Document Intelligence
Are you a government or organization that depends on semi-structured data, which, in the end, are not very structured at all?
consulting, writing, research, and development | low-tech solutions | human intelligence
Are you a government or organization that depends on semi-structured data, which, in the end, are not very structured at all?
Are you a provider of public data that wants to make these data actually accessible, understandable, and useful to the public?
Are you a minority language community in need of models and tools to document, teach, and revitalize your language in the digital world?
Feb 27, 2023
Let’s set up CMake to build everything with maximum optimization with the -Oz
option, which should be passed at both compile and link time. (Note: I will not discuss -flto
here, because it is only useful when dealing with the eldritch horrors of C++). While we’re at it we’ll also disable support for the longjmp
function which we know our library doesn’t use:
[read more]
Feb 24, 2023
Just by screwing around on the command line, we were previously able to produce a more or less useful CommonJS module wrapping the real-valued FFT function from the Kiss FFT library (though not as useful as the existing one on npmjs.org). Now let’s look at how we can build a module with CMake as part of the library’s build system. [read more]
Feb 24, 2023
When I set out to create an NPM package for SoundSwallower, I was unable to find much relevant information in the Emscripten documentation or elsewhere on the Web, so I have written this guide, which walks through the process of compiling a library to WebAssembly and packaging it as a CommonJS or ES6 module. [read more]
Originally trained in linguistics, with over 25 years of experience in speech and natural language processing technology, I have also developed a wide range of expertise in open-source software development, as a maintainer and valued contributor to numerous projects.