ecolingui.ca

consulting, writing, research, and development | low-tech solutions | human intelligence

technology for people and planet

Document Intelligence

Are you a government or organization that depends on semi-structured data, which, in the end, are not very structured at all?

Annotation

Data Visualization

Are you a provider of public data that wants to make these data actually accessible, understandable, and useful to the public?

géomatique

Language Technologies

Are you a minority language community in need of models and tools to document, teach, and revitalize your language in the digital world?

linguistique

Recent posts

Presenting PLAYA-PDF (and PAVÉS)

Feb 25, 2026

If you need to delve into the murky depths of a PDF to return with spices and silk extract metadata, images, and yes, even text, I have some excellent Free Software for you: PLAYA-PDF and PAVÉS. If you’d like to know how this came to be, then continue reading. And if you need a consultant for document intelligence tasks, large and small, I’m currently available for contracts of all sorts!

[read more]

TypeScript modules With Emscripten and CMake, part 5

Feb 27, 2023

Let’s set up CMake to build everything with maximum optimization with the -Oz option, which should be passed at both compile and link time. (Note: I will not discuss -flto here, because it is only useful when dealing with the eldritch horrors of C++). While we’re at it we’ll also disable support for the longjmp function which we know our library doesn’t use: [read more]

About me

Originally trained in linguistics, with over 25 years of experience in speech and natural language processing technology, I have also developed a wide range of expertise in open-source software development, as a maintainer and valued contributor to numerous projects.