AI and NLP solutions
for oriental languages

Delivering new technologies for oriental languages. Dedicated to organizations, companies and cultural heritage professionals.

Our expertises


OCR/HTR : printed and handwritten text recognition

Calfa developed an Artificial Intelligence for Arabic, Armenian, Georgian, Syriac etc. This technology is able to analyse the page layout and to read the text from the document, giving it a new, digital potential.


Text Analysis technology

We are offering an AI dedicated to grammatical analysis and morphological tagging for texts in oriental languages, allowing to process data contained in the documents.


Digitization and promotion projects

Calfa is working for institutions and companies in their digitization projects or automated processes on documents (OCR system, recognition API, collections browsing tools, search engines, expert assessment for catalogues and collections enhancement). Calfa tools are customized accordingly to the specific features and the language of your project.


Actions for cultural heritage preservation

We lead or co-finance projects to preserve written or printed cultural heritage. Discover our digitization and conservation actions.

Our latest projects

About Calfa

Founded in 2014, Calfa is developing text detection and automated analysis technologies for manuscripts written in oriental languages. Our team is composed of PhD-students and engineers in Artificial Intelligence, specialists of deep learning. We are today improving our expertise in text recognition and digitization for the needs of our partners, facing a lack of performing tools dedicated to oriental languages.

Learn more about Calfa

Our partners

Stay tuned with our projects

Receive the latest news and updates on our projects and conferences by email