This plugin makes it possible to perform optical character recognition (OCR) on 3 types of documents:
The plugin will query the A2IA engine (https://www.a2ia.com/en) that will proceed to the OCR, then the plugin returns the results in a HashMap.
To work with A2ia, the plugin uses the Jacob library (https://sourceforge.net/projects/jacob-project). The use of Jacob requires the loading of the Windows DLL file jacob-1.19-x64.dll. That why the site-Lutèce using this plugin must be deployed on a Windows server.
The plugin contains a single bean Spring OcrService that offers a "proceed" method to launch the OCR and retrieve the results.
/** * Perform OCR with A2iA. * * @param byteImageContent * image to process * @param strFileExtension * image extension : values allowed : Tiff, Bmp, Jpeg * @param strDocumentType * document type : values allowed : Rib, TaxAssessment,Identity * @return Map result of OCR * @throws OcrException * the OcrException * */ public Map<String, String> proceed( byte[] byteImageContent, String strFileExtension, String strDocumentType ) throws OcrException