Vysoke uceni technicke v Brne (Brno University of Technology) (Czech Republic)
Digital Single Market (DSM) strategy
DSM - Connecting Europe Facility
CEF Digital portal
Innovation and Networks Executive Agency (INEA)
The objective of the Action (OCCAM - OCR, ClassificAtion & Machine Translation) is to focus on the integration and extension of CEF Automated Translation DSI into multilingual digital cross-border services.
The Action aims at the integration of image classification, Translation Memories (TMs),Optical Character Recognition (OCR), and Machine Translation (MT) to support the Automated Translation of scanned documents – a document type that currently cannot be processed by the CEF eTranslation service. Scanned forms typically contain template text, for which translations are available in TMs. The Action aims at recognising such text from scanned images, and translating it automatically. By filling this gap, OCCAM will directly contribute to lowering the language barrier.
The languages to be covered by OCCAM will develop two use cases: (a) Business Registers Interconnection System (BRIS) use case focusing on Belgian and Czech Business Registers and (b) Digital Humanities use case. For use case (a), OCCAM will develop an eDelivery-compliant Reference Implementation for the BRIS DSI. For use case (b), OCCAM will provide extensive guidance for introducing the combination of OCR and MT into the day-to-day workflow of Digital Humanities researchers. Both use cases will support the OCCAM Action in becoming sustainable in the long term.