Dublin City University (Ireland)
Digital Single Market (DSM) strategy
DSM - Connecting Europe Facility
CEF Digital portal
Innovation and Networks Executive Agency (INEA)
The Action (PRINCIPLE) focuses on the identification, collection and processing of language resources, for four under-resourced languages, Croatian, Icelandic, Irish and Norwegian (covering both official varieties Bokmål and Nynorsk). It focuses on providing data to and improving 2 DSIs, eJustice and eProcurement.
PRINCIPLE will promote awareness and use of National Relay Stations (NRSs), which are designed to collect, process and share language resources that can be used for Machine Translation training. It will encourage the extension of new NRSs to Croatia, Iceland and Norway for the respective languages.
Once the utility of the datasets has been verified, parallel data in 50K sentence pairs quantities to ELRC-SHARE for use by eTranslation engines will be uploaded. This will provide multilingual access to a number of DSIs, especially eJustice and eProcurement, by European and national public administrations for the languages covered under this Action.
The Action will result in the provision of high-quality curated monolingual and parallel corpora to the ELRC-SHARE repository. Machine Translation engines will be made available to the 'early adopters' in the four countries to validate the language resources collected and based on the specific use cases determined by public bodies within each country.