EU Science Hub

EAC-Translation Memory

Introduction

In October 2012, the European Union's (EU) Directorate General for Education and Culture (DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. The data gets distributed via the web pages of the EC's Joint Research Centre (JRC). Here we describe this resource, which bears the name EAC Translation Memory, short EAC-TM.

 

Languages / File Format

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

 

Text types / Domain

EAC-TM was built in the context of translating electronic forms: application and report forms for decentralised actions of EAC's Life-long Learning Programme (LLP) and the Youth in Action Programme.

 

Statistics for the EAC Translation Memory

The EAC Translation Memory is available in 25 languages: the English source language and its translations into up to 24 other languages. The following tables show the coverage, expressed in the approximate total number of translation units available for each language, the number of words and the number of characters.

 

Conditions for Use

The Commission's copright notice applies

 

Further Translation Memories available here

The public release of the EAC-Translation Memory follows the release of various other multilingual resources via the JRC's website.

 

Download the EAC Translation Memory

The distribution of the EAC Translation Memory consists of a single zip file (EAC-TM-all.zip), which can be downloaded by clicking on the link below. In the zip file, you find two TMX files (EAC_FORMS.tmx and EAC_REFERENCE_DATA.tmx) containing the English sentences and their translations into up to 25 other languages; the DTD file, which should be kept in the same directory; two PDF files with the statistics on the corpora; the Java utility CreateLanguagePair.jar that allows you to extract a TMX file containing only one single language pair. The language codes used are those defined by the norm ISO 639-1.

EAC-TM (August 2012)

Download size

EAC-TM-all.zip

3.5 MB

 

Referring to this resources

When referring to the EAC Translation Memory EAC-TM in publications, please use the following reference:

Acknowledgement and Contact

For more information, you can contact the following persons:

Directorate General for Education and Culture

Mr Marek Przybyszewski (Email address format: Firstname.lastname@ec.europa.eu)
European Commission - Directorate General for Education and Culture (DG EAC)
Brussels, Belgium
URL: http://ec.europa.eu/dgs/education_culture/

 

Joint Research Centre (JRC)
Ralf Steinberger (Email address format: Firstname.Lastname@jrc.ec.europa.eu)
IPSC - GlobeSec - OPTIMA
Via E. Fermi 2749, T.P. 267
I-21027 Ispra (VA)