Following my post in April, I would like to let you know that the translation memories made available by the Directorate-General for Translation at the European Commission Joint Research Centre have been updated for the second time this year.
In the April 2012 release, documents up to 2010 were included. Now, 2011 data has been added – a further 6 million translation units. Somewhat confusingly, the new update is called “DGT-TM-2012” even though the data is from 2011.
New features of DGT-TM-2012 are:
- Small amounts of Irish data are now included for the first time;
- Significantly more data for the Bulgarian, Maltese and Romanian languages;
- Mostly about 285K new translation units per language.
The translation memories are parallel texts of the entire body of European legislation, comprising all the treaties, regulations and directives adopted by the European Union (EU), in 23 languages: Bulgarian, Czech, Danish, Dutch, English, Estonian, German, Greek, Finnish, French, Hungarian, Irish, Italian, Latvian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Slovak, Slovene, Spanish and Swedish.
You can download them from this page. Scroll down to “DGT-TM-release 2012” for the 2011 data. Information on how to produce bilingual extractions appears on the same page.