Corpus-based collocation dictionaries in 230 languages

I posted early last year about the fantastic resource offered by Projekt Deutscher Wortschatz at Leipzig University’s Department of Computer Science – the number of languages has now increased to 230!

Although the texts making up the databases are general and not specific to law, you can get some really useful information about words, including ‘technical’ ones as you can see below (click to enlarge). In particular you can find collocations – words that often occur nearby – because when you enter a word, you are presented with significant co-occurrences, as well as left and right neighbours of the word, with their frequencies.

leipzig

There is also a graphical presentation – a kind of spider’s web showing related words that can be clicked on and explored.

2733370

Let me know what you think!  http://corpora.informatik.uni-leipzig.de/

5 thoughts on “Corpus-based collocation dictionaries in 230 languages

  1. Wow. Don’t exactly know whether it will actually be useful for translation, but, then, it just might, for those cases when you are looking for a certain expression which keeps eluding you but it’s there, in the back of your mind, and if you see it, it immediately clicks and you know it’s the right one. Be that as it may: I have bookmarked it and hope to use it.

    It must have involved an enormous amount of work and certainly is a commendable effort. And so many languages, to boot!

    Thank you for sharing and providing us always with these amazing, awesome (being a Saturday, you will allow me a bit of fun with “fashionable, trendy” adjectives) links…

    • Thanks Nelida 🙂 And awesome adjectives are welcome any day of the week!

      I find this kind of corpus-based word grouping really useful when translating actually (but maybe that’s because the “right word” often eludes me!!).

  2. Pingback: Corpus-based collocation dictionaries in 230 languages | Terminology, Computing and Translation

  3. Pingback: Weekly favorites 90 (Apr 15-21) | Adventures in Freelance Translation

  4. Pingback: Most read posts 2013 | From Words to Deeds: translation & the law

Leave a Reply

Please log in using one of these methods to post your comment:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.