, , ,

I posted early last year about the fantastic resource offered by Projekt Deutscher Wortschatz at Leipzig University’s Department of Computer Science – the number of languages has now increased to 230!

Although the texts making up the databases are general and not specific to law, you can get some really useful information about words, including ‘technical’ ones as you can see below (click to enlarge). In particular you can find collocations – words that often occur nearby – because when you enter a word, you are presented with significant co-occurrences, as well as left and right neighbours of the word, with their frequencies.


There is also a graphical presentation – a kind of spider’s web showing related words that can be clicked on and explored.


Let me know what you think!  http://corpora.informatik.uni-leipzig.de/