Criminocorpus is an online scholarly publication on the subject of criminal justice history.
It provides researchers and the general public with free access to a wide range of primary sources, together with analysis of key subjects by leading specialists in the field. There are also a growing number of online multimedia exhibitions.
As part of the 20th European Symposium on Languages for Special Purposes, entitled “Multilingualism in Specialized Communication: Challenges and Opportunities in the Digital Age“, to be held from 8 -10 July 2015 at the Centre for Translation Studies, University of Vienna, Vienna, Austria, there will be a 1-day workshop “Corpus approaches to legal phraseology“.
I posted early last year about the fantastic resource offered by Projekt Deutscher Wortschatz at Leipzig University’s Department of Computer Science – the number of languages has now increased to 230!
This post, in my mini-series of posts entitled ‘What exactly is…”, will try to give an overview of Corpus Linguistics and hopefully pique your interest to find out more.
First of all, a definition: a corpus is a collection of texts, often used to study language. These days, corpora are generally held electronically – access is much faster and analysis can be more powerful.
I love this resource, with fast and immediate results, made available by Leipzig University’s Department of Computer Science. At present 158 languages or sub-languages have been included. The texts making up the databases are general and not specific to law.
When you enter a word, you are presented with significant co-occurrences, as well as left and right neighbours of the word, with their frequencies, and two graphical presentations – a kind of spider’s web showing related words that can be clicked on and explored.
Try it out and let me know what you think!
A few weeks ago, I told you about a dictionary on steroids (see here). Today’s post is about a multilingual semantic map and thesaurus on steroids! It’s called the Sketch Engine. At present it can be used in 42 languages.
The Sketch Engine is an awesome tool. It is extremely useful for everyone who manipulates words and needs ideas.
Here are just a few examples of how it can be used:
– To create brand names: The Most Powerful Naming Tool I’ve Ever Used
– To help translators looking for collocations (the words that ‘sound right’ together)
– To give inspiration to lawyers when wording their pleadings
– To help academics when writing papers or theses
– To help journalists and authors get around ‘writer’s block’
– For non-native speakers of a language to check which words are used together and how
The Google N-grams tool (see my recent post and update) has now fostered a new application to search 200 years of US Supreme Court decisions: Legal Language Explorer. It has been developed by Professor Dr. Daniel Martin Katz of Michigan State University College of Law, Michael J. Bommarito II of Computational Legal Studies, and their colleagues.
The tool is lightning fast, and I really like the way that the corpus tool allows you to click through to list of cases and then to see the original text, and to export lists to Excel, for example. I also support the move, of which the authors are part, to make more full-text legal resources openly available to the public.
You can try out the web-based interface here. Let me know what you think!
Thanks to Rob L. indirectly via Australia, and Robert at Legal Informatics Blog for bringing the project to my attention.