DANTE – Database of Analysed Texts of English

Posted in Resources by WordstoDeeds

DANTE stands for Database of Analysed Texts of English. “Its lexical entries
offer a thorough, systematic, and fine-grained description of the core vocabulary of English, derived entirely from corpus data.

Continue reading →

30Jun2015

Criminocorpus

Posted in Resources by WordstoDeeds

La Santé prison, Paris, France

Criminocorpus is an online scholarly publication on the subject of criminal justice history.

It provides researchers and the general public with free access to a wide range of primary sources, together with analysis of key subjects by leading specialists in the field. There are also a growing number of online multimedia exhibitions.

Continue reading →

31Mar2015

Symposium – Corpus approaches to legal phraseology

Posted in Conferences by WordstoDeeds

As part of the 20^th European Symposium on Languages for Special Purposes, entitled “Multilingualism in Specialized Communication: Challenges and Opportunities in the Digital Age“, to be held from 8 -10 July 2015 at the Centre for Translation Studies, University of Vienna, Vienna, Austria, there will be a 1-day workshop “Corpus approaches to legal phraseology“.

Continue reading →

5Apr2013

Corpus-based collocation dictionaries in 230 languages

Posted in Resources by WordstoDeeds

I posted early last year about the fantastic resource offered by Projekt Deutscher Wortschatz at Leipzig University’s Department of Computer Science – the number of languages has now increased to 230!

Continue reading →

10Sep2012

What exactly is corpus linguistics?

Posted in Articles, What exactly is... by WordstoDeeds

This post, in my mini-series of posts entitled ‘What exactly is…”, will try to give an overview of Corpus Linguistics and hopefully pique your interest to find out more.

First of all, a definition: a corpus is a collection of texts, often used to study language. These days, corpora are generally held electronically – access is much faster and analysis can be more powerful.

Continue reading →

7Feb2012

135 Corpus-Based Monolingual Dictionaries

Posted in Resources by WordstoDeeds

I love this resource, with fast and immediate results, made available by Leipzig University’s Department of Computer Science. At present 158 languages or sub-languages have been included. The texts making up the databases are general and not specific to law.

When you enter a word, you are presented with significant co-occurrences, as well as left and right neighbours of the word, with their frequencies, and two graphical presentations – a kind of spider’s web showing related words that can be clicked on and explored.

http://corpora.uni-leipzig.de/

Try it out and let me know what you think!

12Jan2012

Multilingual semantic map on steroids

Posted in Resources by WordstoDeeds

A few weeks ago, I told you about a dictionary on steroids (see here). Today’s post is about a multilingual semantic map and thesaurus on steroids! It’s called the Sketch Engine. At present it can be used in 42 languages.

The Sketch Engine is an awesome tool. It is extremely useful for everyone who manipulates words and needs ideas.

Here are just a few examples of how it can be used:
– To create brand names: The Most Powerful Naming Tool I’ve Ever Used
– To help translators looking for collocations (the words that ‘sound right’ together)
– To give inspiration to lawyers when wording their pleadings
– To help academics when writing papers or theses
– To help journalists and authors get around ‘writer’s block’
– For non-native speakers of a language to check which words are used together and how

Continue reading →

8Jan2012

Legal Language Explorer

Posted in Resources by WordstoDeeds

The Google N-grams tool (see my recent post and update) has now fostered a new application to search 200 years of US Supreme Court decisions: Legal Language Explorer. It has been developed by Professor Dr. Daniel Martin Katz of Michigan State University College of Law, Michael J. Bommarito II of Computational Legal Studies, and their colleagues.

The tool is lightning fast, and I really like the way that the corpus tool allows you to click through to list of cases and then to see the original text, and to export lists to Excel, for example. I also support the move, of which the authors are part, to make more full-text legal resources openly available to the public.

Click here to see a presentation of the service, given at the conference JURIX 2011 in mid-December. Bommarito has also posted a more technical description of the project on his blog.

You can try out the web-based interface here. Let me know what you think!

Thanks to Rob L. indirectly via Australia, and Robert at Legal Informatics Blog for bringing the project to my attention.