[Corpora-List] Legal-domain corpora

Stella Neumann st.neumann at mx.uni-saarland.de
Wed Oct 18 16:27:50 UTC 2006


Seth,
check out the HOLJ Corpus built in the framework of the SUM project in 
Edinburgh (http://www.ltg.ed.ac.uk/SUM/index.html).
It contains court decisions by the House of Lords, is annotated and can 
be downloaded for free.
Best,
Stella

Seth Grimes schrieb:
> Hello all,
> 
> 	I'm researching legal-domain application of NLP with machine
> learning.  What annotated corpora are available in this domain, either for
> free or for a license fee?  I'd be interested in --
> 
> - legislation and statutes
> - case law
> - briefs, depositions & testimony, crime reports, and evidentiary
> materials
> - court judgments
> - patent filings
> 
> -- and also in parallel, multi-lingual corpora, for instance that might
> have been created in the EU, Switzerland, Canada, and other areas with
> multiple official languages.
> 
> 	I've been told that news-media text can provide good training
> material for the legal domain.  I'd also be interested in hearing
> reactions to that claim, especially if anyone has formally studied the
> question.
> 
> 	Thanks very much for all help,
> 
> 					Seth
> 
> 
> --
> Seth Grimes   Alta Plana Corp, analytical computing & data management
>               Intelligent Enterprise magazine (CMP), Contributing Editor
> grimes at altaplana.com       http://altaplana.com    301-270-0795
> 

-- 
Dr. Stella Neumann
Englische Sprach- und
Übersetzungswissenschaft

Universität des Saarlandes
Fachrichtung 4.6
Angewandte Sprachwissenschaft
sowie Übersetzen und Dolmetschen
Postfach 15 11 50
D-66041 Saarbrücken

Tel.: +49(681) 302 64307
Fax: +49(681) 302 64375
e-mail: st.neumann at mx.uni-saarland.de

http://fr46.uni-saarland.de/steiner.php
http://www.uni-saarland.de/~st.neumann



More information about the Corpora mailing list