[Corpora-List] Legal-domain corpora
Stella Neumann
st.neumann at mx.uni-saarland.de
Wed Oct 18 16:27:50 UTC 2006
Seth,
check out the HOLJ Corpus built in the framework of the SUM project in
Edinburgh (http://www.ltg.ed.ac.uk/SUM/index.html).
It contains court decisions by the House of Lords, is annotated and can
be downloaded for free.
Best,
Stella
Seth Grimes schrieb:
> Hello all,
>
> I'm researching legal-domain application of NLP with machine
> learning. What annotated corpora are available in this domain, either for
> free or for a license fee? I'd be interested in --
>
> - legislation and statutes
> - case law
> - briefs, depositions & testimony, crime reports, and evidentiary
> materials
> - court judgments
> - patent filings
>
> -- and also in parallel, multi-lingual corpora, for instance that might
> have been created in the EU, Switzerland, Canada, and other areas with
> multiple official languages.
>
> I've been told that news-media text can provide good training
> material for the legal domain. I'd also be interested in hearing
> reactions to that claim, especially if anyone has formally studied the
> question.
>
> Thanks very much for all help,
>
> Seth
>
>
> --
> Seth Grimes Alta Plana Corp, analytical computing & data management
> Intelligent Enterprise magazine (CMP), Contributing Editor
> grimes at altaplana.com http://altaplana.com 301-270-0795
>
--
Dr. Stella Neumann
Englische Sprach- und
Übersetzungswissenschaft
Universität des Saarlandes
Fachrichtung 4.6
Angewandte Sprachwissenschaft
sowie Übersetzen und Dolmetschen
Postfach 15 11 50
D-66041 Saarbrücken
Tel.: +49(681) 302 64307
Fax: +49(681) 302 64375
e-mail: st.neumann at mx.uni-saarland.de
http://fr46.uni-saarland.de/steiner.php
http://www.uni-saarland.de/~st.neumann
More information about the Corpora
mailing list