Release of the ICE-GB Corpus

Survey of English Usage ucleseu at ucl.ac.uk
Mon Nov 30 11:04:09 UTC 1998


The Survey of English Usage, University College London, is pleased to 
announce the release of the ICE-GB corpus, the British component of the 
International Corpus of English (ICE).  

ICE-GB is a fully parsed corpus of adult British English from the 1990s.  It 
contains 300 spoken texts and 200 written texts  -  a total of 1 million 
words. The texts are distributed across 32 categories, including private 
conversations, telephone calls, court proceedings, broadcasts, social 
letters, examination scripts, and academic writing. 

ICE-GB has been grammatically analysed at wordclass level, and at the 
function and category levels. The analyses are presented as labelled 
syntactic trees  -  83,419 trees in total. 

The corpus is distributed with its own dedicated retrieval software, ICECUP.  

ICE-GB and ICECUP are available now on CD-ROM. 

A Sample Corpus of ten parsed texts, together with ICECUP, may be downloaded 
free from our website, at http://www.ucl.ac.uk/english-usage/

With apologies for cross postings. 
-----------------------------------------------------------------
Survey of English Usage
Department of English
University College London
Gower Street
London WC1E 6BT
UK

Telephone:  0171-419-3119 Marie Gibney (Administrator)
                      0171-419-3120  SEU Research Unit
Email:           ucleseu at ucl.ac.uk
Fax:               0171-916-2054
--------------------------------------------------------------





More information about the LFG mailing list