16.2978, FYI: Developing Linguistic Corpora, Available Online

LINGUIST List linguist at LINGUISTLIST.ORG
Fri Oct 14 14:41:03 UTC 2005


LINGUIST List: Vol-16-2978. Fri Oct 14 2005. ISSN: 1068 - 4875.

Subject: 16.2978, FYI: Developing Linguistic Corpora, Available Online

Moderators: Anthony Aristar, Wayne State U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
 
Reviews (reviews at linguistlist.org) 
        Sheila Dooley, U of Arizona  
        Terry Langendoen, U of Arizona  

Homepage: http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.

Editor for this issue: Svetlana Aksenova <svetlana at linguistlist.org>
================================================================  

To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.


===========================Directory==============================  

1)
Date: 14-Oct-2005
From: Martin Wynne < martin.wynne at oucs.ox.ac.uk >
Subject: Developing Linguistic Corpora: a Guide to Good Practice, Online 

	
-------------------------Message 1 ---------------------------------- 
Date: Fri, 14 Oct 2005 10:35:19
From: Martin Wynne < martin.wynne at oucs.ox.ac.uk >
Subject: Developing Linguistic Corpora: a Guide to Good Practice, Online 
 

'Developing Linguistic Corpora: a guide to good practice', edited by Martin
Wynne of the Oxford Text Archive, is now available for free online at
http://ahds.ac.uk/linguistic-corpora/. This is the latest in the series of
Guides to Good Practice from the Arts and Humanities Data Service.

In this guide, a selection of leading experts offer advice to help the
reader to ensure that their corpus is well-designed and fit for the
intended purpose.

As John Sinclair writes in the first chapter: ''A corpus is a remarkable
thing, not so much because it is a collection of language text, but because
of the properties that it acquires if it is well-designed and
carefully-constructed.''

The collection includes the following chapters:

* 'Corpus and text: basic principles' by John Sinclair
* 'Adding linguistic annotation' by Geoffrey Leech
* 'Metadata for corpus work' by Lou Burnard
* 'Character encoding in corpus construction' by Tony McEnery and Richard Xiao
* 'Spoken language corpora' by Paul Thompson
* 'Archiving, distribution and preservation' by Martin Wynne

This and other guides in the series (in print and online) are available
from http://www.ahds.ac.uk/creating/guides/.

 
Martin Wynne
Head of the Oxford Text Archive and
AHDS Literature, Languages and Linguistics

martin.wynne at oucs.ox.ac.uk 


Linguistic Field(s): Text/Corpus Linguistics





-----------------------------------------------------------
LINGUIST List: Vol-16-2978	

	



More information about the LINGUIST mailing list