[Corpora-List] consistency of spoken transcription

Karin Axelsson karin.axelsson at his.se
Thu May 9 14:30:13 UTC 2013


Dear Chris,

When the BNC was converted into XML, many pauses were lost, which means that the latest version of the BNC is less reliable than earlier versions.
You can read about this in the following report compiled by Sebastian Hoffmann and Stefan Evert:

http://corpora.lancs.ac.uk/BNCweb/BNC-errors_and_inconsistencies.pdf

Best wishes,
Karin

---------------------------
Karin Axelsson, Ph.D.
Senior Lecturer in English
School of Humanities and Informatics
University of Skövde
Sweden
karin.axelsson at his.se
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20130509/a55b3bd7/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list