Corpora: Santa Barbara Corpus
Lou Burnard
lou.burnard at computing-services.oxford.ac.uk
Mon Aug 7 15:31:27 UTC 2000
On Fri, 4 Aug 2000, Christopher Cieri wrote:
|I can certainly try to help. Because we expected the Santa Barbara
|Corpus of Spoken American English (SBCSAE) to be used in multiple
|research communties where different computer platforms and software are
|common, we have tried to avoid depending upon any specific set of tools.
|The corpus contains only data; there is no software to install. Indeed,
|the data is stored on the CDs in uncompressed format so that you can
|read the transcripts or listen to the audio directly from CD.
Hmm. So instead of using pre-existing standards which at least have a
chance of being implemented across different computer platforms, it's
better to make up an entirely arbitrary set of codes of your own for
which *everyone* has to write their own software?
Ah well.
----------------------------------------------------------------
Lou Burnard http://users.ox.ac.uk/~lou
----------------------------------------------------------------
More information about the Corpora
mailing list