Corpora: Santa Barbara Corpus

Lou Burnard lou.burnard at computing-services.oxford.ac.uk
Mon Aug 7 15:31:27 UTC 2000


On Fri, 4 Aug 2000, Christopher Cieri wrote:

|I can certainly try to help. Because we expected the Santa Barbara
|Corpus of Spoken American English (SBCSAE) to be used in multiple
|research communties where different computer platforms and software are
|common, we have tried to avoid depending upon any specific set of tools.
|The corpus contains only data; there is no software to install. Indeed,
|the data is stored on the CDs in uncompressed format so that you can
|read the transcripts or listen to the audio directly from CD.


Hmm. So instead of using pre-existing standards which at least have a
chance of being implemented across different computer platforms, it's
better to make up an entirely arbitrary set of codes of your own for
which *everyone* has to write their own software?

Ah well.

 ----------------------------------------------------------------
 Lou Burnard                           http://users.ox.ac.uk/~lou
 ----------------------------------------------------------------



More information about the Corpora mailing list