Creating a corpus

Angus B. Grieve-Smith grvsmth at UNM.EDU
Wed Jun 2 12:45:03 UTC 2004


        An elicited corpus can be very useful, but I wanted to mention a
topic that came up at the North American Symposium on Corpus Linguistics a
little more than a week ago: representativeness.  It's definitely valuable
to say "native signers of x language produced this utterance and didn't
act as though it were a speech error."  But if you do any frequency
counts, you need to make sure that they can be generalized to the entire
language (or at least an entire register, genre or dialect), and for this
you need to have a representative sample.  It might be possible to get
this through elicitation, but you'd first need to study the balance of
"text types" that a typical Deaf person is exposed to in a typical day,
week or month, and then recreate that balance.

                                        -Angus B. Grieve-Smith
                                        Linguistics Department
                                        University of New Mexico
                                        grvsmth at unm.edu
                                        grvsmth at panix.com



More information about the Slling-l mailing list