[Corpora-List] Can corpora help to distinguish a dialect and a language?

CRuehlemann at aol.com CRuehlemann at aol.com
Tue Feb 16 14:38:47 UTC 2010


It is interesting to see how a serious question sparks a funny discussion,  
which is somewhat carnivalesque even, featuring soldiers, rabbis, schmucks, 
 dead dogs and what have you galore. On a more sober note one might return  
to the original question (Can corpora help distinguish dialect and a 
language?)  and say yes, to an extent, provided they have been carefully marked up 
 for  social categories. One such well-designed and annotated corpus is the 
BNC. In  pursuit of the above question, it can be used in two basic ways. 
In  a top down approach, one may start with a set of pre-defined dialect  
features, run queries for them using relevant restrictions such as speaker  
location or speaker dialect. Such a query would show, for example, that the  
quotative phrase *I says* is found predominatly in northern dialect areas of  
Great Britain (and Ireland). A bottom up approach would be to start without 
a  set of pre-defined dialect features and instead examine a set of randomly 
 selected features using the same set of restrictions as above. Such a 
query type  might, perhaps, help discover dialect features that researchers have 
not  yet been aware of as dialect features.
 
Apologies for this totally unMardi-Graslike note.  
 
Cheers
 
Chris
 
--------------------------------------------------------
Dr. Chrstoph Rühlemann, Munich
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20100216/77454b21/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list