[Corpora] [Corpora-List] English corpus for specific domains

liling tan alvations at gmail.com
Thu Nov 13 14:33:41 UTC 2014


Dear linguists,

Traditional corpora such as British National Corpus, American COCA corpus
and International Corpus of English holds on to the notion of a balance
corpus and allowed corpora of different registers, domains and types.

Web corpora like wikipedia corpora, web as corpus corpora and many others
used crawling techniques or crowdsourcing texts for compilation and it also
ends up with some sort of balance corpora.

Thus finding corpora for specific domains is a task of resourcefulness. And
we require your help in locating them.

Are there corpora that are specifically for the following domain:

   - *Chemical*: the taxonomy rooted on "chemical", examples of terminology
   concepts are ("ammonium carbonate", "beta hydroxybutyric acid", "butyl
   rubber" );
   - *Equipment*: the taxonomy rooted on "equipment", examples of
   terminology concepts are ("acoustic modem", "parasail", "clock pendulum");
   - *Food*: the taxonomy rooted on "food", examples of terminology
   concepts are ("jacket potato", "lemonade", "bolognese pasta sauce");
   - *Science*: the taxonomy rooted on "science", examples of terminology
   concepts are ( "neuropsychiatry", "craniometry", "microelectronics");

Best Regards,
Liling
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20141113/2122e2a6/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list