[Corpora-List] SourceForge as a corpus

radev at umich.edu radev at umich.edu
Thu Jan 24 16:06:13 UTC 2008


> 
> On Jan 24, 2008 8:56 AM,  <radev at umich.edu> wrote:
> 
> > with a list of candidate corpora and contact people for each of
> > them. Here are some examples: Google n-grams, Enron email, GENIA, etc.
> 
> The list would indeed be quite large.  Just for the biomedical domain
> alone, there are more than twenty linked to at
> 
> http://compbio.uchsc.edu/ccp/corpora/obtaining.shtml

If the list is too large, perhaps let's pick 2-3 "easier" ones and
work on releasing them as case studies first.

Drago

> 
> Kev
> 
> 
> -- 
> K. B. Cohen
> Biomedical Text Mining Group Lead
> Center for Computational Pharmacology
> 303-916-2417 (cell) 303-377-9194 (home)
> http://compbio.uchsc.edu/Hunter_lab/Cohen
> 
> 


-- 
Dragomir R. Radev                    Associate Professor
SI, CSE, Ling                     U. Michigan, Ann Arbor 
http://www.eecs.umich.edu/~radev         radev at umich.edu              

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list