[Corpora-List] Seeking corpus for academic domain

Lushan Han lushan1 at umbc.edu
Wed Jul 18 19:30:20 UTC 2012


A corpus of smaller size (e.g. millions of words) can also be very helpful
to me.  Please inform me if you happen to know it.

Thanks,

Lushan

On Wed, Jul 18, 2012 at 11:03 AM, Lushan Han <lushan1 at umbc.edu> wrote:

> Dear all,
>
> I am looking for a very large corpus ( > 1 billion words) made for
> academic domain, mainly describing university, project, conference, paper,
> author and etc. I will compute statistics from it, which is used in
> building a query system on structured data for academic domain.
>
> Does anyone know such a corpus? Any information will be appreciated.
>
>
> Thanks,
>
> Lushan Han
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20120718/146d221c/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list