[Corpora-List] Dataset for Different Research Areas

Shayan A Tabrizi shayantabrizi at gmail.com
Sun Oct 5 19:32:55 UTC 2014


Hi Everybody,

I want to find the relevance of each of the research papers of my dataset
to each of the research areas such as Physics, CS, Math, Social Sciences,
etc.
Thus, I need a dataset consisting of all research areas and some sample
texts (preferably papers) in that area, to estimate the similarity of each
of my papers to each of the areas.
*Is there any such dataset?*

Some points:

   1. It is much much better if the dataset has areas in different
   granularities. e.g. in one level: Mathematics, Physics, CS, etc. and in a
   more fine-grained level divides CS to Networks, Artificial Intelligence,
   etc.
   2. Even if the dataset only consists of a specific domain (especially
   CS) and its sub-domains it is still usable.

Regards,
Shayan Tabrizi
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20141005/777bc0fe/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list