<div>Dear all,</div>
<div> </div>
<div>I am looking for a very large corpus ( > 1 billion words) made for academic domain, mainly describing university, project, conference, paper, author and etc. I will compute statistics from it, which is used in building a query system on structured data for academic domain.</div>
<div> </div>
<div>Does anyone know such a corpus? Any information will be appreciated.</div>
<div> </div>
<div> </div>
<div>Thanks,</div>
<div> </div>
<div>Lushan Han </div>