<style type="text/css">
p{margin:0;padding:0;}
</style>
<div style="color:#000000;font-size:11pt;font-family:돋움;background-color: transparent;">
<p><br></p><p><br></p><p><span style="font-family: '맑은 고딕';">Dear all,</span></p><p><br></p><p><span style="font-family: '맑은 고딕';">Now I'm trying to evaluate my NER system using gold standard corpus.</span></p><p><span style="font-family: '맑은 고딕';"><br></span></p><p><span style="font-family: '맑은 고딕';">CoNLL-03 English corpus are available freely but It requires Reuter corpus (RCV1) as a raw text.</span></p><p><br></p><p><span style="font-family: '맑은 고딕';">But provider, NIST, is closed. (</span><a href="http://trec.nist.gov/data/reuters/reuters.html" style="font-size: 11pt; line-height: 1.2;"><span style="font-family: '맑은 고딕';">http://trec.nist.gov/data/reuters/reuters.html</span></a><span style="font-family: '맑은 고딕';">)</span></p><p><span style="font-family: '맑은 고딕';"><br></span></p><p><span style="font-size: 11pt; line-height: 1.2; font-family: '맑은 고딕';">So I cannot find(generate) CoNLL NER corpus from Reuter corpus.</span></p><p><span style="font-size: 11pt; line-height: 1.2; font-family: '맑은 고딕';"><br></span></p><p><span style="font-size: 11pt; line-height: 1.2;"><br></span></p><p><span style="font-size: 11pt; line-height: 1.2; font-family: '맑은 고딕';">- Is it possible to get CoNLL / Reuter corpus? Where?</span></p><p><span style="font-size: 11pt; line-height: 1.2; font-family: '맑은 고딕';"><br></span></p><p><span style="font-family: '맑은 고딕';">- are there any alternative corpus as free gold-standard?</span></p><p><span style="font-family: '맑은 고딕';"><br></span></p><p><span style="font-family: '맑은 고딕';">Thanks!</span></p><p><span style="font-family: '맑은 고딕';"><br></span></p><p><span style="font-family: '맑은 고딕';">---</span></p><p><span style="font-family: '맑은 고딕';">Younggyun Hahm.</span></p><p><br></p><p><br></p>
</div>
<img id='mailexp' width=0 heigh=0 border=0 src='https://mail.kaist.ac.kr/Mail?act=RECEIPT_CHECK&ukey=525d97873fe89bb2c03bd658&userid=hahmyg&mhost=kaist.ac.kr&ahost=d0001'>