<html dir="ltr">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<style id="owaParaStyle" type="text/css">P {margin-top:0;margin-bottom:0;}</style>
</head>
<body ocsi="0" fpstyle="1" style="word-wrap:break-word">
<div style="direction: ltr;font-family: Tahoma;color: #000000;font-size: 10pt;">Dear Leon,<br>
<br>
I seem to remember than in the ACE tasks, the terms "entities" (involved in the key facts of interest in the information extraction task) and "mentions" (references to those entities) were being used. Using ACE terminology, can we consider "named entities"
to be those entities whose mentions are typically proper names? Then in clinical IE tasks, for example, entities include symptoms, clinical findings, and anatomical locations, etc., which are not referred to using proper names and so are not named entities.
By contrast, in MUC they are locations, organisations, etc., which are named entities.<br>
<br>
Best regards,<br>
<br>
Richard<br>
<br>
<br>
<div><br>
<div style="font-family:Tahoma; font-size:13px">
<div style="font-family:Tahoma; font-size:13px">
<div style="font-family:Tahoma; font-size:13px">
<div style="font-family:Tahoma; font-size:13px">
<div class="BodyFragment"><font size="2"><span style="font-size:10pt">
<div class="PlainText">Richard Evans,<br>
Research Fellow,<br>
Computational Linguistics Research Group,<br>
Research Institute of Information and Language Processing,<br>
University of Wolverhampton,<br>
United Kingdom.<br>
<a href="http://clg.wlv.ac.uk/~richard/">http://clg.wlv.ac.uk/~richard</a><br>
</div>
</span></font></div>
</div>
</div>
</div>
</div>
</div>
<div style="font-family: Times New Roman; color: #000000; font-size: 16px">
<hr tabindex="-1">
<div style="direction: ltr;" id="divRpF696712"><font color="#000000" face="Tahoma" size="2"><b>From:</b> corpora-bounces@uib.no [corpora-bounces@uib.no] on behalf of Satoshi Sekine [sekine@cs.nyu.edu]<br>
<b>Sent:</b> 03 November 2014 15:35<br>
<b>To:</b> leon@dcs.shef.ac.uk<br>
<b>Cc:</b> corpora@lists.uib.no<br>
<b>Subject:</b> Re: [Corpora-List] Named entities and abstract codes<br>
</font><br>
</div>
<div></div>
<div>Dear Leon,
<div><br>
</div>
<div><br>
</div>
<div>I think I'm not directly answering your question (asking related literature), but I just want to give a comment. If you are building an application which needs to recognize abstract codes, don't be afraid to make such category. We don't have to start designing
it from the definition of "named entity". Rather, we should start design it from what is important for your application. We may be able to call them "targets of interest" or "important designator".</div>
<div><br>
</div>
<div>Actually, when we designed 200 extended named entity, which includes color name, animal, ID number (like yours), job title etc, we got comments that these are not named entities. We spent sometime trying to coin a new term to describe it, but it was not
successful. (if any of you come up with a good name, please let me know).</div>
<div><br>
</div>
<div><a href="http://nlp.cs.nyu.edu/ene/version7_1_0Beng.html" target="_blank">http://nlp.cs.nyu.edu/ene/version7_1_0Beng.html</a></div>
<div><a href="http://cs.nyu.edu/~sekine/papers/lrec04-65.pdf" target="_blank">http://cs.nyu.edu/~sekine/papers/lrec04-65.pdf</a></div>
<div><br>
</div>
<div>Thanks,</div>
<div>Satoshi Sekine</div>
<div><br>
</div>
<div><br>
</div>
<div> <br>
<div>
<div>On 2014/11/03, at 7:24, Leon Derczynski wrote:</div>
<blockquote type="cite">
<div dir="ltr">Dear list,
<div><br>
</div>
<div>Are "abstract codes" named entities? For example, "The *15:07 train to Sheffield*", "*Flight MH17*", "The new *Canon 50D*", "pass me *document 123*"?<br>
</div>
<div><br>
</div>
<div>One generic definition of a named entity is, a phrase that is a rigid designator. When we talk about "Lars von Trier", it is fairly uncontroversial to claim that this fits the rigid designator definition well. But using a less descriptive, more abstract
phrase, like a document identifier, does the definition fit just as well? Is there some related literature?<br>
</div>
<div><br>
</div>
<div>All the best,</div>
<div><br>
</div>
<div><br>
</div>
<div>Leon</div>
<div>
<div><br>
</div>
-- <br>
<div>
<div dir="ltr">Leon R A Derczynski<br>
Research Associate, NLP Group
<div><br>
</div>
<div>Department of Computer Science</div>
<div>University of Sheffield, UK</div>
<div><br>
<div>Voted number one for student experience</div>
<div>Times Higher Education Student Experience Survey <a href="tel:2014-2015" value="+4520142015" target="_blank">
2014-2015</a></div>
<br>
<a href="http://www.dcs.shef.ac.uk/~leon/" target="_blank">http://www.dcs.shef.ac.uk/~leon/</a></div>
</div>
</div>
</div>
</div>
_______________________________________________<br>
UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">
http://mailman.uib.no/options/corpora</a><br>
Corpora mailing list<br>
<a href="mailto:Corpora@uib.no" target="_blank">Corpora@uib.no</a><br>
http://mailman.uib.no/listinfo/corpora<br>
</blockquote>
</div>
<br>
<div>
<div>------------------------------</div>
<div>Satoshi Sekine</div>
<div><a href="mailto:sekine@cs.nyu.edu" target="_blank">sekine@cs.nyu.edu</a></div>
<div><br>
</div>
<br class="Apple-interchange-newline">
</div>
<br>
</div>
</div>
</div>
</div>
<br><p>--
<BR>Scanned by iCritical.
</p>
<br></body>
</html>