When you mention ATLAS, it's probably woth mentioning that this model of<div>having stand-off annotations of various kinds was already common in various</div><div>systems for information extraction, such as FASTUS and GATE.</div>
<div>(NLP, specifically for information extraction, has needed multiple layers</div><div>of annotation pretty early since it was the solution to merging the output</div><div>from different NLP components).</div><div>NITE and ATLAS are an attempt at making the complexity of</div>
<div>having multiple annotation layers more manageable (or palatable)</div><div>than the approach taken for e.g. MUC (where you have both NER and</div><div>coreference info annotated on the same base data, but distributed</div>
<div>as joint SGML).</div><div>Other people probably know more about this, but the Alembic Workbench</div><div>(from 1997, no less) allows you to create standoff annotation separately</div><div>from the document itself (and its markup):</div>
<div><meta http-equiv="content-type" content="text/html; charset=utf-8"><a href="http://www.timeml.org/site/terqas/alembic/AWB-ptf-format.html">http://www.timeml.org/site/terqas/alembic/AWB-ptf-format.html</a></div><div><br>
</div><div>Best,</div><div>Yannick</div><div><div><br><div class="gmail_quote">On Fri, Nov 19, 2010 at 12:56 PM, Piotr Bański <span dir="ltr"><<a href="mailto:bansp@o2.pl">bansp@o2.pl</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">
Don't forget about ATLAS (Steven Bird, Mark Liberman; [1], look around<br>
1999/2000). It is also necessary to mention the work by Nancy Ide since<br>
the Corpus Encoding Standard[2], then, in the context of ISO TC 37 SC 4,<br>
mostly with Laurent Romary and Keith Suderman [3]. Some attempt at<br>
putting order into these notions has been made in Goecke et al., 2010 [4].<br>
<br>
Good luck,<br>
<br>
Piotr<br>
<br>
[1]:<br>
<a href="http://www.informatik.uni-trier.de/~ley/db/indices/a-tree/l/Liberman:Mark.html" target="_blank">http://www.informatik.uni-trier.de/~ley/db/indices/a-tree/l/Liberman:Mark.html</a><br>
[2]: <a href="http://www.cs.vassar.edu/CES/" target="_blank">http://www.cs.vassar.edu/CES/</a><br>
[3]: <a href="http://www.cs.vassar.edu/~ide/pubs.html" target="_blank">http://www.cs.vassar.edu/~ide/pubs.html</a><br>
[4]: Goecke, D., Metzing, D., Lüngen, H., Stührenberg, M., Witt, A.<br>
(2010). Different views on markup. distinguishing levels and layers. In<br>
Linguistic modeling of information and markup languages. Contributions<br>
to language technology. Springer Netherlands, pp. 1–21.<br>
<div class="im"><br>
On 2010-11-18 14:06, Philippe Blache wrote:<br>
> Hi Karen,<br>
> I don't know whether the idea comes from there, but it belongs to the NXT data model:<br>
><br>
> J. Carletta, S. Evert, U. Heid, J. Kilgour, J. Robertson, H. Voormann<br>
> "The NITE XML Toolkit: Flexible annotation for multimodal language data"<br>
> Behavior Research Methods, Instruments, & Computers 2003, 35 (3), 353-363<br>
><br>
><br>
> Philippe<br>
><br>
><br>
> Le 18 nov. 2010 à 11:04, Karen Fort a écrit :<br>
><br>
>> Dear members,<br>
>><br>
>> I'm looking for a reference on annotations layers.<br>
>> Can somebody tell me when the "idea" of annotation layers appeared?<br>
>><br>
>> I suppose it came from the speech community, but I cannot find a clear reference on that.<br>
>><br>
>> Thank you for your help!<br>
<br>
</div><div><div></div><div class="h5">_______________________________________________<br>
Corpora mailing list<br>
<a href="mailto:Corpora@uib.no">Corpora@uib.no</a><br>
<a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>
</div></div></blockquote></div><br></div></div>