When you mention ATLAS, it's probably woth mentioning that this model of<div>having stand-off annotations of various kinds was already common in various</div><div>systems for information extraction, such as FASTUS and GATE.</div>

<div>(NLP, specifically for information extraction, has needed multiple layers</div><div>of annotation pretty early since it was the solution to merging the output</div><div>from different NLP components).</div><div>NITE and ATLAS are an attempt at making the complexity of</div>

<div>having multiple annotation layers more manageable (or palatable)</div><div>than the approach taken for e.g. MUC (where you have both NER and</div><div>coreference info annotated on the same base data, but distributed</div>

<div>as joint SGML).</div><div>Other people probably know more about this, but the Alembic Workbench</div><div>(from 1997, no less) allows you to create standoff annotation separately</div><div>from the document itself (and its markup):</div>

<div><meta http-equiv="content-type" content="text/html; charset=utf-8"><a href="http://www.timeml.org/site/terqas/alembic/AWB-ptf-format.html">http://www.timeml.org/site/terqas/alembic/AWB-ptf-format.html</a></div><div><br>

</div><div>Best,</div><div>Yannick</div><div><div><br><div class="gmail_quote">On Fri, Nov 19, 2010 at 12:56 PM, Piotr Bański <span dir="ltr"><<a href="mailto:bansp@o2.pl">bansp@o2.pl</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">

Don't forget about ATLAS (Steven Bird, Mark Liberman; [1], look around<br>

1999/2000). It is also necessary to mention the work by Nancy Ide since<br>

the Corpus Encoding Standard[2], then, in the context of ISO TC 37 SC 4,<br>

mostly with Laurent Romary and Keith Suderman [3]. Some attempt at<br>

putting order into these notions has been made in Goecke et al., 2010 [4].<br>

<br>

Good luck,<br>

<br>

  Piotr<br>

<br>

[1]:<br>

<a href="http://www.informatik.uni-trier.de/~ley/db/indices/a-tree/l/Liberman:Mark.html" target="_blank">http://www.informatik.uni-trier.de/~ley/db/indices/a-tree/l/Liberman:Mark.html</a><br>

[2]: <a href="http://www.cs.vassar.edu/CES/" target="_blank">http://www.cs.vassar.edu/CES/</a><br>

[3]: <a href="http://www.cs.vassar.edu/~ide/pubs.html" target="_blank">http://www.cs.vassar.edu/~ide/pubs.html</a><br>

[4]: Goecke, D., Metzing, D., Lüngen, H., Stührenberg, M., Witt, A.<br>

(2010). Different views on markup. distinguishing levels and layers. In<br>

Linguistic modeling of information and markup languages. Contributions<br>

to language technology. Springer Netherlands, pp. 1–21.<br>

<div class="im"><br>

On 2010-11-18 14:06, Philippe Blache wrote:<br>

> Hi Karen,<br>

> I don't know whether the idea comes from there, but it belongs to the NXT data model:<br>

><br>

> J. Carletta, S. Evert, U. Heid, J. Kilgour, J. Robertson, H. Voormann<br>

> "The NITE XML Toolkit: Flexible annotation for multimodal language data"<br>

> Behavior Research Methods, Instruments, & Computers 2003, 35 (3), 353-363<br>

><br>

><br>

> Philippe<br>

><br>

><br>

> Le 18 nov. 2010 à 11:04, Karen Fort a écrit :<br>

><br>

>> Dear members,<br>

>><br>

>> I'm looking for a reference on annotations layers.<br>

>> Can somebody tell me when the "idea" of annotation layers appeared?<br>

>><br>

>> I suppose it came from the speech community, but I cannot find a clear reference on that.<br>

>><br>

>> Thank you for your help!<br>

<br>

</div><div><div></div><div class="h5">_______________________________________________<br>

Corpora mailing list<br>

<a href="mailto:Corpora@uib.no">Corpora@uib.no</a><br>

<a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>

</div></div></blockquote></div><br></div></div>