Albretch,<div><br></div><div>I just wrote up one perspective on corpus comparison, see</div><div><br></div><div><a class="attachment" href="http://trac.sketchengine.co.uk/attachment/wiki/AK/Papers/Kilgarriff_TSD2012.pdf?format=raw" title="Attachment 'Kilgarriff_TSD2012.pdf' in AK/Papers" style="text-decoration:none;color:rgb(51,113,186);border-bottom-width:1px;border-bottom-style:dotted;border-bottom-color:rgb(187,187,187);font-family:Verdana,sans-serif;font-size:12px;line-height:16px;background-color:rgb(255,255,255)">Getting to know your corpus</a><span class="noprint" style="font-family:Verdana,sans-serif;font-size:12px;line-height:16px;background-color:rgb(255,255,255)"> <font color="#3371ba"><img src="http://trac.sketchengine.co.uk/chrome/common/download.png" alt="Download" style="border: none; "></font><font color="#000000"> To appear<i> </i></font><i>in: Proc. Text, Speech, Dialogue (TSD 2012)</i></span></div>
<div><span class="noprint" style="font-family:Verdana,sans-serif;font-size:12px;line-height:16px;background-color:rgb(255,255,255)"><i><br></i></span></div><div><font face="Verdana, sans-serif"><span style="font-size:12px;line-height:16px">(Note: this is about the corpora themselves, not the software or the markup. It's about what we ought to think about, as opposed to what people in the NLP community tend to think about :) Your questions mixed up the three issues)</span></font></div>
<div><font face="Verdana, sans-serif"><span style="font-size:12px;line-height:16px"><br></span></font></div><div><span class="noprint" style="font-family:Verdana,sans-serif;font-size:12px;line-height:16px;background-color:rgb(255,255,255)">Adam<br>
</span><br><div class="gmail_quote">On 6 July 2012 05:53, Albretch Mueller <span dir="ltr"><<a href="mailto:lbrtchx@gmail.com" target="_blank">lbrtchx@gmail.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"> professor Nancy Ide<br>
~<br>
<a href="http://www.cs.vassar.edu/~ide/" target="_blank">http://www.cs.vassar.edu/~ide/</a><br>
~<br>
has worked on such matters as part of the CES (Corpora Encoding Standard)<br>
~<br>
<a href="http://www.cs.vassar.edu/~ide/papers/ces.wvlc.pdf" target="_blank">http://www.cs.vassar.edu/~ide/papers/ces.wvlc.pdf</a><br>
~<br>
<a href="http://www.cs.vassar.edu/CES/" target="_blank">http://www.cs.vassar.edu/CES/</a><br>
~<br>
she talks about "inter-textual pointers" but only some of my points<br>
where addressed<br>
~<br>
Any other recent papers you guys know of? Any corpora features<br>
comparison papers? (google doesn't give you much)<br>
<div><div>~<br>
lbrtchx<br>
<br>
_______________________________________________<br>
UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">http://mailman.uib.no/options/corpora</a><br>
Corpora mailing list<br>
<a href="mailto:Corpora@uib.no" target="_blank">Corpora@uib.no</a><br>
<a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>
</div></div></blockquote></div><br><br clear="all"><div><br></div>-- <br>========================================<br><a href="http://www.kilgarriff.co.uk/" target="_blank">Adam Kilgarriff</a> <a href="mailto:adam@lexmasterclass.com" target="_blank">adam@lexmasterclass.com</a> <br>
Director <a href="http://www.sketchengine.co.uk/" target="_blank">Lexical Computing Ltd</a> <br>Visiting Research Fellow <a href="http://leeds.ac.uk" target="_blank">University of Leeds</a> <div>
<i><font color="#006600">Corpora for all</font></i> with <a href="http://www.sketchengine.co.uk" target="_blank">the Sketch Engine</a> </div><div> <i><a href="http://www.webdante.com" target="_blank">DANTE: <font color="#009900">a lexical database for English</font></a><font color="#009900"> </font> </i><div>
========================================</div></div><br>
</div>