<div dir="ltr">The 2009 version of the corpus is searchable at <a href="https://the.sketchengine.co.uk/open/">https://the.sketchengine.co.uk/open/</a>, also we did a bit of tidying up to solve the problems you mention<div>

<div><br></div><div>Adam</div></div></div><div class="gmail_extra"><br><br><div class="gmail_quote">On 25 November 2013 22:14, Stephan Oepen <span dir="ltr"><<a href="mailto:oe@ifi.uio.no" target="_blank">oe@ifi.uio.no</a>></span> wrote:<br>

<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">hi christian,<br>
<div class="im"><br>
> I am looking for an ACL Anthology corpus which contains the extracted<br>
> full-texts of ACL papers (for example as textfile or xml file).<br>
<br>
</div>please see the following reference for a summary of a<br>
2012 community effort in this direction:<br>
<br>
  <a href="http://aclweb.org/anthology//W/W12/W12-3210.pdf" target="_blank">http://aclweb.org/anthology//W/W12/W12-3210.pdf</a><br>
<br>
the paper provides access information for two sets of<br>
full-text documents, including some logical structure,<br>
extracted from large parts of the ACL Anthology:<br>
<br>
  <a href="http://www.delph-in.net/aac" target="_blank">http://www.delph-in.net/aac</a><br>
<br>
we are aware of many remaining issues, but this may<br>
be a useful starting point for you, nevertheless?<br>
<br>
best wishes, oe<br>
<br>
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++<br>
+++ Universitetet i Oslo (IFI); Boks 1080 Blindern; 0316 Oslo; <a href="tel:%28%2B47%29%202284%200125" value="+4722840125">(+47) 2284 0125</a><br>
+++    --- <a href="mailto:oe@ifi.uio.no">oe@ifi.uio.no</a>; <a href="mailto:stephan@oepen.net">stephan@oepen.net</a>; <a href="http://www.emmtee.net/oe/" target="_blank">http://www.emmtee.net/oe/</a> ---<br>
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++<br>
<div class="HOEnZb"><div class="h5"><br>
_______________________________________________<br>
UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">http://mailman.uib.no/options/corpora</a><br>
Corpora mailing list<br>
<a href="mailto:Corpora@uib.no">Corpora@uib.no</a><br>
<a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>
</div></div></blockquote></div><br><br clear="all"><div><br></div>-- <br>========================================<br><a href="http://www.kilgarriff.co.uk/" target="_blank">Adam Kilgarriff</a>                  <a href="mailto:adam@lexmasterclass.com" target="_blank">adam@lexmasterclass.com</a>                                             <br>

Director                                    <a href="http://www.sketchengine.co.uk/" target="_blank">Lexical Computing Ltd</a>                <br>Visiting Research Fellow                 <a href="http://leeds.ac.uk" target="_blank">University of Leeds</a>     <div>

<i><font color="#006600">Corpora for all</font></i> with <a href="http://www.sketchengine.co.uk" target="_blank">the Sketch Engine</a>                 </div><div>                        <i><a href="http://www.webdante.com" target="_blank">DANTE: <font color="#009900">a lexical database for English</font></a><font color="#009900"> </font>                 </i><div>

========================================</div></div>
</div>