<html>
<head>
<meta content="text/html; charset=ISO-8859-1"
http-equiv="Content-Type">
</head>
<body bgcolor="#FFFFFF" text="#000000">
Thanks Martin and Mikhail!<br>
I'll be checking out your references.<br>
<br>
Ivelina<br>
<br>
<br>
<div class="moz-cite-prefix">На 8.11.2012 г. 01:28 ч., Mikhail
Kozhevnikov написа:<br>
</div>
<blockquote
cite="mid:CAM2hxrcPjfRcK36nQujh0WB_2_nJWdHvutyXYyvQ_=AWVWJpPw@mail.gmail.com"
type="cite">Dear Martin,<br>
<br>
To my knowledge even the bits already annotated are not available
yet, as the data has not been officially released. I've tried to
obtain the SRL annotations described in <a moz-do-not-send="true"
href="http://lt3.hogent.be/media/uploads/publications/2012/FinalSRL.pdf"
target="_blank">this paper</a> in the end of September and got
the following reply:<br>
<br>
<blockquote class="gmail_quote" style="margin:0px 0px 0px
0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">The
SRL annotations are not part of the second release of the
intermediate SoNaR results. The final release will comprise SRL
annotations: a 500K corpus that has been automatically labeled
and a 500K corpus that has been completely manually verified.<br>
We do not know when the final release will be available, since
the project is still not officially closed: an evaluation has
shown that some alterations need to be made and documentation
needs to be added. We can not start distribution before the
official ending of the project.</blockquote>
<div><br>
</div>
<div>I too would be very interested in any new information
concerning the release date or (partial) availability of the
data.<br>
<br>
Regards,<br>
Mikhail</div>
<div> </div>
<div class="gmail_extra"><br>
<div class="gmail_quote">
On Wed, Nov 7, 2012 at 9:28 PM, Martin Reynaert <span
dir="ltr"><<a moz-do-not-send="true"
href="mailto:reynaert@uvt.nl" target="_blank">reynaert@uvt.nl</a>></span>
wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0
.8ex;border-left:1px #ccc solid;padding-left:1ex">Dear
Ivelina,<br>
<br>
For Dutch we now have the SoNaR-500 corpus (currently about
540 million word tokens of contemporary written Dutch,
automatically annotated) and the SoNaR-1 corpus (about 1
million word tokens of contemporary written Dutch, largely
manually annotated for semantics).<br>
<br>
For Named Entity Recognition the Support-Vector Machine tool
(called 'NERD' for 'Named Entity Recognition for Dutch',
developed at LT3, Ghent University, by Bart Desmet) used to
automatically label SoNaR-500 was trained on the NEs
manually labeled in SoNaR-1.<br>
<br>
To acquire the corpus, please enquire at the Dutch HLT
Agency:<br>
<br>
<a moz-do-not-send="true"
href="http://www.inl.nl/tst-centrale/" target="_blank">http://www.inl.nl/tst-centrale/</a><br>
<br>
The full corpus itself may not be fully available yet, but
should be soon, and you can at least sort out the licensing
part at this stage. In fact, I am to date curating parts of
its metadata.<br>
<br>
Best,<br>
<br>
Martin
<div>
<div><br>
<br>
<br>
<br>
<br>
On 11/07/2012 06:23 PM, Ivelina Nikolova wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0
.8ex;border-left:1px #ccc solid;padding-left:1ex">
On 11/07/2012 05:49 PM, Alberto Lavelli wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0
.8ex;border-left:1px #ccc solid;padding-left:1ex">
The CoNLL 2002 shared task concerned Named Entity
Recognition for<br>
Spanish and Dutch.<br>
You can find information about the CoNLL series
here:<br>
<br>
<a moz-do-not-send="true"
href="http://ifarm.nl/signll/conll/"
target="_blank">http://ifarm.nl/signll/conll/</a><br>
<br>
Hope this helps<br>
</blockquote>
<br>
Thanks Alberto!<br>
I got several references to this task corpus
especially. It seems to be the most used one.<br>
<br>
Best,<br>
Ivelina<br>
<br>
<br>
<blockquote class="gmail_quote" style="margin:0 0 0
.8ex;border-left:1px #ccc solid;padding-left:1ex">
<br>
alberto<br>
<br>
<br>
On Wed, Nov 07, 2012 at 04:13:07PM +0200, Ivelina
Nikolova wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0
.8ex;border-left:1px #ccc solid;padding-left:1ex">
Dear Corpora Members,<br>
<br>
I am searching for corpora in Dutch with Named
Entity annotations.<br>
I'm interested in Person, Location, Organization
and Event mentions.<br>
Do you have any suggestions on that?<br>
<br>
Thank you very much!<br>
Ivelina<br>
<br>
-- <br>
Ivelina Nikolova<br>
PhD student in Computer Science<br>
Linguistic Modelling Department<br>
Institute of Information and Communication
Technologies<br>
Bulgarian Academy of Sciences<br>
<br>
<br>
_______________________________________________<br>
UNSUBSCRIBE from this page: <a
moz-do-not-send="true"
href="http://mailman.uib.no/options/corpora"
target="_blank">http://mailman.uib.no/options/corpora</a><br>
Corpora mailing list<br>
<a moz-do-not-send="true"
href="mailto:Corpora@uib.no" target="_blank">Corpora@uib.no</a><br>
<a moz-do-not-send="true"
href="http://mailman.uib.no/listinfo/corpora"
target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>
</blockquote>
</blockquote>
<br>
<br>
</blockquote>
<br>
<br>
_______________________________________________<br>
UNSUBSCRIBE from this page: <a moz-do-not-send="true"
href="http://mailman.uib.no/options/corpora"
target="_blank">http://mailman.uib.no/options/corpora</a><br>
Corpora mailing list<br>
<a moz-do-not-send="true" href="mailto:Corpora@uib.no"
target="_blank">Corpora@uib.no</a><br>
<a moz-do-not-send="true"
href="http://mailman.uib.no/listinfo/corpora"
target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>
</div>
</div>
</blockquote>
</div>
<br>
</div>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
UNSUBSCRIBE from this page: <a class="moz-txt-link-freetext" href="http://mailman.uib.no/options/corpora">http://mailman.uib.no/options/corpora</a>
Corpora mailing list
<a class="moz-txt-link-abbreviated" href="mailto:Corpora@uib.no">Corpora@uib.no</a>
<a class="moz-txt-link-freetext" href="http://mailman.uib.no/listinfo/corpora">http://mailman.uib.no/listinfo/corpora</a>
</pre>
</blockquote>
<br>
</body>
</html>