<html>

  <head>

    <meta content="text/html; charset=ISO-8859-1"

      http-equiv="Content-Type">

  </head>

  <body bgcolor="#FFFFFF" text="#000000">

    Thanks Martin and Mikhail!<br>

    I'll be checking out your references.<br>

    <br>

    Ivelina<br>

    <br>

    <br>

    <div class="moz-cite-prefix">На 8.11.2012 г. 01:28 ч., Mikhail

      Kozhevnikov написа:<br>

    </div>

    <blockquote

cite="mid:CAM2hxrcPjfRcK36nQujh0WB_2_nJWdHvutyXYyvQ_=AWVWJpPw@mail.gmail.com"

      type="cite">Dear Martin,<br>

      <br>

      To my knowledge even the bits already annotated are not available

      yet, as the data has not been officially released. I've tried to

      obtain the SRL annotations described in <a moz-do-not-send="true"

href="http://lt3.hogent.be/media/uploads/publications/2012/FinalSRL.pdf"

        target="_blank">this paper</a> in the end of September and got

      the following reply:<br>

      <br>

      <blockquote class="gmail_quote" style="margin:0px 0px 0px

0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">The

        SRL annotations are not part of the second release of the

        intermediate SoNaR results. The final release will comprise SRL

        annotations: a 500K corpus that has been automatically labeled

        and a 500K corpus that has been completely manually verified.<br>

        We do not know when the final release will be available, since

        the project is still not officially closed: an evaluation has

        shown that some alterations need to be made and documentation

        needs to be added. We can not start distribution before the

        official ending of the project.</blockquote>

      <div><br>

      </div>

      <div>I too would be very interested in any new information

        concerning the release date or (partial) availability of the

        data.<br>

        <br>

        Regards,<br>

        Mikhail</div>

      <div> </div>

      <div class="gmail_extra"><br>

        <div class="gmail_quote">

          On Wed, Nov 7, 2012 at 9:28 PM, Martin Reynaert <span

            dir="ltr"><<a moz-do-not-send="true"

              href="mailto:reynaert@uvt.nl" target="_blank">reynaert@uvt.nl</a>></span>

          wrote:<br>

          <blockquote class="gmail_quote" style="margin:0 0 0

            .8ex;border-left:1px #ccc solid;padding-left:1ex">Dear

            Ivelina,<br>

            <br>

            For Dutch we now have the SoNaR-500 corpus (currently about

            540 million word tokens of contemporary written Dutch,

            automatically annotated) and the SoNaR-1 corpus (about 1

            million word tokens of contemporary written Dutch, largely

            manually annotated for semantics).<br>

            <br>

            For Named Entity Recognition the Support-Vector Machine tool

            (called 'NERD' for 'Named Entity Recognition for Dutch',

            developed at LT3, Ghent University, by Bart Desmet) used to

            automatically label SoNaR-500 was trained on the NEs

            manually labeled in SoNaR-1.<br>

            <br>

            To acquire the corpus, please enquire at the Dutch HLT

            Agency:<br>

            <br>

            <a moz-do-not-send="true"

              href="http://www.inl.nl/tst-centrale/" target="_blank">http://www.inl.nl/tst-centrale/</a><br>

            <br>

            The full corpus itself may not be fully available yet, but

            should be soon, and you can at least sort out the licensing

            part at this stage. In fact, I am to date curating parts of

            its metadata.<br>

            <br>

            Best,<br>

            <br>

            Martin

            <div>

              <div><br>

                <br>

                <br>

                <br>

                <br>

                On 11/07/2012 06:23 PM, Ivelina Nikolova wrote:<br>

                <blockquote class="gmail_quote" style="margin:0 0 0

                  .8ex;border-left:1px #ccc solid;padding-left:1ex">

                  On 11/07/2012 05:49 PM, Alberto Lavelli wrote:<br>

                  <blockquote class="gmail_quote" style="margin:0 0 0

                    .8ex;border-left:1px #ccc solid;padding-left:1ex">

                    The CoNLL 2002 shared task concerned Named Entity

                    Recognition for<br>

                    Spanish and Dutch.<br>

                    You can find information about the CoNLL series

                    here:<br>

                    <br>

                    <a moz-do-not-send="true"

                      href="http://ifarm.nl/signll/conll/"

                      target="_blank">http://ifarm.nl/signll/conll/</a><br>

                    <br>

                    Hope this helps<br>

                  </blockquote>

                  <br>

                  Thanks Alberto!<br>

                  I got several references to this task corpus

                  especially. It seems to be the most used one.<br>

                  <br>

                  Best,<br>

                  Ivelina<br>

                  <br>

                  <br>

                  <blockquote class="gmail_quote" style="margin:0 0 0

                    .8ex;border-left:1px #ccc solid;padding-left:1ex">

                    <br>

                        alberto<br>

                    <br>

                    <br>

                    On Wed, Nov 07, 2012 at 04:13:07PM +0200, Ivelina

                    Nikolova wrote:<br>

                    <blockquote class="gmail_quote" style="margin:0 0 0

                      .8ex;border-left:1px #ccc solid;padding-left:1ex">

                      Dear Corpora Members,<br>

                      <br>

                      I am searching for corpora in Dutch with Named

                      Entity annotations.<br>

                      I'm interested in Person, Location, Organization

                      and Event mentions.<br>

                      Do you have any suggestions on that?<br>

                      <br>

                      Thank you very much!<br>

                      Ivelina<br>

                      <br>

                      -- <br>

                      Ivelina Nikolova<br>

                      PhD student in Computer Science<br>

                      Linguistic Modelling Department<br>

                      Institute of Information and Communication

                      Technologies<br>

                      Bulgarian Academy of Sciences<br>

                      <br>

                      <br>

                      _______________________________________________<br>

                      UNSUBSCRIBE from this page: <a

                        moz-do-not-send="true"

                        href="http://mailman.uib.no/options/corpora"

                        target="_blank">http://mailman.uib.no/options/corpora</a><br>

                      Corpora mailing list<br>

                      <a moz-do-not-send="true"

                        href="mailto:Corpora@uib.no" target="_blank">Corpora@uib.no</a><br>

                      <a moz-do-not-send="true"

                        href="http://mailman.uib.no/listinfo/corpora"

                        target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>

                    </blockquote>

                  </blockquote>

                  <br>

                  <br>

                </blockquote>

                <br>

                <br>

                _______________________________________________<br>

                UNSUBSCRIBE from this page: <a moz-do-not-send="true"

                  href="http://mailman.uib.no/options/corpora"

                  target="_blank">http://mailman.uib.no/options/corpora</a><br>

                Corpora mailing list<br>

                <a moz-do-not-send="true" href="mailto:Corpora@uib.no"

                  target="_blank">Corpora@uib.no</a><br>

                <a moz-do-not-send="true"

                  href="http://mailman.uib.no/listinfo/corpora"

                  target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>

              </div>

            </div>

          </blockquote>

        </div>

        <br>

      </div>

      <br>

      <fieldset class="mimeAttachmentHeader"></fieldset>

      <br>

      <pre wrap="">_______________________________________________

UNSUBSCRIBE from this page: <a class="moz-txt-link-freetext" href="http://mailman.uib.no/options/corpora">http://mailman.uib.no/options/corpora</a>

Corpora mailing list

<a class="moz-txt-link-abbreviated" href="mailto:Corpora@uib.no">Corpora@uib.no</a>

<a class="moz-txt-link-freetext" href="http://mailman.uib.no/listinfo/corpora">http://mailman.uib.no/listinfo/corpora</a>

</pre>

    </blockquote>

    <br>

  </body>

</html>