<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">


<head>

<meta http-equiv=Content-Type content="text/html; charset=us-ascii">

<meta name=Generator content="Microsoft Word 12 (filtered medium)">

<style>

<!--

 /* Font Definitions */

 @font-face

        {font-family:Calibri;

        panose-1:2 15 5 2 2 2 4 3 2 4;}

 /* Style Definitions */

 p.MsoNormal, li.MsoNormal, div.MsoNormal

        {margin:0cm;

        margin-bottom:.0001pt;

        font-size:12.0pt;

        font-family:"Times New Roman","serif";}

a:link, span.MsoHyperlink

        {mso-style-priority:99;

        color:blue;

        text-decoration:underline;}

a:visited, span.MsoHyperlinkFollowed

        {mso-style-priority:99;

        color:purple;

        text-decoration:underline;}

span.E-MailFormatvorlage17

        {mso-style-type:personal-compose;

        font-family:"Times New Roman","serif";

        color:windowtext;

        font-weight:normal;

        font-style:normal;}

.MsoChpDefault

        {mso-style-type:export-only;

        font-size:10.0pt;}

@page Section1

        {size:612.0pt 792.0pt;

        margin:70.85pt 70.85pt 2.0cm 70.85pt;}

div.Section1

        {page:Section1;}

-->

</style>

<!--[if gte mso 9]><xml>

 <o:shapedefaults v:ext="edit" spidmax="1026" />

</xml><![endif]--><!--[if gte mso 9]><xml>

 <o:shapelayout v:ext="edit">

  <o:idmap v:ext="edit" data="1" />

 </o:shapelayout></xml><![endif]-->

</head>


<body lang=DE link=blue vlink=purple>


<div class=Section1>


<p class=MsoNormal><span lang=EN-GB>Dear Corpora List-members,<o:p></o:p></span></p>


<p class=MsoNormal><span lang=EN-GB><o:p> </o:p></span></p>


<p class=MsoNormal><span lang=EN-GB>For my PhD, I am searching for a corpus

with annotated definite, indefinite and generic noun phrases in German (or

English). <o:p></o:p></span></p>


<p class=MsoNormal><span lang=EN-GB><o:p> </o:p></span></p>


<p class=MsoNormal><span lang=EN-GB>Do you know some corpora and treebanks

preferably in German, where definite, indefinite and generic noun phrases are

annotated? <o:p></o:p></span></p>


<p class=MsoNormal><span lang=EN-GB>I already know the TIGER treebank, but

there are two problems with TIGER. First TIGER does not distinguish between

definite and indefinite articles or definite, indefinite and generic noun

phrases, so I would have to annotate on my own. Second the TIGER corpus only

contains articles of newspapers. The amount of generic noun phrases in the

TIGER corpus is much too small. A corpus which contains other types of texts

(i.e. encyclopaedic entries, probably wikipedia) would be better for my

research, because in such texts there are much more generic noun phrases. <o:p></o:p></span></p>


<p class=MsoNormal><span lang=EN-GB><o:p> </o:p></span></p>


<p class=MsoNormal><span lang=EN-GB>If you know a corpus that probably helps me

or have a hint, please answer. I am looking forward to hear from you. <o:p></o:p></span></p>


<p class=MsoNormal><span lang=EN-GB><o:p> </o:p></span></p>


<p class=MsoNormal><span lang=EN-GB>Angela Klutsch<o:p></o:p></span></p>


<p class=MsoNormal style='margin-bottom:12.0pt'><span lang=EN-US><o:p> </o:p></span></p>


<p class=MsoNormal><span lang=EN-US>-----<br>

Dipl.-Inform. Angela Klutsch<br>

Computational Linguistics<br>

University of Duisburg-Essen<br>

Lotharstr. 65, LF 116<o:p></o:p></span></p>


<p class=MsoNormal><span lang=EN-US>D-47057 Duisburg<br>

E-Mail: </span><a href="mailto:angela.klutsch@uni-due.de"><span lang=EN-US>angela.klutsch@uni-due.de</span></a>

<span lang=EN-US><o:p></o:p></span></p>


<p class=MsoNormal><span lang=EN-US><o:p> </o:p></span></p>


</div>


</body>


</html>