<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv=Content-Type content="text/html; charset=us-ascii">
<meta name=Generator content="Microsoft Word 12 (filtered medium)">
<style>
<!--
/* Font Definitions */
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:purple;
text-decoration:underline;}
span.E-MailFormatvorlage17
{mso-style-type:personal-compose;
font-family:"Times New Roman","serif";
color:windowtext;
font-weight:normal;
font-style:normal;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:10.0pt;}
@page Section1
{size:612.0pt 792.0pt;
margin:70.85pt 70.85pt 2.0cm 70.85pt;}
div.Section1
{page:Section1;}
-->
</style>
<!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang=DE link=blue vlink=purple>
<div class=Section1>
<p class=MsoNormal><span lang=EN-GB>Dear Corpora List-members,<o:p></o:p></span></p>
<p class=MsoNormal><span lang=EN-GB><o:p> </o:p></span></p>
<p class=MsoNormal><span lang=EN-GB>For my PhD, I am searching for a corpus
with annotated definite, indefinite and generic noun phrases in German (or
English). <o:p></o:p></span></p>
<p class=MsoNormal><span lang=EN-GB><o:p> </o:p></span></p>
<p class=MsoNormal><span lang=EN-GB>Do you know some corpora and treebanks
preferably in German, where definite, indefinite and generic noun phrases are
annotated? <o:p></o:p></span></p>
<p class=MsoNormal><span lang=EN-GB>I already know the TIGER treebank, but
there are two problems with TIGER. First TIGER does not distinguish between
definite and indefinite articles or definite, indefinite and generic noun
phrases, so I would have to annotate on my own. Second the TIGER corpus only
contains articles of newspapers. The amount of generic noun phrases in the
TIGER corpus is much too small. A corpus which contains other types of texts
(i.e. encyclopaedic entries, probably wikipedia) would be better for my
research, because in such texts there are much more generic noun phrases. <o:p></o:p></span></p>
<p class=MsoNormal><span lang=EN-GB><o:p> </o:p></span></p>
<p class=MsoNormal><span lang=EN-GB>If you know a corpus that probably helps me
or have a hint, please answer. I am looking forward to hear from you. <o:p></o:p></span></p>
<p class=MsoNormal><span lang=EN-GB><o:p> </o:p></span></p>
<p class=MsoNormal><span lang=EN-GB>Angela Klutsch<o:p></o:p></span></p>
<p class=MsoNormal style='margin-bottom:12.0pt'><span lang=EN-US><o:p> </o:p></span></p>
<p class=MsoNormal><span lang=EN-US>-----<br>
Dipl.-Inform. Angela Klutsch<br>
Computational Linguistics<br>
University of Duisburg-Essen<br>
Lotharstr. 65, LF 116<o:p></o:p></span></p>
<p class=MsoNormal><span lang=EN-US>D-47057 Duisburg<br>
E-Mail: </span><a href="mailto:angela.klutsch@uni-due.de"><span lang=EN-US>angela.klutsch@uni-due.de</span></a>
<span lang=EN-US><o:p></o:p></span></p>
<p class=MsoNormal><span lang=EN-US><o:p> </o:p></span></p>
</div>
</body>
</html>