<html xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=us-ascii">
<meta name=Generator content="Microsoft Word 11 (filtered medium)">
<style>
<!--
/* Font Definitions */
@font-face
{font-family:SimSun;
panose-1:2 1 6 0 3 1 1 1 1 1;}
@font-face
{font-family:"\@SimSun";
panose-1:0 0 0 0 0 0 0 0 0 0;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0pt;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman";}
a:link, span.MsoHyperlink
{color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{color:purple;
text-decoration:underline;}
p
{mso-margin-top-alt:auto;
margin-right:0pt;
mso-margin-bottom-alt:auto;
margin-left:0pt;
font-size:12.0pt;
font-family:"Times New Roman";}
span.EmailStyle17
{mso-style-type:personal-compose;
font-family:Arial;
color:windowtext;}
@page Section1
{size:612.0pt 792.0pt;
margin:72.0pt 90.0pt 72.0pt 90.0pt;}
div.Section1
{page:Section1;}
-->
</style>
</head>
<body lang=EN-US link=blue vlink=purple>
<div class=Section1>
<p class=MsoNormal><font size=2 face=Arial><span lang=IS style='font-size:10.0pt;
font-family:Arial'>Hi everyone,<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face=Arial><span lang=IS style='font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 face=Arial><span lang=IS style='font-size:10.0pt;
font-family:Arial'>(It has been pointed out to me that, for some reason, my
message to the list appeared empty in some e-mail systems. Here is a second
try:)<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face=Arial><span lang=IS style='font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 face=Arial><span lang=IS style='font-size:10.0pt;
font-family:Arial'>The paper: “J. Hajic (2000) Morphological tagging:
Data vs. Dictionaries”, reports percentages of ambiguous tokens for
English (38.65%), Czech (45.97%), Estonian (40.24%), Hungarian (21.58%),
Romanian (40.00%) and Slovene (38.01%), using an annotated version of
Orwell’s 1984 novel for each of these languages.<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face=Arial><span lang=IS style='font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 face=Arial><span lang=IS style='font-size:10.0pt;
font-family:Arial'>I need corresponding percentage number for Swedish, Danish
and Norwegian, calculated using ANY corpora.<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face=Arial><span lang=IS style='font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 face=Arial><span lang=IS style='font-size:10.0pt;
font-family:Arial'>Does anyone have this info (and preferably a reference to a
paper which discusses the issue)?<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face=Arial><span lang=IS style='font-size:10.0pt;
font-family:Arial'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 face=Arial><span lang=IS style='font-size:10.0pt;
font-family:Arial'>Regards,<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face=Arial><span lang=IS style='font-size:10.0pt;
font-family:Arial'>Hrafn Loftsson<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face=Arial><span lang=IS style='font-size:10.0pt;
font-family:Arial'>Assistant professor<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face=Arial><span lang=IS style='font-size:10.0pt;
font-family:Arial'>Department of Computer Science<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face=Arial><span lang=IS style='font-size:10.0pt;
font-family:Arial'>School of Science and Engineering<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face=Arial><span lang=IS style='font-size:10.0pt;
font-family:Arial'>Reykjavik University<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face=Arial><span lang=IS style='font-size:10.0pt;
font-family:Arial'>Iceland<o:p></o:p></span></font></p>
</div>
</body>
</html>