<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40"><head><meta http-equiv=Content-Type content="text/html; charset=utf-8"><meta name=Generator content="Microsoft Word 14 (filtered medium)"><style><!--
/* Font Definitions */
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
{font-family:Tahoma;
panose-1:2 11 6 4 3 5 4 4 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:purple;
text-decoration:underline;}
span.EmailStyle17
{mso-style-type:personal-reply;
font-family:"Calibri","sans-serif";
color:#1F497D;}
.MsoChpDefault
{mso-style-type:export-only;
font-family:"Calibri","sans-serif";
mso-fareast-language:EN-US;}
@page WordSection1
{size:612.0pt 792.0pt;
margin:72.0pt 72.0pt 72.0pt 72.0pt;}
div.WordSection1
{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]--></head><body lang=EN-GB link=blue vlink=purple><div class=WordSection1><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'>Dear Hamid,<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'>At the European Commission’s <i>Joint Research Centre</i> (JRC), we have developed the <i>Europe Media Monitor</i> (EMM) family of applications (<a href="http://emm.newsbrief.eu/overview.html">http://emm.newsbrief.eu/overview.html</a>), which includes Farsi. <o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'>EMM collects Farsi news (together with another 50 or so languages) and displays them in EMM-NewsBrief and in EMM-MedISys (Medical Information System). If you go to ‘advanced search’, you can display all the news sources monitored. Farsi news then get classified according to the many EMM categories and they will be displayed together with those in the other languages, if found.<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'>In EMM-NewsExplorer (<a href="http://emm.newsexplorer.eu/NewsExplorer/home/fa/latest.html">http://emm.newsexplorer.eu/NewsExplorer/home/fa/latest.html</a>), we display the biggest news cluster of any given calendar day (for 20 languages, including Farsi), together with information we manage to extract. We aim to extract entities (persons and organisation names), geo-locations and quotations. We also try to link the Farsi news to those in (a subset of) other languages and to the news published in previous days.<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'>NewsExplorer also collects information found on entities over time and in many languages, and it displays this information on mixed-language pages (e.g. <a href="http://emm.newsexplorer.eu/NewsExplorer/entities/en/101358.html">http://emm.newsexplorer.eu/NewsExplorer/entities/en/101358.html</a> for Mahmoud Ahmadinejad).<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'>I do not think our Farsi information extraction tools work particularly well, but we intend to put some more effort into the Farsi tools soon.<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'>For an overview of the EMM applications, you can read:<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal style='margin-left:36.0pt'><span lang=DE>Steinberger Ralf, Bruno Pouliquen & Erik van der Goot (2009). </span><a href="http://langtech.jrc.ec.europa.eu/Documents/09_SIGIR-WS_Steinberger+frontmatter.pdf" target="_new" title="Overview article presenting the Europe Media Monitor Family of Applications, invited talk at a SIGIR workshop">An introduction to the Europe Media Monitor Family of Applications</a>. In: Fredric Gey, Noriko Kando & Jussi Karlgren (eds.): Information Access in a Multilingual World - Proceedings of the SIGIR 2009 Workshop (SIGIR-CLIR'2009), pp. 1-8. Boston, USA. 23 July 2009.<span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'><o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'>Greetings, currently from LREC in Istanbul, and best wishes for your interesting effort.<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'>Ralf<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><b><span lang=DE style='font-size:9.0pt;font-family:"Calibri","sans-serif";color:#4A442A'>Ralf Steinberger</span></b><span lang=DE style='font-size:9.0pt;font-family:"Calibri","sans-serif";color:#4A442A'> <o:p></o:p></span></p><p class=MsoNormal><span lang=EN-US style='font-size:9.0pt;font-family:"Calibri","sans-serif";color:#4A442A'>European Commission – Joint Research Centre (JRC)<o:p></o:p></span></p><p class=MsoNormal><span lang=EN-US style='font-size:9.0pt;font-family:"Calibri","sans-serif";color:#4A442A'>URL of the lab: <a href="http://langtech.jrc.ec.europa.eu/">http://langtech.jrc.ec.europa.eu/</a> <o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><b><span lang=EN-US style='font-size:10.0pt;font-family:"Tahoma","sans-serif"'>From:</span></b><span lang=EN-US style='font-size:10.0pt;font-family:"Tahoma","sans-serif"'> corpora-bounces@uib.no [mailto:corpora-bounces@uib.no] <b>On Behalf Of </b>Hamid Reza Ghader<br><b>Sent:</b> 24 May 2012 10:01<br><b>To:</b> corpora@uib.no<br><b>Subject:</b> [Corpora-List] NLP labs that have active projects on Persian<o:p></o:p></span></p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>Dear scientists,<br><br>We are going to develop a list of all NLP labs around the world that have active projects on Persian language. So I decided to ask you all to give me your lab name and homepage address if you have any project related to Persian language in your lab. I appreciate if you provide a brief description of the Persian related project of yours.<br><br>Regards,<br>Hamidreza Ghader<br>Natural language and Text processing Laboratory<br>School of Electrical and Computer Engineering<br>University of Tehran<br>Iran<br><a href="http://ece.ut.ac.ir/nlp/" target="_blank">http://ece.ut.ac.ir/nlp/</a> <o:p></o:p></p></div></body></html>