21.922, Software: PANL10n Releases Linguistic Resources and Software

linguist at LINGUISTLIST.ORG linguist at LINGUISTLIST.ORG
Wed Feb 24 17:10:10 UTC 2010


LINGUIST List: Vol-21-922. Wed Feb 24 2010. ISSN: 1068 - 4875.

Subject: 21.922, Software: PANL10n Releases Linguistic Resources and Software

Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
 
Reviews: Monica Macaulay, U of Wisconsin-Madison  
Eric Raimy, U of Wisconsin-Madison  
Joseph Salmons, U of Wisconsin-Madison  
Anja Wanner, U of Wisconsin-Madison  
       <reviews at linguistlist.org> 

Homepage: http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, 
and donations from subscribers and publishers.

Editor for this issue: Susanne Vejdemo <susanne at linguistlist.org>
================================================================  

To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.

===========================Directory==============================  

1)
Date: 24-Feb-2010
From: Sarmad Hussain < sarmad.hussain at nu.edu.pk >
Subject: PANL10n Releases Linguistic Resources and Software
 

	
-------------------------Message 1 ---------------------------------- 
Date: Wed, 24 Feb 2010 12:09:04
From: Sarmad Hussain [sarmad.hussain at nu.edu.pk]
Subject: PANL10n Releases Linguistic Resources and Software

E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=21-922.html&submissionid=2612514&topicid=13&msgnumber=1
  

Editor's note: This issue contains non-ISO-8859-1 characters.
To view the correct characters, go to http://linguistlist.org/issues/21/21-922.html.


PAN Localization project (http://www.panl10n.net) has been a regional
initiative promoting language technology across developing Asia. The
project, initiated in 2003, has developed and disseminated computing
solutions for Bahasa Indonesia, Bangla, Dzongkha, Khmer, Lao, Mongolian,
Nepali, Pashto, Sinhala, Tamil Tibetan and Urdu. 

On the occasion of the eleventh International Mother Language Day, 21st
February 2010, PAN Localization project is pleased to release its research,
technology and resources through its website.

This project has been carried out with collaboration of IDRC, Canada
(www.idrc.ca), National University of Computer and Emerging Sciences,
Pakistan (www.crulp.org; www.nu.edu.pk) and the following partner
organizations:

- Afghan Computer Science Association, Afghanistan
- BRAC University and Development Research Network, Bangladesh
- Department of IT, Bhutan
- Ministry of Education, Youth and Sport, Institute of Technology, and
National ICT Development Authority, Cambodia
- Tibet University, Institute of Science and Technology, Tibet Academy of
Agricultural and Animal Husbandry Sciences, China
- University of Indonesia, Agency for the Assessment and Application of
Technology, Indonesia
- National Authority for Science and Technology, Laos
- InfoCon Co. Ltd., Mongolian University of Science and Technology and
National University of Mongolia, Mongolia
- Madan Puraskar Pustakalaya, and E-Network Research and Development, Nepal
- University of Colombo School of Computing, Sri Lanka

Salient Outputs (and more ? on the project website: http://www.panl10n.net/):

Bahasa Indonesia
Statistical Machine Translation (Awarded), English-Bahasa Parallel Corpus
(1 Million words), POS Tagged Bahasa Corpus (500,000 words), Part of Speech
Tagset and Tagger

Bangla
Text to Speech System (Awarded), Optical Character Recognition System
(Shortlisted for Award), Bangla Pad, Spell Checker, Lexicon, Language Table
for IDNs, Part of Speech Tagset and Tagger, Wordnet (1000 words), Tagged
Corpus (5 Million words), English-Bangla Parallel Corpus

Dzongkha
DzongkhaLinux, Optical Character Recognition System, Language Table for
IDNs, Part of Speech Tagset, Corpus (600,000 words), Lexicon (23,000
words), Text to Speech System (prototype), Dzongkha Terminology, Collation,
Locale, Fonts and Keyboard

Khmer
Optical Character Recognition System, Java Applications and OpenOffice.org
Plug-ins for Collation, Encoding Conversion, Word Segmentation, Locale,
Mobile SMS, Language Table for IDNs, Part of Speech Tagset and Tagger,
Lexicon, Text to Speech System (prototype), Tagged Corpus (150,000 words),
Online Khmer Content on Veticar.com

Lao
Optical Character Recognition System, OpenOffice.org and MS Office Plug-in
for Word Segmentation, Collation, Spell Checker, Lao Pad, Fonts, Keyboard,
Language Table for IDNs, Part of Speech Tagset, POS Tagged Corpus, Parallel
Corpus (37,000 words), Online Lao Content

Mongolian
Part of Speech Tagset and Tagger, Spell Checker, Corpus (1,000,000 words),
Tagged Corpus (100,000 words), Lexicon (10,000 words), Automatic Speech
Recognition, Localization of Pidgin and SeaMonkey

Nepali
NepaLinux (Awarded), Spell Checker, Grammar Checker, Parallel Corpus
(100,000 words), Tagged Corpus (80,000 words), Lexicon (37,000 words),
Optical Character Recognition System (prototype)

Pashto
Localized SeaMonkey (Awarded), Keyboard, Fonts

Sinhala & Tamil
Sinhala Optical Character Recognition System, Sinhala Text to Speech System
(Awarded), Screen Reader for Sinhala for Blind, Language Learning Tool for
Tamil in Sinhala and English, Sinhala Wordnet, Localized OpenTM, Collation
Standard, Encoding Conversion tool 
Linguistic Field(s): Computational Linguistics
                     General Linguistics
                     Translation
                     Writing Systems

Subject Language(s): Dzongkha (dzo)
                     Indonesian (ind)
                     Mongolian, Halh (khk)
                     Khmer, Central (khm)
                     Samba Leko (ndi)
                     Nepali (nep)
                     Lao (lao)
                     Pashto, Central (pst)
                     Sinhala (sin)
                     Tamil (tam)
                     Urdu (urd)




-----------------------------------------------------------
LINGUIST List: Vol-21-922	

	



More information about the LINGUIST mailing list