21.1167, Diss: Comp Ling: Mishra: 'Sanskrit Karaka Analyzer for Machine...'
linguist at LINGUISTLIST.ORG
linguist at LINGUISTLIST.ORG
Tue Mar 9 21:31:38 UTC 2010
LINGUIST List: Vol-21-1167. Tue Mar 09 2010. ISSN: 1068 - 4875.
Subject: 21.1167, Diss: Comp Ling: Mishra: 'Sanskrit Karaka Analyzer for Machine...'
Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
Reviews: Monica Macaulay, U of Wisconsin-Madison
Eric Raimy, U of Wisconsin-Madison
Joseph Salmons, U of Wisconsin-Madison
Anja Wanner, U of Wisconsin-Madison
<reviews at linguistlist.org>
Homepage: http://linguistlist.org/
The LINGUIST List is funded by Eastern Michigan University,
and donations from subscribers and publishers.
Editor for this issue: Di Wdzenczny <di at linguistlist.org>
================================================================
To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.cfm.
===========================Directory==============================
1)
Date: 08-Mar-2010
From: Sudhir Mishra < sudhirkumarmishra at gmail.com >
Subject: Sanskrit Karaka Analyzer for Machine Translation
-------------------------Message 1 ----------------------------------
Date: Tue, 09 Mar 2010 16:29:15
From: Sudhir Mishra [sudhirkumarmishra at gmail.com]
Subject: Sanskrit Karaka Analyzer for Machine Translation
E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=21-1167.html&submissionid=2615576&topicid=14&msgnumber=1
Institution: Jawaharlal Nehru University
Program: Ph.D. in Linguistics
Dissertation Status: Completed
Degree Date: 2007
Author: Sudhir K. Mishra
Dissertation Title: Sanskrit Karaka Analyzer for Machine Translation
Dissertation URL: http://sites.google.com/site/mishrasudhirk/education
Linguistic Field(s): Computational Linguistics
Dissertation Director(s):
Dissertation Abstract:
The present Research and development (R&D) describes the
necessary research that went towards developing a karaka analyzer
for laukika Sanskrit text. The system called KAS (Karaka Analyzer for
Sanskrit) is a partial implementation and is live at
http://sanskrit.jnu.ac.in/karaka/analyzer.jsp in will be required for any
processing of Sanskrit for larger Natural Language Processing (NLP)
tools. The KAS takes as input unicode devanagari sandhi-samasa free
text and returns karaka analyzed text as output.
Developing a M(A)TS with Sanskrit as Source Language (SL) is very
important and challenging. It is important because Sanskrit is the only
language in India which can be truly considered a 'donor' language.
The vast knowledge reserves in Sanskrit can be 'transferred' (read
translated) to other Indian languages with the help of computer. Having
Sanskrit as the SL is challenging because of the difficulty in parsing
due to its synthetic nature in which a single word (sentence) can run up
to 32 pages (Banabhatta's Kadambari). Therefore, processing system
for Sanskrit is critical for any further work in this area for this language.
-----------------------------------------------------------
This Year the LINGUIST List hopes to raise $65,000. This money will go to help
keep the List running by supporting all of our Student Editors for the coming year.
See below for donation instructions, and don't forget to check out our Space Fund
Drive 2010 and join us for a great journey!
http://linguistlist.org/fund-drive/2010/
There are many ways to donate to LINGUIST!
You can donate right now using our secure credit card form at
https://linguistlist.org/donation/donate/donate1.cfm
Alternatively you can also pledge right now and pay later. To do so, go to:
https://linguistlist.org/donation/pledge/pledge1.cfm
For all information on donating and pledging, including information on how to
donate by check, money order, or wire transfer, please visit:
http://linguistlist.org/donation/
The LINGUIST List is under the umbrella of Eastern Michigan University and as
such can receive donations through the EMU Foundation, which is a registered
501(c) Non Profit organization. Our Federal Tax number is 38-6005986. These
donations can be offset against your federal and sometimes your state tax return
(U.S. tax payers only). For more information visit the IRS Web-Site, or contact
your financial advisor.
Many companies also offer a gift matching program, such that they will match
any gift you make to a non-profit organization. Normally this entails your
contacting your human resources department and sending us a form that the
EMU Foundation fills in and returns to your employer. This is generally a simple
administrative procedure that doubles the value of your gift to LINGUIST, without
costing you an extra penny. Please take a moment to check if your company
operates such a program.
Thank you very much for your support of LINGUIST!
-----------------------------------------------------------
LINGUIST List: Vol-21-1167
More information about the LINGUIST
mailing list