13.511, FYI: Master's in Computational Ling, New Corpora
LINGUIST List
linguist at linguistlist.org
Sat Feb 23 20:54:50 UTC 2002
LINGUIST List: Vol-13-511. Sat Feb 23 2002. ISSN: 1068-4875.
Subject: 13.511, FYI: Master's in Computational Ling, New Corpora
Moderators: Anthony Aristar, Wayne State U.<aristar at linguistlist.org>
Helen Dry, Eastern Michigan U. <hdry at linguistlist.org>
Andrew Carnie, U. of Arizona <carnie at linguistlist.org>
Reviews (reviews at linguistlist.org):
Simin Karimi, U. of Arizona
Terence Langendoen, U. of Arizona
Editors (linguist at linguistlist.org):
Karen Milligan, WSU Naomi Ogasawara, EMU
James Yuells, EMU Marie Klopfenstein, WSU
Michael Appleby, EMU Heather Taylor-Loring, EMU
Ljuba Veselinova, Stockholm U. Richard John Harvey, EMU
Dina Kapetangianni, EMU Renee Galvis, WSU
Karolina Owczarzak, EMU
Software: John Remmers, E. Michigan U. <remmers at emunix.emich.edu>
Gayathri Sriram, E. Michigan U. <gayatri at linguistlist.org>
Home Page: http://linguistlist.org/
The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.
Editor for this issue: Marie Klopfenstein <marie at linguistlist.org>
=================================Directory=================================
1)
Date: Fri, 22 Feb 2002 09:41:56 GMT
From: jpmg at eng.cam.ac.uk
Subject: Graduate Course: COMPUTER SPEECH, TEXT, AND INTERNET TECHNOLOGY
2)
Date: Fri, 22 Feb 2002 11:58:45 -0500
From: LDC Office <ldc at ldc.upenn.edu>
Subject: New LDC Corpora
-------------------------------- Message 1 -------------------------------
Date: Fri, 22 Feb 2002 09:41:56 GMT
From: jpmg at eng.cam.ac.uk
Subject: Graduate Course: COMPUTER SPEECH, TEXT, AND INTERNET TECHNOLOGY
*** ONE YEAR GRADUATE COURSE ***
** STUDENTSHIPS AVAILABLE **
- ----------------------------------------------------------------------
COMPUTER SPEECH, TEXT, AND INTERNET TECHNOLOGY
ONE YEAR MASTERS COURSE
THE DEPARTMENT OF ENGINEERING AND THE COMPUTER LABORATORY
UNIVERSITY OF CAMBRIDGE
- ----------------------------------------------------------------------
This new Masters course has replaced the highly successful M.Phil in
Computer Speech and Language Processing.
Like its predecessor, a key aim of the new Masters course in
Computer Speech, Text and Internet Technology is to teach the
fundamental theory of speech and natural language processing.
However, the new course also focuses on its application to
information management and access within the framework of emerging
Internet and W3C standards, such as XML text and speech annotation.
It runs from early October to August and consists of two terms of
lectures and practicals followed by a three month project. The
final degree is awarded on the basis of coursework, examination
and project.
The course differs from some other programmes by providing an
in-depth practical and theoretical grounding in the techniques for
speech and language processing which form the basis for today's
commercial and research prototype systems. There are strong links
with industry and many of our past students have gone on to work
for high-tech start-ups and industrial research laboratories, either
immediately or after completing a PhD.
To further strengthen our links with industry, we are making this
course available to students wishing to pursue it on a part-time
basis. We currently have two part-time students in their first year
and strongly encourage others to apply. (Note that part-time enrolment
requires attendance in Cambridge 1+1/2 days / week during term time.)
Cambridge is a major international centre for research in both
speech and language processing. The course is taught by leading
researchers in these areas who have active collaborations with
industrial and academic laboratories in Europe, the US and Japan.
The EPSRC have funded a number of studentships for the course which
are currently available to qualifying applicants. We especially
encourage applications from students with a background in
engineering, computer science, mathematics, linguistics and/or
psychology.
For further details please consult the course URL:
http://svr-www.eng.cam.ac.uk/cstit/
or contact:
Mrs Mavis Barber (Computer Speech, Text, and Internet Technology)
Department of Engineering, University of Cambridge
Trumpington Street, Cambridge CB2 1PZ, UK
Tel: +44-1223-332752
Fax: +44-1223-332662
Email: cstit-enquiries at eng.cam.ac.uk
- ----------------------------------------------------------------------
-------------------------------- Message 2 -------------------------------
Date: Fri, 22 Feb 2002 11:58:45 -0500
From: LDC Office <ldc at ldc.upenn.edu>
Subject: New LDC Corpora
* RST Discourse Treebank *
* Multiple-Translation Chinese Corpus *
The Linguistic Data Consortium (LDC) is pleased to announce the
availability of the RST Discourse Treebank. This ftp publication
has been authored by Lynn Carlson, Daniel Marcu, and Mary Ellen
Okurowski. It contains a selection of 385 Wall Street Journal articles
from the Penn Treebank which have been annotated with discourse
structure in the framework of Rhetorical Structure Theory (RST).
Additionally, the
corpus includes a number of human generated extracts and abstracts
associated with the original documents.
For further information, including a link to the discourse annotation
tool used for this database, please visit:
http://www.ldc.upenn.edu/Catalog/LDC2002T07.html
Institutions that have membership in the LDC during the 2002
Membership Year will be able to receive this corpus free of charge.
Nonmembers may purchase this publication for $100.
*
The Linguistic Data Consortium (LDC) would like to announce the
availability of the Multiple-Translation Chinese Corpus. This ftp
publication was designed to support the development of automatic means
for evaluating translation quality. The corpus consists of 105 stories
drawn from Mandarin Chinese journalistic text. These stories were
translated several times into English by both human translators and MT
systems.
For further information, including a Chinese text with a sample English
translation, please visit:
http://www.ldc.upenn.edu/Catalog/LDC2002T01.html
Institutions that have membership in the LDC during the 2002
Membership Year will be able to receive this corpus free of charge.
Nonmembers may purchase this publication for $400.
*
If you need additional information before placing your order, or
would like to inquire about membership in the LDC, please send email to
<ldc at ldc.upenn.edu> or call (215) 573-1275.
- ------------------------------------------------------------------
Linguistic Data Consortium Phone: (215) 573-1275
3615 Market Street Fax: (215) 573-2175
Suite 200 email: ldc at unagi.cis.upenn.edu
Philadelphia, PA 19104-2608 www: http://www.ldc.upenn.edu
---------------------------------------------------------------------------
LINGUIST List: Vol-13-511
More information about the LINGUIST
mailing list