Corpora: New Corpus from LDC

LDC Office ldc at unagi.cis.upenn.edu
Thu Jul 6 15:23:21 UTC 2000


The Linguistic Data Consortium is pleased to announce
the availability of the Korean Newswire Text Corpus.
The collection contains 143,137 articles collected
from Korean Press Agency during the period of 2 June
1994 through 20 March 2000.  The articles are encoded
in the the KSC-5601 Korean character encoding and
SGML tagging has been added.

Institutions that have membership in the LDC during
the 2000 Membership Year will be able to receive this
corpus free of charge.  Nonmembers may purchase this
collection for $1000.

If you would like to order a copy of this corpus,
please email your request to <ldc at ldc.upenn.edu>. If
you need additional information before placing your
order, or would like to inquire about membership in
the LDC, please send email or call (215) 573-1275.



More information about the Corpora mailing list