29.1462, FYI: Oxford-NINJAL Corpus of Old Japanese (ONCOJ)

The LINGUIST List linguist at listserv.linguistlist.org
Tue Apr 3 21:13:29 UTC 2018


LINGUIST List: Vol-29-1462. Tue Apr 03 2018. ISSN: 1069 - 4875.

Subject: 29.1462, FYI: Oxford-NINJAL Corpus of Old Japanese (ONCOJ)

Moderators: linguist at linguistlist.org (Damir Cavar, Malgorzata E. Cavar)
Reviews: reviews at linguistlist.org (Helen Aristar-Dry, Robert Coté,
                                   Michael Czerniakowski)
Homepage: http://linguistlist.org

Please support the LL editors and operation with a donation at:
           http://funddrive.linguistlist.org/donate/

Editor for this issue: Kenneth Steimel <ken at linguistlist.org>
================================================================


Date: Tue, 03 Apr 2018 17:13:21
From: Bjarke Frellesvig [bjarke.frellesvig at hertford.ox.ac.uk]
Subject: Oxford-NINJAL Corpus of Old Japanese (ONCOJ)

 
Dear colleagues

We are very pleased to announce the first public release of the Oxford-NINJAL
Corpus of Old Japanese (ONCOJ). We will be grateful if you would circulate and
share this information as appropriate.

The corpus is avallable through this website: http://oncoj.ninjal.ac.jp/

Old Japanese is the earliest attested stage of the Japanese language (mainly
the 8th century AD). The texts from the period are mainly poetry. The ONCOJ is
an ongoing, long-term collaborative research project between the Research
Centre for Japanese Language and Linguistics in the University of Oxford, and
the National Institute for Japanese Language and Linguistics, Tokyo.

The ONCOJ contains the texts in original script and in a phonemic
transcription. It is lemmatized and has annotation for mode of writing
(phonographic or logographic), morphology, constituency, and grammatical
function. This release presents the poetic texts from the period,
approximately 90,000 words of text. 

The corpus is searchable through a suite of online search facilities and both
the full data in the corpus and individual search results are downloadable for
offline use. The data is primarily presented in a Penn Historical style
bracketed tree format, but will also soon be available in a TEI convertible
xml format.

Bjarke Frellesvig (University of Oxford)
Stephen Wright Horn (NINJAL)
Toshinobu Ogiso (NINJAL)
 



Linguistic Field(s): Text/Corpus Linguistics

Subject Language(s): Japanese, Old (ojp)





 



------------------------------------------------------------------------------

*****************    LINGUIST List Support    *****************
Please support the LL editors and operation with a donation at:
            http://funddrive.linguistlist.org/donate/
 


----------------------------------------------------------
LINGUIST List: Vol-29-1462	
----------------------------------------------------------






More information about the LINGUIST mailing list