<html>
<head>
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
</head>
<body text="#000000" bgcolor="#FFFFFF">
<p>We are glad to announce that the first version of the Dolgan
corpus developed in the <a
href="https://inel.corpora.uni-hamburg.de/">INEL project</a> (<a
href="https://inel.corpora.uni-hamburg.de/">https://inel.corpora.uni-hamburg.de/</a>)
is now published online.</p>
<p><a href="http://hdl.handle.net/11022/0000-0007-CAE7-1">http://hdl.handle.net/11022/0000-0007-CAE7-1</a></p>
<p>Dolgan is an endangered Turkic language of Northern Siberia. It
is spoken by approximately 1,000 people on the Taymyr peninsula
and in adjacent areas. Dolgan is closely related to Yakut (Sakha),
but differs nevertheless in many aspects. Dolgan is in close
contact with the neighboring languages Nganasan, Enets and Evenki
as well as with Russian.</p>
<p>The corpus at hand contains both folklore and narrative texts as
well as spontaneous conversations. All material is interlinearily
glossed; partly annotations of Semantic Roles, Syntactic
Functions, Information Status and Structure as well as Borrowing
and Code-Switching are provided. Roughly half of the material is
aligned to the respective sound file which makes up ca. 10 hours
of Dolgan speech in total.</p>
<p>The INEL Dolgan corpus is composed of texts from different
sources:<br>
1. Published folklore texts from an edited volume ("Fol'klor
Dolgan", P.E. Efremov 2000),<br>
2. Transcripts of recordings provided by the Taymyr House of Folk
Art (TDNT) in Dudinka (1970s-2000s),<br>
3. Transcripts from the collection of Dr. Eugénie Stapert recorded
on several fieldwork trips in 2007-2010,<br>
4. Transcripts of recordings made on a fieldwork trip in 2017.</p>
<p><strong>Accessing the corpus<br>
</strong></p>
<p>An online search interface, similar to the one for <a
href="https://inel.corpora.uni-hamburg.de/SelkupCorpus/search">Selkup</a>
and <a
href="https://inel.corpora.uni-hamburg.de/KamasCorpus/search">Kamas</a>
corpora, will be made available in the near future.</p>
<p>The data in the corpora (annotated texts as well as corresponding
metadata) are represented in XML formats of the freely distributed
EXMARaLDA suite (<a href="http://exmaralda.org/en/">http://exmaralda.org/en/</a>).</p>
<p>User documentation (in English) is available here: <a
href="https://corpora.uni-hamburg.de/hzsk/de/islandora/object/file:dolgan-1.0_INEL_Dolgan_Corpus_1.0_User_Documentation/datastream/PDF/INEL_Dolgan_Corpus.pdf">INEL_Dolgan_Corpus.pdf</a></p>
<p>For browsing (and playback) of individual texts, use «<a
href="https://corpora.uni-hamburg.de/hzsk/de/islandora/object/spoken-corpus:dolgan-1.0#corpus-content">Sessions</a>»
tab on the main corpus page. Each text can be viewed in one of
three online formats (e.g. Visualizations: <a
href="https://corpora.uni-hamburg.de/hzsk/de/islandora/object/transcript:dolgan-1.0_AnIM_2009_Argish_nar/datastream/SCORE/AnIM_2009_Argish_nar-score.html">Score</a>)
and downloaded in EXB (an EXMARaLDA format). The sources of texts,
i.e. scanned pages (PDF) or sound files (WAV, MP3) can also be
viewed/downloaded.</p>
<p>For searching across the whole corpus, the complete archive of
the <a
href="https://corpora.uni-hamburg.de/hzsk/de/islandora/object/spoken-corpus:dolgan-1.0#additional-files">corpus
files</a> can be downloaded and searched with the EXAKT program
of the EXMARaLDA suite.<br>
Furthermore, in the next few weeks, an online search interface
will be launched, based on the Tsakonian Corpus Platform (<a
href="https://bitbucket.org/tsakorpus/">Tsakorpus</a>).</p>
<p>Please send your comments and suggestions to: <a
href="mailto:inel@uni-hamburg.de">inel@uni-hamburg.de</a>.</p>
<p><br>
Best regards,<br>
Alexandre Arkhipov<br>
</p>
</body>
</html>