<html>

  <head>


    <meta http-equiv="content-type" content="text/html; charset=UTF-8">

  </head>

  <body text="#000000" bgcolor="#FFFFFF">

    <p>We are glad to announce that the first version of the Dolgan

      corpus developed in the <a

        href="https://inel.corpora.uni-hamburg.de/">INEL project</a> (<a

        href="https://inel.corpora.uni-hamburg.de/">https://inel.corpora.uni-hamburg.de/</a>)

      is now published online.</p>

    <p><a href="http://hdl.handle.net/11022/0000-0007-CAE7-1">http://hdl.handle.net/11022/0000-0007-CAE7-1</a></p>

    <p>Dolgan is an endangered Turkic language of Northern Siberia. It

      is spoken by approximately 1,000 people on the Taymyr peninsula

      and in adjacent areas. Dolgan is closely related to Yakut (Sakha),

      but differs nevertheless in many aspects. Dolgan is in close

      contact with the neighboring languages Nganasan, Enets and Evenki

      as well as with Russian.</p>

    <p>The corpus at hand contains both folklore and narrative texts as

      well as spontaneous conversations. All material is interlinearily

      glossed; partly annotations of Semantic Roles, Syntactic

      Functions, Information Status and Structure as well as Borrowing

      and Code-Switching are provided. Roughly half of the material is

      aligned to the respective sound file which makes up ca. 10 hours

      of Dolgan speech in total.</p>

    <p>The INEL Dolgan corpus is composed of texts from different

      sources:<br>

      1. Published folklore texts from an edited volume ("Fol'klor

      Dolgan", P.E. Efremov 2000),<br>

      2. Transcripts of recordings provided by the Taymyr House of Folk

      Art (TDNT) in Dudinka (1970s-2000s),<br>

      3. Transcripts from the collection of Dr. Eugénie Stapert recorded

      on several fieldwork trips in 2007-2010,<br>

      4. Transcripts of recordings made on a fieldwork trip in 2017.</p>

    <p><strong>Accessing the corpus<br>

      </strong></p>

    <p>An online search interface, similar to the one for <a

        href="https://inel.corpora.uni-hamburg.de/SelkupCorpus/search">Selkup</a>

      and <a

        href="https://inel.corpora.uni-hamburg.de/KamasCorpus/search">Kamas</a>

      corpora, will be made available in the near future.</p>

    <p>The data in the corpora (annotated texts as well as corresponding

      metadata) are represented in XML formats of the freely distributed

      EXMARaLDA suite (<a href="http://exmaralda.org/en/">http://exmaralda.org/en/</a>).</p>

    <p>User documentation (in English) is available here: <a

href="https://corpora.uni-hamburg.de/hzsk/de/islandora/object/file:dolgan-1.0_INEL_Dolgan_Corpus_1.0_User_Documentation/datastream/PDF/INEL_Dolgan_Corpus.pdf">INEL_Dolgan_Corpus.pdf</a></p>

    <p>For browsing (and playback) of individual texts, use «<a

href="https://corpora.uni-hamburg.de/hzsk/de/islandora/object/spoken-corpus:dolgan-1.0#corpus-content">Sessions</a>»

      tab on the main corpus page. Each text can be viewed in one of

      three online formats (e.g. Visualizations: <a

href="https://corpora.uni-hamburg.de/hzsk/de/islandora/object/transcript:dolgan-1.0_AnIM_2009_Argish_nar/datastream/SCORE/AnIM_2009_Argish_nar-score.html">Score</a>)

      and downloaded in EXB (an EXMARaLDA format). The sources of texts,

      i.e. scanned pages (PDF) or sound files (WAV, MP3) can also be

      viewed/downloaded.</p>

    <p>For searching across the whole corpus, the complete archive of

      the <a

href="https://corpora.uni-hamburg.de/hzsk/de/islandora/object/spoken-corpus:dolgan-1.0#additional-files">corpus

        files</a> can be downloaded and searched with the EXAKT program

      of the EXMARaLDA suite.<br>

      Furthermore, in the next few weeks, an online search interface

      will be launched, based on the Tsakonian Corpus Platform (<a

        href="https://bitbucket.org/tsakorpus/">Tsakorpus</a>).</p>

    <p>Please send your comments and suggestions to:  <a

        href="mailto:inel@uni-hamburg.de">inel@uni-hamburg.de</a>.</p>

    <p><br>

      Best regards,<br>

      Alexandre Arkhipov<br>

    </p>

  </body>

</html>