[Lingtyp] FYI: New INEL corpora released

Alexandre Arkhipov sarkipo at yandex.ru
Wed Feb 12 11:38:24 UTC 2025


Dear colleagues,

We are happy to announce three new corpora published within the project 
“Grammars, Corpora and Language Technology for Indigenous Northern 
Eurasian Languages” (INEL).
Project resources portal: https://inel.corpora.uni-hamburg.de/portal/#en 
<https://inel.corpora.uni-hamburg.de/portal/#en>
The entire INEL collection in the 
repository:https://hdl.handle.net/11022/0000-0007-F45A-1 
<https://hdl.handle.net/11022/0000-0007-F45A-1>

*INEL Enets Corpus 1.0
*Shluinsky, Andrey; Khanina, Olesya; Wagner-Nagy, Beáta. 2024
Corpus home page: 
https://inel.corpora.uni-hamburg.de/portal/corpora/enets/#en 
<https://inel.corpora.uni-hamburg.de/portal/corpora/enets/#en>
Online search: https://inel.corpora.uni-hamburg.de/EnetsCorpus/search 
<https://inel.corpora.uni-hamburg.de/EnetsCorpus/search>
Repository: https://hdl.handle.net/11022/0000-0007-FE1D-C 
<https://hdl.handle.net/11022/0000-0007-FE1D-C>
(Forest Enets & Tundra Enets < Samoyedic < Uralic; 218,710 tokens)

*INEL Nenets corpus 1.0*
Budzisch, Josefina; Wagner-Nagy, Beáta. 2024
Corpus home page: 
https://inel.corpora.uni-hamburg.de/portal/corpora/nenets/#en 
<https://inel.corpora.uni-hamburg.de/portal/corpora/nenets/#en>
Online search: https://inel.corpora.uni-hamburg.de/NenetsCorpus/search 
<https://inel.corpora.uni-hamburg.de/NenetsCorpus/search>
Repository: https://hdl.handle.net/11022/0000-0007-FE37-E 
<https://hdl.handle.net/11022/0000-0007-FE37-E>
(Forest Nenets & Tundra Nenets < Samoyedic < Uralic; 61,278 tokens)

*INEL Evenki Corpus 2.0*
Däbritz, Chris Lasse; Gusev, Valentin; Stoynova, Natalia. 2024
Corpus home page: 
https://inel.corpora.uni-hamburg.de/portal/corpora/evenki/#en 
<https://inel.corpora.uni-hamburg.de/portal/corpora/evenki/#en>
Online search: https://inel.corpora.uni-hamburg.de/EvenkiCorpus/search 
<https://inel.corpora.uni-hamburg.de/EvenkiCorpus/search>
Repository: https://hdl.handle.net/11022/0000-0007-FE38-D 
<https://hdl.handle.net/11022/0000-0007-FE38-D>
(Northern and Southern dialects of Evenki < Tungusic; 93,264 tokens)

We also take this opportunity to provide links to our previously 
published corpora:

*INEL Selkup Corpus 2.0*
Brykina, Maria; Orlova, Svetlana; Wagner-Nagy, Beáta. 2021
Corpus home page: 
https://inel.corpora.uni-hamburg.de/portal/corpora/selkup/#en 
<https://inel.corpora.uni-hamburg.de/portal/corpora/selkup/#en>
Online version: https://inel.corpora.uni-hamburg.de/SelkupCorpus/search 
<https://inel.corpora.uni-hamburg.de/SelkupCorpus/search>
Repository: https://hdl.handle.net/11022/0000-0007-F4D9-1 
<https://hdl.handle.net/11022/0000-0007-F4D9-1>
(Northern, Central and Southern varieties of Selkup < Samoyedic < 
Uralic; 81,498 tokens)

*INEL Kamas Corpus 2.0*
Gusev, Valentin; Klooster, Tiina; Wagner-Nagy, Beáta. 2023
Corpus home page: 
https://inel.corpora.uni-hamburg.de/portal/corpora/kamas/#en 
<https://inel.corpora.uni-hamburg.de/portal/corpora/kamas/#en>
Online search: https://inel.corpora.uni-hamburg.de/KamasCorpus/search 
<https://inel.corpora.uni-hamburg.de/KamasCorpus/search>
Repository: http://hdl.handle.net/11022/0000-0007-FC25-4 
<http://hdl.handle.net/11022/0000-0007-FC25-4>
(Kamas < Samoyedic < Uralic; ca. 49,000 tokens)

*INEL Dolgan Corpus 2.0*
Däbritz, Chris Lasse; Kudryakova, Nina; Stapert, Eugénie. 2022
Corpus home page: 
https://inel.corpora.uni-hamburg.de/portal/corpora/dolgan/#en 
<https://inel.corpora.uni-hamburg.de/portal/corpora/dolgan/#en>
Online version: https://inel.corpora.uni-hamburg.de/DolganCorpus/search 
<https://inel.corpora.uni-hamburg.de/DolganCorpus/search>
Repository: https://hdl.handle.net/11022/0000-0007-F9A7-4 
<https://hdl.handle.net/11022/0000-0007-F9A7-4>
(Dolgan < Turkic; 97,757tokens)

On behalf of the INEL Project team,
Alexandre Arkhipov

* * *
Apologies for cross-posting
* * *
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/lingtyp/attachments/20250212/d3ea3c5a/attachment.htm>


More information about the Lingtyp mailing list