[Lingtyp] FYI: New INEL corpora released
Alexandre Arkhipov
sarkipo at yandex.ru
Wed Feb 12 11:38:24 UTC 2025
Dear colleagues,
We are happy to announce three new corpora published within the project
“Grammars, Corpora and Language Technology for Indigenous Northern
Eurasian Languages” (INEL).
Project resources portal: https://inel.corpora.uni-hamburg.de/portal/#en
<https://inel.corpora.uni-hamburg.de/portal/#en>
The entire INEL collection in the
repository:https://hdl.handle.net/11022/0000-0007-F45A-1
<https://hdl.handle.net/11022/0000-0007-F45A-1>
*INEL Enets Corpus 1.0
*Shluinsky, Andrey; Khanina, Olesya; Wagner-Nagy, Beáta. 2024
Corpus home page:
https://inel.corpora.uni-hamburg.de/portal/corpora/enets/#en
<https://inel.corpora.uni-hamburg.de/portal/corpora/enets/#en>
Online search: https://inel.corpora.uni-hamburg.de/EnetsCorpus/search
<https://inel.corpora.uni-hamburg.de/EnetsCorpus/search>
Repository: https://hdl.handle.net/11022/0000-0007-FE1D-C
<https://hdl.handle.net/11022/0000-0007-FE1D-C>
(Forest Enets & Tundra Enets < Samoyedic < Uralic; 218,710 tokens)
*INEL Nenets corpus 1.0*
Budzisch, Josefina; Wagner-Nagy, Beáta. 2024
Corpus home page:
https://inel.corpora.uni-hamburg.de/portal/corpora/nenets/#en
<https://inel.corpora.uni-hamburg.de/portal/corpora/nenets/#en>
Online search: https://inel.corpora.uni-hamburg.de/NenetsCorpus/search
<https://inel.corpora.uni-hamburg.de/NenetsCorpus/search>
Repository: https://hdl.handle.net/11022/0000-0007-FE37-E
<https://hdl.handle.net/11022/0000-0007-FE37-E>
(Forest Nenets & Tundra Nenets < Samoyedic < Uralic; 61,278 tokens)
*INEL Evenki Corpus 2.0*
Däbritz, Chris Lasse; Gusev, Valentin; Stoynova, Natalia. 2024
Corpus home page:
https://inel.corpora.uni-hamburg.de/portal/corpora/evenki/#en
<https://inel.corpora.uni-hamburg.de/portal/corpora/evenki/#en>
Online search: https://inel.corpora.uni-hamburg.de/EvenkiCorpus/search
<https://inel.corpora.uni-hamburg.de/EvenkiCorpus/search>
Repository: https://hdl.handle.net/11022/0000-0007-FE38-D
<https://hdl.handle.net/11022/0000-0007-FE38-D>
(Northern and Southern dialects of Evenki < Tungusic; 93,264 tokens)
We also take this opportunity to provide links to our previously
published corpora:
*INEL Selkup Corpus 2.0*
Brykina, Maria; Orlova, Svetlana; Wagner-Nagy, Beáta. 2021
Corpus home page:
https://inel.corpora.uni-hamburg.de/portal/corpora/selkup/#en
<https://inel.corpora.uni-hamburg.de/portal/corpora/selkup/#en>
Online version: https://inel.corpora.uni-hamburg.de/SelkupCorpus/search
<https://inel.corpora.uni-hamburg.de/SelkupCorpus/search>
Repository: https://hdl.handle.net/11022/0000-0007-F4D9-1
<https://hdl.handle.net/11022/0000-0007-F4D9-1>
(Northern, Central and Southern varieties of Selkup < Samoyedic <
Uralic; 81,498 tokens)
*INEL Kamas Corpus 2.0*
Gusev, Valentin; Klooster, Tiina; Wagner-Nagy, Beáta. 2023
Corpus home page:
https://inel.corpora.uni-hamburg.de/portal/corpora/kamas/#en
<https://inel.corpora.uni-hamburg.de/portal/corpora/kamas/#en>
Online search: https://inel.corpora.uni-hamburg.de/KamasCorpus/search
<https://inel.corpora.uni-hamburg.de/KamasCorpus/search>
Repository: http://hdl.handle.net/11022/0000-0007-FC25-4
<http://hdl.handle.net/11022/0000-0007-FC25-4>
(Kamas < Samoyedic < Uralic; ca. 49,000 tokens)
*INEL Dolgan Corpus 2.0*
Däbritz, Chris Lasse; Kudryakova, Nina; Stapert, Eugénie. 2022
Corpus home page:
https://inel.corpora.uni-hamburg.de/portal/corpora/dolgan/#en
<https://inel.corpora.uni-hamburg.de/portal/corpora/dolgan/#en>
Online version: https://inel.corpora.uni-hamburg.de/DolganCorpus/search
<https://inel.corpora.uni-hamburg.de/DolganCorpus/search>
Repository: https://hdl.handle.net/11022/0000-0007-F9A7-4
<https://hdl.handle.net/11022/0000-0007-F9A7-4>
(Dolgan < Turkic; 97,757tokens)
On behalf of the INEL Project team,
Alexandre Arkhipov
* * *
Apologies for cross-posting
* * *
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/lingtyp/attachments/20250212/d3ea3c5a/attachment.htm>
More information about the Lingtyp
mailing list