[Lingtyp] FYI: New INEL corpora released

Sebastian Nordhoff sebastian.nordhoff at glottotopia.de
Thu Feb 13 09:37:52 UTC 2025


Dear Alexandre,
thank you very much for offering this resource. It is especially nice
that you offer bulk downloads. I have tried to find licensing
information on the site but failed. Could you let us know what the
framework for reuse is?
Best wishes
Sebastian

On 2/12/25 12:38, Alexandre Arkhipov via Lingtyp wrote:
> Dear colleagues,
>
> We are happy to announce three new corpora published within the project
> “Grammars, Corpora and Language Technology for Indigenous Northern
> Eurasian Languages” (INEL).
> Project resources portal: https://inel.corpora.uni-hamburg.de/portal/#en
> <https://inel.corpora.uni-hamburg.de/portal/#en>
> The entire INEL collection in the repository:https://
> hdl.handle.net/11022/0000-0007-F45A-1 <https://
> hdl.handle.net/11022/0000-0007-F45A-1>
>
> *INEL Enets Corpus 1.0
> *Shluinsky, Andrey; Khanina, Olesya; Wagner-Nagy, Beáta. 2024
> Corpus home page: https://inel.corpora.uni-hamburg.de/portal/corpora/
> enets/#en <https://inel.corpora.uni-hamburg.de/portal/corpora/enets/#en>
> Online search: https://inel.corpora.uni-hamburg.de/EnetsCorpus/search
> <https://inel.corpora.uni-hamburg.de/EnetsCorpus/search>
> Repository: https://hdl.handle.net/11022/0000-0007-FE1D-C <https://
> hdl.handle.net/11022/0000-0007-FE1D-C>
> (Forest Enets & Tundra Enets < Samoyedic < Uralic; 218,710 tokens)
>
> *INEL Nenets corpus 1.0*
> Budzisch, Josefina; Wagner-Nagy, Beáta. 2024
> Corpus home page: https://inel.corpora.uni-hamburg.de/portal/corpora/
> nenets/#en <https://inel.corpora.uni-hamburg.de/portal/corpora/nenets/#en>
> Online search: https://inel.corpora.uni-hamburg.de/NenetsCorpus/search
> <https://inel.corpora.uni-hamburg.de/NenetsCorpus/search>
> Repository: https://hdl.handle.net/11022/0000-0007-FE37-E <https://
> hdl.handle.net/11022/0000-0007-FE37-E>
> (Forest Nenets & Tundra Nenets < Samoyedic < Uralic; 61,278 tokens)
>
> *INEL Evenki Corpus 2.0*
> Däbritz, Chris Lasse; Gusev, Valentin; Stoynova, Natalia. 2024
> Corpus home page: https://inel.corpora.uni-hamburg.de/portal/corpora/
> evenki/#en <https://inel.corpora.uni-hamburg.de/portal/corpora/evenki/#en>
> Online search: https://inel.corpora.uni-hamburg.de/EvenkiCorpus/search
> <https://inel.corpora.uni-hamburg.de/EvenkiCorpus/search>
> Repository: https://hdl.handle.net/11022/0000-0007-FE38-D <https://
> hdl.handle.net/11022/0000-0007-FE38-D>
> (Northern and Southern dialects of Evenki < Tungusic; 93,264 tokens)
>
> We also take this opportunity to provide links to our previously
> published corpora:
>
> *INEL Selkup Corpus 2.0*
> Brykina, Maria; Orlova, Svetlana; Wagner-Nagy, Beáta. 2021
> Corpus home page: https://inel.corpora.uni-hamburg.de/portal/corpora/
> selkup/#en <https://inel.corpora.uni-hamburg.de/portal/corpora/selkup/#en>
> Online version: https://inel.corpora.uni-hamburg.de/SelkupCorpus/search
> <https://inel.corpora.uni-hamburg.de/SelkupCorpus/search>
> Repository: https://hdl.handle.net/11022/0000-0007-F4D9-1 <https://
> hdl.handle.net/11022/0000-0007-F4D9-1>
> (Northern, Central and Southern varieties of Selkup < Samoyedic <
> Uralic; 81,498 tokens)
>
> *INEL Kamas Corpus 2.0*
> Gusev, Valentin; Klooster, Tiina; Wagner-Nagy, Beáta. 2023
> Corpus home page: https://inel.corpora.uni-hamburg.de/portal/corpora/
> kamas/#en <https://inel.corpora.uni-hamburg.de/portal/corpora/kamas/#en>
> Online search: https://inel.corpora.uni-hamburg.de/KamasCorpus/search
> <https://inel.corpora.uni-hamburg.de/KamasCorpus/search>
> Repository: http://hdl.handle.net/11022/0000-0007-FC25-4 <http://
> hdl.handle.net/11022/0000-0007-FC25-4>
> (Kamas < Samoyedic < Uralic; ca. 49,000 tokens)
>
> *INEL Dolgan Corpus 2.0*
> Däbritz, Chris Lasse; Kudryakova, Nina; Stapert, Eugénie. 2022
> Corpus home page: https://inel.corpora.uni-hamburg.de/portal/corpora/
> dolgan/#en <https://inel.corpora.uni-hamburg.de/portal/corpora/dolgan/#en>
> Online version: https://inel.corpora.uni-hamburg.de/DolganCorpus/search
> <https://inel.corpora.uni-hamburg.de/DolganCorpus/search>
> Repository: https://hdl.handle.net/11022/0000-0007-F9A7-4 <https://
> hdl.handle.net/11022/0000-0007-F9A7-4>
> (Dolgan < Turkic; 97,757tokens)
>
> On behalf of the INEL Project team,
> Alexandre Arkhipov
>
> * * *
> Apologies for cross-posting
> * * *
>
>
> _______________________________________________
> Lingtyp mailing list
> Lingtyp at listserv.linguistlist.org
> https://listserv.linguistlist.org/cgi-bin/mailman/listinfo/lingtyp



More information about the Lingtyp mailing list