[Lingtyp] AUTOTYP database v1.0.0 now public

Johanna Nichols johanna at berkeley.edu
Thu Feb 10 23:46:13 UTC 2022


We are happy to announce a new release of the AUTOTYP database v1.0.0,
available here:

    Zenodo: https://doi.org/10.5281/zenodo.5931509
    GitHub: https://github.com/autotyp/autotyp-data/tree/v1.0.0

This is a completely new release, radically overhauled from the
earlier 0.1.x version, and focuses on usability, documentation, and
completeness.  New features include:

• Over 260 typological variables that describe 1319 languages across
approximately 260,000 datapoints or, together with the derived
(aggregated) data, over 1,700,000 datapoints.

• New naming conventions for datasets and variables, focusing on
usability and clarity.

• Language name and Glottolog code now accompany every dataset, so
each dataset is a self-standing table of a typological variable (but
can also be linked to any and all of the others via the internal
language ID).

• Published data now includes the raw exported database data as well
as derived aggregated tables.  All aggregation scripts used to compute
derived data are published as well.

• New R and JSON exports for users who prefer those environments.

For a complete list of major new features see:
   https://github.com/autotyp/autotyp-data/blob/v1.0.0/CHANGES-1.0.0.md

For general information about the database:
   https://github.com/autotyp/autotyp-data/blob/v1.0.0/readme.md


Balthasar Bickel, Johanna Nichols, Taras Zakharko, Alena Witzlack-Makarevich



More information about the Lingtyp mailing list