35.1781, FYI: June 2024 Newsletter - LDC

The LINGUIST List linguist at listserv.linguistlist.org
Mon Jun 17 21:05:02 UTC 2024


LINGUIST List: Vol-35-1781. Mon Jun 17 2024. ISSN: 1069 - 4875.

Subject: 35.1781, FYI: June 2024 Newsletter - LDC

Moderator: Francis Tyers (linguist at linguistlist.org)
Managing Editor: Justin Fuller
Team: Helen Aristar-Dry, Steven Franks, Daniel Swanson, Erin Steitz
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org

Homepage: http://linguistlist.org

Please support the LL editors and operation with a donation at:
           https://funddrive.linguistlist.org/donate/

Editor for this issue: Justin Fuller <justin at linguistlist.org>

LINGUIST List is hosted by Indiana University College of Arts and Sciences.
================================================================


Date: 17-Jun-2024
From: Membership Coordinator [ldc at ldc.upenn.edu]
Subject: June 2024 Newsletter - LDC


In this newsletter:
LDC data and commercial technology development

New publications:
Diaspora Tibetan Speech
AIDA Scenario 2 Practice Topic Annotation

________________________________________
LDC data and commercial technology development
For-profit organizations are reminded that an LDC membership is a
pre-requisite for obtaining a commercial license to almost all LDC
databases. Non-member organizations, including non-member for-profit
organizations, cannot use LDC data to develop or test products for
commercialization, nor can they use LDC data in any commercial product
or for any commercial purpose. LDC data users should consult
corpus-specific license agreements for limitations on the use of
certain corpora. Visit the Licensing page for further information.
________________________________________

New publications:
Diaspora Tibetan Speech was developed at Yale University. It contains
28 hours of Tibetan elicited speech by 73 speakers from the diaspora
Tibetan community in Kathmandu, Nepal, along with transcripts,
elicitation materials, and speaker metadata.

Recordings were collected in 2016. All speakers were adults and varied
in age as well as age of diaspora. A substantial number of speakers
were born in Nepal. Each speaker contributed one recording comprising
a series of elicitation tasks: some demographic information; a word
list and numbers; some sentences in isolation; a scripted story; and
free speech based on "frog story" type illustrations.  Annotation and
metadata formats include PDF and Word (some transcripts), Excel (some
transcripts, speaker metadata) and Praat TextGrids (word and number
lists).

2024 members can access this corpus through their LDC accounts.
Non-members may license this data for a fee.

*

AIDA Scenario 2 Practice Topic Annotation was developed by LDC and is
comprised of annotations for 29 English, Russian, and Spanish
documents (text, image, and video) from AIDA Scenario 2  Practice
Topic Source Data (LDC2024T04), specifically, the set of practice
documents designated for annotation in Phase 2.

Annotations are presented as tab separated files in the following
categories for each topic:
•       Mentions: single references in source data to a real-world
entity or filler, event, or relation.
•       Slots: pre-defined roles in an event or relation filled by an
argument (entity mention).
•       Linking: entity mentions linked to entries in the knowledge
base as a method of indicating the real-world entity to which an
entity referred.

2024 members can access this corpus through their LDC accounts.
Non-members may license this data for a fee.

To unsubscribe from this newsletter, log in to your LDC account and
uncheck the box next to “Receive Newsletter” under Account Options or
contact LDC for assistance.

Membership Coordinator
Linguistic Data Consortium
University of Pennsylvania
T: +1-215-573-1275
E: ldc at ldc.upenn.edu
M: 3600 Market St. Suite 810
      Philadelphia, PA 19104

Linguistic Field(s): Computational Linguistics




------------------------------------------------------------------------------

Please consider donating to the Linguist List https://give.myiu.org/iu-bloomington/I320011968.html


LINGUIST List is supported by the following publishers:

Cambridge University Press http://www.cambridge.org/linguistics

De Gruyter Mouton https://cloud.newsletter.degruyter.com/mouton

Equinox Publishing Ltd http://www.equinoxpub.com/

John Benjamins http://www.benjamins.com/

Lincom GmbH https://lincom-shop.eu/

Multilingual Matters http://www.multilingual-matters.com/

Narr Francke Attempto Verlag GmbH + Co. KG http://www.narr.de/

Wiley http://www.wiley.com


----------------------------------------------------------
LINGUIST List: Vol-35-1781
----------------------------------------------------------



More information about the LINGUIST mailing list