35.543, FYI: February 2024 Newsletter - LDC

The LINGUIST List linguist at listserv.linguistlist.org
Fri Feb 16 00:05:04 UTC 2024


LINGUIST List: Vol-35-543. Fri Feb 16 2024. ISSN: 1069 - 4875.

Subject: 35.543, FYI: February 2024 Newsletter - LDC

Moderators: Malgorzata E. Cavar, Francis Tyers (linguist at linguistlist.org)
Managing Editor: Justin Fuller
Team: Helen Aristar-Dry, Steven Franks, Everett Green, Daniel Swanson, Maria Lucero Guillen Puon, Zackary Leech, Lynzie Coburn, Natasha Singh, Erin Steitz
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org

Homepage: http://linguistlist.org

Please support the LL editors and operation with a donation at:
           https://funddrive.linguistlist.org/donate/

Editor for this issue: Justin Fuller <justin at linguistlist.org>
================================================================


Date: 16-Feb-2024
From: Membership Coordinator [ldc at ldc.upenn.edu]
Subject: February 2024 Newsletter - LDC


In this newsletter:
LDC membership discounts expire March 1
Spring 2024 data scholarship recipients
Four corpora withdrawn from the LDC Catalog

New publications:
Second Language University Speech Intelligibility Corpus
AIDA Scenario 1 Practice Topic Annotation
________________________________________
LDC membership discounts expire March 1
Time is running out to save on 2024 membership fees. Renew your LDC
membership, rejoin the Consortium, or become a new member by March 1
to receive a discount of up to 10%. For more information on membership
benefits and options, visit Join LDC.

Spring 2024 data scholarship recipients
Congratulations to the recipients of LDC’s Spring 2024 data
scholarships:

Jordan Chandler: Université Rennes 2 (France): Master’s student,
English Studies. Jordan is awarded a copy of Penn Parsed Corpora of
Historical English LDC2020T16 to continue his research on the
historical development of adjective, quantifier, and article
indefiniteness in the English language.

Nikhil Raghav: TCG Crest (India): PhD candidate, Institute for
Advancing Intelligence. Nikhil is awarded copies of Third DIHARD
Challenge Development LDC2022S12 and Third DIHARD Challenge Evaluation
LDC2022S14 for his work in speaker diarization.

Abraham Sanders: Rensselaer Polytechnical Institute (USA): PhD
candidate, Cognitive Science. Abraham is awarded copies of Fisher
English Training Speech Part 1 Speech LDC2004S13, Fisher English
Training Speech Part 1 Transcripts LDC2004T19, Fisher English Training
Part 2 Speech LDC2005S13 and Fisher English Training Part 2
Transcripts LDC2005T19, for his work in spoken dialogue systems.

The next round of applications will be accepted in September 2024. For
information about the program, visit the Data Scholarships page.

Four corpora withdrawn from the LDC Catalog
We regret to announce that The New York Times Annotated Corpus,
LDC2008T19, has been withdrawn from the LDC Catalog by the data
provider. Because they contain data from LDC2008T19, the following
three corpora are also withdrawn from the Catalog: Benchmarks for Open
Relation Extraction LDC2014T27, Concretely Annotated New York Times
LDC2018T12, and News Sub-domain Named Entity Recognition LDC2023T12.
Organizations and individuals who have previously licensed any of
these data sets can continue to use them under the terms of their
respective special license agreements.
________________________________________
New publications:
Second Language University Speech Intelligibility Corpus was developed
by Northern Arizona University, The Pennsylvania State University, and
The University of Texas at Dallas. It contains 10.5 hours of English
speech collected from 66 international faculty and university students
representing 15 language backgrounds at 10 North American
universities. This release also includes orthographic transcriptions
for all recordings, intelligibility scores for 73% of the files,
speaker metadata, and aligned Praat textgrids.

2024 members can access this corpus through their LDC accounts
provided they have submitted a completed copy of the special license
agreement. Non-members may license this data for a fee.

*

AIDA Scenario 1 Practice Topic Annotation was developed by LDC and is
comprised of annotations for 212 English, Russian, and Ukrainian web
documents (text, image, and video) from AIDA Scenario 1 Practice Topic
Source Data (LDC2023T11), specifically, the set of practice documents
designated for annotation in Phase 1.

2024 members can access this corpus through their LDC accounts.
Non-members may license this data for a fee.

To unsubscribe from this newsletter, log in to your LDC account and
uncheck the box next to “Receive Newsletter” under Account Options or
contact LDC for assistance.

Membership Coordinator
Linguistic Data Consortium
University of Pennsylvania
T: +1-215-573-1275
E: ldc at ldc.upenn.edu

Linguistic Field(s): Computational Linguistics




------------------------------------------------------------------------------

Please consider donating to the Linguist List https://give.myiu.org/iu-bloomington/I320011968.html


LINGUIST List is supported by the following publishers:

Cambridge University Press http://www.cambridge.org/linguistics

John Benjamins http://www.benjamins.com/

Lincom GmbH https://lincom-shop.eu/

Linguistic Association of Finland http://www.ling.helsinki.fi/sky/

Multilingual Matters http://www.multilingual-matters.com/

Netherlands Graduate School of Linguistics / Landelijke (LOT) http://www.lotpublications.nl/

Wiley http://www.wiley.com


----------------------------------------------------------
LINGUIST List: Vol-35-543
----------------------------------------------------------



More information about the LINGUIST mailing list