33.1010, FYI: March 2022 Newsletter - LDC

The LINGUIST List linguist at listserv.linguistlist.org
Thu Mar 17 06:09:41 UTC 2022


LINGUIST List: Vol-33-1010. Thu Mar 17 2022. ISSN: 1069 - 4875.

Subject: 33.1010, FYI: March 2022 Newsletter - LDC

Moderator: Malgorzata E. Cavar (linguist at linguistlist.org)
Student Moderator: Billy Dickson
Managing Editor: Lauren Perkins
Team: Helen Aristar-Dry, Everett Green, Sarah Goldfinch, Nils Hjortnaes,
      Joshua Sims, Billy Dickson, Amalia Robinson, Matthew Fort
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org

Homepage: http://linguistlist.org

Please support the LL editors and operation with a donation at:
           https://funddrive.linguistlist.org/donate/

Editor for this issue: Everett Green <everett at linguistlist.org>
================================================================


Date: Thu, 17 Mar 2022 01:49:50
From: Membership Coordinator [ldc at ldc.upenn.edu]
Subject: March 2022 Newsletter - LDC

 
In this newsletter: 
LDC data and commercial technology development 

New Publications:
AttImam 
HAVIC MED Novel 1 Test – Videos, Metadata and Annotation
________________________________________
LDC data and commercial technology development
For-profit organizations are reminded that an LDC membership is a
pre-requisite for obtaining a commercial license to almost all LDC databases.
Non-member organizations, including non-member for-profit organizations,
cannot use LDC data to develop or test products for commercialization, nor can
they use LDC data in any commercial product or for any commercial purpose. LDC
data users should consult corpus-specific license agreements for limitations
on the use of certain corpora. Visit the Licensing page for further
information.
________________________________________

New publications:
(1)  AttImam was developed by Al-Imam Mohammad Ibn Saud Islamic University and
consists of approximately 2,000 attribution relations applied to Arabic
newswire text from Arabic Treebank: Part 1 v 4.1 (LDC2010T13). Attribution
refers to the process of reporting or assigning an utterance to the correct
speaker.  

The source Arabic newswire was collected by LDC from Agence France Presse
articles published in 2000. Files were annotated by native Arabic speakers and
contain the following elements:
- Cue: the lexical anchor that connects the source with the content.
- Source: the entity or the agent that owns the content.
- Content: the basic element expressing the claim or the reported news.
- General Features: these can include such features as attribution style
(direct or indirect), determinacy (factual or non-factual), and purpose (e.g.,
assertion, expression).

AttImam is distributed via web download.  

2022 Subscription Members will automatically receive copies of this corpus.
2022 Standard Members may request a copy as part of their 16 free membership
corpora. Non-members may license this data for a fee.
*
(2)  HAVIC MED Novel 1 Test – Videos, Metadata and Annotation is comprised of
3,800 hours of user-generated videos with annotation and metadata developed by
LDC for the 2015 NIST Multimedia Event Detection tasks. The data consists of
videos of various events (event videos) and videos completely unrelated to
events (background videos). Each event video was manually annotated with
judgments describing its event properties and other salient features.
Background videos were labeled with topic and genre categories.

HAVIC MED Novel 1 Test -- Videos, Metadata and Annotation is distributed via
web download. 

2022 Subscription Members will automatically receive copies of this corpus.
2022 Standard Members may request a copy as part of their 16 free membership
corpora. This corpus is a members-only release and is not available for
non-member licensing. Contact ldc at ldc.upenn.edu for information about
membership.

Membership Coordinator
Linguistic Data Consortium
University of Pennsylvania
T: +1-215-573-1275
E: ldc at ldc.upenn.edu
M: 3600 Market St. Suite 810
Philadelphia, PA 19104

 



Linguistic Field(s): Computational Linguistics





 



------------------------------------------------------------------------------

***************************    LINGUIST List Support    ***************************
 The 2020 Fund Drive is under way! Please visit https://funddrive.linguistlist.org
  to find out how to donate and check how your university, country or discipline
     ranks in the fund drive challenges. Or go directly to the donation site:
                   https://crowdfunding.iu.edu/the-linguist-list

                        Let's make this a short fund drive!
                Please feel free to share the link to our campaign:
                    https://funddrive.linguistlist.org/donate/
 


----------------------------------------------------------
LINGUIST List: Vol-33-1010	
----------------------------------------------------------






More information about the LINGUIST mailing list