[Corpora-List] Italian media corpora

Stefania Spina stefania.spina at gmail.com
Sat Nov 23 13:57:28 UTC 2013


Hi Stefan,
you can take a look at The Perugia Corpus (
http://perugiacorpus.unistrapg.it/), a 26 million words reference corpus of
Italian. There are three sections that may be of interest for media
language: a tv section, a film section and a web section (with texts taken
from chat and social networks interactions and blog posts).
Best regards,

Stefania Spina





> ---------- Messaggio inoltrato ----------
> From: Stefan Schneider <stefan.schneider at uni-graz.at>
> To: "CORPORA at UIB.NO" <CORPORA at UIB.NO>
> Cc:
> Date: Fri, 22 Nov 2013 15:35:28 +0100
> Subject: [Corpora-List] Italian media corpora
> Dear colleagues,
> I am preparing a little survey of Italian media corpora. I am already
> aware of the following:
> - Corpora e lessici dell'italiano parlato e scritto (CLIPS)
> - Corpora LABLITA (Corpus di italiano parlato, corpus Stammerjohann, etc.)
> - Corpus di parlato cinematografico
> - Corpus di parlato telegiornalistico. Anni Sessanta vs. 2005 (CPT)
> - Integrated reference corpora for spoken romance languages (C-Oral-Rom)
> - Corpus del Lessico di frequenza dell'italiano parlato (LIP corpus)
> - Lessico italiano radiofonico (LIR corpus)
> - Lessico italiano televisivo (LIT corpus or LIT 2006 corpus)
> - Newsgroup UseNet Corpora (NUNC)
> - Corpus della Piattaforma per l’apprendimento dell’italiano su corpora
> annotati (PAISÀ corpus)
> - SMS Monitor Studies
> - Corpus Stammerjohann
> - Corpus TWITA
> - Web as corpus kool ynitiative (corpus itWaC)
> All these corpora contain smaller or larger portions of media language
> (radio, television, telephone, etc.). I'd like to know whether there are
> other corpora documenting Italian media language, especially SMS, tweets
> and E-mails.
> Thank you
> Stefan Schneider (University of Graz)
>
>
>
>
>
>
>


-- 
Stefania Spina
Università per Stranieri di Perugia
Dipartimento di Scienze Umane e Sociali
stefania.spina at unistrapg.it
http://webclass.unistrapg.it/webclass/mod/data/view.php?d=1&rid=6
Twitter: @sspina
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20131123/9de1d4d8/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list