17.308, FYI: Thesaurus Occitan (THESOC) Online
LINGUIST List
linguist at LINGUISTLIST.ORG
Mon Jan 30 14:27:12 UTC 2006
LINGUIST List: Vol-17-308. Mon Jan 30 2006. ISSN: 1068 - 4875.
Subject: 17.308, FYI: Thesaurus Occitan (THESOC) Online
Moderators: Anthony Aristar, Wayne State U <aristar at linguistlist.org>
Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
Reviews (reviews at linguistlist.org)
Sheila Dooley, U of Arizona
Terry Langendoen, U of Arizona
Homepage: http://linguistlist.org/
The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.
Editor for this issue: Svetlana Aksenova <svetlana at linguistlist.org>
================================================================
To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.
===========================Directory==============================
1)
Date: 29-Jan-2006
From: Michèle Oliviéri < olivieri at unice.fr >
Subject: Thesaurus Occitan (THESOC) Online
-------------------------Message 1 ----------------------------------
Date: Mon, 30 Jan 2006 09:24:41
From: Michèle Oliviéri < olivieri at unice.fr >
Subject: Thesaurus Occitan (THESOC) Online
The ''Thesaurus Occitan'' (THESOC) is a dialectological multimedia database
developed for several years at the CNRS research lab ''Bases, Corpus,
Langage'' (UMR 6039) at the University of Nice - Sophia Antipolis (France),
and supervised by Prof. J.Ph. DALBERA. Its aim is to collect all existing
dialectological material regarding the Occitan language (spoken in the
South of France), such as those published in the ''Atlas linguistiques de
la France par régions'', but also results of unpublished fieldwork. The
data available constitute a large corpus that contributes to the picture of
diatopic variation in Romance (concerning the lexicon, phonetics, phonology
and morphology). For every word, it offers several layers of searchable
items: IPA transcriptions, lemma, etymons, sounds and localisation on
geographical maps. It is continually increasing and contains at the moment:
- 8345 questions
- 819 localities
- 803874 records
- 29058 lemma
- 3754 etymons
- 988 sounds
- 500 pictures
Part of this corpus (about 400 000 items for the moment) is now available at:
http://thesaurus.unice.fr
Multiple ways of searching the database are already implemented in the
THESOC; all of them are not yet available on-line; they will be
progressively added.
Linguistic Field(s): Genetic Classification
Lexicography
Morphology
Phonetics
Phonology
Text/Corpus Linguistics
Subject Language(s): Auvergnat (auv)
-----------------------------------------------------------
LINGUIST List: Vol-17-308
More information about the LINGUIST
mailing list