12.1369, Support: Rosetta Project 1,000 Lang Archive: Researcher

Fri May 18 19:20:50 UTC 2001

LINGUIST List:  Vol-12-1369. Fri May 18 2001. ISSN: 1068-4875.

Subject: 12.1369, Support: Rosetta Project 1,000 Lang Archive: Researcher

Moderators: Anthony Aristar, Wayne State U.<aristar at linguistlist.org>
            Helen Dry, Eastern Michigan U. <hdry at linguistlist.org>
            Andrew Carnie, U. of Arizona <carnie at linguistlist.org>

Reviews (reviews at linguistlist.org):
	Simin Karimi, U. of Arizona
	Terence Langendoen, U. of Arizona

Editors (linguist at linguistlist.org):
	Karen Milligan, WSU 		Naomi Ogasawara, EMU
	Lydia Grebenyova, EMU		Jody Huellmantel, WSU
	James Yuells, WSU		Michael Appleby, EMU
	Marie Klopfenstein, WSU		Ljuba Veselinova, Stockholm U.
	Heather Taylor-Loring, EMU	Dina Kapetangianni, EMU

Software: John Remmers, E. Michigan U. <remmers at emunix.emich.edu>
          Gayathri Sriram, E. Michigan U. <gayatri at linguistlist.org>

Home Page:  http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.

Editor for this issue: Jody Huellmantel <jody at linguistlist.org>

=================================Directory=================================

1)
Date:  Thu, 17 May 2001 17:22:28 -0700
From:  "Jim Mason" <jimmason at longnow.org>
Subject:  Rosetta Project 1,000 Language Archive: Researcher/Intern, San Francisco, CA

-------------------------------- Message 1 -------------------------------

Date:  Thu, 17 May 2001 17:22:28 -0700
From:  "Jim Mason" <jimmason at longnow.org>
Subject:  Rosetta Project 1,000 Language Archive: Researcher/Intern, San Francisco, CA

Researcher/Intern opportunity:  The Rosetta Project 1,000 Language Archive

We are looking for linguistics students/professionals interested in
helping with archive research for The Rosetta Project 1,000 Language
Archive.  The Rosetta Project is an attempt to create a broad corpus
of language descriptions, vernacular texts, analytic materials and
audio files for 1,000+ languages in a publicly accessible, online
archive (http://www.RosettaProject.org).  Our goal is to create a
meaningful survey and near permanent archive of 1,000 languages as
well as a unique platform for contemporary comparative linguistic
research and education.

We are assembling a group of 5 researchers for the summer of 2001 to
help collect and assess a variety of materials to build the archive.
Most of this research will take place in the stacks at Stanford and
Berkeley as well as scanning and image processing in the offices of
the Long Now Foundation in San Francisco.  Payment will be on a
"pay-per-text" basis.  We pay $10 per text collected, which
should work out to a minimum of $15 an hour.  If you get efficient at
the process, you can make significantly more.

Most of the materials in the Rosetta archive are excerpts of already
published texts so the collection effort focuses on locating,
excepting and formatting published materials in various archives and
personal libraries.  We are excerpting and disseminating these
materials under Fair Use provisions where appropriate or with specific
permission when we are reproducing entire publications.

The texts we are collecting for each language are as follows:

- Genesis translations: We have collected Genesis Ch 1-3 translations
in 1,000 languages, most of which can be seen at
http://www.rosettaproject.org.  We invite more, but this component is
mostly completed.

- Glossed vernacular texts:  A cultural specific counterpoint to the Genesis
text with an interlinear morphemic analysis.  We will substitute other
vernacular texts if a glossed origin story is unavailable or culturally
inappropriate.

- Orthographies: The writing system(s) of the language with pronunciation
guide ideally in IPA.  Multiple or competing or historic orthographies are
especially encouraged.

- Swadesh word lists:  The Swadesh 100 word list.

- Inventories of phonemes.

- Morphology and Syntax:  Short sketches of 7 pages or under.  We do not
want full descriptive grammars.

- Audio files: Sample of spoken language with transcription and ideally a
translation.

- Detailed descriptions: Origin and current distribution of language, number
of speakers, family, typology, history, etc.  Descriptions that extend past
the current Ethnologue description for a language.

Though we are primarily looking for people to work in and around our office
in San Francisco, proposals to work in other archives in the US or around
the world will also be considered.   Off-site collection efforts will
likewise be paid via a "pay-per-text" basis and collections must focus on
materials needed for the Rosetta Archive.

For more information on the project, please see http://www.RosettaProject.org
and/or email jimmason at longnow.org

Thank you,

Jim Mason
Director, The Rosetta Project

---------------------------------------------------------------------------
LINGUIST List: Vol-12-1369