13.1207, Books: Corpus Ling: Frequency list for Russian
LINGUIST List
linguist at linguistlist.org
Wed May 1 18:02:28 UTC 2002
LINGUIST List: Vol-13-1207. Wed May 1 2002. ISSN: 1068-4875.
Subject: 13.1207, Books: Corpus Ling: Frequency list for Russian
Moderators: Anthony Aristar, Wayne State U.<aristar at linguistlist.org>
Helen Dry, Eastern Michigan U. <hdry at linguistlist.org>
Reviews (reviews at linguistlist.org):
Simin Karimi, U. of Arizona
Terence Langendoen, U. of Arizona
Consulting Editor:
Andrew Carnie, U. of Arizona <carnie at linguistlist.org>
Editors (linguist at linguistlist.org):
Karen Milligan, WSU Naomi Ogasawara, EMU
James Yuells, EMU Marie Klopfenstein, WSU
Michael Appleby, EMU Heather Taylor-Loring, EMU
Ljuba Veselinova, Stockholm U. Richard John Harvey, EMU
Dina Kapetangianni, EMU Renee Galvis, WSU
Karolina Owczarzak, EMU
Software: John Remmers, E. Michigan U. <remmers at emunix.emich.edu>
Gayathri Sriram, E. Michigan U. <gayatri at linguistlist.org>
Home Page: http://linguistlist.org/
The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.
Editor for this issue: Dina Kapetangianni <dina at linguistlist.org>
==========================================================================
Links to the websites of all LINGUIST's supporting publishers are
available at the end of this issue.
=================================Directory=================================
1)
Date: Thu, 18 Apr 2002 10:13:46 +0200
From: Serge Sharoff <serge.sharoff at uni-bielefeld.de>
Subject: Frequency list for Russian
-------------------------------- Message 1 -------------------------------
Date: Thu, 18 Apr 2002 10:13:46 +0200
From: Serge Sharoff <serge.sharoff at uni-bielefeld.de>
Subject: Frequency list for Russian
The list of most frequent Russian words is available at:
http://www.artint.ru/projects/frqlist/frqlist-en.asp
Currently Chastotnyj slovarj russkogo jazyka (Zasorina, 1977) provides
the most widely used frequency list for Russian. However, the corpus
used in Zasorina is relatively small according to modern standards
(about 1 million words). It is outdated: mostly it covers uses from
1920s to 1960s and includes a high proportion of ideological sources,
like texts by Lenin and Khrushchev and Soviet newspapers, thus, word
frequencies in it are severely biased. Finally, the list of
(Zasorina, 1977) is not available electronically.
The announced list is compiled on the basis of a corpus of modern
Russian fiction and political texts (more than 35 million words). The
list includes about 33000 words which frequency is greater than 1 ipm
(instances per million words). A shorter selection of 5000 most
frequent words is also available.
The structure of the lists follows the template of the lemmatised BNC
lists produced by Adam Kilgariff
(http://www.itri.bton.ac.uk/~Adam.Kilgarriff/bnc-readme.html), namely:
word rank, frequency (in ipm), word, part of speech.
In addition, some analytical information about the lexical stock is
provided, such as coverage of the total language use by word bands,
e.g. first 3000 lemmas cover 76.6824% of the total number of word
forms.
The corpus, tools for working with it, as well as an aligned parallel
English-Russian corpus are discussed in the forthcoming publication:
Sharoff, Serge, (2002). Meaning as use: exploitation of aligned
corpora for the contrastive study of lexical semantics. Proc. of
Language Resources and Evaluation Conference (LREC02). May, 2002, Las
Palmas, Spain.
http://www.artint.ru/projects/frqlist/lrec-02.pdf
---------------------------------------------------------------------------
If you buy one of these books, please tell the publisher or author
that you saw it on LINGUIST.
The following publishers contribute to the support of The LINGUIST List:
MAJOR SUPPORTERS
Academic Press
http://www.academicpress.com
Arnold Publishers
http://www.arnoldpublishers.com
Athelstan Publications
http://www.athel.com
Blackwell Publishers
http://www.blackwellpublishers.co.uk/
Cambridge University Press
http://www.cup.org
Cascadilla Press
http://www.cascadilla.com/
Continuum International Publishing Group Ltd
http://www.continuumbooks.com
CSLI Publications
http://csli-www.stanford.edu/publications/
Distribution Fides
Elsevier Science Ltd.
http://www.elsevier.nl/locate/linguistics
John Benjamins
http://www.benjamins.com/
http://www.benjamins.nl/
Kluwer Academic Publishers
http://www.wkap.nl/
Lernout & Hauspie
http://www.lhsl.com
Lincom Europa
http://www.lincom-europa.com
MIT Press
http://mitpress.mit.edu/books-legacy.tcl
Mouton de Gruyter
http://www.deGruyter.de/hling.html
Multilingual Matters
http://www.multilingual-matters.com/
Oxford UP
http://www.oup-usa.org/
Pearson Education
http://www.pearsoneduc.com/catalog.html
Rodopi
http://www.rodopi.nl/
Routledge
http://www.routledge.com/
Springer-Verlag
http://www.springer.de
Summer Institute of Linguistics
http://www.sil.org/
OTHER SUPPORTING PUBLISHERS
Anthropological Linguistics
http://www.indiana.edu/~anthling/
Bedford/St. Martin's
http://www.bedfordstmartins.com/
Finno-Ugrian Society
http://www.helsinki.fi/jarj/sus/
Graduate Linguistic Students' Assoc., Umass
http://www.umass.edu/linguist/GLSA/
International Pragmatics Assoc.
http://ipra-www.uia.ac.be/ipra/
Kingston Press Ltd.
http://www.kingstonpress.com
Linguistic Assoc. of Finland
http://www.ling.helsinki.fi/sky/
Linguistic Society of Southern Africa (LSSA)
http://www.safest.org.za/bsp
MIT Working Publishers in Linguistics
http://web.mit.edu/mitwpl/
Pacific Linguistics
http://pacling.anu.edu.au
Pacini Editore Spa
http://www.pacinieditore.it/
Utrecht Institute of Linguistics
http://www-uilots.let.uu.nl/
Virittaja Aikakauslehti
http://www.helsinki.fi/jarj/kks/virittaja.html
---------------------------------------------------------------------------
LINGUIST List: Vol-13-1207
More information about the LINGUIST
mailing list