11.2604, Sum: JPNS Frequency

The LINGUIST Network linguist at linguistlist.org
Sat Dec 2 04:23:00 UTC 2000


LINGUIST List:  Vol-11-2604. Fri Dec 1 2000. ISSN: 1068-4875.

Subject: 11.2604, Sum: JPNS Frequency

Moderators: Anthony Aristar, Wayne State U.<aristar at linguistlist.org>
            Helen Dry, Eastern Michigan U. <hdry at linguistlist.org>
            Andrew Carnie, U. of Arizona <carnie at linguistlist.org>

Reviews: Andrew Carnie: U. of Arizona <carnie at linguistlist.org>

Editors: Karen Milligan, Wayne State U. <karen at linguistlist.org>
         Michael Appleby, E. Michigan U. <michael at linguistlist.org>
         Rob Beltz, E. Michigan U. <rob at linguistlist.org>
         Lydia Grebenyova, E. Michigan U. <lydia at linguistlist.org>
         Jody Huellmantel, Wayne State U. <jody at linguistlist.org>
         Marie Klopfenstein, Wayne State U. <marie at linguistlist.org>
	 Naomi Ogasawara, E. Michigan U. <naomi at linguistlist.org>
	 James Yuells, Wayne State U. <james at linguistlist.org>
         Ljuba Veselinova, Stockholm U. <ljuba at linguistlist.org>

Software: John Remmers, E. Michigan U. <remmers at emunix.emich.edu>
          Gayathri Sriram, E. Michigan U. <gayatri at linguistlist.org>

Home Page:  http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.


Editor for this issue: Marie Klopfenstein <marie at linguistlist.org>

=================================Directory=================================

1)
Date:  Fri, 1 Dec 2000 10:30:04 -0700
From:  Tim Mills <tmills at zicorp.com>
Subject:  JPNS frequency

-------------------------------- Message 1 -------------------------------

Date:  Fri, 1 Dec 2000 10:30:04 -0700
From:  Tim Mills <tmills at zicorp.com>
Subject:  JPNS frequency

Hello, fellow linguists!

As promised, here is a summary of the responses I received to my query for
information on Japanese kanji, kana, and word frequency.

- --
My original query:

I seek the following data.  All frequencies are, ideally, counts from a
corpus of informal written correspondence.

1.	Frequency of individual kanji characters.
2.	Frequency of individual kana characters (hiragana & katakana).
3.	Frequency of words.  (Preferably roots, with cross-referenced affix
frequencies)

If anyone has or knows of research involving any of the above data, please
contact me off the list at "tmills at zicorp.com".
- --

The responses:

- --
>>From Edson Miyamoto [etm at is.s.u-tokyo.ac.jp]:

there's a database that has just come out recently.  Take a look at:

http://www.sanseido-publ.co.jp/publ/NTT_english.html

- --
>>From Heidi Frank [h-frank2 at nwu.edu]:

I recently completed my masters thesis on character counts in Japanese
lesbian and Japanese housewife letters to and from the editor of their
respective periodicals.  I counted a total of 8,400 characters from each
group.  Is this the kind of data that you are looking for?  Is it
informal enough?  I counted Kanji, hiragana, katakana, romaji, and
various symbols.  Let me know if this would help you out.

- --
>>From Atsuko Hayashi [mailto:hayashi at OREGON.UOREGON.EDU],
	through Scott McGinnis [smcginnis at nflc.org]

Hayashisan sent a file with kanji frequency counts.  Unfortunately, I was
unable to open the file and so cannot comment on the contents.  But thankyou
for the effort, and thankyou Mr. McGinnis for forwarding the information.

- --
>>From Mike Roberts [mailto:robertsm at waikato.ac.nz],
	also through Scott McGinnis [smcginnis at nflc.org]

This study is quite old now and I understand that the book is out of
print; but you may be able to access it through the Kokuritsu Kokugo
Kenkyuujo.

It's called

Gendai Zasshi Kyuujuushu Yoogo Yooji Hindosuu

- --

Thanks to all who responded, and especially to Scott McGinnis for relaying
the message to the Japanese SLA listserve and passing the replies on to me.

If anyone has further information pertaining to this query, please contact
me off the list at "tmills at zicorp.com".  Anyone wishing further information
regarding any of these responses may contact me.

Sincerely,

	- Tim Mills -
	Zi Corporation

- --------------------------------------------

Tim Mills, Computational Linguist
Zi Corporation
Suite 300, 500 - 4 Avenue SW
Calgary, Alberta
Canada T2P 2V6

Main:  (403) 233.8875
Direct:  (403) 231.4591
Fax:  (403) 231.4595
E-mail:  tmills at zicorp.com
Website:  www.zicorp.com

---------------------------------------------------------------------------
LINGUIST List: Vol-11-2604



More information about the LINGUIST mailing list