Assessing well-formedness using Google counts

Danko Sipka danko.sipka at ASU.EDU
Mon Sep 13 00:11:23 UTC 2004


Dear Seelangers,

I frequently use Google to determine lexical and morphosyntactic well-formedness of two options in various languages. I advise my students to do the same. In order to save time required to go to Google two times for one inquiry, I have created a simple script at:

http://cli.la.asu.edu/togoogleornot.htm

which lets you enter two option, choose the target language and then get hits for both options in one window. For example, if a student of Russian enters в ВУ3 as one option and на ВУ3 as the other and selects Russian as the target language, it will be obvious that the first option is well formed while the second is not. 

I plan to add lemmatizers for several Slavic languages which would make it possible to search words in all their inflectional forms but even it this form the script may be of interest.

Best,

Danko

Danko Sipka
Research Associate Professor and Acting Director
Critical Languages Institute (http://cli.la.asu.edu)
Arizona State University
E-mail: Danko.Sipka at asu.edu
Web: http://www.public.asu.edu/~dsipka

-------------------------------------------------------------------------
 Use your web browser to search the archives, control your subscription
  options, and more.  Visit and bookmark the SEELANGS Web Interface at:
                    http://seelangs.home.comcast.net/
-------------------------------------------------------------------------



More information about the SEELANG mailing list