16.1291, Qs: WebCorp Concordance Counts; Children's First Words

LINGUIST List linguist at linguistlist.org
Fri Apr 22 17:17:36 UTC 2005


LINGUIST List: Vol-16-1291. Fri Apr 22 2005. ISSN: 1068 - 4875.

Subject: 16.1291, Qs: WebCorp Concordance Counts; Children's First Words

Moderators: Anthony Aristar, Wayne State U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>

Reviews (reviews at linguistlist.org)
        Sheila Dooley, U of Arizona
        Terry Langendoen, U of Arizona

Homepage: http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.

Editor for this issue: Jessica Boynton <jessica at linguistlist.org>
================================================================

We'd like to remind readers that the responses to queries are usually
best posted to the individual asking the question. That individual is
then strongly encouraged to post a summary to the list. This policy was
instituted to help control the huge volume of mail on LINGUIST; so we
would appreciate your cooperating with it whenever it seems appropriate.

In addition to posting a summary, we'd like to remind people that it
is usually a good idea to personally thank those individuals who have
taken the trouble to respond to the query.

To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.


===========================Directory==============================

1)
Date: 21-Apr-2005
From: Jerry Kurjian < jkurjian at mail.sdsu.edu >
Subject: WebCorp Concordance Counts

2)
Date: 21-Apr-2005
From: Sue Hagen < suezqzeus at yahoo.com >
Subject: Children's First Words

	
-------------------------Message 1 ----------------------------------
Date: Fri, 22 Apr 2005 13:15:20
From: Jerry Kurjian < jkurjian at mail.sdsu.edu >
Subject: WebCorp Concordance Counts


Hi all,
I have a question about the concordance counts produced by the WebCorp site:

http://www.webcorp.org.uk/wcadvanced.html

For example, if I search ''suggest you don't'' vs. ''suggest that you
don't'' using WebCorp (via Google) I get, at the bottom of the page, a
concordance count of 187 vs. 96 kwics respectively.  However, if I search
the same two terms, in quotes, on Google, I get 34,200 vs. 16,200 hits.
The ratios are similar though not the same.

Does anyone have insight into how WebCorp calculates/filters its
concordances or why these two engines are so different in the number of
hits they return?

In fact, it is nice to have the more manageable number produced by WebCorp,
and the external collocate counts it creates.  But if I am interested in
the frequency of ''I'' collocating with the two search terms based on
WebCorp, I'd like to be clearer how those two counts are derived.

Jerry

Linguistic Field(s): Computational Linguistics
                     Text/Corpus Linguistics



	
-------------------------Message 2 ----------------------------------
Date: Fri, 22 Apr 2005 13:15:24
From: Sue Hagen < suezqzeus at yahoo.com >
Subject: Children's First Words

	

Hello,

I'm a graduate student at California State University, Fresno, studying
first word phonology.  Does anyone have available lists of children's first
ten to twenty words (with phonetic representations)?  I'm not looking for
any specific languages,and would appreciate any help.

Thanks,

Sue Hagen

California State University, Fresno
General Linguistics
suezqzeus at yahoo.com

Linguistic Field(s): Language Acquisition
                     Phonology






-----------------------------------------------------------
LINGUIST List: Vol-16-1291	

	



More information about the LINGUIST mailing list