[Corpora-List] looking for spacial metaphors
Mark Davies
Mark_Davies at byu.edu
Fri May 6 00:09:02 UTC 2011
>>From Google Books (American books dataset; 155 **billion** words):
285,342 tokens, 33 types for "preposition + Internet" (top results = of 102468, for 39495, in 34214, on 22085)
772,579 tokens, 45 types for "preposition + the + Internet" (top results = on 329826, of 172917, from 43319, via 36636, through 25682, in 24057)
(Note: the 155 billion words Google Books corpus will be released next week, using more or less the same interface as the other corpora from http://corpus.byu.edu)
------------------
>>From COCA (http://corpus.byu.edu/coca); 425 million words, 1990-2011:
2995 tokens, 34 types for "preposition + Internet": http://corpus.byu.edu/coca/?c=coca&q=10133470
12,859 tokens, 42 types for "preposition + the + Internet": http://corpus.byu.edu/coca/?c=coca&q=10133477
Best,
Mark Davies
============================================
Mark Davies
Professor of (Corpus) Linguistics
Brigham Young University
(phone) 801-422-9168 / (fax) 801-422-0906
Web: http://davies-linguistics.byu.edu
** Corpus design and use // Linguistic databases **
** Historical linguistics // Language variation **
** English, Spanish, and Portuguese **
============================================
________________________________________
From: corpora-bounces at uib.no [corpora-bounces at uib.no] On Behalf Of Eric Atwell [csc6ea at leeds.ac.uk]
Sent: Thursday, May 05, 2011 4:27 PM
To: Ilze Black
Cc: corpora at uib.no
Subject: Re: [Corpora-List] looking for spacial metaphors
Ilze,
a starting point could be to collect corpus evidence of the prepositions
used with "internet" (and similar terms) in a range of languages. For
this, you could use Serge Sharoff's Querying Internet Corpora website
http://corpus.leeds.ac.uk/internet.html
- you can query large web-corpora of 13 languages, looking for
collocates with a given PoS-tag. For example, prepositions
with "internet" for English:
Collocation Joint Freq1 Freq2 LL score Concordance
via internet 10 16738 8.77 Examples
with internet 84 1004574 8.74 Examples
for internet 104 1597989 4.97 Examples
about internet 30 350787 3.28 Examples
on internet 68 1109319 2.51 Examples
of internet 222 4658360 1.34 Examples
against internet 5 67629 0.37 Examples
through internet 8 130426 0.29 Examples
over internet 9 157134 0.23 Examples
without internet 4 80007 0.04 Examples
compared to, for example, Italian:
su internet 947 207279 1776.05 Examples
via internet 240 83931 387.79 Examples
attraverso internet 126 43705 203.57 Examples
ad internet 206 309169 187.39 Examples
tramite internet 85 11838 175.70 Examples
in internet 383 1834798 154.12 Examples
da internet 183 803566 79.17 Examples
di internet 388 4773196 33.50 Examples
a internet 115 1856391 3.46 Examples
contro internet 10 85763 1.83 Examples
con internet 53 890519 1.27 Examples
per internet 78 1533484 0.53 Examples
It's up to you to identify the interesting conclusions!
good luck,
Eric Atwell, Leeds University
On Thu, 5 May 2011, Ilze Black wrote:
> Dear List,
>
> I am a PhD student and quite new to this list but I hope you dont mind
> me asking for help here. I have just started my research on spacial
> perception of the Internet. In my initial phase I am trying to establish
> if there are any relevant study been done on a similar questions and in
> particular on any spacial metaphors. In English we often say 'on' the
> Internet, while in Latvia, for example, they say 'in' the Internet. What
> drives these linguistic metaphors? May be someone has some suggestion on
> previous ethnographic studies, method I am thinking to use. Or other
> methodology or models I should look at? Does anyone has any thoughts on
> this? I am also looking to collect spacial metaphors used of the
> Internet space in different languages. Would love to hear your thoughts,
> suggestions and metaphors.
> Thank you in advance and looking forward to hear.
>
> Ilze Black
> black at eecs.qmul.ac.uk
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
>
--
Eric Atwell, Senior Lecturer, Language research group,
I-AIBS Institute for Artificial Intelligence and Biological Systems
School of Computing, Faculty of Engineering, UNIVERSITY OF LEEDS
Leeds LS2 9JT, England. TEL: 0113-3435430 FAX: 0113-3435468
WWW: http://www.comp.leeds.ac.uk/arabic
http://www.comp.leeds.ac.uk/nlp
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list