<html><head><meta http-equiv="Content-Type" content="text/html charset=iso-8859-1"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><div>Dear Corpora list members,</div><div>on behalf of a colleague I ask your help in order to find a transliteration/romanization tool for modern greek texts. </div><div>Is there anything available (for free or for purchase?)?</div><div>Thank you in advance for your help,</div><div>Isabella</div><div><br></div><br><div><div>Il giorno 25/feb/2013, alle ore 16:23, Gill Philip <<a href="mailto:g.philip.polidoro@gmail.com">g.philip.polidoro@gmail.com</a>> ha scritto:</div><br class="Apple-interchange-newline"><blockquote type="cite">Although it has its critics and its weak points, a pretty good point of reference is Berlin & Kay 1969. Their listing of colour words actually refers to existence in languages: if a language has a "blue" colour term, then it already has black, white, red, green & yellow: no language (in their study) can have, e.g. "pink" if it doesn't already have "blue".<br>
<br>Anyway, as a rough guide, their order is (Berlin and Kay 1969: 4)<br>white & black<br>red<br>yellow & green<br>blue<br>brown<br>pink / purple / grey/ orange<br><br>When I looked at colour words in English and Italian, I got these figures (freq. per million)<br>
<br>ENGLISH (Bank of English, circa 2003)<br>white (316) & black (294)<br>red (182)<br>green (139), brown (136), blue (122)<br>grey (63)<br>yellow (51)<br>pink (37) & purple (15)<br>orange (35)<br><br>ITALIAN (CORIS, circa 2003)<br>
White (Bianco, 308)<br>Red (Rosso, 267) and Black (Nero, 265)<br>Green (Verde, 176)<br>Blue (=143: Azzurro, 85 plus Blu, 58)<br>Pink (Rosa, 90), Yellow (Giallo, 82), Grey (Grigio, 63)<br>Purple (Viola, 22)<br>Brown (Marrone, 13)<br>
Orange (Arancione, 9)<br><br>They're not an exact match with B&K's sequencing, but you can see the basic principle at work. Black, white and red are clearly more common than the other colours;
blue and green are similar in frequency; pink & purple form another group. I should mention, though, that this is a fairly crude measure, and not based on POS-tagged data. There are problems with homographs, e.g. "orange" is also the fruit in English (but not in Italian); Brown is a surname in English (and was the name of the then Chancellor, subsequently Prime Minister, so cropped up disproportionately in the data).<br>
<br>This data comes from my long-forgotten PhD dissertation "Collocation and Connotation": I believe it's still hanging around on the web somewhere.<br><br>hope this helps,<br>Gill<br><br><div class="gmail_quote">
On 25 February 2013 14:31, H.A.E Viethen <span dir="ltr"><<a href="mailto:H.A.E.Viethen@uvt.nl" target="_blank">H.A.E.Viethen@uvt.nl</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Hi,<br>
<br>
we are looking for a way to estimate the relative frequency of colour<br>
terms in different languages, in particular Greek and Dutch. So for<br>
example, we'd like to know how frequent the term 'rood' (red) is in<br>
Dutch compared to the term 'roze' (pink), or how the frequencies of<br>
the terms 'ble' and 'galázio' compare in Greek.<br>
<br>
We only need ballpark figures, the kind of thing one might estimate<br>
with hit counts in web searches, altough having slightly more<br>
reliable numbers than that would be nice. In any case, many Greek<br>
colour terms are derived from common nouns for objects in the natural<br>
environment and usually even spelled the same. This makes it difficult<br>
to distinguish the use of a word as a colour term from its use as a<br>
common noun.<br>
<br>
Does anyone know of a resource (paper, website, anything) that might<br>
readily list relative frequencies for colour terms in Greek and Dutch?<br>
Alternatively, can anyone point us to a POS-tagged corpus of Greek or<br>
Dutch which would be suitable for counting the use of colour terms?<br>
<br>
Many thanks,<br>
<br>
Jette Viethen<br>
Tilburg University<br>
<br>
_______________________________________________<br>
UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">http://mailman.uib.no/options/corpora</a><br>
Corpora mailing list<br>
<a href="mailto:Corpora@uib.no">Corpora@uib.no</a><br>
<a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>
</blockquote></div><br><br clear="all"><br>-- <br>*********************************<br>Dr. Gill Philip<br>Universitŕ degli Studi di Macerata<br>Dipartimento di Scienze della Formazione, dei Beni Culturali, e del Turismo<br>
Piazzale L. Bertelli<br>Contrada Vallebona<br>62100 Macerata<br>Italy
_______________________________________________<br>UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora">http://mailman.uib.no/options/corpora</a><br>Corpora mailing list<br><a href="mailto:Corpora@uib.no">Corpora@uib.no</a><br>http://mailman.uib.no/listinfo/corpora<br></blockquote></div><br></body></html>