<html>
<head>
<meta content="text/html; charset=ISO-8859-1"
http-equiv="Content-Type">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix">Dear Isabella,<br>
<br>
you coult try with ICU project:<br>
<br>
<a class="moz-txt-link-freetext" href="http://icu-project.org/apiref/icu4j/com/ibm/icu/text/Transliterator.html">http://icu-project.org/apiref/icu4j/com/ibm/icu/text/Transliterator.html</a><br>
<br>
Best regards,<br>
Alberto<br>
<pre class="moz-signature" cols="72">Alberto Barrón Cedeño (Ph.D.)
Departament de Llenguatges i Sistemes Informàtics
Universitat Politècnica de Catalunya
<a class="moz-txt-link-freetext" href="http://www.lsi.upc.edu/~albarron">http://www.lsi.upc.edu/~albarron</a></pre>
On 27/02/13 17:06, Isabella Chiari wrote:<br>
</div>
<blockquote
cite="mid:7618952F-66C7-481B-8C6E-B0C0EDB9ED3D@uniroma1.it"
type="cite">
<meta http-equiv="Content-Type" content="text/html;
charset=ISO-8859-1">
<div>Dear Corpora list members,</div>
<div>on behalf of a colleague I ask your help in order to find a
transliteration/romanization tool for modern greek texts. </div>
<div>Is there anything available (for free or for purchase?)?</div>
<div>Thank you in advance for your help,</div>
<div>Isabella</div>
<div><br>
</div>
<br>
<div>
<div>Il giorno 25/feb/2013, alle ore 16:23, Gill Philip <<a
moz-do-not-send="true"
href="mailto:g.philip.polidoro@gmail.com">g.philip.polidoro@gmail.com</a>>
ha scritto:</div>
<br class="Apple-interchange-newline">
<blockquote type="cite">Although it has its critics and its weak
points, a pretty good point of reference is Berlin & Kay
1969. Their listing of colour words actually refers to
existence in languages: if a language has a "blue" colour
term, then it already has black, white, red, green &
yellow: no language (in their study) can have, e.g. "pink" if
it doesn't already have "blue".<br>
<br>
Anyway, as a rough guide, their order is (Berlin and Kay 1969:
4)<br>
white & black<br>
red<br>
yellow & green<br>
blue<br>
brown<br>
pink / purple / grey/ orange<br>
<br>
When I looked at colour words in English and Italian, I got
these figures (freq. per million)<br>
<br>
ENGLISH (Bank of English, circa 2003)<br>
white (316) & black (294)<br>
red (182)<br>
green (139), brown (136), blue (122)<br>
grey (63)<br>
yellow (51)<br>
pink (37) & purple (15)<br>
orange (35)<br>
<br>
ITALIAN (CORIS, circa 2003)<br>
White (Bianco, 308)<br>
Red (Rosso, 267) and Black (Nero, 265)<br>
Green (Verde, 176)<br>
Blue (=143: Azzurro, 85 plus Blu, 58)<br>
Pink (Rosa, 90), Yellow (Giallo, 82), Grey (Grigio, 63)<br>
Purple (Viola, 22)<br>
Brown (Marrone, 13)<br>
Orange (Arancione, 9)<br>
<br>
They're not an exact match with B&K's sequencing, but you
can see the basic principle at work. Black, white and red are
clearly more common than the other colours; blue and green are
similar in frequency; pink & purple form another group. I
should mention, though, that this is a fairly crude measure,
and not based on POS-tagged data. There are problems with
homographs, e.g. "orange" is also the fruit in English (but
not in Italian); Brown is a surname in English (and was the
name of the then Chancellor, subsequently Prime Minister, so
cropped up disproportionately in the data).<br>
<br>
This data comes from my long-forgotten PhD dissertation
"Collocation and Connotation": I believe it's still hanging
around on the web somewhere.<br>
<br>
hope this helps,<br>
Gill<br>
<br>
<div class="gmail_quote">
On 25 February 2013 14:31, H.A.E Viethen <span dir="ltr"><<a
moz-do-not-send="true"
href="mailto:H.A.E.Viethen@uvt.nl" target="_blank">H.A.E.Viethen@uvt.nl</a>></span>
wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0
.8ex;border-left:1px #ccc solid;padding-left:1ex">
Hi,<br>
<br>
we are looking for a way to estimate the relative
frequency of colour<br>
terms in different languages, in particular Greek and
Dutch. So for<br>
example, we'd like to know how frequent the term 'rood'
(red) is in<br>
Dutch compared to the term 'roze' (pink), or how the
frequencies of<br>
the terms 'ble' and 'galázio' compare in Greek.<br>
<br>
We only need ballpark figures, the kind of thing one might
estimate<br>
with hit counts in web searches, altough having slightly
more<br>
reliable numbers than that would be nice. In any case,
many Greek<br>
colour terms are derived from common nouns for objects in
the natural<br>
environment and usually even spelled the same. This makes
it difficult<br>
to distinguish the use of a word as a colour term from its
use as a<br>
common noun.<br>
<br>
Does anyone know of a resource (paper, website, anything)
that might<br>
readily list relative frequencies for colour terms in
Greek and Dutch?<br>
Alternatively, can anyone point us to a POS-tagged corpus
of Greek or<br>
Dutch which would be suitable for counting the use of
colour terms?<br>
<br>
Many thanks,<br>
<br>
Jette Viethen<br>
Tilburg University<br>
<br>
_______________________________________________<br>
UNSUBSCRIBE from this page: <a moz-do-not-send="true"
href="http://mailman.uib.no/options/corpora"
target="_blank">http://mailman.uib.no/options/corpora</a><br>
Corpora mailing list<br>
<a moz-do-not-send="true" href="mailto:Corpora@uib.no">Corpora@uib.no</a><br>
<a moz-do-not-send="true"
href="http://mailman.uib.no/listinfo/corpora"
target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>
</blockquote>
</div>
<br>
<br clear="all">
<br>
-- <br>
*********************************<br>
Dr. Gill Philip<br>
Università degli Studi di Macerata<br>
Dipartimento di Scienze della Formazione, dei Beni Culturali,
e del Turismo<br>
Piazzale L. Bertelli<br>
Contrada Vallebona<br>
62100 Macerata<br>
Italy
_______________________________________________<br>
UNSUBSCRIBE from this page: <a moz-do-not-send="true"
href="http://mailman.uib.no/options/corpora">http://mailman.uib.no/options/corpora</a><br>
Corpora mailing list<br>
<a moz-do-not-send="true" href="mailto:Corpora@uib.no">Corpora@uib.no</a><br>
<a class="moz-txt-link-freetext" href="http://mailman.uib.no/listinfo/corpora">http://mailman.uib.no/listinfo/corpora</a><br>
</blockquote>
</div>
<br>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
UNSUBSCRIBE from this page: <a class="moz-txt-link-freetext" href="http://mailman.uib.no/options/corpora">http://mailman.uib.no/options/corpora</a>
Corpora mailing list
<a class="moz-txt-link-abbreviated" href="mailto:Corpora@uib.no">Corpora@uib.no</a>
<a class="moz-txt-link-freetext" href="http://mailman.uib.no/listinfo/corpora">http://mailman.uib.no/listinfo/corpora</a>
</pre>
</blockquote>
<br>
</body>
</html>