[Lexicog] help with N-grams
J.L. DeLucca
jldlme at YAHOO.COM
Sun Oct 26 14:11:32 UTC 2008
Hi Mark,
I have a software tool for doing ngrams (bi,tri,tetra y penta), but I know I you are looking for something more precise. Could you send me a short piece of your database or your text?
Best for now,
J. L. De Lucca
Universidad Politécnica de Valencia
Departamento de Linguistica Aplicada
--- On Sat, 10/25/08, Marc FRYD <marc.fryd at univ-poitiers.fr> wrote:
From: Marc FRYD <marc.fryd at univ-poitiers.fr>
Subject: [Lexicog] help with N-grams
To: lexicographylist at yahoogroups.com
Date: Saturday, October 25, 2008, 12:49 AM
Hi all,
I wonder if anyone could help a linguist with moderate programming
abilities with the following task.
I am currently working on a corpus of aligned grapheme-to- phoneme
isolated words.
I would like to produce an N-gram parsing of both levels of data (the
graphemic and the phonemic) with a view to extracting trends favouring
realisations (i.e. this grapheme will realise as that phoneme with an x
rate of occurrence if preceded/followed by such and such graphemes). The
db is currently c3000 words, but it will keep growing.
Cheers,
Marc
--
Dr. Marc FRYD
Senior Lecturer in English Linguistics
Faculté des Lettres et des Langues
Université de Poitiers
95 avenue du Recteur Pineau
86022, Poitiers, France
Office: 05 49 45 48 11
Cell: 06 76 28 18 50
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/lexicography/attachments/20081026/4dfa15c3/attachment.htm>
More information about the Lexicography
mailing list