[Lexicog] help with N-grams

Sun Oct 26 14:11:32 UTC 2008

Hi Mark,

I have a software tool for doing ngrams (bi,tri,tetra y penta), but I know I you are looking for something more precise. Could you send me a short piece of your database or your text?

Best for now,

J. L. De Lucca
Universidad Politécnica de Valencia
Departamento de Linguistica Aplicada

--- On Sat, 10/25/08, Marc FRYD <marc.fryd at univ-poitiers.fr> wrote:

From: Marc FRYD <marc.fryd at univ-poitiers.fr>
Subject: [Lexicog] help with N-grams
To: lexicographylist at yahoogroups.com
Date: Saturday, October 25, 2008, 12:49 AM

Hi all,
I wonder if anyone could help a linguist with moderate programming 
abilities with the following task.
I am currently working on a corpus of aligned grapheme-to- phoneme 
isolated words.
I would like to produce an N-gram parsing of both levels of data (the 
graphemic and the phonemic) with a view to extracting trends favouring 
realisations (i.e. this grapheme will realise as that phoneme with an x 
rate of occurrence if preceded/followed by such and such graphemes). The 
db is currently c3000 words, but it will keep growing.
Cheers,
Marc

-- 
Dr. Marc FRYD
Senior Lecturer in English Linguistics

Faculté des Lettres et des Langues
Université de Poitiers
95 avenue du Recteur Pineau
86022, Poitiers, France

Office: 05 49 45 48 11
Cell: 06 76 28 18 50

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/lexicography/attachments/20081026/4dfa15c3/attachment.htm>