<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<meta content="text/html;charset=ISO-8859-1" http-equiv="Content-Type">
<title></title>
</head>
<body bgcolor="#ffffff" text="#000000">
Hi J.L.,<br>
Thank you for your kind interest.<br>
Please find attached a .txt sample. Words are presented in sequences of
three lines, in each case:<br>
- Line 1 provides the full graphemic string (English surnames in this
corpus), followed by its phonemic transcription, in which phonemes are
separated by a space.<br>
- Lines 2 and 3 provide the graphemic and phonemic alignments. Note
that conjoined letters are indicated with the "+" sign in graphemic
clusters:<br>
<br>
---- aagaard = "eI g A: d ----<br>
a+a g a+a+r d<br>
"eI g A: d<br>
<br>
In the example above (the English surname Aagard), the clusters are:
<a+a> and <a+a+r>).<br>
Note that primary stress is marked with a double quote < " >, and
secondary stress with the <br>
percent sign < % >, placed directly before the stressed vowels.<br>
Do note that the phonemic symbols are case-sensitive.<br>
Just ask if there is anything else you need to know.<br>
With kind regards,<br>
Marc<br>
<br>
J.L. DeLucca wrote:
<blockquote cite="mid:802557.56350.qm@web35907.mail.mud.yahoo.com"
type="cite">
<div id="ygrp-text">
<p>
<table border="0" cellpadding="0" cellspacing="0">
<tbody>
<tr>
<td
style="font-family: inherit; font-style: inherit; font-variant: inherit; font-weight: inherit; font-size: inherit; line-height: inherit; font-size-adjust: inherit; font-stretch: inherit;"
valign="top">
<div>Hi Mark,</div>
<div> </div>
<div>I have a software tool for doing ngrams (bi,tri,tetra y
penta), but I know I you are looking for something more precise. Could
you send me a short piece of your database or your text?</div>
<div> </div>
<div>Best for now,<br>
<br>
J. L. De Lucca<br>
Universidad Politécnica de Valencia<br>
Departamento de Linguistica Aplicada<br>
<br>
--- On <b>Sat, 10/25/08, Marc FRYD <i><marc.fryd@univ-<wbr>poitiers.<wbr>fr></i></b>
wrote:<br>
</div>
<blockquote style="border-left: 2px solid rgb(16, 16, 255);">From:
Marc FRYD <marc.fryd@univ-<wbr>poitiers.<wbr>fr><br>
Subject: [Lexicog] help with N-grams<br>
To: lexicographylist@<wbr>yahoogroups.<wbr>com<br>
Date: Saturday, October 25, 2008, 12:49 AM<br>
<br>
<div id="yiv1844995831">
<div id="ygrp-text">
<div>Hi all,<br>
I wonder if anyone could help a linguist with moderate programming <br>
abilities with the following task.<br>
I am currently working on a corpus of aligned grapheme-to- phoneme <br>
isolated words.<br>
I would like to produce an N-gram parsing of both levels of data (the <br>
graphemic and the phonemic) with a view to extracting trends favouring <br>
realisations (i.e. this grapheme will realise as that phoneme with an x
<br>
rate of occurrence if preceded/followed by such and such graphemes).
The <br>
db is currently c3000 words, but it will keep growing.<br>
Cheers,<br>
Marc<br>
<br>
-- <br>
Dr. Marc FRYD<br>
Senior Lecturer in English Linguistics<br>
<br>
Faculté des Lettres et des Langues<br>
Université de Poitiers<br>
95 avenue du Recteur Pineau<br>
86022, Poitiers, France<br>
<br>
Office: 05 49 45 48 11<br>
Cell: 06 76 28 18 50<br>
<br>
</div>
</div>
</div>
</blockquote>
</td>
</tr>
</tbody>
</table>
<br>
</p>
</div>
<!--End group email --> </blockquote>
<br>
<br>
<span width="1" style="color: white;"/>__._,_.___</span>
<!-- Start Recommendations -->
<!-- End Recommendations -->
<!-- |**|begin egp html banner|**| -->
<img src="http://geo.yahoo.com/serv?s=97476590/grpId=11682781/grpspId=1709195911/msgId=4723/stime=1225043729" width="1" height="1"> <br>
<!-- |**|end egp html banner|**| -->
<!-- |**|begin egp html banner|**| -->
<br>
<div style="font-family: verdana; font-size: 77%; border-top: 1px solid #666; padding: 5px 0;" >
Your email settings: Individual Email|Traditional <br>
<a href="http://groups.yahoo.com/group/lexicographylist/join;_ylc=X3oDMTJnYnJlZ3Z0BF9TAzk3NDc2NTkwBGdycElkAzExNjgyNzgxBGdycHNwSWQDMTcwOTE5NTkxMQRzZWMDZnRyBHNsawNzdG5ncwRzdGltZQMxMjI1MDQzNzI5">Change settings via the Web</a> (Yahoo! ID required) <br>
Change settings via email: <a href="mailto:lexicographylist-digest@yahoogroups.com?subject=Email Delivery: Digest">Switch delivery to Daily Digest</a> | <a href = "mailto:lexicographylist-fullfeatured@yahoogroups.com?subject=Change Delivery Format: Fully Featured">Switch to Fully Featured</a> <br>
<a href="http://groups.yahoo.com/group/lexicographylist;_ylc=X3oDMTJlNHBqcjF1BF9TAzk3NDc2NTkwBGdycElkAzExNjgyNzgxBGdycHNwSWQDMTcwOTE5NTkxMQRzZWMDZnRyBHNsawNocGYEc3RpbWUDMTIyNTA0MzcyOQ--">
Visit Your Group
</a> |
<a href="http://docs.yahoo.com/info/terms/">
Yahoo! Groups Terms of Use
</a> |
<a href="mailto:lexicographylist-unsubscribe@yahoogroups.com?subject=Unsubscribe">
Unsubscribe
</a>
<br>
</div>
<br>
<!-- |**|end egp html banner|**| -->
<span style="color: white;"/>__,_._,___</span>
</body>
</html>