[Lexicog] help with N-grams

Piotr Bański bansp at O2.PL
Sat Oct 25 10:50:14 UTC 2008


Dear Marc,

Your query may be best answered on the Corpora list:

http://gandalf.aksis.uib.no/corpora/

Do try it, you're going to get lots of useful hints there.

Good luck,

  Piotr

Marc FRYD pisze:
> Hi all,
> I wonder if anyone could help a linguist with moderate programming 
> abilities with the following task.
> I am currently working on a corpus of aligned grapheme-to-phoneme 
> isolated words.
> I would like to produce an N-gram parsing of both levels of data (the 
> graphemic and the phonemic) with a view to extracting trends favouring 
> realisations (i.e. this grapheme will realise as that phoneme with an x 
> rate of occurrence if preceded/followed by such and such graphemes). The 
> db is currently c3000 words, but it will keep growing.
> Cheers,
> Marc
> 


------------------------------------

Yahoo! Groups Links

<*> To visit your group on the web, go to:
    http://groups.yahoo.com/group/lexicographylist/

<*> Your email settings:
    Individual Email | Traditional

<*> To change settings online go to:
    http://groups.yahoo.com/group/lexicographylist/join
    (Yahoo! ID required)

<*> To change settings via email:
    mailto:lexicographylist-digest at yahoogroups.com 
    mailto:lexicographylist-fullfeatured at yahoogroups.com

<*> To unsubscribe from this group, send an email to:
    lexicographylist-unsubscribe at yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
    http://docs.yahoo.com/info/terms/



More information about the Lexicography mailing list