[Lexicog] Dictionary software

Bill Poser billposer2 at GMAIL.COM
Tue Apr 29 03:21:58 UTC 2014


Interesting. The built-in stuff sounds like it won't work well for some
kinds of languages, but if you can escape to Lua then you can do pretty
much anything. I have only used Lua a little but it seems like a good
language for this purpose. Among other things, although its built-in
pattern matching facilities are very limited, the lpeg library, which
implements Roberto Ierusalimschy's Parsing Expression Grammar, is very
impressive.


On Mon, Apr 28, 2014 at 8:04 PM, <benjaminbarrett85 at gmail.com> wrote:

>
>
> I was pretty sure we were talking about the same thing :)
>
> While complex morphology can be handled, I don't think it's generally
> done. String manipulation can be executed by adding a string software
> extension, and more complex operations are possible with the programming
> language lua.
>
> However, the way I see conjugations and declensions generally handled is
> to just provide the necessary forms and have the generator add endings as
> appropriate. Here are the sections showing how plurals of English nouns are
> derived:
>
> {{en-noun}} - nothing provided, resulting in "(plural faces)"
> {{en-noun | wolves}} - the entire plural form is just spelled out
> {{en-noun | sheep}} - the entire plural form is just spelled out
> {{en-noun | es | tomatos}} - This results in "(plural tomatoes or
> tomatos)"
> {{en-noun | fish | fishes}} - both forms are just spelled out
>
> Here are the conjugation portions of two sample verbs (adding spaces for
> ease of reading):
>
> 1. The verb andare in Italian (template at
> https://en.wiktionary.org/wiki/Template:it-conj-are):
>
> {{it-conj-are | and | essere
>  | pres1s=vado | pres1s2=vo | pres2s=vai | pres3s=va | pres3p=vanno
>  | prem1s=andai | prem2s=andasti | prem3s=andò
>  | fut1s=andrò | fut2s=andrai | fut3s=andrà | fut1p=andremo |
> fut2p=andrete | fut3p=andranno
>  | cond1s=andrei | cond2s=andresti | cond3s=andrebbe | cond1p=andremmo |
> cond2p=andreste | cond3p=andrebbero
>  | sub123s=vada | sub3p=vadano
>  | imp2s=va' | imp2s2=va | imp2s3=vai | imp3s=vada | imp3p=vadano}}
>
> The main parameters are that it's an -are verb that takes essere as the
> auxiliary verb. The rest provides irregular forms. From these parameters,
> about 50 conjugation pages are created.
>
> 2. The Latin verb ambio (template at
> https://en.wiktionary.org/wiki/Template:la-verb):
>
> {{la-verb | ambio | ambiō | ambire | ambīre | ambivi | ambīvī | ambitum |
> ambītum}}
>
> From this, about 120 conjugation pages are generated.
>
> One bonus feature that I think was added fairly recently is that
> automatically generated pages are in green until they are edited just so
> people can be aware that there might be a problem.
>
>
> Ben Barrett
> La Conner, WA
>
> Learn Ainu! https://sites.google.com/site/aynuitak1/videos
>
>
>
>
> 
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/lexicography/attachments/20140428/0c0dd3a4/attachment.htm>


More information about the Lexicography mailing list