[Lexicog] Collaborative lexicography software?
Mike Maxwell
maxwell at LDC.UPENN.EDU
Fri May 9 15:17:48 UTC 2008
Allan Johnson wrote:
> For a dictionary user to be able to look up any wordform in a
> computer-based (maybe online) dictionary, another approach would be to
> explicitly list all forms in the dictionary...
>
> Automated parsing could still have a role in such a dictionary, but the
> role would be to assist in building the dictionary rather than to assist
> in reading it. When analyzing words that it encounters in vernacular
> texts, the parser would draw its conclusion regarding what roots and
> affixes make up the word, and thus what entries it should be linked to.
> Based on his knowledge of the actual meanings of the words, the human
> dictionary compiler would then evaluate whether to accept the parser's
> choice or make links that the parser didn't predict. If it involves some
> regularity of the language that the parser just doesn't yet handle, the
> dictionary compiler could use this parser failure as feedback to help
> improve the parser's success in future predictions...
This is a description of part of the "stealth to wealth" idea being
implemented for parsing in FLEx. The idea is in part that the program
learns new affixes and stems automatically from the user's hand parses,
as described above. But also, at some point the user is likely to get
frustrated at having to sort through computer-produced parses that are
obviously wrong (like 'cats' as a noun with a 3rd person singular
present tense suffix). This (hopefully) motivates the user to tell the
stupid computer why this parse doesn't make sense, in the process
improving the parser and the grammatical description used by the parser.
It would also be possible for the computer to guess new roots/ stems (or
allomorphs thereof), on the basis of what's left over after it removes
what it believes to be affixes from a previously unseen word. Of
course, these will sometimes be wrong (like calling 'r' a root in the
word 'ring', after removing a supposed -ing suffix), and the computer
won't have a clue about the meaning of these stems. But it might still
be useful.
--
Mike Maxwell
maxwell at ldc.upenn.edu
------------------------------------
Yahoo! Groups Links
<*> To visit your group on the web, go to:
http://groups.yahoo.com/group/lexicographylist/
<*> Your email settings:
Individual Email | Traditional
<*> To change settings online go to:
http://groups.yahoo.com/group/lexicographylist/join
(Yahoo! ID required)
<*> To change settings via email:
mailto:lexicographylist-digest at yahoogroups.com
mailto:lexicographylist-fullfeatured at yahoogroups.com
<*> To unsubscribe from this group, send an email to:
lexicographylist-unsubscribe at yahoogroups.com
<*> Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/
More information about the Lexicography
mailing list