[Lexicog] Features of a lexical entry in hard copy dictionary

maxwell at LDC.UPENN.EDU maxwell at LDC.UPENN.EDU
Wed Jan 3 12:57:03 UTC 2007


Quoting ali72678 <ali72678 at yahoo.com>:
> I am working on Urdu Lexicon. If I want to see an entry like
> "dekhna"(to see)What features should a standard dictiony have?

I guess the first question to ask is who the dictionary is for: 
learners of Urdu, or learners of English, or both?  At what 
level--grade school, High School, college, tourists,...  (I'm assuming 
this is intended for people to use, not for machines--i.e. it's not for 
machine translation.)

The next question (influenced, but not determined, by the answer to the 
previous question) is how complete it's intended to be.  Is it a pocket 
dictionary, a desk dictionary, or _?  That may have more to do with how 
many entries it will have, but it also influences how complete each 
individual entry should be.

When I worked with smallish (several thousand entry) bilingual 
dictionaries intended to document a language (thus in theory intended 
for learners of that language), what I considered to be a minimal entry 
on the target language side was:
  The headword (citation form);
  Any irregular forms;
  A part of speech (for the target language word--
     people sometimes get confused and put the part     of speech of 
the glossing language word instead);
  Inflection (paradigm or declension) class where
     relevant;
  One or more senses with glosses (not definitions);
  One or more example sentences for each sense of     any word that 
wasn't a concrete noun.
On this last point, my feeling was that concrete nouns (dog, house, 
book...) don't need example sentences, but that most anything else 
does: verbs most definitely, but also some adjectives, adverbs (because 
this term usually covers a multitude of categories, i.e. it's 
ill-defined), and abstract nouns (because the argument-taking 
properties of the latter are not always obvious: "the destruction _of_ 
Carthage" but "the attack _on_ Rome"), and function words.  (The latter 
may be documented instead in a grammar, but there are usually some odd 
ones that don't fit the pattern, like "between" or "ago" in English.)

You'll notice I don't mention etymology.  I suppose for Urdu, though, 
that it might be relevant, since the usage of Urdu words is influenced, 
I am told, by whether they come from Sanskrit or Arabic (or some other 
source).  Speaking of which, you will doubtless have to make decisions 
about whether to include recent loanwords, particularly technical 
terminology taken from English (and maybe words which are more common 
in Hindi than in Urdu?).

I find a lot of good ideas on making up example sentences in 
Bartholomew and Schoenhall's older book:

   Bartholomew, Doris A. and Louise C. Schoenhals. 1983. Bilingual   
dictionaries for indigenous languages. Mexico: Summer Institute   of 
Linguistics.

Doubtless out of print now and hard to find, and written before 
lexicography became a computational project, but still useful.  I 
suppose there are better ones now...  And call me olde fashioned, but I 
believe example sentences should be made up, not (just) found in 
corpora (unless you have a huge corpus).

   Mike Maxwell

----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.



 
Yahoo! Groups Links

<*> To visit your group on the web, go to:
    http://groups.yahoo.com/group/lexicographylist/

<*> Your email settings:
    Individual Email | Traditional

<*> To change settings online go to:
    http://groups.yahoo.com/group/lexicographylist/join
    (Yahoo! ID required)

<*> To change settings via email:
    mailto:lexicographylist-digest at yahoogroups.com 
    mailto:lexicographylist-fullfeatured at yahoogroups.com

<*> To unsubscribe from this group, send an email to:
    lexicographylist-unsubscribe at yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
    http://docs.yahoo.com/info/terms/
 



More information about the Lexicography mailing list