[Corpora-List] Chiniese Name Gender Recognition

Mark Lewellen lewellen at erols.com
Wed Dec 21 15:49:16 UTC 2005


Since Chinese given names are not limited to a set of
lexical items that are prototypically 'names' (i.e. they
can be just about any lexical item), Chinese given names, 
as you probably know, often have no clue about gender.  
There has been some discussion on 'traits' that are
more feminine or masculine and would be reflected in names,
but there remains a lot of ambiguity.  I doubt there is any 
statistical method, algorithm, or even native speaker that 
can make up for that problem!

Mark Lewellen

> -----Original Message-----
> From: owner-corpora at lists.uib.no 
> [mailto:owner-corpora at lists.uib.no] On Behalf Of Jun Lang
> Sent: Tuesday, December 13, 2005 7:31 AM
> To: 'Xiaofei Lu'
> Cc: corpora at uib.no
> Subject: [Corpora-List] 答复: [Corpora-List] Chiniese Name 
> Gender Recognition
> 
> 
> Yeah! There are many names which could be used for mail and 
> female. It is a
> difficult problem. Now I have done some simple research on this topic.
> Recently, I am trying to get more and more data. Since the 
> parameter space
> is very huge, decision trees can not get the final result 
> quickly. I want to
> use Bayes Model again.
> 
> Can you give me some ideas about it?  Thanks a lot!
> 
> Best wishes,
> Jun Lang
> 
> -----邮件原件-----
> 发件人: Xiaofei Lu [mailto:xflu at ling.ohio-state.edu] 
> 发送时间: 2005年12月13日 13:56
> 收件人: Jun Lang
> 主题: Re: [Corpora-List] Chiniese Name Gender Recognition
> 
> Interesting. What is and how do you establish the baseline? 
> Many names can 
> be either male or female, can't they?
> 
> On Tue, 13 Dec 2005, Jun Lang wrote:
> 
> > Hi all Corpora Members,
> >
> >    Now I am studying on Chinese Name Gender Recognition. 
> The input is a
> > Chinese name. The output is the corresponding gender. I 
> used decision
> trees
> > method. But finally, the accuracy is only about 70%.
> >
> >    Do you know any other method which can achieve higher 
> accuracy? And is
> > there somebody has done any similar research?
> >
> >    Thanks a lot!
> >
> >
> >
> > Best wishes,
> >
> > Bill_Lang(Jun Lang): Ph.D Candidate
> >
> > Information Retrieval Laboratory
> >
> > Harbin Institute of Technology
> >
> > Mail: bill_lang at gmail.com
> >
> > Homepage: http://ir.hit.edu.cn/~bill_lang
> >
> >
> 



More information about the Corpora mailing list