[Corpora-List] Google's translations

Francis Tyers ftyers at prompsit.com
Sun Mar 14 21:52:41 UTC 2010


El dg 14 de 03 de 2010 a les 18:50 -0400, en/na Angus Grieve-Smith va
escriure:
> On 3/11/2010 10:10 AM, chris brew wrote: 
> > 
> > 
> > On Thu, Mar 11, 2010 at 8:18 AM, Peter Kolb <pekoli at gmail.com>
> > wrote:
> >         I have three comments:
> >         
> >         1. The text by Kant contains a lot of anaphoric pronouns.
> >         From Google's translation it is obvious that their system
> >         does not perform any pronoun resolution (or at least none
> >         that works better than a random baseline). However, there
> >         exist German to English translation engines on the market
> >         that incorporate such components.
> > 
> > 
> > I would moderate that conclusion. If, as I suspect, the Google
> > engine for German to English is a statistical
> > one, it will be choosing a translation by optimizing a complex
> > internal criterion that involves tradeoffs between multiple
> > criteria. Because SMT systems are not conventionally modular, it is
> > hard to 
> > say what components they have or do not have.
> > 
> 
>     Which is why it would be a huge boon to the science of language if
> more of the statistical machine translation systems produced some kind
> of human-readable report of what they "learn" from their "training"
> data.

Wouldn't that be the phrase table and language model ? :)

Fran


_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list