[Corpora-List] Google's translations
Angus Grieve-Smith
grvsmth at panix.com
Sun Mar 14 22:50:39 UTC 2010
On 3/11/2010 10:10 AM, chris brew wrote:
>
>
> On Thu, Mar 11, 2010 at 8:18 AM, Peter Kolb <pekoli at gmail.com
> <mailto:pekoli at gmail.com>> wrote:
>
> I have three comments:
>
> 1. The text by Kant contains a lot of anaphoric pronouns. From
> Google's translation it is obvious that their system does not
> perform any pronoun resolution (or at least none that works better
> than a random baseline). However, there exist German to English
> translation engines on the market that incorporate such components.
>
>
> I would moderate that conclusion. If, as I suspect, the Google engine
> for German to English is a statistical
> one, it will be choosing a translation by optimizing a complex
> internal criterion that involves tradeoffs between multiple criteria.
> Because SMT systems are not conventionally modular, it is hard to
> say what components they have or do not have.
Which is why it would be a huge boon to the science of language if
more of the statistical machine translation systems produced some kind
of human-readable report of what they "learn" from their "training" data.
--
-Angus B. Grieve-Smith
grvsmth at panix.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20100314/eaa93f81/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list