[Corpora-List] Compared or Parallel Corpus (corrected)
Hilde Hasselgård
hilde.hasselgard at ilos.uio.no
Mon Mar 2 10:40:55 UTC 2009
In his book *Seeing through Multilingual Corpora* (Benjamins 2007), Stig
Johansson proposes the following terminology for different kinds of
multilingual corpora:
- *translation corpora* "contain original texts and their trnalsations
into one or more other languages" (Johansson 2007: 9)
- *comparable corpora* "contain original texts in two or more languages
matched by criteria such as genre, time of publication, etc." (Johansson
2007: 9)
- the term *parallel corpus* is reserved for "bidirectional translation
corpora", i.e. the type of corpus that combines translation corpora and
comparable corpora within the same framework (comparable originals in at
least two languages plus their translation into the other language(s))
(Johansson 2007: 10-11)
Best wishes,
Hilde Hasselgård
On 01.03.2009 18:11, Alberto Simões wrote:
> Hello, Colt
>
> J. R. Colt Clint wrote:
>
>> Dear All,
>>
>> Could someone answer me If texts (e.g., novels) were originally written
>> in other languages, but have been translated into English. After
>> aligning these texts - sentence by sentence. Are we faced with a
>> parallel corpus or compared corpus?
>>
>> In another way texts (e.g., documents from World Bank or UE) were
>> originally written in several languages - at the same time. After
>> aligning these texts - sentence by sentence. Are we faced with a
>> compared corpus or a parallel corpus?
>>
>
> In any case you are dealing with parallel corpora, as the text say
> exactly the same (ok, somecases almost the same).
>
> Compared or comparable corpora is when you are dealing with two texts,
> written originally in two different languages, but are about the same
> subject.
>
> Cheers
> Alberto
>
>> Best regards
>>
>> J.R. Colt Clint
>>
>>
>> ------------------------------------------------------------------------
>>
>> _______________________________________________
>> Corpora mailing list
>> Corpora at uib.no
>> http://mailman.uib.no/listinfo/corpora
>>
>
>
--
Hilde Hasselgård
http://www.hf.uio.no/ilos/om-instituttet/ansatte/vit/hhasselg.xml
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list