[Corpora-List] How to create a corpus

Christoph Ruehlemann chrisruehlemann at googlemail.com
Tue Oct 9 12:38:16 UTC 2012


Meganathan,

refs to get you started might be these two (out of many):

1.
www.ahds.ac.uk/guides/linguistic-corpora/appendix.htm

2.
Anne O‘Keeffe & M.J. McCarthy. 2010. Routledge Handbook of Corpus
Linguistics. Routledge (esp. Section 2 “Building a corpus: what are
the key considerations?“)

Best
Chris

On 10/9/12, Christoph Ruehlemann <chrisruehlemann at googlemail.com> wrote:
> Meganathan,
>
> useful refs might be these two (out of many):
>
> 1.
> www.ahds.ac.uk/guides/linguistic-corpora/appendix.htm
>
> 2.
> Anne O‘Keeffe & M.J. McCarthy. 2010. Routledge Handbook of Corpus
> Linguistics. Routledge (esp. Section 2 “Building a corpus: what are
> the key considerations?“)
>
> Best
> Chris
>
> On 10/9/12, Martin Reynaert <reynaert at uvt.nl> wrote:
>> On 10/09/2012 01:37 PM, Krishnamurthy, Ramesh wrote:
>>> I think you need to:
>>>
>>> a) consider copyright issues, if you intend to use the corpus for
>>> commercial purposes
>>>
>> Hi Meganathan,
>>
>> A corpus is only really useful if it can be shared, at least for
>> research purposes, if not also commercial ones.
>>
>> To be able do that, Intellectual Property Rights (IPR) issues need to be
>> settled.
>>
>> How we have gone about that for the 500 million word reference corpus of
>> contemporary written Dutch SoNaR is detailed in:
>>
>> De Clercq O. and Reynaert M., (2010), SoNaR Acquisition Manual
>> <http://taalunieversum.org/taal/technologie/stevin/documenten/sonar_manual.pdf>,
>>
>> LT3 Technical Report LT3 10-02, Hogeschool Gent, Gent, Belgium
>>
>> http://lt3.hogent.be/media/uploads/publications/2010/DeClercq2010a.pdf
>>
>> Success!
>>
>> Martin
>>
>

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list