[Corpora-List] plain text extraction from ICE-GB...
Stefan Th. Gries
STGries at sitkom.sdu.dk
Mon Nov 22 14:37:02 UTC 2004
Just use a grep command or some utility to extract all
sequences of
{*}
at the end of a line.
Best,
STG
Stefan Th. Gries
----------------------------------------
IFKI, Southern Denmark University
http://people.freenet.de/Stefan_Th_Gries
----------------------------------------
Ute Römer wrote:
> Dear all,
>
> In a pragmatics class I would like to use some real samples of classroom
> interaction (for my students to analyse the move structure in terms of IRF
> etc.). I thought I could simply take one or two of the transcripts included
> in ICE-GB but now I don't seem to find a way of copying (cutting/pasting)
> texts from ICECUP (in the browse text mode) to a Word or text file. My
> not-so-ideal solution was to enlarge font size and use screenshots, but I am
> not quite happy with that (some lines are cut off). When I go to the corpus
> files proper, I get all the markup material which I don't want either. Does
> anyone know of a way of extracting plain text from this corpus?
>
> Thanks and best wishes... Ute
More information about the Corpora
mailing list