[Corpora-List] Query about Internet corpora (different text types)
Angus B. Grieve-Smith
grvsmth at panix.com
Fri Jun 3 02:13:47 UTC 2011
On 6/2/2011 7:03 PM, Christine Amling wrote:
> Hello everyone.
> My name is Christine Amling and I am a master's student at the
> university of Mainz. At the moment I am working on my master's paper
> which is about the distribution and usage of the so-called "New
> Quotative" be like in Internet contexts
> (http://martina.lampert-mainz.de/belike.php ) and for this purpose I
> need an Internet corpus. The study aims to analyze different text
> types and at the moment I want to look at message boards, Twitter, IRC
> chat and blogs. If possible also Instant Messaging, but access is
> restricted.
> My problem is that I don't find any valid IRC chat data and I was
> wondering if there is already an existing corpus about that, but in
> general I would be thankful for all kinds of information anybody has
> on Internet text types and registers. (Instant Messaging corpora would
> be appreciated as well - I know that there is study from Jones &
> Schiefflin 2009, is there any way to access the corpus they used?)
I don't know about existing IRC corpora of any other languages
(since you're clearly looking for English), but in 1999 I compiled my
own corpus of IRC French by running a bot on #france for several days to
collect a log. I used the corpus to investigate right-dislocation
constructions, aka "antitopic," as I described in this working paper:
http://stjohns.academia.edu/grvsmth/Papers/646687/Antitopic_and_Word_Order_in_Conversational_French
--
-Angus B. Grieve-Smith
Saint John's University
grvsmth at panix.com
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list