[Corpora-List] Query about Internet corpora (different text types)

Angus B. Grieve-Smith grvsmth at panix.com
Fri Jun 3 02:13:47 UTC 2011


On 6/2/2011 7:03 PM, Christine Amling wrote:
> Hello everyone.
> My name is Christine Amling and I am a master's student at the 
> university of Mainz. At the moment I am working on my master's paper 
> which is about the distribution and usage of the so-called "New 
> Quotative" be like in Internet contexts 
> (http://martina.lampert-mainz.de/belike.php ) and for this purpose I 
> need an Internet corpus. The study aims to analyze different text 
> types and at the moment I want to look at message boards, Twitter, IRC 
> chat and blogs. If possible also Instant Messaging, but access is 
> restricted.
> My problem is that I don't find any valid IRC chat data and I was 
> wondering if there is already an existing corpus about that, but in 
> general I would be thankful for all kinds of information anybody has 
> on Internet text types and registers. (Instant Messaging corpora would 
> be appreciated as well - I know that there is study from Jones & 
> Schiefflin 2009, is there any way to access the corpus they used?)

     I don't know about existing IRC corpora of any other languages 
(since you're clearly looking for English), but in 1999 I compiled my 
own corpus of IRC French by running a bot on #france for several days to 
collect a log.  I used the corpus to investigate right-dislocation 
constructions, aka "antitopic," as I described in this working paper:

http://stjohns.academia.edu/grvsmth/Papers/646687/Antitopic_and_Word_Order_in_Conversational_French

-- 
				-Angus B. Grieve-Smith
				Saint John's University
				grvsmth at panix.com


_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list