[Corpora-List] a corpus of recent German youth language??
Roman Klinger
roman.klinger at scai.fraunhofer.de
Thu Apr 23 13:10:56 UTC 2009
Hi,
Herrmann, J.B. wrote:
> [...] I am also interested in using the web as a corpus - does
> anybody know of a way of filtering for youth language?
I would use a forum/newsgroup/chat system for young people.
And if "web" does not need to be www: you could e.g. log an IRC
chatroom. This is technically easy because there are command line tools
available so that you could pipe the traffic into a file.
If you are not used to IRC, I remember the IRCNet to be a network with a
lot of well visited channels. A starting point could be
http://irc.netsplit.de/channels/?net=IRCnet
But: IRC is not representative for normal use of language ;-).
Regards,
Roman
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list