<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META content=text/html;charset=utf-8 http-equiv=Content-Type>
<META name=GENERATOR content="MSHTML 8.00.7600.16625"></HEAD>
<BODY style="PADDING-LEFT: 10px; PADDING-RIGHT: 10px; PADDING-TOP: 15px"
id=MailContainerBody leftMargin=0 topMargin=0 CanvasTabStop="true"
name="Compose message area">
<DIV><FONT face=Calibri>Alan, it might be a good idea to start with David Lee's
'Bookmarks for corpus-based linguists', at <A
title="http://tiny.cc/corpora
CTRL + Click to follow link"
href="http://tiny.cc/corpora">http://tiny.cc/corpora</A> if you don't already
know about this useful resource.</FONT></DIV>
<DIV><FONT face=Calibri></FONT> </DIV>
<DIV><FONT face=Calibri>Chris Butler</FONT></DIV>
<DIV><FONT face=Calibri></FONT> </DIV>
<DIV style="FONT: 10pt Tahoma">
<DIV><BR></DIV>
<DIV style="BACKGROUND: #f5f5f5">
<DIV style="font-color: black"><B>From:</B> <A title=eahogue@gmail.com
href="mailto:eahogue@gmail.com">Alan Hogue</A> </DIV>
<DIV><B>Sent:</B> Saturday, October 09, 2010 9:59 AM</DIV>
<DIV><B>To:</B> <A title=corpora@uib.no
href="mailto:corpora@uib.no">corpora@uib.no</A> </DIV>
<DIV><B>Subject:</B> [Corpora-List] Where is corpus linguistics on the
web?</DIV></DIV></DIV>
<DIV><BR></DIV>Hello,
<DIV><BR></DIV>
<DIV>I am pretty new to corpus linguistics. I have a fair amount of experience
in general programming, and<I> </I>I am a 3rd year grad student in
linguistics. I have embarked on a research project which requires corpora. Since
then, I have been surprised at the (as far as I can tell) complete lack of good,
reliable web presence of the corpus linguistics community. </DIV>
<DIV><BR></DIV>
<DIV>Looking at what I can find on the web, I would have to conclude that corpus
linguistics is a dead discipline. I know that is certainly not true, so I wonder
why it looks that way. If you haven't recently, google "corpus
linguistics".</DIV>
<DIV><BR></DIV>
<DIV>I mostly see book listings and a few mailing lists.</DIV>
<DIV><BR></DIV>
<DIV>Shouldn't there be at least one general site which helps people find their
way?</DIV>
<DIV><BR></DIV>
<DIV>Or, and forgive me, but this was particularly irritating to me, try looking
at the websites of virtually any corpus, and see how long it takes you to find
solid information about the type of annotation employed. Even something as
simple as this. Maybe I am just not looking hard enough, this sort of essential
information is perversely hard to find on many sites, as far as I can
tell.</DIV>
<DIV><BR></DIV>
<DIV>My question to all of you is simply Why? If corpus linguistics is such a
vibrant field, why is it so damn hard to find what people (perhaps not die-hard
corpus linguists) need to find? Why should the Penn Treebank and others like it
have no reasonably documented query software? Why can't you easily find corpora
of a particular kind online? </DIV>
<DIV><BR></DIV>
<DIV>To put it simply, it seems to me there is no infrastructure here.</DIV>
<DIV><BR></DIV>
<DIV>I have heard over and over again that this is the future of linguistics,
and I would agree. So why is there no presence on the web? Are we stuck in the
newsgroup dark ages? What's going on here? Honestly, I'd really like to
know.</DIV>
<DIV><BR></DIV>
<DIV>And if I have somehow got everything wrong, then I'd love to be
enlightened. </DIV>
<DIV><BR></DIV>
<DIV>Sincerely,</DIV>
<DIV>Alan Hogue</DIV>
<DIV><BR clear=all><BR>-- <BR>"A good traveler has no fixed plans<BR> and
is not intent on arriving."<BR><BR>-Lao Tzu<BR></DIV>
<P>
<HR>
<P></P>_______________________________________________<BR>Corpora mailing
list<BR>Corpora@uib.no<BR>http://mailman.uib.no/listinfo/corpora<BR></BODY></HTML>