<HTML dir=ltr><HEAD>
<META http-equiv=Content-Type content="text/html; charset=unicode">
<META content="MSHTML 6.00.6000.16640" name=GENERATOR></HEAD>
<BODY>
<DIV id=idOWAReplyText12862 dir=ltr>
<DIV dir=ltr><FONT face=Arial color=#000000 size=2>Hi Imtiaz</FONT></DIV>
<DIV dir=ltr><FONT face=Arial color=#000000 size=2></FONT> </DIV>
<DIV dir=ltr><FONT face=Arial color=#000000 size=2>The current version of WebCorp (<A href="http://www.webcorp.org.uk/">http://www.webcorp.org.uk/</A>) relies on standard search engines such as Google to access the web, adding layers of refinement specifically for linguistic analysis. This means that the ‘corpus’ you are searching is not Part-of-Speech tagged and, thus, you cannot run the type of search you suggest.</FONT></DIV>
<DIV dir=ltr><FONT face=Arial color=#000000 size=2></FONT> </DIV>
<DIV dir=ltr><FONT face=Arial color=#000000 size=2>However, we are currently working on the new WebCorp Linguist’s Search Engine, which crawls the web, downloading texts and building structured, POS-tagged corpora. Using this system it is possible to search for your pattern as:</FONT></DIV>
<DIV dir=ltr><FONT face=Arial color=#000000 size=2></FONT> </DIV>
<DIV dir=ltr><FONT face=Arial color=#000000 size=2>'the {ADJ*} man and woman'</FONT></DIV>
<DIV dir=ltr><FONT face=Arial color=#000000 size=2></FONT> </DIV>
<DIV dir=ltr><FONT face=Arial color=#000000 size=2>You can see a screenshot of the first 20 results from a small test corpus at <A href="http://www.webcorp.org.uk/WebCorpLSE.gif">http://www.webcorp.org.uk/WebCorpLSE.gif</A></FONT></DIV>
<DIV dir=ltr><FONT face=Arial color=#000000 size=2></FONT> </DIV>
<DIV dir=ltr><FONT face=Arial color=#000000 size=2>The WebCorp LSE prototype is currently being beta tested by volunteers from the community. For more information please visit <A href="http://wse1.webcorp.org.uk/preview/">http://wse1.webcorp.org.uk/preview/</A></FONT></DIV>
<DIV dir=ltr><FONT face=Arial color=#000000 size=2></FONT> </DIV>
<DIV dir=ltr><FONT face=Arial color=#000000 size=2>Best wishes</FONT></DIV>
<DIV dir=ltr><FONT face=Arial color=#000000 size=2></FONT> </DIV></DIV>
<DIV id=idSignature11235 dir=ltr>
<DIV><FONT face=Arial color=#000000 size=2>Andrew Kehoe</FONT></DIV>
<DIV><FONT face=Arial size=2>Research & Development Unit for English Studies</FONT></DIV>
<DIV><FONT face=Arial size=2>Birmingham City University</FONT></DIV>
<DIV><FONT face=Arial size=2><A href="http://rdues.bcu.ac.uk/">http://rdues.bcu.ac.uk/</A></FONT></DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV>
<DIV><FONT face=Arial size=2><A href="http://www.webcorp.org.uk/">http://www.webcorp.org.uk/</A> </FONT></DIV></DIV>
<DIV dir=ltr><BR>
<HR tabIndex=-1>
<FONT face=Tahoma size=2><B>From:</B> corpora-bounces@uib.no on behalf of Khan, I. H.<BR><B>Sent:</B> Mon 12/05/2008 2:54 PM<BR><B>To:</B> corpora@uib.no<BR><B>Subject:</B> [Corpora-List] Using WebCorp<BR></FONT><BR></DIV>
<DIV dir=ltr>
<DIV><FONT face=Arial color=#000000 size=2>Hi</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV>
<DIV><FONT face=Arial size=2>Does any one know how to use regular expressions (with Part-of-Speech tags) in WebCorp corpus? </FONT></DIV>
<DIV><FONT face=Arial size=2>For example, to extarct the phrases of the form 'the Adj man and woman' from the BNC, I can use the following regexp in SkE:</FONT></DIV>
<DIV><FONT face=Arial color=#000000 size=2></FONT> </DIV>
<DIV><FONT face=Arial color=#000000 size=2>[word = "the"] [tag = "AJ.*"] [word = "man"] [word = "and"] [word = "woman"];</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV>
<DIV><FONT face=Arial size=2>Any help?</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV>
<DIV><FONT face=Arial size=2>Regards</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV>
<DIV><FONT face=Arial size=2>Imtiaz</FONT></DIV>
<P></P>
<P style="FONT-SIZE: 10pt; COLOR: #772244; FONT-FAMILY: Arial, sans-serif">The University of Aberdeen is a charity registered in Scotland, No SC013683.</P></DIV>
<DIV><P><HR>
Birmingham City University is the new name unveiled for the former University of Central England in Birmingham<BR>
For more information about the name change go to http://www.bcu.ac.uk/namechange/official_announcement.html
</P></DIV>
</BODY></HTML>