<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">

<HTML><HEAD>

<META content="text/html; charset=utf-8" http-equiv=Content-Type>

<STYLE>

BLOCKQUOTE {

        MARGIN-BOTTOM: 0px; MARGIN-LEFT: 2em; MARGIN-TOP: 0px

}

OL {

        MARGIN-BOTTOM: 0px; MARGIN-TOP: 0px

}

UL {

        MARGIN-BOTTOM: 0px; MARGIN-TOP: 0px

}

P {

        MARGIN-BOTTOM: 0px; MARGIN-TOP: 0px

}

BODY {

        FONT-SIZE: 10.5pt; FONT-FAMILY: 微软雅黑; COLOR: #000080; LINE-HEIGHT: 1.5

}

</STYLE>


<META name=GENERATOR content="MSHTML 10.00.9200.16576"></HEAD>

<BODY style="MARGIN: 10px">

<DIV>Indeed, if none of the above mentioned approaches work for your 

purpose.  You can have your corpora POS-tagged (if they are not) and quite 

conveniently retrieve Verb+ particle with Regular Expressions in text editors 

like Powergrep or Editpad Pro, or some other Reg-supporting concordancers like 

AntConc.</DIV>

<DIV 

style="BORDER-TOP: #b5c4df 1pt solid; BORDER-RIGHT: medium none; BORDER-BOTTOM: medium none; PADDING-BOTTOM: 0cm; PADDING-TOP: 3pt; PADDING-LEFT: 0cm; BORDER-LEFT: medium none; PADDING-RIGHT: 0cm">

<DIV 

style="FONT-SIZE: 12px; BACKGROUND: #efefef; COLOR: #000000; PADDING-BOTTOM: 8px; PADDING-TOP: 8px; PADDING-LEFT: 8px; PADDING-RIGHT: 8px">

<DIV><B>From:</B> <A 

href="mailto:corpora-request@uib.no">corpora-request</A></DIV>

<DIV><B>Date:</B> 2013-06-17 18:00</DIV>

<DIV><B>To:</B> <A href="mailto:corpora@uib.no">corpora</A></DIV>

<DIV><B>Subject:</B> Corpora Digest, Vol 72, Issue 15</DIV></DIV></DIV>

<DIV>

<DIV>Today's Topics:</DIV>

<DIV> </DIV>

<DIV>   1. Re:  Looking for corpus tool/software to extract phrasal</DIV>

<DIV>      verbs (Phil Gooch)</DIV>

<DIV> </DIV>

<DIV> </DIV>

<DIV>----------------------------------------------------------------------</DIV>

<DIV> </DIV>

<DIV>Message: 1</DIV>

<DIV>Date: Mon, 17 Jun 2013 10:57:54 +0100</DIV>

<DIV>From: Phil Gooch <philgooch@gmail.com></DIV>

<DIV>Subject: Re: [Corpora-List] Looking for corpus tool/software to</DIV>

<DIV>extract phrasal verbs</DIV>

<DIV>To: "corpora@uib.no" <corpora@uib.no></DIV>

<DIV> </DIV>

<DIV>If you use GATE, you could try the predicate phrase chunker at</DIV>

<DIV>https://github.com/philgooch/BioPred--Biomedical-Predicate-VerbGroup-Chunker</DIV>

<DIV>although this was developed for biomedical texts, it might work in other</DIV>

<DIV>contexts. You can then use the GATE Orthomatcher to extract co-occurrences.</DIV>

<DIV> </DIV>

<DIV>Otherwise, if you are using Python, you could write some rules using the</DIV>

<DIV>CliPs library, http://www.clips.ua.ac.be/pages/pattern-search</DIV>

<DIV> </DIV>

<DIV>Phil</DIV>

<DIV> </DIV>

<DIV> </DIV>

<DIV>On Mon, Jun 17, 2013 at 9:48 AM, LEUNG, Maggie SN [12901991r] <</DIV>

<DIV>maggie.sn.leung@connect.polyu.hk> wrote:</DIV>

<DIV> </DIV>

<DIV>>  Dear Corpora-List users,</DIV>

<DIV>></DIV>

<DIV>>  I would like to have your recommendation on any corpus linguistics tool</DIV>

<DIV>> or software which can be used to automatically extract all co-occurrences</DIV>

<DIV>> of any verb + particle combinations (irrespective of whether a particle is</DIV>

<DIV>> adverbial or prepositional) from a corpus. The corpus is about 9 million</DIV>

<DIV>> words.</DIV>

<DIV>></DIV>

<DIV>></DIV>

<DIV>>  Thanks and best regards,</DIV>

<DIV>> Maggie</DIV>

<DIV>></DIV>

<DIV>>        ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------</DIV>

<DIV>></DIV>

<DIV>>        * Disclaimer:*</DIV>

<DIV>></DIV>

<DIV>> *This message (including any attachments) contains confidential</DIV>

<DIV>> information intended for a specific individual and purpose. If you are not</DIV>

<DIV>> the intended recipient, you should delete this message and notify the</DIV>

<DIV>> sender and the University immediately. Any disclosure, copying, or</DIV>

<DIV>> distribution of this message, or the taking of any action based on it, is</DIV>

<DIV>> strictly prohibited and may be unlawful.*</DIV>

<DIV>></DIV>

<DIV>> *The University specifically denies any responsibility for the accuracy</DIV>

<DIV>> or quality of information obtained through University E-mail Facilities.</DIV>

<DIV>> Any views and opinions expressed are only those of the author(s) and do not</DIV>

<DIV>> necessarily represent those of the University and the University accepts no</DIV>

<DIV>> liability whatsoever for any losses or damages incurred or caused to any</DIV>

<DIV>> party as a result of the use of such information.*</DIV>

<DIV>></DIV>

<DIV>> _______________________________________________</DIV>

<DIV>> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora</DIV>

<DIV>> Corpora mailing list</DIV>

<DIV>> Corpora@uib.no</DIV>

<DIV>> http://mailman.uib.no/listinfo/corpora</DIV>

<DIV>></DIV>

<DIV>></DIV>

<DIV>-------------- next part --------------</DIV>

<DIV>A non-text attachment was scrubbed...</DIV>

<DIV>Name: not available</DIV>

<DIV>Type: text/html</DIV>

<DIV>Size: 3373 bytes</DIV>

<DIV>Desc: not available</DIV>

<DIV>URL: <http://www.uib.no/mailman/public/corpora/attachments/20130617/c158f185/attachment.txt></DIV>

<DIV> </DIV>

<DIV>----------------------------------------------------------------------</DIV>

<DIV>Send Corpora mailing list submissions to</DIV>

<DIV>corpora@uib.no</DIV>

<DIV> </DIV>

<DIV>To subscribe or unsubscribe via the World Wide Web, visit</DIV>

<DIV>http://mailman.uib.no/listinfo/corpora</DIV>

<DIV>or, via email, send a message with subject or body 'help' to</DIV>

<DIV>corpora-request@uib.no</DIV>

<DIV> </DIV>

<DIV>You can reach the person managing the list at</DIV>

<DIV>corpora-owner@uib.no</DIV>

<DIV> </DIV>

<DIV>When replying, please edit your Subject line so it is more specific</DIV>

<DIV>than "Re: Contents of Corpora digest..."</DIV>

<DIV> </DIV>

<DIV> </DIV>

<DIV>_______________________________________________</DIV>

<DIV>Corpora mailing list</DIV>

<DIV>Corpora@uib.no</DIV>

<DIV>http://mailman.uib.no/listinfo/corpora</DIV>

<DIV> </DIV>

<DIV> </DIV>

<DIV>End of Corpora Digest, Vol 72, Issue 15</DIV>

<DIV>***************************************</DIV></DIV></BODY></HTML>