<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>

<head>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=us-ascii">


<meta name=Generator content="Microsoft Word 10 (filtered)">

<style>
<!--
 /* Font Definitions */
 @font-face
        {font-family:SimSun;
        panose-1:2 1 6 0 3 1 1 1 1 1;}
@font-face
        {font-family:Tahoma;
        panose-1:2 11 6 4 3 5 4 4 2 4;}
@font-face
        {font-family:"\@SimSun";
        panose-1:2 1 6 0 3 1 1 1 1 1;}
 /* Style Definitions */
 p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0cm;
        margin-bottom:.0001pt;
        font-size:12.0pt;
        font-family:"Times New Roman";
        color:black;}
a:link, span.MsoHyperlink
        {color:blue;
        text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
        {color:purple;
        text-decoration:underline;}
pre
        {margin:0cm;
        margin-bottom:.0001pt;
        font-size:10.0pt;
        font-family:"Courier New";
        color:black;}
span.EmailStyle18
        {font-family:Arial;
        color:navy;}
@page Section1
        {size:612.0pt 792.0pt;
        margin:72.0pt 90.0pt 72.0pt 90.0pt;}
div.Section1
        {page:Section1;}
-->
</style>

</head>

<body bgcolor=white lang=EN-GB link=blue vlink=purple>

<div class=Section1>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'>Mikhail,</span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'> </span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'>The algorithm you want is</span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'> </span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'>In a large corpus</span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'>     For each verb</span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'>            Find
how often it occurs in pattern <VERB PRONOUN> </span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'>            Find
how often it occurs in pattern <VERB to PRONOUN></span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'>            Compute
a statistic to see how high both these numbers are, relative to overall freq of
verb</span></font></p>

<p class=MsoNormal style='text-indent:12.0pt'><font size=2 color=navy
face=Arial><span style='font-size:10.0pt;font-family:Arial;color:navy'>Sort
verbs according to the statistic</span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'> </span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'>Now you have a starter set for examining which
verbs show the behaviour you want to investigate.</span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'> </span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'>All relevant frequencies are available
for, eg, the BNC, in the Sketch Engine <a href="http://www.sketchengine.co.uk/">http://www.sketchengine.co.uk</a>
where you can define the patterns in CQL (Corpus Query Language from Stuttgart Uni). 
We don’t currently have a nice web interface for robots but will have
shortly, in the meantime, ask us and we can set things up to help you (eg by allowing
you robot access  and then you’d need to scrape web pages)</span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'> </span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'>Regards,</span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'> </span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'>            Adam</span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'> </span></font></p>

<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'> </span></font></p>

<p class=MsoNormal style='margin-left:36.0pt'><font size=2 color=black
face=Tahoma><span lang=EN-US style='font-size:10.0pt;font-family:Tahoma;
color:windowtext'>-----Original Message-----<br>
<b><span style='font-weight:bold'>From:</span></b> owner-corpora@lists.uib.no
[mailto:owner-corpora@lists.uib.no] <b><span style='font-weight:bold'>On Behalf
Of </span></b>Mikhail Kopotev<br>
<b><span style='font-weight:bold'>Sent:</span></b> </span></font><font size=2 color=black face=Tahoma><span lang=EN-US style='font-size:10.0pt;font-family:
 Tahoma;color:windowtext'>22 February 2007</span></font><font size=2
color=black face=Tahoma><span lang=EN-US style='font-size:10.0pt;font-family:
Tahoma;color:windowtext'> </span></font><font size=2 color=black face=Tahoma><span
 lang=EN-US style='font-size:10.0pt;font-family:Tahoma;color:windowtext'>13:15</span></font><font
size=2 color=black face=Tahoma><span lang=EN-US style='font-size:10.0pt;
font-family:Tahoma;color:windowtext'><br>
<b><span style='font-weight:bold'>Cc:</span></b> CORPORA@UIB.NO<br>
<b><span style='font-weight:bold'>Subject:</span></b> [Corpora-List] Variant
verbal government extraction</span></font></p>

<p class=MsoNormal style='margin-left:36.0pt'><font size=3 color=black
face="Times New Roman"><span style='font-size:12.0pt'> </span></font></p>

<p class=MsoNormal style='margin-left:36.0pt'><font size=3 color=black
face="Times New Roman"><span lang=EN-US style='font-size:12.0pt'>Dear all,<u1:p></u1:p></span></font></p>

<p class=MsoNormal style='margin-left:36.0pt'><font size=3 color=black
face="Times New Roman"><span lang=EN-US style='font-size:12.0pt'>does anyone
know how to recognize and extract variations of verbal government such as
“to write you/to you’ from a corpus?<u1:p></u1:p></span></font></p>

<p class=MsoNormal style='margin-left:36.0pt'><font size=3 color=black
face="Times New Roman"><span lang=EN-US style='font-size:12.0pt'>As far as I am
interested in Russian morphosyntactic changes, I would like you to point me any
tools, methods rather than obtained results, concerning English or any other
irrelevant languages. <u1:p></u1:p></span></font></p>

<p class=MsoNormal style='margin-left:36.0pt'><font size=3 color=black
face="Times New Roman"><span lang=EN-US style='font-size:12.0pt'>Many thanks,<u1:p></u1:p></span></font></p>

<pre style='margin-left:36.0pt' cols=72><font size=2 color=black
face="Courier New"><span style='font-size:10.0pt'>Mikhail Kopotev</span></font></pre><pre
style='margin-left:36.0pt'><font size=2 color=black face="Courier New"><span
style='font-size:10.0pt'>Researcher</span></font></pre><pre style='margin-left:
36.0pt'><font size=2 color=black face="Courier New"><span style='font-size:
10.0pt'>Department of Slavonic</span></font></pre><pre style='margin-left:36.0pt'><font
size=2 color=black face="Courier New"><span style='font-size:10.0pt'>and Baltic Languages and Literatures</span></font></pre><pre
style='margin-left:36.0pt'><font size=2 color=black face="Courier New"><span
style='font-size:10.0pt'>University of Helsinki</span></font></pre></div>

</body>

</html>