<html>
<head>
<style><!--
.hmmessage P
{
margin:0px;
padding:0px
}
body.hmmessage
{
font-size: 12pt;
font-family:΢ÈíÑźÚ
}
--></style></head>
<body class='hmmessage'><div dir='ltr'><p class="MsoNormal"><font size="3"><a name="OLE_LINK63"></a><a name="OLE_LINK58"></a><a name="OLE_LINK57"><span lang="EN-US" style="background-color: white; ">Hi,
dear all,</span></a></font></p><p class="MsoNormal"><font size="3"><a name="OLE_LINK57"><span lang="EN-US" style="background-color: white; "><br></span></a></font></p>
<p class="MsoNormal" align="left"><font size="3"><a name="OLE_LINK59"><span lang="EN-US">I
am extremely interested in the new edition of Google N-grams
corpus.My research topic is using the sentence dependence parsing skill to
mining the web scale textual corpus for semantics understanding.<o:p></o:p></span></a></font></p><p class="MsoNormal" align="left"><br></p>
<p class="MsoNormal" align="left"><font size="3"><span lang="EN-US" style="color: rgb(68, 68, 68); ">And I want to ask two questions as following,</span><span lang="EN-US"><o:p></o:p></span></font></p><p class="MsoNormal" align="left"><font size="3"><span lang="EN-US" style="color: rgb(68, 68, 68); "><br></span></font></p>
<p class="MsoNormal" align="left"><font size="3"><span lang="EN-US" style="color: rgb(68, 68, 68); ">Q1: how to use this large scale data? Is there any existing
tools, e.g. indexing and search tools like lucene (maybe not available for this
big data)? Any other index tools?</span></font></p><p class="MsoNormal" align="left"><font size="3"><span lang="EN-US" style="color: rgb(68, 68, 68); "><br></span></font></p>
<span lang="EN-US" style="color: rgb(68, 68, 68); "><font size="3">Q2: I want to extract the typical triplets dependent
relations (S-V-O, e.g. "lion - chase - zebra"), could you help me for
how to do this efficiently?</font></span><br><br><p style="line-height:21.81818199157715px;color:rgb(68, 68, 68);font-family:'Microsoft YaHei UI', 'Microsoft YaHei', ËÎÌå, Calibri, sans-serif;font-size:15.454545021057129px;">Gang Tian | Phd Student</p><p style="line-height:21.81818199157715px;color:rgb(68, 68, 68);font-family:'Microsoft YaHei UI', 'Microsoft YaHei', ËÎÌå, Calibri, sans-serif;font-size:15.454545021057129px;"></p><p style="line-height:21.81818199157715px;color:rgb(68, 68, 68);font-family:'Microsoft YaHei UI', 'Microsoft YaHei', ËÎÌå, Calibri, sans-serif;font-size:15.454545021057129px;">School of Information Technologies | Faculty of Engineering & IT</p><p style="line-height:21.81818199157715px;color:rgb(68, 68, 68);font-family:'Microsoft YaHei UI', 'Microsoft YaHei', ËÎÌå, Calibri, sans-serif;font-size:15.454545021057129px;">THE UNIVERSITY OF SYDNEY</p> </div></body>
</html>