<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<html>
<head>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=us-ascii">
<meta name=Generator content="Microsoft Word 10 (filtered)">
<style>
<!--
/* Font Definitions */
@font-face
{font-family:Wingdings;
panose-1:5 0 0 0 0 0 0 0 0 0;}
@font-face
{font-family:SimSun;
panose-1:2 1 6 0 3 1 1 1 1 1;}
@font-face
{font-family:Tahoma;
panose-1:2 11 6 4 3 5 4 4 2 4;}
@font-face
{font-family:"\@SimSun";
panose-1:2 1 6 0 3 1 1 1 1 1;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman";}
a:link, span.MsoHyperlink
{color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{color:purple;
text-decoration:underline;}
p
{margin-right:0cm;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman";}
span.EmailStyle18
{font-family:Arial;
color:navy;}
@page Section1
{size:612.0pt 792.0pt;
margin:72.0pt 90.0pt 72.0pt 90.0pt;}
div.Section1
{page:Section1;}
/* List Definitions */
ol
{margin-bottom:0cm;}
ul
{margin-bottom:0cm;}
-->
</style>
</head>
<body lang=EN-GB link=blue vlink=purple>
<div class=Section1>
<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'>In the general case, this is a very big
question. Once you limit it to particular types of documents, eg,
scientific papers, or journalism, or CVs, it becomes somewhat tractable, and
this is what citeseer and DBLP are doing on an industrial scale for academic
papers. </span></font></p>
<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'> </span></font></p>
<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'>As a general rule, you depend on the conventions
that people use in structuring each particular document type – the stronger
the conventions, the more tractable it is, and the more different conventions (and
markup languages, etc) there are, the more work there is to cover them
all.</span></font></p>
<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'> </span></font></p>
<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'>Adam</span></font></p>
<p class=MsoNormal><font size=2 color=navy face=Arial><span style='font-size:
10.0pt;font-family:Arial;color:navy'> </span></font></p>
<p class=MsoNormal style='margin-left:36.0pt'><font size=2 face=Tahoma><span
lang=EN-US style='font-size:10.0pt;font-family:Tahoma'>-----Original
Message-----<br>
<b><span style='font-weight:bold'>From:</span></b> owner-corpora@lists.uib.no
[mailto:owner-corpora@lists.uib.no] <b><span style='font-weight:bold'>On Behalf
Of </span></b>Chaker Jabbari<br>
<b><span style='font-weight:bold'>Sent:</span></b> 04 December 2005 08:40<br>
<b><span style='font-weight:bold'>To:</span></b> CORPORA@UIB.NO<br>
<b><span style='font-weight:bold'>Subject:</span></b> [Corpora-List] chaker
jebari</span></font></p>
<p class=MsoNormal style='margin-left:36.0pt'><font size=3
face="Times New Roman"><span style='font-size:12.0pt'> </span></font></p>
<div>
<p class=MsoNormal style='margin-left:36.0pt'><font size=2 color=black
face=Arial><span style='font-size:10.0pt;font-family:Arial;color:black'>Hi</span></font></p>
</div>
<div>
<p class=MsoNormal style='margin-left:36.0pt'><font size=3 color=black
face=Arial><span style='font-size:12.0pt;font-family:Arial;color:black'> </span></font></p>
</div>
<div>
<p class=MsoNormal style='margin-left:36.0pt'><font size=2 color=black
face=Arial><span style='font-size:10.0pt;font-family:Arial;color:black'>I need
a tool to identify the logical structure of a textual document. </span></font></p>
</div>
<div>
<p class=MsoNormal style='margin-left:36.0pt'><font size=2 color=black
face=Arial><span style='font-size:10.0pt;font-family:Arial;color:black'>for
example :</span></font></p>
</div>
<div>
<p class=MsoNormal style='margin-left:36.0pt'><font size=2 color=black
face=Arial><span style='font-size:10.0pt;font-family:Arial;color:black'>a
logical structure of a scientific paper is : title, abstract, key words,
introduction, text, conclusion, references</span></font></p>
</div>
<div>
<p class=MsoNormal style='margin-left:36.0pt'><font size=2 color=black
face=Arial><span style='font-size:10.0pt;font-family:Arial;color:black'>a
logical structure of a call for papers is : title, topics, important dates,
submission, ...</span></font></p>
</div>
<div>
<p class=MsoNormal style='margin-left:36.0pt'><font size=3 color=black
face=Arial><span style='font-size:12.0pt;font-family:Arial;color:black'> </span></font></p>
</div>
<div>
<p class=MsoNormal style='margin-left:36.0pt'><font size=2 color=black
face=Arial><span style='font-size:10.0pt;font-family:Arial;color:black'>I ask
you if any one have an idea about a tool or an algorithm to identify the
logical structure.</span></font></p>
</div>
<div>
<p class=MsoNormal style='margin-left:36.0pt'><font size=3 color=black
face=Arial><span style='font-size:12.0pt;font-family:Arial;color:black'> </span></font></p>
</div>
<div>
<p class=MsoNormal style='margin-left:36.0pt'><font size=3 color=black
face=Arial><span style='font-size:12.0pt;font-family:Arial;color:black'> </span></font></p>
</div>
<div>
<p class=MsoNormal style='margin-left:36.0pt'><font size=2 color=black
face=Arial><span style='font-size:10.0pt;font-family:Arial;color:black'>regards</span></font></p>
</div>
<div>
<p class=MsoNormal style='margin-left:36.0pt'><font size=2 color=black
face=Arial><span style='font-size:10.0pt;font-family:Arial;color:black'>chaker
jebari</span></font><font color=black face=Arial><span style='font-family:Arial;
color:black'> </span></font></p>
</div>
</div>
</body>
</html>