<head>
<style type="text/css">
<!--
/* start of attachment style */
.ygrp-photo-title{
clear: both;
font-size: smaller;
height: 15px;
overflow: hidden;
text-align: center;
width: 75px;
}
div.ygrp-photo{
background-position: center;
background-repeat: no-repeat;
background-color: white;
border: 1px solid black;
height: 62px;
width: 62px;
}
div.photo-title
a,
div.photo-title a:active,
div.photo-title a:hover,
div.photo-title a:visited {
text-decoration: none;
}
div.attach-table div.attach-row {
clear: both;
}
div.attach-table div.attach-row div {
float: left;
/* margin: 2px;*/
}
p {
clear: both;
padding: 15px 0 3px 0;
overflow: hidden;
}
p span {
color: #628C2A;
}
div.ygrp-file {
width: 30px;
valign: middle;
}
div.attach-table div.attach-row div div a {
text-decoration: none;
}
div.attach-table div.attach-row div div span {
font-weight: normal;
}
div.ygrp-file-title {
font-weight: bold;
}
/* end of attachment style */
-->
</style>
</head>
<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv=Content-Type content="text/html; charset=utf-8">
<meta name=Generator content="Microsoft Word 12 (filtered medium)">
<style>
<!--
/* Font Definitions */
@font-face
{font-family:Wingdings;
panose-1:5 0 0 0 0 0 0 0 0 0;}
@font-face
{font-family:Wingdings;
panose-1:5 0 0 0 0 0 0 0 0 0;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
{font-family:Consolas;
panose-1:2 11 6 9 2 2 4 3 2 4;}
@font-face
{font-family:Verdana;
panose-1:2 11 6 4 3 5 4 4 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
a:link, span.MsoHyperlink
{color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:purple;
text-decoration:underline;}
p
{mso-style-priority:99;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
code
{mso-style-priority:99;
font-family:"Courier New";}
pre
{mso-style-priority:99;
mso-style-link:"HTML Preformatted Char";
margin:0cm;
margin-bottom:.0001pt;
font-size:10.0pt;
font-family:"Courier New";}
tt
{mso-style-priority:99;
font-family:"Courier New";}
span.left
{mso-style-name:left;}
span.bld
{mso-style-name:bld;}
p.ad, li.ad, div.ad
{mso-style-name:ad;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
p.ad1, li.ad1, div.ad1
{mso-style-name:ad1;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
p.ad2, li.ad2, div.ad2
{mso-style-name:ad2;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
p.ad3, li.ad3, div.ad3
{mso-style-name:ad3;
mso-margin-top-alt:auto;
margin-right:0cm;
margin-bottom:3.75pt;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
span.HTMLPreformattedChar
{mso-style-name:"HTML Preformatted Char";
mso-style-priority:99;
mso-style-link:"HTML Preformatted";
font-family:Consolas;}
p.replbq, li.replbq, div.replbq
{mso-style-name:replbq;
margin:1.5pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
span.yshortcuts
{mso-style-name:yshortcuts;}
p.ad4, li.ad4, div.ad4
{mso-style-name:ad4;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
p.ad5, li.ad5, div.ad5
{mso-style-name:ad5;
mso-margin-top-alt:auto;
margin-right:0cm;
margin-bottom:3.75pt;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
p.ad6, li.ad6, div.ad6
{mso-style-name:ad6;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
span.yshortcuts1
{mso-style-name:yshortcuts1;
font-family:"Verdana","sans-serif";
font-weight:bold;}
span.yshortcuts2
{mso-style-name:yshortcuts2;
font-family:"Verdana","sans-serif";
color:#1E66AE;
font-weight:normal;}
span.EmailStyle35
{mso-style-type:personal-reply;
font-family:"Calibri","sans-serif";
color:#1F497D;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:10.0pt;}
@page Section1
{size:612.0pt 792.0pt;
margin:70.85pt 70.85pt 70.85pt 70.85pt;}
div.Section1
{page:Section1;}
/* List Definitions */
@list l0
{mso-list-id:1976904548;
mso-list-template-ids:-1494696274;}
@list l0:level1
{mso-level-number-format:bullet;
mso-level-text:;
mso-level-tab-stop:36.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Symbol;}
ol
{margin-bottom:0cm;}
ul
{margin-bottom:0cm;}
-->
</style>
<!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body bgcolor=white lang=CS link=blue vlink=purple>
<div class=Section1>
<p class=MsoNormal><span lang=EN-US><o:p> </o:p></span></p>
<p class=MsoNormal><span lang=EN-US><o:p> </o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US>> I'm curious:
how many distinct fields do you have? <o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US><o:p> </o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US>I counted forty
seven fields. But not all of them are exported for the printed version. <o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US>I should also
mention that I employ some custom defined fields and that I use several of the
MDF fields for different purposes and in different hierarchical positions from
what they were designed for. My programmer modified the export CCT according to
my needs. This CCT also checks for consistencies in the hierarchical structure
and we have another CCT that can actually fix the position of a field if it is
placed improperly. But we haven’t used it extensively as the entry structure
has been established quite early on and I think I have been able to work with
it quite consistently. It is possible to define a record template in Toolbox, a
feature I have been using from the beginning. You can also definite which field
follows a particular field so when you press enter it automatically creates the
field that is supposed to follow. These functions are very helpful in keeping
the structure consistent.<o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US><o:p> </o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US>> Can any
repeat, are any of them hierarchical (like example sentences need to <o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US>> come under a
sense), and do any of them come in pairs or triplets (like example <br>
> sentence + translation)? <o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US><o:p> </o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US>Yes, exactly; the
pair \xv and \xe (example sentences and its English translation) comes after
the sense definition. The field \ue (usage) precedes example sentences when it
is present. The example sentence bundle also includes reference to its source,
but it is not exported for the printed version.<o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US>I also use the
field bundle \lf \lv \le quite extensively. These fields are placed under
paradigm and contain reference to various grammatical forms, such as
reduplicated, contracted, datives, benefactives, possessive, reflexives,
causatives etc. These forms are also given as separate records but they are
always referenced from the base form. Another pair of fields is at the very end
of an entry and contains reference to the Dakota dialect variants, whenever
these differ from the lemma on phonemic or lexical level (this is the case in
about 30% or lemmas).<o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US><o:p> </o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US><o:p> </o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US>> How many
records do you have? <o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US><o:p> </o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US>As of now the
Lakota-English dictionary database contains 43,152 records and 112,263 example
sentences or collocations. But the printed version of the dictionary includes
only 20,000 records and 43,000 sentences. The other entries need more research
or editing and we will be expanding the printed version with each new edition,
and especially in the electronic version of the dictionary.<o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US><o:p> </o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US>The English-Lakota
side of the dictionary is a separate database, it has about half the number of
records and a very different record structure. It is not a full-fledged
English-Lakota dictionary (for instance it does not contain the example
sentences because including them twice would made the book huge, it is already
1200 pages) but it is much more detailed than just a finder list.<o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US><o:p> </o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US>Sample pages from
both the Lakota-English and English-Lakota sections can be seen here:<o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US><a
href="http://www.inext.cz/siouan/sample_nld.pdf">http://www.inext.cz/siouan/sample_nld.pdf</a><o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US><o:p> </o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US><o:p> </o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US>We are now in the
process of programming a lemmatizing scrip that will help us create a list of
all possible inflection forms so that they can be included in the multimedia
version of the dictionary. We only have estimates now but it will probably be
around 2 million lexemes. Each will be linked to its lemma in the electronic
dictionary.<o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US><o:p> </o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US>> And how have
you verified that your field hierarchy is consistent?<o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US><o:p> </o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US>The CCT that I
mentioned above is one way of doing this. The printed version of the dictionary
was of course proofread by several people. The only inconsistency was
occasional omission of sense number or a wrong value of the sense number. I
know that TshwaneLex automates sense and homonym numbers and I believe it to be
great advantage.<o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US>Other than that we
have not found any inconsistencies in the record structure and hierarchy.<o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US><o:p> </o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US>I still think
Toolbox is an excellent software for a lexicographer and it has allowed me to
do pretty much everything I wanted to with the dictionary. But it is also
important to say that I would not be able to do some of those things without
the excellent programmer that I have in my team. So, Toolbox offers flexibility
to do things, but with that flexibility comes additional work. TshwaneLex
probably automates a lot of the features that we do with various CCT’s and
scripts but then maybe it isn’t flexible enough to allow us do other things. My
main worry though, is that the data entry process in much time-consuming the
TshwaneLex than it is in Toolbox.<o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US><o:p> </o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US>Jan<o:p></o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US><o:p> </o:p></span></p>
<p style='margin:0cm;margin-bottom:.0001pt'><span lang=EN-US><o:p> </o:p></span></p>
</div>
<!-- |**|begin egp html banner|**| -->
<br>
<br>
<!-- |**|end egp html banner|**| -->
<div width="1" style="color: white; clear: both;"/>__._,_.___</div>
<!-- Start Recommendations -->
<!-- End Recommendations -->
<!-- |**|begin egp html banner|**| -->
<img src="http://geo.yahoo.com/serv?s=97476590/grpId=11682781/grpspId=1709195911/msgId=4930/stime=1237660251" width="1" height="1"> <br>
<!-- |**|end egp html banner|**| -->
<!-- |**|begin egp html banner|**| -->
<br>
<div style="font-family: verdana; font-size: 77%; border-top: 1px solid #666; padding: 5px 0;" >
Your email settings: Individual Email|Traditional <br>
<a href="http://groups.yahoo.com/group/lexicographylist/join;_ylc=X3oDMTJnYmU1amVvBF9TAzk3NDc2NTkwBGdycElkAzExNjgyNzgxBGdycHNwSWQDMTcwOTE5NTkxMQRzZWMDZnRyBHNsawNzdG5ncwRzdGltZQMxMjM3NjYwMjUx">Change settings via the Web</a> (Yahoo! ID required) <br>
Change settings via email: <a href="mailto:lexicographylist-digest@yahoogroups.com?subject=Email Delivery: Digest">Switch delivery to Daily Digest</a> | <a href = "mailto:lexicographylist-fullfeatured@yahoogroups.com?subject=Change Delivery Format: Fully Featured">Switch to Fully Featured</a> <br>
<a href="http://groups.yahoo.com/group/lexicographylist;_ylc=X3oDMTJlazZhdHM2BF9TAzk3NDc2NTkwBGdycElkAzExNjgyNzgxBGdycHNwSWQDMTcwOTE5NTkxMQRzZWMDZnRyBHNsawNocGYEc3RpbWUDMTIzNzY2MDI1MQ--">
Visit Your Group
</a> |
<a href="http://docs.yahoo.com/info/terms/">
Yahoo! Groups Terms of Use
</a> |
<a href="mailto:lexicographylist-unsubscribe@yahoogroups.com?subject=Unsubscribe">
Unsubscribe
</a>
<br>
</div>
<br>
<!-- |**|end egp html banner|**| -->
<div style="color: white; clear: both;"/>__,_._,___</div>
</body>
</html>