(Fwd) Re: [Lexicog] dialect tagging
David Joffe
david.joffe at TSHWANEDJE.COM
Wed Jul 1 10:21:38 UTC 2009
> Can you display this kind of info in some user-defined format in the
> output (like the printed dictionary)? e.g.
>
> trunk ... 3. (British) rear compartment in an automobile
>
> where the user defines the formatting, e.g. parens around the dialect
> tag (as above), or maybe italics; and of course the user decides where
> this information goes in the output, like in this case between the sense
> number (which I'm assuming is automatically generated in TshwaneLex) and
> the sense itself.
>
> If there were multiple tags, you might want some separating punctuation,
> like ', ':
>
> trunk ... 3. (British, Australian) rear compartment in
> an automobile
Certainly, you can do all of the above. These are in fact *basics*
of a dictionary editing system - TshwaneLex could do all of these
since before version 1 at least five years ago already - if it
couldn't, I doubt we would've sold much by now.
The TshwaneLex Styles system allows you to define the output order
of attributes and child elements relative to any element (including
the automatic number, which is optional and is in fact just another
attribute in the DTD), so you can easily configure it to output:
SenseNumber
Dialect
Definition*
or any order you prefer. You can also do this 'centrally' at any
time - i.e. you can change the desired output order of attributes
with just a few clicks even the day before you want to typeset the
dictionary, and it will change throughout the dictionary.
Each attribute also has its own plain old formatting styles - bold,
italics, font, font size, superscript, subscript, underline, etc. In
addition to that, you can configure in the Styles system the common
punctuation to be generated before/after (e.g. your brackets above)
any attribute element, as well as *between* both list attributes
(like the dialects) and between elements - so you can easily
configure if you want to separate the multi-list items with a comma
or any other string (comma is the default). (Can I post attachments
to the list? I could show screenshots of this.)
And again, you can configure this at any time - e.g. if literally
the day before you typeset you decide you want square brackets
instead of round brackets throughout the dictionary, or want a semi-
colon instead of a comma between the dialects, you can change this
centrally with just a few clicks, and all with a user-friendly
interface so anyone can do it (though advanced users could bypass
the interface and configure these things with scripts).
Again, sorry, but those are *basics* (I've long since taken these
aspects 'for granted' now). In version 3 (which is now two years
old) we even added programmable dynamic "smart" styles - for
advanced users, of course, but this allows every aspect of the
styles for every element or attribute to even optionally be decided
in real-time by a script:
http://tshwanedje.com/tshwanelex/overview.html#sma (as TshwaneLex
has a built-in scripting language embedded). The possibilities for
this are basically endless, but the screenshot at that link shows a
basic sample whereby headwords are highlighted differently if
they're amongst the most frequent as determined by a corpus (a
common dictionary thing). Important to note is that in that sample
this happens in real time - if you re-import new corpus frequencies,
for example, the highlighting immediately changes to reflect the new
information. Another sample included with TshwaneLex, and something
fairly commonly done in dictionaries, is to output different
punctuation depending on what appears before/after some specific
type of information (e.g. if a usage example is preceded by a
definition output ":", if preceded by (say) a cross-reference then
output ";" - or whatever).
> I.e. can the user edit the configuration files? (perhaps as in
> applications like Endnote)
Not sure what you mean by this.
- David
---
http://tshwanedje.com/
TshwaneDJe Human Language Technology
------------------------------------
Yahoo! Groups Links
<*> To visit your group on the web, go to:
http://groups.yahoo.com/group/lexicographylist/
<*> Your email settings:
Individual Email | Traditional
<*> To change settings online go to:
http://groups.yahoo.com/group/lexicographylist/join
(Yahoo! ID required)
<*> To change settings via email:
mailto:lexicographylist-digest at yahoogroups.com
mailto:lexicographylist-fullfeatured at yahoogroups.com
<*> To unsubscribe from this group, send an email to:
lexicographylist-unsubscribe at yahoogroups.com
<*> Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/
More information about the Lexicography
mailing list