[Corpora-List] number and dates normalization
Shachar Mirkin
mirkins at macs.biu.ac.il
Tue Jul 29 15:17:06 UTC 2008
Hi,
Here's a summary of the pointers we got for the number and date
normalization inquiry:
- ICU4J (http://icu-project.org/index.html) - a set of libraries for
globalization purposes, including number and date formatting.
- A date normalizer by Mark Greenwood found at:
http://www.dcs.shef.ac.uk/~mark/dev/java/index.html
- Unix date program, part of GNU coreutils:
http://www.gnu.org/software/coreutils/
- hCalendar ( <http://microformats.org/wiki/hcalendar>
http://microformats.org/wiki/hcalendar , a microformats standard for
calendaring and events format.
- TempEx: for date and time expression tagging by George Wilson:
http://timex2.mitre.org/cgi-bin/download?file=TempEx_R1_05_03.tar
Thanks to Trevor Jenkins, Michael Hawkes, Mark Greenwood and George Wilson
for their help.
Shachar
_____
From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of
Shachar Mirkin
Sent: Thursday, July 24, 2008 8:25 PM
To: corpora at uib.no
Subject: [Corpora-List] number and dates normalization
Hi,
I'm looking for an available package (preferably Java) for numbers and dates
normalization, that given "fifteen hundred" will return "1500" and given
"January, 23 1987" will return a date in some predefined schema, e.g.
"23/1/87".
Anyone knows of such a tool?
Thanks,
Shachar Mirkin
Bar-Ilan University, Israel
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20080729/5e789fd2/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list