Corpora: Measures for the similarity between two sentences
Patrick Ruch
ruch at dim.hcuge.ch
Mon Nov 13 18:45:56 UTC 2000
Hi Constantin,
If what you need is a string edit distance (a kind of diff tool),
then a good introduction, theoretical, practical and didactic,
can be found here:
http://www-igm.univ-mlv.fr/~lecroq/seqcomp/index.html,
you will also find a couple of links.
Some (unix-like) c code can be downloaded here:
http://odur.let.rug.nl/~kleiweg/levenshtein/
Such generic metrics are likely to be applied on any material
(tokens, part-of.speech, word-sense).
Regards,
-Patrick
__________________________________
Patrick Ruch
HUG - Medical Informatics Division
CH-1211 Geneva 14
tel.: (+41 22) 372 61 64
fax: (+41 22) 372 48 55
email: Patrick.Ruch at dim.hcuge.ch
More information about the Corpora
mailing list