Corpora: Measures for the similarity between two sentences

Patrick Ruch ruch at dim.hcuge.ch
Mon Nov 13 18:45:56 UTC 2000


Hi Constantin,

If what you need is a string edit distance (a kind of diff tool),
then a good introduction, theoretical, practical and didactic,
can be found here:
http://www-igm.univ-mlv.fr/~lecroq/seqcomp/index.html,
you will also find a couple of links.

Some (unix-like) c code can be downloaded here:
http://odur.let.rug.nl/~kleiweg/levenshtein/

Such generic metrics are likely to be applied on any material
(tokens, part-of.speech, word-sense).

Regards,
-Patrick
__________________________________
Patrick Ruch
HUG - Medical Informatics Division
CH-1211 Geneva 14
tel.: (+41 22) 372 61 64
fax: (+41 22) 372 48 55
email: Patrick.Ruch at dim.hcuge.ch



More information about the Corpora mailing list