Corpora: approximations (bounds) for edit distance
Patrick Ruch
patrick.ruch at dim.hcuge.ch
Mon Dec 3 09:32:07 UTC 2001
Hello,
My feeling is that all these bit-paralellism and speed-up approaches to the
basic string-to-string edit distance are greatly dependent on the raw data
you are working on.
If I understand well, you plan to filter the text, in order to reduce the
area where dynamic programming needs to be used. Such algorithms can achieve
sublinear time in most cases, however this is true only for low error ratio
(useful in biocomputing, not in OCR !). What it the foreseen application ?
Thx,
-Patrick
More information about the Corpora
mailing list