OCR advice?

James Bailey jobailey at FACSTAFF.WISC.EDU
Wed Feb 2 00:29:59 UTC 2000


Dear Michael,
     Since other people may be interested in this I will try to answer your
question.  I have used Fine Reader to scan Cyrillic texts with fairly good
results.  Formating does tend to fall apart and word in italic may come out
with different letters.  On the whole I'm satisfied with it.  It can be
used easily to do a batch file with multiple pages.  I've copied as many as
45 at the rate of about 1 minute for page through the scanner.  Page
numbers are usually confused with the result that you have to edit the top
of each page.
     I do not know anything about other program that may be available.  You
can get Fine Reader from Smart Link (800) 256 4814  or info at smartlinkcorp.com
     Best of luck,
     James Bailey


At 05:22 PM 2/1/2000 -0600, you wrote:
>Mark: read this for me & offer any advice. Questions I should ask? Gaffes
>I've made?
>
>Dear SEELANGers,
>
>I'm about to begin a project that will involve converting 200+ pages of
>Cyrillic text into something that will eventually be in HTML format. Since
>there are many tech-savvy people and companies that read this usenet, I
>thought I'd start here.
>
>I'm looking for any recommendations for OCR software: The text that needs to
>be scanned is clear & fairly homogeneous, but it's poetry, so formatting is
>a complicated affair. Since this will eventually be used in HTML documents,
>the scanner should convert the text into  (I think) KOI-8, preferably to
>other formats as well (like the MAC- or PC-related codes for HTML editing in
>Cyrillic). Ideally, it should scan directly into Microsoft Word, since I've
>had good luck converting Cyrillic documents from Word to DreamWeaver (the
>HTML editor I use).
>
>Has anyone had any experience with OCR technology? Any problems using the
>data in HTML format? Does Microsoft have integrated software to use with
>Cyrillic? Any and all advice appreciated. Please respond off list, unless
>you believe that your response will be of general interest.
>
>Michael A. Denner
>Northwestern University
>
>
>+++***+++
>the preacher should shout... with thundering voice: "'pause, avast, why so
>seeming fast, but deadly slow?'"
>thoreau. walden. 1854.
>
>-------------------------------------------------------------------------
> Use your web browser to search the archives, control your subscription
>  options, and more.  Visit and bookmark the SEELANGS Web Interface at:
>                http://members.home.net/lists/seelangs/
>-------------------------------------------------------------------------

James Bailey
1102 Hathaway Dr.
Madison, WI  53711
(608) 271-3824

-------------------------------------------------------------------------
 Use your web browser to search the archives, control your subscription
  options, and more.  Visit and bookmark the SEELANGS Web Interface at:
                http://members.home.net/lists/seelangs/
-------------------------------------------------------------------------



More information about the SEELANG mailing list