Google now doing on-the-fly OCR on scanned PDF images

Grant Barrett gbarrett at WORLDNEWYORK.ORG
Thu Oct 30 21:59:22 UTC 2008


 From Google is an announcement that increases findability:

http://googleblog.blogspot.com/2008/10/picture-of-thousand-words.html

"We are now able to perform OCR on any scanned documents that we find
stored in Adobe's PDF format. This Optical Character Recognition (OCR)
technology lets us convert a picture (of a thousand words) a thousand
words -- words that can be searched and indexed, so that these
valuable documents are more easily found. This is a small but
important step forward in our mission of making all the world's
information accessible and useful."

Grant Barrett
gbarrett at worldnewyork.org

------------------------------------------------------------
The American Dialect Society - http://www.americandialect.org



More information about the Ads-l mailing list