Linux.com

Re(1): gscan2pdf

Posted by: Anonymous [ip: 88.162.36.171] on June 29, 2008 01:30 AM
It's completely possible to cut or copy the OCR text from the output window in gscan2pdf and then paste it into whatever text manipulation application you prefer. This works fine for smaller projects, although I agree that it's probably not the best choice for dealing with a whole book.

There's also the xsane2tess script, a wrapper for tesseract-ocr that can be used with xsane's scan to text feature. See: http://doc.ubuntu-fr.org/xsane2tess (in French, but try Google translation).

#

Return to How to scan and OCR like a pro with open source tools