Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Text to OCR - offline
#5
The problem with a basic Tesseract, is it is command line. Obviously the best way if OCR-ing a whole book. One problem is loss of formatting, tend to get long lines of text with no breaks and no headings etc.

I use it in Linux for small 'screen captured' text images using a GUI (prefer YAGF but not working in 'buntu 18.04 so gImageReader) .

For a screen capture always need some pre-processing in Gimp, scaling up 200% - 300%, clean background etc.

There is a Tesseract for Windows with GUI here: https://ocr.space/blog/p/free-ocr-windows.html

And a quick try-out in a Win10 VM https://i.imgur.com/H7fvKCu.jpg and that is typical, some post OCR corrections needed. Still better than typing out the whole thing Wink
Reply


Messages In This Thread
Text to OCR - offline - by Krikor - 12-12-2019, 10:14 PM
RE: Text to OCR - offline - by Ofnuts - 12-13-2019, 10:55 AM
RE: Text to OCR - offline - by Krikor - 12-13-2019, 11:52 PM
RE: Text to OCR - offline - by Ofnuts - 12-14-2019, 12:33 AM
RE: Text to OCR - offline - by rich2005 - 12-14-2019, 10:26 AM
RE: Text to OCR - offline - by Krikor - 12-14-2019, 09:10 PM

Forum Jump: