Tesseract OCR Chopper

Utilization/last two weeks

This project is a boxfile editor for training the tesseract-ocr engine. It converts an uploaded image to TIFF format via imagemagick and runs tesseract with its makebox option. For more information see full writeup and comments.

Running tesseract 3.01 and ImageMagick 6.5.4-7. Note: please don't abuse this tool/live demo or repurpose it as a web service. In case of resource abuse, I'll be forced to throttle offending IP addresses.

Hover For Preview     Boxfile/tesseract version
hi

Update: sorry, server migration last month disabled tesseract.

We're back in business now. Please report any issues.

Upload new image




Note: this is a beta; there are no error messages if imagemagick is unable to process the file (PDF, etc.)

Posts on this blog solely represent my personal opinions and technical experience.

© 2009-2017 Edin (Dino) Beslagic