GOCR
From Wikipedia, the free encyclopedia
GOCR | |
---|---|
Developed by | Jörg Schulenburg |
Latest release | 0.44 / 1 March 2008 |
Genre | Optical character recognition |
License | GNU General Public License |
Website | jocr.sourceforge.net |
GOCR (or JOCR) is a free optical character recognition program, initially written by Jörg Schulenburg. It can be used to convert or scan image files (portable pixmap or PCX) into text files.
According to the program's documentation, as of version 0.44 it is still in the early stages of development. It claims to handle single-column sans-serif fonts of 20-60 pixels in height, and reports trouble with serif fonts, overlapping characters, handwritten text, heterogeneous fonts, noisy images, large angles of skew, and text in anything other than a Latin alphabet.