OS/2 Warp Compatible Hardware List Web site: gOCR for OS/2OS/2 Scanning Solutions Overview, advantages of TAME/2.
GOCR (GNU Optical Character Recognition)
© 2003-2004, Franz Bakan, os2warp.be
GOCR, also often referred to as jOCR, is an OCR (Optical Character Recognition)
program, developed under the GNU Public License. It converts scanned images
of text back to text files. Joerg Schulenburg started the program, and now leads
a team of developers with Bruno Barberi Gnecco. It reads images in many formats
(pnm, pbm, pgm, ppm, some pcx and tga image files and outputs a text file. The
gOCR/2 port was developed by Franz Bakan.
| Here you can find OS/2 executables of GOCR Version 0.38 if you want to play with it.|
GOCR.EXE is compiled with GCC 3.2.1 (available via ftp from netlabs).
Here is the link:
- Type gocr -h for usage.
- Example 'one-liner' of a scan2text.cmd:
scanimage --device=epson --mode=Gray --resolution=300 | gocr - > textfile.txt
- Another example:
scanimage --device=epson --mode=Gray --resolution=300 1>out.pnm 2>out.error && gocr out.pnm > ocr.txt
How to compile with GCC 3.2.1:
- If the image is complex or the letters are small, gocr is quite slooow. (expect duration of serveral minutes!).
- I suggest that you make your first tests with small scans.
Compiling wiht emx-gcc required 2 more steps:
- get and install os2unix
- (probably only if you use 2.0 of os2unix) remove the 3 lines
LIBOBJS := $(LIBOBJS:.o=.obj)
LIBOBJS = pgm2asc.$(obj)
copy srclibPgm2asc.a srcPgm2asc.a