Originally Posted By: tanstaafl.
I'll scan through their forum, though, and see if I can pick up any pointers.
Okay, I did pick up one pointer that paid off. The forum members seem pretty fond of the ABBYY program for OCR, it's on sale for half price through the end of February, so I sprang $30 for it. At a rough approximation, it cut my OCR errors by about a factor of 10 as compared to the OCR built into Adobe Acrobat.

Reading through the forum, looking at the questions being asked, I don't think there's much there that they can teach me. Well, that's a bit unfair, they are focused more on getting their (IMHO) somewhat Rube Goldberg machines to work properly, they're more into the hardware end than the software end. They're doing non-destructive eBook conversions, whereas I am destroying my books, which greatly simplifies the process.

I can highly recommend the ABBYY program for OCR. It does what I call "Intelligent OCR" in that it has dictionaries for about 50 languages built into it, and rather than converting letter by letter, it does it word by word, so that if it sees "floor" with the troublesome "fl" ligature, it doesn't make it into "hoor" or "Aoor".

tanstaafl.
_________________________
"There Ain't No Such Thing As A Free Lunch"