MariaAux Posted November 29, 2011 Posted November 29, 2011 Hi, I am undertaking a massive project of converting dozens of PDF books into text-searchable PDF's, then having the text extracted by outsourced employees into their various categories, and then importing them into an FM database for personal use. The problem as you know with OCR, is that words often come out with irregular spacing such as this text; O n e c o p y of Sheet 7 (page 55) p e r g r o u p of four. O n e paper fastener for each group. O n e dice p e r g r o u p The regular FM spellchecker won't pick up the the mistakes if letters are spaced out, each by themselves. Other applications exist to do this job such as Afterscan but copying 200,000 text blocks from a PDF, then to afterscan and then into excel makes it a lot more work. Is there an OCR spell check Plug inj For filemaker pro? Or maybe even when for excel, where all teh text will first be copied before being imported? Thank you so much for your advice.
Guest Posted June 5, 2012 Posted June 5, 2012 This isn't for FileMaker but have a look at a product called 'Trim' by HP. Used extensively to OCR documents and store them. We use it at my company for archiving hard copy student records where it is excellent.
Recommended Posts
This topic is 4821 days old. Please don't post here. Open a new topic instead.
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now