Jump to content
Claris Engage 2025 - March 25-26 Austin Texas ×

This topic is 4545 days old. Please don't post here. Open a new topic instead.

Recommended Posts

Posted

Hi,

I am undertaking a massive project of converting dozens of PDF books into text-searchable PDF's, then having the text extracted by outsourced employees into their various categories, and then importing them into an FM database for personal use.

The problem as you know with OCR, is that words often come out with irregular spacing such as this text;

O n e c o p y of Sheet 7 (page 55) p e r g r o u p of four. O n e paper fastener

for each group. O n e dice p e r g r o u p

The regular FM spellchecker won't pick up the the mistakes if letters are spaced out, each by themselves. Other applications exist to do this job such as Afterscan but copying 200,000 text blocks from a PDF, then to afterscan and then into excel makes it a lot more work.

Is there an OCR spell check Plug inj For filemaker pro? Or maybe even when for excel, where all teh text will first be copied before being imported?

Thank you so much for your advice.

  • 6 months later...
Posted

This isn't for FileMaker but have a look at a product called 'Trim' by HP. Used extensively to OCR documents and store them. We use it at my company for archiving hard copy student records where it is excellent.

This topic is 4545 days old. Please don't post here. Open a new topic instead.

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...

Important Information

By using this site, you agree to our Terms of Use.