Jump to content
Claris Engage 2025 - March 25-26 Austin Texas ×
The Claris Museum: The Vault of FileMaker Antiquities at Claris Engage 2025! ×

This topic is 3594 days old. Please don't post here. Open a new topic instead.

Recommended Posts

Posted

There are basically 2 types of PDFs, 1) the text/vector based one; with the subcategory of X-PDF or was it PDF-X, in which is already OCRed by the scanner, and 2) the scanned pixel based ones, for case number 1 I'd use pdftotext from poppler-utils linux package( probably available from brew for Mac http://brew.sh or you can compile from source: http://poppler.freedesktop.org/ )for number 2, I would use imagemagick with tesseract or OCRopus.

This topic is 3594 days old. Please don't post here. Open a new topic instead.

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...

Important Information

By using this site, you agree to our Terms of Use.