Jump to content

Pulling text from a pdf file imported to filemaker?


This topic is 2818 days old. Please don't post here. Open a new topic instead.

Recommended Posts

There are basically 2 types of PDFs, 1) the text/vector based one; with the subcategory of X-PDF or was it PDF-X, in which is already OCRed by the scanner, and 2) the scanned pixel based ones, for case number 1 I'd use pdftotext from poppler-utils linux package( probably available from brew for Mac http://brew.sh or you can compile from source: http://poppler.freedesktop.org/ )for number 2, I would use imagemagick with tesseract or OCRopus.

Link to comment
Share on other sites

This topic is 2818 days old. Please don't post here. Open a new topic instead.

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
 Share

×
×
  • Create New...

Important Information

By using this site, you agree to our Terms of Use.