Click to See Complete Forum and Search --> : Reading text from PDF files
JWiz
May 7th, 2001, 11:01 PM
I need to read text out of PDF files, and use it in my program. Can anyone help? I have found some code that does this (called PDF Text Extractor), but you have to have Adobe Acrobat (full) installed to use it. I need to distribute this program, so that's not an option.
Please help if you know how to do this!
Thanks!
aficio1279
June 7th, 2001, 11:09 AM
Can you tell me where I can get that or post the code for the PDF text extractor
Thanks,
Scott
JWiz
June 9th, 2001, 12:36 PM
I can't remember where I got it, but if you send me your email address I'll email it to you. My email is zancate@excite.com.
JWiz
June 20th, 2001, 08:31 PM
Holycow, I didn't expect so many responses! Please do NOT email me anymore. Thanks :)
After more research I found some better seeming options. One is to use Ghostscript (http://www.cs.wisc.edu/~ghost/). Another, even better one, is the command line tools available in the Xpdf package. The pdftotext program contained therein does an excellent job. It is available for all sorts of platforms, under the GNU public license. http://www.foolabs.com/xpdf/download.html.
If you're still set on that VB code, I have posted the file online at http://www.thegoldensun.com/jack/pdf_text_extractor.zip. You can download it from there.
NOTE: I have not even attempted to actually use this code, but it claims to require Adobe Acrobat the FULL VERSION (not just the reader) to work. That kind of discouraged me, 'cause I need to distribute the sucker. Um, I forgot where I downloaded this code from, but I think they wanted credit to the author if it is used... I don't know who the author is, though, so you can just give credit to whoever you feel like, I guess :)
nolc
December 21st, 2001, 11:40 AM
Do you have any ideas how I could parse through a PDF file? My goal is to parse through a PDF and then send the desired data into a database table. If you could help me that would be SWEET!
Happy Holidays!!!!!!!
No L c
VB Developer
codeguru.com
Copyright Internet.com Inc., All Rights Reserved.