CodeGuru Home VC++ / MFC / C++ .NET / C# Visual Basic VB Forums Developer.com
Results 1 to 5 of 5
  1. #1
    Join Date
    May 2001
    Location
    Asheville, NC
    Posts
    3

    Reading text from PDF files

    I need to read text out of PDF files, and use it in my program. Can anyone help? I have found some code that does this (called PDF Text Extractor), but you have to have Adobe Acrobat (full) installed to use it. I need to distribute this program, so that's not an option.

    Please help if you know how to do this!
    Thanks!


  2. #2
    Join Date
    Jun 2001
    Posts
    1

    Re: Reading text from PDF files

    Can you tell me where I can get that or post the code for the PDF text extractor
    Thanks,
    Scott


  3. #3
    Join Date
    May 2001
    Location
    Asheville, NC
    Posts
    3

    Re: Reading text from PDF files

    I can't remember where I got it, but if you send me your email address I'll email it to you. My email is [email protected].


  4. #4
    Join Date
    May 2001
    Location
    Asheville, NC
    Posts
    3

    Re: Reading text from PDF files

    Holycow, I didn't expect so many responses! Please do NOT email me anymore. Thanks

    After more research I found some better seeming options. One is to use Ghostscript (http://www.cs.wisc.edu/~ghost/). Another, even better one, is the command line tools available in the Xpdf package. The pdftotext program contained therein does an excellent job. It is available for all sorts of platforms, under the GNU public license. http://www.foolabs.com/xpdf/download.html.

    If you're still set on that VB code, I have posted the file online at http://www.thegoldensun.com/jack/pdf_text_extractor.zip. You can download it from there.

    NOTE: I have not even attempted to actually use this code, but it claims to require Adobe Acrobat the FULL VERSION (not just the reader) to work. That kind of discouraged me, 'cause I need to distribute the sucker. Um, I forgot where I downloaded this code from, but I think they wanted credit to the author if it is used... I don't know who the author is, though, so you can just give credit to whoever you feel like, I guess


  5. #5
    Join Date
    Aug 2001
    Location
    Minneapolis, MN, USA
    Posts
    150

    Re: Reading text from PDF files

    Do you have any ideas how I could parse through a PDF file? My goal is to parse through a PDF and then send the desired data into a database table. If you could help me that would be SWEET!

    Happy Holidays!!!!!!!

    No L c
    VB Developer

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  





Click Here to Expand Forum to Full Width

Featured