CodeGuru Home VC++ / MFC / C++ .NET / C# Visual Basic VB Forums Developer.com
Results 1 to 6 of 6
  1. #1
    Join Date
    May 2009
    Location
    Hyderabad
    Posts
    4

    Extract text from Scanned Image

    Hi all,
    I want to extract text from scanned Image using c#.net. I have tried by converting the scanned image to PDF and then extract text from PDF.
    I am extracting text from PDF using pdfbox , and able to get the text from normal PDF file. I am not able to get the text from the PDF file which I converted the Image to PDF.
    Some tools(OCR) are there, but I have to implement using C#.NET.

    Please give some Ideas how to get the text from scanned Image.
    Thanks in advance.

  2. #2
    Join Date
    Jul 2001
    Location
    Sunny South Africa
    Posts
    11,284

    Re: Extract text from Scanned Image

    From what I've heared, OCR with .NET is a bit problematic at this stage. Yes, there are numerous OCR SDKs on Microsoft's site, but I haven't heared much good about them yet.

  3. #3
    Join Date
    May 2009
    Location
    Hyderabad
    Posts
    4

    Re: Extract text from Scanned Image

    Thanks for the response..

    I am able to extract English text from the scanned Image using MODI C#.NET.
    "http://www.devsource.com/c/a/Languages/Using-The-Office-2007-OCR-Component-in-C/".

    But I am not able to get my Local Language text(Telugu) from the Image Is it possible to achieve this?

    Please help me regarding this...

    Thanks in advance.

  4. #4
    Join Date
    Nov 2015
    Posts
    2

    Re: Extract text from Scanned Image

    Quote Originally Posted by Post2Naresh View Post
    Hi all,
    I want to extract text from scanned Image using c#.net. I have tried by converting the scanned image to PDF and then extract text from PDF.
    I am extracting text from PDF using pdfbox , and able to get the text from normal PDF file. I am not able to get the text from the PDF file which I converted the Image to PDF.
    Some tools(OCR) are there, but I have to implement using C#.NET.

    Please give some Ideas how to get the text from scanned Image.
    Thanks in advance.
    maybe you can try Tesseract OCR engine, it's the best open source library. in may opinion, it's higher accurate than MODI.

  5. #5
    Join Date
    Jul 2001
    Location
    Sunny South Africa
    Posts
    11,284

    Re: Extract text from Scanned Image

    Erm, thanks for your efforts in helping. Keep in mind that this thread is over 6 years old. please do not reply to old threads as it messes with the forum-flow.

    There are plenty of current threads that needs assistance

  6. #6
    Join Date
    Mar 2001
    Posts
    2,529

    Re: Extract text from Scanned Image

    ahoodin
    To keep the plot moving, that's why.

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  





Click Here to Expand Forum to Full Width

Featured