Click to See Complete Forum and Search --> : Extract text from Scanned Image


Post2Naresh
May 13th, 2009, 07:51 AM
Hi all,
I want to extract text from scanned Image using c#.net. I have tried by converting the scanned image to PDF and then extract text from PDF.
I am extracting text from PDF using pdfbox , and able to get the text from normal PDF file. I am not able to get the text from the PDF file which I converted the Image to PDF.
Some tools(OCR) are there, but I have to implement using C#.NET.

Please give some Ideas how to get the text from scanned Image.
Thanks in advance.

HanneSThEGreaT
May 13th, 2009, 08:48 AM
From what I've heared, OCR with .NET is a bit problematic at this stage. Yes, there are numerous OCR SDKs on Microsoft's site, but I haven't heared much good about them yet.

Post2Naresh
May 14th, 2009, 05:37 AM
Thanks for the response..

I am able to extract English text from the scanned Image using MODI C#.NET.
"http://www.devsource.com/c/a/Languages/Using-The-Office-2007-OCR-Component-in-C/".

But I am not able to get my Local Language text(Telugu) from the Image Is it possible to achieve this?

Please help me regarding this...

Thanks in advance.