Powerful OCR API for .NET to Perform Precise Text Recognition

Aspose.OCR for .NET Installation

You can install the OCR library for C# .NET by downloading the MSI installer or DLLs from the downloads section. Or, you can use the NuGet package for setting it up. The command for the Visual Studio package manager console is provided below:

PM > Install-Package Aspose.OCR

Efficiently Perform OCR on PDF Documents and Convert Images to Text

Aspose.OCR for .NET API is an excellent choice for seamlessly performing optical character recognition (OCR) for .NET developers. Applying OCR to PDF documents and converting the recognized text to Word format requires only a few lines of code. Similarly, you can build image to text converter solutions using the text recognition library for C# .NET.

Recognize and extract text from PDF documents in .NET

Optical character recognition for scanned PDF documents is made simple with the help of Aspose.OCR for .NET. It is one of the most well-liked features of the .NET PDF document OCR API and equips anyone working with the API with all the required tools to recognize, extract, and convert text from PDF documents to multiple formats. Let’s take a look at the steps needed to perform OCR on PDF documents and convert the extracted text from PDF to Word format:

Instantiate the AsposeOcr class.
Create an object of the DocumentRecognitionSettings class.
Specify the OCR language.
Call the RecognizePdf() method to get RecognitionResult. Add file path and DocumentRecognitionSettings object as arguments.
Save the text using the SaveMultipageDocument() method. Add the output file path, the SaveFormat, and RecognitionResult objects as arguments

// This code example demonstrates how to OCR PDF documents and save the recognized text as DOCX.
// Initialize the PCR engine
AsposeOcr recognitionEngine = new AsposeOcr();
// Initialize recognition settings
DocumentRecognitionSettings recognitionSettings = new DocumentRecognitionSettings();
// Specify language for OCR. Multi-language by default
recognitionSettings.Language = Language.Eng;
// Recognize text from PDF
List results = recognitionEngine.RecognizePdf("C:\\MyFiles\\test.pdf", recognitionSettings);

// Save the recognized text as Word (DOCX) file
AsposeOcr.SaveMultipageDocument("C:\\Files\\OCR_result.docx", SaveFormat.Docx, results);

Effortlessly Convert Images to Searchable PDFs in .NET

The high-performance .NET OCR API allows developers to extract searchable text from scanned images and photos of different commonly-used image formats such as GIF, PNG, JPG, BMP, and TIFF. If you are looking to integrate OCR capabilities into your .NET apps, the image to text conversion feature could bolster them. Here, we will review the steps required for efficiently converting JPG to PDF using the robust OCR API for .NET:

Set input image file path for text recognition.
Create an AsposeOcr class instance
Use the RecognizeImage method to recognize text in your selected image file.
Save the processed file using the Save method, and add the output file name and the SaveFormat as arguments.
Ensure setting the resultant file to a searchable PDF document using SaveFormat.

// Set path for input image to recognize
string image = "OCR_test.jpg";
// Initialize AsposeOcr class instance
AsposeOcr api = new AsposeOcr();
// Recognize input image with RecognizeImage method
RecognitionResult result = api.RecognizeImage(image, new RecognitionSettings());
// Save output to searchable PDF file
result.Save("OCR_output.pdf", SaveFormat.Pdf);

The GitHub examples page of the Aspose Optical Character Recognition API for .NET includes more working code samples, please feel free to check them out. We also provide Free Online PDF OCR and Image to Text Converter, text recognition, and extract apps. You can use these free OCR apps on the fly using your mobile devices.

Easy-to-use .NET OCR Library for Text Extraction in Photos

Are you having difficulty reading and extracting text from scanned images? If yes, use Aspose.OCR for .NET, as it is the best OCR API for image analysis and text extraction from scanned photos, smartphone images, and more. Using this API, you can programmatically recognize text in images and convert it to different file formats. Furthermore, the image to text conversion API includes advanced functions such as spell-checking and character replacement for visually defective texts. It could easily read such texts, check the spelling, and replace any distorted characters from the misspelled text on the image. While running image OCR, you can provide the link to an image file as a source. It is possible to scan and recognize text from a complete image or only a select part of the image for this process. Proficiently develop full-featured image to text converter apps using the .NET image OCR API and easily convert JPEG to text, PNG to text, BMP to text, and more.

FAQ

1. How can I convert images to text in .NET?

Converting images to text in .NET is quick and easy using the .NET OCR API. Only a few lines of .NET coding are needed to convert JPG, BMP, GIF, PNG, TIFF, and other image files to PDF, Word, Excel, JSON, Plain Text, and XML formats.

2. How long it takes to convert image or PDF files to text?

The best OCR API for .NET works fast and completes your text recognition, extraction, and conversion requests in no time.

3. Is it safe to process scanned images and PDFs using the .NET API?

You can be sure of the security of your data files while processing them using the optical character recognition API for .NET. We ensure the privacy of your data and take all required measures to provide you with a secure user experience.

4. Can I process photos and PDF files on Mac OS, Windows, or Linux?

Yes, the text recognition and image to text conversion API for .NET works across different OS, frameworks, and operating environments. You can use it on the platform of your choice and it does not require any additional software installation to function.

Optical Character Recognition Tools

JPEG to TXT PNG to TXT GIF to TXT BMP to TXT TIFF to TXT JPEG to PDF JPEG to Word JPEG to Excel JPEG to XML PNG to Word PNG to Excel PNG to PDF GIF to Word GIF to PDF BMP to Word BMP to Excel

Powerful .NET OCR API to Perform Text Recognition

Integrate advanced OCR capabilities into your apps with Aspose Optical Character Recognition API for .NET. Seamlessly recognize and extract text from PNG, JPG, GIF, and BMP images, scanned photos, and PDFs, or convert images or PDFs to text.

.NET OCR Engine with Superior Speed and Accuracy

Getting Started

Aspose.OCR for .NET Installation

Efficiently Perform OCR on PDF Documents and Convert Images to Text

Recognize and extract text from PDF documents in .NET

Effortlessly Convert Images to Searchable PDFs in .NET

Easy-to-use .NET OCR Library for Text Extraction in Photos

FAQ

1. How can I convert images to text in .NET?

2. How long it takes to convert image or PDF files to text?

3. Is it safe to process scanned images and PDFs using the .NET API?

4. Can I process photos and PDF files on Mac OS, Windows, or Linux?

Optical Character Recognition Tools

Looking for help?

Resources

Blogs

Documentation

Knowledge Base

Ready to get started?