How to Implement Optical Character Recognition in Python

Python Programming (136 Blogs) Become a Certified Professional

Optical Character Recognition is vital and a key aspect and python programming language. The application of such concepts in real-world scenarios is numerous. In this article, we will discuss how to implement Optical Character Recognition in Python

Applications of Optical Character Recognition

Ticket counters use this extensively for scanning and detecting of key information on the ticket to track routes and commuters details. Conversion of paper text into digital formats where cameras capture high-resolution photographs and then OCR is used to bring them into a word or a PDF format.

The introduction of OCR with python is credited to the addition of versatile libraries like “Tesseract” and “Orcad”. These libraries have helped many coders and developers to simplify their code design and allow them to spend more time on other aspects of their projects. Since the benefits are enormous, let us peek into what it is and how it is done.

Building an Optical Character Recognition in Python

We first need to make a class using “pytesseract”. This class will enable us to import images and scan them. In the process it will output files with the extension “ocr.py”. Let us see the below code. The function block “process_image” is used to sharpen the text we get.

The following route handler and view function are added to the application (app.py).

Router Handler Code

//ROUTE HANDLER
@app.route('/v{}/ocr'.format(_VERSION), methods=["POST"])
def ocr():
    try:
        url = request.json['image_url']
        if 'jpg' in url:
            output = process_image(url)
            return jsonify({"output": output})
        else:
            return jsonify({"error": "only .jpg files, please"})
    except:
        return jsonify(
            {"error": "Did you mean to send: {'image_url': 'some_jpeg_url'}"}
        )

OCR Engine Code

// OCR ENGINE
import pytesseract
import requests
from PIL import Image
from PIL import ImageFilter
from StringIO import StringIO

def process_image(url):
    image = _get_image(url)
    image.filter(ImageFilter.SHARPEN)
    return pytesseract.image_to_string(image)

def _get_image(url):
    return Image.open(StringIO(requests.get(url).content))
//

Please ensure to update the imports and add API version number.

import os
import logging
from logging import Formatter, FileHandler
from flask import Flask, request, jsonify

from ocr import process_image
_VERSION = 1  # API version

We are adding in the JSON response of the OCR Engine’s function that is “process_image()”. JSON is used for gathering information going into and out of the API. We pass the response in an object file using “Image” library from PIL to install it.

Please note that this code only performs best with .jpg images only. If we use complex libraries that can feature multiple image formats then all images can be processed effectively. Also note, in case you are interested in trying out this code by yourself, then please install PIL which is procured out of “Pillow” library first

• Start out by running the app, which is “app.py”:

//
$ cd ../home/flask_server/
$ python app.py
//

• Then, in another terminal run:

//$ curl -X POST http://localhost:5000/v1/ocr -d '{"image_url": "some_url"}' -H "Content-Type: application/json"

For Example:

//
$ curl -X POST http://localhost:5000/v1/ocr -d '{" C:UsersakashDownloadsPic1 ": "https://edureka.com/images/blog_images/ocr/ocr.jpg"}' -H "Content-Type: application/json"
{
  "output": "ABCDEnFGH I JnKLMNOnPQRST"
}
//

Advantages and Disadvantages of OCR Engine

Out of the many applications of using OCR in python, the popular one is handwriting recognition. People apply this is to recreate written text which can then be populated into numerous copies rather than just photocopying the original script. This is to bring about uniformity and legibility.

OCR is also useful in converting PDF’s to texts and store them as variables. This can later be then subjected to any amount of pre-processing for additional tasks. Although the concept of OCR seems to be a beneficial topic in the world of Python, it sure does share its part of disadvantages.

The OCR cannot always guarantee 100% accuracy. A lot of hours of training need to be applied with the help of Artificial Intelligence concepts which can enable the OCR engine to learn and recognize poor images. Handwriting images can be recognized but they depend on several factors like the style of the writing, the color of the page, the contrast of the image and the resolution of the image.

With this, we come to an end of this Optical Character Recognition in Python article. I hope you go an understanding of how exactly OCR works.

To get in-depth knowledge on Python along with its various applications, you can enroll now for the best Python course training with 24/7 support and lifetime access.

Got a question for us? Mention them in the comments section of “Optical Character Recognition in Python” and we will get back to you.

Introduction to Python

Python Installation

Python Fundamentals

Python OOPs

Python Libraries

Web Scraping

Django

Python Programs

Career Oppurtunities

Interview Questions

Data Science

How to Implement Optical Character Recognition in Python

Applications of Optical Character Recognition

Building an Optical Character Recognition in Python

Advantages and Disadvantages of OCR Engine

Recommended videos for you

Web Scraping And Analytics With Python

Application of Clustering in Data Science Using Real-Time Examples

Business Analytics with R

Diversity Of Python Programming

Linear Regression With R

Python for Big Data Analytics

Mastering Python : An Excellent tool for Web Scraping and Data Analysis

Python Loops – While, For and Nested Loops in Python Programming

Sentiment Analysis In Retail Domain

Introduction to Business Analytics with R

The Whys and Hows of Predictive Modelling-I

Python Tutorial – All You Need To Know In Python Programming

Python Classes – Python Programming Tutorial

3 Scenarios Where Predictive Analytics is a Must

Data Science : Make Smarter Business Decisions

Know The Science Behind Product Recommendation With R Programming

Python List, Tuple, String, Set And Dictonary – Python Sequences

Machine Learning with Python

Python Numpy Tutorial – Arrays In Python

Business Analytics Decision Tree in R

Recommended blogs for you

How To Install OpenCV Python On Windows

What is Alpha Beta Pruning in Artificial Intelligence?

Support Vector Machine In R: Using SVM To Predict Heart Diseases

How To Input a List in Python?

A Comprehensive Guide To Naive Bayes In R

Init In Python: Everything You Need To Know

What is Python JSON and How to implement it?

Top 10 Reasons to Learn R

Introduction To Supervised Learning

What is Data Analytics? Introduction to Data Analysis

Machine Learning Career and Future Scope

What Are GANs? How and why you should use them!

Types of Data Scientist

Java vs Python : Comparison between the Best Programming Languages

SciPy Tutorial: What is Python SciPy and How to use it?

Data Analytics Projects: 9 Project Ideas for Your Portfolio

Future Scope of Data Science in 2025

Python Requests Tutorial: GET and POST Requests in Python

Time Series Forecasting: Mastering Techniques and Applications

Introduction To Markov Chains With Examples – Markov Chains With Python

Join the discussionCancel reply

Trending Courses in Data Science

Python Programming Certification Course

Data Science with Python Certification Course

Data Science and Machine Learning Internship ...

Statistics Essentials for Analytics

SAS Training and Certification

Data Analytics with R Programming Certificati ...

Data Science with R Programming Certification ...

Advanced Python for Data Analytics by PwC Aca ...

Analytics for Retail Banks

Decision Tree Modeling Using R Certification ...

Browse Categories

Subscribe to our Newsletter, and get personalized recommendations.

How to Implement Optical Character Recognition in Python