Confidence in KTP-OCR using Pytesseract

ByFirhan Maulana Rusli July 18, 2020December 31, 2023

In previous blog, we already learn how to crop an image https://about.lovia.id/getting-cordinate-and-cropping-an-image-with-opencv/. Then we will learn how to got confidence using pytesseract, After much searching, there was some some ways to got confidence in my KTP-OCR. Pytesseract give us a lot of syntax that can we use, such as :

#this line of code will extract your image into string
print(pytesseract.image_to_string(Image.open('test.png')))

# Batch processing with a single file containing the list of multiple image file paths print(pytesseract.image_to_string('images.txt'))

# Get information about orientation and script detection 
print(pytesseract.image_to_osd(Image.open('test.png')))

And many others, to got the confidence, pythesseract already give line of code, it was:

text1 = pytesseract.image_to_data(Image.open('test.png'))

This line of code will output confidence, boxes on image, page number, line number, etc. This code give us the confidence each word not each line, so i will change it then we will got the confidence each line.

text = text1[text1.conf != -1]
lines = text.groupby('block_num')['text'].apply(list)
conf = text.groupby(['block_num'])['conf'].mean()

print(text)
print(lines)
print(conf)

the output would be like this:

and if u want to see the box the boxes of text on the image, just use this code:

n_boxes = len(text1['text'])
for i in range(n_boxes):
    if int(text1['conf'][i]) > 60:
        (x, y, w, h) = (text1['left'][i], text1['top'][i], text1['width'][i], text1['height'][i])
        img = cv2.rectangle(img, (x, y), (x + w, y + h), (0, 255, 0), 2)

plt.imshow(img)

here some source that might help:

Firhan Maulana Rusli

Linkedin : www.linkedin.com/in/firhan-rusli

Artificial Intelligence | Computer Vision | Internships

Image Scoring oleh Angky

ByAngky Musa April 11, 2020December 31, 2023

Proyek yang dinamakan Image Scoring ini berfungsi sebagaimana namanya, yaitu memberikan nilai terhadap gambar digital yang diinput oleh user. Adapun penilaian yang dimaksud mencakup: Black-white image detection : mendeteksi apakah gambar tersebut merupakan gambar hitam-putih atau bukan. Cartoon image detection : mendeteksi apakah gambar tersebut merupakan gambar kartun atau bukan. Face detection : mendeteksi apakah…

Artificial Intelligence | Chatbot | Natural Language Processing (NLP) | RASA

RASA Form Actions: Calling SatukanCinta API

ByAngela Marpaung July 13, 2020December 31, 2023

Hi there! This post is the continuation of the blog post before. In this post, I’ll be giving a tutorial in getting the information if user has registered an account in SatukanCinta. So the information needed to check if the user has registered an account is full name, email, and also phone number.Here we are…

Artificial Intelligence | Chatbot | Natural Language Processing (NLP) | ngrok | RASA

Integrating RASA chatbot assistant to Facebook Messenger

ByAngela Marpaung June 15, 2020December 31, 2023

Here is tutorial on how we can integrate RASA chatbot assistant to Facebook Messenger. Here we’ll be using ngrok to expose a local server on the internet, so make sure you have installed ngrok before.If you haven’t installed ngrok, you can install it here. Make sure you have created an account in facebook for developers…

Artificial Intelligence | BPMN Workflow Automation

Importance of BPMN for Companies Development

ByAngela Marpaung April 27, 2020December 31, 2023

BPMN (Business Process Modelling Notation) is a method used to visualize business processes and is mainly used by enterprises to help relevant stakeholders understand their job. BPMN is also a tool that can be used not only for visualizing current state but also future state of business process. And how does BPMN benefits enterprises? As…

Artificial Intelligence | Chatbot | Natural Language Processing (NLP)

Components in RASA NLU

ByAngela Marpaung June 8, 2020December 31, 2023

In RASA, user messages is excecuted for every sequence of components. All components executed in RASA can be customized to meet any requirements in pipeline defined in config.yml file. We can even build our own (custom) component in RASA NLU. Configurating the Right Components Every components have different functions whether its for pre-processing text, intent…

Artificial Intelligence | Chatbot | RASA

Forms in RASA

ByAngela Marpaung May 30, 2020December 31, 2023

When building a chatbot as conversational assistant, we may need some of user’s information in order to answers and give suggestions to the user in a right context.This proccess of collecting the user’s required information is called slot filling. (If you want to understand better about slots, you may want to check out the previous…

Similar Posts