Getting Coordinate and Cropping an Image with OpenCV

ByFirhan Maulana Rusli June 6, 2020December 31, 2023

OpenCV is popular library for computer vision. OpenCV is an open source computer vision and machine learning software library. OpenCV was built to provide a common infrastructure for computer vision applications and to accelerate the use of machine perception in the commercial products.

OpenCV uses machine learning to detect object/faces in picture. for detecting face, the algorithm start from top left on picture unltil right down on the picture.

Why we use OpenCV for getting the cordinate?

there’se so much way to goes to Roma, same as here, there so much way to getting the cordinate. We can use countours detection or maybe R-cnn to get the cordinate. But for this situation we will use Opencv and Cascades.

First of all we need to download the xml of haar-cascades, here is the link

https://github.com/shantnu/FaceDetect/blob/master/haarcascade_frontalface_default.xml

After that we need to import some library:

import cv2
import matplotlib.pyplot as plt
from PIL import Image
%matplotlib inline

And then import the xml files:

cascPath = "haarcascade_frontalface_default.xml"

And then create the haar-cascade:

faceCascade = cv2.CascadeClassifier(cascPath)

After that, just import the image that we want to use:

path = "/content/ktp3.png"
image = cv2.imread(path)
image_crop = Image.open(path)
gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)

After that, detect the face in the image:

faces = faceCascade.detectMultiScale(
    gray,
    scaleFactor=1.1,
    minNeighbors=2,
    minSize=(40, 60),
    flags = cv2.CASCADE_SCALE_IMAGE
)

print("Found {0} faces!".format(len(faces)))

Here is some explanation about the parameters:

Parameters: cascade – Haar classifier cascade (OpenCV 1.x API only). It can be loaded from XML or YAML file using Load(). When the cascade is not needed anymore, release it using cvReleaseHaarClassifierCascade(&cascade).
image – Matrix of the type CV_8U containing an image where objects are detected.
objects – Vector of rectangles where each rectangle contains the detected object.
scaleFactor – Parameter specifying how much the image size is reduced at each image scale.
minNeighbors – Parameter specifying how many neighbors each candidate rectangle should have to retain it.
flags – Parameter with the same meaning for an old cascade as in the function cvHaarDetectObjects. It is not used for a new cascade.
minSize – Minimum possible object size. Objects smaller than that are ignored.
maxSize – Maximum possible object size. Objects larger than that are ignored.

Parameters:	cascade – Haar classifier cascade (OpenCV 1.x API only). It can be loaded from XML or YAML file using `Load()`. When the cascade is not needed anymore, release it using `cvReleaseHaarClassifierCascade(&cascade)`. image – Matrix of the type `CV_8U` containing an image where objects are detected. objects – Vector of rectangles where each rectangle contains the detected object. scaleFactor – Parameter specifying how much the image size is reduced at each image scale. minNeighbors – Parameter specifying how many neighbors each candidate rectangle should have to retain it. flags – Parameter with the same meaning for an old cascade as in the function `cvHaarDetectObjects`. It is not used for a new cascade. minSize – Minimum possible object size. Objects smaller than that are ignored. maxSize – Maximum possible object size. Objects larger than that are ignored.

And then display the image:

for (x, y, w, h) in faces:
    cv2.rectangle(image, (x, y), (x+w, y+h), (0, 255, 0), 2)

plt.imshow(image)

And the result would be like this:

And then for cropping the image, we just need to use the cordinate that we just got from the image detector, it would be like this:

im_crop = image_crop.crop((x, y, (x+w), (y+h)))
plt.imshow(im_crop)

The result would be like this:

Here is some source that might be help:

Firhan Maulana Rusli

Linkedin : www.linkedin.com/in/firhan-rusli

Artificial Intelligence | Chatbot | RASA

Forms in RASA

ByAngela Marpaung May 30, 2020December 31, 2023

When building a chatbot as conversational assistant, we may need some of user’s information in order to answers and give suggestions to the user in a right context.This proccess of collecting the user’s required information is called slot filling. (If you want to understand better about slots, you may want to check out the previous…

Artificial Intelligence | Chatbot | RASA

Build Your First RASA Chatbot

ByAngela Marpaung May 7, 2020December 31, 2023

What is chatbot? Chatbot is an Artificial Intelligence (AI) agent that can understand as well as repond to human conversation. Chatbot can be deployed in many platform such as websites, mobile apps, and messaging channels. Chatbot works in a way that it is able to immitate human conversations and make it seems like they are…

Data Science | Internships

Meeting 11/04/2020 – Amran – Scraping web

ByAmran April 11, 2020December 31, 2023

Overview : Scraping detail page Challenges : Struktur page tiap halaman berbeda-beda Baca-baca artikel tentang teknik penanganan struktur yang berbeda dengna tag yang sama Feedback : https://gitlab.com/lovia/jobsid-crawler/-/issues/2 Next Steps : Insert or update MongoDB jobPosting collection from JobPosting object Wrap crawler into Zeebe worker using zeebe-grpc (Python) Store HiringOrganization Logo Image to S3+MongoDB Run Workflow…

Artificial Intelligence | Chatbot | Natural Language Processing (NLP) | RASA

RASA Form Actions: Calling SatukanCinta API

ByAngela Marpaung July 13, 2020December 31, 2023

Hi there! This post is the continuation of the blog post before. In this post, I’ll be giving a tutorial in getting the information if user has registered an account in SatukanCinta. So the information needed to check if the user has registered an account is full name, email, and also phone number.Here we are…

Artificial Intelligence | Computer Vision | Data Science | Natural Language Processing (NLP)

Confidence in KTP-OCR using Pytesseract

ByFirhan Maulana Rusli July 18, 2020December 31, 2023

In previous blog, we already learn how to crop an image https://about.lovia.id/getting-cordinate-and-cropping-an-image-with-opencv/. Then we will learn how to got confidence using pytesseract, After much searching, there was some some ways to got confidence in my KTP-OCR. Pytesseract give us a lot of syntax that can we use, such as : #this line of code will…

Artificial Intelligence | Chatbot | Natural Language Processing (NLP)

Components in RASA NLU

ByAngela Marpaung June 8, 2020December 31, 2023

In RASA, user messages is excecuted for every sequence of components. All components executed in RASA can be customized to meet any requirements in pipeline defined in config.yml file. We can even build our own (custom) component in RASA NLU. Configurating the Right Components Every components have different functions whether its for pre-processing text, intent…

One Comment

Pingback: Confidence in KTP-OCR using Pytesseract - About Lovia

Comments are closed.

Similar Posts

One Comment