Published September 7, 2024 © GPL3+

Simple ESP32 CAM Object detection using Open CV

This is certainly the simplest and cheapest object detection system that can serve perfectly for the presentation.

BeginnerFull instructions provided2 hours8,543

Simple ESP32 CAM Object detection using Open CV

Things used in this project

Hardware components

ESP32 Camera module

Software apps and online services

Arduino IDE

Hand tools and fabrication machines

Multitool, Screwdriver

Story

Object detection is a computer vision technique that involves identifying and locating objects within an image or video. It is a fundamental task in various applications, such as surveillance, autonomous driving, and image retrieval.

This time I will explain to you how to make a powerful object detection device that uses only an inexpensive ESP32 Camera module with a built-in FTDI USB to serial converter.

This means that there is no need for any soldering or connection of external components. We only need to connect the Module directly to the USB port of the PC. Basically the whole system consists of two parts

Esp32 Camera module with arduino code installed
and the second part is а pc software, actually Python code that uses Open CV, which is a powerful library for computer vision tasks, including identifying and localizing objects, as well as object detection. In our case are processed series of images received from the camera module.

This project is sponsored by PCBWay. They has all the services you need to create your project at the best price, whether is a scool project, or complex professional project. On PCBWay you can share your experiences, or get inspiration for your next project. They also provide completed Surface mount SMT PCB assemblY service at a best price, and ISO9001 quality control. Visit pcbway.com for more services.

Now I will explain the installation method in order.

First we need to enable camera module support in the Arduino environment. For this purpose we go to Arduino IDE - File - Preferences - where we add the ESP32 URL to "Board Manager URLs" as follows：

(https://raw.githubusercontent.com/espressif/arduino-esp32/gh-pages/package_esp32_index.json)

Now click "Tool-->Board-->Board Manager", and search for "esp32". It is recommended to

install version 2.0.6 or newer ESP32 core. I installed the latest version.

Next, on Arduino IDE -> Tools -> ESP32 Arduino we choose: AI Thihker ESP32-Cam

With this, the procedure for entering support for the specific Camera Мodule in the Arduino IDE is completed.

Next we install the ESP32cam library from the attached.zip file.

We go to Sketch - Include Library - add ZIP library and select the given library

After this we upload the provided Arduino code. Just don't forget to enter the credentials of our Wi-Fi network beforehand in the code. Now in the Arduino Serial Monitor we check if the camera is initialized and working, and we also need to remember the IP address that was assigned to it in the local network because we will need it when starting the Python code.

Next comes the installation of the Python environment section. For this purpose, we go to the Python page, download the latest version, and install it with default settings, noting that we need to mark the checkbox "add python.exe to path"

As I mentioned at the beginning, in order for Python code to work, several necessary libraries need to be installed, namely NumPy, OpenCV and cvlib libraries. For this purpose, we go to the command prompt and execute the following commands

type: pip install numpy and press enter. After the installation is done.
type: pip install opencv-python and press enter.
type: pip install cvlib and press enter

Now we start the Python IDLE editor which is an integral part of the Python installation, or any other Python editor. We go to File - Open - and search for the provided Python code. Let me mention that together with the code, there are three more files that must be located in the same folder as the code. When we open the code, we need to enter the IP address from the camera that was previously given to the Arduino Serial Monitor.

We press RUN, and if we have completed the previous steps, a video from the camera appears on the screen for a few moments, on which various objects surrounded by a rectangular green frame are detected. The name of the detected object is written on the upper part of the frame.

And now let's see how it looks in real conditions. As can be seen from the examples, the system is capable of detecting objects with high precision.

In particular, this system uses a pre-trained object detection model. The file "coco.names" contains the names of the 90+ objects that the YOLOv3 model is trained to detect.

And finally a short conclusion. Object detection is having uses in almost all sorts of industries. It is used for tracking objects, people counting, automated CCTV surveillance, vehicle detection, etc. This is certainly the simplest and cheapest object detection system that can serve perfectly for the presentation of the possibilities of this technology, and for powerful object detection and identification even without using the Python Code we can use the AMB82-Mini IoT AI Camera, which will probably be the subject of analysis in one of my next videos.

#include <WebServer.h>
#include <WiFi.h>
#include <esp32cam.h>

//THIS PROGRAM SENDS IMAGE IF IT IS PLACED IN WEB IP, BUT IF IT IS PLACED IN PYTHON IT SENDS VIDEO THROUGH THE ITERATIONS. . . (IF IT WORKS IN PYTHON)
const char* WIFI_SSID = "ESP Repeater";
const char* WIFI_PASS = "77777777";

WebServer server(80); //server on port 80

static auto loRes = esp32cam::Resolution::find(320, 240); //low resolution
static auto hiRes = esp32cam::Resolution::find(800, 600); //high resolution
//static auto hiRes = esp32cam::Resolution::find(640, 480); //high resolution (for fps rates) (IP CAM APP)

void
serveJpg() //capture image .jpg
{
  auto frame = esp32cam::capture();
  if (frame == nullptr) {
    Serial.println("Capture Fail");
    server.send(503, "", "");
    return;
  }
  Serial.printf("CAPTURE OK %dx%d %db\n", frame->getWidth(), frame->getHeight(),
                static_cast<int>(frame->size()));

  server.setContentLength(frame->size());
  server.send(200, "image/jpeg");
  WiFiClient client = server.client();
  frame->writeTo(client);  //and send to a client (in this case it will be python)
}

void
handleJpgLo()  //allows to send low resolution image
{
  if (!esp32cam::Camera.changeResolution(loRes)) {
    Serial.println("SET-LO-RES FAIL");
  }
  serveJpg();
}

void
handleJpgHi() //allows to send high resolution image
{
  if (!esp32cam::Camera.changeResolution(hiRes)) {
    Serial.println("SET-HI-RES FAIL");
  }
  serveJpg();
}

void setup()
{
  Serial.begin(115200);
  Serial.println();

  {
    using namespace esp32cam;
    Config cfg;
    cfg.setPins(pins::AiThinker);
    cfg.setResolution(hiRes);
    cfg.setBufferCount(2);
    cfg.setJpeg(80);

    bool ok = Camera.begin(cfg);
    Serial.println(ok ? "CAMERA OK" : "CAMERA FAIL");
  }

  WiFi.persistent(false);
  WiFi.mode(WIFI_STA);
  WiFi.begin(WIFI_SSID, WIFI_PASS); //connect to the WiFi network
  while (WiFi.status() != WL_CONNECTED) {
    delay(500);
  }

  Serial.print("http://");
  Serial.print(WiFi.localIP());
  Serial.println("/cam-lo.jpg");//to connect IP low res

  Serial.print("http://");
  Serial.print(WiFi.localIP());
  Serial.println("/cam-hi.jpg");//to connect high res IP
  server.on("/cam-lo.jpg",handleJpgLo);//send to the server
  server.on("/cam-hi.jpg", handleJpgHi);

  server.begin();
}

void loop()
{
  server.handleClient();
}

import cv2 #opencv
import urllib.request #to open and read URL
import numpy as np

#OBJECT CLASSIFICATION PROGRAM FOR VIDEO IN IP ADDRESS

url = 'http://192.168.100.14/cam-hi.jpg'
#url = 'http://192.168.0.159/'
winName = 'ESP32 CAMERA'
cv2.namedWindow(winName,cv2.WINDOW_AUTOSIZE)
#scale_percent = 80 # percent of original size    #for image processing

classNames = []
classFile = 'coco.names'
with open(classFile,'rt') as f:
    classNames = f.read().rstrip('\n').split('\n')

configPath = 'ssd_mobilenet_v3_large_coco_2020_01_14.pbtxt'
weightsPath = 'frozen_inference_graph.pb'

net = cv2.dnn_DetectionModel(weightsPath,configPath)
net.setInputSize(320,320)
#net.setInputSize(480,480)
net.setInputScale(1.0/127.5)
net.setInputMean((127.5, 127.5, 127.5))
net.setInputSwapRB(True)

while(1):
    imgResponse = urllib.request.urlopen (url) # here open the URL
    imgNp = np.array(bytearray(imgResponse.read()),dtype=np.uint8)
    img = cv2.imdecode (imgNp,-1) #decodificamos

    img = cv2.rotate(img, cv2.ROTATE_90_CLOCKWISE) # vertical
    #img = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY) #black and white

    

    classIds, confs, bbox = net.detect(img,confThreshold=0.5)
    print(classIds,bbox)

    if len(classIds) != 0:
        for classId, confidence,box in zip(classIds.flatten(),confs.flatten(),bbox):
            cv2.rectangle(img,box,color=(0,255,0),thickness = 3) #mostramos en rectangulo lo que se encuentra
            cv2.putText(img, classNames[classId-1], (box[0]+10,box[1]+30), cv2.FONT_HERSHEY_COMPLEX, 1, (0,255,0),2)


    cv2.imshow(winName,img) #  show the picture

    #wait for ESC to be pressed to end the program
    tecla = cv2.waitKey(5) & 0xFF
    if tecla == 27:
        break
cv2.destroyAllWindows()

Simple ESP32 CAM Object detection using Open CV

Things used in this project

Hardware components

Software apps and online services

Hand tools and fabrication machines

Story

Code

Arduino code

Python code

Libraries

Credits

Mirko Pavleski

Comments

Embed the widget on your own site

Simple ESP32 CAM Object detection using Open CV

Simple ESP32 CAM Object detection using Open CV

Things used in this project

Hardware components

Software apps and online services

Hand tools and fabrication machines

Story

Code

Arduino code

Python code

Libraries

Credits

Mirko Pavleski

Comments

Related channels and tags