Working Assistant with W5100S-EVB-Pico

This project uses W5100S-EVB-Pico and Arducam to deliver images to a webapp and identify my work patterns through object recognition with AI

BeginnerFull instructions provided1 hour182

Story

W5100S-EVB-PICO-WORKINGASSISTANT

0. Original Link

For more information, please visit the original project page.

1. Project Overview

This is a work assistance project utilizing W5100S-EVB-PICO and Arducam.

2. Project Details

The project involves capturing images every 5 seconds through a camera connected to Pico and transmitting them via ethernet.
The transmitted images are processed using a Fine-tuned YOLOv8 model to perform object detection on the user's work status. The model is trained to recognize five classes: Normal, Drowsiness, Yawning, Distraction, and Mobile usage.
The recognized objects are counted to analyze work patterns, and the results are transmitted and displayed on a web interface.

3. Code and Libraries

The code was written using Thonny IDE and CircuitPython.
Copy the lib folder, boot_out.txt, and code.py files from the repo to W5100S-EVB-PICO and run code.py.
Refer to the link above to build the hardware by combining the W5100s-evb-pico board with the arducam and circuitpython to get the webcam working.
We used the Bundle for Version 7.x of the CircuitPython libraries, and for the Adafruit_CircuitPython_wiznet5k library, we used the 1.12.15 release version.

Download links:

The code from the previous project Upscaling Image with AI using W5100S-EVB-PICO and Arducam was refined and simplified using ChatGP

The ArduCam OV2640 Module requires CS, MOSI, MISO, SCLK pins for SPI connection, and SDA, SCL pins for I2C connection. This project modified the source code of ArduCam to use SPI1.

SPI1 configuration for ArduCam OV2640:

CS --> GPIO 13
MOSI --> GPIO 11
MISO --> GPIO 12
SCLK --> GPIO 10

I2C configuration for ArduCam OV2640:

SDA --> GPIO 8
SCL --> GPIO 9

4. YOLOv8 Model Training

YOLOv8 Model

YOLOv8 is a powerful object detection model. It is the latest iteration in the YOLO series of models. YOLOv8 is designed to be fast and accurate, with a focus on simultaneous detection and classification. It is capable of detecting multiple objects in an image or video and classifying them in real-time, making it highly effective for many computer vision tasks.

📷 📷

Nano Model

The Nano model is a smaller and faster version of the standard model. It is designed for efficiency and can run at higher speeds with less computational resources, making it ideal for deployment on devices with limited computational power such as mobile devices and embedded systems.

Usage

To use these models in your project, you can install the Ultralytics package in a Python>=3.8 environment with PyTorch>=1.7 using the following command:

pip install ultralytics

Dataset: Roboflow Project

Model: Ultralytics's YOLOv8 Nano (fast and small)

mAP50: 0.857, mAP50-95: 0.498.

We utilized the free GPU provided by Google Colab.

Google Colab Notebook.

Inference: 📷

5. Inference to Web

Run the working_assistant.py file.

Code Explanation:

Setup: The script first sets up a Flask application and defines some global variables. url is the URL from which images are fetched. save_dir is the directory where fetched images are saved. inference_dir is the directory where images after inference are saved. results is a dictionary that keeps track of the count of different object detection results.
fetch_image_and_inference(): This function runs in an infinite loop. In each iteration, it fetches an image from the URL, saves it to the save_dir, runs a YOLO object detection model on the image, and updates the results dictionary based on the output of the model. The function then waits for 10 seconds before starting the next iteration.
serve_image(): This is a Flask route that serves an HTML page. The page contains an image element that displays the latest inferred image and several paragraph elements that display the counts of different object detection results. The page refreshes every 10 second
get_inferred_image(): This is another Flask route that serves the latest inferred image. The image is fetched from the inference_dir.
Main Execution: If the script is run as the main program, it starts the fetch_image_and_inference() function in a new thread and then starts the Flask application.

6. Similar Projects

For other similar projects, please visit Upscaling image with AI using W5100S-EVB-Pico and Arducam.

Working Assistant with W5100S-EVB-Pico