Team IRIS Labs Hardware:

•

Published October 12, 2024

Face tracking using Kria KR260 Robotics Starter Kit

A webcam inputs the video to a deep learning model accelerated by the DPU of Kria KR260 and motors controlled using PMOD for face tracking

256

Face tracking using Kria KR260 Robotics Starter Kit

Things used in this project

Hardware components

AMD Kria™ KR260 Robotics Starter Kit

Webcam, Logitech® HD Pro

SG90 Micro-servo motor

Two axis laser cut servo bracket

Software apps and online services

AMD Vivado Design Suite

AMD PYNQ Framework

Hand tools and fabrication machines

Multitool, Screwdriver

Used to attach camera and servo motor

Two axis pan tilt brackets

Story

In this project, we aimed to implement a system which can track a person when they move around in the frame or tries to leave the frame. Usually, there is a tradeoff between the camera's distance from the subject, which can reduce the viewers' experience quality, and the content coverage on the board if the camera is placed closer. Although, there are many platforms which provide free courses, the technology they use is compromised due to the associated costs.

In some free platform lectures, even though the professors are really enthusiastic there is usually a crew asking them to stay withing the frame :)

Some other applications in which it could prove useful are:

With increase in remote learning and online workspaces, it is important to maintain clarity of the subject and focus on the required.
In cooking shows/videos, it's essential to have a close-up view of the chef's actions and also the other parts of the kitchen.
Technical assembly procedures and health procedures

Ubuntu 22.04 image and Kria KR260 setup

Download the official SD card image from here which can be found in the Kria KR260 Getting Started Guide which explains the steps to burn the image into the SD card. To explain a bit about this, this is similar to the Windows or Mac in your computer which is put into an SD card as an "image" and the SD card later on is inserted into the board. The board now has the required files to boot into Ubuntu, set up the file systems and all other kinds of good stuff so you can have Ubuntu running on the board. Make sure to enable GNOME desktop as well if you want to use the GUI. If you have a DisplayPort Cable, you can connect it to the board and your monitor(depends on the port of your monitor, preferrably DisplayPort and accordingly get a cable) to have this feature. The rest of the login process and setup is explained in the above link.
A serial terminal like Putty or TeraTerm can be used. Open up the serial terminal you like and set the mode to Serial in Putty and select the COM Port(you can find this in Device Hardware Manager in Windows) or as TeraTerm likes it, the USB port(like /ttyUSB1 which you can find using lsusb on Linux) after which you will be able to see the login screen after the boot process.
Make sure to connect to your router with the ethernet to have internet on the board.

Tip: scp <source> <dest_user_name@ip_address:/path_to_destination>-r flag can be used to scp a folder. This command can be used to copy files from your computer to the board provided you have an internet connection.

Vivado flow for PMOD

[Refer to Whitney Knitter's post about Getting Started with the Kria KR260 in Vivado 2022.1 and RPi+PMOD Connector GPIO with Custom PL Design in Kria KR260]

[STEP 1]Launch Vivado and create a project:

~$ source <path_to_Vivado_installation>/Vivado/<Vivado_version>/settings64.sh 
~$ vivado &

source sets the environment variable for that terminal. If you close the terminal, you will have to source Vivado again. But if you close Vivado and you want to open it again you do not need to source it everytime, provided you were working with the same terminal.

Select the option under Quick Start for Create Project:
Check the option on the Project Type window that the Project is an extensible Vitis platform, if you plan on making a hardware accelerated design later on.
In the Default Part page, select Kria KR260 Robotics Starter Kit SOM.
Click on Connections and for Connector 1 select Robotics Starter Kit Carrier(SOM240_1) and for Connector 2 select Robotics Starter Kit Carrier(SOM240_2) or as per your requirements.
Click Finish to create the new Vivado project.

[STEP 2] Create a Block Design:

In the Project Manager window under the IP Integrator tab, select the Create Block Design option and give it the desired name in the following pop up window.
Click the + button in the Diagram window and add Zynq MPSoC IP
Add the Clocking Wizard, Processor System Reset, AXI GPIO and AXI Interrupt

[STEP 3] Validate and generate Block Design:

In the Diagram tab of the block design, click the checkbox icon to run a design validation.
One critical warning appears about the input interrupt not being connected on the AXI interrupt controller, which can safely be ignored in this case.

Tip: Disable Incremental Synthesis during Synthesis run in Settings under Flow Navigator window

Select Generate Block Design from the Flow Navigator window and change the Synthesis Options from Out of context per IP to Global.

[STEP 4]Create HDL Wrapper:

In the Sources window, right-click on the block design file and select the option to Create HDL Wrapper.
Then select the option to let Vivado manage the wrapper and auto-update it before clicking OK in the pop-up window.

[STEP 5] Generate Bitstream:

Select to generate a bitstream from the Flow Navigator window
Click OK to launch the runs for synthesis, implementation, and bitstream generation.

[STEP 6] Export Platform for SW Development:

Select the option to Export Platform from the Flow Navigator window.
In the platform packaging windows, select Hardware for Platform Type (the Kria doesn't currently have emulation support) and be sure to check the option to include the bitstream in Platform State.

PMOD Block Design

Moving the motors using PMOD

[STEP 1] Hardware setup:

The pan tilt system can be assembled by following this tutorial and the pan tilt can be purchased here and the servo motor can be purchased here.

Here is a view of how the assembled 2 axis tilt mechanism looks like.

Assembled pan tilt mechanism

The clamp of the Logitech webcam can be removed by following this tutorial.

Removing the clamp from the webcam

Now, we can clamp the webcam to the 2 axis tilt mechanism using rubber bands to avoid using glue :)

This is how the end product looks like

2 axis rotation mechanism for camera

[STEP 2] Rotating the motors using the PYNQ overlay

Transfer the pmod_kria_wrapper.bit and pmod_kria.hwh (found in pmod_kria.gen -> sources_1 -> bd -> hw_handoff ) to the Kria board. Make sure to rename both the .bit and .hwh to a same name to avoid getting unable to parse metadata error

Now that we have the pmod_kria block design and the mechanical 2 axis pan tilt mechanism ready, we can now work on the logic that actuates the servos.

Connect the gnd to the brown wire, 5v to the red wire, and the gpio[2] and gpio[4] to the orange wire of any pmod that is preffered. refer to this diagram for pmod pinout.

Create an Overlay using the pmod_kria.bit and pmod_kria.hwh. Lets create and use the axi gpio instance of pmod1 to access the gpio pins.

Assuming we get the DPU working correctly, on running the YOLOv3 algorithm, we get the bounding box's dimensions (height, width, center_x, center_y) of the detected objects in the input image of dimensions 480*640. Hence we shall come up with an algorithm that keeps a count of the current orientation of the camera and then moves it according to the dimensions of bounding box generated from the camera image, thereby giving us a feedback loop.

Inorder to actuate the servo motors, referring to this link, the servo motors PWM data should be in the frequency of 50Hz (T = 20ms). The physical angle zero of servo maps to Ton=0.2ms (Toff=19.8ms) and the angle 180 degree maps to Ton=2.4ms (Toff=17.6ms).

Here is the algorithm that we came up with that had to be used after the DPU was integrated.

import time
from pynq import Overlay
from pynq.lib import AxiGPIO
from pynq.lib import Pmod_PWM

# Initial angles for vertical and horizontal servos
vertical_angle = 90  # Initial angle for vertical servo
horizontal_angle = 90  # Initial angle for horizontal servo

def drive_servos(vertical_angle, horizontal_angle):
    # Get the GPIO instance
    gpio_instance = overlay.ip_dict['pmod_1']
    gpio = AxiGPIO(gpio_instance).channel1
    
    # Calculate the Ton and Toff for each servo motor
    vertical_ton = 0.2 + (vertical_angle * 2.2 / 180)  # Ton for vertical servo
    vertical_toff = 20 - vertical_ton
    horizontal_ton = 0.2 + (horizontal_angle * 2.2 / 180)  # Ton for horizontal servo
    horizontal_toff = 20 - horizontal_ton
    
    # Write the PWM signals to the GPIO pins for 5 iterations
    for _ in range(5):
        # Control the vertical servo motor
        gpio[4].write(0x1)
        time.sleep(vertical_ton / 1000)
        gpio[4].write(0x0)
        
        # Control the horizontal servo motor
        gpio[2].write(0x1)
        time.sleep(horizontal_ton / 1000)
        gpio[2].write(0x0)
        
        # Make sure the PWM cycle time is 20ms
        time.sleep((20 - vertical_ton - horizontal_ton) / 1000)

def update_angles(center_x, center_y, width, height):
    global vertical_angle, horizontal_angle
    
    # Image dimensions
    image_width = 640
    image_height = 480
    
    # Calculate the center of the image
    image_center_x = image_width / 2
    image_center_y = image_height / 2
    
    # Thresholds for angle adjustments
    horizontal_threshold = 50  # Adjust this threshold as needed
    vertical_threshold = 50  # Adjust this threshold as needed

    # Check horizontal position
    if center_x < image_center_x - horizontal_threshold:
        horizontal_angle -= 5  # Decrease angle to move left
    elif center_x > image_center_x + horizontal_threshold:
        horizontal_angle += 5  # Increase angle to move right

    # Check vertical position
    if center_y < image_center_y - vertical_threshold:
        vertical_angle -= 5  # Decrease angle to move up
    elif center_y > image_center_y + vertical_threshold:
        vertical_angle += 5  # Increase angle to move down

    # Call the servo control function with updated angles
    drive_servos(vertical_angle, horizontal_angle)

Here is a snap of the 2-axis servo tilt mechanism, along with the camera, being driven by the KR260 in action.

Servo Motor Pan Tilt Mechanism

Running a face detection model

We chose a Logitech C270 HD USB webcam owing to its ease of integration with Kria KR260 Robotics Starter Kit. The process was as simple as connecting the USB to the board. It can be bought from here

Logitech Webcam

[STEP 1]: Using the PYNQ DPUOverlay:

Installation to PYNQ can be found here along with the pynq-helloworld example and you can access PYNQ Jupyter Notebook by just going to a browser of your choice and then typing <ip_address:9090> and logging in using username as xilinx and password as xilinx.

Tip: Internet connection is not required to access the PYNQ Jupyter Notebook.Just connect the board and your computer with an ethernet cable. Assign a static IP to the ethernet interface of your computer by going to Control Panel > Network and Internet > Network and Sharing Center. Now find the ethernet interface and go to Properties > IPv4 > IP address can be 192.168.2.105 and Subnet Mask of 255.255.255.0. Default Gateway need not be assigned. The same can be done on Linux using doing sudo ipconfig <interface_name> 192.168.2.105. On the board you can give a static IP on the same network range by typing sudo ipconfig <interface_name> 192.168.2.105in the shell. Again, look for the right interface such as eth0 or eth1 on your board side. On your computer's browser now type 192.168.2.102:9090 to open up the PYNQ Jupyter Notebook.

[STEP 2]: Running the yolov3 model on the PYNQ Jupyter Notebook:

The YOLO (You Only Look Once) algorithm is a real-time object detection system that identifies multiple objects in an image with a single network pass. It segments the input image into a grid, predicting bounding boxes and class labels for each cell simultaneously. YOLO is celebrated for its rapid processing and efficiency, making it ideal for real-time tasks. It achieves this by leveraging a unified convolutional network for both object detection and classification.

To deploy YOLO on the Kria board, we use the provided example files and create a DpuOverlay. This requires the yolo.xclbin, yolo.bit, and yolo.hwh files. Since generating a custom build that integrates GPIO access with DpuOverlay wasn't feasible, we used the default gen_platform.tcl file available at and executed the make flow to produce the necessary files.

The YOLO algorithm outputs predicted bounding boxes and their dimensions. We capture frames using OpenCV, process them through YOLOv3 on the DPU, and then utilize the bounding box coordinates and dimensions to control the rotation of servo motors.

Two faces detected

Integration Challenges

As discussed in the previous sections, we were able to run the face detection model and turn the motors based on PWM signals. While trying to integrate the two there were some issues faced which we tried resolving to some extent.

[ISSUE 1] DPU Version mismatch in Vivado:FIXED

Download the latest version of DPU IP from here
While running Generate Block Design select Global instead of Out-of-context IP

[ISSUE 2] Vitis Platform Vitis_AIE_DIR not found:FIXED

In Vivado, go to Help > Add Design Tools or Devices and Enable Install Devices for Alveo and edge acceleration platforms and Install Devices for Kria SOMs and Starter Kits

Vitis_AIE_DIR issue

[ISSUE 3] Vitis Platform V++ linker error while trying to generate.xclbin file:

Export the.xsa from the hardware platform
Build an application platform and follow the steps from here
While running the Vector Addition application this issue was faced.

Vector Addition application gives a V++ linker issue

[ISSUE 4] DCP file doesn't exist:

In this tutorial, while running make BOARD=kr260_som, the DCP file it was looking for in binary_container1 was not found.

DCP file not found

While trying to integrate the DPU using the Vitis Flow, we were facing issues trying to generate the.xclbin as mentioned above.
While trying to obtain the.xclbin file as in this tutorial, while running make BOARD=kr260_som, we were facing the above mentioned issue.
While trying to integrate the DPU IP in Vivado Block Design, we followed this tutorial using PetaLinux but couldn't implement the whole system.

Conclusion

Face detection using the yolo-v3 algorithm was implemented by capturing frames from the Video through the webcam and using Vivado block design we were able to add the AXI GPIOs and use it to generate PWM signals in PYNQ overlay and turn the motors. Further, integration of DPU was not possible in the Vitis flow. We can create a PetaLinux image by integrating the DPU in the Vivado block design and the generating boot the PetaLinux image and boot components for the board. Using this, we can integrate the DPU with the AXI GPIOs and using existing Vitis AI Libraries(which can be loaded during while configuring the PetaLinux RootFS) and use it for face detection in the video and also turn the motors using the AXI GPIOs.

Code

import time
import os
import numpy as np
import cv2
import random
import colorsys
from matplotlib.patches import Rectangle
import matplotlib.pyplot as plt
from pynq_dpu import DpuOverlay
from pynq import Overlay
from pynq.lib import AxiGPIO
from pynq.lib import Pmod_PWM
%matplotlib inline

'''System Setup'''
overlay = DpuOverlay("yolo.bit")

# Initial angles for vertical and horizontal servos
vertical_angle = 90  # Initial angle for vertical servo
horizontal_angle = 90  # Initial angle for horizontal servo

overlay.load_model("tf_yolov3_voc.xmodel")
anchor_list = [10,13,16,30,33,23,30,61,62,45,59,119,116,90,156,198,373,326]
anchor_float = [float(x) for x in anchor_list]
anchors = np.array(anchor_float).reshape(-1, 2)

'''Get model classification information'''	
def get_class(classes_path):
    with open(classes_path) as f:
        class_names = f.readlines()
    class_names = [c.strip() for c in class_names]
    return class_names

classes_path = "img/voc_classes.txt"
class_names = get_class(classes_path)
num_classes = len(class_names)
hsv_tuples = [(1.0 * x / num_classes, 1., 1.) for x in range(num_classes)]
colors = list(map(lambda x: colorsys.hsv_to_rgb(*x), hsv_tuples))
colors = list(map(lambda x: 
                  (int(x[0] * 255), int(x[1] * 255), int(x[2] * 255)), 
                  colors))
random.seed(0)
random.shuffle(colors)
random.seed(None)

'''resize image with unchanged aspect ratio using padding'''
def letterbox_image(image, size):
    ih, iw, _ = image.shape
    w, h = size
    scale = min(w/iw, h/ih)
    #print(scale)
    
    nw = int(iw*scale)
    nh = int(ih*scale)
    #print(nw)
    #print(nh)

    image = cv2.resize(image, (nw,nh), interpolation=cv2.INTER_LINEAR)
    new_image = np.ones((h,w,3), np.uint8) * 128
    h_start = (h-nh)//2
    w_start = (w-nw)//2
    new_image[h_start:h_start+nh, w_start:w_start+nw, :] = image
    return new_image


'''image preprocessing'''
def pre_process(image, model_image_size):
    image = image[...,::-1]
    image_h, image_w, _ = image.shape
 
    if model_image_size != (None, None):
        assert model_image_size[0]%32 == 0, 'Multiples of 32 required'
        assert model_image_size[1]%32 == 0, 'Multiples of 32 required'
        boxed_image = letterbox_image(image, tuple(reversed(model_image_size)))
    else:
        new_image_size = (image_w - (image_w % 32), image_h - (image_h % 32))
        boxed_image = letterbox_image(image, new_image_size)
    image_data = np.array(boxed_image, dtype='float32')
    image_data /= 255.
    image_data = np.expand_dims(image_data, 0) 	
    return image_data

def _get_feats(feats, anchors, num_classes, input_shape):
    num_anchors = len(anchors)
    anchors_tensor = np.reshape(np.array(anchors, dtype=np.float32), [1, 1, 1, num_anchors, 2])
    grid_size = np.shape(feats)[1:3]
    nu = num_classes + 5
    predictions = np.reshape(feats, [-1, grid_size[0], grid_size[1], num_anchors, nu])
    grid_y = np.tile(np.reshape(np.arange(grid_size[0]), [-1, 1, 1, 1]), [1, grid_size[1], 1, 1])
    grid_x = np.tile(np.reshape(np.arange(grid_size[1]), [1, -1, 1, 1]), [grid_size[0], 1, 1, 1])
    grid = np.concatenate([grid_x, grid_y], axis = -1)
    grid = np.array(grid, dtype=np.float32)

    box_xy = (1/(1+np.exp(-predictions[..., :2])) + grid) / np.array(grid_size[::-1], dtype=np.float32)
    box_wh = np.exp(predictions[..., 2:4]) * anchors_tensor / np.array(input_shape[::-1], dtype=np.float32)
    box_confidence = 1/(1+np.exp(-predictions[..., 4:5]))
    box_class_probs = 1/(1+np.exp(-predictions[..., 5:]))
    return box_xy, box_wh, box_confidence, box_class_probs

def drive_servos(vertical_angle, horizontal_angle):
    # Get the GPIO instance
    gpio_instance = overlay.ip_dict['pmod_1']
    gpio = AxiGPIO(gpio_instance).channel1
    
    # Calculate the Ton and Toff for each servo motor
    vertical_ton = 0.2 + (vertical_angle * 2.2 / 180)  # Ton for vertical servo
    vertical_toff = 20 - vertical_ton
    horizontal_ton = 0.2 + (horizontal_angle * 2.2 / 180)  # Ton for horizontal servo
    horizontal_toff = 20 - horizontal_ton
    
    # Write the PWM signals to the GPIO pins for 5 iterations
    for _ in range(5):
        # Control the vertical servo motor
        gpio[4].write(0x1)
        time.sleep(vertical_ton / 1000)
        gpio[4].write(0x0)
        
        # Control the horizontal servo motor
        gpio[2].write(0x1)
        time.sleep(horizontal_ton / 1000)
        gpio[2].write(0x0)
        
        # Make sure the PWM cycle time is 20ms
        time.sleep((20 - vertical_ton - horizontal_ton) / 1000)

def update_angles(center_x, center_y, width, height):
    global vertical_angle, horizontal_angle
    
    # Image dimensions
    image_width = 640
    image_height = 480
    
    # Calculate the center of the image
    image_center_x = image_width / 2
    image_center_y = image_height / 2
    
    # Thresholds for angle adjustments
    horizontal_threshold = 50  # Adjust this threshold as needed
    vertical_threshold = 50  # Adjust this threshold as needed

    # Check horizontal position
    if center_x < image_center_x - horizontal_threshold:
        horizontal_angle -= 5  # Decrease angle to move left
    elif center_x > image_center_x + horizontal_threshold:
        horizontal_angle += 5  # Increase angle to move right

    # Check vertical position
    if center_y < image_center_y - vertical_threshold:
        vertical_angle -= 5  # Decrease angle to move up
    elif center_y > image_center_y + vertical_threshold:
        vertical_angle += 5  # Increase angle to move down

    # Call the servo control function with updated angles
    drive_servos(vertical_angle, horizontal_angle)


def correct_boxes(box_xy, box_wh, input_shape, image_shape):
    box_yx = box_xy[..., ::-1]
    box_hw = box_wh[..., ::-1]
    input_shape = np.array(input_shape, dtype = np.float32)
    image_shape = np.array(image_shape, dtype = np.float32)
    new_shape = np.around(image_shape * np.min(input_shape / image_shape))
    offset = (input_shape - new_shape) / 2. / input_shape
    scale = input_shape / new_shape
    box_yx = (box_yx - offset) * scale
    box_hw *= scale

    box_mins = box_yx - (box_hw / 2.)
    box_maxes = box_yx + (box_hw / 2.)
    boxes = np.concatenate([
        box_mins[..., 0:1],
        box_mins[..., 1:2],
        box_maxes[..., 0:1],
        box_maxes[..., 1:2]
    ], axis = -1)
    boxes *= np.concatenate([image_shape, image_shape], axis = -1)
    return boxes


def boxes_and_scores(feats, anchors, classes_num, input_shape, image_shape):
    box_xy, box_wh, box_confidence, box_class_probs = _get_feats(feats, anchors, classes_num, input_shape)
    boxes = correct_boxes(box_xy, box_wh, input_shape, image_shape)
    boxes = np.reshape(boxes, [-1, 4])
    box_scores = box_confidence * box_class_probs
    box_scores = np.reshape(box_scores, [-1, classes_num])
    return boxes, box_scores

'''Draw detection frame'''
def draw_bbox(image, bboxes, classes):
    """
    bboxes: [x_min, y_min, x_max, y_max, probability, cls_id] format coordinates.
    """
    num_classes = len(classes)
    image_h, image_w, _ = image.shape
    hsv_tuples = [(1.0 * x / num_classes, 1., 1.) for x in range(num_classes)]
    colors = list(map(lambda x: colorsys.hsv_to_rgb(*x), hsv_tuples))
    colors = list(map(lambda x: (int(x[0] * 255), int(x[1] * 255), int(x[2] * 255)), colors))

    random.seed(0)
    random.shuffle(colors)
    random.seed(None)

    for i, bbox in enumerate(bboxes):
        coor = np.array(bbox[:4], dtype=np.int32)
        fontScale = 0.5
        score = bbox[4]
        class_ind = int(bbox[5])
        bbox_color = colors[class_ind]
        bbox_thick = int(0.6 * (image_h + image_w) / 600)
        c1, c2 = (coor[0], coor[1]), (coor[2], coor[3])
        cv2.rectangle(image, c1, c2, bbox_color, bbox_thick)
    return image


def nms_boxes(boxes, scores):
    """Suppress non-maximal boxes.

    # Arguments
        boxes: ndarray, boxes of objects.
        scores: ndarray, scores of objects.

    # Returns
        keep: ndarray, index of effective boxes.
    """
    x1 = boxes[:, 0]
    y1 = boxes[:, 1]
    x2 = boxes[:, 2]
    y2 = boxes[:, 3]

    areas = (x2-x1+1)*(y2-y1+1)
    order = scores.argsort()[::-1]

    keep = []
    while order.size > 0:
        i = order[0]
        keep.append(i)

        xx1 = np.maximum(x1[i], x1[order[1:]])
        yy1 = np.maximum(y1[i], y1[order[1:]])
        xx2 = np.minimum(x2[i], x2[order[1:]])
        yy2 = np.minimum(y2[i], y2[order[1:]])

        w1 = np.maximum(0.0, xx2 - xx1 + 1)
        h1 = np.maximum(0.0, yy2 - yy1 + 1)
        inter = w1 * h1

        ovr = inter / (areas[i] + areas[order[1:]] - inter)
        inds = np.where(ovr <= 0.55)[0]  # threshold
        order = order[inds + 1]

    return keep

def draw_boxes(image, boxes, scores, classes):
    _, ax = plt.subplots(1)
    ax.imshow(cv2.cvtColor(image, cv2.COLOR_BGR2RGB))
    image_h, image_w, _ = image.shape

    for i, bbox in enumerate(boxes):
        [top, left, bottom, right] = bbox
        width, height = right - left, bottom - top
        center_x, center_y = left + width*0.5, top + height*0.5
        score, class_index = scores[i], classes[i]
        label = '{}: {:.4f}'.format(class_names[class_index], score) 
        color = tuple([color/255 for color in colors[class_index]])
        ax.add_patch(Rectangle((left, top), width, height,
                               edgecolor=color, facecolor='none'))
        ax.annotate(label, (center_x, center_y), color=color, weight='bold', 
                    fontsize=12, ha='center', va='center')
        print(f'Width = {width},Height = {height}, Center = ({center_x},{center_y})')
	update_angles(center_x, center_y, height, width)
        print(f'Detected object = {class_names[class_index]}')
    return ax

def evaluate(yolo_outputs, image_shape, class_names, anchors):
    score_thresh = 0.2
    anchor_mask = [[6, 7, 8], [3, 4, 5], [0, 1, 2]]
    boxes = []
    box_scores = []
    input_shape = np.shape(yolo_outputs[0])[1 : 3]
    input_shape = np.array(input_shape)*32

    for i in range(len(yolo_outputs)):
        _boxes, _box_scores = boxes_and_scores(
            yolo_outputs[i], anchors[anchor_mask[i]], len(class_names), 
            input_shape, image_shape)
        boxes.append(_boxes)
        box_scores.append(_box_scores)
    boxes = np.concatenate(boxes, axis = 0)
    box_scores = np.concatenate(box_scores, axis = 0)

    mask = box_scores >= score_thresh
    boxes_ = []
    scores_ = []
    classes_ = []
    for c in range(len(class_names)):
        class_boxes_np = boxes[mask[:, c]]
        class_box_scores_np = box_scores[:, c]
        class_box_scores_np = class_box_scores_np[mask[:, c]]
        nms_index_np = nms_boxes(class_boxes_np, class_box_scores_np) 
        class_boxes_np = class_boxes_np[nms_index_np]
        class_box_scores_np = class_box_scores_np[nms_index_np]
        classes_np = np.ones_like(class_box_scores_np, dtype = np.int32) * c
        boxes_.append(class_boxes_np)
        scores_.append(class_box_scores_np)
        classes_.append(classes_np)
    boxes_ = np.concatenate(boxes_, axis = 0)
    scores_ = np.concatenate(scores_, axis = 0)
    classes_ = np.concatenate(classes_, axis = 0)

    return boxes_, scores_, classes_

image_folder = 'img'
original_images = [i for i in os.listdir(image_folder) if i.endswith("JPEG")]
total_images = len(original_images)

dpu = overlay.runner

inputTensors = dpu.get_input_tensors()
outputTensors = dpu.get_output_tensors()
shapeIn = tuple(inputTensors[0].dims)
shapeOut0 = (tuple(outputTensors[0].dims)) # (1, 13, 13, 75)
shapeOut1 = (tuple(outputTensors[1].dims)) # (1, 26, 26, 75)
shapeOut2 = (tuple(outputTensors[2].dims)) # (1, 52, 52, 75)
outputSize0 = int(outputTensors[0].get_data_size() / shapeIn[0]) # 12675
outputSize1 = int(outputTensors[1].get_data_size() / shapeIn[0]) # 50700
outputSize2 = int(outputTensors[2].get_data_size() / shapeIn[0]) # 202800
input_data = [np.empty(shapeIn, dtype=np.float32, order="C")]
output_data = [np.empty(shapeOut0, dtype=np.float32, order="C"), 
               np.empty(shapeOut1, dtype=np.float32, order="C"),
               np.empty(shapeOut2, dtype=np.float32, order="C")]
image = input_data[0]




def runn(input_image, display=False):
    # Read input image
    # Pre-processing
    image_size_lines, input_image_columns, _ = input_image.shape
    image_size = (image_size_lines,input_image_columns)
    image_data = np.array(pre_process(input_image, (416, 416)), dtype=np.float32)
    # Fetch data to DPU and trigger it
    image[0,...] = image_data.reshape(shapeIn[1:])
    job_id = dpu.execute_async(input_data, output_data)
    dpu.wait(job_id)
    
    # Retrieve output data
    conv_out0 = np.reshape(output_data[0], shapeOut0)
    conv_out1 = np.reshape(output_data[1], shapeOut1)
    conv_out2 = np.reshape(output_data[2], shapeOut2)
    yolo_outputs = [conv_out0, conv_out1, conv_out2]
    
    # Decode output from YOLOv3
    boxes, scores, classes = evaluate(yolo_outputs, image_size, class_names, anchors)
    
    if display:
        _ = draw_boxes(input_image, boxes, scores, classes)
    print("Number of detected objects: {}".format(len(boxes)))


from IPython.display import clear_output, Image, display
import ipywidgets
import cv2
video = cv2.VideoCapture(0)
display_handle=display(None, display_id=True)
image_widget = ipywidgets.Image(format='jpeg')

while True:
    try:
        clear_output(wait=True)
        _, frame = video.read()
        runn(frame, display=True)
        lines, columns, _ =  frame.shape
        frame = cv2.resize(frame, (int(columns/4), int(lines/4))) 
        image_widget.value =cv2.imencode('.jpeg', frame)[1].tobytes()
        display(image_widget)
    except KeyboardInterrupt:
        video.release()
        break

import time
from pynq import Overlay
from pynq.lib import AxiGPIO
from pynq.lib import Pmod_PWM

# Initial angles for vertical and horizontal servos
vertical_angle = 90  # Initial angle for vertical servo
horizontal_angle = 90  # Initial angle for horizontal servo

def drive_servos(vertical_angle, horizontal_angle):
    # Get the GPIO instance
    gpio_instance = overlay.ip_dict['pmod_1']
    gpio = AxiGPIO(gpio_instance).channel1
    
    # Calculate the Ton and Toff for each servo motor
    vertical_ton = 0.2 + (vertical_angle * 2.2 / 180)  # Ton for vertical servo
    vertical_toff = 20 - vertical_ton
    horizontal_ton = 0.2 + (horizontal_angle * 2.2 / 180)  # Ton for horizontal servo
    horizontal_toff = 20 - horizontal_ton
    
    # Write the PWM signals to the GPIO pins for 5 iterations
    for _ in range(5):
        # Control the vertical servo motor
        gpio[4].write(0x1)
        time.sleep(vertical_ton / 1000)
        gpio[4].write(0x0)
        
        # Control the horizontal servo motor
        gpio[2].write(0x1)
        time.sleep(horizontal_ton / 1000)
        gpio[2].write(0x0)
        
        # Make sure the PWM cycle time is 20ms
        time.sleep((20 - vertical_ton - horizontal_ton) / 1000)

def update_angles(center_x, center_y, width, height):
    global vertical_angle, horizontal_angle
    
    # Image dimensions
    image_width = 640
    image_height = 480
    
    # Calculate the center of the image
    image_center_x = image_width / 2
    image_center_y = image_height / 2
    
    # Thresholds for angle adjustments
    horizontal_threshold = 50  # Adjust this threshold as needed
    vertical_threshold = 50  # Adjust this threshold as needed

    # Check horizontal position
    if center_x < image_center_x - horizontal_threshold:
        horizontal_angle -= 5  # Decrease angle to move left
    elif center_x > image_center_x + horizontal_threshold:
        horizontal_angle += 5  # Increase angle to move right

    # Check vertical position
    if center_y < image_center_y - vertical_threshold:
        vertical_angle -= 5  # Decrease angle to move up
    elif center_y > image_center_y + vertical_threshold:
        vertical_angle += 5  # Increase angle to move down

    # Call the servo control function with updated angles
    drive_servos(vertical_angle, horizontal_angle)

Credits

V Vignesh Karthik

1 project • 0 followers

Contact

Face tracking using Kria KR260 Robotics Starter Kit

Things used in this project

Hardware components

Software apps and online services

Hand tools and fabrication machines

Story

Ubuntu 22.04 image and Kria KR260 setup

Vivado flow for PMOD

Moving the motors using PMOD

Running a face detection model

Integration Challenges

Conclusion

Schematics

YOLO .bit, .hwh and .xclbin

PMOD Kria Block Design

Code

yolo-v3 modified

Two axis motor control

Credits

V Vignesh Karthik

Keerthi Bhushan m

Comments

Embed the widget on your own site

Face tracking using Kria KR260 Robotics Starter Kit

Face tracking using Kria KR260 Robotics Starter Kit

Things used in this project

Hardware components

Software apps and online services

Hand tools and fabrication machines

Story

Ubuntu 22.04 image and Kria KR260 setup

Vivado flow for PMOD

Moving the motors using PMOD

Running a face detection model

Integration Challenges

Conclusion

Schematics

YOLO .bit, .hwh and .xclbin

PMOD Kria Block Design

Code

yolo-v3 modified

Two axis motor control

Credits

V Vignesh Karthik

Keerthi Bhushan m

Comments

Related channels and tags