Published September 15, 2018 © GPL3+

Autonomous Driving AI for Donkey Car Garbage Collector

Use a TensorFlow SSD MobileNetV2 DNN on the Raspberry Pi plus Pi Camera to build an autonomous car capable of object detection.

AdvancedFull instructions providedOver 3 days10,352

Best Use of AI

The Autonomous Robot Challenge

Autonomous Driving AI for Donkey Car Garbage Collector

Things used in this project

Hardware components

Raspberry Pi 3 Model B

Raspberry Pi 2 Model B

Raspberry Pi Zero Wireless

Raspberry Pi Camera Module

DonkeyCar

Ultrasonic Sensor - HC-SR04 (Generic)

Adafruit Analog Accelerometer: ADXL335

Software apps and online services

TensorFlow

OpenCV

DonkeyCar

Hand tools and fabrication machines

Screwdriver

Story

This project will show how to transform a (Raspberry Pi + Pi Camera ) powered RC car into one capable of object detection and autonomous driving. To do this we will deploy two deep neural networks. One for object detection and the other for autonomous driving using inference for steering and throttle. An RPi 3 serves as the vehicle computer. Due to limited resources on the RPi only one of the networks can run at a time.

The idea is to train the neural network to identify garbage bins so that the car can pick em up autonomously.

1 / 2 • Detecting cars

identifying car and laptop

The project consists of two parts. In the first part the plan is to to use a moderate size convolutional network to recognize objects in the input video feed from the Pi Camera. TensorFlow will be used to deploy the CNN model and OpenCV will be used for managing the video feed from the Pi Camera.

In the second part we are going to use behavioral cloning to get the car to navigate autonomously. The modified car will also be augmented with additional sensors such as ultrasonic distance sensor, GPS and 6-DOF IMU as well as implement additional telemetry features.

Introduction

Back in 2017 (a year ago), Google released MobileNet, and earlier this year MobileNetV2. These networks are custom optimized for working on embedded devices like smartphones. Coincidentally the MPU used on RPi 3 falls in this category since it can run either Linux or Android.

The first problem one encounters is the limited RAM and computational capacity of Raspberry Pi 3. Even though this is a quad core MPU it is still not enough when dealing with massive weight files needed by YOLO (You Only Look Once) type networks.

The first solution that comes to mind is to send the image acquired via the Pi Camera via WIFI to an external PC and do the object inference stage there and then send commands to the Donkey Car. In essence we have to contact the mother-ship on every step.This is inefficient not to mention impractical in scenarios when communication with an external laptop is not possible.

I initially tested a VGG16 network which was relatively accurate when detecting garbage bins. However it could not run on the RPi since the weights alone were around 350MB!To test the VGG network refer to the attached code at the end with similar images as input.

Issue:

python KerasVGG_test.py

So to solve the problem of huge weights sizes we are going to run all models on the RPi using slimmer networks. Specifically we are going to use the a MobileNetV2 Single Shot Detector based CNN. This is a DNN which has been optimized to have very small (relatively) weights.

The technique is called transfer learning since we are using a pre-trained network weights.

Before we delve into the software we have to make some hardware modifications.

Hardware

A Magnet car was used for the Donkey Car project. Magnet is an RC (remote controlled) car that operates using a 2.4GHz multi-channel radio. To transform the Magnet into a Donkey car there are a couple of steps that have to be undertaken.

1. Dis-assembly

First remove the top-cover by removing the clips and two screws on the back. You will find a cage with two drivers. Remove this too and then remove the outer top cage.Now you have access to the circuits on the car. From the top once can see the receiver, the ESC (Electronic Speed Controller) and the servo.

A bare DonkeyCar

The receiver is a 4 channel receiver with a B/C (battery elimination) circuit. Each channel uses a 3 wire connector. Channels CH3 and CH4 are not used. The ESC takes the battery as input, power switch and the input from the receiver channel 1. The servo is connected to channel 0 of the receiver. The servo is used for steering. The steering angle can be trimmed manually if driving via a joystick or it has to be calibrated.

2. Mount the adapters

Two 3D printed plastic adapter are used, after removing the two screws and the body top. You have to screw the two adapters in place of the existing clips by using the same screws.After replacing the two clips with the two 3D printed adapters now we can mount the top wooden Donkey car plate.

Adapters which replace the long clips

Next screw down the camera handle on the base board plate. Then place the plastic threaded parts on each hole. These are used for securing the Raspberry Pi and the servo controller in place.

3. Servo controller and RPi

Mount the RPi and the servo controller on the wooden plate. I ended up using zip-ties for securing the RPi since I did not want to put a metallic screw near the antenna. After screwing down the servo controller connect the I2C bus pins from the RPi to the servo controller. Next, take a small knife and cut the zip-ties that holds together the ESC and servo 3 pin wires.

When connecting to the external servo controller both of the connections to the receiver has to be disconnected from it and connected to channels 0 and 1 of the servo controller which we will later mount on on the DonkeyCar top-plate.

4. Wooden plate

Mount the wooden plate on the adapters.Now use the clips to attach the DonkeyCar plate to the Magnet Chassis adapters.

Mount the Donkey car plate on top and use a short USB cable to connect the USB battery to the RPi. The throttle and steering cables will protrude from the opening in the plate and connect to channel 0 and 1 of the servo controller mounted on the Donkey car plate.

5. Additional sensors

The main issue with the standard configuration is that there is no sensor used to measure speed or distance to obstacles. I added an 6DOF MPU6050 IMU which allows the RPi to measure 3D acceleration and turns, Next I added a GPS to the serial port and also an HCSR04 sensor for distance measurement. The HCSR04 however works via 5V and needs a level shifter.

This completes the hardware stage of the unit. The Donkey Car has been fully converted into a 4 wheel vehicle equipped with :

a) Monocular wide angle Camera

b) Servo controller.

c) 6-DOF IMU sensor

d) GPS

e) Distance sensor

All the additional sensors will be time-stamped upon acquisition and used to augment the training vectors for the deep neural network.

To support the extra sensor one has to modify the manage.py script to add this functionality.

To use the IMU, I initially tried a Python Library for FXOS8700 on Debian Stretch. This did not work out of the box due to the repeated start bug of the RPi, so I ended up using an MPU6050 which also comes with a gyroscope.

To test the IMU code use the following snippet below:

from IMU import MPU6050
mu = MPU6050()
a = imu.run()
a        #print answer

The following software needs to be from within the virtualenv for the MPU6050:

sudo apt install python3-smbus
pip install mpu6050-raspberrypi

The meta.json file under the tub folder has to be augmented to support logging IMU data.

{"inputs": ["cam/image_array", "user/angle", "user/throttle", "user/mode", "imu/acl_x", "imu/acl_y", "imu/acl_z","imu/gyr_x", "imu/gyr_y", "imu/gyr_z"], "types": ["image_array", "float", "float", "str", "float", "float", "float","float", "float", "float"]}

The manage.py file also has to be modified as below:

     imu = Mpu6050()
   V.add(imu, outputs=['imu/acl_x', 'imu/acl_y', 'imu/acl_z', 'imu/gyr_x', 'imu/gyr_y', 'imu/gyr_z'], threaded=True)
 # add tub to save data
   inputs = ['cam/image_array', 'user/angle', 'user/throttle', 'user/mode', 'imu/acl_x', 'imu/acl_y', 'imu/acl_z','imu/gyr_x', 'imu/gyr_y', 'imu/gyr_z']
   types = ['image_array', 'float', 'float',  'str', 'float', 'float', 'float','float', 'float', 'float']

Finally I also added a GPS module to the unit. While this cannot be used indoor, it is useful to add for outdoor tests in areas where you can connect to a WIFI network.

If one needs to log in GPS data the same modification as with the IMU have to be implemented.

To use the HSCR04 distance sensor once has to install the RPI.GPIO library from the python environment.

pi p install RPi.GPIO

1 / 2

This sums up all the hardware modifications. In the end you are going to end up with a DonkeyCar that looks like this:

1 / 2

Software

The idea here is to implement an AI pipeline for object detection running on the RPi. The first step will be to deploy an object detection DNN that will work on the RPi 3 without dependence on external devices. Let's get started by installing the software needed.

1. Install DNN libs

The project uses TensorFlow and OpenCV. In simple terms in order to do inference on the Raspberry Pi we use an already trained network. After the weights are loaded object detection and inference is done for each camera frame.

pip install tensorflow[pi] 
pip install matplotlib raspberry
sudo apt-get install libjpeg-dev libtiff5-dev libjasper-dev libpng12-dev
sudo apt-get install libavcodec-dev libavformat-dev libswscale-dev libv4l-dev
sudo apt-get install libxvidcore-dev libx264-devsudo apt-get install qt4-dev-tools
pip3 install opencv-python

One thing that needs to be pointed out is that TensorFlow uses a different file format unlike Keras which has a relatively simple workflow for loading weights as h5 files.

sudo pip3 install keras --upgrade

Clone the official TensorFlow model repository.

git clone --recurse-submodules https://github.com/tensorflow/models.git

and export the path:

export PYTHONPATH=$PYTHONPATH:/home/pi/tensorflow1/models/research:/home/pi/tensorflow1/models/research/slim

Finally when everything is installed and before you restart issue

deactivate    #to get out of the virtual env workspace if you are using one
sudo shutdown -h now

Next step is to install the Protobuffer compiler for the weights of the MobileNetV2 SSD.

2. Install ProtoBuffer compiler

Keras uses a different file format from TensorFlow. So we have to deal with Protobuffers which is the native format for TensorFlow.

I installed version 3.5.1

sudo apt-get install autoconf automake libtool curl
wget https://github.com/google/protobuf/releases/download/v3.5.1/protobuf-all-3.5.1.tar.gz
tar -zxvf protobuf-all-3.5.1.tar.gzcd protobuf-3.5.1
./configure

Compiling this will take quite a bit of time (~1.5 hr) on the RPi. The other solution is to cross-compile but we''ll have to keep it simple for now. Issue:

make

Then issue:

make check 
sudo make install
cd pythonexport 
LD_LIBRARY_PATH=../src/.libs

Finally install the compiler:

python3 setup.py build --cpp_implementation python3 setup.py test --cpp_implementationsudo python3 setup.py install --cpp_implementation

Now export the path :

export PROTOCOL_BUFFERS_PYTHON_IMPLEMENTATION=cpp
export PROTOCOL_BUFFERS_PYTHON_IMPLEMENTATION_VERSION=3
sudo ldconfig

Finally to test the compiler just type:

protoc

Now we are read to rumble. This will allow us to convert the weight into a format that TensorFlow understands.

3. MobileNetV2 weights

The MobileNetV2 model we are going to use are fetched from this page.

wget http://download.tensorflow.org/models/object_detection/ssdlite_mobilenet_v2_coco_2018_05_09.tar.gztar -xzvf ssdlite_mobilenet_v2_coco_2018_05_09.tar.gz

The model we are using is the one with the smallest image dimensions 160x128. The Protobuffer compiler installed above will be used to convert the weights into compiled files so that TensorFlow can use them.

4. Set up image detection model on RPi

Here we are going to use the SSD (Single Shot Detection) with a MobileNetV2 in order to be able to run the program standalone on the RPi.

Copy the Object detection script under /models/research/Object detection and compile the weights

cd /home/pi/Documents/model/research/
cd object_detection   #object detection folder
#compile the weights of the SSD mobilenet V2
protoc object_detection/protos/*.proto --python_out=.

The object detection program can be tested and run independently from the autonomous driving program.

To test it just issue:

source ~/.profile
workon cv
python ObjectDetectionDonkeyCar.py

Since the network is trained on a limited number of categories it only recognizes objects from those correctly. I tested on people, cars and potted plants and bins as well as bottles.

As you can see it's not very fast, clocking at 0.8 - 1.3 frames per seconds on average. It however runs on the Raspberry Pi 3 so not much is expected.

potted plants and laptop recognized

And that concludes the object recognition stage on the RPi.

5. Set up Donkey car Software stack

The idea here is to use behavioral cloning to predict the steering angle. This will allow the car to autonomously navigate around a track. To do that we will have to first install the DonkeyCar from Github, acquire some images during the training phase and then train the model on a laptop. At the end we can transfer the model weights to the RPi via SCP and test it on the track.

This task is independent from the object detection model we used above so these two models have to run separately.

We are going to use virtual environments so first load the profile and then start it.

source ~/.profile
workon cv

From the same virtualenv issue.

pip install donkeycar[pi]
pip install donkeycar[pi]$ python -c "import donkeycar as dk; print(dk.__version__)"

Now create a DonkeyCar project:

donkey createcar ~/mycar 
# or if issuing from anywhere else
donkey createcar --template donkey2 --path ~/home

6. Calibrating

Before training we have to calibrate the car so that the wheels are straight in default mode and the throttle works correctly. I found the steering angle between 300 and 400 works fine.

To calibrate the car issue:

# for steering
donkey calibrate --channel 1
#for throttle
donkey calibrate --channel 0

7. Training the car

To train the car to drive autonomously around a track we need to drive the vehicle manually. Recording 10 laps should be enough. The camera will grab 160×128 frames from the camera, store those images under the /tub folder while the throttle and steering angles are time-stamped and saved as JSON files. Next steps are to record images while driving manually and then use those images to train the network. Under the tub folder you'll end up with a bunch of images and json files that have this format:

{"inputs": ["cam/image_array", "user/angle", "user/throttle", "user/mode", "timestamp"], "types": ["image_array", "float", "float", "str", "str"]}

The throttle and steering data will be cross-referenced with the timestamped camera frames. These files are then fed to a CNN which attempts to infer throttle and steering parameters based on new images. The current software also logs in acceleration from the MPU6050 accelerometer and Gyroscope. These are currently used for telemetry. Based on the acquired data now we will create a model that the Deep Neural Network will provide to us from the supplied data.

The first problem one encounters is that there is no WIFI on an open track. I used an indoor track for training.

Install donkeycar with GPU support on a laptop equipped with a Nvidia graphic card.

 pip install donkeycar[tf_gpu]

To train the car from the current environment on the laptop I copied the files under tub folder from RPi to the Windows Donkey project and issued :

python manage.py train --tub=tub  --model=tt1.h5

This does not work however since the script gives an error stating:

Cannot import Keras.categorical

The issue is that the default Keras.categorical class on the Keras.py file is not included when the package is cloned from Github. The correct Keras.py file is attached below and can be found from

https://github.com/wroscoe/donkey/blob/master/donkeycar/parts/keras.py

This has to be included on the DonkeyCar Package before you install.

After that issuing the above command works as you can see on the image below the training loss decreases constantly.

The training is done on a laptop equipped with a Cuda GTX860 with compute capability of 5.0.

Initially I started implementing a basic training network on my own. This will use all the acquired images and throttle and steering data to train the network. In the end a Keras h5 weight file is obtained which shoudl be transferred to the RPI by using WinSCP or rsync.

The thing to keep in mind here is that the deeper the network the larger the weights so you have to strike a balance. In the end it looks like the ddefault network provides a good compromise so i ended up using that.

Algorithm

Since throttle and steering are continuous floating point values the algorithm splits all these two into different bins. Each bin is then matched with images and passed to the network as key value pairs. The networks then learns to control the throttle based on the features in the images. in this case the rope.

Performance

The object detection is done simply by running the MobileNet Tensorflow DNN. Change directory under /models/research/Object Detection and run the python script by issuing:

python ObjectDetection.py

The normal frame rate for object detection is on average around 0.8 frames per second since this runs natively on the RPi, although sometimes I can get 1.3-1.4 fps.

1 / 2 • Here it is detecting multiple cars.

Obviously you are not going to get state of the art detection with a network like mobilenet V2. However in comparison to much bigger networks such as VGG16 or Resnet performance is exemplary.

While running this script takes around 78-80% of the PI resources with no video recording.

Future improvements I would like to implement are:

a) Use a Kalman filter for sensor fusion

b) Implement a DNN for depth detection

c) Add Lidar, or array of ultrasonic tranceivers

d) Upgrade battery and ESC

Conclusion

This project showcased how to build an autonomous AI capable of object detection and autonomous driving based on behavioral cloning. An SSD MobileNetV2 DNN were leveraged for object detection and the DonkeyCar stack was used for autonomous driving.

Code

#!/usr/bin/env python
"""
copyright dhq 2018, GPLV3
"""

from mpu6050 import mpu6050
from time import sleep

sensor = mpu6050(0x68)


class MPU6050():

    def run(self,):
        accel_data = sensor.get_accel_data()
        gyro_data = sensor.get_gyro_data()
        temp = sensor.get_temp()
        mpu6050pack =    ("x: " + str(accel_data['x'])) + ("y: " + str(accel_data['y'])) + ("z: " + str(accel_data['z'])) + ("gx: " + str(gyro_data['x'])) + ("gy: " + str(gyro_data['y'])) + ("gz: " + str(gyro_data['z'])) + ("Temp: " + str(temp) + " C")
        return mpu6050pack

#!/usr/bin/env python3
"""
Scripts to drive a donkey 2 car and train a model for it.

Usage:
    manage.py (drive) [--model=<model>] [--js] [--chaos]
    manage.py (train) [--tub=<tub1,tub2,..tubn>]  (--model=<model>) [--base_model=<base_model>] [--no_cache]

Options:
    -h --help        Show this screen.
    --tub TUBPATHS   List of paths to tubs. Comma separated. Use quotes to use wildcards. ie "~/tubs/*"
    --js             Use physical joystick.
    --chaos          Add periodic random steering when manually driving
"""
import os
from docopt import docopt

import donkeycar as dk

#import parts
from donkeycar.parts.camera import PiCamera
from donkeycar.parts.transform import Lambda
from donkeycar.parts.keras import KerasCategorical
from donkeycar.parts.actuator import PCA9685, PWMSteering, PWMThrottle
from donkeycar.parts.datastore import TubGroup, TubWriter
from donkeycar.parts.controller import LocalWebController, JoystickController
from donkeycar.parts.clock import Timestamp
from donkeycar.parts.imu import Mpu6050

def drive(cfg, model_path=None, use_joystick=False, use_chaos=False):
    """
    Construct a working robotic vehicle from many parts.
    Each part runs as a job in the Vehicle loop, calling either
    it's run or run_threaded method depending on the constructor flag `threaded`.
    All parts are updated one after another at the framerate given in
    cfg.DRIVE_LOOP_HZ assuming each part finishes processing in a timely manner.
    Parts may have named outputs and inputs. The framework handles passing named outputs
    to parts requesting the same named input.
    """

    V = dk.vehicle.Vehicle()

    imu = Mpu6050()
    V.add(imu, outputs=['imu/acl_x', 'imu/acl_y', 'imu/acl_z', 'imu/gyr_x', 'imu/gyr_y', 'imu/gyr_z'], threaded=True)
    
    clock = Timestamp()
    V.add(clock, outputs='timestamp')

    cam = PiCamera(resolution=cfg.CAMERA_RESOLUTION)
    V.add(cam, outputs=['cam/image_array'], threaded=True)

    if use_joystick or cfg.USE_JOYSTICK_AS_DEFAULT:
        ctr = JoystickController(max_throttle=cfg.JOYSTICK_MAX_THROTTLE,
                                 steering_scale=cfg.JOYSTICK_STEERING_SCALE,
                                 auto_record_on_throttle=cfg.AUTO_RECORD_ON_THROTTLE)
    else:
        # This web controller will create a web server that is capable
        # of managing steering, throttle, and modes, and more.
        ctr = LocalWebController(use_chaos=use_chaos)

    V.add(ctr,
          inputs=['cam/image_array'],
          outputs=['user/angle', 'user/throttle', 'user/mode', 'recording'],
          threaded=True)

    # See if we should even run the pilot module.
    # This is only needed because the part run_condition only accepts boolean
    def pilot_condition(mode):
        if mode == 'user':
            return False
        else:
            return True

    pilot_condition_part = Lambda(pilot_condition)
    V.add(pilot_condition_part, inputs=['user/mode'],
                                outputs=['run_pilot'])

    # Run the pilot if the mode is not user.
    kl = KerasCategorical()
    if model_path:
        kl.load(model_path)

    V.add(kl, inputs=['cam/image_array'],
              outputs=['pilot/angle', 'pilot/throttle'],
              run_condition='run_pilot')

    # Choose what inputs should change the car.
    def drive_mode(mode,
                   user_angle, user_throttle,
                   pilot_angle, pilot_throttle):
        if mode == 'user':
            return user_angle, user_throttle

        elif mode == 'local_angle':
            return pilot_angle, user_throttle

        else:
            return pilot_angle, pilot_throttle

    drive_mode_part = Lambda(drive_mode)
    V.add(drive_mode_part,
          inputs=['user/mode', 'user/angle', 'user/throttle', 'pilot/angle', 'pilot/throttle'],
          outputs=['angle', 'throttle'])

    steering_controller = PCA9685(cfg.STEERING_CHANNEL)
    steering = PWMSteering(controller=steering_controller,
                           left_pulse=cfg.STEERING_LEFT_PWM,
                           right_pulse=cfg.STEERING_RIGHT_PWM)

    throttle_controller = PCA9685(cfg.THROTTLE_CHANNEL)
    throttle = PWMThrottle(controller=throttle_controller,
                           max_pulse=cfg.THROTTLE_FORWARD_PWM,
                           zero_pulse=cfg.THROTTLE_STOPPED_PWM,
                           min_pulse=cfg.THROTTLE_REVERSE_PWM)

    V.add(steering, inputs=['angle'])
    V.add(throttle, inputs=['throttle'])

    # add tub to save data
    inputs = ['cam/image_array', 'user/angle', 'user/throttle', 'user/mode', 'imu/acl_x', 'imu/acl_y', 'imu/acl_z','imu/gyr_x', 'imu/gyr_y', 'imu/gyr_z']
    types = ['image_array', 'float', 'float',  'str', 'float', 'float', 'float','float', 'float', 'float']

    #multiple tubs
    #th = TubHandler(path=cfg.DATA_PATH)
    #tub = th.new_tub_writer(inputs=inputs, types=types)

    # single tub
    tub = TubWriter(path=cfg.TUB_PATH, inputs=inputs, types=types)
    V.add(tub, inputs=inputs, run_condition='recording')

    # run the vehicle
    V.start(rate_hz=cfg.DRIVE_LOOP_HZ, max_loop_count=cfg.MAX_LOOPS)




def train(cfg, tub_names, new_model_path, base_model_path=None ):
    """
    use the specified data in tub_names to train an artifical neural network
    saves the output trained model as model_name
    """
    X_keys = ['cam/image_array']
    y_keys = ['user/angle', 'user/throttle']
    def train_record_transform(record):
        """ convert categorical steering to linear and apply image augmentations """
        record['user/angle'] = dk.util.data.linear_bin(record['user/angle'])
        # TODO add augmentation that doesn't use opencv
        return record

    def val_record_transform(record):
        """ convert categorical steering to linear """
        record['user/angle'] = dk.util.data.linear_bin(record['user/angle'])
        return record

    new_model_path = os.path.expanduser(new_model_path)

    kl = KerasCategorical()
    if base_model_path is not None:
        base_model_path = os.path.expanduser(base_model_path)
        kl.load(base_model_path)

    print('tub_names', tub_names)
    if not tub_names:
        tub_names = os.path.join(cfg.DATA_PATH, '*')
    tubgroup = TubGroup(tub_names)
    train_gen, val_gen = tubgroup.get_train_val_gen(X_keys, y_keys,
                                                    train_record_transform=train_record_transform,
                                                    val_record_transform=val_record_transform,
                                                    batch_size=cfg.BATCH_SIZE,
                                                    train_frac=cfg.TRAIN_TEST_SPLIT)

    total_records = len(tubgroup.df)
    total_train = int(total_records * cfg.TRAIN_TEST_SPLIT)
    total_val = total_records - total_train
    print('train: %d, validation: %d' % (total_train, total_val))
    steps_per_epoch = total_train // cfg.BATCH_SIZE
    print('steps_per_epoch', steps_per_epoch)

    kl.train(train_gen,
             val_gen,
             saved_model_path=new_model_path,
             steps=steps_per_epoch,
             train_split=cfg.TRAIN_TEST_SPLIT)


if __name__ == '__main__':
    args = docopt(__doc__)
    cfg = dk.load_config()

    if args['drive']:
        drive(cfg, model_path = args['--model'], use_joystick=args['--js'], use_chaos=args['--chaos'])

    elif args['train']:
        tub = args['--tub']
        new_model_path = args['--model']
        base_model_path = args['--base_model']
        cache = not args['--no_cache']
        train(cfg, tub, new_model_path, base_model_path)

"""
CAR CONFIG

This file is read by your car application's manage.py script to change the car
performance.

EXMAPLE
-----------
import dk
cfg = dk.load_config(config_path='~/mycar/config.py')
print(cfg.CAMERA_RESOLUTION)

"""


import os

#PATHS
CAR_PATH = PACKAGE_PATH = os.path.dirname(os.path.realpath(__file__))
DATA_PATH = os.path.join(CAR_PATH, 'data')
MODELS_PATH = os.path.join(CAR_PATH, 'models')

#VEHICLE
DRIVE_LOOP_HZ = 20
MAX_LOOPS = 100000

#CAMERA
CAMERA_RESOLUTION = (120, 160) #(height, width)
CAMERA_FRAMERATE = DRIVE_LOOP_HZ

#STEERING
STEERING_CHANNEL = 1
STEERING_LEFT_PWM = 420
STEERING_RIGHT_PWM = 300

#THROTTLE
THROTTLE_CHANNEL = 0
THROTTLE_FORWARD_PWM = 400
THROTTLE_STOPPED_PWM = 360
THROTTLE_REVERSE_PWM = 310

#TRAINING
BATCH_SIZE = 128
TRAIN_TEST_SPLIT = 0.8


#JOYSTICK
USE_JOYSTICK_AS_DEFAULT = False
JOYSTICK_MAX_THROTTLE = 0.25
JOYSTICK_STEERING_SCALE = 1.0
AUTO_RECORD_ON_THROTTLE = True


TUB_PATH = os.path.join(CAR_PATH, 'tub') # if using a single tub

#ROPE.DONKEYCAR.COM
ROPE_TOKEN="GET A TOKEN AT ROPE.DONKEYCAR.COM"

######## Picamera Object Detection Using Tensorflow Classifier #########
#
# Author dhq 		 # Date: 9/12/18
# removed usb camera

# This program uses a TensorFlow classifier to perform object detection.
# It loads the classifier uses it to perform object detection on a Picamera feed.
# It draws boxes and scores around the objects of interest in each frame from the Picamera. 

## Based on code from Evan Juras # Date: 4/15/1
## Some of the code is copied from Google's example at
## https://github.com/tensorflow/models/blob/master/research/object_detection/object_detection_tutorial.ipynb
## and some is copied from Dat Tran's example at
## https://github.com/datitran/object_detector_app/blob/master/object_detection_app.py



# Import packages
import os
import cv2
import numpy as np
from picamera.array import PiRGBArray
from picamera import PiCamera
import tensorflow as tf
import argparse
import sys

# Set up camera constants
IM_WIDTH = 160
IM_HEIGHT = 128


# This is needed since the working directory is the object_detection folder.
sys.path.append('..')

# Import utilites
from utils import label_map_util
from utils import visualization_utils as vis_util

# Name of the directory containing the object detection module we're using
MODEL_NAME = 'ssdlite_mobilenet_v2_coco_2018_05_09'

# Grab path to current working directory
CWD_PATH = os.getcwd()

# Path to frozen detection graph .pb file, which contains the model that is used
# for object detection.
PATH_TO_CKPT = os.path.join(CWD_PATH,MODEL_NAME,'frozen_inference_graph.pb')

# Path to label map file
PATH_TO_LABELS = os.path.join(CWD_PATH,'data','mscoco_label_map.pbtxt')

# Number of classes the object detector can identify
NUM_CLASSES = 90

## Load the label map.
# Label maps map indices to category names, so that when the convolution
# network predicts `5`, we know that this corresponds to `airplane`.
# Here we use internal utility functions, but anything that returns a
# dictionary mapping integers to appropriate string labels would be fine
label_map = label_map_util.load_labelmap(PATH_TO_LABELS)
categories = label_map_util.convert_label_map_to_categories(label_map, max_num_classes=NUM_CLASSES, use_display_name=True)
category_index = label_map_util.create_category_index(categories)

# Load the Tensorflow model into memory.
detection_graph = tf.Graph()
with detection_graph.as_default():
    od_graph_def = tf.GraphDef()
    with tf.gfile.GFile(PATH_TO_CKPT, 'rb') as fid:
        serialized_graph = fid.read()
        od_graph_def.ParseFromString(serialized_graph)
        tf.import_graph_def(od_graph_def, name='')

    sess = tf.Session(graph=detection_graph)


# Define input and output tensors (i.e. data) for the object detection classifier

# Input tensor is the image
image_tensor = detection_graph.get_tensor_by_name('image_tensor:0')

# Output tensors are the detection boxes, scores, and classes
# Each box represents a part of the image where a particular object was detected
detection_boxes = detection_graph.get_tensor_by_name('detection_boxes:0')

# Each score represents level of confidence for each of the objects.
# The score is shown on the result image, together with the class label.
detection_scores = detection_graph.get_tensor_by_name('detection_scores:0')
detection_classes = detection_graph.get_tensor_by_name('detection_classes:0')

# Number of objects detected
num_detections = detection_graph.get_tensor_by_name('num_detections:0')

# Initialize frame rate calculation
frame_rate_calc = 1
freq = cv2.getTickFrequency()
font = cv2.FONT_HERSHEY_SIMPLEX

# Initialize camera and perform object detection.

# Initialize Picamera and grab reference to the raw capture
camera = PiCamera()
camera.resolution = (IM_WIDTH,IM_HEIGHT)
camera.framerate = 10
rawCapture = PiRGBArray(camera, size=(IM_WIDTH,IM_HEIGHT))
rawCapture.truncate(0)

#To save video file

#path = ('/home/pi/Documents/Obje.avi')
#fourcc = cv2.VideoWriter_fourcc(*'XVID')
#video_writer = cv2.VideoWriter(path,-1, 1, (160,128), True)
# if not video_writer :
    # print ("!!! Failed VideoWriter: invalid parameters")
    # sys.exit(1)

#camera.start_recording('/home/pi/Documents/Obje.h264')

for frame1 in camera.capture_continuous(rawCapture, format="bgr",use_video_port=True):

    t1 = cv2.getTickCount()
        
    # Acquire frame and expand frame dimensions to have shape: [1, None, None, 3]
    # i.e. a single-column array, where each item in the column has the pixel RGB value
    frame = frame1.array
    frame.setflags(write=1)
    frame_expanded = np.expand_dims(frame, axis=0)

    # Perform the actual detection by running the model with the image as input
    (boxes, scores, classes, num) = sess.run(
         [detection_boxes, detection_scores, detection_classes, num_detections],
         feed_dict={image_tensor: frame_expanded})

    # Draw the results of the detection (aka 'visulaize the results')
    vis_util.visualize_boxes_and_labels_on_image_array(
            frame,
            np.squeeze(boxes),
            np.squeeze(classes).astype(np.int32),
            np.squeeze(scores),
            category_index,
            use_normalized_coordinates=True,
            line_thickness=8,
            min_score_thresh=0.40)

    cv2.putText(frame,"FPS: {0:.2f}".format(frame_rate_calc),(30,50),font,1,(255,255,0),2,cv2.LINE_AA)

    # All the results have been drawn on the frame, so it's time to display it.
    cv2.imshow('Object detector', frame)

    #video_writer.write(frame)

    t2 = cv2.getTickCount()
    time1 = (t2-t1)/freq
    frame_rate_calc = 1/time1

    # Press 'q' to quit
    if cv2.waitKey(1) == ord('q'):
        break

        rawCapture.truncate(0)

    camera.close()
    #camera.stop_recording()



cv2.destroyAllWindows()
#video_writer.release()

from keras.preprocessing.image import load_img
from keras.preprocessing.image import img_to_array
from keras.applications.vgg16 import preprocess_input
from keras.applications.vgg16 import decode_predictions
from keras.applications.vgg16 import VGG16

# let's keep our keras backend tensorflow quiet
import os
os.environ['TF_CPP_MIN_LOG_LEVEL']='3'
# for testing on CPU
#os.environ['CUDA_VISIBLE_DEVICES'] = ''


# load the model
model = VGG16()
# load an image from file
image = load_img('garbage_bin5.jpg', target_size=(224, 224))
# convert the image pixels to a numpy array
image = img_to_array(image)
# reshape data for the model
image = image.reshape((1, image.shape[0], image.shape[1], image.shape[2]))
# prepare the image for the VGG model
image = preprocess_input(image)
# predict the probability across all output classes
yhat = model.predict(image)
# convert the probabilities to class labels
label = decode_predictions(yhat)
# retrieve the most likely result, e.g. highest probability
label = label[0][0]
# print the classification
print('%s (%.2f%%)' % (label[1], label[2]*100))

""""
keras.py
functions to run and train autopilots using keras
"""

from tensorflow.python.keras.layers import Input
from tensorflow.python.keras.models import Model, load_model
from tensorflow.python.keras.layers import Convolution2D
from tensorflow.python.keras.layers import Dropout, Flatten, Dense, Cropping2D, Lambda
from tensorflow.python.keras.callbacks import ModelCheckpoint, EarlyStopping

from donkeycar import util


class KerasPilot:

    def load(self, model_path):
        self.model = load_model(model_path)

    def shutdown(self):
        pass

    def train(self, train_gen, val_gen,
              saved_model_path, epochs=100, steps=100, train_split=0.8,
              verbose=1, min_delta=.0005, patience=5, use_early_stop=True):
        """
        train_gen: generator that yields an array of images an array of
        """

        # checkpoint to save model after each epoch
        save_best = ModelCheckpoint(saved_model_path,
                                    monitor='val_loss',
                                    verbose=verbose,
                                    save_best_only=True,
                                    mode='min')

        # stop training if the validation error stops improving.
        early_stop = EarlyStopping(monitor='val_loss',
                                   min_delta=min_delta,
                                   patience=patience,
                                   verbose=verbose,
                                   mode='auto')

        callbacks_list = [save_best]

        if use_early_stop:
            callbacks_list.append(early_stop)

        hist = self.model.fit_generator(
            train_gen,
            steps_per_epoch=steps,
            epochs=epochs,
            verbose=1,
            validation_data=val_gen,
            callbacks=callbacks_list,
            validation_steps=steps * (1.0 - train_split) / train_split)
        return hist


class KerasCategorical(KerasPilot):
    def __init__(self, model=None, *args, **kwargs):
        super(KerasCategorical, self).__init__(*args, **kwargs)
        if model:
            self.model = model
        else:
            self.model = default_categorical()

    def run(self, img_arr):
        img_arr = img_arr.reshape((1,) + img_arr.shape)
        angle_binned, throttle = self.model.predict(img_arr)
        angle_unbinned = util.data.linear_unbin(angle_binned[0])
        return angle_unbinned, throttle[0][0]


class KerasLinear(KerasPilot):
    def __init__(self, model=None, num_outputs=None, *args, **kwargs):
        super(KerasLinear, self).__init__(*args, **kwargs)
        if model:
            self.model = model
        elif num_outputs is not None:
            self.model = default_n_linear(num_outputs)
        else:
            self.model = default_linear()

    def run(self, img_arr):
        img_arr = img_arr.reshape((1,) + img_arr.shape)
        outputs = self.model.predict(img_arr)
        # print(len(outputs), outputs)
        steering = outputs[0]
        throttle = outputs[1]
        return steering[0][0], throttle[0][0]


def default_categorical():
    img_in = Input(shape=(120, 160, 3),
                   name='img_in')  # First layer, input layer, Shape comes from camera.py resolution, RGB
    x = img_in
    x = Convolution2D(24, (5, 5), strides=(2, 2), activation='relu')(
        x)  # 24 features, 5 pixel x 5 pixel kernel (convolution, feauture) window, 2wx2h stride, relu activation
    x = Convolution2D(32, (5, 5), strides=(2, 2), activation='relu')(
        x)  # 32 features, 5px5p kernel window, 2wx2h stride, relu activatiion
    x = Convolution2D(64, (5, 5), strides=(2, 2), activation='relu')(
        x)  # 64 features, 5px5p kernal window, 2wx2h stride, relu
    x = Convolution2D(64, (3, 3), strides=(2, 2), activation='relu')(
        x)  # 64 features, 3px3p kernal window, 2wx2h stride, relu
    x = Convolution2D(64, (3, 3), strides=(1, 1), activation='relu')(
        x)  # 64 features, 3px3p kernal window, 1wx1h stride, relu

    # Possibly add MaxPooling (will make it less sensitive to position in image).  Camera angle fixed, so may not to be needed

    x = Flatten(name='flattened')(x)  # Flatten to 1D (Fully connected)
    x = Dense(100, activation='relu')(x)  # Classify the data into 100 features, make all negatives 0
    x = Dropout(.1)(x)  # Randomly drop out (turn off) 10% of the neurons (Prevent overfitting)
    x = Dense(50, activation='relu')(x)  # Classify the data into 50 features, make all negatives 0
    x = Dropout(.1)(x)  # Randomly drop out 10% of the neurons (Prevent overfitting)
    # categorical output of the angle
    angle_out = Dense(15, activation='softmax', name='angle_out')(
        x)  # Connect every input with every output and output 15 hidden units. Use Softmax to give percentage. 15 categories and find best one based off percentage 0.0-1.0

    # continous output of throttle
    throttle_out = Dense(1, activation='relu', name='throttle_out')(x)  # Reduce to 1 number, Positive number only

    model = Model(inputs=[img_in], outputs=[angle_out, throttle_out])
    model.compile(optimizer='adam',
                  loss={'angle_out': 'categorical_crossentropy',
                        'throttle_out': 'mean_absolute_error'},
                  loss_weights={'angle_out': 0.9, 'throttle_out': .01})

    return model


from tensorflow.python.keras import backend as K


def linear_unbin_layer(tnsr):
    bin = K.constant((2 / 14), dtype='float32')
    norm = K.constant(1, dtype='float32')

    b = K.cast(K.argmax(tnsr), dtype='float32')
    a = b - norm
    # print('linear_unbin_layer out: {}'.format(a))
    return a


def default_catlin():
    """
    Categorial Steering output before linear conversion.
    :return:
    """
    img_in = Input(shape=(120, 160, 3),
                   name='img_in')  # First layer, input layer, Shape comes from camera.py resolution, RGB
    x = img_in
    x = Convolution2D(24, (5, 5), strides=(2, 2), activation='relu')(
        x)  # 24 features, 5 pixel x 5 pixel kernel (convolution, feauture) window, 2wx2h stride, relu activation
    x = Convolution2D(32, (5, 5), strides=(2, 2), activation='relu')(
        x)
    x = Convolution2D(64, (5, 5), strides=(2, 2), activation='relu')(
        x)  # 64 features, 5px5p kernal window, 2wx2h stride, relu
    x = Convolution2D(64, (3, 3), strides=(2, 2), activation='relu')(
        x)  # 64 features, 3px3p kernal window, 2wx2h stride, relu
    x = Convolution2D(64, (3, 3), strides=(1, 1), activation='relu')(
        x)  # 64 features, 3px3p kernal window, 1wx1h stride, relu

    # Possibly add MaxPooling (will make it less sensitive to position in image).  Camera angle fixed, so may not to be needed

    x = Flatten(name='flattened')(x)  # Flatten to 1D (Fully connected)
    x = Dense(100, activation='relu')(x)  # Classify the data into 100 features, make all negatives 0
    x = Dropout(.1)(x)  # Randomly drop out (turn off) 10% of the neurons (Prevent overfitting)
    x = Dense(50, activation='relu')(x)  # Classify the data into 50 features, make all negatives 0
    x = Dropout(.1)(x)  # Randomly drop out 10% of the neurons (Prevent overfitting)
    # categorical output of the angle
    angle_cat_out = Dense(15, activation='softmax', name='angle_cat_out')(x)
    angle_out = Dense(1, activation='sigmoid', name='angle_out')(angle_cat_out)
    # angle_out = Lambda(linear_unbin_layer, output_shape=(1,1, ), name='angle_out')(angle_cat_out)

    # continuous output of throttle
    throttle_out = Dense(1, activation='relu', name='throttle_out')(x)  # Reduce to 1 number, Positive number only

    model = Model(inputs=[img_in], outputs=[angle_out, throttle_out])
    model.compile(optimizer='adam',
                  loss={'angle_out': 'mean_squared_error',
                        'throttle_out': 'mean_absolute_error'},
                  loss_weights={'angle_out': 0.9, 'throttle_out': .01})

    return model


def default_linear():
    img_in = Input(shape=(120, 160, 3), name='img_in')
    x = img_in
    x = Convolution2D(24, (5, 5), strides=(2, 2), activation='relu')(x)
    x = Convolution2D(32, (5, 5), strides=(2, 2), activation='relu')(x)
    x = Convolution2D(64, (5, 5), strides=(2, 2), activation='relu')(x)
    x = Convolution2D(64, (3, 3), strides=(2, 2), activation='relu')(x)
    x = Convolution2D(64, (3, 3), strides=(1, 1), activation='relu')(x)

    x = Flatten(name='flattened')(x)
    x = Dense(100, activation='linear')(x)
    x = Dropout(.1)(x)
    x = Dense(50, activation='linear')(x)
    x = Dropout(.1)(x)
    # categorical output of the angle
    angle_out = Dense(1, activation='linear', name='angle_out')(x)

    # continous output of throttle
    throttle_out = Dense(1, activation='linear', name='throttle_out')(x)

    model = Model(inputs=[img_in], outputs=[angle_out, throttle_out])

    model.compile(optimizer='adam',
                  loss={'angle_out': 'mean_squared_error',
                        'throttle_out': 'mean_squared_error'},
                  loss_weights={'angle_out': 0.5, 'throttle_out': .5})

    return model


def default_n_linear(num_outputs):
    img_in = Input(shape=(120, 160, 3), name='img_in')
    x = img_in
    x = Cropping2D(cropping=((60, 0), (0, 0)))(x)  # trim 60 pixels off top
    x = Lambda(lambda x: x / 127.5 - 1.)(x)  # normalize and re-center
    x = Convolution2D(24, (5, 5), strides=(2, 2), activation='relu')(x)
    x = Convolution2D(32, (5, 5), strides=(2, 2), activation='relu')(x)
    x = Convolution2D(64, (5, 5), strides=(1, 1), activation='relu')(x)
    x = Convolution2D(64, (3, 3), strides=(1, 1), activation='relu')(x)
    x = Convolution2D(64, (3, 3), strides=(1, 1), activation='relu')(x)

    x = Flatten(name='flattened')(x)
    x = Dense(100, activation='relu')(x)
    x = Dropout(.1)(x)
    x = Dense(50, activation='relu')(x)
    x = Dropout(.1)(x)

    outputs = []

    for i in range(num_outputs):
        outputs.append(Dense(1, activation='linear', name='n_outputs' + str(i))(x))

    model = Model(inputs=[img_in], outputs=outputs)

    model.compile(optimizer='adam',
                  loss='mse')
    return model

Credits

Dimiter Kendri

23 projects • 159 followers

Robotics and AI

Thanks to Keras team and Evan Juras.

Comments

Awards

Best Use of AI

The Autonomous Robot Challenge

Autonomous Driving AI for Donkey Car Garbage Collector

Things used in this project

Hardware components

Software apps and online services

Hand tools and fabrication machines

Story

Introduction

Hardware

1. Dis-assembly

2. Mount the adapters

3. Servo controller and RPi

4. Wooden plate

5. Additional sensors

Software

1. Install DNN libs

2. Install ProtoBuffer compiler

3. MobileNetV2 weights

4. Set up image detection model on RPi

5. Set up Donkey car Software stack

6. Calibrating

7. Training the car

Algorithm

Performance

Conclusion

Schematics

MPU6050

Code

IMU code

manage.py

meta.json

config.py

ObjecDetectionDonkeyCar.py

Keras_VGG_test.py

Fixes an issue with KerasCategorical class.

Credits

Dimiter Kendri

Comments

Awards

Embed the widget on your own site

Autonomous Driving AI for Donkey Car Garbage Collector

Autonomous Driving AI for Donkey Car Garbage Collector

Things used in this project

Hardware components

Software apps and online services

Hand tools and fabrication machines

Story

Introduction

Hardware

1. Dis-assembly

2. Mount the adapters

3. Servo controller and RPi

4. Wooden plate

5. Additional sensors

Software

1. Install DNN libs

2. Install ProtoBuffer compiler

3. MobileNetV2 weights

4. Set up image detection model on RPi

5. Set up Donkey car Software stack

6. Calibrating

7. Training the car

Algorithm

Performance

Conclusion

Schematics

MPU6050

Code

IMU code

manage.py

meta.json

config.py

ObjecDetectionDonkeyCar.py

Keras_VGG_test.py

Fixes an issue with KerasCategorical class.

Credits

Dimiter Kendri

Comments

Awards

Related channels and tags