Created March 3, 2019

Emotional Assistant Robot

Is an assistant robot able to detect emotions.

Things used in this project

Hardware components

Sony Spresense boards (main & extension)

Sony Spresense camera board

Espressif ESP8266 ESP-01

Adafruit NeoPixel Ring: WS2812 5050 RGB LED

Servo Motot LX-16A

Li-Ion Battery 1000mAh

Software apps and online services

Arduino IDE

Python 3.6

Flask Web-Server

solidworks

Hand tools and fabrication machines

3D Printer (generic)

Soldering iron (generic)

Story

Introduction

Nowadays, it is not uncommon to find intelligent household appliances, such as washing machines or refrigerators, indispensable in everyday life. This is something that also aspires to welfare robotics, which develops automatons specializing in caring for people with some dependency, such as robots nurse who remind patients to take medicines. Their aim is to integrate these robots into the home as if they were just another appliance, so that it is not uncommon to live with them. This type of robots have complex algorithms that allow them to interact and identify people or recommend activities. However, the vast majority are large and expensive.

With the latest advances in IoT devices, with greater computing capacity and the incorporation of small neural networks, their applications in this area can become of great interest. Moreover, in recent years, robotics has been simplified so that anyone can have access to uncomplicated levels. In fact, it is common for people with little knowledge of programming and electronics to design and build their own robot, all thanks to the rise of open source prototype platforms.

In this sense, the following project is presented, whose objective was the creation of a low-cost assistant robot that incorporated an IoT device. To carry out this project, the Sony Spresense development system was used (Figure 1).

1 / 2 • Figure 1. Sony Spresense development card

This new card incorporates a multicore (6 core) CXD5602 processor, camera, a high-resolution audio system and multi-micro inputs - 192kHz/24-bit advanced audio codec and amplifier for audio output, and support for up to 8 microphone input channels, integrated GNSS with support for GPS, QZSS and GLONASS allowing applications where tracking is required. These features make this development system a useful and ideal tool for applications in robotics, IoT, tracking, etc...

Project Description

According to the World Health Organization WHO, it is estimated that by 2050 people over 60 years of age will double. This creates a problem related to the logistics involved in assisting these people, so that the demand for trained personnel to assist older people will increase. In order to meet this need, robotics represents a good option, being in the market robots that remind people of the schedule of their medicines. These robots are generally station robots, that is to say that they are fixed in a specific place of our homes.

The robot presented in this project is a first approximation of an accompanying robot, which can be used as a station robot or as a robot that accompanies the user everywhere. Some of these robots have been observed, one of the first robots of this type was the Archimedes robot.

(https://www.hackster.io/glowascii/archimedes-the-ai-robot-owl-325ff5), the next one was the robot Asi.

Robot Construction

The construction of the robot is divided into three phases. The first is the mechanical design of the robot using 3D design software (SolidWorsk). The second phase focused on low level programming, i.e. Spresense programming (camera control, access to WiFi, control of motor servos. Finally, the last phase focused on the programming of the web service based on flask, in charge of carrying out the analysis of the information obtained by the robot.

Design and construction of the robot.

To design the robot a SolidWorks software was used to make the 3D modeling that would be printed later. The robot has two parts, a head which houses the electronics and a base that allows you to move the head.

1 / 2 • Figure 2. Robot Face

Figure 2 shows the printed robot. The robot has two eyes and a mouth, in which the camera has been placed (Figure 3).

1 / 2 • Figura 3. Localización de la cámara en el Robot

The Spresense development system does not have a WiFi module, so it was decided to add an ESP-8266. This module allows the robot to send the information to the web service, Figure 4 shows the location of the ESP-8266 inside the head.

1 / 2 • Figure 4. ESP-8266 Location

In order to accommodate the electronics inside the head, a piece was designed to hold the Spresense (Figure 5).

Figure 5 Spresense card assembly inside the head.

To perform the movements of the robot was incorporated ajoint in the neck, which allows the head to move from left to right, forwardand back (Figure 6).

Figure 6. Servo motors

Spresense Programming

The Spresense is a development systemthat has a multicore processor, which incorporates different elements such ascamera, high definition audio and GPS. However, it does not have integratedcommunication modules (WiFi or Bluetooth) that allow communication with the outside.For this reason, an ESP-8266 has been added, with which the system can send theacquired data to the web service. Within the information to send we find theimage acquired by the camera, GPS and other sensors.

One of the inconveniences that arose atthe time of making this communication, was in the sending of the image. Thecamera captures images of 640x480, so we saw the need to resize the image. Thisoption is not integrated into the Spresense camera library, so we opted to usethe JPEGDecoder library, which was modified to work with the Spresense system.It is important to note that the image has to be sent in grayscale and not inRGB, and must be sent in packages that do not exceed 1024 bytes, which is the maximumsize of the data sending buffer of ESP-8266.

These images are sent to the web servicedeveloped in Flask. This web service is in charge of reconstructing the imagesent by the robot. To reconstruct the image it is necessary that the robotsends in the first bits the size of the image and then the information of eachone of the pixels and an "END" to indicate that the sending wasfinished.

Web Service

Once the reconstruction is done, the system analyzes the image,identifying objects or detecting faces. To make these identifications thesystem uses different artificial intelligence algorithms, which were trainedfor public databases such as Ciraf-100(https://www.cs.toronto.edu/~kriz/cifar.html). At the same time we created adatabase of objects that we can find in our home, such as spoons, lamps,tables, etc.. All these images were pre-processed and resized to 32x32 pixelsto facilitate the training of the different neural networks. At the same timethe system allows the identification of people and the classification ofemotional states. To perform this task the web service needs to locate the facewithin the image. Using the "dlib" library for Python, it is possibleto detect the face and extract the main characteristics for the identificationof the person. In order to carry out the identification process, it isnecessary to encode the image in a vector. This vector is a one-dimensionalarray in which the image has been coded and has a size of 1x128. This codingallows the robot to recognize the person only with an image, if compared withother techniques (neural networks and deep learning) this technique is fasterand does not require lengthy learning processes. Figure 7 and 8 show the vectorfor two users and Figure 9 shows the comparison between them. Foridentification purposes, the system calculates the distance between the twovectors and if this distance is within a threshold, it means that it is theperson.

Figure 7. User Vector 1

Figure 7. User Vector 2

Figure 9. Comparison of the two Users

The classification of emotions was carried outusing the KDEF data set (http://www.kdef.se). Seven emotional states areclassified, AF = afraid, , AN = angry, , DI = disgusted, , HA = happy, NE =neutral, SA = sad, SU = surprised. To classify these emotions the deep-learningtool was used using tensor flow as a development tool. This database has a total of 4900 images witha size of 562 * 762 pixels and a resolution of 72*72 dpi. The dataset wasdivided into 80% for training, 10% for testing and another 10% forvalidation. The results obtainedfrom the learning process can be seen in Figures 10 and 11.

Figure 10 Accuaracy

Figure 11 Mean Squared Error (MSE) in training phase

Video:

Video of the process and classification and people identification.

Code

Credits

Jaime Andres Rincon Arango

23 projects • 97 followers

Phd in informatics and research in multi-agent system, IoT, IoMT and artificial intelligence.

Contact

Comments

Please log in or sign up to comment.

Emotional Assistant Robot

Things used in this project

Hardware components

Software apps and online services

Hand tools and fabrication machines

Story

Custom parts and enclosures

Acople_0

Base_0

Base_1

Head_1

Head_2

Base_Sony

Eye

Code

Sony_Code

Credits

Jaime Andres Rincon Arango

Comments

Embed the widget on your own site

Emotional Assistant Robot

Emotional Assistant Robot

Things used in this project

Hardware components

Software apps and online services

Hand tools and fabrication machines

Story

Custom parts and enclosures

Acople_0

Base_0

Base_1

Head_1

Head_2

Base_Sony

Eye

Code

Sony_Code

Credits

Jaime Andres Rincon Arango

Comments