Published May 3, 2021

Automatic Emotion Journal

Using an emotion recognition model, a raspberry pi camera journals my emotions throughout the day.

BeginnerFull instructions provided20 hours1,690

Things used in this project

Hardware components

Raspberry Pi 3 Model B+

Raspberry Pi Camera Module

Generic mouse and keyboard

Software apps and online services

PyCharm CE

This is my preferred python IDE for my Mac

Raspberry Pi Raspbian

Raspberry Pi Thonny

The default Python IDE on my Raspberry Pi 3 B+

Face-Recognition

This library allowed me to locate facial landmarks

TensorFlow

I used TensorFlow to create and train my emotion recognition model

Hand tools and fabrication machines

TensorFlow Lite

I used TensorFlow Lite to run my emotion recognition model on the Raspberry Pit B+

Story

Motivation

I created this as part of a school project on Home Automation, and this is my own take on the concept. In general, I think some Home Automation items take away from the everyday human interactions that make us human. Without getting too side tracked and into my philosophical beliefs behind automating your life, I decided that I would make something practical that does not take away from the actions I already do in my day-to-day life. With the prevalence of mental health during this pandemic and a little bit of brainstorming help from my teacher, I decided it would be a cool idea to create an automatic emotions journal that keeps track of my emotions throughout the day.

(As a short disclaimer, this project is just a proof of concept. I think that with the contents of this tutorial, it could be turned into something much more pragmatic. It can only detect very intense smiles and frowns, though there is room to expand the algorithm to recognize more emotions and to make the detections more accurate on subtle expressions.)

Hardware Setup

First off, I have used my Raspberry Pi 3B+ before and already have it all set up. If you are starting from scratch and want to follow along with the rest of my tutorial below, I suggest following Raspberry Pi's official setup guide linked here and in the sources below. I would explain all of the steps here, but I am a little rusty on them now so this linked tutorial is probably a better explainer.

Once you have your general Raspberry Pi all set up, you will need to install the camera and configure some settings. First you will want your Raspberry Pi turned off. The next step is to find the camera pins which should be a slot located in between the audio jack and the HDMI port (shown below).

Correct Insertion of Camera Ribbon Cable

Once you have identified where the camera's ribbon cable is supposed to be inserted, you should gently pull up on the black plastic clip and insert the ribbon cable with the blue side facing the USB ports. The final step is to push the black clip back into place securing the ribbon cable in the port. If any of what I have written here is confusing, feel free to follow the official Raspberry Pi Guide for this as well linked here.

With the camera plugged in, I decided to build a stand out of cardboard to hold it in the correct position. To make this, I took a strip of cardboard and folded the ends in to create legs. I then drew small triangles at the bottom of each leg so the stand would angle the camera up towards my face when it sits on my desk. My last step was taping the camera to the top of the stand. Here are some photos of the stand creation process:

1 / 4

Now that the Camera has been successfully installed, some settings will need to be properly configured to get it working. Head to the start menu, then click on preferences, then find your way to the Raspberry Pi configuration menu. From here, you will want to enable the camera and then reboot your Raspberry Pi.

(Start —> Preferences —> Raspberry Pi Configuration)

To check that the camera has been installed correctly and the settings properly configured, open the terminal and type the following command:

raspistill -o Desktop/test.jpg

This should cause a window to display a preview of what the camera sees for 5 seconds before saving the photo to the desktop. This is all the hardware setup required for this project. Now that the hardware is set up, we need to set up the software with library installations.

Library Installations

There is a lot that needs to be imported to get this project up and running. This part of the project will probably take a little bit of time just because of the sheer quantity of libraries required to make it work.

I really suggest using pip for these installations because these imports would be a nightmare without if. If you don't have pip installed and/or don't know how to use it, you can follow this tutorial linked here: https://pip.pypa.io/en/stable/installing/. For some reason, I was able to get everything installed using pip except for cmake. I used Homebrew to install cmake, but since I did not already have Homebrew installed on my computer, I will cover the installation below.

To recreate this project, you will need to use a computer as well. The computer is used to train the emotion recognition model that the Raspberry Pi uses. First, I will cover the imports required to get the code I have bellow running on your computer.

Do the following library installations in order from top to bottom. Some of the lower libraries are dependent on the first few installations.

HomebrewSetup:

$ ruby -e "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/master/install)"
$ brew update

$ echo -e "\n# Homebrew" >> ~/.bash_profile
$ echo "export PATH=/usr/local/bin:$PATH" >> ~/.bash_profile

$ source ~/.bash_profile

Cmake Installation Using Homebrew:

$ brew install cmake

PIP Installations:

$ pip install numpy
$ pip install dlib
$ pip install face-recognition
$ pip install --upgrade tensorflow
$ pip install Pillow

If you run into any trouble with the above installations, Google tutorials on how to install them. There are much more detailed explanations on how to get these to work and what your issue might be. I know I hit plenty of system and circumstantial issue during these imports. I am sharing what worked with me above, but that doesn't necessarily mean it might work for you. If you type one of these into your console/terminal and get a lot of redlines, don't get discouraged. Luckily, all of these libraries have been around for a while which means that you are not the first person to have these errors. If you run into any serious problems during the installation process, try Googling the error message along with the name of the library you are trying to install.

Now that the imports required for your computer are out of the way, we can move on to installing the libraries required on the Raspberry Pi. Between the cmake and dlib installations, this will take roughly an hour to install correctly. Give it time, and know that these installations take a very long time. Don't worry.

TensorFlow Lite Runtime:

I actually have instructions for this installation later in the tutorial as well, but the commands go as follows:

echo "deb https://packages.cloud.google.com/apt coral-edgetpu-stable main" | sudo tee /etc/apt/sources.list.d/coral-edgetpu.list
curl https://packages.cloud.google.com/apt/doc/apt-key.gpg | sudo apt-key add -
sudo apt-get update
sudo apt-get install python3-tflite-runtime

Cmake Installation:

$ sudo apt-get update
$ sudo apt-get install build-essential cmake
$ sudo apt-get install libgtk-3-dev
$ sudo apt-get install libboost-all-dev

Dlib and Face_recognition Installation:

$ pip install numpy
$ pip install dlib
$ pip install face-recognition

Now that all of these are out of the way, we can get to the really exciting software contents of this project.

Building The Emotion Recognition Model

I chose to build my emotion recognition model using TensorFlow Keras, but before I dive into the details, I think its important for me to briefly explain what TensorFlow is to the best of my knowledge. TensorFlow is a machine learning framework that allows for the easier creation and training of machine learning models. Keras is just one of the tools that TensorFlow offers to build those models. TensorFlow's fancy, mysterious machine learning models are really just data-flow graph, which sounds just as confusing but is actually much easier to break down.

Graphs in this sense are a mathematical structure of interconnected nodes/vertices an edges, and they are a part of a broader category of math called graph theory. It is actually a lot easier to explain visually so below is my best attempt at explaining graphs in a picture.

So with this in mind, when we think of data-flow graphs, we can think of a series of structured nodes and edges through which we can send information not too dissimilar to the way neurons work in brains. Unlike the graph I drew above, the graphs created by TensorFlow have very specific, user-controlled structures that allow the data to flow and emerge in meaningful ways. The machine learning model I built and trained was set up with the following structure:

The model I trained and used throughout this project is essentially the same structure as the graph shown above. The only difference between the drawing and the model I used is that the model I used had weights and biases between each layer. These weights and biases help determine what value goes to which nodes depending on the input, and the activation functions work in conjunction with these weights and biases. These are the only two variables in the entire model, and so they are whats used to train the models to make accurate predictions. In other words the weights and biases between each layer are nudged in directions that maximize the accuracy of the model. This is done through feedforward and backpropagation. This is where my knowledge about neural networks and machine learning models breaks down, but luck for us, we don't need to know much more or how feedforward and backpropagation work to effectively create and train a machine learning model. TensorFlow is more than capable of handling all the hard, complicated math for us, so all that is important is recognizing the general structure of the data-flow graph we are building.

Translating the model drawn above into code, we get this:

model = keras.Sequential([
    keras.layers.Flatten(input_shape=(20, 41)),
    keras.layers.Dense(160, activation="relu"),
    keras.layers.Dense(2, activation="softmax")
])

Thats all it takes to create the model, the hard part comes from training it and aggregating enough proper standardized data for the model to be accurate under broader use cases.

Training The Model

This was one of the hardest part of this entire project. It turns out to be very hard to get large amounts of data that fit the my parameters for use and the shape of the model. To avoid having my model have a massive input layer that fit an entire image, I chose to use pre-trained face-recognition library to find facial landmarks which I could then pass to my own model. This allowed me to greatly reduce the size of the input layer, and though I didn't test this, increase the speed at which the model can run. Less inputs means less nodes, edges, weights, and biases which ultimately means faster training and running times. In addition, it will be way easier for me to work with a constant small number of points than an entire photo. I quickly realized that to get a work of concept up and running, I would only really need the landmarks of the top and bottom lips. This way I had even less data to have to deal with, and I could store it in a.csv file so I would only have to extract it from the photos once.

How the data is processed input to output

Each facial landmark was a x, y coordinate pair with the x and y measured in pixels. While very useful for locating the landmarks, the landmarks in this format were essentially un-sendable to the emotion recognition model I was training. It would be impossible for me to pass the values held in any given coordinate pair to a single node. To work around this, I plotted the facial landmarks of the upper and lower lip into set box 40 pixels wide and 20 pixels tall. Using some simple geometry shown below, I was able to standardize all of my training data so that it would fit the shape of my model's input layer.

In action the landmark plots looked like the following:

1 / 4 • It is actually pretty hard to make sense of these, but the points should form a top and bottom lip. (1/4)

An issue I encountered while standardizing the data was that a' and b' were often not whole integer numbers. This meant that I had to round the a' and b' values to fit the plotted facial landmarks into the indexes of a 2D-array. This led to some errors where I would get an index of 40, when the array only had indexes 0-39. Naturally, I made the array one longer instead of finding a way to fix the rounding causing the error. If I have more time in the future, I might revisit this project and properly fix this issue.

When it actually came to training the model, I made minor tweaks and edits that left me with four different versions. With each version, I was able increase the models accuracy and decrease its bias to recognizing the limited training data I was able to accumulate. Here are some photos of the best model version predicting photos it has never seen before:

1 / 3 • (1/3)

Something to take note of is that the model's certainty is scary high for almost every picture. This is most likely an indication that there is not enough breadth in the data I used to train my model and in the data the model has never seen before. In short, I just do not have enough varying data: all my training and testing data looks practically identical. This is something I have struggled to fix because it is challenging to accumulate and label enough data, but I am at least aware of this issue.

There is plenty of room for improvement in the training of the model, but using the approach I explained above gets the model roughly working which is fine for now. If you are interested in using this emotion recognition model in an upcoming project, I highly suggest spending the bulk of your time collecting diverse data and corresponding labels to greatly improve how well the model will work.

Converting to and Using a TensorFlow Lite Model

The TensorFlow model I defined and trained above was all done using TensorFlow 2 on my Macbook Pro, but I want to use the model on my Raspberry Pi. Since my laptop has plenty of extra disk space, 8 gigabytes of memory, and a decent processor, it had no problem whatsoever running the TensorFlow 2 model I created. However, a Raspberry Pi has comparatively limited storage, memory, and processing capacity which means it might run into some serious performance issues by trying to run the full TensorFlow model. By converting the trained model to a TensorFlow Lite model, it should be possible to circumvent these performance issues.

Converting the Model:

Before transferring the trained model over to the Raspberry Pi, it should be converted into a .tflite file. By following the guide on TensorFlow's website (linked here), I was able to convert my saved model with the following script (also from TensorFlow's website linked above):

converter = tf.lite.TFLiteConverter.from_saved_model(saved_model_path)
tflite_model = converter.convert()

with open("model.tflite", "wb") as file:
   file.write(tflite_model)

Using the Model:

Before the converted model can be run on the Raspberry Pi, TensorFlow Lite needs to be installed. TensorFlow Lite can be installed on the Raspberry Pi with the following terminal commands:

echo "deb https://packages.cloud.google.com/apt coral-edgetpu-stable main" | sudo tee /etc/apt/sources.list.d/coral-edgetpu.list
curl https://packages.cloud.google.com/apt/doc/apt-key.gpg | sudo apt-key add -
sudo apt-get update
sudo apt-get install python3-tflite-runtime

Once tflite_runtime has been installed, the .tflite model can be used to make predictions by creating an Interpreter object. Shown below is an example from TensorFlow's website (linked here) of how to use the Interpreter.

import numpy as np
import tensorflow as tf

# Load the TFLite model and allocate tensors.
interpreter = tf.lite.Interpreter(model_path="converted_model.tflite")
interpreter.allocate_tensors()

# Get input and output tensors.
input_details = interpreter.get_input_details()
output_details = interpreter.get_output_details()

# Test the model on random input data.
input_shape = input_details[0]['shape']
input_data = np.array(np.random.random_sample(input_shape), dtype=np.float32)
interpreter.set_tensor(input_details[0]['index'], input_data)

interpreter.invoke()

# The function `get_tensor()` returns a copy of the tensor data.
# Use `tensor()` in order to get a pointer to the tensor.
output_data = interpreter.get_tensor(output_details[0]['index'])
print(output_data)

Automatic Emotion Journal

The final part of this project was bringing together all these pieces to create an automatic emotions journal. The PiCamera and TensorFlow model would work in unison to ideally document my mood throughout the day.

The emotion journal script starts by taking a photo using the PiCamera module, and the photo is saved as unknown.jpg for the time being. The next step is loading the recently taken photo as a numpy array using the face-recognition module. This numpy array can then be passed to the face-recognition module's face_landmarks() function which defines the important facial features in the passed photo using 68 points. The facial landmarks defining the top and bottom lips are then sent to the standardization function I discussed earlier. The standardization function scales the facial landmarks sent to it into set bounds. These standardized points are then sent to the format_data() function which simply reformats the array of standardized points into a format the TensorFlow model will accept. The properly formatted data is then sent to the emotion recognition model, and the model's prediction is extracted. A time stamp is created just after the model makes its prediction. The prediction and the time stamp are then appended to a running csv file that keeps track of all the models predictions. This csv file is essentially the emotion journal, and each entry reads something like this: "happy; 23-49-31". The time stamp is the military time in hours, minutes, and then seconds.

Sources

Raspberry Pi Setup Guide: https://projects.raspberrypi.org/en/projects/raspberry-pi-setting-up
Raspberry Pi Camera Setup Guide: https://projects.raspberrypi.org/en/projects/getting-started-with-picamera
pip Installation tutorial: https://pip.pypa.io/en/stable/installing/
TensorFlow Keras tutorial (1/2): https://www.youtube.com/watch?v=cvNtZqphr6A
TensorFlow Keras tutorial (2/2): https://www.youtube.com/watch?v=RqLD1INA_cQ
Installing dlib (computer and Raspberry Pi): https://www.pyimagesearch.com/2018/01/22/install-dlib-easy-complete-guide/
Converting to a.tflite model: https://www.tensorflow.org/lite/convert
Installing TensorFlow Lite: https://www.tensorflow.org/lite/guide/python
Using a TensorFlow Lite Model: https://www.tensorflow.org/lite/guide/inference#load_and_run_a_model_in_python
face-recognition Library: https://pypi.org/project/face-recognition/
Converting Video to Images with OpenCV: https://medium.com/@iKhushPatel/convert-video-to-images-images-to-video-using-opencv-python-db27a128a481

import csv
import os
import sys

from time import sleep
from datetime import datetime

import face_recognition as fr
from picamera import PiCamera

import numpy as np
import tflite_runtime.interpreter as tflite


def establish_camera_connection() -> object:
    camera = PiCamera()
    camera.rotation = 180 # This rights the image since the camera hangs upsidedown
    return camera
    
    
def take_photo(camera: object) -> str:
    save_path = "/home/pi/Desktop/Automatic Emotion Journal/Face Repository/unkown.jpg"
    camera.start_preview()
    print("prepare for photo")
    sleep(5)
    camera.capture(save_path)
    camera.stop_preview()
    print("photo taken")
    return save_path


def load_image(image_path: str) -> list:
    image_as_numpy = fr.load_image_file(image_path)
    landmarks: dict = fr.face_landmarks(image_as_numpy)
    top_lip = landmarks[0]["top_lip"]
    bottom_lip = landmarks[0]["bottom_lip"]

    def merge(arr1: list, arr2: list) -> list:
        merged = []
        for i in arr1:
            merged.append(i)
        for i in arr2:
            merged.append(i)
        return merged

    mouth_landmarks = merge(top_lip, bottom_lip)
    return mouth_landmarks


def locate_maxs_and_mins(land: list) -> tuple:
    maximum_x, maximum_y = -1 * sys.maxsize, -1 * sys.maxsize
    minimum_x, minimum_y = sys.maxsize, sys.maxsize
    for point in land:
        maximum_x = max(maximum_x, point[0])
        minimum_x = min(minimum_x, point[0])
        maximum_y = max(maximum_y, point[1])
        minimum_y = min(minimum_y, point[1])
    coordinates: tuple = (minimum_x, minimum_y, maximum_x, maximum_y)
    return coordinates


def standardize(mouth_landmarks: list) -> list:
    buffer = 1
    extrema = locate_maxs_and_mins(mouth_landmarks)
    # the actual width and height are increased by twice the buffer so that the loop below does not divide by zero for
    # the minimum x and y points
    actual_width = (extrema[2] + buffer) - (extrema[0] - buffer)
    actual_height = (extrema[3] + buffer) - (extrema[1] - buffer)
    desired_width = 40
    desired_height = 20
    standardized_mouth_landmarks = []

    for point in mouth_landmarks:
        ax = point[0] - extrema[0]
        ay = point[1] - extrema[1]
        bx = actual_width - ax
        by = actual_height - ay

        ratio_x = ax / bx
        ratio_y = ay / by

        # Mistake here allows for a_prime_x to equal 40. To fix this I think rounding should be done on the a_prime_x
        # definition (i.e a_prime_x = round(desired_width - b_prime_x))
        b_prime_x = round(desired_width / (ratio_x + 1))
        a_prime_x = desired_width - b_prime_x

        b_prime_y = round(desired_height / (ratio_y + 1))
        a_prime_y = desired_height - b_prime_y

        # Double check here that the same calculation for a prime x works for y because I might be finding something
        # else here. Since a is the distance from the minimum, then maybe this might be a problem, but (x,y) coordinates
        # are usually done from 0, the minimum
        standardized_mouth_landmarks.append((a_prime_x, a_prime_y))

    # the idea is to then take these points and convert them to an 'image' which would really just be a
    # two dimensional numpy array with the first axis being the desired_height and the second axis being the
    # desired_width
    return standardized_mouth_landmarks


def format_data(mouth_landmarks: list) -> object:
    formatted_data = np.zeros((1, 20, 41), dtype=np.float32)
    
    for point in mouth_landmarks:
        x = point[0]
        y = point[1]
        formatted_data[0, (19 - y), x] = 1.0
    return formatted_data


def rename_photo(prediction: str, time_stamp: str):
    os.rename(r'/home/pi/Desktop/Automatic Emotion Journal/Face Repository/unkown.jpg',
              r'/home/pi/Desktop/Automatic Emotion Journal/Face Repository/' + prediction + "_" + time_stamp + '.jpg')

def append_to_csv(prediction: str, time_stamp: str):
    with open("emotion_juornal.txt", mode='a') as training_data:
        csv_w = csv.writer(training_data, delimiter=";", quotechar='"', quoting=csv.QUOTE_MINIMAL)
        csv_w.writerow([prediction] + [time_stamp])


def generate_time_stamp() -> str:
    current_time = datetime.now()
    time_stamp = current_time.strftime("%H-%M-%S")
    return str(time_stamp)


if __name__ == "__main__":
    model_path = "/home/pi/Desktop/Automatic Emotion Journal/erm_v4.tflite"
    input_data_path = "/home/pi/Desktop/Automatic Emotion Journal/mouth_landmarks_unknown.txt"
    labels = ["happy", "sad"]

    # Loading tflite model
    interpreter = tflite.Interpreter(model_path=model_path)
    interpreter.allocate_tensors()
    
    # Allows access to input and output layers
    input_tensors = interpreter.get_input_details()
    output_tensors = interpreter.get_output_details()
    
    # creates a connection to the camera port
    camera = establish_camera_connection()
    
    for i in range(5):
        # Preparing and formatting the data that will be fed to the input layer
        photo = take_photo(camera)
        try:
            landmarks = load_image(photo)
            landmarks = standardize(landmarks)
            formatted_data = format_data(landmarks)
        except IndexError:
            continue
        
        # Making a prediction and time stamp
        interpreter.set_tensor(input_tensors[0]['index'], formatted_data)
        interpreter.invoke()
        time_stamp = generate_time_stamp()
        
        # Housekeeping
        output_data = interpreter.get_tensor(output_tensors[0]['index'])
        prediction = labels[np.argmax(output_data[0])]
        rename_photo(prediction, time_stamp)
        append_to_csv(prediction, time_stamp)
        sleep(10)

Mouth Focused Data Organizer

Python

This script appends top and bottom lip facial landmarks to a .csv file that is used to train the emotion recogntion TensorFlow model. This script is not actually part of the final product, but I am including this here because training the model and aggregating data is half of the battle. The function "def convert_to_images():" is not my function; I got the function from the tutorial linked here and in the sources (https://medium.com/@iKhushPatel/convert-video-to-images-images-to-video-using-opencv-python-db27a128a481). This script is meant to be run on a normal computer.

import cv2
import csv
import sys
import face_recognition as fr
from PIL import Image


def main():
    convert_to_images()
    count = 1
    path = "image" + str(count) + ".jpg"
    # yes I could do some math here to know how many frames will be converted to images which would really speed up this
    # process; Definitely something to tackle in the future if performance ever becomes an issue.
    while image_exists(path):
        # for count in range(10):
        landmarks = load_image(path)
        landmarks = standardize(landmarks)
        append_to_csv(landmarks)
        count += 1
        path = "image" + str(count) + ".jpg"


def image_exists(path: str) -> bool:
    try:
        Image.open(path)
    except:
        return False
    return True


def append_to_csv(land: list):
    try:
        with open("training_set_expanded.txt", mode='a') as training_data:
            csv_w = csv.writer(training_data, delimiter=";", quotechar='"', quoting=csv.QUOTE_MINIMAL)
            csv_w.writerow(land)
    except IndexError:
        print("no face found")


def standardize(mouth_landmarks: list) -> list:
    buffer = 1
    extrema = locate_maxs_and_mins_from_list(mouth_landmarks)
    # the actual width and height are increased by twice the buffer so that the loop below does not divide by zero for
    # the minimum x and y points
    actual_width = (extrema[2] + buffer) - (extrema[0] - buffer)
    actual_height = (extrema[3] + buffer) - (extrema[1] - buffer)
    desired_width = 40
    desired_height = 20
    standardized_mouth_landmarks = []

    for point in mouth_landmarks:
        ax = point[0] - extrema[0]
        ay = point[1] - extrema[1]
        bx = actual_width - ax
        by = actual_height - ay

        ratio_x = ax / bx
        ratio_y = ay / by

        # Mistake here allows for a_prime_x to equal 40. To fix this I think rounding should be done on the a_prime_x
        # definition (i.e a_prime_x = round(desired_width - b_prime_x))
        b_prime_x = round(desired_width / (ratio_x + 1))
        a_prime_x = desired_width - b_prime_x

        b_prime_y = round(desired_height / (ratio_y + 1))
        a_prime_y = desired_height - b_prime_y

        # Double check here that the same calculation for a prime x works for y because I might be finding something
        # else here. Since a is the distance from the minimum, then maybe this might be a problem, but (x,y) coordinates
        # are usually done from 0, the minimum
        standardized_mouth_landmarks.append((a_prime_x, a_prime_y))

    # the idea is to then take these points and convert them to an 'image' which would really just be a
    # two dimensional numpy array with the first axis being the desired_height and the second axis being the
    # desired_width
    return standardized_mouth_landmarks


def load_image(path):
    path: str = path
    image_as_numpy = fr.load_image_file(path)
    landmarks: dict = fr.face_landmarks(image_as_numpy)
    # im = crop_to_face(landmarks, path)
    # im.save(path)
    # image_as_numpy = fr.load_image_file(path)
    # landmarks = fr.face_landmarks(image_as_numpy)
    # print("cropped and loaded")
    # print(landmarks)
    top_lip = landmarks[0]["top_lip"]
    bottom_lip = landmarks[0]["bottom_lip"]

    def merge(arr1: list, arr2: list) -> list:
        merged = []
        for i in arr1:
            merged.append(i)
        for i in arr2:
            merged.append(i)
        return merged

    mouth_landmarks = merge(top_lip, bottom_lip)
    return mouth_landmarks


def locate_maxs_and_mins_from_list(land: list) -> tuple:
    maximum_x, maximum_y = -1 * sys.maxsize, -1 * sys.maxsize
    minimum_x, minimum_y = sys.maxsize, sys.maxsize
    for point in land:
        maximum_x = max(maximum_x, point[0])
        minimum_x = min(minimum_x, point[0])
        maximum_y = max(maximum_y, point[1])
        minimum_y = min(minimum_y, point[1])
    coordinates: tuple = (minimum_x, minimum_y, maximum_x, maximum_y)
    return coordinates


# Not my code here, this function was borrowed from an video to image tutorial
def convert_to_images():
    vidcap = cv2.VideoCapture("/Users/noahjillson/Desktop/sad_2.mov")

    def getFrame(sec):
        vidcap.set(cv2.CAP_PROP_POS_MSEC, sec * 1000)
        has_frames, image = vidcap.read()
        if has_frames:
            cv2.imwrite("image" + str(count) + ".jpg", image)  # save frame as JPG file
        return has_frames

    sec = 0
    frame_rate = 0.2  # //it will capture image in each 0.5 second
    count = 1
    success = getFrame(sec)
    while success:
        count = count + 1
        sec = sec + frame_rate
        sec = round(sec, 2)
        success = getFrame(sec)


if __name__ == "__main__":
    main()

Emotion Model Creation and Training

Python

This script creates and trains the emotion recognition model using TensorFlow Keras. This script is not necessarily part of the final project, but it is necessary for recreating it and following along. For the model part of this project, I followed this TensorFlow tutorial linked here and in the sources making my own changes to better fit the model to my project's purposes (https://www.youtube.com/watch?v=cvNtZqphr6A). This script is meant to be run on a normal computer.

import csv

import numpy as np
from tensorflow import keras


def load_data(path: str):
    csv_data = []
    arr = np.zeros((814, 20, 41))
    with open(path, mode='r') as training_data:
        csv_r = csv.reader(training_data, delimiter=";", quotechar='"', quoting=csv.QUOTE_MINIMAL)
        for row in csv_r:
            csv_data.append(row)

    face_index = 0
    for face in csv_data:
        for coord in face:
            usable = coord.replace('(', '').replace(')', '').replace(' ', '')
            x = int(usable[:usable.index(',')])
            y = int(usable[(usable.index(',') + 1):])
            try:
                arr[face_index, (19 - y), x] = 1
            except IndexError:
                print(str(face_index) + "  " + str(19-y) + "  " + str(x))

        face_index += 1
    # yes I could streamline into one loop inside the csv reading, but I don't want to accidentally screw up with that
    # before I test the model
    return arr


if __name__ == "__main__":
    train_i = load_data("training_set_expanded.txt")
    train_l = np.zeros(814)
    train_l[208:411] = 1
    train_l[598:] = 1

    model = keras.Sequential([
        keras.layers.Flatten(input_shape=(20, 41)),
        keras.layers.Dense(160, activation="relu"),
        keras.layers.Dense(2, activation="softmax")
    ])
    model.compile(optimizer="adam", loss="sparse_categorical_crossentropy", metrics=["accuracy"])
    model.fit(train_i, train_l, epochs=6)
    model.save('emotion_recognition_model_v4')

import tensorflow as tf

if __name__ == "__main__":
    converter = tf.lite.TFLiteConverter.from_saved_model("/Users/noahjillson/Desktop/facialrecognition/emotion_recognition_model_v4")
    tflite_model = converter.convert()

    with open("erm_v4.tflite", "wb") as file:
        file.write(tflite_model)

Automatic Emotion Journal

Things used in this project

Hardware components

Software apps and online services

Hand tools and fabrication machines

Story

Motivation

Hardware Setup

Library Installations

Building The Emotion Recognition Model

Training The Model

Converting to and Using a TensorFlow Lite Model

Automatic Emotion Journal

Sources

Code

Automatic Emotion Journal

Mouth Focused Data Organizer

Emotion Model Creation and Training

TensorFlow to TensorFlow Lite Model Converter

Credits

Noah Jillson

Comments

Embed the widget on your own site

Automatic Emotion Journal

Automatic Emotion Journal

Things used in this project

Hardware components

Software apps and online services

Hand tools and fabrication machines

Story

Motivation

Hardware Setup

Library Installations

Building The Emotion Recognition Model

Training The Model

Converting to and Using a TensorFlow Lite Model

Automatic Emotion Journal

Sources

Code

Automatic Emotion Journal

Mouth Focused Data Organizer

Emotion Model Creation and Training

TensorFlow to TensorFlow Lite Model Converter

Credits

Noah Jillson

Comments

Related channels and tags