Overview - Hardware and Software
MNIST Dataset & Its relevance to the Project
Data Collection: Preparing the Training and Test datasets
Training the Model
Training results
Prediction
Embedding the TinyML Model on M5Stack
Final Look
Conclusion

Published November 30, 2023 © MIT

Play and study with M5Stack Core 2!

Children with ADHD can have difficulties in learning and this project focuses on this problem

IntermediateFull instructions provided4 hours520

Things used in this project

Hardware components

M5Stack Core2 ESP32 IoT Development Kit

Software apps and online services

Arduino IDE

Neuton Tiny ML Neuton

Story

Attention-deficit/ hyperactivity disorder (ADHD) is a neurodevelopmental condition that can make it difficult for people to focus and this the most common neurodevelopmental condition in children.

However, children affected by ADHD can be taught in a rather interactive way. This project is a simple example of this solution. Children usually start to learn colors when they are a toddler and color coding plays a significant role in enhancing memory performance.

Overview - Hardware and Software

M5Stack Core 2 ESP32 IoT Development Kit (I will call it M5Stack from here) and Neuton TinyML made this project possible. By the way, though Neuton is not an open-source tool, all functions are free and its models can run on almost any MCU.

I focused on preparing a proper dataset. I was inspired to apply this idea after taking a look at the MNIST dataset. I will explain why as you read further.

I drew the initial of the color on the M5Stack and the handwritten letter would be predicted by the TinyML model and the model's output will be used to display the respective colour - using the RGB LEDs. I have programmed this prototype for fifteen colours. One drawback is that the LEDs are unable to display any dark colours.

MNIST Dataset & Its relevance to the Project

The MNIST (Modified National Institute of Standards and Technology) dataset is the "hello world" dataset of computer vision, sourced from the MNIST database which is a large collection of handwritten digits.

1 / 3

Each sample is a 28x28 grayscale image of a hand-drawn digit between 0 and 9. There are 784 pixels in total and each pixel value indicates the lightness/darkness of the respective pixel. A higher pixel value means the pixel is dark and if it's low – the pixel is light. The pixel value has a range of 0-255 (inclusive).

The pixel location of a pixel is calculated using the following formula:

x = i * 28 + j

x is the pixel location. I and j are integers between 0 and 27 (inclusive). I denotes the pixel's row while j denotes the pixel's column and the indexing is by zero.

I decided to create my own dataset based on this idea and create a dataset using letters instead of digits.

Data Collection: Preparing the Training and Test datasets

I used my M5Stack to collect the data and prepare the training and test datasets. I decided to assign 75% of the data as the training dataset and the remaining 25% as the test dataset.

The target variable of my training dataset will be the 'Label' variable and it will contain the initial of the color. The letters and corresponding colors are as follows:

A - Aquamarine
B - Blue
C - Cyan
F - Fluorescent Blue
G - Green
I - Indigo
L - Lime
M - Magenta
O - Orange
P - Pink
R - Red
S - Sky Blue
V - Violet
W - White
Y - Yellow

I collected 400 samples for each letter, 300 of which belonged to the training dataset and 100 of which belonged to the test dataset. There will be 255 feature variables that contain the pixel values of the respective pixels. I verified the approximate number of pixels that had significant pixel values as I drew my digits and decided to use 255 feature variables.

The screen resolution of the M5Stack is 320 x 240 pixels. To calculate the pixel location, I used the following formula:

x = i * 320 + j

The factor by which you multiply the pixel's row number is basically the width of your touchscreen. To store the pixel values, I tried using the list method. I declared a list and then tried to append the pixel values within a for loop (with 255 iterations) but it was not successful as I kept getting an error saying that the pixel location did not have an appropriate data type. To save time, I decided to use the buffer method instead where I allocated memory to store pixel values.

Each person will have a different way to write a letter, so I made sure to draw out each letter in all possible ways and collect sufficient samples for each way.

Neuton TinyML requires the datasets in a CSV format so I prepared my training and test datasets as CSV files. Your dataset, both training, and test, will have to meet some other requirements as well, but don't worry, you can always view them in the Support Library on the platform.

The code for data collection is available in the Code section below.

The next step is model training.

Training the Model

Visit the Neuton TinyML's web page (neuton.ai) and click on ‘Get Started’. Click on the ‘Start for Free’ button and you will be redirected to the welcome page where you can sign in using your Google account and get started. Set up your CGP account and you will receive free credits to upload your own data and train your models. Subscribe to Neuton's Zero Gravity Plan and you are good to go!

Click on 'Add New Solution' and you will see something like this:

Once you are done, click ‘Next’ and you will be required to upload your training dataset. The dataset will be validated and if it meets all requirements, it will show a green tick and allow you to continue. You should not have duplicate rows or any missing values.

1 / 3

Click ‘OK’ and proceed to the next step. Choose your target variable which is the 'Label' variable and if you want to eliminate any other variables, you can also do that.

The next step will require you to specify the task type, the metric, and TinyML model settings. The platform can identify the target metric and task type itself but I will explain why I used the Classification task type.

This model should be able to classify the given input as a letter and this is supervised machine learning as we are training the model with the target and feature variables.

This is classification since we are not predicting a continuous dependent variable using independent variables like predicting the yearly income using the number of hours worked per week. There are two types of classification - binary and multi. Binary classification will classify the input into one of the two classes. But in this project, we will be classifying the input into one of the nine classes so the task type is Multi Classification in this case.

The target metric is Accuracy and you will eventually know why the platform chose it after your model's training is complete. The target metric calculates the error rate of the model predictions on the validation dataset and represents the model quality.

If you want to create tiny models for microcontrollers, enable the TinyML mode using the slider and set the model settings.

The input data type is FLOAT32 and the normalization type is 'Unified scale for all features'. You will need to choose this normalization type if the data from your feature variables are within the same range and doing this will also reduce the time required for training. Enable float datatype support and select 8 bits as the bit depth for calculations. Once you are done, click 'Start training' and the training process will start.

1 / 3

You can view the quality of your model, its accuracy, and other analytics once your training is complete.

Training results

My model had an accuracy of 97.2% and a model quality index of 97%. I am satisfied with the results!

1 / 4

Prediction

I enabled prediction to see how well my model performed. For this, I used my test dataset.

1 / 2

The results were better than expected and I felt quite confident about my TinyML model. I downloaded the C library and got ready to deploy it on my M5Stack.

Embedding the TinyML Model on M5Stack

Create an Arduino sketch file to deploy your model. I used Neuton TinyML to create the model. After downloading the C library which is available for download after the training's complete, extract the zipped folder and copy the contents into the folder with your sketch file. Read the README text file within the downloaded content to learn how to embed your model.

According to the README file, the two main functions are:

neuton_model_set_inputs - to set input values
neuton_model_run_inference - to make predictions

You will need to make an array with model inputs. In my case, I have used a buffer as my input data type was not suitable for an array. Please make sure that the input count and order are the same as in the training dataset. Pass this to neuton_model_set_inputs function. The function will return 0 when the buffer is full and this indicates that the model is ready for prediction.

You should call neuton_model_run_inference function with two arguments when your buffer is ready. These two arguments are:

pointer to index of predicted class
pointer to neural net outputs

As you can see in the code below, 0 is returned by neuton_model_run_inference function when the prediction is successful.

if (neuton_model_set_inputs(inputs) == 0)
{
    uint16_t index;
    float* outputs;
    
    if (neuton_model_run_inference(&index, &outputs) == 0)
    {
        // code for handling prediction result
    }
}

After a successful prediction, classification takes place and the inference results are mapped on your classes. Note that the inference results are encoded (0..n). Use dictionaries binary_target_dict_csv.csv / multi_target_dict_csv.csv for the mapping process.

I have uploaded the complete source code in the Code section for your convenience.

Final Look

Conclusion

I had a nice experience while working on this project. I hope you liked my tutorial and found it helpful. I also hope that children are more intrigued by this project and show more interest while learning the colors. I'm always open to suggestions so please feel free to share your feedback below.

Code

#include <M5GFX.h>

M5GFX display;

void setup(void)
{
  display.init();
  display.setFont(&fonts::Font4);

  if (!display.touch())
  {
    display.setTextDatum(textdatum_t::middle_center);
    display.drawString("Touch not found.", display.width() / 2, display.height() / 2);
  }

  display.setEpdMode(epd_mode_t::epd_fastest);
  display.startWrite();
}

void loop(void)
{
  static bool drawed = false;
  lgfx::touch_point_t tp[3];

  int nums = display.getTouchRaw(tp, 3);
  if (nums)
  {
    for (int i = 0; i < nums; ++i)
    {
      display.setCursor(16, 16 + i * 24);
      display.printf("Raw X:%03d  Y:%03d", tp[i].x, tp[i].y);
    }

    display.convertRawXY(tp, nums);

    for (int i = 0; i < nums; ++i)
    {
      display.setCursor(16, 128 + i * 24);
      display.printf("Convert X:%03d  Y:%03d", tp[i].x, tp[i].y);
    }
    display.display();

    display.setColor(display.isEPD() ? TFT_BLACK : TFT_WHITE);
    for (int i = 0; i < nums; ++i)
    {
      int s = tp[i].size + 3;
      switch (tp[i].id)
      {
      case 0:
        display.fillCircle(tp[i].x, tp[i].y, s);
        break;
      case 1:
        display.drawLine(tp[i].x-s, tp[i].y-s, tp[i].x+s, tp[i].y+s);
        display.drawLine(tp[i].x-s, tp[i].y+s, tp[i].x+s, tp[i].y-s);
        break;
      default:
        display.fillTriangle(tp[i].x-s, tp[i].y +s, tp[i].x+s, tp[i].y+s, tp[i].x, tp[i].y-s);
        break;
      }
      display.display();
    }
    drawed = true;
  }
  else if (drawed)
  {
    drawed = false;
    display.waitDisplay();
    display.clear();
    display.display();
  }
  vTaskDelay(1);
}

#include <M5GFX.h>

M5GFX display;
int val;
int iteration;
const int Buffer_Size = "Your estimation of the number of pixels that are included when you draw a digit";
int* Buffer = (int*) calloc(Buffer_Size, sizeof(int));  

void setup() {
  // put your setup code here, to run once:
  Serial.begin(115200);
  display.init();
  display.setFont(&fonts::Font4);

  if (!display.touch())
  {
    display.setTextDatum(textdatum_t::middle_center);
    display.drawString("Touch not found.", display.width() / 2, display.height() / 2);
  }

  display.setEpdMode(epd_mode_t::epd_fastest);
  display.startWrite();
  
  //Creating the header for the CSV file
  Serial.print("Label");
  for (int i=0;i<Buffer_Size;i++){
    Serial.print(",");
    Serial.print("pixel"+String(i));
  }
  Serial.println(); //Starts new row
  Serial.print("The digit for which you want to collect samples");
}

void loop() {
  // put your main code here, to run repeatedly:
  static bool drawed = false;
  lgfx::touch_point_t tp[3];

  int nums = display.getTouchRaw(tp, 3);

  if(nums)
  {
    display.convertRawXY(tp, nums);
    for (int i = 0; i < nums; ++i){
      if((tp[i].y * 320 + tp[i].x) != val && iteration < Buffer_Size){ //Prevents duplicate data
        Buffer[iteration] = (tp[i].y * 320) + tp[i].x; //Store pixel values. 320 is display width and can vary with different touchscreens
        val = Buffer[iteration];
        iteration++;
      }                 
     }
     display.display();
     display.setColor(display.isEPD() ? TFT_BLACK : TFT_WHITE);
     for (int i = 0; i < nums; ++i)
     {
      int s = tp[i].size + 3;
      switch (tp[i].id)
      {
      case 0:
        display.fillCircle(tp[i].x, tp[i].y, s);
        break;
      case 1:
        display.drawLine(tp[i].x-s, tp[i].y-s, tp[i].x+s, tp[i].y+s);
        display.drawLine(tp[i].x-s, tp[i].y+s, tp[i].x+s, tp[i].y-s);
        break;
      default:
        display.fillTriangle(tp[i].x-s, tp[i].y +s, tp[i].x+s, tp[i].y+s, tp[i].x, tp[i].y-s);
        break;
      }
      display.display();
    }
    drawed = true;
   }

   else if (drawed) //Implements after you finish drawing the digit
   {   
    for(int i = 0; i < Buffer_Size; i++){
      Serial.print(",");
      Serial.print(Buffer[i]);
      
    }
    Serial.println(); //Create a new row
    Serial.print("The digit for which you want to collect samples");

    drawed = false;
    display.waitDisplay();
    display.clear(); //Clear display to draw digit again
    display.display();
    val=iteration=0; //Reset iteration and val variables
    free(Buffer); //Clear buffer to allocate memory for the digits drawn again
    Buffer= (int*) calloc(Buffer_Size, sizeof(int));     
   }
   vTaskDelay(1);    
}

Credits

Rucksikaa Raajkumar

43 projects • 94 followers

Amateur Arduino Developer. Undergraduate. YouTuber (https://www.youtube.com/c/RucksikaaRaajkumar/videos) and Blogger (Arduino Projects by R)

Contact

Comments

Please log in or sign up to comment.

Play and study with M5Stack Core 2!

Things used in this project

Hardware components

Software apps and online services

Story

Overview - Hardware and Software

MNIST Dataset & Its relevance to the Project

Data Collection: Preparing the Training and Test datasets

Training the Model

Training results

Prediction

Embedding the TinyML Model on M5Stack

Final Look

Conclusion

Schematics

ATECC608A Secure element

M5 Core2 for AWS

Code

Touch Test

Data Collection

Color Initial Letter Recognition

Credits

Rucksikaa Raajkumar

Comments

Embed the widget on your own site

Play and study with M5Stack Core 2!

Play and study with M5Stack Core 2!

Things used in this project

Hardware components

Software apps and online services

Story

Overview - Hardware and Software

MNIST Dataset & Its relevance to the Project

Data Collection: Preparing the Training and Test datasets

Training the Model

Training results

Prediction

Embedding the TinyML Model on M5Stack

Final Look

Conclusion

Schematics

ATECC608A Secure element

M5 Core2 for AWS

Code

Touch Test

Data Collection

Color Initial Letter Recognition

Credits

Rucksikaa Raajkumar

Comments

Related channels and tags