Created December 6, 2018

Multimedia system with artificial intelligence

Self-learning system for the selection of musical compositions for listening, which would analyze the emotions of a person

273

Most creative

Bring AI Alive on Powerful AI Kit

Multimedia system with artificial intelligence

Things used in this project

Hardware components

ThunderSoft Thundercomm AI Kit

Software apps and online services

MobileNet SSD

Qualcomm Neural Processing SDK for AI v1.21.0

Android Studio

OpenCV

ThunderSoft Face SDK

ThunderSoft Object Dectetion SDK

Story

Story

Today there are as many as four types of audio content: radio services, streaming music services, podcasts and audio books. And the first is gaining more popularity.And why is this happening? And the whole point is that in 2018 neither I nor you have time to select music. And it remains to rely only on the taste of the DJ. At the same time, I have a personal rich digital collection of music, the selection of which for playback can be obtained by artificial intelligence,focusing on recognizing the emotions of the person. I plan to create algorithm for learning the neural network for selecting music for listening on the basis of an analysis of the listener's emotions. That is, the creation of the profile of the listener and the selection of the optimal musical compositions for him on the basis of his emotional state at a given time.

Before I obtained Thundercomm AI Kit I had used the MobileNet SSD model due to its relatively small size and the fact that it already had a method to upload to anandroid app. SSD is an unified framework for object detection with a single network. It’s possible to use the code to train/evaluate a network for object detection task. Having received the AI Kit, I turned to the Face SDK and Object Dectetion SDK from the ThunderSoft. The key task at the moment is to use the face recognition and emotion detection capabilities to select and play music from the database.

Theory

Face recognition is a classic topic in past decades and now still attracts much attention in the ﬁeld of computer vision and pattern recognition. Emotion recognition is challenging due to several input modalities, have a significant role in understanding it. The mission of recognizing of the emotions is mostly difficult due to two main reasons: 1) There is not largely available database of training images and 2) classifying emotion could not be simple based on whether the input image is static or evolution frame into a facial expression. The final difficulty is mostly for the real-time detection while facial expressions different husiastically. There are six basic expressions (surprise, fear, happiness, anger, disgust, and sadness) that are common among human beings. Mostly the big overlap between the emotion classes makes the classification task very difficult. The facial recognition process consists of three main stages: acquisition, feature extraction, and emotion classification. In figure is shown as outputs the six basic emotions with the neutral state.

Recently, a new trend of machine learning techniques emerged namely Deep Learning allowing to automatically discover the adequate and relevant representations from raw data such as images. Indeed,they enable the extraction of several representations levels beginning from the lower level input to higher and more abstract one. In the case of an image, the ﬁrst layer of representation deﬁnes the presence or absence of edges at speciﬁc orientations and locations. The next one allows detection of motifs by spotting particular arrangements of edges. The third layer allows to combine the detected motifs in order to spot parts of the object to detect. The last layers might merge the detected parts to match the entire object.

Tests

AlgoSample's main features are face registration, face recognition, object recognition, built-in camera/USB camera switching, AI Kit LED light control and more. It’s possible to build it with Android Studio. I use it to detect faces and add then to the DB. Then the captured dataset needs to be trained using OpenCV training algorithm. The idea is to create a database with face emotions,capture images of the face, comparisons with basic ones and define emotions.

BEGIN  
Step 1: Input the samples images  
     (I) = { I1, I2, …, I n  }  
Step 2: Read the image and store to the frame  
      F = { I1, I2, …, I n  }  
Step 3: Read the contents of the xml file for face detection 
and store in the memory  
      R = face.xml  //read face.xml, it stores in OpenCV lib.   
Step 4:  
   i= {1,2,3,…,n} 
   fi ={f1, f2,…f n } 
   START loop  
       F 0  = 1 
       i = 0  
   While F i  > F(target) and i < n(Stages)  
          i = i + 1  
          Train classifier for stage(i)  
               Initialize weights  
               Normalize weights  
               Pick the (next) best weak classifier  
               Update weights  
               Evaluate F i 
               if F i  > f  
                     go to Normalize Weights  
               Combine weak classifiers to form the strong  
               classifier stage 
               compute F i  
Step 6: End of the algorithm

BEGIN 
         X={1,2,3,...n} 
         C is that point the image in the frame 
         S is that sotre the image 
         X=0, C = 0, S=0  
Step 1: Read the detected image from the system memory  
        R = Detected image(I)  
Step 2: Detect in the image  
    START loop  
        For X = 1 to n 
            Read each pixel in I n   
            Point the image in the frame  
               C = Sobel (I n ) // import sobel function in OpenCV 
lib 
               Store the image in the system directory  
                         S = Store(C)  // import store function in  
                                               // OpenCV lib 
      End of For loop  
Step 3: End of the algorithm 2

Credits

lab308

2 projects • 1 follower

Contact

Comments

Please log in or sign up to comment.

Awards

Most creative

Bring AI Alive on Powerful AI Kit

Multimedia system with artificial intelligence

Things used in this project

Hardware components

Software apps and online services

Story

Code

Eyes and mouth detection

eyes and mouth edge

Credits

lab308

Comments

Awards

Embed the widget on your own site

Multimedia system with artificial intelligence

Multimedia system with artificial intelligence

Things used in this project

Hardware components

Software apps and online services

Story

Code

Eyes and mouth detection

eyes and mouth edge

Credits

lab308

Comments

Awards