Emotion recognition with FER+ ONNX Model

liamgh · August 8, 2023, 8:16pm

Hi all,

I am experimenting with getting the FER+ Emotion Recognition model working with Sentis. I’ve got Unity hooked up with the Webcam and the model loaded, but unfortunately I keep getting a “neutral” result regardless of facial expression. The output numbers do vary slightly, but not a lot. This is the code I am using:

Texture2D texture2D = new Texture2D(webcamTexture.width, webcamTexture.height);
texture2D.SetPixels(webcamTexture.GetPixels());
texture2D.Apply();
TextureTransform test = new TextureTransform().SetDimensions(64,64,1);
TensorFloat inputTensor = TextureConverter.ToTensor(texture2D, test );
worker.Execute(inputTensor);
TensorFloat outputTensor = worker.PeekOutput() as TensorFloat;
float[] values = outputTensor.ToReadOnlyArray();

I’m quite new to AI so I’m probably missing something! Any pointers would be greatly appreciated.
Thanks
Liam

yoonitee · August 8, 2023, 9:22pm

Seems like you’re sending it colour values in the range 0…1 where from the documentation looks like it’s expecting values in the range 0…255. (well to be fair the documentations unclear what it expects!)

Seems like the TextureTransform() could do with specifying an output range for your tensor, e.g. [-1…1], [0…1], [0…255]. Would be a useful feature to have (unless I missed it).

alexandreribard_unity · August 9, 2023, 6:25pm

Yes looking at the model it looks like it’s expecting input between 0-255 to then re-normalize between -1 and 1
In unity textures are between 0-1 so makes sense your model doesn’t produce the right results.
You have two solutions to this:

You can use a Op cf ExecuteOperatorOnTensor sample

op.Mul(tensorInput, new TensorFloat(255));

You can edit the imported Model

model.constants.Add(new Constant(newconstant_name, new TensorFloat(255));
model.layers.InsertAt(0, new Layers.Mul("newlayer_name", model_input, newconstant_name));
model.layers[1].inputs[0] = "newlayer_name"

liamgh · August 9, 2023, 9:38pm

Thanks for these replies! Adding the line to multiply the tensor by 255 seemed to help a bit. It is still erring towards neutral as a result though.

Do you think I need to do anymore processing of the image? Will the tensortransform automatically make it monochrome or should I do that manually?

Thanks
Liam

yoonitee · August 9, 2023, 10:00pm

Which line(s) did you add?

I found the second option works fine after Model model = ModelLoader.Load(…);

model.constants.Add(new Unity.Sentis.Layers.Constant("scale_factor", new TensorFloat(255)));
model.layers.Insert(0, new Unity.Sentis.Layers.Mul("scaled_input", "Input3", "scale_factor"));
model.layers[1].inputs[0] = "scaled_input";

(Or replace “Input3” with whichever is the name of the input in your model).

I tried it (not with a webcam) but with just images of different exaggerated faces as textures. Got “happy” and “angry” to come up top.
So if it’s still not working even with images instead of the webcam, you are right, it might be due to a noisy or dark webcam image. (Or even upside down!) Also, it’s an old model so not state of the art.

Hope you get it working. Here’s the images I used to test it:

happy:
smiling
angry:
angry
Not subtle lol.

liamgh · August 10, 2023, 12:24pm

Hi yoonitee,

Thanks for your help! I tried it out with the two test images you attached and it works perfectly! So this must mean that there too much going on in the webcam photo. I’m thinking my next step will be to use a model that can detect a face in a picture then use that to trim the webcam image and then feed it in to the emotion recognition model.

Liam

a_gebenroth · September 11, 2023, 9:47am

Hi!

I also ran into this issue while using Google net age and gender classification and Inception v2
Scaling the input to 255 got me one step further. Yet many of the results I get seem still wrong. In the Google net documentation it is mentioned, that the models expect data in BGR format. Could this be an issue?

alexandreribard_unity · September 11, 2023, 2:10pm

in that case try TextureTransform.SetChannelSwizzle(ChannelSwizzle.BGRA)

a_gebenroth · September 11, 2023, 2:39pm

Thanks. That somewhat improved my results. But I will reevaluate my test set, as inception still gives me mostly wrong outputs.

Additionally to the BGR issue, I’ve come across this super resolution model and played around with it. It accepts images in RGB format, but it seems to convert them internally into YCbCr format, which is what the output will be in. Are conversions of this (and other) formats possible? If not, will they be possible in the future?

alexandreribard_unity · September 11, 2023, 4:33pm

I’d suggest you to write your own compute shader to do the conversion then, it’s probably the best

Christin2015 · September 14, 2023, 9:37pm

Hey @liamgh , I used that FER dataset from Kaggle, 5 years ago. That dataset is very unbalanced and unclean. There are many images that have a wrong label (bad for training and classifiaction), other images have large black parts making them useless, and other artifacts.
Thus, if your code is working well, it is simply the unclean dataset the model was trained on that lead to wrong classification during inference.

richdrummer280 · October 10, 2023, 7:34pm

@Christin2015 might you know of other open-source pre-trained onnx models available trained on cleaner data, and perhaps more state of the art? If anyone else happens to know, please let me know!

I’m also exploring FER/emotion recog models and am looking for something that can reliably output scores, at least for happy/sad.

Edit: According to this article, the Kaggle FER+ model is Microsoft’s extension of the original Kaggle competition model - a reviewed and cleaned up version.

Best,
Richard

Topic		Replies	Views
I try to use Sentis with EmotionFerPlus and I encounter problem with no error Unity Engine Inference-Engine , Question	1	280	April 8, 2024
Calculation results are fixed Unity Engine Inference-Engine , Question	10	795	November 15, 2023
Inconsistent Model Inference (.onnx) Unity Engine Android , Inference-Engine , Question	8	630	July 4, 2024
Sentis output doesn't match with the onnx output for segmentation Unity Engine 2022-3-LTS , Intermediate , Inference-Engine , Question	8	369	December 20, 2024
Speech Emotion Recognition - Sentis AI Unity Engine Inference-Engine , Question	7	1418	December 12, 2023

Emotion recognition with FER+ ONNX Model

Related topics