What makes emotion recognition so difficult for AI models?
On episode 104 of WebRTC Live, Yahia Salman breaks down why detecting emotions remains a major challenge for VLMs and AI in general.
Watch the full episode and read highlights on the WebRTC.ventures blog: https://webrtc.ventures/2025/07/watch-webrtc-live-104-why-vision-language-models-deserve-a-closer-look/
“I worked on the paper that the research I was talking about earlier was in emotion recognition and things like that. One insight, one of the biggest insights I got regarding emotions, is that a lot of times, the emotions are so subtle you feel them way stronger in your head than you feel them on your face. If you think back to a time you were really mad, you won't sit there like that for five minutes or something like that. It won't really happen. You'll probably just sit there with a blank expression. So if you gave it somebody that's like this, I'm so sure I'd be able to tell they were angry or somebody smiling that they're happy but in the real world in terms of application based emotions a very difficult thing because we don't show it a lot of times and we don't show what we're thinking a lot of times.”
#visionlanguagemodels #emotiondetection #aiemotions #webrtc
On episode 104 of WebRTC Live, Yahia Salman breaks down why detecting emotions remains a major challenge for VLMs and AI in general.
Watch the full episode and read highlights on the WebRTC.ventures blog: https://webrtc.ventures/2025/07/watch-webrtc-live-104-why-vision-language-models-deserve-a-closer-look/
“I worked on the paper that the research I was talking about earlier was in emotion recognition and things like that. One insight, one of the biggest insights I got regarding emotions, is that a lot of times, the emotions are so subtle you feel them way stronger in your head than you feel them on your face. If you think back to a time you were really mad, you won’t sit there like that for five minutes or something like that. It won’t really happen. You’ll probably just sit there with a blank expression. So if you gave it somebody that’s like this, I’m so sure I’d be able to tell they were angry or somebody smiling that they’re happy but in the real world in terms of application based emotions a very difficult thing because we don’t show it a lot of times and we don’t show what we’re thinking a lot of times.”
#visionlanguagemodels #emotiondetection #aiemotions #webrtc