CSE-GResNet: A Simple and Highly Efficient Network for Facial Expression Recognition

An Efficient Expression Recognition Network Based on Gabor Convolution: CSE-GResNet Academic Background Facial Expression Recognition (FER) is an important research direction in the field of computer vision, with wide applications in social robots, healthcare, social psychology, customer service, and psychoanalysis. Facial expressions are natural a...

Phonetically-Anchored Domain Adaptation for Cross-Lingual Speech Emotion Recognition

Phonetic-Anchored Domain Adaptation in Cross-Lingual Speech Emotion Recognition Academic Background Speech Emotion Recognition (SER) has broad application prospects in intelligent agents, social robots, voice assistants, and automated call center systems. With the development of globalization, the demand for cross-lingual SER is increasing. However...

Facial 3D Regional Structural Motion Representation Using Lightweight Point Cloud Networks for Micro-Expression Recognition

3D Regional Structural Motion Representation Using Lightweight Point Cloud Networks for Micro-Expression Recognition Academic Background Micro-expressions (MEs) are brief and subtle facial expressions in human emotional expression, typically lasting between 1⁄25 and 1⁄5 of a second. Due to their spontaneity, rapidity, and difficulty to control, mic...

Multi-scale Hyperbolic Contrastive Learning for Cross-subject EEG Emotion Recognition

Cross-Subject EEG Emotion Recognition Research Based on Multi-Scale Hyperbolic Contrastive Learning Academic Background Electroencephalography (EEG), as a physiological signal, plays an important role in the field of affective computing. Compared with traditional non-physiological cues (such as facial expressions or voice), EEG signals have higher ...

Multimodal Sentiment Analysis with Mutual Information-Based Disentangled Representation Learning

Disentangled Representation Learning in Multimodal Sentiment Analysis Using Mutual Information: An Innovative Study Academic Background With the rapid development of social media, the amount of user-generated multimedia content (such as tweets and videos) has increased dramatically. These multimedia data typically include three modalities: visual (...

Spectro-Temporal Modulations Incorporated Two-Stream Robust Speech Emotion Recognition

Research on Two-Stream Robust Speech Emotion Recognition Based on Spectro-Temporal Modulation Features Academic Background Speech Emotion Recognition (SER) is a technology that identifies emotions by analyzing the emotional content in human speech. It has broad application potential in areas such as human-computer interaction, customer service mana...