Key Issues and Application Analysis of Multimodal Emotion Recognition
DOI:
https://doi.org/10.54097/qt2d7e63Keywords:
Multimodal emotion recognition, data quality and coverage, semantic understanding and modal association, Application.Abstract
This article mainly explores the key issues of multimodal emotion recognition and analyzes some applications. It discusses the key issues from two core problems: data quality and coverage, semantic understanding and modal association. In the issue of data quality coverage, it is pointed out whether the data is good enough and comprehensive enough. In semantic understanding and modal association, it is pointed out whether the semantics are clear and whether they can be coordinated with other modalities. It also combines two core issues to discover the problems that occur in multimodal scenarios. At the application level, research on learning emotion recognition for the education field is listed. By real-time judgment of learning concentration and emotional state, it provides support for personalized teaching. The design and application of multimodal emotion recognition technology in security systems, the role of multimodal emotion recognition in security, MemoCMT: Multimodal emotion recognition is achieved through feature fusion based on cross-modal converters. By using an innovative feature fusion strategy, the three application aspects of multimodal emotion features are effectively integrated.
Downloads
References
[1] Sepideh Kalateh, Luis a. Estrada-jimenez, Sanaz Nikghadam-hojjati, et al. A systematic review on multimodal emotionrecognition: building blocks, current state, applications, and challenges. Centre of Technology and Systems (CTS-UNINOVA),2829-516 Caparica, Portugal Associated Laboratory on Intelligent Systems (LASI), 2829-516 Caparica, Portugal Department of Electrical Engineering, NOVA School of Science and Technology, NOVA University of Lisbon, 1099-085 Lisbon, Portugal,2024.
[2] Xiaofei Zhu, Chenyao Li, Xu Chen, et al. Multimodal conversation sentiment recognition based on modal de-heterogeneity and adaptive fusion. Chongqing University of Technology, 2025.
[3] Changzeng Fu, Fengkui Qian, Kaifeng Su, et al. HiMul-LGG: A hierarchical decision fusion-based local–global graph neural network for multimodal emotion recognition in conversation. Hebei Key Laboratory of Marine Perception Network and Data Processing,2025.
[4] Wei Ai1, Fuchen Zhang1, Yuntao Shou1, et al. Revisiting multimodal emotion recognition in conversation from the perspective of graph spectrum. college of computer and mathematics, central south university of forestry and technology, 410004, china. college of computer science and electronic engineering, hunan university, 410082, China Department of Computer Science, State University of New York, 12561, USA, 2010, 30(1): 158-160,2025.
[5] Jiaxing Zhao, Xihan Wei, Liefeng Bo. R1-Omni: explainable omni-multimodal emotion recognition with reinforcement learning. Tongyi Lab, Alibaba Group, 2025.
[6] Xiaofei Luo. A review of multimodal human-computer interaction evaluation. Hangzhou Guyun Business Consulting Co., LTD, 2025.
[7] Manju Priya Arthanarisamy Ramaswamy, Suja Palaniswamy. Multimodal emotion recognition: A comprehensive review, trends, and challenges. Multimodal emotion recognition: A comprehensive review, trends, and challenges, 2024.
[8] Yumei Tan, Shuxiang Song, Haiying Xia. A review of learning emotion recognition research in the field of education. key laboratory of integrated circuits and microsystems, Guangxi Normal University, Guangxi Higher Education Institutions, 2025.
[9] Qingqing Li. A design and application of multimodal emotion recognition technology in security systems. Jiangsu Fushite Electrical Technology Co, LTD,2025.
[10] Mustaqeem Khan, Phuong-NamTran, Nhat Truong Pham, et al. MemoCMT: multimodal emotion recognition using cross-modal transformer-based feature fusion. Scientific Reports, 2025, 15;5473.
[11] Karthik Parvathinathan, Sudarshana Karkala, Sazzad Hossain, et al. Multimodal emotion recognition from text and audio using cross-attention fusion. Federal University Oye Ekiti, 2025.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







