Modalities Missing in Multimodal Sentiment Analysis
DOI:
https://doi.org/10.54097/pjawnx15Keywords:
Multimodal sentiment analysis, Modal absence, Feature fusion, Model framework optimization.Abstract
In recent years, sentiment analysis has been widely applied in various fields such as smart home, intelligent healthcare, and education. Multiple technologies such as eye movement recognition and text analysis are employed as means to determine the emotional state of users. However, during this process, factors such as sensor failures, privacy restrictions, and environmental disturbances can all lead to the problem of modal absence in sentiment analysis. Traditional multimodal approaches often fail to handle the correlations between different modalities during feature extraction, and when integrating features, they often neglect multi-scale emotional cues. This research aims to summarize typical multimodal feature fusion models. Based on four optimized models developed in recent years, it aims to summarize and outline the technical breakthroughs in the emerging model architectures in addressing the issue of modality absence in multimodal sentiment analysis, and also makes assumptions and prospects for future multimodal sentiment analysis in dealing with modality absence.
Downloads
References
[1] Diraco G, Rescio G, Siciliano P, et al. Review on human action recognition in smart living: Sensing technology, multimodality, real-time processing, interoperability, and resource-constrained processing. Sensors, 2023, 23(11): 5281.
[2] Wu R, Wang H, Chen H T, et al. Deep multimodal learning with missing modality: A survey. arXiv preprint arXiv:2409.07825, 2024.
[3] Ahmad Z, Jaffri Z A, Chen M, et al. Understanding GANs: Fundamentals, variants, training challenges, applications, and open problems. Multimedia Tools and Applications, 2025, 84(12): 10347-10423.
[4] Xu P, Zhu X, Clifton D A. Multimodal learning with transformers: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 45(10): 12113-12132.
[5] Wu Z, Gong Z, Koo J, et al. Multimodal multi-loss fusion network for sentiment analysis. arXiv preprint arXiv:2308.00264, 2023.
[6] Guo Z, Jin T, Zhao Z. Multimodal prompt learning with missing modalities for sentiment analysis and emotion recognition. arXiv preprint arXiv:2407.05374, 2024.
[7] Li M, Yang D, Zhang L. Towards robust multimodal sentiment analysis under uncertain signal missing. IEEE Signal Processing Letters, 2023, 30: 1497-1501.
[8] Liu Z, Zhou B, Chu D, et al. Modality translation-based multimodal sentiment analysis under uncertain missing modalities. Information Fusion, 2024, 101: 101973.
[9] Yang H, Zhao Y, Wu Y, et al. Large language models meet text-centric multimodal sentiment analysis: A survey. arXiv preprint arXiv:2406.08068, 2024.
[10] Zong Y, Mac Aodha O, Hospedales T. Self-supervised multimodal learning: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







