Exploration of Medicine

Table 1. Deep learning applications for facial-expression-based emotion recognition.

Approach/Model^†	Description	Strengths (relevance to APA)	Main limitations	Ref.
CNN	Extraction of spatial features from static facial images for emotion classification or regression	Effective detection of facial muscle activation patterns (AUs); suitable for baseline pain/no-pain discrimination	Limited ability to capture temporal dynamics; reduced performance in real-world conditions (occlusions, variability)	[24]
Hybrid CNN	A combination of convolutional feature extraction with attention mechanisms focusing on salient facial regions	Improved discrimination of subtle expressions; better robustness to noise and inter-individual variability	Increased architectural complexity; requires large annotated datasets	[25]
Transformer-based models (TFE, Swin)	Attention-based architectures model global dependencies and dynamically focus on informative regions	Strong robustness to occlusions and pose variations; improved generalization across datasets	High computational cost; data-intensive training	[28–30]
CNN + temporal models (TCN, LSTM, 3D CNN)	Integration of spatial feature extraction with temporal modeling of video sequences and facial dynamics	Capture of microexpressions and temporal evolution of pain-related facial patterns; critical for continuous and real-time APA	Requires temporally annotated datasets; higher computational burden	[32–34]

Return to article view

^† These approaches differ in their ability to capture static versus dynamic emotional features, with temporal models being particularly relevant for continuous pain assessment in clinical settings. CNN: convolutional neural network; LSTM: long short-term memory; TFE: transformer facial encoder; 3D CNN: three-dimensional CNN; TCN: Temporal Convolutional Network; AUs: action units; APA: automatic pain assessment.

Declarations

Author contributions

MC: Resources, Data curation, Formal analysis, Software. AZ: Resources, Data curation, Formal analysis, Software. OP: Conceptualization, Formal analysis, Data curation. CG: Investigation, Visualization. FDL: Formal analysis, Supervision. V Cerrone and AZ: Validation, Formal analysis, Writing—original draft, Writing—review & editing. AF: Methodology, Investigation. V Conti: Software, Formal analysis. GP and SC: Methodology, Software. FS, RDF, and DE: Conceptualization, Formal analysis. MPB: Validation, Investigation. All authors read and approved the submitted version.

Conflicts of interest

Marco Cascella, who is the Editorial Board Member and Guest Editor of Exploration of Medicine, had no involvement in the decision-making or the review process of this manuscript. The other authors declare no conflicts of interest.

Ethical approval

Not applicable.

Consent to participate

Not applicable.

Consent to publication

Not applicable.

Availability of data and materials

This study does not involve original data; all data analyzed are publicly available and have been appropriately cited.

Funding

This research received no external funding.

Copyright

Publisher’s note

Open Exploration maintains a neutral stance on jurisdictional claims in published institutional affiliations and maps. All opinions expressed in this article are the personal views of the author(s) and do not represent the stance of the editorial team or the publisher.

References

Scherer KR, Moors A. The Emotion Process: Event Appraisal and Component Differentiation. Annu Rev Psychol. 2019;70:719–45. [DOI] [PubMed]

Jack RE, Garrod OG, Yu H, Caldara R, Schyns PG. Facial expressions of emotion are not culturally universal. Proc Natl Acad Sci U S A. 2012;109:7241–4. [DOI] [PubMed] [PMC]

Posner J, Russell JA, Peterson BS. The circumplex model of affect: an integrative approach to affective neuroscience, cognitive development, and psychopathology. Dev Psychopathol. 2005;17:715–34. [DOI] [PubMed] [PMC]

Raja SN, Carr DB, Cohen M, Finnerup NB, Flor H, Gibson S, et al. The revised International Association for the Study of Pain definition of pain: concepts, challenges, and compromises. Pain. 2020;161:1976–82. [DOI] [PubMed] [PMC]

Wiech K. Deconstructing the sensation of pain: The influence of cognitive processes on pain perception. Science. 2016;354:584–7. [DOI] [PubMed]

Murphy SE, Downham C, Cowen PJ, Harmer CJ. Direct effects of diazepam on emotional processing in healthy volunteers. Psychopharmacology (Berl). 2008;199:503–13. [DOI] [PubMed] [PMC]

Cascella M, Bimonte S, Saettini F, Muzio MR. The challenge of pain assessment in children with cognitive disabilities: Features and clinical applicability of different observational tools. J Paediatr Child Health. 2019;55:129–35. [DOI] [PubMed]

Picard RW. Affective computing. Cambridge (MA): The MIT Press; 1997. [DOI]

Cowen AS, Keltner D. Self-report captures 27 distinct categories of emotion bridged by continuous gradients. Proc Natl Acad Sci U S A. 2017;114:E7900–9. [DOI] [PubMed] [PMC]

10.

Schuller B, Steidl S, Batliner A, Epps J, Eyben F, Ringeval F, et al. The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load. Proc Interspeech. 2014:427–31. [DOI]

11.

Brigham TJ. Merging Technology and Emotions: Introduction to Affective Computing. Med Ref Serv Q. 2017;36:399–407. [DOI]

12.

Cascella M, Schiavo D, Cuomo A, Ottaiano A, Perri F, Patrone R, et al. Artificial Intelligence for Automatic Pain Assessment: Research Methods and Perspectives. Pain Res Manag. 2023;2023:6018736. [DOI] [PubMed] [PMC]

13.

Gkikas S, Tsiknakis M. Automatic assessment of pain based on deep learning methods: A systematic review. Comput Methods Programs Biomed. 2023;231:107365. [DOI]

14.

Gkikas S, Tachos NS, Andreadis S, Pezoulas VC, Zaridis D, Gkois G, et al. Multimodal automatic assessment of acute pain through facial videos and heart rate signals utilizing transformer-based architectures. Front Pain Res (Lausanne). 2024;5:1372814. [DOI] [PubMed] [PMC]

15.

Kunz M, Lautenbacher S, LeBlanc N, Rainville P. Are both the sensory and the affective dimensions of pain encoded in the face? Pain. 2012;153:350–8. [DOI] [PubMed]

16.

Barrett LF, Adolphs R, Marsella S, Martinez AM, Pollak SD. Emotional Expressions Reconsidered: Challenges to Inferring Emotion From Human Facial Movements. Psychol Sci Public Interest. 2019;20:1–68. [DOI] [PubMed] [PMC]

17.

Colares WG, Costa MGF, Costa Filho CFF. Enhancing Emotion Recognition: A Dual-Input Model for Facial Expression Recognition Using Images and Facial Landmarks. Annu Int Conf IEEE Eng Med Biol Soc. 2024;2024:1–5. [DOI] [PubMed]

18.

Cascella M, Vitale VN, Mariani F, Iuorio M, Cutugno F. Development of a binary classifier model from extended facial codes toward video-based pain recognition in cancer patients. Scand J Pain. 2023;23:638–45. [DOI] [PubMed]

19.

Saarimäki H, Glerean E, Smirnov D, Mynttinen H, Jääskeläinen IP, Sams M, et al. Classification of emotion categories based on functional connectivity patterns of the human brain. NeuroImage. 2022;247:118800. [DOI]

20.

Li S, Deng W. Deep Facial Expression Recognition: A Survey. IEEE Trans Affect Comput. 2022;13:1195–215. [DOI]

21.

Krichen M. Convolutional Neural Networks: A Survey. Computers. 2023;12:151. [DOI]

22.

Vinay BV, M R, Math S. Comprehensive Study of Low Light Facial Recognition Based on Conventional and Deep Learning Models. In: 2025 3rd International Conference on Data Science and Network Security (ICDSNS). 2025. pp. 1–4. [DOI]

23.

Caruccio L, Polese G, Tortora G, Iannone D. EDCAR: A knowledge representation framework to enhance automatic video surveillance. Expert Syst Appl. 2019;131:190–207. [DOI]

24.

Cascella M, Shariff MN, Lo Bianco G, Monaco F, Gargano F, Simonini A, et al. Employing the Artificial Intelligence Object Detection Tool YOLOv8 for Real-Time Pain Detection: A Feasibility Study. J Pain Res. 2024;17:3681–96. [DOI] [PubMed] [PMC]

25.

Wang W, Jia M. A facial expression recognition network based on attention double branch enhanced fusion. PeerJ Comput Sci. 2024;10:e2266. [DOI] [PubMed] [PMC]

26.

Liang X, Xu L, Zhang W, Zhang Y, Liu J, Liu Z. A convolution-transformer dual branch network for head-pose and occlusion facial expression recognition. Vis Comput. 2022;39:2277–90. [DOI]

27.

Kim J, Lee D. Facial expression recognition robust to occlusion and to intra-similarity problem using relevant subsampling. Sensors. 2023;23:2619. [DOI] [PubMed] [PMC]

28.

Gao J, Zhao Y. TFE: A transformer architecture for occlusion aware facial expression recognition. Front Neurorobot. 2021;15:763100. [DOI] [PubMed] [PMC]

29.

Huang ZY, Chiang CC, Chen JH, Chen YC, Chung HL, Cai YP, et al. A study on computer vision for facial emotion recognition. Sci Rep. 2023;13:8425. [DOI] [PubMed] [PMC]

30.

Li Y, Liu H, Liang J, Jiang D. Occlusion-robust facial expression recognition based on multi-angle feature extraction. Appl Sci. 2025;15:5139. [DOI]

31.

Zhang Y, Zheng X, Liang C, Hu J, Deng W. Generalizable facial expression recognition. In: Computer Vision – ECCV 2024: 18th European Conference; 2024 Sep 29–Oct 4; Milan, Italy. Berlin, Heidelberg: Springer-Verlag; 2024. pp. 231–48. [DOI]

32.

Aly M. Revolutionizing online education: Advanced facial expression recognition for real-time student progress tracking via deep learning model. Multimed Tools Appl. 2025;84:12575–614. [DOI]

33.

Duraj A, Szczepaniak PS, Sadok A. Detection of Anomalies in Data Streams Using the LSTM-CNN Model. Sensors (Basel). 2025;25:1610. [DOI] [PubMed] [PMC]

34.

Wu Y, Mi Q, Gao T. A Comprehensive Review of Multimodal Emotion Recognition: Techniques, Challenges, and Future Directions. Biomimetics (Basel). 2025;10:418. [DOI] [PubMed] [PMC]

35.

Pordoy J, Farman H, Dicheva NK, Anwar A, Nasralla MM, Khilji N, et al. Multi-frame transfer learning framework for facial emotion recognition in e-learning contexts. IEEE Access. 2024;12:151360–81. [DOI]

36.

Elgendi M, Galli V, Ahmadizadeh C, Menon C. Dataset of psychological scales and physiological signals collected for anxiety assessment using a portable device. Data. 2022;7:132. [DOI]

37.

Reiss A, Indlekofer I, Schmidt P, Van Laerhoven K. Deep PPG: large-scale heart rate estimation with convolutional neural networks. Sensors. 2019;19:3079. [DOI] [PubMed] [PMC]

38.

Nooh S, Ragab M, Aboalela R, Al-Ghamdi AA, Abdulkader OA, Alghamdi G. An exploratory analysis of longitudinal artificial intelligence for cognitive fatigue detection using neurophysiological based biosignal data. Sci Rep. 2025;15:15736. [DOI] [PubMed] [PMC]

39.

Cascella M, Di Gennaro P, Crispo A, Vittori A, Petrucci E, Sciorio F, et al. Advancing the integration of biosignal-based automated pain assessment methods into a comprehensive model for addressing cancer pain. BMC Palliat Care. 2024;23:198. [DOI] [PubMed] [PMC]

40.

Moscato S, Orlandi S, Giannelli A, Ostan R, Chiari L. Automatic pain assessment on cancer patients using physiological signals recorded in real-world contexts. Annu Int Conf IEEE Eng Med Biol Soc. 2022;2022:1931–4. [DOI] [PubMed]

41.

Cascella M, Vitale VN, D’Antò M, Cuomo A, Amato F, Romano M, et al. Exploring Biosignals for Quantitative Pain Assessment in Cancer Patients: A Proof of Concept. Electronics. 2023;12:3716. [DOI]

42.

Greff K, Srivastava RK, Koutník J, Steunebrink BR, Schmidhuber J. LSTM: a search space odyssey. IEEE Trans Neural Netw Learn Syst. 2017;28:2222–32. [DOI]

43.

Banerjee R, Ghose A. Synthesis of realistic ECG waveforms using a composite generative adversarial network for classification of atrial fibrillation. In: 2021 29th European Signal Processing Conference (EUSIPCO). 2021. pp. 1145–9. [DOI]

44.

Engel E, Hudy C, Li L, Schleusner R. Multi-modal transfer learning for dynamic facial emotion recognition in the wild. arXiv [Preprint]. 2025 [cited 2026 May 5]. Available from: https://arxiv.org/pdf/2504.21248v1

45.

Li Q, Li Q, Cakmak AS, Da Poian G, Bliwise DL, Vaccarino V, et al. Transfer learning from ECG to PPG for improved sleep staging from wrist-worn wearables. Physiol Meas. 2021;42:044004. [DOI]

46.

Ezzat A, Omer OA, Mohamed US, Mubarak AS. ECG signal reconstruction from PPG using a hybrid attention-based deep learning network. EURASIP J Adv Signal Process. 2024;2024:95. [DOI]

47.

Fadhlullah BR, Nuha HH, Putrada AG. Quality control for PPG-based hypertension detection: an imbalance-aware deep learning approach. In: 2025 International Seminar on Intelligent Technology and Its Applications (ISITIA); 2025. pp. 795–800. [DOI]

48.

Zhang H, Wang Z, Zhuang Y, Yin S, Chen Z, Liang Y. Assessment of mental workload level based on PPG signal fusion continuous wavelet transform and cardiopulmonary coupling technology. Electronics. 2024;13:1238. [DOI]

49.

Luo Y, Chen Y, Salekin A, Rahman T. Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals. ArXiv [Preprint]. 2025 [cited 2026 Feb 24]. Available from: https://doi.org/10.48550/arXiv.2412.09758

50.

Webster MB, Lee D, Lee J. Self-supervised autoencoder network for robust heart rate extraction from noisy photoplethysmogram: Applying blind source separation to biosignal analysis. Comput Biol Med. 2025;199:111319. [DOI]

51.

Napoletano F, Cutugno F, Cascella M, Maffia M. Can you describe your pain? Combining psycho-emotional and pragmatic analysis on cancer patients’ utterances. Int J Linguist. 2025;17:86–103. [DOI]

52.

Calvo RA, D’Mello S. Affect Detection: An Interdisciplinary Review of Models, Methods, and Their Applications. IEEE Trans Affect Comput. 2010;1:18–37. [DOI]

53.

Fisher H, Jaffe N, Pidvirny K, Tierney A, Pizzagalli D, Webb C. Using Natural Language Processing to Track Negative Emotions in the Daily Lives of Adolescents. Res Sq [Preprint]. 2025 [cited 2026 Feb 24]. Available from: https://doi.org/10.21203/rs.3.rs-6414400/v1

54.

Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach. arXiv [Preprint]. 2019 [cited 2026 Feb 24]. Available from: https://doi.org/10.48550/arXiv.1907.11692

55.

Mohammad SM, Turney PD. Crowdsourcing a Word–Emotion Association Lexicon. Comput Intell. 2013;29:436–65. [DOI]

56.

Godard R, Holtzman S. The multidimensional lexicon of emojis: a new tool to assess the emotional content of emojis. Front Psychol. 2022;13:921388. [DOI]

57.

Shen J, Zhang S, Tong Y, Dong X, Wang X, Fu G, et al. Establishment and psychometric characteristics of emotional words list for suicidal risk assessment in speech emotion recognition. Front Psychiatry. 2022;13:1022036. [DOI]

58.

Zhang T, Schoene AM, Ji S, Ananiadou S. Natural language processing applied to mental illness detection: a narrative review. NPJ Digit Med. 2022;5:46. [DOI]

59.

Sakai H, Lam SS. Large Language Models for Health Care Text Classification: Systematic Review. JMIR AI. 2026;5:e79202. [DOI]

60.

Baltrušaitis T, Ahuja C, Morency LP. Multimodal Machine Learning: A Survey and Taxonomy. IEEE Trans Pattern Anal Mach Intell. 2019;41:423–43. [DOI]

61.

Kächele M, Thiam P, Amirian M, Werner P, Walter S, Schwenker F, et al. Multimodal Data Fusion for Person-Independent, Continuous Estimation of Pain Intensity. In: Iliadis L, Jayne C, editors. Engineering Applications of Neural Networks. EANN 2015. Communications in Computer and Information Science. Springer, Cham; 2015. pp. 275–85. [DOI]

62.

Costantini G, Iaderola I, Paoloni A, Todisco M. EMOVO corpus: an Italian emotional speech database. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14). 2014. pp. 3501–4.

63.

Cascella M, Cutugno F, Mariani F, Vitale VN, Iuorio M, Cuomo A, et al. AI-based cancer pain assessment through speech emotion recognition and video facial expressions classification. Signa Vitae. 2024;20;28–38. [DOI]

64.

Limbrecht-Ecklundt K, Werner P, Traue HC, Al-Hamadi A, Walter S. Mimic activity of differentiated pain intensities: Correlation of characteristics of Facial Action Coding System and electromyography. Schmerz. 2016;30:248–56. German. [DOI] [PubMed]

65.

Lopez-Martinez D, Picard R. Continuous Pain Intensity Estimation from Autonomic Signals with Recurrent Neural Networks. Annu Int Conf IEEE Eng Med Biol Soc. 2018;2018:5624–7. [DOI]

66.

Chen Y, Qiu Z, Meng F, Li H, Xu L, Wu Q. Leveraging pre-trained models for multimodal class-incremental learning under adaptive fusion. In: ICASSP 2025 – IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2025. pp. 1–5. [DOI]

67.

Yang Z. Research on multi-source data fusion analysis model based on deep learning. In: 2025 5th Asia-Pacific Conference on Communications Technology and Computer Science (ACCTCS). 2025; pp. 1178–83. [DOI]

68.

Yue Y. Multimodal learning data fusion and analysis based on self-attention mechanism. In: 2025 IEEE 5th International Conference on Electronic Technology, Communication and Information (ICETCI). 2025. pp. 1040–7. [DOI]

69.

Zhang J, Xue S, Wang X, Liu J. Survey of multimodal sentiment analysis based on deep learning. In: 2023 IEEE 9th International Conference on Cloud Computing and Intelligent Systems (CCIS). 2023. pp. 446–50. [DOI]

70.

Shivappa ST, Trivedi MM, Rao BD. Audiovisual information fusion in human–computer interfaces and intelligent environments: a survey. Proc IEEE. 2010;98:1692–715. [DOI]

71.

Islam MM, Yasar MS, Iqbal T. MAVEN: a memory augmented recurrent approach for multimodal fusion. IEEE Trans Multimedia. 2023;25:3694–708. [DOI]

72.

Yu J, Pu J, Cheng Y, Feng R, Shan Y. Learning music-dance representations through explicit-implicit rhythm synchronization. IEEE Trans Multimedia. 2024;26:8454–63. [DOI]

73.

Wu Y, Jia T, Yang B, Li W, Yang T. Design of acquisition system based on RGB-D-T multi-modal images and research on alignment techniques. In: 2024 IEEE 14th International Conference on CYBER Technology in Automation, Control, and Intelligent Systems (CYBER). 2024. pp. 260–5. [DOI]

74.

Freeman ED, Ipser A, Palmbaha A, Paunoiu D, Brown P, Lambert C, et al. Sight and sound out of synch: fragmentation and renormalisation of audiovisual integration and subjective timing. Cortex. 2013;49:2875–87. [DOI]

75.

Al-Dailami A, Kuang H, Wang J. Multimodal representation learning based on personalized graph-based fusion for mortality prediction using electronic medical records. Big Data Min Anal. 2025;8:933–50. [DOI]

76.

Cai L, Zeng W, Chen H, Zhang H, Li Y, Feng Y, et al. MM-GTUNets: unified multi-modal graph deep learning for brain disorders prediction. IEEE Trans Med Imaging. 2025;44:3705–16. [DOI]

77.

Wang J, Zhang O, Jiang Y. Multimodal diffusion framework for collaborative text image audio generation and applications. Sci Rep. 2025;15:20604. [DOI] [PubMed] [PMC]

78.

Ai W, Shou Y, Meng T, Li K. DER-GCN: dialog and event relation-aware graph convolutional neural network for multimodal dialog emotion recognition. IEEE Trans Neural Netw Learn Syst. 2025;36:4908–21. [DOI]

79.

Boie SD, Giesa N, Sekutowicz M, Zhumagambetov R, Haufe S, Grünewald E, et al. Multimodal data for predictive medicine: algorithmic fusion of clinical data in anesthesiology and intensive care. Front Med. 2026;13:1746867. [DOI]

80.

Khader F, Kather JN, Müller-Franzes G, Wang T, Han T, Tayebi Arasteh S, et al. Medical transformer for multimodal survival prediction in intensive care: integration of imaging and non-imaging data. Sci Rep. 2023;13:10666. [DOI]

81.

Xiang H, Zhang H, Cheng Y, Quan X, Huang W. SMFusion: Semantic-Preserving Fusion of Multimodal Medical Images for Enhanced Clinical Diagnosis. IEEE J Biomed Health Inform. 2025;PP. [DOI]

82.

Intriago JA, Estevez P, Cortes-Briones JA, Okuma CA, Henriquez F, Lillo P, et al. Detecting early risk of Alzheimer’s disease using self-supervised multimodal representation learning. In: 2023 IEEE Conference on Artificial Intelligence (CAI). 2023. pp. 158–60. [DOI]

83.

Shinde RK, Sodhi A, Mane PB, Mehta H. Adaptive multimodal learning for robot decision-making in dynamic environments. In: 2025 11th International Conference on Control, Automation and Robotics (ICCAR); 2025. pp. 157–62. [DOI]

84.

Dimakatso T, Kuthadi V, Selvaraj R, Dinakenyane O. Pragmatic review on progressions in multimodal disease prediction with combination of machine learning, deep learning and electronic health records. In: 2024 IEEE 4th International Conference on ICT in Business Industry & Government (ICTBIG). 2024. pp. 1–7. [DOI]

85.

Lee CH, Kim H, Yoon BC, Kim DJ. Toward foundational model for sleep analysis using a multimodal hybrid-self-supervised learning framework. IEEE Trans Cybern. 2025;55:5619–32. [DOI]

86.

Li Y, Chen J, Li F, Fu B, Wu H, Ji Y, et al. GMSS: graph-based multi-task self-supervised learning for EEG emotion recognition. IEEE Trans Affect Comput. 2023;14:2512–25. [DOI]

87.

Rajan V, Brutti A, Cavallaro A. Robust latent representations via cross-modal translation and alignment. In: ICASSP 2021 – IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); 2021. pp. 4315–9. [DOI]

88.

Pa Aung KP, Yin HL, Ma TF, Zheng WL, Lu BL. A multimodal Myanmar emotion dataset for emotion recognition. In: 2024 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC); 2024. pp. 1–4. [DOI]

89.

Gao P, Liu T, Liu JW, Lu BL, Zheng WL. Multimodal multi-view spectral-spatial-temporal masked autoencoder for self-supervised emotion recognition. In: ICASSP 2024 – IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2024. pp. 1926–30. [DOI]

90.

Hou M, Zhang Z, Liu C, Lu G. Semantic alignment network for multi-modal emotion recognition. IEEE Trans Circuits Syst Video Technol. 2023;33:5318–29. [DOI]

91.

Heo S, Kyung J, Chang JH. Multimodal emotion recognition with target speaker-based facial embeddings. In: ICASSP 2025 – IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); 2025. pp. 1–5. [DOI]

92.

Qiu F, Du H, Zhang W, Liu C, Li L, Guo T, et al. Learning transferable compound expressions from masked autoencoder pretraining. In: 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW); 2024. pp. 4733–41. [DOI]

93.

Hjuler MJ, Clemmensen LH, Das S. Exploring local interpretable model-agnostic explanations for speech emotion recognition with distribution-shift. In: ICASSP 2025 – IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); 2025. pp. 1–5. [DOI]

94.

Cascella M, Perri F, Ottaiano A, Santorsola M, Marciano ML, Rampetta FR, et al. Linking Cancer Pain Features and Biosignals for Automatic Pain Assessment. Cancers (Basel). 2026;18:646. [DOI] [PubMed] [PMC]

95.

Xu Y, Pinkney JNM, Yang YL, Shao T, Zhou K. Emotion amplification of facial videos using a fine-tuned StyleGAN. Comput Vis Media. 2025;11:587–601. [DOI]

96.

Cross JL, Choma MA, Onofrey JA. Bias in medical AI: Implications for clinical decision-making. PLOS Digit Health. 2024;3:e0000651. [DOI] [PubMed] [PMC]

97.

Han B, Yau C, Lei S, Gratch J. Knowledge-based emotion recognition using large language models. In: 2024 12th International Conference on Affective Computing and Intelligent Interaction (ACII); 2024. pp. 1–9. [DOI]

98.

Wang H, Kim DH. Graph Neural Network-Based Speech Emotion Recognition: A Fusion of Skip Graph Convolutional Networks and Graph Attention Networks. Electronics. 2024;13:4208. [DOI]

99.

Liu J, Li J, Dong J, Mo Z, Liu N, Li Q, et al. Adaptive Graph Learning with Multimodal Fusion for Emotion Recognition in Conversation. Biomimetics (Basel). 2025;10:414. [DOI] [PubMed] [PMC]

100.

Sukumar A, Desai A, Singhal P, Gokhale S, Jain DK, Walambe R, et al. Training against disguises: addressing and mitigating bias in facial emotion recognition with synthetic data. In: 2024 IEEE 18th International Conference on Automatic Face and Gesture Recognition (FG). 2024. pp. 1–6. [DOI]

101.

Bodyanskiy Y, Kulishova N, Malysheva D. The multidimensional extended neo-fuzzy system and its fast learning for emotions online recognition. In: 2018 IEEE Second International Conference on Data Stream Mining & Processing (DSMP); 2018. pp. 473–7. [DOI]

102.

Shehada D, Tawfik H, Bouridane A, Hussain A. An Explainable Framework for Mental Health Monitoring Using Lightweight and Privacy-Preserving Federated Facial Emotion Recognition. Sensors (Basel). 2025;25:7320. [DOI] [PubMed] [PMC]

103.

Chang SK, Deufemia V, Polese G, Vacca M. A Normalization Framework for Multimedia Databases. IEEE Trans Knowl Data Eng. 2007;19:1666–79. [DOI]

104.

Dzedzickis A, Kaklauskas A, Bucinskas V. Human Emotion Recognition: Review of Sensors and Methods. Sensors (Basel). 2020;20:592. [DOI] [PubMed] [PMC]

105.

Mohammad GB, Potluri S, Kumar A, A RK, P D, Tiwari R, et al. An Artificial Intelligence-Based Reactive Health Care System for Emotion Detections. Comput Intell Neurosci. 2022;2022:8787023. [DOI] [PubMed] [PMC]

106.

Montomoli J, Bitondo MM, Cascella M, Rezoagli E, Romeo L, Bellini V, et al. Algor-ethics: charting the ethical path for AI in critical care. J Clin Monit Comput. 2024;38:931–9. [DOI]

107.

Fang J, Wu W, Liu J, Zhang S. Deep learning-guided postoperative pain assessment in children. Pain. 2023;164:2029–35. [DOI] [PubMed] [PMC]