AudioVisual Video Summarization

Zhao, B; Gong, MG; Li, XL

Zhao, B (通讯作者),Xidian Univ, Acad Adv Interdisciplinary Res, Xian 710071, Peoples R China.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023; 34 (8): 5181

Abstract

Audio and vision are two main modalities in video data. Multimodal learning, especially for audiovisual learning, has drawn considerable attention rec......

Full Text Link