Abstract
Currently, datasets that support audio-visual recognition of people in videos are scarce and limited. In this paper, we introduce an expansion of vide......
小提示:本篇文献需要登录阅读全文,点击跳转登录