Visual Question Answering Research on Multi-layer Attention Mechanism Based on Image Target Features

Cao, DY; Ren, X; Zhu, MG; Song, W

Cao, DY (corresponding author), North China Univ Technol, Sch Informat Sci & Technol, Beijing, Peoples R China.; Cao, DY (corresponding author), Beijing Key Lab Integrat & Anal Large Scale Strea, Beijing, Peoples R China.

HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2021; 11 ():

Abstract

Visual question answering (VQA) aims to output a natural language answer based on a picture and a related question in order to achieve machine languag......

Full Text Link