Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention

Pan, XR; Ye, TZ; Xia, ZF; Song, SJ; Huang, G

Huang, G (通讯作者),Tsinghua Univ, BNRist, Dept Automat, Beijing, Peoples R China.

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023; (): 2082

Abstract

Self-attention mechanism has been a key factor in the recent progress of Vision Transformer (ViT), which enables adaptive feature extraction from glob......

Full Text Link