Jigsaw-ViT: Learning jigsaw puzzles in vision transformer

Chen, YY; Shen, X; Liu, YH; Tao, QH; Suykens, JAK

Chen, YY (通讯作者),Katholieke Univ Leuven, ESAT STADIUS, Leuven, Belgium.;Shen, X (通讯作者),Tencent AI Lab, Shenzhen, Peoples R China.

PATTERN RECOGNITION LETTERS, 2023; 166 (): 53

Abstract

The success of Vision Transformer (ViT) in various computer vision tasks has promoted the ever-increasing prevalence of this convolution-free network.......

Full Text Link