
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
Citation:
Baraldi, Lorenzo; Amoroso, Roberto; Cornia, Marcella; Baraldi, Lorenzo; Pilzer, Andrea; Cucchiara, Rita "Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training" COMPUTER VISION AND IMAGE UNDERSTANDING, pp. 1 -10 , 2025
not available