Unimore logo AImageLab

Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training


Citation:

Baraldi, Lorenzo; Amoroso, Roberto; Cornia, Marcella; Baraldi, Lorenzo; Pilzer, Andrea; Cucchiara, Rita "Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training" COMPUTER VISION AND IMAGE UNDERSTANDING, pp. 1 -10 , 2025

 not available

Paper download:

  • Author version: