Unimore logo AImageLab

Worldly eyes on video: Learnt vs. reactive deployment of attention to dynamic stimuli

Abstract: Computational visual attention is a hot topic in computer vision. However, most efforts are devoted to model saliency, whilst the actual eye guidance problem, which brings into play the sequence of gaze shifts characterising overt attention, is overlooked. Further, in those cases where the generation of gaze behaviour is considered, stimuli of interest are by and large static (still images) rather than dynamic ones (videos). Under such circumstances, the work described in this note has a twofold aim: (i) addressing the problem of estimating and generating visual scan paths, that is the sequences of gaze shifts over videos; (ii) investigating the effectiveness in scan path generation offered by features dynamically learned on the base of human observers attention dynamics as opposed to bottom-up derived features. To such end a probabilistic model is proposed. By using a publicly available dataset, our approach is compared against a model of scan path simulation that does not rely on a learning step.


Citation:

Cuculo, V.; D'Amelio, A.; Grossi, G.; Lanzarotti, R. "Worldly eyes on video: Learnt vs. reactive deployment of attention to dynamic stimuli" Image Analysis and Processing – ICIAP 2019, vol. 11751, Trento, pp. 128 -138 , 2019, 2019 DOI: 10.1007/978-3-030-30642-7_12

 not available