Give Ear to My Face: Modelling Multimodal Attention to Social Interactions
Abstract: We address the deployment of perceptual attention to social interactions as displayed in conversational clips, when relying on multimodal information (audio and video). A probabilistic modelling framework is proposed that goes beyond the classic saliency paradigm while integrating multiple information cues. Attentional allocation is determined not just by stimulus-driven selection but, importantly, by social value as modulating the selection history of relevant multimodal items. Thus, the construction of attentional priority is the result of a sampling procedure conditioned on the potential value dynamics of socially relevant objects emerging moment to moment within the scene. Preliminary experiments on a publicly available dataset are presented.
Citation:
Boccignone, Giuseppe; Cuculo, Vittorio; D’Amelio, Alessandro; Grossi, Giuliano; Lanzarotti, Raffaella "Give Ear to My Face: Modelling Multimodal Attention to Social Interactions" Computer Vision – ECCV 2018 Workshops, vol. 11130, Munich, Germany, pp. 331 -345 , 2018, 2019 DOI: 10.1007/978-3-030-11012-3_27not available