Unimore logo AImageLab
Prof. Lorenzo Baraldi

Prof. Lorenzo Baraldi

Homepage:

http://www.lorenzobaraldi.com

Position at AImageLab:

Associate Professor
Dipartimento d'Ingegneria "Enzo Ferrari", Modena Italy

Email:

lorenzo_DOT_baraldi_AT_unimore_DOT_it

Phone:

+39-059-2058790

Lorenzo Baraldi

Lorenzo Baraldi is an Associate Professor at the University of Modena and Reggio Emilia. He works with Prof. Rita Cucchiara on Deep Learning, Video Analysis and Multimedia, and teaches in the courses of Computer Architecture, Computer Vision and Cognitive Systems, Scalable AI, AI for Automotive. Among his research interests, he worked on Egocentric Vision and Gesture Recognition, Temporal Video Segmentation and Retrieval, Saliency, Video Captioning, Visual-Semantic alignment and Embodied AI.

He is the author of more than 120 publications in international journals and conferences, and serves as Associate Editor for Pattern Recognition and Pattern Recognition Letters and as Area Chair for major multimedia conferences. He has been elected as a Scholar in the ELLIS society, the European Laboratory for Learning and Intelligent Systems, and coordinates the Modena ELLIS Unit. Since 2021, he has been appointed as deputy director of the Interdepartmental Center on Digital Humanities of the University of Modena and Reggio Emilia.

In 2016, together with Prof. Rita Cucchiara, Prof. Costantino Grana and Dr. Simone Calderara, he has been author of the winning proposal for the Facebook AI Research Partnership, with which AImageLab has been selected as one of the 15 world-class research labs in Europe to receive a GPU-based server. In 2017, he worked in the Facebook AI Research laboratory in Paris, under the supervision of Hervé Jégou, where he developed a video copy detection algorithm that has been adopted in production on the social network.

He is a member of IEEE, ACM and CVPL, the Italian Association for Computer Vision, Pattern Recognition and Machine Learning.

Thesis proposals can be found at this link: https://www.lorenzobaraldi.com/thesis_proposals/

 

Curriculum vitae: download

Keywords: Deep Learning, Video Analysis, Image and Video Captioning, Saliency Prediction


Teaching

  • Computer Architecture
  • Computer Vision and Cognitive Systems
  • Scalable AI
  • AI for Automotive

Past courses:

Theses supervision

  • Matteo Tomei (MSc, currently PhD Student) - Constrained image-to-image translation
  • Jørgen Wilhelmsen, Bjørn Hoxmark (MSc) - Active learning
  • Matteo Stefanini (MSc, currently Research Fellow) - Spectral pooling techniques
  • Angelo Carraggi (MSc) - Visual-semantic embeddings
  • Stefano Pini (MSc, currently PhD Student) - Linking people and objects with their proper names in videos
  • Gianluca Puglia (MSc) - Image and Video Captioning with Transferred Semantic Attributes
  • Federico Bolelli (MSc, currently PhD Student) - Connected Components Labeling
  • Marcella Cornia (MSc, currently PhD Student) - Deeply learned Saliency prediction
  • Fabio Pozzi (MSc) - Shot and scene detection in broadcast videos
  • Angelo Perri (BSc) - Optimization of convolution algorithms on GPU architectures
  • Dodiane Carole Ngatcha Nana (BSc) - Optimization of convolution algorithms on multicore architectures

Research Projects


Research Activities


Publications

1 Zini, Leonardo; Frigieri, Elia; Aloscari, Sebastiano; Baraldi, Lorenzo "vHector and HeisenVec: Scalable Vector Graphics Generation Through Large Language Models" Proceedings of the 39th Conference on Neural Information Processing Systems, San Diego, Dec 2nd - Dec 7th, 2025 Conference
2 Compagnoni, Alberto; Caffagni, Davide; Moratelli, Nicholas; Baraldi, Lorenzo; Cornia, Marcella; Cucchiara, Rita "Mitigating Hallucinations in Multimodal LLMs via Object-aware Preference Optimization" Proceedings of the 36th British Machine Vision Conference, Sheffield, UK, 24th - 27th November 2025, 2025 Conference
3 Cocchi, Federico; Moratelli, Nicholas; Caffagni, Davide; Sarto, Sara; Baraldi, Lorenzo; Cornia, Marcella; Cucchiara, Rita "LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning" Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, Honolulu, Hawaii, Oct 19 – 23th, 2025 Conference
4 Baraldi, Lorenzo; Bucciarelli, Davide; Betti, Federico; Cornia, Marcella; Baraldi, Lorenzo; Sebe, Nicu; Cucchiara, Rita "What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models" Proceedings of the 2025 IEEE/CVF International Conference on Computer Vision, Honolulu, Hawaii, Oct 19 – 23th, 2025, 2025 Conference
5 Barsellotti, Luca; Bianchi, Lorenzo; Messina, Nicola; Carrara, Fabio; Cornia, Marcella; Baraldi, Lorenzo; Falchi, Fabrizio; Cucchiara, Rita "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation" Proceedings of the 2025 IEEE/CVF International Conference on Computer Vision, Honolulu, Hawaii, Oct 19 – 23th, 2025, 2025 Conference
6 Pipoli, Vittorio; Saporita, Alessia; Bolelli, Federico; Cornia, Marcella; Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita; Ficarra, Elisa "MissRAG: Addressing the Missing Modality Challenge in Multimodal Large Language Models" Proceedings of the 2025 IEEE/CVF International Conference on Computer Vision, Honolulu, Hawaii, Oct 19-23, 2025 Conference
7 Rawal, Niyati; Singh Maharjan, Rahul; Salici, Giacomo; Catalini, Riccardo; Romeo, Marta; Bigazzi, Roberto; Baraldi, Lorenzo; Vezzani, Roberto; Cucchiara, Rita; Cangelosi, Angelo "Multimodal Dialogue for Empathetic Human-Robot Interaction" Proceedings of the International Conference on Social Robotics, Naples, Italy, September 10-12, 2025 Conference
8 Saporita, Alessia; Pipoli, Vittorio; Bolelli, Federico; Baraldi, Lorenzo; Acquaviva, Andrea; Ficarra, Elisa "Tracing Information Flow in LLaMA Vision: A Step Toward Multimodal Understanding" Proceedings of the 21st International Conference in Computer Analysis of Images and Patterns, Las Palmas de Gran Canaria, Spain, 22 - 25 Sep, 2025 Conference
9 Rawal, Niyati; Xia, Matteo; Tessaro, David; Baraldi, Lorenzo; Cucchiara, Rita "MATE: Multimodal Agent that Talks and Empathizes" Proceedings of the 23rd International Conference on Image Analysis and Processing, Rome, Italy, 15-19 September 2025, 2025 Conference
10 Caffagni, Davide; Sarto, Sara; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval" Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville TN, June 11th - June 15th, 2025 Conference
11 Poppi, Tobia; Kasarla, Tejaswi; Mettes, Pascal; Baraldi, Lorenzo; Cucchiara, Rita "Hyperbolic Safety-Aware Vision-Language Models" Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville TN, June 11th - June 15th, 2025 Conference
12 Cocchi, Federico; Moratelli, Nicholas; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering" Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville TN, June 11th - June 15th, 2025 Conference
13 Sarto, Sara; Moratelli, Nicholas; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and Training" INTERNATIONAL JOURNAL OF COMPUTER VISION, pp. 1 -28 , 2025 | DOI: 10.1007/s11263-025-02535-y Journal
14 Parascandolo, Fiorenzo; Moratelli, Nicholas; Sangineto, Enver; Baraldi, Lorenzo; Cucchiara, Rita "Causal Graphical Models for Vision-Language Compositional Understanding" Proceedings of the 13th International Conference on Learning Representations, ICLR 2025, Singapore, pp. 41219 -41244 , Apr 24 - Apr 28th, 2025, 2025 Conference
15 Singh Maharjan, Rahul; Rawal, Niyati; Romeo, Marta; Baraldi, Lorenzo; Cucchiara, Rita; Cangelosi, Angelo "Multimodal Emotion Recognition in Conversation via Possible Speaker's Audio and Visual Sequence Selection" Proceedings of the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing, Hyderabad, India, 6-11 April 2025, 2025 | DOI: 10.1109/ICASSP49660.2025.10888172 Conference
16 Pipoli, Vittorio; Bolelli, Federico; Sarto, Sara; Cornia, Marcella; Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita; Ficarra, Elisa "Semantically Conditioned Prompts for Visual Recognition under Missing Modality Scenarios" Proceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025, Tucson, Arizona, pp. 4968 -4977 , Feb 28 - Mar 4 2025, 2025 | DOI: 10.1109/WACV61041.2025.00486 Conference
17 Rawal, Niyati; Baraldi, Lorenzo; Cucchiara, Rita "AIGeN-Llama: An Adversarial Approach for Instruction Generation in VLN using Llama2 Model" Proceedings of the 21st Conference on Information and Research Science Connecting to Digital and Library Science, IRCDL 2025, vol. 3937, Udine, ITALY, Feb 20-21, 2025, 2025 Conference
18 Amoroso, Roberto; Zhang, Gengyuan; Koner, Rajat; Baraldi, Lorenzo; Cucchiara, Rita; Tresp, Volker "Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries" Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision 2025, Tucson, Arizona, Feb 28 – Mar 4, 2025 Conference
19 Caffagni, Davide; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Augmenting and Mixing Transformers with Synthetic Data for Image Captioning" IMAGE AND VISION COMPUTING, vol. 162, pp. 1 -31 , 2025 | DOI: 10.1016/j.imavis.2025.105661 Journal
20 Baraldi, Lorenzo; Amoroso, Roberto; Cornia, Marcella; Pilzer, Andrea; Cucchiara, Rita "Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training" COMPUTER VISION AND IMAGE UNDERSTANDING, vol. 252, pp. 1 -10 , 2025 | DOI: 10.1016/j.cviu.2025.104294 Journal
21 Baraldi, Lorenzo; Bucciarelli, Davide; Zeng, Zifan; Zhang, Chongzhe; Zhang, Qunli; Cornia, Marcella; Baraldi, Lorenzo; Liu, Feng; Hu, Zheng; Cucchiara, Rita "Verifier Matters: Enhancing Inference-Time Scaling for Video Diffusion Models" Proceedings of the 36th British Machine Vision Conference, Sheffield, UK, 24th - 27th November 2025, 2025 Conference
22 Rawal, Niyati "Integrazione di visione e linguaggio per l'interazione fisica e cognitiva uomo-robot" 2025 Other
23 Amoroso, Roberto "Architetture Multimodali Attentive di Deep Learning per la Comprensione Visivo-Semantica" 2025 Other
24 Barsellotti, Luca; Bigazzi, Roberto; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments" Proccedings of the 38th Conference on Neural Information Processing Systems, NeurIPS 2024, vol. 37, Vancouver, Canada, December 9-15, 2024, 2024 Conference
25 Moratelli, Nicholas; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Fluent and Accurate Image Captioning with a Self-Trained Reward Model" Proceedings of the 27th International Conference on Pattern Recognition, Kolkata, India, December 01-05, 2024, 2024 Conference
26 Cappelletti, Silvia; Baraldi, Lorenzo; Cocchi, Federico; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Adapt to Scarcity: Few-Shot Deepfake Detection via Low-Rank Adaptation" Proceedings of the 27th International Conference on Pattern Recognition, Kolkata, India, December 01-05, 2024, 2024 Conference
27 Moratelli, Nicholas; Caffagni, Davide; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization" Proceedings of the 35th British Machine Vision Conference, Glasgow, UK, 25th - 28th November 2024, 2024 Conference
28 Bucciarelli, Davide; Moratelli, Nicholas; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis" Proceedings of the European Conference on Computer Vision Workshops, Milan, Sep 29th - Oct 4th, 2024 Conference
29 Rawal, Niyati; Maharjan, Rahul Singh; Romeo, Marta; Bigazzi, Roberto; Baraldi, Lorenzo; Cucchiara, Rita; Cangelosi, Angelo "Intelligent Multimodal Artificial Agents that Talk and Express Emotions" Springer Proceedings in Advanced Robotics, Lugano, 30 September to 1 October 2024, 2024 Conference
30 Poppi, Samuele; Poppi, Tobia; Cocchi, Federico; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models" Proceedings of the European Conference on Computer Vision, Milan, Sep 29th - Oct 4th, 2024 Conference
31 Baraldi, Lorenzo; Cocchi, Federico; Cornia, Marcella; Baraldi, Lorenzo; Nicolosi, Alessandro; Cucchiara, Rita "Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities" Proceedings of the European Conference on Computer Vision, Milan, Sep 29th - Oct 4th, 2024 Conference
32 Sarto, Sara; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues" Proceedings of the European Conference on Computer Vision, Milan, Sep 29th - Oct 4th, 2024 Conference
33 Cucchiara, Rita; Baraldi, Lorenzo; Cornia, Marcella; Sarto, Sara "Video Surveillance and Privacy: A Solvable Paradox?" COMPUTER, vol. 57, pp. 91 -100 , 2024 | DOI: 10.1109/MC.2023.3316696 Journal
34 Caffagni, Davide; Cocchi, Federico; Barsellotti, Luca; Moratelli, Nicholas; Sarto, Sara; Baraldi, Lorenzo; Baraldi, Lorenzo; Cornia, Marcella; Cucchiara, Rita "The Revolution of Multimodal Large Language Models: A Survey" FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, Bangkok, Thailand, pp. 13590 -13618 , August 11–16, 2024, 2024 Conference
35 Poppi, Samuele; Sarto, Sara; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Multi-Class Unlearning for Image Classification via Weight Filtering" IEEE INTELLIGENT SYSTEMS, vol. 39, pp. 40 -47 , 2024 | DOI: 10.1109/MIS.2024.3412742 Journal
36 Rawal, Niyati; Bigazzi, Roberto; Baraldi, Lorenzo; Cucchiara, Rita "AIGeN: An Adversarial Approach for Instruction Generation in VLN" Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024, Seattle, pp. 2070 -2080 , 16th-22st June 2024, 2024 | DOI: 10.1109/CVPRW63382.2024.00212 Conference
37 Caffagni, Davide; Cocchi, Federico; Moratelli, Nicholas; Sarto, Sara; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs" Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024, Seattle, USA, pp. 1818 -1826 , Jun 17-21 2024, 2024 | DOI: 10.1109/CVPRW63382.2024.00188 Conference
38 Barsellotti, Luca; Amoroso, Roberto; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation" Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, pp. 3689 -3698 , 17th-21st June 2024, 2024 | DOI: 10.1109/CVPR52733.2024.00354 Conference
39 Amoroso, Roberto; Morelli, Davide; Cornia, Marcella; Baraldi, Lorenzo; Del Bimbo, Alberto; Cucchiara, Rita "Parents and Children: Distinguishing Multimodal DeepFakes from Natural Images" ACM TRANSACTIONS ON MULTIMEDIA COMPUTING, COMMUNICATIONS AND APPLICATIONS, vol. 21, pp. 1 -22 , 2024 | DOI: 10.1145/3665497 Journal
40 Sarto, Sara; Cornia, Marcella; Baraldi, Lorenzo; Nicolosi, Alessandro; Cucchiara, Rita "Towards Retrieval-Augmented Architectures for Image Captioning" ACM TRANSACTIONS ON MULTIMEDIA COMPUTING, COMMUNICATIONS AND APPLICATIONS, vol. 20, pp. 1 -22 , 2024 | DOI: 10.1145/3663667 Journal
41 Bigazzi, Roberto; Baraldi, Lorenzo; Kousik, Shreyas; Cucchiara, Rita; Pavone, Marco "Mapping High-level Semantic Regions in Indoor Environments without Object Recognition" Proceedings of the 2024 IEEE International Conference on Robotics and Automation (ICRA), vol. 2024, Yokohama, pp. 7686 -7693 , May 13th-17th, 2024, 2024 | DOI: 10.1109/ICRA57147.2024.10610897 Conference
42 Moratelli, Nicholas; Barraco, Manuele; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Are Learnable Prompts the Right Way of Prompting? Adapting Vision-and-Language Models with Memory Optimization" IEEE INTELLIGENT SYSTEMS, vol. 39, pp. 26 -34 , 2024 | DOI: 10.1109/MIS.2024.3386099 Journal
43 Cornia, Marcella; Baraldi, Lorenzo; Fiameni, Giuseppe; Cucchiara, Rita "Generating More Pertinent Captions by Leveraging Semantics and Style on Multi-Source Datasets" INTERNATIONAL JOURNAL OF COMPUTER VISION, vol. 132, pp. 1701 -1720 , 2024 | DOI: 10.1007/s11263-023-01949-w Journal
44 Bernhard, Maximilian; Amoroso, Roberto; Kindermann, Yannic; Baraldi, Lorenzo; Cucchiara, Rita; Tresp, Volker; Schubert, Matthias "What’s Outside the Intersection? Fine-grained Error Analysis for Semantic Segmentation Beyond IoU" Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, Hawaii, Jan 4-8, 2024 Conference
45 Barsellotti, Luca; Amoroso, Roberto; Baraldi, Lorenzo; Cucchiara, Rita "FOSSIL: Free Open-Vocabulary Semantic Segmentation through Synthetic References Retrieval" Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024, Waikoloa, Hawaii, pp. 1453 -1462 , Jan 4-8, 2024, 2024 | DOI: 10.1109/WACV57701.2024.00149 Conference
46 Betti, Federico; Baraldi, Lorenzo; Baraldi, Lorenzo; Cucchiara, Rita; Sebe, Nicu "Optimizing Resource Consumption in Diffusion Models through Hallucination Early Detection" Proceedings of the European Conference on Computer Vision Workshops, Milan, Sep 29th - Oct 4th, 2024 Conference
47 Poppi, Samuele; Sarto, Sara; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Unlearning Vision Transformers without Retaining Data via Low-Rank Decompositions" Proceedings of the 27th International Conference on Pattern Recognition, Kolkata, India, December 01-05, 2024, 2024 Conference
48 Amoroso, Roberto; Tomei, Matteo; Baraldi, Lorenzo; Cucchiara, Rita "Superpixel Positional Encoding to Improve ViT-based Semantic Segmentation Models" Proceedings of the British Machine Vision Conference 2023, Aberdeen, UK, 20th - 24th November 2023, 2023 Conference
49 Betti, Federico; Staiano, Jacopo; Baraldi, Lorenzo; Baraldi, Lorenzo; Cucchiara, Rita; Sebe, Nicu "Let's ViCE! Mimicking Human Cognitive Behavior in Image Generation Evaluation" Proceedings of the 31st ACM International Conference on Multimedia - MM 2023, Ottawa, pp. 9306 -9312 , October 29 - November 3, 2023, 2023 | DOI: 10.1145/3581783.3612706 Conference
50 Cornia, Marcella; Baraldi, Lorenzo; Ayellet, Tal; Cucchiara, Rita "Fully-Attentive Iterative Networks for Region-based Controllable Image and Video Captioning" COMPUTER VISION AND IMAGE UNDERSTANDING, vol. 237, pp. 1 -10 , 2023 | DOI: 10.1016/j.cviu.2023.103857 Journal
51 Barraco, Manuele; Sarto, Sara; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning" Proceedings of the IEEE International Conference on Computer Vision, ICCV 2023, Paris, France, pp. 3009 -3019 , October 2-6, 2023, 2023 | DOI: 10.1109/ICCV51070.2023.00282 Conference
52 Cocchi, Federico; Baraldi, Lorenzo; Poppi, Samuele; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Unveiling the Impact of Image Transformations on Deepfake Detection: An Experimental Analysis" IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT II, vol. 14234, Udine, Italy, pp. 345 -356 , September 11-15, 2023, 2023 | DOI: 10.1007/978-3-031-43153-1_29 Conference
53 Poppi, Samuele; Rawal, Niyati; Bigazzi, Roberto; Cornia, Marcella; Cascianelli, Silvia; Baraldi, Lorenzo; Cucchiara, Rita "Towards Explainable Navigation and Recounting" IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT I, vol. 14233, Udine, Italy, pp. 171 -183 , September 11-15, 2023, 2023 | DOI: 10.1007/978-3-031-43148-7_15 Conference
54 Caffagni, Davide; Barraco, Manuele; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "SynthCap: Augmenting Transformers with Synthetic Data for Image Captioning" Proceedings of the 22nd International Conference on Image Analysis and Processing, vol. 14233, Udine, Italy, pp. 112 -123 , September 11-15, 2023, 2023 | DOI: 10.1007/978-3-031-43148-7_10 Conference
55 Barsellotti, Luca; Amoroso, Roberto; Baraldi, Lorenzo; Cucchiara, Rita "Enhancing Open-Vocabulary Semantic Segmentation with Prototype Retrieval" Proceedings of the 22nd International Conference on Image Analysis and Processing, ICIAP 2023, vol. 14234, Udine, Italy, pp. 196 -208 , September 11-15, 2023, 2023 | DOI: 10.1007/978-3-031-43153-1_17 Conference
56 Sarto, Sara; Barraco, Manuele; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation" Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, vol. 2023, Vancouver, can, pp. 6914 -6924 , Jun 18-22 2023, 2023 | DOI: 10.1109/CVPR52729.2023.00668 Conference
57 Moratelli, Nicholas; Barraco, Manuele; Morelli, Davide; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Fashion-Oriented Image Captioning with External Knowledge Retrieval and Fully Attentive Gates" SENSORS, vol. 23, pp. 1 -16 , 2023 | DOI: 10.3390/s23031286 Journal
58 Bigazzi, Roberto; Cornia, Marcella; Cascianelli, Silvia; Baraldi, Lorenzo; Cucchiara, Rita "Embodied Agents for Efficient Exploration and Smart Scene Description" Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), vol. 2023-May, London, pp. 6057 -6064 , 29 May - 2 June 2023, 2023 | DOI: 10.1109/ICRA48891.2023.10160668 Conference
59 Stefanini, Matteo; Cornia, Marcella; Baraldi, Lorenzo; Cascianelli, Silvia; Fiameni, Giuseppe; Cucchiara, Rita "From Show to Tell: A Survey on Deep Learning-based Image Captioning" IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 45, pp. 539 -559 , 2023 | DOI: 10.1109/TPAMI.2022.3148210 Journal
60 Al Kalak, Matteo; Baraldi, Lorenzo "Sharing Cultural Heritage—The Case of the Lodovico Media Library" MULTIMODAL TECHNOLOGIES AND INTERACTION, vol. 7, pp. 1 -15 , 2023 | DOI: 10.3390/mti7120115 Journal
61 Pippi, V.; Cascianelli, S.; Baraldi, L.; Cucchiara, R. "Evaluating synthetic pre-Training for handwriting processing tasks" PATTERN RECOGNITION LETTERS, vol. 172, pp. 44 -50 , 2023 | DOI: 10.1016/j.patrec.2023.06.003 Journal
62 Sarto, Sara; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Retrieval-Augmented Transformer for Image Captioning" Proceedings of the 19th International Conference on Content-based Multimedia Indexing, CBMI 2022, Graz, Austria, pp. 1 -7 , SEP 14-16, 2022, 2022 | DOI: 10.1145/3549555.3549585 Conference
63 Messina, Nicola; Stefanini, Matteo; Cornia, Marcella; Baraldi, Lorenzo; Falchi, Fabrizio; Amato, Giuseppe; Cucchiara, Rita "ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval" Proceedings of the 19th International Conference on Content-based Multimedia Indexing, Graz, Austria, pp. 64 -70 , Sept. 14-16, 2022, 2022 | DOI: 10.1145/3549555.3549576 Conference
64 Tomei, Matteo; Baraldi, Lorenzo; Fiameni, Giuseppe; Bronzin, Simone; Cucchiara, Rita "A Computational Approach for Progressive Architecture Shrinkage in Action Recognition" SOFTWARE, PRACTICE AND EXPERIENCE, vol. 52, pp. 537 -554 , 2022 | DOI: 10.1002/spe.3035 Journal
65 Cascianelli, Silvia; Pippi, Vittorio; Maarand, Martin; Cornia, Marcella; Baraldi, Lorenzo; Kermorvant, Christopher; Cucchiara, Rita "The LAM Dataset: A Novel Benchmark for Line-Level Handwritten Text Recognition" Proceedings of the 26th International Conference on Pattern Recognition, vol. 2022-, Montréal Québec, pp. 1506 -1513 , August 21-25, 2022, 2022 | DOI: 10.1109/ICPR56361.2022.9956189 Conference
66 Landi, Federico; Bigazzi, Roberto; Cornia, Marcella; Cascianelli, Silvia; Baraldi, Lorenzo; Cucchiara, Rita "Spot the Difference: A Novel Task for Embodied Agents in Changing Environments" Proceedings of the 26th International Conference on Pattern Recognition, vol. 2022-, Montréal Québec, pp. 4182 -4188 , August 21-25, 2022, 2022 | DOI: 10.1109/ICPR56361.2022.9956538 Conference
67 Barraco, Manuele; Stefanini, Matteo; Cornia, Marcella; Cascianelli, Silvia; Baraldi, Lorenzo; Cucchiara, Rita "CaMEL: Mean Teacher Learning for Image Captioning" Proceedings of the 26th International Conference on Pattern Recognition, vol. 2022-, Montréal Québec, pp. 4087 -4094 , August 21-25, 2022, 2022 | DOI: 10.1109/ICPR56361.2022.9955644 Conference
68 Barraco, Manuele; Cornia, Marcella; Cascianelli, Silvia; Baraldi, Lorenzo; Cucchiara, Rita "The Unreasonable Effectiveness of CLIP features for Image Captioning: an Experimental Analysis" IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, vol. 2022-, New Orleans, Louisiana, pp. 4661 -4669 , June 19-24, 2022, 2022 | DOI: 10.1109/CVPRW56347.2022.00512 Conference
69 Bruno, Paolo; Amoroso, Roberto; Cornia, Marcella; Cascianelli, Silvia; Baraldi, Lorenzo; Cucchiara, Rita "Investigating Bidimensional Downsampling in Vision Transformer Models" Proceedings of the 21st International Conference on Image Analysis and Processing, vol. 13232, Lecce, Italy, pp. 287 -299 , 23 - 27 May 2022, 2022 | DOI: 10.1007/978-3-031-06430-2_24 Conference
70 Bigazzi, Roberto; Landi, Federico; Cascianelli, Silvia; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Embodied Navigation at the Art Gallery" Proceedings of the 21st International Conference on Image Analysis and Processing, vol. 13231, Lecce, Italy, pp. 739 -750 , May 23-27, 2022, 2022 | DOI: 10.1007/978-3-031-06427-2_61 Conference
71 Cascianelli, Silvia; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Boosting Modern and Historical Handwritten Text Recognition with Deformable Convolutions" INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, vol. 25, pp. 207 -217 , 2022 | DOI: 10.1007/s10032-022-00401-y Journal
72 Bigazzi, Roberto; Landi, Federico; Cascianelli, Silvia; Baraldi, Lorenzo; Cornia, Marcella; Cucchiara, Rita "Focus on Impact: Indoor Exploration with Intrinsic Motivation" IEEE ROBOTICS AND AUTOMATION LETTERS, vol. 7, pp. 2985 -2992 , 2022 | DOI: 10.1109/LRA.2022.3145971 Journal
73 Fenocchi, Emanuele; Morelli, Davide; Cornia, Marcella; Baraldi, Lorenzo; Cesari, Fabio; Cucchiara, Rita "Dual-Branch Collaborative Transformer for Virtual Try-On" Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, vol. 2022-, New Orleans, Louisiana, pp. 2246 -2250 , June 19-24, 2022, 2022 | DOI: 10.1109/CVPRW56347.2022.00246 Conference
74 Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Explaining Transformer-based Image Captioning Models: An Empirical Analysis" AI COMMUNICATIONS, vol. 35, pp. 111 -129 , 2022 | DOI: 10.3233/AIC-210172 Journal
75 Cornia, Marcella; Tomei, Matteo; Baraldi, Lorenzo; Cucchiara, Rita "Matching Faces and Attributes Between the Artistic and the Real Domain: the PersonArt Approach" ACM TRANSACTIONS ON MULTIMEDIA COMPUTING, COMMUNICATIONS AND APPLICATIONS, vol. 18, pp. 1 -23 , 2022 | DOI: 10.1145/3490033 Journal
76 Tomei, Matteo; Baraldi, Lorenzo; Calderara, Simone; Bronzin, Simone; Cucchiara, Rita "RMS-Net: Regression and Masking for Soccer Event Spotting" Proceedings of the 25th International Conference on Pattern Recognition, Milan, Italy, pp. 7699 -7706 , 10-15 January 2021, 2021 | DOI: 10.1109/ICPR48806.2021.9412268 Conference
77 Cojocaru, Iulian; Cascianelli, Silvia; Baraldi, Lorenzo; Corsini, Massimiliano; Cucchiara, Rita "Watch Your Strokes: Improving Handwritten Text Recognition with Deformable Convolutions" Proceedings of the 25th International Conference on Pattern Recognition, Milan, Italy, pp. 6096 -6103 , 10-15 January 2021, 2021 | DOI: 10.1109/ICPR48806.2021.9412392 Conference
78 Bigazzi, Roberto; Landi, Federico; Cornia, Marcella; Cascianelli, Silvia; Baraldi, Lorenzo; Cucchiara, Rita "Explore and Explain: Self-supervised Navigation and Recounting" Proceedings of the 25th International Conference on Pattern Recognition, Milan, Italy, pp. 1152 -1159 , 10-15 January 2021, 2021 | DOI: 10.1109/ICPR48806.2021.9412628 Conference
79 Stefanini, Matteo; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "A Novel Attention-based Aggregation Function to Combine Vision and Language" Proceedings of the 25th International Conference on Pattern Recognition, Milan, Italy, pp. 1212 -1219 , 10-15 January 2021, 2021 | DOI: 10.1109/ICPR48806.2021.9413269 Conference
80 Landi, Federico; Baraldi, Lorenzo; Cornia, Marcella; Cucchiara, Rita "Working Memory Connections for LSTM" NEURAL NETWORKS, vol. 144, pp. 334 -341 , 2021 | DOI: 10.1016/j.neunet.2021.08.030 Journal
81 Bigazzi, Roberto; Landi, Federico; Cornia, Marcella; Cascianelli, Silvia; Baraldi, Lorenzo; Cucchiara, Rita "Out of the Box: Embodied Navigation in the Real World" Proceedings of the 19th International Conference on Computer Analysis of Images and Patterns, vol. 13052, Virtual, pp. 47 -57 , 27 September - 01 October 2021, 2021 | DOI: 10.1007/978-3-030-89128-2_5 Conference
82 Cascianelli, Silvia; Cornia, Marcella; Baraldi, Lorenzo; Piazzi, Maria Ludovica; Schiuma, Rosiana; Cucchiara, Rita "Learning to Read L'Infinito: Handwritten Text Recognition with Synthetic Training Data" Proceedings of the 19th International Conference on Computer Analysis of Images and Patterns, vol. 13053, Virtual, pp. 340 -350 , 27 September - 01 October 2021, 2021 | DOI: 10.1007/978-3-030-89131-2_31 Conference
83 Amoroso, Roberto; Baraldi, Lorenzo; Cucchiara, Rita "Assessing the Role of Boundary-level Objectives in Indoor Semantic Segmentation" COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2021, PT 1, vol. 13052 LNCS, Virtual, pp. 455 -465 , 27 September - 01 October 2021, 2021 | DOI: 10.1007/978-3-030-89128-2_44 Conference
84 Landi, Federico; Baraldi, Lorenzo; Cornia, Marcella; Corsini, Massimiliano; Cucchiara, Rita "Multimodal Attention Networks for Low-Level Vision-and-Language Navigation" COMPUTER VISION AND IMAGE UNDERSTANDING, vol. 210, pp. 1 -10 , 2021 | DOI: 10.1016/j.cviu.2021.103255 Journal
85 Cagrandi, Marco; Cornia, Marcella; Stefanini, Matteo; Baraldi, Lorenzo; Cucchiara, Rita "Learning to Select: A Fully Attentive Approach for Novel Object Captioning" Proceedings of the ACM International Conference on Multimedia Retrieval, Taipei, Taiwan, pp. 437 -441 , August 21-24, 2021, 2021 | DOI: 10.1145/3460426.3463587 Conference
86 Amoroso, Roberto; Baraldi, Lorenzo; Cucchiara, Rita "Improving Indoor Semantic Segmentation with Boundary-level Objectives" Proceedings of the 16th International Work-conference on Artificial Neural Networks, vol. 12862, Online, pp. 318 -329 , June 16-18, 2021, 2021 | DOI: 10.1007/978-3-030-85099-9_26 Conference
87 Tomei, Matteo; Baraldi, Lorenzo; Bronzin, Simone; Cucchiara, Rita "Estimating (and fixing) the Effect of Face Obfuscation in Video Recognition" 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, Virtual, pp. 3257 -3263 , June 19-25, 2021, 2021 | DOI: 10.1109/CVPRW53098.2021.00364 Conference
88 Poppi, Samuele; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Revisiting The Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis" 2021 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, Virtual, pp. 2299 -2304 , June 19-25, 2021, 2021 | DOI: 10.1109/CVPRW53098.2021.00260 Conference
89 Tomei, Matteo; Baraldi, Lorenzo; Calderara, Simone; Bronzin, Simone; Cucchiara, Rita "Video action detection by learning graph-based spatio-temporal interactions" COMPUTER VISION AND IMAGE UNDERSTANDING, vol. 206, pp. 1 -9 , 2021 | DOI: 10.1016/j.cviu.2021.103187 Journal
90 Cornia, Marcella; Baraldi, Lorenzo; Tavakoli, Hamed R.; Cucchiara, Rita "A Unified Cycle-Consistent Neural Model for Text and Image Retrieval" MULTIMEDIA TOOLS AND APPLICATIONS, vol. 79, pp. 25697 -25721 , 2020 | DOI: 10.1007/s11042-020-09251-4 Journal
91 Cornia, Marcella; Stefanini, Matteo; Baraldi, Lorenzo; Cucchiara, Rita "Meshed-Memory Transformer for Image Captioning" 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), Seattle, WA, USA, pp. 10575 -10584 , June 14-19 2020, 2020 | DOI: 10.1109/CVPR42600.2020.01059 Conference
92 Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "SMArT: Training Shallow Memory-aware Transformers for Robotic Explainability" International Conference on Robotics and Automation, Paris, France, pp. 1128 -1134 , May, 31 - June, 4, 2020 | DOI: 10.1109/ICRA40945.2020.9196653 Conference
93 Bolelli, Federico; Cancilla, Michele; Baraldi, Lorenzo; Grana, Costantino "Towards Reliable Experiments on the Performance of Connected Components Labeling Algorithms" JOURNAL OF REAL-TIME IMAGE PROCESSING, vol. 17, pp. 229 -244 , 2020 | DOI: 10.1007/s11554-018-0756-1 Journal
94 Cornia, Marcella; Stefanini, Matteo; Baraldi, Lorenzo; Corsini, Massimiliano; Cucchiara, Rita "Explaining Digital Humanities by Aligning Images and Textual Descriptions" PATTERN RECOGNITION LETTERS, vol. 129, pp. 166 -172 , 2020 | DOI: 10.1016/j.patrec.2019.11.018 Journal
95 Pierdicca, R.; Paolanti, M.; Frontoni, E.; Baraldi, L. "Ai4ar: An ai-based mobile application for the automatic generation of ar contents" AUGMENTED REALITY, VIRTUAL REALITY, AND COMPUTER GRAPHICS, AVR 2020, PT I, vol. 12242, ita, pp. 273 -288 , 2020, 2020 | DOI: 10.1007/978-3-030-58465-8_21 Conference
96 Bolelli, Federico; Allegretti, Stefano; Baraldi, Lorenzo; Grana, Costantino "Spaghetti Labeling: Directed Acyclic Graphs for Block-Based Connected Components Labeling" IEEE TRANSACTIONS ON IMAGE PROCESSING, vol. 29, pp. 1999 -2012 , 2020 | DOI: 10.1109/TIP.2019.2946979 Journal
97 Pini, Stefano; Cornia, Marcella; Bolelli, Federico; Baraldi, Lorenzo; Cucchiara, Rita "M-VAD Names: a Dataset for Video Captioning with Naming" MULTIMEDIA TOOLS AND APPLICATIONS, vol. 78, pp. 14007 -14027 , 2019 | DOI: 10.1007/s11042-018-7040-z Journal
98 Landi, Federico; Baraldi, Lorenzo; Corsini, Massimiliano; Cucchiara, Rita "Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters" Proceedings of 30th British Machine Vision Conference, Cardiff, UK, pp. 1 -12 , 9th-12th September 2019, 2019 Conference
99 Tomei, Matteo; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Image-to-Image Translation to Unfold the Reality of Artworks: an Empirical Analysis" Image Analysis and Processing – ICIAP 2019, Trento, Italy, pp. 741 -752 , 9-13 September, 2019, 2019 | DOI: 10.1007/978-3-030-30645-8_67 Conference
100 Stefanini, Matteo; Cornia, Marcella; Baraldi, Lorenzo; Corsini, Massimiliano; Cucchiara, Rita "Artpedia: A New Visual-Semantic Dataset with Visual and Contextual Sentences in the Artistic Domain" Image Analysis and Processing – ICIAP 2019, Trento, Italy, pp. 729 -740 , 9-13 September, 2019, 2019 | DOI: 10.1007/978-3-030-30645-8_66 Conference
101 Tomei, Matteo; Baraldi, Lorenzo; Cornia, Marcella; Cucchiara, Rita "What was Monet seeing while painting? Translating artworks to photo-realistic images" Computer Vision – ECCV 2018 Workshops, Munich, Germany, 8-14 September 2018, 2019 | DOI: 10.1007/978-3-030-11012-3_46 Conference
102 Carraggi, Angelo; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Visual-Semantic Alignment Across Domains Using a Semi-Supervised Approach" Computer Vision – ECCV 2018 Workshops, vol. 11134, Munich, Germany, pp. 625 -640 , 8-14 September 2018, 2019 | DOI: 10.1007/978-3-030-11024-6_47 Conference
103 Cornia, Marcella; Baraldi, Lorenzo; Rezazadegan Tavakoli, Hamed; Cucchiara, Rita "Towards Cycle-Consistent Models for Text and Image Retrieval" Computer Vision – ECCV 2018 Workshops, Munich, Germany, 8-14 September 2018, 2019 | DOI: 10.1007/978-3-030-11018-5_58 Conference
104 Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions" 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, vol. 2019-, Long Beach, CA, USA, pp. 8299 -8308 , June 16-20 2019, 2019 | DOI: 10.1109/CVPR.2019.00850 Conference
105 Tomei, Matteo; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation" 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, vol. 2019-, Long Beach, CA, USA, pp. 5842 -5852 , June 16-20 2019, 2019 | DOI: 10.1109/CVPR.2019.00600 Conference
106 Alletto, Stefano; Cornia, Marcella; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita "Recognizing social relationships from an egocentric vision perspective" MULTIMODAL BEHAVIOR ANALYSIS IN THE WILD: ADVANCES AND CHALLENGES, pp. 199 -224 , 2019 | DOI: 10.1016/B978-0-12-814601-9.00015-8 Chapter in Book
107 Stefanini, M.; Lancellotti, R.; Baraldi, L.; Calderara, S. "A Deep-learning-based approach to VM behavior Identification in Cloud Systems" CLOSER 2019 - Proceedings of the 9th International Conference on Cloud Computing and Services Science, Heraklion, Greece, pp. 308 -315 , May, 2019, 2019 | DOI: 10.5220/0007708403080315 Conference
108 Bolelli, Federico; Cancilla, Michele; Baraldi, Lorenzo; Grana, Costantino "Connected Components Labeling on DRAGs: Implementation and Reproducibility Notes" Reproducible Research in Pattern Recognition, vol. 11455, Beijing, China, pp. 89 -93 , Aug 20-24, 2019 | DOI: 10.1007/978-3-030-23987-9_7 Conference
109 Bolelli, Federico; Baraldi, Lorenzo; Grana, Costantino "A Hierarchical Quasi-Recurrent approach to Video Captioning" 2018 IEEE International Conference on Image Processing, Applications and Systems (IPAS), Inria Sophia Antipolis, France, pp. 162 -167 , Dec 12-14, 2018 | DOI: 10.1109/IPAS.2018.8708893 Conference
110 Bolelli, Federico; Baraldi, Lorenzo; Cancilla, Michele; Grana, Costantino "Connected Components Labeling on DRAGs" 2018 24th International Conference on Pattern Recognition (ICPR), vol. 2018-, Beijing, China, pp. 121 -126 , Aug 20-24, 2018 | DOI: 10.1109/ICPR.2018.8545505 Conference
111 Baraldi, Lorenzo; Cornia, Marcella; Grana, Costantino; Cucchiara, Rita "Aligning Text and Document Illustrations: towards Visually Explainable Digital Humanities" Proceedings of the 24th International Conference on Pattern Recognition, Beijing, China, pp. 1097 -1102 , August 20th-24th, 2018, 2018 | DOI: 10.1109/ICPR.2018.8545064 Conference
112 Cornia, Marcella; Abati, Davide; Baraldi, Lorenzo; Palazzi, Andrea; Calderara, Simone; Cucchiara, Rita "Attentive Models in Vision: Computing Saliency Maps in the Deep Learning Era" INTELLIGENZA ARTIFICIALE, vol. 12, y, pp. 161 -175 , z, 2018 | DOI: 10.3233/IA-170033 Journal
113 Cornia, Marcella; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita "Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model" IEEE TRANSACTIONS ON IMAGE PROCESSING, vol. 27, pp. 5142 -5154 , 2018 | DOI: 10.1109/TIP.2018.2851672 Journal
114 Cornia, Marcella; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita "SAM: Pushing the Limits of Saliency Prediction Models" 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, Salt Lake City, pp. 1971 -1973 , June 18-22 2018, 2018 | DOI: 10.1109/CVPRW.2018.00250 Conference
115 Baraldi, Lorenzo; Douze, Matthijs; Cucchiara, Rita; Jégou, Hervé "LAMV: Learning to align and match videos with kernelized temporal layers" 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, USA, pp. 7804 -7813 , June 18-22, 2018 | DOI: 10.1109/CVPR.2018.00814 Conference
116 Cornia, Marcella; Pini, Stefano; Baraldi, Lorenzo; Cucchiara, Rita "Automatic Image Cropping and Selection using Saliency: an Application to Historical Manuscripts" Digital Libraries and Multimedia Archives, vol. 806, Udine, pp. 169 -179 , January 25-26, 2018, 2018 | DOI: 10.1007/978-3-319-73165-0_17 Conference
117 Cornia, Marcella; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita "Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention" ACM TRANSACTIONS ON MULTIMEDIA COMPUTING, COMMUNICATIONS AND APPLICATIONS, vol. 14, pp. 1 -21 , 2018 | DOI: 10.1145/3177745 Journal
118 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "Recognizing and Presenting the Storytelling Video Structure with Deep Multimodal Networks" IEEE TRANSACTIONS ON MULTIMEDIA, vol. 19, pp. 955 -968 , 2017 | DOI: 10.1109/TMM.2016.2644872 Journal
119 Pini, Stefano; Ben Ahmed, Olfa; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita; Huet, Benoit "Modeling Multimodal Cues in a Deep Learning-based Framework for Emotion Recognition in the Wild" Proceedings of the 19th ACM International Conference on Multimodal Interaction, Glasgow, Scotland, pp. 536 -543 , November 13-17th, 2017, 2017 | DOI: 10.1145/3136755.3143006 Conference
120 Cornia, Marcella; Abati, Davide; Baraldi, Lorenzo; Palazzi, Andrea; Calderara, Simone; Cucchiara, Rita "Attentive Models in Vision: Computing Saliency Maps in the Deep Learning Era" AI*IA 2017 Advances in Artificial Intelligence, vol. 10640, Bari, Italy, pp. 387 -399 , November 14-17, 2017, 2017 | DOI: 10.1007/978-3-319-70169-1_29 Conference
121 Pini, Stefano; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Towards Video Captioning with Naming: a Novel Dataset and a Multi-Modal Approach" Image Analysis and Processing - ICIAP 2017, vol. 10485, Catania, Italy, pp. 384 -395 , 11-15 September 2017, 2017 | DOI: 10.1007/978-3-319-68548-9_36 Conference
122 Cornia, Marcella; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita "Visual Saliency for Image Captioning in New Multimedia Services" Multimedia & Expo Workshops (ICMEW), 2017 IEEE International Conference on, Hong Kong, pp. 309 -314 , July 10-14, 2017, 2017 | DOI: 10.1109/ICMEW.2017.8026277 Conference
123 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "Hierarchical Boundary-Aware Neural Encoder for Video Captioning" Computer Vision and Pattern Recognition (CVPR), 2017 IEEE Conference on, vol. 2017-, Honolulu, Hawaii, pp. 3185 -3194 , July, 22-25, 2017 | DOI: 10.1109/CVPR.2017.339 Conference
124 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "NeuralStory: an Interactive Multimedia System for Video Indexing and Re-use" Proceedings of the 15th International Workshop on Content-Based Multimedia Indexing, Florence, Italy, 19-21 June 2017, 2017 | DOI: 10.1145/3095713.3095735 Conference
125 Corbelli, Andrea; Baraldi, Lorenzo; Balducci, Fabrizio; Grana, Costantino; Cucchiara, Rita "Layout analysis and content classification in digitized books" Digital Libraries and Multimedia Archives, vol. 701, Firenze, pp. 153 -165 , Feb. 4-5, 2017 | DOI: 10.1007/978-3-319-56300-8_14 Conference
126 Grana, C.; Baraldi, L. "Preface" Communications in Computer and Information Science, vol. 733, Modena; Italy, pp. 5 -6 , 26 - 27 gennaio 2017, 2017 Conference
127 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "A Video Library System Using Scene Detection and Automatic Tagging" Digital Libraries and Archives, vol. 733, Modena, January 26-27, 2017, 2017 | DOI: 10.1007/978-3-319-68130-6_5 Conference
128 Grana, Costantino; Bolelli, Federico; Baraldi, Lorenzo; Vezzani, Roberto "YACCLAB - Yet Another Connected Components Labeling Benchmark" 2016 23rd International Conference on Pattern Recognition (ICPR), Cancun, Mexico, pp. 3109 -3114 , Dec 4-8, 2016 | DOI: 10.1109/ICPR.2016.7900112 Conference
129 Corbelli, Andrea; Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "Historical Document Digitization through Layout Analysis and Deep Content Classification" Proceedings of the 23rd International Conference on Pattern Recognition, Cancun, Mexico, 4-8 Dec 2016, 2016 | DOI: 10.1109/ICPR.2016.7900272 Conference
130 Cornia, Marcella; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita "A Deep Multi-Level Network for Saliency Prediction" Pattern Recognition (ICPR), 2016 23rd International Conference on, Cancun, Mexico, pp. 3488 -3493 , 4-8 Dec 2016, 2016 | DOI: 10.1109/ICPR.2016.7900174 Conference
131 Grana, Costantino; Baraldi, Lorenzo; Bolelli, Federico "Optimized Connected Components Labeling with Pixel Prediction" Advanced Concepts for Intelligent Vision Systems, vol. 10016, Lecce, Italy, pp. 431 -440 , Oct 24-27, 2016 | DOI: 10.1007/978-3-319-48680-2_38 Conference
132 Cornia, Marcella; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita "Multi-Level Net: a Visual Saliency Prediction Model" Computer Vision – ECCV 2016 Workshops, vol. 9914, Amsterdam, The Netherlands, pp. 302 -315 , October 9th, 2016, 2016 | DOI: 10.1007/978-3-319-48881-3_21 Conference
133 Paci, Francesco; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita; Benini, Luca "Context Change Detection for an Ultra-Low Power Low-Resolution Ego-Vision Imager" Computer Vision – ECCV 2016 Workshops, vol. 9913, Amsterdam, The Netherlands, pp. 589 -602 , October 8-10, 2016, 2016 | DOI: 10.1007/978-3-319-46604-0_42 Conference
134 Baraldi, Lorenzo; Grana, Costantino; Messina, Alberto; Cucchiara, Rita "A Browsing and Retrieval System for Broadcast Videos using Scene Detection and Automatic Annotation" Proceedings of the 2016 ACM on Multimedia Conference, Amsterdam, The Netherlands, pp. 733 -734 , 15 - 19 October 2016, 2016 | DOI: 10.1145/2964284.2973825 Conference
135 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "Scene-driven Retrieval in Edited Videos using Aesthetic and Semantic Deep Features" Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, New York, USA, pp. 23 -29 , 6-9 Giugno 2016, 2016 | DOI: 10.1145/2911996.2912012 Conference
136 BARALDI, LORENZO; GRANA, Costantino; BORGHI, GUIDO; VEZZANI, Roberto; CUCCHIARA, Rita "Shot, scene and keyframe ordering for interactive video re-use" Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, vol. 4, Rome, pp. 626 -631 , Feb 27-29, 2016, 2016 | DOI: 10.5220/0005768706260631 Conference
137 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "Analysis and Re-use of Videos in Educational Digital Libraries with Automatic Scene Detection" Digital Libraries on the Move, vol. 612, Bolzano, pp. 155 -164 , Jan. 29-30, 2016 | DOI: 10.1007/978-3-319-41938-1_16 Conference
138 Baraldi, L.; Grana, C.; Borghi, G.; Vezzani, R.; Cucchiara, R. "Shot, scene and keyframe ordering for interactive video re-use" VISIGRAPP 2016 - Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, vol. 2016, Rome, pp. 626 -631 , Feb 27-29, 2016, 2016 | DOI: 10.5220/0005768706260631 Conference
139 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "A Deep Siamese Network for Scene Detection in Broadcast Videos" Proceedings of the 23rd ACM international conference on Multimedia, Brisbane, Australia, pp. 1199 -1202 , 26-30 October 2015, 2015 | DOI: 10.1145/2733373.2806316 Conference
140 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "Shot and Scene Detection via Hierarchical Clustering for Re-using Broadcast Video" Computer Analysis of Images and Patterns. Part I, vol. 9256, Valletta, Malta, pp. 801 -811 , 2-4 September 2015, 2015 | DOI: 10.1007/978-3-319-23192-1_67 Conference
141 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "Scene segmentation using temporal clustering for accessing and re-using broadcast video" Proceedings - IEEE International Conference on Multimedia and Expo, vol. 2015-, Torino, Italia, pp. 1 -6 , 2015, 2015 | DOI: 10.1109/ICME.2015.7177476 Conference
142 Baraldi, Lorenzo; Paci, Francesco; Serra, Giuseppe; Cucchiara, Rita "Gesture Recognition using Wearable Vision Sensors to Enhance Visitors' Museum Experiences" IEEE SENSORS JOURNAL, vol. 15, pp. 2705 -2714 , 2015 | DOI: 10.1109/JSEN.2015.2411994 Journal
143 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "Measuring scene detection performance" Pattern Recognition and Image Analysis, vol. 9117, Santiago de Compostela, Spain, pp. 395 -403 , 17-19 June 2015, 2015 | DOI: 10.1007/978-3-319-19390-8_45 Conference
144 Baraldi, Lorenzo; Paci, Francesco; Serra, Giuseppe; Benini, Luca; Cucchiara, Rita "Gesture Recognition in Ego-Centric Videos using Dense Trajectories and Hand Segmentation" Computer Vision and Pattern Recognition Workshops (CVPRW), 2014 IEEE Conference on, Columbus, Ohio, pp. 702 -707 , 23-28 June 2014, 2014 | DOI: 10.1109/CVPRW.2014.107 Conference
145 Serra, Giuseppe; Camurri, Marco; Baraldi, Lorenzo; Michela, Benedetti; Cucchiara, Rita "Hand Segmentation for Gesture Recognition in EGO-Vision" Proceedings of the 3rd ACM international workshop on Interactive multimedia on mobile & portable devices, Barcelona, Spain, pp. 31 -36 , 21 October 2013, 2013 | DOI: 10.1145/2505483.2505490 Conference