Unimore logo AImageLab
 Davide Caffagni

Davide Caffagni

Homepage:

https://github.com/dcaffo98

Position at AImageLab:

PhD Student
Dipartimento d'Ingegneria "Enzo Ferrari", Modena Italy

Email:

davide_DOT_caffagni_AT_unimore_DOT_it

Davide Caffagni


Research Projects


Research Activities


Publications

1 Caffagni, Davide; Sarto, Sara; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval" Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville TN, June 11th - June 15th, 2025 Conference
2 Caffagni, Davide; Cocchi, Federico; Mambelli, Anna; Tutrone, Fabio; Zanella, Marco; Cornia, Marcella; Cucchiara, Rita "Benchmarking BERT-based Models for Latin: A Case Study on Biblical References in Ancient Christian Literature" Proceedings of the 21st Conference on Information and Research Science Connecting to Digital and Library Science, Udine, Italy, February 20-21, 2025 Conference
3 Moratelli, Nicholas; Caffagni, Davide; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization" Proceedings of the 35th British Machine Vision Conference, Glasgow, UK, 25th - 28th November 2024, 2024 Conference
4 Caffagni, Davide; Cocchi, Federico; Barsellotti, Luca; Moratelli, Nicholas; Sarto, Sara; Baraldi, Lorenzo; Baraldi, Lorenzo; Cornia, Marcella; Cucchiara, Rita "The Revolution of Multimodal Large Language Models: A Survey" Proceedings of the Annual Meeting of the Association for Computational Linguistics, ACL 2024, Bangkok, Thailand, pp. 13590 -13618 , August 11–16, 2024, 2024 Conference
5 Caffagni, Davide; Cocchi, Federico; Moratelli, Nicholas; Sarto, Sara; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs" Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024, Seattle, USA, pp. 1818 -1826 , Jun 17-21 2024, 2024 | DOI: 10.1109/CVPRW63382.2024.00188 Conference
6 Caffagni, Davide; Barraco, Manuele; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "SynthCap: Augmenting Transformers with Synthetic Data for Image Captioning" Proceedings of the 22nd International Conference on Image Analysis and Processing, vol. 14233, Udine, Italy, pp. 112 -123 , September 11-15, 2023, 2023 | DOI: 10.1007/978-3-031-43148-7_10 Conference