Dott. Lorenzo Baraldi

Dott. Lorenzo Baraldi

Homepage:

http://www.lorenzobaraldi.com

Position at Imagelab:

Assistant Professor (RTD)
Dipartimento d'Ingegneria "Enzo Ferrari", Modena Italy

Email:

lorenzo_DOT_baraldi_AT_unimore_DOT_it

Phone:

+39-059-2058790

Lorenzo Baraldi

Lorenzo Baraldi is an Assistant Professor (RTD-A) at AImageLab. He works under the supervision of Prof. Rita Cucchiara on Deep Learning, video analysis and Multimedia. Among his research interests, he has worked on Egocentric Vision and Gesture Recognition, Temporal Video Segmentation and Retrieval, Saliency prediction, Video Captioning, Visual-Semantic alignment.

He has served as a reviewer in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), IEEE Transactions on Multimedia, IEEE Transactions on Image Processing, Computer Vision and Image Understanding and IEEE Transactions on Human-Machine System. Together with Prof. Costantino Grana, in 2017, he has organized the 13th Italian Research Conference on Digital Libraries. He is member of the Program Committee for ACM Multimedia 2017-2019, in the Multimedia Search and Recommendation track. He served as a reviewer for AVSS 2017, ICCV 2017 and 2019, CVPR 2018 and 2019.

In 2016, together with Prof. Rita Cucchiara, Prof. Costantino Grana and Dr. Simone Calderara, he has been author of the winning proposal for the Facebook AI Research Partnership, with which AImageLab has been selected as one of the 15 world-class research labs in Europe to receive a GPU-based server. In 2017 he worked in the FAIR (Facebook AI Research) lab in Paris, under the supervision of Hervé Jégou and Matthijs Douze.

As part of the Città Educante project, he has developed NeuralStory, an interactive multimedia system for video indexing and re-use. He develops and maintains Speaksee, a PyTorch that provides utilities for working with Visual-Semantic data, developed at AImageLab.

He is a member of IEEE, ACM and CVPL, the Italian Association for Computer Vision, Pattern Recognition and Machine Learning.

Curriculum vitae: download

Keywords: Deep Learning, Video Analysis, Image and Video Captioning, Saliency Prediction


Teaching

  • Vision and Cognitive Systems
  • Fundamentals of Computer Science I
  • Neural Network Computing, AI and Machine Learning for Automotive

Past courses:

Theses supervision

Go to the list of available thesis

  • Matteo Tomei (MSc, currently PhD Student) - Constrained image-to-image translation
  • Jørgen Wilhelmsen, Bjørn Hoxmark (MSc) - Active learning
  • Matteo Stefanini (MSc, currently Research Fellow) - Spectral pooling techniques
  • Angelo Carraggi (MSc) - Visual-semantic embeddings
  • Stefano Pini (MSc, currently PhD Student) - Linking people and objects with their proper names in videos
  • Gianluca Puglia (MSc) - Image and Video Captioning with Transferred Semantic Attributes
  • Federico Bolelli (MSc, currently PhD Student) - Connected Components Labeling
  • Marcella Cornia (MSc, currently PhD Student) - Deeply learned Saliency prediction
  • Fabio Pozzi (MSc) - Shot and scene detection in broadcast videos
  • Angelo Perri (BSc) - Optimization of convolution algorithms on GPU architectures
  • Dodiane Carole Ngatcha Nana (BSc) - Optimization of convolution algorithms on multicore architectures

Research Projects


Research Activities


Publications

1 Cornia, Marcella; Stefanini, Matteo; Baraldi, Lorenzo; Corsini, Massimiliano; Cucchiara, Rita "Explaining Digital Humanities by Aligning Images and Textual Descriptions" PATTERN RECOGNITION LETTERS, pp. 1 -8 , 2019 Journal
2 Landi, Federico; Baraldi, Lorenzo; Corsini, Massimiliano; Cucchiara, Rita "Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters" Proceedings of 30th British Machine Vision Conference, Cardiff, UK, 9th-12th September 2019, 2019 Conference
3 Tomei, Matteo; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Image-to-Image Translation to Unfold the Reality of Artworks: an Empirical Analysis" Image Analysis and Processing ICIAP 2019, Trento, Italy, pp. 741 -752 , 9-13 September, 2019, 2019 | DOI: 10.1007/978-3-030-30645-8_67 Conference
4 Stefanini, Matteo; Cornia, Marcella; Baraldi, Lorenzo; Corsini, Massimiliano; Cucchiara, Rita "Artpedia: A New Visual-Semantic Dataset with Visual and Contextual Sentences in the Artistic Domain" Image Analysis and Processing ICIAP 2019, Trento, Italy, pp. 729 -740 , 9-13 September, 2019, 2019 | DOI: 10.1007/978-3-030-30645-8_66 Conference
5 Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions" 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, June 16-20, 2019 Conference
6 Tomei, Matteo; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation" 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, June 16-20, 2019 Conference
7 Alletto, Stefano; Cornia, Marcella; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita "Recognizing social relationships from an egocentric vision perspective" Multimodal Behavior Analysis in the Wild, pp. 199 -224 , 2019 | DOI: 10.1016/B978-0-12-814601-9.00015-8 Chapter in Book
8 Stefanini, M.; Lancellotti, R.; Baraldi, L.; Calderara, S. "A Deep-learning-based approach to VM behavior Identification in Cloud Systems" CLOSER 2019 - Proceedings of the 9th International Conference on Cloud Computing and Services Science, Heraklion, reece, pp. 308 -315 , May, 2019, 2019 Conference
9 Bolelli, Federico; Allegretti, Stefano; Baraldi, Lorenzo; Grana, Costantino "Spaghetti Labeling: Directed Acyclic Graphs for Block-Based Connected Components Labeling" IEEE TRANSACTIONS ON IMAGE PROCESSING, pp. 1 -14 , 2019 | DOI: 10.1109/TIP.2019.2946979 Journal
10 Bolelli, Federico; Cancilla, Michele; Baraldi, Lorenzo; Grana, Costantino "Connected Components Labeling on DRAGs: Implementation and Reproducibility Notes" Reproducible Research in Pattern Recognition, vol. 11455, Beijing, China, pp. 89 -93 , Aug 20-24, 2019 | DOI: 10.1007/978-3-030-23987-9_7 Conference
11 Pini, Stefano; Cornia, Marcella; Bolelli, Federico; Baraldi, Lorenzo; Cucchiara, Rita "M-VAD Names: a Dataset for Video Captioning with Naming" MULTIMEDIA TOOLS AND APPLICATIONS, vol. 78, pp. 14007 -14027 , 2018 | DOI: 10.1007/s11042-018-7040-z Journal
12 Bolelli, Federico; Baraldi, Lorenzo; Grana, Costantino "A Hierarchical Quasi-Recurrent approach to Video Captioning" 2018 IEEE International Conference on Image Processing, Applications and Systems (IPAS), Inria Sophia Antipolis, France, pp. 162 -167 , Dec 12-14, 2018 | DOI: 10.1109/IPAS.2018.8708893 Conference
13 Tomei, Matteo; Baraldi, Lorenzo; Cornia, Marcella; Cucchiara, Rita "What was Monet seeing while painting? Translating artworks to photo-realistic images" Computer Vision ECCV 2018 Workshops, Munich, Germany, 8-14 September 2018, 2018 | DOI: 10.1007/978-3-030-11012-3_46 Conference
14 Carraggi, Angelo; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Visual-Semantic Alignment Across Domains Using a Semi-Supervised Approach" Computer Vision ECCV 2018 Workshops, Munich, Germany, 8-14 September 2018, 2018 | DOI: 10.1007/978-3-030-11024-6_47 Conference
15 Cornia, Marcella; Baraldi, Lorenzo; Rezazadegan Tavakoli, Hamed; Cucchiara, Rita "Towards Cycle-Consistent Models for Text and Image Retrieval" Computer Vision ECCV 2018 Workshops, Munich, Germany, 8-14 September 2018, 2018 | DOI: 10.1007/978-3-030-11018-5_58 Conference
16 Bolelli, Federico; Baraldi, Lorenzo; Cancilla, Michele; Grana, Costantino "Connected Components Labeling on DRAGs" 2018 24th International Conference on Pattern Recognition (ICPR), Beijing, China, pp. 121 -126 , Aug 20-24, 2018 | DOI: 10.1109/ICPR.2018.8545505 Conference
17 Baraldi, Lorenzo; Cornia, Marcella; Grana, Costantino; Cucchiara, Rita "Aligning Text and Document Illustrations: towards Visually Explainable Digital Humanities" Proceedings of the 24th International Conference on Pattern Recognition, Beijing, China, pp. 1097 -1102 , August 20th-24th, 2018, 2018 | DOI: 10.1109/ICPR.2018.8545064 Conference
18 Cornia, Marcella; Abati, Davide; Baraldi, Lorenzo; Palazzi, Andrea; Calderara, Simone; Cucchiara, Rita "Attentive Models in Vision: Computing Saliency Maps in the Deep Learning Era" INTELLIGENZA ARTIFICIALE, vol. 12, pp. 161 -175 , 2018 | DOI: 10.3233/IA-170033 Conference
19 Cornia, Marcella; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita "Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model" IEEE TRANSACTIONS ON IMAGE PROCESSING, vol. 27, pp. 5142 -5154 , 2018 | DOI: 10.1109/TIP.2018.2851672 Journal
20 Cornia, Marcella; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita "SAM: Pushing the Limits of Saliency Prediction Models" 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, Salt Lake City, pp. 1890 -1892 , June 18-22, 2018 | DOI: 10.1109/CVPRW.2018.00250 Conference
21 Baraldi, Lorenzo; Douze, Matthijs; Cucchiara, Rita; Jgou, Herv "LAMV: Learning to align and match videos with kernelized temporal layers" 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, USA, June 18-22, 2018 | DOI: 10.1109/CVPR.2018.00814 Conference
22 Bolelli, Federico; Cancilla, Michele; Baraldi, Lorenzo; Grana, Costantino "Towards Reliable Experiments on the Performance of Connected Components Labeling Algorithms" JOURNAL OF REAL-TIME IMAGE PROCESSING, pp. 1 -16 , 2018 | DOI: 10.1007/s11554-018-0756-1 Journal
23 Cornia, Marcella; Pini, Stefano; Baraldi, Lorenzo; Cucchiara, Rita "Automatic Image Cropping and Selection using Saliency: an Application to Historical Manuscripts" Digital Libraries and Multimedia Archives, vol. 806, Udine, pp. 169 -179 , January 25-26, 2018, 2018 | DOI: 10.1007/978-3-319-73165-0_17 Conference
24 Cornia, Marcella; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita "Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention" ACM TRANSACTIONS ON MULTIMEDIA COMPUTING, COMMUNICATIONS AND APPLICATIONS, vol. 14, pp. 1 -21 , 2018 | DOI: 10.1145/3177745 Journal
25 Pini, Stefano; Ben Ahmed, Olfa; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita; Huet, Benoit "Modeling Multimodal Cues in a Deep Learning-based Framework for Emotion Recognition in the Wild" Proceedings of the 19th ACM International Conference on Multimodal Interaction, Glasgow, Scotland, pp. 536 -543 , November 13-17th, 2017, 2017 | DOI: 10.1145/3136755.3143006 Conference
26 Cornia, Marcella; Abati, Davide; Baraldi, Lorenzo; Palazzi, Andrea; Calderara, Simone; Cucchiara, Rita "Attentive Models in Vision: Computing Saliency Maps in the Deep Learning Era" AI*IA 2017 Advances in Artificial Intelligence, vol. 10640, Bari, Italy, pp. 387 -399 , November 14-17, 2017, 2017 | DOI: 10.1007/978-3-319-70169-1_29 Conference
27 Pini, Stefano; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Towards Video Captioning with Naming: a Novel Dataset and a Multi-Modal Approach" Image Analysis and Processing - ICIAP 2017, vol. 10485, Catania, Italy, pp. 384 -395 , 11-15 September 2017, 2017 | DOI: 10.1007/978-3-319-68548-9_36 Conference
28 Cornia, Marcella; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita "Visual Saliency for Image Captioning in New Multimedia Services" Multimedia & Expo Workshops (ICMEW), 2017 IEEE International Conference on, Hong Kong, pp. 309 -314 , July 10-14, 2017, 2017 | DOI: 10.1109/ICMEW.2017.8026277 Conference
29 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "Hierarchical Boundary-Aware Neural Encoder for Video Captioning" Computer Vision and Pattern Recognition (CVPR), 2017 IEEE Conference on, Honolulu, Hawaii, pp. 3185 -3194 , July, 22-25, 2017 | DOI: 10.1109/CVPR.2017.339 Conference
30 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "NeuralStory: an Interactive Multimedia System for Video Indexing and Re-use" Proceedings of the 15th International Workshop on Content-Based Multimedia Indexing, Florence, Italy, 19-21 June 2017, 2017 | DOI: 10.1145/3095713.3095735 Conference
31 Corbelli, Andrea; Baraldi, Lorenzo; Balducci, Fabrizio; Grana, Costantino; Cucchiara, Rita "Layout analysis and content classification in digitized books" Digital Libraries and Multimedia Archives, vol. 701, Firenze, pp. 153 -165 , Feb. 4-5, 2017 | DOI: 10.1007/978-3-319-56300-8_14 Conference
32 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "A Video Library System Using Scene Detection and Automatic Tagging" Digital Libraries and Archives, vol. 733, Modena, January 26-27, 2017, 2017 | DOI: 10.1007/978-3-319-68130-6_5 Conference
33 Grana, Costantino; Bolelli, Federico; Baraldi, Lorenzo; Vezzani, Roberto "YACCLAB - Yet Another Connected Components Labeling Benchmark" 2016 23rd International Conference on Pattern Recognition (ICPR), Cancun, Mexico, pp. 3109 -3114 , Dec 4-8, 2016 | DOI: 10.1109/ICPR.2016.7900112 Conference
34 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "Recognizing and Presenting the Storytelling Video Structure with Deep Multimodal Networks" IEEE TRANSACTIONS ON MULTIMEDIA, vol. 19, pp. 955 -968 , 2016 | DOI: 10.1109/TMM.2016.2644872 Journal
35 Corbelli, Andrea; Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "Historical Document Digitization through Layout Analysis and Deep Content Classification" Proceedings of the 23rd International Conference on Pattern Recognition, Cancun, Mexico, 4-8 Dec 2016, 2016 | DOI: 10.1109/ICPR.2016.7900272 Conference
36 Cornia, Marcella; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita "A Deep Multi-Level Network for Saliency Prediction" Pattern Recognition (ICPR), 2016 23rd International Conference on, Cancun, Mexico, 4-8 Dec 2016, 2016 | DOI: 10.1109/ICPR.2016.7900174 Conference
37 Grana, Costantino; Baraldi, Lorenzo; Bolelli, Federico "Optimized Connected Components Labeling with Pixel Prediction" Advanced Concepts for Intelligent Vision Systems, vol. 10016, Lecce, Italy, pp. 431 -440 , Oct 24-27, 2016 | DOI: 10.1007/978-3-319-48680-2_38 Conference
38 Cornia, Marcella; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita "Multi-Level Net: a Visual Saliency Prediction Model" Computer Vision ECCV 2016 Workshops, vol. 9914, Amsterdam, The Netherlands, pp. 302 -315 , October 9th, 2016, 2016 | DOI: 10.1007/978-3-319-48881-3_21 Conference
39 Paci, Francesco; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita; Benini, Luca "Context Change Detection for an Ultra-Low Power Low-Resolution Ego-Vision Imager" Computer Vision ECCV 2016 Workshops, vol. 9913, Amsterdam, The Netherlands, pp. 589 -602 , October 8-10, 2016, 2016 | DOI: 10.1007/978-3-319-46604-0_42 Conference
40 Baraldi, Lorenzo; Grana, Costantino; Messina, Alberto; Cucchiara, Rita "A Browsing and Retrieval System for Broadcast Videos using Scene Detection and Automatic Annotation" Proceedings of the 2016 ACM on Multimedia Conference, Amsterdam, The Netherlands, pp. 733 -734 , 15 - 19 October 2016, 2016 | DOI: 10.1145/2964284.2973825 Conference
41 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "Scene-driven Retrieval in Edited Videos using Aesthetic and Semantic Deep Features" Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, New York, USA, pp. 23 -29 , 6-9 Giugno 2016, 2016 | DOI: 10.1145/2911996.2912012 Conference
42 Baraldi, Lorenzo; Grana, Costantino; Borghi, Guido; Vezzani, Roberto; Cucchiara, Rita "Shot, scene and keyframe ordering for interactive video re-use" Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, vol. 4, Rome, pp. 626 -631 , Feb 27-29, 2016, 2016 | DOI: 10.5220/0005768706260631 Conference
43 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "Analysis and Re-use of Videos in Educational Digital Libraries with Automatic Scene Detection" Digital Libraries on the Move, vol. 612, Bolzano, pp. 155 -164 , Jan. 29-30, 2016 | DOI: 10.1007/978-3-319-41938-1_16 Conference
44 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "A Deep Siamese Network for Scene Detection in Broadcast Videos" Proceedings of the 23rd ACM international conference on Multimedia, Brisbane, Australia, pp. 1199 -1202 , 26-30 October 2015, 2015 | DOI: 10.1145/2733373.2806316 Conference
45 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "Shot and Scene Detection via Hierarchical Clustering for Re-using Broadcast Video" Computer Analysis of Images and Patterns. Part I, vol. 9256, Valletta, Malta, pp. 801 -811 , 2-4 September 2015, 2015 | DOI: 10.1007/978-3-319-23192-1_67 Conference
46 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "Scene segmentation using temporal clustering for accessing and re-using broadcast video" Proceedings - IEEE International Conference on Multimedia and Expo, vol. 2015-, Torino, Italia, pp. 1 -6 , 2015, 2015 | DOI: 10.1109/ICME.2015.7177476 Conference
47 Baraldi, Lorenzo; Paci, Francesco; Serra, Giuseppe; Cucchiara, Rita "Gesture Recognition using Wearable Vision Sensors to Enhance Visitors' Museum Experiences" IEEE SENSORS JOURNAL, vol. 15, pp. 2705 -2714 , 2015 | DOI: 10.1109/JSEN.2015.2411994 Journal
48 Baraldi, Lorenzo; Grana, Costantino; Cucchiara, Rita "Measuring scene detection performance" Pattern Recognition and Image Analysis, vol. 9117, Santiago de Compostela, Spain, pp. 395 -403 , 17-19 June 2015, 2015 | DOI: 10.1007/978-3-319-19390-8_45 Conference
49 Baraldi, Lorenzo; Paci, Francesco; Serra, Giuseppe; Benini, Luca; Cucchiara, Rita "Gesture Recognition in Ego-Centric Videos using Dense Trajectories and Hand Segmentation" Computer Vision and Pattern Recognition Workshops (CVPRW), 2014 IEEE Conference on, Columbus, Ohio, 23-28 June 2014, 2014 | DOI: 10.1109/CVPRW.2014.107 Conference
50 Serra, Giuseppe; Camurri, Marco; Baraldi, Lorenzo; Michela, Benedetti; Cucchiara, Rita "Hand Segmentation for Gesture Recognition in EGO-Vision" Proceedings of the 3rd ACM international workshop on Interactive multimedia on mobile & portable devices, Barcelona, Spain, pp. 31 -36 , 21 October 2013, 2013 | DOI: 10.1145/2505483.2505490 Conference