Unimore logo AImageLab

Research fields at AImageLab

Multimedia and Big Visual Data

Transformer-based Image Captioning Controllable Captioning Video Captioning with Naming Video matching and retrieval Scene detection in Broadcast Videos Deep Learning in videos Video Captioning Class Specific Segmentation Garment Selection and Color Classification Egocentric Video Summarization Face Recognition in News Streams	From Show to Tell: A Survey on Image Captioning

Medical Imaging

IAN detection from maxillofacial 3D images Skin Lesion Analysis Deep Renal Biopsy Immunofluorescence Image Analysis 3D Reconstruction of Skin Lesions for Tumor Diagnosis in OCT imaging

Videosurveillance and HBU

3D Human Pose Estimation from Depth Maps Soccer Event Spotting GAN4Surveillance: Generative Adversarial Networks for Attribute Classification Learning to Detect and Track Visible and Occluded Body Joints in a Virtual World Duke Imagelab Multi-Target, Multi-Camera Tracking Project Spotting Prejudice Action and Gesture Recognition for Human Computer Interaction Group Detection and Crowd Analysis Multiple People Tracking People Re-identification ViSOR - Video Surveillance Online Repository People trajectory analysis and anomaly detection People Tracking From Multiple Cameras Video Action Detection Trajectory Prediction

Cultural Heritage and Digital Humanties

Handwritten Text Recognition on Historical Documments Visual-Semantic Domain Adaptation in Digital Humanities Art2Real: Translating Artworks to Photo-Realistic Images Layout and content analysis in Digitized Books Egocentric Video Registration and Architectural Details Retrieval EgoVision and Human Augmentation for Cultural Heritage

Embodied AI

Self-Supervised Navigation and Recounting Embodied Vision-and-Language Navigation Acquisition of 3D Environments for Robotic Navigation

Computer Vision and Pattern Recognition

Visualization Techniques for Explainable AI Connected Components Labeling Visual Saliency Prediction Animal welfare analysis from 3D sensors 3D Computer Vision Novelty Detection


Hand Monitoring and Gesture Recognition for Human-Car Interaction Video synthesis from Intensity and Event Frames Learning to Generate Faces from RGB and Depth data Mercury: a framework for Driver Monitoring and Human Car Interaction Face Verification with Depth Images Dr(eye)ve a Dataset for Attention-Based Tasks with Applications to Autonomous Driving Driver Attention through Head Localization and Pose Estimation Landmark Localization in Depth Images Learning to Map Vehicles into Bird

New Visions: Sensors, Mobile, and Embedding

Egocentric Vision for Detecting Social Relationships Sensing floors Real Time Ellipse Detection on Mobile Devices Collaborative robot programming

Continual Learning

General Continual Learning