Research fields at AImageLab

Embodied AI

Embodied Vision-and-Language Navigation Self-Supervised Navigation and Recounting Acquisition of 3D Environments for Robotic Navigation

Videosurveillance and HBU

Learning to Detect and Track Visible and Occluded Body Joints in a Virtual World GAN4Surveillance: Generative Adversarial Networks for Attribute Classification Duke Imagelab Multi-Target, Multi-Camera Tracking Project Spotting Prejudice Action and Gesture Recognition for Human Computer Interaction Group Detection and Crowd Analysis Multiple People Tracking People Re-identification ViSOR - Video Surveillance Online Repository People trajectory analysis and anomaly detection People Tracking From Multiple Cameras Video Action Detection Trajectory Prediction

Computer Vision and Pattern Recognition

Connected Components Labeling Visual Saliency Prediction Animal welfare analysis from 3D sensors 3D Computer Vision 3D Reconstruction of Skin Lesions for Tumor Diagnosis in OCT imaging Novelty Detection

Multimedia and Big Visual Data

Controllable Captioning Video Captioning with Naming Video matching and retrieval Scene detection in Broadcast Videos Deep Learning in videos Video Captioning Class Specific Segmentation Garment Selection and Color Classification Egocentric Video Summarization Face Recognition in News Streams	Transformer-based Image Captioning

Cultural Heritage and Digital Humanties

Art2Real: Translating Artworks to Photo-Realistic Images Visual-Semantic Domain Adaptation in Digital Humanities Layout and content analysis in Digitized Books Egocentric Video Registration and Architectural Details Retrieval EgoVision and Human Augmentation for Cultural Heritage


Video synthesis from Intensity and Event Frames Face Verification with Depth Images Hand Monitoring and Gesture Recognition for Human-Car Interaction Learning to Generate Faces from RGB and Depth data Mercury: a framework for Driver Monitoring and Human Car Interaction Landmark Localization in Depth Images Driver Attention through Head Localization and Pose Estimation Dr(eye)ve a Dataset for Attention-Based Tasks with Applications to Autonomous Driving Learning to Map Vehicles into Bird

New Visions: Sensors, Mobile, and Embedding

Egocentric Vision for Detecting Social Relationships Sensing floors Real Time Ellipse Detection on Mobile Devices Collaborative robot programming

Continual Learning

General Continual Learning