Image-to-Image Translation to Unfold the Reality of Artworks: an Empirical Analysis
Abstract: State-of-the-art Computer Vision pipelines show poor performances on artworks and data coming from the artistic domain, thus limiting the applicability of current architectures to the automatic understanding of the cultural heritage. This is mainly due to the difference in texture and low-level feature distribution between artistic and real images, on which state-of-the-art approaches are usually trained. To enhance the applicability of pre-trained architectures on artistic data, we have recently proposed an unpaired domain translation approach which can translate artworks to photo-realistic visualizations. Our approach leverages semantically-aware memory banks of real patches, which are used to drive the generation of the translated image while improving its realism. In this paper, we provide additional analyses and experimental results which demonstrate the effectiveness of our approach. In particular, we evaluate the quality of generated results in the case of the translation of landscapes, portraits and of paintings coming from four different styles using automatic distance metrics. Also, we analyze the response of pre-trained architecture for classification, detection and segmentation both in terms of feature distribution and entropy of prediction, and show that our approach effectively reduces the domain shift of paintings. As an additional contribution, we also provide a qualitative analysis of the reduction of the domain shift for detection, segmentation and image captioning.
Citation:
Tomei, Matteo; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita "Image-to-Image Translation to Unfold the Reality of Artworks: an Empirical Analysis" Image Analysis and Processing – ICIAP 2019, Trento, Italy, pp. 741 -752 , 9-13 September, 2019, 2019 DOI: 10.1007/978-3-030-30645-8_67not available
Paper download:
- Author version:
- DOI: 10.1007/978-3-030-30645-8_67