Layout analysis and content enrichment of digitized books
Abstract: In this paper we describe a system for automatically analyzing old documents and creating hyper linking between different epochs, thus opening ancient documents to young people and to make them available on the web with old and current content. We propose a supervised learning approach to segment text and illustration of digitized old documents using a texture feature based on local correlation aimed at detecting the repeating patterns of text regions and differentiate them from pictorial elements. Moreover we present a solution to help the user in finding contemporary content connected to what is automatically extracted from the ancient documents.
Citation:Grana, Costantino; Serra, Giuseppe; Manfredi, Marco; Coppi, Dalia; Cucchiara, Rita "Layout analysis and content enrichment of digitized books" MULTIMEDIA TOOLS AND APPLICATIONS, vol. 75, pp. 3879 -3900 , 2016 DOI: 10.1007/s11042-014-2360-0
- Author version:
- DOI: 10.1007/s11042-014-2360-0