Semantic Text Detection in Born-Digital Images via Fully Convolutional Networks - La Rochelle Université Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

Semantic Text Detection in Born-Digital Images via Fully Convolutional Networks

Résumé

Traditional layout analysis methods cannot be easily adapted to born-digital images which carry properties from both regular document images and natural scene images. One layout approach for analyzing born-digital images is to separate the text layer from the graphics layer before further analyzing any of them. In this paper, we propose a method for detecting text regions in such images by casting the detection problem as a semantic object segmentation problem. The text classification is done in a holistic approach using fully convolutional networks where the full image is fed as input to the network and the output is a pixel heat map of the same input image size. This solves the problem of low resolution images, and the variability of text scale within one image. It also eliminates the need for finding interest points, candidate text locations or low level components. The experimental evaluation of our method on the ICDAR 2013 dataset shows that our method outperforms state-of-the-art methods. The detected text regions also allow flexibility to later apply methods for finding text components at character, word or textline levels in different orientations.
Fichier principal
Vignette du fichier
Nayef2017.pdf (248.18 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03030193 , version 1 (08-05-2022)

Licence

Paternité - Pas d'utilisation commerciale

Identifiants

Citer

Nibal Nayef, Jean-Marc Ogier. Semantic Text Detection in Born-Digital Images via Fully Convolutional Networks. 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) 2017, Nov 2017, Kyoto, Japan. pp.859-864, ⟨10.1109/ICDAR.2017.145⟩. ⟨hal-03030193⟩

Collections

L3I UNIV-ROCHELLE
60 Consultations
53 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More