Skip to Main content Skip to Navigation
Conference papers

Semantic Text Detection in Born-Digital Images via Fully Convolutional Networks

Abstract : Traditional layout analysis methods cannot be easily adapted to born-digital images which carry properties from both regular document images and natural scene images. One layout approach for analyzing born-digital images is to separate the text layer from the graphics layer before further analyzing any of them. In this paper, we propose a method for detecting text regions in such images by casting the detection problem as a semantic object segmentation problem. The text classification is done in a holistic approach using fully convolutional networks where the full image is fed as input to the network and the output is a pixel heat map of the same input image size. This solves the problem of low resolution images, and the variability of text scale within one image. It also eliminates the need for finding interest points, candidate text locations or low level components. The experimental evaluation of our method on the ICDAR 2013 dataset shows that our method outperforms state-of-the-art methods. The detected text regions also allow flexibility to later apply methods for finding text components at character, word or textline levels in different orientations.
Complete list of metadata

https://hal.archives-ouvertes.fr/hal-03030193
Contributor : Antoine Doucet Connect in order to contact the contributor
Submitted on : Sunday, May 8, 2022 - 3:36:53 PM
Last modification on : Thursday, May 12, 2022 - 3:40:05 PM
Long-term archiving on: : Tuesday, August 9, 2022 - 6:06:18 PM

File

Nayef2017.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution - NonCommercial 4.0 International License

Identifiers

Collections

Citation

Nibal Nayef, Jean-Marc Ogier. Semantic Text Detection in Born-Digital Images via Fully Convolutional Networks. 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) 2017, Nov 2017, Kyoto, Japan. pp.859-864, ⟨10.1109/ICDAR.2017.145⟩. ⟨hal-03030193⟩

Share

Metrics

Record views

31

Files downloads

10