Toward speech text recognition for comic books

Abstract : Speech text in comic books is placed and written in a particular manner by the letterers which raises unusual challenges for text recognition. We first detail these challenges and present different approaches to solve them. We compare the performances of generic versus specifically trained OCR systems for typewritten and handwritten text lines from French comic books. This work is evaluated over a subset of public (eBDtheque) and private (Sequencity) datasets. We demonstrate that generic OCR systems perform best on typewritten-like and lowercase fonts while specifically trained OCR can be very powerful on skewed, uppercase and even cursive fonts.
Document type :
Conference papers
Proceedings of the 1st International Workshop on coMics ANalysis, Processing and Understanding, Dec 2016, Cancun, Mexico. ACM Press, 〈10.1145/3011549.3011557〉
Liste complète des métadonnées

Cited literature [12 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01719530
Contributor : Christophe Rigaud <>
Submitted on : Wednesday, February 28, 2018 - 11:42:22 AM
Last modification on : Wednesday, March 7, 2018 - 1:02:02 AM
Document(s) archivé(s) le : Monday, May 28, 2018 - 9:52:46 AM

File

2016_Rigaud_Toward_speech_text...
Files produced by the author(s)

Identifiers

Collections

Citation

Christophe Rigaud, Srikanta Pal, Jean-Christophe Burie, Jean-Marc Ogier. Toward speech text recognition for comic books. Proceedings of the 1st International Workshop on coMics ANalysis, Processing and Understanding, Dec 2016, Cancun, Mexico. ACM Press, 〈10.1145/3011549.3011557〉. 〈hal-01719530〉

Share

Metrics

Record views

11

Files downloads

24