Toward speech text recognition for comic books

Christophe Rigaud; Srikanta Pal; Jean-Christophe Burie; Jean-Marc Ogier

doi:10.1145/3011549.3011557

Communication Dans Un Congrès Année : 2016

Toward speech text recognition for comic books

Vers la reconnaissance automatique du texte de bandes dessinées

(1) , (1) , (1) , (1)

Christophe Rigaud

Fonction : Auteur

Laboratoire Informatique, Image et Interaction - EA 2118

Srikanta Pal

Fonction : Auteur

Laboratoire Informatique, Image et Interaction - EA 2118

Jean-Christophe Burie

Fonction : Auteur
PersonId : 735515
IdHAL : jean-christophe-burie
ORCID : 0000-0001-7323-2855
IdRef : 119612151

Laboratoire Informatique, Image et Interaction - EA 2118

Jean-Marc Ogier

Fonction : Auteur
PersonId : 833747

Laboratoire Informatique, Image et Interaction - EA 2118

Résumé

Speech text in comic books is placed and written in a particular manner by the letterers which raises unusual challenges for text recognition. We first detail these challenges and present different approaches to solve them. We compare the performances of generic versus specifically trained OCR systems for typewritten and handwritten text lines from French comic books. This work is evaluated over a subset of public (eBDtheque) and private (Sequencity) datasets. We demonstrate that generic OCR systems perform best on typewritten-like and lowercase fonts while specifically trained OCR can be very powerful on skewed, uppercase and even cursive fonts.

Mots clés

CCS Concepts •Information systems → Content analysis and fea- ture selection Keywords Handwritten text recognition comics image analysis

Domaines

Traitement des images [eess.IV] Traitement du texte et du document

Fichier principal

2016_Rigaud_Toward_speech_text_recognition_for_comic_books.pdf (789.51 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Christophe Rigaud : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01719530

Soumis le : mercredi 28 février 2018-11:42:22

Dernière modification le : jeudi 12 mai 2022-15:35:17

Archivage à long terme le : lundi 28 mai 2018-09:52:46

Dates et versions

hal-01719530 , version 1 (28-02-2018)

Identifiants

HAL Id : hal-01719530 , version 1
DOI : 10.1145/3011549.3011557

Citer

Christophe Rigaud, Srikanta Pal, Jean-Christophe Burie, Jean-Marc Ogier. Toward speech text recognition for comic books. Proceedings of the 1st International Workshop on coMics ANalysis, Processing and Understanding, Dec 2016, Cancun, Mexico. ⟨10.1145/3011549.3011557⟩. ⟨hal-01719530⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

L3I UNIV-ROCHELLE

64 Consultations

646 Téléchargements

Toward speech text recognition for comic books

Vers la reconnaissance automatique du texte de bandes dessinées

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager