Automatic Matching and Expansion of Abbreviated Phrases without Context - La Rochelle Université Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Automatic Matching and Expansion of Abbreviated Phrases without Context

Résumé

In many documents, like receipts or invoices, textual information is constrained by the space and organization of the document. The document information has no natural language context, and expressions are often abbreviated to respect the graphical layout, both at word level and phrase level. In order to analyze the semantic content of these types of document, we need to understand each phrase, and particularly each name of sold products. In this paper, we propose an approach to find the right expansion of abbreviations and acronyms, without context. First, we extract information about sold products from our receipts corpus and we analyze the different linguistic processes of abbreviation. Then, we retrieve a list of expanded names of products sold by the company that emitted receipts, and we propose an algorithm to pair extracted names of products with the corresponding expansions. We provide the research community with a unique document collection for abbreviation expansion.
Fichier principal
Vignette du fichier
paper_74.pdf (235.34 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02316286 , version 1 (15-10-2019)

Identifiants

  • HAL Id : hal-02316286 , version 1

Citer

Chloé Artaud, Antoine Doucet, Vincent Poulain d'Andecy, Jean-Marc Ogier. Automatic Matching and Expansion of Abbreviated Phrases without Context. 19th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing2018), Mar 2018, Hanoi, Vietnam. ⟨hal-02316286⟩

Collections

L3I UNIV-ROCHELLE
67 Consultations
387 Téléchargements

Partager

Gmail Facebook X LinkedIn More