First-Order Ambisonic Coding with PCA Matrixing and Quaternion-Based Interpolation - La Rochelle Université Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

First-Order Ambisonic Coding with PCA Matrixing and Quaternion-Based Interpolation

Résumé

We present a spatial audio coding method which can extend existing speech/audio codecs, such as EVS or Opus, to represent first-order ambisonic (FOA) signals at low bit rates. The proposed method is based on principal component analysis (PCA) to de-correlate ambisonic components prior to multi-mono coding. The PCA rotation matrices are quantized in the generalized Euler angle domain; they are interpolated in quaternion domain to avoid dis-continuities between successive signal blocks. We also describe an adaptive bit allocation algorithm for an optimized multi-mono coding of principal components. A subjective evaluation using the MUSHRA methodology is presented to compare the performance of the proposed method with naive multi-mono coding using a fixed bit allocation. Results show significant quality improvements at bit rates in the range of 52.8 kbit/s (4 × 13.2) to 97.6 kbit/s (4 × 24.4) using the EVS codec.
Fichier principal
Vignette du fichier
dafx_MAHE_Pierre.pdf (373.06 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02289558 , version 1 (16-09-2019)

Identifiants

  • HAL Id : hal-02289558 , version 1

Citer

Pierre Mahé, Stéphane Ragot, Sylvain Marchand. First-Order Ambisonic Coding with PCA Matrixing and Quaternion-Based Interpolation. DAFx19, Sep 2019, Birmingham, United Kingdom. pp.284--291. ⟨hal-02289558⟩

Collections

L3I UNIV-ROCHELLE
205 Consultations
179 Téléchargements

Partager

Gmail Facebook X LinkedIn More