Audio Signal Representations for Indexing in the Transform Domain Emmanuel Ravelli, Gael Richard, Laurent Daudet
Audio signal representations for indexing in the transform domain Emmanuel Ravelli, Gael Richard, Laurent Daudet To cite this version: Emmanuel Ravelli, Gael Richard, Laurent Daudet. Audio signal representations for indexing in the transform domain. IEEE Transactions on Audio, Speech and Language Processing, Institute of Elec- trical and Electronics Engineers, 2010. hal-02652798 HAL Id: hal-02652798 https://hal.archives-ouvertes.fr/hal-02652798 Submitted on 29 May 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. 1 Audio signal representations for indexing in the transform domain Emmanuel Ravelli, Ga¨el Richard, Senior Member, IEEE, and Laurent Daudet, Member, IEEE Abstract—Indexing audio signals directly in the transform called Advanced Audio Coding (AAC), was first introduced in domain can potentially save a significant amount of computation the MPEG-2 standard [2] in 1997 and included in the MPEG- when working on a large database of signals stored in a 4 standard [3] in 1999. AAC is based on a pure MDCT lossy compression format, without having to fully decode the signals. Here, we show that the representations used in standard (without PQF filterbank), an improved encoding algorithm, transform-based audio codecs (e.g.
[Show full text]