Non-linear speech representation based on local predictability exponents
6 pages, 3 figures
Saved in:
Main Authors: | , , , , |
---|---|
Format: | artículo biblioteca |
Published: |
Elsevier
|
Subjects: | Multiscale signal processing, Nonlinear speech processing, Complex signals and system, |
Online Access: | http://hdl.handle.net/10261/97550 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
dig-icm-es-10261-97550 |
---|---|
record_format |
koha |
spelling |
dig-icm-es-10261-975502020-12-09T17:46:24Z Non-linear speech representation based on local predictability exponents Khanagha, V. Daoudi, K. Pont, Oriol Yahia, Hussein Turiel, Antonio Multiscale signal processing Nonlinear speech processing Complex signals and system 6 pages, 3 figures Looking for new perspectives to analyze non-linear dynamics of speech, this paper presents a novel approach based on a microcanonical multiscale formulation which allows the geometric and statistical description of multiscale properties of the complex dynamics. Speech is a complex system whose dynamics can be, to some extent, geometrically and statistically accessed by the computation of Local Predictability Exponents (LPEs) unlocking the determination of the most informative subset (Most Singular Manifold or MSM), leading to associated compact representation and reconstruction. But the complex intertwining of different dynamics in speech (added to purely turbulent descriptions) suggests the definition of appropriate multiscale functionals that might influence the evaluation of LPEs, hence leading to more compact MSM. Consequently, by using the classical and generic Sauer/Allebach algorithm for signal reconstruction from irregularly spaced samples, we show that speech reconstruction of good quality can be achieved using MSM of low cardinality. Moreover, in order to further show the potential of the new methodology, we develop a simple and efficient waveform coder which achieves almost the same level of perceptual quality as a standard coder, while having a lower bit-rate. © 2013 Elsevier B.V. This work was funded by the INRIA CORDIS doctoral program Peer Reviewed 2014-05 2014-06-02T10:59:25Z artículo http://purl.org/coar/resource_type/c_6501 doi: 10.1016/j.neucom.2012.12.061 issn: 0925-2312 Neurocomputing 132: 136-141 (2014) http://hdl.handle.net/10261/97550 10.1016/j.neucom.2012.12.061 https://doi.org/10.1016/j.neucom.2012.12.061 open Elsevier |
institution |
ICM ES |
collection |
DSpace |
country |
España |
countrycode |
ES |
component |
Bibliográfico |
access |
En linea |
databasecode |
dig-icm-es |
tag |
biblioteca |
region |
Europa del Sur |
libraryname |
Biblioteca del ICM España |
topic |
Multiscale signal processing Nonlinear speech processing Complex signals and system Multiscale signal processing Nonlinear speech processing Complex signals and system |
spellingShingle |
Multiscale signal processing Nonlinear speech processing Complex signals and system Multiscale signal processing Nonlinear speech processing Complex signals and system Khanagha, V. Daoudi, K. Pont, Oriol Yahia, Hussein Turiel, Antonio Non-linear speech representation based on local predictability exponents |
description |
6 pages, 3 figures |
format |
artículo |
topic_facet |
Multiscale signal processing Nonlinear speech processing Complex signals and system |
author |
Khanagha, V. Daoudi, K. Pont, Oriol Yahia, Hussein Turiel, Antonio |
author_facet |
Khanagha, V. Daoudi, K. Pont, Oriol Yahia, Hussein Turiel, Antonio |
author_sort |
Khanagha, V. |
title |
Non-linear speech representation based on local predictability exponents |
title_short |
Non-linear speech representation based on local predictability exponents |
title_full |
Non-linear speech representation based on local predictability exponents |
title_fullStr |
Non-linear speech representation based on local predictability exponents |
title_full_unstemmed |
Non-linear speech representation based on local predictability exponents |
title_sort |
non-linear speech representation based on local predictability exponents |
publisher |
Elsevier |
url |
http://hdl.handle.net/10261/97550 |
work_keys_str_mv |
AT khanaghav nonlinearspeechrepresentationbasedonlocalpredictabilityexponents AT daoudik nonlinearspeechrepresentationbasedonlocalpredictabilityexponents AT pontoriol nonlinearspeechrepresentationbasedonlocalpredictabilityexponents AT yahiahussein nonlinearspeechrepresentationbasedonlocalpredictabilityexponents AT turielantonio nonlinearspeechrepresentationbasedonlocalpredictabilityexponents |
_version_ |
1777666154073948160 |