Non-linear speech representation based on local predictability exponents

6 pages, 3 figures

Saved in:
Bibliographic Details
Main Authors: Khanagha, V., Daoudi, K., Pont, Oriol, Yahia, Hussein, Turiel, Antonio
Format: artículo biblioteca
Published: Elsevier
Subjects:Multiscale signal processing, Nonlinear speech processing, Complex signals and system,
Online Access:http://hdl.handle.net/10261/97550
Tags: Add Tag
No Tags, Be the first to tag this record!
id dig-icm-es-10261-97550
record_format koha
spelling dig-icm-es-10261-975502020-12-09T17:46:24Z Non-linear speech representation based on local predictability exponents Khanagha, V. Daoudi, K. Pont, Oriol Yahia, Hussein Turiel, Antonio Multiscale signal processing Nonlinear speech processing Complex signals and system 6 pages, 3 figures Looking for new perspectives to analyze non-linear dynamics of speech, this paper presents a novel approach based on a microcanonical multiscale formulation which allows the geometric and statistical description of multiscale properties of the complex dynamics. Speech is a complex system whose dynamics can be, to some extent, geometrically and statistically accessed by the computation of Local Predictability Exponents (LPEs) unlocking the determination of the most informative subset (Most Singular Manifold or MSM), leading to associated compact representation and reconstruction. But the complex intertwining of different dynamics in speech (added to purely turbulent descriptions) suggests the definition of appropriate multiscale functionals that might influence the evaluation of LPEs, hence leading to more compact MSM. Consequently, by using the classical and generic Sauer/Allebach algorithm for signal reconstruction from irregularly spaced samples, we show that speech reconstruction of good quality can be achieved using MSM of low cardinality. Moreover, in order to further show the potential of the new methodology, we develop a simple and efficient waveform coder which achieves almost the same level of perceptual quality as a standard coder, while having a lower bit-rate. © 2013 Elsevier B.V. This work was funded by the INRIA CORDIS doctoral program Peer Reviewed 2014-05 2014-06-02T10:59:25Z artículo http://purl.org/coar/resource_type/c_6501 doi: 10.1016/j.neucom.2012.12.061 issn: 0925-2312 Neurocomputing 132: 136-141 (2014) http://hdl.handle.net/10261/97550 10.1016/j.neucom.2012.12.061 https://doi.org/10.1016/j.neucom.2012.12.061 open Elsevier
institution ICM ES
collection DSpace
country España
countrycode ES
component Bibliográfico
access En linea
databasecode dig-icm-es
tag biblioteca
region Europa del Sur
libraryname Biblioteca del ICM España
topic Multiscale signal processing
Nonlinear speech processing
Complex signals and system
Multiscale signal processing
Nonlinear speech processing
Complex signals and system
spellingShingle Multiscale signal processing
Nonlinear speech processing
Complex signals and system
Multiscale signal processing
Nonlinear speech processing
Complex signals and system
Khanagha, V.
Daoudi, K.
Pont, Oriol
Yahia, Hussein
Turiel, Antonio
Non-linear speech representation based on local predictability exponents
description 6 pages, 3 figures
format artículo
topic_facet Multiscale signal processing
Nonlinear speech processing
Complex signals and system
author Khanagha, V.
Daoudi, K.
Pont, Oriol
Yahia, Hussein
Turiel, Antonio
author_facet Khanagha, V.
Daoudi, K.
Pont, Oriol
Yahia, Hussein
Turiel, Antonio
author_sort Khanagha, V.
title Non-linear speech representation based on local predictability exponents
title_short Non-linear speech representation based on local predictability exponents
title_full Non-linear speech representation based on local predictability exponents
title_fullStr Non-linear speech representation based on local predictability exponents
title_full_unstemmed Non-linear speech representation based on local predictability exponents
title_sort non-linear speech representation based on local predictability exponents
publisher Elsevier
url http://hdl.handle.net/10261/97550
work_keys_str_mv AT khanaghav nonlinearspeechrepresentationbasedonlocalpredictabilityexponents
AT daoudik nonlinearspeechrepresentationbasedonlocalpredictabilityexponents
AT pontoriol nonlinearspeechrepresentationbasedonlocalpredictabilityexponents
AT yahiahussein nonlinearspeechrepresentationbasedonlocalpredictabilityexponents
AT turielantonio nonlinearspeechrepresentationbasedonlocalpredictabilityexponents
_version_ 1777666154073948160