Deep Neural Networks for Shimmer Approximation in Synthesized Audio Signal
Shimmer is a classical acoustic measure of the amplitude perturbation of a signal. This kind of variation in the human voice allow to characterize some properties, not only of the voice itself, but of the person who speaks. During the last years deep learning techniques have become the state of the...
Guardado en:
Autores principales: | , |
---|---|
Formato: | Objeto de conferencia |
Lenguaje: | Inglés |
Publicado: |
2017
|
Materias: | |
Acceso en línea: | http://sedici.unlp.edu.ar/handle/10915/63484 |
Aporte de: |
id |
I19-R120-10915-63484 |
---|---|
record_format |
dspace |
institution |
Universidad Nacional de La Plata |
institution_str |
I-19 |
repository_str |
R-120 |
collection |
SEDICI (UNLP) |
language |
Inglés |
topic |
Ciencias Informáticas shimmer voice quality deep learning deep neural network convolutional neural network |
spellingShingle |
Ciencias Informáticas shimmer voice quality deep learning deep neural network convolutional neural network García, Mario Alejandro Destéfanis, Eduardo A. Deep Neural Networks for Shimmer Approximation in Synthesized Audio Signal |
topic_facet |
Ciencias Informáticas shimmer voice quality deep learning deep neural network convolutional neural network |
description |
Shimmer is a classical acoustic measure of the amplitude perturbation of a signal. This kind of variation in the human voice allow to characterize some properties, not only of the voice itself, but of the person who speaks. During the last years deep learning techniques have become the state of the art for recognition tasks on the voice. In this work the relationship between shimmer and deep neural networks is analyzed.
A deep learning model is created. It is able to approximate shimmer value of a simple synthesized audio signal (stationary and without formants) taking the spectrogram as input feature. It is concluded firstly, that for this kind of synthesized signal, a neural network like the one we proposed can approximate shimmer, and secondly, that the convolution layers can be designed in order to preserve the information of shimmer and transmit it to the following layers. |
format |
Objeto de conferencia Objeto de conferencia |
author |
García, Mario Alejandro Destéfanis, Eduardo A. |
author_facet |
García, Mario Alejandro Destéfanis, Eduardo A. |
author_sort |
García, Mario Alejandro |
title |
Deep Neural Networks for Shimmer Approximation in Synthesized Audio Signal |
title_short |
Deep Neural Networks for Shimmer Approximation in Synthesized Audio Signal |
title_full |
Deep Neural Networks for Shimmer Approximation in Synthesized Audio Signal |
title_fullStr |
Deep Neural Networks for Shimmer Approximation in Synthesized Audio Signal |
title_full_unstemmed |
Deep Neural Networks for Shimmer Approximation in Synthesized Audio Signal |
title_sort |
deep neural networks for shimmer approximation in synthesized audio signal |
publishDate |
2017 |
url |
http://sedici.unlp.edu.ar/handle/10915/63484 |
work_keys_str_mv |
AT garciamarioalejandro deepneuralnetworksforshimmerapproximationinsynthesizedaudiosignal AT destefaniseduardoa deepneuralnetworksforshimmerapproximationinsynthesizedaudiosignal |
bdutipo_str |
Repositorios |
_version_ |
1764820480839122944 |