Improving speech synthesis quality by reducing pitch peaks in the source recordings
We present a method for improving the perceived naturalness of corpus-based speech synthesizers. It consists in removing pronounced pitch peaks in the original recordings, which typically lead to noticeable discontinuities in the synthesized speech. We perceptually evaluated this method using two co...
Guardado en:
Autor principal: | Gravano, Agustín |
---|---|
Publicado: |
2013
|
Materias: | |
Acceso en línea: | https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_97819372_v_n_p502_Violante http://hdl.handle.net/20.500.12110/paper_97819372_v_n_p502_Violante |
Aporte de: |
Ejemplares similares
-
Improving speech synthesis quality by reducing pitch peaks in the source recordings
por: Violante, L., et al. -
Prosodic facilitation and interference while judging on the veracity of synthesized statements
por: Gálvez, R.H., et al. -
Prosodic facilitation and interference while judging on the veracity of synthesized statements
Publicado: (2017) -
Techniques for noise robustness in automatic speech recognition
por: Virtanen, Tuomas
Publicado: (2012) -
Emilia: a speech corpus for Argentine Spanish text to speech synthesis
por: Torres, H.M., et al.