Improving speech synthesis quality by reducing pitch peaks in the source recordings

We present a method for improving the perceived naturalness of corpus-based speech synthesizers. It consists in removing pronounced pitch peaks in the original recordings, which typically lead to noticeable discontinuities in the synthesized speech. We perceptually evaluated this method using two co...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autor principal: Gravano, Agustín
Publicado: 2013
Materias:
Acceso en línea:https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_97819372_v_n_p502_Violante
http://hdl.handle.net/20.500.12110/paper_97819372_v_n_p502_Violante
Aporte de:

Ejemplares similares