Improving speech synthesis quality by reducing pitch peaks in the source recordings
We present a method for improving the perceived naturalness of corpus-based speech synthesizers. It consists in removing pronounced pitch peaks in the original recordings, which typically lead to noticeable discontinuities in the synthesized speech. We perceptually evaluated this method using two co...
Guardado en:
| Autores principales: | Violante, L., Rodríguez Zivic, P., Gravano, A., Appen ButlerHill; et al.; ETS; Google; Microsoft Research; Rakuten |
|---|---|
| Formato: | CONF |
| Materias: | |
| Acceso en línea: | http://hdl.handle.net/20.500.12110/paper_97819372_v_n_p502_Violante |
| Aporte de: |
Ejemplares similares
-
Improving speech synthesis quality by reducing pitch peaks in the source recordings
por: Gravano, Agustín
Publicado: (2013) -
Prosodic facilitation and interference while judging on the veracity of synthesized statements
por: Gálvez, R.H., et al. -
Prosodic facilitation and interference while judging on the veracity of synthesized statements
Publicado: (2017) -
Emilia: a speech corpus for Argentine Spanish text to speech synthesis
por: Torres, H.M., et al. -
Emilia: a speech corpus for Argentine Spanish text to speech synthesis
Publicado: (2019)