Carregant...
Carregant...

Vés al contingut (premeu Retorn)

Spanish statistical parametric speech synthesis using a neural vocoder

Autor
Bonafonte, A.; Pascual, S.; Dorca, G.
Tipus d'activitat
Presentació treball a congrés
Nom de l'edició
19th Annual Conference of the International Speech Communication
Any de l'edició
2018
Data de presentació
2018-09-01
Llibre d'actes
Interspeech 2018: 2-6 September 2018, Hyderabad
Pàgina inicial
1998
Pàgina final
2001
DOI
https://doi.org/10.21437/Interspeech.2018-2417 Obrir en finestra nova
Projecte finançador
Tecnologías de aprendizaje profundo aplicadas al procesado de voz y audio
Repositori
http://hdl.handle.net/2117/123852 Obrir en finestra nova
URL
https://www.isca-speech.org/archive/Interspeech_2018/pdfs/2417.pdf Obrir en finestra nova
Resum
During the 2000s decade, unit-selection based text-to-speech was the dominant commercial technology. Meanwhile, the TTS research community has made a big effort to push statistical-parametric speech synthesis to get similar quality and more flexibility on the synthetically generated voice. During last years, deep learning advances applied to speech synthesis have filled the gap, specially when neural vocoders substitute traditional signal-processing based vocoders. In this paper we propose to su...
Paraules clau
Commercial technology, Deep learning, Linguistics, Neural vocoder, Recurrent neural networks, Research communities, SPSS, SampleRNN, Signal processing, Spanish TTS, Speech communication, Speech synthesis, Statistical parametric speech synthesis, Subjective evaluations, Vocoders, Waveform generation
Grup de recerca
IDEAI-UPC Intelligent Data Science and Artificial Intelligence
TALP - Centre de Tecnologies i Aplicacions del Llenguatge i la Parla
VEU - Grup de Tractament de la Parla

Participants