Loading...
Loading...

Go to the content (press return)

Spanish statistical parametric speech synthesis using a neural vocoder

Author
Bonafonte, A.; Pascual, S.; Dorca, G.
Type of activity
Presentation of work at congresses
Name of edition
19th Annual Conference of the International Speech Communication
Date of publication
2018
Presentation's date
2018-09-01
Book of congress proceedings
Interspeech 2018: 2-6 September 2018, Hyderabad
First page
1998
Last page
2001
Publisher
International Speech Communication Association (ISCA)
DOI
https://doi.org/10.21437/Interspeech.2018-2417 Open in new window
Project funding
Tecnologías de aprendizaje profundo aplicadas al procesado de voz y audio
Repository
http://hdl.handle.net/2117/123852 Open in new window
URL
https://www.isca-speech.org/archive/Interspeech_2018/pdfs/2417.pdf Open in new window
Abstract
During the 2000s decade, unit-selection based text-to-speech was the dominant commercial technology. Meanwhile, the TTS research community has made a big effort to push statistical-parametric speech synthesis to get similar quality and more flexibility on the synthetically generated voice. During last years, deep learning advances applied to speech synthesis have filled the gap, specially when neural vocoders substitute traditional signal-processing based vocoders. In this paper we propose to su...
Citation
Bonafonte, A., Pascual, S., Dorca, G. Spanish statistical parametric speech synthesis using a neural vocoder. A: Annual Conference of the International Speech Communication Association. "Interspeech 2018: 2-6 September 2018, Hyderabad". Baixas: International Speech Communication Association (ISCA), 2018, p. 1998-2001.
Keywords
Commercial technology, Deep learning, Linguistics, Neural vocoder, Recurrent neural networks, Research communities, SPSS, SampleRNN, Signal processing, Spanish TTS, Speech communication, Speech synthesis, Statistical parametric speech synthesis, Subjective evaluations, Vocoders, Waveform generation
Group of research
IDEAI-UPC - Intelligent Data Science and Artificial Intelligence Research Center
TALP - Centre for Language and Speech Technologies and Applications
VEU - Speech Processing Group

Participants

Attachments