Loading...
Loading...

Go to the content (press return)

Expressive speech synthesis using sentiment embeddings

Author
Jauk, I.; Lorenzo Trueba, J.; Yamagishi, J.; Bonafonte, A.
Type of activity
Presentation of work at congresses
Name of edition
19th Annual Conference of the International Speech Communication
Date of publication
2018
Presentation's date
2018-09-01
Book of congress proceedings
Interspeech 2018: 2-6 September 2018, Hyderabad
First page
3062
Last page
3066
Publisher
International Speech Communication Association (ISCA)
DOI
https://doi.org/10.21437/Interspeech.2018-2467 Open in new window
Repository
http://hdl.handle.net/2117/123860 Open in new window
URL
https://www.isca-speech.org/archive/Interspeech_2018/pdfs/2467.pdf Open in new window
Abstract
In this paper we present a DNN based speech synthesis system trained on an audiobook including sentiment features predicted by the Stanford sentiment parser. The baseline system uses DNN to predict acoustic parameters based on conventional linguistic features, as they have been used in statistical parametric speech synthesis. The predicted parameters are transformed into speech using a conventional high-quality vocoder. In this paper, the conventional linguistic features are enriched using senti...
Citation
Jauk, I., Lorenzo Trueba, J., Yamagishi, J., Bonafonte, A. Expressive speech synthesis using sentiment embeddings. A: Annual Conference of the International Speech Communication Association. "Interspeech 2018: 2-6 September 2018, Hyderabad". Baixas: International Speech Communication Association (ISCA), 2018, p. 3062-3066.
Keywords
Acoustic parameters, Baseline systems, DNN, Expressive speech synthesis, Linguistic features, Preliminary analysis, Sentiment analysis, Sentiment features, Speech communication, Speech synthesis, Speech synthesis system, Statistical parametric speech synthesis, TTS Linguistics
Group of research
IDEAI-UPC - Intelligent Data Science and Artificial Intelligence Research Center
TALP - Centre for Language and Speech Technologies and Applications
VEU - Speech Processing Group

Participants

  • Jauk, Igor  (author and speaker )
  • Lorenzo Trueba, J.  (author and speaker )
  • Yamagishi, J.  (author and speaker )
  • Bonafonte Cavez, Antonio Jesus  (author and speaker )

Attachments