Loading...
Loading...

Go to the content (press return)

Restricted Boltzmann Machine vectors for speaker clustering

Author
Khan, U.; Safari, P.; Hernando, J.
Type of activity
Presentation of work at congresses
Name of edition
International Conference on Advances in Speech and Language Technologies for Iberian Languages 2018
Date of publication
2018
Presentation's date
2018-11-21
Book of congress proceedings
IberSPEECH 2018: program and proceedings: 21-23 November 2018: Barcelona, Spain
First page
10
Last page
14
Publisher
International Speech Communication Association (ISCA)
DOI
https://doi.org/10.21437/IberSPEECH.2018-3 Open in new window
Project funding
Tecnologías de aprendizaje profundo aplicadas al procesado de voz y audio
Repository
http://hdl.handle.net/2117/127730 Open in new window
URL
https://www.isca-speech.org/archive/IberSPEECH_2018/abstracts/IberS18_O1-3_Khan.html Open in new window
Abstract
Restricted Boltzmann Machines (RBMs) have been used both in the front-end and backend of speaker verification systems. In this work, we apply RBMs as a front-end in the context of speaker clustering. Speakers' utterances are transformed into a vector representation by means of RBMs. These vectors, referred to as RBM vectors, have shown to preserve speaker-specific information and are used for the task of speaker clustering. In this work, we perform the traditional bottom-up Agglomerative Hierarc...
Citation
Khan, U.; Safari, P.; Hernando, J. Restricted Boltzmann Machine vectors for speaker clustering. A: International Conference on Advances in Speech and Language Technologies for Iberian Languages. "IberSPEECH 2018: program and proceedings: 21-23 November 2018: Barcelona, Spain". Baixas: International Speech Communication Association (ISCA), 2018, p. 10-14.
Keywords
Agglomerative hierarchical clustering, Restricted Boltzmann Machine adaptation, Speaker clustering
Group of research
IDEAI-UPC - Intelligent Data Science and Artificial Intelligence Research Center
TALP - Centre for Language and Speech Technologies and Applications
VEU - Speech Processing Group

Participants

Attachments