Loading...
Loading...

Go to the content (press return)

Restricted Boltzmann Machine vectors for speaker clustering

Author
Khan, U.; Safari, P.; Hernando, J.
Type of activity
Presentation of work at congresses
Name of edition
International Conference on Advances in Speech and Language Technologies for Iberian Languages
Date of publication
2018
Presentation's date
2018-11-21
Book of congress proceedings
IberSPEECH 2018: program and proceedings: 21-23 November 2018: Barcelona, Spain
First page
10
Last page
14
Publisher
International Speech Communication Association (ISCA)
DOI
https://doi.org/10.21437/IberSPEECH.2018-3 Open in new window
Project funding
Deep learning technologies for speech and audio processing
Repository
http://hdl.handle.net/2117/127730 Open in new window
URL
https://www.isca-speech.org/archive/IberSPEECH_2018/abstracts/IberS18_O1-3_Khan.html Open in new window
Abstract
Restricted Boltzmann Machines (RBMs) have been used both in the front-end and backend of speaker verification systems. In this work, we apply RBMs as a front-end in the context of speaker clustering. Speakers' utterances are transformed into a vector representation by means of RBMs. These vectors, referred to as RBM vectors, have shown to preserve speaker-specific information and are used for the task of speaker clustering. In this work, we perform the traditional bottom-up Agglomerative Hierarc...
Citation
Khan, U.; Safari, P.; Hernando, J. Restricted Boltzmann Machine vectors for speaker clustering. A: International Conference on Advances in Speech and Language Technologies for Iberian Languages. "IberSPEECH 2018: program and proceedings: 21-23 November 2018: Barcelona, Spain". Baixas: International Speech Communication Association (ISCA), 2018, p. 10-14.
Keywords
Agglomerative hierarchical clustering, Restricted Boltzmann Machine adaptation, Speaker clustering
Group of research
IDEAI-UPC - Intelligent Data Science and Artificial Intelligence Research Center
TALP - Centre for Language and Speech Technologies and Applications
VEU - Speech Processing Group

Participants

Attachments