Loading...
Loading...

Go to the content (press return)

Restricted Boltzmann machine vectors for speaker clustering and tracking tasks in TV broadcast shows

Author
Khan, U.; Safari, P.; Hernando, J.
Type of activity
Journal article
Journal
Applied sciences
Date of publication
2019-07-09
Volume
9
Number
13
First page
1
Last page
17
DOI
10.3390/app9132761
Repository
http://hdl.handle.net/2117/179837 Open in new window
URL
https://www.mdpi.com/2076-3417/9/13/2761 Open in new window
Abstract
(This article belongs to the Special Issue IberSPEECH 2018: Speech and Language Technologies for Iberian Languages) Restricted Boltzmann Machines (RBMs) have shown success in both the front-end and backend of speaker verification systems. In this paper, we propose applying RBMs to the front-end for the tasks of speaker clustering and speaker tracking in TV broadcast shows. RBMs are trained to transform utterances into a vector based representation. Because of the lack of data for a test speaker,...
Citation
Khan, U.; Safari, P.; Hernando, J. Restricted Boltzmann machine vectors for speaker clustering and tracking tasks in TV broadcast shows. "Applied sciences", 9 Juliol 2019, vol. 9, núm. 13, p. 1-17.
Keywords
Agglomerative hierarchical clustering, Restricted Boltzmann machine adaptation, Speaker clustering, Speaker segmentation, Speaker tracking
Group of research
IDEAI-UPC - Intelligent Data Science and Artificial Intelligence Research Center
TALP - Centre for Language and Speech Technologies and Applications
VEU - Speech Processing Group

Participants

Attachments