Carregant...
Carregant...

Vés al contingut (premeu Retorn)

Visualizing punctuation restoration in speech transcripts with prosograph

Autor
Oktem, A.; Farrús, M.; Bonafonte, A.
Tipus d'activitat
Presentació treball a congrés
Nom de l'edició
19th Annual Conference of the International Speech Communication
Any de l'edició
2018
Data de presentació
2018-09-01
Llibre d'actes
Interspeech 2018: 2-6 September 2018, Hyderabad
Pàgina inicial
1493
Pàgina final
1494
DOI
10.21437/Interspeech.2018-3028
Repositori
http://hdl.handle.net/2117/123861 Obrir en finestra nova
URL
https://www.isca-speech.org/archive/Interspeech_2018/pdfs/3028.pdf Obrir en finestra nova
Resum
We have developed a neural architecture that tests the effect of lexical, morphosyntactic and prosodic features in restoring punctuation in speech transcriptions. Having outperformed a baseline model in terms of precision and recall, we further extend our performance tests by attaching it in a speech recognition pipeline. The visual and interactive testing environment that we prepared helps us observe how our models generalizes in unseen data and also plan our next steps for improvement.
Paraules clau
Automatic speech recognition, Neural architectures, Precision and recall, Prosody, Punctuation, Speech communication, Speech processing, Speech recognition, Speech transcriptions, Speech transcripts, Speech transmission, Testing environment, Transcription
Grup de recerca
IDEAI-UPC Intelligent Data Science and Artificial Intelligence
TALP - Centre de Tecnologies i Aplicacions del Llenguatge i la Parla
VEU - Grup de Tractament de la Parla

Participants