Loading...
Loading...

Go to the content (press return)

Audiovisual event detection towards scene understanding

Author
Canton, C.; Butko, T.; Segura, C.; Giro, X.; Nadeu, C.; Hernando, J.; Casas, J.
Type of activity
Presentation of work at congresses
Name of edition
2009 IEEE Computer-Society Conference on Computer Vision and Pattern Recognition Workshops
Date of publication
2009
Presentation's date
2009-06
Book of congress proceedings
2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition workshops: CVPR workshops 2009: Miami Beach, Florida, USA: 20-25 June 2009
First page
840
Last page
847
Publisher
Institute of Electrical and Electronics Engineers (IEEE)
DOI
https://doi.org/10.1109/CVPRW.2009.5204264 Open in new window
Repository
http://hdl.handle.net/2117/23653 Open in new window
URL
http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=05204264 Open in new window
Abstract
Acoustic events produced in meeting environments may contain useful information for perceptually aware interfaces and multimodal behavior analysis. In this paper, a system to detect and recognize these events from a multimodal perspective is presented combining information from multiple cameras and microphones. First, spectral and temporal features are extracted from a single audio channel and spatial localization is achieved by exploiting cross-correlation among microphone arrays. Second, sever...
Citation
Canton, C. [et al.]. Audiovisual event detection towards scene understanding. A: IEEE Conference on Computer Vision and Pattern Recognition. "2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition workshops: CVPR workshops 2009: Miami Beach, Florida, USA: 20-25 June 2009". Institute of Electrical and Electronics Engineers (IEEE), 2009, p. 840-847.
Keywords
Audio signal processing, Face recognition, Motion estimation, Object detection, Sensor fusion, Transforms, Video signal processing
Group of research
GPI - Image and Video Processing Group
IDEAI-UPC - Intelligent Data Science and Artificial Intelligence Research Center
TALP - Centre for Language and Speech Technologies and Applications
VEU - Speech Processing Group

Participants