Giro Nieto, Xavier
Total activity: 99
Areas of expertise
Human computer interfaces, Image processing, Multimedia retrieval
Doctoral courses
Teoria del Senyal i Comunicacions
University degree
Enginyeria Superior de Telecomunicacions
Research group
GPI - Image and Video Processing Group
Department
Department of Signal Theory and Communications
School
Terrassa School of Engineering (EET)
E-mail
xavier.giroupc.edu
Contact details
UPC directory Open in new window
Orcid
0000-0002-9935-5332 Open in new window
Scopus Author ID
35098596700 Open in new window
Collaborative networks
       
Links of interest
Home page Open in new window

Graphic summary
  • Show / hide key
  • Information


Scientific and technological production
  •  

1 to 50 of 99 results
  • From global image annotation to interactive object segmentation

     Giro Nieto, Xavier; Martos Asensio, Manel; Mohedano Robles, Eva; Pont Tuset, Jordi
    Multimedia tools and applications
    Vol. 70, num. 1, p. 475-493
    DOI: 10.1007/s11042-013-1374-3
    Date of publication: 2014-05-01
    Journal article

    Read the abstract Read the abstract View View Open in new window  Share Reference managers Reference managers Open in new window

    This paper presents a graphical environment for the annotation of still images that works both at the global and local scales. At the global scale, each image can be tagged with positive, negative and neutral labels referred to a semantic class from an ontology. These annotations can be used to train and evaluate an image classifier. A finer annotation at a local scale is also available for interactive segmentation of objects. This process is formulated as a selection of regions from a precomputed hierarchical partition called Binary Partition Tree. Three different semi-supervised methods have been presented and evaluated: bounding boxes, scribbles and hierarchical navigation. The implemented Java source code is published under a free software license.

    This paper presents a graphical environment for the annotation of still images that works both at the global and local scales. At the global scale, each image can be tagged with positive, negative and neutral labels referred to a semantic class from an ontology. These annotations can be used to train and evaluate an image classifier. A finer annotation at a local scale is also available for interactive segmentation of objects. This process is formulated as a selection of regions from a precomputed hierarchical partition called Binary Partition Tree. Three different semi-supervised methods have been presented and evaluated: bounding boxes, scribbles and hierarchical navigation. The implemented Java source code is published under a free software license

  • Procesado de información heterogénea y señales en grafos para Big Data:aplicación en cribado de alto rendimiento,teledetección,multimedia y HCI

     Gasull Llampallas, Antoni; Ruiz Hidalgo, Javier; Giro Nieto, Xavier; Marques Acosta, Fernando; Morros Rubió, Josep Ramon; Oliveras Verges, Albert; Salembier Clairon, Philippe Jean; Sayrol Clols, Elisa; Vilaplana Besler, Veronica; Casas Pla, Josep Ramon; Pardas Feliu, Montserrat
    Competitive project

     Share

  • Photo clustering of social events by extending photoTOC to a rich context

     Manchon Vizuete, Daniel; Gris-Sarabia, Irene; Giro Nieto, Xavier
    ACM International Conference on Multimedia Retrieval
    Presentation's date: 2014-04-01
    Presentation of work at congresses

    Read the abstract Read the abstract View View Open in new window  Share Reference managers Reference managers Open in new window

    The popularisation of the storage of photos on the cloud has opened new opportunities and challenges for the organisation and extension of photo collections. This paper presents a light computational solution for the clustering of web photos based on social events. The proposal combines a first over-segmentation of the photo collections of each user based on temporal cues, as previously proposed in PhotoTOC. On a second stage, the resulting mini-clusters are merged based on contextual metadata such as geolocation, keywords and user IDs. Results indicate that, although temporal cues are very relevant for event clustering, robust solutions should also consider all these additional features

    The popularisation of the storage of photos on the cloud has opened new opportunities and challenges for the organisation and extension of photo collections. This paper presents a light computational solution for the clustering of web photos based on social events. The proposal combines a first over-segmentation of the photo collections of each user based on temporal cues, as previously proposed in PhotoTOC. On a second stage, the resulting mini-clusters are merged based on contextual metadata such as geolocation, keywords and user IDs. Results indicate that, although temporal cues are very relevant for event clustering, robust solutions should also consider all these additional features

  • Improving retrieval accuracy of Hierarchical Cellular Trees for generic metric spaces

     Ventura Royo, Carles; Vilaplana Besler, Veronica; Giro Nieto, Xavier; Marques Acosta, Fernando
    Multimedia tools and applications
    DOI: 10.1007/s11042-013-1686-3
    Date of publication: 2013-09
    Journal article

    Read the abstract Read the abstract View View Open in new window  Share Reference managers Reference managers Open in new window

    Metric Access Methods (MAMs) are indexing techniques which allow working in generic metric spaces. Therefore, MAMs are specially useful for Content- Based Image Retrieval systems based on features which use non Lp norms as similarity measures. MAMs naturally allow the design of image browsers due to their inherent hierarchical structure. The Hierarchical Cellular Tree (HCT), a MAM-based indexing technique, provides the starting point of our work. In this paper, we describe some limitations detected in the original formulation of the HCT and propose some modifications to both the index building and the search algorithm. First, the covering radius, which is defined as the distance from the representative to the furthest element in a node, may not cover all the elements belonging to the node¿s subtree. Therefore, we propose to redefine the covering radius as the distance from the representative to the furthest element in the node¿s subtree. This new definition is essential to guarantee a correct construction of the HCT. Second, the proposed Progressive Query retrieval scheme can be redesigned to perform the nearest neighbor operation in a more efficient way. We propose a new retrieval scheme which takes advantage of the benefits of the search algorithm used in the index building. Furthermore, while the evaluation of the HCT in the original work was only subjective, we propose an objective evaluation based on two aspects which are crucial in any approximate search algorithm: the retrieval time and the retrieval accuracy. Finally, we illustrate the usefulness of the proposal by presenting some actual applications.

    Abstract Metric Access Methods (MAMs) are indexing techniques which al- low working in generic metric spaces. Therefore, MAMs are specially useful for Content-Based Image Retrieval systems based on features which use non Lp norms as similarity measures. MAMs naturally allow the design of image browsers due to their inherent hierarchical structure. The Hierarchical Cellular Tree (HCT), a MAM-based indexing technique, provides the starting point of our work. In this paper, we describe some limitations detected in the original formulation of the HCT and propose some modi cations to both the index building and the search algorithm. First, the covering radius, which is de ned as the distance from the representative to the furthest element in a node, may not cover all the elements belonging to the node's subtree. Therefore, we pro- pose to rede ne the covering radius as the distance from the representative to the furthest element in the node's subtree. This new de nition is essen- tial to guarantee a correct construction of the HCT. Second, the proposed Progressive Query retrieval scheme can be redesigned to perform the nearest neighbor operation in a more e cient way. We propose a new retrieval scheme which takes advantage of the bene ts of the search algorithm used in the index building. Furthermore, while the evaluation of the HCT in the original work was only subjective, we propose an objective evaluation based on two aspects which are crucial in any approximate search algorithm: the retrieval time and the retrieval accuracy. Finally, we illustrate the usefulness of the proposal by presenting some actual applications.

  • Access to the full text
    UPC at MediaEval 2013 Hyperlinking Task  Open access

     Ventura Royo, Carles; Tella-Amo, Marcel; Giro Nieto, Xavier
    Multimedia Benchmark Workshop
    Presentation's date: 2013-10-18
    Presentation of work at congresses

    Read the abstract Read the abstract Access to the full text Access to the full text Open in new window  Share Reference managers Reference managers Open in new window

    These working notes paper present the contribution of the UPC team to the Hyperlinking sub-task of the Search and Hyperlinking Task in MediaEval 2013. Our contribution explores the potential of a solution based only on visual cues. In particular, every automatically generated shot is represented by a keyframe. The linking between video segments is based on the visual similarity of the keyframes they contain. Visual similarity is assessed with the intersection of bag of features histograms generated with the SURF descriptor.

    These working notes paper present the contribution of the UPC team to the Hyperlinking sub-task of the Search and Hyperlinking Task in MediaEval 2013. Our contribution ex- plores the potential of a solution based only on visual cues. In particular, every automatically generated shot is repre- sented by a keyframe. The linking between video segments is based on the visual similarity of the keyframes they contain. Visual similarity is assessed with the intersection of bag of features histograms generated with the SURF descriptor.

  • Access to the full text
    Crowdsourced object segmentation with a game  Open access

     Salvador Aguilera, Amaia; Carlier, Axel; Giro Nieto, Xavier; Marques, Ogé; Charvillat, Vincent
    ACM international workshop on Crowdsourcing for multimedia
    p. 15-20
    DOI: 10.1145/2506364.2506367
    Presentation's date: 2013-10-22
    Presentation of work at congresses

    Read the abstract Read the abstract Access to the full text Access to the full text Open in new window  Share Reference managers Reference managers Open in new window

    We introduce a new algorithm for image segmentation based on crowdsourcing through a game : Ask'nSeek. The game provides information on the objects of an image, under the form of clicks that are either on the object, or on the back-ground. These logs are then used in order to determine the best segmentation for an object among a set of candidates generated by the state-of-the-art CPMC algorithm. We also introduce a simulator that allows the generation of game logs and therefore gives insight about the number of games needed on an image to perform acceptable segmentation.

    We introduce a new algorithm for image segmentation based on crowdsourcing through a game : Ask'nSeek. The game provides information on the objects of an image, under the form of clicks that are either on the object, or on the background. These logs are then used in order to determine the best segmentation for an object among a set of candidates generated by the state-of-the-art CPMC algorithm. We also introduce a simulator that allows the generation of game logs and therefore gives insight about the number of games needed on an image to perform acceptable segmentation.

  • Automatic keyframe selection based on mutual reinforcement algorithm

     Ventura Royo, Carles; Giro Nieto, Xavier; Vilaplana Besler, Veronica; farre Giribet, Daniel; Carasusan, Eusebio
    International Workshop on Content-Based Multimedia Indexing
    p. 29-34
    DOI: 10.1109/CBMI.2013.6576548
    Presentation's date: 2013-06
    Presentation of work at congresses

    Read the abstract Read the abstract View View Open in new window  Share Reference managers Reference managers Open in new window

    This paper addresses the problem of video summarization through an automatic selection of a single representative keyframe. The proposed solution is based on the mutual reinforcement paradigm, where a keyframe is selected thanks to its highest and most frequent similarity to the rest of considered frames. Two variations of the algorithm are explored: a first one where only frames within the same video are used (intra-clip mode) and a second one where the decision also depends on the previously selected keyframes of related videos (inter-clip mode). These two algorithms were evaluated by a set of professional documentalists from a broadcaster's archive, and results concluded that the proposed techniques outperform the semi-manual solution adopted so far in the company.

    This paper addresses the problem of video summarization through an automatic selection of a single representative keyframe. The proposed solution is based on the mutual reinforcement paradigm, where a keyframe is selected thanks to its highest and most frequent similarity to the rest of considered frames. Two variations of the algorithm are explored: a first one where only frames within the same video are used (intraclip mode) and a second one where the decision also depends on the previously selected keyframes of related videos (interclip mode). These two algorithms were evaluated by a set of professional documentalists from a broadcaster’s archive, and results concluded that the proposed techniques outperform the semi-manual solution adopted so far in the company.

  • Access to the full text
    UPC at MediaEval 2013 social event detection task  Open access

     Manchon Vizuete, Daniel; Giro Nieto, Xavier
    Multimedia Benchmark Workshop
    Presentation's date: 2013-10-19
    Presentation of work at congresses

    Read the abstract Read the abstract Access to the full text Access to the full text Open in new window  Share Reference managers Reference managers Open in new window

    These working notes present the contribution of the UPC team to the Social Event Detection (SED) task in MediaEval 2013. The proposal extends the previous PhotoTOC work in the domain of shared collections of photographs stored in cloud services. An initial over-segmentation of the photo collection is later refined by merging pairs of similar clusters.

    These working notes present the contribution of the UPC team to the Social Event Detection (SED) task in MediaEval 2013. The proposal extends the previous PhotoTOC work in the domain of shared collections of photographs stored in cloud services. An initial over-segmentation of the photo collection is later re ned by merging pairs of similar clus- ters.

  • Rich internet application for semi-automatic annotation of semantic shots on keyframes

     Carcel, Elisabet; Martos, Manuel; Giro Nieto, Xavier; Marques Acosta, Fernando
    DOI: 10.1007/978-3-642-32436-9
    Date of publication: 2012-09
    Book chapter

    View View Open in new window  Share Reference managers Reference managers Open in new window

  • Part-based Object Retrieval with Binary Partition Trees  Open access

     Giro Nieto, Xavier
    Department of Signal Theory and Communications, Universitat Politècnica de Catalunya
    Theses

    Read the abstract Read the abstract Access to the full text Access to the full text Open in new window  Share Reference managers Reference managers Open in new window

    This thesis addresses the problem of visual object retrieval, where a user formulates a query to an image database by providing one or multiple examples of an object of interest. The presented techniques aim both at finding those images in the database that contain the object as well as locating the object in the image and segmenting it from the background. Every considered image, both the ones used as queries and the ones contained in the target database, is represented as a Binary Partition Tree (BPT), the hierarchy of regions previously proposed by Salembier and Garrido (2000). This data structure offers multiple opportunities and challenges when applied to the object retrieval problem. A first application of BPTs appears during the formulation of the query, when the user must interactively segment the query object from the background. Firstly, the BPT can assist in adjusting an initial marker, such as a scribble or bounding box, to the object contours. Secondly, BPT can also define a navigation path for the user to adjust an initial selection to the appropriate spatial scale. The hierarchical structure of the BPT is also exploited to extract a new type of visual words named Hierarchical Bag of Regions (HBoR). Each region defined in the BPT is described with a feature vector that combines a soft quantization on a visual codebook with an efficient bottom-up computation through the BPT. These descriptors allow the definition of a novel feature space, the Parts Space, where each object is located according to the parts that compose it. HBoR descriptors have been applied to two scenarios for object retrieval, both of them solved by considering the decomposition of the objects in parts. In the first scenario, the query is formulated with a single object exemplar which is to be matched with each BPT in the target database. The matching problem is solved in two stages: an initial top-down one that assumes that the hierarchy from the query is respected in the target BPT, and a second bottom-up one that relaxes this condition and considers region merges which are not in the target BPT. The second scenario where HBoR descriptors are applied considers a query composed of several visual objects. In this case, the provided exemplars are considered as a training set to build a model of the query concept. This model is composed of two levels, a first one where each part is modelled and detected separately, and a second one that characterises the combinations of parts that describe the complete object. The analysis process exploits the hierarchical nature of the BPT by using a novel classifier that drives an efficient top-down analysis of the target BPTs.

  • Video Browser Showdown

     Ventura Royo, Carles; Martos Asensio, Manel; Giro Nieto, Xavier; Vilaplana Besler, Veronica; Marques Acosta, Fernando
    Award or recognition

    View View Open in new window  Share

  • Hierarchical navigation and visual search for video keyframe retrieval

     Ventura Royo, Carles; Martos Asensio, Manuel; Giro Nieto, Xavier; Vilaplana Besler, Veronica; Marques Acosta, Fernando
    International Conference on MultiMedia Modeling
    p. 652-654
    DOI: 10.1007/978-3-642-27355-1_67
    Presentation's date: 2012-01-06
    Presentation of work at congresses

    View View Open in new window  Share Reference managers Reference managers Open in new window

  • Teaching students to teach computers

     Mohedano Robles, Eva; Giro Nieto, Xavier
    Congrés Internacional de Docència Universitària i Innovació
    Presentation's date: 2012-07-05
    Presentation of work at congresses

     Share Reference managers Reference managers Open in new window

  • Part-based Object Retrieval with Binary Partition Trees

     Giro Nieto, Xavier
    IEEE Conference on Computer Vision and Pattern Recognition
    Presentation's date: 2012-06-18
    Presentation of work at congresses

     Share Reference managers Reference managers Open in new window

  • Interactive segmentation and tracking of video objects

     Giro Nieto, Xavier; Martos Asensio, Manuel
    International Workshop on Image Analysis for Multimedia Application Services
    Presentation's date: 2012-05-23
    Presentation of work at congresses

     Share Reference managers Reference managers Open in new window

  • Access to the full text
    Multiscale annotation of still images with GAT  Open access

     Giro Nieto, Xavier; Martos Asensio, Manel
    International Workshop on Visual Interfaces for Ground Truth Collection in Computer Vision Applications
    Presentation's date: 2012-05-21
    Presentation of work at congresses

    Read the abstract Read the abstract Access to the full text Access to the full text Open in new window  Share Reference managers Reference managers Open in new window

    This paper presents GAT, a Graphical Annotation Tool for still images that works both at the global and local scales. This interface has been designed to assist users in the an- notation of images with relation to the semantic classes de- scribed in an ontology. Positive, negative and neutral labels can be assigned to both the whole images or parts of them. The user interface is capable of exploiting segmentation data to assist in the selection of objects. Moreover, the annota- tion capabilities are complemented with additional function- alities that allow the creation and evaluation of an image classi er. The implemented Java source code is published under a free software license.

  • Access to the full text
    Acoustic event detection based on feature-level fusion of audio and video modalities  Open access

     Butko, Taras; Canton Ferrer, Cristian; Segura, Carlos; Giro Nieto, Xavier; Nadeu Camprubí, Climent; Hernando Pericas, Francisco Javier; Casas Pla, Josep Ramon
    Eurasip journal on advances in signal processing
    Vol. 2011, p. 1-11
    DOI: 10.1155/2011/485738
    Date of publication: 2011-03-15
    Journal article

    Read the abstract Read the abstract Access to the full text Access to the full text Open in new window  Share Reference managers Reference managers Open in new window

    Acoustic event detection (AED) aims at determining the identity of sounds and their temporal position in audio signals. When applied to spontaneously generated acoustic events, AED based only on audio information shows a large amount of errors, which are mostly due to temporal overlaps. Actually, temporal overlaps accounted for more than 70% of errors in the realworld interactive seminar recordings used in CLEAR 2007 evaluations. In this paper, we improve the recognition rate of acoustic events using information from both audio and video modalities. First, the acoustic data are processed to obtain both a set of spectrotemporal features and the 3D localization coordinates of the sound source. Second, a number of features are extracted from video recordings by means of object detection, motion analysis, and multicamera person tracking to represent the visual counterpart of several acoustic events. A feature-level fusion strategy is used, and a parallel structure of binary HMM-based detectors is employed in our work. The experimental results show that information from both the microphone array and video cameras is useful to improve the detection rate of isolated as well as spontaneously generated acoustic events.

  • (1874-4796) Recent patents on computer science

     Giro Nieto, Xavier
    Collaboration in journals

     Share

  • IEEE transactions on image processing

     Giro Nieto, Xavier
    Collaboration in journals

     Share

  • Procesado de vídeo multicámara empleando información de la escena: aplicación a eventos deportivos, interacción visual y 3DTV

     Gasull Llampallas, Antoni; Oliveras Verges, Albert; Giro Nieto, Xavier; Vilaplana Besler, Veronica; Ruiz Hidalgo, Javier; Morros Rubió, Josep Ramon; Pardas Feliu, Montserrat; Sayrol Clols, Elisa; Marques Acosta, Fernando; Salembier Clairon, Philippe Jean; Casas Pla, Josep Ramon
    Competitive project

     Share

  • Diversity ranking for video retrieval from a broadcaster archive

     Giro Nieto, Xavier; Alfaro Vendrell, Mónica; Marques Acosta, Fernando
    ACM International Conference on Multimedia Retrieval
    p. 1-8
    DOI: 10.1145/1991996.1992052
    Presentation's date: 2011-04-18
    Presentation of work at congresses

    View View Open in new window  Share Reference managers Reference managers Open in new window

  • Rich internet applications for semi-automatic annotation of semantic shots in keyframes

     Carcel Folch, Elisabet; Martos Asensio, Manel; Giro Nieto, Xavier; Marques Acosta, Fernando
    International Workshop on Computational Intelligence for Multimedia Understanding
    Presentation's date: 2011-12-14
    Presentation of work at congresses

     Share Reference managers Reference managers Open in new window

  • Rich internet application for semi-automatic annotation of semantic shots on keyframes

     Carcel, Elisabet; Martos Asensio, Manel; Giro Nieto, Xavier; Marques Acosta, Fernando
    International Workshop on Computational Intelligence for Multimedia Understanding
    p. 172-182
    DOI: 10.1007/978-3-642-32436-9_15
    Presentation's date: 2011
    Presentation of work at congresses

    Read the abstract Read the abstract View View Open in new window  Share Reference managers Reference managers Open in new window

    This paper describes a system developed for the semi-automatic annotation of keyframes in a broadcasting company. The tool aims at assisting archivists who traditionally label every keyframe manually by suggesting them an automatic annotation that they can intuitively edit and validate. The system is valid for any domain as it uses generic MPEG-7 visual descriptors and binary SVM classifiers. The classification engine has been tested on the multiclass problem of semantic shot detection, a type of metadata used in the company to index new content ingested in the system. The detection performance has been tested in two different domains: soccer and parliament. The core engine is accessed by a Rich Internet Application via a web service. The graphical user interface allows the edition of the suggested labels with an intuitive drag and drop mechanism between rows of thumbnails, each row representing a different semantic shot class. The system has been described as complete and easy to use by the professional archivists at the company

  • GOS: búsqueda visual de imágenes

     Cortés Yuste, Silvia; Giro Nieto, Xavier; Marques Acosta, Fernando
    Buran
    num. 25, p. 36-44
    Date of publication: 2010-04-21
    Journal article

     Share Reference managers Reference managers Open in new window

  • Access to the full text
    GAT, a Graphical Annotation Tool for semantic regions  Open access

     Giro Nieto, Xavier; Camps Hernando, Neus; Marques Acosta, Fernando
    Multimedia tools and applications
    Vol. 46, num. 2/3 (2010), p. 155-174
    DOI: 10.1007/s11042-009-0389-2
    Date of publication: 2010
    Journal article

    Read the abstract Read the abstract Access to the full text Access to the full text Open in new window  Share Reference managers Reference managers Open in new window

    This article presents GAT, a Graphical Annotation Tool based on a region-based hierarchical representation of images. The proposed solution uses Partition Trees to navigate through the image segments which are automatically defined at different spatial scales. Moreover, the system focuses on the navigation through ontologies for a semantic annotation of objects and of the parts that compose them. The tool has been designed under usability criteria to minimize the user interaction by trying to predict the future selection of regions and semantic classes. The implementation uses MPEG-7/XML input and output data to allow interoperability with any type of Partition Tree. This tool is publicly available and its source code can be downloaded under a free software license.

    The original publication is available at http://www.springerlink.com/content/j78782k762617352/

  • Versión final de los algoritmos de búsqueda mediante ejemplos

     Giro Nieto, Xavier; Martos Asensio, Manel; Ventura, Carles; Pont Tuset, Jordi; Marques Acosta, Fernando
    Date: 2010-11-28
    Report

     Share Reference managers Reference managers Open in new window

  • Arquitectura del sistema

     Giro Nieto, Xavier; Martos Asensio, Manel; Ventura, Carles
    Date: 2010-11-27
    Report

     Share Reference managers Reference managers Open in new window

  • Activo Experimental sobre integración y validación de tecnologías de detección, reconocimiento y Query by Example en los casos de uso

     Espinosa, Pedro; Giro Nieto, Xavier; Martos Asensio, Manel; Pont Tuset, Jordi; Sastre, Jaume; Varas, David; Marques Acosta, Fernando
    Date: 2010-11-28
    Report

     Share Reference managers Reference managers Open in new window

  • Visual Content Identification and Search. IEEE multimedia

     Giro Nieto, Xavier
    Vol. 18, num. 3
    Collaboration in journals

     Share

  • IEEE multimedia

     Giro Nieto, Xavier
    Collaboration in journals

     Share

  • Versión final del Activo Experimental sobre algoritmos de creación de árboles, de población de sus nodos y de anotación de imágenes basada en regiones

     Camps Hernando, Neus; Giro Nieto, Xavier; Pont Tuset, Jordi; Ventura, Carles; Marques Acosta, Fernando
    Date: 2010-11-27
    Report

     Share Reference managers Reference managers Open in new window

  • Activo experimental sobre algoritmos de detección de objetos

     Gasull Llampallas, Antoni; Giro Nieto, Xavier; León Cristóbal, Miriam; López Bakerkina, Khristina; Tapaswi, Makarand; Varas, David; Vilaplana Besler, Veronica; Marques Acosta, Fernando
    Date: 2010-11-28
    Report

     Share Reference managers Reference managers Open in new window

  • Premi al millor PFC a l'ETSETB en l'àrea de Serveis Telemàtics

     Giro Nieto, Xavier; Salla, Ramon; Vives, Xavier
    Award or recognition

     Share

  • Premios Liberalización de las Telecomunicaciones 2010

     Ruiz Sancho, Cristina; Cucurella, Eduard; Giro Nieto, Xavier
    Award or recognition

    View View Open in new window  Share

  • Adquisición multicámara para Free Viewpoint Video (MC4FVV)

     Pardas Feliu, Montserrat; Giro Nieto, Xavier; Vilaplana Besler, Veronica; Ruiz Hidalgo, Javier; Morros Rubió, Josep Ramon; Salembier Clairon, Philippe Jean; Marques Acosta, Fernando; Gasull Llampallas, Antoni; Oliveras Verges, Albert; Sayrol Clols, Elisa; Casas Pla, Josep Ramon
    Competitive project

     Share

  • Digimatge, a Rich Internet Application for video retrieval from a Multimedia Asset Management system

     Giro Nieto, Xavier; Salla, Ramon; Vives, Xavier
    ACM SIGMM International Conference on Multimedia Information Retrieval
    Presentation's date: 2010-03-30
    Presentation of work at congresses

     Share Reference managers Reference managers Open in new window

  • BitSearch, the blog before the thesis

     Giro Nieto, Xavier
    International Congress of University Teaching and Innovation
    Presentation's date: 2010-07-01
    Presentation of work at congresses

     Share Reference managers Reference managers Open in new window

  • Access to the full text
    System architecture of a web service for Content-Based Image Retrieval  Open access

     Giro Nieto, Xavier; Ventura, Carles; Pont Tuset, Jordi; Cortés Yuste, Silvia; Marques Acosta, Fernando
    ACM International Conference On Image And Video Retrieval
    p. 358-365
    DOI: 10.1145/1816041.1816093
    Presentation's date: 2010-07-06
    Presentation of work at congresses

    Read the abstract Read the abstract Access to the full text Access to the full text Open in new window  Share Reference managers Reference managers Open in new window

    This paper presents the system architecture of a Content- Based Image Retrieval system implemented as a web service. The proposed solution is composed of two parts, a client run- ning a graphical user interface for query formulation and a server where the search engine explores an image repository. The separation of the user interface and the search engine follows a Service as a Software (SaaS) model, a type of cloud computing design where a single core system is online and available to authorized clients. The proposed architecture follows the REST software architecture and HTTP proto- col for communications, two solutions that combined with metadata coded in RDF, make the proposed system ready for its integration in the semantic web. User queries are formulated by visual examples through a graphical inter- face and content is remotely accessed also through HTTP communication. Visual descriptors and similarity measures implemented in this work are mostly de ned in the MPEG-7 standard, while textual metadata is coded according to the Dublin Core speci cations.

  • GRUP DE PROECESSAMENT D'IMATGE I VIDEO (GPI)

     Ruiz Hidalgo, Javier; Salembier Clairon, Philippe Jean; Gasull Llampallas, Antoni; Morros Rubió, Josep Ramon; Pardas Feliu, Montserrat; Sayrol Clols, Elisa; Oliveras Verges, Albert; Marques Acosta, Fernando; Giro Nieto, Xavier; Vilaplana Besler, Veronica; Casas Pla, Josep Ramon
    Competitive project

     Share

  • Multimedia tools and applications

     Giro Nieto, Xavier
    Collaboration in journals

     Share

  • Multimedia tools and applications

     Giro Nieto, Xavier
    Collaboration in journals

     Share

  • Signal, Image and Video Processing

     Giro Nieto, Xavier
    Collaboration in journals

     Share

  • Audiovisual event detection towards scene understanding

     Canton Ferrer, Cristian; Butko, Taras; Segura, C.; Giro Nieto, Xavier; Nadeu Camprubí, Climent; Hernando Pericas, Francisco Javier; Casas Pla, Josep Ramon
    IEEE Conference on Computer Vision and Pattern Recognition
    p. 840-847
    DOI: 10.1109/CVPRW.2009.5204264
    Presentation's date: 2009-06
    Presentation of work at congresses

    Read the abstract Read the abstract View View Open in new window  Share Reference managers Reference managers Open in new window

    Acoustic events produced in meeting environments may contain useful information for perceptually aware interfaces and multimodal behavior analysis. In this paper, a system to detect and recognize these events from a multimodal perspective is presented combining information from multiple cameras and microphones. First, spectral and temporal features are extracted from a single audio channel and spatial localization is achieved by exploiting cross-correlation among microphone arrays. Second, several video cues obtained from multiperson tracking, motion analysis, face recognition, and object detection provide the visual counterpart of the acoustic events to be detected. A multimodal data fusion at score level is carried out using two approaches: weighted mean average and fuzzy integral. Finally, a multimodal database containing a rich variety of acoustic events has been recorded including manual annotations of the data. A set of metrics allow assessing the performance of the presented algorithms. This dataset is made publicly available for research purposes.

    Acoustic events produced in meeting environments may contain useful information for perceptually aware interfaces and multimodal behavior analysis. In this paper, a system to detect and recognize these events from a multimodal perspective is presented combining information from multiple cameras and microphones. First, spectral and temporal features are extracted from a single audio channel and spatial localization is achieved by exploiting cross-correlation among microphone arrays. Second, several video cues obtained from multiperson tracking, motion analysis, face recognition, and object detection provide the visual counterpart of the acoustic events to be detected. A multimodal data fusion at score level is carried out using two approaches: weighted mean average and fuzzy integral. Finally, a multimodal database containing a rich variety of acoustic events has been recorded including manual annotations of the data. A set of metrics allow assessing the performance of the presented algorithms. This dataset is made publicly available for research purposes.

  • Improving Detection of Acoustic Events Using Audiovisual Data and Feature Level Fusion

     Butko, Taras; Canton Ferrer, Cristian; Segura, C.; Giro Nieto, Xavier; Nadeu Camprubí, Climent; Hernando Pericas, Francisco Javier; Casas Pla, Josep Ramon
    Annual Conference of the International Speech Communication Association
    p. 1147-1150
    Presentation's date: 2009-09
    Presentation of work at congresses

    View View Open in new window  Share Reference managers Reference managers Open in new window

  • Activo experimental sobre algoritmos de detección de objetos

     Giro Nieto, Xavier; Camps, N; Leon, M; Pont, J; Marques Acosta, Fernando
    Date: 2008-12
    Report

     Share Reference managers Reference managers Open in new window

  • Manual de uso del Graphic Annotation Tool (GAT)

     Camps, Neus; Giro Nieto, Xavier; Marques Acosta, Fernando
    Date: 2008-12
    Report

    View View Open in new window  Share Reference managers Reference managers Open in new window

  • Activo Experimental sobre algoritmos de consultas con ejemplos

     Giro Nieto, Xavier; Jordi, Pont; Silvia, Cortés; Marques Acosta, Fernando
    Date: 2008-12
    Report

     Share Reference managers Reference managers Open in new window

  • Estudio de los algoritmos para la consulta con ejemplos

     Giro Nieto, Xavier; Marques Acosta, Fernando
    Date: 2008-12
    Report

     Share Reference managers Reference managers Open in new window

  • Estudio de los algoritmos genéricos para el reconocimiento de cámaras

     Giro Nieto, Xavier; Marques Acosta, Fernando
    Date: 2008-12
    Report

     Share Reference managers Reference managers Open in new window

  • Desarrollo de los primeros algoritmos de indexación

     Calderero, Felipe; Casas Pla, Josep Ramon; Gasull Llampallas, Antoni; Giro Nieto, Xavier; Miriam, Leon; Marques Acosta, Fernando; Pardas Feliu, Montserrat; Jordi, Pont; Salembier Clairon, Philippe Jean; Vilaplana Besler, Veronica; Morros Rubió, Josep Ramon
    Date: 2008-12
    Report

     Share Reference managers Reference managers Open in new window