Graphic summary
  • Show / hide key
  • Information


Scientific and technological production
  •  

1 to 50 of 50 results
  • Access to the full text
    Gesture control interface for immersive panoramic displays  Open access

     Alcoverro Vidal, Marcel; Suau, Xavier; Morros Rubió, Josep Ramon; López Méndez, Adolfo; Gil, Albert; Ruiz Hidalgo, Javier; Casas Pla, Josep Ramon
    Multimedia tools and applications
    Date of publication: 2013-07-25
    Journal article

    Read the abstract Read the abstract Access to the full text Access to the full text Open in new window  Share Reference managers Reference managers Open in new window

    In this paper, we propose a gesture-based interface designed to interact with panoramic scenes. The system combines novel static gestures with a fast hand tracking method. Our proposal is to use static gestures as shortcuts to activate functionalities of the system (i.e. volume up/down, mute, pause, etc.), and hand tracking to freely explore the panoramic video. The overall system is multi-user, and incorporates a user identification module based on face recognition, which is able both to recognize returning users and to add new users online. The system exploits depth data, making it robust to challenging illumination conditions.We show through experimental results the performance of every component of the system compared to the state of the art. We also show the results of a usability study performed with several untrained users.

    Aquest article es pot consultar a: http://link.springer.com/article/10.1007%2Fs11042-013-1605-7

  • Real-time AV renderer with support for WFS and full interactivity. FascinatE deliverable D5.1.4

     Kochale, Axel; Borsum, Malte; Spille, Jens; Kropp, Holger; Alcoverro Vidal, Marcel; Gil, Albert; Morros Rubió, Josep Ramon; Ruiz Hidalgo, Javier; Suau Cuadros, Xavier; Macq, Jean François; Verzijp, Nico; Oldfield, Rob; Zoric, Goranka
    Date: 2013-07-01
    Report

     Share Reference managers Reference managers Open in new window

  • Collaborative Annotation of multi-MOdal, MultI-Lingual and multi-mEdia documents

     Hernando Pericas, Francisco Javier; Morros Rubió, Josep Ramon
    Participation in a competitive project

     Share

  • Corpus selection

     Adda, Gilles; Barras, Claude; Kernal Ekenel, Hazim; Morros Rubió, Josep Ramon; Hernando Pericas, Francisco Javier
    Date: 2013-03-31
    Report

     Share Reference managers Reference managers Open in new window

  • Fusion of colour and depth partitions for depth map coding

     Maceira Duch, Marc; Morros Rubió, Josep Ramon; Ruiz Hidalgo, Javier
    International Conference on Digital Signal Processing
    Presentation's date: 2013-07-02
    Presentation of work at congresses

    Read the abstract Read the abstract View View Open in new window  Share Reference managers Reference managers Open in new window

    3D video coding includes the use of multiple color views and depth maps associated to each view. An adequate coding of depth maps should be adapted to the characteristics of depth maps: smooth regions and sharp edges. In this paper a segmentation-based technique is proposed for improving the depth map compression while preserving the main discontinuities that exploits the color-depth similarity of 3D video. An initial coarse depth map segmentation is used to locate the main discontinuities in depth. The resulting partition is improved by fusing a color partition. We assume that the color image is first encoded and available when the associated depth map is encoded, therefore the color partition can be segmented in the decoder without introducing any extra cost. A new segmentation criterion inspired by super-pixels techniques is proposed to obtain the color partition. Initial experimental results show similar compression efficiency to HEVC with a big potential for further improvements.

    3D video coding includes the use of multiple color views and depth maps associated to each view. An adequate coding of depth maps should be adapted to the characteristics of depth maps: smooth regions and sharp edges. In this paper a segmentation-based technique is proposed for improving the depth map compression while preserving the main discontinuities that exploits the color-depth similarity of 3D video. An initial coarse depth map segmentation is used to locate the main discontinuities in depth. The resulting partition is improved by fusing a color partition. We assume that the color image is first encoded and available when the associated depth map is encoded, therefore the color partition can be segmented in the decoder without introducing any extra cost. A new segmentation criterion inspired by super-pixels techniques is proposed to obtain the color partition. Initial experimental results show similar compression efficiency to HEVC with a big potential for further improvements.

  • Algoritmos de anotación e indexado de vídeo: reconocimiento de caras y reconocimiento de texto

     Vilaplana Besler, Veronica; Morros Rubió, Josep Ramon
    Date: 2012-12-31
    Report

     Share Reference managers Reference managers Open in new window

  • Fusión de la anotación multimedia: análisis de cromos y selección de keyframe representativos.

     Vilaplana Besler, Veronica; Morros Rubió, Josep Ramon; Ventura Royo, Carles
    Date: 2012-12-31
    Report

     Share Reference managers Reference managers Open in new window

  • Depth map coding based on a optimal hierarchical region representation

     Maceira Duch, Marc; Ruiz Hidalgo, Javier; Morros Rubió, Josep Ramon
    3DTV Conference
    Presentation's date: 2012-10-15
    Presentation of work at congresses

    Read the abstract Read the abstract View View Open in new window  Share Reference managers Reference managers Open in new window

    Multiview color information used jointly with depth maps is a widespread technique for 3D video. Using this depth information, 3D functionalities such as free view point video can be provided by means of depth-image-based rendering techniques. In this pa- per, a new technique to encode depth maps is proposed. Based on the usually smooth structure and the sharp edges of depth map, our proposal segments the depth map into homogeneous regions of ar- bitrary shape and encodes the contents of these regions using dif- ferent texture coding strategies. An optimal lagrangian approach is applied to the hierarchical region representation provided by our segmentation technique. This approach automatically selects the best encoding strategy for each region and the optimal partition to encode the depth map. To avoid the high coding costs of coding the resulting partition, a prediction is made using the associated decoded color image

  • Promeds: an adaptive robust fundamental matrix estimation approach

     Irurueta Carro, Alberto; Morros Rubió, Josep Ramon
    3DTV Conference
    Presentation's date: 2012-10-17
    Presentation of work at congresses

    View View Open in new window  Share Reference managers Reference managers Open in new window

  • Multiview depth coding based on combined color/depth segmentation

     Ruiz Hidalgo, Javier; Morros Rubió, Josep Ramon; Aflaki, Payman; Calderero Patino, Felipe; Marques Acosta, Fernando
    Journal of visual communication and image representation
    Date of publication: 2011-08-18
    Journal article

    Read the abstract Read the abstract View View Open in new window  Share Reference managers Reference managers Open in new window

    In this paper, a new coding method for multiview depth video is presented. Considering the smooth structure and sharp edges of depth maps, a segmentation based approach is proposed. This allows further preserving the depth contours thus introducing fewer artifacts in the depth perception of the video. To reduce the cost associated with partition coding, an approximation of the depth partition is built using the decoded color view segmentation. This approximation is refined by sending some complementary information about the relevant differences between color and depth partitions. For coding the depth content of each region, a decomposition into orthogonal basis is used in this paper although similar decompositions may be also employed. Experimental results show that the proposed segmentation based depth coding method outperforms H.264/AVC and H.264/MVC by more than 2 dB at similar bitrates.

  • Fusión de la anotación multimedia

     Alvarez, Federico; Garcia Serrano, Ana; Iturraspe, Urtza; Vilaplana Besler, Veronica; Morros Rubió, Josep Ramon; Loscos, Alex
    Date: 2011-12-31
    Report

     Share Reference managers Reference managers Open in new window

  • Algoritmos de anotación e indexado de vídeo, segunda versión

     Vilaplana Besler, Veronica; Morros Rubió, Josep Ramon; Ventura Royo, Carles; Iturraspe, Urtza; Alvarez, Federico; Garcia Serrano, Ana
    Date: 2011-12-30
    Report

     Share Reference managers Reference managers Open in new window

  • Procesado de vídeo multicámara empleando información de la escena: aplicación a eventos deportivos, interacción visual y 3DTV

     Gasull Llampallas, Antoni; Salembier Clairon, Philippe Jean; Marques Acosta, Fernando; Sayrol Clols, Elisa; Pardas Feliu, Montserrat; Morros Rubió, Josep Ramon; Ruiz Hidalgo, Javier; Vilaplana Besler, Veronica; Giro Nieto, Xavier; Oliveras Verges, Albert; Casas Pla, Josep Ramon
    Participation in a competitive project

     Share

  • HESPERIA Homeland security: tecnologías para la seguridad integral en espacios públicos e infraestructuras CENIT-2005 Entregable 3.1.1 Revisión del estado del arte 2009

     Ruiz Hidalgo, Javier; Sainz, Félix; Albiol Colomer, Antonio; Albiol Colomer, Alberto; Morros Rubió, Josep Ramon
    Date: 2010-01-01
    Report

    View View Open in new window  Share Reference managers Reference managers Open in new window

  • HESPERIA Homeland security: tecnologías para la seguridad integral en espacios públicos e infraestructuras CENIT-2005 Paquete de Trabajo 5, Actividad 5.2 E.5.2.1 Descripción del plan de pruebas

     Ruiz Hidalgo, Javier; Morros Rubió, Josep Ramon; Albiol Colomer, Antonio; Albiol Colomer, Alberto; Silla Martínez, María Julia; Sainz, Félix
    Date: 2010-02-01
    Report

    View View Open in new window  Share Reference managers Reference managers Open in new window

  • E.3.2.3a.: detección de gestos con el sistema EyeRIS

     Alba, Luis; Morros Rubió, Josep Ramon; García, Mario
    Date: 2010-01-19
    Report

    Read the abstract Read the abstract View View Open in new window  Share Reference managers Reference managers Open in new window

    El documento presenta una descripción de una interfaz gestual implementada sobre el sistema de visión EyeRIS. Se especifican en primer lugar el tipo de gestos a detectar para revisar a continuación las técnicas utilizadas para la detección y reconocimiento de los diferentes gestos.

  • Format-Agnostic SCript-based INterAcTive Experience

     Morros Rubió, Josep Ramon; Casas Pla, Josep Ramon; Marques Acosta, Fernando; Pardas Feliu, Montserrat; Ruiz Hidalgo, Javier
    Participation in a competitive project

     Share

  • Adquisición multicámara para Free Viewpoint Video (MC4FVV)

     Pardas Feliu, Montserrat; Giro Nieto, Xavier; Vilaplana Besler, Veronica; Ruiz Hidalgo, Javier; Morros Rubió, Josep Ramon; Salembier Clairon, Philippe Jean; Marques Acosta, Fernando; Gasull Llampallas, Antoni; Oliveras Verges, Albert; Sayrol Clols, Elisa; Casas Pla, Josep Ramon
    Participation in a competitive project

     Share

  • FascinatE: Format-Agnostic SCript-based INterAcTive Experience

     Ruiz Hidalgo, Javier; Casas Pla, Josep Ramon; Suau Cuadros, Xavier; Morros Rubió, Josep Ramon; Pardas Feliu, Montserrat; Marques Acosta, Fernando
    Participation in a competitive project

     Share

  • GRUP DE PROECESSAMENT D'IMATGE I VIDEO (GPI)

     Oliveras Verges, Albert; Sayrol Clols, Elisa; Pardas Feliu, Montserrat; Morros Rubió, Josep Ramon; Vilaplana Besler, Veronica; Ruiz Hidalgo, Javier; Giro Nieto, Xavier; Marques Acosta, Fernando; Gasull Llampallas, Antoni; Salembier Clairon, Philippe Jean; Casas Pla, Josep Ramon
    Participation in a competitive project

     Share

  • Gray-scale erosion algorithm based on image bitwise decomposition: application to focal plane processors

     Frías-Velázquez, Andrés; Morros Rubió, Josep Ramon
    IEEE International Conference on Acoustics, Speech and Signal Processing
    Presentation's date: 2009-04-24
    Presentation of work at congresses

    Read the abstract Read the abstract View View Open in new window  Share Reference managers Reference managers Open in new window

    A novel approach to implement gray-scale morphological operations is presented in this work. This new technique is based on the bitwise decomposition of the gray-scale image, yielding bitplanes disposed according to their bit of significance. It is of particular interest for implementations on Focal Plane Processors. Our approach relies on the binary search method to obtain either the maximum or minimum on a local neighborhood by manipulating the binary levels resulting from the bitwise decomposition with simple logic functions. This contrasts significantly with the classical Threshold Decomposition (TD) approach, on which most of the current techniques are based on. Our method shows better efficiency than TD implementations. Further gains can be obtained because our method shows a strong dependency on the image dynamic range.

  • Histogram computation based on image bitwise decomposition

     Frías-Velázquez, Andrés; Morros Rubió, Josep Ramon
    IEEE International Conference on Image Processing
    Presentation's date: 2009-11-10
    Presentation of work at congresses

    Read the abstract Read the abstract View View Open in new window  Share Reference managers Reference managers Open in new window

    In this paper, a new method to compute the image histogram is presented, along with the image maximum and minimum values. It is intended for highly paral- lel architectures such as the ones found in Focal Plane Processors (FPP). This new approach exploits this par- allelism relying on the privatization technique to avoid the memory collision problem, while the bin frequency is obtained through image bitwise manipulation. Unlike traditional privatization techniques, our method exhibits a trade-o between processing time and bin size. That is, it can be adapted as a power-of-two bin size histogram and the computation time decreases exponentially as the bin size is reduced on each power of two, allowing high computational exibility.

  • Access to the full text
    Multimodal identification and localization of users in a smart environment  Open access

     Salah, Albert Ali; Morros Rubió, Josep Ramon; Luque Serrano, Jordi; Segura, Carlos; Hernando Pericas, Francisco Javier; Ambekar, Onkar; Schouten, Ben; Pauwels, Eric
    Journal on Multimodal user interfaces
    Date of publication: 2008-09
    Journal article

    Read the abstract Read the abstract Access to the full text Access to the full text Open in new window  Share Reference managers Reference managers Open in new window

    Detecting the location and identity of users is a first step in creating contextaware applications for technologically-endowed environments. We propose a system that makes use of motion detection, person tracking, face identification, feature-based identification, audio-based localization, and audio-based identification modules, fusing information with particle filters to obtain robust localization and identification. The data streams are processed with the help of the generic client-server middleware SmartFlow, resulting in a flexible architecture that runs across different platforms.

    The original publication is available at www.springerlink.com

  • Revisión del estado del arte 2007

     Ruiz Hidalgo, Javier; Félix, Sainz; Antonio, Albiol; Alberto, Albiol; Marques Acosta, Fernando; Casas Pla, Josep Ramon; Morros Rubió, Josep Ramon; Canton Ferrer, Cristian; Pardas Feliu, Montserrat; Batalle, Dafnis Demian Bola
    Date: 2008-01
    Report

    View View Open in new window  Share Reference managers Reference managers Open in new window

  • Desarrollo de los primeros algoritmos de indexación

     Calderero, Felipe; Casas Pla, Josep Ramon; Gasull Llampallas, Antoni; Giro Nieto, Xavier; Miriam, Leon; Marques Acosta, Fernando; Pardas Feliu, Montserrat; Jordi, Pont; Salembier Clairon, Philippe Jean; Vilaplana Besler, Veronica; Morros Rubió, Josep Ramon
    Date: 2008-12
    Report

     Share Reference managers Reference managers Open in new window

  • Selección de agentes a detectar y descriptores de bajo nivel asociados

     Ruiz-Hidalgo, J; Ruiz Hidalgo, Javier; Sainz, F; Albiol, A; Marques Acosta, Fernando; Casas Pla, Josep Ramon; Morros Rubió, Josep Ramon; Canton Ferrer, Cristian; Pardas Feliu, Montserrat
    Date: 2007-03
    Report

     Share Reference managers Reference managers Open in new window

  • Diccionari de Telecomunicacions

     Aguilar Igartua, Mónica; Alcober Segura, Jesus Angel; Altes Bosch, Jorge; Aragones Cervera, Xavier; Artigas Garcia, David; Bardes Llorensi, Daniel; Barlabe Dalmau, Antoni; Bragos Bardia, Ramon; Calderer Cardona, Josep; Cardama Aznar, Angel; Casademont Serra, Jordi; Casals Ibañez, Lluis; Comeron Tejero, Adolfo; Cotrina Navau, Josep; Cruz Llopis, Luis Javier de La; Dios Otin, Victor Federico; Duxans Barrobes, Helena; Esparza Martin, Oscar; Esquerra Llucià, Ignasi; Garcia Vizcaino, David; Garcies Salva, Pau; Gomez Montenegro, Carlos; Gorricho Moreno, Juan Luis; Guinjoan Gispert, Francisco; Hesselbach Serra, Xavier; Liria Righetti, Antoni; Lopez Salcedo, Jose Antonio; Madrenas Boadas, Jordi; Madueño Ruiz, María Isabel; Mestre Pons, Francesc Xavier; Monte Moreno, Enrique; Morros Rubió, Josep Ramon; Muñoz Tapia, Jose Luis; Pallares Segarra, Esteve; Pons Nin, Joan; Recolons Martos, Jaume; Rincon Rivera, David; Riu Costa, Pere Joan; Ruiz Vela, Inmaculada; Pradell Cara, Lluis; Pascual Iserte, Antonio; Prat Viñas, Luis; Rey Micolau, Francesc; Villares Piera, N. Javier
    Date of publication: 2007-03
    Book

     Share Reference managers Reference managers Open in new window

  • Procesado de vídeo en entornos controlados: aplicación a seguridad, salas inteligentes y telepresencia (PROVEC)

     Casas Pla, Josep Ramon; Gasull Llampallas, Antoni; Giro Nieto, Xavier; Marques Acosta, Fernando; Morros Rubió, Josep Ramon; Oliveras Verges, Albert; Pardas Feliu, Montserrat; Ruiz Hidalgo, Javier; Salembier Clairon, Philippe Jean; Sayrol Clols, Elisa; Vilaplana Besler, Veronica
    Participation in a competitive project

     Share

  • Comunicaciones de video de nueva generación (VISION A3) - CENIT

     Morros Rubió, Josep Ramon; Salembier Clairon, Philippe Jean
    Participation in a competitive project

     Share

  • Estado del arte de las tecnologías de conocimiento, visión y audio cognitivo

     Ruiz Hidalgo, Javier; Pardas Feliu, Montserrat; Casas Pla, Josep Ramon; Canton Ferrer, Cristian; Ferran Bennström, Igorchristian; Marques Acosta, Fernando; Morros Rubió, Josep Ramon; Vilaplana Besler, Veronica; Giro Nieto, Xavier
    Date: 2006-11
    Report

     Share Reference managers Reference managers Open in new window

  • Audio, Video and Multimodal Person Identification in a Smart Room

     Luque, J; Morros Rubió, Josep Ramon; Garde, A; Anguita Ortega, Jan; Farrús Cabecerán, Mireia; Macho, D; Marques Acosta, Fernando; Martínez, C; Vilaplana Besler, Veronica; Hernando Pericas, Francisco Javier
    Lecture notes in computer science
    Date of publication: 2006-01
    Journal article

    View View Open in new window  Share Reference managers Reference managers Open in new window

  • Multimodal Person Identification in a Smart Room

     Luque, J; Morros Rubió, Josep Ramon; Anguita, J; Farrús, M; Macho, D; Marques Acosta, Fernando; Martínez, C; Vilaplana Besler, Veronica; Hernando Pericas, Francisco Javier
    Jornadas en Tecnología del Habla
    Presentation of work at congresses

     Share Reference managers Reference managers Open in new window

  • Audio, Video and Multimodal Person Identification in a Smart Room

     Luque, J; Morros Rubió, Josep Ramon; Garde, A; Anguita Ortega, Jan; Farrús Cabecerán, Mireia; Macho, D; Marques Acosta, Fernando; Martínez, C; Vilaplana Besler, Veronica; Hernando Pericas, Francisco Javier
    CLEAR'06 Evaluation Campaign and Workshop - Classification of Events, Activities and Relationships
    Presentation of work at congresses

     Share Reference managers Reference managers Open in new window

  • Overview 2D Face Detection

     Morros Rubió, Josep Ramon
    CLEAR'06 Evaluation Campaign and Workshop - Classification of Events, Activities and Relationships
    Presentation's date: 2006-04-06
    Presentation of work at congresses

     Share Reference managers Reference managers Open in new window

  • - Homeland security tecnologías para la seguridad integral en espacios públicos e infraestructuras, HESPERIA

     Salembier Clairon, Philippe Jean; Ruiz Hidalgo, Javier; Morros Rubió, Josep Ramon; Pardas Feliu, Montserrat; Marques Acosta, Fernando
    Participation in a competitive project

     Share

  • Audio, Video and Multimodal Person Identification in a Smart Room

     Morros Rubió, Josep Ramon
    CLEAR'06 Evaluation Campaign and Workshop - Classification of Events, Activities and Relationships
    Presentation's date: 2006-04-06
    Presentation of work at congresses

     Share Reference managers Reference managers Open in new window

  • IEEE transactions on image processing

     Morros Rubió, Josep Ramon
    Collaboration in journals

     Share

  • 2005SGR-00341 GRUP DE PROCESSAMENT D'IMATGE I VIDEO

     Gasull Llampallas, Antoni; Casas Pla, Josep Ramon; Giro Nieto, Xavier; Marques Acosta, Fernando; Morros Rubió, Josep Ramon; Oliveras Verges, Albert; Pardas Feliu, Montserrat; Ruiz Hidalgo, Javier; Salembier Clairon, Philippe Jean; Sayrol Clols, Elisa; Vilaplana Besler, Veronica
    Participation in a competitive project

     Share

  • Eurasip Journal on Applied Signal Processing

     Morros Rubió, Josep Ramon
    Collaboration in journals

     Share

  • OPTIMIZATION OF SEGMENTATION-BASED VIDEO SEQUENCE CODING TECHNIQUES. APPLICATION TO CONTENT BASED FUNCTIONALITIES  Open access

     Morros Rubió, Josep Ramon
    Defense's date: 2004-12-23
    Department of Signal Theory and Communications, Universitat Politècnica de Catalunya
    Theses

    Read the abstract Read the abstract Access to the full text Access to the full text Open in new window  Share Reference managers Reference managers Open in new window

    En aquest treball s'estudia el problema de la compressió de video utilitzant funcionalitats basades en el contingut en el marc teòric dels sistemes de codificació de seqüències de video basats en regions. Es tracten bàsicament dos problemes: El primer està relacionat amb com es pot aconseguir una codificació òptima en sistemes de codificació de video basats en regions. En concret, es mostra com es pot utilitzar un metodologia de 'rate-distortion' en aquest tipus de problemes. El segon problema que es tracta és com introduir funcionalitats basades en el contingut en un d'aquests sistemes de codificació de video.La teoria de 'rate-distortion' defineix l'optimalitat en la codificació com la representació d'un senyal que, per una taxa de bits donada, resulta en una distorsió mínima al reconstruir el senyal. En el cas de sistemes de codificació basats en regions, això implica obtenir una partició òptima i al mateix temps, un repartiment òptim dels bits entre les diferents regions d'aquesta partició. Aquest problema es formalitza per sistemes de codificació no escalables i es proposa un algorisme per solucionar-lo. Aquest algorisme s'aplica a un sistema de codificació concret anomenat SESAME. En el SESAME, cada quadre de la seqüència de video es segmenta en un conjunt de regions que es codifiquen de forma independent. La segmentació es fa seguint criteris d'homogeneitat espaial i temporal. Per eliminar la redundància temporal, s'utilitza un sistema predictiu basat en la informació de moviment tant per la partició com per la textura. El sistema permet seguir l'evolució temporal de cada regió per tota la seqüència. Els resultats de la codificació són òptims (o quasi-òptims) pel marc donat en un sentit de 'rate-distortion'. El procés de codificació inclou trobar una partició òptima i també trobar la tècnica de codificació i nivell de qualitat més adient per cada regió. Més endavant s'investiga el problema de codificació de video en sistemes amb escalabilitat i que suporten funcionalitats basades en el contingut. El problema es generalitza incloent en l'esquema de codificació les dependències espaials i temporals entre els diferents quadres o entre les diferents capes d'escalabilitat. En aquest cas, la solució requereix trobar la partició òptima i les tècniques de codificació de textura òptimes tant per la capa base com per la capa de millora. A causa de les dependències que hi ha entre aquestes capes, la partició i el conjunt de tècniques de codificació per la capa de millora dependran de les decisions preses en la capa base. Donat que aquest tipus de solucions generalment són molt costoses computacionalment, també es proposa una solució que no té en compte aquestes dependències.Els algorismes obtinguts s'apliquen per extendre SESAME. El sistema de codificació extès, anomenat XSESAME suporta diferents tipus d'escalabilitat (PSNR, espaial i temporal) així com funcionalitats basades en el contingut i la possibilitat de seguiment d'objectes a través de la seqüència de video. El sistema de codificació permet utilitzar dos modes diferents pel que fa a la selecció de les regions de la partició de la capa de millora: El primer mode (supervisat) està pensat per utilitzar funcionalitats basades en el contingut. El segon mode (no supervisat) no suporta funcionalitats basades en el contingut i el seu objectiu és simplement obtenir una codificació òptima a la capa de millora.Un altre tema que s'ha investigat és la integració d'un mètode de seguiment d'objectes en el sistema de codificació. En el cas general, el seguiment d'objectes en seqüències de video és un problema molt complex. Si a més aquest seguiment es vol integrar en un sistema de codificació apareixen problemes addicionals degut a que els requisits necessaris per obtenir eficiència en la codificació poden entrar en conflicte amb els requisits per una bona precisió en el seguiment d'objectes. Aquesta aparent incompatibilitat es soluciona utilitzant un enfocament basat en una doble partició de cada quadre de la seqüència. La partició que s'utilitza per la codificació es resegmenta utilitzant criteris purament espaials. Al projectar aquesta segona partició permet una millor adaptació dels contorns de l'objecte a seguir. L'excés de regions que implicaria aquesta re-segmentació s'elimina amb una etapa de fusió de regions realitzada a posteriori.

    En este trabajo se estudia el problema de la compresión de vídeo utilizando funcionalidades basadas en el contenido en el marco teórico de los sistemas de codificación de secuencias de vídeo basados en regiones. Se tratan básicamente dos problemas: El primero está relacionado con la obtención de una codificación óptima en sistemas de codificación de vídeo basados en regiones. En concreto, se muestra como se puede utilizar un metodología de 'rate-distortion' para este tipo de problemas. El segundo problema tratado es como introducir funcionalidades basadas en el contenido en uno de estos sistemas de codificación de vídeo.La teoría de 'rate-distortion' define la optimalidad en la codificación como la representación de una señal que, para un tasa de bits dada, resulta en una distorsión mínima al reconstruir la señal. En el caso de sistemas de codificación basados en regiones, esto implica obtener una partición óptima y al mismo tiempo, un reparto óptimo de los bits entre las diferentes regiones de esta partición. Este problema se formaliza para sistemas de codificación no escalables y se propone un algoritmo para solucionar este problema. Este algoritmo se aplica a un sistema de codificación concreto llamado SESAME. En SESAME, cada cuadro de la secuencia de vídeo se segmenta en un conjunto de regiones que se codifican de forma independiente. La segmentación se hace siguiendo criterios de homogeneidad espacial y temporal. Para eliminar la redundancia temporal, se utiliza un sistema predictivo basado en la información de movimiento tanto para la partición como para la textura. El sistema permite seguir la evolución temporal de cada región a lo largo de la secuencia. Los resultados de la codificación son óptimos (o casi-óptimos) para el marco dado en un sentido de 'rate-distortion'. El proceso de codificación incluye encontrar una partición óptima y también encontrar la técnica de codificación y nivel de calidad más adecuados para cada región.Más adelante se investiga el problema de la codificación de vídeo en sistemas con escalabilidad y que suporten funcionalidades basadas en el contenido. El problema se generaliza incluyendo en el esquema de codificación las dependencias espaciales y temporales entre los diferentes cuadros o entre las diferentes capas de escalabilidad. En este caso, la solución requiere encontrar la partición óptima y las técnicas de codificación de textura óptimas tanto para la capa base como para la capa de mejora. A causa de les dependencias que hay entre estas capas, la partición y el conjunto de técnicas de codificación para la capa de mejora dependerán de las decisiones tomadas en la capa base. Dado que este tipo de soluciones generalmente son muy costosas computacionalmente, también se propone una solución que no tiene en cuenta estas dependencias.Los algoritmos obtenido se usan en la extensión de SESAME. El sistema de codificación extendido, llamado XSESAME soporta diferentes tipos de escalabilidad (PSNR, espacial y temporal) así como funcionalidades basadas en el contenido y la posibilidad de seguimiento de objetos a través de la secuencia de vídeo. El sistema de codificación permite utilizar dos modos diferentes por lo que hace referencia a la selección de les regiones de la partición de la capa de mejora: El primer modo (supervisado) está pensado para utilizar funcionalidades basadas en el contenido. El segundo modo (no supervisado) no soporta funcionalidades basadas en el contenido y su objetivo es simplemente obtener una codificación óptima en la capa de mejora.Otro tema investigado es la integración de un método de seguimiento de objetos en el sistema de codificación.En el caso general, el seguimiento de objetos en secuencias de vídeo es un problema muy complejo. Si este seguimiento se quiere integrar en un sistema de codificación aparecen problemas adicionales debido a que los requisitos necesarios para obtener eficiencia en la codificación pueden entrar en conflicto con los requisitos para obtener una buena precisión en el seguimiento de objetos. Esta aparente incompatibilidad se soluciona usando un enfoque basado en una doble partición de cada cuadro de la secuencia. La partición que se usa para codificar se resegmenta usando criterios puramente espaciales. Proyectando esta segunda partición se obtiene una mejor adaptación de los contornos al objeto a seguir. El exceso de regiones que implicaría esta resegmentación se elimina con una etapa de fusión de regiones realizada a posteriori.

    This work addresses the problem of video compression with content-based functionalities in the framework of segmentation-based video coding systems. Two major problems are considered. The first one is related with coding optimality in segmentation-based coding systems. Regarding this subject, the feasibility of a rate-distortion approach for a complete region-based coding system is shown. The second one is how to address content-based functionalities in the coding system proposed as a solution of the first problem. Optimality, as defined in the framework of rate-distortion theory, deals with obtaining a representation of the video sequence that leads to a minimum distortion of the coded signal for a given bit budget. In the case of segmentation-based coding systems this means to obtain an 'optimal' partition together with the best coding technique for each region of this partition so that the result is optimal in an operational rate-distortion sense. The problem is formalized for independent, non-scalable coding.An algorithm to solve this problem is provided as well.This algorithms is applied to a specific segmentation-based coding system, the so called SESAME. In SESAME, each frame is segmented into a set of regions, that are coded independently. Segmentation involves both spatial and motion homogeneity criteria. To exploit temporal redundancy, a prediction for both the partition and the texture of the current frame is created by using motion information. The time evolution of each region is defined along the sequence (time tracking). The results are optimal (or near-optimal) for the given framework in a rate-distortion sense. The definition of the coding strategy involves a global optimization of the partition as well as of the coding technique/quality level for each region. Later, the investigation is also extended to the problem of video coding optimization in the framework of a scalable video coding system that can address content-based functionalities. The focus is set in the various types of content-based scalability and object tracking. The generality of the problem has also been extended by including the spatial and temporal dependencies between frames and scalability layers into the optimization schema. In this case the solution implies finding the optimal partition and set of quantizers for both the base and the enhancement layers. Due to the coding dependencies of the enhancement layer with respect to the base layer, the partition and the set of quantizers of the enhancement layer depend on the decisions made on the base layer. Also, a solution for the independent optimization problem (i.e. without tacking into account dependencies between different frames of scalability layers) has been proposed to reduce the computational complexity. These solutions are used to extend the SESAME coding system. The extended coding system, named XSESAME, supports different types of scalability (PSNR, Spatial and temporal) as well as content-based functionalities, such as content-based scalability and object tracking. Two different operating modes for region selection in the enhancement layer have been presented: One (supervised) aimed at providing content-based functionalities at the enhancement layer and the other (unsupervised) aimed at coding efficiency, without content-based functionalities. Integration of object tracking into the segmentation-based coding system is also investigated.In the general case, tracking is a very complex problem. If this capability has to be integrated into a coding system, additional problems arise due to conflicting requirements between coding efficiency and tracking accuracy. This is solved by using a double partition approach, where pure spatial criteria are used to re-segment the partition used for coding. The projection of the re-segmented partition results in more precise adaptation to object contours. A merging step is performed a posteriori to eliminate the excess of regions originated by the re-segmentation.

  • COMPUTERS IN THE HUMAN INTERACTION LOOP

     Casas Pla, Josep Ramon; Pardas Feliu, Montserrat; Ruiz Hidalgo, Javier; Morros Rubió, Josep Ramon; Giro Nieto, Xavier; Marques Acosta, Fernando
    Participation in a competitive project

     Share

  • Object matching based on partition information

     Marques Acosta, Fernando; Pardas Feliu, Montserrat; Morros Rubió, Josep Ramon
    IEEE International Conference on Image Processing
    Presentation of work at congresses

     Share Reference managers Reference managers Open in new window

  • 2001SGR-00265 GRUP DE TRACTAMENT DE LA IMATGE

     Casas Pla, Josep Ramon; Gasull Llampallas, Antoni; Giro Nieto, Xavier; Marques Acosta, Fernando; Morros Rubió, Josep Ramon; Oliveras Verges, Albert; Pardas Feliu, Montserrat; Ruiz Hidalgo, Javier; Salembier Clairon, Philippe Jean; Vilaplana Besler, Veronica
    Participation in a competitive project

     Share

  • Segmentation of video sequences and rate control

     Marcotegui, Beatriz; Marques Acosta, Fernando; Morros Rubió, Josep Ramon; Pardas Feliu, Montserrat; Salembier Clairon, Philippe Jean
    Annales des télecommunications. Annals of telecommunications
    Date of publication: 1997-07
    Journal article

     Share Reference managers Reference managers Open in new window

  • Segmentation-based video coding system allowing the manipulation of objects

     Salembier Clairon, Philippe Jean; Marques Acosta, Fernando; Pardas Feliu, Montserrat; Morros Rubió, Josep Ramon; Corset, I; Jeannin, S; Bouchard, L; Meyer, F; Marcotegui, B
    IEEE transactions on circuits and systems for video technology
    Date of publication: 1997-02
    Journal article

    View View Open in new window  Share Reference managers Reference managers Open in new window

  • Scalable segmentation-based coding of video sequences addressing content-based functionalities

     Morros Rubió, Josep Ramon; Marques Acosta, Fernando
    IEEE International Conference on Image Processing
    Presentation of work at congresses

    View View Open in new window  Share Reference managers Reference managers Open in new window

  • Video sequence segmentation based on rate-distortion theory

     Morros Rubió, Josep Ramon; Marques Acosta, Fernando; Pardas Feliu, Montserrat; Salembier Clairon, Philippe Jean
    Visual Communications and image Processing
    Presentation's date: 1996-03-18
    Presentation of work at congresses

    View View Open in new window  Share Reference managers Reference managers Open in new window

  • Segmented picture coding method and system ad corresponding decoding method and system

     Salembier Clairon, Philippe Jean; Marques Acosta, Fernando; Pardas Feliu, Montserrat; Corset, I; Bouchard, L; Jeannin, S; Morros Rubió, Josep Ramon; Meyer, F; Marcotegui, B
    Date of request: 1996-04-29
    Invention patent

     Share Reference managers Reference managers Open in new window

  • Interleaved segmentation and motion estimation by means of morphological tools

     Marques Acosta, Fernando; Bouchard, L; Corset, I; Jeannin, S; Morros Rubió, Josep Ramon; Pardas Feliu, Montserrat; Salembier Clairon, Philippe Jean; Torres Urgell, Luis
    Workshop on Image Analysis and Synthesis in Image Coding
    Presentation of work at congresses

     Share Reference managers Reference managers Open in new window