Ranjan, Rakesh
Total activity: 5
Department
Department of Computer Architecture
E-mail
rakesh.ranjanestudiant.upc.edu
Contact details
UPC directory Open in new window

Graphic summary
  • Show / hide key
  • Information


Scientific and technological production
  •  

1 to 5 of 5 results
  • Fg-STP: fine-grain single thread partitioning on multicores

     Ranjan, Rakesh; Latorre, Fernando; Marcuello, Pedro; Gonzalez Colas, Antonio Maria
    International Symposium on High-Performance Computer Architecture (HPCA)
    p. 15-24
    DOI: 10.1109/HPCA.2011.5749713
    Presentation's date: 2011-02-12
    Presentation of work at congresses

    Read the abstract Read the abstract View View Open in new window  Share Reference managers Reference managers Open in new window

    Power and complexity issues have led the microprocessor industry to shift to Chip Multiprocessors in order to be able to better utilize the additional transistors ensured by Moore's law. While parallel programs are going to be able to take most of the advantage of these CMPs, single thread applications are not equipped to benefit from them. In this paper we propose Fine-Grain Single-Thread Partitioning (Fg-STP), a hardware-only scheme that takes advantage of CMP designs to speedup single-threaded applications. Our proposal improves single thread performance by reconfiguring two cores with the aim of collaborating on the fetching and execution of the instructions. These cores are basically conventional out-of-order cores in which execution is orchestrated using a dedicated hardware that has minimum and localized impact on the original design of the cores. This approach partitions the code at instruction granularity and differs from previous proposals on the extensive use of dependence speculation, replication and communication. These features are combined with the ability to look for parallelism on large instruction windows without any software intervention (no re-compilation or profiling hints are needed). These characteristics allow Fg-STP to speedup single thread by 18% and 7% on average over similar hardware-only approaches like Core Fusion, on medium sized and small sized 2-core CMP respectively for Spec 2006 benchmarks.

  • Speeding up sequential applications on multicore platforms

     Ranjan, Rakesh
    Department of Computer Architecture, Universitat Politècnica de Catalunya
    Theses

     Share Reference managers Reference managers Open in new window

  • Access to the full text
    P-slice based efficient speculative multithreading  Open access

     Marcuello Pascual, Pedro; Latorre Salinas, Fernando; Gonzalez Colas, Antonio Maria; Ranjan, Rakesh
    International Conference on High Performance Computing
    p. 119-128
    DOI: 10.1109/HIPC.2009.5433216
    Presentation's date: 2009-12-16
    Presentation of work at congresses

    Read the abstract Read the abstract Access to the full text Access to the full text Open in new window  Share Reference managers Reference managers Open in new window

    Microprocessor industry has recently shifted towards multi-core to take advantage of the ever increasing number of transistors provided by the new technologies. Unfortunately, the multi-core approach does not allow single threaded applications to benefit from the additional cores to improve their execution time. Speculative multithreading (SpMT) has been proposed in the past to boost performance of irregular applications in multi-core environments. In this work, we study the main bottlenecks of these architectures, such as the memory behavior and the pre-computation slices and propose two novel schemes that allow SpMT to get 25% average speedup over single threaded execution. We propose Selective Replication as a technique to improve the performance of the SpMT memory system. This technique does not introduce additional traffic in the bus and improves the performance of a conventional SpMT memory model by 6% on average and up to 21% for some applications. Also, we propose a scheme called Slice Specialization that reduces the number of instructions in the pre-computation slices by adapting the slice to every single speculative thread spawned. The later proposal outperforms previous schemes with slices by 15% and overall, both techniques combined achieve an improvement of 20% over a conventional SpMT processor.

  • Microarquitectura i compiladors (ARCO)

     Tubella Murgadas, Jordi; Gonzalez Colas, Antonio Maria; Parcerisa Bundó, Joan Manuel; Canal Corretger, Ramon; Cruz Diaz, Josep-llorenç; Molina Clemente, Carlos Maria; Aliagas Castell, Carles; Aleta Ortega, Alexandre; Deb, Abhishek; Sreekar Shenoy, Govind; Pavlou, Demos; Herrero Abellanas, Enric; Yazdanpanah Ahmadabadi, Fahimeh; Bhagat, Indu; Lira Rueda, Javier; Lupon Navazo, Marc; Pons Sole, Marc; Ranjan, Rakesh; Ganapathy, Shrikanth; Jaksic, Zoran
    Competitive project

     Share

  • Pre-computation slice based efficient speculative multithreading

     Marcuello, Pedro; Latorre, Fernando; Gonzalez Colas, Antonio Maria; Ranjan, Rakesh
    Date: 2009-02-04
    Report

     Share Reference managers Reference managers Open in new window