Chiesi, Matteo
  
(2014)
Heterogeneous Multi-core Architectures for High Performance Computing, [Dissertation thesis], Alma Mater Studiorum Università di Bologna. 
 Dottorato di ricerca in 
Tecnologie dell'informazione, 26 Ciclo. DOI 10.6092/unibo/amsdottorato/6469.
  
 
  
  
        
        
        
  
  
  
  
  
  
  
    
  
    
      Documenti full-text disponibili:
      
    
  
  
    
      Abstract
      This thesis deals with heterogeneous architectures in standard workstations.  
Heterogeneous architectures represent an appealing alternative to traditional supercomputers because they are based on commodity components fabricated in large quantities. Hence their price-performance ratio is unparalleled in the world of high performance computing (HPC). 
In particular, different aspects related to the performance and consumption of heterogeneous architectures have been explored.
The thesis initially focuses on an efficient implementation of a parallel application, where the execution time is dominated by an high number of floating point instructions. Then the thesis touches the central problem of efficient management of power peaks in heterogeneous computing systems. Finally it discusses a memory-bounded problem, where the execution time is dominated by the memory latency.    
Specifically, the following main contributions have been carried out:
A novel framework for the design and analysis of solar field for Central Receiver Systems (CRS) has been developed. The implementation based on desktop workstation equipped with multiple Graphics Processing Units (GPUs) is motivated by the need to have an accurate and fast simulation environment for studying mirror imperfection and non-planar geometries.
Secondly, a power-aware scheduling algorithm on heterogeneous CPU-GPU architectures, based on an efficient distribution of the computing workload to the resources, has been realized. The scheduler manages the resources of several computing nodes with a view to reducing the peak power. The two main contributions of this work follow:
the approach reduces the supply cost due to high peak power whilst having negligible impact on the parallelism of computational nodes. 
from another point of view the developed model allows designer to increase the number of cores without increasing the capacity of the power supply unit. 
 
Finally, an implementation for efficient graph exploration on reconfigurable architectures is presented.
The purpose is to accelerate graph exploration, reducing the number of random memory accesses.
     
    
      Abstract
      This thesis deals with heterogeneous architectures in standard workstations.  
Heterogeneous architectures represent an appealing alternative to traditional supercomputers because they are based on commodity components fabricated in large quantities. Hence their price-performance ratio is unparalleled in the world of high performance computing (HPC). 
In particular, different aspects related to the performance and consumption of heterogeneous architectures have been explored.
The thesis initially focuses on an efficient implementation of a parallel application, where the execution time is dominated by an high number of floating point instructions. Then the thesis touches the central problem of efficient management of power peaks in heterogeneous computing systems. Finally it discusses a memory-bounded problem, where the execution time is dominated by the memory latency.    
Specifically, the following main contributions have been carried out:
A novel framework for the design and analysis of solar field for Central Receiver Systems (CRS) has been developed. The implementation based on desktop workstation equipped with multiple Graphics Processing Units (GPUs) is motivated by the need to have an accurate and fast simulation environment for studying mirror imperfection and non-planar geometries.
Secondly, a power-aware scheduling algorithm on heterogeneous CPU-GPU architectures, based on an efficient distribution of the computing workload to the resources, has been realized. The scheduler manages the resources of several computing nodes with a view to reducing the peak power. The two main contributions of this work follow:
the approach reduces the supply cost due to high peak power whilst having negligible impact on the parallelism of computational nodes. 
from another point of view the developed model allows designer to increase the number of cores without increasing the capacity of the power supply unit. 
 
Finally, an implementation for efficient graph exploration on reconfigurable architectures is presented.
The purpose is to accelerate graph exploration, reducing the number of random memory accesses.
     
  
  
    
    
      Tipologia del documento
      Tesi di dottorato
      
      
      
      
        
      
        
          Autore
          Chiesi, Matteo
          
        
      
        
          Supervisore
          
          
        
      
        
      
        
          Dottorato di ricerca
          
          
        
      
        
          Scuola di dottorato
          Scienze e ingegneria dell'informazione
          
        
      
        
          Ciclo
          26
          
        
      
        
          Coordinatore
          
          
        
      
        
          Settore disciplinare
          
          
        
      
        
          Settore concorsuale
          
          
        
      
        
          Parole chiave
          Heterogeneous Architectures, GPU, FPGA
          
        
      
        
          URN:NBN
          
          
        
      
        
          DOI
          10.6092/unibo/amsdottorato/6469
          
        
      
        
          Data di discussione
          28 Aprile 2014
          
        
      
      URI
      
      
     
   
  
    Altri metadati
    
      Tipologia del documento
      Tesi di dottorato
      
      
      
      
        
      
        
          Autore
          Chiesi, Matteo
          
        
      
        
          Supervisore
          
          
        
      
        
      
        
          Dottorato di ricerca
          
          
        
      
        
          Scuola di dottorato
          Scienze e ingegneria dell'informazione
          
        
      
        
          Ciclo
          26
          
        
      
        
          Coordinatore
          
          
        
      
        
          Settore disciplinare
          
          
        
      
        
          Settore concorsuale
          
          
        
      
        
          Parole chiave
          Heterogeneous Architectures, GPU, FPGA
          
        
      
        
          URN:NBN
          
          
        
      
        
          DOI
          10.6092/unibo/amsdottorato/6469
          
        
      
        
          Data di discussione
          28 Aprile 2014
          
        
      
      URI
      
      
     
   
  
  
  
  
  
    
    Statistica sui download
    
    
  
  
    
      Gestione del documento: