Documenti full-text disponibili:
|
Documento PDF (English)
- Richiede un lettore di PDF come Xpdf o Adobe Acrobat Reader
Disponibile con Licenza: Salvo eventuali più ampie autorizzazioni dell'autore, la tesi può essere liberamente consultata e può essere effettuato il salvataggio e la stampa di una copia per fini strettamente personali di studio, di ricerca e di insegnamento, con espresso divieto di qualunque utilizzo direttamente o indirettamente commerciale. Ogni altro diritto sul materiale è riservato.
Download (3MB)
|
Abstract
The focus of this dissertation is the design and the implementation of a computing platform which can accelerate data processing in the embedded computation domain. We focus on a heterogeneous computing platform, whose hardware implementation can approach the power and area efficiency of specialized designs, while remaining flexible across the application domain.
The multi-core architectures require parallel programming, which is widely-regarded as more challenging than sequential programming. Although shared memory parallel programs may be fairly easy to write (using OpenMP, for example), they are quite hard to optimize; providing embedded application developers with optimizing tools and programming frameworks is a challenge. The heterogeneous
specialized elements make the problem even more difficult.
Dataflow is a parallel computation model that relies exclusively on message passing, and that has some advantages over parallel programming tools in wide use today: simplicity, graphical representation, and determinism. Dataflow model is also a good match to streaming applications, such as audio, video and image processing, which operate on large sequences of data and are characterized by abundant parallelism and regular memory access patterns. Dataflow model of computation has gained acceptance in simulation and signal-processing communities. This thesis evaluates the applicability
of the dataflow model for implementing domain-specific embedded accelerators for streaming applications.
Abstract
The focus of this dissertation is the design and the implementation of a computing platform which can accelerate data processing in the embedded computation domain. We focus on a heterogeneous computing platform, whose hardware implementation can approach the power and area efficiency of specialized designs, while remaining flexible across the application domain.
The multi-core architectures require parallel programming, which is widely-regarded as more challenging than sequential programming. Although shared memory parallel programs may be fairly easy to write (using OpenMP, for example), they are quite hard to optimize; providing embedded application developers with optimizing tools and programming frameworks is a challenge. The heterogeneous
specialized elements make the problem even more difficult.
Dataflow is a parallel computation model that relies exclusively on message passing, and that has some advantages over parallel programming tools in wide use today: simplicity, graphical representation, and determinism. Dataflow model is also a good match to streaming applications, such as audio, video and image processing, which operate on large sequences of data and are characterized by abundant parallelism and regular memory access patterns. Dataflow model of computation has gained acceptance in simulation and signal-processing communities. This thesis evaluates the applicability
of the dataflow model for implementing domain-specific embedded accelerators for streaming applications.
Tipologia del documento
Tesi di dottorato
Autore
Stoutchinin, Arthur
Supervisore
Dottorato di ricerca
Ciclo
31
Coordinatore
Settore disciplinare
Settore concorsuale
Parole chiave
dataflow, computer vision, shared memory cluster, optimization
URN:NBN
DOI
10.6092/unibo/amsdottorato/9026
Data di discussione
8 Aprile 2019
URI
Altri metadati
Tipologia del documento
Tesi di dottorato
Autore
Stoutchinin, Arthur
Supervisore
Dottorato di ricerca
Ciclo
31
Coordinatore
Settore disciplinare
Settore concorsuale
Parole chiave
dataflow, computer vision, shared memory cluster, optimization
URN:NBN
DOI
10.6092/unibo/amsdottorato/9026
Data di discussione
8 Aprile 2019
URI
Statistica sui download
Gestione del documento: