Bersanelli, Matteo
(2017)
Mathematical Physics Techniques for Omics Data Integration, [Dissertation thesis], Alma Mater Studiorum Università di Bologna.
Dottorato di ricerca in
Fisica, 29 Ciclo. DOI 10.6092/unibo/amsdottorato/7812.
Documenti full-text disponibili:
Anteprima |
|
Documento PDF (English)
- Richiede un lettore di PDF come Xpdf o Adobe Acrobat Reader
Disponibile con Licenza: Salvo eventuali più ampie autorizzazioni dell'autore, la tesi può essere liberamente consultata e può essere effettuato il salvataggio e la stampa di una copia per fini strettamente personali di studio, di ricerca e di insegnamento, con espresso divieto di qualunque utilizzo direttamente o indirettamente commerciale. Ogni altro diritto sul materiale è riservato.
Download (8MB)
| Anteprima
|
Abstract
Nowadays different types of high-throughput technologies allow us to collect information on the molecular components of biological systems. Each of such technologies is designed to simultaneously collect large sets of molecular data of a specific omic-kind. In order to draw a more comprehensive view of biological processes, experimental data made on different layers have to be integrated and analyzed. The complexity of biological systems, the technological limits, the large number of biological variables and the relatively low number of biological samples make integrative analyses a challenge. Hence, the development of methods for omics integration is one of the most relevant problems computational scientists are addressing nowadays. The most representative and promising techniques for the analysis of omics data are presented and broadly divided into categories. In the literature we notice a growing interest around approaches that use graphs for modeling the relationships among omic variables. In particular we found that algorithms propagating molecular information on networks are being proposed in several applications and are often related to actual physical models. We considered the chemical master equation (CME) framework to model the exchange of information in biological networks as a stochastic process on the network. In this context we defined new algorithms and pipelines for the analysis of omics. In particular we propose two network-based methods with applications to both synthetic and prostate ardenocarcinoma data. In both the applications the molecular alterations are mapped on the protein-protein interaction network. In the first application we defined a novel methodology for extracting modules of connected genes that present the most significant differential molecular information between two classes of samples. In the second application we measure to which degree a distribution of deleterious molecular information on a given network deviates the normal trajectories of information flow using a perturbative approach to the CME.
Abstract
Nowadays different types of high-throughput technologies allow us to collect information on the molecular components of biological systems. Each of such technologies is designed to simultaneously collect large sets of molecular data of a specific omic-kind. In order to draw a more comprehensive view of biological processes, experimental data made on different layers have to be integrated and analyzed. The complexity of biological systems, the technological limits, the large number of biological variables and the relatively low number of biological samples make integrative analyses a challenge. Hence, the development of methods for omics integration is one of the most relevant problems computational scientists are addressing nowadays. The most representative and promising techniques for the analysis of omics data are presented and broadly divided into categories. In the literature we notice a growing interest around approaches that use graphs for modeling the relationships among omic variables. In particular we found that algorithms propagating molecular information on networks are being proposed in several applications and are often related to actual physical models. We considered the chemical master equation (CME) framework to model the exchange of information in biological networks as a stochastic process on the network. In this context we defined new algorithms and pipelines for the analysis of omics. In particular we propose two network-based methods with applications to both synthetic and prostate ardenocarcinoma data. In both the applications the molecular alterations are mapped on the protein-protein interaction network. In the first application we defined a novel methodology for extracting modules of connected genes that present the most significant differential molecular information between two classes of samples. In the second application we measure to which degree a distribution of deleterious molecular information on a given network deviates the normal trajectories of information flow using a perturbative approach to the CME.
Tipologia del documento
Tesi di dottorato
Autore
Bersanelli, Matteo
Supervisore
Co-supervisore
Dottorato di ricerca
Ciclo
29
Coordinatore
Settore disciplinare
Settore concorsuale
Parole chiave
omics, multi-omics, data integration, master equation, random walk, biological network, Laplacian dynamics, network diffusion, network smoothing index, network resampling, prostate cancer, eigenvalue perturbation, molecular alterations, network stability, control theory
URN:NBN
DOI
10.6092/unibo/amsdottorato/7812
Data di discussione
22 Marzo 2017
URI
Altri metadati
Tipologia del documento
Tesi di dottorato
Autore
Bersanelli, Matteo
Supervisore
Co-supervisore
Dottorato di ricerca
Ciclo
29
Coordinatore
Settore disciplinare
Settore concorsuale
Parole chiave
omics, multi-omics, data integration, master equation, random walk, biological network, Laplacian dynamics, network diffusion, network smoothing index, network resampling, prostate cancer, eigenvalue perturbation, molecular alterations, network stability, control theory
URN:NBN
DOI
10.6092/unibo/amsdottorato/7812
Data di discussione
22 Marzo 2017
URI
Statistica sui download
Gestione del documento: