Macedoni, Pietro
(2025)
Optimization of data collection and analysis methods to reduce the impact of oil and gas industry upstream activities on the environment: a focus on SCOPE 1 emissions, [Dissertation thesis], Alma Mater Studiorum Università di Bologna.
Dottorato di ricerca in
Scienze statistiche, 37 Ciclo.
Documenti full-text disponibili:
![Macedoni_PhdThesis.pdf [thumbnail of Macedoni_PhdThesis.pdf]](https://amsdottorato.unibo.it/style/images/fileicons/application_pdf.png) |
Documento PDF (English)
- Accesso riservato fino a 27 Aprile 2028
- Richiede un lettore di PDF come Xpdf o Adobe Acrobat Reader
Disponibile con Licenza: Salvo eventuali più ampie autorizzazioni dell'autore, la tesi può essere liberamente consultata e può essere effettuato il salvataggio e la stampa di una copia per fini strettamente personali di studio, di ricerca e di insegnamento, con espresso divieto di qualunque utilizzo direttamente o indirettamente commerciale. Ogni altro diritto sul materiale è riservato.
Download (9MB)
| Contatta l'autore
|
Abstract
As the oil and gas industry faces growing pressure to mitigate its climate impact, this thesis develops a robust, data-driven framework to estimate direct (Scope 1) greenhouse gas (GHG) emissions from upstream treatment plants. The study leverages administrative datasets that require significant preprocessing to extract meaningful insights into key emission sources. Fuel gas consumption, identified as the dominant contributor to emissions, served as the primary modeling focus due to its critical importance and the availability of detailed data. To account for heterogeneity across treatment facilities, unsupervised clustering techniques—particularly a random forest algorithm—were applied, resulting in the identification of four distinct operational groups. Principal Component Analysis (PCA) was used to address issues arising from highly correlated variables, improving interpretability and model accuracy. These clusters informed a split-panel regression approach, enabling the development of models tailored to specific operational behaviors. Model robustness was assessed using diagnostics such as Cook’s distance and hierarchical bootstrapping, ensuring adaptability to newly introduced treatment plants while maintaining internal homogeneity within clusters. Following the successful validation of fuel gas models, the study expanded to include other emission sources, such as flaring and venting. To quantify uncertainty and prioritize key variables, a Global Sensitivity Analysis (GSA) was conducted. This method allowed for the systematic evaluation of input variability under both uncorrelated and correlated assumptions, using a two-step procedure. The approach produced comprehensive emission estimates and associated uncertainty ranges for selected treatment plants. By integrating clustering, regression, and sensitivity analysis, the thesis offers a versatile and scalable methodology for estimating upstream GHG emissions. This framework supports ongoing monitoring and provides a foundation for future regulatory reporting, scenario planning, and emission-reduction strategies across diverse operational settings in the oil and gas sector.
Abstract
As the oil and gas industry faces growing pressure to mitigate its climate impact, this thesis develops a robust, data-driven framework to estimate direct (Scope 1) greenhouse gas (GHG) emissions from upstream treatment plants. The study leverages administrative datasets that require significant preprocessing to extract meaningful insights into key emission sources. Fuel gas consumption, identified as the dominant contributor to emissions, served as the primary modeling focus due to its critical importance and the availability of detailed data. To account for heterogeneity across treatment facilities, unsupervised clustering techniques—particularly a random forest algorithm—were applied, resulting in the identification of four distinct operational groups. Principal Component Analysis (PCA) was used to address issues arising from highly correlated variables, improving interpretability and model accuracy. These clusters informed a split-panel regression approach, enabling the development of models tailored to specific operational behaviors. Model robustness was assessed using diagnostics such as Cook’s distance and hierarchical bootstrapping, ensuring adaptability to newly introduced treatment plants while maintaining internal homogeneity within clusters. Following the successful validation of fuel gas models, the study expanded to include other emission sources, such as flaring and venting. To quantify uncertainty and prioritize key variables, a Global Sensitivity Analysis (GSA) was conducted. This method allowed for the systematic evaluation of input variability under both uncorrelated and correlated assumptions, using a two-step procedure. The approach produced comprehensive emission estimates and associated uncertainty ranges for selected treatment plants. By integrating clustering, regression, and sensitivity analysis, the thesis offers a versatile and scalable methodology for estimating upstream GHG emissions. This framework supports ongoing monitoring and provides a foundation for future regulatory reporting, scenario planning, and emission-reduction strategies across diverse operational settings in the oil and gas sector.
Tipologia del documento
Tesi di dottorato
Autore
Macedoni, Pietro
Supervisore
Dottorato di ricerca
Ciclo
37
Coordinatore
Settore disciplinare
Settore concorsuale
Parole chiave
Random Forest; Global Sensitivity Analysis; Oil and Gas; Environmental Monitoring;Greenhouse Gas Emissions
Data di discussione
24 Giugno 2025
URI
Altri metadati
Tipologia del documento
Tesi di dottorato
Autore
Macedoni, Pietro
Supervisore
Dottorato di ricerca
Ciclo
37
Coordinatore
Settore disciplinare
Settore concorsuale
Parole chiave
Random Forest; Global Sensitivity Analysis; Oil and Gas; Environmental Monitoring;Greenhouse Gas Emissions
Data di discussione
24 Giugno 2025
URI
Gestione del documento: