thesis_annexes

Software developed and main results obtained in the Master dissertation in Bioinformatics:

A model validation pipeline for healthy tissue genome-scale metabolic models.

Abstract:

In the past few years, high-throughput experimental methods have made omics data available for several layers of biological organization, enabling the integration of knowledge from individual components into complex modelssuch as genome-scale metabolic models (GSMMs). These can be analysed by constraintbased modelling (CBM) methods, which facilitate in silico predictive approaches.

Human metabolic models have been used to study healthy human tissues and their associated metabolic diseases, such as obesity, diabetes, and cancer. Generic human models can be integrated with contextual data through reconstruction algorithms to produce context-specific models (CSMs), which are typically better at capturing the variation between different tissues and cell types. As the human body contains a multitude of tissues and cell types, CSMs are frequently adopted as a means to obtain accurate metabolic models of healthy human tissues.

However, unlike microorganisms’ or cancer models, which allow several methods of validation such as the comparison of in silico fluxes or gene essentiality predictions to experimental data, the validation methods easily applicable to CSMs of healthy human tissue are more limited. Consequently, despite continued efforts to update generic human models and reconstruction algorithms to extract high quality CSMs, their validation remains a concern.

This work presents a pipeline for the extraction and basic validation of CSMs of normal human tissues derived from the integration of transcriptomics data with a generic human model. All CSMs were extracted from the Human1 generic model recently published by Robinson et al. (2020), relied on the open-source Troppo Python package and in the fastCORE and tINIT reconstruction algorithms implemented therein. CSMs were extracted for 11 healthy tissues available in the GTEx v8 dataset.

Prior to extraction, machine learning methods were applied to threshold selection for gene scores conversion. The highest quality models were obtained with a global threshold applied to the omics data directly. The CSM validation strategy focused on the total number of metabolic tasks passed as a performance indicator. Lastly, this work is accompanied by Jupyter Notebooks, which include a beginner friendly model extraction guide.

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
CODE		CODE
DATA		DATA
RESULTS		RESULTS
LICENSE		LICENSE
README.md		README.md
extraction_example_guide.ipynb		extraction_example_guide.ipynb
omics_to_genescores.ipynb		omics_to_genescores.ipynb
results_graphs_clean.ipynb		results_graphs_clean.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CODE

CODE

DATA

DATA

RESULTS

RESULTS

LICENSE

LICENSE

README.md

README.md

extraction_example_guide.ipynb

extraction_example_guide.ipynb

omics_to_genescores.ipynb

omics_to_genescores.ipynb

results_graphs_clean.ipynb

results_graphs_clean.ipynb

Repository files navigation

thesis_annexes

Software developed and main results obtained in the Master dissertation in Bioinformatics:

Abstract:

About

Releases

Packages

Languages

License

MariaPessoa/thesis_annexes

Folders and files

Latest commit

History

Repository files navigation

thesis_annexes

Software developed and main results obtained in the Master dissertation in Bioinformatics:

Abstract:

About

Topics

Resources

License

Stars

Watchers

Forks

Languages