GEA_Community_Detection

Summary

This repository performs gene enrichment analysis using either the KEGG, or PID databases. The experiment is set up to contain both a control and experimental arm where the control arm is enrichment of a gene list of m pathways using only p% of the genes in each pathway with a% additional random genes from the ontology. This gene list is then subjected to enrichment analysis and the relevant enriched pathways are determined. The experimental condition is just like the control except that community detection is performed before enrichment analysis. In particular, one can select Fastgreedy, Walktrap, Infomap, or Multilevel as the possible grouping method. For all methods, the F1-score, false positive ratio, and false negative ratio are returned.

All figures from the simulations are included in the Paper_Figs folder and results from the simulations are included in the Data folder as all_iterations_data.csv.

Reproducibility

To reproduce all analyses including simulations and HGSC applications:

# Create and activate reproducible conda environment
conda env create --force --file environment.yml
source activate gea_community_detection

# Data for this project can be downloaded using the script and URL text file
# located in the Data folder. This is required before running the pipeline.
bash Data/data_files.sh

# Reproduce all results
bash Scripts/gea_pipeline.sh

Contact

About the code: Lia Harrington (lia.x.harrington.gr@dartmouth.edu)
About the project or collaboration: Jennifer Doherty (jennifer.a.doherty@dartmouth.edu) or Casey Greene at (csgreene@mail.med.upenn.edu).

Name		Name	Last commit message	Last commit date
Latest commit History 150 Commits
Data		Data
Paper_Figs		Paper_Figs
Scripts		Scripts
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data

Data

Paper_Figs

Paper_Figs

Scripts

Scripts

.gitignore

.gitignore

LICENSE.txt

LICENSE.txt

README.md

README.md

environment.yml

environment.yml

Repository files navigation

GEA_Community_Detection

Summary

Reproducibility

Contact

About

Releases 1

Packages

Contributors 3

Languages

License

greenelab/GEA_Community_Detection

Folders and files

Latest commit

History

Repository files navigation

GEA_Community_Detection

Summary

Reproducibility

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Languages