mrpipeline

Workflow overview

Complete pipeline for generating analysis for manuscript and poster:

Generation of triangular graph
Generation of power graph
Search for proxy SNPs in the exposure dataset
Synchronization of exposure and outcome datasets
Generation of tables (causal estimates using multiple MR methods, measures of heterogeneity and power analysis)
Generation of heat maps
Generation of sensitivity analysis graphs
Idenification of overlapping loci across multiple exposures
Plotting of Venn graphs
Causal estimates after removal of overlapping loci
Import of expression data
Calculation of tissue specific causal estimates
Import of Phenoscanner data
Causal estimates after removal of SNPs associated with potential confounders

Installation

# Install mrpipeline:
# From the directory containing .Rpoj file of the package
devtools::intstall()

# From the external repository
install.packages("mrpipeline")

# Or the the development version from GitHub:
# install.packages("devtools")
devtools::install_github("Sandyyy123/mrpipeline")

Description of the input files

Main data file

(xlsx file)

Trait="Exposure of interest"
Leading SNP dbSNP id="Genetic instrument"
Chromosome position (GRCh37.p13)="Chromosomal position"
Effect Allele="Effect Allele in exposure dataset"
Other Allele="Other Allele in exposure dataset"
Effect Allele Frequency (EAF)="Effect Allele Frequency in exposure dataset"
Gene/ nearby gene="Gene"
Effect of lead variant on exposure levels (β)="Effect of lead variant on exposure levels"
Standard Error of Effect on exposure (SE)="Standard Error of Effect on exposure"
P-value="Association P-value of genetic instrument with exposure dataset"
Outcome="Outcome of interest"
Use of Proxy SNP="Yes/No"
Proxy SNP id="Proxy SNP id if Yes in the previosu step"
Distance from lead SNP (bp)="Distance of proxy SNP from lead SNP"
LD (r2) with lead SNP="LD of proxy SNP with lead SNP in the exposure dataset"
Effect Allele="Effect Allele in outcome dataset"
Other Allele="Other Allele in outcome dataset"
Effect of lead variant on outcome levels (β)="Effect of lead variant on outcome levels"
Standard Error of Effect on outcome (SE)="Standard Error of Effect on outcome"
P-value="Association P-value of genetic instrument with outcome dataset"

Meta file with additional details

(yaml config file)

rnd_seed: 123                           # RNG seed
input_path: "/path/to/input/file.xlsx"  # Path to input file (see above)
metadata:
  n_out: 66164                          # Number of individuals in the outcome datase
  n_cas: 4127                           # Number of cases in the outcome dataset (categorical outcome variable)
  n_exp:
    Trait01: 518633                     # Number of individuals in the exposure dataset on trait01
    Trait02: 370711                     # Number of individuals in the exposure dataset on trait02
    Trait03: 318633
    ...
    TraitNN: 420711                     # Number of individuals in the exposure dataset on traitNN

Usage

run_mr generates the tables and figures required for the MR pipeline. The output could be an html or doc file.

# load the library
library(mrpipeline)

# Set working directory
mr_dir <- "/path/to/working/directory"
setwd(mr_dir)

# Our working directory also contains input files (data file and config file)
config_file_name <- "config.yaml"
config <- file.path(getwd(), config_file_name)


# Run the MR results with output in html file
report_mr(config = config, output_format = "html_document", output_file = "mr_output.html", output_dir= mr_dir)

# Alterantively,Run the MR results with output in doc file
# Several other output options can be used as stated here: "https://rmarkdown.rstudio.com/lesson-9.html"
report_mr(config = config, output_format = "word_document", output_file = "mr_output.doc", output_dir= mr_dir)

# Check the results directly for html ouput file from console
browseURL(paste('file://', getwd(),'mr_output.html', sep='/'))

To guide your reading, here's a translation between the terminology used in different places:

function	role
mr_raggr	proxy snp identification
import_mr_input	wrangling
compute_result_for_exposure	post-hoc power analysis
tangram_results	generate table for manuscript
experiment_heatmap	generate heatamps
plot_scatter	generate plots for sensitivit analysis
venn_diagram	generate venn diagram for overlap among variants
mr_gtex	generate plots for overlap among variants
mr_pheno	generate plots for overlap among variants
mr_uni	generate causal estimates for unique loci
mr_tissue	generate tissue specific causal estimates
mr_conf	generate causal estimates without SNPs associated with confounders
mr_forest	generate forest plots for tissue specific SNPs or SNPs not associated with confounders

Cheatsheet

Getting help

Extended documentation or Further reading

Understanding MR metholodogy and workflow

Critical appraisal of an MR manuscript

Writing an MR manuscript

Creating a poster: Example 01

Creating a poster: Example 02

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
R		R
figures		figures
inst/templates		inst/templates
man		man
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
NAMESPACE		NAMESPACE
README.md		README.md
mrpipeline.Rproj		mrpipeline.Rproj

License

Sandyyy123/mrpipeline

Folders and files

Latest commit

History

Repository files navigation

mrpipeline

Table of contents

Workflow overview

Installation

Description of the input files

Main data file

Meta file with additional details

Usage

Cheatsheet

Getting help

Extended documentation or Further reading

About

Topics

Resources

License

Stars

Watchers

Forks

Languages