Skip to content

adrianodemarino/build-detection-VCF

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

Build Detection VCF/BCF

This Python script can identify the genomic build or assembly of a VCF/BCF file by utilizing the coordinates of commonly occurring variants.

Dependencies

import os 
from cyvcf2 import VCF
import pandas as pd
import argparse

Usage

tabix <file_name.vcf.gz>
python3 build_detection_VCF.py <file_name.vcf.gz>

Output

The script returns one of the following cases:

Message Genome Build
hg19/GRCh37 build 37
hg38/GRCh38 build 38
No target variants found to detect the build no detection
Error in build detection Error

About

Detect a build from a VCF/BCF in python

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages