Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fasta2bed #167

Open
yoann-dufresne opened this issue Jul 17, 2018 · 2 comments
Open

fasta2bed #167

yoann-dufresne opened this issue Jul 17, 2018 · 2 comments

Comments

@yoann-dufresne
Copy link
Collaborator

yoann-dufresne commented Jul 17, 2018

This converter should transform a fasta file representing a scaffold (ordered contigs) into a genome annotation following this specifications:

Fasta:

>contig_1 my first contig
ACT
>contig_2 my second contig
GTGT

Bed:

contig_1 0 2
contig_2 3 6

@cokelaer
Copy link
Contributor

cokelaer commented Aug 1, 2018

at some point we should define precisely what BED means. We have some BED with 3,6,12 columns.
Here, the BED file has 3 columns. What are the second and third column meaning here. should be in the class documentation. thanks

@blaiseli
Copy link
Collaborator

blaiseli commented Aug 2, 2018

BED has optional columns, but the first 3 should always be present and are well defined.

The first three required BED fields are:

  1. chrom - The name of the chromosome (e.g. chr3, chrY, chr2_random) or scaffold (e.g. scaffold10671).
  2. chromStart - The starting position of the feature in the chromosome or scaffold. The first base in a chromosome is numbered 0.
  3. chromEnd - The ending position of the feature in the chromosome or scaffold. The chromEnd base is not included in the display of the feature. For example, the first 100 bases of a chromosome are defined as chromStart=0, chromEnd=100, and span the bases numbered 0-99.

(see https://genome.ucsc.edu/FAQ/FAQformat.html#format1)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants