Skip to content

Gene Annotations

Peter Kerpedjiev edited this page Mar 22, 2017 · 5 revisions

Each gene can have multiple isoforms (combinations of exons and introns). These isoforms can overlap

chr4    115519557       115599381       UGT8    25      +       NM_001128174    7368    protein-coding  UDP glycosyltransferase 8       115544036       115597444       115519557,115544034,115585150,115586835,115589240,115597080,    115520130,115544858,115585293,115586912,115589460,115599381,
chr4    115519557       115599381       UGT8    25      +       NM_001322112    7368    protein-coding  UDP glycosyltransferase 8       115544036       115597444       115519557,115540578,115544034,115585150,115586835,115589240,115597080,  115520130,115540681,115544858,115585293,115586912,115589460,115599381,
chr4    115519557       115599381       UGT8    25      +       NM_001322113    7368    protein-coding  UDP glycosyltransferase 8       115544036       115597444       115519557,115544034,115585150,115586835,115589240,115597080,    115520213,115544858,115585293,115586912,115589460,115599381,
chr4    115520440       115599381       UGT8    25      +       NM_001322114    7368    protein-coding  UDP glycosyltransferase 8       115544036       115597444       115520440,115544034,115585150,115586835,115589240,115597080,    115520942,115544858,115585293,115586912,115589460,115599381,
chr4    115543522       115599381       UGT8    25      +       NM_003360       7368    protein-coding  UDP glycosyltransferase 8       115544036       115597444       115543522,115585150,115586835,115589240,115597080,      115544858,115585293,115586912,115589460,115599381,

or they can be located on distant regions (even different chromosomes).

chr1    367658  368597  OR4F16  2       +       NM_001005277    81399   protein-coding  olfactory receptor family 4 subfamily F member 16       367658  368597  367658, 368597,
chr1    621095  622034  OR4F16  2       -       NM_001005277    81399   protein-coding  olfactory receptor family 4 subfamily F member 16       621095  622034  621095, 622034,
chr5    180794287       180795226       OR4F16  2       +       NM_001005277    81399   protein-coding  olfactory receptor family 4 subfamily F member 16       180794287       180795226       180794287,      180795226,

We want to display an overview of all known exons but we don't want our genes to extend across chromosomes. To resolve this, we show all overlapping sets of exons as single entities. Genes with annotations that are far away from each other and don't overlap will be displayed separately:

7368    115519557       115599381       UGT8    25      +       union_7368      7368    protein-coding  UDP glycosyltransferase 8       115544036       115597444       115519557,115519557,115520440,115540578,115543522,115544034,115585150,115586835,115589240,115597080  115520130,115520213,115520942,115540681,115544858,115544858,115585293,115586912,115589460,115599381
81399   367658  368597  OR4F16  2       +       union_81399     81399   protein-coding  olfactory receptor family 4 subfamily F member 16       367658  368597  367658  368597
81399   621095  622034  OR4F16  2       -       union_81399     81399   protein-coding  olfactory receptor family 4 subfamily F member 16       621095  622034  621095  622034
81399   180794287       180795226       OR4F16  2       +       union_81399     81399   protein-coding  olfactory receptor family 4 subfamily F member 16       180794287       180795226       180794287       180795226

Links