Schema for wgEncodeGencodeCompV47lift37
|
|
Database: hg19 Primary Table: wgEncodeGencodeCompV47lift37 Row Count: 366,510   Data last updated: 2024-10-17
Format description: A gene prediction with some additional info. On download server: MariaDB table dump directory
field | example | SQL type | info | description |
bin | 585 | smallint(5) unsigned | range | Indexing field to speed chromosome range queries. |
name | ENST00000833856.1_1 | varchar(255) | values | Name of gene (usually transcript_id from GTF) |
chrom | chr1 | varchar(255) | values | Reference sequence chromosome or scaffold |
strand | + | char(1) | values | + or - for strand |
txStart | 10369 | int(10) unsigned | range | Transcription start position (or end position for minus strand item) |
txEnd | 10582 | int(10) unsigned | range | Transcription end position (or start position for minus strand item) |
cdsStart | 10369 | int(10) unsigned | range | Coding region start (or end position for minus strand item) |
cdsEnd | 10369 | int(10) unsigned | range | Coding region end (or start position for minus strand item) |
exonCount | 2 | int(10) unsigned | range | Number of exons |
exonStarts | 10369,10465, | longblob | | Exon start positions (or end positions for minus strand item) |
exonEnds | 10377,10582, | longblob | | Exon end positions (or start positions for minus strand item) |
score | 0 | int(11) | range | score |
name2 | DDX11L2 | varchar(255) | values | Alternate name (e.g. gene_id from GTF) |
cdsStartStat | none | enum('none', 'unk', 'incmpl', 'cmpl') | values | Status of CDS start annotation (none, unknown, incomplete, or complete) |
cdsEndStat | none | enum('none', 'unk', 'incmpl', 'cmpl') | values | Status of CDS end annotation (none, unknown, incomplete, or complete) |
exonFrames | -1,-1, | longblob | | Reading frame of the start of the CDS region of the exon, in the direction of transcription (0,1,2), or -1 if there is no CDS region. |
|
| |
|
|
Connected Tables and Joining Fields
|
|
Sample Rows
|
|
bin | name | chrom | strand | txStart | txEnd | cdsStart | cdsEnd | exonCount | exonStarts | exonEnds | score | name2 | cdsStartStat | cdsEndStat | exonFrames |
---|
585 | ENST00000833856.1_1 | chr1 | + | 10369 | 10582 | 10369 | 10369 | 2 | 10369,10465, | 10377,10582, | 0 | DDX11L2 | none | none | -1,-1, |
585 | ENST00000832824.1_1 | chr1 | + | 11120 | 14413 | 11120 | 11120 | 4 | 11120,12009,12612,13452, | 11211,12227,12721,14413, | 0 | DDX11L16 | none | none | -1,-1,-1,-1, |
585 | ENST00000832825.1_1 | chr1 | + | 11124 | 14405 | 11124 | 11124 | 4 | 11124,12009,12612,13402, | 11211,12227,12721,14405, | 0 | DDX11L16 | none | none | -1,-1,-1,-1, |
585 | ENST00000832826.1_1 | chr1 | + | 11409 | 14413 | 11409 | 11409 | 4 | 11409,12009,12594,13482, | 11671,12227,12721,14413, | 0 | DDX11L16 | none | none | -1,-1,-1,-1, |
585 | ENST00000832827.1_1 | chr1 | + | 11410 | 14413 | 11410 | 11410 | 4 | 11410,12009,12612,13482, | 11671,12227,12721,14413, | 0 | DDX11L16 | none | none | -1,-1,-1,-1, |
585 | ENST00000832828.1_1 | chr1 | + | 11425 | 14409 | 11425 | 11425 | 4 | 11425,12009,12612,13220, | 11671,12227,12721,14409, | 0 | DDX11L16 | none | none | -1,-1,-1,-1, |
585 | ENST00000832829.1_1 | chr1 | + | 11769 | 14416 | 11769 | 11769 | 4 | 11769,12009,12612,13482, | 11844,12227,12721,14416, | 0 | DDX11L16 | none | none | -1,-1,-1,-1, |
585 | ENST00000832830.1_1 | chr1 | + | 11818 | 14413 | 11818 | 11818 | 4 | 11818,12009,12612,13482, | 11844,12227,12697,14413, | 0 | DDX11L16 | none | none | -1,-1,-1,-1, |
585 | ENST00000832837.1_1 | chr1 | + | 11822 | 14406 | 11822 | 11822 | 4 | 11822,12009,12594,13220, | 11844,12227,12721,14406, | 0 | DDX11L16 | none | none | -1,-1,-1,-1, |
585 | ENST00000832836.1_1 | chr1 | + | 11823 | 14409 | 11823 | 11823 | 3 | 11823,12009,13482, | 11844,12697,14409, | 0 | DDX11L16 | none | none | -1,-1,-1, |
|
Note: all start coordinates in our database are 0-based, not
1-based. See explanation
here.
| |
|
|
|