Schema for wgEncodeGencodeCompV41lift37
|
|
Database: hg19 Primary Table: wgEncodeGencodeCompV41lift37 Row Count: 232,631   Data last updated: 2022-07-12
Format description: A gene prediction with some additional info. On download server: MariaDB table dump directory
field | example | SQL type | info | description |
bin | 585 | smallint(5) unsigned | range | Indexing field to speed chromosome range queries. |
name | ENST00000473358.1_4 | varchar(255) | values | Name of gene (usually transcript_id from GTF) |
chrom | chr1 | varchar(255) | values | Reference sequence chromosome or scaffold |
strand | + | char(1) | values | + or - for strand |
txStart | 29553 | int(10) unsigned | range | Transcription start position (or end position for minus strand item) |
txEnd | 31097 | int(10) unsigned | range | Transcription end position (or start position for minus strand item) |
cdsStart | 29553 | int(10) unsigned | range | Coding region start (or end position for minus strand item) |
cdsEnd | 29553 | int(10) unsigned | range | Coding region end (or start position for minus strand item) |
exonCount | 3 | int(10) unsigned | range | Number of exons |
exonStarts | 29553,30563,30975, | longblob | | Exon start positions (or end positions for minus strand item) |
exonEnds | 30039,30667,31097, | longblob | | Exon end positions (or start positions for minus strand item) |
score | 0 | int(11) | range | score |
name2 | MIR1302-2HG | varchar(255) | values | Alternate name (e.g. gene_id from GTF) |
cdsStartStat | none | enum('none', 'unk', 'incmpl', 'cmpl') | values | Status of CDS start annotation (none, unknown, incomplete, or complete) |
cdsEndStat | none | enum('none', 'unk', 'incmpl', 'cmpl') | values | Status of CDS end annotation (none, unknown, incomplete, or complete) |
exonFrames | -1,-1,-1, | longblob | | Reading frame of the start of the CDS region of the exon, in the direction of transcription (0,1,2), or -1 if there is no CDS region. |
|
| |
|
|
Connected Tables and Joining Fields
|
|
Sample Rows
|
|
bin | name | chrom | strand | txStart | txEnd | cdsStart | cdsEnd | exonCount | exonStarts | exonEnds | score | name2 | cdsStartStat | cdsEndStat | exonFrames |
---|
585 | ENST00000473358.1_4 | chr1 | + | 29553 | 31097 | 29553 | 29553 | 3 | 29553,30563,30975, | 30039,30667,31097, | 0 | MIR1302-2HG | none | none | -1,-1,-1, |
585 | ENST00000469289.1_1 | chr1 | + | 30266 | 31109 | 30266 | 30266 | 2 | 30266,30975, | 30667,31109, | 0 | MIR1302-2HG | none | none | -1,-1, |
585 | ENST00000417324.1_4 | chr1 | - | 34553 | 36081 | 34553 | 34553 | 3 | 34553,35276,35720, | 35174,35481,36081, | 0 | FAM138A | none | none | -1,-1,-1, |
585 | ENST00000461467.1_3 | chr1 | - | 35244 | 36073 | 35244 | 35244 | 2 | 35244,35720, | 35481,36073, | 0 | FAM138A | none | none | -1,-1, |
585 | ENST00000641515.2_4 | chr1 | + | 65418 | 71585 | 65564 | 70008 | 3 | 65418,65519,69036, | 65433,65573,71585, | 0 | OR4F5 | cmpl | cmpl | -1,0,0, |
585 | ENST00000466430.5_3 | chr1 | - | 89294 | 120932 | 89294 | 89294 | 4 | 89294,92090,112699,120774, | 91629,92240,112804,120932, | 0 | ENSG00000238009 | none | none | -1,-1,-1,-1, |
585 | ENST00000495576.1_4 | chr1 | - | 89550 | 91105 | 89550 | 89550 | 2 | 89550,90286, | 90050,91105, | 0 | ENSG00000239945 | none | none | -1,-1, |
585 | ENST00000477740.5_4 | chr1 | - | 92229 | 129217 | 92229 | 92229 | 4 | 92229,112699,120720,129054, | 92240,112804,120932,129217, | 0 | ENSG00000238009 | none | none | -1,-1,-1,-1, |
585 | ENST00000471248.1_7 | chr1 | - | 110952 | 129173 | 110952 | 110952 | 3 | 110952,112699,129054, | 111357,112804,129173, | 0 | ENSG00000238009 | none | none | -1,-1,-1, |
73 | ENST00000610542.1_3 | chr1 | - | 120724 | 133723 | 120724 | 120724 | 3 | 120724,129054,133373, | 120932,129223,133723, | 0 | ENSG00000238009 | none | none | -1,-1,-1, |
|
Note: all start coordinates in our database are 0-based, not
1-based. See explanation
here.
| |
|
|
|