|
|
Database: sacCer3 Primary Table: xenoRefGene Row Count: 31,843   Data last updated: 2020-08-20
Format description: A gene prediction with some additional info. On download server: MariaDB table dump directory
field | example | SQL type | info | description |
bin | 586 | smallint(5) unsigned | range | Indexing field to speed chromosome range queries. |
name | NR_052819 | varchar(255) | values | Name of gene (usually transcript_id from GTF) |
chrom | chrVII | varchar(255) | values | Reference sequence chromosome or scaffold |
strand | - | char(1) | values | + or - for strand |
txStart | 158898 | int(10) unsigned | range | Transcription start position (or end position for minus strand item) |
txEnd | 158985 | int(10) unsigned | range | Transcription end position (or start position for minus strand item) |
cdsStart | 158985 | int(10) unsigned | range | Coding region start (or end position for minus strand item) |
cdsEnd | 158985 | int(10) unsigned | range | Coding region end (or start position for minus strand item) |
exonCount | 1 | int(10) unsigned | range | Number of exons |
exonStarts | 158898, | longblob | | Exon start positions (or end positions for minus strand item) |
exonEnds | 158985, | longblob | | Exon end positions (or start positions for minus strand item) |
score | 0 | int(11) | range | score |
name2 | F54F12.3 | varchar(255) | values | Alternate name (e.g. gene_id from GTF) |
cdsStartStat | unk | enum('none', 'unk', 'incmpl', 'cmpl') | values | Status of CDS start annotation (none, unknown, incomplete, or complete) |
cdsEndStat | unk | enum('none', 'unk', 'incmpl', 'cmpl') | values | Status of CDS end annotation (none, unknown, incomplete, or complete) |
exonFrames | -1, | longblob | | Reading frame of the start of the CDS region of the exon, in the direction of transcription (0,1,2), or -1 if there is no CDS region. |
|
| |
|
|
Connected Tables and Joining Fields
|
|
hgFixed.gbCdnaInfo.acc (via xenoRefGene.name)
hgFixed.gbMiscDiff.acc (via xenoRefGene.name)
hgFixed.gbWarn.acc (via xenoRefGene.name)
sacCer3.all_est.qName (via xenoRefGene.name)
sacCer3.all_mrna.qName (via xenoRefGene.name)
sacCer3.xenoRefSeqAli.qName (via xenoRefGene.name)
| |
|
|
Sample Rows
|
|
bin | name | chrom | strand | txStart | txEnd | cdsStart | cdsEnd | exonCount | exonStarts | exonEnds | score | name2 | cdsStartStat | cdsEndStat | exonFrames |
---|
586 | NR_052819 | chrVII | - | 158898 | 158985 | 158985 | 158985 | 1 | 158898, | 158985, | 0 | F54F12.3 | unk | unk | -1, |
73 | NM_001086217 | chrXIV | - | 506353 | 619553 | 506353 | 619553 | 2 | 506353,619361, | 506569,619553, | 0 | dnaja2.L | incmpl | incmpl | 0,0, |
589 | NM_001086214 | chrXV | + | 545103 | 546306 | 545103 | 546306 | 2 | 545103,545334, | 545280,546306, | 0 | psmc3.S | incmpl | incmpl | 0,0, |
587 | NM_001086211 | chrII | - | 290812 | 291631 | 290812 | 291631 | 4 | 290812,291028,291340,291523, | 290998,291202,291493,291631, | 0 | ola1.L | incmpl | incmpl | 0,0,0,0, |
591 | NM_001155440 | chrXVI | + | 911566 | 911791 | 911566 | 911791 | 1 | 911566, | 911791, | 0 | LOC100282532 | incmpl | incmpl | 0, |
587 | NM_001155439 | chrVII | - | 281711 | 283595 | 281711 | 283595 | 4 | 281711,282365,282968,283544, | 281930,282863,283511,283595, | 0 | LOC100282531 | incmpl | incmpl | 0,0,0,0, |
585 | NM_001185145 | chrVI | - | 53259 | 54378 | 53259 | 54378 | 1 | 53259, | 54378, | 0 | Arp1 | incmpl | cmpl | 0, |
585 | NM_001185146 | chrVI | - | 53259 | 54378 | 53259 | 54378 | 1 | 53259, | 54378, | 0 | Arp1 | incmpl | cmpl | 0, |
590 | NM_001324843 | chrXIII | + | 702299 | 703040 | 702299 | 703040 | 5 | 702299,702422,702638,702761,702842, | 702344,702575,702725,702824,703040, | 0 | LOC107764380 | incmpl | incmpl | 0,0,0,0,0, |
591 | NM_001183691 | chrXV | + | 832812 | 834195 | 832812 | 834195 | 1 | 832812, | 834195, | 0 | YTM1 | cmpl | cmpl | 0, |
|
Note: all start coordinates in our database are 0-based, not
1-based. See explanation
here.
| |
|
|
|