|
|
Database: sacCer2 Primary Table: xenoRefGene Row Count: 31,870   Data last updated: 2020-09-01
Format description: A gene prediction with some additional info. On download server: MariaDB table dump directory
field | example | SQL type | info | description |
bin | 585 | smallint(5) unsigned | range | Indexing field to speed chromosome range queries. |
name | NM_001180043 | varchar(255) | values | Name of gene (usually transcript_id from GTF) |
chrom | chrI | varchar(255) | values | Reference sequence chromosome or scaffold |
strand | - | char(1) | values | + or - for strand |
txStart | 1806 | int(10) unsigned | range | Transcription start position (or end position for minus strand item) |
txEnd | 2169 | int(10) unsigned | range | Transcription end position (or start position for minus strand item) |
cdsStart | 1806 | int(10) unsigned | range | Coding region start (or end position for minus strand item) |
cdsEnd | 2169 | int(10) unsigned | range | Coding region end (or start position for minus strand item) |
exonCount | 1 | int(10) unsigned | range | Number of exons |
exonStarts | 1806, | longblob | | Exon start positions (or end positions for minus strand item) |
exonEnds | 2169, | longblob | | Exon end positions (or start positions for minus strand item) |
score | 0 | int(11) | range | score |
name2 | PAU8 | varchar(255) | values | Alternate name (e.g. gene_id from GTF) |
cdsStartStat | cmpl | enum('none', 'unk', 'incmpl', 'cmpl') | values | Status of CDS start annotation (none, unknown, incomplete, or complete) |
cdsEndStat | cmpl | enum('none', 'unk', 'incmpl', 'cmpl') | values | Status of CDS end annotation (none, unknown, incomplete, or complete) |
exonFrames | 0, | longblob | | Reading frame of the start of the CDS region of the exon, in the direction of transcription (0,1,2), or -1 if there is no CDS region. |
|
| |
|
|
Connected Tables and Joining Fields
|
|
hgFixed.gbCdnaInfo.acc (via xenoRefGene.name)
hgFixed.gbMiscDiff.acc (via xenoRefGene.name)
hgFixed.gbWarn.acc (via xenoRefGene.name)
sacCer2.all_est.qName (via xenoRefGene.name)
sacCer2.all_mrna.qName (via xenoRefGene.name)
sacCer2.xenoRefSeqAli.qName (via xenoRefGene.name)
| |
|
|
Sample Rows
|
|
bin | name | chrom | strand | txStart | txEnd | cdsStart | cdsEnd | exonCount | exonStarts | exonEnds | score | name2 | cdsStartStat | cdsEndStat | exonFrames |
---|
585 | NM_001180043 | chrI | - | 1806 | 2169 | 1806 | 2169 | 1 | 1806, | 2169, | 0 | PAU8 | cmpl | cmpl | 0, |
585 | NM_001184582 | chrI | + | 2479 | 2707 | 2479 | 2707 | 1 | 2479, | 2707, | 0 | YAL067W-A | cmpl | cmpl | 0, |
585 | NM_001178208 | chrI | - | 7235 | 9017 | 7235 | 9017 | 1 | 7235, | 9017, | 0 | SEO1 | cmpl | cmpl | 0, |
585 | NM_210746 | chrI | - | 7340 | 9008 | 7340 | 9008 | 3 | 7340,8702,8942, | 8666,8765,9008, | 0 | AGOS_AFL158C | incmpl | incmpl | 0,0,0, |
585 | NM_001179897 | chrI | - | 11565 | 11952 | 11565 | 11952 | 1 | 11565, | 11952, | 0 | YAL065C | cmpl | cmpl | 0, |
585 | NM_001180042 | chrI | + | 12046 | 12427 | 12046 | 12427 | 1 | 12046, | 12427, | 0 | YAL064W-B | cmpl | cmpl | 0, |
585 | NM_001180041 | chrI | - | 13363 | 13744 | 13363 | 13744 | 1 | 13363, | 13744, | 0 | TDA8 | cmpl | cmpl | 0, |
585 | NM_001178206 | chrI | + | 21567 | 21852 | 21567 | 21852 | 1 | 21567, | 21852, | 0 | YAL064W | cmpl | cmpl | 0, |
585 | NM_001184642 | chrI | - | 22396 | 22687 | 22396 | 22687 | 1 | 22396, | 22687, | 0 | YAL063C-A | cmpl | cmpl | 0, |
585 | NM_001178205 | chrI | - | 24000 | 27969 | 24000 | 27969 | 1 | 24000, | 27969, | 0 | FLO9 | cmpl | cmpl | 0, |
|
Note: all start coordinates in our database are 0-based, not
1-based. See explanation
here.
| |
|
|
|