|
|
Database: sarHar1 Primary Table: xenoRefGene Row Count: 502,601   Data last updated: 2020-08-20
Format description: A gene prediction with some additional info. On download server: MariaDB table dump directory
field | example | SQL type | description |
bin | 586 | smallint(5) unsigned | Indexing field to speed chromosome range queries. |
name | NM_001245365 | varchar(255) | Name of gene (usually transcript_id from GTF) |
chrom | chr1_GL834412_random | varchar(255) | Reference sequence chromosome or scaffold |
strand | + | char(1) | + or - for strand |
txStart | 183608 | int(10) unsigned | Transcription start position (or end position for minus strand item) |
txEnd | 189467 | int(10) unsigned | Transcription end position (or start position for minus strand item) |
cdsStart | 183608 | int(10) unsigned | Coding region start (or end position for minus strand item) |
cdsEnd | 189467 | int(10) unsigned | Coding region end (or start position for minus strand item) |
exonCount | 4 | int(10) unsigned | Number of exons |
exonStarts | 183608,184755,184869,189294, | longblob | Exon start positions (or end positions for minus strand item) |
exonEnds | 183680,184824,184957,189467, | longblob | Exon end positions (or start positions for minus strand item) |
score | 0 | int(11) | score |
name2 | LOC100190035 | varchar(255) | Alternate name (e.g. gene_id from GTF) |
cdsStartStat | incmpl | enum('none', 'unk', 'incmpl', 'cmpl') | Status of CDS start annotation (none, unknown, incomplete, or complete) |
cdsEndStat | incmpl | enum('none', 'unk', 'incmpl', 'cmpl') | Status of CDS end annotation (none, unknown, incomplete, or complete) |
exonFrames | 0,0,0,1, | longblob | Reading frame of the start of the CDS region of the exon, in the direction of transcription (0,1,2), or -1 if there is no CDS region. |
|
| |
|
|
Connected Tables and Joining Fields
|
|
hgFixed.gbCdnaInfo.acc (via xenoRefGene.name)
hgFixed.gbMiscDiff.acc (via xenoRefGene.name)
hgFixed.gbSeq.acc (via xenoRefGene.name)
hgFixed.gbWarn.acc (via xenoRefGene.name)
hgFixed.imageClone.acc (via xenoRefGene.name)
sarHar1.all_mrna.qName (via xenoRefGene.name)
sarHar1.refGene.name (via xenoRefGene.name)
sarHar1.refSeqAli.qName (via xenoRefGene.name)
sarHar1.xenoMrna.qName (via xenoRefGene.name)
sarHar1.xenoRefFlat.name (via xenoRefGene.name)
sarHar1.xenoRefSeqAli.qName (via xenoRefGene.name)
| |
|
|
Sample Rows
|
|
bin | name | chrom | strand | txStart | txEnd | cdsStart | cdsEnd | exonCount | exonStarts | exonEnds | score | name2 | cdsStartStat | cdsEndStat | exonFrames |
---|
586 | NM_001245365 | chr1_GL834412_random | + | 183608 | 189467 | 183608 | 189467 | 4 | 183608,184755,184869,189294, | 183680,184824,184957,189467, | 0 | LOC100190035 | incmpl | incmpl | 0,0,0,1, |
600 | NM_001078307 | chr3_GL849648_random | + | 2048606 | 2048837 | 2048606 | 2048837 | 1 | 2048606, | 2048837, | 0 | nab1/2 | incmpl | incmpl | 0, |
73 | NM_001094891 | chr2_GL841448_random | - | 148417 | 637330 | 148417 | 637330 | 9 | 148417,172343,172917,221371,221782,221848,386542,637037,637282, | 148465,172497,173090,221410,221830,221889,386610,637102,637330, | 0 | znf268.S | incmpl | incmpl | 0,2,0,0,0,1,2,0,0, |
585 | NM_001365069 | chr1_GL835609_random | + | 11065 | 11218 | 11065 | 11218 | 1 | 11065, | 11218, | 0 | ASTN2 | incmpl | incmpl | 0, |
602 | NM_001000978 | chr2_GL841550_random | - | 2335566 | 2336475 | 2335566 | 2336475 | 3 | 2335566,2335806,2336235, | 2335801,2336214,2336475, | 0 | Olr1374 | incmpl | incmpl | 2,0,0, |
585 | NM_001099269 | chr5_GL861552_random | - | 4723 | 36341 | 5025 | 36341 | 7 | 4723,5008,5356,34114,34595,35542,36287, | 4738,5077,5455,34175,34870,35699,36341, | 0 | ZNF506 | incmpl | cmpl | -1,2,0,2,0,2,2, |
585 | NM_001206370 | chr5_GL862015_random | - | 382 | 25465 | 382 | 25465 | 38 | 382,589,823,4736,8954,9891,9895,9949,10370,16442,16776,16994,18247,18304,18461,18743,19441,19696,20185,20587,20750,20859,20992,2 ... | 459,724,919,4790,9033,9894,9946,10040,10562,16680,16883,17173,18295,18366,18657,18803,19615,19926,20342,20656,20858,20901,21103, ... | 0 | SBF1 | incmpl | incmpl | 0,0,0,0,0,0,0,2,2,1,2,0,0,1,0,0,2,0,0,0,0,0,0,0,0,0,0,0,0,0,0,2,1,0,0,0,0,1, |
593 | NR_134506 | chr1_GL834656_random | + | 1128455 | 1128878 | 1128878 | 1128878 | 1 | 1128455, | 1128878, | 0 | TMEM250 | unk | unk | -1, |
585 | NM_001092934 | chr2_GL841683_random | + | 9434 | 10091 | 9434 | 10091 | 1 | 9434, | 10091, | 0 | ywhah.L | incmpl | cmpl | 0, |
585 | NM_001093034 | chr1_GL834554_random | + | 60031 | 69226 | 60031 | 69226 | 4 | 60031,61138,67549,69121, | 60151,61348,67720,69226, | 0 | ccdc102a.S | incmpl | incmpl | 0,0,0,0, |
|
Note: all start coordinates in our database are 0-based, not
1-based. See explanation
here.
| |
|
|
|