Human Gene SP1 (ENST00000327443.9_10) from GENCODE V47lift37
  Description: Sp1 transcription factor, transcript variant 1 (from RefSeq NM_138473.3)
Gencode Transcript: ENST00000327443.9_10
Gencode Gene: ENSG00000185591.10_14
Transcript (Including UTRs)
   Position: hg19 chr12:53,773,960-53,810,230 Size: 36,271 Total Exon Count: 6 Strand: +
Coding Region
   Position: hg19 chr12:53,774,076-53,805,024 Size: 30,949 Coding Exon Count: 6 

Page IndexSequence and LinksUniProtKB CommentsPrimersMalaCardsCTD
Gene AllelesRNA-Seq ExpressionMicroarray ExpressionRNA StructureProtein StructureOther Species
GO AnnotationsmRNA DescriptionsPathwaysOther NamesModel InformationMethods
Data last updated at UCSC: 2024-08-22 23:36:26

-  Sequence and Links to Tools and Databases
 
Genomic Sequence (chr12:53,773,960-53,810,230)mRNA (may differ from genome)Protein (785 aa)
Gene SorterGenome BrowserOther Species FASTAVisiGeneGene interactionsTable Schema
AlphaFoldBioGPSEnsemblEntrez GeneExonPrimerGeneCards
HGNCMalacardsMGIOMIMPubMedReactome
UniProtKBWikipediaBioGrid CRISPR DB

-  Comments and Description Text from UniProtKB
  ID: SP1_HUMAN
DESCRIPTION: RecName: Full=Transcription factor Sp1;
FUNCTION: Transcription factor that can activate or repress transcription in response to physiological and pathological stimuli. Binds with high affinity to GC-rich motifs and regulates the expression of a large number of genes involved in a variety of processes such as cell growth, apoptosis, differentiation and immune responses. Highly regulated by post-translational modifications (phosphorylations, sumoylation, proteolytic cleavage, glycosylation and acetylation). Binds also the PDGFR- alpha G-box promoter. May have a role in modulating the cellular response to DNA damage. Implicated in chromatin remodeling. Plays a role in the recruitment of SMARCA4/BRG1 on the c-FOS promoter. Plays an essential role in the regulation of FE65 gene expression. In complex with ATF7IP, maintains telomerase activity in cancer cells by inducing TERT and TERC gene expression.
SUBUNIT: Interacts with ATF7IP, ATF7IP2, BAHD1, POGZ, HCFC1, AATF and PHC2. Interacts with varicella-zoster virus IE62 protein. Interacts with HIV-1 Vpr; the interaction is inhibited by SP1 O- glycosylation. Interacts with SV40 VP2/3 proteins. Interacts with SV40 major capsid protein VP1; this interaction leads to a cooperativity between the 2 proteins in DNA binding. Interacts with HLTF; the interaction may be required for basal transcriptional activity of HLTF. Interacts (deacetylated form) with EP300; the interaction enhances gene expression. Interacts with HDAC1 and JUN. Interacts with ELF1; the interaction is inhibited by glycosylation of SP1. Interaction with NFYA; the interaction is inhibited by glycosylation of SP1. Interacts with SMARCA4/BRG1 (By similarity). Interacts with ATF7IP and TBP.
INTERACTION: Q92988:DLX4; NbExp=4; IntAct=EBI-298336, EBI-1752755; Q01094:E2F1; NbExp=2; IntAct=EBI-298336, EBI-448924; P32519:ELF1; NbExp=2; IntAct=EBI-298336, EBI-765526; P03372:ESR1; NbExp=2; IntAct=EBI-298336, EBI-78473; P51610:HCFC1; NbExp=4; IntAct=EBI-298336, EBI-396176; Q13118:KLF10; NbExp=2; IntAct=EBI-298336, EBI-1389509; P01106:MYC; NbExp=4; IntAct=EBI-298336, EBI-447544; P16333:NCK1; NbExp=2; IntAct=EBI-298336, EBI-389883; P23708:Nfya (xeno); NbExp=18; IntAct=EBI-298336, EBI-862337; Q8IXK0:PHC2; NbExp=2; IntAct=EBI-298336, EBI-713786; Q7Z3K3:POGZ; NbExp=2; IntAct=EBI-298336, EBI-1389308; P14859:POU2F1; NbExp=2; IntAct=EBI-298336, EBI-624770; Q15459:SF3A1; NbExp=2; IntAct=EBI-298336, EBI-1054743; Q13485:SMAD4; NbExp=2; IntAct=EBI-298336, EBI-347263; Q12772:SREBF2; NbExp=3; IntAct=EBI-298336, EBI-465059;
SUBCELLULAR LOCATION: Nucleus. Cytoplasm. Note=Nuclear location is governed by glycosylated/phosphorylated states. Insulin promotes nuclear location, while glucagon favors cytoplasmic location.
TISSUE SPECIFICITY: Up-regulated in adenocarcinomas of the stomach (at protein level).
INDUCTION: By insulin.
PTM: Phosphorylated on multiple serine and threonine residues. Phosphorylation is coupled to ubiquitination, sumoylation and proteolytic processing. Phosphorylation on Ser-59 enhances proteolytic cleavage. Phosphorylation on Ser-7 enhances ubiquitination and protein degradation. Hyperphosphorylation on Ser-101 in response to DNA damage has no effect on transcriptional activity. MAPK1/MAPK3-mediated phosphorylation on Thr-453 and Thr- 739 enhances VEGF transcription but, represses FGF2-triggered PDGFR-alpha transcription. Also implicated in the repression of RECK by ERBB2. Hyperphosphorylated on Thr-278 and Thr-739 during mitosis by MAPK8 shielding SP1 from degradation by the ubiquitin- dependent pathway. Phosphorylated in the zinc-finger domain by calmodulin-activated PKCzeta. Phosphorylation on Ser-641 by PKCzeta is critical for TSA-activated LHR gene expression through release of its repressor, p107. Phosphorylation on Thr-668, Ser- 670 and Thr-681 is stimulated by angiotensin II via the AT1 receptor inducing increased binding to the PDGF-D promoter. This phosphorylation is increased in injured artey wall. Ser-59 and Thr-681 can both be dephosphorylated by PP2A during cell-cycle interphase. Dephosphorylation on Ser-59 leads to increased chromatin association during interphase and increases the transcriptional activity. On insulin stimulation, sequentially glycosylated and phosphorylated on several C-terminal serine and threonine residues.
PTM: Acetylated. Acetylation/deacetylation events affect transcriptional activity. Deacetylation leads to an increase in the expression the 12(s)-lipooxygenase gene though recruitment of p300 to the promoter.
PTM: Ubiquitinated. Ubiquitination occurs on the C-terminal proteolytically-cleaved peptide and is triggered by phosphorylation.
PTM: Sumoylated with SUMO1. Sumoylation modulates proteolytic cleavage of the N-terminal repressor domain. Sumoylation levels are attenuated during tumorigenesis. Phosphorylation mediates SP1 desumoylation.
PTM: Proteolytic cleavage in the N-terminal repressor domain is prevented by sumoylation. The C-terminal cleaved product is susceptible to degradation.
PTM: O-glycosylated; contains at least 8 N-acetylglucosamine side chains. Levels are controlled by insulin and the SP1 phosphorylation states. Insulin-mediated O-glycosylation locates SP1 to the nucleus, where it is sequentially deglycosylated and phosphorylated. O-glycosylation affects transcriptional activity through disrupting the interaction with a number of transcription factors including ELF1 and NFYA. Also inhibits interaction with the HIV1 promoter. Inhibited by peroxisomome proliferator receptor gamma (PPARgamma).
MISCELLANEOUS: In the hepatoma cell line Hep-G2, SP1 precursor mRNA may undergo homotype trans-splicing leading to the duplication of exons 2 and 3.
SIMILARITY: Belongs to the Sp1 C2H2-type zinc-finger protein family.
SIMILARITY: Contains 3 C2H2-type zinc fingers.
SEQUENCE CAUTION: Sequence=AAH43224.1; Type=Erroneous initiation;

-  Primer design for this transcript
 

Primer3Plus can design qPCR Primers that straddle exon-exon-junctions, which amplify only cDNA, not genomic DNA.
Click here to load the transcript sequence and exon structure into Primer3Plus

Exonprimer can design one pair of Sanger sequencing primers around every exon, located in non-genic sequence.
Click here to open Exonprimer with this transcript

To design primers for a non-coding sequence, zoom to a region of interest and select from the drop-down menu: View > In External Tools > Primer3


-  MalaCards Disease Associations
  MalaCards Gene Search: SP1
Diseases sorted by gene-association score: huntington disease (4), hiv-1 (4), keratoconus (3), breast cancer (3), colorectal cancer (2), deafness, autosomal dominant 15 (1), adrenal cortical adenocarcinoma (1), hypoparathyroidism-deafness-renal disease syndrome (1)

-  Comparative Toxicogenomics Database (CTD)
  The following chemicals interact with this gene

+  Common Gene Haplotype Alleles
  Press "+" in the title bar above to open this section.

-  RNA-Seq Expression Data from GTEx (53 Tissues, 570 Donors)
  Highest median expression: 20.42 RPKM in Esophagus - Mucosa
Total median expression: 634.19 RPKM



View in GTEx track of Genome Browser    View at GTEx portal     View GTEx Body Map

+  Microarray Expression Data
  Press "+" in the title bar above to open this section.

-  mRNA Secondary Structure of 3' and 5' UTRs
 
RegionFold EnergyBasesEnergy/Base
Display As
5' UTR -31.80116-0.274 Picture PostScript Text
3' UTR -1450.205206-0.279 Picture PostScript Text

The RNAfold program from the Vienna RNA Package is used to perform the secondary structure predictions and folding calculations. The estimated folding energy is in kcal/mol. The more negative the energy, the more secondary structure the RNA is likely to have.

-  Protein Domain and Structure Information
  InterPro Domains: Graphical view of domain structure
IPR007087 - Znf_C2H2
IPR015880 - Znf_C2H2-like
IPR013087 - Znf_C2H2/integrase_DNA-bd

Pfam Domains:
PF00096 - Zinc finger, C2H2 type
PF13894 - C2H2-type zinc finger
PF13912 - C2H2-type zinc finger

SCOP Domains:
57667 - beta-beta-alpha zinc fingers

Protein Data Bank (PDB) 3-D Structure
MuPIT help
1SP1 - NMR MuPIT 1SP2 - NMR MuPIT 1VA1 - NMR MuPIT 1VA2 - NMR MuPIT 1VA3 - NMR MuPIT


ModBase Predicted Comparative 3D Structure on P08047
FrontTopSide
The pictures above may be empty if there is no ModBase structure for the protein. The ModBase structure frequently covers just a fragment of the protein. You may be asked to log onto ModBase the first time you click on the pictures. It is simplest after logging in to just click on the picture again to get to the specific info on that model.

-  Orthologous Genes in Other Species
  Orthologies between human, mouse, and rat are computed by taking the best BLASTP hit, and filtering out non-syntenic hits. For more distant species reciprocal-best BLASTP hits are used. Note that the absence of an ortholog in the table below may reflect incomplete annotations in the other species rather than a true absence of the orthologous gene.
MouseRatZebrafishD. melanogasterC. elegansS. cerevisiae
No orthologNo orthologNo orthologNo orthologNo orthologNo ortholog
Gene DetailsGene Details    
Gene SorterGene Sorter    
 RGDEnsembl   
      
      

-  Gene Ontology (GO) Annotations with Structured Vocabulary
  Molecular Function:
GO:0000977 RNA polymerase II regulatory region sequence-specific DNA binding
GO:0000978 RNA polymerase II core promoter proximal region sequence-specific DNA binding
GO:0000981 RNA polymerase II transcription factor activity, sequence-specific DNA binding
GO:0000982 transcription factor activity, RNA polymerase II core promoter proximal region sequence-specific binding
GO:0000987 core promoter proximal region sequence-specific DNA binding
GO:0001046 core promoter sequence-specific DNA binding
GO:0001077 transcriptional activator activity, RNA polymerase II core promoter proximal region sequence-specific binding
GO:0001103 RNA polymerase II repressing transcription factor binding
GO:0003676 nucleic acid binding
GO:0003677 DNA binding
GO:0003690 double-stranded DNA binding
GO:0003700 transcription factor activity, sequence-specific DNA binding
GO:0005515 protein binding
GO:0008022 protein C-terminus binding
GO:0008134 transcription factor binding
GO:0035035 histone acetyltransferase binding
GO:0042803 protein homodimerization activity
GO:0042826 histone deacetylase binding
GO:0043425 bHLH transcription factor binding
GO:0043565 sequence-specific DNA binding
GO:0044212 transcription regulatory region DNA binding
GO:0046872 metal ion binding
GO:0070491 repressing transcription factor binding
GO:0071837 HMG box domain binding

Biological Process:
GO:0006351 transcription, DNA-templated
GO:0006355 regulation of transcription, DNA-templated
GO:0006357 regulation of transcription from RNA polymerase II promoter
GO:0010628 positive regulation of gene expression
GO:0016032 viral process
GO:0032869 cellular response to insulin stimulus
GO:0033194 response to hydroperoxide
GO:0042795 snRNA transcription from RNA polymerase II promoter
GO:0043536 positive regulation of blood vessel endothelial cell migration
GO:0043923 positive regulation by host of viral transcription
GO:0045540 regulation of cholesterol biosynthetic process
GO:0045766 positive regulation of angiogenesis
GO:0045893 positive regulation of transcription, DNA-templated
GO:0045944 positive regulation of transcription from RNA polymerase II promoter
GO:0048511 rhythmic process
GO:0100057 regulation of phenotypic switching by transcription from RNA polymerase II promoter
GO:1904828 positive regulation of hydrogen sulfide biosynthetic process
GO:1905564 positive regulation of vascular endothelial cell proliferation

Cellular Component:
GO:0000790 nuclear chromatin
GO:0005634 nucleus
GO:0005654 nucleoplasm
GO:0005737 cytoplasm
GO:0017053 transcriptional repressor complex
GO:0032993 protein-DNA complex


-  Descriptions from all associated GenBank mRNAs
  BC062539 - Homo sapiens Sp1 transcription factor, mRNA (cDNA clone MGC:71333 IMAGE:5928633), complete cds.
AF255682 - Homo sapiens transcription factor Sp1 mRNA, partial cds.
AJ272134 - Homo sapiens partial mRNA for transcription factor SP1 (SP1 gene).
BC043224 - Homo sapiens Sp1 transcription factor, mRNA (cDNA clone IMAGE:5295465), partial cds.
AB528280 - Synthetic construct DNA, clone: pF1KB6290, Homo sapiens SP1 gene for Sp1 transcription factor, without stop codon, in Flexi system.
FN908228 - Homo sapiens mRNA for transcription factor Sp1 (SP1 gene), splice variant c.
AF252284 - Homo sapiens transcription specificity factor Sp1 mRNA, partial cds.
J03133 - Human transcription factor SP1 mRNA, 3' end.
DQ582593 - Homo sapiens piRNA piR-32705, complete sequence.
JD381694 - Sequence 362718 from Patent EP1572962.
AB209678 - Homo sapiens mRNA for Sp1 transcription factor variant protein.
FJ949572 - Homo sapiens specificity protein 1 (Sp1) mRNA, complete cds, alternatively spliced.
JD515057 - Sequence 496081 from Patent EP1572962.
JD279434 - Sequence 260458 from Patent EP1572962.
JD047249 - Sequence 28273 from Patent EP1572962.
JD450006 - Sequence 431030 from Patent EP1572962.
JD519023 - Sequence 500047 from Patent EP1572962.
JD285846 - Sequence 266870 from Patent EP1572962.
JD055976 - Sequence 37000 from Patent EP1572962.
JD156690 - Sequence 137714 from Patent EP1572962.
JD349719 - Sequence 330743 from Patent EP1572962.
BC012008 - Homo sapiens Sp1 transcription factor, mRNA (cDNA clone IMAGE:4516234).
JD396617 - Sequence 377641 from Patent EP1572962.
JD377142 - Sequence 358166 from Patent EP1572962.
JD501647 - Sequence 482671 from Patent EP1572962.
BC078158 - Homo sapiens cDNA clone IMAGE:6160021, partial cds.
AK311995 - Homo sapiens cDNA, FLJ92269.
JD304964 - Sequence 285988 from Patent EP1572962.
JD402916 - Sequence 383940 from Patent EP1572962.
JD181839 - Sequence 162863 from Patent EP1572962.
JD438448 - Sequence 419472 from Patent EP1572962.
JD183256 - Sequence 164280 from Patent EP1572962.
JD137215 - Sequence 118239 from Patent EP1572962.
JD085213 - Sequence 66237 from Patent EP1572962.
JD469545 - Sequence 450569 from Patent EP1572962.
JD109402 - Sequence 90426 from Patent EP1572962.
JD162144 - Sequence 143168 from Patent EP1572962.
JD322509 - Sequence 303533 from Patent EP1572962.
JD418779 - Sequence 399803 from Patent EP1572962.
JD510709 - Sequence 491733 from Patent EP1572962.
JD087226 - Sequence 68250 from Patent EP1572962.
JD087227 - Sequence 68251 from Patent EP1572962.
JD262473 - Sequence 243497 from Patent EP1572962.
JD296989 - Sequence 278013 from Patent EP1572962.
JD126103 - Sequence 107127 from Patent EP1572962.
JD080647 - Sequence 61671 from Patent EP1572962.
JD524083 - Sequence 505107 from Patent EP1572962.
AL442093 - Homo sapiens mRNA; cDNA DKFZp547C244 (from clone DKFZp547C244).

-  Biochemical and Signaling Pathways
  Reactome (by CSHL, EBI, and GO)

Protein P08047 (Reactome details) participates in the following event(s):

R-HSA-2187309 SMAD2/3:SMAD4 heterotrimer binds SP1
R-HSA-2426146 SREBP1A,1C,2 binds the FDFT1 promoter
R-HSA-2426148 SREBP1A,1C binds the FASN promoter
R-HSA-2426152 SREBP1A,2 binds the IDI1 promoter
R-HSA-2426153 SREBP1A,1C binds the ACACB promoter
R-HSA-2426155 SREBP1A,2 binds the DHCR7 promoter
R-HSA-2426156 SREBP1A,2 binds the PMVK promoter
R-HSA-2426157 SREBP1A,2 binds the MVK promoter
R-HSA-2426161 SREBP1A,2 binds the GGPS1 promoter
R-HSA-2426163 SREBP1A,2 binds the CYP51A1 promoter
R-HSA-3209096 E2F1, E2F2, or E2F3 and SP1 bind p14-ARF promoter
R-HSA-6807496 POU2F1 (OCT1) or POU2F2 (OCT2), SP1, and ZNF143 (STAF) bind the DSE of snRNA gene (U1, U2, U4, U4atac, U5, U11, U12)
R-HSA-9023840 HATs and coactivators are recruited to the CTSD gene
R-HSA-6810239 SNAPc binds PSE of snRNA gene (U1, U2, U4, U4atac, U5, U11, U12)
R-HSA-6810234 General transcription factors bind SNAPc:POU2F1:ZNF143:snRNA gene
R-HSA-6810233 CDK7 phosphorylates serine-5 and serine-7 of heptad repeats in C-terminal domain of RNA polymerase II at snRNA promoter
R-HSA-6810238 RNA polymerase II binds initiation factors at promoter of snRNA gene (U1, U2, U4, U4atac, U5, U11, U12)
R-HSA-6810235 RPAP2 binds RNA polymerase II phosphorylated at serine-7 residues of heptad repeats in the C-terminal domain
R-HSA-6814885 CBCAP complex binds 7-methylguanosine cap of snRNA
R-HSA-2173796 SMAD2/SMAD3:SMAD4 heterotrimer regulates transcription
R-HSA-2426168 Activation of gene expression by SREBF (SREBP)
R-HSA-2559585 Oncogene Induced Senescence
R-HSA-6807505 RNA polymerase II transcribes snRNA genes
R-HSA-9018519 Estrogen-dependent gene expression
R-HSA-2173793 Transcriptional activity of SMAD2/SMAD3:SMAD4 heterotrimer
R-HSA-1655829 Regulation of cholesterol biosynthesis by SREBP (SREBF)
R-HSA-2559583 Cellular Senescence
R-HSA-73857 RNA Polymerase II Transcription
R-HSA-8939211 ESR-mediated signaling
R-HSA-170834 Signaling by TGF-beta Receptor Complex
R-HSA-212436 Generic Transcription Pathway
R-HSA-8957322 Metabolism of steroids
R-HSA-2262752 Cellular responses to stress
R-HSA-74160 Gene expression (Transcription)
R-HSA-9006931 Signaling by Nuclear Receptors
R-HSA-9006936 Signaling by TGF-beta family members
R-HSA-556833 Metabolism of lipids
R-HSA-8953897 Cellular responses to external stimuli
R-HSA-162582 Signal Transduction
R-HSA-1430728 Metabolism

-  Other Names for This Gene
  Alternate Gene Symbols: E4Z9M7, ENST00000327443.1, ENST00000327443.2, ENST00000327443.3, ENST00000327443.4, ENST00000327443.5, ENST00000327443.6, ENST00000327443.7, ENST00000327443.8, G5E9M8, NM_138473, P08047, Q86TN8, Q9H3Q5, Q9NR51, Q9NY21, Q9NYE7, SP1_HUMAN, TSFP1, uc317shb.1, uc317shb.2
UCSC ID: ENST00000327443.9_10
RefSeq Accession: NM_138473.3
Protein: P08047 (aka SP1_HUMAN)

-  Gene Model Information
  Click here for a detailed description of the fields of the table above.

-  Methods, Credits, and Use Restrictions
  Click here for details on how this gene model was made and data restrictions if any.