Human Gene GTF2H4 (ENST00000259895.9_4) from GENCODE V47lift37
  Description: general transcription factor IIH subunit 4 (from RefSeq NM_001517.5)
Gencode Transcript: ENST00000259895.9_4
Gencode Gene: ENSG00000213780.11_8
Transcript (Including UTRs)
   Position: hg19 chr6:30,875,984-30,881,883 Size: 5,900 Total Exon Count: 14 Strand: +
Coding Region
   Position: hg19 chr6:30,876,814-30,881,760 Size: 4,947 Coding Exon Count: 13 

Page IndexSequence and LinksUniProtKB CommentsPrimersMalaCardsCTD
Gene AllelesRNA-Seq ExpressionMicroarray ExpressionRNA StructureProtein StructureOther Species
GO AnnotationsmRNA DescriptionsPathwaysOther NamesModel InformationMethods
Data last updated at UCSC: 2024-08-22 23:36:26

-  Sequence and Links to Tools and Databases
 
Genomic Sequence (chr6:30,875,984-30,881,883)mRNA (may differ from genome)Protein (462 aa)
Gene SorterGenome BrowserOther Species FASTAVisiGeneGene interactionsTable Schema
AlphaFoldBioGPSEnsemblEntrez GeneExonPrimerGeneCards
HGNCMalacardsMGIOMIMPubMedReactome
UniProtKBWikipediaBioGrid CRISPR DB

-  Comments and Description Text from UniProtKB
  ID: TF2H4_HUMAN
DESCRIPTION: RecName: Full=General transcription factor IIH subunit 4; AltName: Full=Basic transcription factor 2 52 kDa subunit; Short=BTF2 p52; AltName: Full=General transcription factor IIH polypeptide 4; AltName: Full=TFIIH basal transcription factor complex p52 subunit;
FUNCTION: Component of the core-TFIIH basal transcription factor involved in nucleotide excision repair (NER) of DNA and, when complexed to CAK, in RNA transcription by RNA polymerase II.
SUBUNIT: One of the 6 subunits forming the core-TFIIH basal transcription factor which associates with the CAK complex composed of CDK7, CCNH/cyclin H and MNAT1 to form the TFIIH basal transcription factor.
SUBCELLULAR LOCATION: Nucleus.
SIMILARITY: Belongs to the TFB2 family.
WEB RESOURCE: Name=NIEHS-SNPs; URL="http://egp.gs.washington.edu/data/gtf2h4/";

-  Primer design for this transcript
 

Primer3Plus can design qPCR Primers that straddle exon-exon-junctions, which amplify only cDNA, not genomic DNA.
Click here to load the transcript sequence and exon structure into Primer3Plus

Exonprimer can design one pair of Sanger sequencing primers around every exon, located in non-genic sequence.
Click here to open Exonprimer with this transcript

To design primers for a non-coding sequence, zoom to a region of interest and select from the drop-down menu: View > In External Tools > Primer3


-  MalaCards Disease Associations
  MalaCards Gene Search: GTF2H4
Diseases sorted by gene-association score: trichothiodystrophy 1, photosensitive (6), orchitis (5), xeroderma pigmentosum, group b (5), cockayne syndrome (3)

-  Comparative Toxicogenomics Database (CTD)
  The following chemicals interact with this gene

+  Common Gene Haplotype Alleles
  Press "+" in the title bar above to open this section.

-  RNA-Seq Expression Data from GTEx (53 Tissues, 570 Donors)
  Highest median expression: 22.40 RPKM in Brain - Cerebellum
Total median expression: 579.54 RPKM



View in GTEx track of Genome Browser    View at GTEx portal     View GTEx Body Map

+  Microarray Expression Data
  Press "+" in the title bar above to open this section.

-  mRNA Secondary Structure of 3' and 5' UTRs
 
RegionFold EnergyBasesEnergy/Base
Display As
5' UTR -53.60200-0.268 Picture PostScript Text
3' UTR -27.50123-0.224 Picture PostScript Text

The RNAfold program from the Vienna RNA Package is used to perform the secondary structure predictions and folding calculations. The estimated folding energy is in kcal/mol. The more negative the energy, the more secondary structure the RNA is likely to have.

-  Protein Domain and Structure Information
  InterPro Domains: Graphical view of domain structure
IPR004598 - Tfb2

Pfam Domains:
PF03849 - Transcription factor Tfb2
PF18307 - Transcription factor Tfb2 (p52) C-terminal domain

SCOP Domains:
46785 - "Winged helix" DNA-binding domain

ModBase Predicted Comparative 3D Structure on Q92759
FrontTopSide
The pictures above may be empty if there is no ModBase structure for the protein. The ModBase structure frequently covers just a fragment of the protein. You may be asked to log onto ModBase the first time you click on the pictures. It is simplest after logging in to just click on the picture again to get to the specific info on that model.

-  Orthologous Genes in Other Species
  Orthologies between human, mouse, and rat are computed by taking the best BLASTP hit, and filtering out non-syntenic hits. For more distant species reciprocal-best BLASTP hits are used. Note that the absence of an ortholog in the table below may reflect incomplete annotations in the other species rather than a true absence of the orthologous gene.
MouseRatZebrafishD. melanogasterC. elegansS. cerevisiae
No orthologNo orthologNo orthologNo orthologNo orthologNo ortholog
Gene DetailsGene Details    
Gene SorterGene Sorter    
 RGD    
      
      

-  Gene Ontology (GO) Annotations with Structured Vocabulary
  Molecular Function:
GO:0003690 double-stranded DNA binding
GO:0003700 transcription factor activity, sequence-specific DNA binding
GO:0004003 ATP-dependent DNA helicase activity
GO:0005515 protein binding
GO:0004672 protein kinase activity
GO:0008094 DNA-dependent ATPase activity
GO:0008353 RNA polymerase II carboxy-terminal domain kinase activity

Biological Process:
GO:0006281 DNA repair
GO:0006283 transcription-coupled nucleotide-excision repair
GO:0006289 nucleotide-excision repair
GO:0006293 nucleotide-excision repair, preincision complex stabilization
GO:0006294 nucleotide-excision repair, preincision complex assembly
GO:0006296 nucleotide-excision repair, DNA incision, 5'-to lesion
GO:0006351 transcription, DNA-templated
GO:0006355 regulation of transcription, DNA-templated
GO:0006361 transcription initiation from RNA polymerase I promoter
GO:0006363 termination of RNA polymerase I transcription
GO:0006366 transcription from RNA polymerase II promoter
GO:0006367 transcription initiation from RNA polymerase II promoter
GO:0006368 transcription elongation from RNA polymerase II promoter
GO:0006370 7-methylguanosine mRNA capping
GO:0006974 cellular response to DNA damage stimulus
GO:0032508 DNA duplex unwinding
GO:0033683 nucleotide-excision repair, DNA incision
GO:0070816 phosphorylation of RNA polymerase II C-terminal domain
GO:0070911 global genome nucleotide-excision repair

Cellular Component:
GO:0000438 core TFIIH complex portion of holo TFIIH complex
GO:0000439 core TFIIH complex
GO:0005634 nucleus
GO:0005654 nucleoplasm
GO:0005669 transcription factor TFIID complex
GO:0005675 holo TFIIH complex
GO:0016607 nuclear speck


-  Descriptions from all associated GenBank mRNAs
  AK309869 - Homo sapiens cDNA, FLJ99910.
LF210804 - JP 2014500723-A/18307: Polycomb-Associated Non-Coding RNAs.
AK298058 - Homo sapiens cDNA FLJ58492 complete cds, highly similar to TFIIH basal transcription factor complex p52 subunit.
AK300239 - Homo sapiens cDNA FLJ50212 complete cds, highly similar to TFIIH basal transcription factor complex p52 subunit.
BC004935 - Homo sapiens general transcription factor IIH, polypeptide 4, 52kDa, mRNA (cDNA clone MGC:10768 IMAGE:3606718), complete cds.
BC016302 - Homo sapiens general transcription factor IIH, polypeptide 4, 52kDa, mRNA (cDNA clone MGC:16269 IMAGE:3830902), complete cds.
AB209479 - Homo sapiens premature mRNA for VARS2L protein variant.
JD291035 - Sequence 272059 from Patent EP1572962.
Y07595 - Homo sapiens mRNA for 52 kD subunit of transcription factor TFIIH (p52 gene).
AK222607 - Homo sapiens mRNA for general transcription factor IIH, polypeptide 4, 52kDa variant, clone: CAS07035.
JD299779 - Sequence 280803 from Patent EP1572962.
BT007321 - Homo sapiens general transcription factor IIH, polypeptide 4, 52kDa mRNA, complete cds.
DQ893165 - Synthetic construct clone IMAGE:100005795; FLH194551.01X; RZPDo839C0780D general transcription factor IIH, polypeptide 4, 52kDa (GTF2H4) gene, encodes complete protein.
DQ896463 - Synthetic construct Homo sapiens clone IMAGE:100010923; FLH194547.01L; RZPDo839C0770D general transcription factor IIH, polypeptide 4, 52kDa (GTF2H4) gene, encodes complete protein.
JD025396 - Sequence 6420 from Patent EP1572962.
JD020906 - Sequence 1930 from Patent EP1572962.
JD025382 - Sequence 6406 from Patent EP1572962.
JD462423 - Sequence 443447 from Patent EP1572962.
JD052100 - Sequence 33124 from Patent EP1572962.
MA446381 - JP 2018138019-A/18307: Polycomb-Associated Non-Coding RNAs.

-  Biochemical and Signaling Pathways
  Reactome (by CSHL, EBI, and GO)

Protein Q92759 (Reactome details) participates in the following event(s):

R-HSA-73758 Recruitment of Active RNA Polymerase I to SL1:phos.UBF-1:rDNA Promoter
R-HSA-109639 Formation of the closed pre-initiation complex
R-HSA-112379 Recruitment of elongation factors to form elongation complex
R-HSA-112383 Hypophosphorylation of RNA Pol II CTD by FCP1P protein
R-HSA-167072 Hypophosphorylation of RNA Pol II CTD by FCP1P protein
R-HSA-167077 Recruitment of elongation factors to form HIV-1 elongation complex
R-HSA-167196 Recruitment of elongation factors to form HIV-1 elongation complex
R-HSA-5691000 TFIIH binds GG-NER site to form a verification complex
R-HSA-73946 Abortive initiation
R-HSA-75856 Abortive Initiation Before Second Transition
R-HSA-75891 Abortive Initiation After Second Transition
R-HSA-77090 Methylation of GMP-cap by RNA Methyltransferase
R-HSA-112385 Addition of nucleotides leads to transcript elongation
R-HSA-167181 Addition of nucleotides leads to HIV-1 transcript elongation
R-HSA-167468 Abortive HIV-1 Initiation After Second Transition
R-HSA-167474 Abortive HIV-1 Initiation Before Second Transition
R-HSA-167477 Abortive HIV-1 initiation after formation of the first phosphodiester bond
R-HSA-5690988 3'-incision of DNA by ERCC5 (XPG) in GG-NER
R-HSA-73769 Loss of Rrn3 from RNA Polymerase I promoter escape complex
R-HSA-74994 Polymerase I Transcription Complex/Nascent Pre rRNA Complex pauses at the TTF-I:Sal Box
R-HSA-74992 Dissociation of PTRF:Polymerase I/Nascent Pre rRNA Complex:TTF-I:Sal Box
R-HSA-75873 Addition of Nucleotides 5 through 9 on the growing Transcript
R-HSA-76576 Addition of nucleotides 10 and 11 on the growing transcript: Third Transition
R-HSA-111264 Addition of nucleotides between position +11 and +30
R-HSA-77068 Activation of GT
R-HSA-77069 RNA Polymerase II CTD (phosphorylated) binds to CE
R-HSA-77073 SPT5 subunit of Pol II binds the RNA triphosphatase (RTP)
R-HSA-77077 Capping complex formation
R-HSA-75864 Newly Formed Phosphodiester Bond Stabilized and PPi Released
R-HSA-75866 Nucleophillic Attack by 3'-hydroxyl Oxygen of nascent transcript on the Alpha Phosphate of NTP
R-HSA-75949 RNA Polymerase II Promoter Opening: First Transition
R-HSA-75862 Fall Back to Closed Pre-initiation Complex
R-HSA-75861 NTP Binds Active Site of RNA Polymerase II
R-HSA-113430 Extrusion of 5'-end of 30 nt long transcript through the pore in Pol II complex
R-HSA-77071 Phosphorylation (Ser5) of RNA pol II CTD
R-HSA-167117 Addition of nucleotides 10 and 11 on the growing HIV-1 transcript: Third Transition
R-HSA-167136 Addition of nucleotides 5 through 9 on the growing HIV-1 transcript
R-HSA-167134 Newly formed phosphodiester bond stabilized and PPi released
R-HSA-167098 Phosphorylation (Ser5) of RNA pol II CTD
R-HSA-167111 Extrusion of 5'-end of 30 nt long HIV-1 transcript through the pore in Pol II complex
R-HSA-167130 Nucleophillic attack by 3'-hydroxyl oxygen of nascent HIV-1 transcript on the Alpha phosphate of NTP
R-HSA-167133 Activation of GT
R-HSA-167128 RNA Polymerase II CTD (phosphorylated) binds to CE
R-HSA-167115 Addition of nucleotides between position +11 and +30 on HIV-1 transcript
R-HSA-167153 SPT5 subunit of Pol II binds the RNA triphosphatase (RTP)
R-HSA-167097 HIV Promoter Opening: First Transition
R-HSA-167484 Fall Back to Closed Pre-initiation Complex
R-HSA-167118 NTP binds active site of RNA Polymerase II in HIV-1 open pre-initiation complex
R-HSA-5689861 Recruitment of XPA and release of CAK
R-HSA-6781840 ERCC6 binds stalled RNA Pol II
R-HSA-6782211 DNA polymerases delta, epsilon or kappa bind the TC-NER site
R-HSA-6782204 5' incision of damaged DNA strand by ERCC1:ERCC4 in TC-NER
R-HSA-6782224 3' incision by ERCC5 (XPG) in TC-NER
R-HSA-6782227 Ligation of newly synthesized repair patch to incised DNA in TC-NER
R-HSA-6782208 Repair DNA synthesis of ~27-30 bases long patch by POLD, POLE or POLK in TC-NER
R-HSA-6797616 CCNK:CDK12 binds RNA Pol II at DNA repair genes
R-HSA-5696670 CHD1L is recruited to GG-NER site
R-HSA-5690213 DNA polymerases delta, epsilon or kappa bind the GG-NER site
R-HSA-6790454 SUMOylation of XPC
R-HSA-5690996 ERCC2 and ERCC3 DNA helicases form an open bubble structure in damaged DNA
R-HSA-5690990 5'- incision of DNA by ERCC1:ERCC4 in GG-NER
R-HSA-6790487 RNF111 ubiquitinates SUMOylated XPC
R-HSA-5689317 Formation of the pre-incision complex in GG-NER
R-HSA-74993 PTRF Binds the Polymerase I Transcription Complex/Nascent Pre rRNA Complex paused at the TTF-I:Sal Box
R-HSA-74986 Elongation of pre-rRNA transcript
R-HSA-427366 Transcription of intergenic spacer of the rRNA gene
R-HSA-77078 Hydrolysis of the 5'-end of the nascent transcript by the capping enzyme
R-HSA-77081 Formation of the CE:GMP intermediate complex
R-HSA-77085 Dissociation of transcript with 5'-GMP from GT
R-HSA-77083 Transfer of GMP from the capping enzyme GT site to 5'-end of mRNA
R-HSA-6781833 ERCC8 (CSA) binds stalled RNA Pol II
R-HSA-6797606 CDK12 phosphorylates RNA Pol II CTD at DNA repair genes
R-HSA-5690991 Binding of ERCC1:ERCC4 (ERCC1:XPF) to pre-incision complex in GG-NER
R-HSA-6781867 ERCC8:DDB1:CUL4:RBX1 ubiquitinates ERCC6 and RNA Pol II
R-HSA-6782004 Assembly of the pre-incision complex in TC-NER
R-HSA-6782069 UVSSA:USP7 deubiquitinates ERCC6
R-HSA-6782131 RNA Pol II backtracking in TC-NER
R-HSA-6782138 ERCC5 and RPA bind TC-NER site
R-HSA-6782141 Binding of ERCC1:ERCC4 (ERCC1:XPF) to pre-incision complex in TC-NER
R-HSA-73762 RNA Polymerase I Transcription Initiation
R-HSA-73779 RNA Polymerase II Transcription Pre-Initiation And Promoter Opening
R-HSA-112382 Formation of RNA Pol II elongation complex
R-HSA-113418 Formation of the Early Elongation Complex
R-HSA-167158 Formation of the HIV-1 Early Elongation Complex
R-HSA-167152 Formation of HIV elongation complex in the absence of HIV Tat
R-HSA-167200 Formation of HIV-1 elongation complex containing HIV-1 Tat
R-HSA-5696395 Formation of Incision Complex in GG-NER
R-HSA-6781823 Formation of TC-NER Pre-Incision Complex
R-HSA-674695 RNA Polymerase II Pre-transcription Events
R-HSA-72086 mRNA Capping
R-HSA-75955 RNA Polymerase II Transcription Elongation
R-HSA-167246 Tat-mediated elongation of the HIV-1 transcript
R-HSA-167162 RNA Polymerase II HIV Promoter Escape
R-HSA-167161 HIV Transcription Initiation
R-HSA-6781827 Transcription-Coupled Nucleotide Excision Repair (TC-NER)
R-HSA-5696400 Dual Incision in GG-NER
R-HSA-73772 RNA Polymerase I Promoter Escape
R-HSA-73863 RNA Polymerase I Transcription Termination
R-HSA-73776 RNA Polymerase II Promoter Escape
R-HSA-77075 RNA Pol II CTD phosphorylation and interaction with CE
R-HSA-75953 RNA Polymerase II Transcription Initiation
R-HSA-76042 RNA Polymerase II Transcription Initiation And Promoter Clearance
R-HSA-167160 RNA Pol II CTD phosphorylation and interaction with CE during HIV infection
R-HSA-167172 Transcription of the HIV genome
R-HSA-6782135 Dual incision in TC-NER
R-HSA-6782210 Gap-filling DNA repair synthesis and ligation in TC-NER
R-HSA-6796648 TP53 Regulates Transcription of DNA Repair Genes
R-HSA-73854 RNA Polymerase I Promoter Clearance
R-HSA-73857 RNA Polymerase II Transcription
R-HSA-167169 HIV Transcription Elongation
R-HSA-5696399 Global Genome Nucleotide Excision Repair (GG-NER)
R-HSA-8953854 Metabolism of RNA
R-HSA-5696398 Nucleotide Excision Repair
R-HSA-73864 RNA Polymerase I Transcription
R-HSA-73777 RNA Polymerase I Chain Elongation
R-HSA-427413 NoRC negatively regulates rRNA expression
R-HSA-162599 Late Phase of HIV Life Cycle
R-HSA-3700989 Transcriptional Regulation by TP53
R-HSA-74160 Gene expression (Transcription)
R-HSA-73894 DNA Repair
R-HSA-5250941 Negative epigenetic regulation of rRNA expression
R-HSA-162587 HIV Life Cycle
R-HSA-212436 Generic Transcription Pathway
R-HSA-212165 Epigenetic regulation of gene expression
R-HSA-162906 HIV Infection
R-HSA-5663205 Infectious disease
R-HSA-1643685 Disease

-  Other Names for This Gene
  Alternate Gene Symbols: B4DTJ5, ENST00000259895.1, ENST00000259895.2, ENST00000259895.3, ENST00000259895.4, ENST00000259895.5, ENST00000259895.6, ENST00000259895.7, ENST00000259895.8, NM_001517, Q76KU4, Q92759, TF2H4_HUMAN, uc317gic.1, uc317gic.2
UCSC ID: ENST00000259895.9_4
RefSeq Accession: NM_001517.5
Protein: Q92759 (aka TF2H4_HUMAN or TFH4_HUMAN)

-  Gene Model Information
  Click here for a detailed description of the fields of the table above.

-  Methods, Credits, and Use Restrictions
  Click here for details on how this gene model was made and data restrictions if any.