Human Gene CENPI (ENST00000372926.5_4) from GENCODE V47lift37
  Description: centromere protein I, transcript variant 3 (from RefSeq NM_001318523.1)
Gencode Transcript: ENST00000372926.5_4
Gencode Gene: ENSG00000102384.14_12
Transcript (Including UTRs)
   Position: hg19 chrX:100,355,426-100,396,282 Size: 40,857 Total Exon Count: 15 Strand: +
Coding Region
   Position: hg19 chrX:100,356,060-100,395,753 Size: 39,694 Coding Exon Count: 14 

Page IndexSequence and LinksUniProtKB CommentsPrimersCTDGene Alleles
RNA-Seq ExpressionMicroarray ExpressionRNA StructureProtein StructureOther SpeciesGO Annotations
mRNA DescriptionsPathwaysOther NamesModel InformationMethods
Data last updated at UCSC: 2024-08-22 23:36:26

-  Sequence and Links to Tools and Databases
 
Genomic Sequence (chrX:100,355,426-100,396,282)mRNA (may differ from genome)Protein (522 aa)
Gene SorterGenome BrowserOther Species FASTAGene interactionsTable SchemaAlphaFold
BioGPSEnsemblEntrez GeneExonPrimerGeneCardsHGNC
MGIOMIMPubMedReactomeUniProtKBWikipedia
BioGrid CRISPR DB

-  Comments and Description Text from UniProtKB
  ID: CENPI_HUMAN
DESCRIPTION: RecName: Full=Centromere protein I; Short=CENP-I; AltName: Full=FSH primary response protein 1; AltName: Full=Follicle-stimulating hormone primary response protein; AltName: Full=Interphase centromere complex protein 19; AltName: Full=Leucine-rich primary response protein 1;
FUNCTION: Component of the CENPA-CAD (nucleosome distal) complex, a complex recruited to centromeres which is involved in assembly of kinetochore proteins, mitotic progression and chromosome segregation. May be involved in incorporation of newly synthesized CENPA into centromeres via its interaction with the CENPA-NAC complex. Required for the localization of CENPF, MAD1L1 and MAD2 (MAD2L1 or MAD2L2) to kinetochores. Involved in the response of gonadal tissues to follicle-stimulating hormone.
SUBUNIT: Component of the CENPA-CAD complex, composed of CENPI, CENPK, CENPL, CENPO, CENPP, CENPQ, CENPR and CENPS. The CENPA-CAD complex interacts with the CENPA-NAC complex, at least composed of CENPA, CENPC, CENPH, CENPM, CENPN, CENPT and MLF1IP/CENPU. Interacts with SENP6.
SUBCELLULAR LOCATION: Nucleus. Chromosome, centromere. Note=Localizes exclusively in the centromeres. The CENPA-CAD complex is probably recruited on centromeres by the CENPA-NAC complex.
INDUCTION: By follicle-stimulating hormone (FSH).
PTM: Sumoylated by RNF4, leading to its degradation. Desumoylated by SENP6, preventing its degradation.
SIMILARITY: Belongs to the mis6 family.

-  Primer design for this transcript
 

Primer3Plus can design qPCR Primers that straddle exon-exon-junctions, which amplify only cDNA, not genomic DNA.
Click here to load the transcript sequence and exon structure into Primer3Plus

Exonprimer can design one pair of Sanger sequencing primers around every exon, located in non-genic sequence.
Click here to open Exonprimer with this transcript

To design primers for a non-coding sequence, zoom to a region of interest and select from the drop-down menu: View > In External Tools > Primer3


-  Comparative Toxicogenomics Database (CTD)
  The following chemicals interact with this gene           more ... click here to view the complete list

+  Common Gene Haplotype Alleles
  Press "+" in the title bar above to open this section.

-  RNA-Seq Expression Data from GTEx (53 Tissues, 570 Donors)
  Highest median expression: 4.10 RPKM in Cells - EBV-transformed lymphocytes
Total median expression: 15.15 RPKM



View in GTEx track of Genome Browser    View at GTEx portal     View GTEx Body Map

+  Microarray Expression Data
  Press "+" in the title bar above to open this section.

-  mRNA Secondary Structure of 3' and 5' UTRs
 
RegionFold EnergyBasesEnergy/Base
Display As
5' UTR -103.10277-0.372 Picture PostScript Text
3' UTR -90.80529-0.172 Picture PostScript Text

The RNAfold program from the Vienna RNA Package is used to perform the secondary structure predictions and folding calculations. The estimated folding energy is in kcal/mol. The more negative the energy, the more secondary structure the RNA is likely to have.

-  Protein Domain and Structure Information
  InterPro Domains: Graphical view of domain structure
IPR012485 - Centromere_CenpI

Pfam Domains:
PF07778 - Mis6

ModBase Predicted Comparative 3D Structure on Q92674
FrontTopSide
The pictures above may be empty if there is no ModBase structure for the protein. The ModBase structure frequently covers just a fragment of the protein. You may be asked to log onto ModBase the first time you click on the pictures. It is simplest after logging in to just click on the picture again to get to the specific info on that model.

-  Orthologous Genes in Other Species
  Orthologies between human, mouse, and rat are computed by taking the best BLASTP hit, and filtering out non-syntenic hits. For more distant species reciprocal-best BLASTP hits are used. Note that the absence of an ortholog in the table below may reflect incomplete annotations in the other species rather than a true absence of the orthologous gene.
MouseRatZebrafishD. melanogasterC. elegansS. cerevisiae
No orthologNo orthologNo orthologNo orthologNo orthologNo ortholog
Gene DetailsGene Details    
Gene SorterGene Sorter    
 RGD    
      
      

-  Gene Ontology (GO) Annotations with Structured Vocabulary
  Molecular Function:
GO:0005515 protein binding

Biological Process:
GO:0007548 sex differentiation
GO:0034080 CENP-A containing nucleosome assembly
GO:0034508 centromere complex assembly

Cellular Component:
GO:0000775 chromosome, centromeric region
GO:0000776 kinetochore
GO:0005634 nucleus
GO:0005654 nucleoplasm
GO:0005694 chromosome
GO:0005829 cytosol
GO:0016604 nuclear body


-  Descriptions from all associated GenBank mRNAs
  AK302986 - Homo sapiens cDNA FLJ60999 complete cds, highly similar to Centromere protein I.
X97249 - H.sapiens mRNA for leucine-rich primary response protein 1.
JD330133 - Sequence 311157 from Patent EP1572962.
JD413901 - Sequence 394925 from Patent EP1572962.
BC012462 - Homo sapiens centromere protein I, mRNA (cDNA clone MGC:21750 IMAGE:4537558), complete cds.
KJ901433 - Synthetic construct Homo sapiens clone ccsbBroadEn_10827 CENPI gene, encodes complete protein.

-  Biochemical and Signaling Pathways
  Reactome (by CSHL, EBI, and GO)

Protein Q92674 (Reactome details) participates in the following event(s):

R-HSA-606349 Mis18 complex binds the centromere
R-HSA-141409 Mad1 binds kinetochore
R-HSA-375302 Kinetochore capture of astral microtubules
R-HSA-5666129 CDC42:GTP recruits DIAPH2-2 to kinetochores
R-HSA-5666169 Kinetochore capture of astral microtubules is positively regulated by CDC42:GTP:p-S196-DIAPH2-2
R-HSA-606326 HJURP:CENPA complex localizes to the centromere
R-HSA-141431 MAD2 associates with the Mad1 kinetochore complex
R-HSA-141439 Release of activated MAD2 from kinetochores
R-HSA-2467811 Separation of sister chromatids
R-HSA-2467809 ESPL1 (Separase) cleaves centromeric cohesin
R-HSA-5666160 AURKB phosphorylates DIAPH2-2 at kinetochores
R-HSA-141422 MAD2 converted to an inhibitory state via interaction with Mad1
R-HSA-1638821 PP2A-B56 dephosphorylates centromeric cohesin
R-HSA-1638803 Phosphorylation of cohesin by PLK1 at centromeres
R-HSA-2468287 CDK1 phosphorylates CDCA5 (Sororin) at centromeres
R-HSA-606279 Deposition of new CENPA-containing nucleosomes at the centromere
R-HSA-141444 Amplification of signal from unattached kinetochores via a MAD2 inhibitory signal
R-HSA-68877 Mitotic Prometaphase
R-HSA-5663220 RHO GTPases Activate Formins
R-HSA-774815 Nucleosome assembly
R-HSA-2500257 Resolution of Sister Chromatid Cohesion
R-HSA-2467813 Separation of Sister Chromatids
R-HSA-141424 Amplification of signal from the kinetochores
R-HSA-68886 M Phase
R-HSA-195258 RHO GTPase Effectors
R-HSA-73886 Chromosome Maintenance
R-HSA-68882 Mitotic Anaphase
R-HSA-69618 Mitotic Spindle Checkpoint
R-HSA-69278 Cell Cycle (Mitotic)
R-HSA-194315 Signaling by Rho GTPases
R-HSA-1640170 Cell Cycle
R-HSA-2555396 Mitotic Metaphase and Anaphase
R-HSA-69620 Cell Cycle Checkpoints
R-HSA-162582 Signal Transduction

-  Other Names for This Gene
  Alternate Gene Symbols: CENPI_HUMAN, ENST00000372926.1, ENST00000372926.2, ENST00000372926.3, ENST00000372926.4, FSHPRH1, ICEN19, LRPR1, NM_001318523, Q5JWZ9, Q92674, Q96ED0, uc318kii.1
UCSC ID: ENST00000372926.5_4
RefSeq Accession: NM_001318523.1
Protein: Q92674 (aka CENPI_HUMAN)

-  Gene Model Information
  Click here for a detailed description of the fields of the table above.

-  Methods, Credits, and Use Restrictions
  Click here for details on how this gene model was made and data restrictions if any.