U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Cenpu centromere protein U [ Mus musculus (house mouse) ]

Gene ID: 71876, updated on 5-Mar-2024

Summary

Official Symbol
Cenpuprovided by MGI
Official Full Name
centromere protein Uprovided by MGI
Primary source
MGI:MGI:1919126
See related
Ensembl:ENSMUSG00000031629 AllianceGenome:MGI:1919126
Gene type
protein coding
RefSeq status
VALIDATED
Organism
Mus musculus
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
Also known as
Mlf1ip; 1700029A22Rik
Summary
Acts upstream of or within chordate embryonic development. Located in cytoplasm and nucleus. Is expressed in several structures, including alimentary system; central nervous system; gonad; hemolymphoid system; and sensory organ. Orthologous to human CENPU (centromere protein U). [provided by Alliance of Genome Resources, Apr 2022]
Expression
Broad expression in testis adult (RPKM 4.2), liver E14 (RPKM 3.4) and 21 other tissues See more
Orthologs
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

See Cenpu in Genome Data Viewer
Location:
8 B1.1; 8 26.38 cM
Exon count:
14
Annotation release Status Assembly Chr Location
RS_2024_02 current GRCm39 (GCF_000001635.27) 8 NC_000074.7 (47005054..47033603)
108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 8 NC_000074.6 (46552019..46580568)

Chromosome 8 - NC_000074.7Genomic Context describing neighboring genes Neighboring gene predicted gene, 30931 Neighboring gene STARR-seq mESC enhancer starr_21383 Neighboring gene STARR-seq mESC enhancer starr_21384 Neighboring gene STARR-positive B cell enhancer ABC_E2261 Neighboring gene acyl-CoA synthetase long-chain family member 1 Neighboring gene STARR-positive B cell enhancer ABC_E6635 Neighboring gene CapStarr-seq enhancer MGSCv37_chr8:47637172-47637355 Neighboring gene primase and polymerase (DNA-directed) Neighboring gene predicted gene 45607 Neighboring gene STARR-positive B cell enhancer ABC_E1365 Neighboring gene caspase 3

Genomic regions, transcripts, and products

Expression

  • Project title: Mouse ENCODE transcriptome data
  • Description: RNA profiling data sets generated by the Mouse ENCODE project.
  • BioProject: PRJNA66167
  • Publication: PMID 25409824
  • Analysis date: n/a

Bibliography

GeneRIFs: Gene References Into Functions

What's a GeneRIF?

Variation

Alleles

Alleles of this type are documented at Mouse Genome Informatics  (MGI)

Pathways from PubChem

Interactions

Products Interactant Other Gene Complex Source Pubs Description

General gene information

Clone Names

  • MGC143675, MGC143676

Gene Ontology Provided by MGI

Function Evidence Code Pubs
enables protein binding IPI
Inferred from Physical Interaction
more info
PubMed 
Process Evidence Code Pubs
acts_upstream_of_or_within chordate embryonic development IMP
Inferred from Mutant Phenotype
more info
PubMed 
involved_in chromosome segregation NAS
Non-traceable Author Statement
more info
PubMed 
Component Evidence Code Pubs
located_in centriolar satellite ISO
Inferred from Sequence Orthology
more info
 
located_in chromosome IEA
Inferred from Electronic Annotation
more info
 
located_in chromosome, centromeric region IEA
Inferred from Electronic Annotation
more info
 
located_in cytoplasm IDA
Inferred from Direct Assay
more info
PubMed 
part_of inner kinetochore ISO
Inferred from Sequence Orthology
more info
 
located_in kinetochore IEA
Inferred from Electronic Annotation
more info
 
located_in nucleoplasm ISO
Inferred from Sequence Orthology
more info
 
is_active_in nucleus IBA
Inferred from Biological aspect of Ancestor
more info
 
located_in nucleus IDA
Inferred from Direct Assay
more info
PubMed 
located_in nucleus NAS
Non-traceable Author Statement
more info
PubMed 

General protein information

Preferred Names
centromere protein U
Names
CENP-U
MLF1-interacting protein
myeloid leukemia factor 1 interacting protein

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_001368403.1NP_001355332.1  centromere protein U isoform 2

    Status: VALIDATED

    Source sequence(s)
    AC119267
    UniProtKB/TrEMBL
    Q149H7
    Conserved Domains (2) summary
    pfam13097
    Location:139307
    CENP-U; CENP-A nucleosome associated complex (NAC) subunit
    pfam01496
    Location:242384
    V_ATPase_I; V-type ATPase 116kDa subunit family
  2. NM_027973.4NP_082249.1  centromere protein U isoform 1

    See identical proteins and their annotated locations for NP_082249.1

    Status: VALIDATED

    Source sequence(s)
    AC119267
    Consensus CDS
    CCDS22292.1
    UniProtKB/Swiss-Prot
    Q6UNA2, Q8C4M7, Q9D9U1
    UniProtKB/TrEMBL
    Q149H7
    Related
    ENSMUSP00000034045.8, ENSMUST00000034045.15
    Conserved Domains (1) summary
    pfam13097
    Location:144312
    CENP-U; CENP-A nucleosome associated complex (NAC) subunit

RNA

  1. NR_160797.1 RNA Sequence

    Status: VALIDATED

    Source sequence(s)
    AC119267
    Related
    ENSMUST00000135432.8
  2. NR_160798.1 RNA Sequence

    Status: VALIDATED

    Source sequence(s)
    AC119267

RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCm39 C57BL/6J

Genomic

  1. NC_000074.7 Reference GRCm39 C57BL/6J

    Range
    47005054..47033603
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

RNA

  1. XR_004934878.1 RNA Sequence

  2. XR_004934879.1 RNA Sequence