U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Wdr33 WD repeat domain 33 [ Mus musculus (house mouse) ]

Gene ID: 74320, updated on 11-Apr-2024

Summary

Official Symbol
Wdr33provided by MGI
Official Full Name
WD repeat domain 33provided by MGI
Primary source
MGI:MGI:1921570
See related
Ensembl:ENSMUSG00000024400 AllianceGenome:MGI:1921570
Gene type
protein coding
RefSeq status
VALIDATED
Organism
Mus musculus
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
Also known as
WDC146; 1110001N06Rik; 2310011G05Rik; 2810021O11Rik; 8430413N20Rik
Summary
Predicted to be involved in mRNA polyadenylation. Predicted to act upstream of or within mRNA processing. Located in nucleus. Orthologous to human WDR33 (WD repeat domain 33). [provided by Alliance of Genome Resources, Apr 2022]
Expression
Ubiquitous expression in CNS E11.5 (RPKM 5.8), limb E14.5 (RPKM 5.4) and 28 other tissues See more
Orthologs
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

Location:
18 B1; 18 17.85 cM
Exon count:
26
Annotation release Status Assembly Chr Location
RS_2024_02 current GRCm39 (GCF_000001635.27) 18 NC_000084.7 (31937079..32042040)
108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 18 NC_000084.6 (31804057..31908987)

Chromosome 18 - NC_000084.7Genomic Context describing neighboring genes Neighboring gene AMME chromosomal region gene 1-like Neighboring gene STARR-positive B cell enhancer ABC_E3226 Neighboring gene predicted gene, 26533 Neighboring gene STARR-positive B cell enhancer mm9_chr18:31963552-31963852 Neighboring gene polymerase (RNA) II (DNA directed) polypeptide D Neighboring gene STARR-positive B cell enhancer ABC_E10983 Neighboring gene glutathione S-transferase, mu 2 pseudogene Neighboring gene STARR-seq mESC enhancer starr_44149 Neighboring gene CapStarr-seq enhancer MGSCv37_chr18:32069964-32070147 Neighboring gene SFT2 domain containing 3 Neighboring gene STARR-seq mESC enhancer starr_44150 Neighboring gene LIM and senescent cell antigen like domains 2 Neighboring gene predicted gene, 46619

Genomic regions, transcripts, and products

Expression

  • Project title: Mouse ENCODE transcriptome data
  • Description: RNA profiling data sets generated by the Mouse ENCODE project.
  • BioProject: PRJNA66167
  • Publication: PMID 25409824
  • Analysis date: n/a

Variation

Alleles

Alleles of this type are documented at Mouse Genome Informatics  (MGI)
  • Endonuclease-mediated (2) 

Pathways from PubChem

Interactions

Products Interactant Other Gene Complex Source Pubs Description

General gene information

Markers

Gene Ontology Provided by MGI

Process Evidence Code Pubs
involved_in mRNA 3'-end processing IEA
Inferred from Electronic Annotation
more info
 
acts_upstream_of_or_within mRNA processing IEA
Inferred from Electronic Annotation
more info
 
Component Evidence Code Pubs
part_of collagen trimer IEA
Inferred from Electronic Annotation
more info
 
located_in fibrillar center ISO
Inferred from Sequence Orthology
more info
 
part_of mRNA cleavage and polyadenylation specificity factor complex IBA
Inferred from Biological aspect of Ancestor
more info
 
located_in nucleoplasm ISO
Inferred from Sequence Orthology
more info
 
located_in nucleus IDA
Inferred from Direct Assay
more info
PubMed 
located_in nucleus ISO
Inferred from Sequence Orthology
more info
 

General protein information

Preferred Names
pre-mRNA 3' end processing protein WDR33
Names
WD repeat-containing protein 33
WD repeat-containing protein WDC146
WD repeat-containing protein of 146 kDa

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_001170966.1NP_001164437.1  pre-mRNA 3' end processing protein WDR33 isoform 4

    See identical proteins and their annotated locations for NP_001164437.1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (4) has multiple differences, compared to variant 1. The encoded isoform (4) is shorter and has a distinct C-terminus, compared to isoform 1.
    Source sequence(s)
    AC124393, AC161511, AK045923
    Consensus CDS
    CCDS89211.1
    UniProtKB/TrEMBL
    Q8BRC5, Q9D1P6
    Related
    ENSMUSP00000157238.2, ENSMUST00000234344.2
    Conserved Domains (3) summary
    COG2319
    Location:127219
    WD40; WD40 repeat [General function prediction only]
    sd00039
    Location:122158
    7WD40; WD40 repeat [structural motif]
    cl02567
    Location:119205
    WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
  2. NM_001170967.1NP_001164438.1  pre-mRNA 3' end processing protein WDR33 isoform 3

    See identical proteins and their annotated locations for NP_001164438.1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (3) has multiple differences, compared to variant 1. The encoded isoform (3) is shorter and has a distinct C-terminus, compared to isoform 1.
    Source sequence(s)
    AC124393, AC161511, AK078286
    Consensus CDS
    CCDS89210.1
    UniProtKB/TrEMBL
    D3YX80, Q8K1G7
    Related
    ENSMUSP00000080936.9, ENSMUST00000082319.15
    Conserved Domains (3) summary
    COG2319
    Location:104231
    WD40; WD40 repeat [General function prediction only]
    sd00039
    Location:122159
    7WD40; WD40 repeat [structural motif]
    cl02567
    Location:119230
    WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
  3. NM_001170970.1NP_001164441.1  pre-mRNA 3' end processing protein WDR33 isoform 2

    Status: VALIDATED

    Description
    Transcript Variant: This variant (2) has multiple differences, compared to variant 1. The encoded isoform (2) is shorter and has a distinct C-terminus, compared to isoform 1.
    Source sequence(s)
    AC124393, AC161511, AK009297
    Consensus CDS
    CCDS50242.1
    UniProtKB/TrEMBL
    A0A3Q4EGD8
    Related
    ENSMUSP00000157157.2, ENSMUST00000234957.2
    Conserved Domains (2) summary
    sd00039
    Location:122159
    7WD40; WD40 repeat [structural motif]
    cl29593
    Location:119230
    WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
  4. NM_028866.3NP_083142.2  pre-mRNA 3' end processing protein WDR33 isoform 1

    See identical proteins and their annotated locations for NP_083142.2

    Status: VALIDATED

    Description
    Transcript Variant: This variant (1) represents the longest transcript and encodes the longest isoform (1).
    Source sequence(s)
    AC124393, AC131761, AC161511
    Consensus CDS
    CCDS29112.1
    UniProtKB/Swiss-Prot
    Q8C7C6, Q8CD02, Q8K4P0
    Related
    ENSMUSP00000025264.7, ENSMUST00000025264.8
    Conserved Domains (5) summary
    pfam01391
    Location:717769
    Collagen; Collagen triple helix repeat (20 copies)
    COG2319
    Location:121405
    WD40; WD40 repeat [General function prediction only]
    cd00200
    Location:121402
    WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
    pfam09606
    Location:602963
    Med15; ARC105 or Med15 subunit of Mediator complex non-fungal
    sd00039
    Location:122159
    7WD40; WD40 repeat [structural motif]

RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCm39 C57BL/6J

Genomic

  1. NC_000084.7 Reference GRCm39 C57BL/6J

    Range
    31937079..32042040
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

mRNA and Protein(s)

  1. XM_006526300.2XP_006526363.1  pre-mRNA 3' end processing protein WDR33 isoform X2

    Conserved Domains (4) summary
    cd00200
    Location:121402
    WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
    sd00039
    Location:122159
    7WD40; WD40 repeat [structural motif]
    pfam09606
    Location:590926
    Med15; ARC105 or Med15 subunit of Mediator complex non-fungal
    cl26593
    Location:537640
    DUF2076; Uncharacterized protein conserved in bacteria (DUF2076)
  2. XM_006526301.2XP_006526364.1  pre-mRNA 3' end processing protein WDR33 isoform X3

    Conserved Domains (3) summary
    cd00200
    Location:121402
    WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
    sd00039
    Location:122159
    7WD40; WD40 repeat [structural motif]
    pfam09606
    Location:590926
    Med15; ARC105 or Med15 subunit of Mediator complex non-fungal
  3. XM_006526298.2XP_006526361.1  pre-mRNA 3' end processing protein WDR33 isoform X1

    Conserved Domains (4) summary
    cd00200
    Location:121402
    WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
    sd00039
    Location:122159
    7WD40; WD40 repeat [structural motif]
    pfam09606
    Location:590926
    Med15; ARC105 or Med15 subunit of Mediator complex non-fungal
    cl26593
    Location:537640
    DUF2076; Uncharacterized protein conserved in bacteria (DUF2076)
  4. XM_036161297.1XP_036017190.1  pre-mRNA 3' end processing protein WDR33 isoform X5

    Conserved Domains (2) summary
    sd00039
    Location:122159
    7WD40; WD40 repeat [structural motif]
    cl29593
    Location:119230
    WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
  5. XM_017317996.3XP_017173485.1  pre-mRNA 3' end processing protein WDR33 isoform X1

    Conserved Domains (4) summary
    cd00200
    Location:121402
    WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
    sd00039
    Location:122159
    7WD40; WD40 repeat [structural motif]
    pfam09606
    Location:590926
    Med15; ARC105 or Med15 subunit of Mediator complex non-fungal
    cl26593
    Location:537640
    DUF2076; Uncharacterized protein conserved in bacteria (DUF2076)
  6. XM_036161296.1XP_036017189.1  pre-mRNA 3' end processing protein WDR33 isoform X4

    UniProtKB/TrEMBL
    A0A3Q4EGD8
    Conserved Domains (2) summary
    sd00039
    Location:122159
    7WD40; WD40 repeat [structural motif]
    cl29593
    Location:119230
    WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...