U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

AI987944 expressed sequence AI987944 [ Mus musculus (house mouse) ]

Gene ID: 233168, updated on 8-Feb-2024

Summary

Official Symbol
AI987944provided by MGI
Official Full Name
expressed sequence AI987944provided by MGI
Primary source
MGI:MGI:2142079
See related
Ensembl:ENSMUSG00000056383 AllianceGenome:MGI:2142079
Gene type
protein coding
RefSeq status
VALIDATED
Organism
Mus musculus
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
Summary
Predicted to enable DNA-binding transcription repressor activity, RNA polymerase II-specific and RNA polymerase II transcription regulatory region sequence-specific DNA binding activity. Predicted to be involved in negative regulation of transcription by RNA polymerase II. Is expressed in dorsal root ganglion and trigeminal ganglion. Orthologous to human ZNF101 (zinc finger protein 101). [provided by Alliance of Genome Resources, Apr 2022]
Expression
Ubiquitous expression in testis adult (RPKM 3.3), bladder adult (RPKM 2.9) and 28 other tissues See more
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

Location:
7 B3; 7 28.25 cM
Exon count:
7
Annotation release Status Assembly Chr Location
RS_2024_02 current GRCm39 (GCF_000001635.27) 7 NC_000073.7 (41022347..41042772, complement)
108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 7 NC_000073.6 (41372923..41393379, complement)

Chromosome 7 - NC_000073.7Genomic Context describing neighboring genes Neighboring gene baculoviral IAP repeat-containing 4 pseudogene Neighboring gene zinc finger, MYND-type containing 8 pseudogene 2 Neighboring gene STARR-positive B cell enhancer ABC_E3715 Neighboring gene predicted gene 17102 Neighboring gene vomeronasal 2, receptor 57 Neighboring gene STARR-seq mESC enhancer starr_18727 Neighboring gene STARR-seq mESC enhancer starr_18728

Genomic regions, transcripts, and products

Expression

  • Project title: Mouse ENCODE transcriptome data
  • Description: RNA profiling data sets generated by the Mouse ENCODE project.
  • BioProject: PRJNA66167
  • Publication: PMID 25409824
  • Analysis date: n/a

General gene information

Markers

Clone Names

  • MGC60751, MGC117553

Gene Ontology Provided by MGI

Process Evidence Code Pubs
involved_in regulation of transcription by RNA polymerase II IBA
Inferred from Biological aspect of Ancestor
more info
 
Component Evidence Code Pubs
is_active_in nucleus IBA
Inferred from Biological aspect of Ancestor
more info
 

General protein information

Preferred Names
uncharacterized protein LOC233168

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_001199330.2NP_001186259.1  uncharacterized protein LOC233168 isoform 2

    See identical proteins and their annotated locations for NP_001186259.1

    Status: VALIDATED

    Source sequence(s)
    AC137152
    Consensus CDS
    CCDS85272.1
    UniProtKB/TrEMBL
    Q4KL68
    Related
    ENSMUSP00000145621.2, ENSMUST00000205338.2
    Conserved Domains (3) summary
    COG5048
    Location:48402
    COG5048; FOG: Zn-finger [General function prediction only]
    sd00017
    Location:264284
    ZF_C2H2; C2H2 Zn finger [structural motif]
    pfam01352
    Location:141
    KRAB; KRAB box
  2. NM_001421750.1NP_001408679.1  uncharacterized protein LOC233168 isoform 2

    Status: VALIDATED

    Source sequence(s)
    AC137152
    UniProtKB/TrEMBL
    Q4KL68
  3. NM_001421752.1NP_001408681.1  uncharacterized protein LOC233168 isoform 2

    Status: VALIDATED

    Source sequence(s)
    AC137152
    UniProtKB/TrEMBL
    Q4KL68
  4. NM_001421753.1NP_001408682.1  uncharacterized protein LOC233168 isoform 4

    Status: VALIDATED

    Source sequence(s)
    AC137152
  5. NM_001421754.1NP_001408683.1  uncharacterized protein LOC233168 isoform 5

    Status: VALIDATED

    Source sequence(s)
    AC137152
  6. NM_001421755.1NP_001408684.1  uncharacterized protein LOC233168 isoform 6

    Status: VALIDATED

    Source sequence(s)
    AC137152
    UniProtKB/TrEMBL
    A0A0U1RNH4
  7. NM_001421756.1NP_001408685.1  uncharacterized protein LOC233168 isoform 7

    Status: VALIDATED

    Source sequence(s)
    AC137152
  8. NM_001421757.1NP_001408686.1  uncharacterized protein LOC233168 isoform 7

    Status: VALIDATED

    Source sequence(s)
    AC137152
  9. NM_183167.5NP_898990.1  uncharacterized protein LOC233168 isoform 1

    See identical proteins and their annotated locations for NP_898990.1

    Status: VALIDATED

    Source sequence(s)
    AC137152
    Consensus CDS
    CCDS39920.1
    UniProtKB/TrEMBL
    Q7TPX5
    Related
    ENSMUSP00000071708.8, ENSMUST00000071804.10
    Conserved Domains (5) summary
    smart00349
    Location:445
    KRAB; krueppel associated box
    COG5048
    Location:51405
    COG5048; FOG: Zn-finger [General function prediction only]
    sd00017
    Location:267287
    ZF_C2H2; C2H2 Zn finger [structural motif]
    pfam01352
    Location:443
    KRAB; KRAB box
    pfam13465
    Location:307332
    zf-H2C2_2; Zinc-finger double domain

RNA

  1. NR_185325.1 RNA Sequence

    Status: VALIDATED

    Source sequence(s)
    AC137152

RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCm39 C57BL/6J

Genomic

  1. NC_000073.7 Reference GRCm39 C57BL/6J

    Range
    41022347..41042772 complement
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

mRNA and Protein(s)

  1. XM_011250853.3XP_011249155.1  uncharacterized protein LOC233168 isoform X1

    See identical proteins and their annotated locations for XP_011249155.1

    UniProtKB/TrEMBL
    Q4KL68
    Conserved Domains (3) summary
    COG5048
    Location:48402
    COG5048; FOG: Zn-finger [General function prediction only]
    sd00017
    Location:264284
    ZF_C2H2; C2H2 Zn finger [structural motif]
    pfam01352
    Location:141
    KRAB; KRAB box