NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907132489|ref|XP_036017500|]
View 

cleavage and polyadenylation specificity factor subunit 7 isoform X4 [Mus musculus]

Protein Classification

RNA-binding protein( domain architecture ID 106745)

RNA-binding protein containing an RNA recognition motif (RRM)

CATH:  3.30.70.330
Gene Ontology:  GO:0003723
PubMed:  15853797
SCOP:  3000110

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
RRM_SF super family cl17169
RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP ...
23-101 1.79e-49

RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).


The actual alignment was detected with superfamily member cd12644:

Pssm-ID: 473069 [Multi-domain]  Cd Length: 90  Bit Score: 161.90  E-value: 1.79e-49
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907132489  23 WTTDQQLIQVIRSIGVYDVVELKFAENRANGQSKGYAEVVVASENSVHKLLELLPGKVLNGEKVDVRPATRQNLSQFEA 101
Cdd:cd12644    12 WTTDQDLINLIRSLGVKDVVELKFAENRANGQSKGYAEVVVASENSVHLLLELLPGKKLNGEKVDVRLATRQNLSQFEA 90
 
Name Accession Description Interval E-value
RRM_CFIm59 cd12644
RNA recognition motif (RRM) found in pre-mRNA cleavage factor Im 59 kDa subunit (CFIm59 or ...
23-101 1.79e-49

RNA recognition motif (RRM) found in pre-mRNA cleavage factor Im 59 kDa subunit (CFIm59 or CPSF7) and similar proteins; This subgroup corresponds to the RRM of CFIm59. Cleavage factor Im (CFIm) is a highly conserved component of the eukaryotic mRNA 3' processing machinery that functions in UGUA-mediated poly(A) site recognition, the regulation of alternative poly(A) site selection, mRNA export, and mRNA splicing. It is a complex composed of a small 25 kDa (CFIm25) subunit and a larger 59/68/72 kDa subunit. The two separate genes, CPSF6 and CPSF7, code for two isoforms of the large subunit, CFIm68 and CFIm59. The family includes CFIm59, also termed cleavage and polyadenylation specificity factor subunit 6 (CPSF7), or cleavage and polyadenylation specificity factor 59 kDa subunit (CPSF59). CFIm59 contains an N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), a central proline-rich region, and a C-terminal RS-like domain. The N-terminal RRM of CFIm59 mediates the interaction with CFIm25. It also serves to enhance RNA binding and facilitate RNA looping.


Pssm-ID: 410049 [Multi-domain]  Cd Length: 90  Bit Score: 161.90  E-value: 1.79e-49
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907132489  23 WTTDQQLIQVIRSIGVYDVVELKFAENRANGQSKGYAEVVVASENSVHKLLELLPGKVLNGEKVDVRPATRQNLSQFEA 101
Cdd:cd12644    12 WTTDQDLINLIRSLGVKDVVELKFAENRANGQSKGYAEVVVASENSVHLLLELLPGKKLNGEKVDVRLATRQNLSQFEA 90
PLN03134 PLN03134
glycine-rich RNA-binding protein 4; Provisional
24-94 1.32e-04

glycine-rich RNA-binding protein 4; Provisional


Pssm-ID: 178680 [Multi-domain]  Cd Length: 144  Bit Score: 41.95  E-value: 1.32e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907132489  24 TTDQQLIQVIRSIGvyDVVELKFAENRANGQSKGYAEVVVASENSVHKLLELLPGKVLNGEKVDVRPATRQ 94
Cdd:PLN03134   46 TDDASLRDAFAHFG--DVVDAKVIVDRETGRSRGFGFVNFNDEGAATAAISEMDGKELNGRHIRVNPANDR 114
RRM_1 pfam00076
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic ...
24-86 2.54e-03

RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have an N terminal rrm which is included in the seed. There is a second region towards the C terminus that has some features characteriztic of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.


Pssm-ID: 425453 [Multi-domain]  Cd Length: 70  Bit Score: 36.06  E-value: 2.54e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907132489  24 TTDQQLIQVIRSIGvyDVVELKFAeNRANGQSKGYAEVVVASENSVHKLLELLPGKVLNGEKV 86
Cdd:pfam00076  10 TTEEDLKDLFSKFG--PIKSIRLV-RDETGRSKGFAFVEFEDEEDAEKAIEALNGKELGGREL 69
 
Name Accession Description Interval E-value
RRM_CFIm59 cd12644
RNA recognition motif (RRM) found in pre-mRNA cleavage factor Im 59 kDa subunit (CFIm59 or ...
23-101 1.79e-49

RNA recognition motif (RRM) found in pre-mRNA cleavage factor Im 59 kDa subunit (CFIm59 or CPSF7) and similar proteins; This subgroup corresponds to the RRM of CFIm59. Cleavage factor Im (CFIm) is a highly conserved component of the eukaryotic mRNA 3' processing machinery that functions in UGUA-mediated poly(A) site recognition, the regulation of alternative poly(A) site selection, mRNA export, and mRNA splicing. It is a complex composed of a small 25 kDa (CFIm25) subunit and a larger 59/68/72 kDa subunit. The two separate genes, CPSF6 and CPSF7, code for two isoforms of the large subunit, CFIm68 and CFIm59. The family includes CFIm59, also termed cleavage and polyadenylation specificity factor subunit 6 (CPSF7), or cleavage and polyadenylation specificity factor 59 kDa subunit (CPSF59). CFIm59 contains an N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), a central proline-rich region, and a C-terminal RS-like domain. The N-terminal RRM of CFIm59 mediates the interaction with CFIm25. It also serves to enhance RNA binding and facilitate RNA looping.


Pssm-ID: 410049 [Multi-domain]  Cd Length: 90  Bit Score: 161.90  E-value: 1.79e-49
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907132489  23 WTTDQQLIQVIRSIGVYDVVELKFAENRANGQSKGYAEVVVASENSVHKLLELLPGKVLNGEKVDVRPATRQNLSQFEA 101
Cdd:cd12644    12 WTTDQDLINLIRSLGVKDVVELKFAENRANGQSKGYAEVVVASENSVHLLLELLPGKKLNGEKVDVRLATRQNLSQFEA 90
RRM_CFIm68_CFIm59 cd12372
RNA recognition motif (RRM) found in pre-mRNA cleavage factor Im 68 kDa subunit (CFIm68 or ...
23-90 1.41e-28

RNA recognition motif (RRM) found in pre-mRNA cleavage factor Im 68 kDa subunit (CFIm68 or CPSF6), pre-mRNA cleavage factor Im 59 kDa subunit (CFIm59 or CPSF7), and similar proteins; This subfamily corresponds to the RRM of cleavage factor Im (CFIm) subunits. Cleavage factor Im (CFIm) is a highly conserved component of the eukaryotic mRNA 3' processing machinery that functions in UGUA-mediated poly(A) site recognition, the regulation of alternative poly(A) site selection, mRNA export, and mRNA splicing. It is a complex composed of a small 25 kDa (CFIm25) subunit and a larger 59/68/72 kDa subunit. Two separate genes, CPSF6 and CPSF7, code for two isoforms of the large subunit, CFIm68 and CFIm59. Structurally related CFIm68 and CFIm59, also termed cleavage and polyadenylation specificity factor subunit 6 (CPSF7), or cleavage and polyadenylation specificity factor 59 kDa subunit (CPSF59), are functionally redundant. Both contains an N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), a central proline-rich region, and a C-terminal RS-like domain. Their N-terminal RRM mediates the interaction with CFIm25, and also serves to enhance RNA binding and facilitate RNA looping.


Pssm-ID: 409807 [Multi-domain]  Cd Length: 76  Bit Score: 106.63  E-value: 1.41e-28
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907132489  23 WTTDQQLIQVIRSIGVYDVVELKFAENRANGQSKGYAEVVVASENSVHKLLELLPGKVLNGEKVDVRP 90
Cdd:cd12372     9 WTTDEDLEGACASFGVVDVKEIKFFEHKANGKSKGYAYVEFASPAAAAAVKEKLEKREFNGRPCVVTP 76
RRM_CFIm68 cd12643
RNA recognition motif (RRM) found in pre-mRNA cleavage factor Im 68 kDa subunit (CFIm68 or ...
23-90 8.19e-25

RNA recognition motif (RRM) found in pre-mRNA cleavage factor Im 68 kDa subunit (CFIm68 or CPSF6) and similar proteins; This subgroup corresponds to the RRM of CFIm68. Cleavage factor Im (CFIm) is a highly conserved component of the eukaryotic mRNA 3' processing machinery that functions in UGUA-mediated poly(A) site recognition, the regulation of alternative poly(A) site selection, mRNA export, and mRNA splicing. It is a complex composed of a small 25 kDa (CFIm25) subunit and a larger 59/68/72 kDa subunit. Two separate genes, CPSF6 and CPSF7, code for two isoforms of the large subunit, CFIm68 and CFIm59. The family includes CFIm68, also termed cleavage and polyadenylation specificity factor subunit 6 (CPSF6), or cleavage and polyadenylation specificity factor 68 kDa subunit (CPSF68), or protein HPBRII-4/7. CFIm68 contains an N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), a central proline-rich region, and a C-terminal RS-like domain. The N-terminal RRM of CFIm68 mediates the interaction with CFIm25. It also serves to enhance RNA binding and facilitate RNA looping.


Pssm-ID: 410048 [Multi-domain]  Cd Length: 77  Bit Score: 96.34  E-value: 8.19e-25
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907132489  23 WTTDQQLIQVIRSIGVYDVVELKFAENRANGQSKGYAEVVVASENSVHKLLELLPGKVLNGEKVDVRP 90
Cdd:cd12643    10 WTTDEDLTEALHSIGVNDLLEIKFFENRANGQSKGFALIVVGSEASSRKLMDKLPKKELHGQNPVVTP 77
RRM2_NsCP33_like cd21608
RNA recognition motif 2 (RRM2) found in Nicotiana sylvestris chloroplastic 33 kDa ...
24-91 1.23e-04

RNA recognition motif 2 (RRM2) found in Nicotiana sylvestris chloroplastic 33 kDa ribonucleoprotein (NsCP33) and similar proteins; The family includes NsCP33, Arabidopsis thaliana chloroplastic 31 kDa ribonucleoprotein (CP31A) and mitochondrial glycine-rich RNA-binding protein 2 (AtGR-RBP2). NsCP33 may be involved in splicing and/or processing of chloroplast RNA's. AtCP31A, also called RNA-binding protein 1/2/3 (AtRBP33), or RNA-binding protein CP31A, or RNA-binding protein RNP-T, or RNA-binding protein cp31, is required for specific RNA editing events in chloroplasts and stabilizes specific chloroplast mRNAs, as well as for normal chloroplast development under cold stress conditions by stabilizing transcripts of numerous mRNAs under these conditions. CP31A may modulate telomere replication through RNA binding domains. AtGR-RBP2, also called AtRBG2, or glycine-rich protein 2 (AtGRP2), or mitochondrial RNA-binding protein 1a (At-mRBP1a), plays a role in RNA transcription or processing during stress. It binds RNAs and DNAs sequence with a preference to single-stranded nucleic acids. AtGR-RBP2 displays strong affinity to poly(U) sequence. It exerts cold and freezing tolerance, probably by exhibiting an RNA chaperone activity during the cold and freezing adaptation process. Some members in this family contain two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). The model corresponds to the second RRM motif.


Pssm-ID: 410187 [Multi-domain]  Cd Length: 76  Bit Score: 40.23  E-value: 1.23e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907132489  24 TTDQQLIQVIRSIGvyDVVELKFAENRANGQSKGYAEVVVASENSVHKLLELLPGKVLNGEKVDVRPA 91
Cdd:cd21608    11 TTEDDLRDLFSEFG--EVESAKVITDRETGRSRGFGFVTFSTAEAAEAAIDALNGKELDGRSIVVNEA 76
PLN03134 PLN03134
glycine-rich RNA-binding protein 4; Provisional
24-94 1.32e-04

glycine-rich RNA-binding protein 4; Provisional


Pssm-ID: 178680 [Multi-domain]  Cd Length: 144  Bit Score: 41.95  E-value: 1.32e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907132489  24 TTDQQLIQVIRSIGvyDVVELKFAENRANGQSKGYAEVVVASENSVHKLLELLPGKVLNGEKVDVRPATRQ 94
Cdd:PLN03134   46 TDDASLRDAFAHFG--DVVDAKVIVDRETGRSRGFGFVNFNDEGAATAAISEMDGKELNGRHIRVNPANDR 114
RRM_DAZL_BOULE cd12412
RNA recognition motif (RRM) found in AZoospermia (DAZ) autosomal homologs, DAZL (DAZ-like) and ...
24-94 1.43e-04

RNA recognition motif (RRM) found in AZoospermia (DAZ) autosomal homologs, DAZL (DAZ-like) and BOULE; This subfamily corresponds to the RRM domain of two Deleted in AZoospermia (DAZ) autosomal homologs, DAZL (DAZ-like) and BOULE. BOULE is the founder member of the family and DAZL arose from BOULE in an ancestor of vertebrates. The DAZ gene subsequently originated from a duplication transposition of the DAZL gene. Invertebrates contain a single DAZ homolog, BOULE, while vertebrates, other than catarrhine primates, possess both BOULE and DAZL genes. The catarrhine primates possess BOULE, DAZL, and DAZ genes. The family members encode closely related RNA-binding proteins that are required for fertility in numerous organisms. These proteins contain an RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a varying number of copies of a DAZ motif, believed to mediate protein-protein interactions. DAZL and BOULE contain a single copy of the DAZ motif, while DAZ proteins can contain 8-24 copies of this repeat. Although their specific biochemical functions remain to be investigated, DAZL proteins may interact with poly(A)-binding proteins (PABPs), and act as translational activators of specific mRNAs during gametogenesis.


Pssm-ID: 409846 [Multi-domain]  Cd Length: 81  Bit Score: 39.90  E-value: 1.43e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907132489  24 TTDQQLIQVIRSIGvyDVVELKFAENRAnGQSKGYAEVVVASENSVHKLLELLPGKVLNGEKVDVRPATRQ 94
Cdd:cd12412    14 TTEEELREFFSKFG--KVKDVKIIKDRA-GVSKGYGFVTFETQEDAEKIQKWGANLVFKGKKLNVGPAIRK 81
RRM_1 pfam00076
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic ...
24-86 2.54e-03

RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have an N terminal rrm which is included in the seed. There is a second region towards the C terminus that has some features characteriztic of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.


Pssm-ID: 425453 [Multi-domain]  Cd Length: 70  Bit Score: 36.06  E-value: 2.54e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907132489  24 TTDQQLIQVIRSIGvyDVVELKFAeNRANGQSKGYAEVVVASENSVHKLLELLPGKVLNGEKV 86
Cdd:pfam00076  10 TTEEDLKDLFSKFG--PIKSIRLV-RDETGRSKGFAFVEFEDEEDAEKAIEALNGKELGGREL 69
RRM_RBM18 cd12355
RNA recognition motif (RRM) found in eukaryotic RNA-binding protein 18 and similar proteins; ...
16-91 9.16e-03

RNA recognition motif (RRM) found in eukaryotic RNA-binding protein 18 and similar proteins; This subfamily corresponds to the RRM of RBM18, a putative RNA-binding protein containing a well-conserved RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). The biological role of RBM18 remains unclear.


Pssm-ID: 409791 [Multi-domain]  Cd Length: 80  Bit Score: 34.97  E-value: 9.16e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907132489  16 LDGPVimwtTDQQLIQVIRSIGvyDVVELKFAENRAN---GQSKGYAEVVVASENSVHKLLELLPGKVLNGEKVDVRPA 91
Cdd:cd12355     7 LDPRL----TEYHLLKLLSKYG--KIKKFDFLFHKTGplkGQPRGYCFVTFETKEEAEKAIECLNGKLALGKKLVVRWA 79
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH