NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907191599|ref|XP_036010251|]
View 

zinc finger protein 821 isoform X2 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COG2433 super family cl43687
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
110-184 6.61e-06

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


The actual alignment was detected with superfamily member COG2433:

Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 46.77  E-value: 6.61e-06
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907191599 110 LRRQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRmRDREAKRLQRMQETDEQRARRLQRDREamRLKRANE 184
Cdd:COG2433   432 LEAELEEKDERIERLERELSEARSEERREIRKDREISR-LDREIERLERELEEERERIEELKRKLE--RLKELWK 503
 
Name Accession Description Interval E-value
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
110-184 6.61e-06

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 46.77  E-value: 6.61e-06
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907191599 110 LRRQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRmRDREAKRLQRMQETDEQRARRLQRDREamRLKRANE 184
Cdd:COG2433   432 LEAELEEKDERIERLERELSEARSEERREIRKDREISR-LDREIERLERELEEERERIEELKRKLE--RLKELWK 503
CCDC47 pfam07946
PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of ...
143-204 1.39e-04

PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of the PAT complex, an endoplasmic reticulum (ER)-resident membrane multiprotein complex that facilitates multi-pass membrane proteins insertion into membranes. The PAT complex, formed by CCDC47 and Asterix proteins, acts as an intramembrane chaperone by directly interacting with nascent transmembrane domains (TMDs), releasing its substrates upon correct folding, and is needed for optimal biogenesis of multi-pass membrane proteins. CCDC47 is required to maintain the stability of Asterix. CCDC47 is associated with various membrane-associated processes and is component of a ribosome-associated ER translocon complex involved in multi-pass membrane protein transport into the ER membrane and biogenesis. It is also involved in the regulation of calcium ion homeostasis in the ER, being also required for proper protein degradation via the ERAD (ER-associated degradation) pathway.


Pssm-ID: 462322  Cd Length: 323  Bit Score: 42.55  E-value: 1.39e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907191599 143 REVRRMRDREAKRLQRMQETDEQRARRLQRD---REAMRLKRANETPEKRQARLIREREaKRLKR 204
Cdd:pfam07946 260 KKAKKTREEEIEKIKKAAEEERAEEAQEKKEeakKKEREEKLAKLSPEEQRKYEEKERK-KEQRK 323
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
125-237 4.61e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 41.03  E-value: 4.61e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907191599 125 ERERTAKKSR-RDNETPEEREVRRMRDREAKR-------LQRMQETDEQRARRLQRDREAMRLK----RANETPEKRQAR 192
Cdd:TIGR01642   4 EPDREREKSRgRDRDRSSERPRRRSRDRSRFRdrhrrsrERSYREDSRPRDRRRYDSRSPRSLRyssvRRSRDRPRRRSR 83
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1907191599 193 LI------REREAKRLKRRLEKMDMMLRAQFGQDPSAM----AALAAEMNFFQLP 237
Cdd:TIGR01642  84 SVrsieqhRRRLRDRSPSNQWRKDDKKRSLWDIKPPGYelvtADQAKASQVFSVP 138
PTZ00121 PTZ00121
MAEBL; Provisional
122-208 9.96e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 40.51  E-value: 9.96e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907191599  122 QRLERERTAKKSRRDNETPEEREVRRMRD-----REAKRLQRMQETDEQRARRLQRDREAMRLKRANETPEKRQARLIRE 196
Cdd:PTZ00121  1206 RKAEEERKAEEARKAEDAKKAEAVKKAEEakkdaEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKK 1285
                           90
                   ....*....|..
gi 1907191599  197 REAKRLKRRLEK 208
Cdd:PTZ00121  1286 AEEKKKADEAKK 1297
 
Name Accession Description Interval E-value
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
110-184 6.61e-06

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 46.77  E-value: 6.61e-06
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907191599 110 LRRQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRmRDREAKRLQRMQETDEQRARRLQRDREamRLKRANE 184
Cdd:COG2433   432 LEAELEEKDERIERLERELSEARSEERREIRKDREISR-LDREIERLERELEEERERIEELKRKLE--RLKELWK 503
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
109-214 1.10e-04

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 43.31  E-value: 1.10e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907191599 109 ALRRQNEPLEVRlqRLERERTAKKSRRDNETPEEREVRRMRDR------EAKRLQRMQETDEQRARRLQRDREAMRLKRA 182
Cdd:COG2433   381 ALEELIEKELPE--EEPEAEREKEHEERELTEEEEEIRRLEEQverleaEVEELEAELEEKDERIERLERELSEARSEER 458
                          90       100       110
                  ....*....|....*....|....*....|..
gi 1907191599 183 NETPEKRQARlIREREAKRLKRRLEKMDMMLR 214
Cdd:COG2433   459 REIRKDREIS-RLDREIERLERELEEERERIE 489
CCDC47 pfam07946
PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of ...
143-204 1.39e-04

PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of the PAT complex, an endoplasmic reticulum (ER)-resident membrane multiprotein complex that facilitates multi-pass membrane proteins insertion into membranes. The PAT complex, formed by CCDC47 and Asterix proteins, acts as an intramembrane chaperone by directly interacting with nascent transmembrane domains (TMDs), releasing its substrates upon correct folding, and is needed for optimal biogenesis of multi-pass membrane proteins. CCDC47 is required to maintain the stability of Asterix. CCDC47 is associated with various membrane-associated processes and is component of a ribosome-associated ER translocon complex involved in multi-pass membrane protein transport into the ER membrane and biogenesis. It is also involved in the regulation of calcium ion homeostasis in the ER, being also required for proper protein degradation via the ERAD (ER-associated degradation) pathway.


Pssm-ID: 462322  Cd Length: 323  Bit Score: 42.55  E-value: 1.39e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907191599 143 REVRRMRDREAKRLQRMQETDEQRARRLQRD---REAMRLKRANETPEKRQARLIREREaKRLKR 204
Cdd:pfam07946 260 KKAKKTREEEIEKIKKAAEEERAEEAQEKKEeakKKEREEKLAKLSPEEQRKYEEKERK-KEQRK 323
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
92-216 2.27e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 42.06  E-value: 2.27e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907191599  92 AAYRKLLETQTPSVRKWALRRQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRMRDREAKRLQRMQETDEQRARRLQ 171
Cdd:COG4717   119 EKLEKLLQLLPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLA 198
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 1907191599 172 RDREAMRLKRANETPEKRQARlireREAKRLKRRLEKMDMMLRAQ 216
Cdd:COG4717   199 EELEELQQRLAELEEELEEAQ----EELEELEEELEQLENELEAA 239
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
99-220 4.03e-04

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 41.38  E-value: 4.03e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907191599  99 ETQTPSVRKWALRRQNEPLEVRLQRLERErtAKKSRRDNETpEEREVRRMRDREAKRLQRMQETDEQRARRLQRDREAMR 178
Cdd:COG2433   393 EEPEAEREKEHEERELTEEEEEIRRLEEQ--VERLEAEVEE-LEAELEEKDERIERLERELSEARSEERREIRKDREISR 469
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 1907191599 179 LKRANETPEKRQARLirEREAKRLKRRLEKMDMMLRAQFGQD 220
Cdd:COG2433   470 LDREIERLERELEEE--RERIEELKRKLERLKELWKLEHSGE 509
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
125-237 4.61e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 41.03  E-value: 4.61e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907191599 125 ERERTAKKSR-RDNETPEEREVRRMRDREAKR-------LQRMQETDEQRARRLQRDREAMRLK----RANETPEKRQAR 192
Cdd:TIGR01642   4 EPDREREKSRgRDRDRSSERPRRRSRDRSRFRdrhrrsrERSYREDSRPRDRRRYDSRSPRSLRyssvRRSRDRPRRRSR 83
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1907191599 193 LI------REREAKRLKRRLEKMDMMLRAQFGQDPSAM----AALAAEMNFFQLP 237
Cdd:TIGR01642  84 SVrsieqhRRRLRDRSPSNQWRKDDKKRSLWDIKPPGYelvtADQAKASQVFSVP 138
PTZ00121 PTZ00121
MAEBL; Provisional
122-208 9.96e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 40.51  E-value: 9.96e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907191599  122 QRLERERTAKKSRRDNETPEEREVRRMRD-----REAKRLQRMQETDEQRARRLQRDREAMRLKRANETPEKRQARLIRE 196
Cdd:PTZ00121  1206 RKAEEERKAEEARKAEDAKKAEAVKKAEEakkdaEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKK 1285
                           90
                   ....*....|..
gi 1907191599  197 REAKRLKRRLEK 208
Cdd:PTZ00121  1286 AEEKKKADEAKK 1297
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
113-208 1.06e-03

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 40.11  E-value: 1.06e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907191599 113 QNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRMRDREAKRLQRM----QETDEQRARRLQRDREAMRLKRA------ 182
Cdd:pfam17380 416 QQQKVEMEQIRAEQEEARQREVRRLEEERAREMERVRLEEQERQQQVerlrQQEEERKRKKLELEKEKRDRKRAeeqrrk 495
                          90       100
                  ....*....|....*....|....*....
gi 1907191599 183 ---NETPEKRQARLIREREAKRLKRRLEK 208
Cdd:pfam17380 496 ileKELEERKQAMIEEERKRKLLEKEMEE 524
PTZ00121 PTZ00121
MAEBL; Provisional
112-208 2.97e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 38.97  E-value: 2.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907191599  112 RQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRmrdreAKRLQRMQETDEQRARRLQRD-----REAMRLKRANETP 186
Cdd:PTZ00121  1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKK-----AEELKKAEEENKIKAAEEAKKaeedkKKAEEAKKAEEDE 1687
                           90       100
                   ....*....|....*....|..
gi 1907191599  187 EKRQARLIREREAKRLKRRLEK 208
Cdd:PTZ00121  1688 KKAAEALKKEAEEAKKAEELKK 1709
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
110-216 3.24e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 38.76  E-value: 3.24e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907191599 110 LRRQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRMRDREAKRLQRMQETDEQRARRLQRDREAMRLKRANETpEKR 189
Cdd:COG1196   307 LEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELE-ELA 385
                          90       100
                  ....*....|....*....|....*..
gi 1907191599 190 QARLIREREAKRLKRRLEKMDMMLRAQ 216
Cdd:COG1196   386 EELLEALRAAAELAAQLEELEEAEEAL 412
PTZ00121 PTZ00121
MAEBL; Provisional
122-208 6.07e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 37.81  E-value: 6.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907191599  122 QRLERERTAKKSRRDNETPEEREVRRM----RDREAKRLQRMQETDEqrARRLQRDREAMRLKRANETPEKRQARLIRER 197
Cdd:PTZ00121  1128 RKAEEARKAEDARKAEEARKAEDAKRVeiarKAEDARKAEEARKAED--AKKAEAARKAEEVRKAEELRKAEDARKAEAA 1205
                           90
                   ....*....|.
gi 1907191599  198 EAKRLKRRLEK 208
Cdd:PTZ00121  1206 RKAEEERKAEE 1216
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
120-233 6.96e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 37.61  E-value: 6.96e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907191599 120 RLQRLERERTAKKSRRDNETPEEREVRRMRDREAKRLQRMQETDEQRARRLQRDREAMRLKRANETPEKRQARLIREREA 199
Cdd:COG1196   285 EAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAE 364
                          90       100       110
                  ....*....|....*....|....*....|....
gi 1907191599 200 KRLKRRLEKMDMMLRAQFGQDPSAMAALAAEMNF 233
Cdd:COG1196   365 EALLEAEAELAEAEEELEELAEELLEALRAAAEL 398
PRK00247 PRK00247
putative inner membrane protein translocase component YidC; Validated
106-191 7.04e-03

putative inner membrane protein translocase component YidC; Validated


Pssm-ID: 178945 [Multi-domain]  Cd Length: 429  Bit Score: 37.52  E-value: 7.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907191599 106 RKWALRRQN----------EPLEVRLQRLERERTAKKSRRDNETPEEREVRRMRDREA---KRLQRMQETDEQRARRLQR 172
Cdd:PRK00247  304 FLWTLRRNRlrmiitpwraPELHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREInreARQERAAAMARARARRAAV 383
                          90
                  ....*....|....*....
gi 1907191599 173 DREAMRLKRANETPEKRQA 191
Cdd:PRK00247  384 KAKKKGLIDASPNEDTPSE 402
PRK12705 PRK12705
hypothetical protein; Provisional
120-216 7.78e-03

hypothetical protein; Provisional


Pssm-ID: 237178 [Multi-domain]  Cd Length: 508  Bit Score: 37.38  E-value: 7.78e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907191599 120 RLQRLERERTAKKSRRDNETPEEREVRRMRDREAKRLQRMQETDEQRARRLQRDREAMRLKRANETPEKRQARLirerea 199
Cdd:PRK12705   27 KRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKL------ 100
                          90
                  ....*....|....*..
gi 1907191599 200 KRLKRRLEKMDMMLRAQ 216
Cdd:PRK12705  101 DNLENQLEEREKALSAR 117
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH