NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|568998040|ref|XP_006523311|]
View 

tubby-related protein 4 isoform X1 [Mus musculus]

Protein Classification

WD40 and Tub domain-containing protein( domain architecture ID 11455620)

protein containing domains WD40, SOCS, PHA03247, and Tub

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Tub super family cl08308
Tub family;
1513-1583 1.79e-23

Tub family;


The actual alignment was detected with superfamily member pfam01167:

Pssm-ID: 460094  Cd Length: 251  Bit Score: 101.50  E-value: 1.79e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568998040  1513 VMANKQPLWNEATQVYQLDFGGRVTQESAKNFQI---ELEGRQVMQFGRIDGNAYILDFQYPFSAVQAFAVALA 1583
Cdd:pfam01167  174 VLKNKPPRWNEQLQCYCLNFHGRVTVASVKNFQLvapEDQDKVILQFGKVGKDMFTMDYRYPLSAFQAFAICLS 247
WD40 COG2319
WD40 repeat [General function prediction only];
45-217 7.02e-11

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 66.09  E-value: 7.02e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040   45 WLATGNGRGVVGVtftsshcrRDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319   176 LLASGSDDGTVRL--------WDLATGKLL-RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLW-DLATGKLLRTLTGH 245
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040  125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHV 204
Cdd:COG2319   246 SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRT 325
                         170
                  ....*....|...
gi 568998040  205 LLHESDGILSMSW 217
Cdd:COG2319   326 LTGHTGAVRSVAF 338
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1049-1380 2.10e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.10  E-value: 2.10e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1049 ADSSRAPLQPLAKPKGGAAGAVAQLPARPPpalytcSQCSGAGPSSQSGAALAHAISTSPlasqssynllsPPDTSRDRT 1128
Cdd:PHA03247 2564 PDRSVPPPRPAPRPSEPAVTSRARRPDAPP------QSARPRAPVDDRGDPRGPAPPSPL-----------PPDTHAPDP 2626
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1129 DYVNSAFTEDEALSQHcQLEKPLRHPPLPEAAVTMKRPPPYQWDPMLGEDVWVPQERTAQPTVPNPLklsplmlgqGQHL 1208
Cdd:PHA03247 2627 PPPSPSPAANEPDPHP-PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV---------GSLT 2696
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1209 DVARVPFVPPKSPSSPTATFPTGYGMGMPYPGSYNNPSLPGVQAPCSPK-----DALSQAQFAQQESAVVLQPAyPPSLS 1283
Cdd:PHA03247 2697 SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPagpatPGGPARPARPPTTAGPPAPA-PPAAP 2775
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1284 YCTLPPTYPGSSTCSSVQLPPIALHPWN----------SYSTCPPMQNTQGTLPPKPHLVVEKPLVSPPPAELQSHMGTE 1353
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDpadppaavlaPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|...
gi 568998040 1354 VM----------------VETADNFQEVLSLTESPVPQRTEKF 1380
Cdd:PHA03247 2856 VApggdvrrrppsrspaaKPAAPARPPVRRLARPAVSRSTESF 2898
SOCS super family cl02533
SOCS (suppressors of cytokine signaling) box. The SOCS box is found in the C-terminal region ...
373-408 3.85e-04

SOCS (suppressors of cytokine signaling) box. The SOCS box is found in the C-terminal region of CIS/SOCS family proteins (in combination with a SH2 domain), ASBs (ankyrin repeat-containing proteins with a SOCS box), SSBs (SPRY domain-containing proteins with a SOCS box), and WSBs (WD40 repeat-containing proteins with a SOCS box), as well as, other miscellaneous proteins. The function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions.


The actual alignment was detected with superfamily member cd03717:

Pssm-ID: 470605  Cd Length: 39  Bit Score: 39.50  E-value: 3.85e-04
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 568998040  373 RVSSLQLLCQQAIASTLREDKdVNKLTLPPRLCSYL 408
Cdd:cd03717     2 SVRSLQHLCRFVIRQCTRRDL-IDQLPLPRRLKDYL 36
 
Name Accession Description Interval E-value
Tub pfam01167
Tub family;
1513-1583 1.79e-23

Tub family;


Pssm-ID: 460094  Cd Length: 251  Bit Score: 101.50  E-value: 1.79e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568998040  1513 VMANKQPLWNEATQVYQLDFGGRVTQESAKNFQI---ELEGRQVMQFGRIDGNAYILDFQYPFSAVQAFAVALA 1583
Cdd:pfam01167  174 VLKNKPPRWNEQLQCYCLNFHGRVTVASVKNFQLvapEDQDKVILQFGKVGKDMFTMDYRYPLSAFQAFAICLS 247
WD40 COG2319
WD40 repeat [General function prediction only];
45-217 7.02e-11

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 66.09  E-value: 7.02e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040   45 WLATGNGRGVVGVtftsshcrRDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319   176 LLASGSDDGTVRL--------WDLATGKLL-RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLW-DLATGKLLRTLTGH 245
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040  125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHV 204
Cdd:COG2319   246 SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRT 325
                         170
                  ....*....|...
gi 568998040  205 LLHESDGILSMSW 217
Cdd:COG2319   326 LTGHTGAVRSVAF 338
PHA03247 PHA03247
large tegument protein UL36; Provisional
1049-1380 2.10e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.10  E-value: 2.10e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1049 ADSSRAPLQPLAKPKGGAAGAVAQLPARPPpalytcSQCSGAGPSSQSGAALAHAISTSPlasqssynllsPPDTSRDRT 1128
Cdd:PHA03247 2564 PDRSVPPPRPAPRPSEPAVTSRARRPDAPP------QSARPRAPVDDRGDPRGPAPPSPL-----------PPDTHAPDP 2626
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1129 DYVNSAFTEDEALSQHcQLEKPLRHPPLPEAAVTMKRPPPYQWDPMLGEDVWVPQERTAQPTVPNPLklsplmlgqGQHL 1208
Cdd:PHA03247 2627 PPPSPSPAANEPDPHP-PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV---------GSLT 2696
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1209 DVARVPFVPPKSPSSPTATFPTGYGMGMPYPGSYNNPSLPGVQAPCSPK-----DALSQAQFAQQESAVVLQPAyPPSLS 1283
Cdd:PHA03247 2697 SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPagpatPGGPARPARPPTTAGPPAPA-PPAAP 2775
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1284 YCTLPPTYPGSSTCSSVQLPPIALHPWN----------SYSTCPPMQNTQGTLPPKPHLVVEKPLVSPPPAELQSHMGTE 1353
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDpadppaavlaPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|...
gi 568998040 1354 VM----------------VETADNFQEVLSLTESPVPQRTEKF 1380
Cdd:PHA03247 2856 VApggdvrrrppsrspaaKPAAPARPPVRRLARPAVSRSTESF 2898
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
78-218 8.00e-07

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 52.72  E-value: 8.00e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040   78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVWIQYEGRWSVELVNDRGAqVSDFTWSHDGTQALISYRDGFVLVGSVSGQ 157
Cdd:cd00200     5 LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGP-VRDVAASADGTYLASGSSDKTIRLWDLETG 83
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568998040  158 R-------HwsseinlESQITCGIWTPDDQQVLFGTADGQVIV-----------MDCHGRMLAHVLLHESDGILSMSWN 218
Cdd:cd00200    84 EcvrtltgH-------TSYVSSVAFSPDGRILSSSSRDKTIKVwdvetgkclttLRGHTDWVNSVAFSPDGTFVASSSQ 155
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1093-1349 8.02e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.45  E-value: 8.02e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040  1093 SSQSGAALAHAISTSPLASQSSYNLLSPPDTSRDRTDYVNSAFTEDEALSQHCQLEKPLRHPP-----------LPEAAV 1161
Cdd:pfam03154  156 SDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPnqtqstaaphtLIQQTP 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040  1162 TMKRP------PPYQWDPMLGEDVWVPQERTAQPTVPNPLKLSPLMLGQGQHLDVARVPFVP-PKSPSSPTATFPTGYGM 1234
Cdd:pfam03154  236 TLHPQrlpsphPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPfPLTPQSSQSQVPPGPSP 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040  1235 GMPYPgSYNNPSLPGVQapcspkdalSQAQFAQQESAVVLQPAyPPSLSYCTLPPTYPGSstcssvQLPPIALHPWNSYS 1314
Cdd:pfam03154  316 AAPGQ-SQQRIHTPPSQ---------SQLQSQQPPREQPLPPA-PLSMPHIKPPPTTPIP------QLPNPQSHKHPPHL 378
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 568998040  1315 TCPPMQNTQGTLPPKPHLvveKPLVS----------PPPAELQSH 1349
Cdd:pfam03154  379 SGPSPFQMNSNLPPPPAL---KPLSSlsthhppsahPPPLQLMPQ 420
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
146-219 3.06e-04

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 41.11  E-value: 3.06e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568998040   146 DGFVLVGSVSGQRHWS-SEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSWNY 219
Cdd:pfam12894   16 DGELLLHRLNWQRVWTlSPDKEDLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGSDLITCLGWGE 90
SOCS_SOCS_like cd03717
SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of ...
373-408 3.85e-04

SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of proteins is characterized by the presence of a C-terminal SOCS box and a central SH2 domain. These intracellular proteins regulate the responses of immune cells to cytokines. Identified as negative regulators of the cytokine-JAK-STAT pathway, they seem to play a role in many immunological and pathological processes. The function of the SOCS box is the recruitment of the ubiquitin-transferase system. Related SOCS boxes are also present in Rab40-like proteins and insect proteins of unknown function that also contain a NEUZ (domain in neuralized proteins) domain.


Pssm-ID: 239687  Cd Length: 39  Bit Score: 39.50  E-value: 3.85e-04
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 568998040  373 RVSSLQLLCQQAIASTLREDKdVNKLTLPPRLCSYL 408
Cdd:cd03717     2 SVRSLQHLCRFVIRQCTRRDL-IDQLPLPRRLKDYL 36
SOCS_box smart00969
The SOCS box acts as a bridge between specific substrate- binding domains and more generic ...
375-409 6.51e-04

The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases;


Pssm-ID: 198037  Cd Length: 34  Bit Score: 38.54  E-value: 6.51e-04
                            10        20        30
                    ....*....|....*....|....*....|....*
gi 568998040    375 SSLQLLCQQAIASTLredKDVNKLTLPPRLCSYLS 409
Cdd:smart00969    1 RSLQHLCRLAIRRSL---GGIDKLPLPPRLKDYLL 32
SOCS_box pfam07525
SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more ...
374-408 1.96e-03

SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases.


Pssm-ID: 462192  Cd Length: 39  Bit Score: 37.53  E-value: 1.96e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 568998040   374 VSSLQLLCQQAIASTL--REDKDVNKLTLPPRLCSYL 408
Cdd:pfam07525    2 PRSLQHLCRLAIRRALgkRRLGAIDKLPLPPLLKDYL 38
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
78-109 8.67e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 35.75  E-value: 8.67e-03
                            10        20        30
                    ....*....|....*....|....*....|..
gi 568998040     78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVW 109
Cdd:smart00320    8 LKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
 
Name Accession Description Interval E-value
Tub pfam01167
Tub family;
1513-1583 1.79e-23

Tub family;


Pssm-ID: 460094  Cd Length: 251  Bit Score: 101.50  E-value: 1.79e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568998040  1513 VMANKQPLWNEATQVYQLDFGGRVTQESAKNFQI---ELEGRQVMQFGRIDGNAYILDFQYPFSAVQAFAVALA 1583
Cdd:pfam01167  174 VLKNKPPRWNEQLQCYCLNFHGRVTVASVKNFQLvapEDQDKVILQFGKVGKDMFTMDYRYPLSAFQAFAICLS 247
WD40 COG2319
WD40 repeat [General function prediction only];
45-217 7.02e-11

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 66.09  E-value: 7.02e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040   45 WLATGNGRGVVGVtftsshcrRDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319   176 LLASGSDDGTVRL--------WDLATGKLL-RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLW-DLATGKLLRTLTGH 245
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040  125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHV 204
Cdd:COG2319   246 SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRT 325
                         170
                  ....*....|...
gi 568998040  205 LLHESDGILSMSW 217
Cdd:COG2319   326 LTGHTGAVRSVAF 338
WD40 COG2319
WD40 repeat [General function prediction only];
45-217 1.93e-10

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 64.93  E-value: 1.93e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040   45 WLATGNGRGVVGVTftsshcrrDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319   218 LLASGSADGTVRLW--------DLATGKLL-RTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLW-DLATGELLRTLTGH 287
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040  125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHV 204
Cdd:COG2319   288 SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRT 367
                         170
                  ....*....|...
gi 568998040  205 LLHESDGILSMSW 217
Cdd:COG2319   368 LTGHTGAVTSVAF 380
PHA03247 PHA03247
large tegument protein UL36; Provisional
1049-1380 2.10e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.10  E-value: 2.10e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1049 ADSSRAPLQPLAKPKGGAAGAVAQLPARPPpalytcSQCSGAGPSSQSGAALAHAISTSPlasqssynllsPPDTSRDRT 1128
Cdd:PHA03247 2564 PDRSVPPPRPAPRPSEPAVTSRARRPDAPP------QSARPRAPVDDRGDPRGPAPPSPL-----------PPDTHAPDP 2626
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1129 DYVNSAFTEDEALSQHcQLEKPLRHPPLPEAAVTMKRPPPYQWDPMLGEDVWVPQERTAQPTVPNPLklsplmlgqGQHL 1208
Cdd:PHA03247 2627 PPPSPSPAANEPDPHP-PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV---------GSLT 2696
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1209 DVARVPFVPPKSPSSPTATFPTGYGMGMPYPGSYNNPSLPGVQAPCSPK-----DALSQAQFAQQESAVVLQPAyPPSLS 1283
Cdd:PHA03247 2697 SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPagpatPGGPARPARPPTTAGPPAPA-PPAAP 2775
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1284 YCTLPPTYPGSSTCSSVQLPPIALHPWN----------SYSTCPPMQNTQGTLPPKPHLVVEKPLVSPPPAELQSHMGTE 1353
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDpadppaavlaPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|...
gi 568998040 1354 VM----------------VETADNFQEVLSLTESPVPQRTEKF 1380
Cdd:PHA03247 2856 VApggdvrrrppsrspaaKPAAPARPPVRRLARPAVSRSTESF 2898
WD40 COG2319
WD40 repeat [General function prediction only];
45-193 2.82e-07

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 54.92  E-value: 2.82e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040   45 WLATGNGRGVVGVtftsshcrRDRSTPQRINFnLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319   260 LLASGSADGTVRL--------WDLATGELLRT-LTGHSGGVNSVAFSPDGKLLASGSDDGTVRLW-DLATGKLLRTLTGH 329
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568998040  125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIV 193
Cdd:COG2319   330 TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRL 398
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
78-218 8.00e-07

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 52.72  E-value: 8.00e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040   78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVWIQYEGRWSVELVNDRGAqVSDFTWSHDGTQALISYRDGFVLVGSVSGQ 157
Cdd:cd00200     5 LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGP-VRDVAASADGTYLASGSSDKTIRLWDLETG 83
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568998040  158 R-------HwsseinlESQITCGIWTPDDQQVLFGTADGQVIV-----------MDCHGRMLAHVLLHESDGILSMSWN 218
Cdd:cd00200    84 EcvrtltgH-------TSYVSSVAFSPDGRILSSSSRDKTIKVwdvetgkclttLRGHTDWVNSVAFSPDGTFVASSSQ 155
PHA03247 PHA03247
large tegument protein UL36; Provisional
992-1344 2.21e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.02  E-value: 2.21e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040  992 RLTVPRYSIPTGDPPPYP------EIASQLAQGRSAA----QRLDNSLIHATLRRNNREVALKMAQLADSS-RAPLQPLA 1060
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDPpppspsPAANEPDPHPPPTvpppERPRDDPAPGRVSRPRRARRLGRAAQASSPpQRPRRRAA 2688
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1061 KPKGGAAGAVAQLPARPP---PALYTCSQCSGAGPSSQSGAALAHAISTSPLASQSSYNLLSPPDTSRDRTDYVNSAfte 1137
Cdd:PHA03247 2689 RPTVGSLTSLADPPPPPPtpePAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG--- 2765
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1138 dealsqhcqlekplrhPPLPEAAVTMKRPPPYQWDPMLGEDVWVpqERTAQPTVPNPLKLSPLMLGQGQHLDVARVPFVP 1217
Cdd:PHA03247 2766 ----------------PPAPAPPAAPAAGPPRRLTRPAVASLSE--SRESLPSPWDPADPPAAVLAPAAALPPAASPAGP 2827
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1218 PKSPSSPTATFPtgygmgmPYPGSYNNPSLP--GVQAPCSPkdaLSQAQFAQQESAVVLQPAYPPSLSyctLPPTYPGSS 1295
Cdd:PHA03247 2828 LPPPTSAQPTAP-------PPPPGPPPPSLPlgGSVAPGGD---VRRRPPSRSPAAKPAAPARPPVRR---LARPAVSRS 2894
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*....
gi 568998040 1296 TCSSVQLPPIALHPWNSYSTCPPMQNTQGTLPPKPHLVVEKPLVSPPPA 1344
Cdd:PHA03247 2895 TESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPL 2943
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
67-218 1.77e-05

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 48.49  E-value: 1.77e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040   67 DRSTPQRINfNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiqyEGRWSVELVNDRG--AQVSDFTWSHDGTQALISY 144
Cdd:cd00200    79 DLETGECVR-TLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVW---DVETGKCLTTLRGhtDWVNSVAFSPDGTFVASSS 154
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040  145 RDGFVLVGSVSGQR-------HwsseinlESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSW 217
Cdd:cd00200   155 QDGTIKLWDLRTGKcvatltgH-------TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAF 227

                  .
gi 568998040  218 N 218
Cdd:cd00200   228 S 228
PHA03378 PHA03378
EBNA-3B; Provisional
944-1346 2.27e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 49.30  E-value: 2.27e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040  944 VEEVCRPRTRMLCSQNTYTLPGPGSSATLRLTATEKkVPQPCTSatlnrLTVPrYSIPTGDPPPYPEIASQLAQGRSAAQ 1023
Cdd:PHA03378  426 IEEEHRKKKAARTEQPRATPHSQAPTVVLHRPPTQP-LEGPTGP-----LSVQ-APLEPWQPLPHPQVTPVILHQPPAQG 498
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1024 RLDNSLIHATLRRNNREVALK-MAQLADssRAPLQPLAKPKG-----GAAGAVAQLPARPPPALYTCSQCSGAGP---SS 1094
Cdd:PHA03378  499 VQAHGSMLDLLEKDDEDMEQRvMATLLP--PSPPQPRAGRRApcvytEDLDIESDEPASTEPVHDQLLPAPGLGPlqiQP 576
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1095 QSGAALAHAISTSPLASQSSYNLLSPPDTSRDRTdyVNSAFTEDEALSQHCQLEKPLRHPPLPEAAVTMK---RPPPY-- 1169
Cdd:PHA03378  577 LTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPT--TQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNvlvFPTPHqp 654
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1170 -QWDPMLGEDVWV-PQERTAQPTVPNPLKLSPLMLGQGQHLDVARVP--FVPPKSPSSPtATFPTGYGMGMPYPGSYNNP 1245
Cdd:PHA03378  655 pQVEITPYKPTWTqIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPtpMRPPAAPPGR-AQRPAAATGRARPPAAAPGR 733
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1246 SLPGVQAPcSPKDALSQAQFAQQESAVVLQPAYPPSLSYCTLPPTYPGsstcssvQLPPIalhpwnsystcpPMQNTQGT 1325
Cdd:PHA03378  734 ARPPAAAP-GRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPP-------QAPPA------------PQQRPRGA 793
                         410       420
                  ....*....|....*....|.
gi 568998040 1326 LPPKPhlvveKPLVSPPPAEL 1346
Cdd:PHA03378  794 PTPQP-----PPQAGPTSMQL 809
WD40 COG2319
WD40 repeat [General function prediction only];
65-218 3.46e-05

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 47.98  E-value: 3.46e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040   65 RRDRSTPQRINFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDRGAQVSDFTWSHDGTQALISY 144
Cdd:COG2319    61 LLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLW-DLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGS 139
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568998040  145 RDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSWN 218
Cdd:COG2319   140 ADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS 213
WD40 COG2319
WD40 repeat [General function prediction only];
65-217 4.76e-05

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 47.60  E-value: 4.76e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040   65 RRDRSTPQRINFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWIQYEGRWSVELVnDRGAQVSDFTWSHDGTQALISY 144
Cdd:COG2319    19 ALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLL-GHTAAVLSVAFSPDGRLLASAS 97
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568998040  145 RDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSW 217
Cdd:COG2319    98 ADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAF 170
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1093-1349 8.02e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.45  E-value: 8.02e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040  1093 SSQSGAALAHAISTSPLASQSSYNLLSPPDTSRDRTDYVNSAFTEDEALSQHCQLEKPLRHPP-----------LPEAAV 1161
Cdd:pfam03154  156 SDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPnqtqstaaphtLIQQTP 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040  1162 TMKRP------PPYQWDPMLGEDVWVPQERTAQPTVPNPLKLSPLMLGQGQHLDVARVPFVP-PKSPSSPTATFPTGYGM 1234
Cdd:pfam03154  236 TLHPQrlpsphPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPfPLTPQSSQSQVPPGPSP 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040  1235 GMPYPgSYNNPSLPGVQapcspkdalSQAQFAQQESAVVLQPAyPPSLSYCTLPPTYPGSstcssvQLPPIALHPWNSYS 1314
Cdd:pfam03154  316 AAPGQ-SQQRIHTPPSQ---------SQLQSQQPPREQPLPPA-PLSMPHIKPPPTTPIP------QLPNPQSHKHPPHL 378
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 568998040  1315 TCPPMQNTQGTLPPKPHLvveKPLVS----------PPPAELQSH 1349
Cdd:pfam03154  379 SGPSPFQMNSNLPPPPAL---KPLSSlsthhppsahPPPLQLMPQ 420
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
989-1237 2.89e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 45.64  E-value: 2.89e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040  989 TLNRLTVPRYSIPTGDPPPYPEIASQLAQGRSAAqrldnslihatlrrnnreVALKMAQLADSSRAPLQPLAKPKGGAAG 1068
Cdd:PRK12323  356 TLLRMLAFRPGQSGGGAGPATAAAAPVAQPAPAA------------------AAPAAAAPAPAAPPAAPAAAPAAAAAAR 417
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1069 AVAQLPARPPP---ALYTCSQCSGAGPSSQSGAALAHAISTSPLASQSSYNLLSPPDTSrdrtdyVNSAFTEDEALSQHC 1145
Cdd:PRK12323  418 AVAAAPARRSPapeALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAA------AAAPARAAPAAAPAP 491
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040 1146 QLEKPlrhPPLPEAAVTMKRPPPYQWDPMLGEDVW--VPQERTAQPTVPNPLKLSPLMLGQGQHLDVARVPFVPPKSPSS 1223
Cdd:PRK12323  492 ADDDP---PPWEELPPEFASPAPAQPDAAPAGWVAesIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRA 568
                         250
                  ....*....|....
gi 568998040 1224 PTATFPTGYGMGMP 1237
Cdd:PRK12323  569 SASGLPDMFDGDWP 582
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
146-219 3.06e-04

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 41.11  E-value: 3.06e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568998040   146 DGFVLVGSVSGQRHWS-SEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSWNY 219
Cdd:pfam12894   16 DGELLLHRLNWQRVWTlSPDKEDLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGSDLITCLGWGE 90
SOCS_SOCS_like cd03717
SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of ...
373-408 3.85e-04

SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of proteins is characterized by the presence of a C-terminal SOCS box and a central SH2 domain. These intracellular proteins regulate the responses of immune cells to cytokines. Identified as negative regulators of the cytokine-JAK-STAT pathway, they seem to play a role in many immunological and pathological processes. The function of the SOCS box is the recruitment of the ubiquitin-transferase system. Related SOCS boxes are also present in Rab40-like proteins and insect proteins of unknown function that also contain a NEUZ (domain in neuralized proteins) domain.


Pssm-ID: 239687  Cd Length: 39  Bit Score: 39.50  E-value: 3.85e-04
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 568998040  373 RVSSLQLLCQQAIASTLREDKdVNKLTLPPRLCSYL 408
Cdd:cd03717     2 SVRSLQHLCRFVIRQCTRRDL-IDQLPLPRRLKDYL 36
SOCS_box smart00969
The SOCS box acts as a bridge between specific substrate- binding domains and more generic ...
375-409 6.51e-04

The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases;


Pssm-ID: 198037  Cd Length: 34  Bit Score: 38.54  E-value: 6.51e-04
                            10        20        30
                    ....*....|....*....|....*....|....*
gi 568998040    375 SSLQLLCQQAIASTLredKDVNKLTLPPRLCSYLS 409
Cdd:smart00969    1 RSLQHLCRLAIRRSL---GGIDKLPLPPRLKDYLL 32
SOCS_box pfam07525
SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more ...
374-408 1.96e-03

SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases.


Pssm-ID: 462192  Cd Length: 39  Bit Score: 37.53  E-value: 1.96e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 568998040   374 VSSLQLLCQQAIASTL--REDKDVNKLTLPPRLCSYL 408
Cdd:pfam07525    2 PRSLQHLCRLAIRRALgkRRLGAIDKLPLPPLLKDYL 38
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
53-217 2.76e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 41.55  E-value: 2.76e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040   53 GVVGVTFTSSH-----CRRDRS-------TPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWIQYEGRwSVEL 120
Cdd:cd00200    95 YVSSVAFSPDGrilssSSRDKTikvwdveTGKCL-TTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGK-CVAT 172
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568998040  121 VNDRGAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQR-------HwsseinlESQITCGIWTPDDQQVLFGTADGQVIV 193
Cdd:cd00200   173 LTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKclgtlrgH-------ENGVNSVAFSPDGYLLASGSEDGTIRV 245
                         170       180
                  ....*....|....*....|....*
gi 568998040  194 MD-CHGRMLAHVLLHESdGILSMSW 217
Cdd:cd00200   246 WDlRTGECVQTLSGHTN-SVTSLAW 269
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
78-109 8.67e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 35.75  E-value: 8.67e-03
                            10        20        30
                    ....*....|....*....|....*....|..
gi 568998040     78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVW 109
Cdd:smart00320    8 LKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH