NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1908918643|ref|NP_001374109|]
View 

ataxin-2-like protein isoform 16 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
123-196 1.64e-19

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


:

Pssm-ID: 464173  Cd Length: 78  Bit Score: 83.76  E-value: 1.64e-19
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918643  123 MLHFLTAVVGSTCDVKVKNGTTYEGIFKTLS--SKFELAVDAVHRKASE--PAGGPRREDIVDTMVFKPSDVMLVHFR 196
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASleKDFGVVLKMARRIKKSngSGLNPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
264-326 1.56e-17

LsmAD domain; This domain is found associated with Lsm domain.


:

Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 77.61  E-value: 1.56e-17
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918643  264 YGVKTTYDSSLssYTVPLEKdNSEEFRQRELRAAQLAREIESSPQYRLRIAMEN-----DDGRTEEEK 326
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERgldvdDSGLDEEDK 65
PHA03247 super family cl33720
large tegument protein UL36; Provisional
398-936 3.38e-09

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.49  E-value: 3.38e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  398 ARGINGGPSRMSPKAQRPLRGAKTLSSPSNR--PSGETSVPPPPAVGRMYPPRSPKSAAPAPisASCPEPPIGSAV-PTS 474
Cdd:PHA03247  2456 ARTILGAPFSLSLLLGELFPGAPVYRRPAEArfPFAAGAAPDPGGGGPPDPDAPPAPSRLAP--AILPDEPVGEPVhPRM 2533
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  475 SASIPVTSSVSDPGVGSISPASPKISLAPTDVKELSTKEPG-RTLEPQELARiaGKVPGLQNEQKRFQLEELRKFGAQFK 553
Cdd:PHA03247  2534 LTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRPApRPSEPAVTSR--ARRPDAPPQSARPRAPVDDRGDPRGP 2611
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  554 LQPSSSPENSLDPFPPRILKEEPKGKEKEVDGLLTSEPMGSPVSSKTESVS------------------DKEDKPPLAPS 615
Cdd:PHA03247  2612 APPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSrprrarrlgraaqassppQRPRRRAARPT 2691
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  616 GGTEGPEQPPPPCPSQTGSPPVGLIKGEDKDEGP-VAEQVKKSTLNPNAKEFNPTKPLL--SVNKSTSTPTSPGPRTHST 692
Cdd:PHA03247  2692 VGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPaAARQASPALPAAPAPPAVPAGPATpgGPARPARPPTTAGPPAPAP 2771
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  693 PSIPVlTAGQSGLYSPQYISYIPQIHMGPAVQAPQMYPYPVSNSVPGQQGKYRGAKGSLPPQRSDQHQPASAPPmmqaaa 772
Cdd:PHA03247  2772 PAAPA-AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG------ 2844
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  773 aagpplvaatPYSSYIPYNPQQFPGQPAMMQPMahyPSQPVFAPMLQSNP--RMLTSGSHPQAIVSSSTPQYPSAEQPTP 850
Cdd:PHA03247  2845 ----------PPPPSLPLGGSVAPGGDVRRRPP---SRSPAAKPAAPARPpvRRLARPAVSRSTESFALPPDQPERPPQP 2911
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  851 QALYATVHQSYPHHATQLHAHQPQPATTPTGSQPQSQHA-APSPVQAGQAPHLGSGQPQQnlyHPGALTGTPPSLPPGPS 929
Cdd:PHA03247  2912 QAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAgAGEPSGAVPQPWLGALVPGR---VAVPRFRVPQPAPSREA 2988

                   ....*..
gi 1908918643  930 AQSPQSS 936
Cdd:PHA03247  2989 PASSTPP 2995
 
Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
123-196 1.64e-19

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


Pssm-ID: 464173  Cd Length: 78  Bit Score: 83.76  E-value: 1.64e-19
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918643  123 MLHFLTAVVGSTCDVKVKNGTTYEGIFKTLS--SKFELAVDAVHRKASE--PAGGPRREDIVDTMVFKPSDVMLVHFR 196
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASleKDFGVVLKMARRIKKSngSGLNPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
264-326 1.56e-17

LsmAD domain; This domain is found associated with Lsm domain.


Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 77.61  E-value: 1.56e-17
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918643  264 YGVKTTYDSSLssYTVPLEKdNSEEFRQRELRAAQLAREIESSPQYRLRIAMEN-----DDGRTEEEK 326
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERgldvdDSGLDEEDK 65
PHA03247 PHA03247
large tegument protein UL36; Provisional
398-936 3.38e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.49  E-value: 3.38e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  398 ARGINGGPSRMSPKAQRPLRGAKTLSSPSNR--PSGETSVPPPPAVGRMYPPRSPKSAAPAPisASCPEPPIGSAV-PTS 474
Cdd:PHA03247  2456 ARTILGAPFSLSLLLGELFPGAPVYRRPAEArfPFAAGAAPDPGGGGPPDPDAPPAPSRLAP--AILPDEPVGEPVhPRM 2533
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  475 SASIPVTSSVSDPGVGSISPASPKISLAPTDVKELSTKEPG-RTLEPQELARiaGKVPGLQNEQKRFQLEELRKFGAQFK 553
Cdd:PHA03247  2534 LTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRPApRPSEPAVTSR--ARRPDAPPQSARPRAPVDDRGDPRGP 2611
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  554 LQPSSSPENSLDPFPPRILKEEPKGKEKEVDGLLTSEPMGSPVSSKTESVS------------------DKEDKPPLAPS 615
Cdd:PHA03247  2612 APPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSrprrarrlgraaqassppQRPRRRAARPT 2691
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  616 GGTEGPEQPPPPCPSQTGSPPVGLIKGEDKDEGP-VAEQVKKSTLNPNAKEFNPTKPLL--SVNKSTSTPTSPGPRTHST 692
Cdd:PHA03247  2692 VGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPaAARQASPALPAAPAPPAVPAGPATpgGPARPARPPTTAGPPAPAP 2771
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  693 PSIPVlTAGQSGLYSPQYISYIPQIHMGPAVQAPQMYPYPVSNSVPGQQGKYRGAKGSLPPQRSDQHQPASAPPmmqaaa 772
Cdd:PHA03247  2772 PAAPA-AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG------ 2844
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  773 aagpplvaatPYSSYIPYNPQQFPGQPAMMQPMahyPSQPVFAPMLQSNP--RMLTSGSHPQAIVSSSTPQYPSAEQPTP 850
Cdd:PHA03247  2845 ----------PPPPSLPLGGSVAPGGDVRRRPP---SRSPAAKPAAPARPpvRRLARPAVSRSTESFALPPDQPERPPQP 2911
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  851 QALYATVHQSYPHHATQLHAHQPQPATTPTGSQPQSQHA-APSPVQAGQAPHLGSGQPQQnlyHPGALTGTPPSLPPGPS 929
Cdd:PHA03247  2912 QAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAgAGEPSGAVPQPWLGALVPGR---VAVPRFRVPQPAPSREA 2988

                   ....*..
gi 1908918643  930 AQSPQSS 936
Cdd:PHA03247  2989 PASSTPP 2995
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
829-989 3.13e-06

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 51.58  E-value: 3.13e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  829 SHPQAIVSSSTPQYPSAEQPTPQALYATVHQSYPHHATQLHAHQPQPATTPtgsQPQSQHAAPSPVQAGQAPHLGSGQPQ 908
Cdd:pfam09770  217 APAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQ---RPQSPQPDPAQPSIQPQAQQFHQQPP 293
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  909 QNLYHPGALTGTPPSLPPgPSAQSPQSSFPQPAAVYAIHHQQLPHGFTNMAHVtQAHvqtgitaappphpgaphPPQVML 988
Cdd:pfam09770  294 PVPVQPTQILQNPNRLSA-ARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPI-ITH-----------------PQQLAQ 354

                   .
gi 1908918643  989 L 989
Cdd:pfam09770  355 L 355
 
Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
123-196 1.64e-19

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


Pssm-ID: 464173  Cd Length: 78  Bit Score: 83.76  E-value: 1.64e-19
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918643  123 MLHFLTAVVGSTCDVKVKNGTTYEGIFKTLS--SKFELAVDAVHRKASE--PAGGPRREDIVDTMVFKPSDVMLVHFR 196
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASleKDFGVVLKMARRIKKSngSGLNPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
264-326 1.56e-17

LsmAD domain; This domain is found associated with Lsm domain.


Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 77.61  E-value: 1.56e-17
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918643  264 YGVKTTYDSSLssYTVPLEKdNSEEFRQRELRAAQLAREIESSPQYRLRIAMEN-----DDGRTEEEK 326
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERgldvdDSGLDEEDK 65
PHA03247 PHA03247
large tegument protein UL36; Provisional
398-936 3.38e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.49  E-value: 3.38e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  398 ARGINGGPSRMSPKAQRPLRGAKTLSSPSNR--PSGETSVPPPPAVGRMYPPRSPKSAAPAPisASCPEPPIGSAV-PTS 474
Cdd:PHA03247  2456 ARTILGAPFSLSLLLGELFPGAPVYRRPAEArfPFAAGAAPDPGGGGPPDPDAPPAPSRLAP--AILPDEPVGEPVhPRM 2533
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  475 SASIPVTSSVSDPGVGSISPASPKISLAPTDVKELSTKEPG-RTLEPQELARiaGKVPGLQNEQKRFQLEELRKFGAQFK 553
Cdd:PHA03247  2534 LTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRPApRPSEPAVTSR--ARRPDAPPQSARPRAPVDDRGDPRGP 2611
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  554 LQPSSSPENSLDPFPPRILKEEPKGKEKEVDGLLTSEPMGSPVSSKTESVS------------------DKEDKPPLAPS 615
Cdd:PHA03247  2612 APPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSrprrarrlgraaqassppQRPRRRAARPT 2691
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  616 GGTEGPEQPPPPCPSQTGSPPVGLIKGEDKDEGP-VAEQVKKSTLNPNAKEFNPTKPLL--SVNKSTSTPTSPGPRTHST 692
Cdd:PHA03247  2692 VGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPaAARQASPALPAAPAPPAVPAGPATpgGPARPARPPTTAGPPAPAP 2771
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  693 PSIPVlTAGQSGLYSPQYISYIPQIHMGPAVQAPQMYPYPVSNSVPGQQGKYRGAKGSLPPQRSDQHQPASAPPmmqaaa 772
Cdd:PHA03247  2772 PAAPA-AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG------ 2844
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  773 aagpplvaatPYSSYIPYNPQQFPGQPAMMQPMahyPSQPVFAPMLQSNP--RMLTSGSHPQAIVSSSTPQYPSAEQPTP 850
Cdd:PHA03247  2845 ----------PPPPSLPLGGSVAPGGDVRRRPP---SRSPAAKPAAPARPpvRRLARPAVSRSTESFALPPDQPERPPQP 2911
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  851 QALYATVHQSYPHHATQLHAHQPQPATTPTGSQPQSQHA-APSPVQAGQAPHLGSGQPQQnlyHPGALTGTPPSLPPGPS 929
Cdd:PHA03247  2912 QAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAgAGEPSGAVPQPWLGALVPGR---VAVPRFRVPQPAPSREA 2988

                   ....*..
gi 1908918643  930 AQSPQSS 936
Cdd:PHA03247  2989 PASSTPP 2995
PHA03247 PHA03247
large tegument protein UL36; Provisional
336-850 8.47e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.40  E-value: 8.47e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  336 GRESPSLASREGKYIPLPQRVREGPRGGVRCSSSRGGRPGLSSLPPRGPHHLDNSSPGP-----------GSEARGINGG 404
Cdd:PHA03247  2513 SRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPrpaprpsepavTSRARRPDAP 2592
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  405 PSRMSPKAQRPLRGAKTLSSPSNRPSGETSVPPPPAVGRmyPPRSPKSAAPAPISASCPEPPIGSAVPTSSASIPVTSSV 484
Cdd:PHA03247  2593 PQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSP--SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRL 2670
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  485 SDPGVGSISPASPKISLAPTDVKELST----KEPGRTLEPQELARIAGKVPGLQNEQKRfqleelrkfgaqfklqpSSSP 560
Cdd:PHA03247  2671 GRAAQASSPPQRPRRRAARPTVGSLTSladpPPPPPTPEPAPHALVSATPLPPGPAAAR-----------------QASP 2733
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  561 ENSLDPFPPrilkeePKGKEKEVDGllTSEPMGSPVSSKTESVSdkedKPPLAPSGGTEGPEQPPPPCPSQTGSPpvGLI 640
Cdd:PHA03247  2734 ALPAAPAPP------AVPAGPATPG--GPARPARPPTTAGPPAP----APPAAPAAGPPRRLTRPAVASLSESRE--SLP 2799
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  641 KGEDKDEGPVAEQVKKSTLNPNAKEFNPTKPllsvnKSTSTPTSPGPRthSTPSIPVLTAGqsGLYSPQyisyipqihmG 720
Cdd:PHA03247  2800 SPWDPADPPAAVLAPAAALPPAASPAGPLPP-----PTSAQPTAPPPP--PGPPPPSLPLG--GSVAPG----------G 2860
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  721 PAVQAPQMYPYPVSNSVPGQQGKYRGAKGSLPPQRSDQHQPASAPPMMQAAAAAGPPLVAATPYSSYIPYNPQQFPGQP- 799
Cdd:PHA03247  2861 DVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPq 2940
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1908918643  800 AMMQPMAHYPSQPV---FAPMLQSNPRMLTSGSHPQAIVSSSTPQYPSAEQPTP 850
Cdd:PHA03247  2941 PPLAPTTDPAGAGEpsgAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
829-989 3.13e-06

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 51.58  E-value: 3.13e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  829 SHPQAIVSSSTPQYPSAEQPTPQALYATVHQSYPHHATQLHAHQPQPATTPtgsQPQSQHAAPSPVQAGQAPHLGSGQPQ 908
Cdd:pfam09770  217 APAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQ---RPQSPQPDPAQPSIQPQAQQFHQQPP 293
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  909 QNLYHPGALTGTPPSLPPgPSAQSPQSSFPQPAAVYAIHHQQLPHGFTNMAHVtQAHvqtgitaappphpgaphPPQVML 988
Cdd:pfam09770  294 PVPVQPTQILQNPNRLSA-ARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPI-ITH-----------------PQQLAQ 354

                   .
gi 1908918643  989 L 989
Cdd:pfam09770  355 L 355
PHA03247 PHA03247
large tegument protein UL36; Provisional
668-1035 3.52e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.48  E-value: 3.52e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  668 PTKPLLSVNKSTSTPTSPGPrTHSTPSIPVLTAGQSGLYSPQYISYIPQIHMGPAVQAPQMYPYPVSNSVPGQ-QGKYRG 746
Cdd:PHA03247  2595 SARPRAPVDDRGDPRGPAPP-SPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRaRRLGRA 2673
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  747 AKGSLPPQRSdqHQPASAPPMMQAAAAAGPPLVAATPYssyiPYNPQQFPGQPAMMQPMAHYPSQPVfAPMLQSNPRMLT 826
Cdd:PHA03247  2674 AQASSPPQRP--RRRAARPTVGSLTSLADPPPPPPTPE----PAPHALVSATPLPPGPAAARQASPA-LPAAPAPPAVPA 2746
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  827 SGSHPQAIVSSSTPQYPSA-EQPTPQALYATVhqsyPHHATQLHAHQPQPATTPTGSQPQSQHAAPSPVQAGQAPHLGSG 905
Cdd:PHA03247  2747 GPATPGGPARPARPPTTAGpPAPAPPAAPAAG----PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAA 2822
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  906 QPQQNLYHPGALTGTPPSLPPGPSAQS------------------PQSSFPQPAAVYAIHHQQLPHGFTNMAHVTQAHVQ 967
Cdd:PHA03247  2823 SPAGPLPPPTSAQPTAPPPPPGPPPPSlplggsvapggdvrrrppSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPP 2902
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918643  968 TGITAAPPPHPGAPHPPQVMLLHPPQSHGGPPQGAVPQSGVPALSASTPSPYPYIGHPQGEQPGQAPG 1035
Cdd:PHA03247  2903 DQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPG 2970
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
652-953 3.41e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 48.11  E-value: 3.41e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  652 EQVKKSTLNPNAKEFNPTKPLLSVNKSTSTPTSPGPrthSTPSIPVLTagqsGLYSPQYISYIPQIH-------MGPAVQ 724
Cdd:pfam09770   98 EQVRFNRQQPAARAAQSSAQPPASSLPQYQYASQQS---QQPSKPVRT----GYEKYKEPEPIPDLQvdaslwgVAPKKA 170
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  725 APQMYPYPVSNSVPGQQGKYR---------------GAKGSLPPQRSDQHQPASAppmmqaaaaagpplvaatPYSSYIP 789
Cdd:pfam09770  171 AAPAPAPQPAAQPASLPAPSRkmmsleeveaamraqAKKPAQQPAPAPAQPPAAP------------------PAQQAQQ 232
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  790 YnpQQFPGQPAMMQPMAHYPSQPVFAPmlqsnprmltSGSHPQAIVssstpQYPSAEQPTPqalyatvhqsyphhatqlh 869
Cdd:pfam09770  233 Q--QQFPPQIQQQQQPQQQPQQPQQHP----------GQGHPVTIL-----QRPQSPQPDP------------------- 276
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  870 aHQPQPATTPTGSQPQSQHAAPSPVQAGQAPHLGSgqPQQNLYHPGALTGTPPslPPGPSAQSPQSSFPQPAAVYAiHHQ 949
Cdd:pfam09770  277 -AQPSIQPQAQQFHQQPPPVPVQPTQILQNPNRLS--AARVGYPQNPQPGVQP--APAHQAHRQQGSFGRQAPIIT-HPQ 350

                   ....
gi 1908918643  950 QLPH 953
Cdd:pfam09770  351 QLAQ 354
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
791-1006 6.97e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 46.95  E-value: 6.97e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  791 NPQQFPGQPAMMQPMAHYPSQPVFAPMLQSNPRMLTSG----SHPQAI----VSSS------TPQYPSAEQPTPQALYAT 856
Cdd:pfam09770  106 QPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTGyekyKEPEPIpdlqVDASlwgvapKKAAAPAPAPQPAAQPAS 185
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  857 VH------------------QSYPHHATQLHAHQPQPATTPTGSQPQSQHAaPSPVQAGQAPHLGSGQPQQNLYHpgalt 918
Cdd:pfam09770  186 LPapsrkmmsleeveaamraQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQF-PPQIQQQQQPQQQPQQPQQHPGQ----- 259
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  919 GTPPSLPPGPSAQSPQSSFPQPAAVYAIHHQQLPhgfTNMAHVTQA-------HVQTGITAAPPPHPGAPHPPQVMLLHP 991
Cdd:pfam09770  260 GHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPP---PVPVQPTQIlqnpnrlSAARVGYPQNPQPGVQPAPAHQAHRQQ 336
                          250
                   ....*....|....*
gi 1908918643  992 PQSHGGPPQGAVPQS 1006
Cdd:pfam09770  337 GSFGRQAPIITHPQQ 351
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
374-696 9.35e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 46.61  E-value: 9.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  374 PGLSSLPPRGPHHLDNSS----PGPGSEARGINGGPSRMSP----KAQRPLRGAKTLSSP--SNRPSGETSVPPPPavgR 443
Cdd:PTZ00449   514 PEASGLPPKAPGDKEGEEgeheDSKESDEPKEGGKPGETKEgevgKKPGPAKEHKPSKIPtlSKKPEFPKDPKHPK---D 590
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  444 MYPPRSPKS--AAPAPISASCPEPPIGSAVPTSSASIPVTSSVSDPgVGSISPASPKISLAPTDVKE-LSTKEPGRTLEP 520
Cdd:PTZ00449   591 PEEPKKPKRprSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRP-PPPQRPSSPERPEGPKIIKSpKPPKSPKPPFDP 669
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  521 ---QELARIAGKVPGLQNEQKRFQL--EELRKFGAQFKLQPSSSPENSLDPFPPRIlkeepkgkekEVDGLLTSEPMGSP 595
Cdd:PTZ00449   670 kfkEKFYDDYLDAAAKSKETKTTVVldESFESILKETLPETPGTPFTTPRPLPPKL----------PRDEEFPFEPIGDP 739
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  596 VSsktESVSDKE-DKPPLAPSggtegpeqpPPPCPSQTGSPPVGLIKGEDKDEGPVAEqvkksTLNPNAKEFNPTKPlls 674
Cdd:PTZ00449   740 DA---EQPDDIEfFTPPEEER---------TFFHETPADTPLPDILAEEFKEEDIHAE-----TGEPDEAMKRPDSP--- 799
                          330       340
                   ....*....|....*....|..
gi 1908918643  675 vnkSTSTPTSPGprTHstPSIP 696
Cdd:PTZ00449   800 ---SEHEDKPPG--DH--PSLP 814
PRK10263 PRK10263
DNA translocase FtsK; Provisional
661-939 3.14e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 45.08  E-value: 3.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  661 PNAKEFNP-------TKPLLSVNKSTS-TPTSPGPRTHSTPSIPVLTAgQSGLYSPQY-ISYIPQIHMGPAVQAPQMYPY 731
Cdd:PRK10263   302 PEYDEYDPllngapiTEPVAVAAAATTaTQSWAAPVEPVTQTPPVASV-DVPPAQPTVaWQPVPGPQTGEPVIAPAPEGY 380
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  732 PvsnsvPGQQGKYRGAKGSLPPQRSDQHQPASAPPMMQAAAAAGPPLVAATPYSSYIPYNPQqfPGQPAMMQPMAHYPSQ 811
Cdd:PRK10263   381 P-----QQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPA--PEQPVAGNAWQAEEQQ 453
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  812 PVFAPmlqsNPRMLTSGSHPQAIVSSSTPQYPSAEQPT-----------------PQALYATVHQSYPHHATQLHA-HQ- 872
Cdd:PRK10263   454 STFAP----QSTYQTEQTYQQPAAQEPLYQQPQPVEQQpvvepepvveetkparpPLYYFEEVEEKRAREREQLAAwYQp 529
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1908918643  873 -PQPATTP---TGSQPQSQHAAPSPVQAGQA-PHLGSGQPQQNLYHPGALTGTPPSLPP----GPSAQSPQSSFPQ 939
Cdd:PRK10263   530 iPEPVKEPepiKSSLKAPSVAAVPPVEAAAAvSPLASGVKKATLATGAAATVAAPVFSLansgGPRPQVKEGIGPQ 605
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
870-943 6.88e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 43.61  E-value: 6.88e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1908918643  870 AHQPQPATTPTGSQPQSqHAAPSPVQAGQAPHlGSGQPQQNLYHPGALTGTPPSLPPGPSAQSPQSSFPQPAAV 943
Cdd:PRK14971   389 APQPSAAAAASPSPSQS-SAAAQPSAPQSATQ-PAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPV 460
PRK10263 PRK10263
DNA translocase FtsK; Provisional
793-940 1.65e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 42.76  E-value: 1.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  793 QQFPG-QPAMMQPMAHypSQPVFAPMlqsnPRMLTSGSHPQAIVSSSTPQYPSAEQPTPQALYATVHQSYPHHATQLHAH 871
Cdd:PRK10263   709 QRYSGeQPAGANPFSL--DDFEFSPM----KALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQ 782
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1908918643  872 QPQPATtPTGSQPQSQHAAPSPVQAGQAPHlgsgQPQQNLYHPGALTGTPPSL--PPGPSAQSPQSSFPQP 940
Cdd:PRK10263   783 QPVAPQ-PQYQQPQQPVAPQPQYQQPQQPV----APQPQYQQPQQPVAPQPQYqqPQQPVAPQPQDTLLHP 848
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
395-552 2.58e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 41.68  E-value: 2.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  395 GSEARGINGGPSRMSPKAQRPLRGAKTLSSPSNRPS-GETSVPPPPAvgrmypprspksaAPAPISASCPEPPIGSAVPT 473
Cdd:PRK14971   366 GDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSpSQSSAAAQPS-------------APQSATQPAGTPPTVSVDPP 432
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1908918643  474 SSASIPVTSSVSDPGVGSISPASPKISLAPTDVKELSTKEPGRtlEPQELARIAGKVPGLQNEQKRFQLEELRKFGAQF 552
Cdd:PRK14971   433 AAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQ--EKAEQATGNIKEAPTGTQKEIFTEEDLQYYWQEF 509
PHA03247 PHA03247
large tegument protein UL36; Provisional
332-532 2.99e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 2.99e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  332 RQGSGRESPSLASREGKYIPLPQRVREGPRGGVRCSSSRGGRPGLSSLPPRGPHH-------------LDNSSPGPGSEA 398
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRraarptvgsltslADPPPPPPTPEP 2710
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  399 RGINGGPSRMSPKAQRPLRGAK--TLSSPSNRPSGETSVPP-----------PPAVGRMYPPRSPKSAAP--APISASCP 463
Cdd:PHA03247  2711 APHALVSATPLPPGPAAARQASpaLPAAPAPPAVPAGPATPggparparpptTAGPPAPAPPAAPAAGPPrrLTRPAVAS 2790
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1908918643  464 EPPIGSAVPTSSASIPVTSSVSDPGVGSISPASPKISLAPTDVKELSTKEPGRTLEPQELARIAGKVPG 532
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG 2859
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
359-528 3.19e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 41.76  E-value: 3.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  359 GPRGGVRCSSSRGGRPGLSSLPPRGPHHLDNSSPGPGSEARGINGGPSRMSPKAQRPLRGAKTLSSPS-NRPSGETSVPP 437
Cdd:PRK07003   366 GAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPaTADRGDDAADG 445
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  438 PPAVGRMYPPRSPKSAAPAPISASCPEPPIGSAVPTSSASIPVTSSVSDPGVGSISPASPKISLAPTDVKELSTKEPGRT 517
Cdd:PRK07003   446 DAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAA 525
                          170
                   ....*....|.
gi 1908918643  518 LEPQELARIAG 528
Cdd:PRK07003   526 APPAPEARPPT 536
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
394-555 5.62e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 40.85  E-value: 5.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  394 PGSEARGINGGPSRMSPKAQRPLRGAKTLSSPsnRPSGETSVPPPPAVGRMYPPRSP-----KSAAPAPISASCPEPPIG 468
Cdd:PRK14951   374 APAEKKTPARPEAAAPAAAPVAQAAAAPAPAA--APAAAASAPAAPPAAAPPAPVAApaaaaPAAAPAAAPAAVALAPAP 451
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  469 SAVPTSSASIPVTSSVSDPGVGS----ISPASPKISLAPTD--------VKELSTKEPGRTLePQELAriagkvpgLQNE 536
Cdd:PRK14951   452 PAQAAPETVAIPVRVAPEPAVASaapaPAAAPAAARLTPTEegdvwhatVQQLAAAEAITAL-ARELA--------LQSE 522
                          170       180
                   ....*....|....*....|....*...
gi 1908918643  537 ---------QKRFQLEELRKFGAQFKLQ 555
Cdd:PRK14951   523 lvardgdqwLLRVERESLNQPGARERLR 550
PHA03369 PHA03369
capsid maturational protease; Provisional
396-709 6.09e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 40.75  E-value: 6.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  396 SEARGINGGPSRMSPKAQRPLRGAKTLSSPSnRPSGETSVPPPPAVGRMYPPRSPKSAAPAPISASCPEPPIGSAVPTss 475
Cdd:PHA03369   355 APSRVLAAAAKVAVIAAPQTHTGPADRQRPQ-RPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPE-- 431
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  476 asiPVTSSVSDPGVGSISPASPKISLAPTDVKELSTKEPGRTLEP--QELARIAGKVPGL--QNEQKRFQLEELRKFGAQ 551
Cdd:PHA03369   432 ---PVGPVPPQPTNPYVMPISMANMVYPGHPQEHGHERKRKRGGElkEELIETLKLVKKLkeEQESLAKELEATAHKSEI 508
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  552 fklqpSSSPENSLDPFPPRILKEEPKGKEKEVDGLLTSEPMGSPVSSKTESVSDKEDKPPLAPSGGTEGPEQPPPPCPSQ 631
Cdd:PHA03369   509 -----KKIAESEFKNAGAKTAAANIEPNCSADAAAPATKRARPETKTELEAVVRFPYQIRNMESPAFVHSFTSTTLAAAA 583
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918643  632 TGSPpvgliKGEDKDEGPVAEQVKKSTLNPnaKEFNPTKPLLSVNKSTSTPTSPgPRTHSTPSIPVLTAGQSGLYSPQ 709
Cdd:PHA03369   584 GQGS-----DTAEALAGAIETLLTQASAQP--AGLSLPAPAVPVNASTPASTPP-PLAPQEPPQPGTSAPSLETSLPQ 653
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
327-488 9.37e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 40.15  E-value: 9.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  327 HSAVQRQGSGRESPSLASREGKyIPLPQRVREGPRGGVRcssSRGGRPGLSSLPPRGPHhldnSSPGPGSEARGINGGPS 406
Cdd:PHA03307   241 SSESSGCGWGPENECPLPRPAP-ITLPTRIWEASGWNGP---SSRPGPASSSSSPRERS----PSPSPSSPGSGPAPSSP 312
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918643  407 RMSPKAQRPLRGAKTLSSPSNRPSGETSVPPPPAVGRMYPPRSPKS-AAPAPISASCPEPPIGSAVPTSSASIPVTSSVS 485
Cdd:PHA03307   313 RASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPpADPSSPRKRPRPSRAPSSPAASAGRPTRRRARA 392

                   ...
gi 1908918643  486 DPG 488
Cdd:PHA03307   393 AVA 395
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH