|
Name |
Accession |
Description |
Interval |
E-value |
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
1799-2131 |
4.53e-97 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules. :
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 317.20 E-value: 4.53e-97
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1799 TMLRGRTVIYSAGpASRTQSKGISGPCTTPKKTgtsgttqpETVTKAPSPEQQRSRSLHRPGKISELAALRHPPRSATPP 1878
Cdd:pfam05956 1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1879 ARLAKTPSSSSSQTSPASQPLPRRSPLATPTGGPLPGPGGS------LVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1950
Cdd:pfam05956 72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRnklsplPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1951 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRMASARSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 2025
Cdd:pfam05956 152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2026 SSADSTASTSQAASPRRGRPALPAVFLCSSRCDELRVSPRQPLAA--QRSPQAKPGLAPRAPRRTSSESPSRLPVRASPG 2103
Cdd:pfam05956 229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
|
330 340
....*....|....*....|....*...
gi 1730154244 2104 RPETVKRYASLPHISVSRRSDSAVSVPT 2131
Cdd:pfam05956 309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
393-466 |
1.50e-36 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region. :
Pssm-ID: 465870 Cd Length: 74 Bit Score: 133.06 E-value: 1.50e-36
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1730154244 393 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTESGTGDTPVPIEPQICQATCAVMKLSFDEEYRRAM 466
Cdd:pfam18797 1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_N_CC |
pfam16689 |
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ... |
43-94 |
4.45e-26 |
|
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin. :
Pssm-ID: 435517 Cd Length: 52 Bit Score: 102.37 E-value: 4.45e-26
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 1730154244 43 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 94
Cdd:pfam16689 1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
|
|
| Suppressor_APC |
pfam11414 |
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ... |
161-241 |
4.04e-18 |
|
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils. :
Pssm-ID: 463275 Cd Length: 82 Bit Score: 80.76 E-value: 4.04e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 161 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDT-FSMQMDLIRQQLEFEAQHIRSLMEERFG 239
Cdd:pfam11414 1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGTyFDYGSDAQQERLEFLLARIQEVNRCLGG 80
|
..
gi 1730154244 240 TS 241
Cdd:pfam11414 81 LI 82
|
|
| Arm_APC_u3 super family |
cl25003 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
732-978 |
4.86e-17 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. The actual alignment was detected with superfamily member pfam16629:
Pssm-ID: 435476 Cd Length: 293 Bit Score: 84.25 E-value: 4.86e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 732 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQSlPEAETTSKKplpplRHLDGLVQDYASDSG 811
Cdd:pfam16629 1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 812 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEK-----------------EAGGEAAVA 874
Cdd:pfam16629 74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKdrsldrergaglsnfhpATENSGNSS 147
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 875 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 950
Cdd:pfam16629 148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
|
250 260
....*....|....*....|....*...
gi 1730154244 951 AAHTSLSNDSLNSGSTSDGYCTREHMTP 978
Cdd:pfam16629 228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
650-689 |
1.37e-06 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats. :
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 46.68 E-value: 1.37e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1730154244 650 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 689
Cdd:pfam00514 2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
1547-1986 |
7.41e-05 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.40 E-value: 7.41e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1547 PPPRRASAIPRALKREKPAGRKETP---SRAAQPATLPVRAQPRLIVDETPPcysltssaSSLSEPEAPEQPANHARGPE 1623
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGD--------PRGPAPPSPLPPDTHAPDPP 2627
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1624 QgskqdSSPSPRAEEELLQRCISLAMPRRRTQVP-GSRRRKPRALRSDIRPTEITQKCQ--EEVAGSDPASDLDSVEWQA 1700
Cdd:PHA03247 2628 P-----PSPSPAANEPDPHPPPTVPPPERPRDDPaPGRVSRPRRARRLGRAAQASSPPQrpRRRAARPTVGSLTSLADPP 2702
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1701 IQEGAnsivtwlhQAAAKASLEASSESDSLLSLVSGVSAGSTLQPSKLRKGRKPAAEAGGAWRPEKRGTTSTKINGSPRL 1780
Cdd:PHA03247 2703 PPPPT--------PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA 2774
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1781 PNGPEKAKGTQKMMAGESTMLRGRTVIYSAGPASRTQSKGISGPCTTPKKTGTSGTTQPETVTKAPSPEQQRSRSLHRPG 1860
Cdd:PHA03247 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1861 KISELAALRHPPRSATPPARLAktpsssssqtsPASQPLPRRSPLATPTGGPLPGPGGSLVPKSPARALLAKQHKTQKSP 1940
Cdd:PHA03247 2855 SVAPGGDVRRRPPSRSPAAKPA-----------APARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
|
410 420 430 440
....*....|....*....|....*....|....*....|....*.
gi 1730154244 1941 VRIPFMQRPARrvPPPLARPSPEPGSRGRAGAEGTPGARGSRLGLV 1986
Cdd:PHA03247 2924 PPPPQPQPPPP--PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL 2967
|
|
| PTZ00449 super family |
cl33186 |
104 kDa microneme/rhoptry antigen; Provisional |
2041-2238 |
2.90e-04 |
|
104 kDa microneme/rhoptry antigen; Provisional The actual alignment was detected with superfamily member PTZ00449:
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 46.22 E-value: 2.90e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2041 RRGRPALPAVFLC--SSRCDELRVSPRQPLAAQRSPQAKPGLAPRAPRrtSSESPSRLPVRASPGRPETV------KRYA 2112
Cdd:PTZ00449 608 RPKSPKLPELLDIpkSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIK--SPKPPKSPKPPFDPKFKEKFyddyldAAAK 685
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2113 SLPHISVSRRSDSAVSVPTTQANATRRGSDGEARPLPRVAP--PGTTWRRIKDEDVPHILRSTLPATALPLRVSSPEdSP 2190
Cdd:PTZ00449 686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPrdEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHE-TP 764
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1730154244 2191 AGTPQRKTSDAVVQTEDVaTSKTNSSTSPSLESRDP----PQAPASGPVAPQ 2238
Cdd:PTZ00449 765 ADTPLPDILAEEFKEEDI-HAETGEPDEAMKRPDSPseheDKPPGDHPSLPK 815
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1412-1434 |
5.83e-04 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin. :
Pssm-ID: 461781 Cd Length: 24 Bit Score: 38.90 E-value: 5.83e-04
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
691-731 |
3.01e-03 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats. :
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 37.43 E-value: 3.01e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1730154244 691 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 731
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1289-1310 |
3.18e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin. :
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.97 E-value: 3.18e-03
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1177-1200 |
5.15e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin. :
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.20 E-value: 5.15e-03
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
1327-1664 |
5.44e-03 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.23 E-value: 5.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1327 PPACPERAVGGGGHRRRDEAASRLDGPAPAGSRARSATDKeleALRECLG-----AAMPARLRKVASALVPGRRSLPVPv 1401
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRR---AARPTVGsltslADPPPPPPTPEPAPHALVSATPLP- 2722
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1402 ymLVPAPARGDDSGTDSAEGTPVNFSSAASLSDETlQGPSRDKPAGPGDRQKPTGRAAPArQTRSHRPKAAGAGKSTEHT 1481
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPA-RPARPPTTAGPPAPAPPAAPAAGP-PRRLTRPAVASLSESRESL 2798
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1482 RGPcRNRAGLELPLSRPQSARSNRDSSCQTRTRGDGALQS--------LCLTTPTEEAVYCFYD-SDEEPPATAPPPRRA 1552
Cdd:PHA03247 2799 PSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTapppppgpPPPSLPLGGSVAPGGDvRRRPPSRSPAAKPAA 2877
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1553 SAIPRALKREKPAgrketPSRAAQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEAPEQPANHAR---GPEQGSKQD 1629
Cdd:PHA03247 2878 PARPPVRRLARPA-----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQpplAPTTDPAGA 2952
|
330 340 350
....*....|....*....|....*....|....*
gi 1730154244 1630 SSPSPRAEEELLQRCISLAMPRRRTQVPGSRRRKP 1664
Cdd:PHA03247 2953 GEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE 2987
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
1799-2131 |
4.53e-97 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 317.20 E-value: 4.53e-97
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1799 TMLRGRTVIYSAGpASRTQSKGISGPCTTPKKTgtsgttqpETVTKAPSPEQQRSRSLHRPGKISELAALRHPPRSATPP 1878
Cdd:pfam05956 1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1879 ARLAKTPSSSSSQTSPASQPLPRRSPLATPTGGPLPGPGGS------LVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1950
Cdd:pfam05956 72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRnklsplPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1951 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRMASARSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 2025
Cdd:pfam05956 152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2026 SSADSTASTSQAASPRRGRPALPAVFLCSSRCDELRVSPRQPLAA--QRSPQAKPGLAPRAPRRTSSESPSRLPVRASPG 2103
Cdd:pfam05956 229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
|
330 340
....*....|....*....|....*...
gi 1730154244 2104 RPETVKRYASLPHISVSRRSDSAVSVPT 2131
Cdd:pfam05956 309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
393-466 |
1.50e-36 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.
Pssm-ID: 465870 Cd Length: 74 Bit Score: 133.06 E-value: 1.50e-36
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1730154244 393 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTESGTGDTPVPIEPQICQATCAVMKLSFDEEYRRAM 466
Cdd:pfam18797 1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_N_CC |
pfam16689 |
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ... |
43-94 |
4.45e-26 |
|
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.
Pssm-ID: 435517 Cd Length: 52 Bit Score: 102.37 E-value: 4.45e-26
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 1730154244 43 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 94
Cdd:pfam16689 1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
|
|
| Suppressor_APC |
pfam11414 |
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ... |
161-241 |
4.04e-18 |
|
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.
Pssm-ID: 463275 Cd Length: 82 Bit Score: 80.76 E-value: 4.04e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 161 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDT-FSMQMDLIRQQLEFEAQHIRSLMEERFG 239
Cdd:pfam11414 1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGTyFDYGSDAQQERLEFLLARIQEVNRCLGG 80
|
..
gi 1730154244 240 TS 241
Cdd:pfam11414 81 LI 82
|
|
| Arm_APC_u3 |
pfam16629 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
732-978 |
4.86e-17 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435476 Cd Length: 293 Bit Score: 84.25 E-value: 4.86e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 732 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQSlPEAETTSKKplpplRHLDGLVQDYASDSG 811
Cdd:pfam16629 1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 812 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEK-----------------EAGGEAAVA 874
Cdd:pfam16629 74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKdrsldrergaglsnfhpATENSGNSS 147
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 875 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 950
Cdd:pfam16629 148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
|
250 260
....*....|....*....|....*...
gi 1730154244 951 AAHTSLSNDSLNSGSTSDGYCTREHMTP 978
Cdd:pfam16629 228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
650-689 |
1.37e-06 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 46.68 E-value: 1.37e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1730154244 650 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 689
Cdd:pfam00514 2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
649-689 |
2.07e-06 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 46.27 E-value: 2.07e-06
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1730154244 649 REDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 689
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
1811-1964 |
2.17e-05 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 50.07 E-value: 2.17e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1811 GPASRTQSKGISGPCT---TPKKTGTSGTTQPETVTKAPSPEQQrsrslHRPGKISELAALRHPPRSATPPARlaKTPSS 1887
Cdd:PTZ00449 523 APGDKEGEEGEHEDSKesdEPKEGGKPGETKEGEVGKKPGPAKE-----HKPSKIPTLSKKPEFPKDPKHPKD--PEEPK 595
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1730154244 1888 SSSQTSPASQPLPRRSPlatptggplPGPGGSLVPKSPARALLAKQHKTQKSPVRIPFMQRPARRVPPPLARP--SPEP 1964
Cdd:PTZ00449 596 KPKRPRSAQRPTRPKSP---------KLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPpkSPKP 665
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1547-1986 |
7.41e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.40 E-value: 7.41e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1547 PPPRRASAIPRALKREKPAGRKETP---SRAAQPATLPVRAQPRLIVDETPPcysltssaSSLSEPEAPEQPANHARGPE 1623
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGD--------PRGPAPPSPLPPDTHAPDPP 2627
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1624 QgskqdSSPSPRAEEELLQRCISLAMPRRRTQVP-GSRRRKPRALRSDIRPTEITQKCQ--EEVAGSDPASDLDSVEWQA 1700
Cdd:PHA03247 2628 P-----PSPSPAANEPDPHPPPTVPPPERPRDDPaPGRVSRPRRARRLGRAAQASSPPQrpRRRAARPTVGSLTSLADPP 2702
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1701 IQEGAnsivtwlhQAAAKASLEASSESDSLLSLVSGVSAGSTLQPSKLRKGRKPAAEAGGAWRPEKRGTTSTKINGSPRL 1780
Cdd:PHA03247 2703 PPPPT--------PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA 2774
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1781 PNGPEKAKGTQKMMAGESTMLRGRTVIYSAGPASRTQSKGISGPCTTPKKTGTSGTTQPETVTKAPSPEQQRSRSLHRPG 1860
Cdd:PHA03247 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1861 KISELAALRHPPRSATPPARLAktpsssssqtsPASQPLPRRSPLATPTGGPLPGPGGSLVPKSPARALLAKQHKTQKSP 1940
Cdd:PHA03247 2855 SVAPGGDVRRRPPSRSPAAKPA-----------APARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
|
410 420 430 440
....*....|....*....|....*....|....*....|....*.
gi 1730154244 1941 VRIPFMQRPARrvPPPLARPSPEPGSRGRAGAEGTPGARGSRLGLV 1986
Cdd:PHA03247 2924 PPPPQPQPPPP--PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL 2967
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1633-1654 |
8.72e-05 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 41.42 E-value: 8.72e-05
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
2041-2238 |
2.90e-04 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 46.22 E-value: 2.90e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2041 RRGRPALPAVFLC--SSRCDELRVSPRQPLAAQRSPQAKPGLAPRAPRrtSSESPSRLPVRASPGRPETV------KRYA 2112
Cdd:PTZ00449 608 RPKSPKLPELLDIpkSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIK--SPKPPKSPKPPFDPKFKEKFyddyldAAAK 685
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2113 SLPHISVSRRSDSAVSVPTTQANATRRGSDGEARPLPRVAP--PGTTWRRIKDEDVPHILRSTLPATALPLRVSSPEdSP 2190
Cdd:PTZ00449 686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPrdEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHE-TP 764
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1730154244 2191 AGTPQRKTSDAVVQTEDVaTSKTNSSTSPSLESRDP----PQAPASGPVAPQ 2238
Cdd:PTZ00449 765 ADTPLPDILAEEFKEEDI-HAETGEPDEAMKRPDSPseheDKPPGDHPSLPK 815
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1412-1434 |
5.83e-04 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 38.90 E-value: 5.83e-04
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
21-123 |
2.67e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 42.58 E-value: 2.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 21 RPRTqtpGRIVALQELKMTSSMASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQearv 100
Cdd:COG4372 19 RPKT---GILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQA---- 91
|
90 100
....*....|....*....|...
gi 1730154244 101 lVSSGQTEVLEQLKALQTDISSL 123
Cdd:COG4372 92 -AQAELAQAQEELESLQEEAEEL 113
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
691-731 |
3.01e-03 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 37.43 E-value: 3.01e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1730154244 691 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 731
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1289-1310 |
3.18e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.97 E-value: 3.18e-03
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1177-1200 |
5.15e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.20 E-value: 5.15e-03
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1327-1664 |
5.44e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.23 E-value: 5.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1327 PPACPERAVGGGGHRRRDEAASRLDGPAPAGSRARSATDKeleALRECLG-----AAMPARLRKVASALVPGRRSLPVPv 1401
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRR---AARPTVGsltslADPPPPPPTPEPAPHALVSATPLP- 2722
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1402 ymLVPAPARGDDSGTDSAEGTPVNFSSAASLSDETlQGPSRDKPAGPGDRQKPTGRAAPArQTRSHRPKAAGAGKSTEHT 1481
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPA-RPARPPTTAGPPAPAPPAAPAAGP-PRRLTRPAVASLSESRESL 2798
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1482 RGPcRNRAGLELPLSRPQSARSNRDSSCQTRTRGDGALQS--------LCLTTPTEEAVYCFYD-SDEEPPATAPPPRRA 1552
Cdd:PHA03247 2799 PSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTapppppgpPPPSLPLGGSVAPGGDvRRRPPSRSPAAKPAA 2877
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1553 SAIPRALKREKPAgrketPSRAAQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEAPEQPANHAR---GPEQGSKQD 1629
Cdd:PHA03247 2878 PARPPVRRLARPA-----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQpplAPTTDPAGA 2952
|
330 340 350
....*....|....*....|....*....|....*
gi 1730154244 1630 SSPSPRAEEELLQRCISLAMPRRRTQVPGSRRRKP 1664
Cdd:PHA03247 2953 GEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE 2987
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
1799-2131 |
4.53e-97 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 317.20 E-value: 4.53e-97
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1799 TMLRGRTVIYSAGpASRTQSKGISGPCTTPKKTgtsgttqpETVTKAPSPEQQRSRSLHRPGKISELAALRHPPRSATPP 1878
Cdd:pfam05956 1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1879 ARLAKTPSSSSSQTSPASQPLPRRSPLATPTGGPLPGPGGS------LVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1950
Cdd:pfam05956 72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRnklsplPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1951 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRMASARSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 2025
Cdd:pfam05956 152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2026 SSADSTASTSQAASPRRGRPALPAVFLCSSRCDELRVSPRQPLAA--QRSPQAKPGLAPRAPRRTSSESPSRLPVRASPG 2103
Cdd:pfam05956 229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
|
330 340
....*....|....*....|....*...
gi 1730154244 2104 RPETVKRYASLPHISVSRRSDSAVSVPT 2131
Cdd:pfam05956 309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
393-466 |
1.50e-36 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.
Pssm-ID: 465870 Cd Length: 74 Bit Score: 133.06 E-value: 1.50e-36
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1730154244 393 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTESGTGDTPVPIEPQICQATCAVMKLSFDEEYRRAM 466
Cdd:pfam18797 1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_N_CC |
pfam16689 |
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ... |
43-94 |
4.45e-26 |
|
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.
Pssm-ID: 435517 Cd Length: 52 Bit Score: 102.37 E-value: 4.45e-26
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 1730154244 43 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 94
Cdd:pfam16689 1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
|
|
| Suppressor_APC |
pfam11414 |
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ... |
161-241 |
4.04e-18 |
|
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.
Pssm-ID: 463275 Cd Length: 82 Bit Score: 80.76 E-value: 4.04e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 161 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDT-FSMQMDLIRQQLEFEAQHIRSLMEERFG 239
Cdd:pfam11414 1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGTyFDYGSDAQQERLEFLLARIQEVNRCLGG 80
|
..
gi 1730154244 240 TS 241
Cdd:pfam11414 81 LI 82
|
|
| Arm_APC_u3 |
pfam16629 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
732-978 |
4.86e-17 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435476 Cd Length: 293 Bit Score: 84.25 E-value: 4.86e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 732 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQSlPEAETTSKKplpplRHLDGLVQDYASDSG 811
Cdd:pfam16629 1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 812 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEK-----------------EAGGEAAVA 874
Cdd:pfam16629 74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKdrsldrergaglsnfhpATENSGNSS 147
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 875 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 950
Cdd:pfam16629 148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
|
250 260
....*....|....*....|....*...
gi 1730154244 951 AAHTSLSNDSLNSGSTSDGYCTREHMTP 978
Cdd:pfam16629 228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
650-689 |
1.37e-06 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 46.68 E-value: 1.37e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1730154244 650 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 689
Cdd:pfam00514 2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
649-689 |
2.07e-06 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 46.27 E-value: 2.07e-06
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1730154244 649 REDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 689
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
1811-1964 |
2.17e-05 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 50.07 E-value: 2.17e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1811 GPASRTQSKGISGPCT---TPKKTGTSGTTQPETVTKAPSPEQQrsrslHRPGKISELAALRHPPRSATPPARlaKTPSS 1887
Cdd:PTZ00449 523 APGDKEGEEGEHEDSKesdEPKEGGKPGETKEGEVGKKPGPAKE-----HKPSKIPTLSKKPEFPKDPKHPKD--PEEPK 595
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1730154244 1888 SSSQTSPASQPLPRRSPlatptggplPGPGGSLVPKSPARALLAKQHKTQKSPVRIPFMQRPARRVPPPLARP--SPEP 1964
Cdd:PTZ00449 596 KPKRPRSAQRPTRPKSP---------KLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPpkSPKP 665
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1834-2316 |
4.08e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.17 E-value: 4.08e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1834 SGTTQPETVTKAPSPEQQRSRSLHRPGkiselaalrhpPRSATPPARLAKTPSSSSSQTSPASQPL-PRRSPLATPTGGP 1912
Cdd:PHA03247 2548 AGDPPPPLPPAAPPAAPDRSVPPPRPA-----------PRPSEPAVTSRARRPDAPPQSARPRAPVdDRGDPRGPAPPSP 2616
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1913 LPGPGGSLVPKSPARALLAKQhKTQKSPVRIPFMQRPARRVPPPLARPSPEPGSRGRAGAEGTPGARGSRLGL-----VR 1987
Cdd:PHA03247 2617 LPPDTHAPDPPPPSPSPAANE-PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAArptvgSL 2695
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1988 MASARSSGSESSDRSGFRRQLTFIKESPGLLRRRRSELSSADSTASTSQAASP-------RRGRPALPAvflcssrcdel 2060
Cdd:PHA03247 2696 TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPatpggpaRPARPPTTA----------- 2764
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2061 rvSPRQPLAAQRSPQAKPGLAPRAPRRTSSESPSRLPVRASPGRPETVKRYASLPHISVSRRSDSAVSVPTTQANATRRG 2140
Cdd:PHA03247 2765 --GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2141 SDGEARPLP---RVAPPGTTWRR-------IKDEDVPHILRSTLPATALPLRVSSPEDSPAGTPQRKTSDAVVQTEDVAT 2210
Cdd:PHA03247 2843 PGPPPPSLPlggSVAPGGDVRRRppsrspaAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQ 2922
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2211 SKTNSSTSPSLESRDPPQAPASGPVAPQGSDVDGPVLTKPPASAPFPHEgLSAVIAGFPTSRHGSPSRAARVPPFNYVPS 2290
Cdd:PHA03247 2923 PPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGR-VAVPRFRVPQPAPSREAPASSTPPLTGHSL 3001
|
490 500
....*....|....*....|....*.
gi 1730154244 2291 PmAAATMASDSAVEKAPVSSPASLLE 2316
Cdd:PHA03247 3002 S-RVSSWASSLALHEETDPPPVSLKQ 3026
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1547-1986 |
7.41e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.40 E-value: 7.41e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1547 PPPRRASAIPRALKREKPAGRKETP---SRAAQPATLPVRAQPRLIVDETPPcysltssaSSLSEPEAPEQPANHARGPE 1623
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGD--------PRGPAPPSPLPPDTHAPDPP 2627
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1624 QgskqdSSPSPRAEEELLQRCISLAMPRRRTQVP-GSRRRKPRALRSDIRPTEITQKCQ--EEVAGSDPASDLDSVEWQA 1700
Cdd:PHA03247 2628 P-----PSPSPAANEPDPHPPPTVPPPERPRDDPaPGRVSRPRRARRLGRAAQASSPPQrpRRRAARPTVGSLTSLADPP 2702
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1701 IQEGAnsivtwlhQAAAKASLEASSESDSLLSLVSGVSAGSTLQPSKLRKGRKPAAEAGGAWRPEKRGTTSTKINGSPRL 1780
Cdd:PHA03247 2703 PPPPT--------PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA 2774
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1781 PNGPEKAKGTQKMMAGESTMLRGRTVIYSAGPASRTQSKGISGPCTTPKKTGTSGTTQPETVTKAPSPEQQRSRSLHRPG 1860
Cdd:PHA03247 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1861 KISELAALRHPPRSATPPARLAktpsssssqtsPASQPLPRRSPLATPTGGPLPGPGGSLVPKSPARALLAKQHKTQKSP 1940
Cdd:PHA03247 2855 SVAPGGDVRRRPPSRSPAAKPA-----------APARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
|
410 420 430 440
....*....|....*....|....*....|....*....|....*.
gi 1730154244 1941 VRIPFMQRPARrvPPPLARPSPEPGSRGRAGAEGTPGARGSRLGLV 1986
Cdd:PHA03247 2924 PPPPQPQPPPP--PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL 2967
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1633-1654 |
8.72e-05 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 41.42 E-value: 8.72e-05
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
2041-2238 |
2.90e-04 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 46.22 E-value: 2.90e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2041 RRGRPALPAVFLC--SSRCDELRVSPRQPLAAQRSPQAKPGLAPRAPRrtSSESPSRLPVRASPGRPETV------KRYA 2112
Cdd:PTZ00449 608 RPKSPKLPELLDIpkSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIK--SPKPPKSPKPPFDPKFKEKFyddyldAAAK 685
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2113 SLPHISVSRRSDSAVSVPTTQANATRRGSDGEARPLPRVAP--PGTTWRRIKDEDVPHILRSTLPATALPLRVSSPEdSP 2190
Cdd:PTZ00449 686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPrdEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHE-TP 764
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1730154244 2191 AGTPQRKTSDAVVQTEDVaTSKTNSSTSPSLESRDP----PQAPASGPVAPQ 2238
Cdd:PTZ00449 765 ADTPLPDILAEEFKEEDI-HAETGEPDEAMKRPDSPseheDKPPGDHPSLPK 815
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1947-2281 |
3.32e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 46.32 E-value: 3.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1947 QRPARRVPPPLARPSPEPGSRGRAGAeGTPGARGSRLGLVRMASARSSGSESSDRSGFRRqltfiKESPGLLRRRRSELS 2026
Cdd:PHA03307 80 PANESRSTPTWSLSTLAPASPAREGS-PTPPGPSSPDPPPPTPPPASPPPSPAPDLSEML-----RPVGSPGPPPAASPP 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2027 SADSTASTSQAASPRRGRPALPAvflcsSRCDELRVSPRQPLAAQRSPQAKPGLAPRAPRRTSSESPSRLPVRASPGRPE 2106
Cdd:PHA03307 154 AAGASPAAVASDAASSRQAALPL-----SSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSA 228
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2107 TVKRYASlphisvsrRSDSAVSVPTTQANATRrgsdgEARPLPRvAPPGTTWRRIkDEDVPHILRSTL-----PATALPL 2181
Cdd:PHA03307 229 ADDAGAS--------SSDSSSSESSGCGWGPE-----NECPLPR-PAPITLPTRI-WEASGWNGPSSRpgpasSSSSPRE 293
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2182 RVSSPEDSPAGTPQRKTSDAVVQTEDVATSKTNSSTSPSLESRDPPQAPASGPVAPQGSDVDGPVLTKPP----ASAPFP 2257
Cdd:PHA03307 294 RSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSsprkRPRPSR 373
|
330 340
....*....|....*....|....
gi 1730154244 2258 HEGLSAVIAGFPTSRHGSPSRAAR 2281
Cdd:PHA03307 374 APSSPAASAGRPTRRRARAAVAGR 397
|
|
| PHA03321 |
PHA03321 |
tegument protein VP11/12; Provisional |
2068-2298 |
5.35e-04 |
|
tegument protein VP11/12; Provisional
Pssm-ID: 223041 [Multi-domain] Cd Length: 694 Bit Score: 45.33 E-value: 5.35e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2068 LAAQRSPqakPGlaPRAPRRTSSESPsrlPVRASPGRPETVKRYASlphisvSRRSDSAVSVPTTQANATRRGSDGEAR- 2146
Cdd:PHA03321 424 LLSSRQP---PG--APAPRRDNDPPP---PPRARPGSTPACARRAR------AQRARDAGPEYVDPLGALRRLPAGAAPp 489
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2147 PLPRVAPPGTTWRRIKDEDVPHiLRSTLPATALPLRVSSPedSPAGTPQRKTSDAVVQTEDVATSKTNSSTSPSLESRDP 2226
Cdd:PHA03321 490 PEPAAAPSPATYYTRMGGGPPR-LPPRNRATETLRPDWGP--PAAAPPEQMEDPYLEPDDDRFDRRDGAAAAATSHPREA 566
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2227 PqAPASGPVAPQGSDVDGPV-------------LTKPPASAPFPHEGLS---------AVIAGFPTSRHGSPSRAARVPP 2284
Cdd:PHA03321 567 P-APDDDPIYEGVSDSEEPVyeeiptprvyqnpLPRPMEGAGEPPDLDAptspwveeeNPIYGWGDSPLFSPPPAARFPP 645
|
250
....*....|....
gi 1730154244 2285 FNYVPSPMAAATMA 2298
Cdd:PHA03321 646 PDPALSPEPPALPA 659
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1412-1434 |
5.83e-04 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 38.90 E-value: 5.83e-04
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2014-2234 |
1.78e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.77 E-value: 1.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2014 SPGLLRRRRSELSSADSTASTSQAASPRRGRPALPAVFLCSSRCDELRVSP-RQPLAAQRSPQAKPGLAPRAP---RRTS 2089
Cdd:PHA03247 256 APPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPlALPAPPDPPPPAPAGDAEEEDdedGAME 335
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2090 SESP-----SRLPVRASPGRPETVKRYASLPHISVSRRSDSAVSVPTTQANATR-------RGSDGEARPLPRVAPPGTT 2157
Cdd:PHA03247 336 VVSPlprprQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRRSARhaatpfaRGPGGDDQTRPAAPVPASV 415
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1730154244 2158 wrriKDEDVPHILRSTLPATALPLRVSSP--EDSPAGTPQRKTSDAVVQTEDVATSKTNSSTSPSLESRDPPQAPASGP 2234
Cdd:PHA03247 416 ----PTPAPTPVPASAPPPPATPLPSAEPgsDDGPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPPEPPGADL 490
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
21-123 |
2.67e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 42.58 E-value: 2.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 21 RPRTqtpGRIVALQELKMTSSMASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQearv 100
Cdd:COG4372 19 RPKT---GILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQA---- 91
|
90 100
....*....|....*....|...
gi 1730154244 101 lVSSGQTEVLEQLKALQTDISSL 123
Cdd:COG4372 92 -AQAELAQAQEELESLQEEAEEL 113
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
691-731 |
3.01e-03 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 37.43 E-value: 3.01e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1730154244 691 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 731
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1289-1310 |
3.18e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.97 E-value: 3.18e-03
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1177-1200 |
5.15e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.20 E-value: 5.15e-03
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1327-1664 |
5.44e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.23 E-value: 5.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1327 PPACPERAVGGGGHRRRDEAASRLDGPAPAGSRARSATDKeleALRECLG-----AAMPARLRKVASALVPGRRSLPVPv 1401
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRR---AARPTVGsltslADPPPPPPTPEPAPHALVSATPLP- 2722
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1402 ymLVPAPARGDDSGTDSAEGTPVNFSSAASLSDETlQGPSRDKPAGPGDRQKPTGRAAPArQTRSHRPKAAGAGKSTEHT 1481
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPA-RPARPPTTAGPPAPAPPAAPAAGP-PRRLTRPAVASLSESRESL 2798
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1482 RGPcRNRAGLELPLSRPQSARSNRDSSCQTRTRGDGALQS--------LCLTTPTEEAVYCFYD-SDEEPPATAPPPRRA 1552
Cdd:PHA03247 2799 PSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTapppppgpPPPSLPLGGSVAPGGDvRRRPPSRSPAAKPAA 2877
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1553 SAIPRALKREKPAgrketPSRAAQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEAPEQPANHAR---GPEQGSKQD 1629
Cdd:PHA03247 2878 PARPPVRRLARPA-----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQpplAPTTDPAGA 2952
|
330 340 350
....*....|....*....|....*....|....*
gi 1730154244 1630 SSPSPRAEEELLQRCISLAMPRRRTQVPGSRRRKP 1664
Cdd:PHA03247 2953 GEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE 2987
|
|
| CENP-F_leu_zip |
pfam10473 |
Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; Cenp-F, a centromeric kinetochore, ... |
28-123 |
6.21e-03 |
|
Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway. There are several leucine-rich repeats along the sequence of LEK1 that are considered to be zippers, though they do not appear to be binding DNA directly in this instance.
Pssm-ID: 463102 [Multi-domain] Cd Length: 140 Bit Score: 39.20 E-value: 6.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 28 GRIVALQ-ELKMtsSMASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQEarvlvssgQ 106
Cdd:pfam10473 24 DKVENLErELEM--SEENQELAILEAENSKAEVETLKAEIEEMAQNLRDLELDLVTLRSEKENLTKELQKK--------Q 93
|
90
....*....|....*..
gi 1730154244 107 TEVLEqLKALQTDISSL 123
Cdd:pfam10473 94 ERVSE-LESLNSSLENL 109
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1898-2283 |
7.06e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 7.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1898 PLPRRSPLATPTGGPLpgpggslvPKSPARALLAKQHKTQKSPVRI----------------------PFMQRPA---RR 1952
Cdd:PHA03247 2496 PDPGGGGPPDPDAPPA--------PSRLAPAILPDEPVGEPVHPRMltwirgleelasddagdpppplPPAAPPAapdRS 2567
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1953 VPPPLARPSP-EPGSRGRAGAEGTP--GARGSRLGLVRMASARSSGSESSDRSGFRRQLTFIKESPGLLRRRRSELSSAD 2029
Cdd:PHA03247 2568 VPPPRPAPRPsEPAVTSRARRPDAPpqSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVP 2647
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2030 STASTSQAASPRRGRPalpavflcssrcdelrvsPRQPLAAQRSPQAKPglAPRAPRRTSSEsPSRLPVRASPGRPETVK 2109
Cdd:PHA03247 2648 PPERPRDDPAPGRVSR------------------PRRARRLGRAAQASS--PPQRPRRRAAR-PTVGSLTSLADPPPPPP 2706
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2110 RYASLPHISVsrrsdSAVSVPTTQANATRRGSDGEARPLPRVAPPGTTwrrikdEDVPHILRSTLPATALPLRVSSPEDs 2189
Cdd:PHA03247 2707 TPEPAPHALV-----SATPLPPGPAAARQASPALPAAPAPPAVPAGPA------TPGGPARPARPPTTAGPPAPAPPAA- 2774
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2190 PAGTPQRKTSDAVVQTEDVATSKTNSSTSPS---------LESRDPPQAPASGPVAPQGSDVDGPVLTKPPASAPFPHEG 2260
Cdd:PHA03247 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPAdppaavlapAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
|
410 420
....*....|....*....|...
gi 1730154244 2261 lsAVIAGFPTSRHGSPSRAARVP 2283
Cdd:PHA03247 2855 --SVAPGGDVRRRPPSRSPAAKP 2875
|
|
|