NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1730154244|ref|NP_001359170|]
View 

adenomatous polyposis coli protein 2 isoform 6 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1799-2131 4.53e-97

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


:

Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 317.20  E-value: 4.53e-97
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1799 TMLRGRTVIYSAGpASRTQSKGISGPCTTPKKTgtsgttqpETVTKAPSPEQQRSRSLHRPGKISELAALRHPPRSATPP 1878
Cdd:pfam05956    1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1879 ARLAKTPSSSSSQTSPASQPLPRRSPLATPTGGPLPGPGGS------LVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1950
Cdd:pfam05956   72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRnklsplPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1951 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRMASARSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 2025
Cdd:pfam05956  152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2026 SSADSTASTSQAASPRRGRPALPAVFLCSSRCDELRVSPRQPLAA--QRSPQAKPGLAPRAPRRTSSESPSRLPVRASPG 2103
Cdd:pfam05956  229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
                          330       340
                   ....*....|....*....|....*...
gi 1730154244 2104 RPETVKRYASLPHISVSRRSDSAVSVPT 2131
Cdd:pfam05956  309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
393-466 1.50e-36

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


:

Pssm-ID: 465870  Cd Length: 74  Bit Score: 133.06  E-value: 1.50e-36
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1730154244  393 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTESGTGDTPVPIEPQICQATCAVMKLSFDEEYRRAM 466
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
43-94 4.45e-26

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


:

Pssm-ID: 435517  Cd Length: 52  Bit Score: 102.37  E-value: 4.45e-26
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1730154244   43 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 94
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
161-241 4.04e-18

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


:

Pssm-ID: 463275  Cd Length: 82  Bit Score: 80.76  E-value: 4.04e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244  161 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDT-FSMQMDLIRQQLEFEAQHIRSLMEERFG 239
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGTyFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 1730154244  240 TS 241
Cdd:pfam11414   81 LI 82
Arm_APC_u3 super family cl25003
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
732-978 4.86e-17

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


The actual alignment was detected with superfamily member pfam16629:

Pssm-ID: 435476  Cd Length: 293  Bit Score: 84.25  E-value: 4.86e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244  732 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQSlPEAETTSKKplpplRHLDGLVQDYASDSG 811
Cdd:pfam16629    1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244  812 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEK-----------------EAGGEAAVA 874
Cdd:pfam16629   74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKdrsldrergaglsnfhpATENSGNSS 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244  875 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 950
Cdd:pfam16629  148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
                          250       260
                   ....*....|....*....|....*...
gi 1730154244  951 AAHTSLSNDSLNSGSTSDGYCTREHMTP 978
Cdd:pfam16629  228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
650-689 1.37e-06

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


:

Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 46.68  E-value: 1.37e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1730154244  650 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 689
Cdd:pfam00514    2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1547-1986 7.41e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 7.41e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1547 PPPRRASAIPRALKREKPAGRKETP---SRAAQPATLPVRAQPRLIVDETPPcysltssaSSLSEPEAPEQPANHARGPE 1623
Cdd:PHA03247  2556 PPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGD--------PRGPAPPSPLPPDTHAPDPP 2627
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1624 QgskqdSSPSPRAEEELLQRCISLAMPRRRTQVP-GSRRRKPRALRSDIRPTEITQKCQ--EEVAGSDPASDLDSVEWQA 1700
Cdd:PHA03247  2628 P-----PSPSPAANEPDPHPPPTVPPPERPRDDPaPGRVSRPRRARRLGRAAQASSPPQrpRRRAARPTVGSLTSLADPP 2702
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1701 IQEGAnsivtwlhQAAAKASLEASSESDSLLSLVSGVSAGSTLQPSKLRKGRKPAAEAGGAWRPEKRGTTSTKINGSPRL 1780
Cdd:PHA03247  2703 PPPPT--------PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA 2774
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1781 PNGPEKAKGTQKMMAGESTMLRGRTVIYSAGPASRTQSKGISGPCTTPKKTGTSGTTQPETVTKAPSPEQQRSRSLHRPG 1860
Cdd:PHA03247  2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1861 KISELAALRHPPRSATPPARLAktpsssssqtsPASQPLPRRSPLATPTGGPLPGPGGSLVPKSPARALLAKQHKTQKSP 1940
Cdd:PHA03247  2855 SVAPGGDVRRRPPSRSPAAKPA-----------APARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*.
gi 1730154244 1941 VRIPFMQRPARrvPPPLARPSPEPGSRGRAGAEGTPGARGSRLGLV 1986
Cdd:PHA03247  2924 PPPPQPQPPPP--PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL 2967
PTZ00449 super family cl33186
104 kDa microneme/rhoptry antigen; Provisional
2041-2238 2.90e-04

104 kDa microneme/rhoptry antigen; Provisional


The actual alignment was detected with superfamily member PTZ00449:

Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 46.22  E-value: 2.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2041 RRGRPALPAVFLC--SSRCDELRVSPRQPLAAQRSPQAKPGLAPRAPRrtSSESPSRLPVRASPGRPETV------KRYA 2112
Cdd:PTZ00449   608 RPKSPKLPELLDIpkSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIK--SPKPPKSPKPPFDPKFKEKFyddyldAAAK 685
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2113 SLPHISVSRRSDSAVSVPTTQANATRRGSDGEARPLPRVAP--PGTTWRRIKDEDVPHILRSTLPATALPLRVSSPEdSP 2190
Cdd:PTZ00449   686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPrdEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHE-TP 764
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1730154244 2191 AGTPQRKTSDAVVQTEDVaTSKTNSSTSPSLESRDP----PQAPASGPVAPQ 2238
Cdd:PTZ00449   765 ADTPLPDILAEEFKEEDI-HAETGEPDEAMKRPDSPseheDKPPGDHPSLPK 815
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1412-1434 5.83e-04

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 38.90  E-value: 5.83e-04
                           10        20
                   ....*....|....*....|...
gi 1730154244 1412 DDSGTDSAEGTPVNFSSAASLSD 1434
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSS 23
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
691-731 3.01e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


:

Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 37.43  E-value: 3.01e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1730154244  691 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 731
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1289-1310 3.18e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.97  E-value: 3.18e-03
                           10        20
                   ....*....|....*....|..
gi 1730154244 1289 SVRFTVEKPDENFSCASSLSAL 1310
Cdd:pfam05923    3 PKRYCVEGTPANFSRASSLSSL 24
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1177-1200 5.15e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 5.15e-03
                           10        20
                   ....*....|....*....|....
gi 1730154244 1177 SSSSENCVQETPLVLSRCSSVSSL 1200
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1327-1664 5.44e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 5.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1327 PPACPERAVGGGGHRRRDEAASRLDGPAPAGSRARSATDKeleALRECLG-----AAMPARLRKVASALVPGRRSLPVPv 1401
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRR---AARPTVGsltslADPPPPPPTPEPAPHALVSATPLP- 2722
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1402 ymLVPAPARGDDSGTDSAEGTPVNFSSAASLSDETlQGPSRDKPAGPGDRQKPTGRAAPArQTRSHRPKAAGAGKSTEHT 1481
Cdd:PHA03247  2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPA-RPARPPTTAGPPAPAPPAAPAAGP-PRRLTRPAVASLSESRESL 2798
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1482 RGPcRNRAGLELPLSRPQSARSNRDSSCQTRTRGDGALQS--------LCLTTPTEEAVYCFYD-SDEEPPATAPPPRRA 1552
Cdd:PHA03247  2799 PSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTapppppgpPPPSLPLGGSVAPGGDvRRRPPSRSPAAKPAA 2877
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1553 SAIPRALKREKPAgrketPSRAAQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEAPEQPANHAR---GPEQGSKQD 1629
Cdd:PHA03247  2878 PARPPVRRLARPA-----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQpplAPTTDPAGA 2952
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 1730154244 1630 SSPSPRAEEELLQRCISLAMPRRRTQVPGSRRRKP 1664
Cdd:PHA03247  2953 GEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE 2987
 
Name Accession Description Interval E-value
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1799-2131 4.53e-97

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 317.20  E-value: 4.53e-97
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1799 TMLRGRTVIYSAGpASRTQSKGISGPCTTPKKTgtsgttqpETVTKAPSPEQQRSRSLHRPGKISELAALRHPPRSATPP 1878
Cdd:pfam05956    1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1879 ARLAKTPSSSSSQTSPASQPLPRRSPLATPTGGPLPGPGGS------LVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1950
Cdd:pfam05956   72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRnklsplPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1951 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRMASARSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 2025
Cdd:pfam05956  152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2026 SSADSTASTSQAASPRRGRPALPAVFLCSSRCDELRVSPRQPLAA--QRSPQAKPGLAPRAPRRTSSESPSRLPVRASPG 2103
Cdd:pfam05956  229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
                          330       340
                   ....*....|....*....|....*...
gi 1730154244 2104 RPETVKRYASLPHISVSRRSDSAVSVPT 2131
Cdd:pfam05956  309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
393-466 1.50e-36

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 133.06  E-value: 1.50e-36
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1730154244  393 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTESGTGDTPVPIEPQICQATCAVMKLSFDEEYRRAM 466
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
43-94 4.45e-26

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


Pssm-ID: 435517  Cd Length: 52  Bit Score: 102.37  E-value: 4.45e-26
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1730154244   43 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 94
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
161-241 4.04e-18

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


Pssm-ID: 463275  Cd Length: 82  Bit Score: 80.76  E-value: 4.04e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244  161 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDT-FSMQMDLIRQQLEFEAQHIRSLMEERFG 239
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGTyFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 1730154244  240 TS 241
Cdd:pfam11414   81 LI 82
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
732-978 4.86e-17

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 84.25  E-value: 4.86e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244  732 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQSlPEAETTSKKplpplRHLDGLVQDYASDSG 811
Cdd:pfam16629    1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244  812 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEK-----------------EAGGEAAVA 874
Cdd:pfam16629   74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKdrsldrergaglsnfhpATENSGNSS 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244  875 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 950
Cdd:pfam16629  148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
                          250       260
                   ....*....|....*....|....*...
gi 1730154244  951 AAHTSLSNDSLNSGSTSDGYCTREHMTP 978
Cdd:pfam16629  228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
650-689 1.37e-06

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 46.68  E-value: 1.37e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1730154244  650 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 689
Cdd:pfam00514    2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
649-689 2.07e-06

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 46.27  E-value: 2.07e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 1730154244   649 REDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 689
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1811-1964 2.17e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 50.07  E-value: 2.17e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1811 GPASRTQSKGISGPCT---TPKKTGTSGTTQPETVTKAPSPEQQrsrslHRPGKISELAALRHPPRSATPPARlaKTPSS 1887
Cdd:PTZ00449   523 APGDKEGEEGEHEDSKesdEPKEGGKPGETKEGEVGKKPGPAKE-----HKPSKIPTLSKKPEFPKDPKHPKD--PEEPK 595
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1730154244 1888 SSSQTSPASQPLPRRSPlatptggplPGPGGSLVPKSPARALLAKQHKTQKSPVRIPFMQRPARRVPPPLARP--SPEP 1964
Cdd:PTZ00449   596 KPKRPRSAQRPTRPKSP---------KLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPpkSPKP 665
PHA03247 PHA03247
large tegument protein UL36; Provisional
1547-1986 7.41e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 7.41e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1547 PPPRRASAIPRALKREKPAGRKETP---SRAAQPATLPVRAQPRLIVDETPPcysltssaSSLSEPEAPEQPANHARGPE 1623
Cdd:PHA03247  2556 PPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGD--------PRGPAPPSPLPPDTHAPDPP 2627
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1624 QgskqdSSPSPRAEEELLQRCISLAMPRRRTQVP-GSRRRKPRALRSDIRPTEITQKCQ--EEVAGSDPASDLDSVEWQA 1700
Cdd:PHA03247  2628 P-----PSPSPAANEPDPHPPPTVPPPERPRDDPaPGRVSRPRRARRLGRAAQASSPPQrpRRRAARPTVGSLTSLADPP 2702
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1701 IQEGAnsivtwlhQAAAKASLEASSESDSLLSLVSGVSAGSTLQPSKLRKGRKPAAEAGGAWRPEKRGTTSTKINGSPRL 1780
Cdd:PHA03247  2703 PPPPT--------PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA 2774
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1781 PNGPEKAKGTQKMMAGESTMLRGRTVIYSAGPASRTQSKGISGPCTTPKKTGTSGTTQPETVTKAPSPEQQRSRSLHRPG 1860
Cdd:PHA03247  2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1861 KISELAALRHPPRSATPPARLAktpsssssqtsPASQPLPRRSPLATPTGGPLPGPGGSLVPKSPARALLAKQHKTQKSP 1940
Cdd:PHA03247  2855 SVAPGGDVRRRPPSRSPAAKPA-----------APARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*.
gi 1730154244 1941 VRIPFMQRPARrvPPPLARPSPEPGSRGRAGAEGTPGARGSRLGLV 1986
Cdd:PHA03247  2924 PPPPQPQPPPP--PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL 2967
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1633-1654 8.72e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 41.42  E-value: 8.72e-05
                           10        20
                   ....*....|....*....|..
gi 1730154244 1633 SPRAEEELLQRCISLAMPRRRT 1654
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
2041-2238 2.90e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 46.22  E-value: 2.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2041 RRGRPALPAVFLC--SSRCDELRVSPRQPLAAQRSPQAKPGLAPRAPRrtSSESPSRLPVRASPGRPETV------KRYA 2112
Cdd:PTZ00449   608 RPKSPKLPELLDIpkSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIK--SPKPPKSPKPPFDPKFKEKFyddyldAAAK 685
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2113 SLPHISVSRRSDSAVSVPTTQANATRRGSDGEARPLPRVAP--PGTTWRRIKDEDVPHILRSTLPATALPLRVSSPEdSP 2190
Cdd:PTZ00449   686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPrdEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHE-TP 764
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1730154244 2191 AGTPQRKTSDAVVQTEDVaTSKTNSSTSPSLESRDP----PQAPASGPVAPQ 2238
Cdd:PTZ00449   765 ADTPLPDILAEEFKEEDI-HAETGEPDEAMKRPDSPseheDKPPGDHPSLPK 815
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1412-1434 5.83e-04

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 38.90  E-value: 5.83e-04
                           10        20
                   ....*....|....*....|...
gi 1730154244 1412 DDSGTDSAEGTPVNFSSAASLSD 1434
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSS 23
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
21-123 2.67e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 42.58  E-value: 2.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244   21 RPRTqtpGRIVALQELKMTSSMASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQearv 100
Cdd:COG4372     19 RPKT---GILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQA---- 91
                           90       100
                   ....*....|....*....|...
gi 1730154244  101 lVSSGQTEVLEQLKALQTDISSL 123
Cdd:COG4372     92 -AQAELAQAQEELESLQEEAEEL 113
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
691-731 3.01e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 37.43  E-value: 3.01e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1730154244  691 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 731
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1289-1310 3.18e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.97  E-value: 3.18e-03
                           10        20
                   ....*....|....*....|..
gi 1730154244 1289 SVRFTVEKPDENFSCASSLSAL 1310
Cdd:pfam05923    3 PKRYCVEGTPANFSRASSLSSL 24
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1177-1200 5.15e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 5.15e-03
                           10        20
                   ....*....|....*....|....
gi 1730154244 1177 SSSSENCVQETPLVLSRCSSVSSL 1200
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
PHA03247 PHA03247
large tegument protein UL36; Provisional
1327-1664 5.44e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 5.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1327 PPACPERAVGGGGHRRRDEAASRLDGPAPAGSRARSATDKeleALRECLG-----AAMPARLRKVASALVPGRRSLPVPv 1401
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRR---AARPTVGsltslADPPPPPPTPEPAPHALVSATPLP- 2722
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1402 ymLVPAPARGDDSGTDSAEGTPVNFSSAASLSDETlQGPSRDKPAGPGDRQKPTGRAAPArQTRSHRPKAAGAGKSTEHT 1481
Cdd:PHA03247  2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPA-RPARPPTTAGPPAPAPPAAPAAGP-PRRLTRPAVASLSESRESL 2798
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1482 RGPcRNRAGLELPLSRPQSARSNRDSSCQTRTRGDGALQS--------LCLTTPTEEAVYCFYD-SDEEPPATAPPPRRA 1552
Cdd:PHA03247  2799 PSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTapppppgpPPPSLPLGGSVAPGGDvRRRPPSRSPAAKPAA 2877
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1553 SAIPRALKREKPAgrketPSRAAQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEAPEQPANHAR---GPEQGSKQD 1629
Cdd:PHA03247  2878 PARPPVRRLARPA-----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQpplAPTTDPAGA 2952
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 1730154244 1630 SSPSPRAEEELLQRCISLAMPRRRTQVPGSRRRKP 1664
Cdd:PHA03247  2953 GEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE 2987
 
Name Accession Description Interval E-value
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1799-2131 4.53e-97

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 317.20  E-value: 4.53e-97
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1799 TMLRGRTVIYSAGpASRTQSKGISGPCTTPKKTgtsgttqpETVTKAPSPEQQRSRSLHRPGKISELAALRHPPRSATPP 1878
Cdd:pfam05956    1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1879 ARLAKTPSSSSSQTSPASQPLPRRSPLATPTGGPLPGPGGS------LVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1950
Cdd:pfam05956   72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRnklsplPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1951 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRMASARSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 2025
Cdd:pfam05956  152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2026 SSADSTASTSQAASPRRGRPALPAVFLCSSRCDELRVSPRQPLAA--QRSPQAKPGLAPRAPRRTSSESPSRLPVRASPG 2103
Cdd:pfam05956  229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
                          330       340
                   ....*....|....*....|....*...
gi 1730154244 2104 RPETVKRYASLPHISVSRRSDSAVSVPT 2131
Cdd:pfam05956  309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
393-466 1.50e-36

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 133.06  E-value: 1.50e-36
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1730154244  393 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTESGTGDTPVPIEPQICQATCAVMKLSFDEEYRRAM 466
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
43-94 4.45e-26

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


Pssm-ID: 435517  Cd Length: 52  Bit Score: 102.37  E-value: 4.45e-26
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1730154244   43 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 94
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
161-241 4.04e-18

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


Pssm-ID: 463275  Cd Length: 82  Bit Score: 80.76  E-value: 4.04e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244  161 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDT-FSMQMDLIRQQLEFEAQHIRSLMEERFG 239
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGTyFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 1730154244  240 TS 241
Cdd:pfam11414   81 LI 82
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
732-978 4.86e-17

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 84.25  E-value: 4.86e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244  732 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQSlPEAETTSKKplpplRHLDGLVQDYASDSG 811
Cdd:pfam16629    1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244  812 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEK-----------------EAGGEAAVA 874
Cdd:pfam16629   74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKdrsldrergaglsnfhpATENSGNSS 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244  875 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 950
Cdd:pfam16629  148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
                          250       260
                   ....*....|....*....|....*...
gi 1730154244  951 AAHTSLSNDSLNSGSTSDGYCTREHMTP 978
Cdd:pfam16629  228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
650-689 1.37e-06

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 46.68  E-value: 1.37e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1730154244  650 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 689
Cdd:pfam00514    2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
649-689 2.07e-06

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 46.27  E-value: 2.07e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 1730154244   649 REDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 689
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1811-1964 2.17e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 50.07  E-value: 2.17e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1811 GPASRTQSKGISGPCT---TPKKTGTSGTTQPETVTKAPSPEQQrsrslHRPGKISELAALRHPPRSATPPARlaKTPSS 1887
Cdd:PTZ00449   523 APGDKEGEEGEHEDSKesdEPKEGGKPGETKEGEVGKKPGPAKE-----HKPSKIPTLSKKPEFPKDPKHPKD--PEEPK 595
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1730154244 1888 SSSQTSPASQPLPRRSPlatptggplPGPGGSLVPKSPARALLAKQHKTQKSPVRIPFMQRPARRVPPPLARP--SPEP 1964
Cdd:PTZ00449   596 KPKRPRSAQRPTRPKSP---------KLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPpkSPKP 665
PHA03247 PHA03247
large tegument protein UL36; Provisional
1834-2316 4.08e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 4.08e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1834 SGTTQPETVTKAPSPEQQRSRSLHRPGkiselaalrhpPRSATPPARLAKTPSSSSSQTSPASQPL-PRRSPLATPTGGP 1912
Cdd:PHA03247  2548 AGDPPPPLPPAAPPAAPDRSVPPPRPA-----------PRPSEPAVTSRARRPDAPPQSARPRAPVdDRGDPRGPAPPSP 2616
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1913 LPGPGGSLVPKSPARALLAKQhKTQKSPVRIPFMQRPARRVPPPLARPSPEPGSRGRAGAEGTPGARGSRLGL-----VR 1987
Cdd:PHA03247  2617 LPPDTHAPDPPPPSPSPAANE-PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAArptvgSL 2695
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1988 MASARSSGSESSDRSGFRRQLTFIKESPGLLRRRRSELSSADSTASTSQAASP-------RRGRPALPAvflcssrcdel 2060
Cdd:PHA03247  2696 TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPatpggpaRPARPPTTA----------- 2764
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2061 rvSPRQPLAAQRSPQAKPGLAPRAPRRTSSESPSRLPVRASPGRPETVKRYASLPHISVSRRSDSAVSVPTTQANATRRG 2140
Cdd:PHA03247  2765 --GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2141 SDGEARPLP---RVAPPGTTWRR-------IKDEDVPHILRSTLPATALPLRVSSPEDSPAGTPQRKTSDAVVQTEDVAT 2210
Cdd:PHA03247  2843 PGPPPPSLPlggSVAPGGDVRRRppsrspaAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQ 2922
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2211 SKTNSSTSPSLESRDPPQAPASGPVAPQGSDVDGPVLTKPPASAPFPHEgLSAVIAGFPTSRHGSPSRAARVPPFNYVPS 2290
Cdd:PHA03247  2923 PPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGR-VAVPRFRVPQPAPSREAPASSTPPLTGHSL 3001
                          490       500
                   ....*....|....*....|....*.
gi 1730154244 2291 PmAAATMASDSAVEKAPVSSPASLLE 2316
Cdd:PHA03247  3002 S-RVSSWASSLALHEETDPPPVSLKQ 3026
PHA03247 PHA03247
large tegument protein UL36; Provisional
1547-1986 7.41e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 7.41e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1547 PPPRRASAIPRALKREKPAGRKETP---SRAAQPATLPVRAQPRLIVDETPPcysltssaSSLSEPEAPEQPANHARGPE 1623
Cdd:PHA03247  2556 PPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGD--------PRGPAPPSPLPPDTHAPDPP 2627
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1624 QgskqdSSPSPRAEEELLQRCISLAMPRRRTQVP-GSRRRKPRALRSDIRPTEITQKCQ--EEVAGSDPASDLDSVEWQA 1700
Cdd:PHA03247  2628 P-----PSPSPAANEPDPHPPPTVPPPERPRDDPaPGRVSRPRRARRLGRAAQASSPPQrpRRRAARPTVGSLTSLADPP 2702
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1701 IQEGAnsivtwlhQAAAKASLEASSESDSLLSLVSGVSAGSTLQPSKLRKGRKPAAEAGGAWRPEKRGTTSTKINGSPRL 1780
Cdd:PHA03247  2703 PPPPT--------PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA 2774
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1781 PNGPEKAKGTQKMMAGESTMLRGRTVIYSAGPASRTQSKGISGPCTTPKKTGTSGTTQPETVTKAPSPEQQRSRSLHRPG 1860
Cdd:PHA03247  2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1861 KISELAALRHPPRSATPPARLAktpsssssqtsPASQPLPRRSPLATPTGGPLPGPGGSLVPKSPARALLAKQHKTQKSP 1940
Cdd:PHA03247  2855 SVAPGGDVRRRPPSRSPAAKPA-----------APARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*.
gi 1730154244 1941 VRIPFMQRPARrvPPPLARPSPEPGSRGRAGAEGTPGARGSRLGLV 1986
Cdd:PHA03247  2924 PPPPQPQPPPP--PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL 2967
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1633-1654 8.72e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 41.42  E-value: 8.72e-05
                           10        20
                   ....*....|....*....|..
gi 1730154244 1633 SPRAEEELLQRCISLAMPRRRT 1654
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
2041-2238 2.90e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 46.22  E-value: 2.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2041 RRGRPALPAVFLC--SSRCDELRVSPRQPLAAQRSPQAKPGLAPRAPRrtSSESPSRLPVRASPGRPETV------KRYA 2112
Cdd:PTZ00449   608 RPKSPKLPELLDIpkSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIK--SPKPPKSPKPPFDPKFKEKFyddyldAAAK 685
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2113 SLPHISVSRRSDSAVSVPTTQANATRRGSDGEARPLPRVAP--PGTTWRRIKDEDVPHILRSTLPATALPLRVSSPEdSP 2190
Cdd:PTZ00449   686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPrdEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHE-TP 764
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1730154244 2191 AGTPQRKTSDAVVQTEDVaTSKTNSSTSPSLESRDP----PQAPASGPVAPQ 2238
Cdd:PTZ00449   765 ADTPLPDILAEEFKEEDI-HAETGEPDEAMKRPDSPseheDKPPGDHPSLPK 815
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1947-2281 3.32e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 46.32  E-value: 3.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1947 QRPARRVPPPLARPSPEPGSRGRAGAeGTPGARGSRLGLVRMASARSSGSESSDRSGFRRqltfiKESPGLLRRRRSELS 2026
Cdd:PHA03307    80 PANESRSTPTWSLSTLAPASPAREGS-PTPPGPSSPDPPPPTPPPASPPPSPAPDLSEML-----RPVGSPGPPPAASPP 153
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2027 SADSTASTSQAASPRRGRPALPAvflcsSRCDELRVSPRQPLAAQRSPQAKPGLAPRAPRRTSSESPSRLPVRASPGRPE 2106
Cdd:PHA03307   154 AAGASPAAVASDAASSRQAALPL-----SSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSA 228
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2107 TVKRYASlphisvsrRSDSAVSVPTTQANATRrgsdgEARPLPRvAPPGTTWRRIkDEDVPHILRSTL-----PATALPL 2181
Cdd:PHA03307   229 ADDAGAS--------SSDSSSSESSGCGWGPE-----NECPLPR-PAPITLPTRI-WEASGWNGPSSRpgpasSSSSPRE 293
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2182 RVSSPEDSPAGTPQRKTSDAVVQTEDVATSKTNSSTSPSLESRDPPQAPASGPVAPQGSDVDGPVLTKPP----ASAPFP 2257
Cdd:PHA03307   294 RSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSsprkRPRPSR 373
                          330       340
                   ....*....|....*....|....
gi 1730154244 2258 HEGLSAVIAGFPTSRHGSPSRAAR 2281
Cdd:PHA03307   374 APSSPAASAGRPTRRRARAAVAGR 397
PHA03321 PHA03321
tegument protein VP11/12; Provisional
2068-2298 5.35e-04

tegument protein VP11/12; Provisional


Pssm-ID: 223041 [Multi-domain]  Cd Length: 694  Bit Score: 45.33  E-value: 5.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2068 LAAQRSPqakPGlaPRAPRRTSSESPsrlPVRASPGRPETVKRYASlphisvSRRSDSAVSVPTTQANATRRGSDGEAR- 2146
Cdd:PHA03321   424 LLSSRQP---PG--APAPRRDNDPPP---PPRARPGSTPACARRAR------AQRARDAGPEYVDPLGALRRLPAGAAPp 489
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2147 PLPRVAPPGTTWRRIKDEDVPHiLRSTLPATALPLRVSSPedSPAGTPQRKTSDAVVQTEDVATSKTNSSTSPSLESRDP 2226
Cdd:PHA03321   490 PEPAAAPSPATYYTRMGGGPPR-LPPRNRATETLRPDWGP--PAAAPPEQMEDPYLEPDDDRFDRRDGAAAAATSHPREA 566
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2227 PqAPASGPVAPQGSDVDGPV-------------LTKPPASAPFPHEGLS---------AVIAGFPTSRHGSPSRAARVPP 2284
Cdd:PHA03321   567 P-APDDDPIYEGVSDSEEPVyeeiptprvyqnpLPRPMEGAGEPPDLDAptspwveeeNPIYGWGDSPLFSPPPAARFPP 645
                          250
                   ....*....|....
gi 1730154244 2285 FNYVPSPMAAATMA 2298
Cdd:PHA03321   646 PDPALSPEPPALPA 659
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1412-1434 5.83e-04

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 38.90  E-value: 5.83e-04
                           10        20
                   ....*....|....*....|...
gi 1730154244 1412 DDSGTDSAEGTPVNFSSAASLSD 1434
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSS 23
PHA03247 PHA03247
large tegument protein UL36; Provisional
2014-2234 1.78e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2014 SPGLLRRRRSELSSADSTASTSQAASPRRGRPALPAVFLCSSRCDELRVSP-RQPLAAQRSPQAKPGLAPRAP---RRTS 2089
Cdd:PHA03247   256 APPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPlALPAPPDPPPPAPAGDAEEEDdedGAME 335
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2090 SESP-----SRLPVRASPGRPETVKRYASLPHISVSRRSDSAVSVPTTQANATR-------RGSDGEARPLPRVAPPGTT 2157
Cdd:PHA03247   336 VVSPlprprQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRRSARhaatpfaRGPGGDDQTRPAAPVPASV 415
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1730154244 2158 wrriKDEDVPHILRSTLPATALPLRVSSP--EDSPAGTPQRKTSDAVVQTEDVATSKTNSSTSPSLESRDPPQAPASGP 2234
Cdd:PHA03247   416 ----PTPAPTPVPASAPPPPATPLPSAEPgsDDGPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPPEPPGADL 490
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
21-123 2.67e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 42.58  E-value: 2.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244   21 RPRTqtpGRIVALQELKMTSSMASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQearv 100
Cdd:COG4372     19 RPKT---GILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQA---- 91
                           90       100
                   ....*....|....*....|...
gi 1730154244  101 lVSSGQTEVLEQLKALQTDISSL 123
Cdd:COG4372     92 -AQAELAQAQEELESLQEEAEEL 113
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
691-731 3.01e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 37.43  E-value: 3.01e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1730154244  691 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 731
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1289-1310 3.18e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.97  E-value: 3.18e-03
                           10        20
                   ....*....|....*....|..
gi 1730154244 1289 SVRFTVEKPDENFSCASSLSAL 1310
Cdd:pfam05923    3 PKRYCVEGTPANFSRASSLSSL 24
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1177-1200 5.15e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 5.15e-03
                           10        20
                   ....*....|....*....|....
gi 1730154244 1177 SSSSENCVQETPLVLSRCSSVSSL 1200
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
PHA03247 PHA03247
large tegument protein UL36; Provisional
1327-1664 5.44e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 5.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1327 PPACPERAVGGGGHRRRDEAASRLDGPAPAGSRARSATDKeleALRECLG-----AAMPARLRKVASALVPGRRSLPVPv 1401
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRR---AARPTVGsltslADPPPPPPTPEPAPHALVSATPLP- 2722
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1402 ymLVPAPARGDDSGTDSAEGTPVNFSSAASLSDETlQGPSRDKPAGPGDRQKPTGRAAPArQTRSHRPKAAGAGKSTEHT 1481
Cdd:PHA03247  2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPA-RPARPPTTAGPPAPAPPAAPAAGP-PRRLTRPAVASLSESRESL 2798
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1482 RGPcRNRAGLELPLSRPQSARSNRDSSCQTRTRGDGALQS--------LCLTTPTEEAVYCFYD-SDEEPPATAPPPRRA 1552
Cdd:PHA03247  2799 PSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTapppppgpPPPSLPLGGSVAPGGDvRRRPPSRSPAAKPAA 2877
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1553 SAIPRALKREKPAgrketPSRAAQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEAPEQPANHAR---GPEQGSKQD 1629
Cdd:PHA03247  2878 PARPPVRRLARPA-----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQpplAPTTDPAGA 2952
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 1730154244 1630 SSPSPRAEEELLQRCISLAMPRRRTQVPGSRRRKP 1664
Cdd:PHA03247  2953 GEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE 2987
CENP-F_leu_zip pfam10473
Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; Cenp-F, a centromeric kinetochore, ...
28-123 6.21e-03

Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway. There are several leucine-rich repeats along the sequence of LEK1 that are considered to be zippers, though they do not appear to be binding DNA directly in this instance.


Pssm-ID: 463102 [Multi-domain]  Cd Length: 140  Bit Score: 39.20  E-value: 6.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244   28 GRIVALQ-ELKMtsSMASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQEarvlvssgQ 106
Cdd:pfam10473   24 DKVENLErELEM--SEENQELAILEAENSKAEVETLKAEIEEMAQNLRDLELDLVTLRSEKENLTKELQKK--------Q 93
                           90
                   ....*....|....*..
gi 1730154244  107 TEVLEqLKALQTDISSL 123
Cdd:pfam10473   94 ERVSE-LESLNSSLENL 109
PHA03247 PHA03247
large tegument protein UL36; Provisional
1898-2283 7.06e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 7.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1898 PLPRRSPLATPTGGPLpgpggslvPKSPARALLAKQHKTQKSPVRI----------------------PFMQRPA---RR 1952
Cdd:PHA03247  2496 PDPGGGGPPDPDAPPA--------PSRLAPAILPDEPVGEPVHPRMltwirgleelasddagdpppplPPAAPPAapdRS 2567
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 1953 VPPPLARPSP-EPGSRGRAGAEGTP--GARGSRLGLVRMASARSSGSESSDRSGFRRQLTFIKESPGLLRRRRSELSSAD 2029
Cdd:PHA03247  2568 VPPPRPAPRPsEPAVTSRARRPDAPpqSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVP 2647
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2030 STASTSQAASPRRGRPalpavflcssrcdelrvsPRQPLAAQRSPQAKPglAPRAPRRTSSEsPSRLPVRASPGRPETVK 2109
Cdd:PHA03247  2648 PPERPRDDPAPGRVSR------------------PRRARRLGRAAQASS--PPQRPRRRAAR-PTVGSLTSLADPPPPPP 2706
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2110 RYASLPHISVsrrsdSAVSVPTTQANATRRGSDGEARPLPRVAPPGTTwrrikdEDVPHILRSTLPATALPLRVSSPEDs 2189
Cdd:PHA03247  2707 TPEPAPHALV-----SATPLPPGPAAARQASPALPAAPAPPAVPAGPA------TPGGPARPARPPTTAGPPAPAPPAA- 2774
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1730154244 2190 PAGTPQRKTSDAVVQTEDVATSKTNSSTSPS---------LESRDPPQAPASGPVAPQGSDVDGPVLTKPPASAPFPHEG 2260
Cdd:PHA03247  2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPAdppaavlapAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
                          410       420
                   ....*....|....*....|...
gi 1730154244 2261 lsAVIAGFPTSRHGSPSRAARVP 2283
Cdd:PHA03247  2855 --SVAPGGDVRRRPPSRSPAAKP 2875
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH