NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|568999406|ref|XP_006523908|]
View 

polycystin-1 isoform X12 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
97-2723 0e+00

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 3842.37  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406    97 DLSNNRISTLEEGVFANLFNLSEINLSGNPFECNCGLAWLPRWAKEHQVHVVQSEATTCRGPIPLAGQPLLSIPLLDNAC 176
Cdd:TIGR00864    1 DISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   177 GEEYVACLPDNSSGAVAA----VPFYFAHEGPLETEACSAFCFSAGEGLAALSEQNQCLCGAGQASNSSAACSSWCSSIS 252
Cdd:TIGR00864   81 DEEYVACLKDNSSGGGAArselVIFSAAHEGLFQPEACNAFCFSAGHGLAALGEQGECLCGAAQPSEANFACESLCSGPP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   253 LSLNSACGGPTLLQHTFPASPGATLVGPHGPLASGQPADFHITSSLPISSTRWNFGDGSPEVDMASP----AATHFYVLP 328
Cdd:TIGR00864  161 PPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQLAAFHAAAPLAPTAMRWDFGDGSAEVDAAGAggttAASHKYGHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   329 GSYHMTVVLALGAGSALLETEVQVEATPTVLELVCPSFVHSNESLELGIRHRGGSALEVTYSILALDKEPAQVVHPLCPL 408
Cdd:TIGR00864  241 GRYHVSAMGALGAGKALAGGDVQVEAAPAALELHCPSLVQADESLDLSIQNRGGSDLDAAWKITAHGEEPAKASHPHCPK 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   409 DTEIFPGNGHCYRLVAEKAPWLQAQEQCRTWAGAALAMVDSPAIQHFLVSKVTRSLD--VWIGFSSVEGTE-GLDPRGEA 485
Cdd:TIGR00864  321 DGEIFEENGHCFQIVPEEAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDrgVWIGFSDVNGAEkGPAHQGEA 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   486 FSLESCQNWLPGEPHPATAEHCVRLGPAGQCNTDLCSAPHSYVCELRPGGPVWDTENFVMGMSGGGLSGPLHPLAQQETV 565
Cdd:TIGR00864  401 FEAEECEEGLAGEPHPARAEHCVRLDPRGQCNSDLCNAPHAYVCELNPGGPVPDAENFAMGAASFDLHGLLQALAAMDGL 480
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   566 QGPLR-PVEVMVFPGLSPSREAFLTAAEFSTQKLEEPAQMRLQVYRPSGG--AAAVPEGSSEP-----DNRTEPAPKCVP 637
Cdd:TIGR00864  481 PAPPHeGVEVLLFPALRFSRAAFLSSAEFGTQELRRPAHILFQIYRLRCRlpGAGGPACGPEAecrppDNRSADAPACMK 560
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   638 EELWCPGANVCIPFDASCNSHVCINGSVSRLGLSRAS----YTLWKEFFFSVPAGPPTQYLVTLHSQDVPMLPGDLIGLQ 713
Cdd:TIGR00864  561 GEQWCPFAHICLPLDAPCHPQACANGCSQGHGLPGAArmplYALQREFLFSLPAGPAAHVLLQDHGEDLLMLPGDLIALQ 640
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   714 HDAGPGTLLQCPLASSCPG-QALYLSTNASDWM------------------------TNLPVHLEEAWAG----PVCSLQ 764
Cdd:TIGR00864  641 HDAGPAALIHCQPAPGHPGpRAPVFAANASEWFghnntpvppdnlagdgadplpdpeLDLKALLEGTRASwlecAACAIR 720
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   765 LLLVTERLTPLLGLGPNPGLQHPGHYEVRATVGNSVSRQNLSCSFSVVSPIAGLRVIHPIPLDGHIYVPTNGSVLVLQVD 844
Cdd:TIGR00864  721 LLAAGEQETRLLGAELNAGLPLPGLYELLAESAKGSDLHNASCSFDVLPPLAGLRVIHPAPQDGRLFLESNGSALLLQVD 800
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   845 SGANATATAQWFGGNISAPFEDACPPEVDFLKQD-------CTEEANGTLFSVLMLPRLKEGDHT----VEIVAQNGASQ 913
Cdd:TIGR00864  801 SGANAEAKAFWPGGNSSARFENVCPAEFASRLCHpstfeggCAEEAEDSLFAVLALNWLKEGEHTgpvqVDLMAENNASE 880
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   914 ANLSLRVTAEEPICGLRAVPSPEARVLQGILVRYSPMVEAGSDVAFRWTIDDKQSLTFHNTVFNVIYQSAAIFKLSLTAS 993
Cdd:TIGR00864  881 ANLSLLVQAEEPICGLRAQPHPAARVLMESLVRYSASVEAGSDMTFKWTIDDKPFFTFQNTVFNVIYQHAAVFKLSLTAM 960
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   994 NHVSNITVNYNVTVERMNKMHGLWVSAVPTVLPPNATLALTGGVLVDSAVEVAFLWNFGDGEQVLRQFKPPYDESFQVPD 1073
Cdd:TIGR00864  961 NHVSNLTEDFNVTVDRLNPMQGLQVKGVPAVLPPGATLALTAGVLIDMAVEAAFLWSFGDGEQALFEFKPPYNESFPCPD 1040
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1074 PTVAQVLVEHNTTHIYTTPGEYNLTVLVSNTYENLTQQVTVSVRTVLPNVAIGMSSNVLVAGQPITFSPYPLPSTDGVLY 1153
Cdd:TIGR00864 1041 PSPAQVLLEHNVMHIYAAPGEYLATVLASNAFENISQQINMSVRAILPRVAIGTEDGLLLAGKPADFEAHPLPSPGGIHY 1120
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1154 TWDFGDGSPVLIQSQPVLNHTYSMTGAYRITLEVNNTVSSVTAHADIRVFQELHGLTVYLSPSVEQGAPMVVSASVESGD 1233
Cdd:TIGR00864 1121 EWDFGDGSALLQGRQPAAAHTFAKRGPFHVCLEVNNTISGAAACADMFAFEEIEGLSADMSLATELGAATTVRAALQSGD 1200
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1234 NITWTFDMGDGTVFTGPEATVQHVYLRAQNFTVTVEAANPAGHLSQSLHVQVFVLEVLHIEPSTCIPTQPSAQLMAHVTG 1313
Cdd:TIGR00864 1201 NITWTFDMGDGKSLSGPEATVEHKYAKAGNCTVNIGAANAAGHGARIIHVEVFVFEVAGIEPAACIGEHADANFRARVSG 1280
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1314 DPVHYLFDWTFGDGSSNVTVHGHPSVTHNFTRSGIFPLALVLSSHVNKAHYFTSICVEPEIRNITLQPERQFVKLGDEAR 1393
Cdd:TIGR00864 1281 NAAHYLFDWSFGDGSPNETHHGCPGISHNFRGNGTFPLALTISSGVNKAHFFTQICVEPELGKISLQAEKQFFALGDEAQ 1360
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1394 LVAYSWPPFPYRYTWDFGTEDTTHTQTGGSEVKFIYREPGSYLVIVTVSNNISSTNDSAFVEVQEPVLVTGIRINGSH-- 1471
Cdd:TIGR00864 1361 FQACAEPEFNYRYEWDFGGEEAAPLPAAGAEVTFIYNDPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGSHgn 1440
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1472 VLELQQPYLLSAMGSGSPATYLWELGDGSQSEGPEVTHIYSSTGDFTVRVSGWNEVSRSEAQLNITVKQRVRGLTINASR 1551
Cdd:TIGR00864 1441 NLELGQPYLFSAFGRARNASYLWDFGDGGLLEGPEILHAFNSPGDFNIRLAAANEVGKNEATLNVAVKARVRGLTINASL 1520
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1552 TVVPLNGSVSFSTLLEVGSDVHYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFIYVLQFIEGLQV 1631
Cdd:TIGR00864 1521 TNVPLNGSVHFEAHLDAGDDVRFSWILCDHCTPIFGGNTIFYTFRSVGTFNIIVTAENDVGAAQASIFLFVLQEIEGLQI 1600
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1632 AGGD-----------NGCCFPTNYTLQLQAAVRDGTNISYSWTA--QQEGSLITLFGSGKCFSLTSLKASTYYVHLRATN 1698
Cdd:TIGR00864 1601 LGETaegggggvqelDGCYFETNHTVQFHAGFKDGTNLSFSWNAilDNEPDGPAFAGSGKGAKLNPLEAGPCDIFLQAAN 1680
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1699 MLGSAAANRTIDFVEPVESLILSASPNPAAVNMSLTLCAELAGGSGVVYTWYLEEGLSWKTSMPSTTHTFAAPGLHLVRV 1778
Cdd:TIGR00864 1681 LLGQATADCTIDFLEPAGNLMLAASDNPAAVNALINLSAELAEGSGLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLVTM 1760
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1779 TAENQLGSVNATVEVAIQVPVGGLSIRTSEP-DSIFVAAGSTLPFWGQLAEGTNVTWCWTLPGGS-KDSQYIAVRFSTAG 1856
Cdd:TIGR00864 1761 KAFNELGSANASEEVDVQEPISGLKIRAADAgEQNFFAADSSVCFQGELATGTNVSWCWAIDGGSsKMGKHACMTFPDAG 1840
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1857 SFSLQLNASNAVSWVSAMYNLTVEEPIVNLMLWASSKVVAPGQPVHFEILLAAGSALTFRLQVGGSVPEVL-PSPHFSHS 1935
Cdd:TIGR00864 1841 TFAIRLNASNAVSGKSASREFFAEEPIFGLELKASKKIAAIGEKVEFQILLAAGSAVNFRLQIGGAAPEVLqPGPRFSHS 1920
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1936 FFRVGDHLVNVQAENHVSHAQAQVRILVLEAVVGLQVPNCCEPGMATGTEKNFTARVQRGSRVAYAWYFSLQKVQGDSLV 2015
Cdd:TIGR00864 1921 FPRVDDHMVNLRAKNEVSCAQANLHIEVLEAVRGLQIPDCCAAGIATGEEKNFTANVQRGKPVAFAWTFDLHHLHGDSLV 2000
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2016 ILSGRDVTYTPVAAGLLEIHVRAFNELGGVNLTLMVEVQDIIQYVTLQSG--RCFTNRSARFEAATSPSPRRVTYHWDFG 2093
Cdd:TIGR00864 2001 IHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQLEAQDALMDAALQAGpqDCFTNKMAQFEAATSPKPNFMACHWDFG 2080
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2094 DGTPVQKTEEFWADHYYLRPGDYHVEVNATNLVSFFVAQATVTVQVLACREPEVEVALPLQVLMRRSQRNYLEAHVDLRN 2173
Cdd:TIGR00864 2081 DGSAGQDTDEPRAEHEYLHPGDYRVQVNASNLVSFFSAHAEINVQVLACEEPEVDVVLALQLAIRRSQPNLLEAHVDLKD 2160
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2174 CVSYQTEYRWEIYRTASCQRPGRMAQMVLPG-------------VDVSRPQLVVPRLALPVGHYCFVFVVSFGDTPLARS 2240
Cdd:TIGR00864 2161 CLRYGAEYLWEILRAASCDNDGHFARGALNGatrsfpviplpaeVDVQRLQLSLPKLALAAGHYCFVFSLSFEDTPLKKA 2240
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2241 IQANVTVAAERLVPIIEGGSYRVWSDTQDLVLDGSKSYDPNLEDGDQTPLNFHWACVASTQSETGGCVLNFGPRG-SSVV 2319
Cdd:TIGR00864 2241 ACANLGVAAARLMPIIEGGSYRVWSDTQDLQLDAEESYDPNLDDDDQSLLHFHWACQASSKGEAGCCALNFGLGGkGPTL 2320
                         2330      2340      2350      2360      2370      2380      2390      2400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2320 TIPLERLEAGVEYTFNLIVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGHCHNCSRGYKQG 2399
Cdd:TIGR00864 2321 GIPGEELAAGIEYTFKLSIGKAGMKEEATNQTVLIQSGHIPIVSLECVSCKAQALYEVSQNSYVYLEGRCLNCQSGFHRG 2400
                         2410      2420      2430      2440      2450      2460      2470      2480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2400 CWAARTFSNKTLVLNETTTSTGSTGMNLVVRPGALRDGEGYIFTLTVLGHSGEEEGCASIRLSPNRPPLGGSCRLFPL-- 2477
Cdd:TIGR00864 2401 RWAARTFQNDTLVLDESSTSTGSAGMNLVLRQGVLHDGEGYNFTLHVLDDSGDEEGAASIRLHHNMPPDGGECHLFPGge 2480
                         2490      2500      2510      2520      2530      2540      2550      2560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2478 ------------DSVRGLTTKVHFECTGWRDAEDGGAPLVYALLLKRCRQSYCENFCIYKGSLSTYGAVLPPGFQ-PLFV 2544
Cdd:TIGR00864 2481 tgqehgdkedevWAIEALLDKVHFECSGWHDAEDAEAPLLYALLLNRCRDDHCEEFCVYKGSLPEHGAFLPPGFRsAHFE 2560
                         2570      2580      2590      2600      2610      2620      2630      2640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2545 VSLAVVVQDQLGAAVVALNRSLTIVLPEPSGNPADLVPWLHSLTASVLPGLLKQADPQHVIEYSLALITVLNEYEQAPD- 2623
Cdd:TIGR00864 2561 VGLAITVEDHLGAAIRALNKSIAITLPDPNGEASGLPHWLHDLIASKLKGLLDQADFQHVIELSLALITVLNEYEQALDs 2640
                         2650      2660      2670      2680      2690      2700      2710      2720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2624 VSEPNVEQQLRAQMRKNITETLISLRVNTVDDIQQITAALAQCMVSSRELMCRSCLKKMLQKLEGMMRILQAETTEGTLT 2703
Cdd:TIGR00864 2641 AAEPKHERGHRAQIRKNITEALTALDLHTVDDIQQIAAALAQCMAPSREFICEECLKQTLHKLEAMLEILQADTKAGIVT 2720
                         2730      2740
                   ....*....|....*....|
gi 568999406  2704 PTTIADSILNITGDLIHLAS 2723
Cdd:TIGR00864 2721 PTAIADNILNIMGDLIHLAS 2740
Polycystin_dom pfam20519
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
3704-3882 6.23e-66

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


:

Pssm-ID: 466668  Cd Length: 199  Bit Score: 223.07  E-value: 6.23e-66
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  3704 AFLAITRSDEFWPWMSHVFLPYVHGNQS------------SPELGPPRLRQVRLQEAFC--PDPSSSEHM-CSAAGSLST 3768
Cdd:pfam20519    1 GLLTVTDLDDIWDWLSSVLLPALHSNKTpsglpgsfiayeSLLLGVPRLRQLRVRNSSClvHDKFVREINeCHAGYSPPS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  3769 SDYGI----GWQSVVQNGSETWAYSAPDLL-GAWYWGYCAVYDSGGYIQELGLSLEESRARLGFLQLHNWLDSRSRAVFV 3843
Cdd:pfam20519   81 EDRKLysalPYKPVHYGSKYWFIYTPPGLLmGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVFV 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 568999406  3844 ELTRYSPAVGLHAAVTLRLEFPVAGHALAAFSVRPFALR 3882
Cdd:pfam20519  161 DFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQSVKLL 199
PKD_channel pfam08016
Polycystin cation channel; This family contains the cation channel region from group II of ...
3883-4104 2.09e-59

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


:

Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 205.20  E-value: 2.09e-59
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  3883 RLSTGLSLPLLT-SVCLLLFALYFSMAEVQTWRKDGcACTARpDTW--ARCLLVILTAATGLVRLAQLGIADRQWTHFVq 3959
Cdd:pfam08016    1 RYVTNRSLFILLcEIVFVVFFLYFVVEEILKIRKHR-PSYLR-SVWnlLDLAIVILSVVLIVLNIYRDFLADRLIKSVE- 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  3960 DHPRHFTSFDQVAQLGSVARGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELMGATLGLVLLGVAYAQMAILLI 4039
Cdd:pfam08016   78 ASPVTFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFGYLLF 157
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568999406  4040 SSGADTLYNMARAFLVLCpgarvPTLCPSESWY--------LSPLLCVGLWALRVWGALRLGAILLRWRYHAL 4104
Cdd:pfam08016  158 GTQAPNFSNFVKSILTLF-----RTILGDFGYNeifsgnrvLGPLLFLTFVFLVIFILLNLFLAIINDSYVEV 225
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
3110-3229 6.16e-57

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


:

Pssm-ID: 238850  Cd Length: 120  Bit Score: 194.03  E-value: 6.16e-57
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 3110 FKYEILVKTGWSRGSGTTAHVGIMLYGEDNRSGHRHLDGD--RAFHRNSLDIFQIATPHSLGSVWKIRVWHDNKGLSPAW 3187
Cdd:cd01752     1 YLYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPekPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSW 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 568999406 3188 FLQHIIVRDLQSARSTFFLVNDWLSVETEanGGLVEKEVLAA 3229
Cdd:cd01752    81 YLSRVIVRDLQTGKKWFFLCNDWLSVEEG--DGTVERTFPVA 120
LRR_8 pfam13855
Leucine rich repeat;
61-127 2.37e-13

Leucine rich repeat;


:

Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 67.16  E-value: 2.37e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568999406    61 PSLRIpadataLDLSHNLLQTLDIGLLVNLSALVELDLSNNRISTLEEGVFANLFNLSEINLSGNPF 127
Cdd:pfam13855    1 PNLRS------LDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
GPS super family cl02559
GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for ...
3003-3052 9.14e-10

GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for auto-proteolysis, so is thus named, GPS. The GPS motif is a conserved sequence of ~40 amino acids containing canonical cysteine and tryptophan residues, and is the most highly conserved part of the domain. In most, if not all, cell-adhesion GPCRs these undergo autoproteolysis in the GPS between a conserved aliphatic residue (usually a leucine) and a threonine, serine, or cysteine residue. In higher eukaryotes this motif is found embedded in the C-terminal beta-stranded part of a GAIN domain - GPCR-Autoproteolysis INducing (GAIN). The GAIN-GPS domain adopts a fold in which the GPS motif, at the C-terminus, forms five beta-strands that are tightly integrated into the overall GAIN domain. The GPS motif, evolutionarily conserved from tetrahymena to mammals, is the only extracellular domain shared by all human cell-adhesion GPCRs and PKD proteins, and is the locus of multiple human disease mutations. The GAIN-GPS domain is both necessary and sufficient functionally for autoproteolysis, suggesting an autoproteolytic mechanism whereby the overall GAIN domain fine-tunes the chemical environment in the GPS to catalyze peptide bond hydrolysis. In the cell-adhesion GPCRs and PKD proteins, the GPS motif is always located at the end of their long N-terminal extracellular regions, immediately before the first transmembrane helix of the respective protein.


The actual alignment was detected with superfamily member smart00303:

Pssm-ID: 470616  Cd Length: 49  Bit Score: 56.63  E-value: 9.14e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 568999406   3003 YTSLCQYFSEEMMMWRTEGIVPLEETSpSQAVCLTRHLTAFGASLFVPPS 3052
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNG-THTTCSCNHLTTFAVLMDVPPI 49
LRRNT smart00013
Leucine rich repeat N-terminal domain;
32-71 4.69e-05

Leucine rich repeat N-terminal domain;


:

Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 43.08  E-value: 4.69e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 568999406     32 PCPLPCFCGPapdaaCRVNCSGRWLQTLgpSLRIPADATA 71
Cdd:smart00013    1 ACPAPCNCSG-----TAVDCSGRGLTEV--PLDLPPDTTL 33
 
Name Accession Description Interval E-value
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
97-2723 0e+00

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 3842.37  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406    97 DLSNNRISTLEEGVFANLFNLSEINLSGNPFECNCGLAWLPRWAKEHQVHVVQSEATTCRGPIPLAGQPLLSIPLLDNAC 176
Cdd:TIGR00864    1 DISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   177 GEEYVACLPDNSSGAVAA----VPFYFAHEGPLETEACSAFCFSAGEGLAALSEQNQCLCGAGQASNSSAACSSWCSSIS 252
Cdd:TIGR00864   81 DEEYVACLKDNSSGGGAArselVIFSAAHEGLFQPEACNAFCFSAGHGLAALGEQGECLCGAAQPSEANFACESLCSGPP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   253 LSLNSACGGPTLLQHTFPASPGATLVGPHGPLASGQPADFHITSSLPISSTRWNFGDGSPEVDMASP----AATHFYVLP 328
Cdd:TIGR00864  161 PPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQLAAFHAAAPLAPTAMRWDFGDGSAEVDAAGAggttAASHKYGHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   329 GSYHMTVVLALGAGSALLETEVQVEATPTVLELVCPSFVHSNESLELGIRHRGGSALEVTYSILALDKEPAQVVHPLCPL 408
Cdd:TIGR00864  241 GRYHVSAMGALGAGKALAGGDVQVEAAPAALELHCPSLVQADESLDLSIQNRGGSDLDAAWKITAHGEEPAKASHPHCPK 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   409 DTEIFPGNGHCYRLVAEKAPWLQAQEQCRTWAGAALAMVDSPAIQHFLVSKVTRSLD--VWIGFSSVEGTE-GLDPRGEA 485
Cdd:TIGR00864  321 DGEIFEENGHCFQIVPEEAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDrgVWIGFSDVNGAEkGPAHQGEA 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   486 FSLESCQNWLPGEPHPATAEHCVRLGPAGQCNTDLCSAPHSYVCELRPGGPVWDTENFVMGMSGGGLSGPLHPLAQQETV 565
Cdd:TIGR00864  401 FEAEECEEGLAGEPHPARAEHCVRLDPRGQCNSDLCNAPHAYVCELNPGGPVPDAENFAMGAASFDLHGLLQALAAMDGL 480
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   566 QGPLR-PVEVMVFPGLSPSREAFLTAAEFSTQKLEEPAQMRLQVYRPSGG--AAAVPEGSSEP-----DNRTEPAPKCVP 637
Cdd:TIGR00864  481 PAPPHeGVEVLLFPALRFSRAAFLSSAEFGTQELRRPAHILFQIYRLRCRlpGAGGPACGPEAecrppDNRSADAPACMK 560
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   638 EELWCPGANVCIPFDASCNSHVCINGSVSRLGLSRAS----YTLWKEFFFSVPAGPPTQYLVTLHSQDVPMLPGDLIGLQ 713
Cdd:TIGR00864  561 GEQWCPFAHICLPLDAPCHPQACANGCSQGHGLPGAArmplYALQREFLFSLPAGPAAHVLLQDHGEDLLMLPGDLIALQ 640
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   714 HDAGPGTLLQCPLASSCPG-QALYLSTNASDWM------------------------TNLPVHLEEAWAG----PVCSLQ 764
Cdd:TIGR00864  641 HDAGPAALIHCQPAPGHPGpRAPVFAANASEWFghnntpvppdnlagdgadplpdpeLDLKALLEGTRASwlecAACAIR 720
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   765 LLLVTERLTPLLGLGPNPGLQHPGHYEVRATVGNSVSRQNLSCSFSVVSPIAGLRVIHPIPLDGHIYVPTNGSVLVLQVD 844
Cdd:TIGR00864  721 LLAAGEQETRLLGAELNAGLPLPGLYELLAESAKGSDLHNASCSFDVLPPLAGLRVIHPAPQDGRLFLESNGSALLLQVD 800
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   845 SGANATATAQWFGGNISAPFEDACPPEVDFLKQD-------CTEEANGTLFSVLMLPRLKEGDHT----VEIVAQNGASQ 913
Cdd:TIGR00864  801 SGANAEAKAFWPGGNSSARFENVCPAEFASRLCHpstfeggCAEEAEDSLFAVLALNWLKEGEHTgpvqVDLMAENNASE 880
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   914 ANLSLRVTAEEPICGLRAVPSPEARVLQGILVRYSPMVEAGSDVAFRWTIDDKQSLTFHNTVFNVIYQSAAIFKLSLTAS 993
Cdd:TIGR00864  881 ANLSLLVQAEEPICGLRAQPHPAARVLMESLVRYSASVEAGSDMTFKWTIDDKPFFTFQNTVFNVIYQHAAVFKLSLTAM 960
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   994 NHVSNITVNYNVTVERMNKMHGLWVSAVPTVLPPNATLALTGGVLVDSAVEVAFLWNFGDGEQVLRQFKPPYDESFQVPD 1073
Cdd:TIGR00864  961 NHVSNLTEDFNVTVDRLNPMQGLQVKGVPAVLPPGATLALTAGVLIDMAVEAAFLWSFGDGEQALFEFKPPYNESFPCPD 1040
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1074 PTVAQVLVEHNTTHIYTTPGEYNLTVLVSNTYENLTQQVTVSVRTVLPNVAIGMSSNVLVAGQPITFSPYPLPSTDGVLY 1153
Cdd:TIGR00864 1041 PSPAQVLLEHNVMHIYAAPGEYLATVLASNAFENISQQINMSVRAILPRVAIGTEDGLLLAGKPADFEAHPLPSPGGIHY 1120
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1154 TWDFGDGSPVLIQSQPVLNHTYSMTGAYRITLEVNNTVSSVTAHADIRVFQELHGLTVYLSPSVEQGAPMVVSASVESGD 1233
Cdd:TIGR00864 1121 EWDFGDGSALLQGRQPAAAHTFAKRGPFHVCLEVNNTISGAAACADMFAFEEIEGLSADMSLATELGAATTVRAALQSGD 1200
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1234 NITWTFDMGDGTVFTGPEATVQHVYLRAQNFTVTVEAANPAGHLSQSLHVQVFVLEVLHIEPSTCIPTQPSAQLMAHVTG 1313
Cdd:TIGR00864 1201 NITWTFDMGDGKSLSGPEATVEHKYAKAGNCTVNIGAANAAGHGARIIHVEVFVFEVAGIEPAACIGEHADANFRARVSG 1280
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1314 DPVHYLFDWTFGDGSSNVTVHGHPSVTHNFTRSGIFPLALVLSSHVNKAHYFTSICVEPEIRNITLQPERQFVKLGDEAR 1393
Cdd:TIGR00864 1281 NAAHYLFDWSFGDGSPNETHHGCPGISHNFRGNGTFPLALTISSGVNKAHFFTQICVEPELGKISLQAEKQFFALGDEAQ 1360
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1394 LVAYSWPPFPYRYTWDFGTEDTTHTQTGGSEVKFIYREPGSYLVIVTVSNNISSTNDSAFVEVQEPVLVTGIRINGSH-- 1471
Cdd:TIGR00864 1361 FQACAEPEFNYRYEWDFGGEEAAPLPAAGAEVTFIYNDPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGSHgn 1440
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1472 VLELQQPYLLSAMGSGSPATYLWELGDGSQSEGPEVTHIYSSTGDFTVRVSGWNEVSRSEAQLNITVKQRVRGLTINASR 1551
Cdd:TIGR00864 1441 NLELGQPYLFSAFGRARNASYLWDFGDGGLLEGPEILHAFNSPGDFNIRLAAANEVGKNEATLNVAVKARVRGLTINASL 1520
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1552 TVVPLNGSVSFSTLLEVGSDVHYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFIYVLQFIEGLQV 1631
Cdd:TIGR00864 1521 TNVPLNGSVHFEAHLDAGDDVRFSWILCDHCTPIFGGNTIFYTFRSVGTFNIIVTAENDVGAAQASIFLFVLQEIEGLQI 1600
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1632 AGGD-----------NGCCFPTNYTLQLQAAVRDGTNISYSWTA--QQEGSLITLFGSGKCFSLTSLKASTYYVHLRATN 1698
Cdd:TIGR00864 1601 LGETaegggggvqelDGCYFETNHTVQFHAGFKDGTNLSFSWNAilDNEPDGPAFAGSGKGAKLNPLEAGPCDIFLQAAN 1680
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1699 MLGSAAANRTIDFVEPVESLILSASPNPAAVNMSLTLCAELAGGSGVVYTWYLEEGLSWKTSMPSTTHTFAAPGLHLVRV 1778
Cdd:TIGR00864 1681 LLGQATADCTIDFLEPAGNLMLAASDNPAAVNALINLSAELAEGSGLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLVTM 1760
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1779 TAENQLGSVNATVEVAIQVPVGGLSIRTSEP-DSIFVAAGSTLPFWGQLAEGTNVTWCWTLPGGS-KDSQYIAVRFSTAG 1856
Cdd:TIGR00864 1761 KAFNELGSANASEEVDVQEPISGLKIRAADAgEQNFFAADSSVCFQGELATGTNVSWCWAIDGGSsKMGKHACMTFPDAG 1840
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1857 SFSLQLNASNAVSWVSAMYNLTVEEPIVNLMLWASSKVVAPGQPVHFEILLAAGSALTFRLQVGGSVPEVL-PSPHFSHS 1935
Cdd:TIGR00864 1841 TFAIRLNASNAVSGKSASREFFAEEPIFGLELKASKKIAAIGEKVEFQILLAAGSAVNFRLQIGGAAPEVLqPGPRFSHS 1920
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1936 FFRVGDHLVNVQAENHVSHAQAQVRILVLEAVVGLQVPNCCEPGMATGTEKNFTARVQRGSRVAYAWYFSLQKVQGDSLV 2015
Cdd:TIGR00864 1921 FPRVDDHMVNLRAKNEVSCAQANLHIEVLEAVRGLQIPDCCAAGIATGEEKNFTANVQRGKPVAFAWTFDLHHLHGDSLV 2000
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2016 ILSGRDVTYTPVAAGLLEIHVRAFNELGGVNLTLMVEVQDIIQYVTLQSG--RCFTNRSARFEAATSPSPRRVTYHWDFG 2093
Cdd:TIGR00864 2001 IHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQLEAQDALMDAALQAGpqDCFTNKMAQFEAATSPKPNFMACHWDFG 2080
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2094 DGTPVQKTEEFWADHYYLRPGDYHVEVNATNLVSFFVAQATVTVQVLACREPEVEVALPLQVLMRRSQRNYLEAHVDLRN 2173
Cdd:TIGR00864 2081 DGSAGQDTDEPRAEHEYLHPGDYRVQVNASNLVSFFSAHAEINVQVLACEEPEVDVVLALQLAIRRSQPNLLEAHVDLKD 2160
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2174 CVSYQTEYRWEIYRTASCQRPGRMAQMVLPG-------------VDVSRPQLVVPRLALPVGHYCFVFVVSFGDTPLARS 2240
Cdd:TIGR00864 2161 CLRYGAEYLWEILRAASCDNDGHFARGALNGatrsfpviplpaeVDVQRLQLSLPKLALAAGHYCFVFSLSFEDTPLKKA 2240
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2241 IQANVTVAAERLVPIIEGGSYRVWSDTQDLVLDGSKSYDPNLEDGDQTPLNFHWACVASTQSETGGCVLNFGPRG-SSVV 2319
Cdd:TIGR00864 2241 ACANLGVAAARLMPIIEGGSYRVWSDTQDLQLDAEESYDPNLDDDDQSLLHFHWACQASSKGEAGCCALNFGLGGkGPTL 2320
                         2330      2340      2350      2360      2370      2380      2390      2400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2320 TIPLERLEAGVEYTFNLIVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGHCHNCSRGYKQG 2399
Cdd:TIGR00864 2321 GIPGEELAAGIEYTFKLSIGKAGMKEEATNQTVLIQSGHIPIVSLECVSCKAQALYEVSQNSYVYLEGRCLNCQSGFHRG 2400
                         2410      2420      2430      2440      2450      2460      2470      2480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2400 CWAARTFSNKTLVLNETTTSTGSTGMNLVVRPGALRDGEGYIFTLTVLGHSGEEEGCASIRLSPNRPPLGGSCRLFPL-- 2477
Cdd:TIGR00864 2401 RWAARTFQNDTLVLDESSTSTGSAGMNLVLRQGVLHDGEGYNFTLHVLDDSGDEEGAASIRLHHNMPPDGGECHLFPGge 2480
                         2490      2500      2510      2520      2530      2540      2550      2560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2478 ------------DSVRGLTTKVHFECTGWRDAEDGGAPLVYALLLKRCRQSYCENFCIYKGSLSTYGAVLPPGFQ-PLFV 2544
Cdd:TIGR00864 2481 tgqehgdkedevWAIEALLDKVHFECSGWHDAEDAEAPLLYALLLNRCRDDHCEEFCVYKGSLPEHGAFLPPGFRsAHFE 2560
                         2570      2580      2590      2600      2610      2620      2630      2640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2545 VSLAVVVQDQLGAAVVALNRSLTIVLPEPSGNPADLVPWLHSLTASVLPGLLKQADPQHVIEYSLALITVLNEYEQAPD- 2623
Cdd:TIGR00864 2561 VGLAITVEDHLGAAIRALNKSIAITLPDPNGEASGLPHWLHDLIASKLKGLLDQADFQHVIELSLALITVLNEYEQALDs 2640
                         2650      2660      2670      2680      2690      2700      2710      2720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2624 VSEPNVEQQLRAQMRKNITETLISLRVNTVDDIQQITAALAQCMVSSRELMCRSCLKKMLQKLEGMMRILQAETTEGTLT 2703
Cdd:TIGR00864 2641 AAEPKHERGHRAQIRKNITEALTALDLHTVDDIQQIAAALAQCMAPSREFICEECLKQTLHKLEAMLEILQADTKAGIVT 2720
                         2730      2740
                   ....*....|....*....|
gi 568999406  2704 PTTIADSILNITGDLIHLAS 2723
Cdd:TIGR00864 2721 PTAIADNILNIMGDLIHLAS 2740
REJ pfam02010
REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor ...
2167-2610 2.40e-116

REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly Swiss:Q26627. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains.


Pssm-ID: 366875 [Multi-domain]  Cd Length: 448  Bit Score: 378.00  E-value: 2.40e-116
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2167 AHVDLRNCVS-YQTEYRWEIYRTASCqrpGRMAQMVLPgVDVSRPQLVVPRLALPVGHYCFVFVVSFGDTP-LARSIQAN 2244
Cdd:pfam02010    1 ASVELNGCFSaYTIDYLWSVFTVSSN---LNLQTISSP-KDLVLPQLTIPSGTLPYGTYVFTLTVSLSSTPsLAGTDIIT 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2245 VTVAAERLVPIIEGGSYRVWSDTQDLVLDGSKSYDPNLEDGDQTPLNFHWACVAST------QSETGGCV-----LNFGP 2313
Cdd:pfam02010   77 VTVQPSPLVAVIDGGSSRVVGYNQDLTLDGSESYDPDVDPGSSSGLTYLWSCRRSSsgdnplLNNDPVCFsdqneGTLLQ 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2314 RGSSVVTIPLERLEAGVEYTFNLIVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRssYVYLEGHCHNCS 2393
Cdd:pfam02010  157 STSSSLTIPASTLQANVTYTFKLTVSKGSRNSASTTQTILVVDGNPPIIILSCISNCNRKNNPVDR--LVLLASTCLNCS 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2394 RGYKQGC--WAARTFSNKTLVLN--ETTTSTGSTGMNLVVRPGALRDGEGYIFTLTVLGHSGEEEGCASIRLSPNRPPLG 2469
Cdd:pfam02010  235 SDLSDVTyrWLSLGSENTSLVLDqlNSQTSTGRSGPYLVIKAGVLQSGVSYRFTLIVTVYPGLVSGLASISFITNAPPTG 314
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2470 GSCRLFPLDSVRgLTTKVHFECTGWRDAEDggaPLVYALLLKRCRQSYCENFCIYKGSLST-YGAVLPPGF-QPLFVVSL 2547
Cdd:pfam02010  315 GTCSVTPTEGTA-LETKFTVTCQGWTDDDL---PLTYQFGDISFREASEEWFLLYEGSSQIsISTFLPPGLpANDYQVTV 390
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568999406  2548 AVVVQDQLGAAvVALNRSLTIVLPEPSgnpaDLVPWLHSLTASVLPGLLKQADPQHVIEYSLA 2610
Cdd:pfam02010  391 VVVVYDSLGAA-TSVSLTITVTPPSSS----DELLYFLLGTTSDLSALLQSGDPQQAAQLILA 448
Polycystin_dom pfam20519
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
3704-3882 6.23e-66

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


Pssm-ID: 466668  Cd Length: 199  Bit Score: 223.07  E-value: 6.23e-66
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  3704 AFLAITRSDEFWPWMSHVFLPYVHGNQS------------SPELGPPRLRQVRLQEAFC--PDPSSSEHM-CSAAGSLST 3768
Cdd:pfam20519    1 GLLTVTDLDDIWDWLSSVLLPALHSNKTpsglpgsfiayeSLLLGVPRLRQLRVRNSSClvHDKFVREINeCHAGYSPPS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  3769 SDYGI----GWQSVVQNGSETWAYSAPDLL-GAWYWGYCAVYDSGGYIQELGLSLEESRARLGFLQLHNWLDSRSRAVFV 3843
Cdd:pfam20519   81 EDRKLysalPYKPVHYGSKYWFIYTPPGLLmGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVFV 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 568999406  3844 ELTRYSPAVGLHAAVTLRLEFPVAGHALAAFSVRPFALR 3882
Cdd:pfam20519  161 DFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQSVKLL 199
PKD_channel pfam08016
Polycystin cation channel; This family contains the cation channel region from group II of ...
3883-4104 2.09e-59

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 205.20  E-value: 2.09e-59
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  3883 RLSTGLSLPLLT-SVCLLLFALYFSMAEVQTWRKDGcACTARpDTW--ARCLLVILTAATGLVRLAQLGIADRQWTHFVq 3959
Cdd:pfam08016    1 RYVTNRSLFILLcEIVFVVFFLYFVVEEILKIRKHR-PSYLR-SVWnlLDLAIVILSVVLIVLNIYRDFLADRLIKSVE- 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  3960 DHPRHFTSFDQVAQLGSVARGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELMGATLGLVLLGVAYAQMAILLI 4039
Cdd:pfam08016   78 ASPVTFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFGYLLF 157
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568999406  4040 SSGADTLYNMARAFLVLCpgarvPTLCPSESWY--------LSPLLCVGLWALRVWGALRLGAILLRWRYHAL 4104
Cdd:pfam08016  158 GTQAPNFSNFVKSILTLF-----RTILGDFGYNeifsgnrvLGPLLFLTFVFLVIFILLNLFLAIINDSYVEV 225
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
3110-3229 6.16e-57

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238850  Cd Length: 120  Bit Score: 194.03  E-value: 6.16e-57
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 3110 FKYEILVKTGWSRGSGTTAHVGIMLYGEDNRSGHRHLDGD--RAFHRNSLDIFQIATPHSLGSVWKIRVWHDNKGLSPAW 3187
Cdd:cd01752     1 YLYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPekPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSW 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 568999406 3188 FLQHIIVRDLQSARSTFFLVNDWLSVETEanGGLVEKEVLAA 3229
Cdd:cd01752    81 YLSRVIVRDLQTGKKWFFLCNDWLSVEEG--DGTVERTFPVA 120
PLAT pfam01477
PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. ...
3112-3215 1.86e-22

PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.


Pssm-ID: 396180  Cd Length: 115  Bit Score: 95.19  E-value: 1.86e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  3112 YEILVKTGWSRGSGTTAHVGIMLYGEDNRSGHRHLDGDR-AFHRNSLDIFQIATPHSLGSVWKIRVWHDNKGLSPAWFLQ 3190
Cdd:pfam01477    1 YQVKVVTGDELGAGTDADVYISLYGKVGESAQLEITLDNpDFERGAEDSFEIDTDWDVGAILKINLHWDNNGLSDEWFLK 80
                           90       100
                   ....*....|....*....|....*.
gi 568999406  3191 HIIVRDLQSARSTF-FLVNDWLSVET 3215
Cdd:pfam01477   81 SITVEVPGETGGKYtFPCNSWVYGSK 106
WSC smart00321
present in yeast cell wall integrity and stress response component proteins; Domain present in ...
177-271 3.54e-19

present in yeast cell wall integrity and stress response component proteins; Domain present in WSC proteins, polycystin and fungal exoglucanase


Pssm-ID: 214616 [Multi-domain]  Cd Length: 95  Bit Score: 85.21  E-value: 3.54e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406    177 GEEYVACLPDNSSGAVAAVPFYFAHegPLETEACSAFCFSAGEGLAALSEQNQCLCGAGQASNSSAACSSWC--SSISLS 254
Cdd:smart00321    1 GATYVGCYSDNSSRTLAAVSSYAYH--NMSVEACSNFCFSAGYALAALENGNECYCGDSLPSTSVSASDSSQcsTTCSGY 78
                            90
                    ....*....|....*..
gi 568999406    255 LNSACGGPTLLQHTFPA 271
Cdd:smart00321   79 PAEVCGGPNRLSVYVLA 95
LH2 smart00308
Lipoxygenase homology 2 (beta barrel) domain;
3110-3212 5.52e-19

Lipoxygenase homology 2 (beta barrel) domain;


Pssm-ID: 214608 [Multi-domain]  Cd Length: 105  Bit Score: 85.00  E-value: 5.52e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   3110 FKYEILVKTGWSRGSGTTAHVGIMLYGEDNRSGHRHLD--GDRAFHRNSLDIFQIATPHSLGSVWKIRVWHDNKglSPAW 3187
Cdd:smart00308    1 GKYKVTVTTGGLDFAGTTASVSLSLVGAEGDGKESKLDylFKGIFARGSTYEFTFDVDEDFGELGAVKIKNEHR--HPEW 78
                            90       100
                    ....*....|....*....|....*
gi 568999406   3188 FLQHIIVRDLQSARSTFFLVNDWLS 3212
Cdd:smart00308   79 FLKSITVKDLPTGGKYHFPCNSWVY 103
CLECT cd00037
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type ...
418-530 1.18e-17

C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.


Pssm-ID: 153057 [Multi-domain]  Cd Length: 116  Bit Score: 81.51  E-value: 1.18e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  418 HCYRLVAEKAPWLQAQEQCRTWaGAALAMVDSPAIQHFLVSKVTRSL--DVWIG---------FSSVEGTEGLDPrgeaf 486
Cdd:cd00037     1 SCYKFSTEKLTWEEAQEYCRSL-GGHLASIHSEEENDFLASLLKKSSssDVWIGlndlssegtWKWSDGSPLVDY----- 74
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 568999406  487 slescQNWLPGEPHPATAEHCVRL--GPAGQCNTDLCSAPHSYVCE 530
Cdd:cd00037    75 -----TNWAPGEPNPGGSEDCVVLssSSDGKWNDVSCSSKLPFICE 115
LRR_8 pfam13855
Leucine rich repeat;
61-127 2.37e-13

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 67.16  E-value: 2.37e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568999406    61 PSLRIpadataLDLSHNLLQTLDIGLLVNLSALVELDLSNNRISTLEEGVFANLFNLSEINLSGNPF 127
Cdd:pfam13855    1 PNLRS------LDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1481-1768 8.30e-11

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 67.00  E-value: 8.30e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1481 LSAMGSGSPATYLWELGDGSQSEGPEVTHIYSSTGDFTVRVSGWNEV-SRSEAQLNITVKQRVRGLTINASRTVVPLNGS 1559
Cdd:COG3291    16 FTDTSSGNATSYEWDFGDGTTSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAPNPGVTTVTTSTTVTTLAN 95
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1560 VSFSTLLEVGSDVHYSWVlcdrCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFIYVLQFIEGLQVAGGDNGCC 1639
Cdd:COG3291    96 TANGGATTVVAGSTVGTG----VATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVTTSVSTTDVTSDGTTS 171
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1640 FPTNYTLQLQAAVRDGTNISYSWTAQQEGSLITLFGSGKCFSLTSLKASTYYVHLRATNMLGSAAANRTIDFVEPVESLI 1719
Cdd:COG3291   172 ASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTDVTLTGISTGDAGTPGTNTV 251
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*....
gi 568999406 1720 LSASPNPAAVNMSLTLCAELAGGSGVVYTWYLEEGLSWKTSMPSTTHTF 1768
Cdd:COG3291   252 TTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNVS 300
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
56-127 1.20e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 64.42  E-value: 1.20e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568999406   56 LQTLGPSLRIpadataLDLSHNLLQTLDIglLVNLSALVELDLSNNRISTLEE--GVFANLFNLSEINLSGNPF 127
Cdd:cd21340   115 LAALSNSLRV------LNISGNNIDSLEP--LAPLRNLEQLDASNNQISDLEEllDLLSSWPSLRELDLTGNPV 180
GPS smart00303
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ...
3003-3052 9.14e-10

G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.


Pssm-ID: 197639  Cd Length: 49  Bit Score: 56.63  E-value: 9.14e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 568999406   3003 YTSLCQYFSEEMMMWRTEGIVPLEETSpSQAVCLTRHLTAFGASLFVPPS 3052
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNG-THTTCSCNHLTTFAVLMDVPPI 49
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
48-128 5.81e-09

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 61.87  E-value: 5.81e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   48 RVNCSGRWLQTLGPSLripADATAL---DLSHNLLQTLDIGLLvNLSALVELDLSNNRISTLEEgVFANLFNLSEINLSG 124
Cdd:COG4886   140 ELDLSNNQLTDLPEPL---GNLTNLkslDLSNNQLTDLPEELG-NLTNLKELDLSNNQITDLPE-PLGNLTNLEELDLSG 214

                  ....
gi 568999406  125 NPFE 128
Cdd:COG4886   215 NQLT 218
LRRNT smart00013
Leucine rich repeat N-terminal domain;
32-71 4.69e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 43.08  E-value: 4.69e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 568999406     32 PCPLPCFCGPapdaaCRVNCSGRWLQTLgpSLRIPADATA 71
Cdd:smart00013    1 ACPAPCNCSG-----TAVDCSGRGLTEV--PLDLPPDTTL 33
myxo_dep_M36 NF038112
myxosortase-dependent M36 family metallopeptidase; Members of this bacterial protein family ...
1551-1879 3.09e-04

myxosortase-dependent M36 family metallopeptidase; Members of this bacterial protein family have an M36 family metallopeptidase domain, like fungalysin (see PF02128), and a C-terminal MYXO-CTERM domain (see TIGR03901), suggesting processing and surface-anchoring by the still-unknown putative transpeptidase, myxosortase. Members of this family include MXAN_3564 (mepA), part of the effector cargo of outer membrane vesicles that the species produces in large numbers during predation on other microbes.


Pssm-ID: 468355 [Multi-domain]  Cd Length: 1597  Bit Score: 47.34  E-value: 3.09e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1551 RTVVPLNGSVSFStllEVGSDVHYSW--------VLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVgSAQDSIFIYV 1622
Cdd:NF038112 1201 RTTVTLNGSGSFD---PDGDPLTYAWtqvsgpavTLTGADTATPSFTAPEVTADTVLTFQLVVSDGTKT-SAPDTVTVLV 1276
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1623 LQFIEG-LQVAGGDNGCCFPTNYTLQLQAAVRDGTNISYSWTaQQEGSLITLFGSGKC---FSLTSLKASTYYVhLRATN 1698
Cdd:NF038112 1277 RNVNRApVAVAGAPATVDERSTVTLDGSGTDADGDALTYAWT-QTSGPAVTLTGATTAtatFTAPEVTADTQLT-FTLTV 1354
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1699 MLGSAAANRTIDFV------EPVESLILSASPNPAAVnMSLTLCAELAGGSGVVYTWYLEEGLSWK-TSMPSTTHTFAAP 1771
Cdd:NF038112 1355 SDGTASATDTVTVTvrnvnrAPVANAGADQTVDERST-VTLSGSATDPDGDALTYAWTQTAGPTVTlTGADTATASFTAP 1433
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1772 glhlvRVTAENQLG---------------SVNATVEVAIQVPVGglsirtSEPDSIFVAAGSTLPFWGQL--AEGTNVTW 1834
Cdd:NF038112 1434 -----EVAADTELTfqltvsadgqasadvTVTVTVRNVNRAPVA------HAGESITVDEGSTVTLDASAtdPDGDTLTY 1502
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*
gi 568999406 1835 CWTLPGGSkdsqyiAVRFSTAGSFSLQLNASNAVSWVSAMYNLTV 1879
Cdd:NF038112 1503 AWTQVAGP------SVTLTGADSAKLTFTAPEVSADTTLTFSLTV 1541
 
Name Accession Description Interval E-value
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
97-2723 0e+00

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 3842.37  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406    97 DLSNNRISTLEEGVFANLFNLSEINLSGNPFECNCGLAWLPRWAKEHQVHVVQSEATTCRGPIPLAGQPLLSIPLLDNAC 176
Cdd:TIGR00864    1 DISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   177 GEEYVACLPDNSSGAVAA----VPFYFAHEGPLETEACSAFCFSAGEGLAALSEQNQCLCGAGQASNSSAACSSWCSSIS 252
Cdd:TIGR00864   81 DEEYVACLKDNSSGGGAArselVIFSAAHEGLFQPEACNAFCFSAGHGLAALGEQGECLCGAAQPSEANFACESLCSGPP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   253 LSLNSACGGPTLLQHTFPASPGATLVGPHGPLASGQPADFHITSSLPISSTRWNFGDGSPEVDMASP----AATHFYVLP 328
Cdd:TIGR00864  161 PPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQLAAFHAAAPLAPTAMRWDFGDGSAEVDAAGAggttAASHKYGHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   329 GSYHMTVVLALGAGSALLETEVQVEATPTVLELVCPSFVHSNESLELGIRHRGGSALEVTYSILALDKEPAQVVHPLCPL 408
Cdd:TIGR00864  241 GRYHVSAMGALGAGKALAGGDVQVEAAPAALELHCPSLVQADESLDLSIQNRGGSDLDAAWKITAHGEEPAKASHPHCPK 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   409 DTEIFPGNGHCYRLVAEKAPWLQAQEQCRTWAGAALAMVDSPAIQHFLVSKVTRSLD--VWIGFSSVEGTE-GLDPRGEA 485
Cdd:TIGR00864  321 DGEIFEENGHCFQIVPEEAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDrgVWIGFSDVNGAEkGPAHQGEA 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   486 FSLESCQNWLPGEPHPATAEHCVRLGPAGQCNTDLCSAPHSYVCELRPGGPVWDTENFVMGMSGGGLSGPLHPLAQQETV 565
Cdd:TIGR00864  401 FEAEECEEGLAGEPHPARAEHCVRLDPRGQCNSDLCNAPHAYVCELNPGGPVPDAENFAMGAASFDLHGLLQALAAMDGL 480
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   566 QGPLR-PVEVMVFPGLSPSREAFLTAAEFSTQKLEEPAQMRLQVYRPSGG--AAAVPEGSSEP-----DNRTEPAPKCVP 637
Cdd:TIGR00864  481 PAPPHeGVEVLLFPALRFSRAAFLSSAEFGTQELRRPAHILFQIYRLRCRlpGAGGPACGPEAecrppDNRSADAPACMK 560
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   638 EELWCPGANVCIPFDASCNSHVCINGSVSRLGLSRAS----YTLWKEFFFSVPAGPPTQYLVTLHSQDVPMLPGDLIGLQ 713
Cdd:TIGR00864  561 GEQWCPFAHICLPLDAPCHPQACANGCSQGHGLPGAArmplYALQREFLFSLPAGPAAHVLLQDHGEDLLMLPGDLIALQ 640
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   714 HDAGPGTLLQCPLASSCPG-QALYLSTNASDWM------------------------TNLPVHLEEAWAG----PVCSLQ 764
Cdd:TIGR00864  641 HDAGPAALIHCQPAPGHPGpRAPVFAANASEWFghnntpvppdnlagdgadplpdpeLDLKALLEGTRASwlecAACAIR 720
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   765 LLLVTERLTPLLGLGPNPGLQHPGHYEVRATVGNSVSRQNLSCSFSVVSPIAGLRVIHPIPLDGHIYVPTNGSVLVLQVD 844
Cdd:TIGR00864  721 LLAAGEQETRLLGAELNAGLPLPGLYELLAESAKGSDLHNASCSFDVLPPLAGLRVIHPAPQDGRLFLESNGSALLLQVD 800
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   845 SGANATATAQWFGGNISAPFEDACPPEVDFLKQD-------CTEEANGTLFSVLMLPRLKEGDHT----VEIVAQNGASQ 913
Cdd:TIGR00864  801 SGANAEAKAFWPGGNSSARFENVCPAEFASRLCHpstfeggCAEEAEDSLFAVLALNWLKEGEHTgpvqVDLMAENNASE 880
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   914 ANLSLRVTAEEPICGLRAVPSPEARVLQGILVRYSPMVEAGSDVAFRWTIDDKQSLTFHNTVFNVIYQSAAIFKLSLTAS 993
Cdd:TIGR00864  881 ANLSLLVQAEEPICGLRAQPHPAARVLMESLVRYSASVEAGSDMTFKWTIDDKPFFTFQNTVFNVIYQHAAVFKLSLTAM 960
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   994 NHVSNITVNYNVTVERMNKMHGLWVSAVPTVLPPNATLALTGGVLVDSAVEVAFLWNFGDGEQVLRQFKPPYDESFQVPD 1073
Cdd:TIGR00864  961 NHVSNLTEDFNVTVDRLNPMQGLQVKGVPAVLPPGATLALTAGVLIDMAVEAAFLWSFGDGEQALFEFKPPYNESFPCPD 1040
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1074 PTVAQVLVEHNTTHIYTTPGEYNLTVLVSNTYENLTQQVTVSVRTVLPNVAIGMSSNVLVAGQPITFSPYPLPSTDGVLY 1153
Cdd:TIGR00864 1041 PSPAQVLLEHNVMHIYAAPGEYLATVLASNAFENISQQINMSVRAILPRVAIGTEDGLLLAGKPADFEAHPLPSPGGIHY 1120
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1154 TWDFGDGSPVLIQSQPVLNHTYSMTGAYRITLEVNNTVSSVTAHADIRVFQELHGLTVYLSPSVEQGAPMVVSASVESGD 1233
Cdd:TIGR00864 1121 EWDFGDGSALLQGRQPAAAHTFAKRGPFHVCLEVNNTISGAAACADMFAFEEIEGLSADMSLATELGAATTVRAALQSGD 1200
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1234 NITWTFDMGDGTVFTGPEATVQHVYLRAQNFTVTVEAANPAGHLSQSLHVQVFVLEVLHIEPSTCIPTQPSAQLMAHVTG 1313
Cdd:TIGR00864 1201 NITWTFDMGDGKSLSGPEATVEHKYAKAGNCTVNIGAANAAGHGARIIHVEVFVFEVAGIEPAACIGEHADANFRARVSG 1280
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1314 DPVHYLFDWTFGDGSSNVTVHGHPSVTHNFTRSGIFPLALVLSSHVNKAHYFTSICVEPEIRNITLQPERQFVKLGDEAR 1393
Cdd:TIGR00864 1281 NAAHYLFDWSFGDGSPNETHHGCPGISHNFRGNGTFPLALTISSGVNKAHFFTQICVEPELGKISLQAEKQFFALGDEAQ 1360
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1394 LVAYSWPPFPYRYTWDFGTEDTTHTQTGGSEVKFIYREPGSYLVIVTVSNNISSTNDSAFVEVQEPVLVTGIRINGSH-- 1471
Cdd:TIGR00864 1361 FQACAEPEFNYRYEWDFGGEEAAPLPAAGAEVTFIYNDPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGSHgn 1440
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1472 VLELQQPYLLSAMGSGSPATYLWELGDGSQSEGPEVTHIYSSTGDFTVRVSGWNEVSRSEAQLNITVKQRVRGLTINASR 1551
Cdd:TIGR00864 1441 NLELGQPYLFSAFGRARNASYLWDFGDGGLLEGPEILHAFNSPGDFNIRLAAANEVGKNEATLNVAVKARVRGLTINASL 1520
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1552 TVVPLNGSVSFSTLLEVGSDVHYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFIYVLQFIEGLQV 1631
Cdd:TIGR00864 1521 TNVPLNGSVHFEAHLDAGDDVRFSWILCDHCTPIFGGNTIFYTFRSVGTFNIIVTAENDVGAAQASIFLFVLQEIEGLQI 1600
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1632 AGGD-----------NGCCFPTNYTLQLQAAVRDGTNISYSWTA--QQEGSLITLFGSGKCFSLTSLKASTYYVHLRATN 1698
Cdd:TIGR00864 1601 LGETaegggggvqelDGCYFETNHTVQFHAGFKDGTNLSFSWNAilDNEPDGPAFAGSGKGAKLNPLEAGPCDIFLQAAN 1680
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1699 MLGSAAANRTIDFVEPVESLILSASPNPAAVNMSLTLCAELAGGSGVVYTWYLEEGLSWKTSMPSTTHTFAAPGLHLVRV 1778
Cdd:TIGR00864 1681 LLGQATADCTIDFLEPAGNLMLAASDNPAAVNALINLSAELAEGSGLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLVTM 1760
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1779 TAENQLGSVNATVEVAIQVPVGGLSIRTSEP-DSIFVAAGSTLPFWGQLAEGTNVTWCWTLPGGS-KDSQYIAVRFSTAG 1856
Cdd:TIGR00864 1761 KAFNELGSANASEEVDVQEPISGLKIRAADAgEQNFFAADSSVCFQGELATGTNVSWCWAIDGGSsKMGKHACMTFPDAG 1840
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1857 SFSLQLNASNAVSWVSAMYNLTVEEPIVNLMLWASSKVVAPGQPVHFEILLAAGSALTFRLQVGGSVPEVL-PSPHFSHS 1935
Cdd:TIGR00864 1841 TFAIRLNASNAVSGKSASREFFAEEPIFGLELKASKKIAAIGEKVEFQILLAAGSAVNFRLQIGGAAPEVLqPGPRFSHS 1920
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1936 FFRVGDHLVNVQAENHVSHAQAQVRILVLEAVVGLQVPNCCEPGMATGTEKNFTARVQRGSRVAYAWYFSLQKVQGDSLV 2015
Cdd:TIGR00864 1921 FPRVDDHMVNLRAKNEVSCAQANLHIEVLEAVRGLQIPDCCAAGIATGEEKNFTANVQRGKPVAFAWTFDLHHLHGDSLV 2000
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2016 ILSGRDVTYTPVAAGLLEIHVRAFNELGGVNLTLMVEVQDIIQYVTLQSG--RCFTNRSARFEAATSPSPRRVTYHWDFG 2093
Cdd:TIGR00864 2001 IHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQLEAQDALMDAALQAGpqDCFTNKMAQFEAATSPKPNFMACHWDFG 2080
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2094 DGTPVQKTEEFWADHYYLRPGDYHVEVNATNLVSFFVAQATVTVQVLACREPEVEVALPLQVLMRRSQRNYLEAHVDLRN 2173
Cdd:TIGR00864 2081 DGSAGQDTDEPRAEHEYLHPGDYRVQVNASNLVSFFSAHAEINVQVLACEEPEVDVVLALQLAIRRSQPNLLEAHVDLKD 2160
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2174 CVSYQTEYRWEIYRTASCQRPGRMAQMVLPG-------------VDVSRPQLVVPRLALPVGHYCFVFVVSFGDTPLARS 2240
Cdd:TIGR00864 2161 CLRYGAEYLWEILRAASCDNDGHFARGALNGatrsfpviplpaeVDVQRLQLSLPKLALAAGHYCFVFSLSFEDTPLKKA 2240
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2241 IQANVTVAAERLVPIIEGGSYRVWSDTQDLVLDGSKSYDPNLEDGDQTPLNFHWACVASTQSETGGCVLNFGPRG-SSVV 2319
Cdd:TIGR00864 2241 ACANLGVAAARLMPIIEGGSYRVWSDTQDLQLDAEESYDPNLDDDDQSLLHFHWACQASSKGEAGCCALNFGLGGkGPTL 2320
                         2330      2340      2350      2360      2370      2380      2390      2400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2320 TIPLERLEAGVEYTFNLIVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGHCHNCSRGYKQG 2399
Cdd:TIGR00864 2321 GIPGEELAAGIEYTFKLSIGKAGMKEEATNQTVLIQSGHIPIVSLECVSCKAQALYEVSQNSYVYLEGRCLNCQSGFHRG 2400
                         2410      2420      2430      2440      2450      2460      2470      2480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2400 CWAARTFSNKTLVLNETTTSTGSTGMNLVVRPGALRDGEGYIFTLTVLGHSGEEEGCASIRLSPNRPPLGGSCRLFPL-- 2477
Cdd:TIGR00864 2401 RWAARTFQNDTLVLDESSTSTGSAGMNLVLRQGVLHDGEGYNFTLHVLDDSGDEEGAASIRLHHNMPPDGGECHLFPGge 2480
                         2490      2500      2510      2520      2530      2540      2550      2560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2478 ------------DSVRGLTTKVHFECTGWRDAEDGGAPLVYALLLKRCRQSYCENFCIYKGSLSTYGAVLPPGFQ-PLFV 2544
Cdd:TIGR00864 2481 tgqehgdkedevWAIEALLDKVHFECSGWHDAEDAEAPLLYALLLNRCRDDHCEEFCVYKGSLPEHGAFLPPGFRsAHFE 2560
                         2570      2580      2590      2600      2610      2620      2630      2640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2545 VSLAVVVQDQLGAAVVALNRSLTIVLPEPSGNPADLVPWLHSLTASVLPGLLKQADPQHVIEYSLALITVLNEYEQAPD- 2623
Cdd:TIGR00864 2561 VGLAITVEDHLGAAIRALNKSIAITLPDPNGEASGLPHWLHDLIASKLKGLLDQADFQHVIELSLALITVLNEYEQALDs 2640
                         2650      2660      2670      2680      2690      2700      2710      2720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2624 VSEPNVEQQLRAQMRKNITETLISLRVNTVDDIQQITAALAQCMVSSRELMCRSCLKKMLQKLEGMMRILQAETTEGTLT 2703
Cdd:TIGR00864 2641 AAEPKHERGHRAQIRKNITEALTALDLHTVDDIQQIAAALAQCMAPSREFICEECLKQTLHKLEAMLEILQADTKAGIVT 2720
                         2730      2740
                   ....*....|....*....|
gi 568999406  2704 PTTIADSILNITGDLIHLAS 2723
Cdd:TIGR00864 2721 PTAIADNILNIMGDLIHLAS 2740
REJ pfam02010
REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor ...
2167-2610 2.40e-116

REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly Swiss:Q26627. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains.


Pssm-ID: 366875 [Multi-domain]  Cd Length: 448  Bit Score: 378.00  E-value: 2.40e-116
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2167 AHVDLRNCVS-YQTEYRWEIYRTASCqrpGRMAQMVLPgVDVSRPQLVVPRLALPVGHYCFVFVVSFGDTP-LARSIQAN 2244
Cdd:pfam02010    1 ASVELNGCFSaYTIDYLWSVFTVSSN---LNLQTISSP-KDLVLPQLTIPSGTLPYGTYVFTLTVSLSSTPsLAGTDIIT 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2245 VTVAAERLVPIIEGGSYRVWSDTQDLVLDGSKSYDPNLEDGDQTPLNFHWACVAST------QSETGGCV-----LNFGP 2313
Cdd:pfam02010   77 VTVQPSPLVAVIDGGSSRVVGYNQDLTLDGSESYDPDVDPGSSSGLTYLWSCRRSSsgdnplLNNDPVCFsdqneGTLLQ 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2314 RGSSVVTIPLERLEAGVEYTFNLIVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRssYVYLEGHCHNCS 2393
Cdd:pfam02010  157 STSSSLTIPASTLQANVTYTFKLTVSKGSRNSASTTQTILVVDGNPPIIILSCISNCNRKNNPVDR--LVLLASTCLNCS 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2394 RGYKQGC--WAARTFSNKTLVLN--ETTTSTGSTGMNLVVRPGALRDGEGYIFTLTVLGHSGEEEGCASIRLSPNRPPLG 2469
Cdd:pfam02010  235 SDLSDVTyrWLSLGSENTSLVLDqlNSQTSTGRSGPYLVIKAGVLQSGVSYRFTLIVTVYPGLVSGLASISFITNAPPTG 314
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  2470 GSCRLFPLDSVRgLTTKVHFECTGWRDAEDggaPLVYALLLKRCRQSYCENFCIYKGSLST-YGAVLPPGF-QPLFVVSL 2547
Cdd:pfam02010  315 GTCSVTPTEGTA-LETKFTVTCQGWTDDDL---PLTYQFGDISFREASEEWFLLYEGSSQIsISTFLPPGLpANDYQVTV 390
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568999406  2548 AVVVQDQLGAAvVALNRSLTIVLPEPSgnpaDLVPWLHSLTASVLPGLLKQADPQHVIEYSLA 2610
Cdd:pfam02010  391 VVVVYDSLGAA-TSVSLTITVTPPSSS----DELLYFLLGTTSDLSALLQSGDPQQAAQLILA 448
Polycystin_dom pfam20519
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
3704-3882 6.23e-66

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


Pssm-ID: 466668  Cd Length: 199  Bit Score: 223.07  E-value: 6.23e-66
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  3704 AFLAITRSDEFWPWMSHVFLPYVHGNQS------------SPELGPPRLRQVRLQEAFC--PDPSSSEHM-CSAAGSLST 3768
Cdd:pfam20519    1 GLLTVTDLDDIWDWLSSVLLPALHSNKTpsglpgsfiayeSLLLGVPRLRQLRVRNSSClvHDKFVREINeCHAGYSPPS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  3769 SDYGI----GWQSVVQNGSETWAYSAPDLL-GAWYWGYCAVYDSGGYIQELGLSLEESRARLGFLQLHNWLDSRSRAVFV 3843
Cdd:pfam20519   81 EDRKLysalPYKPVHYGSKYWFIYTPPGLLmGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVFV 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 568999406  3844 ELTRYSPAVGLHAAVTLRLEFPVAGHALAAFSVRPFALR 3882
Cdd:pfam20519  161 DFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQSVKLL 199
PKD_channel pfam08016
Polycystin cation channel; This family contains the cation channel region from group II of ...
3883-4104 2.09e-59

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 205.20  E-value: 2.09e-59
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  3883 RLSTGLSLPLLT-SVCLLLFALYFSMAEVQTWRKDGcACTARpDTW--ARCLLVILTAATGLVRLAQLGIADRQWTHFVq 3959
Cdd:pfam08016    1 RYVTNRSLFILLcEIVFVVFFLYFVVEEILKIRKHR-PSYLR-SVWnlLDLAIVILSVVLIVLNIYRDFLADRLIKSVE- 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  3960 DHPRHFTSFDQVAQLGSVARGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELMGATLGLVLLGVAYAQMAILLI 4039
Cdd:pfam08016   78 ASPVTFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFGYLLF 157
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568999406  4040 SSGADTLYNMARAFLVLCpgarvPTLCPSESWY--------LSPLLCVGLWALRVWGALRLGAILLRWRYHAL 4104
Cdd:pfam08016  158 GTQAPNFSNFVKSILTLF-----RTILGDFGYNeifsgnrvLGPLLFLTFVFLVIFILLNLFLAIINDSYVEV 225
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
3110-3229 6.16e-57

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238850  Cd Length: 120  Bit Score: 194.03  E-value: 6.16e-57
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 3110 FKYEILVKTGWSRGSGTTAHVGIMLYGEDNRSGHRHLDGD--RAFHRNSLDIFQIATPHSLGSVWKIRVWHDNKGLSPAW 3187
Cdd:cd01752     1 YLYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPekPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSW 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 568999406 3188 FLQHIIVRDLQSARSTFFLVNDWLSVETEanGGLVEKEVLAA 3229
Cdd:cd01752    81 YLSRVIVRDLQTGKKWFFLCNDWLSVEEG--DGTVERTFPVA 120
PLAT_repeat cd01756
PLAT/LH2 domain repeats of family of proteins with unknown function. In general, PLAT/LH2 ...
3110-3229 2.32e-29

PLAT/LH2 domain repeats of family of proteins with unknown function. In general, PLAT/LH2 consists of an eight stranded beta-barrel and it's proposed function is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238854  Cd Length: 120  Bit Score: 114.96  E-value: 2.32e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 3110 FKYEILVKTGWSRGSGTTAHVGIMLYGEDNRSGHRHLD---GDRAFHRNSLDIFQIATPhSLGSVWKIRVWHDNKGLSPA 3186
Cdd:cd01756     1 VTYEVTVKTGDVKGAGTDANVFITLYGENGDTGKRKLKksnNKNKFERGQTDKFTVEAV-DLGKLKKIRIGHDNSGLGAG 79
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 568999406 3187 WFLQHIIVRDLQSARSTFFLVNDWLSveTEANGGLVEKEVLAA 3229
Cdd:cd01756    80 WFLDKVEIREPGTGDEYTFPCNRWLD--KDEDDGQIVRELYPS 120
PLAT pfam01477
PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. ...
3112-3215 1.86e-22

PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.


Pssm-ID: 396180  Cd Length: 115  Bit Score: 95.19  E-value: 1.86e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  3112 YEILVKTGWSRGSGTTAHVGIMLYGEDNRSGHRHLDGDR-AFHRNSLDIFQIATPHSLGSVWKIRVWHDNKGLSPAWFLQ 3190
Cdd:pfam01477    1 YQVKVVTGDELGAGTDADVYISLYGKVGESAQLEITLDNpDFERGAEDSFEIDTDWDVGAILKINLHWDNNGLSDEWFLK 80
                           90       100
                   ....*....|....*....|....*.
gi 568999406  3191 HIIVRDLQSARSTF-FLVNDWLSVET 3215
Cdd:pfam01477   81 SITVEVPGETGGKYtFPCNSWVYGSK 106
WSC smart00321
present in yeast cell wall integrity and stress response component proteins; Domain present in ...
177-271 3.54e-19

present in yeast cell wall integrity and stress response component proteins; Domain present in WSC proteins, polycystin and fungal exoglucanase


Pssm-ID: 214616 [Multi-domain]  Cd Length: 95  Bit Score: 85.21  E-value: 3.54e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406    177 GEEYVACLPDNSSGAVAAVPFYFAHegPLETEACSAFCFSAGEGLAALSEQNQCLCGAGQASNSSAACSSWC--SSISLS 254
Cdd:smart00321    1 GATYVGCYSDNSSRTLAAVSSYAYH--NMSVEACSNFCFSAGYALAALENGNECYCGDSLPSTSVSASDSSQcsTTCSGY 78
                            90
                    ....*....|....*..
gi 568999406    255 LNSACGGPTLLQHTFPA 271
Cdd:smart00321   79 PAEVCGGPNRLSVYVLA 95
PLAT cd00113
PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology 2) domain. ...
3111-3212 4.06e-19

PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology 2) domain. It consists of an eight stranded beta-barrel. The domain can be found in various domain architectures, in case of lipoxygenases, alpha toxin, lipases and polycystin, but also as a single domain or as repeats.The putative function of this domain is to facilitate access to sequestered membrane or micelle bound substrates.


Pssm-ID: 238061  Cd Length: 116  Bit Score: 85.85  E-value: 4.06e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 3111 KYEILVKTGWSRGSGTTAHVGIMLYGED-NRSGHRHLDGDRAFHRNSLDIFQIATPHSLGSVWKIRVWHDNKGLSPAWFL 3189
Cdd:cd00113     2 RYTVTIKTGDKKGAGTDSNISLALYGENgNSSDIPILDGPGSFERGSTDTFQIDLKLDIGDITKVYLRRDGSGLSDGWYC 81
                          90       100
                  ....*....|....*....|...
gi 568999406 3190 QHIIVRDLQSARSTFFLVNDWLS 3212
Cdd:cd00113    82 ESITVQALGTKKVYTFPVNRWVL 104
LH2 smart00308
Lipoxygenase homology 2 (beta barrel) domain;
3110-3212 5.52e-19

Lipoxygenase homology 2 (beta barrel) domain;


Pssm-ID: 214608 [Multi-domain]  Cd Length: 105  Bit Score: 85.00  E-value: 5.52e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   3110 FKYEILVKTGWSRGSGTTAHVGIMLYGEDNRSGHRHLD--GDRAFHRNSLDIFQIATPHSLGSVWKIRVWHDNKglSPAW 3187
Cdd:smart00308    1 GKYKVTVTTGGLDFAGTTASVSLSLVGAEGDGKESKLDylFKGIFARGSTYEFTFDVDEDFGELGAVKIKNEHR--HPEW 78
                            90       100
                    ....*....|....*....|....*
gi 568999406   3188 FLQHIIVRDLQSARSTFFLVNDWLS 3212
Cdd:smart00308   79 FLKSITVKDLPTGGKYHFPCNSWVY 103
CLECT cd00037
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type ...
418-530 1.18e-17

C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.


Pssm-ID: 153057 [Multi-domain]  Cd Length: 116  Bit Score: 81.51  E-value: 1.18e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  418 HCYRLVAEKAPWLQAQEQCRTWaGAALAMVDSPAIQHFLVSKVTRSL--DVWIG---------FSSVEGTEGLDPrgeaf 486
Cdd:cd00037     1 SCYKFSTEKLTWEEAQEYCRSL-GGHLASIHSEEENDFLASLLKKSSssDVWIGlndlssegtWKWSDGSPLVDY----- 74
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 568999406  487 slescQNWLPGEPHPATAEHCVRL--GPAGQCNTDLCSAPHSYVCE 530
Cdd:cd00037    75 -----TNWAPGEPNPGGSEDCVVLssSSDGKWNDVSCSSKLPFICE 115
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1720-1789 1.48e-17

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 79.74  E-value: 1.48e-17
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1720 LSASPNPAAVNMSLTLCAELAGGSGVVYTWYLEEGLSWKTSMPSTTHTFAAPGLHLVRVTAENQLGSVNA 1789
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1888-1957 3.38e-17

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 78.58  E-value: 3.38e-17
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1888 LWASSKVVAPGQPVHFEILLAAGSALTFRLQVGGSVPEVLPSPHFSHSFFRVGDHLVNVQAENHVSHAQA 1957
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
CLECT smart00034
C-type lectin (CTL) or carbohydrate-recognition domain (CRD); Many of these domains function ...
406-530 4.64e-17

C-type lectin (CTL) or carbohydrate-recognition domain (CRD); Many of these domains function as calcium-dependent carbohydrate binding modules.


Pssm-ID: 214480 [Multi-domain]  Cd Length: 124  Bit Score: 80.34  E-value: 4.64e-17
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406    406 CPLDTEIFpgNGHCYRLVAEKAPWLQAQEQCRtWAGAALAMVDSPAIQHFL---VSKVTRSLDVWIGFS--SVEGT-EGL 479
Cdd:smart00034    1 CPSGWISY--GGKCYKFSTEKKTWEDAQAFCQ-SLGGHLASIHSEAENDFVaslLKNSGSSDYYWIGLSdpDSNGSwQWS 77
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|..
gi 568999406    480 DPRGeafsLESCQNWLPGEPhPATAEHCVRLGPAGQC-NTDLCSAPHSYVCE 530
Cdd:smart00034   78 DGSG----PVSYSNWAPGEP-NNSSGDCVVLSTSGGKwNDVSCTSKLPFVCE 124
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1379-1450 1.74e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 76.66  E-value: 1.74e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568999406  1379 LQPERQFVKLGDEARLVAYSWPPFPYRYTWDFGteDTTHTQTGGSEVKFIYREPGSYLVIVTVSNNISSTND 1450
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFG--DSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1214-1279 2.28e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 76.27  E-value: 2.28e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568999406  1214 SPSVEQGAPMVVSASVESGDNITWTFDMGDGTVFTGPEATVQHVYLRAQNFTVTVEAANPAGHLSQ 1279
Cdd:pfam00801    5 GTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1125-1196 4.06e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 75.50  E-value: 4.06e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568999406  1125 IGMSSNVLVAGQPITFSPYpLPSTDGVLYTWDFGDgSPVLIQSQPVLNHTYSMTGAYRITLEVNNTVSSVTA 1196
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTAT-LADGSNVTYTWDFGD-SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1547-1616 8.76e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 74.73  E-value: 8.76e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1547 INASRTVVPLNGSVSFSTLLEVGSDVHYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQD 1616
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
2064-2131 4.25e-15

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 72.80  E-value: 4.25e-15
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568999406  2064 SGRCFTNRSARFEAaTSPSPRRVTYHWDFGDGtPVQKTEEFWADHYYLRPGDYHVEVNATNLVSFFVA 2131
Cdd:pfam00801    5 GTVVAAGQPVTFTA-TLADGSNVTYTWDFGDS-PGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
CLECT_DC-SIGN_like cd03590
C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific ...
406-530 1.34e-13

C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR); CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.


Pssm-ID: 153060 [Multi-domain]  Cd Length: 126  Bit Score: 70.41  E-value: 1.34e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  406 CPLDTEIFpgNGHCYRLVAEKAPWLQAQEQCRTwAGAALAMVDSPAIQHFLVSKVTRSLDVWIGFsSVEGTEG----LDp 481
Cdd:cd03590     1 CPTNWKSF--QSSCYFFSTEKKSWEESRQFCED-MGAHLVIINSQEEQEFISKILSGNRSYWIGL-SDEETEGewkwVD- 75
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 568999406  482 rGEAFSLEScQNWLPGEP--HPATAEHCVRL-GPAGQCNTDLCSAPHSYVCE 530
Cdd:cd03590    76 -GTPLNSSK-TFWHPGEPnnWGGGGEDCAELvYDSGGWNDVPCNLEYRWICE 125
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1128-1203 2.08e-13

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 68.25  E-value: 2.08e-13
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568999406   1128 SSNVLVAGQPITFSPYPLPSTDGVLYTWDFGDGSpvlIQSQPVLNHTYSMTGAYRITLEVNNTVSSVTAHADIRVF 1203
Cdd:smart00089    7 SPTVGVAGESVTFTATSSDDGSIVSYTWDFGDGT---SSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1211-1287 2.11e-13

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 68.29  E-value: 2.11e-13
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568999406 1211 VYLSPSVEQGAPMVVSASVES-GDNITWTFDMGDGTVFTGPEATVQHVYLRAQNFTVTVEAANPAGhLSQSLHVQVFV 1287
Cdd:cd00146     5 VSAPPVAELGASVTFSASDSSgGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVG-SSSTKTTTVVV 81
LRR_8 pfam13855
Leucine rich repeat;
61-127 2.37e-13

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 67.16  E-value: 2.37e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568999406    61 PSLRIpadataLDLSHNLLQTLDIGLLVNLSALVELDLSNNRISTLEEGVFANLFNLSEINLSGNPF 127
Cdd:pfam13855    1 PNLRS------LDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1485-1539 2.50e-13

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 67.86  E-value: 2.50e-13
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*
gi 568999406   1485 GSGSPATYLWELGDGSQSEGPEVTHIYSSTGDFTVRVSGWNEVSRSEAQLNITVK 1539
Cdd:smart00089   25 DDGSIVSYTWDFGDGTSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1208-1286 3.10e-13

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 67.48  E-value: 3.10e-13
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   1208 GLTVYLSPSVEQ-GAPMVVSASVE-SGDNITWTFDMGDGTVFTGPeaTVQHVYLRAQNFTVTVEAANPAGHLSQSLHVQV 1285
Cdd:smart00089    1 VADVSASPTVGVaGESVTFTATSSdDGSIVSYTWDFGDGTSSTGP--TVTHTYTKPGTYTVTLTVTNAVGSASATVTVVV 78

                    .
gi 568999406   1286 F 1286
Cdd:smart00089   79 Q 79
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1636-1705 3.67e-13

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 67.03  E-value: 3.67e-13
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1636 NGCCFPTNYTLQLQAAVRDGTNISYSWTAqqeGSLITLFGSGKCFSLTSLKASTYYVHLRATNMLGSAAA 1705
Cdd:pfam00801    4 SGTVVAAGQPVTFTATLADGSNVTYTWDF---GDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1469-1532 4.92e-13

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 66.64  E-value: 4.92e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568999406  1469 GSHVLELQQPYLLSAMG-SGSPATYLWELGD--GSQSEGPEVTHIYSSTGDFTVRVSGWNEVSRSEA 1532
Cdd:pfam00801    4 SGTVVAAGQPVTFTATLaDGSNVTYTWDFGDspGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1544-1622 2.22e-12

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 65.17  E-value: 2.22e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   1544 GLTINASRTVVPLNGSVSFS-TLLEVGSDVHYSWVLCDRctPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFIYV 1622
Cdd:smart00089    1 VADVSASPTVGVAGESVTFTaTSSDDGSIVSYTWDFGDG--TSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVV 78
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
277-344 3.98e-12

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 64.33  E-value: 3.98e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568999406   277 LVGPHGPLASGQPADFHITS-SLPISSTRWNFGDgSPEVDMASPAATHFYVLPGSYHMTVVLALGAGSA 344
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLaDGSNVTYTWDFGD-SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSA 68
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
2069-2138 6.76e-12

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 64.01  E-value: 6.76e-12
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   2069 TNRSARFEAATSPSPRRVTYHWDFGDGTPVQKTeefWADHYYLRPGDYHVEVNATNLVSFFVAQATVTVQ 2138
Cdd:smart00089   13 AGESVTFTATSSDDGSIVSYTWDFGDGTSSTGP---TVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1809-1873 1.10e-11

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 62.79  E-value: 1.10e-11
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568999406  1809 PDSIFVAAGSTLPFWGQLAEGTNVTWCWTL---PGGSKDSQYIAVRFSTAGSFSLQLNASNAVSWVSA 1873
Cdd:pfam00801    3 ASGTVVAAGQPVTFTATLADGSNVTYTWDFgdsPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1129-1202 2.94e-11

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 62.13  E-value: 2.94e-11
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568999406 1129 SNVLVAGQPITFSPYPLPSTDGVLYTWDFGDGSpVLIQSQPVLNHTYSMTGAYRITLEV-NNTVSSVTAHADIRV 1202
Cdd:cd00146     8 PPVAELGASVTFSASDSSGGSIVSYKWDFGDGE-VSSSGEPTVTHTYTKPGTYTVTLTVtNAVGSSSTKTTTVVV 81
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1721-1796 6.96e-11

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 60.93  E-value: 6.96e-11
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568999406   1721 SASPNPAAVNMSLTLCAELAG-GSGVVYTWYLEEGLSWKTsmPSTTHTFAAPGLHLVRVTAENQLGSVNATVEVAIQ 1796
Cdd:smart00089    5 SASPTVGVAGESVTFTATSSDdGSIVSYTWDFGDGTSSTG--PTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1481-1768 8.30e-11

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 67.00  E-value: 8.30e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1481 LSAMGSGSPATYLWELGDGSQSEGPEVTHIYSSTGDFTVRVSGWNEV-SRSEAQLNITVKQRVRGLTINASRTVVPLNGS 1559
Cdd:COG3291    16 FTDTSSGNATSYEWDFGDGTTSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAPNPGVTTVTTSTTVTTLAN 95
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1560 VSFSTLLEVGSDVHYSWVlcdrCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFIYVLQFIEGLQVAGGDNGCC 1639
Cdd:COG3291    96 TANGGATTVVAGSTVGTG----VATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVTTSVSTTDVTSDGTTS 171
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1640 FPTNYTLQLQAAVRDGTNISYSWTAQQEGSLITLFGSGKCFSLTSLKASTYYVHLRATNMLGSAAANRTIDFVEPVESLI 1719
Cdd:COG3291   172 ASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTDVTLTGISTGDAGTPGTNTV 251
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*....
gi 568999406 1720 LSASPNPAAVNMSLTLCAELAGGSGVVYTWYLEEGLSWKTSMPSTTHTF 1768
Cdd:COG3291   252 TTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNVS 300
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
56-127 1.20e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 64.42  E-value: 1.20e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568999406   56 LQTLGPSLRIpadataLDLSHNLLQTLDIglLVNLSALVELDLSNNRISTLEE--GVFANLFNLSEINLSGNPF 127
Cdd:cd21340   115 LAALSNSLRV------LNISGNNIDSLEP--LAPLRNLEQLDASNNQISDLEEllDLLSSWPSLRELDLTGNPV 180
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1018-1116 2.31e-10

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 59.39  E-value: 2.31e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   1018 VSAVPTVLPPNATLALTGGVLVDSAVeVAFLWNFGDGeqvlrqfkppydesfqvpdptvaQVLVEHNTTHIYTTPGEYNL 1097
Cdd:smart00089    4 VSASPTVGVAGESVTFTATSSDDGSI-VSYTWDFGDG-----------------------TSSTGPTVTHTYTKPGTYTV 59
                            90
                    ....*....|....*....
gi 568999406   1098 TVLVSNTYENLTQQVTVSV 1116
Cdd:smart00089   60 TLTVTNAVGSASATVTVVV 78
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1976-2047 2.90e-10

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 58.94  E-value: 2.90e-10
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568999406  1976 CEPGMATGTEKNFTARVQRGSRVAYAWYFslqkvqGDSLV-ILSGRDVTYTPVAAGLLEIHVRAFNELGGVNL 2047
Cdd:pfam00801    4 SGTVVAAGQPVTFTATLADGSNVTYTWDF------GDSPGtSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1545-1622 7.29e-10

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 58.28  E-value: 7.29e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1545 LTINASRTVVPLNG-SVSFS-TLLEVGSDVHYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQ-DSIFIY 1621
Cdd:cd00146     1 PTASVSAPPVAELGaSVTFSaSDSSGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSStKTTTVV 80

                  .
gi 568999406 1622 V 1622
Cdd:cd00146    81 V 81
GPS smart00303
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ...
3003-3052 9.14e-10

G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.


Pssm-ID: 197639  Cd Length: 49  Bit Score: 56.63  E-value: 9.14e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 568999406   3003 YTSLCQYFSEEMMMWRTEGIVPLEETSpSQAVCLTRHLTAFGASLFVPPS 3052
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNG-THTTCSCNHLTTFAVLMDVPPI 49
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1018-1110 2.28e-09

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 56.24  E-value: 2.28e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1018 VSAVPTVLPPNATLALTGGVLvdSAVEVAFLWNFGDGeqvlrqfkppydesfqvpdptVAQVLVEHNTTHIYTTPGEYNL 1097
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLA--DGSNVTYTWDFGDS---------------------PGTSGSGPTVTHTYLSPGTYTV 57
                           90
                   ....*....|...
gi 568999406  1098 TVLVSNTYENLTQ 1110
Cdd:pfam00801   58 TLTASNAVGSANA 70
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
70-128 4.63e-09

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 61.87  E-value: 4.63e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 568999406   70 TALDLSHNLLQTLDIGLlVNLSALVELDLSNNRISTLEEgvFANLFNLSEINLSGNPFE 128
Cdd:COG4886   208 EELDLSGNQLTDLPEPL-ANLTNLETLDLSNNQLTDLPE--LGNLTNLEELDLSNNQLT 263
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1465-1538 5.29e-09

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 55.58  E-value: 5.29e-09
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568999406 1465 IRINGSHVLELQQPYLLSAMGS--GSPATYLWELGDGSQ--SEGPEVTHIYSSTGDFTVRVSGWNEVSRSEAQ-LNITV 1538
Cdd:cd00146     3 ASVSAPPVAELGASVTFSASDSsgGSIVSYKWDFGDGEVssSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTKtTTVVV 81
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
48-128 5.81e-09

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 61.87  E-value: 5.81e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   48 RVNCSGRWLQTLGPSLripADATAL---DLSHNLLQTLDIGLLvNLSALVELDLSNNRISTLEEgVFANLFNLSEINLSG 124
Cdd:COG4886   140 ELDLSNNQLTDLPEPL---GNLTNLkslDLSNNQLTDLPEELG-NLTNLKELDLSNNQITDLPE-PLGNLTNLEELDLSG 214

                  ....
gi 568999406  125 NPFE 128
Cdd:COG4886   215 NQLT 218
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
2069-2139 1.46e-08

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 54.42  E-value: 1.46e-08
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568999406 2069 TNRSARFEAATSPSPRRVTYHWDFGDGTpVQKTEEFWADHYYLRPGDYHVEVNATNLVSfFVAQATVTVQV 2139
Cdd:cd00146    13 LGASVTFSASDSSGGSIVSYKWDFGDGE-VSSSGEPTVTHTYTKPGTYTVTLTVTNAVG-SSSTKTTTVVV 81
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
72-128 1.74e-08

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 60.33  E-value: 1.74e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568999406   72 LDLSHNLLQTLDIGLlVNLSALVELDLSNNRISTLEEGvFANLFNLSEINLSGNPFE 128
Cdd:COG4886   187 LDLSNNQITDLPEPL-GNLTNLEELDLSGNQLTDLPEP-LANLTNLETLDLSNNQLT 241
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
276-352 1.92e-08

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 54.04  E-value: 1.92e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  276 TLVGPHGPLAS-GQPADFHITSSLP--ISSTRWNFGDGSPEVdMASPAATHFYVLPGSYHMTVVLALGAGSALLET-EVQ 351
Cdd:cd00146     2 TASVSAPPVAElGASVTFSASDSSGgsIVSYKWDFGDGEVSS-SGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTKTtTVV 80

                  .
gi 568999406  352 V 352
Cdd:cd00146    81 V 81
WSC pfam01822
WSC domain; This domain is involved in carbohydrate binding.
180-233 2.26e-08

WSC domain; This domain is involved in carbohydrate binding.


Pssm-ID: 460348  Cd Length: 82  Bit Score: 54.00  E-value: 2.26e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 568999406   180 YVACLPDNSSGaVAAVPFYFAHEGPLETEACSAFCFSAGEGLAALSEQNQCLCG 233
Cdd:pfam01822    1 YLGCYSDGTGG-RRLLLGSSGDYDDMTPEKCIAFCSAAGYTYAGLEYGGECYCG 53
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1119-1202 3.82e-08

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 53.43  E-value: 3.82e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1119 VLPNVAIGMSSNVlVAGQPITFSPypLPSTDG----VLYTWDFGDGSPVliqSQPVLNHTYSMTGAYRITLEVNNT--VS 1192
Cdd:pfam18911    2 AAPVADAGGDRIV-AEGETVTFDA--SASDDPdgdiLSYRWDFGDGTTA---TGANVSHTYAAPGTYTVTLTVTDDsgAS 75
                           90
                   ....*....|
gi 568999406  1193 SVTAHADIRV 1202
Cdd:pfam18911   76 NSTATDTVTV 85
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
286-353 4.02e-08

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 53.22  E-value: 4.02e-08
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406    286 SGQPADFHITSSLP--ISSTRWNFGDGSpevDMASPAATHFYVLPGSYHMTVVLALGAGSALLETEVQVE 353
Cdd:smart00089   13 AGESVTFTATSSDDgsIVSYTWDFGDGT---SSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1485-1538 4.25e-08

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 53.43  E-value: 4.25e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 568999406  1485 GSGSPATYLWELGDGSQSEGPEVTHIYSSTGDFTVRVSGWNE--VSRSEAQLNITV 1538
Cdd:pfam18911   30 PDGDILSYRWDFGDGTTATGANVSHTYAAPGTYTVTLTVTDDsgASNSTATDTVTV 85
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
70-128 6.31e-08

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 58.41  E-value: 6.31e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 568999406   70 TALDLSHNLLQtlDIGLLVNLSALVELDLSNNRISTLEEgvFANLFNLSEINLSGNPFE 128
Cdd:COG4886   231 ETLDLSNNQLT--DLPELGNLTNLEELDLSNNQLTDLPP--LANLTNLKTLDLSNNQLT 285
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1380-1457 8.88e-08

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 52.07  E-value: 8.88e-08
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568999406   1380 QPERQFVKLGDEARLVAYSWP-PFPYRYTWDFGTEDTTHTQTggseVKFIYREPGSYLVIVTVSNNISSTNDSAFVEVQ 1457
Cdd:smart00089    5 SASPTVGVAGESVTFTATSSDdGSIVSYTWDFGDGTSSTGPT----VTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1721-1793 1.09e-07

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 52.11  E-value: 1.09e-07
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568999406 1721 SASPNPAAVNMSLTLcaeLAGGSGVVYTWYLEEGLSWKTSMPSTTHTFAAPGLHLVRVTAENQLGSVN---ATVEV 1793
Cdd:cd00146     9 PVAELGASVTFSASD---SSGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSStktTTVVV 81
CLECT_REG-1_like cd03594
C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and ...
406-530 1.50e-07

C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2); CLECT_REG-1_like: C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. REG-1 is a proliferating factor which participates in various kinds of tissue regeneration including pancreatic beta-cell regeneration, regeneration of intestinal mucosa, regeneration of motor neurons, and perhaps in tissue regeneration of damaged heart. REG-1 may play a role on the pathophysiology of Alzheimer's disease and in the development of gastric cancers. Its expression is correlated with reduced survival from early-stage colorectal cancer. REG-1 also binds and aggregates several bacterial strains from the intestinal flora and it has been suggested that it is involved in the control of the intestinal bacterial ecosystem. Rat lithostathine has calcium carbonate crystal inhibitor activity in vitro. REG-IV is unregulated in pancreatic, gastric, hepatocellular, and prostrate adenocarcinomas. REG-IV activates the EGF receptor/Akt/AP-1 signaling pathway in colorectal carcinoma. Ansocalcin, SCA-1 and -2 are found at high concentration in the calcified egg shell layer of goose and ostrich, respectively and tend to form aggregates. Ansocalcin nucleates calcite crystal aggregates in vitro.


Pssm-ID: 153064 [Multi-domain]  Cd Length: 129  Bit Score: 53.14  E-value: 1.50e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  406 CPLDTeiFPGNGHCYRLVAEKAPWLQAQEQCRTW-AGAALAMVDSPAIQHFLVSKV----TRSLDVWIGFSsvegteglD 480
Cdd:cd03594     1 CPKGW--LPYKGNCYGYFRQPLSWSDAELFCQKYgPGAHLASIHSPAEAAAIASLIssyqKAYQPVWIGLH--------D 70
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 568999406  481 P---RGEAFSLESCQNW--LPGEPHPATAEHCVRL-GPAG--QCNTDLCSAPHSYVCE 530
Cdd:cd03594    71 PqqsRGWEWSDGSKLDYrsWDRNPPYARGGYCAELsRSTGflKWNDANCEERNPFICK 128
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1290-1371 1.57e-07

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 51.34  E-value: 1.57e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1290 VLHIEPSTCIPTQPSAQLMAHVTGDPVHYLFDWTFGDGssNVTVHGHPSVTHNFTRSGIFPLALVLSSHVNKAHyFTSIC 1369
Cdd:cd00146     2 TASVSAPPVAELGASVTFSASDSSGGSIVSYKWDFGDG--EVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSS-TKTTT 78

                  ..
gi 568999406 1370 VE 1371
Cdd:cd00146    79 VV 80
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1290-1371 1.80e-07

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 51.30  E-value: 1.80e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   1290 VLHIEPSTCIPTQP-SAQLMAHVTGDPVHYLFDWTFGDGssnvTVHGHPSVTHNFTRSGIFPLALVLSSHVNKAHYFTSI 1368
Cdd:smart00089    1 VADVSASPTVGVAGeSVTFTATSSDDGSIVSYTWDFGDG----TSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTV 76

                    ...
gi 568999406   1369 CVE 1371
Cdd:smart00089   77 VVQ 79
CLECT_CEL-1_like cd03589
C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and ...
406-531 3.22e-07

C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina; CLECT_CEL-1_like: C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CEL-1 CTLD binds three calcium ions and has a high specificity for N-acteylgalactosamine (GalNAc). CEL-1 exhibits strong cytotoxicity which is inhibited by GalNAc. This protein may play a role as a toxin defending against predation. Echinoidin is found in the coelomic fluid of the sea urchin and is specific for GalBeta1-3GalNAc. Echinoidin has a cell adhesive activity towards human cancer cells which is not mediated through the CTLD. Both CEL-1 and Echinoidin are multimeric proteins comprised of multiple dimers linked by disulfide bonds.


Pssm-ID: 153059  Cd Length: 137  Bit Score: 52.36  E-value: 3.22e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  406 CPLDTEIFpgNGHCYRLVAEKAPWLQAQEQCRTWAG----AALAMVDSPAIQHFL------VSKVTRSLDVWIGF--SSV 473
Cdd:cd03589     1 CPTFWTAF--GGYCYRFFGDRLTWEEAELRCRSFSIpgliAHLVSIHSQEENDFVydlfesSRGPDTPYGLWIGLhdRTS 78
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568999406  474 EGT-EGLDPRGEAFSlescqNWLPGEPHPA-TAEHCV---RLGPAGQCNTDL-CSAPHSYVCEL 531
Cdd:cd03589    79 EGPfEWTDGSPVDFT-----KWAGGQPDNYgGNEDCVqmwRRGDAGQSWNDMpCDAVFPYICKM 137
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
70-128 3.87e-07

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 56.10  E-value: 3.87e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 568999406   70 TALDLSHNLLQTLDIGLlVNLSALVELDLSNNRISTLEEgVFANLFNLSEINLSGNPFE 128
Cdd:COG4886   116 ESLDLSGNQLTDLPEEL-ANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLT 172
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1968-2054 4.01e-07

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 50.14  E-value: 4.01e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406   1968 VGLQVPNCCEPgmATGTEKNFTARVQR-GSRVAYAWYFslqkvqGDSLViLSGRDVTYTPVAAGLLEIHVRAFNELGGVN 2046
Cdd:smart00089    1 VADVSASPTVG--VAGESVTFTATSSDdGSIVSYTWDF------GDGTS-STGPTVTHTYTKPGTYTVTLTVTNAVGSAS 71

                    ....*...
gi 568999406   2047 LTLMVEVQ 2054
Cdd:smart00089   72 ATVTVVVQ 79
PLAT_plant_stress cd01754
PLAT/LH2 domain of plant-specific single domain protein family with unknown function. Many of ...
3112-3219 5.57e-07

PLAT/LH2 domain of plant-specific single domain protein family with unknown function. Many of its members are stress induced. In general, PLAT/LH2 consists of an eight stranded beta-barrel and it's proposed function is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238852  Cd Length: 129  Bit Score: 51.39  E-value: 5.57e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 3112 YEILVKTGWSRGSGTTAHVGIMLYGEDNRS---------GHRHLDGDRAFHRNSLDIFQIATPHSLGSVWKIRVWHDNKG 3182
Cdd:cd01754     3 YTIYVQTGSIWKAGTDSRISLQIYDADGPGlrianleawGGLMGAGHDYFERGNLDRFSGRGPCLPSPPCWMNLTSDGTG 82
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 568999406 3183 LSPAWFLQHIIVRDLQSAR---STFFLVNDWLSVETEANG 3219
Cdd:cd01754    83 NHPGWYVNYVEVTQAGQHApcmQHLFAVEQWLATDESPYM 122
LRRCT smart00082
Leucine rich repeat C-terminal domain;
125-177 1.17e-06

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 48.20  E-value: 1.17e-06
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 568999406    125 NPFECNCGLAWLPRWAKEhQVHVVQSEATTCRGPIPLAGqPLLSIPLLDNACG 177
Cdd:smart00082    1 NPFICDCELRWLLRWLQA-NEHLQDPVDLRCASPSSLRG-PLLELLHSEFKCP 51
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1019-1275 1.42e-06

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 53.91  E-value: 1.42e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1019 SAVPTVLPPNATLALTGgvlVDSAVEVAFLWNFGDGEqvlrqfkppydesfqvpDPTVAqvlvehNTTHIYTTPGEYNLT 1098
Cdd:COG3291     2 TATPTSGCAPLTVQFTD---TSSGNATSYEWDFGDGT-----------------TSTEA------NPSHTYTTPGTYTVT 55
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1099 VLVSNTY-ENLTQQVTVSVRTVLPNVAIGMSSNVLVAGQPITFSPYPLPSTDGVLYTWDFGDGSPVLIQSQPVLNHTYSM 1177
Cdd:COG3291    56 LTVTDAAgCSDTTTKTITVGAPNPGVTTVTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTT 135
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1178 TGayriTLEVNNTVSSVTAHADIRVFQELHGLTVYLSPSVEQGAPMVVSASVESGDNITWTFDMGDGTVFTGPEATVQHV 1257
Cdd:COG3291   136 TG----TDTGLTGSTGTASDTATVTTSVSTTDVTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVT 211
                         250
                  ....*....|....*...
gi 568999406 1258 YLRAQNFTVTVEAANPAG 1275
Cdd:COG3291   212 TGATSGTSGTGSATSGVA 229
CLECT_collectin_like cd03591
C-type lectin-like domain (CTLD) of the type found in human collectins including lung ...
420-531 2.29e-06

C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1); CLECT_collectin_like: C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CTLDs of these collectins bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, or apoptotic cells) and mediate functions associated with killing and phagocytosis. MBPs recognize high mannose oligosaccharides in a calcium dependent manner, bind to a broad range of pathogens, and trigger cell killing by activating the complement pathway. MBP also acts directly as an opsonin. SP-A and SP-D in addition to functioning as host defense components, are components of pulmonary surfactant which play a role in surfactant homeostasis. Pulmonary surfactant is a phospholipid-protein complex which reduces the surface tension within the lungs. SP-A binds the major surfactant lipid: dipalmitoylphosphatidylcholine (DPPC). SP-D binds two minor components of surfactant that contain sugar moieties: glucosylceramide and phosphatidylinositol (PI). MBP and SP-A, -D monomers are homotrimers with an N-terminal collagen region and three CTLDs. Multiple homotrimeric units associate to form supramolecular complexes. MBL deficiency results in an increased susceptibility to a large number of different infections and to inflammatory disease, such as rheumatoid arthritis.


Pssm-ID: 153061 [Multi-domain]  Cd Length: 114  Bit Score: 49.22  E-value: 2.29e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  420 YRLVAEKAPWLQAQEQCRTwAGAALAMVDSP----AIQHFLVSKVTRsldVWIGFSSVEgTEG----LDPRGEAFSlesc 491
Cdd:cd03591     4 FVTNGEEKNFDDAQKLCSE-AGGTLAMPRNAaenaAIASYVKKGNTY---AFIGITDLE-TEGqfvyLDGGPLTYT---- 74
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|.
gi 568999406  492 qNWLPGEPHPAT-AEHCVRLGPAGQCNTDLCSAPHSYVCEL 531
Cdd:cd03591    75 -NWKPGEPNNAGgGEDCVEMYTSGKWNDVACNLTRLFVCEF 114
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1386-1456 6.59e-06

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 47.11  E-value: 6.59e-06
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568999406 1386 VKLGDEARLVAYSWPPF----PYRYTWDFGTEDTTHTqtGGSEVKFIYREPGSYLVIVTVSNNISSTN-DSAFVEV 1456
Cdd:cd00146     8 PPVAELGASVTFSASDSsggsIVSYKWDFGDGEVSSS--GEPTVTHTYTKPGTYTVTLTVTNAVGSSStKTTTVVV 81
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1816-1880 1.09e-05

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 46.29  E-value: 1.09e-05
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568999406   1816 AGSTLPFWGQ-LAEGTNVTWCWTLPGGSKDSQYIAV-RFSTAGSFSLQLNASNAVSWVSAMYNLTVE 1880
Cdd:smart00089   13 AGESVTFTATsSDDGSIVSYTWDFGDGTSSTGPTVThTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1296-1360 1.70e-05

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 45.46  E-value: 1.70e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568999406  1296 STCIPTQPSAQLMAHV-TGDPVHYLfdWTFGDGssNVTVHGHPSVTHNFTRSGIFPLALVLSSHVN 1360
Cdd:pfam00801    5 GTVVAAGQPVTFTATLaDGSNVTYT--WDFGDS--PGTSGSGPTVTHTYLSPGTYTVTLTASNAVG 66
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
55-128 1.73e-05

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 50.70  E-value: 1.73e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568999406   55 WLQTLGPSLRIPADATALDLSHNLLqtldiglLVNLSALVELDLSNNRISTLEEGvFANLFNLSEINLSGNPFE 128
Cdd:COG4886    84 LLLLGLTDLGDLTNLTELDLSGNEE-------LSNLTNLESLDLSGNQLTDLPEE-LANLTNLKELDLSNNQLT 149
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1018-1116 4.49e-05

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 44.41  E-value: 4.49e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1018 VSAVPTVLPPNATLALTGGVLVDSAVeVAFLWNFGDGEQVLRQfkppydesfqvpdptvaqvlvEHNTTHIYTTPGEYNL 1097
Cdd:cd00146     4 SVSAPPVAELGASVTFSASDSSGGSI-VSYKWDFGDGEVSSSG---------------------EPTVTHTYTKPGTYTV 61
                          90       100
                  ....*....|....*....|
gi 568999406 1098 TVLVSNTY-ENLTQQVTVSV 1116
Cdd:cd00146    62 TLTVTNAVgSSSTKTTTVVV 81
LRRNT smart00013
Leucine rich repeat N-terminal domain;
32-71 4.69e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 43.08  E-value: 4.69e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 568999406     32 PCPLPCFCGPapdaaCRVNCSGRWLQTLgpSLRIPADATA 71
Cdd:smart00013    1 ACPAPCNCSG-----TAVDCSGRGLTEV--PLDLPPDTTL 33
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
928-1008 1.28e-04

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 43.21  E-value: 1.28e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406    928 GLRAVPSPEARVLqGILVRYSPMVE-AGSDVAFRWTIDDkqSLTFHNTVFNVIYQSAAIFKLSLTASNHVSNITVNYNVT 1006
Cdd:smart00089    1 VADVSASPTVGVA-GESVTFTATSSdDGSIVSYTWDFGD--GTSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVV 77

                    ..
gi 568999406   1007 VE 1008
Cdd:smart00089   78 VQ 79
CLECT_NK_receptors_like cd03593
C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs); ...
417-530 1.60e-04

C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs); CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.


Pssm-ID: 153063  Cd Length: 116  Bit Score: 43.86  E-value: 1.60e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  417 GHCYRLVAEKAPWLQAQEQCRTwAGAALAMVDSPAIQHFLvSKVTRSLDVWIGFSsVEGTEG----LDprGEAFSlescq 492
Cdd:cd03593    10 NKCYYFSMEKKTWNESKEACSS-KNSSLLKIDDEEELEFL-QSQIGSSSYWIGLS-REKSEKpwkwID--GSPLN----- 79
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 568999406  493 NWLpgEPHPATAE-HCVRLGPAGqCNTDLCSAPHSYVCE 530
Cdd:cd03593    80 NLF--NIRGSTKSgNCAYLSSTG-IYSEDCSTKKRWICE 115
CLECT_CSPGs cd03588
C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core ...
417-530 1.86e-04

C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins; CLECT_CSPGs: C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins (CSPGs) in human and chicken aggrecan, frog brevican, and zebra fish dermacan. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with hyaluronan (HA). These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Xenopus brevican is expressed in the notochord and the brain during early embryogenesis. Zebra fish dermacan is expressed in dermal bones and may play a role in dermal bone development. CSPGs do contain LINK domain(s) which bind HA. These LINK domains are considered by one classification system to be a variety of CTLD, but are omitted from this hierarchical classification based on insignificant sequence similarity.


Pssm-ID: 153058  Cd Length: 124  Bit Score: 44.11  E-value: 1.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  417 GHCYRLVAEKAPWLQAQEQCRTwAGAALAMVDSPAIQHFLVSKVTRSLdvWIGFS--SVEGteglDPR---GEAFSLEsc 491
Cdd:cd03588    10 GHCYRHFPDRETWEDAERRCRE-QQGHLSSIVTPEEQEFVNNNAQDYQ--WIGLNdrTIEG----DFRwsdGHPLQFE-- 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 568999406  492 qNWLPGEPHP--ATAEHCVRL--GPAGQCNTDLCSAPHSYVCE 530
Cdd:cd03588    81 -NWRPNQPDNffATGEDCVVMiwHEEGEWNDVPCNYHLPFTCK 122
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1721-1876 2.08e-04

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 46.97  E-value: 2.08e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1721 SASPNPAAVNMSLTLCAeLAGGSGVVYTWYLEEGLSwkTSMPSTTHTFAAPGLHLVRVTAENQLGSvNATVEVAIQVPVG 1800
Cdd:COG3291     2 TATPTSGCAPLTVQFTD-TSSGNATSYEWDFGDGTT--STEANPSHTYTTPGTYTVTLTVTDAAGC-SDTTTKTITVGAP 77
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568999406 1801 GLSIRTSEPDSIFVAAGSTLPFWGQLAEGTNVTWCWTLPGGSKDSQYIAVRFSTAGSFSLQLNASNAVSWVSAMYN 1876
Cdd:COG3291    78 NPGVTTVTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTA 153
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1225-1287 2.68e-04

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 42.64  E-value: 2.68e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568999406  1225 VSASVES-GDNITWTFDMGDGTVFTGPEATvqHVYLRAQNFTVTVEAANPAGHLSQSLHVQVFV 1287
Cdd:pfam18911   24 ASASDDPdGDILSYRWDFGDGTTATGANVS--HTYAAPGTYTVTLTVTDDSGASNSTATDTVTV 85
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1402-1663 2.70e-04

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 46.59  E-value: 2.70e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1402 FPYRYTWDFGtEDTTHTQTggsEVKFIYREPGSYLVIVTVSNNI-SSTNDSAFVEVQEPVLVTGIRINGShvlelqqpYL 1480
Cdd:COG3291    23 NATSYEWDFG-DGTTSTEA---NPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAPNPGVTTVTTST--------TV 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1481 LSAMGSGSPATYLWELGDGSQSEGPEVTHIYSSTGDFTVRVSGWNEVSRSEAQLNITVKQRVRGLTINASRTVVPLNGSV 1560
Cdd:COG3291    91 TTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVTTSVSTTDVTSDGTT 170
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1561 SFSTLLEVGSDVHYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFIYVLQFIEGLQVAGGDNGCCF 1640
Cdd:COG3291   171 SASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTDVTLTGISTGDAGTPGTNT 250
                         250       260
                  ....*....|....*....|...
gi 568999406 1641 PTNYTLQLQAAVRDGTNISYSWT 1663
Cdd:COG3291   251 VTTSGANTAGTSTITGGTSGVVT 273
myxo_dep_M36 NF038112
myxosortase-dependent M36 family metallopeptidase; Members of this bacterial protein family ...
1551-1879 3.09e-04

myxosortase-dependent M36 family metallopeptidase; Members of this bacterial protein family have an M36 family metallopeptidase domain, like fungalysin (see PF02128), and a C-terminal MYXO-CTERM domain (see TIGR03901), suggesting processing and surface-anchoring by the still-unknown putative transpeptidase, myxosortase. Members of this family include MXAN_3564 (mepA), part of the effector cargo of outer membrane vesicles that the species produces in large numbers during predation on other microbes.


Pssm-ID: 468355 [Multi-domain]  Cd Length: 1597  Bit Score: 47.34  E-value: 3.09e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1551 RTVVPLNGSVSFStllEVGSDVHYSW--------VLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVgSAQDSIFIYV 1622
Cdd:NF038112 1201 RTTVTLNGSGSFD---PDGDPLTYAWtqvsgpavTLTGADTATPSFTAPEVTADTVLTFQLVVSDGTKT-SAPDTVTVLV 1276
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1623 LQFIEG-LQVAGGDNGCCFPTNYTLQLQAAVRDGTNISYSWTaQQEGSLITLFGSGKC---FSLTSLKASTYYVhLRATN 1698
Cdd:NF038112 1277 RNVNRApVAVAGAPATVDERSTVTLDGSGTDADGDALTYAWT-QTSGPAVTLTGATTAtatFTAPEVTADTQLT-FTLTV 1354
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1699 MLGSAAANRTIDFV------EPVESLILSASPNPAAVnMSLTLCAELAGGSGVVYTWYLEEGLSWK-TSMPSTTHTFAAP 1771
Cdd:NF038112 1355 SDGTASATDTVTVTvrnvnrAPVANAGADQTVDERST-VTLSGSATDPDGDALTYAWTQTAGPTVTlTGADTATASFTAP 1433
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1772 glhlvRVTAENQLG---------------SVNATVEVAIQVPVGglsirtSEPDSIFVAAGSTLPFWGQL--AEGTNVTW 1834
Cdd:NF038112 1434 -----EVAADTELTfqltvsadgqasadvTVTVTVRNVNRAPVA------HAGESITVDEGSTVTLDASAtdPDGDTLTY 1502
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*
gi 568999406 1835 CWTLPGGSkdsqyiAVRFSTAGSFSLQLNASNAVSWVSAMYNLTV 1879
Cdd:NF038112 1503 AWTQVAGP------SVTLTGADSAKLTFTAPEVSADTTLTFSLTV 1541
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1405-1456 3.13e-04

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 42.26  E-value: 3.13e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 568999406  1405 RYTWDFGteDTThtQTGGSEVKFIYREPGSYLVIVTVSNNISSTNDSAFVEV 1456
Cdd:pfam18911   36 SYRWDFG--DGT--TATGANVSHTYAAPGTYTVTLTVTDDSGASNSTATDTV 83
CLECT_VCBS cd03603
A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein ...
418-501 6.90e-04

A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins; CLECT_VCBS: A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces including CaCO3 and ice. Bacterial CTLDs within this group are functionally uncharacterized. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers from which ligand-binding sites project in different orientations. In some CTLDs a loop extends to the adjoining domain to form a loop-swapped dimer.


Pssm-ID: 153073 [Multi-domain]  Cd Length: 118  Bit Score: 42.41  E-value: 6.90e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  418 HCYRLVAEKAPWLQAQEQCRTwAGAALAMVDSPAIQHFLVSKVTRSLDVWIGFSSVEgTEG--LDPRGEAfslESCQNWL 495
Cdd:cd03603     1 HFYKFVDGGMTWEAAQTLAES-LGGHLVTINSAEENDWLLSNFGGYGASWIGASDAA-TEGtwKWSDGEE---STYTNWG 75

                  ....*.
gi 568999406  496 PGEPHP 501
Cdd:cd03603    76 SGEPHN 81
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
61-108 1.46e-03

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 39.15  E-value: 1.46e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 568999406    61 PSLRIpadataLDLSHNLLQtlDIGLLVNLSALVELDLS-NNRISTLEE 108
Cdd:pfam12799    1 PNLEV------LDLSNNQIT--DIPPLAKLPNLETLDLSgNNKITDLSD 41
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
271-344 1.59e-03

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 40.33  E-value: 1.59e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568999406   271 ASPGATLVGPHgPLASGQPADFHITSSLP----ISSTRWNFGDGSpEVDMASPaaTHFYVLPGSYHMTVVLALGAGSA 344
Cdd:pfam18911    2 AAPVADAGGDR-IVAEGETVTFDASASDDpdgdILSYRWDFGDGT-TATGANV--SHTYAAPGTYTVTLTVTDDSGAS 75
CLECT_TC14_like cd03601
C-type lectin-like domain (CTLD) of the type found in lectins TC14, TC14-2, TC14-3, and TC14-4 ...
435-530 1.85e-03

C-type lectin-like domain (CTLD) of the type found in lectins TC14, TC14-2, TC14-3, and TC14-4 from the budding tunicate Polyandrocarpa misakiensis and PfG6 from the Acorn worm; CLECT_TC14_like: C-type lectin-like domain (CTLD) of the type found in lectins TC14, TC14-2, TC14-3, and TC14-4 from the budding tunicate Polyandrocarpa misakiensis and PfG6 from the Acorn worm. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. TC14 is homodimeric. The CTLD of TC14 binds D-galactose and D-fucose. TC14 is expressed constitutively by multipotent epithelial and mesenchymal cells and plays in role during budding, in inducing the aggregation of undifferentiated mesenchymal cells to give rise to epithelial forming tissue. TC14-2 and TC14-3 shows calcium-dependent galactose binding activity. TC14-3 is a cytostatic factor which blocks cell growth and dedifferentiation of the atrial epithelium during asexual reproduction. It may also act as a differentiation inducing factor. Galactose inhibits the cytostatic activity of TC14-3. The gene for Acorn worm PfG6 is gill-specific; PfG6 may be a secreted protein.


Pssm-ID: 153071  Cd Length: 119  Bit Score: 40.98  E-value: 1.85e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  435 QCRTWAGAALAMVDSPAIQHFLVSKVTRSLDVWIGFSSVEGTEG--LDPRGEAFSLEScQNWLPGEP-HPATAEHCVRL- 510
Cdd:cd03601    20 RSRGMRLASLAMRDSEMRDAILAFTLVKGHGYWVGADNLQDGEYdfLWNDGVSLPTDS-DLWAPNEPsNPQSRQLCVQLw 98
                          90       100
                  ....*....|....*....|
gi 568999406  511 GPAGQCNTDLCSAPHSYVCE 530
Cdd:cd03601    99 SKYNLLDDEYCGRAKRVICE 118
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1296-1622 1.96e-03

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 43.89  E-value: 1.96e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1296 STCIPTqpSAQLMAHVTGDPVHYLfdWTFGDGSSNVTvhghPSVTHNFTRSGIFPLALVLSSHVNKAHYFTS-ICVEPEI 1374
Cdd:COG3291     7 SGCAPL--TVQFTDTSSGNATSYE--WDFGDGTTSTE----ANPSHTYTTPGTYTVTLTVTDAAGCSDTTTKtITVGAPN 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1375 RNITLQPERQFVKLGDEARLVAYSWPPFPYRYTWDFGTEDTTHTQTGGSEVKFIYREPGSYLVIVTVSNNISSTNDSAFV 1454
Cdd:COG3291    79 PGVTTVTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVTTS 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1455 EVQEPVLVTGIRINGSHVLELQQPYLLSAMGSGSPATYLWELGDGSQSEGpevTHIYSSTGDFTVRVSGWNEVSRSEAQL 1534
Cdd:COG3291   159 VSTTDVTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTA---GVTTGATSGTSGTGSATSGVAVTDVTL 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406 1535 NITVKQRVRGLTINASRTVVPLNGSVSFSTllEVGSDVHYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSA 1614
Cdd:COG3291   236 TGISTGDAGTPGTNTVTTSGANTAGTSTIT--GGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNVSTTADVTGGTATLA 313

                  ....*...
gi 568999406 1615 QDSIFIYV 1622
Cdd:COG3291   314 VSSTLTTN 321
LRR smart00370
Leucine-rich repeats, outliers;
90-113 2.06e-03

Leucine-rich repeats, outliers;


Pssm-ID: 197688 [Multi-domain]  Cd Length: 24  Bit Score: 38.10  E-value: 2.06e-03
                            10        20
                    ....*....|....*....|....
gi 568999406     90 LSALVELDLSNNRISTLEEGVFAN 113
Cdd:smart00370    1 LPNLRELDLSNNQLSSLPPGAFQG 24
LRR_TYP smart00369
Leucine-rich repeats, typical (most populated) subfamily;
90-113 2.06e-03

Leucine-rich repeats, typical (most populated) subfamily;


Pssm-ID: 197687 [Multi-domain]  Cd Length: 24  Bit Score: 38.10  E-value: 2.06e-03
                            10        20
                    ....*....|....*....|....
gi 568999406     90 LSALVELDLSNNRISTLEEGVFAN 113
Cdd:smart00369    1 LPNLRELDLSNNQLSSLPPGAFQG 24
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
2072-2124 2.22e-03

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 39.95  E-value: 2.22e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568999406  2072 SARFEAATS--PSPRRVTYHWDFGDGTPVQKTEefwADHYYLRPGDYHVEVNATN 2124
Cdd:pfam18911   19 TVTFDASASddPDGDILSYRWDFGDGTTATGAN---VSHTYAAPGTYTVTLTVTD 70
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1890-1963 3.31e-03

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 39.36  E-value: 3.31e-03
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568999406   1890 ASSKVVAPGQPVHFEI-LLAAGSALTFRLQVGGSvpEVLPSPHFSHSFFRVGDHLVNVQAENHVSHAQAQVRILV 1963
Cdd:smart00089    6 ASPTVGVAGESVTFTAtSSDDGSIVSYTWDFGDG--TSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVV 78
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
933-1000 3.48e-03

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 38.91  E-value: 3.48e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568999406   933 PSPEARVL-QGILVRYSPMVEAGSDVAFRWTIDDKQSLTFHNTVFNVIYQSAAIFKLSLTASNHVSNIT 1000
Cdd:pfam00801    1 VSASGTVVaAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSAN 69
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1022-1116 4.15e-03

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 39.18  E-value: 4.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568999406  1022 PTVLPPNATLALTGGVLVDS-AVEVAFLWNFGDGeqvlrqfkppydesfqvpdpTVAQVLvehNTTHIYTTPGEYNLTVL 1100
Cdd:pfam18911   11 DRIVAEGETVTFDASASDDPdGDILSYRWDFGDG--------------------TTATGA---NVSHTYAAPGTYTVTLT 67
                           90
                   ....*....|....*...
gi 568999406  1101 VSNTY--ENLTQQVTVSV 1116
Cdd:pfam18911   68 VTDDSgaSNSTATDTVTV 85
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
93-126 4.23e-03

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 37.61  E-value: 4.23e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 568999406    93 LVELDLSNNRISTLEEgvFANLFNLSEINLSGNP 126
Cdd:pfam12799    3 LEVLDLSNNQITDIPP--LAKLPNLETLDLSGNN 34
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1641-1709 4.40e-03

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 38.97  E-value: 4.40e-03
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568999406   1641 PTNYTLQLQAAVR-DGTNISYSWTaqqegslitlFG-----SGKCFSLTSLKASTYYVHLRATNMLGSAAANRTI 1709
Cdd:smart00089   12 VAGESVTFTATSSdDGSIVSYTWD----------FGdgtssTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTV 76
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
929-1001 4.99e-03

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 38.63  E-value: 4.99e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568999406  929 LRAVPSPEARVLQGILVRYSPMVEA-GSDVAFRWTIDDKQSLTFHNTVFNVIYQSAAIFKLSLTASNHVSNITV 1001
Cdd:cd00146     1 PTASVSAPPVAELGASVTFSASDSSgGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSST 74
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
2072-2138 9.60e-03

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 41.58  E-value: 9.60e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568999406 2072 SARFEAATSPSPrrVTYHWDFGDGT------PVqkteefwadHYYLRPGDYHVEVNATNLV-SFFVAQATVTVQ 2138
Cdd:COG3291    13 TVQFTDTSSGNA--TSYEWDFGDGTtsteanPS---------HTYTTPGTYTVTLTVTDAAgCSDTTTKTITVG 75
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH