|
Name |
Accession |
Description |
Interval |
E-value |
| Ten_N super family |
cl24184 |
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ... |
10-222 |
1.01e-108 |
|
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats). The actual alignment was detected with superfamily member pfam06484:
Pssm-ID: 461932 [Multi-domain] Cd Length: 367 Bit Score: 352.36 E-value: 1.01e-108
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 10 KPSAEAGRPIPPTSSSSLLPsaqlpSSHNPPP---VSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSG 86
Cdd:pfam06484 154 KSDNENGPPIPPSSSSSSPV-----EQHSPPPpslNENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQ 228
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 87 PPNHHSQSTLRPPLPP-PHNHT-LSHHHSSANSLNRNSLTNRRSQIHAP-APAPNDLATTPESVQLQDSWVLNSNVPLET 163
Cdd:pfam06484 229 PPNFQNHSRLRTPPPPlPPPHKqNQHHHPSINSLNRSSLTNRRNPSPAPtASLPAELQSTQESVQLQDSWVLNSNVPLET 308
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081759 164 RHFLFKTSSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSWK 222
Cdd:pfam06484 309 RHFLFKTGTGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
|
|
| NHL super family |
cl18310 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1079-1409 |
7.56e-47 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats. The actual alignment was detected with superfamily member cd14953:
Pssm-ID: 302697 [Multi-domain] Cd Length: 323 Bit Score: 172.33 E-value: 7.56e-47
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1079 PVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNSPGHKYY----LAVDPvTGSLYVSDTNSRRIYRV 1152
Cdd:cd14953 25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRKI 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1153 kslsgakDLAGNSEVVAGTGEqclpfdeARCGDGGKAVDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLG 1230
Cdd:cd14953 104 -------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG 169
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1231 sndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSLSKLA 1308
Cdd:cd14953 170 ----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSGDGGA 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1309 IHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncicYSGDDAYATDAILNSPSSL 1388
Cdd:cd14953 237 TAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGV 302
|
330 340
....*....|....*....|.
gi 1907081759 1389 AVAPDGTIYIADLGNIRIRAV 1409
Cdd:cd14953 303 AVDAAGNLYVADTGNNRIRKI 323
|
|
| Tox-GHH |
pfam15636 |
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ... |
2529-2606 |
3.96e-37 |
|
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus. :
Pssm-ID: 464783 Cd Length: 78 Bit Score: 135.05 E-value: 3.96e-37
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907081759 2529 EEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2606
Cdd:pfam15636 1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1370-2307 |
5.63e-32 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only]; :
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 137.19 E-value: 5.63e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1370 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASP---GEQELYVFNADGIHQYTVS 1446
Cdd:COG3209 105 LTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAgggASAYGLTLGGAAAGPATGV 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1447 LVTGEYLYNFTYSADNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKAVSTQNLELGLMTYDGNTG 1526
Cdd:COG3209 185 GTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTG 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1527 LLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNS 1606
Cdd:COG3209 265 AGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGG 344
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1607 YQLCNNGTLRVMYANGMAVSFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLL 1686
Cdd:COG3209 345 TTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAG 424
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1687 SIDYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFA 1766
Cdd:COG3209 425 ALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDD 504
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1767 DGKVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLHAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFD 1838
Cdd:COG3209 505 TLGGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGT 584
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1839 YSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKVGPLVDKQIYRF 1918
Cdd:COG3209 585 TGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTG 664
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1919 SEEGMINARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGR 1998
Cdd:COG3209 665 TGTGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTT 741
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1999 IKEVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH----- 2072
Cdd:COG3209 742 GTLTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitv 821
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 2073 LLNPGNSARLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKT 2152
Cdd:COG3209 822 GSGGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RT 890
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 2153 NLGHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVFSINGLMIKQLQYTA 2232
Cdd:COG3209 891 DGGTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDP 950
|
890 900 910 920 930 940 950
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907081759 2233 YGEIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwrNVGKEPAPfNLYMFKNNNPLSN 2307
Cdd:COG3209 951 FGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVNY 1019
|
|
| acid_disulf_rpt |
NF033662 |
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ... |
682-712 |
2.63e-08 |
|
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids. :
Pssm-ID: 411265 [Multi-domain] Cd Length: 32 Bit Score: 51.36 E-value: 2.63e-08
10 20 30
....*....|....*....|....*....|.
gi 1907081759 682 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 712
Cdd:NF033662 2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
|
|
| C_rich_MXAN6577 super family |
cl49352 |
MXAN_6577-like cysteine-rich domain; |
492-638 |
5.54e-07 |
|
MXAN_6577-like cysteine-rich domain; The actual alignment was detected with superfamily member NF041328:
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 51.30 E-value: 5.54e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 492 SCGGHGS-CIDGNCVCaagykGEHCeeVDC-LDP--------TCSSHGVCVNGECLCSPGwgglncelaRVQCPDQCSgh 561
Cdd:NF041328 13 GCPEPGAvCPEGLSVC-----GGAC--VDLrSDPsncgacgvACGAGQTCVAGACGCGPG---------TVACGGACV-- 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 562 gtylpdsgLCSCDPNWMGpdcsveVCSVDCGTHGVCIGGACR--CEEGWT--GAAC-DQRVCHPRCIEHGT-CKDGKcEC 635
Cdd:NF041328 75 --------DTASDPAHCG------ACGAACAPGQVCEGGACReaCSEGLTrcGGACvDLATDPLHCGACGVaCDPGE-SC 139
|
...
gi 1907081759 636 REG 638
Cdd:NF041328 140 RGG 142
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
428-450 |
5.87e-04 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. :
Pssm-ID: 400365 Cd Length: 26 Bit Score: 39.25 E-value: 5.87e-04
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
646-679 |
3.47e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements. :
Pssm-ID: 238011 Cd Length: 38 Bit Score: 37.23 E-value: 3.47e-03
10 20 30
....*....|....*....|....*....|....*.
gi 1907081759 646 IDGC--PDLCNGNGRCTLGQNSWQCVCQTGWRGPGC 679
Cdd:cd00054 2 IDECasGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
|
|
| I-EGF_1 |
pfam18372 |
Integrin beta epidermal growth factor like domain 1; This is the I-EGF 1 domain found in ... |
459-476 |
9.00e-03 |
|
Integrin beta epidermal growth factor like domain 1; This is the I-EGF 1 domain found in several integrin betas such as integrin beta 1-7. Structural analysis reveal an epidermal growth factor-like (I-EGF) domains 1 and 2. EGF1 lacks one disulfide (C2-C4) relative to the integrin EGF 2, 3, and 4 domains, this allows the C-terminal end of EGF1 to flex remarkably relative to its N-terminal end. :
Pssm-ID: 465729 Cd Length: 29 Bit Score: 35.93 E-value: 9.00e-03
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Ten_N |
pfam06484 |
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ... |
10-222 |
1.01e-108 |
|
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).
Pssm-ID: 461932 [Multi-domain] Cd Length: 367 Bit Score: 352.36 E-value: 1.01e-108
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 10 KPSAEAGRPIPPTSSSSLLPsaqlpSSHNPPP---VSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSG 86
Cdd:pfam06484 154 KSDNENGPPIPPSSSSSSPV-----EQHSPPPpslNENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQ 228
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 87 PPNHHSQSTLRPPLPP-PHNHT-LSHHHSSANSLNRNSLTNRRSQIHAP-APAPNDLATTPESVQLQDSWVLNSNVPLET 163
Cdd:pfam06484 229 PPNFQNHSRLRTPPPPlPPPHKqNQHHHPSINSLNRSSLTNRRNPSPAPtASLPAELQSTQESVQLQDSWVLNSNVPLET 308
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081759 164 RHFLFKTSSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSWK 222
Cdd:pfam06484 309 RHFLFKTGTGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1079-1409 |
7.56e-47 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 172.33 E-value: 7.56e-47
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1079 PVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNSPGHKYY----LAVDPvTGSLYVSDTNSRRIYRV 1152
Cdd:cd14953 25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRKI 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1153 kslsgakDLAGNSEVVAGTGEqclpfdeARCGDGGKAVDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLG 1230
Cdd:cd14953 104 -------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG 169
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1231 sndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSLSKLA 1308
Cdd:cd14953 170 ----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSGDGGA 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1309 IHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncicYSGDDAYATDAILNSPSSL 1388
Cdd:cd14953 237 TAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGV 302
|
330 340
....*....|....*....|.
gi 1907081759 1389 AVAPDGTIYIADLGNIRIRAV 1409
Cdd:cd14953 303 AVDAAGNLYVADTGNNRIRKI 323
|
|
| Tox-GHH |
pfam15636 |
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ... |
2529-2606 |
3.96e-37 |
|
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.
Pssm-ID: 464783 Cd Length: 78 Bit Score: 135.05 E-value: 3.96e-37
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907081759 2529 EEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2606
Cdd:pfam15636 1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1370-2307 |
5.63e-32 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 137.19 E-value: 5.63e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1370 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASP---GEQELYVFNADGIHQYTVS 1446
Cdd:COG3209 105 LTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAgggASAYGLTLGGAAAGPATGV 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1447 LVTGEYLYNFTYSADNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKAVSTQNLELGLMTYDGNTG 1526
Cdd:COG3209 185 GTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTG 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1527 LLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNS 1606
Cdd:COG3209 265 AGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGG 344
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1607 YQLCNNGTLRVMYANGMAVSFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLL 1686
Cdd:COG3209 345 TTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAG 424
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1687 SIDYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFA 1766
Cdd:COG3209 425 ALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDD 504
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1767 DGKVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLHAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFD 1838
Cdd:COG3209 505 TLGGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGT 584
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1839 YSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKVGPLVDKQIYRF 1918
Cdd:COG3209 585 TGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTG 664
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1919 SEEGMINARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGR 1998
Cdd:COG3209 665 TGTGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTT 741
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1999 IKEVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH----- 2072
Cdd:COG3209 742 GTLTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitv 821
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 2073 LLNPGNSARLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKT 2152
Cdd:COG3209 822 GSGGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RT 890
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 2153 NLGHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVFSINGLMIKQLQYTA 2232
Cdd:COG3209 891 DGGTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDP 950
|
890 900 910 920 930 940 950
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907081759 2233 YGEIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwrNVGKEPAPfNLYMFKNNNPLSN 2307
Cdd:COG3209 951 FGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVNY 1019
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1079-1409 |
2.94e-12 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 69.66 E-value: 2.94e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1079 PVALAVGIDGSLFVGDF--NYIRRIFPsrnvtsilelRNKEFK-HSNSPGHKYY-LAVDPvTGSLYVSDTNSRRIYRVks 1154
Cdd:COG4257 19 PRDVAVDPDGAVWFTDQggGRIGRLDP----------ATGEFTeYPLGGGSGPHgIAVDP-DGNLWFTDNGNNRIGRI-- 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1155 lsGAKDlaGNSEVVAGTGEQCLPFdearcgdggkavdatlmsprGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTLlgs 1231
Cdd:COG4257 86 --DPKT--GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEF--- 138
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1232 ndltavrPLSCDSSMdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRI-TENHQVSIIAGrpmhcqvpgidyslskla 1308
Cdd:COG4257 139 -------PLPTGGAG---------PYGIAVDP-DGNLWVtdFGANAIGRIdPDTGTLTEYAL------------------ 183
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1309 iHSALESASAIAISHTGVLYITETDEKKINRLRqvTTNGEIcllagaasdcdckndvncicysgdDAYATDAILNSPSSL 1388
Cdd:COG4257 184 -PTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTV------------------------TEYPLPGGGARPYGV 236
|
330 340
....*....|....*....|.
gi 1907081759 1389 AVAPDGTIYIADLGNIRIRAV 1409
Cdd:COG4257 237 AVDGDGRVWFAESGANRIVRF 257
|
|
| Rhs_assc_core |
TIGR03696 |
RHS repeat-associated core domain; This model represents a conserved unique core sequence ... |
2230-2307 |
6.51e-10 |
|
RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.
Pssm-ID: 274730 [Multi-domain] Cd Length: 77 Bit Score: 57.51 E-value: 6.51e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 2230 YTAYGEIYYDSNPDFQmVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwrnvgkePA----PFNLYMFKNNNPL 2305
Cdd:TIGR03696 1 YDPYGEVLSESGAAPN-PLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD----------PIglggGLNLYAYVGNNPV 69
|
..
gi 1907081759 2306 SN 2307
Cdd:TIGR03696 70 NW 71
|
|
| acid_disulf_rpt |
NF033662 |
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ... |
682-712 |
2.63e-08 |
|
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.
Pssm-ID: 411265 [Multi-domain] Cd Length: 32 Bit Score: 51.36 E-value: 2.63e-08
10 20 30
....*....|....*....|....*....|.
gi 1907081759 682 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 712
Cdd:NF033662 2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
492-638 |
5.54e-07 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 51.30 E-value: 5.54e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 492 SCGGHGS-CIDGNCVCaagykGEHCeeVDC-LDP--------TCSSHGVCVNGECLCSPGwgglncelaRVQCPDQCSgh 561
Cdd:NF041328 13 GCPEPGAvCPEGLSVC-----GGAC--VDLrSDPsncgacgvACGAGQTCVAGACGCGPG---------TVACGGACV-- 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 562 gtylpdsgLCSCDPNWMGpdcsveVCSVDCGTHGVCIGGACR--CEEGWT--GAAC-DQRVCHPRCIEHGT-CKDGKcEC 635
Cdd:NF041328 75 --------DTASDPAHCG------ACGAACAPGQVCEGGACReaCSEGLTrcGGACvDLATDPLHCGACGVaCDPGE-SC 139
|
...
gi 1907081759 636 REG 638
Cdd:NF041328 140 RGG 142
|
|
| DUF5885 |
pfam19232 |
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ... |
423-582 |
5.78e-07 |
|
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.
Pssm-ID: 437064 Cd Length: 265 Bit Score: 53.47 E-value: 5.78e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 423 DCPRNCHGNGECVSGLCH--------------CFPGFLGADCAKAAC--PVLCsGNGQ----------YSKGTCQ----C 472
Cdd:pfam19232 11 DCTPPCGGTQVCIDRQCKdntlacttdaqcgtCMTCVAGACTPKASCcgGVTC-GAGQtcdaktntcvYVKGYCSadhpC 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 473 YSGwkgAECDVPMNQCI-DPSCG-GHGS-CIDG-----------------NCVCAAG--YKGEH-CEEV--------DCL 521
Cdd:pfam19232 90 PSG---SACDTAKNACIaQPPYGpDSGKgCVRGfgawiweldpatnsgvwRCRCANGslYNSAHeCSPLadqtlcaaENL 166
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 522 DPTC---------------SSHGVCVN-------------GECLCSPGWGGLNCELARvqcpdQCSGHGTYLPDSGLCSC 573
Cdd:pfam19232 167 DPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTEDR-----TCNGRGTWNETTGQCAC 241
|
250 260
....*....|....*....|....
gi 1907081759 574 ------------DPN---WMGPDC 582
Cdd:pfam19232 242 nidfsghnscgdDNNctsWTGPRC 265
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
1521-1557 |
4.98e-05 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 42.59 E-value: 4.98e-05
10 20 30
....*....|....*....|....*....|....*..
gi 1907081759 1521 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTG 1557
Cdd:pfam05593 1 YDAA-GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
428-450 |
5.87e-04 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 39.25 E-value: 5.87e-04
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
519-548 |
2.14e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 38.00 E-value: 2.14e-03
10 20 30
....*....|....*....|....*....|....*
gi 1907081759 519 DCLDPT-CSSHGVCVNGE----CLCSPGWGGLNCE 548
Cdd:cd00054 4 ECASGNpCQNGGTCVNTVgsyrCSCPPGYTGRNCE 38
|
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
646-679 |
3.47e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 37.23 E-value: 3.47e-03
10 20 30
....*....|....*....|....*....|....*.
gi 1907081759 646 IDGC--PDLCNGNGRCTLGQNSWQCVCQTGWRGPGC 679
Cdd:cd00054 2 IDECasGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
587-659 |
3.57e-03 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 40.13 E-value: 3.57e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 587 CSVDCGTHGVCIGGACRCEEGWT--GAAC-----DQR---VCHPRCIEHGTCKDGKCE--CREGWngEHCTiDGCPDLCN 654
Cdd:NF041328 45 CGVACGAGQTCVAGACGCGPGTVacGGACvdtasDPAhcgACGAACAPGQVCEGGACReaCSEGL--TRCG-GACVDLAT 121
|
....*
gi 1907081759 655 GNGRC 659
Cdd:NF041328 122 DPLHC 126
|
|
| NHL_like_5 |
cd14963 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1382-1473 |
8.63e-03 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271333 [Multi-domain] Cd Length: 268 Bit Score: 40.74 E-value: 8.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1382 LNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYE----AASPG-----EQELYVFNADGiHQYTVSLVTGEY 1452
Cdd:cd14963 55 FKYPYGIAVDSDGNIYVADLYNGRIQVFDPDGKFLKYFPEKKdrvkLISPAglaidDGKLYVSDVKK-HKVIVFDLEGKL 133
|
90 100
....*....|....*....|....*....
gi 1907081759 1453 LYNF--------TYSADNDVTelIDNNGN 1473
Cdd:cd14963 134 LLEFgkpgsepgELSYPNGIA--VDEDGN 160
|
|
| I-EGF_1 |
pfam18372 |
Integrin beta epidermal growth factor like domain 1; This is the I-EGF 1 domain found in ... |
459-476 |
9.00e-03 |
|
Integrin beta epidermal growth factor like domain 1; This is the I-EGF 1 domain found in several integrin betas such as integrin beta 1-7. Structural analysis reveal an epidermal growth factor-like (I-EGF) domains 1 and 2. EGF1 lacks one disulfide (C2-C4) relative to the integrin EGF 2, 3, and 4 domains, this allows the C-terminal end of EGF1 to flex remarkably relative to its N-terminal end.
Pssm-ID: 465729 Cd Length: 29 Bit Score: 35.93 E-value: 9.00e-03
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
431-541 |
9.22e-03 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 38.97 E-value: 9.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 431 NGECVSglchcfpgfLGADCAK-AACPVLCSGNGQYSKGTCQCYSGwkGAECDvpmNQCI----DP-SCGGHGSCIDGNC 504
Cdd:NF041328 29 GGACVD---------LRSDPSNcGACGVACGAGQTCVAGACGCGPG--TVACG---GACVdtasDPaHCGACGAACAPGQ 94
|
90 100 110 120
....*....|....*....|....*....|....*....|....*....
gi 1907081759 505 VCAAGYKGEHCEE--VDCldptcssHGVCVN--------GEC--LCSPG 541
Cdd:NF041328 95 VCEGGACREACSEglTRC-------GGACVDlatdplhcGACgvACDPG 136
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Ten_N |
pfam06484 |
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ... |
10-222 |
1.01e-108 |
|
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).
Pssm-ID: 461932 [Multi-domain] Cd Length: 367 Bit Score: 352.36 E-value: 1.01e-108
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 10 KPSAEAGRPIPPTSSSSLLPsaqlpSSHNPPP---VSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSG 86
Cdd:pfam06484 154 KSDNENGPPIPPSSSSSSPV-----EQHSPPPpslNENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQ 228
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 87 PPNHHSQSTLRPPLPP-PHNHT-LSHHHSSANSLNRNSLTNRRSQIHAP-APAPNDLATTPESVQLQDSWVLNSNVPLET 163
Cdd:pfam06484 229 PPNFQNHSRLRTPPPPlPPPHKqNQHHHPSINSLNRSSLTNRRNPSPAPtASLPAELQSTQESVQLQDSWVLNSNVPLET 308
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081759 164 RHFLFKTSSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSWK 222
Cdd:pfam06484 309 RHFLFKTGTGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1079-1409 |
7.56e-47 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 172.33 E-value: 7.56e-47
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1079 PVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNSPGHKYY----LAVDPvTGSLYVSDTNSRRIYRV 1152
Cdd:cd14953 25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRKI 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1153 kslsgakDLAGNSEVVAGTGEqclpfdeARCGDGGKAVDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLG 1230
Cdd:cd14953 104 -------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG 169
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1231 sndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSLSKLA 1308
Cdd:cd14953 170 ----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSGDGGA 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1309 IHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncicYSGDDAYATDAILNSPSSL 1388
Cdd:cd14953 237 TAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGV 302
|
330 340
....*....|....*....|.
gi 1907081759 1389 AVAPDGTIYIADLGNIRIRAV 1409
Cdd:cd14953 303 AVDAAGNLYVADTGNNRIRKI 323
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1130-1410 |
1.77e-40 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 153.84 E-value: 1.77e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1130 LAVDPvTGSLYVSDTNSRRIYRVkslsgakDLAGNSEVVAGTGEqclpfdEARCGDGGKAvdATLMSPRGIAVDKNGLMY 1209
Cdd:cd14953 28 VAVDA-AGNLYVADRGNHRIRKI-------TPDGVVTTVAGTGT------AGFADGGGAA--AQFNTPSGVAVDAAGNLY 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1210 FVDAT--MIRKVDQNGIISTLLGsndlTAVRPLSCDSSMDVAQvrLEWPTDLAVNPMDNsLYVLE--NNVILRITENHQV 1285
Cdd:cd14953 92 VADTGnhRIRKITPDGVVSTLAG----TGTAGFSDDGGATAAQ--FNYPTGVAVDAAGN-LYVADtgNHRIRKITPDGVV 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1286 SIIAGRPmhcqVPGidYSLSKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndv 1365
Cdd:cd14953 165 TTVAGTG----GAG--YAGDGPATAAQFNNPTGVAVDAAGNLYVADRGN---HRIRKITPDGVVTTVAGTGTA------- 228
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 1907081759 1366 ncicYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVS 1410
Cdd:cd14953 229 ----GFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGNHRIRKIT 269
|
|
| Tox-GHH |
pfam15636 |
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ... |
2529-2606 |
3.96e-37 |
|
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.
Pssm-ID: 464783 Cd Length: 78 Bit Score: 135.05 E-value: 3.96e-37
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907081759 2529 EEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2606
Cdd:pfam15636 1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1370-2307 |
5.63e-32 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 137.19 E-value: 5.63e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1370 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASP---GEQELYVFNADGIHQYTVS 1446
Cdd:COG3209 105 LTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAgggASAYGLTLGGAAAGPATGV 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1447 LVTGEYLYNFTYSADNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKAVSTQNLELGLMTYDGNTG 1526
Cdd:COG3209 185 GTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTG 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1527 LLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNS 1606
Cdd:COG3209 265 AGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGG 344
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1607 YQLCNNGTLRVMYANGMAVSFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLL 1686
Cdd:COG3209 345 TTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAG 424
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1687 SIDYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFA 1766
Cdd:COG3209 425 ALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDD 504
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1767 DGKVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLHAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFD 1838
Cdd:COG3209 505 TLGGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGT 584
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1839 YSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKVGPLVDKQIYRF 1918
Cdd:COG3209 585 TGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTG 664
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1919 SEEGMINARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGR 1998
Cdd:COG3209 665 TGTGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTT 741
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1999 IKEVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH----- 2072
Cdd:COG3209 742 GTLTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitv 821
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 2073 LLNPGNSARLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKT 2152
Cdd:COG3209 822 GSGGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RT 890
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 2153 NLGHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVFSINGLMIKQLQYTA 2232
Cdd:COG3209 891 DGGTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDP 950
|
890 900 910 920 930 940 950
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907081759 2233 YGEIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwrNVGKEPAPfNLYMFKNNNPLSN 2307
Cdd:COG3209 951 FGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVNY 1019
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1167-1410 |
5.78e-32 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 128.80 E-value: 5.78e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1167 VVAGTGeqclpfdeARCGDGGKAVDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLL-----GSNDLTAvrp 1239
Cdd:cd14953 3 TVAGSG--------TAGFSGGGGTAARFNSPSGVAVDAAGNLYVADRGnhRIRKITPDGVVTTVAgtgtaGFADGGG--- 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1240 lscdssmdvAQVRLEWPTDLAVNPMDNsLYV--LENNVILRITENHQVSIIAGRPmhcqVPGidYSLSKLAIHSALESAS 1317
Cdd:cd14953 72 ---------AAAQFNTPSGVAVDAAGN-LYVadTGNHRIRKITPDGVVSTLAGTG----TAG--FSDDGGATAAQFNYPT 135
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1318 AIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncicYSGDDAYATDAILNSPSSLAVAPDGTIY 1397
Cdd:cd14953 136 GVAVDAAGNLYVADTGN---HRIRKITPDGVVTTVAGTGGA-----------GYAGDGPATAAQFNNPTGVAVDAAGNLY 201
|
250
....*....|...
gi 1907081759 1398 IADLGNIRIRAVS 1410
Cdd:cd14953 202 VADRGNHRIRKIT 214
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1130-1428 |
1.18e-17 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 85.45 E-value: 1.18e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1130 LAVDPvTGSLYVSDTNSRRIYRVkslsgakDLAGNSEVVAGTGeqclpfdearcGDGgkavDATLMSPRGIAVDKNGLMY 1209
Cdd:cd05819 13 IAVDS-SGNIYVADTGNNRIQVF-------DPDGNFITSFGSF-----------GSG----DGQFNEPAGVAVDSDGNLY 69
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1210 FVDAT--MIRKVDQNGIISTLLGSNDLTavrplscdssmdvaQVRLEWPTDLAVNPMDNsLYVL--ENNVILRITENHQV 1285
Cdd:cd05819 70 VADTGnhRIQKFDPDGNFLASFGGSGDG--------------DGEFNGPRGIAVDSSGN-IYVAdtGNHRIQKFDPDGEF 134
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1286 SIIAGrpmhcqvpgidyslSKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGaasdcdckndv 1365
Cdd:cd05819 135 LTTFG--------------SGGSGPGQFNGPTGVAVDSDGNIYVADTGN---HRIQVFDPDGNFLTTFG----------- 186
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907081759 1366 ncicysgdDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASPG 1428
Cdd:cd05819 187 --------STGTGPGQFNYPTGIAVDSDGNIYVADSGNNRVQVFDPDGAGFGGNGNFLGSDGQ 241
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1049-1219 |
1.96e-17 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 86.04 E-value: 1.96e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1049 IITSIMGNGRRRSiscpSCNGLAEGNKLLAPVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSILELRNKEFKHS----- 1121
Cdd:cd14953 163 VVTTVAGTGGAGY----AGDGPATAAQFNNPTGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFSGDggata 238
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1122 ---NSPghkYYLAVDPvTGSLYVSDTNSRRIYRVkslsgakDLAGNSEVVAGTGeQCLPfdearcGDGGKAVDATLMSPR 1198
Cdd:cd14953 239 aqlNNP---TGVAVDA-AGNLYVADSGNHRIRKI-------TPAGVVTTVAGGG-AGFS------GDGGPATSAQFNNPT 300
|
170 180
....*....|....*....|...
gi 1907081759 1199 GIAVDKNGLMYFVDAT--MIRKV 1219
Cdd:cd14953 301 GVAVDAAGNLYVADTGnnRIRKI 323
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1074-1407 |
2.34e-16 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 81.60 E-value: 2.34e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1074 NKLLAPVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNSPghkYYLAVDPvTGSLYVSDTNSRRIYR 1151
Cdd:cd05819 5 GELNNPQGIAVDSSGNIYVADTgnNRIQVFDPDGNFITSFGSFGSGDGQFNEP---AGVAVDS-DGNLYVADTGNHRIQK 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1152 VkslsgakDLAGNSEVVAGTGeqclpfdearcGDGgkavDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLL 1229
Cdd:cd05819 81 F-------DPDGNFLASFGGS-----------GDG----DGEFNGPRGIAVDSSGNIYVADTGnhRIQKFDPDGEFLTTF 138
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1230 GSNdltavrplscdsSMDVAQvrLEWPTDLAVNPmDNSLYVLE--NNVILRITENHQVSIIAGRPmhCQVPGidyslskl 1307
Cdd:cd05819 139 GSG------------GSGPGQ--FNGPTGVAVDS-DGNIYVADtgNHRIQVFDPDGNFLTTFGST--GTGPG-------- 193
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1308 aihsALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGaasdcdckndvncicysgdDAYATDAILNSPSS 1387
Cdd:cd05819 194 ----QFNYPTGIAVDSDGNIYVADSGN---NRVQVFDPDGAGFGGNG-------------------NFLGSDGQFNRPSG 247
|
330 340
....*....|....*....|
gi 1907081759 1388 LAVAPDGTIYIADLGNIRIR 1407
Cdd:cd05819 248 LAVDSDGNLYVADTGNNRIQ 267
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1068-1279 |
2.97e-15 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 78.51 E-value: 2.97e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1068 NGLAEGNkLLAPVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSIL---ELRNKEFkhsNSPghkYYLAVDPvTGSLYVS 1142
Cdd:cd05819 94 SGDGDGE-FNGPRGIAVDSSGNIYVADTgnHRIQKFDPDGEFLTTFgsgGSGPGQF---NGP---TGVAVDS-DGNIYVA 165
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1143 DTNSRRIYRVKSlsgakdlagNSEVVAGTGEQClpfdearcgdggkAVDATLMSPRGIAVDKNGLMYFVDATM--IRKVD 1220
Cdd:cd05819 166 DTGNHRIQVFDP---------DGNFLTTFGSTG-------------TGPGQFNYPTGIAVDSDGNIYVADSGNnrVQVFD 223
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907081759 1221 QNGIISTLLGSNdltavrplscdssmDVAQVRLEWPTDLAVNPmDNSLYVLE--NNVILRI 1279
Cdd:cd05819 224 PDGAGFGGNGNF--------------LGSDGQFNRPSGLAVDS-DGNLYVADtgNNRIQVF 269
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1191-1422 |
7.42e-15 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 77.36 E-value: 7.42e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1191 DATLMSPRGIAVDKNGLMYFVDATM--IRKVDQNGIISTLLGSNDltavrplscdssmdVAQVRLEWPTDLAVNPmDNSL 1268
Cdd:cd05819 4 PGELNNPQGIAVDSSGNIYVADTGNnrIQVFDPDGNFITSFGSFG--------------SGDGQFNEPAGVAVDS-DGNL 68
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1269 YVL--ENNVILRITENHQVSIIAGRPmhcqvpGIDYSlsklaihsALESASAIAISHTGVLYITETDEkkiNRLRQVTTN 1346
Cdd:cd05819 69 YVAdtGNHRIQKFDPDGNFLASFGGS------GDGDG--------EFNGPRGIAVDSSGNIYVADTGN---HRIQKFDPD 131
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081759 1347 GEIcllagaasdcdckndVNCICYSGddayATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQY 1422
Cdd:cd05819 132 GEF---------------LTTFGSGG----SGPGQFNGPTGVAVDSDGNIYVADTGNHRIQVFDPDGNFLTTFGST 188
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1075-1340 |
1.02e-13 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 73.89 E-value: 1.02e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1075 KLLAPVALAVGIDGSLFVGDFNYIR-RIFPS----RNVTSILELRNKEFkhsNSPghkYYLAVDPvTGSLYVSDTNSRRI 1149
Cdd:cd05819 53 QFNEPAGVAVDSDGNLYVADTGNHRiQKFDPdgnfLASFGGSGDGDGEF---NGP---RGIAVDS-SGNIYVADTGNHRI 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1150 YRVkslsgakDLAGNSEVVAGtgeqclpfdearcgdGGKAVDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIIST 1227
Cdd:cd05819 126 QKF-------DPDGEFLTTFG---------------SGGSGPGQFNGPTGVAVDSDGNIYVADTGnhRIQVFDPDGNFLT 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1228 LLGSNdltavrplscdssmDVAQVRLEWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGrpmhcqvpgidyslS 1305
Cdd:cd05819 184 TFGST--------------GTGPGQFNYPTGIAVDSDGN-IYVADsgNNRVQVFDPDGAGFGGNG--------------N 234
|
250 260 270
....*....|....*....|....*....|....*
gi 1907081759 1306 KLAIHSALESASAIAISHTGVLYITETDEKKINRL 1340
Cdd:cd05819 235 FLGSDGQFNRPSGLAVDSDGNLYVADTGNNRIQVF 269
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1079-1409 |
2.94e-12 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 69.66 E-value: 2.94e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1079 PVALAVGIDGSLFVGDF--NYIRRIFPsrnvtsilelRNKEFK-HSNSPGHKYY-LAVDPvTGSLYVSDTNSRRIYRVks 1154
Cdd:COG4257 19 PRDVAVDPDGAVWFTDQggGRIGRLDP----------ATGEFTeYPLGGGSGPHgIAVDP-DGNLWFTDNGNNRIGRI-- 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1155 lsGAKDlaGNSEVVAGTGEQCLPFdearcgdggkavdatlmsprGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTLlgs 1231
Cdd:COG4257 86 --DPKT--GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEF--- 138
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1232 ndltavrPLSCDSSMdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRI-TENHQVSIIAGrpmhcqvpgidyslskla 1308
Cdd:COG4257 139 -------PLPTGGAG---------PYGIAVDP-DGNLWVtdFGANAIGRIdPDTGTLTEYAL------------------ 183
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1309 iHSALESASAIAISHTGVLYITETDEKKINRLRqvTTNGEIcllagaasdcdckndvncicysgdDAYATDAILNSPSSL 1388
Cdd:COG4257 184 -PTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTV------------------------TEYPLPGGGARPYGV 236
|
330 340
....*....|....*....|.
gi 1907081759 1389 AVAPDGTIYIADLGNIRIRAV 1409
Cdd:COG4257 237 AVDGDGRVWFAESGANRIVRF 257
|
|
| Rhs_assc_core |
TIGR03696 |
RHS repeat-associated core domain; This model represents a conserved unique core sequence ... |
2230-2307 |
6.51e-10 |
|
RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.
Pssm-ID: 274730 [Multi-domain] Cd Length: 77 Bit Score: 57.51 E-value: 6.51e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 2230 YTAYGEIYYDSNPDFQmVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwrnvgkePA----PFNLYMFKNNNPL 2305
Cdd:TIGR03696 1 YDPYGEVLSESGAAPN-PLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD----------PIglggGLNLYAYVGNNPV 69
|
..
gi 1907081759 2306 SN 2307
Cdd:TIGR03696 70 NW 71
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1116-1439 |
1.92e-09 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 61.19 E-value: 1.92e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1116 KEFKHSNSPGHKYYLAVDPvTGSLYVSDTNSRRIYRVkslsgakDLAgnsevvagTGEqclpFDEARCGDGGkavdatlm 1195
Cdd:COG4257 8 TEYPVPAPGSGPRDVAVDP-DGAVWFTDQGGGRIGRL-------DPA--------TGE----FTEYPLGGGS-------- 59
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1196 SPRGIAVDKNGLMYFVD--ATMIRKVD-QNGIISTLLGSNDLTAvrplscdssmdvaqvrlewPTDLAVNPmDNSLYV-- 1270
Cdd:COG4257 60 GPHGIAVDPDGNLWFTDngNNRIGRIDpKTGEITTFALPGGGSN-------------------PHGIAFDP-DGNLWFtd 119
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1271 LENNVILRIT-ENHQVSIIAGRPMHCQvpgidyslsklaihsalesASAIAISHTGVLYITETdekKINRLRQVTT-NGE 1348
Cdd:COG4257 120 QGGNRIGRLDpATGEVTEFPLPTGGAG-------------------PYGIAVDPDGNLWVTDF---GANAIGRIDPdTGT 177
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1349 IcllagaasdcdckndvncicysgdDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSknkPVLNAFNQYeAASPG 1428
Cdd:COG4257 178 L------------------------TEYALPTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD---PKTGTVTEY-PLPGG 229
|
330
....*....|...
gi 1907081759 1429 EQELY--VFNADG 1439
Cdd:COG4257 230 GARPYgvAVDGDG 242
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1074-1349 |
1.98e-08 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 58.11 E-value: 1.98e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1074 NKLLAPVALAVGIDGSLFVGD--FNYIRRIFPSRNVTSILELRNKEfkhsNSPghkYYLAVDPvTGSLYVSDTNSRRIYR 1151
Cdd:COG4257 56 GGGSGPHGIAVDPDGNLWFTDngNNRIGRIDPKTGEITTFALPGGG----SNP---HGIAFDP-DGNLWFTDQGGNRIGR 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1152 VkslsgakDLAGNsEVVAGTgeqcLPFDEARcgdggkavdatlmsPRGIAVDKNGLMYFVD--ATMIRKVD-QNGIISTL 1228
Cdd:COG4257 128 L-------DPATG-EVTEFP----LPTGGAG--------------PYGIAVDPDGNLWVTDfgANAIGRIDpDTGTLTEY 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1229 LGSNDLTAvrplscdssmdvaqvrlewPTDLAVNPmDNSLYVLE--NNVILRITENhqvsiiagrpmhcqvpgiDYSLSK 1306
Cdd:COG4257 182 ALPTPGAG-------------------PRGLAVDP-DGNLWVADtgSGRIGRFDPK------------------TGTVTE 223
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 1907081759 1307 LAIHSALESASAIAISHTGVLYITETDekkINRLRQVTTNGEI 1349
Cdd:COG4257 224 YPLPGGGARPYGVAVDGDGRVWFAESG---ANRIVRFDPDTEL 263
|
|
| acid_disulf_rpt |
NF033662 |
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ... |
682-712 |
2.63e-08 |
|
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.
Pssm-ID: 411265 [Multi-domain] Cd Length: 32 Bit Score: 51.36 E-value: 2.63e-08
10 20 30
....*....|....*....|....*....|.
gi 1907081759 682 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 712
Cdd:NF033662 2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
|
|
| NHL_PKND_like |
cd14952 |
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ... |
1130-1406 |
8.20e-08 |
|
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271322 [Multi-domain] Cd Length: 247 Bit Score: 55.68 E-value: 8.20e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1130 LAVDPvTGSLYVSDTNSRRIYRvkslsgakdLAgnsevvAGTGEQC-LPFDEarcgdggkavdatLMSPRGIAVDKNGLM 1208
Cdd:cd14952 15 VAVDA-AGNVYVADSGNNRVLK---------LA------AGSTTQTvLPFTG-------------LYQPQGVAVDAAGTV 65
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1209 YFVDAtmirkvDQNGIISTLLGSNDLTAVrPLScdssmdvaqvRLEWPTDLAVNPMDNsLYVLE--NNVILRITenhqvs 1286
Cdd:cd14952 66 YVTDF------GNNRVLKLAAGSTTQTVL-PFT----------GLNDPTGVAVDAAGN-VYVADtgNNRVLKLA------ 121
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1287 iiAGRPMHCQVPGIDyslsklaihsaLESASAIAISHTGVLYITETDEkkiNRLRQvttngeicLLAGAASdcdckndvn 1366
Cdd:cd14952 122 --AGSNTQTVLPFTG-----------LSNPDGVAVDGAGNVYVTDTGN---NRVLK--------LAAGSTT--------- 168
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 1907081759 1367 cicysgddayATD---AILNSPSSLAVAPDGTIYIADLGNIRI 1406
Cdd:cd14952 169 ----------QTVlpfTGLNSPSGVAVDTAGNVYVTDHGNNRV 201
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
492-638 |
5.54e-07 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 51.30 E-value: 5.54e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 492 SCGGHGS-CIDGNCVCaagykGEHCeeVDC-LDP--------TCSSHGVCVNGECLCSPGwgglncelaRVQCPDQCSgh 561
Cdd:NF041328 13 GCPEPGAvCPEGLSVC-----GGAC--VDLrSDPsncgacgvACGAGQTCVAGACGCGPG---------TVACGGACV-- 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 562 gtylpdsgLCSCDPNWMGpdcsveVCSVDCGTHGVCIGGACR--CEEGWT--GAAC-DQRVCHPRCIEHGT-CKDGKcEC 635
Cdd:NF041328 75 --------DTASDPAHCG------ACGAACAPGQVCEGGACReaCSEGLTrcGGACvDLATDPLHCGACGVaCDPGE-SC 139
|
...
gi 1907081759 636 REG 638
Cdd:NF041328 140 RGG 142
|
|
| DUF5885 |
pfam19232 |
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ... |
423-582 |
5.78e-07 |
|
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.
Pssm-ID: 437064 Cd Length: 265 Bit Score: 53.47 E-value: 5.78e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 423 DCPRNCHGNGECVSGLCH--------------CFPGFLGADCAKAAC--PVLCsGNGQ----------YSKGTCQ----C 472
Cdd:pfam19232 11 DCTPPCGGTQVCIDRQCKdntlacttdaqcgtCMTCVAGACTPKASCcgGVTC-GAGQtcdaktntcvYVKGYCSadhpC 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 473 YSGwkgAECDVPMNQCI-DPSCG-GHGS-CIDG-----------------NCVCAAG--YKGEH-CEEV--------DCL 521
Cdd:pfam19232 90 PSG---SACDTAKNACIaQPPYGpDSGKgCVRGfgawiweldpatnsgvwRCRCANGslYNSAHeCSPLadqtlcaaENL 166
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 522 DPTC---------------SSHGVCVN-------------GECLCSPGWGGLNCELARvqcpdQCSGHGTYLPDSGLCSC 573
Cdd:pfam19232 167 DPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTEDR-----TCNGRGTWNETTGQCAC 241
|
250 260
....*....|....*....|....
gi 1907081759 574 ------------DPN---WMGPDC 582
Cdd:pfam19232 242 nidfsghnscgdDNNctsWTGPRC 265
|
|
| NHL_like_2 |
cd14957 |
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ... |
1194-1473 |
5.38e-06 |
|
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271327 [Multi-domain] Cd Length: 280 Bit Score: 50.73 E-value: 5.38e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1194 LMSPRGIAVDKNGLMYFVDA--TMIRKVDQNGIISTLLGSNDLTavrplscdssmdvaQVRLEWPTDLAVNPMDNsLYVL 1271
Cdd:cd14957 17 FNTPRGIAVDSAGNIYVADTgnNRIQVFTSSGVYSYSIGSGGTG--------------SGQFNSPYGIAVDSNGN-IYVA 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1272 EnnvilriTENHQVSII--AGrpmhcqvpGIDYSL-SKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGE 1348
Cdd:cd14957 82 D-------TDNNRIQVFnsSG--------VYQYSIgTGGSGDGQFNGPYGIAVDSNGNIYVADTGN---HRIQVFTSSGT 143
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1349 ICllagaasdcdckndvncicYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRavsknkpvlnafnqyeaaspg 1428
Cdd:cd14957 144 FS-------------------YSIGSGGTGPGQFNGPQGIAVDSDGNIYVADTGNHRIQ--------------------- 183
|
250 260 270 280
....*....|....*....|....*....|....*....|....*.
gi 1907081759 1429 eqelyVFNADGIHQYTV-SLVTGEYLYNFTYsadnDVTelIDNNGN 1473
Cdd:cd14957 184 -----VFTSSGTFQYTFgSSGSGPGQFSDPY----GIA--VDSDGN 218
|
|
| NHL_like_2 |
cd14957 |
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ... |
1079-1407 |
1.48e-05 |
|
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271327 [Multi-domain] Cd Length: 280 Bit Score: 49.19 E-value: 1.48e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1079 PVALAVGIDGSLFVGDFNYIR-RIF-PSRNVTSIL---ELRNKEFkhsNSPghkYYLAVDPvTGSLYVSDTNSRRIyRVK 1153
Cdd:cd14957 20 PRGIAVDSAGNIYVADTGNNRiQVFtSSGVYSYSIgsgGTGSGQF---NSP---YGIAVDS-NGNIYVADTDNNRI-QVF 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1154 SLSGAKDLAgnsevVAGTGEQCLPFDEarcgdggkavdatlmsPRGIAVDKNGLMYFVDA--TMIRKVDQNGIISTLLGS 1231
Cdd:cd14957 92 NSSGVYQYS-----IGTGGSGDGQFNG----------------PYGIAVDSNGNIYVADTgnHRIQVFTSSGTFSYSIGS 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1232 ndltavrplscdSSMDVAQVRLewPTDLAVNPMDNsLYVLENNvilriteNHQVSII--AGRPmhcqvpgiDYSL-SKLA 1308
Cdd:cd14957 151 ------------GGTGPGQFNG--PQGIAVDSDGN-IYVADTG-------NHRIQVFtsSGTF--------QYTFgSSGS 200
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1309 IHSALESASAIAISHTGVLYITETDEKKInrlrQVTTNgeicllagaasdcdckndvncicySGDDAYA------TDAIL 1382
Cdd:cd14957 201 GPGQFSDPYGIAVDSDGNIYVADTGNHRI----QVFTS------------------------SGAYQYSigtsgsGNGQF 252
|
330 340
....*....|....*....|....*
gi 1907081759 1383 NSPSSLAVAPDGTIYIADLGNIRIR 1407
Cdd:cd14957 253 NYPYGIAVDNDGKIYVADSNNNRIQ 277
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1371-1412 |
1.90e-05 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 49.45 E-value: 1.90e-05
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 1907081759 1371 SGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKN 1412
Cdd:cd14953 11 GFSGGGGTAARFNSPSGVAVDAAGNLYVADRGNHRIRKITPD 52
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
1521-1557 |
4.98e-05 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 42.59 E-value: 4.98e-05
10 20 30
....*....|....*....|....*....|....*..
gi 1907081759 1521 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTG 1557
Cdd:pfam05593 1 YDAA-GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
|
|
| NHL_PKND_like |
cd14952 |
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ... |
1076-1270 |
8.65e-05 |
|
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271322 [Multi-domain] Cd Length: 247 Bit Score: 46.82 E-value: 8.65e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1076 LLAPVALAVGIDGSLFVGDFNYIR--RIFPSRNVTSILELRNKEFKHSnspghkyyLAVDPVtGSLYVSDTNSRRIYRVK 1153
Cdd:cd14952 51 LYQPQGVAVDAAGTVYVTDFGNNRvlKLAAGSTTQTVLPFTGLNDPTG--------VAVDAA-GNVYVADTGNNRVLKLA 121
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1154 S------------LSGAKDLA------------GNSEVV---AGTGEQC-LPFDEarcgdggkavdatLMSPRGIAVDKN 1205
Cdd:cd14952 122 AgsntqtvlpftgLSNPDGVAvdgagnvyvtdtGNNRVLklaAGSTTQTvLPFTG-------------LNSPSGVAVDTA 188
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907081759 1206 GLMYFVDAtmirkvDQNGIISTLLGSNDLTAVrPLScdssmdvaqvRLEWPTDLAVNPmDNSLYV 1270
Cdd:cd14952 189 GNVYVTDH------GNNRVLKLAAGSTTPTVL-PFT----------GLNGPLGVAVDA-AGNVYV 235
|
|
| Keratin_B2 |
pfam01500 |
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ... |
515-679 |
1.80e-04 |
|
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.
Pssm-ID: 366678 [Multi-domain] Cd Length: 161 Bit Score: 44.40 E-value: 1.80e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 515 CEEVDCLDPTCSSHGVCvnGECLCSPGWGGLNCelarvqCPDQCSGHGTYLPDSGLCSCDPNWMGPDCSVEVCSVDCGTH 594
Cdd:pfam01500 4 CGTSFCGFPTCSTGGTC--GSGCCQPCCCQSSC------CRPSCCQTSCCQPTTFQSSCCRPTCQPCCQTSCCQPTCCQT 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 595 GVCIGGACRCEEGWTGAA----CDQRVCHPRCIEHGTCKdGKCecregwngehCTIDGCPDLCngngrCTLGQNSWQCvC 670
Cdd:pfam01500 76 SSCQTGCGGIGYGQEGSSgavsSRTRWCRPDCRVEGTCL-PPC----------CVVSCTPPTC-----CQLHHAQASC-C 138
|
....*....
gi 1907081759 671 QTGWRGPGC 679
Cdd:pfam01500 139 RPSYCGQSC 147
|
|
| YvrE |
COG3386 |
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase ... |
1082-1242 |
3.22e-04 |
|
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase YvrE is part of the Pathway/BioSystem: Non-phosphorylated Entner-Doudoroff pathway
Pssm-ID: 442613 [Multi-domain] Cd Length: 266 Bit Score: 44.88 E-value: 3.22e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1082 LAVGIDGSLFVGDFNY------IRRIFPSRNVTSILElrnkEFKHSNSpghkyyLAVDPVTGSLYVSDTNSRRIYRVksl 1155
Cdd:COG3386 98 GVVDPDGRLYFTDMGEylptgaLYRVDPDGSLRVLAD----GLTFPNG------IAFSPDGRTLYVADTGAGRIYRF--- 164
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1156 sgakDLAGNSEVVAGTgeqclPFDEARCGDGGkavdatlmsPRGIAVDKNGLMY--FVDATMIRKVDQNGiisTLLGSND 1233
Cdd:COG3386 165 ----DLDADGTLGNRR-----VFADLPDGPGG---------PDGLAVDADGNLWvaLWGGGGVVRFDPDG---ELLGRIE 223
|
....*....
gi 1907081759 1234 LTAVRPLSC 1242
Cdd:COG3386 224 LPERRPTNV 232
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
428-450 |
5.87e-04 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 39.25 E-value: 5.87e-04
|
| DSL |
pfam01414 |
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ... |
604-646 |
1.42e-03 |
|
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.
Pssm-ID: 460202 Cd Length: 46 Bit Score: 38.76 E-value: 1.42e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 1907081759 604 CEEGWTGAACDqRVCHPR--CIEHGTC-KDGKCECREGWNGEHCTI 646
Cdd:pfam01414 1 CDENYYGSTCS-KFCRPRddKFGHYTCdANGNKVCLPGWTGPYCDK 45
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
493-515 |
2.08e-03 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 37.71 E-value: 2.08e-03
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
519-548 |
2.14e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 38.00 E-value: 2.14e-03
10 20 30
....*....|....*....|....*....|....*
gi 1907081759 519 DCLDPT-CSSHGVCVNGE----CLCSPGWGGLNCE 548
Cdd:cd00054 4 ECASGNpCQNGGTCVNTVgsyrCSCPPGYTGRNCE 38
|
|
| NHL_like_5 |
cd14963 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1069-1233 |
2.60e-03 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271333 [Multi-domain] Cd Length: 268 Bit Score: 42.28 E-value: 2.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1069 GLAEGnKLLAPVALAVGIDGSLFVGDFnYIRRI------------FPSRnvtsilelrnKEFKHSNSPGHkyyLAVDpvT 1136
Cdd:cd14963 49 GTGPG-EFKYPYGIAVDSDGNIYVADL-YNGRIqvfdpdgkflkyFPEK----------KDRVKLISPAG---LAID--D 111
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1137 GSLYVSDTNSRRIYrvkslsgakdlagnseVVAGTGEQCLPFDEARCGDGgkavdaTLMSPRGIAVDKNGLMYFVDATMI 1216
Cdd:cd14963 112 GKLYVSDVKKHKVI----------------VFDLEGKLLLEFGKPGSEPG------ELSYPNGIAVDEDGNIYVADSGNG 169
|
170 180
....*....|....*....|
gi 1907081759 1217 R-KV-DQNG-IISTLLGSND 1233
Cdd:cd14963 170 RiQVfDKNGkFIKELNGSPD 189
|
|
| YD_repeat_2x |
TIGR01643 |
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ... |
1521-1563 |
2.76e-03 |
|
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Pssm-ID: 273728 [Multi-domain] Cd Length: 42 Bit Score: 37.57 E-value: 2.76e-03
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 1907081759 1521 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLH 1563
Cdd:TIGR01643 1 YDAA-GRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
|
|
| NHL_like_6 |
cd14962 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1128-1332 |
2.97e-03 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271332 [Multi-domain] Cd Length: 271 Bit Score: 42.19 E-value: 2.97e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1128 YYLAVDPvTGSLYVSDTNSRRIYRvkslsgakdlagnsevvagtgeqclpFDEARcgdG-----GKAVDATLMSPRGIAV 1202
Cdd:cd14962 15 YGVAADG-RGRIYVADTGRGAVFV--------------------------FDLPN---GkvfviGNAGPNRFVSPIGVAI 64
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1203 DKNGLMYFVDAT--MIRKVDQNGIISTLLGSNDLtavrplscdssmdvaQVRlewPTDLAVNPMDNSLYVLEnnvilriT 1280
Cdd:cd14962 65 DANGNLYVSDAElgKVFVFDRDGKFLRAIGAGAL---------------FKR---PTGIAVDPAGKRLYVVD-------T 119
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1907081759 1281 ENHQVSIIAGRPMHCQVPGIDYSlsklaIHSALESASAIAISHTGVLYITET 1332
Cdd:cd14962 120 LAHKVKVFDLDGRLLFDIGKRGS-----GPGEFNLPTDLAVDRDGNLYVTDT 166
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
558-582 |
3.03e-03 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 37.33 E-value: 3.03e-03
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
486-516 |
3.11e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 37.23 E-value: 3.11e-03
10 20 30
....*....|....*....|....*....|....*.
gi 1907081759 486 NQCIDPS-CGGHGSCIDG----NCVCAAGYKGEHCE 516
Cdd:cd00054 3 DECASGNpCQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
|
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
646-679 |
3.47e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 37.23 E-value: 3.47e-03
10 20 30
....*....|....*....|....*....|....*.
gi 1907081759 646 IDGC--PDLCNGNGRCTLGQNSWQCVCQTGWRGPGC 679
Cdd:cd00054 2 IDECasGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
587-659 |
3.57e-03 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 40.13 E-value: 3.57e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 587 CSVDCGTHGVCIGGACRCEEGWT--GAAC-----DQR---VCHPRCIEHGTCKDGKCE--CREGWngEHCTiDGCPDLCN 654
Cdd:NF041328 45 CGVACGAGQTCVAGACGCGPGTVacGGACvdtasDPAhcgACGAACAPGQVCEGGACReaCSEGL--TRCG-GACVDLAT 121
|
....*
gi 1907081759 655 GNGRC 659
Cdd:NF041328 122 DPLHC 126
|
|
| SOBP |
pfam15279 |
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ... |
33-197 |
5.43e-03 |
|
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.
Pssm-ID: 464609 [Multi-domain] Cd Length: 325 Bit Score: 41.72 E-value: 5.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 33 LPSSHNPPPVScQMPLLDSNTSHQIMDTNPDEEF----SPNSYLLRACSGPQQ---ASSSGPPNHHSQSTLRPPLPPPhn 105
Cdd:pfam15279 127 APKPHEPPSLP-PPPLPPKKGRRHRPGLHPPLGRppgsPPMSMTPRGLLGKPQqhpPPSPLPAFMEPSSMPPPFLRPP-- 203
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 106 htlsHHHSSANSlnrnSLTNRRSQIHAPAPAP-NDLATTPEsvqlqdswvlnsnvPLEtRHFLFKTSSGSTPLFSSSSPG 184
Cdd:pfam15279 204 ----PSIPQPNS----PLSNPMLPGIGPPPKPpRNLGPPSN--------------PMH-RPPFSPHHPPPPPTPPGPPPG 260
|
170
....*....|...
gi 1907081759 185 YPLTSGTVYTPPP 197
Cdd:pfam15279 261 LPPPPPRGFTPPF 273
|
|
| NHL_PKND_like |
cd14952 |
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ... |
1079-1212 |
6.76e-03 |
|
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271322 [Multi-domain] Cd Length: 247 Bit Score: 40.65 E-value: 6.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1079 PVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSILElrnkeFKHSNSPGHkyyLAVDPvTGSLYVSDTNSRRIYRvksls 1156
Cdd:cd14952 138 PDGVAVDGAGNVYVTDTgnNRVLKLAAGSTTQTVLP-----FTGLNSPSG---VAVDT-AGNVYVTDHGNNRVLK----- 203
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*..
gi 1907081759 1157 gakdLAgnsevvAGTGEQC-LPFDEarcgdggkavdatLMSPRGIAVDKNGLMYFVD 1212
Cdd:cd14952 204 ----LA------AGSTTPTvLPFTG-------------LNGPLGVAVDAAGNVYVAD 237
|
|
| NHL-2_like |
cd14951 |
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ... |
1324-1409 |
7.82e-03 |
|
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271321 [Multi-domain] Cd Length: 334 Bit Score: 41.02 E-value: 7.82e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1324 TGVLYITETDEKKINRL----RQVTTngeiclLAGaasdcdckndvncicySGDDAYA-TDAILNSPSSLAVAPDGTIYI 1398
Cdd:cd14951 206 DGSVYVADTYNHKIKRVdpatGEVST------LAG----------------TGKAGYKdLEAQFSEPSGLVVDGDGRLYV 263
|
90
....*....|.
gi 1907081759 1399 ADLGNIRIRAV 1409
Cdd:cd14951 264 ADTNNHRIRRL 274
|
|
| NHL_like_5 |
cd14963 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1382-1473 |
8.63e-03 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271333 [Multi-domain] Cd Length: 268 Bit Score: 40.74 E-value: 8.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 1382 LNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYE----AASPG-----EQELYVFNADGiHQYTVSLVTGEY 1452
Cdd:cd14963 55 FKYPYGIAVDSDGNIYVADLYNGRIQVFDPDGKFLKYFPEKKdrvkLISPAglaidDGKLYVSDVKK-HKVIVFDLEGKL 133
|
90 100
....*....|....*....|....*....
gi 1907081759 1453 LYNF--------TYSADNDVTelIDNNGN 1473
Cdd:cd14963 134 LLEFgkpgsepgELSYPNGIA--VDEDGN 160
|
|
| I-EGF_1 |
pfam18372 |
Integrin beta epidermal growth factor like domain 1; This is the I-EGF 1 domain found in ... |
459-476 |
9.00e-03 |
|
Integrin beta epidermal growth factor like domain 1; This is the I-EGF 1 domain found in several integrin betas such as integrin beta 1-7. Structural analysis reveal an epidermal growth factor-like (I-EGF) domains 1 and 2. EGF1 lacks one disulfide (C2-C4) relative to the integrin EGF 2, 3, and 4 domains, this allows the C-terminal end of EGF1 to flex remarkably relative to its N-terminal end.
Pssm-ID: 465729 Cd Length: 29 Bit Score: 35.93 E-value: 9.00e-03
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
431-541 |
9.22e-03 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 38.97 E-value: 9.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081759 431 NGECVSglchcfpgfLGADCAK-AACPVLCSGNGQYSKGTCQCYSGwkGAECDvpmNQCI----DP-SCGGHGSCIDGNC 504
Cdd:NF041328 29 GGACVD---------LRSDPSNcGACGVACGAGQTCVAGACGCGPG--TVACG---GACVdtasDPaHCGACGAACAPGQ 94
|
90 100 110 120
....*....|....*....|....*....|....*....|....*....
gi 1907081759 505 VCAAGYKGEHCEE--VDCldptcssHGVCVN--------GEC--LCSPG 541
Cdd:NF041328 95 VCEGGACREACSEglTRC-------GGACVDlatdplhcGACgvACDPG 136
|
|
|