NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|20149764|ref|NP_619614|]
View 

stabilin-2 precursor [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Link_Domain super family cl02612
The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive ...
2206-2297 8.38e-38

The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It is found in the CD44 receptor and in human TSG-6. TSG-6 is the protein product of the tumor necrosis factor-stimulated gene-6. TSG-6 has a strong anti-inflammatory effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. This group also contains the link domains of the chondroitin sulfate proteoglycan core proteins (CSPG) including aggrecan, versican, neurocan, and brevican and the link domains of the vertebrate HAPLN (HA and proteoglycan binding link) protein family. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates in which other CSPGs substitute for aggregan might contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN gene family are physically linked adjacent to CSPG genes. TSG-6 contains a single link module which supports high affinity binding with HA. The functional HA-binding domain of CD44 is an extended domain comprised of a link module flanked with N-and C- extensions. These extensions are essential for folding and functional activity. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of the CSPG aggrecan are involved in interaction with HA. Aggrecan in addition contains a second globular domain (G2) which contains link modules 3 and 4 which lack HA-binding activity. HAPLNs contain two contiguous link modules.


The actual alignment was detected with superfamily member cd03515:

Pssm-ID: 470631  Cd Length: 93  Bit Score: 137.59  E-value: 8.38e-38
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 2206 GVFHLRSPLGQYKLTFDKAKEACAKEAASIATYNQLSYAQKAKYHLCSAGWLESGRVAYPTIYASKKCA-NIVGIVDYGT 2284
Cdd:cd03515    1 GVFHLRSRSGKYKLTYTEAKAACEAEGAHLATYSQLSAAQQLGFHLCAAGWLAKGRVGYPIVFPSANCGfGHVGIVDYGP 80
                         90
                 ....*....|...
gi 20149764 2285 RTNKSEMWDVFCY 2297
Cdd:cd03515   81 RLNLSERWDAYCY 93
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1758-1889 4.06e-30

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 116.58  E-value: 4.06e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764   1758 HGYTKFSKLIQDSGLLKVITDPMHtPVTLFWPTDKALQALPQEQQDFLFNedNKDKLKAYLKFHVIRDTMaLASDLPRSA 1837
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQG-PFTVFAPTNEAFAKLPAGTLNFLLK--DKEQLKNLLKYHVVPGRL-TSSDLKNGG 76
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 20149764   1838 SWKTLQGSELSVRCGTGSdvgeLFLNGqmCRIIQRRLLFDGGVAYGIDCLLM 1889
Cdd:pfam02469   77 TLATLQGSKLRVNVTGGS----VTVNG--ARVVQADIEATNGVIHVIDKVLL 122
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1628-1733 1.74e-28

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 111.96  E-value: 1.74e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764   1628 VQELAGP-GPFTVFVPSSDSFN---SESKLKVWDKQGLMSQILRYHVVACQqLLLENLKVITSATTLQGEPISISVSQDT 1703
Cdd:pfam02469   16 VDTLNGSqGPFTVFAPTNEAFAklpAGTLNFLLKDKEQLKNLLKYHVVPGR-LTSSDLKNGGTLATLQGSKLRVNVTGGS 94
                           90       100       110
                   ....*....|....*....|....*....|
gi 20149764   1704 VLINKkAKVLSSDIISTNGVIHVIDTLLSP 1733
Cdd:pfam02469   95 VTVNG-ARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1018-1137 2.04e-24

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 100.41  E-value: 2.04e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764   1018 FYQWINNASLQSMLSAT-SNLTVLVPSLQAIKDMDQNEKSFWLS-RNNIPALIKYHTLLGTYRVADLQTLPSshmlATSL 1095
Cdd:pfam02469    6 FVALLKAAGLVDTLNGSqGPFTVFAPTNEAFAKLPAGTLNFLLKdKEQLKNLLKYHVVPGRLTSSDLKNGGT----LATL 81
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 20149764   1096 QGSFLRLDKADGNITIEGASFVDGDNAATNGVVHIINKVLIP 1137
Cdd:pfam02469   82 QGSKLRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
533-661 7.33e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 96.17  E-value: 7.33e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    533 PRYGKFRSLLEKTNVGQALEKGgiDEPYTIFVPSNEALSNMTAGVLDYLLSPegSRKLLELVRYHIVAfTQLEVATLVST 612
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGS--QGPFTVFAPTNEAFAKLPAGTLNFLLKD--KEQLKNLLKYHVVP-GRLTSSDLKNG 75
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 20149764    613 LHIRSMANQIITFNISSkGQILANNVAVDETEVAAKNGRIYTLTGVLIP 661
Cdd:pfam02469   76 GTLATLQGSKLRVNVTG-GSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1156-1273 1.17e-22

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 95.40  E-value: 1.17e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764   1156 PDYSIFRGYIIHYNLASAIEAADA-YTVFVPNNEAIESYIREKKATSLK-----EDILQYHVVLGeKLLRNDLHNGMHRE 1229
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQGpFTVFAPTNEAFAKLPAGTLNFLLKdkeqlKNLLKYHVVPG-RLTSSDLKNGGTLA 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 20149764   1230 TMLGFSylLAFFLHNDQLYVNEAPINYTNVATDKGVIHGLEKVL 1273
Cdd:pfam02469   80 TLQGSK--LRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVL 121
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
390-512 1.18e-21

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 92.70  E-value: 1.18e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    390 GQLTSFISILDRT-YAWPLSN-LGPFTVLLPSDKG---LKGVDVKELLMDKEAARYFVKLHIIAGQMSTEQMYNLDTFYT 464
Cdd:pfam02469    1 PGFSTFVALLKAAgLVDTLNGsQGPFTVFAPTNEAfakLPAGTLNFLLKDKEQLKNLLKYHVVPGRLTSSDLKNGGTLAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 20149764    465 LTGKSGEIINKDKdnqlKLKLYGSKIVQiiqGNIVASNGLVHILDRAM 512
Cdd:pfam02469   81 LQGSKLRVNVTGG----SVTVNGARVVQ---ADIEATNGVIHVIDKVL 121
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
2363-2452 5.20e-11

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


:

Pssm-ID: 214719  Cd Length: 97  Bit Score: 61.22  E-value: 5.20e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    2363 TLFVPQNSGL----PKNKSLSG----RDIEHHLTNVNVSfYDDLVNGTVLKTRLGSQLLITSSQDQlhqEARFVDGRAIL 2434
Cdd:smart00554    1 TVFAPTDEAFqklpPDLNSLLAdklkNLLLYHVVPGRLS-SADLLNGGTLPTLAGSKLRITRSGGS---GTVTVNGARIV 76
                            90
                    ....*....|....*...
gi 20149764    2435 QWDIIASNGVLHIISEPL 2452
Cdd:smart00554   77 EADIAATNGVVHVIDRVL 94
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1482-1518 6.51e-08

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 50.68  E-value: 6.51e-08
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 20149764   1482 CEISNGGCSAKADCKRTiPGSRVCVCKAGYTGDGIVC 1518
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1566-1602 2.75e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 48.75  E-value: 2.75e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 20149764   1566 CLTNNGGCSPFAFCNHTEqDQRTCTCKPDYTGDGIVC 1602
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTG-GSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2135-2172 4.49e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 45.28  E-value: 4.49e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 20149764   2135 CANGvNGGCHEHATCRMTgPGKQKCECKSHYVGDGRDC 2172
Cdd:pfam12947    1 CSDN-NGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1524-1560 9.94e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 44.51  E-value: 9.94e-06
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 20149764   1524 CLENHGGCDRHAECTQTgPNQAVCNCLPKYTGDGKVC 1560
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2094-2129 1.62e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 43.74  E-value: 1.62e-05
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 20149764   2094 CKQNNGGCAKVAKCSQKGTQVSCSCQKGYKGDGHSC 2129
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
844-872 5.17e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 42.20  E-value: 5.17e-05
                           10        20
                   ....*....|....*....|....*....
gi 20149764    844 CHIHATCEYSNETASCVCNDGYEGDGTLC 872
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
334-369 3.31e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.89  E-value: 3.31e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 20149764    334 CESKN-PCHKNANCSTVsPGQTQCTCQKGYVGDGLNC 369
Cdd:pfam12947    1 CSDNNgGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
927-959 4.36e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.89  E-value: 4.36e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 20149764    927 SGGCHDNATCLYVgPGQNECECKKGFRGNGIDC 959
Cdd:pfam12947    5 NGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
965-1001 1.35e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.35  E-value: 1.35e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 20149764    965 CLEQIEKCHPLATCQYTLSGVwSCVCQEGYEGNGVLC 1001
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSF-TCTCNDGYTGDGVTC 36
EGF_3 super family cl48154
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
254-283 2.71e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


The actual alignment was detected with superfamily member pfam12947:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.58  E-value: 2.71e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 20149764    254 CHPHASCSYLgPNRHSCVCQKGYQGDGQVC 283
Cdd:pfam12947    8 CHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 super family cl48154
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
881-916 4.23e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


The actual alignment was detected with superfamily member pfam12947:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.81  E-value: 4.23e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 20149764    881 STSRGGCSPNAECIQaSTGTYSCVCQRGWTGNGRDC 916
Cdd:pfam12947    2 SDNNGGCHPNATCTN-TGGSFTCTCNDGYTGDGVTC 36
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1978-2022 4.33e-03

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


:

Pssm-ID: 395007  Cd Length: 49  Bit Score: 37.33  E-value: 4.33e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 20149764   1978 CNNRGMCYDQ-YKPTGQCQCHTGFNGTACELCLPGRFG-PDCQPCGC 2022
Cdd:pfam00053    3 CNPHGSLSDTcDPETGQCLCKPGVTGRHCDRCKPGYYGlPSDPPQGC 49
 
Name Accession Description Interval E-value
Link_domain_TSG_6_like cd03515
This is the extracellular link domain of the type found in human TSG-6. The link domain is a ...
2206-2297 8.38e-38

This is the extracellular link domain of the type found in human TSG-6. The link domain is a hyaluronan (HA)-binding domain. TSG-6 is the protein product of tumor necrosis factor-stimulated gene-6. TSG-6 is up-regulated in inflammatory lesions and in the ovary during ovulation. It has a strong anti-inflammatory and chondroprotective effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. Also included in this group are the stabilins: stabilin-1 (FEEL-1, CLEVER-1) and stabilin-2 (FEEL-2). Stabilin-2 functions as the major liver and lymph node-scavenging receptor for HA and related glycosaminoglycans. Stabilin-2 is a scavenger receptor with a broad range of ligands including advanced glycation end (AGE) products, acetylated low density lipoprotein and procollagen peptides. In contrast, stabilin-1 does not bind HA, but binds acetylated low density lipoprotein and AGEs with lower affinity. As AGEs accumulate in vascular tissues during aging and diabetes, these receptors may be implicated in the pathologies of these states. Both stabilins are present in the early endocytic pathway in hepatic sinusoidal epithelium associating with clathrin/AP-2. Stabilin-1 is expressed in macrophages. Stabilin-2 is absent from the latter. In macrophages: stabilin-1 is involved in trafficking between early/sorting endosomes and the trans-Golgi network. Stabilin-1 has also been implicated in angiogenesis and possibly leucocyte trafficking. Both stabilins bind gram-positive and gram-negative bacteria. TSG-6 and stabilins contain a single link module which supports high affinity binding to HA.


Pssm-ID: 239592  Cd Length: 93  Bit Score: 137.59  E-value: 8.38e-38
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 2206 GVFHLRSPLGQYKLTFDKAKEACAKEAASIATYNQLSYAQKAKYHLCSAGWLESGRVAYPTIYASKKCA-NIVGIVDYGT 2284
Cdd:cd03515    1 GVFHLRSRSGKYKLTYTEAKAACEAEGAHLATYSQLSAAQQLGFHLCAAGWLAKGRVGYPIVFPSANCGfGHVGIVDYGP 80
                         90
                 ....*....|...
gi 20149764 2285 RTNKSEMWDVFCY 2297
Cdd:cd03515   81 RLNLSERWDAYCY 93
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1758-1889 4.06e-30

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 116.58  E-value: 4.06e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764   1758 HGYTKFSKLIQDSGLLKVITDPMHtPVTLFWPTDKALQALPQEQQDFLFNedNKDKLKAYLKFHVIRDTMaLASDLPRSA 1837
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQG-PFTVFAPTNEAFAKLPAGTLNFLLK--DKEQLKNLLKYHVVPGRL-TSSDLKNGG 76
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 20149764   1838 SWKTLQGSELSVRCGTGSdvgeLFLNGqmCRIIQRRLLFDGGVAYGIDCLLM 1889
Cdd:pfam02469   77 TLATLQGSKLRVNVTGGS----VTVNG--ARVVQADIEATNGVIHVIDKVLL 122
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1628-1733 1.74e-28

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 111.96  E-value: 1.74e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764   1628 VQELAGP-GPFTVFVPSSDSFN---SESKLKVWDKQGLMSQILRYHVVACQqLLLENLKVITSATTLQGEPISISVSQDT 1703
Cdd:pfam02469   16 VDTLNGSqGPFTVFAPTNEAFAklpAGTLNFLLKDKEQLKNLLKYHVVPGR-LTSSDLKNGGTLATLQGSKLRVNVTGGS 94
                           90       100       110
                   ....*....|....*....|....*....|
gi 20149764   1704 VLINKkAKVLSSDIISTNGVIHVIDTLLSP 1733
Cdd:pfam02469   95 VTVNG-ARVVQADIEATNGVIHVIDKVLLP 123
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1628-1733 9.07e-26

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 105.76  E-value: 9.07e-26
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 1628 VQELAGPGPFTVFVPSSDSFNS------ESKLKVWDKQgLMSQILRYHVVAcQQLLLENLKVITSATTLQGEPISISVSQ 1701
Cdd:COG2335   56 VDTLSGEGPFTVFAPTDAAFAAlpagtlDALLKPENKA-TLTKILTYHVVP-GKVTAADLKDGKTLTTLQGQTLTVTVSG 133
                         90       100       110
                 ....*....|....*....|....*....|..
gi 20149764 1702 DTVLINKkAKVLSSDIISTNGVIHVIDTLLSP 1733
Cdd:COG2335  134 GGVTVNG-ANVITADIEASNGVIHVIDKVLLP 164
Xlink pfam00193
Extracellular link domain;
2206-2297 9.32e-26

Extracellular link domain;


Pssm-ID: 459706  Cd Length: 92  Bit Score: 103.04  E-value: 9.32e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764   2206 GVFHLRSPlGQYKLTFDKAKEACAKEAASIATYNQLSYAQKAKYHLCSAGWLESGRVAYPTIYASKKCA-NIVGIVDYGT 2284
Cdd:pfam00193    1 GVFHLESP-GRYKLTFQEAQAACAALGATLATPEQLYAAWKAGLDTCDAGWLADGTVRYPITTPRPNCGgNMPGVRQYGF 79
                           90
                   ....*....|...
gi 20149764   2285 RTNKSEMWDVFCY 2297
Cdd:pfam00193   80 RDPLSERYDAYCY 92
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1018-1137 2.04e-24

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 100.41  E-value: 2.04e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764   1018 FYQWINNASLQSMLSAT-SNLTVLVPSLQAIKDMDQNEKSFWLS-RNNIPALIKYHTLLGTYRVADLQTLPSshmlATSL 1095
Cdd:pfam02469    6 FVALLKAAGLVDTLNGSqGPFTVFAPTNEAFAKLPAGTLNFLLKdKEQLKNLLKYHVVPGRLTSSDLKNGGT----LATL 81
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 20149764   1096 QGSFLRLDKADGNITIEGASFVDGDNAATNGVVHIINKVLIP 1137
Cdd:pfam02469   82 QGSKLRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
533-661 7.33e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 96.17  E-value: 7.33e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    533 PRYGKFRSLLEKTNVGQALEKGgiDEPYTIFVPSNEALSNMTAGVLDYLLSPegSRKLLELVRYHIVAfTQLEVATLVST 612
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGS--QGPFTVFAPTNEAFAKLPAGTLNFLLKD--KEQLKNLLKYHVVP-GRLTSSDLKNG 75
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 20149764    613 LHIRSMANQIITFNISSkGQILANNVAVDETEVAAKNGRIYTLTGVLIP 661
Cdd:pfam02469   76 GTLATLQGSKLRVNVTG-GSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1156-1273 1.17e-22

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 95.40  E-value: 1.17e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764   1156 PDYSIFRGYIIHYNLASAIEAADA-YTVFVPNNEAIESYIREKKATSLK-----EDILQYHVVLGeKLLRNDLHNGMHRE 1229
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQGpFTVFAPTNEAFAKLPAGTLNFLLKdkeqlKNLLKYHVVPG-RLTSSDLKNGGTLA 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 20149764   1230 TMLGFSylLAFFLHNDQLYVNEAPINYTNVATDKGVIHGLEKVL 1273
Cdd:pfam02469   80 TLQGSK--LRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVL 121
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
390-512 1.18e-21

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 92.70  E-value: 1.18e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    390 GQLTSFISILDRT-YAWPLSN-LGPFTVLLPSDKG---LKGVDVKELLMDKEAARYFVKLHIIAGQMSTEQMYNLDTFYT 464
Cdd:pfam02469    1 PGFSTFVALLKAAgLVDTLNGsQGPFTVFAPTNEAfakLPAGTLNFLLKDKEQLKNLLKYHVVPGRLTSSDLKNGGTLAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 20149764    465 LTGKSGEIINKDKdnqlKLKLYGSKIVQiiqGNIVASNGLVHILDRAM 512
Cdd:pfam02469   81 LQGSKLRVNVTGG----SVTVNGARVVQ---ADIEATNGVIHVIDKVL 121
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
511-661 1.33e-21

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 93.82  E-value: 1.33e-21
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764  511 AMDKIEPTLESNPQQTIMTMLQ--PRYGKFRSLLEKTNVGQALEKGGidePYTIFVPSNEALSNMTAGVLDYLLSPEGSR 588
Cdd:COG2335   17 ASSAAAEGAAMAPTKNIVETAAnnPDFSTLVAALKAAGLVDTLSGEG---PFTVFAPTDAAFAALPAGTLDALLKPENKA 93
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 20149764  589 KLLELVRYHIVAfTQLEVATLVSTLHIRSMANQIITFNISSkGQILANNVAVDETEVAAKNGRIYTLTGVLIP 661
Cdd:COG2335   94 TLTKILTYHVVP-GKVTAADLKDGKTLTTLQGQTLTVTVSG-GGVTVNGANVITADIEASNGVIHVIDKVLLP 164
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1638-1733 1.46e-20

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 88.57  E-value: 1.46e-20
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    1638 TVFVPSSDSF-NSESKLKVWDKQgLMSQILRYHVVAcQQLLLENLKVITSATTLQGEPISISVSQD--TVLINKkAKVLS 1714
Cdd:smart00554    1 TVFAPTDEAFqKLPPDLNSLLAD-KLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSGGsgTVTVNG-ARIVE 77
                            90
                    ....*....|....*....
gi 20149764    1715 SDIISTNGVIHVIDTLLSP 1733
Cdd:smart00554   78 ADIAATNGVVHVIDRVLLP 96
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1008-1137 7.75e-20

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 88.81  E-value: 7.75e-20
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 1008 ELSFLSEAavfyqwINNASLQSMLSATSNLTVLVPSLQAIKDMDQNEKSFWLSRNNIPAL---IKYHTLLGTYRVADLQT 1084
Cdd:COG2335   42 DFSTLVAA------LKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPENKATLtkiLTYHVVPGKVTAADLKD 115
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|...
gi 20149764 1085 LPSshmlATSLQGSFLRLDKADGNITIEGASFVDGDNAATNGVVHIINKVLIP 1137
Cdd:COG2335  116 GKT----LTTLQGQTLTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVLLP 164
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1785-1891 1.34e-18

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 82.80  E-value: 1.34e-18
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    1785 TLFWPTDKALQALPQEQQDFLfnednKDKLKAYLKFHVIRDTMaLASDLPRSASWKTLQGSELSVRCGTGSDVgeLFLNG 1864
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLL-----ADKLKNLLLYHVVPGRL-SSADLLNGGTLPTLAGSKLRITRSGGSGT--VTVNG 72
                            90       100
                    ....*....|....*....|....*..
gi 20149764    1865 QmcRIIQRRLLFDGGVAYGIDCLLMDP 1891
Cdd:smart00554   73 A--RIVEADIAATNGVVHVIDRVLLPP 97
LINK smart00445
Link (Hyaluronan-binding);
2206-2298 7.68e-18

Link (Hyaluronan-binding);


Pssm-ID: 214667  Cd Length: 94  Bit Score: 80.46  E-value: 7.68e-18
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    2206 GVFHLRsPLGQYKLTFDKAKEACAKEAASIATYNQLSYAQKAKYHLCSAGWLESGRVAYPTIYASKKCA-NIVGIVDYGT 2284
Cdd:smart00445    3 GVFHVE-KNGRYKLTFAEAREACRAQGATLATVGQLYAAWQDGFDTCDAGWLADGSVRYPIITPRPRCGgNLPGVRQYGF 81
                            90
                    ....*....|....
gi 20149764    2285 RTNKSeMWDVFCYR 2298
Cdd:smart00445   82 PDPTS-RYDAYCFN 94
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1729-1889 1.35e-17

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 82.26  E-value: 1.35e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 1729 TLLSPQNLLITPKGASGRVLLNLTTVAANHG-YTKFSKLIQDSGLLKVITDPmhTPVTLFWPTDKALQALPQEQQDFLFN 1807
Cdd:COG2335   11 ALLAACASSAAAEGAAMAPTKNIVETAANNPdFSTLVAALKAAGLVDTLSGE--GPFTVFAPTDAAFAALPAGTLDALLK 88
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 1808 EDNKDKLKAYLKFHVIRDTMAlASDLPRSASWKTLQGSELSVrcgTGSDvGELFLNGQmcRIIQRRLLFDGGVAYGIDCL 1887
Cdd:COG2335   89 PENKATLTKILTYHVVPGKVT-AADLKDGKTLTTLQGQTLTV---TVSG-GGVTVNGA--NVITADIEASNGVIHVIDKV 161

                 ..
gi 20149764 1888 LM 1889
Cdd:COG2335  162 LL 163
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1146-1273 3.45e-17

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 81.11  E-value: 3.45e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 1146 PSLLTRLEQMPDYSIFRGYIIHYNLASAIEAADAYTVFVPNNEAIESYIREKKATSLKE-------DILQYHVVLGeKLL 1218
Cdd:COG2335   31 KNIVETAANNPDFSTLVAALKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPenkatltKILTYHVVPG-KVT 109
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 20149764 1219 RNDLHNGMHRETMLGFSylLAFFLHNDQLYVNEAPINYTNVATDKGVIHGLEKVL 1273
Cdd:COG2335  110 AADLKDGKTLTTLQGQT--LTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVL 162
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
561-662 2.91e-14

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 70.47  E-value: 2.91e-14
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764     561 TIFVPSNEALSNMTAGvLDYLLSPegsrKLLELVRYHIVAfTQLEVATLVSTLHIRSMANQIITFNISS-KGQILANNVA 639
Cdd:smart00554    1 TVFAPTDEAFQKLPPD-LNSLLAD----KLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSGgSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|...
gi 20149764     640 VDETEVAAKNGRIYTLTGVLIPP 662
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLPP 97
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1038-1137 7.48e-14

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 69.31  E-value: 7.48e-14
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    1038 TVLVPSLQAIKDMDQNEKSfwLSRNNIPALIKYHTLLGTYRVADLQtlpsSHMLATSLQGSFLRL--DKADGNITIEGAS 1115
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNS--LLADKLKNLLLYHVVPGRLSSADLL----NGGTLPTLAGSKLRItrSGGSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|..
gi 20149764    1116 FVDGDNAATNGVVHIINKVLIP 1137
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLP 96
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
390-510 2.89e-13

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 69.94  E-value: 2.89e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764  390 GQLTSFISILDRT-YAWPLSNLGPFTVLLPSDKGLKGVD---VKELLM--DKEAARYFVKLHIIAGQMSTEQMYNLDTFY 463
Cdd:COG2335   41 PDFSTLVAALKAAgLVDTLSGEGPFTVFAPTDAAFAALPagtLDALLKpeNKATLTKILTYHVVPGKVTAADLKDGKTLT 120
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|.
gi 20149764  464 TLTGKSgeiinkdkdnqLKLKLYGSKIV----QIIQGNIVASNGLVHILDR 510
Cdd:COG2335  121 TLQGQT-----------LTVTVSGGGVTvngaNVITADIEASNGVIHVIDK 160
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1181-1273 2.12e-11

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 62.38  E-value: 2.12e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    1181 TVFVPNNEAIESYIREKKA--TSLKEDILQYHVVLGeKLLRNDLHNGMHRETMLGFSylLAFFLHND--QLYVNEAPINY 1256
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSllADKLKNLLLYHVVPG-RLSSADLLNGGTLPTLAGSK--LRITRSGGsgTVTVNGARIVE 77
                            90
                    ....*....|....*..
gi 20149764    1257 TNVATDKGVIHGLEKVL 1273
Cdd:smart00554   78 ADIAATNGVVHVIDRVL 94
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
414-510 2.68e-11

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 61.99  E-value: 2.68e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764     414 TVLLPSDKGLKGVDVKELLMDKEAARYFVKLHIIAGQMSTEQMYNLDTFYTLTGKSGEIINKDKDNQLKLklygsKIVQI 493
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLADKLKNLLLYHVVPGRLSSADLLNGGTLPTLAGSKLRITRSGGSGTVTV-----NGARI 75
                            90
                    ....*....|....*..
gi 20149764     494 IQGNIVASNGLVHILDR 510
Cdd:smart00554   76 VEADIAATNGVVHVIDR 92
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
2363-2452 5.20e-11

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 61.22  E-value: 5.20e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    2363 TLFVPQNSGL----PKNKSLSG----RDIEHHLTNVNVSfYDDLVNGTVLKTRLGSQLLITSSQDQlhqEARFVDGRAIL 2434
Cdd:smart00554    1 TVFAPTDEAFqklpPDLNSLLAdklkNLLLYHVVPGRLS-SADLLNGGTLPTLAGSKLRITRSGGS---GTVTVNGARIV 76
                            90
                    ....*....|....*...
gi 20149764    2435 QWDIIASNGVLHIISEPL 2452
Cdd:smart00554   77 EADIAATNGVVHVIDRVL 94
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
2363-2452 2.06e-08

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 54.57  E-value: 2.06e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764   2363 TLFVPQN-----------SGLPKNKSLSGRDIEHHLTNVNVSfYDDLVNGTVLKTRLGSQLLITSSQDQLhqearFVDGR 2431
Cdd:pfam02469   27 TVFAPTNeafaklpagtlNFLLKDKEQLKNLLKYHVVPGRLT-SSDLKNGGTLATLQGSKLRVNVTGGSV-----TVNGA 100
                           90       100
                   ....*....|....*....|.
gi 20149764   2432 AILQWDIIASNGVLHIISEPL 2452
Cdd:pfam02469  101 RVVQADIEATNGVIHVIDKVL 121
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1482-1518 6.51e-08

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 50.68  E-value: 6.51e-08
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 20149764   1482 CEISNGGCSAKADCKRTiPGSRVCVCKAGYTGDGIVC 1518
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1566-1602 2.75e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 48.75  E-value: 2.75e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 20149764   1566 CLTNNGGCSPFAFCNHTEqDQRTCTCKPDYTGDGIVC 1602
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTG-GSFTCTCNDGYTGDGVTC 36
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
2302-2448 3.54e-06

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 49.52  E-value: 3.54e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 2302 VNCTCKAGYVGDGFSCNGNLLQVLMSFPSLTNFLTEVlvfsrsSAQGraflkhLTD-LSISG--TLFVPQNSG---LPK- 2374
Cdd:COG2335   14 AACASSAAAEGAAMAPTKNIVETAANNPDFSTLVAAL------KAAG------LVDtLSGEGpfTVFAPTDAAfaaLPAg 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 2375 --NKSLSGRDIE-------HHLTNVNVSFyDDLVNGTVLKTRLGSQLLITSSQDQLHqearfVDGRAILQWDIIASNGVL 2445
Cdd:COG2335   82 tlDALLKPENKAtltkiltYHVVPGKVTA-ADLKDGKTLTTLQGQTLTVTVSGGGVT-----VNGANVITADIEASNGVI 155

                 ...
gi 20149764 2446 HII 2448
Cdd:COG2335  156 HVI 158
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2135-2172 4.49e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 45.28  E-value: 4.49e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 20149764   2135 CANGvNGGCHEHATCRMTgPGKQKCECKSHYVGDGRDC 2172
Cdd:pfam12947    1 CSDN-NGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1524-1560 9.94e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 44.51  E-value: 9.94e-06
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 20149764   1524 CLENHGGCDRHAECTQTgPNQAVCNCLPKYTGDGKVC 1560
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2094-2129 1.62e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 43.74  E-value: 1.62e-05
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 20149764   2094 CKQNNGGCAKVAKCSQKGTQVSCSCQKGYKGDGHSC 2129
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
844-872 5.17e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 42.20  E-value: 5.17e-05
                           10        20
                   ....*....|....*....|....*....
gi 20149764    844 CHIHATCEYSNETASCVCNDGYEGDGTLC 872
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
334-369 3.31e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.89  E-value: 3.31e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 20149764    334 CESKN-PCHKNANCSTVsPGQTQCTCQKGYVGDGLNC 369
Cdd:pfam12947    1 CSDNNgGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
927-959 4.36e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.89  E-value: 4.36e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 20149764    927 SGGCHDNATCLYVgPGQNECECKKGFRGNGIDC 959
Cdd:pfam12947    5 NGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
965-1001 1.35e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.35  E-value: 1.35e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 20149764    965 CLEQIEKCHPLATCQYTLSGVwSCVCQEGYEGNGVLC 1001
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSF-TCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
254-283 2.71e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.58  E-value: 2.71e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 20149764    254 CHPHASCSYLgPNRHSCVCQKGYQGDGQVC 283
Cdd:pfam12947    8 CHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
881-916 4.23e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.81  E-value: 4.23e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 20149764    881 STSRGGCSPNAECIQaSTGTYSCVCQRGWTGNGRDC 916
Cdd:pfam12947    2 SDNNGGCHPNATCTN-TGGSFTCTCNDGYTGDGVTC 36
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1978-2022 4.33e-03

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 37.33  E-value: 4.33e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 20149764   1978 CNNRGMCYDQ-YKPTGQCQCHTGFNGTACELCLPGRFG-PDCQPCGC 2022
Cdd:pfam00053    3 CNPHGSLSDTcDPETGQCLCKPGVTGRHCDRCKPGYYGlPSDPPQGC 49
 
Name Accession Description Interval E-value
Link_domain_TSG_6_like cd03515
This is the extracellular link domain of the type found in human TSG-6. The link domain is a ...
2206-2297 8.38e-38

This is the extracellular link domain of the type found in human TSG-6. The link domain is a hyaluronan (HA)-binding domain. TSG-6 is the protein product of tumor necrosis factor-stimulated gene-6. TSG-6 is up-regulated in inflammatory lesions and in the ovary during ovulation. It has a strong anti-inflammatory and chondroprotective effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. Also included in this group are the stabilins: stabilin-1 (FEEL-1, CLEVER-1) and stabilin-2 (FEEL-2). Stabilin-2 functions as the major liver and lymph node-scavenging receptor for HA and related glycosaminoglycans. Stabilin-2 is a scavenger receptor with a broad range of ligands including advanced glycation end (AGE) products, acetylated low density lipoprotein and procollagen peptides. In contrast, stabilin-1 does not bind HA, but binds acetylated low density lipoprotein and AGEs with lower affinity. As AGEs accumulate in vascular tissues during aging and diabetes, these receptors may be implicated in the pathologies of these states. Both stabilins are present in the early endocytic pathway in hepatic sinusoidal epithelium associating with clathrin/AP-2. Stabilin-1 is expressed in macrophages. Stabilin-2 is absent from the latter. In macrophages: stabilin-1 is involved in trafficking between early/sorting endosomes and the trans-Golgi network. Stabilin-1 has also been implicated in angiogenesis and possibly leucocyte trafficking. Both stabilins bind gram-positive and gram-negative bacteria. TSG-6 and stabilins contain a single link module which supports high affinity binding to HA.


Pssm-ID: 239592  Cd Length: 93  Bit Score: 137.59  E-value: 8.38e-38
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 2206 GVFHLRSPLGQYKLTFDKAKEACAKEAASIATYNQLSYAQKAKYHLCSAGWLESGRVAYPTIYASKKCA-NIVGIVDYGT 2284
Cdd:cd03515    1 GVFHLRSRSGKYKLTYTEAKAACEAEGAHLATYSQLSAAQQLGFHLCAAGWLAKGRVGYPIVFPSANCGfGHVGIVDYGP 80
                         90
                 ....*....|...
gi 20149764 2285 RTNKSEMWDVFCY 2297
Cdd:cd03515   81 RLNLSERWDAYCY 93
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1758-1889 4.06e-30

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 116.58  E-value: 4.06e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764   1758 HGYTKFSKLIQDSGLLKVITDPMHtPVTLFWPTDKALQALPQEQQDFLFNedNKDKLKAYLKFHVIRDTMaLASDLPRSA 1837
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQG-PFTVFAPTNEAFAKLPAGTLNFLLK--DKEQLKNLLKYHVVPGRL-TSSDLKNGG 76
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 20149764   1838 SWKTLQGSELSVRCGTGSdvgeLFLNGqmCRIIQRRLLFDGGVAYGIDCLLM 1889
Cdd:pfam02469   77 TLATLQGSKLRVNVTGGS----VTVNG--ARVVQADIEATNGVIHVIDKVLL 122
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1628-1733 1.74e-28

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 111.96  E-value: 1.74e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764   1628 VQELAGP-GPFTVFVPSSDSFN---SESKLKVWDKQGLMSQILRYHVVACQqLLLENLKVITSATTLQGEPISISVSQDT 1703
Cdd:pfam02469   16 VDTLNGSqGPFTVFAPTNEAFAklpAGTLNFLLKDKEQLKNLLKYHVVPGR-LTSSDLKNGGTLATLQGSKLRVNVTGGS 94
                           90       100       110
                   ....*....|....*....|....*....|
gi 20149764   1704 VLINKkAKVLSSDIISTNGVIHVIDTLLSP 1733
Cdd:pfam02469   95 VTVNG-ARVVQADIEATNGVIHVIDKVLLP 123
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1628-1733 9.07e-26

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 105.76  E-value: 9.07e-26
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 1628 VQELAGPGPFTVFVPSSDSFNS------ESKLKVWDKQgLMSQILRYHVVAcQQLLLENLKVITSATTLQGEPISISVSQ 1701
Cdd:COG2335   56 VDTLSGEGPFTVFAPTDAAFAAlpagtlDALLKPENKA-TLTKILTYHVVP-GKVTAADLKDGKTLTTLQGQTLTVTVSG 133
                         90       100       110
                 ....*....|....*....|....*....|..
gi 20149764 1702 DTVLINKkAKVLSSDIISTNGVIHVIDTLLSP 1733
Cdd:COG2335  134 GGVTVNG-ANVITADIEASNGVIHVIDKVLLP 164
Xlink pfam00193
Extracellular link domain;
2206-2297 9.32e-26

Extracellular link domain;


Pssm-ID: 459706  Cd Length: 92  Bit Score: 103.04  E-value: 9.32e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764   2206 GVFHLRSPlGQYKLTFDKAKEACAKEAASIATYNQLSYAQKAKYHLCSAGWLESGRVAYPTIYASKKCA-NIVGIVDYGT 2284
Cdd:pfam00193    1 GVFHLESP-GRYKLTFQEAQAACAALGATLATPEQLYAAWKAGLDTCDAGWLADGTVRYPITTPRPNCGgNMPGVRQYGF 79
                           90
                   ....*....|...
gi 20149764   2285 RTNKSEMWDVFCY 2297
Cdd:pfam00193   80 RDPLSERYDAYCY 92
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1018-1137 2.04e-24

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 100.41  E-value: 2.04e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764   1018 FYQWINNASLQSMLSAT-SNLTVLVPSLQAIKDMDQNEKSFWLS-RNNIPALIKYHTLLGTYRVADLQTLPSshmlATSL 1095
Cdd:pfam02469    6 FVALLKAAGLVDTLNGSqGPFTVFAPTNEAFAKLPAGTLNFLLKdKEQLKNLLKYHVVPGRLTSSDLKNGGT----LATL 81
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 20149764   1096 QGSFLRLDKADGNITIEGASFVDGDNAATNGVVHIINKVLIP 1137
Cdd:pfam02469   82 QGSKLRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
533-661 7.33e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 96.17  E-value: 7.33e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    533 PRYGKFRSLLEKTNVGQALEKGgiDEPYTIFVPSNEALSNMTAGVLDYLLSPegSRKLLELVRYHIVAfTQLEVATLVST 612
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGS--QGPFTVFAPTNEAFAKLPAGTLNFLLKD--KEQLKNLLKYHVVP-GRLTSSDLKNG 75
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 20149764    613 LHIRSMANQIITFNISSkGQILANNVAVDETEVAAKNGRIYTLTGVLIP 661
Cdd:pfam02469   76 GTLATLQGSKLRVNVTG-GSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1156-1273 1.17e-22

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 95.40  E-value: 1.17e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764   1156 PDYSIFRGYIIHYNLASAIEAADA-YTVFVPNNEAIESYIREKKATSLK-----EDILQYHVVLGeKLLRNDLHNGMHRE 1229
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQGpFTVFAPTNEAFAKLPAGTLNFLLKdkeqlKNLLKYHVVPG-RLTSSDLKNGGTLA 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 20149764   1230 TMLGFSylLAFFLHNDQLYVNEAPINYTNVATDKGVIHGLEKVL 1273
Cdd:pfam02469   80 TLQGSK--LRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVL 121
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
390-512 1.18e-21

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 92.70  E-value: 1.18e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    390 GQLTSFISILDRT-YAWPLSN-LGPFTVLLPSDKG---LKGVDVKELLMDKEAARYFVKLHIIAGQMSTEQMYNLDTFYT 464
Cdd:pfam02469    1 PGFSTFVALLKAAgLVDTLNGsQGPFTVFAPTNEAfakLPAGTLNFLLKDKEQLKNLLKYHVVPGRLTSSDLKNGGTLAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 20149764    465 LTGKSGEIINKDKdnqlKLKLYGSKIVQiiqGNIVASNGLVHILDRAM 512
Cdd:pfam02469   81 LQGSKLRVNVTGG----SVTVNGARVVQ---ADIEATNGVIHVIDKVL 121
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
511-661 1.33e-21

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 93.82  E-value: 1.33e-21
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764  511 AMDKIEPTLESNPQQTIMTMLQ--PRYGKFRSLLEKTNVGQALEKGGidePYTIFVPSNEALSNMTAGVLDYLLSPEGSR 588
Cdd:COG2335   17 ASSAAAEGAAMAPTKNIVETAAnnPDFSTLVAALKAAGLVDTLSGEG---PFTVFAPTDAAFAALPAGTLDALLKPENKA 93
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 20149764  589 KLLELVRYHIVAfTQLEVATLVSTLHIRSMANQIITFNISSkGQILANNVAVDETEVAAKNGRIYTLTGVLIP 661
Cdd:COG2335   94 TLTKILTYHVVP-GKVTAADLKDGKTLTTLQGQTLTVTVSG-GGVTVNGANVITADIEASNGVIHVIDKVLLP 164
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1638-1733 1.46e-20

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 88.57  E-value: 1.46e-20
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    1638 TVFVPSSDSF-NSESKLKVWDKQgLMSQILRYHVVAcQQLLLENLKVITSATTLQGEPISISVSQD--TVLINKkAKVLS 1714
Cdd:smart00554    1 TVFAPTDEAFqKLPPDLNSLLAD-KLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSGGsgTVTVNG-ARIVE 77
                            90
                    ....*....|....*....
gi 20149764    1715 SDIISTNGVIHVIDTLLSP 1733
Cdd:smart00554   78 ADIAATNGVVHVIDRVLLP 96
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1008-1137 7.75e-20

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 88.81  E-value: 7.75e-20
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 1008 ELSFLSEAavfyqwINNASLQSMLSATSNLTVLVPSLQAIKDMDQNEKSFWLSRNNIPAL---IKYHTLLGTYRVADLQT 1084
Cdd:COG2335   42 DFSTLVAA------LKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPENKATLtkiLTYHVVPGKVTAADLKD 115
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|...
gi 20149764 1085 LPSshmlATSLQGSFLRLDKADGNITIEGASFVDGDNAATNGVVHIINKVLIP 1137
Cdd:COG2335  116 GKT----LTTLQGQTLTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVLLP 164
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1785-1891 1.34e-18

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 82.80  E-value: 1.34e-18
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    1785 TLFWPTDKALQALPQEQQDFLfnednKDKLKAYLKFHVIRDTMaLASDLPRSASWKTLQGSELSVRCGTGSDVgeLFLNG 1864
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLL-----ADKLKNLLLYHVVPGRL-SSADLLNGGTLPTLAGSKLRITRSGGSGT--VTVNG 72
                            90       100
                    ....*....|....*....|....*..
gi 20149764    1865 QmcRIIQRRLLFDGGVAYGIDCLLMDP 1891
Cdd:smart00554   73 A--RIVEADIAATNGVVHVIDRVLLPP 97
LINK smart00445
Link (Hyaluronan-binding);
2206-2298 7.68e-18

Link (Hyaluronan-binding);


Pssm-ID: 214667  Cd Length: 94  Bit Score: 80.46  E-value: 7.68e-18
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    2206 GVFHLRsPLGQYKLTFDKAKEACAKEAASIATYNQLSYAQKAKYHLCSAGWLESGRVAYPTIYASKKCA-NIVGIVDYGT 2284
Cdd:smart00445    3 GVFHVE-KNGRYKLTFAEAREACRAQGATLATVGQLYAAWQDGFDTCDAGWLADGSVRYPIITPRPRCGgNLPGVRQYGF 81
                            90
                    ....*....|....
gi 20149764    2285 RTNKSeMWDVFCYR 2298
Cdd:smart00445   82 PDPTS-RYDAYCFN 94
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1729-1889 1.35e-17

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 82.26  E-value: 1.35e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 1729 TLLSPQNLLITPKGASGRVLLNLTTVAANHG-YTKFSKLIQDSGLLKVITDPmhTPVTLFWPTDKALQALPQEQQDFLFN 1807
Cdd:COG2335   11 ALLAACASSAAAEGAAMAPTKNIVETAANNPdFSTLVAALKAAGLVDTLSGE--GPFTVFAPTDAAFAALPAGTLDALLK 88
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 1808 EDNKDKLKAYLKFHVIRDTMAlASDLPRSASWKTLQGSELSVrcgTGSDvGELFLNGQmcRIIQRRLLFDGGVAYGIDCL 1887
Cdd:COG2335   89 PENKATLTKILTYHVVPGKVT-AADLKDGKTLTTLQGQTLTV---TVSG-GGVTVNGA--NVITADIEASNGVIHVIDKV 161

                 ..
gi 20149764 1888 LM 1889
Cdd:COG2335  162 LL 163
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1146-1273 3.45e-17

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 81.11  E-value: 3.45e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 1146 PSLLTRLEQMPDYSIFRGYIIHYNLASAIEAADAYTVFVPNNEAIESYIREKKATSLKE-------DILQYHVVLGeKLL 1218
Cdd:COG2335   31 KNIVETAANNPDFSTLVAALKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPenkatltKILTYHVVPG-KVT 109
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 20149764 1219 RNDLHNGMHRETMLGFSylLAFFLHNDQLYVNEAPINYTNVATDKGVIHGLEKVL 1273
Cdd:COG2335  110 AADLKDGKTLTTLQGQT--LTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVL 162
Link_Domain cd01102
The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive ...
2206-2297 1.51e-15

The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It is found in the CD44 receptor and in human TSG-6. TSG-6 is the protein product of the tumor necrosis factor-stimulated gene-6. TSG-6 has a strong anti-inflammatory effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. This group also contains the link domains of the chondroitin sulfate proteoglycan core proteins (CSPG) including aggrecan, versican, neurocan, and brevican and the link domains of the vertebrate HAPLN (HA and proteoglycan binding link) protein family. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates in which other CSPGs substitute for aggregan might contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN gene family are physically linked adjacent to CSPG genes. TSG-6 contains a single link module which supports high affinity binding with HA. The functional HA-binding domain of CD44 is an extended domain comprised of a link module flanked with N-and C- extensions. These extensions are essential for folding and functional activity. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of the CSPG aggrecan are involved in interaction with HA. Aggrecan in addition contains a second globular domain (G2) which contains link modules 3 and 4 which lack HA-binding activity. HAPLNs contain two contiguous link modules.


Pssm-ID: 238534  Cd Length: 92  Bit Score: 73.99  E-value: 1.51e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 2206 GVFHLRSPLGQYKLTFDKAKEACAKEAASIATYNQLSYAQKAKYHLCSAGWLESGRVAYPTIYASKKCA-NIVGIVDYGT 2284
Cdd:cd01102    1 VVFHLESQNGRYKLTFAEAALACKARGAHLATPGQLEAAWQDGFDVCTAGWLADGSVRYPIVTSRPNCGgRNPGVRSYGN 80
                         90
                 ....*....|...
gi 20149764 2285 RtNKSEMWDVFCY 2297
Cdd:cd01102   81 P-APSGRYDAYCF 92
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
561-662 2.91e-14

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 70.47  E-value: 2.91e-14
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764     561 TIFVPSNEALSNMTAGvLDYLLSPegsrKLLELVRYHIVAfTQLEVATLVSTLHIRSMANQIITFNISS-KGQILANNVA 639
Cdd:smart00554    1 TVFAPTDEAFQKLPPD-LNSLLAD----KLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSGgSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|...
gi 20149764     640 VDETEVAAKNGRIYTLTGVLIPP 662
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLPP 97
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1038-1137 7.48e-14

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 69.31  E-value: 7.48e-14
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    1038 TVLVPSLQAIKDMDQNEKSfwLSRNNIPALIKYHTLLGTYRVADLQtlpsSHMLATSLQGSFLRL--DKADGNITIEGAS 1115
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNS--LLADKLKNLLLYHVVPGRLSSADLL----NGGTLPTLAGSKLRItrSGGSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|..
gi 20149764    1116 FVDGDNAATNGVVHIINKVLIP 1137
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLP 96
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
390-510 2.89e-13

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 69.94  E-value: 2.89e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764  390 GQLTSFISILDRT-YAWPLSNLGPFTVLLPSDKGLKGVD---VKELLM--DKEAARYFVKLHIIAGQMSTEQMYNLDTFY 463
Cdd:COG2335   41 PDFSTLVAALKAAgLVDTLSGEGPFTVFAPTDAAFAALPagtLDALLKpeNKATLTKILTYHVVPGKVTAADLKDGKTLT 120
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|.
gi 20149764  464 TLTGKSgeiinkdkdnqLKLKLYGSKIV----QIIQGNIVASNGLVHILDR 510
Cdd:COG2335  121 TLQGQT-----------LTVTVSGGGVTvngaNVITADIEASNGVIHVIDK 160
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1181-1273 2.12e-11

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 62.38  E-value: 2.12e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    1181 TVFVPNNEAIESYIREKKA--TSLKEDILQYHVVLGeKLLRNDLHNGMHRETMLGFSylLAFFLHND--QLYVNEAPINY 1256
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSllADKLKNLLLYHVVPG-RLSSADLLNGGTLPTLAGSK--LRITRSGGsgTVTVNGARIVE 77
                            90
                    ....*....|....*..
gi 20149764    1257 TNVATDKGVIHGLEKVL 1273
Cdd:smart00554   78 ADIAATNGVVHVIDRVL 94
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
414-510 2.68e-11

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 61.99  E-value: 2.68e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764     414 TVLLPSDKGLKGVDVKELLMDKEAARYFVKLHIIAGQMSTEQMYNLDTFYTLTGKSGEIINKDKDNQLKLklygsKIVQI 493
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLADKLKNLLLYHVVPGRLSSADLLNGGTLPTLAGSKLRITRSGGSGTVTV-----NGARI 75
                            90
                    ....*....|....*..
gi 20149764     494 IQGNIVASNGLVHILDR 510
Cdd:smart00554   76 VEADIAATNGVVHVIDR 92
Link_domain_CSPGs_modules_1_3 cd03517
Link_domain_CSPGs_modules_1_3; this extracellular link domain is found in the first and third ...
2207-2297 4.12e-11

Link_domain_CSPGs_modules_1_3; this extracellular link domain is found in the first and third link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan. In addition, it is found in the first link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. In addition, aggrecan contains a second globular domain (G2) which contains link modules 3 and 4. G2 appears to lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.


Pssm-ID: 239594  Cd Length: 95  Bit Score: 61.65  E-value: 4.12e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 2207 VFHLRSPLGQYKLTFDKAKEACAKEAASIATYNQLSYAQKAKYHLCSAGWLESGRVAYPTIYASKKC----ANIVGIVDY 2282
Cdd:cd03517    2 VFHYRDATARYALTFPRAQRACLDISAQIATPEQLLAAYEDGFEQCDAGWLADQTVRYPIQTPREGCygdmDGFPGVRNY 81
                         90
                 ....*....|....*
gi 20149764 2283 GTRtNKSEMWDVFCY 2297
Cdd:cd03517   82 GVR-DPDELYDVYCY 95
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
2363-2452 5.20e-11

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 61.22  E-value: 5.20e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764    2363 TLFVPQNSGL----PKNKSLSG----RDIEHHLTNVNVSfYDDLVNGTVLKTRLGSQLLITSSQDQlhqEARFVDGRAIL 2434
Cdd:smart00554    1 TVFAPTDEAFqklpPDLNSLLAdklkNLLLYHVVPGRLS-SADLLNGGTLPTLAGSKLRITRSGGS---GTVTVNGARIV 76
                            90
                    ....*....|....*...
gi 20149764    2435 QWDIIASNGVLHIISEPL 2452
Cdd:smart00554   77 EADIAATNGVVHVIDRVL 94
Link_domain_HAPLN_module_1 cd03518
Link_domain_HAPLN_module_1; this link domain is found in the first link module of proteins ...
2207-2297 1.91e-09

Link_domain_HAPLN_module_1; this link domain is found in the first link module of proteins similar to the vertebrate HAPLN (hyaluronan/HA and proteoglycan binding link) protein family which includes cartilage link protein. The link domain is a HA-binding domain. HAPLNs contain two contiguous link modules. Both link modules of cartilage link protein are involved in interaction with HA. In cartilage, a chondroitin sulfate proteoglycan core protein (CSPG) aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates with other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HAPLN gene family are physically linked adjacent to CSPG genes.


Pssm-ID: 239595  Cd Length: 95  Bit Score: 56.67  E-value: 1.91e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 2207 VFHLRSPLGQYKLTFDKAKEACAKEAASIATYNQLSYAQKAKYHLCSAGWLESGRVAYPTIYASKKC---ANIVGIVDYG 2283
Cdd:cd03518    2 VFPYQPRLGRYNLNFHEAQQACEEQDATLASFEQLYQAWTEGLDWCNAGWLSDGTVQYPITKPREPCggkRTVPGLRSYG 81
                         90
                 ....*....|....
gi 20149764 2284 TRTNKSEMWDVFCY 2297
Cdd:cd03518   82 ERDKMLSRYDAFCF 95
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
2363-2452 2.06e-08

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 54.57  E-value: 2.06e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764   2363 TLFVPQN-----------SGLPKNKSLSGRDIEHHLTNVNVSfYDDLVNGTVLKTRLGSQLLITSSQDQLhqearFVDGR 2431
Cdd:pfam02469   27 TVFAPTNeafaklpagtlNFLLKDKEQLKNLLKYHVVPGRLT-SSDLKNGGTLATLQGSKLRVNVTGGSV-----TVNGA 100
                           90       100
                   ....*....|....*....|.
gi 20149764   2432 AILQWDIIASNGVLHIISEPL 2452
Cdd:pfam02469  101 RVVQADIEATNGVIHVIDKVL 121
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1482-1518 6.51e-08

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 50.68  E-value: 6.51e-08
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 20149764   1482 CEISNGGCSAKADCKRTiPGSRVCVCKAGYTGDGIVC 1518
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
Link_domain_CD44_like cd03516
This domain is a hyaluronan (HA)-binding domain. It is found in CD44 receptor and mediates ...
2206-2301 2.18e-07

This domain is a hyaluronan (HA)-binding domain. It is found in CD44 receptor and mediates adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It also plays an important role in arteriogenesis. The functional HA-binding domain of CD44 is an extended domain comprised of a single link module flanked with N-and C- extensions. These extensions are essential for folding and for functional activity. This group also contains the cell surface retention sequence (CRS) binding protein-1 (CRSBP-1) and lymph vessel endothelial receptor-1 (LYVE-1). CRSBP-1 is a cell surface binding protein for the CRS motif of PDGF-BB (platelet-derived growth factor-BB) and is responsible for the cell surface retention of PDGF-BB in SSV-transformed cells. CRSBP-1 may play a role in autocrine regulation of cell growth mediated by CRS containing growth regulators. LYVE-1 is preferentially expressed on the lymphatic endothelium and is used as a molecular marker for the detection and characterization of lymphatic vessels in tumors.


Pssm-ID: 239593  Cd Length: 144  Bit Score: 52.46  E-value: 2.18e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 2206 GVFHLRSPlGQYKLTFDKAKEACAKEAASIATYNQLSYAQKAKYHLCSAGWLESGRVAYPTIYASKKCA-NIVGIVDYGT 2284
Cdd:cd03516    7 GVFLVEKN-GRYSLNFTEAKEACRALGLTLASKAQVETALKFGFETCRYGWVEDGFVVIPRIDPNPLCGkNGTGVYILNS 85
                         90
                 ....*....|....*..
gi 20149764 2285 RTNKSemWDVFCYRMKD 2301
Cdd:cd03516   86 NLSSR--YDAYCYNSSD 100
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1566-1602 2.75e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 48.75  E-value: 2.75e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 20149764   1566 CLTNNGGCSPFAFCNHTEqDQRTCTCKPDYTGDGIVC 1602
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTG-GSFTCTCNDGYTGDGVTC 36
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
2302-2448 3.54e-06

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 49.52  E-value: 3.54e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 2302 VNCTCKAGYVGDGFSCNGNLLQVLMSFPSLTNFLTEVlvfsrsSAQGraflkhLTD-LSISG--TLFVPQNSG---LPK- 2374
Cdd:COG2335   14 AACASSAAAEGAAMAPTKNIVETAANNPDFSTLVAAL------KAAG------LVDtLSGEGpfTVFAPTDAAfaaLPAg 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 2375 --NKSLSGRDIE-------HHLTNVNVSFyDDLVNGTVLKTRLGSQLLITSSQDQLHqearfVDGRAILQWDIIASNGVL 2445
Cdd:COG2335   82 tlDALLKPENKAtltkiltYHVVPGKVTA-ADLKDGKTLTTLQGQTLTVTVSGGGVT-----VNGANVITADIEASNGVI 155

                 ...
gi 20149764 2446 HII 2448
Cdd:COG2335  156 HVI 158
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2135-2172 4.49e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 45.28  E-value: 4.49e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 20149764   2135 CANGvNGGCHEHATCRMTgPGKQKCECKSHYVGDGRDC 2172
Cdd:pfam12947    1 CSDN-NGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1524-1560 9.94e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 44.51  E-value: 9.94e-06
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 20149764   1524 CLENHGGCDRHAECTQTgPNQAVCNCLPKYTGDGKVC 1560
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2094-2129 1.62e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 43.74  E-value: 1.62e-05
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 20149764   2094 CKQNNGGCAKVAKCSQKGTQVSCSCQKGYKGDGHSC 2129
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
844-872 5.17e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 42.20  E-value: 5.17e-05
                           10        20
                   ....*....|....*....|....*....
gi 20149764    844 CHIHATCEYSNETASCVCNDGYEGDGTLC 872
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
Link_domain_CSPGs_modules_2_4 cd03520
Link_domain_CSPGs_modules_2_4; this link domain is found in the second and fourth link modules ...
2207-2297 5.87e-05

Link_domain_CSPGs_modules_2_4; this link domain is found in the second and fourth link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan and, in the second link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. Aggrecan in addition contains a second globular domain (G2) having link modules 3 and 4 which lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.


Pssm-ID: 239597  Cd Length: 96  Bit Score: 44.23  E-value: 5.87e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20149764 2207 VFHLRSPlgqYKLTFDKAKEACAKEAASIATYNQLSYAQKAKYHLCSAGWLESGRVAYPTIYASKKCA-NIVG---IVDY 2282
Cdd:cd03520    2 VFYATAP---EKFTFQEARAECRSLGAVLATTGQLYAAWRQGLDQCDPGWLADGSVRYPISTPRPQCGgGLPGvrtLYRF 78
                         90
                 ....*....|....*...
gi 20149764 2283 GTRT---NKSEMWDVFCY 2297
Cdd:cd03520   79 PNQTgfpDPHSRFDAYCF 96
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
334-369 3.31e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.89  E-value: 3.31e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 20149764    334 CESKN-PCHKNANCSTVsPGQTQCTCQKGYVGDGLNC 369
Cdd:pfam12947    1 CSDNNgGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
927-959 4.36e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.89  E-value: 4.36e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 20149764    927 SGGCHDNATCLYVgPGQNECECKKGFRGNGIDC 959
Cdd:pfam12947    5 NGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
965-1001 1.35e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.35  E-value: 1.35e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 20149764    965 CLEQIEKCHPLATCQYTLSGVwSCVCQEGYEGNGVLC 1001
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSF-TCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
254-283 2.71e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.58  E-value: 2.71e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 20149764    254 CHPHASCSYLgPNRHSCVCQKGYQGDGQVC 283
Cdd:pfam12947    8 CHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
881-916 4.23e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.81  E-value: 4.23e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 20149764    881 STSRGGCSPNAECIQaSTGTYSCVCQRGWTGNGRDC 916
Cdd:pfam12947    2 SDNNGGCHPNATCTN-TGGSFTCTCNDGYTGDGVTC 36
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1978-2022 4.33e-03

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 37.33  E-value: 4.33e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 20149764   1978 CNNRGMCYDQ-YKPTGQCQCHTGFNGTACELCLPGRFG-PDCQPCGC 2022
Cdd:pfam00053    3 CNPHGSLSDTcDPETGQCLCKPGVTGRHCDRCKPGYYGlPSDPPQGC 49
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH