NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907081931|ref|XP_036012621|]
View 

myosin phosphatase Rho-interacting protein isoform X5 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PH_M-RIP cd13275
Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed ...
429-530 5.44e-47

Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed to play a role in myosin phosphatase regulation by RhoA. M-RIP contains 2 PH domains followed by a Rho binding domain (Rho-BD), and a C-terminal myosin binding subunit (MBS) binding domain (MBS-BD). The amino terminus of M-RIP with its adjacent PH domains and polyproline motifs mediates binding to both actin and Galpha. M-RIP brings RhoA and MBS into close proximity where M-RIP can target RhoA to the myosin phosphatase complex to regulate the myosin phosphorylation state. M-RIP does this via its C-terminal coiled-coil domain which interacts with the MBS leucine zipper domain of myosin phosphatase, while its Rho-BD, directly binds RhoA in a nucleotide-independent manner. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


:

Pssm-ID: 270094  Cd Length: 104  Bit Score: 164.04  E-value: 5.44e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQ-YEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDVTEYPVQRNYGFQIHTKEGE-FTLSAMT 506
Cdd:cd13275      1 KKGWLMKQgSRQGEWSKHWFVLRGAALKYYRDPSAEEAGELDGVIDLSSCTEVTELPVSRNYGFQVKTWDGKvYVLSAMT 80
                           90       100
                   ....*....|....*....|....
gi 1907081931  507 SGIRRNWIQTIMKHVLPASAPDVT 530
Cdd:cd13275     81 SGIRTNWIQALRKAAGLPSPPALP 104
Smc super family cl34174
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
718-1032 8.17e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


The actual alignment was detected with superfamily member COG1196:

Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 74.20  E-value: 8.17e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEAS------DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATspsgawQRLHRVNQDLQSELEAQCRR 791
Cdd:COG1196    198 LERQLEPLERQAEkaeryrELKEELKELEAELLLLKLRELEAELEELEAELEEL------EAELEELEAELAELEAELEE 271
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  792 QEL----ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQgkvrEQLEEWQHSKA 867
Cdd:COG1196    272 LRLeleeLELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELE----EELEELEEELE 347
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  868 MLSGQLRASEQKLRSTEARLLEKTQELrdLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytlll 947
Cdd:COG1196    348 EAEEELEEAEAELAEAEEALLEAEAEL--AEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERL------ 419
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  948 escEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1027
Cdd:COG1196    420 ---EEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLL 496

                   ....*
gi 1907081931 1028 RDLIK 1032
Cdd:COG1196    497 LEAEA 501
Smc super family cl34174
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1975-2240 4.11e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


The actual alignment was detected with superfamily member COG1196:

Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 62.26  E-value: 4.11e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1975 AAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQH- 2053
Cdd:COG1196    247 ELEELEAELEELEAELAELEAELEELRLE--LEELELELEEAQAEEY----------ELLAELARLEQDIARLEERRREl 314
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2054 QRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQ 2133
Cdd:COG1196    315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRA 394
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2134 YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLtgdgggestglpltqgKDAYELEVLL 2213
Cdd:COG1196    395 AAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAA----------------EEEAELEEEE 458
                          250       260
                   ....*....|....*....|....*..
gi 1907081931 2214 RVKESEIQYLKQEISSLKDELQTALRD 2240
Cdd:COG1196    459 EALLELLAELLEEAALLEAALAELLEE 485
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1680-2282 2.46e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 53.14  E-value: 2.46e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1680 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLppTEPLGGCQ------RLLRMSQHLSYESCLEGLGQYS 1753
Cdd:TIGR02168  308 RERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEEL--KEELESLEaeleelEAELEELESRLEELEEQLETLR 385
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1754 S----LLVQDAIIQAQVCYAACRI-RLEYEKELRFYKKACQEAKGASGQKraQAVGALKEEYEELLHKQKSEYQKVITLI 1828
Cdd:TIGR02168  386 SkvaqLELQIASLNNEIERLEARLeRLEDRRERLQQEIEELLKKLEEAEL--KELQAELEELEEELEELQEELERLEEAL 463
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1829 EKENTELKAKVSQMDHQQRCLQEAENKhSESMFALQGRYEEEIRCmVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEq 1908
Cdd:TIGR02168  464 EELREELEEAEQALDAAERELAQLQAR-LDSLERLQENLEGFSEG-VKALLKNQSGLSGILGVLSELISVDEGYEAAIE- 540
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1909 hhvqqmKMLEDRFQ-LKVRELQAVhqeelRALQEHYIWSLRG-----ALSLYQPSHPDSSLAPG-PSEPRAVPAAKDEAE 1981
Cdd:TIGR02168  541 ------AALGGRLQaVVVENLNAA-----KKAIAFLKQNELGrvtflPLDSIKGTEIQGNDREIlKNIEGFLGVAKDLVK 609
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1982 SMSGLRERIQELEAQMGV---------MREELGHKE----LEGDVAAlqekyqrdfeslkatceRGFAAMEETHQKKIED 2048
Cdd:TIGR02168  610 FDPKLRKALSYLLGGVLVvddldnaleLAKKLRPGYrivtLDGDLVR-----------------PGGVITGGSAKTNSSI 672
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2049 LQRQhqRELEKLREEKDRLLAEETAATISAIEAMKNahREEMERELEKSQRsQISSINSDIEALRRQYLEELQSVQR--- 2125
Cdd:TIGR02168  673 LERR--REIEELEEKIEELEEKIAELEKALAELRKE--LEELEEELEQLRK-ELEELSRQISALRKDLARLEAEVEQlee 747
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2126 ELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTL-----LTGDGGGESTGLPL 2200
Cdd:TIGR02168  748 RIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELraeltLLNEEAANLRERLE 827
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2201 TQGKDAYELEVLLRVKESEIQYLKQEISSLKDElqtaLRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEAL 2280
Cdd:TIGR02168  828 SLERRIAATERRLEDLEEQIEELSEDIESLAAE----IEELEELIEELESELEALLNERASLEEALALLRSELEELSEEL 903

                   ..
gi 1907081931 2281 GE 2282
Cdd:TIGR02168  904 RE 905
 
Name Accession Description Interval E-value
PH_M-RIP cd13275
Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed ...
429-530 5.44e-47

Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed to play a role in myosin phosphatase regulation by RhoA. M-RIP contains 2 PH domains followed by a Rho binding domain (Rho-BD), and a C-terminal myosin binding subunit (MBS) binding domain (MBS-BD). The amino terminus of M-RIP with its adjacent PH domains and polyproline motifs mediates binding to both actin and Galpha. M-RIP brings RhoA and MBS into close proximity where M-RIP can target RhoA to the myosin phosphatase complex to regulate the myosin phosphorylation state. M-RIP does this via its C-terminal coiled-coil domain which interacts with the MBS leucine zipper domain of myosin phosphatase, while its Rho-BD, directly binds RhoA in a nucleotide-independent manner. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270094  Cd Length: 104  Bit Score: 164.04  E-value: 5.44e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQ-YEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDVTEYPVQRNYGFQIHTKEGE-FTLSAMT 506
Cdd:cd13275      1 KKGWLMKQgSRQGEWSKHWFVLRGAALKYYRDPSAEEAGELDGVIDLSSCTEVTELPVSRNYGFQVKTWDGKvYVLSAMT 80
                           90       100
                   ....*....|....*....|....
gi 1907081931  507 SGIRRNWIQTIMKHVLPASAPDVT 530
Cdd:cd13275     81 SGIRTNWIQALRKAAGLPSPPALP 104
PH smart00233
Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The ...
429-521 3.73e-16

Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.


Pssm-ID: 214574 [Multi-domain]  Cd Length: 102  Bit Score: 76.05  E-value: 3.73e-16
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931   429 KKGWLTKQYEDG--QWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTC---YDVTEYPVQRNYGFQIHTKEGE-FTL 502
Cdd:smart00233    3 KEGWLYKKSGGGkkSWKKRYFVLFNSTLLYYKSKKDKKSYKPKGSIDLSGCtvrEAPDPDSSKKPHCFEIKTSDRKtLLL 82
                            90
                    ....*....|....*....
gi 1907081931   503 SAMTSGIRRNWIQTIMKHV 521
Cdd:smart00233   83 QAESEEEREKWVEALRKAI 101
PH pfam00169
PH domain; PH stands for pleckstrin homology.
429-517 2.43e-15

PH domain; PH stands for pleckstrin homology.


Pssm-ID: 459697 [Multi-domain]  Cd Length: 105  Bit Score: 73.75  E-value: 2.43e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQYED--GQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDV---TEYPVQRNYGFQIHTKEG----E 499
Cdd:pfam00169    3 KEGWLLKKGGGkkKSWKKRYFVLFDGSLLYYKDDKSGKSKEPKGSISLSGCEVVevvASDSPKRKFCFELRTGERtgkrT 82
                           90
                   ....*....|....*...
gi 1907081931  500 FTLSAMTSGIRRNWIQTI 517
Cdd:pfam00169   83 YLLQAESEEERKDWIKAI 100
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
718-1032 8.17e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 74.20  E-value: 8.17e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEAS------DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATspsgawQRLHRVNQDLQSELEAQCRR 791
Cdd:COG1196    198 LERQLEPLERQAEkaeryrELKEELKELEAELLLLKLRELEAELEELEAELEEL------EAELEELEAELAELEAELEE 271
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  792 QEL----ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQgkvrEQLEEWQHSKA 867
Cdd:COG1196    272 LRLeleeLELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELE----EELEELEEELE 347
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  868 MLSGQLRASEQKLRSTEARLLEKTQELrdLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytlll 947
Cdd:COG1196    348 EAEEELEEAEAELAEAEEALLEAEAEL--AEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERL------ 419
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  948 escEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1027
Cdd:COG1196    420 ---EEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLL 496

                   ....*
gi 1907081931 1028 RDLIK 1032
Cdd:COG1196    497 LEAEA 501
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
706-987 6.42e-11

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 68.16  E-value: 6.42e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  706 SERLSTHELtSLLEKELEQSQKEASDLLEQNRLLQDQLRvALGREQSAREGYVLQTEVATSPsgawqrLHRVNQDLQSEL 785
Cdd:TIGR02168  219 KAELRELEL-ALLVLRLEELREELEELQEELKEAEEELE-ELTAELQELEEKLEELRLEVSE------LEEEIEELQKEL 290
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  786 EAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEqgkvREQLEEWQHS 865
Cdd:TIGR02168  291 YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESL----EAELEELEAE 366
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  866 KAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDR----QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKR 941
Cdd:TIGR02168  367 LEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARlerlEDRRERLQQEIEELLKKLEEAELKELQAELEELE 446
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1907081931  942 NYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQK 987
Cdd:TIGR02168  447 EELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLD 492
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
770-1135 2.23e-10

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 66.24  E-value: 2.23e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  770 AWQRLHRVNQDLQSELEaqcRRQELITQQiqtlkhsyGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELK 849
Cdd:PRK03918   163 AYKNLGEVIKEIKRRIE---RLEKFIKRT--------ENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVK 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  850 mEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-----------------QALQRDRQKEVQ 912
Cdd:PRK03918   232 -ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKvkelkelkekaeeyiklSEFYEEYLDELR 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  913 RLQECIAELSQQL-GTSEQAQRLMEKKLKrnytllLESCEQEKQALLQNLKEVEDKASAYED------QLQGHVQQVEAL 985
Cdd:PRK03918   311 EIEKRLSRLEEEInGIEERIKELEEKEER------LEELKKKLKELEKRLEELEERHELYEEakakkeELERLKKRLTGL 384
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  986 QKEKLSETCKGSEQVHKLEEELEAREAS-IRQLAQHVQSLHDE---------------RDLIKHQFQELMER----VATS 1045
Cdd:PRK03918   385 TPEKLEKELEELEKAKEEIEEEISKITArIGELKKEIKELKKAieelkkakgkcpvcgRELTEEHRKELLEEytaeLKRI 464
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1046 DGDVAELQEKLRGKEVDYQNLEHSHHRVS--VQLQSVRTLLREKEEELKHI------------KETHERV--LEKKDQDL 1109
Cdd:PRK03918   465 EKELKEIEEKERKLRKELRELEKVLKKESelIKLKELAEQLKELEEKLKKYnleelekkaeeyEKLKEKLikLKGEIKSL 544
                          410       420
                   ....*....|....*....|....*.
gi 1907081931 1110 NEALVKMIALGSSLEETEIKLQEKEE 1135
Cdd:PRK03918   545 KKELEKLEELKKKLAELEKKLDELEE 570
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1975-2240 4.11e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 62.26  E-value: 4.11e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1975 AAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQH- 2053
Cdd:COG1196    247 ELEELEAELEELEAELAELEAELEELRLE--LEELELELEEAQAEEY----------ELLAELARLEQDIARLEERRREl 314
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2054 QRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQ 2133
Cdd:COG1196    315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRA 394
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2134 YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLtgdgggestglpltqgKDAYELEVLL 2213
Cdd:COG1196    395 AAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAA----------------EEEAELEEEE 458
                          250       260
                   ....*....|....*....|....*..
gi 1907081931 2214 RVKESEIQYLKQEISSLKDELQTALRD 2240
Cdd:COG1196    459 EALLELLAELLEEAALLEAALAELLEE 485
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
666-1161 5.50e-08

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 58.65  E-value: 5.50e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  666 LKTQNVHVEIEQRWHQV--ETTPLREEKQVPiAPLHLSLEDRSERLST---------HELTSLLEKELEQSQKEASDLLE 734
Cdd:pfam01576   22 QKAESELKELEKKHQQLceEKNALQEQLQAE-TELCAEAEEMRARLAArkqeleeilHELESRLEEEEERSQQLQNEKKK 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  735 QNRLLQDqLRVALGREQSAREGyvLQTEVATSPS----------------GAWQRLHRVNQDLQSELEAQCRRQELITQQ 798
Cdd:pfam01576  101 MQQHIQD-LEEQLDEEEAARQK--LQLEKVTTEAkikkleedillledqnSKLSKERKLLEERISEFTSNLAEEEEKAKS 177
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  799 IQTLKHSygeakdairhHEAEIQTLQTRLGNAAAelaiKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQ 878
Cdd:pfam01576  178 LSKLKNK----------HEAMISDLEERLKKEEK----GRQELEKAKRKLEGESTDLQEQIAELQAQIAELRAQLAKKEE 243
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  879 KLRSTEARLlektqelrdlETQQALQRDRQKEVQRLQECIAELSQQLgTSEQAQRLMEKKLKRNYTLLLESCEQEKQALL 958
Cdd:pfam01576  244 ELQAALARL----------EEETAQKNNALKKIRELEAQISELQEDL-ESERAARNKAEKQRRDLGEELEALKTELEDTL 312
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  959 ------QNLK-----EVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1027
Cdd:pfam01576  313 dttaaqQELRskreqEVTELKKALEEETRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAE 392
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1028 RDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEelKHIKETHE-RVLEKKD 1106
Cdd:pfam01576  393 LRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLLNEAEG--KNIKLSKDvSSLESQL 470
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081931 1107 QDLNEALV----KMIALGSSLEETEI-------KLQEKEECLRRF----------VSDSPKDAKEPLSTTEPTEEG 1161
Cdd:pfam01576  471 QDTQELLQeetrQKLNLSTRLRQLEDernslqeQLEEEEEAKRNVerqlstlqaqLSDMKKKLEEDAGTLEALEEG 546
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1986-2178 7.30e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 58.14  E-value: 7.30e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1986 LRERIQELEAQMGVMREELGH-----KELEGDVAALQEKyqrdFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKL 2060
Cdd:TIGR02168  307 LRERLANLERQLEELEAQLEElesklDELAEELAELEEK----LEELKEELESLEAELEELEAELEELESRLEELEEQLE 382
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2061 REEKDRLLAEETAATIsaieamkNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLE 2140
Cdd:TIGR02168  383 TLRSKVAQLELQIASL-------NNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEE 455
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1907081931 2141 NAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAA 2178
Cdd:TIGR02168  456 LERLEEALEELREELEEAEQALDAAERELAQLQARLDS 493
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1680-2282 2.46e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 53.14  E-value: 2.46e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1680 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLppTEPLGGCQ------RLLRMSQHLSYESCLEGLGQYS 1753
Cdd:TIGR02168  308 RERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEEL--KEELESLEaeleelEAELEELESRLEELEEQLETLR 385
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1754 S----LLVQDAIIQAQVCYAACRI-RLEYEKELRFYKKACQEAKGASGQKraQAVGALKEEYEELLHKQKSEYQKVITLI 1828
Cdd:TIGR02168  386 SkvaqLELQIASLNNEIERLEARLeRLEDRRERLQQEIEELLKKLEEAEL--KELQAELEELEEELEELQEELERLEEAL 463
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1829 EKENTELKAKVSQMDHQQRCLQEAENKhSESMFALQGRYEEEIRCmVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEq 1908
Cdd:TIGR02168  464 EELREELEEAEQALDAAERELAQLQAR-LDSLERLQENLEGFSEG-VKALLKNQSGLSGILGVLSELISVDEGYEAAIE- 540
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1909 hhvqqmKMLEDRFQ-LKVRELQAVhqeelRALQEHYIWSLRG-----ALSLYQPSHPDSSLAPG-PSEPRAVPAAKDEAE 1981
Cdd:TIGR02168  541 ------AALGGRLQaVVVENLNAA-----KKAIAFLKQNELGrvtflPLDSIKGTEIQGNDREIlKNIEGFLGVAKDLVK 609
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1982 SMSGLRERIQELEAQMGV---------MREELGHKE----LEGDVAAlqekyqrdfeslkatceRGFAAMEETHQKKIED 2048
Cdd:TIGR02168  610 FDPKLRKALSYLLGGVLVvddldnaleLAKKLRPGYrivtLDGDLVR-----------------PGGVITGGSAKTNSSI 672
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2049 LQRQhqRELEKLREEKDRLLAEETAATISAIEAMKNahREEMERELEKSQRsQISSINSDIEALRRQYLEELQSVQR--- 2125
Cdd:TIGR02168  673 LERR--REIEELEEKIEELEEKIAELEKALAELRKE--LEELEEELEQLRK-ELEELSRQISALRKDLARLEAEVEQlee 747
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2126 ELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTL-----LTGDGGGESTGLPL 2200
Cdd:TIGR02168  748 RIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELraeltLLNEEAANLRERLE 827
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2201 TQGKDAYELEVLLRVKESEIQYLKQEISSLKDElqtaLRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEAL 2280
Cdd:TIGR02168  828 SLERRIAATERRLEDLEEQIEELSEDIESLAAE----IEELEELIEELESELEALLNERASLEEALALLRSELEELSEEL 903

                   ..
gi 1907081931 2281 GE 2282
Cdd:TIGR02168  904 RE 905
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1819-2288 2.18e-05

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 50.12  E-value: 2.18e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1819 SEYQKVITLIEKENTELKAKVSQMDHQQRCLQ-EAENK-------HSESMFALQGRYEEEIRCMVEQLSHTENTLQAERS 1890
Cdd:pfam15921  220 SAISKILRELDTEISYLKGRIFPVEDQLEALKsESQNKielllqqHQDRIEQLISEHEVEITGLTEKASSARSQANSIQS 299
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1891 RvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEHYIWSlrgalslyqpshpDSSLAPGPSEp 1970
Cdd:pfam15921  300 Q-LEIIQEQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEDKIEELEKQLVLA-------------NSELTEARTE- 364
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1971 ravpaaKDEAESMSG-LRERIQELEAQMGVMREELG---------------------HKELEGDVAALQ-EKYQRDFESL 2027
Cdd:pfam15921  365 ------RDQFSQESGnLDDQLQKLLADLHKREKELSlekeqnkrlwdrdtgnsitidHLRRELDDRNMEvQRLEALLKAM 438
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2028 KATC----ERGFAAMEETHQ--KKIEDLQRQHQRELEKLREEKDRLLA-----EETAATISAIeamkNAHREEMERELEK 2096
Cdd:pfam15921  439 KSECqgqmERQMAAIQGKNEslEKVSSLTAQLESTKEMLRKVVEELTAkkmtlESSERTVSDL----TASLQEKERAIEA 514
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2097 SQrSQISSINS--DIEALRRQYL----EELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQ 2170
Cdd:pfam15921  515 TN-AEITKLRSrvDLKLQELQHLknegDHLRNVQTECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKA 593
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2171 ELNNRLAAEITRLRTLLTgdgggestglpLTQGKDAYELEVLLRVKESEIQY----------------LKQEISSLKDEL 2234
Cdd:pfam15921  594 QLEKEINDRRLELQEFKI-----------LKDKKDAKIRELEARVSDLELEKvklvnagserlravkdIKQERDQLLNEV 662
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081931 2235 QTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGE-----KSPEGT 2288
Cdd:pfam15921  663 KTSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQtrntlKSMEGS 721
PTZ00121 PTZ00121
MAEBL; Provisional
1776-2180 9.29e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 48.21  E-value: 9.29e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1776 EYEKELRFYKKACQEAKGASGQKRAQAVGALKEE---YEELlhKQKSEYQKVITLIEKENTELKAKVSQMdhqqRCLQEA 1852
Cdd:PTZ00121  1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEakkADEA--KKKAEEAKKADEAKKKAEEAKKKADEA----KKAAEA 1508
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1853 ENKHSESMFALQGRYEEEIRcMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRElqavh 1932
Cdd:PTZ00121  1509 KKKADEAKKAEEAKKADEAK-KAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRK----- 1582
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1933 QEELRALQEHYIwslrgalslyqpshpdsslapgpsepravpaakdeaesmsglrERIQELEAQMGVMREELGHKELEGD 2012
Cdd:PTZ00121  1583 AEEAKKAEEARI-------------------------------------------EEVMKLYEEEKKMKAEEAKKAEEAK 1619
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2013 VAALQEKYQrdfESLKATCERgFAAMEETHQKKIEDLQRQHqrELEKLREEKDRLLAEETAATISAIEAMKNAHREEMER 2092
Cdd:PTZ00121  1620 IKAEELKKA---EEEKKKVEQ-LKKKEAEEKKKAEELKKAE--EENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA 1693
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2093 ELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQEL 2172
Cdd:PTZ00121  1694 LKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEI 1773

                   ....*...
gi 1907081931 2173 NNRLAAEI 2180
Cdd:PTZ00121  1774 RKEKEAVI 1781
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1680-2157 5.25e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 45.14  E-value: 5.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1680 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLPPTEPLGGCQRLLRMSQHlSYESCLEGLGQYSSLLVQd 1759
Cdd:COG4717     87 EEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPE-RLEELEERLEELRELEEE- 164
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1760 aiiqaqvcyaacriRLEYEKELRFYKKACQEAKGASGQKRAQAVGALKEEYEELlHKQKSEYQKVITLIEKENTELKAKV 1839
Cdd:COG4717    165 --------------LEELEAELAELQEELEELLEQLSLATEEELQDLAEELEEL-QQRLAELEEELEEAQEELEELEEEL 229
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1840 SQMDHQQRCLQEAENKHSESMFALqgryeeeIRCMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLED 1919
Cdd:COG4717    230 EQLENELEAAALEERLKEARLLLL-------IAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGK 302
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1920 RFQlkvrelQAVHQEELRALQEHYIWSLRGALSLyqpshpdsslaPGPSEPRAVPAAKDEAESMSGLRERIQELEAQMgv 1999
Cdd:COG4717    303 EAE------ELQALPALEELEEEELEELLAALGL-----------PPDLSPEELLELLDRIEELQELLREAEELEEEL-- 363
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2000 mreelghkelegDVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQR--QHQRELEKLREEKDRLLAEETAATIS 2077
Cdd:COG4717    364 ------------QLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQELKEEleELEEQLEELLGELEELLEALDEEELE 431
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2078 AIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQY-----LEELQSVQRELEVLSEQYSQKCLenahLAQALEAER 2152
Cdd:COG4717    432 EELEELEEELEELEEELEE-LREELAELEAELEQLEEDGelaelLQELEELKAELRELAEEWAALKL----ALELLEEAR 506

                   ....*
gi 1907081931 2153 QALRQ 2157
Cdd:COG4717    507 EEYRE 511
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1639-2274 9.56e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 44.67  E-value: 9.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1639 ERVIQQILetlrhptgREDQVQTSWDQnpLGEILRPgTDGSQEPLQALHQSPEvlaAIQDELAQQLREKASILEEISAAL 1718
Cdd:PRK03918   148 EKVVRQIL--------GLDDYENAYKN--LGEVIKE-IKRRIERLEKFIKRTE---NIEELIKEKEKELEEVLREINEIS 213
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1719 PVLPPT-EPLGGCQRLLRmsqhlSYESCLEGLgqySSLLVQDAIIQAQVCYAACRIRlEYEKELRFYKKACQEAKgaSGQ 1797
Cdd:PRK03918   214 SELPELrEELEKLEKEVK-----ELEELKEEI---EELEKELESLEGSKRKLEEKIR-ELEERIEELKKEIEELE--EKV 282
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1798 KRAQAVGALKEEYEElLHKQKSEYQKVITLIEKENTELKAKVSQMdhqQRCLQEAENKHSEsMFALQGRyEEEIRCMVEQ 1877
Cdd:PRK03918   283 KELKELKEKAEEYIK-LSEFYEEYLDELREIEKRLSRLEEEINGI---EERIKELEEKEER-LEELKKK-LKELEKRLEE 356
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1878 LSHTENTLQAERsRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALqEHYIWSLRGALSLYQPS 1957
Cdd:PRK03918   357 LEERHELYEEAK-AKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGEL-KKEIKELKKAIEELKKA 434
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1958 HPDSSL--APGPSEPRAVPAAKDEAEsMSGLRERIQELEAQMGVMREELghKELEGDVaalqeKYQRDFESLKATCERGF 2035
Cdd:PRK03918   435 KGKCPVcgRELTEEHRKELLEEYTAE-LKRIEKELKEIEEKERKLRKEL--RELEKVL-----KKESELIKLKELAEQLK 506
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2036 AAMEETHQKKIEDLQRQhQRELEKLREEKDRLLAE--ETAATISAIEAMKNaHREEMERELEKSQRsQISSINSDIEALR 2113
Cdd:PRK03918   507 ELEEKLKKYNLEELEKK-AEEYEKLKEKLIKLKGEikSLKKELEKLEELKK-KLAELEKKLDELEE-ELAELLKELEELG 583
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2114 RQYLEELQSVQRELEVLSEQYsqkcLENAHLAQALEAERQALRQCQRENQELNAHNQELNNR---LAAEITRLRTLLTGD 2190
Cdd:PRK03918   584 FESVEELEERLKELEPFYNEY----LELKDAEKELEREEKELKKLEEELDKAFEELAETEKRleeLRKELEELEKKYSEE 659
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2191 GGGESTGLPLtqgkdayELEVLLRVKESEIQYLKqeisSLKDELQTALRDKKYASDKYKDIYTEL-SIAKAKAdcDISRL 2269
Cdd:PRK03918   660 EYEELREEYL-------ELSRELAGLRAELEELE----KRREEIKKTLEKLKEELEEREKAKKELeKLEKALE--RVEEL 726

                   ....*
gi 1907081931 2270 KEQLK 2274
Cdd:PRK03918   727 REKVK 731
TOPEUc smart00435
DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina ...
1971-2067 7.56e-03

DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina virus topoisomerase, Variola virus topoisomerase, Shope fibroma virus topoisomeras


Pssm-ID: 214661 [Multi-domain]  Cd Length: 391  Bit Score: 41.18  E-value: 7.56e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  1971 RAVPaaKDEAESMSGLRERIQELEAQMGVMREELGHKELEGDV-AALQEKYQRDFESLKATCERGFAAM--EETHQKKIE 2047
Cdd:smart00435  269 RTVS--KTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLkRKLKSKFERDNEKLDAEVKEKKKEKkkEEKKKKQIE 346
                            90       100
                    ....*....|....*....|
gi 1907081931  2048 DLQRQHQReLEKLREEKDRL 2067
Cdd:smart00435  347 RLEERIEK-LEVQATDKEEN 365
 
Name Accession Description Interval E-value
PH_M-RIP cd13275
Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed ...
429-530 5.44e-47

Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed to play a role in myosin phosphatase regulation by RhoA. M-RIP contains 2 PH domains followed by a Rho binding domain (Rho-BD), and a C-terminal myosin binding subunit (MBS) binding domain (MBS-BD). The amino terminus of M-RIP with its adjacent PH domains and polyproline motifs mediates binding to both actin and Galpha. M-RIP brings RhoA and MBS into close proximity where M-RIP can target RhoA to the myosin phosphatase complex to regulate the myosin phosphorylation state. M-RIP does this via its C-terminal coiled-coil domain which interacts with the MBS leucine zipper domain of myosin phosphatase, while its Rho-BD, directly binds RhoA in a nucleotide-independent manner. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270094  Cd Length: 104  Bit Score: 164.04  E-value: 5.44e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQ-YEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDVTEYPVQRNYGFQIHTKEGE-FTLSAMT 506
Cdd:cd13275      1 KKGWLMKQgSRQGEWSKHWFVLRGAALKYYRDPSAEEAGELDGVIDLSSCTEVTELPVSRNYGFQVKTWDGKvYVLSAMT 80
                           90       100
                   ....*....|....*....|....
gi 1907081931  507 SGIRRNWIQTIMKHVLPASAPDVT 530
Cdd:cd13275     81 SGIRTNWIQALRKAAGLPSPPALP 104
PH smart00233
Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The ...
429-521 3.73e-16

Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.


Pssm-ID: 214574 [Multi-domain]  Cd Length: 102  Bit Score: 76.05  E-value: 3.73e-16
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931   429 KKGWLTKQYEDG--QWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTC---YDVTEYPVQRNYGFQIHTKEGE-FTL 502
Cdd:smart00233    3 KEGWLYKKSGGGkkSWKKRYFVLFNSTLLYYKSKKDKKSYKPKGSIDLSGCtvrEAPDPDSSKKPHCFEIKTSDRKtLLL 82
                            90
                    ....*....|....*....
gi 1907081931   503 SAMTSGIRRNWIQTIMKHV 521
Cdd:smart00233   83 QAESEEEREKWVEALRKAI 101
PH pfam00169
PH domain; PH stands for pleckstrin homology.
429-517 2.43e-15

PH domain; PH stands for pleckstrin homology.


Pssm-ID: 459697 [Multi-domain]  Cd Length: 105  Bit Score: 73.75  E-value: 2.43e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQYED--GQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDV---TEYPVQRNYGFQIHTKEG----E 499
Cdd:pfam00169    3 KEGWLLKKGGGkkKSWKKRYFVLFDGSLLYYKDDKSGKSKEPKGSISLSGCEVVevvASDSPKRKFCFELRTGERtgkrT 82
                           90
                   ....*....|....*...
gi 1907081931  500 FTLSAMTSGIRRNWIQTI 517
Cdd:pfam00169   83 YLLQAESEEERKDWIKAI 100
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
718-1032 8.17e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 74.20  E-value: 8.17e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEAS------DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATspsgawQRLHRVNQDLQSELEAQCRR 791
Cdd:COG1196    198 LERQLEPLERQAEkaeryrELKEELKELEAELLLLKLRELEAELEELEAELEEL------EAELEELEAELAELEAELEE 271
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  792 QEL----ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQgkvrEQLEEWQHSKA 867
Cdd:COG1196    272 LRLeleeLELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELE----EELEELEEELE 347
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  868 MLSGQLRASEQKLRSTEARLLEKTQELrdLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytlll 947
Cdd:COG1196    348 EAEEELEEAEAELAEAEEALLEAEAEL--AEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERL------ 419
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  948 escEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1027
Cdd:COG1196    420 ---EEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLL 496

                   ....*
gi 1907081931 1028 RDLIK 1032
Cdd:COG1196    497 LEAEA 501
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
772-1131 1.43e-12

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 73.43  E-value: 1.43e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  772 QRLHRVNqDLQSELEAQcrRQELITQQIQTLKhsYGEAKDAIRHHEAEIQTLQTRlgNAAAELAIKEQALAKLKGELKME 851
Cdd:COG1196    186 ENLERLE-DILGELERQ--LEPLERQAEKAER--YRELKEELKELEAELLLLKLR--ELEAELEELEAELEELEAELEEL 258
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  852 QGKVREQLEEWQHSKAmlsgQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEV----QRLQECIAELSQQLGT 927
Cdd:COG1196    259 EAELAELEAELEELRL----ELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELeerlEELEEELAELEEELEE 334
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  928 SEQAQRLMEKKLKRNytlllescEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSEtckgseqvhklEEEL 1007
Cdd:COG1196    335 LEEELEELEEELEEA--------EEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEA-----------LRAA 395
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1008 EAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREK 1087
Cdd:COG1196    396 AELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALL 475
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1907081931 1088 EEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQ 1131
Cdd:COG1196    476 EAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGL 519
PH cd00821
Pleckstrin homology (PH) domain; PH domains have diverse functions, but in general are ...
429-517 2.63e-12

Pleckstrin homology (PH) domain; PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 275388 [Multi-domain]  Cd Length: 92  Bit Score: 64.87  E-value: 2.63e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQ--YEDGQWKKHWFVLADQSLRYYRDSvAEEAADLDGEINLSTCYDVTEY-PVQRNYGFQIHTKEGE-FTLSA 504
Cdd:cd00821      1 KEGYLLKRggGGLKSWKKRWFVLFEGVLLYYKSK-KDSSYKPKGSIPLSGILEVEEVsPKERPHCFELVTPDGRtYYLQA 79
                           90
                   ....*....|...
gi 1907081931  505 MTSGIRRNWIQTI 517
Cdd:cd00821     80 DSEEERQEWLKAL 92
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
706-987 6.42e-11

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 68.16  E-value: 6.42e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  706 SERLSTHELtSLLEKELEQSQKEASDLLEQNRLLQDQLRvALGREQSAREGYVLQTEVATSPsgawqrLHRVNQDLQSEL 785
Cdd:TIGR02168  219 KAELRELEL-ALLVLRLEELREELEELQEELKEAEEELE-ELTAELQELEEKLEELRLEVSE------LEEEIEELQKEL 290
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  786 EAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEqgkvREQLEEWQHS 865
Cdd:TIGR02168  291 YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESL----EAELEELEAE 366
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  866 KAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDR----QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKR 941
Cdd:TIGR02168  367 LEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARlerlEDRRERLQQEIEELLKKLEEAELKELQAELEELE 446
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1907081931  942 NYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQK 987
Cdd:TIGR02168  447 EELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLD 492
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
770-1135 2.23e-10

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 66.24  E-value: 2.23e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  770 AWQRLHRVNQDLQSELEaqcRRQELITQQiqtlkhsyGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELK 849
Cdd:PRK03918   163 AYKNLGEVIKEIKRRIE---RLEKFIKRT--------ENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVK 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  850 mEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-----------------QALQRDRQKEVQ 912
Cdd:PRK03918   232 -ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKvkelkelkekaeeyiklSEFYEEYLDELR 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  913 RLQECIAELSQQL-GTSEQAQRLMEKKLKrnytllLESCEQEKQALLQNLKEVEDKASAYED------QLQGHVQQVEAL 985
Cdd:PRK03918   311 EIEKRLSRLEEEInGIEERIKELEEKEER------LEELKKKLKELEKRLEELEERHELYEEakakkeELERLKKRLTGL 384
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  986 QKEKLSETCKGSEQVHKLEEELEAREAS-IRQLAQHVQSLHDE---------------RDLIKHQFQELMER----VATS 1045
Cdd:PRK03918   385 TPEKLEKELEELEKAKEEIEEEISKITArIGELKKEIKELKKAieelkkakgkcpvcgRELTEEHRKELLEEytaeLKRI 464
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1046 DGDVAELQEKLRGKEVDYQNLEHSHHRVS--VQLQSVRTLLREKEEELKHI------------KETHERV--LEKKDQDL 1109
Cdd:PRK03918   465 EKELKEIEEKERKLRKELRELEKVLKKESelIKLKELAEQLKELEEKLKKYnleelekkaeeyEKLKEKLikLKGEIKSL 544
                          410       420
                   ....*....|....*....|....*.
gi 1907081931 1110 NEALVKMIALGSSLEETEIKLQEKEE 1135
Cdd:PRK03918   545 KKELEKLEELKKKLAELEKKLDELEE 570
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
772-1167 3.88e-10

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 65.46  E-value: 3.88e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  772 QRLHRVNQDLQ------SELEAQCRRqeLITQQIQTLKhsYGEAKDAIRHHEAEIQTLQtrlgnaaaelaiKEQALAKLK 845
Cdd:TIGR02168  179 RKLERTRENLDrledilNELERQLKS--LERQAEKAER--YKELKAELRELELALLVLR------------LEELREELE 242
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  846 gELKMEQGKVREQLEEwqhskamLSGQLRASEQKLRSTEARLLEKTQELRDLetqqalqrdrQKEVQRLQECIAELSQQL 925
Cdd:TIGR02168  243 -ELQEELKEAEEELEE-------LTAELQELEEKLEELRLEVSELEEEIEEL----------QKELYALANEISRLEQQK 304
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  926 GTSEQAQRLMEKKLKRnYTLLLESCEQEKQALLQNLKEVEDKasayEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEE 1005
Cdd:TIGR02168  305 QILRERLANLERQLEE-LEAQLEELESKLDELAEELAELEEK----LEELKEELESLEAELEELEAELEELESRLEELEE 379
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1006 EleareasIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEklrgkevdyQNLEHSHHRVSVQLQSVRTLLR 1085
Cdd:TIGR02168  380 Q-------LETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQ---------EIEELLKKLEEAELKELQAELE 443
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1086 EKEEELKHIKETHERV---LEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECLRRFvSDSPKDAKEPLsttEPTEEGS 1162
Cdd:TIGR02168  444 ELEEELEELQEELERLeeaLEELREELEEAEQALDAAERELAQLQARLDSLERLQENL-EGFSEGVKALL---KNQSGLS 519

                   ....*
gi 1907081931 1163 GILPL 1167
Cdd:TIGR02168  520 GILGV 524
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
720-929 1.42e-09

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 63.78  E-value: 1.42e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  720 KELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATSPsgAWQRLHRVN------QDLQSELEAQCRRQE 793
Cdd:COG4913    235 DDLERAHEALEDAREQIELLEPIRELAERYAAARERLAELEYLRAALR--LWFAQRRLElleaelEELRAELARLEAELE 312
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  794 LITQQIQTLKHSYGEAKDAIRHH--------EAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHS 865
Cdd:COG4913    313 RLEARLDALREELDELEAQIRGNggdrleqlEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAAL 392
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907081931  866 KAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEV-QRLQECIAELSQQLGTSE 929
Cdd:COG4913    393 LEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIpARLLALRDALAEALGLDE 457
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
837-1135 1.43e-09

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 63.92  E-value: 1.43e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  837 KEQALAKLKGELKMEQGKVREQLEEwqhsKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQqaLQRDRQkEVQRLQE 916
Cdd:TIGR02168  675 RRREIEELEEKIEELEEKIAELEKA----LAELRKELEELEEELEQLRKELEELSRQISALRKD--LARLEA-EVEQLEE 747
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  917 CIAELSQQLgtSEQAQRLMEKKLKrnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEkLSETckg 996
Cdd:TIGR02168  748 RIAQLSKEL--TELEAEIEELEER------LEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAE-LTLL--- 815
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  997 SEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQ 1076
Cdd:TIGR02168  816 NEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSE 895
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081931 1077 LQSVRTLLREKE----------EELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1135
Cdd:TIGR02168  896 LEELSEELRELEskrselrrelEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIE 964
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
817-1125 1.45e-09

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 63.92  E-value: 1.45e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  817 EAEIQTLQTRLGNAAAELAIKEQALAKLKGE---LKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQE 893
Cdd:TIGR02168  676 RREIEELEEKIEELEEKIAELEKALAELRKEleeLEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKE 755
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  894 LRDLETQ-QALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytllLESCEQEKQALLQNLKEVEDKASAYE 972
Cdd:TIGR02168  756 LTELEAEiEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREA-----LDELRAELTLLNEEAANLRERLESLE 830
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  973 DQLQGHVQQVEALQKEKLSEtckgSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAEL 1052
Cdd:TIGR02168  831 RRIAATERRLEDLEEQIEEL----SEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELREL 906
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907081931 1053 QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEEL----KHIKETHERVLEKKDQDLNEALVKMIALGSSLEE 1125
Cdd:TIGR02168  907 ESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLseeySLTLEEAEALENKIEDDEEEARRRLKRLENKIKE 983
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
854-1138 3.56e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 62.26  E-value: 3.56e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  854 KVREQLEEWQHSKAMLsgQLRASEQKLRSTEARLLEKTQELRDLETQQALqrdRQKEVQRLQECIAELSQQLGTSEQAQR 933
Cdd:COG1196    217 ELKEELKELEAELLLL--KLRELEAELEELEAELEELEAELEELEAELAE---LEAELEELRLELEELELELEEAQAEEY 291
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  934 LMEKKLKRnytlllesCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETckgsEQVHKLEEELEAREAS 1013
Cdd:COG1196    292 ELLAELAR--------LEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELE----EELEEAEEELEEAEAE 359
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1014 IRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKH 1093
Cdd:COG1196    360 LAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEE 439
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1907081931 1094 IKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECLR 1138
Cdd:COG1196    440 EEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLE 484
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1975-2240 4.11e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 62.26  E-value: 4.11e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1975 AAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQH- 2053
Cdd:COG1196    247 ELEELEAELEELEAELAELEAELEELRLE--LEELELELEEAQAEEY----------ELLAELARLEQDIARLEERRREl 314
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2054 QRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQ 2133
Cdd:COG1196    315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRA 394
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2134 YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLtgdgggestglpltqgKDAYELEVLL 2213
Cdd:COG1196    395 AAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAA----------------EEEAELEEEE 458
                          250       260
                   ....*....|....*....|....*..
gi 1907081931 2214 RVKESEIQYLKQEISSLKDELQTALRD 2240
Cdd:COG1196    459 EALLELLAELLEEAALLEAALAELLEE 485
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
772-1133 7.72e-09

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 61.24  E-value: 7.72e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  772 QRLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKme 851
Cdd:TIGR02169  684 EGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELK-- 761
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  852 qgKVREQLEEWQHSKAMLSGQLRASEQKLRstEARLLEKTQELRDLEtqqalqrdrqKEVQRLQECIAELSQQLGTSEQA 931
Cdd:TIGR02169  762 --ELEARIEELEEDLHKLEEALNDLEARLS--HSRIPEIQAELSKLE----------EEVSRIEARLREIEQKLNRLTLE 827
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  932 QRLMEKKLkrnytlllesceQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEARE 1011
Cdd:TIGR02169  828 KEYLEKEI------------QELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELE 895
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1012 ASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELqEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEE-E 1090
Cdd:TIGR02169  896 AQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEI-EDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPvN 974
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1907081931 1091 LKHIKEtHERVLEKKDqDLNEALVKMIALGSSLEETEIKLQEK 1133
Cdd:TIGR02169  975 MLAIQE-YEEVLKRLD-ELKEKRAKLEEERKAILERIEEYEKK 1015
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
856-1160 1.10e-08

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 60.85  E-value: 1.10e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  856 REQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQalqRDRQKEVQRLQEciaelsqqlgtSEQAQRLM 935
Cdd:TIGR02169  673 PAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKI---GEIEKEIEQLEQ-----------EEEKLKER 738
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  936 EKKLKRNytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEAL-QKEKLSETCKGSEQVHKLEEELEAREASI 1014
Cdd:TIGR02169  739 LEELEED----LSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLeARLSHSRIPEIQAELSKLEEEVSRIEARL 814
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1015 RQLAQHVQSLHDERDLIKHQFQELMERVatsdgDVAELQEKLRGKEVDyqNLEHSHHRVSVQLQSVRTLLREKEEELKHI 1094
Cdd:TIGR02169  815 REIEQKLNRLTLEKEYLEKEIQELQEQR-----IDLKEQIKSIEKEIE--NLNGKKEELEEELEELEAALRDLESRLGDL 887
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907081931 1095 K------ETHERVLEKKDQDLNEALVKmiaLGSSLEETEIKLQEKEECLRRFvsDSPKDAKEPLSTTEPTEE 1160
Cdd:TIGR02169  888 KkerdelEAQLRELERKIEELEAQIEK---KRKRLSELKAKLEALEEELSEI--EDPKGEDEEIPEEELSLE 954
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
772-988 1.83e-08

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 59.01  E-value: 1.83e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  772 QRLHRVNQDL---QSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGEl 848
Cdd:COG4942     27 AELEQLQQEIaelEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEE- 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  849 kmeqgkvreqleewqhskamLSGQLRASEQKLRSTEARLL----EKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQ 924
Cdd:COG4942    106 --------------------LAELLRALYRLGRQPPLALLlspeDFLDAVRRLQYLKYLAPARREQAEELRADLAELAAL 165
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081931  925 LGTSEQAQRLMEKKLKRNytlllescEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKE 988
Cdd:COG4942    166 RAELEAERAELEALLAEL--------EEERAALEALKAERQKLLARLEKELAELAAELAELQQE 221
PH2_MyoX cd13296
Myosin X Pleckstrin homology (PH) domain, repeat 2; MyoX, a MyTH-FERM myosin, is a molecular ...
429-529 3.64e-08

Myosin X Pleckstrin homology (PH) domain, repeat 2; MyoX, a MyTH-FERM myosin, is a molecular motor that has crucial functions in the transport and/or tethering of integrins in the actin-based extensions known as filopodia, microtubule binding, and in netrin-mediated axon guidance. It functions as a dimer. MyoX walks on bundles of actin, rather than single filaments, unlike the other unconventional myosins. MyoX is present in organisms ranging from humans to choanoflagellates, but not in Drosophila and Caenorhabditis elegans.MyoX consists of a N-terminal motor/head region, a neck made of 3 IQ motifs, and a tail consisting of a coiled-coil domain, a PEST region, 3 PH domains, a myosin tail homology 4 (MyTH4), and a FERM domain at its very C-terminus. The first PH domain in the MyoX tail is a split-PH domain, interupted by the second PH domain such that PH 1a and PH 1b flanks PH 2. The third PH domain (PH 3) follows the PH 1b domain. This cd contains the second PH repeat. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270108  Cd Length: 103  Bit Score: 53.24  E-value: 3.64e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQYEDG------QWKKHWFVLADQSLRYYRDsvAEEAADLDGEINLSTCYDVTEYPVQRNyGFQIHTKEGEFTL 502
Cdd:cd13296      1 KSGWLTKKGGGSstlsrrNWKSRWFVLRDTVLKYYEN--DQEGEKLLGTIDIRSAKEIVDNDPKEN-RLSITTEERTYHL 77
                           90       100
                   ....*....|....*....|....*..
gi 1907081931  503 SAMTSGIRRNWIQtIMKHVLPASAPDV 529
Cdd:cd13296     78 VAESPEDASQWVN-VLTRVISATDLEL 103
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
818-989 3.85e-08

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 58.63  E-value: 3.85e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  818 AEIQTLQTRLGNAAAELAIKEQALAKLKgELKMEQGKVREQLEEWQHSKAMLSGQLRASE--QKLRSTEARLLEKTQELR 895
Cdd:COG4717     71 KELKELEEELKEAEEKEEEYAELQEELE-ELEEELEELEAELEELREELEKLEKLLQLLPlyQELEALEAELAELPERLE 149
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  896 DLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQL 975
Cdd:COG4717    150 ELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEEL 229
                          170
                   ....*....|....
gi 1907081931  976 QGHVQQVEALQKEK 989
Cdd:COG4717    230 EQLENELEAAALEE 243
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
666-1161 5.50e-08

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 58.65  E-value: 5.50e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  666 LKTQNVHVEIEQRWHQV--ETTPLREEKQVPiAPLHLSLEDRSERLST---------HELTSLLEKELEQSQKEASDLLE 734
Cdd:pfam01576   22 QKAESELKELEKKHQQLceEKNALQEQLQAE-TELCAEAEEMRARLAArkqeleeilHELESRLEEEEERSQQLQNEKKK 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  735 QNRLLQDqLRVALGREQSAREGyvLQTEVATSPS----------------GAWQRLHRVNQDLQSELEAQCRRQELITQQ 798
Cdd:pfam01576  101 MQQHIQD-LEEQLDEEEAARQK--LQLEKVTTEAkikkleedillledqnSKLSKERKLLEERISEFTSNLAEEEEKAKS 177
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  799 IQTLKHSygeakdairhHEAEIQTLQTRLGNAAAelaiKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQ 878
Cdd:pfam01576  178 LSKLKNK----------HEAMISDLEERLKKEEK----GRQELEKAKRKLEGESTDLQEQIAELQAQIAELRAQLAKKEE 243
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  879 KLRSTEARLlektqelrdlETQQALQRDRQKEVQRLQECIAELSQQLgTSEQAQRLMEKKLKRNYTLLLESCEQEKQALL 958
Cdd:pfam01576  244 ELQAALARL----------EEETAQKNNALKKIRELEAQISELQEDL-ESERAARNKAEKQRRDLGEELEALKTELEDTL 312
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  959 ------QNLK-----EVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1027
Cdd:pfam01576  313 dttaaqQELRskreqEVTELKKALEEETRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAE 392
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1028 RDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEelKHIKETHE-RVLEKKD 1106
Cdd:pfam01576  393 LRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLLNEAEG--KNIKLSKDvSSLESQL 470
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081931 1107 QDLNEALV----KMIALGSSLEETEI-------KLQEKEECLRRF----------VSDSPKDAKEPLSTTEPTEEG 1161
Cdd:pfam01576  471 QDTQELLQeetrQKLNLSTRLRQLEDernslqeQLEEEEEAKRNVerqlstlqaqLSDMKKKLEEDAGTLEALEEG 546
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
874-1135 6.81e-08

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 58.02  E-value: 6.81e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  874 RASEQKLRSTEARLL-------EKTQELRDLETQ-------QALQ---RDRQKEVQRLQecIAELSQQLGTSEQAQRLME 936
Cdd:COG1196    175 EEAERKLEATEENLErledilgELERQLEPLERQaekaeryRELKeelKELEAELLLLK--LRELEAELEELEAELEELE 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  937 KKLKRnYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETckgsEQVHKLEEELEAREASIRQ 1016
Cdd:COG1196    253 AELEE-LEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLE----ERRRELEERLEELEEELAE 327
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1017 LAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEvdyQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKE 1096
Cdd:COG1196    328 LEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAE---AELAEAEEELEELAEELLEALRAAAELAAQLEE 404
                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1907081931 1097 ThERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1135
Cdd:COG1196    405 L-EEAEEALLERLERLEEELEELEEALAELEEEEEEEEE 442
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1986-2178 7.30e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 58.14  E-value: 7.30e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1986 LRERIQELEAQMGVMREELGH-----KELEGDVAALQEKyqrdFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKL 2060
Cdd:TIGR02168  307 LRERLANLERQLEELEAQLEElesklDELAEELAELEEK----LEELKEELESLEAELEELEAELEELESRLEELEEQLE 382
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2061 REEKDRLLAEETAATIsaieamkNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLE 2140
Cdd:TIGR02168  383 TLRSKVAQLELQIASL-------NNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEE 455
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1907081931 2141 NAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAA 2178
Cdd:TIGR02168  456 LERLEEALEELREELEEAEQALDAAERELAQLQARLDS 493
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
797-987 1.01e-07

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 57.62  E-value: 1.01e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  797 QQIQTLKHSYGEAKDAIRHHEA--EIQTLQTRLGNAAAELAIKEQALAKLK--------GELKMEQGKVREQLEEWQHSK 866
Cdd:COG4913    232 EHFDDLERAHEALEDAREQIELlePIRELAERYAAARERLAELEYLRAALRlwfaqrrlELLEAELEELRAELARLEAEL 311
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  867 AMLSGQLRASEQKLRSTEARLLE-KTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTL 945
Cdd:COG4913    312 ERLEARLDALREELDELEAQIRGnGGDRLEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAA 391
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1907081931  946 LLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQK 987
Cdd:COG4913    392 LLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLER 433
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1974-2240 1.38e-07

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 57.23  E-value: 1.38e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1974 PAAKDEAESMSGLRERIQELEAQMGVMREELGHkeLEgDVAALQEKYQRDFESLKA--TCERGFAAmeETHQKKIEDLQR 2051
Cdd:COG4913    221 PDTFEAADALVEHFDDLERAHEALEDAREQIEL--LE-PIRELAERYAAARERLAEleYLRAALRL--WFAQRRLELLEA 295
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2052 qhqrELEKLREEKDRLLAEETAATISAIEAmkNAHREEMERELEKSQRSQISSINSDIEALRRqyleELQSVQRELEVLS 2131
Cdd:COG4913    296 ----ELEELRAELARLEAELERLEARLDAL--REELDELEAQIRGNGGDRLEQLEREIERLER----ELEERERRRARLE 365
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2132 EQysqkcLENAHLAQALEAE--RQALRQCQRENQELNAHNQELNNRLAAEITRLRtlltgdgggestglpltqgkdayEL 2209
Cdd:COG4913    366 AL-----LAALGLPLPASAEefAALRAEAAALLEALEEELEALEEALAEAEAALR-----------------------DL 417
                          250       260       270
                   ....*....|....*....|....*....|.
gi 1907081931 2210 EVLLRVKESEIQYLKQEISSLKDELQTALRD 2240
Cdd:COG4913    418 RRELRELEAEIASLERRKSNIPARLLALRDA 448
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1801-2162 1.48e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 57.00  E-value: 1.48e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1801 QAVGALKEEYEELLhkqksEYQKVITliEKENTELKAKVSQMDHQQRCLQEAENKHSESmfalqgryEEEIRCMVEQLSH 1880
Cdd:TIGR02169  198 QQLERLRREREKAE-----RYQALLK--EKREYEGYELLKEKEALERQKEAIERQLASL--------EEELEKLTEEISE 262
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1881 TENTLqAERSRVLSQLDASVKDRQAMEQHHVQ--------QMKMLEDRFQLKVRELQ------AVHQEELRALQEHyIWS 1946
Cdd:TIGR02169  263 LEKRL-EEIEQLLEELNKKIKDLGEEEQLRVKekigeleaEIASLERSIAEKERELEdaeerlAKLEAEIDKLLAE-IEE 340
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1947 LRGALSLYQpshpdsslapgpSEPRAVPAA-KDEAESMSGLRERIQELEAQMGVMREELghkelegdvaalqEKYQRDFE 2025
Cdd:TIGR02169  341 LEREIEEER------------KRRDKLTEEyAELKEELEDLRAELEEVDKEFAETRDEL-------------KDYREKLE 395
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2026 SLKatcergfaameethqKKIEDLQRQHQRELEKLREEKDRLlaEETAATISAIEAMKNAHREEME--RELEKSQRSQIS 2103
Cdd:TIGR02169  396 KLK---------------REINELKRELDRLQEELQRLSEEL--ADLNAAIAGIEAKINELEEEKEdkALEIKKQEWKLE 458
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907081931 2104 SINSDIEALRRQYL---EELQSVQRELEVLSEQYSQkclenahlaqaLEAERQALRQCQREN 2162
Cdd:TIGR02169  459 QLAADLSKYEQELYdlkEEYDRVEKELSKLQRELAE-----------AEAQARASEERVRGG 509
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1992-2286 1.64e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 57.00  E-value: 1.64e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1992 ELEAQMGVMREELGhkELEGDVAALQEKyqrdFESLKATCERGFAAMEETHqKKIEDLQRQHQRELEKLREEKDRLlaEE 2071
Cdd:TIGR02169  671 SEPAELQRLRERLE--GLKRELSSLQSE----LRRIENRLDELSQELSDAS-RKIGEIEKEIEQLEQEEEKLKERL--EE 741
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2072 TAATISAIEAMKNAHREEMErELEK---SQRSQISSINSDIEALRRQYLEE-LQSVQRELEVLSEQYSQKCLENAHLAQA 2147
Cdd:TIGR02169  742 LEEDLSSLEQEIENVKSELK-ELEArieELEEDLHKLEEALNDLEARLSHSrIPEIQAELSKLEEEVSRIEARLREIEQK 820
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2148 LEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTGDGGGEStglpltqgkDAYELEVLLRVKESEIQYLKQEI 2227
Cdd:TIGR02169  821 LNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEE---------ELEELEAALRDLESRLGDLKKER 891
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081931 2228 SSLKDELQTALRDKKYASDKYKDIYTELSIAKAKAdcdiSRLKEQLKAATEALGEKSPE 2286
Cdd:TIGR02169  892 DELEAQLRELERKIEELEAQIEKKRKRLSELKAKL----EALEEELSEIEDPKGEDEEI 946
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
673-918 2.26e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 56.61  E-value: 2.26e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  673 VEIEQRWHQVET--TPLREEKQVPIAPLHLSLEdrSERLSTHELTSLLEKELEQSQKE-ASDLLEQNRLLQD--QLRVAL 747
Cdd:TIGR02169  268 EEIEQLLEELNKkiKDLGEEEQLRVKEKIGELE--AEIASLERSIAEKERELEDAEERlAKLEAEIDKLLAEieELEREI 345
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  748 GREQSAREGyvLQTEVATSPsgawQRLHRVNQDLQS-ELEAQCRRQEL--ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQ 824
Cdd:TIGR02169  346 EEERKRRDK--LTEEYAELK----EELEDLRAELEEvDKEFAETRDELkdYREKLEKLKREINELKRELDRLQEELQRLS 419
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  825 TRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSG---QLRASEQKLRSTEARLLEKTQELRDLETQQ 901
Cdd:TIGR02169  420 EELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKyeqELYDLKEEYDRVEKELSKLQRELAEAEAQA 499
                          250
                   ....*....|....*..
gi 1907081931  902 ALQRDRQKEVQRLQECI 918
Cdd:TIGR02169  500 RASEERVRGGRAVEEVL 516
PH_AtPH1 cd13276
Arabidopsis thaliana Pleckstrin homolog (PH) 1 (AtPH1) PH domain; AtPH1 is expressed in all ...
429-525 2.69e-07

Arabidopsis thaliana Pleckstrin homolog (PH) 1 (AtPH1) PH domain; AtPH1 is expressed in all plant tissue and is proposed to be the plant homolog of human pleckstrin. Pleckstrin consists of two PH domains separated by a linker region, while AtPH has a single PH domain with a short N-terminal extension. AtPH1 binds PtdIns3P specifically and is thought to be an adaptor molecule since it has no obvious catalytic functions. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270095  Cd Length: 106  Bit Score: 50.78  E-value: 2.69e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQYED-GQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDVT--EYPVQRNYGFQIHTKEGEFTLSAM 505
Cdd:cd13276      1 KAGWLEKQGEFiKTWRRRWFVLKQGKLFWFKEPDVTPYSKPRGVIDLSKCLTVKsaEDATNKENAFELSTPEETFYFIAD 80
                           90       100
                   ....*....|....*....|
gi 1907081931  506 TSGIRRNWIQTIMKHVLPAS 525
Cdd:cd13276     81 NEKEKEEWIGAIGRAIVKHS 100
PH-GRAM1_AGT26 cd13215
Autophagy-related protein 26/Sterol 3-beta-glucosyltransferase Pleckstrin homology (PH) domain, ...
428-521 2.78e-07

Autophagy-related protein 26/Sterol 3-beta-glucosyltransferase Pleckstrin homology (PH) domain, repeat 1; ATG26 (also called UGT51/UDP-glycosyltransferase 51), a member of the glycosyltransferase 28 family, resulting in the biosynthesis of sterol glucoside. ATG26 in decane metabolism and autophagy. There are 32 known autophagy-related (ATG) proteins, 17 are components of the core autophagic machinery essential for all autophagy-related pathways and 15 are the additional components required only for certain pathways or species. The core autophagic machinery includes 1) the ATG9 cycling system (ATG1, ATG2, ATG9, ATG13, ATG18, and ATG27), 2) the phosphatidylinositol 3-kinase complex (ATG6/VPS30, ATG14, VPS15, and ATG34), and 3) the ubiquitin-like protein system (ATG3, ATG4, ATG5, ATG7, ATG8, ATG10, ATG12, and ATG16). Less is known about how the core machinery is adapted or modulated with additional components to accommodate the nonselective sequestration of bulk cytosol (autophagosome formation) or selective sequestration of specific cargos (Cvt vesicle, pexophagosome, or bacteria-containing autophagosome formation). The pexophagosome-specific additions include the ATG30-ATG11-ATG17 receptor-adaptors complex, the coiled-coil protein ATG25, and the sterol glucosyltransferase ATG26. ATG26 is necessary for the degradation of medium peroxisomes. It contains 2 GRAM domains and a single PH domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 275402  Cd Length: 116  Bit Score: 51.08  E-value: 2.78e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  428 FKKGWLTKQ-YEDGQWKKHWFVLADQSLRYYRDSvaeeaADL---DGEINLSTCY--DVTEYPVQRNYGFQIHTKEGEFT 501
Cdd:cd13215     22 IKSGYLSKRsKRTLRYTRYWFVLKGDTLSWYNSS-----TDLyfpAGTIDLRYATsiELSKSNGEATTSFKIVTNSRTYK 96
                           90       100
                   ....*....|....*....|
gi 1907081931  502 LSAMTSGIRRNWIQTIMKHV 521
Cdd:cd13215     97 FKADSETSADEWVKALKKQI 116
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
690-1095 3.35e-07

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 55.89  E-value: 3.35e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  690 EKQVPIAPLHLSlEDRSER----LSTHELTSLLEK---ELEQSQKEASDLLEQNRLLQDQ-LRVALGREQSAREGYVLQT 761
Cdd:pfam15921  348 EKQLVLANSELT-EARTERdqfsQESGNLDDQLQKllaDLHKREKELSLEKEQNKRLWDRdTGNSITIDHLRRELDDRNM 426
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  762 EVatspsgawQRLHRVNQDLQSELEAQCRRQ--------------ELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRL 827
Cdd:pfam15921  427 EV--------QRLEALLKAMKSECQGQMERQmaaiqgkneslekvSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTV 498
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  828 GNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHskamlsgqLRASEQKLRSTEArllektqELRDLETQQAlQRDR 907
Cdd:pfam15921  499 SDLTASLQEKERAIEATNAEITKLRSRVDLKLQELQH--------LKNEGDHLRNVQT-------ECEALKLQMA-EKDK 562
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  908 QKEVQRLQ-ECIAELSQQLGTSEQAQRLMEKKLKRNYtlllesceQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEaLQ 986
Cdd:pfam15921  563 VIEILRQQiENMTQLVGQHGRTAGAMQVEKAQLEKEI--------NDRRLELQEFKILKDKKDAKIRELEARVSDLE-LE 633
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  987 KEKLSETckGSEQVhkleeeleareasirqlaQHVQSLHDERDlikhqfqELMERVATSDGDVAELQEKLRGKEVDYQN- 1065
Cdd:pfam15921  634 KVKLVNA--GSERL------------------RAVKDIKQERD-------QLLNEVKTSRNELNSLSEDYEVLKRNFRNk 686
                          410       420       430
                   ....*....|....*....|....*....|...
gi 1907081931 1066 ---LEHSHHRVSVQLQSVRTLLREKEEELKHIK 1095
Cdd:pfam15921  687 seeMETTTNKLKMQLKSAQSELEQTRNTLKSME 719
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1868-2165 3.44e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 55.84  E-value: 3.44e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1868 EEEIRCMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLE--DRFQLKVRELQAVHQEELRALQehyiw 1945
Cdd:TIGR02169  676 LQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEqlEQEEEKLKERLEELEEDLSSLE----- 750
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1946 slrgalslyqpshpdsslapgpsepRAVPAAKDEaesMSGLRERIQELEAQMGVMREELGHKELEGDVAALQEKyQRDFE 2025
Cdd:TIGR02169  751 -------------------------QEIENVKSE---LKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEI-QAELS 801
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2026 SLKATCERGFAAMEETHQKkiedLQRQHQRE--LEKLREEK--DRLLAEETAATISAIEAMKNAHREEMERELEKSQRS- 2100
Cdd:TIGR02169  802 KLEEEVSRIEARLREIEQK----LNRLTLEKeyLEKEIQELqeQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAAl 877
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081931 2101 -QISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQEL 2165
Cdd:TIGR02169  878 rDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGED 943
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
818-1158 4.47e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 55.46  E-value: 4.47e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  818 AEIQTLQTRLGNAAAELAIKEQALAKLKGElkmeqgkvREQLEEWQHskamLSGQLRASEQKLRSTEARLLEKTQE--LR 895
Cdd:TIGR02169  177 EELEEVEENIERLDLIIDEKRQQLERLRRE--------REKAERYQA----LLKEKREYEGYELLKEKEALERQKEaiER 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  896 DLETQQALQRDRQKEVQRLQECIAELSQQLgtSEQAQRLMEKKLKRNYTLllesceQEKQALLQ-NLKEVEDKASAYEDQ 974
Cdd:TIGR02169  245 QLASLEEELEKLTEEISELEKRLEEIEQLL--EELNKKIKDLGEEEQLRV------KEKIGELEaEIASLERSIAEKERE 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  975 LQghvqQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQE 1054
Cdd:TIGR02169  317 LE----DAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYRE 392
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1055 KLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETH---ERVLEKKDQDLNEALVKMIALGSSLEETEIKLQ 1131
Cdd:TIGR02169  393 KLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKInelEEEKEDKALEIKKQEWKLEQLAADLSKYEQELY 472
                          330       340
                   ....*....|....*....|....*..
gi 1907081931 1132 EKEECLRRfVSDSPKDAKEPLSTTEPT 1158
Cdd:TIGR02169  473 DLKEEYDR-VEKELSKLQRELAEAEAQ 498
PH_CNK_mammalian-like cd01260
Connector enhancer of KSR (Kinase suppressor of ras) (CNK) pleckstrin homology (PH) domain; ...
431-475 4.96e-07

Connector enhancer of KSR (Kinase suppressor of ras) (CNK) pleckstrin homology (PH) domain; CNK family members function as protein scaffolds, regulating the activity and the subcellular localization of RAS activated RAF. There is a single CNK protein present in Drosophila and Caenorhabditis elegans in contrast to mammals which have 3 CNK proteins (CNK1, CNK2, and CNK3). All of the CNK members contain a sterile a motif (SAM), a conserved region in CNK (CRIC) domain, and a PSD-95/DLG-1/ZO-1 (PDZ) domain, and, with the exception of CNK3, a PH domain. A CNK2 splice variant CNK2A also has a PDZ domain-binding motif at its C terminus and Drosophila CNK (D-CNK) also has a domain known as the Raf-interacting region (RIR) that mediates binding of the Drosophila Raf kinase. This cd contains CNKs from mammals, chickens, amphibians, fish, and crustacea. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 269962  Cd Length: 114  Bit Score: 50.48  E-value: 4.96e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907081931  431 GWLTKQYEDG-----QWKKHWFVLADQSLRYYRDSVAEEAadlDGEINLS 475
Cdd:cd01260     17 GWLWKKKEAKsffgqKWKKYWFVLKGSSLYWYSNQQDEKA---EGFINLP 63
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1984-2282 5.00e-07

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 55.33  E-value: 5.00e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1984 SGLRERIQELEAQMGVMREELG-----HKELEGDVAALQE---------KYQRDFESLKAtcergfaameETHQKKIEDL 2049
Cdd:COG1196    168 SKYKERKEEAERKLEATEENLErlediLGELERQLEPLERqaekaeryrELKEELKELEA----------ELLLLKLREL 237
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2050 QRQHQRELEKLREEKDRL--LAEETAATISAIEAMKNAHREEMERELEKSQR-----SQISSINSDIEAL---RRQYLEE 2119
Cdd:COG1196    238 EAELEELEAELEELEAELeeLEAELAELEAELEELRLELEELELELEEAQAEeyellAELARLEQDIARLeerRRELEER 317
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2120 LQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTGDgggestglp 2199
Cdd:COG1196    318 LEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEEL--------- 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2200 LTQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEA 2279
Cdd:COG1196    389 LEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAEL 468

                   ...
gi 1907081931 2280 LGE 2282
Cdd:COG1196    469 LEE 471
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1909-2235 5.08e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 55.45  E-value: 5.08e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1909 HHVQQMKMLEDRFQLKVRELQAVhQEELRALQEHYIWSLRGALSLYQPSHpdsslapgpsepravpaakDEAESMSGLRE 1988
Cdd:TIGR02168  681 ELEEKIEELEEKIAELEKALAEL-RKELEELEEELEQLRKELEELSRQIS-------------------ALRKDLARLEA 740
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1989 RIQELEAQMGVMREELghKELEGDVAALQEKYQRDFESLKATcergfaameETHQKKIEDLQRQHQRELEKLREEKDRL- 2067
Cdd:TIGR02168  741 EVEQLEERIAQLSKEL--TELEAEIEELEERLEEAEEELAEA---------EAEIEELEAQIEQLKEELKALREALDELr 809
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2068 -----LAEETAATISAIEAMKNAHREEmERELEKSQRsQISSINSDIEALRRQyLEELQS----VQRELEVLSEQYSQKC 2138
Cdd:TIGR02168  810 aeltlLNEEAANLRERLESLERRIAAT-ERRLEDLEE-QIEELSEDIESLAAE-IEELEElieeLESELEALLNERASLE 886
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2139 LENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLltgdgggESTGLPLTQ-----GKDAYE-LEVL 2212
Cdd:TIGR02168  887 EALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGL-------EVRIDNLQErlseeYSLTLEeAEAL 959
                          330       340
                   ....*....|....*....|...
gi 1907081931 2213 LRVKESEIQYLKQEISSLKDELQ 2235
Cdd:TIGR02168  960 ENKIEDDEEEARRRLKRLENKIK 982
PH1_ARAP cd13253
ArfGAP with RhoGAP domain, ankyrin repeat and PH domain Pleckstrin homology (PH) domain, ...
429-522 5.51e-07

ArfGAP with RhoGAP domain, ankyrin repeat and PH domain Pleckstrin homology (PH) domain, repeat 1; ARAP proteins (also called centaurin delta) are phosphatidylinositol 3,4,5-trisphosphate-dependent GTPase-activating proteins that modulate actin cytoskeleton remodeling by regulating ARF and RHO family members. They bind phosphatidylinositol 3,4,5-trisphosphate (PtdIns(3,4,5)P3) and phosphatidylinositol 3,4-bisphosphate (PtdIns(3,4,5)P2) binding. There are 3 mammalian ARAP proteins: ARAP1, ARAP2, and ARAP3. All ARAP proteins contain a N-terminal SAM (sterile alpha motif) domain, 5 PH domains, an ArfGAP domain, 2 ankyrin domain, A RhoGap domain, and a Ras-associating domain. This hierarchy contains the first PH domain in ARAP. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270073  Cd Length: 94  Bit Score: 49.69  E-value: 5.51e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQYEDGQ---WKKHWFVLADQSLRYYRdsvAEEAADLDGEINLSTcydVTEYPVQRNYGFQIHTKEGEFTLSAM 505
Cdd:cd13253      2 KSGYLDKQGGQGNnkgFQKRWVVFDGLSLRYFD---SEKDAYSKRIIPLSA---ISTVRAVGDNKFELVTTNRTFVFRAE 75
                           90
                   ....*....|....*..
gi 1907081931  506 TSGIRRNWIQTIMKHVL 522
Cdd:cd13253     76 SDDERNLWCSTLQAAIS 92
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
718-1135 7.86e-07

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 54.64  E-value: 7.86e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRLLQDQLRVaLGREQSAREGYVLQTEvatspsgaWQRLhRVNQDLqSELEAQCRRQELITQ 797
Cdd:TIGR04523  150 KEKELEKLNNKYNDLKKQKEELENELNL-LEKEKLNIQKNIDKIK--------NKLL-KLELLL-SNLKKKIQKNKSLES 218
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  798 QIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAikeqalaklkgELKMEQGKVREQLEEWQHskamlsgQLRASE 877
Cdd:TIGR04523  219 QISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLN-----------QLKDEQNKIKKQLSEKQK-------ELEQNN 280
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  878 QKLRSTEARLLEKTQELRDL--ETQQALQRDRQKEVQRLQECIAELSQQLGTSEQA-QRLMEK--KLKRNytllLESCEQ 952
Cdd:TIGR04523  281 KKIKELEKQLNQLKSEISDLnnQKEQDWNKELKSELKNQEKKLEEIQNQISQNNKIiSQLNEQisQLKKE----LTNSES 356
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  953 EKQALLQNLKEVEDKASAYEDQLQGHVQQVEAL--QKEKLSETCKGSEQVHKleeeleareasirQLAQHVQSLHDERDL 1030
Cdd:TIGR04523  357 ENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLesQINDLESKIQNQEKLNQ-------------QKDEQIKKLQQEKEL 423
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1031 IKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNL--------------EHSHHRVSVQLQSVRTLLREKEEELKHIKE 1096
Cdd:TIGR04523  424 LEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLdntresletqlkvlSRSINKIKQNLEQKQKELKSKEKELKKLNE 503
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|...
gi 1907081931 1097 tHERVLEKKDQDLN----EALVKMIALGSSLEETEIKLQEKEE 1135
Cdd:TIGR04523  504 -EKKELEEKVKDLTkkisSLKEKIEKLESEKKEKESKISDLED 545
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1829-2214 9.42e-07

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 54.56  E-value: 9.42e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1829 EKENTELKAKVSQMDHQQRCLQEAENKHSESmfalqgryEEEIRCMVEQLSHTENTLQAERSRvLSQLDASVKDRQAMEQ 1908
Cdd:COG1196    221 ELKELEAELLLLKLRELEAELEELEAELEEL--------EAELEELEAELAELEAELEELRLE-LEELELELEEAQAEEY 291
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1909 HHVQQMKMLEDRFQLKVRELQAVHQEELRALQEhyiwslrgalslyqpshpdsslapgpsepravpaakdEAEsmsgLRE 1988
Cdd:COG1196    292 ELLAELARLEQDIARLEERRRELEERLEELEEE-------------------------------------LAE----LEE 330
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1989 RIQELEAQMGVMREELghKELEGDVAALQEKYQRdfeslkatcergfaamEETHQKKIEDLQRQHQRELEKLREEKDRLL 2068
Cdd:COG1196    331 ELEELEEELEELEEEL--EEAEEELEEAEAELAE----------------AEEALLEAEAELAEAEEELEELAEELLEAL 392
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2069 AEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQAL 2148
Cdd:COG1196    393 RAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEA 472
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907081931 2149 EAERQALRQCQRENQELNA-----HNQELNNRLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYELEVLLR 2214
Cdd:COG1196    473 ALLEAALAELLEELAEAAArllllLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAA 543
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
797-1127 1.01e-06

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 54.26  E-value: 1.01e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  797 QQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAI---KEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQL 873
Cdd:TIGR04523  117 EQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKlnnKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKL 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  874 RASEQKLRSTEA-----RLLEKtqELRDLETQQA-LQRDRQKEVQRLQECIAELSQqlgTSEQAQRLMEKKLKRNYTLll 947
Cdd:TIGR04523  197 LKLELLLSNLKKkiqknKSLES--QISELKKQNNqLKDNIEKKQQEINEKTTEISN---TQTQLNQLKDEQNKIKKQL-- 269
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  948 esceQEKQallQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKG--------SEQVHKLEEELEAREASIRQLAQ 1019
Cdd:TIGR04523  270 ----SEKQ---KELEQNNKKIKELEKQLNQLKSEISDLNNQKEQDWNKElkselknqEKKLEEIQNQISQNNKIISQLNE 342
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1020 HVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKhIKETHE 1099
Cdd:TIGR04523  343 QISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIK-KLQQEK 421
                          330       340
                   ....*....|....*....|....*...
gi 1907081931 1100 RVLEKKDQDLNEALVKMIALGSSLEETE 1127
Cdd:TIGR04523  422 ELLEKEIERLKETIIKNNSEIKDLTNQD 449
PH_PEPP1_2_3 cd13248
Phosphoinositol 3-phosphate binding proteins 1, 2, and 3 pleckstrin homology (PH) domain; ...
429-517 1.15e-06

Phosphoinositol 3-phosphate binding proteins 1, 2, and 3 pleckstrin homology (PH) domain; PEPP1 (also called PLEKHA4/PH domain-containing family A member 4 and RHOXF1/Rhox homeobox family member 1), and related homologs PEPP2 (also called PLEKHA5/PH domain-containing family A member 5) and PEPP3 (also called PLEKHA6/PH domain-containing family A member 6), have PH domains that interact specifically with PtdIns(3,4)P3. Other proteins that bind PtdIns(3,4)P3 specifically are: TAPP1 (tandem PH-domain-containing protein-1) and TAPP2], PtdIns3P AtPH1, and Ptd- Ins(3,5)P2 (centaurin-beta2). All of these proteins contain at least 5 of the 6 conserved amino acids that make up the putative phosphatidylinositol 3,4,5- trisphosphate-binding motif (PPBM) located at their N-terminus. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270068  Cd Length: 104  Bit Score: 49.19  E-value: 1.15e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQYEDG--QWKKHWFVLADQSLRYYRDsvaEEAADLDGEINLSTcYDVT----EYPVQRNYGFQIhTKEGEFT- 501
Cdd:cd13248      9 MSGWLHKQGGSGlkNWRKRWFVLKDNCLYYYKD---PEEEKALGSILLPS-YTISpappSDEISRKFAFKA-EHANMRTy 83
                           90
                   ....*....|....*..
gi 1907081931  502 -LSAMTSGIRRNWIQTI 517
Cdd:cd13248     84 yFAADTAEEMEQWMNAM 100
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
718-1090 1.16e-06

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 54.00  E-value: 1.16e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSARegyvLQTEVATSPSG---------AWQRLHRVNQDLQSEL-EA 787
Cdd:COG4717    100 LEEELEELEAELEELREELEKLEKLLQLLPLYQELEA----LEAELAELPERleeleerleELRELEEELEELEAELaEL 175
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  788 QCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLkgELKMEQGKVREQLEEWQHSKA 867
Cdd:COG4717    176 QEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQL--ENELEAAALEERLKEARLLLL 253
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  868 MLSGQL------------------------------RASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQEC 917
Cdd:COG4717    254 IAAALLallglggsllsliltiagvlflvlgllallFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPD 333
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  918 I--AELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQ-----NLKEVEDKASAYED--QLQGHVQQVEALQKE 988
Cdd:COG4717    334 LspEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAeagveDEEELRAALEQAEEyqELKEELEELEEQLEE 413
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  989 KLSETCKGSEQVHKLE--EELEAREASIRQLAQHVQSLHDERDLIKHQFQELMErvatsDGDVAELQEKLRGKEVDYQNL 1066
Cdd:COG4717    414 LLGELEELLEALDEEEleEELEELEEELEELEEELEELREELAELEAELEQLEE-----DGELAELLQELEELKAELREL 488
                          410       420
                   ....*....|....*....|....
gi 1907081931 1067 EHSHHRVSVQLQSVRTLLREKEEE 1090
Cdd:COG4717    489 AEEWAALKLALELLEEAREEYREE 512
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
833-1134 1.24e-06

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 54.21  E-value: 1.24e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  833 ELAIKEQALAKL------KGELKMEQGKVREQL--EEWQHSKAMLSGQLRASEQKLRSTEARLLEKT---QELRDLETQQ 901
Cdd:pfam02463  167 LKRKKKEALKKLieetenLAELIIDLEELKLQElkLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKlneERIDLLQELL 246
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  902 ALQRDRQKEVQRLQECIAELSQQ----LGTSEQAQRLMEKKLKRNYTLL------LESCEQEKQALLQNLKEVEDKASAY 971
Cdd:pfam02463  247 RDEQEEIESSKQEIEKEEEKLAQvlkeNKEEEKEKKLQEEELKLLAKEEeelkseLLKLERRKVDDEEKLKESEKEKKKA 326
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  972 EDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQ---FQELMERVATSDGD 1048
Cdd:pfam02463  327 EKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAaklKEEELELKSEEEKE 406
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1049 VAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEI 1128
Cdd:pfam02463  407 AQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQL 486

                   ....*.
gi 1907081931 1129 KLQEKE 1134
Cdd:pfam02463  487 ELLLSR 492
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
674-1139 1.84e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 53.40  E-value: 1.84e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  674 EIEQRWHQVETTPLREEKQvpIAPLHLSLEDRSERL-STHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQS 752
Cdd:COG1196    285 EAQAEEYELLAELARLEQD--IARLEERRRELEERLeELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAE 362
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  753 AREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAA 832
Cdd:COG1196    363 AEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEE 442
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  833 ELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSgQLRASEQKLRSTEARLLE--KTQELRDLETQQALQRDRQKE 910
Cdd:COG1196    443 ALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALA-ELLEELAEAAARLLLLLEaeADYEGFLEGVKAALLLAGLRG 521
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  911 VQR---------------LQECIAELSQQLGT------SEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKAS 969
Cdd:COG1196    522 LAGavavligveaayeaaLEAALAAALQNIVVeddevaAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAV 601
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  970 AYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELME---RVATSD 1046
Cdd:COG1196    602 DLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAallEAEAEL 681
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1047 GDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEET 1126
Cdd:COG1196    682 EELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPD 761
                          490
                   ....*....|...
gi 1907081931 1127 EIKLQEKEECLRR 1139
Cdd:COG1196    762 LEELERELERLER 774
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
718-928 1.85e-06

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 53.48  E-value: 1.85e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRL--LQDQLRVALGREQSAREGYV-LQTEVAtspsGAWQRLHRVNQDLQSELEAQcrRQEL 794
Cdd:COG3206    187 LRKELEEAEAALEEFRQKNGLvdLSEEAKLLLQQLSELESQLAeARAELA----EAEARLAALRAQLGSGPDAL--PELL 260
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  795 ITQQIQTLKHSYGEAkdairhhEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLE-EWQHSKAMLSgQL 873
Cdd:COG3206    261 QSPVIQQLRAQLAEL-------EAELAELSARYTPNHPDVIALRAQIAALRAQLQQEAQRILASLEaELEALQAREA-SL 332
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081931  874 RASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKE-VQRLQEciAELSQQLGTS 928
Cdd:COG3206    333 QAQLAQLEARLAELPELEAELRRLEREVEVARELYESlLQRLEE--ARLAEALTVG 386
PH2_ADAP cd01251
ArfGAP with dual PH domains Pleckstrin homology (PH) domain, repeat 2; ADAP (also called ...
427-517 2.15e-06

ArfGAP with dual PH domains Pleckstrin homology (PH) domain, repeat 2; ADAP (also called centaurin alpha) is a phophatidlyinositide binding protein consisting of an N-terminal ArfGAP domain and two PH domains. In response to growth factor activation, PI3K phosphorylates phosphatidylinositol 4,5-bisphosphate to phosphatidylinositol 3,4,5-trisphosphate. Centaurin alpha 1 is recruited to the plasma membrane following growth factor stimulation by specific binding of its PH domain to phosphatidylinositol 3,4,5-trisphosphate. Centaurin alpha 2 is constitutively bound to the plasma membrane since it binds phosphatidylinositol 4,5-bisphosphate and phosphatidylinositol 3,4,5-trisphosphate with equal affinity. This cd contains the second PH domain repeat. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 241282  Cd Length: 105  Bit Score: 48.35  E-value: 2.15e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  427 NFKK-GWLTK----QYEdgQWKKHWFVLADQSLRYYRDSvaeeaadLD----GEINLSTC---YDVTE-----YPVQRNY 489
Cdd:cd01251      1 DFLKeGYLEKtgpkQTD--GFRKRWFTLDDRRLMYFKDP-------LDafpkGEIFIGSKeegYSVREglppgIKGHWGF 71
                           90       100
                   ....*....|....*....|....*...
gi 1907081931  490 GFQIHTKEGEFTLSAMTSGIRRNWIQTI 517
Cdd:cd01251     72 GFTLVTPDRTFLLSAETEEERREWITAI 99
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1680-2282 2.46e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 53.14  E-value: 2.46e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1680 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLppTEPLGGCQ------RLLRMSQHLSYESCLEGLGQYS 1753
Cdd:TIGR02168  308 RERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEEL--KEELESLEaeleelEAELEELESRLEELEEQLETLR 385
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1754 S----LLVQDAIIQAQVCYAACRI-RLEYEKELRFYKKACQEAKGASGQKraQAVGALKEEYEELLHKQKSEYQKVITLI 1828
Cdd:TIGR02168  386 SkvaqLELQIASLNNEIERLEARLeRLEDRRERLQQEIEELLKKLEEAEL--KELQAELEELEEELEELQEELERLEEAL 463
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1829 EKENTELKAKVSQMDHQQRCLQEAENKhSESMFALQGRYEEEIRCmVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEq 1908
Cdd:TIGR02168  464 EELREELEEAEQALDAAERELAQLQAR-LDSLERLQENLEGFSEG-VKALLKNQSGLSGILGVLSELISVDEGYEAAIE- 540
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1909 hhvqqmKMLEDRFQ-LKVRELQAVhqeelRALQEHYIWSLRG-----ALSLYQPSHPDSSLAPG-PSEPRAVPAAKDEAE 1981
Cdd:TIGR02168  541 ------AALGGRLQaVVVENLNAA-----KKAIAFLKQNELGrvtflPLDSIKGTEIQGNDREIlKNIEGFLGVAKDLVK 609
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1982 SMSGLRERIQELEAQMGV---------MREELGHKE----LEGDVAAlqekyqrdfeslkatceRGFAAMEETHQKKIED 2048
Cdd:TIGR02168  610 FDPKLRKALSYLLGGVLVvddldnaleLAKKLRPGYrivtLDGDLVR-----------------PGGVITGGSAKTNSSI 672
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2049 LQRQhqRELEKLREEKDRLLAEETAATISAIEAMKNahREEMERELEKSQRsQISSINSDIEALRRQYLEELQSVQR--- 2125
Cdd:TIGR02168  673 LERR--REIEELEEKIEELEEKIAELEKALAELRKE--LEELEEELEQLRK-ELEELSRQISALRKDLARLEAEVEQlee 747
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2126 ELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTL-----LTGDGGGESTGLPL 2200
Cdd:TIGR02168  748 RIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELraeltLLNEEAANLRERLE 827
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2201 TQGKDAYELEVLLRVKESEIQYLKQEISSLKDElqtaLRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEAL 2280
Cdd:TIGR02168  828 SLERRIAATERRLEDLEEQIEELSEDIESLAAE----IEELEELIEELESELEALLNERASLEEALALLRSELEELSEEL 903

                   ..
gi 1907081931 2281 GE 2282
Cdd:TIGR02168  904 RE 905
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
830-1060 2.53e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 52.07  E-value: 2.53e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  830 AAAELAIKEQALAKLKGELKmeqgKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-QALQRDRQ 908
Cdd:COG4942     18 QADAAAEAEAELEQLQQEIA----ELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAElAELEKEIA 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  909 KEVQRLQECIAELSQQLgtseqaqRLMEKKLKRNYTLLLESCEQEKQAL--LQNLKEVEDKASAYEDQLQGHVQQVEALQ 986
Cdd:COG4942     94 ELRAELEAQKEELAELL-------RALYRLGRQPPLALLLSPEDFLDAVrrLQYLKYLAPARREQAEELRADLAELAALR 166
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081931  987 KEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKE 1060
Cdd:COG4942    167 AELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAA 240
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
703-1032 3.71e-06

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 52.38  E-value: 3.71e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  703 EDRSERLSTheLTSLLEKELEQSQKEASDLLEQNRLLqdqlrvalgREQSAREGYVLqtevatspSGAWQRLHRVNQDLQ 782
Cdd:TIGR02169  183 EENIERLDL--IIDEKRQQLERLRREREKAERYQALL---------KEKREYEGYEL--------LKEKEALERQKEAIE 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  783 SELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQ--------TLQTRLGNAAAELAIKEQALAKLKGELKMEQGK 854
Cdd:TIGR02169  244 RQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKdlgeeeqlRVKEKIGELEAEIASLERSIAEKERELEDAEER 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  855 VR---EQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-----QALQRDRQKEVQRlQECIAELSQQLG 926
Cdd:TIGR02169  324 LAkleAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAEleevdKEFAETRDELKDY-REKLEKLKREIN 402
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  927 TSEQAQ-RLMEKKLKRNYTLL-----LESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKL---SETCKGS 997
Cdd:TIGR02169  403 ELKRELdRLQEELQRLSEELAdlnaaIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYdlkEEYDRVE 482
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 1907081931  998 EQVHKLEEELEAREASIRQLAQHVQSLHDERDLIK 1032
Cdd:TIGR02169  483 KELSKLQRELAEAEAQARASEERVRGGRAVEEVLK 517
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
856-1092 6.40e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 51.84  E-value: 6.40e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  856 REQLEEWQHSKAMLSgQLRASEQKLRSTEARLlEKTQELRD----------LETQQALQRDRQKEVQRLQECIAELSQQL 925
Cdd:COG4913    241 HEALEDAREQIELLE-PIRELAERYAAARERL-AELEYLRAalrlwfaqrrLELLEAELEELRAELARLEAELERLEARL 318
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  926 GTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQghvqqvealqkeKLSETCKGSEQVHKLEE 1005
Cdd:COG4913    319 DALREELDELEAQIRGNGGDRLEQLEREIERLERELEERERRRARLEALLA------------ALGLPLPASAEEFAALR 386
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1006 eleareasiRQLAQHVQSLHDERDlikhqfqELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLR 1085
Cdd:COG4913    387 ---------AEAAALLEALEEELE-------ALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIPARLLALRDALA 450
                          250
                   ....*....|.
gi 1907081931 1086 E----KEEELK 1092
Cdd:COG4913    451 EalglDEAELP 461
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
2013-2322 6.42e-06

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 51.61  E-value: 6.42e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2013 VAALQEKYQRDFESLKATCERgfaamEETHQKKIEDLQRQhqreLEKLREEKDRLL--------AEETAATI--SAIEAM 2082
Cdd:TIGR02169  165 VAEFDRKKEKALEELEEVEEN-----IERLDLIIDEKRQQ----LERLRREREKAEryqallkeKREYEGYEllKEKEAL 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2083 K------NAHREEMERELEKSQRsQISSINSDIEALRRqyleELQSVQRELEVLSEQYSQKCLENAHLAQA-LEAERQAL 2155
Cdd:TIGR02169  236 ErqkeaiERQLASLEEELEKLTE-EISELEKRLEEIEQ----LLEELNKKIKDLGEEEQLRVKEKIGELEAeIASLERSI 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2156 RQCQRENQELNAHNQELN---NRLAAEITRLRTLLtgdgggESTGLPLTQGKDAY-ELEVLLRVKESEIQYLKQEISSLK 2231
Cdd:TIGR02169  311 AEKERELEDAEERLAKLEaeiDKLLAEIEELEREI------EEERKRRDKLTEEYaELKEELEDLRAELEEVDKEFAETR 384
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2232 DELQTALRDKKYASDKYKDIYTELSI---AKAKADCDISRLKEQLKAATEALGEkSPEGTTVSGYDIMKSKSNPDFLKKD 2308
Cdd:TIGR02169  385 DELKDYREKLEKLKREINELKRELDRlqeELQRLSEELADLNAAIAGIEAKINE-LEEEKEDKALEIKKQEWKLEQLAAD 463
                          330
                   ....*....|....
gi 1907081931 2309 RSCVTRQLRNIRSK 2322
Cdd:TIGR02169  464 LSKYEQELYDLKEE 477
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
779-1135 6.61e-06

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 51.56  E-value: 6.61e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  779 QDLQSELEAQCRRQELITQQIQTLKHSYGEAKDA-------IRHHEAEIQTLQTRLGNAAAELAI----KEQALAK-LKG 846
Cdd:TIGR04523  235 EKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQlsekqkeLEQNNKKIKELEKQLNQLKSEISDlnnqKEQDWNKeLKS 314
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  847 ELKMEQGKVRE---QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-QALQRDRQ---KEVQRLQECIA 919
Cdd:TIGR04523  315 ELKNQEKKLEEiqnQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEiEKLKKENQsykQEIKNLESQIN 394
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  920 ELSQQLGTSEQAQRLME---KKLKRNYTLLLESCEQEKQALLQN---LKEVEDKASAYEDQLQGHVQQVEAlQKEKLSEt 993
Cdd:TIGR04523  395 DLESKIQNQEKLNQQKDeqiKKLQQEKELLEKEIERLKETIIKNnseIKDLTNQDSVKELIIKNLDNTRES-LETQLKV- 472
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  994 ckgseqvhkleeeleaREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRV 1073
Cdd:TIGR04523  473 ----------------LSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEK 536
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907081931 1074 SVQLQSVRTLLREKEEELKhiKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1135
Cdd:TIGR04523  537 ESKISDLEDELNKDDFELK--KENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEK 596
PH_DAPP1 cd10573
Dual Adaptor for Phosphotyrosine and 3-Phosphoinositides Pleckstrin homology (PH) domain; ...
426-517 6.74e-06

Dual Adaptor for Phosphotyrosine and 3-Phosphoinositides Pleckstrin homology (PH) domain; DAPP1 (also known as PHISH/3' phosphoinositide-interacting SH2 domain-containing protein or Bam32) plays a role in B-cell activation and has potential roles in T-cell and mast cell function. DAPP1 promotes B cell receptor (BCR) induced activation of Rho GTPases Rac1 and Cdc42, which feed into mitogen-activated protein kinases (MAPK) activation pathways and affect cytoskeletal rearrangement. DAPP1can also regulate BCR-induced activation of extracellular signal-regulated kinase (ERK), and c-jun NH2-terminal kinase (JNK). DAPP1 contains an N-terminal SH2 domain and a C-terminal pleckstrin homology (PH) domain with a single tyrosine phosphorylation site located centrally. DAPP1 binds strongly to both PtdIns(3,4,5)P3 and PtdIns(3,4)P2. The PH domain is essential for plasma membrane recruitment of PI3K upon cell activation. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 269977 [Multi-domain]  Cd Length: 96  Bit Score: 46.55  E-value: 6.74e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  426 LNFKKGWLTKQyedGQ----WKKHWFVLADQSLRYYRDSVAEEAADldgEINLSTCYDVTEYPVQ-RNYGFQIHTKEGEF 500
Cdd:cd10573      2 LGSKEGYLTKL---GGivknWKTRWFVLRRNELKYFKTRGDTKPIR---VLDLRECSSVQRDYSQgKVNCFCLVFPERTF 75
                           90
                   ....*....|....*..
gi 1907081931  501 TLSAMTSGIRRNWIQTI 517
Cdd:cd10573     76 YMYANTEEEADEWVKLL 92
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
717-964 7.37e-06

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 51.05  E-value: 7.37e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  717 LLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQ----CRRQ 792
Cdd:pfam07888   31 LLQNRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWERQRRELESRVAELKEELRQSREKHEELEEKykelSASS 110
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  793 ELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAikeqalaklkgELKMEQGKVREQLEEWQHSKAMLSGQ 872
Cdd:pfam07888  111 EELSEEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETELE-----------RMKERAKKAGAQRKEEEAERKQLQAK 179
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  873 LRASEQKLRSTEARLlektQELRDLETQQALQrdrqkeVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTL--LLESC 950
Cdd:pfam07888  180 LQQTEEELRSLSKEF----QELRNSLAQRDTQ------VLQLQDTITTLTQKLTTAHRKEAENEALLEELRSLqeRLNAS 249
                          250
                   ....*....|....
gi 1907081931  951 EQEKQALLQNLKEV 964
Cdd:pfam07888  250 ERKVEGLGEELSSM 263
PH_TBC1D2A cd01265
TBC1 domain family member 2A pleckstrin homology (PH) domain; TBC1D2A (also called PARIS-1 ...
442-520 7.54e-06

TBC1 domain family member 2A pleckstrin homology (PH) domain; TBC1D2A (also called PARIS-1/Prostate antigen recognized and identified by SEREX 1 and ARMUS) contains a PH domain and a TBC-type GTPase catalytic domain. TBC1D2A integrates signaling between Arf6, Rac1, and Rab7 during junction disassembly. Activated Rac1 recruits TBC1D2A to locally inactivate Rab7 via its C-terminal TBC/RabGAP domain and facilitate E-cadherin degradation in lysosomes. The TBC1D2A PH domain mediates localization at cell-cell contacts and coprecipitates with cadherin complexes. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 269966  Cd Length: 102  Bit Score: 46.55  E-value: 7.54e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  442 WKKHWFVLADQS--LRYYRDSvaeeaADLD--GEINLSTC---YDVTEYPVQrnygFQIHTKEGEFTLSAMTSGIRRNWI 514
Cdd:cd01265     19 WKRRWFVLDESKcqLYYYRSP-----QDATplGSIDLSGAafsYDPEAEPGQ----FEIHTPGRVHILKASTRQAMLYWL 89

                   ....*.
gi 1907081931  515 QTIMKH 520
Cdd:cd01265     90 QALQSK 95
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
778-1134 1.10e-05

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 50.79  E-value: 1.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  778 NQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGElkmEQGKvRE 857
Cdd:TIGR04523  309 NKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKE---NQSY-KQ 384
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  858 QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQ----ALQRDRQKEVQRLQECIAELSQQLGTSEQAQR 933
Cdd:TIGR04523  385 EIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIerlkETIIKNNSEIKDLTNQDSVKELIIKNLDNTRE 464
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  934 LMEKKLKrnytLLLESCEQEKQALLQNLKEVEDKASAYeDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREAS 1013
Cdd:TIGR04523  465 SLETQLK----VLSRSINKIKQNLEQKQKELKSKEKEL-KKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESK 539
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1014 IRQLAQHVQSLHDE--RDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEEL 1091
Cdd:TIGR04523  540 ISDLEDELNKDDFElkKENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKEL 619
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1907081931 1092 KHIKETHERVLEKKDqDLNEALVKMIALGSSLEETEIKLQEKE 1134
Cdd:TIGR04523  620 EKAKKENEKLSSIIK-NIKSKKNKLKQEVKQIKETIKEIRNKW 661
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1797-2230 1.31e-05

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 50.71  E-value: 1.31e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1797 QKRAQAVGALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRCLQEAEN--KHSESMFALQGRYEEEIRCM 1874
Cdd:COG1196    347 EEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEEleEAEEALLERLERLEEELEEL 426
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1875 VEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKmLEDRFQLKVRELQAVHQEELRALQEHyiWSLRGALSLY 1954
Cdd:COG1196    427 EEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAE-LLEEAALLEAALAELLEELAEAAARL--LLLLEAEADY 503
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1955 QPSHPDSSLAPGPSEPRAVPAAKDEAesMSGLRERIQELEAQMGVMREELGHKELEgdVAALQEKYQRDFESLKAT---- 2030
Cdd:COG1196    504 EGFLEGVKAALLLAGLRGLAGAVAVL--IGVEAAYEAALEAALAAALQNIVVEDDE--VAAAAIEYLKAAKAGRATflpl 579
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2031 --CERGFAAMEETHQKKIEDLQRQHQRELEKLREEKDRL---LAEETAATISAIEAMKNAHREEMERELEKSQRSQISSI 2105
Cdd:COG1196    580 dkIRARAALAAALARGAIGAAVDLVASDLREADARYYVLgdtLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAG 659
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2106 NSDIEALRRQYLEELQSVQRELEVLSEQ--YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRL 2183
Cdd:COG1196    660 GSLTGGSRRELLAALLEAEAELEELAERlaEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLE 739
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*..
gi 1907081931 2184 RTLltgdgggESTGLPLTQGKDAYELEVLLRVKESEIQYLKQEISSL 2230
Cdd:COG1196    740 ELL-------EEEELLEEEALEELPEPPDLEELERELERLEREIEAL 779
PH1_PLEKHH1_PLEKHH2 cd13282
Pleckstrin homology (PH) domain containing, family H (with MyTH4 domain) members 1 and 2 ...
429-517 1.45e-05

Pleckstrin homology (PH) domain containing, family H (with MyTH4 domain) members 1 and 2 (PLEKHH1) PH domain, repeat 1; PLEKHH1 and PLEKHH2 (also called PLEKHH1L) are thought to function in phospholipid binding and signal transduction. There are 3 Human PLEKHH genes: PLEKHH1, PLEKHH2, and PLEKHH3. There are many isoforms, the longest of which contain a FERM domain, a MyTH4 domain, two PH domains, a peroximal domain, a vacuolar domain, and a coiled coil stretch. The FERM domain has a cloverleaf tripart structure (FERM_N, FERM_M, FERM_C/N, alpha-, and C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3). The C-lobe/F3 within the FERM domain is part of the PH domain family. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 241436  Cd Length: 96  Bit Score: 45.75  E-value: 1.45e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQyeDGQ---WKKHWFVLADQSLRYYRD--SVAEEAAdldGEINLSTCYDVTeyPVQRNYGFQIHTKEGEFTLS 503
Cdd:cd13282      1 KAGYLTKL--GGKvktWKRRWFVLKNGELFYYKSpnDVIRKPQ---GQIALDGSCEIA--RAEGAQTFEIVTEKRTYYLT 73
                           90
                   ....*....|....
gi 1907081931  504 AMTSGIRRNWIQTI 517
Cdd:cd13282     74 ADSENDLDEWIRVI 87
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
793-1104 1.47e-05

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 49.52  E-value: 1.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  793 ELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQtrlgNAAAELAIKEQALAKLKGELKMEQGKVREQLEEwqhskamLSGQ 872
Cdd:COG1340      4 DELSSSLEELEEKIEELREEIEELKEKRDELN----EELKELAEKRDELNAQVKELREEAQELREKRDE-------LNEK 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  873 LRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTS----EQAQRLMEK--KLKRNYTLL 946
Cdd:COG1340     73 VKELKEERDELNEKLNELREELDELRKELAELNKAGGSIDKLRKEIERLEWRQQTEvlspEEEKELVEKikELEKELEKA 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  947 LESCEQEKQallqnLKEVEDKASAYEDQLQGHVQQVEALQKEklsetckgSEQVHKleeeleareaSIRQLAQHVQSLHD 1026
Cdd:COG1340    153 KKALEKNEK-----LKELRAELKELRKEAEEIHKKIKELAEE--------AQELHE----------EMIELYKEADELRK 209
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907081931 1027 ERDLIKHQFQELMERvatsdgdVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRtllreKEEELKHIKETHERVLEK 1104
Cdd:COG1340    210 EADELHKEIVEAQEK-------ADELHEEIIELQKELRELRKELKKLRKKQRALK-----REKEKEELEEKAEEIFEK 275
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1819-2288 2.18e-05

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 50.12  E-value: 2.18e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1819 SEYQKVITLIEKENTELKAKVSQMDHQQRCLQ-EAENK-------HSESMFALQGRYEEEIRCMVEQLSHTENTLQAERS 1890
Cdd:pfam15921  220 SAISKILRELDTEISYLKGRIFPVEDQLEALKsESQNKielllqqHQDRIEQLISEHEVEITGLTEKASSARSQANSIQS 299
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1891 RvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEHYIWSlrgalslyqpshpDSSLAPGPSEp 1970
Cdd:pfam15921  300 Q-LEIIQEQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEDKIEELEKQLVLA-------------NSELTEARTE- 364
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1971 ravpaaKDEAESMSG-LRERIQELEAQMGVMREELG---------------------HKELEGDVAALQ-EKYQRDFESL 2027
Cdd:pfam15921  365 ------RDQFSQESGnLDDQLQKLLADLHKREKELSlekeqnkrlwdrdtgnsitidHLRRELDDRNMEvQRLEALLKAM 438
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2028 KATC----ERGFAAMEETHQ--KKIEDLQRQHQRELEKLREEKDRLLA-----EETAATISAIeamkNAHREEMERELEK 2096
Cdd:pfam15921  439 KSECqgqmERQMAAIQGKNEslEKVSSLTAQLESTKEMLRKVVEELTAkkmtlESSERTVSDL----TASLQEKERAIEA 514
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2097 SQrSQISSINS--DIEALRRQYL----EELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQ 2170
Cdd:pfam15921  515 TN-AEITKLRSrvDLKLQELQHLknegDHLRNVQTECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKA 593
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2171 ELNNRLAAEITRLRTLLTgdgggestglpLTQGKDAYELEVLLRVKESEIQY----------------LKQEISSLKDEL 2234
Cdd:pfam15921  594 QLEKEINDRRLELQEFKI-----------LKDKKDAKIRELEARVSDLELEKvklvnagserlravkdIKQERDQLLNEV 662
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081931 2235 QTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGE-----KSPEGT 2288
Cdd:pfam15921  663 KTSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQtrntlKSMEGS 721
mukB PRK04863
chromosome partition protein MukB;
702-1042 2.36e-05

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 49.96  E-value: 2.36e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  702 LEDRSERLSTHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQtevatspsgawQRLHRVNQDL 781
Cdd:PRK04863   289 LELRRELYTSRRQLAAEQYRLVEMARELAELNEAESDLEQDYQAASDHLNLVQTALRQQ-----------EKIERYQADL 357
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  782 QsELEAQCRRQELITQQIQTLKHSYGEAKDAIrhhEAEIQTLQTRLGNAAAELAIKE----------QALAKLK---GEL 848
Cdd:PRK04863   358 E-ELEERLEEQNEVVEEADEQQEENEARAEAA---EEEVDELKSQLADYQQALDVQQtraiqyqqavQALERAKqlcGLP 433
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  849 KMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEA--RLLEKTQEL-----------------RDLETQQALQRDRQK 909
Cdd:PRK04863   434 DLTADNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAahSQFEQAYQLvrkiagevsrseawdvaRELLRRLREQRHLAE 513
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  910 EVQRLQECIAELSQQLGTSEQAQRLM---EKKLKRNYTL--LLESCEQEKQALLQNLKE----VEDKASAYEDQLQGHVQ 980
Cdd:PRK04863   514 QLQQLRMRLSELEQRLRQQQRAERLLaefCKRLGKNLDDedELEQLQEELEARLESLSEsvseARERRMALRQQLEQLQA 593
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907081931  981 QVEALQK---------EKLSETCkgsEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERV 1042
Cdd:PRK04863   594 RIQRLAArapawlaaqDALARLR---EQSGEEFEDSQDVTEYMQQLLERERELTVERDELAARKQALDEEI 661
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
703-1135 2.70e-05

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 49.65  E-value: 2.70e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  703 EDRSERLSTHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYvlqTEVATSPSGAWQRLHRVNQDLQ 782
Cdd:PRK02224   293 EERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESL---REDADDLEERAEELREEAAELE 369
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  783 SELEAqCRRQelitqqiqtlkhsYGEAKDAIRHHEAEIQTLQTRLGNAAAELaikeQALAKLKGELKMEQGKVREQLEEw 862
Cdd:PRK02224   370 SELEE-AREA-------------VEDRREEIEELEEEIEELRERFGDAPVDL----GNAEDFLEELREERDELREREAE- 430
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  863 qhskamLSGQLRASEQKLRSTEaRLLEK------TQELRDLETQQALQRDRQKevqrlqecIAELSQQLGTSEQAQRLME 936
Cdd:PRK02224   431 ------LEATLRTARERVEEAE-ALLEAgkcpecGQPVEGSPHVETIEEDRER--------VEELEAELEDLEEEVEEVE 495
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  937 KKLKRNYTllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQvhklEEELEAREASIRQ 1016
Cdd:PRK02224   496 ERLERAED--LVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEK----REAAAEAEEEAEE 569
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1017 LAQHVQSLHDERDLIKHQFQELmERVATSDGDVAELQ---EKLRGKEVDYQNLE-HSHHRVSVQLQSVRTLLREKE---- 1088
Cdd:PRK02224   570 AREEVAELNSKLAELKERIESL-ERIRTLLAAIADAEdeiERLREKREALAELNdERRERLAEKRERKRELEAEFDeari 648
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*..
gi 1907081931 1089 EELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1135
Cdd:PRK02224   649 EEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVENELEELEE 695
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
885-1135 2.76e-05

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 49.53  E-value: 2.76e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  885 ARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKklkrnytlllescEQEKQALLQNLKEV 964
Cdd:COG4913    194 LRLLHKTQSFKPIGDLDDFVREYMLEEPDTFEAADALVEHFDDLERAHEALED-------------AREQIELLEPIREL 260
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  965 EDKASAYEDQLQGHVQQVEALQKEKlsetckGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVAT 1044
Cdd:COG4913    261 AERYAAARERLAELEYLRAALRLWF------AQRRLELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIRG 334
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1045 SDGD-VAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSL 1123
Cdd:COG4913    335 NGGDrLEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAAL 414
                          250
                   ....*....|..
gi 1907081931 1124 EETEIKLQEKEE 1135
Cdd:COG4913    415 RDLRRELRELEA 426
PH_SWAP-70 cd13273
Switch-associated protein-70 Pleckstrin homology (PH) domain; SWAP-70 (also called ...
429-517 2.93e-05

Switch-associated protein-70 Pleckstrin homology (PH) domain; SWAP-70 (also called Differentially expressed in FDCP 6/DEF-6 or IRF4-binding protein) functions in cellular signal transduction pathways (in conjunction with Rac), regulates cell motility through actin rearrangement, and contributes to the transformation and invasion activity of mouse embryo fibroblasts. Metazoan SWAP-70 is found in B lymphocytes, mast cells, and in a variety of organs. Metazoan SWAP-70 contains an N-terminal EF-hand motif, a centrally located PH domain, and a C-terminal coiled-coil domain. The PH domain of Metazoan SWAP-70 contains a phosphoinositide-binding site and a nuclear localization signal (NLS), which localize SWAP-70 to the plasma membrane and nucleus, respectively. The NLS is a sequence of four Lys residues located at the N-terminus of the C-terminal a-helix; this is a unique characteristic of the Metazoan SWAP-70 PH domain. The SWAP-70 PH domain binds PtdIns(3,4,5)P3 and PtdIns(4,5)P2 embedded in lipid bilayer vesicles. There are additional plant SWAP70 proteins, but these are not included in this hierarchy. Rice SWAP70 (OsSWAP70) exhibits GEF activity toward the its Rho GTPase, OsRac1, and regulates chitin-induced production of reactive oxygen species and defense gene expression in rice. Arabidopsis SWAP70 (AtSWAP70) plays a role in both PAMP- and effector-triggered immunity. Plant SWAP70 contains both DH and PH domains, but their arrangement is the reverse of that in typical DH-PH-type Rho GEFs, wherein the DH domain is flanked by a C-terminal PH domain. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270092  Cd Length: 110  Bit Score: 45.36  E-value: 2.93e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQ-YEDGQWKKHWFVLADQSLRYYrdsVAEEAADLDGEINL--STCYDVTEYPVQRNYGFQIHTKEGEFTLSAM 505
Cdd:cd13273     10 KKGYLWKKgHLLPTWTERWFVLKPNSLSYY---KSEDLKEKKGEIALdsNCCVESLPDREGKKCRFLVKTPDKTYELSAS 86
                           90
                   ....*....|..
gi 1907081931  506 TSGIRRNWIQTI 517
Cdd:cd13273     87 DHKTRQEWIAAI 98
PH_Osh1p_Osh2p_yeast cd13292
Yeast oxysterol binding protein homologs 1 and 2 Pleckstrin homology (PH) domain; Yeast Osh1p ...
430-521 3.02e-05

Yeast oxysterol binding protein homologs 1 and 2 Pleckstrin homology (PH) domain; Yeast Osh1p is proposed to function in postsynthetic sterol regulation, piecemeal microautophagy of the nucleus, and cell polarity establishment. Yeast Osh2p is proposed to function in sterol metabolism and cell polarity establishment. Both Osh1p and Osh2p contain 3 N-terminal ankyrin repeats, a PH domain, a FFAT motif (two phenylalanines in an acidic tract), and a C-terminal OSBP-related domain. OSBP andOsh1p PH domains specifically localize to the Golgi apparatus in a PtdIns4P-dependent manner. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. In general OSBPs and ORPs have been found to be involved in the transport and metabolism of cholesterol and related lipids in eukaryotes. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. They are members of the oxysterol binding protein (OSBP) family which includes OSBP, OSBP-related proteins (ORP), Goodpasture antigen binding protein (GPBP), and Four phosphate adaptor protein 1 (FAPP1). They have a wide range of purported functions including sterol transport, cell cycle control, pollen development and vessicle transport from Golgi recognize both PI lipids and ARF proteins. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 241446  Cd Length: 103  Bit Score: 44.99  E-value: 3.02e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  430 KGWLTK--QYEDGqWKKHWFVLADQSLRYYRDSVAEEAAdLDGEINLSTCYDVteYPVQRNYGFQIHTKEG---EFTLSA 504
Cdd:cd13292      5 KGYLKKwtNYAKG-YKTRWFVLEDGVLSYYRHQDDEGSA-CRGSINMKNARLV--SDPSEKLRFEVSSKTSgspKWYLKA 80
                           90
                   ....*....|....*..
gi 1907081931  505 MTSGIRRNWIQTIMKHV 521
Cdd:cd13292     81 NHPVEAARWIQALQKAI 97
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
718-931 3.24e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 48.61  E-value: 3.24e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRLLQDQLrvalgrEQSAREGYVLQTEVATspsgAWQRLHRVNQDLQSELEAQCRRQELITQ 797
Cdd:COG4942     39 LEKELAALKKEEKALLKQLAALERRI------AALARRIRALEQELAA----LEAELAELEKEIAELRAELEAQKEELAE 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  798 QIQTL----KHSY-------GEAKDAIRHHEAeIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSK 866
Cdd:COG4942    109 LLRALyrlgRQPPlalllspEDFLDAVRRLQY-LKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEER 187
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907081931  867 AMLSGQLRASEQKLRSTEARLLEKTQELRDLetqqalqrdrQKEVQRLQECIAELSQQLGTSEQA 931
Cdd:COG4942    188 AALEALKAERQKLLARLEKELAELAAELAEL----------QQEAEELEALIARLEAEAAAAAER 242
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
683-1149 3.48e-05

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 49.20  E-value: 3.48e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  683 ETTPLREEkQVPIAPLHLSLEDRSERLSTHELTSLLEKELEQ-----SQKEASDLLEQNRLLQDQLRVALGREQSAREGY 757
Cdd:TIGR00618  401 ELDILQRE-QATIDTRTSAFRDLQGQLAHAKKQQELQQRYAElcaaaITCTAQCEKLEKIHLQESAQSLKEREQQLQTKE 479
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  758 VLQTEVATSPSGAWQRLHRVnQDLQSELEAQCRRQELITQQIQTL------------KHSY-GEAKDAIRHHEAEIQTLQ 824
Cdd:TIGR00618  480 QIHLQETRKKAVVLARLLEL-QEEPCPLCGSCIHPNPARQDIDNPgpltrrmqrgeqTYAQlETSEEDVYHQLTSERKQR 558
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  825 TRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSgqlraseqklRSTEARLLEKTQELRDLETQQALQ 904
Cdd:TIGR00618  559 ASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLS----------EAEDMLACEQHALLRKLQPEQDLQ 628
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  905 RDRQKEvqrlQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQN-LKEVEDKASAYEDQLQGHVQQVE 983
Cdd:TIGR00618  629 DVRLHL----QQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELLASRQLaLQKMQSEKEQLTYWKEMLAQCQT 704
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  984 ALQKEKLSETcKGSEQVHKLEEELEAREASIR-QLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVD 1062
Cdd:TIGR00618  705 LLRELETHIE-EYDREFNEIENASSSLGSDLAaREDALNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQTGAELSHL 783
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1063 YQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECLRRFVS 1142
Cdd:TIGR00618  784 AAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQ 863

                   ....*..
gi 1907081931 1143 DSPKDAK 1149
Cdd:TIGR00618  864 LTQEQAK 870
PH_Gab-like cd13324
Grb2-associated binding protein family Pleckstrin homology (PH) domain; Gab proteins are ...
431-517 4.10e-05

Grb2-associated binding protein family Pleckstrin homology (PH) domain; Gab proteins are scaffolding adaptor proteins, which possess N-terminal PH domains and a C-terminus with proline-rich regions and multiple phosphorylation sites. Following activation of growth factor receptors, Gab proteins are tyrosine phosphorylated and activate PI3K, which generates 3-phosphoinositide lipids. By binding to these lipids via the PH domain, Gab proteins remain in proximity to the receptor, leading to further signaling. While not all Gab proteins depend on the PH domain for recruitment, it is required for Gab activity. There are 3 families: Gab1, Gab2, and Gab3. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270133  Cd Length: 112  Bit Score: 44.71  E-value: 4.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  431 GWLTK-----QYEDGQWKKHWFVL------ADQS-LRYYRDsvaEEAADLDGEINLSTCYDVT-----EYPVQRN-YGFQ 492
Cdd:cd13324      5 GWLTKsppekKIWRAAWRRRWFVLrsgrlsGGQDvLEYYTD---DHCKKLKGIIDLDQCEQVDagltfEKKKFKNqFIFD 81
                           90       100
                   ....*....|....*....|....*
gi 1907081931  493 IHTKEGEFTLSAMTSGIRRNWIQTI 517
Cdd:cd13324     82 IRTPKRTYYLVAETEEEMNKWVRCI 106
HCR pfam07111
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...
677-1132 6.16e-05

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.


Pssm-ID: 284517 [Multi-domain]  Cd Length: 749  Bit Score: 48.21  E-value: 6.16e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  677 QRWHQVETTPLREEKQVPIAPLHLSLEDRSERLSTHELTSLLE-KELEQSQKEASDLLEQNRLLQDQLRVALGREQSARE 755
Cdd:pfam07111  146 QRLHQEQLSSLTQAHEEALSSLTSKAEGLEKSLNSLETKRAGEaKQLAEAQKEAELLRKQLSKTQEELEAQVTLVESLRK 225
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  756 gYVLQTEVATSPSGAWqrlhrvnqdlqsELEaqcrRQELItQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELA 835
Cdd:pfam07111  226 -YVGEQVPPEVHSQTW------------ELE----RQELL-DTMQHLQEDRADLQATVELLQVRVQSLTHMLALQEEELT 287
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  836 IKEQALAKLKGELKMeqgKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQA-----LQR---DR 907
Cdd:pfam07111  288 RKIQPSDSLEPEFPK---KCRSLLNRWREKVFALMVQLKAQDLEHRDSVKQLRGQVAELQEQVTSQSqeqaiLQRalqDK 364
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  908 QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLL------LESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQ 981
Cdd:pfam07111  365 AAEVEVERMSAKGLQMELSRAQEARRRQQQQTASAEEQLkfvvnaMSSTQIWLETTMTRVEQAVARIPSLSNRLSYAVRK 444
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  982 V---EALQKEKLS------ETCKGSEqvhKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERvATSDGDVAEL 1052
Cdd:pfam07111  445 VhtiKGLMARKVAlaqlrqESCPPPP---PAPPVDADLSLELEQLREERNRLDAELQLSAHLIQQEVGR-AREQGEAERQ 520
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1053 Q--EKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKM-IALGSSLEETEIK 1129
Cdd:pfam07111  521 QlsEVAQQLEQELQRAQESLASVGQQLEVARQGQQESTEEAASLRQELTQQQEIYGQALQEKVAEVeTRLREQLSDTKRR 600

                   ...
gi 1907081931 1130 LQE 1132
Cdd:pfam07111  601 LNE 603
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
1787-2271 6.84e-05

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 48.18  E-value: 6.84e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1787 ACQEAKGASGQKRAQAVGALKEEYEELLHKQKsEYQKVITLIEKENTELKAKVSqmdhqqrclqEAENKHSESMFALqgr 1866
Cdd:pfam05483  198 AFEELRVQAENARLEMHFKLKEDHEKIQHLEE-EYKKEINDKEKQVSLLLIQIT----------EKENKMKDLTFLL--- 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1867 yeEEIRCMVEQLSHtENTLQAERSRVLSQ----LDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEH 1942
Cdd:pfam05483  264 --EESRDKANQLEE-KTKLQDENLKELIEkkdhLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEEL 340
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1943 YIWSLRGALSLYQPSHPDSSLAPG-PSEPRAVPAAKD-------EAESMSGLRERIQELEAQMGVMREELgHKELEGDVA 2014
Cdd:pfam05483  341 NKAKAAHSFVVTEFEATTCSLEELlRTEQQRLEKNEDqlkiitmELQKKSSELEEMTKFKNNKEVELEEL-KKILAEDEK 419
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2015 ALQEKYQRD--FESLKATcERGFAAMEETHQKKIEDLQRQ----------HQRELEKLREE--KDRLLAEETAATISAIE 2080
Cdd:pfam05483  420 LLDEKKQFEkiAEELKGK-EQELIFLLQAREKEIHDLEIQltaiktseehYLKEVEDLKTEleKEKLKNIELTAHCDKLL 498
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2081 AMKNAHREE---MERELEKSQRSQISSINSDIEALRR-QYLEELQSVQR-ELEVLSEQYSQ-----KCLENAHLAQALEA 2150
Cdd:pfam05483  499 LENKELTQEasdMTLELKKHQEDIINCKKQEERMLKQiENLEEKEMNLRdELESVREEFIQkgdevKCKLDKSEENARSI 578
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2151 ERQALRQCQRENQELNAHNQ-----ELNNRLAAEITRLRTLLTGDGGGESTGLpltqgkDAYELEVllRVKESEIQYLKQ 2225
Cdd:pfam05483  579 EYEVLKKEKQMKILENKCNNlkkqiENKNKNIEELHQENKALKKKGSAENKQL------NAYEIKV--NKLELELASAKQ 650
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*.
gi 1907081931 2226 EISSLKDELQTALRDKKYASDKykdIYTELSIAKAKADCDISRLKE 2271
Cdd:pfam05483  651 KFEEIIDNYQKEIEDKKISEEK---LLEEVEKAKAIADEAVKLQKE 693
PH_evt cd13265
Evectin Pleckstrin homology (PH) domain; There are 2 members of the evectin family (also ...
428-480 7.00e-05

Evectin Pleckstrin homology (PH) domain; There are 2 members of the evectin family (also called pleckstrin homology domain containing, family B): evt-1 (also called PLEKHB1) and evt-2 (also called PLEKHB2). evt-1 is specific to the nervous system, where it is expressed in photoreceptors and myelinating glia. evt-2 is widely expressed in both neural and nonneural tissues. Evectins possess a single N-terminal PH domain and a C-terminal hydrophobic region. evt-1 is thought to function as a mediator of post-Golgi trafficking in cells that produce large membrane-rich organelles. It is a candidate gene for the inherited human retinopathy autosomal dominant familial exudative vitreoretinopathy and a susceptibility gene for multiple sclerosis. evt-2 is essential for retrograde endosomal membrane transport from the plasma membrane (PM) to the Golgi. Two membrane trafficking pathways pass through recycling endosomes: a recycling pathway and a retrograde pathway that links the PM to the Golgi/ER. Its PH domain that is unique in that it specifically recognizes phosphatidylserine (PS), but not polyphosphoinositides. PS is an anionic phospholipid class in eukaryotic biomembranes, is highly enriched in the PM, and plays key roles in various physiological processes such as the coagulation cascade, recruitment and activation of signaling molecules, and clearance of apoptotic cells. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270085  Cd Length: 108  Bit Score: 44.22  E-value: 7.00e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081931  428 FKKGWLTKQYE-DGQWKKHWFVL-ADQSLRYYRDsvaEEAADLDGEINL-STCYDV 480
Cdd:cd13265      4 VKSGWLLRQSTiLKRWKKNWFVLyGDGNLVYYED---ETRREVEGRINMpRECRNI 56
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1963-2193 7.08e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 47.45  E-value: 7.08e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1963 LAPGPSEPRAVPAAKDEAEsMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQRDFESLKATcERGFAAMEeth 2042
Cdd:COG4942     10 LLALAAAAQADAAAEAEAE-LEQLQQEIAELEKELAALKKE--EKALLKQLAALERRIAALARRIRAL-EQELAALE--- 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2043 qKKIEDLQRQH---QRELEKLREEKDRLLA--------EETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEA 2111
Cdd:COG4942     83 -AELAELEKEIaelRAELEAQKEELAELLRalyrlgrqPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAE 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2112 LRR------QYLEELQSVQRELE----VLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEIT 2181
Cdd:COG4942    162 LAAlraeleAERAELEALLAELEeeraALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAE 241
                          250
                   ....*....|..
gi 1907081931 2182 RLRTLLTGDGGG 2193
Cdd:COG4942    242 RTPAAGFAALKG 253
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
895-1143 8.14e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 48.13  E-value: 8.14e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  895 RDLETQQALqRDRQKEVQRLQECIAELSQQLGT----SEQAQRLMEKKLKRNytlllescEQEKQALLQNLKEVEDKASA 970
Cdd:TIGR02168  173 RRKETERKL-ERTRENLDRLEDILNELERQLKSlerqAEKAERYKELKAELR--------ELELALLVLRLEELREELEE 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  971 YEDQLQGHVQQVEALQKEKlsetckgseqvhkleeelEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVA 1050
Cdd:TIGR02168  244 LQEELKEAEEELEELTAEL------------------QELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQ 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1051 ELQEKLRgkevdyqNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHErVLEKKDQDLNEALVKMIALgssLEETEIKL 1130
Cdd:TIGR02168  306 ILRERLA-------NLERQLEELEAQLEELESKLDELAEELAELEEKLE-ELKEELESLEAELEELEAE---LEELESRL 374
                          250
                   ....*....|...
gi 1907081931 1131 QEKEECLRRFVSD 1143
Cdd:TIGR02168  375 EELEEQLETLRSK 387
PTZ00121 PTZ00121
MAEBL; Provisional
1776-2180 9.29e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 48.21  E-value: 9.29e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1776 EYEKELRFYKKACQEAKGASGQKRAQAVGALKEE---YEELlhKQKSEYQKVITLIEKENTELKAKVSQMdhqqRCLQEA 1852
Cdd:PTZ00121  1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEakkADEA--KKKAEEAKKADEAKKKAEEAKKKADEA----KKAAEA 1508
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1853 ENKHSESMFALQGRYEEEIRcMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRElqavh 1932
Cdd:PTZ00121  1509 KKKADEAKKAEEAKKADEAK-KAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRK----- 1582
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1933 QEELRALQEHYIwslrgalslyqpshpdsslapgpsepravpaakdeaesmsglrERIQELEAQMGVMREELGHKELEGD 2012
Cdd:PTZ00121  1583 AEEAKKAEEARI-------------------------------------------EEVMKLYEEEKKMKAEEAKKAEEAK 1619
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2013 VAALQEKYQrdfESLKATCERgFAAMEETHQKKIEDLQRQHqrELEKLREEKDRLLAEETAATISAIEAMKNAHREEMER 2092
Cdd:PTZ00121  1620 IKAEELKKA---EEEKKKVEQ-LKKKEAEEKKKAEELKKAE--EENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA 1693
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2093 ELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQEL 2172
Cdd:PTZ00121  1694 LKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEI 1773

                   ....*...
gi 1907081931 2173 NNRLAAEI 2180
Cdd:PTZ00121  1774 RKEKEAVI 1781
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
688-1137 9.50e-05

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 48.12  E-value: 9.50e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  688 REEKQVPIAPLHLSLEDRSERlSTHELTSLLEKELEQSQKEASDLLEQNRLLQdqlrVALGREQSARegyVLQTEVAtsp 767
Cdd:TIGR00606  724 RRDEMLGLAPGRQSIIDLKEK-EIPELRNKLQKVNRDIQRLKNDIEEQETLLG----TIMPEEESAK---VCLTDVT--- 792
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  768 sgawqrlhrVNQDLQSELEAQCRRqelITQQIQTLKHSygeakDAIRHHEAEIQTLQTrlgnaaaelaiKEQALAKLKGE 847
Cdd:TIGR00606  793 ---------IMERFQMELKDVERK---IAQQAAKLQGS-----DLDRTVQQVNQEKQE-----------KQHELDTVVSK 844
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  848 LKMEQGKVREQLEEWQHskamlsgqLRASEQKLRStearllEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGT 927
Cdd:TIGR00606  845 IELNRKLIQDQQEQIQH--------LKSKTNELKS------EKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQ 910
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  928 SEQAQRLMEKKLKRNYTLLLESCEQEKQAL--LQNLKEVEDKASAYEDQLQGHVQQ-VEALQKEKLSETCKGSEQVHKLE 1004
Cdd:TIGR00606  911 DSPLETFLEKDQQEKEELISSKETSNKKAQdkVNDIKEKVKNIHGYMKDIENKIQDgKDDYLKQKETELNTVNAQLEECE 990
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1005 EELEAREASIRQLAQHVQSLHDERDLIKHQF---------QELMERVATSDGDVAELQekLRGKEVDYQNLEHSHHRVSV 1075
Cdd:TIGR00606  991 KHQEKINEDMRLMRQDIDTQKIQERWLQDNLtlrkrenelKEVEEELKQHLKEMGQMQ--VLQMKQEHQKLEENIDLIKR 1068
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907081931 1076 QLQSVRTLLREKEEELKHIKethERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECL 1137
Cdd:TIGR00606 1069 NHVLALGRQKGYEKEIKHFK---KELREPQFRDAEEKYREMMIVMRTTELVNKDLDIYYKTL 1127
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1805-2286 1.00e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 47.75  E-value: 1.00e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1805 ALKEEYEELLhKQKSEYQKVITLIEKENTELKAKVSQmdhqqrcLQEAENKHSESMFALQGRYE--EEIRCMVEQLSHTE 1882
Cdd:PRK03918   176 RRIERLEKFI-KRTENIEELIKEKEKELEEVLREINE-------ISSELPELREELEKLEKEVKelEELKEEIEELEKEL 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1883 NTLQAErsrvLSQLDASVKDRQAMEQHHVQQMKMLEDrfqlKVRELqavhqEELRALQEHYIwSLRGALSLY--QPSHPD 1960
Cdd:PRK03918   248 ESLEGS----KRKLEEKIRELEERIEELKKEIEELEE----KVKEL-----KELKEKAEEYI-KLSEFYEEYldELREIE 313
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1961 SSLAPGPSEPRAVPAAKDEAESMSglrERIQELEAQMGVMREELGhkELEGDVAALQEKYQRDFESLKATCERGFAAMEE 2040
Cdd:PRK03918   314 KRLSRLEEEINGIEERIKELEEKE---ERLEELKKKLKELEKRLE--ELEERHELYEEAKAKKEELERLKKRLTGLTPEK 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2041 ThQKKIEDLQRQH---QRELEKLREEKDRLLAEEtAATISAIEAMKNAHR----------EEMERELEKSQRSQISSINS 2107
Cdd:PRK03918   389 L-EKELEELEKAKeeiEEEISKITARIGELKKEI-KELKKAIEELKKAKGkcpvcgreltEEHRKELLEEYTAELKRIEK 466
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2108 DIEALRRQyLEELQSVQRELEVLSEQYSQkclenahlaqaLEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLL 2187
Cdd:PRK03918   467 ELKEIEEK-ERKLRKELRELEKVLKKESE-----------LIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKL 534
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2188 TGDGGgESTGLpLTQGKDAYELEVLLRVKESEIQYLKQEISSLK-----------DELQTALRDKKYASDKY---KDIYT 2253
Cdd:PRK03918   535 IKLKG-EIKSL-KKELEKLEELKKKLAELEKKLDELEEELAELLkeleelgfesvEELEERLKELEPFYNEYlelKDAEK 612
                          490       500       510
                   ....*....|....*....|....*....|...
gi 1907081931 2254 ELSIAKAKadcdISRLKEQLKAATEALGEKSPE 2286
Cdd:PRK03918   613 ELEREEKE----LKKLEEELDKAFEELAETEKR 641
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1981-2246 1.00e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 47.76  E-value: 1.00e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1981 ESMSGLRERIQELEAQMGVMREELGH------KELEGDVAALQEKYqRDFESLKATCERGFAAMEEtHQKKIEDLQRQHQ 2054
Cdd:TIGR02169  251 EELEKLTEEISELEKRLEEIEQLLEElnkkikDLGEEEQLRVKEKI-GELEAEIASLERSIAEKER-ELEDAEERLAKLE 328
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2055 RELEKLREEKDRL---LAEETAATISAIEAMKNAHREEMEReleksqRSQISSINSDIEALRR---QYLEELQSVQRELE 2128
Cdd:TIGR02169  329 AEIDKLLAEIEELereIEEERKRRDKLTEEYAELKEELEDL------RAELEEVDKEFAETRDelkDYREKLEKLKREIN 402
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2129 VLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTlltgdgggestglpLTQGKDAYE 2208
Cdd:TIGR02169  403 ELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQ--------------LAADLSKYE 468
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1907081931 2209 LEvlLRVKESEIQYLKQEISSLKDELQTALRDKKYASD 2246
Cdd:TIGR02169  469 QE--LYDLKEEYDRVEKELSKLQRELAEAEAQARASEE 504
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1988-2164 1.17e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 47.43  E-value: 1.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1988 ERIQELEA-QMGVMRE-ELGHKELEG--DVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKLREE 2063
Cdd:pfam17380  375 SRMRELERlQMERQQKnERVRQELEAarKVKILEEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAREMERVRLE 454
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2064 KdrLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISS---INSDIEALRRQYLEELQS---VQRELE-----VLSE 2132
Cdd:pfam17380  455 E--QERQQQVERLRQQEEERKRKKLELEKEKRDRKRAEEQRrkiLEKELEERKQAMIEEERKrklLEKEMEerqkaIYEE 532
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1907081931 2133 QYSQKCLENAHLAQALEAERQALRQCQRENQE 2164
Cdd:pfam17380  533 ERRREAEEERRKQQEMEERRRIQEQMRKATEE 564
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
779-993 1.35e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 46.75  E-value: 1.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  779 QDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLkgelkmeqgkvREQ 858
Cdd:COG3883     19 QAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEER-----------REE 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  859 LEEWQHSKAMLSGQLRASEQKLRSTE-ARLLEKTQELRDL-ETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLME 936
Cdd:COG3883     88 LGERARALYRSGGSVSYLDVLLGSESfSDFLDRLSALSKIaDADADLLEELKADKAELEAKKAELEAKLAELEALKAELE 167
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907081931  937 KKLKRnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSET 993
Cdd:COG3883    168 AAKAE-----LEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAA 219
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
702-1139 1.40e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 47.37  E-value: 1.40e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  702 LEDRSERLSTheltslLEKELEQSQKEASDLLEQNRLLQD--QLRVALGREQSAREGYVLQtevatspsgawqrlhrvnq 779
Cdd:PRK03918   333 LEEKEERLEE------LKKKLKELEKRLEELEERHELYEEakAKKEELERLKKRLTGLTPE------------------- 387
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  780 DLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNA---AAELAikEQALAKLKGELKMEQGKVR 856
Cdd:PRK03918   388 KLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCpvcGRELT--EEHRKELLEEYTAELKRIE 465
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  857 EQLEEWQHSKAMLSGQLRASEQKLR--STEARLLEKTQELRDLETQqaLQRDRQKEVQRLQECIAELSQQLGTSEQAQRL 934
Cdd:PRK03918   466 KELKEIEEKERKLRKELRELEKVLKkeSELIKLKELAEQLKELEEK--LKKYNLEELEKKAEEYEKLKEKLIKLKGEIKS 543
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  935 MEKKLKRnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEalqkEKLSETCKGSEQVHKLEEELEAREASI 1014
Cdd:PRK03918   544 LKKELEK-----LEELKKKLAELEKKLDELEEELAELLKELEELGFESV----EELEERLKELEPFYNEYLELKDAEKEL 614
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1015 RQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAEL-----QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEE 1089
Cdd:PRK03918   615 EREEKELKKLEEELDKAFEELAETEKRLEELRKELEELekkysEEEYEELREEYLELSRELAGLRAELEELEKRREEIKK 694
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907081931 1090 ELKHIKETHERVLEKKD--QDLNEALVKMIALGSSLEetEIKLQEKEECLRR 1139
Cdd:PRK03918   695 TLEKLKEELEEREKAKKelEKLEKALERVEELREKVK--KYKALLKERALSK 744
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1986-2184 1.42e-04

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 47.32  E-value: 1.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1986 LRERIQELEAQMGVMREELghKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQhqreLEKLREEKD 2065
Cdd:COG3206    173 ARKALEFLEEQLPELRKEL--EEAEAALEEFRQKNG----------LVDLSEEAKLLLQQLSELESQ----LAEARAELA 236
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2066 RLLAEETAATiSAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEE---LQSVQRELEVLSEQYSQkclENA 2142
Cdd:COG3206    237 EAEARLAALR-AQLGSGPDALPELLQSPVIQQLRAQLAELEAELAELSARYTPNhpdVIALRAQIAALRAQLQQ---EAQ 312
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1907081931 2143 HLAQALEAERQALRQCQRE-NQELNAHNQELN--NRLAAEITRLR 2184
Cdd:COG3206    313 RILASLEAELEALQAREASlQAQLAQLEARLAelPELEAELRRLE 357
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
878-1081 1.52e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 47.22  E-value: 1.52e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  878 QKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQ-QLGTSEQAQRLMEKKLKRNyTLLLESCEQEKQA 956
Cdd:COG4913    235 DDLERAHEALEDAREQIELLEPIRELAERYAAARERLAELEYLRAAlRLWFAQRRLELLEAELEEL-RAELARLEAELER 313
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  957 LLQNLKEVEDKASAYEDQLQGH-VQQVEALQKEklsetckgseqVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQF 1035
Cdd:COG4913    314 LEARLDALREELDELEAQIRGNgGDRLEQLERE-----------IERLERELEERERRRARLEALLAALGLPLPASAEEF 382
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1036 QEL----MERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVR 1081
Cdd:COG4913    383 AALraeaAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLE 432
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
1869-2281 1.62e-04

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 47.14  E-value: 1.62e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1869 EEIRCMVEQLSHTENTLQAERSRvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEHyIWSLR 1948
Cdd:pfam12128  237 MKIRPEFTKLQQEFNTLESAELR-LSHLHFGYKSDETLIASRQEERQETSAELNQLLRTLDDQWKEKRDELNGE-LSAAD 314
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1949 GALSLYQpSHPDSSlapgpsEPRAVPAAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQRDFESLK 2028
Cdd:pfam12128  315 AAVAKDR-SELEAL------EDQHGAFLDADIETAAADQEQLPSWQSELENLEER--LKALTGKHQDVTAKYNRRRSKIK 385
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2029 ATCERGFAAMEEthqkkiedlqrqhqrELEKLREEKDRLLAEETAatisAIEAMKNAHREEME------RELEKSQRSQI 2102
Cdd:pfam12128  386 EQNNRDIAGIKD---------------KLAKIREARDRQLAVAED----DLQALESELREQLEagklefNEEEYRLKSRL 446
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2103 SSIN---------SDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELN 2173
Cdd:pfam12128  447 GELKlrlnqatatPELLLQLENFDERIERAREEQEAANAEVERLQSELRQARKRRDQASEALRQASRRLEERQSALDELE 526
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2174 NRLAAEITRLRTLLTGDGGG--ESTG----------------LPLTQGKDAYEL-EVLLRVKESEIQ---YLKQEISSLK 2231
Cdd:pfam12128  527 LQLFPQAGTLLHFLRKEAPDweQSIGkvispellhrtdldpeVWDGSVGGELNLyGVKLDLKRIDVPewaASEEELRERL 606
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907081931 2232 DELQTALRDkkyASDKYKDIYTELSIAKA---KADCDISRLKEQLKAATEALG 2281
Cdd:pfam12128  607 DKAEEALQS---AREKQAAAEEQLVQANGeleKASREETFARTALKNARLDLR 656
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1982-2339 1.66e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 46.98  E-value: 1.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1982 SMSGLRERIQELEAQMGVMREELghKELEGDVAALQE------------KYQRDFESLKATCERGFAAMEETH---QKKI 2046
Cdd:PRK03918   253 SKRKLEEKIRELEERIEELKKEI--EELEEKVKELKElkekaeeyiklsEFYEEYLDELREIEKRLSRLEEEIngiEERI 330
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2047 EDLQRQHQR--ELEKLREEKDRLLA--EETAATISAIEAMKnahrEEMERELEKSQRSQISSINSDIEALRRQYLEelqs 2122
Cdd:PRK03918   331 KELEEKEERleELKKKLKELEKRLEelEERHELYEEAKAKK----EELERLKKRLTGLTPEKLEKELEELEKAKEE---- 402
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2123 VQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNA-HNQELNNRLAAEITRLRTLLTGDGGGESTGLplt 2201
Cdd:PRK03918   403 IEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEeHRKELLEEYTAELKRIEKELKEIEEKERKLR--- 479
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2202 qgKDAYELEVLLRvKESEIQYLKQ---EISSLKDELqtalrdKKYASDKYKDIYTELSIAKAKAD---CDISRLKEQLKA 2275
Cdd:PRK03918   480 --KELRELEKVLK-KESELIKLKElaeQLKELEEKL------KKYNLEELEKKAEEYEKLKEKLIklkGEIKSLKKELEK 550
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081931 2276 ATEALGEKSpegttvsgydimKSKSNPDFLKKDRSCVTRQLRNIRSKSLKEgltVQERLKLFES 2339
Cdd:PRK03918   551 LEELKKKLA------------ELEKKLDELEEELAELLKELEELGFESVEE---LEERLKELEP 599
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
773-947 1.80e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 46.68  E-value: 1.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  773 RLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKE--QALAKLKGELKM 850
Cdd:COG4717     64 RKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPlyQELEALEAELAE 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  851 EQGKVRE------QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQE-----LRDLETQQALQRDRQKEVQRLQECIA 919
Cdd:COG4717    144 LPERLEEleerleELRELEEELEELEAELAELQEELEELLEQLSLATEEelqdlAEELEELQQRLAELEEELEEAQEELE 223
                          170       180       190
                   ....*....|....*....|....*....|
gi 1907081931  920 ELSQQLGTSEQAQRL--MEKKLKRNYTLLL 947
Cdd:COG4717    224 ELEEELEQLENELEAaaLEERLKEARLLLL 253
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1975-2132 1.84e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 46.83  E-value: 1.84e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1975 AAKDEAES-MSGLRERIQELEAQM---GVMREElghkELEGDVAALQEKY---QRDFESLKATCER-GFAAmeETHQKKI 2046
Cdd:COG4913    309 AELERLEArLDALREELDELEAQIrgnGGDRLE----QLEREIERLERELeerERRRARLEALLAAlGLPL--PASAEEF 382
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2047 EDLQRQHQRELEKLREEKDRLLAEETAAtISAIEAMKNAHREeMERELEkSQRSQISSINSDIEALRRQYLEELQSVQRE 2126
Cdd:COG4913    383 AALRAEAAALLEALEEELEALEEALAEA-EAALRDLRRELRE-LEAEIA-SLERRKSNIPARLLALRDALAEALGLDEAE 459

                   ....*.
gi 1907081931 2127 LEVLSE 2132
Cdd:COG4913    460 LPFVGE 465
PH_Boi cd13316
Boi family Pleckstrin homology domain; Yeast Boi proteins Boi1 and Boi2 are functionally ...
430-519 2.15e-04

Boi family Pleckstrin homology domain; Yeast Boi proteins Boi1 and Boi2 are functionally redundant and important for cell growth with Boi mutants displaying defects in bud formation and in the maintenance of cell polarity.They appear to be linked to Rho-type GTPase, Cdc42 and Rho3. Boi1 and Boi2 display two-hybrid interactions with the GTP-bound ("active") form of Cdc42, while Rho3 can suppress of the lethality caused by deletion of Boi1 and Boi2. These findings suggest that Boi1 and Boi2 are targets of Cdc42 that promote cell growth in a manner that is regulated by Rho3. Boi proteins contain a N-terminal SH3 domain, followed by a SAM (sterile alpha motif) domain, a proline-rich region, which mediates binding to the second SH3 domain of Bem1, and C-terminal PH domain. The PH domain is essential for its function in cell growth and is important for localization to the bud, while the SH3 domain is needed for localization to the neck. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270126  Cd Length: 97  Bit Score: 42.36  E-value: 2.15e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  430 KGWLTKQYED-GQWKKHWFVLADQSLRYYRdsvAEEAADLDGEINLsTCYDVT----EYPVQRNYGFQI--HTKEGEFTL 502
Cdd:cd13316      3 SGWMKKRGERyGTWKTRYFVLKGTRLYYLK---SENDDKEKGLIDL-TGHRVVpddsNSPFRGSYGFKLvpPAVPKVHYF 78
                           90
                   ....*....|....*..
gi 1907081931  503 SAMTSGIRRNWIQTIMK 519
Cdd:cd13316     79 AVDEKEELREWMKALMK 95
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
2055-2322 2.51e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 46.60  E-value: 2.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2055 RELEKLREEKDRLLAEEtaatiSAIEAMKnahrEEMERELEKSQRsQISSINSDIEALRRQyLEELQSVQRELEVLSEQY 2134
Cdd:PRK03918   172 KEIKRRIERLEKFIKRT-----ENIEELI----KEKEKELEEVLR-EINEISSELPELREE-LEKLEKEVKELEELKEEI 240
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2135 SQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRlAAEITRLRTLltgdgggestglpltqgKDAY-ELEVLL 2213
Cdd:PRK03918   241 EELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEK-VKELKELKEK-----------------AEEYiKLSEFY 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2214 RVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIyTELSIAKAKADCDISRLKEQLKAATEA---LGEKSPEGTTV 2290
Cdd:PRK03918   303 EEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERL-EELKKKLKELEKRLEELEERHELYEEAkakKEELERLKKRL 381
                          250       260       270
                   ....*....|....*....|....*....|..
gi 1907081931 2291 SGYDIMKSKSNPDFLKKDRSCVTRQLRNIRSK 2322
Cdd:PRK03918   382 TGLTPEKLEKELEELEKAKEEIEEEISKITAR 413
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
674-976 2.64e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 46.60  E-value: 2.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  674 EIEQRWHQVETTPLREEKQVPIAPLHLSLEDRSERLSTHELTSLLEK--ELEQSQKEASDLLEQNRLLQDQLRVALGREQ 751
Cdd:TIGR02169  699 RIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDlsSLEQEIENVKSELKELEARIEELEEDLHKLE 778
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  752 SAREGyvLQTEVATSPsgaWQRLhrvnQDLQSELEAQCRRQELITQQIQ------TLKHSYgeAKDAIRHHEAEIQTLQT 825
Cdd:TIGR02169  779 EALND--LEARLSHSR---IPEI----QAELSKLEEEVSRIEARLREIEqklnrlTLEKEY--LEKEIQELQEQRIDLKE 847
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  826 RLGNAAAELAIKEQALAKLKGELKMEQGKVRE---QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQA 902
Cdd:TIGR02169  848 QIKSIEKEIENLNGKKEELEEELEELEAALRDlesRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLE 927
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081931  903 LQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQ 976
Cdd:TIGR02169  928 ALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEE-----IRALEPVNMLAIQEYEEVLKRLDELKEKRA 996
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
718-1139 2.80e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 46.50  E-value: 2.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSA-------REGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCR 790
Cdd:TIGR00618  227 ELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLLkqlrariEELRAQEAVLEETQERINRARKAAPLAAHIKAVTQIE 306
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  791 RQ-ELITQQIQTLKHSYGEAKDAIRHHEAEIQTL--QTRLGN---AAAELAIKEQALAKLKGELKMEQGKVREQLEEWQH 864
Cdd:TIGR00618  307 QQaQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIeeQRRLLQtlhSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQ 386
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  865 SKAMLSGQLRASEQ---KLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKR 941
Cdd:TIGR00618  387 QKTTLTQKLQSLCKeldILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQ 466
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  942 NYTLLLEScEQEKQALLQNLKEVEDKASAYEDQLQG------------HVQQVEALQKEKLSETCKGSEQVHKLEEELEA 1009
Cdd:TIGR00618  467 SLKEREQQ-LQTKEQIHLQETRKKAVVLARLLELQEepcplcgscihpNPARQDIDNPGPLTRRMQRGEQTYAQLETSEE 545
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1010 REASIRQ-LAQHVQSLHDERDLIKHQFQELmervATSDGDVAELQEKLRGKEVDYQNL--EHSHHRVSVQLQSVRTLLRE 1086
Cdd:TIGR00618  546 DVYHQLTsERKQRASLKEQMQEIQQSFSIL----TQCDNRSKEDIPNLQNITVRLQDLteKLSEAEDMLACEQHALLRKL 621
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907081931 1087 KEEELKHIKETHERVLEKKDQDLNEALVKmialgsslEETEIKLQEKEECLRR 1139
Cdd:TIGR00618  622 QPEQDLQDVRLHLQQCSQELALKLTALHA--------LQLTLTQERVREHALS 666
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
1975-2271 3.13e-04

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 46.10  E-value: 3.13e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1975 AAKDEAESMSGLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFESLKATCE-----RGFAAMEETHQKKIEDL 2049
Cdd:COG5185    269 KLGENAESSKRLNENANNLIKQFENTKEKIAEYTKSIDIKKATESLEEQLAAAEAEQEleeskRETETGIQNLTAEIEQG 348
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2050 QRQHQRELEKLREEKDRLLAEETAATisaieamknahREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEV 2129
Cdd:COG5185    349 QESLTENLEAIKEEIENIVGEVELSK-----------SSEELDSFKDTIESTKESLDEIPQNQRGYAQEILATLEDTLKA 417
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2130 LSEQysqkclenahlaqaLEAERQALRQCQRENQElnahNQELNNRLAAEITRLRTLLTGDGggeSTGLPLTQGKDAYEL 2209
Cdd:COG5185    418 ADRQ--------------IEELQRQIEQATSSNEE----VSKLLNELISELNKVMREADEES---QSRLEEAYDEINRSV 476
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081931 2210 EVLLRVKESEIQYLKQEISSLKDELQT--ALRDKKYASDKYKDIYTELSIAKAKADCDISRLKE 2271
Cdd:COG5185    477 RSKKEDLNEELTQIESRVSTLKATLEKlrAKLERQLEGVRSKLDQVAESLKDFMRARGYAHILA 540
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1841-2259 3.13e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 46.26  E-value: 3.13e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1841 QMDHQQRCLQEAENKHSESMFAlqgRYEEEIRCMVEQLSHTeNTLQAERSRVLSQldaSVKDRQAmeqhHVQQMKMLEDR 1920
Cdd:pfam15921   60 ELDSPRKIIAYPGKEHIERVLE---EYSHQVKDLQRRLNES-NELHEKQKFYLRQ---SVIDLQT----KLQEMQMERDA 128
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1921 FqLKVRELQAVHQEELRALQEHYIWSLRGALSLYQPSHPDSSLAPGP------------SEPRAVPAAKDEAESmsglrE 1988
Cdd:pfam15921  129 M-ADIRRRESQSQEDLRNQLQNTVHELEAAKCLKEDMLEDSNTQIEQlrkmmlshegvlQEIRSILVDFEEASG-----K 202
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1989 RIQELEAQMGVMREELGH------KELEGDVAALQEK---YQRDFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEK 2059
Cdd:pfam15921  203 KIYEHDSMSTMHFRSLGSaiskilRELDTEISYLKGRifpVEDQLEALKSESQNKIELLLQQHQDRIEQLISEHEVEITG 282
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2060 LREEKdrllaeetaatiSAIEAMKNAHREEME--RELEKSQRSQISSINSDIEALRRQYLEELQSVQRELE-VLSEQYSQ 2136
Cdd:pfam15921  283 LTEKA------------SSARSQANSIQSQLEiiQEQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEdKIEELEKQ 350
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2137 KCLENAHLAQAlEAERQALRQ----CQRENQELNAHNQELNNRLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYELEVl 2212
Cdd:pfam15921  351 LVLANSELTEA-RTERDQFSQesgnLDDQLQKLLADLHKREKELSLEKEQNKRLWDRDTGNSITIDHLRRELDDRNMEV- 428
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*..
gi 1907081931 2213 lRVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAK 2259
Cdd:pfam15921  429 -QRLEALLKAMKSECQGQMERQMAAIQGKNESLEKVSSLTAQLESTK 474
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
667-1092 3.33e-04

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 45.87  E-value: 3.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  667 KTQNVHVEIEQRWHQVETT--PLREEKQVPIAPLHLSLEDRSERLstHELTSLLEKELEQSQKEASDLLEQNRLLQDQLR 744
Cdd:pfam05483  180 ETRQVYMDLNNNIEKMILAfeELRVQAENARLEMHFKLKEDHEKI--QHLEEEYKKEINDKEKQVSLLLIQITEKENKMK 257
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  745 VALGREQSAREGyVLQTEVATSpsgawqrlhrvnqdLQSE-LEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTL 823
Cdd:pfam05483  258 DLTFLLEESRDK-ANQLEEKTK--------------LQDEnLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIA 322
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  824 QTRLGNAAAELAIKEQALAKLKGELKMeqgkvreQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELR----DLET 899
Cdd:pfam05483  323 TKTICQLTEEKEAQMEELNKAKAAHSF-------VVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQkkssELEE 395
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  900 QQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKklkrnytllLESCEQEKQALLQNL-KEVED---KASAYEDQL 975
Cdd:pfam05483  396 MTKFKNNKEVELEELKKILAEDEKLLDEKKQFEKIAEE---------LKGKEQELIFLLQAReKEIHDleiQLTAIKTSE 466
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  976 QGHVQQVEALQKEKLSETCKGSEqvhkleeeleareasirqLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEK 1055
Cdd:pfam05483  467 EHYLKEVEDLKTELEKEKLKNIE------------------LTAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQ 528
                          410       420       430
                   ....*....|....*....|....*....|....*..
gi 1907081931 1056 LRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELK 1092
Cdd:pfam05483  529 EERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVK 565
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1977-2157 3.90e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 45.53  E-value: 3.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1977 KDEAESMSGLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFESLKATCERgfaameethqkkIEDLQRQHQrE 2056
Cdd:COG4717     91 AELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPER------------LEELEERLE-E 157
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2057 LEKLREEKDRLLAEetaatisaiEAMKNAHREEMERELEKSQRSQISSINSDIEALR---RQYLEELQSVQRELEVLSEQ 2133
Cdd:COG4717    158 LRELEEELEELEAE---------LAELQEELEELLEQLSLATEEELQDLAEELEELQqrlAELEEELEEAQEELEELEEE 228
                          170       180
                   ....*....|....*....|....
gi 1907081931 2134 YSQkcLENAHLAQALEAERQALRQ 2157
Cdd:COG4717    229 LEQ--LENELEAAALEERLKEARL 250
PH_GRP1-like cd01252
General Receptor for Phosphoinositides-1-like Pleckstrin homology (PH) domain; GRP1/cytohesin3 ...
429-517 4.03e-04

General Receptor for Phosphoinositides-1-like Pleckstrin homology (PH) domain; GRP1/cytohesin3 and the related proteins ARNO (ARF nucleotide-binding site opener)/cytohesin-2 and cytohesin-1 are ARF exchange factors that contain a pleckstrin homology (PH) domain thought to target these proteins to cell membranes through binding polyphosphoinositides. The PH domains of all three proteins exhibit relatively high affinity for PtdIns(3,4,5)P3. Within the Grp1 family, diglycine (2G) and triglycine (3G) splice variants, differing only in the number of glycine residues in the PH domain, strongly influence the affinity and specificity for phosphoinositides. The 2G variants selectively bind PtdIns(3,4,5)P3 with high affinity,the 3G variants bind PtdIns(3,4,5)P3 with about 30-fold lower affinity and require the polybasic region for plasma membrane targeting. These ARF-GEFs share a common, tripartite structure consisting of an N-terminal coiled-coil domain, a central domain with homology to the yeast protein Sec7, a PH domain, and a C-terminal polybasic region. The Sec7 domain is autoinhibited by conserved elements proximal to the PH domain. GRP1 binds to the DNA binding domain of certain nuclear receptors (TRalpha, TRbeta, AR, ER, but not RXR), and can repress thyroid hormone receptor (TR)-mediated transactivation by decreasing TR-complex formation on thyroid hormone response elements. ARNO promotes sequential activation of Arf6, Cdc42 and Rac1 and insulin secretion. Cytohesin acts as a PI 3-kinase effector mediating biological responses including cell spreading and adhesion, chemotaxis, protein trafficking, and cytoskeletal rearrangements, only some of which appear to depend on their ability to activate ARFs. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 269954  Cd Length: 119  Bit Score: 42.30  E-value: 4.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQyeDGQ---WKKHWFVLADQSLRYYRDSvaeEAADLDGEI---NLStcydVTEYP-VQRNYGFQIH------- 494
Cdd:cd01252      5 REGWLLKL--GGRvksWKRRWFILTDNCLYYFEYT---TDKEPRGIIpleNLS----VREVEdKKKPFCFELYspsngqv 75
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1907081931  495 -----------TKEGEFT---LSAMTSGIRRNWIQTI 517
Cdd:cd01252     76 ikacktdsdgkVVEGNHTvyrISAASEEERDEWIKSI 112
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1776-2067 4.47e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 45.82  E-value: 4.47e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1776 EYEKELRFYKKACQEAKGASGQKRaQAVGALKEEYEELLHKQKSEYQKvITLIEKENTELKAKVSQMDHQQRCLQEAENK 1855
Cdd:TIGR02168  702 ELRKELEELEEELEQLRKELEELS-RQISALRKDLARLEAEVEQLEER-IAQLSKELTELEAEIEELEERLEEAEEELAE 779
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1856 HSESMFALQGRYEEeircMVEQLSHTENTLQAERSRvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVREL--QAVHQ 1933
Cdd:TIGR02168  780 AEAEIEELEAQIEQ----LKEELKALREALDELRAE-LTLLNEEAANLRERLESLERRIAATERRLEDLEEQIeeLSEDI 854
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1934 EELRALQEHYiWSLRGALSlyqpSHPDSSLAPGPSEPRAVPAAKDEAESMSG----LRERIQELEAQMGVMREELGH--- 2006
Cdd:TIGR02168  855 ESLAAEIEEL-EELIEELE----SELEALLNERASLEEALALLRSELEELSEelreLESKRSELRRELEELREKLAQlel 929
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907081931 2007 --KELEGDVAALQEK----YQRDFEslkatcergfaaMEETHQKKIEDLQRQHQRELEKLREEKDRL 2067
Cdd:TIGR02168  930 rlEGLEVRIDNLQERlseeYSLTLE------------EAEALENKIEDDEEEARRRLKRLENKIKEL 984
PH_OSBP_ORP4 cd13284
Human Oxysterol binding protein and OSBP-related protein 4 Pleckstrin homology (PH) domain; ...
429-515 4.62e-04

Human Oxysterol binding protein and OSBP-related protein 4 Pleckstrin homology (PH) domain; Human OSBP is proposed to function is sterol-dependent regulation of ERK dephosphorylation and sphingomyelin synthesis as well as modulation of insulin signaling and hepatic lipogenesis. It contains a N-terminal PH domain, a FFAT motif (two phenylalanines in an acidic tract), and a C-terminal OSBP-related domain. OSBPs and Osh1p PH domains specifically localize to the Golgi apparatus in a PtdIns4P-dependent manner. ORP4 is proposed to function in Vimentin-dependent sterol transport and/or signaling. Human ORP4 has 2 forms, a long (ORP4L) and a short (ORP4S). ORP4L contains a N-terminal PH domain, a FFAT motif (two phenylalanines in an acidic tract), and a C-terminal OSBP-related domain. ORP4S is truncated and contains only an OSBP-related domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. They are members of the oxysterol binding protein (OSBP) family which includes OSBP, OSBP-related proteins (ORP), Goodpasture antigen binding protein (GPBP), and Four phosphate adaptor protein 1 (FAPP1). They have a wide range of purported functions including sterol transport, cell cycle control, pollen development and vessicle transport from Golgi recognize both PI lipids and ARF proteins. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270101  Cd Length: 99  Bit Score: 41.59  E-value: 4.62e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTK--QYEDGqWKKHWFVLADQSLRYYRdSVAEEAADLDGEINLSTCYDVTEYPVQrnygFQIHT-KEGEFTLSAM 505
Cdd:cd13284      1 MKGWLLKwtNYIKG-YQRRWFVLSNGLLSYYR-NQAEMAHTCRGTINLAGAEIHTEDSCN----FVISNgGTQTFHLKAS 74
                           90
                   ....*....|
gi 1907081931  506 TSGIRRNWIQ 515
Cdd:cd13284     75 SEVERQRWVT 84
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
834-1090 4.69e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 45.14  E-value: 4.69e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  834 LAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQrdrQKEVQR 913
Cdd:COG4942     11 LALAAAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAAL---EAELAE 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  914 LQECIAELSQQLGTSEQ--AQRL--MEKKLKRNYTLLLESCEqekqallqNLKEVEDKASAYEDQLQGHVQQVEALqkek 989
Cdd:COG4942     88 LEKEIAELRAELEAQKEelAELLraLYRLGRQPPLALLLSPE--------DFLDAVRRLQYLKYLAPARREQAEEL---- 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  990 lsetckgseqvhklEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHS 1069
Cdd:COG4942    156 --------------RADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQE 221
                          250       260
                   ....*....|....*....|.
gi 1907081931 1070 HHRVSVQLQSVRTLLREKEEE 1090
Cdd:COG4942    222 AEELEALIARLEAEAAAAAER 242
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1680-2157 5.25e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 45.14  E-value: 5.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1680 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLPPTEPLGGCQRLLRMSQHlSYESCLEGLGQYSSLLVQd 1759
Cdd:COG4717     87 EEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPE-RLEELEERLEELRELEEE- 164
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1760 aiiqaqvcyaacriRLEYEKELRFYKKACQEAKGASGQKRAQAVGALKEEYEELlHKQKSEYQKVITLIEKENTELKAKV 1839
Cdd:COG4717    165 --------------LEELEAELAELQEELEELLEQLSLATEEELQDLAEELEEL-QQRLAELEEELEEAQEELEELEEEL 229
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1840 SQMDHQQRCLQEAENKHSESMFALqgryeeeIRCMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLED 1919
Cdd:COG4717    230 EQLENELEAAALEERLKEARLLLL-------IAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGK 302
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1920 RFQlkvrelQAVHQEELRALQEHYIWSLRGALSLyqpshpdsslaPGPSEPRAVPAAKDEAESMSGLRERIQELEAQMgv 1999
Cdd:COG4717    303 EAE------ELQALPALEELEEEELEELLAALGL-----------PPDLSPEELLELLDRIEELQELLREAEELEEEL-- 363
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2000 mreelghkelegDVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQR--QHQRELEKLREEKDRLLAEETAATIS 2077
Cdd:COG4717    364 ------------QLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQELKEEleELEEQLEELLGELEELLEALDEEELE 431
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2078 AIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQY-----LEELQSVQRELEVLSEQYSQKCLenahLAQALEAER 2152
Cdd:COG4717    432 EELEELEEELEELEEELEE-LREELAELEAELEQLEEDGelaelLQELEELKAELRELAEEWAALKL----ALELLEEAR 506

                   ....*
gi 1907081931 2153 QALRQ 2157
Cdd:COG4717    507 EEYRE 511
PH_Gab2_2 cd13384
Grb2-associated binding protein family pleckstrin homology (PH) domain; The Gab subfamily ...
428-517 5.65e-04

Grb2-associated binding protein family pleckstrin homology (PH) domain; The Gab subfamily includes several Gab proteins, Drosophila DOS and C. elegans SOC-1. They are scaffolding adaptor proteins, which possess N-terminal PH domains and a C-terminus with proline-rich regions and multiple phosphorylation sites. Following activation of growth factor receptors, Gab proteins are tyrosine phosphorylated and activate PI3K, which generates 3-phosphoinositide lipids. By binding to these lipids via the PH domain, Gab proteins remain in proximity to the receptor, leading to further signaling. While not all Gab proteins depend on the PH domain for recruitment, it is required for Gab activity. Members here include insect, nematodes, and crustacean Gab2s. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 241535  Cd Length: 115  Bit Score: 41.66  E-value: 5.65e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  428 FKKGWLTK-----QYEDGQWKKHWFVLADQS------LRYYRDsvaEEAADLDGEINLSTCYDV-----TEYPVQRNYG- 490
Cdd:cd13384      4 VYEGWLTKsppekRIWRAKWRRRYFVLRQSEipgqyfLEYYTD---RTCRKLKGSIDLDQCEQVdagltFETKNKLKDQh 80
                           90       100
                   ....*....|....*....|....*...
gi 1907081931  491 -FQIHTKEGEFTLSAMTSGIRRNWIQTI 517
Cdd:cd13384     81 iFDIRTPKRTYYLVADTEDEMNKWVNCI 108
46 PHA02562
endonuclease subunit; Provisional
774-999 5.76e-04

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 45.01  E-value: 5.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  774 LHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKlkgeLKMEQG 853
Cdd:PHA02562   190 IDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNK----LNTAAA 265
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  854 KVREQLEEWQHSKAMLS--GQLRASEQKLRSTEARLLEKTQELRDLETQ-QALQRDRQKEVQRLQEcIAELSQQLgtseq 930
Cdd:PHA02562   266 KIKSKIEQFQKVIKMYEkgGVCPTCTQQISEGPDRITKIKDKLKELQHSlEKLDTAIDELEEIMDE-FNEQSKKL----- 339
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907081931  931 aqrlmeKKLKRNYtlllESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKE--KLSETCKGSEQ 999
Cdd:PHA02562   340 ------LELKNKI----STNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDEldKIVKTKSELVK 400
PH_Btk cd01238
Bruton's tyrosine kinase pleckstrin homology (PH) domain; Btk is a member of the Tec family of ...
429-517 5.78e-04

Bruton's tyrosine kinase pleckstrin homology (PH) domain; Btk is a member of the Tec family of cytoplasmic protein tyrosine kinases that includes BMX, IL2-inducible T-cell kinase (Itk) and Tec. Btk plays a role in the maturation of B cells. Tec proteins general have an N-terminal PH domain, followed by a Tek homology (TH) domain, a SH3 domain, a SH2 domain and a kinase domain. The Btk PH domain binds phosphatidylinositol 3,4,5-trisphosphate and responds to signalling via phosphatidylinositol 3-kinase. The PH domain is also involved in membrane anchoring which is confirmed by the discovery of a mutation of a critical arginine residue in the BTK PH domain. This results in severe human immunodeficiency known as X-linked agammaglobulinemia (XLA) in humans and a related disorder is mice.PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 269944 [Multi-domain]  Cd Length: 140  Bit Score: 42.22  E-value: 5.78e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQyedGQ---------WKKHWFVLADQSLRYYrDSVAEEAADLDGEINLSTCYDV----TEYPVQRNYGFQIHT 495
Cdd:cd01238      1 LEGLLVKR---SQgkkrfgpvnYKERWFVLTKSSLSYY-EGDGEKRGKEKGSIDLSKVRCVeevkDEAFFERKYPFQVVY 76
                           90       100
                   ....*....|....*....|..
gi 1907081931  496 KEGEFTLSAMTSGIRRNWIQTI 517
Cdd:cd01238     77 DDYTLYVFAPSEEDRDEWIAAL 98
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1975-2237 6.81e-04

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 45.03  E-value: 6.81e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1975 AAKDEAEsmsgLRERIQELEAQMGVMREELGHKELEGDVAALQ-----------EKYQRDFESLKATCERGFAAMEETHQ 2043
Cdd:PRK02224   197 EEKEEKD----LHERLNGLESELAELDEEIERYEEQREQARETrdeadevleehEERREELETLEAEIEDLRETIAETER 272
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2044 KK--IEDLQRQHQRELEKLREEKDRLLAEE--TAATISAIEAMKN---AHREEMERELEKsQRSQISSINSDIEALRrqy 2116
Cdd:PRK02224   273 EReeLAEEVRDLRERLEELEEERDDLLAEAglDDADAEAVEARREeleDRDEELRDRLEE-CRVAAQAHNEEAESLR--- 348
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2117 leelqsvqRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAaeitrlrtlltgdgggest 2196
Cdd:PRK02224   349 --------EDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFG------------------- 401
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 1907081931 2197 GLPLTQGKDAYELEVLLrvkeSEIQYLKQEISSLKDELQTA 2237
Cdd:PRK02224   402 DAPVDLGNAEDFLEELR----EERDELREREAELEATLRTA 438
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
774-1139 7.47e-04

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 44.81  E-value: 7.47e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  774 LHRVNQDLQSELEAQCRRQELITQQIQT--------------LKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAE------ 833
Cdd:pfam10174  245 LERNIRDLEDEVQMLKTNGLLHTEDREEeikqmevykshskfMKNKIDQLKQELSKKESELLALQTKLETLTNQnsdckq 324
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  834 --------LAIKEQALAKLKGE-----LKMEQ-----GKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELR 895
Cdd:pfam10174  325 hievlkesLTAKEQRAAILQTEvdalrLRLEEkesflNKKTKQLQDLTEEKSTLAGEIRDLKDMLDVKERKINVLQKKIE 404
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  896 DLETQqalQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDkasaYEDQL 975
Cdd:pfam10174  405 NLQEQ---LRDKDKQLAGLKERVKSLQTDSSNTDTALTTLEEALSEKERIIERLKEQREREDRERLEELES----LKKEN 477
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  976 QGHVQQVEALQKEKLSETCKGS---EQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDvAEL 1052
Cdd:pfam10174  478 KDLKEKVSALQPELTEKESSLIdlkEHASSLASSGLKKDSKLKSLEIAVEQKKEECSKLENQLKKAHNAEEAVRTN-PEI 556
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1053 QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEElKHIKETHERVLEK------KDQDLNEALVKMialgSSLEET 1126
Cdd:pfam10174  557 NDRIRLLEQEVARYKEESGKAQAEVERLLGILREVENE-KNDKDKKIAELESltlrqmKEQNKKVANIKH----GQQEMK 631
                          410
                   ....*....|...
gi 1907081931 1127 EIKLQEKEECLRR 1139
Cdd:pfam10174  632 KKGAQLLEEARRR 644
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
718-1137 7.73e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 45.11  E-value: 7.73e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNrllQDQLRVALGREQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRRQelITQ 797
Cdd:pfam15921  247 LEALKSESQNKIELLLQQH---QDRIEQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQARNQNSMYMRQ--LSD 321
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  798 QIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQAlaklKGELKMEQGKVREQLEEwqhskamLSGQLRASE 877
Cdd:pfam15921  322 LESTVSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTE----RDQFSQESGNLDDQLQK-------LLADLHKRE 390
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  878 QKLRstearlLEKTQELR--DLETQQA-----LQR---DRQKEVQRLQ--------ECIAELSQQLGTSEQAQRLMEKkl 939
Cdd:pfam15921  391 KELS------LEKEQNKRlwDRDTGNSitidhLRReldDRNMEVQRLEallkamksECQGQMERQMAAIQGKNESLEK-- 462
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  940 krnYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSE--QVHKLEEELEAREASIRQL 1017
Cdd:pfam15921  463 ---VSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEitKLRSRVDLKLQELQHLKNE 539
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1018 AQHVQSLHDERDLIKHQFQELMERVatsdgdvaelqEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLrEKE--------E 1089
Cdd:pfam15921  540 GDHLRNVQTECEALKLQMAEKDKVI-----------EILRQQIENMTQLVGQHGRTAGAMQVEKAQL-EKEindrrlelQ 607
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907081931 1090 ELKHIKETHE---RVLEKKDQDLNEALVKMIALGSS-LEETEIKLQEKEECL 1137
Cdd:pfam15921  608 EFKILKDKKDakiRELEARVSDLELEKVKLVNAGSErLRAVKDIKQERDQLL 659
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1799-2261 7.90e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 44.76  E-value: 7.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1799 RAQAVGALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRcLQEAENKHSESMFALQGRYEE---EIRCMV 1875
Cdd:COG4717     44 RAMLLERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAE-LQEELEELEEELEELEAELEElreELEKLE 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1876 EQLSHTENTLQAER-SRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQlKVRELQAVHQEELRALQEHYIWSLRGALSLY 1954
Cdd:COG4717    123 KLLQLLPLYQELEAlEAELAELPERLEELEERLEELRELEEELEELEA-ELAELQEELEELLEQLSLATEEELQDLAEEL 201
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1955 QpshpdsslapgpsepravpAAKDEAESmsgLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFESLKATCER- 2033
Cdd:COG4717    202 E-------------------ELQQRLAE---LEEELEEAQEELEELEEELEQLENELEAAALEERLKEARLLLLIAAALl 259
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2034 GFAAMEETHQKKIEDLQRQHQ-----RELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSD 2108
Cdd:COG4717    260 ALLGLGGSLLSLILTIAGVLFlvlglLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEEL 339
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2109 IEALRRqyLEELQSVQRELEVLSEQYSQKCLE---NAHLAQALEAERQALRQCQRENQELnahnQELNNRLAAEITRLRT 2185
Cdd:COG4717    340 LELLDR--IEELQELLREAEELEEELQLEELEqeiAALLAEAGVEDEEELRAALEQAEEY----QELKEELEELEEQLEE 413
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081931 2186 LLTGDGGGESTGLPLTQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTALRDKKYAsdkykDIYTELSIAKAK 2261
Cdd:COG4717    414 LLGELEELLEALDEEELEEELEELEEELEELEEELEELREELAELEAELEQLEEDGELA-----ELLQELEELKAE 484
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
683-1098 8.34e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 44.72  E-value: 8.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  683 ETTPLREEKQVPIAPLH-----LSLE-DRSERLSTHELTSL-----LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQ 751
Cdd:pfam15921  371 ESGNLDDQLQKLLADLHkrekeLSLEkEQNKRLWDRDTGNSitidhLRRELDDRNMEVQRLEALLKAMKSECQGQMERQM 450
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  752 SAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGnaa 831
Cdd:pfam15921  451 AAIQGKNESLEKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVD--- 527
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  832 aelaIKEQALAKLKGE---------------LKM-EQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLlEKTQELR 895
Cdd:pfam15921  528 ----LKLQELQHLKNEgdhlrnvqtecealkLQMaEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQL-EKEINDR 602
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  896 DLETQQ--ALQRDRQKEVQRLQECIAEL-----------SQQLGT-----SEQAQRLMEKKLKRNYtllLESCEQEKQAL 957
Cdd:pfam15921  603 RLELQEfkILKDKKDAKIRELEARVSDLelekvklvnagSERLRAvkdikQERDQLLNEVKTSRNE---LNSLSEDYEVL 679
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  958 LQNLKEVEDKASAYEDQLQGHVQ--QVEALQKEKLSETCKGSE------------QVHKLEEELEAREASIRQLAQHVQS 1023
Cdd:pfam15921  680 KRNFRNKSEEMETTTNKLKMQLKsaQSELEQTRNTLKSMEGSDghamkvamgmqkQITAKRGQIDALQSKIQFLEEAMTN 759
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1024 LHDERDLIKHQFQEL---MERVATSDGDVAELQEKLRGKEVDYQ----NLEHSHHRVSVQLQSVRTLLREKEEELKHIKE 1096
Cdd:pfam15921  760 ANKEKHFLKEEKNKLsqeLSTVATEKNKMAGELEVLRSQERRLKekvaNMEVALDKASLQFAECQDIIQRQEQESVRLKL 839

                   ..
gi 1907081931 1097 TH 1098
Cdd:pfam15921  840 QH 841
PRK09039 PRK09039
peptidoglycan -binding protein;
2037-2189 8.51e-04

peptidoglycan -binding protein;


Pssm-ID: 181619 [Multi-domain]  Cd Length: 343  Bit Score: 44.19  E-value: 8.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2037 AMEETHQKKIEDLQRQHQRELEKLREEKDRL--LAEETAATISAIEAMKNAHREEM--ERELEKSQRSQISSINSDIEAL 2112
Cdd:PRK09039    70 SLERQGNQDLQDSVANLRASLSAAEAERSRLqaLLAELAGAGAAAEGRAGELAQELdsEKQVSARALAQVELLNQQIAAL 149
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907081931 2113 RRQyleeLQSVQRELEVlSEQYSQkclenahlaqalEAERQALRQCQRENQELNAHNQELnNRLAAE-ITRLRTLLTG 2189
Cdd:PRK09039   150 RRQ----LAALEAALDA-SEKRDR------------ESQAKIADLGRRLNVALAQRVQEL-NRYRSEfFGRLREILGD 209
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
830-1105 8.98e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 44.57  E-value: 8.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  830 AAAELAIKEQALAK---LKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRD 906
Cdd:TIGR00618  182 ALMEFAKKKSLHGKaelLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQL 261
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  907 RQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQ 986
Cdd:TIGR00618  262 LKQLRARIEELRAQEAVLEETQERINRARKAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIE 341
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  987 KEKLSETCKGSEQVH------------KLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSD-------- 1046
Cdd:TIGR00618  342 EQRRLLQTLHSQEIHirdahevatsirEISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDtrtsafrd 421
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907081931 1047 --GDVAEL-------QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKK 1105
Cdd:TIGR00618  422 lqGQLAHAkkqqelqQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHLQETRKK 489
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1980-2180 9.45e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 44.52  E-value: 9.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1980 AESMSGLRERIQELEAQMGVMR--------------EELGHKELEGDVAALQEKYQR------DFESLK---ATCERGFA 2036
Cdd:COG4913    623 EEELAEAEERLEALEAELDALQerrealqrlaeyswDEIDVASAEREIAELEAELERldassdDLAALEeqlEELEAELE 702
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2037 AMEEtHQKKIEDLQRQHQRELEKLREEKDRLLAEETAATISAI------------EAMKNAHREEMERELEKSQ---RSQ 2101
Cdd:COG4913    703 ELEE-ELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARlelralleerfaAALGDAVERELRENLEERIdalRAR 781
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2102 ISSINSDIEALRRQYLEE----LQSVQRELEVLSEqYSQKC--LENAHLAQALEAERQALRQCQRENQElnahnqELNNR 2175
Cdd:COG4913    782 LNRAEEELERAMRAFNREwpaeTADLDADLESLPE-YLALLdrLEEDGLPEYEERFKELLNENSIEFVA------DLLSK 854

                   ....*
gi 1907081931 2176 LAAEI 2180
Cdd:COG4913    855 LRRAI 859
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1639-2274 9.56e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 44.67  E-value: 9.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1639 ERVIQQILetlrhptgREDQVQTSWDQnpLGEILRPgTDGSQEPLQALHQSPEvlaAIQDELAQQLREKASILEEISAAL 1718
Cdd:PRK03918   148 EKVVRQIL--------GLDDYENAYKN--LGEVIKE-IKRRIERLEKFIKRTE---NIEELIKEKEKELEEVLREINEIS 213
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1719 PVLPPT-EPLGGCQRLLRmsqhlSYESCLEGLgqySSLLVQDAIIQAQVCYAACRIRlEYEKELRFYKKACQEAKgaSGQ 1797
Cdd:PRK03918   214 SELPELrEELEKLEKEVK-----ELEELKEEI---EELEKELESLEGSKRKLEEKIR-ELEERIEELKKEIEELE--EKV 282
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1798 KRAQAVGALKEEYEElLHKQKSEYQKVITLIEKENTELKAKVSQMdhqQRCLQEAENKHSEsMFALQGRyEEEIRCMVEQ 1877
Cdd:PRK03918   283 KELKELKEKAEEYIK-LSEFYEEYLDELREIEKRLSRLEEEINGI---EERIKELEEKEER-LEELKKK-LKELEKRLEE 356
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1878 LSHTENTLQAERsRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALqEHYIWSLRGALSLYQPS 1957
Cdd:PRK03918   357 LEERHELYEEAK-AKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGEL-KKEIKELKKAIEELKKA 434
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1958 HPDSSL--APGPSEPRAVPAAKDEAEsMSGLRERIQELEAQMGVMREELghKELEGDVaalqeKYQRDFESLKATCERGF 2035
Cdd:PRK03918   435 KGKCPVcgRELTEEHRKELLEEYTAE-LKRIEKELKEIEEKERKLRKEL--RELEKVL-----KKESELIKLKELAEQLK 506
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2036 AAMEETHQKKIEDLQRQhQRELEKLREEKDRLLAE--ETAATISAIEAMKNaHREEMERELEKSQRsQISSINSDIEALR 2113
Cdd:PRK03918   507 ELEEKLKKYNLEELEKK-AEEYEKLKEKLIKLKGEikSLKKELEKLEELKK-KLAELEKKLDELEE-ELAELLKELEELG 583
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2114 RQYLEELQSVQRELEVLSEQYsqkcLENAHLAQALEAERQALRQCQRENQELNAHNQELNNR---LAAEITRLRTLLTGD 2190
Cdd:PRK03918   584 FESVEELEERLKELEPFYNEY----LELKDAEKELEREEKELKKLEEELDKAFEELAETEKRleeLRKELEELEKKYSEE 659
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2191 GGGESTGLPLtqgkdayELEVLLRVKESEIQYLKqeisSLKDELQTALRDKKYASDKYKDIYTEL-SIAKAKAdcDISRL 2269
Cdd:PRK03918   660 EYEELREEYL-------ELSRELAGLRAELEELE----KRREEIKKTLEKLKEELEEREKAKKELeKLEKALE--RVEEL 726

                   ....*
gi 1907081931 2270 KEQLK 2274
Cdd:PRK03918   727 REKVK 731
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
716-885 9.87e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 43.98  E-value: 9.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  716 SLLEKELEQSQKEAS----DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRR 791
Cdd:COG4942     79 AALEAELAELEKEIAelraELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRAD 158
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  792 QELITQQIQTLKhsygEAKDAIRHHEAEIQTLQTRLgnaAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSG 871
Cdd:COG4942    159 LAELAALRAELE----AERAELEALLAELEEERAAL---EALKAERQKLLARLEKELAELAAELAELQQEAEELEALIAR 231
                          170
                   ....*....|....
gi 1907081931  872 QLRASEQKLRSTEA 885
Cdd:COG4942    232 LEAEAAAAAERTPA 245
COG5022 COG5022
Myosin heavy chain [General function prediction only];
778-1244 1.03e-03

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 44.68  E-value: 1.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  778 NQDLQSELEAQCRRQELITQQI----QTLKHSYGEAkdAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKmEQG 853
Cdd:COG5022    819 IIKLQKTIKREKKLRETEEVEFslkaEVLIQKFGRS--LKAKKRFSLLKKETIYLQSAQRVELAERQLQELKIDVK-SIS 895
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  854 KVREQLEEWQHSKAMLSGQLRASEQ---KLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLgtseq 930
Cdd:COG5022    896 SLKLVNLELESEIIELKKSLSSDLIenlEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETS----- 970
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  931 aqrlmekklkrnytlllesceQEKQALLqnlkeveDKASAYEDQLQGHVQQVEALQKEkLSETCKGSEQVHKLEEELEAR 1010
Cdd:COG5022    971 ---------------------EEYEDLL-------KKSTILVREGNKANSELKNFKKE-LAELSKQYGALQESTKQLKEL 1021
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1011 EASIRQLAQHVQSLHDERDlIKHQFQELMERVATSDGDVAELQEKLrgKEVDYQN-LEHSHHRVSVQLQSVRTLlrEKEE 1089
Cdd:COG5022   1022 PVEVAELQSASKIISSEST-ELSILKPLQKLKGLLLLENNQLQARY--KALKLRReNSLLDDKQLYQLESTENL--LKTI 1096
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1090 ELKHIKETHERVLEKKdqdlnEALVKMIALGSSLEEteikLQEKEECLRRFVSDSPkDAKEPLSTTEPTEEGSGILPLGS 1169
Cdd:COG5022   1097 NVKDLEVTNRNLVKPA-----NVLQFIVAQMIKLNL----LQEISKFLSQLVNTLE-PVFQKLSVLQLELDGLFWEANLE 1166
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1170 VTRVFPGFpHSQPEDEDPSAGLGEEGSSGS-------------LSREENTILPKSADMPE--REG-HLQSTSKSDPGapI 1233
Cdd:COG5022   1167 ALPSPPPF-AALSEKRLYQSALYDEKSKLSssevndlkneliaLFSKIFSGWPRGDKLKKliSEGwVPTEYSTSLKG--F 1243
                          490
                   ....*....|.
gi 1907081931 1234 KRPRIRFSTIQ 1244
Cdd:COG5022   1244 NNLNKKFDTPA 1254
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1980-2165 1.26e-03

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 44.26  E-value: 1.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1980 AESMSGLRERIQELEAQMGVMREELGHKELEGDVAALQekyQRDFESLKATCERgfaAMEETHQKkiedlQRQHQRELEK 2059
Cdd:PRK02224   278 AEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEAR---REELEDRDEELRD---RLEECRVA-----AQAHNEEAES 346
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2060 LREEKDRLlaEETAATISAIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQY---LEELQSVQRELEVLSEQysq 2136
Cdd:PRK02224   347 LREDADDL--EERAEELREEAAELESELEEAREAVED-RREEIEELEEEIEELRERFgdaPVDLGNAEDFLEELREE--- 420
                          170       180       190
                   ....*....|....*....|....*....|
gi 1907081931 2137 kcLENAHLAQA-LEAERQALRQCQRENQEL 2165
Cdd:PRK02224   421 --RDELREREAeLEATLRTARERVEEAEAL 448
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1790-2177 1.39e-03

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 44.01  E-value: 1.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1790 EAKGASGQKRAQAVGALKEEYEELLHKQKSEYQKVITLIEKEN------TELKAkVSQMDHQQRCLQEAENKHSESMFAL 1863
Cdd:pfam01576  562 EEKAAAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSNLEKKQKkfdqmlAEEKA-ISARYAEERDRAEAEAREKETRALS 640
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1864 QGRYEEEIRCMVEQLSHTENTLQAERSRVLSQLDASVKD-------RQAMEQhHVQQMKM----LEDrfqlkvrELQAVH 1932
Cdd:pfam01576  641 LARALEEALEAKEELERTNKQLRAEMEDLVSSKDDVGKNvhelersKRALEQ-QVEEMKTqleeLED-------ELQATE 712
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1933 QEELR------ALQEHYIWSLRgalslyqpshpdsslapgpsepravpaAKDEA--ESMSGLRERIQELEAQMGVMREEl 2004
Cdd:pfam01576  713 DAKLRlevnmqALKAQFERDLQ---------------------------ARDEQgeEKRRQLVKQVRELEAELEDERKQ- 764
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2005 ghkelEGDVAALQEKYQRDFESLKATCERGFAAMEET--HQKKIEDLQRQHQRELEKLREEKDRLLA--EETAATISAIE 2080
Cdd:pfam01576  765 -----RAQAVAAKKKLELDLKELEAQIDAANKGREEAvkQLKKLQAQMKDLQRELEEARASRDEILAqsKESEKKLKNLE 839
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2081 A-----------------MKNAHREEMERELEK--SQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLEN 2141
Cdd:pfam01576  840 AellqlqedlaaserarrQAQQERDELADEIASgaSGKSALQDEKRRLEARIAQLEEELEEEQSNTELLNDRLRKSTLQV 919
                          410       420       430
                   ....*....|....*....|....*....|....*.
gi 1907081931 2142 AHLAQALEAERQALRQCQRENQELNAHNQELNNRLA 2177
Cdd:pfam01576  920 EQLTTELAAERSTSQKSESARQQLERQNKELKAKLQ 955
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
2096-2303 1.57e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 43.28  E-value: 1.57e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2096 KSQRSQISSINSDIEALRrqylEELQSVQRELEVLSEQYSQKcleNAHLAQALEAERQALRQCQRENQELNAHNQELNNR 2175
Cdd:COG3883     19 QAKQKELSELQAELEAAQ----AELDALQAELEELNEEYNEL---QAELEALQAEIDKLQAEIAEAEAEIEERREELGER 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2176 LAA------EITRLRTLLTGDGGGE--------STGLPLTQG--KDAYELEVLLRVKESEIQYLKQEISSLKDELQTALR 2239
Cdd:COG3883     92 ARAlyrsggSVSYLDVLLGSESFSDfldrlsalSKIADADADllEELKADKAELEAKKAELEAKLAELEALKAELEAAKA 171
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081931 2240 DKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGEKSPEGTTVSGYDIMKSKSNPD 2303
Cdd:COG3883    172 ELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAA 235
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1772-2165 1.63e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 43.77  E-value: 1.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1772 RIRLEYEKELRFYKKACQEAKGASGQKRAQAvgALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRcLQE 1851
Cdd:COG1196    380 ELEELAEELLEALRAAAELAAQLEELEEAEE--ALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAE-LEE 456
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1852 AENKHSESMFALQGRYEEEIrcmvEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAV 1931
Cdd:COG1196    457 EEEALLELLAELLEEAALLE----AALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGV 532
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1932 HQEELRALQEhyiwsLRGALSLYQPSHPDSSLAPGPSEPRAVPAAKDEAESMSGLRERIQELEAQMGVMREElGHKELEG 2011
Cdd:COG1196    533 EAAYEAALEA-----ALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGA-AVDLVAS 606
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2012 DVAALQEKYQRDFESL------KATCERGFAAMEETHQKKIE---DLQRQHQRELEKLREEKDRLLAEETAATISAIEAM 2082
Cdd:COG1196    607 DLREADARYYVLGDTLlgrtlvAARLEAALRRAVTLAGRLREvtlEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAE 686
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2083 KNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAE----------R 2152
Cdd:COG1196    687 RLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEElpeppdleelE 766
                          410
                   ....*....|...
gi 1907081931 2153 QALRQCQRENQEL 2165
Cdd:COG1196    767 RELERLEREIEAL 779
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
2050-2322 1.75e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 43.90  E-value: 1.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2050 QRQHQRELEKLREEKDRLLAEEtAATISAIEAMKNaHREEMERELEKSQRsQISSINSDIEALrrqyLEELQSVQRELEV 2129
Cdd:TIGR02169  669 SRSEPAELQRLRERLEGLKREL-SSLQSELRRIEN-RLDELSQELSDASR-KIGEIEKEIEQL----EQEEEKLKERLEE 741
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2130 LSEQYSQkclenahLAQALEAERQALRQCQRENQELnahnQELNNRLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYEL 2209
Cdd:TIGR02169  742 LEEDLSS-------LEQEIENVKSELKELEARIEEL----EEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRI 810
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2210 EVLLRVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAKAKadcdISRLKEQLKAATEALgekspegtt 2289
Cdd:TIGR02169  811 EARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGK----KEELEEELEELEAAL--------- 877
                          250       260       270
                   ....*....|....*....|....*....|...
gi 1907081931 2290 vsgYDIMKSKSNpdfLKKDRSCVTRQLRNIRSK 2322
Cdd:TIGR02169  878 ---RDLESRLGD---LKKERDELEAQLRELERK 904
PRK10246 PRK10246
exonuclease subunit SbcC; Provisional
718-981 1.75e-03

exonuclease subunit SbcC; Provisional


Pssm-ID: 182330 [Multi-domain]  Cd Length: 1047  Bit Score: 43.64  E-value: 1.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRLLQDQLRvalgREQSAREGYVLQTEVATSpsgAWQRL-------HRVNQDLQSELEAQCR 790
Cdd:PRK10246   535 LEKEVKKLGEEGAALRGQLDALTKQLQ----RDESEAQSLRQEEQALTQ---QWQAVcaslnitLQPQDDIQPWLDAQEE 607
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  791 RQELITQ--QIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIK------EQA-LAKLKGELKMEQGKVREQ--L 859
Cdd:PRK10246   608 HERQLRLlsQRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTALAGYALTlpqedeEASwLATRQQEAQSWQQRQNELtaL 687
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  860 EEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRD----LETQ-QALQRDRQKEVQRLQECIAELSQQLGTSEQAQR- 933
Cdd:PRK10246   688 QNRIQQLTPLLETLPQSDDLPHSEETVALDNWRQVHEqclsLHSQlQTLQQQDVLEAQRLQKAQAQFDTALQASVFDDQq 767
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907081931  934 ------LMEKKLKRnytlllesCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQ 981
Cdd:PRK10246   768 aflaalLDEETLTQ--------LEQLKQNLENQRQQAQTLVTQTAQALAQHQQH 813
PH_RhoGap25-like cd13263
Rho GTPase activating protein 25 and related proteins Pleckstrin homology (PH) domain; ...
429-519 1.77e-03

Rho GTPase activating protein 25 and related proteins Pleckstrin homology (PH) domain; RhoGAP25 (also called ArhGap25) like other RhoGaps are involved in cell polarity, cell morphology and cytoskeletal organization. They act as GTPase activators for the Rac-type GTPases by converting them to an inactive GDP-bound state and control actin remodeling by inactivating Rac downstream of Rho leading to suppress leading edge protrusion and promotes cell retraction to achieve cellular polarity and are able to suppress RAC1 and CDC42 activity in vitro. Overexpression of these proteins induces cell rounding with partial or complete disruption of actin stress fibers and formation of membrane ruffles, lamellipodia, and filopodia. This hierarchy contains RhoGAP22, RhoGAP24, and RhoGAP25. Members here contain an N-terminal PH domain followed by a RhoGAP domain and either a BAR or TATA Binding Protein (TBP) Associated Factor 4 (TAF4) domain. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270083  Cd Length: 114  Bit Score: 40.06  E-value: 1.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQYED-GQWKKHWFVLADQSLRYYRDsvaEEAADLDGEINLSTCyDVTEYPVQRN----YGFQIHTKEGE---- 499
Cdd:cd13263      5 KSGWLKKQGSIvKNWQQRWFVLRGDQLYYYKD---EDDTKPQGTIPLPGN-KVKEVPFNPEepgkFLFEIIPGGGGdrmt 80
                           90       100
                   ....*....|....*....|....*
gi 1907081931  500 -----FTLSAMTSGIRRNWIQTIMK 519
Cdd:cd13263     81 snhdsYLLMANSQAEMEEWVKVIRR 105
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
772-926 1.84e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 43.80  E-value: 1.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  772 QRLHRVNQDLQSELEA-QCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQtRLGNAAAELAIKEQALAKLKGELKM 850
Cdd:TIGR00618  725 NASSSLGSDLAAREDAlNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQTGA-ELSHLAAEIQFFNRLREEDTHLLKT 803
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081931  851 EQGKVREQLEewqHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLG 926
Cdd:TIGR00618  804 LEAEIGQEIP---SDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQAKIIQLSD 876
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
2043-2186 1.98e-03

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 43.66  E-value: 1.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2043 QKKIEDLQRQHQRELEKLREEKDRL--LAEETAATISA--------------IEAMKNAH-REEMER--ELEkSQRSQIS 2103
Cdd:pfam10174  400 QKKIENLQEQLRDKDKQLAGLKERVksLQTDSSNTDTAlttleealsekeriIERLKEQReREDRERleELE-SLKKENK 478
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2104 SINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQR-ENQELNAHNQELNNRLAAEIT- 2181
Cdd:pfam10174  479 DLKEKVSALQPELTEKESSLIDLKEHASSLASSGLKKDSKLKSLEIAVEQKKEECSKlENQLKKAHNAEEAVRTNPEINd 558

                   ....*
gi 1907081931 2182 RLRTL 2186
Cdd:pfam10174  559 RIRLL 563
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1980-2149 2.14e-03

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 43.47  E-value: 2.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1980 AESMSGLRERIQELEAQMGVMREELghKELEGDVAALQEKYQRDFESLKATCErgfAAMEETHQKKIEDLQRQHQRELEK 2059
Cdd:COG3206    211 SEEAKLLLQQLSELESQLAEARAEL--AEAEARLAALRAQLGSGPDALPELLQ---SPVIQQLRAQLAELEAELAELSAR 285
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2060 LREEKDRL--LAEETAATISAIEAMKNAHREEMERELEkSQRSQISSINSDIEALRRQYLE------ELQSVQRELEVLS 2131
Cdd:COG3206    286 YTPNHPDViaLRAQIAALRAQLQQEAQRILASLEAELE-ALQAREASLQAQLAQLEARLAElpeleaELRRLEREVEVAR 364
                          170       180
                   ....*....|....*....|
gi 1907081931 2132 EQYSQ--KCLENAHLAQALE 2149
Cdd:COG3206    365 ELYESllQRLEEARLAEALT 384
mukB PRK04863
chromosome partition protein MukB;
1789-2186 2.16e-03

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 43.41  E-value: 2.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1789 QEAKGASGQ-KRAQA-VGALKEEYEE------LLHKQKSEYQKVITLIEKENTELKAKVSqmDHQQRcLQEAENKhsesm 1860
Cdd:PRK04863   341 QTALRQQEKiERYQAdLEELEERLEEqnevveEADEQQEENEARAEAAEEEVDELKSQLA--DYQQA-LDVQQTR----- 412
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1861 fALQGRyeeeircmveqlshteNTLQA-ERSRVLSQLDA----SVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEE 1935
Cdd:PRK04863   413 -AIQYQ----------------QAVQAlERAKQLCGLPDltadNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAAHSQF 475
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1936 LRALQehyiwSLRgalslyqpshpdsSLAPGPSEPRAVPAAKD---EAESMSGLRERIQELEAQmgvmreelgHKELEGD 2012
Cdd:PRK04863   476 EQAYQ-----LVR-------------KIAGEVSRSEAWDVAREllrRLREQRHLAEQLQQLRMR---------LSELEQR 528
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2013 VAAlqekyQRDFESLKATCERGFAAMEEThQKKIEDLQRQHQRELEKLREEKDRLlaeetaatisaieamkNAHREEMER 2092
Cdd:PRK04863   529 LRQ-----QQRAERLLAEFCKRLGKNLDD-EDELEQLQEELEARLESLSESVSEA----------------RERRMALRQ 586
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2093 ELEKsqrsqissINSDIEALRRQYLEELQSvQRELEVLSEQY------SQKCLEnaHLAQALEAERQALR---QCQRENQ 2163
Cdd:PRK04863   587 QLEQ--------LQARIQRLAARAPAWLAA-QDALARLREQSgeefedSQDVTE--YMQQLLERERELTVerdELAARKQ 655
                          410       420
                   ....*....|....*....|...
gi 1907081931 2164 ELNAHNQELNNRLAAEITRLRTL 2186
Cdd:PRK04863   656 ALDEEIERLSQPGGSEDPRLNAL 678
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1978-2242 2.29e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 43.51  E-value: 2.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1978 DEAESMSGLRERIQELEAQMGVMREELG-----HKELEGDVAALQ------EKYQRDFESLKATcERGFAAME-ETHQKK 2045
Cdd:TIGR02168  162 EEAAGISKYKERRKETERKLERTRENLDrlediLNELERQLKSLErqaekaERYKELKAELREL-ELALLVLRlEELREE 240
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2046 IEDLQRQhQRELEKLREEKDRLLAEETAAtisaIEAMKNAHREeMERELEKSQRS--QISSINSDIEALRRQYLEELQSV 2123
Cdd:TIGR02168  241 LEELQEE-LKEAEEELEELTAELQELEEK----LEELRLEVSE-LEEEIEELQKElyALANEISRLEQQKQILRERLANL 314
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2124 QRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRtlltgdgggestglpltqg 2203
Cdd:TIGR02168  315 ERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLE------------------- 375
                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1907081931 2204 kdayELEVLLRVKESEIQYLKQEISSLKDELQTALRDKK 2242
Cdd:TIGR02168  376 ----ELEEQLETLRSKVAQLELQIASLNNEIERLEARLE 410
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1985-2273 2.81e-03

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 43.02  E-value: 2.81e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1985 GLRERIQELEAQMGVMREElgHKELEGDVAALQE----------KYQRDFESLKATceRGFAAMEETHQKKIEDLQRQHQ 2054
Cdd:COG3096    372 EAAEQLAEAEARLEAAEEE--VDSLKSQLADYQQaldvqqtraiQYQQAVQALEKA--RALCGLPDLTPENAEDYLAAFR 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2055 RELEKLREEkdrLLAEETAATISaiEAMKNAHREEME------------------RELEKSQRSQiSSINSDIEALRRQY 2116
Cdd:COG3096    448 AKEQQATEE---VLELEQKLSVA--DAARRQFEKAYElvckiageversqawqtaRELLRRYRSQ-QALAQRLQQLRAQL 521
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2117 --LEELQSVQRELEVLSEQYSQ---KCLENA----HLAQALEAERQAL-----------RQCQRENQELNAHNQELNNRL 2176
Cdd:COG3096    522 aeLEQRLRQQQNAERLLEEFCQrigQQLDAAeeleELLAELEAQLEELeeqaaeaveqrSELRQQLEQLRARIKELAARA 601
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2177 AAEIT---RLRTLltgdggGESTGLPLTQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTALRdkkyasdkykdiyt 2253
Cdd:COG3096    602 PAWLAaqdALERL------REQSGEALADSQEVTAAMQQLLEREREATVERDELAARKQALESQIE-------------- 661
                          330       340
                   ....*....|....*....|
gi 1907081931 2254 ELSIAKAKADCDISRLKEQL 2273
Cdd:COG3096    662 RLSQPGGAEDPRLLALAERL 681
PRK11281 PRK11281
mechanosensitive channel MscK;
779-987 2.87e-03

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 42.98  E-value: 2.87e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  779 QDLQSELEAQCRRQELITQQ---IQTLKHSYgEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKG--------- 846
Cdd:PRK11281    39 ADVQAQLDALNKQKLLEAEDklvQQDLEQTL-ALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDdndeetret 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  847 -------ELKMEQGKVREQLEEWQHSKAMLSGQL--------RASEQkLRSTEARLLEKTQELRDLETQQALQRDRQKev 911
Cdd:PRK11281   118 lstlslrQLESRLAQTLDQLQNAQNDLAEYNSQLvslqtqpeRAQAA-LYANSQRLQQIRNLLKGGKVGGKALRPSQR-- 194
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  912 QRLQECIAELSQQ-------LGTSEQAQRLMEKklKRNYTLLLESCEQEKQALLQNLkeVEDKasaYEDQLQGHVQQVEA 984
Cdd:PRK11281   195 VLLQAEQALLNAQndlqrksLEGNTQLQDLLQK--QRDYLTARIQRLEHQLQLLQEA--INSK---RLTLSEKTVQEAQS 267

                   ...
gi 1907081931  985 LQK 987
Cdd:PRK11281   268 QDE 270
Apolipoprotein pfam01442
Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a ...
1981-2157 3.01e-03

Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E.


Pssm-ID: 460211 [Multi-domain]  Cd Length: 175  Bit Score: 40.71  E-value: 3.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1981 ESMSGLRERIQELEAQMGVMREELgHKELEGDVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKL 2060
Cdd:pfam01442    4 DSLDELSTYAEELQEQLGPVAQEL-VDRLEKETEALRERLQKDLEEVRAKLEPYLEELQAKLGQNVEELRQRLEPYTEEL 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2061 ReekdrllaEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLE 2140
Cdd:pfam01442   83 R--------KRLNADAEELQEKLAPYGEELRERLEQNVDALRARLAPYAEELRQKLAERLEELKESLAPYAEEVQAQLSQ 154
                          170
                   ....*....|....*...
gi 1907081931 2141 NA-HLAQALEAERQALRQ 2157
Cdd:pfam01442  155 RLqELREKLEPQAEDLRE 172
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
872-1042 3.16e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 42.98  E-value: 3.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  872 QLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKK---LKRNYTL--- 945
Cdd:COG4913    611 KLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAEREIAELEAELerlDASSDDLaal 690
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  946 --LLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQS 1023
Cdd:COG4913    691 eeQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELREN 770
                          170
                   ....*....|....*....
gi 1907081931 1024 LHDERDLIKHQFQELMERV 1042
Cdd:COG4913    771 LEERIDALRARLNRAEEEL 789
PTZ00121 PTZ00121
MAEBL; Provisional
832-1109 3.56e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 42.82  E-value: 3.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  832 AELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMlsgQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEV 911
Cdd:PTZ00121  1542 AEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNM---ALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEA 1618
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  912 QRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLS 991
Cdd:PTZ00121  1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEA 1698
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  992 ETCKGSEQVHKLEEELEAREASIRQlaqhvqslHDERDLIKhqfQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHH 1071
Cdd:PTZ00121  1699 EEAKKAEELKKKEAEEKKKAEELKK--------AEEENKIK---AEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEE 1767
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1907081931 1072 RVSVQLQSVRTLLreKEEELKHIKETHERVLEKKDQDL 1109
Cdd:PTZ00121  1768 KKAEEIRKEKEAV--IEEELDEEDEKRRMEVDKKIKDI 1803
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1989-2133 3.78e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 41.45  E-value: 3.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1989 RIQELEAQMGVMREELghKELEGDVAALQ---EKYQRDFESLKATCERgFAAMEETHQKKIEDLQRQH-----QRELEKL 2060
Cdd:COG1579     18 ELDRLEHRLKELPAEL--AELEDELAALEarlEAAKTELEDLEKEIKR-LELEIEEVEARIKKYEEQLgnvrnNKEYEAL 94
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907081931 2061 REEKDRLLAEETAATISAIEAMKNahREEMERELEKSQrSQISSINSDIEALRRQYLEELQSVQRELEVLSEQ 2133
Cdd:COG1579     95 QKEIESLKRRISDLEDEILELMER--IEELEEELAELE-AELAELEAELEEKKAELDEELAELEAELEELEAE 164
PH_ACAP cd13250
ArfGAP with coiled-coil, ankyrin repeat and PH domains Pleckstrin homology (PH) domain; ACAP ...
429-517 3.90e-03

ArfGAP with coiled-coil, ankyrin repeat and PH domains Pleckstrin homology (PH) domain; ACAP (also called centaurin beta) functions both as a Rab35 effector and as an Arf6-GTPase-activating protein (GAP) by which it controls actin remodeling and membrane trafficking. ACAP contain an NH2-terminal bin/amphiphysin/Rvs (BAR) domain, a phospholipid-binding domain, a PH domain, a GAP domain, and four ankyrin repeats. The AZAPs constitute a family of Arf GAPs that are characterized by an NH2-terminal pleckstrin homology (PH) domain and a central Arf GAP domain followed by two or more ankyrin repeats. On the basis of sequence and domain organization, the AZAP family is further subdivided into four subfamilies: 1) the ACAPs contain an NH2-terminal bin/amphiphysin/Rvs (BAR) domain (a phospholipid-binding domain that is thought to sense membrane curvature), a single PH domain followed by the GAP domain, and four ankyrin repeats; 2) the ASAPs also contain an NH2-terminal BAR domain, the tandem PH domain/GAP domain, three ankyrin repeats, two proline-rich regions, and a COOH-terminal Src homology 3 domain; 3) the AGAPs contain an NH2-terminal GTPase-like domain (GLD), a split PH domain, and the GAP domain followed by four ankyrin repeats; and 4) the ARAPs contain both an Arf GAP domain and a Rho GAP domain, as well as an NH2-terminal sterile-a motif (SAM), a proline-rich region, a GTPase-binding domain, and five PH domains. PMID 18003747 and 19055940 Centaurin can bind to phosphatidlyinositol (3,4,5)P3. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270070  Cd Length: 98  Bit Score: 38.74  E-value: 3.90e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQYED--GQWKKHWFVLADQSLRYYRDSVAEEAADLdgEINLSTCYDVTEYPVQRNYGFQIHTKEGEFTLSAMT 506
Cdd:cd13250      1 KEGYLFKRSSNafKTWKRRWFSLQNGQLYYQKRDKKDEPTVM--VEDLRLCTVKPTEDSDRRFCFEVISPTKSYMLQAES 78
                           90
                   ....*....|.
gi 1907081931  507 SGIRRNWIQTI 517
Cdd:cd13250     79 EEDRQAWIQAI 89
PH_DOCK-D cd13267
Dedicator of cytokinesis-D subfamily Pleckstrin homology (PH) domain; DOCK-D subfamily (also ...
428-519 3.98e-03

Dedicator of cytokinesis-D subfamily Pleckstrin homology (PH) domain; DOCK-D subfamily (also called Zizimin subfamily) consists of Dock9/Zizimin1, Dock10/Zizimin3, and Dock11/Zizimin2. DOCK-D has a N-terminal DUF3398 domain, a PH-like domain, a Dock Homology Region 1, DHR1 (also called CZH1), a C2 domain, and a C-terminal DHR2 domain (also called CZH2). Zizimin1 is enriched in the brain, lung, and kidney; zizimin2 is found in B and T lymphocytes, and zizimin3 is enriched in brain, lung, spleen and thymus. Zizimin1 functions in autoinhibition and membrane targeting. Zizimin2 is an immune-related and age-regulated guanine nucleotide exchange factor, which facilitates filopodial formation through activation of Cdc42, which results in activation of cell migration. No function has been determined for Zizimin3 to date. The N-terminal half of zizimin1 binds to the GEF domain through three distinct areas, including CZH1, to inhibit the interaction with Cdc42. In addition its PH domain binds phosphoinositides and mediates zizimin1 membrane targeting. DOCK is a family of proteins involved in intracellular signalling networks. They act as guanine nucleotide exchange factors for small G proteins of the Rho family, such as Rac and Cdc42. There are 4 subfamilies of DOCK family proteins based on their sequence homology: A-D. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270087  Cd Length: 126  Bit Score: 39.62  E-value: 3.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  428 FKKGWLTKQYEDGQ----------WKKHWFVL---ADQS--LRYYRDsvaEEAADLDGEINLSTCYDVTEYPVQRNYGFQ 492
Cdd:cd13267      7 TKEGYLYKGPENSSdsfislamksFKRRFFHLkqlVDGSyiLEFYKD---EKKKEAKGTIFLDSCTGVVQNSKRRKFCFE 83
                           90       100
                   ....*....|....*....|....*...
gi 1907081931  493 IHTKEGE-FTLSAMTSGIRRNWIQTIMK 519
Cdd:cd13267     84 LRMQDKKsYVLAAESEAEMDEWISKLNK 111
PRK09039 PRK09039
peptidoglycan -binding protein;
819-961 4.00e-03

peptidoglycan -binding protein;


Pssm-ID: 181619 [Multi-domain]  Cd Length: 343  Bit Score: 41.88  E-value: 4.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  819 EIQTLQTRLGNAAAELAIKEQALA---KLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELr 895
Cdd:PRK09039    47 EISGKDSALDRLNSQIAELADLLSlerQGNQDLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQEL- 125
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081931  896 DLETQQALQRDRQkeVQRLQECIAELSQQLGTSEQAQRLMEKKlkrnytlllescEQEKQALLQNL 961
Cdd:PRK09039   126 DSEKQVSARALAQ--VELLNQQIAALRRQLAALEAALDASEKR------------DRESQAKIADL 177
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1797-2237 4.05e-03

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 42.47  E-value: 4.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1797 QKRAQAVGALKEEYEELlHKQKSEYQKVITLIEKENTELKAKV-----SQMDHQQR------CLQEAENKHSESMFALQG 1865
Cdd:pfam01576  352 QKHTQALEELTEQLEQA-KRNKANLEKAKQALESENAELQAELrtlqqAKQDSEHKrkklegQLQELQARLSESERQRAE 430
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1866 RYEEEIRCMVEQLSHTENTLQAER-----SRVLSQLDASVKDRQAMEQHHVQQMKMLEDRfqlkVRELQavhqEELRALQ 1940
Cdd:pfam01576  431 LAEKLSKLQSELESVSSLLNEAEGkniklSKDVSSLESQLQDTQELLQEETRQKLNLSTR----LRQLE----DERNSLQ 502
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1941 EHYiwslrgalslyqpshpdsslapgpsepravpaaKDEAESMSGLRERIQELEAQMGVMREELghKELEGDVAALQE-- 2018
Cdd:pfam01576  503 EQL---------------------------------EEEEEAKRNVERQLSTLQAQLSDMKKKL--EEDAGTLEALEEgk 547
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2019 -KYQRDFESLKATCERGFAAMEETH------QKKIEDL------QRQHQRELEKLREEKDRLLAEETAATISAIEAMKNA 2085
Cdd:pfam01576  548 kRLQRELEALTQQLEEKAAAYDKLEktknrlQQELDDLlvdldhQRQLVSNLEKKQKKFDQMLAEEKAISARYAEERDRA 627
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2086 HREEMERELEksqrsqissinsdiealrrqyleelqsvqreleVLSeqysqkclenahLAQALEAERQALRQCQRENQEL 2165
Cdd:pfam01576  628 EAEAREKETR---------------------------------ALS------------LARALEEALEAKEELERTNKQL 662
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907081931 2166 NAHNQELNNRLAAeitrlrtlltgdgggestglpltQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTA 2237
Cdd:pfam01576  663 RAEMEDLVSSKDD-----------------------VGKNVHELERSKRALEQQVEEMKTQLEELEDELQAT 711
PH_TAAP2-like cd13255
Tandem PH-domain-containing protein 2 Pleckstrin homology (PH) domain; The binding of TAPP2 ...
429-517 4.49e-03

Tandem PH-domain-containing protein 2 Pleckstrin homology (PH) domain; The binding of TAPP2 (also called PLEKHA2) adaptors to PtdIns(3,4)P(2), but not PI(3,4, 5)P3, function as negative regulators of insulin and PI3K signalling pathways (i.e. TAPP/utrophin/syntrophin complex). TAPP2 contains two sequential PH domains in which the C-terminal PH domain specifically binds PtdIns(3,4)P2 with high affinity. The N-terminal PH domain does not interact with any phosphoinositide tested. They also contain a C-terminal PDZ-binding motif that interacts with several PDZ-binding proteins, including PTPN13 (known previously as PTPL1 or FAP-1) as well as the scaffolding proteins MUPP1 (multiple PDZ-domain-containing protein 1), syntrophin and utrophin. The members here are most sequence similar to TAPP2 proteins, but may not be actual TAPP2 proteins. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270075  Cd Length: 110  Bit Score: 38.93  E-value: 4.49e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQYEDGQ-WKKHWFVLADQSLRYYRDSVAEEAADLdgeINLSTCYDVTEYPVQRN-YGFQIHTKEGEFTLSAMT 506
Cdd:cd13255      8 KAGYLEKKGERRKtWKKRWFVLRPTKLAYYKNDKEYRLLRL---IDLTDIHTCTEVQLKKHdNTFGIVTPARTFYVQADS 84
                           90
                   ....*....|.
gi 1907081931  507 SGIRRNWIQTI 517
Cdd:cd13255     85 KAEMESWISAI 95
PH1_PH_fungal cd13298
Fungal proteins Pleckstrin homology (PH) domain, repeat 1; The functions of these fungal ...
428-517 5.06e-03

Fungal proteins Pleckstrin homology (PH) domain, repeat 1; The functions of these fungal proteins are unknown, but they all contain 2 PH domains. This cd represents the first PH repeat. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270110  Cd Length: 106  Bit Score: 38.76  E-value: 5.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  428 FKKGWLTKQYED-GQWKKHWFVLADQSLRYYRDsvaEEAADLDGEINLSTCYDVTEYPV-QRNYGFQIHTKEGEFTLSAM 505
Cdd:cd13298      7 LKSGYLLKRSRKtKNWKKRWVVLRPCQLSYYKD---EKEYKLRRVINLSELLAVAPLKDkKRKNVFGIYTPSKNLHFRAT 83
                           90
                   ....*....|..
gi 1907081931  506 TSGIRRNWIQTI 517
Cdd:cd13298     84 SEKDANEWVEAL 95
PH_KIFIA_KIFIB cd01233
KIFIA and KIFIB protein pleckstrin homology (PH) domain; The kinesin-3 family motors KIFIA ...
429-517 5.09e-03

KIFIA and KIFIB protein pleckstrin homology (PH) domain; The kinesin-3 family motors KIFIA (Caenorhabditis elegans homolog unc-104) and KIFIB transport synaptic vesicle precursors that contain synaptic vesicle proteins, such as synaptophysin, synaptotagmin and the small GTPase RAB3A, but they do not transport organelles that contain plasma membrane proteins. They have a N-terminal motor domain, followed by a coiled-coil domain, and a C-terminal PH domain. KIF1A adopts a monomeric form in vitro, but acts as a processive dimer in vivo. KIF1B has alternatively spliced isoforms distinguished by the presence or absence of insertion sequences in the conserved amino-terminal region of the protein; this results in their different motor activities. KIF1A and KIF1B bind to RAB3 proteins through the adaptor protein mitogen-activated protein kinase (MAPK) -activating death domain (MADD; also calledDENN), which was first identified as a RAB3 guanine nucleotide exchange factor (GEF). PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 269939  Cd Length: 103  Bit Score: 38.73  E-value: 5.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQyEDG--QWKKHWFVLADQSLRYYRDSvaeeaADLD--GEINLSTC---YDV-TEYPVQRNYGFQIHTKEGEF 500
Cdd:cd01233      8 KRGYLLFL-EDAtdGWVRRWVVLRRPYLHIYSSE-----KDGDerGVINLSTArveYSPdQEALLGRPNVFAVYTPTNSY 81
                           90
                   ....*....|....*..
gi 1907081931  501 TLSAMTSGIRRNWIQTI 517
Cdd:cd01233     82 LLQARSEKEMQDWLYAI 98
DUF4618 pfam15397
Domain of unknown function (DUF4618); This family of proteins is found in eukaryotes. Proteins ...
1976-2121 5.19e-03

Domain of unknown function (DUF4618); This family of proteins is found in eukaryotes. Proteins in this family are typically between 238 and 363 amino acids in length. There are two conserved sequence motifs: EYP and KCTPD.


Pssm-ID: 464704 [Multi-domain]  Cd Length: 258  Bit Score: 41.09  E-value: 5.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1976 AKDEAESMSGLRERIQELEAQMGVMREELG----HKELEGDVAALQ-EKYQRDFESLKatcergfaameETHQKKIEDLQ 2050
Cdd:pfam15397   76 EEKEESKLNKLEQQLEQLNAKIQKTQEELNflstYKDKEYPVKAVQiANLVRQLQQLK-----------DSQQDELDELE 144
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907081931 2051 RQHQRELEKL----REEKDRLL---AEETAATISAIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQyLEELQ 2121
Cdd:pfam15397  145 EMRRMVLESLsrkiQKKKEKILsslAEKTLSPYQESLLQKTRDNQVMLKEIEQ-FREFIDELEEEIPKLKAE-VQQLQ 220
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
817-1135 5.30e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 41.81  E-value: 5.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  817 EAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRD 896
Cdd:COG4372     12 RLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQA 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  897 LETQQALQRDR----QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLEScEQEKQALLQNLKEVEDKASAYE 972
Cdd:COG4372     92 AQAELAQAQEEleslQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAER-EEELKELEEQLESLQEELAALE 170
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  973 DQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAEL 1052
Cdd:COG4372    171 QELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEE 250
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1053 QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQE 1132
Cdd:COG4372    251 LLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLEL 330

                   ...
gi 1907081931 1133 KEE 1135
Cdd:COG4372    331 ALA 333
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
718-992 5.33e-03

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 42.25  E-value: 5.33e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQTEVA--TSPSGAWQRlhrvnqdlQSELEAQCRRQELI 795
Cdd:COG3096    439 AEDYLAAFRAKEQQATEEVLELEQKLSVADAARRQFEKAYELVCKIAgeVERSQAWQT--------ARELLRRYRSQQAL 510
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  796 TQQIQTLKHSYGEA-KDAIRHHEAEiqtlqtrlgnaaaelaikeqalaKLKGELKMEQGKVREQLEEWQHSKAMLSGQLR 874
Cdd:COG3096    511 AQRLQQLRAQLAELeQRLRQQQNAE-----------------------RLLEEFCQRIGQQLDAAEELEELLAELEAQLE 567
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  875 ASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTS--------EQAQRLMEKklKRNYTLL 946
Cdd:COG3096    568 ELEEQAAEAVEQRSELRQQLEQLRARIKELAARAPAWLAAQDALERLREQSGEAladsqevtAAMQQLLER--EREATVE 645
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1907081931  947 LESCEQEKQALLQNLKEVEDKASAYEDQLqghVQQVEALQKEKLSE 992
Cdd:COG3096    646 RDELAARKQALESQIERLSQPGGAEDPRL---LALAERLGGVLLSE 688
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
718-883 5.51e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 42.21  E-value: 5.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGY---VLQTEVAtspsgAWQ-RLHRVNQD------LQSELEA 787
Cdd:COG4913    622 LEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIdvaSAEREIA-----ELEaELERLDASsddlaaLEEQLEE 696
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  788 QCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKA 867
Cdd:COG4913    697 LEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRENLEERID 776
                          170
                   ....*....|....*.
gi 1907081931  868 MLSGQLRASEQKLRST 883
Cdd:COG4913    777 ALRARLNRAEEELERA 792
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1697-2282 5.89e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 41.88  E-value: 5.89e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1697 QDELAQQLREKASILEEISAALPVLPPTEPLGgcqrLLRMSQHLSYESCLEGLGQYSSLLVQDAIIQAQVCYAACRIRLE 1776
Cdd:TIGR00618  151 QGEFAQFLKAKSKEKKELLMNLFPLDQYTQLA----LMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEK 226
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1777 YEKELRFYKKACQEAKGASGQKRAQAVGALKEEYE-ELLHKQKSEYQKVITLIEKENTELK------------AKVSQMD 1843
Cdd:TIGR00618  227 ELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLlKQLRARIEELRAQEAVLEETQERINrarkaaplaahiKAVTQIE 306
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1844 HQ-QRCLQEAENKHSESMFALQGRYE-EEIRCMVEQLSHTENTLQAERSRVLSQLDA-----SVKDRQAMEQHHV----Q 1912
Cdd:TIGR00618  307 QQaQRIHTELQSKMRSRAKLLMKRAAhVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVatsirEISCQQHTLTQHIhtlqQ 386
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1913 QMKMLEDRFQLKVRELQAVHQE---------ELRALQEHYIwSLRGALSLYQPSHPDSSLAPGPSEPRAVPAAKDEAESM 1983
Cdd:TIGR00618  387 QKTTLTQKLQSLCKELDILQREqatidtrtsAFRDLQGQLA-HAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESA 465
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1984 SGLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFE--SLKATCERGFAAMEETHQKK---IEDLQRQHQRELE 2058
Cdd:TIGR00618  466 QSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCgsCIHPNPARQDIDNPGPLTRRmqrGEQTYAQLETSEE 545
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2059 KLREEKDRLLaeETAATISAieamknahREEMERELEKSQRSQISSINSDIEALRRqyleELQSVQRELEVLSEQYSQKC 2138
Cdd:TIGR00618  546 DVYHQLTSER--KQRASLKE--------QMQEIQQSFSILTQCDNRSKEDIPNLQN----ITVRLQDLTEKLSEAEDMLA 611
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2139 LENAHL------AQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTgdgggestglplTQGKDAYELEVL 2212
Cdd:TIGR00618  612 CEQHALlrklqpEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSI------------RVLPKELLASRQ 679
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907081931 2213 LrvKESEIQYLKQEISSLKDEL---QTALRDKKYASDKYKDIYTELSIAKAKAdcdISRLKEQLKAATEALGE 2282
Cdd:TIGR00618  680 L--ALQKMQSEKEQLTYWKEMLaqcQTLLRELETHIEEYDREFNEIENASSSL---GSDLAAREDALNQSLKE 747
Yuri_gagarin pfam15934
Yuri gagarin; The yuri gagarin protein found in Drosophila, it plays roles in spermatogenesis.
676-904 7.18e-03

Yuri gagarin; The yuri gagarin protein found in Drosophila, it plays roles in spermatogenesis.


Pssm-ID: 318204 [Multi-domain]  Cd Length: 234  Bit Score: 40.33  E-value: 7.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  676 EQRWHQVETTPLREEKQVPIAPLHLSLEDRSERLstheltslleKELEQSQKEASDLLEQNRLlqdqlrvalgreqsARE 755
Cdd:pfam15934   45 EQEQQLKEFTVQNQRLACQIDNLHETLKDRDHQI----------KQLQSMITGYSDISENNRL--------------KEE 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  756 GYVLQTEVATspsgawqrLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNaaaela 835
Cdd:pfam15934  101 IHDLKQKNCV--------QARVVRKMGLELKGQEEQRVELCDKYESLLGSFEEQCQELKRANRRVQSLQTRLSQ------ 166
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081931  836 ikeqaLAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQ 904
Cdd:pfam15934  167 -----VEKLQEELRTERKILREEVIALKEKDAKSNGRERALQDQLKCCQTEIEKSRTLIRNMQSHLQLE 230
Mitofilin pfam09731
Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. ...
809-990 7.28e-03

Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence rich in positive-charged and hydroxylated residues and a membrane anchor domain. In addition, it has three centrally located coiled coil domains.


Pssm-ID: 430783 [Multi-domain]  Cd Length: 618  Bit Score: 41.67  E-value: 7.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  809 AKDAIRHHEAEIQTLQTRlGNAAAELAIKEQALAKLKGELKmeqgkVREQLEEwqhskamlsgQLRASEQKLRSTEARLL 888
Cdd:pfam09731  292 AHREIDQLSKKLAELKKR-EEKHIERALEKQKEELDKLAEE-----LSARLEE----------VRAADEAQLRLEFERER 355
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  889 EKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLgtseqaQRLMEKKLKrnytlllESCEQEKQALLQNLKEVEDKA 968
Cdd:pfam09731  356 EEIRESYEEKLRTELERQAEAHEEHLKDVLVEQEIEL------QREFLQDIK-------EKVEEERAGRLLKLNELLANL 422
                          170       180
                   ....*....|....*....|...
gi 1907081931  969 SAYEDQLQGHVQQV-EALQKEKL 990
Cdd:pfam09731  423 KGLEKATSSHSEVEdENRKAQQL 445
TOPEUc smart00435
DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina ...
1971-2067 7.56e-03

DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina virus topoisomerase, Variola virus topoisomerase, Shope fibroma virus topoisomeras


Pssm-ID: 214661 [Multi-domain]  Cd Length: 391  Bit Score: 41.18  E-value: 7.56e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  1971 RAVPaaKDEAESMSGLRERIQELEAQMGVMREELGHKELEGDV-AALQEKYQRDFESLKATCERGFAAM--EETHQKKIE 2047
Cdd:smart00435  269 RTVS--KTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLkRKLKSKFERDNEKLDAEVKEKKKEKkkEEKKKKQIE 346
                            90       100
                    ....*....|....*....|
gi 1907081931  2048 DLQRQHQReLEKLREEKDRL 2067
Cdd:smart00435  347 RLEERIEK-LEVQATDKEEN 365
Golgin_A5 pfam09787
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ...
2075-2188 8.21e-03

Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.


Pssm-ID: 462900 [Multi-domain]  Cd Length: 305  Bit Score: 40.90  E-value: 8.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2075 TISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQA 2154
Cdd:pfam09787   43 TALTLELEELRQERDLLREEIQKLRGQIQQLRTELQELEAQQQEEAESSREQLQELEEQLATERSARREAEAELERLQEE 122
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1907081931 2155 LRQCQRENQELNAHNQELNNRLAAEITRLRTLLT 2188
Cdd:pfam09787  123 LRYLEEELRRSKATLQSRIKDREAEIEKLRNQLT 156
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
718-893 8.81e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 40.97  E-value: 8.81e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRLLQ---DQLRVALGR------EQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEaq 788
Cdd:COG3883     56 LQAELEALQAEIDKLQAEIAEAEaeiEERREELGEraralyRSGGSVSYLDVLLGSESFSDFLDRLSALSKIADADAD-- 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  789 crrqelITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAM 868
Cdd:COG3883    134 ------LLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAA 207
                          170       180
                   ....*....|....*....|....*
gi 1907081931  869 LSGQLRASEQKLRSTEARLLEKTQE 893
Cdd:COG3883    208 AEAAAAAAAAAAAAAAAAAAAAAAA 232
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
718-1003 9.76e-03

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 41.16  E-value: 9.76e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRLLQDQLRvALGREQSAREGYVLQTEvatspsgawqrlhRVNQDLQSELEAQCRRQELITQ 797
Cdd:TIGR04523  361 KQRELEEKQNEIEKLKKENQSYKQEIK-NLESQINDLESKIQNQE-------------KLNQQKDEQIKKLQQEKELLEK 426
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  798 QIQTLKHSYGEAKDAIRHHEAEIqtlqtrlgnAAAELAIKEqaLAKLKGELKmeqgkvrEQLEEWQHSKAMLSGQLRASE 877
Cdd:TIGR04523  427 EIERLKETIIKNNSEIKDLTNQD---------SVKELIIKN--LDNTRESLE-------TQLKVLSRSINKIKQNLEQKQ 488
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  878 QKLRSTEARLLEKTQELRDLETQQALQRDRQKE----VQRLQECIAELSQQLGTSEQAQRLMEKKLKRNytlLLESCEQE 953
Cdd:TIGR04523  489 KELKSKEKELKKLNEEKKELEEKVKDLTKKISSlkekIEKLESEKKEKESKISDLEDELNKDDFELKKE---NLEKEIDE 565
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907081931  954 KQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKL 1003
Cdd:TIGR04523  566 KNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSL 615
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH