NCBI Conserved Domain Search

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

718-1032

8.17e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 74.20 E-value: 8.17e-13

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEAS------DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATspsgawQRLHRVNQDLQSELEAQCRR 791
Cdd:COG1196    198 LERQLEPLERQAEkaeryrELKEELKELEAELLLLKLRELEAELEELEAELEEL------EAELEELEAELAELEAELEE 271

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  792 QEL----ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQgkvrEQLEEWQHSKA 867
Cdd:COG1196    272 LRLeleeLELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELE----EELEELEEELE 347

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  868 MLSGQLRASEQKLRSTEARLLEKTQELrdLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytlll 947
Cdd:COG1196    348 EAEEELEEAEAELAEAEEALLEAEAEL--AEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERL------ 419

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  948 escEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1027
Cdd:COG1196    420 ---EEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLL 496


                   ....*
gi 1907081931 1028 RDLIK 1032
Cdd:COG1196    497 LEAEA 501

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

706-987

6.42e-11

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 68.16 E-value: 6.42e-11

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  706 SERLSTHELtSLLEKELEQSQKEASDLLEQNRLLQDQLRvALGREQSAREGYVLQTEVATSPsgawqrLHRVNQDLQSEL 785
Cdd:TIGR02168  219 KAELRELEL-ALLVLRLEELREELEELQEELKEAEEELE-ELTAELQELEEKLEELRLEVSE------LEEEIEELQKEL 290

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  786 EAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEqgkvREQLEEWQHS 865
Cdd:TIGR02168  291 YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESL----EAELEELEAE 366

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  866 KAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDR----QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKR 941
Cdd:TIGR02168  367 LEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARlerlEDRRERLQQEIEELLKKLEEAELKELQAELEELE 446

                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1907081931  942 NYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQK 987
Cdd:TIGR02168  447 EELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLD 492

PRK03918

DNA double-strand break repair ATPase Rad50;

770-1135

2.23e-10

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 66.24 E-value: 2.23e-10

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  770 AWQRLHRVNQDLQSELEaqcRRQELITQQiqtlkhsyGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELK 849
Cdd:PRK03918   163 AYKNLGEVIKEIKRRIE---RLEKFIKRT--------ENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVK 231

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  850 mEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-----------------QALQRDRQKEVQ 912
Cdd:PRK03918   232 -ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKvkelkelkekaeeyiklSEFYEEYLDELR 310

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  913 RLQECIAELSQQL-GTSEQAQRLMEKKLKrnytllLESCEQEKQALLQNLKEVEDKASAYED------QLQGHVQQVEAL 985
Cdd:PRK03918   311 EIEKRLSRLEEEInGIEERIKELEEKEER------LEELKKKLKELEKRLEELEERHELYEEakakkeELERLKKRLTGL 384

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  986 QKEKLSETCKGSEQVHKLEEELEAREAS-IRQLAQHVQSLHDE---------------RDLIKHQFQELMER----VATS 1045
Cdd:PRK03918   385 TPEKLEKELEELEKAKEEIEEEISKITArIGELKKEIKELKKAieelkkakgkcpvcgRELTEEHRKELLEEytaeLKRI 464

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1046 DGDVAELQEKLRGKEVDYQNLEHSHHRVS--VQLQSVRTLLREKEEELKHI------------KETHERV--LEKKDQDL 1109
Cdd:PRK03918   465 EKELKEIEEKERKLRKELRELEKVLKKESelIKLKELAEQLKELEEKLKKYnleelekkaeeyEKLKEKLikLKGEIKSL 544

                          410       420
                   ....*....|....*....|....*.
gi 1907081931 1110 NEALVKMIALGSSLEETEIKLQEKEE 1135
Cdd:PRK03918   545 KKELEKLEELKKKLAELEKKLDELEE 570

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

1975-2240

4.11e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 62.26 E-value: 4.11e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1975 AAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQH- 2053
Cdd:COG1196    247 ELEELEAELEELEAELAELEAELEELRLE--LEELELELEEAQAEEY----------ELLAELARLEQDIARLEERRREl 314

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2054 QRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQ 2133
Cdd:COG1196    315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRA 394

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2134 YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLtgdgggestglpltqgKDAYELEVLL 2213
Cdd:COG1196    395 AAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAA----------------EEEAELEEEE 458

                          250       260
                   ....*....|....*....|....*..
gi 1907081931 2214 RVKESEIQYLKQEISSLKDELQTALRD 2240
Cdd:COG1196    459 EALLELLAELLEEAALLEAALAELLEE 485

Myosin_tail_1

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...

666-1161

5.50e-08

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.

Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 58.65 E-value: 5.50e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  666 LKTQNVHVEIEQRWHQV--ETTPLREEKQVPiAPLHLSLEDRSERLST---------HELTSLLEKELEQSQKEASDLLE 734
Cdd:pfam01576   22 QKAESELKELEKKHQQLceEKNALQEQLQAE-TELCAEAEEMRARLAArkqeleeilHELESRLEEEEERSQQLQNEKKK 100

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  735 QNRLLQDqLRVALGREQSAREGyvLQTEVATSPS----------------GAWQRLHRVNQDLQSELEAQCRRQELITQQ 798
Cdd:pfam01576  101 MQQHIQD-LEEQLDEEEAARQK--LQLEKVTTEAkikkleedillledqnSKLSKERKLLEERISEFTSNLAEEEEKAKS 177

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  799 IQTLKHSygeakdairhHEAEIQTLQTRLGNAAAelaiKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQ 878
Cdd:pfam01576  178 LSKLKNK----------HEAMISDLEERLKKEEK----GRQELEKAKRKLEGESTDLQEQIAELQAQIAELRAQLAKKEE 243

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  879 KLRSTEARLlektqelrdlETQQALQRDRQKEVQRLQECIAELSQQLgTSEQAQRLMEKKLKRNYTLLLESCEQEKQALL 958
Cdd:pfam01576  244 ELQAALARL----------EEETAQKNNALKKIRELEAQISELQEDL-ESERAARNKAEKQRRDLGEELEALKTELEDTL 312

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  959 ------QNLK-----EVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1027
Cdd:pfam01576  313 dttaaqQELRskreqEVTELKKALEEETRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAE 392

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1028 RDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEelKHIKETHE-RVLEKKD 1106
Cdd:pfam01576  393 LRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLLNEAEG--KNIKLSKDvSSLESQL 470

                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081931 1107 QDLNEALV----KMIALGSSLEETEI-------KLQEKEECLRRF----------VSDSPKDAKEPLSTTEPTEEG 1161
Cdd:pfam01576  471 QDTQELLQeetrQKLNLSTRLRQLEDernslqeQLEEEEEAKRNVerqlstlqaqLSDMKKKLEEDAGTLEALEEG 546

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1986-2178

7.30e-08

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 58.14 E-value: 7.30e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1986 LRERIQELEAQMGVMREELGH-----KELEGDVAALQEKyqrdFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKL 2060
Cdd:TIGR02168  307 LRERLANLERQLEELEAQLEElesklDELAEELAELEEK----LEELKEELESLEAELEELEAELEELESRLEELEEQLE 382

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2061 REEKDRLLAEETAATIsaieamkNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLE 2140
Cdd:TIGR02168  383 TLRSKVAQLELQIASL-------NNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEE 455

                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1907081931 2141 NAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAA 2178
Cdd:TIGR02168  456 LERLEEALEELREELEEAEQALDAAERELAQLQARLDS 493

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1680-2282

2.46e-06

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 53.14 E-value: 2.46e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1680 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLppTEPLGGCQ------RLLRMSQHLSYESCLEGLGQYS 1753
Cdd:TIGR02168  308 RERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEEL--KEELESLEaeleelEAELEELESRLEELEEQLETLR 385

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1754 S----LLVQDAIIQAQVCYAACRI-RLEYEKELRFYKKACQEAKGASGQKraQAVGALKEEYEELLHKQKSEYQKVITLI 1828
Cdd:TIGR02168  386 SkvaqLELQIASLNNEIERLEARLeRLEDRRERLQQEIEELLKKLEEAEL--KELQAELEELEEELEELQEELERLEEAL 463

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1829 EKENTELKAKVSQMDHQQRCLQEAENKhSESMFALQGRYEEEIRCmVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEq 1908
Cdd:TIGR02168  464 EELREELEEAEQALDAAERELAQLQAR-LDSLERLQENLEGFSEG-VKALLKNQSGLSGILGVLSELISVDEGYEAAIE- 540

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1909 hhvqqmKMLEDRFQ-LKVRELQAVhqeelRALQEHYIWSLRG-----ALSLYQPSHPDSSLAPG-PSEPRAVPAAKDEAE 1981
Cdd:TIGR02168  541 ------AALGGRLQaVVVENLNAA-----KKAIAFLKQNELGrvtflPLDSIKGTEIQGNDREIlKNIEGFLGVAKDLVK 609

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1982 SMSGLRERIQELEAQMGV---------MREELGHKE----LEGDVAAlqekyqrdfeslkatceRGFAAMEETHQKKIED 2048
Cdd:TIGR02168  610 FDPKLRKALSYLLGGVLVvddldnaleLAKKLRPGYrivtLDGDLVR-----------------PGGVITGGSAKTNSSI 672

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2049 LQRQhqRELEKLREEKDRLLAEETAATISAIEAMKNahREEMERELEKSQRsQISSINSDIEALRRQYLEELQSVQR--- 2125
Cdd:TIGR02168  673 LERR--REIEELEEKIEELEEKIAELEKALAELRKE--LEELEEELEQLRK-ELEELSRQISALRKDLARLEAEVEQlee 747

                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2126 ELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTL-----LTGDGGGESTGLPL 2200
Cdd:TIGR02168  748 RIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELraeltLLNEEAANLRERLE 827

                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2201 TQGKDAYELEVLLRVKESEIQYLKQEISSLKDElqtaLRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEAL 2280
Cdd:TIGR02168  828 SLERRIAATERRLEDLEEQIEELSEDIESLAAE----IEELEELIEELESELEALLNERASLEEALALLRSELEELSEEL 903


                   ..
gi 1907081931 2281 GE 2282
Cdd:TIGR02168  904 RE 905

CCDC158

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

1819-2288

2.18e-05

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 50.12 E-value: 2.18e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1819 SEYQKVITLIEKENTELKAKVSQMDHQQRCLQ-EAENK-------HSESMFALQGRYEEEIRCMVEQLSHTENTLQAERS 1890
Cdd:pfam15921  220 SAISKILRELDTEISYLKGRIFPVEDQLEALKsESQNKielllqqHQDRIEQLISEHEVEITGLTEKASSARSQANSIQS 299

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1891 RvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEHYIWSlrgalslyqpshpDSSLAPGPSEp 1970
Cdd:pfam15921  300 Q-LEIIQEQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEDKIEELEKQLVLA-------------NSELTEARTE- 364

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1971 ravpaaKDEAESMSG-LRERIQELEAQMGVMREELG---------------------HKELEGDVAALQ-EKYQRDFESL 2027
Cdd:pfam15921  365 ------RDQFSQESGnLDDQLQKLLADLHKREKELSlekeqnkrlwdrdtgnsitidHLRRELDDRNMEvQRLEALLKAM 438

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2028 KATC----ERGFAAMEETHQ--KKIEDLQRQHQRELEKLREEKDRLLA-----EETAATISAIeamkNAHREEMERELEK 2096
Cdd:pfam15921  439 KSECqgqmERQMAAIQGKNEslEKVSSLTAQLESTKEMLRKVVEELTAkkmtlESSERTVSDL----TASLQEKERAIEA 514

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2097 SQrSQISSINS--DIEALRRQYL----EELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQ 2170
Cdd:pfam15921  515 TN-AEITKLRSrvDLKLQELQHLknegDHLRNVQTECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKA 593

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2171 ELNNRLAAEITRLRTLLTgdgggestglpLTQGKDAYELEVLLRVKESEIQY----------------LKQEISSLKDEL 2234
Cdd:pfam15921  594 QLEKEINDRRLELQEFKI-----------LKDKKDAKIRELEARVSDLELEKvklvnagserlravkdIKQERDQLLNEV 662

                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081931 2235 QTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGE-----KSPEGT 2288
Cdd:pfam15921  663 KTSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQtrntlKSMEGS 721

PTZ00121

MAEBL; Provisional

1776-2180

9.29e-05

MAEBL; Provisional

Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 48.21 E-value: 9.29e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1776 EYEKELRFYKKACQEAKGASGQKRAQAVGALKEE---YEELlhKQKSEYQKVITLIEKENTELKAKVSQMdhqqRCLQEA 1852
Cdd:PTZ00121  1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEakkADEA--KKKAEEAKKADEAKKKAEEAKKKADEA----KKAAEA 1508

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1853 ENKHSESMFALQGRYEEEIRcMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRElqavh 1932
Cdd:PTZ00121  1509 KKKADEAKKAEEAKKADEAK-KAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRK----- 1582

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1933 QEELRALQEHYIwslrgalslyqpshpdsslapgpsepravpaakdeaesmsglrERIQELEAQMGVMREELGHKELEGD 2012
Cdd:PTZ00121  1583 AEEAKKAEEARI-------------------------------------------EEVMKLYEEEKKMKAEEAKKAEEAK 1619

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2013 VAALQEKYQrdfESLKATCERgFAAMEETHQKKIEDLQRQHqrELEKLREEKDRLLAEETAATISAIEAMKNAHREEMER 2092
Cdd:PTZ00121  1620 IKAEELKKA---EEEKKKVEQ-LKKKEAEEKKKAEELKKAE--EENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA 1693

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2093 ELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQEL 2172
Cdd:PTZ00121  1694 LKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEI 1773


                   ....*...
gi 1907081931 2173 NNRLAAEI 2180
Cdd:PTZ00121  1774 RKEKEAVI 1781

YhaN

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

1680-2157

5.25e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 45.14 E-value: 5.25e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1680 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLPPTEPLGGCQRLLRMSQHlSYESCLEGLGQYSSLLVQd 1759
Cdd:COG4717     87 EEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPE-RLEELEERLEELRELEEE- 164

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1760 aiiqaqvcyaacriRLEYEKELRFYKKACQEAKGASGQKRAQAVGALKEEYEELlHKQKSEYQKVITLIEKENTELKAKV 1839
Cdd:COG4717    165 --------------LEELEAELAELQEELEELLEQLSLATEEELQDLAEELEEL-QQRLAELEEELEEAQEELEELEEEL 229

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1840 SQMDHQQRCLQEAENKHSESMFALqgryeeeIRCMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLED 1919
Cdd:COG4717    230 EQLENELEAAALEERLKEARLLLL-------IAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGK 302

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1920 RFQlkvrelQAVHQEELRALQEHYIWSLRGALSLyqpshpdsslaPGPSEPRAVPAAKDEAESMSGLRERIQELEAQMgv 1999
Cdd:COG4717    303 EAE------ELQALPALEELEEEELEELLAALGL-----------PPDLSPEELLELLDRIEELQELLREAEELEEEL-- 363

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2000 mreelghkelegDVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQR--QHQRELEKLREEKDRLLAEETAATIS 2077
Cdd:COG4717    364 ------------QLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQELKEEleELEEQLEELLGELEELLEALDEEELE 431

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2078 AIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQY-----LEELQSVQRELEVLSEQYSQKCLenahLAQALEAER 2152
Cdd:COG4717    432 EELEELEEELEELEEELEE-LREELAELEAELEQLEEDGelaelLQELEELKAELRELAEEWAALKL----ALELLEEAR 506


                   ....*
gi 1907081931 2153 QALRQ 2157
Cdd:COG4717    507 EEYRE 511

PRK03918

DNA double-strand break repair ATPase Rad50;

1639-2274

9.56e-04

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 44.67 E-value: 9.56e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1639 ERVIQQILetlrhptgREDQVQTSWDQnpLGEILRPgTDGSQEPLQALHQSPEvlaAIQDELAQQLREKASILEEISAAL 1718
Cdd:PRK03918   148 EKVVRQIL--------GLDDYENAYKN--LGEVIKE-IKRRIERLEKFIKRTE---NIEELIKEKEKELEEVLREINEIS 213

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1719 PVLPPT-EPLGGCQRLLRmsqhlSYESCLEGLgqySSLLVQDAIIQAQVCYAACRIRlEYEKELRFYKKACQEAKgaSGQ 1797
Cdd:PRK03918   214 SELPELrEELEKLEKEVK-----ELEELKEEI---EELEKELESLEGSKRKLEEKIR-ELEERIEELKKEIEELE--EKV 282

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1798 KRAQAVGALKEEYEElLHKQKSEYQKVITLIEKENTELKAKVSQMdhqQRCLQEAENKHSEsMFALQGRyEEEIRCMVEQ 1877
Cdd:PRK03918   283 KELKELKEKAEEYIK-LSEFYEEYLDELREIEKRLSRLEEEINGI---EERIKELEEKEER-LEELKKK-LKELEKRLEE 356

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1878 LSHTENTLQAERsRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALqEHYIWSLRGALSLYQPS 1957
Cdd:PRK03918   357 LEERHELYEEAK-AKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGEL-KKEIKELKKAIEELKKA 434

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1958 HPDSSL--APGPSEPRAVPAAKDEAEsMSGLRERIQELEAQMGVMREELghKELEGDVaalqeKYQRDFESLKATCERGF 2035
Cdd:PRK03918   435 KGKCPVcgRELTEEHRKELLEEYTAE-LKRIEKELKEIEEKERKLRKEL--RELEKVL-----KKESELIKLKELAEQLK 506

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2036 AAMEETHQKKIEDLQRQhQRELEKLREEKDRLLAE--ETAATISAIEAMKNaHREEMERELEKSQRsQISSINSDIEALR 2113
Cdd:PRK03918   507 ELEEKLKKYNLEELEKK-AEEYEKLKEKLIKLKGEikSLKKELEKLEELKK-KLAELEKKLDELEE-ELAELLKELEELG 583

                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2114 RQYLEELQSVQRELEVLSEQYsqkcLENAHLAQALEAERQALRQCQRENQELNAHNQELNNR---LAAEITRLRTLLTGD 2190
Cdd:PRK03918   584 FESVEELEERLKELEPFYNEY----LELKDAEKELEREEKELKKLEEELDKAFEELAETEKRleeLRKELEELEKKYSEE 659

                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2191 GGGESTGLPLtqgkdayELEVLLRVKESEIQYLKqeisSLKDELQTALRDKKYASDKYKDIYTEL-SIAKAKAdcDISRL 2269
Cdd:PRK03918   660 EYEELREEYL-------ELSRELAGLRAELEELE----KRREEIKKTLEKLKEELEEREKAKKELeKLEKALE--RVEEL 726


                   ....*
gi 1907081931 2270 KEQLK 2274
Cdd:PRK03918   727 REKVK 731

TOPEUc

smart00435

DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina ...

1971-2067

7.56e-03

DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina virus topoisomerase, Variola virus topoisomerase, Shope fibroma virus topoisomeras

Pssm-ID: 214661 [Multi-domain] Cd Length: 391 Bit Score: 41.18 E-value: 7.56e-03

                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  1971 RAVPaaKDEAESMSGLRERIQELEAQMGVMREELGHKELEGDV-AALQEKYQRDFESLKATCERGFAAM--EETHQKKIE 2047
Cdd:smart00435  269 RTVS--KTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLkRKLKSKFERDNEKLDAEVKEKKKEKkkEEKKKKQIE 346

                            90       100
                    ....*....|....*....|
gi 1907081931  2048 DLQRQHQReLEKLREEKDRL 2067
Cdd:smart00435  347 RLEERIEK-LEVQATDKEEN 365

Name

Accession

Description

Interval

E-value

PH_M-RIP

cd13275

Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed ...

429-530

5.44e-47

Pssm-ID: 270094 Cd Length: 104 Bit Score: 164.04 E-value: 5.44e-47

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQ-YEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDVTEYPVQRNYGFQIHTKEGE-FTLSAMT 506
Cdd:cd13275      1 KKGWLMKQgSRQGEWSKHWFVLRGAALKYYRDPSAEEAGELDGVIDLSSCTEVTELPVSRNYGFQVKTWDGKvYVLSAMT 80

                           90       100
                   ....*....|....*....|....
gi 1907081931  507 SGIRRNWIQTIMKHVLPASAPDVT 530
Cdd:cd13275     81 SGIRTNWIQALRKAAGLPSPPALP 104

smart00233

Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The ...

429-521

3.73e-16

Pssm-ID: 214574 [Multi-domain] Cd Length: 102 Bit Score: 76.05 E-value: 3.73e-16

                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931   429 KKGWLTKQYEDG--QWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTC---YDVTEYPVQRNYGFQIHTKEGE-FTL 502
Cdd:smart00233    3 KEGWLYKKSGGGkkSWKKRYFVLFNSTLLYYKSKKDKKSYKPKGSIDLSGCtvrEAPDPDSSKKPHCFEIKTSDRKtLLL 82

                            90
                    ....*....|....*....
gi 1907081931   503 SAMTSGIRRNWIQTIMKHV 521
Cdd:smart00233   83 QAESEEEREKWVEALRKAI 101

pfam00169

PH domain; PH stands for pleckstrin homology.

429-517

2.43e-15

PH domain; PH stands for pleckstrin homology.

Pssm-ID: 459697 [Multi-domain] Cd Length: 105 Bit Score: 73.75 E-value: 2.43e-15

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQYED--GQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDV---TEYPVQRNYGFQIHTKEG----E 499
Cdd:pfam00169    3 KEGWLLKKGGGkkKSWKKRYFVLFDGSLLYYKDDKSGKSKEPKGSISLSGCEVVevvASDSPKRKFCFELRTGERtgkrT 82

                           90
                   ....*....|....*...
gi 1907081931  500 FTLSAMTSGIRRNWIQTI 517
Cdd:pfam00169   83 YLLQAESEEERKDWIKAI 100

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

718-1032

8.17e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 74.20 E-value: 8.17e-13

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEAS------DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATspsgawQRLHRVNQDLQSELEAQCRR 791
Cdd:COG1196    198 LERQLEPLERQAEkaeryrELKEELKELEAELLLLKLRELEAELEELEAELEEL------EAELEELEAELAELEAELEE 271

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  792 QEL----ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQgkvrEQLEEWQHSKA 867
Cdd:COG1196    272 LRLeleeLELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELE----EELEELEEELE 347

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  868 MLSGQLRASEQKLRSTEARLLEKTQELrdLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytlll 947
Cdd:COG1196    348 EAEEELEEAEAELAEAEEALLEAEAEL--AEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERL------ 419

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  948 escEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1027
Cdd:COG1196    420 ---EEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLL 496


                   ....*
gi 1907081931 1028 RDLIK 1032
Cdd:COG1196    497 LEAEA 501

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

772-1131

1.43e-12

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 73.43 E-value: 1.43e-12

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  772 QRLHRVNqDLQSELEAQcrRQELITQQIQTLKhsYGEAKDAIRHHEAEIQTLQTRlgNAAAELAIKEQALAKLKGELKME 851
Cdd:COG1196    186 ENLERLE-DILGELERQ--LEPLERQAEKAER--YRELKEELKELEAELLLLKLR--ELEAELEELEAELEELEAELEEL 258

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  852 QGKVREQLEEWQHSKAmlsgQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEV----QRLQECIAELSQQLGT 927
Cdd:COG1196    259 EAELAELEAELEELRL----ELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELeerlEELEEELAELEEELEE 334

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  928 SEQAQRLMEKKLKRNytlllescEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSEtckgseqvhklEEEL 1007
Cdd:COG1196    335 LEEELEELEEELEEA--------EEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEA-----------LRAA 395

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1008 EAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREK 1087
Cdd:COG1196    396 AELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALL 475

                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1907081931 1088 EEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQ 1131
Cdd:COG1196    476 EAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGL 519

cd00821

Pleckstrin homology (PH) domain; PH domains have diverse functions, but in general are ...

429-517

2.63e-12

Pleckstrin homology (PH) domain; PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 275388 [Multi-domain] Cd Length: 92 Bit Score: 64.87 E-value: 2.63e-12

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQ--YEDGQWKKHWFVLADQSLRYYRDSvAEEAADLDGEINLSTCYDVTEY-PVQRNYGFQIHTKEGE-FTLSA 504
Cdd:cd00821      1 KEGYLLKRggGGLKSWKKRWFVLFEGVLLYYKSK-KDSSYKPKGSIPLSGILEVEEVsPKERPHCFELVTPDGRtYYLQA 79

                           90
                   ....*....|...
gi 1907081931  505 MTSGIRRNWIQTI 517
Cdd:cd00821     80 DSEEERQEWLKAL 92

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

706-987

6.42e-11

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 68.16 E-value: 6.42e-11

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  706 SERLSTHELtSLLEKELEQSQKEASDLLEQNRLLQDQLRvALGREQSAREGYVLQTEVATSPsgawqrLHRVNQDLQSEL 785
Cdd:TIGR02168  219 KAELRELEL-ALLVLRLEELREELEELQEELKEAEEELE-ELTAELQELEEKLEELRLEVSE------LEEEIEELQKEL 290

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  786 EAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEqgkvREQLEEWQHS 865
Cdd:TIGR02168  291 YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESL----EAELEELEAE 366

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  866 KAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDR----QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKR 941
Cdd:TIGR02168  367 LEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARlerlEDRRERLQQEIEELLKKLEEAELKELQAELEELE 446

                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1907081931  942 NYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQK 987
Cdd:TIGR02168  447 EELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLD 492

PRK03918

DNA double-strand break repair ATPase Rad50;

770-1135

2.23e-10

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 66.24 E-value: 2.23e-10

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  770 AWQRLHRVNQDLQSELEaqcRRQELITQQiqtlkhsyGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELK 849
Cdd:PRK03918   163 AYKNLGEVIKEIKRRIE---RLEKFIKRT--------ENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVK 231

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  850 mEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-----------------QALQRDRQKEVQ 912
Cdd:PRK03918   232 -ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKvkelkelkekaeeyiklSEFYEEYLDELR 310

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  913 RLQECIAELSQQL-GTSEQAQRLMEKKLKrnytllLESCEQEKQALLQNLKEVEDKASAYED------QLQGHVQQVEAL 985
Cdd:PRK03918   311 EIEKRLSRLEEEInGIEERIKELEEKEER------LEELKKKLKELEKRLEELEERHELYEEakakkeELERLKKRLTGL 384

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  986 QKEKLSETCKGSEQVHKLEEELEAREAS-IRQLAQHVQSLHDE---------------RDLIKHQFQELMER----VATS 1045
Cdd:PRK03918   385 TPEKLEKELEELEKAKEEIEEEISKITArIGELKKEIKELKKAieelkkakgkcpvcgRELTEEHRKELLEEytaeLKRI 464

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1046 DGDVAELQEKLRGKEVDYQNLEHSHHRVS--VQLQSVRTLLREKEEELKHI------------KETHERV--LEKKDQDL 1109
Cdd:PRK03918   465 EKELKEIEEKERKLRKELRELEKVLKKESelIKLKELAEQLKELEEKLKKYnleelekkaeeyEKLKEKLikLKGEIKSL 544

                          410       420
                   ....*....|....*....|....*.
gi 1907081931 1110 NEALVKMIALGSSLEETEIKLQEKEE 1135
Cdd:PRK03918   545 KKELEKLEELKKKLAELEKKLDELEE 570

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

772-1167

3.88e-10

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 65.46 E-value: 3.88e-10

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  772 QRLHRVNQDLQ------SELEAQCRRqeLITQQIQTLKhsYGEAKDAIRHHEAEIQTLQtrlgnaaaelaiKEQALAKLK 845
Cdd:TIGR02168  179 RKLERTRENLDrledilNELERQLKS--LERQAEKAER--YKELKAELRELELALLVLR------------LEELREELE 242

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  846 gELKMEQGKVREQLEEwqhskamLSGQLRASEQKLRSTEARLLEKTQELRDLetqqalqrdrQKEVQRLQECIAELSQQL 925
Cdd:TIGR02168  243 -ELQEELKEAEEELEE-------LTAELQELEEKLEELRLEVSELEEEIEEL----------QKELYALANEISRLEQQK 304

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  926 GTSEQAQRLMEKKLKRnYTLLLESCEQEKQALLQNLKEVEDKasayEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEE 1005
Cdd:TIGR02168  305 QILRERLANLERQLEE-LEAQLEELESKLDELAEELAELEEK----LEELKEELESLEAELEELEAELEELESRLEELEE 379

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1006 EleareasIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEklrgkevdyQNLEHSHHRVSVQLQSVRTLLR 1085
Cdd:TIGR02168  380 Q-------LETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQ---------EIEELLKKLEEAELKELQAELE 443

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1086 EKEEELKHIKETHERV---LEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECLRRFvSDSPKDAKEPLsttEPTEEGS 1162
Cdd:TIGR02168  444 ELEEELEELQEELERLeeaLEELREELEEAEQALDAAERELAQLQARLDSLERLQENL-EGFSEGVKALL---KNQSGLS 519


                   ....*
gi 1907081931 1163 GILPL 1167
Cdd:TIGR02168  520 GILGV 524

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

720-929

1.42e-09

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 63.78 E-value: 1.42e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  720 KELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATSPsgAWQRLHRVN------QDLQSELEAQCRRQE 793
Cdd:COG4913    235 DDLERAHEALEDAREQIELLEPIRELAERYAAARERLAELEYLRAALR--LWFAQRRLElleaelEELRAELARLEAELE 312

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  794 LITQQIQTLKHSYGEAKDAIRHH--------EAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHS 865
Cdd:COG4913    313 RLEARLDALREELDELEAQIRGNggdrleqlEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAAL 392

                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907081931  866 KAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEV-QRLQECIAELSQQLGTSE 929
Cdd:COG4913    393 LEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIpARLLALRDALAEALGLDE 457

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

837-1135

1.43e-09

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 63.92 E-value: 1.43e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  837 KEQALAKLKGELKMEQGKVREQLEEwqhsKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQqaLQRDRQkEVQRLQE 916
Cdd:TIGR02168  675 RRREIEELEEKIEELEEKIAELEKA----LAELRKELEELEEELEQLRKELEELSRQISALRKD--LARLEA-EVEQLEE 747

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  917 CIAELSQQLgtSEQAQRLMEKKLKrnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEkLSETckg 996
Cdd:TIGR02168  748 RIAQLSKEL--TELEAEIEELEER------LEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAE-LTLL--- 815

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  997 SEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQ 1076
Cdd:TIGR02168  816 NEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSE 895

                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081931 1077 LQSVRTLLREKE----------EELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1135
Cdd:TIGR02168  896 LEELSEELRELEskrselrrelEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIE 964

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

817-1125

1.45e-09

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 63.92 E-value: 1.45e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  817 EAEIQTLQTRLGNAAAELAIKEQALAKLKGE---LKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQE 893
Cdd:TIGR02168  676 RREIEELEEKIEELEEKIAELEKALAELRKEleeLEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKE 755

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  894 LRDLETQ-QALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytllLESCEQEKQALLQNLKEVEDKASAYE 972
Cdd:TIGR02168  756 LTELEAEiEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREA-----LDELRAELTLLNEEAANLRERLESLE 830

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  973 DQLQGHVQQVEALQKEKLSEtckgSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAEL 1052
Cdd:TIGR02168  831 RRIAATERRLEDLEEQIEEL----SEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELREL 906

                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907081931 1053 QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEEL----KHIKETHERVLEKKDQDLNEALVKMIALGSSLEE 1125
Cdd:TIGR02168  907 ESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLseeySLTLEEAEALENKIEDDEEEARRRLKRLENKIKE 983

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

854-1138

3.56e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 62.26 E-value: 3.56e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  854 KVREQLEEWQHSKAMLsgQLRASEQKLRSTEARLLEKTQELRDLETQQALqrdRQKEVQRLQECIAELSQQLGTSEQAQR 933
Cdd:COG1196    217 ELKEELKELEAELLLL--KLRELEAELEELEAELEELEAELEELEAELAE---LEAELEELRLELEELELELEEAQAEEY 291

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  934 LMEKKLKRnytlllesCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETckgsEQVHKLEEELEAREAS 1013
Cdd:COG1196    292 ELLAELAR--------LEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELE----EELEEAEEELEEAEAE 359

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1014 IRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKH 1093
Cdd:COG1196    360 LAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEE 439

                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1907081931 1094 IKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECLR 1138
Cdd:COG1196    440 EEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLE 484

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

1975-2240

4.11e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 62.26 E-value: 4.11e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1975 AAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQH- 2053
Cdd:COG1196    247 ELEELEAELEELEAELAELEAELEELRLE--LEELELELEEAQAEEY----------ELLAELARLEQDIARLEERRREl 314

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2054 QRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQ 2133
Cdd:COG1196    315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRA 394

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2134 YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLtgdgggestglpltqgKDAYELEVLL 2213
Cdd:COG1196    395 AAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAA----------------EEEAELEEEE 458

                          250       260
                   ....*....|....*....|....*..
gi 1907081931 2214 RVKESEIQYLKQEISSLKDELQTALRD 2240
Cdd:COG1196    459 EALLELLAELLEEAALLEAALAELLEE 485

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

772-1133

7.72e-09

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 61.24 E-value: 7.72e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  772 QRLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKme 851
Cdd:TIGR02169  684 EGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELK-- 761

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  852 qgKVREQLEEWQHSKAMLSGQLRASEQKLRstEARLLEKTQELRDLEtqqalqrdrqKEVQRLQECIAELSQQLGTSEQA 931
Cdd:TIGR02169  762 --ELEARIEELEEDLHKLEEALNDLEARLS--HSRIPEIQAELSKLE----------EEVSRIEARLREIEQKLNRLTLE 827

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  932 QRLMEKKLkrnytlllesceQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEARE 1011
Cdd:TIGR02169  828 KEYLEKEI------------QELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELE 895

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1012 ASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELqEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEE-E 1090
Cdd:TIGR02169  896 AQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEI-EDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPvN 974

                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1907081931 1091 LKHIKEtHERVLEKKDqDLNEALVKMIALGSSLEETEIKLQEK 1133
Cdd:TIGR02169  975 MLAIQE-YEEVLKRLD-ELKEKRAKLEEERKAILERIEEYEKK 1015

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

856-1160

1.10e-08

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 60.85 E-value: 1.10e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  856 REQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQalqRDRQKEVQRLQEciaelsqqlgtSEQAQRLM 935
Cdd:TIGR02169  673 PAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKI---GEIEKEIEQLEQ-----------EEEKLKER 738

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  936 EKKLKRNytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEAL-QKEKLSETCKGSEQVHKLEEELEAREASI 1014
Cdd:TIGR02169  739 LEELEED----LSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLeARLSHSRIPEIQAELSKLEEEVSRIEARL 814

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1015 RQLAQHVQSLHDERDLIKHQFQELMERVatsdgDVAELQEKLRGKEVDyqNLEHSHHRVSVQLQSVRTLLREKEEELKHI 1094
Cdd:TIGR02169  815 REIEQKLNRLTLEKEYLEKEIQELQEQR-----IDLKEQIKSIEKEIE--NLNGKKEELEEELEELEAALRDLESRLGDL 887

                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907081931 1095 K------ETHERVLEKKDQDLNEALVKmiaLGSSLEETEIKLQEKEECLRRFvsDSPKDAKEPLSTTEPTEE 1160
Cdd:TIGR02169  888 KkerdelEAQLRELERKIEELEAQIEK---KRKRLSELKAKLEALEEELSEI--EDPKGEDEEIPEEELSLE 954

EnvC

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

772-988

1.83e-08

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 59.01 E-value: 1.83e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  772 QRLHRVNQDL---QSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGEl 848
Cdd:COG4942     27 AELEQLQQEIaelEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEE- 105

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  849 kmeqgkvreqleewqhskamLSGQLRASEQKLRSTEARLL----EKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQ 924
Cdd:COG4942    106 --------------------LAELLRALYRLGRQPPLALLlspeDFLDAVRRLQYLKYLAPARREQAEELRADLAELAAL 165

                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081931  925 LGTSEQAQRLMEKKLKRNytlllescEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKE 988
Cdd:COG4942    166 RAELEAERAELEALLAEL--------EEERAALEALKAERQKLLARLEKELAELAAELAELQQE 221

PH2_MyoX

cd13296

Myosin X Pleckstrin homology (PH) domain, repeat 2; MyoX, a MyTH-FERM myosin, is a molecular ...

429-529

3.64e-08

Myosin X Pleckstrin homology (PH) domain, repeat 2; MyoX, a MyTH-FERM myosin, is a molecular motor that has crucial functions in the transport and/or tethering of integrins in the actin-based extensions known as filopodia, microtubule binding, and in netrin-mediated axon guidance. It functions as a dimer. MyoX walks on bundles of actin, rather than single filaments, unlike the other unconventional myosins. MyoX is present in organisms ranging from humans to choanoflagellates, but not in Drosophila and Caenorhabditis elegans.MyoX consists of a N-terminal motor/head region, a neck made of 3 IQ motifs, and a tail consisting of a coiled-coil domain, a PEST region, 3 PH domains, a myosin tail homology 4 (MyTH4), and a FERM domain at its very C-terminus. The first PH domain in the MyoX tail is a split-PH domain, interupted by the second PH domain such that PH 1a and PH 1b flanks PH 2. The third PH domain (PH 3) follows the PH 1b domain. This cd contains the second PH repeat. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270108 Cd Length: 103 Bit Score: 53.24 E-value: 3.64e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQYEDG------QWKKHWFVLADQSLRYYRDsvAEEAADLDGEINLSTCYDVTEYPVQRNyGFQIHTKEGEFTL 502
Cdd:cd13296      1 KSGWLTKKGGGSstlsrrNWKSRWFVLRDTVLKYYEN--DQEGEKLLGTIDIRSAKEIVDNDPKEN-RLSITTEERTYHL 77

                           90       100
                   ....*....|....*....|....*..
gi 1907081931  503 SAMTSGIRRNWIQtIMKHVLPASAPDV 529
Cdd:cd13296     78 VAESPEDASQWVN-VLTRVISATDLEL 103

YhaN

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

818-989

3.85e-08

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 58.63 E-value: 3.85e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  818 AEIQTLQTRLGNAAAELAIKEQALAKLKgELKMEQGKVREQLEEWQHSKAMLSGQLRASE--QKLRSTEARLLEKTQELR 895
Cdd:COG4717     71 KELKELEEELKEAEEKEEEYAELQEELE-ELEEELEELEAELEELREELEKLEKLLQLLPlyQELEALEAELAELPERLE 149

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  896 DLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQL 975
Cdd:COG4717    150 ELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEEL 229

                          170
                   ....*....|....
gi 1907081931  976 QGHVQQVEALQKEK 989
Cdd:COG4717    230 EQLENELEAAALEE 243

Myosin_tail_1

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...

666-1161

5.50e-08

Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 58.65 E-value: 5.50e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  666 LKTQNVHVEIEQRWHQV--ETTPLREEKQVPiAPLHLSLEDRSERLST---------HELTSLLEKELEQSQKEASDLLE 734
Cdd:pfam01576   22 QKAESELKELEKKHQQLceEKNALQEQLQAE-TELCAEAEEMRARLAArkqeleeilHELESRLEEEEERSQQLQNEKKK 100

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  735 QNRLLQDqLRVALGREQSAREGyvLQTEVATSPS----------------GAWQRLHRVNQDLQSELEAQCRRQELITQQ 798
Cdd:pfam01576  101 MQQHIQD-LEEQLDEEEAARQK--LQLEKVTTEAkikkleedillledqnSKLSKERKLLEERISEFTSNLAEEEEKAKS 177

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  799 IQTLKHSygeakdairhHEAEIQTLQTRLGNAAAelaiKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQ 878
Cdd:pfam01576  178 LSKLKNK----------HEAMISDLEERLKKEEK----GRQELEKAKRKLEGESTDLQEQIAELQAQIAELRAQLAKKEE 243

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  879 KLRSTEARLlektqelrdlETQQALQRDRQKEVQRLQECIAELSQQLgTSEQAQRLMEKKLKRNYTLLLESCEQEKQALL 958
Cdd:pfam01576  244 ELQAALARL----------EEETAQKNNALKKIRELEAQISELQEDL-ESERAARNKAEKQRRDLGEELEALKTELEDTL 312

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  959 ------QNLK-----EVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1027
Cdd:pfam01576  313 dttaaqQELRskreqEVTELKKALEEETRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAE 392

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1028 RDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEelKHIKETHE-RVLEKKD 1106
Cdd:pfam01576  393 LRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLLNEAEG--KNIKLSKDvSSLESQL 470

                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081931 1107 QDLNEALV----KMIALGSSLEETEI-------KLQEKEECLRRF----------VSDSPKDAKEPLSTTEPTEEG 1161
Cdd:pfam01576  471 QDTQELLQeetrQKLNLSTRLRQLEDernslqeQLEEEEEAKRNVerqlstlqaqLSDMKKKLEEDAGTLEALEEG 546

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

874-1135

6.81e-08

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 58.02 E-value: 6.81e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  874 RASEQKLRSTEARLL-------EKTQELRDLETQ-------QALQ---RDRQKEVQRLQecIAELSQQLGTSEQAQRLME 936
Cdd:COG1196    175 EEAERKLEATEENLErledilgELERQLEPLERQaekaeryRELKeelKELEAELLLLK--LRELEAELEELEAELEELE 252

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  937 KKLKRnYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETckgsEQVHKLEEELEAREASIRQ 1016
Cdd:COG1196    253 AELEE-LEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLE----ERRRELEERLEELEEELAE 327

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1017 LAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEvdyQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKE 1096
Cdd:COG1196    328 LEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAE---AELAEAEEELEELAEELLEALRAAAELAAQLEE 404

                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1907081931 1097 ThERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1135
Cdd:COG1196    405 L-EEAEEALLERLERLEEELEELEEALAELEEEEEEEEE 442

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1986-2178

7.30e-08

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 58.14 E-value: 7.30e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1986 LRERIQELEAQMGVMREELGH-----KELEGDVAALQEKyqrdFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKL 2060
Cdd:TIGR02168  307 LRERLANLERQLEELEAQLEElesklDELAEELAELEEK----LEELKEELESLEAELEELEAELEELESRLEELEEQLE 382

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2061 REEKDRLLAEETAATIsaieamkNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLE 2140
Cdd:TIGR02168  383 TLRSKVAQLELQIASL-------NNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEE 455

                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1907081931 2141 NAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAA 2178
Cdd:TIGR02168  456 LERLEEALEELREELEEAEQALDAAERELAQLQARLDS 493

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

797-987

1.01e-07

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 57.62 E-value: 1.01e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  797 QQIQTLKHSYGEAKDAIRHHEA--EIQTLQTRLGNAAAELAIKEQALAKLK--------GELKMEQGKVREQLEEWQHSK 866
Cdd:COG4913    232 EHFDDLERAHEALEDAREQIELlePIRELAERYAAARERLAELEYLRAALRlwfaqrrlELLEAELEELRAELARLEAEL 311

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  867 AMLSGQLRASEQKLRSTEARLLE-KTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTL 945
Cdd:COG4913    312 ERLEARLDALREELDELEAQIRGnGGDRLEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAA 391

                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1907081931  946 LLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQK 987
Cdd:COG4913    392 LLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLER 433

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

1974-2240

1.38e-07

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 57.23 E-value: 1.38e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1974 PAAKDEAESMSGLRERIQELEAQMGVMREELGHkeLEgDVAALQEKYQRDFESLKA--TCERGFAAmeETHQKKIEDLQR 2051
Cdd:COG4913    221 PDTFEAADALVEHFDDLERAHEALEDAREQIEL--LE-PIRELAERYAAARERLAEleYLRAALRL--WFAQRRLELLEA 295

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2052 qhqrELEKLREEKDRLLAEETAATISAIEAmkNAHREEMERELEKSQRSQISSINSDIEALRRqyleELQSVQRELEVLS 2131
Cdd:COG4913    296 ----ELEELRAELARLEAELERLEARLDAL--REELDELEAQIRGNGGDRLEQLEREIERLER----ELEERERRRARLE 365

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2132 EQysqkcLENAHLAQALEAE--RQALRQCQRENQELNAHNQELNNRLAAEITRLRtlltgdgggestglpltqgkdayEL 2209
Cdd:COG4913    366 AL-----LAALGLPLPASAEefAALRAEAAALLEALEEELEALEEALAEAEAALR-----------------------DL 417

                          250       260       270
                   ....*....|....*....|....*....|.
gi 1907081931 2210 EVLLRVKESEIQYLKQEISSLKDELQTALRD 2240
Cdd:COG4913    418 RRELRELEAEIASLERRKSNIPARLLALRDA 448

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

1801-2162

1.48e-07

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 57.00 E-value: 1.48e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1801 QAVGALKEEYEELLhkqksEYQKVITliEKENTELKAKVSQMDHQQRCLQEAENKHSESmfalqgryEEEIRCMVEQLSH 1880
Cdd:TIGR02169  198 QQLERLRREREKAE-----RYQALLK--EKREYEGYELLKEKEALERQKEAIERQLASL--------EEELEKLTEEISE 262

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1881 TENTLqAERSRVLSQLDASVKDRQAMEQHHVQ--------QMKMLEDRFQLKVRELQ------AVHQEELRALQEHyIWS 1946
Cdd:TIGR02169  263 LEKRL-EEIEQLLEELNKKIKDLGEEEQLRVKekigeleaEIASLERSIAEKERELEdaeerlAKLEAEIDKLLAE-IEE 340

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1947 LRGALSLYQpshpdsslapgpSEPRAVPAA-KDEAESMSGLRERIQELEAQMGVMREELghkelegdvaalqEKYQRDFE 2025
Cdd:TIGR02169  341 LEREIEEER------------KRRDKLTEEyAELKEELEDLRAELEEVDKEFAETRDEL-------------KDYREKLE 395

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2026 SLKatcergfaameethqKKIEDLQRQHQRELEKLREEKDRLlaEETAATISAIEAMKNAHREEME--RELEKSQRSQIS 2103
Cdd:TIGR02169  396 KLK---------------REINELKRELDRLQEELQRLSEEL--ADLNAAIAGIEAKINELEEEKEdkALEIKKQEWKLE 458

                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907081931 2104 SINSDIEALRRQYL---EELQSVQRELEVLSEQYSQkclenahlaqaLEAERQALRQCQREN 2162
Cdd:TIGR02169  459 QLAADLSKYEQELYdlkEEYDRVEKELSKLQRELAE-----------AEAQARASEERVRGG 509

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

1992-2286

1.64e-07

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 57.00 E-value: 1.64e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1992 ELEAQMGVMREELGhkELEGDVAALQEKyqrdFESLKATCERGFAAMEETHqKKIEDLQRQHQRELEKLREEKDRLlaEE 2071
Cdd:TIGR02169  671 SEPAELQRLRERLE--GLKRELSSLQSE----LRRIENRLDELSQELSDAS-RKIGEIEKEIEQLEQEEEKLKERL--EE 741

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2072 TAATISAIEAMKNAHREEMErELEK---SQRSQISSINSDIEALRRQYLEE-LQSVQRELEVLSEQYSQKCLENAHLAQA 2147
Cdd:TIGR02169  742 LEEDLSSLEQEIENVKSELK-ELEArieELEEDLHKLEEALNDLEARLSHSrIPEIQAELSKLEEEVSRIEARLREIEQK 820

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2148 LEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTGDGGGEStglpltqgkDAYELEVLLRVKESEIQYLKQEI 2227
Cdd:TIGR02169  821 LNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEE---------ELEELEAALRDLESRLGDLKKER 891

                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081931 2228 SSLKDELQTALRDKKYASDKYKDIYTELSIAKAKAdcdiSRLKEQLKAATEALGEKSPE 2286
Cdd:TIGR02169  892 DELEAQLRELERKIEELEAQIEKKRKRLSELKAKL----EALEEELSEIEDPKGEDEEI 946

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

673-918

2.26e-07

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 56.61 E-value: 2.26e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  673 VEIEQRWHQVET--TPLREEKQVPIAPLHLSLEdrSERLSTHELTSLLEKELEQSQKE-ASDLLEQNRLLQD--QLRVAL 747
Cdd:TIGR02169  268 EEIEQLLEELNKkiKDLGEEEQLRVKEKIGELE--AEIASLERSIAEKERELEDAEERlAKLEAEIDKLLAEieELEREI 345

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  748 GREQSAREGyvLQTEVATSPsgawQRLHRVNQDLQS-ELEAQCRRQEL--ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQ 824
Cdd:TIGR02169  346 EEERKRRDK--LTEEYAELK----EELEDLRAELEEvDKEFAETRDELkdYREKLEKLKREINELKRELDRLQEELQRLS 419

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  825 TRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSG---QLRASEQKLRSTEARLLEKTQELRDLETQQ 901
Cdd:TIGR02169  420 EELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKyeqELYDLKEEYDRVEKELSKLQRELAEAEAQA 499

                          250
                   ....*....|....*..
gi 1907081931  902 ALQRDRQKEVQRLQECI 918
Cdd:TIGR02169  500 RASEERVRGGRAVEEVL 516

PH_AtPH1

cd13276

Arabidopsis thaliana Pleckstrin homolog (PH) 1 (AtPH1) PH domain; AtPH1 is expressed in all ...

429-525

2.69e-07

Arabidopsis thaliana Pleckstrin homolog (PH) 1 (AtPH1) PH domain; AtPH1 is expressed in all plant tissue and is proposed to be the plant homolog of human pleckstrin. Pleckstrin consists of two PH domains separated by a linker region, while AtPH has a single PH domain with a short N-terminal extension. AtPH1 binds PtdIns3P specifically and is thought to be an adaptor molecule since it has no obvious catalytic functions. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270095 Cd Length: 106 Bit Score: 50.78 E-value: 2.69e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQYED-GQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDVT--EYPVQRNYGFQIHTKEGEFTLSAM 505
Cdd:cd13276      1 KAGWLEKQGEFiKTWRRRWFVLKQGKLFWFKEPDVTPYSKPRGVIDLSKCLTVKsaEDATNKENAFELSTPEETFYFIAD 80

                           90       100
                   ....*....|....*....|
gi 1907081931  506 TSGIRRNWIQTIMKHVLPAS 525
Cdd:cd13276     81 NEKEKEEWIGAIGRAIVKHS 100

PH-GRAM1_AGT26

cd13215

Autophagy-related protein 26/Sterol 3-beta-glucosyltransferase Pleckstrin homology (PH) domain, ...

428-521

2.78e-07

Autophagy-related protein 26/Sterol 3-beta-glucosyltransferase Pleckstrin homology (PH) domain, repeat 1; ATG26 (also called UGT51/UDP-glycosyltransferase 51), a member of the glycosyltransferase 28 family, resulting in the biosynthesis of sterol glucoside. ATG26 in decane metabolism and autophagy. There are 32 known autophagy-related (ATG) proteins, 17 are components of the core autophagic machinery essential for all autophagy-related pathways and 15 are the additional components required only for certain pathways or species. The core autophagic machinery includes 1) the ATG9 cycling system (ATG1, ATG2, ATG9, ATG13, ATG18, and ATG27), 2) the phosphatidylinositol 3-kinase complex (ATG6/VPS30, ATG14, VPS15, and ATG34), and 3) the ubiquitin-like protein system (ATG3, ATG4, ATG5, ATG7, ATG8, ATG10, ATG12, and ATG16). Less is known about how the core machinery is adapted or modulated with additional components to accommodate the nonselective sequestration of bulk cytosol (autophagosome formation) or selective sequestration of specific cargos (Cvt vesicle, pexophagosome, or bacteria-containing autophagosome formation). The pexophagosome-specific additions include the ATG30-ATG11-ATG17 receptor-adaptors complex, the coiled-coil protein ATG25, and the sterol glucosyltransferase ATG26. ATG26 is necessary for the degradation of medium peroxisomes. It contains 2 GRAM domains and a single PH domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 275402 Cd Length: 116 Bit Score: 51.08 E-value: 2.78e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  428 FKKGWLTKQ-YEDGQWKKHWFVLADQSLRYYRDSvaeeaADL---DGEINLSTCY--DVTEYPVQRNYGFQIHTKEGEFT 501
Cdd:cd13215     22 IKSGYLSKRsKRTLRYTRYWFVLKGDTLSWYNSS-----TDLyfpAGTIDLRYATsiELSKSNGEATTSFKIVTNSRTYK 96

                           90       100
                   ....*....|....*....|
gi 1907081931  502 LSAMTSGIRRNWIQTIMKHV 521
Cdd:cd13215     97 FKADSETSADEWVKALKKQI 116

CCDC158

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

690-1095

3.35e-07

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 55.89 E-value: 3.35e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  690 EKQVPIAPLHLSlEDRSER----LSTHELTSLLEK---ELEQSQKEASDLLEQNRLLQDQ-LRVALGREQSAREGYVLQT 761
Cdd:pfam15921  348 EKQLVLANSELT-EARTERdqfsQESGNLDDQLQKllaDLHKREKELSLEKEQNKRLWDRdTGNSITIDHLRRELDDRNM 426

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  762 EVatspsgawQRLHRVNQDLQSELEAQCRRQ--------------ELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRL 827
Cdd:pfam15921  427 EV--------QRLEALLKAMKSECQGQMERQmaaiqgkneslekvSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTV 498

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  828 GNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHskamlsgqLRASEQKLRSTEArllektqELRDLETQQAlQRDR 907
Cdd:pfam15921  499 SDLTASLQEKERAIEATNAEITKLRSRVDLKLQELQH--------LKNEGDHLRNVQT-------ECEALKLQMA-EKDK 562

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  908 QKEVQRLQ-ECIAELSQQLGTSEQAQRLMEKKLKRNYtlllesceQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEaLQ 986
Cdd:pfam15921  563 VIEILRQQiENMTQLVGQHGRTAGAMQVEKAQLEKEI--------NDRRLELQEFKILKDKKDAKIRELEARVSDLE-LE 633

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  987 KEKLSETckGSEQVhkleeeleareasirqlaQHVQSLHDERDlikhqfqELMERVATSDGDVAELQEKLRGKEVDYQN- 1065
Cdd:pfam15921  634 KVKLVNA--GSERL------------------RAVKDIKQERD-------QLLNEVKTSRNELNSLSEDYEVLKRNFRNk 686

                          410       420       430
                   ....*....|....*....|....*....|...
gi 1907081931 1066 ---LEHSHHRVSVQLQSVRTLLREKEEELKHIK 1095
Cdd:pfam15921  687 seeMETTTNKLKMQLKSAQSELEQTRNTLKSME 719

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

1868-2165

3.44e-07

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 55.84 E-value: 3.44e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1868 EEEIRCMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLE--DRFQLKVRELQAVHQEELRALQehyiw 1945
Cdd:TIGR02169  676 LQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEqlEQEEEKLKERLEELEEDLSSLE----- 750

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1946 slrgalslyqpshpdsslapgpsepRAVPAAKDEaesMSGLRERIQELEAQMGVMREELGHKELEGDVAALQEKyQRDFE 2025
Cdd:TIGR02169  751 -------------------------QEIENVKSE---LKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEI-QAELS 801

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2026 SLKATCERGFAAMEETHQKkiedLQRQHQRE--LEKLREEK--DRLLAEETAATISAIEAMKNAHREEMERELEKSQRS- 2100
Cdd:TIGR02169  802 KLEEEVSRIEARLREIEQK----LNRLTLEKeyLEKEIQELqeQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAAl 877

                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081931 2101 -QISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQEL 2165
Cdd:TIGR02169  878 rDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGED 943

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

818-1158

4.47e-07

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 55.46 E-value: 4.47e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  818 AEIQTLQTRLGNAAAELAIKEQALAKLKGElkmeqgkvREQLEEWQHskamLSGQLRASEQKLRSTEARLLEKTQE--LR 895
Cdd:TIGR02169  177 EELEEVEENIERLDLIIDEKRQQLERLRRE--------REKAERYQA----LLKEKREYEGYELLKEKEALERQKEaiER 244

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  896 DLETQQALQRDRQKEVQRLQECIAELSQQLgtSEQAQRLMEKKLKRNYTLllesceQEKQALLQ-NLKEVEDKASAYEDQ 974
Cdd:TIGR02169  245 QLASLEEELEKLTEEISELEKRLEEIEQLL--EELNKKIKDLGEEEQLRV------KEKIGELEaEIASLERSIAEKERE 316

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  975 LQghvqQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQE 1054
Cdd:TIGR02169  317 LE----DAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYRE 392

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1055 KLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETH---ERVLEKKDQDLNEALVKMIALGSSLEETEIKLQ 1131
Cdd:TIGR02169  393 KLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKInelEEEKEDKALEIKKQEWKLEQLAADLSKYEQELY 472

                          330       340
                   ....*....|....*....|....*..
gi 1907081931 1132 EKEECLRRfVSDSPKDAKEPLSTTEPT 1158
Cdd:TIGR02169  473 DLKEEYDR-VEKELSKLQRELAEAEAQ 498

PH_CNK_mammalian-like

cd01260

Connector enhancer of KSR (Kinase suppressor of ras) (CNK) pleckstrin homology (PH) domain; ...

431-475

4.96e-07

Connector enhancer of KSR (Kinase suppressor of ras) (CNK) pleckstrin homology (PH) domain; CNK family members function as protein scaffolds, regulating the activity and the subcellular localization of RAS activated RAF. There is a single CNK protein present in Drosophila and Caenorhabditis elegans in contrast to mammals which have 3 CNK proteins (CNK1, CNK2, and CNK3). All of the CNK members contain a sterile a motif (SAM), a conserved region in CNK (CRIC) domain, and a PSD-95/DLG-1/ZO-1 (PDZ) domain, and, with the exception of CNK3, a PH domain. A CNK2 splice variant CNK2A also has a PDZ domain-binding motif at its C terminus and Drosophila CNK (D-CNK) also has a domain known as the Raf-interacting region (RIR) that mediates binding of the Drosophila Raf kinase. This cd contains CNKs from mammals, chickens, amphibians, fish, and crustacea. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269962 Cd Length: 114 Bit Score: 50.48 E-value: 4.96e-07

                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907081931  431 GWLTKQYEDG-----QWKKHWFVLADQSLRYYRDSVAEEAadlDGEINLS 475
Cdd:cd01260     17 GWLWKKKEAKsffgqKWKKYWFVLKGSSLYWYSNQQDEKA---EGFINLP 63

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

1984-2282

5.00e-07

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 55.33 E-value: 5.00e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1984 SGLRERIQELEAQMGVMREELG-----HKELEGDVAALQE---------KYQRDFESLKAtcergfaameETHQKKIEDL 2049
Cdd:COG1196    168 SKYKERKEEAERKLEATEENLErlediLGELERQLEPLERqaekaeryrELKEELKELEA----------ELLLLKLREL 237

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2050 QRQHQRELEKLREEKDRL--LAEETAATISAIEAMKNAHREEMERELEKSQR-----SQISSINSDIEAL---RRQYLEE 2119
Cdd:COG1196    238 EAELEELEAELEELEAELeeLEAELAELEAELEELRLELEELELELEEAQAEeyellAELARLEQDIARLeerRRELEER 317

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2120 LQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTGDgggestglp 2199
Cdd:COG1196    318 LEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEEL--------- 388

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2200 LTQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEA 2279
Cdd:COG1196    389 LEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAEL 468


                   ...
gi 1907081931 2280 LGE 2282
Cdd:COG1196    469 LEE 471

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1909-2235

5.08e-07

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 55.45 E-value: 5.08e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1909 HHVQQMKMLEDRFQLKVRELQAVhQEELRALQEHYIWSLRGALSLYQPSHpdsslapgpsepravpaakDEAESMSGLRE 1988
Cdd:TIGR02168  681 ELEEKIEELEEKIAELEKALAEL-RKELEELEEELEQLRKELEELSRQIS-------------------ALRKDLARLEA 740

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1989 RIQELEAQMGVMREELghKELEGDVAALQEKYQRDFESLKATcergfaameETHQKKIEDLQRQHQRELEKLREEKDRL- 2067
Cdd:TIGR02168  741 EVEQLEERIAQLSKEL--TELEAEIEELEERLEEAEEELAEA---------EAEIEELEAQIEQLKEELKALREALDELr 809

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2068 -----LAEETAATISAIEAMKNAHREEmERELEKSQRsQISSINSDIEALRRQyLEELQS----VQRELEVLSEQYSQKC 2138
Cdd:TIGR02168  810 aeltlLNEEAANLRERLESLERRIAAT-ERRLEDLEE-QIEELSEDIESLAAE-IEELEElieeLESELEALLNERASLE 886

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2139 LENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLltgdgggESTGLPLTQ-----GKDAYE-LEVL 2212
Cdd:TIGR02168  887 EALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGL-------EVRIDNLQErlseeYSLTLEeAEAL 959

                          330       340
                   ....*....|....*....|...
gi 1907081931 2213 LRVKESEIQYLKQEISSLKDELQ 2235
Cdd:TIGR02168  960 ENKIEDDEEEARRRLKRLENKIK 982

PH1_ARAP

cd13253

ArfGAP with RhoGAP domain, ankyrin repeat and PH domain Pleckstrin homology (PH) domain, ...

429-522

5.51e-07

ArfGAP with RhoGAP domain, ankyrin repeat and PH domain Pleckstrin homology (PH) domain, repeat 1; ARAP proteins (also called centaurin delta) are phosphatidylinositol 3,4,5-trisphosphate-dependent GTPase-activating proteins that modulate actin cytoskeleton remodeling by regulating ARF and RHO family members. They bind phosphatidylinositol 3,4,5-trisphosphate (PtdIns(3,4,5)P3) and phosphatidylinositol 3,4-bisphosphate (PtdIns(3,4,5)P2) binding. There are 3 mammalian ARAP proteins: ARAP1, ARAP2, and ARAP3. All ARAP proteins contain a N-terminal SAM (sterile alpha motif) domain, 5 PH domains, an ArfGAP domain, 2 ankyrin domain, A RhoGap domain, and a Ras-associating domain. This hierarchy contains the first PH domain in ARAP. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270073 Cd Length: 94 Bit Score: 49.69 E-value: 5.51e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQYEDGQ---WKKHWFVLADQSLRYYRdsvAEEAADLDGEINLSTcydVTEYPVQRNYGFQIHTKEGEFTLSAM 505
Cdd:cd13253      2 KSGYLDKQGGQGNnkgFQKRWVVFDGLSLRYFD---SEKDAYSKRIIPLSA---ISTVRAVGDNKFELVTTNRTFVFRAE 75

                           90
                   ....*....|....*..
gi 1907081931  506 TSGIRRNWIQTIMKHVL 522
Cdd:cd13253     76 SDDERNLWCSTLQAAIS 92

Mplasa_alph_rch

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...

718-1135

7.86e-07

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.

Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 54.64 E-value: 7.86e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRLLQDQLRVaLGREQSAREGYVLQTEvatspsgaWQRLhRVNQDLqSELEAQCRRQELITQ 797
Cdd:TIGR04523  150 KEKELEKLNNKYNDLKKQKEELENELNL-LEKEKLNIQKNIDKIK--------NKLL-KLELLL-SNLKKKIQKNKSLES 218

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  798 QIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAikeqalaklkgELKMEQGKVREQLEEWQHskamlsgQLRASE 877
Cdd:TIGR04523  219 QISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLN-----------QLKDEQNKIKKQLSEKQK-------ELEQNN 280

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  878 QKLRSTEARLLEKTQELRDL--ETQQALQRDRQKEVQRLQECIAELSQQLGTSEQA-QRLMEK--KLKRNytllLESCEQ 952
Cdd:TIGR04523  281 KKIKELEKQLNQLKSEISDLnnQKEQDWNKELKSELKNQEKKLEEIQNQISQNNKIiSQLNEQisQLKKE----LTNSES 356

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  953 EKQALLQNLKEVEDKASAYEDQLQGHVQQVEAL--QKEKLSETCKGSEQVHKleeeleareasirQLAQHVQSLHDERDL 1030
Cdd:TIGR04523  357 ENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLesQINDLESKIQNQEKLNQ-------------QKDEQIKKLQQEKEL 423

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1031 IKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNL--------------EHSHHRVSVQLQSVRTLLREKEEELKHIKE 1096
Cdd:TIGR04523  424 LEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLdntresletqlkvlSRSINKIKQNLEQKQKELKSKEKELKKLNE 503

                          410       420       430       440
                   ....*....|....*....|....*....|....*....|...
gi 1907081931 1097 tHERVLEKKDQDLN----EALVKMIALGSSLEETEIKLQEKEE 1135
Cdd:TIGR04523  504 -EKKELEEKVKDLTkkisSLKEKIEKLESEKKEKESKISDLED 545

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

1829-2214

9.42e-07

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 54.56 E-value: 9.42e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1829 EKENTELKAKVSQMDHQQRCLQEAENKHSESmfalqgryEEEIRCMVEQLSHTENTLQAERSRvLSQLDASVKDRQAMEQ 1908
Cdd:COG1196    221 ELKELEAELLLLKLRELEAELEELEAELEEL--------EAELEELEAELAELEAELEELRLE-LEELELELEEAQAEEY 291

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1909 HHVQQMKMLEDRFQLKVRELQAVHQEELRALQEhyiwslrgalslyqpshpdsslapgpsepravpaakdEAEsmsgLRE 1988
Cdd:COG1196    292 ELLAELARLEQDIARLEERRRELEERLEELEEE-------------------------------------LAE----LEE 330

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1989 RIQELEAQMGVMREELghKELEGDVAALQEKYQRdfeslkatcergfaamEETHQKKIEDLQRQHQRELEKLREEKDRLL 2068
Cdd:COG1196    331 ELEELEEELEELEEEL--EEAEEELEEAEAELAE----------------AEEALLEAEAELAEAEEELEELAEELLEAL 392

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2069 AEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQAL 2148
Cdd:COG1196    393 RAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEA 472

                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907081931 2149 EAERQALRQCQRENQELNA-----HNQELNNRLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYELEVLLR 2214
Cdd:COG1196    473 ALLEAALAELLEELAEAAArllllLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAA 543

Mplasa_alph_rch

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...

797-1127

1.01e-06

Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 54.26 E-value: 1.01e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  797 QQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAI---KEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQL 873
Cdd:TIGR04523  117 EQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKlnnKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKL 196

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  874 RASEQKLRSTEA-----RLLEKtqELRDLETQQA-LQRDRQKEVQRLQECIAELSQqlgTSEQAQRLMEKKLKRNYTLll 947
Cdd:TIGR04523  197 LKLELLLSNLKKkiqknKSLES--QISELKKQNNqLKDNIEKKQQEINEKTTEISN---TQTQLNQLKDEQNKIKKQL-- 269

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  948 esceQEKQallQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKG--------SEQVHKLEEELEAREASIRQLAQ 1019
Cdd:TIGR04523  270 ----SEKQ---KELEQNNKKIKELEKQLNQLKSEISDLNNQKEQDWNKElkselknqEKKLEEIQNQISQNNKIISQLNE 342

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1020 HVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKhIKETHE 1099
Cdd:TIGR04523  343 QISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIK-KLQQEK 421

                          330       340
                   ....*....|....*....|....*...
gi 1907081931 1100 RVLEKKDQDLNEALVKMIALGSSLEETE 1127
Cdd:TIGR04523  422 ELLEKEIERLKETIIKNNSEIKDLTNQD 449

PH_PEPP1_2_3

cd13248

Phosphoinositol 3-phosphate binding proteins 1, 2, and 3 pleckstrin homology (PH) domain; ...

429-517

1.15e-06

Phosphoinositol 3-phosphate binding proteins 1, 2, and 3 pleckstrin homology (PH) domain; PEPP1 (also called PLEKHA4/PH domain-containing family A member 4 and RHOXF1/Rhox homeobox family member 1), and related homologs PEPP2 (also called PLEKHA5/PH domain-containing family A member 5) and PEPP3 (also called PLEKHA6/PH domain-containing family A member 6), have PH domains that interact specifically with PtdIns(3,4)P3. Other proteins that bind PtdIns(3,4)P3 specifically are: TAPP1 (tandem PH-domain-containing protein-1) and TAPP2], PtdIns3P AtPH1, and Ptd- Ins(3,5)P2 (centaurin-beta2). All of these proteins contain at least 5 of the 6 conserved amino acids that make up the putative phosphatidylinositol 3,4,5- trisphosphate-binding motif (PPBM) located at their N-terminus. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270068 Cd Length: 104 Bit Score: 49.19 E-value: 1.15e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQYEDG--QWKKHWFVLADQSLRYYRDsvaEEAADLDGEINLSTcYDVT----EYPVQRNYGFQIhTKEGEFT- 501
Cdd:cd13248      9 MSGWLHKQGGSGlkNWRKRWFVLKDNCLYYYKD---PEEEKALGSILLPS-YTISpappSDEISRKFAFKA-EHANMRTy 83

                           90
                   ....*....|....*..
gi 1907081931  502 -LSAMTSGIRRNWIQTI 517
Cdd:cd13248     84 yFAADTAEEMEQWMNAM 100

YhaN

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

718-1090

1.16e-06

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 54.00 E-value: 1.16e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSARegyvLQTEVATSPSG---------AWQRLHRVNQDLQSEL-EA 787
Cdd:COG4717    100 LEEELEELEAELEELREELEKLEKLLQLLPLYQELEA----LEAELAELPERleeleerleELRELEEELEELEAELaEL 175

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  788 QCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLkgELKMEQGKVREQLEEWQHSKA 867
Cdd:COG4717    176 QEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQL--ENELEAAALEERLKEARLLLL 253

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  868 MLSGQL------------------------------RASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQEC 917
Cdd:COG4717    254 IAAALLallglggsllsliltiagvlflvlgllallFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPD 333

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  918 I--AELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQ-----NLKEVEDKASAYED--QLQGHVQQVEALQKE 988
Cdd:COG4717    334 LspEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAeagveDEEELRAALEQAEEyqELKEELEELEEQLEE 413

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  989 KLSETCKGSEQVHKLE--EELEAREASIRQLAQHVQSLHDERDLIKHQFQELMErvatsDGDVAELQEKLRGKEVDYQNL 1066
Cdd:COG4717    414 LLGELEELLEALDEEEleEELEELEEELEELEEELEELREELAELEAELEQLEE-----DGELAELLQELEELKAELREL 488

                          410       420
                   ....*....|....*....|....
gi 1907081931 1067 EHSHHRVSVQLQSVRTLLREKEEE 1090
Cdd:COG4717    489 AEEWAALKLALELLEEAREEYREE 512

SMC_N

pfam02463

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...

833-1134

1.24e-06

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.

Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 54.21 E-value: 1.24e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  833 ELAIKEQALAKL------KGELKMEQGKVREQL--EEWQHSKAMLSGQLRASEQKLRSTEARLLEKT---QELRDLETQQ 901
Cdd:pfam02463  167 LKRKKKEALKKLieetenLAELIIDLEELKLQElkLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKlneERIDLLQELL 246

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  902 ALQRDRQKEVQRLQECIAELSQQ----LGTSEQAQRLMEKKLKRNYTLL------LESCEQEKQALLQNLKEVEDKASAY 971
Cdd:pfam02463  247 RDEQEEIESSKQEIEKEEEKLAQvlkeNKEEEKEKKLQEEELKLLAKEEeelkseLLKLERRKVDDEEKLKESEKEKKKA 326

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  972 EDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQ---FQELMERVATSDGD 1048
Cdd:pfam02463  327 EKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAaklKEEELELKSEEEKE 406

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1049 VAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEI 1128
Cdd:pfam02463  407 AQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQL 486


                   ....*.
gi 1907081931 1129 KLQEKE 1134
Cdd:pfam02463  487 ELLLSR 492

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

674-1139

1.84e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 53.40 E-value: 1.84e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  674 EIEQRWHQVETTPLREEKQvpIAPLHLSLEDRSERL-STHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQS 752
Cdd:COG1196    285 EAQAEEYELLAELARLEQD--IARLEERRRELEERLeELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAE 362

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  753 AREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAA 832
Cdd:COG1196    363 AEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEE 442

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  833 ELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSgQLRASEQKLRSTEARLLE--KTQELRDLETQQALQRDRQKE 910
Cdd:COG1196    443 ALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALA-ELLEELAEAAARLLLLLEaeADYEGFLEGVKAALLLAGLRG 521

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  911 VQR---------------LQECIAELSQQLGT------SEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKAS 969
Cdd:COG1196    522 LAGavavligveaayeaaLEAALAAALQNIVVeddevaAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAV 601

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  970 AYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELME---RVATSD 1046
Cdd:COG1196    602 DLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAallEAEAEL 681

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1047 GDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEET 1126
Cdd:COG1196    682 EELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPD 761

                          490
                   ....*....|...
gi 1907081931 1127 EIKLQEKEECLRR 1139
Cdd:COG1196    762 LEELERELERLER 774

GumC

COG3206

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

718-928

1.85e-06

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 53.48 E-value: 1.85e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRL--LQDQLRVALGREQSAREGYV-LQTEVAtspsGAWQRLHRVNQDLQSELEAQcrRQEL 794
Cdd:COG3206    187 LRKELEEAEAALEEFRQKNGLvdLSEEAKLLLQQLSELESQLAeARAELA----EAEARLAALRAQLGSGPDAL--PELL 260

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  795 ITQQIQTLKHSYGEAkdairhhEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLE-EWQHSKAMLSgQL 873
Cdd:COG3206    261 QSPVIQQLRAQLAEL-------EAELAELSARYTPNHPDVIALRAQIAALRAQLQQEAQRILASLEaELEALQAREA-SL 332

                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081931  874 RASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKE-VQRLQEciAELSQQLGTS 928
Cdd:COG3206    333 QAQLAQLEARLAELPELEAELRRLEREVEVARELYESlLQRLEE--ARLAEALTVG 386

PH2_ADAP

cd01251

ArfGAP with dual PH domains Pleckstrin homology (PH) domain, repeat 2; ADAP (also called ...

427-517

2.15e-06

ArfGAP with dual PH domains Pleckstrin homology (PH) domain, repeat 2; ADAP (also called centaurin alpha) is a phophatidlyinositide binding protein consisting of an N-terminal ArfGAP domain and two PH domains. In response to growth factor activation, PI3K phosphorylates phosphatidylinositol 4,5-bisphosphate to phosphatidylinositol 3,4,5-trisphosphate. Centaurin alpha 1 is recruited to the plasma membrane following growth factor stimulation by specific binding of its PH domain to phosphatidylinositol 3,4,5-trisphosphate. Centaurin alpha 2 is constitutively bound to the plasma membrane since it binds phosphatidylinositol 4,5-bisphosphate and phosphatidylinositol 3,4,5-trisphosphate with equal affinity. This cd contains the second PH domain repeat. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 241282 Cd Length: 105 Bit Score: 48.35 E-value: 2.15e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  427 NFKK-GWLTK----QYEdgQWKKHWFVLADQSLRYYRDSvaeeaadLD----GEINLSTC---YDVTE-----YPVQRNY 489
Cdd:cd01251      1 DFLKeGYLEKtgpkQTD--GFRKRWFTLDDRRLMYFKDP-------LDafpkGEIFIGSKeegYSVREglppgIKGHWGF 71

                           90       100
                   ....*....|....*....|....*...
gi 1907081931  490 GFQIHTKEGEFTLSAMTSGIRRNWIQTI 517
Cdd:cd01251     72 GFTLVTPDRTFLLSAETEEERREWITAI 99

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1680-2282

2.46e-06

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 53.14 E-value: 2.46e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1680 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLppTEPLGGCQ------RLLRMSQHLSYESCLEGLGQYS 1753
Cdd:TIGR02168  308 RERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEEL--KEELESLEaeleelEAELEELESRLEELEEQLETLR 385

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1754 S----LLVQDAIIQAQVCYAACRI-RLEYEKELRFYKKACQEAKGASGQKraQAVGALKEEYEELLHKQKSEYQKVITLI 1828
Cdd:TIGR02168  386 SkvaqLELQIASLNNEIERLEARLeRLEDRRERLQQEIEELLKKLEEAEL--KELQAELEELEEELEELQEELERLEEAL 463

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1829 EKENTELKAKVSQMDHQQRCLQEAENKhSESMFALQGRYEEEIRCmVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEq 1908
Cdd:TIGR02168  464 EELREELEEAEQALDAAERELAQLQAR-LDSLERLQENLEGFSEG-VKALLKNQSGLSGILGVLSELISVDEGYEAAIE- 540

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1909 hhvqqmKMLEDRFQ-LKVRELQAVhqeelRALQEHYIWSLRG-----ALSLYQPSHPDSSLAPG-PSEPRAVPAAKDEAE 1981
Cdd:TIGR02168  541 ------AALGGRLQaVVVENLNAA-----KKAIAFLKQNELGrvtflPLDSIKGTEIQGNDREIlKNIEGFLGVAKDLVK 609

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1982 SMSGLRERIQELEAQMGV---------MREELGHKE----LEGDVAAlqekyqrdfeslkatceRGFAAMEETHQKKIED 2048
Cdd:TIGR02168  610 FDPKLRKALSYLLGGVLVvddldnaleLAKKLRPGYrivtLDGDLVR-----------------PGGVITGGSAKTNSSI 672

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2049 LQRQhqRELEKLREEKDRLLAEETAATISAIEAMKNahREEMERELEKSQRsQISSINSDIEALRRQYLEELQSVQR--- 2125
Cdd:TIGR02168  673 LERR--REIEELEEKIEELEEKIAELEKALAELRKE--LEELEEELEQLRK-ELEELSRQISALRKDLARLEAEVEQlee 747

                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2126 ELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTL-----LTGDGGGESTGLPL 2200
Cdd:TIGR02168  748 RIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELraeltLLNEEAANLRERLE 827

                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2201 TQGKDAYELEVLLRVKESEIQYLKQEISSLKDElqtaLRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEAL 2280
Cdd:TIGR02168  828 SLERRIAATERRLEDLEEQIEELSEDIESLAAE----IEELEELIEELESELEALLNERASLEEALALLRSELEELSEEL 903


                   ..
gi 1907081931 2281 GE 2282
Cdd:TIGR02168  904 RE 905

EnvC

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

830-1060

2.53e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 52.07 E-value: 2.53e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  830 AAAELAIKEQALAKLKGELKmeqgKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-QALQRDRQ 908
Cdd:COG4942     18 QADAAAEAEAELEQLQQEIA----ELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAElAELEKEIA 93

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  909 KEVQRLQECIAELSQQLgtseqaqRLMEKKLKRNYTLLLESCEQEKQAL--LQNLKEVEDKASAYEDQLQGHVQQVEALQ 986
Cdd:COG4942     94 ELRAELEAQKEELAELL-------RALYRLGRQPPLALLLSPEDFLDAVrrLQYLKYLAPARREQAEELRADLAELAALR 166

                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081931  987 KEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKE 1060
Cdd:COG4942    167 AELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAA 240

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

703-1032

3.71e-06

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 52.38 E-value: 3.71e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  703 EDRSERLSTheLTSLLEKELEQSQKEASDLLEQNRLLqdqlrvalgREQSAREGYVLqtevatspSGAWQRLHRVNQDLQ 782
Cdd:TIGR02169  183 EENIERLDL--IIDEKRQQLERLRREREKAERYQALL---------KEKREYEGYEL--------LKEKEALERQKEAIE 243

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  783 SELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQ--------TLQTRLGNAAAELAIKEQALAKLKGELKMEQGK 854
Cdd:TIGR02169  244 RQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKdlgeeeqlRVKEKIGELEAEIASLERSIAEKERELEDAEER 323

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  855 VR---EQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-----QALQRDRQKEVQRlQECIAELSQQLG 926
Cdd:TIGR02169  324 LAkleAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAEleevdKEFAETRDELKDY-REKLEKLKREIN 402

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  927 TSEQAQ-RLMEKKLKRNYTLL-----LESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKL---SETCKGS 997
Cdd:TIGR02169  403 ELKRELdRLQEELQRLSEELAdlnaaIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYdlkEEYDRVE 482

                          330       340       350
                   ....*....|....*....|....*....|....*
gi 1907081931  998 EQVHKLEEELEAREASIRQLAQHVQSLHDERDLIK 1032
Cdd:TIGR02169  483 KELSKLQRELAEAEAQARASEERVRGGRAVEEVLK 517

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

856-1092

6.40e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 51.84 E-value: 6.40e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  856 REQLEEWQHSKAMLSgQLRASEQKLRSTEARLlEKTQELRD----------LETQQALQRDRQKEVQRLQECIAELSQQL 925
Cdd:COG4913    241 HEALEDAREQIELLE-PIRELAERYAAARERL-AELEYLRAalrlwfaqrrLELLEAELEELRAELARLEAELERLEARL 318

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  926 GTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQghvqqvealqkeKLSETCKGSEQVHKLEE 1005
Cdd:COG4913    319 DALREELDELEAQIRGNGGDRLEQLEREIERLERELEERERRRARLEALLA------------ALGLPLPASAEEFAALR 386

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1006 eleareasiRQLAQHVQSLHDERDlikhqfqELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLR 1085
Cdd:COG4913    387 ---------AEAAALLEALEEELE-------ALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIPARLLALRDALA 450

                          250
                   ....*....|.
gi 1907081931 1086 E----KEEELK 1092
Cdd:COG4913    451 EalglDEAELP 461

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

2013-2322

6.42e-06

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 51.61 E-value: 6.42e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2013 VAALQEKYQRDFESLKATCERgfaamEETHQKKIEDLQRQhqreLEKLREEKDRLL--------AEETAATI--SAIEAM 2082
Cdd:TIGR02169  165 VAEFDRKKEKALEELEEVEEN-----IERLDLIIDEKRQQ----LERLRREREKAEryqallkeKREYEGYEllKEKEAL 235

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2083 K------NAHREEMERELEKSQRsQISSINSDIEALRRqyleELQSVQRELEVLSEQYSQKCLENAHLAQA-LEAERQAL 2155
Cdd:TIGR02169  236 ErqkeaiERQLASLEEELEKLTE-EISELEKRLEEIEQ----LLEELNKKIKDLGEEEQLRVKEKIGELEAeIASLERSI 310

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2156 RQCQRENQELNAHNQELN---NRLAAEITRLRTLLtgdgggESTGLPLTQGKDAY-ELEVLLRVKESEIQYLKQEISSLK 2231
Cdd:TIGR02169  311 AEKERELEDAEERLAKLEaeiDKLLAEIEELEREI------EEERKRRDKLTEEYaELKEELEDLRAELEEVDKEFAETR 384

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2232 DELQTALRDKKYASDKYKDIYTELSI---AKAKADCDISRLKEQLKAATEALGEkSPEGTTVSGYDIMKSKSNPDFLKKD 2308
Cdd:TIGR02169  385 DELKDYREKLEKLKREINELKRELDRlqeELQRLSEELADLNAAIAGIEAKINE-LEEEKEDKALEIKKQEWKLEQLAAD 463

                          330
                   ....*....|....
gi 1907081931 2309 RSCVTRQLRNIRSK 2322
Cdd:TIGR02169  464 LSKYEQELYDLKEE 477

Mplasa_alph_rch

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...

779-1135

6.61e-06

Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 51.56 E-value: 6.61e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  779 QDLQSELEAQCRRQELITQQIQTLKHSYGEAKDA-------IRHHEAEIQTLQTRLGNAAAELAI----KEQALAK-LKG 846
Cdd:TIGR04523  235 EKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQlsekqkeLEQNNKKIKELEKQLNQLKSEISDlnnqKEQDWNKeLKS 314

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  847 ELKMEQGKVRE---QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-QALQRDRQ---KEVQRLQECIA 919
Cdd:TIGR04523  315 ELKNQEKKLEEiqnQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEiEKLKKENQsykQEIKNLESQIN 394

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  920 ELSQQLGTSEQAQRLME---KKLKRNYTLLLESCEQEKQALLQN---LKEVEDKASAYEDQLQGHVQQVEAlQKEKLSEt 993
Cdd:TIGR04523  395 DLESKIQNQEKLNQQKDeqiKKLQQEKELLEKEIERLKETIIKNnseIKDLTNQDSVKELIIKNLDNTRES-LETQLKV- 472

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  994 ckgseqvhkleeeleaREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRV 1073
Cdd:TIGR04523  473 ----------------LSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEK 536

                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907081931 1074 SVQLQSVRTLLREKEEELKhiKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1135
Cdd:TIGR04523  537 ESKISDLEDELNKDDFELK--KENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEK 596

PH_DAPP1

cd10573

Dual Adaptor for Phosphotyrosine and 3-Phosphoinositides Pleckstrin homology (PH) domain; ...

426-517

6.74e-06

Dual Adaptor for Phosphotyrosine and 3-Phosphoinositides Pleckstrin homology (PH) domain; DAPP1 (also known as PHISH/3' phosphoinositide-interacting SH2 domain-containing protein or Bam32) plays a role in B-cell activation and has potential roles in T-cell and mast cell function. DAPP1 promotes B cell receptor (BCR) induced activation of Rho GTPases Rac1 and Cdc42, which feed into mitogen-activated protein kinases (MAPK) activation pathways and affect cytoskeletal rearrangement. DAPP1can also regulate BCR-induced activation of extracellular signal-regulated kinase (ERK), and c-jun NH2-terminal kinase (JNK). DAPP1 contains an N-terminal SH2 domain and a C-terminal pleckstrin homology (PH) domain with a single tyrosine phosphorylation site located centrally. DAPP1 binds strongly to both PtdIns(3,4,5)P3 and PtdIns(3,4)P2. The PH domain is essential for plasma membrane recruitment of PI3K upon cell activation. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269977 [Multi-domain] Cd Length: 96 Bit Score: 46.55 E-value: 6.74e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  426 LNFKKGWLTKQyedGQ----WKKHWFVLADQSLRYYRDSVAEEAADldgEINLSTCYDVTEYPVQ-RNYGFQIHTKEGEF 500
Cdd:cd10573      2 LGSKEGYLTKL---GGivknWKTRWFVLRRNELKYFKTRGDTKPIR---VLDLRECSSVQRDYSQgKVNCFCLVFPERTF 75

                           90
                   ....*....|....*..
gi 1907081931  501 TLSAMTSGIRRNWIQTI 517
Cdd:cd10573     76 YMYANTEEEADEWVKLL 92

CALCOCO1

pfam07888

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...

717-964

7.37e-06

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.

Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 51.05 E-value: 7.37e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  717 LLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQ----CRRQ 792
Cdd:pfam07888   31 LLQNRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWERQRRELESRVAELKEELRQSREKHEELEEKykelSASS 110

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  793 ELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAikeqalaklkgELKMEQGKVREQLEEWQHSKAMLSGQ 872
Cdd:pfam07888  111 EELSEEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETELE-----------RMKERAKKAGAQRKEEEAERKQLQAK 179

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  873 LRASEQKLRSTEARLlektQELRDLETQQALQrdrqkeVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTL--LLESC 950
Cdd:pfam07888  180 LQQTEEELRSLSKEF----QELRNSLAQRDTQ------VLQLQDTITTLTQKLTTAHRKEAENEALLEELRSLqeRLNAS 249

                          250
                   ....*....|....
gi 1907081931  951 EQEKQALLQNLKEV 964
Cdd:pfam07888  250 ERKVEGLGEELSSM 263

PH_TBC1D2A

cd01265

TBC1 domain family member 2A pleckstrin homology (PH) domain; TBC1D2A (also called PARIS-1 ...

442-520

7.54e-06

TBC1 domain family member 2A pleckstrin homology (PH) domain; TBC1D2A (also called PARIS-1/Prostate antigen recognized and identified by SEREX 1 and ARMUS) contains a PH domain and a TBC-type GTPase catalytic domain. TBC1D2A integrates signaling between Arf6, Rac1, and Rab7 during junction disassembly. Activated Rac1 recruits TBC1D2A to locally inactivate Rab7 via its C-terminal TBC/RabGAP domain and facilitate E-cadherin degradation in lysosomes. The TBC1D2A PH domain mediates localization at cell-cell contacts and coprecipitates with cadherin complexes. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269966 Cd Length: 102 Bit Score: 46.55 E-value: 7.54e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  442 WKKHWFVLADQS--LRYYRDSvaeeaADLD--GEINLSTC---YDVTEYPVQrnygFQIHTKEGEFTLSAMTSGIRRNWI 514
Cdd:cd01265     19 WKRRWFVLDESKcqLYYYRSP-----QDATplGSIDLSGAafsYDPEAEPGQ----FEIHTPGRVHILKASTRQAMLYWL 89


                   ....*.
gi 1907081931  515 QTIMKH 520
Cdd:cd01265     90 QALQSK 95

Mplasa_alph_rch

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...

778-1134

1.10e-05

Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 50.79 E-value: 1.10e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  778 NQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGElkmEQGKvRE 857
Cdd:TIGR04523  309 NKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKE---NQSY-KQ 384

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  858 QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQ----ALQRDRQKEVQRLQECIAELSQQLGTSEQAQR 933
Cdd:TIGR04523  385 EIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIerlkETIIKNNSEIKDLTNQDSVKELIIKNLDNTRE 464

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  934 LMEKKLKrnytLLLESCEQEKQALLQNLKEVEDKASAYeDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREAS 1013
Cdd:TIGR04523  465 SLETQLK----VLSRSINKIKQNLEQKQKELKSKEKEL-KKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESK 539

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1014 IRQLAQHVQSLHDE--RDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEEL 1091
Cdd:TIGR04523  540 ISDLEDELNKDDFElkKENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKEL 619

                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1907081931 1092 KHIKETHERVLEKKDqDLNEALVKMIALGSSLEETEIKLQEKE 1134
Cdd:TIGR04523  620 EKAKKENEKLSSIIK-NIKSKKNKLKQEVKQIKETIKEIRNKW 661

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

1797-2230

1.31e-05

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 50.71 E-value: 1.31e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1797 QKRAQAVGALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRCLQEAEN--KHSESMFALQGRYEEEIRCM 1874
Cdd:COG1196    347 EEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEEleEAEEALLERLERLEEELEEL 426

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1875 VEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKmLEDRFQLKVRELQAVHQEELRALQEHyiWSLRGALSLY 1954
Cdd:COG1196    427 EEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAE-LLEEAALLEAALAELLEELAEAAARL--LLLLEAEADY 503

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1955 QPSHPDSSLAPGPSEPRAVPAAKDEAesMSGLRERIQELEAQMGVMREELGHKELEgdVAALQEKYQRDFESLKAT---- 2030
Cdd:COG1196    504 EGFLEGVKAALLLAGLRGLAGAVAVL--IGVEAAYEAALEAALAAALQNIVVEDDE--VAAAAIEYLKAAKAGRATflpl 579

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2031 --CERGFAAMEETHQKKIEDLQRQHQRELEKLREEKDRL---LAEETAATISAIEAMKNAHREEMERELEKSQRSQISSI 2105
Cdd:COG1196    580 dkIRARAALAAALARGAIGAAVDLVASDLREADARYYVLgdtLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAG 659

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2106 NSDIEALRRQYLEELQSVQRELEVLSEQ--YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRL 2183
Cdd:COG1196    660 GSLTGGSRRELLAALLEAEAELEELAERlaEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLE 739

                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*..
gi 1907081931 2184 RTLltgdgggESTGLPLTQGKDAYELEVLLRVKESEIQYLKQEISSL 2230
Cdd:COG1196    740 ELL-------EEEELLEEEALEELPEPPDLEELERELERLEREIEAL 779

PH1_PLEKHH1_PLEKHH2

cd13282

Pleckstrin homology (PH) domain containing, family H (with MyTH4 domain) members 1 and 2 ...

429-517

1.45e-05

Pleckstrin homology (PH) domain containing, family H (with MyTH4 domain) members 1 and 2 (PLEKHH1) PH domain, repeat 1; PLEKHH1 and PLEKHH2 (also called PLEKHH1L) are thought to function in phospholipid binding and signal transduction. There are 3 Human PLEKHH genes: PLEKHH1, PLEKHH2, and PLEKHH3. There are many isoforms, the longest of which contain a FERM domain, a MyTH4 domain, two PH domains, a peroximal domain, a vacuolar domain, and a coiled coil stretch. The FERM domain has a cloverleaf tripart structure (FERM_N, FERM_M, FERM_C/N, alpha-, and C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3). The C-lobe/F3 within the FERM domain is part of the PH domain family. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 241436 Cd Length: 96 Bit Score: 45.75 E-value: 1.45e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQyeDGQ---WKKHWFVLADQSLRYYRD--SVAEEAAdldGEINLSTCYDVTeyPVQRNYGFQIHTKEGEFTLS 503
Cdd:cd13282      1 KAGYLTKL--GGKvktWKRRWFVLKNGELFYYKSpnDVIRKPQ---GQIALDGSCEIA--RAEGAQTFEIVTEKRTYYLT 73

                           90
                   ....*....|....
gi 1907081931  504 AMTSGIRRNWIQTI 517
Cdd:cd13282     74 ADSENDLDEWIRVI 87

COG1340

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];

793-1104

1.47e-05

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];

Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 49.52 E-value: 1.47e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  793 ELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQtrlgNAAAELAIKEQALAKLKGELKMEQGKVREQLEEwqhskamLSGQ 872
Cdd:COG1340      4 DELSSSLEELEEKIEELREEIEELKEKRDELN----EELKELAEKRDELNAQVKELREEAQELREKRDE-------LNEK 72

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  873 LRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTS----EQAQRLMEK--KLKRNYTLL 946
Cdd:COG1340     73 VKELKEERDELNEKLNELREELDELRKELAELNKAGGSIDKLRKEIERLEWRQQTEvlspEEEKELVEKikELEKELEKA 152

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  947 LESCEQEKQallqnLKEVEDKASAYEDQLQGHVQQVEALQKEklsetckgSEQVHKleeeleareaSIRQLAQHVQSLHD 1026
Cdd:COG1340    153 KKALEKNEK-----LKELRAELKELRKEAEEIHKKIKELAEE--------AQELHE----------EMIELYKEADELRK 209

                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907081931 1027 ERDLIKHQFQELMERvatsdgdVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRtllreKEEELKHIKETHERVLEK 1104
Cdd:COG1340    210 EADELHKEIVEAQEK-------ADELHEEIIELQKELRELRKELKKLRKKQRALK-----REKEKEELEEKAEEIFEK 275

CCDC158

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

1819-2288

2.18e-05

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 50.12 E-value: 2.18e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1819 SEYQKVITLIEKENTELKAKVSQMDHQQRCLQ-EAENK-------HSESMFALQGRYEEEIRCMVEQLSHTENTLQAERS 1890
Cdd:pfam15921  220 SAISKILRELDTEISYLKGRIFPVEDQLEALKsESQNKielllqqHQDRIEQLISEHEVEITGLTEKASSARSQANSIQS 299

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1891 RvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEHYIWSlrgalslyqpshpDSSLAPGPSEp 1970
Cdd:pfam15921  300 Q-LEIIQEQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEDKIEELEKQLVLA-------------NSELTEARTE- 364

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1971 ravpaaKDEAESMSG-LRERIQELEAQMGVMREELG---------------------HKELEGDVAALQ-EKYQRDFESL 2027
Cdd:pfam15921  365 ------RDQFSQESGnLDDQLQKLLADLHKREKELSlekeqnkrlwdrdtgnsitidHLRRELDDRNMEvQRLEALLKAM 438

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2028 KATC----ERGFAAMEETHQ--KKIEDLQRQHQRELEKLREEKDRLLA-----EETAATISAIeamkNAHREEMERELEK 2096
Cdd:pfam15921  439 KSECqgqmERQMAAIQGKNEslEKVSSLTAQLESTKEMLRKVVEELTAkkmtlESSERTVSDL----TASLQEKERAIEA 514

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2097 SQrSQISSINS--DIEALRRQYL----EELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQ 2170
Cdd:pfam15921  515 TN-AEITKLRSrvDLKLQELQHLknegDHLRNVQTECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKA 593

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2171 ELNNRLAAEITRLRTLLTgdgggestglpLTQGKDAYELEVLLRVKESEIQY----------------LKQEISSLKDEL 2234
Cdd:pfam15921  594 QLEKEINDRRLELQEFKI-----------LKDKKDAKIRELEARVSDLELEKvklvnagserlravkdIKQERDQLLNEV 662

                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081931 2235 QTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGE-----KSPEGT 2288
Cdd:pfam15921  663 KTSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQtrntlKSMEGS 721

mukB

PRK04863

chromosome partition protein MukB;

702-1042

2.36e-05

chromosome partition protein MukB;

Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 49.96 E-value: 2.36e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  702 LEDRSERLSTHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQtevatspsgawQRLHRVNQDL 781
Cdd:PRK04863   289 LELRRELYTSRRQLAAEQYRLVEMARELAELNEAESDLEQDYQAASDHLNLVQTALRQQ-----------EKIERYQADL 357

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  782 QsELEAQCRRQELITQQIQTLKHSYGEAKDAIrhhEAEIQTLQTRLGNAAAELAIKE----------QALAKLK---GEL 848
Cdd:PRK04863   358 E-ELEERLEEQNEVVEEADEQQEENEARAEAA---EEEVDELKSQLADYQQALDVQQtraiqyqqavQALERAKqlcGLP 433

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  849 KMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEA--RLLEKTQEL-----------------RDLETQQALQRDRQK 909
Cdd:PRK04863   434 DLTADNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAahSQFEQAYQLvrkiagevsrseawdvaRELLRRLREQRHLAE 513

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  910 EVQRLQECIAELSQQLGTSEQAQRLM---EKKLKRNYTL--LLESCEQEKQALLQNLKE----VEDKASAYEDQLQGHVQ 980
Cdd:PRK04863   514 QLQQLRMRLSELEQRLRQQQRAERLLaefCKRLGKNLDDedELEQLQEELEARLESLSEsvseARERRMALRQQLEQLQA 593

                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907081931  981 QVEALQK---------EKLSETCkgsEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERV 1042
Cdd:PRK04863   594 RIQRLAArapawlaaqDALARLR---EQSGEEFEDSQDVTEYMQQLLERERELTVERDELAARKQALDEEI 661

PRK02224

DNA double-strand break repair Rad50 ATPase;

703-1135

2.70e-05

DNA double-strand break repair Rad50 ATPase;

Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 49.65 E-value: 2.70e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  703 EDRSERLSTHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYvlqTEVATSPSGAWQRLHRVNQDLQ 782
Cdd:PRK02224   293 EERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESL---REDADDLEERAEELREEAAELE 369

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  783 SELEAqCRRQelitqqiqtlkhsYGEAKDAIRHHEAEIQTLQTRLGNAAAELaikeQALAKLKGELKMEQGKVREQLEEw 862
Cdd:PRK02224   370 SELEE-AREA-------------VEDRREEIEELEEEIEELRERFGDAPVDL----GNAEDFLEELREERDELREREAE- 430

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  863 qhskamLSGQLRASEQKLRSTEaRLLEK------TQELRDLETQQALQRDRQKevqrlqecIAELSQQLGTSEQAQRLME 936
Cdd:PRK02224   431 ------LEATLRTARERVEEAE-ALLEAgkcpecGQPVEGSPHVETIEEDRER--------VEELEAELEDLEEEVEEVE 495

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  937 KKLKRNYTllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQvhklEEELEAREASIRQ 1016
Cdd:PRK02224   496 ERLERAED--LVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEK----REAAAEAEEEAEE 569

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1017 LAQHVQSLHDERDLIKHQFQELmERVATSDGDVAELQ---EKLRGKEVDYQNLE-HSHHRVSVQLQSVRTLLREKE---- 1088
Cdd:PRK02224   570 AREEVAELNSKLAELKERIESL-ERIRTLLAAIADAEdeiERLREKREALAELNdERRERLAEKRERKRELEAEFDeari 648

                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*..
gi 1907081931 1089 EELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1135
Cdd:PRK02224   649 EEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVENELEELEE 695

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

885-1135

2.76e-05

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 49.53 E-value: 2.76e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  885 ARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKklkrnytlllescEQEKQALLQNLKEV 964
Cdd:COG4913    194 LRLLHKTQSFKPIGDLDDFVREYMLEEPDTFEAADALVEHFDDLERAHEALED-------------AREQIELLEPIREL 260

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  965 EDKASAYEDQLQGHVQQVEALQKEKlsetckGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVAT 1044
Cdd:COG4913    261 AERYAAARERLAELEYLRAALRLWF------AQRRLELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIRG 334

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1045 SDGD-VAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSL 1123
Cdd:COG4913    335 NGGDrLEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAAL 414

                          250
                   ....*....|..
gi 1907081931 1124 EETEIKLQEKEE 1135
Cdd:COG4913    415 RDLRRELRELEA 426

PH_SWAP-70

cd13273

Switch-associated protein-70 Pleckstrin homology (PH) domain; SWAP-70 (also called ...

429-517

2.93e-05

Switch-associated protein-70 Pleckstrin homology (PH) domain; SWAP-70 (also called Differentially expressed in FDCP 6/DEF-6 or IRF4-binding protein) functions in cellular signal transduction pathways (in conjunction with Rac), regulates cell motility through actin rearrangement, and contributes to the transformation and invasion activity of mouse embryo fibroblasts. Metazoan SWAP-70 is found in B lymphocytes, mast cells, and in a variety of organs. Metazoan SWAP-70 contains an N-terminal EF-hand motif, a centrally located PH domain, and a C-terminal coiled-coil domain. The PH domain of Metazoan SWAP-70 contains a phosphoinositide-binding site and a nuclear localization signal (NLS), which localize SWAP-70 to the plasma membrane and nucleus, respectively. The NLS is a sequence of four Lys residues located at the N-terminus of the C-terminal a-helix; this is a unique characteristic of the Metazoan SWAP-70 PH domain. The SWAP-70 PH domain binds PtdIns(3,4,5)P3 and PtdIns(4,5)P2 embedded in lipid bilayer vesicles. There are additional plant SWAP70 proteins, but these are not included in this hierarchy. Rice SWAP70 (OsSWAP70) exhibits GEF activity toward the its Rho GTPase, OsRac1, and regulates chitin-induced production of reactive oxygen species and defense gene expression in rice. Arabidopsis SWAP70 (AtSWAP70) plays a role in both PAMP- and effector-triggered immunity. Plant SWAP70 contains both DH and PH domains, but their arrangement is the reverse of that in typical DH-PH-type Rho GEFs, wherein the DH domain is flanked by a C-terminal PH domain. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270092 Cd Length: 110 Bit Score: 45.36 E-value: 2.93e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQ-YEDGQWKKHWFVLADQSLRYYrdsVAEEAADLDGEINL--STCYDVTEYPVQRNYGFQIHTKEGEFTLSAM 505
Cdd:cd13273     10 KKGYLWKKgHLLPTWTERWFVLKPNSLSYY---KSEDLKEKKGEIALdsNCCVESLPDREGKKCRFLVKTPDKTYELSAS 86

                           90
                   ....*....|..
gi 1907081931  506 TSGIRRNWIQTI 517
Cdd:cd13273     87 DHKTRQEWIAAI 98

PH_Osh1p_Osh2p_yeast

cd13292

Yeast oxysterol binding protein homologs 1 and 2 Pleckstrin homology (PH) domain; Yeast Osh1p ...

430-521

3.02e-05

Yeast oxysterol binding protein homologs 1 and 2 Pleckstrin homology (PH) domain; Yeast Osh1p is proposed to function in postsynthetic sterol regulation, piecemeal microautophagy of the nucleus, and cell polarity establishment. Yeast Osh2p is proposed to function in sterol metabolism and cell polarity establishment. Both Osh1p and Osh2p contain 3 N-terminal ankyrin repeats, a PH domain, a FFAT motif (two phenylalanines in an acidic tract), and a C-terminal OSBP-related domain. OSBP andOsh1p PH domains specifically localize to the Golgi apparatus in a PtdIns4P-dependent manner. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. In general OSBPs and ORPs have been found to be involved in the transport and metabolism of cholesterol and related lipids in eukaryotes. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. They are members of the oxysterol binding protein (OSBP) family which includes OSBP, OSBP-related proteins (ORP), Goodpasture antigen binding protein (GPBP), and Four phosphate adaptor protein 1 (FAPP1). They have a wide range of purported functions including sterol transport, cell cycle control, pollen development and vessicle transport from Golgi recognize both PI lipids and ARF proteins. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 241446 Cd Length: 103 Bit Score: 44.99 E-value: 3.02e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  430 KGWLTK--QYEDGqWKKHWFVLADQSLRYYRDSVAEEAAdLDGEINLSTCYDVteYPVQRNYGFQIHTKEG---EFTLSA 504
Cdd:cd13292      5 KGYLKKwtNYAKG-YKTRWFVLEDGVLSYYRHQDDEGSA-CRGSINMKNARLV--SDPSEKLRFEVSSKTSgspKWYLKA 80

                           90
                   ....*....|....*..
gi 1907081931  505 MTSGIRRNWIQTIMKHV 521
Cdd:cd13292     81 NHPVEAARWIQALQKAI 97

EnvC

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

718-931

3.24e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 48.61 E-value: 3.24e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRLLQDQLrvalgrEQSAREGYVLQTEVATspsgAWQRLHRVNQDLQSELEAQCRRQELITQ 797
Cdd:COG4942     39 LEKELAALKKEEKALLKQLAALERRI------AALARRIRALEQELAA----LEAELAELEKEIAELRAELEAQKEELAE 108

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  798 QIQTL----KHSY-------GEAKDAIRHHEAeIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSK 866
Cdd:COG4942    109 LLRALyrlgRQPPlalllspEDFLDAVRRLQY-LKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEER 187

                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907081931  867 AMLSGQLRASEQKLRSTEARLLEKTQELRDLetqqalqrdrQKEVQRLQECIAELSQQLGTSEQA 931
Cdd:COG4942    188 AALEALKAERQKLLARLEKELAELAAELAEL----------QQEAEELEALIARLEAEAAAAAER 242

sbcc

exonuclease SbcC; All proteins in this family for which functions are known are part of an ...

683-1149

3.48e-05

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]

Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 49.20 E-value: 3.48e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  683 ETTPLREEkQVPIAPLHLSLEDRSERLSTHELTSLLEKELEQ-----SQKEASDLLEQNRLLQDQLRVALGREQSAREGY 757
Cdd:TIGR00618  401 ELDILQRE-QATIDTRTSAFRDLQGQLAHAKKQQELQQRYAElcaaaITCTAQCEKLEKIHLQESAQSLKEREQQLQTKE 479

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  758 VLQTEVATSPSGAWQRLHRVnQDLQSELEAQCRRQELITQQIQTL------------KHSY-GEAKDAIRHHEAEIQTLQ 824
Cdd:TIGR00618  480 QIHLQETRKKAVVLARLLEL-QEEPCPLCGSCIHPNPARQDIDNPgpltrrmqrgeqTYAQlETSEEDVYHQLTSERKQR 558

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  825 TRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSgqlraseqklRSTEARLLEKTQELRDLETQQALQ 904
Cdd:TIGR00618  559 ASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLS----------EAEDMLACEQHALLRKLQPEQDLQ 628

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  905 RDRQKEvqrlQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQN-LKEVEDKASAYEDQLQGHVQQVE 983
Cdd:TIGR00618  629 DVRLHL----QQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELLASRQLaLQKMQSEKEQLTYWKEMLAQCQT 704

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  984 ALQKEKLSETcKGSEQVHKLEEELEAREASIR-QLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVD 1062
Cdd:TIGR00618  705 LLRELETHIE-EYDREFNEIENASSSLGSDLAaREDALNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQTGAELSHL 783

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1063 YQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECLRRFVS 1142
Cdd:TIGR00618  784 AAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQ 863


                   ....*..
gi 1907081931 1143 DSPKDAK 1149
Cdd:TIGR00618  864 LTQEQAK 870

PH_Gab-like

cd13324

Grb2-associated binding protein family Pleckstrin homology (PH) domain; Gab proteins are ...

431-517

4.10e-05

Grb2-associated binding protein family Pleckstrin homology (PH) domain; Gab proteins are scaffolding adaptor proteins, which possess N-terminal PH domains and a C-terminus with proline-rich regions and multiple phosphorylation sites. Following activation of growth factor receptors, Gab proteins are tyrosine phosphorylated and activate PI3K, which generates 3-phosphoinositide lipids. By binding to these lipids via the PH domain, Gab proteins remain in proximity to the receptor, leading to further signaling. While not all Gab proteins depend on the PH domain for recruitment, it is required for Gab activity. There are 3 families: Gab1, Gab2, and Gab3. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270133 Cd Length: 112 Bit Score: 44.71 E-value: 4.10e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  431 GWLTK-----QYEDGQWKKHWFVL------ADQS-LRYYRDsvaEEAADLDGEINLSTCYDVT-----EYPVQRN-YGFQ 492
Cdd:cd13324      5 GWLTKsppekKIWRAAWRRRWFVLrsgrlsGGQDvLEYYTD---DHCKKLKGIIDLDQCEQVDagltfEKKKFKNqFIFD 81

                           90       100
                   ....*....|....*....|....*
gi 1907081931  493 IHTKEGEFTLSAMTSGIRRNWIQTI 517
Cdd:cd13324     82 IRTPKRTYYLVAETEEEMNKWVRCI 106

HCR

pfam07111

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...

677-1132

6.16e-05

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.

Pssm-ID: 284517 [Multi-domain] Cd Length: 749 Bit Score: 48.21 E-value: 6.16e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  677 QRWHQVETTPLREEKQVPIAPLHLSLEDRSERLSTHELTSLLE-KELEQSQKEASDLLEQNRLLQDQLRVALGREQSARE 755
Cdd:pfam07111  146 QRLHQEQLSSLTQAHEEALSSLTSKAEGLEKSLNSLETKRAGEaKQLAEAQKEAELLRKQLSKTQEELEAQVTLVESLRK 225

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  756 gYVLQTEVATSPSGAWqrlhrvnqdlqsELEaqcrRQELItQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELA 835
Cdd:pfam07111  226 -YVGEQVPPEVHSQTW------------ELE----RQELL-DTMQHLQEDRADLQATVELLQVRVQSLTHMLALQEEELT 287

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  836 IKEQALAKLKGELKMeqgKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQA-----LQR---DR 907
Cdd:pfam07111  288 RKIQPSDSLEPEFPK---KCRSLLNRWREKVFALMVQLKAQDLEHRDSVKQLRGQVAELQEQVTSQSqeqaiLQRalqDK 364

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  908 QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLL------LESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQ 981
Cdd:pfam07111  365 AAEVEVERMSAKGLQMELSRAQEARRRQQQQTASAEEQLkfvvnaMSSTQIWLETTMTRVEQAVARIPSLSNRLSYAVRK 444

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  982 V---EALQKEKLS------ETCKGSEqvhKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERvATSDGDVAEL 1052
Cdd:pfam07111  445 VhtiKGLMARKVAlaqlrqESCPPPP---PAPPVDADLSLELEQLREERNRLDAELQLSAHLIQQEVGR-AREQGEAERQ 520

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1053 Q--EKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKM-IALGSSLEETEIK 1129
Cdd:pfam07111  521 QlsEVAQQLEQELQRAQESLASVGQQLEVARQGQQESTEEAASLRQELTQQQEIYGQALQEKVAEVeTRLREQLSDTKRR 600


                   ...
gi 1907081931 1130 LQE 1132
Cdd:pfam07111  601 LNE 603

SCP-1

pfam05483

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...

1787-2271

6.84e-05

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.

Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 48.18 E-value: 6.84e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1787 ACQEAKGASGQKRAQAVGALKEEYEELLHKQKsEYQKVITLIEKENTELKAKVSqmdhqqrclqEAENKHSESMFALqgr 1866
Cdd:pfam05483  198 AFEELRVQAENARLEMHFKLKEDHEKIQHLEE-EYKKEINDKEKQVSLLLIQIT----------EKENKMKDLTFLL--- 263

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1867 yeEEIRCMVEQLSHtENTLQAERSRVLSQ----LDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEH 1942
Cdd:pfam05483  264 --EESRDKANQLEE-KTKLQDENLKELIEkkdhLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEEL 340

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1943 YIWSLRGALSLYQPSHPDSSLAPG-PSEPRAVPAAKD-------EAESMSGLRERIQELEAQMGVMREELgHKELEGDVA 2014
Cdd:pfam05483  341 NKAKAAHSFVVTEFEATTCSLEELlRTEQQRLEKNEDqlkiitmELQKKSSELEEMTKFKNNKEVELEEL-KKILAEDEK 419

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2015 ALQEKYQRD--FESLKATcERGFAAMEETHQKKIEDLQRQ----------HQRELEKLREE--KDRLLAEETAATISAIE 2080
Cdd:pfam05483  420 LLDEKKQFEkiAEELKGK-EQELIFLLQAREKEIHDLEIQltaiktseehYLKEVEDLKTEleKEKLKNIELTAHCDKLL 498

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2081 AMKNAHREE---MERELEKSQRSQISSINSDIEALRR-QYLEELQSVQR-ELEVLSEQYSQ-----KCLENAHLAQALEA 2150
Cdd:pfam05483  499 LENKELTQEasdMTLELKKHQEDIINCKKQEERMLKQiENLEEKEMNLRdELESVREEFIQkgdevKCKLDKSEENARSI 578

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2151 ERQALRQCQRENQELNAHNQ-----ELNNRLAAEITRLRTLLTGDGGGESTGLpltqgkDAYELEVllRVKESEIQYLKQ 2225
Cdd:pfam05483  579 EYEVLKKEKQMKILENKCNNlkkqiENKNKNIEELHQENKALKKKGSAENKQL------NAYEIKV--NKLELELASAKQ 650

                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*.
gi 1907081931 2226 EISSLKDELQTALRDKKYASDKykdIYTELSIAKAKADCDISRLKE 2271
Cdd:pfam05483  651 KFEEIIDNYQKEIEDKKISEEK---LLEEVEKAKAIADEAVKLQKE 693

PH_evt

cd13265

Evectin Pleckstrin homology (PH) domain; There are 2 members of the evectin family (also ...

428-480

7.00e-05

Evectin Pleckstrin homology (PH) domain; There are 2 members of the evectin family (also called pleckstrin homology domain containing, family B): evt-1 (also called PLEKHB1) and evt-2 (also called PLEKHB2). evt-1 is specific to the nervous system, where it is expressed in photoreceptors and myelinating glia. evt-2 is widely expressed in both neural and nonneural tissues. Evectins possess a single N-terminal PH domain and a C-terminal hydrophobic region. evt-1 is thought to function as a mediator of post-Golgi trafficking in cells that produce large membrane-rich organelles. It is a candidate gene for the inherited human retinopathy autosomal dominant familial exudative vitreoretinopathy and a susceptibility gene for multiple sclerosis. evt-2 is essential for retrograde endosomal membrane transport from the plasma membrane (PM) to the Golgi. Two membrane trafficking pathways pass through recycling endosomes: a recycling pathway and a retrograde pathway that links the PM to the Golgi/ER. Its PH domain that is unique in that it specifically recognizes phosphatidylserine (PS), but not polyphosphoinositides. PS is an anionic phospholipid class in eukaryotic biomembranes, is highly enriched in the PM, and plays key roles in various physiological processes such as the coagulation cascade, recruitment and activation of signaling molecules, and clearance of apoptotic cells. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270085 Cd Length: 108 Bit Score: 44.22 E-value: 7.00e-05

                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081931  428 FKKGWLTKQYE-DGQWKKHWFVL-ADQSLRYYRDsvaEEAADLDGEINL-STCYDV 480
Cdd:cd13265      4 VKSGWLLRQSTiLKRWKKNWFVLyGDGNLVYYED---ETRREVEGRINMpRECRNI 56

EnvC

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

1963-2193

7.08e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 47.45 E-value: 7.08e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1963 LAPGPSEPRAVPAAKDEAEsMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQRDFESLKATcERGFAAMEeth 2042
Cdd:COG4942     10 LLALAAAAQADAAAEAEAE-LEQLQQEIAELEKELAALKKE--EKALLKQLAALERRIAALARRIRAL-EQELAALE--- 82

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2043 qKKIEDLQRQH---QRELEKLREEKDRLLA--------EETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEA 2111
Cdd:COG4942     83 -AELAELEKEIaelRAELEAQKEELAELLRalyrlgrqPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAE 161

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2112 LRR------QYLEELQSVQRELE----VLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEIT 2181
Cdd:COG4942    162 LAAlraeleAERAELEALLAELEeeraALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAE 241

                          250
                   ....*....|..
gi 1907081931 2182 RLRTLLTGDGGG 2193
Cdd:COG4942    242 RTPAAGFAALKG 253

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

895-1143

8.14e-05

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 48.13 E-value: 8.14e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  895 RDLETQQALqRDRQKEVQRLQECIAELSQQLGT----SEQAQRLMEKKLKRNytlllescEQEKQALLQNLKEVEDKASA 970
Cdd:TIGR02168  173 RRKETERKL-ERTRENLDRLEDILNELERQLKSlerqAEKAERYKELKAELR--------ELELALLVLRLEELREELEE 243

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  971 YEDQLQGHVQQVEALQKEKlsetckgseqvhkleeelEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVA 1050
Cdd:TIGR02168  244 LQEELKEAEEELEELTAEL------------------QELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQ 305

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1051 ELQEKLRgkevdyqNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHErVLEKKDQDLNEALVKMIALgssLEETEIKL 1130
Cdd:TIGR02168  306 ILRERLA-------NLERQLEELEAQLEELESKLDELAEELAELEEKLE-ELKEELESLEAELEELEAE---LEELESRL 374

                          250
                   ....*....|...
gi 1907081931 1131 QEKEECLRRFVSD 1143
Cdd:TIGR02168  375 EELEEQLETLRSK 387

PTZ00121

MAEBL; Provisional

1776-2180

9.29e-05

MAEBL; Provisional

Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 48.21 E-value: 9.29e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1776 EYEKELRFYKKACQEAKGASGQKRAQAVGALKEE---YEELlhKQKSEYQKVITLIEKENTELKAKVSQMdhqqRCLQEA 1852
Cdd:PTZ00121  1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEakkADEA--KKKAEEAKKADEAKKKAEEAKKKADEA----KKAAEA 1508

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1853 ENKHSESMFALQGRYEEEIRcMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRElqavh 1932
Cdd:PTZ00121  1509 KKKADEAKKAEEAKKADEAK-KAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRK----- 1582

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1933 QEELRALQEHYIwslrgalslyqpshpdsslapgpsepravpaakdeaesmsglrERIQELEAQMGVMREELGHKELEGD 2012
Cdd:PTZ00121  1583 AEEAKKAEEARI-------------------------------------------EEVMKLYEEEKKMKAEEAKKAEEAK 1619

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2013 VAALQEKYQrdfESLKATCERgFAAMEETHQKKIEDLQRQHqrELEKLREEKDRLLAEETAATISAIEAMKNAHREEMER 2092
Cdd:PTZ00121  1620 IKAEELKKA---EEEKKKVEQ-LKKKEAEEKKKAEELKKAE--EENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA 1693

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2093 ELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQEL 2172
Cdd:PTZ00121  1694 LKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEI 1773


                   ....*...
gi 1907081931 2173 NNRLAAEI 2180
Cdd:PTZ00121  1774 RKEKEAVI 1781

rad50

TIGR00606

rad50; All proteins in this family for which functions are known are involvedin recombination, ...

688-1137

9.50e-05

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).

Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 48.12 E-value: 9.50e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  688 REEKQVPIAPLHLSLEDRSERlSTHELTSLLEKELEQSQKEASDLLEQNRLLQdqlrVALGREQSARegyVLQTEVAtsp 767
Cdd:TIGR00606  724 RRDEMLGLAPGRQSIIDLKEK-EIPELRNKLQKVNRDIQRLKNDIEEQETLLG----TIMPEEESAK---VCLTDVT--- 792

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  768 sgawqrlhrVNQDLQSELEAQCRRqelITQQIQTLKHSygeakDAIRHHEAEIQTLQTrlgnaaaelaiKEQALAKLKGE 847
Cdd:TIGR00606  793 ---------IMERFQMELKDVERK---IAQQAAKLQGS-----DLDRTVQQVNQEKQE-----------KQHELDTVVSK 844

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  848 LKMEQGKVREQLEEWQHskamlsgqLRASEQKLRStearllEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGT 927
Cdd:TIGR00606  845 IELNRKLIQDQQEQIQH--------LKSKTNELKS------EKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQ 910

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  928 SEQAQRLMEKKLKRNYTLLLESCEQEKQAL--LQNLKEVEDKASAYEDQLQGHVQQ-VEALQKEKLSETCKGSEQVHKLE 1004
Cdd:TIGR00606  911 DSPLETFLEKDQQEKEELISSKETSNKKAQdkVNDIKEKVKNIHGYMKDIENKIQDgKDDYLKQKETELNTVNAQLEECE 990

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1005 EELEAREASIRQLAQHVQSLHDERDLIKHQF---------QELMERVATSDGDVAELQekLRGKEVDYQNLEHSHHRVSV 1075
Cdd:TIGR00606  991 KHQEKINEDMRLMRQDIDTQKIQERWLQDNLtlrkrenelKEVEEELKQHLKEMGQMQ--VLQMKQEHQKLEENIDLIKR 1068

                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907081931 1076 QLQSVRTLLREKEEELKHIKethERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECL 1137
Cdd:TIGR00606 1069 NHVLALGRQKGYEKEIKHFK---KELREPQFRDAEEKYREMMIVMRTTELVNKDLDIYYKTL 1127

PRK03918

DNA double-strand break repair ATPase Rad50;

1805-2286

1.00e-04

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 47.75 E-value: 1.00e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1805 ALKEEYEELLhKQKSEYQKVITLIEKENTELKAKVSQmdhqqrcLQEAENKHSESMFALQGRYE--EEIRCMVEQLSHTE 1882
Cdd:PRK03918   176 RRIERLEKFI-KRTENIEELIKEKEKELEEVLREINE-------ISSELPELREELEKLEKEVKelEELKEEIEELEKEL 247

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1883 NTLQAErsrvLSQLDASVKDRQAMEQHHVQQMKMLEDrfqlKVRELqavhqEELRALQEHYIwSLRGALSLY--QPSHPD 1960
Cdd:PRK03918   248 ESLEGS----KRKLEEKIRELEERIEELKKEIEELEE----KVKEL-----KELKEKAEEYI-KLSEFYEEYldELREIE 313

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1961 SSLAPGPSEPRAVPAAKDEAESMSglrERIQELEAQMGVMREELGhkELEGDVAALQEKYQRDFESLKATCERGFAAMEE 2040
Cdd:PRK03918   314 KRLSRLEEEINGIEERIKELEEKE---ERLEELKKKLKELEKRLE--ELEERHELYEEAKAKKEELERLKKRLTGLTPEK 388

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2041 ThQKKIEDLQRQH---QRELEKLREEKDRLLAEEtAATISAIEAMKNAHR----------EEMERELEKSQRSQISSINS 2107
Cdd:PRK03918   389 L-EKELEELEKAKeeiEEEISKITARIGELKKEI-KELKKAIEELKKAKGkcpvcgreltEEHRKELLEEYTAELKRIEK 466

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2108 DIEALRRQyLEELQSVQRELEVLSEQYSQkclenahlaqaLEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLL 2187
Cdd:PRK03918   467 ELKEIEEK-ERKLRKELRELEKVLKKESE-----------LIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKL 534

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2188 TGDGGgESTGLpLTQGKDAYELEVLLRVKESEIQYLKQEISSLK-----------DELQTALRDKKYASDKY---KDIYT 2253
Cdd:PRK03918   535 IKLKG-EIKSL-KKELEKLEELKKKLAELEKKLDELEEELAELLkeleelgfesvEELEERLKELEPFYNEYlelKDAEK 612

                          490       500       510
                   ....*....|....*....|....*....|...
gi 1907081931 2254 ELSIAKAKadcdISRLKEQLKAATEALGEKSPE 2286
Cdd:PRK03918   613 ELEREEKE----LKKLEEELDKAFEELAETEKR 641

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

1981-2246

1.00e-04

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 47.76 E-value: 1.00e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1981 ESMSGLRERIQELEAQMGVMREELGH------KELEGDVAALQEKYqRDFESLKATCERGFAAMEEtHQKKIEDLQRQHQ 2054
Cdd:TIGR02169  251 EELEKLTEEISELEKRLEEIEQLLEElnkkikDLGEEEQLRVKEKI-GELEAEIASLERSIAEKER-ELEDAEERLAKLE 328

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2055 RELEKLREEKDRL---LAEETAATISAIEAMKNAHREEMEReleksqRSQISSINSDIEALRR---QYLEELQSVQRELE 2128
Cdd:TIGR02169  329 AEIDKLLAEIEELereIEEERKRRDKLTEEYAELKEELEDL------RAELEEVDKEFAETRDelkDYREKLEKLKREIN 402

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2129 VLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTlltgdgggestglpLTQGKDAYE 2208
Cdd:TIGR02169  403 ELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQ--------------LAADLSKYE 468

                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1907081931 2209 LEvlLRVKESEIQYLKQEISSLKDELQTALRDKKYASD 2246
Cdd:TIGR02169  469 QE--LYDLKEEYDRVEKELSKLQRELAEAEAQARASEE 504

DUF5401

pfam17380

Family of unknown function (DUF5401); This is a family of unknown function found in ...

1988-2164

1.17e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.

Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 47.43 E-value: 1.17e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1988 ERIQELEA-QMGVMRE-ELGHKELEG--DVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKLREE 2063
Cdd:pfam17380  375 SRMRELERlQMERQQKnERVRQELEAarKVKILEEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAREMERVRLE 454

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2064 KdrLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISS---INSDIEALRRQYLEELQS---VQRELE-----VLSE 2132
Cdd:pfam17380  455 E--QERQQQVERLRQQEEERKRKKLELEKEKRDRKRAEEQRrkiLEKELEERKQAMIEEERKrklLEKEMEerqkaIYEE 532

                          170       180       190
                   ....*....|....*....|....*....|..
gi 1907081931 2133 QYSQKCLENAHLAQALEAERQALRQCQRENQE 2164
Cdd:pfam17380  533 ERRREAEEERRKQQEMEERRRIQEQMRKATEE 564

CwlO1

COG3883

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...

779-993

1.35e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];

Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 46.75 E-value: 1.35e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  779 QDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLkgelkmeqgkvREQ 858
Cdd:COG3883     19 QAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEER-----------REE 87

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  859 LEEWQHSKAMLSGQLRASEQKLRSTE-ARLLEKTQELRDL-ETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLME 936
Cdd:COG3883     88 LGERARALYRSGGSVSYLDVLLGSESfSDFLDRLSALSKIaDADADLLEELKADKAELEAKKAELEAKLAELEALKAELE 167

                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907081931  937 KKLKRnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSET 993
Cdd:COG3883    168 AAKAE-----LEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAA 219

PRK03918

DNA double-strand break repair ATPase Rad50;

702-1139

1.40e-04

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 47.37 E-value: 1.40e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  702 LEDRSERLSTheltslLEKELEQSQKEASDLLEQNRLLQD--QLRVALGREQSAREGYVLQtevatspsgawqrlhrvnq 779
Cdd:PRK03918   333 LEEKEERLEE------LKKKLKELEKRLEELEERHELYEEakAKKEELERLKKRLTGLTPE------------------- 387

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  780 DLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNA---AAELAikEQALAKLKGELKMEQGKVR 856
Cdd:PRK03918   388 KLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCpvcGRELT--EEHRKELLEEYTAELKRIE 465

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  857 EQLEEWQHSKAMLSGQLRASEQKLR--STEARLLEKTQELRDLETQqaLQRDRQKEVQRLQECIAELSQQLGTSEQAQRL 934
Cdd:PRK03918   466 KELKEIEEKERKLRKELRELEKVLKkeSELIKLKELAEQLKELEEK--LKKYNLEELEKKAEEYEKLKEKLIKLKGEIKS 543

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  935 MEKKLKRnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEalqkEKLSETCKGSEQVHKLEEELEAREASI 1014
Cdd:PRK03918   544 LKKELEK-----LEELKKKLAELEKKLDELEEELAELLKELEELGFESV----EELEERLKELEPFYNEYLELKDAEKEL 614

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1015 RQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAEL-----QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEE 1089
Cdd:PRK03918   615 EREEKELKKLEEELDKAFEELAETEKRLEELRKELEELekkysEEEYEELREEYLELSRELAGLRAELEELEKRREEIKK 694

                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907081931 1090 ELKHIKETHERVLEKKD--QDLNEALVKMIALGSSLEetEIKLQEKEECLRR 1139
Cdd:PRK03918   695 TLEKLKEELEEREKAKKelEKLEKALERVEELREKVK--KYKALLKERALSK 744

GumC

COG3206

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

1986-2184

1.42e-04

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 47.32 E-value: 1.42e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1986 LRERIQELEAQMGVMREELghKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQhqreLEKLREEKD 2065
Cdd:COG3206    173 ARKALEFLEEQLPELRKEL--EEAEAALEEFRQKNG----------LVDLSEEAKLLLQQLSELESQ----LAEARAELA 236

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2066 RLLAEETAATiSAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEE---LQSVQRELEVLSEQYSQkclENA 2142
Cdd:COG3206    237 EAEARLAALR-AQLGSGPDALPELLQSPVIQQLRAQLAELEAELAELSARYTPNhpdVIALRAQIAALRAQLQQ---EAQ 312

                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1907081931 2143 HLAQALEAERQALRQCQRE-NQELNAHNQELN--NRLAAEITRLR 2184
Cdd:COG3206    313 RILASLEAELEALQAREASlQAQLAQLEARLAelPELEAELRRLE 357

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

878-1081

1.52e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 47.22 E-value: 1.52e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  878 QKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQ-QLGTSEQAQRLMEKKLKRNyTLLLESCEQEKQA 956
Cdd:COG4913    235 DDLERAHEALEDAREQIELLEPIRELAERYAAARERLAELEYLRAAlRLWFAQRRLELLEAELEEL-RAELARLEAELER 313

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  957 LLQNLKEVEDKASAYEDQLQGH-VQQVEALQKEklsetckgseqVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQF 1035
Cdd:COG4913    314 LEARLDALREELDELEAQIRGNgGDRLEQLERE-----------IERLERELEERERRRARLEALLAALGLPLPASAEEF 382

                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1036 QEL----MERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVR 1081
Cdd:COG4913    383 AALraeaAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLE 432

DUF3584

pfam12128

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...

1869-2281

1.62e-04

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.

Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 47.14 E-value: 1.62e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1869 EEIRCMVEQLSHTENTLQAERSRvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEHyIWSLR 1948
Cdd:pfam12128  237 MKIRPEFTKLQQEFNTLESAELR-LSHLHFGYKSDETLIASRQEERQETSAELNQLLRTLDDQWKEKRDELNGE-LSAAD 314

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1949 GALSLYQpSHPDSSlapgpsEPRAVPAAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQRDFESLK 2028
Cdd:pfam12128  315 AAVAKDR-SELEAL------EDQHGAFLDADIETAAADQEQLPSWQSELENLEER--LKALTGKHQDVTAKYNRRRSKIK 385

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2029 ATCERGFAAMEEthqkkiedlqrqhqrELEKLREEKDRLLAEETAatisAIEAMKNAHREEME------RELEKSQRSQI 2102
Cdd:pfam12128  386 EQNNRDIAGIKD---------------KLAKIREARDRQLAVAED----DLQALESELREQLEagklefNEEEYRLKSRL 446

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2103 SSIN---------SDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELN 2173
Cdd:pfam12128  447 GELKlrlnqatatPELLLQLENFDERIERAREEQEAANAEVERLQSELRQARKRRDQASEALRQASRRLEERQSALDELE 526

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2174 NRLAAEITRLRTLLTGDGGG--ESTG----------------LPLTQGKDAYEL-EVLLRVKESEIQ---YLKQEISSLK 2231
Cdd:pfam12128  527 LQLFPQAGTLLHFLRKEAPDweQSIGkvispellhrtdldpeVWDGSVGGELNLyGVKLDLKRIDVPewaASEEELRERL 606

                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907081931 2232 DELQTALRDkkyASDKYKDIYTELSIAKA---KADCDISRLKEQLKAATEALG 2281
Cdd:pfam12128  607 DKAEEALQS---AREKQAAAEEQLVQANGeleKASREETFARTALKNARLDLR 656

PRK03918

DNA double-strand break repair ATPase Rad50;

1982-2339

1.66e-04

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 46.98 E-value: 1.66e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1982 SMSGLRERIQELEAQMGVMREELghKELEGDVAALQE------------KYQRDFESLKATCERGFAAMEETH---QKKI 2046
Cdd:PRK03918   253 SKRKLEEKIRELEERIEELKKEI--EELEEKVKELKElkekaeeyiklsEFYEEYLDELREIEKRLSRLEEEIngiEERI 330

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2047 EDLQRQHQR--ELEKLREEKDRLLA--EETAATISAIEAMKnahrEEMERELEKSQRSQISSINSDIEALRRQYLEelqs 2122
Cdd:PRK03918   331 KELEEKEERleELKKKLKELEKRLEelEERHELYEEAKAKK----EELERLKKRLTGLTPEKLEKELEELEKAKEE---- 402

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2123 VQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNA-HNQELNNRLAAEITRLRTLLTGDGGGESTGLplt 2201
Cdd:PRK03918   403 IEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEeHRKELLEEYTAELKRIEKELKEIEEKERKLR--- 479

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2202 qgKDAYELEVLLRvKESEIQYLKQ---EISSLKDELqtalrdKKYASDKYKDIYTELSIAKAKAD---CDISRLKEQLKA 2275
Cdd:PRK03918   480 --KELRELEKVLK-KESELIKLKElaeQLKELEEKL------KKYNLEELEKKAEEYEKLKEKLIklkGEIKSLKKELEK 550

                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081931 2276 ATEALGEKSpegttvsgydimKSKSNPDFLKKDRSCVTRQLRNIRSKSLKEgltVQERLKLFES 2339
Cdd:PRK03918   551 LEELKKKLA------------ELEKKLDELEEELAELLKELEELGFESVEE---LEERLKELEP 599

YhaN

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

773-947

1.80e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 46.68 E-value: 1.80e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  773 RLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKE--QALAKLKGELKM 850
Cdd:COG4717     64 RKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPlyQELEALEAELAE 143

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  851 EQGKVRE------QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQE-----LRDLETQQALQRDRQKEVQRLQECIA 919
Cdd:COG4717    144 LPERLEEleerleELRELEEELEELEAELAELQEELEELLEQLSLATEEelqdlAEELEELQQRLAELEEELEEAQEELE 223

                          170       180       190
                   ....*....|....*....|....*....|
gi 1907081931  920 ELSQQLGTSEQAQRL--MEKKLKRNYTLLL 947
Cdd:COG4717    224 ELEEELEQLENELEAaaLEERLKEARLLLL 253

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

1975-2132

1.84e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 46.83 E-value: 1.84e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1975 AAKDEAES-MSGLRERIQELEAQM---GVMREElghkELEGDVAALQEKY---QRDFESLKATCER-GFAAmeETHQKKI 2046
Cdd:COG4913    309 AELERLEArLDALREELDELEAQIrgnGGDRLE----QLEREIERLERELeerERRRARLEALLAAlGLPL--PASAEEF 382

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2047 EDLQRQHQRELEKLREEKDRLLAEETAAtISAIEAMKNAHREeMERELEkSQRSQISSINSDIEALRRQYLEELQSVQRE 2126
Cdd:COG4913    383 AALRAEAAALLEALEEELEALEEALAEA-EAALRDLRRELRE-LEAEIA-SLERRKSNIPARLLALRDALAEALGLDEAE 459


                   ....*.
gi 1907081931 2127 LEVLSE 2132
Cdd:COG4913    460 LPFVGE 465

PH_Boi

cd13316

Boi family Pleckstrin homology domain; Yeast Boi proteins Boi1 and Boi2 are functionally ...

430-519

2.15e-04

Boi family Pleckstrin homology domain; Yeast Boi proteins Boi1 and Boi2 are functionally redundant and important for cell growth with Boi mutants displaying defects in bud formation and in the maintenance of cell polarity.They appear to be linked to Rho-type GTPase, Cdc42 and Rho3. Boi1 and Boi2 display two-hybrid interactions with the GTP-bound ("active") form of Cdc42, while Rho3 can suppress of the lethality caused by deletion of Boi1 and Boi2. These findings suggest that Boi1 and Boi2 are targets of Cdc42 that promote cell growth in a manner that is regulated by Rho3. Boi proteins contain a N-terminal SH3 domain, followed by a SAM (sterile alpha motif) domain, a proline-rich region, which mediates binding to the second SH3 domain of Bem1, and C-terminal PH domain. The PH domain is essential for its function in cell growth and is important for localization to the bud, while the SH3 domain is needed for localization to the neck. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270126 Cd Length: 97 Bit Score: 42.36 E-value: 2.15e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  430 KGWLTKQYED-GQWKKHWFVLADQSLRYYRdsvAEEAADLDGEINLsTCYDVT----EYPVQRNYGFQI--HTKEGEFTL 502
Cdd:cd13316      3 SGWMKKRGERyGTWKTRYFVLKGTRLYYLK---SENDDKEKGLIDL-TGHRVVpddsNSPFRGSYGFKLvpPAVPKVHYF 78

                           90
                   ....*....|....*..
gi 1907081931  503 SAMTSGIRRNWIQTIMK 519
Cdd:cd13316     79 AVDEKEELREWMKALMK 95

PRK03918

DNA double-strand break repair ATPase Rad50;

2055-2322

2.51e-04

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 46.60 E-value: 2.51e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2055 RELEKLREEKDRLLAEEtaatiSAIEAMKnahrEEMERELEKSQRsQISSINSDIEALRRQyLEELQSVQRELEVLSEQY 2134
Cdd:PRK03918   172 KEIKRRIERLEKFIKRT-----ENIEELI----KEKEKELEEVLR-EINEISSELPELREE-LEKLEKEVKELEELKEEI 240

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2135 SQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRlAAEITRLRTLltgdgggestglpltqgKDAY-ELEVLL 2213
Cdd:PRK03918   241 EELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEK-VKELKELKEK-----------------AEEYiKLSEFY 302

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2214 RVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIyTELSIAKAKADCDISRLKEQLKAATEA---LGEKSPEGTTV 2290
Cdd:PRK03918   303 EEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERL-EELKKKLKELEKRLEELEERHELYEEAkakKEELERLKKRL 381

                          250       260       270
                   ....*....|....*....|....*....|..
gi 1907081931 2291 SGYDIMKSKSNPDFLKKDRSCVTRQLRNIRSK 2322
Cdd:PRK03918   382 TGLTPEKLEKELEELEKAKEEIEEEISKITAR 413

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

674-976

2.64e-04

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 46.60 E-value: 2.64e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  674 EIEQRWHQVETTPLREEKQVPIAPLHLSLEDRSERLSTHELTSLLEK--ELEQSQKEASDLLEQNRLLQDQLRVALGREQ 751
Cdd:TIGR02169  699 RIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDlsSLEQEIENVKSELKELEARIEELEEDLHKLE 778

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  752 SAREGyvLQTEVATSPsgaWQRLhrvnQDLQSELEAQCRRQELITQQIQ------TLKHSYgeAKDAIRHHEAEIQTLQT 825
Cdd:TIGR02169  779 EALND--LEARLSHSR---IPEI----QAELSKLEEEVSRIEARLREIEqklnrlTLEKEY--LEKEIQELQEQRIDLKE 847

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  826 RLGNAAAELAIKEQALAKLKGELKMEQGKVRE---QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQA 902
Cdd:TIGR02169  848 QIKSIEKEIENLNGKKEELEEELEELEAALRDlesRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLE 927

                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081931  903 LQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQ 976
Cdd:TIGR02169  928 ALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEE-----IRALEPVNMLAIQEYEEVLKRLDELKEKRA 996

sbcc

exonuclease SbcC; All proteins in this family for which functions are known are part of an ...

718-1139

2.80e-04

Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 46.50 E-value: 2.80e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSA-------REGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCR 790
Cdd:TIGR00618  227 ELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLLkqlrariEELRAQEAVLEETQERINRARKAAPLAAHIKAVTQIE 306

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  791 RQ-ELITQQIQTLKHSYGEAKDAIRHHEAEIQTL--QTRLGN---AAAELAIKEQALAKLKGELKMEQGKVREQLEEWQH 864
Cdd:TIGR00618  307 QQaQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIeeQRRLLQtlhSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQ 386

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  865 SKAMLSGQLRASEQ---KLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKR 941
Cdd:TIGR00618  387 QKTTLTQKLQSLCKeldILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQ 466

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  942 NYTLLLEScEQEKQALLQNLKEVEDKASAYEDQLQG------------HVQQVEALQKEKLSETCKGSEQVHKLEEELEA 1009
Cdd:TIGR00618  467 SLKEREQQ-LQTKEQIHLQETRKKAVVLARLLELQEepcplcgscihpNPARQDIDNPGPLTRRMQRGEQTYAQLETSEE 545

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1010 REASIRQ-LAQHVQSLHDERDLIKHQFQELmervATSDGDVAELQEKLRGKEVDYQNL--EHSHHRVSVQLQSVRTLLRE 1086
Cdd:TIGR00618  546 DVYHQLTsERKQRASLKEQMQEIQQSFSIL----TQCDNRSKEDIPNLQNITVRLQDLteKLSEAEDMLACEQHALLRKL 621

                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907081931 1087 KEEELKHIKETHERVLEKKDQDLNEALVKmialgsslEETEIKLQEKEECLRR 1139
Cdd:TIGR00618  622 QPEQDLQDVRLHLQQCSQELALKLTALHA--------LQLTLTQERVREHALS 666

HEC1

COG5185

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...

1975-2271

3.13e-04

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 444066 [Multi-domain] Cd Length: 594 Bit Score: 46.10 E-value: 3.13e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1975 AAKDEAESMSGLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFESLKATCE-----RGFAAMEETHQKKIEDL 2049
Cdd:COG5185    269 KLGENAESSKRLNENANNLIKQFENTKEKIAEYTKSIDIKKATESLEEQLAAAEAEQEleeskRETETGIQNLTAEIEQG 348

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2050 QRQHQRELEKLREEKDRLLAEETAATisaieamknahREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEV 2129
Cdd:COG5185    349 QESLTENLEAIKEEIENIVGEVELSK-----------SSEELDSFKDTIESTKESLDEIPQNQRGYAQEILATLEDTLKA 417

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2130 LSEQysqkclenahlaqaLEAERQALRQCQRENQElnahNQELNNRLAAEITRLRTLLTGDGggeSTGLPLTQGKDAYEL 2209
Cdd:COG5185    418 ADRQ--------------IEELQRQIEQATSSNEE----VSKLLNELISELNKVMREADEES---QSRLEEAYDEINRSV 476

                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081931 2210 EVLLRVKESEIQYLKQEISSLKDELQT--ALRDKKYASDKYKDIYTELSIAKAKADCDISRLKE 2271
Cdd:COG5185    477 RSKKEDLNEELTQIESRVSTLKATLEKlrAKLERQLEGVRSKLDQVAESLKDFMRARGYAHILA 540

CCDC158

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

1841-2259

3.13e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 46.26 E-value: 3.13e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1841 QMDHQQRCLQEAENKHSESMFAlqgRYEEEIRCMVEQLSHTeNTLQAERSRVLSQldaSVKDRQAmeqhHVQQMKMLEDR 1920
Cdd:pfam15921   60 ELDSPRKIIAYPGKEHIERVLE---EYSHQVKDLQRRLNES-NELHEKQKFYLRQ---SVIDLQT----KLQEMQMERDA 128

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1921 FqLKVRELQAVHQEELRALQEHYIWSLRGALSLYQPSHPDSSLAPGP------------SEPRAVPAAKDEAESmsglrE 1988
Cdd:pfam15921  129 M-ADIRRRESQSQEDLRNQLQNTVHELEAAKCLKEDMLEDSNTQIEQlrkmmlshegvlQEIRSILVDFEEASG-----K 202

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1989 RIQELEAQMGVMREELGH------KELEGDVAALQEK---YQRDFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEK 2059
Cdd:pfam15921  203 KIYEHDSMSTMHFRSLGSaiskilRELDTEISYLKGRifpVEDQLEALKSESQNKIELLLQQHQDRIEQLISEHEVEITG 282

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2060 LREEKdrllaeetaatiSAIEAMKNAHREEME--RELEKSQRSQISSINSDIEALRRQYLEELQSVQRELE-VLSEQYSQ 2136
Cdd:pfam15921  283 LTEKA------------SSARSQANSIQSQLEiiQEQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEdKIEELEKQ 350

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2137 KCLENAHLAQAlEAERQALRQ----CQRENQELNAHNQELNNRLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYELEVl 2212
Cdd:pfam15921  351 LVLANSELTEA-RTERDQFSQesgnLDDQLQKLLADLHKREKELSLEKEQNKRLWDRDTGNSITIDHLRRELDDRNMEV- 428

                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*..
gi 1907081931 2213 lRVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAK 2259
Cdd:pfam15921  429 -QRLEALLKAMKSECQGQMERQMAAIQGKNESLEKVSSLTAQLESTK 474

SCP-1

pfam05483

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...

667-1092

3.33e-04

Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 45.87 E-value: 3.33e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  667 KTQNVHVEIEQRWHQVETT--PLREEKQVPIAPLHLSLEDRSERLstHELTSLLEKELEQSQKEASDLLEQNRLLQDQLR 744
Cdd:pfam05483  180 ETRQVYMDLNNNIEKMILAfeELRVQAENARLEMHFKLKEDHEKI--QHLEEEYKKEINDKEKQVSLLLIQITEKENKMK 257

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  745 VALGREQSAREGyVLQTEVATSpsgawqrlhrvnqdLQSE-LEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTL 823
Cdd:pfam05483  258 DLTFLLEESRDK-ANQLEEKTK--------------LQDEnLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIA 322

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  824 QTRLGNAAAELAIKEQALAKLKGELKMeqgkvreQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELR----DLET 899
Cdd:pfam05483  323 TKTICQLTEEKEAQMEELNKAKAAHSF-------VVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQkkssELEE 395

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  900 QQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKklkrnytllLESCEQEKQALLQNL-KEVED---KASAYEDQL 975
Cdd:pfam05483  396 MTKFKNNKEVELEELKKILAEDEKLLDEKKQFEKIAEE---------LKGKEQELIFLLQAReKEIHDleiQLTAIKTSE 466

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  976 QGHVQQVEALQKEKLSETCKGSEqvhkleeeleareasirqLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEK 1055
Cdd:pfam05483  467 EHYLKEVEDLKTELEKEKLKNIE------------------LTAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQ 528

                          410       420       430
                   ....*....|....*....|....*....|....*..
gi 1907081931 1056 LRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELK 1092
Cdd:pfam05483  529 EERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVK 565

YhaN

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

1977-2157

3.90e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 45.53 E-value: 3.90e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1977 KDEAESMSGLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFESLKATCERgfaameethqkkIEDLQRQHQrE 2056
Cdd:COG4717     91 AELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPER------------LEELEERLE-E 157

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2057 LEKLREEKDRLLAEetaatisaiEAMKNAHREEMERELEKSQRSQISSINSDIEALR---RQYLEELQSVQRELEVLSEQ 2133
Cdd:COG4717    158 LRELEEELEELEAE---------LAELQEELEELLEQLSLATEEELQDLAEELEELQqrlAELEEELEEAQEELEELEEE 228

                          170       180
                   ....*....|....*....|....
gi 1907081931 2134 YSQkcLENAHLAQALEAERQALRQ 2157
Cdd:COG4717    229 LEQ--LENELEAAALEERLKEARL 250

PH_GRP1-like

cd01252

General Receptor for Phosphoinositides-1-like Pleckstrin homology (PH) domain; GRP1/cytohesin3 ...

429-517

4.03e-04

General Receptor for Phosphoinositides-1-like Pleckstrin homology (PH) domain; GRP1/cytohesin3 and the related proteins ARNO (ARF nucleotide-binding site opener)/cytohesin-2 and cytohesin-1 are ARF exchange factors that contain a pleckstrin homology (PH) domain thought to target these proteins to cell membranes through binding polyphosphoinositides. The PH domains of all three proteins exhibit relatively high affinity for PtdIns(3,4,5)P3. Within the Grp1 family, diglycine (2G) and triglycine (3G) splice variants, differing only in the number of glycine residues in the PH domain, strongly influence the affinity and specificity for phosphoinositides. The 2G variants selectively bind PtdIns(3,4,5)P3 with high affinity,the 3G variants bind PtdIns(3,4,5)P3 with about 30-fold lower affinity and require the polybasic region for plasma membrane targeting. These ARF-GEFs share a common, tripartite structure consisting of an N-terminal coiled-coil domain, a central domain with homology to the yeast protein Sec7, a PH domain, and a C-terminal polybasic region. The Sec7 domain is autoinhibited by conserved elements proximal to the PH domain. GRP1 binds to the DNA binding domain of certain nuclear receptors (TRalpha, TRbeta, AR, ER, but not RXR), and can repress thyroid hormone receptor (TR)-mediated transactivation by decreasing TR-complex formation on thyroid hormone response elements. ARNO promotes sequential activation of Arf6, Cdc42 and Rac1 and insulin secretion. Cytohesin acts as a PI 3-kinase effector mediating biological responses including cell spreading and adhesion, chemotaxis, protein trafficking, and cytoskeletal rearrangements, only some of which appear to depend on their ability to activate ARFs. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269954 Cd Length: 119 Bit Score: 42.30 E-value: 4.03e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQyeDGQ---WKKHWFVLADQSLRYYRDSvaeEAADLDGEI---NLStcydVTEYP-VQRNYGFQIH------- 494
Cdd:cd01252      5 REGWLLKL--GGRvksWKRRWFILTDNCLYYFEYT---TDKEPRGIIpleNLS----VREVEdKKKPFCFELYspsngqv 75

                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1907081931  495 -----------TKEGEFT---LSAMTSGIRRNWIQTI 517
Cdd:cd01252     76 ikacktdsdgkVVEGNHTvyrISAASEEERDEWIKSI 112

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1776-2067

4.47e-04

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 45.82 E-value: 4.47e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1776 EYEKELRFYKKACQEAKGASGQKRaQAVGALKEEYEELLHKQKSEYQKvITLIEKENTELKAKVSQMDHQQRCLQEAENK 1855
Cdd:TIGR02168  702 ELRKELEELEEELEQLRKELEELS-RQISALRKDLARLEAEVEQLEER-IAQLSKELTELEAEIEELEERLEEAEEELAE 779

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1856 HSESMFALQGRYEEeircMVEQLSHTENTLQAERSRvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVREL--QAVHQ 1933
Cdd:TIGR02168  780 AEAEIEELEAQIEQ----LKEELKALREALDELRAE-LTLLNEEAANLRERLESLERRIAATERRLEDLEEQIeeLSEDI 854

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1934 EELRALQEHYiWSLRGALSlyqpSHPDSSLAPGPSEPRAVPAAKDEAESMSG----LRERIQELEAQMGVMREELGH--- 2006
Cdd:TIGR02168  855 ESLAAEIEEL-EELIEELE----SELEALLNERASLEEALALLRSELEELSEelreLESKRSELRRELEELREKLAQlel 929

                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907081931 2007 --KELEGDVAALQEK----YQRDFEslkatcergfaaMEETHQKKIEDLQRQHQRELEKLREEKDRL 2067
Cdd:TIGR02168  930 rlEGLEVRIDNLQERlseeYSLTLE------------EAEALENKIEDDEEEARRRLKRLENKIKEL 984

PH_OSBP_ORP4

cd13284

Human Oxysterol binding protein and OSBP-related protein 4 Pleckstrin homology (PH) domain; ...

429-515

4.62e-04

Human Oxysterol binding protein and OSBP-related protein 4 Pleckstrin homology (PH) domain; Human OSBP is proposed to function is sterol-dependent regulation of ERK dephosphorylation and sphingomyelin synthesis as well as modulation of insulin signaling and hepatic lipogenesis. It contains a N-terminal PH domain, a FFAT motif (two phenylalanines in an acidic tract), and a C-terminal OSBP-related domain. OSBPs and Osh1p PH domains specifically localize to the Golgi apparatus in a PtdIns4P-dependent manner. ORP4 is proposed to function in Vimentin-dependent sterol transport and/or signaling. Human ORP4 has 2 forms, a long (ORP4L) and a short (ORP4S). ORP4L contains a N-terminal PH domain, a FFAT motif (two phenylalanines in an acidic tract), and a C-terminal OSBP-related domain. ORP4S is truncated and contains only an OSBP-related domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. They are members of the oxysterol binding protein (OSBP) family which includes OSBP, OSBP-related proteins (ORP), Goodpasture antigen binding protein (GPBP), and Four phosphate adaptor protein 1 (FAPP1). They have a wide range of purported functions including sterol transport, cell cycle control, pollen development and vessicle transport from Golgi recognize both PI lipids and ARF proteins. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270101 Cd Length: 99 Bit Score: 41.59 E-value: 4.62e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTK--QYEDGqWKKHWFVLADQSLRYYRdSVAEEAADLDGEINLSTCYDVTEYPVQrnygFQIHT-KEGEFTLSAM 505
Cdd:cd13284      1 MKGWLLKwtNYIKG-YQRRWFVLSNGLLSYYR-NQAEMAHTCRGTINLAGAEIHTEDSCN----FVISNgGTQTFHLKAS 74

                           90
                   ....*....|
gi 1907081931  506 TSGIRRNWIQ 515
Cdd:cd13284     75 SEVERQRWVT 84

EnvC

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

834-1090

4.69e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 45.14 E-value: 4.69e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  834 LAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQrdrQKEVQR 913
Cdd:COG4942     11 LALAAAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAAL---EAELAE 87

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  914 LQECIAELSQQLGTSEQ--AQRL--MEKKLKRNYTLLLESCEqekqallqNLKEVEDKASAYEDQLQGHVQQVEALqkek 989
Cdd:COG4942     88 LEKEIAELRAELEAQKEelAELLraLYRLGRQPPLALLLSPE--------DFLDAVRRLQYLKYLAPARREQAEEL---- 155

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  990 lsetckgseqvhklEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHS 1069
Cdd:COG4942    156 --------------RADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQE 221

                          250       260
                   ....*....|....*....|.
gi 1907081931 1070 HHRVSVQLQSVRTLLREKEEE 1090
Cdd:COG4942    222 AEELEALIARLEAEAAAAAER 242

YhaN

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

1680-2157

5.25e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 45.14 E-value: 5.25e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1680 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLPPTEPLGGCQRLLRMSQHlSYESCLEGLGQYSSLLVQd 1759
Cdd:COG4717     87 EEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPE-RLEELEERLEELRELEEE- 164

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1760 aiiqaqvcyaacriRLEYEKELRFYKKACQEAKGASGQKRAQAVGALKEEYEELlHKQKSEYQKVITLIEKENTELKAKV 1839
Cdd:COG4717    165 --------------LEELEAELAELQEELEELLEQLSLATEEELQDLAEELEEL-QQRLAELEEELEEAQEELEELEEEL 229

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1840 SQMDHQQRCLQEAENKHSESMFALqgryeeeIRCMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLED 1919
Cdd:COG4717    230 EQLENELEAAALEERLKEARLLLL-------IAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGK 302

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1920 RFQlkvrelQAVHQEELRALQEHYIWSLRGALSLyqpshpdsslaPGPSEPRAVPAAKDEAESMSGLRERIQELEAQMgv 1999
Cdd:COG4717    303 EAE------ELQALPALEELEEEELEELLAALGL-----------PPDLSPEELLELLDRIEELQELLREAEELEEEL-- 363

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2000 mreelghkelegDVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQR--QHQRELEKLREEKDRLLAEETAATIS 2077
Cdd:COG4717    364 ------------QLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQELKEEleELEEQLEELLGELEELLEALDEEELE 431

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2078 AIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQY-----LEELQSVQRELEVLSEQYSQKCLenahLAQALEAER 2152
Cdd:COG4717    432 EELEELEEELEELEEELEE-LREELAELEAELEQLEEDGelaelLQELEELKAELRELAEEWAALKL----ALELLEEAR 506


                   ....*
gi 1907081931 2153 QALRQ 2157
Cdd:COG4717    507 EEYRE 511

PH_Gab2_2

cd13384

Grb2-associated binding protein family pleckstrin homology (PH) domain; The Gab subfamily ...

428-517

5.65e-04

Grb2-associated binding protein family pleckstrin homology (PH) domain; The Gab subfamily includes several Gab proteins, Drosophila DOS and C. elegans SOC-1. They are scaffolding adaptor proteins, which possess N-terminal PH domains and a C-terminus with proline-rich regions and multiple phosphorylation sites. Following activation of growth factor receptors, Gab proteins are tyrosine phosphorylated and activate PI3K, which generates 3-phosphoinositide lipids. By binding to these lipids via the PH domain, Gab proteins remain in proximity to the receptor, leading to further signaling. While not all Gab proteins depend on the PH domain for recruitment, it is required for Gab activity. Members here include insect, nematodes, and crustacean Gab2s. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 241535 Cd Length: 115 Bit Score: 41.66 E-value: 5.65e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  428 FKKGWLTK-----QYEDGQWKKHWFVLADQS------LRYYRDsvaEEAADLDGEINLSTCYDV-----TEYPVQRNYG- 490
Cdd:cd13384      4 VYEGWLTKsppekRIWRAKWRRRYFVLRQSEipgqyfLEYYTD---RTCRKLKGSIDLDQCEQVdagltFETKNKLKDQh 80

                           90       100
                   ....*....|....*....|....*...
gi 1907081931  491 -FQIHTKEGEFTLSAMTSGIRRNWIQTI 517
Cdd:cd13384     81 iFDIRTPKRTYYLVADTEDEMNKWVNCI 108

PHA02562

endonuclease subunit; Provisional

774-999

5.76e-04

endonuclease subunit; Provisional

Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 45.01 E-value: 5.76e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  774 LHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKlkgeLKMEQG 853
Cdd:PHA02562   190 IDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNK----LNTAAA 265

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  854 KVREQLEEWQHSKAMLS--GQLRASEQKLRSTEARLLEKTQELRDLETQ-QALQRDRQKEVQRLQEcIAELSQQLgtseq 930
Cdd:PHA02562   266 KIKSKIEQFQKVIKMYEkgGVCPTCTQQISEGPDRITKIKDKLKELQHSlEKLDTAIDELEEIMDE-FNEQSKKL----- 339

                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907081931  931 aqrlmeKKLKRNYtlllESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKE--KLSETCKGSEQ 999
Cdd:PHA02562   340 ------LELKNKI----STNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDEldKIVKTKSELVK 400

PH_Btk

cd01238

Bruton's tyrosine kinase pleckstrin homology (PH) domain; Btk is a member of the Tec family of ...

429-517

5.78e-04

Bruton's tyrosine kinase pleckstrin homology (PH) domain; Btk is a member of the Tec family of cytoplasmic protein tyrosine kinases that includes BMX, IL2-inducible T-cell kinase (Itk) and Tec. Btk plays a role in the maturation of B cells. Tec proteins general have an N-terminal PH domain, followed by a Tek homology (TH) domain, a SH3 domain, a SH2 domain and a kinase domain. The Btk PH domain binds phosphatidylinositol 3,4,5-trisphosphate and responds to signalling via phosphatidylinositol 3-kinase. The PH domain is also involved in membrane anchoring which is confirmed by the discovery of a mutation of a critical arginine residue in the BTK PH domain. This results in severe human immunodeficiency known as X-linked agammaglobulinemia (XLA) in humans and a related disorder is mice.PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269944 [Multi-domain] Cd Length: 140 Bit Score: 42.22 E-value: 5.78e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQyedGQ---------WKKHWFVLADQSLRYYrDSVAEEAADLDGEINLSTCYDV----TEYPVQRNYGFQIHT 495
Cdd:cd01238      1 LEGLLVKR---SQgkkrfgpvnYKERWFVLTKSSLSYY-EGDGEKRGKEKGSIDLSKVRCVeevkDEAFFERKYPFQVVY 76

                           90       100
                   ....*....|....*....|..
gi 1907081931  496 KEGEFTLSAMTSGIRRNWIQTI 517
Cdd:cd01238     77 DDYTLYVFAPSEEDRDEWIAAL 98

PRK02224

DNA double-strand break repair Rad50 ATPase;

1975-2237

6.81e-04

DNA double-strand break repair Rad50 ATPase;

Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 45.03 E-value: 6.81e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1975 AAKDEAEsmsgLRERIQELEAQMGVMREELGHKELEGDVAALQ-----------EKYQRDFESLKATCERGFAAMEETHQ 2043
Cdd:PRK02224   197 EEKEEKD----LHERLNGLESELAELDEEIERYEEQREQARETrdeadevleehEERREELETLEAEIEDLRETIAETER 272

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2044 KK--IEDLQRQHQRELEKLREEKDRLLAEE--TAATISAIEAMKN---AHREEMERELEKsQRSQISSINSDIEALRrqy 2116
Cdd:PRK02224   273 EReeLAEEVRDLRERLEELEEERDDLLAEAglDDADAEAVEARREeleDRDEELRDRLEE-CRVAAQAHNEEAESLR--- 348

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2117 leelqsvqRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAaeitrlrtlltgdgggest 2196
Cdd:PRK02224   349 --------EDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFG------------------- 401

                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 1907081931 2197 GLPLTQGKDAYELEVLLrvkeSEIQYLKQEISSLKDELQTA 2237
Cdd:PRK02224   402 DAPVDLGNAEDFLEELR----EERDELREREAELEATLRTA 438

Cast

pfam10174

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...

774-1139

7.47e-04

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.

Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 44.81 E-value: 7.47e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  774 LHRVNQDLQSELEAQCRRQELITQQIQT--------------LKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAE------ 833
Cdd:pfam10174  245 LERNIRDLEDEVQMLKTNGLLHTEDREEeikqmevykshskfMKNKIDQLKQELSKKESELLALQTKLETLTNQnsdckq 324

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  834 --------LAIKEQALAKLKGE-----LKMEQ-----GKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELR 895
Cdd:pfam10174  325 hievlkesLTAKEQRAAILQTEvdalrLRLEEkesflNKKTKQLQDLTEEKSTLAGEIRDLKDMLDVKERKINVLQKKIE 404

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  896 DLETQqalQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDkasaYEDQL 975
Cdd:pfam10174  405 NLQEQ---LRDKDKQLAGLKERVKSLQTDSSNTDTALTTLEEALSEKERIIERLKEQREREDRERLEELES----LKKEN 477

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  976 QGHVQQVEALQKEKLSETCKGS---EQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDvAEL 1052
Cdd:pfam10174  478 KDLKEKVSALQPELTEKESSLIdlkEHASSLASSGLKKDSKLKSLEIAVEQKKEECSKLENQLKKAHNAEEAVRTN-PEI 556

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1053 QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEElKHIKETHERVLEK------KDQDLNEALVKMialgSSLEET 1126
Cdd:pfam10174  557 NDRIRLLEQEVARYKEESGKAQAEVERLLGILREVENE-KNDKDKKIAELESltlrqmKEQNKKVANIKH----GQQEMK 631

                          410
                   ....*....|...
gi 1907081931 1127 EIKLQEKEECLRR 1139
Cdd:pfam10174  632 KKGAQLLEEARRR 644

CCDC158

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

718-1137

7.73e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 45.11 E-value: 7.73e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNrllQDQLRVALGREQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRRQelITQ 797
Cdd:pfam15921  247 LEALKSESQNKIELLLQQH---QDRIEQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQARNQNSMYMRQ--LSD 321

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  798 QIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQAlaklKGELKMEQGKVREQLEEwqhskamLSGQLRASE 877
Cdd:pfam15921  322 LESTVSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTE----RDQFSQESGNLDDQLQK-------LLADLHKRE 390

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  878 QKLRstearlLEKTQELR--DLETQQA-----LQR---DRQKEVQRLQ--------ECIAELSQQLGTSEQAQRLMEKkl 939
Cdd:pfam15921  391 KELS------LEKEQNKRlwDRDTGNSitidhLRReldDRNMEVQRLEallkamksECQGQMERQMAAIQGKNESLEK-- 462

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  940 krnYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSE--QVHKLEEELEAREASIRQL 1017
Cdd:pfam15921  463 ---VSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEitKLRSRVDLKLQELQHLKNE 539

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1018 AQHVQSLHDERDLIKHQFQELMERVatsdgdvaelqEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLrEKE--------E 1089
Cdd:pfam15921  540 GDHLRNVQTECEALKLQMAEKDKVI-----------EILRQQIENMTQLVGQHGRTAGAMQVEKAQL-EKEindrrlelQ 607

                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907081931 1090 ELKHIKETHE---RVLEKKDQDLNEALVKMIALGSS-LEETEIKLQEKEECL 1137
Cdd:pfam15921  608 EFKILKDKKDakiRELEARVSDLELEKVKLVNAGSErLRAVKDIKQERDQLL 659

YhaN

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

1799-2261

7.90e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 44.76 E-value: 7.90e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1799 RAQAVGALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRcLQEAENKHSESMFALQGRYEE---EIRCMV 1875
Cdd:COG4717     44 RAMLLERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAE-LQEELEELEEELEELEAELEElreELEKLE 122

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1876 EQLSHTENTLQAER-SRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQlKVRELQAVHQEELRALQEHYIWSLRGALSLY 1954
Cdd:COG4717    123 KLLQLLPLYQELEAlEAELAELPERLEELEERLEELRELEEELEELEA-ELAELQEELEELLEQLSLATEEELQDLAEEL 201

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1955 QpshpdsslapgpsepravpAAKDEAESmsgLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFESLKATCER- 2033
Cdd:COG4717    202 E-------------------ELQQRLAE---LEEELEEAQEELEELEEELEQLENELEAAALEERLKEARLLLLIAAALl 259

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2034 GFAAMEETHQKKIEDLQRQHQ-----RELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSD 2108
Cdd:COG4717    260 ALLGLGGSLLSLILTIAGVLFlvlglLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEEL 339

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2109 IEALRRqyLEELQSVQRELEVLSEQYSQKCLE---NAHLAQALEAERQALRQCQRENQELnahnQELNNRLAAEITRLRT 2185
Cdd:COG4717    340 LELLDR--IEELQELLREAEELEEELQLEELEqeiAALLAEAGVEDEEELRAALEQAEEY----QELKEELEELEEQLEE 413

                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081931 2186 LLTGDGGGESTGLPLTQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTALRDKKYAsdkykDIYTELSIAKAK 2261
Cdd:COG4717    414 LLGELEELLEALDEEELEEELEELEEELEELEEELEELREELAELEAELEQLEEDGELA-----ELLQELEELKAE 484

CCDC158

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

683-1098

8.34e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 44.72 E-value: 8.34e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  683 ETTPLREEKQVPIAPLH-----LSLE-DRSERLSTHELTSL-----LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQ 751
Cdd:pfam15921  371 ESGNLDDQLQKLLADLHkrekeLSLEkEQNKRLWDRDTGNSitidhLRRELDDRNMEVQRLEALLKAMKSECQGQMERQM 450

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  752 SAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGnaa 831
Cdd:pfam15921  451 AAIQGKNESLEKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVD--- 527

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  832 aelaIKEQALAKLKGE---------------LKM-EQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLlEKTQELR 895
Cdd:pfam15921  528 ----LKLQELQHLKNEgdhlrnvqtecealkLQMaEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQL-EKEINDR 602

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  896 DLETQQ--ALQRDRQKEVQRLQECIAEL-----------SQQLGT-----SEQAQRLMEKKLKRNYtllLESCEQEKQAL 957
Cdd:pfam15921  603 RLELQEfkILKDKKDAKIRELEARVSDLelekvklvnagSERLRAvkdikQERDQLLNEVKTSRNE---LNSLSEDYEVL 679

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  958 LQNLKEVEDKASAYEDQLQGHVQ--QVEALQKEKLSETCKGSE------------QVHKLEEELEAREASIRQLAQHVQS 1023
Cdd:pfam15921  680 KRNFRNKSEEMETTTNKLKMQLKsaQSELEQTRNTLKSMEGSDghamkvamgmqkQITAKRGQIDALQSKIQFLEEAMTN 759

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1024 LHDERDLIKHQFQEL---MERVATSDGDVAELQEKLRGKEVDYQ----NLEHSHHRVSVQLQSVRTLLREKEEELKHIKE 1096
Cdd:pfam15921  760 ANKEKHFLKEEKNKLsqeLSTVATEKNKMAGELEVLRSQERRLKekvaNMEVALDKASLQFAECQDIIQRQEQESVRLKL 839


                   ..
gi 1907081931 1097 TH 1098
Cdd:pfam15921  840 QH 841

PRK09039

peptidoglycan -binding protein;

2037-2189

8.51e-04

peptidoglycan -binding protein;

Pssm-ID: 181619 [Multi-domain] Cd Length: 343 Bit Score: 44.19 E-value: 8.51e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2037 AMEETHQKKIEDLQRQHQRELEKLREEKDRL--LAEETAATISAIEAMKNAHREEM--ERELEKSQRSQISSINSDIEAL 2112
Cdd:PRK09039    70 SLERQGNQDLQDSVANLRASLSAAEAERSRLqaLLAELAGAGAAAEGRAGELAQELdsEKQVSARALAQVELLNQQIAAL 149

                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907081931 2113 RRQyleeLQSVQRELEVlSEQYSQkclenahlaqalEAERQALRQCQRENQELNAHNQELnNRLAAE-ITRLRTLLTG 2189
Cdd:PRK09039   150 RRQ----LAALEAALDA-SEKRDR------------ESQAKIADLGRRLNVALAQRVQEL-NRYRSEfFGRLREILGD 209

sbcc

exonuclease SbcC; All proteins in this family for which functions are known are part of an ...

830-1105

8.98e-04

Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 44.57 E-value: 8.98e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  830 AAAELAIKEQALAK---LKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRD 906
Cdd:TIGR00618  182 ALMEFAKKKSLHGKaelLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQL 261

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  907 RQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQ 986
Cdd:TIGR00618  262 LKQLRARIEELRAQEAVLEETQERINRARKAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIE 341

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  987 KEKLSETCKGSEQVH------------KLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSD-------- 1046
Cdd:TIGR00618  342 EQRRLLQTLHSQEIHirdahevatsirEISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDtrtsafrd 421

                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907081931 1047 --GDVAEL-------QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKK 1105
Cdd:TIGR00618  422 lqGQLAHAkkqqelqQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHLQETRKK 489

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

1980-2180

9.45e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 44.52 E-value: 9.45e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1980 AESMSGLRERIQELEAQMGVMR--------------EELGHKELEGDVAALQEKYQR------DFESLK---ATCERGFA 2036
Cdd:COG4913    623 EEELAEAEERLEALEAELDALQerrealqrlaeyswDEIDVASAEREIAELEAELERldassdDLAALEeqlEELEAELE 702

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2037 AMEEtHQKKIEDLQRQHQRELEKLREEKDRLLAEETAATISAI------------EAMKNAHREEMERELEKSQ---RSQ 2101
Cdd:COG4913    703 ELEE-ELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARlelralleerfaAALGDAVERELRENLEERIdalRAR 781

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2102 ISSINSDIEALRRQYLEE----LQSVQRELEVLSEqYSQKC--LENAHLAQALEAERQALRQCQRENQElnahnqELNNR 2175
Cdd:COG4913    782 LNRAEEELERAMRAFNREwpaeTADLDADLESLPE-YLALLdrLEEDGLPEYEERFKELLNENSIEFVA------DLLSK 854


                   ....*
gi 1907081931 2176 LAAEI 2180
Cdd:COG4913    855 LRRAI 859

PRK03918

DNA double-strand break repair ATPase Rad50;

1639-2274

9.56e-04

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 44.67 E-value: 9.56e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1639 ERVIQQILetlrhptgREDQVQTSWDQnpLGEILRPgTDGSQEPLQALHQSPEvlaAIQDELAQQLREKASILEEISAAL 1718
Cdd:PRK03918   148 EKVVRQIL--------GLDDYENAYKN--LGEVIKE-IKRRIERLEKFIKRTE---NIEELIKEKEKELEEVLREINEIS 213

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1719 PVLPPT-EPLGGCQRLLRmsqhlSYESCLEGLgqySSLLVQDAIIQAQVCYAACRIRlEYEKELRFYKKACQEAKgaSGQ 1797
Cdd:PRK03918   214 SELPELrEELEKLEKEVK-----ELEELKEEI---EELEKELESLEGSKRKLEEKIR-ELEERIEELKKEIEELE--EKV 282

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1798 KRAQAVGALKEEYEElLHKQKSEYQKVITLIEKENTELKAKVSQMdhqQRCLQEAENKHSEsMFALQGRyEEEIRCMVEQ 1877
Cdd:PRK03918   283 KELKELKEKAEEYIK-LSEFYEEYLDELREIEKRLSRLEEEINGI---EERIKELEEKEER-LEELKKK-LKELEKRLEE 356

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1878 LSHTENTLQAERsRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALqEHYIWSLRGALSLYQPS 1957
Cdd:PRK03918   357 LEERHELYEEAK-AKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGEL-KKEIKELKKAIEELKKA 434

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1958 HPDSSL--APGPSEPRAVPAAKDEAEsMSGLRERIQELEAQMGVMREELghKELEGDVaalqeKYQRDFESLKATCERGF 2035
Cdd:PRK03918   435 KGKCPVcgRELTEEHRKELLEEYTAE-LKRIEKELKEIEEKERKLRKEL--RELEKVL-----KKESELIKLKELAEQLK 506

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2036 AAMEETHQKKIEDLQRQhQRELEKLREEKDRLLAE--ETAATISAIEAMKNaHREEMERELEKSQRsQISSINSDIEALR 2113
Cdd:PRK03918   507 ELEEKLKKYNLEELEKK-AEEYEKLKEKLIKLKGEikSLKKELEKLEELKK-KLAELEKKLDELEE-ELAELLKELEELG 583

                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2114 RQYLEELQSVQRELEVLSEQYsqkcLENAHLAQALEAERQALRQCQRENQELNAHNQELNNR---LAAEITRLRTLLTGD 2190
Cdd:PRK03918   584 FESVEELEERLKELEPFYNEY----LELKDAEKELEREEKELKKLEEELDKAFEELAETEKRleeLRKELEELEKKYSEE 659

                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2191 GGGESTGLPLtqgkdayELEVLLRVKESEIQYLKqeisSLKDELQTALRDKKYASDKYKDIYTEL-SIAKAKAdcDISRL 2269
Cdd:PRK03918   660 EYEELREEYL-------ELSRELAGLRAELEELE----KRREEIKKTLEKLKEELEEREKAKKELeKLEKALE--RVEEL 726


                   ....*
gi 1907081931 2270 KEQLK 2274
Cdd:PRK03918   727 REKVK 731

EnvC

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

716-885

9.87e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 43.98 E-value: 9.87e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  716 SLLEKELEQSQKEAS----DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRR 791
Cdd:COG4942     79 AALEAELAELEKEIAelraELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRAD 158

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  792 QELITQQIQTLKhsygEAKDAIRHHEAEIQTLQTRLgnaAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSG 871
Cdd:COG4942    159 LAELAALRAELE----AERAELEALLAELEEERAAL---EALKAERQKLLARLEKELAELAAELAELQQEAEELEALIAR 231

                          170
                   ....*....|....
gi 1907081931  872 QLRASEQKLRSTEA 885
Cdd:COG4942    232 LEAEAAAAAERTPA 245

COG5022

Myosin heavy chain [General function prediction only];

778-1244

1.03e-03

Myosin heavy chain [General function prediction only];

Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 44.68 E-value: 1.03e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  778 NQDLQSELEAQCRRQELITQQI----QTLKHSYGEAkdAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKmEQG 853
Cdd:COG5022    819 IIKLQKTIKREKKLRETEEVEFslkaEVLIQKFGRS--LKAKKRFSLLKKETIYLQSAQRVELAERQLQELKIDVK-SIS 895

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  854 KVREQLEEWQHSKAMLSGQLRASEQ---KLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLgtseq 930
Cdd:COG5022    896 SLKLVNLELESEIIELKKSLSSDLIenlEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETS----- 970

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  931 aqrlmekklkrnytlllesceQEKQALLqnlkeveDKASAYEDQLQGHVQQVEALQKEkLSETCKGSEQVHKLEEELEAR 1010
Cdd:COG5022    971 ---------------------EEYEDLL-------KKSTILVREGNKANSELKNFKKE-LAELSKQYGALQESTKQLKEL 1021

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1011 EASIRQLAQHVQSLHDERDlIKHQFQELMERVATSDGDVAELQEKLrgKEVDYQN-LEHSHHRVSVQLQSVRTLlrEKEE 1089
Cdd:COG5022   1022 PVEVAELQSASKIISSEST-ELSILKPLQKLKGLLLLENNQLQARY--KALKLRReNSLLDDKQLYQLESTENL--LKTI 1096

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1090 ELKHIKETHERVLEKKdqdlnEALVKMIALGSSLEEteikLQEKEECLRRFVSDSPkDAKEPLSTTEPTEEGSGILPLGS 1169
Cdd:COG5022   1097 NVKDLEVTNRNLVKPA-----NVLQFIVAQMIKLNL----LQEISKFLSQLVNTLE-PVFQKLSVLQLELDGLFWEANLE 1166

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1170 VTRVFPGFpHSQPEDEDPSAGLGEEGSSGS-------------LSREENTILPKSADMPE--REG-HLQSTSKSDPGapI 1233
Cdd:COG5022   1167 ALPSPPPF-AALSEKRLYQSALYDEKSKLSssevndlkneliaLFSKIFSGWPRGDKLKKliSEGwVPTEYSTSLKG--F 1243

                          490
                   ....*....|.
gi 1907081931 1234 KRPRIRFSTIQ 1244
Cdd:COG5022   1244 NNLNKKFDTPA 1254

PRK02224

DNA double-strand break repair Rad50 ATPase;

1980-2165

1.26e-03

DNA double-strand break repair Rad50 ATPase;

Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 44.26 E-value: 1.26e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1980 AESMSGLRERIQELEAQMGVMREELGHKELEGDVAALQekyQRDFESLKATCERgfaAMEETHQKkiedlQRQHQRELEK 2059
Cdd:PRK02224   278 AEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEAR---REELEDRDEELRD---RLEECRVA-----AQAHNEEAES 346

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2060 LREEKDRLlaEETAATISAIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQY---LEELQSVQRELEVLSEQysq 2136
Cdd:PRK02224   347 LREDADDL--EERAEELREEAAELESELEEAREAVED-RREEIEELEEEIEELRERFgdaPVDLGNAEDFLEELREE--- 420

                          170       180       190
                   ....*....|....*....|....*....|
gi 1907081931 2137 kcLENAHLAQA-LEAERQALRQCQRENQEL 2165
Cdd:PRK02224   421 --RDELREREAeLEATLRTARERVEEAEAL 448

Myosin_tail_1

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...

1790-2177

1.39e-03

Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 44.01 E-value: 1.39e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1790 EAKGASGQKRAQAVGALKEEYEELLHKQKSEYQKVITLIEKEN------TELKAkVSQMDHQQRCLQEAENKHSESMFAL 1863
Cdd:pfam01576  562 EEKAAAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSNLEKKQKkfdqmlAEEKA-ISARYAEERDRAEAEAREKETRALS 640

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1864 QGRYEEEIRCMVEQLSHTENTLQAERSRVLSQLDASVKD-------RQAMEQhHVQQMKM----LEDrfqlkvrELQAVH 1932
Cdd:pfam01576  641 LARALEEALEAKEELERTNKQLRAEMEDLVSSKDDVGKNvhelersKRALEQ-QVEEMKTqleeLED-------ELQATE 712

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1933 QEELR------ALQEHYIWSLRgalslyqpshpdsslapgpsepravpaAKDEA--ESMSGLRERIQELEAQMGVMREEl 2004
Cdd:pfam01576  713 DAKLRlevnmqALKAQFERDLQ---------------------------ARDEQgeEKRRQLVKQVRELEAELEDERKQ- 764

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2005 ghkelEGDVAALQEKYQRDFESLKATCERGFAAMEET--HQKKIEDLQRQHQRELEKLREEKDRLLA--EETAATISAIE 2080
Cdd:pfam01576  765 -----RAQAVAAKKKLELDLKELEAQIDAANKGREEAvkQLKKLQAQMKDLQRELEEARASRDEILAqsKESEKKLKNLE 839

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2081 A-----------------MKNAHREEMERELEK--SQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLEN 2141
Cdd:pfam01576  840 AellqlqedlaaserarrQAQQERDELADEIASgaSGKSALQDEKRRLEARIAQLEEELEEEQSNTELLNDRLRKSTLQV 919

                          410       420       430
                   ....*....|....*....|....*....|....*.
gi 1907081931 2142 AHLAQALEAERQALRQCQRENQELNAHNQELNNRLA 2177
Cdd:pfam01576  920 EQLTTELAAERSTSQKSESARQQLERQNKELKAKLQ 955

CwlO1

COG3883

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...

2096-2303

1.57e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];

Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 43.28 E-value: 1.57e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2096 KSQRSQISSINSDIEALRrqylEELQSVQRELEVLSEQYSQKcleNAHLAQALEAERQALRQCQRENQELNAHNQELNNR 2175
Cdd:COG3883     19 QAKQKELSELQAELEAAQ----AELDALQAELEELNEEYNEL---QAELEALQAEIDKLQAEIAEAEAEIEERREELGER 91

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2176 LAA------EITRLRTLLTGDGGGE--------STGLPLTQG--KDAYELEVLLRVKESEIQYLKQEISSLKDELQTALR 2239
Cdd:COG3883     92 ARAlyrsggSVSYLDVLLGSESFSDfldrlsalSKIADADADllEELKADKAELEAKKAELEAKLAELEALKAELEAAKA 171

                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081931 2240 DKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGEKSPEGTTVSGYDIMKSKSNPD 2303
Cdd:COG3883    172 ELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAA 235

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

1772-2165

1.63e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 43.77 E-value: 1.63e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1772 RIRLEYEKELRFYKKACQEAKGASGQKRAQAvgALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRcLQE 1851
Cdd:COG1196    380 ELEELAEELLEALRAAAELAAQLEELEEAEE--ALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAE-LEE 456

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1852 AENKHSESMFALQGRYEEEIrcmvEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAV 1931
Cdd:COG1196    457 EEEALLELLAELLEEAALLE----AALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGV 532

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1932 HQEELRALQEhyiwsLRGALSLYQPSHPDSSLAPGPSEPRAVPAAKDEAESMSGLRERIQELEAQMGVMREElGHKELEG 2011
Cdd:COG1196    533 EAAYEAALEA-----ALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGA-AVDLVAS 606

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2012 DVAALQEKYQRDFESL------KATCERGFAAMEETHQKKIE---DLQRQHQRELEKLREEKDRLLAEETAATISAIEAM 2082
Cdd:COG1196    607 DLREADARYYVLGDTLlgrtlvAARLEAALRRAVTLAGRLREvtlEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAE 686

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2083 KNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAE----------R 2152
Cdd:COG1196    687 RLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEElpeppdleelE 766

                          410
                   ....*....|...
gi 1907081931 2153 QALRQCQRENQEL 2165
Cdd:COG1196    767 RELERLEREIEAL 779

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

2050-2322

1.75e-03

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 43.90 E-value: 1.75e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2050 QRQHQRELEKLREEKDRLLAEEtAATISAIEAMKNaHREEMERELEKSQRsQISSINSDIEALrrqyLEELQSVQRELEV 2129
Cdd:TIGR02169  669 SRSEPAELQRLRERLEGLKREL-SSLQSELRRIEN-RLDELSQELSDASR-KIGEIEKEIEQL----EQEEEKLKERLEE 741

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2130 LSEQYSQkclenahLAQALEAERQALRQCQRENQELnahnQELNNRLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYEL 2209
Cdd:TIGR02169  742 LEEDLSS-------LEQEIENVKSELKELEARIEEL----EEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRI 810

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2210 EVLLRVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAKAKadcdISRLKEQLKAATEALgekspegtt 2289
Cdd:TIGR02169  811 EARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGK----KEELEEELEELEAAL--------- 877

                          250       260       270
                   ....*....|....*....|....*....|...
gi 1907081931 2290 vsgYDIMKSKSNpdfLKKDRSCVTRQLRNIRSK 2322
Cdd:TIGR02169  878 ---RDLESRLGD---LKKERDELEAQLRELERK 904

PRK10246

exonuclease subunit SbcC; Provisional

718-981

1.75e-03

exonuclease subunit SbcC; Provisional

Pssm-ID: 182330 [Multi-domain] Cd Length: 1047 Bit Score: 43.64 E-value: 1.75e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRLLQDQLRvalgREQSAREGYVLQTEVATSpsgAWQRL-------HRVNQDLQSELEAQCR 790
Cdd:PRK10246   535 LEKEVKKLGEEGAALRGQLDALTKQLQ----RDESEAQSLRQEEQALTQ---QWQAVcaslnitLQPQDDIQPWLDAQEE 607

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  791 RQELITQ--QIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIK------EQA-LAKLKGELKMEQGKVREQ--L 859
Cdd:PRK10246   608 HERQLRLlsQRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTALAGYALTlpqedeEASwLATRQQEAQSWQQRQNELtaL 687

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  860 EEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRD----LETQ-QALQRDRQKEVQRLQECIAELSQQLGTSEQAQR- 933
Cdd:PRK10246   688 QNRIQQLTPLLETLPQSDDLPHSEETVALDNWRQVHEqclsLHSQlQTLQQQDVLEAQRLQKAQAQFDTALQASVFDDQq 767

                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907081931  934 ------LMEKKLKRnytlllesCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQ 981
Cdd:PRK10246   768 aflaalLDEETLTQ--------LEQLKQNLENQRQQAQTLVTQTAQALAQHQQH 813

PH_RhoGap25-like

cd13263

Rho GTPase activating protein 25 and related proteins Pleckstrin homology (PH) domain; ...

429-519

1.77e-03

Rho GTPase activating protein 25 and related proteins Pleckstrin homology (PH) domain; RhoGAP25 (also called ArhGap25) like other RhoGaps are involved in cell polarity, cell morphology and cytoskeletal organization. They act as GTPase activators for the Rac-type GTPases by converting them to an inactive GDP-bound state and control actin remodeling by inactivating Rac downstream of Rho leading to suppress leading edge protrusion and promotes cell retraction to achieve cellular polarity and are able to suppress RAC1 and CDC42 activity in vitro. Overexpression of these proteins induces cell rounding with partial or complete disruption of actin stress fibers and formation of membrane ruffles, lamellipodia, and filopodia. This hierarchy contains RhoGAP22, RhoGAP24, and RhoGAP25. Members here contain an N-terminal PH domain followed by a RhoGAP domain and either a BAR or TATA Binding Protein (TBP) Associated Factor 4 (TAF4) domain. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270083 Cd Length: 114 Bit Score: 40.06 E-value: 1.77e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQYED-GQWKKHWFVLADQSLRYYRDsvaEEAADLDGEINLSTCyDVTEYPVQRN----YGFQIHTKEGE---- 499
Cdd:cd13263      5 KSGWLKKQGSIvKNWQQRWFVLRGDQLYYYKD---EDDTKPQGTIPLPGN-KVKEVPFNPEepgkFLFEIIPGGGGdrmt 80

                           90       100
                   ....*....|....*....|....*
gi 1907081931  500 -----FTLSAMTSGIRRNWIQTIMK 519
Cdd:cd13263     81 snhdsYLLMANSQAEMEEWVKVIRR 105

sbcc

exonuclease SbcC; All proteins in this family for which functions are known are part of an ...

772-926

1.84e-03

Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 43.80 E-value: 1.84e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  772 QRLHRVNQDLQSELEA-QCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQtRLGNAAAELAIKEQALAKLKGELKM 850
Cdd:TIGR00618  725 NASSSLGSDLAAREDAlNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQTGA-ELSHLAAEIQFFNRLREEDTHLLKT 803

                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081931  851 EQGKVREQLEewqHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLG 926
Cdd:TIGR00618  804 LEAEIGQEIP---SDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQAKIIQLSD 876

Cast

pfam10174

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...

2043-2186

1.98e-03

Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 43.66 E-value: 1.98e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2043 QKKIEDLQRQHQRELEKLREEKDRL--LAEETAATISA--------------IEAMKNAH-REEMER--ELEkSQRSQIS 2103
Cdd:pfam10174  400 QKKIENLQEQLRDKDKQLAGLKERVksLQTDSSNTDTAlttleealsekeriIERLKEQReREDRERleELE-SLKKENK 478

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2104 SINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQR-ENQELNAHNQELNNRLAAEIT- 2181
Cdd:pfam10174  479 DLKEKVSALQPELTEKESSLIDLKEHASSLASSGLKKDSKLKSLEIAVEQKKEECSKlENQLKKAHNAEEAVRTNPEINd 558


                   ....*
gi 1907081931 2182 RLRTL 2186
Cdd:pfam10174  559 RIRLL 563

GumC

COG3206

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

1980-2149

2.14e-03

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 43.47 E-value: 2.14e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1980 AESMSGLRERIQELEAQMGVMREELghKELEGDVAALQEKYQRDFESLKATCErgfAAMEETHQKKIEDLQRQHQRELEK 2059
Cdd:COG3206    211 SEEAKLLLQQLSELESQLAEARAEL--AEAEARLAALRAQLGSGPDALPELLQ---SPVIQQLRAQLAELEAELAELSAR 285

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2060 LREEKDRL--LAEETAATISAIEAMKNAHREEMERELEkSQRSQISSINSDIEALRRQYLE------ELQSVQRELEVLS 2131
Cdd:COG3206    286 YTPNHPDViaLRAQIAALRAQLQQEAQRILASLEAELE-ALQAREASLQAQLAQLEARLAElpeleaELRRLEREVEVAR 364

                          170       180
                   ....*....|....*....|
gi 1907081931 2132 EQYSQ--KCLENAHLAQALE 2149
Cdd:COG3206    365 ELYESllQRLEEARLAEALT 384

mukB

PRK04863

chromosome partition protein MukB;

1789-2186

2.16e-03

chromosome partition protein MukB;

Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 43.41 E-value: 2.16e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1789 QEAKGASGQ-KRAQA-VGALKEEYEE------LLHKQKSEYQKVITLIEKENTELKAKVSqmDHQQRcLQEAENKhsesm 1860
Cdd:PRK04863   341 QTALRQQEKiERYQAdLEELEERLEEqnevveEADEQQEENEARAEAAEEEVDELKSQLA--DYQQA-LDVQQTR----- 412

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1861 fALQGRyeeeircmveqlshteNTLQA-ERSRVLSQLDA----SVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEE 1935
Cdd:PRK04863   413 -AIQYQ----------------QAVQAlERAKQLCGLPDltadNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAAHSQF 475

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1936 LRALQehyiwSLRgalslyqpshpdsSLAPGPSEPRAVPAAKD---EAESMSGLRERIQELEAQmgvmreelgHKELEGD 2012
Cdd:PRK04863   476 EQAYQ-----LVR-------------KIAGEVSRSEAWDVAREllrRLREQRHLAEQLQQLRMR---------LSELEQR 528

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2013 VAAlqekyQRDFESLKATCERGFAAMEEThQKKIEDLQRQHQRELEKLREEKDRLlaeetaatisaieamkNAHREEMER 2092
Cdd:PRK04863   529 LRQ-----QQRAERLLAEFCKRLGKNLDD-EDELEQLQEELEARLESLSESVSEA----------------RERRMALRQ 586

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2093 ELEKsqrsqissINSDIEALRRQYLEELQSvQRELEVLSEQY------SQKCLEnaHLAQALEAERQALR---QCQRENQ 2163
Cdd:PRK04863   587 QLEQ--------LQARIQRLAARAPAWLAA-QDALARLREQSgeefedSQDVTE--YMQQLLERERELTVerdELAARKQ 655

                          410       420
                   ....*....|....*....|...
gi 1907081931 2164 ELNAHNQELNNRLAAEITRLRTL 2186
Cdd:PRK04863   656 ALDEEIERLSQPGGSEDPRLNAL 678

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1978-2242

2.29e-03

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 43.51 E-value: 2.29e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1978 DEAESMSGLRERIQELEAQMGVMREELG-----HKELEGDVAALQ------EKYQRDFESLKATcERGFAAME-ETHQKK 2045
Cdd:TIGR02168  162 EEAAGISKYKERRKETERKLERTRENLDrlediLNELERQLKSLErqaekaERYKELKAELREL-ELALLVLRlEELREE 240

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2046 IEDLQRQhQRELEKLREEKDRLLAEETAAtisaIEAMKNAHREeMERELEKSQRS--QISSINSDIEALRRQYLEELQSV 2123
Cdd:TIGR02168  241 LEELQEE-LKEAEEELEELTAELQELEEK----LEELRLEVSE-LEEEIEELQKElyALANEISRLEQQKQILRERLANL 314

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2124 QRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRtlltgdgggestglpltqg 2203
Cdd:TIGR02168  315 ERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLE------------------- 375

                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1907081931 2204 kdayELEVLLRVKESEIQYLKQEISSLKDELQTALRDKK 2242
Cdd:TIGR02168  376 ----ELEEQLETLRSKVAQLELQIASLNNEIERLEARLE 410

MukB

COG3096

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...

1985-2273

2.81e-03

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 43.02 E-value: 2.81e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1985 GLRERIQELEAQMGVMREElgHKELEGDVAALQE----------KYQRDFESLKATceRGFAAMEETHQKKIEDLQRQHQ 2054
Cdd:COG3096    372 EAAEQLAEAEARLEAAEEE--VDSLKSQLADYQQaldvqqtraiQYQQAVQALEKA--RALCGLPDLTPENAEDYLAAFR 447

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2055 RELEKLREEkdrLLAEETAATISaiEAMKNAHREEME------------------RELEKSQRSQiSSINSDIEALRRQY 2116
Cdd:COG3096    448 AKEQQATEE---VLELEQKLSVA--DAARRQFEKAYElvckiageversqawqtaRELLRRYRSQ-QALAQRLQQLRAQL 521

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2117 --LEELQSVQRELEVLSEQYSQ---KCLENA----HLAQALEAERQAL-----------RQCQRENQELNAHNQELNNRL 2176
Cdd:COG3096    522 aeLEQRLRQQQNAERLLEEFCQrigQQLDAAeeleELLAELEAQLEELeeqaaeaveqrSELRQQLEQLRARIKELAARA 601

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2177 AAEIT---RLRTLltgdggGESTGLPLTQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTALRdkkyasdkykdiyt 2253
Cdd:COG3096    602 PAWLAaqdALERL------REQSGEALADSQEVTAAMQQLLEREREATVERDELAARKQALESQIE-------------- 661

                          330       340
                   ....*....|....*....|
gi 1907081931 2254 ELSIAKAKADCDISRLKEQL 2273
Cdd:COG3096    662 RLSQPGGAEDPRLLALAERL 681

PRK11281

mechanosensitive channel MscK;

779-987

2.87e-03

mechanosensitive channel MscK;

Pssm-ID: 236892 [Multi-domain] Cd Length: 1113 Bit Score: 42.98 E-value: 2.87e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  779 QDLQSELEAQCRRQELITQQ---IQTLKHSYgEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKG--------- 846
Cdd:PRK11281    39 ADVQAQLDALNKQKLLEAEDklvQQDLEQTL-ALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDdndeetret 117

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  847 -------ELKMEQGKVREQLEEWQHSKAMLSGQL--------RASEQkLRSTEARLLEKTQELRDLETQQALQRDRQKev 911
Cdd:PRK11281   118 lstlslrQLESRLAQTLDQLQNAQNDLAEYNSQLvslqtqpeRAQAA-LYANSQRLQQIRNLLKGGKVGGKALRPSQR-- 194

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  912 QRLQECIAELSQQ-------LGTSEQAQRLMEKklKRNYTLLLESCEQEKQALLQNLkeVEDKasaYEDQLQGHVQQVEA 984
Cdd:PRK11281   195 VLLQAEQALLNAQndlqrksLEGNTQLQDLLQK--QRDYLTARIQRLEHQLQLLQEA--INSK---RLTLSEKTVQEAQS 267


                   ...
gi 1907081931  985 LQK 987
Cdd:PRK11281   268 QDE 270

Apolipoprotein

pfam01442

Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a ...

1981-2157

3.01e-03

Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E.

Pssm-ID: 460211 [Multi-domain] Cd Length: 175 Bit Score: 40.71 E-value: 3.01e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1981 ESMSGLRERIQELEAQMGVMREELgHKELEGDVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKL 2060
Cdd:pfam01442    4 DSLDELSTYAEELQEQLGPVAQEL-VDRLEKETEALRERLQKDLEEVRAKLEPYLEELQAKLGQNVEELRQRLEPYTEEL 82

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2061 ReekdrllaEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLE 2140
Cdd:pfam01442   83 R--------KRLNADAEELQEKLAPYGEELRERLEQNVDALRARLAPYAEELRQKLAERLEELKESLAPYAEEVQAQLSQ 154

                          170
                   ....*....|....*...
gi 1907081931 2141 NA-HLAQALEAERQALRQ 2157
Cdd:pfam01442  155 RLqELREKLEPQAEDLRE 172

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

872-1042

3.16e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 42.98 E-value: 3.16e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  872 QLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKK---LKRNYTL--- 945
Cdd:COG4913    611 KLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAEREIAELEAELerlDASSDDLaal 690

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  946 --LLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQS 1023
Cdd:COG4913    691 eeQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELREN 770

                          170
                   ....*....|....*....
gi 1907081931 1024 LHDERDLIKHQFQELMERV 1042
Cdd:COG4913    771 LEERIDALRARLNRAEEEL 789

PTZ00121

MAEBL; Provisional

832-1109

3.56e-03

MAEBL; Provisional

Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 42.82 E-value: 3.56e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  832 AELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMlsgQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEV 911
Cdd:PTZ00121  1542 AEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNM---ALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEA 1618

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  912 QRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLS 991
Cdd:PTZ00121  1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEA 1698

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  992 ETCKGSEQVHKLEEELEAREASIRQlaqhvqslHDERDLIKhqfQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHH 1071
Cdd:PTZ00121  1699 EEAKKAEELKKKEAEEKKKAEELKK--------AEEENKIK---AEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEE 1767

                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1907081931 1072 RVSVQLQSVRTLLreKEEELKHIKETHERVLEKKDQDL 1109
Cdd:PTZ00121  1768 KKAEEIRKEKEAV--IEEELDEEDEKRRMEVDKKIKDI 1803

DR0291

COG1579

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...

1989-2133

3.78e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];

Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 41.45 E-value: 3.78e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1989 RIQELEAQMGVMREELghKELEGDVAALQ---EKYQRDFESLKATCERgFAAMEETHQKKIEDLQRQH-----QRELEKL 2060
Cdd:COG1579     18 ELDRLEHRLKELPAEL--AELEDELAALEarlEAAKTELEDLEKEIKR-LELEIEEVEARIKKYEEQLgnvrnNKEYEAL 94

                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907081931 2061 REEKDRLLAEETAATISAIEAMKNahREEMERELEKSQrSQISSINSDIEALRRQYLEELQSVQRELEVLSEQ 2133
Cdd:COG1579     95 QKEIESLKRRISDLEDEILELMER--IEELEEELAELE-AELAELEAELEEKKAELDEELAELEAELEELEAE 164

PH_ACAP

cd13250

ArfGAP with coiled-coil, ankyrin repeat and PH domains Pleckstrin homology (PH) domain; ACAP ...

429-517

3.90e-03

ArfGAP with coiled-coil, ankyrin repeat and PH domains Pleckstrin homology (PH) domain; ACAP (also called centaurin beta) functions both as a Rab35 effector and as an Arf6-GTPase-activating protein (GAP) by which it controls actin remodeling and membrane trafficking. ACAP contain an NH2-terminal bin/amphiphysin/Rvs (BAR) domain, a phospholipid-binding domain, a PH domain, a GAP domain, and four ankyrin repeats. The AZAPs constitute a family of Arf GAPs that are characterized by an NH2-terminal pleckstrin homology (PH) domain and a central Arf GAP domain followed by two or more ankyrin repeats. On the basis of sequence and domain organization, the AZAP family is further subdivided into four subfamilies: 1) the ACAPs contain an NH2-terminal bin/amphiphysin/Rvs (BAR) domain (a phospholipid-binding domain that is thought to sense membrane curvature), a single PH domain followed by the GAP domain, and four ankyrin repeats; 2) the ASAPs also contain an NH2-terminal BAR domain, the tandem PH domain/GAP domain, three ankyrin repeats, two proline-rich regions, and a COOH-terminal Src homology 3 domain; 3) the AGAPs contain an NH2-terminal GTPase-like domain (GLD), a split PH domain, and the GAP domain followed by four ankyrin repeats; and 4) the ARAPs contain both an Arf GAP domain and a Rho GAP domain, as well as an NH2-terminal sterile-a motif (SAM), a proline-rich region, a GTPase-binding domain, and five PH domains. PMID 18003747 and 19055940 Centaurin can bind to phosphatidlyinositol (3,4,5)P3. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270070 Cd Length: 98 Bit Score: 38.74 E-value: 3.90e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQYED--GQWKKHWFVLADQSLRYYRDSVAEEAADLdgEINLSTCYDVTEYPVQRNYGFQIHTKEGEFTLSAMT 506
Cdd:cd13250      1 KEGYLFKRSSNafKTWKRRWFSLQNGQLYYQKRDKKDEPTVM--VEDLRLCTVKPTEDSDRRFCFEVISPTKSYMLQAES 78

                           90
                   ....*....|.
gi 1907081931  507 SGIRRNWIQTI 517
Cdd:cd13250     79 EEDRQAWIQAI 89

PH_DOCK-D

cd13267

Dedicator of cytokinesis-D subfamily Pleckstrin homology (PH) domain; DOCK-D subfamily (also ...

428-519

3.98e-03

Dedicator of cytokinesis-D subfamily Pleckstrin homology (PH) domain; DOCK-D subfamily (also called Zizimin subfamily) consists of Dock9/Zizimin1, Dock10/Zizimin3, and Dock11/Zizimin2. DOCK-D has a N-terminal DUF3398 domain, a PH-like domain, a Dock Homology Region 1, DHR1 (also called CZH1), a C2 domain, and a C-terminal DHR2 domain (also called CZH2). Zizimin1 is enriched in the brain, lung, and kidney; zizimin2 is found in B and T lymphocytes, and zizimin3 is enriched in brain, lung, spleen and thymus. Zizimin1 functions in autoinhibition and membrane targeting. Zizimin2 is an immune-related and age-regulated guanine nucleotide exchange factor, which facilitates filopodial formation through activation of Cdc42, which results in activation of cell migration. No function has been determined for Zizimin3 to date. The N-terminal half of zizimin1 binds to the GEF domain through three distinct areas, including CZH1, to inhibit the interaction with Cdc42. In addition its PH domain binds phosphoinositides and mediates zizimin1 membrane targeting. DOCK is a family of proteins involved in intracellular signalling networks. They act as guanine nucleotide exchange factors for small G proteins of the Rho family, such as Rac and Cdc42. There are 4 subfamilies of DOCK family proteins based on their sequence homology: A-D. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270087 Cd Length: 126 Bit Score: 39.62 E-value: 3.98e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  428 FKKGWLTKQYEDGQ----------WKKHWFVL---ADQS--LRYYRDsvaEEAADLDGEINLSTCYDVTEYPVQRNYGFQ 492
Cdd:cd13267      7 TKEGYLYKGPENSSdsfislamksFKRRFFHLkqlVDGSyiLEFYKD---EKKKEAKGTIFLDSCTGVVQNSKRRKFCFE 83

                           90       100
                   ....*....|....*....|....*...
gi 1907081931  493 IHTKEGE-FTLSAMTSGIRRNWIQTIMK 519
Cdd:cd13267     84 LRMQDKKsYVLAAESEAEMDEWISKLNK 111

PRK09039

peptidoglycan -binding protein;

819-961

4.00e-03

peptidoglycan -binding protein;

Pssm-ID: 181619 [Multi-domain] Cd Length: 343 Bit Score: 41.88 E-value: 4.00e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  819 EIQTLQTRLGNAAAELAIKEQALA---KLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELr 895
Cdd:PRK09039    47 EISGKDSALDRLNSQIAELADLLSlerQGNQDLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQEL- 125

                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081931  896 DLETQQALQRDRQkeVQRLQECIAELSQQLGTSEQAQRLMEKKlkrnytlllescEQEKQALLQNL 961
Cdd:PRK09039   126 DSEKQVSARALAQ--VELLNQQIAALRRQLAALEAALDASEKR------------DRESQAKIADL 177

Myosin_tail_1

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...

1797-2237

4.05e-03

Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 42.47 E-value: 4.05e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1797 QKRAQAVGALKEEYEELlHKQKSEYQKVITLIEKENTELKAKV-----SQMDHQQR------CLQEAENKHSESMFALQG 1865
Cdd:pfam01576  352 QKHTQALEELTEQLEQA-KRNKANLEKAKQALESENAELQAELrtlqqAKQDSEHKrkklegQLQELQARLSESERQRAE 430

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1866 RYEEEIRCMVEQLSHTENTLQAER-----SRVLSQLDASVKDRQAMEQHHVQQMKMLEDRfqlkVRELQavhqEELRALQ 1940
Cdd:pfam01576  431 LAEKLSKLQSELESVSSLLNEAEGkniklSKDVSSLESQLQDTQELLQEETRQKLNLSTR----LRQLE----DERNSLQ 502

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1941 EHYiwslrgalslyqpshpdsslapgpsepravpaaKDEAESMSGLRERIQELEAQMGVMREELghKELEGDVAALQE-- 2018
Cdd:pfam01576  503 EQL---------------------------------EEEEEAKRNVERQLSTLQAQLSDMKKKL--EEDAGTLEALEEgk 547

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2019 -KYQRDFESLKATCERGFAAMEETH------QKKIEDL------QRQHQRELEKLREEKDRLLAEETAATISAIEAMKNA 2085
Cdd:pfam01576  548 kRLQRELEALTQQLEEKAAAYDKLEktknrlQQELDDLlvdldhQRQLVSNLEKKQKKFDQMLAEEKAISARYAEERDRA 627

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2086 HREEMERELEksqrsqissinsdiealrrqyleelqsvqreleVLSeqysqkclenahLAQALEAERQALRQCQRENQEL 2165
Cdd:pfam01576  628 EAEAREKETR---------------------------------ALS------------LARALEEALEAKEELERTNKQL 662

                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907081931 2166 NAHNQELNNRLAAeitrlrtlltgdgggestglpltQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTA 2237
Cdd:pfam01576  663 RAEMEDLVSSKDD-----------------------VGKNVHELERSKRALEQQVEEMKTQLEELEDELQAT 711

PH_TAAP2-like

cd13255

Tandem PH-domain-containing protein 2 Pleckstrin homology (PH) domain; The binding of TAPP2 ...

429-517

4.49e-03

Tandem PH-domain-containing protein 2 Pleckstrin homology (PH) domain; The binding of TAPP2 (also called PLEKHA2) adaptors to PtdIns(3,4)P(2), but not PI(3,4, 5)P3, function as negative regulators of insulin and PI3K signalling pathways (i.e. TAPP/utrophin/syntrophin complex). TAPP2 contains two sequential PH domains in which the C-terminal PH domain specifically binds PtdIns(3,4)P2 with high affinity. The N-terminal PH domain does not interact with any phosphoinositide tested. They also contain a C-terminal PDZ-binding motif that interacts with several PDZ-binding proteins, including PTPN13 (known previously as PTPL1 or FAP-1) as well as the scaffolding proteins MUPP1 (multiple PDZ-domain-containing protein 1), syntrophin and utrophin. The members here are most sequence similar to TAPP2 proteins, but may not be actual TAPP2 proteins. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270075 Cd Length: 110 Bit Score: 38.93 E-value: 4.49e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQYEDGQ-WKKHWFVLADQSLRYYRDSVAEEAADLdgeINLSTCYDVTEYPVQRN-YGFQIHTKEGEFTLSAMT 506
Cdd:cd13255      8 KAGYLEKKGERRKtWKKRWFVLRPTKLAYYKNDKEYRLLRL---IDLTDIHTCTEVQLKKHdNTFGIVTPARTFYVQADS 84

                           90
                   ....*....|.
gi 1907081931  507 SGIRRNWIQTI 517
Cdd:cd13255     85 KAEMESWISAI 95

PH1_PH_fungal

cd13298

Fungal proteins Pleckstrin homology (PH) domain, repeat 1; The functions of these fungal ...

428-517

5.06e-03

Fungal proteins Pleckstrin homology (PH) domain, repeat 1; The functions of these fungal proteins are unknown, but they all contain 2 PH domains. This cd represents the first PH repeat. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270110 Cd Length: 106 Bit Score: 38.76 E-value: 5.06e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  428 FKKGWLTKQYED-GQWKKHWFVLADQSLRYYRDsvaEEAADLDGEINLSTCYDVTEYPV-QRNYGFQIHTKEGEFTLSAM 505
Cdd:cd13298      7 LKSGYLLKRSRKtKNWKKRWVVLRPCQLSYYKD---EKEYKLRRVINLSELLAVAPLKDkKRKNVFGIYTPSKNLHFRAT 83

                           90
                   ....*....|..
gi 1907081931  506 TSGIRRNWIQTI 517
Cdd:cd13298     84 SEKDANEWVEAL 95

PH_KIFIA_KIFIB

cd01233

KIFIA and KIFIB protein pleckstrin homology (PH) domain; The kinesin-3 family motors KIFIA ...

429-517

5.09e-03

KIFIA and KIFIB protein pleckstrin homology (PH) domain; The kinesin-3 family motors KIFIA (Caenorhabditis elegans homolog unc-104) and KIFIB transport synaptic vesicle precursors that contain synaptic vesicle proteins, such as synaptophysin, synaptotagmin and the small GTPase RAB3A, but they do not transport organelles that contain plasma membrane proteins. They have a N-terminal motor domain, followed by a coiled-coil domain, and a C-terminal PH domain. KIF1A adopts a monomeric form in vitro, but acts as a processive dimer in vivo. KIF1B has alternatively spliced isoforms distinguished by the presence or absence of insertion sequences in the conserved amino-terminal region of the protein; this results in their different motor activities. KIF1A and KIF1B bind to RAB3 proteins through the adaptor protein mitogen-activated protein kinase (MAPK) -activating death domain (MADD; also calledDENN), which was first identified as a RAB3 guanine nucleotide exchange factor (GEF). PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269939 Cd Length: 103 Bit Score: 38.73 E-value: 5.09e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  429 KKGWLTKQyEDG--QWKKHWFVLADQSLRYYRDSvaeeaADLD--GEINLSTC---YDV-TEYPVQRNYGFQIHTKEGEF 500
Cdd:cd01233      8 KRGYLLFL-EDAtdGWVRRWVVLRRPYLHIYSSE-----KDGDerGVINLSTArveYSPdQEALLGRPNVFAVYTPTNSY 81

                           90
                   ....*....|....*..
gi 1907081931  501 TLSAMTSGIRRNWIQTI 517
Cdd:cd01233     82 LLQARSEKEMQDWLYAI 98

DUF4618

pfam15397

Domain of unknown function (DUF4618); This family of proteins is found in eukaryotes. Proteins ...

1976-2121

5.19e-03

Domain of unknown function (DUF4618); This family of proteins is found in eukaryotes. Proteins in this family are typically between 238 and 363 amino acids in length. There are two conserved sequence motifs: EYP and KCTPD.

Pssm-ID: 464704 [Multi-domain] Cd Length: 258 Bit Score: 41.09 E-value: 5.19e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1976 AKDEAESMSGLRERIQELEAQMGVMREELG----HKELEGDVAALQ-EKYQRDFESLKatcergfaameETHQKKIEDLQ 2050
Cdd:pfam15397   76 EEKEESKLNKLEQQLEQLNAKIQKTQEELNflstYKDKEYPVKAVQiANLVRQLQQLK-----------DSQQDELDELE 144

                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907081931 2051 RQHQRELEKL----REEKDRLL---AEETAATISAIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQyLEELQ 2121
Cdd:pfam15397  145 EMRRMVLESLsrkiQKKKEKILsslAEKTLSPYQESLLQKTRDNQVMLKEIEQ-FREFIDELEEEIPKLKAE-VQQLQ 220

COG4372

Uncharacterized protein, contains DUF3084 domain [Function unknown];

817-1135

5.30e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];

Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 41.81 E-value: 5.30e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  817 EAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRD 896
Cdd:COG4372     12 RLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQA 91

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  897 LETQQALQRDR----QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLEScEQEKQALLQNLKEVEDKASAYE 972
Cdd:COG4372     92 AQAELAQAQEEleslQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAER-EEELKELEEQLESLQEELAALE 170

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  973 DQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAEL 1052
Cdd:COG4372    171 QELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEE 250

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1053 QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQE 1132
Cdd:COG4372    251 LLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLEL 330


                   ...
gi 1907081931 1133 KEE 1135
Cdd:COG4372    331 ALA 333

MukB

COG3096

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...

718-992

5.33e-03

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 42.25 E-value: 5.33e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQTEVA--TSPSGAWQRlhrvnqdlQSELEAQCRRQELI 795
Cdd:COG3096    439 AEDYLAAFRAKEQQATEEVLELEQKLSVADAARRQFEKAYELVCKIAgeVERSQAWQT--------ARELLRRYRSQQAL 510

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  796 TQQIQTLKHSYGEA-KDAIRHHEAEiqtlqtrlgnaaaelaikeqalaKLKGELKMEQGKVREQLEEWQHSKAMLSGQLR 874
Cdd:COG3096    511 AQRLQQLRAQLAELeQRLRQQQNAE-----------------------RLLEEFCQRIGQQLDAAEELEELLAELEAQLE 567

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  875 ASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTS--------EQAQRLMEKklKRNYTLL 946
Cdd:COG3096    568 ELEEQAAEAVEQRSELRQQLEQLRARIKELAARAPAWLAAQDALERLREQSGEAladsqevtAAMQQLLER--EREATVE 645

                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1907081931  947 LESCEQEKQALLQNLKEVEDKASAYEDQLqghVQQVEALQKEKLSE 992
Cdd:COG3096    646 RDELAARKQALESQIERLSQPGGAEDPRL---LALAERLGGVLLSE 688

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

718-883

5.51e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 42.21 E-value: 5.51e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGY---VLQTEVAtspsgAWQ-RLHRVNQD------LQSELEA 787
Cdd:COG4913    622 LEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIdvaSAEREIA-----ELEaELERLDASsddlaaLEEQLEE 696

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  788 QCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKA 867
Cdd:COG4913    697 LEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRENLEERID 776

                          170
                   ....*....|....*.
gi 1907081931  868 MLSGQLRASEQKLRST 883
Cdd:COG4913    777 ALRARLNRAEEELERA 792

sbcc

exonuclease SbcC; All proteins in this family for which functions are known are part of an ...

1697-2282

5.89e-03

Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 41.88 E-value: 5.89e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1697 QDELAQQLREKASILEEISAALPVLPPTEPLGgcqrLLRMSQHLSYESCLEGLGQYSSLLVQDAIIQAQVCYAACRIRLE 1776
Cdd:TIGR00618  151 QGEFAQFLKAKSKEKKELLMNLFPLDQYTQLA----LMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEK 226

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1777 YEKELRFYKKACQEAKGASGQKRAQAVGALKEEYE-ELLHKQKSEYQKVITLIEKENTELK------------AKVSQMD 1843
Cdd:TIGR00618  227 ELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLlKQLRARIEELRAQEAVLEETQERINrarkaaplaahiKAVTQIE 306

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1844 HQ-QRCLQEAENKHSESMFALQGRYE-EEIRCMVEQLSHTENTLQAERSRVLSQLDA-----SVKDRQAMEQHHV----Q 1912
Cdd:TIGR00618  307 QQaQRIHTELQSKMRSRAKLLMKRAAhVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVatsirEISCQQHTLTQHIhtlqQ 386

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1913 QMKMLEDRFQLKVRELQAVHQE---------ELRALQEHYIwSLRGALSLYQPSHPDSSLAPGPSEPRAVPAAKDEAESM 1983
Cdd:TIGR00618  387 QKTTLTQKLQSLCKELDILQREqatidtrtsAFRDLQGQLA-HAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESA 465

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 1984 SGLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFE--SLKATCERGFAAMEETHQKK---IEDLQRQHQRELE 2058
Cdd:TIGR00618  466 QSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCgsCIHPNPARQDIDNPGPLTRRmqrGEQTYAQLETSEE 545

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2059 KLREEKDRLLaeETAATISAieamknahREEMERELEKSQRSQISSINSDIEALRRqyleELQSVQRELEVLSEQYSQKC 2138
Cdd:TIGR00618  546 DVYHQLTSER--KQRASLKE--------QMQEIQQSFSILTQCDNRSKEDIPNLQN----ITVRLQDLTEKLSEAEDMLA 611

                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2139 LENAHL------AQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTgdgggestglplTQGKDAYELEVL 2212
Cdd:TIGR00618  612 CEQHALlrklqpEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSI------------RVLPKELLASRQ 679

                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907081931 2213 LrvKESEIQYLKQEISSLKDEL---QTALRDKKYASDKYKDIYTELSIAKAKAdcdISRLKEQLKAATEALGE 2282
Cdd:TIGR00618  680 L--ALQKMQSEKEQLTYWKEMLaqcQTLLRELETHIEEYDREFNEIENASSSL---GSDLAAREDALNQSLKE 747

Yuri_gagarin

pfam15934

Yuri gagarin; The yuri gagarin protein found in Drosophila, it plays roles in spermatogenesis.

676-904

7.18e-03

Yuri gagarin; The yuri gagarin protein found in Drosophila, it plays roles in spermatogenesis.

Pssm-ID: 318204 [Multi-domain] Cd Length: 234 Bit Score: 40.33 E-value: 7.18e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  676 EQRWHQVETTPLREEKQVPIAPLHLSLEDRSERLstheltslleKELEQSQKEASDLLEQNRLlqdqlrvalgreqsARE 755
Cdd:pfam15934   45 EQEQQLKEFTVQNQRLACQIDNLHETLKDRDHQI----------KQLQSMITGYSDISENNRL--------------KEE 100

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  756 GYVLQTEVATspsgawqrLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNaaaela 835
Cdd:pfam15934  101 IHDLKQKNCV--------QARVVRKMGLELKGQEEQRVELCDKYESLLGSFEEQCQELKRANRRVQSLQTRLSQ------ 166

                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081931  836 ikeqaLAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQ 904
Cdd:pfam15934  167 -----VEKLQEELRTERKILREEVIALKEKDAKSNGRERALQDQLKCCQTEIEKSRTLIRNMQSHLQLE 230

Mitofilin

pfam09731

Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. ...

809-990

7.28e-03

Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence rich in positive-charged and hydroxylated residues and a membrane anchor domain. In addition, it has three centrally located coiled coil domains.

Pssm-ID: 430783 [Multi-domain] Cd Length: 618 Bit Score: 41.67 E-value: 7.28e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  809 AKDAIRHHEAEIQTLQTRlGNAAAELAIKEQALAKLKGELKmeqgkVREQLEEwqhskamlsgQLRASEQKLRSTEARLL 888
Cdd:pfam09731  292 AHREIDQLSKKLAELKKR-EEKHIERALEKQKEELDKLAEE-----LSARLEE----------VRAADEAQLRLEFERER 355

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  889 EKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLgtseqaQRLMEKKLKrnytlllESCEQEKQALLQNLKEVEDKA 968
Cdd:pfam09731  356 EEIRESYEEKLRTELERQAEAHEEHLKDVLVEQEIEL------QREFLQDIK-------EKVEEERAGRLLKLNELLANL 422

                          170       180
                   ....*....|....*....|...
gi 1907081931  969 SAYEDQLQGHVQQV-EALQKEKL 990
Cdd:pfam09731  423 KGLEKATSSHSEVEdENRKAQQL 445

TOPEUc

smart00435

DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina ...

1971-2067

7.56e-03

DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina virus topoisomerase, Variola virus topoisomerase, Shope fibroma virus topoisomeras

Pssm-ID: 214661 [Multi-domain] Cd Length: 391 Bit Score: 41.18 E-value: 7.56e-03

                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  1971 RAVPaaKDEAESMSGLRERIQELEAQMGVMREELGHKELEGDV-AALQEKYQRDFESLKATCERGFAAM--EETHQKKIE 2047
Cdd:smart00435  269 RTVS--KTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLkRKLKSKFERDNEKLDAEVKEKKKEKkkEEKKKKQIE 346

                            90       100
                    ....*....|....*....|
gi 1907081931  2048 DLQRQHQReLEKLREEKDRL 2067
Cdd:smart00435  347 RLEERIEK-LEVQATDKEEN 365

Golgin_A5

pfam09787

Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ...

2075-2188

8.21e-03

Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.

Pssm-ID: 462900 [Multi-domain] Cd Length: 305 Bit Score: 40.90 E-value: 8.21e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931 2075 TISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQA 2154
Cdd:pfam09787   43 TALTLELEELRQERDLLREEIQKLRGQIQQLRTELQELEAQQQEEAESSREQLQELEEQLATERSARREAEAELERLQEE 122

                           90       100       110
                   ....*....|....*....|....*....|....
gi 1907081931 2155 LRQCQRENQELNAHNQELNNRLAAEITRLRTLLT 2188
Cdd:pfam09787  123 LRYLEEELRRSKATLQSRIKDREAEIEKLRNQLT 156

CwlO1

COG3883

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...

718-893

8.81e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];

Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 40.97 E-value: 8.81e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  718 LEKELEQSQKEASDLLEQNRLLQ---DQLRVALGR------EQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEaq 788
Cdd:COG3883     56 LQAELEALQAEIDKLQAEIAEAEaeiEERREELGEraralyRSGGSVSYLDVLLGSESFSDFLDRLSALSKIADADAD-- 133

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081931  789 crrqelITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAM 868
Cdd:COG3883    134 ------LLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAA 207

                          170       180
                   ....*....|....*....|....*
gi 1907081931  869 LSGQLRASEQKLRSTEARLLEKTQE 893
Cdd:COG3883    208 AEAAAAAAAAAAAAAAAAAAAAAAA 232

Mplasa_alph_rch