|
Name |
Accession |
Description |
Interval |
E-value |
| Macoilin |
pfam09726 |
Macoilin family; The Macoilin proteins has an N-terminal portion that is composed of 5 ... |
2-435 |
0e+00 |
|
Macoilin family; The Macoilin proteins has an N-terminal portion that is composed of 5 trasnmembrane helices, followed by a C-terminal coiled-coil region. Macoilin is a highly conserved protein present in eukaryotes. Macoilin appears to be found in the ER and be involved in the function of neurons.
Pssm-ID: 462859 [Multi-domain] Cd Length: 670 Bit Score: 607.23 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 2 KRRNADCSKLRRPLKRNRITEGIYGSTFLYLKFLVVWALVLLADFVLEFRFEYLWPFWLFIRSVYDSFRYQGLAFSVFFV 81
Cdd:pfam09726 1 KRRNADCSKLRRPLKRNRITEGIYGSTFLYLKFLVVWALVLLADFVLEFRFEYLWPFWLLIRSVYDSFKYQGLAFSVFFV 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 82 CVAFTSNIICLLFIPIQWLFFAASTYVWVQYVWHTERGVCLPTVSLWILFVYIEAAIRFKDLKNFHVDLCRPFAAHWLEQ 161
Cdd:pfam09726 81 CIAFTSDIICLLFIPVQWLFFAASTYVWVQYVWHTEKGICLPTVSLWILFVYIEAAIRFKDLKNFHVDLCRPFAAHCIGY 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 162 DIKKL----------KADLQASRQVEQE---------------------------------------------------- 179
Cdd:pfam09726 161 PVVTLgfgfksyvsyKMRLRKQREVQKEnefymqllqqalpkeqqmldrqeretsetakglsevdplalnqnghslnkkd 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 180 LRSQISSLSSTERGI-----RSEMGQLRQENELLQNKLHNAVQMKQ---------------------------------- 220
Cdd:pfam09726 241 STLQLPELEYREKKNsgtssGSDSKKSHNHNIHNLNHVDSKLQEKEymenhsnskrlnistspgseedllvresvsskss 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 221 ---------------------------------------------KDKQN--------------------------ISQL 229
Cdd:pfam09726 321 ssssssnknyknasggsanssnssprshshnsgsvtsssssknskKQKGPggksgarhkdpaencipnnqlskpdaLVRL 400
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 230 E---KKLKA--------EQEARS--------------------------------------------------------- 241
Cdd:pfam09726 401 EqdiKKLKAelqasrqtEQELRSqissltslerslkselgqlrqendllqtklhnavsakqkdkqtvqqlekrlkaeqea 480
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 242 --FVEKQLMEEKKRKKLEEATAARAVAFAAASRGECTETLRNRIRELEAEGKKLTMDMKVKEDQIRELELKVQELRKYKE 319
Cdd:pfam09726 481 raSAEKQLAEEKKRKKEEEATAARAVALAAASRGECTESLKQRKRELESEIKKLTHDIKLKEEQIRELEIKVQELRKYKE 560
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 320 NEKDTEVLMSALSAMQDKTQHLENSLSAETRIKLDLFSALGDAKRQLEIAQGQILQKDQEIKDLKQKIAEVMAVMPSity 399
Cdd:pfam09726 561 SEKDTEVLMSALSAMQDKNQHLENSLSAETRIKLDLFSALGDAKRQLEIAQGQIYQKDQEIKDLKQKIAEVMAVMPS--- 637
|
650 660 670
....*....|....*....|....*....|....*.
gi 543173121 400 saaTSPLSPVSPHYSSKFVETSPSGLDPNASVYQPL 435
Cdd:pfam09726 638 ---TSRITPVTPHYSSKFMDTSPSMRDPNASVYPPL 670
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
135-397 |
2.51e-11 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 65.85 E-value: 2.51e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 135 EAAIRFKDLKN--FHVDLcRPFAAHW--LEQDIKKLKADLQASRQVEQELRSQISSLSSTERGIRSEMGQLRQENELLQN 210
Cdd:TIGR02168 210 EKAERYKELKAelRELEL-ALLVLRLeeLREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQK 288
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 211 KLHNAvqmkqkdKQNISQLEKKLKAEQEARSFVEKQLmEEKKRKKLEEATAARAVAFAAASRGECTETLRNRIRELEAEG 290
Cdd:TIGR02168 289 ELYAL-------ANEISRLEQQKQILRERLANLERQL-EELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAEL 360
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 291 KKLTMDMKVKEDQIRELELKVQELR-KYKENEKDTEVLMSALSAMQDKTQHLENSLSAETRIKLDLFSALGDAkrQLEIA 369
Cdd:TIGR02168 361 EELEAELEELESRLEELEEQLETLRsKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEA--ELKEL 438
|
250 260
....*....|....*....|....*...
gi 543173121 370 QGQILQKDQEIKDLKQKIAEVMAVMPSI 397
Cdd:TIGR02168 439 QAELEELEEELEELQEELERLEEALEEL 466
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
159-389 |
2.82e-09 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 59.30 E-value: 2.82e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 159 LEQDIKKLKADLQASRQVEQELRSQISSLSSTERGIRSEMGQLRQENELLQNKLHNAVQMKQKDKQNISQLEKKLKAEQE 238
Cdd:TIGR02168 682 LEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEA 761
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 239 ARSFVEKQLMEEKKRKKLEEATAARAVAFAAASRGECTET------LRNRIRELEAEGKKLTMDMKVKEDQIRELELKVQ 312
Cdd:TIGR02168 762 EIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALrealdeLRAELTLLNEEAANLRERLESLERRIAATERRLE 841
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 313 ELRKYKEnekDTEVLMSALSAMQDKTQHLENSLSAETRIKLDLFS----ALGDAKRQLEIAQGQILQKDQEIKDLKQKIA 388
Cdd:TIGR02168 842 DLEEQIE---ELSEDIESLAAEIEELEELIEELESELEALLNERAsleeALALLRSELEELSEELRELESKRSELRRELE 918
|
.
gi 543173121 389 E 389
Cdd:TIGR02168 919 E 919
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
157-389 |
1.19e-08 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 57.25 E-value: 1.19e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 157 HWLEQDIKKLKADLQASRQVEQELRSQISSLSSTERGIRSEMGQLRQENELLQNKLHNAvqmkqkdKQNISQLEKKLKAE 236
Cdd:COG1196 235 RELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYEL-------LAELARLEQDIARL 307
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 237 QEARSFVEKQLmeekkrkkleeataaravafaaasrgectETLRNRIRELEAEGKKLTMDMKVKEDQIRELELKVQELR- 315
Cdd:COG1196 308 EERRRELEERL-----------------------------EELEEELAELEEELEELEEELEELEEELEEAEEELEEAEa 358
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 543173121 316 KYKENEKDTEVLMSALSAMQDKTQHLENSLSAETRIKLDLFSALGDAKRQLEIAQGQILQKDQEIKDLKQKIAE 389
Cdd:COG1196 359 ELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAE 432
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
159-389 |
2.44e-08 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 56.48 E-value: 2.44e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 159 LEQDIKKLKADLQASRQVEQELRSQISSLSSTERGIRSEMGQLRQENELLQNKLHNAVQMKQKDKQNISQLEKKLKAEQE 238
Cdd:COG1196 258 LEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEE 337
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 239 ARSFVEKQLMEEKKRKKLEEATAARAVAFAAASRGECTETLRNRIRELEAEGKKLTmdmKVKEDQIRELELKVQE---LR 315
Cdd:COG1196 338 ELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALR---AAAELAAQLEELEEAEealLE 414
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 543173121 316 KYKENEKDTEVLMSALSAMQDKTQHLENSLSAETRIKLDLFSALGDAKRQLEIAQGQILQKDQEIKDLKQKIAE 389
Cdd:COG1196 415 RLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAE 488
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
134-389 |
7.58e-08 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 54.68 E-value: 7.58e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 134 IEAAIRFKDLKNFHVDLCRpfAAHWLEQDIKKLKADLQASRQVEQELRSQISSLSSTERGIRSEMGQLRQENELLQNKLH 213
Cdd:TIGR02168 701 AELRKELEELEEELEQLRK--ELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELA 778
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 214 NAVQMKQKDKQNISQLEKKLKAEQEARSFVEKQLmEEKKRKKLEEATAARAVAFAAASRGECTETLRNRIRELEAEGKKL 293
Cdd:TIGR02168 779 EAEAEIEELEAQIEQLKEELKALREALDELRAEL-TLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESL 857
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 294 TMDMKVKEDQIRELELKVQELRK-YKENEKDTEVLMSALSAMQDKTQHLENSLSAETRIKLDLFSALGDAKRQLEIAQGQ 372
Cdd:TIGR02168 858 AAEIEELEELIEELESELEALLNeRASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVR 937
|
250 260 270 280
....*....|....*....|....*....|....*....|....*.
gi 543173121 373 ILQK-----------------------------DQEIKDLKQKIAE 389
Cdd:TIGR02168 938 IDNLqerlseeysltleeaealenkieddeeeaRRRLKRLENKIKE 983
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
153-424 |
1.37e-06 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 50.15 E-value: 1.37e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 153 PFAAHWLEQDIKKLKADLQASRQVEQELRSQISSLSSTERGIRSEMGQLRQENELLQNKLHNAvqmkqkdKQNISQLEKK 232
Cdd:COG4942 12 ALAAAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRAL-------EQELAALEAE 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 233 LKA----EQEARSFVEKQLMEEKKRKKLEEATAARAVAFAAASRGECTETLRNRIREleaegKKLTMDMKVKEDQIRELE 308
Cdd:COG4942 85 LAElekeIAELRAELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYL-----KYLAPARREQAEELRADL 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 309 LKVQELRKYKENEKDTevLMSALSAMQDKTQHLENSLSAETRIKLDLFSALGDAKRQLEIAQGQILQKDQEIKDLKQKIA 388
Cdd:COG4942 160 AELAALRAELEAERAE--LEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAA 237
|
250 260 270
....*....|....*....|....*....|....*..
gi 543173121 389 EVMAVMPSITYSAATSPLS-PVSPHYSSKFVETSPSG 424
Cdd:COG4942 238 AAAERTPAAGFAALKGKLPwPVSGRVVRRFGERDGGG 274
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
159-391 |
1.98e-06 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 50.45 E-value: 1.98e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 159 LEQDIKKLKADLQASRQVEQELRSQISSLSSTERGIRSEMGQLRQENELLQNKLHNaVQMKQKD------KQNISQLEKK 232
Cdd:TIGR02169 721 IEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHK-LEEALNDlearlsHSRIPEIQAE 799
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 233 LKAEQEARSFVEKQLMEEKKRKKLEEATAARAVAFAAASRGECTEtLRNRIRELEAEGKKLTMDMKVKEDQIRELELKVQ 312
Cdd:TIGR02169 800 LSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRID-LKEQIKSIEKEIENLNGKKEELEEELEELEAALR 878
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 313 ELRK-YKENEKDTEVLMSALSAMQDKTQHLENSLSaetriklDLFSALGDAKRQLEIAQGQILQKDQEIKDLKQKIAEVM 391
Cdd:TIGR02169 879 DLESrLGDLKKERDELEAQLRELERKIEELEAQIE-------KKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEEL 951
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
155-411 |
3.43e-06 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 49.24 E-value: 3.43e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 155 AAHWLEQDIKKLKADLQASRQVEQELRSQISSLSSTErgirsEMGQLRQENELLQNKLHNAVQMKQKDKQNISQLEKKLK 234
Cdd:COG3206 176 ALEFLEEQLPELRKELEEAEAALEEFRQKNGLVDLSE-----EAKLLLQQLSELESQLAEARAELAEAEARLAALRAQLG 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 235 AEQEARSFVekqlmeekkrkkleeataaravafaaaSRGECTETLRNRIRELEAEGKKLTMDMKVKEDQIRELELKVQEL 314
Cdd:COG3206 251 SGPDALPEL---------------------------LQSPVIQQLRAQLAELEAELAELSARYTPNHPDVIALRAQIAAL 303
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 315 RKYKENEKDTEV--LMSALSAMQDKTQHLENSLsAETRIKLDLFSA----LGDAKRQLEIAQG---QILQKDQEIkdlkq 385
Cdd:COG3206 304 RAQLQQEAQRILasLEAELEALQAREASLQAQL-AQLEARLAELPEleaeLRRLEREVEVARElyeSLLQRLEEA----- 377
|
250 260
....*....|....*....|....*.
gi 543173121 386 KIAEVMAVMPSITYSAATSPLSPVSP 411
Cdd:COG3206 378 RLAEALTVGNVRVIDPAVVPLKPVSP 403
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
160-385 |
5.76e-06 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 48.86 E-value: 5.76e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 160 EQDIKKLKADLQASRQVEQELRSQISSLsstergiRSEMGQLRQENELLQNKLHNAVQMKQKDKQNISQL-EKKLKAEQE 238
Cdd:TIGR04523 369 QNEIEKLKKENQSYKQEIKNLESQINDL-------ESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLkETIIKNNSE 441
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 239 ARSFVEKqlmeekkrkkleeataARAVAFAAASRGECTETLRNRIRELEAEGKKLTMDMkvkEDQIRELELKVQELRKYK 318
Cdd:TIGR04523 442 IKDLTNQ----------------DSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNL---EQKQKELKSKEKELKKLN 502
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 543173121 319 ENEKDTEVLMSALSAMQDKTQHLENSLSAETRIKLDLFSALGDAKRQL------EIAQGQILQKDQEIKDLKQ 385
Cdd:TIGR04523 503 EEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDdfelkkENLEKEIDEKNKEIEELKQ 575
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
159-389 |
8.08e-06 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 48.39 E-value: 8.08e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 159 LEQDIKKLKADLQASRQVEQELRSQISSLSSTERGIRSEMGQLRQENELLQNKLHNAVQMKQKDKQNISQLEKKLKAEQE 238
Cdd:COG1196 300 LEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEE 379
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 239 ARSFVEKQLMEEKKRKkleeataaravafaaasrgectETLRNRIRELEAEGKKLTMDMKVKEDQIRELELKVQELRKYK 318
Cdd:COG1196 380 ELEELAEELLEALRAA----------------------AELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEE 437
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 543173121 319 ENEKDTEvlmsaLSAMQDKTQHLENSLSAETRIKLDLFSALGdAKRQLEIAQGQILQKDQEIKDLKQKIAE 389
Cdd:COG1196 438 EEEEEAL-----EEAAEEEAELEEEEEALLELLAELLEEAAL-LEAALAELLEELAEAAARLLLLLEAEAD 502
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
159-389 |
1.90e-05 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 46.98 E-value: 1.90e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 159 LEQDIKKLKADLQASRQVEQELRSQ----------ISSLSSTERGIRSEMGQLR---QENELLQNKLHNAVQMKQKDKQN 225
Cdd:PRK03918 174 IKRRIERLEKFIKRTENIEELIKEKekeleevlreINEISSELPELREELEKLEkevKELEELKEEIEELEKELESLEGS 253
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 226 ISQLEKKLKAEQEARSFVEKQLMEEKKRKKLEEATAARAVAFAAASrgECTETLRNRIRELEAEGKKLTMDMKVKEDQIR 305
Cdd:PRK03918 254 KRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLS--EFYEEYLDELREIEKRLSRLEEEINGIEERIK 331
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 306 ELELKVQELR----KYKENEKDTEVLMSALSAMQDKTQHLENSLSAETRIKLDlfsALGDAKRQLEIAQGQILQKDQEIK 381
Cdd:PRK03918 332 ELEEKEERLEelkkKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGL---TPEKLEKELEELEKAKEEIEEEIS 408
|
....*...
gi 543173121 382 DLKQKIAE 389
Cdd:PRK03918 409 KITARIGE 416
|
|
| CarR_dom_SF |
TIGR03462 |
lycopene cyclase domain; This domain is often repeated twice within the same polypeptide, as ... |
29-113 |
8.21e-05 |
|
lycopene cyclase domain; This domain is often repeated twice within the same polypeptide, as is observed in Archaea, Thermus, Sphingobacteria and Fungi. In the fungal sequences, this tandem domain pair is observed as the N-terminal half of a bifunctional protein, where it has been characterized as a lycopene beta-cyclase and the C-terminal half is a phytoene synthetase. In Myxococcus and Actinobacterial genomes this domain appears as a single polypeptide, tandemly repeated and usually in a genomic context consistent with a role in carotenoid biosynthesis. It is unclear whether any of the sequences in this family truly encode lycopene epsilon cyclases. However a number are annotated as such. The domain is generally hydrophobic with a number of predicted membrane spanning segments and contains a distinctive motif (hPhEEhhhhhh). In certain sequences one of either the proline or glutamates may vary, but always one of the tandem pair appear to match this canonical sequence exactly.
Pssm-ID: 274590 Cd Length: 89 Bit Score: 41.05 E-value: 8.21e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 29 FLYLKFLVVWALVLLA-DFVLEFRFEYLWPFWLFI------RSVYDSFryqGLAFSVFFVCVAFTSNiICLLFIPIQ-WL 100
Cdd:TIGR03462 1 YLYLGVLLVWALPVLAlLWVFRGPFLRLRALALALlialptFLVWDNL---AIRRGVWTYNPRYILG-IRLGDLPIEeFL 76
|
90
....*....|...
gi 543173121 101 FFAASTYVWVQYV 113
Cdd:TIGR03462 77 FFLLTPLLTVLWL 89
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
195-389 |
1.13e-04 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 44.66 E-value: 1.13e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 195 RSEMGQLRQENELLQNKLHNAVQMKQKDKQNISQLEKKLKAEQEARSFVEKQLMEEKKRKKLEEATAARAVAFAAASRGE 274
Cdd:TIGR02168 676 RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKE 755
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 275 CTEtLRNRIRELEAEGKKLTMDMKVKEDQIRELELKVQELRK-YKENEKDTEVLMSALSAMQDKTQHLENSLSAETRIKL 353
Cdd:TIGR02168 756 LTE-LEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEeLKALREALDELRAELTLLNEEAANLRERLESLERRIA 834
|
170 180 190
....*....|....*....|....*....|....*.
gi 543173121 354 DLFSALGDAKRQLEIAQGQILQKDQEIKDLKQKIAE 389
Cdd:TIGR02168 835 ATERRLEDLEEQIEELSEDIESLAAEIEELEELIEE 870
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
160-389 |
2.27e-04 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 43.47 E-value: 2.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 160 EQDIKKLKADLQASRQVEQELRSQISSLSSTERGIRSEMGQLRQENELLQNKLHNAVQMKQKDKQNISQLEKK---LKAE 236
Cdd:TIGR04523 320 EKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQindLESK 399
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 237 QEARSFVEKQLMEEKKRKKLEEataaravafaaasrgectETLRNRIRELEAEGKKLTMDMKVKEDQIRELELKVQELRK 316
Cdd:TIGR04523 400 IQNQEKLNQQKDEQIKKLQQEK------------------ELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDN 461
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 543173121 317 YKENEKDTevlMSALSAMQDKTQHLENSLSAETRIKLDLFSALGDAKRQLEiaqgqilqkdQEIKDLKQKIAE 389
Cdd:TIGR04523 462 TRESLETQ---LKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELE----------EKVKDLTKKISS 521
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
162-394 |
3.67e-04 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 42.79 E-value: 3.67e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 162 DIKKLKADLQASRQVEQELRSQISSLSSTERGIRSEMGQLRQEnellqnklhnavqMKQKDKQNISQLEkklKAEQEARS 241
Cdd:pfam05483 514 ELKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDELESVREE-------------FIQKGDEVKCKLD---KSEENARS 577
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 242 FVEKQLMEEKKRKKLEEATAARAVAFAAASRgectetlrnRIRELEAEGKKLTMDMKVKEDQIRELELKVQELrkykENE 321
Cdd:pfam05483 578 IEYEVLKKEKQMKILENKCNNLKKQIENKNK---------NIEELHQENKALKKKGSAENKQLNAYEIKVNKL----ELE 644
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 543173121 322 kdtevlmsaLSAMQDKTQHLENSLSAETRI-KLDLFSALGDAKRQLEIAQGQI-LQKDQEIKdLKQKIAEVMAVM 394
Cdd:pfam05483 645 ---------LASAKQKFEEIIDNYQKEIEDkKISEEKLLEEVEKAKAIADEAVkLQKEIDKR-CQHKIAEMVALM 709
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
134-386 |
6.21e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 42.21 E-value: 6.21e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 134 IEAAIR-FKDLKNFHVDLCRpfaahwLEQDIKKLkADLQASRQVEQELRSQISSLSSTERGIRSEMGQLRQEneLLQNKL 212
Cdd:COG4913 227 ADALVEhFDDLERAHEALED------AREQIELL-EPIRELAERYAAARERLAELEYLRAALRLWFAQRRLE--LLEAEL 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 213 HNAVQMKQKDKQNISQLEKKLKAEQEARSFVEKQLMEekkrkkleeataaravafaaaSRGECTETLRNRIRELEAEGKK 292
Cdd:COG4913 298 EELRAELARLEAELERLEARLDALREELDELEAQIRG---------------------NGGDRLEQLEREIERLERELEE 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 293 LTMDMKVKEDQIRELELKVQELRKykenekdtevlmsALSAMQDKTQHLENSLSAETRiklDLFSALGDAKRQLEIAQGQ 372
Cdd:COG4913 357 RERRRARLEALLAALGLPLPASAE-------------EFAALRAEAAALLEALEEELE---ALEEALAEAEAALRDLRRE 420
|
250
....*....|....
gi 543173121 373 ILQKDQEIKDLKQK 386
Cdd:COG4913 421 LRELEAEIASLERR 434
|
|
| COG5022 |
COG5022 |
Myosin heavy chain [General function prediction only]; |
163-437 |
6.34e-04 |
|
Myosin heavy chain [General function prediction only];
Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 42.37 E-value: 6.34e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 163 IKKLKADLQASRQVEQELRSQI-----SSLSSTERgiRSEMGQLRQENELLQNKLHNAVQMKQKDKQ--------NISQL 229
Cdd:COG5022 774 IQVIQHGFRLRRLVDYELKWRLfiklqPLLSLLGS--RKEYRSYLACIIKLQKTIKREKKLRETEEVefslkaevLIQKF 851
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 230 EKKLKAEQEARSFVEKQLMEEKKRKKLEEATAARAVAFAAASRGECTETLrnriRELEAEGKKLTMDmkVKEDQIRELEL 309
Cdd:COG5022 852 GRSLKAKKRFSLLKKETIYLQSAQRVELAERQLQELKIDVKSISSLKLVN----LELESEIIELKKS--LSSDLIENLEF 925
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 310 KVQELRKYKENeKDTEVLMSALSAMQDKTQHLENSLSAETRIKlDLFSALGDAKRQLEIAQGQILQKDQEIKDLKQKIAE 389
Cdd:COG5022 926 KTELIARLKKL-LNNIDLEEGPSIEYVKLPELNKLHEVESKLK-ETSEEYEDLLKKSTILVREGNKANSELKNFKKELAE 1003
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 543173121 390 VMAVMPSITYSAATSPLSPVSP---HYSSKFVETSPSGLdpnaSVYQPLKK 437
Cdd:COG5022 1004 LSKQYGALQESTKQLKELPVEVaelQSASKIISSESTEL----SILKPLQK 1050
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
159-390 |
1.08e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 41.04 E-value: 1.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 159 LEQDIKKLKADLQASRQVEQELRSQISSLSSTERGIRSEMGQLRQENELLQnklhnavQMKQKDKQNISQLEKKLKAEQE 238
Cdd:COG4372 78 LEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLE-------QQRKQLEAQIAELQSEIAEREE 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 239 ARSFVEKQLMEEKKRKKLEEATAARAVAFAAASRGECTETLRNRIRELEAEGKKLtMDMKVKEDQIRELELKVQELRKYK 318
Cdd:COG4372 151 ELKELEEQLESLQEELAALEQELQALSEAEAEQALDELLKEANRNAEKEEELAEA-EKLIESLPRELAEELLEAKDSLEA 229
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 543173121 319 ENEKDTEVLMSALSAMQDKTQHLENSLSAETRIKLDLFSALGDAKRQLEIAQGQILQKDQEIKDLKQKIAEV 390
Cdd:COG4372 230 KLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALL 301
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
166-390 |
1.40e-03 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 41.36 E-value: 1.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 166 LKADLQA-----SRQVEQELRSQISSLSST-------ERGIRSEMGQLRQENELLQNKLHNAVQMKQKDKQNISQLEKKL 233
Cdd:pfam12128 583 VKLDLKRidvpeWAASEEELRERLDKAEEAlqsarekQAAAEEQLVQANGELEKASREETFARTALKNARLDLRRLFDEK 662
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 234 KAEQEArsfVEKQLMEEKKRKKLeeataaravafaaasrgectetlrnRIRELEAEGKKLTMDMKVKEDQIRE--LELKV 311
Cdd:pfam12128 663 QSEKDK---KNKALAERKDSANE-------------------------RLNSLEAQLKQLDKKHQAWLEEQKEqkREART 714
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 312 QELRKYKENEKDTEVLMSALSAMQDK--TQHLENSLSAETRIKLDLFSalgdakrqLEIAQGQILQKDQEIKDLKQKIAE 389
Cdd:pfam12128 715 EKQAYWQVVEGALDAQLALLKAAIAArrSGAKAELKALETWYKRDLAS--------LGVDPDVIAKLKREIRTLERKIER 786
|
.
gi 543173121 390 V 390
Cdd:pfam12128 787 I 787
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
161-393 |
1.47e-03 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 41.26 E-value: 1.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 161 QDIKKLKADLQAS----RQVEQELRSQISSLSSTERGIRSEMGQLRQENELLQ------NKLHNAVQMKQKDKQNISQLE 230
Cdd:pfam15921 461 EKVSSLTAQLESTkemlRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEatnaeiTKLRSRVDLKLQELQHLKNEG 540
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 231 KKLKAEQEARSFVEKQlMEEKKRKKLEEATAARAVAFAAASRGECTETLRNRIRELEAEGKKLTMDMK-------VKEDQ 303
Cdd:pfam15921 541 DHLRNVQTECEALKLQ-MAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQLEKEINDRRLELQefkilkdKKDAK 619
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 304 IRELELKVQ--ELRKYKENEKDTEVLMSALSAMQDKTQHLE---------NSLSAETRI--------KLDLFSALGDAKR 364
Cdd:pfam15921 620 IRELEARVSdlELEKVKLVNAGSERLRAVKDIKQERDQLLNevktsrnelNSLSEDYEVlkrnfrnkSEEMETTTNKLKM 699
|
250 260
....*....|....*....|....*....
gi 543173121 365 QLEIAQGQILQKDQEIKDLKQKIAEVMAV 393
Cdd:pfam15921 700 QLKSAQSELEQTRNTLKSMEGSDGHAMKV 728
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
159-398 |
1.53e-03 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 40.77 E-value: 1.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 159 LEQDIKKLKADLQASRQVEQELRSQISSLSSTERGIRSEMGQLRQENELLQNKLHNAvqmkqkdKQNISQLEKKLKAEQE 238
Cdd:TIGR04523 389 LESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDL-------TNQDSVKELIIKNLDN 461
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 239 ARSFVEKQLMEEKKRKKLEEATAARAVAFAAASRGEcTETLRNRIRELEAEGKKLTmdmkvkeDQIRELELKVQELRKYK 318
Cdd:TIGR04523 462 TRESLETQLKVLSRSINKIKQNLEQKQKELKSKEKE-LKKLNEEKKELEEKVKDLT-------KKISSLKEKIEKLESEK 533
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 319 eNEKDTEvlmsaLSAMQDKTQHLE-----NSLSAETRIKLDLFSALGDAKRQLEIAQGQ----ILQKDQEIKDLKQKIAE 389
Cdd:TIGR04523 534 -KEKESK-----ISDLEDELNKDDfelkkENLEKEIDEKNKEIEELKQTQKSLKKKQEEkqelIDQKEKEKKDLIKEIEE 607
|
....*....
gi 543173121 390 VMAVMPSIT 398
Cdd:TIGR04523 608 KEKKISSLE 616
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
159-351 |
1.85e-03 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 40.79 E-value: 1.85e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 159 LEQDIKKLKADLQASRQVEQELRSQISSLSSTERGIRSEMGQLRQENELLQNKLHNAVQMKQKDKQNISQLEKKLKAEQE 238
Cdd:PRK02224 319 LEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRE 398
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 239 ARSFVEKQLMEEKKRKKLEEATAARAVAFAAASRGEcTETLRNRIRE---LEAEGKKLTMDMKVKE----DQIRELELKV 311
Cdd:PRK02224 399 RFGDAPVDLGNAEDFLEELREERDELREREAELEAT-LRTARERVEEaeaLLEAGKCPECGQPVEGsphvETIEEDRERV 477
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 543173121 312 QELrkykenEKDTEVLMSALSAMQDKTQHLENSLSAETRI 351
Cdd:PRK02224 478 EEL------EAELEDLEEEVEEVEERLERAEDLVEAEDRI 511
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
159-388 |
1.85e-03 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 40.72 E-value: 1.85e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 159 LEQDIKKLKADLQASRQVEQELRSQISSLSSTERgIRSEMGQLRQENELLQNKLhnAVQMKQKDKQNISQLEKKLKAEQE 238
Cdd:TIGR00618 224 LEKELKHLREALQQTQQSHAYLTQKREAQEEQLK-KQQLLKQLRARIEELRAQE--AVLEETQERINRARKAAPLAAHIK 300
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 239 ARSFVEKQlmeekkrkkleeataaravafaaasRGECTETLRNRIRELEAEGKKLTMDMKVKEDQIRELELKVQELRKYK 318
Cdd:TIGR00618 301 AVTQIEQQ-------------------------AQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEI 355
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 543173121 319 ENEKDTEVLMSALsAMQDKTQHLENSLSAETRIK---LDLFSALGDAKRQLEIAQGQILQKDQEIKDLKQKIA 388
Cdd:TIGR00618 356 HIRDAHEVATSIR-EISCQQHTLTQHIHTLQQQKttlTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLA 427
|
|
| COG4026 |
COG4026 |
Uncharacterized conserved protein, contains TOPRIM domain, potential nuclease [General ... |
159-232 |
2.16e-03 |
|
Uncharacterized conserved protein, contains TOPRIM domain, potential nuclease [General function prediction only];
Pssm-ID: 443204 [Multi-domain] Cd Length: 287 Bit Score: 39.71 E-value: 2.16e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 543173121 159 LEQDIKKLKADLQASRQVEQELRSQISSLSSTERGIRSEMGQLRQENELLQNKLHNAVQMKQKDKQNISQLEKK 232
Cdd:COG4026 133 LREELLELKEKIDEIAKEKEKLTKENEELESELEELREEYKKLREENSILEEEFDNIKSEYSDLKSRFEELLKK 206
|
|
| COG2433 |
COG2433 |
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only]; |
279-348 |
2.26e-03 |
|
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain] Cd Length: 644 Bit Score: 40.23 E-value: 2.26e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 543173121 279 LRNRIRELEAEGKKLTMDMKVKEDQIRELELKVQELRK--YKENEKDTEVlmsalSAMQDKTQHLENSLSAE 348
Cdd:COG2433 418 LEEQVERLEAEVEELEAELEEKDERIERLERELSEARSeeRREIRKDREI-----SRLDREIERLERELEEE 484
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
159-394 |
4.65e-03 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 39.33 E-value: 4.65e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 159 LEQDIKKLKADL-QASRQVE---QELRSQISSLSSTERGIRSEMGQLRQENELLQNKLhnavqmkQKDKQNISQLEKKLK 234
Cdd:pfam15921 322 LESTVSQLRSELrEAKRMYEdkiEELEKQLVLANSELTEARTERDQFSQESGNLDDQL-------QKLLADLHKREKELS 394
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 235 AEQEARSFVEKQLM-------EEKKRKKLEEATAARAVAFAAASRGECTETLRNRIRELEAEGK------KLTMDMKVKE 301
Cdd:pfam15921 395 LEKEQNKRLWDRDTgnsitidHLRRELDDRNMEVQRLEALLKAMKSECQGQMERQMAAIQGKNEslekvsSLTAQLESTK 474
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 302 DQIRELelkVQELRKYKENEKDTEVLMSALSA-MQDKTQHLE--NSLSAETRIKLDLF-----------SALGDAKRQLE 367
Cdd:pfam15921 475 EMLRKV---VEELTAKKMTLESSERTVSDLTAsLQEKERAIEatNAEITKLRSRVDLKlqelqhlknegDHLRNVQTECE 551
|
250 260
....*....|....*....|....*..
gi 543173121 368 IAQGQILQKDQEIKDLKQKIAEVMAVM 394
Cdd:pfam15921 552 ALKLQMAEKDKVIEILRQQIENMTQLV 578
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
159-390 |
5.15e-03 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 39.28 E-value: 5.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 159 LEQDIKKLKADLQASRQVEQ-ELRSQISSLSSTERGIRSEMGQLRQENELLQNKLHNAVQMKQKDKQNISQLEKKLKAEQ 237
Cdd:TIGR02169 270 IEQLLEELNKKIKDLGEEEQlRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEER 349
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 238 EARSfvekQLMEEkkrkkleeataaravafaAASRGECTETLRNRIRELEAEGKKL---TMDMKVK-EDQIREL-ELKVQ 312
Cdd:TIGR02169 350 KRRD----KLTEE------------------YAELKEELEDLRAELEEVDKEFAETrdeLKDYREKlEKLKREInELKRE 407
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 543173121 313 ELRKYKENEKdtevLMSALSAMQDKTQHLENSLSAETRIKLDLFSALGDAKRQLEIAQGQILQKDQEIKDLKQKIAEV 390
Cdd:TIGR02169 408 LDRLQEELQR----LSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRV 481
|
|
| HMMR_N |
pfam15905 |
Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of ... |
159-386 |
8.20e-03 |
|
Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of eukaryotic hyaluronan-mediated motility receptor proteins. The protein is functionally associated with BRCA1 and thus predicted to be a common, low-penetrance breast cancer candidate.
Pssm-ID: 464932 [Multi-domain] Cd Length: 329 Bit Score: 38.25 E-value: 8.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 159 LEQDIKKLKADLQASRQVEQELRSQISSLsstERgirsEMGQLRQENELLQNK-------------------LHNAVQMK 219
Cdd:pfam15905 99 LEEELEKVEAKLNAAVREKTSLSASVASL---EK----QLLELTRVNELLKAKfsedgtqkkmsslsmelmkLRNKLEAK 171
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 220 QK---DKQniSQLEKKLKAEQearsfveKQLMEEKKRKKLEEATAARAVAFAAASRGEcTETLRNRIRELEAEGKKLTMd 296
Cdd:pfam15905 172 MKevmAKQ--EGMEGKLQVTQ-------KNLEHSKGKVAQLEEKLVSTEKEKIEEKSE-TEKLLEYITELSCVSEQVEK- 240
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 297 MKVKEDQIRE-LELKVQELRKYKENEKDTEVLMSALSAMQD-KTQHLENSLSAETRIKLDLFSALgdaKRQLEIAQGQIL 374
Cdd:pfam15905 241 YKLDIAQLEElLKEKNDEIESLKQSLEEKEQELSKQIKDLNeKCKLLESEKEELLREYEEKEQTL---NAELEELKEKLT 317
|
250
....*....|..
gi 543173121 375 QKDQEIKDLKQK 386
Cdd:pfam15905 318 LEEQEHQKLQQK 329
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
161-330 |
8.64e-03 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 38.60 E-value: 8.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 161 QDIKKLKADLQASRQVEQELRSQISSLSSTERGI---RSEMGQLRQENELLQN--KLHNAVQMKQKDKQNISQLEKKLKA 235
Cdd:COG4717 71 KELKELEEELKEAEEKEEEYAELQEELEELEEELeelEAELEELREELEKLEKllQLLPLYQELEALEAELAELPERLEE 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 543173121 236 ----------EQEARSFVEKQLMEEKKRKKLEEATAARAVAFAAASRGECTETLRNRIRELEAEGKKLTMDMKVKEDQIR 305
Cdd:COG4717 151 leerleelreLEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELE 230
|
170 180
....*....|....*....|....*
gi 543173121 306 ELELKVQELRKYKENEKDTEVLMSA 330
Cdd:COG4717 231 QLENELEAAALEERLKEARLLLLIA 255
|
|
|