View
Concise Results
Standard Results
Full Results
janus kinase and microtubule-interacting protein 1 isoform 1 [Mus musculus]
Protein Classification
List of domain hits
Name
Accession
Description
Interval
E-value
JAKMIP_CC3
pfam16034
JAKMIP CC3 domain; This domain is found at the C-terminus of proteins belonging to the JAKMIP ...
415-612
1.56e-74
JAKMIP CC3 domain; This domain is found at the C-terminus of proteins belonging to the JAKMIP family (Janus kinase and microtubule-interacting proteins) and is predicted to be a coiled coil. It interacts with the Janus family kinases Tyk2 and Jak1.
:Pssm-ID: 464986 [Multi-domain]
Cd Length: 199
Bit Score: 241.32
E-value: 1.56e-74
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 415 ER ER L L R SKRH R G K SL K PPKKH - VVETFFG F DEE - SVDSET l S ET SYNTDRTD R TPATP E EDLDE TTTR EEA D LRF C QLT 492
Cdd:pfam16034 1 ER DK L I R ARKQ R R K EK K KKLRD m VVETFFG Y DEE a SVDSET - S SL SYNTDRTD S TPATP D EDLDE GVAA EEA E LRF R QLT 79
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 493 REYQALQRAYALLQEQ V GGTLDAEREA R TREQLQA D L L R C Q AK IEDLE KL L VE KGQD AA WVEEKQ V L M R T NQ D L L EKI YR 572
Cdd:pfam16034 80 REYQALQRAYALLQEQ S GGTLDAEREA K TREQLQA E L R R A Q SR IEDLE SA L AQ KGQD SK WVEEKQ A L I R R NQ E L V EKI RI 159
170 180 190 200
....*....|....*....|....*....|....*....|
gi 1104469519 573 L E M EE NQ LK S E M QDA K DQNELLEFR V LELEERERRSPA F N 612
Cdd:pfam16034 160 M E Q EE GR LK N E I QDA R DQNELLEFR I LELEERERRSPA I N 199
Smc super family
cl34174
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
24-607
4.18e-11
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
The actual alignment was detected with superfamily member COG1196 :Pssm-ID: 440809 [Multi-domain]
Cd Length: 983
Bit Score: 66.88
E-value: 4.18e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 24 E EL R A K L TNIQIEFQQEKSKVGKLRERLQ EA K LE REQEQRRHTAYIS E LKAKLH EE KTK EL QALREAL i RQHEQ E A AR TA 103
Cdd:COG1196 223 K EL E A E L LLLKLRELEAELEELEAELEEL EA E LE ELEAELAELEAEL E ELRLEL EE LEL EL EEAQAEE - YELLA E L AR LE 301
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 104 KIKEGELQ R LQA tlnvlrdgaadkvktal L ADAR EE ARRTFDGERQR L QQEIL EL KAARKQA EEA lsncm QADKAK A ADL 183
Cdd:COG1196 302 QDIARLEE R RRE ----------------- L EERL EE LEEELAELEEE L EELEE EL EELEEEL EEA ----- EEELEE A EAE 359
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 184 R A AYQAHQD E VHRIKR E C E RDIRR L MD E IKGKE R VILA L EKE L GVQTGQTQR LL LQK E A L D E Q L VQVK EA ERHHSSPKR E 263
Cdd:COG1196 360 L A EAEEALL E AEAELA E A E EELEE L AE E LLEAL R AAAE L AAQ L EELEEAEEA LL ERL E R L E E E L EELE EA LAELEEEEE E 439
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 264 L ppgigdmaelmggqdq HMDERDVRRFQLKIA E LNSVIRK L EDRNTLL A DERNEL L KRSR E TEVQLKPLVEKNKRMNKKN 343
Cdd:COG1196 440 E ---------------- EEALEEAAEEEAELE E EEEALLE L LAELLEE A ALLEAA L AELL E ELAEAAARLLLLLEAEADY 503
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 344 E EL L HSIQRMEEKLK -------- SLTRENVEMK E KLSAQ A SLKRHTSLNDLSLTRDEQE IE F L RLQVLEQQHVIDDLSLE 415
Cdd:COG1196 504 E GF L EGVKAALLLAG lrglagav AVLIGVEAAY E AALEA A LAAALQNIVVEDDEVAAAA IE Y L KAAKAGRATFLPLDKIR 583
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 416 RERL L RSKRH RG KSLKPPKKHVVETFFGFDEES V DSE TL SETSYNTD R TDRTPATPEEDLDETTTREEADLRFCQLTREY 495
Cdd:COG1196 584 ARAA L AAALA RG AIGAAVDLVASDLREADARYY V LGD TL LGRTLVAA R LEAALRRAVTLAGRLREVTLEGEGGSAGGSLT 663
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 496 QALQ R AYALLQEQ vggt LD AE R E ARTREQLQAD L LRCQ A KIEDL E KLLVEKGQDAAWV EE KQVLMRTNQD L LEKIYR L EM 575
Cdd:COG1196 664 GGSR R ELLAALLE ---- AE AE L E ELAERLAEEE L ELEE A LLAEE E EERELAEAEEERL EE ELEEEALEEQ L EAEREE L LE 739
570 580 590
....*....|....*....|....*....|..
gi 1104469519 576 E ENQLKSEMQDAKDQNELLEFRVL ELE ERER R 607
Cdd:COG1196 740 E LLEEEELLEEEALEELPEPPDLE ELE RELE R 771
Name
Accession
Description
Interval
E-value
JAKMIP_CC3
pfam16034
JAKMIP CC3 domain; This domain is found at the C-terminus of proteins belonging to the JAKMIP ...
415-612
1.56e-74
JAKMIP CC3 domain; This domain is found at the C-terminus of proteins belonging to the JAKMIP family (Janus kinase and microtubule-interacting proteins) and is predicted to be a coiled coil. It interacts with the Janus family kinases Tyk2 and Jak1.
Pssm-ID: 464986 [Multi-domain]
Cd Length: 199
Bit Score: 241.32
E-value: 1.56e-74
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 415 ER ER L L R SKRH R G K SL K PPKKH - VVETFFG F DEE - SVDSET l S ET SYNTDRTD R TPATP E EDLDE TTTR EEA D LRF C QLT 492
Cdd:pfam16034 1 ER DK L I R ARKQ R R K EK K KKLRD m VVETFFG Y DEE a SVDSET - S SL SYNTDRTD S TPATP D EDLDE GVAA EEA E LRF R QLT 79
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 493 REYQALQRAYALLQEQ V GGTLDAEREA R TREQLQA D L L R C Q AK IEDLE KL L VE KGQD AA WVEEKQ V L M R T NQ D L L EKI YR 572
Cdd:pfam16034 80 REYQALQRAYALLQEQ S GGTLDAEREA K TREQLQA E L R R A Q SR IEDLE SA L AQ KGQD SK WVEEKQ A L I R R NQ E L V EKI RI 159
170 180 190 200
....*....|....*....|....*....|....*....|
gi 1104469519 573 L E M EE NQ LK S E M QDA K DQNELLEFR V LELEERERRSPA F N 612
Cdd:pfam16034 160 M E Q EE GR LK N E I QDA R DQNELLEFR I LELEERERRSPA I N 199
Smc
COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
24-607
4.18e-11
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain]
Cd Length: 983
Bit Score: 66.88
E-value: 4.18e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 24 E EL R A K L TNIQIEFQQEKSKVGKLRERLQ EA K LE REQEQRRHTAYIS E LKAKLH EE KTK EL QALREAL i RQHEQ E A AR TA 103
Cdd:COG1196 223 K EL E A E L LLLKLRELEAELEELEAELEEL EA E LE ELEAELAELEAEL E ELRLEL EE LEL EL EEAQAEE - YELLA E L AR LE 301
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 104 KIKEGELQ R LQA tlnvlrdgaadkvktal L ADAR EE ARRTFDGERQR L QQEIL EL KAARKQA EEA lsncm QADKAK A ADL 183
Cdd:COG1196 302 QDIARLEE R RRE ----------------- L EERL EE LEEELAELEEE L EELEE EL EELEEEL EEA ----- EEELEE A EAE 359
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 184 R A AYQAHQD E VHRIKR E C E RDIRR L MD E IKGKE R VILA L EKE L GVQTGQTQR LL LQK E A L D E Q L VQVK EA ERHHSSPKR E 263
Cdd:COG1196 360 L A EAEEALL E AEAELA E A E EELEE L AE E LLEAL R AAAE L AAQ L EELEEAEEA LL ERL E R L E E E L EELE EA LAELEEEEE E 439
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 264 L ppgigdmaelmggqdq HMDERDVRRFQLKIA E LNSVIRK L EDRNTLL A DERNEL L KRSR E TEVQLKPLVEKNKRMNKKN 343
Cdd:COG1196 440 E ---------------- EEALEEAAEEEAELE E EEEALLE L LAELLEE A ALLEAA L AELL E ELAEAAARLLLLLEAEADY 503
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 344 E EL L HSIQRMEEKLK -------- SLTRENVEMK E KLSAQ A SLKRHTSLNDLSLTRDEQE IE F L RLQVLEQQHVIDDLSLE 415
Cdd:COG1196 504 E GF L EGVKAALLLAG lrglagav AVLIGVEAAY E AALEA A LAAALQNIVVEDDEVAAAA IE Y L KAAKAGRATFLPLDKIR 583
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 416 RERL L RSKRH RG KSLKPPKKHVVETFFGFDEES V DSE TL SETSYNTD R TDRTPATPEEDLDETTTREEADLRFCQLTREY 495
Cdd:COG1196 584 ARAA L AAALA RG AIGAAVDLVASDLREADARYY V LGD TL LGRTLVAA R LEAALRRAVTLAGRLREVTLEGEGGSAGGSLT 663
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 496 QALQ R AYALLQEQ vggt LD AE R E ARTREQLQAD L LRCQ A KIEDL E KLLVEKGQDAAWV EE KQVLMRTNQD L LEKIYR L EM 575
Cdd:COG1196 664 GGSR R ELLAALLE ---- AE AE L E ELAERLAEEE L ELEE A LLAEE E EERELAEAEEERL EE ELEEEALEEQ L EAEREE L LE 739
570 580 590
....*....|....*....|....*....|..
gi 1104469519 576 E ENQLKSEMQDAKDQNELLEFRVL ELE ERER R 607
Cdd:COG1196 740 E LLEEEELLEEEALEELPEPPDLE ELE RELE R 771
SMC_prok_A
TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
6-363
3.72e-08
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain]
Cd Length: 1164
Bit Score: 57.39
E-value: 3.72e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 6 R S KGDKP E AET D SVQMANEELRA K LTN I QI E FQ Q EKSKVG KL R ERL Q E AKLEREQEQRRHTAYI SELK aklhe E KTKELQ 85
Cdd:TIGR02169 694 Q S ELRRI E NRL D ELSQELSDASR K IGE I EK E IE Q LEQEEE KL K ERL E E LEEDLSSLEQEIENVK SELK ----- E LEARIE 768
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 86 A L R E A L IRQH E QEAART A KIKEGELQRL QA T L NV L RDGAADKVKTALLADAREEAR ---- RTFDG E R Q R LQ QEILE LK AA 161
Cdd:TIGR02169 769 E L E E D L HKLE E ALNDLE A RLSHSRIPEI QA E L SK L EEEVSRIEARLREIEQKLNRL tlek EYLEK E I Q E LQ EQRID LK EQ 848
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 162 R K QA E EALS N c MQAD K AKAADLRAAYQ A HQDEVHRIKRECERDIRR L MDEIKGK ER V I LA LE KELGVQTGQTQR L LLQK E 241
Cdd:TIGR02169 849 I K SI E KEIE N - LNGK K EELEEELEELE A ALRDLESRLGDLKKERDE L EAQLREL ER K I EE LE AQIEKKRKRLSE L KAKL E 927
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 242 AL D E Q L VQVKEAERHHSSPKR E LP pgigdmaelmggqdqhmder DVRRF Q LKIAELNSV IR K LE DR N T L LAD E RN E L LKR 321
Cdd:TIGR02169 928 AL E E E L SEIEDPKGEDEEIPE E EL -------------------- SLEDV Q AELQRVEEE IR A LE PV N M L AIQ E YE E V LKR 987
330 340 350 360
....*....|....*....|....*....|....*....|..
gi 1104469519 322 sretevq L KP L V EK NKRMNKKNEEL L HS I QRM E E K LKSLTR E 363
Cdd:TIGR02169 988 ------- L DE L K EK RAKLEEERKAI L ER I EEY E K K KREVFM E 1022
PTZ00121
PTZ00121
MAEBL; Provisional
3-467
3.85e-06
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain]
Cd Length: 2084
Bit Score: 50.91
E-value: 3.85e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 3 K KGRSKGDKP EA ET D SVQM A N E ELR A K ltni QIEFQQE K S K VGKLRERLQ E A K ---- LEREQ E QRRHT A yi S ELK AKLHE 78
Cdd:PTZ00121 1343 K AAEAAKAEA EA AA D EAEA A E E KAE A A ---- EKKKEEA K K K ADAAKKKAE E K K kade AKKKA E EDKKK A -- D ELK KAAAA 1416
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 79 E K TKE lqalr EA LIRQH E QEA A RT AK I K EG E LQRLQA tlnv LRDG A ADKV K TALLADAR EEA RRT fd G E RQRLQQ E ILEL 158
Cdd:PTZ00121 1417 K K KAD ----- EA KKKAE E KKK A DE AK K K AE E AKKADE ---- AKKK A EEAK K AEEAKKKA EEA KKA -- D E AKKKAE E AKKA 1485
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 159 KA A R K Q AEEA LSNCMQ A D KA ----- KA ADLRA A YQ A HQ - DE VHRIKRECER D IRRLMD E I K GKERVIL A L E KELGVQTGQ 232
Cdd:PTZ00121 1486 DE A K K K AEEA KKKADE A K KA aeakk KA DEAKK A EE A KK a DE AKKAEEAKKA D EAKKAE E K K KADELKK A E E LKKAEEKKK 1565
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 233 TQRLLLQK E ALDEQ L VQVK EA ERHHSSPKR E LPPGIGDMAELMGGQDQHMD E RDVRRFQ LK I AE ln SVIR K L E DRNTLL A 312
Cdd:PTZ00121 1566 AEEAKKAE E DKNMA L RKAE EA KKAEEARIE E VMKLYEEEKKMKAEEAKKAE E AKIKAEE LK K AE -- EEKK K V E QLKKKE A 1643
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 313 D E R --- N EL L K RSR E TEVQLKPLVE K NKRMN KK N EE LLHSIQRMEEKLKS L TR E NV E M K E kl SAQASL K RHTSLNDLSLT 389
Cdd:PTZ00121 1644 E E K kka E EL K K AEE E NKIKAAEEAK K AEEDK KK A EE AKKAEEDEKKAAEA L KK E AE E A K K -- AEELKK K EAEEKKKAEEL 1721
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1104469519 390 RDEQ E IEFLRLQVLEQQHVI D DLSL E RERLLRSKRHRGKS LK PPKKHVV E TFFGFD E ESVDS E TLS E TSYNTDRT D RT 467
Cdd:PTZ00121 1722 KKAE E ENKIKAEEAKKEAEE D KKKA E EAKKDEEEKKKIAH LK KEEEKKA E EIRKEK E AVIEE E LDE E DEKRRMEV D KK 1799
SMC_N
pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
42-809
6.59e-04
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain]
Cd Length: 1161
Bit Score: 43.42
E-value: 6.59e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 42 S KVGKLRERLQEAKL E REQEQRRHTAYIS ELK AKLHEE K TKELQ AL REALIRQHEQEAARTAKIKE g E L QRLQATLNV L R 121
Cdd:pfam02463 165 S RLKRKKKEALKKLI E ETENLAELIIDLE ELK LQELKL K EQAKK AL EYYQLKEKLELEEEYLLYLD - Y L KLNEERIDL L Q 243
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 122 DGAA D KVKTALLADARE E ARR tfdg E RQRLQQEILELKAAR K QAE E ALSNCMQADKAKAADLRAAYQAHQDEVHRIKR E C 201
Cdd:pfam02463 244 ELLR D EQEEIESSKQEI E KEE ---- E KLAQVLKENKEEEKE K KLQ E EELKLLAKEEEELKSELLKLERRKVDDEEKLK E S 319
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 202 E RDIRRLMD E I K GKERV I LA LEKEL GVQTGQTQRLLLQK E A L DEQLVQVKEA E RHHSSP K RELPPGIGDM A E L MGGQDQH 281
Cdd:pfam02463 320 E KEKKKAEK E L K KEKEE I EE LEKEL KELEIKREAEEEEE E E L EKLQEKLEQL E EELLAK K KLESERLSSA A K L KEEELEL 399
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 282 MD E rdvrrfqlkia E LNSVIRK LE DRNT L LADERN E LLKRSRET E VQLKPLVE K NKRMNKKN EEL LHSIQRMEEKLKS L T 361
Cdd:pfam02463 400 KS E ----------- E EKEAQLL LE LARQ L EDLLKE E KKEELEIL E EEEESIEL K QGKLTEEK EEL EKQELKLLKDELE L K 468
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 362 RENVEM KE KLSA qasl K RHTS L ND L SLTRDEQ E IEFLRLQVLEQQH V IDD L SLERERLLRSKR H RGKSLKPP kkh V VE TF 441
Cdd:pfam02463 469 KSEDLL KE TQLV ---- K LQEQ L EL L LSRQKLE E RSQKESKARSGLK V LLA L IKDGVGGRIISA H GRLGDLGV --- A VE NY 541
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 442 FGFDEES V DS E TLSETSYNTD R TDRTP A TP E ED L DETTT R EEADLRFCQ L t REYQA L QRAYA L LQE Q VGGTLDAER E ART 521
Cdd:pfam02463 542 KVAISTA V IV E VSATADEVEE R QKLVR A LT E LP L GARKL R LLIPKLKLP L - KSIAV L EIDPI L NLA Q LDKATLEAD E DDK 620
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 522 R EQLQADL L RCQAKIEDL E KLLVEKGQDAAW V EEKQV L M ---- RTNQDLLEKIYR LE ME E N Q L K S E MQD AK dqne LLEF R 597
Cdd:pfam02463 621 R AKVVEGI L KDTELTKLK E SAKAKESGLRKG V SLEEG L A ekse VKASLSELTKEL LE IQ E L Q E K A E SEL AK ---- EEIL R 696
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 598 VLELEERERRSPAFN L QITTFPENNSS A LQLFCH Q EGVKGVNISELM K NLDILGDNGNL R NEEQVAVIQAGTVLALCEKW 677
Cdd:pfam02463 697 RQLEIKKKEQREKEE L KKLKLEAEELL A DRVQEA Q DKINEELKLLKQ K IDEEEEEEEKS R LKKEEKEEEKSELSLKEKEL 776
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 678 LKQI E G TE AALTQKMMDLEKEKDQFSRQKGYL E QELDYRKEALD Q AYLVGP EKI QDL E ATLYNALQQ E PGRRASE A LSAS 757
Cdd:pfam02463 777 AEER E K TE KLKVEEEKEEKLKAQEEELRALEE E LKEEAELLEEE Q LLIEQE EKI KEE E LEELALELK E EQKLEKL A EEEL 856
730 740 750 760 770
....*....|....*....|....*....|....*....|....*....|..
gi 1104469519 758 Q R EDLQAAV E KVRRQ LL RQSR E FDS Q I L RERM E LLQQAQQKIREMEGKLELQ 809
Cdd:pfam02463 857 E R LEEEITK E ELLQE LL LKEE E LEE Q K L KDEL E SKEEKEKEEKKELEEESQK 908
PspC_subgroup_1
NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
23-374
3.98e-03
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain]
Cd Length: 684
Bit Score: 40.77
E-value: 3.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 23 N EE L RA KL TN I QI E F ------ QQ EKS KV --- G K LRER L QE A KLEREQEQRRHTAYIS E LKA K L h EE KT K ELQALR E ALI R 93
Cdd:NF033838 86 N VA L NK KL SD I KT E Y lyelnv LK EKS EA elt S K TKKE L DA A FEQFKKDTLEPGKKVA E ATK K V - EE AE K KAKDQK E EDR R 164
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 94 QHEQEAAR T --------- AKI K EG EL QRLQATLNVL RD ga AD K V K T A lla D A RE E ARRT fdg E RQ RL Q qeil EL K AA R KQ 164
Cdd:NF033838 165 NYPTNTYK T leleiaesd VEV K KA EL ELVKEEAKEP RD -- EE K I K Q A --- K A KV E SKKA --- E AT RL E ---- KI K TD R EK 232
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 165 AEE ALSNCMQ A DKAK A ADLRA A YQ ahqd E VHRI KR ECE R DIRRLMDEIKG KE RVILALEKEL G VQ T GQTQR L LLQ K eald 244
Cdd:NF033838 233 AEE EAKRRAD A KLKE A VEKNV A TS ---- E QDKP KR RAK R GVLGEPATPDK KE NDAKSSDSSV G EE T LPSPS L KPE K ---- 304
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 245 eqlv Q V K EAE RHHSSP K REL ppgigdmaelmgg Q DQ hm D E R D V R RF -------- Q L K IAE LNSVIRKL E DR nt L LAD E RN 316
Cdd:NF033838 305 ---- K V A EAE KKVEEA K KKA ------------- K DQ -- K E E D R R NY ptntyktl E L E IAE SDVKVKEA E LE -- L VKE E AK 363
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*...
gi 1104469519 317 E ll K R SR E TEV Q L K PL VE KN K RMNKKN E ELLHSIQRM EE KL K SLTR E NVEM KEK LSA Q 374
Cdd:NF033838 364 E -- P R NE E KIK Q A K AK VE SK K AEATRL E KIKTDRKKA EE EA K RKAA E EDKV KEK PAE Q 419
Name
Accession
Description
Interval
E-value
JAKMIP_CC3
pfam16034
JAKMIP CC3 domain; This domain is found at the C-terminus of proteins belonging to the JAKMIP ...
415-612
1.56e-74
JAKMIP CC3 domain; This domain is found at the C-terminus of proteins belonging to the JAKMIP family (Janus kinase and microtubule-interacting proteins) and is predicted to be a coiled coil. It interacts with the Janus family kinases Tyk2 and Jak1.
Pssm-ID: 464986 [Multi-domain]
Cd Length: 199
Bit Score: 241.32
E-value: 1.56e-74
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 415 ER ER L L R SKRH R G K SL K PPKKH - VVETFFG F DEE - SVDSET l S ET SYNTDRTD R TPATP E EDLDE TTTR EEA D LRF C QLT 492
Cdd:pfam16034 1 ER DK L I R ARKQ R R K EK K KKLRD m VVETFFG Y DEE a SVDSET - S SL SYNTDRTD S TPATP D EDLDE GVAA EEA E LRF R QLT 79
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 493 REYQALQRAYALLQEQ V GGTLDAEREA R TREQLQA D L L R C Q AK IEDLE KL L VE KGQD AA WVEEKQ V L M R T NQ D L L EKI YR 572
Cdd:pfam16034 80 REYQALQRAYALLQEQ S GGTLDAEREA K TREQLQA E L R R A Q SR IEDLE SA L AQ KGQD SK WVEEKQ A L I R R NQ E L V EKI RI 159
170 180 190 200
....*....|....*....|....*....|....*....|
gi 1104469519 573 L E M EE NQ LK S E M QDA K DQNELLEFR V LELEERERRSPA F N 612
Cdd:pfam16034 160 M E Q EE GR LK N E I QDA R DQNELLEFR I LELEERERRSPA I N 199
Smc
COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
24-607
4.18e-11
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain]
Cd Length: 983
Bit Score: 66.88
E-value: 4.18e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 24 E EL R A K L TNIQIEFQQEKSKVGKLRERLQ EA K LE REQEQRRHTAYIS E LKAKLH EE KTK EL QALREAL i RQHEQ E A AR TA 103
Cdd:COG1196 223 K EL E A E L LLLKLRELEAELEELEAELEEL EA E LE ELEAELAELEAEL E ELRLEL EE LEL EL EEAQAEE - YELLA E L AR LE 301
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 104 KIKEGELQ R LQA tlnvlrdgaadkvktal L ADAR EE ARRTFDGERQR L QQEIL EL KAARKQA EEA lsncm QADKAK A ADL 183
Cdd:COG1196 302 QDIARLEE R RRE ----------------- L EERL EE LEEELAELEEE L EELEE EL EELEEEL EEA ----- EEELEE A EAE 359
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 184 R A AYQAHQD E VHRIKR E C E RDIRR L MD E IKGKE R VILA L EKE L GVQTGQTQR LL LQK E A L D E Q L VQVK EA ERHHSSPKR E 263
Cdd:COG1196 360 L A EAEEALL E AEAELA E A E EELEE L AE E LLEAL R AAAE L AAQ L EELEEAEEA LL ERL E R L E E E L EELE EA LAELEEEEE E 439
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 264 L ppgigdmaelmggqdq HMDERDVRRFQLKIA E LNSVIRK L EDRNTLL A DERNEL L KRSR E TEVQLKPLVEKNKRMNKKN 343
Cdd:COG1196 440 E ---------------- EEALEEAAEEEAELE E EEEALLE L LAELLEE A ALLEAA L AELL E ELAEAAARLLLLLEAEADY 503
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 344 E EL L HSIQRMEEKLK -------- SLTRENVEMK E KLSAQ A SLKRHTSLNDLSLTRDEQE IE F L RLQVLEQQHVIDDLSLE 415
Cdd:COG1196 504 E GF L EGVKAALLLAG lrglagav AVLIGVEAAY E AALEA A LAAALQNIVVEDDEVAAAA IE Y L KAAKAGRATFLPLDKIR 583
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 416 RERL L RSKRH RG KSLKPPKKHVVETFFGFDEES V DSE TL SETSYNTD R TDRTPATPEEDLDETTTREEADLRFCQLTREY 495
Cdd:COG1196 584 ARAA L AAALA RG AIGAAVDLVASDLREADARYY V LGD TL LGRTLVAA R LEAALRRAVTLAGRLREVTLEGEGGSAGGSLT 663
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 496 QALQ R AYALLQEQ vggt LD AE R E ARTREQLQAD L LRCQ A KIEDL E KLLVEKGQDAAWV EE KQVLMRTNQD L LEKIYR L EM 575
Cdd:COG1196 664 GGSR R ELLAALLE ---- AE AE L E ELAERLAEEE L ELEE A LLAEE E EERELAEAEEERL EE ELEEEALEEQ L EAEREE L LE 739
570 580 590
....*....|....*....|....*....|..
gi 1104469519 576 E ENQLKSEMQDAKDQNELLEFRVL ELE ERER R 607
Cdd:COG1196 740 E LLEEEELLEEEALEELPEPPDLE ELE RELE R 771
Smc
COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
13-541
1.05e-09
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain]
Cd Length: 983
Bit Score: 62.26
E-value: 1.05e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 13 EAE TDSVQMAN EEL R A K L TNIQI E FQQEKSKVGK L RER L Q EA --------- K L E R EQEQRRHTAYISELKAKLH EE KTK E 83
Cdd:COG1196 245 EAE LEELEAEL EEL E A E L AELEA E LEELRLELEE L ELE L E EA qaeeyella E L A R LEQDIARLEERRRELEERL EE LEE E 324
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 84 L QA L R E A L IRQH E QEAARTAKIK E g ELQR L QATLNV L RDGAADKVKTALLADAR EE ARRTFDG E RQRLQQEIL EL K A ARK 163
Cdd:COG1196 325 L AE L E E E L EELE E ELEELEEELE E - AEEE L EEAEAE L AEAEEALLEAEAELAEA EE ELEELAE E LLEALRAAA EL A A QLE 403
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 164 QA EEA LSN c MQADKAKAADLRAAYQAHQD E VHRIKR E C E RDIRRLMD E IKGK E RVIL AL EKE L GVQTGQTQR L LLQKEA L 243
Cdd:COG1196 404 EL EEA EEA - LLERLERLEEELEELEEALA E LEEEEE E E E EALEEAAE E EAEL E EEEE AL LEL L AELLEEAAL L EAALAE L 482
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 244 D E Q L VQVKEAERHHSSPKRELPPGIGDMAELMGGQDQHMDERD V RRFQLKI A ELNSVIRKLE -------- DRNTLL A DER 315
Cdd:COG1196 483 L E E L AEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGA V AVLIGVE A AYEAALEAAL aaalqniv VEDDEV A AAA 562
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 316 N E L LK RSRETEVQLK PL vek N K RMNKKNEELLHSIQRMEEKLKSLTRENV E MKEKLSAQASLKRHTS L NDLS L TRDEQEI 395
Cdd:COG1196 563 I E Y LK AAKAGRATFL PL --- D K IRARAALAAALARGAIGAAVDLVASDLR E ADARYYVLGDTLLGRT L VAAR L EAALRRA 639
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 396 EF L RLQVL E QQHVIDDL S LERERLLR S K R HRGKS L KPPKKHVV E tffg FD E ESVDS E TLS E TSYNTDRTDRTPATPE E DL 475
Cdd:COG1196 640 VT L AGRLR E VTLEGEGG S AGGSLTGG S R R ELLAA L LEAEAELE E ---- LA E RLAEE E LEL E EALLAEEEEERELAEA E EE 715
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1104469519 476 DETTTR EE AD L RFCQLTREYQA L QRAYALLQEQVGGT L DAER E ARTR E Q L QAD L L R CQAK IE D L EK 541
Cdd:COG1196 716 RLEEEL EE EA L EEQLEAEREEL L EELLEEEELLEEEA L EELP E PPDL E E L ERE L E R LERE IE A L GP 781
Smc
COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
44-605
3.07e-08
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain]
Cd Length: 983
Bit Score: 57.64
E-value: 3.07e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 44 VG K LR ER LQ EA KLER E QEQR --- R HTAYIS EL KAK L HE ----- EK TKELQA L R E A L IRQHEQEAARTAKIK E G EL QR L Q A 115
Cdd:COG1196 167 IS K YK ER KE EA ERKL E ATEE nle R LEDILG EL ERQ L EP lerqa EK AERYRE L K E E L KELEAELLLLKLREL E A EL EE L E A 246
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 116 T L NV L RDGAADKVKTALLAD A RE E AR R ---- TFDG E RQRL Q Q E IL EL K A ARKQA E EALSNCMQADKAKAAD L RAAYQAHQ 191
Cdd:COG1196 247 E L EE L EAELEELEAELAELE A EL E EL R lele ELEL E LEEA Q A E EY EL L A ELARL E QDIARLEERRRELEER L EELEEELA 326
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 192 DEVHRIKRE c E RDIRR L MD E IKGK E RVILAL E K EL GVQTGQTQRLLLQKEALD E Q L VQVK E AERHHSSPKR EL PPGIGDM 271
Cdd:COG1196 327 ELEEELEEL - E EELEE L EE E LEEA E EELEEA E A EL AEAEEALLEAEAELAEAE E E L EELA E ELLEALRAAA EL AAQLEEL 405
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 272 A E LMGGQDQHMDERDVRRFQ L KI A ELNSVIRKL E DRNT L LAD -- E RN EL LKRSRETEVQ L KP L V E KNKRMNKKNE ELL HS 349
Cdd:COG1196 406 E E AEEALLERLERLEEELEE L EE A LAELEEEEE E EEEA L EEA ae E EA EL EEEEEALLEL L AE L L E EAALLEAALA ELL EE 485
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 350 IQRMEEK L KS L TRENVEMKEK L SAQASLKRHTS L ND L SLT --------- RD E QEI E f LR L QVLE Q QH V ID D LSLERERLL 420
Cdd:COG1196 486 LAEAAAR L LL L LEAEADYEGF L EGVKAALLLAG L RG L AGA vavligvea AY E AAL E - AA L AAAL Q NI V VE D DEVAAAAIE 564
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 421 RS K RHRG ----------- KSLKPPKKHVVETFF G FDEES V D S E tlsetsynt D R TDRTPATPEE D LDETT T REE A D L rfc 489
Cdd:COG1196 565 YL K AAKA gratflpldki RARAALAAALARGAI G AAVDL V A S D --------- L R EADARYYVLG D TLLGR T LVA A R L --- 632
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 490 q LTREYQ A LQR A YA L LQEQVG G TLDAEREAR T REQLQAD L LRCQAKIED LE K L LVEKGQDAAWV EE KQVLMRTNQDL L EK 569
Cdd:COG1196 633 - EAALRR A VTL A GR L REVTLE G EGGSAGGSL T GGSRREL L AALLEAEAE LE E L AERLAEEELEL EE ALLAEEEEERE L AE 711
570 580 590
....*....|....*....|....*....|....*.
gi 1104469519 570 IYRLEM EE NQLKSEMQDAKDQNELLEFRV L EL EE RE 605
Cdd:COG1196 712 AEEERL EE ELEEEALEEQLEAEREELLEE L LE EE EL 747
SMC_prok_A
TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
6-363
3.72e-08
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain]
Cd Length: 1164
Bit Score: 57.39
E-value: 3.72e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 6 R S KGDKP E AET D SVQMANEELRA K LTN I QI E FQ Q EKSKVG KL R ERL Q E AKLEREQEQRRHTAYI SELK aklhe E KTKELQ 85
Cdd:TIGR02169 694 Q S ELRRI E NRL D ELSQELSDASR K IGE I EK E IE Q LEQEEE KL K ERL E E LEEDLSSLEQEIENVK SELK ----- E LEARIE 768
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 86 A L R E A L IRQH E QEAART A KIKEGELQRL QA T L NV L RDGAADKVKTALLADAREEAR ---- RTFDG E R Q R LQ QEILE LK AA 161
Cdd:TIGR02169 769 E L E E D L HKLE E ALNDLE A RLSHSRIPEI QA E L SK L EEEVSRIEARLREIEQKLNRL tlek EYLEK E I Q E LQ EQRID LK EQ 848
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 162 R K QA E EALS N c MQAD K AKAADLRAAYQ A HQDEVHRIKRECERDIRR L MDEIKGK ER V I LA LE KELGVQTGQTQR L LLQK E 241
Cdd:TIGR02169 849 I K SI E KEIE N - LNGK K EELEEELEELE A ALRDLESRLGDLKKERDE L EAQLREL ER K I EE LE AQIEKKRKRLSE L KAKL E 927
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 242 AL D E Q L VQVKEAERHHSSPKR E LP pgigdmaelmggqdqhmder DVRRF Q LKIAELNSV IR K LE DR N T L LAD E RN E L LKR 321
Cdd:TIGR02169 928 AL E E E L SEIEDPKGEDEEIPE E EL -------------------- SLEDV Q AELQRVEEE IR A LE PV N M L AIQ E YE E V LKR 987
330 340 350 360
....*....|....*....|....*....|....*....|..
gi 1104469519 322 sretevq L KP L V EK NKRMNKKNEEL L HS I QRM E E K LKSLTR E 363
Cdd:TIGR02169 988 ------- L DE L K EK RAKLEEERKAI L ER I EEY E K K KREVFM E 1022
PTZ00121
PTZ00121
MAEBL; Provisional
3-467
3.85e-06
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain]
Cd Length: 2084
Bit Score: 50.91
E-value: 3.85e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 3 K KGRSKGDKP EA ET D SVQM A N E ELR A K ltni QIEFQQE K S K VGKLRERLQ E A K ---- LEREQ E QRRHT A yi S ELK AKLHE 78
Cdd:PTZ00121 1343 K AAEAAKAEA EA AA D EAEA A E E KAE A A ---- EKKKEEA K K K ADAAKKKAE E K K kade AKKKA E EDKKK A -- D ELK KAAAA 1416
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 79 E K TKE lqalr EA LIRQH E QEA A RT AK I K EG E LQRLQA tlnv LRDG A ADKV K TALLADAR EEA RRT fd G E RQRLQQ E ILEL 158
Cdd:PTZ00121 1417 K K KAD ----- EA KKKAE E KKK A DE AK K K AE E AKKADE ---- AKKK A EEAK K AEEAKKKA EEA KKA -- D E AKKKAE E AKKA 1485
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 159 KA A R K Q AEEA LSNCMQ A D KA ----- KA ADLRA A YQ A HQ - DE VHRIKRECER D IRRLMD E I K GKERVIL A L E KELGVQTGQ 232
Cdd:PTZ00121 1486 DE A K K K AEEA KKKADE A K KA aeakk KA DEAKK A EE A KK a DE AKKAEEAKKA D EAKKAE E K K KADELKK A E E LKKAEEKKK 1565
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 233 TQRLLLQK E ALDEQ L VQVK EA ERHHSSPKR E LPPGIGDMAELMGGQDQHMD E RDVRRFQ LK I AE ln SVIR K L E DRNTLL A 312
Cdd:PTZ00121 1566 AEEAKKAE E DKNMA L RKAE EA KKAEEARIE E VMKLYEEEKKMKAEEAKKAE E AKIKAEE LK K AE -- EEKK K V E QLKKKE A 1643
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 313 D E R --- N EL L K RSR E TEVQLKPLVE K NKRMN KK N EE LLHSIQRMEEKLKS L TR E NV E M K E kl SAQASL K RHTSLNDLSLT 389
Cdd:PTZ00121 1644 E E K kka E EL K K AEE E NKIKAAEEAK K AEEDK KK A EE AKKAEEDEKKAAEA L KK E AE E A K K -- AEELKK K EAEEKKKAEEL 1721
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1104469519 390 RDEQ E IEFLRLQVLEQQHVI D DLSL E RERLLRSKRHRGKS LK PPKKHVV E TFFGFD E ESVDS E TLS E TSYNTDRT D RT 467
Cdd:PTZ00121 1722 KKAE E ENKIKAEEAKKEAEE D KKKA E EAKKDEEEKKKIAH LK KEEEKKA E EIRKEK E AVIEE E LDE E DEKRRMEV D KK 1799
Smc
COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
174-807
1.16e-05
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain]
Cd Length: 983
Bit Score: 49.16
E-value: 1.16e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 174 QA D KA - KAAD L RAAYQAHQD E VHRI K - RE C E RDIRR L MD E IKGK E RVILA LE K EL GVQTGQTQR L L L QK E A L DEQ L VQVK 251
Cdd:COG1196 208 QA E KA e RYRE L KEELKELEA E LLLL K l RE L E AELEE L EA E LEEL E AELEE LE A EL AELEAELEE L R L EL E E L ELE L EEAQ 287
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 252 EA E RHHSSPKRE L ppgigdmaelmg G QD QHMD E RDV R RFQLKIA EL NSVIRK LE DRNTL L AD E RN EL LKRSR E T E VQ L K p 331
Cdd:COG1196 288 AE E YELLAELAR L ------------ E QD IARL E ERR R ELEERLE EL EEELAE LE EELEE L EE E LE EL EEELE E A E EE L E - 354
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 332 lv E KNKRMNKKN E E LL HSIQRME E KLKS L TRENV E MK E K L S A Q A S L KRHTSLNDLSLTRDEQEI E F L RLQVL E QQHVIDD 411
Cdd:COG1196 355 -- E AEAELAEAE E A LL EAEAELA E AEEE L EELAE E LL E A L R A A A E L AAQLEELEEAEEALLERL E R L EEELE E LEEALAE 432
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 412 L SL E R E RLL rskrhrgkslkppkkhvvetffgfdeesvdsetlsetsyntdrtdrtpatp E EDLDETTTRE E ADLRFCQ L 491
Cdd:COG1196 433 L EE E E E EEE --------------------------------------------------- E ALEEAAEEEA E LEEEEEA L 461
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 492 TREYQA L QRAY ALL QEQVGGT L DAER EA RT R EQ L QADLLR - CQAKI E DLEKL L VEK G QDAAWVEE kq VLMRTNQDLL E KI 570
Cdd:COG1196 462 LELLAE L LEEA ALL EAALAEL L EELA EA AA R LL L LLEAEA d YEGFL E GVKAA L LLA G LRGLAGAV -- AVLIGVEAAY E AA 539
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 571 YRLEMEENQLKSEMQ D AKDQNELL E FRVLELEE R ERRS P AFNLQITTFPENNSSALQLFCHQEG V KGVNISELMKNLDI l 650
Cdd:COG1196 540 LEAALAAALQNIVVE D DEVAAAAI E YLKAAKAG R ATFL P LDKIRARAALAAALARGAIGAAVDL V ASDLREADARYYVL - 618
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 651 GD NGNL R NEEQVAVIQ A GTVLALCEKW L KQIEGTEAALTQKMMDLEKEKDQFSRQKGYL E Q EL DYRK E A L DQAY L VGP E K 730
Cdd:COG1196 619 GD TLLG R TLVAARLEA A LRRAVTLAGR L REVTLEGEGGSAGGSLTGGSRRELLAALLEA E A EL EELA E R L AEEE L ELE E A 698
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 731 IQDL E ATLYNALQQ E p GR R AS E A L SASQR E DLQA A VEKVRRQL L RQSR E FDSQILR E RMEL --- L QQAQQKIREM E GKL E 807
Cdd:COG1196 699 LLAE E EEERELAEA E - EE R LE E E L EEEAL E EQLE A EREELLEE L LEEE E LLEEEAL E ELPE ppd L EELERELERL E REI E 777
EnvC
COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
22-255
4.89e-05
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain]
Cd Length: 377
Bit Score: 46.68
E-value: 4.89e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 22 A NE E LR A K L TNI Q I E FQQEKSKVGK L RERLQEAKLEREQEQ RR htay I SE L KAKLHE ek TKELQ A LR EA LIRQH E Q E A A R 101
Cdd:COG4942 21 A AA E AE A E L EQL Q Q E IAELEKELAA L KKEEKALLKQLAALE RR ---- I AA L ARRIRA -- LEQEL A AL EA ELAEL E K E I A E 94
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 102 TAKIK E GELQR L QAT L NV L - R D G AADKVKTA L LADAREE A R R tfdge R QRLQQEILE lk A A R K QAEE alsnc MQ AD K A KA 180
Cdd:COG4942 95 LRAEL E AQKEE L AEL L RA L y R L G RQPPLALL L SPEDFLD A V R ----- R LQYLKYLAP -- A R R E QAEE ----- LR AD L A EL 162
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1104469519 181 A D LRA AYQ A HQD E VHRIKR E C E RDIRR L MDEIKGKERVILA LEKEL GVQTGQTQR L LLQK E A L DEQLVQVKEAER 255
Cdd:COG4942 163 A A LRA ELE A ERA E LEALLA E L E EERAA L EALKAERQKLLAR LEKEL AELAAELAE L QQEA E E L EALIARLEAEAA 237
SMC_prok_B
TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
13-253
5.24e-05
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain]
Cd Length: 1179
Bit Score: 46.97
E-value: 5.24e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 13 E A E TDSVQMAN EEL RAKLTNIQIEFQQEKSK V GK L R ER LQEAKL E REQEQRRHTAYISE L ----- KAKLH E EKTK EL Q A L 87
Cdd:TIGR02168 711 E E E LEQLRKEL EEL SRQISALRKDLARLEAE V EQ L E ER IAQLSK E LTELEAEIEELEER L eeaee ELAEA E AEIE EL E A Q 790
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 88 R E A L IRQHEQE a ARTAKIKEG EL QR L QATLNV LR DGAADKVKTA ll A DAREEARRT f DGERQR L QQE I LE L K A ARKQA EE 167
Cdd:TIGR02168 791 I E Q L KEELKAL - REALDELRA EL TL L NEEAAN LR ERLESLERRI -- A ATERRLEDL - EEQIEE L SED I ES L A A EIEEL EE 866
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 168 ALSN c MQADKAKAADL RA AYQAHQDEV hrikrec ERDIRR L MD E IKGK E RVILA L EK EL GVQTGQTQR L L L QK E A L DEQL 247
Cdd:TIGR02168 867 LIEE - LESELEALLNE RA SLEEALALL ------- RSELEE L SE E LREL E SKRSE L RR EL EELREKLAQ L E L RL E G L EVRI 938
....*.
gi 1104469519 248 VQVK E A 253
Cdd:TIGR02168 939 DNLQ E R 944
SMC_prok_B
TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
56-402
1.27e-04
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain]
Cd Length: 1179
Bit Score: 45.82
E-value: 1.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 56 LER EQ E QRRHTAY I S EL KA K LH E e KT K E L QA LR EA L ir QHEQ E AARTAKIKEG EL Q R L qatlnv LRDGAA D KVKTALLAD 135
Cdd:TIGR02168 673 LER RR E IEELEEK I E EL EE K IA E - LE K A L AE LR KE L -- EELE E ELEQLRKELE EL S R Q ------ ISALRK D LARLEAEVE 743
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 136 AR EE ARRTFDG E RQR L QQ EI L EL KAARKQ AEE A L sncm QADK A KAAD L R A AYQAHQD E V --- HRIKR E CERDIRR L MD E I 212
Cdd:TIGR02168 744 QL EE RIAQLSK E LTE L EA EI E EL EERLEE AEE E L ---- AEAE A EIEE L E A QIEQLKE E L kal REALD E LRAELTL L NE E A 819
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 213 KGKERVILA LE KELGVQTGQTQR L LL Q K E A L D E QLVQVKEA erhhsspkrelppgigdmaelmggqdqhmderd VRRFQL 292
Cdd:TIGR02168 820 ANLRERLES LE RRIAATERRLED L EE Q I E E L S E DIESLAAE --------------------------------- IEELEE 866
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 293 K I A EL N S VIRK L EDRNTL L ADERNE L LKRSR E TEVQ L KP L VE K NKRMNKKN EEL LHSIQRM E EK L KS L TRENVEMK E K LS 372
Cdd:TIGR02168 867 L I E EL E S ELEA L LNERAS L EEALAL L RSELE E LSEE L RE L ES K RSELRREL EEL REKLAQL E LR L EG L EVRIDNLQ E R LS 946
330 340 350
....*....|....*....|....*....|....*
gi 1104469519 373 AQA SL ----- KRHTSLNDLSLTRDEQEIEF L RLQV 402
Cdd:TIGR02168 947 EEY SL tleea EALENKIEDDEEEARRRLKR L ENKI 981
COG2433
COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
297-428
1.43e-04
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain]
Cd Length: 644
Bit Score: 45.62
E-value: 1.43e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 297 LNSVIRK L EDR nt L L AD E RN E LLKRSRET E VQ L KPLV E KNK R MNKKN E E L LHSIQRM E EK L KSLTRENVEMKEK LS AQA S 376
Cdd:COG2433 378 IEEALEE L IEK -- E L PE E EP E AEREKEHE E RE L TEEE E EIR R LEEQV E R L EAEVEEL E AE L EEKDERIERLERE LS EAR S 455
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 1104469519 377 LK R HTSLN D LSLT R DEQ EIE F L RLQVL E QQHV I DD L SLER ERL -- L RSKR H R G K 428
Cdd:COG2433 456 EE R REIRK D REIS R LDR EIE R L ERELE E ERER I EE L KRKL ERL ke L WKLE H S G E 509
COG4913
COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
27-213
1.47e-04
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain]
Cd Length: 1089
Bit Score: 45.68
E-value: 1.47e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 27 RAKL TNIQI E FQQEKSKVGKLR ERL QEAKL E REQE Q R R HT A YISELKAKLH E EKTKELQALREA L IRQH E QEA A RTAKIK 106
Cdd:COG4913 609 RAKL AALEA E LAELEEELAEAE ERL EALEA E LDAL Q E R RE A LQRLAEYSWD E IDVASAEREIAE L EAEL E RLD A SSDDLA 688
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 107 -- E GE L QR L Q A T L NV L RDGAAD - K VKTAL L ADAR E E A RRTF D GERQ RL QQ - E I L ELKAA R KQA EE ALSN cm QADK A KAAD 182
Cdd:COG4913 689 al E EQ L EE L E A E L EE L EEELDE l K GEIGR L EKEL E Q A EEEL D ELQD RL EA a E D L ARLEL R ALL EE RFAA -- ALGD A VERE 766
170 180 190
....*....|....*....|....*....|.
gi 1104469519 183 LR AAYQAHQ D EVHRIKREC E RDIR R L M DEIK 213
Cdd:COG4913 767 LR ENLEERI D ALRARLNRA E EELE R A M RAFN 797
GumC
COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
23-246
2.89e-04
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain]
Cd Length: 687
Bit Score: 44.62
E-value: 2.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 23 N E ELR AKLTNIQI EF QQ E K sk VGK LR ER L Q EA KLER E QEQRRHTAYISELK AKL HEEKTK EL QALREALIRQHEQEA AR T 102
Cdd:COG3206 165 N L ELR REEARKAL EF LE E Q -- LPE LR KE L E EA EAAL E EFRQKNGLVDLSEE AKL LLQQLS EL ESQLAEARAELAEAE AR L 242
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 103 A KIKEGELQRLQ A TLNV L RDGAADKVKTA L LADAR E E A R rtfdg ERQ R LQQEILELK A A R K Q AEEALSNCM Q ADKAKA A D 182
Cdd:COG3206 243 A ALRAQLGSGPD A LPEL L QSPVIQQLRAQ L AELEA E L A E ----- LSA R YTPNHPDVI A L R A Q IAALRAQLQ Q EAQRIL A S 317
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1104469519 183 L R A AYQ A H Q DEVHRIKRE c ERDIRRLMD E IKGK E RVILA LE K E LG V QTGQTQR LL LQK E ALDEQ 246
Cdd:COG3206 318 L E A ELE A L Q AREASLQAQ - LAQLEARLA E LPEL E AELRR LE R E VE V ARELYES LL QRL E EARLA 380
SMC_prok_B
TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
25-362
5.41e-04
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain]
Cd Length: 1179
Bit Score: 43.89
E-value: 5.41e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 25 EL R A K L TNIQIEFQ -- QEKSKVGK L r E R LQE AKL E R E Q E QRRH TA YIS EL KA KL h EE KTK E LQA L R E alirqheqeaart 102
Cdd:TIGR02168 217 EL K A E L RELELALL vl RLEELREE L - E E LQE ELK E A E E E LEEL TA ELQ EL EE KL - EE LRL E VSE L E E ------------- 281
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 103 akikeg E LQR LQ AT L NV L RDGAADKVKTALLADA R EEARRT fdg ERQR L QQEIL EL KAARKQAE E A L s NCMQADKAKAAD 182
Cdd:TIGR02168 282 ------ E IEE LQ KE L YA L ANEISRLEQQKQILRE R LANLER --- QLEE L EAQLE EL ESKLDELA E E L - AELEEKLEELKE 351
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 183 LRAAYQ A HQD E VHRIKR E C E RDIRR L MDEIKGKERVILA LE KELGVQTGQTQ RL LLQK E A L DEQLVQVKEAER hhsspkr 262
Cdd:TIGR02168 352 ELESLE A ELE E LEAELE E L E SRLEE L EEQLETLRSKVAQ LE LQIASLNNEIE RL EARL E R L EDRRERLQQEIE ------- 424
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 263 elppgigdmaelmg GQDQHMD E RDVRRF Q LKIA EL NSVIRK L EDRN tlla DERN E L L KRS RE TEVQLKPLVEKNK R MNKK 342
Cdd:TIGR02168 425 -------------- ELLKKLE E AELKEL Q AELE EL EEELEE L QEEL ---- ERLE E A L EEL RE ELEEAEQALDAAE R ELAQ 486
330 340
....*....|....*....|
gi 1104469519 343 NEEL L H S IQ R ME E K L KSLTR 362
Cdd:TIGR02168 487 LQAR L D S LE R LQ E N L EGFSE 506
COG4913
COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
22-169
5.74e-04
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain]
Cd Length: 1089
Bit Score: 43.75
E-value: 5.74e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 22 AN E E LRA K L T -- NI Q IEFQQEKSKVGK LR E --- RL QEAKLER E QEQRRHTAYIS EL K A KLHEEKTKE L QA L r E AL I RQH E 96
Cdd:COG4913 273 EL E Y LRA A L R lw FA Q RRLELLEAELEE LR A ela RL EAELERL E ARLDALREELD EL E A QIRGNGGDR L EQ L - E RE I ERL E 351
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1104469519 97 Q E AAR taki K E GELQ RL Q A T L NV L rd G AADKVKTALL A DA R E EA RR --- TFDG E RQR L QQEIL E LK AA RKQAEEA L 169
Cdd:COG4913 352 R E LEE ---- R E RRRA RL E A L L AA L -- G LPLPASAEEF A AL R A EA AA lle ALEE E LEA L EEALA E AE AA LRDLRRE L 421
SMC_prok_B
TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
44-815
5.99e-04
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain]
Cd Length: 1179
Bit Score: 43.51
E-value: 5.99e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 44 VG K LR ER LQ E A -- KLER EQ E Q - R R HTAYIS EL KAK L hee K TK E L QA lrealirqheq E A A RTA K IKEG EL QR L QAT L N VL 120
Cdd:TIGR02168 167 IS K YK ER RK E T er KLER TR E N l D R LEDILN EL ERQ L --- K SL E R QA ----------- E K A ERY K ELKA EL RE L ELA L L VL 232
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 121 R dgaadkvktall ADARE E ARRTFDG E RQRLQQ E IL EL K A ARKQA EE A L SNC --- MQADKAKAAD L RAAYQ A HQD E VH R I 197
Cdd:TIGR02168 233 R ------------ LEELR E ELEELQE E LKEAEE E LE EL T A ELQEL EE K L EEL rle VSELEEEIEE L QKELY A LAN E IS R L 300
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 198 KRECERDIR RL MDEIKGK E RVILA LE KELGVQTGQTQR L LLQK E A L D E QLVQVKEA E RHHSSPKR EL P pgig DMAELMGG 277
Cdd:TIGR02168 301 EQQKQILRE RL ANLERQL E ELEAQ LE ELESKLDELAEE L AELE E K L E E LKEELESL E AELEELEA EL E ---- ELESRLEE 376
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 278 QDQHMDE -- RD V RRFQ L K IA E LN SV I RK LE D R NTL L A D E R NE L LKRSR E ----- T E VQ LK P L VEKNKRMNKKN EEL LHSI 350
Cdd:TIGR02168 377 LEEQLET lr SK V AQLE L Q IA S LN NE I ER LE A R LER L E D R R ER L QQEIE E llkkl E E AE LK E L QAELEELEEEL EEL QEEL 456
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 351 Q R M EE K L KS L TR E NV E MKEK L - S A QAS L KRHTSLN D l SL T R DEQEI E FLRLQ V LEQQHVIDD LS LERER L lrskrhr GKS 429
Cdd:TIGR02168 457 E R L EE A L EE L RE E LE E AEQA L d A A ERE L AQLQARL D - SL E R LQENL E GFSEG V KALLKNQSG LS GILGV L ------- SEL 528
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 430 LKPPKKH -- VV E TFF G FDEES V DS E T L SE -------- TSYNTD R ------ TDRTPATPEEDLD E TTTRE E AD L RFCQLTR 493
Cdd:TIGR02168 529 ISVDEGY ea AI E AAL G GRLQA V VV E N L NA akkaiafl KQNELG R vtflpl DSIKGTEIQGNDR E ILKNI E GF L GVAKDLV 608
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 494 EYQA lq RAYAL L QEQV GG T L DAEREARTR E Q lq A DL LR CQAK I ED L EKL LV EK G QDAAWVEE K QVLMRTNQ ----- D L L E 568
Cdd:TIGR02168 609 KFDP -- KLRKA L SYLL GG V L VVDDLDNAL E L -- A KK LR PGYR I VT L DGD LV RP G GVITGGSA K TNSSILER rreie E L E E 684
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 569 KI YR LE MEENQ L KSEMQDAKDQN E L LE FRVLE L EERERRS pafnlqittfp ENNS SAL QLFCHQEGVKGVNIS E LMKN L D 648
Cdd:TIGR02168 685 KI EE LE EKIAE L EKALAELRKEL E E LE EELEQ L RKELEEL ----------- SRQI SAL RKDLARLEAEVEQLE E RIAQ L S 753
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 649 I - L GDNGNLRN E EQVAVIQ A GTV LA LC E KWLKQI E GTEAA L TQKMMD L EKEK D QFSRQKGY L EQ E LDYRK E A L DQA ylvg 727
Cdd:TIGR02168 754 K e L TELEAEIE E LEERLEE A EEE LA EA E AEIEEL E AQIEQ L KEELKA L REAL D ELRAELTL L NE E AANLR E R L ESL ---- 829
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 728 PEK I QDL E AT L YNALQ Q EPGRRASEALS A SQR E D L QAAV E KVRRQ L LRQSR E FD S QI -- L RERMEL L QQAQQKI RE M E G K 805
Cdd:TIGR02168 830 ERR I AAT E RR L EDLEE Q IEELSEDIESL A AEI E E L EELI E ELESE L EALLN E RA S LE ea L ALLRSE L EELSEEL RE L E S K 909
810
....*....|
gi 1104469519 806 LELQ RR Q L K E 815
Cdd:TIGR02168 910 RSEL RR E L E E 919
SMC_N
pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
42-809
6.59e-04
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain]
Cd Length: 1161
Bit Score: 43.42
E-value: 6.59e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 42 S KVGKLRERLQEAKL E REQEQRRHTAYIS ELK AKLHEE K TKELQ AL REALIRQHEQEAARTAKIKE g E L QRLQATLNV L R 121
Cdd:pfam02463 165 S RLKRKKKEALKKLI E ETENLAELIIDLE ELK LQELKL K EQAKK AL EYYQLKEKLELEEEYLLYLD - Y L KLNEERIDL L Q 243
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 122 DGAA D KVKTALLADARE E ARR tfdg E RQRLQQEILELKAAR K QAE E ALSNCMQADKAKAADLRAAYQAHQDEVHRIKR E C 201
Cdd:pfam02463 244 ELLR D EQEEIESSKQEI E KEE ---- E KLAQVLKENKEEEKE K KLQ E EELKLLAKEEEELKSELLKLERRKVDDEEKLK E S 319
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 202 E RDIRRLMD E I K GKERV I LA LEKEL GVQTGQTQRLLLQK E A L DEQLVQVKEA E RHHSSP K RELPPGIGDM A E L MGGQDQH 281
Cdd:pfam02463 320 E KEKKKAEK E L K KEKEE I EE LEKEL KELEIKREAEEEEE E E L EKLQEKLEQL E EELLAK K KLESERLSSA A K L KEEELEL 399
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 282 MD E rdvrrfqlkia E LNSVIRK LE DRNT L LADERN E LLKRSRET E VQLKPLVE K NKRMNKKN EEL LHSIQRMEEKLKS L T 361
Cdd:pfam02463 400 KS E ----------- E EKEAQLL LE LARQ L EDLLKE E KKEELEIL E EEEESIEL K QGKLTEEK EEL EKQELKLLKDELE L K 468
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 362 RENVEM KE KLSA qasl K RHTS L ND L SLTRDEQ E IEFLRLQVLEQQH V IDD L SLERERLLRSKR H RGKSLKPP kkh V VE TF 441
Cdd:pfam02463 469 KSEDLL KE TQLV ---- K LQEQ L EL L LSRQKLE E RSQKESKARSGLK V LLA L IKDGVGGRIISA H GRLGDLGV --- A VE NY 541
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 442 FGFDEES V DS E TLSETSYNTD R TDRTP A TP E ED L DETTT R EEADLRFCQ L t REYQA L QRAYA L LQE Q VGGTLDAER E ART 521
Cdd:pfam02463 542 KVAISTA V IV E VSATADEVEE R QKLVR A LT E LP L GARKL R LLIPKLKLP L - KSIAV L EIDPI L NLA Q LDKATLEAD E DDK 620
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 522 R EQLQADL L RCQAKIEDL E KLLVEKGQDAAW V EEKQV L M ---- RTNQDLLEKIYR LE ME E N Q L K S E MQD AK dqne LLEF R 597
Cdd:pfam02463 621 R AKVVEGI L KDTELTKLK E SAKAKESGLRKG V SLEEG L A ekse VKASLSELTKEL LE IQ E L Q E K A E SEL AK ---- EEIL R 696
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 598 VLELEERERRSPAFN L QITTFPENNSS A LQLFCH Q EGVKGVNISELM K NLDILGDNGNL R NEEQVAVIQAGTVLALCEKW 677
Cdd:pfam02463 697 RQLEIKKKEQREKEE L KKLKLEAEELL A DRVQEA Q DKINEELKLLKQ K IDEEEEEEEKS R LKKEEKEEEKSELSLKEKEL 776
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 678 LKQI E G TE AALTQKMMDLEKEKDQFSRQKGYL E QELDYRKEALD Q AYLVGP EKI QDL E ATLYNALQQ E PGRRASE A LSAS 757
Cdd:pfam02463 777 AEER E K TE KLKVEEEKEEKLKAQEEELRALEE E LKEEAELLEEE Q LLIEQE EKI KEE E LEELALELK E EQKLEKL A EEEL 856
730 740 750 760 770
....*....|....*....|....*....|....*....|....*....|..
gi 1104469519 758 Q R EDLQAAV E KVRRQ LL RQSR E FDS Q I L RERM E LLQQAQQKIREMEGKLELQ 809
Cdd:pfam02463 857 E R LEEEITK E ELLQE LL LKEE E LEE Q K L KDEL E SKEEKEKEEKKELEEESQK 908
COG2433
COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
178-369
3.26e-03
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain]
Cd Length: 644
Bit Score: 41.00
E-value: 3.26e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 178 A K AA D L R A a Y Q A HQDEVH R IKRECER D IR R lm DE I K gkervilaleke LG V QT G QTQRLL L QKEALD E QLVQVK EAER HH 257
Cdd:COG2433 337 A L AA A L K A - Y D A YKNKFE R VEKKVPP D VD R -- DE V K ------------ AR V IR G LSIEEA L EELIEK E LPEEEP EAER EK 401
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 258 SSPK REL PPGIGDMAE L MGGQDQHMD E rd V RRFQLKIA E LNSV I RK LE DRNTLLAD E RNELLKRS RE T evqlkplvekn K 337
Cdd:COG2433 402 EHEE REL TEEEEEIRR L EEQVERLEA E -- V EELEAELE E KDER I ER LE RELSEARS E ERREIRKD RE I ----------- S 468
170 180 190
....*....|....*....|....*....|..
gi 1104469519 338 R MNKKN E E L LHSIQRME E KLKS L T R ENVEM KE 369
Cdd:COG2433 469 R LDREI E R L ERELEEER E RIEE L K R KLERL KE 500
PspC_subgroup_1
NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
23-374
3.98e-03
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain]
Cd Length: 684
Bit Score: 40.77
E-value: 3.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 23 N EE L RA KL TN I QI E F ------ QQ EKS KV --- G K LRER L QE A KLEREQEQRRHTAYIS E LKA K L h EE KT K ELQALR E ALI R 93
Cdd:NF033838 86 N VA L NK KL SD I KT E Y lyelnv LK EKS EA elt S K TKKE L DA A FEQFKKDTLEPGKKVA E ATK K V - EE AE K KAKDQK E EDR R 164
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 94 QHEQEAAR T --------- AKI K EG EL QRLQATLNVL RD ga AD K V K T A lla D A RE E ARRT fdg E RQ RL Q qeil EL K AA R KQ 164
Cdd:NF033838 165 NYPTNTYK T leleiaesd VEV K KA EL ELVKEEAKEP RD -- EE K I K Q A --- K A KV E SKKA --- E AT RL E ---- KI K TD R EK 232
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 165 AEE ALSNCMQ A DKAK A ADLRA A YQ ahqd E VHRI KR ECE R DIRRLMDEIKG KE RVILALEKEL G VQ T GQTQR L LLQ K eald 244
Cdd:NF033838 233 AEE EAKRRAD A KLKE A VEKNV A TS ---- E QDKP KR RAK R GVLGEPATPDK KE NDAKSSDSSV G EE T LPSPS L KPE K ---- 304
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 245 eqlv Q V K EAE RHHSSP K REL ppgigdmaelmgg Q DQ hm D E R D V R RF -------- Q L K IAE LNSVIRKL E DR nt L LAD E RN 316
Cdd:NF033838 305 ---- K V A EAE KKVEEA K KKA ------------- K DQ -- K E E D R R NY ptntyktl E L E IAE SDVKVKEA E LE -- L VKE E AK 363
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*...
gi 1104469519 317 E ll K R SR E TEV Q L K PL VE KN K RMNKKN E ELLHSIQRM EE KL K SLTR E NVEM KEK LSA Q 374
Cdd:NF033838 364 E -- P R NE E KIK Q A K AK VE SK K AEATRL E KIKTDRKKA EE EA K RKAA E EDKV KEK PAE Q 419
EnvC
COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
131-376
9.41e-03
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain]
Cd Length: 377
Bit Score: 39.36
E-value: 9.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 131 AL L A D A REE A RRTFDG E RQR LQQEI L EL KAARKQA eealsncm QADKAKAADLR AA YQAHQDEVH R IK R EC E RDIRR L MD 210
Cdd:COG4942 12 AL A A A A QAD A AAEAEA E LEQ LQQEI A EL EKELAAL -------- KKEEKALLKQL AA LERRIAALA R RI R AL E QELAA L EA 83
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 211 E IKGK E RV I LA L EK EL GV Q TGQTQR LL -- L QKEALDEQ L VQVKEA E R hhsspkrel PPGIGDMAELMGGQDQHMD E R d VR 288
Cdd:COG4942 84 E LAEL E KE I AE L RA EL EA Q KEELAE LL ra L YRLGRQPP L ALLLSP E D --------- FLDAVRRLQYLKYLAPARR E Q - AE 153
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 289 RFQLKI AEL NSVIRK LE DRNTL L ADERN EL LKRSRET E VQLKPLVEKNK R MN K KNE EL LHSIQRMEEKLKS L TRENVEMK 368
Cdd:COG4942 154 ELRADL AEL AALRAE LE AERAE L EALLA EL EEERAAL E ALKAERQKLLA R LE K ELA EL AAELAELQQEAEE L EALIARLE 233
....*...
gi 1104469519 369 EKLS A Q A S 376
Cdd:COG4942 234 AEAA A A A E 241
SMC_prok_B
TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
24-253
9.46e-03
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain]
Cd Length: 1179
Bit Score: 39.65
E-value: 9.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 24 EEL R A KLTNIQI E FQQEKSKVGK LR ER L ------- QEAKLER E QEQ RR HT A YISE L KAK lh EE KTK EL QALR E A L IRQH E 96
Cdd:TIGR02168 785 EEL E A QIEQLKE E LKALREALDE LR AE L tllneea ANLRERL E SLE RR IA A TERR L EDL -- EE QIE EL SEDI E S L AAEI E 862
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 97 QEAARTAKIKEG ------ E LQR L QAT L NV LR DGAADKVKT all ADAR E EA R RTFDG E RQR L QQEILE L KAARKQA E EALS 170
Cdd:TIGR02168 863 ELEELIEELESE lealln E RAS L EEA L AL LR SELEELSEE --- LREL E SK R SELRR E LEE L REKLAQ L ELRLEGL E VRID 939
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1104469519 171 N CMQADKAKAADLRAAYQ A HQDEVHRIKR E CE R DIR RL MDE IK GKER V I LA LEK E LGVQTGQTQR L LL QKE A L D E QLVQV 250
Cdd:TIGR02168 940 N LQERLSEEYSLTLEEAE A LENKIEDDEE E AR R RLK RL ENK IK ELGP V N LA AIE E YEELKERYDF L TA QKE D L T E AKETL 1019
...
gi 1104469519 251 K EA 253
Cdd:TIGR02168 1020 E EA 1022
Blast search parameters
Data Source:
Precalculated data, version = cdd.v.3.21
Preset Options: Database: CDSEARCH/cdd Low complexity filter: no Composition Based Adjustment: yes E-value threshold: 0.01