|
Name |
Accession |
Description |
Interval |
E-value |
| NPIP super family |
cl05750 |
Nuclear pore complex interacting protein (NPIP); This family consists of a series of primate ... |
41-303 |
6.08e-85 |
|
Nuclear pore complex interacting protein (NPIP); This family consists of a series of primate specific nuclear pore complex interacting protein (NPIP) sequences. The function of this family is unknown but is well conserved from African apes to humans. The actual alignment was detected with superfamily member pfam06409:
Pssm-ID: 461900 [Multi-domain] Cd Length: 267 Bit Score: 276.62 E-value: 6.08e-85
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 41 VINTLADHHHRGTDFGGSPWLHVIIAFPTSYKVVITLWIVYLWVSLLKTIFWSRNGHDGSTDVQQRAWRSNRRRQEGLrs 120
Cdd:pfam06409 22 LIITLADHRHKFADFGCSPWLCIIFLFLIFPKFAGHDCSSDLCQRALKSIFPRQEGHDGSLDDIFRARRQNERKQEAI-- 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 121 icmhtkkrvssfrgnKIGLKDVITLRRHVETKVRAKIRKRKVTTKINHHDKINGKRKTAR---------------KQKMF 185
Cdd:pfam06409 100 ---------------ICKLEDIFKLNRHDEIKGKAKIAKEHLRKKSMKEDEHGEKEKQAKeaeekgkldekehgeKEEMF 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 186 QRAQELRRRAEDYHKCKIPPSARKALCNWVRMAAAEHRHSSGLPYWPYLTAETLKNRMGHQPPPPTQQHSITDNSLSLKT 265
Cdd:pfam06409 165 QEAEALGKLAEDEIHCKIEMFARAPACNRRAEAAAECKHSPGAPKPLCLRAEMAAAEHGHQPGLPTQPHLIADNLKNLKG 244
|
250 260 270
....*....|....*....|....*....|....*...
gi 209529739 266 PPECLLTPLPPSADdnlktppecvltplpPSADDNLKT 303
Cdd:pfam06409 245 HPECLLTPLHPIAD---------------NSADDKLKP 267
|
|
| AFD_class_I super family |
cl17068 |
Adenylate forming domain, Class I superfamily; This family includes acyl- and aryl-CoA ligases, ... |
2-41 |
2.23e-11 |
|
Adenylate forming domain, Class I superfamily; This family includes acyl- and aryl-CoA ligases, as well as the adenylation domain of nonribosomal peptide synthetases and firefly luciferases. The adenylate-forming enzymes catalyze an ATP-dependent two-step reaction to first activate a carboxylate substrate as an adenylate and then transfer the carboxylate to the pantetheine group of either coenzyme A or an acyl-carrier protein. The active site of the domain is located at the interface of a large N-terminal subdomain and a smaller C-terminal subdomain. The actual alignment was detected with superfamily member cd05928:
Pssm-ID: 473059 [Multi-domain] Cd Length: 530 Bit Score: 67.88 E-value: 2.23e-11
10 20 30 40
....*....|....*....|....*....|....*....|
gi 209529739 2 VKLSIVLTPQFLSHDQGQLTKELQQHVKSVTCPCEYLRKV 41
Cdd:cd05928 467 VKAFVVLAPQFLSHDPEQLTKELQQHVKSVTAPYKYPRKV 506
|
|
| PHA03307 super family |
cl33723 |
transcriptional regulator ICP4; Provisional |
871-1131 |
2.80e-05 |
|
transcriptional regulator ICP4; Provisional The actual alignment was detected with superfamily member PHA03307:
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 48.63 E-value: 2.80e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 871 RGPLPPSADDNLKTPSERQLTPLPPSAPPSADDNIKTPAERLRGPLPPSADDNLKTPSERQLTPLPPSAP---PSADDNI 947
Cdd:PHA03307 142 GSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSpisASASSPA 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 948 KTPAERLRGPLPPSADDNLKTPSERqltplppsappsaddniKTPAERLRGPLPPSADDNLKTPPLATQEAEAEKPRKPK 1027
Cdd:PHA03307 222 PAPGRSAADDAGASSSDSSSSESSG-----------------CGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGP 284
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 1028 RQRAA---EMEPPPEPKRRRVGDVEPSRKPKRRRAADVEPSSPEPKRRRVGD--------VEPSRKPKRRRAADVEPSSP 1096
Cdd:PHA03307 285 ASSSSsprERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSrgaavspgPSPSRSPSPSRPPPPADPSS 364
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 209529739 1097 EPKRRRVGDVEPSR-----KPKRRRAAdvePSLPEPKRRR 1131
Cdd:PHA03307 365 PRKRPRPSRAPSSPaasagRPTRRRAR---AAVAGRARRR 401
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| NPIP |
pfam06409 |
Nuclear pore complex interacting protein (NPIP); This family consists of a series of primate ... |
41-303 |
6.08e-85 |
|
Nuclear pore complex interacting protein (NPIP); This family consists of a series of primate specific nuclear pore complex interacting protein (NPIP) sequences. The function of this family is unknown but is well conserved from African apes to humans.
Pssm-ID: 461900 [Multi-domain] Cd Length: 267 Bit Score: 276.62 E-value: 6.08e-85
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 41 VINTLADHHHRGTDFGGSPWLHVIIAFPTSYKVVITLWIVYLWVSLLKTIFWSRNGHDGSTDVQQRAWRSNRRRQEGLrs 120
Cdd:pfam06409 22 LIITLADHRHKFADFGCSPWLCIIFLFLIFPKFAGHDCSSDLCQRALKSIFPRQEGHDGSLDDIFRARRQNERKQEAI-- 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 121 icmhtkkrvssfrgnKIGLKDVITLRRHVETKVRAKIRKRKVTTKINHHDKINGKRKTAR---------------KQKMF 185
Cdd:pfam06409 100 ---------------ICKLEDIFKLNRHDEIKGKAKIAKEHLRKKSMKEDEHGEKEKQAKeaeekgkldekehgeKEEMF 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 186 QRAQELRRRAEDYHKCKIPPSARKALCNWVRMAAAEHRHSSGLPYWPYLTAETLKNRMGHQPPPPTQQHSITDNSLSLKT 265
Cdd:pfam06409 165 QEAEALGKLAEDEIHCKIEMFARAPACNRRAEAAAECKHSPGAPKPLCLRAEMAAAEHGHQPGLPTQPHLIADNLKNLKG 244
|
250 260 270
....*....|....*....|....*....|....*...
gi 209529739 266 PPECLLTPLPPSADdnlktppecvltplpPSADDNLKT 303
Cdd:pfam06409 245 HPECLLTPLHPIAD---------------NSADDKLKP 267
|
|
| MACS_euk |
cd05928 |
Eukaryotic Medium-chain acyl-CoA synthetase (MACS or ACSM); MACS catalyzes the two-step ... |
2-41 |
2.23e-11 |
|
Eukaryotic Medium-chain acyl-CoA synthetase (MACS or ACSM); MACS catalyzes the two-step activation of medium chain fatty acids (containing 4-12 carbons). The carboxylate substrate first reacts with ATP to form an acyl-adenylate intermediate, which then reacts with CoA to produce an acyl-CoA ester. The acyl-CoA is a key intermediate in many important biosynthetic and catabolic processes. MACS enzymes are localized to mitochondria. Two murine MACS family proteins are found in liver and kidney. In rodents, a MACS member is detected particularly in the olfactory epithelium and is called O-MACS. O-MACS demonstrates substrate preference for the fatty acid lengths of C6-C12.
Pssm-ID: 341251 [Multi-domain] Cd Length: 530 Bit Score: 67.88 E-value: 2.23e-11
10 20 30 40
....*....|....*....|....*....|....*....|
gi 209529739 2 VKLSIVLTPQFLSHDQGQLTKELQQHVKSVTCPCEYLRKV 41
Cdd:cd05928 467 VKAFVVLAPQFLSHDPEQLTKELQQHVKSVTAPYKYPRKV 506
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
871-1131 |
2.80e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 48.63 E-value: 2.80e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 871 RGPLPPSADDNLKTPSERQLTPLPPSAPPSADDNIKTPAERLRGPLPPSADDNLKTPSERQLTPLPPSAP---PSADDNI 947
Cdd:PHA03307 142 GSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSpisASASSPA 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 948 KTPAERLRGPLPPSADDNLKTPSERqltplppsappsaddniKTPAERLRGPLPPSADDNLKTPPLATQEAEAEKPRKPK 1027
Cdd:PHA03307 222 PAPGRSAADDAGASSSDSSSSESSG-----------------CGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGP 284
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 1028 RQRAA---EMEPPPEPKRRRVGDVEPSRKPKRRRAADVEPSSPEPKRRRVGD--------VEPSRKPKRRRAADVEPSSP 1096
Cdd:PHA03307 285 ASSSSsprERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSrgaavspgPSPSRSPSPSRPPPPADPSS 364
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 209529739 1097 EPKRRRVGDVEPSR-----KPKRRRAAdvePSLPEPKRRR 1131
Cdd:PHA03307 365 PRKRPRPSRAPSSPaasagRPTRRRAR---AAVAGRARRR 401
|
|
| SF-CC1 |
TIGR01622 |
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ... |
1022-1133 |
3.09e-04 |
|
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
Pssm-ID: 273721 [Multi-domain] Cd Length: 494 Bit Score: 44.52 E-value: 3.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 1022 KPRKPKRQRAAEMEPPPEPKRRRVGDVEPSRKPKRRRAadvepSSPEPKRRRVGDvePSRKPKRRRAADVEPSSPEPK-- 1099
Cdd:TIGR01622 3 RDRERERLRDSSSAGDRDRRRDKGRERSRDRSRDRERS-----RSRRRDRHRDRD--YYRGRERRSRSRRPNRRYRPRek 75
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 209529739 1100 --------RRRVGDVEPSRKPKRRRAADVEPsLPEPKRRRLS 1133
Cdd:TIGR01622 76 rrrrgdsyRRRRDDRRSRREKPRARDGTPEP-LTEDERDRRT 116
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| NPIP |
pfam06409 |
Nuclear pore complex interacting protein (NPIP); This family consists of a series of primate ... |
41-303 |
6.08e-85 |
|
Nuclear pore complex interacting protein (NPIP); This family consists of a series of primate specific nuclear pore complex interacting protein (NPIP) sequences. The function of this family is unknown but is well conserved from African apes to humans.
Pssm-ID: 461900 [Multi-domain] Cd Length: 267 Bit Score: 276.62 E-value: 6.08e-85
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 41 VINTLADHHHRGTDFGGSPWLHVIIAFPTSYKVVITLWIVYLWVSLLKTIFWSRNGHDGSTDVQQRAWRSNRRRQEGLrs 120
Cdd:pfam06409 22 LIITLADHRHKFADFGCSPWLCIIFLFLIFPKFAGHDCSSDLCQRALKSIFPRQEGHDGSLDDIFRARRQNERKQEAI-- 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 121 icmhtkkrvssfrgnKIGLKDVITLRRHVETKVRAKIRKRKVTTKINHHDKINGKRKTAR---------------KQKMF 185
Cdd:pfam06409 100 ---------------ICKLEDIFKLNRHDEIKGKAKIAKEHLRKKSMKEDEHGEKEKQAKeaeekgkldekehgeKEEMF 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 186 QRAQELRRRAEDYHKCKIPPSARKALCNWVRMAAAEHRHSSGLPYWPYLTAETLKNRMGHQPPPPTQQHSITDNSLSLKT 265
Cdd:pfam06409 165 QEAEALGKLAEDEIHCKIEMFARAPACNRRAEAAAECKHSPGAPKPLCLRAEMAAAEHGHQPGLPTQPHLIADNLKNLKG 244
|
250 260 270
....*....|....*....|....*....|....*...
gi 209529739 266 PPECLLTPLPPSADdnlktppecvltplpPSADDNLKT 303
Cdd:pfam06409 245 HPECLLTPLHPIAD---------------NSADDKLKP 267
|
|
| MACS_euk |
cd05928 |
Eukaryotic Medium-chain acyl-CoA synthetase (MACS or ACSM); MACS catalyzes the two-step ... |
2-41 |
2.23e-11 |
|
Eukaryotic Medium-chain acyl-CoA synthetase (MACS or ACSM); MACS catalyzes the two-step activation of medium chain fatty acids (containing 4-12 carbons). The carboxylate substrate first reacts with ATP to form an acyl-adenylate intermediate, which then reacts with CoA to produce an acyl-CoA ester. The acyl-CoA is a key intermediate in many important biosynthetic and catabolic processes. MACS enzymes are localized to mitochondria. Two murine MACS family proteins are found in liver and kidney. In rodents, a MACS member is detected particularly in the olfactory epithelium and is called O-MACS. O-MACS demonstrates substrate preference for the fatty acid lengths of C6-C12.
Pssm-ID: 341251 [Multi-domain] Cd Length: 530 Bit Score: 67.88 E-value: 2.23e-11
10 20 30 40
....*....|....*....|....*....|....*....|
gi 209529739 2 VKLSIVLTPQFLSHDQGQLTKELQQHVKSVTCPCEYLRKV 41
Cdd:cd05928 467 VKAFVVLAPQFLSHDPEQLTKELQQHVKSVTAPYKYPRKV 506
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
871-1131 |
2.80e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 48.63 E-value: 2.80e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 871 RGPLPPSADDNLKTPSERQLTPLPPSAPPSADDNIKTPAERLRGPLPPSADDNLKTPSERQLTPLPPSAP---PSADDNI 947
Cdd:PHA03307 142 GSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSpisASASSPA 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 948 KTPAERLRGPLPPSADDNLKTPSERqltplppsappsaddniKTPAERLRGPLPPSADDNLKTPPLATQEAEAEKPRKPK 1027
Cdd:PHA03307 222 PAPGRSAADDAGASSSDSSSSESSG-----------------CGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGP 284
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 1028 RQRAA---EMEPPPEPKRRRVGDVEPSRKPKRRRAADVEPSSPEPKRRRVGD--------VEPSRKPKRRRAADVEPSSP 1096
Cdd:PHA03307 285 ASSSSsprERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSrgaavspgPSPSRSPSPSRPPPPADPSS 364
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 209529739 1097 EPKRRRVGDVEPSR-----KPKRRRAAdvePSLPEPKRRR 1131
Cdd:PHA03307 365 PRKRPRPSRAPSSPaasagRPTRRRAR---AAVAGRARRR 401
|
|
| SF-CC1 |
TIGR01622 |
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ... |
1022-1133 |
3.09e-04 |
|
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
Pssm-ID: 273721 [Multi-domain] Cd Length: 494 Bit Score: 44.52 E-value: 3.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 1022 KPRKPKRQRAAEMEPPPEPKRRRVGDVEPSRKPKRRRAadvepSSPEPKRRRVGDvePSRKPKRRRAADVEPSSPEPK-- 1099
Cdd:TIGR01622 3 RDRERERLRDSSSAGDRDRRRDKGRERSRDRSRDRERS-----RSRRRDRHRDRD--YYRGRERRSRSRRPNRRYRPRek 75
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 209529739 1100 --------RRRVGDVEPSRKPKRRRAADVEPsLPEPKRRRLS 1133
Cdd:TIGR01622 76 rrrrgdsyRRRRDDRRSRREKPRARDGTPEP-LTEDERDRRT 116
|
|
| PRK13709 |
PRK13709 |
conjugal transfer nickase/helicase TraI; Provisional |
999-1116 |
1.25e-03 |
|
conjugal transfer nickase/helicase TraI; Provisional
Pssm-ID: 237478 [Multi-domain] Cd Length: 1747 Bit Score: 43.24 E-value: 1.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 999 PLPPSADDNLKTPPLATQEAEAEKPRKPKRQRAAEM------EPPPEPKRRRVGDVEPSRKPKRRRAADVEPSSPEPKRR 1072
Cdd:PRK13709 1629 VQPGAGNGEPVTAEVLAQRQAEEAIRRETERRADEIvrkmaeNKPDLPDGKTEQAVRDIAGQERDRAAISEREAALPESV 1708
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 209529739 1073 rvgdvepSRKPKRRRAADVEPSSPEPKRRRVGDVE--PSRKPKRRR 1116
Cdd:PRK13709 1709 -------LREPQREREAVREVARENLLRERLQQMErdMVRDLQKEK 1747
|
|
| SF-CC1 |
TIGR01622 |
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ... |
1016-1116 |
2.40e-03 |
|
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
Pssm-ID: 273721 [Multi-domain] Cd Length: 494 Bit Score: 41.83 E-value: 2.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 1016 QEAEAEKPRKPKRQRAAEMEPPPEPKRRRVGDVEPSRKPKRRRAADVEPSSPEPK---RRRVGDVEPSRKPKRRRAadve 1092
Cdd:TIGR01622 19 RDRRRDKGRERSRDRSRDRERSRSRRRDRHRDRDYYRGRERRSRSRRPNRRYRPRekrRRRGDSYRRRRDDRRSRR---- 94
|
90 100
....*....|....*....|....
gi 209529739 1093 psspEPKRRRVGDVEPSRKPKRRR 1116
Cdd:TIGR01622 95 ----EKPRARDGTPEPLTEDERDR 114
|
|
| PRK12678 |
PRK12678 |
transcription termination factor Rho; Provisional |
952-1116 |
2.41e-03 |
|
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain] Cd Length: 672 Bit Score: 41.81 E-value: 2.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 952 ERLRGPLPPSADDNLKTPSERQLTPLPPSAPPSADDNIKTPAERLRGPLPPSADDNLKTPPLATQEAEAEKPRKPKRQRA 1031
Cdd:PRK12678 57 EARGGGAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEA 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 1032 AEMEPPPEPKRRRVGDVEPSRKPKRRRAADVEPSSPEPKRRRVGDVEPSRKPKRRRAADVEPSSPEPKRRRVGDVEPSRK 1111
Cdd:PRK12678 137 ARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRRE 216
|
....*
gi 209529739 1112 PKRRR 1116
Cdd:PRK12678 217 ERGRR 221
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
867-1133 |
4.62e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 41.31 E-value: 4.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 867 GERLRGPLPPSADdnlkTPSERQLTPLPPSAPPSADDNIKTPAERLRGPLPPSADDNLKTPSERQLTPLPPSAPPSADDN 946
Cdd:PHA03307 100 PAREGSPTPPGPS----SPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALP 175
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 947 IKTPAERLRGPLPPSADDNLKTPSERQLTPLPPSAP---PSADDNIKTPAERLRGPLPPSADDNLKTPPLATQEAEAEKP 1023
Cdd:PHA03307 176 LSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSpisASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENEC 255
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 1024 RKPkrqRAAEMEPPPEPKRRRVGDVEPSRKPKRR-RAADVEPS-SPEPKRRRVGDVEPSRKPKRRRAADVEPSSPEPKRR 1101
Cdd:PHA03307 256 PLP---RPAPITLPTRIWEASGWNGPSSRPGPASsSSSPRERSpSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSS 332
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 209529739 1102 RVGD--------VEPSRKPKRRRAADVEPSLPEPKRRRLS 1133
Cdd:PHA03307 333 SESSrgaavspgPSPSRSPSPSRPPPPADPSSPRKRPRPS 372
|
|
| MACS_like |
cd05972 |
Medium-chain acyl-CoA synthetase (MACS or ACSM); MACS catalyzes the two-step activation of ... |
1-41 |
7.09e-03 |
|
Medium-chain acyl-CoA synthetase (MACS or ACSM); MACS catalyzes the two-step activation of medium chain fatty acids (containing 4-12 carbons). The carboxylate substrate first reacts with ATP to form an acyl-adenylate intermediate, which then reacts with CoA to produce an acyl-CoA ester. The acyl-CoA is a key intermediate in many important biosynthetic and catabolic processes.
Pssm-ID: 341276 [Multi-domain] Cd Length: 428 Bit Score: 40.40 E-value: 7.09e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 209529739 1 MVKLSIVLTPQFLSHDQgqLTKELQQHVKSVTCPCEYLRKV 41
Cdd:cd05972 369 VVKAFVVLTSGYEPSEE--LAEELQGHVKKVLAPYKYPREI 407
|
|
| SF-CC1 |
TIGR01622 |
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ... |
1016-1102 |
7.78e-03 |
|
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
Pssm-ID: 273721 [Multi-domain] Cd Length: 494 Bit Score: 40.29 E-value: 7.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 1016 QEAEAEKPRKPKRQRAAEMEPPPEPKRRRVGDvEPSRKPKRRRAADVEPSSPEPKRrrvGDVEPSRKPKRRRAADVEPSS 1095
Cdd:TIGR01622 33 RSRDRERSRSRRRDRHRDRDYYRGRERRSRSR-RPNRRYRPREKRRRRGDSYRRRR---DDRRSRREKPRARDGTPEPLT 108
|
....*..
gi 209529739 1096 PEPKRRR 1102
Cdd:TIGR01622 109 EDERDRR 115
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
990-1126 |
9.31e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 40.06 E-value: 9.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 209529739 990 KTPAERLRGPLPPSADDNLKTPPLATQEAEAEKPRKPKRQRAAEMEP--------PPEPKRRRvgDVEPSRKPKRRRAAD 1061
Cdd:PTZ00449 527 KEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPtlskkpefPKDPKHPK--DPEEPKKPKRPRSAQ 604
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 209529739 1062 VEPSSPEPKRRRVGDVEPSRKPKRRRAADVEPSSPE----PKRRRVGDVEPSRKPKRRRAADVEPSLPE 1126
Cdd:PTZ00449 605 RPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQrpssPERPEGPKIIKSPKPPKSPKPPFDPKFKE 673
|
|
|