NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907155782|ref|XP_036019964|]
View 

eukaryotic translation initiation factor 4 gamma 3 isoform X12 [Mus musculus]

Protein Classification

MA3 and W2_eIF4G1_like domain-containing protein( domain architecture ID 10501431)

protein containing domains MIF4G, MA3, W2_eIF4G1_like, and W2

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
906-1134 7.42e-64

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


:

Pssm-ID: 397130  Cd Length: 203  Bit Score: 216.08  E-value: 7.42e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  906 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 985
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  986 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 1065
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907155782 1066 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 1134
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1555-1687 6.24e-49

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


:

Pssm-ID: 211397  Cd Length: 134  Bit Score: 170.54  E-value: 6.24e-49
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782 1555 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1634
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907155782 1635 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1687
Cdd:cd11559     82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1353-1465 2.85e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


:

Pssm-ID: 397128  Cd Length: 113  Bit Score: 130.47  E-value: 2.85e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782 1353 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1432
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1907155782 1433 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1465
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
PHA03247 super family cl33720
large tegument protein UL36; Provisional
57-465 6.28e-12

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 71.12  E-value: 6.28e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   57 RALQTPAPQQIPRGPVqqpledrlFPPTVSavysTVTQVARQPGPPTPAPYSAHEISKGLPSLAATPPGHASSPGLSQTP 136
Cdd:PHA03247  2672 RAAQASSPPQRPRRRA--------ARPTVG----SLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAP 2739
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  137 YPSGQNAGPATLVYPQAPQTmnsqPQARSpffqrpqiQPPRAAIPNSSPSIRPGVQTPTAVYQANQHIMMVNHLPMPYPV 216
Cdd:PHA03247  2740 APPAVPAGPATPGGPARPAR----PPTTA--------GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADP 2807
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  217 T---QGHQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFANAYGTPFYPSQPVYQSAPiiVPTQQQPPPAKR 293
Cdd:PHA03247  2808 PaavLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAA--KPAAPARPPVRR 2885
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  294 EKKTIRIRDPNqggkdiTEEIMSGGGSRNPTPPIGRPASTPTPPQQLPSQVPEHSPvvygtvesahlAASTPVTAASDQK 373
Cdd:PHA03247  2886 LARPAVSRSTE------SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP-----------PPRPQPPLAPTTD 2948
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  374 QEEKPKPDPVFQSPStvLRLVLSGEKKEQAGQMPETAAGEPTPEPPrTSSPTSLPPLARSSLPSPMSAALSSQPLFTAED 453
Cdd:PHA03247  2949 PAGAGEPSGAVPQPW--LGALVPGRVAVPRFRVPQPAPSREAPASS-TPPLTGHSLSRVSSWASSLALHEETDPPPVSLK 3025
                          410
                   ....*....|..
gi 1907155782  454 KCELPSSKEEDA 465
Cdd:PHA03247  3026 QTLWPPDDTEDS 3037
W2 super family cl17013
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1666-1712 9.63e-05

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


The actual alignment was detected with superfamily member cd11560:

Pssm-ID: 473053 [Multi-domain]  Cd Length: 194  Bit Score: 45.28  E-value: 9.63e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1907155782 1666 LYDEEVISEDAFYKWesSKDPAEQAGKGVALKSVTAFFTWLREAEEE 1712
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
906-1134 7.42e-64

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 216.08  E-value: 7.42e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  906 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 985
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  986 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 1065
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907155782 1066 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 1134
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
907-1131 3.92e-51

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 179.48  E-value: 3.92e-51
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   907 RKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVtLKVPmadkpgntvNFRK 986
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   987 LLLNRCQKEFEKDkadddvfekkqkeleaasapeertrlhdeLEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCVV 1066
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907155782  1067 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 1131
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1555-1687 6.24e-49

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 170.54  E-value: 6.24e-49
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782 1555 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1634
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907155782 1635 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1687
Cdd:cd11559     82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1353-1465 2.85e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 130.47  E-value: 2.85e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782 1353 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1432
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1907155782 1433 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1465
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1353-1465 1.20e-32

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 123.12  E-value: 1.20e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  1353 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1432
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1907155782  1433 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1465
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1625-1709 1.21e-28

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 110.46  E-value: 1.21e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  1625 PILLKYLDSDTEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEqaGKGVALKSVTAFFT 1704
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 1907155782  1705 WLREA 1709
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1638-1714 3.71e-24

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 97.60  E-value: 3.71e-24
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907155782 1638 ELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEQaGKGVALKSVTAFFTWLREAEEESE 1714
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
PHA03247 PHA03247
large tegument protein UL36; Provisional
57-465 6.28e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 71.12  E-value: 6.28e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   57 RALQTPAPQQIPRGPVqqpledrlFPPTVSavysTVTQVARQPGPPTPAPYSAHEISKGLPSLAATPPGHASSPGLSQTP 136
Cdd:PHA03247  2672 RAAQASSPPQRPRRRA--------ARPTVG----SLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAP 2739
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  137 YPSGQNAGPATLVYPQAPQTmnsqPQARSpffqrpqiQPPRAAIPNSSPSIRPGVQTPTAVYQANQHIMMVNHLPMPYPV 216
Cdd:PHA03247  2740 APPAVPAGPATPGGPARPAR----PPTTA--------GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADP 2807
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  217 T---QGHQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFANAYGTPFYPSQPVYQSAPiiVPTQQQPPPAKR 293
Cdd:PHA03247  2808 PaavLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAA--KPAAPARPPVRR 2885
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  294 EKKTIRIRDPNqggkdiTEEIMSGGGSRNPTPPIGRPASTPTPPQQLPSQVPEHSPvvygtvesahlAASTPVTAASDQK 373
Cdd:PHA03247  2886 LARPAVSRSTE------SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP-----------PPRPQPPLAPTTD 2948
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  374 QEEKPKPDPVFQSPStvLRLVLSGEKKEQAGQMPETAAGEPTPEPPrTSSPTSLPPLARSSLPSPMSAALSSQPLFTAED 453
Cdd:PHA03247  2949 PAGAGEPSGAVPQPW--LGALVPGRVAVPRFRVPQPAPSREAPASS-TPPLTGHSLSRVSSWASSLALHEETDPPPVSLK 3025
                          410
                   ....*....|..
gi 1907155782  454 KCELPSSKEEDA 465
Cdd:PHA03247  3026 QTLWPPDDTEDS 3037
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
99-507 5.53e-09

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 61.32  E-value: 5.53e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   99 PGPPTPAPYSAHEISKGLPSLAATPPGHASSPGLSQTPYPSGQNAGPATLVYPQAPQTMNSQPQARSPFFQRPQIQPPRA 178
Cdd:pfam03154  181 ASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQ 260
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  179 AIPNSSPSirPGVQTPTAvyqanqhimmvnhlPMPYPVTQGHqyciPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPG 258
Cdd:pfam03154  261 VSPQPLPQ--PSLHGQMP--------------PMPHSLQTGP----SHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQ 320
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  259 DFANAYgTPfyPSQPVYQSAPiivPTQQQP-PPAKRekktirirdpnqggkditeeimsgggsrnPTPPIGRPASTPTPP 337
Cdd:pfam03154  321 SQQRIH-TP--PSQSQLQSQQ---PPREQPlPPAPL-----------------------------SMPHIKPPPTTPIPQ 365
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  338 QQLPsQVPEHSPvvygtvesaHLAASTPVTAASDQkqeekpKPDPVFQSPSTVLRLVLSGEKKEQAGQMPETAAGEPTP- 416
Cdd:pfam03154  366 LPNP-QSHKHPP---------HLSGPSPFQMNSNL------PPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPa 429
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  417 EPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKceLPSSKEEDAPPVPSPTScTAASGPSLtdnsdicKKPCSVA 496
Cdd:pfam03154  430 QPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPF--VPGGPPPITPPSGPPTS-TSSAMPGI-------QPPSSAS 499
                          410
                   ....*....|.
gi 1907155782  497 PHDSQLISSTI 507
Cdd:pfam03154  500 VSSSGPVPAAV 510
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1666-1712 9.63e-05

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 45.28  E-value: 9.63e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1907155782 1666 LYDEEVISEDAFYKWesSKDPAEQAGKGVALKSVTAFFTWLREAEEE 1712
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
906-1134 7.42e-64

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 216.08  E-value: 7.42e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  906 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 985
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  986 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 1065
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907155782 1066 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 1134
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
907-1131 3.92e-51

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 179.48  E-value: 3.92e-51
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   907 RKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVtLKVPmadkpgntvNFRK 986
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   987 LLLNRCQKEFEKDkadddvfekkqkeleaasapeertrlhdeLEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCVV 1066
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907155782  1067 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 1131
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1555-1687 6.24e-49

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 170.54  E-value: 6.24e-49
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782 1555 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1634
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907155782 1635 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1687
Cdd:cd11559     82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1353-1465 2.85e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 130.47  E-value: 2.85e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782 1353 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1432
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1907155782 1433 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1465
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1353-1465 1.20e-32

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 123.12  E-value: 1.20e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  1353 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1432
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1907155782  1433 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1465
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1625-1709 1.21e-28

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 110.46  E-value: 1.21e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  1625 PILLKYLDSDTEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEqaGKGVALKSVTAFFT 1704
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 1907155782  1705 WLREA 1709
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1638-1714 3.71e-24

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 97.60  E-value: 3.71e-24
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907155782 1638 ELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEQaGKGVALKSVTAFFTWLREAEEESE 1714
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
W2 cd11473
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1555-1681 5.39e-19

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211395  Cd Length: 135  Bit Score: 84.84  E-value: 5.39e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782 1555 EELSQRLEKLIMEEKADDERIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADCSTF---RVDTAVIKQRVPILLKYL 1631
Cdd:cd11473      4 KKLRDSLLKELEEDKSSDVESVKAAKSKLDLDPISLEEVVKVLLTAVVNAVESADSISLtqkEQLVLVLKKYGPVLRELL 83
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907155782 1632 DSDTEKELQALYALQA--SIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWE 1681
Cdd:cd11473     84 KLIKKDQLYLLLKIEKlcLQLKLSELISLLEKILDLLYDADVLSEEAILSWF 135
W2_eIF2B_epsilon cd11558
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a ...
1595-1714 2.63e-15

C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a heteropentameric complex which functions as a guanine nucleotide exchange factor in the recycling of eIF-2 during the initiation of translation in eukaryotes. The epsilon and gamma subunits are sequence similar and both are essential in yeast. Epsilon appears to be the catalytically active subunit, with gamma enhancing its activity. The C-terminal domain of the eIF2B epsilon subunit contains bipartite motifs rich in acidic and aromatic residues, which are responsible for the interaction with eIF2. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211396  Cd Length: 169  Bit Score: 75.37  E-value: 2.63e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782 1595 RALMTAVCK-AAIIADCSTFRVDTA---VIKQRVPILLKYLDSDTEkELQALYALQASIVKLDQPANLLRMFFDCLYDEE 1670
Cdd:cd11558     47 RAVVKALLElILEVSSTSTAELLEAlkkLLSKWGPLLENYVKSQDD-QVELLLALEEFCLESEEGGPLFAKLLHALYDLD 125
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1907155782 1671 VISEDAFYKWESSKDPAEQAGKGVALKSVTAFFTWLREAEEESE 1714
Cdd:cd11558    126 ILEEEAILEWWEEPDAGADEEMKKVRELVKKFIEWLEEAEEESD 169
PHA03247 PHA03247
large tegument protein UL36; Provisional
57-465 6.28e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 71.12  E-value: 6.28e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   57 RALQTPAPQQIPRGPVqqpledrlFPPTVSavysTVTQVARQPGPPTPAPYSAHEISKGLPSLAATPPGHASSPGLSQTP 136
Cdd:PHA03247  2672 RAAQASSPPQRPRRRA--------ARPTVG----SLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAP 2739
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  137 YPSGQNAGPATLVYPQAPQTmnsqPQARSpffqrpqiQPPRAAIPNSSPSIRPGVQTPTAVYQANQHIMMVNHLPMPYPV 216
Cdd:PHA03247  2740 APPAVPAGPATPGGPARPAR----PPTTA--------GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADP 2807
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  217 T---QGHQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFANAYGTPFYPSQPVYQSAPiiVPTQQQPPPAKR 293
Cdd:PHA03247  2808 PaavLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAA--KPAAPARPPVRR 2885
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  294 EKKTIRIRDPNqggkdiTEEIMSGGGSRNPTPPIGRPASTPTPPQQLPSQVPEHSPvvygtvesahlAASTPVTAASDQK 373
Cdd:PHA03247  2886 LARPAVSRSTE------SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP-----------PPRPQPPLAPTTD 2948
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  374 QEEKPKPDPVFQSPStvLRLVLSGEKKEQAGQMPETAAGEPTPEPPrTSSPTSLPPLARSSLPSPMSAALSSQPLFTAED 453
Cdd:PHA03247  2949 PAGAGEPSGAVPQPW--LGALVPGRVAVPRFRVPQPAPSREAPASS-TPPLTGHSLSRVSSWASSLALHEETDPPPVSLK 3025
                          410
                   ....*....|..
gi 1907155782  454 KCELPSSKEEDA 465
Cdd:PHA03247  3026 QTLWPPDDTEDS 3037
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
99-507 5.53e-09

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 61.32  E-value: 5.53e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   99 PGPPTPAPYSAHEISKGLPSLAATPPGHASSPGLSQTPYPSGQNAGPATLVYPQAPQTMNSQPQARSPFFQRPQIQPPRA 178
Cdd:pfam03154  181 ASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQ 260
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  179 AIPNSSPSirPGVQTPTAvyqanqhimmvnhlPMPYPVTQGHqyciPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPG 258
Cdd:pfam03154  261 VSPQPLPQ--PSLHGQMP--------------PMPHSLQTGP----SHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQ 320
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  259 DFANAYgTPfyPSQPVYQSAPiivPTQQQP-PPAKRekktirirdpnqggkditeeimsgggsrnPTPPIGRPASTPTPP 337
Cdd:pfam03154  321 SQQRIH-TP--PSQSQLQSQQ---PPREQPlPPAPL-----------------------------SMPHIKPPPTTPIPQ 365
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  338 QQLPsQVPEHSPvvygtvesaHLAASTPVTAASDQkqeekpKPDPVFQSPSTVLRLVLSGEKKEQAGQMPETAAGEPTP- 416
Cdd:pfam03154  366 LPNP-QSHKHPP---------HLSGPSPFQMNSNL------PPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPa 429
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  417 EPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKceLPSSKEEDAPPVPSPTScTAASGPSLtdnsdicKKPCSVA 496
Cdd:pfam03154  430 QPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPF--VPGGPPPITPPSGPPTS-TSSAMPGI-------QPPSSAS 499
                          410
                   ....*....|.
gi 1907155782  497 PHDSQLISSTI 507
Cdd:pfam03154  500 VSSSGPVPAAV 510
W2_eIF5 cd11561
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase ...
1567-1714 6.95e-09

C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase acceleration protein (GAP), as well as a GDP dissociation inhibitor (GDI) during translational initiation in eukaryotes. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211399  Cd Length: 157  Bit Score: 56.47  E-value: 6.95e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782 1567 EEKADD--ERIFDWVEANLDESQMS-------SPTFLRALMTAVCkaAIIADCsTFRVDTA-VIKQRVPILLKYLDSDte 1636
Cdd:cd11561      1 EEEEDErvDELGEFLKKNKDESGLSelkeilkEAERLDVVKDKAV--LVLAEV-LFDENIVkEIKKRKALLLKLVTDE-- 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782 1637 kelQALYALQASIVKL--DQPANLLRMF---FDCLYDEEVISEDAFYKWeSSKDPAEQAGKGVA---LKSVTAFFTWLRE 1708
Cdd:cd11561     76 ---KAQKALLGGIERFcgKHSPELLKKVpliLKALYDNDILEEEVILKW-YEKVSKKYVSKEKSkkvRKAAEPFVEWLEE 151

                   ....*.
gi 1907155782 1709 AEEESE 1714
Cdd:cd11561    152 AEEEEE 157
PHA03378 PHA03378
EBNA-3B; Provisional
60-477 8.85e-07

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 53.92  E-value: 8.85e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   60 QTPAPQQIPRGPVQQPLEDRLFPPTVsavystvtQVARQPGPPTPAPYSAHEISKGLPSLAATPPGhaSSPGLSQTPYPS 139
Cdd:PHA03378   446 HSQAPTVVLHRPPTQPLEGPTGPLSV--------QAPLEPWQPLPHPQVTPVILHQPPAQGVQAHG--SMLDLLEKDDED 515
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  140 GQNAGPATLVYPQAPQTMNSQpqaRSPFFQR---------PQIQPPRAAIPNSSPSIRP-GVQTPTAVYQANQHIMMVNH 209
Cdd:PHA03378   516 MEQRVMATLLPPSPPQPRAGR---RAPCVYTedldiesdePASTEPVHDQLLPAPGLGPlQIQPLTSPTTSQLASSAPSY 592
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  210 LPMPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFaNAYGTPfYPSQPVYQSAPIIVPTQQQPP 289
Cdd:PHA03378   593 AQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITF-NVLVFP-TPHQPPQVEITPYKPTWTQIG 670
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  290 PAKREkktirirdPNQGGKDITEEIMSGGGSRNP---TPPIGRPASTPTPPQQLPSQVPEHSPVVYGTVESAHLAASTPV 366
Cdd:PHA03378   671 HIPYQ--------PSPTGANTMLPIQWAPGTMQPpprAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPG 742
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  367 TAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPETAA----GEPTPEPPRTSSPTSLPPLARSSLPSPMSAA 442
Cdd:PHA03378   743 RARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQqrprGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTK 822
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|..
gi 1907155782  443 LSSQPLFTAEDKCELPSSK-----EEDAP--PVPSPTSCTAA 477
Cdd:PHA03378   823 QILRQLLTGGVKRGRPSLKkpaalERQAAagPTPSPGSGTSD 864
PHA03247 PHA03247
large tegument protein UL36; Provisional
61-434 1.01e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 1.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   61 TPAPQQIPRGPVQQPledrlfPP--TVSAVYSTVTQVARQPGPPTPAPYSAheiskGLPSLAATPPGHASSPGLSQTPYP 138
Cdd:PHA03247  2765 GPPAPAPPAAPAAGP------PRrlTRPAVASLSESRESLPSPWDPADPPA-----AVLAPAAALPPAASPAGPLPPPTS 2833
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  139 SGQNAGPATLVYPQAPQTMNSQPQARSPFFQRPqiqPPRAAIPNSSPSIRPGVQtptavyqanqhimmvnHLPMPYPVTQ 218
Cdd:PHA03247  2834 AQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRP---PSRSPAAKPAAPARPPVR----------------RLARPAVSRS 2894
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  219 GHQYCIPQYRHSGPPyvgppqqypvqppgpgpfypgpgpgdfanaygTPFYPSQPVYQSAPIIVPTQQQPPPAKRekkti 298
Cdd:PHA03247  2895 TESFALPPDQPERPP--------------------------------QPQAPPPPQPQPQPPPPPQPQPPPPPPP----- 2937
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  299 RIRDPNQGGKDITEEIMSGGGSRNPTPPIGRPASTPTPPQQLPSQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEKP 378
Cdd:PHA03247  2938 RPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEET 3017
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907155782  379 KPDPVfqspsTVLRLVLSGEKKEQA-------GQMPETAAGEPTPEPPRTSSPTSLPPLARSS 434
Cdd:PHA03247  3018 DPPPV-----SLKQTLWPPDDTEDSdadslfdSDSERSDLEALDPLPPEPHDPFAHEPDPATP 3075
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
317-481 1.41e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 50.23  E-value: 1.41e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  317 GGGSRNPTPPIGRPASTPTPPQQLPSQV-PEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVL 395
Cdd:PRK07003   368 PGGGVPARVAGAVPAPGARAAAAVGASAvPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDA 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  396 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTS-----LPPLARSSlPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVP- 469
Cdd:PRK07003   448 PVPAKANARASADSRCDERDAQPPADSGSASapasdAPPDAAFE-PAPRAAAPSAATPAAVPDARAPAAASREDAPAAAa 526
                          170
                   ....*....|..
gi 1907155782  470 SPTSCTAASGPS 481
Cdd:PRK07003   527 PPAPEARPPTPA 538
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
5-387 6.72e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.84  E-value: 6.72e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782    5 QKPALKSGSAAAAGTGPGTGAAAAAAVPPPHPAAAAAAAAVAAAAApphpniralQTPAPQQIPRGP---VQQPLEdrLF 81
Cdd:pfam03154  170 QPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATS---------QPPNQTQSTAAPhtlIQQTPT--LH 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   82 PPTVSAVYSTVTQVARQPGP------PTPAPYSAHEISKGLPSLAATPPgHASSPGLSQ---TPYPSGQNAGPAtLVYPQ 152
Cdd:pfam03154  239 PQRLPSPHPPLQPMTQPPPPsqvspqPLPQPSLHGQMPPMPHSLQTGPS-HMQHPVPPQpfpLTPQSSQSQVPP-GPSPA 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  153 APQTMNSQPQARSPFFQRPQIQPPR----AAIPNSSPSIRPGVQTPTAVYQANQHIMMVNHLPMPYPVTQghqycipqyr 228
Cdd:pfam03154  317 APGQSQQRIHTPPSQSQLQSQQPPReqplPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQM---------- 386
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  229 HSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFANAYGTPFYPSQPvyqsaPIIVPTQQQPPPAKREKKTIRIRD-PNQgg 307
Cdd:pfam03154  387 NSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQP-----PVLTQSQSLPPPAASHPPTSGLHQvPSQ-- 459
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  308 KDITEEIMSGGGSRNPTPPIGRPASTPT--PPQQLPSQVPEHSPVVYGTVESAHLAASTPVTAASDQKQE-EKPKPDPVF 384
Cdd:pfam03154  460 SPFPQHPFVPGGPPPITPPSGPPTSTSSamPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEpESPPPPPRS 539

                   ...
gi 1907155782  385 QSP 387
Cdd:pfam03154  540 PSP 542
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1666-1712 9.63e-05

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 45.28  E-value: 9.63e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1907155782 1666 LYDEEVISEDAFYKWesSKDPAEQAGKGVALKSVTAFFTWLREAEEE 1712
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
90-233 1.36e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 46.95  E-value: 1.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   90 STVTQVARQPGPPTPAPYSAHEISKGLPSLAATPPGHASSPGlsQTPYPSGQNAGPATLVYPQAPQTMNSQPQARSPFFQ 169
Cdd:pfam09770  210 PAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQ--QPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQ 287
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907155782  170 RPQIQPPRAaipnsspsirpgvQTPTAVYQaNQHIMMVNHLPMPYPVTQGHQYCIPQYRHSGPP 233
Cdd:pfam09770  288 FHQQPPPVP-------------VQPTQILQ-NPNRLSAARVGYPQNPQPGVQPAPAHQAHRQQG 337
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
317-487 1.99e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.02  E-value: 1.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  317 GGGSRNPTPPIGRPASTPTPPQQLPSQV-PEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRlvl 395
Cdd:PRK12323   368 SGGGAGPATAAAAPVAQPAPAAAAPAAAaPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP--- 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  396 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVPSPTSCT 475
Cdd:PRK12323   445 GGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAE 524
                          170
                   ....*....|..
gi 1907155782  476 AASGPSLTDNSD 487
Cdd:PRK12323   525 SIPDPATADPDD 536
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
266-452 3.16e-04

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 45.04  E-value: 3.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  266 TPFYPSQpvyqsapiiVPTQQQPPPAKREKKTIRIRDPNQGgkditeeimSGGGSRNPTPPIGRPASTPTPPQQLPSQVP 345
Cdd:pfam05539  186 HPTYPSQ---------VTPQSQPATQGHQTATANQRLSSTE---------PVGTQGTTTSSNPEPQTEPPPSQRGPSGSP 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  346 EHSPvvygtvesahlaaSTP----VTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKeqagqmPETAAGEPTPEPPRT 421
Cdd:pfam05539  248 QHPP-------------STTsqdqSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTT------KRQETGRPTPRPTAT 308
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1907155782  422 SSPTSLPPlarSSLPSPMSAALSSQPLFTAE 452
Cdd:pfam05539  309 TQSGSSPP---HSSPPGVQANPTTQNLVDCK 336
PRK10263 PRK10263
DNA translocase FtsK; Provisional
75-425 4.78e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 45.08  E-value: 4.78e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   75 PLEDRLFPPTVSAVYSTVTQ--VARQ--PGPPTPAPYSAHEISKGLPSLAATPPGHASSPGLSQTPYPSGQNAGPATLVY 150
Cdd:PRK10263   336 PVEPVTQTPPVASVDVPPAQptVAWQpvPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQP 415
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  151 PQAPQTMNSQPQARSPFFQRPQIQPPRAAIPNSSPSIRPGVQtPTAVYQANQhimmvnhlPMPYPVTQGHQYCIPQYrhs 230
Cdd:PRK10263   416 AQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFA-PQSTYQTEQ--------TYQQPAAQEPLYQQPQP--- 483
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  231 gppyvgppqqypvqppgpgpfypgpgpgdfanaygtpfYPSQPVYQSAPIIVPTQQQPPPAK--REKKTIRIRDPNQGG- 307
Cdd:PRK10263   484 --------------------------------------VEQQPVVEPEPVVEETKPARPPLYyfEEVEEKRAREREQLAa 525
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  308 --KDITEEIMSGGGSRNPTPPIGRPASTPTPPqqlpsqVPEHSPVVYGtVESAHLAASTPVTAASdqkqeekpkpdPVFQ 385
Cdd:PRK10263   526 wyQPIPEPVKEPEPIKSSLKAPSVAAVPPVEA------AAAVSPLASG-VKKATLATGAAATVAA-----------PVFS 587
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|
gi 1907155782  386 spstvlrLVLSGEKKEQAGQmpetAAGEPTPEPPRTSSPT 425
Cdd:PRK10263   588 -------LANSGGPRPQVKE----GIGPQLPRPKRIRVPT 616
PRK10263 PRK10263
DNA translocase FtsK; Provisional
166-471 6.00e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 44.69  E-value: 6.00e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  166 PFFQRPQIqpPRAAIPNSSPSIR----PGVQTPTAVYQanqhimmvnhlPMPYPVTQGHQYCIPQYRHSGP---PYVGPP 238
Cdd:PRK10263   339 PVTQTPPV--ASVDVPPAQPTVAwqpvPGPQTGEPVIA-----------PAPEGYPQQSQYAQPAVQYNEPlqqPVQPQQ 405
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  239 QQYPVQPPGPgpfypgpgpgdfanaygtpfyPSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQGGKDITEEimsgg 318
Cdd:PRK10263   406 PYYAPAAEQP---------------------AQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQ----- 459
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  319 GSRNPTPPIGRPAstPTPPQQLPSQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEK------------PKPDPVfqs 386
Cdd:PRK10263   460 STYQTEQTYQQPA--AQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRarereqlaawyqPIPEPV--- 534
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  387 pstvlrlvlsgekKEQAGQMPETAAGEPTPEPPRTSSPTSLPPLA--RSSLPSPMSAALSSQPLFT-AEDKCELPSSKEE 463
Cdd:PRK10263   535 -------------KEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASgvKKATLATGAAATVAAPVFSlANSGGPRPQVKEG 601

                   ....*...
gi 1907155782  464 DAPPVPSP 471
Cdd:PRK10263   602 IGPQLPRP 609
PHA03378 PHA03378
EBNA-3B; Provisional
65-351 1.59e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 43.52  E-value: 1.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   65 QQIPRGPVQQPLEDRlfPPTVSAVY--STVTQVARQPGPPTPAPYSAHEISKGLPSLAATP-------------PGHASS 129
Cdd:PHA03378   639 QPITFNVLVFPTPHQ--PPQVEITPykPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPpraptpmrppaapPGRAQR 716
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  130 PGLSQTPYPSGQnAGPATLVYPQAPQTMNSQPQARSPFFQRPQIQPPRAAIPNSSpsirPGVQTPTAVYQANqhimmvnh 209
Cdd:PHA03378   717 PAAATGRARPPA-AAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAA----PGAPTPQPPPQAP-------- 783
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  210 lpmPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFANAYGTPF--YPS----------QPVYQS 277
Cdd:PHA03378   784 ---PAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSlkKPAalerqaaagpTPSPGS 860
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  278 --------APIIVPTQQQPPPAKREKKTIRIRDPNQGGKDITEEI--MSGGGSRNPT--PPIGRPASTPTPPQQLPSQVP 345
Cdd:PHA03378   861 gtsdkivqAPVFYPPVLQPIQVMRQLGSVRAAAASTVTQAPTEYTgeRRGVGPMHPTdiPPSKRAKTDAYVESQPPHGGQ 940

                   ....*.
gi 1907155782  346 EHSPVV 351
Cdd:PHA03378   941 SHSFSV 946
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
329-484 1.59e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 43.62  E-value: 1.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  329 RPASTPTPPQQLPSQvpEHSPVVygTVESAHLAASTPV---TAASDQKQEEKPKPDPVFQSPS------TVLRLVLSGEK 399
Cdd:PHA03307    23 RPPATPGDAADDLLS--GSQGQL--VSDSAELAAVTVVagaAACDRFEPPTGPPPGPGTEAPAnesrstPTWSLSTLAPA 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  400 KEQAGQMPETAAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKC-ELPSSKEEDAPPVPSPTSCTAAS 478
Cdd:PHA03307    99 SPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAaGASPAAVASDAASSRQAALPLSS 178

                   ....*.
gi 1907155782  479 GPSLTD 484
Cdd:PHA03307   179 PEETAR 184
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
316-481 2.05e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.05  E-value: 2.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  316 SGGGSRNPTPPIGRPASTPTPPQQLPSQVPehSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVL 395
Cdd:PRK07764   601 PAPASSGPPEEAARPAAPAAPAAPAAPAPA--GAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAP 678
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  396 SGEKKEQAGQMPETAAGEPTPEP----------PRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSKEEDA 465
Cdd:PRK07764   679 AAPPPAPAPAAPAAPAGAAPAQPapapaatppaGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQP 758
                          170
                   ....*....|....*.
gi 1907155782  466 PPVPSPTSCTAASGPS 481
Cdd:PRK07764   759 PPPPAPAPAAAPAAAP 774
PRK11901 PRK11901
hypothetical protein; Reviewed
270-397 2.22e-03

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 42.36  E-value: 2.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  270 PSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDP---------NQGGKDITEEIMSGGGSRNPTPPIGRPASTPTPPQQL 340
Cdd:PRK11901   113 TAPPQDISAPPISPTPTQAAPPQTPNGQQRIELPgnisdalsqQQGQVNAASQNAQGNTSTLPTAPATVAPSKGAKVPAT 192
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  341 PSQVPEHSPVVYGT--VESAHLAASTPVTAASDQKQEEKPKPDPVFQS-PSTVLRLVLSG 397
Cdd:PRK11901   193 AETHPTPPQKPATKkpAVNHHKTATVAVPPATSGKPKSGAASARALSSaPASHYTLQLSS 252
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
55-232 2.98e-03

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 42.30  E-value: 2.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   55 NIRALQTPAPQQIPRG--PVQQPLEdrlfpptvsavystVTQVARQPGPPTPAPYSAHEISK--------GLPSLAATPP 124
Cdd:pfam09606  305 NYQQQQTRQQQQQQGGnhPAAHQQQ--------------MNQSVGQGGQVVALGGLNHLETWnpgnfgglGANPMQRGQP 370
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  125 GHASSPglsqTPYPSGQnagpatLVYPQAPQTMNSQPQARSPFFQRPQIQPPRAAIPNSSPSirPGvqtptavyqanqhi 204
Cdd:pfam09606  371 GMMSSP----SPVPGQQ------VRQVTPNQFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPS--PA-------------- 424
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1907155782  205 mmvnHLPMPYP---VTQGHQYCIPQYRHSGP 232
Cdd:pfam09606  425 ----LIPSPSPqmsQQPAQQRTIGQDSPGGS 451
PRK10263 PRK10263
DNA translocase FtsK; Provisional
80-184 3.80e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 42.38  E-value: 3.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   80 LFPPTVSAVYSTVTQVARQPGPPTPAPYSAHEISKGLPSLAATPPGHASSPGLSQTPYPSGQNagPATLVYPQAPQTMNS 159
Cdd:PRK10263   744 LFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQ--PQQPVAPQPQYQQPQ 821
                           90       100
                   ....*....|....*....|....*
gi 1907155782  160 QPQARSPFFQRPqiQPPRAAIPNSS 184
Cdd:PRK10263   822 QPVAPQPQYQQP--QQPVAPQPQDT 844
PHA03378 PHA03378
EBNA-3B; Provisional
57-196 3.95e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 41.98  E-value: 3.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   57 RALQTPAPQQIPRGPVQQP--LEDRLFPPTVSAVYSTVTQVARQPGPPTPAPYSAHEISKGLPSLAATPPGHASSPglsq 134
Cdd:PHA03378   699 RAPTPMRPPAAPPGRAQRPaaATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAP---- 774
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907155782  135 TPYPSGQnAGPATLVYPQAPQTMNSQPQA----------RSPFFQRPQIQPPRAAIPNSSPSIRPGVQTPTA 196
Cdd:PHA03378   775 TPQPPPQ-APPAPQQRPRGAPTPQPPPQAgptsmqlmprAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAA 845
PRK10263 PRK10263
DNA translocase FtsK; Provisional
341-482 5.42e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.61  E-value: 5.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  341 PSQVPEHSPVVYGTVESAHLAASTPVTAASdqkQEEKPKPDPVFQSPStvlrlVLSGEKKEQAGQMP-ETAAGEPTPEPP 419
Cdd:PRK10263   301 QPEYDEYDPLLNGAPITEPVAVAAAATTAT---QSWAAPVEPVTQTPP-----VASVDVPPAQPTVAwQPVPGPQTGEPV 372
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907155782  420 RTSSPTSLPPLARSSLPSPMSAALSSQPlFTAEDKCELPSSKEEDAPPVPSPTSCTAASGPSL 482
Cdd:PRK10263   373 IAPAPEGYPQQSQYAQPAVQYNEPLQQP-VQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYY 434
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
283-466 7.25e-03

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 40.92  E-value: 7.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  283 PTQQQPPPAKREKKTIRirdPNQGGKDiteeiMSGGGSRNPTPPIGRPASTPTPPQQLPSQVPEHSPVVYGTVESAHLAA 362
Cdd:pfam13254  170 PSQPAQPAWMKELNKIR---QSRASVD-----LGRPNSFKEVTPVGLMRSPAPGGHSKSPSVSGISADSSPTKEEPSEEA 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  363 STPVTaasdqKQEEKPKPDPVFQSPSTVLRLvlsgEKKEQAGQMPETAAGEPT--PEPPRTSSPTSLPPLARSSLPSPMS 440
Cdd:pfam13254  242 DTLST-----DKEQSPAPTSASEPPPKTKEL----PKDSEEPAAPSKSAEASTekKEPDTESSPETSSEKSAPSLLSPVS 312
                          170       180
                   ....*....|....*....|....*.
gi 1907155782  441 AALSSQPLFTAEDKCELPSSKEEDAP 466
Cdd:pfam13254  313 KASIDKPLSSPDRDPLSPKPKPQSPP 338
PHA03247 PHA03247
large tegument protein UL36; Provisional
20-487 7.75e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.46  E-value: 7.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   20 GPGTGAAAAAAVPPPHPAAAAAAAAVAAAAAPPHPN----IRALQT--------PAPQQIPRGPVQQPleDRLFPPTVSA 87
Cdd:PHA03247  2497 DPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRmltwIRGLEElasddagdPPPPLPPAAPPAAP--DRSVPPPRPA 2574
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782   88 ---VYSTVTQVARQPG-PPTPA----PYSAHEISKGLPSLAATPPGHASSPGLSQTPYPSGQNAG---PATLVYPQAPQT 156
Cdd:PHA03247  2575 prpSEPAVTSRARRPDaPPQSArpraPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDphpPPTVPPPERPRD 2654
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  157 MNSQPQARSPFFQRPQIQPPRAAIPNSSP---SIRPGVQTPTAVYQAnqhimmvnHLPMPYPVTQGHQYCIPQYRHSGPP 233
Cdd:PHA03247  2655 DPAPGRVSRPRRARRLGRAAQASSPPQRPrrrAARPTVGSLTSLADP--------PPPPPTPEPAPHALVSATPLPPGPA 2726
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  234 YVGPPQQYPVQPPGPGPFYPGPGPGDFANAYGTPFYPSQPVYQS---APIIVPTQQQPPPAKREKKTIRIRDPNQ----- 305
Cdd:PHA03247  2727 AARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAppaAPAAGPPRRLTRPAVASLSESRESLPSPwdpad 2806
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  306 -----GGKDITEEIMSGGGSRNPTPPIGRPASTPTPPQQLPSQVPEHSPVVYG---------TVESAHLAASTPVTAASD 371
Cdd:PHA03247  2807 ppaavLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGgdvrrrppsRSPAAKPAAPARPPVRRL 2886
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155782  372 QKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPETAAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTA 451
Cdd:PHA03247  2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGA 2966
                          490       500       510
                   ....*....|....*....|....*....|....*..
gi 1907155782  452 EDKCELPSSKEEDAPPVPS-PTSctAASGPSLTDNSD 487
Cdd:PHA03247  2967 LVPGRVAVPRFRVPQPAPSrEAP--ASSTPPLTGHSL 3001
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH