NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1677538761|ref|NP_937887|]
View 

eukaryotic translation initiation factor 4 gamma 1 isoform 2 [Homo sapiens]

Protein Classification

MA3 and W2_eIF4G1_like domain-containing protein( domain architecture ID 10501430)

protein containing domains MIF4G, MA3, W2_eIF4G1_like, and W2

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
674-899 4.84e-64

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


:

Pssm-ID: 397130  Cd Length: 203  Bit Score: 216.46  E-value: 4.84e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  674 FRRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlkvpttekpTVTVNFR 753
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNL---------RNPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  754 KLLLNRCQKEFEKdkdddevfekkqkemdeaataeergrlKEELEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCV 833
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677538761  834 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKII---KEKKTSSRIRFMLQDVLDLR 899
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVlskDDPKLSSRLRFMLQDLIELR 200
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1351-1483 1.69e-57

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


:

Pssm-ID: 211397  Cd Length: 134  Bit Score: 194.81  E-value: 1.69e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761 1351 LPSEELNRQLEKLLKEGSSNQRVFDWIEANLSEQQIVSNTLVRALMTAVCYSAIIFETPLRVDVAVLKARAKLLQKYLC- 1429
Cdd:cd11559      1 LPLLRVQAELLKLLQEDPNPDELYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLPEKEKALLEKYAPLLQKYLDd 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1677538761 1430 DEQKELQALYALQALVVTLEQPPNLLRMFFDALYDEDVVKEDAFYSWESSKDPA 1483
Cdd:cd11559     81 DEQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1155-1267 4.38e-37

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


:

Pssm-ID: 397128  Cd Length: 113  Bit Score: 135.48  E-value: 4.38e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761 1155 LEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRHGVESTLERSAIAREHMGQLLHQLLCAGHLSTAQYYQGLYE 1234
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1677538761 1235 ILELAEDMEIDIPHVWLYLAELVTPILQEGGVP 1267
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
PHA03247 super family cl33720
large tegument protein UL36; Provisional
28-489 8.07e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 8.07e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761   28 VPTQQYPVQPGAPGF-----YPGASPTEfgtyAGAYYPAQGVQQFPTGVAPTPVlmnqPPQIAPKRERKTIRIRDPNQGG 102
Cdd:PHA03247  2568 VPPPRPAPRPSEPAVtsrarRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPL----PPDTHAPDPPPPSPSPAANEPD 2639
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  103 KDITEEIMSGARTASTPTPPQTGGGLEPQANGETPQV-AVIVRPDDRSQGAIIAD-----RPGLPGPEhSPSESQPSSPS 176
Cdd:PHA03247  2640 PHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQAsSPPQRPRRRAARPTVGSltslaDPPPPPPT-PEPAPHALVSA 2718
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  177 PTPSPSPVLEPGSEPNLAVLSIPGDTMTTIQMSVEESTPISRETGEPyrlSPEPTPLAEPILEVEVTLSKPVPESEFSSS 256
Cdd:PHA03247  2719 TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG---PPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  257 P-LQAPTPLASHTVEIHEPNGMVPsedlePEVESSPELAPPPACPSESPVPIAPTAQPEELLNGAPSPPAvDLSPVSEPE 335
Cdd:PHA03247  2796 EsLPSPWDPADPPAAVLAPAAALP-----PAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG-DVRRRPPSR 2869
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  336 EQAKEVTASMAPPTIPSATPATAPSATS----------PAQEEEMEEEEEEEEGEAGEAGEAESEKGGEELLPPESTPIP 405
Cdd:PHA03247  2870 SPAAKPAAPARPPVRRLARPAVSRSTESfalppdqperPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDP 2949
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  406 ANLSQNLEAAAATQVAVSVPKRrrkikelnkkeavgdlldafkeanPAVPEVENQPPAGSNPGPESEGSGVPPRPEEADE 485
Cdd:PHA03247  2950 AGAGEPSGAVPQPWLGALVPGR------------------------VAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVS 3005

                   ....
gi 1677538761  486 TWDS 489
Cdd:PHA03247  3006 SWAS 3009
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
674-899 4.84e-64

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 216.46  E-value: 4.84e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  674 FRRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlkvpttekpTVTVNFR 753
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNL---------RNPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  754 KLLLNRCQKEFEKdkdddevfekkqkemdeaataeergrlKEELEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCV 833
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677538761  834 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKII---KEKKTSSRIRFMLQDVLDLR 899
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVlskDDPKLSSRLRFMLQDLIELR 200
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1351-1483 1.69e-57

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 194.81  E-value: 1.69e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761 1351 LPSEELNRQLEKLLKEGSSNQRVFDWIEANLSEQQIVSNTLVRALMTAVCYSAIIFETPLRVDVAVLKARAKLLQKYLC- 1429
Cdd:cd11559      1 LPLLRVQAELLKLLQEDPNPDELYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLPEKEKALLEKYAPLLQKYLDd 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1677538761 1430 DEQKELQALYALQALVVTLEQPPNLLRMFFDALYDEDVVKEDAFYSWESSKDPA 1483
Cdd:cd11559     81 DEQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
675-902 1.68e-53

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 186.03  E-value: 1.68e-53
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761   675 RRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlKVPttekptvtvNFRK 754
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLNA-KNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761   755 LLLNRCQKEFEKDkdddevfekkqkemdeaataeergrlkeeLEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCVV 834
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1677538761   835 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIIKEKKT---SSRIRFMLQDVLDLRGSN 902
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELRKNK 200
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1155-1267 4.38e-37

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 135.48  E-value: 4.38e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761 1155 LEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRHGVESTLERSAIAREHMGQLLHQLLCAGHLSTAQYYQGLYE 1234
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1677538761 1235 ILELAEDMEIDIPHVWLYLAELVTPILQEGGVP 1267
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1155-1267 5.63e-36

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 132.37  E-value: 5.63e-36
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  1155 LEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRHGVESTLERSAIAREHMGQLLHQLLCAGHLSTAQYYQGLYE 1234
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1677538761  1235 ILELAEDMEIDIPHVWLYLAELVTPILQEGGVP 1267
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1422-1505 1.32e-26

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 104.68  E-value: 1.32e-26
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  1422 KLLQKYLCDEQKELQALYALQALVVTLEQPPNLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqqGKGVALKSVTAFFKW 1501
Cdd:smart00515    2 PLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVTW 79

                    ....
gi 1677538761  1502 LREA 1505
Cdd:smart00515   80 LQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1434-1510 2.42e-23

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 94.90  E-value: 2.42e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1677538761 1434 ELQALYALQALVVTLEQPPNLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqQGKGVALKSVTAFFKWLREAEEESD 1510
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAE-KGMKKVRKQAKPFVEWLEEAEEESD 76
PHA03247 PHA03247
large tegument protein UL36; Provisional
28-489 8.07e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 8.07e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761   28 VPTQQYPVQPGAPGF-----YPGASPTEfgtyAGAYYPAQGVQQFPTGVAPTPVlmnqPPQIAPKRERKTIRIRDPNQGG 102
Cdd:PHA03247  2568 VPPPRPAPRPSEPAVtsrarRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPL----PPDTHAPDPPPPSPSPAANEPD 2639
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  103 KDITEEIMSGARTASTPTPPQTGGGLEPQANGETPQV-AVIVRPDDRSQGAIIAD-----RPGLPGPEhSPSESQPSSPS 176
Cdd:PHA03247  2640 PHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQAsSPPQRPRRRAARPTVGSltslaDPPPPPPT-PEPAPHALVSA 2718
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  177 PTPSPSPVLEPGSEPNLAVLSIPGDTMTTIQMSVEESTPISRETGEPyrlSPEPTPLAEPILEVEVTLSKPVPESEFSSS 256
Cdd:PHA03247  2719 TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG---PPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  257 P-LQAPTPLASHTVEIHEPNGMVPsedlePEVESSPELAPPPACPSESPVPIAPTAQPEELLNGAPSPPAvDLSPVSEPE 335
Cdd:PHA03247  2796 EsLPSPWDPADPPAAVLAPAAALP-----PAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG-DVRRRPPSR 2869
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  336 EQAKEVTASMAPPTIPSATPATAPSATS----------PAQEEEMEEEEEEEEGEAGEAGEAESEKGGEELLPPESTPIP 405
Cdd:PHA03247  2870 SPAAKPAAPARPPVRRLARPAVSRSTESfalppdqperPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDP 2949
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  406 ANLSQNLEAAAATQVAVSVPKRrrkikelnkkeavgdlldafkeanPAVPEVENQPPAGSNPGPESEGSGVPPRPEEADE 485
Cdd:PHA03247  2950 AGAGEPSGAVPQPWLGALVPGR------------------------VAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVS 3005

                   ....
gi 1677538761  486 TWDS 489
Cdd:PHA03247  3006 SWAS 3009
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
53-366 5.27e-05

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 47.65  E-value: 5.27e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761   53 TYAGAYYPAQGVQQFPTGVAPTPVLMNQPPQIAPKRERKTIRIRDPNQGGKDiTEEIMSGARTA--STPTPPQTGGGLEP 130
Cdd:pfam17823  116 AAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASA-PHAASPAPRTAasSTTAASSTTAASSA 194
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  131 QANGETPQVAVIVRPDDRSQGAIIADRPGLpGPEHSPSESQPSSPSPTPSPSPVLEPGSepnLAVLSIPGDTMTTIQMSV 210
Cdd:pfam17823  195 PTTAASSAPATLTPARGISTAATATGHPAA-GTALAAVGNSSPAAGTVTAAVGTVTPAA---LATLAAAAGTVASAAGTI 270
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  211 EESTPISRetgepyRLSPeptplaepilevevtlSKPVPESEFSSSPL-----QAPTPLASHTVEihepngmvpsedlEP 285
Cdd:pfam17823  271 NMGDPHAR------RLSP----------------AKHMPSDTMARNPAapmgaQAQGPIIQVSTD-------------QP 315
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  286 EVESSPELAPPPACPSESPvpiaptaqpeellNGAPSPPAVDLSPVSEPEEQAKEVTASMAP-------PTIPSATPATA 358
Cdd:pfam17823  316 VHNTAGEPTPSPSNTTLEP-------------NTPKSVASTNLAVVTTTKAQAKEPSASPVPvlhtsmiPEVEATSPTTQ 382

                   ....*...
gi 1677538761  359 PSATSPAQ 366
Cdd:pfam17823  383 PSPLLPTQ 390
COG5373 COG5373
Uncharacterized membrane protein [Function unknown];
281-363 1.81e-04

Uncharacterized membrane protein [Function unknown];


Pssm-ID: 444140 [Multi-domain]  Cd Length: 854  Bit Score: 46.15  E-value: 1.81e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  281 EDLEPEVESSPELAPPPACPSESPVPIAPTAQPEEllngAPSPPAVDLSPVSEPEEQAKEVTASMAPPTIPSATPATAPS 360
Cdd:COG5373     31 EELEAELAEAAEAASAPAEPEPEAAAAATAAAPEA----APAPVPEAPAAPPAAAEAPAPAAAAPPAEAEPAAAPAAASS 106

                   ...
gi 1677538761  361 ATS 363
Cdd:COG5373    107 FEE 109
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
674-899 4.84e-64

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 216.46  E-value: 4.84e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  674 FRRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlkvpttekpTVTVNFR 753
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNL---------RNPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  754 KLLLNRCQKEFEKdkdddevfekkqkemdeaataeergrlKEELEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCV 833
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677538761  834 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKII---KEKKTSSRIRFMLQDVLDLR 899
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVlskDDPKLSSRLRFMLQDLIELR 200
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1351-1483 1.69e-57

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 194.81  E-value: 1.69e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761 1351 LPSEELNRQLEKLLKEGSSNQRVFDWIEANLSEQQIVSNTLVRALMTAVCYSAIIFETPLRVDVAVLKARAKLLQKYLC- 1429
Cdd:cd11559      1 LPLLRVQAELLKLLQEDPNPDELYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLPEKEKALLEKYAPLLQKYLDd 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1677538761 1430 DEQKELQALYALQALVVTLEQPPNLLRMFFDALYDEDVVKEDAFYSWESSKDPA 1483
Cdd:cd11559     81 DEQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
675-902 1.68e-53

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 186.03  E-value: 1.68e-53
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761   675 RRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlKVPttekptvtvNFRK 754
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLNA-KNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761   755 LLLNRCQKEFEKDkdddevfekkqkemdeaataeergrlkeeLEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCVV 834
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1677538761   835 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIIKEKKT---SSRIRFMLQDVLDLRGSN 902
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELRKNK 200
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1155-1267 4.38e-37

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 135.48  E-value: 4.38e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761 1155 LEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRHGVESTLERSAIAREHMGQLLHQLLCAGHLSTAQYYQGLYE 1234
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1677538761 1235 ILELAEDMEIDIPHVWLYLAELVTPILQEGGVP 1267
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1155-1267 5.63e-36

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 132.37  E-value: 5.63e-36
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  1155 LEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRHGVESTLERSAIAREHMGQLLHQLLCAGHLSTAQYYQGLYE 1234
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1677538761  1235 ILELAEDMEIDIPHVWLYLAELVTPILQEGGVP 1267
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1422-1505 1.32e-26

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 104.68  E-value: 1.32e-26
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  1422 KLLQKYLCDEQKELQALYALQALVVTLEQPPNLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqqGKGVALKSVTAFFKW 1501
Cdd:smart00515    2 PLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVTW 79

                    ....
gi 1677538761  1502 LREA 1505
Cdd:smart00515   80 LQEA 83
W2 cd11473
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1352-1477 2.12e-23

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211395  Cd Length: 135  Bit Score: 97.16  E-value: 2.12e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761 1352 PSEELNRQLEKLLKEG-SSNQRVFDWIEANLSEQQIVSNTLVRALMTAVCYSAIIFE----TPLRVDVAVLKARAKLLQK 1426
Cdd:cd11473      2 KNKKLRDSLLKELEEDkSSDVESVKAAKSKLDLDPISLEEVVKVLLTAVVNAVESADsislTQKEQLVLVLKKYGPVLRE 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1677538761 1427 YLCD-EQKELQALYALQALVVT--LEQPPNLLRMFFDALYDEDVVKEDAFYSWE 1477
Cdd:cd11473     82 LLKLiKKDQLYLLLKIEKLCLQlkLSELISLLEKILDLLYDADVLSEEAILSWF 135
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1434-1510 2.42e-23

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 94.90  E-value: 2.42e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1677538761 1434 ELQALYALQALVVTLEQPPNLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqQGKGVALKSVTAFFKWLREAEEESD 1510
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAE-KGMKKVRKQAKPFVEWLEEAEEESD 76
W2_eIF2B_epsilon cd11558
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a ...
1421-1510 4.82e-15

C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a heteropentameric complex which functions as a guanine nucleotide exchange factor in the recycling of eIF-2 during the initiation of translation in eukaryotes. The epsilon and gamma subunits are sequence similar and both are essential in yeast. Epsilon appears to be the catalytically active subunit, with gamma enhancing its activity. The C-terminal domain of the eIF2B epsilon subunit contains bipartite motifs rich in acidic and aromatic residues, which are responsible for the interaction with eIF2. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211396  Cd Length: 169  Bit Score: 74.60  E-value: 4.82e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761 1421 AKLLQKYLCDEQKELQALYALQALVVTLEQPPNLLRMFFDALYDEDVVKEDAFYSWESSKDPAEQQGKGVALKSVTAFFK 1500
Cdd:cd11558     80 GPLLENYVKSQDDQVELLLALEEFCLESEEGGPLFAKLLHALYDLDILEEEAILEWWEEPDAGADEEMKKVRELVKKFIE 159
                           90
                   ....*....|
gi 1677538761 1501 WLREAEEESD 1510
Cdd:cd11558    160 WLEEAEEESD 169
W2_eIF5 cd11561
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase ...
1352-1510 5.42e-09

C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase acceleration protein (GAP), as well as a GDP dissociation inhibitor (GDI) during translational initiation in eukaryotes. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211399  Cd Length: 157  Bit Score: 56.47  E-value: 5.42e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761 1352 PSEELNR--QLEKLLKEGSSNQRVFDwIEANLSEQQIVSNTLVRALmtavcysAIIFETPLRVD-VAVLKARAKLLQKYL 1428
Cdd:cd11561      1 EEEEDERvdELGEFLKKNKDESGLSE-LKEILKEAERLDVVKDKAV-------LVLAEVLFDENiVKEIKKRKALLLKLV 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761 1429 CDEQKELQALYALQALVVtlEQPPNLLRMF---FDALYDEDVVKEDAFYSW---ESSKDPAEQQGKGVaLKSVTAFFKWL 1502
Cdd:cd11561     73 TDEKAQKALLGGIERFCG--KHSPELLKKVpliLKALYDNDILEEEVILKWyekVSKKYVSKEKSKKV-RKAAEPFVEWL 149

                   ....*...
gi 1677538761 1503 REAEEESD 1510
Cdd:cd11561    150 EEAEEEEE 157
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1348-1508 4.69e-08

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 54.91  E-value: 4.69e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761 1348 QRALPSEELNRQLEKLLKEGSSNQRVFDWIEANLSEQQIVSN--------TLVRALMTAVCYSA---IIFETPLRVdvav 1416
Cdd:cd11560     29 YRKQASQEIKKELQQELKEMIAEEEPVKEIIAAVKEQMKKSSlpehevvgLLWTALMDAVEWSKkedQIAEQALRH---- 104
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761 1417 LKARAKLLQKYLCDEQKELQALYALQalVVTLEQPpNLLRMFFD---ALYDEDVVKEDAFYSWesSKDPAEQQGKGVALK 1493
Cdd:cd11560    105 LKKYAPLLAAFCTTARAELALLNKIQ--EYCYENM-KFMKVFQKivkLLYKADVLSEDAILKW--YKKGHSPKGKQVFLK 179
                          170
                   ....*....|....*
gi 1677538761 1494 SVTAFFKWLREAEEE 1508
Cdd:cd11560    180 QMEPFVEWLQEAEEE 194
PHA03247 PHA03247
large tegument protein UL36; Provisional
28-489 8.07e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 8.07e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761   28 VPTQQYPVQPGAPGF-----YPGASPTEfgtyAGAYYPAQGVQQFPTGVAPTPVlmnqPPQIAPKRERKTIRIRDPNQGG 102
Cdd:PHA03247  2568 VPPPRPAPRPSEPAVtsrarRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPL----PPDTHAPDPPPPSPSPAANEPD 2639
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  103 KDITEEIMSGARTASTPTPPQTGGGLEPQANGETPQV-AVIVRPDDRSQGAIIAD-----RPGLPGPEhSPSESQPSSPS 176
Cdd:PHA03247  2640 PHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQAsSPPQRPRRRAARPTVGSltslaDPPPPPPT-PEPAPHALVSA 2718
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  177 PTPSPSPVLEPGSEPNLAVLSIPGDTMTTIQMSVEESTPISRETGEPyrlSPEPTPLAEPILEVEVTLSKPVPESEFSSS 256
Cdd:PHA03247  2719 TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG---PPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  257 P-LQAPTPLASHTVEIHEPNGMVPsedlePEVESSPELAPPPACPSESPVPIAPTAQPEELLNGAPSPPAvDLSPVSEPE 335
Cdd:PHA03247  2796 EsLPSPWDPADPPAAVLAPAAALP-----PAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG-DVRRRPPSR 2869
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  336 EQAKEVTASMAPPTIPSATPATAPSATS----------PAQEEEMEEEEEEEEGEAGEAGEAESEKGGEELLPPESTPIP 405
Cdd:PHA03247  2870 SPAAKPAAPARPPVRRLARPAVSRSTESfalppdqperPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDP 2949
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  406 ANLSQNLEAAAATQVAVSVPKRrrkikelnkkeavgdlldafkeanPAVPEVENQPPAGSNPGPESEGSGVPPRPEEADE 485
Cdd:PHA03247  2950 AGAGEPSGAVPQPWLGALVPGR------------------------VAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVS 3005

                   ....
gi 1677538761  486 TWDS 489
Cdd:PHA03247  3006 SWAS 3009
PHA03247 PHA03247
large tegument protein UL36; Provisional
29-485 3.00e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 3.00e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761   29 PTQQYPVQPGAPgFYPGASPTEFGTyagayyPAQGVQQFPTGVAPTPVLMNQPPQIAPKRERKTIRIRDpnqggkdiTEE 108
Cdd:PHA03247  2478 PVYRRPAEARFP-FAAGAAPDPGGG------GPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRG--------LEE 2542
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  109 IMSGarTASTPTPPQTGGGLEPQANGETPQVAVIVRPDDRSQGAIiADRPGLPgPEHSPSESQPSSPSPTPSPspvLEPG 188
Cdd:PHA03247  2543 LASD--DAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSR-ARRPDAP-PQSARPRAPVDDRGDPRGP---APPS 2615
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  189 SEPnlavlsiPGDTMTTIQMSVEESTPISRETGEPYRLSPEPTPLAEPILEvEVTLSKPV---PESEFSSSPLQAPTPLA 265
Cdd:PHA03247  2616 PLP-------PDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPG-RVSRPRRArrlGRAAQASSPPQRPRRRA 2687
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  266 shtveIHEPNGMVPSEDLEPEVESSPELAPPPACPSeSPVPIAPTAQ-----PEELLNGAPSPPAVDLSPVSEPEEQAKE 340
Cdd:PHA03247  2688 -----ARPTVGSLTSLADPPPPPPTPEPAPHALVSA-TPLPPGPAAArqaspALPAAPAPPAVPAGPATPGGPARPARPP 2761
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  341 VTASMAPPTIPSATPATAPSATSPAQEEEMEEEEEEEEGEAGEAGEAESEKGGEELLPPESTPI-----PANLSQNLEAA 415
Cdd:PHA03247  2762 TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAgplppPTSAQPTAPPP 2841
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  416 AATQVAVSVP-----------KRRRKIKELNKKEAVGDLLDAFKEANPAV---PEVENQPPAGSNPGPESEgSGVPPRPE 481
Cdd:PHA03247  2842 PPGPPPPSLPlggsvapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVsrsTESFALPPDQPERPPQPQ-APPPPQPQ 2920

                   ....
gi 1677538761  482 EADE 485
Cdd:PHA03247  2921 PQPP 2924
PHA03247 PHA03247
large tegument protein UL36; Provisional
19-365 3.23e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 3.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761   19 PGQGRSTYVVPTQQYPVQPGAP--GFYPGASPTEFGTYAG-AYYPAQGVQQFPtgvAPTPVLMNQPPQIAPKRERKTIRI 95
Cdd:PHA03247  2658 PGRVSRPRRARRLGRAAQASSPpqRPRRRAARPTVGSLTSlADPPPPPPTPEP---APHALVSATPLPPGPAAARQASPA 2734
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761   96 RDPNQGGKDITEEIMSGARTASTPTPPQTGGGLEPQ--ANGETPQVAVIVRPDDRSQGAIIADRPGLPGPEHSPSESQPS 173
Cdd:PHA03247  2735 LPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAppAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAP 2814
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  174 SPSPTPSPSPVlePGSEPNLAVL-------SIPGDTMTTIQMSVEESTPISREtgePYRLSPEPTPLAEPILEVEvTLSK 246
Cdd:PHA03247  2815 AAALPPAASPA--GPLPPPTSAQptappppPGPPPPSLPLGGSVAPGGDVRRR---PPSRSPAAKPAAPARPPVR-RLAR 2888
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  247 PVPESEFSSSPLQAPTPLASHTVEIHEPngmvPSEDLEPEVESSPELAPPPacPSESPVPIAPTAQPEELLNGAPSPPAV 326
Cdd:PHA03247  2889 PAVSRSTESFALPPDQPERPPQPQAPPP----PQPQPQPPPPPQPQPPPPP--PPRPQPPLAPTTDPAGAGEPSGAVPQP 2962
                          330       340       350
                   ....*....|....*....|....*....|....*....
gi 1677538761  327 DLSPVSEPEEQAKEVTASMAPPTIPSATPATAPSATSPA 365
Cdd:PHA03247  2963 WLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSL 3001
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
318-486 9.58e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 50.26  E-value: 9.58e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  318 NGAPSPPAVDLSPVSEPEEQAKEVTASMAPPTIPSATPATAPSATSPAQEEEMEEEEEEEEGEAGEAGEAESEKGGEELL 397
Cdd:PRK12323   369 GGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAP 448
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  398 PPESTPIPANLSQNLEAAAATQ----VAVSVPKRRRKIKElnkKEAVGDLLDAFKEANPAVPEVenqPPAGSNPGPESEG 473
Cdd:PRK12323   449 APAPAPAAAPAAAARPAAAGPRpvaaAAAAAPARAAPAAA---PAPADDDPPPWEELPPEFASP---APAQPDAAPAGWV 522
                          170
                   ....*....|...
gi 1677538761  474 SGVPPRPEEADET 486
Cdd:PRK12323   523 AESIPDPATADPD 535
rne PRK10811
ribonuclease E; Reviewed
213-366 2.11e-05

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 49.27  E-value: 2.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  213 STPISR-ETGEPYRLSPEPTPLAEPILeVEVTLSKPVPESEFSSSPLQAPT-PLASHTVEIHEPNGMVPSEDLEPEVESS 290
Cdd:PRK10811   844 RYPVVRpQDVQVEEQREAEEVQVQPVV-AEVPVAAAVEPVVSAPVVEAVAEvVEEPVVVAEPQPEEVVVVETTHPEVIAA 922
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  291 PELAPPP-------ACPSESPVPIAPTAQPEELLNGAPSPPAVDLSPVSEPEEQAKEVTASMAPPTIPSATPATAPSATS 363
Cdd:PRK10811   923 PVTEQPQvitesdvAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETVTAVEPEVA 1002

                   ...
gi 1677538761  364 PAQ 366
Cdd:PRK10811  1003 PAQ 1005
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
53-366 5.27e-05

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 47.65  E-value: 5.27e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761   53 TYAGAYYPAQGVQQFPTGVAPTPVLMNQPPQIAPKRERKTIRIRDPNQGGKDiTEEIMSGARTA--STPTPPQTGGGLEP 130
Cdd:pfam17823  116 AAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASA-PHAASPAPRTAasSTTAASSTTAASSA 194
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  131 QANGETPQVAVIVRPDDRSQGAIIADRPGLpGPEHSPSESQPSSPSPTPSPSPVLEPGSepnLAVLSIPGDTMTTIQMSV 210
Cdd:pfam17823  195 PTTAASSAPATLTPARGISTAATATGHPAA-GTALAAVGNSSPAAGTVTAAVGTVTPAA---LATLAAAAGTVASAAGTI 270
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  211 EESTPISRetgepyRLSPeptplaepilevevtlSKPVPESEFSSSPL-----QAPTPLASHTVEihepngmvpsedlEP 285
Cdd:pfam17823  271 NMGDPHAR------RLSP----------------AKHMPSDTMARNPAapmgaQAQGPIIQVSTD-------------QP 315
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  286 EVESSPELAPPPACPSESPvpiaptaqpeellNGAPSPPAVDLSPVSEPEEQAKEVTASMAP-------PTIPSATPATA 358
Cdd:pfam17823  316 VHNTAGEPTPSPSNTTLEP-------------NTPKSVASTNLAVVTTTKAQAKEPSASPVPvlhtsmiPEVEATSPTTQ 382

                   ....*...
gi 1677538761  359 PSATSPAQ 366
Cdd:pfam17823  383 PSPLLPTQ 390
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
231-477 6.26e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 47.54  E-value: 6.26e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  231 TPLAEPILEVEVTLSKPVPESEFSSSPLQAPTPLASHTVEIHEPNGMVPSEDLEPEVESSPELAPPPACPSESPVPIAPT 310
Cdd:PRK07003   416 AAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPR 495
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  311 AQPEELLNGAPSPPAVDLSPVSEPEEqakevTASMAPPTiPSATPAtAPSATSPAQEEEMEEEEEEEEGEAGEAGEAESE 390
Cdd:PRK07003   496 AAAPSAATPAAVPDARAPAAASREDA-----PAAAAPPA-PEARPP-TPAAAAPAARAGGAAAALDVLRNAGMRVSSDRG 568
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  391 KGgeellpPESTPIPANLSQNLEAAAATQVAVSVPKRRRKIKELNKKEAVGDLLDAFKEANPAVPEVENQPPAGSNPGPE 470
Cdd:PRK07003   569 AR------AAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQAAESRGAPPPWEDIPPDDYVPLSA 642

                   ....*..
gi 1677538761  471 SEGSGVP 477
Cdd:PRK07003   643 DEGFGGP 649
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
241-426 9.09e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 47.22  E-value: 9.09e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  241 EVTLSKpVPESEFSSSPLQ-----APTPLASHTVEIHEPNGMVPSEDLEPEVESSPELAPPPACPSESPVPIAPTAQPEE 315
Cdd:pfam05109  418 KVIFSK-APESTTTSPTLNttgfaAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRD 496
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  316 llNG----------------APSPPAVDLSP-VSEPEEQAKEVTASMAPPTIPSATPatAPSATSPAQEEEMEEEEEEEE 378
Cdd:pfam05109  497 --NGteskapdmtsptsavtTPTPNATSPTPaVTTPTPNATSPTLGKTSPTSAVTTP--TPNATSPTPAVTTPTPNATIP 572
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1677538761  379 GEAGEAGEAESEKGGEELLPP---ESTPIPANLSQNLEAAAATQVAVSVPK 426
Cdd:pfam05109  573 TLGKTSPTSAVTTPTPNATSPtvgETSPQANTTNHTLGGTSSTPVVTSPPK 623
PRK10263 PRK10263
DNA translocase FtsK; Provisional
97-358 9.64e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 47.39  E-value: 9.64e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761   97 DPNQGGKDITEEIMSGARTAS------------TPTPPQTGgglePQANGETPQVAVIVRPDDRSQGAIIADRP------ 158
Cdd:PRK10263   308 DPLLNGAPITEPVAVAAAATTatqswaapvepvTQTPPVAS----VDVPPAQPTVAWQPVPGPQTGEPVIAPAPegypqq 383
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  159 ---GLPGPEHSPSESQPSSPSPTPSPSPVLEPGSEPNLAVLSIPGDTMTTIQMSVEESTPISRETGEPyrlsPEPTPLAE 235
Cdd:PRK10263   384 sqyAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEE----QQSTFAPQ 459
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  236 PILEVEVTLSKPVPESEFSSSPLQAPTPLASHTV-EIHEPNGMVPSEDLEPEVESS-----PELA----PPPAcPSESPV 305
Cdd:PRK10263   460 STYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEpVVEETKPARPPLYYFEEVEEKrarerEQLAawyqPIPE-PVKEPE 538
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1677538761  306 PIAPTAQPEELLNGAPSPPAVDLSPVSEPEEQAKEVTASMAPPTIPSATPATA 358
Cdd:PRK10263   539 PIKSSLKAPSVAAVPPVEAAAAVSPLASGVKKATLATGAAATVAAPVFSLANS 591
COG5373 COG5373
Uncharacterized membrane protein [Function unknown];
281-363 1.81e-04

Uncharacterized membrane protein [Function unknown];


Pssm-ID: 444140 [Multi-domain]  Cd Length: 854  Bit Score: 46.15  E-value: 1.81e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  281 EDLEPEVESSPELAPPPACPSESPVPIAPTAQPEEllngAPSPPAVDLSPVSEPEEQAKEVTASMAPPTIPSATPATAPS 360
Cdd:COG5373     31 EELEAELAEAAEAASAPAEPEPEAAAAATAAAPEA----APAPVPEAPAAPPAAAEAPAPAAAAPPAEAEPAAAPAAASS 106

                   ...
gi 1677538761  361 ATS 363
Cdd:COG5373    107 FEE 109
PHA03377 PHA03377
EBNA-3C; Provisional
216-526 2.51e-04

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 45.81  E-value: 2.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  216 ISRETGEPYRLSPEPTPLAEPILEVEVTLSKPVPESEFSSSPLQAPTPLASHTVEIhEPNGMVPSEdlepevESSPELAP 295
Cdd:PHA03377   408 VSRVPWRKPRTLPWPTPKTHPVKRTLVKTSGRSDEAEQAQSTPERPGPSDQPSVPV-EPAHLTPVE------HTTVILHQ 480
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  296 PPacPSESPVPIAPTAQPEELLNGA-----------------------PSPPAVDLSPVSEPEEQAKEVTASMAPPTIPS 352
Cdd:PHA03377   481 PP--QSPPTVAIKPAPPPSRRRRGAcvvydddiievidvetteeeesvTQPAKPHRKVQDGFQRSGRRQKRATPPKVSPS 558
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  353 atpATAPSATSPAQEEEMEEEEEEEEGEAGEAGEAESEKGGEELLPPESTPIPANLSQNLEAAAATQVAVSVPKRRRKIK 432
Cdd:PHA03377   559 ---DRGPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPASGPHEKQPPSSAPRDMAPSVVRMFLR 635
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  433 ELNKKEAVGDLLDAFKEANPAVPEVENQPPAGSNPGPESEGSgvPPRPEEADETWDSkedKIHNAENIQPGEQKYEYKSD 512
Cdd:PHA03377   636 ERLLEQSTGPKPKSFWEMRAGRDGSGIQQEPSSRRQPATQST--PPRPSWLPSVFVL---PSVDAGRAQPSEESHLSSMS 710
                          330
                   ....*....|....
gi 1677538761  513 QWKPLNLEEKKRYD 526
Cdd:PHA03377   711 PTQPISHEEQPRYE 724
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
189-365 2.51e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 45.64  E-value: 2.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  189 SEPNLAVLSIPGDTMTTIQMSVEESTPISRETGEPyrlSPEPTPLAEPILEVEVTLSKPVPESEFSSSPLQAPTPLASHT 268
Cdd:PRK12323   341 SELALAPDEYAGFTMTLLRMLAFRPGQSGGGAGPA---TAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAAR 417
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  269 VEIHEPNGMVPSEDLEPEVESSPELAP-----PPACPSESPVPIAPTAQPeellnGAPSPPAVDLSPVSEPEEQAKEVTA 343
Cdd:PRK12323   418 AVAAAPARRSPAPEALAAARQASARGPggapaPAPAPAAAPAAAARPAAA-----GPRPVAAAAAAAPARAAPAAAPAPA 492
                          170       180
                   ....*....|....*....|....*
gi 1677538761  344 SMAPP---TIPSATPATAPSATSPA 365
Cdd:PRK12323   493 DDDPPpweELPPEFASPAPAQPDAA 517
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
257-365 2.73e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 45.30  E-value: 2.73e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  257 PLQAPTPLASHTVEIHEPngmVPSEDLEPEVESSPELAPPPAC------------------PSESPVPIAPTAQPEEL-- 316
Cdd:PLN03209   324 PSQRVPPKESDAADGPKP---VPTKPVTPEAPSPPIEEEPPQPkavvprplspytayedlkPPTSPIPTPPSSSPASSks 400
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1677538761  317 LNGAPSPPAVDLSPVSE-----PEEQAKEVTASMA-------------PPTIPSATPATA--PSATSPA 365
Cdd:PLN03209   401 VDAVAKPAEPDVVPSPGsasnvPEVEPAQVEAKKTrplspyaryedlkPPTSPSPTAPTGvsPSVSSTS 469
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
221-364 7.08e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 44.32  E-value: 7.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  221 GEPYRLSPEPTPLAEPilevEVTLSKPVPESEFSSSPLQAPTPLAShtveihepngMVPSEDLEPEVESSPELAPPPACP 300
Cdd:PRK14951   369 AAEAAAPAEKKTPARP----EAAAPAAAPVAQAAAAPAPAAAPAAA----------ASAPAAPPAAAPPAPVAAPAAAAP 434
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1677538761  301 SESPVPIAPTAQPEELLNGAPSPPAVDLSPVSEPEeqakevTASMAPPTIPSATPATAPSATSP 364
Cdd:PRK14951   435 AAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPE------PAVASAAPAPAAAPAAARLTPTE 492
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
245-480 1.49e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.22  E-value: 1.49e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  245 SKPVP---ESEFSSSPLQapTPLASHTVEIHEPNGMVPSEDLEPEVESS-PELAPPPACPSESPVPIAPTAQPeellnga 320
Cdd:pfam03154  147 SIPSPqdnESDSDSSAQQ--QILQTQPPVLQAQSGAASPPSPPPPGTTQaATAGPTPSAPSVPPQGSPATSQP------- 217
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  321 PSPPAVDLSPVSEPEEqakevTASMAPPTIPSATPATAPSATSPAQEEEMEEEEEEEEGEAGEAGEAESEKGGEELL--- 397
Cdd:pfam03154  218 PNQTQSTAAPHTLIQQ-----TPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMqhp 292
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  398 -PPESTPIPANLSQNlEAAAATQVAVSVPKRRRKIKELNKKEAVGDLLDAFKEANPA-------------------VPEV 457
Cdd:pfam03154  293 vPPQPFPLTPQSSQS-QVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAplsmphikpppttpipqlpNPQS 371
                          250       260
                   ....*....|....*....|...
gi 1677538761  458 ENQPPAGSNPGPESEGSGVPPRP 480
Cdd:pfam03154  372 HKHPPHLSGPSPFQMNSNLPPPP 394
rne PRK10811
ribonuclease E; Reviewed
257-425 2.57e-03

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 42.33  E-value: 2.57e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  257 PLQAPTPL---------ASHTVEIHEP-----------NGMVPSEDLEPEVESSPELAPPPACPSESPVPIAPTAQPEEL 316
Cdd:PRK10811   820 PTQSPMPLtvacaspemASGKVWIRYPvvrpqdvqveeQREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEVVEEPV 899
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  317 LNGAPSPPAVDlsPVSEPEEQAKEVTASMAPPTIPSATPATAPSATSPAQEEEMEEEEEEEEGEAGEAGEAESEKGGEEL 396
Cdd:PRK10811   900 VVAEPQPEEVV--VVETTHPEVIAAPVTEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVA 977
                          170       180
                   ....*....|....*....|....*....
gi 1677538761  397 LPPESTPIPANLSQNLEAAAATQVAVSVP 425
Cdd:PRK10811   978 QPAAPVVAEVAAEVETVTAVEPEVAPAQV 1006
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
245-366 2.99e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 42.01  E-value: 2.99e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  245 SKPVPESEfSSSPLQAPTPLASHTVEIHEPNGMVPsedlePEVESSPELAPPPACPSESPVPIAPTAQPEELlnGAPSPP 324
Cdd:PRK14951   372 AAAPAEKK-TPARPEAAAPAAAPVAQAAAAPAPAA-----APAAAASAPAAPPAAAPPAPVAAPAAAAPAAA--PAAAPA 443
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1677538761  325 AVDLSPVSEPEEQAKEVTASMAPPTIPSATPATAPSATSPAQ 366
Cdd:PRK14951   444 AVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAA 485
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
187-363 3.87e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.79  E-value: 3.87e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  187 PGSEPNLAVLSIPGDTMTTIQMSVEESTPISRETGEPYRLSPEPTPLAEPileVEVTLSKPVPESEFSSSPLQAPTPLAS 266
Cdd:PRK12323   387 PAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAA---RQASARGPGGAPAPAPAPAAAPAAAAR 463
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  267 HTVeihepngmvpsedlepevessPELAPPPACPSESPVPIAPTAQP----------EELLNGAPSPPAVDLSPVSEPEE 336
Cdd:PRK12323   464 PAA---------------------AGPRPVAAAAAAAPARAAPAAAPapadddpppwEELPPEFASPAPAQPDAAPAGWV 522
                          170       180
                   ....*....|....*....|....*..
gi 1677538761  337 QAKEVTASMAPPTIPSATPATAPSATS 363
Cdd:PRK12323   523 AESIPDPATADPDDAFETLAPAPAAAP 549
Rib_recp_KP_reg pfam05104
Ribosome receptor lysine/proline rich region; This highly conserved region is found towards ...
272-365 4.19e-03

Ribosome receptor lysine/proline rich region; This highly conserved region is found towards the C-terminus of the transmembrane domain. The function is unclear.


Pssm-ID: 461548 [Multi-domain]  Cd Length: 140  Bit Score: 39.33  E-value: 4.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  272 HEPNGMVPSEDLEPEVESSP-ELAPPPACPSESPVPiAPTAQPEELLNGAPSPPAVDLSPVSEPEEQAKEVTASMAPPTI 350
Cdd:pfam05104   44 EKPNGKLPESEQADESEEEPrEFKTPDEAPSAALEP-EPVPTPVPAPVEPEPAPPSESPAPSPKEKKKKEKKSAKVEPAE 122
                           90
                   ....*....|....*
gi 1677538761  351 PSATPATAPSATSPA 365
Cdd:pfam05104  123 TPEAVQPKPALEKEE 137
PRK10263 PRK10263
DNA translocase FtsK; Provisional
190-366 6.89e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.22  E-value: 6.89e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  190 EPNLAVLSIPGDTMTTIQMSVEESTPISRETGEPYrLSPEP------TPLAEPILEVEVTLSKPVPESEFSSSPLQAPTP 263
Cdd:PRK10263   338 EPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPV-IAPAPegypqqSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPA 416
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  264 LASHTVEihEPNGMVPSEDLEPEVESSPELAPPPACPSESPVPIAPTAQPEellngapsppavdlSPVSEPEEQAKEVTA 343
Cdd:PRK10263   417 QQPYYAP--APEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTE--------------QTYQQPAAQEPLYQQ 480
                          170       180
                   ....*....|....*....|...
gi 1677538761  344 SMAPPTIPSATPATAPSATSPAQ 366
Cdd:PRK10263   481 PQPVEQQPVVEPEPVVEETKPAR 503
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
236-349 7.70e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 40.72  E-value: 7.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  236 PILEVEVTLSKPVPESEFSSSPLQAPTPLASHTVEIHEPNGMVPSEDLEPEVESSPELAPPPACPSESPVPIAPTAQPEE 315
Cdd:PRK14948   348 PRLWLEVTLLGLLPSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPT 427
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1677538761  316 llngaPSPPAVDLSPVSEPEEQAKEVTASMAPPT 349
Cdd:PRK14948   428 -----PPANAANAPPSLNLEELWQQILAKLELPS 456
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
240-358 8.59e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 40.56  E-value: 8.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  240 VEVTLSK-PVPESEFSSSPlqAPTPLASHTVEIHEPNGMVPseDLEPEVESSPELAPPPacpsesPVPIAPTAQPEElln 318
Cdd:PRK14950   356 IEALLVPvPAPQPAKPTAA--APSPVRPTPAPSTRPKAAAA--ANIPPKEPVRETATPP------PVPPRPVAPPVP--- 422
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 1677538761  319 gAPSPPAVDLSPVSEPEEQAKEVTASMAPPTIPSATPATA 358
Cdd:PRK14950   423 -HTPESAPKLTRAAIPVDEKPKYTPPAPPKEEEKALIADG 461
DamX COG3266
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ...
256-366 9.63e-03

Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442497 [Multi-domain]  Cd Length: 455  Bit Score: 40.22  E-value: 9.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538761  256 SPLQAPTPLASHTVEIHEPNGMVPSEDLEPEVESspelAPPPACPSESPVPIAPTAQPEellNGAPSPPAVDLSPVSEPE 335
Cdd:COG3266    253 SALKAPSQASSASAPATTSLGEQQEVSLPPAVAA----QPAAAAAAQPSAVALPAAPAA---AAAAAAPAEAAAPQPTAA 325
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1677538761  336 EQAKEVTASMAPPTIPSATPATAPSATSPAQ 366
Cdd:COG3266    326 KPVVTETAAPAAPAPEAAAAAAAPAAPAVAK 356
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH