NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|568965255|ref|XP_006512726|]
View 

androglobin isoform X8 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Adgb_C_mid-like cd22307
C-terminal middle region of Androglobins (Adgbs) and related proteins; including permuted ...
363-775 0e+00

C-terminal middle region of Androglobins (Adgbs) and related proteins; including permuted globin domain and IQ motif; Androglobin (Adgb, also known as Calpain-7-like protein, CAPN7L) is a large multidomain protein consisting of an N-terminal peptidase C2 family calpain-like domain, an IQ calmodulin-binding motif, and an internal, circularly permuted globin domain. The canonical secondary structure of hemoglobins is an 3-over-3 alpha-helical sandwich structure, where the eight alpha-helical segments are conventionally labeled, A-H, according to their sequential order; Adgbs differ from this in having helices C-H followed by A-B. Adgbs and other phylogenetically ancient globins, such as neuroglobins and globin X, form hexacoordinated heme iron complexes. Globins contain various highly conserved residues of the heme pocket: including a Phe in the interhelical position CD1 (Phe CD1, first position in the loop between the helices C and D) that is packed against the heme, a His at the 7th position of the E-helix (His E7) that binds the heme iron distally, and a His at the 8th position of the F-helix (His F8) that binds the heme iron proximally. Unlike other hexacoordinated globins, Adgbs have an E7 Gln; their hexacoordination scheme is [Gln]-Fe-[His]. In mammals, Adgb is mainly expressed in the testes and may play an important role in spermatogenesis. Arthropod Adgbs have degenerate globin domains (DOI:10.3389/fgene.2020.00858). This model spans the permuted globin domain, the IQ motif, and a conserved region of about 200 amino acid residues located C-terminal to the globin domain; it does not include the N-terminal protease domain or the large uncharacterized C-terminal domain of approximately 500 residues.


:

Pssm-ID: 412094  Cd Length: 416  Bit Score: 592.99  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  363 IHVCSMTTFVIGDEDIVLPNFEPESYRFTEQSIIIMKAIGNVIANFKDKGKLPAALRDLQAAHYPIPLNNKELTAQHFRV 442
Cdd:cd22307     1 LHLCSDTPFVFGDEETVMPLLTKESVRFTEQASSILKALGNAIQSFGDEEYLPAALKELYRSYCPPLLWSKEDKKEHHKV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  443 FHISLWRLMKKSQVAKPPSNFKFAFRAMVFDTDLLDSFSEDVSLAEWVDLKYSTPINEK-EYTSEEIAAAVKIQSMWKGC 521
Cdd:cd22307    81 FNEALYHLLKKALGRKETPDELFALRALFLDPDIGLEYKESPSSSLREIVEPDECDCRTrEPTIEEHEAATKIQAFFRGT 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  522 YVRLLMKARKPETKENVTVADTLQKIWAVLEMNLEQYALSLLRLMFKSKCKSMESYPCYQDEETKLAFADHTVNYADQPP 601
Cdd:cd22307   161 LVRKLLKAHKPGTKENLKVAETLKKIWEKIESNLESLAASLLRYMFKNNPKLKELYPCYEDEWTVISFQDYSGTYPDQPP 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  602 NSWFIVFREIFLVPQDMIILPKVYTTLPICILHVINNDTLEQVPKVFQKVVPFLYTKNKKGYTFVAEAYTGDTFVSGARW 681
Cdd:cd22307   241 NSWFPVFREVFNVPEEMLVVPKLYSPLPRCLLRVFNNDTGEELPRVFNKVAPFVYKPNKKGYTFVAEAYTGDQPPKEGKW 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  682 KLRLIGSYNPLPFLARDSPCNTFSIKE-IRDYYIPNDRKILFRYSIKVTVAQSITIQVRTSKPDTFIKLQVLESEEVITS 760
Cdd:cd22307   321 RLRLIGSKEPLPKLSRETPLSTFSVKEeIKDYYIPNKKNIICRYIVKVTKDHLVTIRLQTSKPDVEIKLQVLDEEEEVAS 400
                         410
                  ....*....|....*
gi 568965255  761 TVGKGQAVIPAFYFL 775
Cdd:cd22307   401 ETGGGHVVIPVFRLL 415
PTZ00121 super family cl31754
MAEBL; Provisional
896-1263 1.15e-03

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 43.59  E-value: 1.15e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  896 QALKDMEKMDIKAE----KHEEPAPMGSPDSHAVSEGQKSVGVPKTTRKGKEKSAEKEKLAKEKQAP---RFEPQQVQMP 968
Cdd:PTZ00121 1354 AAADEAEAAEEKAEaaekKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAdeaKKKAEEKKKA 1433
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  969 TAVHSQQEDPNKPYWILRLVSEHTDSDYVDVKKDTER-ADEIRamKQAWETTEPGRAIKAAQARLKYLTQFIKKPVTTDT 1047
Cdd:PTZ00121 1434 DEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKkADEAK--KKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255 1048 TTSAPSPETLSVSQSQTKSSEEGELDTGKYADIKELPPNAAGSVLWKKWQMTKTITSLTK--------FTSSESVPKEEp 1119
Cdd:PTZ00121 1512 ADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKaeedknmaLRKAEEAKKAE- 1590
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255 1120 pQKEIPVVRQRSPTILETSPQQIRKALEFLDFSHYVRKTAAE-------AVLQTEELNKQQAMQKAEEIHQFRQHRSRIL 1192
Cdd:PTZ00121 1591 -EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEkkkveqlKKKEAEEKKKAEELKKAEEENKIKAAEEAKK 1669
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568965255 1193 SIRDIDQEERFKQKDEVLEMYGEMRDSVDEARQKILDIREVYRNKLLEAERLRME----ALAAQEAAVKIEIEKK 1263
Cdd:PTZ00121 1670 AEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAeeenKIKAEEAKKEAEEDKK 1744
 
Name Accession Description Interval E-value
Adgb_C_mid-like cd22307
C-terminal middle region of Androglobins (Adgbs) and related proteins; including permuted ...
363-775 0e+00

C-terminal middle region of Androglobins (Adgbs) and related proteins; including permuted globin domain and IQ motif; Androglobin (Adgb, also known as Calpain-7-like protein, CAPN7L) is a large multidomain protein consisting of an N-terminal peptidase C2 family calpain-like domain, an IQ calmodulin-binding motif, and an internal, circularly permuted globin domain. The canonical secondary structure of hemoglobins is an 3-over-3 alpha-helical sandwich structure, where the eight alpha-helical segments are conventionally labeled, A-H, according to their sequential order; Adgbs differ from this in having helices C-H followed by A-B. Adgbs and other phylogenetically ancient globins, such as neuroglobins and globin X, form hexacoordinated heme iron complexes. Globins contain various highly conserved residues of the heme pocket: including a Phe in the interhelical position CD1 (Phe CD1, first position in the loop between the helices C and D) that is packed against the heme, a His at the 7th position of the E-helix (His E7) that binds the heme iron distally, and a His at the 8th position of the F-helix (His F8) that binds the heme iron proximally. Unlike other hexacoordinated globins, Adgbs have an E7 Gln; their hexacoordination scheme is [Gln]-Fe-[His]. In mammals, Adgb is mainly expressed in the testes and may play an important role in spermatogenesis. Arthropod Adgbs have degenerate globin domains (DOI:10.3389/fgene.2020.00858). This model spans the permuted globin domain, the IQ motif, and a conserved region of about 200 amino acid residues located C-terminal to the globin domain; it does not include the N-terminal protease domain or the large uncharacterized C-terminal domain of approximately 500 residues.


Pssm-ID: 412094  Cd Length: 416  Bit Score: 592.99  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  363 IHVCSMTTFVIGDEDIVLPNFEPESYRFTEQSIIIMKAIGNVIANFKDKGKLPAALRDLQAAHYPIPLNNKELTAQHFRV 442
Cdd:cd22307     1 LHLCSDTPFVFGDEETVMPLLTKESVRFTEQASSILKALGNAIQSFGDEEYLPAALKELYRSYCPPLLWSKEDKKEHHKV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  443 FHISLWRLMKKSQVAKPPSNFKFAFRAMVFDTDLLDSFSEDVSLAEWVDLKYSTPINEK-EYTSEEIAAAVKIQSMWKGC 521
Cdd:cd22307    81 FNEALYHLLKKALGRKETPDELFALRALFLDPDIGLEYKESPSSSLREIVEPDECDCRTrEPTIEEHEAATKIQAFFRGT 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  522 YVRLLMKARKPETKENVTVADTLQKIWAVLEMNLEQYALSLLRLMFKSKCKSMESYPCYQDEETKLAFADHTVNYADQPP 601
Cdd:cd22307   161 LVRKLLKAHKPGTKENLKVAETLKKIWEKIESNLESLAASLLRYMFKNNPKLKELYPCYEDEWTVISFQDYSGTYPDQPP 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  602 NSWFIVFREIFLVPQDMIILPKVYTTLPICILHVINNDTLEQVPKVFQKVVPFLYTKNKKGYTFVAEAYTGDTFVSGARW 681
Cdd:cd22307   241 NSWFPVFREVFNVPEEMLVVPKLYSPLPRCLLRVFNNDTGEELPRVFNKVAPFVYKPNKKGYTFVAEAYTGDQPPKEGKW 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  682 KLRLIGSYNPLPFLARDSPCNTFSIKE-IRDYYIPNDRKILFRYSIKVTVAQSITIQVRTSKPDTFIKLQVLESEEVITS 760
Cdd:cd22307   321 RLRLIGSKEPLPKLSRETPLSTFSVKEeIKDYYIPNKKNIICRYIVKVTKDHLVTIRLQTSKPDVEIKLQVLDEEEEVAS 400
                         410
                  ....*....|....*
gi 568965255  761 TVGKGQAVIPAFYFL 775
Cdd:cd22307   401 ETGGGHVVIPVFRLL 415
PTZ00121 PTZ00121
MAEBL; Provisional
896-1263 1.15e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 43.59  E-value: 1.15e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  896 QALKDMEKMDIKAE----KHEEPAPMGSPDSHAVSEGQKSVGVPKTTRKGKEKSAEKEKLAKEKQAP---RFEPQQVQMP 968
Cdd:PTZ00121 1354 AAADEAEAAEEKAEaaekKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAdeaKKKAEEKKKA 1433
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  969 TAVHSQQEDPNKPYWILRLVSEHTDSDYVDVKKDTER-ADEIRamKQAWETTEPGRAIKAAQARLKYLTQFIKKPVTTDT 1047
Cdd:PTZ00121 1434 DEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKkADEAK--KKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255 1048 TTSAPSPETLSVSQSQTKSSEEGELDTGKYADIKELPPNAAGSVLWKKWQMTKTITSLTK--------FTSSESVPKEEp 1119
Cdd:PTZ00121 1512 ADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKaeedknmaLRKAEEAKKAE- 1590
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255 1120 pQKEIPVVRQRSPTILETSPQQIRKALEFLDFSHYVRKTAAE-------AVLQTEELNKQQAMQKAEEIHQFRQHRSRIL 1192
Cdd:PTZ00121 1591 -EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEkkkveqlKKKEAEEKKKAEELKKAEEENKIKAAEEAKK 1669
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568965255 1193 SIRDIDQEERFKQKDEVLEMYGEMRDSVDEARQKILDIREVYRNKLLEAERLRME----ALAAQEAAVKIEIEKK 1263
Cdd:PTZ00121 1670 AEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAeeenKIKAEEAKKEAEEDKK 1744
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1112-1264 3.28e-03

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 41.65  E-value: 3.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  1112 ESVPKEEPPQKEIPVVRQRSptILETSPQqiRKALEFLDFSHYVRKTAAEA---VLQTEELNKQQAMQKAEEIHQFRQHR 1188
Cdd:pfam17380  386 ERQQKNERVRQELEAARKVK--ILEEERQ--RKIQQQKVEMEQIRAEQEEArqrEVRRLEEERAREMERVRLEEQERQQQ 461
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  1189 SRILSirdiDQEERFKQKDEVLEMYGEMRDSVDEARQKILD----------IREVYRNKLLEAE-RLRMEALAAQEAAVK 1257
Cdd:pfam17380  462 VERLR----QQEEERKRKKLELEKEKRDRKRAEEQRRKILEkeleerkqamIEEERKRKLLEKEmEERQKAIYEEERRRE 537

                   ....*..
gi 568965255  1258 IEIEKKS 1264
Cdd:pfam17380  538 AEEERRK 544
 
Name Accession Description Interval E-value
Adgb_C_mid-like cd22307
C-terminal middle region of Androglobins (Adgbs) and related proteins; including permuted ...
363-775 0e+00

C-terminal middle region of Androglobins (Adgbs) and related proteins; including permuted globin domain and IQ motif; Androglobin (Adgb, also known as Calpain-7-like protein, CAPN7L) is a large multidomain protein consisting of an N-terminal peptidase C2 family calpain-like domain, an IQ calmodulin-binding motif, and an internal, circularly permuted globin domain. The canonical secondary structure of hemoglobins is an 3-over-3 alpha-helical sandwich structure, where the eight alpha-helical segments are conventionally labeled, A-H, according to their sequential order; Adgbs differ from this in having helices C-H followed by A-B. Adgbs and other phylogenetically ancient globins, such as neuroglobins and globin X, form hexacoordinated heme iron complexes. Globins contain various highly conserved residues of the heme pocket: including a Phe in the interhelical position CD1 (Phe CD1, first position in the loop between the helices C and D) that is packed against the heme, a His at the 7th position of the E-helix (His E7) that binds the heme iron distally, and a His at the 8th position of the F-helix (His F8) that binds the heme iron proximally. Unlike other hexacoordinated globins, Adgbs have an E7 Gln; their hexacoordination scheme is [Gln]-Fe-[His]. In mammals, Adgb is mainly expressed in the testes and may play an important role in spermatogenesis. Arthropod Adgbs have degenerate globin domains (DOI:10.3389/fgene.2020.00858). This model spans the permuted globin domain, the IQ motif, and a conserved region of about 200 amino acid residues located C-terminal to the globin domain; it does not include the N-terminal protease domain or the large uncharacterized C-terminal domain of approximately 500 residues.


Pssm-ID: 412094  Cd Length: 416  Bit Score: 592.99  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  363 IHVCSMTTFVIGDEDIVLPNFEPESYRFTEQSIIIMKAIGNVIANFKDKGKLPAALRDLQAAHYPIPLNNKELTAQHFRV 442
Cdd:cd22307     1 LHLCSDTPFVFGDEETVMPLLTKESVRFTEQASSILKALGNAIQSFGDEEYLPAALKELYRSYCPPLLWSKEDKKEHHKV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  443 FHISLWRLMKKSQVAKPPSNFKFAFRAMVFDTDLLDSFSEDVSLAEWVDLKYSTPINEK-EYTSEEIAAAVKIQSMWKGC 521
Cdd:cd22307    81 FNEALYHLLKKALGRKETPDELFALRALFLDPDIGLEYKESPSSSLREIVEPDECDCRTrEPTIEEHEAATKIQAFFRGT 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  522 YVRLLMKARKPETKENVTVADTLQKIWAVLEMNLEQYALSLLRLMFKSKCKSMESYPCYQDEETKLAFADHTVNYADQPP 601
Cdd:cd22307   161 LVRKLLKAHKPGTKENLKVAETLKKIWEKIESNLESLAASLLRYMFKNNPKLKELYPCYEDEWTVISFQDYSGTYPDQPP 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  602 NSWFIVFREIFLVPQDMIILPKVYTTLPICILHVINNDTLEQVPKVFQKVVPFLYTKNKKGYTFVAEAYTGDTFVSGARW 681
Cdd:cd22307   241 NSWFPVFREVFNVPEEMLVVPKLYSPLPRCLLRVFNNDTGEELPRVFNKVAPFVYKPNKKGYTFVAEAYTGDQPPKEGKW 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  682 KLRLIGSYNPLPFLARDSPCNTFSIKE-IRDYYIPNDRKILFRYSIKVTVAQSITIQVRTSKPDTFIKLQVLESEEVITS 760
Cdd:cd22307   321 RLRLIGSKEPLPKLSRETPLSTFSVKEeIKDYYIPNKKNIICRYIVKVTKDHLVTIRLQTSKPDVEIKLQVLDEEEEVAS 400
                         410
                  ....*....|....*
gi 568965255  761 TVGKGQAVIPAFYFL 775
Cdd:cd22307   401 ETGGGHVVIPVFRLL 415
IQCD cd23767
IQ (isoleucine-glutamine) motif containing D (IQCD); IQCD, also called dynein regulatory ...
500-535 2.87e-04

IQ (isoleucine-glutamine) motif containing D (IQCD); IQCD, also called dynein regulatory complex protein 10 (DRC10), belongs to the IQ motif-containing protein family which contains a C-terminal conserved IQ motif domain and two coiled-coil domains. The IQ motif ([ILV]QxxxRxxxx[RK]), where x stands for any amino-acid residue, interacts with calmodulin (CaM) in a calcium-independent manner and is present in proteins with a wide diversity of biological functions. The IQCD protein was found to primarily accumulate in the acrosome area of round and elongating spermatids of the testis during late stage of spermiogenesis and was then localized to the acrosome and tail regions of mature spermatozoa. The expression of IQCD follows the trajectory of acrosome development during spermatogenesis. IQCD is associated with neuroblastoma and neurodegenerative diseases, and is reported to interact with the nuclear retinoid X receptor in the presence of 9-cis-retinoic acid, thereby activating the transcriptional activity of the receptor.


Pssm-ID: 467745 [Multi-domain]  Cd Length: 37  Bit Score: 39.45  E-value: 2.87e-04
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 568965255  500 EKEYTSEEIAAAVKIQSMWKGCYVRLLMKARKPETK 535
Cdd:cd23767     1 EEEELQRMNRAATLIQALWRGYKVRKELKKKKKKGK 36
PTZ00121 PTZ00121
MAEBL; Provisional
896-1263 1.15e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 43.59  E-value: 1.15e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  896 QALKDMEKMDIKAE----KHEEPAPMGSPDSHAVSEGQKSVGVPKTTRKGKEKSAEKEKLAKEKQAP---RFEPQQVQMP 968
Cdd:PTZ00121 1354 AAADEAEAAEEKAEaaekKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAdeaKKKAEEKKKA 1433
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  969 TAVHSQQEDPNKPYWILRLVSEHTDSDYVDVKKDTER-ADEIRamKQAWETTEPGRAIKAAQARLKYLTQFIKKPVTTDT 1047
Cdd:PTZ00121 1434 DEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKkADEAK--KKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255 1048 TTSAPSPETLSVSQSQTKSSEEGELDTGKYADIKELPPNAAGSVLWKKWQMTKTITSLTK--------FTSSESVPKEEp 1119
Cdd:PTZ00121 1512 ADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKaeedknmaLRKAEEAKKAE- 1590
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255 1120 pQKEIPVVRQRSPTILETSPQQIRKALEFLDFSHYVRKTAAE-------AVLQTEELNKQQAMQKAEEIHQFRQHRSRIL 1192
Cdd:PTZ00121 1591 -EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEkkkveqlKKKEAEEKKKAEELKKAEEENKIKAAEEAKK 1669
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568965255 1193 SIRDIDQEERFKQKDEVLEMYGEMRDSVDEARQKILDIREVYRNKLLEAERLRME----ALAAQEAAVKIEIEKK 1263
Cdd:PTZ00121 1670 AEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAeeenKIKAEEAKKEAEEDKK 1744
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1112-1264 3.28e-03

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 41.65  E-value: 3.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  1112 ESVPKEEPPQKEIPVVRQRSptILETSPQqiRKALEFLDFSHYVRKTAAEA---VLQTEELNKQQAMQKAEEIHQFRQHR 1188
Cdd:pfam17380  386 ERQQKNERVRQELEAARKVK--ILEEERQ--RKIQQQKVEMEQIRAEQEEArqrEVRRLEEERAREMERVRLEEQERQQQ 461
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568965255  1189 SRILSirdiDQEERFKQKDEVLEMYGEMRDSVDEARQKILD----------IREVYRNKLLEAE-RLRMEALAAQEAAVK 1257
Cdd:pfam17380  462 VERLR----QQEEERKRKKLELEKEKRDRKRAEEQRRKILEkeleerkqamIEEERKRKLLEKEmEERQKAIYEEERRRE 537

                   ....*..
gi 568965255  1258 IEIEKKS 1264
Cdd:pfam17380  538 AEEERRK 544
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH