NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2217311042|ref|XP_047291684|]
View 

keratin, type I cytoskeletal 23 isoform X1 [Homo sapiens]

Protein Classification

intermediate filament family protein( domain architecture ID 705869)

intermediate filament (IF) family protein is a primordial component of the cytoskeleton and the nuclear envelope; such as type I keratins

CATH:  1.20.5.170
Gene Ontology:  GO:0005882

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Filament super family cl25641
Intermediate filament protein;
71-380 1.85e-96

Intermediate filament protein;


The actual alignment was detected with superfamily member pfam00038:

Pssm-ID: 459643 [Multi-domain]  Cd Length: 313  Bit Score: 291.44  E-value: 1.85e-96
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042  71 NGKATMQNLNDRLASYLEKVRALEEANMKLESRILKWHQQRDPGSKKDYSQYEENITHLQEQIVDGKMTNAQIILLIDNA 150
Cdd:pfam00038   1 NEKEQLQELNDRLASYIDKVRFLEQQNKLLETKISELRQKKGAEPSRLYSLYEKEIEDLRRQLDTLTVERARLQLELDNL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042 151 RMAVDDFNLKYENEHSFKKDLEIEVEGLRRTLDNLTIVTTDLEQEVEGMRKELILMKKHHEQEM-EKHHVPSDFNVNVKV 229
Cdd:pfam00038  81 RLAAEDFRQKYEDELNLRTSAENDLVGLRKDLDEATLARVDLEAKIESLKEELAFLKKNHEEEVrELQAQVSDTQVNVEM 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042 230 DTGPREDLIKVLEDMRQEYELIIKKKHRDLDTWYKEQSAAMSQEAA-SPATVQSRQGDIHELKRTFQALEIDLQTQYSTK 308
Cdd:pfam00038 161 DAARKLDLTSALAEIRAQYEEIAAKNREEAEEWYQSKLEELQQAAArNGDALRSAKEEITELRRTIQSLEIELQSLKKQK 240
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217311042 309 SALENMLSETQSRYSCKLQDMQEIISHYEEELTQLRHELERQNNEYQVLLGIKTHLEKEITTYRRLLEGESE 380
Cdd:pfam00038 241 ASLERQLAETEERYELQLADYQELISELEAELQETRQEMARQLREYQELLNVKLALDIEIATYRKLLEGEEC 312
 
Name Accession Description Interval E-value
Filament pfam00038
Intermediate filament protein;
71-380 1.85e-96

Intermediate filament protein;


Pssm-ID: 459643 [Multi-domain]  Cd Length: 313  Bit Score: 291.44  E-value: 1.85e-96
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042  71 NGKATMQNLNDRLASYLEKVRALEEANMKLESRILKWHQQRDPGSKKDYSQYEENITHLQEQIVDGKMTNAQIILLIDNA 150
Cdd:pfam00038   1 NEKEQLQELNDRLASYIDKVRFLEQQNKLLETKISELRQKKGAEPSRLYSLYEKEIEDLRRQLDTLTVERARLQLELDNL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042 151 RMAVDDFNLKYENEHSFKKDLEIEVEGLRRTLDNLTIVTTDLEQEVEGMRKELILMKKHHEQEM-EKHHVPSDFNVNVKV 229
Cdd:pfam00038  81 RLAAEDFRQKYEDELNLRTSAENDLVGLRKDLDEATLARVDLEAKIESLKEELAFLKKNHEEEVrELQAQVSDTQVNVEM 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042 230 DTGPREDLIKVLEDMRQEYELIIKKKHRDLDTWYKEQSAAMSQEAA-SPATVQSRQGDIHELKRTFQALEIDLQTQYSTK 308
Cdd:pfam00038 161 DAARKLDLTSALAEIRAQYEEIAAKNREEAEEWYQSKLEELQQAAArNGDALRSAKEEITELRRTIQSLEIELQSLKKQK 240
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217311042 309 SALENMLSETQSRYSCKLQDMQEIISHYEEELTQLRHELERQNNEYQVLLGIKTHLEKEITTYRRLLEGESE 380
Cdd:pfam00038 241 ASLERQLAETEERYELQLADYQELISELEAELQETRQEMARQLREYQELLNVKLALDIEIATYRKLLEGEEC 312
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
170-387 2.57e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 43.51  E-value: 2.57e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042  170 DLEIEVEGLRRTLDNLTIVTTDLEQEVEGMRKELILMKKHHEQEMEKHHVPSDFNVNVKVDTGPREDLIKVLEDMRQEYE 249
Cdd:TIGR02168  695 ELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAE 774
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042  250 LIIKKKHRDLDTwyKEQSAAMSQEAASpatVQSRQGDihELKRTFQALEIDLQTQYSTKSALENMLSETQSRysckLQDM 329
Cdd:TIGR02168  775 EELAEAEAEIEE--LEAQIEQLKEELK---ALREALD--ELRAELTLLNEEAANLRERLESLERRIAATERR----LEDL 843
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217311042  330 QEIISHYEEELTQLRHELERQ-------NNEYQVLLGIKTHLEKEITTYRRLLEGESEGTREESK 387
Cdd:TIGR02168  844 EEQIEELSEDIESLAAEIEELeelieelESELEALLNERASLEEALALLRSELEELSEELRELES 908
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
141-390 3.88e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 39.66  E-value: 3.88e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042 141 AQIILLIDNARMAVDDFNLKYENEHSFKKDLEIEVEGLRRTLDNLTIVTTDLEQEVEGMRKELILMKKHHEQ--EMEKHh 218
Cdd:PRK03918  168 GEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEieELEKE- 246
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042 219 vpsdfNVNVKVDTGPREDLIKVLEDMRQEYELIIK---------KKHRDLDTWYKEQSAAMSQEAASPATVQSRQGDIHE 289
Cdd:PRK03918  247 -----LESLEGSKRKLEEKIRELEERIEELKKEIEeleekvkelKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEE 321
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042 290 LKRTFQALEIDLQTQYSTKSALENMLSETQSRYSCKLQDMQ--EIISHYEEELTQLRHEL-----ERQNNEYQVLLGIKT 362
Cdd:PRK03918  322 EINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHElyEEAKAKKEELERLKKRLtgltpEKLEKELEELEKAKE 401
                         250       260
                  ....*....|....*....|....*...
gi 2217311042 363 HLEKEITTYRRLLeGESEGTREESKSSM 390
Cdd:PRK03918  402 EIEEEISKITARI-GELKKEIKELKKAI 428
 
Name Accession Description Interval E-value
Filament pfam00038
Intermediate filament protein;
71-380 1.85e-96

Intermediate filament protein;


Pssm-ID: 459643 [Multi-domain]  Cd Length: 313  Bit Score: 291.44  E-value: 1.85e-96
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042  71 NGKATMQNLNDRLASYLEKVRALEEANMKLESRILKWHQQRDPGSKKDYSQYEENITHLQEQIVDGKMTNAQIILLIDNA 150
Cdd:pfam00038   1 NEKEQLQELNDRLASYIDKVRFLEQQNKLLETKISELRQKKGAEPSRLYSLYEKEIEDLRRQLDTLTVERARLQLELDNL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042 151 RMAVDDFNLKYENEHSFKKDLEIEVEGLRRTLDNLTIVTTDLEQEVEGMRKELILMKKHHEQEM-EKHHVPSDFNVNVKV 229
Cdd:pfam00038  81 RLAAEDFRQKYEDELNLRTSAENDLVGLRKDLDEATLARVDLEAKIESLKEELAFLKKNHEEEVrELQAQVSDTQVNVEM 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042 230 DTGPREDLIKVLEDMRQEYELIIKKKHRDLDTWYKEQSAAMSQEAA-SPATVQSRQGDIHELKRTFQALEIDLQTQYSTK 308
Cdd:pfam00038 161 DAARKLDLTSALAEIRAQYEEIAAKNREEAEEWYQSKLEELQQAAArNGDALRSAKEEITELRRTIQSLEIELQSLKKQK 240
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217311042 309 SALENMLSETQSRYSCKLQDMQEIISHYEEELTQLRHELERQNNEYQVLLGIKTHLEKEITTYRRLLEGESE 380
Cdd:pfam00038 241 ASLERQLAETEERYELQLADYQELISELEAELQETRQEMARQLREYQELLNVKLALDIEIATYRKLLEGEEC 312
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
170-387 2.57e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 43.51  E-value: 2.57e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042  170 DLEIEVEGLRRTLDNLTIVTTDLEQEVEGMRKELILMKKHHEQEMEKHHVPSDFNVNVKVDTGPREDLIKVLEDMRQEYE 249
Cdd:TIGR02168  695 ELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAE 774
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042  250 LIIKKKHRDLDTwyKEQSAAMSQEAASpatVQSRQGDihELKRTFQALEIDLQTQYSTKSALENMLSETQSRysckLQDM 329
Cdd:TIGR02168  775 EELAEAEAEIEE--LEAQIEQLKEELK---ALREALD--ELRAELTLLNEEAANLRERLESLERRIAATERR----LEDL 843
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217311042  330 QEIISHYEEELTQLRHELERQ-------NNEYQVLLGIKTHLEKEITTYRRLLEGESEGTREESK 387
Cdd:TIGR02168  844 EEQIEELSEDIESLAAEIEELeelieelESELEALLNERASLEEALALLRSELEELSEELRELES 908
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
115-407 5.36e-04

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 42.40  E-value: 5.36e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042 115 SKKDYS-QYEENITHLQ-EQIVDGKMTNAQIILLIDNARMA--VDDFNLKYENEHSFKKDLEIEVEGLRRTLDNLTIVTT 190
Cdd:pfam05483 465 SEEHYLkEVEDLKTELEkEKLKNIELTAHCDKLLLENKELTqeASDMTLELKKHQEDIINCKKQEERMLKQIENLEEKEM 544
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042 191 DLEQEVEGMRKELILMKKHHEQEMEKhhvPSDFNVNVKVDTGPREDLIKVLEDMRQEYELIIKKKHRDLDTWYKEQSAAM 270
Cdd:pfam05483 545 NLRDELESVREEFIQKGDEVKCKLDK---SEENARSIEYEVLKKEKQMKILENKCNNLKKQIENKNKNIEELHQENKALK 621
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042 271 SQeaaspATVQSRQGDIHELKrtFQALEIDLQtqySTKSALENMLSETQSRYSCKLQDMQEIISHYE------EELTQLR 344
Cdd:pfam05483 622 KK-----GSAENKQLNAYEIK--VNKLELELA---SAKQKFEEIIDNYQKEIEDKKISEEKLLEEVEkakaiaDEAVKLQ 691
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217311042 345 HELE-RQNNEYQVLLGIkthLEKEITTYRRLLEG-ESE----GTREESKSSMKVSATPKIKAITQETIN 407
Cdd:pfam05483 692 KEIDkRCQHKIAEMVAL---MEKHKHQYDKIIEErDSElglyKNKEQEQSSAKAALEIELSNIKAELLS 757
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
141-390 3.88e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 39.66  E-value: 3.88e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042 141 AQIILLIDNARMAVDDFNLKYENEHSFKKDLEIEVEGLRRTLDNLTIVTTDLEQEVEGMRKELILMKKHHEQ--EMEKHh 218
Cdd:PRK03918  168 GEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEieELEKE- 246
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042 219 vpsdfNVNVKVDTGPREDLIKVLEDMRQEYELIIK---------KKHRDLDTWYKEQSAAMSQEAASPATVQSRQGDIHE 289
Cdd:PRK03918  247 -----LESLEGSKRKLEEKIRELEERIEELKKEIEeleekvkelKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEE 321
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042 290 LKRTFQALEIDLQTQYSTKSALENMLSETQSRYSCKLQDMQ--EIISHYEEELTQLRHEL-----ERQNNEYQVLLGIKT 362
Cdd:PRK03918  322 EINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHElyEEAKAKKEELERLKKRLtgltpEKLEKELEELEKAKE 401
                         250       260
                  ....*....|....*....|....*...
gi 2217311042 363 HLEKEITTYRRLLeGESEGTREESKSSM 390
Cdd:PRK03918  402 EIEEEISKITARI-GELKKEIKELKKAI 428
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
168-389 7.93e-03

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 38.34  E-value: 7.93e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042 168 KKDLEIEVEGLRRTLDNLTIVTTDLEQEVEGMRKELILMKKHHEQEMEKHHVPSDFNVNVKvdtgpredlikvledmrqe 247
Cdd:pfam07888  54 NRQREKEKERYKRDREQWERQRRELESRVAELKEELRQSREKHEELEEKYKELSASSEELS------------------- 114
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217311042 248 yeliikkkhrdldtwyKEQSAAMSQEAASPATVQSRQGDIHELKRTFQALEIDLQTQYSTKSALENMLSETQS---RYSC 324
Cdd:pfam07888 115 ----------------EEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETELERMKERAKKAGAQRKEEEAerkQLQA 178
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217311042 325 KLQDMQEIISHYEEELTQLRHELERQNNEYQVLLGIKTHLEKEITTYRRlLEGESEGTREESKSS 389
Cdd:pfam07888 179 KLQQTEEELRSLSKEFQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHR-KEAENEALLEELRSL 242
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH