Entry information : CtheAPx-CcP01 (CthehyBpox)
Entry ID 10136
Creation 2012-01-09 (Marcel Zamocky)
Last sequence changes 2012-01-09 (Marcel Zamocky)
Sequence status complete
Reviewer Marcel Zamocky
Last annotation changes 2016-09-06 (Marcel Zamocky)
Peroxidase information: CtheAPx-CcP01 (CthehyBpox)
Name (synonym) CtheAPx-CcP01 (CthehyBpox)
Class Hybrid Ascorbate-Cytochrome C peroxidase    [Orthogroup: APx-CcP001]
Taxonomy Eukaryota Fungi Ascomycota Sordariomycetes Chaetomiaceae Chaetomium
Organism Chaetomium thermophilum    [TaxId: 209285 ]
Cellular localisation Secreted
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value CtheAPx-CcP01
start..stop
S start..stop
StheAPx-CcP01 1303 0 1..1023 1..1050
StheAPx-CcP01 326 8e-96 693..1024 594..931
StheAPx-CcP01 287 5e-82 577..908 706..1057
StheAPx-CcP01 202 9e-54 807..1027 594..806
CgAPx-CcP02 1276 0 1..1023 1..1049
CgAPx-CcP02 328 1e-96 693..1024 585..924
CgAPx-CcP02 199 1e-52 807..1027 585..797
CgAPx-CcP02 103 1e-22 585..673 962..1050
PanAPx-CcP01 1207 0 19..1024 24..1036
PanAPx-CcP01 213 4e-57 581..789 837..1038
AalcAPx-CcP 1038 0 20..1023 20..1038
AalcAPx-CcP 297 1e-85 575..905 694..1042
Literature and cross-references CtheAPx-CcP01 (CthehyBpox)
Literature REFERENCE #1 Amlacher S., Sarges P., Flemming D., van Noort V., Kunze R., Devos D.P., Arumugam M., Bork P., Hurt E. (2011) Insight into structure and assembly of the nuclear pore complex by utilizing the genome of a eukaryotic thermophile. Cell 146:277-289
REFERENCE #2 Zamocky M., Gasselhuber B., Furtmueller P.G., Obinger C. (2014) Turning Points in the evolution of peroxidase-catalase superfamily: molecular phylogeny of hybrid heme peroxidases. Cell. Mol. Life Sci. 71:4681-4696.
Protein ref. GenBank:   EGS17224.1 UniProtKB:   G0SG85
DNA ref. GenBank:   GL988047.1 (341824..338537)
Protein sequence: CtheAPx-CcP01 (CthehyBpox)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   1027 (1007)
PWM (Da):   %s   107928.5 (105897.4)  
PI (pH):   %s   4.26 (4.24) Peptide Signal:   %s   cut: 21 range:21-1027
Sequence
Send to BLAST
Send to Peroxiscan
.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
MRTETLYAGL LSLAAVAAKA DPTWPSEVDH MEFIVYQLQG FKGSLFNDAI RPCDNEAAGP GRITASEWLR VAFHDMSTHN KYTGVGGLDA SIQFELNNGE NTGPGHRTSL EFYANYLTSR  SSMADLIAAG VYASVRSCGG PVVPLRVGRK DATAAGPLGV PQPQNSVVTF RQQFDRMGFT SSEMIQLVAC GHTLGSVHSA EFPNIVNASV GQIGLDSSNH VYDNKVVTEY LDGTTTNPLV  VGPAVGLNRH SDFKVFNSDG NATISAMADP NAYREICRTV LQKMIEVVPP GVTLTDPVEP YNVKPVDIKL KLNNGASTLQ LTGYIRLRAN GFAMSDVSNV VLTWKDRWGG SNCGSSGCST  TLTLRGVASG LDDNFGFFPI DVAIPAAYGI SSFTLVVNFN DGTSQSYDNN GHSYPVDDAI ILQIPQSCLL QTSGSLTVRA LVRNDIADVP VNLDVEYLTP RTTGTNPVPI LSKETIPMTE  GDCAGLYTFY EATFAIPGGM SYNARVSVSA GEHADTFIKG SDLGGSCATF SGGLACGNVT EPEPATSTSS IASSTSVPVT SASSSTSDTA TASPTPAHKS TIEGYQLVGC WTEGIGARAL  GGAAFAYDGM TLESCMANCT GFDYWGTEYG RECYCGNSLH STSSEAPLED CNMPCSGDAT EYCGAGNRLE LYSTTATRTT TATPTPTGAL AHKPAVGDYV FVGCQTEASG GRALSGAAHA  DDSMTLELCA SLCSGFIYFG TEYGRECYCG NSLNAGSTEA PLSECNMVCA GDQFEYCGAG NRLDLYVLAN APTVTANPTT TAAPSHQPTA SPFAFVGCWT EGTTGRTLSD KTFASGDMTV  ESCAAFCDGY KYFGVEYSSE CYCGNTINPT SSEAPSLNDC NMLCSGNPSQ YCGGPSRLDL YENEDVIAPT TPSSTSTPST PTQPSTVIAP TAQATWSSKG CYTEATGMRA LSEQTLASDN 
LTLEMCAEFC NGYQFFGTEY SRECYCGNVL NTGSVEVSDG DCSMPCAGDT SQLCGAGNRL SVYEVQA 

Retrieve as FASTA  
Remarks Complete sequence from genomic (2 introns). No EST. Strain="DSM 1495/CBS 144.50/IMI 039719". N-terminal peroxidase domain fused to 4 WSC domains (putative carbohydrate binding domains).
DNA
Send to BLAST
.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
ATGAGAACCG AAACACTTTA CGCAGGCCTG CTCTCGCTGG CAGCGGTGGC TGCCAAAGCT GACCCGACAT GGCCTTCCGA AGTTGACCAC ATGGAATTCA TCGTGTACCA GCTCCAGGGC  TTCAAGGGCA GTTTGTTCAA CGATGCTATC AGGCCATGCG ACAACGAAGC GGCCGGTCCT GGCCGTATCA CCGCCAGCGA GTGGCTGCGC GTGGCCTTCC AGTATGTGTG CTCTCCGTCC  TTGTGCTTGT TCATACGCTG GATTTCCTAA CACACCCGTA GTGATATGTC CACACACAAC AAATATACTG GTGTCGGTGG TCTGGACGCG TCGATTCAGT TCGAGCTCAA CAACGGCGAG  AATACGGGGC CTGGCCACCG GACCTCGCTC GAGTTCTATG CCAACTACCT AACCAGCCGC TCCAGCATGG CCGACTTGAT TGCCGCTGGT GTCTATGCGT CTGTCCGGTC GTGTGGCGGC  CCTGTCGTCC CGCTCCGTGT TGGCCGCAAA GACGCTACCG CAGCAGGCCC GCTCGGTGTG CCTCAGCCGC AGAACTCGGT GGTCACCTTC CGCCAGCAGT TCGATCGTAT GGGCTTCACC  AGCAGTGAAA TGATTCAGCT GGTCGCCTGC GGACATACTC TTGGCTCTGT CCACAGCGCC GAGTTCCCCA ACATTGTCAA TGCCTCTGTC GGGCAGATCG GCCTCGACTC GAGCAACCAT  GTCTATGACA ATAAGGTCGT CACGGAGTAC CTAGACGGCA CTACGACCAA TCCTCTGGTG GTTGGCCCGG CTGTTGGCCT CAACCGTCAT TCGGACTTCA AGGTTTTCAA CTCGGATGGT  AATGCGACCA TCAGCGCCAT GGCCGACCCG AATGCCTATC GCGAGATCTG CCGCACTGTC CTTCAGAAGA TGATCGAGGT CGTCCCTCCC GGCGTGACTC TCACCGATCC TGTCGAGCCC  TACAACGTCA AGCCCGTCGA TATCAAGTTG AAGCTGAACA ACGGTGCCAG CACCCTCCAG CTGACCGGCT ATATCCGCCT CAGGGCTAAC GGCTTCGCCA TGAGTGACGT CAGCAATGTC  GTCCTCACCT GGAAGGACCG CTGGGGTGGC AGCAACTGCG GCTCAAGCGG CTGCTCAACT ACTCTCACCC TGCGTGGCGT TGCTAGCGGT CTTGATGACA ACTTCGGCTT CTTCCCTATC  GACGTGGCTA TTCCAGCGGC GTACGGCATC TCGTCCTTCA CCTTGGTTGT CAACTTCAAT GACGGCACGA GCCAGAGCTA CGATAACAAC GGTCACTCGT ACCCCGTCGA TGATGCCATC  ATCCTGCAGA TTCCCCAGAG CTGCTTGCTC CAGACTTCTG GCTCTCTCAC TGTCCGTGCC CTCGTTCGCA ACGACATCGC TGACGTCCCC GTCAATCTCG ACGTCGAGTA CCTGACTCCG  CGTACTACGG GCACGAACCC GGTTCCAATC CTCAGCAAGG AAACCATTCC CATGACTGAG GGTGATTGCG CAGGCCTGTA TACGTTTTAT GAGGCCACCT TCGCTATTCC GGGCGGTATG  AGCTACAACG CCCGCGTCAG CGTGTCTGCC GGCGAGCATG CCGATACGTT CATTAAGGGT AGCGATTTGG GCGGCAGCTG CGCAACTTTC TCCGGGGGCC TTGCTTGCGG CAACGTCACC  GAGCCTGAGC CGGCGACCAG CACCTCCTCG ATTGCTTCGT CCACCAGTGT TCCTGTTACA AGCGCGAGCA GCAGCACTAG CGACACCGCG ACAGCTAGCC CGACTCCTGC TCACAAGTCG  ACCATCGAAG GCTACCAGCT TGTCGGCTGC TGGACCGAGG GCATTGGTGC TCGCGCTCTC GGTGGCGCTG CATTCGCCTA CGATGGCATG ACTCTGGAGT CCTGCATGGC AAACTGCACC  GGCTTTGACT ACTGGGGTAC CGAGTATGGC CGCGAGTGCT ATTGCGGCAA CAGCTTGCAT TCCACTAGCT CAGAGGCGCC GCTTGAGGAT TGCAACATGC CTTGCAGCGG TGACGCGACC  GAGTACTGCG GTGCTGGCAA TCGCCTCGAG CTCTACTCGA CGACTGCCAC TCGGACAACG ACCGCTACCC CGACTCCTAC CGGCGCTCTT GCCCACAAGC CTGCTGTTGG CGACTACGTC  TTCGTCGGTT GCCAGACAGA GGCTTCTGGC GGCCGTGCTC TCTCGGGTGC CGCCCATGCC GACGATTCGA TGACTCTGGA GCTGTGCGCT TCGCTCTGCA GCGGCTTCAT CTACTTCGGT  ACCGAGTACG GCCGTGAGTG CTACTGCGGC AACTCCCTCA ACGCCGGAAG CACCGAGGCT CCGCTGTCTG AGTGCAACAT GGTCTGCGCT GGCGATCAGT TTGAGTACTG CGGTGCCGGC  AACCGTCTTG ATCTCTACGT CTTGGCCAAT GCTCCGACAG GTCAGTCTCC AACTTCATCG GCAGCTTCGA CATCCTCACC AGCAAGCGAC CATGCCACAA CTTCTGGTGA GATTTTATTT  TTTTTATTTC TTTTCGAGCA TTGAAGATTT GAATTGAAGT TTTGACTGAC ACATTTAGGA ACAGTGACCG CCAACCCTAC CACGACTGCC GCTCCTTCTC ACCAGCCGAC GGCTTCTCCT  TTCGCCTTCG TCGGCTGCTG GACCGAGGGC ACGACTGGAC GCACCCTGTC AGACAAGACC TTTGCCTCGG GTGACATGAC TGTTGAGTCG TGCGCCGCTT TTTGTGACGG CTACAAGTAC  TTCGGCGTCG AGTACTCGTC CGAGTGCTAC TGCGGCAACA CGATCAACCC AACTTCCTCG GAAGCGCCCT CCCTCAATGA CTGCAACATG CTCTGCTCCG GCAACCCGTC CCAGTACTGC  GGTGGCCCGA GCCGTCTGGA TCTCTACGAG AACGAGGATG TCATCGCCCC AACCACCCCC TCCTCAACCT CGACGCCGTC AACCCCAACC CAGCCCTCTA CCGTCATCGC ACCCACCGCC  CAGGCCACCT GGTCTAGCAA GGGCTGCTAC ACTGAAGCGA CCGGCATGCG CGCTCTGAGC GAGCAGACCC TTGCCTCAGA CAACCTCACG CTTGAGATGT GCGCCGAGTT CTGCAATGGT  TACCAGTTCT TCGGCACGGA ATACTCTCGT GAGTGTTACT GCGGAAACGT CCTCAATACC GGGTCGGTTG AAGTAAGCGA TGGTGACTGC AGCATGCCGT GCGCGGGAGA CACTTCGCAA 
CTCTGCGGCG CTGGTAACAG ACTGAGCGTC TATGAGGTGC AAGCTTGA 

Retrieve as FASTA