Entry information : UmCP01
Entry ID 2327
Creation 2008-11-14 (Christophe Dunand)
Last sequence changes 2011-05-11 (Christophe Dunand)
Sequence status complete
Reviewer Not yet reviewed
Last annotation changes 2011-05-11 (Christophe Dunand)
Peroxidase information: UmCP01
Name UmCP01
Class Catalase peroxidase    [Orthogroup: CP001]
Taxonomy Eukaryota Fungi Basidiomycota Ustilaginomycetes Ustilaginaceae Ustilago
Organism Ustilago maydis    [TaxId: 5270 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value UmCP01
start..stop
S start..stop
SreCP01 1389 0 1..749 1..745
UhorCP 1271 0 3..749 4..745
CtheCP01 1093 0 1..741 1..722
CthedisCP1 1088 0 1..741 1..722
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '2327' 'complement(join(18660..20909))' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 18660..20909 2248  
complement(join(18660..20909))


exon

Literature and cross-references UmCP01
Literature REFERENCE 1 Birren,B., et al., The genome sequence of Ustilago maydis
REFERENCE 2 Chen,F., Carrick,B., Zeng,K., Gnirke,A., Cheung,L., Chong,A., Goldschmidt,S., Hussain,S., Laufer,A., Oliva,J., Park,C., Wong,M., Amundsen,C., Orton,A., Shao,A., Platt,D. and Swimmer,C. Exelixis Ustilago maydis EST project.
Protein ref. UniProtKB:   Q4P914 [Incorrect splicing]
DNA ref. GenBank:   AACP01000115 (20909..18661) [Incorrect splicing]
mRNA ref. GenBank:   XM_754453
EST ref. GenBank:   EH019647 [Fragment]
Protein sequence: UmCP01
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   749
PWM (Da):   %s   82572.21  
PI (pH):   %s   6.01
Sequence
Send to BLAST
Send to Peroxiscan
*.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
MGECPFAHQA NVDRKRVPAA GFGTKNSDWW PNAVKLNVLR QHQAKSDPFN AEFDYAAAFN SLDYDALKKD LTHLMTDSQD WWPADYGHYG GFFIRMSWHA AGTYRVQDGR GGGGEGQQRF  APLNSWPDNG NLDKARRLLW PIKQKYGNKI SWADLLLLAG NVALESMGFK TFGFAGGRAD TWEADQSTYW GGETTWLAND VRYEEGTKNG GDINDLKNRN LDHALAASHM GLIYVNPEGP  NGEPDPVAAA HDIRTTFGRM AMNDEETVAL IAGGHTFGKT HGAGNPDLVG PEPNGAPIEA QGFGWTSKHG SGKAGDAITS GLEVVWTSKP TEWSNLYLKY LFEFEWEHDK SPAGANQFVA  KNADAIIPDP FDPSKKRRPT MLTTDLSLRY DPAYEKISRR FLENHDEFAD AFARAWFKLL HRDMGPRARW LGPEVPKEIL IWEDPVPTAD YALVDDRDLA GLKQAIFATG VEPSKFLATA  WASAASYRDS DKRGGANGAR IRLAPMKDWE VNNPQQLAEV IKALEGVQQQ FNSSNQGGKK ISIADLIVLA GNAALEKASG LPVPFTPGRT DATQEQTEVD TFEFLKPVAD GFRNYGQSTD  RVCAEQILID RANLLTLTPP ELTVLIGGLR ALGLNYNGSS HGVLTHRRGQ LSNDFFVNLL DMSTEWKAAD GGKGEVFDGV DRKSGQKKWS ATRADLVFGS QAELRALAEN YAQADNADKF 
KKDFVTAWNK VMNLDRFDVK KSNIARARF 

Retrieve as FASTA  
Remarks Complete sequence from genomic (chromo 9, no intron) and 18 ESTs. Incorrect splicing prediction confirmed with ESTs due to one missing residue. Strain="521".
DNA
Send to BLAST
.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
ATGGGAGAGT GTCCATTTGC TCACCAGGCC AACGTCGATC GCAAGCGCGT TCCCGCTGCG GGTTTTGGTA CTAAGAACAG CGATTGGTGG CCTAACGCTG TCAAGCTCAA TGTGCTCCGT  CAGCACCAGG CAAAGTCGGA TCCGTTCAAT GCAGAGTTCG ACTATGCTGC TGCCTTCAAC AGCCTTGATT ACGATGCTCT CAAGAAGGAC CTCACCCATC TGATGACTGA TTCGCAGGAC  TGGTGGCCTG CCGACTATGG TCATTACGGT GGCTTCTTCA TTCGAATGTC GTGGCACGCT GCCGGTACTT ACCGTGTCCA AGACGGTCGC GGCGGCGGAG GAGAGGGTCA GCAGCGATTC  GCTCCCCTCA ATTCGTGGCC AGACAACGGA AACCTGGACA AGGCTCGTCG CCTCTTGTGG CCTATCAAGC AAAAGTACGG CAACAAGATC TCGTGGGCAG ATCTCCTGCT CCTTGCGGGC  AACGTCGCTC TCGAGTCGAT GGGCTTCAAG ACGTTTGGTT TTGCCGGCGG TCGTGCTGAC ACTTGGGAGG CTGACCAGTC CACCTACTGG GGTGGCGAGA CCACTTGGCT CGCAAACGAC  GTCCGTTACG AGGAGGGCAC CAAGAACGGT GGCGACATCA ACGACCTCAA GAACCGCAAT CTCGACCACG CTCTTGCCGC CTCGCACATG GGTCTTATCT ACGTCAACCC CGAGGGACCC  AACGGTGAGC CTGACCCGGT TGCTGCCGCC CACGATATCC GCACCACCTT CGGCCGCATG GCCATGAACG ACGAGGAAAC CGTCGCTCTT ATTGCCGGAG GCCACACCTT TGGCAAGACT  CATGGTGCTG GTAACCCAGA TCTCGTCGGC CCCGAACCCA ACGGCGCTCC CATCGAGGCT CAGGGCTTCG GTTGGACCAG CAAGCATGGT TCTGGTAAAG CTGGCGATGC GATTACCTCG  GGTCTCGAGG TTGTCTGGAC TAGCAAGCCT ACCGAGTGGT CCAACCTCTA CCTCAAGTAC CTCTTTGAGT TCGAGTGGGA GCACGACAAG TCGCCCGCTG GCGCCAACCA GTTTGTCGCC  AAGAATGCCG ACGCCATCAT CCCCGATCCC TTCGACCCAT CCAAGAAGCG TCGTCCTACT ATGCTCACCA CCGATCTATC GTTGCGCTAC GATCCTGCCT ACGAGAAGAT CTCGCGTCGC  TTCCTTGAGA ACCACGACGA GTTTGCCGAC GCCTTTGCCC GTGCCTGGTT CAAACTGCTC CACCGTGACA TGGGTCCTCG CGCCCGCTGG CTTGGACCCG AGGTGCCCAA GGAGATCCTT  ATCTGGGAGG ACCCCGTGCC TACCGCCGAT TACGCTCTCG TGGACGACCG CGACCTTGCC GGCTTGAAGC AGGCTATTTT TGCCACTGGC GTCGAGCCTT CCAAGTTCCT TGCCACCGCC  TGGGCTTCCG CTGCCAGCTA CCGAGACAGT GACAAGCGCG GCGGTGCCAA CGGTGCTCGC ATCCGCCTTG CACCGATGAA GGACTGGGAA GTCAACAATC CTCAGCAGCT CGCTGAGGTC  ATCAAGGCTC TCGAGGGCGT TCAGCAGCAG TTCAACTCTT CCAACCAAGG TGGCAAGAAG ATTTCGATTG CTGACTTGAT CGTTCTCGCC GGTAACGCAG CGCTTGAGAA GGCATCGGGT  CTCCCCGTTC CCTTCACTCC TGGTCGTACT GATGCTACCC AGGAGCAGAC CGAGGTCGAC ACCTTCGAGT TCCTCAAGCC GGTCGCCGAT GGCTTCCGCA ATTACGGCCA GTCCACCGAC  CGTGTTTGCG CTGAACAGAT CCTCATTGAC CGCGCCAACC TGCTCACTCT CACCCCTCCC GAGCTCACTG TCCTCATCGG CGGTCTCCGC GCTCTTGGTC TCAACTACAA CGGCTCGTCA  CACGGTGTCT TGACTCACCG CCGAGGCCAG CTCTCGAACG ACTTCTTTGT CAACCTCCTC GACATGAGCA CCGAGTGGAA GGCTGCTGAC GGTGGCAAGG GCGAGGTCTT CGACGGTGTC  GACCGCAAGT CAGGCCAGAA GAAGTGGTCT GCTACCCGTG CCGATCTTGT CTTTGGCTCT CAGGCTGAGC TTCGTGCCCT CGCCGAGAAC TACGCTCAGG CCGACAACGC CGACAAGTTC 
AAGAAGGACT TTGTGACTGC CTGGAACAAG GTTATGAACC TGGATCGTTT TGACGTCAAG AAGAGCAACA TTGCCCGTGC CAGGTTCTAA 

Retrieve as FASTA