Entry information : GtCcP02
Entry ID 13356
Creation 2015-08-04 (Christophe Dunand)
Last sequence changes 2015-08-04 (Christophe Dunand)
Sequence status complete
Reviewer Achraf Jemmat
Last annotation changes 2016-02-10 (Achraf Jemmat)
Peroxidase information: GtCcP02
Name GtCcP02
Class Cytochrome C peroxidase    [Orthogroup: CcP001]
Taxonomy Eukaryota Cryptophyta Geminigeraceae Guillardia
Organism Guillardia theta    [TaxId: 55529 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value GtCcP02
start..stop
S start..stop
PsuCcP02 466 1.19e-167 45..319 1..275
HvireCcP02 443 1.11e-158 59..319 1..261
ChrospCcP03 430 9.51e-154 59..319 1..261
RspCcP02 367 1.42e-129 100..319 1..219
Literature and cross-references GtCcP02
Literature Zauner,S. et al., EST project for full length transcripts in Guillardia theta. Unpublished (2013)
DNA ref. JGI genome:   scaffold_58 (243776..241776)
EST ref. GenBank:   HE991742.1 [5' end]
Cluster/Prediction ref. JGI gene:   164338 [Incorrect splicing]
Protein sequence: GtCcP02
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   330 (310)
PWM (Da):   %s   36699.5 (34097.9) Transmb domain:   %s   i32-54o
PI (pH):   %s   8.79 (8.88) Peptide Signal:   %s   cut: 23 range:23-332
Sequence
Send to BLAST
Send to Peroxiscan
*.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
MLRSAGKQAA VIIRRQICSH TRNFGTSGFK RATSSLVVAS AFGVGFSAFT AYAYSEKPDY SKVRDAVKAI LDEDDYDDGS IGPILVRLAW HASGTYDAKT KTGGSDGATM RFTPEAAFGA  NAGLAEARKR LEPIKAQFPG LTYADLWILA SIVAIEEMGG PKIPFRPGRR DQISGEWCPP DGRLPDADKG TKPATIGHVR DIFYRMGFND QEIVALFGAH ALGRCHTDRS GYTGPWTRAP 
TTFSNEYYRL LLESKWVPKS WKGPKQFENE DGKDLMMLPT DLALIEDFHF RKWVEIYAKD EKRFFADFAK AYQKLTELGC NNLEGGKGWF * 

Retrieve as FASTA  
Remarks Complete sequence from genomic (15 introns) and 1 EST. Incorrect predcition from JGI (incorrect splicing border and minssing intron).
DNA
Send to BLAST
.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
ATGCTCAGGA GCGCTGGAAA GCAAGCAGGT TGAAGAGAGG AGAGGATGAA GAGAGGAGAG ACGAAGAGGG GGGAGACATA TGAGAGGAGA GACGAAGAGA GGGGGAGGCG GATGAGACGG  GAGGGGAGAT TGAGAAGAGA AAGCAAGGAC AAGAGAAGGA GATGAGAGTA ATAAGGGAGA AGGAAGGGGA AAAAAACGAA AATTATGGCG ATGACACACT TTGATGTCTC CTCTTTCTGC  AGCTGTCATC ATCCGCAGAC AGATCTGCAG TCATACCAGA AACTTTGGGA CTTCCGGTAG AACAAAGGTT GCCTTCTCCA GGAGTTGCAC TGACGTCTGG CCCATTCAGG GTTCAAGAGG  GCTACGAGCA GTCTGGTTGT AGCATCGGCC TTTGGCGTAG GCTTCTCGGC ATTCACAGCC TATGCATACA GTGAGAAACC TGATTACAGC AAGGTTTATC AAGTCTGCTC CGTGAAAAAG  CTCATATTCC TTTCAGGTCA GAGATGCAGT AAAGGCTATC TTGGACGAGG ATGACTACGA TGACGGAAGC ATCGGGCCTA TCCTGTGACT TGCTGCCTCG TTGCAACTCG GCGTCACCTC  TTGTAGTGTC AGGCTGGCGT GGCATGCCTC TGGGACTTAC GACGCCAAGA CAAAGACTGG TGCAAGAGAA AGCATGTGAT GGTGGTGAGT GAAAGCTTTG CAGGCGGAAG CGACGGGGCC  ACCATGCGCT TCACGCCAGA AGCAGGCAGA CAGTCACAAG TCGCTCTGTC ATCCTTGACC CTCCTCTCCA TGCAGCCTTC GGAGCCAACG CTGGGCTGGC TGAGGCTCGA AAGCGATTGG  AGCCCATCAA GGTGATTGCA AACTGCCTTG ATCGCAAAAT TCTGAGATTC TGATGGAGGC TCAATTCCCG GGACTGACGT ATGCTGATCT CTGGATCCTC GCATCGGTAC GTCTGTCTTT  GAAATATTTA CCTCGATCTT TGACCTTGCC AATAGATTGT CGCCATCGAA GAGATGGGTG GACCTAAAGT GATTACTTCT GCTCGCTTCT TGTGCTCTAA CATCGCATTC ATCAGATTCC  ATTCCGTCCT GGTCGTCGTG ACCAAATCAG TGGAGAGGTA AGAGACATCT CCGTGCACAG CTCGTGAGCT CAGGCCGTTG GTCCCAGTGG TGTCCACCCG ATGGAAGACT TCCTGACGCT  GACAAGGTAT GCCCGCGCCT CAAATCTTGA CGTGACTGGC TCACCTGGCG CAAGGGAACC AAACCTGCGA CCATCGGACA CGTGCGGTAC GTCGCTGTGA GTCTGACGGT AGCAAGAGTG  AGTGGAGGGC GGCACAGGGA CATCTTCTAC CGCATGGGCT TCAACGATCA AGAGATCGTC GCGCTCTTCG GAGCTCATGC CCTCGGGCGC TGTCACACCG ACAGGTCGGG ATACACCGGT  CCATGGACAA GGGCCCCAAC CACCTTCTCC AACGAGTACT ACCGTGAGAG AACCAGTGAC TCGGCTCGAT GACGCTGATC GTTTGATCGC AGGTCTACTT CTTGAGTCTA AGGTGACGGG  ACGTTAGCTC AGTTCTCTTC GCCTGACTGG ATTGCAGTGG GTCCCCAAGT CTTGGAAAGG ACCAAAACAA TTCGAAAACG AGGACGGAAA GGATCTGATG ATGCTTCCCA CCGACCTTGC  GCTGGTTGAG ACACGCATGA CCCCCTCGTT CCTCTCCTTG CAGCATGTCC TCTCTGCCTT GTCTTTTGCT GTTTTCCTCG CCCTTATCGA GGCTTCAGAT CGAAGACTTC CATTTCCGCA  AGTGGGTTGA AATCTACGCG AAAGATGAGA AGCGGTTTTT TGCCGACTTC GCCAAGGTTT GAGCTGTCAG CCGAGGGGGC GAATAGCTGA CATGAGGAGC AGGCGTATCA GAAGCTGACG 
GAGCTGGGTT GCAACAATCT GGAAGGAGGA AAGGGATGGT TCTGA 

Retrieve as FASTA  
CDS
Send to BLAST
.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
ATGCTCAGGA GCGCTGGAAA GCAAGCAGCT GTCATCATCC GCAGACAGAT CTGCAGTCAT ACCAGAAACT TTGGGACTTC CGGGTTCAAG AGGGCTACGA GCAGTCTGGT TGTAGCATCG  GCCTTTGGCG TAGGCTTCTC GGCATTCACA GCCTATGCAT ACAGTGAGAA ACCTGATTAC AGCAAGGTCA GAGATGCAGT AAAGGCTATC TTGGACGAGG ATGACTACGA TGACGGAAGC  ATCGGGCCTA TCCTTGTCAG GCTGGCGTGG CATGCCTCTG GGACTTACGA CGCCAAGACA AAGACTGGCG GAAGCGACGG GGCCACCATG CGCTTCACGC CAGAAGCAGC CTTCGGAGCC  AACGCTGGGC TGGCTGAGGC TCGAAAGCGA TTGGAGCCCA TCAAGGCTCA ATTCCCGGGA CTGACGTATG CTGATCTCTG GATCCTCGCA TCGATTGTCG CCATCGAAGA GATGGGTGGA  CCTAAAATTC CATTCCGTCC TGGTCGTCGT GACCAAATCA GTGGAGAGTG GTGTCCACCC GATGGAAGAC TTCCTGACGC TGACAAGGGA ACCAAACCTG CGACCATCGG ACACGTGCGG  GACATCTTCT ACCGCATGGG CTTCAACGAT CAAGAGATCG TCGCGCTCTT CGGAGCTCAT GCCCTCGGGC GCTGTCACAC CGACAGGTCG GGATACACCG GTCCATGGAC AAGGGCCCCA  ACCACCTTCT CCAACGAGTA CTACCGTCTA CTTCTTGAGT CTAAGTGGGT CCCCAAGTCT TGGAAAGGAC CAAAACAATT CGAAAACGAG GACGGAAAGG ATCTGATGAT GCTTCCCACC  GACCTTGCGC TGATCGAAGA CTTCCATTTC CGCAAGTGGG TTGAAATCTA CGCGAAAGAT GAGAAGCGGT TTTTTGCCGA CTTCGCCAAG GCGTATCAGA AGCTGACGGA GCTGGGTTGC 
AACAATCTGG AAGGAGGAAA GGGATGGTTC TGA 

Retrieve as FASTA