Entry information : EferPrx[P]74(Eurfer_22_g24236)
Entry ID 16818
Creation 2020-12-12 (Christophe Dunand)
Last sequence changes 2020-12-17 (Christophe Dunand)
Sequence status theoretical translation / pseudogene
Reviewer Not yet reviewed
Last annotation changes 2021-05-07 (Christophe Dunand)
Peroxidase information: EferPrx[P]74(Eurfer_22_g24236)
Name EferPrx[P]74(Eurfer_22_g24236)
Class Class III peroxidase     [Orthogroup: Prx079]*
Taxonomy Viridiplantae (green plants); Streptophyta; Angiospermae; Basal Magnoliophyta
Organism Euryale ferox    [TaxId: 4414 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value EferPrx[P]74
start..stop
S start..stop
EferPrx34 615 0 1..325 1..323
EferPrx10 577 0 1..325 1..326
EferPrx13 574 0 1..325 1..326
EferPrx15 501 0 1..324 1..325
Literature and cross-references EferPrx[P]74(Eurfer_22_g24236)
Protein sequence: EferPrx[P]74(Eurfer_22_g24236)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   324 (301)
PWM (Da):   %s   34713.74 (32350.4)  
PI (pH):   %s   6.5 (5.96) Peptide Signal:   %s   cut: 24 range:24-324
Sequence
Send to BLAST
Send to Peroxiscan
*.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
MRKTSSFFLA TLALLGLVAG SQAQLKVGFY EPSCP*AECI IKEFVKQHIL NAPTLAAALL RMNFHDCFVR GCDGSLLLNS ASNSTAEKDA IPNSTLRGFD FIDRVKALLE AECPGVVSCA  DTLALVARDA IAVIGGPSWT VPTGHRDGTV SLASEALTDI PSPAFNFTTL KKNFAPKNLD VKDLVVLSGV HTIGIAHCAV VTNRLYNFTG KGDEDPTLDK FYAASLKKFK CKTPTDTTTI 
LEMDPGSFRT FDTHYYTNVV KRRGLFSSDS SLLTDATAAS YVTEILNGPI ENIFxFAASM EKMIEIEVKT GSEGEIRKVC GVVNG 

Retrieve as FASTA  
Remarks Incorrect prediction, splicing error end exon 1 (stop in frame), and missing part last exon (frame shift). Eurfer_22_8220487_8221988_g24236.t1_-1_CDS_2836141052_19239
DNA
Send to BLAST
.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
ATGAGAAAGA CCAGCTCCTT TTTCCTTGCA ACATTGGCTC TTCTGGGCCT CGTGGCTGGC TCCCAAGCTC AGCTTAAGGT CGGCTTCTAT GAACCATCCT GCCCATAAGC AGAGTGCATA  ATCAAGGAGT TCGTGAAGCA GCATATCCTC AATGCGCCTA CTTTGGCAGC AGCATTGCTT AGGATGAACT TCCATGACTG CTTTGTGAGG GTGAGTACTG CGTTGATCTG CAGTTTTTAA  TCTTGCCATC TTCAGATTCA TTTCCTAGCT ATACCTGTTT ATGAGAAAGC TGTGTCTCAA GCCCCTGTTT GTGAGAAAAA AAGGTGTCGA CAACTTCACT GTGAACTTAC ACTTCTCATT  TGCTTACGAG TTCGAATATT GAATTAATTC TTATGGTTTA GTTCATGGGA GCTGCCTTCA CTGGTACTGC TTATATATAT TGAACTGGTT GCCATCTTTC CCTGTGAAAG AACACTAGTA  AAGTTCAACT TTTCTAAATT GCACTGTATT TCATCGATGG TGATGATAGT CTCATTCAAA CTGCGATTTC ACAGGGCTGT GATGGATCCC TGCTGCTTAA TTCGGCCAGT AACAGCACGG  CAGAGAAGGA CGCCATACCA AATTCCACCC TCCGTGGCTT CGATTTCATT GACAGAGTGA AGGCATTGTT AGAAGCAGAG TGCCCGGGGG TCGTCTCCTG TGCAGATACT CTTGCACTAG  TAGCAAGAGA TGCAATAGCT GTCATTGTAT GACTGCCTTA CCACATCTAA AATAACTTTT CTTCTTAGTT GTTAGCATGA ACTCACTGTA CTGCAAATAT CCTGCCACAG GGTGGCCCGT  CATGGACAGT GCCTACAGGC CATAGGGATG GAACTGTTTC ATTAGCTTCT GAGGCGCTCA CCGACATTCC ATCACCAGCC TTCAATTTTA CTACTCTAAA GAAAAATTTT GCACCCAAGA  ATCTTGATGT GAAGGATCTA GTTGTCCTCT CAGGTGGGTC TCCATTTTTA TGCCATTTAG GATCTTTTAA CTAAGTCCAA TATTTATTTA AATGCTTAAT CAGTTCTTTT CTCTGTTTTC  TGCAGGTGTC CATACAATCG GCATTGCTCA CTGTGCGGTT GTCACAAACA GGCTCTATAA TTTCACTGGG AAAGGTGATG AAGACCCAAC CCTTGACAAA TTCTATGCTG CAAGCCTAAA  GAAATTCAAG TGCAAGACCC CCACAGACAC CACCACCATC CTAGAAATGG ATCCGGGCAG TTTCCGGACA TTCGACACGC ATTACTACAC CAATGTGGTA AAGAGAAGAG GCCTCTTCAG  CTCTGACTCT TCTCTGCTCA CAGACGCAAC TGCGGCGTCT TACGTGACTG AAATACTAAA CGGTCCTATT GAAAATATAT TTCACTCAGT TTGCTGCGTC CATGGAGAAG ATGATTGAGA 
TTGAAGTCAA GACTGGCAGT GAAGGTGAGA TAAGGAAGGT CTGCGGCGTG GTGAATGGTT AG 

Retrieve as FASTA