Entry information : AtaPrx23-D
Entry ID 12895
Creation 2014-03-18 (Maxime Neel)
Last sequence changes 2014-04-18 (Maxime Neel)
Sequence status complete
Reviewer Christophe Dunand
Last annotation changes 2017-11-09 (Christophe Dunand)
Peroxidase information: AtaPrx23-D
Name AtaPrx23-D
Class Class III peroxidase    [Orthogroup: Prx003]
Taxonomy Eukaryota Viridiplantae Streptophyta Monocotyledons Poaceae Aegilops
Organism Aegilops tauschii    [TaxId: 37682 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value AtaPrx23-D
start..stop
S start..stop
TaPrx23 670 0 8..358 4..357
TaPrx23-1D 629 0 2..358 5..361
HvPrx105 611 0 33..347 34..348
BdiPrx120 537 0 27..355 27..355
Literature and cross-references AtaPrx23-D
Literature Jia,J. The Aegilops tauschii draft genome sequence reveals gene repertoire for wheat adaptation
Protein ref. GenBank:   EMT17413.1 [Incorrect prediction]   EMT20646.1 [Incorrect prediction] UniProtKB:   N1R1Z4 [Incorrect prediction]
DNA ref. GenBank:   AOCO010340309.1 (500..3) [5' end]   KD526289.1 (74448..73976) [5' end]   AOCO010415387.1 (7666..6820) [3' end]   KD534824.1 (45013..44410) [3' end]
Protein sequence: AtaPrx23-D
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   358 (330)
PWM (Da):   %s   37619.53 (35027.1) Transmb domain:   %s   i13-35o
PI (pH):   %s   8.01 (7.75) Peptide Signal:   %s   cut: 29 range:29-358
Sequence
Send to BLAST
Send to Peroxiscan
.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
MASSPSPSAA RALLVLAAAA LVLSSVSAAA GSPPLAKGLS FDFYSAKCPQ AESIVFSFLK DAVRKDVGLA AALLRIHFHD CFVQGCDGSV LLDKTNGVDS EKVSPPNVTL RPSAFKAINA  IRALLQRACG GPVVSCADIA ALAARDSVHL AGGPRYAVPL GRRDGLAPAS LDTILGALPP PTSKVPVLLS FLAKIGPNAA GLVALSGAHT LGIAHCGSFE ERLFPKDDPT MDKFFAGHLK 
LTCPRLKVDN FTANDIRTPD VFDNKFYVDL LNRQGLFTSD QDLHTDAKTK PMVTRFAVDQ AAFFDQFVKS MVKMGQINVL TGNQGQIRTD CSVPNAARSA GDELPWSVVE TAVESLVL 

Retrieve as FASTA  
Remarks Complete sequence from genomic (no intron). Incorrect prediction from NCBI: the first exon (the first 55 aa) is incorrect due to undetermined nt in the sequence together with frame shift between SEKV and SSPPNV .
DNA
Send to BLAST
.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
ATGGCTTCTT CTCCCTCTCC CTCGGCTGCT CGTGCCCTGC TCGTCCTCGC CGCAGCAGCT CTGGTGCTGA GCTCTGTGTC AGCCGCGGCT GGCTCGCCGC CGCTTGCCAA GGGCCTGTCG  TTCGACTTCT ACAGCGCCAA GTGCCCGCAG GCGGAGTCCA TCGTCTTCTC CTTCCTCAAG GACGCCGTCC GCAAGGACGT CGGCCTCGCC GCGGCGCTCC TCCGCATCCA CTTCCACGAC  TGCTTCGTGC AGGGCTGCGA CGGCTCCGTG CTCCTCGACA AGACCAACGG CGTCGACAGC GAGAAGGTGT CTCCTCCGAA CGTCACGCTC CGTCCGTCCG CGTTCAAGGC CATCAACGCC  ATCCGCGCGC TCCTCCAGAG GGCGTGCGGC GGGCCCGTGG TCTCCTGCGC CGACATCGCC GCGCTCGCCG CCCGCGACTC CGTCCACCTG GCCGGCGGGC CGAGGTACGC CGTCCCGCTC  GGCCGCCGTG ACGGGCTCGC CCCCGCGTCC CTGGACACCA TCCTGGGCGC GCTCCCGCCG CCGACCTCCA AGGTCCCCGT GCTCCTCAGC TTCCTCGCCA AGATCGGCCC CAACGCCGCC  GGCCTGGTAG CGCTCTCCGG CGCGCACACG CTCGGGATCG CGCACTGCGG CTCCTTCGAG GAGAGGCTGT TCCCCAAGGA CGACCCCACC ATGGACAAGT TTTTCGCCGG CCACCTCAAG  CTCACCTGCC CGCGGCTCAA GGTGGACAAC TTCACCGCCA ACGACATCCG CACGCCTGAC GTGTTCGACA ACAAGTTCTA CGTCGACCTG CTCAACCGGC AGGGTCTCTT CACCTCCGAC  CAGGACCTGC ACACCGACGC CAAGACCAAG CCCATGGTCA CCAGGTTCGC CGTCGACCAG GCCGCCTTCT TCGACCAGTT CGTCAAGTCC ATGGTGAAAA TGGGGCAGAT CAACGTGCTC 
ACTGGCAACC AGGGTCAGAT CCGCACGGAC TGCTCCGTGC CCAACGCCGC CCGCAGTGCC GGCGACGAGC TGCCGTGGTC CGTCGTCGAG ACCGCCGTGG AGAGCTTGGT GTTGTAG 

Retrieve as FASTA  
CDS
Send to BLAST
.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
ATGGCTTCTT CTCCCTCTCC CTCGGCTGCT CGTGCCCTGC TCGTCCTCGC CGCAGCAGCT CTGGTGCTGA GCTCTGTGTC AGCCGCGGCT GGCTCGCCGC CGCTTGCCAA GGGCCTGTCG  TTCGACTTCT ACAGCGCCAA GTGCCCGCAG GCGGAGTCCA TCGTCTTCTC CTTCCTCAAG GACGCCGTCC GCAAGGACGT CGGCCTCGCC GCGGCGCTCC TCCGCATCCA CTTCCACGAC  TGCTTCGTGC AGGGCTGCGA CGGCTCCGTG CTCCTCGACA AGACCAACGG CGTCGACAGC GAGAAGGTGT CTCCTCCGAA CGTCACGCTC CGTCCGTCCG CGTTCAAGGC CATCAACGCC  ATCCGCGCGC TCCTCCAGAG GGCGTGCGGC GGGCCCGTGG TCTCCTGCGC CGACATCGCC GCGCTCGCCG CCCGCGACTC CGTCCACCTG GCCGGCGGGC CGAGGTACGC CGTCCCGCTC  GGCCGCCGTG ACGGGCTCGC CCCCGCGTCC CTGGACACCA TCCTGGGCGC GCTCCCGCCG CCGACCTCCA AGGTCCCCGT GCTCCTCAGC TTCCTCGCCA AGATCGGCCC CAACGCCGCC  GGCCTGGTAG CGCTCTCCGG CGCGCACACG CTCGGGATCG CGCACTGCGG CTCCTTCGAG GAGAGGCTGT TCCCCAAGGA CGACCCCACC ATGGACAAGT TTTTCGCCGG CCACCTCAAG  CTCACCTGCC CGCGGCTCAA GGTGGACAAC TTCACCGCCA ACGACATCCG CACGCCTGAC GTGTTCGACA ACAAGTTCTA CGTCGACCTG CTCAACCGGC AGGGTCTCTT CACCTCCGAC  CAGGACCTGC ACACCGACGC CAAGACCAAG CCCATGGTCA CCAGGTTCGC CGTCGACCAG GCCGCCTTCT TCGACCAGTT CGTCAAGTCC ATGGTGAAAA TGGGGCAGAT CAACGTGCTC 
ACTGGCAACC AGGGTCAGAT CCGCACGGAC TGCTCCGTGC CCAACGCCGC CCGCAGTGCC GGCGACGAGC TGCCGTGGTC CGTCGTCGAG ACCGCCGTGG AGAGCTTGGT GTTGTAG 

Retrieve as FASTA