Entry information : PtroEPO
Entry ID 3367
Creation 2008-08-13 (Christophe Dunand)
Last sequence changes 2010-12-21 (Myriam Duval)
Sequence status complete
Reviewer Christophe Dunand
Last annotation changes 2010-12-21 (Myriam Duval)
Peroxidase information: PtroEPO
Name PtroEPO
Class Eosinophil peroxidase    [Orthogroup: EPO001]
Taxonomy Eukaryota Metazoa Chordata Mammalia Hominidae Pan
Organism Pan troglodytes (chimpanzee)    [TaxId: 9598 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value PtroEPO
start..stop
S start..stop
GgoEPO 1457 0 1..715 1..715
HsEPO 1456 0 1..715 1..715
MmulEPO 1418 0 1..714 1..714
CjaEPO 1385 0 1..715 1..715
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '3367' 'join(57363864..57363939,57364055..57364148,57364388..57364563,57364731..57364848,57364980..57365109,57365971..57366177,57367958..57368276,57370052..57370212,57370552..57370807,57371238..57371408,57374064..57374301,57374878..57375079)' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 57363864..57363939 74 N° 2 57364055..57364148 92 N° 3 57364388..57364563 174 N° 4 57364731..57364848 116
N° 5 57364980..57365109 128 N° 6 57365971..57366177 205 N° 7 57367958..57368276 317 N° 8 57370052..57370212 159
N° 9 57370552..57370807 254 N° 10 57371238..57371408 169 N° 11 57374064..57374301 236 N° 12 57374878..57375079 200
join(57363864..57363939,57364055..57364148,57364388..57364563,57364731..57364848 ,57364980..57365109,57365971..57366177,57367958..57368276,57370052..57370212,573 70552..57370807,57371238..57371408,57374064..57374301,57374878..57375079)


exon

Literature and cross-references PtroEPO
Literature REFERENCE 1 Chimpanzee Sequencing and Analysis Consortium Initial sequence of the chimpanzee genome and comparison with the human genome Nature 437 (7055), 69-87 (2005).
REFERENCE 2 Zamocky M. et al. (2007) Phylogenetic relationship in the peroxidase-cyclooxygenase superfamily (in preparation).
DNA ref. GenBank:   NC_006484.2 (57363864..57375079)
mRNA ref. GenBank:   XM_523809
Protein sequence: PtroEPO
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   715 (693)
PWM (Da):   %s   80890.13 (78677.9) Transmb domain:   %s   i5-22o
PI (pH):   %s   10.81 (10.90) Peptide Signal:   %s   cut: 23 range:23-715
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MHLLPALAGVLATLILAQPCEGTDPASPGAVETSVLRDCIAEAKLLVDAAYNWTQKSIKQRLRSGSASPMDLLSYFKQPVAATRTVVRAADYMHVALGLLEEKLQPQRSGPFNVDVLTEPQLRLLSQASGCVLRDQAERCSDKYR
TITGRCNN
KRRPLLGASNQALARWLPAEYEDGLSLPFGWTPSRRRNGFLLPVRAVSNQIVRFPNERLTSDRGRALMFMQWGQFIDHDLDFSPESPARVAFTAGVDCERTCAQLPPCFPIKIPPNDPRIKNQRDCIPFFRSAPSCPQNKNRVRNQINALTSFVDASMVYGSEVSLSLRLRNRTNYLGLLAINQRFQDNGRALLPFDNLHDDPCLLTNRSARIPCFLAGDTRSTETPKLAAM
HTLFMREHNRLATELRRLNPRWNGDKLYNEARKIMGAMV
IITYRDFLPLVLGKARARRTLGPYRGYCSNVDPRVANVFTLAFRFGHTMLQPFMFRLDSQYRASAPNSHVPLSSAFFASWR
IVYE
GGIDPILRGLMATPAKLNRQDAMLVDELRDRLFRQVRRIGLDLAALNMQRSRDHGLPGYNAWRRFCGLSQPRNLAQLSRVLKNQDLARKFLNLYGTPDNIDIWIGAIAEPLLPGAR
VGPLLACLFENQFRRARDGD
RFWWQKRGVFTKRQRKALSRISLSRIICDNTGITTVSRDIFRANIYPRGFVNCSRIPRLNLSAWRGT

Retrieve as FASTA  
Remarks Complete sequence from genomic (chromo 17, 11 introns) and mRNA. No EST available.
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCATCTGCTCCCAGCCCTGGCAGGGGTCCTGGCCACACTCATCCTTGCCCAGCCCTGTGAGGGCACTGACCCAGAGGTAATAGCCCCCTAGACAGGCAAGGAGGAGGGAGGGGAAATG
GAAGGGGAAGCACTTGGGTCTTGGAGGGGGTCTTGTGGCTTGCTGAACCCTGAGTCCCCATCTCTTTGAACAGCCTCCCCTGGGGCAGTGGAGACCTCGGTCCTGCGAGACTGCATAGCA
GAGGCCAAGTTGCTGGTGGATGCTGCCTACAATTGGACCCAGAAGAG
AGGTGGACTTGGGTCTGGGGGCTGCATGGGCCTGGGAGGATCAGTGAGGGGCATGGGCCCCAGCCAAGTTCAT
CTCACTCATCACCCACCTCTGGACCCCATGAACATTTCCTGTTGGTAGAGCCTCCCTTCCATCCATCCTTCTGTCCGCTTGCCCTGTCCTGACTGTGCCCCAGGACTGGGTCTCTGCTGG
GTGGGTCTGCACCCTCTCTCCAGCCCTCACTCCTCCTCTCCTGGGCAGCATCAAGCAGCGGCTTCGCAGCGGTTCAGCCAGCCCCATGGACCTCCTGTCCTACTTCAAACAACCGGTAGC
AGCCACCAGGACAGTTGTTCGGGCCGCAGATTATATGCATGTGGCCTTGGGGCTGCTTGAAGAGAAGTTACAACCCCAGCGGTCCGGACCCTTCAATGTCACTG
TGGTACTCTGATCCCC
ACTGAGCCCGCTGGGCCTGCCCTGGCCTGGAGTAGAAGGAATCCAGGAGAGGGAGGCAGGGTGCACAGGCTTGGGGTGCTGGGAGAAGAGAGGGTAAAGGGATGGGAGGTACAGAGCAGG
CCAGCTCAGGTCTGCCCACTTGCCTTCCCACAGATGTGCTAACAGAACCACAGCTGCGGCTGCTGTCCCAGGCCAGTGGCTGTGTTCTCCGGGACCAGGCCGAGCGCTGCAGCGACAAGT
ACCGCACCATCACTGGACGGTGCAACAACAA
AAGTGCGTGCGGGCCGGCAGGAGGGGCTGCCCCTGCCTGGGGGACCTCTCCCTTCCTGCACCCACCCTCTCCCTCCATGCTGAGCCATC
TCCAGGCCCTGCCCCCTGCTAACCTATCCCACCCATGGCTGCAGGAGGAGACCCTTGCTAGGGGCCTCCAACCAGGCTCTGGCTCGCTGGCTGCCCGCCGAGTATGAGGATGGGCTGTCG
CTCCCCTTCGGCTGGACCCCCAGCAGGAGGCGCAATGGCTTCCTTCTCCCTCTT
TTGTGAGTTGCGGCTGGGGGCTTGGGAGGTTGCTTGATCTCTTAAATGCGGGGAGTAAAACACAGC
CCAGAGTCACCCAGGCAGGGCTTGAACCCACTGACCAGTGCAGTGACTTGTGGCAAATAACACAGCTTTCTGAGGCTTAGTTTCCTCATCTGTAAAATGGCCCTAAAACCTACCTCGTAG
AGCCTGTGTGGATATTGAAGTCCTTGGTATAAATAGCTATAGAGGACATGATCTATTCATGCTAACCATTACGGTGTTGTAGGGTTTTTTCTGAGAAAATTAGTGTCAAAGTCTTGCTAA
ATATAAACAGCTCTATACCTTAATCTGGCTTTATCTATTGATCTATTTCACATTTATCAACTCATCAGTTTTCTATTGATATATCGATAGAGCAGTCTATCATCTACCCATCTATCAACA
TCTATTAATTAATTTTCTTCTATCCACCCTTCCATCTGCTGACCCCGTACTCCCTACGTGTATTATCCATCACCTGGCAACCAGCCAATGCACCTGTGAATCCGTGCTATCATCTGGTTA
GCTATCAGTCTACAAATTGATCTATCTATTTGATCTCCCTTCCCCTCTGGGAGAGCTGGTGAGGTCTGAGCCAGTCAACCTAGCCCCTCTCCTTCTCTTTACCACCGGAATCCTCAGGAG
CCCAGCCAGAAACCATCCTTCTAGGAATGAGAGCAGGAGGTGGCTACGCCTCCAGGGACAAAAGGGGCATGGAGGGCAGAAGAGGAGAGGCTGTCAATTCCAGCAGGGGAGCTGCTGCTC
CCTGAGTCCTGGGTTGGCTCTAATACCTTGTGGGGTCAGGGAGCCCATGTCCCGTGCTGATGTTATTTCCCCACCAGGTCCGGGCTGTCTCCAACCAGATTGTGCGCTTCCCCAATGAGA
GGCTGACCTCCGACCGTGGCCGGGCCCTCATGTTCATGCAGTGGGGCCAGTTCATTGACCATGACCTGGACTTCTCCCCGGAGTCCCCGGCCAGAGTGGCCTTCACTGCAGGCGTTGACT
GTGAGAGGACCTGCGCCCAGCTGCCCCCCTGCTTTCCCATCAAG
AGGTACCTACCCTCAGCCAATCTCCCATGCCCTTGTGTGGCCTCCCCCAAAGGCAAGGTGCTGGGGGTGGGGATCT
GGAAGACTGGAGCACCATCCTTAAGGAGCTGCCTGTGGAGCTAGGGTATGAGACAGAGACACAAGAAACACAGCTGAGCAGAGACCCCGCGCCGTGTGTGTTTGAGAGGTGGGGGTAGGG
CAATCTGCCAGGAGGCTCAGGTCAGGCTTCATGGGGTGGGAAGCCCTTGACACATGCCTTGACCCATGGGTCTGAATCCAACAGGGGAAGCCTCTGGGCCCCTGCTTTTGGCAACCTAAG
GGCCTCTTAGCTCTTGCCCTTCTCTCCTTCCCAGAGTAGTCCAAGGGTCCTGCGGCTCTTGCCAGCTTCTTGGGCTTGGGCTGTAAGGGTCCTAACCTTCAGGCTAGAAGCCAAGGACAG
TGTGGGGCACTAGGGAAGAAAAGATAGAATCCAGGGAGCAGAGTCTCTGCTAGGAACATAGGGTTGACACACGTGCACACACACACTCTCTCACACACACACGTGCACACACACACTCTC
TCCCTCTCACACACACACGTACACACACAGTCACCCTTAATAGGTGGGGCCACATTCATGAGGGATGACATGTGAGGGCCTCTGGATGGCCTCTCCCCGTTGCAACCTTATCTATTCCTC
AATCCCTGGCTTAGGGGCACCTCCTCTGGGGAGCCTTCTCAGATCTTCCTATTACACTTTTAACATATGTCCTCCCCTCCCCCTCCTCTCTCTCCTTCCTTCCTTCCTCTTTCTTTCTTT
CTTCTTTCTTTCCTTTCCTTCCTTCCCTTCCTTCCTTCCTTCCCTTCCTTCCTTCATTTCCTTCCTTCCTTCCTTTCTTTCTTTCTTTTTCTTTCTTTCTTTTTCTTTCTTCCGTCCTTT
CTTTCTTTATCTTTCCTTCTTTCTTTCTTCTTTATTATTATTATTATTGTTTTGAGACAGAATCTCACTCTGTTGTCCAGACTGGGGTACAGAGGGGCAATCTCAGCTCACTGCAACCTC
TGCCTCCTGGGTTCAAGCAATTCTCATGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATGTGCCACCATGCCTGGCTAATTTTTGTATTTTTAGTAGAGATTGGGTTTCACCATGTT
GACAAGGCTGATCTCAAACTTCTGGCCTCATGTAATCCACCTGCCTCGGCCTCCCAAAGTGCTAGGATTATAGGCGTGAGCCGCCACACCTGGCCCACATATGGCTTTTCTATCAAGCAC
AGCAGATGCCACTTCAACTTAGTTTCTTAACTCTATGTGTGTGTGTCTTTTTCCACTGAACTGCCAGCTTCCTGCAACCTTTCCTGTGTTTTTTTAACCTTCATAGTCTCCAAAGCAGCA
AGCAAGAAAGGGGCTTAGAAAAGGTCTGTCGAATTGAACAGAATTAAAGAAGCTCAGGCCTAAGAGTCAGGAGAACTAAGTTCTAATCCCACCTTTCCTGCTAACTGGTTGTCACTTCCC
CTCTCTGAGGCTGTTTTCCTTGCTGTAAAATGAGGATCGAGATTGTTTCAAAAGCCTCATTCATTCCTAACCTTTTGAGACTCTATGAAACAAAATGTTAAGGACACTCAGACTGAAGGG
GTGAGCAGCATGAGCCTGGGTGAGTCAAGGAGGGCTTCCTGGAGGAAGGGAGTTTTAAGCAAGGTTTTGAGGAGGTGGAAGATGACGGAGAGGAGTGAGTCTGCTATTGAGGGGGCCCCA
TGTCACTGGCTCCTCTTCCATCTCAGATCCCACCCAATGACCCCCGCATCAAGAACCAGCGTGACTGCATCCCTTTCTTCCGCTCGGCACCCTCATGCCCCCAAAACAAGAACAGAGTCC
GCAACCAGATCAACGCGCTCACCTCCTTTGTGGACGCCAGCATGGTGTATGGCAGTGAGGTCTCCCTCTCGCTGCGGCTCCGCAACCGGACCAACTACCTGGGGCTGCTGGCCATCAACC
AGCGCTTTCAAGACAACGGCCGGGCCCTGCTGCCCTTCGACAACCTGCACGATGACCCCTGTCTCCTCACCAACCGCTCGGCACGCATCCCCTGCTTCCTGGCAG
AGGTCAGACAGGGAG
GAAGGTGGTGTCTTCCCAGGAAACAGCCATCCCTGGGGTCCCAACTGGGAAGCCATGGTGGGATGTGGTGAAGGTACATGGTTTGGGACCTCAGTATTAGGCACACCATAAGCATGGATC
TGTGCACAGCCATCATAGAATCAGAACGTTGGAGTCCCTTTGCAAGCTCCCTGTGTGTGACCAGAGATAACTGCTTAACCGTCTCCAGTGACAAGGCTCTCACTGCCTCTATGGCACCCC
AGCTCAGCCACAGGCAGCTCCGACTGTTAGAAAGTCTCCTTATTTTGATCACATATTCACCTATTCCCCTTAATCATGCTTCTGGAACCTCAGAAAAGACACTGTCTCTCTTTCACGCAG
GAATACCTCAAAGTAGCTGTTAACTCCATCTCCATTTTTTTTCTCTTCTCTGGGATAAATTTCCGTAGTTCCTCCCTGTCCCCGCTGTTGCTAGACAGTTAATACCTTCTTCTGGAAGCA
TAATTTGAACTTTCCTCAGGGTGTTTTGTCCCATGTACATGGTTACAATAATAATGGTAGTCATTTCTCATTAAGATATTTATCTTATTATTATTTTAAGATGGAGTCTCACTCTGTTGC
CCAGGCTGGAGTGCAGTGGCGTGATCTCAGCTCACTGCCACCTCTGCCTCCTGGGTTCAAGTGATTCCCCTGCCTCAGCCTCCCGAGTAGCTGGAATTACAGGTGCCTGCCACCACGTTT
GGCTAATTTTTGTATATTTAGTGGAGATAGAGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGATCTCAGGTGATTCACCTGCCTTGGCCTCCCAAAGTGTTGGGATTACAGGTG
TGAGCCACCACGCCCAACCTTATCAAGATATTTAAATGTGCCATCTCTAAACCTTGCAACAGCCCTGAAAGGAGGGAATTATTATTGTTCTCATTTTACGATAAGAAACTGAGACTCAGG
GAAAGTGAATCATGCGCCCAAGACCACAGAGTTAGTTTCAAATGGCAGAGTCCACTTGAGAACACAGTCAGTGTAGTATCAAGGTCTCTGCCCAGTCTACTGTATTCCACTGTTTTCCTC
ACAGAGAGTCGGCCCCAACCCCTCTTTTGAAGGATTGTACATTTGTGTGTCCTGGAAAGAGCAGCATGTGCCTTCGGGAGAGGTCAACTGAACTAAGACCTTGTGCAAAAGGGTTTAGTT
AAAACCAAGCAAATGACGGAATTAACTTTCTTTCCCTGCCTGTAGTAGGTCTGTTCGTAAGAACAGTCTCTGCCCAACCCTCCCTCTCTTCCTCTTTCCCTCTGGTTCACGGGGGAGAGC
AGTATTATACCAAGCCCAAAGATAGGAAGAGGTACACCAGCCCTTGGCAGAGAACACGCAGGCAGATGCAGCTCATGGACAGGTGGGCTCTCAGTCCCAAAGTCAAGGTGCCTTGCCCAT
AGCTGGTCCAGGCTATGGATCTGGTTGTCTTGTAGAGTAGAATCCTTAGGCAAAGATGCCCCTGCCTGAGACCAGCAGCTGCCCTGTGCTTTCCCACCTCACCCAAGCCTGCTAGGAACC
CCCGGTCCTCCCGCTCCTGGCCTGTGGTGTTAGGGAGGAGGGTGGCGATGAGGAGCCTCAGCACAGCTGAAGAGATGGAGGTCCAGTGAGGGCCAGGAGTTTGGCCCACCCCGTCTCTCC
CGTCCCCAGCCCTGGGTCTACCCTGGTAGAAAGACATTTCTCTGGGAAAGACTGCAGTAAATCTGAGCTTGGGGTTTTCAAGGTGACACCCGATCAACGGAAACCCCCAAACTGGCAGCC
ATGCACACCCTCTTTATGCGAGAGCACAACCGGCTGGCCACCGAGCTGAGACGCCTGAATCCCCGGTGGAATGGAGACAAACTGTACAATGAGGCTCGGAAGATCATGGGGGCCATGGTC
CAG
AGGTAAGGAGCTCTGCATCCCAGCATCCCCCAGATGACAAGCTTGGCATGAGAAGCAGTCCTTAACACATCCTTGCGGATGTGCCTAAAACCAGCTGGGTCTGGGCAGCTGGCGGAG
CACCTGGACCTGTCCTCTTGCCCCAATTCTGCCTCTAACTCCCTGTGTGACCCTGTCTGTGTCACTCACCCTCTCTGGGCTTTGTATCTCCACCCACCAATAGGAAATTAATGTTGTCAC
ATTTGACGTGATGACAATAAAGAATATGTCTGAGCCACCCTTTGAAAAGGCAAGGGTATGGGTGAGTAGCCTCTGGGGAATGTTCCTCCTGTCTTCCCTTCCAGATCATCACCTACCGAG
ACTTTCTGCCCCTGGTTCTGGGCAAGGCCCGGGCCAGGAGAACCCTGGGGCCCTACAGGGGGTACTGCTCCAATGTGGACCCACGGGTGGCCAATGTCTTCACCCTGGCTTTCCGCTTTG
GCCACACAATGCTCCAGCCCTTCATGTTCCGCTTGGACAGTCAGTACCGGGCCTCTGCACCCAACTCGCATGTCCCACTTAGCTCTGCCTTCTTTGCCAGCTGGCGGATCGTGTATGAAG
AGGTGACCAGGTTTTCCAGGGGGCAAATGGGGGTGAGGGTGGGGAGCATGCCTTCCCCTAGGTGGGCCAAGCTTACTGCCAGGAAGCCAGGCTGCTGCAGAGGCCACTGCTAATATCTCC
CCAGGACAGTGGAAACAAGGCAGGTGCAAGCAAGACCCTCAGTCACGGGCCTCCCATCCTCTGTGGAATGAGAGGATTTTTTAAAGGGGTGGAGACTAATGTCAGATTGATGGGGAGCTC
ACCTTCCATTCCTTAAGAAGTACCTCCCAGCTCCAGCTGCTTCATGTCTCTCCAGAACTCTGTTTCCTGACAAACGTTACTAACATACCCGACTGGCTTGTCCAGCTCTGGGCTAGCTTG
GCATCATGTGATAACCCAAGTAGCTTCCCAGAGGCTGGTCCAATCTGTGCTGCTCACATTCCCTGCCACCAGGGGGCATCGACCCCATCCTCCGGGGCCTCATGGCCACCCCTGCCAAGC
TGAACCGTCAGGATGCCATGTTAGTGGATGAGCTCCGGGACCGGCTGTTTCGGCAAGTGAGGAGGATTGGGCTGGACCTGGCAGCTCTCAACATGCAACGAAGCCGGGACCACGGCCTTC
CAG
AGGTGAGGGGGCTGTCCACCTCTTCTCCCAGCTTTGCTCAGGCCAGGCTGCTCAAGGGGTTCTGGGAAGACCCTGGTACCTCCTTTCTGACTGGGACTGTCTACAGGATGTGCAGGA
GTGCAGAGGCATGCAAGGCCAAGGTCGATGTCCCAAAGCACTCCTGGAACACCGCTTGCTGTCCTGCCATGGCTCCCCCATCCACTGTAGGGGGTTCCAGCTTTAGGTTCACCACAGGAC
CAGGACTTAGGCTCAGGAGCTGAAAAAGGGAGGCTCGAACTTTCCAAGTAGAAAAAAAGCAATAGGCTTAAAAGGCAGGGAAGAACTGCTCCATTCCCAACACGCTGGCAAGTTCCAAGG
AGGAAAGCTAAAATTCCAGAGCTACAGAAACGCTCCTTGCTCTAAGTGGAGAAAGGCAAAGAGCCTTAGAGGCAGGGAATAGGTGGAATAAAACTAAGGCTGCAGTGACTGTTTGGTGAG
GGCACAAATGACTTTATCCCAGGAAAGAAGAAAATGTAGGCACTTTGTTAGTCACTTTCATGTATATTAATTTATTTTGTCCTCCCAATCACCTTGAGAAGCAAATGATTACAGCCATTT
TATAGATGAGAAACTATGTTCAGAGACAGCTAGTAATTTGCCCAAGATCACATAAATAAGAAGTAGTAAAATCTGAATTTAAATTTACAATTTCTCCTTTCTTCCCCCTGTTCAACCCCA
GTAGGGGAAGGTTTTGGCCACTGCCTTGCTCCTCTCTCCTTCCCACCTCCTACCTCCCTGTCTAGAAAGAACCTGAAGGCCCATCTGCCCATGTATGAAGCAGTGGTGGCACCTCCTACT
GCCCACCCCACTTCACACAGCGCCTGGTAGAGCACAGCAGGCACTCTGGAAACCTGAGCTTCTTTCCTTTCTTAAACTTGGGCGCTCCTGGCAGCTAAGGAGCTGTGGGCACCCCCTGCT
GGAGAGCTGAGATCGCTAGCCACCCAAGTTTCTATTAGGTTTTGAAGGGAAGGCAAGGGTTAAGGAAAGACACAGAGAGGGCGGCTCAGCAGCCAATGCAGGTATTTATTTCCAGCATAA
AACCTACAGAGGTGGGGGACCAGCTTATTGCCAGAGCCTGCTGCCGCCTACAGCTGGGGTACTGGGTATGGGTATGGGCAGGAGGGGTCTGGGTTGTATGGCTTGCTGCCCAGCAGGATA
TTGATAAGATGTACCTATGATCAGGTCCTTTTTTCAGTGGGATGTGGTAAGATGTTCCTTGGACCTTTGCCCAGCAGGATATGATAGGGATGTTCCTTCAGTTGGGCCTTTGCCCATTAG
GGTATGTTATGTTTTTCATGGCCCGACCTCCTGTGGAACGTTTCACTTTGACCAAAGTCTGGGAAATGGTGGGGGGCTTACAAAATGGTGCAGTTTGGACTAACATTCTTGCCTTCTACT
TTGGTATAAAAGGAAGAGCTGGAGAGCTGAGATCGCTAGCCACCCAAGTTTCCACATTGCGAGCTTCTGCTCCCCGCAATACCTGTGCCCTCCAGAAATATTATCCTGACACTAGCTCTT
GCAATCTCCTCTAATCCCTTCCCAAGTCCAACATCAAACTGGGAACTGGACAACCTGGTTTGAGCTCCACTTAACAGTAAAGTGACCCTGAAGCCTATGGTTGCTCAACTCTAGTAAACA
TACCAGATATAGTTAGCATGCCATAAAAGCACCTCACAAAGGTGCTGTGAGGGTCCTGAGATATTCACCCAATAGTGCTCAGTAATGGTAAAATACTGTGTGGTTTTCATGTGTTCCGCT
GGGGGATGGTGTGTGTGAGCTGTAGTTCAATGGATGACATTGAGCTGGTGTTCTTTCGGGACATGGCAAAGCTATCTCCTACCTTGATTGTCACATCTTATGATGTGGCTTGGCCCTGTG
TGGGATCCTGATGGCTGACAAATTGATATGGCTGGTCTTTGGAAGTTTCTGAAAACTTTTTAAAAACTGCTTTATTGAGATGTACCATAAAATTCACCCATATGAAATAGACAATTCAAT
GATTTTAGTGTATTTAGAGTTATGCACCCATCACCATGATCTAATATCAAACCATTTTCATCAGTCCCCAAAGAAACCTTGTAAGAACAGTCAATCCCCATTCCCACCCCAGCCCCAATC
ACCTCTAATATACTTTGTGTCTCTATATTGGCCTATGCAGGACCTCTCATATCAACGGAATCATAGTGTGCCGTCCTTTTCTGAGTAGATGAATTTTATCATTGCTCTCCATTTGTTCGC
TCATTCATTCATTCTACAAACCTGATGGAGAGTCTTTTATATTCTGGGTACTGTCTTACTTGTGTCTAATGAATGCAGTCGAGCCCAGTAATATCACTAAGTGTCGGATTTGTGGTTATG
AGTTTTGGGTTAAAACAGCTTTTTAGGTTAGGACACACTCCTCAGTCTGATGCACATTGAGCACGGTTGTCCACTCACATTTTCAATCAGAGGACAGAAACCTGGTCTTTGGAAGGTCTA
GGGACTCTTCTGCCAGCAGGATCTTCTGGTTCTGCCCAATATTGACTGGCCACAGCTTCCCCCCAGAGGACTGGTGGAGAGAAACAGAAGCTAATGGGAGGTCAGCAAGACTGAAGCTGC
TTCTCCCCGTTCCCCTGCAGGGTACAATGCTTGGAGGCGCTTCTGTGGGCTCTCCCAGCCCCGGAATTTGGCACAGCTTAGCCGGGTGCTGAAAAACCAGGACTTGGCAAGGAAGTTCCT
GAATCTGTATGGAACACCTGACAACATTGACATCTGGATTGGGGCCATCGCTGAGCCTCTTTTGCCGGGGGCTCGAGTGGGGCCTCTTCTGGCTTGTCTGTTCGAGAACCAGTTCAGAAG
AGCCCGAGACGGAGACAG
AGGTAAGTGACCCTATCATAAAAGACATCAGCACCAGAGGCAGAGCAGAAAAACACTAGCATTTCAAGACTAAACATTGAAGAACATTGCTCTTTTTAGTAT
CATTTCTTCCAAGTTCACAGGATCTGAAATCAGGAGGCTCCTCTCTGAAAAGCTGGGTCAAAGGAAGAGAGACACTGACCCAGGCAAGGCCCATATTGCCTGAGCTGGAGTCATCTTAAA
CCCAGGAGGTTGCCTGGCAGCCTCACTGTTCTGGGTTGGTGGATCCCAGAAAACATGGGCAGAAAGGGCTAAATCTGGTTTCCCTCCAATACATTTGTGATTTAAGACCTGACTTCTTAT
TTGGAGTTGGTATTGCCCGAGTTCAAATCCAGCCCTGGCCACTTAAATTACTGTGCACTTAATACTGCGCAATTTATTTGGCCACTGAATTTGTTTCATTTCTGAGATTCCAGTCTTGCA
GGAATTTTGTGAGAGTTGAATGGAATAATATATGTAAAGTACCTGGCACACAACAGGTGCTCATTATAAGGTAATTCCTCCCCAACCTTCACCCACATCTCTCGACTGCCTGGTAGGTTC
TGGTGGCAGAAACGAGGTGTTTTCACCAAAAGACAGCGCAAGGCCCTGAGCAGAATTTCCTTGTCTCGAATTATATGTGACAATACCGGTATCACCACGGTTTCAAGGGACATCTTCAGA
GCCAACATCTACCCTCGGGGCTTTGTGAACTGCAGCCGTATCCCCAGGTTGAACCTGTCAGCCTGGCGAGGGACATGA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCATCTGCTCCCAGCCCTGGCAGGGGTCCTGGCCACACTCATCCTTGCCCAGCCCTGTGAGGGCACTGACCCAGCCTCCCCTGGGGCAGTGGAGACCTCGGTCCTGCGAGACTGCATA
GCAGAGGCCAAGTTGCTGGTGGATGCTGCCTACAATTGGACCCAGAAGAG
CATCAAGCAGCGGCTTCGCAGCGGTTCAGCCAGCCCCATGGACCTCCTGTCCTACTTCAAACAACCGGTA
GCAGCCACCAGGACAGTTGTTCGGGCCGCAGATTATATGCATGTGGCCTTGGGGCTGCTTGAAGAGAAGTTACAACCCCAGCGGTCCGGACCCTTCAATGTCACTG
ATGTGCTAACAGAA
CCACAGCTGCGGCTGCTGTCCCAGGCCAGTGGCTGTGTTCTCCGGGACCAGGCCGAGCGCTGCAGCGACAAGTACCGCACCATCACTGGACGGTGCAACAACAA
GAGGAGACCCTTGCTA
GGGGCCTCCAACCAGGCTCTGGCTCGCTGGCTGCCCGCCGAGTATGAGGATGGGCTGTCGCTCCCCTTCGGCTGGACCCCCAGCAGGAGGCGCAATGGCTTCCTTCTCCCTCTT
GTCCGG
GCTGTCTCCAACCAGATTGTGCGCTTCCCCAATGAGAGGCTGACCTCCGACCGTGGCCGGGCCCTCATGTTCATGCAGTGGGGCCAGTTCATTGACCATGACCTGGACTTCTCCCCGGAG
TCCCCGGCCAGAGTGGCCTTCACTGCAGGCGTTGACTGTGAGAGGACCTGCGCCCAGCTGCCCCCCTGCTTTCCCATCAAG
ATCCCACCCAATGACCCCCGCATCAAGAACCAGCGTGAC
TGCATCCCTTTCTTCCGCTCGGCACCCTCATGCCCCCAAAACAAGAACAGAGTCCGCAACCAGATCAACGCGCTCACCTCCTTTGTGGACGCCAGCATGGTGTATGGCAGTGAGGTCTCC
CTCTCGCTGCGGCTCCGCAACCGGACCAACTACCTGGGGCTGCTGGCCATCAACCAGCGCTTTCAAGACAACGGCCGGGCCCTGCTGCCCTTCGACAACCTGCACGATGACCCCTGTCTC
CTCACCAACCGCTCGGCACGCATCCCCTGCTTCCTGGCAG
GTGACACCCGATCAACGGAAACCCCCAAACTGGCAGCCATGCACACCCTCTTTATGCGAGAGCACAACCGGCTGGCCACC
GAGCTGAGACGCCTGAATCCCCGGTGGAATGGAGACAAACTGTACAATGAGGCTCGGAAGATCATGGGGGCCATGGTCCAG
ATCATCACCTACCGAGACTTTCTGCCCCTGGTTCTGGGC
AAGGCCCGGGCCAGGAGAACCCTGGGGCCCTACAGGGGGTACTGCTCCAATGTGGACCCACGGGTGGCCAATGTCTTCACCCTGGCTTTCCGCTTTGGCCACACAATGCTCCAGCCCTTC
ATGTTCCGCTTGGACAGTCAGTACCGGGCCTCTGCACCCAACTCGCATGTCCCACTTAGCTCTGCCTTCTTTGCCAGCTGGCGGATCGTGTATGAAG
GGGGCATCGACCCCATCCTCCGG
GGCCTCATGGCCACCCCTGCCAAGCTGAACCGTCAGGATGCCATGTTAGTGGATGAGCTCCGGGACCGGCTGTTTCGGCAAGTGAGGAGGATTGGGCTGGACCTGGCAGCTCTCAACATG
CAACGAAGCCGGGACCACGGCCTTCCAG
GGTACAATGCTTGGAGGCGCTTCTGTGGGCTCTCCCAGCCCCGGAATTTGGCACAGCTTAGCCGGGTGCTGAAAAACCAGGACTTGGCAAGG
AAGTTCCTGAATCTGTATGGAACACCTGACAACATTGACATCTGGATTGGGGCCATCGCTGAGCCTCTTTTGCCGGGGGCTCGAGTGGGGCCTCTTCTGGCTTGTCTGTTCGAGAACCAG
TTCAGAAGAGCCCGAGACGGAGACAG
GTTCTGGTGGCAGAAACGAGGTGTTTTCACCAAAAGACAGCGCAAGGCCCTGAGCAGAATTTCCTTGTCTCGAATTATATGTGACAATACCGGT
ATCACCACGGTTTCAAGGGACATCTTCAGAGCCAACATCTACCCTCGGGGCTTTGTGAACTGCAGCCGTATCCCCAGGTTGAACCTGTCAGCCTGGCGAGGGACATGA

Retrieve as FASTA