Entry information : EsilPxd03 (Esi_0083_0098)
Entry ID 16972
Creation 2021-02-04 (Christophe Dunand)
Last sequence changes 2021-02-04 (Christophe Dunand)
Sequence status complete
Reviewer Not yet reviewed
Peroxidase information: EsilPxd03 (Esi_0083_0098)
Name (synonym) EsilPxd03 (Esi_0083_0098)
Class Peroxidasin    [Orthogroup: Pxd003]
Taxonomy Eukaryota Phaeophyceae Ectocarpaceae Ectocarpus
Organism Ectocarpus siliculosus    [TaxId: 2880 ]
Cellular localisation N/D
Tissue type
Inducer
Repressor
Best BLASTp hits
Perox score E-value EsilPxd03
start..stop
S start..stop
EsilPxd02 2566 0 1..1490 1..1489
EsilPxd01 1060 0 282..1262 356..1344
EsilPxd01 292 1.77e-80 10..220 82..302
EsilPxd01 73 0.000000000000552 1329..1490 1445..1627
EsilPxd04 815 0 17..884 1..805
EsilPxd04 278 9.4e-77 987..1262 836..1095
EsilPxd05 504 4.23e-161 10..679 87..763
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '16972' 'join(565069..565109,565629..566025,566334..566453,567116..567646,567965..568381,568980..569084,569808..570011,570274..570494,570889..571144,571476..571610,572010..572252,572793..572981,573312..573536,573929..574054,574435..574623,575159..575215,576351..576461,576714..576818,577374..577488,578289..578385,579197..579392,580156..580230,580422..580643,581122..581217)' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 565069..565109 39 N° 2 565629..566025 395 N° 3 566334..566453 118 N° 4 567116..567646 529
N° 5 567965..568381 415 N° 6 568980..569084 103 N° 7 569808..570011 202 N° 8 570274..570494 219
N° 9 570889..571144 254 N° 10 571476..571610 133 N° 11 572010..572252 241 N° 12 572793..572981 187
N° 13 573312..573536 223 N° 14 573929..574054 124 N° 15 574435..574623 187 N° 16 575159..575215 55
N° 17 576351..576461 109 N° 18 576714..576818 103 N° 19 577374..577488 113 N° 20 578289..578385 95
N° 21 579197..579392 194 N° 22 580156..580230 73 N° 23 580422..580643 220 N° 24 581122..581217 94
join(565069..565109,565629..566025,566334..566453,567116..567646,567965..568381, 568980..569084,569808..570011,570274..570494,570889..571144,571476..571610,57201 0..572252,572793..572981,573312..573536,573929..574054,574435..574623,575159..57 5215,576351..576461,576714..576818,577374..577488,578289..578385,579197..579392, 580156..580230,580422..580643,581122..581217)


exon

Literature and cross-references EsilPxd03 (Esi_0083_0098)
Protein ref. GenBank:   CBJ27734.1
DNA ref. GenBank:   FN649064 (565069..581217)
Protein sequence: EsilPxd03 (Esi_0083_0098)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   1490
PWM (Da):   %s   162508.07 Transmb domain:   %s   o655-677i730-747o
PI (pH):   %s   4.96
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MKTILLGALRGQPRELMTDVFLSSPPSVSDMNAVSIAWGQLLLLDLSYTVDNSSEPFEIACDDGGGSVDVWCPLGEASDPIPFFRSQATVTDSVRNPINYASSFIDLDFVYGRSKDAADA
LRTFDGGMLSMADDNMPIKNSDGTW
IADQRTARFPLTFALHVVLLLEHNRCCVDVAPALNYTSDEDMYQACRGWTIATFQHITEDEFLILLMGRSIGDWTVYQDDDGGSRRRLSPEQRRE
LLFTFDYYDDLLNPSADVFVTVAMTAAFESALPSTLRIVSEGYVATDYDHLELTVAAEDITGLFEHSAIGDILRGAVLSPAMAVWPHFASAVSNASPLFKLPVDMVQRARDHGVPSYNDV
RE
AYELSKATAFSDVSADDDVVQLLYAAYGGEIENLDACVGALAEEKEASLGGNFGDLLHTAWVNQLYRTFFGDRYHHLHSRPIENVSLASISGLINQTLGVTDLPASGFTVPEVTVCTG
ECEAAGISGVSLAERYAMSWE
VIDDQTISISLSVLGIGDSGMMGIGFGGLSMTDAQDFIICEVFSTGGAECIDRSPTGGRSEPQPDTLQSGLQVTNVTTDKTWTTVTFSRERATLDAEDY
DLFE
DIENEEDTLVIYAFKKGEGVGQHPNTNRGAATINFVTGDVDTQCDGETNFVSLHGALMLIAWMLIAPWGIYYARYRKGDAIKWAGREWYEMHEDIMIVASEAVLPLGITAVFASRG
RTSEAHAHWGYYMIAAVAMQIFTGWMRTKGLEAKHSNFSLLH
FNKHFHIWAGRFAYAAGVVQCYRGLELVSSDDELIFSAGDGLDLQLGSFGWVKDYLFPAWFALVAGGFLVLEAQKQYQ
RFFKKGAASVCGVVSIVNELHDGSMHKGRLIPRTLDLPIYSVAAFNDK
VLSGQSWLMVDEAVLDVSDFAQRHPGGRRLILNALGTDVTQELIGQENSVGHAMSFPPHVHTGSAWRIIRSL
VVGYIEEKDAAEPTAALEDDQEQEGEEKVDTTTGDIPVPDANNRRFRVAGKAVMLNNRLALGDDTL
ATKAMRLNDLAVIPAPTRRPSRTAASNNPASVVGNPPIEMDVAQRADENDGGWG
SSTDLLERFQVCPLLFRERMGAASAVGRGHLPSKRPVYRYIFSCPAKAQAQ
AQAVSGVCYFNMRAQEEGKGVVQRPYNAFAVRLLDVEPPTPGGRVAWSKTSAKLPKVVPAEETTEGVLC
IEMRIRMYHDGAMSKLLEKLSK
DTDNVAVQLQGPFLVNKLAPPPAHRNVIMIAAGTGVNPRTPRFPRGRGWSSCGRARRRLTSTVPTKSRPCRSNLDSSSSGKPKPPPYSTLARDGGRDE
QPPPYRTPGREEKKRSAADAGAKKERAPTGVGKLLTT
RTWRNSKWNESPTTSGRRRAPMQDTSNYQVGDGLVRGRVNREILETVFGEALISSIAAYNRQRALNSLACSDSDVGDNKEGEG
EDDRDLIGTDQTAGKL
QVVVSGPTAFVANVKQLLTEMGVPAGSTVLLD

Retrieve as FASTA  
Remarks
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAAGACAATCCTACTTGGGGCGCTGCGGGGACAACCCAGAGGTGTGGTGACAAGCGCAACAATGCGTCCATGACCCACGACACTCTTCCTAAGTTGGTCGCACGTATATTCTATGGCG
GAAAACAACAGGGCTCGAAGGCCTTTGAGCTCATTAGCGGAAAATGACGTTAAGTTGGGGTTTTTCGACTCTGTTTTCGACGCTAGCCAGTTCACTCGGCCGTCCCCCAACACCTCGCTT
CGGTTGCACCTTCCTCCTCTTGCCATGCCCCGCGGCCTACTTTCCAGATACGTTCGCCAGCTGAATATTCGTTTGCGGACGACGCATTCACTCCGCTGGAGGAATTTAGGCCGACTGCGA
GGCAAGTACGGCCGTAAGAGGATCTTGTTTCATCGGAGGCGTCATGCCGAGAACCAGTTGTTTCTAACTTTGGTATGTTGGCACTGCCCTCACTCCGCGTTCGGGCTCGAACGGGCTTTG
ACATGTCTCGGCTACGTGCGCCTAACCCACTCATCACCGTCTGTACTCTGTACTCCTTTATTTCTCCCGCAACCGTTGACAGGGAGTTGATGACGGACGTGTTCCTGTCGTCGCCACCGT
CCGTTTCTGATATGAATGCGGTTTCCATCGCTTGGGGACAGCTGCTGCTTCTCGATTTGTCCTACACCGTCGACAACAGCTCGGAACCCTTCGAAATCGCTTGCGATGACGGAGGGGGCT
CCGTTGACGTTTGGTGTCCTCTGGGGGAGGCATCTGATCCAATCCCTTTTTTCAGATCCCAAGCTACGGTGACTGATTCTGTCCGAAACCCGATCAACTACGCCTCCTCGTTTATCGACC
TGGACTTCGTGTATGGAAGGAGCAAGGACGCTGCCGATGCCCTTCGTACCTTTGATGGCGGAATGCTGAGTATGGCAGATGACAACATGCCCATCAAGAACAGCGACGGGACTTGGTTG
T
GGTAGGTTGGCGAGGCCAGTGTACTTGTAGAAGTTTAGAACAGCAGTGTCGAGCGCGCAGCAGAAGATTTATCCAAGTTGCGAGAACCTAGATGGTCGACAGGTCGTGTCTGTGACAAGC
AACGGTTTGTATGTACATCTGCACTGCGGTCAACAACATGTTGTCTACATTCGTTGTCCCGATCTTCATTGTCCCGAACTTGTTCGGTTCGGGAACTATCCATCGATGTATTGCCAACCA
GTTGACAATCTGTATTGGCCTCCATATTTTCTTGATTTTGCCGCTGTCACCTTGTCACCTCTACGGTAGATCGCGGATCAACGGACAGCCAGATTTCCGCTGACGTTCGCCCTTCACGTC
GTTCTCCTCCTCGAGCACAACCGCTGCTGCGTTGACGTCGCGCCAGCTCTGAACTACACGAGTGATGAG
AGGTCAGTCATGCTGAAGATAAGATACCCGCTTCTTAGGGAGCACACAGGC
TTGTGTCGTGCCTGATTCCTTGCCGCAACTTCGATGGAGCATCATTACGTCGTCCTATTCGTTTCGATCGGAACAACAGAAAAAAATATTTGTAAAGAAAGAAATACGGAGTTGTCCGTT
AAGGTGATTGGCTTGGGTTGCCATACCCACCCCGCATAGCCAATAGTGGCAGTGGATCGGCGTGTCCCACGTGTGTGCGCCAGCAGGTGTGGCCACGTGCCCCCATTCATACGCAATCGC
GTCGAAATTGTTCGTTTGACCAACCCCACATGCATTTTTTGGGCTTTGAAATATCAGATTGAACCATGAACCAGAATCGTCCCCTTTCGCGCAGGAAGCCACATTATATGCTTGGCGAAC
GCAGAGCGTAGTGTCATGTTGAGATCAGGAATATTTCACACGAAGAACCACTCATTCCTCTCTTCAGAGACTCGAGACTCGCTCAAATATTATGTCATGAGCTCGAACAGCAGTGCACAT
CATGGCTTTTAACGTATATAATTTTTCCACTGCTCACCCGATGTTGTTGCCCTAAGACGTGCGGTTACCTCGCACCTCACTGACGTGCAACCCGCCGACCTCACACACAACAAACTCCTC
GGTTGCTGAACAGGATATGTATCAAGCTTGCCGAGGCTGGACGATCGCCACATTTCAGCACATAACTGAAGATGAGTTCTTGATTCTACTAATGGGAAGAAGCATCGGCGACTGGACCGT
GTACCAAGATGACGATGGCGGCAGCAGACGCAGGCTATCCCCAGAACAGCGGAGGGAGCTGTTGTTTACTTTCGACTACTACGACGACCTCTTGAACCCGTCCGCCGACGTCTTCGTGAC
GGTGGCCATGACCGCCGCGTTCGAGTCGGCGCTGCCTTCTACCCTCCGCATTGTGAGCGAGGGGTATGTTGCCACTGACTACGATCACCTTGAGCTCACCGTTGCGGCAGAAGATATCAC
GGGCCTGTTCGAGCACAGCGCGATCGGGGACATCCTGCGTGGAGCGGTTTTGTCTCCCGCGATGGCGGTATGGCCACACTTCGCTTCGGCGGTGTCGAACGCCTCGCCGTTGTTCAAACT
CCCAGTTGACATGGTGCAGCGGGCTCGTGATCATGGGGTGCCATCGTACAACGACGTCCGGGAG
AGGTGAGTCCTTAAAATTGTCAAAACGTTTTGGAGGCACCGGTTCCACCAACCTCG
CTGTGTCGAGGGGAGGGTAAGGAGCGAGCCCTCTCGGTCACAGTTGCACGTGGGGTATTGATTGTCCCACGCCTCAGGGCGTGGCCGAATGCTCATCCCTAGGGAAGAAATGCTGGAGTA
TCAACCGGAGGTGAACCTTGAAGTGTAGGCCCTTACTGTCATCTTCGCCACGAAGCACCCCCACGTGTTCCCAGGCTATTACGAGGGGACACATTTCCATGAAAAGAATAATCCACCTGT
TTACATGCACGAACAATACAACAGGCATATGAACTTTCCAAAGCCACGGCCTTTTCGGATGTCTCGGCGGACGACGACGTGGTGCAACTCCTATATGCTGCGTACGGTGGAGAAATCGAA
AACCTCGACGCGTGCGTCGGGGCCTTGGCGGAGGAGAAGGAGGCGAGCCTAGGTGGCAATTTCGGTGACCTGCTGCACACAGCATGGGTGAACCAGTTGTACAGAACATTCTTCGGGGAC
CGGTACCACCACCTTCACTCGAGGCCGATCGAGAACGTGTCACTCGCGTCTATCTCAGGACTAATCAATCAAACGCTCGGCGTGACTGACCTGCCGGCATCGGGTTTCACGGTGCCGGAA
GTTACCGTGTGCACCGGGGAATGCGAAGCTGCGGGAATATCGGGGGTCTCGTTGGCAGAGCGCTACGCCATGTCGTGGGAG
AGGTACACGATGTCGTTGAAGGAATGCTTGTTTCCTAGG
TTCTCGGATTAGGACTGTACTGGTATCACTCTGATCCCACAGAGTGGCTCATCGACACATTCACCGACGTCCAGCAGTGCGATATTCCCTTCTTCCCGGAGCTTTCGGATGTACACAGTT
TGGATTGTTTCTTGGGTGAACGAACGGGCGGGTGGAAACGAAGGGCCCCTCCTACACGCCGCGACCGTAATCTTTCTTGGGATGAGGGTAGACCCTTGCCCTAGTCATCCGAATTTTGAT
TCCTCGCGAACAAGCGACCTCGAAGGAGTGTTTCGACCGGGTTCGACTGCGTTGATCATCCATGTTTCACCACCTGGGGCCGCAAGCCCATGGGGGTTGTCGCCTAATGGGAAGTCTGCG
ATGGTCCCTTCTCTGCCACGTTTTGTCAGTAGTTGCGTGATCGACGATGTAGATGTGTTGGATGTTTGGACGCTGCACCACCATGCAGATTCTAATGATGCCGCTCCACGTTACTGAGCG
GCATACGGTGATAGCGAAACGAGTTTTCCGGGCTTGACTCCCACGGCGATGCATCTGTTGAAACCCTTGGCCATTTCACAGGTGATTGATGATCAAACGATATCGATTTCCTTGAGTGTC
CTGGGTATCGGCGACAGCGGGATGATGGGCATCGGCTTTGGAGGATTGTCCATGACGGATGCCCAG
AGGTGCGAGAAACCAAGCTATAATCTAATCGCTCCGAAATTCTGAACACAACAG
TCTGTTTAGTTCTTCCTCCGTTGAGCGCTTCCGGCTGTCCTCCTCACGCTCGAACCTTATTCGGGTTATTTCGCTCAGGGGGGTGCGGACTATTTTTTTTTCTAGAGCAGTTTTTCAGGT
GAAAGCGAAGATGTCACAATTTACAGGTCGTTCCACTGGTTGTCGTTACCGTTGTGTTCCACTTGAGTGTATATCAGCGCTATGCAATAAAGCCGCAAGCAGGGAATGGCGGAAAGAGGC
TCGAAAACAATGCTGTGTCTTGTCGTCCAGGCGGGAGGCTTTTCTCTCCTCCGGGGATGTGCTCAACTTTGGCAAACAAACCCCCGCTAGTGCCGGTGTTCGTTTGGGGAGAACTGTGTC
CTACACCTTGTACGAAGTACCTCCCCACACACTCGGCCAGCTCAAAGCTGCTTGCGTCATGGTTTTTTTCTGCTGTTCATGACGTCGTTACGCCAGTCGACAGCAGTTGATGTCCATACT
CTGTGACGGCCGAATACTCACCCGTATGATTGTGCGCGCGGGAGTATGTATGCATCAACGATTCCAACCAGCTCACGGTCGTGCTCTATTCACGTGTTGCAGCATGCTGTTCAACACAAT
AGTCTACTTCCCGCCGTCTGCTCCGCCAACGCTCACCGCTGGTGTGTTGATGCATATGGTGCCCTCTAAAGGATTTCATTATCTGTGAGGTCTTCTCAACGGGCGGTGCCGAATGCATTG
ACCGGTCTCCCACTGGAGGGCGATCGGAGCCGCAGCCGGACACTTTGCAATCGGGCCTCCAAGTTACCAATGTAACGACGGACAAGACCTGGACTACCGTGACGTTCTCGAGGGAACGGG
CGACGCTGGATGCAGAAGACTACGACCTCTTTGAG
AGGTGGGTGCTACACCTGCTGTGCATGTTATCTTTTGGAATATTTTTCGCGTCCACCTTGATGACGTCTTATATTTTGGGGAGTT
CATTCCGCGGGTTCCGGTATACGCCAGTGATGCTGCCGCTGTGAATTCAACGCCGATGTAAGGAAGAGCGTACCCGGCCTGTGTCAACCTTCCATATATGGACATTTTGGTTTCGGTACA
AACAATATCCGTTAGCGGCGTGCATGCCGCCTCAAGTACGTTTTTTTCGTCGCTTATAGGATATCGAAAACGAAGAGGACACCCTCGTCATCTACGCGTTCAAGAAGGGAGAGGGCGTGG
GCCAACATCCAAACACCAATCGTGGAGCGGCCACGATAAACTTCGTCACGGGAGACGTAGACACTCAGTGCGACGGCGAGACCAACTTCGTTTCGTTGCACGGCGCGCTGATGCTCATCG
CGTGGATGCTCATCGCACCGTGGGGCATCTACTACGCGAG
AGGTGAGCAACCTCAAACGATAATTTGTTCTAGAGTTTTGGATGTCACTACCTGAGTCAAATCGCGACTGAAAGCTTTCG
ATAGCAAGAAGATCCCCTCTTTTCCGTGGCTAGCTAGGTGAAACCACAACTGATTGGTACTGACAAGTCAGACGGGACGCTCGCTTTTTTGCCATTTGCGGAGCTGACCAGCAACAATAC
ATGTCCGAACGGCTGATCCAACCTTGGCGTTTTTGGCCAGCCGTGCCTCCCTCCCTCACAGCCGTAGTATTTTCTTGCGCGATCTCCAGCCGAATAACACGGTGATTGGTTGCCGAAAGC
ACAACCTCGCTCATAACGATAAGTCGTCTCGAACATTACTCGTAATGCATGTGACACACCATGGGCATTATCCCAGGTACCGCAAGGGCGACGCGATTAAGTGGGCTGGACGCGAATGGT
ACGAGATGCACGAGGACATCATGATCGTTGCCTCCGAAGCCGTGCTCCCCCTCGGGATCACCGCTGTATTTGCTTCCAGGGGCAGGACGTCGGAAGCACACGCGCACTGGGGGTACTACA
TGATCGCCGCGGTCGCAATGCAGATCTTCACGGGCTGGATGAGGACCAAGGGATTAGAGGCCAAACACTCGAACTTTTCCTTGCTTCACAGG
GGGTGAGATGAGCAAGACATGTTGGTTG
TCGATGAGACGTGTACACCATTTATTTTTTCCGCATGCCAGTACTTGGAGGACATTCTTCGCTATCCATTGACTCGCACTGGGTAGCTCTCGCAGGCGAAAGCCGGCAAGTTCCAGGCAC
GTGCCGTTCGCTTTGGGTGTTCGCTTGACGATCCACGCTGTTCATCTCAACAATCGAACCAAGGCTACCCAGCCCCAACAACATCACTGGATACCATGAGAAAAAATGACAACACGTTTG
AGGGCCTCAATTGCCAAGCATCTCCTAGGCTAATCAGCTACTGCGGGAACTCCCTTGCTCAACAGTTCAACAAGCACTTCCACATCTGGGCCGGACGATTCGCATACGCAGCCGGAGTGG
TGCAGTGCTACCGAGGGCTGGAGCTCGTGTCCTCGGACGATGAGCTCATATTCTCCGCAGGCGATGGCCTTGACCTGCAG
AGGTGAAGCCATGGGAGCGAATCGTAGTCCCTCCTCACCA
AGTATTCTACGTCACCCCGCACCTGCATTGATCGTGACGAATGCGGGCAATATAGTTGTTTCCTACTTCATGAGGCGTACTCCGTAAGTTCCCAGCGTTGGTGTGATGACTTCCGTCGCT
GAAGAGCTCGACTTGTATGTAGGAGTGCAAAGGACGCAGCAGAATCCTCGGGATATACAGGCAGAAACACCCCCGGCTGTGATATCGGAATATATGTGTCGCTGTACCCCTCTGCCTTCT
TGGATACTGTCCTTGACGGTGACAATCTCCTGTCAGATCGCGCTGCTTGTCACATTAGCCTCCTTGTGTTCGCTCACCTCTTGCTGAAATTCCTGTTCGCACTATTTCGACCGAAATACA
GCTCGGCAGCTTCGGTTGGGTCAAGGACTACCTGTTCCCTGCCTGGTTTGCGCTAGTCGCCGGCGGCTTTCTTGTCTTAGAAGCACAAAAGCAATACCAGCGATTCTTCAAGAAAGGGGC
GGCTAGCGTGTGTGGAGTTGTGTCTATTGTCAACGAGCTACACGATGGCTCAATGCACAAAGGCAGGCTGATTCCGAGAACGTTGGACCTGCCTATCTACAGCGTTGCAGCATTCAATGA
CAAG
AGGTGAACGAAGACTGTGGACTTTTGGCGATCGGTAATAGTGTTTGGCGTGCTTCGCGCGGCAGGGTGTGCGCTTCATTGAACTGCTTGAATCGCATTAAACGTGGAGGCATCGAC
AACGGGTTTCAAGCTCTATTCAGTCTTTTCGTAGTACGCTCGGTCAAGAACAGTTCGCGGAGTCGGTTTTTCTTTACCGTGCAAAGGGTGTCGTTCAGAATAAATCTCCCGCTCAACACC
TATTGTCCTTGGCCTAAGTTTGTTTCCAGTCAACGTCTCGACGCCATGAACCTCTAGAAGGAAACGTCTCCCACCAATTCGGAAAGGATCGTAGACAACATGTGCCAAATGCAACCCATA
TGTAGCATGCACCTCCTAGGTCCCGCAAGATACGCAATAAGTCGAAACGATCTCCCAACCTCCACCATGTCGCCATTCTCTTGGAGTCTCACGCGGACCATAACCAACACCACACACCGT
GCCTGATGTAAACGTGTCTTGCGGCCCAAAACTTACCTGCCTTCCGACCAACCTCTCTTCGATCAGGTGTTGAGCGGCCAGTCCTGGTTGATGGTGGATGAAGCAGTTCTGGATGTGTCC
GACTTTGCGCAGAGGCATCCAGGCGGTCGACGCCTCATCCTCAACGCCTTGGGCACCGATGTTACTCAGGAGCTGATCGGACAAGAGAACTCTGTGGGGCATGCCATGTCCTTCCCGCCT
CACGTGCACACCGGG
GGGTGAGTGCGCGCTTGACAGAACCAAATTTCCCTCCTGTTGAAAATGTGTAGAACAGACGAATGTTAGGCGTATCGTTCTGATGGCCCTCGAACAGCACAAGAA
GATGGTGTGACTGCCGACCACCTCAACCATCACACACAGACCTCTTGGAAACCATTGGATGGCAACGCAAGGGCCAAATCCTGGACAACCCCCTCCGCAAATTGTCGTCAGGGGGGTGGA
TGATTGGATGATGCGATATATTTCGTAGCATTGCAGTTATTATCGCCACCAACAAGGCTCCTGACCCACCTGTTGCGTCTACACCTTTTCTTACTCGCTGGCGGCAGAGTGCATGGCGAA
TCATTCGTTCGCTTGTAGTCGGCTACATCGAGGAGAAGGACGCTGCGGAACCTACGGCTGCCTTGGAAGATGACCAGGAGCAGGAGGGAGAAGAGAAGGTCGATACCACGACCGGTGATA
TTCCCGTTCCTGACGCAAACAACCGAAGGTTTCGTGTCGCGGGCAAGGCGGTAATGCTGAACAACCGACTGGCCTTGGGCGACGATACTCTG
TGGCAAGTGACGATCCACGTCACGATAC
TCATGTGTCTCGAACAATGACCTCACAGCCAACGAATTGCATGTATTCGACAGGCGTTCCGAAGCATCCACATCATGATGCATGAGTGTGCACCTGAGTCGAAATCAACAATTTGGATGT
GAGATACTTTGTGGGCCGGCCGTTGAGAATTTCCACACCAAGCTCAGCTACTTGGTCACGTAGGTGCTTTCTCGACAACACGCTGTACACACCGCACCAGAAAACTCTCCCCACTTTCCC
TCGATTTTGGGATCGCCTGCTGAGATGTGCCTTTCCGTATCAAACCATGCCGATAGGTCGTGAGGGGCGGCATGGAACCGATACTTGTTGTTCCCTTTGTTTCGTGGTATGCAACACTTC
CTCCAGGCAACCAAGGCCATGCGGCTGAACGACTTGGCTGTCATTCCCGCTCCAACCAGGCGCCCCTCGAGAACGGCAGCGTCGAACAACCCAGCAAGCGTTGTGGGCAACCCGCCCATA
GAAATGGACGTC
TCGTAAGTCTCCAAGTCTCCCAAGTTTTTGTCTCGTGGACTGTGCAGACGTGCCGAGACCTTGTTTCTCAGTACCCGGCATAGTCTAGTGCTTGTTTGATGGACGCCC
TGTTCCCAACTGAAAGATGGGCAGATGACCATGTAGTTGCACGTTCAGATTCCCCAGCTGTTGGTTTTGTGTGCTGCTGCTTTTTCCCGCGAATTTCAATCGGGGTCAATAACACACCAC
GTCGAAATCCCCCAGAGGGAACACCCGGCGTTGCCGGAGGGAAACACCAGCATTGTTTAGCACCTACAGTATGAGCGAAGCTCAAGGACAACATAAGAGCTGATACCCATGTTATTTTCC
GTTGAACTGGAACGGAACGTTTGATACATCGAAGGCTCAGCGCGCCGATGAGAACGACGGCGGATGGGGAAGCAGCACGGACTTGCTCGAGCGATTTCAAGTATGCCCCCTGCTCTTCCG
AGAGAGGATGGGAGCCGCCAGCGCCGTGGGCCGCGGTCACTTACCCAGCAAGCGCCCGGTCTACAGATACATTTTTTCTTGTCCCGCTAAGGCGCAAGCCCAG
AGGTATGCACACCCTCG
TTTCCCTTCATTATTATAGTGTTTTGCGTCAATCCTCCCCTTTGGCATGCTCCGCTGTAGACACGAGGACCCGCCAGCCAACGTAGCTCTCTTTTTTCCGCTTGCTTTGCATTCTGAACA
GTCGGTCGGGCCTGTCTGTAGACAGAGAACATAGACAAAAAGTGGAAAACGTAGGTATCAAATGTCGTCGAAAGCATACGTTGACTATGACACCCATGACGGAATTGGAGTCCATGAAGT
CAATGTCTCTTCCTGTGGTCCCGCCGTGCACGTCTAAACAGCGCGGAACCCTATCCACGCCGCGGGACACAAAACAAACGTGTTGGTCCAGAGGTCTTTTGTGCTGATGTCGCCTAGCTC
TCGCATCACACGGGTTTTTCTGTTCTCTGTTGCGCGGTATACGTTTTCGACCACGTCTTTGGTCAACCAGCAGTTCATGTCATTTTCATGATTTTTTCGTTAGTTGCCTTGTGGTTTACA
CGGGAAAATTCTTCGGCCCCTATCAATTTCGGTACAAAAGGCGCAGGCGGTTTCAGGGGTCTGCTACTTCAACATGCGCGCCCAGGAAGAGGGAAAGAGGTGAGGCTTGAAACGAACAGA
GTCGTGGTGAATGCTCTGTGGGATCGCCGTTCTGCGCCAGGTACATGCCTACCTTTTCGTGCGCTACCACGGGTTGTTGTGACATCGTTGTCGCGTGCCCCGCCCTAGCGATCACGTGAC
TGAACACAACGCCCCTACGCTCGGCTGTATTGTGCGGGGTTGGTTTTCCTCTCGCCCAAGAGGCGATACTGCATTCTGACAAGTGACCTTTTCGGGTCTGTTGAACCATATCAATGGTCT
AGATTGACTTGGCTCCCGCTGGAATGTTTGTGTTGGCCGTATTGATGGAATTTTCATCCAATCTGCCACAAAGGAGGTCTCCCAATGTCCTCTAGAAAACGAATGCCGTCCGTGCTGCAA
TGGTTGCTGTTGTGGACTGCTGCTACTGGCATACGTCCGGAATAGTTATAGGAGAGCTGGATGTGGCATGGCAGGGACCCCTACAGGTCTCAATCTCTCGCACTACCTACAGTTTCGCTA
AACCGCCCATGGGACACCCATGGGTCTCCCATGGGTCGGCTATGGGCACTTCATGGGCCGGCCATAGGCAGCCCATAAACTGCCCAACGTGGGCACTTGAACCTAGAACTTCGCCACCGG
GATATCAACCTAGAACTTAGCCACCAGCGCCAACCTAACACGTTCGTGTGGGCTTGGGGTTAGGGTTACGGTTAGGGTTATTAGCCACTATTGTACCCGTTCCAACGCGGACAATCTGTT
GCTTAGGAGTACACGGGTTCGGTTATAGGGCTCCCATTGGCAGCCCATGGGCTTCCCAAGGGTCGCCCATCGGCTGCCCATGGACCTAACGGGGCGCCCATGGGCGGCCCATGGGTCACC
CATGGGTCACCCATGGGAGTCCCATTGGTCGTTTGGGAAAACTGTGGAAAGTGTGGGAGACCCACAAGCCAACCCTACAGCATGGCTATCTGTGTCTCTCACTGAGATGCGAGGATGCGA
GGTTTGAGCACACACGGTCAAGAACATTCTCTGTCGACCTGATCAGTGACTTGAGATTTCCTACGGAGTATCGAAAATCGGCATGGATTTGTTTTTCGTCTTGTCGTCAGCGGCATTGTT
CGTGATCTCAACGTCGTTTGCGTCTTCACAACAGGGTGTTGTACAGCGGCCGTACAATGCCTTCGCGGTGAGGTTGCTGGACGTCGAGCCACCCACACCGGGCGGAAGGGTGGCCTGGAG
CAAGACTTCTGCGAAGTTGCCTAAG
AGGTCAGCACGCTTGAGACAAATCGAGCGAAAGAAGAAAATAATTCGATGTAGACGGCATACTCTAATCTGCTCCTCGTTCGCTACACACACCTG
CACCTGAAGGTTGATGCAAACTGTAGCCTCCATGGACATAGTCCGAACGACATGTTCAACCTGTGCCACAAGGCTCTCGTGCGAGCCCAGCACCTGTCCCATCGTTAACGTTAACGCACG
GTGCCTAACTCGCGTGACGGTTTTTCCTCTGTGTGGAAGGTTGTGCCGGCAGAGGAGACTACGGAGGGCGTACTGTGCATCGAGATGCGTATCAGGATGTACCACGACGGGGCCATGAGC
AAGCTACTAGAAAAACTCTCCAAG
AGGTGAGCTCTACGATTGATTCCGCAACATGATGTTGTGCAATGGTTTCGCGAAGTTGCTCTGGCAGTTTTCTTACCATCCTCCTCAAGCCGAAAT
ATTTTCGAGACTGAGGCCGTTTTCGTTTATCAAATATTTTGCCTCATGTGTGCTAACCTTCGGCGTAAGGTGCCGGTACACCGACCACTACCATTTGTTTTAATTGTCACAATTACACTG
CTTGTGTTACGGGTAACAAAATCTCCTTCAAGAGTCGAGCATCAAAATGTGACAGATGATAGTTTTATCACCTTGAGTGGGGCGTTGTGGATGAAAGGGGACTTCGTGAACATTTGCGGA
GTCTAATACTCAAATTCTAGCTGCTGTTTGACCGAGGGAACCTGTAGACTCAGCTCTCGATTTTATTCCACGAATATCCGTCAGATAATCGTTTTTTTCTGCCAAGGCCAAGATCACAAA
CAATATTTGCTTTTCCACGCCATGCTCCGCCTAGATGCTAAGTAACCGGATTAGCGAATTTACGTTTACTCTTGATAAACGTCCATACAACTCGTCTCCAGGATACGGACAACGTGGCAG
TCCAGCTGCAAGGACCATTCCTCGTCAACAAGCTCGCTCCGCCACCGGCCCACCGCAACGTCATCATGATTGCGGCGGGCACAGGCGTCAACCCAA
AAGTGCGCAAAGATGGTTACTGAT
GGATGTGTTGTCTCTACAATAGCGTATTCGCTTCCGGTGTGTGTGACGAAGACAACGTCGCCGAGTGCCCAGTATGCCGCGGATGTTCGCTCCGCGCCCTTGGAGCCAAAAACGGAAATG
CTTTGAATTTGGGTATTGAAACGTCTTCGAAGGTTTAACGACAGGGCTAAGGTTAGGCTAAGGGTTACTCTAACACTACCTGACCTTGCCGTGCAATATATGTCAGAAATTGGCCGTATA
ATACCTTAAGTACCTCAGGAATCTGTGCAGACCCATGTCGCAGAGGCCAATGTGCATGACCGCAGGAAATCGACGGCCTTTCCTATATATCGTGTCAGATAGTTCAGGGCCACCACCTTT
GTCGTACTGTCTTATCCCCGCCGCCAGCGATGTCGTACTAGACCCCGGCCACCACATCAAACGAAGGCGTGCTCTGTGACGCAGTCAACGAAGTGTCGCAGCCGACTGTCGCCGTATCTT
GACGTCCCCGCGCAACCTATGTATGCCCCCCCCCGCGTATCTCCTTCATTCGCGGCGTGACGTCCCTACTGGTGGTCTTCGACGGTCGAACAAATGACCGACGGCAATGGCAGTGGTCCA
GCAGATTCGGAACTATCTCAACGTTTCCAGGTGGGCGGCAAATCGTACGTGGGATGGCCATCATCAGTACTTTCACGACGTTTCCGAAACCCTCTGGCGTGCCGACGACTCGCATGAACG
GCAGAAGAAACTCGCGCGTAAAGCGGGATCCCTCCCTTCGTATTTTCAATGACCCCAGGGACACCACGTTTTCCACGAGGTCGCGGCTGGTCCTCGTGTGGCAGAGCACGACGGAGGCTG
ACTTCTACGGTACCGACGAAATCACGGCCATGCAG
AGGTAAGTCACCTTTGCATGGATCAGTCTGGATATGCAGCAGGCATCTCCCATCGCTAACTCGGTGATATGGCGTGGCCGGCTCT
TTGCCATTCTGGTGTCGCTCCCACTGTTGGACACGTCCCGAACATTGATCGGTGCAACATCCACCCCGCCCTGGCTTCTTGAATTGTTATTATCCAGGAGAAAAGCAACGGACTCCTCGA
AGTGACTGCTCTCACCAGTGAAGGAGGTCGTCGTCGGAATGCTCCCGGTGCAGCCTTCCGCAAGGCCCGTGACATGGCCTGGAATGTCTTCGGTTCACCCGCGTCTTCAAACACCAGCTC
CGCTAACCCCTTCACTCCTGCGGGGTACGGTGCCACGATGATTTCGGGGTCATCAAATTATCCCCCGAAGACGCAGTGGGAACTGTTGGTTGATCGCTCTTTCGTCGCCATTTGAACGCG
TAGGCGTACCATATTTTTCAGTATTCCGCGCTTGAAATCCGGGATTTGATCTGTCGGTCCGGCTTCTCCGCGGGTCCAGCCTTGTGTGTGCTCCGTTAAAATTGGTCGGCGAGGTTCCTG
CGAAGGTGCGCTTGAACAAGTAGCGGGCGCAACAAAATTATTATCTACCGCCTTTGCTCCATTTCCGTTGGAGCGCTTGCCTCACCTTTGAACCTCGAAAATCACTTGCTCTAGATCAGA
TGCCGGCACGCACTTGTGGTTTCAGAGCACACCTCTCTCAGAACCAGGGTACGACGAGACCCCCACTGCCAAGCGGCCCATGTGCAATGACGTTGCTACGAATACTGCCGTGTTTTACCC
GTCGCCAGGTCCAACCTCGACAGCAGCAGCTCCGGGAAACCGAAACCTCCACCCTATTCCACGCTGGCCCGAGACGGTGGAAGGGACGAGCAGCCACCCCCGTACCGCACCCCGGGTCGT
GAAGAGAAGAAGCGCTCTGCTGCAGATGCAGGGGCCAAGAAAGAAAGGGCCCCGACTGGCGTTGGGAAGCTACTGACCACGAGG
GGGTGAGACGTACCACGTGGCAGGGACACATTTGCA
ATATGCGTACGTACGTGACACCTACGCCCAGACCTAGTAACACGGATGTAGTTCTGGCGTTACCCAGGAAGTTGAATAGTGAGCGCAGGACGCATCATTTTGTCTGTGGCTAAATAACCT
GGTCTCAGGGATGCTCGCACCCAAAACGAGGCAGCTACGCCCAACAAGGACCACCAGTGGTCGCTCTCTTAGCTGATTGGTCGTCTCGATTACCTCGCAGCAGATTGCCGTCCTTCTCTC
ATAATCGCCCCGCTGAAAATTCAGTGTGAGGAAAGTTTCTGGAGAACAGTGACAGGCAGACGCAGAATCCTTGCGTTCCATGCGAAAAGGTAGATGTGACGTCCAGACCAGTACGCTCCA
AGACTTCTGGTTGTTCGAGGGAATGGGCAAATTGGCAGAATTGAACGAGTCGTTCGCACGATCGCAACGAGATGGCTTGCCGAACCAACCAATGACAGTGCAGCTCCCGAAAGCGCTATC
GTACGAGAAAAGCCCGAGTACTCTCCGAGGGGTACTGTTGGATGAGCAAAGTGGGCGGGGCGGGGCATCTTCGAGTGGTGGATGACTACTGAGCAATAGGAGTGCCGTGTGTCCCGGGGA
CTTGCCTGCCTCACTACGGAAGCTATGGGGATGTTCTAACCACTGCATCATGTATGTCGACCGTATGTTTCGTGATGCTCGTGGCGCCGTGCGTGTTTCGATGGACGGCTGTTGTTCATG
CTGCTGCAGACGTGGCGCAACTCGAAGTGGAATGAGAGCCCAACGACAAGCGGTCGCCGTCGCGCACCTATGCAGGACACCAGCGCGTGAGCTTCAGACATGTGCATGAGTTAGGAGTAG
CGACCGACAATCAAGTTTGAAATGCCAGAAAAGGGCTCGTTGAGAGTTAAGGCTCCACATCGTGAGAAGGTTGGAATCTGCATCATCTGCGAGGCGTGGTCTCAGGTGAACCTACTCCCA
TTATCTCCCTCTCGCTGTTCTGGTTTGTCATCGCCAGAACTATCAAGTGGGAGATGGGCTGGTTCGAGGCAGGGTGAACCGGGAGATCCTCGAGACGGTCTTCGGCGAAGCCCTGATTTC
ATCAATCGCGGCGTACAACAGGCAGCGAGCGCTGAACAGCCTTGCTTGTAGCGACAGCGACGTCGGGGACAACAAAGAAGGGGAAGGCGAGGATGATCGCGACCTCATAGGCACCGACCA
AACTGCCGGGAAGCTTCAG
AGGTGGCCATCACGAGTAGTCGTAGTTTCTTGTCGTTAACAATTCATATAAACGATGATGAAAACCTCGCCACAAACCTGAGAGCTAGGGTGGCTATGGTT
GTGACGATAAAGTGTAATCCAATAGAAACGGTTCTGCCGTTGGTTGTGAAGAAGGCAACAGCAATAACAGCACGAGCAACGTCTAGTCTAGAGGCCTATCGTTTTTCGAGGATTTCTTTC
GATGGAGCACGTCGAGTGTGTGGCTACCTCTTCGTGCCAGGGAGAACGGCGAGTTTCAGACTCGCTTTTGACCTGTTACTCTCCCGAAGTGAACATTTCCTCCTCGTGGACAGTCGGTCC
AATGGCGCAATCTTGGTGGGATACTTCTGGCAGTATCGACTATTCGATGCTGGACAGTGAATGGTCAGGTATTCGTACATGACCGCCTGGAGAAACACGACGAACGTGTCCCGTCGAATC
CCCTGTGCTCTGATGTCAGGTGGTCGTTTCTGGACCTACCGCCTTTGTGGCCAACGTGAAGCAGCTCCTTACCGAAATGGGGGTCCCTGCCGGCTCCACAGTCTTGCTTGACTGA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAAGACAATCCTACTTGGGGCGCTGCGGGGACAACCCAGGGAGTTGATGACGGACGTGTTCCTGTCGTCGCCACCGTCCGTTTCTGATATGAATGCGGTTTCCATCGCTTGGGGACAG
CTGCTGCTTCTCGATTTGTCCTACACCGTCGACAACAGCTCGGAACCCTTCGAAATCGCTTGCGATGACGGAGGGGGCTCCGTTGACGTTTGGTGTCCTCTGGGGGAGGCATCTGATCCA
ATCCCTTTTTTCAGATCCCAAGCTACGGTGACTGATTCTGTCCGAAACCCGATCAACTACGCCTCCTCGTTTATCGACCTGGACTTCGTGTATGGAAGGAGCAAGGACGCTGCCGATGCC
CTTCGTACCTTTGATGGCGGAATGCTGAGTATGGCAGATGACAACATGCCCATCAAGAACAGCGACGGGACTTGGTTG
ATCGCGGATCAACGGACAGCCAGATTTCCGCTGACGTTCGCC
CTTCACGTCGTTCTCCTCCTCGAGCACAACCGCTGCTGCGTTGACGTCGCGCCAGCTCTGAACTACACGAGTGATGAG
GATATGTATCAAGCTTGCCGAGGCTGGACGATCGCCACATTT
CAGCACATAACTGAAGATGAGTTCTTGATTCTACTAATGGGAAGAAGCATCGGCGACTGGACCGTGTACCAAGATGACGATGGCGGCAGCAGACGCAGGCTATCCCCAGAACAGCGGAGG
GAGCTGTTGTTTACTTTCGACTACTACGACGACCTCTTGAACCCGTCCGCCGACGTCTTCGTGACGGTGGCCATGACCGCCGCGTTCGAGTCGGCGCTGCCTTCTACCCTCCGCATTGTG
AGCGAGGGGTATGTTGCCACTGACTACGATCACCTTGAGCTCACCGTTGCGGCAGAAGATATCACGGGCCTGTTCGAGCACAGCGCGATCGGGGACATCCTGCGTGGAGCGGTTTTGTCT
CCCGCGATGGCGGTATGGCCACACTTCGCTTCGGCGGTGTCGAACGCCTCGCCGTTGTTCAAACTCCCAGTTGACATGGTGCAGCGGGCTCGTGATCATGGGGTGCCATCGTACAACGAC
GTCCGGGAG
GCATATGAACTTTCCAAAGCCACGGCCTTTTCGGATGTCTCGGCGGACGACGACGTGGTGCAACTCCTATATGCTGCGTACGGTGGAGAAATCGAAAACCTCGACGCGTGC
GTCGGGGCCTTGGCGGAGGAGAAGGAGGCGAGCCTAGGTGGCAATTTCGGTGACCTGCTGCACACAGCATGGGTGAACCAGTTGTACAGAACATTCTTCGGGGACCGGTACCACCACCTT
CACTCGAGGCCGATCGAGAACGTGTCACTCGCGTCTATCTCAGGACTAATCAATCAAACGCTCGGCGTGACTGACCTGCCGGCATCGGGTTTCACGGTGCCGGAAGTTACCGTGTGCACC
GGGGAATGCGAAGCTGCGGGAATATCGGGGGTCTCGTTGGCAGAGCGCTACGCCATGTCGTGGGAG
GTGATTGATGATCAAACGATATCGATTTCCTTGAGTGTCCTGGGTATCGGCGAC
AGCGGGATGATGGGCATCGGCTTTGGAGGATTGTCCATGACGGATGCCCAG
GATTTCATTATCTGTGAGGTCTTCTCAACGGGCGGTGCCGAATGCATTGACCGGTCTCCCACTGGAGGG
CGATCGGAGCCGCAGCCGGACACTTTGCAATCGGGCCTCCAAGTTACCAATGTAACGACGGACAAGACCTGGACTACCGTGACGTTCTCGAGGGAACGGGCGACGCTGGATGCAGAAGAC
TACGACCTCTTTGAG
GATATCGAAAACGAAGAGGACACCCTCGTCATCTACGCGTTCAAGAAGGGAGAGGGCGTGGGCCAACATCCAAACACCAATCGTGGAGCGGCCACGATAAACTTC
GTCACGGGAGACGTAGACACTCAGTGCGACGGCGAGACCAACTTCGTTTCGTTGCACGGCGCGCTGATGCTCATCGCGTGGATGCTCATCGCACCGTGGGGCATCTACTACGCGAG
GTAC
CGCAAGGGCGACGCGATTAAGTGGGCTGGACGCGAATGGTACGAGATGCACGAGGACATCATGATCGTTGCCTCCGAAGCCGTGCTCCCCCTCGGGATCACCGCTGTATTTGCTTCCAGG
GGCAGGACGTCGGAAGCACACGCGCACTGGGGGTACTACATGATCGCCGCGGTCGCAATGCAGATCTTCACGGGCTGGATGAGGACCAAGGGATTAGAGGCCAAACACTCGAACTTTTCC
TTGCTTCACAGG
TTCAACAAGCACTTCCACATCTGGGCCGGACGATTCGCATACGCAGCCGGAGTGGTGCAGTGCTACCGAGGGCTGGAGCTCGTGTCCTCGGACGATGAGCTCATATTC
TCCGCAGGCGATGGCCTTGACCTGCAG
CTCGGCAGCTTCGGTTGGGTCAAGGACTACCTGTTCCCTGCCTGGTTTGCGCTAGTCGCCGGCGGCTTTCTTGTCTTAGAAGCACAAAAGCAA
TACCAGCGATTCTTCAAGAAAGGGGCGGCTAGCGTGTGTGGAGTTGTGTCTATTGTCAACGAGCTACACGATGGCTCAATGCACAAAGGCAGGCTGATTCCGAGAACGTTGGACCTGCCT
ATCTACAGCGTTGCAGCATTCAATGACAAG
GTGTTGAGCGGCCAGTCCTGGTTGATGGTGGATGAAGCAGTTCTGGATGTGTCCGACTTTGCGCAGAGGCATCCAGGCGGTCGACGCCTC
ATCCTCAACGCCTTGGGCACCGATGTTACTCAGGAGCTGATCGGACAAGAGAACTCTGTGGGGCATGCCATGTCCTTCCCGCCTCACGTGCACACCGGG
AGTGCATGGCGAATCATTCGT
TCGCTTGTAGTCGGCTACATCGAGGAGAAGGACGCTGCGGAACCTACGGCTGCCTTGGAAGATGACCAGGAGCAGGAGGGAGAAGAGAAGGTCGATACCACGACCGGTGATATTCCCGTT
CCTGACGCAAACAACCGAAGGTTTCGTGTCGCGGGCAAGGCGGTAATGCTGAACAACCGACTGGCCTTGGGCGACGATACTCTG
GCAACCAAGGCCATGCGGCTGAACGACTTGGCTGTC
ATTCCCGCTCCAACCAGGCGCCCCTCGAGAACGGCAGCGTCGAACAACCCAGCAAGCGTTGTGGGCAACCCGCCCATAGAAATGGACGTC
GCTCAGCGCGCCGATGAGAACGACGGCGGA
TGGGGAAGCAGCACGGACTTGCTCGAGCGATTTCAAGTATGCCCCCTGCTCTTCCGAGAGAGGATGGGAGCCGCCAGCGCCGTGGGCCGCGGTCACTTACCCAGCAAGCGCCCGGTCTAC
AGATACATTTTTTCTTGTCCCGCTAAGGCGCAAGCCCAG
GCGCAGGCGGTTTCAGGGGTCTGCTACTTCAACATGCGCGCCCAGGAAGAGGGAAAGGGTGTTGTACAGCGGCCGTACAAT
GCCTTCGCGGTGAGGTTGCTGGACGTCGAGCCACCCACACCGGGCGGAAGGGTGGCCTGGAGCAAGACTTCTGCGAAGTTGCCTAAG
GTTGTGCCGGCAGAGGAGACTACGGAGGGCGTA
CTGTGCATCGAGATGCGTATCAGGATGTACCACGACGGGGCCATGAGCAAGCTACTAGAAAAACTCTCCAAG
GATACGGACAACGTGGCAGTCCAGCTGCAAGGACCATTCCTCGTCAAC
AAGCTCGCTCCGCCACCGGCCCACCGCAACGTCATCATGATTGCGGCGGGCACAGGCGTCAACCCAA
GGACACCACGTTTTCCACGAGGTCGCGGCTGGTCCTCGTGTGGCAGAGCACGA
CGGAGGCTGACTTCTACGGTACCGACGAAATCACGGCCATGCAG
GTCCAACCTCGACAGCAGCAGCTCCGGGAAACCGAAACCTCCACCCTATTCCACGCTGGCCCGAGACGGTGGAAGG
GACGAGCAGCCACCCCCGTACCGCACCCCGGGTCGTGAAGAGAAGAAGCGCTCTGCTGCAGATGCAGGGGCCAAGAAAGAAAGGGCCCCGACTGGCGTTGGGAAGCTACTGACCACGAGG
ACGTGGCGCAACTCGAAGTGGAATGAGAGCCCAACGACAAGCGGTCGCCGTCGCGCACCTATGCAGGACACCAGCAACTATCAAGTGGGAGATGGGCTGGTTCGAGGCAGGGTGAACCGG
GAGATCCTCGAGACGGTCTTCGGCGAAGCCCTGATTTCATCAATCGCGGCGTACAACAGGCAGCGAGCGCTGAACAGCCTTGCTTGTAGCGACAGCGACGTCGGGGACAACAAAGAAGGG
GAAGGCGAGGATGATCGCGACCTCATAGGCACCGACCAAACTGCCGGGAAGCTTCAG
GTGGTCGTTTCTGGACCTACCGCCTTTGTGGCCAACGTGAAGCAGCTCCTTACCGAAATGGGG
GTCCCTGCCGGCTCCACAGTCTTGCTTGACTGA

Retrieve as FASTA