Entry information : EsilPxd02 (Esi_0083_0090)
Entry ID 16971
Creation 2021-02-04 (Christophe Dunand)
Last sequence changes 2021-02-04 (Christophe Dunand)
Sequence status complete
Reviewer Not yet reviewed
Peroxidase information: EsilPxd02 (Esi_0083_0090)
Name (synonym) EsilPxd02 (Esi_0083_0090)
Class Peroxidasin    [Orthogroup: Pxd003]
Taxonomy Eukaryota Phaeophyceae Ectocarpaceae Ectocarpus
Organism Ectocarpus siliculosus    [TaxId: 2880 ]
Cellular localisation N/D
Tissue type
Inducer
Repressor
Best BLASTp hits
Perox score E-value EsilPxd02
start..stop
S start..stop
EsilPxd03 2540 0 1..1489 1..1490
EsilPxd01 1052 0 282..1268 356..1344
EsilPxd01 296 8.17e-82 14..220 86..302
EsilPxd01 62 0.00000000213 1332..1489 1445..1627
EsilPxd04 823 0 17..885 1..805
EsilPxd04 256 1.29e-69 1007..1268 849..1095
EsilPxd05 513 2.2e-164 14..678 91..763
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '16971' 'join(501712..501752,502283..502679,502949..503068,503624..504154,504484..504900,505194..505295,506981..507184,507452..507672,508088..508343,508684..508818,509213..509461,509966..510154,510529..510756,511210..511347,511662..511850,512422..512478,513295..513405,513652..513756,514191..514305,515025..515121,515951..516125,516975..517049,517241..517462,517779..517874)' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 501712..501752 39 N° 2 502283..502679 395 N° 3 502949..503068 118 N° 4 503624..504154 529
N° 5 504484..504900 415 N° 6 505194..505295 100 N° 7 506981..507184 202 N° 8 507452..507672 219
N° 9 508088..508343 254 N° 10 508684..508818 133 N° 11 509213..509461 247 N° 12 509966..510154 187
N° 13 510529..510756 226 N° 14 511210..511347 136 N° 15 511662..511850 187 N° 16 512422..512478 55
N° 17 513295..513405 109 N° 18 513652..513756 103 N° 19 514191..514305 113 N° 20 515025..515121 95
N° 21 515951..516125 173 N° 22 516975..517049 73 N° 23 517241..517462 220 N° 24 517779..517874 94
join(501712..501752,502283..502679,502949..503068,503624..504154,504484..504900, 505194..505295,506981..507184,507452..507672,508088..508343,508684..508818,50921 3..509461,509966..510154,510529..510756,511210..511347,511662..511850,512422..51 2478,513295..513405,513652..513756,514191..514305,515025..515121,515951..516125, 516975..517049,517241..517462,517779..517874)


exon

Literature and cross-references EsilPxd02 (Esi_0083_0090)
Protein ref. GenBank:   CBJ27733.1
DNA ref. GenBank:   FN649064 .1 (501712..517874)
Protein sequence: EsilPxd02 (Esi_0083_0090)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   1489 (312)
PWM (Da):   %s   162486.93 (33571.4) Transmb domain:   %s   o654-676i729-746o
PI (pH):   %s   4.78 (5.12) Peptide Signal:   %s   cut: 26 range:26-337
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MKTILLGALQGQPRELMTDVFLSSPPSVSDMNAVAIAWGQLLLLD
LSYTVDNSSEPFDIACDDGGGSVDVWCPLGEASDPIPFFRSEATVTDSVRNPINYASSFIDLDFVYGRSEDAADALRTFEDGMLSMAD
DNMPIKNDDGTW
IADQRTARFPLTFALHVVLLLEHNRCCVDVAPGENYTGDEDIYQACRGWTIATFQHITEDELMILLMGRSIGDWTVYQDDDDGGRRRLSPEQRRE
LLFTFDYYNDTVDPSADVFVTVAMTAAFESALPSTLRIVSEGYVATDYDHLELTVAAEDITGLFEHSAIGDILRGAVLSPAMAVGAHYASAVSNASPLFKLPVDMVQRGRDHGVPSYNDV
RG
AYGLPEATDFSDVSSDGDVVQLLDAAYGGEIDNLDACTGALAEDKEASLGGIFGYLLHTAWVDQLYRSLFGDRYHHLHSRPIENVSLVSISQLLNRTLGLTALPESGFTVPEVTVCTG
QCEATGTSGVSLAERYGISWE
VEDETLLISLSVLGIGDSGMIGIGFGGLSMTDAQDFIICEVFSTGGAECTDRSPTGGRSEPQPDTFQLGLEVTNVTTGGGWTTVKFSRERATLDAEDYD
LFE
DIENEADTLVIYSFKKGEGVGQHPNTNRGAATINFVTGDVDTQCDGETSFVSLHGALMLIAWMIIAPWGIYYARYRKGDAIKWAGREWYEMHEEIMIVASEAVLPLGITAVFASRGR
TSEAHARWGYYMIAAVAMQIFTGWMRTKGLEAKHSNFSLFH
FNKFFHIWAGRFAYAAGVVQCYRGLELVSSDDELIFSAGDGLDLQLGSFGWVKDILFPAWFALIAGSFLILETQKQYHR
FFKKGAANVCGVVSIVNELHDTSIRNNGGRLIPRTLDLPIYSISAFNDK
VLSGQTWLMVDEAVLDVSDFAQRHPGGRRLILNALGTDVTQELLGQENSVGHAMSFPPHVHTGSAWRIIRS
LVVGYIEEKDVGEPAAALEDQQEQEEAEEKVEPSTGDPACSGTNNRKIRVAGRAVMLTNRLALGDDNL
ATKAMRLNDLAAIPACIPAPTRRPTNLVATSSSASRFGNQPKEIDAPQRVDD
KEGVLGSNTDLFERFQVCPLLFRERMGAASAVGRGHLPSKRPVYRYIFSCPANGQAQ
AQAVSGVCYFNMRAQEEGKGVVQRAYNAFAVRLLDVEPPTPGGRVAWTKGFAKLPKIVPAGET
TEGILCIEMRIRMYHDGAMSKLLEKLSQ
DTDNAAVQLQGPFLINKLAPPPVYRNVIMIAAGTGVNPRTPRSPRGRGWSSCGRARPRPTSTVPMKSRPCKFNLDSSSGKPPPYRTPSRTGH
GDEQPPPYRTPGRDGSRPADGKEKRTTTGVGKLLPK
KTWRNLRRSDSSTPGAHRRPPIQDSSTYQVGDGLVRGKVNREILETVFGEALTTPIAADNRQREVKSLLRSDSDISDEKEGQDE
DDGDLTSTDQTSRKL
QVVVSGPSEFVANVWQLLDQMGVPSGCMVLLD

Retrieve as FASTA  
Remarks
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAAGACAATCCTACTTGGGGCGCTGCAGGGACAACCCAGAGGTGCGGCGACAATCAACAACAATGCGTCCATAACCTACCACATTTGCCCCTTGTTATCGCATGTCTATCCTAGAGTT
GCACATTGCAGGGATCGATGATCATCGTGCTCCTCCTCGAGAGGAACATTACCAGAGGTTGGGGTTTGTCAACCCTGTTTCCGAATCTAGCCAAGTCATGGTCCTTCTACCTACCCACCT
CGCGTTCGGCTGCACCTGCCTTCTCTTGAAATTTCCCCGCGGCCTATTTTGCAGATCCGTGCGGAAGCTACTGATTTATATGCGGACGCTGCATTCACCCCGCTGGAGGAATTTAGGCCG
ACTGCGAGGCAAGTACGGCCGTAAGAGTATTTTGTTTCATGTGAGGTGTCATGCCAAGAACCAGTTGTTTCTAACTTTGGTTTGTTGGCCCTACCCTAACTCGGCGTTCGGTTTGGAAGG
CGTCGTGGCATTTCTCCGCTATTTGCACCTCACCCAACCATTACCACCCCTTCTCTGCTCCACCCACCACTTTTGTCCCACAACCGCTGACAGGGAGCTGATGACGGATGTATTTCTGTC
GTCTCCACCATCCGTTTCTGACATGAACGCGGTTGCCATCGCTTGGGGACAGCTGCTGCTTCTCGATTTGTCCTACACCGTCGACAACAGCTCGGAACCCTTCGATATCGCTTGCGATGA
CGGAGGGGGCTCCGTCGATGTTTGGTGCCCTCTGGGGGAGGCATCTGATCCAATCCCTTTTTTCAGATCCGAAGCTACGGTGACTGATTCTGTGCGAAACCCGATCAATTATGCGTCCTC
GTTTATCGACCTGGACTTCGTCTATGGAAGGAGCGAGGACGCTGCCGATGCACTTCGTACCTTTGAAGACGGAATGCTGAGTATGGCAGATGACAACATGCCCATCAAGAACGACGACGG
GACTTGGTTG
TGGTAGGTTGACCAGGCCCGTGTACTAGTAGAAGTCTCGAACAGCAGTGTCGAGCGCGCTGCAGAAGATTATCCAAGTCGCCAGAACCAACATGGCCTATGCGTCGTGTC
TGCGACGAGGAACGAAATACATCGGCGCTGCGGTGAATAAGACGTACCGACGTTATTCTATTGCCGGGGGCCATCCACCGATAATTTGCTAGCCAGCTACCCCCTCTATTGACCGCCAAG
TGTTCTTCATCTTGCCGCTACCACCTTGTCACCGACGGTAGATCGCGGATCAACGTACGGCGAGATTTCCACTGACGTTCGCTCTTCACGTCGTTCTCCTTCTCGAGCACAACCGCTGCT
GCGTTGACGTCGCGCCCGGCGAGAACTACACGGGCGATGAG
AGGTTAGTCCTGCTAGAGACAGCCTCTTCTGAGGAACCATGCGGTCGTCCGAGTGACATGTGATTGCTTGTTTGCCGCG
GTGTGATAGAGCGCCGTCATGTTGGCTTTGAGCCTGGATAGGGCGGTTGTCTGTGAAGCAATTGGTGCCACGGCGATTTTTTCCGAGAAAGTGGTGTGATGTGAGTTGCCTTACCCACCC
CTCCCTCCATTCCTAACAGCAACTATTGTCAACAGTGGCGGTGGTCAGTTGTTCCTTGTGGTCGTATCAGATGATGTGGCCATGCCCCACTGACGAATCGTGTCGATGACAACATTGGAA
AGCCCATAGTACGTTTTCGACGAAGACCGCCAGATAGAGTAGCAACTGTACGACCCAGCAAGCCCCCGTCTGAAATCGCCAGTATTGCGGTTGAATGCTGAGTATTGCCCCTTAAACAAA
GCCCACGAACACTTTGCAAAAAGAACCGCTGATTCCTCTCTTTCAGAGTTTCGTTCGCAGTACCAGGTAGGACGAACCCTCTTGTACGCACACAACGATCCCCTTTTGGGCCATGCAGGA
TATTTACCAGGCTTGTCGAGGCTGGACCATTGCAACATTCCAGCACATAACTGAAGACGAGCTTATGATTCTACTCATGGGAAGAAGCATCGGCGATTGGACCGTGTACCAAGATGACGA
TGACGGCGGCAGACGAAGACTATCCCCAGAACAGCGGAGGGAGCTGTTGTTCACTTTCGACTACTACAACGACACCGTGGACCCGTCCGCCGACGTCTTCGTGACGGTGGCCATGACCGC
CGCGTTCGAGTCGGCGCTGCCTTCTACCCTCCGCATTGTGAGCGAGGGGTATGTTGCCACTGACTACGACCACCTTGAGCTCACCGTTGCGGCGGAAGATATCACGGGCCTGTTCGAGCA
CAGCGCGATCGGGGACATTCTGCGTGGGGCGGTTTTGTCGCCTGCGATGGCGGTGGGGGCGCACTACGCTTCGGCGGTGTCGAACGCCTCGCCGCTGTTCAAACTCCCTGTGGACATGGT
GCAGCGGGGCCGTGATCATGGGGTGCCATCGTACAACGACGTCCGGGGG
GGGTGAGTCCTAATTTTGCTCAAAACGTTTGTGAGGCACCGGTCCCACCAACCTCGCCGTTTGCAACAGCC
AAGGTGAGGGTGAAGAGTGAGCGCTCGGTCGTACCCGTCCGTCCGTGGGATCTTGACTATCCCCTGTGTCGAGGACCTGACTACCACACAAAGGGCAGAATGGCTCGCGCATCGACCGGG
GGGAAACGTTGTAGTGCAAACCCAAGTCGTCAGTGCCGCAAAGAAGCCCTCCCACCCAAAACACGTGTTCCCTGTAGTACTGCGGGGGGAAATGACTTTTTCGGAACACTTGCTGTCGGC
GTTCGCGCGCGCCACAACAGGCGTATGGTCTTCCGGAAGCCACGGACTTTTCAGATGTATCGTCGGATGGAGACGTGGTGCAACTCCTAGATGCTGCGTACGGTGGAGAAATCGACAACC
TCGACGCGTGCACAGGAGCCTTGGCGGAGGATAAGGAGGCGAGCCTAGGTGGTATATTCGGCTACCTGCTGCATACCGCATGGGTGGACCAGTTATACAGATCTCTCTTCGGGGACCGGT
ACCACCACCTTCACTCGAGGCCGATCGAGAACGTGTCACTCGTGTCTATCTCGCAGCTACTCAACCGCACGCTCGGCTTGACCGCCTTGCCGGAGTCGGGGTTCACGGTGCCGGAAGTCA
CTGTTTGCACCGGGCAATGCGAAGCCACGGGTACATCGGGGGTCTCGTTGGCTGAGCGCTACGGCATATCGTGGGAG
AGGTACGCGACGTCGTGGCCGGGGCTTCACCTTTATGTGTCGA
CCACCCATACTTGACCAGAGTACCTTATGATGATGGGGATGGTTACGCAGAGGAAATTCTCCGCCATTTTTTGGCAAGCTGTCGTCATCGTCGCGAAAGGGGTTCACCTGTTTCCTGTTC
GTCGGTTGCTTTACAGTGTACTTTCCAGCGGATTTTCTCCAGGTTGCTAAGAAGCGTCAGTCATAGCGAATGAGGAGCTCTGGGCGCGACTCCCTCGGCCATGTACCGCTTGAAACCCTC
ATCCTTGTACAGGTGGAAGATGAAACACTACTGATTTCTTTGAGTGTCCTGGGCATCGGCGACAGCGGGATGATAGGAATCGGCTTTGGAGGCTTGTCCATGACGGACGCCCAGAGGTGG
GAGAAGCAGGCTGCAACATACATGTCGATAATGTCTGAACATAAGTCGCTGCCATCATCCCCCCCCTCACCCCGTTGAGCTGCTCGCCCCGTCCTCCCCGTAATCGTGCCGGGATATCGT
CAAAGCTTTGAAAGGGTACCTCGGCCAATGTGTCTGTTCTTCAGAGGGAACCTATTTTCATGTCACAACCGAACGAGGCGTTTGTCAGTGTTGAGATCAGCACATGTCGTTGCCGTTGTG
TTTCATGTCGTGGCTGAAAATCAGTGCTGTGCAATGAAGTCGACTGCAGGGATTGACAGAAATTCTGCGCAAATAGGTTTGTTTTCCAAGCGGGAGGGCATTTTTTCCCGTCCGGGATGT
ACACGGATGTTTGATGCTCCCCGTTTTGTCGGTGTGTCGTGGAGGCAAGCCAAACCTACCCACATGTCGTGCGACGTGCCTCCCCAAAACGGCCAGGTCAACCCTGGTTTCGACATGAAT
ACCGTAATTGTCCGTGTTATGGCACCGAAATGCGATAGTTTTTTAAATTATTCCCATTTTGGTTTTAATAGCACCCCGGTGCCATAATGCCGATGGGTGCCATACCAGCGATACCTGTGC
GGAAATGGAATTATTTAGTTCTAGATTTCCAACGCACCCCCGGTGTGATGACACCCATGGGTGCGATACGGAAAAAATCACGTGAGTGGCCATTTCACGGTGCTGAGTGAACGCACCCCG
GTGCGATGACATCGATAGGTGCGATACTCAAGAAACGAGTACAGCCGTTTCCTGGTACGAGTACCAAAATTTTGCGTGTTACGGTACTTAACAAATATCAAGCGGCTGATGCGAGTTTGG
GGTAGTAGCATGTGCCAAATATAACAGACGCTGTGGGGAAATTATTGCGGTACGGTATCGTTTTTCTTTCAACCTCTTTGTTGGTCGCAAGCCCCTCATTTTTTGGCTGATGCTTCCATC
TGTCTTTGTCTGCCTCAAGGAGCACGGTACCGGTACAATTGTAGTGCCGTTTACAAACTCCTTCTTGTTTGGTTGGTTGGAGGTGCGGGATGTGTTCTGTGTGCTTGGCTGAGTGAGTGA
GTGCATTCGATGTACGGATAACCGTAGAAAAAAACATGAATCATTTCTACCGCACACCTCGACGGATACATACATACATGCATGGCGCCCCAATTTTTACCTATCGGACCCTCTCAGCCG
TTCGCTACATGCAGTTCGACACGTGGCCTGGTACCTTTCGGAAGCACACAAATATACGTCCTAAGCAGATTTTATTTCGTTGATATCGCACCGGGTGCGTTACACAGACACCGGTGCGGA
AATGCAATCATATCACGCAGTTCTCGATATTTTTGCGCCCCCCGGTGCCATAACGCCGATGGGTGGAAAATCAACGATGGGTCCCATAACACGCACACGGCATTGTCTGGTCATTACGTC
GGTGTTTGCGATTTTTCCGACAACGGACTTTCGTCTCTTTGACGACCGTACACTCAATCGTACTCCGTGCACGATGGGAAATAGCCTGAAGTACGAAGTCATCAATCAGGTTGTCTACCA
GCGCGCGGCAGCTCTCTAATCACGTCTTGCAGCATCCTGTTCAACACAATAGTCTACTTGCCGCCGCCTGCTCCGCTAACGCTCACCACTGGTGTGTTGATGCATATGGTGCCCTCCAAA
GGATTTCATTATCTGTGAGGTCTTCTCAACGGGCGGTGCCGAATGCACTGACCGGTCTCCCACTGGAGGGCGATCGGAGCCGCAGCCGGACACTTTTCAATTGGGCCTCGAAGTTACCAA
TGTAACGACGGGGGGAGGCTGGACTACAGTGAAGTTCTCGAGGGAACGGGCGACGCTGGATGCAGAAGACTACGACCTCTTTGAG
AGGTGGGTACTACACCTGCTGTGCGTTTTATCCTT
TGTGTTGACGCGCCCGTTGATGACCGTCTTGTCTGTTGGGGAGTTCGTAACGCGGGTTCCGGTATTCCCTACCTACCGCCAGTGATGCGGCCGCTGTGAATACATCGCCGATGTAAGGAA
GAGCGTGCCCGGCCTATGTCAACCTGCATGCATGAACGTTTTGATGTCAGTACAAACGAGATCCGTTTTGCGGCATGCATGTCGCCCCGCATGCTGTATGTTCGTCGCTTATAGGATATC
GAAAACGAAGCAGACACCCTCGTCATCTACTCGTTCAAGAAGGGAGAGGGCGTGGGCCAACATCCCAACACCAATCGTGGAGCGGCCACGATAAACTTCGTCACGGGAGACGTAGACACT
CAGTGCGACGGCGAGACCAGCTTCGTTTCGTTGCACGGCGCGCTGATGCTTATCGCGTGGATGATCATCGCACCGTGGGGTATCTACTACGCGAG
AGGTGAGCTGTATCGATACGATTTA
CCTGCCTTTGATTTTGTGGTAGTCACTGTCTGAGGCCAGATCGCGACCTGAAAGCATGCCATAGCATCCAAATCTTCCCGTTCTCGTAGCTGGCTAGGTGAACTGCAACATCACCTTTCC
AGCCGGGAGTGGGCAGTATTGGGTGACTGGGCAAGACAACTCAGGTCGAACACATTTATAGCTTGCATATTAGGGACTGACCCGCCATACAACATATGCACCACTTACCCAATCATGGCG
TGTTTCCCCGGCGTACCGATCGCTCAGCTGTAGTTGTATCGCGCGTGCTCTCGAACTGAAAACCACGGGGGGCGGTTGCCTCACGCACAACTATTTTCACTGAGACCACTCTTGTCCGAA
TGTTATTCATCATGCCGTACACATCATCCCAGGTACCGCAAGGGCGACGCGATTAAGTGGGCTGGACGTGAGTGGTACGAGATGCACGAGGAAATCATGATCGTTGCCTCCGAAGCCGTG
CTCCCCCTCGGGATCACCGCAGTGTTTGCTTCAAGGGGCAGGACCTCGGAAGCACACGCACGCTGGGGGTACTACATGATCGCCGCGGTAGCAATGCAGATCTTTACGGGCTGGATGAGG
ACCAAGGGGTTAGAGGCCAAACACTCGAACTTTTCTCTCTTCCACAGG
GGGTGAGATGAACACGACATGGCGGTTGTACGCAAGGCGTGTGCACAATTTTGGTTCAACGCGTCGAGCCTG
ATGCGATTGCGCACACCAACAACGTCGAGGCTATTCTGCTGCGTAGTTAGATCGACTCGGACTAGTCGGGTACCTTGTTGGAGGAGAATCTGTCAAGTTTCTGGCAGCCGCCGTACGGTT
CGACTGATGCTTTTGCTAGTCAGATCGGCCTTGCTGCAGTGCTTGAAGATACGCCCTTTTCTTCTGAATACCACACCAAGGCTAGGCAACGTTAGACACGTTGGAGGGCATTGGCAACAA
GTCGAAACTTCTCCTTACTAACTTGCACAGTTCAACAAGTTCTTCCACATCTGGGCTGGACGGTTCGCATACGCAGCCGGCGTGGTGCAGTGCTACCGAGGGCTGGAGCTCGTGTCCTCG
GACGATGAGCTCATTTTCTCAGCAGGCGATGGCCTTGACTTGCAG
AGGTGAAGCCATGGGAGCGACTCGTCAACCGCCGTCACCAACAGTGCAAGACGCCCTCCTTGTACCTGCAGTGAA
ATACCTGCGGCGACTTTGGGTGTTCCCAGAATGCCGGGGTATGCTCCAATGGTATTCTAGGGTGTGTGGGACAACTTGTGTCGTTGAGAGGTACTGCGATTACGTACAAGAGCAAAGGAC
ATGGCAGCGACCTCGTTCGCTGGAAATATTCACCTGCTTCAGTATCGTAGAAGCGTGTCAAGCACATTTTTGTTTGCCTTTTTGGCATTCATGGATATTGTAACAATCTCCTATCGGATC
ATGCTGGTGGGGTCACAAGCACTTTCAACTTCACTCAACTCCTGCTGAAGTTCTGTTGCCCCTGATTCGACCGTCTAACAGCTCGGCAGCTTCGGTTGGGTCAAGGACATCTTGTTCCCA
GCCTGGTTTGCGCTCATCGCGGGCAGCTTTCTTATCCTAGAGACACAAAAGCAGTACCACCGGTTCTTCAAGAAGGGAGCTGCTAACGTGTGTGGCGTTGTGTCCATTGTCAACGAGCTG
CACGACACATCTATTCGCAACAATGGTGGCCGGCTAATTCCGAGGACTTTAGACCTGCCAATATACAGCATATCGGCGTTCAATGACAAG
AGGTGAGCGATGACGATGGAAGTGTGACGT
GCGGTTGTATTGGGAGGCGTGCTTCGAATCGCTGGGTGTATGATTCGAAGAGTTGGTTTGATTCCGATGAACATGTTGAGACATCGACAATGGTAACAAGTTTGCGAAGAAGGGTTTCAC
AGTGCGCCCGGCTATAACAAGGGCCGTACGTGGGGATATTTATCGCGGTGTCGCGTGTCGTGCCCTACCAAAATTGTTTCGGTGTATGTCGCGACGCAATGAACCTCAAGAGCGAAACCC
ATCCCATCGTTCCAAACAAGATGGTAGAAAGCCCCTTGACCAATGCTACCCACAGCATGCTTGTCTTTGAGCTTTCGAGACGCGCAATGACTTGAAACGATCCCATCGTCCAGTATCGAA
ACTTTCCTTAGGGATTTCATACGGACCCCAGCTCTACCACTACATACCCTGTGTGGTGCACATATCTCTTGGAGCCCGAAGCCCACCTGCCTTCGTGACCACCTCTTTCCGGCCAGGTGC
TGAGCGGCCAGACCTGGTTGATGGTGGATGAGGCAGTTCTGGATGTGTCCGACTTCGCACAGAGGCACCCAGGGGGCAGGCGACTCATCCTGAACGCCCTGGGAACGGACGTTACACAGG
AGCTGTTAGGACAAGAGAACTCCGTGGGGCATGCCATGTCCTTCCCGCCCCACGTGCACACCGGG
GGGTAAGTGAGCGCTTGACAAAGCCAGGGTACCCTGGTCTTGAATGTGCGTACAA
GCTTAGCGTCAGGTGCGTCGTTGTGTTACCCCCCACCCAACAGCACCAGAAGATGCTGTACGTGGCGAAGCACTACGAACCGACTCGATCATTGCACCAAATGTCCACCAGAGCGTGAGA
CCAAGCCTATAGCAAAACATCCACCAAGCGACCACCTTCGCCTCCCTCTATCCCGGCAGTGATGTCTTGTTCATGGCGCAAGGGGGTCGGATGCTGCGATATCTCTTGGACGAAGTAGAA
GGTATTTTTGTCGCCAACCAAGAAGTGGGCCCTCTCGCTCACCTGTTTCGCCCACACCTTGCCGTGCCTTCTGACCGGCAGAGTGCATGGCGAATCATTCGGTCGCTAGTAGTCGGCTAC
ATCGAGGAGAAGGACGTCGGGGAACCTGCGGCAGCCCTGGAAGATCAACAGGAGCAGGAGGAGGCAGAAGAGAAAGTCGAACCCTCGACGGGAGATCCTGCCTGTTCTGGCACCAACAAC
CGCAAGATCCGTGTCGCGGGCAGGGCGGTAATGCTGACCAACCGGTTGGCCCTGGGCGACGACAATCTG
TGGTAAGTCACGAATCTACGTTATGGATCGTTGATCATTGGACAGCAGAGA
CTGAGAAGTGCATATGGCCGACTCATGTCTCGGAAGATCTGCGTGTTTATCGTTTTCTATTATTGTGACTGGGATGTTGTGAGCCGACGGGAAGCTGAATACGCCCAGCCCATTGTTCAC
ATTCCGTAGGTGCTTACGCGACAATTAAGTCTTATGGGTGAAAATCTAGGAAACGCGTATCGAAAACAGTTGTTGACGTGTTTCTATTTCGACGACCAGTGCGTGAGGGGGGGGGCAGCA
ACTAGGTCCCTGTGACATATACTGGCAGACGCAGCTATCGCCCACGAATTGATGTTGATTTTTTCCAATAACACATGGAACGAACTAGTTATTTACCCGTTTGCACGCGGCCGACCCGGC
CGTGGGGGTTGGCACGGAAACATGCTCTGCAACGTCTTTTCCAGGCAACCAAGGCGATGCGGCTGAACGACTTGGCGGCTATTCCGGCATGCATTCCCGCTCCAACCAGGCGCCCCACGA
ATTTGGTGGCGACGAGTAGCTCGGCAAGCCGCTTCGGCAACCAGCCCAAAGAGATCGACGCT
CTGTAAGTCTCCCAGTCGCTCTTGTTTTCGGTTTCATTGACTGTAGTCGCACTCTGAT
CCCTATCAAATGTAGGGCAGTCAAGGATGATGAATGGTGATGTAGTTGTACATTCAGATTGCCCCGCTGGTTCTGCGGCGGCTTCCCCCCCGACGTTTAGCCAACACATCGTGTGGAAAA
ATCCGCGTGGGAACACGCGGCGTTGCTCGAGAGAAACGCCCGCAGGTTATTTGCACCTGCCATTCAAGTTGAAGGCTGACGTTACCCATGTGTTTTGTCTTTCATAACTGAAACGATTGT
ATTTGTAATGTTTCAAAGCCTCAGCGCGTCGATGACAAGGAAGGCGTTCTGGGAAGCAACACGGACTTGTTCGAGCGGTTTCAAGTATGCCCCCTGCTCTTCCGAGAGAGGATGGGAGCC
GCCAGTGCCGTGGGCCGTGGTCACTTGCCCAGCAAGCGCCCGGTCTACCGATACATTTTCTCGTGTCCCGCCAACGGACAAGCCCAG
AGGTATGCACACCTTCGTTTCCCTTCAAGCGGA
CGTGCAGTCTTTTATGTTTAGCGTCCAAGCTCCCCTCCGGCATGCTTACTATACGAGGTCACGCCCGCCAAGGCAGAATTTGTTTCAGCCGCTCCCTGTGCCCTCTGAAGAGCCGACCCG
CTAGCGCAAAGACGCTCGAAGGTACGGTCGTCGAGCCTGTCTATGGACAGATAATCGACAACAGTATGGAGAAATCGACTATAATATTAGGGTCTGTCGAACGCATGCCTTTGAGTACGA
CACGCATGACATGTGTGGGGTCGGACTATCAAGTCCTGCCGTTTGCGAACCAAACGGCGTCGAATCCCATCAACATGTGACAAAAAAAATGCGCTTTTCTCCATGGCCTTTTGCGCTGGT
GTCGTTTTTTCTTGCAAGACATACGTATACGGTCTCTGATGCTCTCTTGCGCGGTATTCATGTTGAGCATGCCCCTGGCCAAGAGAAAGTTCATGCACGGCTCAGGTGGTTTGGAAGCTT
GCCTTCGGCGAAGTACATGTCGAATCAACACCCTTTGCCCCATTGTTTCCGGTGCAACAGGCACAGGCGGTTTCAGGAGTCTGCTATTTCAACATGCGCGCCCAGGAAGAGGGGAAGAGG
TGAGAAATGGATTATCTCAGGATGAAGCTCTGCAGCAGCACTCCCGTAGACCTGTACTGCTTTCGGATGAATTGTGGCAATATTGCTACTTGCACCGGCTAGCTGACCACACGATTGGAG
ACGATGCCTTCAGTGCTGGCCTTATTTTTATATTTTCTTTGAGCTGCGTTCTGCGTCGCAAGGCAGCGATGCTAAACTTTGAAAGGTCAACAGTGTTCGGGTCCGTTCAACCGCAAGGCT
GTCTGCCAAGAATGTTTCCCTGTTTGCCGTATGAATGCTCAAGACTCCGCTACGGATACTTGCCTCCACAAAGTTCTTCGAGCAACACATGAGTGCCACCCGTGTTACAGCGGGCAGTTG
TGTACCACCATCCAGGAAACGATGAGGGTTGCCTGGCGTTTCCTGTGCGTGAAATATCAATGGAAGTTCCTCAATAACCAAGCATATTGTGGGTGGCCGGAGTTACGGACCAAGATAATC
GATTTTGGTCGTTGGGGGCAGCCTCGTGTGATATGGAGAAAATCACAGAGAGCTCCCGAAGTTGAAAGTAGACAAGCGGATTGACGACTGCATTTGCTGCATCAGTGAGTGAGGCATCCA
GTTATTTTTTGCCCAAAAACAGCGAGTGGCGGAGCCCGAAGCTTCATGGAAGAATCTTCTTTGAACCGTGTTGTTCGAGCACTACAGTTACTTTCCTTCTTGGGTCCAAGCCCAACTGCC
CGATCGAGTATGTGTGATTCGCTTTCTATTCAGCGGCCTTGTCTTTGATCGAGTACAAGCAACTAACCAAGGTGTTGTTTACATCTCCGCAACAGGGTGTAGTACAGCGGGCGTACAATG
CCTTTGCTGTCAGATTGCTGGACGTCGAGCCTCCCACACCAGGCGGAAGGGTGGCCTGGACCAAGGGCTTTGCCAAGCTGCCGAAG
AGGTATGCACAAGCGAGACGAAATTACCATCATA
TTCGGGGTCGATGAAGCATCTCGCTCTTTGAGTGGTTCGGTAGAAGACCTGGACCTGAAGTTTGTTGCACAATCGATGGGCACGCCATGAACATCTGCACCGCAGGACAAGCGACGAAAT
GTGTGTCAATAAGCTCTCGACTTGTTTGAGCCCAGCATCGATACCGAGGTTTAACGCGTGTTGCACTCGTGACGATTTCTCCTTTATTTGGAAGATTGTACCGGCGGGAGAGACTACGGA
GGGCATACTGTGCATCGAGATGCGCATCAGGATGTACCACGACGGGGCCATGAGCAAACTGCTAGAAAAACTCTCACAG
AGGTGAATGGGGTTCGCGCGAAGTTGTCTTTTTTTGCTGAT
AGTTTTTGCCGCCCAAATTCATATGGTTGGGACTTTTTCGCGTTGCATAGGCTATTTTTATTGGAGGCGCGTGTTTTTCCTGATACATGTTCAAGAAATTTAGGAGACCCCAAAGATGTT
CGGTACTGTTGTTGCATCGTCTTCTATGTTAACGAAACGGCATCTTTACTACGGGTGTAACGGGTACCAACAAATCTCCGTCATGGTCGTAGATGAAGATGCGCCACCAGATTAACTCCC
GTAAACTTCACCCTATTCTGGATAGAACGGGTTGTAATCCGAACATGTGTCAAGAAGCTACTCCAGCACATTGTTTTTTCCCGTTAGATGAAGAGGGTAGCCGACCCATGTCATGCTTCT
TTTTCGTCCTTTCGTCGGCTTCAACTCTTGTCCAGGATACGGACAACGCCGCCGTCCAGCTGCAAGGACCATTCCTCATCAACAAGCTTGCCCCACCACCCGTCTATCGCAACGTTATCA
TGATTGCGGCAGGCACGGGCGTCAACCCGA
GAGTGAGTACGATAGTTTCGGACTCATGGATTGTCTCGTGCATACTTTTTCCCAACTTTCGTGTGCAAGTAATGCCGCTCCATTGGAGCT
TACAAACCATCATGGTTGTTTTTTGGCAATCAGAACGTATGTCTTTCGCTTGCATCGTGTTTCAGAGCTTGGTGTGCGATACGCGAAACTTGCCGAGGAGTTAGTGCACGCCCATGCGTT
GCAAAGGGTGCCGGACATCGCCTCCTTCCTCGACATGTTCTCGTGGAACGTGTTCGTCAGTGCAAAATTAACAACTTGCGCCTTCGGTACACATCGATTTGCCCCCCACGACATCAACCG
CGCTACCACCGCCCGACACGTGAAACGACCTGGGTGCCAGAGGTGCTCTCACGATGCTGGCGACGACCCTCCGCAGCCGATTGCTGCCCTAACCCGACGCCCCATTGCAACCTTCGCCTT
GTCTCTCGTTTCTCCTACGTGGACTCACTCTCGATGGAACTTCCCTTTACTGGTGGTCTTCGACGGTCGAACAAACGGCCCACGGCATCGACAGTGGTCCAGCAGATTCGGAATTATCTA
AGTATTCCCAGGTGGGCGGCAAATCGTAAACAAAGGCCAGGGCTAGCACTATTCCGGCGTTTCCGAAAGCCTCTGGGTTGCGGATGTGTCGCATGAAGCAACTCGCACGTTAACGCGGGA
TCCCCTCCCTTCGTATTTTCAATTACCCCAGGGACACCACGTTCTCCACGAGGTCGCGGCTGGTCCTCGTGTGGCAGAGCACGACCGAGGCCGACTTCTACGGTACCGATGAAATCACGG
CCATGCAA
AAGTGAGTTGGTTGCCCTTGCAATCATCAGTCTGGATATGTAGGGGACATCTCGAATCGTCAAACTTGGTGATAAGGACATGGCCGATCCATCGGCATTACTTCACAACAAG
CAAAACGTCTCAATACCTTCAAACCGTTGCTGTATCCGCTCGACCCTTGCTTCTTGACACTGCGACACCAGGAGAAGAGCGATGGACTCCTCGAAGTGACTGCTCTCGTTAGCGAAGGAG
GTCGTGCGCGGAATGCTCCCGGTGCAGCCTTCCGTAAAGCGCGTGACATGGCCCGGAACGTCTTCGCTTCACCAGCGCATTCTAACAGCAGCACGCTCAATCCCTTCACTCCTATGGGGT
ATGACACCACGATGAGTTTGGAGTCATCCGATTCTCTTCCCCAGACCCAGCGCTCACGTTTGTTGCGAATTTCGACCTAAACCTCTCTTTCGCCATTTGCAAGGTTAGGCGTATCAAAGA
CAAAGAAACCGATCTTGGGAGCTTGAAATCCGAGATAATGTCTTGGCCCTCCGGTTTTTGCACGTGTCCAGCCTTCCCTTGCGTGCTCACTCTTGTACTCCGAAGTTCCTGTGAGCAGAT
GCCAGAACGCCTTGTGGCCGCGGTGTGATTCTGTGCCGCCGTGTTCGATGCTTGATTGCCATTTTATGCGCTTGTTTCGTTGGTGTATTTCATACTTACAAGGATTTGCCATGGAATGAT
TCTAGCAGATTCCTCCCTCAAAGCCAGGTTACGGCAGCAATCCCAGTCGCAAACGGCAGATACACAACTACGCAGCAGTCCCACGAATGCTGCCGCTACCTTTTCACAACCGTCGCCAGG
TTCAACTTGGACAGCAGCTCCGGGAAACCTCCGCCTTACAGGACGCCGTCCAGAACTGGTCACGGGGATGAGCAGCCCCCCCCGTACCGCACCCCGGGTCGTGACGGGAGCAGGCCTGCA
GATGGGAAGGAAAAAAGGACCACGACTGGCGTTGGCAAGCTATTGCCCAAGAAG
AGGTGAGTCGTGAAAAGCACGGCCGGAAGCAGGCACATGGGTGCATTTGTGGAATTGCCCCAAATT
GATTAACGTGAGGGCAGGATGGTCATGTCGACCTTGACTGACAACCACTCGGCGTTGCTCGCACCCTCAAAAACGCAGCTACACCAGATGGAAACCACCAGCCGCATCTCGTTGGGCTGA
TTGGTTGCCTCGAATACCTTTTGGCACACGGCCGTTGTTCGTCGTAGCCCTGCTGAAAGTGCAGTGAGGACCAATTCTGAAGAACGGCGACAGCGGGGGGTCGTTTGGTGTGGTTTATGG
CAGGTGTGATGATCATGCCAGTTCGCAACAAAACGTCTGGCTGCGTCGAGGGCATGAGTATGTTTGCTGGATTGAATGAGGTATTCGCACGAGCGCGGCGGGATGGGTTGCCGGACGAAC
GTGAAGGAGACGCCTACCGGCAGCTGTAACGCATGGGCGCAGCCCGAGTATTCTCCGCGGGGCATTGCTGATCGTCTGGGGCGTGCCATGTGTACCACAACTAGAGGGGGCTCCGTACAG
TATTTGGTACTCGAAACGTATGTGGGCCGCGTGTCCATTTCTAGATGGCTTATTTGGCTCCATCCTCACACATATATCTATCTACCCCTGCTTCTTACTCTCCGCCCGTTCTCTTTGGCG
CTTCGAGCGAAATCATTAAAAAAAATTGTTGGGCCTCTTATGTATGGGAGGCGAGCTGTTTTCTAAGCCTGCGAAAGTACTGATATTGTCGGCATCATTTCAACCTGCGCATAGACCACA
TGTTGGACTTGATGCCTGTGGCTCGTGCATCTTCCGATGGCCTGTTGTTCCCCATGCTGCTGCAGACCTGGCGCAACCTGCGGCGGAGTGACAGCTCAACGCCAGGTGCTCACCGTCGTC
CACCTATTCAAGACAGCAGC
GCGTAAGTCCGTCGACATGTACAGGAGTGCGAAGAAACAGACGACAATAGATGCAAATGCCAGAGAGGAGGCTTCTTGTAAGGTTTTGTGGGACCACTGG
GATCTGCATCACCTGCGTAGTAGAAGGTTTGTCACCATTGAACATACAACGCGTCCGTCCAACTTGTTCGCTGCTCTGGCCTGTGATTGCCAGACCTATCAAGTGGGAGATGGCCTCGTC
CGAGGCAAAGTGAACAGGGAGATCCTCGAGACGGTCTTCGGCGAAGCGCTGACGACCCCGATAGCGGCGGATAACCGCCAGCGAGAGGTGAAAAGCCTTCTTCGAAGCGACAGCGACATC
TCGGACGAGAAGGAAGGACAAGACGAGGATGATGGGGATCTTACAAGCACCGACCAGACCTCACGGAAACTTCAG
AGGTGACAGTGAAAAATTGTTTGTTGGATCGAGCGTATCTTTCTA
CGCTCCTTTCGATGGAAGGTGTGCTGCTGCTCCTTTGTCGTGCATGCAGGCATGGGGATGGCGAGTACGTGCAACGCGTTTGAACAGGTGCTTTCCTGAGAAGGAATTTTCCTCCTCGCC
GGCAGTCCCTCGAAAGGCACAGTGGTGGTCAGTATTTCTATGGCTTTATCGGATATTCGAGTCGGACAGCGGATAGTCAGGTTGAGACAGCTGCCAGCAGCAAGACTGACGAATGCTTAC
ACCCCCGTCATGTAATACGTGCTCTGCCGTCAGGTGGTTGTATCAGGGCCTAGCGAGTTTGTGGCCAACGTGTGGCAGCTCCTCGACCAAATGGGGGTCCCCTCTGGCTGCATGGTGTTG
CTTGACTGA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAAGACAATCCTACTTGGGGCGCTGCAGGGACAACCCAGGGAGCTGATGACGGATGTATTTCTGTCGTCTCCACCATCCGTTTCTGACATGAACGCGGTTGCCATCGCTTGGGGACAG
CTGCTGCTTCTCGATTTGTCCTACACCGTCGACAACAGCTCGGAACCCTTCGATATCGCTTGCGATGACGGAGGGGGCTCCGTCGATGTTTGGTGCCCTCTGGGGGAGGCATCTGATCCA
ATCCCTTTTTTCAGATCCGAAGCTACGGTGACTGATTCTGTGCGAAACCCGATCAATTATGCGTCCTCGTTTATCGACCTGGACTTCGTCTATGGAAGGAGCGAGGACGCTGCCGATGCA
CTTCGTACCTTTGAAGACGGAATGCTGAGTATGGCAGATGACAACATGCCCATCAAGAACGACGACGGGACTTGGTTG
ATCGCGGATCAACGTACGGCGAGATTTCCACTGACGTTCGCT
CTTCACGTCGTTCTCCTTCTCGAGCACAACCGCTGCTGCGTTGACGTCGCGCCCGGCGAGAACTACACGGGCGATGAG
GATATTTACCAGGCTTGTCGAGGCTGGACCATTGCAACATTC
CAGCACATAACTGAAGACGAGCTTATGATTCTACTCATGGGAAGAAGCATCGGCGATTGGACCGTGTACCAAGATGACGATGACGGCGGCAGACGAAGACTATCCCCAGAACAGCGGAGG
GAGCTGTTGTTCACTTTCGACTACTACAACGACACCGTGGACCCGTCCGCCGACGTCTTCGTGACGGTGGCCATGACCGCCGCGTTCGAGTCGGCGCTGCCTTCTACCCTCCGCATTGTG
AGCGAGGGGTATGTTGCCACTGACTACGACCACCTTGAGCTCACCGTTGCGGCGGAAGATATCACGGGCCTGTTCGAGCACAGCGCGATCGGGGACATTCTGCGTGGGGCGGTTTTGTCG
CCTGCGATGGCGGTGGGGGCGCACTACGCTTCGGCGGTGTCGAACGCCTCGCCGCTGTTCAAACTCCCTGTGGACATGGTGCAGCGGGGCCGTGATCATGGGGTGCCATCGTACAACGAC
GTCCGGGGG
GCGTATGGTCTTCCGGAAGCCACGGACTTTTCAGATGTATCGTCGGATGGAGACGTGGTGCAACTCCTAGATGCTGCGTACGGTGGAGAAATCGACAACCTCGACGCGTGC
ACAGGAGCCTTGGCGGAGGATAAGGAGGCGAGCCTAGGTGGTATATTCGGCTACCTGCTGCATACCGCATGGGTGGACCAGTTATACAGATCTCTCTTCGGGGACCGGTACCACCACCTT
CACTCGAGGCCGATCGAGAACGTGTCACTCGTGTCTATCTCGCAGCTACTCAACCGCACGCTCGGCTTGACCGCCTTGCCGGAGTCGGGGTTCACGGTGCCGGAAGTCACTGTTTGCACC
GGGCAATGCGAAGCCACGGGTACATCGGGGGTCTCGTTGGCTGAGCGCTACGGCATATCGTGGGAG
GTGGAAGATGAAACACTACTGATTTCTTTGAGTGTCCTGGGCATCGGCGACAGC
GGGATGATAGGAATCGGCTTTGGAGGCTTGTCCATGACGGACGCCCAG
GATTTCATTATCTGTGAGGTCTTCTCAACGGGCGGTGCCGAATGCACTGACCGGTCTCCCACTGGAGGGCGA
TCGGAGCCGCAGCCGGACACTTTTCAATTGGGCCTCGAAGTTACCAATGTAACGACGGGGGGAGGCTGGACTACAGTGAAGTTCTCGAGGGAACGGGCGACGCTGGATGCAGAAGACTAC
GACCTCTTTGAG
GATATCGAAAACGAAGCAGACACCCTCGTCATCTACTCGTTCAAGAAGGGAGAGGGCGTGGGCCAACATCCCAACACCAATCGTGGAGCGGCCACGATAAACTTCGTC
ACGGGAGACGTAGACACTCAGTGCGACGGCGAGACCAGCTTCGTTTCGTTGCACGGCGCGCTGATGCTTATCGCGTGGATGATCATCGCACCGTGGGGTATCTACTACGCGAG
GTACCGC
AAGGGCGACGCGATTAAGTGGGCTGGACGTGAGTGGTACGAGATGCACGAGGAAATCATGATCGTTGCCTCCGAAGCCGTGCTCCCCCTCGGGATCACCGCAGTGTTTGCTTCAAGGGGC
AGGACCTCGGAAGCACACGCACGCTGGGGGTACTACATGATCGCCGCGGTAGCAATGCAGATCTTTACGGGCTGGATGAGGACCAAGGGGTTAGAGGCCAAACACTCGAACTTTTCTCTC
TTCCACAGG
TTCAACAAGTTCTTCCACATCTGGGCTGGACGGTTCGCATACGCAGCCGGCGTGGTGCAGTGCTACCGAGGGCTGGAGCTCGTGTCCTCGGACGATGAGCTCATTTTCTCA
GCAGGCGATGGCCTTGACTTGCAG
CTCGGCAGCTTCGGTTGGGTCAAGGACATCTTGTTCCCAGCCTGGTTTGCGCTCATCGCGGGCAGCTTTCTTATCCTAGAGACACAAAAGCAGTAC
CACCGGTTCTTCAAGAAGGGAGCTGCTAACGTGTGTGGCGTTGTGTCCATTGTCAACGAGCTGCACGACACATCTATTCGCAACAATGGTGGCCGGCTAATTCCGAGGACTTTAGACCTG
CCAATATACAGCATATCGGCGTTCAATGACAAG
GTGCTGAGCGGCCAGACCTGGTTGATGGTGGATGAGGCAGTTCTGGATGTGTCCGACTTCGCACAGAGGCACCCAGGGGGCAGGCGA
CTCATCCTGAACGCCCTGGGAACGGACGTTACACAGGAGCTGTTAGGACAAGAGAACTCCGTGGGGCATGCCATGTCCTTCCCGCCCCACGTGCACACCGGG
AGTGCATGGCGAATCATT
CGGTCGCTAGTAGTCGGCTACATCGAGGAGAAGGACGTCGGGGAACCTGCGGCAGCCCTGGAAGATCAACAGGAGCAGGAGGAGGCAGAAGAGAAAGTCGAACCCTCGACGGGAGATCCT
GCCTGTTCTGGCACCAACAACCGCAAGATCCGTGTCGCGGGCAGGGCGGTAATGCTGACCAACCGGTTGGCCCTGGGCGACGACAATCTG
GCAACCAAGGCGATGCGGCTGAACGACTTG
GCGGCTATTCCGGCATGCATTCCCGCTCCAACCAGGCGCCCCACGAATTTGGTGGCGACGAGTAGCTCGGCAAGCCGCTTCGGCAACCAGCCCAAAGAGATCGACGCT
CCTCAGCGCGTC
GATGACAAGGAAGGCGTTCTGGGAAGCAACACGGACTTGTTCGAGCGGTTTCAAGTATGCCCCCTGCTCTTCCGAGAGAGGATGGGAGCCGCCAGTGCCGTGGGCCGTGGTCACTTGCCC
AGCAAGCGCCCGGTCTACCGATACATTTTCTCGTGTCCCGCCAACGGACAAGCCCAG
GCACAGGCGGTTTCAGGAGTCTGCTATTTCAACATGCGCGCCCAGGAAGAGGGGAAGGGTGTA
GTACAGCGGGCGTACAATGCCTTTGCTGTCAGATTGCTGGACGTCGAGCCTCCCACACCAGGCGGAAGGGTGGCCTGGACCAAGGGCTTTGCCAAGCTGCCGAAG
ATTGTACCGGCGGGA
GAGACTACGGAGGGCATACTGTGCATCGAGATGCGCATCAGGATGTACCACGACGGGGCCATGAGCAAACTGCTAGAAAAACTCTCACAG
GATACGGACAACGCCGCCGTCCAGCTGCAA
GGACCATTCCTCATCAACAAGCTTGCCCCACCACCCGTCTATCGCAACGTTATCATGATTGCGGCAGGCACGGGCGTCAACCCGA
GGACACCACGTTCTCCACGAGGTCGCGGCTGGTCC
TCGTGTGGCAGAGCACGACCGAGGCCGACTTCTACGGTACCGATGAAATCACGGCCATGCAA
GTTCAACTTGGACAGCAGCTCCGGGAAACCTCCGCCTTACAGGACGCCGTCCAGAACT
GGTCACGGGGATGAGCAGCCCCCCCCGTACCGCACCCCGGGTCGTGACGGGAGCAGGCCTGCAGATGGGAAGGAAAAAAGGACCACGACTGGCGTTGGCAAGCTATTGCCCAAGAAG
ACC
TGGCGCAACCTGCGGCGGAGTGACAGCTCAACGCCAGGTGCTCACCGTCGTCCACCTATTCAAGACAGCAGC
ACCTATCAAGTGGGAGATGGCCTCGTCCGAGGCAAAGTGAACAGGGAG
ATCCTCGAGACGGTCTTCGGCGAAGCGCTGACGACCCCGATAGCGGCGGATAACCGCCAGCGAGAGGTGAAAAGCCTTCTTCGAAGCGACAGCGACATCTCGGACGAGAAGGAAGGACAA
GACGAGGATGATGGGGATCTTACAAGCACCGACCAGACCTCACGGAAACTTCAG
GTGGTTGTATCAGGGCCTAGCGAGTTTGTGGCCAACGTGTGGCAGCTCCTCGACCAAATGGGGGTC
CCCTCTGGCTGCATGGTGTTGCTTGACTGA

Retrieve as FASTA