Entry information : DyaPxd01
Entry ID 7659
Creation 2010-10-25 (Marcel Zamocky)
Last sequence changes 2010-10-25 (Christophe Dunand)
Sequence status complete
Reviewer Marcel Zamocky
Last annotation changes 2010-10-25 (Marcel Zamocky)
Peroxidase information: DyaPxd01
Name DyaPxd01
Class Peroxidasin    [Orthogroup: Pxd001]
Taxonomy Eukaryota Metazoa Arthropoda Insecta Drosophilidae Drosophila
Organism Drosophila yakuba    [TaxId: 7245 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value DyaPxd01
start..stop
S start..stop
DmPxd-A 3079 0 1..1528 1..1527
DerPxd01 2998 0 1..1528 1..1526
DsiPxd01 2970 0 17..1528 18..1528
DmPxd 2911 0 17..1525 18..1526
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '7659' 'join(9307651..9307817,9325640..9325855,9325921..9325992,9326079..9326320,9326481..9327152,9329090..9329296,9329392..9329570,9329855..9330036,9334879..9337528)' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 9307651..9307817 165 N° 2 9325640..9325855 214 N° 3 9325921..9325992 70 N° 4 9326079..9326320 240
N° 5 9326481..9327152 670 N° 6 9329090..9329296 205 N° 7 9329392..9329570 177 N° 8 9329855..9330036 180
N° 9 9334879..9337528 2648  
join(9307651..9307817,9325640..9325855,9325921..9325992,9326079..9326320,9326481 ..9327152,9329090..9329296,9329392..9329570,9329855..9330036,9334879..9337528)


exon

Literature and cross-references DyaPxd01
Literature Drosophila 12 genomes consortium (2007) Evolution of genes and genomes on the Drosophila phylogeny. Nature 450:203-218.
Protein ref. UniProtKB:   B4PDD3
DNA ref. GenBank:   CM000159.2 (9307651..9337528)
mRNA ref. GenBank:   XM_002094165.1
Protein sequence: DyaPxd01
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   1528 (1506)
PWM (Da):   %s   170349.52 (168035.1) Transmb domain:   %s   i12-34o
PI (pH):   %s   6.25 (6.21) Peptide Signal:   %s   cut: 23 range:23-1528
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MRVPLLLLQLLGLLLLSGGVQSIYCPAGCTCLERTVRCIRAKLTA
VPKLPQDTQT
LDLRFNHIEELPANAFSGLAQLTTLFLNDNELAYLQDGALNGLTALRFLYLNNNRLSRLPAAIFQRLPRLEAIFLENNDIWQLPAGLFDNLPRLNRLIMYNNKLSQLPVDGFNRLNNLKR
LRLDGNNIDCNCGVYSLWRRWHLDVQRQLVSISLTCAAPQLLQNQGFSSLGEHHFK
AKPQFLVAAQDAQAAAGEQVELSCEVTGLPRPQITWMHNTQEVGLEEQARAEILPSGSLLIRSV
EPSDMGIYQCIARNEMGELHSQPVRLVVNGGNHPLDSPLDARSNQVWADAGTPTHGATPSPSSTPLPSPPHFTHQPHDQIVALHGSGHVLLDCAASGWPQPDIQWFVNGRQLLQSTPSLQ
LQANGSLILLQPTQLSAGTYRCEARNSLGSVQATARIEVK
ELPEILTAPQSQTIKLGKAFVLECDADGNPLPTIDWQFNGVPLPGNTPDLQLENENTELLVGAARHEHAGVYRCTARNEN
GETSMEATIKVERSQSPPQLAIEPSNLVAITGTTIELPCQADQPEDGL
QITWRHDGRLIDPNVQLAEKYQISGAGSLFVKNVTIPDGGRYECQLKNQFGRISASALVTRNNVDLAPGDRY
VRIAFAEAAKEIDLAINNTLDMLFSNRSDKAPPNYGELLRVFRFPTGEARQLARAAEIYERTLVNIRKHVQEGDNLTMKSEEYEFRDLLSREHLHLVAELSGCMEHREMPNCTDMCFHSR
YRSIDGTCNNLQHPTWGASLTAFRRLAPPIYENGFSMPVGWTKGMLYSGHAKPSARLVSTSLVATKEITPDARITHMVMQWGQFLDHDLDHAIPSVSSESWDGIDCKKSCEMAPPCYPIE
VPPNDPRVRNRRCIDVVRSSAICGSGMTSLFFDSVQHREQINQLTSYIDASQMYGYSTAFAQELRNLTSQDGLLRVGVHFPRQKDMLPFAAPQDGMDCRRNLDENTMSCFVSGDIRVNEQ
VGLLAMHTVWMREHNRIASKLKQINGHWDGDTLYQEARKIVGAQMQHITFKQWLPLIIGESGMKMMGEYSGYNPQVNPSIANEFATAALRFGHTIINPILHRLNETFQPIPQGHLLLHKA
FFAPWRLAYEGGVDPLMRGFLAVPAKLKTPDQNLNTELTEKLFQTAHAVALDLAAINIQRGRDHGMPGYNVYRKLCNLTVAQDFEDLADEISNAEIRQKMKELYGHPDNVDVWLGGILED
QVEGGKVGPLFQCLLVEQFRRLRDGDRLYYENPGVFSPEQLTQIKQANFGRVLCDVGDNFDQVTENVFILAKHQGGYKKCEDIAGINLYLWQECGRCNSPPAIFDSYIPQTYTKRSNRQK
RDLGKVDEEVATAESYDSPLESLYDVNEERVSGLEELIGSFQKELKKLHKKLRKLEDSCNSADVEPVAQVVQLAAAPPQVVTKPKRSHCVDDKGTTRLNNEVWSPDVCTKCNCFHGQVNC
LRERCGEVSCPPGVDPLTPPEACCPHCPMVK

Retrieve as FASTA  
Remarks Complete from genomic (chromo 3L, 8 introns), no EST found. Strain="Tai18E2".
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCGAGTTCCGCTGCTGCTGCTGCAGCTGCTCGGCTTGCTGCTCCTCTCCGGCGGAGTCCAGTCCATCTACTGCCCGGCCGGATGCACCTGCTTAGAGCGCACTGTGCGCTGCATCCGC
GCCAAGCTGACCGCTGTGCCCAAACTGCCCCAGGATACCCAAACGCT
CTGTAAGTACTCTATGCCATATACTTAAAACTCAACAAGCAACATGCATCGATTCAATACTTAAATAATTAAA
GATAATCAATCAATTTAAAGAACTTTTATGGTGCTCCTGGAATTCCAGTTTCACAGGCTTAAGTTCGAAATGGACTTTTAATAGTTATTCTTGTCCAAGTTCGGCTTCCAGTTGCCTTTC
GTCGAGTGTACATAAATCAACGCGGGAGTTGGCTTGCAAATCCATAAATTAGTTTTTAATGTTTTCTGTGCACGCTGCGAAAAGTTGACGGCGTTAATTTGACGCCCCCCTTTTCTGCTC
CTCCTCCTTCCATGGAGTCCTACTACTCCTCCTCCTCCTCCTTCCATGGAGTCCTCCTACTCCTCCTCCTCCTCCTCCCAAGGGAGCCAGCTCCAGCTGAATCCATGAATCTTCCGCCAA
GCGGCGCTATGTTGACTCAATTAAACGGATGTCTAACCGGCAAAAGCTGACTGCGATTTGTATATTTATCACGCTGCCTGTCGCGTATTTGCATGCATGTGCAATATTTCTGACGTTGCA
TTTATTTTCGAAAATGTCAGCAGTTTTCGCATGAATTTCGGGCACCACAACCATTCCGCATCCAACGCATACAACTTGGCAAATGTTTGCGGAGCATATGCATATTTATGTAGGCCGTGG
CGTGTCCCAAGCCGTAGGAACAACCTCCCAAACTCGCATCCTGGCAGAGTGCAGCAGCTGCTGCAGCCGATGGCAAGTTATCAGCAGACAGAAAAACAGAAAACAGTCTCCACTGGGGGA
CAACAAGATATCCGGACAGACGGAGGAGCACGGTTTTCGTGGTATTTACGGAGAATTCAATTTCCAGCGGGGTTGACGGCGCTTGAGCCGCGTTTGATGGCCGGGATTCGCTGATTCGCA
GCGGGGAAAAGCTGCAAAGGATTTTCACACACACACACCCGTCAGGTGGAAAGCCAGGACCATCGTGGTCCTGGGAATAGGTGAAGCCACTCACTTCACGTGTCAACGAATTTGAAAGCC
TTGAACAAATCGATTATATGGTGCTTCAAAAACTATAAAGTGCTCTCTCAACTGGGGCACATTATTCGCTGATTCGCCGTTTCCGAAGCACTTTTAAGTTGAATCTGATACATTTTAAAT
CTGAAAAAAATCATTGGATAATTGAATGTTCCTGAGTTGGAAAAAACGGTTTTAAATAGCTTCATTAAATTATACAAGCCGAAGGAAAATGCTCCAGCCAAGCGGCGGATTTCAAAACTC
CTGACAGGTGGATAAATTCCACCTGTCCGTTGAATGGGTGAAACCAATTACTGCAGCTGTCATCGAATTTTGGGTCCCCAAATACATGGATTTATTATTTCATTTTCGCCATTGAACGAG
TGAGTTATGGGAAAGTACAGTTTGGTTCATTTGGCATTTTAAAGACTGCTTCCAAATTTAAATGAATATTTTAGCGACATCAAAAGCTGGTATTAGATTATGAATTTAAGGTCGCCCGAC
AGGTGACAAAAGCATGAATGGTTGGCAAAAAACAAAATTCCAAAACTGAAAAATAATATGGAACTAAAAACTTCTCTGACCGCGGCAGAGTTAATAATTGAGTTAACCAACATATCAGAC
CAAAGCAAAACGGCAAAACTACAAAACTATACAACTATGTGCAACCAACAATGCCTTTTGTATTTAAATATTCTATTGTGCAATCCGCAACGCCATCGAATCCACATCCATTTGCAATCT
CAACAGGAGGATCACATTCTTTGCCATTTGTTTGGCTTGCCGCACAACAATTTCTCAAATGGGTGAAAGCTGTGGAAAACTGGTAGTGGGAGCCAACAGCATTTGGCAGTACTTAAATTA
ATCTTCATTTTAAATAAGGTGATGGCTGCATAATTTATCATGCCTCGAGAGCAACAGCATCCGCACACAACAATAGAACAAAAAAAAACTGAAAAATATTAAAAAACACAGAAAAACACT
GAGAAAGACTGAACAAGACTGCAAAAAAAGTGAGCAACTTTTGCGTTGTCATTGCGATTCGCTTTTCATTTATTACGCAGCCGACCCGTTGCACACTTTTAATTAATTTGTTTCATTTAT
TACTGCATGTGTCTAAAAGTTGCTCTTGTTTTCCCCCATACAAGCGCATGTTTGTGCATTTCGCTCTTGTCTGAGGCTGTCGACTCAGATTCCAGCTTTCTTTCCATAATTCTCGAAATG
TTTCTCGGCAGTCTGTGAAAAATATGAGCACACATTTTCCCAACCAGAGAGACACTATTATTTTATAGCTCACACATGCCGACATTTCGGTTTTCGGTCTTCATGAAAATATTCTTGATT
TAATTTCCTTAATGGCGAAAATTTTGTTTACAATTTCTGTCGGAAGGTCGCTGCAGATAAATGTCGAGCAAACATAAAAGAAACTACGTCGAAAAACTACGTAAATACAGACAAATTTTA
GTAGAGGTAAAAGCAAAGTCAATTTCTAGTTAAATTGGACACAGAATTAAAGTCAAAAAGGCAGAAAAGATGTGTAGAATTACTGACTATATATGCTTATAATCAAATAGGTGCACAAGC
ATTTTTAGGGAAATATTAAAATAATGAATAGTAAGTTCGATAGCATAACGATCTTCAAGATCACCTCCTAGAGGTTTCCATTCAAGCATTTCCTGGCAAACTTACAAAAGCCTTTTCAGT
GAGCTCTCTCCTATTTTGCACACTTCAGCTGCCATTCCTCTTCAATTTAATGCTGCTTACAGCTCCAGTCTTCTCAAGGCCTAAGCCACATTTCACTTCTTTCAAACTGTCCATCCATCA
GGCATAAAGTCGCTGCCTGCGCAGCCATTGAAACTTCTCGCCGGGATGGGGCCATAAAAAACGGATGCAGCAGCTCCACATCCACATCCACGAAACATGCCTCTCACATCCTTGACTTGC
CTCGAACAAAGGCCCAAGCATTTTTTGGCCCGACCCGTGTGTATTTAAACTGGCAACTAAATGGAAGTCGAAACTGTGTAACATACGGCATCATTCATTTTGCCACACCCCCACCTCTTA
CGCCTCCAAACGCCCACTCACAAGGCAGCTGCCAAAGGAGGGGGTGGGGCTCTCTTTCCTTCAGAGCTAAATAAATGCACTTGATTTCATTTCATGATGATGCATTTTTAGCTGACCTCC
CATAAGCACACATCTCGACTTGCTCCGCCGTGGAGGCAGTTGCTTTGTCATTATTCCCCGGATCGTGGAGCTCGTCCAGTGGATTTACCCCATCCATTCCGGACTCGTGAGCCAGTACTT
ACTTTTAAGGACCGAGAGAAATTGCTTTTCGATAGAAGAAAAAGTGGTGAGGTTGAAATTTATGTTGCCTAGGTCAAGCTTATTGGATTTAAAACAATTAGGCAATTTAAAGTGTAGAAT
AGCTTTTATTTATAACAGAAATTGAGAGTCTTAAAAGAAAGGGATTTACTTGAAATTAGTTACGAAATATATAAATATAAATATATAAAATATGTATAATAAATACATATCTAAGCAATT
TCATATTTCTTTGCTGTCTATGCCCCCTTTTCGCTCACTGTATCCTCAGTTGAAAAGACGAGCACATAAAACGCACATGCAGCCAAGACATATGAACTAAATCTGCTCGCCGGCAGCCAC
TCGGCCACACATGCTATCATCTAATAACTCGGGCATTATGCACATATGAAATATATCTCATAATGCCGCACCGACTTGCCACGCCCCCCGCATGGCCACTGCCACCCGCTGCCGCCCACT
TTGCGGCGGTGTGTCAAAACAAAAGTCGCCGCACGTTGCATGCATGGCATATTAAAAAGTTTGATTAGCGTCCAGTTCGCTGCCTCTGCTTCTGCCACGCCCACTTGCCACCCGCCCACT
GGCCGACAATATGCCACACCGTCCGGCAAATACACACAAACACACACACACAGAAATATGAGCATGAACATGAATATGAATACGTATACATATATGGACAATATTTGGTTCGGTTTTTTT
TGCGAATCAACTATTTGCGAAAACTATTTTGCTAGCAAAAATGTTCCAACTCCCCTGCTTGCTCCCGCTGCTGTTTCACTTCGACCCCCAAATCATCCATCCTCTTTAGGCTACCCCTAT
ATATATATATGTATATGTATGTATAGAGGGACAAGCCCCCGGAAAAAGCCCGATTTTGCCGTCATACTTTCCTGAATTTCTCATCTCGTCGGGTGCATTTATGCGTGAGAAATGCCTTGT
ATTTTGTAGCTAGTCGTCGTGGCCATATCCTTGTTCCTGCTGCTGTTCCATATCCAGTTGCTTCTGCCATGTTGTTATTAAATATGCATAAATTTCCTTTTTGCACAGGACTTTTGCATT
TACATTTGCTGTTTATTTGTCGGTGGGAATATTGAAATTGTTACCAATGGATAGCAGATTCCAGCCCAGAGAACTGGTTTCCGGTGGGCATGGATTCGCATGGATTCCAAACTCCAGACT
CACCTGTGCATGAATATCTACCTAGTAGGATGGGAGGTGGGAGTTGGGAGTTGGAGGGGTATATACCTAGATCTATCTATCTGGGAATATATGCAGGATATGCAGCCTGGATGCGGATAA
TTATAATTATAGTAGTCGGCAAGTTGTCGGTTCAAGTTTCTGGCCGATTTGTGAGTATTTCGATTCGCCTGTGTCGAGTTATGGTGGGTTTATGAAGCTCGTACTATTGCAAATAACTTG
CTAATGAGATTTATAATTGATAAGCAGAGAACTTTCAACGTTTTCTCAGAATCTCTGTGTATTTGCAAACTAGTTCTATACATTTTATGCAGTTAAGTATATGTTTCATTTAGACCACAT
TTAGATAAATCAAATTTATTAAATGCTGCTTATTTCCTCTGGCTAAATAATTATTAATAATAGGAAATATGTTTTTGTGGACAGTTTTATTCAGTGACTTGTAACTATTAAAAATTATTT
ATGACATTTTTGGCTTTATGCTGCTTTTAATTAAAATGTGCCGTAAACAAACAATATTCTTAGTAATAACAACAAATTAGATGTGTGTACAAAAGAATTCAGATTGCAATGCTAAAAAAT
GGAATAATGTAAAAGTGATTATAACGAACAAAAATAATAATTTACATAATTTCTAGGTGAATGAAACAAAAAGGATTAATTAAAATTTATGTCAGTTTTCTTATGTTCCATTTAAAATGG
TTATATTTATTTTAAAATTAGTTATGTGTTTTTACAATGCGTTATTTAATTTAATTGCTTACCTTCATTGCATTTAATTTCCTTAAATGGCGCTTGCAATTGAAAAACTCATTATCCTAA
TATCGCAGTGCCATCAATCACAGCTGTCTGCGTTTGTCTAGTGATAAATCCTCTTACTGCATTCCGCATCCAATTACCAATCAATCAAAGAGCATCCAATCGTCGCTGTCTCATCCGTCT
TCCTTTTATGTAGACTGCCACGCCCCCAAAAAAGGCTCCGCCCCCGTTTGTGAAATATAATATTATTGTTGGTACGCTTTTTGCCCTTGCAGTTCAGTCTGGTTTGACTTTTCAATTGTT
GCCATATTTTCGCAGACCGAAAAAATTGACTACATGGAAATACAATTAAATTGTATGTATGGTATATGTATGGAATTCCTACCATTCCGCCACCCAATCAGTAATATGATTTGTAATTAT
GCAAATGAAGCGGTAGACCGTAGACCAGCGAACGCACTTGAATGGCTGAATGCAGTTCTCATTGATTAAAGCTGAACGTTTGGGAAATGTTCACTTTTTACATTGGCGCACCTCAAAGGG
AAATCCCCGAAGGCAAAGTTTTTGCCAATCAAAGCGTATGCATTTTTAATTCACTGTCACAAGCAGAAGTGGAAATGGAGCTGCAATATCGACTGGCATCTCGGGGTGGGGCATTCCCGA
TTCGAGATTTTATATTCCAGTTTCCAGGCTCAAGACTCGAGGGTCGAGACTGGGTACACGAGATTACAGATTCCGCCAAGGAATCTGTCTGCCAGTCAGCCGAGCTTCGCTTCACTTTGA
CATAGAAAAATCTTCTATTGATTTCCCAAAACTCCCCCTCAATTTGCATTTAAAGTTGGACGCTTTCATTTCTCCCATTTCTTGCTGCTCTCACTCGCGCCCCTCTTTAATGCTACAAAG
TGCCACTTGGTTCATTTGCCAGCAGTTCTCCTGCTCCTTTGGATCATGCCACTGCCCCCGCCCCTGCCACTGCCACGCCCACCAGCTTTGTCATTTCGCATGCTCGTTAGTGCGGAGCAA
AGAATGCAAAGTAGCAAGATAGCAAGAGCTGTGTGGAAAATACAGCAGCAGCAGCAACATCATCGCTGCACAAACACATGCCACAATTTATAAGCAAAGTTATTCTGCCACCGAGAAACG
GTTGGTGGCAAGTCGGCCAGGGGGGCGGGGCTGTGGCAACTAGCAACATGTCAAAGTGGAGCAATCGGCATGTGCAACGGACACAGAACAGGACACTGAACACGGAAAACAGGCGTGCAC
TGCGGGAAAAAACAGGCAAAAGTTGCCGATATTAAGATGTTCTGGCTGCCACAAAAGGTTTTCTATCAATCAATACAGATGTTGCAAAATGACTCAACCTTGAAACTGAATACCACCTAG
AGAGCAAGTACATAGAGAAAAGAAATATACTTCATCACATCTAGTTTTCAATAAGTGTGCGTTTTCAGCAAGGCGAGCTCCTTTTAATGTAATGATTTAGCAAATGGAATTCCACTGAAA
TTCGCCTCATAAACCCTCTGTCCATAAAGAAGTGATAGAGTTGAGAATAGATTGCAGTGCAGTCCTGGCAACTGGCGGTCAAGCATAAGAAACGGCAGCATCAGCAGCAGAAGAAGAATT
GCAGCAGCAGAAGAGGAGCTGCAGCCCATAAGTATGCTATGGCTATGGCAACGGAGCGAAGCCCAAAAGCGAAGGAATAAGTGCGCAAATAAAAGCGTTGCAGGGCGACTGCTGGATGCG
GAAACCTCCTCCTCGTCAGCGCCGCCTGGCCAGGTGAGATGCCACCGCCATGTGGCACGAACTACAGTCAACAGCAGGCGATGGCCGAGTCAACAAAAAAACAACAAGACAGCAAGACCA
ATGTGGCTTAGTTAAATGGGAAATACTCTTACACTGAATCAAATACAGATGACATGTGCTTACTCTTAAAGTCAGTATGGCCTACAAAATCATAGATATTCAAATGAAATATATCATCTA
TAAAATAACAATTAGGCAAAGACTTCATCGGCTTATTTGTGACACCATATCAGTTGGTCAGGACCTCCTTAAAGGGTATTTCCGGACGCCAAAGGAAAAGAAACTCACTGCTCAACCAGG
CGGCAAACCAAGTGAGTGAGTGGGTTTGGGGTCCAAGGAGTCCGAGTCCGAGTCCCAGCCAAGTGGCACGCCACGTAGCCAGCAGATAGCAAATACAAGATTGCTGCTCCGGGTACGGAA
CTTGCCCCAGACAGATGCATAATAATGTTAATGTAAGCGCCTGCGACTTTGGGCTGTGGGGTCGAGTATGGGGCAACCACGGCGTATGCGTAATATGCCAGCAAATTTGAAGCTATTCTC
GGTCGTGTAAGTTGAAGGCCACATCGTAAAATGAGCATTAGATATTTGTAAGCTAGGTAGTTAAAATTAGCTTTTACCTTCTTTCAAAACTAAACAGCTATTTTATATCCACCTTTGCTT
TATTCCCTTTGGCTACAAACACTTGAACACTTTCATACATTTAGGAAATAGTTCGTAATTTAATTTCCTCGGCATATTTATCAATTTCCATCCACCTAGGCCAAATACATTTATCCCTCG
GAAAGCCGAAATAGAACACTGCATCAGAAGGGCTTTGGGTAATATTCGCCTTTTATACGCATATTAAACCGTTTCCACTTGAATGGCGAACTGATTTCATTGCATTTTTATAACGCATAT
ATGTTGCATGGCTGCATGATTGTATGGGTGGTGTGTGTGGGCTGAGTGTGTGCCTCCTCAAATAAAAACGAAGAGCTGCTCATGCACCCAGCTCCATGCTCGACATATAAATCTGCTGTA
TAATTGAATTTCACACGTTGCAAGGAAAAAAAATAATACCAAAATGGTGCAGAAAAAATACGAAAAAAAAAAAAAGAAAGTGCATAACCAGCGGCAGCGAAAGTTTAAATAAAATTTATT
TCAACAAAAATACACACACTCGTTGGGCAAATGTGTGTGTTCAGAGATGGCAGTTGCCTGAAAATGTGAGCATTATATATTGAAGGGCGCACGAGTGGGGCGATGGGAGATGGTGCTGCT
CCTCCAGGTGGCCACTTTTAACCATGAAGGTGTTTGCCTGGCCAAACTGGAAGCAGCTTAAGCATGCCACAACGAGCAGCCGGGATGATGAGGGTGGTCTCCTTGCGTTGGTCTCCTTGC
GTTCGCAGGACGACGAGGAGCACGAGGACTCCCATAGACAGGCGGCAATTGCATGTGTGTAGAGTGCACTTGTGAGACATTTACACCTGTAGAAACGTCGATGGGAGGGGCGGCTTTACG
AAGAGCATTCTGAGCTGGGATATGAGGCACGGCTGAAGGAGCTTTTGAGGCACAACACATTAGCAGTCAGCACAACAGAACAGAGGAGTGACGCTGCAAATGCAACATAAGAGCAGTCGT
CACGAACCAAAAATTAAACTTTGTAAGACATAACTATTTCGTTTTTATATTTTACCTTTATTATTAGAGATGCTTACAGCATTTTTATGCTCATGTTCACATGAGTGACAAAGAAAGGAA
GCAGCTCATAGCCAACAGAAATAAATTGAGTTACAAAGGTGTCTGAAAGTATGCAACCTTTTTAATTATTTTCTTCTTTAAACAGCTCGTTTTGATTAGAAGCTCTGATAAAATACACCA
ATCTGGGTGATAGGTTTTGGGTTTTCCTGGAAAAACTTGAGAGAGCGGGTCAGTCAAAAAACGGCGAAGTGAGACAGCAATTTTAAGTAAGAGGAAATTTGTAGCCCGGGCATTTCTTTG
GCCTTCTAACATTTCGTTTTTTCTATGCTCCTAGGACTTTTGCACCCTTTTCCTCTCCGCTCTTTGATTTTCCAACGCTGTCAGGAGTTTTTCTTTCGCTGGCAGCCCAAAAGTATGCAA
TGCAAGTTTAGATGCCTCTTGTTCTTGCAGCGTCCTTTTTTGCTCAAGACCGTGAGTGCCAGTGTGTGTGTGTGTGTGAGTGTATGAGTGTGAGTGTGTGTGTGTGTGGGACGGAAGTGA
ATTTTCTGCACGTGGCTTAACTTCTGCTGCCTGCTCTGGCGTGTCATGCAGTCAGCAATCTTTATCCAACCCAAACGCAAGCAGCTCGAGTGGGCGCCTTTCATTCTTTTCTTGCGGATT
CCGCTCAAGGATTGCATTTAACGCAGGAACAGGAAGGAAGGAAATGCCATGTCGACGCTAAAACTGGGTTAGCTTACATAACGACAACATTAACGACAACGCCTTCCTTCCGGTGGGCGG
CGTTCGGGTGGGCGGCGCATGGGTGGTATGGCGTTGGCTTCACGCTTCGTCGCTTATTGGACATAATTACTGAATATAACCATTCTGTGGTGGCTCCACAAAACGGGGCACTCAGAAAAA
TGCCCGCCAGGAATTTTAAGAATTTCTCTCATATTTTTGTGAACAATTCTATATTACATAACATTCTACTGGTAAATTCTTTCTTTAATCAAAATTCGAATAACTTCTTCACTGCCAAAC
TATTTCATTTAAACTGCAAGGTGTTTAGAAGTATGCAGTGAATTTAAGCTGTCAAAAAATGTGCTTGCATACCCTTAGAGCACCTTCGATATATTCTTGAGATGCATCCTTGATTTCGCC
GAGTGCATATCCTTTCGCCAAAGCCACTTGTGCGTAACGGCAGCGCCAGATAAGTGCAACAAAAGCCACGGCGGCGGATGCAGCCAAGCCAGGCCACATATCAATTCCATGTGAGCGGAC
TAACTGCTCCTGGTGGTGCAGTCCTAGTTCCGGAATGTCCTGCTTGCTGCTGGTCAGTTGGAGGAGGGGGATTTTGGTTCCCTTATCGCTTACACTTGGCAAAAGGAGAGTCACCCTCAC
CCGCACACAGTTCACTATTCGCACTGATTTTTTGCTCGCCTGCCAAATCCTTAAAGTAGCACAGCTTTGGCCCAAGGATTTGTCGTAGCAAAGTTTCGTTCTTTGTCAGTTGGGGAAAAC
AATGTTAATTGCTTTGCAGTTTACTTTGCGCAATCCGGTTGACTGGCTCAACTGTTGGACTTTCTTGGTACAGGTGACATCCGCTAAGCCGTAAACTGCGGCTCTCAAGTGCTTACAAAT
CCGCTGTTAACTTTATGGCTTAAATGTACATTTACTCCTTCTTGAGATTTAAGTGCATAGCAATTTAAATAATTTATTTAATGATTTTAATGATCTTAAAGGAACATGTTCTCATACTAG
AGATTAAACTACATAGTTAATTACCAATTTAAATAGTTTATTAAAAGGAACATTTACTCATACAAGAGATTTAACTACATAGTTACATAACTACCCATTTAAATAGTTTTGTAAATGATT
TGATTTTCCCGAGATATTCACTTACACACTAAGGATTCAAATTACTCTGCAGCTTCCTTTTGCTAATGTCATTGCCACTTTCCCGGGCAGGAGAAATTCATTAAGCCCCGATCTACAGCT
CTAAACTCTTTGGCCAAAATCTTTTGTTGTTCCATCTGGGCTAAACAAAGCGTTTTATGCATGCACACTCCGAGCCAATTACGCAAAGTGCTCTAAGTATCTCAAACAGTCCTAACTGGA
CACAATTACATTAACAGTTTAGCACACACACGTATGTATGTATGCGTGTAGTTGGCAGATATCGGGGATTTCCGGAGGATAACGGGTTAGATATAGCCATCATCGTCAGAAGTGAGCGAA
ATTGGGGGTAGGCAAACAGGACGAAAATTAAATCACAGCCACAGCTGCAAAGGCTCGAAAATAAGTCCTGGCGAAAAGGAGCGGAGGCAATGAAACTGCTGCTGCTACTGCAACTGCTGC
CACTACGGTAACTAACTTTAACCCCGGAATCTTGTAGTCCGTAACGCCCATCTCCATTAACCCACACACATACAGACACTCATACACTCGCACACACACACACACACTGTCAGGCACACA
GACAAAGTGACTGGCTGACTGGCAGCTGAAGTTTTACGCTCACAGATAAACAGTGTCAAGGGTTAACAGCAATGGGATGGAGGGGCGGTGGGAGTGGCCTTTGTTTAATAATAATAAAAT
GTCACTGGGCAAAAATAAACGGCTGAGGAGCCGAGCGGCTGACGGATTACATTGCCAAGTGCTCCCCACAAGCACACACACATAGTAACACACACACAGTAACACATAGTAACACACACA
CAGTAACACACACACACACTCATGCAGCAAGTCAGTCAGAGCAATTGTCATGCATCCTTTGGACTATCGTCACTGGCAAATCACCTGGGCAACCCGACAGCACACAGGCGGATACTTGGC
TGCGAGAATGTGAGTCCTGGCATCCAAGGATGCGCCCGCCTTGTGGGTGCAGGCACTTTTATTCCTTTGGTTGTTCTTGCTGGCCATGTTGCTGCTGCTATTTGTGCTGCTGCTGCCGGC
GTGTTTGTAAGCCATGTTATTACAATAATGTGCAATCTGAGTCTGAGATGCAGCACTGCAAGAAACATCTGTAGGGAAAATGAATGTGTGTCTTTAATAGGTGTCTTAGTGTATATTGCG
CATTCTGGAGGAAAGCAAGCCACTTGAAAGCTATCTATCTATCTATTATGACAACCACACAACACGAAACTATTTGCATCAGCATAGGAGTACATCCTTAAATGTATAATCTTTTTCTTG
CTCTGCTGGCGAATGCGGTCTCTGTGGTGGCTTGTTGCGAGTGTTTGTGTGTGTGTGAGTGTGAGTGTGCGGCAGGCCAAAGTTAAATCATGTTTTTATTGTCACATTTGCCATAAAGGC
AGCCGAGGGACCGTCAGACCGTAAAAGGACTTATACTGGCACAGGAGCAGGTTTAGGATCAGGAGTAGGAGTAGAATCAGGAGAAGGAGTAGAAGCAGGAGCAAGAGGCGTCATAAGATG
CTGCCTAAGCTGCAGGCAAAGGTGGGGTGGCCAGGTGGCCAGGTGGGCAGGCAGGTGGGTAAGCCAGACAAATCATAATCAGAGCACAAGGCAACAAGCTCAACAAAGCCGGAATCACCT
TAACAACCAGGTGGTGGGCTGCTCTTAACCGGGGGTGTTGGTTTGGGGGCGTGGCCGATGCCACTGATAAGGCAGCGCATAGGTGGCCTTTTCCTTCCTTCTTTCAAACGCGCTTATGGC
CTTGTTTACGTTGCATACTTTATGGCGCAAAGCGAAAGGGGGAGAAGTGGGTGGTCCTGCGGAAAAGCGAAGGAAAACGTGTGTGTGCGGAACGCGGAAAAAAGCGAAGCGTAAGCGTTA
ACTCTAATAAAAATGCAGTGAACTGGAAAATATACTGACTGCATGATAAAAACTAACTGGCGCTGAAAGCTGGCAGAAACCAATTCCAGCGAGAAAATGAAACCTTTAATTCATTTAGAA
CACCCACAAAAGGTCCAAAACGTTTTCAAAATCAGCAATCAAAGTGACTTGTATCAAAGAGTTTTGATCGTGGAGGCTAATGAAAATAGATAACTAGGTGTGGATTAAACAGCATTTAAT
TTGTATTTATTTGTGCATTACAAATGGGTTTACTTCACGATGTCTGTCATGATTTTTCACTTAGGGAAATTGCGAAATTCCAGTTGACTGACATTGCTCGTTAATAAGGCTCTCAAAAAC
CGCAATCGTGGCCAAATAATCAAACTGAAATCTCAAAGGGGACACCGCAAACATACACTTGTCGAAATTGCATGGCATGTACCCCCAGAAAATGCGCACTAATTTACTTGCTGCGGTTTT
TCCTTCACTAATTGTGACAGCTAATTGAAATCAAATTAGTGAAGCCACTAAACAGCTTAATGCTCGCTGAGCACGAACCAGATTTTCCACCTCGGCGATGAGGGGGGAAATGGGTTAACC
ATAAACCCTACACAATGCTAATTGTTTTCATTCAATTTACTGTCTGCGCGCCGCTTTAAAATATCGCTCTTGGCTTCAGTTTTGGCTAACCATAAATTTTCGTCATGGACATCAAAGCGT
ATACGCAATATGCGAGTGTGTGTGCGAGTGGCATGTCCTGTTTGCAAGAATGTGTGTGTGGAGTGGTGTGGGTGTGCAGACATGCGTGAATAGACAAACATGCCTTTGTCAATAATATGC
ATTAATTAAGTGCAGACATGGATGCCAGAGGAATCAAACCAAGGATATCCGAGGTGCATCAATGGATTCCCAATCCCGCGCCACAGGACATACATACATACGTACATGCGGACACACAGG
CATACACACGTACATACGTACACTCACTGACACGCAGGACTCAACACCAACACGGCGGCATTGCAAATTTAATTGAAAATATCGAATAATGCGAAAATGCAAATGTCATTATCAAAATTC
AGGCCGGCGGAAGTGTCACAAGCGCTGAAAAGAAGTTTCAAGATTCAATTTCCGTTTCCGCACACATCACGTGCCTCGCAGCAGGTGGGCATTAGGGCGTGGCCGTAGGGGGCGTGGCAG
CGGGGCGGTGCGCGGATACCGCCCCATCAGCAACCCAGTTATTTATGCCTCCGTTTGGTTCCCATTTTTTTTTTTAGCAGCTCCGTAAATAGTTGCGCAACTGACACGACGCCATGAGAT
GAGGCCATTGCATCGGCGCCCCGCTGAATCCCCCATTTAGCCGGAGAGCAACTCCAGCACGCACTTGGTCAGGAGTGGGTTCTGTTCCGGCACATAAGCACTGAGAGAAATTGTGGCTAA
CACATGATCCATCATTCAAAAGAATGCCTTGCAAACAAGTGTTGGAGAAAAACTCTCTTTAAGAAGAACTAATTTGCAGGATGACTTACTGCTCTAAAGTCAAGGAAAGATGGGAAGACC
TACTGCCATTTTCTCACAGTGCACCACATTTTGGGTAGGCTCGATGAGGCATTGGTCCGGGGAAGCCGGCTGTCCGGAATCCACAGTCCGAAGTCCGGAACCCGGACACGCACTCAGCGT
GTGTGCACATTGAAGTGGACCATCGTTTGTCACTCGAGGTGGGTGTGGAAGTGGGTGCAGCAGGGGCAGTGGATGTAGAAGTGGCAGAGGAAGAGGAAGAAGAGGAGGACACGGGCACAG
TGGTGCAAAGTGGTGGAGAGGTGTTGGCTATGCAGAAAGAGAATCGGCAAGTGGCTGGGAATTTTAAAGCACTCTACGGCTGGGCAAATACGATAGCGGTGTGTGTGGGTCAAAGGGCAG
AAAAGACCGCTTCCTTGTTGATTCAAATTAGAGCGATCCAACTGGCGGCACCACTCGGCTTGACCCACTGTGAATCTGAATCTGAATCTCAATCTGAGTGTGAGTGTGGCTGTGGCTGTG
GTTGTGGTGCAGGGCTCTGGATGGTGGCCCTATTCCTTGTTGACTGTTGGTGGCGGGGCTAATGGAGTTTGCAAGAAGTGGGCAAAAATGTGGATCAAGCAGCCGTACTTGCACACAAAG
ACCCCACCAATCTTCGTTCCGCATTCTCCGGCTACTCCGGTGCTCCTTCCTGGCGGGAGAAATGCGAGCAACTTTGGCAAATTGTTGCCAGCGTATCGCTGCCCCACTTCCTTGGATTCG
CCGCAGCTGCATGAGGTTGCCAAGTTGGTCCACCAGGAAAAATATTTACCAGAACTTTGAAGGCACTACAAAAATTTTCGGGATTACATTTCAATTAACTGAAGTGCAACGTGGCTATGG
AAAACTAGTTTGCGGACTTGAGCTTATTAAAAACAAAAAACAAATAAAAACATATTTTGTAGCGAACTTTGGAAAATTCCTTCATAATAAAGTTTGCTGTAATAGTCTTTATGGCACAAA
GAACTTTGAAAAAATCTACAATAAAGGAGGGTAGTCGTAATATTTTTTTGAGTGCAGCTCTGAGAGCGGGGTTACCACATGGACACCAGTGGGTCGACCTGCCTCGGGCCGTGAGTCTTT
CATCAGACCAGAGCCGTAGTCATAGCCATAGCCATAGCCATAGCCGTAGCCATAGCCGTAGCCGGAGTATAAGAGCCGGAGCCAAGACCGAAACCAAAACCCGGAGTGGAATTTCATGGA
TGGGCGGAGGAGGGGGGTGCTGCTGCTTTGTTTATGTGGCTCCATTTGGGCATATGCATGACCCACTCAACCACCCATCAGTGGCACCCACCACCATCCTGTGGTTGCTGCTATTGTTGG
GCTGGGGCCTCAAGCTCCACGTGAGAGTGGTTTTAGTGTTGTTTTGGATTCGCGTACCTCCTAGTTGCATAGCTAAAGGGTATATTCGTTTTAAGCAAATGTAGGAATAGATGGACCAAT
TAAAGGTGTTGCTGAAACTAAAGATTAAAAATCTATATTAAGTTATAGATTTATTCTAAAGGCATGCTGTGTACTTACTGTCTAAAAATGCATATTAAAAGTAATAAATAGCTAGCTTAG
GTATGCAATACATCAAATAACAAAGCATGTACTACAGGGTATCATCTAGTCGCACCCCTCGACTTAAGTGCTCTATCTTGTTCTGGATTTTTAAGTTGTTTTCGTTGTCGTAGCTTGTTT
GTGTTGCCTTTTTTGTTGTCCTGTGTGCGAGGTTTTGCCTTTTTCCCGTGTGTTGCATCCTCGTCTTTAGTTGCAATTTGAGCACAGACACAGCAACAGCCAGTATTGCACTTGGCCAAC
ACACACTCACGCACACACTCACACACTCATACTCACACTCACACTCACACTTACATTCACAGTGAAATGCTGACACCAATTAGGGTGCAGGTTGTTGGTGTTGCTGCCGTTAGTTGCTGT
TGGTAGTTAGTTAGTTCGGTTGCACTTTGTGCGTTGCATGTTTACAGTGGCCACAAAACATAATTTCTGTTTAGACAACTTTAGCCCCAGCTCCAATTGCAGTGGCTTCTGGCCAAAAGC
TTTGCGAGCAGGGCTGGAAAAAAAAAATACAAAACAAAAAGGCGGAAAACAAAACAGGGTCATAATTATTGCGCCAACAAAGACAGCAAAGCAGCTGCCACATTGGCCAGCGGTAAACGA
TGTTGCCAGCAGGCAGAGGCACCACCAGCTGAATTTTCGGTATAATTTTCACTTTTGCCAAGTGAAACTTTTTCGCTGCCGCATTGAATTTGCAGCCAGCGAGCAGCTAAAACCCGAGCA
ATAGCAAAAGCAAAAGCAACAGCAACAAAGTTCATGAGTTGATTAGAAAGTTTTTTAACCGCTGTTGCAGCTACATCCTTAACCCGCTGAAATCCTACAGCCATGCCCCCCAGCACATCA
TGCACTCAGATAAACTACTATGGCTTTTACCGCATGCAAGGTAATAATACTTATGTATGAATATGTAACCAGATACTCCTTCACTCAATTTCTCATCGTTGTCCTGGGGAATTTCTTTAA
GTGAACCGTTGCTAATGGCAGCTCGTGAGCTTGGCATATGCGAAAAGACCTCAATCCGTGGCTTAGGCTCCCAACTCCGCACTGCATGCCACATGTTATATGCCACATGGGTAGATGGGT
AGATGGGTAGTACGAGTAGTGGTGGCTTATCGCTTGGCTGCTTTGTCGCCGGGCAGGGGCATAAACTATGCGAGCGCGAGCATCAACTTTTTGTAATTAATTTTTAATGTGTGCAAAATG
TTTGAGTAAACGAGCATGTTGTTGTTTGCCATAGCGAGCGCCGCCATATCAAGCGGATGTCGTTAGAACGGTTTATCAAGCGCTCTAAGAGGCGAATAAGCTGCGGCTATATATACTATA
TACTCTATACTCTATACTCTATACTATTTAAGTTTGTGCTTAGCTGGTGTAAATAATTGGATTGCGTTTCAAAACGGGCGGTTGCCAGAAGCCAAGACTGCCAGGCTTCTGGTTACCCAA
ACGGAATTTCTTTGCACTAATTGCATATGACAATGCTGCAAAACATCCATACATCCCCACCTGACGCCCCCAAAAAACCACCTTACACCCCTACACCCTTGCAACACCCACCCATGGTGT
GCGAGCGAGAGGGAGGAAGGAGCAACGCAGACCGCATGACGAGAATTGGCAACTTGCTGCAATTTGCTAGAACCAAAAATTGATTTACCCTCAACTGCAAGTCTTCCCGCTTCGCACAGC
TTTCAAGGCACTTGAAAAAAATGCTGTTCGCTGGCAATTTGCATCTCGAACTTCGGAATTCAGCAGAATTTGCAGCACTGGCCAGCTTGCGTATTATGCAGAGTTGGCATCATTGACTGC
CACGCATACTGGGCATCATTGATTTCATTGTAGAGCTTGAAATATAATATATATAATACTTTAAATGCTGAAATGGTTATTGGAATTTAAGAGTGTGACGTTTGACACCTTTGAGAGTCC
TTTTAATTTGCCATCATGCACTGAAATACAAAACTTAAGAGTAGTTTTTCCCGCAGTGTTGAGCCCAACTCCCCTTTCAATCTTTAACCGACTCGCATCCATCCATCAGATAAACGTTTT
GCTATCTGCAATCTGCGAATCGCATTCTGTCTCTTTGTCCATGCCATGCCATCCGCATCCACTCACGCTGTGTCCGTTCTTGTTCTTGTTTTTCCGGCAACATTTTTGCAGAGATTTGCG
TTTCAATCACATCGAGGAGCTGCCGGCCAACGCGTTCAGTGGTCTCGCCCAGCTGACTACGCTTTTCCTGAACGACAACGAGCTGGCTTACCTGCAGGATGGGGCACTCAATGGACTGAC
GGCGCTGAGGTTCCTCTATCTGAACAATAATCGGCTAAGCCGCTTACCGGCGGCCATCTTCCAGCGGTTGCCACGCCTGGAGGCGAT
ATGTAAGTGAACCCTCCTTAATTAAGGATCCCC
GCGTGCAATCGCCTTACTCAATATCCTTCTGCAGTTTCCTCGAGAACAATGACATTTGGCAGCTGCCCGCCGGCCTCTTTGACAACTTGCCGCGCCTGAACCGCCTCTGTAAGTCGATGA
TGGTTCGATTCCACTTTTCGTCTTTTTTTTTTGTGAGGAGACTAACATTCTGTCCTCCTTCCTCCACCATCCAGGATCATGTACAACAACAAGCTGAGCCAACTGCCAGTGGACGGATTT
AATCGGCTGAACAACCTGAAGCGCCTGCGACTGGACGGCAACAACATCGACTGCAATTGTGGCGTCTACTCGCTCTGGCGCCGCTGGCACCTGGATGTGCAGCGCCAGCTGGTGTCCATC
TCGCTCACCTGCGCTGCTCCTCAGCTGCTCCAGAACCAGGGCTTTTCCAGCCTGGGAGAGCATCACTTCAAATGCG
CGGTAAGTTTTCAATTTCCATTTAAGTCACACACGGAACATGAA
GCAACTTTATGCTTAAGTACGCTTGCCATCAAATGGATTGAAAGCACGATATCTGATGAGCGCCGCAGATGCATTTTAATACCGAATTGATGGCCACTCCACTTGAAATTACTTGCAGCA
AAGCCGCAGTTCCTGGTGGCAGCCCAAGATGCCCAAGCCGCTGCCGGCGAGCAGGTGGAGCTGAGCTGCGAGGTCACCGGCCTGCCCCGTCCGCAGATCACGTGGATGCACAACACGCAG
GAGGTGGGCCTGGAGGAGCAGGCGCGGGCAGAGATCCTGCCCAGCGGCAGTCTGCTCATCCGCAGCGTGGAGCCCAGCGACATGGGCATCTACCAGTGCATTGCGCGCAACGAGATGGGC
GAACTGCATTCCCAGCCCGTACGCCTCGTAGTTAATGGCGGTAACCACCCACTCGATTCGCCCCTCGATGCCCGCAGCAATCAGGTGTGGGCGGATGCCGGAACACCCACGCACGGAGCA
ACGCCATCGCCATCGTCAACGCCATTGCCATCGCCACCGCACTTCACCCACCAGCCGCATGACCAAATTGTGGCCCTTCACGGCTCTGGACACGTGCTGCTCGATTGCGCCGCCTCCGGC
TGGCCACAGCCGGACATACAATGGTTCGTCAATGGCCGCCAACTCTTGCAGTCGACCCCCAGTCTCCAGCTGCAGGCCAATGGCAGCCTCATCCTGCTGCAGCCCACCCAGCTCTCTGCC
GGCACGTATCGCTGCGAGGCTCGCAATTCTTTGGGCAGCGTCCAGGCCACCGCCCGCATCGAAGTGAAGG
GGGTGAGTAAGTGCCAGAGATGCTCCATGTTGAGTGGGAAACTTTGTTTC
CAGTCCAGGTACAGATGTGTCTATCTATGAATCTATATATTTTTGCTTTTATCGCAAACGCACGGCGTAATGTTGAACAACCTTATGAATTATGGTGTTGTTATCCTTAGTGCCTCATAT
ATTTACCCATATGTGCACCCAGCTACTGACCATCTCTGGTGCAGTACCCACACAGAGTCAGTGTTTATGCCACACACCATCGCCCTGTGACTGCCACACAGTTGCTGCTGCAGGAATAAG
AGTCACAACTGTCGCGCTTTACTCAATTTGCAGTCGATTTAATTTGTTTTCGTCTGGGAAAAGGGCAAATGGGGAGTGGCATTTGAGACTGACCAGGATGCAGCTGCCTGGCGAGGCCAA
CTCAACTGAACTGATATCGGGGCGTTGGGACTTTGCAGATGAGTTTGGCCGGCGGGAAAAGGGCCTCACTGTCCGTTAGAAATATTGCGACAACTTTGACAGCAGGTGGTGCCGGTGATT
GGATGCAACTTCATATAGACAAATCAAATCACAAAAGTTTGTGTTTTTGCACTCGCCAAATTTAACAATTTGTTCAAGCAAAAAAAAGTTTGGGATTCCGTTTTTTCGGAAAAATATTGT
TTCCCACGAAAAGTTCACGGCAATGTTTGCAATAATTAACTTTTAAAGGTCAACAGCATATATTGTGATTTTTGAAATTGTACAAAAAATTAAAAAAATGAGAGAAGCTTTAATTTTAGA
CTTACTATATTCTTACTATTATTATAGGTAGCTGTGGCCGCTTAGGTTTTTTTCCAGTGCACTGACGTTATGCGGTGTGTAAGTAAATGGCTCTGGGTTTGATTGGCGCCCATTAAGCGC
TCTCGCTTCGCAGCTTCTTGCCCATTAAATAACCGATGAAGTGCATCCCGGCAGGAGGTTTAACCCGAAACAACTTGGCACTTGAGCTTTTTATGTTTTTCCATTCGTTATTGCCGTATT
TCGAGCGCTTGTACGTGACTGTACTTTATTGCGCAATAGAAACGGCCAACGGAAACGGGGAGTGAAATTTATAAATATATATATATATATATCCACTGGGGAATGGGGATCACAGACGGC
GAAAAGTGCATCCACCGCCTTGGCTGCAGCACAATGGCTCATTATAATTTGTGGTTGTCGGCACAAAATGAAGTGCGAAATTTATGCATTATTTTGCTGAATTAAGTGCAAAAATTATAA
ACTATTATGTGCGGCGGGGGCACGCACAAATGGCGCGCTGTGGCTTTAAAAATGTGCACTGCGGCTTAGGGGGTATCACGGGGTATGCGGGGCACACGGGTACCACCAGGCGGATGAACT
GTTTAAGCAGTCAGGCATGGCGCCTGGTCAGCATAATTAACCCACCAAACGCATAAACAGACACACGGGCAAACACTCAACCAAATACAAACACACACACTCGTACACAAGTAAATATGG
CGGCCACGCCGAAAAGTATGCAGCAGAAATTTCATATTTTATATTACGCATACGCCCCATGGGCATTGACCAACGCACAGTGGTGTTAGCAACGGCAACACGTGCAGATTGTGCTCAATA
AGATATGTTTAAAATGCAGCTTCTAAGCACTGTGTATCATAAATATAGTTTGTTGTTGGACATATAAGATTTGCTTTTCAAATGTATATATTAGCTGGCATATCAGATTTTAAACATGAA
TTCCCACTGCGCAGCGCTTAAAGCGGGTCAAAAGGCGGGGGGTTTTTGGGGAGTGGACTTTTGGTGTTGCTGCGGCAACAAAAGACTCATAGAAACCGAACCGAAAAGCCAGGCCTCGCT
TTAGTCGCTTGTTGCTGCCATTTGGAGCAGTAAAATTGGAGCTCGAAAATCACTCATTCGCCCCGTTGCACCGCTTGCCACTCTTGCAGAACTGCCCGAAATTTTAACAGCACCGCAAAG
CCAAACAATCAAACTTGGCAAGGCCTTTGTGCTGGAGTGTGATGCCGATGGCAACCCGCTGCCCACCATCGACTGGCAGTTCAATGGCGTTCCCCTGCCCGGAAATACGCCCGACTTGCA
ATTGGAGAACGAGAACACCGAGCTGCTGGTGGGTGCCGCACGGCATGAGCATGCCG
CGGTAAGTAGGAGAAGTGATTGCCAAAACGGTTTACATTGCATTTTCATTCGATACCTTGGACA
TAATACCGTGAATCCTGCCCCCCTCCATCGCAGGTGTCTATCGCTGCACGGCGCGCAACGAAAATGGAGAGACGAGCATGGAGGCCACCATCAAGGTGGAACGGTCGCAGTCGCCACCGC
AACTCGCCATCGAGCCAAGCAATTTGGTGGCCATTACGGGCACCACCATTGAGCTGCCCTGCCAGGCCGACCAGCCGGAGGATGGACTGCAG
AGGTAAACTCACTTACCGGGATTGGCCA
CGGAATCGGAAACTTTTTATTTACCCGCCACTTGTTGACGTTCCCTTTTGGCCTCATTTTCCGCTTGGCCACCGGAAAAAATGAACCCAAAAATTGCATACATTTAAAGTTTGCTTTTGA
AGAAAGTAGAAATTGTAATATAGACAACTTTACACATTATTTAAACACACGCGTTGTGTTTTTCTTACAGAATAACATATTTTTAATGGAAATTATAATATAAAATATTATTATATTTTT
TTCACTGCAACTCCTCAGATTACGTGGCGCCACGATGGGCGACTCATTGATCCGAATGTGCAGCTGGCGGAGAAATATCAAATAAGCGGCGCCGGCAGTCTGTTCGTCAAGAATGTGACC
ATCCCGGATGGCGGACGCTACGAGTGTCAGCTGAAGAACCAGTTTGGTCGCATTTCCGCCTCCGCACTGGTTACCATCAG
AGGTGAGCGGAAAGGGGGGGATGCTGAATCAAAATCAAAT
AGCTCAACTTTGGGCGACGACCGCAACTTTAATTGCGCTGTCGGCGTTGAGAGGAGGGTGGATTGCATAAAACTCCATTAAAATCAAAGGGAAAACTTTTGCTCAAAGGCGAAAGAACTG
TGGGTAGCTGGCAAGAAAAGGCCTCTGCCAACAGAGGATGCACTTAAAATGCGTTTAAGTGCGCGGTAAAGTTGCTTAATTAATTGAAAAACAGTTGGTCTGCAGAAAAACTGCACATTA
ACTTAACGACGCTGCGATGCCGGAAAGAAGTTGTCGTATATCTTGACATTTTTATATAAAGACCTAAAGTCATTTTAAAAACCCCGGGGGTAGCGTACATCATTATGTACTTCAAATTGT
GTTTAATGAAATCAAAATGCTTTACAATTATTGCCAGCAGACTTGAACCTTGCACGGCTGAAATGATTTTCGAGTCCTTGCAGAGTTCCAGTGAAGTAAATTGCTTGGAAACCACTGCCG
CAAACTATGAAAAGAACATAAAACCATAAAGTAAAGCTGCTGGCTAAACTACAGCACAGTGGCCAGAAAACGAGGAAAAATTCTGGGAAAATCTAGGGGATAAACATCAAACTGCGACTG
GCTGTTATCAAAAGCAGTTAGCAGCTGGGCAAACTTGCTAATCCTCTCCAAGTTGCCAAGGCACTGGGATCCACTGTGTTGTCAGCTAAACTGCAAATAAAATATGCGCCGAGGCTCCAG
CAAGTGCAACATGAACATCAAAAGGCGGCGAAAGAAAGCACAAAAGTTGTCGCCGCCATCGCCAGTTGCCAGTTGCCAGTTACCAGCTACCAACTACCAGCTACCAGTTACCAGTTTTCA
GTTACCAGCTTTCTGTTGCCAGTTGCCAAACTGGAAAACGTGCTAAGCCCGGAAAAGCTACGTGCCACACTCACACTTACATCGGTCGGGGAAAATGGCAAAAGACACAGGCAGTGGAAA
TTCCTCACTGAGGCAACCGCACAAGTTATCCAAATTGCCCAAATTGAAATTGAGTTGACAAGTTGCTGGCTGCGACAAGAAAAAGAAGGGCTACACACTAAATAAAATGTATGTCTTCTT
TGAAGGGAAATTTAATGCGGAATTATTAACTATTACCTCAGAATATTACTTTCTTTTAAAGCTGGATTTAGATTACTTAGCAAACAAATACAAATAGGTTTCCTTCGAACAATTGTAATG
CATGAACCTAAATATCTATAAATCGAACAATTTATCTGCACCTAAAACTCAGCTACTTCTAATTGCAATTGCCTATGAATTTTGGTTATTTTTCTAAAGGTGCACAGAAACTGAAACTAG
CGCCCATCTTGTACCTCAACTTTTACACTCCACTTGCGGCCTTAGAAGTTTTCAAATTGTTTGTACCGAAAAAAAGCGACACGACGTGAATTTATTGAGCAAATGGTCTCTGCATCATCC
GCCACAAGCCGTAAATCAATTTGAGGCGACAAAAACGGAGTCGTGGAGGATGCGGACTTGGCCATTAACCACCGACTGGCGAACTGGAGACCAATGATGGGCCACAAAAGAACCGCCGGA
GCGGCTTACTTTCTGCTTACACAGCCATTTTCCCTGATGCATAATGCAGCAGAAAAGCGGGTGGATTTCCATGTGTGGCACCCACGGAGCATTTAGTGCCAGGAATGGGGGGCACGGAAG
CTGAATCGGCTTACAGTTCTGTGCCGGCAGCATTAGAAACTTTACTCATTACGCCGGGTTTTCCCAGTTTTCCCCGTTTTCCAGTCGCCCACTCGCCCAGACCGCAGCAAAGCAGCGGGG
GCGGCAAATTATAAAGAAAAAATCCAGAAAATGGGTGCGAGAAAGCCGGGCGCCAAAAGTGAAAAACTATGTTGGAAGAATGCAGGAGTCGCAGGAGCAGCCATCCTGCCCCTTGAAGAG
TGTGAGTGAAAACCGCTGAAAGACCCCGACGAAACCAGTGCAACAAAAATGGCAAAAATAAAAAAAAAACAAGAGAGAACGCTATAGTCGAGTTCCCCGACTATCTGATACCCGTTACTC
AGCTAGTGGAAGGGAGAAGGAGAATCTTAAACACAGTTTTTGCCGGTTTGTAGGCGTTATAGTGGGCGTGGCAGAAAGTTTTTTGGCAAATCGATAGAAGTTTACAAGACCAATACAAAA
ATGAAAAAATATCAAAACATTTTTCAAAAGTGTGGGCGTAGCAGCTTTGGGCGGTTTGTGGGCGTTAGAGTGGGCGTGGCAAAAAGTTTGTTGGCAAATCGATAGAAATTTACAAGACCA
ATACAAAAATGAAAAAATATCAAAACATTTTTCAAAAGTGTGGGCGTGACAGCTTTGGGCGGTTTGTGGGCGTTAGAGTGGGCGTGGCAAAAAGTTTTTTTGNNNNNNNNNNGATAGAAG
TTTTACAAGACCAATACAAAAATGAAAAAATATCAAAACATTTTTCAAAAGTGTGGGCGTAGCAGCTTTGGGCGGTTTGTGGGCGTTAGAGTGGGCGTGGCAAAAAGTTTGTTGGCAAAT
CGATAGAAATTTACAAGACCAATACAAAAATGAAAAAATATCAAAACATTTTTCAAAAGTGTGGGCGTGACAGCTTTGGGCGGTTTGTGGGCGTTAGAGTGGGCGTGGCAAAAAGTTTTT
TTGCAAATCGATAGAAATTTACAAGACCAATACAAAAATGAAAAAATATTAAAACATTTTCCAAAAATGTGGGCGTGGCAGTTTTGGGCGGTTTGTGGGCGTTAGAGTGGGCGTGGCAAC
CTGAATCGACAAACTTGCGCTGCGTCTATGTCCCTGGAGTCTGTATACTTAATCTCAACTTTCTAGCTTTTGTAGTTCCTGAGATCTCGACGTTCATACGGACAGACGGACAGACGGACA
GACGGACAGACGGACGGACAGACGGACATGGCCAGATCGACTCGGCTACTGATCCTGATCAAGAATATATATACTTTATATGGTCGGAAACGCTTCCTTCTGCCTGTTACATACTTTTCA
ACGAATCTAGTATACCCTTTTACTCTACGAGTAACGGGTATAAATATACCAAAACACCCAGAAATCCGCCTAAGTTGTGGGTAATGCTGGGCGTAAAACAAAAGCTTATCCTGTTACGGC
CACTTTTGTTGTTGCCGCCCGACTCCTGTGATCCTGCTGCTGGCAAAAAGGCTAACCACGGAAATACATTGCCGTGCGCCACAGCGTGTGGCAAGTGAGGAGTGAGTGAGACGGCAGTTG
GGTGCCACAGAGAAACTTTACACCAAATGGGGCACTCGCAAGTGCTGACTGCAGGATACCCTTACTATGGAAAAAGGATTTATAATAAATTAATTTCAAAGAGCAAAACTATAGTTCATT
TTATAAAGATAGAAAGTTTTATTTATAAGCAAGGCAATGACTTAAAAATATTAATTTCTTAATGGTATAAACTTAATTATAAACCTGAAAAAATTTAAAAGACTTATATACCACTAAGAG
AAATTATTTTTAAAACCAGGCTAATTAGTTTTCTGGTAGCCTCGTTGTACTTCTTCCCCTTTTCATTGCACTTTGTGCAAACAATGAATTTGTATTAAACTTAATTGAGTAAACTATTAC
TGCAAAGAATAAGAAACAAGTATAATTAATATCTTTTGCCAGAGATAACGTTCATAATTCTCCAGAGGTAACTCTACCCTACGAACTTTTACACTGTGACTTTTCTACTTTTAAGCACTC
TTTGAGGCACCCCGAAAAGATGCATTATCTGCTGAAATAATTGTCCGAGTGTTGCACAAACTTTTTCCAAAAACTATCAATGTCCAAAATACCGTGGGCAATTGTCAGCTATAGAATAGA
ACAACGGACACCCCATCGAGTGGCCCGGGGAAAAGGACACTGCCAGGAGGGCAGCCACAGAATGGCAGCGTGGAAATATGTAGTTCCTGGCGCCGCAATATCCACTGACTGGAGCCGGAG
CACCAGAAACAGGAGCAGTGGCTGTGGCAGCTGGCAACTGGCAACCGATAGTGGCAGACACTGGTGCAACAGGGTGTATTATGGTCGTTAAGTTATTACCTACAGCAGCGCCACCGCCAG
CGGCACACAATGCCAAAGTTTACACAGTCGAAAGTTTCCTAAAGGAGCATTCCTTGCCAGCAGAATGCCAGGCATGAATTGTTGTTAATGCTGCCCTCCACATCCGCTCCTCCATCTGCT
CCTCCATTCAGATTCAAGAAAGGCGAAAAAAGCAAGGCGCGGATTTTTCCTATGACTTTTCTCCGCTGCAAACAGAATGCAGCATAATTTCGTGAGGCAGAAAAAAAAAGGGGAAAATGG
AGTGGCTGGCATTTTTATTAGACAACGCGGCCATTTGGGCGACTCCTGCCACTGGCATTATGGTAATTAAAATGTGCAAATTCCCTGCGATATCTTGATTGTTTCCGCGCTTTGTGTTCT
TTATGTGCGTGTCAACTGTTGCCTCTGGGCATTAAATAAAGATTTATAAGAAAGTAACCCGACAACTTGGGGAGTAATGCACGGTATACCATCTTTTGGGTCACCAAATGCTATGCCCTG
GGTCACCAAACTTCCTTTGCTTATTTAAAGCTATCAGCTTATGTTAACTAATCGCCCTTTGTTTCGTTTCCACATTAGTTTTATAAGAAAATAAGCCGACAACTTGGGGACTTATGCACT
GTATGGCGCACAAACCAAGTCTTGGGGTCACCAACTACTTAGCCTGGGTCACCAAACTTCCTTTGACTATTCAAAACCATCAGCTTATGTTAACTTATCGCCCTTTATTTCGTTTCCACA
TTAGAAACAACGTGGATTTGGCTCCAGGGGATCGGTATGTGCGCATCGCCTTCGCCGAGGCAGCCAAGGAGATTGACCTGGCCATCAACAACACCCTGGACATGCTCTTCTCCAACCGAT
CCGACAAGGCGCCGCCCAACTATGGCGAACTGCTGCGCGTCTTCCGCTTCCCCACCGGAGAAGCTAGGCAGCTGGCCCGCGCGGCGGAGATCTACGAGCGGACGCTGGTCAATATACGGA
AACACGTCCAGGAGGGCGACAACCTGACCATGAAGAGCGAGGAGTACGAGTTCCGGGATCTGCTGTCACGCGAGCATTTGCATCTGGTGGCGGAGCTGTCGGGCTGCATGGAACACCGCG
AGATGCCCAACTGCACGGACATGTGCTTCCACTCGCGGTACAGGAGCATCGATGGCACCTGCAACAACCTGCAGCATCCCACCTGGGGTGCGTCACTCACCGCCTTCCGCAGATTGGCGC
CACCGATTTACGAGAACGGTTTCAGTATGCCCGTGGGCTGGACGAAGGGCATGTTGTACTCGGGTCATGCCAAGCCCAGTGCGAGATTGGTGTCCACCTCACTGGTGGCCACCAAGGAGA
TCACACCGGATGCCCGGATAACACACATGGTGATGCAGTGGGGTCAGTTCCTGGATCACGATCTGGATCATGCCATACCATCCGTGAGTTCGGAAAGCTGGGACGGCATCGATTGCAAGA
AGAGCTGCGAGATGGCACCACCATGTTATCCCATTGAAGTGCCCCCAAATGATCCGCGCGTCCGAAACCGTCGTTGCATTGATGTGGTGCGTTCCAGCGCCATCTGTGGCTCCGGCATGA
CCTCCCTCTTCTTCGACAGCGTCCAGCATCGCGAGCAGATCAACCAACTGACCTCCTACATAGATGCTTCCCAGATGTATGGCTACAGCACTGCATTCGCCCAGGAGCTGCGCAATCTGA
CATCCCAGGACGGACTACTCCGTGTGGGTGTCCACTTTCCGCGGCAGAAGGACATGCTGCCCTTTGCTGCTCCCCAGGATGGCATGGATTGCCGCAGGAATCTCGACGAGAACACCATGA
GCTGCTTTGTTTCGGGCGATATCCGGGTGAACGAGCAGGTGGGTCTTCTGGCCATGCACACCGTTTGGATGAGGGAGCACAACAGGATCGCCAGCAAACTGAAGCAAATCAATGGCCATT
GGGATGGGGACACCTTGTACCAGGAAGCTCGTAAGATAGTGGGTGCCCAGATGCAGCACATCACCTTCAAGCAGTGGTTGCCCCTGATTATTGGCGAAAGTGGCATGAAGATGATGGGCG
AGTACTCGGGTTACAATCCCCAAGTGAATCCCAGCATAGCCAACGAGTTTGCCACAGCTGCCCTGCGATTTGGCCACACCATCATCAACCCAATACTCCATCGCTTGAACGAGACCTTCC
AGCCGATACCCCAGGGCCATCTGCTCCTCCACAAGGCCTTCTTTGCTCCCTGGCGTCTGGCCTACGAGGGTGGAGTGGATCCCCTGATGCGCGGCTTCCTGGCCGTGCCCGCCAAGCTAA
AGACCCCCGATCAGAATCTCAACACTGAGCTCACAGAGAAACTGTTCCAGACGGCCCATGCTGTGGCACTGGATCTGGCTGCCATCAACATTCAGAGGGGCAGGGATCACGGCATGCCCG
GCTATAATGTCTACAGGAAGCTATGCAATCTCACAGTGGCCCAGGACTTTGAGGATCTAGCCGACGAGATCAGCAATGCCGAGATACGCCAGAAGATGAAAGAACTGTATGGGCATCCGG
ATAACGTGGATGTTTGGTTGGGCGGCATTCTCGAGGATCAGGTGGAGGGCGGCAAGGTGGGTCCCCTATTCCAGTGCTTGCTGGTGGAACAGTTCCGCAGACTTCGGGATGGCGATCGCC
TGTACTACGAGAATCCCGGTGTCTTCTCACCCGAGCAACTGACCCAAATCAAGCAGGCAAACTTCGGTCGAGTGCTCTGCGATGTGGGTGACAACTTCGACCAGGTCACCGAGAACGTGT
TCATCTTGGCCAAGCACCAGGGTGGCTACAAGAAGTGCGAAGATATAGCTGGCATTAATTTGTATTTGTGGCAGGAGTGTGGCAGGTGCAACAGTCCACCGGCCATCTTCGATTCGTACA
TACCGCAAACGTACACCAAGCGGAGCAACAGGCAGAAGAGAGATCTCGGCAAGGTGGACGAGGAGGTGGCCACCGCCGAGAGCTATGACAGTCCGCTGGAATCCCTCTACGATGTGAACG
AAGAAAGGGTCAGTGGTCTGGAGGAGCTGATTGGCAGCTTCCAGAAGGAACTGAAAAAGCTGCACAAGAAGCTGCGCAAGCTGGAGGACTCCTGCAATTCCGCCGACGTCGAGCCGGTGG
CTCAGGTGGTACAGTTGGCGGCGGCACCGCCCCAGGTGGTTACGAAGCCCAAGAGGAGCCACTGCGTCGACGACAAGGGCACCACCCGGCTGAACAACGAGGTCTGGTCTCCGGACGTCT
GCACCAAGTGCAACTGCTTCCACGGCCAAGTCAACTGCCTGCGGGAGCGGTGCGGCGAGGTCAGCTGTCCGCCGGGAGTGGACCCACTGACGCCTCCGGAGGCCTGCTGCCCACACTGCC
CGATGGTCAAGTGA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCGAGTTCCGCTGCTGCTGCTGCAGCTGCTCGGCTTGCTGCTCCTCTCCGGCGGAGTCCAGTCCATCTACTGCCCGGCCGGATGCACCTGCTTAGAGCGCACTGTGCGCTGCATCCGC
GCCAAGCTGACCGCTGTGCCCAAACTGCCCCAGGATACCCAAACGCT
AGATTTGCGTTTCAATCACATCGAGGAGCTGCCGGCCAACGCGTTCAGTGGTCTCGCCCAGCTGACTACGCTT
TTCCTGAACGACAACGAGCTGGCTTACCTGCAGGATGGGGCACTCAATGGACTGACGGCGCTGAGGTTCCTCTATCTGAACAATAATCGGCTAAGCCGCTTACCGGCGGCCATCTTCCAG
CGGTTGCCACGCCTGGAGGCGAT
TTTCCTCGAGAACAATGACATTTGGCAGCTGCCCGCCGGCCTCTTTGACAACTTGCCGCGCCTGAACCGCCTGATCATGTACAACAACAAGCTGAGC
CAACTGCCAGTGGACGGATTTAATCGGCTGAACAACCTGAAGCGCCTGCGACTGGACGGCAACAACATCGACTGCAATTGTGGCGTCTACTCGCTCTGGCGCCGCTGGCACCTGGATGTG
CAGCGCCAGCTGGTGTCCATCTCGCTCACCTGCGCTGCTCCTCAGCTGCTCCAGAACCAGGGCTTTTCCAGCCTGGGAGAGCATCACTTCAAATGCG
CAAAGCCGCAGTTCCTGGTGGCA
GCCCAAGATGCCCAAGCCGCTGCCGGCGAGCAGGTGGAGCTGAGCTGCGAGGTCACCGGCCTGCCCCGTCCGCAGATCACGTGGATGCACAACACGCAGGAGGTGGGCCTGGAGGAGCAG
GCGCGGGCAGAGATCCTGCCCAGCGGCAGTCTGCTCATCCGCAGCGTGGAGCCCAGCGACATGGGCATCTACCAGTGCATTGCGCGCAACGAGATGGGCGAACTGCATTCCCAGCCCGTA
CGCCTCGTAGTTAATGGCGGTAACCACCCACTCGATTCGCCCCTCGATGCCCGCAGCAATCAGGTGTGGGCGGATGCCGGAACACCCACGCACGGAGCAACGCCATCGCCATCGTCAACG
CCATTGCCATCGCCACCGCACTTCACCCACCAGCCGCATGACCAAATTGTGGCCCTTCACGGCTCTGGACACGTGCTGCTCGATTGCGCCGCCTCCGGCTGGCCACAGCCGGACATACAA
TGGTTCGTCAATGGCCGCCAACTCTTGCAGTCGACCCCCAGTCTCCAGCTGCAGGCCAATGGCAGCCTCATCCTGCTGCAGCCCACCCAGCTCTCTGCCGGCACGTATCGCTGCGAGGCT
CGCAATTCTTTGGGCAGCGTCCAGGCCACCGCCCGCATCGAAGTGAAGG
AACTGCCCGAAATTTTAACAGCACCGCAAAGCCAAACAATCAAACTTGGCAAGGCCTTTGTGCTGGAGTGT
GATGCCGATGGCAACCCGCTGCCCACCATCGACTGGCAGTTCAATGGCGTTCCCCTGCCCGGAAATACGCCCGACTTGCAATTGGAGAACGAGAACACCGAGCTGCTGGTGGGTGCCGCA
CGGCATGAGCATGCCG
GTGTCTATCGCTGCACGGCGCGCAACGAAAATGGAGAGACGAGCATGGAGGCCACCATCAAGGTGGAACGGTCGCAGTCGCCACCGCAACTCGCCATCGAGCCA
AGCAATTTGGTGGCCATTACGGGCACCACCATTGAGCTGCCCTGCCAGGCCGACCAGCCGGAGGATGGACTGCAG
ATTACGTGGCGCCACGATGGGCGACTCATTGATCCGAATGTGCAG
CTGGCGGAGAAATATCAAATAAGCGGCGCCGGCAGTCTGTTCGTCAAGAATGTGACCATCCCGGATGGCGGACGCTACGAGTGTCAGCTGAAGAACCAGTTTGGTCGCATTTCCGCCTCC
GCACTGGTTACCATCAG
AAACAACGTGGATTTGGCTCCAGGGGATCGGTATGTGCGCATCGCCTTCGCCGAGGCAGCCAAGGAGATTGACCTGGCCATCAACAACACCCTGGACATGCTC
TTCTCCAACCGATCCGACAAGGCGCCGCCCAACTATGGCGAACTGCTGCGCGTCTTCCGCTTCCCCACCGGAGAAGCTAGGCAGCTGGCCCGCGCGGCGGAGATCTACGAGCGGACGCTG
GTCAATATACGGAAACACGTCCAGGAGGGCGACAACCTGACCATGAAGAGCGAGGAGTACGAGTTCCGGGATCTGCTGTCACGCGAGCATTTGCATCTGGTGGCGGAGCTGTCGGGCTGC
ATGGAACACCGCGAGATGCCCAACTGCACGGACATGTGCTTCCACTCGCGGTACAGGAGCATCGATGGCACCTGCAACAACCTGCAGCATCCCACCTGGGGTGCGTCACTCACCGCCTTC
CGCAGATTGGCGCCACCGATTTACGAGAACGGTTTCAGTATGCCCGTGGGCTGGACGAAGGGCATGTTGTACTCGGGTCATGCCAAGCCCAGTGCGAGATTGGTGTCCACCTCACTGGTG
GCCACCAAGGAGATCACACCGGATGCCCGGATAACACACATGGTGATGCAGTGGGGTCAGTTCCTGGATCACGATCTGGATCATGCCATACCATCCGTGAGTTCGGAAAGCTGGGACGGC
ATCGATTGCAAGAAGAGCTGCGAGATGGCACCACCATGTTATCCCATTGAAGTGCCCCCAAATGATCCGCGCGTCCGAAACCGTCGTTGCATTGATGTGGTGCGTTCCAGCGCCATCTGT
GGCTCCGGCATGACCTCCCTCTTCTTCGACAGCGTCCAGCATCGCGAGCAGATCAACCAACTGACCTCCTACATAGATGCTTCCCAGATGTATGGCTACAGCACTGCATTCGCCCAGGAG
CTGCGCAATCTGACATCCCAGGACGGACTACTCCGTGTGGGTGTCCACTTTCCGCGGCAGAAGGACATGCTGCCCTTTGCTGCTCCCCAGGATGGCATGGATTGCCGCAGGAATCTCGAC
GAGAACACCATGAGCTGCTTTGTTTCGGGCGATATCCGGGTGAACGAGCAGGTGGGTCTTCTGGCCATGCACACCGTTTGGATGAGGGAGCACAACAGGATCGCCAGCAAACTGAAGCAA
ATCAATGGCCATTGGGATGGGGACACCTTGTACCAGGAAGCTCGTAAGATAGTGGGTGCCCAGATGCAGCACATCACCTTCAAGCAGTGGTTGCCCCTGATTATTGGCGAAAGTGGCATG
AAGATGATGGGCGAGTACTCGGGTTACAATCCCCAAGTGAATCCCAGCATAGCCAACGAGTTTGCCACAGCTGCCCTGCGATTTGGCCACACCATCATCAACCCAATACTCCATCGCTTG
AACGAGACCTTCCAGCCGATACCCCAGGGCCATCTGCTCCTCCACAAGGCCTTCTTTGCTCCCTGGCGTCTGGCCTACGAGGGTGGAGTGGATCCCCTGATGCGCGGCTTCCTGGCCGTG
CCCGCCAAGCTAAAGACCCCCGATCAGAATCTCAACACTGAGCTCACAGAGAAACTGTTCCAGACGGCCCATGCTGTGGCACTGGATCTGGCTGCCATCAACATTCAGAGGGGCAGGGAT
CACGGCATGCCCGGCTATAATGTCTACAGGAAGCTATGCAATCTCACAGTGGCCCAGGACTTTGAGGATCTAGCCGACGAGATCAGCAATGCCGAGATACGCCAGAAGATGAAAGAACTG
TATGGGCATCCGGATAACGTGGATGTTTGGTTGGGCGGCATTCTCGAGGATCAGGTGGAGGGCGGCAAGGTGGGTCCCCTATTCCAGTGCTTGCTGGTGGAACAGTTCCGCAGACTTCGG
GATGGCGATCGCCTGTACTACGAGAATCCCGGTGTCTTCTCACCCGAGCAACTGACCCAAATCAAGCAGGCAAACTTCGGTCGAGTGCTCTGCGATGTGGGTGACAACTTCGACCAGGTC
ACCGAGAACGTGTTCATCTTGGCCAAGCACCAGGGTGGCTACAAGAAGTGCGAAGATATAGCTGGCATTAATTTGTATTTGTGGCAGGAGTGTGGCAGGTGCAACAGTCCACCGGCCATC
TTCGATTCGTACATACCGCAAACGTACACCAAGCGGAGCAACAGGCAGAAGAGAGATCTCGGCAAGGTGGACGAGGAGGTGGCCACCGCCGAGAGCTATGACAGTCCGCTGGAATCCCTC
TACGATGTGAACGAAGAAAGGGTCAGTGGTCTGGAGGAGCTGATTGGCAGCTTCCAGAAGGAACTGAAAAAGCTGCACAAGAAGCTGCGCAAGCTGGAGGACTCCTGCAATTCCGCCGAC
GTCGAGCCGGTGGCTCAGGTGGTACAGTTGGCGGCGGCACCGCCCCAGGTGGTTACGAAGCCCAAGAGGAGCCACTGCGTCGACGACAAGGGCACCACCCGGCTGAACAACGAGGTCTGG
TCTCCGGACGTCTGCACCAAGTGCAACTGCTTCCACGGCCAAGTCAACTGCCTGCGGGAGCGGTGCGGCGAGGTCAGCTGTCCGCCGGGAGTGGACCCACTGACGCCTCCGGAGGCCTGC
TGCCCACACTGCCCGATGGTCAAGTGA

Retrieve as FASTA