Entry information : MmDuOx02-4 ( DUOX2 / / MmDuOx02-4)
Entry ID 3341
Creation 2010-04-28 (Christophe Dunand)
Last sequence changes 2016-02-16 (Christophe Dunand)
Sequence status complete
Reviewer Achraf Jemmat
Last annotation changes 2016-02-16 (Achraf Jemmat)
Peroxidase information: MmDuOx02-4 ( DUOX2 / / MmDuOx02-4)
Name (synonym) MmDuOx02-4 ( DUOX2 / / MmDuOx02-4)
Class Dual oxidase    [Orthogroup: DuOx001]
Taxonomy Eukaryota Metazoa Chordata Mammalia Muridae Mus
Organism Mus musculus (house mouse)    [TaxId: 10090 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value MmDuOx02-4
start..stop
S start..stop
RnoDuOx02-A 2934 0 1..1517 1..1517
HsDuOx02 2653 0 1..1517 1..1548
EcabDuOx02 2639 0 1..1517 1..1517
CfaDuOx02 2619 0 1..1517 1..1571
Gene structure Fichier Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 578..647 70 N° 2 931..1020 90 N° 3 1203..1367 165 N° 4 1874..2061 188
N° 5 2224..2425 202 N° 6 2582..2748 167 N° 7 3389..3449 61 N° 8 3562..3658 97
N° 9 4064..4154 91 N° 10 4650..4752 103 N° 11 5276..5439 164 N° 12 5976..6151 176
N° 13 6366..6484 119 N° 14 6794..6931 138 N° 15 7189..7302 114 N° 16 7477..7679 203
N° 17 7962..8147 186 N° 18 9050..9275 226 N° 19 9358..9451 94 N° 20 11546..11733 188
N° 21 12191..12260 70 N° 22 13404..13582 179 N° 23 13765..13995 231 N° 24 14071..14170 100
N° 25 14845..14894 50 N° 26 15166..15293 128 N° 27 15529..15682 154 N° 28 16794..17026 233
N° 29 17213..17371 159 N° 30 17674..17829 156 N° 31 17924..18052 129 N° 32 18184..18306 123
join(578..647,931..1020,1203..1367,1874..2061,2224..2425,2582..2748,3389..3449,3 562..3658,4064..4154,4650..4752,5276..5439,5976..6151,6366..6484,6794..6931,7189 ..7302,7477..7679,7962..8147,9050..9275,9358..9451,11546..11733,12191..12260,134 04..13582,13765..13995,14071..14170,14845..14894,15166..15293,15529..15682,16794 ..17026,17213..17371,17674..17829,17924..18052,18184..18306)


exon

Literature and cross-references MmDuOx02-4 ( DUOX2 / / MmDuOx02-4)
Protein ref. UniProtKB:   A2AQ99
DNA ref. GenBank:   NC_000068.6 (122298742..122279246)
Cluster/Prediction ref. Genebank:   214593
Protein sequence: MmDuOx02-4 ( DUOX2 / / MmDuOx02-4)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   1517 (1492)
PWM (Da):   %s   171369.4 (169014.0) Transmb domain:   %s   o600-622i1010-1032o1047-1069i1116-1138o1153-1175i1188-1210o (o575-597i985-1007o1022-1044i1091-1113o1128-1150i1163-1185o)
PI (pH):   %s   7.73 (7.65) Peptide Signal:   %s   cut: 26 range:26-1517
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MLPTSPKTLVLLGALLTGPLGPAGGQDAPSLPWEVQRYDGWFNNLKYHQRGAAGSRLRRLIPANYADGVYQALEEPLLPNPRRLSDAVAKGKAGLPSVHNRTVLGVFFGYHVLSDLVSVETPGCPAEFLNIYIPRGDPVFDPDKRGNVVLPFQRSRWDHNTGQSPSNPRDSNQVTGWLDGSAIYGSSHSWSDTLRSFSGGQLASGPDPAFPRNSQSSLLMWMAPDPSTGRGGPQGVYAFGAQRGNREPFLQALGLLWFRYHNLCARKLAQEHPHWGDEELFQHARKRVIATYNIALYQWLPSFLQKTPPEYSGYRPFMDPSISPEFVVASEQFLSTMVPPGVYMRNSSCHFRKFPKEGSDSSPALRVCNSYWIRENPNLKTAQDVDQLLLGMASQISELEDRIVIEDLDYWPGPERFSRTDYVASSIQRGRDMGLPSYSQALLALGLEPPKNWSALNPQVEPQVLEATAALYNQDLSQLELLLGGLLESHGDPGPLFSNIILDQFVRLRDGDRYWFENTRGLFSKEEIAEIRNTTLRDVLVAVSNVDPSALQPNVFFWQGAPCPQPRQLTTDGLPQCAPVTVIDYFEGSGAGYGVTLVAVCCFPLVSLIVAGVVAHFRNREHKMLLKKGKESLKKQPASDGVPAMEWPGPKEKSYPVTLQLLPDRSLQVLDKRFTVLRTIQLQSPQQVNLILSSNSGRRTLLLKIPKEYDLVLMFNSEEDRGAFVRLLQDLCICCTPGLHIAEVDEKELLRKAVTKQQRAGILEIFFRQLFAQVLDINQADAGTLPLDSSQQVREALTCELSRAEFADSLGLKPQDMFVESMFSLADKDGNGYISFREFLDILVVFMGSSEDKSRLMFTMYDLDGNGFLSKDEFFTMMRSFIEISNNCLSKAQLAEVVESMFRESGFQDKEELTWEDFHFMLRDHDSDLRFTQLCVKGGAGTKDIFKQSSACRVSFINRTPGNRVMGPSPRLYTEALQEKKQSGFLAQKFKQYKRFVENYRRHIVCVTIFSAICIGLFADRAYYGFASPPTDIEETTYVGIILSRGTAASISFMFSYILLTMCRNLITFLRETFLNRYIPFDAAVDFHRWIAMAAVVLAVLHSAGHAVNVYIFSVSPLSLMACVFPNVFVNDGSKFPPKYYWWFFETVGMTGVLLLLVLAIMYVFASHHFRRHSFRGFWLTHHLYVVLYVIIIHGSYALIQLPSFHIYFLVPAIIYGGDKLVSLSRKKVEISVVKAELLPSGVTYLQFQRPKTFEYKSGQWVRIACLDLGTNEYHPFTLTSAPHEDTLSLHIRAVGPWTTRLREIYSPPVGGTCARYPLYLDGPFGEGHQEWHKFEVSVLVGGGIGVTPFASILKDLVFKSSMGSQMLCKKIYFIWVTRTQRQFEWLADIIREVEENDCQDLVSVHIYITQLAEKFDLRTTMLYICERHFQKALNRSLFTGLRSITHFGRPPFELFFNSLQEVHPQVRKIGVFSCGPPGMTKNVEKACQLINRQDRAHFVHHYENF*

Retrieve as FASTA  
Remarks Complete sequence from genomic (chromo 2, 31 introns).
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCTTCCAACAAGTCCCAAGACTCTAGTTCTCCTGGGCGCTCTGCTGACTGGACCCCTGGGCCCAGCAGGTACGTAGTAGGGGCCTCTTGCTGAAGGGGCTAGCCAGTGTCCCACAGGA
GTGTGGAAAGAGAAGGGAAGGCAGATGGCGCCTGGAAGCTGGAAGGGTTGAGCCATCTCTTATTGAGCTATTGAGTTGGAAGACTTCTGACCGCCTCGGCTCAGCGGCCAAAGGATTGGA
GATGGCGTTCATTGGCAGGAGTTAGCCAGCTGAGGAAGGAGCCTAAAGCCCATCGGGTCATTCAGGGCATTCTGGAAGGTTAATGTTAAGTGGGAGGCTTTCTGCTCCCACAGGTGGCCA
GGACGCACCCTCACTGCCCTGGGAAGTGCAGCGCTACGACGGCTGGTTTAATAATCTGAAGTACCACCAGCGCGGTGCGGCTG
GTAAGCCCCGGGCTCTGTAGGGAGGGCGTGCAGTTGG
AGTGTGGGGAGAGGAGCCGGGGCGGCCTGCGGGATTGCATTCTCACCACATTCTTATCCATTCCTGGGGGTAGGAAATGGGAGGGGGCATGTGGGGCTTCTTGGGGACCCAGAGCGTTGA
TCCAGATCAACTCTCTGCGCTCCAGGCTCGCGGCTGCGTCGCCTAATACCGGCTAATTATGCTGACGGTGTTTATCAGGCTCTGGAAGAGCCGCTACTGCCCAACCCTCGCCGGCTAAGC
GATGCTGTCGCTAAGGGCAAAGCAGGGCTGCCCTCGGTCCACAACCGCACAGTGCTGGGGGTCTTCTTTG
GTGAGTATAGGAACAGGAAACCAGGGTGGATCTGGAGCTTGCTGAGTGTC
CAGGGAAAGGACCACCGCAACTTGCAGAAAAAAGAACTTTCCAAGAAGTGCTTCTGTTATCATCTTTGAGGGTTTGGGTGTGATGCGGGGATCTTTCTCTTACTTTACTTCAAACCAACA
CGTACATTGGGTCCCCACTAGAACTCTCAGAAACTTATGCCCTGACCACTGCTGTCCTCAAGACTCTCATGACTGCCACACAGAGGTGAAGGGAGCTTGTTTTCAAATCGTATCCGCCTC
CCCCCCTCCCACCCCCGAAACGATATTGGCATGTTCACAAGGGGATGCTACGTTTTGTTTGCATCTTAGAAGATCAAGTCTGGCGGGGGGGGGGGGATAAGCTGAAGAGACAGCGTTGAA
GCTTGTGCTCTGGCCTCTGTGTCCTGATAATTCAGAGGCTCTCCTAAGGAGAGCGGGACCTGGTGATTTATCTACCACTTTGCTCTCTTAACCCAGGCTACCACGTGCTCTCAGACCTGG
TGAGTGTGGAAACACCAGGCTGCCCCGCAGAGTTCCTCAACATTTACATCCCACGTGGGGACCCGGTGTTCGACCCTGACAAGCGCGGGAACGTGGTGCTGCCCTTTCAAAGGAGCCGCT
GGGACCACAACACTGGACAGAGCCCCAGCAACCCCCGGGACCAG
GTGAGGCCAGGCAGCCAAAGGCAGGAGGGCAGAGAGGCGGGAAGGGGGTCTGAGGCTGGATCCGCTGGAGGCCTGG
GATCTGGCTGGGAAGACCAGACACGGTGAGGCTCTCCCTACCAGCAAGGACCTGCCTGGTCCTCCTGACTCTGCTCATCCCTGCAGAGCAACCAGGTGACCGGCTGGCTAGATGGCAGCG
CCATCTATGGCTCCTCTCATTCCTGGAGTGACACTCTGAGGAGCTTCTCTGGAGGACAGCTGGCTTCTGGGCCTGACCCTGCTTTCCCGCGGAACTCCCAGAGCTCTCTGCTCATGTGGA
TGGCGCCGGACCCCTCCACGGGGCGGGGCGGGCCACAAGGGGTGTATG
GTGAGGCTGCGGGGGTCCTGGCAGGAGGTGTAGGGTTAGAGGCACCAGGCTGCTTGGAGGCCCTGTACTGGA
GGAGGGGGGGACTGCCTGTGGGTCTGGGCTCCCCTGACTCAAGCTGTTGTCCATCTCCTGTCCTGCTTCCCCACATGGATGCAGCCTTTGGGGCCCAGCGCGGGAACAGGGAGCCCTTTC
TGCAGGCTCTGGGCTTGTTGTGGTTTCGCTACCACAACCTGTGTGCCAGGAAGCTGGCGCAGGAGCACCCGCACTGGGGGGATGAGGAACTGTTCCAGCATGCTCGCAAGAGGGTCATTG
CCACCTACCAG
GTGAGTCGTGCCGCCTGCCTGGAGTCCACCTGTGTGTAGGGTCCAGAGAGACTCTGCACCCCCAACAAAGCTTCCGGGGTCCTGGACAATGCCTCTTCCCTATATACTC
TTTTCTAGGAGGGCCCTTCTTCAGGGAGAGGTAATGTGTAGGGAAAACTACTGTACATTTTAGAGAGGAAACCCTTTATTTATGCCAGATAATTTGGGATTTGATCATTATTTAACATAT
CTCTCTTCTCAATAAATTCTTTGAAGCATCTAGCACTTGATGCTGTTGGGTTTTTTATGCTCTTTCTCCCATGATGTGCCCAAGGTCCAGGGAGACTCGGCTCCCCCAGGAAAGCTTTCC
CAGTCCTGGATAGCGCCTCTCCCCTCCCTAGTTGTGGCCCTTATTCCCAGAAGGGCTAATGGGAATACACAAACACACACACACACACACACACACACACACACACACACACACACACAC
ACACACAGAGAGACCCTGTGAATGTTTACATGGGTGATTTTGCTTGTGATTATAATTTAATTGCCTGCCCCTCCTTCCCCTACAACTAGCCTTCCCTGGTCCTCATCTCTGCACCCAAGT
CTTCCTTCCCCAAAGGCTCAACCCAACCCCTCCTCCCCTTCTAATCCTCAGAACATTGCTCTATACCAATGGCTGCCCAGCTTCCTGCAGAAAACTCCTCCAGAGTATTCAGGTAATGGG
GAGGGGTTGTGGAAGGTGGGGAGACCTAAGTGGAAGATCCACAGACAAAAGAGAGTTAGATTGTTTGATGCGGGGGTGTGAGGTAAGGCACTGTCTTGAAACAGGGTACCGCCCTTTCAT
GGACCCCAGCATCTCCCCGGAGTTCGTGGTGGCCTCTGAGCAGTTCCTCTCTACTATGGTGCCCCCTGGGGTCTACATGAG
GTAAGCGAAGACGAAGCCGGTTAGAGCTGCATAGAATTG
GAGGGGGGTGGGGTGAGTGGGCTTGGAGCTAGGGGCTTGCATCCCTCCAGTTTCATTCAGAAGAGAAGCAGAATTGTTGAGGGGATGCTGAAGCAGGATCCTGGGGTGGGAAGTTTGGGA
TCCATAGGAAATCTTCCAGCTCTAATAGAAGAATTTTGGTTGCTGAGCGTGAGGTGTGTCTGGCAGCTGGGGCGCCTTCTGACCTCCCACCATAGAGATCACTCTATCTGTAAATCCAAG
CCAATGTTCAAGAAAACTCATAAGTCATCCTTCACTCTCGTCTAGCCCAGAGAAATGCTCACCTCACTCACCACCTTAGTTTCTGCCCCCCAATTCCCTCATTGTATTTTTGTCACCCTA
TTTCAGAAACTCCAGCTGTCATTTCCGGAAATTCCCGAAGGAAGGTTCAGACAGCTCTCCAGCTCTCAGAGTCTGCAACAGCTACTGGATTCGGGAGGTGAGTGTGAGATTGGGGTCAGG
GAGTGGGGGTGGAGTCAAATTCACGTAGCACAAATGCTTGGAAAAGTAGAAGTTTGGCTCTTGAGCAGCAGTGTAGCGAGTAAAAGGCAGAGAAAAGTATCTGGGGAGAATTGTTCGAGA
GCTGGAAATGCCTGGGCATGGGCGATGTGCATGCTGTCCTAGCTCTTCAGAAGCTCAGGCAGGACTACTGCTGTATACCAGGTGCTCACACAATCTACCAAGATAGAGAGGGAAGGAAAG
GAAGTGGAAGAAGGCTAGGGAGAAGGAGGGAGAAGGGAGGGGCAGAGGAAGGGGAAGAGGAGAAGAGAGTGAGAGACAATGAGAGATCAAGCAAGAGAGTGAGAGAGAGAGAGAGAGAGA
GGAAGAGAGAGAGAATATGAATGGAGGTCTGAGAGTGAGTGGGGGGGCCTGTTTTCCCAAGGGGCTACTGGTGGTCCAGGGTCAAAATCTTGAGTGTAGTGTTTACTTCCAGAACCCCAA
TCTGAAGACTGCTCAAGATGTGGATCAGTTGCTGCTGGGAATGGCTTCCCAGATCTCAGAGCTGGAGGACAGGATAGTGATTGAAGACCTGAGAG
GTGAGCTCAGAGGGTGGGGTGTGAG
AGCCTGGAAGTCTGAGGCGCCTTCTGAGTTAGTCAACAAGCAGCCCTGGGGTACCCATGATACAGACACACAGAGCTACACCCTCAGACTTCAAATAAATGTCAGAAGAAGTTGAAATTT
GAGCTCCCTGTCTCAATTATAGTATTGTGTTTGTGTTACTTTTATTTTGACTACGCCCGAATATGGTGGACTTGAGGGTGGTAGAGCAACCGAACACCCTCTCTGGTGTTTACTTAGGTC
TTAAGGGTTCTCATCCAGTCCTGTTTTAAAGTTGGGACATGAAGAGAAGGGAATTTCTGTCTGGTATCAAGAATTATCTTGAGTCTTAGAAAAAGTGCCCCTGACCACACATGGTGACAT
GGGCTCCACAGCCAACAAGGTCGCTGATGTCATGTAGAGGAGTGCTGGGGACTGGGGCTGAGGAGAGCTCTAGACAGCTGGGTGTAAGGCTGCAATGCACGGGGATTGGAACTGACTCAA
AGTCTGGTCTCTTTTCAGATTACTGGCCTGGCCCAGAGAGATTCTCTCGCACAGACTACGTGGCTAGCAGTATCCAGCGTGGCCGAGATATGGGGCTCCCCAGTTATAGCCAGGCTCTGC
TGGCCTTGGGACTGGAGCCTCCTAAGAACTGGAGTGCTCTCAACCCCCAAGTAGAACCCCAG
GTTAATATAAATAATAATAATAACAATGCAATAACAATGACAGTTGGAGGGGGGGAGA
TGGTTCAGTTGGTAACATGCTTGTCACACAAGCATGGAGACCTGAATTTGAATATTCAGCACCTACCTAAAACTCTGAGATGTTTTGGGGATCCCAGTTCTAGGGAGGCAGAGATAGGAC
TCCTGACCCTTACTAGACAGCCAATCCAACCCAGCCAAAACAGTGATACCCAGGTTTAGGGAGAGACTCTGTTTCAAAGAATATGGTAGAGAGTGATTAAGGAAGACTCCCTGTGTCAAC
CAGTGGCATCTATATGCACATATTCATGTACAGAAACATGTACTCCCACACACATATACACACAAGATAATGGCAGCCAAAACTTTGGATGGCAGTGCTGTTCCAAATATCTCTACACAG
ATGAACTCACTCGATCTGCATAAGAATTTATGTGGTAGTTACTGTCAGTGTTGCAGATTTACTGAGGACATTGTGGGTGAGGTCACGTGTACTTTCGTTACCTTCCATGATGCTCCAGGT
GCTGGAGGCCACAGCTGCTCTGTACAACCAGGACCTGTCCCAGTTGGAACTACTCCTGGGTGGACTCCTGGAGAGCCATGGGGACCCTGGACCTCTATTCAGCAACATCATTCTTGACCA
GTTTGTGAGGCTCCGGGATGGTGATCGCTACTGGTTTGAGAACACTAGGAATGG
GTAAGGCCTGGCTGGGCCCTGCTTCTGACTTCACCTTAGCGTGGGGCTCCAGACTCTCTGTCTGGC
CTTAGACAGCCTCCACAGGTCTTGATGCCAGGGGCCTACCACACTCTCGTCCACCCCAGTCTCTTTCTTCACATGAATCTTTGGGCCTGAGGTCACTGGAGGACTGAAATTCCCTTCCTA
TCCCAGCAATGGCCTTCCCCCCATTTAGGCTGTTCTCCAAAGAGGAGATTGCAGAAATCAGAAACACCACCTTGCGGGATGTACTGGTAGCTGTCTCCAATGTGGACCCCAGTGCCTTGC
AACCCAACGTTTTCTTCTGGCAGGAAG
GTGAGTATCCAGGGAGAGCCGACAAGCAATCATATGGGGGCAGAGTCATCTGTGCTGTGCCATGGGCTTTTTGTGTTGGGCTGCCTTCCATGC
CGCTATGGTCTGGGCCTGCCCAAGAGCTATACCTAGCAATCAGGCAGGGTAGATGCTGAGAAATTAGATATGGAGGTCTTTTGAGAGGAACAAGTTCCAGAGGGGTAGGTTTGAGAATGG
GAGAAGATATTTGGTCCTTCCTAAGACAGCGGGAGAGGCTAACCGGAAGAGTAAGGATTCTCGGACTCAGACGTCTGTGACTTACTTATCCCGCAGGTGCACCCTGCCCACAGCCTCGGC
AACTCACAACGGACGGCTTGCCCCAGTGTGCGCCTGTTACTGTGATTGACTACTTTGAGGGCAGTGGTGCTGGCTATGGTGTCACGCTTGTAGCTGTCTGCTGCTTTCCATTAG
GTAAGC
GCTTTAGCTCTCCCTACCTCTTTTCTGACCTCCCCCTCTTCACAGACTGGCCCGGTTCATTCCTCCTCCCACCCCACCCCACTGCCCTCCACCACCCCACTACATGCTGGTTTTCAGGCT
AGCTGCTTTGCATAGCTTGAGCCAGACTCAGGGAGCTCCTGGGTAGCTAAGGAGCTCAACCTGTAGGTCCCCTTGAAGCTATGAGCAAGGGTTCTCCTTCCTTCTCACTAGTCTCTTCTG
GTCCCCTTCAGTGAGTCTGATTGTCGCTGGGGTGGTGGCTCATTTCCGGAACCGAGAACACAAGATGCTACTAAAGAAAGGCAAAGAGAGTCTGAAGAAACAACCAGCCAGTGATGGGGT
ACCAG
GTGAGAAGCCTGAGGACCAGGGGGAGGGGACGGGAGCCAGGGCCTAAGGAGAAGAAACGTGTTACAGAGTGGGATGGAAGCAGGACTCACTAGAACTGCATGCTCAGATCAGAGA
ATCACATGGGCTCAAAGGCCTCTGTTGAGTCACCCACCACGCAATTGTCCTTTCCTCAGCAATGGAGTGGCCGGGCCCCAAGGAGAAGAGCTATCCAGTCACTCTCCAGTTGCTTCCAGA
CAGAAGTCTGCAGGTCCTTGACAAACGGTTCACTGTGCTCCGGACCATCCAACTGCAGTCCCCACAGCAGGTTAACCTCATCCTGTCCAGCAACAGTGGACGTCGCACCCTGCTGCTCAA
GATCCCCAAGGAGTATGACCTG
GTACAGCTCATCCTGCCTTCCTCTGTGGGCTGTTTTCATACGTATCCCCTCTCATAGGCGAGTCTTCTCAGCTACAGGATTTCCCTGTGAAGGCTGAT
GCTGGAGAAGGAGGCTCCTTTCAGGAGCTTTTTGACTTACATGACCCCCTGAGGTCACTCTCTGACCACAGCCTGGACGGTCACAGAAAAGACTTCTCTTCTTCCCTCAGCCTGAAGTCC
TTTGAAGCAGGACTAGATCAGAAAAGCCAGCTCCAGACATGATGAGGCTTTTTCTCCAACCTAGGTGCTGATGTTTAACTCTGAAGAGGACCGGGGTGCCTTCGTGCGGCTGTTGCAAGA
CCTCTGTATCTGCTGCACTCCCGGCCTCCACATAGCTGAGGTGGATGAGAAGGAGCTATTGAGAAAGGCTGTGACCAAGCAGCAACGGGCAGGCATCTTGGAGATCTTCTTCAGACAGCT
TTTTGCTCAG
GTGCCATGAGTTATACCTGACACAGGGGATGGAGCCAACCTGAGCTGGTTGCGTTCTTGCAGGCGAATTAAGATTTCACACTGTAGGCTAGAGACAGACGGTCTTGATTG
GATCCTTAGGCAATTCATGTATTTCTGTCTCTGTATTCCTCACCTCACCATCTAATGATCAGTGACTAAAGCTGTGTCCACTGCATTGCTGAGGGGGGCCGGTGAGGTCATCTGCATGAC
AGATTCTCTAATGGTTCCTCTGGATCAGCAGCATCAGCTGACTTGGAAATGTATTAGAAACTCTGGTTCTCGGGATCCTCCCCAACAGATTGAATGAGGAGCTCTGGGGGTGGTACCCAG
AAAGCTTCATTTTAACAAATCTTTCTAAGTGATGCTGATATAGACTTGGGTTTGAAAATCACCATTTTATAGACAGTGTTTAGCCTAAGTACTTGGTACATTGTGGGCATAACAAATGGT
CGTGATGTGTTAGTTTTAAATTATCTGTACACTGATAGAAGGCAGATTCTAGGCACAAGGCAGTGCTATCTGAATTAGCATGACTATATCGGCTCAACTCTGTCATGCTATAGCCAATAT
CCACTGCCTCTGAAACAGGCTCTTTCACTCTTCATCCCCCAACTTTGCTCCCTGCATCTTACTTTCCCCTTTTCTCCCCATCGATATTATCCTTGCTGGATGGAGCGTAGGTCCAGAGAG
AAGCAGAAAGCGTGTAAAGATTCAGTTGACCCCTCCTATAGGAGAGTAGAGACCCCAATCAGTATGTTTAGGAGTGGGTTTTCTGAACTGCACACAGGGGAGACATGAAGTCATTGAACT
GCTTCCATCCCAGATAGGTCAGCTAGGCCTGACCTCCTTCTCCTCTCCCTCTCTGTGACTGCCCAAGTGCAGGTGCTGGACATCAACCAGGCTGATGCAGGGACTCTGCCCCTGGACTCA
TCCCAGCAAGTGCGTGAGGCTCTGACCTGTGAGCTGAGCAGAGCTGAGTTTGCTGACTCCCTAGGCCTCAAGCCCCAGGACATGTTTGTGGAGTCCATGTTTTCTCTGGCTGACAAGGAT
GGCAATGGCTACATATCCTTCCGGGAGTTCCTGGACATCCTGGTGGTCTTCATGAAAG
GTGGTGGTGGGGTCTAGCAGAGCATCTGAGGAATCAGGAGTTTGTTAGCAAGGAGGTGACCT
ATATCCTCTTTCTCCCTCAGGCTCCTCAGAGGATAAGTCCCGCCTGATGTTTACCATGTATGACCTGGATGGGAATGGCTTCCTCTCCAAGGACGAGTTCTTCACCATGATGCGGTATAG
GCTGGGCCTTCCCAATCCTGGAGTACCTTAATATTTTAAACACAAAAGCAATTCAGCTCAGAGGAGGGCAGGCAAGGTGGCTTAGTGGCCATCCATGTTGCATGGTACACACTATATCCT
GAATTCTCAAGACATCCAACTTCAAAGCATCACCCTGGCATGTCTCTGCCTTTAACAGGGTCCATAGCCACACTATCAAAAACTACCAGAAACATTGCTCTAGTTTTTTTTTTTTTTTAT
TATATGTAAGTACACTGTAGCTGTCCTCAGACACTCCAGAAGAGAGAGTCAGATCTCCTTACGGATAGTTATGAGCCACCATGTGGTTGCTGGGATTTGAACTCCTGACCTTCGGAAGAG
CAGTCGGGTGCTCTTACCCACTGAGCCATCTCACCAGCCCTGCTCTAGTTTTAAAATGTCAATAAAGACCAGCCGTGAGCACCCTCAACCCCTCATTCCCAAGGACAGAGCCTCAAGCAC
CACGAGTCATATCTCTTCACTTCTTCTTAAATCTGCTTTTTTTTTCCATGCTGAGATTAACATGTAGGCATTTGGTACATAGTCCACTGAAAAAGACAAAGAGAAATATATGAGGGTAAG
TAGACAGAGCACAGCAGCCCAGGTTCTTTTAAGGTGTCTTTGTGCCTTCCCATTGCTCTTAACATTCCCTTTCCCTGACAATCCCAACCCAGACCACTTCCTCCTCTTCCTCTTCTTTAG
AGGCTTTGCTTACTTATTTTTATTTTATGTGCATTTGTGTTTTGCCTGCATATATGTCTGTGTGAGGGTGTCAGATCCCCTGGAACTGGAATTACAAACAGTTGTGAGCTGCCATGTGAG
TGCTGAAAACTAAACCTGGGTCTTCTGGAAGAACAACTAGTGCTCTTAACTTCTGAGCAACTTTCCAGTCCCTAACATAAACTTCTTTTATTATTATTATTATATATTTTCTTTTTTTTT
TTTTTTTTTTGGTTTTTCGGGACAGGTTTCTCTGTATAGCCCTGGCTGTCCTGGAACTCACTTTGTACACCAGGCTAGCCTCGAACTCAGAAATCTGCCTGCCTCTGCCTCCCGAGTGCT
GAGATTAAAGACGTGTGCCACCATGCCCTGCTCTAGATATTTTCTTCATTTACATTTCAAATGCTATCCCCAAAGCCCCCTATATCCTCCCCCTGCCCTGCTCCCCAACCCACCCACTCC
TGCTTCCTGGCCCTGTCATCCCCCTGTACTGGGGTATATGATCTTCACAAAACCAAGGGCCTCTTCTCCTATTGATGGTCAACTAGGCCATCCTCTGCTACATATGCAACTGGAGACACA
TAGACTTCTAAAACAGTAAAAAAGACTACAACCTGTATCTCATCAGCATGTCACTGCAGTCCTGTCCTGGCTACATTTAGGAAATCCAGGAGGAGGAAGGACATCCCATTAGCAATGACA
GCTCACATTTTTTTCAGTCTATAATAAAATGTGTGCTCCTTGCTTTGCCTCGATGGCATTATGCTTTTTAGCATGGTGATGGTGCTTGCTTGTCCTTGCACACACACACACTTGTTTTAG
TCCCCGTGCACAACATAAGAAGAGTGGCCGCTGCAGCTGTTACCGTGGGTGCCACTGGACTTGGGCCTGAGTGGGTCCTCCACATGCCCTAGGGTATTGGCGTAAGGAGGCAGCCACCAT
TTTAGCCACTCCCCTCCCACACTCGAGTGTCCTCTTTGGATGGTGAAAGTGGAATCTCAAAGGTGAATGGGTGTGGGTGGGTTTCCACGAAAGAGAGTCAGCAGCAAGCCCATCACAAGA
GAGGGCCATGGTGTATCCTGGACCCCATCATCTGCAACAAGTGTTAGTCTCTCCCATTTGTTTCCATTTCTTGGGGTGAGGGGCAGAGGGGGAGACACACTGCTTTTCTAAGCTGTCAAG
ATCCTTTGAGCCTGCGGCCATCACCTTGCCAGCCTTAGTGTCAGCTTTAGGCTGGATGACATTGCCCAGTGCCATTTCCCAGGCTGATGGTGACCACATTAGGTAATCTGGAGACCTGCC
CAAGCCTGACCTTGCTGGATGATAGCTACCATTTCTCTACTCCTCTAGGTCCTTCATTGAGATCTCCAACAACTGCCTGTCCAAGGCCCAGCTGGCTGAGGTGGTTGAGTCTATGTTCCG
GGAGTCTGGGTTCCAGGACAAGGAGGAGCTGACCTGGGAGGACTTCCACTTCATGCTGCGGGACCACGACAGTGATCTCCGATTCACACAGCTCTGTGTCAAAGGTGGAGCTGGAG
GTGT
GTGAGTGTGGAAGGCGAGGAGTCTTCCAAACAGAGTTTCTGGTAGACATAACACAGGTCTTCTTTTCTTTCCTCCGAGTCCTTGGTGGTTAGTACGGAGCTCAGCATGGATATTCAGAGA
CGAGTTGTACATTGTGCAGAGTTGTTCACTAAGCCAAGGGGCGAGCCATGGGCATTGGCTTGGGATGTATCTGGCCAGAGCAGGGGACCTTGTTTTCTTTTTCTCAAAGGACATGTAGAG
TTCTATCTTCCACATGGTAGCCTCTTTCTTCTTTGTGCACACTGACTGCCCTGGGTATAGGGAGGGAAATGGGGCTGAAGCCCCAGATCTGGACATGACTAAGTCACTATCGAATGCTGA
GCAGCCTTTCCAGGGCCCAGGAGGAGAACAGTCTCCCATCCCACCCTCTGCCCTTCCAGAGAAGCCAAGCTGAGCTCTTGTTCTCTTTTCTAGGCACCAAGGACATCTTTAAACAAAGCA
GTGCCTGTCGAGTCTCGTTCATCAACCGGACTCCTGGGAACAG
GTGTGTGGGGAGGGAGGGACTGGGTGGTGTCCTTGGCGAGATCCATTGAGGGAAACAAAGGAAGGAAGCCGAGCTAG
CTGACAGCCACTGAGCAGATGTTTGCCTTGGCAAAGTCAGTGTCAGCCCAAACTCCAGCTGCTTCCTACATACAGCCTATGTTGTTCAAGAGCTTTGGCTCTCTCCCTACCACTCCAGGC
CTGAGGTGCCCTAAGCACAGACGAGCGGATTGTAGCTGCTCATCCTGCTCTGTCCTTTCCTGCTCTTGGGGACCTTTCCTGGACTCCACCCCAACTTTGATATGGCAGTTCTTCTCCTGG
CTGCAACAGTCAAAAGAAAACAGGTTCCAGGACTTTGATTTGTCTCATTTTTTCCAGTTCCTGCCCCCCAGAAACGGCACTCTCTGACATGGAAACCCCAGAACTGGGAAGTACTGGCCT
AAAGAAGAGGTTTGGCAAAAAGTGAGTATCTCCTAGATTCCTGAATTCTCAGGATTCTTGGAGAGAAGTCTCAGGATCTCTGAGCCCTGACGCACCACCTCAAGTCTCCTTCCATTCGAC
ACACATGATGTGGCACTAGTAGCTCTCCAGGAAACACCTGGACTGCTGTAGCTCCTGGGTCAGGAAACCTGCTGCCTCTTCTCCTGTAGAGTGCACCATAAAGGAGGGGCCCATGGCCCT
TAGCAGGAGAGACAGGACACTCTGAGAAAGTGGCTCCTTGAGCTTCCCTCGGGGCTGCCCACTAGAGGGTTGGAGCTTCCAGTGACCCACTCTTCACTCAGTCTCTAAACGCATCTTCTG
GCAAGATGCCTAGCTCATAGCTCCACCACAATGGCCATAAGAGGCCTCCTGGCTTCTGCTGCCTTGGGTATAGAAGTAGGCCAGTGTAGAGATCAAAGGATGGGGGTGCATAACGAGAGA
GACAGAGACAGAAAGAGAAGAGAAGGTAGACTAGGTGTGGAGAGAGAAAGGAAGATGTCTGGAACAAGTAAGAACTGGGGTAAGGGTCGTGGGAGCAGGGGCTGCTTGCCTGCTTTCCAC
TGTTATGACACATCCTTCGTGGTTGCTAGATCTTCAATTATGGGTAGTACTGAGGGAAGGCCAAGGAACCCCTTTCTCACCAGTGCCCTGTCTATCACTACTGCAGGGTAATGGGGCCCT
CTCCCCGGCTGTACACGGAGGCACTGCAGGAGAAAAAACAGAGTGGCTTCCTGGCCCAGAAGTTCAAGCAGTACAAGCGATTTGTGGAAAACTACCGGCGCCACATTGTGTGTGTTACAA
TCTTCTCAGCCATCTGCATAGGCCTGTTTGCAGACCGTGCCTACT
GTAAGAATGCCAGGTTGTGGGCAGTGGGCAGGGAGCATGCCATATACCTTAGTGGGGAGTGCAGAGTCCTCTGGC
TGCCCTTCCTAGTCCTCGGGGTCTGGACAACAGAGCCTGGGCATTCTGACTCAGGCTCAGGGTCCTGTCCTGTATGCAATGCTGAGCTGCCTCAACTCTGGCTCCAGACTATGGCTTTGC
CTCACCACCCACGGACATCGAAGAAACCACCTATGTGGGCATCATCCTGTCCCGGGGCACGGCGGCCAGCATCTCCTTCATGTTCTCCTACATCCTGCTCACCATGTGCCGCAACCTCAT
CACCTTCTTGCGGGAGACCTTCCTCAACCGCTACATCCCCTTTGATGCCGCTGTGGACTTCCATCGCTGGATTGCTATGGCTGCAGTTGTCCTAGCTG
GTATGTGGCTTCTGGGTTGGGA
GCTGGGAGTGGTGTTGGTCCAGGTTAATCTCTCTGACATGCTGTCTCTTTCAGTTCTGCACAGTGCTGGACATGCAGTCAATGTGTACATTTTCTCAGTCAGTCCCCTCAGCCTGATGGC
CTGCGTCTTCCCTAACGTCTTTGTGAATGACGG
GTCAGTTCGGGGGGAAAGGCACCCCTAGGGTCCTCTGGGTGGGCCTAAGGTTGTAACAGGAAAGAAATAGATGGCCACAGAAGCACA
GCTTCTCGGTGCGCGAAGGCACAACGCTCTGTCCCTCATGGAGAAGCTGTAAGGTGACTGATTCTCCTTCAGAGAGGAAGCTGTCCGTGACTAGCCAGGAAAGGACCTTAGAAGCTGACA
TCAGAGCCTCTTTCCCATTTGACACGTAGTTTATCATCTCAGTTCAATATTCACTCATTGGACAAATATTTATACACTGTGTACTGGGTGTTTTAGTCGAGTGCCAGAGAGCAGGGCTAT
TTCAGATAGCTGCAGTTGTTTAACACTTTTGCTATTCGCTTGATGTTGCTGTGCACAGAACGCAAGTCCCTGCCCTCACTAATCTCACAAAGTAATGGACCAAGTACTCCAGGAGGCAAG
CAAAGGCCAACTGCAGGAGGAGAATGTAAGATAGAATAGCAGGAAAAGCTGCCGTGGAGGAGGGGTAGAACCATGGCAGTGGACACCAGGTTCCTTGGAAGGATCCTGTAGCCATCTTAG
TGGAGTCCTAGTGACCTCAGGAGTTGGGGGCAGCATCAGGCTGTGGTGTGGGAGTGACAAACCCCCTTCTGCCTCAGCCAGAGGCTCACTGTGCCTTTACCCTCCAGGTCCAAGTTTCCC
CCAAAGTACTACTGGTGGTTCTTTGAGACAGTTCCAG
GTAGGAAATGGAGCTGGGTGTGGGATCTGAGCACCACAACTGTGTATCTCCTCCTTTTTCTCCTCTTCCTGAATACACATTGG
TCTATAATAGCCATAGGTCCCTTTTTGACAGGCCAATCCTGGCTCCAGCCCCTACAGATATGCTATTCTGAAGCCCTGAAGGAATAGAGTGGGAGTCAGGGAGAACGGTGTGGGGAGATG
TCCTGTTGTATTTTCAGCTCAGAATGAGGTTCCGAGGATAGCCACTGACCCTGTCCTCCCACCCCCAGGTATGACAGGAGTCCTCCTGCTCCTGGTCCTGGCCATCATGTACGTCTTCGC
CTCCCACCACTTCCGCCGCCACAGTTTCCGGGGCTTCTGGCTGACCCACCACCTCTATGTTGTGCTTTATGTCCTG
GTGAGTACCTTCCCTGGGCAGGGGTGAACCTTGGAGGGGACACT
TGGGATGGAAAGGGACGGTAAAGTAGGGAAGATGGAGAATTAGGCCCAAGGACTGTGATCCTGACCTCTGTGACCCCACCCTGGTGTCTATGCTGGCCCATCAGAGCCAGTACCTGGGTA
GGCCATGCCTGACAAGTAGGAGCCACTCGGGACCTGGTGGGCGGAGCTATACTAACTGGCCCTGTTTCCAGATCATCATCCATGGCAGCTATGCCCTCATCCAATTACCCAGCTTCCACA
TCTACTTCCTGGTCCCAGCAATTATCTATGGAGGGGACAAGCTAGTGAGCCTGAGCCGGAAGAAGGTGGAGATCAGTGTGGTGAAGGCGGAGCTGCTGCCTTCAG
GTACAAGCCTGATCT
GACCCAGGCATGAGAACTAGTCTGCTCTTCAGACACGGTGGGGGCTTGTCGGGTTCCCTAACCCCTACCTCTGACAGTCTCTGGAGAAACCCTGAACCCTGAAGATTTTCGGATAGAAAT
GACACCCCAAGAAGGGCTATGCTTAGTTCCTGGTGAGACTTGCAGACAAGTTGGCTAGGGAGGTAACTGAGCAGTTAAGAGCACCTTCTGCTCTTTCAAAGGACCTGAGGTCAGTTCCCA
GCAGCCATGTCAGGGAATACAGTGCCCTCTTCTGTCCTTCATAGGCACCTGCACTCACGTGTGCATACACCTACACACACATAAACATATATATACGTAATTAAAAATAAAGAAAAAGAC
CCGCAGACAGTGAGAAGCGATGCTGGGAAAGAACAGGTGAAAACAACGTTCCAGGGGGAGGAAGGTGCCAGAGTGCAGCCTGGTTCAGAGAGAGGCAAAGAGCCCTGACAGCCAGACTGC
TGTTGTGTCTGAATTGGCCGTGGTGGAGGATTATATCACTACAGAGGGTGGCGGGAGGTGATTTAGGTATCATAATGTCGTCCAACCTCTAGGCTTAACATTGCAGAGAACTCAATTTCA
TATCCGTTTTGTTTCAGTTAAGATTACGTGAGGATAAAGGTAAAGTAGCCTTTCATACTCTCCAAAACTTGTTAACCTCCTTTTTCAACAAATATAAGTAAACCTGAGGCCCAGCTGTGG
ACAGGCAATAAACACTATCTAAGATTTGGTAAGATATGTTGCTTGTAAGGCATGCAGATATTTGTGAAAGTTCCCTTTGATAAGCATCTGGGTATCTGAAGAGTCCTTCACCCTGTAGGC
TGTTATTTCGGCAGTGCTATGAGCCTGCACCCTCTGCAGGATGAAGTGAGAGAGATTGGGGCAGAGGTGCCCAACTCTTATGTTACCCAGGAAACAGAGGAGCCTCAACTAACTACTCTC
AGCCCCAGCACACCACCTGTCTAGACCATGTGACTGTGTGAGACCCTCTGGGCCTCAGCAGTGTAGAAAGAACAGACACCTCTCAGTCTTCAAGGACAGTTTCTGATCTAGGCTCTTCTC
CCACTCTGACCTCTAGGGGTGACCTACTTGCAGTTCCAGAGACCCAAGACATTTGAGTACAAATCAGGGCAGTGGGTGCGAATCGCGTGCCTGGATCTGGGTACCAATGAGTATCACCCC
TTCACGCTGACCTCTGCACCCCATGAGGACACACTCAGCCTGCACATCAGGGCTGTGGGACCTTGGACTACTCGCCTCAGAGAGATCTACTCACCCCCAGTGGGTGGCACCTGTGCCAGA
TACCCAAAG
GTACCCACCCTTGGGCTTGTCCCTGCTCCCTGTACCCTGACACTAGATGCCCCAGTATCTTTCATGACAAAGCTAGTCTGAAAGAGTACTGAGGTGCGCTGGCCCCTTCCT
CTGTTACTCTTACCCATTTCTGGCCTCAGAGTTGGGGCAGGGCCTGGCTCATCTCACTTCTCCTTCTCCTGTCAGCTGTACCTCGATGGACCATTTGGAGAGGGCCATCAGGAGTGGCAT
AAGTTTGAGGTGTCAGTGCTGGTAGGAGGGGGCATTGGAGTCACCCCCTTTGCCTCCATCCTCAAAGACCTGGTCTTCAAATCGTCCATGGGCAGCCAGATGCTCTGTAAGAAG
GTGAGC
ATCCTTTCTCCACTCTGTGTGGGAAATGTGTGGCCCTATGGATCCCTGCCACCACAACAGTGCACTCCTAGCTTCCAACCGTAGGGTCTGTGGGGACCATGACACACAGCAGTTCAAATA
TCAAATCCTGCGTGTGGATAGGAGAGTGGCCATAAGAATCTCTGCTGGCCTTTGTCCATTGGAGGCTAAGACCAGAACATCGCTCTCAACAGGATATCTGAGGCTTCCTAGCATCTCAGA
ATGTAGGGAGGGGAGCCAGACCTGAGCTGGCCCTGAGCCCCAGTGTGTGTCCTCAGATCTACTTCATCTGGGTGACAAGGACTCAGAGGCAGTTTGAGTGGCTGGCTGACATCATCCGGG
AGGTGGAGGAGAATGACTGCCAGGACCTGGTGTCTGTGCACATCTACATTACTCAGCTGGCTGAGAAGTTCGACCTCAGGACCACCATGCTG
GTAGGTCAGGGCAAGCCAGCCATGGCAG
GAGAGCCACTCCCAGGGCCAAGGGTTGTTCCTAGCAATGGCCTTCCTCCCTGATTTTGTTCTCCAGTACATCTGTGAGAGGCACTTCCAGAAGGCGCTGAACAGGAGTTTGTTCACGGGC
CTGCGTTCCATCACCCACTTTGGTCGCCCTCCCTTTGAGCTCTTCTTCAACTCTCTACAGGAAGTCCATCCACAG
GTGGGTCCATCCCCTCCCACCCTAGGACCATATTTAATTGTCTTA
TTTTGGTCTGATTATTGTGGTCAGGCTGCAGAAGCTGATCTTCCTGGGTTGTGGGCATCTCAGTACACATATCTCTCCATGGACAGGTACGTAAGATTGGAGTCTTCAGCTGTGGTCCTC
CAGGGATGACCAAGAATGTGGAGAAGGCCTGCCAGCTCATCAACAGGCAGGACCGGGCCCACTTTGTGCATCATTATGAGAACTTCTGA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCTTCCAACAAGTCCCAAGACTCTAGTTCTCCTGGGCGCTCTGCTGACTGGACCCCTGGGCCCAGCAGGTGGCCAGGACGCACCCTCACTGCCCTGGGAAGTGCAGCGCTACGACGGC
TGGTTTAATAATCTGAAGTACCACCAGCGCGGTGCGGCTG
GCTCGCGGCTGCGTCGCCTAATACCGGCTAATTATGCTGACGGTGTTTATCAGGCTCTGGAAGAGCCGCTACTGCCCAAC
CCTCGCCGGCTAAGCGATGCTGTCGCTAAGGGCAAAGCAGGGCTGCCCTCGGTCCACAACCGCACAGTGCTGGGGGTCTTCTTTG
GCTACCACGTGCTCTCAGACCTGGTGAGTGTGGAA
ACACCAGGCTGCCCCGCAGAGTTCCTCAACATTTACATCCCACGTGGGGACCCGGTGTTCGACCCTGACAAGCGCGGGAACGTGGTGCTGCCCTTTCAAAGGAGCCGCTGGGACCACAAC
ACTGGACAGAGCCCCAGCAACCCCCGGGACCAG
AGCAACCAGGTGACCGGCTGGCTAGATGGCAGCGCCATCTATGGCTCCTCTCATTCCTGGAGTGACACTCTGAGGAGCTTCTCTGGA
GGACAGCTGGCTTCTGGGCCTGACCCTGCTTTCCCGCGGAACTCCCAGAGCTCTCTGCTCATGTGGATGGCGCCGGACCCCTCCACGGGGCGGGGCGGGCCACAAGGGGTGTATG
CCTTT
GGGGCCCAGCGCGGGAACAGGGAGCCCTTTCTGCAGGCTCTGGGCTTGTTGTGGTTTCGCTACCACAACCTGTGTGCCAGGAAGCTGGCGCAGGAGCACCCGCACTGGGGGGATGAGGAA
CTGTTCCAGCATGCTCGCAAGAGGGTCATTGCCACCTACCAG
AACATTGCTCTATACCAATGGCTGCCCAGCTTCCTGCAGAAAACTCCTCCAGAGTATTCAGGGTACCGCCCTTTCATG
GACCCCAGCATCTCCCCGGAGTTCGTGGTGGCCTCTGAGCAGTTCCTCTCTACTATGGTGCCCCCTGGGGTCTACATGAG
AAACTCCAGCTGTCATTTCCGGAAATTCCCGAAGGAAGGT
TCAGACAGCTCTCCAGCTCTCAGAGTCTGCAACAGCTACTGGATTCGGGAG
AACCCCAATCTGAAGACTGCTCAAGATGTGGATCAGTTGCTGCTGGGAATGGCTTCCCAGATCTCAGAG
CTGGAGGACAGGATAGTGATTGAAGACCTGAGAG
ATTACTGGCCTGGCCCAGAGAGATTCTCTCGCACAGACTACGTGGCTAGCAGTATCCAGCGTGGCCGAGATATGGGGCTCCCCAGT
TATAGCCAGGCTCTGCTGGCCTTGGGACTGGAGCCTCCTAAGAACTGGAGTGCTCTCAACCCCCAAGTAGAACCCCAG
GTGCTGGAGGCCACAGCTGCTCTGTACAACCAGGACCTGTCC
CAGTTGGAACTACTCCTGGGTGGACTCCTGGAGAGCCATGGGGACCCTGGACCTCTATTCAGCAACATCATTCTTGACCAGTTTGTGAGGCTCCGGGATGGTGATCGCTACTGGTTTGAG
AACACTAGGAATGG
GCTGTTCTCCAAAGAGGAGATTGCAGAAATCAGAAACACCACCTTGCGGGATGTACTGGTAGCTGTCTCCAATGTGGACCCCAGTGCCTTGCAACCCAACGTTTTC
TTCTGGCAGGAAG
GTGCACCCTGCCCACAGCCTCGGCAACTCACAACGGACGGCTTGCCCCAGTGTGCGCCTGTTACTGTGATTGACTACTTTGAGGGCAGTGGTGCTGGCTATGGTGTC
ACGCTTGTAGCTGTCTGCTGCTTTCCATTAG
TGAGTCTGATTGTCGCTGGGGTGGTGGCTCATTTCCGGAACCGAGAACACAAGATGCTACTAAAGAAAGGCAAAGAGAGTCTGAAGAAA
CAACCAGCCAGTGATGGGGTACCAG
CAATGGAGTGGCCGGGCCCCAAGGAGAAGAGCTATCCAGTCACTCTCCAGTTGCTTCCAGACAGAAGTCTGCAGGTCCTTGACAAACGGTTCACT
GTGCTCCGGACCATCCAACTGCAGTCCCCACAGCAGGTTAACCTCATCCTGTCCAGCAACAGTGGACGTCGCACCCTGCTGCTCAAGATCCCCAAGGAGTATGACCTG
GTGCTGATGTTT
AACTCTGAAGAGGACCGGGGTGCCTTCGTGCGGCTGTTGCAAGACCTCTGTATCTGCTGCACTCCCGGCCTCCACATAGCTGAGGTGGATGAGAAGGAGCTATTGAGAAAGGCTGTGACC
AAGCAGCAACGGGCAGGCATCTTGGAGATCTTCTTCAGACAGCTTTTTGCTCAG
GTGCTGGACATCAACCAGGCTGATGCAGGGACTCTGCCCCTGGACTCATCCCAGCAAGTGCGTGAG
GCTCTGACCTGTGAGCTGAGCAGAGCTGAGTTTGCTGACTCCCTAGGCCTCAAGCCCCAGGACATGTTTGTGGAGTCCATGTTTTCTCTGGCTGACAAGGATGGCAATGGCTACATATCC
TTCCGGGAGTTCCTGGACATCCTGGTGGTCTTCATGAAAG
GCTCCTCAGAGGATAAGTCCCGCCTGATGTTTACCATGTATGACCTGGATGGGAATGGCTTCCTCTCCAAGGACGAGTTC
TTCACCATGATGCG
GTCCTTCATTGAGATCTCCAACAACTGCCTGTCCAAGGCCCAGCTGGCTGAGGTGGTTGAGTCTATGTTCCGGGAGTCTGGGTTCCAGGACAAGGAGGAGCTGACC
TGGGAGGACTTCCACTTCATGCTGCGGGACCACGACAGTGATCTCCGATTCACACAGCTCTGTGTCAAAGGTGGAGCTGGAG
GCACCAAGGACATCTTTAAACAAAGCAGTGCCTGTCGA
GTCTCGTTCATCAACCGGACTCCTGGGAACAG
GGTAATGGGGCCCTCTCCCCGGCTGTACACGGAGGCACTGCAGGAGAAAAAACAGAGTGGCTTCCTGGCCCAGAAGTTCAAGCAGTAC
AAGCGATTTGTGGAAAACTACCGGCGCCACATTGTGTGTGTTACAATCTTCTCAGCCATCTGCATAGGCCTGTTTGCAGACCGTGCCTACT
ACTATGGCTTTGCCTCACCACCCACGGAC
ATCGAAGAAACCACCTATGTGGGCATCATCCTGTCCCGGGGCACGGCGGCCAGCATCTCCTTCATGTTCTCCTACATCCTGCTCACCATGTGCCGCAACCTCATCACCTTCTTGCGGGAG
ACCTTCCTCAACCGCTACATCCCCTTTGATGCCGCTGTGGACTTCCATCGCTGGATTGCTATGGCTGCAGTTGTCCTAGCTG
TTCTGCACAGTGCTGGACATGCAGTCAATGTGTACATT
TTCTCAGTCAGTCCCCTCAGCCTGATGGCCTGCGTCTTCCCTAACGTCTTTGTGAATGACGG
GTCCAAGTTTCCCCCAAAGTACTACTGGTGGTTCTTTGAGACAGTTCCAGGTATGACA
GGAGTCCTCCTGCTCCTGGTCCTGGCCATCATGTACGTCTTCGCCTCCCACCACTTCCGCCGCCACAGTTTCCGGGGCTTCTGGCTGACCCACCACCTCTATGTTGTGCTTTATGTCCTG
ATCATCATCCATGGCAGCTATGCCCTCATCCAATTACCCAGCTTCCACATCTACTTCCTGGTCCCAGCAATTATCTATGGAGGGGACAAGCTAGTGAGCCTGAGCCGGAAGAAGGTGGAG
ATCAGTGTGGTGAAGGCGGAGCTGCTGCCTTCAG
GGGTGACCTACTTGCAGTTCCAGAGACCCAAGACATTTGAGTACAAATCAGGGCAGTGGGTGCGAATCGCGTGCCTGGATCTGGGT
ACCAATGAGTATCACCCCTTCACGCTGACCTCTGCACCCCATGAGGACACACTCAGCCTGCACATCAGGGCTGTGGGACCTTGGACTACTCGCCTCAGAGAGATCTACTCACCCCCAGTG
GGTGGCACCTGTGCCAGATACCCAAAG
CTGTACCTCGATGGACCATTTGGAGAGGGCCATCAGGAGTGGCATAAGTTTGAGGTGTCAGTGCTGGTAGGAGGGGGCATTGGAGTCACCCCC
TTTGCCTCCATCCTCAAAGACCTGGTCTTCAAATCGTCCATGGGCAGCCAGATGCTCTGTAAGAAG
ATCTACTTCATCTGGGTGACAAGGACTCAGAGGCAGTTTGAGTGGCTGGCTGAC
ATCATCCGGGAGGTGGAGGAGAATGACTGCCAGGACCTGGTGTCTGTGCACATCTACATTACTCAGCTGGCTGAGAAGTTCGACCTCAGGACCACCATGCTG
TACATCTGTGAGAGGCAC
TTCCAGAAGGCGCTGAACAGGAGTTTGTTCACGGGCCTGCGTTCCATCACCCACTTTGGTCGCCCTCCCTTTGAGCTCTTCTTCAACTCTCTACAGGAAGTCCATCCACAG
GTACGTAAG
ATTGGAGTCTTCAGCTGTGGTCCTCCAGGGATGACCAAGAATGTGGAGAAGGCCTGCCAGCTCATCAACAGGCAGGACCGGGCCCACTTTGTGCATCATTATGAGAACTTCTGA

Retrieve as FASTA