Entry information : MmDuOx02-4 ( DUOX2 / / MmDuOx02-4)
Entry ID 3341
Creation 2010-04-28 (Christophe Dunand)
Last sequence changes 2016-02-16 (Christophe Dunand)
Sequence status complete
Reviewer Achraf Jemmat
Last annotation changes 2016-02-16 (Achraf Jemmat)
Peroxidase information: MmDuOx02-4 ( DUOX2 / / MmDuOx02-4)
Name (synonym) MmDuOx02-4 ( DUOX2 / / MmDuOx02-4)
Class Dual oxidase    [Orthogroup: DuOx001]
Taxonomy Eukaryota Metazoa Chordata Mammalia Muridae Mus
Organism Mus musculus (house mouse)    [TaxId: 10090 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value MmDuOx02-4
start..stop
S start..stop
RnoDuOx02-A 2934 0 1..1517 1..1517
HsDuOx02 2653 0 1..1517 1..1548
EcabDuOx02 2639 0 1..1517 1..1517
CfaDuOx02 2619 0 1..1517 1..1571
Gene structure Fichier Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 578..647 70 N° 2 931..1020 90 N° 3 1203..1367 165 N° 4 1874..2061 188
N° 5 2224..2425 202 N° 6 2582..2748 167 N° 7 3389..3449 61 N° 8 3562..3658 97
N° 9 4064..4154 91 N° 10 4650..4752 103 N° 11 5276..5439 164 N° 12 5976..6151 176
N° 13 6366..6484 119 N° 14 6794..6931 138 N° 15 7189..7302 114 N° 16 7477..7679 203
N° 17 7962..8147 186 N° 18 9050..9275 226 N° 19 9358..9451 94 N° 20 11546..11733 188
N° 21 12191..12260 70 N° 22 13404..13582 179 N° 23 13765..13995 231 N° 24 14071..14170 100
N° 25 14845..14894 50 N° 26 15166..15293 128 N° 27 15529..15682 154 N° 28 16794..17026 233
N° 29 17213..17371 159 N° 30 17674..17829 156 N° 31 17924..18052 129 N° 32 18184..18306 123
join(578..647,931..1020,1203..1367,1874..2061,2224..2425,2582..2748,3389..3449,3 562..3658,4064..4154,4650..4752,5276..5439,5976..6151,6366..6484,6794..6931,7189 ..7302,7477..7679,7962..8147,9050..9275,9358..9451,11546..11733,12191..12260,134 04..13582,13765..13995,14071..14170,14845..14894,15166..15293,15529..15682,16794 ..17026,17213..17371,17674..17829,17924..18052,18184..18306)


exon

Literature and cross-references MmDuOx02-4 ( DUOX2 / / MmDuOx02-4)
Protein ref. UniProtKB:   A2AQ99
DNA ref. GenBank:   NC_000068.6 (122298742..122279246)
Cluster/Prediction ref. Genebank:   214593
Protein sequence: MmDuOx02-4 ( DUOX2 / / MmDuOx02-4)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   1517 (1492)
PWM (Da):   %s   171369.4 (169014.0) Transmb domain:   %s   o600-622i1010-1032o1047-1069i1116-1138o1153-1175i1188-1210o (o575-597i985-1007o1022-1044i1091-1113o1128-1150i1163-1185o)
PI (pH):   %s   7.73 (7.65) Peptide Signal:   %s   cut: 26 range:26-1517
Sequence 1558
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MLPTSPKTLVLLGALLTGPLGPAGGQDAPSLPWEVQRYDGWFNNLKYHQRGAAGSRLRRLIPANYADGVYQALEEPLLPNPRRLSDAVAKGKAGLPSVHNRTVLGVFFGYHVLSDLVSVETPGCPAEFLNIYIPRGDPVFDPDKRGNVVLPFQRSRWDHNTGQSPSNPRDQSNQVTGWLDGSAIYGSSHSWSDTLRSFSGGQLASGPDPAFPRNSQSSLLMWMAPDPSTGRGGPQGVYAFGAQRGNREPFLQALGLLWFRYHNLCARKLAQEHPHWGDEELFQHARKRVIATYQNIALYQWLPSFLQKTPPEYSGYRPFMDPSISPEFVVASEQFLSTMVPPGVYMRNSSCHFRKFPKEGSDSSPALRVCNSYWIRENPNLKTAQDVDQLLLGMASQISELEDRIVIEDLRDYWPGPERFSRTDYVASSIQRGRDMGLPSYSQALLALGLEPPKNWSALNPQVEPQVLEATAALYNQDLSQLELLLGGLLESHGDPGPLFSNIILDQFVRLRDGDRYWFENTRNGLFSKEEIAEIRNTTLRDVLVAVSNVDPSALQPNVFFWQEGAPCPQPRQLTTDGLPQCAPVTVIDYFEGSGAGYGVTLVAVCCFPLVSLIVAGVVAHFRNREHKMLLKKGKESLKKQPASDGVPAMEWPGPKEKSYPVTLQLLPDRSLQVLDKRFTVLRTIQLQSPQQVNLILSSNSGRRTLLLKIPKEYDLVLMFNSEEDRGAFVRLLQDLCICCTPGLHIAEVDEKELLRKAVTKQQRAGILEIFFRQLFAQVLDINQADAGTLPLDSSQQVREALTCELSRAEFADSLGLKPQDMFVESMFSLADKDGNGYISFREFLDILVVFMKGSSEDKSRLMFTMYDLDGNGFLSKDEFFTMMRSFIEISNNCLSKAQLAEVVESMFRESGFQDKEELTWEDFHFMLRDHDSDLRFTQLCVKGGAGGTKDIFKQSSACRVSFINRTPGNRVMGPSPRLYTEALQEKKQSGFLAQKFKQYKRFVENYRRHIVCVTIFSAICIGLFADRAYYYGFASPPTDIEETTYVGIILSRGTAASISFMFSYILLTMCRNLITFLRETFLNRYIPFDAAVDFHRWIAMAAVVLAVLHSAGHAVNVYIFSVSPLSLMACVFPNVFVNDGSKFPPKYYWWFFETVPGMTGVLLLLVLAIMYVFASHHFRRHSFRGFWLTHHLYVVLYVLIIIHGSYALIQLPSFHIYFLVPAIIYGGDKLVSLSRKKVEISVVKAELLPSGVTYLQFQRPKTFEYKSGQWVRIACLDLGTNEYHPFTLTSAPHEDTLSLHIRAVGPWTTRLREIYSPPVGGTCARYPKLYLDGPFGEGHQEWHKFEVSVLVGGGIGVTPFASILKDLVFKSSMGSQMLCKKIYFIWVTRTQRQFEWLADIIREVEENDCQDLVSVHIYITQLAEKFDLRTTMLYICERHFQKALNRSLFTGLRSITHFGRPPFELFFNSLQEVHPQVRKIGVFSCGPPGMTKNVEKACQLINRQDRAHFVHHYENF

Retrieve as FASTA  
Remarks Complete sequence from genomic (chromo 2, 31 introns).
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
TGCTTCCAACAAGTCCCAAGACTCTAGTTCTCCTGGGCGCTCTGCTGACTGGACCCCTGGGCCCAGCAGGTACGTAGTAGGGGCCTCTTGCTGAAGGGGCTAGCCAGTGTCCCACAGGAG
TGTGGAAAGAGAAGGGAAGGCAGATGGCGCCTGGAAGCTGGAAGGGTTGAGCCATCTCTTATTGAGCTATTGAGTTGGAAGACTTCTGACCGCCTCGGCTCAGCGGCCAAAGGATTGGAG
ATGGCGTTCATTGGCAGGAGTTAGCCAGCTGAGGAAGGAGCCTAAAGCCCATCGGGTCATTCAGGGCATTCTGGAAGGTTAATGTTAAGTGGGAGGCTTTCTGCTCCCACAGGTGGCCAG
GACGCACCCTCACTGCCCTGGGAAGTGCAGCGCTACGACGGCTGGTTTAATAATCTGAAGTACCACCAGCGCGGTGCGGCTG
GTAAGCCCCGGGCTCTGTAGGGAGGGCGTGCAGTTGGA
GTGTGGGGAGAGGAGCCGGGGCGGCCTGCGGGATTGCATTCTCACCACATTCTTATCCATTCCTGGGGGTAGGAAATGGGAGGGGGCATGTGGGGCTTCTTGGGGACCCAGAGCGTTGAT
CCAGATCAACTCTCTGCGCTCCAGGCTCGCGGCTGCGTCGCCTAATACCGGCTAATTATGCTGACGGTGTTTATCAGGCTCTGGAAGAGCCGCTACTGCCCAACCCTCGCCGGCTAAGCG
ATGCTGTCGCTAAGGGCAAAGCAGGGCTGCCCTCGGTCCACAACCGCACAGTGCTGGGGGTCTTCTTTG
GTGAGTATAGGAACAGGAAACCAGGGTGGATCTGGAGCTTGCTGAGTGTCC
AGGGAAAGGACCACCGCAACTTGCAGAAAAAAGAACTTTCCAAGAAGTGCTTCTGTTATCATCTTTGAGGGTTTGGGTGTGATGCGGGGATCTTTCTCTTACTTTACTTCAAACCAACAC
GTACATTGGGTCCCCACTAGAACTCTCAGAAACTTATGCCCTGACCACTGCTGTCCTCAAGACTCTCATGACTGCCACACAGAGGTGAAGGGAGCTTGTTTTCAAATCGTATCCGCCTCC
CCCCCTCCCACCCCCGAAACGATATTGGCATGTTCACAAGGGGATGCTACGTTTTGTTTGCATCTTAGAAGATCAAGTCTGGCGGGGGGGGGGGGATAAGCTGAAGAGACAGCGTTGAAG
CTTGTGCTCTGGCCTCTGTGTCCTGATAATTCAGAGGCTCTCCTAAGGAGAGCGGGACCTGGTGATTTATCTACCACTTTGCTCTCTTAACCCAGGCTACCACGTGCTCTCAGACCTGGT
GAGTGTGGAAACACCAGGCTGCCCCGCAGAGTTCCTCAACATTTACATCCCACGTGGGGACCCGGTGTTCGACCCTGACAAGCGCGGGAACGTGGTGCTGCCCTTTCAAAGGAGCCGCTG
GGACCACAACACTGGACAGAGCCCCAGCAACCCCCGGGACCAG
GTGAGGCCAGGCAGCCAAAGGCAGGAGGGCAGAGAGGCGGGAAGGGGGTCTGAGGCTGGATCCGCTGGAGGCCTGGG
ATCTGGCTGGGAAGACCAGACACGGTGAGGCTCTCCCTACCAGCAAGGACCTGCCTGGTCCTCCTGACTCTGCTCATCCCTGCAGAGCAACCAGGTGACCGGCTGGCTAGATGGCAGCGC
CATCTATGGCTCCTCTCATTCCTGGAGTGACACTCTGAGGAGCTTCTCTGGAGGACAGCTGGCTTCTGGGCCTGACCCTGCTTTCCCGCGGAACTCCCAGAGCTCTCTGCTCATGTGGAT
GGCGCCGGACCCCTCCACGGGGCGGGGCGGGCCACAAGGGGTGTATG
GTGAGGCTGCGGGGGTCCTGGCAGGAGGTGTAGGGTTAGAGGCACCAGGCTGCTTGGAGGCCCTGTACTGGAG
GAGGGGGGGACTGCCTGTGGGTCTGGGCTCCCCTGACTCAAGCTGTTGTCCATCTCCTGTCCTGCTTCCCCACATGGATGCAGCCTTTGGGGCCCAGCGCGGGAACAGGGAGCCCTTTCT
GCAGGCTCTGGGCTTGTTGTGGTTTCGCTACCACAACCTGTGTGCCAGGAAGCTGGCGCAGGAGCACCCGCACTGGGGGGATGAGGAACTGTTCCAGCATGCTCGCAAGAGGGTCATTGC
CACCTACCAG
GTGAGTCGTGCCGCCTGCCTGGAGTCCACCTGTGTGTAGGGTCCAGAGAGACTCTGCACCCCCAACAAAGCTTCCGGGGTCCTGGACAATGCCTCTTCCCTATATACTCT
TTTCTAGGAGGGCCCTTCTTCAGGGAGAGGTAATGTGTAGGGAAAACTACTGTACATTTTAGAGAGGAAACCCTTTATTTATGCCAGATAATTTGGGATTTGATCATTATTTAACATATC
TCTCTTCTCAATAAATTCTTTGAAGCATCTAGCACTTGATGCTGTTGGGTTTTTTATGCTCTTTCTCCCATGATGTGCCCAAGGTCCAGGGAGACTCGGCTCCCCCAGGAAAGCTTTCCC
AGTCCTGGATAGCGCCTCTCCCCTCCCTAGTTGTGGCCCTTATTCCCAGAAGGGCTAATGGGAATACACAAACACACACACACACACACACACACACACACACACACACACACACACACA
CACACAGAGAGACCCTGTGAATGTTTACATGGGTGATTTTGCTTGTGATTATAATTTAATTGCCTGCCCCTCCTTCCCCTACAACTAGCCTTCCCTGGTCCTCATCTCTGCACCCAAGTC
TTCCTTCCCCAAAGGCTCAACCCAACCCCTCCTCCCCTTCTAATCCTCAGAACATTGCTCTATACCAATGGCTGCCCAGCTTCCTGCAGAAAACTCCTCCAGAGTATTCAGGTAATGGGG
AGGGGTTGTGGAAGGTGGGGAGACCTAAGTGGAAGATCCACAGACAAAAGAGAGTTAGATTGTTTGATGCGGGGGTGTGAGGTAAGGCACTGTCTTGAAACAGGGTACCGCCCTTTCATG
GACCCCAGCATCTCCCCGGAGTTCGTGGTGGCCTCTGAGCAGTTCCTCTCTACTATGGTGCCCCCTGGGGTCTACATGAG
GTAAGCGAAGACGAAGCCGGTTAGAGCTGCATAGAATTGG
AGGGGGGTGGGGTGAGTGGGCTTGGAGCTAGGGGCTTGCATCCCTCCAGTTTCATTCAGAAGAGAAGCAGAATTGTTGAGGGGATGCTGAAGCAGGATCCTGGGGTGGGAAGTTTGGGAT
CCATAGGAAATCTTCCAGCTCTAATAGAAGAATTTTGGTTGCTGAGCGTGAGGTGTGTCTGGCAGCTGGGGCGCCTTCTGACCTCCCACCATAGAGATCACTCTATCTGTAAATCCAAGC
CAATGTTCAAGAAAACTCATAAGTCATCCTTCACTCTCGTCTAGCCCAGAGAAATGCTCACCTCACTCACCACCTTAGTTTCTGCCCCCCAATTCCCTCATTGTATTTTTGTCACCCTAT
TTCAGAAACTCCAGCTGTCATTTCCGGAAATTCCCGAAGGAAGGTTCAGACAGCTCTCCAGCTCTCAGAGTCTGCAACAGCTACTGGATTCGGGAGGTGAGTGTGAGATTGGGGTCAGGG
AGTGGGGGTGGAGTCAAATTCACGTAGCACAAATGCTTGGAAAAGTAGAAGTTTGGCTCTTGAGCAGCAGTGTAGCGAGTAAAAGGCAGAGAAAAGTATCTGGGGAGAATTGTTCGAGAG
CTGGAAATGCCTGGGCATGGGCGATGTGCATGCTGTCCTAGCTCTTCAGAAGCTCAGGCAGGACTACTGCTGTATACCAGGTGCTCACACAATCTACCAAGATAGAGAGGGAAGGAAAGG
AAGTGGAAGAAGGCTAGGGAGAAGGAGGGAGAAGGGAGGGGCAGAGGAAGGGGAAGAGGAGAAGAGAGTGAGAGACAATGAGAGATCAAGCAAGAGAGTGAGAGAGAGAGAGAGAGAGAG
GAAGAGAGAGAGAATATGAATGGAGGTCTGAGAGTGAGTGGGGGGGCCTGTTTTCCCAAGGGGCTACTGGTGGTCCAGGGTCAAAATCTTGAGTGTAGTGTTTACTTCCAGAACCCCAAT
CTGAAGACTGCTCAAGATGTGGATCAGTTGCTGCTGGGAATGGCTTCCCAGATCTCAGAGCTGGAGGACAGGATAGTGATTGAAGACCTGAGAG
GTGAGCTCAGAGGGTGGGGTGTGAGA
GCCTGGAAGTCTGAGGCGCCTTCTGAGTTAGTCAACAAGCAGCCCTGGGGTACCCATGATACAGACACACAGAGCTACACCCTCAGACTTCAAATAAATGTCAGAAGAAGTTGAAATTTG
AGCTCCCTGTCTCAATTATAGTATTGTGTTTGTGTTACTTTTATTTTGACTACGCCCGAATATGGTGGACTTGAGGGTGGTAGAGCAACCGAACACCCTCTCTGGTGTTTACTTAGGTCT
TAAGGGTTCTCATCCAGTCCTGTTTTAAAGTTGGGACATGAAGAGAAGGGAATTTCTGTCTGGTATCAAGAATTATCTTGAGTCTTAGAAAAAGTGCCCCTGACCACACATGGTGACATG
GGCTCCACAGCCAACAAGGTCGCTGATGTCATGTAGAGGAGTGCTGGGGACTGGGGCTGAGGAGAGCTCTAGACAGCTGGGTGTAAGGCTGCAATGCACGGGGATTGGAACTGACTCAAA
GTCTGGTCTCTTTTCAGATTACTGGCCTGGCCCAGAGAGATTCTCTCGCACAGACTACGTGGCTAGCAGTATCCAGCGTGGCCGAGATATGGGGCTCCCCAGTTATAGCCAGGCTCTGCT
GGCCTTGGGACTGGAGCCTCCTAAGAACTGGAGTGCTCTCAACCCCCAAGTAGAACCCCAG
GTTAATATAAATAATAATAATAACAATGCAATAACAATGACAGTTGGAGGGGGGGAGAT
GGTTCAGTTGGTAACATGCTTGTCACACAAGCATGGAGACCTGAATTTGAATATTCAGCACCTACCTAAAACTCTGAGATGTTTTGGGGATCCCAGTTCTAGGGAGGCAGAGATAGGACT
CCTGACCCTTACTAGACAGCCAATCCAACCCAGCCAAAACAGTGATACCCAGGTTTAGGGAGAGACTCTGTTTCAAAGAATATGGTAGAGAGTGATTAAGGAAGACTCCCTGTGTCAACC
AGTGGCATCTATATGCACATATTCATGTACAGAAACATGTACTCCCACACACATATACACACAAGATAATGGCAGCCAAAACTTTGGATGGCAGTGCTGTTCCAAATATCTCTACACAGA
TGAACTCACTCGATCTGCATAAGAATTTATGTGGTAGTTACTGTCAGTGTTGCAGATTTACTGAGGACATTGTGGGTGAGGTCACGTGTACTTTCGTTACCTTCCATGATGCTCCAGGTG
CTGGAGGCCACAGCTGCTCTGTACAACCAGGACCTGTCCCAGTTGGAACTACTCCTGGGTGGACTCCTGGAGAGCCATGGGGACCCTGGACCTCTATTCAGCAACATCATTCTTGACCAG
TTTGTGAGGCTCCGGGATGGTGATCGCTACTGGTTTGAGAACACTAGGAATGG
GTAAGGCCTGGCTGGGCCCTGCTTCTGACTTCACCTTAGCGTGGGGCTCCAGACTCTCTGTCTGGCC
TTAGACAGCCTCCACAGGTCTTGATGCCAGGGGCCTACCACACTCTCGTCCACCCCAGTCTCTTTCTTCACATGAATCTTTGGGCCTGAGGTCACTGGAGGACTGAAATTCCCTTCCTAT
CCCAGCAATGGCCTTCCCCCCATTTAGGCTGTTCTCCAAAGAGGAGATTGCAGAAATCAGAAACACCACCTTGCGGGATGTACTGGTAGCTGTCTCCAATGTGGACCCCAGTGCCTTGCA
ACCCAACGTTTTCTTCTGGCAGGAAG
GTGAGTATCCAGGGAGAGCCGACAAGCAATCATATGGGGGCAGAGTCATCTGTGCTGTGCCATGGGCTTTTTGTGTTGGGCTGCCTTCCATGCC
GCTATGGTCTGGGCCTGCCCAAGAGCTATACCTAGCAATCAGGCAGGGTAGATGCTGAGAAATTAGATATGGAGGTCTTTTGAGAGGAACAAGTTCCAGAGGGGTAGGTTTGAGAATGGG
AGAAGATATTTGGTCCTTCCTAAGACAGCGGGAGAGGCTAACCGGAAGAGTAAGGATTCTCGGACTCAGACGTCTGTGACTTACTTATCCCGCAGGTGCACCCTGCCCACAGCCTCGGCA
ACTCACAACGGACGGCTTGCCCCAGTGTGCGCCTGTTACTGTGATTGACTACTTTGAGGGCAGTGGTGCTGGCTATGGTGTCACGCTTGTAGCTGTCTGCTGCTTTCCATTAG
GTAAGCG
CTTTAGCTCTCCCTACCTCTTTTCTGACCTCCCCCTCTTCACAGACTGGCCCGGTTCATTCCTCCTCCCACCCCACCCCACTGCCCTCCACCACCCCACTACATGCTGGTTTTCAGGCTA
GCTGCTTTGCATAGCTTGAGCCAGACTCAGGGAGCTCCTGGGTAGCTAAGGAGCTCAACCTGTAGGTCCCCTTGAAGCTATGAGCAAGGGTTCTCCTTCCTTCTCACTAGTCTCTTCTGG
TCCCCTTCAGTGAGTCTGATTGTCGCTGGGGTGGTGGCTCATTTCCGGAACCGAGAACACAAGATGCTACTAAAGAAAGGCAAAGAGAGTCTGAAGAAACAACCAGCCAGTGATGGGGTA
CCAG
GTGAGAAGCCTGAGGACCAGGGGGAGGGGACGGGAGCCAGGGCCTAAGGAGAAGAAACGTGTTACAGAGTGGGATGGAAGCAGGACTCACTAGAACTGCATGCTCAGATCAGAGAA
TCACATGGGCTCAAAGGCCTCTGTTGAGTCACCCACCACGCAATTGTCCTTTCCTCAGCAATGGAGTGGCCGGGCCCCAAGGAGAAGAGCTATCCAGTCACTCTCCAGTTGCTTCCAGAC
AGAAGTCTGCAGGTCCTTGACAAACGGTTCACTGTGCTCCGGACCATCCAACTGCAGTCCCCACAGCAGGTTAACCTCATCCTGTCCAGCAACAGTGGACGTCGCACCCTGCTGCTCAAG
ATCCCCAAGGAGTATGACCTG
GTACAGCTCATCCTGCCTTCCTCTGTGGGCTGTTTTCATACGTATCCCCTCTCATAGGCGAGTCTTCTCAGCTACAGGATTTCCCTGTGAAGGCTGATG
CTGGAGAAGGAGGCTCCTTTCAGGAGCTTTTTGACTTACATGACCCCCTGAGGTCACTCTCTGACCACAGCCTGGACGGTCACAGAAAAGACTTCTCTTCTTCCCTCAGCCTGAAGTCCT
TTGAAGCAGGACTAGATCAGAAAAGCCAGCTCCAGACATGATGAGGCTTTTTCTCCAACCTAGGTGCTGATGTTTAACTCTGAAGAGGACCGGGGTGCCTTCGTGCGGCTGTTGCAAGAC
CTCTGTATCTGCTGCACTCCCGGCCTCCACATAGCTGAGGTGGATGAGAAGGAGCTATTGAGAAAGGCTGTGACCAAGCAGCAACGGGCAGGCATCTTGGAGATCTTCTTCAGACAGCTT
TTTGCTCAG
GTGCCATGAGTTATACCTGACACAGGGGATGGAGCCAACCTGAGCTGGTTGCGTTCTTGCAGGCGAATTAAGATTTCACACTGTAGGCTAGAGACAGACGGTCTTGATTGG
ATCCTTAGGCAATTCATGTATTTCTGTCTCTGTATTCCTCACCTCACCATCTAATGATCAGTGACTAAAGCTGTGTCCACTGCATTGCTGAGGGGGGCCGGTGAGGTCATCTGCATGACA
GATTCTCTAATGGTTCCTCTGGATCAGCAGCATCAGCTGACTTGGAAATGTATTAGAAACTCTGGTTCTCGGGATCCTCCCCAACAGATTGAATGAGGAGCTCTGGGGGTGGTACCCAGA
AAGCTTCATTTTAACAAATCTTTCTAAGTGATGCTGATATAGACTTGGGTTTGAAAATCACCATTTTATAGACAGTGTTTAGCCTAAGTACTTGGTACATTGTGGGCATAACAAATGGTC
GTGATGTGTTAGTTTTAAATTATCTGTACACTGATAGAAGGCAGATTCTAGGCACAAGGCAGTGCTATCTGAATTAGCATGACTATATCGGCTCAACTCTGTCATGCTATAGCCAATATC
CACTGCCTCTGAAACAGGCTCTTTCACTCTTCATCCCCCAACTTTGCTCCCTGCATCTTACTTTCCCCTTTTCTCCCCATCGATATTATCCTTGCTGGATGGAGCGTAGGTCCAGAGAGA
AGCAGAAAGCGTGTAAAGATTCAGTTGACCCCTCCTATAGGAGAGTAGAGACCCCAATCAGTATGTTTAGGAGTGGGTTTTCTGAACTGCACACAGGGGAGACATGAAGTCATTGAACTG
CTTCCATCCCAGATAGGTCAGCTAGGCCTGACCTCCTTCTCCTCTCCCTCTCTGTGACTGCCCAAGTGCAGGTGCTGGACATCAACCAGGCTGATGCAGGGACTCTGCCCCTGGACTCAT
CCCAGCAAGTGCGTGAGGCTCTGACCTGTGAGCTGAGCAGAGCTGAGTTTGCTGACTCCCTAGGCCTCAAGCCCCAGGACATGTTTGTGGAGTCCATGTTTTCTCTGGCTGACAAGGATG
GCAATGGCTACATATCCTTCCGGGAGTTCCTGGACATCCTGGTGGTCTTCATGAAAG
GTGGTGGTGGGGTCTAGCAGAGCATCTGAGGAATCAGGAGTTTGTTAGCAAGGAGGTGACCTA
TATCCTCTTTCTCCCTCAGGCTCCTCAGAGGATAAGTCCCGCCTGATGTTTACCATGTATGACCTGGATGGGAATGGCTTCCTCTCCAAGGACGAGTTCTTCACCATGATGCGGTATAGG
CTGGGCCTTCCCAATCCTGGAGTACCTTAATATTTTAAACACAAAAGCAATTCAGCTCAGAGGAGGGCAGGCAAGGTGGCTTAGTGGCCATCCATGTTGCATGGTACACACTATATCCTG
AATTCTCAAGACATCCAACTTCAAAGCATCACCCTGGCATGTCTCTGCCTTTAACAGGGTCCATAGCCACACTATCAAAAACTACCAGAAACATTGCTCTAGTTTTTTTTTTTTTTTATT
ATATGTAAGTACACTGTAGCTGTCCTCAGACACTCCAGAAGAGAGAGTCAGATCTCCTTACGGATAGTTATGAGCCACCATGTGGTTGCTGGGATTTGAACTCCTGACCTTCGGAAGAGC
AGTCGGGTGCTCTTACCCACTGAGCCATCTCACCAGCCCTGCTCTAGTTTTAAAATGTCAATAAAGACCAGCCGTGAGCACCCTCAACCCCTCATTCCCAAGGACAGAGCCTCAAGCACC
ACGAGTCATATCTCTTCACTTCTTCTTAAATCTGCTTTTTTTTTCCATGCTGAGATTAACATGTAGGCATTTGGTACATAGTCCACTGAAAAAGACAAAGAGAAATATATGAGGGTAAGT
AGACAGAGCACAGCAGCCCAGGTTCTTTTAAGGTGTCTTTGTGCCTTCCCATTGCTCTTAACATTCCCTTTCCCTGACAATCCCAACCCAGACCACTTCCTCCTCTTCCTCTTCTTTAGA
GGCTTTGCTTACTTATTTTTATTTTATGTGCATTTGTGTTTTGCCTGCATATATGTCTGTGTGAGGGTGTCAGATCCCCTGGAACTGGAATTACAAACAGTTGTGAGCTGCCATGTGAGT
GCTGAAAACTAAACCTGGGTCTTCTGGAAGAACAACTAGTGCTCTTAACTTCTGAGCAACTTTCCAGTCCCTAACATAAACTTCTTTTATTATTATTATTATATATTTTCTTTTTTTTTT
TTTTTTTTTGGTTTTTCGGGACAGGTTTCTCTGTATAGCCCTGGCTGTCCTGGAACTCACTTTGTACACCAGGCTAGCCTCGAACTCAGAAATCTGCCTGCCTCTGCCTCCCGAGTGCTG
AGATTAAAGACGTGTGCCACCATGCCCTGCTCTAGATATTTTCTTCATTTACATTTCAAATGCTATCCCCAAAGCCCCCTATATCCTCCCCCTGCCCTGCTCCCCAACCCACCCACTCCT
GCTTCCTGGCCCTGTCATCCCCCTGTACTGGGGTATATGATCTTCACAAAACCAAGGGCCTCTTCTCCTATTGATGGTCAACTAGGCCATCCTCTGCTACATATGCAACTGGAGACACAT
AGACTTCTAAAACAGTAAAAAAGACTACAACCTGTATCTCATCAGCATGTCACTGCAGTCCTGTCCTGGCTACATTTAGGAAATCCAGGAGGAGGAAGGACATCCCATTAGCAATGACAG
CTCACATTTTTTTCAGTCTATAATAAAATGTGTGCTCCTTGCTTTGCCTCGATGGCATTATGCTTTTTAGCATGGTGATGGTGCTTGCTTGTCCTTGCACACACACACACTTGTTTTAGT
CCCCGTGCACAACATAAGAAGAGTGGCCGCTGCAGCTGTTACCGTGGGTGCCACTGGACTTGGGCCTGAGTGGGTCCTCCACATGCCCTAGGGTATTGGCGTAAGGAGGCAGCCACCATT
TTAGCCACTCCCCTCCCACACTCGAGTGTCCTCTTTGGATGGTGAAAGTGGAATCTCAAAGGTGAATGGGTGTGGGTGGGTTTCCACGAAAGAGAGTCAGCAGCAAGCCCATCACAAGAG
AGGGCCATGGTGTATCCTGGACCCCATCATCTGCAACAAGTGTTAGTCTCTCCCATTTGTTTCCATTTCTTGGGGTGAGGGGCAGAGGGGGAGACACACTGCTTTTCTAAGCTGTCAAGA
TCCTTTGAGCCTGCGGCCATCACCTTGCCAGCCTTAGTGTCAGCTTTAGGCTGGATGACATTGCCCAGTGCCATTTCCCAGGCTGATGGTGACCACATTAGGTAATCTGGAGACCTGCCC
AAGCCTGACCTTGCTGGATGATAGCTACCATTTCTCTACTCCTCTAGGTCCTTCATTGAGATCTCCAACAACTGCCTGTCCAAGGCCCAGCTGGCTGAGGTGGTTGAGTCTATGTTCCGG
GAGTCTGGGTTCCAGGACAAGGAGGAGCTGACCTGGGAGGACTTCCACTTCATGCTGCGGGACCACGACAGTGATCTCCGATTCACACAGCTCTGTGTCAAAGGTGGAGCTGGAG
GTGTG
TGAGTGTGGAAGGCGAGGAGTCTTCCAAACAGAGTTTCTGGTAGACATAACACAGGTCTTCTTTTCTTTCCTCCGAGTCCTTGGTGGTTAGTACGGAGCTCAGCATGGATATTCAGAGAC
GAGTTGTACATTGTGCAGAGTTGTTCACTAAGCCAAGGGGCGAGCCATGGGCATTGGCTTGGGATGTATCTGGCCAGAGCAGGGGACCTTGTTTTCTTTTTCTCAAAGGACATGTAGAGT
TCTATCTTCCACATGGTAGCCTCTTTCTTCTTTGTGCACACTGACTGCCCTGGGTATAGGGAGGGAAATGGGGCTGAAGCCCCAGATCTGGACATGACTAAGTCACTATCGAATGCTGAG
CAGCCTTTCCAGGGCCCAGGAGGAGAACAGTCTCCCATCCCACCCTCTGCCCTTCCAGAGAAGCCAAGCTGAGCTCTTGTTCTCTTTTCTAGGCACCAAGGACATCTTTAAACAAAGCAG
TGCCTGTCGAGTCTCGTTCATCAACCGGACTCCTGGGAACAG
GTGTGTGGGGAGGGAGGGACTGGGTGGTGTCCTTGGCGAGATCCATTGAGGGAAACAAAGGAAGGAAGCCGAGCTAGC
TGACAGCCACTGAGCAGATGTTTGCCTTGGCAAAGTCAGTGTCAGCCCAAACTCCAGCTGCTTCCTACATACAGCCTATGTTGTTCAAGAGCTTTGGCTCTCTCCCTACCACTCCAGGCC
TGAGGTGCCCTAAGCACAGACGAGCGGATTGTAGCTGCTCATCCTGCTCTGTCCTTTCCTGCTCTTGGGGACCTTTCCTGGACTCCACCCCAACTTTGATATGGCAGTTCTTCTCCTGGC
TGCAACAGTCAAAAGAAAACAGGTTCCAGGACTTTGATTTGTCTCATTTTTTCCAGTTCCTGCCCCCCAGAAACGGCACTCTCTGACATGGAAACCCCAGAACTGGGAAGTACTGGCCTA
AAGAAGAGGTTTGGCAAAAAGTGAGTATCTCCTAGATTCCTGAATTCTCAGGATTCTTGGAGAGAAGTCTCAGGATCTCTGAGCCCTGACGCACCACCTCAAGTCTCCTTCCATTCGACA
CACATGATGTGGCACTAGTAGCTCTCCAGGAAACACCTGGACTGCTGTAGCTCCTGGGTCAGGAAACCTGCTGCCTCTTCTCCTGTAGAGTGCACCATAAAGGAGGGGCCCATGGCCCTT
AGCAGGAGAGACAGGACACTCTGAGAAAGTGGCTCCTTGAGCTTCCCTCGGGGCTGCCCACTAGAGGGTTGGAGCTTCCAGTGACCCACTCTTCACTCAGTCTCTAAACGCATCTTCTGG
CAAGATGCCTAGCTCATAGCTCCACCACAATGGCCATAAGAGGCCTCCTGGCTTCTGCTGCCTTGGGTATAGAAGTAGGCCAGTGTAGAGATCAAAGGATGGGGGTGCATAACGAGAGAG
ACAGAGACAGAAAGAGAAGAGAAGGTAGACTAGGTGTGGAGAGAGAAAGGAAGATGTCTGGAACAAGTAAGAACTGGGGTAAGGGTCGTGGGAGCAGGGGCTGCTTGCCTGCTTTCCACT
GTTATGACACATCCTTCGTGGTTGCTAGATCTTCAATTATGGGTAGTACTGAGGGAAGGCCAAGGAACCCCTTTCTCACCAGTGCCCTGTCTATCACTACTGCAGGGTAATGGGGCCCTC
TCCCCGGCTGTACACGGAGGCACTGCAGGAGAAAAAACAGAGTGGCTTCCTGGCCCAGAAGTTCAAGCAGTACAAGCGATTTGTGGAAAACTACCGGCGCCACATTGTGTGTGTTACAAT
CTTCTCAGCCATCTGCATAGGCCTGTTTGCAGACCGTGCCTACT
GTAAGAATGCCAGGTTGTGGGCAGTGGGCAGGGAGCATGCCATATACCTTAGTGGGGAGTGCAGAGTCCTCTGGCT
GCCCTTCCTAGTCCTCGGGGTCTGGACAACAGAGCCTGGGCATTCTGACTCAGGCTCAGGGTCCTGTCCTGTATGCAATGCTGAGCTGCCTCAACTCTGGCTCCAGACTATGGCTTTGCC
TCACCACCCACGGACATCGAAGAAACCACCTATGTGGGCATCATCCTGTCCCGGGGCACGGCGGCCAGCATCTCCTTCATGTTCTCCTACATCCTGCTCACCATGTGCCGCAACCTCATC
ACCTTCTTGCGGGAGACCTTCCTCAACCGCTACATCCCCTTTGATGCCGCTGTGGACTTCCATCGCTGGATTGCTATGGCTGCAGTTGTCCTAGCTG
GTATGTGGCTTCTGGGTTGGGAG
CTGGGAGTGGTGTTGGTCCAGGTTAATCTCTCTGACATGCTGTCTCTTTCAGTTCTGCACAGTGCTGGACATGCAGTCAATGTGTACATTTTCTCAGTCAGTCCCCTCAGCCTGATGGCC
TGCGTCTTCCCTAACGTCTTTGTGAATGACGG
GTCAGTTCGGGGGGAAAGGCACCCCTAGGGTCCTCTGGGTGGGCCTAAGGTTGTAACAGGAAAGAAATAGATGGCCACAGAAGCACAG
CTTCTCGGTGCGCGAAGGCACAACGCTCTGTCCCTCATGGAGAAGCTGTAAGGTGACTGATTCTCCTTCAGAGAGGAAGCTGTCCGTGACTAGCCAGGAAAGGACCTTAGAAGCTGACAT
CAGAGCCTCTTTCCCATTTGACACGTAGTTTATCATCTCAGTTCAATATTCACTCATTGGACAAATATTTATACACTGTGTACTGGGTGTTTTAGTCGAGTGCCAGAGAGCAGGGCTATT
TCAGATAGCTGCAGTTGTTTAACACTTTTGCTATTCGCTTGATGTTGCTGTGCACAGAACGCAAGTCCCTGCCCTCACTAATCTCACAAAGTAATGGACCAAGTACTCCAGGAGGCAAGC
AAAGGCCAACTGCAGGAGGAGAATGTAAGATAGAATAGCAGGAAAAGCTGCCGTGGAGGAGGGGTAGAACCATGGCAGTGGACACCAGGTTCCTTGGAAGGATCCTGTAGCCATCTTAGT
GGAGTCCTAGTGACCTCAGGAGTTGGGGGCAGCATCAGGCTGTGGTGTGGGAGTGACAAACCCCCTTCTGCCTCAGCCAGAGGCTCACTGTGCCTTTACCCTCCAGGTCCAAGTTTCCCC
CAAAGTACTACTGGTGGTTCTTTGAGACAGTTCCAG
GTAGGAAATGGAGCTGGGTGTGGGATCTGAGCACCACAACTGTGTATCTCCTCCTTTTTCTCCTCTTCCTGAATACACATTGGT
CTATAATAGCCATAGGTCCCTTTTTGACAGGCCAATCCTGGCTCCAGCCCCTACAGATATGCTATTCTGAAGCCCTGAAGGAATAGAGTGGGAGTCAGGGAGAACGGTGTGGGGAGATGT
CCTGTTGTATTTTCAGCTCAGAATGAGGTTCCGAGGATAGCCACTGACCCTGTCCTCCCACCCCCAGGTATGACAGGAGTCCTCCTGCTCCTGGTCCTGGCCATCATGTACGTCTTCGCC
TCCCACCACTTCCGCCGCCACAGTTTCCGGGGCTTCTGGCTGACCCACCACCTCTATGTTGTGCTTTATGTCCTG
GTGAGTACCTTCCCTGGGCAGGGGTGAACCTTGGAGGGGACACTT
GGGATGGAAAGGGACGGTAAAGTAGGGAAGATGGAGAATTAGGCCCAAGGACTGTGATCCTGACCTCTGTGACCCCACCCTGGTGTCTATGCTGGCCCATCAGAGCCAGTACCTGGGTAG
GCCATGCCTGACAAGTAGGAGCCACTCGGGACCTGGTGGGCGGAGCTATACTAACTGGCCCTGTTTCCAGATCATCATCCATGGCAGCTATGCCCTCATCCAATTACCCAGCTTCCACAT
CTACTTCCTGGTCCCAGCAATTATCTATGGAGGGGACAAGCTAGTGAGCCTGAGCCGGAAGAAGGTGGAGATCAGTGTGGTGAAGGCGGAGCTGCTGCCTTCAG
GTACAAGCCTGATCTG
ACCCAGGCATGAGAACTAGTCTGCTCTTCAGACACGGTGGGGGCTTGTCGGGTTCCCTAACCCCTACCTCTGACAGTCTCTGGAGAAACCCTGAACCCTGAAGATTTTCGGATAGAAATG
ACACCCCAAGAAGGGCTATGCTTAGTTCCTGGTGAGACTTGCAGACAAGTTGGCTAGGGAGGTAACTGAGCAGTTAAGAGCACCTTCTGCTCTTTCAAAGGACCTGAGGTCAGTTCCCAG
CAGCCATGTCAGGGAATACAGTGCCCTCTTCTGTCCTTCATAGGCACCTGCACTCACGTGTGCATACACCTACACACACATAAACATATATATACGTAATTAAAAATAAAGAAAAAGACC
CGCAGACAGTGAGAAGCGATGCTGGGAAAGAACAGGTGAAAACAACGTTCCAGGGGGAGGAAGGTGCCAGAGTGCAGCCTGGTTCAGAGAGAGGCAAAGAGCCCTGACAGCCAGACTGCT
GTTGTGTCTGAATTGGCCGTGGTGGAGGATTATATCACTACAGAGGGTGGCGGGAGGTGATTTAGGTATCATAATGTCGTCCAACCTCTAGGCTTAACATTGCAGAGAACTCAATTTCAT
ATCCGTTTTGTTTCAGTTAAGATTACGTGAGGATAAAGGTAAAGTAGCCTTTCATACTCTCCAAAACTTGTTAACCTCCTTTTTCAACAAATATAAGTAAACCTGAGGCCCAGCTGTGGA
CAGGCAATAAACACTATCTAAGATTTGGTAAGATATGTTGCTTGTAAGGCATGCAGATATTTGTGAAAGTTCCCTTTGATAAGCATCTGGGTATCTGAAGAGTCCTTCACCCTGTAGGCT
GTTATTTCGGCAGTGCTATGAGCCTGCACCCTCTGCAGGATGAAGTGAGAGAGATTGGGGCAGAGGTGCCCAACTCTTATGTTACCCAGGAAACAGAGGAGCCTCAACTAACTACTCTCA
GCCCCAGCACACCACCTGTCTAGACCATGTGACTGTGTGAGACCCTCTGGGCCTCAGCAGTGTAGAAAGAACAGACACCTCTCAGTCTTCAAGGACAGTTTCTGATCTAGGCTCTTCTCC
CACTCTGACCTCTAGGGGTGACCTACTTGCAGTTCCAGAGACCCAAGACATTTGAGTACAAATCAGGGCAGTGGGTGCGAATCGCGTGCCTGGATCTGGGTACCAATGAGTATCACCCCT
TCACGCTGACCTCTGCACCCCATGAGGACACACTCAGCCTGCACATCAGGGCTGTGGGACCTTGGACTACTCGCCTCAGAGAGATCTACTCACCCCCAGTGGGTGGCACCTGTGCCAGAT
ACCCAAAG
GTACCCACCCTTGGGCTTGTCCCTGCTCCCTGTACCCTGACACTAGATGCCCCAGTATCTTTCATGACAAAGCTAGTCTGAAAGAGTACTGAGGTGCGCTGGCCCCTTCCTC
TGTTACTCTTACCCATTTCTGGCCTCAGAGTTGGGGCAGGGCCTGGCTCATCTCACTTCTCCTTCTCCTGTCAGCTGTACCTCGATGGACCATTTGGAGAGGGCCATCAGGAGTGGCATA
AGTTTGAGGTGTCAGTGCTGGTAGGAGGGGGCATTGGAGTCACCCCCTTTGCCTCCATCCTCAAAGACCTGGTCTTCAAATCGTCCATGGGCAGCCAGATGCTCTGTAAGAAG
GTGAGCA
TCCTTTCTCCACTCTGTGTGGGAAATGTGTGGCCCTATGGATCCCTGCCACCACAACAGTGCACTCCTAGCTTCCAACCGTAGGGTCTGTGGGGACCATGACACACAGCAGTTCAAATAT
CAAATCCTGCGTGTGGATAGGAGAGTGGCCATAAGAATCTCTGCTGGCCTTTGTCCATTGGAGGCTAAGACCAGAACATCGCTCTCAACAGGATATCTGAGGCTTCCTAGCATCTCAGAA
TGTAGGGAGGGGAGCCAGACCTGAGCTGGCCCTGAGCCCCAGTGTGTGTCCTCAGATCTACTTCATCTGGGTGACAAGGACTCAGAGGCAGTTTGAGTGGCTGGCTGACATCATCCGGGA
GGTGGAGGAGAATGACTGCCAGGACCTGGTGTCTGTGCACATCTACATTACTCAGCTGGCTGAGAAGTTCGACCTCAGGACCACCATGCTG
GTAGGTCAGGGCAAGCCAGCCATGGCAGG
AGAGCCACTCCCAGGGCCAAGGGTTGTTCCTAGCAATGGCCTTCCTCCCTGATTTTGTTCTCCAGTACATCTGTGAGAGGCACTTCCAGAAGGCGCTGAACAGGAGTTTGTTCACGGGCC
TGCGTTCCATCACCCACTTTGGTCGCCCTCCCTTTGAGCTCTTCTTCAACTCTCTACAGGAAGTCCATCCACAG
GTGGGTCCATCCCCTCCCACCCTAGGACCATATTTAATTGTCTTAT
TTTGGTCTGATTATTGTGGTCAGGCTGCAGAAGCTGATCTTCCTGGGTTGTGGGCATCTCAGTACACATATCTCTCCATGGACAGGTACGTAAGATTGGAGTCTTCAGCTGTGGTCCTCC
AGGGATGACCAAGAATGTGGAGAAGGCCTGCCAGCTCATCAACAGGCAGGACCGGGCCCACTTTGTGCATCATTATGAGAACTTCTGA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
TGCTTCCAACAAGTCCCAAGACTCTAGTTCTCCTGGGCGCTCTGCTGACTGGACCCCTGGGCCCAGCAGGTGGCCAGGACGCACCCTCACTGCCCTGGGAAGTGCAGCGCTACGACGGCT
GGTTTAATAATCTGAAGTACCACCAGCGCGGTGCGGCTG
GCTCGCGGCTGCGTCGCCTAATACCGGCTAATTATGCTGACGGTGTTTATCAGGCTCTGGAAGAGCCGCTACTGCCCAACC
CTCGCCGGCTAAGCGATGCTGTCGCTAAGGGCAAAGCAGGGCTGCCCTCGGTCCACAACCGCACAGTGCTGGGGGTCTTCTTTG
GCTACCACGTGCTCTCAGACCTGGTGAGTGTGGAAA
CACCAGGCTGCCCCGCAGAGTTCCTCAACATTTACATCCCACGTGGGGACCCGGTGTTCGACCCTGACAAGCGCGGGAACGTGGTGCTGCCCTTTCAAAGGAGCCGCTGGGACCACAACA
CTGGACAGAGCCCCAGCAACCCCCGGGACCAG
AGCAACCAGGTGACCGGCTGGCTAGATGGCAGCGCCATCTATGGCTCCTCTCATTCCTGGAGTGACACTCTGAGGAGCTTCTCTGGAG
GACAGCTGGCTTCTGGGCCTGACCCTGCTTTCCCGCGGAACTCCCAGAGCTCTCTGCTCATGTGGATGGCGCCGGACCCCTCCACGGGGCGGGGCGGGCCACAAGGGGTGTATG
CCTTTG
GGGCCCAGCGCGGGAACAGGGAGCCCTTTCTGCAGGCTCTGGGCTTGTTGTGGTTTCGCTACCACAACCTGTGTGCCAGGAAGCTGGCGCAGGAGCACCCGCACTGGGGGGATGAGGAAC
TGTTCCAGCATGCTCGCAAGAGGGTCATTGCCACCTACCAG
AACATTGCTCTATACCAATGGCTGCCCAGCTTCCTGCAGAAAACTCCTCCAGAGTATTCAGGGTACCGCCCTTTCATGG
ACCCCAGCATCTCCCCGGAGTTCGTGGTGGCCTCTGAGCAGTTCCTCTCTACTATGGTGCCCCCTGGGGTCTACATGAG
AAACTCCAGCTGTCATTTCCGGAAATTCCCGAAGGAAGGTT
CAGACAGCTCTCCAGCTCTCAGAGTCTGCAACAGCTACTGGATTCGGGAG
AACCCCAATCTGAAGACTGCTCAAGATGTGGATCAGTTGCTGCTGGGAATGGCTTCCCAGATCTCAGAGC
TGGAGGACAGGATAGTGATTGAAGACCTGAGAG
ATTACTGGCCTGGCCCAGAGAGATTCTCTCGCACAGACTACGTGGCTAGCAGTATCCAGCGTGGCCGAGATATGGGGCTCCCCAGTT
ATAGCCAGGCTCTGCTGGCCTTGGGACTGGAGCCTCCTAAGAACTGGAGTGCTCTCAACCCCCAAGTAGAACCCCAG
GTGCTGGAGGCCACAGCTGCTCTGTACAACCAGGACCTGTCCC
AGTTGGAACTACTCCTGGGTGGACTCCTGGAGAGCCATGGGGACCCTGGACCTCTATTCAGCAACATCATTCTTGACCAGTTTGTGAGGCTCCGGGATGGTGATCGCTACTGGTTTGAGA
ACACTAGGAATGG
GCTGTTCTCCAAAGAGGAGATTGCAGAAATCAGAAACACCACCTTGCGGGATGTACTGGTAGCTGTCTCCAATGTGGACCCCAGTGCCTTGCAACCCAACGTTTTCT
TCTGGCAGGAAG
GTGCACCCTGCCCACAGCCTCGGCAACTCACAACGGACGGCTTGCCCCAGTGTGCGCCTGTTACTGTGATTGACTACTTTGAGGGCAGTGGTGCTGGCTATGGTGTCA
CGCTTGTAGCTGTCTGCTGCTTTCCATTAG
TGAGTCTGATTGTCGCTGGGGTGGTGGCTCATTTCCGGAACCGAGAACACAAGATGCTACTAAAGAAAGGCAAAGAGAGTCTGAAGAAAC
AACCAGCCAGTGATGGGGTACCAG
CAATGGAGTGGCCGGGCCCCAAGGAGAAGAGCTATCCAGTCACTCTCCAGTTGCTTCCAGACAGAAGTCTGCAGGTCCTTGACAAACGGTTCACTG
TGCTCCGGACCATCCAACTGCAGTCCCCACAGCAGGTTAACCTCATCCTGTCCAGCAACAGTGGACGTCGCACCCTGCTGCTCAAGATCCCCAAGGAGTATGACCTG
GTGCTGATGTTTA
ACTCTGAAGAGGACCGGGGTGCCTTCGTGCGGCTGTTGCAAGACCTCTGTATCTGCTGCACTCCCGGCCTCCACATAGCTGAGGTGGATGAGAAGGAGCTATTGAGAAAGGCTGTGACCA
AGCAGCAACGGGCAGGCATCTTGGAGATCTTCTTCAGACAGCTTTTTGCTCAG
GTGCTGGACATCAACCAGGCTGATGCAGGGACTCTGCCCCTGGACTCATCCCAGCAAGTGCGTGAGG
CTCTGACCTGTGAGCTGAGCAGAGCTGAGTTTGCTGACTCCCTAGGCCTCAAGCCCCAGGACATGTTTGTGGAGTCCATGTTTTCTCTGGCTGACAAGGATGGCAATGGCTACATATCCT
TCCGGGAGTTCCTGGACATCCTGGTGGTCTTCATGAAAG
GCTCCTCAGAGGATAAGTCCCGCCTGATGTTTACCATGTATGACCTGGATGGGAATGGCTTCCTCTCCAAGGACGAGTTCT
TCACCATGATGCG
GTCCTTCATTGAGATCTCCAACAACTGCCTGTCCAAGGCCCAGCTGGCTGAGGTGGTTGAGTCTATGTTCCGGGAGTCTGGGTTCCAGGACAAGGAGGAGCTGACCT
GGGAGGACTTCCACTTCATGCTGCGGGACCACGACAGTGATCTCCGATTCACACAGCTCTGTGTCAAAGGTGGAGCTGGAG
GCACCAAGGACATCTTTAAACAAAGCAGTGCCTGTCGAG
TCTCGTTCATCAACCGGACTCCTGGGAACAG
GGTAATGGGGCCCTCTCCCCGGCTGTACACGGAGGCACTGCAGGAGAAAAAACAGAGTGGCTTCCTGGCCCAGAAGTTCAAGCAGTACA
AGCGATTTGTGGAAAACTACCGGCGCCACATTGTGTGTGTTACAATCTTCTCAGCCATCTGCATAGGCCTGTTTGCAGACCGTGCCTACT
ACTATGGCTTTGCCTCACCACCCACGGACA
TCGAAGAAACCACCTATGTGGGCATCATCCTGTCCCGGGGCACGGCGGCCAGCATCTCCTTCATGTTCTCCTACATCCTGCTCACCATGTGCCGCAACCTCATCACCTTCTTGCGGGAGA
CCTTCCTCAACCGCTACATCCCCTTTGATGCCGCTGTGGACTTCCATCGCTGGATTGCTATGGCTGCAGTTGTCCTAGCTG
TTCTGCACAGTGCTGGACATGCAGTCAATGTGTACATTT
TCTCAGTCAGTCCCCTCAGCCTGATGGCCTGCGTCTTCCCTAACGTCTTTGTGAATGACGG
GTCCAAGTTTCCCCCAAAGTACTACTGGTGGTTCTTTGAGACAGTTCCAGGTATGACAG
GAGTCCTCCTGCTCCTGGTCCTGGCCATCATGTACGTCTTCGCCTCCCACCACTTCCGCCGCCACAGTTTCCGGGGCTTCTGGCTGACCCACCACCTCTATGTTGTGCTTTATGTCCTG
A
TCATCATCCATGGCAGCTATGCCCTCATCCAATTACCCAGCTTCCACATCTACTTCCTGGTCCCAGCAATTATCTATGGAGGGGACAAGCTAGTGAGCCTGAGCCGGAAGAAGGTGGAGA
TCAGTGTGGTGAAGGCGGAGCTGCTGCCTTCAG
GGGTGACCTACTTGCAGTTCCAGAGACCCAAGACATTTGAGTACAAATCAGGGCAGTGGGTGCGAATCGCGTGCCTGGATCTGGGTA
CCAATGAGTATCACCCCTTCACGCTGACCTCTGCACCCCATGAGGACACACTCAGCCTGCACATCAGGGCTGTGGGACCTTGGACTACTCGCCTCAGAGAGATCTACTCACCCCCAGTGG
GTGGCACCTGTGCCAGATACCCAAAG
CTGTACCTCGATGGACCATTTGGAGAGGGCCATCAGGAGTGGCATAAGTTTGAGGTGTCAGTGCTGGTAGGAGGGGGCATTGGAGTCACCCCCT
TTGCCTCCATCCTCAAAGACCTGGTCTTCAAATCGTCCATGGGCAGCCAGATGCTCTGTAAGAAG
ATCTACTTCATCTGGGTGACAAGGACTCAGAGGCAGTTTGAGTGGCTGGCTGACA
TCATCCGGGAGGTGGAGGAGAATGACTGCCAGGACCTGGTGTCTGTGCACATCTACATTACTCAGCTGGCTGAGAAGTTCGACCTCAGGACCACCATGCTG
TACATCTGTGAGAGGCACT
TCCAGAAGGCGCTGAACAGGAGTTTGTTCACGGGCCTGCGTTCCATCACCCACTTTGGTCGCCCTCCCTTTGAGCTCTTCTTCAACTCTCTACAGGAAGTCCATCCACAG
GTACGTAAGA
TTGGAGTCTTCAGCTGTGGTCCTCCAGGGATGACCAAGAATGTGGAGAAGGCCTGCCAGCTCATCAACAGGCAGGACCGGGCCCACTTTGTGCATCATTATGAGAACTTCTGA

Retrieve as FASTA