Entry information : MmDuOx02-4 ( DUOX2 / / MmDuOx02-4)
Entry ID 3341
Creation 2010-04-28 (Christophe Dunand)
Last sequence changes 2016-02-16 (Christophe Dunand)
Sequence status complete
Reviewer Achraf Jemmat
Last annotation changes 2016-02-16 (Achraf Jemmat)
Peroxidase information: MmDuOx02-4 ( DUOX2 / / MmDuOx02-4)
Name (synonym) MmDuOx02-4 ( DUOX2 / / MmDuOx02-4)
Class Dual oxidase    [Orthogroup: DuOx001]
Taxonomy Eukaryota Metazoa Chordata Mammalia Muridae Mus
Organism Mus musculus (house mouse)    [TaxId: 10090 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value MmDuOx02-4
start..stop
S start..stop
RnoDuOx02-A 2934 0 1..1517 1..1517
HsDuOx02 2653 0 1..1517 1..1548
EcabDuOx02 2639 0 1..1517 1..1517
CfaDuOx02 2619 0 1..1517 1..1571
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '3341' 'join(578..647,931..1020,1203..1367,1874..2061,2224..2425,2582..2748,3389..3449,3562..3658,4064..4154,4650..4752,5276..5439,5976..6151,6366..6484,6794..6931,7189..7302,7477..7679,7962..8147,9050..9275,9358..9451,11546..11733,12191..12260,13404..13582,13765..13995,14071..14170,14845..14894,15166..15293,15529..15682,16794..17026,17213..17371,17674..17829,17924..18052,18184..18306)' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 578..647 68 N° 2 931..1020 88 N° 3 1203..1367 163 N° 4 1874..2061 186
N° 5 2224..2425 200 N° 6 2582..2748 165 N° 7 3389..3449 59 N° 8 3562..3658 95
N° 9 4064..4154 89 N° 10 4650..4752 101 N° 11 5276..5439 162 N° 12 5976..6151 174
N° 13 6366..6484 117 N° 14 6794..6931 136 N° 15 7189..7302 112 N° 16 7477..7679 201
N° 17 7962..8147 184 N° 18 9050..9275 224 N° 19 9358..9451 92 N° 20 11546..11733 186
N° 21 12191..12260 68 N° 22 13404..13582 177 N° 23 13765..13995 229 N° 24 14071..14170 98
N° 25 14845..14894 48 N° 26 15166..15293 126 N° 27 15529..15682 152 N° 28 16794..17026 231
N° 29 17213..17371 157 N° 30 17674..17829 154 N° 31 17924..18052 127 N° 32 18184..18306 121
join(578..647,931..1020,1203..1367,1874..2061,2224..2425,2582..2748,3389..3449,3 562..3658,4064..4154,4650..4752,5276..5439,5976..6151,6366..6484,6794..6931,7189 ..7302,7477..7679,7962..8147,9050..9275,9358..9451,11546..11733,12191..12260,134 04..13582,13765..13995,14071..14170,14845..14894,15166..15293,15529..15682,16794 ..17026,17213..17371,17674..17829,17924..18052,18184..18306)


exon

Literature and cross-references MmDuOx02-4 ( DUOX2 / / MmDuOx02-4)
Protein ref. UniProtKB:   A2AQ99
DNA ref. GenBank:   NC_000068.6 (122298742..122279246)
Cluster/Prediction ref. Genebank:   214593
Protein sequence: MmDuOx02-4 ( DUOX2 / / MmDuOx02-4)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   1517 (1492)
PWM (Da):   %s   171369.4 (169014.0) Transmb domain:   %s   o600-622i1010-1032o1047-1069i1116-1138o1153-1175i1188-1210o (o575-597i985-1007o1022-1044i1091-1113o1128-1150i1163-1185o)
PI (pH):   %s   7.73 (7.65) Peptide Signal:   %s   cut: 26 range:26-1517
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MLPTSPKTLVLLGALLTGPLGPAGGQDAPSLPWEVQRYDGWFNNLKYHQpan>RGAAGSRLRRLIPANYADGVYQALEEPLLPNPRRLSDAVAKGKAGLPSVHNRTVLGVFFGYHVLSDLVSVETPGCPAEFLNIYIPRGDPVFDPDKRGNVVL
PFQRSRWDHNTGQSPSNPRD
SNQVTGWLDGSAIYGSSHSWSDTLRSFSGGQLASGPDPAFPRNSQSSLLMWMAPDPSTGRGGPQGVYAFGAQRGNREPFLQALGLLWFRYHNLCARKLAQ
EHPHWGDEELFQHARKRVIATY
NIALYQWLPSFLQKTPPEYSGYRPFMDPSISPEFVVASEQFLSTMVPPGVYMRNSSCHFRKFPKEGSDSSPALRVCNSYWIRENPNLKTAQDVDQLLL
GMASQISELEDRIVIEDL
DYWPGPERFSRTDYVASSIQRGRDMGLPSYSQALLALGLEPPKNWSALNPQVEPQVLEATAALYNQDLSQLELLLGGLLESHGDPGPLFSNIILDQFVRLRD
GDRYWFENTR
GLFSKEEIAEIRNTTLRDVLVAVSNVDPSALQPNVFFWQGAPCPQPRQLTTDGLPQCAPVTVIDYFEGSGAGYGVTLVAVCCFPLVSLIVAGVVAHFRNREHKMLLKKGK
ESLKKQPASDGVP
AMEWPGPKEKSYPVTLQLLPDRSLQVLDKRFTVLRTIQLQSPQQVNLILSSNSGRRTLLLKIPKEYDLVLMFNSEEDRGAFVRLLQDLCICCTPGLHIAEVDEKELL
RKAVTKQQRAGILEIFFRQLFA
QVLDINQADAGTLPLDSSQQVREALTCELSRAEFADSLGLKPQDMFVESMFSLADKDGNGYISFREFLDILVVFMGSSEDKSRLMFTMYDLDGNGFLS
KDEFFTMM
RSFIEISNNCLSKAQLAEVVESMFRESGFQDKEELTWEDFHFMLRDHDSDLRFTQLCVKGGAGTKDIFKQSSACRVSFINRTPGNRVMGPSPRLYTEALQEKKQSGFLAQKF
KQYKRFVENYRRHIVCVTIFSAICIGLFADRA
YYGFASPPTDIEETTYVGIILSRGTAASISFMFSYILLTMCRNLITFLRETFLNRYIPFDAAVDFHRWIAMAAVVLAVLHSAGHAVNV
YIFSVSPLSLMACVFPNVFVND
GSKFPPKYYWWFFETVGMTGVLLLLVLAIMYVFASHHFRRHSFRGFWLTHHLYVVLYVIIIHGSYALIQLPSFHIYFLVPAIIYGGDKLVSLSRKKVE
ISVVKAELLPS
GVTYLQFQRPKTFEYKSGQWVRIACLDLGTNEYHPFTLTSAPHEDTLSLHIRAVGPWTTRLREIYSPPVGGTCARYPLYLDGPFGEGHQEWHKFEVSVLVGGGIGVTPF
ASILKDLVFKSSMGSQMLCKK
IYFIWVTRTQRQFEWLADIIREVEENDCQDLVSVHIYITQLAEKFDLRTTMLYICERHFQKALNRSLFTGLRSITHFGRPPFELFFNSLQEVHPQVRKI
GVFSCGPPGMTKNVEKACQLINRQDRAHFVHHYENF*

Retrieve as FASTA  
Remarks Complete sequence from genomic (chromo 2, 31 introns).
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCTTCCAACAAGTCCCAAGACTCTAGTTCTCCTGGGCGCTCTGCTGACTGGACCCCTGGGCCCAGCAGAGGTACGTAGTAGGGGCCTCTTGCTGAAGGGGCTAGCCAGTGTCCCACAG
GAGTGTGGAAAGAGAAGGGAAGGCAGATGGCGCCTGGAAGCTGGAAGGGTTGAGCCATCTCTTATTGAGCTATTGAGTTGGAAGACTTCTGACCGCCTCGGCTCAGCGGCCAAAGGATTG
GAGATGGCGTTCATTGGCAGGAGTTAGCCAGCTGAGGAAGGAGCCTAAAGCCCATCGGGTCATTCAGGGCATTCTGGAAGGTTAATGTTAAGTGGGAGGCTTTCTGCTCCCACAGGTGGC
CAGGACGCACCCTCACTGCCCTGGGAAGTGCAGCGCTACGACGGCTGGTTTAATAATCTGAAGTACCACCAGCGCGGTGCGGCTG
TGGTAAGCCCCGGGCTCTGTAGGGAGGGCGTGCAG
TTGGAGTGTGGGGAGAGGAGCCGGGGCGGCCTGCGGGATTGCATTCTCACCACATTCTTATCCATTCCTGGGGGTAGGAAATGGGAGGGGGCATGTGGGGCTTCTTGGGGACCCAGAGCG
TTGATCCAGATCAACTCTCTGCGCTCCAGGCTCGCGGCTGCGTCGCCTAATACCGGCTAATTATGCTGACGGTGTTTATCAGGCTCTGGAAGAGCCGCTACTGCCCAACCCTCGCCGGCT
AAGCGATGCTGTCGCTAAGGGCAAAGCAGGGCTGCCCTCGGTCCACAACCGCACAGTGCTGGGGGTCTTCTTTG
TGGTGAGTATAGGAACAGGAAACCAGGGTGGATCTGGAGCTTGCTG
AGTGTCCAGGGAAAGGACCACCGCAACTTGCAGAAAAAAGAACTTTCCAAGAAGTGCTTCTGTTATCATCTTTGAGGGTTTGGGTGTGATGCGGGGATCTTTCTCTTACTTTACTTCAAA
CCAACACGTACATTGGGTCCCCACTAGAACTCTCAGAAACTTATGCCCTGACCACTGCTGTCCTCAAGACTCTCATGACTGCCACACAGAGGTGAAGGGAGCTTGTTTTCAAATCGTATC
CGCCTCCCCCCCTCCCACCCCCGAAACGATATTGGCATGTTCACAAGGGGATGCTACGTTTTGTTTGCATCTTAGAAGATCAAGTCTGGCGGGGGGGGGGGGATAAGCTGAAGAGACAGC
GTTGAAGCTTGTGCTCTGGCCTCTGTGTCCTGATAATTCAGAGGCTCTCCTAAGGAGAGCGGGACCTGGTGATTTATCTACCACTTTGCTCTCTTAACCCAGGCTACCACGTGCTCTCAG
ACCTGGTGAGTGTGGAAACACCAGGCTGCCCCGCAGAGTTCCTCAACATTTACATCCCACGTGGGGACCCGGTGTTCGACCCTGACAAGCGCGGGAACGTGGTGCTGCCCTTTCAAAGGA
GCCGCTGGGACCACAACACTGGACAGAGCCCCAGCAACCCCCGGGACCAG
AGGTGAGGCCAGGCAGCCAAAGGCAGGAGGGCAGAGAGGCGGGAAGGGGGTCTGAGGCTGGATCCGCTGG
AGGCCTGGGATCTGGCTGGGAAGACCAGACACGGTGAGGCTCTCCCTACCAGCAAGGACCTGCCTGGTCCTCCTGACTCTGCTCATCCCTGCAGAGCAACCAGGTGACCGGCTGGCTAGA
TGGCAGCGCCATCTATGGCTCCTCTCATTCCTGGAGTGACACTCTGAGGAGCTTCTCTGGAGGACAGCTGGCTTCTGGGCCTGACCCTGCTTTCCCGCGGAACTCCCAGAGCTCTCTGCT
CATGTGGATGGCGCCGGACCCCTCCACGGGGCGGGGCGGGCCACAAGGGGTGTATG
TGGTGAGGCTGCGGGGGTCCTGGCAGGAGGTGTAGGGTTAGAGGCACCAGGCTGCTTGGAGGCC
CTGTACTGGAGGAGGGGGGGACTGCCTGTGGGTCTGGGCTCCCCTGACTCAAGCTGTTGTCCATCTCCTGTCCTGCTTCCCCACATGGATGCAGCCTTTGGGGCCCAGCGCGGGAACAGG
GAGCCCTTTCTGCAGGCTCTGGGCTTGTTGTGGTTTCGCTACCACAACCTGTGTGCCAGGAAGCTGGCGCAGGAGCACCCGCACTGGGGGGATGAGGAACTGTTCCAGCATGCTCGCAAG
AGGGTCATTGCCACCTACCAG
AGGTGAGTCGTGCCGCCTGCCTGGAGTCCACCTGTGTGTAGGGTCCAGAGAGACTCTGCACCCCCAACAAAGCTTCCGGGGTCCTGGACAATGCCTCTT
CCCTATATACTCTTTTCTAGGAGGGCCCTTCTTCAGGGAGAGGTAATGTGTAGGGAAAACTACTGTACATTTTAGAGAGGAAACCCTTTATTTATGCCAGATAATTTGGGATTTGATCAT
TATTTAACATATCTCTCTTCTCAATAAATTCTTTGAAGCATCTAGCACTTGATGCTGTTGGGTTTTTTATGCTCTTTCTCCCATGATGTGCCCAAGGTCCAGGGAGACTCGGCTCCCCCA
GGAAAGCTTTCCCAGTCCTGGATAGCGCCTCTCCCCTCCCTAGTTGTGGCCCTTATTCCCAGAAGGGCTAATGGGAATACACAAACACACACACACACACACACACACACACACACACAC
ACACACACACACACACACAGAGAGACCCTGTGAATGTTTACATGGGTGATTTTGCTTGTGATTATAATTTAATTGCCTGCCCCTCCTTCCCCTACAACTAGCCTTCCCTGGTCCTCATCT
CTGCACCCAAGTCTTCCTTCCCCAAAGGCTCAACCCAACCCCTCCTCCCCTTCTAATCCTCAGAACATTGCTCTATACCAATGGCTGCCCAGCTTCCTGCAGAAAACTCCTCCAGAGTAT
TCAG
AGGTAATGGGGAGGGGTTGTGGAAGGTGGGGAGACCTAAGTGGAAGATCCACAGACAAAAGAGAGTTAGATTGTTTGATGCGGGGGTGTGAGGTAAGGCACTGTCTTGAAACAGGG
TACCGCCCTTTCATGGACCCCAGCATCTCCCCGGAGTTCGTGGTGGCCTCTGAGCAGTTCCTCTCTACTATGGTGCCCCCTGGGGTCTACATGAG
AGGTAAGCGAAGACGAAGCCGGTTA
GAGCTGCATAGAATTGGAGGGGGGTGGGGTGAGTGGGCTTGGAGCTAGGGGCTTGCATCCCTCCAGTTTCATTCAGAAGAGAAGCAGAATTGTTGAGGGGATGCTGAAGCAGGATCCTGG
GGTGGGAAGTTTGGGATCCATAGGAAATCTTCCAGCTCTAATAGAAGAATTTTGGTTGCTGAGCGTGAGGTGTGTCTGGCAGCTGGGGCGCCTTCTGACCTCCCACCATAGAGATCACTC
TATCTGTAAATCCAAGCCAATGTTCAAGAAAACTCATAAGTCATCCTTCACTCTCGTCTAGCCCAGAGAAATGCTCACCTCACTCACCACCTTAGTTTCTGCCCCCCAATTCCCTCATTG
TATTTTTGTCACCCTATTTCAGAAACTCCAGCTGTCATTTCCGGAAATTCCCGAAGGAAGGTTCAGACAGCTCTCCAGCTCTCAGAGTCTGCAACAGCTACTGGATTCGGGAGAGGTGAG
TGTGAGATTGGGGTCAGGGAGTGGGGGTGGAGTCAAATTCACGTAGCACAAATGCTTGGAAAAGTAGAAGTTTGGCTCTTGAGCAGCAGTGTAGCGAGTAAAAGGCAGAGAAAAGTATCT
GGGGAGAATTGTTCGAGAGCTGGAAATGCCTGGGCATGGGCGATGTGCATGCTGTCCTAGCTCTTCAGAAGCTCAGGCAGGACTACTGCTGTATACCAGGTGCTCACACAATCTACCAAG
ATAGAGAGGGAAGGAAAGGAAGTGGAAGAAGGCTAGGGAGAAGGAGGGAGAAGGGAGGGGCAGAGGAAGGGGAAGAGGAGAAGAGAGTGAGAGACAATGAGAGATCAAGCAAGAGAGTGA
GAGAGAGAGAGAGAGAGAGGAAGAGAGAGAGAATATGAATGGAGGTCTGAGAGTGAGTGGGGGGGCCTGTTTTCCCAAGGGGCTACTGGTGGTCCAGGGTCAAAATCTTGAGTGTAGTGT
TTACTTCCAGAACCCCAATCTGAAGACTGCTCAAGATGTGGATCAGTTGCTGCTGGGAATGGCTTCCCAGATCTCAGAGCTGGAGGACAGGATAGTGATTGAAGACCTGAGAGAGGTGAG
CTCAGAGGGTGGGGTGTGAGAGCCTGGAAGTCTGAGGCGCCTTCTGAGTTAGTCAACAAGCAGCCCTGGGGTACCCATGATACAGACACACAGAGCTACACCCTCAGACTTCAAATAAAT
GTCAGAAGAAGTTGAAATTTGAGCTCCCTGTCTCAATTATAGTATTGTGTTTGTGTTACTTTTATTTTGACTACGCCCGAATATGGTGGACTTGAGGGTGGTAGAGCAACCGAACACCCT
CTCTGGTGTTTACTTAGGTCTTAAGGGTTCTCATCCAGTCCTGTTTTAAAGTTGGGACATGAAGAGAAGGGAATTTCTGTCTGGTATCAAGAATTATCTTGAGTCTTAGAAAAAGTGCCC
CTGACCACACATGGTGACATGGGCTCCACAGCCAACAAGGTCGCTGATGTCATGTAGAGGAGTGCTGGGGACTGGGGCTGAGGAGAGCTCTAGACAGCTGGGTGTAAGGCTGCAATGCAC
GGGGATTGGAACTGACTCAAAGTCTGGTCTCTTTTCAGATTACTGGCCTGGCCCAGAGAGATTCTCTCGCACAGACTACGTGGCTAGCAGTATCCAGCGTGGCCGAGATATGGGGCTCCC
CAGTTATAGCCAGGCTCTGCTGGCCTTGGGACTGGAGCCTCCTAAGAACTGGAGTGCTCTCAACCCCCAAGTAGAACCCCAG
AGGTTAATATAAATAATAATAATAACAATGCAATAACA
ATGACAGTTGGAGGGGGGGAGATGGTTCAGTTGGTAACATGCTTGTCACACAAGCATGGAGACCTGAATTTGAATATTCAGCACCTACCTAAAACTCTGAGATGTTTTGGGGATCCCAGT
TCTAGGGAGGCAGAGATAGGACTCCTGACCCTTACTAGACAGCCAATCCAACCCAGCCAAAACAGTGATACCCAGGTTTAGGGAGAGACTCTGTTTCAAAGAATATGGTAGAGAGTGATT
AAGGAAGACTCCCTGTGTCAACCAGTGGCATCTATATGCACATATTCATGTACAGAAACATGTACTCCCACACACATATACACACAAGATAATGGCAGCCAAAACTTTGGATGGCAGTGC
TGTTCCAAATATCTCTACACAGATGAACTCACTCGATCTGCATAAGAATTTATGTGGTAGTTACTGTCAGTGTTGCAGATTTACTGAGGACATTGTGGGTGAGGTCACGTGTACTTTCGT
TACCTTCCATGATGCTCCAGGTGCTGGAGGCCACAGCTGCTCTGTACAACCAGGACCTGTCCCAGTTGGAACTACTCCTGGGTGGACTCCTGGAGAGCCATGGGGACCCTGGACCTCTAT
TCAGCAACATCATTCTTGACCAGTTTGTGAGGCTCCGGGATGGTGATCGCTACTGGTTTGAGAACACTAGGAATGG
GGGTAAGGCCTGGCTGGGCCCTGCTTCTGACTTCACCTTAGCGT
GGGGCTCCAGACTCTCTGTCTGGCCTTAGACAGCCTCCACAGGTCTTGATGCCAGGGGCCTACCACACTCTCGTCCACCCCAGTCTCTTTCTTCACATGAATCTTTGGGCCTGAGGTCAC
TGGAGGACTGAAATTCCCTTCCTATCCCAGCAATGGCCTTCCCCCCATTTAGGCTGTTCTCCAAAGAGGAGATTGCAGAAATCAGAAACACCACCTTGCGGGATGTACTGGTAGCTGTCT
CCAATGTGGACCCCAGTGCCTTGCAACCCAACGTTTTCTTCTGGCAGGAAG
AGGTGAGTATCCAGGGAGAGCCGACAAGCAATCATATGGGGGCAGAGTCATCTGTGCTGTGCCATGGGC
TTTTTGTGTTGGGCTGCCTTCCATGCCGCTATGGTCTGGGCCTGCCCAAGAGCTATACCTAGCAATCAGGCAGGGTAGATGCTGAGAAATTAGATATGGAGGTCTTTTGAGAGGAACAAG
TTCCAGAGGGGTAGGTTTGAGAATGGGAGAAGATATTTGGTCCTTCCTAAGACAGCGGGAGAGGCTAACCGGAAGAGTAAGGATTCTCGGACTCAGACGTCTGTGACTTACTTATCCCGC
AGGTGCACCCTGCCCACAGCCTCGGCAACTCACAACGGACGGCTTGCCCCAGTGTGCGCCTGTTACTGTGATTGACTACTTTGAGGGCAGTGGTGCTGGCTATGGTGTCACGCTTGTAGC
TGTCTGCTGCTTTCCATTAG
AGGTAAGCGCTTTAGCTCTCCCTACCTCTTTTCTGACCTCCCCCTCTTCACAGACTGGCCCGGTTCATTCCTCCTCCCACCCCACCCCACTGCCCTCCAC
CACCCCACTACATGCTGGTTTTCAGGCTAGCTGCTTTGCATAGCTTGAGCCAGACTCAGGGAGCTCCTGGGTAGCTAAGGAGCTCAACCTGTAGGTCCCCTTGAAGCTATGAGCAAGGGT
TCTCCTTCCTTCTCACTAGTCTCTTCTGGTCCCCTTCAGTGAGTCTGATTGTCGCTGGGGTGGTGGCTCATTTCCGGAACCGAGAACACAAGATGCTACTAAAGAAAGGCAAAGAGAGTC
TGAAGAAACAACCAGCCAGTGATGGGGTACCAG
AGGTGAGAAGCCTGAGGACCAGGGGGAGGGGACGGGAGCCAGGGCCTAAGGAGAAGAAACGTGTTACAGAGTGGGATGGAAGCAGGA
CTCACTAGAACTGCATGCTCAGATCAGAGAATCACATGGGCTCAAAGGCCTCTGTTGAGTCACCCACCACGCAATTGTCCTTTCCTCAGCAATGGAGTGGCCGGGCCCCAAGGAGAAGAG
CTATCCAGTCACTCTCCAGTTGCTTCCAGACAGAAGTCTGCAGGTCCTTGACAAACGGTTCACTGTGCTCCGGACCATCCAACTGCAGTCCCCACAGCAGGTTAACCTCATCCTGTCCAG
CAACAGTGGACGTCGCACCCTGCTGCTCAAGATCCCCAAGGAGTATGACCTG
TGGTACAGCTCATCCTGCCTTCCTCTGTGGGCTGTTTTCATACGTATCCCCTCTCATAGGCGAGTCTT
CTCAGCTACAGGATTTCCCTGTGAAGGCTGATGCTGGAGAAGGAGGCTCCTTTCAGGAGCTTTTTGACTTACATGACCCCCTGAGGTCACTCTCTGACCACAGCCTGGACGGTCACAGAA
AAGACTTCTCTTCTTCCCTCAGCCTGAAGTCCTTTGAAGCAGGACTAGATCAGAAAAGCCAGCTCCAGACATGATGAGGCTTTTTCTCCAACCTAGGTGCTGATGTTTAACTCTGAAGAG
GACCGGGGTGCCTTCGTGCGGCTGTTGCAAGACCTCTGTATCTGCTGCACTCCCGGCCTCCACATAGCTGAGGTGGATGAGAAGGAGCTATTGAGAAAGGCTGTGACCAAGCAGCAACGG
GCAGGCATCTTGGAGATCTTCTTCAGACAGCTTTTTGCTCAG
AGGTGCCATGAGTTATACCTGACACAGGGGATGGAGCCAACCTGAGCTGGTTGCGTTCTTGCAGGCGAATTAAGATTT
CACACTGTAGGCTAGAGACAGACGGTCTTGATTGGATCCTTAGGCAATTCATGTATTTCTGTCTCTGTATTCCTCACCTCACCATCTAATGATCAGTGACTAAAGCTGTGTCCACTGCAT
TGCTGAGGGGGGCCGGTGAGGTCATCTGCATGACAGATTCTCTAATGGTTCCTCTGGATCAGCAGCATCAGCTGACTTGGAAATGTATTAGAAACTCTGGTTCTCGGGATCCTCCCCAAC
AGATTGAATGAGGAGCTCTGGGGGTGGTACCCAGAAAGCTTCATTTTAACAAATCTTTCTAAGTGATGCTGATATAGACTTGGGTTTGAAAATCACCATTTTATAGACAGTGTTTAGCCT
AAGTACTTGGTACATTGTGGGCATAACAAATGGTCGTGATGTGTTAGTTTTAAATTATCTGTACACTGATAGAAGGCAGATTCTAGGCACAAGGCAGTGCTATCTGAATTAGCATGACTA
TATCGGCTCAACTCTGTCATGCTATAGCCAATATCCACTGCCTCTGAAACAGGCTCTTTCACTCTTCATCCCCCAACTTTGCTCCCTGCATCTTACTTTCCCCTTTTCTCCCCATCGATA
TTATCCTTGCTGGATGGAGCGTAGGTCCAGAGAGAAGCAGAAAGCGTGTAAAGATTCAGTTGACCCCTCCTATAGGAGAGTAGAGACCCCAATCAGTATGTTTAGGAGTGGGTTTTCTGA
ACTGCACACAGGGGAGACATGAAGTCATTGAACTGCTTCCATCCCAGATAGGTCAGCTAGGCCTGACCTCCTTCTCCTCTCCCTCTCTGTGACTGCCCAAGTGCAGGTGCTGGACATCAA
CCAGGCTGATGCAGGGACTCTGCCCCTGGACTCATCCCAGCAAGTGCGTGAGGCTCTGACCTGTGAGCTGAGCAGAGCTGAGTTTGCTGACTCCCTAGGCCTCAAGCCCCAGGACATGTT
TGTGGAGTCCATGTTTTCTCTGGCTGACAAGGATGGCAATGGCTACATATCCTTCCGGGAGTTCCTGGACATCCTGGTGGTCTTCATGAAAG
AGGTGGTGGTGGGGTCTAGCAGAGCATC
TGAGGAATCAGGAGTTTGTTAGCAAGGAGGTGACCTATATCCTCTTTCTCCCTCAGGCTCCTCAGAGGATAAGTCCCGCCTGATGTTTACCATGTATGACCTGGATGGGAATGGCTTCCT
CTCCAAGGACGAGTTCTTCACCATGATGCG
CGGTATAGGCTGGGCCTTCCCAATCCTGGAGTACCTTAATATTTTAAACACAAAAGCAATTCAGCTCAGAGGAGGGCAGGCAAGGTGGCT
TAGTGGCCATCCATGTTGCATGGTACACACTATATCCTGAATTCTCAAGACATCCAACTTCAAAGCATCACCCTGGCATGTCTCTGCCTTTAACAGGGTCCATAGCCACACTATCAAAAA
CTACCAGAAACATTGCTCTAGTTTTTTTTTTTTTTTATTATATGTAAGTACACTGTAGCTGTCCTCAGACACTCCAGAAGAGAGAGTCAGATCTCCTTACGGATAGTTATGAGCCACCAT
GTGGTTGCTGGGATTTGAACTCCTGACCTTCGGAAGAGCAGTCGGGTGCTCTTACCCACTGAGCCATCTCACCAGCCCTGCTCTAGTTTTAAAATGTCAATAAAGACCAGCCGTGAGCAC
CCTCAACCCCTCATTCCCAAGGACAGAGCCTCAAGCACCACGAGTCATATCTCTTCACTTCTTCTTAAATCTGCTTTTTTTTTCCATGCTGAGATTAACATGTAGGCATTTGGTACATAG
TCCACTGAAAAAGACAAAGAGAAATATATGAGGGTAAGTAGACAGAGCACAGCAGCCCAGGTTCTTTTAAGGTGTCTTTGTGCCTTCCCATTGCTCTTAACATTCCCTTTCCCTGACAAT
CCCAACCCAGACCACTTCCTCCTCTTCCTCTTCTTTAGAGGCTTTGCTTACTTATTTTTATTTTATGTGCATTTGTGTTTTGCCTGCATATATGTCTGTGTGAGGGTGTCAGATCCCCTG
GAACTGGAATTACAAACAGTTGTGAGCTGCCATGTGAGTGCTGAAAACTAAACCTGGGTCTTCTGGAAGAACAACTAGTGCTCTTAACTTCTGAGCAACTTTCCAGTCCCTAACATAAAC
TTCTTTTATTATTATTATTATATATTTTCTTTTTTTTTTTTTTTTTTTGGTTTTTCGGGACAGGTTTCTCTGTATAGCCCTGGCTGTCCTGGAACTCACTTTGTACACCAGGCTAGCCTC
GAACTCAGAAATCTGCCTGCCTCTGCCTCCCGAGTGCTGAGATTAAAGACGTGTGCCACCATGCCCTGCTCTAGATATTTTCTTCATTTACATTTCAAATGCTATCCCCAAAGCCCCCTA
TATCCTCCCCCTGCCCTGCTCCCCAACCCACCCACTCCTGCTTCCTGGCCCTGTCATCCCCCTGTACTGGGGTATATGATCTTCACAAAACCAAGGGCCTCTTCTCCTATTGATGGTCAA
CTAGGCCATCCTCTGCTACATATGCAACTGGAGACACATAGACTTCTAAAACAGTAAAAAAGACTACAACCTGTATCTCATCAGCATGTCACTGCAGTCCTGTCCTGGCTACATTTAGGA
AATCCAGGAGGAGGAAGGACATCCCATTAGCAATGACAGCTCACATTTTTTTCAGTCTATAATAAAATGTGTGCTCCTTGCTTTGCCTCGATGGCATTATGCTTTTTAGCATGGTGATGG
TGCTTGCTTGTCCTTGCACACACACACACTTGTTTTAGTCCCCGTGCACAACATAAGAAGAGTGGCCGCTGCAGCTGTTACCGTGGGTGCCACTGGACTTGGGCCTGAGTGGGTCCTCCA
CATGCCCTAGGGTATTGGCGTAAGGAGGCAGCCACCATTTTAGCCACTCCCCTCCCACACTCGAGTGTCCTCTTTGGATGGTGAAAGTGGAATCTCAAAGGTGAATGGGTGTGGGTGGGT
TTCCACGAAAGAGAGTCAGCAGCAAGCCCATCACAAGAGAGGGCCATGGTGTATCCTGGACCCCATCATCTGCAACAAGTGTTAGTCTCTCCCATTTGTTTCCATTTCTTGGGGTGAGGG
GCAGAGGGGGAGACACACTGCTTTTCTAAGCTGTCAAGATCCTTTGAGCCTGCGGCCATCACCTTGCCAGCCTTAGTGTCAGCTTTAGGCTGGATGACATTGCCCAGTGCCATTTCCCAG
GCTGATGGTGACCACATTAGGTAATCTGGAGACCTGCCCAAGCCTGACCTTGCTGGATGATAGCTACCATTTCTCTACTCCTCTAGGTCCTTCATTGAGATCTCCAACAACTGCCTGTCC
AAGGCCCAGCTGGCTGAGGTGGTTGAGTCTATGTTCCGGGAGTCTGGGTTCCAGGACAAGGAGGAGCTGACCTGGGAGGACTTCCACTTCATGCTGCGGGACCACGACAGTGATCTCCGA
TTCACACAGCTCTGTGTCAAAGGTGGAGCTGGAG
AGGTGTGTGAGTGTGGAAGGCGAGGAGTCTTCCAAACAGAGTTTCTGGTAGACATAACACAGGTCTTCTTTTCTTTCCTCCGAGTC
CTTGGTGGTTAGTACGGAGCTCAGCATGGATATTCAGAGACGAGTTGTACATTGTGCAGAGTTGTTCACTAAGCCAAGGGGCGAGCCATGGGCATTGGCTTGGGATGTATCTGGCCAGAG
CAGGGGACCTTGTTTTCTTTTTCTCAAAGGACATGTAGAGTTCTATCTTCCACATGGTAGCCTCTTTCTTCTTTGTGCACACTGACTGCCCTGGGTATAGGGAGGGAAATGGGGCTGAAG
CCCCAGATCTGGACATGACTAAGTCACTATCGAATGCTGAGCAGCCTTTCCAGGGCCCAGGAGGAGAACAGTCTCCCATCCCACCCTCTGCCCTTCCAGAGAAGCCAAGCTGAGCTCTTG
TTCTCTTTTCTAGGCACCAAGGACATCTTTAAACAAAGCAGTGCCTGTCGAGTCTCGTTCATCAACCGGACTCCTGGGAACAGAGGTGTGTGGGGAGGGAGGGACTGGGTGGTGTCCTTG
GCGAGATCCATTGAGGGAAACAAAGGAAGGAAGCCGAGCTAGCTGACAGCCACTGAGCAGATGTTTGCCTTGGCAAAGTCAGTGTCAGCCCAAACTCCAGCTGCTTCCTACATACAGCCT
ATGTTGTTCAAGAGCTTTGGCTCTCTCCCTACCACTCCAGGCCTGAGGTGCCCTAAGCACAGACGAGCGGATTGTAGCTGCTCATCCTGCTCTGTCCTTTCCTGCTCTTGGGGACCTTTC
CTGGACTCCACCCCAACTTTGATATGGCAGTTCTTCTCCTGGCTGCAACAGTCAAAAGAAAACAGGTTCCAGGACTTTGATTTGTCTCATTTTTTCCAGTTCCTGCCCCCCAGAAACGGC
ACTCTCTGACATGGAAACCCCAGAACTGGGAAGTACTGGCCTAAAGAAGAGGTTTGGCAAAAAGTGAGTATCTCCTAGATTCCTGAATTCTCAGGATTCTTGGAGAGAAGTCTCAGGATC
TCTGAGCCCTGACGCACCACCTCAAGTCTCCTTCCATTCGACACACATGATGTGGCACTAGTAGCTCTCCAGGAAACACCTGGACTGCTGTAGCTCCTGGGTCAGGAAACCTGCTGCCTC
TTCTCCTGTAGAGTGCACCATAAAGGAGGGGCCCATGGCCCTTAGCAGGAGAGACAGGACACTCTGAGAAAGTGGCTCCTTGAGCTTCCCTCGGGGCTGCCCACTAGAGGGTTGGAGCTT
CCAGTGACCCACTCTTCACTCAGTCTCTAAACGCATCTTCTGGCAAGATGCCTAGCTCATAGCTCCACCACAATGGCCATAAGAGGCCTCCTGGCTTCTGCTGCCTTGGGTATAGAAGTA
GGCCAGTGTAGAGATCAAAGGATGGGGGTGCATAACGAGAGAGACAGAGACAGAAAGAGAAGAGAAGGTAGACTAGGTGTGGAGAGAGAAAGGAAGATGTCTGGAACAAGTAAGAACTGG
GGTAAGGGTCGTGGGAGCAGGGGCTGCTTGCCTGCTTTCCACTGTTATGACACATCCTTCGTGGTTGCTAGATCTTCAATTATGGGTAGTACTGAGGGAAGGCCAAGGAACCCCTTTCTC
ACCAGTGCCCTGTCTATCACTACTGCAGGGTAATGGGGCCCTCTCCCCGGCTGTACACGGAGGCACTGCAGGAGAAAAAACAGAGTGGCTTCCTGGCCCAGAAGTTCAAGCAGTACAAGC
GATTTGTGGAAAACTACCGGCGCCACATTGTGTGTGTTACAATCTTCTCAGCCATCTGCATAGGCCTGTTTGCAGACCGTGCCTACT
CTGTAAGAATGCCAGGTTGTGGGCAGTGGGCAG
GGAGCATGCCATATACCTTAGTGGGGAGTGCAGAGTCCTCTGGCTGCCCTTCCTAGTCCTCGGGGTCTGGACAACAGAGCCTGGGCATTCTGACTCAGGCTCAGGGTCCTGTCCTGTATG
CAATGCTGAGCTGCCTCAACTCTGGCTCCAGACTATGGCTTTGCCTCACCACCCACGGACATCGAAGAAACCACCTATGTGGGCATCATCCTGTCCCGGGGCACGGCGGCCAGCATCTCC
TTCATGTTCTCCTACATCCTGCTCACCATGTGCCGCAACCTCATCACCTTCTTGCGGGAGACCTTCCTCAACCGCTACATCCCCTTTGATGCCGCTGTGGACTTCCATCGCTGGATTGCT
ATGGCTGCAGTTGTCCTAGCTG
TGGTATGTGGCTTCTGGGTTGGGAGCTGGGAGTGGTGTTGGTCCAGGTTAATCTCTCTGACATGCTGTCTCTTTCAGTTCTGCACAGTGCTGGACATG
CAGTCAATGTGTACATTTTCTCAGTCAGTCCCCTCAGCCTGATGGCCTGCGTCTTCCCTAACGTCTTTGTGAATGACGG
GGGTCAGTTCGGGGGGAAAGGCACCCCTAGGGTCCTCTGGG
TGGGCCTAAGGTTGTAACAGGAAAGAAATAGATGGCCACAGAAGCACAGCTTCTCGGTGCGCGAAGGCACAACGCTCTGTCCCTCATGGAGAAGCTGTAAGGTGACTGATTCTCCTTCAG
AGAGGAAGCTGTCCGTGACTAGCCAGGAAAGGACCTTAGAAGCTGACATCAGAGCCTCTTTCCCATTTGACACGTAGTTTATCATCTCAGTTCAATATTCACTCATTGGACAAATATTTA
TACACTGTGTACTGGGTGTTTTAGTCGAGTGCCAGAGAGCAGGGCTATTTCAGATAGCTGCAGTTGTTTAACACTTTTGCTATTCGCTTGATGTTGCTGTGCACAGAACGCAAGTCCCTG
CCCTCACTAATCTCACAAAGTAATGGACCAAGTACTCCAGGAGGCAAGCAAAGGCCAACTGCAGGAGGAGAATGTAAGATAGAATAGCAGGAAAAGCTGCCGTGGAGGAGGGGTAGAACC
ATGGCAGTGGACACCAGGTTCCTTGGAAGGATCCTGTAGCCATCTTAGTGGAGTCCTAGTGACCTCAGGAGTTGGGGGCAGCATCAGGCTGTGGTGTGGGAGTGACAAACCCCCTTCTGC
CTCAGCCAGAGGCTCACTGTGCCTTTACCCTCCAGGTCCAAGTTTCCCCCAAAGTACTACTGGTGGTTCTTTGAGACAGTTCCAGAGGTAGGAAATGGAGCTGGGTGTGGGATCTGAGCA
CCACAACTGTGTATCTCCTCCTTTTTCTCCTCTTCCTGAATACACATTGGTCTATAATAGCCATAGGTCCCTTTTTGACAGGCCAATCCTGGCTCCAGCCCCTACAGATATGCTATTCTG
AAGCCCTGAAGGAATAGAGTGGGAGTCAGGGAGAACGGTGTGGGGAGATGTCCTGTTGTATTTTCAGCTCAGAATGAGGTTCCGAGGATAGCCACTGACCCTGTCCTCCCACCCCCAGGT
ATGACAGGAGTCCTCCTGCTCCTGGTCCTGGCCATCATGTACGTCTTCGCCTCCCACCACTTCCGCCGCCACAGTTTCCGGGGCTTCTGGCTGACCCACCACCTCTATGTTGTGCTTTAT
GTCCTG
TGGTGAGTACCTTCCCTGGGCAGGGGTGAACCTTGGAGGGGACACTTGGGATGGAAAGGGACGGTAAAGTAGGGAAGATGGAGAATTAGGCCCAAGGACTGTGATCCTGACCTC
TGTGACCCCACCCTGGTGTCTATGCTGGCCCATCAGAGCCAGTACCTGGGTAGGCCATGCCTGACAAGTAGGAGCCACTCGGGACCTGGTGGGCGGAGCTATACTAACTGGCCCTGTTTC
CAGATCATCATCCATGGCAGCTATGCCCTCATCCAATTACCCAGCTTCCACATCTACTTCCTGGTCCCAGCAATTATCTATGGAGGGGACAAGCTAGTGAGCCTGAGCCGGAAGAAGGTG
GAGATCAGTGTGGTGAAGGCGGAGCTGCTGCCTTCAG
AGGTACAAGCCTGATCTGACCCAGGCATGAGAACTAGTCTGCTCTTCAGACACGGTGGGGGCTTGTCGGGTTCCCTAACCCCT
ACCTCTGACAGTCTCTGGAGAAACCCTGAACCCTGAAGATTTTCGGATAGAAATGACACCCCAAGAAGGGCTATGCTTAGTTCCTGGTGAGACTTGCAGACAAGTTGGCTAGGGAGGTAA
CTGAGCAGTTAAGAGCACCTTCTGCTCTTTCAAAGGACCTGAGGTCAGTTCCCAGCAGCCATGTCAGGGAATACAGTGCCCTCTTCTGTCCTTCATAGGCACCTGCACTCACGTGTGCAT
ACACCTACACACACATAAACATATATATACGTAATTAAAAATAAAGAAAAAGACCCGCAGACAGTGAGAAGCGATGCTGGGAAAGAACAGGTGAAAACAACGTTCCAGGGGGAGGAAGGT
GCCAGAGTGCAGCCTGGTTCAGAGAGAGGCAAAGAGCCCTGACAGCCAGACTGCTGTTGTGTCTGAATTGGCCGTGGTGGAGGATTATATCACTACAGAGGGTGGCGGGAGGTGATTTAG
GTATCATAATGTCGTCCAACCTCTAGGCTTAACATTGCAGAGAACTCAATTTCATATCCGTTTTGTTTCAGTTAAGATTACGTGAGGATAAAGGTAAAGTAGCCTTTCATACTCTCCAAA
ACTTGTTAACCTCCTTTTTCAACAAATATAAGTAAACCTGAGGCCCAGCTGTGGACAGGCAATAAACACTATCTAAGATTTGGTAAGATATGTTGCTTGTAAGGCATGCAGATATTTGTG
AAAGTTCCCTTTGATAAGCATCTGGGTATCTGAAGAGTCCTTCACCCTGTAGGCTGTTATTTCGGCAGTGCTATGAGCCTGCACCCTCTGCAGGATGAAGTGAGAGAGATTGGGGCAGAG
GTGCCCAACTCTTATGTTACCCAGGAAACAGAGGAGCCTCAACTAACTACTCTCAGCCCCAGCACACCACCTGTCTAGACCATGTGACTGTGTGAGACCCTCTGGGCCTCAGCAGTGTAG
AAAGAACAGACACCTCTCAGTCTTCAAGGACAGTTTCTGATCTAGGCTCTTCTCCCACTCTGACCTCTAGGGGTGACCTACTTGCAGTTCCAGAGACCCAAGACATTTGAGTACAAATCA
GGGCAGTGGGTGCGAATCGCGTGCCTGGATCTGGGTACCAATGAGTATCACCCCTTCACGCTGACCTCTGCACCCCATGAGGACACACTCAGCCTGCACATCAGGGCTGTGGGACCTTGG
ACTACTCGCCTCAGAGAGATCTACTCACCCCCAGTGGGTGGCACCTGTGCCAGATACCCAAAG
AGGTACCCACCCTTGGGCTTGTCCCTGCTCCCTGTACCCTGACACTAGATGCCCCAG
TATCTTTCATGACAAAGCTAGTCTGAAAGAGTACTGAGGTGCGCTGGCCCCTTCCTCTGTTACTCTTACCCATTTCTGGCCTCAGAGTTGGGGCAGGGCCTGGCTCATCTCACTTCTCCT
TCTCCTGTCAGCTGTACCTCGATGGACCATTTGGAGAGGGCCATCAGGAGTGGCATAAGTTTGAGGTGTCAGTGCTGGTAGGAGGGGGCATTGGAGTCACCCCCTTTGCCTCCATCCTCA
AAGACCTGGTCTTCAAATCGTCCATGGGCAGCCAGATGCTCTGTAAGAAG
AGGTGAGCATCCTTTCTCCACTCTGTGTGGGAAATGTGTGGCCCTATGGATCCCTGCCACCACAACAGTG
CACTCCTAGCTTCCAACCGTAGGGTCTGTGGGGACCATGACACACAGCAGTTCAAATATCAAATCCTGCGTGTGGATAGGAGAGTGGCCATAAGAATCTCTGCTGGCCTTTGTCCATTGG
AGGCTAAGACCAGAACATCGCTCTCAACAGGATATCTGAGGCTTCCTAGCATCTCAGAATGTAGGGAGGGGAGCCAGACCTGAGCTGGCCCTGAGCCCCAGTGTGTGTCCTCAGATCTAC
TTCATCTGGGTGACAAGGACTCAGAGGCAGTTTGAGTGGCTGGCTGACATCATCCGGGAGGTGGAGGAGAATGACTGCCAGGACCTGGTGTCTGTGCACATCTACATTACTCAGCTGGCT
GAGAAGTTCGACCTCAGGACCACCATGCTG
TGGTAGGTCAGGGCAAGCCAGCCATGGCAGGAGAGCCACTCCCAGGGCCAAGGGTTGTTCCTAGCAATGGCCTTCCTCCCTGATTTTGTT
CTCCAGTACATCTGTGAGAGGCACTTCCAGAAGGCGCTGAACAGGAGTTTGTTCACGGGCCTGCGTTCCATCACCCACTTTGGTCGCCCTCCCTTTGAGCTCTTCTTCAACTCTCTACAG
GAAGTCCATCCACAG
AGGTGGGTCCATCCCCTCCCACCCTAGGACCATATTTAATTGTCTTATTTTGGTCTGATTATTGTGGTCAGGCTGCAGAAGCTGATCTTCCTGGGTTGTGGGCAT
CTCAGTACACATATCTCTCCATGGACAGGTACGTAAGATTGGAGTCTTCAGCTGTGGTCCTCCAGGGATGACCAAGAATGTGGAGAAGGCCTGCCAGCTCATCAACAGGCAGGACCGGGC
CCACTTTGTGCATCATTATGAGAACTTCTGA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCTTCCAACAAGTCCCAAGACTCTAGTTCTCCTGGGCGCTCTGCTGACTGGACCCCTGGGCCCAGCAGGTGGCCAGGACGCACCCTCACTGCCCTGGGAAGTGCAGCGCTACGACGGC
TGGTTTAATAATCTGAAGTACCACCAGCGCGGTGCGGCTG
GCTCGCGGCTGCGTCGCCTAATACCGGCTAATTATGCTGACGGTGTTTATCAGGCTCTGGAAGAGCCGCTACTGCCCAAC
CCTCGCCGGCTAAGCGATGCTGTCGCTAAGGGCAAAGCAGGGCTGCCCTCGGTCCACAACCGCACAGTGCTGGGGGTCTTCTTTG
GCTACCACGTGCTCTCAGACCTGGTGAGTGTGGAA
ACACCAGGCTGCCCCGCAGAGTTCCTCAACATTTACATCCCACGTGGGGACCCGGTGTTCGACCCTGACAAGCGCGGGAACGTGGTGCTGCCCTTTCAAAGGAGCCGCTGGGACCACAAC
ACTGGACAGAGCCCCAGCAACCCCCGGGACCAG
AGCAACCAGGTGACCGGCTGGCTAGATGGCAGCGCCATCTATGGCTCCTCTCATTCCTGGAGTGACACTCTGAGGAGCTTCTCTGGA
GGACAGCTGGCTTCTGGGCCTGACCCTGCTTTCCCGCGGAACTCCCAGAGCTCTCTGCTCATGTGGATGGCGCCGGACCCCTCCACGGGGCGGGGCGGGCCACAAGGGGTGTATG
CCTTT
GGGGCCCAGCGCGGGAACAGGGAGCCCTTTCTGCAGGCTCTGGGCTTGTTGTGGTTTCGCTACCACAACCTGTGTGCCAGGAAGCTGGCGCAGGAGCACCCGCACTGGGGGGATGAGGAA
CTGTTCCAGCATGCTCGCAAGAGGGTCATTGCCACCTACCAG
AACATTGCTCTATACCAATGGCTGCCCAGCTTCCTGCAGAAAACTCCTCCAGAGTATTCAGGGTACCGCCCTTTCATG
GACCCCAGCATCTCCCCGGAGTTCGTGGTGGCCTCTGAGCAGTTCCTCTCTACTATGGTGCCCCCTGGGGTCTACATGAG
AAACTCCAGCTGTCATTTCCGGAAATTCCCGAAGGAAGGT
TCAGACAGCTCTCCAGCTCTCAGAGTCTGCAACAGCTACTGGATTCGGGAG
AACCCCAATCTGAAGACTGCTCAAGATGTGGATCAGTTGCTGCTGGGAATGGCTTCCCAGATCTCAGAG
CTGGAGGACAGGATAGTGATTGAAGACCTGAGAG
ATTACTGGCCTGGCCCAGAGAGATTCTCTCGCACAGACTACGTGGCTAGCAGTATCCAGCGTGGCCGAGATATGGGGCTCCCCAGT
TATAGCCAGGCTCTGCTGGCCTTGGGACTGGAGCCTCCTAAGAACTGGAGTGCTCTCAACCCCCAAGTAGAACCCCAG
GTGCTGGAGGCCACAGCTGCTCTGTACAACCAGGACCTGTCC
CAGTTGGAACTACTCCTGGGTGGACTCCTGGAGAGCCATGGGGACCCTGGACCTCTATTCAGCAACATCATTCTTGACCAGTTTGTGAGGCTCCGGGATGGTGATCGCTACTGGTTTGAG
AACACTAGGAATGG
GCTGTTCTCCAAAGAGGAGATTGCAGAAATCAGAAACACCACCTTGCGGGATGTACTGGTAGCTGTCTCCAATGTGGACCCCAGTGCCTTGCAACCCAACGTTTTC
TTCTGGCAGGAAG
GTGCACCCTGCCCACAGCCTCGGCAACTCACAACGGACGGCTTGCCCCAGTGTGCGCCTGTTACTGTGATTGACTACTTTGAGGGCAGTGGTGCTGGCTATGGTGTC
ACGCTTGTAGCTGTCTGCTGCTTTCCATTAG
TGAGTCTGATTGTCGCTGGGGTGGTGGCTCATTTCCGGAACCGAGAACACAAGATGCTACTAAAGAAAGGCAAAGAGAGTCTGAAGAAA
CAACCAGCCAGTGATGGGGTACCAG
CAATGGAGTGGCCGGGCCCCAAGGAGAAGAGCTATCCAGTCACTCTCCAGTTGCTTCCAGACAGAAGTCTGCAGGTCCTTGACAAACGGTTCACT
GTGCTCCGGACCATCCAACTGCAGTCCCCACAGCAGGTTAACCTCATCCTGTCCAGCAACAGTGGACGTCGCACCCTGCTGCTCAAGATCCCCAAGGAGTATGACCTG
GTGCTGATGTTT
AACTCTGAAGAGGACCGGGGTGCCTTCGTGCGGCTGTTGCAAGACCTCTGTATCTGCTGCACTCCCGGCCTCCACATAGCTGAGGTGGATGAGAAGGAGCTATTGAGAAAGGCTGTGACC
AAGCAGCAACGGGCAGGCATCTTGGAGATCTTCTTCAGACAGCTTTTTGCTCAG
GTGCTGGACATCAACCAGGCTGATGCAGGGACTCTGCCCCTGGACTCATCCCAGCAAGTGCGTGAG
GCTCTGACCTGTGAGCTGAGCAGAGCTGAGTTTGCTGACTCCCTAGGCCTCAAGCCCCAGGACATGTTTGTGGAGTCCATGTTTTCTCTGGCTGACAAGGATGGCAATGGCTACATATCC
TTCCGGGAGTTCCTGGACATCCTGGTGGTCTTCATGAAAG
GCTCCTCAGAGGATAAGTCCCGCCTGATGTTTACCATGTATGACCTGGATGGGAATGGCTTCCTCTCCAAGGACGAGTTC
TTCACCATGATGCG
GTCCTTCATTGAGATCTCCAACAACTGCCTGTCCAAGGCCCAGCTGGCTGAGGTGGTTGAGTCTATGTTCCGGGAGTCTGGGTTCCAGGACAAGGAGGAGCTGACC
TGGGAGGACTTCCACTTCATGCTGCGGGACCACGACAGTGATCTCCGATTCACACAGCTCTGTGTCAAAGGTGGAGCTGGAG
GCACCAAGGACATCTTTAAACAAAGCAGTGCCTGTCGA
GTCTCGTTCATCAACCGGACTCCTGGGAACAG
GGTAATGGGGCCCTCTCCCCGGCTGTACACGGAGGCACTGCAGGAGAAAAAACAGAGTGGCTTCCTGGCCCAGAAGTTCAAGCAGTAC
AAGCGATTTGTGGAAAACTACCGGCGCCACATTGTGTGTGTTACAATCTTCTCAGCCATCTGCATAGGCCTGTTTGCAGACCGTGCCTACT
ACTATGGCTTTGCCTCACCACCCACGGAC
ATCGAAGAAACCACCTATGTGGGCATCATCCTGTCCCGGGGCACGGCGGCCAGCATCTCCTTCATGTTCTCCTACATCCTGCTCACCATGTGCCGCAACCTCATCACCTTCTTGCGGGAG
ACCTTCCTCAACCGCTACATCCCCTTTGATGCCGCTGTGGACTTCCATCGCTGGATTGCTATGGCTGCAGTTGTCCTAGCTG
TTCTGCACAGTGCTGGACATGCAGTCAATGTGTACATT
TTCTCAGTCAGTCCCCTCAGCCTGATGGCCTGCGTCTTCCCTAACGTCTTTGTGAATGACGG
GTCCAAGTTTCCCCCAAAGTACTACTGGTGGTTCTTTGAGACAGTTCCAGGTATGACA
GGAGTCCTCCTGCTCCTGGTCCTGGCCATCATGTACGTCTTCGCCTCCCACCACTTCCGCCGCCACAGTTTCCGGGGCTTCTGGCTGACCCACCACCTCTATGTTGTGCTTTATGTCCTG
ATCATCATCCATGGCAGCTATGCCCTCATCCAATTACCCAGCTTCCACATCTACTTCCTGGTCCCAGCAATTATCTATGGAGGGGACAAGCTAGTGAGCCTGAGCCGGAAGAAGGTGGAG
ATCAGTGTGGTGAAGGCGGAGCTGCTGCCTTCAG
GGGTGACCTACTTGCAGTTCCAGAGACCCAAGACATTTGAGTACAAATCAGGGCAGTGGGTGCGAATCGCGTGCCTGGATCTGGGT
ACCAATGAGTATCACCCCTTCACGCTGACCTCTGCACCCCATGAGGACACACTCAGCCTGCACATCAGGGCTGTGGGACCTTGGACTACTCGCCTCAGAGAGATCTACTCACCCCCAGTG
GGTGGCACCTGTGCCAGATACCCAAAG
CTGTACCTCGATGGACCATTTGGAGAGGGCCATCAGGAGTGGCATAAGTTTGAGGTGTCAGTGCTGGTAGGAGGGGGCATTGGAGTCACCCCC
TTTGCCTCCATCCTCAAAGACCTGGTCTTCAAATCGTCCATGGGCAGCCAGATGCTCTGTAAGAAG
ATCTACTTCATCTGGGTGACAAGGACTCAGAGGCAGTTTGAGTGGCTGGCTGAC
ATCATCCGGGAGGTGGAGGAGAATGACTGCCAGGACCTGGTGTCTGTGCACATCTACATTACTCAGCTGGCTGAGAAGTTCGACCTCAGGACCACCATGCTG
TACATCTGTGAGAGGCAC
TTCCAGAAGGCGCTGAACAGGAGTTTGTTCACGGGCCTGCGTTCCATCACCCACTTTGGTCGCCCTCCCTTTGAGCTCTTCTTCAACTCTCTACAGGAAGTCCATCCACAG
GTACGTAAG
ATTGGAGTCTTCAGCTGTGGTCCTCCAGGGATGACCAAGAATGTGGAGAAGGCCTGCCAGCTCATCAACAGGCAGGACCGGGCCCACTTTGTGCATCATTATGAGAACTTCTGA

Retrieve as FASTA