Entry information : HsDuOx02 (DUOX2 / LNOX2 / THOX2)
Entry ID 3338
Creation 2006-07-26 (Christophe Dunand)
Last sequence changes 2010-10-18 (Myriam Duval (Scipio))
Sequence status complete
Reviewer Catherine Mathe
Last annotation changes 2015-12-10 (Catherine Mathe)
Peroxidase information: HsDuOx02 (DUOX2 / LNOX2 / THOX2)
Name (synonym) HsDuOx02 (DUOX2 / LNOX2 / THOX2)
Class Dual oxidase    [Orthogroup: DuOx001]
Taxonomy Eukaryota Metazoa Chordata Mammalia Hominidae Homo
Organism Homo sapiens (human)    [TaxId: 9606 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value HsDuOx02
start..stop
S start..stop
PtroDuOx02 3152 0 1..1548 1..1547
CfaDuOx02 2775 0 1..1548 1..1571
BtDuOx02 2728 0 1..1548 1..1545
SscDuOx02 2724 0 1..1548 1..1545
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '3338' 'complement(join(45386348..45386470,45386761..45386889,45387134..45387289,45387635..45387793,45388026..45388258,45389436..45389589,45389812..45389939,45390207..45390256,45391581..45391680,45391860..45392090,45392248..45392426,45392953..45393036,45393403..45393472,45393991..45394187,45396158..45396251,45396338..45396563,45397841..45398026,45398323..45398525,45398726..45398839,45399030..45399167,45399543..45399661,45400245..45400420,45400987..45401150,45401722..45401824,45402088..45402178,45402626..45402722,45402848..45402908,45403309..45403475,45403582..45403783,45403966..45404153,45404752..45404916,45405185..45405274,45405540..45405609))' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 45405540..45405609 68 N° 2 45405185..45405274 88 N° 3 45404752..45404916 163 N° 4 45403966..45404153 186
N° 5 45403582..45403783 200 N° 6 45403309..45403475 165 N° 7 45402848..45402908 59 N° 8 45402626..45402722 95
N° 9 45402088..45402178 89 N° 10 45401722..45401824 101 N° 11 45400987..45401150 162 N° 12 45400245..45400420 174
N° 13 45399543..45399661 117 N° 14 45399030..45399167 136 N° 15 45398726..45398839 112 N° 16 45398323..45398525 201
N° 17 45397841..45398026 184 N° 18 45396338..45396563 224 N° 19 45396158..45396251 92 N° 20 45393991..45394187 195
N° 21 45393403..45393472 68 N° 22 45392953..45393036 82 N° 23 45392248..45392426 177 N° 24 45391860..45392090 229
N° 25 45391581..45391680 98 N° 26 45390207..45390256 48 N° 27 45389812..45389939 126 N° 28 45389436..45389589 152
N° 29 45388026..45388258 231 N° 30 45387635..45387793 157 N° 31 45387134..45387289 154 N° 32 45386761..45386889 127
N° 33 45386348..45386470 121  
complement(join(45386348..45386470,45386761..45386889,45387134..45387289,4538763 5..45387793,45388026..45388258,45389436..45389589,45389812..45389939,45390207..4 5390256,45391581..45391680,45391860..45392090,45392248..45392426,45392953..45393 036,45393403..45393472,45393991..45394187,45396158..45396251,45396338..45396563, 45397841..45398026,45398323..45398525,45398726..45398839,45399030..45399167,4539 9543..45399661,45400245..45400420,45400987..45401150,45401722..45401824,45402088 ..45402178,45402626..45402722,45402848..45402908,45403309..45403475,45403582..45 403783,45403966..45404153,45404752..45404916,45405185..45405274,45405540..454056 09))


exon

Literature and cross-references HsDuOx02 (DUOX2 / LNOX2 / THOX2)
Literature De Deken X., Wang D., Many M.-C., Costagliola S., Libert F., Vassart G., Dumont J.E., Miot F.Cloning of two human thyroid cDNAs encoding new members of the NADPH oxidase family. J. Biol. Chem. 275:23227-23233(2000).
Protein ref. UniProtKB:   Q9NRD8
DNA ref. GenBank:   NC_000015.9 (45405609..45386348)
Cluster/Prediction ref. UniGene:   Hs.71377
Protein sequence: HsDuOx02 (DUOX2 / LNOX2 / THOX2)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   1548 (1523)
PWM (Da):   %s   175078.87 (172586.5) Transmb domain:   %s   o600-622i1041-1063o1078-1100i1147-1169o1184-1206i1219-1241o (o575-597i1016-1038o1053-1075i1122-1144o1159-1181i1194-1216o)
PI (pH):   %s   7.9 (7.83) Peptide Signal:   %s   cut: 26 range:26-1548
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MLRARPEALMLLGALLTGSLGPSGSQDALSLPWEVQRYDGWFNNLRHHEpan>RGAVGCRLQRRVPANYADGVYQALEEPQLPNPRRLSNAATRGIAGLPSLHNRTVLGVFFGYHVLSDVVSVETPGCPAEFLNIRIPPGDPVFDPDQRGDVVL
PFQRSRWDPETGRSPSNPRD
ANQVTGWLDGSAIYGSSHSWSDALRSFSGGQLASGPDPAFPRDSQNPLLMWAAPDPATGQNGPRGLYAFGAERGNREPFLQALGLLWFRYHNLWAQRLAR
QHPDWEDEELFQHARKRVIATY
NIAVYEWLPSFLQKTLPEYTGYRPFLDPSISPEFVVASEQFFSTMVPPGVYMRNASCHFRKVLNKGFQSSQALRVCNNYWIRENPNLNSTQEVNELLL
GMASQISELEDNIVVEDL
DYWPGPGKFSRTDYVASSIQRGRDMGLPSYSQALLAFGLDIPRNWSDLNPNVDPQVLEATAALYNQDLSQLELLLGGLLESHGDPGPLFSAIVLDQFVRLRD
GDRYWFENTR
GLFSKKEIEDIRNTTLRDVLVAVINIDPSALQPNVFVWHGAPCPQPKQLTTDGLPQCAPLTVLDFFEGSSPGFAITIIALCCLPLVSLLLSGVVAYFRGREHKKLQKKLK
ESVKKEAAKDGVP
AMEWPGPKERSSPIIIQLLSDRCLQVLNRHLTVLRVVQLQPLQQVNLILSNNRGCRTLLLKIPKEYDLVLLFSSEEERGAFVQQLWDFCVRWALGLHVAEMSEKELF
RKAVTKQQRERILEIFFRHLFA
QVLDINQADAGTLPLDSSQKVREALTCELSRAEFAESLGLKPQDMFVESMFSLADKDGNGYLSFREFLDILVVFMGSPEDKSRLMFTMYDLDENGFLS
KDEFFTMM
RSFIEISNNCLSKAQLAEVVESMFRESGFQDKEELTWEDFHFMLRDHDSELRFTQLCVKGGGGGGGIRDIFKQNISCRVSFITRTPGERSHPQGLGPPAPEAPELGGPGLKK
RFGK
KAAVPTPRLYTEALQEKMQRGFLAQKLQQYKRFVENYRRHIVCVAIFSAICVGVFADRAYYGFASPPSDIAQTTLVGIILSRGTAASVSFMFSYILLTMCRNLITFLRETFLNRYV
PFDAAVDFHRWIAMAAVVLA
ILHSAGHAVNVYIFSVSPLSLLACIFPNVFVNDGSKLPQKFYWWFFQTVGMTGVLLLLVLAIMYVFASHHFRRRSFRGFWLTHHLYILLYALIIHGSYAL
IQLPTFHIYFLVPAIIYGGDKLVSLSRKKVEISVVKAELLPS
GVTYLQFQRPQGFEYKSGQWVRIACLALGTTEYHPFTLTSAPHEDTLSLHIRAVGPWTTRLREIYSSPKGNGCAGYPL
YLDGPFGEGHQEWHKFEVSVLVGGGIGVTPFASILKDLVFKSSLGSQMLCKK
IYFIWVTRTQRQFEWLADIIQEVEENDHQDLVSVHIYVTQLAEKFDLRTTMLYICERHFQKVLNRSLF
TGLRSITHFGRPPFEPFFNSLQEVHPQ
VRKIGVFSCGPPGMTKNVEKACQLVNRQDRAHFMHHYENF*

Retrieve as FASTA  
Remarks Complete sequence from genomic (chromo 15, 32 introns), 5 cDNA and 60 ESTs.
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCTCCGTGCAAGACCAGAGGCACTGATGCTCCTGGGAGCTCTTCTGACTGGATCCCTGGGTCCATCGGGCAAGTATCAGGCTCCTCTAGCGGCGGGGTGTTCCCCAGGATCCCTGGGA
GGTGGGGCGGGGAGAGGTGCGGCAAGCGGCTCCCTGAGACTGGAAGGTCATTTCGCCGTGCAGCTCAGCGGGATGGGAAACTTCCCAGTGCGGCCCGACACTTGGGTCCGGTTAGGGGCG
CTCCGCGAGCTGGGGAAGGACTGGCCAAGGCCTTCGTTGCTCGGGAGGGGTAGCTGGGAGCGTAGTGCTGAGGAGGCCCTTCTCTGTGCCCACAGGCGCAGTCAGGACGCACTCTCACTG
CCCTGGGAAGTGCAGCGCTATGACGGCTGGTTTAACAACCTGAGGCACCACGAGCGTGGTGCTGTTG
GTGCGTTCTGGGGGCCCGGGCGTGCTGGGGCCGTGGCTCGCGAAGGGCCGGGG
CGCGAAAGGCCCTGAGCGGGGAATCTGCGGGGAACACGCGCCCAGCAGCTCCGCTGCCTACACAGCGCAATCTTATGCGCTCCCGGGGCCAAGAGACCCTTGAGGGAAGGTTCTGTCAGT
GAAGTGGGATGGGGGTTGAGGGAGGCTTAGGGAGAGGTTTGGGGGATCCTAGGGGATGGAGTGCTTAGACAGAGCCCCGCTCCCTGCCTCCGCAGGCGCTGCCGGTTGCAGCGCCGCGTA
CCAGCCAATTACGCCGACGGTGTGTATCAGGCTCTGGAGGAGCCGCAGCTGCCCAACCCGCGCCGGCTCAGCAACGCAGCCACGCGGGGCATAGCCGGCCTGCCGTCGCTCCACAACCGC
ACCGTACTGGGGGTCTTCTTTG
GTGAGGGCAAAGGGGGAGACCAGTGGGGTTGATCTGGCGCTCTGCTCAGCCTGGGGGAGGGGCCAGATCCCGTCTGCGAGTCCACAGGAGACCCATCC
GACTCCCAACCACTTCCTCTCTACGCAGCACTTCGAGACTGCCTTCATCTCGGAGAGATTTTGGGATGTTGATACAGAGATATTTGCTCTGTATCTAACCTTTCTCTTACGCCTTATTCC
AACCTAGGGGTGTCACTGGACCCCCATTATAGCTCTTGGGAACTGAGCTCCCCAGCCACCGCTCTCCTCACCGTGTGTTTGTAACCATTTTACCTCCCCCTAGCCCAGAGGGAGGAGGAC
TGACTTGGGGTACCCCTACATAAATTATATCATTTTGATTCTCACAACAGCTTTATGAATTGAGTAGGAAGGGAACTCACTATAGTCTTACTTTGCAGATTAGAAAATTGAGGCTACCTG
GGGCTAACGTGCAGAGCTGGTGGCGGAGCTGGCACTGGGACCTCTGTCTTTTGGCTCCTGAGGACACTTGGAGGCCGCCCTGGCCGTGGGGAGGGCGCAATACGGACGGTTTGTCACGTA
TTGGCGCCCCATGCCCGCAGGCGCTACCATGTTCTTTCCGACGTGGTGAGCGTGGAAACGCCCGGTTGCCCCGCCGAGTTCCTCAACATCCGCATCCCACCTGGAGACCCCGTGTTCGAC
CCCGACCAGCGCGGGGACGTGGTGCTGCCCTTCCAGAGGAGCCGCTGGGACCCCGAGACCGGACGGAGTCCCAGCAACCCCCGGGACCTG
GTGAGGCGGGGAAGGCGGCGGGAAGGGACC
GCACCCCAGCCAGGTGGGGCCTGGGCTTCGGGCCTGGCAGGGCCTGGAGGGGAGAGGCGCCCACTCCCCAGCCGCGGACACCCGCCGGGCCCCGGCCTTCCCTGGCCCGCCGCCGCCTCT
CGACCCGGGCTCACCCGCCGCGTGCCCCGCAGGCGCCAACCAGGTGACGGGCTGGCTGGACGGCAGCGCCATCTATGGCTCCTCGCACTCCTGGAGCGACGCGCTGCGGAGCTTCTCGGG
GGGACAGCTGGCGTCGGGGCCCGACCCCGCTTTCCCCCGAGACTCGCAGAACCCCCTGCTCATGTGGGCGGCGCCCGACCCCGCCACCGGGCAGAACGGGCCCCGGGGGCTGTACG
GTGA
GGCCACAGGGGCGGGACGGGGCCGGCTGGGGGTCTGCGAGTGTGGGCTCCCCCGATCACGCTACCGCTCGTCTCCTCCCCTGCGCCCCCACGTCGGATGCAGCCCCTTCGGGGCAGAGAG
AGGGAACCGGGAACCCTTCCTGCAGGCGCTGGGCCTGCTCTGGTTCCGCTACCACAACCTGTGGGCGCAGAGGCTGGCCCGCCAGCACCCAGACTGGGAGGACGAGGAGCTGTTCCAGCA
CGCACGCAAGAGGGTCATCGCCACCTACCAG
GTCAGCCGTCCGCGCCCCGCGACGTCCTCCCTTCCGCGTGCAAGCCCACGGGAGACTCCGCTGCCCCATGGAGCTCCCCATCTGTGGAC
AACCGCCACCCAGAAACCCCTCCCCAGACAGCCGAGGTCTAGGGAAGCCCCTGTAAATGATAGGGAGGCACGCGCTGTTTATAGGAGAAATCTGGCTGGTGATGATTATTTATCACCTCC
CCAACCCCCACTCCCTCAAATCCCCTGGTTCCTTGTGGGGACAGGCCTCACACTGCTCCTGTCTGAGTTGCTTCTCCCATGATTGACCCTTCCTGGTCTCATCTCCACACGGAAGCCGTC
CTTGGGCTCAGACCCTTCCAGGCCCCACACCATCCAACTTGTGCCTCCCCTCGCCCCTCTCTGCCCCTCAGAAAACATCGCTGTGTATGAGTGGCTGCCCAGCTTCCTGCAGAAAACACT
CCCGGAGTATACAG
GTGAGGGAGCGGGGAAGGAGGACACCTGTGCGGAGAATCCTGAGGGGAAGGAGACAGGTGCCTGTGATGGGAGGATGTGGAGGCAAGAAGCCTGTCTCCCCATCAT
CACCGTCTCCTTCCTGCAGGAGATACCGTCCTTTCCTAGACCCCAGCATCTCCCCGGAATTTGTGGTGGCCTCTGAGCAGTTCTTCTCTACCATGGTGCCCCCTGGTGTCTACATGAGGT
GAGGGAGGGGTTGGCAGAGAGGGGCACCACACTAAGAAAGGTGCAGAATGAGCTGCCTTGGGGGCTGGGGCCTTTCACACTCCTTCGCAGTTTCACTAGAGAAGGGGAAGCAAAAATTTG
GGGCCCTGAAACAGAACCCTGGGGTAAGATGTGTAGGCTTAGTAGGGAAATCTCTCCAGCTCTCCTAAGGGCTGAAATTTGGTGGCTGGGTGTAGGATTTGTCTAGCAGCTGGGTCATTC
CCTTCCCTCCTCTCCCCACCCTACCTGGACTAGGAGCGCACTCTATCTTCAGTAAACGCACATCACCAAATCTCTGCCGTGTTCAAGGAAGTTCCTGGGCCACTGCTCAATCCTAGTGAA
CCCCCACTGAGTCCCTCAGCCCACTCAACCCCATCTTTGATTCTTCTCCAAATTCCCTCACCACATCCTTTGTTCTCAATTTCAGAAAAATGCCAGCTGTCATTTCCGGAAGGTCCTGAA
CAAGGGTTTTCAAAGCTCCCAAGCTCTCAGGGTCTGCAACAACTACTGGATTCGGGAG
GTCAGACTGGGGTCAGGGTCAGGGGAAGATGGGTCAAGGTCAGTCTCTTCACACAGGCTGGG
AAAAGCAACAATCCCAGTTCTTGAGTGTTGCTGCCCCAGGTTCATGGGAGATGAAGGGTAGAGGAAATTATCCGGGGACAACAGTTCAAGAGAGTCTGGGACTGGCCAAGGGCCCTTTGT
CCTGGGGTACTAAAGTGGTCCAGGCTGAGAGAGACCGAGTTTTGGGTAGAGGCCTATCTTGAGTGCATTGTTTACTTCCAGAAAACCCCAATCTGAACAGTACCCAGGAGGTGAATGAGC
TGCTGCTGGGAATGGCCTCCCAGATTTCGGAGTTGGAGGACAACATAGTGGTTGAAGATCTGAGGG
GTGAGCTCAGAGCCAGAAGGGGTGGATGGTAAGGGACCAGGAAGCCTGAGGATC
CCTCTGGGTTCATCAATAGCAGACCTAGGGCGCTCACGATGCAGGAACACAGATACAAACACAGACTTCAAGGAGCCATAAGAAGAAAATGAACTTTGAACTTTTTTTTTTTTTTTGCTT
AATTTACACTTTTGTGTTGCTTTTATTTTAAATTAACTATGTCTGAGTATTGGGGAGGGGTGGTACCACAATCTCTTTGGTGCTTAATTAGGTCTCTAAAGGTTTTCATTCAGCCCTGTC
TTGAAACTAGGGAGGCATAGGACAGGGAATTTACTGCTTGGTGACTAGAACAGTCTTGAGTCTTAGAGGAAGGGTCTTACTGGAAAAACTGGCTCTGATCACCCATGGTGACATTGGCCT
TGCAGCAAGGCAAGGTCAGCATGGGCACAGATCTCATGTGGATCACTGGGGGTAGCCAGGAGGGAAGAACCGTAGTGCCAATAGTCAGATACAAGGCTGCAGGGCAAAAAAGGATGGAGA
GGACAAAGCCCATAGTCTAGTCTTCTCTCCTCTTCAGATATTACTGGCCTGGCCCTGGCAAATTCTCCCGTACAGACTATGTGGCCAGCAGCATCCAACGTGGCCGAGATATGGGGCTGC
CCAGCTATAGCCAGGCCCTGCTGGCCTTTGGGCTGGACATCCCAAGGAACTGGAGTGATCTCAACCCTAATGTGGACCCCCAG
GTCAGGAATAGTAATGATAATAATGGCAGCTAAAGCT
TTCCTATGGCAATACTGTTCCAAACCCTTTACATATGTTGACTCATTTAATCTACATAATAATATTGTAAGGTATGTATAATCATTTTCCCCATTTTACTGATGACATAGTTGGTAAATC
ATAGAGGAGGGACTTAAATTCAGCCATCTGATTCCAGAATATATTCCTAACCACTGCATTGTACCATTCCTGCAGGGTGGCTTCTGGGTTGGGTGCCATTGTCCTGTTGCTGCAGGGTCC
CACCCCAAGGGCTGTGTGCCCCTGAAGCTGCTAATCATTGAGGTCAGGCAGGCTGGTGATGGTCACAGGATATGGTCCTAGGGCACCTGACCGTGGTGTTACCGTGGGTAGGGACACACT
GATCCTTCCACCAGACTTGTCCTGCCTGAGGGGGCTTGCCTAAGAAGAGGAAATCAGGCCTGAGCAGCAGCCAGGCAGCGGCTGAGGTTCTGTGCCCGAGGGATGGGGCAACAGTGGCTG
CCCTCCGCAGCAAACATACGCTCACCCCTTATCTCCTGTGGTGCCCCAGGTGTGCTGGAGGCCACAGCTGCCCTGTACAACCAGGACCTATCCCAGCTAGAGCTGCTCCTTGGGGGGCTC
CTGGAGAGCCATGGGGACCCTGGACCCCTGTTCAGTGCCATTGTCCTCGACCAGTTTGTACGGCTGCGGGATGGTGACCGCTACTGGTTTGAGAACACCAGGAATGG
GTAAGGCTTGCCT
GGGCCCCCACCTCAGACTGCTCCTCAGCCTGAGCCCCAGACCCTCTGTCTAGCCTTAGACAGCCCCTATGAGCCCTTGATTCCCAGTCAGCCCACCACACCCTTCCCAACCCCTCTGGGT
CTCTCTTTTTTTCTTCTCTTCTCTTTTTCTTTTTCTTTTTCTTTTTTTTTTTTTTTTTTTTTTGAGGCAGAGTCTAGCTTTGTCACCCAGGCTGGAGTGCAGTGGCGTGATCTTGGCTCA
ATGCACCCTCTACCTCCCAGGTTCAAGTGATTCTCCTGCATCAGCCTCCCAAGTAGCTGGGATTACAGGCATGCACCAACATGCCCGGCTAATTTTTTTAAAAAATATTTTTAGTAGAGA
TGGGGTTTCACTATGCTGGTCAGGCTGGTCTCGAACTCCTGACCTCAAGTGATCCACTCGCCTTGGCCTCCCAAAGTGTTGGGATTACAGGCATGAGCCACTGCACCCAGCCCCTCTGGG
TCTCTTTTCTCACCTGGGTCCTTGGGCCTGGGGTTGCTGGAGGCCTGCATCCCCTTCCCATCCCAGTGACTTCTACTTCCTCCAACTTAGGCGCTGTTCTCCAAGAAGGAGATTGAAGAC
ATCCGAAATACCACCCTGCGGGACGTGCTGGTCGCTGTTATCAACATTGACCCCAGTGCCCTGCAGCCCAATGTCTTTGTCTGGCATAAAG
GTGAGTGCCGTGGGAGAACACAAGTGAGT
GACAGTGGCCAGAGAAGGATCAAGATTGAGGGTGCGGGGGAATCACTTGGTGCTGTCCAGGGAGCCAGGCACCTTCTGTGTTGGGCTAGGAGGCCTGCATTTGGCTGGCTCCCACAGCAG
GGACCTCAACTAGCACACAAGCTACACCCTACAGTCAAGAAGGGGTGGATGGGGTAGATGCCAAGAGACAGGAAATGAATGGGGACTTTTTGAGGGAGACAGTTTCAGGGAGGTGGGCCT
GGGGAAGACAGATGATACCTTGGTCCTTTATAGGATAGAGGGGAAAGAGGTCTGGCCACATAGCGGGATCCTCAGACTTTGAGGTCTTCCCTGCCCTCTCCCTCAGGTGTGCACCCTGCC
CTCAACCTAAGCAGCTCACAACTGACGGCCTGCCCCAGTGTGCACCCCTGACTGTGCTTGACTTCTTTGAAGGCAGCAGCCCTGGTTTTGCCATCACCATCATTGCTCTCTGCTGCCTTC
CCTTAG
GTGAGCTCTTAGGCAGCCTCTCTGCAGACTGGCCCTGCCCCTCATTTCCTGCTGGCCTGAGGGGCTGGCTATTTGGTACCGTTTGAGACCAGGCTCAAGGAACCTCTGGAAGGG
AGGGGCCATAGCCTAAGCCACAGTGGAGCTCTAGGTGAGGGGCTCCCTCCTCACTGTTCCTTCTGATCCACTTCAGTGTGAGTCTGCTTCTCTCTGGAGTGGTGGCCTATTTCCGGGGCC
GAGAACACAAGAAGCTACAAAAGAAACTCAAAGAGAGCGTGAAGAAGGAAGCAGCCAAAGATGGAGTGCCAG
GTGAGCAGGGGCTGGGCAGAGGAGGGAGGAGGGACGGAGGAGGGGAGA
GACAGGAGTCTGGGAGAAAGAACCAAGTTACAGAGTGAGAGGAAAGCCAAGGCACCTTTAGGGCGCCTGCTCAGACTCACAGAGGAATTGACCTGAAGGCGGGGACCTGGGGACATCTGC
TGAACTACCCGGCCCAATTATCCCTTCCCCAGCGCGATGGAGTGGCCAGGCCCCAAGGAGAGGAGCAGTCCCATCATCATCCAGCTGCTGTCAGACAGGTGTCTGCAGGTCCTGAACAGG
CATCTCACTGTGCTCCGTGTGGTCCAGCTGCAGCCTCTGCAGCAGGTCAACCTCATCCTGTCCAACAACCGAGGATGCCGCACCCTGCTGCTCAAGATCCCTAAGGAGTATGACCTG
GTA
TGGCTCGTCCTGCCTCCCCAGCCTGGGCTGCCCTCACACGACTCCATTATCACAAGCGAGGCCACCCTATCCTCAGCTACAGAGCTCAACTATGACAGCTGATGCTGGGGAGAGGGGCTC
CTTTCAGAGGCCCCCAGACACAACCTGACCCCCTTCGTCCACACACCTGGCCCCAGCCTGGATGGATGGGGAGGAGTTTTCTCTCCTCCCCTCAACCCAAGATCCATTGAGGGGAGGCTG
AAGCAGAAGGTCCAGCGAGCTCCCTGCGTCAGTGCCGCCTTCCTCCCACCCAGGTGTGCTGCTGTTTAGTTCTGAAGAGGAACGGGGCGCCTTTGTGCAGCAGCTATGGGACTTCTGCGT
GCGCTGGGCTCTGGGCCTCCATGTGGCTGAGATGAGCGAGAAGGAGCTATTTAGGAAGGCTGTGACAAAGCAGCAGCGGGAACGCATCCTGGAGATCTTCTTCAGACACCTTTTTGCTCA
G
GTGCCATGACCTGTGCCTTTTGGAGATGGGTCCAGCCCCAGAAATGGAGGAAACCTGGGCTGCATAGAACGCCCCTGTGGGTGAACTAAGCTTCCGCTCTATGGCCTGGAGAGAAATAG
CCTTGTTTGAATCCTGGCATTGCCACTTTACTTAGCTCTGTGACCTTAGGCAAGTCACATTATCACTGTTCTGTATCTCTGTTTCCTCATCTATAAAACAGTGATGAAAACTGTATCCAT
CCCATTGCATTGTTGTGAGGATTCGGTGAGATCGTCTACATGAGTGGTACACAGAGGTTGGCCTCTGGACCCGAGAGCATCAGCCTCACCTGGGAACATGTTAGAAATGCACCTACCCAG
TTAGACTGAACCAGGAACTCTGTGGGTGGGGCCTGGCAATCTGTGTTTTAACAAGCTCCCCAGATGATTCAGATACACTCTAATGTTTGAAAAACATTGTTTTATGTACAGTGCTTATTG
GCCCAAGTGCCAGGTATGTTGCAGGCATTTAACAAACGGTTGTGGCCAGGCGCAGTGGCTCATGCCTATAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACCTAAGGTCAGGAG
TTCGAGACTAGCATGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCTGGACGTGGTGGCTCACACCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATC
GCTTGAACCCGGGTGGCGGAGGTTGCAGTGAGCCAAGATCATGCCACTGCACTCCAGCCTGGGTGACAGAGCAAGACTCCATCTCAAAAAAACAAACAAAAAACAATAAATGGTTGTTAC
ATGCGACTTTTAAACTTTTTGTGCAATGGGCAAATCATAGGCACATGGCAGCCTTATCTGAATTGGCAAGAGAGCACAGCCCCAGCCCCTTCCTGCCTGTCTACCACCATGTCTCTACAT
CTTCTGTCCCCAGTATAGGCTCTCTCACTTTCCATTCCCCTTAACTTCGCCCTTCCCCTTCCCTACCCCAGCACCATGCCCACTGCATGAAGTTCCCGGTTCTTGGGCCCAGGGAGAAAT
GGGCAGGCTGCTAGAGATTTGATTCCCCCGTCTATAGGACAACAGAGGCCCCAGTCAGTATATCTAAGGATCAGGAGAACCATCAGAGTTTAGCCTTTCTGATTTGGACTTTGGGGAGAT
ATGAAGGGTCACTGAACTGCTTCCAGCATAGGCTTCACCTCCTTCTCTTTCCCTCCCTCTGCTGCTGCCCGAGTGCAGGTGTGCTGGACATCAACCAGGCCGACGCAGGGACCCTGCCCC
TGGACTCCTCCCAGAAGGTGCGGGAGGCCCTGACCTGCGAGCTGAGCAGGGCCGAGTTTGCCGAGTCCCTGGGCCTCAAGCCCCAGGACATGTTTGTGGAGTCCATGTTCTCTCTGGCTG
ACAAGGATGGCAATGGCTACCTGTCCTTCCGAGAGTTCCTGGACATCCTGGTGGTCTTCATGAAAG
GTAGGGGGCTGGGAGGTGGCAGGCTATCCAAGAATCCAGGGGTCTTTCAGCAAG
GAGATGACCTGCATTCCCTTTTTTCTTCCCAGGCGCTCCCCAGAGGATAAGTCCCGTCTAATGTTTACCATGTATGACCTGGATGAGAATGGCTTCCTCTCCAAGGACGAATTCTTCACC
ATGATGCG
GTATGGGGTGTGCCTTTCTAATCCTGAGACTTCCTGGTGTGTTTCAAACAGGAAAACAGGTCCAGTCAGAGGAGGGCTGGCAAAGAGGCTCTGTGGTCATCTGTGCTGAGAG
GTGGCCTAAACACTACATCCTAAACTCTCAGAGCATCCACCTTCAAATATTTACCTGACTGGCTCCTGCCTCTGGGAGAGTCTCTGTCTGGACTGTCAACACCAGCCAGAAAAGCCTCCC
TAGTTAAAAAACGAAAAAAAAAAACCCAACACCAATATGGCCAACGACAAAAATCCACTAATCCCTTTTGGATGCCCTTGGATCTTTGTGAACTATTTTACGGCACGCCCAACACCGTGC
TCTACCCAGTGAAACAGTGAATGGATGACCTTGGTTGCTGCTCCGCTATTCATCACCATGATAGTCAGGAAGAGAAACGTGGAAGGCTTCTCCATATTCCAACCATTCTTTCTTCCTGCA
TCTTAAGCCCTTTCTGGTTTTGTTGTGCCGGTAAAAAAAAACAGCTTTGTGCTTCTCATTCCTGAAGACAATGAATGCGTCAGTAACACAGCTCCCCTCCATGCCATAAGGGCAGGGCTT
GTTCCCTGTTGAATCCAGCTTCTCTACACTGTGATTGGCACAGGGCAGGCATTTCATACATAACTAACTGAGTAAGACAAAATGAAATAAGTGAGCAAATGAATACAAAGTATAGATGTA
ACAGCCCACATTATTTCAATTTTTCTATCCTGTTAAGCCTTAACATTGCTTTAAGCATTCCCCTTAACTGCTACATTCCTCATATGGTCCAGATACCCCAACTGGACAAGGGCTTCTGAA
AGGGCAAAGCTATTGCAGTCTGTACCTCACTGGGTATGTCACTGCAGGCCCAGCCCGAGGTGAGGCTCAAGGGAATCTAGGAGAGGGTCCCTGCCTCCAAGGGCTGAGTTCCCACCTTCT
TTTTTTGGTTTGTTTGTTTGTTTGTTTTGAGACGGAGTCACTCTGTCACCCAGGCTGGAGTGCAGTGCCACGATCTCGGCTCACTGCAACCTCCACTGCCCAGGTTCAAGTGATTCTCCT
GCCTCAGCCTCCCAAGTAGCTGGAATTACAGGCATGTGCCACCACGTCCAGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTAGTCTCGAACTCCTGACC
TCAGGGGATCAGCCCGCCTCAGCCTCCTAAAGTGCTAGGGTTACAGGCGTGAGCCACCACACCTGGTGAATTCCCACCTTCTTGTCCTGTAACTCAGCGTGTATCTTGCTTACTGTCGGT
GGGACGATGTTTTTAATGTGATGGTTGTGCTTGCTGTGTTTTATGGGCCCAGCATGGCACAGCATTGCTGCCGGACCTACACAAACTTGCATAGTCTGTCTTTTCTGTCCCGAGGCACAA
CCTATGAATAGAGCTTGGCTACTGCAGGTGCCACTGTGGGTGCTATCAGGTTGGGCATGGAGACGCTCCCGCCTGTGCCCCGGGGTGTTGGCACAAGGAAGCAGCAGCATTGCAGCTAGT
TCCCCTCCCTGGCACCTGGCTGCCTGGTGCCCCCACTGGACTATGAAAGGGGGAATCCAGGGGTGATGTGGGAGGCATCAACAGAAGAGAGTGGACAGAGAGCCTGCCACGAGAGAGGGC
CATGCACACCCTGGACACCCCTGCACTCAGTGGACTATCTTCTCAGTTGTAGATGCCCCCTGTTTGAGGGCTGCTTTCTCTGATTGGTCAAGGTCACTTTCAATTCTGTTCTGCCTTTTG
AGTCCATGGCTACCCCACCCAGCTTAGTGTCAGCGGCAGGCTGGATGAGCACCTTCTCAGTGCCATTTCCCGGGTCACTGGTAACCACATTAGATAATCTGGGGCCTGCCCCACCCTGCA
CCTACCCAAGCCTGACCTTGCTGGGTGACAGGCTGCTGTGTCTCTGGTCCTCCTCCAGATATCCTTCATCGAGATCTCCAACAACTGCCTGTCCAAGGCCCAGCTGGCCGAGGTGGTGGA
GTCTATGTTCCGGGAGTCGGGATTCCAGGACAAGGAGGAGCTGACATGGGAGGATTTTCACTTCATGCTGCGGGACCATGACAGCGAGCTCCGCTTCACGCAGCTCTGTGTCAAAGGTGG
AGGTGGAGGTGGAAATG
GTGAGTGTGTGAGGAATGGTTGGCGTCAGGGAGGGGGGGCGTGTCCTCAAAATGAGAGTTCCTGGTGGGCAGGACCCAGGTCTTACTCTTTTCTGAGTCCTTG
GTACCTAGTATAGAACCAAGCACATGTAATTGGGATGGCACATGTAGGTGTGCAAGTTGCCTAATGCACAATGATACCCACTGAGGTCAAGGAGCAGGTTGAAATCTATCCTACACACTG
CTCAAAGCCACCGGCATTGCTCTAAGATGTATCTGGCCAGAGGAAGAGGCATTGTTTCTTTTACACAAAGGACCTGCAGGGTTGCATCCACCTAAGAGGATGTCCCCTTTCTTGTGCAAA
GTTGCCACATTGTCTGCCCTGTGTACAGTGAGTGCTTAGTCTAGGGAATGAGGGAAACAGGACTCGAGTCAGAGATCTGGACATGACTTTCCCAGAAGGGAGGAGGGCACAGTCTCCCAT
CCTACCCCACTGCCCTTGTGGGGAAGCCAGTCCTGCCTCTTGTTCTTTTCTCTAGGTGTATTAGAGATATCTTTAAACAAAACATCAGCTGTCGAGTCTCGTTCATCACTCGGACACCTG
GGGAGCG
GTGAGCAGGAATGGGGCTCTGGCAGGTTGGCCTGGCTGAGCCCCCTGCAGAGAAATGAAGGGAGTAGGACTGGCTGATCAGCCCCTGGTAAAATCAGGCATTTGCCCTTTGAA
AGTAGCTCATGGTAGCACAAACATTCCAGCTGCCTCTCTCACCCTATGCTGCTCGGATGCTTGGCTCTCTCCCTGCTGCTCCAGGCCAGAATCATTCTACAAAACAAATCATGAGATCCT
ATTAATTCATTTTGTGCCTTGCCCCCTGCCTGGTACCAGGAGCCACTCCCTACCTCTACCCCATGCTCTGCCCAGGGAGTTGTTCTCCTGGCTGCAAAGACAAGGGGAGAACAGCCCCAT
TTCTTTTTCTCAGCTCTCCCACCCCCAGGGACTGGGGCCCCCTGCCCCAGAAGCCCCAGAGCTGGGAGGCCCTGGACTGAAGAAGAGGTTTGGCAAAAAGTGAGTGTCTCCCAAATCCCT
GGGCCCAAAGAGACATGGAGAGAAGTCTTAGGGTCCCTAGGCCCCACCCACATATCCTTGACATATAAGCGACCATCCTGAGTCTCATTCCATTTGTTCCTGACCTGACTGAGAGGTTAC
AGTGTTGAATGACTTTTCATCCTCTTCCAGCCTCTGCACCCCATTCTTCAGGCAAGGGTCCTGGCTCAACAGGATGATAGTAAGGGGTCTCCTGGCTCCTGCCTGCTTTGGGCACAGCCT
TGAGGCCTGTGCTGGGATCAGGAAGAAAGAAGGATAAAACAGACAGGAGGAGGGGGAGTAGCAGGGAGACAGTGAGTGGGTGGATGGAGCAAAGACAGAAGTAAAGGGTTGGAGGAGGAA
GAAGCCCCCAGATTGCTTTTTTCCATTCATCTGTTGTGGCCCATCCTGATGCCTGCCAGATCCCCAGGTCACCTTTCATGGAGTGGCATTAAGGGAAGGCCAGAGGGCCCTTACCACACT
GCTGCCCTGCCTCCCCTTGCTATAGGGGGCAGCAGTGCCCACTCCCCGGCTGTACACAGAGGCGCTGCAAGAGAAGATGCAGCGAGGCTTCCTAGCCCAAAAGCTGCAGCAGTACAAGCG
CTTCGTGGAGAACTACCGGAGGCACATCGTGTGTGTGGCAATCTTCTCGGCCATCTGTGTTGGCGTGTTTGCAGATCGTGCTTACT
GTAAGAGTTCCAGGCTGTGGGCAGTGGGTAGGGA
GCAGGCTCTGACCCTTGGAGAGGAGTGGAAAGCCCTCTGATCCTAAGAGTCTGCATGGGAGAGCCCAGGGCTCGGGACCTTGGCCACCTGTGCCAAGCTGATGTAACCTCACTCCGGCCC
CAGACACTATGGCTTTGCCTCGCCACCCTCGGACATTGCACAGACCACCCTCGTGGGCATCATCCTGTCACGAGGCACGGCGGCCAGCGTCTCCTTCATGTTCTCTTATATCTTGCTCAC
CATGTGCCGCAACCTCATAACCTTCCTGCGAGAGACTTTCCTCAACCGCTATGTGCCTTTTGATGCCGCAGTGGACTTCCACCGCTGGATCGCCATGGCTGCTGTTGTCCTGGCCA
GTAC
GTGACTCCCAGGCTTCTTCCTCTTTGCTGCAGCACCCTGGGTCTAGTTGGGGGAAACAGTGGGGAGATGGAACTCCTTATACCTCCATCTCTCCTCCCTATGCCTCCTCTCTCCCTCAGG
ATCGGAGGTAGAGTCTGTCCTGGTTGGCATCTCTAACAGGGTCTGTCTCTTCCAGTTTTTTGCACAGTGCTGGCCACGCAGTCAATGTCTACATCTTCTCAGTCAGCCCACTCAGCCTGC
TGGCCTGCATATTCCCCAACGTCTTTGTGAATGATGG
GTCAGTTCTGGGGATGGTTTCTCCTGGGACTCATAGGGTGGGCCCAAGGGTATAATAGAAAAAGAAATAGGCAGGCCGGGCGC
GGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGCGGATCACGAGGTCAGGAGATCGAGACCATCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAATACAAAA
AATTAGCCGGGCGTGGTGGTGGGCGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATGGCATGAACCCAAGAGGCGGAGCTTGCAGTGAGCCGGGATAGCGCCACTGCAGTC
CAGCTTGGGCGAAAGAGTGAGACTCCGTCTCAAAAAAAAAAAAAAAGAAAGAAATAGGCAGGGCACAATGGCTCACACCTGTAAGCCCAACACTTTGGAAGTCTGAGGCAGGAGGATTGC
TTGAGGCCAGGAGTTCCAGACAAGCCTGAACAACAAAGTGAGACTCCATCTGTACAAAAAGTAAAAAGATTAGCGGGGCATTGTGGTACACATCTGTAGTCCCAGCTATTCAGGAGGCTG
AGGCAGGAGGATTGCTTGAGCCCAGGAGTTTTAGGTTGCAGTGAGCCATGATCAGTACCACTGCATTCCAGCCTGGGCGACAGAGCAAGACTCTGTCTTGAAAAAAAAAAAAAGGAAAGA
AATAGACAGTCCCAGACACTCAGCAAGAAGCTCAGTGCTAAGCTAGTCCCCTGGGGGAAGCTGAAAGGTAAATTTCTTGCCTTCAAGAAGGAAGCTGGCTGTGATTGGCCAGGAAAGGTG
TTTGGGAGATGAAGTCAGAGACTCTTTCTCATAGATACCTGACACACAGCTTGCCATCTCTGCCTTCTATCCATTCACTGGACAGACATTTATGCAGCATGTCCCATGTGTCTACCCAGA
TGCCAGAGTAGGGATGTCAAGATACATGCAGTCATTCAACAACTACTTACCGAGATTTGCTGTGTGCCTTGTATTGTTCTAAGCCTAGGGATAGAGCAGTGAATGAAACAAAAATCCCTG
CCCTCATGGAGCTTACAGAATAATGAACCAGGGACTCAAGGAAGCAGGGGTCAACCACAGTAAGAGAGTTCAGGATAACACAGAAGGAAAAATTGCTGAGTACAGAGGGTGGAACCATGG
AAGCCTGGGTGCCAAGTTCCTTGGAGGGATCCCTGGCCAACTGGGTGGAGGCCCCATACCCTCAGCAGTCAGGGCCAGCAGGAAAGGAGTCATGCTGTGTTGTGACAGTGCTGGAGCCCC
TCCTGCCCCAGTCAGAGGCTCAAGGTGCCTTTGCCCCCCAGGTGTCCAAGCTTCCCCAGAAGTTCTATTGGTGGTTCTTCCAGACCGTCCCAGGTAGGAAACGTGGGACCTGGGGGTTCT
GTCTGAGGACTTCTGCTTTTGTCTCCATCTCTCTGTGAATACTCACTTGTCTATACTGGCCATGGGGTCTGTTTCTAGCCTTCAGGACAAGCCCCAGCTCAAATCCCTCCAGGCAGGCTT
TTCTGGGGGCCCCGGAGGGATGAGAGAGGAAGGAGGGAAGGATAGGGGAATGTGCTGTTGTCTTTTCAGCCCAAGCTGAAGTCCTGAGACTACTCACTGGCCCTGTCTTCCTGCCCCCAG
GTGTATGACAGGTGTGCTTCTGCTCCTGGTCCTGGCCATCATGTATGTCTTCGCCTCCCACCACTTCCGCCGCCGCAGCTTCCGGGGCTTCTGGCTGACCCACCACCTCTACATCCTGCT
CTATGCCCTG
GTGAGGGACTTCCCTGGGCCAGCCCATGGAGCAGGGAGCTCAGGATGGGACAGGAAGGTGAAAGAGGGAGAATTGGATCCAAGATCTCAGAATGAGACTTTGAGATTTAA
GACCCCAGACCTCAGCCCTATCTCCCCGGCACAGGCCTAGTGCCTGGGCAAGAGGGGATGCCGGGCAGGGGCCTGGCTGGGCCTGAGTTGTACTAACTGGCCGTGTCTCCAGCTCTCATC
ATCCATGGCAGCTATGCTCTGATCCAGCTGCCCACTTTCCACATCTACTTCCTGGTCCCGGCAATCATCTATGGAGGTGACAAGCTGGTGAGCCTGAGCCGGAAGAAGGTGGAGATCAGC
GTGGTGAAGGCGGAGCTGCTGCCCTCAG
GTATCAGGCCCAGCCTGACCTGGGTCGGGAGCGACAGAGGCCAAATCTTCAGACATGGGGAGACAGTTCACCAGGTCTCCTGACCCCATCAC
TGCCTCTGACTCTGTCTCCAAAAACAACAACAACAAAAAACCACTCTCGGGGGTTCCTGAAGGTTTCCTGATAGAAGTAGACCCAGAAAGGGCTGTGCTTAGCTCCCAGGAGACTTGCAG
ATGGTGAGAAGTGACCTGAGAAGAGGTGGCTGAACGTGCATACAGAGGGGTTTGAGGATGGGAAAGGGCCCCACATGTGTGGCCTGGGTGCAGGGGAAGTGCAGGGCAGGAAGCCACATA
TGCCTGTCCCATTCCTTCTCTCAGAGACAGGCAAATGCCCAGATTGCCAGTCTGGGTTGTTGAGAGTCAGTGCTGGCCAAAGTCGGGATTGGTATCATCATAGAGGGTGGCAAAAGATGA
TTTTATGTATCTCAGGACTGTCTAACCTTCAGGACTCATGTTGTAAAAAAATTAATCTCATTGCAATGTTATTTCAATTGAGATTACTTAAGGACAAAATCTCAGAGTGGTGTTAGTATG
CCTTTCTACTCTCCAGCACTTGCTGATCTCCTTTTTCAATGAAGAGATCAGGCCTGAGGCTCAGCTATGGTACAAACAGTATCCAGCTAAAATTTGGTAACAAAATATGTTGTCTTCTAT
GTAGGACACATGATACTGGTTTTCCACTTACCTTAGCAATAAAGTTCCCTGCCAAGATTAATTGAATTGGAAAAGTTAGTCAATTTAAAGAAAAGTGTTAGATAAATGATAGTGCAGGAG
GTGTGTAGAAAGGTAACCCTCAAATGGAGGTCTGGTAGCCACTGAGATGGTCCCTGGTGAGCCTGAGTCCCTTTACCCAGCAGGTTGGTATTCAAGGCAGTGGTATGAGGCCTGGGCCTT
CCATGGGAAGAGGGAGTAGAGAGGAGGAGAGGGGCTGGAACAGGGGAGCAGAGAAACCCAGCTCTTGCATTACCTGGGAAACAGAGAAGGGGACCCTCCTACTCCCAGCCCCAGGAGCCC
CGCTTGCTCAGACCATGTGACTACTCCCCAGGCCTCAGGGGTGTGAAGAGGACAGGTGCCTATCAGTCCTCAGGTACCAGGAGCAGGCCTTCTCATCTGTGCTTTTCCCTGTCTATGACC
TCCAGGAGAGTGACCTACCTGCAATTCCAGAGGCCCCAAGGCTTTGAGTACAAGTCAGGACAGTGGGTGCGGATCGCCTGCCTGGCTCTGGGGACCACCGAGTACCACCCCTTCACACTG
ACCTCCGCGCCCCATGAGGACACACTCAGCCTGCACATCCGGGCAGTGGGGCCCTGGACCACTCGCCTCAGGGAGATCTACTCATCCCCAAAGGGCAATGGCTGTGCTGGATACCCAAAG
GTGCCCGTCACTGGGAACCCTGCTTCCGGGCCTCTGGCACTGGCAGAGGATCTCTGCCCTTCCCTATCCTGAGACTAGAAGCTCCAGCCGTCCCAAAGCCAGCCTGGGAGAGGACCGGGG
TGCCTCAGAAAAGACTAGGATGTTCTGTATCCTCCCTCTGCCTGTGTCTCCGTTTCTGGTCTCAGAGCTGGGGCAGGGTCAGGCTCATTTCATCTCCCCCCTCTCTTGGCAGCTCTGTAC
CTTGATGGACCGTTTGGAGAGGGCCATCAGGAGTGGCATAAATTTGAGGTGTCAGTGTTGGTGGGAGGGGGCATTGGGGTCACCCCCTTTGCCTCCATCCTCAAAGACCTGGTCTTCAAG
TCATCCTTGGGCAGCCAAATGCTGTGTAAGAAG
GTGAGCATCCCTCCCTCATTCATCAAATGGGGCATAGGTGGCCGAATTGTGACCCGCATCAAGTGGTGGAGCATGAGAGAAAGCTCC
TGGCTCCAGGAACTGAGTCTGAAGGGGTCATTCTTACCCAGTGGTTGAGATGCCAAACTTGGAGGGAGGTTGGTGGTATAGCCAGAAGGGCCTCTGCTGGGACCTGTCAGTTGGAAGCCT
GGGATCAGGCTGGTGGGTCCTGCCACAGCTTTGGTGTCTGCAGGTGGTCTGGGGCTTCCCAGCCTCTCAGGTGAAGGCACCTGGGATCTAGGGAGGCTGAACTGAGCTGGGTCCTGATCT
CCAGCCCTGTGTCCCCAGATATCTACTTCATCTGGGTGACACGGACCCAGCGTCAGTTTGAGTGGCTGGCTGACATCATCCAAGAGGTGGAGGAGAACGACCACCAGGACCTGGTGTCTG
TGCACATTTATGTCACCCAGCTGGCTGAGAAGTTCGACCTCAGGACCACCATGCTA
GTATGTCAGGGCCCGCCAGGCAGGGCAACTTGGCGGGCAGATGGATTGGCAGCGTAAGGCAGGA
TGGCCAGGGCAGGTGGGTGGACGGCCAGGCTGAGCTGGCAGGAGGCACAGAGCTGATGGCCTGATCCTCAGCCTCCAGCTCCCTCCCCTCCCCATTCTCTGTCTCTTGGGCTATGTGGGC
TGGCTCGGGCTGAGTGCTGGCCCTGACTGTCTTTGGTCTGACCTGCCCCTGTGCCCCCAGTATACATCTGCGAGCGGCACTTCCAGAAAGTGCTGAACCGGAGTCTGTTCACGGGCCTGC
GCTCCATCACCCACTTTGGCCGTCCCCCCTTCGAGCCCTTCTTCAACTCCCTGCAGGAGGTCCACCCACAG
GTCAGTCCCACTCCCTCCCACCCTGGGACTCTGGCCTTCTCCTGCCAGG
ACATCCTGGCCCTGAAGCACCCTGCCACTCTGTTCTGAGCAGAGAACTCCACCCGATTGCCTGGCCCCAGGATGAGGTCAGCTGTTAAAGGGGGACTTCCACCCCCTCCACGTTAAGCCT
CCTCCTCAAGGCCTGGGCTTGAAGCCTTAGTCATTCCAGCCAGGCTCAGGAAGCAGCTTTTCCCAAGGAGAGTGAGCACCTTTAGGCTGCAGGCCCCTCTCTCTCTCCAATCTCCTGACA
GGTGTGCGCAAGATCGGGGTGTTCAGCTGCGGCCCTCCAGGAATGACCAAGAATGTAGAGAAGGCCTGTCAGCTCGTCAACAGGCAGGACCGAGCCCACTTCATGCACCACTATGAGAAC
TTCTGA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCTCCGTGCAAGACCAGAGGCACTGATGCTCCTGGGAGCTCTTCTGACTGGATCCCTGGGTCCATCGGGCAGTCAGGACGCACTCTCACTGCCCTGGGAAGTGCAGCGCTATGACGGC
TGGTTTAACAACCTGAGGCACCACGAGCGTGGTGCTGTTG
GCTGCCGGTTGCAGCGCCGCGTACCAGCCAATTACGCCGACGGTGTGTATCAGGCTCTGGAGGAGCCGCAGCTGCCCAAC
CCGCGCCGGCTCAGCAACGCAGCCACGCGGGGCATAGCCGGCCTGCCGTCGCTCCACAACCGCACCGTACTGGGGGTCTTCTTTG
GCTACCATGTTCTTTCCGACGTGGTGAGCGTGGAA
ACGCCCGGTTGCCCCGCCGAGTTCCTCAACATCCGCATCCCACCTGGAGACCCCGTGTTCGACCCCGACCAGCGCGGGGACGTGGTGCTGCCCTTCCAGAGGAGCCGCTGGGACCCCGAG
ACCGGACGGAGTCCCAGCAACCCCCGGGACCTG
GCCAACCAGGTGACGGGCTGGCTGGACGGCAGCGCCATCTATGGCTCCTCGCACTCCTGGAGCGACGCGCTGCGGAGCTTCTCGGGG
GGACAGCTGGCGTCGGGGCCCGACCCCGCTTTCCCCCGAGACTCGCAGAACCCCCTGCTCATGTGGGCGGCGCCCGACCCCGCCACCGGGCAGAACGGGCCCCGGGGGCTGTACG
CCTTC
GGGGCAGAGAGAGGGAACCGGGAACCCTTCCTGCAGGCGCTGGGCCTGCTCTGGTTCCGCTACCACAACCTGTGGGCGCAGAGGCTGGCCCGCCAGCACCCAGACTGGGAGGACGAGGAG
CTGTTCCAGCACGCACGCAAGAGGGTCATCGCCACCTACCAG
AACATCGCTGTGTATGAGTGGCTGCCCAGCTTCCTGCAGAAAACACTCCCGGAGTATACAGGATACCGTCCTTTCCTA
GACCCCAGCATCTCCCCGGAATTTGTGGTGGCCTCTGAGCAGTTCTTCTCTACCATGGTGCCCCCTGGTGTCTACATGAG
AAATGCCAGCTGTCATTTCCGGAAGGTCCTGAACAAGGGT
TTTCAAAGCTCCCAAGCTCTCAGGGTCTGCAACAACTACTGGATTCGGGAG
AACCCCAATCTGAACAGTACCCAGGAGGTGAATGAGCTGCTGCTGGGAATGGCCTCCCAGATTTCGGAG
TTGGAGGACAACATAGTGGTTGAAGATCTGAGGG
ATTACTGGCCTGGCCCTGGCAAATTCTCCCGTACAGACTATGTGGCCAGCAGCATCCAACGTGGCCGAGATATGGGGCTGCCCAGC
TATAGCCAGGCCCTGCTGGCCTTTGGGCTGGACATCCCAAGGAACTGGAGTGATCTCAACCCTAATGTGGACCCCCAG
GTGCTGGAGGCCACAGCTGCCCTGTACAACCAGGACCTATCC
CAGCTAGAGCTGCTCCTTGGGGGGCTCCTGGAGAGCCATGGGGACCCTGGACCCCTGTTCAGTGCCATTGTCCTCGACCAGTTTGTACGGCTGCGGGATGGTGACCGCTACTGGTTTGAG
AACACCAGGAATGG
GCTGTTCTCCAAGAAGGAGATTGAAGACATCCGAAATACCACCCTGCGGGACGTGCTGGTCGCTGTTATCAACATTGACCCCAGTGCCCTGCAGCCCAATGTCTTT
GTCTGGCATAAAG
GTGCACCCTGCCCTCAACCTAAGCAGCTCACAACTGACGGCCTGCCCCAGTGTGCACCCCTGACTGTGCTTGACTTCTTTGAAGGCAGCAGCCCTGGTTTTGCCATC
ACCATCATTGCTCTCTGCTGCCTTCCCTTAG
TGAGTCTGCTTCTCTCTGGAGTGGTGGCCTATTTCCGGGGCCGAGAACACAAGAAGCTACAAAAGAAACTCAAAGAGAGCGTGAAGAAG
GAAGCAGCCAAAGATGGAGTGCCAG
CGATGGAGTGGCCAGGCCCCAAGGAGAGGAGCAGTCCCATCATCATCCAGCTGCTGTCAGACAGGTGTCTGCAGGTCCTGAACAGGCATCTCACT
GTGCTCCGTGTGGTCCAGCTGCAGCCTCTGCAGCAGGTCAACCTCATCCTGTCCAACAACCGAGGATGCCGCACCCTGCTGCTCAAGATCCCTAAGGAGTATGACCTG
GTGCTGCTGTTT
AGTTCTGAAGAGGAACGGGGCGCCTTTGTGCAGCAGCTATGGGACTTCTGCGTGCGCTGGGCTCTGGGCCTCCATGTGGCTGAGATGAGCGAGAAGGAGCTATTTAGGAAGGCTGTGACA
AAGCAGCAGCGGGAACGCATCCTGGAGATCTTCTTCAGACACCTTTTTGCTCAG
GTGCTGGACATCAACCAGGCCGACGCAGGGACCCTGCCCCTGGACTCCTCCCAGAAGGTGCGGGAG
GCCCTGACCTGCGAGCTGAGCAGGGCCGAGTTTGCCGAGTCCCTGGGCCTCAAGCCCCAGGACATGTTTGTGGAGTCCATGTTCTCTCTGGCTGACAAGGATGGCAATGGCTACCTGTCC
TTCCGAGAGTTCCTGGACATCCTGGTGGTCTTCATGAAAG
GCTCCCCAGAGGATAAGTCCCGTCTAATGTTTACCATGTATGACCTGGATGAGAATGGCTTCCTCTCCAAGGACGAATTC
TTCACCATGATGCG
ATCCTTCATCGAGATCTCCAACAACTGCCTGTCCAAGGCCCAGCTGGCCGAGGTGGTGGAGTCTATGTTCCGGGAGTCGGGATTCCAGGACAAGGAGGAGCTGACA
TGGGAGGATTTTCACTTCATGCTGCGGGACCATGACAGCGAGCTCCGCTTCACGCAGCTCTGTGTCAAAGGTGGAGGTGGAGGTGGAAATG
GTATTAGAGATATCTTTAAACAAAACATC
AGCTGTCGAGTCTCGTTCATCACTCGGACACCTGGGGAGCG
CTCCCACCCCCAGGGACTGGGGCCCCCTGCCCCAGAAGCCCCAGAGCTGGGAGGCCCTGGACTGAAGAAGAGGTTTGGC
AAAAA
GGCAGCAGTGCCCACTCCCCGGCTGTACACAGAGGCGCTGCAAGAGAAGATGCAGCGAGGCTTCCTAGCCCAAAAGCTGCAGCAGTACAAGCGCTTCGTGGAGAACTACCGGAGG
CACATCGTGTGTGTGGCAATCTTCTCGGCCATCTGTGTTGGCGTGTTTGCAGATCGTGCTTACT
ACTATGGCTTTGCCTCGCCACCCTCGGACATTGCACAGACCACCCTCGTGGGCATC
ATCCTGTCACGAGGCACGGCGGCCAGCGTCTCCTTCATGTTCTCTTATATCTTGCTCACCATGTGCCGCAACCTCATAACCTTCCTGCGAGAGACTTTCCTCAACCGCTATGTGCCTTTT
GATGCCGCAGTGGACTTCCACCGCTGGATCGCCATGGCTGCTGTTGTCCTGGCCA
TTTTGCACAGTGCTGGCCACGCAGTCAATGTCTACATCTTCTCAGTCAGCCCACTCAGCCTGCTG
GCCTGCATATTCCCCAACGTCTTTGTGAATGATGG
GTCCAAGCTTCCCCAGAAGTTCTATTGGTGGTTCTTCCAGACCGTCCCAGGTATGACAGGTGTGCTTCTGCTCCTGGTCCTGGCC
ATCATGTATGTCTTCGCCTCCCACCACTTCCGCCGCCGCAGCTTCCGGGGCTTCTGGCTGACCCACCACCTCTACATCCTGCTCTATGCCCTG
CTCATCATCCATGGCAGCTATGCTCTG
ATCCAGCTGCCCACTTTCCACATCTACTTCCTGGTCCCGGCAATCATCTATGGAGGTGACAAGCTGGTGAGCCTGAGCCGGAAGAAGGTGGAGATCAGCGTGGTGAAGGCGGAGCTGCTG
CCCTCAG
GAGTGACCTACCTGCAATTCCAGAGGCCCCAAGGCTTTGAGTACAAGTCAGGACAGTGGGTGCGGATCGCCTGCCTGGCTCTGGGGACCACCGAGTACCACCCCTTCACACTG
ACCTCCGCGCCCCATGAGGACACACTCAGCCTGCACATCCGGGCAGTGGGGCCCTGGACCACTCGCCTCAGGGAGATCTACTCATCCCCAAAGGGCAATGGCTGTGCTGGATACCCAAAG
CTGTACCTTGATGGACCGTTTGGAGAGGGCCATCAGGAGTGGCATAAATTTGAGGTGTCAGTGTTGGTGGGAGGGGGCATTGGGGTCACCCCCTTTGCCTCCATCCTCAAAGACCTGGTC
TTCAAGTCATCCTTGGGCAGCCAAATGCTGTGTAAGAAG
ATCTACTTCATCTGGGTGACACGGACCCAGCGTCAGTTTGAGTGGCTGGCTGACATCATCCAAGAGGTGGAGGAGAACGAC
CACCAGGACCTGGTGTCTGTGCACATTTATGTCACCCAGCTGGCTGAGAAGTTCGACCTCAGGACCACCATGCTA
TACATCTGCGAGCGGCACTTCCAGAAAGTGCTGAACCGGAGTCTG
TTCACGGGCCTGCGCTCCATCACCCACTTTGGCCGTCCCCCCTTCGAGCCCTTCTTCAACTCCCTGCAGGAGGTCCACCCACAG
GTGCGCAAGATCGGGGTGTTCAGCTGCGGCCCTCCA
GGAATGACCAAGAATGTAGAGAAGGCCTGTCAGCTCGTCAACAGGCAGGACCGAGCCCACTTCATGCACCACTATGAGAACTTCTGA

Retrieve as FASTA