Entry information : PtroDuOx02 ( DUOX2)
Entry ID 5853
Creation 2007-10-10 (Marcel Zamocky)
Last sequence changes 2010-11-23 (Myriam Duval (Scipio))
Sequence status complete
Reviewer Christophe Dunand
Last annotation changes 2016-02-11 (Christophe Dunand)
Peroxidase information: PtroDuOx02 ( DUOX2)
Name (synonym) PtroDuOx02 ( DUOX2)
Class Dual oxidase    [Orthogroup: DuOx001]
Taxonomy Eukaryota Metazoa Chordata Mammalia Hominidae Pan
Organism Pan troglodytes (chimpanzee)    [TaxId: 9598 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value PtroDuOx02
start..stop
S start..stop
HsDuOx02 3078 0 1..1547 1..1548
CfaDuOx02 2756 0 1..1547 1..1571
BtDuOx02 2714 0 1..1547 1..1545
SscDuOx02 2702 0 1..1547 1..1545
Gene structure Fichier Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 20196..20269 74 N° 2 19844..19929 86 N° 3 19411..19575 165 N° 4 18625..18812 188
N° 5 18244..18442 199 N° 6 17482..17648 167 N° 7 17020..17080 61 N° 8 16798..16894 97
N° 9 16259..16349 91 N° 10 15893..15995 103 N° 11 15161..15324 164 N° 12 14419..14594 176
N° 13 13707..13825 119 N° 14 13194..13331 138 N° 15 12890..13003 114 N° 16 12487..12689 203
N° 17 12005..12190 186 N° 18 10502..10727 226 N° 19 10322..10415 94 N° 20 8160..8356 197
N° 21 7572..7641 70 N° 22 7122..7205 84 N° 23 6417..6595 179 N° 24 6029..6259 231
N° 25 5736..5835 100 N° 26 4657..4706 50 N° 27 4262..4389 128 N° 28 3886..4039 154
N° 29 1679..1911 233 N° 30 1288..1446 159 N° 31 787..942 156 N° 32 414..542 129
N° 33 1..123 123  
complement(join(1..123,414..542,787..942,1288..1446,1679..1911,3886..4039,4262.. 4389,4657..4706,5736..5835,6029..6259,6417..6595,7122..7205,7572..7641,8160..835 6,10322..10415,10502..10727,12005..12190,12487..12689,12890..13003,13194..13331, 13707..13825,14419..14594,15161..15324,15893..15995,16259..16349,16798..16894,17 020..17080,17482..17648,18244..18442,18625..18812,19411..19575,19844..19929,2019 6..20269))


exon

Literature and cross-references PtroDuOx02 ( DUOX2)
Literature unpublished
Protein ref. GenBank:   XP_510367.2
DNA ref. GenBank:   NC_006482.2 (42285634..42265366)
mRNA ref. GenBank:   XM_510367
Protein sequence: PtroDuOx02 ( DUOX2)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   1547 (1525)
PWM (Da):   %s   175116.97 (172797.7) Transmb domain:   %s   o599-621i1040-1062o1077-1099i1146-1168o1183-1205i1218-1240o (o577-599i1018-1040o1055-1077i1124-1146o1161-1183i1196-1218o)
PI (pH):   %s   8.2 (8.20) Peptide Signal:   %s   cut: 23 range:23-1547
Sequence 2393
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MLRARPEALMLLGALLTGSLDPSGNQDALSLPWEVQRYDGWFNNLRHHERGAGCRLQRRVPANYADGVYQALEEPQLPNPRRLSNAATRGIAGLPSLHNRTVLGVFFGYHVLSDVVSVETPGCPAEFLNIRIPPGDPVFDPDQRGDVVLPFQRSRWDPETGRSPSNPRDANQVTGWLDGSAIYGSSHSWSDALRSFSGGQLASGPDPAFPRDSQNPLLMWAAPRPPPPGQNGPRGPFGAERGNREPFLQALGLLWFRYHNLWAQRLARQHPDWEDEELFQHARKRVIATYNIAVYEWLPSFLQKTLPEYTGYRPFLDPSISPEFVVASEQFFSTMVPPGVYMRNASCHFRKVLNKGFQSSQALRVCNNYWIRENPNLNSTQEVNELLLGMASQISELEDNIVVEDLDYWPGPGKFSRTDYVASSIQRGRDMGLPSYSQALLAFGLDIPRNWSDLNPNVDPQVLEATAALYNQDLSQLELLLGGLLESHGDPGPLFSAIVLDQFVRLRDGDRYWFENTRGLFSKKEIEDIRNTTLRDVLVAVINIDPSALQPNVFVWHGAPCPQPKQLTTDGLPQCAPLTVLDFFEGSSPGFAITIIALCCLPLVSLLLSGVVAYFRGRERKKLQKKVKESVKKEAAKDGVPAMEWPGPKERSSPIIIQLLSDRCLQVLNRRLTVLRVVQLQPLQQVNLILSNNRGCRTLLLKIPKEYDLVLLFSSEEERGAFVQQLRDFCMRWALGLHVAEMSEKELFRKAVTKQQRERILEIFFRHLFAQVLDINQADAGTLPLDSCQKVREALTCELSRAEFAESLGLKPQDMFVESMFSLADKDGNGYLSFREFLDILVVFMGSPEDKSRLMFTMYDLDENGFLSKDEFFTMMRSFIEISNNCLSKAQLAEVVESMFRESGFQDKEELTWEDFHFMLRDHDSELRFTQLCVKGGGGGGGIRDIFKQNISCRVSFITRTPGERSHPQGLGPPAPEAPELGGPGLKKRFGKKAAVPTPRLYTEALQEKMQRGFLAQKLQQYKRFVENYRRHIVCVAIFSAICVGVFADRAYYGFASPPSDIAQTTLVGIILSRGTAASVSFMFSYILLTMCRNLITFLRETFLNRYVPFDAAVDFHRWIAMAAVVLAILHSAGHAVNVYIFSVSPLSLLACVFPNVFVNDGSKLPQKFYWWFFQTVGMTGVLLLLVLAIMYVFASHHFRRRSFRGFWLTHHLYILLYALIIHGSYALIQLPTFHIYFLVPAIIYGGDKLVSLSRKKVEISVVKAELLPSGVTYLQFQRPQGFEYKSGQWVRIACLALGTTEYHPFTLTSAPHEDTLSLHIRAVGPWTTRLREIYSSPKGNACAGYPLYLDGPFGEGHQEWHKFEVSVLVGGGIGVTPFASILKDLVFKSSLGSQMLCKKIYFIWVTRTQRQFEWLADIIREVEENDHQDLVSVHIYVTQLAEKFDLRTTMLYICERHFQKVLNRSLFTGLRSITHFGRPPFEPFFNSLQEVHPQVRKIGVFSCGPPGMTKNVEKACQLVNRQDRAHFMHHYENF*

Retrieve as FASTA  
Remarks Complete sequence from genomic (chromo 15, 30 introns). No EST. Isolate="Yerkes chimp pedigree #C0471 (Clint)". Isoform 2.
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCTCCGTGCAAGACCAGAGGCACTGATGCTCCTGGGAGCTCTTCTGACTGGATCCCTGGATCCATCGGGCAAGTATCAGGCTCCTCTAGCGGCGGGGTGTTCCCCAGGATCCTCTGGG
AGGTGGGGCGGGGAGAGGTGCGGCAAGCGGCTCCCTGAGACTGGAAGGTCATTTCGCCGTGCAGCTCAGCGGGATGGGAAACTTCCCATTGCGGCCCGACACTTGGGTCCGGTTAGGGGC
GCTCCGCGAGCTGGGGAAGGACTGGCCAAGGCCTTCGTTGCTCGGGAGGGGTAGCTGGGAGCGTAGTGCTGAGGAGGCCCTTCTCTGTGCCCACAGGCAGTCAGGACGCACTCTCACTGC
CCTGGGAAGTGCAGCGCTATGACGGCTGGTTTAACAACCTGAGGCACCACGAGCGTGGTGCTGTTG
GTGCGTTCTGGGGGCCCGGGCGTGCTGGGGCCGTGGCTCGCGAAGGGCCGGGGC
GCGAAAGGCCCTGAGCGGGGAATCTGCGGGGAACACGCGCCCAGCAGCTCCGCTGCCTACACAGCTCAATCTTATGCGCTCCCGGGGCCAAGAGACCCTTGAGGGAAGGTTCTGTCAGTG
AAGTGGGATGGGGGTTGAGGGAGGCTTAGGGCGAGGTTTGGGGGATCCTAGGGGATGGAGTGCTTAGACAGAGCCCCGCTCCCTGCCTCCGCAGGCTGCCGGTTGCAGCGCCGCGTACCA
GCCAATTACGCCGACGGTGTGTATCAGGCTCTGGAGGAGCCGCAGCTGCCCAACCCGCGCCGGCTCAGCAACGCAGCCACGCGGGGCATAGCCGGCCTGCCGTCGCTCCACAACCGCACC
GTACTGGGGGTCTTCTTTG
GTGAGGGCAAAGGGGGAGACCAGTGGGGTTGATGTGGCGCTCTGCTCAGCCTGGGGGAGGGGCCAGATCCCGTCTGCGAGTCCACAGGAGACTCATCCGAC
TCCCAACCACTTCCTCTCTAAGCAGCACTTCGAGACTGCCTTCATCTCGGAGAGATTTTGGGATGTTGATACAGAGATATTTGCTCTGTATCTAACCTTTCTCTTACGCCTTACTCCAAA
CTAGGGGTGTCACTGGACCCCCATTATAGCTCTTGCGAACTGAGCTCCCCAGCCACCGCTCTCCTCACCGTGTGTTTGTAACCATTTTACCTCCCCCTAGCCCAGAGGGAGGAGGACTGA
CTTGGGGTACCCCTACCTAAATTATATCATTTTGATTCTCACAACAGCTTTATGAATTGAGTAGGAAGGGAACTCACTATAGTCTTACTTTGCAGATTAGAAAATTGAGGCTCCCTGGGG
CTAACGTGCAGAGCTGGTGGCGGAGCTGGCACTGGGACCTCTGTCTTTTGGCTCCTGAGGACACTTGGAGGCCGCCCTGGCCGTGGGGAGGGCGCAATACGGACGGTTTGTCACCTATTT
GCGCCCCATGCCCGCAGGCTACCATGTTCTTTCCGACGTGGTGAGCGTGGAAACGCCCGGCTGCCCCGCCGAGTTCCTCAACATCCGCATCCCACCTGGAGACCCCGTGTTCGACCCCGA
CCAGCGCGGGGACGTGGTGCTGCCCTTCCAGAGGAGCCGCTGGGACCCCGAGACCGGACGGAGTCCCAGCAACCCCCGGGACCTG
GTGAGGCGGGGAAGGCGGCGGGAAGGGGCCGCACC
CCAGCCAGGTGGGGCCTGGGCTTCGGGCCTGGCAGGGCCTGGAGGGGAGAGGCGCCCACTCCCCAGCCGCGGACACCCGCCGGGCCCCGGCCTTCCCTGGCCCGCCGCCGCCCATCGGCC
CGGGCTCACCCGCCGCGTGCCCCGCAGGCCAACCAGGTGACGGGCTGGCTGGACGGCAGCGCCATCTATGGCTCCTCGCACTCCTGGAGCGACGCGCTGCGGAGCTTCTCGGGGGGACAG
CTGGCGTCGGGGCCCGACCCCGCTTTCCCCCGAGACTCGCAGAACCCCCTGCTCATGTGGGCGGCGCCCCGACCCCCGCCACCGGGGCAGAACGGGCCCCGGGGGC
GTACGGGAGGCCAC
AGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTGTACGGTGAGC
CCCCAGGGACGGGACGGGGCCGGCTGGGGGTCTGCGAGTGTGGACTCCCCCGATCACGCTACCGCTCATCTCCTCCCCCGCGCCCCCCACGTCGGATGCAGCCTTCGGGGCAGAGAGAGG
GAACCGGGAACCCTTCCTGCAGGCGCTGGGCCTGCTCTGGTTCCGCTACCACAACCTGTGGGCGCAGAGGCTGGCCCGCCAGCACCCAGACTGGGAGGACGAGGAGCTGTTCCAGCACGC
ACGCAAGAGGGTCATCGCCACCTACCAG
GTCAGCCGTCCGCGCCCCGCGACGTCCTCCCTTCCGCGTGCAAGCCCACGGGAGACTCCGCTGCCCCACGGAGCTCCCCATCTGTGGACAAC
CGCCACCCAGAAACCCCTCCCCAGACAGCCGAGGTCTAGGGAAGCCCCTGTAAATGATAGGGAGGCACGCGCTGTTTATAGGAGAAATCTGGCTGGTGATGACTATTTATCACCTCCCCA
CCCCCCACTCCCTCAAATCCCCTGGTTCCTTATGGGGACAGGCCTCACACTGCTCCTGTCTGAGTTGCTTCTCCCATGATTGACCCTTCCTGGTCCTCATCTCCACACGGAAGCCGTCCT
TGGGCTCAGACCCTTCCAGGCCCCACGCCATCCAACTCGTGCCTCCCCTCGCCCCTCTCTGCCCCTCAGAACATCGCTGTGTATGAGTGGCTGCCCAGCTTCCTGCAGAAAACACTCCCG
GAGTATACAG
GTGAGGGAGCGGGGAAGGAGGACACCTGTGCGGAGAATCCTGCGGGGAAGGAGACAGGTGCCTGTGATGGGAGGATGTGGAGGCAAGGAGCCTGTCTCCCCATCATCACC
GTCTCCTTCCTGCAGGATACCGTCCTTTCCTAGACCCCAGCATCTCCCCGGAATTTGTGGTGGCCTCTGAGCAGTTCTTCTCTACCATGGTGCCCCCTGGTGTCTACATGAGGTGAGGGA
GGGGTTGGCAGAGAGGGGGCACCACACTAAGAAAGGTGCAGAATGAGCTGCCTTGGGGGCTGGGGCCTTTCACACTCCTTCGCAGTTTCACTAGAGAAGGGGAAGCAAAAATTTGGGGCC
CTGAAACAGAACCCTGGGGTAAGATGTGTAGGCTTAGTAGGGAAATCTCCCCAGCTCTCCTAAGGGCTGAAATTTGGTGGCTGGGTGTAGGATTTGTCTAGCAGCTGGGTCATTCCCTTC
CCTCCTCTCCCCACCCTACCTGGACTAGGAGCGCACTCTATCTTCAGTAAACGCACATCGCCAAATCTCTGCCGTGTTCAAGGAAGTTCCTGGGCCACTGCTCAATCCTAGTGAACCCCC
ACTGAGTCCCTCAGCCCACTCAACCCCATCTTTGATTCTTCTCCAAATTCCCTCACCACATCCTTTGTTCTCAATTTCAGAAATGCCAGCTGTCATTTCCGGAAGGTCCTGAACAAGGGT
TTTCAAAGCTCCCAAGCTCTCAGGGTCTGCAACAACTACTGGATTCGGGAG
GTCAGACTGGGGTCAGGGTCAGGGGAAGATGGGTCAAGGTCAGTCTCTTCACACAGGCTGGGAAAAGCA
ACAATTCCAGTTCTTGAGTGTTGCTGCCCCAGGTTCATGGGAGATGAAGGGTAGAGGAAATTATCCGGGGACAACAGCTCAAGAGAGTCTGGGACTGGCCAAGGGCCCTTTGTCCTGGGG
TACTAAAGTGGTCCAGGCTGAGAGAGACCGAGTTTTGGGTAGAGGCCTATCTTGAGTGCATTGTTTACTTCCAGAATCCCAATCTGAACAGTACCCAGGAGGTGAATGAGCTGCTGCTGG
GAATGGCCTCCCAGATTTCGGAGCTGGAGGACAACATCGTGGTTGAAGATCTGAGGG
GTGAGCTCAGAGCCAGAAGGGGTGGATGGTAAGGGACCAGGAAGCCTGAGGATCCCTCTGGGT
TCATCAGTAGCAGACCTAGGGCACTCACGATGCAGGAATACAGATACAAACACAGACTTCAAGGAGCCATAAGAAGAAAATGAACTTTGAACTTTTTTTTTTTTGCTTAATTTACACTTT
TGTGTTGCTTTTATTTTAAATTAACTATGTCTGAGTATTGGGGAGGGGTGGTACCACAATCTCTTTGGTGCTTAGTTAGGTCTCTAAAGGTTTTCATTCAGCCCTGTCTTGAAACTAGGG
AGGCATAGGACAGGGAATTTACTGCTTGGTGACTAGAACAGTCTTGAGTCTTAGAGGAAGGGTCTTACTGGAAAAACTGGCTCTGATCACACATGGTGACATTGGCCTTGCAGCAAGGCA
AGGTCAGCATGGGCACAGATCTCATGTGGATCACTGGGGGTAGCCAGGAGGGAAGAACCGTAGTGCCAATAGTCAGATACAAGGCTGCAGGGCAAAAAAGGATGGAGAGGACAAAGCCCA
TAGTCTAGTCTTCTCTCCTCTTCAGATTACTGGCCTGGCCCTGGCAAATTCTCCCGTACAGACTATGTGGCCAGCAGCATCCAACGTGGCCGAGATATGGGGCTGCCCAGCTATAGCCAG
GCCCTGCTGGCCTTTGGGCTGGACATCCCAAGGAACTGGAGTGATCTCAACCCTAATGTGGACCCCCAG
GTCAGGAATAGTAATGATAATAATGGCAGCTAAAGCTTTCCTATGGCAATA
CTGTTCCAAACCCTTTACATATGTTGACTCATTTAATCTACATAATAATATTGTAAGGTATGTATAATAATTTTCCCCATTTTACTGATGACATAGTTGGTAAATCATAGAGGAGGGACT
TAAATTCAGCCATCTGATTCCAGAATATATTCCTAACCACTGCATTGTACCATTCCTGCAGGGTGGCTTCTGGGTTGGGTGCCATTGTCCTGTTGCTGCAGGGTCCCACCCCAAGGGCTG
TGTGCCCCTGAAGCTGCTAATCATTGAGGTCAGGCAGGCCGGTGATGGTCACAGGATATGGTCCTAGGGCACCTGACCGTGGTCTTACCGTGGGTAGGGACACACTGATCCTTCCACCAG
ACTTGTCCTGCCTGAGGGGGCTTGCCTAAGAAGAGGAAATCAGGCCTGAGCAGCAAGCCAGGCAGCGGCTGGGGTCCTGTGTCTGAGGGATGGGGCAACAGTGGCTGCCCTCCGCAGCAA
ACATACGCTCACCCCTTACTCCTGTGGTGCCCCAGGTGCTGGAGGCCACAGCTGCCCTGTACAACCAGGACCTATCCCAGCTAGAGCTGCTCCTTGGGGGGCTCCTGGAGAGCCATGGGG
ACCCTGGACCCCTGTTCAGTGCCATTGTCCTCGACCAGTTTGTACGGCTGCGGGATGGTGACCGCTACTGGTTTGAGAACACCAGGAATGG
GTAAGGCTTGCCTGGGCCCCCACCTCAGA
CTGCTCCTCAGCCTGAGCCCCAGACCCTCTGTCTGGCCTTAGACAGCCCCTATGAGCCCTTGATTCCCAGTCAGCCCACCACACCCTTCCCAACCCCTCTGGGTCTCTCTTTTTTTCTTC
TCTTTTCTTTTCTTTTTCTTTTTCTTTTTTCTTTTCTTTTTTTTTTTTTTTTTTTTTGAGGCAGAGTCTAGCTTTGTCACCCAGGCTGGAGTGCAGTGGCGTGATCTTGGCTCAATGCAC
CCTCCACCTCCCAGGTTCAAGTGATTCTCCTGCATCAGCCTCCCGAGTAGCTAGGATTAGAGGCATGCACCAACATGCCCAGCTAATTTTTTTTAAAAATATTTTTAGTAGAGATGGGGT
TTCACCATGCTGGTCAGGCTGGTCTCGAACTCCTGACCTCAAGTGATCCACTCGCCTTGGCCTCCCAAAGTGTTGGGATTACAGGCATGAGCCACTGCACCCAGCCCCTCTGGGTCTCTT
TTCTCACCTGGGTCCTTGGGCCTGGGGTTGCTGGAGGCCTGCATCCCCTTCCCATCCCAGTGACTTCTACTTCCTCCAACTTAGGCTGTTCTCCAAGAAGGAGATTGAAGACATCCGAAA
TACCACCCTGCGGGACGTGCTGGTCGCTGTTATCAACATTGACCCCAGTGCCCTGCAGCCCAATGTCTTTGTCTGGCATAAAG
GTGAGTGCCCTGGGAGAACACAAGTGAGTGACAGTGG
CCAGAGAAGGATCAAGATTGAGGGTGCGGGGGAATCACTTGGTGCTGTCCAGGGAGGCAGGCACCTTCTGTGTTGGGCTAGGAGGCCTGCATTTGGCTGGCTCCCACAGCAGGGACCTCA
ACTAGCACACAAGCTACACCCTACAGTCAAGAAGGGGTGGATGGGGTAGATGCCAAGAGACAGGAAATGAATGGGGACTTTTTGAGGGAGACAGTTTCAGGGAGGTGGGCCTGGGGAAGA
CAGATGATATCTTGGTCCTTTATAGGATAGAGGGGAAAGAGGTCTGGCCACATAGCGGGATCCTCAGACTTTGAGGTCTTCCCTGCCCTCTCCCTCAGGTGCACCCTGCCCTCAACCTAA
GCAGCTCACAACTGACGGCCTGCCCCAGTGCGCACCCCTGACTGTGCTTGACTTCTTTGAAGGCAGCAGCCCTGGTTTTGCCATCACCATCATTGCTCTCTGCTGCCTTCCCTTAG
GTGA
GCTCTTAGGCAGCCTCTCTGCAGACTGGCCCTGCCCCTCATTTCCTGCTGGCCTGAGGGGCTGGCTATTTGGTACCGTTTGAGACCAGGCTCAAGGAACCTCTGGAAGGGAGGGGCCATA
GCCTAAGCCACAGTGAAGCTCTAGGCGAGGGGCTCCCTCCTCACTGTTCCTTCTGATCCGCTTCAGTGAGTCTGCTTCTCTCTGGAGTGGTGGCCTATTTCCGGGGCCGAGAACGCAAGA
AGCTACAAAAGAAAGTCAAAGAGAGCGTGAAGAAGGAAGCAGCCAAAGATGGAGTGCCAG
GTGAGAAGGGGCTGGGCAGAGGAGGGAGGAGGGACGGAGGAGGGGAGAGACAGGAGTCTG
GGAAAAAGAACCAAGTTACAGAGTGAGAGGAAAGCCAAGGCACCTTTAGGGCGCCTGCTCAGACTCACAGAGGAATTGACCTGAAGGCGGGGACCTGGGGACATCTGCTGAACTACCCGG
CCCAATTATCCCTTCCCCAGCGATGGAGTGGCCAGGCCCCAAGGAGAGGAGCAGTCCCATCATCATCCAGCTGCTGTCAGACAGGTGTCTGCAGGTCCTGAACAGGCGTCTCACTGTGCT
CCGTGTGGTCCAGCTGCAGCCTCTGCAGCAGGTCAACCTCATCCTGTCCAACAACCGAGGATGCCGCACCCTGCTGCTCAAGATCCCTAAGGAGTACGACCTG
GTATGGCTCGTCCTGCC
TCCCCAGCCTGGGCTGCCCTCACACGACTCCATTATCACAAGCGAGGCCACCCTATCCTCAGCTACAGAGCTCACCTATGACAGCTGATGCTGGGGAGAGGGGCTCCTTTCAGAGGCCCC
CAGACACAACCTGACCCCCTTCGTCCACACACCTGGCCCCAGCCTGGATGGATGGGGAGGAGTTTTCTCTCCTCCCCTCAACCCAAGATCCATTGAGGGGAGGCTGAAGCAGAAGGTCCA
GCGAGCTCCCTGCATCAGTGCCGCCTTCCTCCCACCCAGGTGCTGCTGTTTAGTTCTGAAGAGGAACGGGGCGCCTTTGTGCAGCAGCTACGGGACTTCTGCATGCGCTGGGCTCTGGGC
CTCCATGTGGCTGAGATGAGTGAGAAGGAGCTATTTAGGAAGGCTGTGACAAAGCAGCAGCGGGAACGCATCCTGGAGATCTTCTTCAGACACCTTTTTGCTCAG
GTGCCATGATCTGTG
CCTTTTGGAGATGGGTCCAGCCCCAGAAATGGAGGAAACCTGGGCTGCATAGAACGCCCCTGTGGGTGAACTAAGCTTCCGCTCTATGGCCTGGAGAGAAATAGCCTTGTTTGAATCCTG
GCATTGCCACTTTACTTAGCTCTGTGACCTTAGGCAAGTCACATTATCGCTGTTCTGTATCTCTGTTTCCTCATCTATAAAACAGTGATGAAAACTGTATCCATCCCATTGCATTGTTGT
GAGGATTCGGTGAGATCGTCTACATGAGTGGTACACAGAGGTTGGCCTCTGGACCCGAGAGCATCAGCCTCACCTGGGAACATGTTAGAAATGCACCTACCCAGTTAGACTGAACCAGGA
ACTCTGTGGGTGGGGCCTGGCAATCTGTGTTTTAACAAGCTCCCCAGATGATTCGGATACACTCTAATGTTTGAAAAACATTGTTTTATGTACAGTGCTTATTGGCCCAAGTGCCAGGTA
TGTTGCAGACATTTAACAAACGGTTGTGGCCAGGCGCAGTGGCTCATGCCTATAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACCTAAGGTCAGGAGTTCGAGACTAGCATGG
CCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCTGGACATGGTGGCTCACACCTGTAATCCCAGCTACTTGGGAGGCTAAGGCAGGAGAATCGCTTGAACCCGGGTGG
CAGAGGTTGCAGTGAGCCAAGATCATGCCACTGCACTCCAGCCTGGGTGACAGAGCAAGACTCCATCTCAAAAAAACAAACAAAAAACAATAAATGGTTGTTACATGTGACTTTTAAACT
TTTTGTGCAATGGGCAAATCATAGGCACATGGCAGCCTTATCTGAATTGGCAAGAGAGCACAGCCCCAGCACCTTCCTGCCTGTCTACCACCATGTCTCTACATCTTCTGTCCCCAGTAT
AGGCTCTCTCACTTTCCATTCCCCTTAACTTTGCCCTTCCCCTTCCCTACCCCAGCACCATGCCCACTGCATGAAGTTCCCGGTTCTTGGGCCCAGGGAGAAATGGGCAGGCTGCTAGAG
ATTTGATTCCCCCGTCTATAGGACAACAGAGGCCCCAGTCAGTATATCTAAGGATCAGGAGAACCATCAGAGTTTAGCCTTTCTGATTTGGACTTTGGGGAGATATGAAGGGTCACTGAA
CTGCTTCCAGCATAGGCTTCACCTCCTTCTCTTTCCCTCCCTCTGCTGCTGCCCGAGTGCAGGTGCTGGACATCAACCAGGCTGATGCAGGGACCCTGCCCCTGGACTCCTGCCAGAAGG
TGCGGGAGGCCCTGACCTGCGAGCTGAGCAGGGCCGAGTTTGCTGAGTCCCTGGGCCTCAAGCCCCAGGACATGTTTGTGGAGTCCATGTTCTCTCTGGCTGACAAGGATGGCAATGGCT
ACCTGTCCTTCCGAGAGTTCCTGGACATCCTGGTGGTCTTCATGAAAG
GTAGGGGGCTGGGAGGTGGCAGGCTATCCAAGAATCCAGGGGTCTTTCAGCAAGGAGATGACCTGCATTCCC
TTTTTTCTTCCCAGGCTCCCCAGAGGATAAGTCCCGTCTAATGTTTACCATGTATGACCTGGATGAGAATGGCTTCCTCTCCAAGGACGAATTCTTCACCATGATGAGGTATGGGGTGTG
CCTTTCTAATCCTGAGATTTCCTGGTGTGTTTCAAACAGGAAAACAGGTCCAGTCAGAGGAGGGCTGGCAAACAGGCTATGCGGTCATCTGTGCTGAGAGGTGGCCTAAACACTACATCC
TAAACTCTCAGAGCATCCACCTTCAAATATTTACCTGACTGGCTCCTGCCTCTGGGAGAGTCTCTGTCTGGACTGTCAACACCAGCCAGAAAAGCCTCCCTAGTTAAAAAACGAAAAAAA
AAAAAACCCAACACCAATATGGCCAACGACAAAAATCCACTAATCCCTTTTGGATGCCCTTGGATCTTTGTGAACTATTTTACGGCATGCCCAACACCGTGCTCTACCCAGTGAAACAGT
GAATGGATGACCTTGGTTGCTGCTCCGATATTCATCACCATGATAGTCAGGAAGAGAAACGTGGAAGGCTTCTCCATATTCCAACCATTCTTTCTTCCTGCATCTTAAGCCCTTTCTGGT
TTTGTTGTGCCGGTAAAAAAAACAGCTTTGTGCTTCTCATTCCTGAAGACAATGAATGCGTCAGTAACACAGCTCCCCTCCATGCCATAAGGGCAGGGCTTGTTCCCTGTTGAATCCAGC
TTCTCTACACTGTGATTGGCACAGGGCAGGCATTTCATACATAACTAACTGAGTAAGACAAAATGAAATAAGTGAGCAAATGAATACAAAGTATAGATGTAACAGCCCACATTATTTCAA
TTTTTCTATCCTGTTAAGCCTTAACATTGCTTTAAGCATTCCCCTTAACTGCTACATTCCTCATATGGTCCAGATACCCCAACTGGACAAGGGCTTCTGAAAGGGCAAAGCTATTGTAGT
CTGTACCTCACTGGGTATGTCATTGCAGGCCCAGCCCGAGGTGAGGCTCAAGGGAATCTAGGAGAGGGTCTCTGCCTCCAAGGGCTGAGTTCCCACCTTCTTTTTTTGGTTTGTTTGTTT
TGAGACGGAGTCACTCTGTCACCCAAGCTGGAGTGCAGTGCCACGATCTCGGCTCACTGCAACCTCCACCGCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGAATT
ACAGGCATGTGCCACCACGTCCAGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTAGTCTCAAACTCCTGACCTCAGGGGATCAGCCCGCCTCAGCCTCC
TAAAGTGCTAGGGTTACAGGCGTGAGCCACCACACCTGGTGAATTCCCACCTTCTTGTCCTGTAACTCAGCGTGTATCTTGCTTACTGTCGGTGGGACGATGTTTTTAATGTGATGGCTG
TGCTTGCTGTGTTTTATGGGCCCAGCATGGCACAGCATTGCTGCTGGACCTACACAAACTTGCATAGTCTGTCTTTTCTGTCCTGAGGCACAACCTATGAATAGAGCTTGGCTACTGCAG
GTGCCACTGTGGGTGCTATCAGGTTGGGCATGGAGACGCTCCCGCCTGTGCCCCGGGGTGTTGGCACAAGGAAGCAGCAGCATTGCAGCTAGTTCCCCTCCCTGGCACCTGGCTGCCTGG
TGCCCCCACTGGACTATGAAAGGGGGAACCCAGGGGTGATATGGGAGGCATCAACAGAAGAGAGTGGACAGAGAGCCTGCCACGAGAGAGGGCCATGCACACCCTGGACACACCCCTGCA
CTCAGTGGACTATCTTCTCAGTTGTAGATGCCCCCTGTTTGAGGGCTGCTTTCTCTGATTGGTCAAGGTCACTTTCAATTCTGTTCTGCCTTTTGAGTCCATGGCTACCCCACCCAGCTT
AGTGTCAGCTGCAGGCTGGATGAGCACCTTCTCAGTGCCATTTCCCGGGTCACTGGTAACCACATTAGATAATCTGGGGCCTGCCCCACCCTGCACCTACCCAAGCCTGACCTTGCTGGG
TGACAGGCTGCTGTGTCTCTGGTCCTCCTCCAGATCCTTCATCGAGATCTCCAACAACTGCCTGTCCAAGGCCCAGCTGGCCGAGGTGGTGGAGTCCATGTTCCGGGAGTCGGGATTCCA
GGACAAGGAGGAGCTGACATGGGAGGATTTTCACTTCATGCTGCGGGACCATGACAGCGAGCTCCGCTTCACGCAGCTCTGTGTCAAAGGTGGAGGTGGAGGTGGAAGTG
GTGAGTGTGT
GAGGAATGGTTGGTGTCAGGGAGGGGGGGCGTGTCCTCAAAATGAGAGTTCCTGGTGGGCAGGACCCAGGTCTTACTCTTTTCTGAGTCCTTGGTACCTAGTATAGAACCAAGCACGTGT
AATTGGGATGGCACATGTAGGTGTGCAAGTTGTCTAATGCACAATGATACCCACTGAGGTCAAGGAGCAGGTTGAAATCTATCCTACACACTGCTCAAAGCCACCGGCATTGGTCTAAGA
TATATCTGGCCAGAGGAAGAGGCATTGTTTCTTTTACACAAAGGACCTGCAGGGTTGCATCCACCTAAGAGGACGTCCCCTTTCTTGTGCAAAGTTGCCACATTGTCTGCCCTGTGTACA
GTGAGTGCTTAGCCTAGGGAATGAGGGAAACAGGACTCGAGTCAGAGATCTGGACATGACTTTCCCAGAAGGGAGGAGGGCACAGTCTCCCATCCTACCCCACTGCCCTTGTGAGGAAGC
CAGTCCTGCCTCTTGTTCTCTTCTCTAGGTATTAGAGATATCTTTAAACAAAACATCAGCTGTCGAGTCTCGTTCATCACTCGGACACCTGGGGAGCGGTGAGCAGGAATGGGGCTCTGG
CAGGTTGGCCTGGCTGAGCCCCCTGCAGAGAAATGAAGGGAGTAGGACTGGCTGATCAGCCCCTGGTAAAATCAGGCATTTGCCCTTTGAAAGTAGCTCTTGGTAGCACAAACATTCCAG
CTGCCTCTCTCACCCTATGCTGCTCGGATGCTTGGCTCTCTCCCTGCTGCTCCAGGCCAGAATCATTCTACAAAACAAATCATGAGATCCTATTAATTCATTTTGCGCCTTGCCCCCTGC
CTGGTACCAGGAGCCACTCCCTACCTCTACCCCATGCTCTGCCCAGGGAGTTGTTCTCCTGGCTGCAAAGACAAGGGGAGAACAGCCCCATTTCTTTTTCTCAGCTCCCACCCCCAGGGA
CTGGGGCCCCCTGCCCCAGAAGCCCCAGAGCTGGGAGGCCCTGGGCTGAAGAAGAGGTTTGGCAAAAA
GTGAGTGTCTCCCAAATCCCTGGGCCCAAAGAGACATGGAGAGAAGTCTTAG
GGTCCCTAGGCCCCACCCGCATATCCTTGACATATAAGCGACCATCCCGAGTCTCATTCCATTTGTTCCTGACCTGACTGAGAGGTTATAGTGTTGAATGACTTTTCATCCTCTTCCAAC
CTCTGCACCCCATTCTTCAGGCAAGGGTCCTGGCTCAACAGGATGATAGTAAGGAGTCTCCTGGCTCCTGCCTGCTTTGGGCACAGCCTTGAGGCCTGTGCTGGGATCAGGAAGAAAGAA
GGTTAAAACAGACAGGAGGAGGGGGAGTAGCAGGGAGACAGTGAGTGGGTGGATGGAGCAAAGACAGAAGTAAAGGGTTGGAGGAGGAAGAAGCCCCCAGATTGCTTTTTTCCCTTCATC
TGTTGTGGCCCATCCTGATGCCTGCCAGATCCCCAGGTCACCTTTCATGGAGTGGCATTAAGGGAAGGCCAGAGGGCCCTTACCACACTGCTGCCCTGCCTCCCCTTGCTATAGGGCAGC
AGTGCCCACTCCCCGGCTGTACACAGAGGCGCTGCAAGAGAAGATGCAGCGAGGCTTCCTAGCCCAAAAGCTGCAGCAGTACAAGCGCTTCGTGGAGAACTACCGGAGGCACATCGTGTG
TGTGGCAATCTTCTCGGCCATCTGTGTTGGCGTGTTTGCAGATCGTGCTTACT
GTAAGAGTTCCAGGCTGTGGGCAGTGGGTAGGGAGCAGGCTCTGACCCTTGGAGAGGAGTGGAAAGC
CCTCTGATCCTAAGAGTCTGCATGGGAGAGCCCAGGGCTCGGGACCTTGGCCACCTGTGCCAAGCTGATGTAACCTCACTCCGGCCCCAGACTATGGCTTTGCCTCGCCACCCTCGGACA
TTGCACAGACCACCCTCGTGGGCATCATCCTGTCACGAGGCACGGCGGCCAGCGTCTCCTTCATGTTCTCTTATATCTTGCTCACCATGTGCCGCAACCTCATAACCTTCCTGCGAGAGA
CTTTCCTCAACCGCTATGTGCCTTTTGATGCCGCAGTGGACTTCCACCGCTGGATCGCCATGGCTGCTGTTGTCCTGGCCA
GTACGTGACTCCCAGGCTTCTTCCTCTTTGCTTCTTCCT
CTTTGCTGCAGCACCCTGGGTCTAGTTGGGGGAAACAGTGGGGAGATGGAACTCCTTATACCTCCATCTCTCCTCCCTATGCCTCCTCTCTCCCTCAGGATCGGAGGTAGAGTCTGTCCT
GGTTGGCATCTCTAACAGGGTCTGTCTCTTCCAGTTTTGCACAGTGCTGGCCACGCAGTCAATGTCTACATCTTCTCAGTCAGCCCACTCAGCCTGCTGGCCTGTGTATTCCCCAACGTC
TTTGTGAATGATGG
GTCAGTTCTGGGGAAGGTTTCTCCTGGGACTCATAGGGTGGGCCCAAGGGTATAATAGAAAAAGAAATAGGCAGGGCACAATGGCTCACACGTGTAAGCCCAACAC
TTTGGAAGTCTGAGGCAGGAGGATTGCTTGAGGCCAGGAGTTCCAGACAAGCCTGAATAACAAAGTGAGACTCCATCTGTACAAAAAGTAAAAAGATTAGCGGGGTATTGTGGTACACAT
CTGTAGTCCCAGCTATTCAGGAGGCTGAGGCAGGAGGATTGCTTGAGCCCAGGAGTTTTAGGTTGCAGTGAGCCATGATCAGTACCACTGCATTCCAGCCTGGGTGACAGAGCAAGACTC
TGTCTTGAAAGAAAAAAAAAAAGAAAAGAAAAAGAAAAAAGAAATAGACAGTCCCAGACACTCAGCAAGAAGCTCAGTGCTAAGCTAGTCCCCTGGGGGAAGCTGAAAGGTAAATTTCTT
GCCCTCAAGAAGGAAGCTGGCTGTGATTGGCCAGGAAAGGTGTTTGGGAGATGAAGTCAGAGACTCTTTCTCATAGATACCTGACACACAGCTTGCCATCTCTGCCTTCTATCCATTCAC
TGGACAGACATTTATGCAGCATGTCCCATGTGTCTACCCAGATGCCAGAGTAGGGATGTCAAGATACATGCAGTCATTCAACAACTACTTACCGAGATTTGCTGTGTGCCTAGTATTGTT
CTAAGCCTAGAGATAGAGCAGTGAATGAAACAAAAATCCCTGCCCTCATGGAGCTTACAGAATAATGAACCAGGGACTCAAGGAAGCAGGGGTCAACCACAGTAAGAGAGTTCAGGATAA
CACAGAAGGAAAAATTGCTGAGTACAGAGGGTGGAACCATGGAAGCCTGGGTGCCGAGTTCCTTGGAGGGATCCCTGGCCAACTGGGTGGAGGCCCCATACCCTCAGCAGTCAGGGCCAG
CAGGAAAGGAGTCATGCTGTGTTGTGACAGTGCTGGAGCCCCTCCTGCCCCAGCCAGAGGCTCAAGGTGTCTTTGCCCCCCAGGTCCAAGCTTCCCCAGAAGTTCTATTGGTGGTTCTTC
CAGACCGTCCCAG
GTAGGAAACGTGGGACCTGGGGGTTCTGTCTGAGGACTTCTGCTTTTGTCTCCATCTCTCTGTGAATACTCACTTGTCTATACTGGCCATGGGGTCTGTTTCTAGCC
TTCAGGACAAGCCCCAGCTCCAATCCCTCCAGGCAGGCTTTTCTGGGGGCCCTGGAGGGATGAGAGAGGAAGGAGGGAAGCATAGGGGAATGTGCTGTTGTCTTTTCAGCCCAAGCTGAA
GTCCTGAGACTACTCACTGGCCCTGTCCTCCTGCCCCCAGGTATGACAGGTGTGCTTCTGCTCCTGGTCCTGGCCATCATGTATGTCTTCGCCTCCCACCACTTCCGCCGCCGCAGCTTC
CGGGGCTTCTGGCTGACCCACCACCTCTACATCCTGCTCTATGCCCTG
GTGAGGGACTTCCCTGGGCCAGCCCATGGAACAGGGAGCTCAGGATGGGACAGGAAGGTGAAAGAGGGAGAA
TTGGATCCAAGATCTCAGAATGAGACTTTGAGATTTAAGACCCCAGACCTCAGCCCTATCTCCCCAGCACAGGCCTAGTGCCTGGGCAAGAGGGGATGCCGGGCAGGGGCCTGGCTGGGC
CTGAGTTGTACTAACTGGCCGTGTCTCCAGCTCATCATCCACGGCAGCTATGCTCTGATCCAGCTGCCCACTTTCCACATCTACTTCCTGGTCCCGGCAATCATCTATGGAGGTGACAAG
CTGGTGAGCCTGAGCCGGAAGAAGGTGGAGATCAGCGTGGTGAAGGCGGAGCTGCTGCCCTCAG
GTATCAGGCCCAGCCTGATCTGGGTCGGGAGCGACAGAGGCCAAATCTTCAGACAT
GAGGAGACAGTTCACCAGGCCTCCTGACCCCATCACTGCCTCTGACTCTGTCTCCAAAAACAACAAGAAAAAAAAACCACTCTCGGGGGTTCCTGAAGGTTTCCTGATAGAAGTAGACCC
AGAAAGGGCTGTGCTTAGCTCCCAGGAGACTTGCAGATGGTGAGAAGTGACCCTGAGAAGAGGTGGCTGAACGTGCATACAGAGGGGTTTGAGGATGGGAAAGGGCCCCACATGTGTGGC
TTGGGTGCAGGGGAAGTGCAGGGCAGGAAGCCACATATGCCTGTCCCATTCCTTCTCTCAGAGACAGGCAAATGCCCAGATTGCCAGTCTGTTGTTGAGAGTCAGTGTTGGCCAAAGTCG
GGATTGGTATCATCATAGAGGGTGGCAAAAGATGATTTTATGTATCGCAGGACTGTCTAACCTTCAGGACTCATGTTGTAAAAAAATTAATCTCATTGCAATGTTATTTCAATTGAGATT
ACTTAAGGACAAAATCTCAGAGTGGTGTTAGTATGCCTTTCTACTCTCCAGCACTTGCTGATCTCCTTTTTCAATGAAGAGATCAGGCCTGAGGCTCAGCTATGGTACAAACAGTATCCA
GCTAAAATTTGGTAACAAAATATGTTGTCTTCTATGTAGGACACATGATACTGGTTTTCCACTTACCTTAGCAATAAAGTTCCCTGCCAAGATAAATTGAATTGGAAAAGTTAGTCAATT
TAAAGAAAAGTGTTAGATAAATGATAGTGCAGGAGGTGTGTAGAAAGGTAACCCTCAAATGGAGGTCTGGTAGCCACTGAGATGGTCCCTGGTGAGCCTGAGTCCCTTTACCCAGCAGGT
TGGTATTCAAGGCAGTGGTATGAGGCCTGCGCCTTCCATGGGAAGAGGGAGTAGAGAGGAGGAGAGGGGCTGGAACAGGGGAGCAGAGACACCCAGCTCTTGCATTACCTGGGAAACAGA
GAAGGCGACCCTCCTACTCCCAGCCCCAGGAGCCCCGCTTGCTCAGAACATGTGACTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAGCCCCAGGAGCCCCGCTTGC
TCAGACCATGTGACTACTCCCCAGGCCTCAGGGGTGTGAAGAGGACAGGTGCCTGTCAGTCCTCAGGTACCAGGAGCAGGCCTTCTCATCTGTGCTTTTCCCTGTCTATGACCTCCAGGA
GTGACCTACCTGCAATTCCAGAGGCCCCAAGGCTTTGAGTACAAGTCAGGACAGTGGGTGCGGATCGCCTGCCTGGCTCTGGGGACCACCGAGTACCACCCCTTCACACTGACCTCCGCG
CCCCATGAGGACACACTCAGCCTGCACATCCGGGCAGTGGGGCCCTGGACCACTCGCCTCAGGGAGATCTACTCATCCCCAAAAGGCAATGCCTGTGCTGGATACCCAAAG
GTGCCCGTC
ACTGGGAACCCTGCTTCCGGGCCTCTGGCACTGGCAGAGGATCTCTGCCCTTCCCTATCCTGAGACTAGAAGCTCCAGCCGTCCCAAAGCCAGCCTGGGAGAGGACCGGGGTGCCTCAGA
AAAGACTAGGATGTTCTGTATCCTCCCTCTGCCTGTGTCTCCGTTTCTGGTCTCAGAGCTGGGGCAGGGTCAGGCTCATTTCATCTCCCCCCTCTCTTGGCAGCTGTACCTTGATGGACC
GTTTGGAGAGGGCCATCAGGAGTGGCATAAATTTGAGGTGTCAGTGTTGGTGGGAGGGGGCATTGGGGTCACCCCCTTTGCCTCCATCCTCAAAGACCTGGTCTTCAAGTCATCCTTGGG
CAGCCAAATGCTGTGTAAGAAG
GTGAGCATCCCTTCCTCATTCATCAAATGGGGCATAGGTGGCCGAATTGTGACCCGCATCAAGTGGTGGATCATGAGAGAAAGCTCCTGGCTCCAGGA
ACTGAGTCTGAAGGGGTCATTCTTACCCAGTGGTTGAGATGCCAAACTTGGAGGGAAGTTGGTGGTATAGCCAGAAGGGCCTCTGCTGGGACCTGTCAGTTGGAAGCCTGGGATCAGGCT
GGTGGGTCCTGCCACAGCTTTGGTGTCTGCAGGTGGTCTGGGGCTTCCCAGCCTCTCAGGTGAAGGCACCTGGGATCTAGGGAGGCTGAACTGAGCTGGGTCCTGATCTCCAGCCCTGTG
TCCCCAGATCTACTTCATCTGGGTGACACGGACCCAGCGTCAGTTTGAGTGGTTGGCTGACATCATCCGAGAGGTGGAGGAGAACGACCACCAGGACCTGGTGTCTGTGCACATTTATGT
CACCCAGCTGGCTGAGAAGTTCGACCTCAGGACCACCATGCTA
GTATGTCAGGGCCCGCCAGGCAGGGCAACTTGGTGGGCAGATGGATTGGCAGCGTAAGGCAGCATAGCCAGGGCAGG
TGGGTGGACGGCCAGGCTGAGCTGGCAGGAGGCACAGAGCTGATGGCCTGATCCTCAGCCTCCAGCTCCCTCCCCTCCCCATTCTCTGTCTCTTGGGCTATGTGGGCTGGCTCGGGCTGA
GTGCTGGCCCTGACTGTCTTTGGTCTGACCTGCCCCTGTGCCCCCAGTACATCTGCGAGCGGCACTTCCAGAAAGTGCTGAACCGGAGTCTGTTCACGGGCCTGCGCTCCATCACCCACT
TTGGCCGTCCCCCCTTCGAGCCCTTCTTCAACTCCCTGCAGGAGGTCCACCCACAG
GTCAGTCCCACTCCCTCCCACCCTGGGACTCTGGCCTTCTCCTGCCAGGACATCCTGGCCCTGA
AGCACCCTGCCGCTCTTTTCTGAGCAGAGAACTCCACCCGCTTGCCTGGCCCCAGGATGAGGTCAGCTGTTAAAGGGGGACTTCCACCCCCTCCACGTTAAGCCTCTTCCTCAAGGCCTG
GGCTTGAAGCCCTAGTCATTCCAGCCAGGCTCAGGAAGCAGCTTTTCCCAAGGAGAGTGAGCACCTTTAGGCTGCAGGCCCCTCTCTCTCTCTAATCTCCTGACAGGTGCGCAAGATCGG
GGTGTTCAGCTGCGGCCCTCCAGGAATGACCAAGAATGTAGAGAAGGCCTGTCAGCTCGTCAACAGGCAGGACCGAGCCCACTTCATGCACCACTATGAGAACTTCTGA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCTCCGTGCAAGACCAGAGGCACTGATGCTCCTGGGAGCTCTTCTGACTGGATCCCTGGATCCATCGGGCAATCAGGACGCACTCTCACTGCCCTGGGAAGTGCAGCGCTATGACGGC
TGGTTTAACAACCTGAGGCACCACGAGCGTGGTGCTGTTG
GCTGCCGGTTGCAGCGCCGCGTACCAGCCAATTACGCCGACGGTGTGTATCAGGCTCTGGAGGAGCCGCAGCTGCCCAAC
CCGCGCCGGCTCAGCAACGCAGCCACGCGGGGCATAGCCGGCCTGCCGTCGCTCCACAACCGCACCGTACTGGGGGTCTTCTTTG
GCTACCATGTTCTTTCCGACGTGGTGAGCGTGGAA
ACGCCCGGCTGCCCCGCCGAGTTCCTCAACATCCGCATCCCACCTGGAGACCCCGTGTTCGACCCCGACCAGCGCGGGGACGTGGTGCTGCCCTTCCAGAGGAGCCGCTGGGACCCCGAG
ACCGGACGGAGTCCCAGCAACCCCCGGGACCTG
GCCAACCAGGTGACGGGCTGGCTGGACGGCAGCGCCATCTATGGCTCCTCGCACTCCTGGAGCGACGCGCTGCGGAGCTTCTCGGGG
GGACAGCTGGCGTCGGGGCCCGACCCCGCTTTCCCCCGAGACTCGCAGAACCCCCTGCTCATGTGGGCGGCGCCCCGACCCCCGCCACCGGGGCAGAACGGGCCCCGGGGGC
CCTTCGGG
GCAGAGAGAGGGAACCGGGAACCCTTCCTGCAGGCGCTGGGCCTGCTCTGGTTCCGCTACCACAACCTGTGGGCGCAGAGGCTGGCCCGCCAGCACCCAGACTGGGAGGACGAGGAGCTG
TTCCAGCACGCACGCAAGAGGGTCATCGCCACCTACCAG
AACATCGCTGTGTATGAGTGGCTGCCCAGCTTCCTGCAGAAAACACTCCCGGAGTATACAGGATACCGTCCTTTCCTAGAC
CCCAGCATCTCCCCGGAATTTGTGGTGGCCTCTGAGCAGTTCTTCTCTACCATGGTGCCCCCTGGTGTCTACATGAG
AAATGCCAGCTGTCATTTCCGGAAGGTCCTGAACAAGGGTTTT
CAAAGCTCCCAAGCTCTCAGGGTCTGCAACAACTACTGGATTCGGGAG
AATCCCAATCTGAACAGTACCCAGGAGGTGAATGAGCTGCTGCTGGGAATGGCCTCCCAGATTTCGGAGCTG
GAGGACAACATCGTGGTTGAAGATCTGAGGG
ATTACTGGCCTGGCCCTGGCAAATTCTCCCGTACAGACTATGTGGCCAGCAGCATCCAACGTGGCCGAGATATGGGGCTGCCCAGCTAT
AGCCAGGCCCTGCTGGCCTTTGGGCTGGACATCCCAAGGAACTGGAGTGATCTCAACCCTAATGTGGACCCCCAG
GTGCTGGAGGCCACAGCTGCCCTGTACAACCAGGACCTATCCCAG
CTAGAGCTGCTCCTTGGGGGGCTCCTGGAGAGCCATGGGGACCCTGGACCCCTGTTCAGTGCCATTGTCCTCGACCAGTTTGTACGGCTGCGGGATGGTGACCGCTACTGGTTTGAGAAC
ACCAGGAATGG
GCTGTTCTCCAAGAAGGAGATTGAAGACATCCGAAATACCACCCTGCGGGACGTGCTGGTCGCTGTTATCAACATTGACCCCAGTGCCCTGCAGCCCAATGTCTTTGTC
TGGCATAAAG
GTGCACCCTGCCCTCAACCTAAGCAGCTCACAACTGACGGCCTGCCCCAGTGCGCACCCCTGACTGTGCTTGACTTCTTTGAAGGCAGCAGCCCTGGTTTTGCCATCACC
ATCATTGCTCTCTGCTGCCTTCCCTTAG
TGAGTCTGCTTCTCTCTGGAGTGGTGGCCTATTTCCGGGGCCGAGAACGCAAGAAGCTACAAAAGAAAGTCAAAGAGAGCGTGAAGAAGGAA
GCAGCCAAAGATGGAGTGCCAG
CGATGGAGTGGCCAGGCCCCAAGGAGAGGAGCAGTCCCATCATCATCCAGCTGCTGTCAGACAGGTGTCTGCAGGTCCTGAACAGGCGTCTCACTGTG
CTCCGTGTGGTCCAGCTGCAGCCTCTGCAGCAGGTCAACCTCATCCTGTCCAACAACCGAGGATGCCGCACCCTGCTGCTCAAGATCCCTAAGGAGTACGACCTG
GTGCTGCTGTTTAGT
TCTGAAGAGGAACGGGGCGCCTTTGTGCAGCAGCTACGGGACTTCTGCATGCGCTGGGCTCTGGGCCTCCATGTGGCTGAGATGAGTGAGAAGGAGCTATTTAGGAAGGCTGTGACAAAG
CAGCAGCGGGAACGCATCCTGGAGATCTTCTTCAGACACCTTTTTGCTCAG
GTGCTGGACATCAACCAGGCTGATGCAGGGACCCTGCCCCTGGACTCCTGCCAGAAGGTGCGGGAGGCC
CTGACCTGCGAGCTGAGCAGGGCCGAGTTTGCTGAGTCCCTGGGCCTCAAGCCCCAGGACATGTTTGTGGAGTCCATGTTCTCTCTGGCTGACAAGGATGGCAATGGCTACCTGTCCTTC
CGAGAGTTCCTGGACATCCTGGTGGTCTTCATGAAAG
GCTCCCCAGAGGATAAGTCCCGTCTAATGTTTACCATGTATGACCTGGATGAGAATGGCTTCCTCTCCAAGGACGAATTCTTC
ACCATGATGAG
ATCCTTCATCGAGATCTCCAACAACTGCCTGTCCAAGGCCCAGCTGGCCGAGGTGGTGGAGTCCATGTTCCGGGAGTCGGGATTCCAGGACAAGGAGGAGCTGACATGG
GAGGATTTTCACTTCATGCTGCGGGACCATGACAGCGAGCTCCGCTTCACGCAGCTCTGTGTCAAAGGTGGAGGTGGAGGTGGAAGTG
GTATTAGAGATATCTTTAAACAAAACATCAGC
TGTCGAGTCTCGTTCATCACTCGGACACCTGGGGAGCG
CTCCCACCCCCAGGGACTGGGGCCCCCTGCCCCAGAAGCCCCAGAGCTGGGAGGCCCTGGGCTGAAGAAGAGGTTTGGCAAA
AA
GGCAGCAGTGCCCACTCCCCGGCTGTACACAGAGGCGCTGCAAGAGAAGATGCAGCGAGGCTTCCTAGCCCAAAAGCTGCAGCAGTACAAGCGCTTCGTGGAGAACTACCGGAGGCAC
ATCGTGTGTGTGGCAATCTTCTCGGCCATCTGTGTTGGCGTGTTTGCAGATCGTGCTTACT
ACTATGGCTTTGCCTCGCCACCCTCGGACATTGCACAGACCACCCTCGTGGGCATCATC
CTGTCACGAGGCACGGCGGCCAGCGTCTCCTTCATGTTCTCTTATATCTTGCTCACCATGTGCCGCAACCTCATAACCTTCCTGCGAGAGACTTTCCTCAACCGCTATGTGCCTTTTGAT
GCCGCAGTGGACTTCCACCGCTGGATCGCCATGGCTGCTGTTGTCCTGGCCA
TTTTGCACAGTGCTGGCCACGCAGTCAATGTCTACATCTTCTCAGTCAGCCCACTCAGCCTGCTGGCC
TGTGTATTCCCCAACGTCTTTGTGAATGATGG
GTCCAAGCTTCCCCAGAAGTTCTATTGGTGGTTCTTCCAGACCGTCCCAGGTATGACAGGTGTGCTTCTGCTCCTGGTCCTGGCCATC
ATGTATGTCTTCGCCTCCCACCACTTCCGCCGCCGCAGCTTCCGGGGCTTCTGGCTGACCCACCACCTCTACATCCTGCTCTATGCCCTG
CTCATCATCCACGGCAGCTATGCTCTGATC
CAGCTGCCCACTTTCCACATCTACTTCCTGGTCCCGGCAATCATCTATGGAGGTGACAAGCTGGTGAGCCTGAGCCGGAAGAAGGTGGAGATCAGCGTGGTGAAGGCGGAGCTGCTGCCC
TCAG
GAGTGACCTACCTGCAATTCCAGAGGCCCCAAGGCTTTGAGTACAAGTCAGGACAGTGGGTGCGGATCGCCTGCCTGGCTCTGGGGACCACCGAGTACCACCCCTTCACACTGACC
TCCGCGCCCCATGAGGACACACTCAGCCTGCACATCCGGGCAGTGGGGCCCTGGACCACTCGCCTCAGGGAGATCTACTCATCCCCAAAAGGCAATGCCTGTGCTGGATACCCAAAG
CTG
TACCTTGATGGACCGTTTGGAGAGGGCCATCAGGAGTGGCATAAATTTGAGGTGTCAGTGTTGGTGGGAGGGGGCATTGGGGTCACCCCCTTTGCCTCCATCCTCAAAGACCTGGTCTTC
AAGTCATCCTTGGGCAGCCAAATGCTGTGTAAGAAG
ATCTACTTCATCTGGGTGACACGGACCCAGCGTCAGTTTGAGTGGTTGGCTGACATCATCCGAGAGGTGGAGGAGAACGACCAC
CAGGACCTGGTGTCTGTGCACATTTATGTCACCCAGCTGGCTGAGAAGTTCGACCTCAGGACCACCATGCTA
TACATCTGCGAGCGGCACTTCCAGAAAGTGCTGAACCGGAGTCTGTTC
ACGGGCCTGCGCTCCATCACCCACTTTGGCCGTCCCCCCTTCGAGCCCTTCTTCAACTCCCTGCAGGAGGTCCACCCACAG
GTGCGCAAGATCGGGGTGTTCAGCTGCGGCCCTCCAGGA
ATGACCAAGAATGTAGAGAAGGCCTGTCAGCTCGTCAACAGGCAGGACCGAGCCCACTTCATGCACCACTATGAGAACTTCTGA

Retrieve as FASTA  
cDNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
TTGGAAGTCG CGCGGGACCC CTTTTATAGC AGCGTGGGCG ACGTGCCACA CGGGTGTCCC AGCCCAGGGG CTGGTCTGAG CTGGAAGAGG TTATGCAAAT AAGGGCCCCA CCTCCACAGC  AGGAGGGTGA GCCCTAGGTC CAGATGCTCA CACTGGCGCA GGTCTGTCCT GAGCCGACAC CTGCACAGTG GCGAGACCAA GGACCCAGAG AGAAAGGTGA GAGTGCAGCC GGGGAGGCTA  AGGATCGGCG GAGCTGGAAG AGTGAGGGTG AAGGCAAGAA GTAGAGCACA GAAGCAAAGA TTTTAAGAGG AAAGAAGACA TCTGAACCCA ACACCACCCT AAACTACAGG CTGCAGGGTT  GGCATGCTCC GTGCAAGACC AGAGGCACTG ATGCTCCTGG GAGCTCTTCT GACTGGATCC CTGGATCCAT CGGGCAATCA GGACGCACTC TCACTGCCCT GGGAAGTGCA GCGCTATGAC  GGCTGGTTTA ACAACCTGAG GCACCACGAG CGTGGTGCTG TTGGCTGCCG GTTGCAGCGC CGCGTACCAG CCAATTACGC CGACGGTGTG TATCAGGCTC TGGAGGAGCC GCAGCTGCCC  AACCCGCGCC GGCTCAGCAA CGCAGCCACG CGGGGCATAG CCGGCCTGCC GTCGCTCCAC AACCGCACCG TACTGGGGGT CTTCTTTGGC TACCATGTTC TTTCCGACGT GGTGAGCGTG  GAAACGCCCG GCTGCCCCGC CGAGTTCCTC AACATCCGCA TCCCACCTGG AGACCCCGTG TTCGACCCCG ACCAGCGCGG GGACGTGGTG CTGCCCTTCC AGAGGAGCCG CTGGGACCCC  GAGACCGGAC GGAGTCCCAG CAACCCCCGG GACCTGGCCA ACCAGGTGAC GGGCTGGCTG GACGGCAGCG CCATCTATGG CTCCTCGCAC TCCTGGAGCG ACGCGCTGCG GAGCTTCTCG  GGGGGACAGC TGGCGTCGGG GCCCGACCCC GCTTTCCCCC GAGACTCGCA GAACCCCCTG CTCATGTGGG CGGCGCCCCG ACCCCCGCCA CCGGGGCAGA ACGGGCCCCG GGGGCCCTTC  GGGGCAGAGA GAGGGAACCG GGAACCCTTC CTGCAGGCGC TGGGCCTGCT CTGGTTCCGC TACCACAACC TGTGGGCGCA GAGGCTGGCC CGCCAGCACC CAGACTGGGA GGACGAGGAG  CTGTTCCAGC ACGCACGCAA GAGGGTCATC GCCACCTACC AGAACATCGC TGTGTATGAG TGGCTGCCCA GCTTCCTGCA GAAAACACTC CCGGAGTATA CAGGATACCG TCCTTTCCTA  GACCCCAGCA TCTCCCCGGA ATTTGTGGTG GCCTCTGAGC AGTTCTTCTC TACCATGGTG CCCCCTGGTG TCTACATGAG AAATGCCAGC TGTCATTTCC GGAAGGTCCT GAACAAGGGT  TTTCAAAGCT CCCAAGCTCT CAGGGTCTGC AACAACTACT GGATTCGGGA GAATCCCAAT CTGAACAGTA CCCAGGAGGT GAATGAGCTG CTGCTGGGAA TGGCCTCCCA GATTTCGGAG  CTGGAGGACA ACATCGTGGT TGAAGATCTG AGGGATTACT GGCCTGGCCC TGGCAAATTC TCCCGTACAG ACTATGTGGC CAGCAGCATC CAACGTGGCC GAGATATGGG GCTGCCCAGC  TATAGCCAGG CCCTGCTGGC CTTTGGGCTG GACATCCCAA GGAACTGGAG TGATCTCAAC CCTAATGTGG ACCCCCAGGT GCTGGAGGCC ACAGCTGCCC TGTACAACCA GGACCTATCC  CAGCTAGAGC TGCTCCTTGG GGGGCTCCTG GAGAGCCATG GGGACCCTGG ACCCCTGTTC AGTGCCATTG TCCTCGACCA GTTTGTACGG CTGCGGGATG GTGACCGCTA CTGGTTTGAG  AACACCAGGA ATGGGCTGTT CTCCAAGAAG GAGATTGAAG ACATCCGAAA TACCACCCTG CGGGACGTGC TGGTCGCTGT TATCAACATT GACCCCAGTG CCCTGCAGCC CAATGTCTTT  GTCTGGCATA AAGGTGCACC CTGCCCTCAA CCTAAGCAGC TCACAACTGA CGGCCTGCCC CAGTGCGCAC CCCTGACTGT GCTTGACTTC TTTGAAGGCA GCAGCCCTGG TTTTGCCATC  ACCATCATTG CTCTCTGCTG CCTTCCCTTA GTGAGTCTGC TTCTCTCTGG AGTGGTGGCC TATTTCCGGG GCCGAGAACG CAAGAAGCTA CAAAAGAAAG TCAAAGAGAG CGTGAAGAAG  GAAGCAGCCA AAGATGGAGT GCCAGCGATG GAGTGGCCAG GCCCCAAGGA GAGGAGCAGT CCCATCATCA TCCAGCTGCT GTCAGACAGG TGTCTGCAGG TCCTGAACAG GCGTCTCACT  GTGCTCCGTG TGGTCCAGCT GCAGCCTCTG CAGCAGGTCA ACCTCATCCT GTCCAACAAC CGAGGATGCC GCACCCTGCT GCTCAAGATC CCTAAGGAGT ACGACCTGGT GCTGCTGTTT  AGTTCTGAAG AGGAACGGGG CGCCTTTGTG CAGCAGCTAC GGGACTTCTG CATGCGCTGG GCTCTGGGCC TCCATGTGGC TGAGATGAGT GAGAAGGAGC TATTTAGGAA GGCTGTGACA  AAGCAGCAGC GGGAACGCAT CCTGGAGATC TTCTTCAGAC ACCTTTTTGC TCAGGTGCTG GACATCAACC AGGCTGATGC AGGGACCCTG CCCCTGGACT CCTGCCAGAA GGTGCGGGAG  GCCCTGACCT GCGAGCTGAG CAGGGCCGAG TTTGCTGAGT CCCTGGGCCT CAAGCCCCAG GACATGTTTG TGGAGTCCAT GTTCTCTCTG GCTGACAAGG ATGGCAATGG CTACCTGTCC  TTCCGAGAGT TCCTGGACAT CCTGGTGGTC TTCATGAAAG GCTCCCCAGA GGATAAGTCC CGTCTAATGT TTACCATGTA TGACCTGGAT GAGAATGGCT TCCTCTCCAA GGACGAATTC  TTCACCATGA TGAGATCCTT CATCGAGATC TCCAACAACT GCCTGTCCAA GGCCCAGCTG GCCGAGGTGG TGGAGTCCAT GTTCCGGGAG TCGGGATTCC AGGACAAGGA GGAGCTGACA  TGGGAGGATT TTCACTTCAT GCTGCGGGAC CATGACAGCG AGCTCCGCTT CACGCAGCTC TGTGTCAAAG GTGGAGGTGG AGGTGGAAGT GGTATTAGAG ATATCTTTAA ACAAAACATC  AGCTGTCGAG TCTCGTTCAT CACTCGGACA CCTGGGGAGC GCTCCCACCC CCAGGGACTG GGGCCCCCTG CCCCAGAAGC CCCAGAGCTG GGAGGCCCTG GGCTGAAGAA GAGGTTTGGC  AAAAAGGCAG CAGTGCCCAC TCCCCGGCTG TACACAGAGG CGCTGCAAGA GAAGATGCAG CGAGGCTTCC TAGCCCAAAA GCTGCAGCAG TACAAGCGCT TCGTGGAGAA CTACCGGAGG  CACATCGTGT GTGTGGCAAT CTTCTCGGCC ATCTGTGTTG GCGTGTTTGC AGATCGTGCT TACTACTATG GCTTTGCCTC GCCACCCTCG GACATTGCAC AGACCACCCT CGTGGGCATC  ATCCTGTCAC GAGGCACGGC GGCCAGCGTC TCCTTCATGT TCTCTTATAT CTTGCTCACC ATGTGCCGCA ACCTCATAAC CTTCCTGCGA GAGACTTTCC TCAACCGCTA TGTGCCTTTT  GATGCCGCAG TGGACTTCCA CCGCTGGATC GCCATGGCTG CTGTTGTCCT GGCCATTTTG CACAGTGCTG GCCACGCAGT CAATGTCTAC ATCTTCTCAG TCAGCCCACT CAGCCTGCTG  GCCTGTGTAT TCCCCAACGT CTTTGTGAAT GATGGGTCCA AGCTTCCCCA GAAGTTCTAT TGGTGGTTCT TCCAGACCGT CCCAGGTATG ACAGGTGTGC TTCTGCTCCT GGTCCTGGCC  ATCATGTATG TCTTCGCCTC CCACCACTTC CGCCGCCGCA GCTTCCGGGG CTTCTGGCTG ACCCACCACC TCTACATCCT GCTCTATGCC CTGCTCATCA TCCACGGCAG CTATGCTCTG  ATCCAGCTGC CCACTTTCCA CATCTACTTC CTGGTCCCGG CAATCATCTA TGGAGGTGAC AAGCTGGTGA GCCTGAGCCG GAAGAAGGTG GAGATCAGCG TGGTGAAGGC GGAGCTGCTG  CCCTCAGGAG TGACCTACCT GCAATTCCAG AGGCCCCAAG GCTTTGAGTA CAAGTCAGGA CAGTGGGTGC GGATCGCCTG CCTGGCTCTG GGGACCACCG AGTACCACCC CTTCACACTG  ACCTCCGCGC CCCATGAGGA CACACTCAGC CTGCACATCC GGGCAGTGGG GCCCTGGACC ACTCGCCTCA GGGAGATCTA CTCATCCCCA AAAGGCAATG CCTGTGCTGG ATACCCAAAG  CTGTACCTTG ATGGACCGTT TGGAGAGGGC CATCAGGAGT GGCATAAATT TGAGGTGTCA GTGTTGGTGG GAGGGGGCAT TGGGGTCACC CCCTTTGCCT CCATCCTCAA AGACCTGGTC  TTCAAGTCAT CCTTGGGCAG CCAAATGCTG TGTAAGAAGA TCTACTTCAT CTGGGTGACA CGGACCCAGC GTCAGTTTGA GTGGTTGGCT GACATCATCC GAGAGGTGGA GGAGAACGAC  CACCAGGACC TGGTGTCTGT GCACATTTAT GTCACCCAGC TGGCTGAGAA GTTCGACCTC AGGACCACCA TGCTATACAT CTGCGAGCGG CACTTCCAGA AAGTGCTGAA CCGGAGTCTG  TTCACGGGCC TGCGCTCCAT CACCCACTTT GGCCGTCCCC CCTTCGAGCC CTTCTTCAAC TCCCTGCAGG AGGTCCACCC ACAGGTGCGC AAGATCGGGG TGTTCAGCTG CGGCCCTCCA  GGAATGACCA AGAATGTAGA GAAGGCCTGT CAGCTCGTCA ACAGGCAGGA CCGAGCCCAC TTCATGCACC ACTATGAGAA CTTCTGAGCC TGTCCTCCCT GGCTGCTGCT TCCAGTATCC  TGCCTTCCCT TCTGTGCACC TAAGTTGCCC AGCCCTGCTG GCAATCTCTC CATCAGAATC CACGTTGGGC CTCAGCTGGA GGGCTGCAGA GCCCCTCCCA ATATTGGGAG AATATTGACC  CAGACAATTA TACAAATGAG AAAAGGCAGG AGACTATGTT CTACAATTGC AGTGCATGAT GATTATAAGT CCACCTATTT ATCAACAGCA CCGTTCCTGC AGCCCTCCAG CCTTCCTGCC  CTTAGCAAGT GCACAACCAG TCAGGATCTC CCAAAGAAGA TAAAGACCAC TCCTCACCCC AGCTCAAGCC ATGGCAGGCG TGGCAAGCAA AGTGGGGAGG AGACAGTCCC TGCTTGTGAC  AAGTGTGGAG GTGAAAAGGT ACAATAGTGC TTGTCTCCAA TAGCTCCCCA CATCTCTAAT TGACTTCCAC AAAATCGATG CATTGCTTTG GTATTTGCTT GGCCTGACAT TTGAGGGAGG  AGGAGGCCGG GATCCTCTGG CTGAGAATCT CCTCAGAGCC CAGTGCAGAA GCTGTGATGC TTAGAACCTG GACAGCCCGA CTGCCTCAAC TCTGTCTCCA GGTCTATTCC CTCTGGCTCC  AAAAGGAGCA GCCCTACTTC CACCCCTTCC CGTCCCCAAA GTGTCAGCAA CTTTGAGGAG GGCACCAGGA AACAAAGATG CCTCCCCAAT CCTGATATTC TTGATGTCAC CAGTGATACC  CACTGCCCTG ACCCCTGGGC AGGCCCCTCT CTGCATCTAC TGGAGTGGTC CCTGGGCTCT GGGGCTGAAG GATTCCAGCC TCTCTGCCAG ATATTCAGAA CTCGCTCTCA ATTCACCTCT  TCCACAAGAG TTGGGTGACC AGCTGTCCTA GTTTGCCCAG GACTCTCCCT GTTTTAGCAC TGAAAGTCTC TTGCCCCAGG AAACCCCATC AGTCCCAGGC AGATTGGGAC AGCTGGTCAC  CTTATGCAAG AGCCAGCCTG AAACATCCCC TCCATACTCA GCTCTTTAAC TTTTCCTTTT TCATTGGGCT CTTTCCTAAA AAGCTAAGCT GTAAAATATT TTACATCGAG GTATAATAAA  TAATCATGTA CATGTTTTAC CACCACCCAG GTCAAGACAT AGAACATTTC AACATTTCCA TCACCCCAGA AACTCCCCTT GTACCCCCTT CCACTTCGTC TCCCCTAGCT CCTAGAAGCA  ACCACTGATG TGATTTCTAC CAAATCCAGT TTTGGTCCTA CTAAATGTAC TCTTTTGAGA CTGGCCTCTT TCACTCGCCA TAATGCCTTT GTAATTCATC CATGCTGTTG TGTGTATCAG  CAGTTTGTTC CTTTTCATTG CTGAGTAGTA TTCCATTGTA GAGATGTACC ACAGTTTGTT TATTCTTCTG TTGATGGACA TTTGGGTTGT TTCTAATTTT GAATGATTAT AAATAAAAAT 
TCTGTGAGTG TTCTTGTACC TA 

Retrieve as FASTA