Entry information : HsPGHS01 (PGHS-1 / COX-1)
Entry ID 3320
Creation 2006-07-19 (Christophe Dunand)
Last sequence changes 2010-10-18 (Myriam Duval (Scipio))
Sequence status complete
Reviewer Filippo Passardi
Last annotation changes 2010-12-03 (Myriam Duval (Scipio))
Peroxidase information: HsPGHS01 (PGHS-1 / COX-1)
Name (synonym) HsPGHS01 (PGHS-1 / COX-1)
Class H synthase    [Orthogroup: PGHS001]
Taxonomy Eukaryota Metazoa Chordata Mammalia Hominidae Homo
Organism Homo sapiens (human)    [TaxId: 9606 ]
Cellular localisation Apoplastic
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value HsPGHS01
start..stop
S start..stop
PtroPGHS01 1180 0 29..599 4..574
EcabPGHS01 1137 0 24..599 24..599
OarPGHS01 1129 0 21..599 22..600
BtPGHS01 1124 0 24..599 25..600
Gene structure Fichier Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 125133364..125133370 7 N° 2 125133465..125133551 87 N° 3 125140178..125140294 117 N° 4 125140712..125140852 141
N° 5 125141054..125141197 144 N° 6 125143650..125143831 182 N° 7 125143943..125144026 84 N° 8 125145788..125146034 247
N° 9 125148725..125149011 287 N° 10 125152477..125152624 148 N° 11 125154468..125154823 356  
join(125133364..125133370,125133465..125133551,125140178..125140294,125140712..1 25140852,125141054..125141197,125143650..125143831,125143943..125144026,12514578 8..125146034,125148725..125149011,125152477..125152624,125154468..125154823)


exon

Literature and cross-references HsPGHS01 (PGHS-1 / COX-1)
Literature Funk C.D., Funk L.B., Kennedy M.E., Pong A.S., Fitzgerald G.A. Human platelet/erythroleukemia cell prostaglandin G/H synthase: cDNA cloning, expression, and gene chromosomal assignment. FASEB J. 5:2304-2312(1991).
Protein ref. UniProtKB:   P23219
DNA ref. GenBank:   NC_000009.1 (125133364..125154823)
mRNA ref. GenBank:   NM_000962.2
Cluster/Prediction ref. UniGene:   Hs.201978
Protein sequence: HsPGHS01 (PGHS-1 / COX-1)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   599 (575)
PWM (Da):   %s   68528.62 (65768.9) Transmb domain:   %s   i5-24o
PI (pH):   %s   7.39 (7.25) Peptide Signal:   %s   cut: 25 range:25-599
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MSRSLLLWFLLFLLLLPPLPVLLADPGAPTPVNPCCYYPCQHQGICVRFGLDRYQCDCTRTGYSGPNCTIPGLWTWLRNSLRPSPSFTHFLLTHGRWFWEFVNATFIREMLMRLVLTVRSNLIPSPPTYNSAHDYISWESFSNVSYYTRILPSVPKDCPTPMGTKGKKQLPDAQLLARRFLLRRKFIPDPQGTNLMFAFFAQHFTHQFFKTSGKMGPGFTKALGHGVDLGHIYGDNLERQYQLRLFKDGKLKYQVLDGEMYPPSVEEAPVLMHYPRGIPPQSQMAVGQEVFGLLPGLMLYATLWLREHNRVCDLLKAEHPTWGDEQLFQTTRLILGETIKIVIEEYVQQLSGYFLQLKFDPELLFGVQFQYRNRIAMEFNHLYHWHPLMPDSFKVGSQEYSYEQFLFNTSMLVDYGVEALVDAFSRQIAGIGGGRNMDHHILHVAVDVIRESREMRLQPFNEYRKRFGMKPYTSFQELVGEKEMAAELEELYGDIDALEFYPGLLLEKCHPNSIFGESMIEIGAPFSLKGLLGNPICSPEYWKPSTFGGEVGFNIVKTATLKKLVCLNTKTCPYVSFRVPDASQDDGPAVERPSTEL

Retrieve as FASTA  
Remarks Complete sequence from genomic (chromo 9, 10 introns) and 220 ESTs. Various splicing variants, need to be enter in the base
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAGCCGTGAGTGCGACCCCGGTGCCCGGTGGGGAATTTTCTTGGCCTCCTGGTGGAGCCTTGAATGCCAGGCTCAGCCCCTCATCTCTCTCCTCTGCAGGGAGTCTCTTGCTCTGGTT
CTTGCTGTTCCTGCTCCTGCTCCCGCCGCTCCCCGTCCTGCTCGCGGACCCAGGGGCGCCCACGCCAG
GTAGGCGGCCCCATCCCTCCCCAAGGGAATCCCCGGTCTTGCGCCCCTGGCC
TGGTTTCAACCCCCTCCTTTCCCCTCCAGCGGGCCCAGCTTCCCCTTTCTGCTCGCGGTGCTGAGAAAGACTGAGGCTGAGTCTTTTGGTGGGATGGGGGCTCCCTGAAGCCCCCCCGGC
GGTGTGGCCTTGGCTAATGGGCTATCTAGTTCTTTCAGGGGAAACAGCAGACTGGGATCTGGTGCCAACTTGGGGAGAAGGGACAGTCCCTATCCATCCCCCTCACCTGTTCTGGGCCCC
AGATGTCTAAGCAGCCTCTGCACCCAACAACCCCGCTGTTTCCTATAGGGGCCTCTTTGGGAGGAAGCCGCAGGCACCAAGGGAAATGAGTTCCCTTTCTCCAGCCTCTAACCGTCTGGG
AACCCATCCTGATTCCCATTGCCAGTGGAGAAGGTCTCCCCTGGTGAAGACTTCGGGAGAACATGGGAGATGGAAATACATTTAGGAGCCGGGATGCTTCATCTGGGGTTTAAGAGATCC
CCATTGAGCAAATGAGGAAACCGAGGCTCAGTAGGTGCCATGATTCCCCAAGCTCACAAAATACATGGTGGGCCCAGGATCTGAACTCAGGTCTGTCTGCGTCCACACACTTCGCAAGGT
CTTTCCACAGAAAGCTGCTTCTACCCACAATGGACCCGGAGCCACTCTGGGTTTCATGGGGGAGGTTGATGGGGAAATTCCGCCCTTCCCTCCCCCATGCATGTGCCAGGTTCCAGGTGA
AGGAATTCAGCTCCTCGCTCAGCCAACCTTCCTCCCCCAGCTCGCTGCTTGTGTGTGTGCGTGTGTGTGAGCATGTGTCCCAGTTGGATATAGTTTCCTCCACCACCTAGCTGTGTAAAA
TCTCTAAAATGAGGCCCAGAGAGGGCAGGAGGGAGCAGAGGGAGTCCGCACTTTCTGCTGTGATGCTCTAAGGATCTGGTGCGTGGGGGAAAGAGGAAAGATATGTACATGGATTTTGGA
TCAGACCAAGTTTCCTGAAACGGGCTTTGGGTGGGTGGCATTTAGGGGACCCAAAGAGAGGACGGAGGGTGCTCTAGGAAAGGGATTGCGTTGAATAAAAGTCTGGTTTATGGGAAGAGG
AGGAGGAAGACTCATGTGGCCAGCACTTTAAAGTTTACAAAGGTTGGTTGTGGACATCCACCTCTTTGGTGCTCTCAACAGCCCTGCCAGAGAGATGCGTTAGACCCAGTCTACAGAAGG
GGACTCTGAGGCCCAGAGGGAGTGAAGCCACTTGTCTGAATCCTCACAGTGAGCTGGTGGTGGGGAGTAGAAGGGGCTCTTCAGGTTGCCTCGCTGACCCATCTTGTCAGTCCCTGGATC
AGCCCAGCACTTCCTCTGAGGCCCCATCTGGCCTCATCCAGGCACTCTTGGTATACAGGTCCCACCCGTCTGTATTCTTTTTTAGTCCATTCACTAATGCACAATGAGCCATGGTCTGTG
CTATTTTGTTTTTTGCCTTTTCTGTTTCCTTTTGTATAATAAATGTTTACACGGAATTGTTTTTTTTTATCCCCAAGGGTTATTTATGCCAGTCTTTGGCTGGGGCCTGGAGACACGGAG
CTGGGAGGAGCTGACAGGCTGATGGTGGAGCCAGAAATGCCAACAGACTGTCATCCAAGGGGACCAGGGCTGTGCGGAGCTCATGGGAGGAAGCAGGGGTATAGGGAGCCCAGGGAAGGG
AGCAGGGGTGTGCGGAGCTCAGGGGAGGGAGCAGCTGCCTCTCAGTGCGAAGAATTGAGGGAGGCTTCTTGGATCTGATGACTTTTGGACTGGACCTCGAAGGAGGTGGCGGGTGGGGGA
AACACTAAGAGACTCGACCAGGAAACACCGTAAGGGTAATGTGGAGCAGGAGGCTGAGGTTGGATGCGTAATGTGGGGATGAGACTGGAGGCCAGAAAACTGGTACCGAGGCTCCTGCAA
GGATCTAGGAAAGAACGAATGACTGGTGGAGTGCAGGGGCAGAAGGTCTGGGTTGCCCTCATATTTCCAAGTCTGCGAGTAAACTGAGGGCCAAGCTACTTGGGCATCTGCCCCAGGAAG
CCAAAGCCTGTCTTTTACCTCTAGAGGCAAAGGAGATGAAGAGATGAGGGCTGCACTCACCTAAGAGGGCAGGCGGAGGGAGAGAATGCTGCTTTTGCCTCTCTGGTGGGTGAGGAGATG
GAGGCCAGCCTGGAGGCAAGGCTGTGCCGGGGGCGCCCTGAATGCTCAAGTACTTCCCCTCTAGTCTCTGGACTTGACTCAGTGCCCATCTTCTAGATGGGGATAGGGCATGACCTGGCA
ACCAGAGCTGCATGCCAGGCAGAGGAGCCCCAGGCCAGCTGACCCGGGCACATGGGGCAAGATGCCAGCTGTGCCAAGGCTGCCCGGGTGCCTGCACCTGCTCTCGCCAGCACCAGTTGG
GTCTGTGGGCCGGGAGAGGGCTTCCCCACCAGCAGGCTCTTCCAGCTGCTGGGCCCAGTTGTCTGGATCTGGTGTGGTGGTGCTTAGGTGTGGGGACTGGGGCCCAGGCCAAACCAGTCA
GGGTTCCCCACCTTTTTTTTTTTTCTCTCTGGCTGCCAGGTCCCCAGAGACTCTCTAAATATTACTCTTTAGGGGATTTGGGTCCTGCCTCTTCACTGGGCAATCTTGCAGCCTGACACC
TCTGCATCCTGGAGGACTTGGTGTTCCCTGGCTGTGCCTGTCCCCTAGTCACGCTCAGTCCTGGACCTGGAGCCCTGACTTTTGGGCTAGGGGGTGTTCCTTGCCCAAACCCTTGGAAAG
GTCCTGGAACCAGGCAGTTCTGGACAGTCCTCATCTGTGGGGATGGGGACTAGCAGCTTGGCCTTTGGGCTGCGTGTGTCTGGCATGCAGCGTGGTGTGCTGATCATCTCTGCCTCCGCG
TGTCTTGGGGACTCAGTGAGAGCAAATGGGAAGGTTCTCTGCTGATGGGGCAGTTGTGGGAGGGTTGTATTGATACAAATAATGGAGAGGGACAGATTGGCTCTACCCAGAGTAGAGGGC
CTGGACGTCGAAAGGAAGGCCTCCTTTAGCCCGCCTGCTAAACCCCCTTCATGGGATATACAGAAAAATGAGGCTCCAGAGGTGGGAGGGACTCACCCAAGAATGCACAGCCAGTAAGAG
CAGCCACTCTCTAATACAGCTCTCCCAAGCAACTACTTGGTGCCTGGCCCTGTGCTGAGTATGGGGGACTCAAAGTGAATTAGACTTGGTACCTGCATCTCCTTGGTTTTCTTTCTTGAT
GAACAGGGGTGTCCTCCACTGAGCATAGAACTGAGTTGGCATGAAAACATGCAGGATTGAAGTTAGACCGAAGGAAGGCCTTCCTGTACGAATACTTCCTAAAGATCTGGTTGTGGCAGG
AGATGGTGATCCCCCTGGGGTCCTGGTTTCATTCGTTCTGGATTTTCAAGGTTCACTTTGAAGGAATGTGACAAATCCAACTCAACTGGGTCATCCTCAGTCTGCATCAAAGCGACTCTC
CCTTTCCATCCCTTCCCCAAATCCAGTCCTGGGTACCTCCAGGAGTAGGAGGAGGAGGGAGATTAAGTCTTGGCTCCGCAACTTCAAAGCAGTGGGGTCTGGGCAAGCCTCCTGCTTTCT
CTGGGCCCTACATTCCTCATTATGCAATGGGGAGTCTCACCTACCTCTCTTAGGGAGTGCTGGGGAGATAAATGAGCGTGGGTGTGGAAGGTTGTGGACTTGGTCGGCTGGAGCCCTGGC
TTTCTGGCTGGGCTGTGATGGTCAGGCTGTAGTCTAGTGGCACCCAGTCTCCTCTCCGCCCACCCTGACACCTTGGGGCACCAGCGGCAGGCCCACTGTTGTCCTTCCTGGGAGAACTTT
TCCTGCCCCAGAGCAGTGGTGTTTATGGAAAGAACGTGGCGGGGGAGGCGGGAGGGGGGAAGCTGGATGGGCCGGACTGGGAGGGAGGAGCCTCAGCTCCCGCACAGCCTCTCTTGGCAG
GGAGGGTGTGGGCAGCCGTTCCAGCTTCAGCGTCTGGAGCTTGTGGCTCTTCTGCCTGCCGAGGCAGAGCTCTAGGGCCATGGCGGGAGGCCTTCCGGCTCCACCTCAGACAGCTGTTGA
GGGCCTGGAAGATGAGGGACTCCTTTTGGTCAGGCTGGAGGTGACCAATCTCCTCACTTCATTTTATTTATTCCACAAACATTCGTTAAGCTCTGGCTACCACATCTGGATCCCAGACCT
GGTTAAGCCCCAACTGTGTGCATAGCCTGTGCTGGGGGAAGAGAAAGCCTTAAGGGAAGCGGGGGTCCCTTCTCCTGGAAGCTTATGGTGGAGAAGAGCAGGTTGAATTGGAACCCGGAG
CCAGGGCAAGGGAGAGTGGGAGGAGGGAGTGTGGACATTTGGGGAAGACTTCTTGGGGAGGCAAGGCGTGGGCTCAGCTGATTTGGAGGAAATGTATTAAGGAGGCTAACAGCCTGGATG
AAGGATGGAGATGCTGAGGGTTTGGGGCTAAGGGAGGGGCTGGCGCAGAATGTTGGAGCTGCACGGACCTCTGGAGTTCATTTTGAGATCCCTGCTGTGGCAGGGATGGGGAAATTGAGC
CTAGGGGAGGCCTAACCCCTGTGCCCTTAGAGTGGTAGAACTAGACCGTGAGTCCTAGCGGACCTTGTGTGCCAGGGTTGGGAGCTGGGCAGTGGGTGCTACAGGGGCTCCTGGGCAGGT
GAGAGAGCCAGGCCAGCAGGAGGGGCAAGAAGAAGAGTTCTAGCAGGGTGACAAGTCCTTTCTTGGGGGAACCAGACCTCTGAGCCAGCACGGGGGCTGCTCCAGAGGCTTCTTGGCCTT
TCTGGGCCTGTGGCTGGGGGAGGCGGGGGCTGGGGTACTGCCAGGCCCTTTCCCTGGATCCTGCCTGCCAGGTTTGCCTGTGGCTGCCTGATAAAGCCTTTTTGTCCAGCCTTCTTCCCC
TGACCCCTCCCTCCTACACCCCTGCTGGGGCCATTGTGGGGCTGTGGGGAGTTGCCAGGGCTTAAGCAAAACTATTTGCATGATTCTTGTGGTCCTTGCTTGTGTGTGTGTGTGTGTGTG
TGTGTGTGTGTGTGTGTGTGTGTGTATTTGAGGTGGTGTTGCCGTCTTCGAAGTGGGGACCAAGAGGACTCTGCTGCCCAGGTCAAGCCCTGTGCTGTGAAGTGAGGAGTGAGGAGACTG
AGACCAGGGCAGGGGATAGGGGGCAGGTCTTGCCCAGGGTCATGCTCTCTGATCCTAGCACCCTGAGATGACAGCATTTCACCCTCAGCTGCTCTGGCCTTTCACTACCCTTTTGACCTG
CAATAACCAGTTGCACAAAGAATCCATATTCCCTGTAACGTTGGCTCTGAGCCCAGATAAGGAAGTATGTTTGCAGAAAGGAGCCCGGCGTTTTCTGTGTAAGGTCAGAGATGGAGCCTC
CTGCCAGGCGCTAGGAGTTTTTGGTCTGCCCAGCTCTGTTCTGTTTCCTTGGCCAAGTTATCACCTGTCCCTGAACCTCAGTTTCCTTCACTGAACAGGGTGGTGGGGGTGATGGTGGAT
CCTACTGAACCTGAGGAGTCTGTAACTTTGCTGCTAAAGAATCACTGAGGCAGATTTAGCAAGGGGGTGGTGGTCTGAGGGATCTGGGTTAGATGCCAAGAGGGATTTCCTTATAGAGTT
TCTCCAGGAGGCAGCAGGGCGGATGCCAGAGCTCCATGCACCCTCCCAGGAGATTTTCTAGCCCTATCTCTCCTATTCCCTTGGTCTGTGACCTTCTTCTGGATCCTAGAGCCCAGACCC
CACCCTGATGTGGGCCAAGTCTGATTCCATCAGGAATGGGGGCTGAGGCAGCTGAGACCTTGGAGTCCCCCTCCTCCTGCACACCCACCCTCTGTCCCCAGGGTTCTGGAAAACTTCTTT
CCTTAGACTTGGCTGCTGAGGCTGTGGGGGTGGAGTGGAAGTGAAGGGATGGGCATTTGGTCTGGTGAGACTTGTGGGGTGATAGGTCTGGCTTTCCCTTTTTCTGGCTTCAATTTCCCC
ATCTGCAATGTGGACATTATGATACCTGCTCTTCTCTACCTTCTGCAGAAATGGAGGAATGGTGAGGGATCTCGTTTTCCTAGTGTTCATGGGGTGGTGGTCTGGGAGAAGTCCCCGTCC
TCGTCCTCGTCCCTTTTCTGACCGCCCCCCCACCCCCCGCCCCGGCCTGCTCTTGCTACTGAACTCTGACTAGGCAGGAAGTGAACGCTCTAGAAGCCCGTGGCTGAGGAGTTTTATCAG
CTCTTTCTTAACTTTCAGTGTCACAGCCGGGGAATCTTCTCCTGCCGCCGTTTGGTGGCAATAGGGAGGGAGGGGGCGCTTCCCCTGGGGGCCTGATGTGGGCTAGGCTGGAGTTCCAGA
GCAGGGCCTGGAAATGTCAGGATGGGTGGTTTATGTTCACAGAGGTGGGAACAGGGGTCCCCTTGGGGGAACCCTGAAGCCCTGGCACCCAGTGATTCAGGACGGAGCTGCGACTTAAGT
CCATGCCTCTGGCCCCTCATTCCCCCATCAGGGGCTAGATGGGGGTCAGGAGGCCACCCTAGCATGGTCTCTGACCTCCATTTCTCACCCACAGTGAATCCCTGTTGTTACTATCCATGC
CAGCACCAGGGCATCTGTGTCCGCTTCGGCCTTGACCGCTACCAGTGTGACTGCACCCGCACGGGCTATTCCGGCCCCAACTGCACCATCC
GTGAGCTGGGCCTTCAGCCCTCACTCCTT
CCGTCTTGAGCCCTTCTGCTCCCCGGGCCCTTTCTCCTAGACCCTAACTTCCTACCCTCCTCTCTGACCATGGCCCTGTTCTCCTTCCTTGCCTGGTTCTGCCCCTCTCCCTGACCTGGC
TTCAGCATGAGCTCTCGCTCTTGGTCCACCCTCACTCCCTGCTTCTGAGTTCCATGGTGAGTCTTCACCACATGCCCTGGCCCCTGTCCTCACGCCCGGTTCTGTCTCTGTCATGTGTCA
TTGCTCCCAAGGTTCCATCCTTACCCACTTCCCCATAGGTGCTACTCTGTTCCACCCTGGCCCTTGTCCTCAGTGCCCCATCTTCCACCCTGGCTACTTCTGGTTCTGGTAGGAGGGACC
AACTGAGTGACTGCCATTGCCCCTGCAGCTGGCCTGTGGACCTGGCTCCGGAATTCACTGCGGCCCAGCCCCTCTTTCACCCACTTCCTGCTCACTCACGGGCGCTGGTTCTGGGAGTTT
GTCAATGCCACCTTCATCCGAGAGATGCTCATGCGCCTGGTACTCACAG
GTGGGTGTGGGGCAGGGCCCCCTGACCTGGGGGAGCAAGCAAGCCTGCTAGTCCTTTTGGATTCTTGGCAT
CTGATAAGGGTAGTGGGTGGGGAGAGTCTATGATGCCTGATAAAATAAGCCCCAACCCAGGAGGAGGCAAGAACTGGGATGGAGCTGGGGGTGGAAACACCCTTGTCACCGTTATTTTTG
CTCTCTGCAGTGCGCTCCAACCTTATCCCCAGTCCCCCCACCTACAACTCAGCACATGACTACATCAGCTGGGAGTCTTTCTCCAACGTGAGCTATTACACTCGTATTCTGCCCTCTGTG
CCTAAAGATTGCCCCACACCCATGGGAACCAAAG
GTAAAATGGGGTGAGGAGCTGGGCCTGGGGATTACAGGAGGTGCTCAGTTCTTCTCTTTGGGAAAAATCAGGCGAAGAACAATTAT
TGACCCAATTCTGCAGATGGCTAGACCAAGGCAAGACATATGACATGTCCAGAGCCTGGGCTGAGAACAGGCAAGGGCAGCAGAGGGTCTTGCCTGAGGTCACCTAGAGTCAGACCAATG
TTTCCATAGTTCCAGGGTGCCTCTTTGCTTGATCCTTTTCTAATGATCAGTTGGGTCCTGCCGGGGTGGAAGTGACTTAGAAGTTGAGATGTAGGAAAGAATAGTGAGCTATTTATTGGG
TGCTGTCTCTGTGTTTGGGTCTTTACAGATGTAAATAGTTTTACATGCTTCACCAGTGTAAGGTACAAATAAGCCACATTTTTTTCTTTCGTTACTCAAGACTTCACTTAGCCACACTGG
CAGGGGTCTTCCTTGTAAGACCTTCCCATGCCACCTGTAATTATCCAAAAACCTGGGATATTATTCATTTCAGACCCATCAGTTCAGCATCAAGTACAGAGAGGACAAGAGAGGCTCATC
AGTCCACTGCTGCTATACTCCAGTTCCTGCCACATGGTGGCACTGTTGAATGCCAGTCCTGTGCAGCCTCATGGTTATGTGCTTTTTTGGGTTCAAAACCTTGCACCATGTCCCTGCTTG
GGTCTCAAGAGCACTACTGTGACGGTTTTCCATCATATGGTTAGCTGCTTTTCCCAAAGCATGACTATTCTACCAGGATAGCTGAGTCTTGCCAACTTTGCTGAAATCATGCTTGCCCAG
TGTCAGTGAGTGATGATTCCAAATTACGGTTGACAGATCACTCCCTCTAACTTCCCTTTTGGTGGATTTTCTTTAGGGGTACTTGATATTTTTTCCTGCCAGAGGAACCCAGTCAGCCAT
CTTAATCCAAGTAATTTCCATTGATCTTGAACCTTCAACATCAGGGCTCACATCTTGATGCATTGTGAAGAGATGCCTTTACCAGAACTCAAAAAATTCTATTCCTTTCTGTGAGGGCAA
TTGGGTGACAACTCATTTGACACTGACATAATTAAGGAAGGCCTCTCAATACTTATTCTAAGGATGACTTGTCTTTATGCCAGACATAGAAAGATTGGCATCATTTAAAATAGGTTGACA
CACCTATTTTAAGGGGAGCAAGCAAGCCTGCTAGTCCTTTTGGACTCTTGGCATCTGATAAGGGTAGTGGGTGGGGAGAGTCTATGATGCCTGATAGGTGGTGGTGTGGTGGCACACGTC
TGTAATCCCAGCACTTTGGGAGGTTGGGGTGGGTGGATTGCTTGAGCTCAGGAGTTCAAGTCCAACCTGGATGACATGGTGAGACACCTTGTCTACAAAAGAATACAAAAGTTAGCTGGG
TGAAGTTGTGTGTGCCTGTAGTCCCAGCTACTCAGGAGGCTAAGGTGGGTGGATCAATTGAGCCCAGGAGGTCGAGGCTGTAGTGAGCCATAATTGCGGCACTGCACTCTGGCCTGGGCA
ATAGAGTGAGACCCTGTCTCAAAAATAAATAAATAAATAAATAAATAAATAAATAAATAAATAAATAGGTTGGCAAACTACAGCCTGCAGGCCAAATTCGGCCCACCTCTCCATTTTTGT
AAATAAAGTTTTATTGGAACACAGCCACACACATTTGTTTATGTGTCATCTGTGACTGCTTTCCCGCTACAATAGCAAAAACTGAATAGATGTGACATGTCTGTATGGTCTGCAAAGCCT
AAAATGTTTACTATCTGGCTTTTTCCAGAAAAAGCTTGCTGACCTATGATTTAAAAAATTCTACCCACTGGTTCTCTGCCATGCTTACATTATCTCTCGCAGTCCTCAGATTCACTGTAT
CAAATGGGTATCATTTGCCCCTTTTTAAAGATGGAGAAATTGACACCCAGAGAGATGAGATGATTTATCTGTATTCACACAGCTAGCAAATAGCATAGTCAGTTGCAAACACAGGCTACC
TTGACTCAGGGCAAGGGAGTTCATGTTTGTTTTTTTCTGTTTCTTTATTTCTTTTTGGTGAAATGTTTCATTATGGAAAAATTGCAAAGATACACAAAAGTTGAGAGAAAAGCAGAATGA
ACTATGTACCCATCTTTCAGTTTCAACATTTACCCACAGTTTCTTCATCTTATTTCATTTCTCCCCTCTCATATTTTTATAAAGTATTTTAAATCAAATTCTAAAAATCATGCCACTTAA
AATTCTAAAAATCATGCCACTTCACCCATAAATACTTCTAGGGTCTTTTGGTAGAGAGTGGGTTACTTGGTGGTGGTGGGGAGGTGGTCCTGAGGAGGCCACTCTGGGCTTCCTGCTTGG
GCCAGTTTGCCTGGTGAGCCCAGATGTCCCCAGGGCAGCAAGATCCAGATAGGAGAAGCTACTGCTGTTTCCTACCCCCCAACCAGGGAAGAAGCAGTTGCCAGATGCCCAGCTCCTGGC
CCGCCGCTTCCTGCTCAGGAGGAAGTTCATACCTGACCCCCAAGGCACCAACCTCATGTTTGCCTTCTTTGCACAACACTTCACCCACCAGTTCTTCAAAACTTCTGGCAAGATGGGTCC
TGGCTTCACCAAGGCCTTGGGCCATGGG
GTGAGTACCTAGGAGGGGCTCAGGACTGCTCTGGACCTAATTTGGCACGCGTATGTCATCGACAGTGGGCCGGCACCCTGGTGACCTGAGGG
AACCCCTCTCTGTCCACAGGTAGACCTCGGCCACATTTATGGAGACAATCTGGAGCGTCAGTATCAACTGCGGCTCTTTAAGGATGGGAAACTCAAGTACCAGGTAGTGCTGGGCCAGGG
GGTAGGGCAGAGGGAGGGGTCTCCCATGGTCTTCCCTGGCAAAGACTGCTTGGGGCGGGGGTCTGGGTCATGTCCTGAGAGGGCCAACCACGGGAGTGGGAAGCTTGTGCCAGGAGCGAC
AGTATACGCTGGGAGGAGGCAGCAGGTATGAGAAGCCAGGGAGGAGCAGACGTGGCCTCCCATGTCAGCCAGGGAGGTGGATTTTGGAGCTCAACAGGAGATGAACATTTGTAGTCTGCA
TTTATTTATTTAATTATTCCACAATATTAATTGGACACGAGAATATACCTGGCACAGGTGATTGAGTAGTGACCATTATAGTTAATGGGCCCTGCCCTCAAAAAACCACAGGCAAACTCT
AGGATGTTCATTCTATGAGGGCTTTTGTTTAAATCAGAACGGTCCAACAGAAATATAATGTGAACCACATACATAGTTAAAATTTTCTAATGTCCATATTAAAAGAGGGAAAAAGAAACA
GGTGAAATGATTTTAATAATACATTTTACTTAATGCAATATGTCCAGGTAATTAGCATTTTAGCATGTAATCAATACATTATTAATAAAATATGTCACATTCTTTTTTCATGCTGAAGCT
TCAAAATCTGGTGTATATTTCACACTCACAGGGCATCTCAATTTGGATGCTCTATTTTCACTGGAGTGATCTAATCTGTATTAAGATTTCATAAAATGTACAGCTGAATAAGTAGAGTGA
CATGTCCGACTTGTTCCACGCATACTTAAAGGTTTTCCAATAGCTGAAGTATCAGTTTTAAAATTCAAATAGAAATTAAGATAAACCTAAATAAAATAAATTAAGTAACATTCAGTTCTT
CATTCACACTAGCCAAATTTCTAGTGCTCAGTAGCTACACGTGGCTAGTGGCTACCATATTGGATCGTACAAATCTTAGGCAAGCACCAAAACAAAGTTTGATGCTGAATCTTTAGGGCT
AAGAACAGTACTTGGCATGTAGTAGTCTCTTGGCATGTATTTACTGAATGAATGAAGAAGCTGCCATATAATTAGGTACACTTGTAGCTGCCACCAAGGAGAAGCTGTGAGTGCCACTAG
AGTGTTTGGATGATGGGTAAAACTTCCCTAGGAAGTTACAAATAAACCCAGAGTTGCATAAAGGATGAGGAGGAGTTAGGGATGCTAAGAATGGGAGAGGGCTTTCCAGGTAGAGGGTTT
AGCAAGTACAAAAGCTTAGAGGTGGAGAACAGCTTGGTGACTTGGAGGGAGTGTAAAAATGGGAGCGTTTGCTGAGCCTAGTGATGGAGCGTAAGAATGACTTTACAGGAAGGTGGAGAT
GTCTGTGGGGACCGTGTTAAGGAATTCTACTTTCTGCCAAGAGCAGAGAGAGTATTTGGAAAGGTTTTAAGTCAGCTCATGATGCAAGATTTGTTGTTTTTTTTTTTTTAAGTTTTTTTT
TTCTTTTTTTTTTTTTTCTTTTTTGCTTGGGTGCAGTGCAGAGACTGGCCTGTCTGGGGATAGGAGTGGAAATAGGGAGCCCACTTAGGGAAAGGAGTCAGAGTGGAAGATGAGCGGGGG
TTGCTTGAGGTAGTGGTGGTAGAGATGGAGAGCAGGGGACTGATTTTAAGAGACATTTTGGAGGTGGAATCAACAGCCTTGGCTGAGGGGAACTGGCAGCTGGAGGCAGGAGTGGGAGGG
AGTTGGTTGTGGGCAGCTGTGGGTGACCCCCAACCCCAGGTTGCCAGGTGGCCCCATCCCACAGGTGCTGGATGGAGAAATGTACCCGCCCTCGGTAGAAGAGGCGCCTGTGTTGATGCA
CTACCCCCGAGGCATCCCGCCCCAGAGCCAGATGGCTGTGGGCCAGGAGGTGTTTGGGCTGCTTCCTGGGCTCATGCTGTATGCCACGCTCTGGCTACGTGAGCACAACCGTGTGTGTGA
CCTGCTGAAGGCTGAGCACCCCACCTGGGGCGATGAGCAGCTTTTCCAGACGACCCGCCTCATCCTCATAG
GTGAGGACTCCAGACCTGCCCTGCCCTGGAAGGTCATTCCCTCCATCCT
GAGAAGTTGGGGGCGGGGGGGTACTTAGAGGTGGAGGCTGGGATTAGAATCCTCACCCTTCTGCTTAGTGGCTAGAAGACCTTGAGCAGGCCCCTCAACCTCTGTGAGCCTCGGTCTCCA
AATCTGTACGTTGGGGTGAACGATGATTGTAAGGATTTATTGAGATCATGAATGGGAAGGCAACTAGCACATAGCAGGCCTTCAGAAAATAGATGACTGTGATGGTTGATTATTAGCTGG
GTGACTCAGCACATTTGATTAGCCACCCTGAGTCTCGGTTTCCCCTGTGTGCTGCGCCCGGATGCATACTTGTGGTCCCCAGCACGTGCAGGGTCAAACTGTAACTTGCCCGGCCTTCAG
GAATCTTCATGTTCCCTTCCCTGCATGTATCTACCTTCCTGCAGTTTGGTAGCTTCTAGGTGACTCAGGGACAGGATATTTTTGTGTTCCCTATGGGGGCGAGTCTGCAACCTAAAATGT
CAGATGGTTTCCTGCCTGGGAGCTTGGCCCCTGACATCCCTGTCCAGACCATGTTCGCTCTGAGTCACCAGTAACTCCCTTCCCCCACCTCTGGCACCACTGGGCATGGCTGGTCCCAAT
TATAGAGCCTAATTCACTGGAGCCATGAAGAGCCAGGCATTGGGAATAGGAACAGTCATTAGAGGGAAGAGGGGCTGGTTCTGAAGTTTCAGCGTTGCAAAGACCTTGACCTGAGAGAGC
TGGAGGCTGTCCAGCACACTGGCTGGAGATGAAGCTGTGACCAAAGGCAGGACCCTAGGGCACCATTTAACTCCCCTACAACCTCATGAGCCGGGTATGATTACCCTGTGAAAATCAAGA
TTCAGAGAGGTGAAGTGAACTCTCCAGGGACACTCAGCAGATGGAGACTTGAGGTCTGGTTGCCACCAAAATCTATGCTACTTCCACTCCATCACAAAGGGGGCTCTTCTTGAATGGGAA
GGGGTTGCAAACCTGAGTCTGATCTGTGACATGTGAGTATTGGAAAGGGATTCCCCCTCGCGTCTACACTTAGTATTCCTACTTTTGGCTGACATATATGGACACCCCGTCTTATGCCAG
GCACTGTGCCAGCAGTTTTGCTGTATTCATTGTCTACTTCTCTCAACAACCCTAATATAGTATTGCATTGATGTTATTATTATTATTATTTGAGATGGAGTCTTGTTCTGTTGCCCACAC
AGTAGTGCAATGGCGTGATCTTGGCTCACTGCAACCTCTGCCTCCCGGGTTCGAGCAATTCTCGTGTCTCAGCCTCCCGAGTAGCTGGGATTACAGGTGCCCGCCACCATGCCTGGCTAA
TTTTTGTATTTTTAGTAGAGACAGGGTTTTGCCATGTTGGCCAGGATGATCTTGAACTGCTGACCTCAGGTGATCCACCCGCCTTGGCATCCCAAAGTTCTGGGTTTATAGGCATGAGCC
ACAGCGCCTGGCCGCACTGATATTATTATAACACACATTTTACAGTTAAGGAAAGTGAGATTCTGAGAGATTAAGTAACTCAGCCCAAGCTCAGGTGGCTGGAAATGGTAAAGCCAAGAT
TTGAATCCAGGTCTGCTGATCCCACAGTCTGTAGCTCCAGGTCGATTTCCAAAAGCCAATTTGTCTAATGGCTAATTGGCCTGCATATCAGTTTCTTTCAATATCTAGTTGCCCAGTTTT
AATTTTGTTACAGAATGTTCAATTTGCATTTTCCCTGGCTTTTTGCTTTCTAGCTGGTTATAGCTACAAATGGGGCAGAGGAAAAACTTATTTATAAGAATCCTGTTATATAAGAACATA
TAGGAAATATGTTTTTTGAAATATATTTAGGGCTAGGCTCCCCTTCTGTCCTCAGTATTCTCTTTTGCCACTGCAGCAGCTCTGAGTGGTCTCCTTGAAGTCCCCCTCTATCACCGAGGG
TGTCAGCATGATGACAGCTCTCACCAGTAAATCCTCCACTTTTCCATCTTTTGTAGTCTCTCCACCTTCTTTTAATGGGCTTCAGGAGGGAGGTCATAAGACCAATTCTTGGACACCTCC
TTTGCATGTCCTGCTTAGCTGGGCCTAGAACCTCCCTCTGAAATGTGGGAGTAGCTGGTGCTCTTGTCTTGAGACCTCAGTCAAAATAGACCAAAAGTTCTATTTTCACATCCTGTTACA
AAGAGACAAAATGGAAGAGGCCAAACAAAATTAAACATCAACAGCAAGGCCAGGTGTGGTGGCTCACACTTGCAATTCCAGGGCTTTTGGGAGGCTGAGGTGAGAGAAGTGCTTGAGACT
TGCAGTTCAAGACCAGCCTGGATAACATAGTGAGACCCCATCTCTTAAAAAAAAAAAAAAAGAAAGAAAGAAAGCTGGGTGTGGTGGTACACCTGTGGTCCCAGCTACTTGGGAGGCTGA
GGTGGGAGGATTGCTTGAGCCCGGGAAAGTTCAGGCTGAAGTGAGCTGTAATTATACCATTGCGCTCCAGCTTGGGTGACAGACCAAGACCTTGTCTGTAAAAATAAAAATAAACATCAA
CAGCAACAACAACAATAAAGAGACCAAAAGCAAGCACCTCTCATGAACTGGCCTGCTTTCCAAGCTGGGCATCTAAATCACTGTGCTTGGCTGACCCTATTTCCAATCCTGCCCTGCCCA
GGGGAGACCATCAAGATTGTCATCGAGGAGTACGTGCAGCAGCTGAGTGGCTATTTCCTGCAGCTGAAATTTGACCCAGAGCTGCTGTTCGGTGTCCAGTTCCAATACCGCAACCGCATT
GCCATGGAGTTCAACCATCTCTACCACTGGCACCCCCTCATGCCTGACTCCTTCAAGGTGGGCTCCCAGGAGTACAGCTACGAGCAGTTCTTGTTCAACACCTCCATGTTGGTGGACTAT
GGGGTTGAGGCCCTGGTGGATGCCTTCTCTCGCCAGATTGCTGGCCGG
GTAAGCCCCAGAGGAGTGCTGGTGAGGGCAGGTGGGCTGAGGGATCCAGCAGACCTGGGTCCAAATTCCAGG
TTCTTCTTCTGTAAAATGGGGCTGATGTCACTTCTACAGGGCAGTTGTAAGCATTCCTGTGTGAGTTCATTGGTTCATTTGTCCATTCCACAATACCAGACATTACTCCAGGTACTGGAG
ATGTAGTGGGAACAAGACTTTTGTGGTTCTTGGCTCATCTTTTAGTGCTCCCACCCTAGAAAGTGATGGCAGTCATACGACAGCTGACAGCATTAGGGCCCTTACTGCGTGTCAGGCACT
GTTCTAAGAGCTTCTCCTATGTTATCAGAATTCTAAGAGTATGTTATAAACCATAGTAGAAAAGCCCACCATGCTCTGGGAGTCAGGAAGGGACATCTGACCCAGAGTTGCGGGTTATGG
GAAAAGAAGGGATGTCCAAGCAGAAATTGGAAGGAGGGATAGAGATTTCCCAGGGTAAGAGGTGGTGTTAGTGGCAGGGGATGGCTGTGTTCCAGATAGAGAGGACGGCATGGGTGAAAG
GCATGGAAGTCAGAGGACATGGCAGTTTGAGGAACAGAAGGACATTCAGGGCATGGTAATGTATGTAACAGTGCCCGCCTATGGTGTTTATTAAATCATAAGCCTCCGCTCTGGGCTGAA
TTGTGGCTCTTTTAAAGTTCGTATGTTGAAATCCTAACCGCCAGAACCTTAGTATGTGACTGCATTTGGAGACAAGGTCTTTAAAGAGGTAATTAAGTTTAATTGAGGTCATTAGGCTGG
GCCCTAATCCAGTGTGACTTATAAGAAGAAGAGATTAGGTCACACACACAGAGGGAAGGCCACATGAAGATGTAGGGAGAAGAAAGCCATCTACAAGCCAAGGAAAGAGGCCTCAGGAGA
TACCAACCCTGCTGACACCTTGATCTTGAACCTCTGGTCTCCAGACAGAGGAAATAATTTCTATTGTTTGAGCCACTCATTCTGTGGTACATTGTCATGGCAGCCCTAGCAAACAAACAC
ATTCTCTTTCCCTGGAATTCCCAGCCAACGCCTTCCTCAATCTCCCCTTCTCCACATTCGGAAGCTCCCATCTGCTTCATCGCAGTCTCTGGCTCCCCTGTTGCCTCACAGTCCTCTGCT
TCTCTCTAATCCTTGTCCCTAAACCCTGTCATGAAGCTGTGGCACACATGGATTTCCATTTCCTTCTGGTAATTTGACTGAAATTAGCATTTGCTGCCCCGGTGGGCAGCTGCTGGCTGC
TTTATGGCCTCTTTGTCGGTTTCTTTATGGTTCTTTGTGGGGACACAAGACATGAACAGAGACAATAGCCTTTGTGTGAGGCTGGATGGTTTTCAGAACGTTTTCAAGGAATGACCATGA
TGATGTACGTGAAAAGCCCCGGCGTCGTACCTGGCACAAGGCAGAAAGGCCGCAGAGATGTATGGACTGTCAAGATTTTTTTTCTTTTTTCTTTTTTTTTAAATAGAGATGGGGTTTTGC
CATATTGCCCAGGCTGGTCTTGAACTCCTGGGCTCAAGCGATCTGCCCGCCTAGGCCTCTCAAGGTGCTGGGATTATAGGCGACTCTCAGGATATTAAGAAGAGTGAGTGATGATAAGAC
AGGGCTTCCCCTGATAACCATTGTCCATGGCTACCCTCTCAGGGGTTCCCATGTGACCAATACTGAGATAACAGCTTTGCATAGTTTATCCCATTTAAATTGACCACAGCCATATCAGGT
AGATGCTCTTCCCTTCCCCATTTTTACATATGCGGGAACTCAAACTTAGCTTGAATAGCTGCCCAAGGTCCCTATATTAGTTTCCTCAGGCTGCTGCAACAAAGTACCACAAACTGGGAA
GCTCAGAGCAACAGATGGAGGCTAGAAGTCTGAATTCAAGGTGTCAGCAGTGCCATGTTCTCTCTCAAGGCTCTATGCCTCCTTGGCATCTGGTGGTGGCCGACAGTCCCTGGCATTCCT
CAGCTTGCAGAGGCATCGCTCTAGTCTCTGCCTTTATCATCCTGTGGTGCTCTCCCTGAACTTGTCTTTCCTCATTTTATAAAGACACCAGTTATTGGATTGGAGCCCACCCTAATCCAG
TAGGATCTCATCTTAACTTGATTACTTTTGCAAAGACTTCATTTCCAAATAAGGATGCATTCACGGATGCAGAGAGTTAGGGCTTCAACACATATTTAATATTTTAGGGAACACAATTCA
ACCCTCAACAGCCCCACAGCTTGTAAAGCTGTAATTGGCACCATCCTCTGCTTGCTTTGCTCATATTATTTCATCCAACCATGCCTTTCTTTTTTGCCCAATTATAGTTGTTGATCAAAA
TGACTATCTCTTAAGCATGAACGTTATTATTTCATCCTAAGCACCAGAGCTTTCTTTTCTTTCCTTTTTTTTTTTTTTTTTTTTGAGATACAATCTTGCTCTCTTGCCTGGGCTGGAGTG
CAGTGGTGTGATTTTGGCTCACTGTAACCTCTATCTCCTGGGTTCAAGTGATTCACCTGCCTCAGTCTCCCAAATAGCTGGGATTACTGGCACCTGCCACCACGCCCAGCTAATTTTTGT
ATTTTTAGTAGAGACGAGTCTTTGCCATGTTGGCCAGGCTGGTCTTGAACTCCTGGCCTCAAGTGATCCACCCGCCTCGGCCTCCTAAGGTGCTGGGATTACAGGCACAAGCCACTGTGC
CTGGTCAGCACCAGAGCTTTCTTTAGACTAATGCGCCTTAACTGTTAGTAATCAGAATCTTTTTCTGCAGCTTTTTTATCTTGCCAGGTTACCTGGGCTTTGAGTTCTATTTTCCCTCCC
TTCACAGGGGGTTACACTACCTCTTGGTGTAATTCAACGGTCTGGGGAGCAGTGTGAGCCACAAAAGAGGTCCTCTTGGCCCATTCCACACTTCAAAGATGAGGAAGCTGCCGGATGTGG
TGGTTCATGCCTGTAATCCCGGCAGTTTGGGAGGCTGTGGTGGGAGGATCATTTGAGACCAGGAGTTTGAGACCAGCCTGGGCAATATAGTGAGACCTCACCTCTACAAAAATAAAATAA
AACAAAATTAGCCAGGCATGGTGGCACGTGTCTGTAGTCCTAGCTACTTGAGAAGCTGAGGTGGGAGGATTGCTTGAGCCTGAGGAAGTTGAGGCTACACTGAGTCAAGATCATGCCACT
GCACTCCAGCCAGGGCAACAGAGTGAGACAGTGTCTAAAAAAAAACCCACAAAAAAAAAATGAGGAAGCTGAGACTCAGAGGGGCTTCCTGAGAGAGCATGACAGCAGCAGGCCCGGGGC
AGGTCTGCAGCATCACAACTGCTAGGCTGCCCAACACTCTCCATCCTAGCTCAGAAGGGACTCCCACTGGAAGCTCTTGTCCCAGGAACTTACCCAGGCTCCAGGACAGCCTGGCCTGGC
TCCCAGACCACTGCTGTGCTTCTCTCTCGGCAGATCGGTGGGGGCAGGAACATGGACCACCACATCCTGCATGTGGCTGTGGATGTCATCAGGGAGTCTCGGGAGATGCGGCTGCAGCCC
TTCAATGAGTACCGCAAGAGGTTTGGCATGAAACCCTACACCTCCTTCCAGGAGCTCGTAG
GTGAGCAGCTGTTTCCTGGATGCAGTCCCTGCCCTTGAGGGACTGGCAGCAAAGTCAGG
GAGACATCAAGGAAATAGAACGGGACAATACATGCGGCAATGTGTAACAACCAGACTTATAATGGGCGTGGAAGTGCTGTGCCAGGGTGGTAAATAAGCCTGCTTGGGGAGAGAAGGTGA
CTTTTCAGCTGGGTTTGGAGAACAAATGGCATTTTCAGTGGGAGAAGAGAGGGAGGAGTGTTTTAGGCAGAGCAATAGAAAGTACAAAGGCTGCCGGGCAAGGTGGCTCGCGCCTGTAAT
CCTAGCACTTTGGGAGGCTCAGGCGGGTGGATCACGAGGTCAGGAAATCGAGACCATCCTGGCTAACACGATGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCGGGCATGGAGG
CAGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATGGTGTGAACCTGGGAGGCAGAGCTTGCAGTGAGCTGAGATTGCGCCACTGCACTCCAGCCTGGGCGACAGAGCG
AGACACCATCTCAAAAAAAAGAAAGTACAAAGGCATAGAGGTCAGACAGCAGGTATCTAGGGAAACAGTCATAGATGTGGCTGGGGCAAGGAGCTTATGTTGTGGGAATGATTAGTGATG
GGAGGTTGGGGCCAAACCGTGTAGAATCTTGGTTTTCATACTAAATATTTTGAGTTTTATTTGGTCTAGTGCTTTCTAAGTACCCACTTACGGGCCAGCTGCATAAAAATCATTTAGAAA
GCTGATTAAGATACCAAATCTCAGATGCCTCTCCTACGGATTCTGATTCAACCAGCCTGGGTGGTGCCCAAGAATCTGCGTATTCATATGAATAAAGTATATATACATATATATGAATAT
ATATATGAATAGAGTATATATACATATATATCAATATATACATATACATATCAATACATATATACATATACACACACATATATATGTGTGTGTATATATATACTATATATATACATATAT
ATATATACATATATATATATACATATATATATACATATATATATACATATATATATATATATACATATATATATATATATATTTCCCCCAATATCCTCAGATGTTGGGCAGCCAAGTTTC
CAGCCCTTGTCACATACAATGGGGAGCCACTGAAGCAGGTGTGACATGGCCCATGAGAGTTCCCAGTGGAGGATGGATTTGAGGAACTGGGCAGGGAAGCAGGGAAGCTCCATATTTGTC
TCCCCTTGGCCACATGAACATGTGGATATTGCAGAGTGGAGCAGATTCTGTGCATGAGCTCCTTGTGATCCTGGAACAGCATCTTATTCTTTACTCTCCCATGACAAATGGTCCCCGGGG
GCAGAAGGAACACTGCCACTGAATTTCTGTGTGAGCTCGGACATGTTACATCTTTGAGCTCTAGTAGTAAAAGGGCTTGGCCCAGGCTAAGGTCTTGAAATCTGTCTGTGAAGGAAGCCA
GATGGGGAGCATTCTCCCTTCTGTGATGATCAGGTAATTAGGGCCCAGAGTTGCTCAAGGTTATAGGCTGTTTGGTGGCAGATCTAGGCCCCTGACTTTCTCTTTAGTAGCATTTTCCTT
CCCTAGACCCAGTCCCTGAGGAGGGGCAATTTGTGCTTTTCCTCTTGACCCTTTTCCCCAGTGCCAACCATGCCAAATTCTAGGGCACATGCTCAGTTGCTCAAGTTAGTCTCCTGGAGT
CCCTATTATCCCCAGAAAAAGGTGGACCTGGAAGGGTCCCGCCCCAGGTTGACCTTAATGGCATCATGGATCTGATGCTAGCATTTCCCCTTATCTCCTTGTAGGAGAGAAGGAGATGGC
AGCAGAGTTGGAGGAATTGTATGGAGACATTGATGCGTTGGAGTTCTACCCTGGACTGCTTCTTGAAAAGTGCCATCCAAACTCTATCTTTGGGGAGAGTATGATAGAGATTGGGGCTCC
CTTTTCCCTCAAGGGTCTCCTAGGGAATCCCATCTGTTCTCCGGAGTACTGGAAGCCGAGCACATTTGGCGGCGAGGTGGGCTTTAACATTGTCAAGACGGCCACACTGAAGAAGCTGGT
CTGCCTCAACACCAAGACCTGTCCCTACGTTTCCTTCCGTGTGCCGGATGCCAGTCAGGATGATGGGCCTGCTGTGGAGCGACCATCCACAGAGCTCTGA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAGCCGGAGTCTCTTGCTCTGGTTCTTGCTGTTCCTGCTCCTGCTCCCGCCGCTCCCCGTCCTGCTCGCGGACCCAGGGGCGCCCACGCCAGTGAATCCCTGTTGTTACTATCCATGC
CAGCACCAGGGCATCTGTGTCCGCTTCGGCCTTGACCGCTACCAGTGTGACTGCACCCGCACGGGCTATTCCGGCCCCAACTGCACCATCC
CTGGCCTGTGGACCTGGCTCCGGAATTCA
CTGCGGCCCAGCCCCTCTTTCACCCACTTCCTGCTCACTCACGGGCGCTGGTTCTGGGAGTTTGTCAATGCCACCTTCATCCGAGAGATGCTCATGCGCCTGGTACTCACAG
TGCGCTCC
AACCTTATCCCCAGTCCCCCCACCTACAACTCAGCACATGACTACATCAGCTGGGAGTCTTTCTCCAACGTGAGCTATTACACTCGTATTCTGCCCTCTGTGCCTAAAGATTGCCCCACA
CCCATGGGAACCAAAG
GGAAGAAGCAGTTGCCAGATGCCCAGCTCCTGGCCCGCCGCTTCCTGCTCAGGAGGAAGTTCATACCTGACCCCCAAGGCACCAACCTCATGTTTGCCTTCTTT
GCACAACACTTCACCCACCAGTTCTTCAAAACTTCTGGCAAGATGGGTCCTGGCTTCACCAAGGCCTTGGGCCATGGG
GTAGACCTCGGCCACATTTATGGAGACAATCTGGAGCGTCAG
TATCAACTGCGGCTCTTTAAGGATGGGAAACTCAAGTACCAG
GTGCTGGATGGAGAAATGTACCCGCCCTCGGTAGAAGAGGCGCCTGTGTTGATGCACTACCCCCGAGGCATCCCGCCC
CAGAGCCAGATGGCTGTGGGCCAGGAGGTGTTTGGGCTGCTTCCTGGGCTCATGCTGTATGCCACGCTCTGGCTACGTGAGCACAACCGTGTGTGTGACCTGCTGAAGGCTGAGCACCCC
ACCTGGGGCGATGAGCAGCTTTTCCAGACGACCCGCCTCATCCTCATAG
GGGAGACCATCAAGATTGTCATCGAGGAGTACGTGCAGCAGCTGAGTGGCTATTTCCTGCAGCTGAAATTT
GACCCAGAGCTGCTGTTCGGTGTCCAGTTCCAATACCGCAACCGCATTGCCATGGAGTTCAACCATCTCTACCACTGGCACCCCCTCATGCCTGACTCCTTCAAGGTGGGCTCCCAGGAG
TACAGCTACGAGCAGTTCTTGTTCAACACCTCCATGTTGGTGGACTATGGGGTTGAGGCCCTGGTGGATGCCTTCTCTCGCCAGATTGCTGGCCGG
ATCGGTGGGGGCAGGAACATGGAC
CACCACATCCTGCATGTGGCTGTGGATGTCATCAGGGAGTCTCGGGAGATGCGGCTGCAGCCCTTCAATGAGTACCGCAAGAGGTTTGGCATGAAACCCTACACCTCCTTCCAGGAGCTC
GTAG
GAGAGAAGGAGATGGCAGCAGAGTTGGAGGAATTGTATGGAGACATTGATGCGTTGGAGTTCTACCCTGGACTGCTTCTTGAAAAGTGCCATCCAAACTCTATCTTTGGGGAGAGT
ATGATAGAGATTGGGGCTCCCTTTTCCCTCAAGGGTCTCCTAGGGAATCCCATCTGTTCTCCGGAGTACTGGAAGCCGAGCACATTTGGCGGCGAGGTGGGCTTTAACATTGTCAAGACG
GCCACACTGAAGAAGCTGGTCTGCCTCAACACCAAGACCTGTCCCTACGTTTCCTTCCGTGTGCCGGATGCCAGTCAGGATGATGGGCCTGCTGTGGAGCGACCATCCACAGAGCTCTGA

Retrieve as FASTA