Entry information : HaDiOx04 (HanXRQChr14g0449991)
Entry ID 13773
Creation 2016-06-01 (Christophe Dunand)
Last sequence changes 2017-12-07 (Christophe Dunand)
Sequence status complete
Reviewer Not yet reviewed
Last annotation changes 2017-12-07 (Christophe Dunand)
Peroxidase information: HaDiOx04 (HanXRQChr14g0449991)
Name (synonym) HaDiOx04 (HanXRQChr14g0449991)
Class Alpha-dioxygenase    [Orthogroup: DiOx001]
Taxonomy Eukaryota Viridiplantae Streptophyta Asteraceae Helianthus
Organism Helianthus annuus (Sunflower)    [TaxId: 4232 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value HaDiOx04
start..stop
S start..stop
HaDiOx06 1285 0 1..643 1..643
HaDiOx05 1245 0 1..643 1..629
HaDiOx07 1130 0 1..643 1..626
NatDiOx01 1020 0 1..642 1..641
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '13773' 'join(140332353..140332472,140332552..140332818,140348036..140348185,140348295..140348428,140348587..140348800,140348893..140349038,140349124..140349351,140349467..140349669,140349756..140349999,140350097..140350322)' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 140332353..140332472 118 N° 2 140332552..140332818 265 N° 3 140348036..140348185 148 N° 4 140348295..140348428 132
N° 5 140348587..140348800 212 N° 6 140348893..140349038 144 N° 7 140349124..140349351 226 N° 8 140349467..140349669 201
N° 9 140349756..140349999 242 N° 10 140350097..140350322 224  
join(140332353..140332472,140332552..140332818,140348036..140348185,140348295..1 40348428,140348587..140348800,140348893..140349038,140349124..140349351,14034946 7..140349669,140349756..140349999,140350097..140350322)


exon

Literature and cross-references HaDiOx04 (HanXRQChr14g0449991)
DNA ref. HanXRQ genome:   HanXRQChr14 (140332353..140350322)
Cluster/Prediction ref. HanXRQ:   HanXRQChr14g0449991
Protein sequence: HaDiOx04 (HanXRQChr14g0449991)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   643
PWM (Da):   %s   73499.72  
PI (pH):   %s   6.67
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MKTFMSFAKQQLLSPFKHFIHADFHELFERMTLIDKFLFLIIHGVDKSGIGWHRFPVFLGLTYLAIRRILHDKYNLLSVGKTRVGVRFDTDEVDFRTANGKFNDPLNESAGSEGTFFGRN
MPPVDQRDK
LLKPDPMVVATKLLARKQLIDTGKQFNMIAASWIQFMIHDWIDHLEETNQIELRAPAEVASECPLKSFRFFETKEIDTGLSDIKKGHRNIRTPWWDASAVYGSNLNAARHI
RTFIDGKLKIAKDGLLQHDNDGLPIAGDIRNSWIGVSTLQALFIHEHNAVCDTL
KEYPYLDDEDLYRHARLVTSAVIAKVHTIDWTIELLKTDMLVAGMRANWYGLLGKRFKDTFGHVGG
SILGGLVGLKKPENHGVPYSLTEEFTSVYRMHSLLPDQLVIRDLNSTPGPNKSPKITK
EIDMINLIGKNGEKELSKIGFTAQMVSMGHQACGALELFNYPVWLRDIVPQNVDGTDRPDHI
DLASL
IYRDRERKVARYNEFRRSLFLIPISKWEDLTEDKEAIATLREVYGDDVEELDLLIGMMAEKKINGFAISETAFVIFLAMASRRLQADRFFTSDFNEDVYTKKGFEWVNTTESLKD
VLDRHYPEMTDRWMNSTSAFSVWDAAPEPHNPVPIYFRLPK

Retrieve as FASTA  
Remarks Very long intron 2, but prediction confirmed with 1 EST from Helianthus petiolaris.
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAAAACATTCATGTCTTTTGCAAAACAACAACTTCTTTCACCCTTCAAACACTTCATCCATGCCGACTTTCATGAACTCTTTGAAAGGATGACACTCATTGACAAGTTTCTCTTCTTG
TGGTATGTTAATGCTGCATGCATGTAATTAAACTTTTAATTTACTATAGCTCATTTTGTGATTATAATTATACTTTGGCAGATCATTCATGGTGTCGATAAATCCGGGATAGGATGGCAC
CGTTTTCCTGTGTTCTTAGGTCTTACTTACTTGGCCATTCGCCGGATTCTTCATGATAAGTATAATCTGCTCAGTGTTGGGAAAACTCGGGTGGGGGTTCGATTTGATACCGATGAGGTT
GATTTCAGGACCGCCAATGGAAAGTTTAACGATCCTTTAAATGAAAGTGCCGGCAGTGAAGGAACTTTCTTTGGCCGTAACATGCCCCCGGTTGATCAGAGAGATAAG
AGGTATTGAAGA
GGTATGGTCCCAATTCCCTGCCTACATGGCAGGGACCACACCACTTTTTGTGCACACAAGGCATCCACATCATCTGGATCTTCTAGAAGCTTCCAGAAGACAACCGGATAAGGCCCGGCG
GGCATCAAAGATGCCTCGCCCAACATCAAGGACAATACGTGGCACCATACACAAGAAGCCACATGGGCCTGCGACAATCCAGGGAGCAGGGCTGACATGACCAGTACGAGGTGCCCAACA
CCGTCGAATGCGGGGACGCCACGTGTACCACTACGCCGTCCCTGACAGAGCAGACCAAGAGGATATTCCCCTTGGTCGGACAGCTGGCGCACATCCACAGCTGGCACAGCTACTTCCTCC
TTCTTCACCCTCCGGCTATAAATAGAACCCTTCATCATTCAGGTTAAGGATCTTGGCTCTCTTTACTCACTCTATACACACACTGTTTTATTCATCTCGGAACAGTACTTATTCTCACGC
CGGAGCCTGGTTAAGAGGGAAAACCTCTCTTTCCCCTCTTAACGAGACTAACGGTGTTTACTGTTTTGCAGATCTCGAGCCTTGGATACGAGCAAGAGAGGAGGTTGAACCCTATAAGTG
AAACGACCCCCTTGGTTATCCCTTGTGTTAACCATTGTTTCAACATTGGCGCCATCCGCTTTTTTGCAAGACCACTCTCACCTCTTTTTCTCTTTTAGAAAAACTCGTAAAAATGGCAGA
CCAGAATCATTCACACCCAGCTGACGGAGAAATCTCTTCTTTTGAACTCGTTTCGGACACGGCACACGTCCAACGCAGTCAAAGGAACGAGACCATCCAGGAAAGCCAGCTGAACAACGA
ATTCCCGTCCATTTTTGGAAGTGCGTCTAGGGCTGCTAGCCAGACCACAACTGGACCCATTTTCCAAACACCAACAAGAATTATCACTCAGACCACAAACGGGGCTGCTCTCCAACCTCC
AACGGGGATGTTACATCAAACCCCACCGCCGACGCAGACTGTAGGCCATGGGCCAGGCCCTTCGGCACCATCGGAACAAGCACAACTCAACTATTCTGCACTTTTAGGGCTACCCGAAGG
AAAAACTCTGGCTTCCTGGTATGCCGAACAGATGGCGTCTATAAACCTTGTCTATACGCAGCTCAGCGCACAACAAGCCTTACTCCAAGCACAGGCTAACCAATCAGCGTTCGTAACTCC
ACAACCAAGGTCTCTGAGTACACACACGGCTCAGCAGACAAACGCGTGGAATTTACGACCAGAAAGAGAACCAGTGCAACAGGTCAGAAGACCCAGCATACAAGACACGCGCGATACCTA
TGCTGAAACAGAGAGCAACTTCGTCCAAACTTCCAATCAGCAACGAAGACCGATCCAAACCCGCTTGGGCGCGCGAAACATGAATACAGAATGGGAAGAGGAGGAAGACGACCCAACGTA
CAAGGCAGAATCCACAGTGTTTAGCAGACTTCCTCCAGAGCATGAGGCTTACAAACCAACCAAGCGCGCGGGGTACAACCCCAAAGCAGAACACGACTTCACCTTAAGCTATCGTCCTGA
GGACATGGCTGAAAATTCAAAATTTATTCCAGAAATCGCGTGCGCGGCCATCGACAAAACAAAGTTACCGCACAACGTAGGTAAATACAATGGGTTGACGGATCCAGATGATCACCTCCA
GGTGTTCAAAGGCGCAGGAGCAACAGGTGGTTGGAACCTACCAACATGGTGTCACTTGTTTGCTCAAACTTTCGTTGGTGCGGCACGCATCTGGTTCGACAATTTACCAGCTGGAAAAAT
CAAGTCATGGGTCGACTTCCGAGAAAAATTCTTAGCACACTTTTCTCAACAGCGAAGACACGCCAGAGACCCAGGTGATTGTCTGAACATATACCGAAAAGACTACGAAAGCGTGGAGGA
TTTTATTACGAGGTACAACAAAGAATGTCTGGAAATTGGAGACATACCGGAAAAAATGATGCGCGCACACTTCATGCGAGCAGTTAAATGCGACGATCTGGTTAAAAGAATCAAAGGGCG
TGACGGAGGACCCAAAGACTGGGAAACCTTCATTGAAGCAGCCAAAACCATTGCGCAGACAGATAGGCAACTGACCGGTGACGATCACCGTCAGCGCGCACACAACCACCACGATCGAAA
CAACAGAAGGGGTAGAAATCAACCCTGGAGGGCTTCCGGGAACAGAGAAAGAAGTCCCCCACGGGACGACGCACGCCATACGATCAATCAGATAGCCCATCGAAAAGAAGTAAAGCGCGA
AAATAGAGAAAAGCAGTGGACTCCACTAACTAAAACACCTTCTGAAGTTTTAGCTACAGAGAACCATCAATTCAAGCCACCTTTGCAGATGCGCAACAAAAGGGGTCAAGACCCAAATCT
CTTCTGTGAATTCCACAAAGACACGGGCCACCTGACCGATGATTGCTTCAGCTTGAAACAAGAAATCGAAAGAGCTCTAAGAGACGGCAAGCTCGGTCACTTAGTCAAAGGAGGAAAGCG
CGATTACCGCCAGATACAACGAAGAGACGAAGGTCCAGACAACAAGAAGCTCAGAAAGCTAGAAACCCATATGGTGCAAGGAGGACCACGGCGACCAAGAAAAAACTACAACAAACGCGC
GCAGGATGATTCATGGCGCGAGAAGCAAGTAGTATTCCCAGTTGTCAGGGGAGGTCCAAGAGAAAAGCGGCCAATAGTCATTCCAGGGGTGATCGGCCACTACCAAACAGATTACATCTT
TATTGATCCAGGAAGCACCGCAGACATCATATATGAACAGTGCTTCAATCAATTCGACCAAGAGGATAAGGCGCGCCTGGAACCAGTTGACTACCCACTAACTGGATTCTGCAATGAGGC
CGTCTTTCCCCTAGGACAAATATCTTTCCCAGTATTACTTTCTGATGGGAGAAATTCAAGAACTGAAGAAGTCACATTTATGGTGCTACCGGCACATTCAAGACATGACATCCTTTTAGG
ACGAGAATCCCAAGGAGATTTCAGCATGATCTGTTCCGCACCACATTCTGCCATAGGTTTTCCAACCGAAACAGGCGTTGCGTTGATATACGCAAGCAAGGAAGTGCTAGCAACAGACGA
AATCAGGCCGGCAAAAGCAAGCAAGCCCGCACCGCGCAGAGAGGCAGAAAAATGGGTATTGAACAGTGCATACCCAGAACAAACGGTCACTCTGGGACCCGCAATGTCTGACCTAACGCG
TGCGGCGTTAAAGAAATTACTGCATGAAAACATGGATGTGTTCGCCTGGACACCAGCCGATATGGTTGGCGTTCCACGACACATTGCGGAACATCGGTTAAACGTCTCAGAGGATGCAAA
GCCAGTAGTGCATGCTAAACGACACCTGGGGGACATCAAACATGATGCAATGAAGGAACAAGTGTTAGAACTGCTAAACGCAGGAATCATCAGGGAAGTCCGGTACCAAACGTGGGTGGC
AAGCCCAGTCATGGTGAAGAAACCGAATGGTAGTTGGCGAATGTGCGTCGACTACAAGGATCTGAACAAAGCATGCCCCCGTGACTGCTATGCGTTGCCCGACATAGACGAGAAAATAGA
TTCTTTGGCAACGTTTCGGTGGAAATGCTTTCTGGATTGCTACAAGGGATACCACCAGGTCCAGATGGCTGTTCAAGACGAGGATAAAACCGCTTTCCGCACGCCAACGGGGCTATACTG
CTACACCAAGATGCCGTTCGGCTTAAAGAATGCCGGTGCTACGTATCAACGATTGATGAACGAAACATTTAGCGACGCCATCGGTAAATACATCGAGGTATACATGGACGATCTGGTAAT
CATGAGCAGGGAGGAGAGCGCAATGCTGGTAAATATCCAGAAAACCTTCAACACGCTGCGAAGCGTGAGCATCAAACTGAATCCAGCAAAATGCTCATTTGGAATGGAGGAAGGAAAGTT
TCTGGGATTCATAGTCACCAAAGACGGTTTTAAGGTGAACCCAGAAAAGGTCCAGGCCATAGAGAGGATGCCTTCACCAGCAAGCATCAAAGATATGCAAAAGCTCGCAGGACGATTGGC
AGCCCTCAATCGATTCCTAGCAAATCACGCGGCAAAATCTTTCCCATTCATCAAGACCTTACGAAACTGCATGAAGAAAACCCAATTCCAATGGACTCCGGAAGCAGAAAGCGCGTTCCG
CGAGATGAAAGACTGTCTCATCAAGCTGCCAACTCTAACCGCACCTAACAAAGGAGAACCTTTGGTTTTGTACCTCTCAGCTTCCGACAGGGCCGTCGGTGCCGTATTGCTGGTTGATCG
ACAAGGTGTCCAAACACCTGTGTATTATGTGTCCAGAACCCTAACCGACCCAGAAACAAGATACGCAATCATGGAAAAGCTTGTCCTTGCACTGATTCACGCGTCAAGAAGGCTACGCCG
ATATTTCGCCAATCACGTCATCCACGTGTTAACAAATTACAATATTGGGAATATCCTAGCAAGGCCAGAAATATCAGGAAGGTTGGCCAAATGGGCGATAGAGCTAGGGGGACTCAACGT
AGTCTTCAGACCACGACCGTCGATAAAAGGCCAAGTTTTGGCAGACTTCATGACGGAAGTCCCCGATGACAAAGACAGAGAATGCAAGGCGATGGAGAAGGCGGAGAAAAAACAAATCGA
AGAACCATGGATGTTGTATACGGACGGCGCGTCCAACGAAGATGGGGCAGGTGCGGGGTTGCGGCTAGTGAGCCCAGACAAAAACGAGTTCACCTACGCCATACGTCTAGACTTCAAGAG
CACAAATAACGAGGCAGAGTATGAAGCCTTTCTGGCCGGCTTGCGCTTAGCAATCAAAATGGGAGTCCGACATATTGAGGCACATGTGGACTCCATGCTAGTGGCAGGACAAATCAACGG
TCAATACGAAGCCAAGGGCGACATCATGGCACTCTATCTCAACCAGGCAAAGACGTTGCTGCAAACTTTCTATTCTTACAAGGTGCACCACATAAACCGCAGCGAAAACAAGCCAGCAGA
CGCCTTAAGCAAACTTGCGTCAACAAGTTTTCAGCACCTAGCCAAAGACGTACGAATAGAAGTCTTAAGCAACCCGTCTGTGCCACTCCGCGAAGTCAGCGTCATCCAAACAGGAACCAC
GTCATGGATGACACCCATCATCATGTACTTACAGTCCGGAATACTCCCAGAAAATAAAGCCGAGGCGCGGAAAATCCAATATAAGTCAGAACATTATCAAATGGCGGATGGGATATTGTA
CCGAAAGTCATATCTCGGCCCTCTGTTGAGATGTGTCGACGCCGACGACGCAAATTATCTGATCCGGGAAGTACATGAGGGCATCTGCGGCATCCACGCCAGGCCACGCATGGTAGTGGC
TAAAGTAATGAACGCCGGGTACTACTGGCCCGGAATGCACCTCGATGCCGTGAAAGAACTAAGGAAATGCAGCGGCTGCCAACGGCATGCACCAAAAACCATGCGGCCAAAAAATGCGTT
GGTGCCTGTAACAACCGCATGGCCCTTTCAGCAATGGGGCATAGACATGGTGGGCCCCTTTCCAGAAGCTCCGGGGGCAGTCAAGTTTATCATCGTCGCGGTCGATTACTTCACCAAGTG
GGTAGAAGCAAAAGCACTTGCGTCAACCACGTCAGCAGTCGTTAAACGCTTTATCTGGGAACAAATCATATGCCGTTTCGGCCTGCCACTCCGAATCATCACCGACAATGGTACAAACTT
TGCAGCAGATGATCTCGAACGATGGTTCAAGGAACTAAACATCGAACATACCTTCTCGTCGGTCGCACATCCGCAAGGGAATGGGCAAGTTGAAGCGGTCAACAAGAGCATCGTCGATGG
CATCAAAGCAAGGCTCGGTGAAAAAAGACGAGGGTGGGTCGACGAGCTACCAAGCATATTATGGGCCCATAGAACAATGCCCAAAACAAGCAATGGAGAAACACCCTTCAGCTTGGTCTA
TGGGTCCGAAGCTGTGATCCCAGCAGAAATCGGACTCCCATCTCCAAGAATGCTCTCCATGAATCTGATCAACAACGAAGAAGAACGGAGGATCGATCTAGACCTCCTAGAAGAACGAAG
AGAGATGGCTGCAATCAATGAAGCCAAGTACAAATCAAAGCTTGAAAAGTATTACAATTCCCGAGTCCGGATCTGCACCTTCAACCCAGGCGATTATGTCTTAAGGGACAATGAAGCATC
CAACACAGAAAAACCAGGGAAACTGGCTCCCAAATGGGAAGGCCCATACATCATTGATGCGATCCTCGGCAAGGGAGCATACAAGCTACGCACCATGAACGACAAAGAGGTCCCGCGAAC
CTGGAACGCCCAACAGTTACGAAAATGCTACATATAAACATGTAATCGTATAAGGCGAATCGCCGACACATTTACTTAATACAAGAAGCGTTTGGCTACCTCTATTTCATGCAAAATTTT
TGTCACAACTGCATTTATTACTTCGGGCGTACGCGAAAACATCAATGGCTCGACCATAGGAAACGTTGTAGACCTCCAAGGCTCGTCACAACCAAGTGAACAGCCGGGTCAGAAACACAA
CCAAAGCCTACAAAACGCCAAAAAACATAACTTCGTGCCCACATAAACACGAAAACATCGATAGCAAGGCAACTAAACATTGTAACGCTCCCAAGGCTGCAATCCCAGCCGAGCTCACAC
AATAACCTGCAACTCGATCACGTCGTATATCACATACAAGCGCGCATACTTAAACGACAGGATAAAAACGTTGTCATAACACAACATCACCTGTCAGCGCCATCATTCATTCGCGACACC
TAATCAAAGTAATTAAAACATGCACATATTATGACGATTTCAAAATTTGCATAAACACAATCAGAGCAACAAACAATCGTCAAAGTACACAAAAGTACAAAGTAAGCAAATTGTCTTGGA
ACATCATTACAAGCCCATACAAGGCTGTACGCGGCGCACAGGCCGGAAATCCCAAACTAAGTGCGGCTGGAACCAGCACCACCCGCACCGCTAGCATCATCTCCTCCCAATGCCTTCTTC
AACAAAGCTACCCCATCCGCTTTGAGGGCCAACTTACCAGCAGCCTTAACTACGGCAAAATCAAGCATTGCAAACTCTCGCCGTTTCTTCGCATACGCATTATCGCAATCAGCTTTATAC
AGCTCAAACTTATAATCCTTCTCATTGTTTGCAGCAGCACACCTGCCTTCAGCATATCCATTCTTGCGGCCACTATTATAACCTGCCCCACCCAACTCAAACATGTATTGGGCAAGCTCA
GGAGAATTCAAGATACGATCAGCAAGCTACATGAAAAACAAGGTTGAATAAGATAGCAGAAAAAACTACACAAAAGTAGAACAGGCAAAGAACATTACCAATGGAACACCACGAGATAGC
AGCCACCTACTATCAGCTGAAGCTTCCTCAGCAAGGAGCTCAGCGGCACGGCACTCCGCCGCCTTCTCATCAAGAAGACGCTGAGCAGCCTCACATTCCGAAGTCTTCGCCTGCACAAGA
AGCTCCAGCTTATCTATCCGCGCAATGTACTCCTTTTCCTTCTTCTCCGCCGCAAGACGCGTTTGATGAGCCTCTTCCGCGAGAGCAGTCTTCTCACCAGACAGCCGCACAATAGCATCA
CGCTGCGACTGCATCTCCGCGTTCGTACGAGCACAGGCCGACTCCCAATCCTTTTGTTTTTGCTCGTTTAACTGTTTCTGCTCTTCAAATTTCCGCTCAGCCTCAGTAACCCGCCATTGA
AGACCCTTCTGCTCTGTATTAAATTTTTCCTTCTCTTCAGCAAATGCCTTCTTGGCCTCCTCAAACGCCATAGTTTCTTCTCCCATAGAACGCCACTCACGAAGAATCTCCTGAGAGGTG
GCAAAGAAGTTAACCCCAGCCGTAACATGATCATCTATAAGGGCAAATCGGCTGCGGTTTTTCTGAAACATCCTCTCGGCAGGAGGAAGGGATAATGAGAAGAACTCCTGACAGTTCTGC
AAGTCATTCATGCGGGAGCCCTGAGTAAGGCCCCAACGGGGGACGTGCGGCAAATCATCGCAAGCCGGGTTACCCCAGGAATCACCAGGATTTTGTTCAAAGTGCGGACTCCGCACAAAG
CCAGAAGTGGCTCCACCTTCAACGGGAGGACGTTTTGTGTAAATGGTTTGTTGTGGAGTAGTCTCACTAGACTCCATCTCCACCTCAATACCTTTACCCTTATCAACTCCACCAGCATCA
CCACCAACATCACCACCAGCAGCGTCACCTCCCTCAAATATACCTTCAAACCCTTCACCGCCCTTCACAGTTACATCAGCATCATCTCGAGGAGGGGTCAAGCCAACAGTCCTCGAAGGA
GGAGAGGTAGGTGGCGTAATCTGCGAAATATCCACTGGCCGTGGCTTCTTGCTAGTCTTAGATTCTGCCACAAAATTACAATTAGCTAAAAAGACAAAATAAAAGATGACAAAAACATAC
ACAGAAACTAAGTTACCAGGCCGGGGAGCAGAAGCCTCAAATATCTTCTCCAACAAATTCCCTCGACCCCCACTAAATACACCCAAATCAACCTCAGATTCAGAAGGGGCTGGGGGGGCA
TCCTTTTGAAGCTTGGCCCTCTTGGCCGCAAGAAGGGCAGCAGCTTGTTCATCAAGATGACGCTTATGATCATCAAGAGCCTTCTTCTTCATATGCTCAGTTAAAGTGGCTTTATCATCA
AGATCCGACCCTTGACGCACCTCCCCAGGGCCTAAGCCAGACAACGTATCAGACACTACTACATAATCTAGATAAGAAAGAATGGAGGGTCGCTTACGAGAAGATCCGGCAGCTCCACCC
TCAACTCCCTCTCCTTTCCTTCCAGACCTATCAATCCGAGCTTTCTTCCTCGTCTCCAACTGAGCAGACGGATCAATAGACGCTTCCACGTCATCATCATCATCTCCAACAACCTCGTGT
ACAGGTTCAGCATTGCCACCAGGCACGGTACCTGCACGCGCGGAACGAGACGTTAGATCCCCACCACTCCGACCAGAACTCCCACTGGAAACAACGATGACATCCTCTTGATCAGAGTTA
ACAAAGTCACCCAAAACATCCTCACCAAGAACCTCAGCAGCATATCGGTTCAAACTTTGCTCGGTAGGGTGCAAGAAGCGATCTTGTATTTGATCCAGCCACGTCGGGTTACCATCAGCC
TGGATGGCTTCAACCATGGCACCAGCAGCCTTGGGGTCCAAAGCATTCAACAAGCTATAACCAACTGCAGAAAAACAACAAAGATCAAAGCACAAAACAAGCCTAAAGGGAAGTAAACAA
CCGACAAAATAAGAAGGAATCATACATTTGCCCTTATGGCTGTATACAGGCTGACCCAAAGGATGCTTAGGCACCCATAACATGCTCATCCCAGCACCGACCAAGGCCATTTCATCCAGT
TGAGAAATAGCAGTTGCCTTGGCCGTTATCCTCCCATACCATTCTTGGGCAGCAAAATTCTCCAACGGTTCCACCCTTGGAATCCCTTGATCGACCTGCCTGTACGTCATCTCCGACGGA
ATCACTCCACGACGAATGTAGAAAAACTTCTGTTTCCAGTCATGAATACCCTTTATTGGGGCAGAACACACTGGGACAGCCCCCGTCCTGGCTTGAAAAGAATAAAAACCACTAGAATAT
GTCACACTATAAAAAGTGTTGAACATCTGAAACGTTGGTTCAACCCGATAAGACCGGCAAACAAACTCGAAGTGAGTAATCCGGGGAAGCCCAATGGCATTTATCTGAGAGATATGAAGA
CCATAACCCCTCAAAACACCGGCAACAAACTTAGTAATGGGCAACCTAAAATTGCCCTCTCGGAAAAAAGCAGCATATAAAGTGATAAAACCGGGGGGGGGCATCCAGTGCAGTAGATCC
AGACGGAGGATATCGAGCCCCCCACTCAGGTCGAAAGTTATGCCCCCTAACGAGGTTATGAAATTCCTCCTCTTTCCAGTTAATGACGGCCTGATCCGGGCCAGGAGGACCCCTACTCCG
TTGTTTCCTTTTCTTCTGAGAAGGATTTTTCTCCGCCATTAAGAAAAATCAAGAGTAAGATACTTCGGATTTGAAAAAATATGAAGGACAAGAAGAACAAAGGAGAAAGGAAGGGAGAGA
ATCACTTAGAAATTTCGAAGATATGGGAGAAGTGAATAACCGCTCTACTATTTATACTCATCGCATTAAATGCGAGAGATAGTGGAACGTCAGGAGGCAGGAAAAGTAACCAATAGATGG
CCGACACGTCAGAGGAAAGAGAAGACGTCGGCTCATTTTTCGGGAAGCATGGGCAACAGGGTCTGCTGATGTGACAGAAAGTCACTGCAAAAGCAACTGTTCACCTCCGAAAAGACAGGG
ATGTCAGACGGTCACACCAAACGCTCCCCCATAACCACCAAACTTCAAACAGAAGAAAGCACGAAAGGGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCA
AAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGAGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGAGCCAAAACCCATGCGGCATACGTCCATT
TGGCTCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGAGCCA
AAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGAGCCAAAACCCATGCGGCATGCGTCCATT
TGGCTCACTTTTTTCAAAGAGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGTGTCCATTTGGCTCACTTTTTTCAAAGAGCCA
AAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGCGTCCATT
TGGCTCACTTTTTTCAAAGAGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGAGCCA
AAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGAGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGTGTCCATT
TGGCTCACTTTTTTCAAAGAGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGAGCCA
AAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAAGGCCAAGACTCACGCGGCATGCGTCCACTCGGCTCAATTTATATTCACCAATTCCAACGCGATGCATAAAAAACCAC
AACTCTCATAAGTCCAGAATCCCTCCTAGACGATCAACTCAACTAGAAGGCACACTGGACTGGGGGGACTTGAAGAGGTATGGTCCCAATTCCCTGCCTACATGGCAGGGACCACACCAC
TTTTTGTGCACACAAGGCATCCACATCATCTGGATCTTCTAGAAGCTTCCAGAAGACAACCGGATAAGGCCCGGCGGGCATCAAAGATGCCTCGCCCAACATCAAGGACAATACGTGGCA
CCATACACAAGAAGCCACATGGGCCTGCGACAATCCAGGGAGCAGGGCTGACATGACCAGTACGAGGTGCCCAACACCGTCGAATGCGGGGACGCCACGTGTACCACTACGCCGTCCCTG
ACAGAGCAGACCAAGAGGATATTCCCCTTGGTCGGACAGCTGGCGCACATCCACAGCTGGCACAGCTACTTCCTCCTTCTTCACCCTCCGGCTATAAATAGAACCCTTCATCATTCAGGT
TAAGGATCTTGGCTCTCTTTACTCACTCTATACACACACTGTTTTATTCATCTCGGAACAGTACTTATTCTCACGCCGGAGCCTGGTTAAGAGGGAAAACCTCTCTTTCCCCTCTTAACG
AGACTAACGGTGTTTACTGTTTTGCAGATCTCGAGCCTTGGATACGAGCAAGAGAGGAGGTTGAACCCTATAAGTGAAACGACCCCCTTGGTTATCCCTTGTGTTAACCATTGTTTCAAC
AGGTATATATATTAATCGATGAACTTCAAATAACACATATTGACATATATATCACTAGCTAGTTATCTTTTCCAACTATTGTTCTAAAATTGTATTGATGTTGAACTGGTCATTTTTTAT
AACCTTGACTGGTTAGGGTTTCTTTTAGGTTTTGATTGACAAAGAGCAAAAGAGAAATTTTACGACGTCTTTGCTTGAATTCTGACGCTCTCTACAGTACATGAAATTTTCCCAAAGTTT
TATTTGTAATGTTTAGGATGATATTTAAAAAGATATAATACATGTATCTTTTTAAATCCTAGCACAAATCTATAGATATATCCTTAAATTTACGTAACATGTTTTGCAAAATAAATTAGT
GATGAGTAGAGATAGTCGACTGTCTCAACCCAACTTAATATTTAGAGCTTTGTATATCTATATGCTAATTAGAACCCCGTATATTATACGGGCTGAATAAATGTGAATTTATATACTAAA
TAATACAATAATATATCTTTAAAAACTTTATTTATTACATGGATTGAATACATATAATTTTATATACTAAAAAAATAAAAAGTTATATCTTTAAAAACCCCTTGTATTGTTGTATAAGTT
GAATAAATATAATTTTATACATCAAATAATGAAAAAAAATACTAAATAATAAATTATTTATATATTTAAAAAACCCCGTGTATTATATGGGTTGAATAAATCTAATTTTATATATCAAAT
AATAAAAAGGTTATATTTTTAAAAACTCATGTATTACACAGGTTGTGTAAATGTAATTTTGTATAGTAATTTATAAAAAAAGTTATATCTTTAAAAACCCTCGTGTATTACACGGGTTGA
ATTTATATATCAAATAATAAAAAAATATATTTTTAAATACCTCATGTATTACACGGGTTGGATAAATGTAATTTTGTATACTAAATGATAAAAAAAAGTTGTATATTAAAAAACCCACGT
GTGTTACACAGGTTGAATAAATGTAATTTTATATAGTAAATAGTTAAAACTTATATCTTTAAAAACCTCGTGTAATACACGAGTTGAGTTATGATTAATTTCAGATTTTGATTTCCTGCA
TTTTCTCTCCTATTAAATAATAATAATTTTTCTCTCCTTAAAATTACATCTTAAGTCATATTCTTATTTTTATAAAACATATTTAAAATATATATAATATTTTATTTAAAATAAATATAT
TTAAAAAATTTACTGATTATAAAAAATTAAATATATTTAAAACATATTACTGATTATAAAAAGTCATAGACAATTTAAATCTGTTACCTAAATTTGGATGTAAAAGTCTAAACGTAAATA
TTAAAATAAAATCTCTCTACGATATAATAAAACTATAATGTAACCTATTCCTATATCTAAGGAAATATATTATAAAAATAATAATTTAATATTAATAGTATATTTTCTAAATAAGTTTTT
AACCTATACTATATCTCTACGTATATCTCATAATTATATATATTAAATGGTATATCTCTACGTATATCTCATAAATAATTATTTAATTTTAAATAATTATTTAATTTTAATAGGTTAAAA
GATAGATGACTTCAATGAATGATATGTATCTCCAAAGTGGTTTCTTTTATTATATAGTATAGATTCATACTAAGTAATTGTAAATATAAATCTCAAGGTCAGATAATTTTGACACACGTG
ATTATTTAAAATTTTAATTATCATCAAGAGAATTATGTCCATCTTAAAAGTTAGACTAAAAATTAGATGTTTATTACTCTTGTAGATAAAACACAAGTCATATATATACCGATCGATCCA
CTGTCATTGTTCAAGATTTGTTATTCTTTCTTTCTTTTTTTTTTTTTTTTTTTTTTCATTTGTCATGTAGTTATTGGCAGTACTTGATATTCTAACTGCCACTCCGCTATCCCATTAAAA
CATCACGTATGGTCCACACCAACATTTATATAATAAAATTAAAACTTTTTTAGTCTAAATACTATTCTTTTTATATAATATGTCTAATGACAAAGCTGACTTCCTTTTGTTAGATGAGAA
TAGCAGTTTTAACCGACATAATTATCAATGAAATTAAAGTGAGATACGATATACACGAAATCCCTTAATTTATTTTTTTAATAATAATTTTTTACAACTAAATATCACCTAATTTTTTAC
TTACAAAAATTTAAACTTATATTTTCATGAGAAACAAATAACATGCCACTAGATCAATAGATCATAAGTACAAATCCTGTTTATTGATCACATGTTTATCTAATTACTACATACTCCATC
CGTCCCATTAAAAACGTCATATTTTGAATTTTCAAAGTCTTTATTTATGAACTTTGACCTTAAATAATTTTGTTTGTGTTAAATAATACTTGATGAAAGTTATATGATTTGAATGTGTTT
TACAAGTGTTTTTATCGGGTTAATTTTTATCAAGTTTTATATAACACAAAAAATATATAATTAAAGTCGAACTTTATAAATAAAGAAAAATTCAAAATAGGACACTTTTAATGGGACGGA
GGGAGTATTAAATAAGCAGGTTTCGAAGATTTCATTCTTATTATACCTCATAAATTTTATAATCATTTTTATACGAGTGTCATCCTTTGCTTGTAGTCTTGTCTGGAGCTGGATTTTGAT
TTATCAAATTTAAAGCAAGTAAATAACTTCTAGATAAAAATGTAACATGCAAAAAAGTTCGTTCATCTAATAGTAGTTATTACTTTTCAAAAAAATCTAATAGTAGTTATTCAATGCAAC
TCACAATAAACAACAGAAATTTGAAATGGTAGTCGAGCTTTGAATGTAGGGTGAGACAATTGTACTAAAACGTATTTCTGTGTGCAGCTATTGAAGCCAGATCCGATGGTGGTAGCAACA
AAGCTTCTAGCACGTAAACAACTTATAGACACCGGAAAGCAGTTCAACATGATTGCTGCTTCATGGATTCAGTTCATGATTCATGATTGGATTGATCACCTCGAAGAAACAAACCAG
AGG
TTCGATTAATCAAGATATTATATAATCCCGTGCCATAAGAAGCTAGGATGAACTTTTGTACTAACACATATATAATATGTTACCATGGATTATATATTATGATAACAGATTGAGCTTAGG
GCGCCAGCAGAAGTAGCAAGTGAATGCCCTCTCAAATCTTTTAGGTTCTTCGAGACTAAAGAAATCGACACCGGTCTTTCTGACATTAAGAAAGGTCATCGTAACATCAGAACTCCTTGG
TG
TGGTATGATTTCTATAGTTTAATTAATCATATTTTATAACCTTGCATTGTTAAAAACTCTCATTAATTAATAGTTATAAAAAAAGAATTGAAAAAGACAATCAGTATATATTCAAATA
AATAGTATGAAGTTAAGAGTGGGATATATATGATATGTACAGGGACGCGAGTGCGGTATATGGAAGCAACTTAAATGCTGCTCGTCACATAAGAACGTTCATTGACGGAAAGCTCAAGAT
TGCAAAAGATGGTCTCCTTCAACACGACAACGATGGATTGCCCATAGCGGGAGACATTCGTAATAGTTGGATTGGGGTGTCAACTTTGCAAGCCCTCTTTATCCACGAGCACAATGCGGT
TTGTGACACCTTAAAG
AGGTATGTGTGTATATGGTAATTGGTAAACATGTAGCCAGTTATTTTTAGTTCATCAGCTAGCTAGTAGCTTACAATATACTTATGTCTTTCAGAAAGAATATC
CTTATTTGGACGATGAAGATCTGTATCGCCACGCAAGACTAGTAACTTCTGCGGTGATCGCAAAGGTCCACACCATTGATTGGACCATTGAGCTTCTCAAAACTGACATGCTTGTTGCCG
GAATGAGAGCTAACTG
TGGTAATTACTTATTTTAAAATAAATAAACTTCATTCTTGAAATGAAGCATATTTACTAATATTTTGTTGAAATTTGTTATGTCAAGGTATGGGCTATTGGGGA
AAAGATTCAAGGACACATTTGGGCATGTTGGAGGGTCTATTTTGGGAGGACTAGTAGGGCTAAAGAAACCCGAAAACCACGGGGTACCCTACTCGCTAACAGAAGAGTTTACAAGTGTTT
ATCGAATGCATTCTCTCTTACCTGATCAACTTGTCATAAGGGATCTTAATTCCACACCAGGCCCTAACAAGTCCCCAAAGATTACTAAGGA
GAGTACGTAAATCACACCTAAGTTATTTT
TTCTTAATTTGAATAACAATTTACTTCTTTGTATCTTCATGTAATAACCAACAAATAAGTATGCTCATAGTTATTAATTTACATTCAGGATTGACATGATCAATTTGATTGGAAAGAATG
GAGAAAAGGAATTATCAAAAATTGGATTTACCGCACAAATGGTATCCATGGGACATCAAGCCTGTGGGGCGCTTGAGCTATTTAACTATCCAGTCTGGCTTAGGGACATTGTGCCTCAAA
ACGTGGATGGGACTGATCGCCCCGACCACATTGATTTAGCATCACTTGAGA
GAGTAAGCTCCTTTTGTCTTTCAATAAAGAATGTGTATATATATACATGATTAATTTTGATCCTAATAT
ATGATCATGATGCGTGCAGTTTATAGGGATAGGGAGAGGAAGGTAGCAAGATATAATGAGTTCCGTAGATCACTTTTCTTGATCCCAATCTCCAAATGGGAAGATCTAACAGAGGACAAA
GAAGCTATTGCCACATTGCGTGAAGTGTACGGTGATGATGTCGAAGAGCTTGATCTGTTGATAGGAATGATGGCCGAGAAAAAGATCAATGGGTTCGCCATTAGCGAAACCGCTTTTGTT
ATCTTTCTAGCCATGGCCTCAAG
AGGTATATATACAAATCACCAAAATAAGTGAATCAAAACTGTAAATAAACAAACTAATGGATAAATTTTTATACAAACTAAATATTGGTTCATGTGC
AGGCGACTCCAAGCGGATAGATTCTTCACTAGCGATTTTAACGAAGATGTGTACACAAAAAAAGGGTTCGAATGGGTGAACACAACAGAGAGTCTGAAAGATGTGTTGGACCGACACTAT
CCAGAGATGACCGATAGATGGATGAACTCAACAAGTGCTTTCTCGGTGTGGGATGCTGCCCCTGAGCCTCATAATCCCGTACCAATTTATTTCCGTCTCCCCAAGTGA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAAAACATTCATGTCTTTTGCAAAACAACAACTTCTTTCACCCTTCAAACACTTCATCCATGCCGACTTTCATGAACTCTTTGAAAGGATGACACTCATTGACAAGTTTCTCTTCTTG
ATCATTCATGGTGTCGATAAATCCGGGATAGGATGGCACCGTTTTCCTGTGTTCTTAGGTCTTACTTACTTGGCCATTCGCCGGATTCTTCATGATAAGTATAATCTGCTCAGTGTTGGG
AAAACTCGGGTGGGGGTTCGATTTGATACCGATGAGGTTGATTTCAGGACCGCCAATGGAAAGTTTAACGATCCTTTAAATGAAAGTGCCGGCAGTGAAGGAACTTTCTTTGGCCGTAAC
ATGCCCCCGGTTGATCAGAGAGATAAG
CTATTGAAGCCAGATCCGATGGTGGTAGCAACAAAGCTTCTAGCACGTAAACAACTTATAGACACCGGAAAGCAGTTCAACATGATTGCTGCT
TCATGGATTCAGTTCATGATTCATGATTGGATTGATCACCTCGAAGAAACAAACCAG
ATTGAGCTTAGGGCGCCAGCAGAAGTAGCAAGTGAATGCCCTCTCAAATCTTTTAGGTTCTTC
GAGACTAAAGAAATCGACACCGGTCTTTCTGACATTAAGAAAGGTCATCGTAACATCAGAACTCCTTGGTG
GGACGCGAGTGCGGTATATGGAAGCAACTTAAATGCTGCTCGTCACATA
AGAACGTTCATTGACGGAAAGCTCAAGATTGCAAAAGATGGTCTCCTTCAACACGACAACGATGGATTGCCCATAGCGGGAGACATTCGTAATAGTTGGATTGGGGTGTCAACTTTGCAA
GCCCTCTTTATCCACGAGCACAATGCGGTTTGTGACACCTTAAAG
AAAGAATATCCTTATTTGGACGATGAAGATCTGTATCGCCACGCAAGACTAGTAACTTCTGCGGTGATCGCAAAG
GTCCACACCATTGATTGGACCATTGAGCTTCTCAAAACTGACATGCTTGTTGCCGGAATGAGAGCTAACTG
GTATGGGCTATTGGGGAAAAGATTCAAGGACACATTTGGGCATGTTGGA
GGGTCTATTTTGGGAGGACTAGTAGGGCTAAAGAAACCCGAAAACCACGGGGTACCCTACTCGCTAACAGAAGAGTTTACAAGTGTTTATCGAATGCATTCTCTCTTACCTGATCAACTT
GTCATAAGGGATCTTAATTCCACACCAGGCCCTAACAAGTCCCCAAAGATTACTAAGGA
GATTGACATGATCAATTTGATTGGAAAGAATGGAGAAAAGGAATTATCAAAAATTGGATTT
ACCGCACAAATGGTATCCATGGGACATCAAGCCTGTGGGGCGCTTGAGCTATTTAACTATCCAGTCTGGCTTAGGGACATTGTGCCTCAAAACGTGGATGGGACTGATCGCCCCGACCAC
ATTGATTTAGCATCACTTGAGA
TTTATAGGGATAGGGAGAGGAAGGTAGCAAGATATAATGAGTTCCGTAGATCACTTTTCTTGATCCCAATCTCCAAATGGGAAGATCTAACAGAGGAC
AAAGAAGCTATTGCCACATTGCGTGAAGTGTACGGTGATGATGTCGAAGAGCTTGATCTGTTGATAGGAATGATGGCCGAGAAAAAGATCAATGGGTTCGCCATTAGCGAAACCGCTTTT
GTTATCTTTCTAGCCATGGCCTCAAG
GCGACTCCAAGCGGATAGATTCTTCACTAGCGATTTTAACGAAGATGTGTACACAAAAAAAGGGTTCGAATGGGTGAACACAACAGAGAGTCTG
AAAGATGTGTTGGACCGACACTATCCAGAGATGACCGATAGATGGATGAACTCAACAAGTGCTTTCTCGGTGTGGGATGCTGCCCCTGAGCCTCATAATCCCGTACCAATTTATTTCCGT
CTCCCCAAGTGA

Retrieve as FASTA  
cDNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
TTCATCAATA CCCAAAATGA AAACATTCAT GTCTTTTGCA AAACAACAAC TTCTTTCACC CTTCAAACAC TTCATCCATG CCGACTTTCA TGAACTCTTT GAAAGGATGA CACTCATTGA  CAAGTTTCTC TTCTTGATCA TTCATGGTGT CGATAAATCC GGGATAGGAT GGCACCGTTT TCCTGTGTTC TTAGGTCTTA CTTACTTGGC CATTCGCCGG ATTCTTCATG ATAAGTATAA  TCTGCTCAGT GTTGGGAAAA CTCGGGTGGG GGTTCGATTT GATACCGATG AGGTTGATTT CAGGACCGCC AATGGAAAGT TTAACGATCC TTTAAATGAA AGTGCCGGCA GTGAAGGAAC  TTTCTTTGGC CGTAACATGC CCCCGGTTGA TCAGAGAGAT AAGCTATTGA AGCCAGATCC GATGGTGGTA GCAACAAAGC TTCTAGCACG TAAACAACTT ATAGACACCG GAAAGCAGTT  CAACATGATT GCTGCTTCAT GGATTCAGTT CATGATTCAT GATTGGATTG ATCACCTCGA AGAAACAAAC CAGATTGAGC TTAGGGCGCC AGCAGAAGTA GCAAGTGAAT GCCCTCTCAA  ATCTTTTAGG TTCTTCGAGA CTAAAGAAAT CGACACCGGT CTTTCTGACA TTAAGAAAGG TCATCGTAAC ATCAGAACTC CTTGGTGGGA CGCGAGTGCG GTATATGGAA GCAACTTAAA  TGCTGCTCGT CACATAAGAA CGTTCATTGA CGGAAAGCTC AAGATTGCAA AAGATGGTCT CCTTCAACAC GACAACGATG GATTGCCCAT AGCGGGAGAC ATTCGTAATA GTTGGATTGG  GGTGTCAACT TTGCAAGCCC TCTTTATCCA CGAGCACAAT GCGGTTTGTG ACACCTTAAA GAAAGAATAT CCTTATTTGG ACGATGAAGA TCTGTATCGC CACGCAAGAC TAGTAACTTC  TGCGGTGATC GCAAAGGTCC ACACCATTGA TTGGACCATT GAGCTTCTCA AAACTGACAT GCTTGTTGCC GGAATGAGAG CTAACTGGTA TGGGCTATTG GGGAAAAGAT TCAAGGACAC  ATTTGGGCAT GTTGGAGGGT CTATTTTGGG AGGACTAGTA GGGCTAAAGA AACCCGAAAA CCACGGGGTA CCCTACTCGC TAACAGAAGA GTTTACAAGT GTTTATCGAA TGCATTCTCT  CTTACCTGAT CAACTTGTCA TAAGGGATCT TAATTCCACA CCAGGCCCTA ACAAGTCCCC AAAGATTACT AAGGAGATTG ACATGATCAA TTTGATTGGA AAGAATGGAG AAAAGGAATT  ATCAAAAATT GGATTTACCG CACAAATGGT ATCCATGGGA CATCAAGCCT GTGGGGCGCT TGAGCTATTT AACTATCCAG TCTGGCTTAG GGACATTGTG CCTCAAAACG TGGATGGGAC  TGATCGCCCC GACCACATTG ATTTAGCATC ACTTGAGATT TATAGGGATA GGGAGAGGAA GGTAGCAAGA TATAATGAGT TCCGTAGATC ACTTTTCTTG ATCCCAATCT CCAAATGGGA  AGATCTAACA GAGGACAAAG AAGCTATTGC CACATTGCGT GAAGTGTACG GTGATGATGT CGAAGAGCTT GATCTGTTGA TAGGAATGAT GGCCGAGAAA AAGATCAATG GGTTCGCCAT  TAGCGAAACC GCTTTTGTTA TCTTTCTAGC CATGGCCTCA AGGCGACTCC AAGCGGATAG ATTCTTCACT AGCGATTTTA ACGAAGATGT GTACACAAAA AAAGGGTTCG AATGGGTGAA  CACAACAGAG AGTCTGAAAG ATGTGTTGGA CCGACACTAT CCAGAGATGA CCGATAGATG GATGAACTCA ACAAGTGCTT TCTCGGTGTG GGATGCTGCC CCTGAGCCTC ATAATCCCGT  ACCAATTTAT TTCCGTCTCC CCAAGTGATA ATGATTTGAT GGAAAAGAAT ATATTACATG TTTGGTATAT ATATATATAT ATATGTATAG GTATTGCATC TGTTGGGGTG CCTTATTCAA 
TATTTATTTG CTTAGTATAT ATGCGTTATT TAATTTTCTT ATGGAATAAA AGAATATGTG ATGTATATTG TATGTATCTT GATGAAATAA AAGGCAATTG GATTATTATA TTGTTA 

Retrieve as FASTA