Entry information : HaDiOx04 (HanXRQChr14g0449991)
Entry ID 13773
Creation 2016-06-01 (Christophe Dunand)
Last sequence changes 2017-12-07 (Christophe Dunand)
Sequence status complete
Reviewer Not yet reviewed
Last annotation changes 2017-12-07 (Christophe Dunand)
Peroxidase information: HaDiOx04 (HanXRQChr14g0449991)
Name (synonym) HaDiOx04 (HanXRQChr14g0449991)
Class Alpha-dioxygenase    [Orthogroup: DiOx001]
Taxonomy Eukaryota Viridiplantae Streptophyta Asteraceae Helianthus
Organism Helianthus annuus (Sunflower)    [TaxId: 4232 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value HaDiOx04
start..stop
S start..stop
HaDiOx06 1285 0 1..643 1..643
HaDiOx05 1245 0 1..643 1..629
HaDiOx07 1130 0 1..643 1..626
NatDiOx01 1020 0 1..642 1..641
Gene structure Fichier Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 140332353..140332472 120 N° 2 140332552..140332818 267 N° 3 140348036..140348185 150 N° 4 140348295..140348428 134
N° 5 140348587..140348800 214 N° 6 140348893..140349038 146 N° 7 140349124..140349351 228 N° 8 140349467..140349669 203
N° 9 140349756..140349999 244 N° 10 140350097..140350322 226  
join(140332353..140332472,140332552..140332818,140348036..140348185,140348295..1 40348428,140348587..140348800,140348893..140349038,140349124..140349351,14034946 7..140349669,140349756..140349999,140350097..140350322)


exon

Literature and cross-references HaDiOx04 (HanXRQChr14g0449991)
DNA ref. HanXRQ genome:   HanXRQChr14 (140332353..140350322)
Cluster/Prediction ref. HanXRQ:   HanXRQChr14g0449991
Protein sequence: HaDiOx04 (HanXRQChr14g0449991)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   643
PWM (Da):   %s   73499.72  
PI (pH):   %s   6.67
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MKTFMSFAKQQLLSPFKHFIHADFHELFERMTLIDKFLFLIIHGVDKSGIGWHRFPVFLGLTYLAIRRILHDKYNLLSVGKTRVGVRFDTDEVDFRTANGKFNDPLNESAGSEGTFFGRNMPPVDQRDKLLKPDPMVVATKLLARKQLIDTGKQFNMIAASWIQFMIHDWIDHLEETNQIELRAPAEVASECPLKSFRFFETKEIDTGLSDIKKGHRNIRTPWWDASAVYGSNLNAARHIRTFIDGKLKIAKDGLLQHDNDGLPIAGDIRNSWIGVSTLQALFIHEHNAVCDTLKEYPYLDDEDLYRHARLVTSAVIAKVHTIDWTIELLKTDMLVAGMRANWYGLLGKRFKDTFGHVGGSILGGLVGLKKPENHGVPYSLTEEFTSVYRMHSLLPDQLVIRDLNSTPGPNKSPKITKEIDMINLIGKNGEKELSKIGFTAQMVSMGHQACGALELFNYPVWLRDIVPQNVDGTDRPDHIDLASLIYRDRERKVARYNEFRRSLFLIPISKWEDLTEDKEAIATLREVYGDDVEELDLLIGMMAEKKINGFAISETAFVIFLAMASRRLQADRFFTSDFNEDVYTKKGFEWVNTTESLKDVLDRHYPEMTDRWMNSTSAFSVWDAAPEPHNPVPIYFRLPK

Retrieve as FASTA  
Remarks Very long intron 2, but prediction confirmed with 1 EST from Helianthus petiolaris.
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAAAACATTCATGTCTTTTGCAAAACAACAACTTCTTTCACCCTTCAAACACTTCATCCATGCCGACTTTCATGAACTCTTTGAAAGGATGACACTCATTGACAAGTTTCTCTTCTTG
GTATGTTAATGCTGCATGCATGTAATTAAACTTTTAATTTACTATAGCTCATTTTGTGATTATAATTATACTTTGGCAGATCATTCATGGTGTCGATAAATCCGGGATAGGATGGCACCG
TTTTCCTGTGTTCTTAGGTCTTACTTACTTGGCCATTCGCCGGATTCTTCATGATAAGTATAATCTGCTCAGTGTTGGGAAAACTCGGGTGGGGGTTCGATTTGATACCGATGAGGTTGA
TTTCAGGACCGCCAATGGAAAGTTTAACGATCCTTTAAATGAAAGTGCCGGCAGTGAAGGAACTTTCTTTGGCCGTAACATGCCCCCGGTTGATCAGAGAGATAAG
GTATTGAAGAGGTA
TGGTCCCAATTCCCTGCCTACATGGCAGGGACCACACCACTTTTTGTGCACACAAGGCATCCACATCATCTGGATCTTCTAGAAGCTTCCAGAAGACAACCGGATAAGGCCCGGCGGGCA
TCAAAGATGCCTCGCCCAACATCAAGGACAATACGTGGCACCATACACAAGAAGCCACATGGGCCTGCGACAATCCAGGGAGCAGGGCTGACATGACCAGTACGAGGTGCCCAACACCGT
CGAATGCGGGGACGCCACGTGTACCACTACGCCGTCCCTGACAGAGCAGACCAAGAGGATATTCCCCTTGGTCGGACAGCTGGCGCACATCCACAGCTGGCACAGCTACTTCCTCCTTCT
TCACCCTCCGGCTATAAATAGAACCCTTCATCATTCAGGTTAAGGATCTTGGCTCTCTTTACTCACTCTATACACACACTGTTTTATTCATCTCGGAACAGTACTTATTCTCACGCCGGA
GCCTGGTTAAGAGGGAAAACCTCTCTTTCCCCTCTTAACGAGACTAACGGTGTTTACTGTTTTGCAGATCTCGAGCCTTGGATACGAGCAAGAGAGGAGGTTGAACCCTATAAGTGAAAC
GACCCCCTTGGTTATCCCTTGTGTTAACCATTGTTTCAACATTGGCGCCATCCGCTTTTTTGCAAGACCACTCTCACCTCTTTTTCTCTTTTAGAAAAACTCGTAAAAATGGCAGACCAG
AATCATTCACACCCAGCTGACGGAGAAATCTCTTCTTTTGAACTCGTTTCGGACACGGCACACGTCCAACGCAGTCAAAGGAACGAGACCATCCAGGAAAGCCAGCTGAACAACGAATTC
CCGTCCATTTTTGGAAGTGCGTCTAGGGCTGCTAGCCAGACCACAACTGGACCCATTTTCCAAACACCAACAAGAATTATCACTCAGACCACAAACGGGGCTGCTCTCCAACCTCCAACG
GGGATGTTACATCAAACCCCACCGCCGACGCAGACTGTAGGCCATGGGCCAGGCCCTTCGGCACCATCGGAACAAGCACAACTCAACTATTCTGCACTTTTAGGGCTACCCGAAGGAAAA
ACTCTGGCTTCCTGGTATGCCGAACAGATGGCGTCTATAAACCTTGTCTATACGCAGCTCAGCGCACAACAAGCCTTACTCCAAGCACAGGCTAACCAATCAGCGTTCGTAACTCCACAA
CCAAGGTCTCTGAGTACACACACGGCTCAGCAGACAAACGCGTGGAATTTACGACCAGAAAGAGAACCAGTGCAACAGGTCAGAAGACCCAGCATACAAGACACGCGCGATACCTATGCT
GAAACAGAGAGCAACTTCGTCCAAACTTCCAATCAGCAACGAAGACCGATCCAAACCCGCTTGGGCGCGCGAAACATGAATACAGAATGGGAAGAGGAGGAAGACGACCCAACGTACAAG
GCAGAATCCACAGTGTTTAGCAGACTTCCTCCAGAGCATGAGGCTTACAAACCAACCAAGCGCGCGGGGTACAACCCCAAAGCAGAACACGACTTCACCTTAAGCTATCGTCCTGAGGAC
ATGGCTGAAAATTCAAAATTTATTCCAGAAATCGCGTGCGCGGCCATCGACAAAACAAAGTTACCGCACAACGTAGGTAAATACAATGGGTTGACGGATCCAGATGATCACCTCCAGGTG
TTCAAAGGCGCAGGAGCAACAGGTGGTTGGAACCTACCAACATGGTGTCACTTGTTTGCTCAAACTTTCGTTGGTGCGGCACGCATCTGGTTCGACAATTTACCAGCTGGAAAAATCAAG
TCATGGGTCGACTTCCGAGAAAAATTCTTAGCACACTTTTCTCAACAGCGAAGACACGCCAGAGACCCAGGTGATTGTCTGAACATATACCGAAAAGACTACGAAAGCGTGGAGGATTTT
ATTACGAGGTACAACAAAGAATGTCTGGAAATTGGAGACATACCGGAAAAAATGATGCGCGCACACTTCATGCGAGCAGTTAAATGCGACGATCTGGTTAAAAGAATCAAAGGGCGTGAC
GGAGGACCCAAAGACTGGGAAACCTTCATTGAAGCAGCCAAAACCATTGCGCAGACAGATAGGCAACTGACCGGTGACGATCACCGTCAGCGCGCACACAACCACCACGATCGAAACAAC
AGAAGGGGTAGAAATCAACCCTGGAGGGCTTCCGGGAACAGAGAAAGAAGTCCCCCACGGGACGACGCACGCCATACGATCAATCAGATAGCCCATCGAAAAGAAGTAAAGCGCGAAAAT
AGAGAAAAGCAGTGGACTCCACTAACTAAAACACCTTCTGAAGTTTTAGCTACAGAGAACCATCAATTCAAGCCACCTTTGCAGATGCGCAACAAAAGGGGTCAAGACCCAAATCTCTTC
TGTGAATTCCACAAAGACACGGGCCACCTGACCGATGATTGCTTCAGCTTGAAACAAGAAATCGAAAGAGCTCTAAGAGACGGCAAGCTCGGTCACTTAGTCAAAGGAGGAAAGCGCGAT
TACCGCCAGATACAACGAAGAGACGAAGGTCCAGACAACAAGAAGCTCAGAAAGCTAGAAACCCATATGGTGCAAGGAGGACCACGGCGACCAAGAAAAAACTACAACAAACGCGCGCAG
GATGATTCATGGCGCGAGAAGCAAGTAGTATTCCCAGTTGTCAGGGGAGGTCCAAGAGAAAAGCGGCCAATAGTCATTCCAGGGGTGATCGGCCACTACCAAACAGATTACATCTTTATT
GATCCAGGAAGCACCGCAGACATCATATATGAACAGTGCTTCAATCAATTCGACCAAGAGGATAAGGCGCGCCTGGAACCAGTTGACTACCCACTAACTGGATTCTGCAATGAGGCCGTC
TTTCCCCTAGGACAAATATCTTTCCCAGTATTACTTTCTGATGGGAGAAATTCAAGAACTGAAGAAGTCACATTTATGGTGCTACCGGCACATTCAAGACATGACATCCTTTTAGGACGA
GAATCCCAAGGAGATTTCAGCATGATCTGTTCCGCACCACATTCTGCCATAGGTTTTCCAACCGAAACAGGCGTTGCGTTGATATACGCAAGCAAGGAAGTGCTAGCAACAGACGAAATC
AGGCCGGCAAAAGCAAGCAAGCCCGCACCGCGCAGAGAGGCAGAAAAATGGGTATTGAACAGTGCATACCCAGAACAAACGGTCACTCTGGGACCCGCAATGTCTGACCTAACGCGTGCG
GCGTTAAAGAAATTACTGCATGAAAACATGGATGTGTTCGCCTGGACACCAGCCGATATGGTTGGCGTTCCACGACACATTGCGGAACATCGGTTAAACGTCTCAGAGGATGCAAAGCCA
GTAGTGCATGCTAAACGACACCTGGGGGACATCAAACATGATGCAATGAAGGAACAAGTGTTAGAACTGCTAAACGCAGGAATCATCAGGGAAGTCCGGTACCAAACGTGGGTGGCAAGC
CCAGTCATGGTGAAGAAACCGAATGGTAGTTGGCGAATGTGCGTCGACTACAAGGATCTGAACAAAGCATGCCCCCGTGACTGCTATGCGTTGCCCGACATAGACGAGAAAATAGATTCT
TTGGCAACGTTTCGGTGGAAATGCTTTCTGGATTGCTACAAGGGATACCACCAGGTCCAGATGGCTGTTCAAGACGAGGATAAAACCGCTTTCCGCACGCCAACGGGGCTATACTGCTAC
ACCAAGATGCCGTTCGGCTTAAAGAATGCCGGTGCTACGTATCAACGATTGATGAACGAAACATTTAGCGACGCCATCGGTAAATACATCGAGGTATACATGGACGATCTGGTAATCATG
AGCAGGGAGGAGAGCGCAATGCTGGTAAATATCCAGAAAACCTTCAACACGCTGCGAAGCGTGAGCATCAAACTGAATCCAGCAAAATGCTCATTTGGAATGGAGGAAGGAAAGTTTCTG
GGATTCATAGTCACCAAAGACGGTTTTAAGGTGAACCCAGAAAAGGTCCAGGCCATAGAGAGGATGCCTTCACCAGCAAGCATCAAAGATATGCAAAAGCTCGCAGGACGATTGGCAGCC
CTCAATCGATTCCTAGCAAATCACGCGGCAAAATCTTTCCCATTCATCAAGACCTTACGAAACTGCATGAAGAAAACCCAATTCCAATGGACTCCGGAAGCAGAAAGCGCGTTCCGCGAG
ATGAAAGACTGTCTCATCAAGCTGCCAACTCTAACCGCACCTAACAAAGGAGAACCTTTGGTTTTGTACCTCTCAGCTTCCGACAGGGCCGTCGGTGCCGTATTGCTGGTTGATCGACAA
GGTGTCCAAACACCTGTGTATTATGTGTCCAGAACCCTAACCGACCCAGAAACAAGATACGCAATCATGGAAAAGCTTGTCCTTGCACTGATTCACGCGTCAAGAAGGCTACGCCGATAT
TTCGCCAATCACGTCATCCACGTGTTAACAAATTACAATATTGGGAATATCCTAGCAAGGCCAGAAATATCAGGAAGGTTGGCCAAATGGGCGATAGAGCTAGGGGGACTCAACGTAGTC
TTCAGACCACGACCGTCGATAAAAGGCCAAGTTTTGGCAGACTTCATGACGGAAGTCCCCGATGACAAAGACAGAGAATGCAAGGCGATGGAGAAGGCGGAGAAAAAACAAATCGAAGAA
CCATGGATGTTGTATACGGACGGCGCGTCCAACGAAGATGGGGCAGGTGCGGGGTTGCGGCTAGTGAGCCCAGACAAAAACGAGTTCACCTACGCCATACGTCTAGACTTCAAGAGCACA
AATAACGAGGCAGAGTATGAAGCCTTTCTGGCCGGCTTGCGCTTAGCAATCAAAATGGGAGTCCGACATATTGAGGCACATGTGGACTCCATGCTAGTGGCAGGACAAATCAACGGTCAA
TACGAAGCCAAGGGCGACATCATGGCACTCTATCTCAACCAGGCAAAGACGTTGCTGCAAACTTTCTATTCTTACAAGGTGCACCACATAAACCGCAGCGAAAACAAGCCAGCAGACGCC
TTAAGCAAACTTGCGTCAACAAGTTTTCAGCACCTAGCCAAAGACGTACGAATAGAAGTCTTAAGCAACCCGTCTGTGCCACTCCGCGAAGTCAGCGTCATCCAAACAGGAACCACGTCA
TGGATGACACCCATCATCATGTACTTACAGTCCGGAATACTCCCAGAAAATAAAGCCGAGGCGCGGAAAATCCAATATAAGTCAGAACATTATCAAATGGCGGATGGGATATTGTACCGA
AAGTCATATCTCGGCCCTCTGTTGAGATGTGTCGACGCCGACGACGCAAATTATCTGATCCGGGAAGTACATGAGGGCATCTGCGGCATCCACGCCAGGCCACGCATGGTAGTGGCTAAA
GTAATGAACGCCGGGTACTACTGGCCCGGAATGCACCTCGATGCCGTGAAAGAACTAAGGAAATGCAGCGGCTGCCAACGGCATGCACCAAAAACCATGCGGCCAAAAAATGCGTTGGTG
CCTGTAACAACCGCATGGCCCTTTCAGCAATGGGGCATAGACATGGTGGGCCCCTTTCCAGAAGCTCCGGGGGCAGTCAAGTTTATCATCGTCGCGGTCGATTACTTCACCAAGTGGGTA
GAAGCAAAAGCACTTGCGTCAACCACGTCAGCAGTCGTTAAACGCTTTATCTGGGAACAAATCATATGCCGTTTCGGCCTGCCACTCCGAATCATCACCGACAATGGTACAAACTTTGCA
GCAGATGATCTCGAACGATGGTTCAAGGAACTAAACATCGAACATACCTTCTCGTCGGTCGCACATCCGCAAGGGAATGGGCAAGTTGAAGCGGTCAACAAGAGCATCGTCGATGGCATC
AAAGCAAGGCTCGGTGAAAAAAGACGAGGGTGGGTCGACGAGCTACCAAGCATATTATGGGCCCATAGAACAATGCCCAAAACAAGCAATGGAGAAACACCCTTCAGCTTGGTCTATGGG
TCCGAAGCTGTGATCCCAGCAGAAATCGGACTCCCATCTCCAAGAATGCTCTCCATGAATCTGATCAACAACGAAGAAGAACGGAGGATCGATCTAGACCTCCTAGAAGAACGAAGAGAG
ATGGCTGCAATCAATGAAGCCAAGTACAAATCAAAGCTTGAAAAGTATTACAATTCCCGAGTCCGGATCTGCACCTTCAACCCAGGCGATTATGTCTTAAGGGACAATGAAGCATCCAAC
ACAGAAAAACCAGGGAAACTGGCTCCCAAATGGGAAGGCCCATACATCATTGATGCGATCCTCGGCAAGGGAGCATACAAGCTACGCACCATGAACGACAAAGAGGTCCCGCGAACCTGG
AACGCCCAACAGTTACGAAAATGCTACATATAAACATGTAATCGTATAAGGCGAATCGCCGACACATTTACTTAATACAAGAAGCGTTTGGCTACCTCTATTTCATGCAAAATTTTTGTC
ACAACTGCATTTATTACTTCGGGCGTACGCGAAAACATCAATGGCTCGACCATAGGAAACGTTGTAGACCTCCAAGGCTCGTCACAACCAAGTGAACAGCCGGGTCAGAAACACAACCAA
AGCCTACAAAACGCCAAAAAACATAACTTCGTGCCCACATAAACACGAAAACATCGATAGCAAGGCAACTAAACATTGTAACGCTCCCAAGGCTGCAATCCCAGCCGAGCTCACACAATA
ACCTGCAACTCGATCACGTCGTATATCACATACAAGCGCGCATACTTAAACGACAGGATAAAAACGTTGTCATAACACAACATCACCTGTCAGCGCCATCATTCATTCGCGACACCTAAT
CAAAGTAATTAAAACATGCACATATTATGACGATTTCAAAATTTGCATAAACACAATCAGAGCAACAAACAATCGTCAAAGTACACAAAAGTACAAAGTAAGCAAATTGTCTTGGAACAT
CATTACAAGCCCATACAAGGCTGTACGCGGCGCACAGGCCGGAAATCCCAAACTAAGTGCGGCTGGAACCAGCACCACCCGCACCGCTAGCATCATCTCCTCCCAATGCCTTCTTCAACA
AAGCTACCCCATCCGCTTTGAGGGCCAACTTACCAGCAGCCTTAACTACGGCAAAATCAAGCATTGCAAACTCTCGCCGTTTCTTCGCATACGCATTATCGCAATCAGCTTTATACAGCT
CAAACTTATAATCCTTCTCATTGTTTGCAGCAGCACACCTGCCTTCAGCATATCCATTCTTGCGGCCACTATTATAACCTGCCCCACCCAACTCAAACATGTATTGGGCAAGCTCAGGAG
AATTCAAGATACGATCAGCAAGCTACATGAAAAACAAGGTTGAATAAGATAGCAGAAAAAACTACACAAAAGTAGAACAGGCAAAGAACATTACCAATGGAACACCACGAGATAGCAGCC
ACCTACTATCAGCTGAAGCTTCCTCAGCAAGGAGCTCAGCGGCACGGCACTCCGCCGCCTTCTCATCAAGAAGACGCTGAGCAGCCTCACATTCCGAAGTCTTCGCCTGCACAAGAAGCT
CCAGCTTATCTATCCGCGCAATGTACTCCTTTTCCTTCTTCTCCGCCGCAAGACGCGTTTGATGAGCCTCTTCCGCGAGAGCAGTCTTCTCACCAGACAGCCGCACAATAGCATCACGCT
GCGACTGCATCTCCGCGTTCGTACGAGCACAGGCCGACTCCCAATCCTTTTGTTTTTGCTCGTTTAACTGTTTCTGCTCTTCAAATTTCCGCTCAGCCTCAGTAACCCGCCATTGAAGAC
CCTTCTGCTCTGTATTAAATTTTTCCTTCTCTTCAGCAAATGCCTTCTTGGCCTCCTCAAACGCCATAGTTTCTTCTCCCATAGAACGCCACTCACGAAGAATCTCCTGAGAGGTGGCAA
AGAAGTTAACCCCAGCCGTAACATGATCATCTATAAGGGCAAATCGGCTGCGGTTTTTCTGAAACATCCTCTCGGCAGGAGGAAGGGATAATGAGAAGAACTCCTGACAGTTCTGCAAGT
CATTCATGCGGGAGCCCTGAGTAAGGCCCCAACGGGGGACGTGCGGCAAATCATCGCAAGCCGGGTTACCCCAGGAATCACCAGGATTTTGTTCAAAGTGCGGACTCCGCACAAAGCCAG
AAGTGGCTCCACCTTCAACGGGAGGACGTTTTGTGTAAATGGTTTGTTGTGGAGTAGTCTCACTAGACTCCATCTCCACCTCAATACCTTTACCCTTATCAACTCCACCAGCATCACCAC
CAACATCACCACCAGCAGCGTCACCTCCCTCAAATATACCTTCAAACCCTTCACCGCCCTTCACAGTTACATCAGCATCATCTCGAGGAGGGGTCAAGCCAACAGTCCTCGAAGGAGGAG
AGGTAGGTGGCGTAATCTGCGAAATATCCACTGGCCGTGGCTTCTTGCTAGTCTTAGATTCTGCCACAAAATTACAATTAGCTAAAAAGACAAAATAAAAGATGACAAAAACATACACAG
AAACTAAGTTACCAGGCCGGGGAGCAGAAGCCTCAAATATCTTCTCCAACAAATTCCCTCGACCCCCACTAAATACACCCAAATCAACCTCAGATTCAGAAGGGGCTGGGGGGGCATCCT
TTTGAAGCTTGGCCCTCTTGGCCGCAAGAAGGGCAGCAGCTTGTTCATCAAGATGACGCTTATGATCATCAAGAGCCTTCTTCTTCATATGCTCAGTTAAAGTGGCTTTATCATCAAGAT
CCGACCCTTGACGCACCTCCCCAGGGCCTAAGCCAGACAACGTATCAGACACTACTACATAATCTAGATAAGAAAGAATGGAGGGTCGCTTACGAGAAGATCCGGCAGCTCCACCCTCAA
CTCCCTCTCCTTTCCTTCCAGACCTATCAATCCGAGCTTTCTTCCTCGTCTCCAACTGAGCAGACGGATCAATAGACGCTTCCACGTCATCATCATCATCTCCAACAACCTCGTGTACAG
GTTCAGCATTGCCACCAGGCACGGTACCTGCACGCGCGGAACGAGACGTTAGATCCCCACCACTCCGACCAGAACTCCCACTGGAAACAACGATGACATCCTCTTGATCAGAGTTAACAA
AGTCACCCAAAACATCCTCACCAAGAACCTCAGCAGCATATCGGTTCAAACTTTGCTCGGTAGGGTGCAAGAAGCGATCTTGTATTTGATCCAGCCACGTCGGGTTACCATCAGCCTGGA
TGGCTTCAACCATGGCACCAGCAGCCTTGGGGTCCAAAGCATTCAACAAGCTATAACCAACTGCAGAAAAACAACAAAGATCAAAGCACAAAACAAGCCTAAAGGGAAGTAAACAACCGA
CAAAATAAGAAGGAATCATACATTTGCCCTTATGGCTGTATACAGGCTGACCCAAAGGATGCTTAGGCACCCATAACATGCTCATCCCAGCACCGACCAAGGCCATTTCATCCAGTTGAG
AAATAGCAGTTGCCTTGGCCGTTATCCTCCCATACCATTCTTGGGCAGCAAAATTCTCCAACGGTTCCACCCTTGGAATCCCTTGATCGACCTGCCTGTACGTCATCTCCGACGGAATCA
CTCCACGACGAATGTAGAAAAACTTCTGTTTCCAGTCATGAATACCCTTTATTGGGGCAGAACACACTGGGACAGCCCCCGTCCTGGCTTGAAAAGAATAAAAACCACTAGAATATGTCA
CACTATAAAAAGTGTTGAACATCTGAAACGTTGGTTCAACCCGATAAGACCGGCAAACAAACTCGAAGTGAGTAATCCGGGGAAGCCCAATGGCATTTATCTGAGAGATATGAAGACCAT
AACCCCTCAAAACACCGGCAACAAACTTAGTAATGGGCAACCTAAAATTGCCCTCTCGGAAAAAAGCAGCATATAAAGTGATAAAACCGGGGGGGGGCATCCAGTGCAGTAGATCCAGAC
GGAGGATATCGAGCCCCCCACTCAGGTCGAAAGTTATGCCCCCTAACGAGGTTATGAAATTCCTCCTCTTTCCAGTTAATGACGGCCTGATCCGGGCCAGGAGGACCCCTACTCCGTTGT
TTCCTTTTCTTCTGAGAAGGATTTTTCTCCGCCATTAAGAAAAATCAAGAGTAAGATACTTCGGATTTGAAAAAATATGAAGGACAAGAAGAACAAAGGAGAAAGGAAGGGAGAGAATCA
CTTAGAAATTTCGAAGATATGGGAGAAGTGAATAACCGCTCTACTATTTATACTCATCGCATTAAATGCGAGAGATAGTGGAACGTCAGGAGGCAGGAAAAGTAACCAATAGATGGCCGA
CACGTCAGAGGAAAGAGAAGACGTCGGCTCATTTTTCGGGAAGCATGGGCAACAGGGTCTGCTGATGTGACAGAAAGTCACTGCAAAAGCAACTGTTCACCTCCGAAAAGACAGGGATGT
CAGACGGTCACACCAAACGCTCCCCCATAACCACCAAACTTCAAACAGAAGAAAGCACGAAAGGGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCAAAAC
CCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGAGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGAGCCAAAACCCATGCGGCATACGTCCATTTGGC
TCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGAGCCAAAAC
CCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGAGCCAAAACCCATGCGGCATGCGTCCATTTGGC
TCACTTTTTTCAAAGAGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGTGTCCATTTGGCTCACTTTTTTCAAAGAGCCAAAAC
CCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGCGTCCATTTGGC
TCACTTTTTTCAAAGAGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGAGCCAAAAC
CCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGAGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGTGTCCATTTGGC
TCACTTTTTTCAAAGAGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGGGCCAAAACCCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAGAGCCAAAAC
CCATGCGGCATGCGTCCATTTGGCTCACTTTTTTCAAAAGGCCAAGACTCACGCGGCATGCGTCCACTCGGCTCAATTTATATTCACCAATTCCAACGCGATGCATAAAAAACCACAACT
CTCATAAGTCCAGAATCCCTCCTAGACGATCAACTCAACTAGAAGGCACACTGGACTGGGGGGACTTGAAGAGGTATGGTCCCAATTCCCTGCCTACATGGCAGGGACCACACCACTTTT
TGTGCACACAAGGCATCCACATCATCTGGATCTTCTAGAAGCTTCCAGAAGACAACCGGATAAGGCCCGGCGGGCATCAAAGATGCCTCGCCCAACATCAAGGACAATACGTGGCACCAT
ACACAAGAAGCCACATGGGCCTGCGACAATCCAGGGAGCAGGGCTGACATGACCAGTACGAGGTGCCCAACACCGTCGAATGCGGGGACGCCACGTGTACCACTACGCCGTCCCTGACAG
AGCAGACCAAGAGGATATTCCCCTTGGTCGGACAGCTGGCGCACATCCACAGCTGGCACAGCTACTTCCTCCTTCTTCACCCTCCGGCTATAAATAGAACCCTTCATCATTCAGGTTAAG
GATCTTGGCTCTCTTTACTCACTCTATACACACACTGTTTTATTCATCTCGGAACAGTACTTATTCTCACGCCGGAGCCTGGTTAAGAGGGAAAACCTCTCTTTCCCCTCTTAACGAGAC
TAACGGTGTTTACTGTTTTGCAGATCTCGAGCCTTGGATACGAGCAAGAGAGGAGGTTGAACCCTATAAGTGAAACGACCCCCTTGGTTATCCCTTGTGTTAACCATTGTTTCAACAGGT
ATATATATTAATCGATGAACTTCAAATAACACATATTGACATATATATCACTAGCTAGTTATCTTTTCCAACTATTGTTCTAAAATTGTATTGATGTTGAACTGGTCATTTTTTATAACC
TTGACTGGTTAGGGTTTCTTTTAGGTTTTGATTGACAAAGAGCAAAAGAGAAATTTTACGACGTCTTTGCTTGAATTCTGACGCTCTCTACAGTACATGAAATTTTCCCAAAGTTTTATT
TGTAATGTTTAGGATGATATTTAAAAAGATATAATACATGTATCTTTTTAAATCCTAGCACAAATCTATAGATATATCCTTAAATTTACGTAACATGTTTTGCAAAATAAATTAGTGATG
AGTAGAGATAGTCGACTGTCTCAACCCAACTTAATATTTAGAGCTTTGTATATCTATATGCTAATTAGAACCCCGTATATTATACGGGCTGAATAAATGTGAATTTATATACTAAATAAT
ACAATAATATATCTTTAAAAACTTTATTTATTACATGGATTGAATACATATAATTTTATATACTAAAAAAATAAAAAGTTATATCTTTAAAAACCCCTTGTATTGTTGTATAAGTTGAAT
AAATATAATTTTATACATCAAATAATGAAAAAAAATACTAAATAATAAATTATTTATATATTTAAAAAACCCCGTGTATTATATGGGTTGAATAAATCTAATTTTATATATCAAATAATA
AAAAGGTTATATTTTTAAAAACTCATGTATTACACAGGTTGTGTAAATGTAATTTTGTATAGTAATTTATAAAAAAAGTTATATCTTTAAAAACCCTCGTGTATTACACGGGTTGAATTT
ATATATCAAATAATAAAAAAATATATTTTTAAATACCTCATGTATTACACGGGTTGGATAAATGTAATTTTGTATACTAAATGATAAAAAAAAGTTGTATATTAAAAAACCCACGTGTGT
TACACAGGTTGAATAAATGTAATTTTATATAGTAAATAGTTAAAACTTATATCTTTAAAAACCTCGTGTAATACACGAGTTGAGTTATGATTAATTTCAGATTTTGATTTCCTGCATTTT
CTCTCCTATTAAATAATAATAATTTTTCTCTCCTTAAAATTACATCTTAAGTCATATTCTTATTTTTATAAAACATATTTAAAATATATATAATATTTTATTTAAAATAAATATATTTAA
AAAATTTACTGATTATAAAAAATTAAATATATTTAAAACATATTACTGATTATAAAAAGTCATAGACAATTTAAATCTGTTACCTAAATTTGGATGTAAAAGTCTAAACGTAAATATTAA
AATAAAATCTCTCTACGATATAATAAAACTATAATGTAACCTATTCCTATATCTAAGGAAATATATTATAAAAATAATAATTTAATATTAATAGTATATTTTCTAAATAAGTTTTTAACC
TATACTATATCTCTACGTATATCTCATAATTATATATATTAAATGGTATATCTCTACGTATATCTCATAAATAATTATTTAATTTTAAATAATTATTTAATTTTAATAGGTTAAAAGATA
GATGACTTCAATGAATGATATGTATCTCCAAAGTGGTTTCTTTTATTATATAGTATAGATTCATACTAAGTAATTGTAAATATAAATCTCAAGGTCAGATAATTTTGACACACGTGATTA
TTTAAAATTTTAATTATCATCAAGAGAATTATGTCCATCTTAAAAGTTAGACTAAAAATTAGATGTTTATTACTCTTGTAGATAAAACACAAGTCATATATATACCGATCGATCCACTGT
CATTGTTCAAGATTTGTTATTCTTTCTTTCTTTTTTTTTTTTTTTTTTTTTTCATTTGTCATGTAGTTATTGGCAGTACTTGATATTCTAACTGCCACTCCGCTATCCCATTAAAACATC
ACGTATGGTCCACACCAACATTTATATAATAAAATTAAAACTTTTTTAGTCTAAATACTATTCTTTTTATATAATATGTCTAATGACAAAGCTGACTTCCTTTTGTTAGATGAGAATAGC
AGTTTTAACCGACATAATTATCAATGAAATTAAAGTGAGATACGATATACACGAAATCCCTTAATTTATTTTTTTAATAATAATTTTTTACAACTAAATATCACCTAATTTTTTACTTAC
AAAAATTTAAACTTATATTTTCATGAGAAACAAATAACATGCCACTAGATCAATAGATCATAAGTACAAATCCTGTTTATTGATCACATGTTTATCTAATTACTACATACTCCATCCGTC
CCATTAAAAACGTCATATTTTGAATTTTCAAAGTCTTTATTTATGAACTTTGACCTTAAATAATTTTGTTTGTGTTAAATAATACTTGATGAAAGTTATATGATTTGAATGTGTTTTACA
AGTGTTTTTATCGGGTTAATTTTTATCAAGTTTTATATAACACAAAAAATATATAATTAAAGTCGAACTTTATAAATAAAGAAAAATTCAAAATAGGACACTTTTAATGGGACGGAGGGA
GTATTAAATAAGCAGGTTTCGAAGATTTCATTCTTATTATACCTCATAAATTTTATAATCATTTTTATACGAGTGTCATCCTTTGCTTGTAGTCTTGTCTGGAGCTGGATTTTGATTTAT
CAAATTTAAAGCAAGTAAATAACTTCTAGATAAAAATGTAACATGCAAAAAAGTTCGTTCATCTAATAGTAGTTATTACTTTTCAAAAAAATCTAATAGTAGTTATTCAATGCAACTCAC
AATAAACAACAGAAATTTGAAATGGTAGTCGAGCTTTGAATGTAGGGTGAGACAATTGTACTAAAACGTATTTCTGTGTGCAGCTATTGAAGCCAGATCCGATGGTGGTAGCAACAAAGC
TTCTAGCACGTAAACAACTTATAGACACCGGAAAGCAGTTCAACATGATTGCTGCTTCATGGATTCAGTTCATGATTCATGATTGGATTGATCACCTCGAAGAAACAAACCAG
GTTCGAT
TAATCAAGATATTATATAATCCCGTGCCATAAGAAGCTAGGATGAACTTTTGTACTAACACATATATAATATGTTACCATGGATTATATATTATGATAACAGATTGAGCTTAGGGCGCCA
GCAGAAGTAGCAAGTGAATGCCCTCTCAAATCTTTTAGGTTCTTCGAGACTAAAGAAATCGACACCGGTCTTTCTGACATTAAGAAAGGTCATCGTAACATCAGAACTCCTTGGTG
GTAT
GATTTCTATAGTTTAATTAATCATATTTTATAACCTTGCATTGTTAAAAACTCTCATTAATTAATAGTTATAAAAAAAGAATTGAAAAAGACAATCAGTATATATTCAAATAAATAGTAT
GAAGTTAAGAGTGGGATATATATGATATGTACAGGGACGCGAGTGCGGTATATGGAAGCAACTTAAATGCTGCTCGTCACATAAGAACGTTCATTGACGGAAAGCTCAAGATTGCAAAAG
ATGGTCTCCTTCAACACGACAACGATGGATTGCCCATAGCGGGAGACATTCGTAATAGTTGGATTGGGGTGTCAACTTTGCAAGCCCTCTTTATCCACGAGCACAATGCGGTTTGTGACA
CCTTAAAG
GTATGTGTGTATATGGTAATTGGTAAACATGTAGCCAGTTATTTTTAGTTCATCAGCTAGCTAGTAGCTTACAATATACTTATGTCTTTCAGAAAGAATATCCTTATTTGGA
CGATGAAGATCTGTATCGCCACGCAAGACTAGTAACTTCTGCGGTGATCGCAAAGGTCCACACCATTGATTGGACCATTGAGCTTCTCAAAACTGACATGCTTGTTGCCGGAATGAGAGC
TAACTG
GTAATTACTTATTTTAAAATAAATAAACTTCATTCTTGAAATGAAGCATATTTACTAATATTTTGTTGAAATTTGTTATGTCAAGGTATGGGCTATTGGGGAAAAGATTCAAGG
ACACATTTGGGCATGTTGGAGGGTCTATTTTGGGAGGACTAGTAGGGCTAAAGAAACCCGAAAACCACGGGGTACCCTACTCGCTAACAGAAGAGTTTACAAGTGTTTATCGAATGCATT
CTCTCTTACCTGATCAACTTGTCATAAGGGATCTTAATTCCACACCAGGCCCTAACAAGTCCCCAAAGATTACTAAGGA
GTACGTAAATCACACCTAAGTTATTTTTTCTTAATTTGAAT
AACAATTTACTTCTTTGTATCTTCATGTAATAACCAACAAATAAGTATGCTCATAGTTATTAATTTACATTCAGGATTGACATGATCAATTTGATTGGAAAGAATGGAGAAAAGGAATTA
TCAAAAATTGGATTTACCGCACAAATGGTATCCATGGGACATCAAGCCTGTGGGGCGCTTGAGCTATTTAACTATCCAGTCTGGCTTAGGGACATTGTGCCTCAAAACGTGGATGGGACT
GATCGCCCCGACCACATTGATTTAGCATCACTTGAGA
GTAAGCTCCTTTTGTCTTTCAATAAAGAATGTGTATATATATACATGATTAATTTTGATCCTAATATATGATCATGATGCGTG
CAGTTTATAGGGATAGGGAGAGGAAGGTAGCAAGATATAATGAGTTCCGTAGATCACTTTTCTTGATCCCAATCTCCAAATGGGAAGATCTAACAGAGGACAAAGAAGCTATTGCCACAT
TGCGTGAAGTGTACGGTGATGATGTCGAAGAGCTTGATCTGTTGATAGGAATGATGGCCGAGAAAAAGATCAATGGGTTCGCCATTAGCGAAACCGCTTTTGTTATCTTTCTAGCCATGG
CCTCAAG
GTATATATACAAATCACCAAAATAAGTGAATCAAAACTGTAAATAAACAAACTAATGGATAAATTTTTATACAAACTAAATATTGGTTCATGTGCAGGCGACTCCAAGCGGAT
AGATTCTTCACTAGCGATTTTAACGAAGATGTGTACACAAAAAAAGGGTTCGAATGGGTGAACACAACAGAGAGTCTGAAAGATGTGTTGGACCGACACTATCCAGAGATGACCGATAGA
TGGATGAACTCAACAAGTGCTTTCTCGGTGTGGGATGCTGCCCCTGAGCCTCATAATCCCGTACCAATTTATTTCCGTCTCCCCAAGTGA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAAAACATTCATGTCTTTTGCAAAACAACAACTTCTTTCACCCTTCAAACACTTCATCCATGCCGACTTTCATGAACTCTTTGAAAGGATGACACTCATTGACAAGTTTCTCTTCTTG
ATCATTCATGGTGTCGATAAATCCGGGATAGGATGGCACCGTTTTCCTGTGTTCTTAGGTCTTACTTACTTGGCCATTCGCCGGATTCTTCATGATAAGTATAATCTGCTCAGTGTTGGG
AAAACTCGGGTGGGGGTTCGATTTGATACCGATGAGGTTGATTTCAGGACCGCCAATGGAAAGTTTAACGATCCTTTAAATGAAAGTGCCGGCAGTGAAGGAACTTTCTTTGGCCGTAAC
ATGCCCCCGGTTGATCAGAGAGATAAG
CTATTGAAGCCAGATCCGATGGTGGTAGCAACAAAGCTTCTAGCACGTAAACAACTTATAGACACCGGAAAGCAGTTCAACATGATTGCTGCT
TCATGGATTCAGTTCATGATTCATGATTGGATTGATCACCTCGAAGAAACAAACCAG
ATTGAGCTTAGGGCGCCAGCAGAAGTAGCAAGTGAATGCCCTCTCAAATCTTTTAGGTTCTTC
GAGACTAAAGAAATCGACACCGGTCTTTCTGACATTAAGAAAGGTCATCGTAACATCAGAACTCCTTGGTG
GGACGCGAGTGCGGTATATGGAAGCAACTTAAATGCTGCTCGTCACATA
AGAACGTTCATTGACGGAAAGCTCAAGATTGCAAAAGATGGTCTCCTTCAACACGACAACGATGGATTGCCCATAGCGGGAGACATTCGTAATAGTTGGATTGGGGTGTCAACTTTGCAA
GCCCTCTTTATCCACGAGCACAATGCGGTTTGTGACACCTTAAAG
AAAGAATATCCTTATTTGGACGATGAAGATCTGTATCGCCACGCAAGACTAGTAACTTCTGCGGTGATCGCAAAG
GTCCACACCATTGATTGGACCATTGAGCTTCTCAAAACTGACATGCTTGTTGCCGGAATGAGAGCTAACTG
GTATGGGCTATTGGGGAAAAGATTCAAGGACACATTTGGGCATGTTGGA
GGGTCTATTTTGGGAGGACTAGTAGGGCTAAAGAAACCCGAAAACCACGGGGTACCCTACTCGCTAACAGAAGAGTTTACAAGTGTTTATCGAATGCATTCTCTCTTACCTGATCAACTT
GTCATAAGGGATCTTAATTCCACACCAGGCCCTAACAAGTCCCCAAAGATTACTAAGGA
GATTGACATGATCAATTTGATTGGAAAGAATGGAGAAAAGGAATTATCAAAAATTGGATTT
ACCGCACAAATGGTATCCATGGGACATCAAGCCTGTGGGGCGCTTGAGCTATTTAACTATCCAGTCTGGCTTAGGGACATTGTGCCTCAAAACGTGGATGGGACTGATCGCCCCGACCAC
ATTGATTTAGCATCACTTGAGA
TTTATAGGGATAGGGAGAGGAAGGTAGCAAGATATAATGAGTTCCGTAGATCACTTTTCTTGATCCCAATCTCCAAATGGGAAGATCTAACAGAGGAC
AAAGAAGCTATTGCCACATTGCGTGAAGTGTACGGTGATGATGTCGAAGAGCTTGATCTGTTGATAGGAATGATGGCCGAGAAAAAGATCAATGGGTTCGCCATTAGCGAAACCGCTTTT
GTTATCTTTCTAGCCATGGCCTCAAG
GCGACTCCAAGCGGATAGATTCTTCACTAGCGATTTTAACGAAGATGTGTACACAAAAAAAGGGTTCGAATGGGTGAACACAACAGAGAGTCTG
AAAGATGTGTTGGACCGACACTATCCAGAGATGACCGATAGATGGATGAACTCAACAAGTGCTTTCTCGGTGTGGGATGCTGCCCCTGAGCCTCATAATCCCGTACCAATTTATTTCCGT
CTCCCCAAGTGA

Retrieve as FASTA  
cDNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
TTCATCAATA CCCAAAATGA AAACATTCAT GTCTTTTGCA AAACAACAAC TTCTTTCACC CTTCAAACAC TTCATCCATG CCGACTTTCA TGAACTCTTT GAAAGGATGA CACTCATTGA  CAAGTTTCTC TTCTTGATCA TTCATGGTGT CGATAAATCC GGGATAGGAT GGCACCGTTT TCCTGTGTTC TTAGGTCTTA CTTACTTGGC CATTCGCCGG ATTCTTCATG ATAAGTATAA  TCTGCTCAGT GTTGGGAAAA CTCGGGTGGG GGTTCGATTT GATACCGATG AGGTTGATTT CAGGACCGCC AATGGAAAGT TTAACGATCC TTTAAATGAA AGTGCCGGCA GTGAAGGAAC  TTTCTTTGGC CGTAACATGC CCCCGGTTGA TCAGAGAGAT AAGCTATTGA AGCCAGATCC GATGGTGGTA GCAACAAAGC TTCTAGCACG TAAACAACTT ATAGACACCG GAAAGCAGTT  CAACATGATT GCTGCTTCAT GGATTCAGTT CATGATTCAT GATTGGATTG ATCACCTCGA AGAAACAAAC CAGATTGAGC TTAGGGCGCC AGCAGAAGTA GCAAGTGAAT GCCCTCTCAA  ATCTTTTAGG TTCTTCGAGA CTAAAGAAAT CGACACCGGT CTTTCTGACA TTAAGAAAGG TCATCGTAAC ATCAGAACTC CTTGGTGGGA CGCGAGTGCG GTATATGGAA GCAACTTAAA  TGCTGCTCGT CACATAAGAA CGTTCATTGA CGGAAAGCTC AAGATTGCAA AAGATGGTCT CCTTCAACAC GACAACGATG GATTGCCCAT AGCGGGAGAC ATTCGTAATA GTTGGATTGG  GGTGTCAACT TTGCAAGCCC TCTTTATCCA CGAGCACAAT GCGGTTTGTG ACACCTTAAA GAAAGAATAT CCTTATTTGG ACGATGAAGA TCTGTATCGC CACGCAAGAC TAGTAACTTC  TGCGGTGATC GCAAAGGTCC ACACCATTGA TTGGACCATT GAGCTTCTCA AAACTGACAT GCTTGTTGCC GGAATGAGAG CTAACTGGTA TGGGCTATTG GGGAAAAGAT TCAAGGACAC  ATTTGGGCAT GTTGGAGGGT CTATTTTGGG AGGACTAGTA GGGCTAAAGA AACCCGAAAA CCACGGGGTA CCCTACTCGC TAACAGAAGA GTTTACAAGT GTTTATCGAA TGCATTCTCT  CTTACCTGAT CAACTTGTCA TAAGGGATCT TAATTCCACA CCAGGCCCTA ACAAGTCCCC AAAGATTACT AAGGAGATTG ACATGATCAA TTTGATTGGA AAGAATGGAG AAAAGGAATT  ATCAAAAATT GGATTTACCG CACAAATGGT ATCCATGGGA CATCAAGCCT GTGGGGCGCT TGAGCTATTT AACTATCCAG TCTGGCTTAG GGACATTGTG CCTCAAAACG TGGATGGGAC  TGATCGCCCC GACCACATTG ATTTAGCATC ACTTGAGATT TATAGGGATA GGGAGAGGAA GGTAGCAAGA TATAATGAGT TCCGTAGATC ACTTTTCTTG ATCCCAATCT CCAAATGGGA  AGATCTAACA GAGGACAAAG AAGCTATTGC CACATTGCGT GAAGTGTACG GTGATGATGT CGAAGAGCTT GATCTGTTGA TAGGAATGAT GGCCGAGAAA AAGATCAATG GGTTCGCCAT  TAGCGAAACC GCTTTTGTTA TCTTTCTAGC CATGGCCTCA AGGCGACTCC AAGCGGATAG ATTCTTCACT AGCGATTTTA ACGAAGATGT GTACACAAAA AAAGGGTTCG AATGGGTGAA  CACAACAGAG AGTCTGAAAG ATGTGTTGGA CCGACACTAT CCAGAGATGA CCGATAGATG GATGAACTCA ACAAGTGCTT TCTCGGTGTG GGATGCTGCC CCTGAGCCTC ATAATCCCGT  ACCAATTTAT TTCCGTCTCC CCAAGTGATA ATGATTTGAT GGAAAAGAAT ATATTACATG TTTGGTATAT ATATATATAT ATATGTATAG GTATTGCATC TGTTGGGGTG CCTTATTCAA 
TATTTATTTG CTTAGTATAT ATGCGTTATT TAATTTTCTT ATGGAATAAA AGAATATGTG ATGTATATTG TATGTATCTT GATGAAATAA AAGGCAATTG GATTATTATA TTGTTA 

Retrieve as FASTA