Entry information : HsEPO (EPX)
Entry ID 3317
Creation 2006-07-14 (Christophe Dunand)
Last sequence changes 2010-12-21 (Myriam Duval)
Sequence status complete
Reviewer Christophe Dunand
Last annotation changes 2010-12-21 (Christophe Dunand)
Peroxidase information: HsEPO (EPX)
Name (synonym) HsEPO (EPX)
Class Eosinophil peroxidase    [Orthogroup: EPO001]
Taxonomy Eukaryota Metazoa Chordata Mammalia Hominidae Homo
Organism Homo sapiens (human)    [TaxId: 9606 ]
Cellular localisation Apoplastic
Tissue types Blood
Bone marrow
Mixed tissues
Thymus
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value HsEPO
start..stop
S start..stop
GgoEPO 1456 0 1..715 1..715
PtroEPO 1456 0 1..715 1..715
MmulEPO 1415 0 1..714 1..714
CjaEPO 1383 0 1..715 1..715
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '3317' 'join(56270208..56270283,56270399..56270492,56270732..56270907,56271075..56271192,56271324..56271453,56272325..56272531,56274300..56274618,56276401..56276561,56276900..56277155,56277586..56277756,56280442..56280679,56281583..56281784)' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 56270208..56270283 74 N° 2 56270399..56270492 92 N° 3 56270732..56270907 174 N° 4 56271075..56271192 116
N° 5 56271324..56271453 128 N° 6 56272325..56272531 205 N° 7 56274300..56274618 317 N° 8 56276401..56276561 159
N° 9 56276900..56277155 254 N° 10 56277586..56277756 169 N° 11 56280442..56280679 236 N° 12 56281583..56281784 200
join(56270208..56270283,56270399..56270492,56270732..56270907,56271075..56271192 ,56271324..56271453,56272325..56272531,56274300..56274618,56276401..56276561,562 76900..56277155,56277586..56277756,56280442..56280679,56281583..56281784)


exon

Literature and cross-references HsEPO (EPX)
Literature Sakamaki K., Tomonaga M., Tsukui K., Nagata S. Molecular cloning and characterization of a chromosomal gene for human eosinophil peroxidase. J. Biol. Chem. 264:16828-16836(1989).
Protein ref. UniProtKB:   P11678
DNA ref. GenBank:   NC_000017.1 (56270208..56281784)
mRNA ref. GenBank:   X14346
Cluster/Prediction ref. UniGene:   Hs.279259
Protein sequence: HsEPO (EPX)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   715 (698)
PWM (Da):   %s   80888.09 (79204.1)  
PI (pH):   %s   10.81 (10.81) Peptide Signal:   %s   cut: 23 range:18-715
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MHLLPALAGVLATLVLAQPCEGTDPASPGAVETSVLRDCIAEAKLLVDAAYNWTQKSIKQRLRSGSASPMDLLSYFKQPVAATRTVVRAADYMHVALGLLEEKLQPQRSGPFNVDVLTEPQLRLLSQASGCALRDQAERCSDKYR
TITGRCNN
KRRPLLGASNQALARWLPAEYEDGLSLPFGWTPSRRRNGFLLPVRAVSNQIVRFPNERLTSDRGRALMFMQWGQFIDHDLDFSPESPARVAFTAGVDCERTCAQLPPCFPIKIPPNDPRIKNQRDCIPFFRSAPSCPQNKNRVRNQINALTSFVDASMVYGSEVSLSLRLRNRTNYLGLLAINQRFQDNGRALLPFDNLHDDPCLLTNRSARIPCFLAGDTRSTETPKLAAM
HTLFMREHNRLATELRRLNPRWNGDKLYNEARKIMGAMV
IITYRDFLPLVLGKARARRTLGHYRGYCSNVDPRVANVFTLAFRFGHTMLQPFMFRLDSQYRASAPNSHVPLSSAFFASWR
IVYE
GGIDPILRGLMATPAKLNRQDAMLVDELRDRLFRQVRRIGLDLAALNMQRSRDHGLPGYNAWRRFCGLSQPRNLAQLSRVLKNQDLARKFLNLYGTPDNIDIWIGAIAEPLLPGAR
VGPLLACLFENQFRRARDGD
RFWWQKRGVFTKRQRKALSRISLSRIICDNTGITTVSRDIFRANIYPRGFVNCSRIPRLNLSAWRGT

Retrieve as FASTA  
Remarks Complete sequence from genomic (chromo 17, 11 introns), 2 mRNA and 17 ESTs.
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCATCTGCTCCCAGCCCTGGCAGGGGTCCTGGCCACACTCGTCCTCGCCCAGCCCTGTGAGGGCACTGACCCAGAGGTAATAGTCCCCTAGACAGGCAAGGAGGAGGGAGGGGAAATG
GAAGGGGAAGCACTTGGGTCTTGGAGGGGGTCTTGTGGCTTGCTGAACCCTGAGTCCCCATCTCTTTGAACAGCCTCCCCTGGGGCAGTGGAGACCTCGGTCCTGCGAGACTGCATAGCA
GAGGCCAAGTTGCTGGTGGATGCTGCCTACAATTGGACCCAGAAGAG
AGGTGGACTTGGGTCTGGGGGCTGCATGGGCCTGGGAGGATCAGTGAGGGGCATGGGCCCCAGCCAAGTTCAT
CTCACTCATCAGCCACCTCTGGACCCCATGAACATTTCCTGTTGGTAGAGCCTCCCTTCCATCCATCCTTCTGTCCGCTTGCCCTGTCCTGACTGTGCCCCAGGACTGGGTCTCTGCTGG
GTGGGTCTGCACCCTCTCTCCAGCCCTCACTCCTCCTCTCCTGGGCAGCATCAAGCAGCGGCTTCGCAGCGGTTCAGCCAGCCCCATGGACCTCCTGTCCTACTTCAAACAACCGGTAGC
AGCCACCAGGACAGTTGTTCGGGCCGCAGATTATATGCATGTGGCTTTGGGGCTGCTTGAAGAGAAGTTACAACCCCAGCGGTCCGGACCCTTCAATGTCACTG
TGGTACTCTGATCCCC
ACTGAGCCCGCTGGGCCTACCCTGGCCTGGAGTAGAAGGAATCCAGGAGAGGGAGGCAGGGTGCACAGGTTTGGGGTGCTGGGAGGAGAGAGGGTAAAGGGATGGGAGGTACAGAGCAGG
CCAGCTCAGGTCTGCCCATTTGCCTTCCCACAGATGTGCTAACAGAACCACAGCTGCGGCTGCTGTCCCAGGCCAGTGGCTGTGCTCTCCGGGACCAGGCCGAGCGCTGCAGCGACAAGT
ACCGCACCATCACTGGACGGTGCAACAACAA
AAGTGCGTGCGGGGCGGCAGGAGGGGCTGCCCCTGCCTGGGGGACCTCTCCCTTCCTGCACCCACCCTCTCCCTCCATGCTGAGCCATC
TCCAGGCCCTGCCCCCTGCTAACCTATCCCACCCATGGCTGCAGGAGGAGACCCTTGCTAGGGGCCTCCAACCAGGCTCTGGCTCGCTGGCTGCCCGCCGAGTATGAGGATGGGCTGTCG
CTCCCCTTCGGCTGGACCCCCAGCAGGAGGCGCAATGGCTTCCTTCTCCCTCTT
TTGTGAGTTGGGGCTGAGGGTTTGGGAGGTTGCTTGATCTCTTAAATGCGGGGAGTAAAACACAGC
CCAGAGTCACGCAGGCAGGGCTTGAACCCAGGCCCTTCCACTGACCAGTGCAGTGACTTGAGGCAAATAACACAGCTTTCTGAGGCTTAGTTTCCTCATCTGTAAAATGGCCCTAAAACC
TACCTCGTAGAGCCTGTGTGGATATTGAAGTCCTTGGTATAAATAGCTATAGAGGACATGATCTATTCATGCTAACCATTACGGTGTTGTACGGTTTTTTCTGAGAAAATTAGTGTCAAA
GTCTTGCTAAATATAAACAGCTCTATACCTTAATCTGGCTTTATCTATTGATCTATTTCACATTTATCAACTCATCAGTTTTCTATTGATATATCGATAGAGCAGTCTATCATCTACCCA
TCTATCAACATCTATTAATTAATTTTCTTCTATCCACCCTTCCATCTGCTGACCCCGTACTCCCTACGTGTATTATCCATCACCTGGCAACCAGCCCATGCACCTGTGAATCCGTGCTAT
CATCTGGTTAGCTATCAGTGTACAAATTGATCTATCTATTTGATCTCCCTTCCCCTCTGGGAGAGCTGGTGAGGTCTGAGCCAGTCAACCTAGCCCCTCTCCTTCTCTTTACCACCGGAA
TCCTCAGGAGCCCAGCCAGAAACCATCCTTCTAGGAATGAGAGCAGGAGGTGGCTACGCCTCCAGGGACAAAAGGGGCATGGAGGGCAGAAGAGGAGAGGCTGTCAATTCCAGCAGGGGA
GCTGCTGCTCCCTGAGTCCTGGGTTGGCTCTAATACCTTGTGGGGTCAGGGAGCCCATGTCCCGTGCTGATGTTATTTCCCCACCAGGTCCGGGCTGTCTCCAACCAGATTGTGCGCTTC
CCCAATGAGAGACTGACCTCCGACCGTGGCCGAGCCCTCATGTTCATGCAGTGGGGCCAGTTCATTGACCATGACCTGGACTTCTCCCCGGAGTCCCCGGCCAGAGTGGCCTTCACTGCA
GGCGTTGACTGTGAGAGGACCTGCGCCCAGCTGCCCCCCTGCTTTCCCATCAAG
AGGTACCTACCCTCAGCCAATCTCCCATGCCCTTGTGTGGCCTCCCCCAAAGGCAAGGTGCTGGGG
GTGGGGATCTGGAAGACTGGAGCACCATCCTTAAGGAGCTGCCTGTGGAGCTAGGGTATGAGACAGAGACACAAGAAACACAGCTGAGCAGAGACCCCGCGCCGTGTGTGTTTGAGAGGT
GGGGGTAGGGCAATCTGCCAGGAGGCTCAGGTCAGGCTTCATGGAGTGGGAAGCCCTTGACACATGCCTTGACCCATGGGTCTGAATCCAACAGGGGAAGCCTCTGGGCCCCTGCTTTTG
GCAACCTAAGGGCCTCTTAGCTCTTGCCCTTCTCTCCTTCCCAGAGTAGTCCAAGGGTCCTGCGGCTCTTGCCAGCTTCTTGGGCTTGGGCTGTAAGTAGGGTCCTAACCTTCAGGCTAG
AAGCCAAGGACAGTGTGGGGCACTAGGGGAGGAAAGATAGAATCCAGGGAGCAGAGTCTCTGCTAGGAACATAGGGTTGACACACGTGCACACACACACTCTCTCACACACACACACGTG
CACACACACGCTCCCTCTCACACACACATGTACACACACAGTCACCCTTAATAGGTGGGGCCACATTCATGAGGGATGACATGTGAGGGCCTCTGGATGGCCTCTCCCTGTTGCAACCTT
ATCTATTCCTCAATCCCTGGCTTAGGGGCACCTCCTCTGGGGAGCCTTCTCAGATCTTCCTATTACACTTTTAACATATGTCCTCCCCTCCCCTCTCCTCTCTCTCCTTCCTTCCTTCTT
CCTTCCTCTTTCTTTCTTTCTTCTTTCTTTCCTTTCCTTCCTTCCTTCCCTTCCTTCCTTCATTTCCTTCCTTCCTTTCTTTCTTTCTTTTTCTTTCTTTTTCTTTCTTCCGTCCTTTCT
TTCTTTATCTTTCCTTCTTTCTTTCTTCTTTATTATTATTATTATTGTTTTGAGACAGAATCTCACTCTGTTGTCCAGACTGGAGTACAGAGGGGCAATCTCAGCTCACTGCAACCTCTG
CCTCCTGGGTTCAAGCAGTTCTCATGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATGTGCCAACAAGCCTGGCTAATTTTTGTATTTTTAGTAGAGATTGGGTTTCACCATGTTGA
CAAGGCTGATCTCAAACTTCTGGCCTCATGTAATCCACCTGCCTCGGCCTCCCAAAGTGCTAGGATTATAGGCGTGAGCCACCACACCTGGCCCACATATGGCTTTTCTATCAAGCACAG
CAGATGCCACTTCAACTTAGTTTCTTAACTCTATGTGTGTGTGTCTTTTTCCACTGAACTGCCAGCTTCCTGCAACCTTTCCTGTGTTTTTTTAACCTTCATAATCTCCAAAGCAGCAAG
CAAGAAAGGGGCTTAGAAAAGGTCTGTCGAATTGAACAGAATTAAAGAAGCTCAGGCCTAAGAGTCAAGAGAACTAAGTTCTAATCCCACCTTTCCTGCTAACTGGTTGTCACTTCCCCT
CTCTGAGGCTGTTTTCCTTGCTGTAAAATGAGGATCGAGATTGTTTCAAAAGCCTCATTCATTCCTAACCTTTTGAGACTCTATGAAACAAAATGTTAAGGACACTCAGACTGACGGGGT
GAGCAGCATGAGCCTGGGTGAGTCAAGGAGGGCTTCCTGGAGGAAGGGAGTTTTAAGCAAGGTTTTGAGGAGGTGGAAGATGACGGAGAGGAGTGAGTCTGCTATTGAGGGGGCCCCATG
TCACTGTCTCCTCTTCCATCTCAGATCCCACCCAATGACCCCCGCATCAAGAACCAGCGTGACTGCATCCCTTTCTTCCGCTCGGCACCCTCATGCCCCCAAAACAAGAACAGAGTCCGC
AACCAGATCAACGCGCTCACCTCCTTTGTGGACGCCAGCATGGTGTATGGCAGTGAGGTCTCCCTCTCGCTGCGGCTCCGCAACCGGACCAACTACCTGGGGCTGCTGGCCATCAACCAG
CGCTTTCAAGACAACGGCCGGGCCCTGCTGCCCTTCGACAACCTGCACGATGACCCCTGTCTCCTCACCAACCGCTCGGCGCGCATCCCCTGCTTCCTGGCAG
AGGTCAGACAGGGAGGA
AGGTGGTGTCTTCCCAGGAAACAGCCATCCCTGGGGTCCCAACTGGGAAGCAATGGTGGGATGTGGTGAAGGTACATGGTTTGGGACCTCAGTATTAGGCACACCATAAGCATGGATCTG
TGCACAGCCATCATAGAATCAGAACGTTGAGTCCCTTTGCAAGCTCCCTGTGTGTGGCCAGAGATAACTGCTTAACCGTCTCCAGTGACAAGGCTCTCACCGCCTCTATGGCACCCCAGC
TCAGCCACAGGCAGCTCCGACTGTTAGAAAGTCTCCTTATTTTGATCACATATTCACCTATTCCCCTTAATCATGCTTCTGGAACCTCAGAAAAGACACTGTCCCTCTTACACGCAGGAA
TACCTCAAAGTAGCTGTTAACTCCATCTCCATTTTTTTTCTCTTCTCTGGGATAAATTTCCGTAGTTCCTCCCTGTCCCCAGCTGTTGCTAGACAGTTAATACCCTCTTCTGGAAGCATA
ATTTGAACTTTCCTCAGGGTCTTTTGTCCCATGTACATGGTTACAATAATAATGGTAGTCATTTCTCATTAAGATATTTATCTTATTATTATTATTATTTTAAGATGGAGTCTCACTCTG
TCGCCCAGGCTGGAGTGCAGTGGCGTGATCTCAGCTCACTGCCACCTCTGCCTCCTGGGTTCAAGTGATTCCCCTGCCTCAGCCTCCCGAGTAGCTGGAATTACAGGTGCCTGCCACCAC
GTTTGGCTAATTTTTGTATATTTAGTGGAGATAGAGTTTCACCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATTCACCTGCCTTGGCCTCCCAAAGTGTTGGGATTACA
GGTGTGAGCCACCACGCCCAACCTTATCAAGATATTTAAATGTGCCATCTCTAAACCTTGCAACAGCCCTGAAAGGAGGGAATTATTATTGTTCTCATTTTACGATAAGAAACTGAGACT
CAGGGAAAGTGAATCATGCGCCCAAGAGCACAGAGTTAGTTTCAAATGACAGAGTCCACTTGAGAACACAGTCAGTGTAGTATCAAGGTCTCTGCCCAGTCCACTGTATTCCACTGTTTT
CCTCACAGAGAGTCGGCCCCAACCCCTCTTTTGAAGGATTGTACATTTGTGTGTCCTGGAAAGAGCAGCTTGTGCCTTCGGGAGAGGTCAACTGAACTAAGACCTTGTGCAAAAGGGTTT
AGTTAAAACCAAGCAAATGATGGAATTAACTTTCTTTCCCTGCCTGTAGTAGGTCTGTTCGTAAGGACAGTCTCTGCCCAACCCTCCCTCTCTTCCTCTTTCCCTCTGGTTCACGGGGGA
GAGCAGTATTATACCAAGCCCAAAGATAGGAAGAGGTACACCAGCCCTTGGCAGAGAACACGCAGGCAGATGCAGCTCATGGACAGGTGGGCTCTCAGTCCCAAAGTCAAGGTGCCCTTG
CCCATAGCTGGTCCAGGCTATGGATCTGGTTGTCTTGTAGAGTAGAATCCTTAGGCAAAGATGCCCCTGCCTGAGACCAGCAGCTGCCCTGTGCTTTCCCACCTCACCCAAGCCTGCTAG
GAACCCCCGGTCCTTCCGCTCCTGGCCTGTGGTGTTAGGGAGGAGGGTGGCGATGAGGAGCCTCAGCACAGCTGAAGAGATGGAGGTCCAGTGAGGGCCAGGAGTTTGGCCCACCCCGTC
TCTCCCATCCCCAGCCCTGGGTCTACCCTGGTAGAAAGACATTTCTCTGGGAAAGGCTGCAGTAAATCTGAGCTTGGGGTTTTCAAGGTGACACCCGATCAACGGAAACCCCCAAACTGG
CAGCCATGCACACCCTCTTTATGCGAGAGCACAACCGGCTGGCCACCGAGCTGAGACGCCTGAATCCCCGGTGGAATGGAGACAAACTGTACAATGAGGCTCGGAAGATCATGGGGGCCA
TGGTCCAG
AGGTAAGGAGCTCTGCATCCCAGCATCCCCCAGATGACAAGCTTGGCATGAGAAGCAGTCCTTAACACATCCTTGCGGATGTGCCTAAAACCAGCTGGGTCTGGGCAACTGG
CGGAGCACCTGGACCTGTCCTCTGGCCCCGATTCTGCCTCTAACTCCCGTGTGACCCTGTCTGTGTCACTCACCCTCTCTGGGCTTTGTATCTCCACCCACCAATAGTAAATTAATGTTG
TCACATTTGACGTGATGACAATAAAGAATATGTCTGAGCCACCCTTTGAAAAGGCAAGGGTATGGGTGAGTAGCCTCTGGGGAATGTTCCTCCTGTCTTCCCTTCCAGATCATCACCTAC
CGAGACTTTCTGCCCCTGGTTCTGGGCAAGGCCCGGGCCAGGAGAACCCTGGGGCACTACAGGGGGTACTGCTCCAATGTGGACCCACGGGTGGCCAATGTCTTCACCCTGGCCTTCCGC
TTTGGCCACACAATGCTCCAGCCCTTCATGTTCCGCTTGGACAGTCAGTACCGGGCCTCCGCACCCAACTCGCATGTCCCACTTAGCTCTGCCTTCTTTGCCAGCTGGCGGATCGTGTAT
GAAG
AGGTGACCAGGTTTTCCAGGGGGCAAATGGGGGTGAGGGTGGGGAGCATGCCCTCCCCTAGGTGGGCCAAGCTTACTGCCAGGAAGCCAGGCTGCTGCAGAGGCCACTGCTAATAT
CTCCCCAGGACAGTGGAAACAAGGCAGGTGCCAGCAAGACCCTCAGTCACGGGCCTCCCATCCTGTGTGGAATGAGAGGATTTTTTAAAGGGGTGGAGACTAATGTCAGATTGATGGGGA
GCTCACCTTCCATTCCTTAAGAAGTACCTCCCAGCTCCAGCTGCTTCATGTCTCTCCAGAACTCTGTTTCCTGACAAACGTTACTAACATACCCGACTGGCTTGTCCAGCTCTGGGCTAG
CTTGGCATCATGTGATAACCCAAGTAGCTTCCCAGAGGCTGGTCCAATCTGTGCTGCTCACATTCCCTGCCACCAGGGGGCATCGACCCCATCCTCCGGGGCCTCATGGCCACCCCTGCC
AAGCTGAACCGTCAGGATGCCATGTTAGTGGATGAGCTCCGGGACCGGCTGTTTCGGCAAGTGAGGAGGATTGGGCTGGACCTGGCAGCTCTCAACATGCAACGAAGCCGGGACCACGGC
CTTCCAG
AGGTGAGGGGGCTGTCCACCTCTTCTCCCAGCTTTGCTCGGGCCAGGCTGCTCAAGGGGTTCTGGGAAGACCCTGGTACCTCCTTTCTGACTGGGACTGTCTACAGGATGTGC
AGGAGTGCAGAGGCATGCAAGGCCAAGGTCGATGTCCCAAAGCACTCCTGGAACACCGCTTGCTGTCCTGCCATGGCTCCCCCATCCACTGTAGGGGCTCCAGCTTTAGGTTCACCACAG
GACCAGCACTTAGGCTCAGGAGCTGAAAAAGGGAGGCTCGAACTTTCCAAGTAGAAAAAAAGCAATAGGCTTAAAAGGCAGGGAAGAACTGCTCCATTCCCAACACGCTGGCAAGTTCCA
AGGAGGAAGGCTAAAATTCCAGAGCTACAGAAACGCTCCTTGCTCTAAGTGGAGAAAGGCAAAGAGCCTTAGAGGCAGGGAATAGGTGGAATAAAACTAAGGCTGCAGTGGCTGTTTGGT
GAGGGCACAAATGACTTTATCCCAGGAAAGAAGAAAATATAGGCACTTTGTTAGTCACTTTCATGTATATTAATTTATTTTGTTCTCCCAATCACCTTGAGAAGCAAATGATTACAGCCA
TTTTATAGATGAGAAACTATGTTCAGAGACAGCTAGTAATTTGCCCAAGATCACATAAATAAGAAGTAGTAAAATCTGAATTTAAATTTACAATTTCTCCTTTCTTCCCCCTGTTCAACC
CCAGTAGGGGAAGGTTTTGGCCACTGCCTTGCTCCTCTCTCCTTCCCACCTCCTACCTCCCTGTCTAGAAAGAACCTGAAGGCCCATCTGCCCATGTATGAAGCAGTGGTGGCACCTCCT
ACTGCCCACCCCACTTCACACAGCGCCTGGTAGAGCACAGCAGGCACTCCGGAAACCTGAGCTTCTTTCCTTTCTTAAACTTGGGCGCTCCTGGCAGCTAAGGAGCTGTGGGCACCCCCT
GCCGGAGAGCTGAGATCGCTAGCCACCCAAGTTTCTATTAGGTTTTGAAGGGAAGGCAAGGGTTAAGGAAAGACACAAAGAGAGAGGGCGGCTGAACAGCCAATGCAGGTATTTATTTCC
AGCATAAAACCTACAGAGGTGGGGGACCAGCTTATTGCCAGAACCTGCCGCCGCCTACAGCTGGGGTACTGGGTATGGGTATGGGTATGGGCAGGAGGGGTCTGGGTCGTATGGCTTGCT
ACCCAGCAGGATATTGATAAGATGTACCTATGATCAGGTCCTTTTTTCAGTGGGATGTGGTAAGATGTTCCTTGGACCTTTGCCCAGCAGGATATGATAGGGATGTTCCTTCAGTTGGGC
CTTTGCCCATTAGGGTATGTTATGTTTTTCATGGCCCGACCTCCTGTGGAACGTTTCACTTTGACCAAGGTCTGGGAAATGGTGGGGGGCTTAGAAAATGGTGCAGTTTGGACTAACATT
CTTGCCTTCTACTTTGGTATAAAAGGAAGAGCTGGAGAGCTGAGATCGCTAGCCACCCAAGTTTCCACATTGCGAGCTTCTGCTCCCCGCAATACCTGTGCCCTCCAGAAATATTATCCT
GACACTAGCTCTTGCAATCTCCTCTAATCCCTTCCCAAGTCCAACATCAAACTGGGAACTGGACAACCTGGTTTGAGCTCCACTTAACAGTAAAGTGACCCTGAAGCCTATGGTTGCTCA
ACTTTAGTAAACATACCAGATATAGTTAGCATGCCATAAAAGCACCTCAAAAAGGTGCTGTGAGGGTCCTGAGATATTCACCCAATAGTGCTCAGTAATGGTAAAATACTGTGTGGTTTT
CATGGGTTCCGCTGGGGGATGGTGTGTGTGAGCTGTAGTTCAATGGATGACGTTGGGCTGGTGTCCCTTCGGGACATGGCAAAGCTATCTCCTACCTTGATTGTCACATCTTATGATGTG
GCTTGGCCCTGCGTGGGATCCTGATGGCTGACAAATTGGTATGGCTTGTCTTTGGAAGTTTCTGAAAACTTTTTAAAAACTTCTTTATTGAGATATACCATAAAATTCACCCATATGAAA
TAGACAATTCAATGATTTTAGTGTATTTAGAGTTATGCACCCATCACCATGATCTAATATCACACCATTTTCACTCCCCAAAGAAACCTTGTAAGAACAGTCAATCCCCATCCCCACCCC
AGCCCCAATCACCTCTAATATACTTTGTGTCTCTATATTGACCTATGCAGGACCTCTCATATCAACAGAATCATAGTGTGCCGTCCTTTTCTGAGTAGATGAATTTTATCATTGCTCTCC
ATTTGTTTGCTTGTTCATTCATTTATTCATTTGCTCACTCATTCATTCTACAAACCTGATGGAGAGTCTTTTATATTCTGGGTACTGTCTTACTTGTGTCTAATGAATGCAGTCGAGCCC
AGTAATATCACTAAGTGTCGGATTTGTGGTTATGAGTTTTGGGTTAAAACAGCTTTTTAGGTTAGGACACACTCCTCAGTCTGATGCACACTGAGCACAGTTGTCCACTCACATTTTCAA
TCAGAGGACAGAAACCTAGTCTTTGGAAGCTCTAGGGACTCTTCTGCCAGCAGGATCTTCTGGTTCTGCCCAATATTGACTGGCCACAGCTTCCCCCCAGAGGACTGGTGGAGAAAAACA
GAAGCTAATGGGAGATCAGCAAGACTGAAGCTGCTTCTCCCCGTTCCCCTGCAGGGTACAATGCTTGGAGGCGCTTCTGTGGGCTCTCCCAGCCCCGGAATTTGGCACAGCTTAGCCGGG
TGCTGAAAAACCAGGACTTGGCAAGGAAGTTCCTGAATTTGTATGGAACACCTGACAACATTGACATCTGGATTGGGGCCATCGCTGAGCCTCTTTTGCCGGGGGCTCGAGTGGGGCCTC
TTCTGGCTTGTCTGTTCGAGAACCAGTTCAGAAGAGCCCGAGACGGAGACAG
AGGTAAGTGACCCTATCATAAAAGACATCAGCACCAGAGGCAGAGCAGAAAAACACTAGCATTTCAAG
ACTAAACATTGAAGAACACTGCTCTTTTTAGTATCATTTCTTCCAAGTTCACAGGATCTGAAATCAGGAGGCTCCTCTCTGAAAAGCTGGGTCAAAGGAAGAGAGACACTGACCCAGGCA
AGGCCCATATTGCCTGAGCTGGAGTCATCTTAAACCCAGAGGTTGCCTGGCAGCCTCACTATTCTGGGTTGGTGGATCCCAGAAAACATGGGCAGAAAGGGCTAAATCTGGTTTCCCTCC
AATACATTTGTGATTTAAGACCTGACGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACGAGGTCAGGAGATCGAGACCATCCCGGCTA
AAACGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCGGGCGTAGTGGCGGGCGCCTGTAGTCCCGGCTACTTGGGAGGCTGAGGCAGGAGAATGGCGTGAACCCGGGAGGCG
GAGCTTGCAGTGAGCCGAGATCCTGCCACTGCACTCCAGCCTGGGCGACAGAGCGAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGACCTGTCTCCTTA
TTTGGAGTTGGTATTGCCCGAGTTCAAATCCAGCCCTGGCCACTTAAATTACTGTGCACTTAATACTGGGCAATTTATTTGGCCACTGAATTTGTTTCATTTCTGAGATTCCAATCTTGC
AGGAATTTTGTGAGAATTGAATGGAATAATATATGTAAAGTACCTGGCACACAACAGGTGCTCATTATAAGGTAATTCCTCCCCAGCCTTCACCCACATCTCTCGACTGCCTGGTAGGTT
CTGGTGGCAGAAACGAGGTGTTTTCACCAAAAGACAGCGCAAGGCCCTGAGCAGAATTTCCTTGTCTCGAATTATATGTGACAATACCGGTATCACCACGGTTTCAAGGGACATCTTCAG
AGCCAACATCTACCCTCGGGGCTTTGTGAACTGCAGCCGTATCCCCAGGTTGAACCTATCAGCCTGGCGAGGGACATGA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCATCTGCTCCCAGCCCTGGCAGGGGTCCTGGCCACACTCGTCCTCGCCCAGCCCTGTGAGGGCACTGACCCAGCCTCCCCTGGGGCAGTGGAGACCTCGGTCCTGCGAGACTGCATA
GCAGAGGCCAAGTTGCTGGTGGATGCTGCCTACAATTGGACCCAGAAGAG
CATCAAGCAGCGGCTTCGCAGCGGTTCAGCCAGCCCCATGGACCTCCTGTCCTACTTCAAACAACCGGTA
GCAGCCACCAGGACAGTTGTTCGGGCCGCAGATTATATGCATGTGGCTTTGGGGCTGCTTGAAGAGAAGTTACAACCCCAGCGGTCCGGACCCTTCAATGTCACTG
ATGTGCTAACAGAA
CCACAGCTGCGGCTGCTGTCCCAGGCCAGTGGCTGTGCTCTCCGGGACCAGGCCGAGCGCTGCAGCGACAAGTACCGCACCATCACTGGACGGTGCAACAACAA
GAGGAGACCCTTGCTA
GGGGCCTCCAACCAGGCTCTGGCTCGCTGGCTGCCCGCCGAGTATGAGGATGGGCTGTCGCTCCCCTTCGGCTGGACCCCCAGCAGGAGGCGCAATGGCTTCCTTCTCCCTCTT
GTCCGG
GCTGTCTCCAACCAGATTGTGCGCTTCCCCAATGAGAGACTGACCTCCGACCGTGGCCGAGCCCTCATGTTCATGCAGTGGGGCCAGTTCATTGACCATGACCTGGACTTCTCCCCGGAG
TCCCCGGCCAGAGTGGCCTTCACTGCAGGCGTTGACTGTGAGAGGACCTGCGCCCAGCTGCCCCCCTGCTTTCCCATCAAG
ATCCCACCCAATGACCCCCGCATCAAGAACCAGCGTGAC
TGCATCCCTTTCTTCCGCTCGGCACCCTCATGCCCCCAAAACAAGAACAGAGTCCGCAACCAGATCAACGCGCTCACCTCCTTTGTGGACGCCAGCATGGTGTATGGCAGTGAGGTCTCC
CTCTCGCTGCGGCTCCGCAACCGGACCAACTACCTGGGGCTGCTGGCCATCAACCAGCGCTTTCAAGACAACGGCCGGGCCCTGCTGCCCTTCGACAACCTGCACGATGACCCCTGTCTC
CTCACCAACCGCTCGGCGCGCATCCCCTGCTTCCTGGCAG
GTGACACCCGATCAACGGAAACCCCCAAACTGGCAGCCATGCACACCCTCTTTATGCGAGAGCACAACCGGCTGGCCACC
GAGCTGAGACGCCTGAATCCCCGGTGGAATGGAGACAAACTGTACAATGAGGCTCGGAAGATCATGGGGGCCATGGTCCAG
ATCATCACCTACCGAGACTTTCTGCCCCTGGTTCTGGGC
AAGGCCCGGGCCAGGAGAACCCTGGGGCACTACAGGGGGTACTGCTCCAATGTGGACCCACGGGTGGCCAATGTCTTCACCCTGGCCTTCCGCTTTGGCCACACAATGCTCCAGCCCTTC
ATGTTCCGCTTGGACAGTCAGTACCGGGCCTCCGCACCCAACTCGCATGTCCCACTTAGCTCTGCCTTCTTTGCCAGCTGGCGGATCGTGTATGAAG
GGGGCATCGACCCCATCCTCCGG
GGCCTCATGGCCACCCCTGCCAAGCTGAACCGTCAGGATGCCATGTTAGTGGATGAGCTCCGGGACCGGCTGTTTCGGCAAGTGAGGAGGATTGGGCTGGACCTGGCAGCTCTCAACATG
CAACGAAGCCGGGACCACGGCCTTCCAG
GGTACAATGCTTGGAGGCGCTTCTGTGGGCTCTCCCAGCCCCGGAATTTGGCACAGCTTAGCCGGGTGCTGAAAAACCAGGACTTGGCAAGG
AAGTTCCTGAATTTGTATGGAACACCTGACAACATTGACATCTGGATTGGGGCCATCGCTGAGCCTCTTTTGCCGGGGGCTCGAGTGGGGCCTCTTCTGGCTTGTCTGTTCGAGAACCAG
TTCAGAAGAGCCCGAGACGGAGACAG
GTTCTGGTGGCAGAAACGAGGTGTTTTCACCAAAAGACAGCGCAAGGCCCTGAGCAGAATTTCCTTGTCTCGAATTATATGTGACAATACCGGT
ATCACCACGGTTTCAAGGGACATCTTCAGAGCCAACATCTACCCTCGGGGCTTTGTGAACTGCAGCCGTATCCCCAGGTTGAACCTATCAGCCTGGCGAGGGACATGA

Retrieve as FASTA