Entry information : PtroPGHS01
Entry ID 5833
Creation 2007-10-09 (Marcel Zamocky)
Last sequence changes 2010-11-23 (Myriam Duval)
Sequence status complete
Reviewer Myriam Duval
Last annotation changes 2010-11-23 (Myriam Duval)
Peroxidase information: PtroPGHS01
Name PtroPGHS01
Class H synthase    [Orthogroup: PGHS001]
Taxonomy Eukaryota Metazoa Chordata Mammalia Hominidae Pan
Organism Pan troglodytes (chimpanzee)    [TaxId: 9598 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value PtroPGHS01
start..stop
S start..stop
HsPGHS01 1180 0 4..574 29..599
EcabPGHS01 1130 0 2..574 27..599
OarPGHS01 1110 0 4..574 30..600
BtPGHS01 1106 0 4..574 30..600
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '5833' 'join(122026281..122026416,122026834..122026974,122027176..122027319,122029765..122029946,122030058..122030141,122031891..122032137,122034825..122035111,122038589..122038736,122040526..122040881)' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 122026281..122026416 134 N° 2 122026834..122026974 139 N° 3 122027176..122027319 142 N° 4 122029765..122029946 180
N° 5 122030058..122030141 82 N° 6 122031891..122032137 245 N° 7 122034825..122035111 285 N° 8 122038589..122038736 146
N° 9 122040526..122040881 354  
join(122026281..122026416,122026834..122026974,122027176..122027319,122029765..1 22029946,122030058..122030141,122031891..122032137,122034825..122035111,12203858 9..122038736,122040526..122040881)


exon

Literature and cross-references PtroPGHS01
Literature Chimpanzee Sequencing and Analysis Consortium Initial sequence of the chimpanzee genome and comparison with the human genome. Nature 437 (7055), 69-87 (2005).
Protein ref. GenBank:   XP_520238.2
DNA ref. GenBank:   NC_006476.2 (122026281..122040881)
mRNA ref. GenBank:   XM_520238
Protein sequence: PtroPGHS01
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   574 (202)
PWM (Da):   %s   65977.05 (22742.5)  
PI (pH):   %s   7.86 (6.52) Peptide Signal:   %s   cut: 25 range:25-226
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MRKPRLMNPCCYYPCQHQGICVRFGLDRYQCDCTRTGYSGPNCTIPGLWTWLRNSLRPSPSFTHFLLTHGRWFWEFVNATFIREMLMRLVLTVRSNLIPSPPTYNSAHDFISWESFSNVSYYTRILPSVPKDCPTPMGTKGKKQLPDAQLLARRFLLRRKFIPDP
QGTNLMFAFFAQHFTHQFFKTSGKMGPGFTKALGH
VDLGHIYGDNLERQYQLRLFKDGKLKYQVLDGEMYPPSVEEAPVLMHYPRGIPPQSQMAVGQEVFGLLPGLMLYATLWLREHNRV
CDLLKAEHPTWGDEQLFQTTRLILI
GETIRIVIEEYVQQLSGYFLQLKFDPELLFGVQFQYRNRIAMEFNHLYHWHPLMPDSFKMGSQEYSYEQFLFNTSMLVDYGVEALVDAFSRQIAGIGGGRNMDHHVLHVAVDVIRESREMRLQPFNEYRKRFGMKPYTSFQELVGEKEMAAELEELYGDIDALEFYPGLLLEKCHPNSIFGESMIEIGAPFSLKGLLGNPICSPEYWKPSTFGGE
VGFNIVKTATLKKLVCLNTKTCPYVSFRVPDASQDDGPAVERPSTEL

Retrieve as FASTA  
Remarks Complete sequence from genomic (chromo 9). Three spilicing variants are prediction with no EST confirmation.
5' end is probably false/missing..
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAGGAAACCGAGGCTCATGAATCCCTGTTGTTACTATCCATGCCAGCACCAGGGCATCTGTGTCCGCTTCGGCCTTGACCGCTACCAGTGTGACTGCACCCGCACGGGCTATTCCGGC
CCCAACTGCACCATCC
CCGTGAGCTGGGCCTTCAGCCCTCAATCCTTCCATCTTGAGCCCTTCTGCTCCCCGGGCCCTTTCTCCTAGACCCTAACTTCCTACCCTCCTCTCTGACCATGG
CCCTGTTCTCCTTCCTTGCCTGGTTCTGCCCCTCTCCCTGACCTGGCTTCAGCATGAGCTCTCGCTCTTGGTCCACCCTCACTCCCTTCTTCTGAGTTCCATGGTGAGTCTTCACCATAT
GCCCTGGCCCCTGTCCTCACGCCCGGTTCTGTCTCTGTCATGTGTCATTGCTCCCAAGGTTCCATCCTTACCCACTTCCCCATAGGTGCTACTCTGTTCTATCCTGGCCCTTGTCCTCAG
TGTCCCATCTTCCACCCTGGCTACTTCTGGTTCTGGTAGGAGGGACCAACTGAGTGACTCCCATTGCCCCTGCAGCTGGCCTGTGGACCTGGCTCCGGAATTCACTGCGGCCCAGCCCCT
CTTTCACCCACTTCCTGCTCACTCACGGGCGCTGGTTCTGGGAGTTTGTCAATGCCACCTTCATCCGAGAGATGCTCATGCGCCTGGTACTCACAG
AGGTGGGTGTGGGGCAGGGCCCCC
TGACCTGGGGGAGCAAGCAAGCCTGCTAGTCCTTTTGGATTCTTGGCATCTGATAAGGGTAGTGGGTGGGGAGAGTCTATGATGCCTGATAAAATAAGCCCCAACCCAGGAGGAGGCAAG
AACTGGGGTGGAGCTGGGGGTGGAAACACCCTTGTCACCATTATTTTTGCTCTCTGCAGTGCGCTCCAACCTTATCCCCAGTCCCCCCACCTACAACTCAGCACATGACTTCATCAGCTG
GGAGTCTTTCTCCAACGTGAGCTATTACACTCGTATTCTGCCCTCTGTGCCTAAAGATTGCCCCACACCCATGGGAACCAAAG
AGGTAAAATGGGGTGAGGAGCTGGGCCTGGGGATTAC
AGGAGGTGCTCAGTTCTTCTCTTTGGGAAAAATCAGGCAGAAGAACAATTATTGACCCAATTCTGCAGATGGCTAGACCAAGGCAAGACATATGACATGTCCAGAGCCTGGGCTGAGAAC
AGGCAAGGGCAGCAGAGGGTCTTGCCTGAGGTCACCTAGAGTCAGACCAATGTTTCCATAGTTCCAGGGTGCCTCTTTGCTTGATCCTTTTCTAATGATCAGTTGGGTCCTGCTGGGGTG
GAAGTGACTTAGAAGTTGAGATGTAGGAAAGAATAGTGAGCTATTTATTGGGTGCTGTCTCTGTGTTTGGGTCTTTACAGATGTAAATAGTTTTACATGCTTCACCAGTGTAAGGTACAA
ATAAGTCACATTTTTTTCTTTCGTTGCTCAAGACTTCACTTAGCCACACTGGCAGGGGTCTTCCTTGTAAGACCTTCCCATGCCACCTGTAATTATCCAAAAACCTGGGATATTATTCAT
TTCAGACCCATCAGTTCAGCATTAAGTACAGAGAGGACAAGAGAGGCTCATCAGTCCACTGCTGCTATACTCCAGTTCCTGCCACATGGTGGCACTGTTGAATGCCAGTCCTGTGCAGCC
TCATGGTTATGTGCTTTTTTGGGTTCAAAACCTTGCACCATGTCCCTGCTTGGGTCTCAAGAGCACTACTGTGACGGTTTTCCATCATATGGTTAGCTGCTTCTCCCAAAGCATGACTAT
TCTACCAGGATAGCTGAGTCTTGCCAACTTTGCTGAAATCATGCTTGCCCAGTGTCAGTGAGTGATGATTCCAAATTACGGTTGACAGATCACTCCCTCTAACTTCCCTTTTGGTGGATT
TTCTTTAGGGGTACTTGATATTTTTTCCTGCCAGAGGAACCCAGTCAGCCATCTTAATCCAAGTAATTTCCATTGATCTTGAACCTTCAACATCAGGGCTCACATCTTGATGCATTGTGA
AGAGATGCCTTTACCAGAACTCAAAAAATTCTATTCCTTTCTGTGAGGGCAATTGGGTGACAACTCATTTGACACTGACATAATTAAGGAAGACCTCTCAATACTTATTCTAAGGATGAC
TTGTCTTTATGCCAGACATAGAAAGATTGGCATCATTTAAAATAGGTTGATACACCTATTTTAAGGGGAGCAAGCAAGCCTGCTAGTCCTTTTGGACTCTTGGCATCTGATAAGGGTAGT
GGGTGGGGAGAGTCTATGATGGCTGATAGGTGGTGGTGTGGTGGCACATGTCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGTGGATTGCTTGAGCTCAGGAGTTCAAGTCCAACCT
GGATGACATGGTGAGACACCTTGTCTACAAAAGAATACAAAAGTTAGCTGGGTGAAGTTGTGTGTGCCTGTAGTCCCAGCTACTCAGGACGCTAAGGTGGGTGGATCAATTGAGCCCAGG
AGGTCAAGGCTGTAGTGAGCCATAATTGCGGCACTGCACTCTGGCCTGGGCAATAGAGTGAGACCCTGTCTCAAAAATAAATAAATAAATAAATAAATAAATAAATAAATAGGTTGGCAA
ACTACAGCCTGCAGGCCAAATTCGGCCCACCTCTCCATTTTTGTAAATAAAGTTTTATTGGAACACAGCCACACACATTTGTTTATGTGTCATCTATGACTGCTTTCCCGCTACAATAGC
AAAAACTGAATAGATGTGACATGTCTGTATGGTCTGCAAAGCCTAAAATGTTTACTATCTGGCTTTTTCCAGAAAAAGCTTGCTGACCTTTGATTTAAAAAATTCTACCCACTGGTTCTC
TGCCATGCTTACATTATCTCTTGCAGTCCTCAGATTCACTGTATCAAATGGGTATCATTTGCCCCTTTTTAAAGATGGAGAAATTGACACCCAGAGAGATGAGATGACTTATCTGTATTC
ACACAGCTAGCAAATAGCATAGTCAGTTGCAAACACAGGCTACCTTGACTCAGGGCAAGGGAGTTCATGTTTGTTTTTTTCTGTTTCTTTATTTCTTTTTGGTGAAATGTTTCATTATGG
AAAAATTGCAAAGATACACAAAAGTTGAGAGAAAAGCAGAATGAACTATGTACCCATCTTTCAGTTTCAACATTTACCCACGGTTTCTTCATCTTATTTCATTTCTCCCCTCTCATATTT
TTATAAAGTATTTTAAATCAAATTCTAAAAATCATGCCACTTAAAATTCTAAAAATCATGCCACTTCACCCATAAATACTTCTAGGGTCTTTTGGTAGAGAGTGGGTTACTTGGTGGTGG
TGGGGAGGTGGTCCTGAGGAGGCCACTCTGGGCTTCCTGCTTGGGCCAGTTTGCCTGGTGAGCCCAGATGTCCCCAGGGCAGCAAGATCCAGACAGGACAAGCTACTGCTGTTTCCTACC
CCCCAACCAGGGAAGAAGCAGTTGCCAGATGCCCAGCTCCTGGCCCGCCGCTTCCTGCTCAGGAGGAAGTTCATACCTGACCCCCAAGGCACCAACCTCATGTTTGCCTTCTTTGCACAA
CACTTCACCCACCAGTTCTTCAAAACTTCTGGCAAGATGGGTCCTGGCTTCACCAAGGCCTTGGGCCATGGG
GGGTGAGTACCTAGGAGGGGCTCAGGACTGCTCTGGACCTAATTTGGC
ACGCGTATGTCATCGACAGTGGGCCGGCACCCTGGTGACCCGAGGGAACCCCTCTCTGTCCACAGGTAGACCTCGGCCACATTTATGGAGACAATCTGGAGCGTCAGTATCAACTGCGGC
TCTTTAAGGATGGGAAACTCAAGTACCAG
AGGTAGTGCTGGGCCAGGGGGTAGGGCAGAGGGAGGGGTCTCCCATGGTCTTCCCTGGCAAAGACTGCTTGGGGCGGGGGTCTGGGTCATG
TCCTGAGAGGGCCAACCACGGGAGTGGGAGGCTTGTGCCAGGAGCGACAGTATACGCTGGGAGGAGGCAGCAGGTATGAGAAGCCAGGGAGGAGCAGACGTGGCCTCCCATGTCAGCCAG
GGAGGTGGATTTTGGAGCTCAACAGGAGATGAACATTTGTAGTCTGCATTTATTTATTTAATTATTCCACAATATTAATTGGACACGAGAATATACCTGGCACAGGTGATTGAGTAGTGA
CCATTATAGTTAATGGGCCCTGCCCTCAAAAAACCACAGGCAAACTCTAGGATGCTCATTCTATGAGGGCTTTTGTTTAAATCAGAACGGTCCAACAGAAATATAATGTGAACCACATAC
ATAGTTAAAATTTTCTAATGTCCATATTAAAAGAGGGAAAAAGAAACAGGTGAAATGATTTTAAGAATACATTTTACTTAATGCAATATGTCCAGGTAATTAGCATTTTAGCATGTAATC
AATACATTATTAATAAAATATGTCACATTCTTTTTTCATGCTGAAGCTTCAAAATCTGGTGTATATTTCACACTCACAGGACATCTCAATTTGGATGCTCCGTTTTCACTGGAGTGATCT
AATCTGTATTAAGATTTCATAAAATGTACAGCTGAATAAGTAGAGTGACATGTCCGACTTGTTCCACGCATACTTAAAGGTTTTCCAATAGCTGAAGTATCAGTTTTAAAATTCAAATAG
AAATTAAGATAAACCTAAATAAAATAAATTAAGTAACATTCAGTTCTTCATTCACACTAGCCAAATTTCTAGTGCTCAGTAGCCACACGTGGCTAGTGGCTACCATATTGGATCGTACAA
ATCTTAGGCAAGCACCAAAACAAAGTTTGATGCTGAATCTTTAGGGCTAAGAACAGTACTTGGCATGTAGTAGTCTCTTGGCATGTATTTACTGAATGAATGAAGAAGCTGCCATATAAT
TAGGTACACTTGTAGCTGCCACCAAGGAGAAGCTGTGAGTGCCACTAGAGTGTTTGGATGATGGGTAAAACTTCCCTAGGAAGTTACAAATAAACCCAGAGTTGCATAAAGGATGAGGAG
GAGTTAGGGATGCTAAGAATGGGAGAGGGCTTTCCAGGTAGAGGGTTTAGCAAGTACAAAAGCTTAGAGGTGGAGAACAGCTTGGTGACTTGGAGGGAGTGTAAAAATGGGAGCGTTTGC
TGAGCCTAGTGATGGAGCGTAAGAATGACTTTACAGGAAGGTGGAGATGTCTGTGGGGACCGTGTTAAGGAATTCTACTTTCTGCCAAGAGCAGAGAGAGTATTTGGAAAGGTTTTAAGT
CAGCTCATGATGCAAGATTTGTTTTTTGTTTTTTTTTTTTAAGTTTTTTTTCTCTTTTTTGCTTGGGTGCAGTGCAGAGACTGGCCTGTCTGGGGATAGGAGTGGAAATAGGGAGCCCAC
TTAGGGAAAGGAGTCAGAGTGGAAGATGAGTGGGGGTTGCTTGAGGTAGTGGTGGTAGAGATGGAGAGCAGGGGACTGATTTTAAGAGACATTTTGGAGGTGGAATCAACAGCCTTGGCT
GAGAGGAACTGGCAGCTGGAGGCAGGAGTGGGAGGGAGTTGGTTGTGGGCAGCTGCGGGTGACCCCCAACCCCAGGTTGCCAGGTGGCCCCATCCCACAGGTGCTGGACGGAGAAATGTA
CCCGCCCTCGGTAGAAGAGGCGCCTGTGTTGATGCACTACCCCCGAGGCATCCCGCCCCAGAGCCAGATGGCTGTGGGCCAGGAGGTGTTTGGGCTGCTTCCTGGGCTCATGCTGTATGC
CACGCTCTGGCTACGTGAGCACAACCGTGTGTGTGACCTGCTGAAGGCTGAGCACCCCACCTGGGGCGATGAGCAGCTTTTCCAGACGACCCGCCTCATCCTCATAG
AGGTGAGGATTCC
AGACCTGCCCTGCCCTGGAAGGTCATTCCCTCCATCCTGAGAAGTTGGGGGCGGGGGGTACTTAGAGGTGGAGGCTGGGATTAGAATCCTCACCCTTCTGCTTAGTGGCTAGAAGACCCT
GAGCAGGCCCCTCAACCTCTGTGAGCCTCGGTCTCCAAATCTGTACATTGGGGTGAACGATGATTGTAAGGATTTATTGAGATCATGAATGGGAAGGCAACTAGCACATAGCAGGCCTTC
AGAAAATAGATGAGTGTGATGGTTAATTATTAGCTGGGTGACTCAGCACATTTGATTAGCCACCCTGAGTCTCGGTTTCCCCCGTGTGCTGCGCCCGGATGCATACTTGTGGTCCCCAGC
ACTTGCAGGGTCAAACTGTAGCTTGCCCGGCCTTCAGGAATCTTCATGTTCCCTTCCCTGCATGTATCTACCTTCCTGCAGTTTGGTAGCTTCTAGGTGACTCAGGGACAGGATATTTTT
GTGTTCCCTATGGGGGCGAGTCTGCAACCTAAAATGTCAGATGGTTTCCTGCCTGGGAGCTTGGCCCCTGACATCCCTGTCCAGACCATGTTCGCTCTGAGTCACCAGCAACTCCCTTCC
CCCACCTCTGGCACCACTGGGCATGGCTGGTCCCAATTATAGAGCCTAATTCACTGGAGCCATGAAGAGCCAGGCATTGGGAATAGGAACAGTCATTAGAAGGAAGAGGGGCTGGTTCTG
AAGTTTCAGCGTTGCAAAGACCTTGACCTGAGAGAGCTGGAGGCTGTCCAGCACACTGGCTGGAGATGAAGCTGTGACCAAAGGCAGGACCCTAGGGCACCATTTAACTCCCCTACAACC
TCATGAGCCGGGTATGATTACCCTGTGAAAATCAAGATTCAGAGAGGTGAAGTGAATTCTCCAGGGACACTTAGCAGATGGAGACTTGAGGTCTGGTTGCCACCAAAATTTATGCTACTT
CCACTCCATCACAAAGGGGGCTCTTCTTGAATGGGAAGGGGTTGCAAACCTGAGTCTGATCTGTGACATGTGAGTATTGGAAAGGGATTCCCCCTCGCGTCTACACTTAGTATTCCTACT
TTTGGCTGACATATATGGACACCCCGTCTTATGCCAGGCACTGTGCCAGCAGTTTTGCTGTATTCATTGTCTACTTCTCTCAACAACCCTAATATAGTATTGCATTGATGTTATTATTAT
TATTATTTGAGATGGAGTCTTGTTCTGTTGCCCACACAGTAGTGCAATGGCGTGATCTTGGCTCACTGCAACCTCTGCCTCCCGGGTTCGAGCAATTCTCGTGTCTCAGCCTCCCGAGTA
GCTGGGATTACAGGTGCCCGCCACCATGCCTGGCTAATTTTTGTATTTTTAGTAGAGACAGGGTTTTGCCATGTTGGCCAGGCTGATCTTGAACTCCTGACCTCAGGTGATCCACCCGCC
TTGGCATCCCAAAGTTCTGGGTTTATAGGCATGAGCCACAGCGCCCGGCCGCACTGATATTATTATAACACACATTTTACGTTAAGGAAAGTGAGATTCTGAGAGATTAAGTAACTCAGC
CCAAGCTCAGGTGGCTGGAAATGGTAAAGCCAAGATTTGAATCCAGGTCTGCTGATCCCACAGTCTGTAGCTCCAGGTCAATTTCCAAAAGCCAATTTGTCTAATGGCTAATTGGCCTGA
ATATCAGTTTCTTTCAATATCTAGTTGCCCAGTTTTAATTTTGTTACAGAATGTTCAATTTGCATTTTCCCTGGCTTTTTGCTTTCTAGCTGGTTATAGCTACAAATGGGGCAGAGGAAA
AACTTATTTATAAGAATCCTGTTATATAAGAACATATAGGAAATATGTTTTTTGAAATATATTTAGGGCTAGGCTCCCCTTCTGTCCTCAGTATTCTCTTTTGCCACTGCAGCAGCTCTG
AGTGGTCTCCTTGAAGTCCCCCTCTATCACCGAGGGTGTCAGCATGATGACAGCTCTCACCAGTAAATCCTCCACTTTTCCATCTTTTGTAGTCTCTCCACCTTCTTTTAATGGGCTTCA
GGAGGGAGGTCATAAGACCAATTCTTGGACACCTCCTTTGCATGTCCTGCTTAGCTGGGCCTAGAACCTCCCTCTGAAATGTGGGAGTAGCTGGTGCTCTTGTCTTGAGACCTCAGTCAA
AATAGACCAAAAGTTCTATTTTCACATCCTGTTACAAAGAGACAAAATGGAAGAGGCCAAACAAAATTAAACATCAACCGCAAGGCCAGGTGTGGTGGCTCACACTTGCAATTCCAGGGC
TTTTGGGAGGCTGAGGTGAGAGGAGTGCTTGAGACTTGCAGTTCAAGACCAGCCTGGATAACATAGTGAGACCCCATCTCTTAAAAAAAAAAAAAAGAAAGAAAGAAAGCTGGGTGTGGT
GGTACACCTGTGGTCCCAGCTACTTGGGAGGCTGAGGTGGGAGGATTGCTTGAGCCCGGGAAAGTTCAGGCTGAAGTGAGCTGTAATTATACCATTGCGCTCCAGCTTGGGTGACAGACC
AAGACCTTGTCTGTAAAAATAAAAATAAACATCAACAGCAACAACAACAATAAAGAGACCAAAAGCAAGCACCTCTCATGAACTGGCCTGCTTTCCAAGCTGGGCATCTAAATCACTGTG
CCTGGCTGACCCTATTTCCAATCCTGCCCTGCCCAGGGGAGACCATCAGGATTGTCATCGAGGAGTACGTGCAGCAGCTGAGTGGCTATTTCCTGCAGCTGAAATTTGACCCAGAGCTGC
TGTTCGGTGTCCAGTTCCAATACCGCAACCGCATTGCCATGGAGTTCAACCATCTCTACCACTGGCACCCCCTCATGCCTGACTCCTTCAAGATGGGCTCCCAGGAGTACAGCTACGAGC
AGTTCTTGTTCAACACCTCCATGTTGGTGGACTATGGGGTCGAGGCCCTGGTGGATGCCTTCTCTCGCCAGATTGCTGGCCGG
GGGTAAGCCCCAGAGGAGTGCTGGTGAGGGCAGGTGG
GCTGAGGGATCCAGCAGACCTGGGTCCAAATTCCAGGTTCTTCTTCTGTAAAATGGGGCTGATGTCACTTCTACAGGGCAGTTGTAAGCATTCCTGCGTGAATTCATTGGTTCATTTGTC
CATTCCACAATACCAGACATTACTCCAGGTACTGGAGATGTAGTGGGAACAAGACTTTTGTGGTTCTTGGCTCATCTTTTAGTGCTCACACCCTAGAAAGTGATGGCAGTCATACGACAG
CTAACAGCATTAGGGCCCTTACTGCGTGTCAGGCACTGTTCTAAGAGCTTCTCCTATGTTATCAGAATTCTAAGAGTATGTTATAAACCATAGTAGAAAAGCCCACCATGCTCTGGGAGT
CAGGGAGGGACATCTGACCCAGAGTTGCGGGTTATGGGAAAAGAAGGGATGTCGGAGCAGAAATTGGAAGGAGGGATAGAGATCTCCCAGGGTAAGAGGTGGTGTTAGTGGCAGGGGATG
GCTGTGTTCCAGATAGAGATGACGGCATGGGTGAAAGGCATGGAAGTCAGAGGACATGGCAGTTTGAGGAACAGAAGGACATTCAGGGCATGGTAATGTATGTAACAGTGCCTGCCTATG
GTGTTTATTAAATCATAAGCCTCCGCTCTGGGCTGAATTGTGTCTCTTTTAAAGTTCGTATGTTGAAATCCTAACCGCCAGAACCTTAGTATGTGACTGCATTTGGAGACAAGGTCTTTA
AAGAGGTAATTAAGTTTAATTGAGGTCATTAGGCTGGGCCCTAATCCAGTGTGACTGGTGTGCTTATAAGAAGAAGAGATTAGGTCACACACACAGAGGGAAGGCCACATGAAGATGCAG
GGAGAAGAAAGCCATCTACAAGCCAAGGAAAGAGGCCTCAGGAGATACCAACCGTGCTGACACCTTGATCTTGAACCTCCGGTCTCCAGACGGAGGAAATAATTTCTATTGTTTGAGCCA
CTCAGTCTGTGGTACATTGTCATGGCAGCCCTAGCAAACAAACACATTCTCCTTCCCTGGAATTCCCAGCCAACGCCTTCCTCAATCTCCCCTTCTCCACATTCAGAAGCTCCCATCTGC
TTCATCGCAGTCTCTGGCTCCCCTGTTGCCTCACAGTCCTCTGCTTCTCTCTAATCCTTGTCCCTAAACCCTGTCATGAAGCTGTGGCACACATGGATTTCCATTTCCTTCTGGTAATTT
GACTGAAATTAGCATTTGCTGCCCCGGTGGGCAGCTGCTGGCTGCTTTATGGCCTCTTTGTCGGTTTCTTTATGGTTCTTTGTGGGGACACAAGACATGAACAGAGACAATAGCCTTTGT
GTGAGGCTGGATGGTTTTCAGAACGTTTTCAAGGAATGACCATGATGATGTACGTGAAAAGCCCCGGCATCGTACCTGGCACAAGGCAGAAAGGCCGCAGAGATGTATGGACTGTCAAGA
TTTTTTTTCTTTTTTCTTTTCTTTTTTAATAGAGATGGGGTTTTGCCATGTTGCCCAAGCTGGTCTTGAACTCCTGGGCTCAAGTGATCTGCCCGCCTAGGCCTCTCAAGGTGCTGGGAT
TATAGGCGACTCTCAGGATATTAAGAAGAGTGAGTGATGATAAGACAGGGCTTCCCCTGATAACCACTGTCCATGGCTACCCTCTCAGGGGTTCCCATGTGACCAATTCTGAGATAACAG
CTTTGCATAGTTTATCCCATTTAAATTGACCACAGCCATATCAGGTAGATGCTCTTCCCTTCCCCATTTTTACATATGCGGGAACTCAAACTTAGCTTGAATAGCTGCCCAAGGTCCCTA
TATTAGTTTCCTCAGGCTGCTGCAACGAAGTACCACCAACTGGGAAGCTCGGAGCAACAGATGGAGGCTAGAAGTCTGAATTCAAGGTGTCGGCAGTGCCATGTTCTCTCTCAAGGCTCT
ATGCCTCCTTGGCATCTGGTGGTGGCCGACAGTCCCTGGCATTCCTCAGCTTGCAGAGGCATCGCTCCAGTCTCTGCCTTTATCATCCTGTGGTGCTCTCCCTGAACTGGTCTTTCCTCA
TTTTATGAAGACACCAGTTATTGGATTGGAGCCCACCGTAATCCAGTATGAGCTCATCTTAACTTGATTACTTTTGCAAAGACTTCATTTCCAAAGAAGGATGCATTCACGGATGCAGAG
AGTTAGGGCTTCAACACATATTTAATATTTTAGGGAACACAGTTCAACCCTCAACAGCCCCACAGCTTGTAAAGCTGTAACTGGCACCATCCTCTGCTTGCTTTGCTCATATGATTTCAT
CCAACCATGGCTTTCTTTTTTGCCCAATTATAGTTGTTGATCAAAATGACTCTCTTAAGCATGAACGTTATTATTTCATCCTAAGCACCAGAGCTTTCTTTTCTTTCCTTTTTTTTTTTT
TTTTTTTTTTTTTTGAGATACAGTCTTGCTCTCTTGCCTGGGCTGGAGTGCAGTGGTGTGATTTTGGCTCACTGTAACCTCTATCTCCTGGGTTCAAGTGATTCACCTGCCTCAGCCTCC
CAAATAGCTGGGATTACAGGCACCTGCCACCACGCTCAGCTAATTTTTGTATTTTTAGTAGAGACGAGTCTTTGCCATGTTGGCCAGGCTGGTCTTGAACTCCTGGACTCAAGTGATCCA
CCCGCCTCGGCCTCCTAAGGTGCTGGGATTACAGGCACGGGCCACTGTGCCTGGTCAGCACCAGAGCTTTCTTTAGACTAATGCGCCTTAACTGTTAGTAATCAGAATCTTTTTCTGCAG
CCTTTTTATCTTGCCAGGTTACCTGGGCTTTGAGGTCTATTTTCCCTCCCTTCACAGGGGGTTACACTACCTCTTGGTGTAATTCAACGGACTGGGGAGCAGTGTGAGCCACAAAAGAGG
TCCTCTTGGCCCATTCCACACTTCAAAGATGAGGAAGCTGCCGGATGTGGTGGCTCATGCCTGTAATCCCGGCAGTTTGGGAGGCCGTGGTGGGAGGATCATTTGAGACCAGGAGTTTGA
GACCAGCCTGGGCAATATAGTGAGTCCTCACCTCTACAAATATAGAATAAAACAAAATTAGCCAGGCATGGTGGCACGTGCCTGTAGTCCTAGCTACTTGAGAAGCTGAGGTGGGAGGAT
CGCTTGAGCCTGAGGAAGTTGAGGCTACACTGAGTCAAGATCGTGCCACTGCACTCCAGCCAGGGCAACAGAGTGAGACAGTGTCTAAAAAAAAACCCACAAAAAAAAATGAGGAAGCTG
AGACTCAGAGGGGCTTCCTGAGAGAGCATGACAGCAGCAGGCCCGGGGCTGGTCTGCAGCATCACAACTGCTAGGCTGCCCAACACTCTCCATCCTAGCTCAGAAGGGACTCCCACTGGA
AGCTCTTGTCCCAGGAACTTACCCAGGCTCCAGGACAGCCTGGCCTGGCTCCCAGACCACTGCTGTGCTTCTCTCCCGGCAGATCGGTGGGGGCAGGAACATGGACCACCATGTCCTGCA
CGTGGCTGTGGATGTCATCAGGGAGTCTCGGGAGATGCGGCTGCAGCCCTTCAATGAGTACCGCAAGAGGTTTGGCATGAAACCCTACACCTCCTTCCAGGAGCTCGTAG
AGGTGAGCAG
CTGTTTCCTGGATGCAGTCCCTGCCCTTGAGGGACTGGCAGCAAAGTCAGGGAGACATCAAGGAAATAGAACGGGACAATACATGCGGCAATGTGTAACAACCAGACTTATAATGGGCGT
GGAAGTGCTGTGCCAGGGTGGTAAATAAGCCTGCTTGGGGAGAGAAGGTGACTTTTCAGCTGGGTTTGGAGAACAAATGGCATTTTCAGTGGGAGAAGAGAGGGAGGAGTGTTTTAGGCG
GAGCAATAGAAAGTACAAAGGCTGCCGGGCAAGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCTCAGGCGGGTGGATCACGAGGTCAGGAGATCGAGACCATCCTGGCTAACAC
GATGAAACCCTGTCTCTACTAAAAATACAAAAAATTAGCCGGGCATGGAGGCAGGTGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATGGTGTGAACCTGGGAGGCAGAGC
TTGCAGTGAGCCGAGATTGCGCCACTGCTCTCCAGCCTGGGCGACAGAGCGAGACTCCATCTCAAAAAAAAGAAAGTACAAAGGCATAGAGGTCACACAGCAGGTATCTAGGGAAACAGT
CATAGATGTGGCTGGGGCAAGGAGCTTATGTTGTGGGGATGATTGGTGATGGGAGGTTGGGGCCAAACCGTGTAGAATCTTGGTTTTCATACTAAATATTTTGAGTTTTATTTGGTCTAG
TGCTTTCTAAGTACCCACTTATGGGCCAGCTGCATAAAAATCATTTAGAAAGCTGATTAAGATACCAAATCTCAGATGCCTCTCCTACAGATTCTGATTCAACCAGCCTGGGTAGTGCCC
AAGAATCTGCGTATTCATATGAATAAAGTATATATACATATATATGAATATATATATGAATAGAGTATATATACATATATATCAATATATACATATACATATCAATACATATATACATAT
ACACACACATATATATGTGTGTGTGTATATATATAAATATATATGTGTGTGTGTGTATATATACATATATATATATATTTCCCCCAATATCCTCAGATGTTGGGCAGCCAAGTTGCCAGC
CCTTGTCACATACAATGGGGAGCCACTGAAGCAGGGTGTGACATGGCCCATGAGAGTTCCCAATGGAGGATGGATTTGAGGAACTGGGTAGGGAAGCAGGGAAGCTCCATATTTGTCTCC
CCTTGGCCACATGAACATGTGGGTATTGCAGAGTGGAGCAGATTCTGTGCATGAGCTCCTTGTGATCCTGGAACAGCATCTTATTCTTTACTCTCCCATGACAAATGGTCCCCAGGGGCA
GAAGGAACACTGCCACTGAATTTCTGTGTGAGCTCGGACATGTTACATCTTTGAGCTCTAGTAATAAAAGGGCTTGGCCCAGGCTAAGGTCTTGAAATCTGTCTGTGAAGGAAGCCAGAT
GGGGAGCATTCTCCCTTCTGTGATGATCAGGTAATTAGGGCCCAGAGTTGCTCAAGGTTATAGGCTGTTTGGTGGCAGATCTAGGCCCCTGACTTTCTCTTTAGTAGCATTTTCCTTCCC
TAGACCCAGTCCCTGAGGAGGGGCAATTTGTGCTTTTCCTCTTGACCCTTTTCCCCAGTGCCAACCAGGCCAAATTCTAGGGCACATGCTCAGTTGCTCAAGTTAGTCTCCTGGAGTCCC
TATTATCCCCAGAAAAAGGTGGACCTGGAAGGGTCCCGCCCCAGGTTGACCTTAATGGCATCATGGATCTGACGCTAGCGTTTCCCCTTATCTCCTTGTAGGAGAGAAGGAGATGGCAGC
AGAGTTGGAGGAATTGTATGGAGACATTGATGCGTTGGAGTTCTACCCCGGACTGCTTCTTGAAAAGTGCCATCCAAACTCTATCTTTGGGGAGAGTATGATAGAGATTGGGGCTCCCTT
TTCCCTCAAGGGTCTCCTAGGGAATCCCATCTGTTCTCCGGAGTACTGGAAGCCGAGCACATTTGGCGGCGAGGTGGGCTTTAACATCGTCAAGACGGCCACACTGAAGAAGCTGGTCTG
CCTCAACACCAAGACCTGTCCCTACGTTTCCTTCCGTGTGCCGGATGCCAGTCAGGATGATGGGCCTGCTGTGGAGCGACCATCCACAGAGCTCTGA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAGGAAACCGAGGCTCATGAATCCCTGTTGTTACTATCCATGCCAGCACCAGGGCATCTGTGTCCGCTTCGGCCTTGACCGCTACCAGTGTGACTGCACCCGCACGGGCTATTCCGGC
CCCAACTGCACCATCC
CTGGCCTGTGGACCTGGCTCCGGAATTCACTGCGGCCCAGCCCCTCTTTCACCCACTTCCTGCTCACTCACGGGCGCTGGTTCTGGGAGTTTGTCAATGCCACC
TTCATCCGAGAGATGCTCATGCGCCTGGTACTCACAG
TGCGCTCCAACCTTATCCCCAGTCCCCCCACCTACAACTCAGCACATGACTTCATCAGCTGGGAGTCTTTCTCCAACGTGAGC
TATTACACTCGTATTCTGCCCTCTGTGCCTAAAGATTGCCCCACACCCATGGGAACCAAAG
GGAAGAAGCAGTTGCCAGATGCCCAGCTCCTGGCCCGCCGCTTCCTGCTCAGGAGGAAG
TTCATACCTGACCCCCAAGGCACCAACCTCATGTTTGCCTTCTTTGCACAACACTTCACCCACCAGTTCTTCAAAACTTCTGGCAAGATGGGTCCTGGCTTCACCAAGGCCTTGGGCCAT
GGG
GTAGACCTCGGCCACATTTATGGAGACAATCTGGAGCGTCAGTATCAACTGCGGCTCTTTAAGGATGGGAAACTCAAGTACCAGGTGCTGGACGGAGAAATGTACCCGCCCTCGGTA
GAAGAGGCGCCTGTGTTGATGCACTACCCCCGAGGCATCCCGCCCCAGAGCCAGATGGCTGTGGGCCAGGAGGTGTTTGGGCTGCTTCCTGGGCTCATGCTGTATGCCACGCTCTGGCTA
CGTGAGCACAACCGTGTGTGTGACCTGCTGAAGGCTGAGCACCCCACCTGGGGCGATGAGCAGCTTTTCCAGACGACCCGCCTCATCCTCATAG
GGGAGACCATCAGGATTGTCATCGAG
GAGTACGTGCAGCAGCTGAGTGGCTATTTCCTGCAGCTGAAATTTGACCCAGAGCTGCTGTTCGGTGTCCAGTTCCAATACCGCAACCGCATTGCCATGGAGTTCAACCATCTCTACCAC
TGGCACCCCCTCATGCCTGACTCCTTCAAGATGGGCTCCCAGGAGTACAGCTACGAGCAGTTCTTGTTCAACACCTCCATGTTGGTGGACTATGGGGTCGAGGCCCTGGTGGATGCCTTC
TCTCGCCAGATTGCTGGCCGG
ATCGGTGGGGGCAGGAACATGGACCACCATGTCCTGCACGTGGCTGTGGATGTCATCAGGGAGTCTCGGGAGATGCGGCTGCAGCCCTTCAATGAGTAC
CGCAAGAGGTTTGGCATGAAACCCTACACCTCCTTCCAGGAGCTCGTAG
GAGAGAAGGAGATGGCAGCAGAGTTGGAGGAATTGTATGGAGACATTGATGCGTTGGAGTTCTACCCCGGA
CTGCTTCTTGAAAAGTGCCATCCAAACTCTATCTTTGGGGAGAGTATGATAGAGATTGGGGCTCCCTTTTCCCTCAAGGGTCTCCTAGGGAATCCCATCTGTTCTCCGGAGTACTGGAAG
CCGAGCACATTTGGCGGCGAGGTGGGCTTTAACATCGTCAAGACGGCCACACTGAAGAAGCTGGTCTGCCTCAACACCAAGACCTGTCCCTACGTTTCCTTCCGTGTGCCGGATGCCAGT
CAGGATGATGGGCCTGCTGTGGAGCGACCATCCACAGAGCTCTGA

Retrieve as FASTA