Entry information : PtroPGHS01
Entry ID 5833
Creation 2007-10-09 (Marcel Zamocky)
Last sequence changes 2010-11-23 (Myriam Duval)
Sequence status complete
Reviewer Myriam Duval
Last annotation changes 2010-11-23 (Myriam Duval)
Peroxidase information: PtroPGHS01
Name PtroPGHS01
Class H synthase    [Orthogroup: PGHS001]
Taxonomy Eukaryota Metazoa Chordata Mammalia Hominidae Pan
Organism Pan troglodytes (chimpanzee)    [TaxId: 9598 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value PtroPGHS01
start..stop
S start..stop
HsPGHS01 1180 0 4..574 29..599
EcabPGHS01 1130 0 2..574 27..599
OarPGHS01 1110 0 4..574 30..600
BtPGHS01 1106 0 4..574 30..600
Gene structure Fichier Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 122026281..122026416 136 N° 2 122026834..122026974 141 N° 3 122027176..122027319 144 N° 4 122029765..122029946 182
N° 5 122030058..122030141 84 N° 6 122031891..122032137 247 N° 7 122034825..122035111 287 N° 8 122038589..122038736 148
N° 9 122040526..122040881 356  
join(122026281..122026416,122026834..122026974,122027176..122027319,122029765..1 22029946,122030058..122030141,122031891..122032137,122034825..122035111,12203858 9..122038736,122040526..122040881)


exon

Literature and cross-references PtroPGHS01
Literature Chimpanzee Sequencing and Analysis Consortium Initial sequence of the chimpanzee genome and comparison with the human genome. Nature 437 (7055), 69-87 (2005).
Protein ref. GenBank:   XP_520238.2
DNA ref. GenBank:   NC_006476.2 (122026281..122040881)
mRNA ref. GenBank:   XM_520238
Protein sequence: PtroPGHS01
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   574
PWM (Da):   %s   65977.05  
PI (pH):   %s   7.86
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MRKPRLMNPCCYYPCQHQGICVRFGLDRYQCDCTRTGYSGPNCTIPGLWTWLRNSLRPSPSFTHFLLTHGRWFWEFVNATFIREMLMRLVLTVRSNLIPSPPTYNSAHDFISWESFSNVSYYTRILPSVPKDCPTPMGTKGKKQLPDAQLLARRFLLRRKFIPDPQGTNLMFAFFAQHFTHQFFKTSGKMGPGFTKALGHVDLGHIYGDNLERQYQLRLFKDGKLKYQVLDGEMYPPSVEEAPVLMHYPRGIPPQSQMAVGQEVFGLLPGLMLYATLWLREHNRVCDLLKAEHPTWGDEQLFQTTRLILIGETIRIVIEEYVQQLSGYFLQLKFDPELLFGVQFQYRNRIAMEFNHLYHWHPLMPDSFKMGSQEYSYEQFLFNTSMLVDYGVEALVDAFSRQIAGIGGGRNMDHHVLHVAVDVIRESREMRLQPFNEYRKRFGMKPYTSFQELVGEKEMAAELEELYGDIDALEFYPGLLLEKCHPNSIFGESMIEIGAPFSLKGLLGNPICSPEYWKPSTFGGEVGFNIVKTATLKKLVCLNTKTCPYVSFRVPDASQDDGPAVERPSTEL

Retrieve as FASTA  
Remarks Complete sequence from genomic (chromo 9). Three spilicing variants are prediction with no EST confirmation. 5' end is probably false/missing..
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAGGAAACCGAGGCTCATGAATCCCTGTTGTTACTATCCATGCCAGCACCAGGGCATCTGTGTCCGCTTCGGCCTTGACCGCTACCAGTGTGACTGCACCCGCACGGGCTATTCCGGC
CCCAACTGCACCATCC
GTGAGCTGGGCCTTCAGCCCTCAATCCTTCCATCTTGAGCCCTTCTGCTCCCCGGGCCCTTTCTCCTAGACCCTAACTTCCTACCCTCCTCTCTGACCATGGCC
CTGTTCTCCTTCCTTGCCTGGTTCTGCCCCTCTCCCTGACCTGGCTTCAGCATGAGCTCTCGCTCTTGGTCCACCCTCACTCCCTTCTTCTGAGTTCCATGGTGAGTCTTCACCATATGC
CCTGGCCCCTGTCCTCACGCCCGGTTCTGTCTCTGTCATGTGTCATTGCTCCCAAGGTTCCATCCTTACCCACTTCCCCATAGGTGCTACTCTGTTCTATCCTGGCCCTTGTCCTCAGTG
TCCCATCTTCCACCCTGGCTACTTCTGGTTCTGGTAGGAGGGACCAACTGAGTGACTCCCATTGCCCCTGCAGCTGGCCTGTGGACCTGGCTCCGGAATTCACTGCGGCCCAGCCCCTCT
TTCACCCACTTCCTGCTCACTCACGGGCGCTGGTTCTGGGAGTTTGTCAATGCCACCTTCATCCGAGAGATGCTCATGCGCCTGGTACTCACAG
GTGGGTGTGGGGCAGGGCCCCCTGAC
CTGGGGGAGCAAGCAAGCCTGCTAGTCCTTTTGGATTCTTGGCATCTGATAAGGGTAGTGGGTGGGGAGAGTCTATGATGCCTGATAAAATAAGCCCCAACCCAGGAGGAGGCAAGAACT
GGGGTGGAGCTGGGGGTGGAAACACCCTTGTCACCATTATTTTTGCTCTCTGCAGTGCGCTCCAACCTTATCCCCAGTCCCCCCACCTACAACTCAGCACATGACTTCATCAGCTGGGAG
TCTTTCTCCAACGTGAGCTATTACACTCGTATTCTGCCCTCTGTGCCTAAAGATTGCCCCACACCCATGGGAACCAAAG
GTAAAATGGGGTGAGGAGCTGGGCCTGGGGATTACAGGAGG
TGCTCAGTTCTTCTCTTTGGGAAAAATCAGGCAGAAGAACAATTATTGACCCAATTCTGCAGATGGCTAGACCAAGGCAAGACATATGACATGTCCAGAGCCTGGGCTGAGAACAGGCAA
GGGCAGCAGAGGGTCTTGCCTGAGGTCACCTAGAGTCAGACCAATGTTTCCATAGTTCCAGGGTGCCTCTTTGCTTGATCCTTTTCTAATGATCAGTTGGGTCCTGCTGGGGTGGAAGTG
ACTTAGAAGTTGAGATGTAGGAAAGAATAGTGAGCTATTTATTGGGTGCTGTCTCTGTGTTTGGGTCTTTACAGATGTAAATAGTTTTACATGCTTCACCAGTGTAAGGTACAAATAAGT
CACATTTTTTTCTTTCGTTGCTCAAGACTTCACTTAGCCACACTGGCAGGGGTCTTCCTTGTAAGACCTTCCCATGCCACCTGTAATTATCCAAAAACCTGGGATATTATTCATTTCAGA
CCCATCAGTTCAGCATTAAGTACAGAGAGGACAAGAGAGGCTCATCAGTCCACTGCTGCTATACTCCAGTTCCTGCCACATGGTGGCACTGTTGAATGCCAGTCCTGTGCAGCCTCATGG
TTATGTGCTTTTTTGGGTTCAAAACCTTGCACCATGTCCCTGCTTGGGTCTCAAGAGCACTACTGTGACGGTTTTCCATCATATGGTTAGCTGCTTCTCCCAAAGCATGACTATTCTACC
AGGATAGCTGAGTCTTGCCAACTTTGCTGAAATCATGCTTGCCCAGTGTCAGTGAGTGATGATTCCAAATTACGGTTGACAGATCACTCCCTCTAACTTCCCTTTTGGTGGATTTTCTTT
AGGGGTACTTGATATTTTTTCCTGCCAGAGGAACCCAGTCAGCCATCTTAATCCAAGTAATTTCCATTGATCTTGAACCTTCAACATCAGGGCTCACATCTTGATGCATTGTGAAGAGAT
GCCTTTACCAGAACTCAAAAAATTCTATTCCTTTCTGTGAGGGCAATTGGGTGACAACTCATTTGACACTGACATAATTAAGGAAGACCTCTCAATACTTATTCTAAGGATGACTTGTCT
TTATGCCAGACATAGAAAGATTGGCATCATTTAAAATAGGTTGATACACCTATTTTAAGGGGAGCAAGCAAGCCTGCTAGTCCTTTTGGACTCTTGGCATCTGATAAGGGTAGTGGGTGG
GGAGAGTCTATGATGGCTGATAGGTGGTGGTGTGGTGGCACATGTCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGTGGATTGCTTGAGCTCAGGAGTTCAAGTCCAACCTGGATGA
CATGGTGAGACACCTTGTCTACAAAAGAATACAAAAGTTAGCTGGGTGAAGTTGTGTGTGCCTGTAGTCCCAGCTACTCAGGACGCTAAGGTGGGTGGATCAATTGAGCCCAGGAGGTCA
AGGCTGTAGTGAGCCATAATTGCGGCACTGCACTCTGGCCTGGGCAATAGAGTGAGACCCTGTCTCAAAAATAAATAAATAAATAAATAAATAAATAAATAAATAGGTTGGCAAACTACA
GCCTGCAGGCCAAATTCGGCCCACCTCTCCATTTTTGTAAATAAAGTTTTATTGGAACACAGCCACACACATTTGTTTATGTGTCATCTATGACTGCTTTCCCGCTACAATAGCAAAAAC
TGAATAGATGTGACATGTCTGTATGGTCTGCAAAGCCTAAAATGTTTACTATCTGGCTTTTTCCAGAAAAAGCTTGCTGACCTTTGATTTAAAAAATTCTACCCACTGGTTCTCTGCCAT
GCTTACATTATCTCTTGCAGTCCTCAGATTCACTGTATCAAATGGGTATCATTTGCCCCTTTTTAAAGATGGAGAAATTGACACCCAGAGAGATGAGATGACTTATCTGTATTCACACAG
CTAGCAAATAGCATAGTCAGTTGCAAACACAGGCTACCTTGACTCAGGGCAAGGGAGTTCATGTTTGTTTTTTTCTGTTTCTTTATTTCTTTTTGGTGAAATGTTTCATTATGGAAAAAT
TGCAAAGATACACAAAAGTTGAGAGAAAAGCAGAATGAACTATGTACCCATCTTTCAGTTTCAACATTTACCCACGGTTTCTTCATCTTATTTCATTTCTCCCCTCTCATATTTTTATAA
AGTATTTTAAATCAAATTCTAAAAATCATGCCACTTAAAATTCTAAAAATCATGCCACTTCACCCATAAATACTTCTAGGGTCTTTTGGTAGAGAGTGGGTTACTTGGTGGTGGTGGGGA
GGTGGTCCTGAGGAGGCCACTCTGGGCTTCCTGCTTGGGCCAGTTTGCCTGGTGAGCCCAGATGTCCCCAGGGCAGCAAGATCCAGACAGGACAAGCTACTGCTGTTTCCTACCCCCCAA
CCAGGGAAGAAGCAGTTGCCAGATGCCCAGCTCCTGGCCCGCCGCTTCCTGCTCAGGAGGAAGTTCATACCTGACCCCCAAGGCACCAACCTCATGTTTGCCTTCTTTGCACAACACTTC
ACCCACCAGTTCTTCAAAACTTCTGGCAAGATGGGTCCTGGCTTCACCAAGGCCTTGGGCCATGGG
GTGAGTACCTAGGAGGGGCTCAGGACTGCTCTGGACCTAATTTGGCACGCGTAT
GTCATCGACAGTGGGCCGGCACCCTGGTGACCCGAGGGAACCCCTCTCTGTCCACAGGTAGACCTCGGCCACATTTATGGAGACAATCTGGAGCGTCAGTATCAACTGCGGCTCTTTAAG
GATGGGAAACTCAAGTACCAG
GTAGTGCTGGGCCAGGGGGTAGGGCAGAGGGAGGGGTCTCCCATGGTCTTCCCTGGCAAAGACTGCTTGGGGCGGGGGTCTGGGTCATGTCCTGAGAGG
GCCAACCACGGGAGTGGGAGGCTTGTGCCAGGAGCGACAGTATACGCTGGGAGGAGGCAGCAGGTATGAGAAGCCAGGGAGGAGCAGACGTGGCCTCCCATGTCAGCCAGGGAGGTGGAT
TTTGGAGCTCAACAGGAGATGAACATTTGTAGTCTGCATTTATTTATTTAATTATTCCACAATATTAATTGGACACGAGAATATACCTGGCACAGGTGATTGAGTAGTGACCATTATAGT
TAATGGGCCCTGCCCTCAAAAAACCACAGGCAAACTCTAGGATGCTCATTCTATGAGGGCTTTTGTTTAAATCAGAACGGTCCAACAGAAATATAATGTGAACCACATACATAGTTAAAA
TTTTCTAATGTCCATATTAAAAGAGGGAAAAAGAAACAGGTGAAATGATTTTAAGAATACATTTTACTTAATGCAATATGTCCAGGTAATTAGCATTTTAGCATGTAATCAATACATTAT
TAATAAAATATGTCACATTCTTTTTTCATGCTGAAGCTTCAAAATCTGGTGTATATTTCACACTCACAGGACATCTCAATTTGGATGCTCCGTTTTCACTGGAGTGATCTAATCTGTATT
AAGATTTCATAAAATGTACAGCTGAATAAGTAGAGTGACATGTCCGACTTGTTCCACGCATACTTAAAGGTTTTCCAATAGCTGAAGTATCAGTTTTAAAATTCAAATAGAAATTAAGAT
AAACCTAAATAAAATAAATTAAGTAACATTCAGTTCTTCATTCACACTAGCCAAATTTCTAGTGCTCAGTAGCCACACGTGGCTAGTGGCTACCATATTGGATCGTACAAATCTTAGGCA
AGCACCAAAACAAAGTTTGATGCTGAATCTTTAGGGCTAAGAACAGTACTTGGCATGTAGTAGTCTCTTGGCATGTATTTACTGAATGAATGAAGAAGCTGCCATATAATTAGGTACACT
TGTAGCTGCCACCAAGGAGAAGCTGTGAGTGCCACTAGAGTGTTTGGATGATGGGTAAAACTTCCCTAGGAAGTTACAAATAAACCCAGAGTTGCATAAAGGATGAGGAGGAGTTAGGGA
TGCTAAGAATGGGAGAGGGCTTTCCAGGTAGAGGGTTTAGCAAGTACAAAAGCTTAGAGGTGGAGAACAGCTTGGTGACTTGGAGGGAGTGTAAAAATGGGAGCGTTTGCTGAGCCTAGT
GATGGAGCGTAAGAATGACTTTACAGGAAGGTGGAGATGTCTGTGGGGACCGTGTTAAGGAATTCTACTTTCTGCCAAGAGCAGAGAGAGTATTTGGAAAGGTTTTAAGTCAGCTCATGA
TGCAAGATTTGTTTTTTGTTTTTTTTTTTTAAGTTTTTTTTCTCTTTTTTGCTTGGGTGCAGTGCAGAGACTGGCCTGTCTGGGGATAGGAGTGGAAATAGGGAGCCCACTTAGGGAAAG
GAGTCAGAGTGGAAGATGAGTGGGGGTTGCTTGAGGTAGTGGTGGTAGAGATGGAGAGCAGGGGACTGATTTTAAGAGACATTTTGGAGGTGGAATCAACAGCCTTGGCTGAGAGGAACT
GGCAGCTGGAGGCAGGAGTGGGAGGGAGTTGGTTGTGGGCAGCTGCGGGTGACCCCCAACCCCAGGTTGCCAGGTGGCCCCATCCCACAGGTGCTGGACGGAGAAATGTACCCGCCCTCG
GTAGAAGAGGCGCCTGTGTTGATGCACTACCCCCGAGGCATCCCGCCCCAGAGCCAGATGGCTGTGGGCCAGGAGGTGTTTGGGCTGCTTCCTGGGCTCATGCTGTATGCCACGCTCTGG
CTACGTGAGCACAACCGTGTGTGTGACCTGCTGAAGGCTGAGCACCCCACCTGGGGCGATGAGCAGCTTTTCCAGACGACCCGCCTCATCCTCATAG
GTGAGGATTCCAGACCTGCCCTG
CCCTGGAAGGTCATTCCCTCCATCCTGAGAAGTTGGGGGCGGGGGGTACTTAGAGGTGGAGGCTGGGATTAGAATCCTCACCCTTCTGCTTAGTGGCTAGAAGACCCTGAGCAGGCCCCT
CAACCTCTGTGAGCCTCGGTCTCCAAATCTGTACATTGGGGTGAACGATGATTGTAAGGATTTATTGAGATCATGAATGGGAAGGCAACTAGCACATAGCAGGCCTTCAGAAAATAGATG
AGTGTGATGGTTAATTATTAGCTGGGTGACTCAGCACATTTGATTAGCCACCCTGAGTCTCGGTTTCCCCCGTGTGCTGCGCCCGGATGCATACTTGTGGTCCCCAGCACTTGCAGGGTC
AAACTGTAGCTTGCCCGGCCTTCAGGAATCTTCATGTTCCCTTCCCTGCATGTATCTACCTTCCTGCAGTTTGGTAGCTTCTAGGTGACTCAGGGACAGGATATTTTTGTGTTCCCTATG
GGGGCGAGTCTGCAACCTAAAATGTCAGATGGTTTCCTGCCTGGGAGCTTGGCCCCTGACATCCCTGTCCAGACCATGTTCGCTCTGAGTCACCAGCAACTCCCTTCCCCCACCTCTGGC
ACCACTGGGCATGGCTGGTCCCAATTATAGAGCCTAATTCACTGGAGCCATGAAGAGCCAGGCATTGGGAATAGGAACAGTCATTAGAAGGAAGAGGGGCTGGTTCTGAAGTTTCAGCGT
TGCAAAGACCTTGACCTGAGAGAGCTGGAGGCTGTCCAGCACACTGGCTGGAGATGAAGCTGTGACCAAAGGCAGGACCCTAGGGCACCATTTAACTCCCCTACAACCTCATGAGCCGGG
TATGATTACCCTGTGAAAATCAAGATTCAGAGAGGTGAAGTGAATTCTCCAGGGACACTTAGCAGATGGAGACTTGAGGTCTGGTTGCCACCAAAATTTATGCTACTTCCACTCCATCAC
AAAGGGGGCTCTTCTTGAATGGGAAGGGGTTGCAAACCTGAGTCTGATCTGTGACATGTGAGTATTGGAAAGGGATTCCCCCTCGCGTCTACACTTAGTATTCCTACTTTTGGCTGACAT
ATATGGACACCCCGTCTTATGCCAGGCACTGTGCCAGCAGTTTTGCTGTATTCATTGTCTACTTCTCTCAACAACCCTAATATAGTATTGCATTGATGTTATTATTATTATTATTTGAGA
TGGAGTCTTGTTCTGTTGCCCACACAGTAGTGCAATGGCGTGATCTTGGCTCACTGCAACCTCTGCCTCCCGGGTTCGAGCAATTCTCGTGTCTCAGCCTCCCGAGTAGCTGGGATTACA
GGTGCCCGCCACCATGCCTGGCTAATTTTTGTATTTTTAGTAGAGACAGGGTTTTGCCATGTTGGCCAGGCTGATCTTGAACTCCTGACCTCAGGTGATCCACCCGCCTTGGCATCCCAA
AGTTCTGGGTTTATAGGCATGAGCCACAGCGCCCGGCCGCACTGATATTATTATAACACACATTTTACGTTAAGGAAAGTGAGATTCTGAGAGATTAAGTAACTCAGCCCAAGCTCAGGT
GGCTGGAAATGGTAAAGCCAAGATTTGAATCCAGGTCTGCTGATCCCACAGTCTGTAGCTCCAGGTCAATTTCCAAAAGCCAATTTGTCTAATGGCTAATTGGCCTGAATATCAGTTTCT
TTCAATATCTAGTTGCCCAGTTTTAATTTTGTTACAGAATGTTCAATTTGCATTTTCCCTGGCTTTTTGCTTTCTAGCTGGTTATAGCTACAAATGGGGCAGAGGAAAAACTTATTTATA
AGAATCCTGTTATATAAGAACATATAGGAAATATGTTTTTTGAAATATATTTAGGGCTAGGCTCCCCTTCTGTCCTCAGTATTCTCTTTTGCCACTGCAGCAGCTCTGAGTGGTCTCCTT
GAAGTCCCCCTCTATCACCGAGGGTGTCAGCATGATGACAGCTCTCACCAGTAAATCCTCCACTTTTCCATCTTTTGTAGTCTCTCCACCTTCTTTTAATGGGCTTCAGGAGGGAGGTCA
TAAGACCAATTCTTGGACACCTCCTTTGCATGTCCTGCTTAGCTGGGCCTAGAACCTCCCTCTGAAATGTGGGAGTAGCTGGTGCTCTTGTCTTGAGACCTCAGTCAAAATAGACCAAAA
GTTCTATTTTCACATCCTGTTACAAAGAGACAAAATGGAAGAGGCCAAACAAAATTAAACATCAACCGCAAGGCCAGGTGTGGTGGCTCACACTTGCAATTCCAGGGCTTTTGGGAGGCT
GAGGTGAGAGGAGTGCTTGAGACTTGCAGTTCAAGACCAGCCTGGATAACATAGTGAGACCCCATCTCTTAAAAAAAAAAAAAAGAAAGAAAGAAAGCTGGGTGTGGTGGTACACCTGTG
GTCCCAGCTACTTGGGAGGCTGAGGTGGGAGGATTGCTTGAGCCCGGGAAAGTTCAGGCTGAAGTGAGCTGTAATTATACCATTGCGCTCCAGCTTGGGTGACAGACCAAGACCTTGTCT
GTAAAAATAAAAATAAACATCAACAGCAACAACAACAATAAAGAGACCAAAAGCAAGCACCTCTCATGAACTGGCCTGCTTTCCAAGCTGGGCATCTAAATCACTGTGCCTGGCTGACCC
TATTTCCAATCCTGCCCTGCCCAGGGGAGACCATCAGGATTGTCATCGAGGAGTACGTGCAGCAGCTGAGTGGCTATTTCCTGCAGCTGAAATTTGACCCAGAGCTGCTGTTCGGTGTCC
AGTTCCAATACCGCAACCGCATTGCCATGGAGTTCAACCATCTCTACCACTGGCACCCCCTCATGCCTGACTCCTTCAAGATGGGCTCCCAGGAGTACAGCTACGAGCAGTTCTTGTTCA
ACACCTCCATGTTGGTGGACTATGGGGTCGAGGCCCTGGTGGATGCCTTCTCTCGCCAGATTGCTGGCCGG
GTAAGCCCCAGAGGAGTGCTGGTGAGGGCAGGTGGGCTGAGGGATCCAG
CAGACCTGGGTCCAAATTCCAGGTTCTTCTTCTGTAAAATGGGGCTGATGTCACTTCTACAGGGCAGTTGTAAGCATTCCTGCGTGAATTCATTGGTTCATTTGTCCATTCCACAATACC
AGACATTACTCCAGGTACTGGAGATGTAGTGGGAACAAGACTTTTGTGGTTCTTGGCTCATCTTTTAGTGCTCACACCCTAGAAAGTGATGGCAGTCATACGACAGCTAACAGCATTAGG
GCCCTTACTGCGTGTCAGGCACTGTTCTAAGAGCTTCTCCTATGTTATCAGAATTCTAAGAGTATGTTATAAACCATAGTAGAAAAGCCCACCATGCTCTGGGAGTCAGGGAGGGACATC
TGACCCAGAGTTGCGGGTTATGGGAAAAGAAGGGATGTCGGAGCAGAAATTGGAAGGAGGGATAGAGATCTCCCAGGGTAAGAGGTGGTGTTAGTGGCAGGGGATGGCTGTGTTCCAGAT
AGAGATGACGGCATGGGTGAAAGGCATGGAAGTCAGAGGACATGGCAGTTTGAGGAACAGAAGGACATTCAGGGCATGGTAATGTATGTAACAGTGCCTGCCTATGGTGTTTATTAAATC
ATAAGCCTCCGCTCTGGGCTGAATTGTGTCTCTTTTAAAGTTCGTATGTTGAAATCCTAACCGCCAGAACCTTAGTATGTGACTGCATTTGGAGACAAGGTCTTTAAAGAGGTAATTAAG
TTTAATTGAGGTCATTAGGCTGGGCCCTAATCCAGTGTGACTGGTGTGCTTATAAGAAGAAGAGATTAGGTCACACACACAGAGGGAAGGCCACATGAAGATGCAGGGAGAAGAAAGCCA
TCTACAAGCCAAGGAAAGAGGCCTCAGGAGATACCAACCGTGCTGACACCTTGATCTTGAACCTCCGGTCTCCAGACGGAGGAAATAATTTCTATTGTTTGAGCCACTCAGTCTGTGGTA
CATTGTCATGGCAGCCCTAGCAAACAAACACATTCTCCTTCCCTGGAATTCCCAGCCAACGCCTTCCTCAATCTCCCCTTCTCCACATTCAGAAGCTCCCATCTGCTTCATCGCAGTCTC
TGGCTCCCCTGTTGCCTCACAGTCCTCTGCTTCTCTCTAATCCTTGTCCCTAAACCCTGTCATGAAGCTGTGGCACACATGGATTTCCATTTCCTTCTGGTAATTTGACTGAAATTAGCA
TTTGCTGCCCCGGTGGGCAGCTGCTGGCTGCTTTATGGCCTCTTTGTCGGTTTCTTTATGGTTCTTTGTGGGGACACAAGACATGAACAGAGACAATAGCCTTTGTGTGAGGCTGGATGG
TTTTCAGAACGTTTTCAAGGAATGACCATGATGATGTACGTGAAAAGCCCCGGCATCGTACCTGGCACAAGGCAGAAAGGCCGCAGAGATGTATGGACTGTCAAGATTTTTTTTCTTTTT
TCTTTTCTTTTTTAATAGAGATGGGGTTTTGCCATGTTGCCCAAGCTGGTCTTGAACTCCTGGGCTCAAGTGATCTGCCCGCCTAGGCCTCTCAAGGTGCTGGGATTATAGGCGACTCTC
AGGATATTAAGAAGAGTGAGTGATGATAAGACAGGGCTTCCCCTGATAACCACTGTCCATGGCTACCCTCTCAGGGGTTCCCATGTGACCAATTCTGAGATAACAGCTTTGCATAGTTTA
TCCCATTTAAATTGACCACAGCCATATCAGGTAGATGCTCTTCCCTTCCCCATTTTTACATATGCGGGAACTCAAACTTAGCTTGAATAGCTGCCCAAGGTCCCTATATTAGTTTCCTCA
GGCTGCTGCAACGAAGTACCACCAACTGGGAAGCTCGGAGCAACAGATGGAGGCTAGAAGTCTGAATTCAAGGTGTCGGCAGTGCCATGTTCTCTCTCAAGGCTCTATGCCTCCTTGGCA
TCTGGTGGTGGCCGACAGTCCCTGGCATTCCTCAGCTTGCAGAGGCATCGCTCCAGTCTCTGCCTTTATCATCCTGTGGTGCTCTCCCTGAACTGGTCTTTCCTCATTTTATGAAGACAC
CAGTTATTGGATTGGAGCCCACCGTAATCCAGTATGAGCTCATCTTAACTTGATTACTTTTGCAAAGACTTCATTTCCAAAGAAGGATGCATTCACGGATGCAGAGAGTTAGGGCTTCAA
CACATATTTAATATTTTAGGGAACACAGTTCAACCCTCAACAGCCCCACAGCTTGTAAAGCTGTAACTGGCACCATCCTCTGCTTGCTTTGCTCATATGATTTCATCCAACCATGGCTTT
CTTTTTTGCCCAATTATAGTTGTTGATCAAAATGACTCTCTTAAGCATGAACGTTATTATTTCATCCTAAGCACCAGAGCTTTCTTTTCTTTCCTTTTTTTTTTTTTTTTTTTTTTTTTT
GAGATACAGTCTTGCTCTCTTGCCTGGGCTGGAGTGCAGTGGTGTGATTTTGGCTCACTGTAACCTCTATCTCCTGGGTTCAAGTGATTCACCTGCCTCAGCCTCCCAAATAGCTGGGAT
TACAGGCACCTGCCACCACGCTCAGCTAATTTTTGTATTTTTAGTAGAGACGAGTCTTTGCCATGTTGGCCAGGCTGGTCTTGAACTCCTGGACTCAAGTGATCCACCCGCCTCGGCCTC
CTAAGGTGCTGGGATTACAGGCACGGGCCACTGTGCCTGGTCAGCACCAGAGCTTTCTTTAGACTAATGCGCCTTAACTGTTAGTAATCAGAATCTTTTTCTGCAGCCTTTTTATCTTGC
CAGGTTACCTGGGCTTTGAGGTCTATTTTCCCTCCCTTCACAGGGGGTTACACTACCTCTTGGTGTAATTCAACGGACTGGGGAGCAGTGTGAGCCACAAAAGAGGTCCTCTTGGCCCAT
TCCACACTTCAAAGATGAGGAAGCTGCCGGATGTGGTGGCTCATGCCTGTAATCCCGGCAGTTTGGGAGGCCGTGGTGGGAGGATCATTTGAGACCAGGAGTTTGAGACCAGCCTGGGCA
ATATAGTGAGTCCTCACCTCTACAAATATAGAATAAAACAAAATTAGCCAGGCATGGTGGCACGTGCCTGTAGTCCTAGCTACTTGAGAAGCTGAGGTGGGAGGATCGCTTGAGCCTGAG
GAAGTTGAGGCTACACTGAGTCAAGATCGTGCCACTGCACTCCAGCCAGGGCAACAGAGTGAGACAGTGTCTAAAAAAAAACCCACAAAAAAAAATGAGGAAGCTGAGACTCAGAGGGGC
TTCCTGAGAGAGCATGACAGCAGCAGGCCCGGGGCTGGTCTGCAGCATCACAACTGCTAGGCTGCCCAACACTCTCCATCCTAGCTCAGAAGGGACTCCCACTGGAAGCTCTTGTCCCAG
GAACTTACCCAGGCTCCAGGACAGCCTGGCCTGGCTCCCAGACCACTGCTGTGCTTCTCTCCCGGCAGATCGGTGGGGGCAGGAACATGGACCACCATGTCCTGCACGTGGCTGTGGATG
TCATCAGGGAGTCTCGGGAGATGCGGCTGCAGCCCTTCAATGAGTACCGCAAGAGGTTTGGCATGAAACCCTACACCTCCTTCCAGGAGCTCGTAG
GTGAGCAGCTGTTTCCTGGATGCA
GTCCCTGCCCTTGAGGGACTGGCAGCAAAGTCAGGGAGACATCAAGGAAATAGAACGGGACAATACATGCGGCAATGTGTAACAACCAGACTTATAATGGGCGTGGAAGTGCTGTGCCAG
GGTGGTAAATAAGCCTGCTTGGGGAGAGAAGGTGACTTTTCAGCTGGGTTTGGAGAACAAATGGCATTTTCAGTGGGAGAAGAGAGGGAGGAGTGTTTTAGGCGGAGCAATAGAAAGTAC
AAAGGCTGCCGGGCAAGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCTCAGGCGGGTGGATCACGAGGTCAGGAGATCGAGACCATCCTGGCTAACACGATGAAACCCTGTCTC
TACTAAAAATACAAAAAATTAGCCGGGCATGGAGGCAGGTGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATGGTGTGAACCTGGGAGGCAGAGCTTGCAGTGAGCCGAGA
TTGCGCCACTGCTCTCCAGCCTGGGCGACAGAGCGAGACTCCATCTCAAAAAAAAGAAAGTACAAAGGCATAGAGGTCACACAGCAGGTATCTAGGGAAACAGTCATAGATGTGGCTGGG
GCAAGGAGCTTATGTTGTGGGGATGATTGGTGATGGGAGGTTGGGGCCAAACCGTGTAGAATCTTGGTTTTCATACTAAATATTTTGAGTTTTATTTGGTCTAGTGCTTTCTAAGTACCC
ACTTATGGGCCAGCTGCATAAAAATCATTTAGAAAGCTGATTAAGATACCAAATCTCAGATGCCTCTCCTACAGATTCTGATTCAACCAGCCTGGGTAGTGCCCAAGAATCTGCGTATTC
ATATGAATAAAGTATATATACATATATATGAATATATATATGAATAGAGTATATATACATATATATCAATATATACATATACATATCAATACATATATACATATACACACACATATATAT
GTGTGTGTGTATATATATAAATATATATGTGTGTGTGTGTATATATACATATATATATATATTTCCCCCAATATCCTCAGATGTTGGGCAGCCAAGTTGCCAGCCCTTGTCACATACAAT
GGGGAGCCACTGAAGCAGGGTGTGACATGGCCCATGAGAGTTCCCAATGGAGGATGGATTTGAGGAACTGGGTAGGGAAGCAGGGAAGCTCCATATTTGTCTCCCCTTGGCCACATGAAC
ATGTGGGTATTGCAGAGTGGAGCAGATTCTGTGCATGAGCTCCTTGTGATCCTGGAACAGCATCTTATTCTTTACTCTCCCATGACAAATGGTCCCCAGGGGCAGAAGGAACACTGCCAC
TGAATTTCTGTGTGAGCTCGGACATGTTACATCTTTGAGCTCTAGTAATAAAAGGGCTTGGCCCAGGCTAAGGTCTTGAAATCTGTCTGTGAAGGAAGCCAGATGGGGAGCATTCTCCCT
TCTGTGATGATCAGGTAATTAGGGCCCAGAGTTGCTCAAGGTTATAGGCTGTTTGGTGGCAGATCTAGGCCCCTGACTTTCTCTTTAGTAGCATTTTCCTTCCCTAGACCCAGTCCCTGA
GGAGGGGCAATTTGTGCTTTTCCTCTTGACCCTTTTCCCCAGTGCCAACCAGGCCAAATTCTAGGGCACATGCTCAGTTGCTCAAGTTAGTCTCCTGGAGTCCCTATTATCCCCAGAAAA
AGGTGGACCTGGAAGGGTCCCGCCCCAGGTTGACCTTAATGGCATCATGGATCTGACGCTAGCGTTTCCCCTTATCTCCTTGTAGGAGAGAAGGAGATGGCAGCAGAGTTGGAGGAATTG
TATGGAGACATTGATGCGTTGGAGTTCTACCCCGGACTGCTTCTTGAAAAGTGCCATCCAAACTCTATCTTTGGGGAGAGTATGATAGAGATTGGGGCTCCCTTTTCCCTCAAGGGTCTC
CTAGGGAATCCCATCTGTTCTCCGGAGTACTGGAAGCCGAGCACATTTGGCGGCGAGGTGGGCTTTAACATCGTCAAGACGGCCACACTGAAGAAGCTGGTCTGCCTCAACACCAAGACC
TGTCCCTACGTTTCCTTCCGTGTGCCGGATGCCAGTCAGGATGATGGGCCTGCTGTGGAGCGACCATCCACAGAGCTCTGA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAGGAAACCGAGGCTCATGAATCCCTGTTGTTACTATCCATGCCAGCACCAGGGCATCTGTGTCCGCTTCGGCCTTGACCGCTACCAGTGTGACTGCACCCGCACGGGCTATTCCGGC
CCCAACTGCACCATCC
CTGGCCTGTGGACCTGGCTCCGGAATTCACTGCGGCCCAGCCCCTCTTTCACCCACTTCCTGCTCACTCACGGGCGCTGGTTCTGGGAGTTTGTCAATGCCACC
TTCATCCGAGAGATGCTCATGCGCCTGGTACTCACAG
TGCGCTCCAACCTTATCCCCAGTCCCCCCACCTACAACTCAGCACATGACTTCATCAGCTGGGAGTCTTTCTCCAACGTGAGC
TATTACACTCGTATTCTGCCCTCTGTGCCTAAAGATTGCCCCACACCCATGGGAACCAAAG
GGAAGAAGCAGTTGCCAGATGCCCAGCTCCTGGCCCGCCGCTTCCTGCTCAGGAGGAAG
TTCATACCTGACCCCCAAGGCACCAACCTCATGTTTGCCTTCTTTGCACAACACTTCACCCACCAGTTCTTCAAAACTTCTGGCAAGATGGGTCCTGGCTTCACCAAGGCCTTGGGCCAT
GGG
GTAGACCTCGGCCACATTTATGGAGACAATCTGGAGCGTCAGTATCAACTGCGGCTCTTTAAGGATGGGAAACTCAAGTACCAGGTGCTGGACGGAGAAATGTACCCGCCCTCGGTA
GAAGAGGCGCCTGTGTTGATGCACTACCCCCGAGGCATCCCGCCCCAGAGCCAGATGGCTGTGGGCCAGGAGGTGTTTGGGCTGCTTCCTGGGCTCATGCTGTATGCCACGCTCTGGCTA
CGTGAGCACAACCGTGTGTGTGACCTGCTGAAGGCTGAGCACCCCACCTGGGGCGATGAGCAGCTTTTCCAGACGACCCGCCTCATCCTCATAG
GGGAGACCATCAGGATTGTCATCGAG
GAGTACGTGCAGCAGCTGAGTGGCTATTTCCTGCAGCTGAAATTTGACCCAGAGCTGCTGTTCGGTGTCCAGTTCCAATACCGCAACCGCATTGCCATGGAGTTCAACCATCTCTACCAC
TGGCACCCCCTCATGCCTGACTCCTTCAAGATGGGCTCCCAGGAGTACAGCTACGAGCAGTTCTTGTTCAACACCTCCATGTTGGTGGACTATGGGGTCGAGGCCCTGGTGGATGCCTTC
TCTCGCCAGATTGCTGGCCGG
ATCGGTGGGGGCAGGAACATGGACCACCATGTCCTGCACGTGGCTGTGGATGTCATCAGGGAGTCTCGGGAGATGCGGCTGCAGCCCTTCAATGAGTAC
CGCAAGAGGTTTGGCATGAAACCCTACACCTCCTTCCAGGAGCTCGTAG
GAGAGAAGGAGATGGCAGCAGAGTTGGAGGAATTGTATGGAGACATTGATGCGTTGGAGTTCTACCCCGGA
CTGCTTCTTGAAAAGTGCCATCCAAACTCTATCTTTGGGGAGAGTATGATAGAGATTGGGGCTCCCTTTTCCCTCAAGGGTCTCCTAGGGAATCCCATCTGTTCTCCGGAGTACTGGAAG
CCGAGCACATTTGGCGGCGAGGTGGGCTTTAACATCGTCAAGACGGCCACACTGAAGAAGCTGGTCTGCCTCAACACCAAGACCTGTCCCTACGTTTCCTTCCGTGTGCCGGATGCCAGT
CAGGATGATGGGCCTGCTGTGGAGCGACCATCCACAGAGCTCTGA

Retrieve as FASTA