Entry information : PabeTPO
Entry ID 6592
Creation 2009-02-17 (Christophe Dunand)
Last sequence changes 2011-01-26 (Christophe Dunand)
Sequence status partial
Reviewer Not yet reviewed
Last annotation changes 2011-01-26 (Christophe Dunand)
Peroxidase information: PabeTPO
Name PabeTPO
Class Thyroid peroxidase     [Orthogroup: TPO001]*
Taxonomy Eukaryota Metazoa Chordata Mammalia Hominidae Pongo
Organism Pongo abelii (Sumatran orangutan)    [TaxId: 9601 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value PabeTPO
start..stop
S start..stop
PpyTPO 1655 0 1..795 1..795
HsTPO01 1569 0 1..795 1..795
GgoTPO 1541 0 1..795 1..793
MmulTPO01 1528 0 1..795 1..795
Gene structure Fichier Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 90582..90675 94 N° 2 82898..82982 85 N° 3 74256..74425 170 N° 4 71356..71488 133
N° 5 44644..44773 130 N° 6 42182..42388 207 N° 7 20824..21342 519 N° 8 12940..13198 259
N° 9 9790..9960 171 N° 10 2672..2909 238 N° 11 589..797 209 N° 12 1..191 191
complement(join(1..191,589..797,2672..2909,9790..9960,12940..13198,20824..21342, 42182..42388,44644..44773,71356..71488,74256..74425,82898..82982,90582..90675))


exon

Literature and cross-references PabeTPO
DNA ref. GenBank:   NC_012592.1 (111523636..111432037) [Fragment]
mRNA ref. GenBank:   XM_002812340.1 [Fragment]
Protein sequence: PabeTPO
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   801 (783)
PWM (Da):   %s   88815.19 (86955.2)  
PI (pH):   %s   6.29 (6.29) Peptide Signal:   %s   cut: 19 range:19-801
Sequence
Send to BLAST
Send to Peroxiscan
.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
MRALAVLSVT LVMACTEAFF PFISRGKELL WGKPEESHVA SILEESKRLV DTAMYATMQR NLKKRGILSP AQLLSFSKLP EPTSGVIARA AEIMETSIQA MKRNVNLKTQ QSQHPTDALS  EDLLSIIANM SGCLPYMLPP KCPNTCLANK YRLITGACNN RDHPRWGASN TALARWLPPV YEDGFSQPRG WNPGFLYNGF PLPPVREVTR HVIQVSNEVV TDDDHYSDLL MAWGQYIDHD  IAFTPQSTSK AAFGGGADCQ MTCENQNPCF PIQLPEEARP AAGTACLPFY RSSAACGTGD QGALFGNLST ANPRQQMNGL TSFLDASTVY GSSPALERQL RNWTSAEGLL RVHASLRDSG  RAYLPFAPPR APAACAPEPG VPGETRGPCF LAGDGRASEV PSLTALHTLW LREHNRLAAA LKTLNAHWSA DAVYQEARKV VGALHQIITL RDYIPRILGP EAFQQYVGPY EGYDSTANPT  VSNVFSTAAF RFGHATIHPL VRRLDASFQE HPDLPGLWLH QAFFSPWTLL RGGGLDPLIR GLLARPAKLQ VQDQLMNEEL TERLFVLSNS STLDLASINL QRGRDHGLPG YNEWREFCGL  PRLETPADLS TAIASRSVAD KILDLYKHPD NIDVWLGGLA ENFLPRARTG PLFACLIGKQ MKALRDGDWF WWENSHVFTD AQRHELEKHS LSRVICDNTG LTRVPVDAFQ VGKFPEDFES 
CDSIPGMNLE AWRETFPQDD KCGFPENVEN GDFVHCEESG RRMLVYSCWH GYELQGREQL TCTQEGWDFQ PPLCKGQSFL Q 

Retrieve as FASTA  
Remarks Complete sequence from genomic (chromo 2a, 12 introns). 3'end is missing
CDS
Send to BLAST
.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
ATGAGAGCGC TTGCTGTCCT GTCTGTCACG CTGGTTATGG CCTGCACAGA AGCCTTCTTC CCTTTCATCT CGAGAGGGAA AGAACTCCTT TGGGGAAAGC CTGAGGAGTC TCATGTCGCC  AGCATCTTGG AGGAAAGCAA GCGCCTGGTG GACACCGCCA TGTACGCCAC GATGCAGAGA AACCTCAAAA AAAGAGGAAT CCTTTCTCCA GCTCAGCTTC TGTCTTTTTC CAAACTTCCT  GAGCCAACAA GTGGAGTGAT TGCCCGAGCA GCAGAGATAA TGGAAACATC AATACAAGCG ATGAAAAGAA ACGTCAACCT GAAAACTCAA CAATCACAGC ATCCAACGGA TGCTTTATCA  GAAGATCTGC TGAGCATCAT TGCAAACATG TCCGGATGTC TCCCTTACAT GCTGCCCCCA AAATGCCCAA ACACTTGCCT GGCAAACAAA TACAGGCTCA TCACGGGAGC TTGCAACAAC  AGAGACCACC CCAGATGGGG CGCCTCCAAC ACGGCCCTGG CACGATGGCT CCCTCCAGTC TACGAGGATG GCTTCAGTCA GCCCCGAGGC TGGAACCCCG GCTTCTTGTA CAACGGGTTC  CCACTGCCCC CGGTCCGGGA GGTGACAAGA CATGTCATTC AAGTTTCAAA TGAGGTTGTC ACAGATGATG ACCACTATTC TGACCTCCTG ATGGCATGGG GACAATACAT CGACCACGAC  ATCGCGTTCA CACCACAGAG CACCAGCAAA GCTGCCTTCG GGGGAGGGGC TGACTGCCAG ATGACTTGTG AGAACCAAAA CCCATGTTTT CCCATACAAC TCCCGGAGGA GGCTCGGCCG  GCCGCGGGCA CCGCCTGTCT GCCCTTCTAC CGCTCTTCGG CCGCCTGCGG CACCGGGGAC CAAGGCGCGC TCTTTGGGAA CCTGTCCACG GCCAACCCGC GGCAGCAGAT GAACGGGTTG  ACCTCATTCC TGGACGCGTC CACCGTGTAT GGCAGCTCCC CGGCCCTAGA GAGGCAGCTG CGGAACTGGA CCAGTGCCGA AGGGCTGCTC CGCGTCCACG CGAGCCTCCG GGACTCCGGC  CGCGCCTACC TGCCCTTCGC GCCGCCACGC GCGCCTGCGG CCTGTGCGCC CGAGCCCGGC GTCCCCGGAG AGACCCGCGG GCCCTGCTTC CTGGCCGGAG ACGGCCGCGC CAGCGAGGTC  CCCTCCCTGA CGGCGCTGCA CACGCTGTGG CTGCGCGAGC ACAACCGCCT GGCCGCGGCG CTCAAGACCC TCAATGCGCA CTGGAGCGCG GACGCCGTGT ACCAGGAGGC GCGCAAGGTC  GTGGGCGCTC TGCACCAGAT CATCACCCTG AGGGATTACA TCCCCAGAAT CCTGGGACCC GAGGCCTTCC AGCAGTACGT GGGTCCCTAT GAAGGCTATG ACTCCACGGC CAACCCCACT  GTGTCCAACG TGTTCTCCAC AGCCGCCTTC CGCTTCGGCC ACGCCACGAT CCACCCACTG GTGAGGAGGC TGGACGCCAG CTTCCAAGAG CACCCCGACC TGCCCGGGCT GTGGCTGCAC  CAGGCCTTCT TCAGCCCATG GACACTCCTC CGTGGAGGTG GTTTGGACCC ACTAATACGA GGCCTTCTTG CAAGACCAGC CAAACTGCAG GTGCAGGATC AGCTCATGAA CGAGGAGCTG  ACGGAAAGGC TGTTTGTGCT GTCCAATTCT AGCACCTTGG ATCTGGCGTC CATCAACCTG CAGAGGGGCC GGGACCACGG GCTGCCAGGT TACAATGAGT GGAGGGAGTT CTGTGGCCTG  CCTCGCCTGG AGACCCCCGC TGACCTGAGC ACAGCCATTG CCAGCAGGAG CGTGGCCGAC AAGATCCTGG ACTTGTACAA GCATCCTGAC AACATCGATG TCTGGCTGGG AGGCTTAGCT  GAAAACTTCC TCCCCAGGGC TCGGACAGGG CCCCTGTTTG CCTGTCTCAT TGGGAAGCAG ATGAAGGCTC TGCGGGACGG TGACTGGTTT TGGTGGGAGA ACAGCCACGT CTTCACGGAT  GCACAGAGGC ATGAGCTGGA GAAGCACTCC CTGTCTCGGG TCATCTGTGA CAACACCGGC CTCACCAGGG TGCCCGTGGA CGCCTTCCAA GTTGGAAAAT TCCCCGAAGA CTTTGAGTCT  TGTGACAGCA TCCCTGGCAT GAACCTGGAG GCCTGGAGGG AAACCTTTCC TCAAGACGAC AAGTGTGGCT TCCCAGAGAA CGTGGAGAAT GGGGACTTTG TGCACTGTGA GGAGTCTGGG  AGGCGCATGC TGGTGTATTC CTGCTGGCAT GGGTATGAGC TCCAAGGCCG GGAGCAGCTC ACTTGCACCC AGGAAGGATG GGATTTCCAG CCTCCCCTCT GCAAAGGTCA GTCCTTTCTT 
CAATGA 

Retrieve as FASTA