Entry information : OcarPxDo
Entry ID 7249
Creation 2010-02-23 (Marcel Zamocky)
Last sequence changes 2010-11-17 (Marcel Zamocky)
Sequence status complete
Reviewer Christophe Dunand
Last annotation changes 2010-11-18 (Christophe Dunand)
Peroxidase information: OcarPxDo
Name OcarPxDo
Class Short peroxidockerin    [Orthogroup: PxDo001]
Taxonomy Bacteria Proteobacteria Alphaproteobacteria Rhodobacteraceae Octadecabacter
Organism Octadecabacter arcticus    [TaxId: 53946 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value OcarPxDo
start..stop
S start..stop
PlmaPxDo01 311 3e-93 46..581 144..678
BmaPxDo01 283 3e-84 24..496 47..513
LaaPxDo 281 2e-83 20..588 61..636
LspPxDo01 272 2e-80 24..494 169..660
Literature and cross-references OcarPxDo
Literature Brinkhoff T., Wawrik B., Simon M., Ferriera S., Johnson J., Kravitz S., Beeson K., Sutton G., Rogers Y.-H., Friedman R., Frazier M., Venter J.C. Submitted (JUL-2008) to the EMBL/GenBank/DDBJ databases
Protein ref. GenBank:   EDY91945.1 UniProtKB:   B5K8K1
DNA ref. GenBank:   DS990628.1 (4664670..4667426)
Protein sequence: OcarPxDo
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   918
PWM (Da):   %s   98118.66  
PI (pH):   %s   3.94
Sequence
Send to BLAST
Send to Peroxiscan
.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
MGRGYGGLMA SSCPFLANIQ AQERLTGARY DDDISETFSG GADPVQVSMV VFDQDGDNPN SAGLSTLFTT FGQFLDHDMV LTPEDHDAGT LNLVGMPHDI ARSQVAEEIG DGETIAPTNA  VTWQIDGSQV YGSTEARMED LRSFEGGKLR MQDDTTSASD MLPDADEDSF MAGDISGDDP VYLAGDIRAN ENPNLLSLQT LFVREHNHWA DKLAQEHPDW SDEQLYDAAR SIVEYELQQI  TYNEWLPHLI GDAVGEDTGF DASVSGESSV ELSTAAFRFG HTLVSSSIDL VGEDGTDAGS VALMDAFFNH SAVENNGIEA IMRGQLSATA QELDTEIVDD LNFFLETPDG VSGFSLAALN  LARGLDHGLD SYIEVRAQLI GDIDPATLDP QDFSIITSDE DVQARLAAVY DDVFQVDLWV GGLAEDAIAG TQMGPLFTHI ITDQFTRTRA ADDTFSDLVS ALGDDIIAEV KASSFATIIA  RNTDVDMVQD DVFLASDRSL TEIEAVETSW RADVIDLAAK SLNGSLYTGS GDDILRLSGG TVISGNVHME QGNDTLIATS GVITGSVDMG LGDDTVTLTG TADVLGDVST YNGGGTVTLA  DMARVGGSVL TGHGDGTVVL SGRATIDGDL RTGHGQDNIS LGARTTVSGV VDAGKGDDII RLEAGANVEN INGGQGLDTL TRSENTRIEY DEDPTNGTVF YLDDAGNDTG ESVDFQSIER  ITCFTLGTLI ITERGKQAIE TLQVGDRVWT LDNGLRPIAW IGRATVAATG DLAPILIRKG AMHNARDLLV SPQHRMMLDG WRVEMHCGTD EVLAPAKALT NDQTIRRVEG GTVTYVHIAF 
DNHEIVMAEG IASESFSPGA EALNALDDAA RSEILTLFPQ WRCPVHRPTT ARPVVTTREL NDYRLTPVGS LFECNSRY 

Retrieve as FASTA  
Remarks Complete sequence from genomic, strain="238" intein mediated protein splicing (rare).
DNA
Send to BLAST
.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
GTGGGGCGTG GTTATGGTGG ACTGATGGCA AGTAGCTGTC CTTTTCTTGC CAATATTCAA GCACAAGAAC GTCTCACTGG GGCGCGATAT GACGATGACA TCTCTGAGAC GTTCAGCGGT  GGCGCGGATC CGGTGCAGGT GTCGATGGTC GTGTTTGATC AAGACGGCGA CAACCCCAAC AGCGCCGGAT TAAGCACTCT GTTTACCACA TTCGGCCAGT TTCTCGATCA CGATATGGTT  TTGACGCCAG AGGATCACGA CGCGGGAACA CTGAACCTCG TCGGGATGCC GCATGACATC GCGCGCTCGC AAGTCGCAGA GGAGATCGGC GACGGAGAAA CCATTGCACC GACCAACGCG  GTGACTTGGC AGATCGACGG TAGCCAAGTT TACGGGTCAA CTGAGGCGCG GATGGAGGAT CTGCGTAGTT TTGAGGGTGG CAAACTGCGG ATGCAGGACG ATACGACATC CGCGTCCGAT  ATGCTGCCTG ATGCCGACGA AGACAGCTTT ATGGCGGGCG ACATCAGCGG CGATGATCCG GTGTATCTTG CCGGTGACAT CCGCGCCAAT GAGAACCCTA ACCTGCTGTC GTTGCAGACA  TTATTTGTGC GTGAACACAA TCATTGGGCG GATAAGCTTG CGCAGGAACA CCCCGATTGG TCCGACGAAC AACTTTACGA CGCCGCGCGA TCCATTGTCG AATACGAACT GCAACAGATC  ACCTACAACG AATGGTTGCC GCATCTGATT GGCGATGCTG TGGGGGAAGA TACAGGGTTT GACGCCAGTG TTTCGGGCGA ATCCTCGGTA GAACTTTCCA CCGCCGCGTT CCGTTTTGGT  CATACGCTTG TGTCGTCCAG CATTGATCTG GTTGGCGAAG ACGGAACCGA TGCAGGTTCG GTCGCTTTGA TGGACGCATT CTTCAACCAC TCGGCCGTGG AAAACAACGG CATCGAAGCG  ATCATGCGCG GCCAGTTGAG CGCGACTGCA CAAGAACTCG ACACTGAAAT TGTCGATGAT TTGAACTTTT TCCTTGAGAC ACCGGACGGC GTATCGGGTT TCTCCCTCGC TGCGCTGAAC  CTCGCGCGGG GCCTCGATCA TGGGCTCGAC AGCTATATCG AGGTGCGCGC GCAATTGATT GGTGACATTG ATCCTGCGAC GCTGGACCCG CAAGATTTCT CAATCATAAC TAGCGACGAA  GACGTGCAAG CGCGCCTTGC GGCGGTTTAT GACGATGTCT TTCAGGTCGA TCTGTGGGTC GGCGGACTGG CCGAAGACGC AATTGCGGGC ACCCAGATGG GCCCCCTGTT CACCCACATC  ATCACCGACC AATTCACCCG CACCCGCGCG GCGGACGACA CGTTCAGCGA TCTTGTTTCC GCATTGGGCG ACGACATCAT TGCGGAAGTG AAGGCCAGTT CTTTCGCCAC GATAATCGCG  CGTAACACCG ACGTGGACAT GGTGCAGGAC GACGTATTCC TCGCCTCTGA CCGTAGCCTG ACAGAGATTG AGGCGGTCGA GACCTCGTGG CGCGCAGACG TGATCGACTT GGCCGCAAAA  TCGCTGAACG GGTCGCTTTA TACTGGGTCC GGCGATGACA TCTTGAGGCT GTCCGGCGGC ACGGTGATCA GCGGCAATGT ACACATGGAA CAGGGCAACG ACACGCTGAT CGCGACCAGT  GGCGTCATCA CCGGCAGCGT CGATATGGGG CTTGGCGATG ACACCGTCAC GCTGACGGGC ACCGCAGATG TGCTGGGTGA CGTCAGCACT TACAACGGCG GTGGCACTGT GACTTTGGCG  GACATGGCGC GTGTTGGTGG CAGCGTTTTG ACGGGCCACG GCGACGGCAC TGTGGTGTTG TCAGGCCGTG CGACGATTGA CGGTGATCTG CGCACAGGGC ACGGCCAAGA CAACATTTCC  CTTGGCGCGC GAACCACGGT CAGCGGCGTG GTTGATGCCG GCAAAGGTGA CGATATTATC CGGCTTGAGG CGGGTGCAAA CGTCGAAAAC ATCAATGGTG GTCAGGGATT AGACACCTTA  ACGCGGTCGG AAAACACCCG CATCGAATAT GACGAAGACC CGACCAATGG CACAGTTTTC TATCTGGATG ATGCTGGCAA TGACACCGGC GAAAGTGTTG ATTTCCAATC CATCGAACGC  ATCACCTGCT TCACCCTCGG CACGTTGATT ATCACTGAAC GCGGCAAGCA AGCGATAGAG ACATTGCAGG TTGGTGATCG TGTCTGGACC CTCGACAACG GGCTGCGACC CATCGCATGG  ATTGGCCGCG CGACGGTTGC GGCCACGGGC GACCTTGCGC CGATCCTGAT CCGCAAAGGG GCAATGCACA ACGCGCGTGA TCTGCTGGTG TCACCGCAGC ACCGCATGAT GCTGGATGGA  TGGCGTGTGG AAATGCACTG TGGCACGGAT GAGGTTTTGG CCCCAGCAAA GGCGCTTACC AACGATCAAA CGATCCGCCG CGTTGAGGGT GGCACTGTGA CCTATGTGCA CATCGCGTTC  GACAACCATG AAATCGTTAT GGCCGAAGGC ATTGCATCCG AAAGCTTTTC CCCCGGCGCT GAGGCGCTCA ACGCGTTGGA CGACGCAGCA CGCAGCGAAA TCTTGACGCT TTTCCCGCAA 
TGGCGCTGCC CTGTACACCG CCCGACGACA GCGCGGCCAG TGGTTACCAC GCGTGAACTG AACGACTACC GGCTGACGCC GGTAGGTTCC CTTTTTGAAT GTAATTCAAG ATACTGA 

Retrieve as FASTA