BLASTX nr result
ID: Catharanthus23_contig00040431
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00040431 (298 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY19148.1| Pentatricopeptide repeat (PPR) superfamily protei... 161 1e-37 gb|EOY19146.1| Pentatricopeptide repeat (PPR) superfamily protei... 161 1e-37 ref|XP_004246883.1| PREDICTED: putative pentatricopeptide repeat... 155 7e-36 ref|XP_006341113.1| PREDICTED: putative pentatricopeptide repeat... 154 9e-36 gb|EMJ08254.1| hypothetical protein PRUPE_ppb002198mg [Prunus pe... 154 2e-35 ref|XP_004172298.1| PREDICTED: putative pentatricopeptide repeat... 154 2e-35 ref|XP_004137894.1| PREDICTED: putative pentatricopeptide repeat... 154 2e-35 ref|XP_004305093.1| PREDICTED: putative pentatricopeptide repeat... 153 3e-35 gb|EXB41949.1| hypothetical protein L484_002200 [Morus notabilis] 152 3e-35 emb|CBI18084.3| unnamed protein product [Vitis vinifera] 149 3e-34 ref|XP_002265522.1| PREDICTED: putative pentatricopeptide repeat... 149 3e-34 ref|XP_006429514.1| hypothetical protein CICLE_v10011209mg [Citr... 149 5e-34 ref|XP_003539649.2| PREDICTED: putative pentatricopeptide repeat... 147 1e-33 ref|XP_002309169.1| pentatricopeptide repeat-containing family p... 147 1e-33 ref|XP_006299039.1| hypothetical protein CARUB_v10015176mg [Caps... 147 2e-33 gb|EPS63455.1| hypothetical protein M569_11327 [Genlisea aurea] 146 2e-33 ref|XP_002323645.2| pentatricopeptide repeat-containing family p... 142 4e-32 gb|ESW05840.1| hypothetical protein PHAVU_011G213900g [Phaseolus... 142 5e-32 ref|NP_187494.1| pentatricopeptide repeat-containing protein [Ar... 141 1e-31 gb|EMS45170.1| hypothetical protein TRIUR3_26201 [Triticum urartu] 140 1e-31 >gb|EOY19148.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 3 [Theobroma cacao] gi|508727252|gb|EOY19149.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 3 [Theobroma cacao] Length = 503 Score = 161 bits (407), Expect = 1e-37 Identities = 75/98 (76%), Positives = 84/98 (85%) Frame = +1 Query: 4 ELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLR 183 EL KE +AAGYVP+TE+VLFDIEEEEKEH L HSEKLA+AFGLIST P DVIR+ KNLR Sbjct: 402 ELAKELKAAGYVPTTEYVLFDIEEEEKEHFLGCHSEKLAIAFGLISTAPTDVIRVVKNLR 461 Query: 184 VCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 VCGDCH K+ S++TGREIIVRD NRFHHF+ GSCSC Sbjct: 462 VCGDCHEVIKLFSRVTGREIIVRDNNRFHHFIGGSCSC 499 >gb|EOY19146.1| Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 1 [Theobroma cacao] gi|508727250|gb|EOY19147.1| Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 1 [Theobroma cacao] Length = 688 Score = 161 bits (407), Expect = 1e-37 Identities = 75/98 (76%), Positives = 84/98 (85%) Frame = +1 Query: 4 ELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLR 183 EL KE +AAGYVP+TE+VLFDIEEEEKEH L HSEKLA+AFGLIST P DVIR+ KNLR Sbjct: 587 ELAKELKAAGYVPTTEYVLFDIEEEEKEHFLGCHSEKLAIAFGLISTAPTDVIRVVKNLR 646 Query: 184 VCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 VCGDCH K+ S++TGREIIVRD NRFHHF+ GSCSC Sbjct: 647 VCGDCHEVIKLFSRVTGREIIVRDNNRFHHFIGGSCSC 684 >ref|XP_004246883.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820-like [Solanum lycopersicum] Length = 688 Score = 155 bits (391), Expect = 7e-36 Identities = 72/98 (73%), Positives = 83/98 (84%) Frame = +1 Query: 4 ELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLR 183 EL+KE R GYVP TE+VLFDIEEEEKEH + HSEKLALA+GL+ST+P D IRI KNLR Sbjct: 587 ELSKELREVGYVPRTEYVLFDIEEEEKEHFVGCHSEKLALAYGLLSTKPGDGIRIIKNLR 646 Query: 184 VCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 +CGDCH FFK++S ITGREII+RD NRFH F+ GSCSC Sbjct: 647 ICGDCHTFFKLVSMITGREIILRDNNRFHCFLEGSCSC 684 >ref|XP_006341113.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820-like [Solanum tuberosum] Length = 687 Score = 154 bits (390), Expect = 9e-36 Identities = 73/98 (74%), Positives = 82/98 (83%) Frame = +1 Query: 4 ELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLR 183 EL+KE R GYVP TE+VLFDIEEEEKEH + HSEKLALAFGL+ST+ DVIRI KNLR Sbjct: 586 ELSKELREVGYVPKTEYVLFDIEEEEKEHFVGCHSEKLALAFGLLSTKHSDVIRIIKNLR 645 Query: 184 VCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 +CGDCH FFK++SKIT REII+RD NRFH F GSCSC Sbjct: 646 ICGDCHTFFKLVSKITEREIILRDNNRFHCFFKGSCSC 683 >gb|EMJ08254.1| hypothetical protein PRUPE_ppb002198mg [Prunus persica] Length = 636 Score = 154 bits (388), Expect = 2e-35 Identities = 73/98 (74%), Positives = 82/98 (83%) Frame = +1 Query: 4 ELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLR 183 EL KE +AAGYVP+T+FVLFDIEEEEKEH L HSEKLA+AFGLIST P D IR+ KNLR Sbjct: 535 ELAKELKAAGYVPTTDFVLFDIEEEEKEHFLGCHSEKLAIAFGLISTAPKDTIRVVKNLR 594 Query: 184 VCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 VCGDCH K+ISKIT R+II+RD NRFH F+ GSCSC Sbjct: 595 VCGDCHEAIKLISKITERQIIIRDNNRFHCFIDGSCSC 632 >ref|XP_004172298.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820-like [Cucumis sativus] Length = 688 Score = 154 bits (388), Expect = 2e-35 Identities = 72/98 (73%), Positives = 84/98 (85%) Frame = +1 Query: 4 ELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLR 183 EL +E +A G+VP+TEFVLFDIEEEEKEH L YHSEKLA+AFGLI++ P+ VIR+ KNLR Sbjct: 587 ELGRELKAVGHVPTTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLR 646 Query: 184 VCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 VCGDCH K+ISKIT REII+RDTNRFH F+ GSCSC Sbjct: 647 VCGDCHDAIKLISKITKREIIIRDTNRFHTFIDGSCSC 684 >ref|XP_004137894.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820-like [Cucumis sativus] Length = 688 Score = 154 bits (388), Expect = 2e-35 Identities = 72/98 (73%), Positives = 84/98 (85%) Frame = +1 Query: 4 ELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLR 183 EL +E +A G+VP+TEFVLFDIEEEEKEH L YHSEKLA+AFGLI++ P+ VIR+ KNLR Sbjct: 587 ELGRELKAVGHVPTTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLR 646 Query: 184 VCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 VCGDCH K+ISKIT REII+RDTNRFH F+ GSCSC Sbjct: 647 VCGDCHDAIKLISKITKREIIIRDTNRFHTFIDGSCSC 684 >ref|XP_004305093.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820-like [Fragaria vesca subsp. vesca] Length = 688 Score = 153 bits (386), Expect = 3e-35 Identities = 70/99 (70%), Positives = 83/99 (83%) Frame = +1 Query: 1 SELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNL 180 +EL K+ +AAGY+P+T+FVLFDIEEEEKEH L HSEKLA+AF LI+T P D IR++KNL Sbjct: 586 NELAKDLKAAGYIPTTDFVLFDIEEEEKEHFLGCHSEKLAIAFALIATAPQDTIRVSKNL 645 Query: 181 RVCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 RVCGDCH K+ISKIT REIIVRD NRFH F+ G+CSC Sbjct: 646 RVCGDCHQAIKLISKITSREIIVRDNNRFHRFIDGTCSC 684 >gb|EXB41949.1| hypothetical protein L484_002200 [Morus notabilis] Length = 688 Score = 152 bits (385), Expect = 3e-35 Identities = 72/98 (73%), Positives = 82/98 (83%) Frame = +1 Query: 4 ELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLR 183 EL+KE +AAGYVP+T+F+LFDIEEEEKEH L HSEKLA+AF LIST D IR+ KNLR Sbjct: 587 ELDKELKAAGYVPTTDFMLFDIEEEEKEHFLRCHSEKLAVAFALISTASKDAIRVVKNLR 646 Query: 184 VCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 VCGDCH F K++SKITGREII+RD NRFH FV G CSC Sbjct: 647 VCGDCHEFIKLVSKITGREIIIRDNNRFHCFVDGFCSC 684 >emb|CBI18084.3| unnamed protein product [Vitis vinifera] Length = 496 Score = 149 bits (377), Expect = 3e-34 Identities = 72/98 (73%), Positives = 79/98 (80%) Frame = +1 Query: 4 ELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLR 183 EL K+ + AGYVP+T+FVLFDIEEEEKEH L HSEKLA+AFGLIS P VIR+ KNLR Sbjct: 395 ELTKKMKVAGYVPTTDFVLFDIEEEEKEHFLGCHSEKLAIAFGLISATPTAVIRVVKNLR 454 Query: 184 VCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 VCGDCHM K+IS ITGREI VRD NRFH F GSCSC Sbjct: 455 VCGDCHMAIKLISSITGREITVRDNNRFHCFREGSCSC 492 >ref|XP_002265522.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820-like [Vitis vinifera] Length = 686 Score = 149 bits (377), Expect = 3e-34 Identities = 72/98 (73%), Positives = 79/98 (80%) Frame = +1 Query: 4 ELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLR 183 EL K+ + AGYVP+T+FVLFDIEEEEKEH L HSEKLA+AFGLIS P VIR+ KNLR Sbjct: 585 ELTKKMKVAGYVPTTDFVLFDIEEEEKEHFLGCHSEKLAIAFGLISATPTAVIRVVKNLR 644 Query: 184 VCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 VCGDCHM K+IS ITGREI VRD NRFH F GSCSC Sbjct: 645 VCGDCHMAIKLISSITGREITVRDNNRFHCFREGSCSC 682 >ref|XP_006429514.1| hypothetical protein CICLE_v10011209mg [Citrus clementina] gi|568855070|ref|XP_006481133.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820-like [Citrus sinensis] gi|557531571|gb|ESR42754.1| hypothetical protein CICLE_v10011209mg [Citrus clementina] Length = 688 Score = 149 bits (375), Expect = 5e-34 Identities = 72/98 (73%), Positives = 83/98 (84%) Frame = +1 Query: 4 ELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLR 183 EL + +AAG+VP+T+ VLFDIEEEEK++ L+ HSEKLALAFGLI+T P DVIRI KNLR Sbjct: 587 ELATKLKAAGFVPTTDHVLFDIEEEEKQYFLACHSEKLALAFGLITTAPKDVIRIAKNLR 646 Query: 184 VCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 VCGDCH K+ISKITGREIIVRD NRFH F+ GSCSC Sbjct: 647 VCGDCHEAIKLISKITGREIIVRDNNRFHCFIEGSCSC 684 >ref|XP_003539649.2| PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820-like isoform X1 [Glycine max] gi|571494895|ref|XP_006592973.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820-like isoform X2 [Glycine max] Length = 681 Score = 147 bits (372), Expect = 1e-33 Identities = 70/97 (72%), Positives = 79/97 (81%) Frame = +1 Query: 7 LNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLRV 186 L K+ R AGY P+TEFVLFD+EEEEKE+ L HSEKLA+AF LIST DVIR+ KNLRV Sbjct: 581 LFKDLREAGYNPTTEFVLFDVEEEEKEYFLGCHSEKLAVAFALISTGAKDVIRVVKNLRV 640 Query: 187 CGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 CGDCH K++SK+TGREIIVRD NRFHHF GSCSC Sbjct: 641 CGDCHEAIKLVSKVTGREIIVRDNNRFHHFTEGSCSC 677 >ref|XP_002309169.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|222855145|gb|EEE92692.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 619 Score = 147 bits (371), Expect = 1e-33 Identities = 70/98 (71%), Positives = 81/98 (82%) Frame = +1 Query: 4 ELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLR 183 EL K+ +A+GYVP+T++VLFDIEEEEKEH + HSEKLA+AFGLIST P+D IR+ KNLR Sbjct: 518 ELVKDLKASGYVPTTDYVLFDIEEEEKEHFIGCHSEKLAIAFGLISTAPNDKIRVVKNLR 577 Query: 184 VCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 VCGDCH K IS+ TGREIIVRD NRFH F GSCSC Sbjct: 578 VCGDCHEAIKHISRFTGREIIVRDNNRFHCFNDGSCSC 615 >ref|XP_006299039.1| hypothetical protein CARUB_v10015176mg [Capsella rubella] gi|482567748|gb|EOA31937.1| hypothetical protein CARUB_v10015176mg [Capsella rubella] Length = 685 Score = 147 bits (370), Expect = 2e-33 Identities = 69/98 (70%), Positives = 78/98 (79%) Frame = +1 Query: 4 ELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLR 183 +L E R G+VP+TEFV+FD+EEEEKE L YHSEKLA+AFGLIST D VIR+ KNLR Sbjct: 584 DLGNEMRLMGFVPTTEFVMFDVEEEEKERVLGYHSEKLAVAFGLISTGHDQVIRVVKNLR 643 Query: 184 VCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 VCGDCH K+ISKIT REI+VRD NRFH F GSCSC Sbjct: 644 VCGDCHEVMKLISKITRREIVVRDNNRFHCFTNGSCSC 681 >gb|EPS63455.1| hypothetical protein M569_11327 [Genlisea aurea] Length = 685 Score = 146 bits (369), Expect = 2e-33 Identities = 70/99 (70%), Positives = 79/99 (79%), Gaps = 1/99 (1%) Frame = +1 Query: 4 ELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDV-IRITKNL 180 E+ KE R AGYVP+TE V FD+EEEEKE L YHSEKLALAFGL++T IRI KNL Sbjct: 583 EIEKEMREAGYVPTTELVHFDVEEEEKEQSLGYHSEKLALAFGLLTTTTTTATIRIAKNL 642 Query: 181 RVCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 RVCGDCH FK++S++TGREIIVRDTNRFHHF G CSC Sbjct: 643 RVCGDCHAAFKLVSRLTGREIIVRDTNRFHHFSEGKCSC 681 >ref|XP_002323645.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550321449|gb|EEF05406.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 682 Score = 142 bits (359), Expect = 4e-32 Identities = 69/98 (70%), Positives = 78/98 (79%) Frame = +1 Query: 4 ELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLR 183 EL K+ +AAGYVP+T+ VLFDIEEEEKEH + HSEKLA+AFGLIST P+D I + KNLR Sbjct: 581 ELAKDLKAAGYVPTTDHVLFDIEEEEKEHFIGCHSEKLAVAFGLISTAPNDKILVVKNLR 640 Query: 184 VCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 VCGDCH K IS+I GREIIVRD NRFH F G CSC Sbjct: 641 VCGDCHEAIKHISRIAGREIIVRDNNRFHCFTDGLCSC 678 >gb|ESW05840.1| hypothetical protein PHAVU_011G213900g [Phaseolus vulgaris] Length = 690 Score = 142 bits (358), Expect = 5e-32 Identities = 68/97 (70%), Positives = 77/97 (79%) Frame = +1 Query: 7 LNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLRV 186 L K+ R AGY P+TEFVLFD+EEEEKE+ L HSEKLA+AF LIST DVIR+ KNLRV Sbjct: 590 LFKDLREAGYSPTTEFVLFDVEEEEKEYFLGCHSEKLAVAFALISTSAKDVIRVVKNLRV 649 Query: 187 CGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 CGDCH K+ISK+T REII+RD NRFHH GSCSC Sbjct: 650 CGDCHEAIKLISKVTHREIIIRDNNRFHHLSEGSCSC 686 >ref|NP_187494.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75207322|sp|Q9SR82.1|PP219_ARATH RecName: Full=Putative pentatricopeptide repeat-containing protein At3g08820 gi|6403507|gb|AAF07847.1|AC010871_23 unknown protein [Arabidopsis thaliana] gi|12322725|gb|AAG51349.1|AC012562_10 unknown protein; 90102-88045 [Arabidopsis thaliana] gi|332641162|gb|AEE74683.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 685 Score = 141 bits (355), Expect = 1e-31 Identities = 67/98 (68%), Positives = 75/98 (76%) Frame = +1 Query: 4 ELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLR 183 +L E R G+VP+TEFV FD+EEEEKE L YHSEKLA+A GLIST VIR+ KNLR Sbjct: 584 DLGNEMRLMGFVPTTEFVFFDVEEEEKERVLGYHSEKLAVALGLISTDHGQVIRVVKNLR 643 Query: 184 VCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 VCGDCH K+ISKIT REI+VRD NRFH F GSCSC Sbjct: 644 VCGDCHEVMKLISKITRREIVVRDNNRFHCFTNGSCSC 681 >gb|EMS45170.1| hypothetical protein TRIUR3_26201 [Triticum urartu] Length = 284 Score = 140 bits (354), Expect = 1e-31 Identities = 64/98 (65%), Positives = 77/98 (78%) Frame = +1 Query: 4 ELNKESRAAGYVPSTEFVLFDIEEEEKEHCLSYHSEKLALAFGLISTRPDDVIRITKNLR 183 EL++E RAAGYVP T FVL DI+E KE L YHSE+LA+AFGL+ST P +R+ KNLR Sbjct: 183 ELHEEIRAAGYVPDTRFVLHDIDEAAKERALMYHSERLAIAFGLVSTPPGTPLRVMKNLR 242 Query: 184 VCGDCHMFFKVISKITGREIIVRDTNRFHHFVTGSCSC 297 +CGDCH K+I+K+TGREIIVRD RFHHF G+CSC Sbjct: 243 ICGDCHTAVKLIAKVTGREIIVRDNKRFHHFKDGACSC 280