BLASTX nr result
ID: Catharanthus23_contig00029836
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00029836 (570 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006348971.1| PREDICTED: putative pentatricopeptide repeat... 147 2e-33 ref|XP_002269319.2| PREDICTED: pentatricopeptide repeat-containi... 146 3e-33 ref|XP_004243818.1| PREDICTED: putative pentatricopeptide repeat... 145 9e-33 gb|EMJ00840.1| hypothetical protein PRUPE_ppa021613mg [Prunus pe... 135 6e-30 ref|XP_004309732.1| PREDICTED: pentatricopeptide repeat-containi... 135 7e-30 gb|EOX98404.1| Pentatricopeptide repeat-containing protein [Theo... 134 1e-29 ref|XP_004514248.1| PREDICTED: pentatricopeptide repeat-containi... 130 2e-28 ref|XP_003535694.1| PREDICTED: pentatricopeptide repeat-containi... 130 3e-28 gb|ESW15077.1| hypothetical protein PHAVU_007G042100g [Phaseolus... 122 5e-26 gb|EPS71925.1| hypothetical protein M569_02834, partial [Genlise... 102 9e-20 ref|XP_003539649.2| PREDICTED: putative pentatricopeptide repeat... 101 1e-19 gb|EMJ08254.1| hypothetical protein PRUPE_ppb002198mg [Prunus pe... 101 2e-19 ref|XP_006838870.1| hypothetical protein AMTR_s00002p00266930 [A... 99 6e-19 gb|EXC23679.1| hypothetical protein L484_015589 [Morus notabilis] 99 1e-18 gb|EOY27823.1| Pentatricopeptide repeat (PPR) superfamily protei... 99 1e-18 gb|EOY19148.1| Pentatricopeptide repeat (PPR) superfamily protei... 99 1e-18 gb|EOY19146.1| Pentatricopeptide repeat (PPR) superfamily protei... 99 1e-18 ref|XP_006429514.1| hypothetical protein CICLE_v10011209mg [Citr... 97 2e-18 ref|XP_004305093.1| PREDICTED: putative pentatricopeptide repeat... 97 3e-18 ref|XP_002309169.1| pentatricopeptide repeat-containing family p... 97 4e-18 >ref|XP_006348971.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820-like [Solanum tuberosum] Length = 627 Score = 147 bits (370), Expect = 2e-33 Identities = 67/103 (65%), Positives = 87/103 (84%) Frame = -3 Query: 568 GGCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSK 389 GGC+LHNRLEL+R I+ +LV+V+P+NSAGYV+ SN A+DHHWG +S LR SM++ GV K Sbjct: 523 GGCVLHNRLELARIISSILVEVDPNNSAGYVMLSNTYAIDHHWGAISRLRLSMKEKGVVK 582 Query: 388 QSGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKLA 260 Q GCSWI+++GVVHEF+AGS+SHPQ+E + L+ LLKEMKLA Sbjct: 583 QPGCSWINIDGVVHEFLAGSSSHPQNERLHSELQVLLKEMKLA 625 >ref|XP_002269319.2| PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Vitis vinifera] Length = 532 Score = 146 bits (369), Expect = 3e-33 Identities = 68/104 (65%), Positives = 84/104 (80%) Frame = -3 Query: 565 GCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSKQ 386 GC LH+RLEL++ +++ LVKV+P NSAGYV++SNALA D WG+VSGLRW MR+ GV K Sbjct: 334 GCRLHSRLELAQDVSQKLVKVDPENSAGYVMFSNALASDQQWGEVSGLRWLMREKGVRKH 393 Query: 385 SGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKLAGS 254 GCSWISVN VVHEF+AGS SHPQ ++I H L GL+KEMK+ S Sbjct: 394 PGCSWISVNRVVHEFLAGSLSHPQIDSIYHTLNGLVKEMKVFAS 437 >ref|XP_004243818.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820-like [Solanum lycopersicum] Length = 627 Score = 145 bits (365), Expect = 9e-33 Identities = 67/103 (65%), Positives = 86/103 (83%) Frame = -3 Query: 568 GGCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSK 389 GGC+LHNRLEL+R I+ +LV+V+P+NSAGYV+ SN A+DHHWG +S LR SM++ GV K Sbjct: 523 GGCMLHNRLELARIISSILVEVDPNNSAGYVMLSNTYAIDHHWGAISRLRLSMKEKGVVK 582 Query: 388 QSGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKLA 260 Q GCSWI+V+GVVHEF+AGS+SHPQ+E + L+ L KEMKLA Sbjct: 583 QPGCSWINVDGVVHEFLAGSSSHPQNERLHTELQVLQKEMKLA 625 >gb|EMJ00840.1| hypothetical protein PRUPE_ppa021613mg [Prunus persica] Length = 643 Score = 135 bits (341), Expect = 6e-30 Identities = 62/106 (58%), Positives = 83/106 (78%) Frame = -3 Query: 568 GGCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSK 389 GGCLLH+R++L++ ++ LV+ +P NS GY++ +NA A D WGDVS LRW MR+ GV+K Sbjct: 539 GGCLLHSRVDLAQYVSNKLVRSDPDNSGGYIMLANAFASDRRWGDVSALRWVMREKGVNK 598 Query: 388 QSGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKLAGSV 251 Q GCSWIS++GVVHEF+ G SHPQ E+I + L GL+KEMK+ GSV Sbjct: 599 QPGCSWISIDGVVHEFLVGCPSHPQIESIYNTLVGLVKEMKI-GSV 643 >ref|XP_004309732.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 435 Score = 135 bits (340), Expect = 7e-30 Identities = 59/103 (57%), Positives = 80/103 (77%) Frame = -3 Query: 568 GGCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSK 389 GGCLLH+RL+L+ ++ LV+ +P N+ GYV+ +NA A DH WGDVS LR MR+ GV+K Sbjct: 329 GGCLLHSRLDLAEYVSDKLVQSDPDNTGGYVMLANAFASDHRWGDVSSLRRFMREKGVTK 388 Query: 388 QSGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKLA 260 + GCSWIS+NGVVHEF+ G +SHPQ ++IC L G++K MK+A Sbjct: 389 KPGCSWISINGVVHEFLVGCSSHPQSDSICSLLNGMVKHMKIA 431 >gb|EOX98404.1| Pentatricopeptide repeat-containing protein [Theobroma cacao] Length = 647 Score = 134 bits (338), Expect = 1e-29 Identities = 61/102 (59%), Positives = 79/102 (77%) Frame = -3 Query: 568 GGCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSK 389 GGC+LH+R +L++ + K LV+V+P NS GYV+ +N LAVDH W DVS LRW MR+ GV K Sbjct: 543 GGCVLHSRADLAQKVYKKLVEVDPQNSGGYVMLANTLAVDHRWNDVSVLRWLMREKGVKK 602 Query: 388 QSGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKL 263 Q G SWIS++GVVHEF+AGS SHP+ E+I H L GL+ MK+ Sbjct: 603 QPGHSWISIDGVVHEFLAGSPSHPKMESIYHTLNGLVNVMKV 644 >ref|XP_004514248.1| PREDICTED: pentatricopeptide repeat-containing protein At3g09040, mitochondrial-like [Cicer arietinum] Length = 1334 Score = 130 bits (327), Expect = 2e-28 Identities = 59/101 (58%), Positives = 79/101 (78%) Frame = -3 Query: 568 GGCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSK 389 GGCLLH+R+EL++ ++K LV+++P+NSAGYV+ +NALA D W DVS LR MR+ G+ K Sbjct: 1223 GGCLLHSRVELAQEVSKRLVQIDPNNSAGYVMLANALASDSQWSDVSALRLEMREKGIKK 1282 Query: 388 QSGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMK 266 Q G SWISV+GVVHEF+ G SHPQ ++I + L GL+K MK Sbjct: 1283 QPGSSWISVDGVVHEFLVGCLSHPQMDSIYYTLTGLVKHMK 1323 >ref|XP_003535694.1| PREDICTED: pentatricopeptide repeat-containing protein At1g08070-like [Glycine max] Length = 634 Score = 130 bits (326), Expect = 3e-28 Identities = 60/103 (58%), Positives = 79/103 (76%) Frame = -3 Query: 568 GGCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSK 389 GGCLLH+R+EL++ +++ LV+V+P NSAGYV+ +NALA D+ W DVSGLR M++ GV K Sbjct: 523 GGCLLHSRVELAQEVSRRLVEVDPDNSAGYVMLANALASDNQWSDVSGLRLEMKEKGVKK 582 Query: 388 QSGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKLA 260 Q G SWI V+G VHEF+ G SHP+ E I H L GL+K MK+A Sbjct: 583 QPGSSWIIVDGAVHEFLVGCLSHPEIEGIYHTLAGLVKNMKVA 625 >gb|ESW15077.1| hypothetical protein PHAVU_007G042100g [Phaseolus vulgaris] Length = 632 Score = 122 bits (307), Expect = 5e-26 Identities = 57/103 (55%), Positives = 79/103 (76%) Frame = -3 Query: 568 GGCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSK 389 GGCLLH+R+EL++ +++ LV+V+P NSAGYV+ +NALA ++ W DVS LR M++ G+ K Sbjct: 521 GGCLLHSRVELAQEVSRRLVEVDPDNSAGYVMLANALASENQWSDVSELRLEMKEKGIKK 580 Query: 388 QSGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKLA 260 Q G SWI V+GVVHEF+ G SHP+ E+I L GL+K MK+A Sbjct: 581 QPGSSWIVVDGVVHEFLVGCLSHPKIESIHITLAGLVKHMKVA 623 >gb|EPS71925.1| hypothetical protein M569_02834, partial [Genlisea aurea] Length = 583 Score = 102 bits (253), Expect = 9e-20 Identities = 46/92 (50%), Positives = 67/92 (72%) Frame = -3 Query: 565 GCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSKQ 386 GC++H RLE++ ++A MLV V+P NS GYVL SN A D W V+ LR +MR++GV K+ Sbjct: 492 GCVVHGRLEIAESVASMLVDVDPDNSGGYVLLSNTFASDRKWRRVAELRRAMRETGVRKE 551 Query: 385 SGCSWISVNGVVHEFVAGSASHPQHETICHAL 290 +G SWIS++G VHEF+AGSA++ + + +L Sbjct: 552 AGRSWISIDGAVHEFIAGSATNADADKVIESL 583 >ref|XP_003539649.2| PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820-like isoform X1 [Glycine max] gi|571494895|ref|XP_006592973.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820-like isoform X2 [Glycine max] Length = 681 Score = 101 bits (252), Expect = 1e-19 Identities = 45/104 (43%), Positives = 63/104 (60%) Frame = -3 Query: 568 GGCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSK 389 GGC LH +L+ + K L+++EP NS YVL SN + H W + +R S+ + G+ K Sbjct: 486 GGCRLHKDTQLAEHVLKQLIELEPWNSGHYVLLSNIYSASHRWDEAEKIRSSLNQKGMQK 545 Query: 388 QSGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKLAG 257 GCSW+ V+GVVHEF+ G SHP I LE L K+++ AG Sbjct: 546 LPGCSWVEVDGVVHEFLVGDTSHPLSHKIYEKLESLFKDLREAG 589 >gb|EMJ08254.1| hypothetical protein PRUPE_ppb002198mg [Prunus persica] Length = 636 Score = 101 bits (251), Expect = 2e-19 Identities = 49/106 (46%), Positives = 64/106 (60%) Frame = -3 Query: 568 GGCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSK 389 GGC LH + +L+ + K L+++EP NSA YVL SN + H W + + R M + G+ K Sbjct: 441 GGCRLHRQTQLAELVLKQLIELEPWNSAHYVLLSNIYSASHKWDEAADTRSRMNEQGMKK 500 Query: 388 QSGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKLAGSV 251 GCSWI VNGVV EF+ G SH E I L+ L KE+K AG V Sbjct: 501 IPGCSWIEVNGVVQEFLVGDKSHALSEKIYAKLDELAKELKAAGYV 546 >ref|XP_006838870.1| hypothetical protein AMTR_s00002p00266930 [Amborella trichopoda] gi|548841376|gb|ERN01439.1| hypothetical protein AMTR_s00002p00266930 [Amborella trichopoda] Length = 646 Score = 99.4 bits (246), Expect = 6e-19 Identities = 50/102 (49%), Positives = 59/102 (57%) Frame = -3 Query: 562 CLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSKQS 383 C +H +EL+ AK L K+EP NS YVL SN A + W +VS LR R+ GV K Sbjct: 538 CRVHCNVELAEVAAKHLFKIEPDNSGNYVLLSNVYAAKNQWENVSKLRAMRRERGVRKNR 597 Query: 382 GCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKLAG 257 GCSWI VN VHEF+ HP +I LEGLL EM L G Sbjct: 598 GCSWIEVNSCVHEFIMEDRRHPDSNSIYEVLEGLLGEMMLIG 639 >gb|EXC23679.1| hypothetical protein L484_015589 [Morus notabilis] Length = 652 Score = 98.6 bits (244), Expect = 1e-18 Identities = 47/106 (44%), Positives = 66/106 (62%) Frame = -3 Query: 568 GGCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSK 389 G C +H ++L + +A+ L++++P NS YVL SN A WGDV +R MR+ GV K Sbjct: 513 GACKIHRDIDLGKYVAEKLLEIDPTNSGPYVLLSNMYAELGRWGDVVKVRKLMRQRGVIK 572 Query: 388 QSGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKLAGSV 251 Q GCSWI + G VH F+ HP+ + IC ++ LLK+MK AG V Sbjct: 573 QPGCSWIELKGRVHVFLVKDKRHPKRKEICSVVKSLLKQMKRAGYV 618 >gb|EOY27823.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] Length = 717 Score = 98.6 bits (244), Expect = 1e-18 Identities = 49/106 (46%), Positives = 66/106 (62%) Frame = -3 Query: 568 GGCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSK 389 G C +H+ LE++ AK ++ +EPH SA YVL SN A W DVS +R M++ G+ K Sbjct: 522 GACRMHSNLEVAERAAKSILDLEPHCSAAYVLLSNLYASAGRWSDVSRMRVRMKQRGIVK 581 Query: 388 QSGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKLAGSV 251 Q GCSW++V GV HEFV+G SHP E I L+ L ++K G V Sbjct: 582 QPGCSWVTVRGVRHEFVSGDKSHPFSEEIYQKLDWLGGKLKEFGYV 627 >gb|EOY19148.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 3 [Theobroma cacao] gi|508727252|gb|EOY19149.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 3 [Theobroma cacao] Length = 503 Score = 98.6 bits (244), Expect = 1e-18 Identities = 49/106 (46%), Positives = 61/106 (57%) Frame = -3 Query: 568 GGCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSK 389 GGC LH +L + K L+++EP NS YVL SN + H W D + +R M + G+ K Sbjct: 308 GGCRLHKDTQLVEHVLKKLIELEPWNSGNYVLLSNIYSASHKWDDAAKIRSIMNERGIQK 367 Query: 388 QSGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKLAGSV 251 G SWI VNG VHEF+ G SHP E I L L KE+K AG V Sbjct: 368 VPGYSWIEVNGFVHEFLVGDKSHPLSEMIYTKLGELAKELKAAGYV 413 >gb|EOY19146.1| Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 1 [Theobroma cacao] gi|508727250|gb|EOY19147.1| Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 1 [Theobroma cacao] Length = 688 Score = 98.6 bits (244), Expect = 1e-18 Identities = 49/106 (46%), Positives = 61/106 (57%) Frame = -3 Query: 568 GGCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSK 389 GGC LH +L + K L+++EP NS YVL SN + H W D + +R M + G+ K Sbjct: 493 GGCRLHKDTQLVEHVLKKLIELEPWNSGNYVLLSNIYSASHKWDDAAKIRSIMNERGIQK 552 Query: 388 QSGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKLAGSV 251 G SWI VNG VHEF+ G SHP E I L L KE+K AG V Sbjct: 553 VPGYSWIEVNGFVHEFLVGDKSHPLSEMIYTKLGELAKELKAAGYV 598 >ref|XP_006429514.1| hypothetical protein CICLE_v10011209mg [Citrus clementina] gi|568855070|ref|XP_006481133.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820-like [Citrus sinensis] gi|557531571|gb|ESR42754.1| hypothetical protein CICLE_v10011209mg [Citrus clementina] Length = 688 Score = 97.4 bits (241), Expect = 2e-18 Identities = 45/105 (42%), Positives = 61/105 (58%) Frame = -3 Query: 565 GCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSKQ 386 GC LH + +L+ + L+ +EP NS YVL SN + H W D + +R M G+ K Sbjct: 494 GCRLHKKTDLAEHVLNQLIALEPWNSGNYVLLSNIYSASHKWNDAAKIRSMMGDKGIQKI 553 Query: 385 SGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKLAGSV 251 GCSW+ V+GVVHEF+ G SHP E I L+ L ++K AG V Sbjct: 554 RGCSWVEVDGVVHEFLVGDNSHPLSEKIYSKLDELATKLKAAGFV 598 >ref|XP_004305093.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820-like [Fragaria vesca subsp. vesca] Length = 688 Score = 97.1 bits (240), Expect = 3e-18 Identities = 46/106 (43%), Positives = 62/106 (58%) Frame = -3 Query: 568 GGCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSK 389 GGC LH +L+ K L+++EP NSA YVL SN + W + + +R M + G+ K Sbjct: 493 GGCRLHRDTQLAELALKQLIELEPWNSAHYVLLSNVYSASQKWDEAANIRSRMNEQGLQK 552 Query: 388 QSGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKLAGSV 251 GCSWI VNGVVHEF+ G SH + I L L K++K AG + Sbjct: 553 IPGCSWIEVNGVVHEFLVGDKSHELSDKIYAKLNELAKDLKAAGYI 598 >ref|XP_002309169.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|222855145|gb|EEE92692.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 619 Score = 96.7 bits (239), Expect = 4e-18 Identities = 47/106 (44%), Positives = 62/106 (58%) Frame = -3 Query: 568 GGCLLHNRLELSRTIAKMLVKVEPHNSAGYVLWSNALAVDHHWGDVSGLRWSMRKSGVSK 389 GGC LH +L + K L+ +EP NS YVL SN + H W D + +R M + G+ K Sbjct: 424 GGCRLHRDTQLVEGVLKQLIALEPSNSGNYVLLSNIYSASHKWEDAAKIRSIMSERGIKK 483 Query: 388 QSGCSWISVNGVVHEFVAGSASHPQHETICHALEGLLKEMKLAGSV 251 G SWI V+GVVHEF+ G SHP E I L L+K++K +G V Sbjct: 484 VPGYSWIEVDGVVHEFLVGDTSHPLSEKIYAKLGELVKDLKASGYV 529