BLASTX nr result
ID: Catharanthus23_contig00032900
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00032900 (340 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EXB77045.1| hypothetical protein L484_014171 [Morus notabilis] 134 2e-29 gb|EOY28770.1| Tetratricopeptide repeat-like superfamily protein... 133 2e-29 ref|XP_004233783.1| PREDICTED: pentatricopeptide repeat-containi... 129 5e-28 ref|XP_006348147.1| PREDICTED: pentatricopeptide repeat-containi... 128 9e-28 ref|XP_004486315.1| PREDICTED: pentatricopeptide repeat-containi... 125 4e-27 ref|XP_003632946.1| PREDICTED: pentatricopeptide repeat-containi... 125 6e-27 emb|CBI23204.3| unnamed protein product [Vitis vinifera] 125 6e-27 gb|ESW19690.1| hypothetical protein PHAVU_006G147000g [Phaseolus... 117 1e-24 ref|XP_004295288.1| PREDICTED: pentatricopeptide repeat-containi... 115 8e-24 ref|XP_003547263.1| PREDICTED: pentatricopeptide repeat-containi... 115 8e-24 ref|XP_002874797.1| pentatricopeptide repeat-containing protein ... 113 2e-23 ref|XP_006289487.1| hypothetical protein CARUB_v10003020mg [Caps... 112 4e-23 ref|XP_004168223.1| PREDICTED: pentatricopeptide repeat-containi... 111 9e-23 ref|XP_004139152.1| PREDICTED: pentatricopeptide repeat-containi... 111 9e-23 ref|XP_003534717.1| PREDICTED: pentatricopeptide repeat-containi... 110 3e-22 ref|NP_192346.1| pentatricopeptide repeat-containing protein [Ar... 108 6e-22 ref|XP_006396695.1| hypothetical protein EUTSA_v10028467mg [Eutr... 105 6e-21 gb|EPS71960.1| hypothetical protein M569_02796 [Genlisea aurea] 103 3e-20 gb|EMJ14151.1| hypothetical protein PRUPE_ppa022121mg [Prunus pe... 87 3e-15 ref|XP_004308640.1| PREDICTED: pentatricopeptide repeat-containi... 78 1e-12 >gb|EXB77045.1| hypothetical protein L484_014171 [Morus notabilis] Length = 746 Score = 134 bits (336), Expect = 2e-29 Identities = 65/107 (60%), Positives = 81/107 (75%) Frame = -3 Query: 323 LPATSPPSNAAWAATKSFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIKAC 144 L A P A +T+SFN+IINRLSSQ +HHEV +T++SML SNT PD+ T+PSL KAC Sbjct: 8 LNALKPSQTAKTTSTRSFNAIINRLSSQASHHEVLITFSSMLQSNTPPDTHTFPSLFKAC 67 Query: 143 TSLKLFSLGILFHQRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 SL LF LGI HQ VV+ GFSSDPY+ASSL++FY++FG A+KV Sbjct: 68 ASLHLFPLGISLHQCVVVNGFSSDPYVASSLVSFYAKFGCVGNARKV 114 >gb|EOY28770.1| Tetratricopeptide repeat-like superfamily protein [Theobroma cacao] Length = 1250 Score = 133 bits (335), Expect = 2e-29 Identities = 64/95 (67%), Positives = 82/95 (86%) Frame = -3 Query: 287 AATKSFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIKACTSLKLFSLGILF 108 ++TKSFNSIIN LSSQG++HEV +TYT+ML S+T PDS+T+PSL+KACTSL LFS+G+ Sbjct: 10 SSTKSFNSIINNLSSQGSYHEVLVTYTTMLNSST-PDSYTFPSLLKACTSLNLFSIGLSV 68 Query: 107 HQRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 HQ+V++ GFSSD Y ASSLI FYS+FG T +A+KV Sbjct: 69 HQQVILRGFSSDSYTASSLINFYSKFGHTKHARKV 103 >ref|XP_004233783.1| PREDICTED: pentatricopeptide repeat-containing protein At4g04370-like [Solanum lycopersicum] Length = 753 Score = 129 bits (323), Expect = 5e-28 Identities = 65/109 (59%), Positives = 78/109 (71%), Gaps = 3/109 (2%) Frame = -3 Query: 320 PATSPP---SNAAWAATKSFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIK 150 P PP S AA A TKSFN+ ++RLSS+GAHH LTY SML S+ PD FT+P+L+K Sbjct: 12 PLKQPPFLHSAAAAATTKSFNATLHRLSSEGAHHHALLTYDSMLKSSVRPDPFTFPTLLK 71 Query: 149 ACTSLKLFSLGILFHQRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 AC SL L G+L HQ VV+ GFSSDPYI SSLI+FYS FG T +A K+ Sbjct: 72 ACISLNLLPHGLLLHQHVVVNGFSSDPYIGSSLISFYSSFGLTEHAHKM 120 >ref|XP_006348147.1| PREDICTED: pentatricopeptide repeat-containing protein At4g04370-like isoform X1 [Solanum tuberosum] gi|565362832|ref|XP_006348148.1| PREDICTED: pentatricopeptide repeat-containing protein At4g04370-like isoform X2 [Solanum tuberosum] gi|565362834|ref|XP_006348149.1| PREDICTED: pentatricopeptide repeat-containing protein At4g04370-like isoform X3 [Solanum tuberosum] Length = 753 Score = 128 bits (321), Expect = 9e-28 Identities = 66/109 (60%), Positives = 78/109 (71%), Gaps = 3/109 (2%) Frame = -3 Query: 320 PATSPP---SNAAWAATKSFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIK 150 P PP S AA AATKSFN+ ++RLSS+G HH LTY SML S+ PD FT+P+L+K Sbjct: 12 PLKQPPFRHSAAAAAATKSFNATLHRLSSEGFHHHALLTYNSMLKSSVPPDPFTFPTLLK 71 Query: 149 ACTSLKLFSLGILFHQRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 AC SL L G+L HQ VV+ GFSSD YI SSLI+FYS FG T +AQKV Sbjct: 72 ACISLNLLRHGLLLHQHVVVNGFSSDSYIGSSLISFYSSFGLTEHAQKV 120 >ref|XP_004486315.1| PREDICTED: pentatricopeptide repeat-containing protein At4g04370-like [Cicer arietinum] Length = 740 Score = 125 bits (315), Expect = 4e-27 Identities = 60/107 (56%), Positives = 80/107 (74%) Frame = -3 Query: 323 LPATSPPSNAAWAATKSFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIKAC 144 LP P S A A T SFN+IINR S+QGAH +V +TY+SML S+ D++T+PSL+KAC Sbjct: 4 LPTPHPSSPPASATTNSFNAIINRHSTQGAHRQVLITYSSMLNSHIPSDAYTFPSLLKAC 63 Query: 143 TSLKLFSLGILFHQRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 +SL LF LG+ HQR+++ G S+D YIASSLI FY +FG + +A+KV Sbjct: 64 SSLNLFHLGLTLHQRILVNGLSTDSYIASSLINFYVKFGYSYFARKV 110 >ref|XP_003632946.1| PREDICTED: pentatricopeptide repeat-containing protein At4g04370-like [Vitis vinifera] Length = 732 Score = 125 bits (314), Expect = 6e-27 Identities = 60/94 (63%), Positives = 79/94 (84%) Frame = -3 Query: 284 ATKSFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIKACTSLKLFSLGILFH 105 ATKS+N+IINRLS+ GA +V LTY+SML+++T PD+ T+PSL+KACTSL LFS G+ FH Sbjct: 12 ATKSYNAIINRLSTAGAFCDVLLTYSSMLSTDTPPDAHTFPSLVKACTSLDLFSHGLSFH 71 Query: 104 QRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 QRV+++G+SSD YIA+SLI FYS+FG A+KV Sbjct: 72 QRVIVDGYSSDSYIATSLINFYSKFGHNQSARKV 105 >emb|CBI23204.3| unnamed protein product [Vitis vinifera] Length = 907 Score = 125 bits (314), Expect = 6e-27 Identities = 60/94 (63%), Positives = 79/94 (84%) Frame = -3 Query: 284 ATKSFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIKACTSLKLFSLGILFH 105 ATKS+N+IINRLS+ GA +V LTY+SML+++T PD+ T+PSL+KACTSL LFS G+ FH Sbjct: 12 ATKSYNAIINRLSTAGAFCDVLLTYSSMLSTDTPPDAHTFPSLVKACTSLDLFSHGLSFH 71 Query: 104 QRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 QRV+++G+SSD YIA+SLI FYS+FG A+KV Sbjct: 72 QRVIVDGYSSDSYIATSLINFYSKFGHNQSARKV 105 >gb|ESW19690.1| hypothetical protein PHAVU_006G147000g [Phaseolus vulgaris] Length = 764 Score = 117 bits (294), Expect = 1e-24 Identities = 56/98 (57%), Positives = 75/98 (76%) Frame = -3 Query: 296 AAWAATKSFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIKACTSLKLFSLG 117 +A A T S N+IIN SSQGAH +V +TY SML ++ D++T+PSL++AC+SL LFSLG Sbjct: 29 SASATTNSLNAIINHYSSQGAHRQVLVTYASMLKTHVPSDAYTFPSLLRACSSLNLFSLG 88 Query: 116 ILFHQRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 + HQRV+++G S DPYIASSLI FY++FG A+KV Sbjct: 89 LSLHQRVLVKGLSIDPYIASSLINFYAKFGCADVARKV 126 >ref|XP_004295288.1| PREDICTED: pentatricopeptide repeat-containing protein At4g04370-like [Fragaria vesca subsp. vesca] Length = 748 Score = 115 bits (287), Expect = 8e-24 Identities = 57/108 (52%), Positives = 75/108 (69%) Frame = -3 Query: 326 PLPATSPPSNAAWAATKSFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIKA 147 PL + + A T++FN+IINRLSSQ +HHEV ++SML ++ PD+ T+PSL+KA Sbjct: 11 PLKPQTQLTPTATTTTRAFNAIINRLSSQRSHHEVLAAFSSMLKAHIPPDTHTFPSLLKA 70 Query: 146 CTSLKLFSLGILFHQRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 CTSL LF G+ HQRVV+ GF SD YIASSLI Y++ G A+KV Sbjct: 71 CTSLNLFGHGVSLHQRVVVNGFFSDAYIASSLINLYAKLGHVQNARKV 118 >ref|XP_003547263.1| PREDICTED: pentatricopeptide repeat-containing protein At4g04370-like [Glycine max] Length = 764 Score = 115 bits (287), Expect = 8e-24 Identities = 56/102 (54%), Positives = 74/102 (72%) Frame = -3 Query: 308 PPSNAAWAATKSFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIKACTSLKL 129 P ++A A SFN+IIN SSQGAH +V TY SML ++ D++T+PSL+KAC+SL L Sbjct: 25 PHPSSASATINSFNAIINHHSSQGAHRQVLATYASMLKTHVPSDAYTFPSLLKACSSLNL 84 Query: 128 FSLGILFHQRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 FSLG+ HQR+++ G S D YIASSLI FY++FG A+KV Sbjct: 85 FSLGLSLHQRILVSGLSLDAYIASSLINFYAKFGFADVARKV 126 >ref|XP_002874797.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297320634|gb|EFH51056.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 748 Score = 113 bits (283), Expect = 2e-23 Identities = 53/94 (56%), Positives = 72/94 (76%) Frame = -3 Query: 284 ATKSFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIKACTSLKLFSLGILFH 105 +TK FNS IN LSS G H +V T++SML + LPD+FT+PSL+KACTSL+L S G+ H Sbjct: 10 STKYFNSHINHLSSHGDHKQVLSTFSSMLANKLLPDTFTFPSLLKACTSLQLLSFGLSIH 69 Query: 104 QRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 Q+V++ GFSSD YI+SSL+ Y++FG +A+KV Sbjct: 70 QKVLVNGFSSDSYISSSLVNLYAKFGLLGHARKV 103 >ref|XP_006289487.1| hypothetical protein CARUB_v10003020mg [Capsella rubella] gi|482558193|gb|EOA22385.1| hypothetical protein CARUB_v10003020mg [Capsella rubella] Length = 748 Score = 112 bits (281), Expect = 4e-23 Identities = 52/94 (55%), Positives = 72/94 (76%) Frame = -3 Query: 284 ATKSFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIKACTSLKLFSLGILFH 105 +TK FNS+IN SS G + +V T++SML LPD+FT+PSL+KAC SL+LFS G+ H Sbjct: 10 STKYFNSLINHFSSHGEYKQVLATFSSMLAKRLLPDTFTFPSLLKACASLQLFSFGLSIH 69 Query: 104 QRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 Q+V++ GFSSD YI+SSL+ FY++FG +A+KV Sbjct: 70 QKVLVYGFSSDFYISSSLVNFYAKFGVLGHARKV 103 >ref|XP_004168223.1| PREDICTED: pentatricopeptide repeat-containing protein At4g04370-like [Cucumis sativus] Length = 743 Score = 111 bits (278), Expect = 9e-23 Identities = 54/97 (55%), Positives = 70/97 (72%) Frame = -3 Query: 293 AWAATKSFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIKACTSLKLFSLGI 114 A TKSFNS+++RLS QGAHH+V TY SM ++T D++T+PSL KACT+L LFS G+ Sbjct: 10 AHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLNLFSHGL 69 Query: 113 LFHQRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 HQ VV+ G S D YI SSLI+FY++FG +KV Sbjct: 70 SLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKV 106 >ref|XP_004139152.1| PREDICTED: pentatricopeptide repeat-containing protein At4g04370-like [Cucumis sativus] Length = 743 Score = 111 bits (278), Expect = 9e-23 Identities = 54/97 (55%), Positives = 70/97 (72%) Frame = -3 Query: 293 AWAATKSFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIKACTSLKLFSLGI 114 A TKSFNS+++RLS QGAHH+V TY SM ++T D++T+PSL KACT+L LFS G+ Sbjct: 10 AHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLNLFSHGL 69 Query: 113 LFHQRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 HQ VV+ G S D YI SSLI+FY++FG +KV Sbjct: 70 SLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKV 106 >ref|XP_003534717.1| PREDICTED: pentatricopeptide repeat-containing protein At4g04370-like [Glycine max] Length = 755 Score = 110 bits (274), Expect = 3e-22 Identities = 52/95 (54%), Positives = 69/95 (72%) Frame = -3 Query: 287 AATKSFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIKACTSLKLFSLGILF 108 A T S N+ IN S+QGAHH+V TY SML ++ D++T+PSL+KAC+ L LFSLG+ Sbjct: 24 ATTNSVNATINHHSTQGAHHQVLATYASMLKTHVPSDAYTFPSLLKACSFLNLFSLGLTL 83 Query: 107 HQRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 HQR+++ G S D YIASSLI FY++FG A+KV Sbjct: 84 HQRILVSGLSLDAYIASSLINFYAKFGFADVARKV 118 >ref|NP_192346.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75214457|sp|Q9XE98.1|PP303_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g04370 gi|4982476|gb|AAD36944.1|AF069441_4 hypothetical protein [Arabidopsis thaliana] gi|7267194|emb|CAB77905.1| hypothetical protein [Arabidopsis thaliana] gi|332656985|gb|AEE82385.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 729 Score = 108 bits (271), Expect = 6e-22 Identities = 51/94 (54%), Positives = 70/94 (74%) Frame = -3 Query: 284 ATKSFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIKACTSLKLFSLGILFH 105 +TK FNS IN LSS G H +V T++SML + LPD+FT+PSL+KAC SL+ S G+ H Sbjct: 10 STKYFNSHINHLSSHGDHKQVLSTFSSMLANKLLPDTFTFPSLLKACASLQRLSFGLSIH 69 Query: 104 QRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 Q+V++ GFSSD YI+SSL+ Y++FG +A+KV Sbjct: 70 QQVLVNGFSSDFYISSSLVNLYAKFGLLAHARKV 103 >ref|XP_006396695.1| hypothetical protein EUTSA_v10028467mg [Eutrema salsugineum] gi|557097712|gb|ESQ38148.1| hypothetical protein EUTSA_v10028467mg [Eutrema salsugineum] Length = 726 Score = 105 bits (262), Expect = 6e-21 Identities = 48/94 (51%), Positives = 70/94 (74%) Frame = -3 Query: 284 ATKSFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIKACTSLKLFSLGILFH 105 +TK+FNS+IN SS G H +V T++SML + PD+FT+PSL+KAC SL+L S G+ H Sbjct: 10 STKTFNSLINHFSSHGDHLQVLSTFSSMLANRFQPDAFTFPSLLKACASLQLLSFGLSIH 69 Query: 104 QRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 Q+V++ GFSSD Y +SSL+ Y++FG +A+K+ Sbjct: 70 QQVLVNGFSSDCYTSSSLVNLYAKFGILGHARKL 103 >gb|EPS71960.1| hypothetical protein M569_02796 [Genlisea aurea] Length = 751 Score = 103 bits (256), Expect = 3e-20 Identities = 49/102 (48%), Positives = 69/102 (67%), Gaps = 1/102 (0%) Frame = -3 Query: 305 PSNAAWAATKSFNSIINRLSSQGAHHEVFLTYTSMLTS-NTLPDSFTYPSLIKACTSLKL 129 P+ A + +S N+ + RLSS+G H EV L + SML S T PD FTYPS +KAC +L+ Sbjct: 15 PATIAASGLRSLNATVGRLSSEGFHREVLLAFASMLKSPETTPDFFTYPSALKACIALRF 74 Query: 128 FSLGILFHQRVVIEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 SL + HQR+++ G++SD YI+SSLI+ Y F + YA+KV Sbjct: 75 LSLSLSIHQRIIVSGYASDSYISSSLISLYGNFKNVDYARKV 116 Score = 62.0 bits (149), Expect = 8e-08 Identities = 34/83 (40%), Positives = 46/83 (55%) Frame = -3 Query: 275 SFNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIKACTSLKLFSLGILFHQRV 96 S+NS+IN S G HEV + M T N PD T+ SL+ A S G + H +V Sbjct: 226 SWNSLINAYSIVGDVHEVLKLFRKMTTENIEPDQQTFGSLVSAIASQGNLQAGRVVHGKV 285 Query: 95 VIEGFSSDPYIASSLITFYSRFG 27 + GF+S ++ +SLITF SR G Sbjct: 286 ITYGFASHKHVETSLITFLSRCG 308 >gb|EMJ14151.1| hypothetical protein PRUPE_ppa022121mg [Prunus persica] Length = 701 Score = 86.7 bits (213), Expect = 3e-15 Identities = 40/67 (59%), Positives = 52/67 (77%) Frame = -3 Query: 203 MLTSNTLPDSFTYPSLIKACTSLKLFSLGILFHQRVVIEGFSSDPYIASSLITFYSRFGS 24 ML +NT PD++T+P+L+KACTSL LF G+ FHQ +V+ GFS D YIASSLI FY++FG Sbjct: 1 MLKTNTPPDTYTFPNLLKACTSLNLFPFGLSFHQCLVVNGFSLDAYIASSLINFYAKFGH 60 Query: 23 TVYAQKV 3 A+KV Sbjct: 61 AQNARKV 67 >ref|XP_004308640.1| PREDICTED: pentatricopeptide repeat-containing protein At3g03580-like [Fragaria vesca subsp. vesca] Length = 764 Score = 78.2 bits (191), Expect = 1e-12 Identities = 37/90 (41%), Positives = 56/90 (62%) Frame = -3 Query: 272 FNSIINRLSSQGAHHEVFLTYTSMLTSNTLPDSFTYPSLIKACTSLKLFSLGILFHQRVV 93 +NSII L+ G H E Y +ML +N PDS T+PS+I AC +L +G++ H+RV Sbjct: 85 WNSIIRALTHNGLHSEALRHYNAMLHTNVRPDSHTFPSVINACAALCDLEMGLVIHRRVS 144 Query: 92 IEGFSSDPYIASSLITFYSRFGSTVYAQKV 3 GF +D Y+ ++LI Y+R G +A++V Sbjct: 145 ETGFGTDLYVCNALIDMYARLGELGHARQV 174