BLASTX nr result
ID: Catharanthus23_contig00003011
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00003011 (1217 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006360650.1| PREDICTED: pentatricopeptide repeat-containi... 361 3e-97 ref|XP_006360648.1| PREDICTED: pentatricopeptide repeat-containi... 361 3e-97 ref|XP_004240282.1| PREDICTED: pentatricopeptide repeat-containi... 360 8e-97 ref|XP_004301396.1| PREDICTED: pentatricopeptide repeat-containi... 355 2e-95 gb|EMJ23653.1| hypothetical protein PRUPE_ppa006191mg [Prunus pe... 355 2e-95 ref|XP_002265876.1| PREDICTED: pentatricopeptide repeat-containi... 353 6e-95 emb|CAN79718.1| hypothetical protein VITISV_012741 [Vitis vinifera] 353 6e-95 ref|XP_006372218.1| hypothetical protein POPTR_0018s14360g [Popu... 353 8e-95 ref|XP_002331436.1| predicted protein [Populus trichocarpa] gi|5... 353 8e-95 ref|XP_006453186.1| hypothetical protein CICLE_v10010414mg [Citr... 352 2e-94 gb|EOY32179.1| Pentatricopeptide repeat superfamily protein, put... 349 1e-93 ref|XP_002511816.1| pentatricopeptide repeat-containing protein,... 348 2e-93 ref|XP_006850970.1| hypothetical protein AMTR_s00025p00206120 [A... 345 2e-92 ref|XP_004152890.1| PREDICTED: pentatricopeptide repeat-containi... 343 6e-92 gb|ESW06704.1| hypothetical protein PHAVU_010G069800g [Phaseolus... 342 2e-91 gb|ESW06703.1| hypothetical protein PHAVU_010G069800g [Phaseolus... 342 2e-91 gb|ESW04320.1| hypothetical protein PHAVU_011G085400g [Phaseolus... 337 8e-90 ref|XP_006573403.1| PREDICTED: pentatricopeptide repeat-containi... 336 1e-89 ref|XP_004489844.1| PREDICTED: pentatricopeptide repeat-containi... 331 4e-88 ref|XP_006307633.1| hypothetical protein CARUB_v10009261mg [Caps... 318 3e-84 >ref|XP_006360650.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X3 [Solanum tuberosum] Length = 409 Score = 361 bits (927), Expect = 3e-97 Identities = 174/236 (73%), Positives = 205/236 (86%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 SMGF VDS TGNA+VIYYS FG ++EME AY RL+ SRILIEEEAIR++S AY+++ KFY Sbjct: 174 SMGFPVDSTTGNAYVIYYSNFGMLSEMEVAYGRLKMSRILIEEEAIRSISLAYLKKEKFY 233 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 SLG+FV +VGL RRNVGNLLWNL+LLSYAANFKMKSLQREFVRMVE+GF PDL TFNIRA Sbjct: 234 SLGQFVRDVGLCRRNVGNLLWNLLLLSYAANFKMKSLQREFVRMVESGFFPDLNTFNIRA 293 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 LAFSKMSLFWDLHV+LEHMKH+ V+PDLVTYG VVDAYL+R L RNL+FAL K++INDCV Sbjct: 294 LAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVDAYLDRGLGRNLDFALRKLNINDCV 353 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 ++ T+ +VFEA+GKGDFH S++ LEF +NWTY+ELI +LKK +R NQ FWNY Sbjct: 354 IVATEPLVFEAIGKGDFHLSSDARLEFSKNKNWTYEELITTYLKKYFRRNQIFWNY 409 >ref|XP_006360648.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X1 [Solanum tuberosum] gi|565389826|ref|XP_006360649.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X2 [Solanum tuberosum] Length = 416 Score = 361 bits (927), Expect = 3e-97 Identities = 174/236 (73%), Positives = 205/236 (86%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 SMGF VDS TGNA+VIYYS FG ++EME AY RL+ SRILIEEEAIR++S AY+++ KFY Sbjct: 181 SMGFPVDSTTGNAYVIYYSNFGMLSEMEVAYGRLKMSRILIEEEAIRSISLAYLKKEKFY 240 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 SLG+FV +VGL RRNVGNLLWNL+LLSYAANFKMKSLQREFVRMVE+GF PDL TFNIRA Sbjct: 241 SLGQFVRDVGLCRRNVGNLLWNLLLLSYAANFKMKSLQREFVRMVESGFFPDLNTFNIRA 300 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 LAFSKMSLFWDLHV+LEHMKH+ V+PDLVTYG VVDAYL+R L RNL+FAL K++INDCV Sbjct: 301 LAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVDAYLDRGLGRNLDFALRKLNINDCV 360 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 ++ T+ +VFEA+GKGDFH S++ LEF +NWTY+ELI +LKK +R NQ FWNY Sbjct: 361 IVATEPLVFEAIGKGDFHLSSDARLEFSKNKNWTYEELITTYLKKYFRRNQIFWNY 416 >ref|XP_004240282.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like [Solanum lycopersicum] Length = 381 Score = 360 bits (923), Expect = 8e-97 Identities = 175/236 (74%), Positives = 203/236 (86%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 SMGF VDS TGNA+VIYYS FG+++EME AY RL+ SRILIEEEAIR++S AY+++ KFY Sbjct: 146 SMGFPVDSTTGNAYVIYYSNFGTLSEMEVAYGRLKMSRILIEEEAIRSISLAYLKKEKFY 205 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 SLG+FV +VGL RRNVGNLLWNL+LLSYAANFKMKSLQREFVRMVE+GF PDL TFNIRA Sbjct: 206 SLGQFVRDVGLCRRNVGNLLWNLLLLSYAANFKMKSLQREFVRMVESGFFPDLNTFNIRA 265 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 LAFSKMSLFWDLHV+LEHMKH+ V+PDLVTYG VVDAYL+R L RNL+FAL K++ NDCV Sbjct: 266 LAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVDAYLDRGLGRNLDFALRKLNTNDCV 325 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 + T+ +VFEAMGKGDFH S+E LEF K NWTY+ LI +LKK +R NQ FWNY Sbjct: 326 TVATEPLVFEAMGKGDFHLSSEARLEFSKKTNWTYEVLITTYLKKYFRRNQIFWNY 381 >ref|XP_004301396.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like [Fragaria vesca subsp. vesca] Length = 424 Score = 355 bits (912), Expect = 2e-95 Identities = 167/236 (70%), Positives = 205/236 (86%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 S GF VDS TGN F+ YYSIFGS+ EME AY RL+ SR LIEEE IRA+S AY+++RKFY Sbjct: 189 SRGFPVDSATGNVFIRYYSIFGSLTEMETAYDRLKRSRFLIEEEGIRAMSLAYLKKRKFY 248 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 SL +F+++VGLGRRN+GNLLWNL+LLSYAANFKMK+LQREF+RMVEAGFHPDLTTFNIRA Sbjct: 249 SLAEFLKSVGLGRRNLGNLLWNLLLLSYAANFKMKTLQREFLRMVEAGFHPDLTTFNIRA 308 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 LAFS+MSL WDLH++LEHMKH V+PDLVT GC+VDAYL+RRL RNL FAL+KM+++D Sbjct: 309 LAFSRMSLLWDLHLTLEHMKHVKVVPDLVTCGCIVDAYLDRRLGRNLYFALNKMNLDDSP 368 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 V+LTD VFE +GKGDFH+S+E LEF+ ++ WTY++LI ++LKK+YR +Q FWNY Sbjct: 369 VVLTDPFVFEVLGKGDFHASSEAFLEFRKQKEWTYQKLISVYLKKQYRRDQIFWNY 424 >gb|EMJ23653.1| hypothetical protein PRUPE_ppa006191mg [Prunus persica] Length = 423 Score = 355 bits (911), Expect = 2e-95 Identities = 168/236 (71%), Positives = 203/236 (86%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 S GF +DS TGNAF+ YYSIFGS+ EME AY RL+ SR LIEEE IRA+S AY+++RKFY Sbjct: 188 SRGFPLDSATGNAFIRYYSIFGSLTEMETAYGRLKRSRFLIEEEGIRAMSFAYLKKRKFY 247 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 L + ++NVGLGRRN+GNL WNL+LLSYAA+FKMKSLQREF+RMVEAGFHPDLTTFNIRA Sbjct: 248 RLAELLKNVGLGRRNLGNLSWNLLLLSYAADFKMKSLQREFLRMVEAGFHPDLTTFNIRA 307 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 LAFS+MSL WDLH+SLEHMKH+ V PDLVT GCVVDAYLERRL +N+ FAL+KM+++D Sbjct: 308 LAFSRMSLLWDLHLSLEHMKHEKVFPDLVTCGCVVDAYLERRLGKNMYFALNKMNLDDSP 367 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 ++LTD VFE +GKGDFH+S+E LEF+S+R WTY+ LI ++LKK+YR NQ FWNY Sbjct: 368 LILTDPFVFEVLGKGDFHASSEAFLEFQSQREWTYRRLISVYLKKQYRRNQIFWNY 423 >ref|XP_002265876.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630 [Vitis vinifera] gi|297736023|emb|CBI24061.3| unnamed protein product [Vitis vinifera] Length = 423 Score = 353 bits (907), Expect = 6e-95 Identities = 171/236 (72%), Positives = 201/236 (85%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 S GF VDS TGNAF+ YYSIFGS+ EME AY RL+ SRILIEEE IRA+S AYI+E+K+Y Sbjct: 188 SRGFPVDSATGNAFIRYYSIFGSLTEMEAAYDRLKKSRILIEEEGIRAMSFAYIKEKKYY 247 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 LG+F+ +VGLGR+NVGNLLWNL+LLSYAANFKMKSLQREF+ MVEAGF PDLTTFNIRA Sbjct: 248 RLGQFLRDVGLGRKNVGNLLWNLLLLSYAANFKMKSLQREFLEMVEAGFAPDLTTFNIRA 307 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 LAFS+MSLFWDLH+SLEHM+H V+ DLVTYGCVVDAYL+RRL +NL+FAL KM+++D Sbjct: 308 LAFSRMSLFWDLHLSLEHMQHVKVVADLVTYGCVVDAYLDRRLGKNLDFALKKMNMDDSP 367 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 ++ TD VFE +GKGDFHSS+E LE K WTY++LI +LKKKYRSNQ FWNY Sbjct: 368 LVSTDHFVFEVLGKGDFHSSSEAFLESKRNGKWTYRKLIATYLKKKYRSNQIFWNY 423 >emb|CAN79718.1| hypothetical protein VITISV_012741 [Vitis vinifera] Length = 446 Score = 353 bits (907), Expect = 6e-95 Identities = 171/236 (72%), Positives = 201/236 (85%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 S GF VDS TGNAF+ YYSIFGS+ EME AY RL+ SRILIEEE IRA+S AYI+E+K+Y Sbjct: 211 SRGFPVDSATGNAFIRYYSIFGSLTEMEAAYDRLKKSRILIEEEGIRAMSFAYIKEKKYY 270 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 LG+F+ +VGLGR+NVGNLLWNL+LLSYAANFKMKSLQREF+ MVEAGF PDLTTFNIRA Sbjct: 271 RLGQFLRDVGLGRKNVGNLLWNLLLLSYAANFKMKSLQREFLEMVEAGFAPDLTTFNIRA 330 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 LAFS+MSLFWDLH+SLEHM+H V+ DLVTYGCVVDAYL+RRL +NL+FAL KM+++D Sbjct: 331 LAFSRMSLFWDLHLSLEHMQHVKVVADLVTYGCVVDAYLDRRLGKNLDFALKKMNMDDSP 390 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 ++ TD VFE +GKGDFHSS+E LE K WTY++LI +LKKKYRSNQ FWNY Sbjct: 391 LVSTDHFVFEVLGKGDFHSSSEAFLESKRNGKWTYRKLIATYLKKKYRSNQIFWNY 446 >ref|XP_006372218.1| hypothetical protein POPTR_0018s14360g [Populus trichocarpa] gi|550318749|gb|ERP50015.1| hypothetical protein POPTR_0018s14360g [Populus trichocarpa] Length = 392 Score = 353 bits (906), Expect = 8e-95 Identities = 166/236 (70%), Positives = 201/236 (85%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 S GF VDS TGNAFV+YYS+ GS+AEME AY RL+ SR+LIE E IRA+S AYI+ERKFY Sbjct: 157 SKGFWVDSATGNAFVVYYSLHGSLAEMEAAYDRLKRSRLLIEREGIRAMSFAYIKERKFY 216 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 L +F+ +VGLGR+N+GNL+WNL+LLSY+ANFKMK+LQREF+ M+EAGFHPDLTTFNIRA Sbjct: 217 GLSEFLRDVGLGRKNLGNLIWNLLLLSYSANFKMKTLQREFLNMLEAGFHPDLTTFNIRA 276 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 LAFS+MSL WDLH+ LEHMKH V PDLVTYGC+VDAYL+RRLVRNLEFAL KM +++ Sbjct: 277 LAFSRMSLLWDLHLGLEHMKHDKVAPDLVTYGCIVDAYLDRRLVRNLEFALSKMHVDNSP 336 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 V+ TD VFE GKGDFHSS+E +EFK +R WTY+ELI+I+L+K++RS FWNY Sbjct: 337 VLSTDPFVFEVFGKGDFHSSSEAFMEFKRQRKWTYRELIKIYLRKQHRSKHIFWNY 392 >ref|XP_002331436.1| predicted protein [Populus trichocarpa] gi|566215849|ref|XP_006372219.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550318750|gb|ERP50016.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 428 Score = 353 bits (906), Expect = 8e-95 Identities = 166/236 (70%), Positives = 201/236 (85%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 S GF VDS TGNAFV+YYS+ GS+AEME AY RL+ SR+LIE E IRA+S AYI+ERKFY Sbjct: 193 SKGFWVDSATGNAFVVYYSLHGSLAEMEAAYDRLKRSRLLIEREGIRAMSFAYIKERKFY 252 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 L +F+ +VGLGR+N+GNL+WNL+LLSY+ANFKMK+LQREF+ M+EAGFHPDLTTFNIRA Sbjct: 253 GLSEFLRDVGLGRKNLGNLIWNLLLLSYSANFKMKTLQREFLNMLEAGFHPDLTTFNIRA 312 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 LAFS+MSL WDLH+ LEHMKH V PDLVTYGC+VDAYL+RRLVRNLEFAL KM +++ Sbjct: 313 LAFSRMSLLWDLHLGLEHMKHDKVAPDLVTYGCIVDAYLDRRLVRNLEFALSKMHVDNSP 372 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 V+ TD VFE GKGDFHSS+E +EFK +R WTY+ELI+I+L+K++RS FWNY Sbjct: 373 VLSTDPFVFEVFGKGDFHSSSEAFMEFKRQRKWTYRELIKIYLRKQHRSKHIFWNY 428 >ref|XP_006453186.1| hypothetical protein CICLE_v10010414mg [Citrus clementina] gi|568840749|ref|XP_006474328.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X1 [Citrus sinensis] gi|568840751|ref|XP_006474329.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X2 [Citrus sinensis] gi|568840753|ref|XP_006474330.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X3 [Citrus sinensis] gi|568840755|ref|XP_006474331.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X4 [Citrus sinensis] gi|568840757|ref|XP_006474332.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X5 [Citrus sinensis] gi|568840759|ref|XP_006474333.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X6 [Citrus sinensis] gi|557556412|gb|ESR66426.1| hypothetical protein CICLE_v10010414mg [Citrus clementina] Length = 412 Score = 352 bits (903), Expect = 2e-94 Identities = 167/236 (70%), Positives = 203/236 (86%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 S GFSVDS TGNAF+IYYS FGS+ EME AY RL+ SR LI++E IRA+S Y++ERKF+ Sbjct: 177 SRGFSVDSATGNAFIIYYSRFGSLTEMETAYGRLKRSRHLIDKEGIRAVSFTYLKERKFF 236 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 LG+F+ +VGLGR+++GNLLWNL+LLSYA NFKMKSLQREF+RM EAGFHPDLTTFNIRA Sbjct: 237 MLGEFLRDVGLGRKDLGNLLWNLLLLSYAGNFKMKSLQREFMRMSEAGFHPDLTTFNIRA 296 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 +AFS+MS+FWDLH+SLEHMKH+SV PDLVTYGCVVDAYL++RL RNL+F L KM+++D Sbjct: 297 VAFSRMSMFWDLHLSLEHMKHESVGPDLVTYGCVVDAYLDKRLGRNLDFGLSKMNLDDSP 356 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 V+ TD VFEA GKGDFHSS+E LEFK +R WTY++LI ++LKK+ R NQ FWNY Sbjct: 357 VVSTDPYVFEAFGKGDFHSSSEAFLEFKRQRKWTYRKLIAVYLKKQLRRNQIFWNY 412 >gb|EOY32179.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma cacao] Length = 429 Score = 349 bits (895), Expect = 1e-93 Identities = 168/237 (70%), Positives = 206/237 (86%), Gaps = 1/237 (0%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 S G VDS TGNAFV YYSIFGS++EME AYARL+ SR LIEEE IRA+SSAYI+E KFY Sbjct: 193 SRGLPVDSATGNAFVRYYSIFGSLSEMEIAYARLKRSRHLIEEEGIRAMSSAYIKEGKFY 252 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 LG+F+ ++GLGRRN+GNLLWNL+LLSYAANFKMK++QR F++M+++GF PDLTTFNIRA Sbjct: 253 RLGEFLNDLGLGRRNLGNLLWNLLLLSYAANFKMKTMQRLFLKMMDSGFRPDLTTFNIRA 312 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 AFS+MS+FWDLH+SLEHMKH+SV+ DLVTYGCVVDAYL+RRL RNL+FAL+ M+ +D Sbjct: 313 WAFSRMSMFWDLHLSLEHMKHESVVSDLVTYGCVVDAYLDRRLARNLDFALNHMNADDSP 372 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFK-SKRNWTYKELIEIHLKKKYRSNQAFWNY 510 ++LTD +VFEA+GKGDFHSSAE LEFK K+ WTY++LI ++LKK+ R NQ FWNY Sbjct: 373 LVLTDPLVFEALGKGDFHSSAEAFLEFKRQKKKWTYRQLIAVYLKKQLRRNQIFWNY 429 >ref|XP_002511816.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223548996|gb|EEF50485.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 427 Score = 348 bits (894), Expect = 2e-93 Identities = 160/234 (68%), Positives = 202/234 (86%) Frame = -1 Query: 1211 GFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFYSL 1032 GF VD TGNAF+ YYSI GS+ +ME AY+RL+ SR L++ E IRA+S AY++ERKFY L Sbjct: 194 GFPVDYATGNAFIRYYSIHGSLTDMESAYSRLKRSRHLVDREGIRAVSLAYVKERKFYRL 253 Query: 1031 GKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRALA 852 G+F+ +VGLGR++VGNL+WN +LLS+AANFKMKSLQREF+RM+EAGFHPD+TTFNIRALA Sbjct: 254 GEFLRDVGLGRKDVGNLIWNFLLLSFAANFKMKSLQREFLRMLEAGFHPDVTTFNIRALA 313 Query: 851 FSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCVVM 672 FS+MSL WDLH++LEHMKH+ V PD+VTYGC+VDAYL+RRL +NL+FA+ KM+++ V+ Sbjct: 314 FSRMSLLWDLHLTLEHMKHEKVSPDIVTYGCIVDAYLDRRLGKNLDFAIKKMNLDGSPVL 373 Query: 671 LTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 LTD VFE +GKGDFHSSAE LEFK +R WTY+EL+ I+L+K+YRSNQ FWNY Sbjct: 374 LTDPFVFEVLGKGDFHSSAEAFLEFKRQRKWTYRELVSIYLRKQYRSNQIFWNY 427 >ref|XP_006850970.1| hypothetical protein AMTR_s00025p00206120 [Amborella trichopoda] gi|548854641|gb|ERN12551.1| hypothetical protein AMTR_s00025p00206120 [Amborella trichopoda] Length = 354 Score = 345 bits (886), Expect = 2e-92 Identities = 164/236 (69%), Positives = 198/236 (83%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 S GF VDS TGNAF+IYYS FGS+AEME AY RL+ SRILIE EAIRA++SAYIRERKF+ Sbjct: 119 SRGFKVDSNTGNAFIIYYSSFGSLAEMEIAYGRLKCSRILIEREAIRAMASAYIRERKFF 178 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 +G+F+ +VGLGRRN GNLLWNL+LLSYAANFKMKSLQR F+ M+EAGF PD+TTFNIR Sbjct: 179 KMGEFLRDVGLGRRNSGNLLWNLLLLSYAANFKMKSLQRTFLGMLEAGFSPDITTFNIRT 238 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 LAFS+M +FWDLH+S+EHM+H +VIPDLVTYGC+VDAY+ERR RNL F L M+++ Sbjct: 239 LAFSRMCMFWDLHLSIEHMRHMNVIPDLVTYGCIVDAYVERRFGRNLGFGLKCMNLDSSP 298 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 ++LTD IV+E GKGDFHSS+E LE K K+ WTY +L+ +LKK+YRSNQ FWNY Sbjct: 299 LILTDPIVYEVFGKGDFHSSSEALLELKWKKEWTYSKLVAFYLKKRYRSNQIFWNY 354 >ref|XP_004152890.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like [Cucumis sativus] gi|449507537|ref|XP_004163059.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like [Cucumis sativus] Length = 388 Score = 343 bits (881), Expect = 6e-92 Identities = 161/236 (68%), Positives = 201/236 (85%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 S GF+V+S TGN+F+IYYS+FGS+ EME AY RL+ SR LIE++ I A++ AYIR+RKFY Sbjct: 153 SSGFTVNSATGNSFIIYYSMFGSLVEMETAYGRLKRSRFLIEKKGIMAMAFAYIRKRKFY 212 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 LG+F+ +VGLGR+NVGNLLWNL+LLSYAANFKMKSLQREF++MV+AGF+PDLTTFNIRA Sbjct: 213 RLGEFLRDVGLGRKNVGNLLWNLLLLSYAANFKMKSLQREFLQMVDAGFNPDLTTFNIRA 272 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 LAFS+M L WDLH+SLEHMKH ++ PDLVTYGCVVDAY++RRL RNLEF L KM+ + Sbjct: 273 LAFSRMDLLWDLHLSLEHMKHMNIEPDLVTYGCVVDAYVDRRLGRNLEFILSKMNPDQPP 332 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 V LTD VFEA+GKGDFH S+E ++F+ ++ WTY+ELI ++LKK +R NQ FWNY Sbjct: 333 VSLTDSFVFEALGKGDFHMSSEAFMQFRKQKKWTYRELISLYLKKHHRRNQVFWNY 388 >gb|ESW06704.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris] gi|561007757|gb|ESW06706.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris] Length = 423 Score = 342 bits (877), Expect = 2e-91 Identities = 160/236 (67%), Positives = 201/236 (85%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 S G V S TGNAFV+YYSIFGS+ +ME+AY RL+ SR LIE E IRA++SAY RER+FY Sbjct: 188 SRGVHVSSKTGNAFVLYYSIFGSLKDMENAYGRLKKSRFLIEREVIRAMASAYTRERQFY 247 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 LG+F+ +VGLGR+++GNLLWNLMLLSYA NFKMKSLQ+EF++MVE+GF PD+TTFNIRA Sbjct: 248 ELGEFIRDVGLGRKDLGNLLWNLMLLSYAVNFKMKSLQKEFLQMVESGFRPDITTFNIRA 307 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 LAFS+M+LFWDLH+S+EHM+H++VIPDLVT+GCVVDAYL+R L RNL FAL+KM+++D Sbjct: 308 LAFSRMALFWDLHLSIEHMEHENVIPDLVTFGCVVDAYLDRGLGRNLNFALNKMNLDDSP 367 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 ++LTD V+EA+GKGDF S+E EFK+ R WTY+ LI+ +LKK YR NQ FWNY Sbjct: 368 MLLTDPFVYEALGKGDFQMSSEAFFEFKTHRKWTYRALIQKYLKKHYRRNQIFWNY 423 >gb|ESW06703.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris] gi|561007756|gb|ESW06705.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris] Length = 372 Score = 342 bits (877), Expect = 2e-91 Identities = 160/236 (67%), Positives = 201/236 (85%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 S G V S TGNAFV+YYSIFGS+ +ME+AY RL+ SR LIE E IRA++SAY RER+FY Sbjct: 137 SRGVHVSSKTGNAFVLYYSIFGSLKDMENAYGRLKKSRFLIEREVIRAMASAYTRERQFY 196 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 LG+F+ +VGLGR+++GNLLWNLMLLSYA NFKMKSLQ+EF++MVE+GF PD+TTFNIRA Sbjct: 197 ELGEFIRDVGLGRKDLGNLLWNLMLLSYAVNFKMKSLQKEFLQMVESGFRPDITTFNIRA 256 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 LAFS+M+LFWDLH+S+EHM+H++VIPDLVT+GCVVDAYL+R L RNL FAL+KM+++D Sbjct: 257 LAFSRMALFWDLHLSIEHMEHENVIPDLVTFGCVVDAYLDRGLGRNLNFALNKMNLDDSP 316 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 ++LTD V+EA+GKGDF S+E EFK+ R WTY+ LI+ +LKK YR NQ FWNY Sbjct: 317 MLLTDPFVYEALGKGDFQMSSEAFFEFKTHRKWTYRALIQKYLKKHYRRNQIFWNY 372 >gb|ESW04320.1| hypothetical protein PHAVU_011G085400g [Phaseolus vulgaris] gi|561005327|gb|ESW04321.1| hypothetical protein PHAVU_011G085400g [Phaseolus vulgaris] Length = 411 Score = 337 bits (863), Expect = 8e-90 Identities = 158/236 (66%), Positives = 200/236 (84%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 S G + S T NAFV+YYSIFGS+ +ME+AY RL+ SR LIE E IRA++SAY RER+FY Sbjct: 176 SRGVHISSKTANAFVLYYSIFGSLKDMENAYGRLKKSRFLIEREVIRAMASAYTRERQFY 235 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 LG+F+ +VGL R++VGNLLWNLMLLSYAANFKMKSLQ+EF++MVE+GF PD+TTFNIRA Sbjct: 236 ELGEFLRDVGLVRKDVGNLLWNLMLLSYAANFKMKSLQKEFLQMVESGFRPDITTFNIRA 295 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 LAFS+M+LFWDLH+S+EHM+H++VIPDLVT+GCVVDAYL+R L +NL FAL+KM+++D Sbjct: 296 LAFSRMALFWDLHLSIEHMEHENVIPDLVTFGCVVDAYLDRGLGKNLNFALNKMNLDDSP 355 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 ++LTD V+EA+GKGDF S+E EFK+ R WTY+ LI+ +LKK YR NQ FWNY Sbjct: 356 MLLTDPFVYEALGKGDFQMSSEAFFEFKTHRKWTYRALIQKYLKKHYRRNQIFWNY 411 >ref|XP_006573403.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like [Glycine max] Length = 415 Score = 336 bits (862), Expect = 1e-89 Identities = 157/236 (66%), Positives = 199/236 (84%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 S G + S T NAF++YYS+FG++ EME+ Y RL+ SR LIE+E IRA++SAYI+ERKFY Sbjct: 180 SSGVHIYSRTANAFLLYYSLFGTLEEMENTYGRLKKSRFLIEKEVIRAVASAYIKERKFY 239 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 LG+F+ +VGL R+NVGNLLWNLMLLSYAANFKMKSLQREF+ MVE+GF PD+TTFNIRA Sbjct: 240 ELGEFLRDVGLRRKNVGNLLWNLMLLSYAANFKMKSLQREFIGMVESGFRPDITTFNIRA 299 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 LAFS+M+LFWDLH+S+EHM+H +IPDLVT+GCVVDAYL+RRL RNL+FAL+KM+++D Sbjct: 300 LAFSRMALFWDLHLSIEHMEHTKIIPDLVTFGCVVDAYLDRRLGRNLDFALNKMNLDDSP 359 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 +LTD V+EA+GKG F S+E E+K++R WTY+ LI+ +LKK YR NQ FWNY Sbjct: 360 RLLTDPFVYEALGKGGFQMSSEAFFEYKTQRKWTYRSLIQKYLKKHYRKNQIFWNY 415 >ref|XP_004489844.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like [Cicer arietinum] Length = 398 Score = 331 bits (848), Expect = 4e-88 Identities = 154/236 (65%), Positives = 196/236 (83%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 S GF +DS TGN + Y++FGS+ +ME+ Y RL+ SR I+ E IRA++ AY+++RKFY Sbjct: 163 SNGFHIDSKTGNYLIRCYAVFGSLNQMENTYGRLKRSRFSIDGETIRAVAFAYLKKRKFY 222 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 LG+FV +VGLGRRN+GNLLWNL+LLSYAANFKMKSLQREFVRMV+AGF PDLT+FNIR Sbjct: 223 ELGEFVRDVGLGRRNLGNLLWNLLLLSYAANFKMKSLQREFVRMVQAGFRPDLTSFNIRV 282 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 LAFS+M LFWDLH+S+EHM+ + V+PDLVTYGCVVD YL+R+L RNLEF L+KMS++DC Sbjct: 283 LAFSRMDLFWDLHLSIEHMRDEMVVPDLVTYGCVVDGYLDRKLGRNLEFVLNKMSVDDCP 342 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 +LTD VFE +GKGDFH S+E LE+++ +NW+Y+ LI+ +LKK YR NQ FWNY Sbjct: 343 RLLTDPFVFEVLGKGDFHLSSEAFLEYETGQNWSYRVLIKKYLKKHYRRNQIFWNY 398 >ref|XP_006307633.1| hypothetical protein CARUB_v10009261mg [Capsella rubella] gi|482576344|gb|EOA40531.1| hypothetical protein CARUB_v10009261mg [Capsella rubella] Length = 419 Score = 318 bits (815), Expect = 3e-84 Identities = 147/236 (62%), Positives = 194/236 (82%) Frame = -1 Query: 1217 SMGFSVDSVTGNAFVIYYSIFGSVAEMEDAYARLRSSRILIEEEAIRALSSAYIRERKFY 1038 S G S+DS T NA V YYS+FG++ ++E AY RL I+IEEE IRA+ AY++ERKFY Sbjct: 184 SKGMSLDSGTSNAIVRYYSVFGTLKKIEQAYGRLWKFGIVIEEEEIRAVLLAYLKERKFY 243 Query: 1037 SLGKFVENVGLGRRNVGNLLWNLMLLSYAANFKMKSLQREFVRMVEAGFHPDLTTFNIRA 858 L +F+ +VGLGRRN+GNLLWN +LLSYAA+FKMKSLQREFV M++AGF PDLTTFNIRA Sbjct: 244 RLREFLSDVGLGRRNLGNLLWNSVLLSYAADFKMKSLQREFVGMLDAGFSPDLTTFNIRA 303 Query: 857 LAFSKMSLFWDLHVSLEHMKHKSVIPDLVTYGCVVDAYLERRLVRNLEFALDKMSINDCV 678 LAFS+M+LFWDLH++LEHM+H +++PDLVT+GCVVDAY++RRL RNLEF ++M+ +D Sbjct: 304 LAFSRMALFWDLHLTLEHMRHLNIVPDLVTFGCVVDAYMDRRLARNLEFFYNQMNFDDSP 363 Query: 677 VMLTDEIVFEAMGKGDFHSSAEPPLEFKSKRNWTYKELIEIHLKKKYRSNQAFWNY 510 ++LTD + +E +GKGDFH S+E LEF ++NWTY++L+ ++ KKK R +Q FWNY Sbjct: 364 IVLTDPLAYEVLGKGDFHLSSEAVLEFSPRKNWTYRKLLGVYFKKKLRRDQIFWNY 419