BLASTX nr result

ID: Atropa21_contig00007536 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00007536
         (2441 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006360648.1| PREDICTED: pentatricopeptide repeat-containi...   693   0.0  
ref|XP_004240282.1| PREDICTED: pentatricopeptide repeat-containi...   692   0.0  
ref|XP_006360650.1| PREDICTED: pentatricopeptide repeat-containi...   685   0.0  
ref|XP_002265876.1| PREDICTED: pentatricopeptide repeat-containi...   508   e-141
ref|XP_004301396.1| PREDICTED: pentatricopeptide repeat-containi...   488   e-135
ref|XP_006453186.1| hypothetical protein CICLE_v10010414mg [Citr...   483   e-133
gb|EMJ23653.1| hypothetical protein PRUPE_ppa006191mg [Prunus pe...   481   e-133
ref|XP_002331436.1| predicted protein [Populus trichocarpa] gi|5...   463   e-127
ref|XP_002511816.1| pentatricopeptide repeat-containing protein,...   451   e-124
gb|EOY32179.1| Pentatricopeptide repeat superfamily protein, put...   446   e-122
ref|XP_006573403.1| PREDICTED: pentatricopeptide repeat-containi...   442   e-121
ref|XP_004152890.1| PREDICTED: pentatricopeptide repeat-containi...   439   e-120
gb|ESW04320.1| hypothetical protein PHAVU_011G085400g [Phaseolus...   437   e-120
emb|CAN79718.1| hypothetical protein VITISV_012741 [Vitis vinifera]   437   e-120
gb|ESW06704.1| hypothetical protein PHAVU_010G069800g [Phaseolus...   432   e-118
gb|ESW06703.1| hypothetical protein PHAVU_010G069800g [Phaseolus...   432   e-118
ref|XP_006372218.1| hypothetical protein POPTR_0018s14360g [Popu...   427   e-117
gb|ABA18111.1| pentatricopeptide repeat protein [Arabidopsis are...   423   e-115
ref|XP_006850970.1| hypothetical protein AMTR_s00025p00206120 [A...   421   e-115
ref|XP_006307633.1| hypothetical protein CARUB_v10009261mg [Caps...   414   e-112

>ref|XP_006360648.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            isoform X1 [Solanum tuberosum]
            gi|565389826|ref|XP_006360649.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X2 [Solanum tuberosum]
          Length = 416

 Score =  693 bits (1789), Expect = 0.0
 Identities = 345/386 (89%), Positives = 361/386 (93%)
 Frame = +2

Query: 197  QRPWHKKQGGNIYPPGNYADCVSLIQGLSRKKLPVAAERLVLDMKSEGFVPDNSTLSVLM 376
            +R W  KQGGNI P GNYADC SLIQGLSRKKLPVAAERLVL+MKSEGFVPD+STLS LM
Sbjct: 31   KRRWRMKQGGNIDPRGNYADCASLIQGLSRKKLPVAAERLVLEMKSEGFVPDSSTLSALM 90

Query: 377  LCYASNGLFCKALTAWDEIINSSFLPDVCVIVELIDICGCNGYLDVAVRILHQIQLKDSD 556
            LCYASNGLF KAL AWDEI+NSSFLPDV VI ELIDI  C GYLDVAVRILHQIQLKDS+
Sbjct: 91   LCYASNGLFYKALAAWDEIMNSSFLPDVHVIAELIDIYVCKGYLDVAVRILHQIQLKDSN 150

Query: 557  LLRDVYARVISRFGKKGQLELMEIMLKEMVSMGFPVDSPTGNAYVIYYSNFGTLSEMEVA 736
            LLRDVYA+ ISRFGKKGQLELME+MLKEMVSMGFPVDS TGNAYVIYYSNFG LSEMEVA
Sbjct: 151  LLRDVYAQAISRFGKKGQLELMEVMLKEMVSMGFPVDSTTGNAYVIYYSNFGMLSEMEVA 210

Query: 737  YGRLKMSRILIEEEAIRSMSSAYLKEQKFYSLGQFVRDVGLCRRNVGNLLWNMLFLSYAA 916
            YGRLKMSRILIEEEAIRS+S AYLK++KFYSLGQFVRDVGLCRRNVGNLLWN+L LSYAA
Sbjct: 211  YGRLKMSRILIEEEAIRSISLAYLKKEKFYSLGQFVRDVGLCRRNVGNLLWNLLLLSYAA 270

Query: 917  NFKMKSLQREFVRMIESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVT 1096
            NFKMKSLQREFVRM+ESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVT
Sbjct: 271  NFKMKSLQREFVRMVESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVT 330

Query: 1097 YGSVVDAYLDKNLGRNLDFALRKLNKNDCVTIATEPLVFEAMGKGDFHLSSEARLEFSKK 1276
            YGSVVDAYLD+ LGRNLDFALRKLN NDCV +ATEPLVFEA+GKGDFHLSS+ARLEFSK 
Sbjct: 331  YGSVVDAYLDRGLGRNLDFALRKLNINDCVIVATEPLVFEAIGKGDFHLSSDARLEFSKN 390

Query: 1277 KNWTYEELIAIYLKKYFRRNQIFWNY 1354
            KNWTYEELI  YLKKYFRRNQIFWNY
Sbjct: 391  KNWTYEELITTYLKKYFRRNQIFWNY 416


>ref|XP_004240282.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            [Solanum lycopersicum]
          Length = 381

 Score =  692 bits (1786), Expect = 0.0
 Identities = 344/380 (90%), Positives = 359/380 (94%)
 Frame = +2

Query: 215  KQGGNIYPPGNYADCVSLIQGLSRKKLPVAAERLVLDMKSEGFVPDNSTLSVLMLCYASN 394
            KQGGNI P  NY DC SLIQGLSRKKLPVAAERLVL+MKSEGFVPD+STLS LMLCYA+N
Sbjct: 2    KQGGNIDPRINYRDCASLIQGLSRKKLPVAAERLVLEMKSEGFVPDSSTLSALMLCYATN 61

Query: 395  GLFCKALTAWDEIINSSFLPDVCVIVELIDICGCNGYLDVAVRILHQIQLKDSDLLRDVY 574
            GLFCKAL AWDEI+NSSFLPDV VI ELIDI GC GYLDVAVRILHQIQLKDS+LLRDVY
Sbjct: 62   GLFCKALAAWDEIMNSSFLPDVHVIAELIDIYGCKGYLDVAVRILHQIQLKDSNLLRDVY 121

Query: 575  ARVISRFGKKGQLELMEIMLKEMVSMGFPVDSPTGNAYVIYYSNFGTLSEMEVAYGRLKM 754
            A+ ISRFGKKGQLELME+ML+EMVSMGFPVDS TGNAYVIYYSNFGTLSEMEVAYGRLKM
Sbjct: 122  AQAISRFGKKGQLELMEVMLEEMVSMGFPVDSTTGNAYVIYYSNFGTLSEMEVAYGRLKM 181

Query: 755  SRILIEEEAIRSMSSAYLKEQKFYSLGQFVRDVGLCRRNVGNLLWNMLFLSYAANFKMKS 934
            SRILIEEEAIRS+S AYLK++KFYSLGQFVRDVGLCRRNVGNLLWN+L LSYAANFKMKS
Sbjct: 182  SRILIEEEAIRSISLAYLKKEKFYSLGQFVRDVGLCRRNVGNLLWNLLLLSYAANFKMKS 241

Query: 935  LQREFVRMIESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVD 1114
            LQREFVRM+ESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVD
Sbjct: 242  LQREFVRMVESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVD 301

Query: 1115 AYLDKNLGRNLDFALRKLNKNDCVTIATEPLVFEAMGKGDFHLSSEARLEFSKKKNWTYE 1294
            AYLD+ LGRNLDFALRKLN NDCVT+ATEPLVFEAMGKGDFHLSSEARLEFSKK NWTYE
Sbjct: 302  AYLDRGLGRNLDFALRKLNTNDCVTVATEPLVFEAMGKGDFHLSSEARLEFSKKTNWTYE 361

Query: 1295 ELIAIYLKKYFRRNQIFWNY 1354
             LI  YLKKYFRRNQIFWNY
Sbjct: 362  VLITTYLKKYFRRNQIFWNY 381


>ref|XP_006360650.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            isoform X3 [Solanum tuberosum]
          Length = 409

 Score =  685 bits (1767), Expect = 0.0
 Identities = 341/380 (89%), Positives = 358/380 (94%)
 Frame = +2

Query: 215  KQGGNIYPPGNYADCVSLIQGLSRKKLPVAAERLVLDMKSEGFVPDNSTLSVLMLCYASN 394
            ++GGNI P GNYADC SLIQGLSRKKLPVAAERLVL+MKSEGFVPD+STLS LMLCYASN
Sbjct: 30   QKGGNIDPRGNYADCASLIQGLSRKKLPVAAERLVLEMKSEGFVPDSSTLSALMLCYASN 89

Query: 395  GLFCKALTAWDEIINSSFLPDVCVIVELIDICGCNGYLDVAVRILHQIQLKDSDLLRDVY 574
            GLF KAL AWDEI+NSSFLPDV VI ELIDI  C GYLDVAVRILHQIQLKDS+LLRDVY
Sbjct: 90   GLFYKALAAWDEIMNSSFLPDVHVIAELIDIYVCKGYLDVAVRILHQIQLKDSNLLRDVY 149

Query: 575  ARVISRFGKKGQLELMEIMLKEMVSMGFPVDSPTGNAYVIYYSNFGTLSEMEVAYGRLKM 754
            A+ ISRFGKKGQLELME+MLKEMVSMGFPVDS TGNAYVIYYSNFG LSEMEVAYGRLKM
Sbjct: 150  AQAISRFGKKGQLELMEVMLKEMVSMGFPVDSTTGNAYVIYYSNFGMLSEMEVAYGRLKM 209

Query: 755  SRILIEEEAIRSMSSAYLKEQKFYSLGQFVRDVGLCRRNVGNLLWNMLFLSYAANFKMKS 934
            SRILIEEEAIRS+S AYLK++KFYSLGQFVRDVGLCRRNVGNLLWN+L LSYAANFKMKS
Sbjct: 210  SRILIEEEAIRSISLAYLKKEKFYSLGQFVRDVGLCRRNVGNLLWNLLLLSYAANFKMKS 269

Query: 935  LQREFVRMIESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVD 1114
            LQREFVRM+ESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVD
Sbjct: 270  LQREFVRMVESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVD 329

Query: 1115 AYLDKNLGRNLDFALRKLNKNDCVTIATEPLVFEAMGKGDFHLSSEARLEFSKKKNWTYE 1294
            AYLD+ LGRNLDFALRKLN NDCV +ATEPLVFEA+GKGDFHLSS+ARLEFSK KNWTYE
Sbjct: 330  AYLDRGLGRNLDFALRKLNINDCVIVATEPLVFEAIGKGDFHLSSDARLEFSKNKNWTYE 389

Query: 1295 ELIAIYLKKYFRRNQIFWNY 1354
            ELI  YLKKYFRRNQIFWNY
Sbjct: 390  ELITTYLKKYFRRNQIFWNY 409


>ref|XP_002265876.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630
            [Vitis vinifera] gi|297736023|emb|CBI24061.3| unnamed
            protein product [Vitis vinifera]
          Length = 423

 Score =  508 bits (1309), Expect = e-141
 Identities = 255/400 (63%), Positives = 307/400 (76%)
 Frame = +2

Query: 155  NIALTRKVVCCN*SQRPWHKKQGGNIYPPGNYADCVSLIQGLSRKKLPVAAERLVLDMKS 334
            N AL RK+         WH KQ  ++    NY D   LIQ LSRK+LP  A+ L+ +MKS
Sbjct: 32   NRALARKLF--------WHWKQERSVDGKDNYVDYTPLIQALSRKRLPHVAQELLFEMKS 83

Query: 335  EGFVPDNSTLSVLMLCYASNGLFCKALTAWDEIINSSFLPDVCVIVELIDICGCNGYLDV 514
            EGF+P+NSTLS LMLCYA NGLF KA   WDEIINSSF P++ ++ +LID  G  G+   
Sbjct: 84   EGFLPNNSTLSALMLCYADNGLFPKAQALWDEIINSSFGPNIQIVSKLIDAYGKMGHFGE 143

Query: 515  AVRILHQIQLKDSDLLRDVYARVISRFGKKGQLELMEIMLKEMVSMGFPVDSPTGNAYVI 694
              RILHQ+  +D + + +VY+  IS FGK GQLE+ME  LKEMVS GFPVDS TGNA++ 
Sbjct: 144  VTRILHQVSSRDFNFMHEVYSLAISCFGKGGQLEMMENALKEMVSRGFPVDSATGNAFIR 203

Query: 695  YYSNFGTLSEMEVAYGRLKMSRILIEEEAIRSMSSAYLKEQKFYSLGQFVRDVGLCRRNV 874
            YYS FG+L+EME AY RLK SRILIEEE IR+MS AY+KE+K+Y LGQF+RDVGL R+NV
Sbjct: 204  YYSIFGSLTEMEAAYDRLKKSRILIEEEGIRAMSFAYIKEKKYYRLGQFLRDVGLGRKNV 263

Query: 875  GNLLWNMLFLSYAANFKMKSLQREFVRMIESGFFPDLNTFNIRALAFSKMSLFWDLHVTL 1054
            GNLLWN+L LSYAANFKMKSLQREF+ M+E+GF PDL TFNIRALAFS+MSLFWDLH++L
Sbjct: 264  GNLLWNLLLLSYAANFKMKSLQREFLEMVEAGFAPDLTTFNIRALAFSRMSLFWDLHLSL 323

Query: 1055 EHMKHEKVVPDLVTYGSVVDAYLDKNLGRNLDFALRKLNKNDCVTIATEPLVFEAMGKGD 1234
            EHM+H KVV DLVTYG VVDAYLD+ LG+NLDFAL+K+N +D   ++T+  VFE +GKGD
Sbjct: 324  EHMQHVKVVADLVTYGCVVDAYLDRRLGKNLDFALKKMNMDDSPLVSTDHFVFEVLGKGD 383

Query: 1235 FHLSSEARLEFSKKKNWTYEELIAIYLKKYFRRNQIFWNY 1354
            FH SSEA LE  +   WTY +LIA YLKK +R NQIFWNY
Sbjct: 384  FHSSSEAFLESKRNGKWTYRKLIATYLKKKYRSNQIFWNY 423


>ref|XP_004301396.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            [Fragaria vesca subsp. vesca]
          Length = 424

 Score =  488 bits (1255), Expect = e-135
 Identities = 247/400 (61%), Positives = 303/400 (75%)
 Frame = +2

Query: 155  NIALTRKVVCCN*SQRPWHKKQGGNIYPPGNYADCVSLIQGLSRKKLPVAAERLVLDMKS 334
            N AL RK+V      R W +++         Y DCV LIQ LSR+K+P  A+ ++L MKS
Sbjct: 33   NRALARKIV------RTWKQEECSR--GKDCYVDCVPLIQSLSRQKMPHVAQEVLLVMKS 84

Query: 335  EGFVPDNSTLSVLMLCYASNGLFCKALTAWDEIINSSFLPDVCVIVELIDICGCNGYLDV 514
            EG +P NSTLS +MLC+A NGL  +A   WDE++NSSF+P + V+ EL D+ G  G    
Sbjct: 85   EGLIPSNSTLSAVMLCHAKNGLLPQAEAIWDEMLNSSFVPGIQVVSELFDVYGNVGSFGK 144

Query: 515  AVRILHQIQLKDSDLLRDVYARVISRFGKKGQLELMEIMLKEMVSMGFPVDSPTGNAYVI 694
               I+ QI+ ++  LL  VY+  IS FGK GQLELME  LKEMVS GFPVDS TGN ++ 
Sbjct: 145  VNEIVGQIRSRNLSLLPQVYSLAISCFGKGGQLELMEDTLKEMVSRGFPVDSATGNVFIR 204

Query: 695  YYSNFGTLSEMEVAYGRLKMSRILIEEEAIRSMSSAYLKEQKFYSLGQFVRDVGLCRRNV 874
            YYS FG+L+EME AY RLK SR LIEEE IR+MS AYLK++KFYSL +F++ VGL RRN+
Sbjct: 205  YYSIFGSLTEMETAYDRLKRSRFLIEEEGIRAMSLAYLKKRKFYSLAEFLKSVGLGRRNL 264

Query: 875  GNLLWNMLFLSYAANFKMKSLQREFVRMIESGFFPDLNTFNIRALAFSKMSLFWDLHVTL 1054
            GNLLWN+L LSYAANFKMK+LQREF+RM+E+GF PDL TFNIRALAFS+MSL WDLH+TL
Sbjct: 265  GNLLWNLLLLSYAANFKMKTLQREFLRMVEAGFHPDLTTFNIRALAFSRMSLLWDLHLTL 324

Query: 1055 EHMKHEKVVPDLVTYGSVVDAYLDKNLGRNLDFALRKLNKNDCVTIATEPLVFEAMGKGD 1234
            EHMKH KVVPDLVT G +VDAYLD+ LGRNL FAL K+N +D   + T+P VFE +GKGD
Sbjct: 325  EHMKHVKVVPDLVTCGCIVDAYLDRRLGRNLYFALNKMNLDDSPVVLTDPFVFEVLGKGD 384

Query: 1235 FHLSSEARLEFSKKKNWTYEELIAIYLKKYFRRNQIFWNY 1354
            FH SSEA LEF K+K WTY++LI++YLKK +RR+QIFWNY
Sbjct: 385  FHASSEAFLEFRKQKEWTYQKLISVYLKKQYRRDQIFWNY 424


>ref|XP_006453186.1| hypothetical protein CICLE_v10010414mg [Citrus clementina]
            gi|568840749|ref|XP_006474328.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X1 [Citrus sinensis]
            gi|568840751|ref|XP_006474329.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X2 [Citrus sinensis]
            gi|568840753|ref|XP_006474330.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X3 [Citrus sinensis]
            gi|568840755|ref|XP_006474331.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X4 [Citrus sinensis]
            gi|568840757|ref|XP_006474332.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X5 [Citrus sinensis]
            gi|568840759|ref|XP_006474333.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X6 [Citrus sinensis]
            gi|557556412|gb|ESR66426.1| hypothetical protein
            CICLE_v10010414mg [Citrus clementina]
          Length = 412

 Score =  483 bits (1244), Expect = e-133
 Identities = 234/369 (63%), Positives = 293/369 (79%)
 Frame = +2

Query: 248  YADCVSLIQGLSRKKLPVAAERLVLDMKSEGFVPDNSTLSVLMLCYASNGLFCKALTAWD 427
            + DC SL++ L RKK P  A +LV  +KSEG +PDNSTL  LMLCYA+NG   +A   W+
Sbjct: 44   FVDCASLVEDLGRKKKPHLAHQLVNTVKSEGLLPDNSTLCALMLCYANNGFVLEAQVVWE 103

Query: 428  EIINSSFLPDVCVIVELIDICGCNGYLDVAVRILHQIQLKDSDLLRDVYARVISRFGKKG 607
            E+++SSF+  V V+ +L+D  G  G  +  + I+ Q+  +++DLL +VY+R IS FGK+G
Sbjct: 104  ELLSSSFVLSVQVLSDLMDAYGRIGCFNEIISIIDQVSCRNADLLPEVYSRAISCFGKQG 163

Query: 608  QLELMEIMLKEMVSMGFPVDSPTGNAYVIYYSNFGTLSEMEVAYGRLKMSRILIEEEAIR 787
            QLELME  LKEMVS GF VDS TGNA++IYYS FG+L+EME AYGRLK SR LI++E IR
Sbjct: 164  QLELMENTLKEMVSRGFSVDSATGNAFIIYYSRFGSLTEMETAYGRLKRSRHLIDKEGIR 223

Query: 788  SMSSAYLKEQKFYSLGQFVRDVGLCRRNVGNLLWNMLFLSYAANFKMKSLQREFVRMIES 967
            ++S  YLKE+KF+ LG+F+RDVGL R+++GNLLWN+L LSYA NFKMKSLQREF+RM E+
Sbjct: 224  AVSFTYLKERKFFMLGEFLRDVGLGRKDLGNLLWNLLLLSYAGNFKMKSLQREFMRMSEA 283

Query: 968  GFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVDAYLDKNLGRNL 1147
            GF PDL TFNIRA+AFS+MS+FWDLH++LEHMKHE V PDLVTYG VVDAYLDK LGRNL
Sbjct: 284  GFHPDLTTFNIRAVAFSRMSMFWDLHLSLEHMKHESVGPDLVTYGCVVDAYLDKRLGRNL 343

Query: 1148 DFALRKLNKNDCVTIATEPLVFEAMGKGDFHLSSEARLEFSKKKNWTYEELIAIYLKKYF 1327
            DF L K+N +D   ++T+P VFEA GKGDFH SSEA LEF +++ WTY +LIA+YLKK  
Sbjct: 344  DFGLSKMNLDDSPVVSTDPYVFEAFGKGDFHSSSEAFLEFKRQRKWTYRKLIAVYLKKQL 403

Query: 1328 RRNQIFWNY 1354
            RRNQIFWNY
Sbjct: 404  RRNQIFWNY 412


>gb|EMJ23653.1| hypothetical protein PRUPE_ppa006191mg [Prunus persica]
          Length = 423

 Score =  481 bits (1239), Expect = e-133
 Identities = 243/398 (61%), Positives = 302/398 (75%)
 Frame = +2

Query: 161  ALTRKVVCCN*SQRPWHKKQGGNIYPPGNYADCVSLIQGLSRKKLPVAAERLVLDMKSEG 340
            AL RK++      R W  KQ       G Y DCV LI+ LSR+K+P  A+ LVL+MKS+G
Sbjct: 34   ALARKII------RKW--KQEECFDGKGIYVDCVPLIRSLSRQKMPHVAQELVLEMKSDG 85

Query: 341  FVPDNSTLSVLMLCYASNGLFCKALTAWDEIINSSFLPDVCVIVELIDICGCNGYLDVAV 520
             +P NSTLS LMLC+A+NGLF +A   WDE+++SSF+P + V+ EL D  G  G  +   
Sbjct: 86   LLPSNSTLSALMLCHANNGLFPQAEAIWDEMLHSSFVPSIQVVSELFDAYGNVGCFEKVN 145

Query: 521  RILHQIQLKDSDLLRDVYARVISRFGKKGQLELMEIMLKEMVSMGFPVDSPTGNAYVIYY 700
             IL QI+ ++  L  +VY+  IS FGK GQLELME  LKEM+S GFP+DS TGNA++ YY
Sbjct: 146  EILAQIRSRNLSLFPEVYSLAISCFGKGGQLELMEGTLKEMISRGFPLDSATGNAFIRYY 205

Query: 701  SNFGTLSEMEVAYGRLKMSRILIEEEAIRSMSSAYLKEQKFYSLGQFVRDVGLCRRNVGN 880
            S FG+L+EME AYGRLK SR LIEEE IR+MS AYLK++KFY L + +++VGL RRN+GN
Sbjct: 206  SIFGSLTEMETAYGRLKRSRFLIEEEGIRAMSFAYLKKRKFYRLAELLKNVGLGRRNLGN 265

Query: 881  LLWNMLFLSYAANFKMKSLQREFVRMIESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEH 1060
            L WN+L LSYAA+FKMKSLQREF+RM+E+GF PDL TFNIRALAFS+MSL WDLH++LEH
Sbjct: 266  LSWNLLLLSYAADFKMKSLQREFLRMVEAGFHPDLTTFNIRALAFSRMSLLWDLHLSLEH 325

Query: 1061 MKHEKVVPDLVTYGSVVDAYLDKNLGRNLDFALRKLNKNDCVTIATEPLVFEAMGKGDFH 1240
            MKHEKV PDLVT G VVDAYL++ LG+N+ FAL K+N +D   I T+P VFE +GKGDFH
Sbjct: 326  MKHEKVFPDLVTCGCVVDAYLERRLGKNMYFALNKMNLDDSPLILTDPFVFEVLGKGDFH 385

Query: 1241 LSSEARLEFSKKKNWTYEELIAIYLKKYFRRNQIFWNY 1354
             SSEA LEF  ++ WTY  LI++YLKK +RRNQIFWNY
Sbjct: 386  ASSEAFLEFQSQREWTYRRLISVYLKKQYRRNQIFWNY 423


>ref|XP_002331436.1| predicted protein [Populus trichocarpa]
            gi|566215849|ref|XP_006372219.1| pentatricopeptide
            repeat-containing family protein [Populus trichocarpa]
            gi|550318750|gb|ERP50016.1| pentatricopeptide
            repeat-containing family protein [Populus trichocarpa]
          Length = 428

 Score =  463 bits (1191), Expect = e-127
 Identities = 233/399 (58%), Positives = 297/399 (74%), Gaps = 1/399 (0%)
 Frame = +2

Query: 161  ALTRKVVCCN*SQRPWHKKQGGNIYPPGNYADCVSLIQGLSRKKLPVAAERLVLDMKSEG 340
            AL +K++      R W + QG  ++     ADC SLIQ L + + P  AE L+L++K EG
Sbjct: 38   ALAQKMI------RQWKRDQG--VFGKETCADCASLIQTLCKHRRPHLAEELLLELKCEG 89

Query: 341  FVPDNSTLSVLMLCYASNGLFCKALTAWDEIINSSFLPDVCVIVELIDICGCNGYLDVAV 520
            F+PDN TLS +MLCYA +GL  +A   W+E++ SSF+P V VI +LIDI   +G  D  +
Sbjct: 90   FLPDNRTLSAMMLCYADSGLLPQAQAIWEEMLYSSFVPSVQVISDLIDIYAKSGLFDEVI 149

Query: 521  RILHQIQ-LKDSDLLRDVYARVISRFGKKGQLELMEIMLKEMVSMGFPVDSPTGNAYVIY 697
            +IL Q+  L+  D L  VY+  IS FGK GQLELME  LK+MVS GF VDS TGNA+V+Y
Sbjct: 150  KILDQLSSLRTFDFLPQVYSLAISCFGKGGQLELMEDTLKKMVSKGFWVDSATGNAFVVY 209

Query: 698  YSNFGTLSEMEVAYGRLKMSRILIEEEAIRSMSSAYLKEQKFYSLGQFVRDVGLCRRNVG 877
            YS  G+L+EME AY RLK SR+LIE E IR+MS AY+KE+KFY L +F+RDVGL R+N+G
Sbjct: 210  YSLHGSLAEMEAAYDRLKRSRLLIEREGIRAMSFAYIKERKFYGLSEFLRDVGLGRKNLG 269

Query: 878  NLLWNMLFLSYAANFKMKSLQREFVRMIESGFFPDLNTFNIRALAFSKMSLFWDLHVTLE 1057
            NL+WN+L LSY+ANFKMK+LQREF+ M+E+GF PDL TFNIRALAFS+MSL WDLH+ LE
Sbjct: 270  NLIWNLLLLSYSANFKMKTLQREFLNMLEAGFHPDLTTFNIRALAFSRMSLLWDLHLGLE 329

Query: 1058 HMKHEKVVPDLVTYGSVVDAYLDKNLGRNLDFALRKLNKNDCVTIATEPLVFEAMGKGDF 1237
            HMKH+KV PDLVTYG +VDAYLD+ L RNL+FAL K++ ++   ++T+P VFE  GKGDF
Sbjct: 330  HMKHDKVAPDLVTYGCIVDAYLDRRLVRNLEFALSKMHVDNSPVLSTDPFVFEVFGKGDF 389

Query: 1238 HLSSEARLEFSKKKNWTYEELIAIYLKKYFRRNQIFWNY 1354
            H SSEA +EF +++ WTY ELI IYL+K  R   IFWNY
Sbjct: 390  HSSSEAFMEFKRQRKWTYRELIKIYLRKQHRSKHIFWNY 428


>ref|XP_002511816.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223548996|gb|EEF50485.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 427

 Score =  451 bits (1160), Expect = e-124
 Identities = 212/367 (57%), Positives = 280/367 (76%)
 Frame = +2

Query: 254  DCVSLIQGLSRKKLPVAAERLVLDMKSEGFVPDNSTLSVLMLCYASNGLFCKALTAWDEI 433
            DC SL+Q L  K+ P  A+ ++L+MKS+G+V +N TLS ++LCYA NGL  +A   W  +
Sbjct: 61   DCASLVQNLHSKRTPHLAQEILLEMKSQGYVLNNPTLSAILLCYADNGLLPQAQAIWKHM 120

Query: 434  INSSFLPDVCVIVELIDICGCNGYLDVAVRILHQIQLKDSDLLRDVYARVISRFGKKGQL 613
            +N SF P + ++  LID     G+ +  + IL Q+   +  LL + Y+  IS FGK GQL
Sbjct: 121  LNGSFTPSIQIVSRLIDAYSKKGHFNEVMNILDQLSYSNFSLLHEAYSLAISCFGKGGQL 180

Query: 614  ELMEIMLKEMVSMGFPVDSPTGNAYVIYYSNFGTLSEMEVAYGRLKMSRILIEEEAIRSM 793
            +LME  LK+MV  GFPVD  TGNA++ YYS  G+L++ME AY RLK SR L++ E IR++
Sbjct: 181  QLMENALKDMVLRGFPVDYATGNAFIRYYSIHGSLTDMESAYSRLKRSRHLVDREGIRAV 240

Query: 794  SSAYLKEQKFYSLGQFVRDVGLCRRNVGNLLWNMLFLSYAANFKMKSLQREFVRMIESGF 973
            S AY+KE+KFY LG+F+RDVGL R++VGNL+WN L LS+AANFKMKSLQREF+RM+E+GF
Sbjct: 241  SLAYVKERKFYRLGEFLRDVGLGRKDVGNLIWNFLLLSFAANFKMKSLQREFLRMLEAGF 300

Query: 974  FPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVDAYLDKNLGRNLDF 1153
             PD+ TFNIRALAFS+MSL WDLH+TLEHMKHEKV PD+VTYG +VDAYLD+ LG+NLDF
Sbjct: 301  HPDVTTFNIRALAFSRMSLLWDLHLTLEHMKHEKVSPDIVTYGCIVDAYLDRRLGKNLDF 360

Query: 1154 ALRKLNKNDCVTIATEPLVFEAMGKGDFHLSSEARLEFSKKKNWTYEELIAIYLKKYFRR 1333
            A++K+N +    + T+P VFE +GKGDFH S+EA LEF +++ WTY EL++IYL+K +R 
Sbjct: 361  AIKKMNLDGSPVLLTDPFVFEVLGKGDFHSSAEAFLEFKRQRKWTYRELVSIYLRKQYRS 420

Query: 1334 NQIFWNY 1354
            NQIFWNY
Sbjct: 421  NQIFWNY 427


>gb|EOY32179.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
            cacao]
          Length = 429

 Score =  446 bits (1148), Expect = e-122
 Identities = 236/405 (58%), Positives = 299/405 (73%), Gaps = 4/405 (0%)
 Frame = +2

Query: 152  HNIALTRKVVCCN*SQRPWHKKQGGNIYPPG--NYADCVSLIQGLSRKKLPVAAERLVLD 325
            +N+ L R+ +      R W  K+ G+I   G  N+ D  SL+Q L+ KK+P     +V  
Sbjct: 34   NNLPLARRQII-----RLW--KRDGSILGVGRDNFVDFDSLLQTLASKKMP--QPHVVHH 84

Query: 326  MKSEGFVPDNSTLSVLMLCYASNGLFCKALTAWDEIINS-SFLPDVCVIVELIDICGCNG 502
            +  +G +P+NSTLS +ML YA NGLF +A   W+E++N+ SF P + V+ + +D  G  G
Sbjct: 85   LLLQGLIPNNSTLSEIMLWYADNGLFPQAQAIWEEMLNTTSFTPSIQVVSKFMDAYGKMG 144

Query: 503  YLDVAVRILHQIQLKDSDLLRDVYARVISRFGKKGQLELMEIMLKEMVSMGFPVDSPTGN 682
            +     +IL ++ L   +LL +VY   IS FGK G+L+LME  LKEMVS G PVDS TGN
Sbjct: 145  HFHKVHKILDRVILLRVNLLPEVYPVAISCFGKHGRLDLMENTLKEMVSRGLPVDSATGN 204

Query: 683  AYVIYYSNFGTLSEMEVAYGRLKMSRILIEEEAIRSMSSAYLKEQKFYSLGQFVRDVGLC 862
            A+V YYS FG+LSEME+AY RLK SR LIEEE IR+MSSAY+KE KFY LG+F+ D+GL 
Sbjct: 205  AFVRYYSIFGSLSEMEIAYARLKRSRHLIEEEGIRAMSSAYIKEGKFYRLGEFLNDLGLG 264

Query: 863  RRNVGNLLWNMLFLSYAANFKMKSLQREFVRMIESGFFPDLNTFNIRALAFSKMSLFWDL 1042
            RRN+GNLLWN+L LSYAANFKMK++QR F++M++SGF PDL TFNIRA AFS+MS+FWDL
Sbjct: 265  RRNLGNLLWNLLLLSYAANFKMKTMQRLFLKMMDSGFRPDLTTFNIRAWAFSRMSMFWDL 324

Query: 1043 HVTLEHMKHEKVVPDLVTYGSVVDAYLDKNLGRNLDFALRKLNKNDCVTIATEPLVFEAM 1222
            H++LEHMKHE VV DLVTYG VVDAYLD+ L RNLDFAL  +N +D   + T+PLVFEA+
Sbjct: 325  HLSLEHMKHESVVSDLVTYGCVVDAYLDRRLARNLDFALNHMNADDSPLVLTDPLVFEAL 384

Query: 1223 GKGDFHLSSEARLEFSK-KKNWTYEELIAIYLKKYFRRNQIFWNY 1354
            GKGDFH S+EA LEF + KK WTY +LIA+YLKK  RRNQIFWNY
Sbjct: 385  GKGDFHSSAEAFLEFKRQKKKWTYRQLIAVYLKKQLRRNQIFWNY 429


>ref|XP_006573403.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            [Glycine max]
          Length = 415

 Score =  442 bits (1136), Expect = e-121
 Identities = 214/383 (55%), Positives = 282/383 (73%)
 Frame = +2

Query: 206  WHKKQGGNIYPPGNYADCVSLIQGLSRKKLPVAAERLVLDMKSEGFVPDNSTLSVLMLCY 385
            W + + G I    N  DC SL Q  SRK++   ++  + D+K EG++P  ++L V ML Y
Sbjct: 33   WWQNEKGVIGGKDNSVDCSSLAQNSSRKRMIHQSDGSLHDIKVEGYMPKQTSLCVSMLYY 92

Query: 386  ASNGLFCKALTAWDEIINSSFLPDVCVIVELIDICGCNGYLDVAVRILHQIQLKDSDLLR 565
              NG F +A T W++++NSSF+P V  I  L D    +   DV + IL  + +++  +L 
Sbjct: 93   TENGFFPQAQTLWEQLVNSSFVPSVQFISRLFDAYAKHRKFDVVIDILRYVDMRNFSILP 152

Query: 566  DVYARVISRFGKKGQLELMEIMLKEMVSMGFPVDSPTGNAYVIYYSNFGTLSEMEVAYGR 745
            DVY   IS FG++GQLELME M  EM S G  + S T NA+++YYS FGTL EME  YGR
Sbjct: 153  DVYWLAISCFGREGQLELMEDMANEMASSGVHIYSRTANAFLLYYSLFGTLEEMENTYGR 212

Query: 746  LKMSRILIEEEAIRSMSSAYLKEQKFYSLGQFVRDVGLCRRNVGNLLWNMLFLSYAANFK 925
            LK SR LIE+E IR+++SAY+KE+KFY LG+F+RDVGL R+NVGNLLWN++ LSYAANFK
Sbjct: 213  LKKSRFLIEKEVIRAVASAYIKERKFYELGEFLRDVGLRRKNVGNLLWNLMLLSYAANFK 272

Query: 926  MKSLQREFVRMIESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGS 1105
            MKSLQREF+ M+ESGF PD+ TFNIRALAFS+M+LFWDLH+++EHM+H K++PDLVT+G 
Sbjct: 273  MKSLQREFIGMVESGFRPDITTFNIRALAFSRMALFWDLHLSIEHMEHTKIIPDLVTFGC 332

Query: 1106 VVDAYLDKNLGRNLDFALRKLNKNDCVTIATEPLVFEAMGKGDFHLSSEARLEFSKKKNW 1285
            VVDAYLD+ LGRNLDFAL K+N +D   + T+P V+EA+GKG F +SSEA  E+  ++ W
Sbjct: 333  VVDAYLDRRLGRNLDFALNKMNLDDSPRLLTDPFVYEALGKGGFQMSSEAFFEYKTQRKW 392

Query: 1286 TYEELIAIYLKKYFRRNQIFWNY 1354
            TY  LI  YLKK++R+NQIFWNY
Sbjct: 393  TYRSLIQKYLKKHYRKNQIFWNY 415


>ref|XP_004152890.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            [Cucumis sativus] gi|449507537|ref|XP_004163059.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At3g42630-like [Cucumis sativus]
          Length = 388

 Score =  439 bits (1130), Expect = e-120
 Identities = 212/363 (58%), Positives = 282/363 (77%)
 Frame = +2

Query: 266  LIQGLSRKKLPVAAERLVLDMKSEGFVPDNSTLSVLMLCYASNGLFCKALTAWDEIINSS 445
            +I+ LSR+++P+ A+ + L++KSEGF  +NSTLS +M+ Y  +G   +A   W+E++NS 
Sbjct: 26   VIKKLSRRRMPILAKEIFLELKSEGFPLNNSTLSTIMVHYIDDGSPLQAQAMWEEMLNSC 85

Query: 446  FLPDVCVIVELIDICGCNGYLDVAVRILHQIQLKDSDLLRDVYARVISRFGKKGQLELME 625
            F P V VI +L +  G  G+ D   ++L Q++L+ S LL + Y+  IS FGK  QLELME
Sbjct: 86   FEPSVQVISKLFNAYGKMGHFDYITKVLDQVKLRYSHLLPEAYSLAISCFGKHKQLELME 145

Query: 626  IMLKEMVSMGFPVDSPTGNAYVIYYSNFGTLSEMEVAYGRLKMSRILIEEEAIRSMSSAY 805
              L+EMVS GF V+S TGN+++IYYS FG+L EME AYGRLK SR LIE++ I +M+ AY
Sbjct: 146  STLREMVSSGFTVNSATGNSFIIYYSMFGSLVEMETAYGRLKRSRFLIEKKGIMAMAFAY 205

Query: 806  LKEQKFYSLGQFVRDVGLCRRNVGNLLWNMLFLSYAANFKMKSLQREFVRMIESGFFPDL 985
            ++++KFY LG+F+RDVGL R+NVGNLLWN+L LSYAANFKMKSLQREF++M+++GF PDL
Sbjct: 206  IRKRKFYRLGEFLRDVGLGRKNVGNLLWNLLLLSYAANFKMKSLQREFLQMVDAGFNPDL 265

Query: 986  NTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVDAYLDKNLGRNLDFALRK 1165
             TFNIRALAFS+M L WDLH++LEHMKH  + PDLVTYG VVDAY+D+ LGRNL+F L K
Sbjct: 266  TTFNIRALAFSRMDLLWDLHLSLEHMKHMNIEPDLVTYGCVVDAYVDRRLGRNLEFILSK 325

Query: 1166 LNKNDCVTIATEPLVFEAMGKGDFHLSSEARLEFSKKKNWTYEELIAIYLKKYFRRNQIF 1345
            +N +      T+  VFEA+GKGDFH+SSEA ++F K+K WTY ELI++YLKK+ RRNQ+F
Sbjct: 326  MNPDQPPVSLTDSFVFEALGKGDFHMSSEAFMQFRKQKKWTYRELISLYLKKHHRRNQVF 385

Query: 1346 WNY 1354
            WNY
Sbjct: 386  WNY 388


>gb|ESW04320.1| hypothetical protein PHAVU_011G085400g [Phaseolus vulgaris]
            gi|561005327|gb|ESW04321.1| hypothetical protein
            PHAVU_011G085400g [Phaseolus vulgaris]
          Length = 411

 Score =  437 bits (1125), Expect = e-120
 Identities = 212/386 (54%), Positives = 284/386 (73%)
 Frame = +2

Query: 197  QRPWHKKQGGNIYPPGNYADCVSLIQGLSRKKLPVAAERLVLDMKSEGFVPDNSTLSVLM 376
            Q  W + + G      +  D  SL+Q  SRK++   ++ +  D K +G++P  ++L VLM
Sbjct: 26   QMIWWRNEKGAFGGMHSSVDSSSLVQNNSRKRMFPQSDGVFHDTKDDGYMPKQTSLCVLM 85

Query: 377  LCYASNGLFCKALTAWDEIINSSFLPDVCVIVELIDICGCNGYLDVAVRILHQIQLKDSD 556
            L Y  NGLF +A T W++++ SSF+P V  I  L D    +G  D  V IL  + +++  
Sbjct: 86   LYYTENGLFPQAQTTWEQLLYSSFVPSVEFISRLFDAYAKHGKFDEVVNILRYVDMRNFS 145

Query: 557  LLRDVYARVISRFGKKGQLELMEIMLKEMVSMGFPVDSPTGNAYVIYYSNFGTLSEMEVA 736
            +L +VY+  IS FG++GQLELME M KEM S G  + S T NA+V+YYS FG+L +ME A
Sbjct: 146  ILPNVYSLAISCFGREGQLELMEDMAKEMASRGVHISSKTANAFVLYYSIFGSLKDMENA 205

Query: 737  YGRLKMSRILIEEEAIRSMSSAYLKEQKFYSLGQFVRDVGLCRRNVGNLLWNMLFLSYAA 916
            YGRLK SR LIE E IR+M+SAY +E++FY LG+F+RDVGL R++VGNLLWN++ LSYAA
Sbjct: 206  YGRLKKSRFLIEREVIRAMASAYTRERQFYELGEFLRDVGLVRKDVGNLLWNLMLLSYAA 265

Query: 917  NFKMKSLQREFVRMIESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVT 1096
            NFKMKSLQ+EF++M+ESGF PD+ TFNIRALAFS+M+LFWDLH+++EHM+HE V+PDLVT
Sbjct: 266  NFKMKSLQKEFLQMVESGFRPDITTFNIRALAFSRMALFWDLHLSIEHMEHENVIPDLVT 325

Query: 1097 YGSVVDAYLDKNLGRNLDFALRKLNKNDCVTIATEPLVFEAMGKGDFHLSSEARLEFSKK 1276
            +G VVDAYLD+ LG+NL+FAL K+N +D   + T+P V+EA+GKGDF +SSEA  EF   
Sbjct: 326  FGCVVDAYLDRGLGKNLNFALNKMNLDDSPMLLTDPFVYEALGKGDFQMSSEAFFEFKTH 385

Query: 1277 KNWTYEELIAIYLKKYFRRNQIFWNY 1354
            + WTY  LI  YLKK++RRNQIFWNY
Sbjct: 386  RKWTYRALIQKYLKKHYRRNQIFWNY 411


>emb|CAN79718.1| hypothetical protein VITISV_012741 [Vitis vinifera]
          Length = 446

 Score =  437 bits (1125), Expect = e-120
 Identities = 228/383 (59%), Positives = 271/383 (70%)
 Frame = +2

Query: 206  WHKKQGGNIYPPGNYADCVSLIQGLSRKKLPVAAERLVLDMKSEGFVPDNSTLSVLMLCY 385
            WH KQ  ++    NY D   LIQ LSRK+LP  A+ L+ +MKSE                
Sbjct: 103  WHWKQERSVDGKDNYVDYTPLIQALSRKRLPHVAQELLFEMKSE---------------- 146

Query: 386  ASNGLFCKALTAWDEIINSSFLPDVCVIVELIDICGCNGYLDVAVRILHQIQLKDSDLLR 565
              NGLF KA   WDEIINSSF P++ ++ +LID  G  G+     RILHQ          
Sbjct: 147  -DNGLFPKAQALWDEIINSSFGPNIQIVSKLIDAYGKMGHFGEVTRILHQ---------- 195

Query: 566  DVYARVISRFGKKGQLELMEIMLKEMVSMGFPVDSPTGNAYVIYYSNFGTLSEMEVAYGR 745
                         GQLE+ME  LKEMVS GFPVDS TGNA++ YYS FG+L+EME AY R
Sbjct: 196  ------------GGQLEMMENALKEMVSRGFPVDSATGNAFIRYYSIFGSLTEMEAAYDR 243

Query: 746  LKMSRILIEEEAIRSMSSAYLKEQKFYSLGQFVRDVGLCRRNVGNLLWNMLFLSYAANFK 925
            LK SRILIEEE IR+MS AY+KE+K+Y LGQF+RDVGL R+NVGNLLWN+L LSYAANFK
Sbjct: 244  LKKSRILIEEEGIRAMSFAYIKEKKYYRLGQFLRDVGLGRKNVGNLLWNLLLLSYAANFK 303

Query: 926  MKSLQREFVRMIESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGS 1105
            MKSLQREF+ M+E+GF PDL TFNIRALAFS+MSLFWDLH++LEHM+H KVV DLVTYG 
Sbjct: 304  MKSLQREFLEMVEAGFAPDLTTFNIRALAFSRMSLFWDLHLSLEHMQHVKVVADLVTYGC 363

Query: 1106 VVDAYLDKNLGRNLDFALRKLNKNDCVTIATEPLVFEAMGKGDFHLSSEARLEFSKKKNW 1285
            VVDAYLD+ LG+NLDFAL+K+N +D   ++T+  VFE +GKGDFH SSEA LE  +   W
Sbjct: 364  VVDAYLDRRLGKNLDFALKKMNMDDSPLVSTDHFVFEVLGKGDFHSSSEAFLESKRNGKW 423

Query: 1286 TYEELIAIYLKKYFRRNQIFWNY 1354
            TY +LIA YLKK +R NQIFWNY
Sbjct: 424  TYRKLIATYLKKKYRSNQIFWNY 446


>gb|ESW06704.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris]
            gi|561007757|gb|ESW06706.1| hypothetical protein
            PHAVU_010G069800g [Phaseolus vulgaris]
          Length = 423

 Score =  432 bits (1111), Expect = e-118
 Identities = 209/367 (56%), Positives = 276/367 (75%)
 Frame = +2

Query: 254  DCVSLIQGLSRKKLPVAAERLVLDMKSEGFVPDNSTLSVLMLCYASNGLFCKALTAWDEI 433
            D  SL+Q  SRK++   ++ +  D K +G++P  ++L VLML Y  NGLF  A T W+++
Sbjct: 57   DSSSLLQKNSRKRMFPQSDGVFPDTKDDGYMPKQTSLCVLMLYYTENGLFPLAQTTWEQL 116

Query: 434  INSSFLPDVCVIVELIDICGCNGYLDVAVRILHQIQLKDSDLLRDVYARVISRFGKKGQL 613
            + SSF+P V  I  L D    +G  D  V IL  + +++  +L +VY+  I  FG++GQL
Sbjct: 117  LYSSFVPSVEFISRLFDAYAKHGKFDEVVNILRYVDMRNFSILPNVYSLAICCFGREGQL 176

Query: 614  ELMEIMLKEMVSMGFPVDSPTGNAYVIYYSNFGTLSEMEVAYGRLKMSRILIEEEAIRSM 793
            ELME M KEM S G  V S TGNA+V+YYS FG+L +ME AYGRLK SR LIE E IR+M
Sbjct: 177  ELMEDMAKEMASRGVHVSSKTGNAFVLYYSIFGSLKDMENAYGRLKKSRFLIEREVIRAM 236

Query: 794  SSAYLKEQKFYSLGQFVRDVGLCRRNVGNLLWNMLFLSYAANFKMKSLQREFVRMIESGF 973
            +SAY +E++FY LG+F+RDVGL R+++GNLLWN++ LSYA NFKMKSLQ+EF++M+ESGF
Sbjct: 237  ASAYTRERQFYELGEFIRDVGLGRKDLGNLLWNLMLLSYAVNFKMKSLQKEFLQMVESGF 296

Query: 974  FPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVDAYLDKNLGRNLDF 1153
             PD+ TFNIRALAFS+M+LFWDLH+++EHM+HE V+PDLVT+G VVDAYLD+ LGRNL+F
Sbjct: 297  RPDITTFNIRALAFSRMALFWDLHLSIEHMEHENVIPDLVTFGCVVDAYLDRGLGRNLNF 356

Query: 1154 ALRKLNKNDCVTIATEPLVFEAMGKGDFHLSSEARLEFSKKKNWTYEELIAIYLKKYFRR 1333
            AL K+N +D   + T+P V+EA+GKGDF +SSEA  EF   + WTY  LI  YLKK++RR
Sbjct: 357  ALNKMNLDDSPMLLTDPFVYEALGKGDFQMSSEAFFEFKTHRKWTYRALIQKYLKKHYRR 416

Query: 1334 NQIFWNY 1354
            NQIFWNY
Sbjct: 417  NQIFWNY 423


>gb|ESW06703.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris]
            gi|561007756|gb|ESW06705.1| hypothetical protein
            PHAVU_010G069800g [Phaseolus vulgaris]
          Length = 372

 Score =  432 bits (1111), Expect = e-118
 Identities = 209/367 (56%), Positives = 276/367 (75%)
 Frame = +2

Query: 254  DCVSLIQGLSRKKLPVAAERLVLDMKSEGFVPDNSTLSVLMLCYASNGLFCKALTAWDEI 433
            D  SL+Q  SRK++   ++ +  D K +G++P  ++L VLML Y  NGLF  A T W+++
Sbjct: 6    DSSSLLQKNSRKRMFPQSDGVFPDTKDDGYMPKQTSLCVLMLYYTENGLFPLAQTTWEQL 65

Query: 434  INSSFLPDVCVIVELIDICGCNGYLDVAVRILHQIQLKDSDLLRDVYARVISRFGKKGQL 613
            + SSF+P V  I  L D    +G  D  V IL  + +++  +L +VY+  I  FG++GQL
Sbjct: 66   LYSSFVPSVEFISRLFDAYAKHGKFDEVVNILRYVDMRNFSILPNVYSLAICCFGREGQL 125

Query: 614  ELMEIMLKEMVSMGFPVDSPTGNAYVIYYSNFGTLSEMEVAYGRLKMSRILIEEEAIRSM 793
            ELME M KEM S G  V S TGNA+V+YYS FG+L +ME AYGRLK SR LIE E IR+M
Sbjct: 126  ELMEDMAKEMASRGVHVSSKTGNAFVLYYSIFGSLKDMENAYGRLKKSRFLIEREVIRAM 185

Query: 794  SSAYLKEQKFYSLGQFVRDVGLCRRNVGNLLWNMLFLSYAANFKMKSLQREFVRMIESGF 973
            +SAY +E++FY LG+F+RDVGL R+++GNLLWN++ LSYA NFKMKSLQ+EF++M+ESGF
Sbjct: 186  ASAYTRERQFYELGEFIRDVGLGRKDLGNLLWNLMLLSYAVNFKMKSLQKEFLQMVESGF 245

Query: 974  FPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVDAYLDKNLGRNLDF 1153
             PD+ TFNIRALAFS+M+LFWDLH+++EHM+HE V+PDLVT+G VVDAYLD+ LGRNL+F
Sbjct: 246  RPDITTFNIRALAFSRMALFWDLHLSIEHMEHENVIPDLVTFGCVVDAYLDRGLGRNLNF 305

Query: 1154 ALRKLNKNDCVTIATEPLVFEAMGKGDFHLSSEARLEFSKKKNWTYEELIAIYLKKYFRR 1333
            AL K+N +D   + T+P V+EA+GKGDF +SSEA  EF   + WTY  LI  YLKK++RR
Sbjct: 306  ALNKMNLDDSPMLLTDPFVYEALGKGDFQMSSEAFFEFKTHRKWTYRALIQKYLKKHYRR 365

Query: 1334 NQIFWNY 1354
            NQIFWNY
Sbjct: 366  NQIFWNY 372


>ref|XP_006372218.1| hypothetical protein POPTR_0018s14360g [Populus trichocarpa]
            gi|550318749|gb|ERP50015.1| hypothetical protein
            POPTR_0018s14360g [Populus trichocarpa]
          Length = 392

 Score =  427 bits (1099), Expect = e-117
 Identities = 219/398 (55%), Positives = 277/398 (69%)
 Frame = +2

Query: 161  ALTRKVVCCN*SQRPWHKKQGGNIYPPGNYADCVSLIQGLSRKKLPVAAERLVLDMKSEG 340
            AL +K++      R W + QG  ++     ADC SLIQ L + + P  AE L+L++K EG
Sbjct: 38   ALAQKMI------RQWKRDQG--VFGKETCADCASLIQTLCKHRRPHLAEELLLELKCEG 89

Query: 341  FVPDNSTLSVLMLCYASNGLFCKALTAWDEIINSSFLPDVCVIVELIDICGCNGYLDVAV 520
            F+PDN TLS +MLCYA +GL  +A   W+E++ SSF+P V                    
Sbjct: 90   FLPDNRTLSAMMLCYADSGLLPQAQAIWEEMLYSSFVPSV-------------------- 129

Query: 521  RILHQIQLKDSDLLRDVYARVISRFGKKGQLELMEIMLKEMVSMGFPVDSPTGNAYVIYY 700
                            VY+  IS FGK GQLELME  LK+MVS GF VDS TGNA+V+YY
Sbjct: 130  ---------------QVYSLAISCFGKGGQLELMEDTLKKMVSKGFWVDSATGNAFVVYY 174

Query: 701  SNFGTLSEMEVAYGRLKMSRILIEEEAIRSMSSAYLKEQKFYSLGQFVRDVGLCRRNVGN 880
            S  G+L+EME AY RLK SR+LIE E IR+MS AY+KE+KFY L +F+RDVGL R+N+GN
Sbjct: 175  SLHGSLAEMEAAYDRLKRSRLLIEREGIRAMSFAYIKERKFYGLSEFLRDVGLGRKNLGN 234

Query: 881  LLWNMLFLSYAANFKMKSLQREFVRMIESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEH 1060
            L+WN+L LSY+ANFKMK+LQREF+ M+E+GF PDL TFNIRALAFS+MSL WDLH+ LEH
Sbjct: 235  LIWNLLLLSYSANFKMKTLQREFLNMLEAGFHPDLTTFNIRALAFSRMSLLWDLHLGLEH 294

Query: 1061 MKHEKVVPDLVTYGSVVDAYLDKNLGRNLDFALRKLNKNDCVTIATEPLVFEAMGKGDFH 1240
            MKH+KV PDLVTYG +VDAYLD+ L RNL+FAL K++ ++   ++T+P VFE  GKGDFH
Sbjct: 295  MKHDKVAPDLVTYGCIVDAYLDRRLVRNLEFALSKMHVDNSPVLSTDPFVFEVFGKGDFH 354

Query: 1241 LSSEARLEFSKKKNWTYEELIAIYLKKYFRRNQIFWNY 1354
             SSEA +EF +++ WTY ELI IYL+K  R   IFWNY
Sbjct: 355  SSSEAFMEFKRQRKWTYRELIKIYLRKQHRSKHIFWNY 392


>gb|ABA18111.1| pentatricopeptide repeat protein [Arabidopsis arenosa]
          Length = 419

 Score =  423 bits (1087), Expect = e-115
 Identities = 209/367 (56%), Positives = 268/367 (73%)
 Frame = +2

Query: 254  DCVSLIQGLSRKKLPVAAERLVLDMKSEGFVPDNSTLSVLMLCYASNGLFCKALTAWDEI 433
            D   L+Q LS+++LP  A  + +  KS   +P+  TL  LMLC+A NG   +A T WDEI
Sbjct: 53   DYAPLVQTLSQRRLPDVAHEIFIQTKSVNLLPNYRTLCALMLCFAENGFVLRARTIWDEI 112

Query: 434  INSSFLPDVCVIVELIDICGCNGYLDVAVRILHQIQLKDSDLLRDVYARVISRFGKKGQL 613
            +NSSF+PDV V+ +LI      G+ D   +I   +  + S LL  VY+  IS FGK GQL
Sbjct: 113  LNSSFVPDVFVVSKLISAYEQLGFFDEVAKITKDVAARHSTLLPVVYSLAISCFGKNGQL 172

Query: 614  ELMEIMLKEMVSMGFPVDSPTGNAYVIYYSNFGTLSEMEVAYGRLKMSRILIEEEAIRSM 793
            ELME +++EM S G  +DS T NA V Y+S FGTL ++E AYGRLK   I+IEEE IR++
Sbjct: 173  ELMEGVIEEMDSKGMSLDSATANAIVRYFSFFGTLDKIEHAYGRLKKFGIVIEEEEIRAV 232

Query: 794  SSAYLKEQKFYSLGQFVRDVGLCRRNVGNLLWNMLFLSYAANFKMKSLQREFVRMIESGF 973
              AYLK++KFY L +F+ DVGL RRN+GN+LWN + LSYAA FKMKSLQREF+ M+++GF
Sbjct: 233  LLAYLKQRKFYRLREFLSDVGLGRRNLGNMLWNSVLLSYAAEFKMKSLQREFIEMLDAGF 292

Query: 974  FPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVDAYLDKNLGRNLDF 1153
             PDL TFNIRALAFS+M+LFWDLH+TLEHM+   +VPDLVT+G VVDAY+DK L RNL+F
Sbjct: 293  SPDLTTFNIRALAFSRMALFWDLHLTLEHMRRLNIVPDLVTFGCVVDAYMDKRLARNLEF 352

Query: 1154 ALRKLNKNDCVTIATEPLVFEAMGKGDFHLSSEARLEFSKKKNWTYEELIAIYLKKYFRR 1333
               ++N +D   + T+PL FE +GKGDFHLSSEA LEFS +KNWTY +LI +Y+KK  RR
Sbjct: 353  VYNQMNLDDSPVVLTDPLAFEVLGKGDFHLSSEAVLEFSTEKNWTYRKLIGVYVKKKLRR 412

Query: 1334 NQIFWNY 1354
            +QIFWNY
Sbjct: 413  DQIFWNY 419


>ref|XP_006850970.1| hypothetical protein AMTR_s00025p00206120 [Amborella trichopoda]
            gi|548854641|gb|ERN12551.1| hypothetical protein
            AMTR_s00025p00206120 [Amborella trichopoda]
          Length = 354

 Score =  421 bits (1082), Expect = e-115
 Identities = 204/354 (57%), Positives = 265/354 (74%)
 Frame = +2

Query: 293  LPVAAERLVLDMKSEGFVPDNSTLSVLMLCYASNGLFCKALTAWDEIINSSFLPDVCVIV 472
            +P   +RL  +++S+ F    +TLS LM+C A NGLF  +   W EIINSSF  D+ V+ 
Sbjct: 1    MPHVVQRLFTEIESQNFRTGCTTLSALMICCAENGLFSLSNAIWTEIINSSFELDIGVVS 60

Query: 473  ELIDICGCNGYLDVAVRILHQIQLKDSDLLRDVYARVISRFGKKGQLELMEIMLKEMVSM 652
            EL+   G     D   R+L++   ++ +L  ++Y   IS FGK  QLELME  +KEMVS 
Sbjct: 61   ELMHAYGKANLYDEVYRMLNEAISREFNLCPEIYTVAISCFGKGAQLELMEATIKEMVSR 120

Query: 653  GFPVDSPTGNAYVIYYSNFGTLSEMEVAYGRLKMSRILIEEEAIRSMSSAYLKEQKFYSL 832
            GF VDS TGNA++IYYS+FG+L+EME+AYGRLK SRILIE EAIR+M+SAY++E+KF+ +
Sbjct: 121  GFKVDSNTGNAFIIYYSSFGSLAEMEIAYGRLKCSRILIEREAIRAMASAYIRERKFFKM 180

Query: 833  GQFVRDVGLCRRNVGNLLWNMLFLSYAANFKMKSLQREFVRMIESGFFPDLNTFNIRALA 1012
            G+F+RDVGL RRN GNLLWN+L LSYAANFKMKSLQR F+ M+E+GF PD+ TFNIR LA
Sbjct: 181  GEFLRDVGLGRRNSGNLLWNLLLLSYAANFKMKSLQRTFLGMLEAGFSPDITTFNIRTLA 240

Query: 1013 FSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVDAYLDKNLGRNLDFALRKLNKNDCVTI 1192
            FS+M +FWDLH+++EHM+H  V+PDLVTYG +VDAY+++  GRNL F L+ +N +    I
Sbjct: 241  FSRMCMFWDLHLSIEHMRHMNVIPDLVTYGCIVDAYVERRFGRNLGFGLKCMNLDSSPLI 300

Query: 1193 ATEPLVFEAMGKGDFHLSSEARLEFSKKKNWTYEELIAIYLKKYFRRNQIFWNY 1354
             T+P+V+E  GKGDFH SSEA LE   KK WTY +L+A YLKK +R NQIFWNY
Sbjct: 301  LTDPIVYEVFGKGDFHSSSEALLELKWKKEWTYSKLVAFYLKKRYRSNQIFWNY 354


>ref|XP_006307633.1| hypothetical protein CARUB_v10009261mg [Capsella rubella]
            gi|482576344|gb|EOA40531.1| hypothetical protein
            CARUB_v10009261mg [Capsella rubella]
          Length = 419

 Score =  414 bits (1063), Expect = e-112
 Identities = 208/367 (56%), Positives = 268/367 (73%)
 Frame = +2

Query: 254  DCVSLIQGLSRKKLPVAAERLVLDMKSEGFVPDNSTLSVLMLCYASNGLFCKALTAWDEI 433
            D  SL+Q LS+K+LP  A  + +  KS+  +P+  TL  LMLC+A NG   +A T WDEI
Sbjct: 53   DYASLVQTLSQKRLPKVAYEIFIQTKSDNLLPNYRTLCALMLCFAENGFVLRARTIWDEI 112

Query: 434  INSSFLPDVCVIVELIDICGCNGYLDVAVRILHQIQLKDSDLLRDVYARVISRFGKKGQL 613
            +NSSF+ D+ VI +L+      G  D    I   + ++ S LLR V +  IS FGK GQL
Sbjct: 113  LNSSFVLDLVVISKLMSAYEKIGCFDEIFVITKDVSVRHSKLLRVVCSLAISCFGKNGQL 172

Query: 614  ELMEIMLKEMVSMGFPVDSPTGNAYVIYYSNFGTLSEMEVAYGRLKMSRILIEEEAIRSM 793
            +LME +++E+ S G  +DS T NA V YYS FGTL ++E AYGRL    I+IEEE IR++
Sbjct: 173  QLMEGVIEEVDSKGMSLDSGTSNAIVRYYSVFGTLKKIEQAYGRLWKFGIVIEEEEIRAV 232

Query: 794  SSAYLKEQKFYSLGQFVRDVGLCRRNVGNLLWNMLFLSYAANFKMKSLQREFVRMIESGF 973
              AYLKE+KFY L +F+ DVGL RRN+GNLLWN + LSYAA+FKMKSLQREFV M+++GF
Sbjct: 233  LLAYLKERKFYRLREFLSDVGLGRRNLGNLLWNSVLLSYAADFKMKSLQREFVGMLDAGF 292

Query: 974  FPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVDAYLDKNLGRNLDF 1153
             PDL TFNIRALAFS+M+LFWDLH+TLEHM+H  +VPDLVT+G VVDAY+D+ L RNL+F
Sbjct: 293  SPDLTTFNIRALAFSRMALFWDLHLTLEHMRHLNIVPDLVTFGCVVDAYMDRRLARNLEF 352

Query: 1154 ALRKLNKNDCVTIATEPLVFEAMGKGDFHLSSEARLEFSKKKNWTYEELIAIYLKKYFRR 1333
               ++N +D   + T+PL +E +GKGDFHLSSEA LEFS +KNWTY +L+ +Y KK  RR
Sbjct: 353  FYNQMNFDDSPIVLTDPLAYEVLGKGDFHLSSEAVLEFSPRKNWTYRKLLGVYFKKKLRR 412

Query: 1334 NQIFWNY 1354
            +QIFWNY
Sbjct: 413  DQIFWNY 419


Top