BLASTX nr result

ID: Rehmannia22_contig00000241 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00000241
         (2710 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004240282.1| PREDICTED: pentatricopeptide repeat-containi...   442   e-121
ref|XP_006360650.1| PREDICTED: pentatricopeptide repeat-containi...   434   e-118
ref|XP_006360648.1| PREDICTED: pentatricopeptide repeat-containi...   433   e-118
ref|XP_002265876.1| PREDICTED: pentatricopeptide repeat-containi...   427   e-116
gb|EMJ23653.1| hypothetical protein PRUPE_ppa006191mg [Prunus pe...   409   e-111
ref|XP_004301396.1| PREDICTED: pentatricopeptide repeat-containi...   405   e-110
ref|XP_006453186.1| hypothetical protein CICLE_v10010414mg [Citr...   392   e-106
ref|XP_002331436.1| predicted protein [Populus trichocarpa] gi|5...   392   e-106
ref|XP_004152890.1| PREDICTED: pentatricopeptide repeat-containi...   388   e-105
ref|XP_002511816.1| pentatricopeptide repeat-containing protein,...   385   e-104
gb|EPS70737.1| hypothetical protein M569_04022 [Genlisea aurea]       380   e-102
gb|EOY32179.1| Pentatricopeptide repeat superfamily protein, put...   377   e-101
gb|ESW04320.1| hypothetical protein PHAVU_011G085400g [Phaseolus...   369   5e-99
ref|XP_006573403.1| PREDICTED: pentatricopeptide repeat-containi...   368   9e-99
gb|ESW06704.1| hypothetical protein PHAVU_010G069800g [Phaseolus...   365   4e-98
gb|ESW06703.1| hypothetical protein PHAVU_010G069800g [Phaseolus...   365   4e-98
gb|ABA18111.1| pentatricopeptide repeat protein [Arabidopsis are...   362   5e-97
ref|XP_006850970.1| hypothetical protein AMTR_s00025p00206120 [A...   361   1e-96
emb|CAN79718.1| hypothetical protein VITISV_012741 [Vitis vinifera]   360   2e-96
ref|XP_006372218.1| hypothetical protein POPTR_0018s14360g [Popu...   358   5e-96

>ref|XP_004240282.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            [Solanum lycopersicum]
          Length = 381

 Score =  442 bits (1136), Expect = e-121
 Identities = 228/382 (59%), Positives = 278/382 (72%)
 Frame = +1

Query: 835  GKCDDYVDRRDACNSGSLIHGFSGERERLPLFPGKNVYEKKSEGLFPDDSALSVLLLHYA 1014
            G  D  ++ RD     SLI G S  R++LP+   + V E KSEG  PD S LS L+L YA
Sbjct: 5    GNIDPRINYRDCA---SLIQGLS--RKKLPVAAERLVLEMKSEGFVPDSSTLSALMLCYA 59

Query: 1015 SNGLFDKAHGIWDEMLNSSFVPDAKVVSELIFIYGSMRDFDMVTRILLQMQLKDSEMLHD 1194
            +NGLF KA   WDE++NSSF+PD  V++ELI IYG     D+  RIL Q+QLKDS +L D
Sbjct: 60   TNGLFCKALAAWDEIMNSSFLPDVHVIAELIDIYGCKGYLDVAVRILHQIQLKDSNLLRD 119

Query: 1195 IFSLAISCFGKRGQLDCMEIMIKKMVSMGYSVDSSTGNAYVLYSSIFGSLAEMEAAYGRL 1374
            +++ AIS FGK+GQL+ ME+M+++MVSMG+ VDS+TGNAYV+Y S FG+L+EME AYGRL
Sbjct: 120  VYAQAISRFGKKGQLELMEVMLEEMVSMGFPVDSTTGNAYVIYYSNFGTLSEMEVAYGRL 179

Query: 1375 KXXXXXXXXXXXXXXXFAYIKENRFYALGRFVHDXXXXXXXXXXXXXXXXXXSYAANFKM 1554
            K                AY+K+ +FY+LG+FV D                  SYAANFKM
Sbjct: 180  KMSRILIEEEAIRSISLAYLKKEKFYSLGQFVRDVGLCRRNVGNLLWNLLLLSYAANFKM 239

Query: 1555 KSLQREFVRMVEAGFSPDLNTFNIRALAFSKMSLLWDLHLSLEHMKHEAVVPDLVTYGCV 1734
            KSLQREFVRMVE+GF PDLNTFNIRALAFSKMSL WDLH++LEHMKHE VVPDLVTYG V
Sbjct: 240  KSLQREFVRMVESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSV 299

Query: 1735 VDAYLDRRLGRNLKFALQKLDWDDSVSILTDPLVFEVMGKGDFHSNSEVVMEYVKKKNWT 1914
            VDAYLDR LGRNL FAL+KL+ +D V++ T+PLVFE MGKGDFH +SE  +E+ KK NWT
Sbjct: 300  VDAYLDRGLGRNLDFALRKLNTNDCVTVATEPLVFEAMGKGDFHLSSEARLEFSKKTNWT 359

Query: 1915 YKMLISIYLKKKFRSNQIFWNY 1980
            Y++LI+ YLKK FR NQIFWNY
Sbjct: 360  YEVLITTYLKKYFRRNQIFWNY 381


>ref|XP_006360650.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            isoform X3 [Solanum tuberosum]
          Length = 409

 Score =  434 bits (1115), Expect = e-118
 Identities = 227/386 (58%), Positives = 275/386 (71%)
 Frame = +1

Query: 823  QSLQGKCDDYVDRRDACNSGSLIHGFSGERERLPLFPGKNVYEKKSEGLFPDDSALSVLL 1002
            QS   K  +   R +  +  SLI G S  R++LP+   + V E KSEG  PD S LS L+
Sbjct: 26   QSSAQKGGNIDPRGNYADCASLIQGLS--RKKLPVAAERLVLEMKSEGFVPDSSTLSALM 83

Query: 1003 LHYASNGLFDKAHGIWDEMLNSSFVPDAKVVSELIFIYGSMRDFDMVTRILLQMQLKDSE 1182
            L YASNGLF KA   WDE++NSSF+PD  V++ELI IY      D+  RIL Q+QLKDS 
Sbjct: 84   LCYASNGLFYKALAAWDEIMNSSFLPDVHVIAELIDIYVCKGYLDVAVRILHQIQLKDSN 143

Query: 1183 MLHDIFSLAISCFGKRGQLDCMEIMIKKMVSMGYSVDSSTGNAYVLYSSIFGSLAEMEAA 1362
            +L D+++ AIS FGK+GQL+ ME+M+K+MVSMG+ VDS+TGNAYV+Y S FG L+EME A
Sbjct: 144  LLRDVYAQAISRFGKKGQLELMEVMLKEMVSMGFPVDSTTGNAYVIYYSNFGMLSEMEVA 203

Query: 1363 YGRLKXXXXXXXXXXXXXXXFAYIKENRFYALGRFVHDXXXXXXXXXXXXXXXXXXSYAA 1542
            YGRLK                AY+K+ +FY+LG+FV D                  SYAA
Sbjct: 204  YGRLKMSRILIEEEAIRSISLAYLKKEKFYSLGQFVRDVGLCRRNVGNLLWNLLLLSYAA 263

Query: 1543 NFKMKSLQREFVRMVEAGFSPDLNTFNIRALAFSKMSLLWDLHLSLEHMKHEAVVPDLVT 1722
            NFKMKSLQREFVRMVE+GF PDLNTFNIRALAFSKMSL WDLH++LEHMKHE VVPDLVT
Sbjct: 264  NFKMKSLQREFVRMVESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVT 323

Query: 1723 YGCVVDAYLDRRLGRNLKFALQKLDWDDSVSILTDPLVFEVMGKGDFHSNSEVVMEYVKK 1902
            YG VVDAYLDR LGRNL FAL+KL+ +D V + T+PLVFE +GKGDFH +S+  +E+ K 
Sbjct: 324  YGSVVDAYLDRGLGRNLDFALRKLNINDCVIVATEPLVFEAIGKGDFHLSSDARLEFSKN 383

Query: 1903 KNWTYKMLISIYLKKKFRSNQIFWNY 1980
            KNWTY+ LI+ YLKK FR NQIFWNY
Sbjct: 384  KNWTYEELITTYLKKYFRRNQIFWNY 409


>ref|XP_006360648.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            isoform X1 [Solanum tuberosum]
            gi|565389826|ref|XP_006360649.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X2 [Solanum tuberosum]
          Length = 416

 Score =  433 bits (1114), Expect = e-118
 Identities = 224/374 (59%), Positives = 271/374 (72%)
 Frame = +1

Query: 859  RRDACNSGSLIHGFSGERERLPLFPGKNVYEKKSEGLFPDDSALSVLLLHYASNGLFDKA 1038
            R +  +  SLI G S  R++LP+   + V E KSEG  PD S LS L+L YASNGLF KA
Sbjct: 45   RGNYADCASLIQGLS--RKKLPVAAERLVLEMKSEGFVPDSSTLSALMLCYASNGLFYKA 102

Query: 1039 HGIWDEMLNSSFVPDAKVVSELIFIYGSMRDFDMVTRILLQMQLKDSEMLHDIFSLAISC 1218
               WDE++NSSF+PD  V++ELI IY      D+  RIL Q+QLKDS +L D+++ AIS 
Sbjct: 103  LAAWDEIMNSSFLPDVHVIAELIDIYVCKGYLDVAVRILHQIQLKDSNLLRDVYAQAISR 162

Query: 1219 FGKRGQLDCMEIMIKKMVSMGYSVDSSTGNAYVLYSSIFGSLAEMEAAYGRLKXXXXXXX 1398
            FGK+GQL+ ME+M+K+MVSMG+ VDS+TGNAYV+Y S FG L+EME AYGRLK       
Sbjct: 163  FGKKGQLELMEVMLKEMVSMGFPVDSTTGNAYVIYYSNFGMLSEMEVAYGRLKMSRILIE 222

Query: 1399 XXXXXXXXFAYIKENRFYALGRFVHDXXXXXXXXXXXXXXXXXXSYAANFKMKSLQREFV 1578
                     AY+K+ +FY+LG+FV D                  SYAANFKMKSLQREFV
Sbjct: 223  EEAIRSISLAYLKKEKFYSLGQFVRDVGLCRRNVGNLLWNLLLLSYAANFKMKSLQREFV 282

Query: 1579 RMVEAGFSPDLNTFNIRALAFSKMSLLWDLHLSLEHMKHEAVVPDLVTYGCVVDAYLDRR 1758
            RMVE+GF PDLNTFNIRALAFSKMSL WDLH++LEHMKHE VVPDLVTYG VVDAYLDR 
Sbjct: 283  RMVESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVDAYLDRG 342

Query: 1759 LGRNLKFALQKLDWDDSVSILTDPLVFEVMGKGDFHSNSEVVMEYVKKKNWTYKMLISIY 1938
            LGRNL FAL+KL+ +D V + T+PLVFE +GKGDFH +S+  +E+ K KNWTY+ LI+ Y
Sbjct: 343  LGRNLDFALRKLNINDCVIVATEPLVFEAIGKGDFHLSSDARLEFSKNKNWTYEELITTY 402

Query: 1939 LKKKFRSNQIFWNY 1980
            LKK FR NQIFWNY
Sbjct: 403  LKKYFRRNQIFWNY 416


>ref|XP_002265876.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630
            [Vitis vinifera] gi|297736023|emb|CBI24061.3| unnamed
            protein product [Vitis vinifera]
          Length = 423

 Score =  427 bits (1098), Expect = e-116
 Identities = 225/396 (56%), Positives = 276/396 (69%)
 Frame = +1

Query: 793  WKENLW*CLEQSLQGKCDDYVDRRDACNSGSLIHGFSGERERLPLFPGKNVYEKKSEGLF 972
            WK+      E+S+ GK D+YVD         LI   S  R+RLP    + ++E KSEG  
Sbjct: 43   WKQ------ERSVDGK-DNYVDYTP------LIQALS--RKRLPHVAQELLFEMKSEGFL 87

Query: 973  PDDSALSVLLLHYASNGLFDKAHGIWDEMLNSSFVPDAKVVSELIFIYGSMRDFDMVTRI 1152
            P++S LS L+L YA NGLF KA  +WDE++NSSF P+ ++VS+LI  YG M  F  VTRI
Sbjct: 88   PNNSTLSALMLCYADNGLFPKAQALWDEIINSSFGPNIQIVSKLIDAYGKMGHFGEVTRI 147

Query: 1153 LLQMQLKDSEMLHDIFSLAISCFGKRGQLDCMEIMIKKMVSMGYSVDSSTGNAYVLYSSI 1332
            L Q+  +D   +H+++SLAISCFGK GQL+ ME  +K+MVS G+ VDS+TGNA++ Y SI
Sbjct: 148  LHQVSSRDFNFMHEVYSLAISCFGKGGQLEMMENALKEMVSRGFPVDSATGNAFIRYYSI 207

Query: 1333 FGSLAEMEAAYGRLKXXXXXXXXXXXXXXXFAYIKENRFYALGRFVHDXXXXXXXXXXXX 1512
            FGSL EMEAAY RLK               FAYIKE ++Y LG+F+ D            
Sbjct: 208  FGSLTEMEAAYDRLKKSRILIEEEGIRAMSFAYIKEKKYYRLGQFLRDVGLGRKNVGNLL 267

Query: 1513 XXXXXXSYAANFKMKSLQREFVRMVEAGFSPDLNTFNIRALAFSKMSLLWDLHLSLEHMK 1692
                  SYAANFKMKSLQREF+ MVEAGF+PDL TFNIRALAFS+MSL WDLHLSLEHM+
Sbjct: 268  WNLLLLSYAANFKMKSLQREFLEMVEAGFAPDLTTFNIRALAFSRMSLFWDLHLSLEHMQ 327

Query: 1693 HEAVVPDLVTYGCVVDAYLDRRLGRNLKFALQKLDWDDSVSILTDPLVFEVMGKGDFHSN 1872
            H  VV DLVTYGCVVDAYLDRRLG+NL FAL+K++ DDS  + TD  VFEV+GKGDFHS+
Sbjct: 328  HVKVVADLVTYGCVVDAYLDRRLGKNLDFALKKMNMDDSPLVSTDHFVFEVLGKGDFHSS 387

Query: 1873 SEVVMEYVKKKNWTYKMLISIYLKKKFRSNQIFWNY 1980
            SE  +E  +   WTY+ LI+ YLKKK+RSNQIFWNY
Sbjct: 388  SEAFLESKRNGKWTYRKLIATYLKKKYRSNQIFWNY 423


>gb|EMJ23653.1| hypothetical protein PRUPE_ppa006191mg [Prunus persica]
          Length = 423

 Score =  409 bits (1052), Expect = e-111
 Identities = 203/357 (56%), Positives = 259/357 (72%)
 Frame = +1

Query: 910  RERLPLFPGKNVYEKKSEGLFPDDSALSVLLLHYASNGLFDKAHGIWDEMLNSSFVPDAK 1089
            R+++P    + V E KS+GL P +S LS L+L +A+NGLF +A  IWDEML+SSFVP  +
Sbjct: 67   RQKMPHVAQELVLEMKSDGLLPSNSTLSALMLCHANNGLFPQAEAIWDEMLHSSFVPSIQ 126

Query: 1090 VVSELIFIYGSMRDFDMVTRILLQMQLKDSEMLHDIFSLAISCFGKRGQLDCMEIMIKKM 1269
            VVSEL   YG++  F+ V  IL Q++ ++  +  +++SLAISCFGK GQL+ ME  +K+M
Sbjct: 127  VVSELFDAYGNVGCFEKVNEILAQIRSRNLSLFPEVYSLAISCFGKGGQLELMEGTLKEM 186

Query: 1270 VSMGYSVDSSTGNAYVLYSSIFGSLAEMEAAYGRLKXXXXXXXXXXXXXXXFAYIKENRF 1449
            +S G+ +DS+TGNA++ Y SIFGSL EME AYGRLK               FAY+K+ +F
Sbjct: 187  ISRGFPLDSATGNAFIRYYSIFGSLTEMETAYGRLKRSRFLIEEEGIRAMSFAYLKKRKF 246

Query: 1450 YALGRFVHDXXXXXXXXXXXXXXXXXXSYAANFKMKSLQREFVRMVEAGFSPDLNTFNIR 1629
            Y L   + +                  SYAA+FKMKSLQREF+RMVEAGF PDL TFNIR
Sbjct: 247  YRLAELLKNVGLGRRNLGNLSWNLLLLSYAADFKMKSLQREFLRMVEAGFHPDLTTFNIR 306

Query: 1630 ALAFSKMSLLWDLHLSLEHMKHEAVVPDLVTYGCVVDAYLDRRLGRNLKFALQKLDWDDS 1809
            ALAFS+MSLLWDLHLSLEHMKHE V PDLVT GCVVDAYL+RRLG+N+ FAL K++ DDS
Sbjct: 307  ALAFSRMSLLWDLHLSLEHMKHEKVFPDLVTCGCVVDAYLERRLGKNMYFALNKMNLDDS 366

Query: 1810 VSILTDPLVFEVMGKGDFHSNSEVVMEYVKKKNWTYKMLISIYLKKKFRSNQIFWNY 1980
              ILTDP VFEV+GKGDFH++SE  +E+  ++ WTY+ LIS+YLKK++R NQIFWNY
Sbjct: 367  PLILTDPFVFEVLGKGDFHASSEAFLEFQSQREWTYRRLISVYLKKQYRRNQIFWNY 423


>ref|XP_004301396.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            [Fragaria vesca subsp. vesca]
          Length = 424

 Score =  405 bits (1041), Expect = e-110
 Identities = 214/396 (54%), Positives = 270/396 (68%)
 Frame = +1

Query: 793  WKENLW*CLEQSLQGKCDDYVDRRDACNSGSLIHGFSGERERLPLFPGKNVYEKKSEGLF 972
            WK+      E+  +GK D YVD         LI   S  R+++P    + +   KSEGL 
Sbjct: 44   WKQ------EECSRGK-DCYVD------CVPLIQSLS--RQKMPHVAQEVLLVMKSEGLI 88

Query: 973  PDDSALSVLLLHYASNGLFDKAHGIWDEMLNSSFVPDAKVVSELIFIYGSMRDFDMVTRI 1152
            P +S LS ++L +A NGL  +A  IWDEMLNSSFVP  +VVSEL  +YG++  F  V  I
Sbjct: 89   PSNSTLSAVMLCHAKNGLLPQAEAIWDEMLNSSFVPGIQVVSELFDVYGNVGSFGKVNEI 148

Query: 1153 LLQMQLKDSEMLHDIFSLAISCFGKRGQLDCMEIMIKKMVSMGYSVDSSTGNAYVLYSSI 1332
            + Q++ ++  +L  ++SLAISCFGK GQL+ ME  +K+MVS G+ VDS+TGN ++ Y SI
Sbjct: 149  VGQIRSRNLSLLPQVYSLAISCFGKGGQLELMEDTLKEMVSRGFPVDSATGNVFIRYYSI 208

Query: 1333 FGSLAEMEAAYGRLKXXXXXXXXXXXXXXXFAYIKENRFYALGRFVHDXXXXXXXXXXXX 1512
            FGSL EME AY RLK                AY+K+ +FY+L  F+              
Sbjct: 209  FGSLTEMETAYDRLKRSRFLIEEEGIRAMSLAYLKKRKFYSLAEFLKSVGLGRRNLGNLL 268

Query: 1513 XXXXXXSYAANFKMKSLQREFVRMVEAGFSPDLNTFNIRALAFSKMSLLWDLHLSLEHMK 1692
                  SYAANFKMK+LQREF+RMVEAGF PDL TFNIRALAFS+MSLLWDLHL+LEHMK
Sbjct: 269  WNLLLLSYAANFKMKTLQREFLRMVEAGFHPDLTTFNIRALAFSRMSLLWDLHLTLEHMK 328

Query: 1693 HEAVVPDLVTYGCVVDAYLDRRLGRNLKFALQKLDWDDSVSILTDPLVFEVMGKGDFHSN 1872
            H  VVPDLVT GC+VDAYLDRRLGRNL FAL K++ DDS  +LTDP VFEV+GKGDFH++
Sbjct: 329  HVKVVPDLVTCGCIVDAYLDRRLGRNLYFALNKMNLDDSPVVLTDPFVFEVLGKGDFHAS 388

Query: 1873 SEVVMEYVKKKNWTYKMLISIYLKKKFRSNQIFWNY 1980
            SE  +E+ K+K WTY+ LIS+YLKK++R +QIFWNY
Sbjct: 389  SEAFLEFRKQKEWTYQKLISVYLKKQYRRDQIFWNY 424


>ref|XP_006453186.1| hypothetical protein CICLE_v10010414mg [Citrus clementina]
            gi|568840749|ref|XP_006474328.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X1 [Citrus sinensis]
            gi|568840751|ref|XP_006474329.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X2 [Citrus sinensis]
            gi|568840753|ref|XP_006474330.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X3 [Citrus sinensis]
            gi|568840755|ref|XP_006474331.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X4 [Citrus sinensis]
            gi|568840757|ref|XP_006474332.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X5 [Citrus sinensis]
            gi|568840759|ref|XP_006474333.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X6 [Citrus sinensis]
            gi|557556412|gb|ESR66426.1| hypothetical protein
            CICLE_v10010414mg [Citrus clementina]
          Length = 412

 Score =  392 bits (1008), Expect = e-106
 Identities = 193/357 (54%), Positives = 253/357 (70%)
 Frame = +1

Query: 910  RERLPLFPGKNVYEKKSEGLFPDDSALSVLLLHYASNGLFDKAHGIWDEMLNSSFVPDAK 1089
            R++ P    + V   KSEGL PD+S L  L+L YA+NG   +A  +W+E+L+SSFV   +
Sbjct: 56   RKKKPHLAHQLVNTVKSEGLLPDNSTLCALMLCYANNGFVLEAQVVWEELLSSSFVLSVQ 115

Query: 1090 VVSELIFIYGSMRDFDMVTRILLQMQLKDSEMLHDIFSLAISCFGKRGQLDCMEIMIKKM 1269
            V+S+L+  YG +  F+ +  I+ Q+  +++++L +++S AISCFGK+GQL+ ME  +K+M
Sbjct: 116  VLSDLMDAYGRIGCFNEIISIIDQVSCRNADLLPEVYSRAISCFGKQGQLELMENTLKEM 175

Query: 1270 VSMGYSVDSSTGNAYVLYSSIFGSLAEMEAAYGRLKXXXXXXXXXXXXXXXFAYIKENRF 1449
            VS G+SVDS+TGNA+++Y S FGSL EME AYGRLK               F Y+KE +F
Sbjct: 176  VSRGFSVDSATGNAFIIYYSRFGSLTEMETAYGRLKRSRHLIDKEGIRAVSFTYLKERKF 235

Query: 1450 YALGRFVHDXXXXXXXXXXXXXXXXXXSYAANFKMKSLQREFVRMVEAGFSPDLNTFNIR 1629
            + LG F+ D                  SYA NFKMKSLQREF+RM EAGF PDL TFNIR
Sbjct: 236  FMLGEFLRDVGLGRKDLGNLLWNLLLLSYAGNFKMKSLQREFMRMSEAGFHPDLTTFNIR 295

Query: 1630 ALAFSKMSLLWDLHLSLEHMKHEAVVPDLVTYGCVVDAYLDRRLGRNLKFALQKLDWDDS 1809
            A+AFS+MS+ WDLHLSLEHMKHE+V PDLVTYGCVVDAYLD+RLGRNL F L K++ DDS
Sbjct: 296  AVAFSRMSMFWDLHLSLEHMKHESVGPDLVTYGCVVDAYLDKRLGRNLDFGLSKMNLDDS 355

Query: 1810 VSILTDPLVFEVMGKGDFHSNSEVVMEYVKKKNWTYKMLISIYLKKKFRSNQIFWNY 1980
              + TDP VFE  GKGDFHS+SE  +E+ +++ WTY+ LI++YLKK+ R NQIFWNY
Sbjct: 356  PVVSTDPYVFEAFGKGDFHSSSEAFLEFKRQRKWTYRKLIAVYLKKQLRRNQIFWNY 412


>ref|XP_002331436.1| predicted protein [Populus trichocarpa]
            gi|566215849|ref|XP_006372219.1| pentatricopeptide
            repeat-containing family protein [Populus trichocarpa]
            gi|550318750|gb|ERP50016.1| pentatricopeptide
            repeat-containing family protein [Populus trichocarpa]
          Length = 428

 Score =  392 bits (1008), Expect = e-106
 Identities = 209/385 (54%), Positives = 258/385 (67%), Gaps = 2/385 (0%)
 Frame = +1

Query: 832  QGKCDDYVDRRDAC-NSGSLIHGFSGERERLPLFPGKNVYEKKSEGLFPDDSALSVLLLH 1008
            Q K D  V  ++ C +  SLI      + R P    + + E K EG  PD+  LS ++L 
Sbjct: 46   QWKRDQGVFGKETCADCASLIQTLC--KHRRPHLAEELLLELKCEGFLPDNRTLSAMMLC 103

Query: 1009 YASNGLFDKAHGIWDEMLNSSFVPDAKVVSELIFIYGSMRDFDMVTRILLQMQ-LKDSEM 1185
            YA +GL  +A  IW+EML SSFVP  +V+S+LI IY     FD V +IL Q+  L+  + 
Sbjct: 104  YADSGLLPQAQAIWEEMLYSSFVPSVQVISDLIDIYAKSGLFDEVIKILDQLSSLRTFDF 163

Query: 1186 LHDIFSLAISCFGKRGQLDCMEIMIKKMVSMGYSVDSSTGNAYVLYSSIFGSLAEMEAAY 1365
            L  ++SLAISCFGK GQL+ ME  +KKMVS G+ VDS+TGNA+V+Y S+ GSLAEMEAAY
Sbjct: 164  LPQVYSLAISCFGKGGQLELMEDTLKKMVSKGFWVDSATGNAFVVYYSLHGSLAEMEAAY 223

Query: 1366 GRLKXXXXXXXXXXXXXXXFAYIKENRFYALGRFVHDXXXXXXXXXXXXXXXXXXSYAAN 1545
             RLK               FAYIKE +FY L  F+ D                  SY+AN
Sbjct: 224  DRLKRSRLLIEREGIRAMSFAYIKERKFYGLSEFLRDVGLGRKNLGNLIWNLLLLSYSAN 283

Query: 1546 FKMKSLQREFVRMVEAGFSPDLNTFNIRALAFSKMSLLWDLHLSLEHMKHEAVVPDLVTY 1725
            FKMK+LQREF+ M+EAGF PDL TFNIRALAFS+MSLLWDLHL LEHMKH+ V PDLVTY
Sbjct: 284  FKMKTLQREFLNMLEAGFHPDLTTFNIRALAFSRMSLLWDLHLGLEHMKHDKVAPDLVTY 343

Query: 1726 GCVVDAYLDRRLGRNLKFALQKLDWDDSVSILTDPLVFEVMGKGDFHSNSEVVMEYVKKK 1905
            GC+VDAYLDRRL RNL+FAL K+  D+S  + TDP VFEV GKGDFHS+SE  ME+ +++
Sbjct: 344  GCIVDAYLDRRLVRNLEFALSKMHVDNSPVLSTDPFVFEVFGKGDFHSSSEAFMEFKRQR 403

Query: 1906 NWTYKMLISIYLKKKFRSNQIFWNY 1980
             WTY+ LI IYL+K+ RS  IFWNY
Sbjct: 404  KWTYRELIKIYLRKQHRSKHIFWNY 428


>ref|XP_004152890.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            [Cucumis sativus] gi|449507537|ref|XP_004163059.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At3g42630-like [Cucumis sativus]
          Length = 388

 Score =  388 bits (996), Expect = e-105
 Identities = 196/376 (52%), Positives = 257/376 (68%)
 Frame = +1

Query: 853  VDRRDACNSGSLIHGFSGERERLPLFPGKNVYEKKSEGLFPDDSALSVLLLHYASNGLFD 1032
            VD     N+  +I   S  R R+P+   +   E KSEG   ++S LS +++HY  +G   
Sbjct: 15   VDDSFNINNSQVIKKLS--RRRMPILAKEIFLELKSEGFPLNNSTLSTIMVHYIDDGSPL 72

Query: 1033 KAHGIWDEMLNSSFVPDAKVVSELIFIYGSMRDFDMVTRILLQMQLKDSEMLHDIFSLAI 1212
            +A  +W+EMLNS F P  +V+S+L   YG M  FD +T++L Q++L+ S +L + +SLAI
Sbjct: 73   QAQAMWEEMLNSCFEPSVQVISKLFNAYGKMGHFDYITKVLDQVKLRYSHLLPEAYSLAI 132

Query: 1213 SCFGKRGQLDCMEIMIKKMVSMGYSVDSSTGNAYVLYSSIFGSLAEMEAAYGRLKXXXXX 1392
            SCFGK  QL+ ME  +++MVS G++V+S+TGN++++Y S+FGSL EME AYGRLK     
Sbjct: 133  SCFGKHKQLELMESTLREMVSSGFTVNSATGNSFIIYYSMFGSLVEMETAYGRLKRSRFL 192

Query: 1393 XXXXXXXXXXFAYIKENRFYALGRFVHDXXXXXXXXXXXXXXXXXXSYAANFKMKSLQRE 1572
                      FAYI++ +FY LG F+ D                  SYAANFKMKSLQRE
Sbjct: 193  IEKKGIMAMAFAYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNLLLLSYAANFKMKSLQRE 252

Query: 1573 FVRMVEAGFSPDLNTFNIRALAFSKMSLLWDLHLSLEHMKHEAVVPDLVTYGCVVDAYLD 1752
            F++MV+AGF+PDL TFNIRALAFS+M LLWDLHLSLEHMKH  + PDLVTYGCVVDAY+D
Sbjct: 253  FLQMVDAGFNPDLTTFNIRALAFSRMDLLWDLHLSLEHMKHMNIEPDLVTYGCVVDAYVD 312

Query: 1753 RRLGRNLKFALQKLDWDDSVSILTDPLVFEVMGKGDFHSNSEVVMEYVKKKNWTYKMLIS 1932
            RRLGRNL+F L K++ D     LTD  VFE +GKGDFH +SE  M++ K+K WTY+ LIS
Sbjct: 313  RRLGRNLEFILSKMNPDQPPVSLTDSFVFEALGKGDFHMSSEAFMQFRKQKKWTYRELIS 372

Query: 1933 IYLKKKFRSNQIFWNY 1980
            +YLKK  R NQ+FWNY
Sbjct: 373  LYLKKHHRRNQVFWNY 388


>ref|XP_002511816.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223548996|gb|EEF50485.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 427

 Score =  385 bits (989), Expect = e-104
 Identities = 191/356 (53%), Positives = 244/356 (68%)
 Frame = +1

Query: 913  ERLPLFPGKNVYEKKSEGLFPDDSALSVLLLHYASNGLFDKAHGIWDEMLNSSFVPDAKV 1092
            +R P    + + E KS+G   ++  LS +LL YA NGL  +A  IW  MLN SF P  ++
Sbjct: 72   KRTPHLAQEILLEMKSQGYVLNNPTLSAILLCYADNGLLPQAQAIWKHMLNGSFTPSIQI 131

Query: 1093 VSELIFIYGSMRDFDMVTRILLQMQLKDSEMLHDIFSLAISCFGKRGQLDCMEIMIKKMV 1272
            VS LI  Y     F+ V  IL Q+   +  +LH+ +SLAISCFGK GQL  ME  +K MV
Sbjct: 132  VSRLIDAYSKKGHFNEVMNILDQLSYSNFSLLHEAYSLAISCFGKGGQLQLMENALKDMV 191

Query: 1273 SMGYSVDSSTGNAYVLYSSIFGSLAEMEAAYGRLKXXXXXXXXXXXXXXXFAYIKENRFY 1452
              G+ VD +TGNA++ Y SI GSL +ME+AY RLK                AY+KE +FY
Sbjct: 192  LRGFPVDYATGNAFIRYYSIHGSLTDMESAYSRLKRSRHLVDREGIRAVSLAYVKERKFY 251

Query: 1453 ALGRFVHDXXXXXXXXXXXXXXXXXXSYAANFKMKSLQREFVRMVEAGFSPDLNTFNIRA 1632
             LG F+ D                  S+AANFKMKSLQREF+RM+EAGF PD+ TFNIRA
Sbjct: 252  RLGEFLRDVGLGRKDVGNLIWNFLLLSFAANFKMKSLQREFLRMLEAGFHPDVTTFNIRA 311

Query: 1633 LAFSKMSLLWDLHLSLEHMKHEAVVPDLVTYGCVVDAYLDRRLGRNLKFALQKLDWDDSV 1812
            LAFS+MSLLWDLHL+LEHMKHE V PD+VTYGC+VDAYLDRRLG+NL FA++K++ D S 
Sbjct: 312  LAFSRMSLLWDLHLTLEHMKHEKVSPDIVTYGCIVDAYLDRRLGKNLDFAIKKMNLDGSP 371

Query: 1813 SILTDPLVFEVMGKGDFHSNSEVVMEYVKKKNWTYKMLISIYLKKKFRSNQIFWNY 1980
             +LTDP VFEV+GKGDFHS++E  +E+ +++ WTY+ L+SIYL+K++RSNQIFWNY
Sbjct: 372  VLLTDPFVFEVLGKGDFHSSAEAFLEFKRQRKWTYRELVSIYLRKQYRSNQIFWNY 427


>gb|EPS70737.1| hypothetical protein M569_04022 [Genlisea aurea]
          Length = 375

 Score =  380 bits (977), Expect = e-102
 Identities = 210/386 (54%), Positives = 257/386 (66%), Gaps = 2/386 (0%)
 Frame = +1

Query: 829  LQGKCDDYVDRRDACNSGSLIHGFSGERE-RLPLFPGKNVYEKKSEGLFPDDSALSVLLL 1005
            LQ K +DY  RR+A  S S   G  G RE RL L           +G  P   A S  LL
Sbjct: 5    LQCKYEDYGLRRNARVSNSQ-KGTHGSREVRLNL-----------KGHLP---AYSSSLL 49

Query: 1006 HYASNGLFDKAHGIWDEMLNSSFVPDAKVVSELIFIYGSMR-DFDMVTRILLQMQLKDSE 1182
            +YA +G   KA  IW  ML+S    D  +V  LIFIYG+M+ DFDMV+RIL  MQ KD+E
Sbjct: 50   YYACHGPSCKALEIWQNMLSSFIAVDTCLVVRLIFIYGNMQQDFDMVSRILHDMQAKDAE 109

Query: 1183 MLHDIFSLAISCFGKRGQLDCMEIMIKKMVSMGYSVDSSTGNAYVLYSSIFGSLAEMEAA 1362
             L  I++LA+S FG+ G L CME M+KKMVSMGY VDS+TGNAY++Y   FGS+ EME  
Sbjct: 110  SLPGIYALAVSSFGEIGDLKCMEYMVKKMVSMGYCVDSATGNAYLMYYGSFGSITEMERI 169

Query: 1363 YGRLKXXXXXXXXXXXXXXXFAYIKENRFYALGRFVHDXXXXXXXXXXXXXXXXXXSYAA 1542
            YGRLK                AYIKE++FY+L  FVHD                   YAA
Sbjct: 170  YGRLKRSRIVIEEEAIRAVSLAYIKESKFYSLCGFVHDLGVGRSDVGNLLWNLLLLCYAA 229

Query: 1543 NFKMKSLQREFVRMVEAGFSPDLNTFNIRALAFSKMSLLWDLHLSLEHMKHEAVVPDLVT 1722
             FKMKSLQREFVRM+E GF PD++TFNIR++AFS+MSLLWDL +SLE MKH+ VV DLVT
Sbjct: 230  RFKMKSLQREFVRMIEWGFKPDIDTFNIRSIAFSRMSLLWDLEVSLEQMKHDGVVGDLVT 289

Query: 1723 YGCVVDAYLDRRLGRNLKFALQKLDWDDSVSILTDPLVFEVMGKGDFHSNSEVVMEYVKK 1902
            YGCV+DAY+DR+LG NL+F L+ LDW D V + TDP+VFE MGKG+FHS+SE +MEY K+
Sbjct: 290  YGCVIDAYMDRKLGGNLEFGLRGLDWSDGVRVWTDPMVFEAMGKGEFHSSSEKLMEYWKE 349

Query: 1903 KNWTYKMLISIYLKKKFRSNQIFWNY 1980
              W+Y+ LI +YL+KK RS+ IFWNY
Sbjct: 350  GGWSYRKLIHVYLRKKSRSDHIFWNY 375


>gb|EOY32179.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
            cacao]
          Length = 429

 Score =  377 bits (969), Expect = e-101
 Identities = 192/342 (56%), Positives = 243/342 (71%), Gaps = 2/342 (0%)
 Frame = +1

Query: 961  EGLFPDDSALSVLLLHYASNGLFDKAHGIWDEMLNS-SFVPDAKVVSELIFIYGSMRDFD 1137
            +GL P++S LS ++L YA NGLF +A  IW+EMLN+ SF P  +VVS+ +  YG M  F 
Sbjct: 88   QGLIPNNSTLSEIMLWYADNGLFPQAQAIWEEMLNTTSFTPSIQVVSKFMDAYGKMGHFH 147

Query: 1138 MVTRILLQMQLKDSEMLHDIFSLAISCFGKRGQLDCMEIMIKKMVSMGYSVDSSTGNAYV 1317
             V +IL ++ L    +L +++ +AISCFGK G+LD ME  +K+MVS G  VDS+TGNA+V
Sbjct: 148  KVHKILDRVILLRVNLLPEVYPVAISCFGKHGRLDLMENTLKEMVSRGLPVDSATGNAFV 207

Query: 1318 LYSSIFGSLAEMEAAYGRLKXXXXXXXXXXXXXXXFAYIKENRFYALGRFVHDXXXXXXX 1497
             Y SIFGSL+EME AY RLK                AYIKE +FY LG F++D       
Sbjct: 208  RYYSIFGSLSEMEIAYARLKRSRHLIEEEGIRAMSSAYIKEGKFYRLGEFLNDLGLGRRN 267

Query: 1498 XXXXXXXXXXXSYAANFKMKSLQREFVRMVEAGFSPDLNTFNIRALAFSKMSLLWDLHLS 1677
                       SYAANFKMK++QR F++M+++GF PDL TFNIRA AFS+MS+ WDLHLS
Sbjct: 268  LGNLLWNLLLLSYAANFKMKTMQRLFLKMMDSGFRPDLTTFNIRAWAFSRMSMFWDLHLS 327

Query: 1678 LEHMKHEAVVPDLVTYGCVVDAYLDRRLGRNLKFALQKLDWDDSVSILTDPLVFEVMGKG 1857
            LEHMKHE+VV DLVTYGCVVDAYLDRRL RNL FAL  ++ DDS  +LTDPLVFE +GKG
Sbjct: 328  LEHMKHESVVSDLVTYGCVVDAYLDRRLARNLDFALNHMNADDSPLVLTDPLVFEALGKG 387

Query: 1858 DFHSNSEVVMEYVK-KKNWTYKMLISIYLKKKFRSNQIFWNY 1980
            DFHS++E  +E+ + KK WTY+ LI++YLKK+ R NQIFWNY
Sbjct: 388  DFHSSAEAFLEFKRQKKKWTYRQLIAVYLKKQLRRNQIFWNY 429


>gb|ESW04320.1| hypothetical protein PHAVU_011G085400g [Phaseolus vulgaris]
            gi|561005327|gb|ESW04321.1| hypothetical protein
            PHAVU_011G085400g [Phaseolus vulgaris]
          Length = 411

 Score =  369 bits (946), Expect = 5e-99
 Identities = 186/371 (50%), Positives = 243/371 (65%), Gaps = 2/371 (0%)
 Frame = +1

Query: 874  NSGSLIHGFSGERERLPLFPGKN--VYEKKSEGLFPDDSALSVLLLHYASNGLFDKAHGI 1047
            +S SL+   S +R    +FP  +   ++ K +G  P  ++L VL+L+Y  NGLF +A   
Sbjct: 45   DSSSLVQNNSRKR----MFPQSDGVFHDTKDDGYMPKQTSLCVLMLYYTENGLFPQAQTT 100

Query: 1048 WDEMLNSSFVPDAKVVSELIFIYGSMRDFDMVTRILLQMQLKDSEMLHDIFSLAISCFGK 1227
            W+++L SSFVP  + +S L   Y     FD V  IL  + +++  +L +++SLAISCFG+
Sbjct: 101  WEQLLYSSFVPSVEFISRLFDAYAKHGKFDEVVNILRYVDMRNFSILPNVYSLAISCFGR 160

Query: 1228 RGQLDCMEIMIKKMVSMGYSVDSSTGNAYVLYSSIFGSLAEMEAAYGRLKXXXXXXXXXX 1407
             GQL+ ME M K+M S G  + S T NA+VLY SIFGSL +ME AYGRLK          
Sbjct: 161  EGQLELMEDMAKEMASRGVHISSKTANAFVLYYSIFGSLKDMENAYGRLKKSRFLIEREV 220

Query: 1408 XXXXXFAYIKENRFYALGRFVHDXXXXXXXXXXXXXXXXXXSYAANFKMKSLQREFVRMV 1587
                  AY +E +FY LG F+ D                  SYAANFKMKSLQ+EF++MV
Sbjct: 221  IRAMASAYTRERQFYELGEFLRDVGLVRKDVGNLLWNLMLLSYAANFKMKSLQKEFLQMV 280

Query: 1588 EAGFSPDLNTFNIRALAFSKMSLLWDLHLSLEHMKHEAVVPDLVTYGCVVDAYLDRRLGR 1767
            E+GF PD+ TFNIRALAFS+M+L WDLHLS+EHM+HE V+PDLVT+GCVVDAYLDR LG+
Sbjct: 281  ESGFRPDITTFNIRALAFSRMALFWDLHLSIEHMEHENVIPDLVTFGCVVDAYLDRGLGK 340

Query: 1768 NLKFALQKLDWDDSVSILTDPLVFEVMGKGDFHSNSEVVMEYVKKKNWTYKMLISIYLKK 1947
            NL FAL K++ DDS  +LTDP V+E +GKGDF  +SE   E+   + WTY+ LI  YLKK
Sbjct: 341  NLNFALNKMNLDDSPMLLTDPFVYEALGKGDFQMSSEAFFEFKTHRKWTYRALIQKYLKK 400

Query: 1948 KFRSNQIFWNY 1980
             +R NQIFWNY
Sbjct: 401  HYRRNQIFWNY 411


>ref|XP_006573403.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            [Glycine max]
          Length = 415

 Score =  368 bits (944), Expect = 9e-99
 Identities = 187/392 (47%), Positives = 248/392 (63%)
 Frame = +1

Query: 805  LW*CLEQSLQGKCDDYVDRRDACNSGSLIHGFSGERERLPLFPGKNVYEKKSEGLFPDDS 984
            +W   E+ + G  D+ VD      + S        R+R+      ++++ K EG  P  +
Sbjct: 32   IWWQNEKGVIGGKDNSVDCSSLAQNSS--------RKRMIHQSDGSLHDIKVEGYMPKQT 83

Query: 985  ALSVLLLHYASNGLFDKAHGIWDEMLNSSFVPDAKVVSELIFIYGSMRDFDMVTRILLQM 1164
            +L V +L+Y  NG F +A  +W++++NSSFVP  + +S L   Y   R FD+V  IL  +
Sbjct: 84   SLCVSMLYYTENGFFPQAQTLWEQLVNSSFVPSVQFISRLFDAYAKHRKFDVVIDILRYV 143

Query: 1165 QLKDSEMLHDIFSLAISCFGKRGQLDCMEIMIKKMVSMGYSVDSSTGNAYVLYSSIFGSL 1344
             +++  +L D++ LAISCFG+ GQL+ ME M  +M S G  + S T NA++LY S+FG+L
Sbjct: 144  DMRNFSILPDVYWLAISCFGREGQLELMEDMANEMASSGVHIYSRTANAFLLYYSLFGTL 203

Query: 1345 AEMEAAYGRLKXXXXXXXXXXXXXXXFAYIKENRFYALGRFVHDXXXXXXXXXXXXXXXX 1524
             EME  YGRLK                AYIKE +FY LG F+ D                
Sbjct: 204  EEMENTYGRLKKSRFLIEKEVIRAVASAYIKERKFYELGEFLRDVGLRRKNVGNLLWNLM 263

Query: 1525 XXSYAANFKMKSLQREFVRMVEAGFSPDLNTFNIRALAFSKMSLLWDLHLSLEHMKHEAV 1704
              SYAANFKMKSLQREF+ MVE+GF PD+ TFNIRALAFS+M+L WDLHLS+EHM+H  +
Sbjct: 264  LLSYAANFKMKSLQREFIGMVESGFRPDITTFNIRALAFSRMALFWDLHLSIEHMEHTKI 323

Query: 1705 VPDLVTYGCVVDAYLDRRLGRNLKFALQKLDWDDSVSILTDPLVFEVMGKGDFHSNSEVV 1884
            +PDLVT+GCVVDAYLDRRLGRNL FAL K++ DDS  +LTDP V+E +GKG F  +SE  
Sbjct: 324  IPDLVTFGCVVDAYLDRRLGRNLDFALNKMNLDDSPRLLTDPFVYEALGKGGFQMSSEAF 383

Query: 1885 MEYVKKKNWTYKMLISIYLKKKFRSNQIFWNY 1980
             EY  ++ WTY+ LI  YLKK +R NQIFWNY
Sbjct: 384  FEYKTQRKWTYRSLIQKYLKKHYRKNQIFWNY 415


>gb|ESW06704.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris]
            gi|561007757|gb|ESW06706.1| hypothetical protein
            PHAVU_010G069800g [Phaseolus vulgaris]
          Length = 423

 Score =  365 bits (938), Expect = 4e-98
 Identities = 180/344 (52%), Positives = 228/344 (66%)
 Frame = +1

Query: 949  EKKSEGLFPDDSALSVLLLHYASNGLFDKAHGIWDEMLNSSFVPDAKVVSELIFIYGSMR 1128
            + K +G  P  ++L VL+L+Y  NGLF  A   W+++L SSFVP  + +S L   Y    
Sbjct: 80   DTKDDGYMPKQTSLCVLMLYYTENGLFPLAQTTWEQLLYSSFVPSVEFISRLFDAYAKHG 139

Query: 1129 DFDMVTRILLQMQLKDSEMLHDIFSLAISCFGKRGQLDCMEIMIKKMVSMGYSVDSSTGN 1308
             FD V  IL  + +++  +L +++SLAI CFG+ GQL+ ME M K+M S G  V S TGN
Sbjct: 140  KFDEVVNILRYVDMRNFSILPNVYSLAICCFGREGQLELMEDMAKEMASRGVHVSSKTGN 199

Query: 1309 AYVLYSSIFGSLAEMEAAYGRLKXXXXXXXXXXXXXXXFAYIKENRFYALGRFVHDXXXX 1488
            A+VLY SIFGSL +ME AYGRLK                AY +E +FY LG F+ D    
Sbjct: 200  AFVLYYSIFGSLKDMENAYGRLKKSRFLIEREVIRAMASAYTRERQFYELGEFIRDVGLG 259

Query: 1489 XXXXXXXXXXXXXXSYAANFKMKSLQREFVRMVEAGFSPDLNTFNIRALAFSKMSLLWDL 1668
                          SYA NFKMKSLQ+EF++MVE+GF PD+ TFNIRALAFS+M+L WDL
Sbjct: 260  RKDLGNLLWNLMLLSYAVNFKMKSLQKEFLQMVESGFRPDITTFNIRALAFSRMALFWDL 319

Query: 1669 HLSLEHMKHEAVVPDLVTYGCVVDAYLDRRLGRNLKFALQKLDWDDSVSILTDPLVFEVM 1848
            HLS+EHM+HE V+PDLVT+GCVVDAYLDR LGRNL FAL K++ DDS  +LTDP V+E +
Sbjct: 320  HLSIEHMEHENVIPDLVTFGCVVDAYLDRGLGRNLNFALNKMNLDDSPMLLTDPFVYEAL 379

Query: 1849 GKGDFHSNSEVVMEYVKKKNWTYKMLISIYLKKKFRSNQIFWNY 1980
            GKGDF  +SE   E+   + WTY+ LI  YLKK +R NQIFWNY
Sbjct: 380  GKGDFQMSSEAFFEFKTHRKWTYRALIQKYLKKHYRRNQIFWNY 423


>gb|ESW06703.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris]
            gi|561007756|gb|ESW06705.1| hypothetical protein
            PHAVU_010G069800g [Phaseolus vulgaris]
          Length = 372

 Score =  365 bits (938), Expect = 4e-98
 Identities = 180/344 (52%), Positives = 228/344 (66%)
 Frame = +1

Query: 949  EKKSEGLFPDDSALSVLLLHYASNGLFDKAHGIWDEMLNSSFVPDAKVVSELIFIYGSMR 1128
            + K +G  P  ++L VL+L+Y  NGLF  A   W+++L SSFVP  + +S L   Y    
Sbjct: 29   DTKDDGYMPKQTSLCVLMLYYTENGLFPLAQTTWEQLLYSSFVPSVEFISRLFDAYAKHG 88

Query: 1129 DFDMVTRILLQMQLKDSEMLHDIFSLAISCFGKRGQLDCMEIMIKKMVSMGYSVDSSTGN 1308
             FD V  IL  + +++  +L +++SLAI CFG+ GQL+ ME M K+M S G  V S TGN
Sbjct: 89   KFDEVVNILRYVDMRNFSILPNVYSLAICCFGREGQLELMEDMAKEMASRGVHVSSKTGN 148

Query: 1309 AYVLYSSIFGSLAEMEAAYGRLKXXXXXXXXXXXXXXXFAYIKENRFYALGRFVHDXXXX 1488
            A+VLY SIFGSL +ME AYGRLK                AY +E +FY LG F+ D    
Sbjct: 149  AFVLYYSIFGSLKDMENAYGRLKKSRFLIEREVIRAMASAYTRERQFYELGEFIRDVGLG 208

Query: 1489 XXXXXXXXXXXXXXSYAANFKMKSLQREFVRMVEAGFSPDLNTFNIRALAFSKMSLLWDL 1668
                          SYA NFKMKSLQ+EF++MVE+GF PD+ TFNIRALAFS+M+L WDL
Sbjct: 209  RKDLGNLLWNLMLLSYAVNFKMKSLQKEFLQMVESGFRPDITTFNIRALAFSRMALFWDL 268

Query: 1669 HLSLEHMKHEAVVPDLVTYGCVVDAYLDRRLGRNLKFALQKLDWDDSVSILTDPLVFEVM 1848
            HLS+EHM+HE V+PDLVT+GCVVDAYLDR LGRNL FAL K++ DDS  +LTDP V+E +
Sbjct: 269  HLSIEHMEHENVIPDLVTFGCVVDAYLDRGLGRNLNFALNKMNLDDSPMLLTDPFVYEAL 328

Query: 1849 GKGDFHSNSEVVMEYVKKKNWTYKMLISIYLKKKFRSNQIFWNY 1980
            GKGDF  +SE   E+   + WTY+ LI  YLKK +R NQIFWNY
Sbjct: 329  GKGDFQMSSEAFFEFKTHRKWTYRALIQKYLKKHYRRNQIFWNY 372


>gb|ABA18111.1| pentatricopeptide repeat protein [Arabidopsis arenosa]
          Length = 419

 Score =  362 bits (929), Expect = 5e-97
 Identities = 185/357 (51%), Positives = 239/357 (66%)
 Frame = +1

Query: 910  RERLPLFPGKNVYEKKSEGLFPDDSALSVLLLHYASNGLFDKAHGIWDEMLNSSFVPDAK 1089
            + RLP    +   + KS  L P+   L  L+L +A NG   +A  IWDE+LNSSFVPD  
Sbjct: 63   QRRLPDVAHEIFIQTKSVNLLPNYRTLCALMLCFAENGFVLRARTIWDEILNSSFVPDVF 122

Query: 1090 VVSELIFIYGSMRDFDMVTRILLQMQLKDSEMLHDIFSLAISCFGKRGQLDCMEIMIKKM 1269
            VVS+LI  Y  +  FD V +I   +  + S +L  ++SLAISCFGK GQL+ ME +I++M
Sbjct: 123  VVSKLISAYEQLGFFDEVAKITKDVAARHSTLLPVVYSLAISCFGKNGQLELMEGVIEEM 182

Query: 1270 VSMGYSVDSSTGNAYVLYSSIFGSLAEMEAAYGRLKXXXXXXXXXXXXXXXFAYIKENRF 1449
             S G S+DS+T NA V Y S FG+L ++E AYGRLK                AY+K+ +F
Sbjct: 183  DSKGMSLDSATANAIVRYFSFFGTLDKIEHAYGRLKKFGIVIEEEEIRAVLLAYLKQRKF 242

Query: 1450 YALGRFVHDXXXXXXXXXXXXXXXXXXSYAANFKMKSLQREFVRMVEAGFSPDLNTFNIR 1629
            Y L  F+ D                  SYAA FKMKSLQREF+ M++AGFSPDL TFNIR
Sbjct: 243  YRLREFLSDVGLGRRNLGNMLWNSVLLSYAAEFKMKSLQREFIEMLDAGFSPDLTTFNIR 302

Query: 1630 ALAFSKMSLLWDLHLSLEHMKHEAVVPDLVTYGCVVDAYLDRRLGRNLKFALQKLDWDDS 1809
            ALAFS+M+L WDLHL+LEHM+   +VPDLVT+GCVVDAY+D+RL RNL+F   +++ DDS
Sbjct: 303  ALAFSRMALFWDLHLTLEHMRRLNIVPDLVTFGCVVDAYMDKRLARNLEFVYNQMNLDDS 362

Query: 1810 VSILTDPLVFEVMGKGDFHSNSEVVMEYVKKKNWTYKMLISIYLKKKFRSNQIFWNY 1980
              +LTDPL FEV+GKGDFH +SE V+E+  +KNWTY+ LI +Y+KKK R +QIFWNY
Sbjct: 363  PVVLTDPLAFEVLGKGDFHLSSEAVLEFSTEKNWTYRKLIGVYVKKKLRRDQIFWNY 419


>ref|XP_006850970.1| hypothetical protein AMTR_s00025p00206120 [Amborella trichopoda]
            gi|548854641|gb|ERN12551.1| hypothetical protein
            AMTR_s00025p00206120 [Amborella trichopoda]
          Length = 354

 Score =  361 bits (926), Expect = 1e-96
 Identities = 180/344 (52%), Positives = 235/344 (68%)
 Frame = +1

Query: 949  EKKSEGLFPDDSALSVLLLHYASNGLFDKAHGIWDEMLNSSFVPDAKVVSELIFIYGSMR 1128
            E +S+      + LS L++  A NGLF  ++ IW E++NSSF  D  VVSEL+  YG   
Sbjct: 11   EIESQNFRTGCTTLSALMICCAENGLFSLSNAIWTEIINSSFELDIGVVSELMHAYGKAN 70

Query: 1129 DFDMVTRILLQMQLKDSEMLHDIFSLAISCFGKRGQLDCMEIMIKKMVSMGYSVDSSTGN 1308
             +D V R+L +   ++  +  +I+++AISCFGK  QL+ ME  IK+MVS G+ VDS+TGN
Sbjct: 71   LYDEVYRMLNEAISREFNLCPEIYTVAISCFGKGAQLELMEATIKEMVSRGFKVDSNTGN 130

Query: 1309 AYVLYSSIFGSLAEMEAAYGRLKXXXXXXXXXXXXXXXFAYIKENRFYALGRFVHDXXXX 1488
            A+++Y S FGSLAEME AYGRLK                AYI+E +F+ +G F+ D    
Sbjct: 131  AFIIYYSSFGSLAEMEIAYGRLKCSRILIEREAIRAMASAYIRERKFFKMGEFLRDVGLG 190

Query: 1489 XXXXXXXXXXXXXXSYAANFKMKSLQREFVRMVEAGFSPDLNTFNIRALAFSKMSLLWDL 1668
                          SYAANFKMKSLQR F+ M+EAGFSPD+ TFNIR LAFS+M + WDL
Sbjct: 191  RRNSGNLLWNLLLLSYAANFKMKSLQRTFLGMLEAGFSPDITTFNIRTLAFSRMCMFWDL 250

Query: 1669 HLSLEHMKHEAVVPDLVTYGCVVDAYLDRRLGRNLKFALQKLDWDDSVSILTDPLVFEVM 1848
            HLS+EHM+H  V+PDLVTYGC+VDAY++RR GRNL F L+ ++ D S  ILTDP+V+EV 
Sbjct: 251  HLSIEHMRHMNVIPDLVTYGCIVDAYVERRFGRNLGFGLKCMNLDSSPLILTDPIVYEVF 310

Query: 1849 GKGDFHSNSEVVMEYVKKKNWTYKMLISIYLKKKFRSNQIFWNY 1980
            GKGDFHS+SE ++E   KK WTY  L++ YLKK++RSNQIFWNY
Sbjct: 311  GKGDFHSSSEALLELKWKKEWTYSKLVAFYLKKRYRSNQIFWNY 354


>emb|CAN79718.1| hypothetical protein VITISV_012741 [Vitis vinifera]
          Length = 446

 Score =  360 bits (923), Expect = 2e-96
 Identities = 205/396 (51%), Positives = 247/396 (62%)
 Frame = +1

Query: 793  WKENLW*CLEQSLQGKCDDYVDRRDACNSGSLIHGFSGERERLPLFPGKNVYEKKSEGLF 972
            WK+      E+S+ GK D+YVD         LI   S  R+RLP    + ++E KSE   
Sbjct: 105  WKQ------ERSVDGK-DNYVDYTP------LIQALS--RKRLPHVAQELLFEMKSE--- 146

Query: 973  PDDSALSVLLLHYASNGLFDKAHGIWDEMLNSSFVPDAKVVSELIFIYGSMRDFDMVTRI 1152
                           NGLF KA  +WDE++NSSF P+ ++VS+LI  YG M  F  VTRI
Sbjct: 147  --------------DNGLFPKAQALWDEIINSSFGPNIQIVSKLIDAYGKMGHFGEVTRI 192

Query: 1153 LLQMQLKDSEMLHDIFSLAISCFGKRGQLDCMEIMIKKMVSMGYSVDSSTGNAYVLYSSI 1332
            L Q                       GQL+ ME  +K+MVS G+ VDS+TGNA++ Y SI
Sbjct: 193  LHQ----------------------GGQLEMMENALKEMVSRGFPVDSATGNAFIRYYSI 230

Query: 1333 FGSLAEMEAAYGRLKXXXXXXXXXXXXXXXFAYIKENRFYALGRFVHDXXXXXXXXXXXX 1512
            FGSL EMEAAY RLK               FAYIKE ++Y LG+F+ D            
Sbjct: 231  FGSLTEMEAAYDRLKKSRILIEEEGIRAMSFAYIKEKKYYRLGQFLRDVGLGRKNVGNLL 290

Query: 1513 XXXXXXSYAANFKMKSLQREFVRMVEAGFSPDLNTFNIRALAFSKMSLLWDLHLSLEHMK 1692
                  SYAANFKMKSLQREF+ MVEAGF+PDL TFNIRALAFS+MSL WDLHLSLEHM+
Sbjct: 291  WNLLLLSYAANFKMKSLQREFLEMVEAGFAPDLTTFNIRALAFSRMSLFWDLHLSLEHMQ 350

Query: 1693 HEAVVPDLVTYGCVVDAYLDRRLGRNLKFALQKLDWDDSVSILTDPLVFEVMGKGDFHSN 1872
            H  VV DLVTYGCVVDAYLDRRLG+NL FAL+K++ DDS  + TD  VFEV+GKGDFHS+
Sbjct: 351  HVKVVADLVTYGCVVDAYLDRRLGKNLDFALKKMNMDDSPLVSTDHFVFEVLGKGDFHSS 410

Query: 1873 SEVVMEYVKKKNWTYKMLISIYLKKKFRSNQIFWNY 1980
            SE  +E  +   WTY+ LI+ YLKKK+RSNQIFWNY
Sbjct: 411  SEAFLESKRNGKWTYRKLIATYLKKKYRSNQIFWNY 446


>ref|XP_006372218.1| hypothetical protein POPTR_0018s14360g [Populus trichocarpa]
            gi|550318749|gb|ERP50015.1| hypothetical protein
            POPTR_0018s14360g [Populus trichocarpa]
          Length = 392

 Score =  358 bits (920), Expect = 5e-96
 Identities = 196/384 (51%), Positives = 238/384 (61%), Gaps = 1/384 (0%)
 Frame = +1

Query: 832  QGKCDDYVDRRDAC-NSGSLIHGFSGERERLPLFPGKNVYEKKSEGLFPDDSALSVLLLH 1008
            Q K D  V  ++ C +  SLI      + R P    + + E K EG  PD+  LS ++L 
Sbjct: 46   QWKRDQGVFGKETCADCASLIQTLC--KHRRPHLAEELLLELKCEGFLPDNRTLSAMMLC 103

Query: 1009 YASNGLFDKAHGIWDEMLNSSFVPDAKVVSELIFIYGSMRDFDMVTRILLQMQLKDSEML 1188
            YA +GL  +A  IW+EML SSFVP  +V                                
Sbjct: 104  YADSGLLPQAQAIWEEMLYSSFVPSVQV-------------------------------- 131

Query: 1189 HDIFSLAISCFGKRGQLDCMEIMIKKMVSMGYSVDSSTGNAYVLYSSIFGSLAEMEAAYG 1368
               +SLAISCFGK GQL+ ME  +KKMVS G+ VDS+TGNA+V+Y S+ GSLAEMEAAY 
Sbjct: 132  ---YSLAISCFGKGGQLELMEDTLKKMVSKGFWVDSATGNAFVVYYSLHGSLAEMEAAYD 188

Query: 1369 RLKXXXXXXXXXXXXXXXFAYIKENRFYALGRFVHDXXXXXXXXXXXXXXXXXXSYAANF 1548
            RLK               FAYIKE +FY L  F+ D                  SY+ANF
Sbjct: 189  RLKRSRLLIEREGIRAMSFAYIKERKFYGLSEFLRDVGLGRKNLGNLIWNLLLLSYSANF 248

Query: 1549 KMKSLQREFVRMVEAGFSPDLNTFNIRALAFSKMSLLWDLHLSLEHMKHEAVVPDLVTYG 1728
            KMK+LQREF+ M+EAGF PDL TFNIRALAFS+MSLLWDLHL LEHMKH+ V PDLVTYG
Sbjct: 249  KMKTLQREFLNMLEAGFHPDLTTFNIRALAFSRMSLLWDLHLGLEHMKHDKVAPDLVTYG 308

Query: 1729 CVVDAYLDRRLGRNLKFALQKLDWDDSVSILTDPLVFEVMGKGDFHSNSEVVMEYVKKKN 1908
            C+VDAYLDRRL RNL+FAL K+  D+S  + TDP VFEV GKGDFHS+SE  ME+ +++ 
Sbjct: 309  CIVDAYLDRRLVRNLEFALSKMHVDNSPVLSTDPFVFEVFGKGDFHSSSEAFMEFKRQRK 368

Query: 1909 WTYKMLISIYLKKKFRSNQIFWNY 1980
            WTY+ LI IYL+K+ RS  IFWNY
Sbjct: 369  WTYRELIKIYLRKQHRSKHIFWNY 392


Top