BLASTX nr result

ID: Sinomenium21_contig00016674 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00016674
         (1815 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002265876.1| PREDICTED: pentatricopeptide repeat-containi...   593   e-167
ref|XP_007222454.1| hypothetical protein PRUPE_ppa006191mg [Prun...   555   e-155
ref|XP_004301396.1| PREDICTED: pentatricopeptide repeat-containi...   545   e-152
ref|XP_006453186.1| hypothetical protein CICLE_v10010414mg [Citr...   525   e-146
ref|XP_006372219.1| pentatricopeptide repeat-containing family p...   523   e-145
ref|XP_002511816.1| pentatricopeptide repeat-containing protein,...   520   e-145
emb|CAN79718.1| hypothetical protein VITISV_012741 [Vitis vinifera]   511   e-142
ref|XP_007014560.1| Pentatricopeptide repeat superfamily protein...   496   e-137
ref|XP_006360648.1| PREDICTED: pentatricopeptide repeat-containi...   494   e-137
ref|XP_006850970.1| hypothetical protein AMTR_s00025p00206120 [A...   491   e-136
ref|XP_004152890.1| PREDICTED: pentatricopeptide repeat-containi...   488   e-135
ref|XP_006360650.1| PREDICTED: pentatricopeptide repeat-containi...   486   e-134
ref|XP_004240282.1| PREDICTED: pentatricopeptide repeat-containi...   485   e-134
ref|XP_006372218.1| hypothetical protein POPTR_0018s14360g [Popu...   478   e-132
ref|XP_006573403.1| PREDICTED: pentatricopeptide repeat-containi...   470   e-130
ref|XP_007134710.1| hypothetical protein PHAVU_010G069800g [Phas...   469   e-129
ref|XP_007134709.1| hypothetical protein PHAVU_010G069800g [Phas...   468   e-129
gb|ABA18111.1| pentatricopeptide repeat protein [Arabidopsis are...   467   e-129
ref|XP_007132326.1| hypothetical protein PHAVU_011G085400g [Phas...   465   e-128
ref|NP_566863.2| pentatricopeptide repeat-containing protein [Ar...   463   e-127

>ref|XP_002265876.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630
            [Vitis vinifera] gi|297736023|emb|CBI24061.3| unnamed
            protein product [Vitis vinifera]
          Length = 423

 Score =  593 bits (1529), Expect = e-167
 Identities = 290/399 (72%), Positives = 339/399 (84%)
 Frame = +2

Query: 128  HQNKDSVGRSVAREIIWHQKRQASATGKDIYLNFASVLRLLGRKRMPHIAQQLWLEMNSE 307
            HQN  S  R++AR++ WH K++ S  GKD Y+++  +++ L RKR+PH+AQ+L  EM SE
Sbjct: 26   HQNY-SPNRALARKLFWHWKQERSVDGKDNYVDYTPLIQALSRKRLPHVAQELLFEMKSE 84

Query: 308  GFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSSYVPNIKVVSELIEAYGKMGRFDAV 487
            GFLP   TLSALMLCYADNGLF  AQ +WDEIINSS+ PNI++VS+LI+AYGKMG F  V
Sbjct: 85   GFLPNNSTLSALMLCYADNGLFPKAQALWDEIINSSFGPNIQIVSKLIDAYGKMGHFGEV 144

Query: 488  SRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKEMVSNGFPVDSATGNAFLQY 667
            +RIL +V+ RDF    EVYS AI CFGKGGQLE+ME  +KEMVS GFPVDSATGNAF++Y
Sbjct: 145  TRILHQVSSRDFNFMHEVYSLAISCFGKGGQLEMMENALKEMVSRGFPVDSATGNAFIRY 204

Query: 668  YSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGKFHKLGEFLRGVGLGRKNVG 847
            YS FGSLT MEAAY RLK+S+ILIE+EGIRAM+ AYIKE K+++LG+FLR VGLGRKNVG
Sbjct: 205  YSIFGSLTEMEAAYDRLKKSRILIEEEGIRAMSFAYIKEKKYYRLGQFLRDVGLGRKNVG 264

Query: 848  NLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNIRALAFSRMALFWDLHVSLE 1027
            NLLWN LLLSYAANFKMKSLQREF  M EAGF+PDL+TFNIRALAFSRM+LFWDLH+SLE
Sbjct: 265  NLLWNLLLLSYAANFKMKSLQREFLEMVEAGFAPDLTTFNIRALAFSRMSLFWDLHLSLE 324

Query: 1028 HMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDSPLVSTDQLVFEVLGKGDF 1207
            HM+H KVV DLVTYGCVVDA+LDR++G+NL+FAL KMN +DSPLVSTD  VFEVLGKGDF
Sbjct: 325  HMQHVKVVADLVTYGCVVDAYLDRRLGKNLDFALKKMNMDDSPLVSTDHFVFEVLGKGDF 384

Query: 1208 HSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324
            HSSSEAFLE  R   WTYRKLIA YL+KKYRSNQ+FWNY
Sbjct: 385  HSSSEAFLESKRNGKWTYRKLIATYLKKKYRSNQIFWNY 423


>ref|XP_007222454.1| hypothetical protein PRUPE_ppa006191mg [Prunus persica]
            gi|462419390|gb|EMJ23653.1| hypothetical protein
            PRUPE_ppa006191mg [Prunus persica]
          Length = 423

 Score =  555 bits (1429), Expect = e-155
 Identities = 277/429 (64%), Positives = 347/429 (80%)
 Frame = +2

Query: 38   MGVVSLLATSHCFSLSLFKCLRPKFHALYSHQNKDSVGRSVAREIIWHQKRQASATGKDI 217
            MG   LL+T+ C S SL    +P+  +  SHQ + S  R++AR+II   K++    GK I
Sbjct: 1    MGGTLLLSTT-CVSSSL----KPQHLSFSSHQPQ-SQSRALARKIIRKWKQEECFDGKGI 54

Query: 218  YLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWD 397
            Y++   ++R L R++MPH+AQ+L LEM S+G LP   TLSALMLC+A+NGLF  A+ IWD
Sbjct: 55   YVDCVPLIRSLSRQKMPHVAQELVLEMKSDGLLPSNSTLSALMLCHANNGLFPQAEAIWD 114

Query: 398  EIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGG 577
            E+++SS+VP+I+VVSEL +AYG +G F+ V+ IL ++  R+  L PEVYS AI CFGKGG
Sbjct: 115  EMLHSSFVPSIQVVSELFDAYGNVGCFEKVNEILAQIRSRNLSLFPEVYSLAISCFGKGG 174

Query: 578  QLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIR 757
            QLELME T+KEM+S GFP+DSATGNAF++YYS FGSLT ME AYGRLKRS+ LIE+EGIR
Sbjct: 175  QLELMEGTLKEMISRGFPLDSATGNAFIRYYSIFGSLTEMETAYGRLKRSRFLIEEEGIR 234

Query: 758  AMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEA 937
            AM+ AY+K+ KF++L E L+ VGLGR+N+GNL WN LLLSYAA+FKMKSLQREF RM EA
Sbjct: 235  AMSFAYLKKRKFYRLAELLKNVGLGRRNLGNLSWNLLLLSYAADFKMKSLQREFLRMVEA 294

Query: 938  GFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNL 1117
            GF PDL+TFNIRALAFSRM+L WDLH+SLEHMKHEKV PDLVT GCVVDA+L+R++G+N+
Sbjct: 295  GFHPDLTTFNIRALAFSRMSLLWDLHLSLEHMKHEKVFPDLVTCGCVVDAYLERRLGKNM 354

Query: 1118 NFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKY 1297
             FALNKMN +DSPL+ TD  VFEVLGKGDFH+SSEAFLE   QR WTYR+LI++YL+K+Y
Sbjct: 355  YFALNKMNLDDSPLILTDPFVFEVLGKGDFHASSEAFLEFQSQREWTYRRLISVYLKKQY 414

Query: 1298 RSNQLFWNY 1324
            R NQ+FWNY
Sbjct: 415  RRNQIFWNY 423


>ref|XP_004301396.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            [Fragaria vesca subsp. vesca]
          Length = 424

 Score =  545 bits (1405), Expect = e-152
 Identities = 262/409 (64%), Positives = 332/409 (81%)
 Frame = +2

Query: 98   LRPKFHALYSHQNKDSVGRSVAREIIWHQKRQASATGKDIYLNFASVLRLLGRKRMPHIA 277
            L P   ++ SHQ++ S  R++AR+I+   K++  + GKD Y++   +++ L R++MPH+A
Sbjct: 16   LNPNRLSVLSHQSQRSQNRALARKIVRTWKQEECSRGKDCYVDCVPLIQSLSRQKMPHVA 75

Query: 278  QQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSSYVPNIKVVSELIEA 457
            Q++ L M SEG +P   TLSA+MLC+A NGL   A+ IWDE++NSS+VP I+VVSEL + 
Sbjct: 76   QEVLLVMKSEGLIPSNSTLSAVMLCHAKNGLLPQAEAIWDEMLNSSFVPGIQVVSELFDV 135

Query: 458  YGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKEMVSNGFPVD 637
            YG +G F  V+ I+ ++  R+  L P+VYS AI CFGKGGQLELME T+KEMVS GFPVD
Sbjct: 136  YGNVGSFGKVNEIVGQIRSRNLSLLPQVYSLAISCFGKGGQLELMEDTLKEMVSRGFPVD 195

Query: 638  SATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGKFHKLGEFLR 817
            SATGN F++YYS FGSLT ME AY RLKRS+ LIE+EGIRAM+ AY+K+ KF+ L EFL+
Sbjct: 196  SATGNVFIRYYSIFGSLTEMETAYDRLKRSRFLIEEEGIRAMSLAYLKKRKFYSLAEFLK 255

Query: 818  GVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNIRALAFSRMA 997
             VGLGR+N+GNLLWN LLLSYAANFKMK+LQREF RM EAGF PDL+TFNIRALAFSRM+
Sbjct: 256  SVGLGRRNLGNLLWNLLLLSYAANFKMKTLQREFLRMVEAGFHPDLTTFNIRALAFSRMS 315

Query: 998  LFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDSPLVSTDQL 1177
            L WDLH++LEHMKH KVVPDLVT GC+VDA+LDR++GRNL FALNKMN +DSP+V TD  
Sbjct: 316  LLWDLHLTLEHMKHVKVVPDLVTCGCIVDAYLDRRLGRNLYFALNKMNLDDSPVVLTDPF 375

Query: 1178 VFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324
            VFEVLGKGDFH+SSEAFLE  +Q+ WTY+KLI++YL+K+YR +Q+FWNY
Sbjct: 376  VFEVLGKGDFHASSEAFLEFRKQKEWTYQKLISVYLKKQYRRDQIFWNY 424


>ref|XP_006453186.1| hypothetical protein CICLE_v10010414mg [Citrus clementina]
            gi|568840749|ref|XP_006474328.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X1 [Citrus sinensis]
            gi|568840751|ref|XP_006474329.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X2 [Citrus sinensis]
            gi|568840753|ref|XP_006474330.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X3 [Citrus sinensis]
            gi|568840755|ref|XP_006474331.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X4 [Citrus sinensis]
            gi|568840757|ref|XP_006474332.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X5 [Citrus sinensis]
            gi|568840759|ref|XP_006474333.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X6 [Citrus sinensis]
            gi|557556412|gb|ESR66426.1| hypothetical protein
            CICLE_v10010414mg [Citrus clementina]
          Length = 412

 Score =  525 bits (1351), Expect = e-146
 Identities = 264/417 (63%), Positives = 330/417 (79%)
 Frame = +2

Query: 74   FSLSLFKCLRPKFHALYSHQNKDSVGRSVAREIIWHQKRQASATGKDIYLNFASVLRLLG 253
            FSLSL    + K   + SHQ     G  +AR+II ++K++        +++ AS++  LG
Sbjct: 4    FSLSLHGSFKFKRFNVPSHQTHPKNG-DLARKIIRYRKQEG-------FVDCASLVEDLG 55

Query: 254  RKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSSYVPNIK 433
            RK+ PH+A QL   + SEG LP   TL ALMLCYA+NG    AQ +W+E+++SS+V +++
Sbjct: 56   RKKKPHLAHQLVNTVKSEGLLPDNSTLCALMLCYANNGFVLEAQVVWEELLSSSFVLSVQ 115

Query: 434  VVSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKEM 613
            V+S+L++AYG++G F+ +  I+ +V+ R+  L PEVYS AI CFGK GQLELME T+KEM
Sbjct: 116  VLSDLMDAYGRIGCFNEIISIIDQVSCRNADLLPEVYSRAISCFGKQGQLELMENTLKEM 175

Query: 614  VSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGKF 793
            VS GF VDSATGNAF+ YYS+FGSLT ME AYGRLKRS+ LI+KEGIRA++  Y+KE KF
Sbjct: 176  VSRGFSVDSATGNAFIIYYSRFGSLTEMETAYGRLKRSRHLIDKEGIRAVSFTYLKERKF 235

Query: 794  HKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNIR 973
              LGEFLR VGLGRK++GNLLWN LLLSYA NFKMKSLQREF RM+EAGF PDL+TFNIR
Sbjct: 236  FMLGEFLRDVGLGRKDLGNLLWNLLLLSYAGNFKMKSLQREFMRMSEAGFHPDLTTFNIR 295

Query: 974  ALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDS 1153
            A+AFSRM++FWDLH+SLEHMKHE V PDLVTYGCVVDA+LD+++GRNL+F L+KMN +DS
Sbjct: 296  AVAFSRMSMFWDLHLSLEHMKHESVGPDLVTYGCVVDAYLDKRLGRNLDFGLSKMNLDDS 355

Query: 1154 PLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324
            P+VSTD  VFE  GKGDFHSSSEAFLE  RQR WTYRKLIA+YL+K+ R NQ+FWNY
Sbjct: 356  PVVSTDPYVFEAFGKGDFHSSSEAFLEFKRQRKWTYRKLIAVYLKKQLRRNQIFWNY 412


>ref|XP_006372219.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550318750|gb|ERP50016.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 428

 Score =  523 bits (1347), Expect = e-145
 Identities = 261/430 (60%), Positives = 338/430 (78%), Gaps = 1/430 (0%)
 Frame = +2

Query: 38   MGVVSLLATSHCFSLSLFKCLRPKFHALYSHQNKDSVGRSVAREIIWHQKRQASATGKDI 217
            M   +++A + C++ ++    +PK  A++S + +D   R++A+++I   KR     GK+ 
Sbjct: 1    METKTVIAATTCYA-NVIGSYKPKRFAIFSIK-RDPKKRALAQKMIRQWKRDQGVFGKET 58

Query: 218  YLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWD 397
              + AS+++ L + R PH+A++L LE+  EGFLP   TLSA+MLCYAD+GL   AQ IW+
Sbjct: 59   CADCASLIQTLCKHRRPHLAEELLLELKCEGFLPDNRTLSAMMLCYADSGLLPQAQAIWE 118

Query: 398  EIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVT-LRDFKLCPEVYSSAICCFGKG 574
            E++ SS+VP+++V+S+LI+ Y K G FD V +IL +++ LR F   P+VYS AI CFGKG
Sbjct: 119  EMLYSSFVPSVQVISDLIDIYAKSGLFDEVIKILDQLSSLRTFDFLPQVYSLAISCFGKG 178

Query: 575  GQLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGI 754
            GQLELME T+K+MVS GF VDSATGNAF+ YYS  GSL  MEAAY RLKRS++LIE+EGI
Sbjct: 179  GQLELMEDTLKKMVSKGFWVDSATGNAFVVYYSLHGSLAEMEAAYDRLKRSRLLIEREGI 238

Query: 755  RAMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAE 934
            RAM+ AYIKE KF+ L EFLR VGLGRKN+GNL+WN LLLSY+ANFKMK+LQREF  M E
Sbjct: 239  RAMSFAYIKERKFYGLSEFLRDVGLGRKNLGNLIWNLLLLSYSANFKMKTLQREFLNMLE 298

Query: 935  AGFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRN 1114
            AGF PDL+TFNIRALAFSRM+L WDLH+ LEHMKH+KV PDLVTYGC+VDA+LDR++ RN
Sbjct: 299  AGFHPDLTTFNIRALAFSRMSLLWDLHLGLEHMKHDKVAPDLVTYGCIVDAYLDRRLVRN 358

Query: 1115 LNFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKK 1294
            L FAL+KM+ ++SP++STD  VFEV GKGDFHSSSEAF+E  RQR WTYR+LI +YLRK+
Sbjct: 359  LEFALSKMHVDNSPVLSTDPFVFEVFGKGDFHSSSEAFMEFKRQRKWTYRELIKIYLRKQ 418

Query: 1295 YRSNQLFWNY 1324
            +RS  +FWNY
Sbjct: 419  HRSKHIFWNY 428


>ref|XP_002511816.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223548996|gb|EEF50485.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 427

 Score =  520 bits (1340), Expect = e-145
 Identities = 252/402 (62%), Positives = 326/402 (81%)
 Frame = +2

Query: 119  LYSHQNKDSVGRSVAREIIWHQKRQASATGKDIYLNFASVLRLLGRKRMPHIAQQLWLEM 298
            L+S Q +D   R +AR+II+  K+  S + K++  + AS+++ L  KR PH+AQ++ LEM
Sbjct: 29   LFSSQ-RDPTNRPLARKIIYQWKQDQSFSCKEV--DCASLVQNLHSKRTPHLAQEILLEM 85

Query: 299  NSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSSYVPNIKVVSELIEAYGKMGRF 478
             S+G++    TLSA++LCYADNGL   AQ IW  ++N S+ P+I++VS LI+AY K G F
Sbjct: 86   KSQGYVLNNPTLSAILLCYADNGLLPQAQAIWKHMLNGSFTPSIQIVSRLIDAYSKKGHF 145

Query: 479  DAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKEMVSNGFPVDSATGNAF 658
            + V  IL +++  +F L  E YS AI CFGKGGQL+LME  +K+MV  GFPVD ATGNAF
Sbjct: 146  NEVMNILDQLSYSNFSLLHEAYSLAISCFGKGGQLQLMENALKDMVLRGFPVDYATGNAF 205

Query: 659  LQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGKFHKLGEFLRGVGLGRK 838
            ++YYS  GSLT ME+AY RLKRS+ L+++EGIRA++ AY+KE KF++LGEFLR VGLGRK
Sbjct: 206  IRYYSIHGSLTDMESAYSRLKRSRHLVDREGIRAVSLAYVKERKFYRLGEFLRDVGLGRK 265

Query: 839  NVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNIRALAFSRMALFWDLHV 1018
            +VGNL+WNFLLLS+AANFKMKSLQREF RM EAGF PD++TFNIRALAFSRM+L WDLH+
Sbjct: 266  DVGNLIWNFLLLSFAANFKMKSLQREFLRMLEAGFHPDVTTFNIRALAFSRMSLLWDLHL 325

Query: 1019 SLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDSPLVSTDQLVFEVLGK 1198
            +LEHMKHEKV PD+VTYGC+VDA+LDR++G+NL+FA+ KMN + SP++ TD  VFEVLGK
Sbjct: 326  TLEHMKHEKVSPDIVTYGCIVDAYLDRRLGKNLDFAIKKMNLDGSPVLLTDPFVFEVLGK 385

Query: 1199 GDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324
            GDFHSS+EAFLE  RQR WTYR+L+++YLRK+YRSNQ+FWNY
Sbjct: 386  GDFHSSAEAFLEFKRQRKWTYRELVSIYLRKQYRSNQIFWNY 427


>emb|CAN79718.1| hypothetical protein VITISV_012741 [Vitis vinifera]
          Length = 446

 Score =  511 bits (1315), Expect = e-142
 Identities = 256/386 (66%), Positives = 301/386 (77%)
 Frame = +2

Query: 167  EIIWHQKRQASATGKDIYLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALM 346
            E+ WH K++ S  GKD Y+++  +++ L RKR+PH+AQ+L  EM SE             
Sbjct: 100  ELFWHWKQERSVDGKDNYVDYTPLIQALSRKRLPHVAQELLFEMKSE------------- 146

Query: 347  LCYADNGLFSYAQTIWDEIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFK 526
                DNGLF  AQ +WDEIINSS+ PNI++VS+LI+AYGKMG F  V+RIL +       
Sbjct: 147  ----DNGLFPKAQALWDEIINSSFGPNIQIVSKLIDAYGKMGHFGEVTRILHQ------- 195

Query: 527  LCPEVYSSAICCFGKGGQLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAA 706
                           GGQLE+ME  +KEMVS GFPVDSATGNAF++YYS FGSLT MEAA
Sbjct: 196  ---------------GGQLEMMENALKEMVSRGFPVDSATGNAFIRYYSIFGSLTEMEAA 240

Query: 707  YGRLKRSQILIEKEGIRAMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAA 886
            Y RLK+S+ILIE+EGIRAM+ AYIKE K+++LG+FLR VGLGRKNVGNLLWN LLLSYAA
Sbjct: 241  YDRLKKSRILIEEEGIRAMSFAYIKEKKYYRLGQFLRDVGLGRKNVGNLLWNLLLLSYAA 300

Query: 887  NFKMKSLQREFSRMAEAGFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVT 1066
            NFKMKSLQREF  M EAGF+PDL+TFNIRALAFSRM+LFWDLH+SLEHM+H KVV DLVT
Sbjct: 301  NFKMKSLQREFLEMVEAGFAPDLTTFNIRALAFSRMSLFWDLHLSLEHMQHVKVVADLVT 360

Query: 1067 YGCVVDAFLDRKIGRNLNFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQ 1246
            YGCVVDA+LDR++G+NL+FAL KMN +DSPLVSTD  VFEVLGKGDFHSSSEAFLE  R 
Sbjct: 361  YGCVVDAYLDRRLGKNLDFALKKMNMDDSPLVSTDHFVFEVLGKGDFHSSSEAFLESKRN 420

Query: 1247 RNWTYRKLIALYLRKKYRSNQLFWNY 1324
              WTYRKLIA YL+KKYRSNQ+FWNY
Sbjct: 421  GKWTYRKLIATYLKKKYRSNQIFWNY 446


>ref|XP_007014560.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
            cacao] gi|508784923|gb|EOY32179.1| Pentatricopeptide
            repeat superfamily protein, putative [Theobroma cacao]
          Length = 429

 Score =  496 bits (1277), Expect = e-137
 Identities = 252/404 (62%), Positives = 313/404 (77%), Gaps = 4/404 (0%)
 Frame = +2

Query: 125  SHQNKDSVGRSVAREIIWHQKRQASATGKDIYLNFASVLRLLGRKRMP--HIAQQLWLEM 298
            S+ N     R + R  +W +       G+D +++F S+L+ L  K+MP  H+   L L+ 
Sbjct: 32   SNNNLPLARRQIIR--LWKRDGSILGVGRDNFVDFDSLLQTLASKKMPQPHVVHHLLLQ- 88

Query: 299  NSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINS-SYVPNIKVVSELIEAYGKMGR 475
               G +P   TLS +ML YADNGLF  AQ IW+E++N+ S+ P+I+VVS+ ++AYGKMG 
Sbjct: 89   ---GLIPNNSTLSEIMLWYADNGLFPQAQAIWEEMLNTTSFTPSIQVVSKFMDAYGKMGH 145

Query: 476  FDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKEMVSNGFPVDSATGNA 655
            F  V +IL  V L    L PEVY  AI CFGK G+L+LME T+KEMVS G PVDSATGNA
Sbjct: 146  FHKVHKILDRVILLRVNLLPEVYPVAISCFGKHGRLDLMENTLKEMVSRGLPVDSATGNA 205

Query: 656  FLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGKFHKLGEFLRGVGLGR 835
            F++YYS FGSL+ ME AY RLKRS+ LIE+EGIRAM+SAYIKEGKF++LGEFL  +GLGR
Sbjct: 206  FVRYYSIFGSLSEMEIAYARLKRSRHLIEEEGIRAMSSAYIKEGKFYRLGEFLNDLGLGR 265

Query: 836  KNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNIRALAFSRMALFWDLH 1015
            +N+GNLLWN LLLSYAANFKMK++QR F +M ++GF PDL+TFNIRA AFSRM++FWDLH
Sbjct: 266  RNLGNLLWNLLLLSYAANFKMKTMQRLFLKMMDSGFRPDLTTFNIRAWAFSRMSMFWDLH 325

Query: 1016 VSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDSPLVSTDQLVFEVLG 1195
            +SLEHMKHE VV DLVTYGCVVDA+LDR++ RNL+FALN MN +DSPLV TD LVFE LG
Sbjct: 326  LSLEHMKHESVVSDLVTYGCVVDAYLDRRLARNLDFALNHMNADDSPLVLTDPLVFEALG 385

Query: 1196 KGDFHSSSEAFLECNRQ-RNWTYRKLIALYLRKKYRSNQLFWNY 1324
            KGDFHSS+EAFLE  RQ + WTYR+LIA+YL+K+ R NQ+FWNY
Sbjct: 386  KGDFHSSAEAFLEFKRQKKKWTYRQLIAVYLKKQLRRNQIFWNY 429


>ref|XP_006360648.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            isoform X1 [Solanum tuberosum]
            gi|565389826|ref|XP_006360649.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g42630-like isoform X2 [Solanum tuberosum]
          Length = 416

 Score =  494 bits (1272), Expect = e-137
 Identities = 251/423 (59%), Positives = 320/423 (75%)
 Frame = +2

Query: 56   LATSHCFSLSLFKCLRPKFHALYSHQNKDSVGRSVAREIIWHQKRQASATGKDIYLNFAS 235
            +A     S+++   LRP   +L SHQN+ S     A++  W  K+  +   +  Y + AS
Sbjct: 1    MAAGLVVSIAVTPKLRP--FSLISHQNQSS-----AQKRRWRMKQGGNIDPRGNYADCAS 53

Query: 236  VLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSS 415
            +++ L RK++P  A++L LEM SEGF+P   TLSALMLCYA NGLF  A   WDEI+NSS
Sbjct: 54   LIQGLSRKKLPVAAERLVLEMKSEGFVPDSSTLSALMLCYASNGLFYKALAAWDEIMNSS 113

Query: 416  YVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELME 595
            ++P++ V++ELI+ Y   G  D   RIL ++ L+D  L  +VY+ AI  FGK GQLELME
Sbjct: 114  FLPDVHVIAELIDIYVCKGYLDVAVRILHQIQLKDSNLLRDVYAQAISRFGKKGQLELME 173

Query: 596  TTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAY 775
              +KEMVS GFPVDS TGNA++ YYS FG L+ ME AYGRLK S+ILIE+E IR+++ AY
Sbjct: 174  VMLKEMVSMGFPVDSTTGNAYVIYYSNFGMLSEMEVAYGRLKMSRILIEEEAIRSISLAY 233

Query: 776  IKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDL 955
            +K+ KF+ LG+F+R VGL R+NVGNLLWN LLLSYAANFKMKSLQREF RM E+GF PDL
Sbjct: 234  LKKEKFYSLGQFVRDVGLCRRNVGNLLWNLLLLSYAANFKMKSLQREFVRMVESGFFPDL 293

Query: 956  STFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNK 1135
            +TFNIRALAFS+M+LFWDLHV+LEHMKHEKVVPDLVTYG VVDA+LDR +GRNL+FAL K
Sbjct: 294  NTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVDAYLDRGLGRNLDFALRK 353

Query: 1136 MNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLF 1315
            +N ND  +V+T+ LVFE +GKGDFH SS+A LE ++ +NWTY +LI  YL+K +R NQ+F
Sbjct: 354  LNINDCVIVATEPLVFEAIGKGDFHLSSDARLEFSKNKNWTYEELITTYLKKYFRRNQIF 413

Query: 1316 WNY 1324
            WNY
Sbjct: 414  WNY 416


>ref|XP_006850970.1| hypothetical protein AMTR_s00025p00206120 [Amborella trichopoda]
            gi|548854641|gb|ERN12551.1| hypothetical protein
            AMTR_s00025p00206120 [Amborella trichopoda]
          Length = 354

 Score =  491 bits (1263), Expect = e-136
 Identities = 237/354 (66%), Positives = 282/354 (79%)
 Frame = +2

Query: 263  MPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSSYVPNIKVVS 442
            MPH+ Q+L+ E+ S+ F     TLSALM+C A+NGLFS +  IW EIINSS+  +I VVS
Sbjct: 1    MPHVVQRLFTEIESQNFRTGCTTLSALMICCAENGLFSLSNAIWTEIINSSFELDIGVVS 60

Query: 443  ELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKEMVSN 622
            EL+ AYGK   +D V R+L E   R+F LCPE+Y+ AI CFGKG QLELME T+KEMVS 
Sbjct: 61   ELMHAYGKANLYDEVYRMLNEAISREFNLCPEIYTVAISCFGKGAQLELMEATIKEMVSR 120

Query: 623  GFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGKFHKL 802
            GF VDS TGNAF+ YYS FGSL  ME AYGRLK S+ILIE+E IRAMASAYI+E KF K+
Sbjct: 121  GFKVDSNTGNAFIIYYSSFGSLAEMEIAYGRLKCSRILIEREAIRAMASAYIRERKFFKM 180

Query: 803  GEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNIRALA 982
            GEFLR VGLGR+N GNLLWN LLLSYAANFKMKSLQR F  M EAGFSPD++TFNIR LA
Sbjct: 181  GEFLRDVGLGRRNSGNLLWNLLLLSYAANFKMKSLQRTFLGMLEAGFSPDITTFNIRTLA 240

Query: 983  FSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDSPLV 1162
            FSRM +FWDLH+S+EHM+H  V+PDLVTYGC+VDA+++R+ GRNL F L  MN + SPL+
Sbjct: 241  FSRMCMFWDLHLSIEHMRHMNVIPDLVTYGCIVDAYVERRFGRNLGFGLKCMNLDSSPLI 300

Query: 1163 STDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324
             TD +V+EV GKGDFHSSSEA LE   ++ WTY KL+A YL+K+YRSNQ+FWNY
Sbjct: 301  LTDPIVYEVFGKGDFHSSSEALLELKWKKEWTYSKLVAFYLKKRYRSNQIFWNY 354


>ref|XP_004152890.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            [Cucumis sativus] gi|449507537|ref|XP_004163059.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At3g42630-like [Cucumis sativus]
          Length = 388

 Score =  488 bits (1257), Expect = e-135
 Identities = 236/384 (61%), Positives = 303/384 (78%)
 Frame = +2

Query: 173  IWHQKRQASATGKDIYLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLC 352
            I+ Q  + S+      +N + V++ L R+RMP +A++++LE+ SEGF     TLS +M+ 
Sbjct: 5    IFQQHNEGSSVDDSFNINNSQVIKKLSRRRMPILAKEIFLELKSEGFPLNNSTLSTIMVH 64

Query: 353  YADNGLFSYAQTIWDEIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFKLC 532
            Y D+G    AQ +W+E++NS + P+++V+S+L  AYGKMG FD ++++L +V LR   L 
Sbjct: 65   YIDDGSPLQAQAMWEEMLNSCFEPSVQVISKLFNAYGKMGHFDYITKVLDQVKLRYSHLL 124

Query: 533  PEVYSSAICCFGKGGQLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYG 712
            PE YS AI CFGK  QLELME+T++EMVS+GF V+SATGN+F+ YYS FGSL  ME AYG
Sbjct: 125  PEAYSLAISCFGKHKQLELMESTLREMVSSGFTVNSATGNSFIIYYSMFGSLVEMETAYG 184

Query: 713  RLKRSQILIEKEGIRAMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANF 892
            RLKRS+ LIEK+GI AMA AYI++ KF++LGEFLR VGLGRKNVGNLLWN LLLSYAANF
Sbjct: 185  RLKRSRFLIEKKGIMAMAFAYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNLLLLSYAANF 244

Query: 893  KMKSLQREFSRMAEAGFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYG 1072
            KMKSLQREF +M +AGF+PDL+TFNIRALAFSRM L WDLH+SLEHMKH  + PDLVTYG
Sbjct: 245  KMKSLQREFLQMVDAGFNPDLTTFNIRALAFSRMDLLWDLHLSLEHMKHMNIEPDLVTYG 304

Query: 1073 CVVDAFLDRKIGRNLNFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRN 1252
            CVVDA++DR++GRNL F L+KMN +  P+  TD  VFE LGKGDFH SSEAF++  +Q+ 
Sbjct: 305  CVVDAYVDRRLGRNLEFILSKMNPDQPPVSLTDSFVFEALGKGDFHMSSEAFMQFRKQKK 364

Query: 1253 WTYRKLIALYLRKKYRSNQLFWNY 1324
            WTYR+LI+LYL+K +R NQ+FWNY
Sbjct: 365  WTYRELISLYLKKHHRRNQVFWNY 388


>ref|XP_006360650.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            isoform X3 [Solanum tuberosum]
          Length = 409

 Score =  486 bits (1251), Expect = e-134
 Identities = 242/390 (62%), Positives = 305/390 (78%), Gaps = 5/390 (1%)
 Frame = +2

Query: 170  IIWHQKRQASATGKDI-----YLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTL 334
            +I HQ + ++  G +I     Y + AS+++ L RK++P  A++L LEM SEGF+P   TL
Sbjct: 20   LISHQNQSSAQKGGNIDPRGNYADCASLIQGLSRKKLPVAAERLVLEMKSEGFVPDSSTL 79

Query: 335  SALMLCYADNGLFSYAQTIWDEIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTL 514
            SALMLCYA NGLF  A   WDEI+NSS++P++ V++ELI+ Y   G  D   RIL ++ L
Sbjct: 80   SALMLCYASNGLFYKALAAWDEIMNSSFLPDVHVIAELIDIYVCKGYLDVAVRILHQIQL 139

Query: 515  RDFKLCPEVYSSAICCFGKGGQLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTA 694
            +D  L  +VY+ AI  FGK GQLELME  +KEMVS GFPVDS TGNA++ YYS FG L+ 
Sbjct: 140  KDSNLLRDVYAQAISRFGKKGQLELMEVMLKEMVSMGFPVDSTTGNAYVIYYSNFGMLSE 199

Query: 695  MEAAYGRLKRSQILIEKEGIRAMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLL 874
            ME AYGRLK S+ILIE+E IR+++ AY+K+ KF+ LG+F+R VGL R+NVGNLLWN LLL
Sbjct: 200  MEVAYGRLKMSRILIEEEAIRSISLAYLKKEKFYSLGQFVRDVGLCRRNVGNLLWNLLLL 259

Query: 875  SYAANFKMKSLQREFSRMAEAGFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVP 1054
            SYAANFKMKSLQREF RM E+GF PDL+TFNIRALAFS+M+LFWDLHV+LEHMKHEKVVP
Sbjct: 260  SYAANFKMKSLQREFVRMVESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVP 319

Query: 1055 DLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLE 1234
            DLVTYG VVDA+LDR +GRNL+FAL K+N ND  +V+T+ LVFE +GKGDFH SS+A LE
Sbjct: 320  DLVTYGSVVDAYLDRGLGRNLDFALRKLNINDCVIVATEPLVFEAIGKGDFHLSSDARLE 379

Query: 1235 CNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324
             ++ +NWTY +LI  YL+K +R NQ+FWNY
Sbjct: 380  FSKNKNWTYEELITTYLKKYFRRNQIFWNY 409


>ref|XP_004240282.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            [Solanum lycopersicum]
          Length = 381

 Score =  485 bits (1248), Expect = e-134
 Identities = 239/369 (64%), Positives = 296/369 (80%)
 Frame = +2

Query: 218  YLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWD 397
            Y + AS+++ L RK++P  A++L LEM SEGF+P   TLSALMLCYA NGLF  A   WD
Sbjct: 13   YRDCASLIQGLSRKKLPVAAERLVLEMKSEGFVPDSSTLSALMLCYATNGLFCKALAAWD 72

Query: 398  EIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGG 577
            EI+NSS++P++ V++ELI+ YG  G  D   RIL ++ L+D  L  +VY+ AI  FGK G
Sbjct: 73   EIMNSSFLPDVHVIAELIDIYGCKGYLDVAVRILHQIQLKDSNLLRDVYAQAISRFGKKG 132

Query: 578  QLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIR 757
            QLELME  ++EMVS GFPVDS TGNA++ YYS FG+L+ ME AYGRLK S+ILIE+E IR
Sbjct: 133  QLELMEVMLEEMVSMGFPVDSTTGNAYVIYYSNFGTLSEMEVAYGRLKMSRILIEEEAIR 192

Query: 758  AMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEA 937
            +++ AY+K+ KF+ LG+F+R VGL R+NVGNLLWN LLLSYAANFKMKSLQREF RM E+
Sbjct: 193  SISLAYLKKEKFYSLGQFVRDVGLCRRNVGNLLWNLLLLSYAANFKMKSLQREFVRMVES 252

Query: 938  GFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNL 1117
            GF PDL+TFNIRALAFS+M+LFWDLHV+LEHMKHEKVVPDLVTYG VVDA+LDR +GRNL
Sbjct: 253  GFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVDAYLDRGLGRNL 312

Query: 1118 NFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKY 1297
            +FAL K+NTND   V+T+ LVFE +GKGDFH SSEA LE +++ NWTY  LI  YL+K +
Sbjct: 313  DFALRKLNTNDCVTVATEPLVFEAMGKGDFHLSSEARLEFSKKTNWTYEVLITTYLKKYF 372

Query: 1298 RSNQLFWNY 1324
            R NQ+FWNY
Sbjct: 373  RRNQIFWNY 381


>ref|XP_006372218.1| hypothetical protein POPTR_0018s14360g [Populus trichocarpa]
            gi|550318749|gb|ERP50015.1| hypothetical protein
            POPTR_0018s14360g [Populus trichocarpa]
          Length = 392

 Score =  478 bits (1231), Expect = e-132
 Identities = 245/429 (57%), Positives = 314/429 (73%)
 Frame = +2

Query: 38   MGVVSLLATSHCFSLSLFKCLRPKFHALYSHQNKDSVGRSVAREIIWHQKRQASATGKDI 217
            M   +++A + C++ ++    +PK  A++S + +D   R++A+++I   KR     GK+ 
Sbjct: 1    METKTVIAATTCYA-NVIGSYKPKRFAIFSIK-RDPKKRALAQKMIRQWKRDQGVFGKET 58

Query: 218  YLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWD 397
              + AS+++ L + R PH+A++L LE+  EGFLP   TLSA+MLCYAD+GL   AQ IW+
Sbjct: 59   CADCASLIQTLCKHRRPHLAEELLLELKCEGFLPDNRTLSAMMLCYADSGLLPQAQAIWE 118

Query: 398  EIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGG 577
            E++ SS+VP++                                   +VYS AI CFGKGG
Sbjct: 119  EMLYSSFVPSV-----------------------------------QVYSLAISCFGKGG 143

Query: 578  QLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIR 757
            QLELME T+K+MVS GF VDSATGNAF+ YYS  GSL  MEAAY RLKRS++LIE+EGIR
Sbjct: 144  QLELMEDTLKKMVSKGFWVDSATGNAFVVYYSLHGSLAEMEAAYDRLKRSRLLIEREGIR 203

Query: 758  AMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEA 937
            AM+ AYIKE KF+ L EFLR VGLGRKN+GNL+WN LLLSY+ANFKMK+LQREF  M EA
Sbjct: 204  AMSFAYIKERKFYGLSEFLRDVGLGRKNLGNLIWNLLLLSYSANFKMKTLQREFLNMLEA 263

Query: 938  GFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNL 1117
            GF PDL+TFNIRALAFSRM+L WDLH+ LEHMKH+KV PDLVTYGC+VDA+LDR++ RNL
Sbjct: 264  GFHPDLTTFNIRALAFSRMSLLWDLHLGLEHMKHDKVAPDLVTYGCIVDAYLDRRLVRNL 323

Query: 1118 NFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKY 1297
             FAL+KM+ ++SP++STD  VFEV GKGDFHSSSEAF+E  RQR WTYR+LI +YLRK++
Sbjct: 324  EFALSKMHVDNSPVLSTDPFVFEVFGKGDFHSSSEAFMEFKRQRKWTYRELIKIYLRKQH 383

Query: 1298 RSNQLFWNY 1324
            RS  +FWNY
Sbjct: 384  RSKHIFWNY 392


>ref|XP_006573403.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like
            [Glycine max]
          Length = 415

 Score =  470 bits (1210), Expect = e-130
 Identities = 240/418 (57%), Positives = 300/418 (71%), Gaps = 2/418 (0%)
 Frame = +2

Query: 77   SLSLFKCLRPKFHALYSHQNKDSVGRSVARE--IIWHQKRQASATGKDIYLNFASVLRLL 250
            SLSL  C  P          KDS   S  ++  +IW Q  +    GKD  ++ +S+ +  
Sbjct: 5    SLSLPSCRMPILL-------KDSHSGSPQQQNKLIWWQNEKGVIGGKDNSVDCSSLAQNS 57

Query: 251  GRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSSYVPNI 430
             RKRM H +     ++  EG++P   +L   ML Y +NG F  AQT+W++++NSS+VP++
Sbjct: 58   SRKRMIHQSDGSLHDIKVEGYMPKQTSLCVSMLYYTENGFFPQAQTLWEQLVNSSFVPSV 117

Query: 431  KVVSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKE 610
            + +S L +AY K  +FD V  ILR V +R+F + P+VY  AI CFG+ GQLELME    E
Sbjct: 118  QFISRLFDAYAKHRKFDVVIDILRYVDMRNFSILPDVYWLAISCFGREGQLELMEDMANE 177

Query: 611  MVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGK 790
            M S+G  + S T NAFL YYS FG+L  ME  YGRLK+S+ LIEKE IRA+ASAYIKE K
Sbjct: 178  MASSGVHIYSRTANAFLLYYSLFGTLEEMENTYGRLKKSRFLIEKEVIRAVASAYIKERK 237

Query: 791  FHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNI 970
            F++LGEFLR VGL RKNVGNLLWN +LLSYAANFKMKSLQREF  M E+GF PD++TFNI
Sbjct: 238  FYELGEFLRDVGLRRKNVGNLLWNLMLLSYAANFKMKSLQREFIGMVESGFRPDITTFNI 297

Query: 971  RALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTND 1150
            RALAFSRMALFWDLH+S+EHM+H K++PDLVT+GCVVDA+LDR++GRNL+FALNKMN +D
Sbjct: 298  RALAFSRMALFWDLHLSIEHMEHTKIIPDLVTFGCVVDAYLDRRLGRNLDFALNKMNLDD 357

Query: 1151 SPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324
            SP + TD  V+E LGKG F  SSEAF E   QR WTYR LI  YL+K YR NQ+FWNY
Sbjct: 358  SPRLLTDPFVYEALGKGGFQMSSEAFFEYKTQRKWTYRSLIQKYLKKHYRKNQIFWNY 415


>ref|XP_007134710.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris]
            gi|593265068|ref|XP_007134712.1| hypothetical protein
            PHAVU_010G069800g [Phaseolus vulgaris]
            gi|561007755|gb|ESW06704.1| hypothetical protein
            PHAVU_010G069800g [Phaseolus vulgaris]
            gi|561007757|gb|ESW06706.1| hypothetical protein
            PHAVU_010G069800g [Phaseolus vulgaris]
          Length = 423

 Score =  469 bits (1207), Expect = e-129
 Identities = 227/381 (59%), Positives = 290/381 (76%)
 Frame = +2

Query: 182  QKRQASATGKDIYLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYAD 361
            +  + ++ G    ++ +S+L+   RKRM   +  ++ +   +G++P   +L  LML Y +
Sbjct: 43   RNEKGASGGMHSSVDSSSLLQKNSRKRMFPQSDGVFPDTKDDGYMPKQTSLCVLMLYYTE 102

Query: 362  NGLFSYAQTIWDEIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEV 541
            NGLF  AQT W++++ SS+VP+++ +S L +AY K G+FD V  ILR V +R+F + P V
Sbjct: 103  NGLFPLAQTTWEQLLYSSFVPSVEFISRLFDAYAKHGKFDEVVNILRYVDMRNFSILPNV 162

Query: 542  YSSAICCFGKGGQLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLK 721
            YS AICCFG+ GQLELME   KEM S G  V S TGNAF+ YYS FGSL  ME AYGRLK
Sbjct: 163  YSLAICCFGREGQLELMEDMAKEMASRGVHVSSKTGNAFVLYYSIFGSLKDMENAYGRLK 222

Query: 722  RSQILIEKEGIRAMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMK 901
            +S+ LIE+E IRAMASAY +E +F++LGEF+R VGLGRK++GNLLWN +LLSYA NFKMK
Sbjct: 223  KSRFLIEREVIRAMASAYTRERQFYELGEFIRDVGLGRKDLGNLLWNLMLLSYAVNFKMK 282

Query: 902  SLQREFSRMAEAGFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVV 1081
            SLQ+EF +M E+GF PD++TFNIRALAFSRMALFWDLH+S+EHM+HE V+PDLVT+GCVV
Sbjct: 283  SLQKEFLQMVESGFRPDITTFNIRALAFSRMALFWDLHLSIEHMEHENVIPDLVTFGCVV 342

Query: 1082 DAFLDRKIGRNLNFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTY 1261
            DA+LDR +GRNLNFALNKMN +DSP++ TD  V+E LGKGDF  SSEAF E    R WTY
Sbjct: 343  DAYLDRGLGRNLNFALNKMNLDDSPMLLTDPFVYEALGKGDFQMSSEAFFEFKTHRKWTY 402

Query: 1262 RKLIALYLRKKYRSNQLFWNY 1324
            R LI  YL+K YR NQ+FWNY
Sbjct: 403  RALIQKYLKKHYRRNQIFWNY 423


>ref|XP_007134709.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris]
            gi|593265066|ref|XP_007134711.1| hypothetical protein
            PHAVU_010G069800g [Phaseolus vulgaris]
            gi|561007754|gb|ESW06703.1| hypothetical protein
            PHAVU_010G069800g [Phaseolus vulgaris]
            gi|561007756|gb|ESW06705.1| hypothetical protein
            PHAVU_010G069800g [Phaseolus vulgaris]
          Length = 372

 Score =  468 bits (1205), Expect = e-129
 Identities = 226/365 (61%), Positives = 283/365 (77%)
 Frame = +2

Query: 230  ASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIIN 409
            +S+L+   RKRM   +  ++ +   +G++P   +L  LML Y +NGLF  AQT W++++ 
Sbjct: 8    SSLLQKNSRKRMFPQSDGVFPDTKDDGYMPKQTSLCVLMLYYTENGLFPLAQTTWEQLLY 67

Query: 410  SSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLEL 589
            SS+VP+++ +S L +AY K G+FD V  ILR V +R+F + P VYS AICCFG+ GQLEL
Sbjct: 68   SSFVPSVEFISRLFDAYAKHGKFDEVVNILRYVDMRNFSILPNVYSLAICCFGREGQLEL 127

Query: 590  METTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMAS 769
            ME   KEM S G  V S TGNAF+ YYS FGSL  ME AYGRLK+S+ LIE+E IRAMAS
Sbjct: 128  MEDMAKEMASRGVHVSSKTGNAFVLYYSIFGSLKDMENAYGRLKKSRFLIEREVIRAMAS 187

Query: 770  AYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSP 949
            AY +E +F++LGEF+R VGLGRK++GNLLWN +LLSYA NFKMKSLQ+EF +M E+GF P
Sbjct: 188  AYTRERQFYELGEFIRDVGLGRKDLGNLLWNLMLLSYAVNFKMKSLQKEFLQMVESGFRP 247

Query: 950  DLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFAL 1129
            D++TFNIRALAFSRMALFWDLH+S+EHM+HE V+PDLVT+GCVVDA+LDR +GRNLNFAL
Sbjct: 248  DITTFNIRALAFSRMALFWDLHLSIEHMEHENVIPDLVTFGCVVDAYLDRGLGRNLNFAL 307

Query: 1130 NKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQ 1309
            NKMN +DSP++ TD  V+E LGKGDF  SSEAF E    R WTYR LI  YL+K YR NQ
Sbjct: 308  NKMNLDDSPMLLTDPFVYEALGKGDFQMSSEAFFEFKTHRKWTYRALIQKYLKKHYRRNQ 367

Query: 1310 LFWNY 1324
            +FWNY
Sbjct: 368  IFWNY 372


>gb|ABA18111.1| pentatricopeptide repeat protein [Arabidopsis arenosa]
          Length = 419

 Score =  467 bits (1201), Expect = e-129
 Identities = 233/416 (56%), Positives = 312/416 (75%)
 Frame = +2

Query: 77   SLSLFKCLRPKFHALYSHQNKDSVGRSVAREIIWHQKRQASATGKDIYLNFASVLRLLGR 256
            +LS    L+P+   L S    DS   S+AR++I   K     + K   +++A +++ L +
Sbjct: 5    NLSHHLSLKPQHLKLLSCYT-DSSAPSIARKLIKESKLSREFSRKIQIVDYAPLVQTLSQ 63

Query: 257  KRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSSYVPNIKV 436
            +R+P +A +++++  S   LP Y TL ALMLC+A+NG    A+TIWDEI+NSS+VP++ V
Sbjct: 64   RRLPDVAHEIFIQTKSVNLLPNYRTLCALMLCFAENGFVLRARTIWDEILNSSFVPDVFV 123

Query: 437  VSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKEMV 616
            VS+LI AY ++G FD V++I ++V  R   L P VYS AI CFGK GQLELME  ++EM 
Sbjct: 124  VSKLISAYEQLGFFDEVAKITKDVAARHSTLLPVVYSLAISCFGKNGQLELMEGVIEEMD 183

Query: 617  SNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGKFH 796
            S G  +DSAT NA ++Y+S FG+L  +E AYGRLK+  I+IE+E IRA+  AY+K+ KF+
Sbjct: 184  SKGMSLDSATANAIVRYFSFFGTLDKIEHAYGRLKKFGIVIEEEEIRAVLLAYLKQRKFY 243

Query: 797  KLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNIRA 976
            +L EFL  VGLGR+N+GN+LWN +LLSYAA FKMKSLQREF  M +AGFSPDL+TFNIRA
Sbjct: 244  RLREFLSDVGLGRRNLGNMLWNSVLLSYAAEFKMKSLQREFIEMLDAGFSPDLTTFNIRA 303

Query: 977  LAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDSP 1156
            LAFSRMALFWDLH++LEHM+   +VPDLVT+GCVVDA++D+++ RNL F  N+MN +DSP
Sbjct: 304  LAFSRMALFWDLHLTLEHMRRLNIVPDLVTFGCVVDAYMDKRLARNLEFVYNQMNLDDSP 363

Query: 1157 LVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324
            +V TD L FEVLGKGDFH SSEA LE + ++NWTYRKLI +Y++KK R +Q+FWNY
Sbjct: 364  VVLTDPLAFEVLGKGDFHLSSEAVLEFSTEKNWTYRKLIGVYVKKKLRRDQIFWNY 419


>ref|XP_007132326.1| hypothetical protein PHAVU_011G085400g [Phaseolus vulgaris]
            gi|593195390|ref|XP_007132327.1| hypothetical protein
            PHAVU_011G085400g [Phaseolus vulgaris]
            gi|561005326|gb|ESW04320.1| hypothetical protein
            PHAVU_011G085400g [Phaseolus vulgaris]
            gi|561005327|gb|ESW04321.1| hypothetical protein
            PHAVU_011G085400g [Phaseolus vulgaris]
          Length = 411

 Score =  465 bits (1196), Expect = e-128
 Identities = 226/386 (58%), Positives = 291/386 (75%)
 Frame = +2

Query: 167  EIIWHQKRQASATGKDIYLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALM 346
            ++IW +  + +  G    ++ +S+++   RKRM   +  ++ +   +G++P   +L  LM
Sbjct: 26   QMIWWRNEKGAFGGMHSSVDSSSLVQNNSRKRMFPQSDGVFHDTKDDGYMPKQTSLCVLM 85

Query: 347  LCYADNGLFSYAQTIWDEIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFK 526
            L Y +NGLF  AQT W++++ SS+VP+++ +S L +AY K G+FD V  ILR V +R+F 
Sbjct: 86   LYYTENGLFPQAQTTWEQLLYSSFVPSVEFISRLFDAYAKHGKFDEVVNILRYVDMRNFS 145

Query: 527  LCPEVYSSAICCFGKGGQLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAA 706
            + P VYS AI CFG+ GQLELME   KEM S G  + S T NAF+ YYS FGSL  ME A
Sbjct: 146  ILPNVYSLAISCFGREGQLELMEDMAKEMASRGVHISSKTANAFVLYYSIFGSLKDMENA 205

Query: 707  YGRLKRSQILIEKEGIRAMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAA 886
            YGRLK+S+ LIE+E IRAMASAY +E +F++LGEFLR VGL RK+VGNLLWN +LLSYAA
Sbjct: 206  YGRLKKSRFLIEREVIRAMASAYTRERQFYELGEFLRDVGLVRKDVGNLLWNLMLLSYAA 265

Query: 887  NFKMKSLQREFSRMAEAGFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVT 1066
            NFKMKSLQ+EF +M E+GF PD++TFNIRALAFSRMALFWDLH+S+EHM+HE V+PDLVT
Sbjct: 266  NFKMKSLQKEFLQMVESGFRPDITTFNIRALAFSRMALFWDLHLSIEHMEHENVIPDLVT 325

Query: 1067 YGCVVDAFLDRKIGRNLNFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQ 1246
            +GCVVDA+LDR +G+NLNFALNKMN +DSP++ TD  V+E LGKGDF  SSEAF E    
Sbjct: 326  FGCVVDAYLDRGLGKNLNFALNKMNLDDSPMLLTDPFVYEALGKGDFQMSSEAFFEFKTH 385

Query: 1247 RNWTYRKLIALYLRKKYRSNQLFWNY 1324
            R WTYR LI  YL+K YR NQ+FWNY
Sbjct: 386  RKWTYRALIQKYLKKHYRRNQIFWNY 411


>ref|NP_566863.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|218546757|sp|Q9M2A1.2|PP263_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At3g42630 gi|332644221|gb|AEE77742.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 415

 Score =  463 bits (1191), Expect = e-127
 Identities = 234/415 (56%), Positives = 309/415 (74%)
 Frame = +2

Query: 80   LSLFKCLRPKFHALYSHQNKDSVGRSVAREIIWHQKRQASATGKDIYLNFASVLRLLGRK 259
            LSL   L+P+   L S    DS   S+A+++I   K     + K   +++A +++ L ++
Sbjct: 2    LSLNLSLKPQHLKLLSCYT-DSSAPSIAKKLIKESKLSRDFSQKIQIVDYAPLVQTLSQR 60

Query: 260  RMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSSYVPNIKVV 439
            R+P +A +++L+  S   LP Y TL ALMLC+A+NG    A+TIWDEIINS +VP++ VV
Sbjct: 61   RLPDVAHEIFLQTKSVNLLPNYRTLCALMLCFAENGFVLRARTIWDEIINSCFVPDVFVV 120

Query: 440  SELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKEMVS 619
            S+LI AY + G FD V++I ++V  R  KL P V S AI CFGK GQLELME  ++EM S
Sbjct: 121  SKLISAYEQFGCFDEVAKITKDVAARHSKLLPVVSSLAISCFGKNGQLELMEGVIEEMDS 180

Query: 620  NGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGKFHK 799
             G  +++ T N  ++YYS FGSL  ME AYGR+K+  I+IE+E IRA+  AY+K+ KF++
Sbjct: 181  KGVLLEAETANVIVRYYSFFGSLDKMEKAYGRVKKFGIVIEEEEIRAVVLAYLKQRKFYR 240

Query: 800  LGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNIRAL 979
            L EFL  VGLGR+N+GN+LWN +LLSYAA+FKMKSLQREF  M +AGFSPDL+TFNIRAL
Sbjct: 241  LREFLSDVGLGRRNLGNMLWNSVLLSYAADFKMKSLQREFIGMLDAGFSPDLTTFNIRAL 300

Query: 980  AFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDSPL 1159
            AFSRMALFWDLH++LEHM+   +VPDLVT+GCVVDA++D+++ RNL F  N+MN +DSPL
Sbjct: 301  AFSRMALFWDLHLTLEHMRRLNIVPDLVTFGCVVDAYMDKRLARNLEFVYNRMNLDDSPL 360

Query: 1160 VSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324
            V TD L FEVLGKGDFH SSEA LE + ++NWTYRKLI +YL+KK R +Q+FWNY
Sbjct: 361  VLTDPLAFEVLGKGDFHLSSEAVLEFSPRKNWTYRKLIGVYLKKKLRRDQIFWNY 415


Top