BLASTX nr result

ID: Sinomenium22_contig00027226 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00027226
         (2537 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278014.2| PREDICTED: pentatricopeptide repeat-containi...   593   e-167
ref|XP_004157939.1| PREDICTED: pentatricopeptide repeat-containi...   584   e-164
ref|XP_004141071.1| PREDICTED: pentatricopeptide repeat-containi...   582   e-163
ref|XP_007012633.1| Pentatricopeptide repeat superfamily protein...   562   e-157
gb|EXB38956.1| hypothetical protein L484_027391 [Morus notabilis]     560   e-156
ref|XP_004301429.1| PREDICTED: pentatricopeptide repeat-containi...   556   e-155
ref|XP_002512275.1| pentatricopeptide repeat-containing protein,...   556   e-155
ref|XP_006452806.1| hypothetical protein CICLE_v10007804mg [Citr...   554   e-155
ref|XP_006474728.1| PREDICTED: pentatricopeptide repeat-containi...   553   e-154
gb|EYU18527.1| hypothetical protein MIMGU_mgv1a003955mg [Mimulus...   531   e-148
ref|XP_006381622.1| pentatricopeptide repeat-containing family p...   526   e-146
ref|XP_006351831.1| PREDICTED: pentatricopeptide repeat-containi...   517   e-144
ref|XP_004230611.1| PREDICTED: pentatricopeptide repeat-containi...   517   e-143
ref|XP_006577946.1| PREDICTED: pentatricopeptide repeat-containi...   503   e-139
ref|XP_002885810.1| pentatricopeptide repeat-containing protein ...   500   e-138
dbj|BAH19478.1| AT2G06000 [Arabidopsis thaliana]                      499   e-138
ref|NP_178657.1| pentatricopeptide repeat-containing protein [Ar...   499   e-138
ref|XP_006396122.1| hypothetical protein EUTSA_v10002477mg [Eutr...   499   e-138
ref|XP_006297396.1| hypothetical protein CARUB_v10013421mg [Caps...   497   e-137
ref|XP_003550612.1| PREDICTED: pentatricopeptide repeat-containi...   492   e-136

>ref|XP_002278014.2| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            [Vitis vinifera]
          Length = 641

 Score =  593 bits (1530), Expect = e-167
 Identities = 301/545 (55%), Positives = 391/545 (71%), Gaps = 4/545 (0%)
 Frame = -3

Query: 2235 VASGTLFVASFHDHAVQVCSQPDFRISNREVIRNTDAWIVKVVSTLCTLGGSKSSTDLFS 2056
            V +  + +A FH+HAV +        +  EVI+N + WIVKV+ TLC    S     L +
Sbjct: 45   VRASKIAIAQFHEHAVGISR------NRPEVIQNPENWIVKVICTLCVRTHS-----LDA 93

Query: 2055 CLEYFSKSFNPSIVFGVVTRLGNPKLALRFFEFCQANCQEINHFVKTYNFLVKSLCQNGL 1876
            CL+YFSK+  PSI F VV  L NP+LAL+FF+  + N   + H  +TY+FL++SL + G 
Sbjct: 94   CLDYFSKTLTPSIAFEVVRGLNNPELALKFFQLSRVNLN-LCHSFRTYSFLLRSLSEMGF 152

Query: 1875 HEAASKLV----FDGYFPDGSVFDFLVSSCAQAGKLDTAKRLLVQACLSKIKVNSFMYNN 1708
            HE+A  +      DG+ PD SV  FLVSS   AGK + A+       +  ++ +  +YN 
Sbjct: 153  HESAKAVYDCMNIDGHSPDASVLGFLVSSATDAGKFNIAR-----TWVDGVEFSLVVYNK 207

Query: 1707 LLNILVGNNRVHEAVCFFQDQNLRLGFCPDTCSFNIVVRGLCRLGKIDMAFELFNGMGSF 1528
            LLN LV  N+V EAVCFF++Q + L    D+CSFNI++RGLCR+GK+D AFELFN M  F
Sbjct: 208  LLNQLVRGNQVDEAVCFFREQ-MGLHGPFDSCSFNILIRGLCRIGKVDKAFELFNEMRGF 266

Query: 1527 NCSPDVVTYNTLIDGFCRAKYVNRGYELLTQIQSKTGFSPDVVTYTSVISGYCKLGEMED 1348
             CSPDV+TYNTLI+GFCR   V+RG++LL ++ SK   SPDVVTYTS+ISGYCKLG+ME 
Sbjct: 267  GCSPDVITYNTLINGFCRVNEVDRGHDLLKELLSKNDLSPDVVTYTSIISGYCKLGKMEK 326

Query: 1347 ASALLDEMINKGIMPNSFTFNVLINGFGKVGNMRSAMAMYENMLSNGCVPDVVTFTSLID 1168
            AS L + MI+ GI PN+FTFN+LINGFGKVG+M SA  MYE ML  GC PD++TFTSLID
Sbjct: 327  ASILFNNMISSGIKPNAFTFNILINGFGKVGDMVSAENMYEEMLLLGCPPDIITFTSLID 386

Query: 1167 GYCQTGAIEEGMKLWNEMGTKNISPNAYTFSVLINALCKANRLNEARDLLRQLMWRNVVP 988
            G+C+TG +E  +KLW+E+  +N+SPN YTF++L NALCK NRL+EAR  LR L WR++V 
Sbjct: 387  GHCRTGKVERSLKLWHELNARNLSPNEYTFAILTNALCKENRLHEARGFLRDLKWRHIVA 446

Query: 987  RPFIYNPVIDGFCKAGNVDEANVILVEMEEKKCNPDKFTFTSLIIGHCMKGRMAEAISLF 808
            +PF+YNPVIDGFCKAGNVDEANVIL EMEEK+C PDK T+T LIIGHCMKGR++EAIS+F
Sbjct: 447  QPFMYNPVIDGFCKAGNVDEANVILAEMEEKRCKPDKITYTILIIGHCMKGRLSEAISIF 506

Query: 807  HKMSSIGCAPDTITISAFISCLLKAGMPCEANQIMVTFSKKDLDPGFSSSRKICTSTKSM 628
            ++M   GCAPD+IT+++ ISCLLKAGMP EA +IM   + +D + G  S ++      + 
Sbjct: 507  NRMLGTGCAPDSITMTSLISCLLKAGMPNEAYRIM-QIASEDFNLGLKSLKRNVPLRTNT 565

Query: 627  DIPAA 613
            DIP A
Sbjct: 566  DIPVA 570


>ref|XP_004157939.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            [Cucumis sativus]
          Length = 548

 Score =  584 bits (1505), Expect = e-164
 Identities = 306/539 (56%), Positives = 385/539 (71%), Gaps = 5/539 (0%)
 Frame = -3

Query: 2214 VASFHDHAVQVCSQPDFRISNREVIRNTDAWIVKVVSTLCTLGGSKSSTDLFSCLEYFSK 2035
            +A F+  A  V     F   +RE+IR+++AW+VKVV TL        S  L +C  Y S+
Sbjct: 20   IAQFYSLADSVSRARPF--CDREIIRHSEAWLVKVVCTLFF-----RSHSLNACFGYLSR 72

Query: 2034 SFNPSIVFGVVTRLGNPKLALRFFEFCQANCQEINHFVKTYNFLVKSLCQNGLHEAASKL 1855
            + NPSI F V+ R  +P L L+FFEF + +   INH   TY+ L+++LC+ GL+++A K+
Sbjct: 73   NLNPSIAFEVIKRFSDPLLGLKFFEFSRTHLS-INHTFNTYDLLMRNLCKVGLNDSA-KI 130

Query: 1854 VFD-----GYFPDGSVFDFLVSSCAQAGKLDTAKRLLVQACLSKIKVNSFMYNNLLNILV 1690
            VFD     G  PD S+ + LVSS A+ GKLD+AK  L +     IKV+ F+YNNLLN+LV
Sbjct: 131  VFDCMRSDGILPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLV 190

Query: 1689 GNNRVHEAVCFFQDQNLRLGFCPDTCSFNIVVRGLCRLGKIDMAFELFNGMGSFNCSPDV 1510
              N V EAV  F++ +L   F PD  SFNI++RGLCR+G+ID AFE F  MG+F C PD+
Sbjct: 191  KQNLVDEAVLLFRE-HLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDI 249

Query: 1509 VTYNTLIDGFCRAKYVNRGYELLTQIQSKTGFSPDVVTYTSVISGYCKLGEMEDASALLD 1330
            V+YNTLI+GFCR   +++G++LL +     G SPDV+TYTS+ISGYCKLG+M+ AS L D
Sbjct: 250  VSYNTLINGFCRVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFD 309

Query: 1329 EMINKGIMPNSFTFNVLINGFGKVGNMRSAMAMYENMLSNGCVPDVVTFTSLIDGYCQTG 1150
            EM++ GI PN FTFNVLI+GFGKVGNMRSAM MYE ML  GC+PDVVTFTSLIDGYC+ G
Sbjct: 310  EMVSSGIKPNDFTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREG 369

Query: 1149 AIEEGMKLWNEMGTKNISPNAYTFSVLINALCKANRLNEARDLLRQLMWRNVVPRPFIYN 970
             + +G+KLW EM  +N+SPN YT++VLINALCK NR+ EAR+ LR L    VVP+PFIYN
Sbjct: 370  EVNQGLKLWEEMKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYN 429

Query: 969  PVIDGFCKAGNVDEANVILVEMEEKKCNPDKFTFTSLIIGHCMKGRMAEAISLFHKMSSI 790
            PVIDGFCKAG VDEAN I+ EM+EKKC PDK TFT LIIG+CMKGRM EAIS F+KM  I
Sbjct: 430  PVIDGFCKAGKVDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEI 489

Query: 789  GCAPDTITISAFISCLLKAGMPCEANQIMVTFSKKDLDPGFSSSRKICTSTKSMDIPAA 613
             C PD ITI++ ISCLLKAGMP EA+QI     +K L+ G SS     T  KS  +PAA
Sbjct: 490  NCVPDEITINSLISCLLKAGMPNEASQIKQAALQK-LNLGLSSLGSPLT-RKSSRVPAA 546


>ref|XP_004141071.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            [Cucumis sativus]
          Length = 548

 Score =  582 bits (1501), Expect = e-163
 Identities = 305/539 (56%), Positives = 384/539 (71%), Gaps = 5/539 (0%)
 Frame = -3

Query: 2214 VASFHDHAVQVCSQPDFRISNREVIRNTDAWIVKVVSTLCTLGGSKSSTDLFSCLEYFSK 2035
            +A F+  A  V     F   +RE+IR+++AW+VKVV TL        S  L +C  Y S+
Sbjct: 20   IAQFYSLADSVSRARPF--CDREIIRHSEAWLVKVVCTLFF-----RSHSLNACFGYLSR 72

Query: 2034 SFNPSIVFGVVTRLGNPKLALRFFEFCQANCQEINHFVKTYNFLVKSLCQNGLHEAASKL 1855
            + NPSI F V+ R  +P L L+FFEF + +   INH   TY+ L+++LC+ GL+++A K+
Sbjct: 73   NLNPSIAFEVIKRFSDPLLGLKFFEFSRTHLS-INHTFNTYDLLMRNLCKVGLNDSA-KI 130

Query: 1854 VFD-----GYFPDGSVFDFLVSSCAQAGKLDTAKRLLVQACLSKIKVNSFMYNNLLNILV 1690
            VFD     G  PD S+ + LVSS A+ GKLD+AK  L +     IKV+ F+YNNLLN+LV
Sbjct: 131  VFDCMRSDGILPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLV 190

Query: 1689 GNNRVHEAVCFFQDQNLRLGFCPDTCSFNIVVRGLCRLGKIDMAFELFNGMGSFNCSPDV 1510
              N V EAV  F++ +L   F PD  SFNI++RGLCR+G+ID AFE F  MG+F C PD+
Sbjct: 191  KQNLVDEAVLLFRE-HLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDI 249

Query: 1509 VTYNTLIDGFCRAKYVNRGYELLTQIQSKTGFSPDVVTYTSVISGYCKLGEMEDASALLD 1330
            V+YNTLI+GFCR   +++G++LL +     G SPDV+TYTS+ISGYCKLG+M+ AS L D
Sbjct: 250  VSYNTLINGFCRVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFD 309

Query: 1329 EMINKGIMPNSFTFNVLINGFGKVGNMRSAMAMYENMLSNGCVPDVVTFTSLIDGYCQTG 1150
            EM++ GI PN FTFNVLI+GFGKVGNMRSAM MYE ML  GC+PDVVTFTSLIDGYC+ G
Sbjct: 310  EMVSSGIKPNDFTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREG 369

Query: 1149 AIEEGMKLWNEMGTKNISPNAYTFSVLINALCKANRLNEARDLLRQLMWRNVVPRPFIYN 970
             + +G+KLW EM  +N+SPN YT++VLINALCK NR+ EAR+ LR L    VVP+PFIYN
Sbjct: 370  EVNQGLKLWEEMKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYN 429

Query: 969  PVIDGFCKAGNVDEANVILVEMEEKKCNPDKFTFTSLIIGHCMKGRMAEAISLFHKMSSI 790
            PVIDGFCKAG VDEAN I+ EM+EKKC PDK TFT LIIG+CMKGRM EAIS F+KM  I
Sbjct: 430  PVIDGFCKAGKVDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEI 489

Query: 789  GCAPDTITISAFISCLLKAGMPCEANQIMVTFSKKDLDPGFSSSRKICTSTKSMDIPAA 613
             C PD ITI++ ISCLLKAGMP EA+QI     +K L+ G SS     T  KS  +P A
Sbjct: 490  NCVPDEITINSLISCLLKAGMPNEASQIKQAALQK-LNLGLSSLGSPLT-RKSSRVPVA 546


>ref|XP_007012633.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
            cacao] gi|508782996|gb|EOY30252.1| Pentatricopeptide
            repeat superfamily protein, putative [Theobroma cacao]
          Length = 592

 Score =  562 bits (1449), Expect = e-157
 Identities = 293/548 (53%), Positives = 385/548 (70%), Gaps = 7/548 (1%)
 Frame = -3

Query: 2232 ASGTLFVASFHDHAVQVCSQPDFRISNREV--IRNTDAWIVKVVSTLCTLGGSKSSTDLF 2059
            A+  +F+  FH   +Q    P  +  N+EV  I+  +AW VKVV TL     S+   D  
Sbjct: 57   AASKVFIPHFH---IQFHGGPHPQ-GNKEVKAIQKHEAWFVKVVCTLFVY--SQPLDD-- 108

Query: 2058 SCLEYFSKSFNPSIVFGVVTRLGNPKLALRFFEFCQANCQEINHFVKTYNFLVKSLCQNG 1879
            SCL Y SK+  P I F VV  L NP L L+F EF + N   I H   TYN L++S C  G
Sbjct: 109  SCLSYLSKNLTPLIEFEVVKWLNNPALGLKFLEFSRVNFN-IAHSFWTYNLLMRSFCHMG 167

Query: 1878 LHEAASKLVFD-----GYFPDGSVFDFLVSSCAQAGKLDTAKRLLVQACLSKIKVNSFMY 1714
            LH++A KLVFD     G+ PD ++  F++SS  +AG+   AK+LL      ++ ++ F  
Sbjct: 168  LHDSA-KLVFDYMRIDGHLPDTTILGFMISSFGRAGEFGMAKKLLADVQSDEVVISIFAL 226

Query: 1713 NNLLNILVGNNRVHEAVCFFQDQNLRLGFCPDTCSFNIVVRGLCRLGKIDMAFELFNGMG 1534
            NNLLN++V  N++ EAV  +++ NL   F PD  +FNI++RGLCR+GK+D AFELFN MG
Sbjct: 227  NNLLNMMVKQNKLEEAVSLYKE-NLGSNFYPDAWTFNILIRGLCRVGKVDQAFELFNDMG 285

Query: 1533 SFNCSPDVVTYNTLIDGFCRAKYVNRGYELLTQIQSKTGFSPDVVTYTSVISGYCKLGEM 1354
            SF C PD+VTYNT+I+G C+   V+RG++LL Q+QS+   SPDVVTYTSVISGYCKLG+M
Sbjct: 286  SFGCFPDIVTYNTIINGLCKVNEVDRGHKLLNQVQSRDDCSPDVVTYTSVISGYCKLGKM 345

Query: 1353 EDASALLDEMINKGIMPNSFTFNVLINGFGKVGNMRSAMAMYENMLSNGCVPDVVTFTSL 1174
            ++ASAL  EMI+ G +P   TFNVLI+GFGKVG+M SA +MYE M S GC+ DVVTFTSL
Sbjct: 346  DEASALFHEMISSGTVPTVVTFNVLIDGFGKVGDMVSAKSMYEQMASFGCIADVVTFTSL 405

Query: 1173 IDGYCQTGAIEEGMKLWNEMGTKNISPNAYTFSVLINALCKANRLNEARDLLRQLMWRNV 994
            IDGYC+ G + + ++LWN M  +++SPN YTF++ INALCK NRL+EAR  LR+L  RN+
Sbjct: 406  IDGYCRIGDVNQSLQLWNTMKGRDLSPNVYTFAITINALCKENRLHEARGFLRELQCRNI 465

Query: 993  VPRPFIYNPVIDGFCKAGNVDEANVILVEMEEKKCNPDKFTFTSLIIGHCMKGRMAEAIS 814
            VP+PFI+NPVIDGFCKAGN+DEAN+I+ EMEEK+C+PDK TFT LIIGHCMKGRM EAIS
Sbjct: 466  VPKPFIFNPVIDGFCKAGNLDEANLIVAEMEEKQCHPDKVTFTILIIGHCMKGRMFEAIS 525

Query: 813  LFHKMSSIGCAPDTITISAFISCLLKAGMPCEANQIMVTFSKKDLDPGFSSSRKICTSTK 634
            +F+KM S+GC PD +T+++ ISCLLKAGMP EA++I    + +D+  G S          
Sbjct: 526  IFNKMLSVGCTPDDVTVNSLISCLLKAGMPSEASRI-TKMASEDMKLGSSLLENNSPLRI 584

Query: 633  SMDIPAAA 610
            +  +P AA
Sbjct: 585  NRGVPVAA 592


>gb|EXB38956.1| hypothetical protein L484_027391 [Morus notabilis]
          Length = 570

 Score =  560 bits (1442), Expect = e-156
 Identities = 289/510 (56%), Positives = 365/510 (71%), Gaps = 6/510 (1%)
 Frame = -3

Query: 2166 FRISNREVIRNTDAWIVKVVSTLCTLGGSKSSTDLFSCLEYFSKSFNPSIVFGVVTRLGN 1987
            F    +EVI  ++AW VKVVSTL        S  L +   Y SK   PSI F V+ RL N
Sbjct: 61   FHHKRKEVISYSEAWFVKVVSTLFV-----RSQSLNTFFGYLSKKLTPSISFEVIKRLNN 115

Query: 1986 -PKLALRFFEFCQANCQEINHFVKTYNFLVKSLCQNGLHEAASKLVFD-----GYFPDGS 1825
             P L L+FFE  +AN   +NH   TYN L++SLCQ G H++A K VFD     G+ PD S
Sbjct: 116  NPNLGLKFFELSRANLS-VNHSFSTYNLLIRSLCQMGFHDSA-KFVFDCMRIDGHSPDNS 173

Query: 1824 VFDFLVSSCAQAGKLDTAKRLLVQACLSKIKVNSFMYNNLLNILVGNNRVHEAVCFFQDQ 1645
              +FLV   A+ GKLD+ ++LL      +I+ + F+Y++L N+LV NN+V+EAVC F+ Q
Sbjct: 174  TIEFLVCVFAKVGKLDSCEKLL-----EEIRASKFVYSSLFNVLVKNNKVYEAVCLFRKQ 228

Query: 1644 NLRLGFCPDTCSFNIVVRGLCRLGKIDMAFELFNGMGSFNCSPDVVTYNTLIDGFCRAKY 1465
             +   F PDT +FNI++ GLC +G++  AFE FN MG F CSPDVVTYNTLI G CR   
Sbjct: 229  -IGSHFVPDTWTFNILIGGLCGVGEVHSAFEFFNDMGKFRCSPDVVTYNTLISGLCRTNE 287

Query: 1464 VNRGYELLTQIQSKTGFSPDVVTYTSVISGYCKLGEMEDASALLDEMINKGIMPNSFTFN 1285
            V+RG +LL ++Q +  FSP+V T+TSVI GYCKLG ME+ASAL DEM++ G  P + TFN
Sbjct: 288  VDRGCDLLREVQLRGDFSPNVRTFTSVILGYCKLGRMEEASALFDEMMDSGTRPTTVTFN 347

Query: 1284 VLINGFGKVGNMRSAMAMYENMLSNGCVPDVVTFTSLIDGYCQTGAIEEGMKLWNEMGTK 1105
            VLI+ F KVG+M SA+A+YE ML +G  PDVVTFTSLIDGYC+ G +  G+KLW EM  +
Sbjct: 348  VLIDAFSKVGDMASAIALYEKMLFHGYRPDVVTFTSLIDGYCRVGQLNRGLKLWCEMSVR 407

Query: 1104 NISPNAYTFSVLINALCKANRLNEARDLLRQLMWRNVVPRPFIYNPVIDGFCKAGNVDEA 925
            N+SPN YT+SV+I+ALCK NRL+EARDLLRQL   N+VP+PF+YNPVIDGFCKAGNVDEA
Sbjct: 408  NVSPNGYTYSVVIHALCKVNRLHEARDLLRQLNCTNIVPKPFMYNPVIDGFCKAGNVDEA 467

Query: 924  NVILVEMEEKKCNPDKFTFTSLIIGHCMKGRMAEAISLFHKMSSIGCAPDTITISAFISC 745
            N+I+ EMEEK+CNPDK TFT LI+G+CMKGRM +AI +F+KM ++GCAPD IT+   +SC
Sbjct: 468  NMIVAEMEEKRCNPDKMTFTILILGNCMKGRMVDAIGVFYKMLAVGCAPDKITVHCLMSC 527

Query: 744  LLKAGMPCEANQIMVTFSKKDLDPGFSSSR 655
            LLKAGMP EA  I  T   K L+ G SS R
Sbjct: 528  LLKAGMPNEAFHIKETV-MKSLNVGMSSLR 556


>ref|XP_004301429.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            [Fragaria vesca subsp. vesca]
          Length = 583

 Score =  556 bits (1433), Expect = e-155
 Identities = 288/502 (57%), Positives = 367/502 (73%), Gaps = 5/502 (0%)
 Frame = -3

Query: 2151 REVIRNTDAWIVKVVSTLCTLGGSKSSTDLFSCLEYFSKSFNPSIVFGVVTRLGNPKLAL 1972
            REV+ N +AW VKVV TL        S  L S + Y SK+  PS+ F V+ RL NPKL L
Sbjct: 75   REVLLNPEAWFVKVVYTLFL-----RSHSLDSYVGYLSKNLTPSLAFEVIKRLNNPKLGL 129

Query: 1971 RFFEFCQANCQEINHFVKTYNFLVKSLCQNGLHEAASKLVFD-----GYFPDGSVFDFLV 1807
            RFFE  + +   +NH V TY++L++SLCQ GL ++A KLVFD     G  P+ SV +FLV
Sbjct: 130  RFFELSKFSLN-VNHGVWTYHYLLRSLCQMGLQDSA-KLVFDYMRTDGLSPNESVLEFLV 187

Query: 1806 SSCAQAGKLDTAKRLLVQACLSKIKVNSFMYNNLLNILVGNNRVHEAVCFFQDQNLRLGF 1627
            SSCAQ G+ D A+++L +   S + ++SF+YNNL N+LV  NRV EAVC F+ + +    
Sbjct: 188  SSCAQMGRSDLAEKILDEVHCSVVGLSSFVYNNLFNVLVKLNRVDEAVCLFR-KYVGSYC 246

Query: 1626 CPDTCSFNIVVRGLCRLGKIDMAFELFNGMGSFNCSPDVVTYNTLIDGFCRAKYVNRGYE 1447
            CPD+ +FNI++RGLCR G +D   E F+ M SF CSP+VVTYNTLI G CRA  V+RG +
Sbjct: 247  CPDSWTFNILIRGLCRTGAVDKGLEFFSDMRSFGCSPNVVTYNTLISGLCRAHEVDRGCD 306

Query: 1446 LLTQIQSKTGFSPDVVTYTSVISGYCKLGEMEDASALLDEMINKGIMPNSFTFNVLINGF 1267
            LL ++Q ++  SPDV+T+TSVISGYCKLG ME+ASA+ DEMI  G+ P + TFN LI+G+
Sbjct: 307  LLREVQFRSELSPDVITFTSVISGYCKLGRMEEASAIFDEMIGCGLKPTAVTFNALIDGY 366

Query: 1266 GKVGNMRSAMAMYENMLSNGCVPDVVTFTSLIDGYCQTGAIEEGMKLWNEMGTKNISPNA 1087
            GK G+M SA ++YE+ML +G   DV+TFTSLIDGYC+ G +  G++LW+EM  KN+SP+A
Sbjct: 367  GKAGDMSSAFSLYESMLFHGHCADVITFTSLIDGYCRAGHLNHGLQLWHEMNAKNVSPSA 426

Query: 1086 YTFSVLINALCKANRLNEARDLLRQLMWRNVVPRPFIYNPVIDGFCKAGNVDEANVILVE 907
            YTFSVLINALCK NRL EARDLLR+L   NVVP+ F+YNPVIDG CKAGN+DEAN+I+ E
Sbjct: 427  YTFSVLINALCKGNRLCEARDLLRELKGSNVVPKSFLYNPVIDGLCKAGNIDEANLIVAE 486

Query: 906  MEEKKCNPDKFTFTSLIIGHCMKGRMAEAISLFHKMSSIGCAPDTITISAFISCLLKAGM 727
            MEEKKC PD+ TFT LI+G+ MKGRM+EAI  F KM SIGCAPD ITI + ISCL KAGM
Sbjct: 487  MEEKKCTPDRVTFTILILGNSMKGRMSEAIGNFSKMLSIGCAPDKITIDSLISCLSKAGM 546

Query: 726  PCEANQIMVTFSKKDLDPGFSS 661
            P EA +I    + +DL+ G  S
Sbjct: 547  PSEAGRIK-KIAYEDLNMGAPS 567


>ref|XP_002512275.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223548236|gb|EEF49727.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 532

 Score =  556 bits (1433), Expect = e-155
 Identities = 293/542 (54%), Positives = 378/542 (69%), Gaps = 7/542 (1%)
 Frame = -3

Query: 2214 VASFHDHAVQVCSQPDFRISNREVI-RNTDAWIVKVVSTLCTLGGSKSSTDLFSCLEYFS 2038
            +A FHD+       P    S++EVI +N +AW VKV++ L        +T L     Y S
Sbjct: 1    MAHFHDYTKGGGFHP---FSDKEVIVKNQEAWFVKVIAILFVRSHCSDATSL----GYLS 53

Query: 2037 KSFN-PSIVFGVVTRLGN-PKLALRFFEFCQANCQEINHFVKTYNFLVKSLCQNGLHEAA 1864
            +  N P + F V+ RL N P++ L+F EFC+ N   I H   TY  L++SLCQ GLH+  
Sbjct: 54   EKLNDPLVAFEVIKRLNNNPQVGLKFMEFCRLNFSLI-HCFSTYELLIRSLCQMGLHDLV 112

Query: 1863 SKLV----FDGYFPDGSVFDFLVSSCAQAGKLDTAKRLLVQACLSKIKVNSFMYNNLLNI 1696
              ++     DG+  D  V  FLV+S AQAGK D AK+L+++    + +++SF+YN LLN 
Sbjct: 113  EMVIGYMRSDGHLIDSRVLGFLVTSFAQAGKFDLAKKLIIEVQGEEARISSFVYNYLLNE 172

Query: 1695 LVGNNRVHEAVCFFQDQNLRLGFCPDTCSFNIVVRGLCRLGKIDMAFELFNGMGSFNCSP 1516
            LV   +VHEA+  F++ NL     P+T +FNI++RGLCR+G+++  FELFN M SF C P
Sbjct: 173  LVKGGKVHEAIFLFKE-NLAFHSPPNTWTFNILIRGLCRVGEVEKGFELFNAMQSFGCLP 231

Query: 1515 DVVTYNTLIDGFCRAKYVNRGYELLTQIQSKTGFSPDVVTYTSVISGYCKLGEMEDASAL 1336
            DVVTYNTLI G C+A  ++R  +LL ++QS+   SPDV+TYTS+ISG+ KLG++E AS L
Sbjct: 232  DVVTYNTLISGLCKANELDRACDLLKEVQSRNDCSPDVMTYTSIISGFRKLGKLEAASVL 291

Query: 1335 LDEMINKGIMPNSFTFNVLINGFGKVGNMRSAMAMYENMLSNGCVPDVVTFTSLIDGYCQ 1156
             +EMI  GI P   TFNVLI+GFGK+GNM +A AM+E M S  C+PDVVTFTSLIDGYC+
Sbjct: 292  FEEMIRSGIEPTVVTFNVLIDGFGKIGNMVAAEAMHEKMASYSCIPDVVTFTSLIDGYCR 351

Query: 1155 TGAIEEGMKLWNEMGTKNISPNAYTFSVLINALCKANRLNEARDLLRQLMWRNVVPRPFI 976
            TG I  G+K+W+ M  +N+SPN YT+SV+INALCK NR++EARDLLRQL   +V P+PFI
Sbjct: 352  TGDIRLGLKVWDVMKARNVSPNIYTYSVIINALCKDNRIHEARDLLRQLKCSDVFPKPFI 411

Query: 975  YNPVIDGFCKAGNVDEANVILVEMEEKKCNPDKFTFTSLIIGHCMKGRMAEAISLFHKMS 796
            YNPVIDGFCKAGNVDEANVI+ EMEEK+C PDK TFT LIIGHCMKGRM EA+ +F KM 
Sbjct: 412  YNPVIDGFCKAGNVDEANVIVTEMEEKRCRPDKVTFTILIIGHCMKGRMVEALDIFKKML 471

Query: 795  SIGCAPDTITISAFISCLLKAGMPCEANQIMVTFSKKDLDPGFSSSRKICTSTKSMDIPA 616
            +IGCAPD ITIS+ ++CLLKAG P EA  I+ T S +DL+  FSS RK        DI  
Sbjct: 472  AIGCAPDNITISSLVACLLKAGKPSEAFHIVQTAS-EDLNLSFSSLRKTFPMRVKTDISV 530

Query: 615  AA 610
            AA
Sbjct: 531  AA 532


>ref|XP_006452806.1| hypothetical protein CICLE_v10007804mg [Citrus clementina]
            gi|557556032|gb|ESR66046.1| hypothetical protein
            CICLE_v10007804mg [Citrus clementina]
          Length = 595

 Score =  554 bits (1427), Expect = e-155
 Identities = 287/514 (55%), Positives = 366/514 (71%), Gaps = 5/514 (0%)
 Frame = -3

Query: 2136 NTDAWIVKVVSTLCTLGGSKSSTDLFSCLEYFSKSFNPSIVFGVVTRLGNPKLALRFFEF 1957
            + + W VKVV TL       S T    C  Y  +  +P     V+ RL NPKL L+F EF
Sbjct: 90   SNEFWFVKVVCTLLLRSSYLSDT----CARYLCEKLSPLNSLEVIKRLDNPKLGLKFLEF 145

Query: 1956 CQANCQEINHFVKTYNFLVKSLCQNGLHEAASKLVFD-----GYFPDGSVFDFLVSSCAQ 1792
             + N   +NH  KTYN +++SLC+ GLH++  ++VFD     G+ P+  + +F VSSC +
Sbjct: 146  SRVNLS-LNHSFKTYNLVMRSLCEMGLHDSV-QVVFDYMRSDGHLPNSPMIEFFVSSCIR 203

Query: 1791 AGKLDTAKRLLVQACLSKIKVNSFMYNNLLNILVGNNRVHEAVCFFQDQNLRLGFCPDTC 1612
            AGK D AK LL Q    ++ +++FMYN+LLN LV  N   EAV  F++   RL   PDT 
Sbjct: 204  AGKCDAAKGLLSQFRPGEVTMSTFMYNSLLNALVKQNNADEAVYMFKEY-FRLYSQPDTW 262

Query: 1611 SFNIVVRGLCRLGKIDMAFELFNGMGSFNCSPDVVTYNTLIDGFCRAKYVNRGYELLTQI 1432
            +FNI++RGLCR+G++  AFE F  MGSF CSPD+VTYNTLI G CR   V RG+ELL ++
Sbjct: 263  TFNILIRGLCRIGEVKKAFEFFYDMGSFGCSPDIVTYNTLISGLCRVNEVARGHELLKEV 322

Query: 1431 QSKTGFSPDVVTYTSVISGYCKLGEMEDASALLDEMINKGIMPNSFTFNVLINGFGKVGN 1252
            + K+ F PDVVTYTSVISGYCKLG+M+ A+++ +EM + GI P++ TFNVLI+GFGKVGN
Sbjct: 323  KFKSEFLPDVVTYTSVISGYCKLGKMDKATSIYNEMNSCGIKPSAVTFNVLIDGFGKVGN 382

Query: 1251 MRSAMAMYENMLSNGCVPDVVTFTSLIDGYCQTGAIEEGMKLWNEMGTKNISPNAYTFSV 1072
            M SA  M E MLS G +PDVVTF+SLIDGYC+ G + +G+KL +EM  KN+SPN YTF++
Sbjct: 383  MVSAEYMRERMLSLGYLPDVVTFSSLIDGYCRNGQLNQGLKLCDEMKGKNLSPNVYTFAI 442

Query: 1071 LINALCKANRLNEARDLLRQLMWRNVVPRPFIYNPVIDGFCKAGNVDEANVILVEMEEKK 892
            LINALCK NRLN+AR  L+QL W ++VP+PF+YNPVIDGFCKAGNVDEANVI+ EMEEK+
Sbjct: 443  LINALCKENRLNDARRFLKQLKWNDLVPKPFMYNPVIDGFCKAGNVDEANVIVAEMEEKR 502

Query: 891  CNPDKFTFTSLIIGHCMKGRMAEAISLFHKMSSIGCAPDTITISAFISCLLKAGMPCEAN 712
            C PDK TFT LIIGHCMKGRM EAIS+F+KM  IGCAPD IT+++ ISCLLK GMP EA 
Sbjct: 503  CKPDKVTFTILIIGHCMKGRMVEAISIFNKMLRIGCAPDDITVNSLISCLLKGGMPNEAF 562

Query: 711  QIMVTFSKKDLDPGFSSSRKICTSTKSMDIPAAA 610
            +IM   S +D +    S +K      + DIP AA
Sbjct: 563  RIMQRAS-EDQNLQLPSWKKAVPLRTNTDIPVAA 595


>ref|XP_006474728.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            isoform X1 [Citrus sinensis]
            gi|568841566|ref|XP_006474729.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g06000-like isoform X2 [Citrus sinensis]
          Length = 595

 Score =  553 bits (1426), Expect = e-154
 Identities = 287/514 (55%), Positives = 367/514 (71%), Gaps = 5/514 (0%)
 Frame = -3

Query: 2136 NTDAWIVKVVSTLCTLGGSKSSTDLFSCLEYFSKSFNPSIVFGVVTRLGNPKLALRFFEF 1957
            + + W VKVV TL       S T    C  Y  +  +P     V+ RL NPKL L+F EF
Sbjct: 90   SNEFWFVKVVCTLLLRSSYLSDT----CARYLCEKLSPLNSLEVIKRLDNPKLGLKFLEF 145

Query: 1956 CQANCQEINHFVKTYNFLVKSLCQNGLHEAASKLVFD-----GYFPDGSVFDFLVSSCAQ 1792
             + N   +NH  KTYN +++SLC+ GLH++  ++VFD     G+ P+  + +F VSSC +
Sbjct: 146  SRVNLS-LNHSFKTYNLVMRSLCEMGLHDSV-QVVFDYMRSDGHLPNSPMIEFFVSSCIR 203

Query: 1791 AGKLDTAKRLLVQACLSKIKVNSFMYNNLLNILVGNNRVHEAVCFFQDQNLRLGFCPDTC 1612
            AGK D AK LL Q    ++ +++FMYN+LLN LV  N   EAV  F++   RL   PDT 
Sbjct: 204  AGKCDAAKGLLSQFRPGEVTMSTFMYNSLLNALVKQNNADEAVYMFKEY-FRLYSQPDTW 262

Query: 1611 SFNIVVRGLCRLGKIDMAFELFNGMGSFNCSPDVVTYNTLIDGFCRAKYVNRGYELLTQI 1432
            +FNI+++GL R+G++  AFE F  MGSF CSPD+VTYNTLI G CR   V RG+ELL ++
Sbjct: 263  TFNILIQGLSRIGEVKKAFEFFYDMGSFGCSPDIVTYNTLISGLCRVNEVARGHELLKEV 322

Query: 1431 QSKTGFSPDVVTYTSVISGYCKLGEMEDASALLDEMINKGIMPNSFTFNVLINGFGKVGN 1252
            + K+ FSPDVVTYTSVISGYCKLG+M+ A+ + +EM + GI P++ TFNVLI+GFGKVGN
Sbjct: 323  KFKSEFSPDVVTYTSVISGYCKLGKMDKATGIYNEMNSCGIKPSAVTFNVLIDGFGKVGN 382

Query: 1251 MRSAMAMYENMLSNGCVPDVVTFTSLIDGYCQTGAIEEGMKLWNEMGTKNISPNAYTFSV 1072
            M SA  M E MLS G +PDVVTF+SLIDGYC+ G + +G+KL +EM  KN+SPN YTF++
Sbjct: 383  MVSAEYMRERMLSFGYLPDVVTFSSLIDGYCRNGQLNQGLKLCDEMKGKNLSPNVYTFTI 442

Query: 1071 LINALCKANRLNEARDLLRQLMWRNVVPRPFIYNPVIDGFCKAGNVDEANVILVEMEEKK 892
            LINALCK NRLN+AR  L+QL W ++VP+PF+YNPVIDGFCKAGNVDEANVI+ EMEEK+
Sbjct: 443  LINALCKENRLNDARRFLKQLKWNDLVPKPFMYNPVIDGFCKAGNVDEANVIVAEMEEKR 502

Query: 891  CNPDKFTFTSLIIGHCMKGRMAEAISLFHKMSSIGCAPDTITISAFISCLLKAGMPCEAN 712
            C PDK TFT LIIGHCMKGRM EAIS+F+KM +IGCAPD IT+++ ISCLLK GMP EA 
Sbjct: 503  CKPDKVTFTILIIGHCMKGRMVEAISIFNKMLTIGCAPDDITVNSLISCLLKGGMPNEAF 562

Query: 711  QIMVTFSKKDLDPGFSSSRKICTSTKSMDIPAAA 610
            +IM   S +DL+    S +K      + DIP AA
Sbjct: 563  RIMQRAS-EDLNLQLPSWKKAVPLRTNTDIPVAA 595


>gb|EYU18527.1| hypothetical protein MIMGU_mgv1a003955mg [Mimulus guttatus]
          Length = 552

 Score =  531 bits (1367), Expect = e-148
 Identities = 286/519 (55%), Positives = 361/519 (69%), Gaps = 15/519 (2%)
 Frame = -3

Query: 2124 WIVKVVSTLCTLGGSKSSTDLFSCLEYFSKSFNPSIVFGVV----TRLGNPKLALRFFEF 1957
            W VKVV TLC     +S +  F   +YF  + NPS+ F VV    +RL NP LA  FF  
Sbjct: 42   WFVKVVCTLCI---RRSPSLAFVETDYFRVNLNPSVAFAVVYHINSRLNNPDLAFTFFR- 97

Query: 1956 CQANCQEINHFVKTYNFLVKSLCQNGLHEAASKLVF-----DGYFPDGSVFDFLVSSCAQ 1792
            C      + H   T++ L++SLCQ G H++A +LV+     DG+ PD SV DF+VSS A 
Sbjct: 98   CSRLRLNLIHLEPTFDLLLRSLCQMGRHDSA-ELVYQYMKSDGFLPDSSVLDFVVSSFAN 156

Query: 1791 AGKLDTAKRLLV---QACLSKIK-VNSFMYNNLLNILVGNNRVHEAVCFFQDQNLRL-GF 1627
            AGK   A+ +L+   + C  K + V+SF+YNN L++L   NR+ +AV FF+   LRL  F
Sbjct: 157  AGKFRIAEEILIARAEYCNEKDELVSSFVYNNFLSMLTNKNRIDDAVLFFKSHILRLKSF 216

Query: 1626 CPDTCSFNIVVRGLCRLGKIDMAFELFNGMGSFNCSPDVVTYNTLIDGFCRAKYVNRGYE 1447
            CPDTCSFNIV+RGLCR  K+D AFE F+ M SF+CSPD+VTYNTLI+G CR   V+R  E
Sbjct: 217  CPDTCSFNIVMRGLCRASKVDKAFEFFDVMRSFSCSPDLVTYNTLINGLCRVGKVDRAEE 276

Query: 1446 LLTQIQSKTGFSPDVVTYTSVISGYCKLGEMEDASALLDEMINKGIMPNSFTFNVLINGF 1267
            LL +I+ ++ FS DVVTYTSVISGYCKLG+ + A+ L +EMIN GI PN FTFN +I+GF
Sbjct: 277  LLREIKVQSEFSADVVTYTSVISGYCKLGKTDAAAFLFEEMINNGIRPNLFTFNAIIDGF 336

Query: 1266 GKVGNMRSAMAMYENMLSNGCVPDVVTFTSLIDGYCQTGAIEEGMKLWNEMGTKNISPNA 1087
            GK G + SA  MYE M + G  PDVVTFTSLIDG+C+ G + +G+ L NEM  K +SPN 
Sbjct: 337  GKKGEVASASKMYERMTATGFRPDVVTFTSLIDGHCRCGDLGQGIHLLNEMNEKRVSPNV 396

Query: 1086 YTFSVLINALCKANRLNEARDLLRQLMWR-NVVPRPFIYNPVIDGFCKAGNVDEANVILV 910
            +TFSVLI+ALCK NRLNEARDLL QL WR ++VP PF+YNPVIDG+CKAGNVDEAN I+ 
Sbjct: 397  FTFSVLISALCKENRLNEARDLLNQLKWREDIVPPPFVYNPVIDGYCKAGNVDEANAIVA 456

Query: 909  EMEEKKCNPDKFTFTSLIIGHCMKGRMAEAISLFHKMSSIGCAPDTITISAFISCLLKAG 730
            EME K C  DK TFT LI+GHCMKGRM EAI +++KM S+GC PD IT+S+ ISCL KAG
Sbjct: 457  EMEAKGCVHDKMTFTILILGHCMKGRMFEAIGMYNKMLSVGCVPDNITMSSLISCLRKAG 516

Query: 729  MPCEANQIMVTFSKKDLDPGFSSSRKICTSTKSMDIPAA 613
            M  EAN+I     +K L    SSS++      +M++  A
Sbjct: 517  MAREANEI----EQKALFSVSSSSKRSNPVRNNMNVTVA 551


>ref|XP_006381622.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550336330|gb|ERP59419.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 511

 Score =  526 bits (1356), Expect = e-146
 Identities = 262/483 (54%), Positives = 340/483 (70%), Gaps = 5/483 (1%)
 Frame = -3

Query: 2046 YFSKSFNPSIVFGVVTRLGNPKLALRFFEFCQANCQEINHFVKTYNFLVKSLCQNGLHEA 1867
            Y  +   P I F V+ R  NPK+  +F EF + N   +NH   TYN L++SLCQ G H+ 
Sbjct: 33   YPDRQLTPLIAFEVIKRFNNPKVGFKFLEFSRLNLN-VNHCYSTYNLLMRSLCQMGHHDL 91

Query: 1866 ASKLVFD-----GYFPDGSVFDFLVSSCAQAGKLDTAKRLLVQACLSKIKVNSFMYNNLL 1702
             + +VFD     G+ PD  +  FLV+  AQA   D  K+LL +    ++++NSF+YNNLL
Sbjct: 92   VN-IVFDYMGSDGHLPDSKLLGFLVTWMAQASDFDMVKKLLAEVQGKEVRINSFVYNNLL 150

Query: 1701 NILVGNNRVHEAVCFFQDQNLRLGFCPDTCSFNIVVRGLCRLGKIDMAFELFNGMGSFNC 1522
            ++LV  N+VHEA+  F++        PDT +FNI++RGLCR+G +D AFE+F  M SF C
Sbjct: 151  SVLVKQNQVHEAIYLFKEYLAMQS--PDTWTFNILIRGLCRVGGVDRAFEVFKDMESFGC 208

Query: 1521 SPDVVTYNTLIDGFCRAKYVNRGYELLTQIQSKTGFSPDVVTYTSVISGYCKLGEMEDAS 1342
             PDVVTYNTLI+G C+A  V RG EL  +IQS++  SPD+VTYTS+ISG+CK G+M++AS
Sbjct: 209  LPDVVTYNTLINGLCKANEVQRGCELFKEIQSRSDCSPDIVTYTSIISGFCKSGKMKEAS 268

Query: 1341 ALLDEMINKGIMPNSFTFNVLINGFGKVGNMRSAMAMYENMLSNGCVPDVVTFTSLIDGY 1162
             L +EM+  GI PN  TFNVLI+GFGK+GN+  A AMY  M    C  DVVTFTSLIDGY
Sbjct: 269  NLFEEMMRSGIQPNVITFNVLIDGFGKIGNIAEAEAMYRKMAYFDCSADVVTFTSLIDGY 328

Query: 1161 CQTGAIEEGMKLWNEMGTKNISPNAYTFSVLINALCKANRLNEARDLLRQLMWRNVVPRP 982
            C+ G +  G+K WN M T+N+SP  YT++VLINALCK NRLNEARD L Q+   +++P+P
Sbjct: 329  CRAGQVNHGLKFWNVMKTRNVSPTVYTYAVLINALCKENRLNEARDFLGQIKNSSIIPKP 388

Query: 981  FIYNPVIDGFCKAGNVDEANVILVEMEEKKCNPDKFTFTSLIIGHCMKGRMAEAISLFHK 802
            F+YNPVIDGFCKAGNVDE NVIL EMEEK+C+PDK TFT LIIGHC+KGRM EAI++F++
Sbjct: 389  FMYNPVIDGFCKAGNVDEGNVILKEMEEKRCDPDKVTFTILIIGHCVKGRMFEAINIFNR 448

Query: 801  MSSIGCAPDTITISAFISCLLKAGMPCEANQIMVTFSKKDLDPGFSSSRKICTSTKSMDI 622
            M +  CAPD IT+++ ISCLLKAGMP EA +I    + +D + G SS  K      + DI
Sbjct: 449  MLATRCAPDNITVNSLISCLLKAGMPNEAYRIR-KMALEDRNLGLSSFEKAIPLRTNTDI 507

Query: 621  PAA 613
            P A
Sbjct: 508  PVA 510


>ref|XP_006351831.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            isoform X1 [Solanum tuberosum]
            gi|565370447|ref|XP_006351832.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g06000-like isoform X2 [Solanum tuberosum]
          Length = 550

 Score =  517 bits (1332), Expect = e-144
 Identities = 286/555 (51%), Positives = 372/555 (67%), Gaps = 15/555 (2%)
 Frame = -3

Query: 2232 ASGTLFVASFHDHAVQVCSQPDFRISNREVIRNTDAWIVKVVSTLCTLGGSKSSTDLFSC 2053
            AS  L +A FH       S P +      V      W  KVV  LC       S D+F  
Sbjct: 8    ASNILLIARFHG-LTSSKSIPSYGPGPEAV------WFTKVVCLLCF--HHSQSLDVFGS 58

Query: 2052 LEYFSKSFNPSIVFGVV----TRLGNPKLALRFFEFCQANCQEINHFVKTYNFLVKSLCQ 1885
             +YF ++ +P I F V+    T L NP+LA RF +  + N   + H + ++N L++SL Q
Sbjct: 59   -DYFRQNLDPHIAFTVIHHINTNLNNPRLAFRFLQCTRINLNLV-HCIGSFNLLLRSLSQ 116

Query: 1884 NGLHEAASKLVF-----DGYFPDGSVFDFLVSSCAQAGKLDTAKRLLV-QACLSKIK--- 1732
             G H++A  LVF     DGY  + S+ + +V + A AGK + AK +L+ QA L + +   
Sbjct: 117  MGFHDSAM-LVFKFMKADGYLLENSILESVVLALANAGKFEIAKEILISQAELGREEGRI 175

Query: 1731 VNSFMYNNLLNILVGNNRVHEAVCFFQDQNLRLG-FCPDTCSFNIVVRGLCRLGKIDMAF 1555
            V  F++N+LL++L+  +RV EAV FF+   LR     PDTC+FN V+RGLCR+G +D AF
Sbjct: 176  VRPFVHNSLLSLLMKRSRVDEAVDFFKHHILRSERLFPDTCTFNTVIRGLCRVGGVDKAF 235

Query: 1554 ELFNGMGSFNCSPDVVTYNTLIDGFCRAKYVNRGYELLTQIQSKTGFSPDVVTYTSVISG 1375
            E FN MGSF C PD VTYNTLI+G C    VNR   LL  ++ + G SPDVVTYTSVI+G
Sbjct: 236  EFFNDMGSFGCFPDTVTYNTLINGLCSVGQVNRARGLLGNLELQDGLSPDVVTYTSVIAG 295

Query: 1374 YCKLGEMEDASALLDEMINKGIMPNSFTFNVLINGFGKVGNMRSAMAMYENMLSNGCVPD 1195
            YCKLG M++A  L+DEM   GI PN  TFN+LINGFGK+G+M SA+ MY  M + G  PD
Sbjct: 296  YCKLGRMDEAINLMDEMTTYGISPNLVTFNILINGFGKIGDMFSAIQMYGRMCAVGYPPD 355

Query: 1194 VVTFTSLIDGYCQTGAIEEGMKLWNEMGTKNISPNAYTFSVLINALCKANRLNEARDLLR 1015
            VVTFTSLIDGYC+TG +++G+KLW+EM T+N+SPN YTFS+LI+AL K NRLNEAR+LLR
Sbjct: 356  VVTFTSLIDGYCRTGELDQGLKLWDEMNTRNLSPNLYTFSILISALSKENRLNEARELLR 415

Query: 1014 QLMWR-NVVPRPFIYNPVIDGFCKAGNVDEANVILVEMEEKKCNPDKFTFTSLIIGHCMK 838
            QL  R ++VP+PF+YNPV+DGFCKAGN+ +ANVI  EME + C  DK TFT LI+GHCMK
Sbjct: 416  QLKSRDDIVPQPFVYNPVLDGFCKAGNLSKANVIAAEMESRGCCHDKITFTILILGHCMK 475

Query: 837  GRMAEAISLFHKMSSIGCAPDTITISAFISCLLKAGMPCEANQIMVTFSKKDLDPGFSSS 658
            GRM EA+++F KM S+GC PD IT+S   SCLLKAGM  EA ++ +T S KDL+P  SSS
Sbjct: 476  GRMLEAMAIFDKMLSLGCVPDDITVSCLTSCLLKAGMVKEAYKVRLTPS-KDLNPDLSSS 534

Query: 657  RKICTSTKSMDIPAA 613
            ++      S+DIP A
Sbjct: 535  KQSVPFRTSLDIPVA 549


>ref|XP_004230611.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            [Solanum lycopersicum]
          Length = 550

 Score =  517 bits (1331), Expect = e-143
 Identities = 279/519 (53%), Positives = 358/519 (68%), Gaps = 15/519 (2%)
 Frame = -3

Query: 2124 WIVKVVSTLCTLGGSKSSTDLFSCLEYFSKSFNPSIVFGVV----TRLGNPKLALRFFEF 1957
            W  KVV  LC       S D+F   +YF ++ +P I F V+    T L NP+LA RF + 
Sbjct: 37   WFTKVVCLLCF--HHSQSLDVFGS-DYFRQNLDPHIAFTVIHHINTNLNNPRLAFRFLQC 93

Query: 1956 CQANCQEINHFVKTYNFLVKSLCQNGLHEAASKLVF-----DGYFPDGSVFDFLVSSCAQ 1792
             + N   I H + ++N L++SL Q G H++A  LVF     DGY  + S+ + +V + A 
Sbjct: 94   TRINLNLI-HCIGSFNLLLRSLSQMGFHDSAM-LVFKYMKADGYLLENSILESVVLALAN 151

Query: 1791 AGKLDTAKRLLV-QACLSKIK---VNSFMYNNLLNILVGNNRVHEAVCFFQDQNLRLG-F 1627
            AGK + AK +L+ QA L + +   V  F++N+LL++L+  +RV EAV FF+   LR    
Sbjct: 152  AGKFEIAKEILISQAELGREEGSIVRPFVHNSLLSLLMKRSRVDEAVDFFKHHILRSERL 211

Query: 1626 CPDTCSFNIVVRGLCRLGKIDMAFELFNGMGSFNCSPDVVTYNTLIDGFCRAKYVNRGYE 1447
             PDTC+FN V+RGLCR+G +D AFE FN MGSF CSPD VTYNTLI+G C    VNR   
Sbjct: 212  FPDTCTFNTVIRGLCRVGGVDKAFEFFNDMGSFGCSPDTVTYNTLINGLCAVGQVNRAQG 271

Query: 1446 LLTQIQSKTGFSPDVVTYTSVISGYCKLGEMEDASALLDEMINKGIMPNSFTFNVLINGF 1267
            LL  +Q + G SPDVVTYTS+ISGYCKL  M++A  L+DEMI  GI PN  TFN+LINGF
Sbjct: 272  LLGNLQLQDGLSPDVVTYTSLISGYCKLSRMDEAINLMDEMITYGISPNLVTFNILINGF 331

Query: 1266 GKVGNMRSAMAMYENMLSNGCVPDVVTFTSLIDGYCQTGAIEEGMKLWNEMGTKNISPNA 1087
            GK+G+M SA+ MY  M + G  PDVVTFTSLIDGYC+TG +++G+KLW++M ++N+SPN 
Sbjct: 332  GKIGDMFSAIKMYGKMCAVGYPPDVVTFTSLIDGYCRTGELDQGLKLWDDMNSRNLSPNL 391

Query: 1086 YTFSVLINALCKANRLNEARDLLRQLMWR-NVVPRPFIYNPVIDGFCKAGNVDEANVILV 910
            YTFSVLI+AL K NRLNEAR+LLRQL  R ++VP+PF+YNPV+DGFCKAGN+ EANVI  
Sbjct: 392  YTFSVLISALSKENRLNEARELLRQLKSRDDIVPQPFVYNPVLDGFCKAGNLSEANVIAA 451

Query: 909  EMEEKKCNPDKFTFTSLIIGHCMKGRMAEAISLFHKMSSIGCAPDTITISAFISCLLKAG 730
            EME K C  DK TFT LI+GHCMKGRM EA+++F KM S+GC PD ITIS   SCLLKAG
Sbjct: 452  EMESKGCCHDKITFTILILGHCMKGRMLEALAIFDKMLSLGCVPDDITISCLTSCLLKAG 511

Query: 729  MPCEANQIMVTFSKKDLDPGFSSSRKICTSTKSMDIPAA 613
            M  EA ++ +    KDL+P  S S+       S+DIP A
Sbjct: 512  MVKEAYKVRL-IPSKDLNPDLSPSKLFIPFRTSLDIPVA 549


>ref|XP_006577946.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            isoform X1 [Glycine max] gi|571448762|ref|XP_006577947.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At2g06000-like isoform X2 [Glycine max]
            gi|571448764|ref|XP_006577948.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g06000-like isoform X3 [Glycine max]
            gi|571448766|ref|XP_006577949.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g06000-like isoform X4 [Glycine max]
          Length = 510

 Score =  503 bits (1296), Expect = e-139
 Identities = 259/515 (50%), Positives = 347/515 (67%), Gaps = 5/515 (0%)
 Frame = -3

Query: 2142 IRNTDAWIVKVVSTLCTLGGSKSSTDLFSCLEYFSKSFNPSIVFGVVTRLGNPKLALRFF 1963
            IR  +AW VK+    CT+    +S D F  + YFSK   PS+V+ VV RL  P L  +F 
Sbjct: 4    IRRAEAWFVKIA---CTVFVRSNSLDPF--VGYFSKHLTPSLVYEVVNRLHIPNLGFKFV 58

Query: 1962 EFCQANCQEINHFVKTYNFLVKSLCQNGLHEAASKLVFD-----GYFPDGSVFDFLVSSC 1798
            EFC+     ++H   TY+ L++SLC++ LH  A K+V+D     G  PD  +  FLV S 
Sbjct: 59   EFCRHKLH-MSHSYLTYSLLLRSLCRSNLHHTA-KVVYDWMRCDGQIPDNRLLGFLVWSY 116

Query: 1797 AQAGKLDTAKRLLVQACLSKIKVNSFMYNNLLNILVGNNRVHEAVCFFQDQNLRLGFCPD 1618
            A  G+LD ++ LL     + + VN+ +YN+L N+L+  N+V +AV  F++  +RL + P 
Sbjct: 117  AIVGRLDVSRELLADVQCNNVGVNAVVYNDLFNVLIRQNKVVDAVVLFREL-IRLRYKPV 175

Query: 1617 TCSFNIVVRGLCRLGKIDMAFELFNGMGSFNCSPDVVTYNTLIDGFCRAKYVNRGYELLT 1438
            T + NI++RGLCR G+ID AF L N + SF C PDV+TYNTLI G CR   V+R   LL 
Sbjct: 176  TYTVNILMRGLCRAGEIDEAFRLLNDLRSFGCLPDVITYNTLIHGLCRINEVDRARSLLK 235

Query: 1437 QIQSKTGFSPDVVTYTSVISGYCKLGEMEDASALLDEMINKGIMPNSFTFNVLINGFGKV 1258
            ++     F+PDVV+YT++ISGYCK  +ME+ + L  EMI  G  PN+FTFN LI GFGK+
Sbjct: 236  EVCLNGEFAPDVVSYTTIISGYCKFSKMEEGNLLFGEMIRSGTAPNTFTFNALIGGFGKL 295

Query: 1257 GNMRSAMAMYENMLSNGCVPDVVTFTSLIDGYCQTGAIEEGMKLWNEMGTKNISPNAYTF 1078
            G+M SA+A+YE ML  GCVPDV TFTSLI+GY + G + + M +W++M  KNI    YTF
Sbjct: 296  GDMASALALYEKMLVQGCVPDVATFTSLINGYFRLGQVHQAMDMWHKMNDKNIGATLYTF 355

Query: 1077 SVLINALCKANRLNEARDLLRQLMWRNVVPRPFIYNPVIDGFCKAGNVDEANVILVEMEE 898
            SVL++ LC  NRL++ARD+LR L   ++VP+PFIYNPVIDG+CK+GNVDEAN I+ EME 
Sbjct: 356  SVLVSGLCNNNRLHKARDILRLLNESDIVPQPFIYNPVIDGYCKSGNVDEANKIVAEMEV 415

Query: 897  KKCNPDKFTFTSLIIGHCMKGRMAEAISLFHKMSSIGCAPDTITISAFISCLLKAGMPCE 718
             +C PDK TFT LIIGHCMKGRM EAI +FHKM ++GCAPD IT++   SCLLKAGMP E
Sbjct: 416  NRCKPDKLTFTILIIGHCMKGRMPEAIGIFHKMLAVGCAPDEITVNNLRSCLLKAGMPGE 475

Query: 717  ANQIMVTFSKKDLDPGFSSSRKICTSTKSMDIPAA 613
            A ++    + ++L  G +SS+K    T +  IP A
Sbjct: 476  AARVKKVLA-QNLTLGITSSKKSYHETTNESIPVA 509


>ref|XP_002885810.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297331650|gb|EFH62069.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 536

 Score =  500 bits (1287), Expect = e-138
 Identities = 260/531 (48%), Positives = 351/531 (66%), Gaps = 5/531 (0%)
 Frame = -3

Query: 2214 VASFHDHAVQVCSQPDFRISNREVIRNTDAWIVKVVSTLCTLGGSKSSTDLFSCLEYFSK 2035
            +A FH H+         + + RE I   +AW+VK+VSTL       S      C  Y SK
Sbjct: 10   IAHFHTHSHGGAQARPIQNNTREKIHCPEAWLVKIVSTLFVYRVPDSDL----CFCYLSK 65

Query: 2034 SFNPSIVFGVVTRL-GNPKLALRFFEFCQANCQEINHFVKTYNFLVKSLCQNGLHEAASK 1858
            + NP I F VV +L  NP +  RF+EF +     I H   TYN L +SLC+ G+H+ A +
Sbjct: 66   NLNPFISFEVVKKLDNNPHIGFRFWEFSRFKLN-IRHSFWTYNLLTRSLCKAGMHDLAGQ 124

Query: 1857 LV----FDGYFPDGSVFDFLVSSCAQAGKLDTAKRLLVQACLSKIKVNSFMYNNLLNILV 1690
            +      DG  P+  +  FLVSS A+ GKL  A  LL+Q+   +++    + N+LLN LV
Sbjct: 125  MFECMKSDGISPNSRLLGFLVSSFAEKGKLHCATALLLQSY--EVEGCCMVVNSLLNTLV 182

Query: 1689 GNNRVHEAVCFFQDQNLRLGFCPDTCSFNIVVRGLCRLGKIDMAFELFNGMGSFNCSPDV 1510
              +RV +A+  F++ +LR   C DT +FNI++RGLC +GK + A EL  GM  F C PD+
Sbjct: 183  KLDRVEDAMKLFEE-HLRFQSCNDTKTFNILIRGLCGVGKAEKAVELLGGMSGFGCLPDI 241

Query: 1509 VTYNTLIDGFCRAKYVNRGYELLTQIQSKTGFSPDVVTYTSVISGYCKLGEMEDASALLD 1330
            VTYNTLI GFC++  + +  E+   ++S +G SPDVVTYTS+ISGYCK G+M++AS LLD
Sbjct: 242  VTYNTLIKGFCKSNELKKANEMFDDVKSSSGCSPDVVTYTSMISGYCKAGKMQEASVLLD 301

Query: 1329 EMINKGIMPNSFTFNVLINGFGKVGNMRSAMAMYENMLSNGCVPDVVTFTSLIDGYCQTG 1150
            +M+  GI P + TFNVL++G+ K G M +A  +   M+S GC PDVVTFTSLIDGYC+ G
Sbjct: 302  DMLRLGIYPTNVTFNVLVDGYAKAGEMHTAEEIRGKMISFGCFPDVVTFTSLIDGYCRVG 361

Query: 1149 AIEEGMKLWNEMGTKNISPNAYTFSVLINALCKANRLNEARDLLRQLMWRNVVPRPFIYN 970
             + +G +LW EM  + + PNA+T+S+LINALCK NRL +AR+LL QL  ++++P+PF+YN
Sbjct: 362  QVNQGFRLWEEMNARGMFPNAFTYSILINALCKENRLLKARELLGQLASKDIIPQPFMYN 421

Query: 969  PVIDGFCKAGNVDEANVILVEMEEKKCNPDKFTFTSLIIGHCMKGRMAEAISLFHKMSSI 790
            PVIDGFCKAG V+EA VI+ EME+KKC PDK TFT LIIGHCMKGRM EA+S+FHKM +I
Sbjct: 422  PVIDGFCKAGKVNEAIVIVEEMEKKKCKPDKITFTILIIGHCMKGRMFEAVSIFHKMVAI 481

Query: 789  GCAPDTITISAFISCLLKAGMPCEANQIMVTFSKKDLDPGFSSSRKICTST 637
            GC+PD IT+S+ +SCLLKAGM  EA  +     K  ++ G     K    T
Sbjct: 482  GCSPDKITVSSLLSCLLKAGMAKEAYHLNQIAHKGQINDGAPLETKTANVT 532


>dbj|BAH19478.1| AT2G06000 [Arabidopsis thaliana]
          Length = 536

 Score =  499 bits (1286), Expect = e-138
 Identities = 261/511 (51%), Positives = 347/511 (67%), Gaps = 8/511 (1%)
 Frame = -3

Query: 2214 VASFHDHAVQVCSQPDFRISNREVIRNTDAWIVKVVSTLCTLGGSKSSTDLFSCLEYFSK 2035
            +A FH H+         + + REVI   +AW+VK+VSTL       S      C  Y SK
Sbjct: 10   IAHFHTHSHGGAQARPLQNNTREVIHCPEAWLVKIVSTLFVYRVPDSDL----CFCYLSK 65

Query: 2034 SFNPSIVFGVVTRL-GNPKLALRFFEFCQANCQEINHFVKTYNFLVKSLCQNGLHEAASK 1858
            + NP I F VV +L  NP +  RF+EF +     I H   TYN L +SLC+ GLH+ A +
Sbjct: 66   NLNPFISFEVVKKLDNNPHIGFRFWEFSRFKLN-IRHSFWTYNLLTRSLCKAGLHDLAGQ 124

Query: 1857 LV----FDGYFPDGSVFDFLVSSCAQAGKLDTAKRLLVQACLSKIKVNSFMYNNLLNILV 1690
            +      DG  P+  +  FLVSS A+ GKL  A  LL+Q+   +++    + N+LLN LV
Sbjct: 125  MFECMKSDGVSPNNRLLGFLVSSFAEKGKLHFATALLLQSF--EVEGCCMVVNSLLNTLV 182

Query: 1689 GNNRVHEAVCFFQDQNLRLGFCPDTCSFNIVVRGLCRLGKIDMAFELFNGMGSFNCSPDV 1510
              +RV +A+  F D++LR   C DT +FNI++RGLC +GK + A EL   M  F C PD+
Sbjct: 183  KLDRVEDAMKLF-DEHLRFQSCNDTKTFNILIRGLCGVGKAEKALELLGVMSGFGCEPDI 241

Query: 1509 VTYNTLIDGFCRAKYVNRGYELLTQIQSKTGFSPDVVTYTSVISGYCKLGEMEDASALLD 1330
            VTYNTLI GFC++  +N+  E+   ++S +  SPDVVTYTS+ISGYCK G+M +AS+LLD
Sbjct: 242  VTYNTLIQGFCKSNELNKASEMFKDVKSGSVCSPDVVTYTSMISGYCKAGKMREASSLLD 301

Query: 1329 EMINKGIMPNSFTFNVLINGFGKVGNMRSAMAMYENMLSNGCVPDVVTFTSLIDGYCQTG 1150
            +M+  GI P + TFNVL++G+ K G M +A  +   M+S GC PDVVTFTSLIDGYC+ G
Sbjct: 302  DMLRLGIYPTNVTFNVLVDGYAKAGEMLTAEEIRGKMISFGCFPDVVTFTSLIDGYCRVG 361

Query: 1149 AIEEGMKLWNEMGTKNISPNAYTFSVLINALCKANRLNEARDLLRQLMWRNVVPRPFIYN 970
             + +G +LW EM  + + PNA+T+S+LINALC  NRL +AR+LL QL  ++++P+PF+YN
Sbjct: 362  QVSQGFRLWEEMNARGMFPNAFTYSILINALCNENRLLKARELLGQLASKDIIPQPFMYN 421

Query: 969  PVIDGFCKAGNVDEANVILVEMEEKKCNPDKFTFTSLIIGHCMKGRMAEAISLFHKMSSI 790
            PVIDGFCKAG V+EANVI+ EME+KKC PDK TFT LIIGHCMKGRM EA+S+FHKM +I
Sbjct: 422  PVIDGFCKAGKVNEANVIVEEMEKKKCKPDKITFTILIIGHCMKGRMFEAVSIFHKMVAI 481

Query: 789  GCAPDTITISAFISCLLKAGMPCEA---NQI 706
            GC+PD IT+S+ +SCLLKAGM  EA   NQI
Sbjct: 482  GCSPDKITVSSLLSCLLKAGMAKEAYHLNQI 512



 Score =  146 bits (369), Expect = 4e-32
 Identities = 105/392 (26%), Positives = 169/392 (43%), Gaps = 33/392 (8%)
 Frame = -3

Query: 1749 CLSKIKVNSFMYNNLLNILVGNNRVHEAVCFFQDQNLRLGFCPDTCSFNIVVRGLCRLGK 1570
            C     +N F+   ++  L  +N  H    F++    +L       ++N++ R LC+ G 
Sbjct: 61   CYLSKNLNPFISFEVVKKL--DNNPHIGFRFWEFSRFKLNIRHSFWTYNLLTRSLCKAGL 118

Query: 1569 IDMAFELFNGMGSFNCSPDVVTYNTLIDGFCRAKYVNRGYELLTQIQSKTGFSP------ 1408
             D+A ++F  M S   SP+      L+  F     ++    LL Q     G         
Sbjct: 119  HDLAGQMFECMKSDGVSPNNRLLGFLVSSFAEKGKLHFATALLLQSFEVEGCCMVVNSLL 178

Query: 1407 --------------------------DVVTYTSVISGYCKLGEMEDASALLDEMINKGIM 1306
                                      D  T+  +I G C +G+ E A  LL  M   G  
Sbjct: 179  NTLVKLDRVEDAMKLFDEHLRFQSCNDTKTFNILIRGLCGVGKAEKALELLGVMSGFGCE 238

Query: 1305 PNSFTFNVLINGFGKVGNMRSAMAMYENMLSNG-CVPDVVTFTSLIDGYCQTGAIEEGMK 1129
            P+  T+N LI GF K   +  A  M++++ S   C PDVVT+TS+I GYC+ G + E   
Sbjct: 239  PDIVTYNTLIQGFCKSNELNKASEMFKDVKSGSVCSPDVVTYTSMISGYCKAGKMREASS 298

Query: 1128 LWNEMGTKNISPNAYTFSVLINALCKANRLNEARDLLRQLMWRNVVPRPFIYNPVIDGFC 949
            L ++M    I P   TF+VL++   KA  +  A ++  +++     P    +  +IDG+C
Sbjct: 299  LLDDMLRLGIYPTNVTFNVLVDGYAKAGEMLTAEEIRGKMISFGCFPDVVTFTSLIDGYC 358

Query: 948  KAGNVDEANVILVEMEEKKCNPDKFTFTSLIIGHCMKGRMAEAISLFHKMSSIGCAPDTI 769
            + G V +   +  EM  +   P+ FT++ LI   C + R+ +A  L  +++S    P   
Sbjct: 359  RVGQVSQGFRLWEEMNARGMFPNAFTYSILINALCNENRLLKARELLGQLASKDIIPQPF 418

Query: 768  TISAFISCLLKAGMPCEANQIMVTFSKKDLDP 673
              +  I    KAG   EAN I+    KK   P
Sbjct: 419  MYNPVIDGFCKAGKVNEANVIVEEMEKKKCKP 450


>ref|NP_178657.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|42570711|ref|NP_973429.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|75216767|sp|Q9ZUE9.1|PP149_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g06000 gi|4006835|gb|AAC95177.1| hypothetical protein
            [Arabidopsis thaliana] gi|110736272|dbj|BAF00106.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|330250896|gb|AEC05990.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|330250897|gb|AEC05991.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 536

 Score =  499 bits (1286), Expect = e-138
 Identities = 261/511 (51%), Positives = 347/511 (67%), Gaps = 8/511 (1%)
 Frame = -3

Query: 2214 VASFHDHAVQVCSQPDFRISNREVIRNTDAWIVKVVSTLCTLGGSKSSTDLFSCLEYFSK 2035
            +A FH H+         + + REVI   +AW+VK+VSTL       S      C  Y SK
Sbjct: 10   IAHFHTHSHGGAQARPLQNNTREVIHCPEAWLVKIVSTLFVYRVPDSDL----CFCYLSK 65

Query: 2034 SFNPSIVFGVVTRL-GNPKLALRFFEFCQANCQEINHFVKTYNFLVKSLCQNGLHEAASK 1858
            + NP I F VV +L  NP +  RF+EF +     I H   TYN L +SLC+ GLH+ A +
Sbjct: 66   NLNPFISFEVVKKLDNNPHIGFRFWEFSRFKLN-IRHSFWTYNLLTRSLCKAGLHDLAGQ 124

Query: 1857 LV----FDGYFPDGSVFDFLVSSCAQAGKLDTAKRLLVQACLSKIKVNSFMYNNLLNILV 1690
            +      DG  P+  +  FLVSS A+ GKL  A  LL+Q+   +++    + N+LLN LV
Sbjct: 125  MFECMKSDGVSPNNRLLGFLVSSFAEKGKLHFATALLLQSF--EVEGCCMVVNSLLNTLV 182

Query: 1689 GNNRVHEAVCFFQDQNLRLGFCPDTCSFNIVVRGLCRLGKIDMAFELFNGMGSFNCSPDV 1510
              +RV +A+  F D++LR   C DT +FNI++RGLC +GK + A EL   M  F C PD+
Sbjct: 183  KLDRVEDAMKLF-DEHLRFQSCNDTKTFNILIRGLCGVGKAEKALELLGVMSGFGCEPDI 241

Query: 1509 VTYNTLIDGFCRAKYVNRGYELLTQIQSKTGFSPDVVTYTSVISGYCKLGEMEDASALLD 1330
            VTYNTLI GFC++  +N+  E+   ++S +  SPDVVTYTS+ISGYCK G+M +AS+LLD
Sbjct: 242  VTYNTLIQGFCKSNELNKASEMFKDVKSGSVCSPDVVTYTSMISGYCKAGKMREASSLLD 301

Query: 1329 EMINKGIMPNSFTFNVLINGFGKVGNMRSAMAMYENMLSNGCVPDVVTFTSLIDGYCQTG 1150
            +M+  GI P + TFNVL++G+ K G M +A  +   M+S GC PDVVTFTSLIDGYC+ G
Sbjct: 302  DMLRLGIYPTNVTFNVLVDGYAKAGEMLTAEEIRGKMISFGCFPDVVTFTSLIDGYCRVG 361

Query: 1149 AIEEGMKLWNEMGTKNISPNAYTFSVLINALCKANRLNEARDLLRQLMWRNVVPRPFIYN 970
             + +G +LW EM  + + PNA+T+S+LINALC  NRL +AR+LL QL  ++++P+PF+YN
Sbjct: 362  QVSQGFRLWEEMNARGMFPNAFTYSILINALCNENRLLKARELLGQLASKDIIPQPFMYN 421

Query: 969  PVIDGFCKAGNVDEANVILVEMEEKKCNPDKFTFTSLIIGHCMKGRMAEAISLFHKMSSI 790
            PVIDGFCKAG V+EANVI+ EME+KKC PDK TFT LIIGHCMKGRM EA+S+FHKM +I
Sbjct: 422  PVIDGFCKAGKVNEANVIVEEMEKKKCKPDKITFTILIIGHCMKGRMFEAVSIFHKMVAI 481

Query: 789  GCAPDTITISAFISCLLKAGMPCEA---NQI 706
            GC+PD IT+S+ +SCLLKAGM  EA   NQI
Sbjct: 482  GCSPDKITVSSLLSCLLKAGMAKEAYHLNQI 512



 Score =  146 bits (369), Expect = 4e-32
 Identities = 105/392 (26%), Positives = 169/392 (43%), Gaps = 33/392 (8%)
 Frame = -3

Query: 1749 CLSKIKVNSFMYNNLLNILVGNNRVHEAVCFFQDQNLRLGFCPDTCSFNIVVRGLCRLGK 1570
            C     +N F+   ++  L  +N  H    F++    +L       ++N++ R LC+ G 
Sbjct: 61   CYLSKNLNPFISFEVVKKL--DNNPHIGFRFWEFSRFKLNIRHSFWTYNLLTRSLCKAGL 118

Query: 1569 IDMAFELFNGMGSFNCSPDVVTYNTLIDGFCRAKYVNRGYELLTQIQSKTGFSP------ 1408
             D+A ++F  M S   SP+      L+  F     ++    LL Q     G         
Sbjct: 119  HDLAGQMFECMKSDGVSPNNRLLGFLVSSFAEKGKLHFATALLLQSFEVEGCCMVVNSLL 178

Query: 1407 --------------------------DVVTYTSVISGYCKLGEMEDASALLDEMINKGIM 1306
                                      D  T+  +I G C +G+ E A  LL  M   G  
Sbjct: 179  NTLVKLDRVEDAMKLFDEHLRFQSCNDTKTFNILIRGLCGVGKAEKALELLGVMSGFGCE 238

Query: 1305 PNSFTFNVLINGFGKVGNMRSAMAMYENMLSNG-CVPDVVTFTSLIDGYCQTGAIEEGMK 1129
            P+  T+N LI GF K   +  A  M++++ S   C PDVVT+TS+I GYC+ G + E   
Sbjct: 239  PDIVTYNTLIQGFCKSNELNKASEMFKDVKSGSVCSPDVVTYTSMISGYCKAGKMREASS 298

Query: 1128 LWNEMGTKNISPNAYTFSVLINALCKANRLNEARDLLRQLMWRNVVPRPFIYNPVIDGFC 949
            L ++M    I P   TF+VL++   KA  +  A ++  +++     P    +  +IDG+C
Sbjct: 299  LLDDMLRLGIYPTNVTFNVLVDGYAKAGEMLTAEEIRGKMISFGCFPDVVTFTSLIDGYC 358

Query: 948  KAGNVDEANVILVEMEEKKCNPDKFTFTSLIIGHCMKGRMAEAISLFHKMSSIGCAPDTI 769
            + G V +   +  EM  +   P+ FT++ LI   C + R+ +A  L  +++S    P   
Sbjct: 359  RVGQVSQGFRLWEEMNARGMFPNAFTYSILINALCNENRLLKARELLGQLASKDIIPQPF 418

Query: 768  TISAFISCLLKAGMPCEANQIMVTFSKKDLDP 673
              +  I    KAG   EAN I+    KK   P
Sbjct: 419  MYNPVIDGFCKAGKVNEANVIVEEMEKKKCKP 450


>ref|XP_006396122.1| hypothetical protein EUTSA_v10002477mg [Eutrema salsugineum]
            gi|557096393|gb|ESQ36901.1| hypothetical protein
            EUTSA_v10002477mg [Eutrema salsugineum]
          Length = 535

 Score =  499 bits (1284), Expect = e-138
 Identities = 265/527 (50%), Positives = 349/527 (66%), Gaps = 7/527 (1%)
 Frame = -3

Query: 2232 ASGTLFVASFHDHAVQVCSQPDFRISNREVIRNTDAWIVKVVSTLCTLGGSKSSTDLFSC 2053
            AS    +  FH H          + + REVI+  +AW+VK+VSTL         +DL  C
Sbjct: 4    ASFATTIGLFHSHTHGGAQARPLQSNTREVIQCPEAWLVKIVSTLFVY--QVPDSDLCFC 61

Query: 2052 LEYFSKSFNPSIVFGVVTRLGNPKLALRFFEFCQANCQEINHFVKTYNFLVKSLCQNGLH 1873
              Y SK+ NP I F VV +L NP +  RF+EF +     I H   TYN L +SLC+ GLH
Sbjct: 62   --YLSKNLNPFIAFEVVKKLDNPHIGFRFWEFSRFKLN-IRHSFWTYNLLTRSLCKAGLH 118

Query: 1872 EAASKLV----FDGYFPDGSVFDFLVSSCAQAGKLDTAKRLLVQACLSKIKVNSFMYNNL 1705
            + A K+      DG  P+  +  FLVSS A+ GKL  A  LL+Q+   +++ +S + N+L
Sbjct: 119  DLAGKMFECMKSDGVSPNSRLLGFLVSSFAEKGKLHFATALLLQSY--EVEGSSMVVNSL 176

Query: 1704 LNILVGNNRVHEAVCFFQDQNLRLGFCPDTCSFNIVVRGLCRLGKIDMAFELFNGMGSFN 1525
            L+ LV  +RV +A+  F D +LR   C DT +FNI+++GLC +GK   A +L   M SF 
Sbjct: 177  LHTLVRLDRVEDAMKLF-DTHLRSQSCNDTRTFNILIQGLCGIGKAHEALKLLGEMSSFG 235

Query: 1524 CSPDVVTYNTLIDGFCRAKYVNRGYELLTQIQSKTGFSPDVVTYTSVISGYCKLGEMEDA 1345
             SPD+VTYNTLI GFC++  +N+  E+  +++S+ G   DVVTYTS++SGYCK G+M +A
Sbjct: 236  SSPDIVTYNTLIKGFCKSNELNKANEIFNEVKSRNGCFRDVVTYTSMMSGYCKAGKMREA 295

Query: 1344 SALLDEMINKGIMPNSFTFNVLINGFGKVGNMRSAMAMYENMLSNGCVPDVVTFTSLIDG 1165
            S LLDEM+  G+ P + TFNVL+ G+ K G M SA A+   M S GC PDVVTFT+LIDG
Sbjct: 296  SLLLDEMVGLGMYPTNITFNVLVYGYVKAGEMSSAEAIRRKMDSFGCFPDVVTFTTLIDG 355

Query: 1164 YCQTGAIEEGMKLWNEMGTKNISPNAYTFSVLINALCKANRLNEARDLLRQLMWRNVVPR 985
            YC+ G + +G  LW EM  K + PNA+T+S+LINALCK NRL +AR+LL QL   ++VP+
Sbjct: 356  YCRVGQVNKGFSLWEEMSAKGMFPNAFTYSILINALCKENRLLKARELLGQLACMDIVPK 415

Query: 984  PFIYNPVIDGFCKAGNVDEANVILVEMEEKKCNPDKFTFTSLIIGHCMKGRMAEAISLFH 805
            PF+YNP+IDGFCKAG V+EANVI+ EME+ +C PDK TFT LIIGHCMKGRM EAIS+FH
Sbjct: 416  PFLYNPIIDGFCKAGKVNEANVIVAEMEKFRCKPDKITFTILIIGHCMKGRMCEAISIFH 475

Query: 804  KMSSIGCAPDTITISAFISCLLKAGMPCEA---NQIMVTFSKKDLDP 673
            KM +IGC+PD IT+S+  SCLLKAGM  EA   NQ  V     D+ P
Sbjct: 476  KMVAIGCSPDKITVSSLSSCLLKAGMAKEAYQLNQFAVKGQSNDVAP 522


>ref|XP_006297396.1| hypothetical protein CARUB_v10013421mg [Capsella rubella]
            gi|565479514|ref|XP_006297397.1| hypothetical protein
            CARUB_v10013421mg [Capsella rubella]
            gi|482566105|gb|EOA30294.1| hypothetical protein
            CARUB_v10013421mg [Capsella rubella]
            gi|482566106|gb|EOA30295.1| hypothetical protein
            CARUB_v10013421mg [Capsella rubella]
          Length = 535

 Score =  497 bits (1279), Expect = e-137
 Identities = 261/523 (49%), Positives = 348/523 (66%), Gaps = 9/523 (1%)
 Frame = -3

Query: 2214 VASFHDHAVQVCSQPDFRISNREVIRNTDAWIVKVVSTLCTLGGSKSSTDLFSCLEYFSK 2035
            +A FH H+           + REV+   +AW++K+VSTL       S      C  Y SK
Sbjct: 10   IAHFHTHSHGGAQARPLHSNKREVMHCPEAWLIKIVSTLFVYRVPDSDL----CFCYLSK 65

Query: 2034 SFNPSIVFGVVTRLGN--PKLALRFFEFCQANCQEINHFVKTYNFLVKSLCQNGLHEAAS 1861
            + NP I F VV +L N  P L  RF+EF +     I H   TYN L +SLC+ G+H+ A 
Sbjct: 66   NLNPFIAFEVVKKLDNNHPHLGFRFWEFSRFKLN-IRHSFWTYNVLTRSLCKAGMHDLAG 124

Query: 1860 KLV----FDGYFPDGSVFDFLVSSCAQAGKLDTAKRLLVQACLSKIKVNSFMYNNLLNIL 1693
            ++      DG  P+  +  FLVSS A+ GKL  A  LL+Q+   +++    + N+LLN L
Sbjct: 125  QMFECMRSDGVSPNSRLLGFLVSSFAEKGKLQFATALLLQSY--EVERCCMVVNSLLNTL 182

Query: 1692 VGNNRVHEAVCFFQDQNLRLGFCPDTCSFNIVVRGLCRLGKIDMAFELFNGMGSFNCSPD 1513
            V  +RV +A+  F D++LR   C DT +FNI++RGLC +GK + A EL   M  F CSPD
Sbjct: 183  VKLDRVDDAMKLF-DKHLRFQCCNDTKTFNILIRGLCSVGKGEKALELLGEMSGFGCSPD 241

Query: 1512 VVTYNTLIDGFCRAKYVNRGYELLTQIQSKTGFSPDVVTYTSVISGYCKLGEMEDASALL 1333
            +VTYNTLI GFC++  + +  E+L  ++S +G SPDVVTYTS+ISGYCK G+M++A  LL
Sbjct: 242  IVTYNTLIKGFCKSNELAKANEMLNDVKSSSGCSPDVVTYTSMISGYCKAGKMQEAYLLL 301

Query: 1332 DEMINKGIMPNSFTFNVLINGFGKVGNMRSAMAMYENMLSNGCVPDVVTFTSLIDGYCQT 1153
            D+M+  GI P + TFNVL++G+ K G M SA  +   M+S GC PDVVTFTSLIDGYC+ 
Sbjct: 302  DDMLGLGIYPTTITFNVLVDGYAKAGEMTSAEDIRGKMISFGCFPDVVTFTSLIDGYCRA 361

Query: 1152 GAIEEGMKLWNEMGTKNISPNAYTFSVLINALCKANRLNEARDLLRQLMWRNVVPRPFIY 973
            G + +G +LW EM  K + PN +T+S+LINALCK N L +AR+LL QL  ++++ +PF+Y
Sbjct: 362  GQVNQGFRLWEEMNAKGMLPNEFTYSILINALCKENSLLKARELLGQLASKDIITKPFMY 421

Query: 972  NPVIDGFCKAGNVDEANVILVEMEEKKCNPDKFTFTSLIIGHCMKGRMAEAISLFHKMSS 793
            NPVIDGFCKAG V+EANVI+ EME+KKC PDK TFT LIIGHCMKGRM EA+S+FHKM +
Sbjct: 422  NPVIDGFCKAGKVNEANVIVEEMEKKKCKPDKITFTILIIGHCMKGRMFEAVSIFHKMVA 481

Query: 792  IGCAPDTITISAFISCLLKAGMPCEA---NQIMVTFSKKDLDP 673
            IGC+PD IT+++ +SCLLKAGM  EA   NQI       D+ P
Sbjct: 482  IGCSPDKITVNSLLSCLLKAGMAEEAYHLNQIARKAQSNDVAP 524


>ref|XP_003550612.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            [Glycine max]
          Length = 544

 Score =  492 bits (1267), Expect = e-136
 Identities = 252/488 (51%), Positives = 333/488 (68%), Gaps = 5/488 (1%)
 Frame = -3

Query: 2145 VIRNTDAWIVKVVSTLCTLGGSKSSTDLFSCLEYFSKSFNPSIVFGVVTRLGNPKLALRF 1966
            +I   D+W VK+VSTL     S S  D F  L YF +   PS V  VV R  NP L  +F
Sbjct: 39   IITTPDSWFVKIVSTLFLC--SNSLDDRF--LGYFREHLTPSHVLEVVKRFNNPNLGFKF 94

Query: 1965 FEFCQANCQEINHFVKTYNFLVKSLCQNGLHEAASKLVFD-----GYFPDGSVFDFLVSS 1801
            F F +     ++H   TYN L++SLCQ GLH +A KL++D     G  PD  +  FLVSS
Sbjct: 95   FRFTRERLS-MSHSFWTYNMLLRSLCQAGLHNSA-KLLYDSMRSDGQLPDSRLLGFLVSS 152

Query: 1800 CAQAGKLDTAKRLLVQACLSKIKVNSFMYNNLLNILVGNNRVHEAVCFFQDQNLRLGFCP 1621
             A A + D +K LL +A  S ++V+  +YNN LNIL+ +NR+ +A+C F++  +R   C 
Sbjct: 153  FALADRFDVSKELLAEAQCSGVQVDVIVYNNFLNILIKHNRLDDAICLFREL-MRSHSCL 211

Query: 1620 DTCSFNIVVRGLCRLGKIDMAFELFNGMGSFNCSPDVVTYNTLIDGFCRAKYVNRGYELL 1441
            D  +FNI++RGLC  G +D AFEL   MGSF CSPD+VTYN L+ G CR   V+R  +LL
Sbjct: 212  DAFTFNILIRGLCTAGDVDEAFELLGDMGSFGCSPDIVTYNILLHGLCRIDQVDRARDLL 271

Query: 1440 TQIQSKTGFSPDVVTYTSVISGYCKLGEMEDASALLDEMINKGIMPNSFTFNVLINGFGK 1261
             ++  K  F+P+VV+YT+VISGYC+L +M++AS+L  EM+  G  PN FTF+ L++GF K
Sbjct: 272  EEVCLKCEFAPNVVSYTTVISGYCRLSKMDEASSLFYEMVRSGTKPNVFTFSALVDGFVK 331

Query: 1260 VGNMRSAMAMYENMLSNGCVPDVVTFTSLIDGYCQTGAIEEGMKLWNEMGTKNISPNAYT 1081
             G+M SA+ M++ +L +GC P+V+T TSLI+GYC+ G +  G+ LW EM  +NI  N YT
Sbjct: 332  AGDMASALGMHKKILFHGCAPNVITLTSLINGYCRAGWVNHGLDLWREMNARNIPANLYT 391

Query: 1080 FSVLINALCKANRLNEARDLLRQLMWRNVVPRPFIYNPVIDGFCKAGNVDEANVILVEME 901
            +SVLI+ALCK+NRL EAR+LLR L   ++VP  F+YNPVIDG+CK+GN+DEAN I+ EME
Sbjct: 392  YSVLISALCKSNRLQEARNLLRILKQSDIVPLAFVYNPVIDGYCKSGNIDEANAIVAEME 451

Query: 900  EKKCNPDKFTFTSLIIGHCMKGRMAEAISLFHKMSSIGCAPDTITISAFISCLLKAGMPC 721
            E KC PDK TFT LIIGHCMKGR  EAI +F+KM + GC PD ITI    SCLLK+GMP 
Sbjct: 452  E-KCKPDKLTFTILIIGHCMKGRTPEAIGIFYKMLASGCTPDDITIRTLSSCLLKSGMPG 510

Query: 720  EANQIMVT 697
            EA +I  T
Sbjct: 511  EAARIKET 518



 Score = 77.0 bits (188), Expect = 4e-11
 Identities = 71/329 (21%), Positives = 141/329 (42%), Gaps = 15/329 (4%)
 Frame = -3

Query: 2151 REVIRN---TDAWIVKV-VSTLCTLGGSKSSTDLFSCLEYFSKS---FNPSIVFGVVTRL 1993
            RE++R+    DA+   + +  LCT G    + +L   +  F  S      +I+   + R+
Sbjct: 202  RELMRSHSCLDAFTFNILIRGLCTAGDVDEAFELLGDMGSFGCSPDIVTYNILLHGLCRI 261

Query: 1992 GNPKLALRFFEFCQANCQEINHFVKTYNFLVKSLCQNGLHEAASKLVFD----GYFPDGS 1825
                 A    E     C+   + V +Y  ++   C+    + AS L ++    G  P+  
Sbjct: 262  DQVDRARDLLEEVCLKCEFAPNVV-SYTTVISGYCRLSKMDEASSLFYEMVRSGTKPNVF 320

Query: 1824 VFDFLVSSCAQAGKLDTA----KRLLVQACLSKIKVNSFMYNNLLNILVGNNRVHEAVCF 1657
             F  LV    +AG + +A    K++L   C      N     +L+N       V+  +  
Sbjct: 321  TFSALVDGFVKAGDMASALGMHKKILFHGCAP----NVITLTSLINGYCRAGWVNHGLDL 376

Query: 1656 FQDQNLRLGFCPDTCSFNIVVRGLCRLGKIDMAFELFNGMGSFNCSPDVVTYNTLIDGFC 1477
            +++ N R     +  ++++++  LC+  ++  A  L   +   +  P    YN +IDG+C
Sbjct: 377  WREMNAR-NIPANLYTYSVLISALCKSNRLQEARNLLRILKQSDIVPLAFVYNPVIDGYC 435

Query: 1476 RAKYVNRGYELLTQIQSKTGFSPDVVTYTSVISGYCKLGEMEDASALLDEMINKGIMPNS 1297
            ++  ++    ++ +++ K    PD +T+T +I G+C  G   +A  +  +M+  G  P+ 
Sbjct: 436  KSGNIDEANAIVAEMEEKC--KPDKLTFTILIIGHCMKGRTPEAIGIFYKMLASGCTPDD 493

Query: 1296 FTFNVLINGFGKVGNMRSAMAMYENMLSN 1210
             T   L +   K G    A  + E +  N
Sbjct: 494  ITIRTLSSCLLKSGMPGEAARIKETLFEN 522


Top