BLASTX nr result

ID: Forsythia22_contig00020386 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00020386
         (2192 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011070974.1| PREDICTED: pentatricopeptide repeat-containi...   595   e-167
emb|CDP17783.1| unnamed protein product [Coffea canephora]            548   e-153
ref|XP_009759898.1| PREDICTED: pentatricopeptide repeat-containi...   524   e-145
ref|XP_006339440.1| PREDICTED: pentatricopeptide repeat-containi...   520   e-144
ref|XP_004229820.1| PREDICTED: pentatricopeptide repeat-containi...   520   e-144
ref|XP_009604911.1| PREDICTED: pentatricopeptide repeat-containi...   517   e-143
ref|XP_012855399.1| PREDICTED: pentatricopeptide repeat-containi...   513   e-142
ref|XP_010092845.1| hypothetical protein L484_022440 [Morus nota...   496   e-137
ref|XP_002273719.2| PREDICTED: pentatricopeptide repeat-containi...   496   e-137
ref|XP_007052035.1| Tetratricopeptide repeat (TPR)-like superfam...   495   e-137
emb|CBI38862.3| unnamed protein product [Vitis vinifera]              485   e-134
ref|XP_008438151.1| PREDICTED: pentatricopeptide repeat-containi...   483   e-133
ref|XP_004134345.1| PREDICTED: pentatricopeptide repeat-containi...   482   e-133
ref|XP_002320730.1| hypothetical protein POPTR_0014s06610g [Popu...   481   e-132
ref|XP_012475249.1| PREDICTED: pentatricopeptide repeat-containi...   480   e-132
ref|XP_011034479.1| PREDICTED: pentatricopeptide repeat-containi...   477   e-131
ref|XP_006445236.1| hypothetical protein CICLE_v10020287mg [Citr...   477   e-131
ref|XP_012083560.1| PREDICTED: pentatricopeptide repeat-containi...   475   e-131
gb|KHG08668.1| hypothetical protein F383_35923 [Gossypium arboreum]   474   e-130
ref|XP_003623723.1| Pentatricopeptide repeat-containing protein ...   474   e-130

>ref|XP_011070974.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970
            [Sesamum indicum] gi|747049842|ref|XP_011070975.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At1g01970 [Sesamum indicum]
          Length = 416

 Score =  595 bits (1533), Expect = e-167
 Identities = 294/417 (70%), Positives = 351/417 (84%)
 Frame = +2

Query: 302  MGYFATSIFCLGNPICMQTQNLLNAYQFPLWKNSFLRNPKAFRLDKSDLRPLLVVKSVND 481
            MG  A ++F + + IC+  Q LL+ +  PLWKN F  +   FR  K  L  LL++  +N+
Sbjct: 1    MGCLAPNLFSMASTICIHDQKLLSFFHRPLWKNPFFGHNSTFRCKKPHLCQLLILSCINE 60

Query: 482  GENVEYSKEEDEEKPRFRWVKIGSNMTEEQKQAIAQIPPKMENRCRALLKQIICFSPENG 661
            GEN   S  +  E  +F+WV+   N+TEEQKQ I+Q P KM NRC+AL+KQIICFS ENG
Sbjct: 61   GEN-RGSSVKGIENAKFKWVRASPNLTEEQKQVISQFPTKMTNRCKALMKQIICFSAENG 119

Query: 662  SISFMLAAWVKSMNPQRANWLSVLKELEKLNHPLYFEVVEHALTEESFEANVRDYTKMIH 841
            S+S MLAAWVKS +P+RA+WLSVLKELE+LNHPLYFEV EHA TEESFEANVRDYTK+IH
Sbjct: 120  SLSHMLAAWVKSTSPRRADWLSVLKELERLNHPLYFEVAEHAFTEESFEANVRDYTKIIH 179

Query: 842  GYAKQTKLQEAENTLLAMKNRGFICDQVTLTALIHMYSKADNLKLAEESFEEMKLLGTPL 1021
             YAK+ +L+EAEN L AMK+RGF+CDQVTLTALIHMYSK+ NLKL+E++FEEMKLLG  L
Sbjct: 180  CYAKENRLREAENALTAMKSRGFVCDQVTLTALIHMYSKSGNLKLSEDTFEEMKLLGVLL 239

Query: 1022 DRRSYGSMVMAYIRAGMLDRGESVLKEMEAQEIYAGREVYKALLRAYSMIGDSLGAQRVF 1201
            D+RSYGSM+MAYIRAGML R E++L+EMEAQEIYAGREVYKALLRAYSM GDS GAQRVF
Sbjct: 240  DKRSYGSMIMAYIRAGMLARAETILREMEAQEIYAGREVYKALLRAYSMTGDSQGAQRVF 299

Query: 1202 DAIQLAGIIPDVKVCGLLINAYVMAGQTREAVIAFDNLRQAGLEPNDKCVALVLAAYEKE 1381
            +AIQLAG+IPDVK+CGLLINAYV++GQ+REA IAF+NLRQAGLEPNDKCVALVL AYEKE
Sbjct: 300  NAIQLAGLIPDVKICGLLINAYVVSGQSREACIAFENLRQAGLEPNDKCVALVLMAYEKE 359

Query: 1382 NKLKIALDFLIKLEKDGVMIEKEASELLVKWFRKLGVVEEVEIVLRDYASRVPEPAL 1552
            N+LK ALD LI+LE+DGVM+ KEAS LL KWF+KLGVVEEVE+VLRD+ASR+P+P +
Sbjct: 360  NRLKEALDLLIELERDGVMLGKEASALLAKWFQKLGVVEEVELVLRDFASRMPQPVI 416


>emb|CDP17783.1| unnamed protein product [Coffea canephora]
          Length = 411

 Score =  548 bits (1413), Expect = e-153
 Identities = 278/414 (67%), Positives = 332/414 (80%)
 Frame = +2

Query: 302  MGYFATSIFCLGNPICMQTQNLLNAYQFPLWKNSFLRNPKAFRLDKSDLRPLLVVKSVND 481
            MG++   +    + I   +QN L   +  LW  S  R P +F   KS+  P+L+VK+  +
Sbjct: 1    MGFYDCKMLSFTDSIVGPSQNNLEFQRCFLWGKSLWRIPSSFGCLKSNNGPVLIVKNEIE 60

Query: 482  GENVEYSKEEDEEKPRFRWVKIGSNMTEEQKQAIAQIPPKMENRCRALLKQIICFSPENG 661
             E  E  +E+   KPRFRWVK+G +  E+QKQAIAQ+P KM NRC+AL+KQIICF PE G
Sbjct: 61   TEKFEVKQED---KPRFRWVKVGPDTNEDQKQAIAQLPLKMSNRCKALMKQIICFKPEKG 117

Query: 662  SISFMLAAWVKSMNPQRANWLSVLKELEKLNHPLYFEVVEHALTEESFEANVRDYTKMIH 841
            ++S +LA WVKSMNP+RA+WL +LKEL +L HPLY E+   AL EESFEA VRDYTK+IH
Sbjct: 118  NLSDLLAVWVKSMNPKRADWLLILKELSRLEHPLYLELAGLALMEESFEACVRDYTKIIH 177

Query: 842  GYAKQTKLQEAENTLLAMKNRGFICDQVTLTALIHMYSKADNLKLAEESFEEMKLLGTPL 1021
            GYAKQ K+QEAENT LAMK  GFICDQVTLTAL+HMYSKA NLKLAE++FEEMKLLG PL
Sbjct: 178  GYAKQKKVQEAENTFLAMKRGGFICDQVTLTALVHMYSKAGNLKLAEDTFEEMKLLGVPL 237

Query: 1022 DRRSYGSMVMAYIRAGMLDRGESVLKEMEAQEIYAGREVYKALLRAYSMIGDSLGAQRVF 1201
            DRRSYGSM+MAYIRAG L +GES+LKEMEA+ IYAGREVYKALLRAYSM GDS GAQRVF
Sbjct: 238  DRRSYGSMIMAYIRAGRLSQGESLLKEMEAENIYAGREVYKALLRAYSMNGDSKGAQRVF 297

Query: 1202 DAIQLAGIIPDVKVCGLLINAYVMAGQTREAVIAFDNLRQAGLEPNDKCVALVLAAYEKE 1381
            DAIQLAG+IPD KVCGLLINAYV+AGQ+ EA I F+NLR++GL+PNDKCV+LVLA YEK+
Sbjct: 298  DAIQLAGMIPDAKVCGLLINAYVVAGQSSEACIVFENLRRSGLQPNDKCVSLVLAVYEKD 357

Query: 1382 NKLKIALDFLIKLEKDGVMIEKEASELLVKWFRKLGVVEEVEIVLRDYASRVPE 1543
            NKL  ALDFL  LE+DG ++ KEASE+LVKWF++LGVVEEVE +LRDYA R  +
Sbjct: 358  NKLSKALDFLTDLERDGFLLGKEASEVLVKWFQRLGVVEEVEQILRDYALRTAQ 411


>ref|XP_009759898.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970
            [Nicotiana sylvestris]
          Length = 410

 Score =  524 bits (1350), Expect = e-145
 Identities = 261/410 (63%), Positives = 326/410 (79%), Gaps = 2/410 (0%)
 Frame = +2

Query: 302  MGYFATSIFCLGNPICMQTQNLLNAYQFPLWKNSFLRNPKAFRLDKSDLRPLLVVKSVND 481
            MG+ A  +F         +  +   +Q+PL     L+    F   +   +PLL V +V  
Sbjct: 1    MGFAACDVFSYSTQTSPLSNTIRRIHQYPLNGAFSLQKVTIFGYRELGFKPLLAVSNVA- 59

Query: 482  GENVEYSKEEDE--EKPRFRWVKIGSNMTEEQKQAIAQIPPKMENRCRALLKQIICFSPE 655
               VE    E++  +KPR+RWVKIGS++TEEQKQAI + PPKM NRC+AL++QIIC+SPE
Sbjct: 60   ---VENGSAENQANDKPRYRWVKIGSDITEEQKQAILKFPPKMPNRCKALMQQIICYSPE 116

Query: 656  NGSISFMLAAWVKSMNPQRANWLSVLKELEKLNHPLYFEVVEHALTEESFEANVRDYTKM 835
             GS+S +L AWVKSM P+RA+WL VLKEL++LNHPLYFEV E +L EESFEANVRDYTK+
Sbjct: 117  KGSVSLLLEAWVKSMKPERADWLEVLKELDRLNHPLYFEVAEVSLLEESFEANVRDYTKI 176

Query: 836  IHGYAKQTKLQEAENTLLAMKNRGFICDQVTLTALIHMYSKADNLKLAEESFEEMKLLGT 1015
            IHG+AKQ + +EAE+ LLAMK RGF CDQV LTAL+HMYSKA NLK+AE++FEEM+LLG 
Sbjct: 177  IHGHAKQDRPREAESMLLAMKTRGFTCDQVVLTALVHMYSKAGNLKMAEDTFEEMRLLGV 236

Query: 1016 PLDRRSYGSMVMAYIRAGMLDRGESVLKEMEAQEIYAGREVYKALLRAYSMIGDSLGAQR 1195
             LD+RSYGSM+MAY+RAGML  GE++LKEME QEIYAGREVYKA+LRAYSMIGDS GAQR
Sbjct: 237  TLDKRSYGSMIMAYVRAGMLGEGEALLKEMEEQEIYAGREVYKAILRAYSMIGDSKGAQR 296

Query: 1196 VFDAIQLAGIIPDVKVCGLLINAYVMAGQTREAVIAFDNLRQAGLEPNDKCVALVLAAYE 1375
            VFDA+QLAGIIPD   CGLL+NAYV+AGQ  EA IAF+NLR+AG+EPNDKC+AL+L+AYE
Sbjct: 297  VFDALQLAGIIPDATFCGLLMNAYVVAGQLSEACIAFENLRRAGIEPNDKCIALLLSAYE 356

Query: 1376 KENKLKIALDFLIKLEKDGVMIEKEASELLVKWFRKLGVVEEVEIVLRDY 1525
             EN L  ALD L+ LE+DG+++ +EASE+L +WF+KLGVV EVE+VLR++
Sbjct: 357  TENNLGKALDVLMNLERDGIVLGREASEILARWFKKLGVVGEVELVLREF 406


>ref|XP_006339440.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like
            [Solanum tuberosum]
          Length = 415

 Score =  520 bits (1340), Expect = e-144
 Identities = 251/410 (61%), Positives = 326/410 (79%)
 Frame = +2

Query: 302  MGYFATSIFCLGNPICMQTQNLLNAYQFPLWKNSFLRNPKAFRLDKSDLRPLLVVKSVND 481
            MG+ A+ + C      +    +   +Q+PL    FL+    F   +   RPLL V +V  
Sbjct: 1    MGFVASDVLCCSTQTSILNNTIGRIHQYPLKGVFFLQKVAIFENRELGFRPLLAVSNVAV 60

Query: 482  GENVEYSKEEDEEKPRFRWVKIGSNMTEEQKQAIAQIPPKMENRCRALLKQIICFSPENG 661
             +    ++ +  +KPR++WVKIGS++TEEQ++AI ++PPKM NRC+AL++QIIC+SPE G
Sbjct: 61   HQKGS-AENQVNDKPRYKWVKIGSDVTEEQQRAILKLPPKMINRCKALMQQIICYSPEKG 119

Query: 662  SISFMLAAWVKSMNPQRANWLSVLKELEKLNHPLYFEVVEHALTEESFEANVRDYTKMIH 841
            S+S +L AWVKSM P+RA+WL+VLKEL++LNHP+Y EV E +L  ESFEAN+RDYTK+IH
Sbjct: 120  SVSLLLEAWVKSMKPERADWLAVLKELDRLNHPMYLEVAELSLLAESFEANIRDYTKIIH 179

Query: 842  GYAKQTKLQEAENTLLAMKNRGFICDQVTLTALIHMYSKADNLKLAEESFEEMKLLGTPL 1021
            GYAKQ +L+EAE+  L+MK+RGF CDQVTLTAL+HMYSKA NLKLAE++FEEM+LLG PL
Sbjct: 180  GYAKQNRLKEAESVFLSMKSRGFTCDQVTLTALVHMYSKAGNLKLAEDTFEEMRLLGVPL 239

Query: 1022 DRRSYGSMVMAYIRAGMLDRGESVLKEMEAQEIYAGREVYKALLRAYSMIGDSLGAQRVF 1201
            D+RS+GS++MAY+RAG L +GE++LKEME QEIYAG EVYKALLRAYSM GDS GAQRVF
Sbjct: 240  DKRSFGSIIMAYVRAGKLGQGEALLKEMEEQEIYAGPEVYKALLRAYSMSGDSKGAQRVF 299

Query: 1202 DAIQLAGIIPDVKVCGLLINAYVMAGQTREAVIAFDNLRQAGLEPNDKCVALVLAAYEKE 1381
            D  QLAG+IPD  +CGLL+NAY+MAGQ  EA I F+N+R+ G++PNDKC+ L+L AYE E
Sbjct: 300  DTTQLAGVIPDATICGLLMNAYIMAGQLSEACITFENMRRVGIKPNDKCITLLLKAYETE 359

Query: 1382 NKLKIALDFLIKLEKDGVMIEKEASELLVKWFRKLGVVEEVEIVLRDYAS 1531
            NKL  ALD L+ LE+DGV++ +EASELL +WF++LGVV EVE+VLRDYAS
Sbjct: 360  NKLSKALDVLMDLERDGVVLGREASELLARWFKRLGVVGEVELVLRDYAS 409


>ref|XP_004229820.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970
            [Solanum lycopersicum]
          Length = 415

 Score =  520 bits (1338), Expect = e-144
 Identities = 249/410 (60%), Positives = 326/410 (79%)
 Frame = +2

Query: 302  MGYFATSIFCLGNPICMQTQNLLNAYQFPLWKNSFLRNPKAFRLDKSDLRPLLVVKSVND 481
            MG+ A+ + C      +++  +    Q+P     FL+   +F   +   +PLL V +V  
Sbjct: 1    MGFVASGVLCCSTQTSIRSNTIGRINQYPSKGVFFLQKVTSFENRELGFKPLLAVSNVAV 60

Query: 482  GENVEYSKEEDEEKPRFRWVKIGSNMTEEQKQAIAQIPPKMENRCRALLKQIICFSPENG 661
             +    ++ +  +KPR+RWVKIGS++TEEQ++AI ++PPKM NRC+AL++QIIC+SPE G
Sbjct: 61   NQKSS-AENQVNDKPRYRWVKIGSDVTEEQQRAILKLPPKMINRCKALMQQIICYSPEKG 119

Query: 662  SISFMLAAWVKSMNPQRANWLSVLKELEKLNHPLYFEVVEHALTEESFEANVRDYTKMIH 841
            S+S +L AWVKSM P RA+WL+VLKEL++LNHP+Y EV E +L  ESFEAN+RDYTK+IH
Sbjct: 120  SVSLLLEAWVKSMKPDRADWLAVLKELDRLNHPMYLEVAELSLLAESFEANIRDYTKIIH 179

Query: 842  GYAKQTKLQEAENTLLAMKNRGFICDQVTLTALIHMYSKADNLKLAEESFEEMKLLGTPL 1021
            GYAKQ +L+EAE+  L+MK+RGF CDQVTLTAL+HMYSKA NLKLAE++FEEM+LLG PL
Sbjct: 180  GYAKQNRLKEAESVFLSMKSRGFTCDQVTLTALVHMYSKASNLKLAEDTFEEMRLLGVPL 239

Query: 1022 DRRSYGSMVMAYIRAGMLDRGESVLKEMEAQEIYAGREVYKALLRAYSMIGDSLGAQRVF 1201
            D+RS+GS++MAY+RAG L +GE++LKEME QE YAG EVYKALLRAYSM GDS GAQRVF
Sbjct: 240  DKRSFGSIIMAYVRAGKLGQGEALLKEMEEQETYAGPEVYKALLRAYSMSGDSKGAQRVF 299

Query: 1202 DAIQLAGIIPDVKVCGLLINAYVMAGQTREAVIAFDNLRQAGLEPNDKCVALVLAAYEKE 1381
            D IQLAG+IPD  +CGLL+NAY+MAGQ  E  IAF+N+R+ G++PNDKC+ L+L AYE E
Sbjct: 300  DTIQLAGVIPDATICGLLMNAYIMAGQLSETCIAFENMRRVGIKPNDKCITLLLTAYETE 359

Query: 1382 NKLKIALDFLIKLEKDGVMIEKEASELLVKWFRKLGVVEEVEIVLRDYAS 1531
            NKL  ALD L+ LE+DG+++ +EASELL +WF++LGVV EVE+VLRDYAS
Sbjct: 360  NKLSKALDVLMDLERDGIVLGREASELLARWFKRLGVVGEVELVLRDYAS 409


>ref|XP_009604911.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970
            [Nicotiana tomentosiformis]
          Length = 410

 Score =  517 bits (1331), Expect = e-143
 Identities = 256/408 (62%), Positives = 322/408 (78%)
 Frame = +2

Query: 302  MGYFATSIFCLGNPICMQTQNLLNAYQFPLWKNSFLRNPKAFRLDKSDLRPLLVVKSVND 481
            MG+ A  +F         +  +   + +PL     L+    F   +   +PLL V +V  
Sbjct: 1    MGFTAYGVFSCSIQTSPLSNTIRRIHHYPLNCVFPLQKVTIFGYGELGFKPLLAVSNVAA 60

Query: 482  GENVEYSKEEDEEKPRFRWVKIGSNMTEEQKQAIAQIPPKMENRCRALLKQIICFSPENG 661
             +    ++  D  KPR++WVKIGS++TE QKQAI ++PPKM NRC+AL++QIIC+S E G
Sbjct: 61   EKGSAENQAND--KPRYKWVKIGSDITEVQKQAILKLPPKMANRCKALMQQIICYSAEKG 118

Query: 662  SISFMLAAWVKSMNPQRANWLSVLKELEKLNHPLYFEVVEHALTEESFEANVRDYTKMIH 841
            S+S +L AWVKSM P+RA+WL  LKEL++LNHPLYFEV E +L EESFEANVRDYTK+IH
Sbjct: 119  SVSLLLEAWVKSMKPERADWLEALKELDRLNHPLYFEVAEVSLLEESFEANVRDYTKLIH 178

Query: 842  GYAKQTKLQEAENTLLAMKNRGFICDQVTLTALIHMYSKADNLKLAEESFEEMKLLGTPL 1021
            GYAKQ +L+EAE+ LLAMK RGF CDQV LTAL+HMYSKA NLK+AE++FEEM+LLG  L
Sbjct: 179  GYAKQNRLREAESMLLAMKTRGFTCDQVVLTALVHMYSKAGNLKMAEDTFEEMRLLGVAL 238

Query: 1022 DRRSYGSMVMAYIRAGMLDRGESVLKEMEAQEIYAGREVYKALLRAYSMIGDSLGAQRVF 1201
            D+RSYGSM+MAY+RAGML  GE++L+EME QEIYAGREVYKA+LRAYSMIGDS GAQRVF
Sbjct: 239  DKRSYGSMIMAYVRAGMLGEGEALLREMEEQEIYAGREVYKAILRAYSMIGDSKGAQRVF 298

Query: 1202 DAIQLAGIIPDVKVCGLLINAYVMAGQTREAVIAFDNLRQAGLEPNDKCVALVLAAYEKE 1381
            D +QLAGIIPD  VCGLL+NAYV+AGQ  EA IAF+NLR+AG+EPNDKC+AL+L+AYE E
Sbjct: 299  DTLQLAGIIPDATVCGLLMNAYVVAGQLSEACIAFENLRRAGIEPNDKCIALLLSAYETE 358

Query: 1382 NKLKIALDFLIKLEKDGVMIEKEASELLVKWFRKLGVVEEVEIVLRDY 1525
            N L  ALD L+ LE+DGV++ +EASE+L +WF+KLGVV EVE+VLR++
Sbjct: 359  NNLSKALDVLMNLERDGVVLGREASEILARWFKKLGVVGEVELVLREF 406


>ref|XP_012855399.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970
            [Erythranthe guttatus]
          Length = 418

 Score =  513 bits (1321), Expect = e-142
 Identities = 258/406 (63%), Positives = 316/406 (77%), Gaps = 1/406 (0%)
 Frame = +2

Query: 338  NPICMQTQNLLNAYQFPLWKNSFLRNPKAFRLD-KSDLRPLLVVKSVNDGENVEYSKEED 514
            +PIC   Q  L A+  P W+N F R    FR   KS+   +LV   +N+G     +   D
Sbjct: 13   SPICTHAQKSLIAFHNPFWRNPFFRKYPNFRFSSKSNRCKMLVFSCLNEGVGSGANAVID 72

Query: 515  EEKPRFRWVKIGSNMTEEQKQAIAQIPPKMENRCRALLKQIICFSPENGSISFMLAAWVK 694
             E  +   V+   +++EEQKQAI+Q+P KM NRC+ L+K+IICFSPENGS+  MLAAWVK
Sbjct: 73   AENRKSNRVRTVFDLSEEQKQAISQLPSKMTNRCKLLMKEIICFSPENGSVPLMLAAWVK 132

Query: 695  SMNPQRANWLSVLKELEKLNHPLYFEVVEHALTEESFEANVRDYTKMIHGYAKQTKLQEA 874
            S NP+R +WLSV KELE LNHPLYFEV EHA  EESFEAN+RDYT +IHGYAKQ +LQEA
Sbjct: 133  STNPRRTDWLSVFKELEALNHPLYFEVAEHAFAEESFEANIRDYTNIIHGYAKQNRLQEA 192

Query: 875  ENTLLAMKNRGFICDQVTLTALIHMYSKADNLKLAEESFEEMKLLGTPLDRRSYGSMVMA 1054
            E TL AMKNRG +CDQV LTALIHMYSK+ NLK A+ SFEEMK+LG  LDRRSYGS++MA
Sbjct: 193  EQTLTAMKNRGLLCDQVILTALIHMYSKSGNLKQAQYSFEEMKMLGVQLDRRSYGSIIMA 252

Query: 1055 YIRAGMLDRGESVLKEMEAQEIYAGREVYKALLRAYSMIGDSLGAQRVFDAIQLAGIIPD 1234
            +IRA  L   E++L+EMEAQEIYAGREVYKALLRAYSM GD  GAQR+FDAIQ+AGI PD
Sbjct: 253  HIRAEKLSSAETLLQEMEAQEIYAGREVYKALLRAYSMKGDFSGAQRIFDAIQVAGITPD 312

Query: 1235 VKVCGLLINAYVMAGQTREAVIAFDNLRQAGLEPNDKCVALVLAAYEKENKLKIALDFLI 1414
             +VCGLLINAYV++G+ +EA +AF N+R++G+E NDKCVALVLAA+E E+KLK ALD LI
Sbjct: 313  ARVCGLLINAYVVSGRAQEACVAFGNMRRSGIEVNDKCVALVLAAFEMEDKLKDALDLLI 372

Query: 1415 KLEKDGVMIEKEASELLVKWFRKLGVVEEVEIVLRDYASRVPEPAL 1552
            +LE +G+M+ KE S+LLVKWF+KLGVVEEV IVLR+   ++ +P L
Sbjct: 373  ELEGEGIMVGKEGSDLLVKWFQKLGVVEEVAIVLRELGLKMAQPVL 418


>ref|XP_010092845.1| hypothetical protein L484_022440 [Morus notabilis]
            gi|587862878|gb|EXB52663.1| hypothetical protein
            L484_022440 [Morus notabilis]
          Length = 406

 Score =  496 bits (1276), Expect = e-137
 Identities = 247/393 (62%), Positives = 312/393 (79%)
 Frame = +2

Query: 356  TQNLLNAYQFPLWKNSFLRNPKAFRLDKSDLRPLLVVKSVNDGENVEYSKEEDEEKPRFR 535
            T  +   + FP     F   P  F       R  LV  SV + E  E        KP+F+
Sbjct: 12   TNEITKTHFFP---KPFYPTPTNFPSRNLHFRRPLVATSVEETEKAE----NGGGKPKFK 64

Query: 536  WVKIGSNMTEEQKQAIAQIPPKMENRCRALLKQIICFSPENGSISFMLAAWVKSMNPQRA 715
            WV++G  +TE QK+AI+Q+ PKM  RCRAL+KQ+ICFS    S++ +LAAWV+ M PQRA
Sbjct: 65   WVEVGPGITESQKEAISQLSPKMTKRCRALMKQLICFSAHKASLNELLAAWVRIMKPQRA 124

Query: 716  NWLSVLKELEKLNHPLYFEVVEHALTEESFEANVRDYTKMIHGYAKQTKLQEAENTLLAM 895
            +WL+++K+L+ ++HPLYF+V E AL EESFEAN+RDYTK+IH Y KQ +L++AE TLLAM
Sbjct: 125  DWLAIIKQLKIMDHPLYFQVAEVALLEESFEANIRDYTKIIHCYGKQNRLEDAEKTLLAM 184

Query: 896  KNRGFICDQVTLTALIHMYSKADNLKLAEESFEEMKLLGTPLDRRSYGSMVMAYIRAGML 1075
            K+RGFI DQVTLT  IHMYSKA NLKLAEE+FEE+KLLG PLD+RSYGSM+MAYIRAGM 
Sbjct: 185  KSRGFIRDQVTLTTFIHMYSKAGNLKLAEETFEELKLLGQPLDKRSYGSMIMAYIRAGMP 244

Query: 1076 DRGESVLKEMEAQEIYAGREVYKALLRAYSMIGDSLGAQRVFDAIQLAGIIPDVKVCGLL 1255
            D+GE++L+EM+ +EIYAG EVYKALLRAYSM GD+ GAQRVFDAIQLAGI+PD ++CGLL
Sbjct: 245  DQGENILREMDVEEIYAGSEVYKALLRAYSMTGDAEGAQRVFDAIQLAGILPDPRLCGLL 304

Query: 1256 INAYVMAGQTREAVIAFDNLRQAGLEPNDKCVALVLAAYEKENKLKIALDFLIKLEKDGV 1435
            INAYV +GQ+ +A +AF N+R+AGLEP+DKCVALVL AYEKENKL+ ALDFL++LE+ G+
Sbjct: 305  INAYVESGQSEKACVAFGNMRRAGLEPSDKCVALVLCAYEKENKLQRALDFLMELERHGI 364

Query: 1436 MIEKEASELLVKWFRKLGVVEEVEIVLRDYASR 1534
            M+ +EASE LV WFRKLGVV+EV++VLR+YAS+
Sbjct: 365  MVGEEASETLVGWFRKLGVVKEVDLVLREYASK 397


>ref|XP_002273719.2| PREDICTED: pentatricopeptide repeat-containing protein At1g01970
            [Vitis vinifera]
          Length = 417

 Score =  496 bits (1276), Expect = e-137
 Identities = 254/411 (61%), Positives = 319/411 (77%)
 Frame = +2

Query: 302  MGYFATSIFCLGNPICMQTQNLLNAYQFPLWKNSFLRNPKAFRLDKSDLRPLLVVKSVND 481
            MG FA  +     P C     + N       +  F + P      + +  P+LV  +V +
Sbjct: 1    MGSFACIMLSNSYPKCSFGDEITNTLNCHFPEKFFFQTPVNVGHSRLNFGPVLVGSNVEE 60

Query: 482  GENVEYSKEEDEEKPRFRWVKIGSNMTEEQKQAIAQIPPKMENRCRALLKQIICFSPENG 661
               VE     + EK R++W++IG N+TE QK  I+QI  KM  RC+AL+KQIICFSPE  
Sbjct: 61   KGTVEMG---EGEKKRYKWIEIGPNITEAQKMTISQISLKMTKRCKALVKQIICFSPEER 117

Query: 662  SISFMLAAWVKSMNPQRANWLSVLKELEKLNHPLYFEVVEHALTEESFEANVRDYTKMIH 841
            S+S +LAAWVK M P+RA+WLSVLKEL +L+HPL  EV E AL EESFEAN+RDYTK+I 
Sbjct: 118  SLSDLLAAWVKIMKPRRADWLSVLKELGRLDHPLLLEVAELALLEESFEANIRDYTKIID 177

Query: 842  GYAKQTKLQEAENTLLAMKNRGFICDQVTLTALIHMYSKADNLKLAEESFEEMKLLGTPL 1021
            GY KQ +LQ+AENTL AMK RGFICDQVTLTA+I+MYSKA NL+LAE++FEE+KLLG PL
Sbjct: 178  GYGKQNRLQDAENTLSAMKRRGFICDQVTLTAMINMYSKAGNLELAEKTFEEIKLLGHPL 237

Query: 1022 DRRSYGSMVMAYIRAGMLDRGESVLKEMEAQEIYAGREVYKALLRAYSMIGDSLGAQRVF 1201
            D+RSYGSM+MAYIRAGM D+GE ++KEMEA+EIYAGREVYKALLRAYS   D+ GAQRVF
Sbjct: 238  DKRSYGSMIMAYIRAGMPDQGEILVKEMEAKEIYAGREVYKALLRAYSNTSDAEGAQRVF 297

Query: 1202 DAIQLAGIIPDVKVCGLLINAYVMAGQTREAVIAFDNLRQAGLEPNDKCVALVLAAYEKE 1381
            DAIQ AGI PDVK+C LLINAY +AGQT++A +AF+N+R++GL+PNDK +AL+LAAYEKE
Sbjct: 298  DAIQFAGISPDVKLCALLINAYRVAGQTQKAHVAFENMRRSGLKPNDKSIALMLAAYEKE 357

Query: 1382 NKLKIALDFLIKLEKDGVMIEKEASELLVKWFRKLGVVEEVEIVLRDYASR 1534
            NKL  ALDFLI LE+DG+++ KEASELL  WF++LGVV+EVE+VLR+Y+++
Sbjct: 358  NKLNKALDFLIDLERDGIVLGKEASELLAAWFQRLGVVKEVELVLREYSAK 408


>ref|XP_007052035.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
            [Theobroma cacao] gi|508704296|gb|EOX96192.1|
            Tetratricopeptide repeat (TPR)-like superfamily protein,
            putative [Theobroma cacao]
          Length = 420

 Score =  495 bits (1275), Expect = e-137
 Identities = 237/343 (69%), Positives = 301/343 (87%)
 Frame = +2

Query: 506  EEDEEKPRFRWVKIGSNMTEEQKQAIAQIPPKMENRCRALLKQIICFSPENGSISFMLAA 685
            E +EEK R++WV+IG ++ EEQKQAI ++P KM  RC+AL+KQIICF PE GS++ +LAA
Sbjct: 69   ETNEEKRRYKWVEIGPDIAEEQKQAITELPFKMTKRCKALMKQIICFCPEKGSLADLLAA 128

Query: 686  WVKSMNPQRANWLSVLKELEKLNHPLYFEVVEHALTEESFEANVRDYTKMIHGYAKQTKL 865
            WVK M P+RA+WL VLKEL+ + HPLYFEV E AL EESFEAN+RD+TK+IHGY KQ +L
Sbjct: 129  WVKIMKPRRADWLVVLKELKIMEHPLYFEVAELALLEESFEANIRDFTKIIHGYGKQKRL 188

Query: 866  QEAENTLLAMKNRGFICDQVTLTALIHMYSKADNLKLAEESFEEMKLLGTPLDRRSYGSM 1045
            QEAEN L+AMK RGFICDQVTLT ++HMYSKA NLKLAEE+FEE+KLLG  LD+RSYGSM
Sbjct: 189  QEAENILVAMKRRGFICDQVTLTTMVHMYSKAGNLKLAEETFEEIKLLGQQLDKRSYGSM 248

Query: 1046 VMAYIRAGMLDRGESVLKEMEAQEIYAGREVYKALLRAYSMIGDSLGAQRVFDAIQLAGI 1225
            +MAYIR+G  ++GE++L+EM++QEIYAG EVYKALLRAYSM+GD+ GAQRVFD IQLAGI
Sbjct: 249  IMAYIRSGTPEQGEALLREMDSQEIYAGSEVYKALLRAYSMLGDANGAQRVFDTIQLAGI 308

Query: 1226 IPDVKVCGLLINAYVMAGQTREAVIAFDNLRQAGLEPNDKCVALVLAAYEKENKLKIALD 1405
             PD ++CGLLINAY +AGQ+ +A IAF+N+R+AGLEP+DKCVALV+AAYEK+NKL  ALD
Sbjct: 309  SPDARMCGLLINAYQLAGQSDKAHIAFENMRRAGLEPSDKCVALVVAAYEKQNKLNKALD 368

Query: 1406 FLIKLEKDGVMIEKEASELLVKWFRKLGVVEEVEIVLRDYASR 1534
            FL++LE+DG+++ KEAS +L +WF+KLGVVE+VE+VLR++A++
Sbjct: 369  FLMELERDGIVVGKEASGILAQWFKKLGVVEQVELVLREFAAK 411


>emb|CBI38862.3| unnamed protein product [Vitis vinifera]
          Length = 353

 Score =  485 bits (1248), Expect = e-134
 Identities = 239/339 (70%), Positives = 294/339 (86%)
 Frame = +2

Query: 518  EKPRFRWVKIGSNMTEEQKQAIAQIPPKMENRCRALLKQIICFSPENGSISFMLAAWVKS 697
            EK R++W++IG N+TE QK  I+QI  KM  RC+AL+KQIICFSPE  S+S +LAAWVK 
Sbjct: 5    EKKRYKWIEIGPNITEAQKMTISQISLKMTKRCKALVKQIICFSPEERSLSDLLAAWVKI 64

Query: 698  MNPQRANWLSVLKELEKLNHPLYFEVVEHALTEESFEANVRDYTKMIHGYAKQTKLQEAE 877
            M P+RA+WLSVLKEL +L+HPL  EV E AL EESFEAN+RDYTK+I GY KQ +LQ+AE
Sbjct: 65   MKPRRADWLSVLKELGRLDHPLLLEVAELALLEESFEANIRDYTKIIDGYGKQNRLQDAE 124

Query: 878  NTLLAMKNRGFICDQVTLTALIHMYSKADNLKLAEESFEEMKLLGTPLDRRSYGSMVMAY 1057
            NTL AMK RGFICDQVTLTA+I+MYSKA NL+LAE++FEE+KLLG PLD+RSYGSM+MAY
Sbjct: 125  NTLSAMKRRGFICDQVTLTAMINMYSKAGNLELAEKTFEEIKLLGHPLDKRSYGSMIMAY 184

Query: 1058 IRAGMLDRGESVLKEMEAQEIYAGREVYKALLRAYSMIGDSLGAQRVFDAIQLAGIIPDV 1237
            IRAGM D+GE ++KEMEA+EIYAGREVYKALLRAYS   D+ GAQRVFDAIQ AGI PDV
Sbjct: 185  IRAGMPDQGEILVKEMEAKEIYAGREVYKALLRAYSNTSDAEGAQRVFDAIQFAGISPDV 244

Query: 1238 KVCGLLINAYVMAGQTREAVIAFDNLRQAGLEPNDKCVALVLAAYEKENKLKIALDFLIK 1417
            K+C LLINAY +AGQT++A +AF+N+R++GL+PNDK +AL+LAAYEKENKL  ALDFLI 
Sbjct: 245  KLCALLINAYRVAGQTQKAHVAFENMRRSGLKPNDKSIALMLAAYEKENKLNKALDFLID 304

Query: 1418 LEKDGVMIEKEASELLVKWFRKLGVVEEVEIVLRDYASR 1534
            LE+DG+++ KEASELL  WF++LGVV+EVE+VLR+Y+++
Sbjct: 305  LERDGIVLGKEASELLAAWFQRLGVVKEVELVLREYSAK 343


>ref|XP_008438151.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970
            [Cucumis melo] gi|659075453|ref|XP_008438152.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At1g01970 [Cucumis melo]
          Length = 404

 Score =  483 bits (1242), Expect = e-133
 Identities = 227/343 (66%), Positives = 289/343 (84%)
 Frame = +2

Query: 506  EEDEEKPRFRWVKIGSNMTEEQKQAIAQIPPKMENRCRALLKQIICFSPENGSISFMLAA 685
            E + EKPRFRWV++G N+TE QKQAI+Q+PPKM  +C+A++KQIICFSP+ G +S MLAA
Sbjct: 58   ESEREKPRFRWVEVGYNITETQKQAISQLPPKMTKKCKAVMKQIICFSPQKGELSDMLAA 117

Query: 686  WVKSMNPQRANWLSVLKELEKLNHPLYFEVVEHALTEESFEANVRDYTKMIHGYAKQTKL 865
            WV+ M P+RA+WLSVLK L  LNHPLY +V E AL E +FEAN RDYTK+IH Y KQ +L
Sbjct: 118  WVRIMKPERADWLSVLKHLRILNHPLYIQVAEAALVEITFEANTRDYTKIIHHYGKQNQL 177

Query: 866  QEAENTLLAMKNRGFICDQVTLTALIHMYSKADNLKLAEESFEEMKLLGTPLDRRSYGSM 1045
            ++AE  LL M+ RGF CDQ+TLT +IH+YSKAD LKLA+++FEE+KLL   LD+RSYG+M
Sbjct: 178  EDAEKVLLTMRERGFACDQITLTTMIHIYSKADKLKLAKQTFEELKLLEQSLDKRSYGAM 237

Query: 1046 VMAYIRAGMLDRGESVLKEMEAQEIYAGREVYKALLRAYSMIGDSLGAQRVFDAIQLAGI 1225
            +MAY+RAG+ + GE +LKEM+A++IYAG EVYKALLRAYSM GD+ GAQRVFDAIQLA I
Sbjct: 238  IMAYVRAGLPEEGEKILKEMDAKDIYAGSEVYKALLRAYSMAGDAEGAQRVFDAIQLAAI 297

Query: 1226 IPDVKVCGLLINAYVMAGQTREAVIAFDNLRQAGLEPNDKCVALVLAAYEKENKLKIALD 1405
             PD K+CGLL+NAY+MAGQ+R+A IAFDN+R+AG+EP+DKC+AL L+AYEKEN+L  AL+
Sbjct: 298  PPDEKLCGLLMNAYLMAGQSRKAQIAFDNMRRAGIEPSDKCIALALSAYEKENRLNAALE 357

Query: 1406 FLIKLEKDGVMIEKEASELLVKWFRKLGVVEEVEIVLRDYASR 1534
             LI LEKD VM+ KEAS++L  W ++LGVVEE+EIVLR+Y ++
Sbjct: 358  LLIDLEKDNVMVGKEASQILAAWLKRLGVVEEIEIVLREYTAK 400


>ref|XP_004134345.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970
            [Cucumis sativus] gi|778677424|ref|XP_011650789.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At1g01970 [Cucumis sativus] gi|700201459|gb|KGN56592.1|
            hypothetical protein Csa_3G126080 [Cucumis sativus]
          Length = 404

 Score =  482 bits (1241), Expect = e-133
 Identities = 227/343 (66%), Positives = 289/343 (84%)
 Frame = +2

Query: 506  EEDEEKPRFRWVKIGSNMTEEQKQAIAQIPPKMENRCRALLKQIICFSPENGSISFMLAA 685
            E + EKPRFRWV++G ++TE QKQAI+Q+PPKM  RC+A++KQIICFSP+ G +S MLAA
Sbjct: 58   ESEREKPRFRWVEVGYDITETQKQAISQLPPKMTKRCKAVMKQIICFSPQKGELSDMLAA 117

Query: 686  WVKSMNPQRANWLSVLKELEKLNHPLYFEVVEHALTEESFEANVRDYTKMIHGYAKQTKL 865
            WV+ M P+RA+WL VLK L  LNHPLY +V E AL E +FEAN RDYTK+IH Y KQ +L
Sbjct: 118  WVRIMKPERADWLLVLKHLRILNHPLYIQVAEAALEEITFEANTRDYTKIIHHYGKQNQL 177

Query: 866  QEAENTLLAMKNRGFICDQVTLTALIHMYSKADNLKLAEESFEEMKLLGTPLDRRSYGSM 1045
            ++AE  LL+M+ RGF+CDQ+TLT +IH+YSKAD L LA+++FEE+KLL  PLD+RS+G+M
Sbjct: 178  EDAEKVLLSMRERGFVCDQITLTTMIHIYSKADKLNLAKQTFEELKLLEQPLDKRSFGAM 237

Query: 1046 VMAYIRAGMLDRGESVLKEMEAQEIYAGREVYKALLRAYSMIGDSLGAQRVFDAIQLAGI 1225
            +MAY+RAG  + GE +LKEM+A++IYAG EVYKALLRAYSM+G++ GAQRVFDAIQLA I
Sbjct: 238  IMAYVRAGFPEEGEKILKEMDAKDIYAGSEVYKALLRAYSMVGNAEGAQRVFDAIQLAAI 297

Query: 1226 IPDVKVCGLLINAYVMAGQTREAVIAFDNLRQAGLEPNDKCVALVLAAYEKENKLKIALD 1405
             PD K+CGLLINAY+MAGQ+REA IAFDN+R+AG+EP+DKC+AL L+AYEKEN+L  AL+
Sbjct: 298  TPDEKLCGLLINAYLMAGQSREAQIAFDNMRRAGIEPSDKCIALALSAYEKENRLNSALE 357

Query: 1406 FLIKLEKDGVMIEKEASELLVKWFRKLGVVEEVEIVLRDYASR 1534
             LI LEKD VM+ KEAS++L  W ++LGVVEEVEIVLR+Y  +
Sbjct: 358  LLIDLEKDNVMVGKEASKILAAWLKRLGVVEEVEIVLREYTEK 400


>ref|XP_002320730.1| hypothetical protein POPTR_0014s06610g [Populus trichocarpa]
            gi|222861503|gb|EEE99045.1| hypothetical protein
            POPTR_0014s06610g [Populus trichocarpa]
          Length = 407

 Score =  481 bits (1238), Expect = e-132
 Identities = 247/410 (60%), Positives = 311/410 (75%)
 Frame = +2

Query: 302  MGYFATSIFCLGNPICMQTQNLLNAYQFPLWKNSFLRNPKAFRLDKSDLRPLLVVKSVND 481
            M  +  +I    +P C                NS  + P      KS ++P+L   ++N 
Sbjct: 1    MATYVINILPFSSPTCPLHSEPKKTSNLHFLGNSLCQQPVTLTSCKSQIQPVLA--AINV 58

Query: 482  GENVEYSKEEDEEKPRFRWVKIGSNMTEEQKQAIAQIPPKMENRCRALLKQIICFSPENG 661
             E VE   E  +EKP+FRWV+IG N+ EEQKQAI+Q+P KM  RC+AL++QIICF+ + G
Sbjct: 59   EEKVE--GEIGKEKPKFRWVEIGPNIPEEQKQAISQLPFKMTKRCKALMRQIICFNDKKG 116

Query: 662  SISFMLAAWVKSMNPQRANWLSVLKELEKLNHPLYFEVVEHALTEESFEANVRDYTKMIH 841
            S+  +L+AWVK M P+R +WLS+LKEL K+ HPLY EVVE AL EESFEANVRDYTK+IH
Sbjct: 117  SLRGLLSAWVKIMKPRRKDWLSILKELNKMEHPLYLEVVEIALLEESFEANVRDYTKIIH 176

Query: 842  GYAKQTKLQEAENTLLAMKNRGFICDQVTLTALIHMYSKADNLKLAEESFEEMKLLGTPL 1021
             Y    +L+EAE T LAM+ RGF+ DQVTLTA+IHMYSK  NL LAEE+FEE+KLLG PL
Sbjct: 177  FYGMNNQLEEAERTRLAMEERGFVSDQVTLTAMIHMYSKGGNLTLAEETFEELKLLGQPL 236

Query: 1022 DRRSYGSMVMAYIRAGMLDRGESVLKEMEAQEIYAGREVYKALLRAYSMIGDSLGAQRVF 1201
            DRRSYGSM+MAYIRAGM ++GE +L+EM+AQEI AG EVYKALLRAYS+IGD+ GAQRVF
Sbjct: 237  DRRSYGSMIMAYIRAGMPEKGEMILREMDAQEIRAGSEVYKALLRAYSIIGDADGAQRVF 296

Query: 1202 DAIQLAGIIPDVKVCGLLINAYVMAGQTREAVIAFDNLRQAGLEPNDKCVALVLAAYEKE 1381
            DAIQLAGI PD + C +L+NAY MAGQ++ A   F+N+ +AG+EP D+CVALVLAAYEKE
Sbjct: 297  DAIQLAGIPPDDRTCAVLLNAYGMAGQSQNAYATFENMWRAGIEPTDRCVALVLAAYEKE 356

Query: 1382 NKLKIALDFLIKLEKDGVMIEKEASELLVKWFRKLGVVEEVEIVLRDYAS 1531
            NKL  ALDFLI LE++ ++I KEASE+L +WF +LGVV+EVE+VLR+YA+
Sbjct: 357  NKLNQALDFLIGLEREKLIIGKEASEVLAEWFGRLGVVKEVELVLREYAA 406


>ref|XP_012475249.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970
            [Gossypium raimondii] gi|763757464|gb|KJB24795.1|
            hypothetical protein B456_004G161000 [Gossypium
            raimondii]
          Length = 400

 Score =  480 bits (1236), Expect = e-132
 Identities = 238/367 (64%), Positives = 299/367 (81%), Gaps = 6/367 (1%)
 Frame = +2

Query: 452  PLLVVKSVNDGENVEYSKE------EDEEKPRFRWVKIGSNMTEEQKQAIAQIPPKMENR 613
            P L +K      +  +S E      + EEK RF+WV+IG  +TEEQ+QAI ++P KM  R
Sbjct: 34   PSLSLKQAMKPSSCTFSNEPQISFIDAEEKRRFKWVEIGPGITEEQRQAIDKLPFKMTKR 93

Query: 614  CRALLKQIICFSPENGSISFMLAAWVKSMNPQRANWLSVLKELEKLNHPLYFEVVEHALT 793
            C+AL+KQIICF+PE GS+  +L AWV  M P+RA+WL VLKEL+ + HPLYF+V E AL 
Sbjct: 94   CKALMKQIICFNPEKGSLEDLLGAWVNVMKPRRADWLVVLKELKIMEHPLYFQVAEIALL 153

Query: 794  EESFEANVRDYTKMIHGYAKQTKLQEAENTLLAMKNRGFICDQVTLTALIHMYSKADNLK 973
            EE+FEAN+RDYTK+IHGY KQ +L+EAEN L AMK RGFICDQVTLT ++HMYSKA NLK
Sbjct: 154  EETFEANIRDYTKIIHGYGKQNRLREAENILDAMKRRGFICDQVTLTTMVHMYSKAGNLK 213

Query: 974  LAEESFEEMKLLGTPLDRRSYGSMVMAYIRAGMLDRGESVLKEMEAQEIYAGREVYKALL 1153
            LAE++FEE+KLLG  LD+RSYG+M+MAYIRAGM ++GE +LKEM+  EIYAG EVYKALL
Sbjct: 214  LAEDTFEEIKLLGQQLDKRSYGAMIMAYIRAGMPEQGEGLLKEMDNLEIYAGSEVYKALL 273

Query: 1154 RAYSMIGDSLGAQRVFDAIQLAGIIPDVKVCGLLINAYVMAGQTREAVIAFDNLRQAGLE 1333
            RAYS  GD+ GAQRVF AIQLAGI PD K+CGLLINAY +AGQ+ EA +AF+N+R+AGLE
Sbjct: 274  RAYSTNGDTDGAQRVFGAIQLAGISPDAKLCGLLINAYQVAGQSEEARVAFENMRRAGLE 333

Query: 1334 PNDKCVALVLAAYEKENKLKIALDFLIKLEKDGVMIEKEASELLVKWFRKLGVVEEVEIV 1513
            P+DKCVALVLAAYEK+NKL  AL+FL+ LE+DG+++ KEAS +L +WF+KLGVVE+VE V
Sbjct: 334  PSDKCVALVLAAYEKQNKLNKALEFLMDLERDGIVVGKEASSILAQWFKKLGVVEQVEQV 393

Query: 1514 LRDYASR 1534
            LR++A++
Sbjct: 394  LREFAAK 400


>ref|XP_011034479.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970
            [Populus euphratica] gi|743788799|ref|XP_011034485.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At1g01970 [Populus euphratica]
          Length = 407

 Score =  477 bits (1228), Expect = e-131
 Identities = 247/410 (60%), Positives = 310/410 (75%)
 Frame = +2

Query: 302  MGYFATSIFCLGNPICMQTQNLLNAYQFPLWKNSFLRNPKAFRLDKSDLRPLLVVKSVND 481
            M  +  +I    +P C                NS  + P      KS ++P+L   ++N 
Sbjct: 1    MATYVINILPFSSPTCPLYSEPKKTSNLHFLGNSLCQQPVTLTSWKSQIQPVLA--AINV 58

Query: 482  GENVEYSKEEDEEKPRFRWVKIGSNMTEEQKQAIAQIPPKMENRCRALLKQIICFSPENG 661
             E VE   E  +EKP+FRWV+IG N+ EEQKQAI+Q+P KM  RC+AL++QIICF+ + G
Sbjct: 59   EEKVE--GEIGKEKPKFRWVEIGPNIPEEQKQAISQLPFKMTKRCKALMRQIICFNDKKG 116

Query: 662  SISFMLAAWVKSMNPQRANWLSVLKELEKLNHPLYFEVVEHALTEESFEANVRDYTKMIH 841
            S+  +L+AWVK M P+R +WLS+LKEL K+ HPLY EV E AL EESFEANVRDYTK+IH
Sbjct: 117  SLPDLLSAWVKIMKPRRKDWLSILKELNKMAHPLYLEVAEIALLEESFEANVRDYTKIIH 176

Query: 842  GYAKQTKLQEAENTLLAMKNRGFICDQVTLTALIHMYSKADNLKLAEESFEEMKLLGTPL 1021
             Y    +L+EAE T LAM+ RGF+ DQVTLTA+IHMYS A NL LAEE+FEE+KLLG PL
Sbjct: 177  FYGMNNQLEEAERTRLAMEERGFVSDQVTLTAMIHMYSNAGNLTLAEETFEELKLLGQPL 236

Query: 1022 DRRSYGSMVMAYIRAGMLDRGESVLKEMEAQEIYAGREVYKALLRAYSMIGDSLGAQRVF 1201
            DRRSYGSM+MAYIRAGM ++GE +L+EM+AQEI AG EVYKALLRAYS+IGD+ GAQRVF
Sbjct: 237  DRRSYGSMIMAYIRAGMPEKGEMILREMDAQEIRAGSEVYKALLRAYSIIGDADGAQRVF 296

Query: 1202 DAIQLAGIIPDVKVCGLLINAYVMAGQTREAVIAFDNLRQAGLEPNDKCVALVLAAYEKE 1381
            DAIQLAGI PD + C +L+NAY MAGQ++ A   F+N+ +AG+EP+D+CVALVLAAYEKE
Sbjct: 297  DAIQLAGIPPDDRTCAVLLNAYGMAGQSQNAHATFENMWRAGIEPSDRCVALVLAAYEKE 356

Query: 1382 NKLKIALDFLIKLEKDGVMIEKEASELLVKWFRKLGVVEEVEIVLRDYAS 1531
            NKL  ALDFLI LE+D + I KEASE+L +WF +LGVV+EVE+VLR+YA+
Sbjct: 357  NKLIQALDFLIGLERDKLTIGKEASEVLAEWFGRLGVVKEVELVLREYAA 406


>ref|XP_006445236.1| hypothetical protein CICLE_v10020287mg [Citrus clementina]
            gi|568875716|ref|XP_006490938.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At1g01970-like [Citrus sinensis]
            gi|557547498|gb|ESR58476.1| hypothetical protein
            CICLE_v10020287mg [Citrus clementina]
            gi|641867169|gb|KDO85853.1| hypothetical protein
            CISIN_1g014507mg [Citrus sinensis]
          Length = 423

 Score =  477 bits (1228), Expect = e-131
 Identities = 238/369 (64%), Positives = 295/369 (79%), Gaps = 7/369 (1%)
 Frame = +2

Query: 449  RPLLVVKSVNDGENVEYSKEEDEEKPR-------FRWVKIGSNMTEEQKQAIAQIPPKME 607
            +P    +S+  G  +E  K ED  K +       F W++IG N+TEEQKQAI+Q P KM 
Sbjct: 41   KPAKWSRSLRSGPALEAIKAEDMGKTQVKDDTSMFTWIQIGPNITEEQKQAISQFPRKMT 100

Query: 608  NRCRALLKQIICFSPENGSISFMLAAWVKSMNPQRANWLSVLKELEKLNHPLYFEVVEHA 787
             RC+A +KQIIC SPE G++S +LAAWV+ M P+RA+WL+VLK+L+ + HPLY +V E A
Sbjct: 101  KRCKAFVKQIICVSPETGNLSDLLAAWVRFMKPRRADWLAVLKQLKLMEHPLYLQVAELA 160

Query: 788  LTEESFEANVRDYTKMIHGYAKQTKLQEAENTLLAMKNRGFICDQVTLTALIHMYSKADN 967
            L EESFEAN+RDYTK+IHGY K+ ++Q AENTLLAMK RGFICDQVTLT ++ MYSKA N
Sbjct: 161  LLEESFEANIRDYTKIIHGYGKKMQIQNAENTLLAMKRRGFICDQVTLTVMVVMYSKAGN 220

Query: 968  LKLAEESFEEMKLLGTPLDRRSYGSMVMAYIRAGMLDRGESVLKEMEAQEIYAGREVYKA 1147
            LK+AEE+FEE+KLLG PLD+RSYGSMVMAY+RAGMLDRGE +L+EM+AQE+Y G EVYKA
Sbjct: 221  LKMAEETFEEIKLLGEPLDKRSYGSMVMAYVRAGMLDRGEVLLREMDAQEVYVGSEVYKA 280

Query: 1148 LLRAYSMIGDSLGAQRVFDAIQLAGIIPDVKVCGLLINAYVMAGQTREAVIAFDNLRQAG 1327
            LLR YSM G+S GAQRVF+AIQ AGI PD ++C LLINAY MAGQ+++A  AF N+R+AG
Sbjct: 281  LLRGYSMNGNSEGAQRVFEAIQFAGITPDARMCALLINAYQMAGQSQKAYTAFQNMRKAG 340

Query: 1328 LEPNDKCVALVLAAYEKENKLKIALDFLIKLEKDGVMIEKEASELLVKWFRKLGVVEEVE 1507
            LEP+DKCVAL+L+A EKEN+L  AL+FLI LE+DG M+ KEAS  L  WF++LGVVEEVE
Sbjct: 341  LEPSDKCVALILSACEKENQLNRALEFLIDLERDGFMVGKEASCTLAAWFKRLGVVEEVE 400

Query: 1508 IVLRDYASR 1534
             VLR+Y  R
Sbjct: 401  HVLREYGLR 409


>ref|XP_012083560.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970
            [Jatropha curcas] gi|643717117|gb|KDP28743.1|
            hypothetical protein JCGZ_14514 [Jatropha curcas]
          Length = 404

 Score =  475 bits (1222), Expect = e-131
 Identities = 233/362 (64%), Positives = 300/362 (82%), Gaps = 2/362 (0%)
 Frame = +2

Query: 452  PLLVVKSVNDGENVEYSKEEDEEKPRFRWVKIGSNMTEEQKQAIAQIPPKMENRCRALLK 631
            P+L   S  +   VE      EEK  F+WVKI  N+TE QKQA++++PPKM NRC+A++K
Sbjct: 47   PVLAAVSTEEIGRVEVK----EEKSSFKWVKIDPNITEPQKQAVSELPPKMTNRCKAIMK 102

Query: 632  QIICFS--PENGSISFMLAAWVKSMNPQRANWLSVLKELEKLNHPLYFEVVEHALTEESF 805
            QIIC+S   +N S+S +L AWV+ M P+R +WLSVL++L+K+ HPLYFEV E AL EESF
Sbjct: 103  QIICYSHQAQNASLSDLLGAWVRLMKPRRTDWLSVLRQLKKMEHPLYFEVAELALLEESF 162

Query: 806  EANVRDYTKMIHGYAKQTKLQEAENTLLAMKNRGFICDQVTLTALIHMYSKADNLKLAEE 985
            EANVRDYTK+IH Y K+ ++Q AEN LLAM+ RGF+ DQVTLTA+I MY KA NLK AEE
Sbjct: 163  EANVRDYTKVIHCYGKENQIQNAENILLAMRKRGFVIDQVTLTAMISMYGKAGNLKQAEE 222

Query: 986  SFEEMKLLGTPLDRRSYGSMVMAYIRAGMLDRGESVLKEMEAQEIYAGREVYKALLRAYS 1165
            +FEE+KLLG PLD+RSYG+M+M +IRAGM ++GE +L+EM+AQEI AG EVYKALLRAYS
Sbjct: 223  TFEELKLLGYPLDKRSYGAMIMTHIRAGMPEKGEVLLREMDAQEICAGSEVYKALLRAYS 282

Query: 1166 MIGDSLGAQRVFDAIQLAGIIPDVKVCGLLINAYVMAGQTREAVIAFDNLRQAGLEPNDK 1345
            M+G++ GAQRVFDAIQ AGI PDVK+CGLLINAY MAG++R+A IAF+N+R+AGLEP+DK
Sbjct: 283  MVGNADGAQRVFDAIQFAGIPPDVKLCGLLINAYQMAGESRKAQIAFENMRRAGLEPSDK 342

Query: 1346 CVALVLAAYEKENKLKIALDFLIKLEKDGVMIEKEASELLVKWFRKLGVVEEVEIVLRDY 1525
            C+AL+LAAYEKEN L  AL+FL++LE++G+M+ KEASE+L  WFR+LGV++EVE+VLR+Y
Sbjct: 343  CIALLLAAYEKENNLNEALNFLMRLEREGIMVGKEASEILACWFRRLGVLKEVELVLREY 402

Query: 1526 AS 1531
             +
Sbjct: 403  VA 404


>gb|KHG08668.1| hypothetical protein F383_35923 [Gossypium arboreum]
          Length = 400

 Score =  474 bits (1220), Expect = e-130
 Identities = 236/367 (64%), Positives = 295/367 (80%), Gaps = 6/367 (1%)
 Frame = +2

Query: 452  PLLVVKSVNDGENVEYSKE------EDEEKPRFRWVKIGSNMTEEQKQAIAQIPPKMENR 613
            P L +K      +  +S E      + EEK RF+WV+IG  +TEEQ+QAI ++P KM  R
Sbjct: 34   PSLSLKQATKPSSCTFSNEPQIAFIDAEEKRRFKWVEIGPGITEEQRQAIDKLPFKMTKR 93

Query: 614  CRALLKQIICFSPENGSISFMLAAWVKSMNPQRANWLSVLKELEKLNHPLYFEVVEHALT 793
            C+AL+KQIICF+PE GS+  +L  WV  M P+RA+WL VLKEL+   HPLYF+V E AL 
Sbjct: 94   CKALMKQIICFNPEKGSLENLLGTWVNVMKPRRADWLVVLKELKITEHPLYFQVAEIALL 153

Query: 794  EESFEANVRDYTKMIHGYAKQTKLQEAENTLLAMKNRGFICDQVTLTALIHMYSKADNLK 973
            EE+FEAN+RDYTK+IHGY KQ +L+EAEN L AMK RGFICDQVTLT ++HMYSKA NLK
Sbjct: 154  EETFEANIRDYTKIIHGYGKQNRLREAENILDAMKRRGFICDQVTLTTMVHMYSKAGNLK 213

Query: 974  LAEESFEEMKLLGTPLDRRSYGSMVMAYIRAGMLDRGESVLKEMEAQEIYAGREVYKALL 1153
            LAE++FEE+KLLG  LD+RSYG M+MAYIRAGM ++GE +LKEM+  EIYAG EVYKALL
Sbjct: 214  LAEDTFEEIKLLGQQLDKRSYGGMIMAYIRAGMPEQGEGLLKEMDNLEIYAGSEVYKALL 273

Query: 1154 RAYSMIGDSLGAQRVFDAIQLAGIIPDVKVCGLLINAYVMAGQTREAVIAFDNLRQAGLE 1333
            RAYS  G + GAQRVF AIQLAGI PD K+CGLLINAY +AG++ EA +AF+N+R+AGLE
Sbjct: 274  RAYSTNGYTDGAQRVFGAIQLAGISPDAKLCGLLINAYQVAGESEEARVAFENMRRAGLE 333

Query: 1334 PNDKCVALVLAAYEKENKLKIALDFLIKLEKDGVMIEKEASELLVKWFRKLGVVEEVEIV 1513
            P+DKCVALVLAAYEK+NKL  AL+FL+ LE+DG+++ KEAS LL +WF+KLGVVE+VE V
Sbjct: 334  PSDKCVALVLAAYEKQNKLNKALEFLMDLERDGIVVGKEASSLLAQWFKKLGVVEQVEQV 393

Query: 1514 LRDYASR 1534
            LR++A++
Sbjct: 394  LREFAAK 400


>ref|XP_003623723.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355498738|gb|AES79941.1| PPR containing plant-like
            protein [Medicago truncatula]
          Length = 426

 Score =  474 bits (1219), Expect = e-130
 Identities = 243/416 (58%), Positives = 318/416 (76%), Gaps = 1/416 (0%)
 Frame = +2

Query: 287  DFHFMMGYFATSIFC-LGNPICMQTQNLLNAYQFPLWKNSFLRNPKAFRLDKSDLRPLLV 463
            D +F++G + +++ C   NP      ++   YQ    +NS  + P    + K     +LV
Sbjct: 12   DPNFIIGTYCSNLVCNFYNP----NYSITKLYQIHNKRNSLSKKPSYLDIHKHHFDSVLV 67

Query: 464  VKSVNDGENVEYSKEEDEEKPRFRWVKIGSNMTEEQKQAIAQIPPKMENRCRALLKQIIC 643
              SV   E VE   E   +K  FRW +I +++TEEQKQAIA++P +ME RC+A+++QIIC
Sbjct: 68   --SVGTEEIVEEVIEGSYKK--FRWNEIRNDITEEQKQAIAKLPFRMEKRCKAVMRQIIC 123

Query: 644  FSPENGSISFMLAAWVKSMNPQRANWLSVLKELEKLNHPLYFEVVEHALTEESFEANVRD 823
            FS E G +  +L AWV+ M P RA+WLSVLKEL+ ++HPLY EV EHAL EESFE N+RD
Sbjct: 124  FSEEKGRLCDVLRAWVEIMKPTRADWLSVLKELKNMDHPLYLEVAEHALVEESFEPNLRD 183

Query: 824  YTKMIHGYAKQTKLQEAENTLLAMKNRGFICDQVTLTALIHMYSKADNLKLAEESFEEMK 1003
            YTK+IH Y+K+ +L+ AEN    MK RGFICDQV LT ++HMYSKA +L  AEE FEE+K
Sbjct: 184  YTKLIHYYSKENQLEAAENIFTLMKQRGFICDQVILTTMVHMYSKAGHLDRAEEYFEEIK 243

Query: 1004 LLGTPLDRRSYGSMVMAYIRAGMLDRGESVLKEMEAQEIYAGREVYKALLRAYSMIGDSL 1183
            LLG PLD+RSYGSM+MAYIRAGM ++GES+L+EM+AQ+IYAG EVYKALLRAYS+IG++ 
Sbjct: 244  LLGEPLDKRSYGSMIMAYIRAGMPEKGESLLEEMDAQDIYAGSEVYKALLRAYSVIGNAE 303

Query: 1184 GAQRVFDAIQLAGIIPDVKVCGLLINAYVMAGQTREAVIAFDNLRQAGLEPNDKCVALVL 1363
            GAQRVFDAIQLAGIIPD K+C LLI AY MAGQ+++A IAF+N+++AG+EP DKC++ VL
Sbjct: 304  GAQRVFDAIQLAGIIPDDKMCSLLIYAYSMAGQSQKARIAFENMKRAGIEPTDKCISSVL 363

Query: 1364 AAYEKENKLKIALDFLIKLEKDGVMIEKEASELLVKWFRKLGVVEEVEIVLRDYAS 1531
             AYEKEN L  AL+FLI+LE+DG+M+++E S +L  WFRKLGVVEEVE+VLRD+A+
Sbjct: 364  VAYEKENMLNTALEFLIELERDGIMVKEETSRILAGWFRKLGVVEEVELVLRDFAT 419


Top