BLASTX nr result

ID: Sinomenium21_contig00013481 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00013481
         (2727 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI32743.3| unnamed protein product [Vitis vinifera]             1010   0.0  
ref|XP_002276355.1| PREDICTED: pentatricopeptide repeat-containi...  1010   0.0  
gb|EXC31617.1| hypothetical protein L484_008414 [Morus notabilis]     964   0.0  
ref|XP_007033459.1| Tetratricopeptide repeat (TPR)-like superfam...   958   0.0  
ref|XP_004163187.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   953   0.0  
ref|XP_004149878.1| PREDICTED: pentatricopeptide repeat-containi...   953   0.0  
ref|XP_002530985.1| pentatricopeptide repeat-containing protein,...   936   0.0  
ref|XP_004299746.1| PREDICTED: pentatricopeptide repeat-containi...   921   0.0  
ref|XP_002315730.1| pentatricopeptide repeat-containing family p...   920   0.0  
ref|XP_006481496.1| PREDICTED: pentatricopeptide repeat-containi...   910   0.0  
ref|XP_006428766.1| hypothetical protein CICLE_v10011107mg [Citr...   907   0.0  
ref|XP_006353112.1| PREDICTED: pentatricopeptide repeat-containi...   905   0.0  
ref|XP_004251992.1| PREDICTED: pentatricopeptide repeat-containi...   902   0.0  
ref|XP_007225233.1| hypothetical protein PRUPE_ppa001877mg [Prun...   899   0.0  
ref|XP_003524868.1| PREDICTED: pentatricopeptide repeat-containi...   892   0.0  
ref|XP_003532699.1| PREDICTED: pentatricopeptide repeat-containi...   892   0.0  
ref|XP_006410903.1| hypothetical protein EUTSA_v10017966mg [Eutr...   887   0.0  
ref|XP_002881498.1| pentatricopeptide repeat-containing protein ...   882   0.0  
ref|XP_007158766.1| hypothetical protein PHAVU_002G180100g [Phas...   881   0.0  
ref|XP_006296196.1| hypothetical protein CARUB_v10025361mg [Caps...   880   0.0  

>emb|CBI32743.3| unnamed protein product [Vitis vinifera]
          Length = 772

 Score = 1010 bits (2612), Expect = 0.0
 Identities = 526/737 (71%), Positives = 592/737 (80%), Gaps = 17/737 (2%)
 Frame = +3

Query: 222  WKSRALFHSRVSRITNPSPLYSLHGFCSNTKE------------ENGVSNSPGD----AA 353
            WK R      +S  +NPS L  +  F S  +             E  VS SP +     A
Sbjct: 12   WKPRLF----ISGASNPSSLNFIQSFSSVDESISAGDLTSSPIPETPVSGSPSEPGNLTA 67

Query: 354  PSTGKVNSPRRTPRGKPPNPEKPEDIIIRMMSNRAWTTRLQNSIRSLVPQFDHSLVLNVL 533
               G+  SPR TPRGK  NPEK EDII RMM+NRAWTTRLQNSIRSLVPQFDHSLV NVL
Sbjct: 68   AEAGEKASPR-TPRGKLRNPEKIEDIICRMMANRAWTTRLQNSIRSLVPQFDHSLVWNVL 126

Query: 534  NGARKPDHALQFFRWVEKTG-YRHDRSTHLKIIEILGGASMLNHARCILFDMPQKGVERD 710
            +G+R  DHALQFFRWVE+ G +RHDR THLKIIEILG AS LNHARCIL DMP+KGVE D
Sbjct: 127  HGSRNSDHALQFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPKKGVEWD 186

Query: 711  EDLFVVLILSYGKAGIVQESVKIFQKMKELGVERTITTYDTLFKVILRRGRVLMAKRYFN 890
            EDLFV+LI SYGKAGIVQESVK+FQKMKELGVERTI +YD LFKVILRRGR +MAKRYFN
Sbjct: 187  EDLFVLLIDSYGKAGIVQESVKVFQKMKELGVERTIKSYDALFKVILRRGRYMMAKRYFN 246

Query: 891  AMLKDGVLPTRHTFNIMIWGFFLSLKVETANRFFEDMKSRGIAPDVVTYNNMINGYCRVR 1070
            AML +GV+PT HT+NIMIWGFFLSLKVETANRFFE+MK R I+PDVVTYN MINGY R++
Sbjct: 247  AMLNEGVMPTCHTYNIMIWGFFLSLKVETANRFFEEMKERRISPDVVTYNTMINGYYRIK 306

Query: 1071 XXXXXXXXXXXXXXXDLIPTVISYTTMIKGYVSAGRVDDGLRLVEEMRAHKIRPNAVTYS 1250
                           ++ PTVISYTTMIKGYVS GRVDDGLRL EEM++  I+PNAVTYS
Sbjct: 307  KMEEAEKFFVEMKGRNIEPTVISYTTMIKGYVSVGRVDDGLRLFEEMKSFGIKPNAVTYS 366

Query: 1251 TLLPGLCDAEKMPEAQKFLNEMVQRNIVPKDNSIFLRLISCQCKSGKLDWAEDVLKGMVK 1430
            TLLPGLCD EKM EAQ  + EMV+R I PKDNSIF+RLI+CQCK+G+LD A DVLK M++
Sbjct: 367  TLLPGLCDGEKMLEAQNVVKEMVERYIAPKDNSIFMRLITCQCKAGQLDAAADVLKAMIR 426

Query: 1431 LSIPAEPAHYGVLMENFCRAGFYDRAIKLLDEVVGNETLLNPQISLEMEDSAYNPVIEYL 1610
            LSIP E  HYGVL+ENFC++G YDRA+KLLD+++  E +L PQ SLEME S YN +IEYL
Sbjct: 427  LSIPTEAGHYGVLIENFCKSGVYDRAVKLLDKLIEKEIILRPQNSLEMESSGYNLIIEYL 486

Query: 1611 CNHKLTAKAETFFRQLMKTGVQDPASFNNLLLGHSKEGNPESALEILKIMSRRKIPADAL 1790
            CN   T+KAET FRQLMK GVQDP +FNNL+ GHSKEG PESA EILKIM RR++P +A 
Sbjct: 487  CNSGQTSKAETLFRQLMKKGVQDPIAFNNLIRGHSKEGAPESAFEILKIMGRREVPREAD 546

Query: 1791 AYRSLIMSFLKKGEPADAKVALDGMIESGHLPDSELFRLVMESLFEDGRVQTASRVMKSM 1970
            AYR LI SFLKKGEPADAK ALDGMIE+GH+PDS LFR VMESLFEDGR+QTASRVM +M
Sbjct: 547  AYRLLIESFLKKGEPADAKTALDGMIENGHIPDSSLFRSVMESLFEDGRIQTASRVMNNM 606

Query: 1971 IEKGVKEHMDLVAKILEALLMRGHVEEALGRIVLLMDNDCIPDFDHLLAVLCEKEKTIAA 2150
            +EKGVKE+MDLVAKILEALL+RGHVEEALGRI LLM+N C PDFD LL+VLC K KTIAA
Sbjct: 607  VEKGVKENMDLVAKILEALLLRGHVEEALGRIDLLMNNGCEPDFDGLLSVLCAKGKTIAA 666

Query: 2151 LKLLDFGLERDCNISFASYDKVLDSLLVAGKTLNAYSILCKVMEKGGIKDLSGVGDLIRS 2330
            LKLLDFGLERD NISF+SY+ VLD+LL AGKTLNAYSILCK+M+KGG  D S   DLIRS
Sbjct: 667  LKLLDFGLERDYNISFSSYENVLDALLTAGKTLNAYSILCKIMQKGGATDWSSCKDLIRS 726

Query: 2331 LNAQGNTKQADILSRMI 2381
            LN +GNTKQADILSRMI
Sbjct: 727  LNEEGNTKQADILSRMI 743


>ref|XP_002276355.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Vitis vinifera]
          Length = 763

 Score = 1010 bits (2612), Expect = 0.0
 Identities = 526/737 (71%), Positives = 592/737 (80%), Gaps = 17/737 (2%)
 Frame = +3

Query: 222  WKSRALFHSRVSRITNPSPLYSLHGFCSNTKE------------ENGVSNSPGD----AA 353
            WK R      +S  +NPS L  +  F S  +             E  VS SP +     A
Sbjct: 12   WKPRLF----ISGASNPSSLNFIQSFSSVDESISAGDLTSSPIPETPVSGSPSEPGNLTA 67

Query: 354  PSTGKVNSPRRTPRGKPPNPEKPEDIIIRMMSNRAWTTRLQNSIRSLVPQFDHSLVLNVL 533
               G+  SPR TPRGK  NPEK EDII RMM+NRAWTTRLQNSIRSLVPQFDHSLV NVL
Sbjct: 68   AEAGEKASPR-TPRGKLRNPEKIEDIICRMMANRAWTTRLQNSIRSLVPQFDHSLVWNVL 126

Query: 534  NGARKPDHALQFFRWVEKTG-YRHDRSTHLKIIEILGGASMLNHARCILFDMPQKGVERD 710
            +G+R  DHALQFFRWVE+ G +RHDR THLKIIEILG AS LNHARCIL DMP+KGVE D
Sbjct: 127  HGSRNSDHALQFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPKKGVEWD 186

Query: 711  EDLFVVLILSYGKAGIVQESVKIFQKMKELGVERTITTYDTLFKVILRRGRVLMAKRYFN 890
            EDLFV+LI SYGKAGIVQESVK+FQKMKELGVERTI +YD LFKVILRRGR +MAKRYFN
Sbjct: 187  EDLFVLLIDSYGKAGIVQESVKVFQKMKELGVERTIKSYDALFKVILRRGRYMMAKRYFN 246

Query: 891  AMLKDGVLPTRHTFNIMIWGFFLSLKVETANRFFEDMKSRGIAPDVVTYNNMINGYCRVR 1070
            AML +GV+PT HT+NIMIWGFFLSLKVETANRFFE+MK R I+PDVVTYN MINGY R++
Sbjct: 247  AMLNEGVMPTCHTYNIMIWGFFLSLKVETANRFFEEMKERRISPDVVTYNTMINGYYRIK 306

Query: 1071 XXXXXXXXXXXXXXXDLIPTVISYTTMIKGYVSAGRVDDGLRLVEEMRAHKIRPNAVTYS 1250
                           ++ PTVISYTTMIKGYVS GRVDDGLRL EEM++  I+PNAVTYS
Sbjct: 307  KMEEAEKFFVEMKGRNIEPTVISYTTMIKGYVSVGRVDDGLRLFEEMKSFGIKPNAVTYS 366

Query: 1251 TLLPGLCDAEKMPEAQKFLNEMVQRNIVPKDNSIFLRLISCQCKSGKLDWAEDVLKGMVK 1430
            TLLPGLCD EKM EAQ  + EMV+R I PKDNSIF+RLI+CQCK+G+LD A DVLK M++
Sbjct: 367  TLLPGLCDGEKMLEAQNVVKEMVERYIAPKDNSIFMRLITCQCKAGQLDAAADVLKAMIR 426

Query: 1431 LSIPAEPAHYGVLMENFCRAGFYDRAIKLLDEVVGNETLLNPQISLEMEDSAYNPVIEYL 1610
            LSIP E  HYGVL+ENFC++G YDRA+KLLD+++  E +L PQ SLEME S YN +IEYL
Sbjct: 427  LSIPTEAGHYGVLIENFCKSGVYDRAVKLLDKLIEKEIILRPQNSLEMESSGYNLIIEYL 486

Query: 1611 CNHKLTAKAETFFRQLMKTGVQDPASFNNLLLGHSKEGNPESALEILKIMSRRKIPADAL 1790
            CN   T+KAET FRQLMK GVQDP +FNNL+ GHSKEG PESA EILKIM RR++P +A 
Sbjct: 487  CNSGQTSKAETLFRQLMKKGVQDPIAFNNLIRGHSKEGAPESAFEILKIMGRREVPREAD 546

Query: 1791 AYRSLIMSFLKKGEPADAKVALDGMIESGHLPDSELFRLVMESLFEDGRVQTASRVMKSM 1970
            AYR LI SFLKKGEPADAK ALDGMIE+GH+PDS LFR VMESLFEDGR+QTASRVM +M
Sbjct: 547  AYRLLIESFLKKGEPADAKTALDGMIENGHIPDSSLFRSVMESLFEDGRIQTASRVMNNM 606

Query: 1971 IEKGVKEHMDLVAKILEALLMRGHVEEALGRIVLLMDNDCIPDFDHLLAVLCEKEKTIAA 2150
            +EKGVKE+MDLVAKILEALL+RGHVEEALGRI LLM+N C PDFD LL+VLC K KTIAA
Sbjct: 607  VEKGVKENMDLVAKILEALLLRGHVEEALGRIDLLMNNGCEPDFDGLLSVLCAKGKTIAA 666

Query: 2151 LKLLDFGLERDCNISFASYDKVLDSLLVAGKTLNAYSILCKVMEKGGIKDLSGVGDLIRS 2330
            LKLLDFGLERD NISF+SY+ VLD+LL AGKTLNAYSILCK+M+KGG  D S   DLIRS
Sbjct: 667  LKLLDFGLERDYNISFSSYENVLDALLTAGKTLNAYSILCKIMQKGGATDWSSCKDLIRS 726

Query: 2331 LNAQGNTKQADILSRMI 2381
            LN +GNTKQADILSRMI
Sbjct: 727  LNEEGNTKQADILSRMI 743


>gb|EXC31617.1| hypothetical protein L484_008414 [Morus notabilis]
          Length = 768

 Score =  964 bits (2491), Expect = 0.0
 Identities = 498/746 (66%), Positives = 584/746 (78%), Gaps = 19/746 (2%)
 Frame = +3

Query: 201  SMAQHLLWKSRALFHSRVSRIT-NPSPLYSLHGFCSNTKEENGVSNS--------PGDAA 353
            ++++   W++RAL    + RI+ NPS ++ L  F ++ + E   + +        P    
Sbjct: 5    ALSKRWQWRARAL--PNLPRISHNPSSIHHLRLFTASQEGEEDPAPTTEKSPDPVPNPDC 62

Query: 354  PSTGKVNSPR---------RTPRGKPPNPEKPEDIIIRMMSNRAWTTRLQNSIRSLVPQF 506
            P +   N P+         RTPRGK  NPEK EDII RMM+NRAWTTRLQNSIR LVPQF
Sbjct: 63   PPSESPNPPKSRPENTAIQRTPRGKSRNPEKIEDIICRMMANRAWTTRLQNSIRRLVPQF 122

Query: 507  DHSLVLNVLNGARKPDHALQFFRWVEKTG-YRHDRSTHLKIIEILGGASMLNHARCILFD 683
            DHSLV NVL+GAR  DHALQFFRWVE++G + HDR THLKIIEIL  AS LNHARCIL D
Sbjct: 123  DHSLVWNVLHGARNSDHALQFFRWVERSGLFNHDRETHLKIIEILTRASKLNHARCILLD 182

Query: 684  MPQKGVERDEDLFVVLILSYGKAGIVQESVKIFQKMKELGVERTITTYDTLFKVILRRGR 863
            MP+K V+ DEDLFV+ I  YGKAGIVQESV++F KMKELGVER++ +YD LFKVILRRGR
Sbjct: 183  MPKKSVQWDEDLFVLFIDGYGKAGIVQESVRMFNKMKELGVERSVKSYDALFKVILRRGR 242

Query: 864  VLMAKRYFNAMLKDGVLPTRHTFNIMIWGFFLSLKVETANRFFEDMKSRGIAPDVVTYNN 1043
             +MAKRYFNAM+ +G+ PT+HT+NIM+WGFFLSL++ETA RF+EDMK+RG+ PDVVTYN 
Sbjct: 243  YMMAKRYFNAMINEGIEPTKHTYNIMLWGFFLSLRLETAKRFYEDMKNRGVWPDVVTYNT 302

Query: 1044 MINGYCRVRXXXXXXXXXXXXXXXDLIPTVISYTTMIKGYVSAGRVDDGLRLVEEMRAHK 1223
            MINGY R +               ++ PTVISYTTMIKGYVS GRVDDGLRL EEM++  
Sbjct: 303  MINGYNRFKMMDEAEKMFVEMKGRNIAPTVISYTTMIKGYVSIGRVDDGLRLFEEMKSFG 362

Query: 1224 IRPNAVTYSTLLPGLCDAEKMPEAQKFLNEMVQRNIVPKDNSIFLRLISCQCKSGKLDWA 1403
            I+PNAVTY+TLLPGLCDAEKM EA+  L EMV R I PKDNSIFLRL+S QCK G LD A
Sbjct: 363  IKPNAVTYTTLLPGLCDAEKMSEARTMLKEMVDRYIAPKDNSIFLRLLSSQCKVGDLDAA 422

Query: 1404 EDVLKGMVKLSIPAEPAHYGVLMENFCRAGFYDRAIKLLDEVVGNETLLNPQISLEMEDS 1583
             DVLK M++LSIP E  HYG+L+ENFC+A  YDRA+KLLD+++  E +L PQ S EME S
Sbjct: 423  ADVLKAMIRLSIPTEAGHYGILIENFCKAAVYDRAVKLLDKLIEKEIVLRPQSSTEMEAS 482

Query: 1584 AYNPVIEYLCNHKLTAKAETFFRQLMKTGVQDPASFNNLLLGHSKEGNPESALEILKIMS 1763
            AYN +I++LCNH  T KAE FFRQLMK GVQDP +FNNL+ GHSKEGNP+SA EILKIM 
Sbjct: 483  AYNAMIQFLCNHGQTGKAEIFFRQLMKKGVQDPVAFNNLIRGHSKEGNPDSAFEILKIMG 542

Query: 1764 RRKIPADALAYRSLIMSFLKKGEPADAKVALDGMIESGHLPDSELFRLVMESLFEDGRVQ 1943
            RR +  DA +YR LI S+L KGEPADAK ALD MIE+ HLP+S LFR VMESL+EDGR Q
Sbjct: 543  RRGVARDADSYRLLIKSYLSKGEPADAKTALDSMIENDHLPESSLFRSVMESLYEDGRAQ 602

Query: 1944 TASRVMKSMIEKGVKEHMDLVAKILEALLMRGHVEEALGRIVLLMDNDCIPDFDHLLAVL 2123
            TASRVMKSMIEKGVKE+MDLVAKILEALL+RGHVEEALGRI LLM + C P+FD LL+VL
Sbjct: 603  TASRVMKSMIEKGVKENMDLVAKILEALLVRGHVEEALGRIDLLMQSGCAPNFDSLLSVL 662

Query: 2124 CEKEKTIAALKLLDFGLERDCNISFASYDKVLDSLLVAGKTLNAYSILCKVMEKGGIKDL 2303
            CEK KTIAALKLLDF LERD  + F+SYDKVLD+LL AGKTLNAYSILCK+M KGG+ D 
Sbjct: 663  CEKGKTIAALKLLDFCLERDYVVDFSSYDKVLDALLAAGKTLNAYSILCKIMGKGGVTDW 722

Query: 2304 SGVGDLIRSLNAQGNTKQADILSRMI 2381
            SG  DLI+SLN +GNTKQADI+SRMI
Sbjct: 723  SGCEDLIKSLNKEGNTKQADIISRMI 748


>ref|XP_007033459.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma
            cacao] gi|508712488|gb|EOY04385.1| Tetratricopeptide
            repeat (TPR)-like superfamily protein [Theobroma cacao]
          Length = 743

 Score =  958 bits (2477), Expect = 0.0
 Identities = 496/739 (67%), Positives = 582/739 (78%), Gaps = 3/739 (0%)
 Frame = +3

Query: 174  VSKTFYISPSMAQHLLWKSRALFHSRVSRITNPSPLYSLHGFCSNTKEENGVSNSPGDAA 353
            VSKT+ + P             +H    RI+NP     LH F + +++ +  S    +A 
Sbjct: 6    VSKTYKLKPRF-----------YH----RISNP-----LH-FFTTSQDPSTASQELNNAP 44

Query: 354  PSTG--KVNSPRRTPRGKPPNPEKPEDIIIRMMSNRAWTTRLQNSIRSLVPQFDHSLVLN 527
            P     KV + R +PRGK  NPEK ED+I RMM NRAWTTRLQNSIR+LVP+FDH+LV N
Sbjct: 45   PQQEGEKVVTQRTSPRGKTRNPEKVEDVICRMMENRAWTTRLQNSIRALVPEFDHALVYN 104

Query: 528  VLNGARKPDHALQFFRWVEKTGY-RHDRSTHLKIIEILGGASMLNHARCILFDMPQKGVE 704
            VL+GA+  + ALQFFRWVE+ G  RHDR  H+KII+ILG AS LNHARCIL DMP+KGVE
Sbjct: 105  VLHGAKNSEQALQFFRWVERAGLIRHDREAHMKIIQILGRASKLNHARCILLDMPKKGVE 164

Query: 705  RDEDLFVVLILSYGKAGIVQESVKIFQKMKELGVERTITTYDTLFKVILRRGRVLMAKRY 884
             DEDLFVVLI SYGKAGIVQE+VKIFQKM ELGVERTI +YD  FKVILRRGR +MAKRY
Sbjct: 165  WDEDLFVVLIDSYGKAGIVQEAVKIFQKMNELGVERTIKSYDAFFKVILRRGRYMMAKRY 224

Query: 885  FNAMLKDGVLPTRHTFNIMIWGFFLSLKVETANRFFEDMKSRGIAPDVVTYNNMINGYCR 1064
            FN ML +G++PTRHT+NIM+WGFFLSL+++TANRF+EDMK+RGI+PDVVTYN MINGY R
Sbjct: 225  FNKMLSEGIVPTRHTYNIMLWGFFLSLRLDTANRFYEDMKTRGISPDVVTYNTMINGYSR 284

Query: 1065 VRXXXXXXXXXXXXXXXDLIPTVISYTTMIKGYVSAGRVDDGLRLVEEMRAHKIRPNAVT 1244
             +               +L PTVISYTTMIKGYV+  +VDDGLRL+EEM++  I+PNA T
Sbjct: 285  FKKMEEAEKLFVEMKGKNLAPTVISYTTMIKGYVAVEQVDDGLRLLEEMKSFGIKPNATT 344

Query: 1245 YSTLLPGLCDAEKMPEAQKFLNEMVQRNIVPKDNSIFLRLISCQCKSGKLDWAEDVLKGM 1424
            YSTLLPGLCDA KM EA+  L EMV+  I PKDNSIF+ L++ QCKSG LD A DVLK M
Sbjct: 345  YSTLLPGLCDAGKMTEAKSILKEMVEWYIAPKDNSIFINLLNSQCKSGDLDAAADVLKAM 404

Query: 1425 VKLSIPAEPAHYGVLMENFCRAGFYDRAIKLLDEVVGNETLLNPQISLEMEDSAYNPVIE 1604
            ++LSIP E  HYGVL+ENFC+A  +DRAIKLLD++V  E +L PQ SL+ME SAYN +I+
Sbjct: 405  IRLSIPTEAGHYGVLIENFCKANLFDRAIKLLDKLVEKEIILRPQNSLDMEASAYNAMIQ 464

Query: 1605 YLCNHKLTAKAETFFRQLMKTGVQDPASFNNLLLGHSKEGNPESALEILKIMSRRKIPAD 1784
            YLC+H  T KAE FFRQLMK GV DP +FNNL+ GH+KEGNP  A EILKIM RR +P D
Sbjct: 465  YLCHHGQTGKAEVFFRQLMKKGVLDPTAFNNLIRGHAKEGNPGLAFEILKIMGRRGVPKD 524

Query: 1785 ALAYRSLIMSFLKKGEPADAKVALDGMIESGHLPDSELFRLVMESLFEDGRVQTASRVMK 1964
            A AY+ LI S+L+KGEPADAK +LD MIE G LP+S +F+ VMESLFEDGR+QTASRVMK
Sbjct: 525  ADAYKLLIESYLRKGEPADAKTSLDSMIEDGLLPESGIFKSVMESLFEDGRIQTASRVMK 584

Query: 1965 SMIEKGVKEHMDLVAKILEALLMRGHVEEALGRIVLLMDNDCIPDFDHLLAVLCEKEKTI 2144
            SM+EKGVKEHMDLVAKILEALLMRGHVEEALGRI LLM N C P+ D LL+VL EK KTI
Sbjct: 585  SMVEKGVKEHMDLVAKILEALLMRGHVEEALGRIELLMQNGCAPNLDSLLSVLSEKGKTI 644

Query: 2145 AALKLLDFGLERDCNISFASYDKVLDSLLVAGKTLNAYSILCKVMEKGGIKDLSGVGDLI 2324
            AALKLLDFGLERDC+I F+SY+KVLD+LL AGKTLNAYSILCK+MEKGGI + S + DLI
Sbjct: 645  AALKLLDFGLERDCSIDFSSYEKVLDALLAAGKTLNAYSILCKIMEKGGITNWSSLEDLI 704

Query: 2325 RSLNAQGNTKQADILSRMI 2381
            +SLN +GNTKQADILSRMI
Sbjct: 705  KSLNQEGNTKQADILSRMI 723


>ref|XP_004163187.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At2g37230-like [Cucumis sativus]
          Length = 760

 Score =  953 bits (2464), Expect = 0.0
 Identities = 489/733 (66%), Positives = 577/733 (78%), Gaps = 20/733 (2%)
 Frame = +3

Query: 243  HSRV---SRITNPSPLYSLHGFCS-----NTKEENGVSNSPG---DAA-PSTGK---VNS 377
            H RV   S I+ P+ L SLH F S     +T  +NG  N P    DAA P TG+   VN 
Sbjct: 13   HYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNG 72

Query: 378  PR----RTPRGKPPNPEKPEDIIIRMMSNRAWTTRLQNSIRSLVPQFDHSLVLNVLNGAR 545
             +    R PRG+P +PEK E II +MM+NR WTTRLQNSIRSLVPQFDH+LV NVL+ A+
Sbjct: 73   VQQVKGRIPRGRPRDPEKLEXIICKMMANREWTTRLQNSIRSLVPQFDHNLVYNVLHAAK 132

Query: 546  KPDHALQFFRWVEKTG-YRHDRSTHLKIIEILGGASMLNHARCILFDMPQKGVERDEDLF 722
            K +HAL FFRWVE+ G ++HDR TH KIIEILG AS LNHARCIL DMP KGV+ DEDLF
Sbjct: 133  KSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLF 192

Query: 723  VVLILSYGKAGIVQESVKIFQKMKELGVERTITTYDTLFKVILRRGRVLMAKRYFNAMLK 902
            VVLI SYGKAGIVQE+VKIFQKMKELGVER++ +YD LFK I+RRGR +MAKRYFNAML 
Sbjct: 193  VVLIESYGKAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLN 252

Query: 903  DGVLPTRHTFNIMIWGFFLSLKVETANRFFEDMKSRGIAPDVVTYNNMINGYCRVRXXXX 1082
            +G+ P RHT+N+M+WGFFLSL++ETA RF+EDMKSRGI+PDVVTYN MINGYCR +    
Sbjct: 253  EGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGYCRFKMMEE 312

Query: 1083 XXXXXXXXXXXDLIPTVISYTTMIKGYVSAGRVDDGLRLVEEMRAHKIRPNAVTYSTLLP 1262
                       ++ PTVISYTTMIKGYVS  R DD LRL EEM+A   +PN +TYSTLLP
Sbjct: 313  AEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLP 372

Query: 1263 GLCDAEKMPEAQKFLNEMVQRNIVPKDNSIFLRLISCQCKSGKLDWAEDVLKGMVKLSIP 1442
            GLCDAEK+PEA+K L EMV R+  PKDNSIF+RL+SCQCK G LD A  VLK M++LSIP
Sbjct: 373  GLCDAEKLPEARKILTEMVTRHFAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIP 432

Query: 1443 AEPAHYGVLMENFCRAGFYDRAIKLLDEVVGNETLLNPQISLEMEDSAYNPVIEYLCNHK 1622
             E  HYG+L+EN C+AG YD+A+KLL+ +V  E +L PQ +LEME SAYN +I+YLCNH 
Sbjct: 433  TEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHG 492

Query: 1623 LTAKAETFFRQLMKTGVQDPASFNNLLLGHSKEGNPESALEILKIMSRRKIPADALAYRS 1802
             T KA+TFFRQL+K G+QD  +FNNL+ GH+KEGNP+ A E+LKIM RR +  DA +Y+ 
Sbjct: 493  QTGKADTFFRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKL 552

Query: 1803 LIMSFLKKGEPADAKVALDGMIESGHLPDSELFRLVMESLFEDGRVQTASRVMKSMIEKG 1982
            LI S+L KGEPADAK ALD MIE+GH PDS LFR VMESLF DGRVQTASRVM SM++KG
Sbjct: 553  LIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQTASRVMNSMLDKG 612

Query: 1983 VKEHMDLVAKILEALLMRGHVEEALGRIVLLMDNDCIPDFDHLLAVLCEKEKTIAALKLL 2162
            + E++DLVAKILEAL MRGH EEALGRI LLM+ +C PDF+ LL+VLCEK KT +A KLL
Sbjct: 613  ITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLL 672

Query: 2163 DFGLERDCNISFASYDKVLDSLLVAGKTLNAYSILCKVMEKGGIKDLSGVGDLIRSLNAQ 2342
            DFGLER+CNI F+SY+KVLD+LL AGKTLNAY+ILCK+MEKGG KD S   DLI+SLN +
Sbjct: 673  DFGLERECNIEFSSYEKVLDALLGAGKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQE 732

Query: 2343 GNTKQADILSRMI 2381
            GNTKQADILSRMI
Sbjct: 733  GNTKQADILSRMI 745


>ref|XP_004149878.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Cucumis sativus]
          Length = 760

 Score =  953 bits (2464), Expect = 0.0
 Identities = 489/733 (66%), Positives = 577/733 (78%), Gaps = 20/733 (2%)
 Frame = +3

Query: 243  HSRV---SRITNPSPLYSLHGFCS-----NTKEENGVSNSPG---DAA-PSTGK---VNS 377
            H RV   S I+ P+ L SLH F S     +T  +NG  N P    DAA P TG+   VN 
Sbjct: 13   HYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNG 72

Query: 378  PR----RTPRGKPPNPEKPEDIIIRMMSNRAWTTRLQNSIRSLVPQFDHSLVLNVLNGAR 545
             +    R PRG+P +PEK E II +MM+NR WTTRLQNSIRSLVPQFDH+LV NVL+ A+
Sbjct: 73   VQQVKGRIPRGRPRDPEKLEKIICKMMANREWTTRLQNSIRSLVPQFDHNLVYNVLHAAK 132

Query: 546  KPDHALQFFRWVEKTG-YRHDRSTHLKIIEILGGASMLNHARCILFDMPQKGVERDEDLF 722
            K +HAL FFRWVE+ G ++HDR TH KIIEILG AS LNHARCIL DMP KGV+ DEDLF
Sbjct: 133  KSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLF 192

Query: 723  VVLILSYGKAGIVQESVKIFQKMKELGVERTITTYDTLFKVILRRGRVLMAKRYFNAMLK 902
            VVLI SYGKAGIVQE+VKIFQKMKELGVER++ +YD LFK I+RRGR +MAKRYFNAML 
Sbjct: 193  VVLIESYGKAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLN 252

Query: 903  DGVLPTRHTFNIMIWGFFLSLKVETANRFFEDMKSRGIAPDVVTYNNMINGYCRVRXXXX 1082
            +G+ P RHT+N+M+WGFFLSL++ETA RF+EDMKSRGI+PDVVTYN MINGYCR +    
Sbjct: 253  EGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGYCRFKMMEE 312

Query: 1083 XXXXXXXXXXXDLIPTVISYTTMIKGYVSAGRVDDGLRLVEEMRAHKIRPNAVTYSTLLP 1262
                       ++ PTVISYTTMIKGYVS  R DD LRL EEM+A   +PN +TYSTLLP
Sbjct: 313  AEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLP 372

Query: 1263 GLCDAEKMPEAQKFLNEMVQRNIVPKDNSIFLRLISCQCKSGKLDWAEDVLKGMVKLSIP 1442
            GLCDAEK+PEA+K L EMV R+  PKDNSIF+RL+SCQCK G LD A  VLK M++LSIP
Sbjct: 373  GLCDAEKLPEARKILTEMVTRHFAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIP 432

Query: 1443 AEPAHYGVLMENFCRAGFYDRAIKLLDEVVGNETLLNPQISLEMEDSAYNPVIEYLCNHK 1622
             E  HYG+L+EN C+AG YD+A+KLL+ +V  E +L PQ +LEME SAYN +I+YLCNH 
Sbjct: 433  TEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHG 492

Query: 1623 LTAKAETFFRQLMKTGVQDPASFNNLLLGHSKEGNPESALEILKIMSRRKIPADALAYRS 1802
             T KA+TFFRQL+K G+QD  +FNNL+ GH+KEGNP+ A E+LKIM RR +  DA +Y+ 
Sbjct: 493  QTGKADTFFRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKL 552

Query: 1803 LIMSFLKKGEPADAKVALDGMIESGHLPDSELFRLVMESLFEDGRVQTASRVMKSMIEKG 1982
            LI S+L KGEPADAK ALD MIE+GH PDS LFR VMESLF DGRVQTASRVM SM++KG
Sbjct: 553  LIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQTASRVMNSMLDKG 612

Query: 1983 VKEHMDLVAKILEALLMRGHVEEALGRIVLLMDNDCIPDFDHLLAVLCEKEKTIAALKLL 2162
            + E++DLVAKILEAL MRGH EEALGRI LLM+ +C PDF+ LL+VLCEK KT +A KLL
Sbjct: 613  ITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLL 672

Query: 2163 DFGLERDCNISFASYDKVLDSLLVAGKTLNAYSILCKVMEKGGIKDLSGVGDLIRSLNAQ 2342
            DFGLER+CNI F+SY+KVLD+LL AGKTLNAY+ILCK+MEKGG KD S   DLI+SLN +
Sbjct: 673  DFGLERECNIEFSSYEKVLDALLGAGKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQE 732

Query: 2343 GNTKQADILSRMI 2381
            GNTKQADILSRMI
Sbjct: 733  GNTKQADILSRMI 745


>ref|XP_002530985.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223529437|gb|EEF31397.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 753

 Score =  936 bits (2418), Expect = 0.0
 Identities = 481/723 (66%), Positives = 569/723 (78%), Gaps = 14/723 (1%)
 Frame = +3

Query: 255  SRITNPSPLYSLHGFCSNTKE--------ENGVSNSPGDAAPSTG-----KVNSPRRTPR 395
            SR+ +  P  SLH FC+ T++         N  S +  DAA +       +  + +R PR
Sbjct: 12   SRVYHTIPRLSLH-FCTLTQDPIPSVTQISNPQSETLNDAAAAAAATQENQTQTYQRIPR 70

Query: 396  GKPPNPEKPEDIIIRMMSNRAWTTRLQNSIRSLVPQFDHSLVLNVLNGARKPDHALQFFR 575
            GK P+PEK ED I RMM+NR WTTRLQNSIR+LVP FDHSLV NVL+ AR  +HALQFFR
Sbjct: 71   GKRPDPEKVEDTISRMMANRPWTTRLQNSIRNLVPHFDHSLVYNVLHAARNSEHALQFFR 130

Query: 576  WVEKTG-YRHDRSTHLKIIEILGGASMLNHARCILFDMPQKGVERDEDLFVVLILSYGKA 752
            WVE+ G +++DR TH+KIIEILG AS LNHARCIL DMP+KGVE DE +FVVLI SYGKA
Sbjct: 131  WVERAGLFKNDRDTHMKIIEILGRASKLNHARCILLDMPKKGVEWDEYMFVVLIESYGKA 190

Query: 753  GIVQESVKIFQKMKELGVERTITTYDTLFKVILRRGRVLMAKRYFNAMLKDGVLPTRHTF 932
            GIVQE+VKIF KM ELGVER+I +YD LFKVILRRGR +MAKR FN ML DG+ PTRHT+
Sbjct: 191  GIVQEAVKIFNKMNELGVERSIKSYDALFKVILRRGRYMMAKRVFNKMLNDGIQPTRHTY 250

Query: 933  NIMIWGFFLSLKVETANRFFEDMKSRGIAPDVVTYNNMINGYCRVRXXXXXXXXXXXXXX 1112
            NIM+WGFFLSL++ETA RF++DMK+RGI+PDVVTYN MING+ R +              
Sbjct: 251  NIMLWGFFLSLRLETAMRFYDDMKNRGISPDVVTYNTMINGFYRFKKMEEAEKLFVEMKG 310

Query: 1113 XDLIPTVISYTTMIKGYVSAGRVDDGLRLVEEMRAHKIRPNAVTYSTLLPGLCDAEKMPE 1292
             ++ PTVISYTTMIKGYV+  RVDDGLRL+EEM++  I+PN  TYSTLLPGLCDA KM E
Sbjct: 311  KNIAPTVISYTTMIKGYVAVDRVDDGLRLLEEMKSFNIKPNVHTYSTLLPGLCDAWKMTE 370

Query: 1293 AQKFLNEMVQRNIVPKDNSIFLRLISCQCKSGKLDWAEDVLKGMVKLSIPAEPAHYGVLM 1472
            A+  L EMV R++ PKDNSIFLRL+SCQCK+G L  AEDVL  M++L IP E  HYGVL+
Sbjct: 371  AKDILIEMVARHLAPKDNSIFLRLLSCQCKAGDLRAAEDVLNTMMRLHIPTEAGHYGVLI 430

Query: 1473 ENFCRAGFYDRAIKLLDEVVGNETLLNPQISLEMEDSAYNPVIEYLCNHKLTAKAETFFR 1652
            ENFC+A  YDRA+K LD+++  E +L PQ +LE+E +AYNP+I+YLC+H  T KAE FFR
Sbjct: 431  ENFCKAEEYDRAVKYLDKLIEKEIILRPQSTLEIESNAYNPMIQYLCSHGQTGKAEIFFR 490

Query: 1653 QLMKTGVQDPASFNNLLLGHSKEGNPESALEILKIMSRRKIPADALAYRSLIMSFLKKGE 1832
            QLMK GVQDP +FNNL+ GH+KEG P+SA EI KIM +R +P DA AYR +I S+L+KGE
Sbjct: 491  QLMKKGVQDPLAFNNLICGHAKEGYPDSAFEIFKIMGKRGVPRDADAYRLIIESYLRKGE 550

Query: 1833 PADAKVALDGMIESGHLPDSELFRLVMESLFEDGRVQTASRVMKSMIEKGVKEHMDLVAK 2012
            PADAK ALDGM+E GH+PD  +FR VMESLFEDGRVQTASRVMKSM+EKGVKE+MDLV K
Sbjct: 551  PADAKTALDGMLEDGHVPDPSVFRSVMESLFEDGRVQTASRVMKSMVEKGVKENMDLVGK 610

Query: 2013 ILEALLMRGHVEEALGRIVLLMDNDCIPDFDHLLAVLCEKEKTIAALKLLDFGLERDCNI 2192
            ILEALLMRGHVEEALGRI LLM +    +FD LL+VL EK KTIAALKLLDF LERD N+
Sbjct: 611  ILEALLMRGHVEEALGRIELLMQSGFHVNFDDLLSVLSEKGKTIAALKLLDFALERDFNL 670

Query: 2193 SFASYDKVLDSLLVAGKTLNAYSILCKVMEKGGIKDLSGVGDLIRSLNAQGNTKQADILS 2372
             F SYDKVLD+LL AGKTLNAYSILCK+M+KGG+ D S   DLI+SLN +GNTKQADILS
Sbjct: 671  DFKSYDKVLDALLAAGKTLNAYSILCKIMQKGGVSDWSSSKDLIKSLNQEGNTKQADILS 730

Query: 2373 RMI 2381
            RMI
Sbjct: 731  RMI 733


>ref|XP_004299746.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Fragaria vesca subsp. vesca]
          Length = 763

 Score =  921 bits (2381), Expect = 0.0
 Identities = 470/729 (64%), Positives = 561/729 (76%), Gaps = 21/729 (2%)
 Frame = +3

Query: 258  RITNPSPLYSLHGFCSNTKEENGVSNSPGDAAPSTGKVNSPRRTPRGK--------PP-- 407
            R++NP  L  L  FCS T+  +    S  DA P+     SP     G         PP  
Sbjct: 15   RLSNPQSLPLLRLFCS-TETPSPQPGSASDAPPAETPTGSPPDPQNGSAAAASAPPPPQT 73

Query: 408  ----------NPEKPEDIIIRMMSNRAWTTRLQNSIRSLVPQFDHSLVLNVLNGARKPDH 557
                      NPEK EDII RMM+NRAWTTRLQNSIR LVP+FDH+LV NVL+GA+  D 
Sbjct: 74   PKPRQLRRARNPEKTEDIICRMMANRAWTTRLQNSIRDLVPEFDHNLVWNVLHGAKTSDQ 133

Query: 558  ALQFFRWVEKTG-YRHDRSTHLKIIEILGGASMLNHARCILFDMPQKGVERDEDLFVVLI 734
            ALQFFRWVE++  ++HDR THLKIIEILG AS LNHARCIL DMP+KGV+ DEDLF+ LI
Sbjct: 134  ALQFFRWVERSRLFQHDRETHLKIIEILGRASKLNHARCILLDMPKKGVQWDEDLFIHLI 193

Query: 735  LSYGKAGIVQESVKIFQKMKELGVERTITTYDTLFKVILRRGRVLMAKRYFNAMLKDGVL 914
             SYGKAGIVQESVK+F +MKELGVER++ +Y+ LFK ILRRGR +M KRYFN ML +G+ 
Sbjct: 194  DSYGKAGIVQESVKLFNQMKELGVERSLKSYEALFKSILRRGRYMMGKRYFNHMLAEGIE 253

Query: 915  PTRHTFNIMIWGFFLSLKVETANRFFEDMKSRGIAPDVVTYNNMINGYCRVRXXXXXXXX 1094
            PTRHT+NIMIWGFFLSL++ETA RFFEDMK+RG++PDVVTYN MINGY R +        
Sbjct: 254  PTRHTYNIMIWGFFLSLRLETAKRFFEDMKTRGLSPDVVTYNTMINGYNRFKMMDEAEQL 313

Query: 1095 XXXXXXXDLIPTVISYTTMIKGYVSAGRVDDGLRLVEEMRAHKIRPNAVTYSTLLPGLCD 1274
                   ++ P VISYTTMIKGYVS G+VDDG RL +EM++  I+PN VT+STLLPGLCD
Sbjct: 314  FVELKGKNIQPNVISYTTMIKGYVSVGKVDDGYRLFQEMKSFGIKPNDVTFSTLLPGLCD 373

Query: 1275 AEKMPEAQKFLNEMVQRNIVPKDNSIFLRLISCQCKSGKLDWAEDVLKGMVKLSIPAEPA 1454
            AEK  EAQ  L+EMV+R+I PKDNS+F +L+ CQCKSG LD A +VLK M++L IP E  
Sbjct: 374  AEKKDEAQNLLSEMVERHIAPKDNSVFEKLLYCQCKSGDLDAAANVLKAMIRLHIPTEAG 433

Query: 1455 HYGVLMENFCRAGFYDRAIKLLDEVVGNETLLNPQISLEMEDSAYNPVIEYLCNHKLTAK 1634
            HYG+L+ENFC+AG YDRA+ LLD ++  E ++  Q S+E+E SAYNP+IEYLC+H  T K
Sbjct: 434  HYGILIENFCKAGVYDRAVHLLDRLIEKEIIMRSQSSMELEASAYNPMIEYLCDHGQTDK 493

Query: 1635 AETFFRQLMKTGVQDPASFNNLLLGHSKEGNPESALEILKIMSRRKIPADALAYRSLIMS 1814
            AE  FRQLMK GVQD  +FNNL+ GH+KEGN +SA EILKIM RR +P +A +Y+ LI S
Sbjct: 494  AEVLFRQLMKKGVQDSVAFNNLIRGHAKEGNSDSAFEILKIMGRRGVPREADSYKLLIKS 553

Query: 1815 FLKKGEPADAKVALDGMIESGHLPDSELFRLVMESLFEDGRVQTASRVMKSMIEKGVKEH 1994
            +L KGEPADAK ALD MIE+GH+P+S LFR VMESLFEDGRVQTASR+MKSM+EKGV E+
Sbjct: 554  YLSKGEPADAKTALDSMIENGHVPESSLFRSVMESLFEDGRVQTASRIMKSMVEKGVNEN 613

Query: 1995 MDLVAKILEALLMRGHVEEALGRIVLLMDNDCIPDFDHLLAVLCEKEKTIAALKLLDFGL 2174
            MDLVAKILEAL +RGHVEEALGRI LLM + C P+FD LL+VL EK KTIAA+KLLDF L
Sbjct: 614  MDLVAKILEALFIRGHVEEALGRIDLLMQSGCAPEFDSLLSVLAEKGKTIAAVKLLDFCL 673

Query: 2175 ERDCNISFASYDKVLDSLLVAGKTLNAYSILCKVMEKGGIKDLSGVGDLIRSLNAQGNTK 2354
            ERDC + F SYDKVLD+LL +GKTLNAYSILCK+M+KGG+ D     DLI+SLN +GNTK
Sbjct: 674  ERDCMVDFKSYDKVLDALLESGKTLNAYSILCKIMDKGGVTDWRSTDDLIKSLNLEGNTK 733

Query: 2355 QADILSRMI 2381
            QAD+LSR I
Sbjct: 734  QADVLSRKI 742


>ref|XP_002315730.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222864770|gb|EEF01901.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 760

 Score =  920 bits (2379), Expect = 0.0
 Identities = 465/695 (66%), Positives = 555/695 (79%), Gaps = 4/695 (0%)
 Frame = +3

Query: 309  TKEENGVSNSPGDAAPSTGKVNSPRRTPRGKPPN--PEKPEDIIIRMMSNRAWTTRLQNS 482
            T    G    P    P+  +    +R PR K  +  PEK EDII RMM+NR WTTRLQNS
Sbjct: 46   TTASPGPKPDPKTETPNVAQEKQYQRIPRAKQQHRSPEKLEDIICRMMANRDWTTRLQNS 105

Query: 483  IRSLVPQFDHSLVLNVLNGARKPDHALQFFRWVEKTGY-RHDRSTHLKIIEILGGASMLN 659
            IR+LVP+FDHSLV NVL+GARKPDHALQFFRWVE+ G  +HDR TH+KII+ILG  SMLN
Sbjct: 106  IRALVPEFDHSLVYNVLHGARKPDHALQFFRWVERAGLIQHDRETHMKIIQILGRYSMLN 165

Query: 660  HARCILF-DMPQKGVERDEDLFVVLILSYGKAGIVQESVKIFQKMKELGVERTITTYDTL 836
            HARCI+  DMP+KG E DED+FV+LI SYGKAGIVQESVK+F KMKELGVER++ +Y+ L
Sbjct: 166  HARCIVLEDMPKKGFELDEDMFVLLIDSYGKAGIVQESVKMFSKMKELGVERSVKSYNAL 225

Query: 837  FKVILRRGRVLMAKRYFNAMLKDGVLPTRHTFNIMIWGFFLSLKVETANRFFEDMKSRGI 1016
            FKVI+R+GR +MAKR+FN ML +G+ PTRHT+N++IWGFFLS+++ TA RF+EDMK RGI
Sbjct: 226  FKVIVRKGRYMMAKRFFNKMLDEGIGPTRHTYNVLIWGFFLSMRLRTAVRFYEDMKVRGI 285

Query: 1017 APDVVTYNNMINGYCRVRXXXXXXXXXXXXXXXDLIPTVISYTTMIKGYVSAGRVDDGLR 1196
            +PDVVTYN MINGY R +               D+ PTVISYTTMIKGY +  R++DGLR
Sbjct: 286  SPDVVTYNTMINGYYRHKRMEEAEKLFAEMKAKDIAPTVISYTTMIKGYFAVDRINDGLR 345

Query: 1197 LVEEMRAHKIRPNAVTYSTLLPGLCDAEKMPEAQKFLNEMVQRNIVPKDNSIFLRLISCQ 1376
            L+EEM++  I+PN VTY+TLLP LCDA KM EA+  L EMV+R I PKDNSIFL+L++ Q
Sbjct: 346  LLEEMKSVGIKPNNVTYTTLLPDLCDAGKMTEAKDILKEMVRRRIAPKDNSIFLKLLNSQ 405

Query: 1377 CKSGKLDWAEDVLKGMVKLSIPAEPAHYGVLMENFCRAGFYDRAIKLLDEVVGNETLLNP 1556
            CK+G L  A DVL GM+KLSIP+E  HYGVL+ENFC+A  YD+A+K +D+++ N+ +L P
Sbjct: 406  CKAGDLKAAVDVLDGMIKLSIPSEAGHYGVLIENFCKAEEYDQAVKFVDKLIENDIILRP 465

Query: 1557 QISLEMEDSAYNPVIEYLCNHKLTAKAETFFRQLMKTGVQDPASFNNLLLGHSKEGNPES 1736
            Q +LEME  AYNPVI+YLC+H  T KAE  FRQL+K GV+DP +FNNL+ GH+KEG P+S
Sbjct: 466  QSTLEMESGAYNPVIQYLCSHGQTGKAEILFRQLLKKGVEDPLAFNNLICGHAKEGTPDS 525

Query: 1737 ALEILKIMSRRKIPADALAYRSLIMSFLKKGEPADAKVALDGMIESGHLPDSELFRLVME 1916
            A EILKIM R+ IP DA AYR LI S+L+KGEPADAK ALD MIE GHLPDS +FR VME
Sbjct: 526  AFEILKIMGRKGIPRDADAYRLLIESYLRKGEPADAKTALDSMIEDGHLPDSSVFRSVME 585

Query: 1917 SLFEDGRVQTASRVMKSMIEKGVKEHMDLVAKILEALLMRGHVEEALGRIVLLMDNDCIP 2096
            SL+EDGRVQTASRVMKSM+EKGVKE+MDLVAKILEALLMRGH EEALGRI LLM + C  
Sbjct: 586  SLYEDGRVQTASRVMKSMVEKGVKENMDLVAKILEALLMRGHEEEALGRIDLLMSSQCNV 645

Query: 2097 DFDHLLAVLCEKEKTIAALKLLDFGLERDCNISFASYDKVLDSLLVAGKTLNAYSILCKV 2276
            +FD LL++L EK KTIAALKLLDFGL+RDC+I F SYDKVLD+LL AGKTLNAYSILCK+
Sbjct: 646  NFDSLLSILSEKGKTIAALKLLDFGLQRDCDIDFKSYDKVLDALLAAGKTLNAYSILCKI 705

Query: 2277 MEKGGIKDLSGVGDLIRSLNAQGNTKQADILSRMI 2381
            MEKGG+       DLI+SLN +GNTKQADILSRMI
Sbjct: 706  MEKGGVTSWRSYEDLIKSLNQEGNTKQADILSRMI 740


>ref|XP_006481496.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Citrus sinensis]
          Length = 751

 Score =  910 bits (2353), Expect = 0.0
 Identities = 466/716 (65%), Positives = 557/716 (77%), Gaps = 3/716 (0%)
 Frame = +3

Query: 243  HSRVSRITNPSPLYSLHGFCS-NTKEENGVSNSPG-DAAPSTGKVNSPRRTPRGKPPNPE 416
            H  +SRI  P  L   H FCS N +++   S +P  D   +  + +  +R PRG   +P 
Sbjct: 17   HPIISRIEAPYSLILPHFFCSINDQQQTQDSPAPNPDPFQADEEPSQRQRIPRGNHRSPV 76

Query: 417  KPEDIIIRMMSNRAWTTRLQNSIRSLVPQFDHSLVLNVLNGARKPDHALQFFRWVEKTG- 593
            K ED I ++M+ RAWTTRLQN IR+LVPQFDH+LV NVL+GA+  +HALQFFRWVE+ G 
Sbjct: 77   KLEDTICKLMAERAWTTRLQNKIRALVPQFDHNLVYNVLHGAKNSEHALQFFRWVERAGL 136

Query: 594  YRHDRSTHLKIIEILGGASMLNHARCILFDMPQKGVERDEDLFVVLILSYGKAGIVQESV 773
            + HDR THLK+IEILG    LNHARCIL DMP+KGV+ DED+F VLI SYGK GIVQESV
Sbjct: 137  FNHDRETHLKMIEILGRVGKLNHARCILLDMPKKGVQWDEDMFEVLIESYGKKGIVQESV 196

Query: 774  KIFQKMKELGVERTITTYDTLFKVILRRGRVLMAKRYFNAMLKDGVLPTRHTFNIMIWGF 953
            KIF  MK+LGVER++ +YD LFK+ILRRGR +MAKRYFN ML +G+ PTRHT+N+M+WGF
Sbjct: 197  KIFDIMKQLGVERSVKSYDALFKLILRRGRYMMAKRYFNKMLSEGIEPTRHTYNVMLWGF 256

Query: 954  FLSLKVETANRFFEDMKSRGIAPDVVTYNNMINGYCRVRXXXXXXXXXXXXXXXDLIPTV 1133
            FLSLK+ETA RFFEDMKSRGI+PDVVTYN MINGY R +               ++ PTV
Sbjct: 257  FLSLKLETAIRFFEDMKSRGISPDVVTYNTMINGYNRFKKMDEAEKLFAEMKEKNIEPTV 316

Query: 1134 ISYTTMIKGYVSAGRVDDGLRLVEEMRAHKIRPNAVTYSTLLPGLCDAEKMPEAQKFLNE 1313
            ISYTTMIKGYV+  R DD LR+ +EM++  ++PNAVTY+ LLPGLCDA KM E QK L E
Sbjct: 317  ISYTTMIKGYVAVERADDALRIFDEMKSFDVKPNAVTYTALLPGLCDAGKMVEVQKVLRE 376

Query: 1314 MVQRNIVPKDNSIFLRLISCQCKSGKLDWAEDVLKGMVKLSIPAEPAHYGVLMENFCRAG 1493
            MV+R I PKDNS+F++L+  QCKSG L+ A DVLK M++LSIP E  HYG+L+ENFC+A 
Sbjct: 377  MVERYIPPKDNSVFMKLLGVQCKSGHLNAAADVLKAMIRLSIPTEAGHYGILIENFCKAE 436

Query: 1494 FYDRAIKLLDEVVGNETLLNPQISLEMEDSAYNPVIEYLCNHKLTAKAETFFRQLMKTGV 1673
             YDRAIKLLD++V  E +L PQ +L+ME S+YNP+I++LC++  T KAE FFRQLMK GV
Sbjct: 437  MYDRAIKLLDKLVEKEIILRPQSTLDMEASSYNPMIQHLCHNGQTGKAEIFFRQLMKKGV 496

Query: 1674 QDPASFNNLLLGHSKEGNPESALEILKIMSRRKIPADALAYRSLIMSFLKKGEPADAKVA 1853
             DP +FNNL+ GHSKEGNP+SA EI+KIM RR +P DA AY  LI S+L+KGEPADAK A
Sbjct: 497  LDPVAFNNLIRGHSKEGNPDSAFEIVKIMGRRGVPRDADAYICLIESYLRKGEPADAKTA 556

Query: 1854 LDGMIESGHLPDSELFRLVMESLFEDGRVQTASRVMKSMIEKGVKEHMDLVAKILEALLM 2033
            LD MIE GH P S LFR VMESLFEDGRVQTASRVMKSM+EKGVKE++DLVAKILEALLM
Sbjct: 557  LDSMIEDGHSPASSLFRSVMESLFEDGRVQTASRVMKSMVEKGVKENLDLVAKILEALLM 616

Query: 2034 RGHVEEALGRIVLLMDNDCIPDFDHLLAVLCEKEKTIAALKLLDFGLERDCNISFASYDK 2213
            RGHVEEALGRI L+M +  +P+FD LL+VL EK KTIAA+KLLDF L RDC I  ASY+K
Sbjct: 617  RGHVEEALGRIDLMMQSGSVPNFDSLLSVLSEKGKTIAAVKLLDFCLGRDCIIDLASYEK 676

Query: 2214 VLDSLLVAGKTLNAYSILCKVMEKGGIKDLSGVGDLIRSLNAQGNTKQADILSRMI 2381
            VLD+LL AGKTLNAYSIL K+MEKGG+ D      LI  LN +GNTKQADILSRMI
Sbjct: 677  VLDALLAAGKTLNAYSILFKIMEKGGVTDWKSSDKLIAGLNQEGNTKQADILSRMI 732


>ref|XP_006428766.1| hypothetical protein CICLE_v10011107mg [Citrus clementina]
            gi|557530823|gb|ESR42006.1| hypothetical protein
            CICLE_v10011107mg [Citrus clementina]
          Length = 787

 Score =  907 bits (2345), Expect = 0.0
 Identities = 466/716 (65%), Positives = 556/716 (77%), Gaps = 3/716 (0%)
 Frame = +3

Query: 243  HSRVSRITNPSPLYSLHGFCS-NTKEENGVSNSPG-DAAPSTGKVNSPRRTPRGKPPNPE 416
            H  +SRI  P  L   H FCS N +++   S +P  D   +  + +  +R PRG   +P 
Sbjct: 53   HPIISRIEAPYSLILPHFFCSINDQQQTQDSPAPNPDPFQADEEPSQRQRIPRGNHRSPV 112

Query: 417  KPEDIIIRMMSNRAWTTRLQNSIRSLVPQFDHSLVLNVLNGARKPDHALQFFRWVEKTG- 593
            K ED I ++M+ RAWTTRLQN IR+LVPQFDH+LV NVL+GA+  +HALQFFRWVE+ G 
Sbjct: 113  KLEDTICKLMAERAWTTRLQNKIRALVPQFDHNLVYNVLHGAKNSEHALQFFRWVERAGL 172

Query: 594  YRHDRSTHLKIIEILGGASMLNHARCILFDMPQKGVERDEDLFVVLILSYGKAGIVQESV 773
            + HDR THLK+IEILG    LNHARCIL DMP+KGV+ DEDLF VLI SYGK GIVQESV
Sbjct: 173  FNHDRETHLKMIEILGRVGKLNHARCILLDMPKKGVQWDEDLFEVLIESYGKKGIVQESV 232

Query: 774  KIFQKMKELGVERTITTYDTLFKVILRRGRVLMAKRYFNAMLKDGVLPTRHTFNIMIWGF 953
            KIF  MK+LGVER++ +YD LFK+ILRRGR +MAKRYFN ML +G+ PTRHT+N+M+WGF
Sbjct: 233  KIFDIMKQLGVERSVKSYDALFKLILRRGRYMMAKRYFNKMLSEGIEPTRHTYNVMLWGF 292

Query: 954  FLSLKVETANRFFEDMKSRGIAPDVVTYNNMINGYCRVRXXXXXXXXXXXXXXXDLIPTV 1133
            FLSLK+ETA RFFEDMKSRGI+PDVVTYN MINGY R +               ++ PTV
Sbjct: 293  FLSLKLETAIRFFEDMKSRGISPDVVTYNTMINGYNRFKKMDEAEKLFAEMKEKNIEPTV 352

Query: 1134 ISYTTMIKGYVSAGRVDDGLRLVEEMRAHKIRPNAVTYSTLLPGLCDAEKMPEAQKFLNE 1313
            ISYTTMIKGYV+  R DD LR+ +EM++  ++PNAVTY+ LLPGLCDA KM E QK L E
Sbjct: 353  ISYTTMIKGYVAVERADDALRIFDEMKSFDVKPNAVTYTALLPGLCDAGKMVEVQKVLRE 412

Query: 1314 MVQRNIVPKDNSIFLRLISCQCKSGKLDWAEDVLKGMVKLSIPAEPAHYGVLMENFCRAG 1493
            MV+R I PKDNS+F++L+  QCKSG L+ A DVLK M++LSIP E  HYG+L+ENFC+A 
Sbjct: 413  MVERYIPPKDNSVFMKLLDVQCKSGHLNAAADVLKAMIRLSIPTEAGHYGILIENFCKAE 472

Query: 1494 FYDRAIKLLDEVVGNETLLNPQISLEMEDSAYNPVIEYLCNHKLTAKAETFFRQLMKTGV 1673
             YDRAIKLLD++V  E +L PQ +L+ME S+YN +I++LC++  T KAE FFRQLMK GV
Sbjct: 473  MYDRAIKLLDKLVEKEIILRPQSTLDMEASSYNLMIQHLCHNGQTGKAEIFFRQLMKKGV 532

Query: 1674 QDPASFNNLLLGHSKEGNPESALEILKIMSRRKIPADALAYRSLIMSFLKKGEPADAKVA 1853
             DP +FNNL+ GHSKEGNP+SA EI+KIM RR +P DA AY  LI S+L+KGEPADAK A
Sbjct: 533  LDPVAFNNLIRGHSKEGNPDSAFEIVKIMGRRGVPRDADAYICLIESYLRKGEPADAKTA 592

Query: 1854 LDGMIESGHLPDSELFRLVMESLFEDGRVQTASRVMKSMIEKGVKEHMDLVAKILEALLM 2033
            LD MIE GH P S LFR VMESLFEDGRVQTASRVMKSM+EKGVKE++DLVAKILEALLM
Sbjct: 593  LDSMIEDGHSPASSLFRSVMESLFEDGRVQTASRVMKSMVEKGVKENLDLVAKILEALLM 652

Query: 2034 RGHVEEALGRIVLLMDNDCIPDFDHLLAVLCEKEKTIAALKLLDFGLERDCNISFASYDK 2213
            RGHVEEALGRI L+M +  +P+FD LL+VL EK KTIAA+KLLDF L RDC I  ASY+K
Sbjct: 653  RGHVEEALGRIDLMMQSGSVPNFDSLLSVLSEKGKTIAAVKLLDFCLGRDCIIDLASYEK 712

Query: 2214 VLDSLLVAGKTLNAYSILCKVMEKGGIKDLSGVGDLIRSLNAQGNTKQADILSRMI 2381
            VLD+LL AGKTLNAYSIL K+MEKGG+ D      LI  LN +GNTKQADILSRMI
Sbjct: 713  VLDALLAAGKTLNAYSILFKIMEKGGVTDWKSSDKLIAGLNQEGNTKQADILSRMI 768


>ref|XP_006353112.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            isoform X1 [Solanum tuberosum]
          Length = 731

 Score =  905 bits (2339), Expect = 0.0
 Identities = 454/674 (67%), Positives = 540/674 (80%), Gaps = 2/674 (0%)
 Frame = +3

Query: 369  VNSPRRTPRGKPPNP-EKPEDIIIRMMSNRAWTTRLQNSIRSLVPQFDHSLVLNVLNGAR 545
            +N+  R P+G  P P EK ED+I RMMS RAWTTRLQNSIR++VP FDH LV NVL+ A+
Sbjct: 39   LNNHDRIPKGNSPKPQEKLEDLICRMMSTRAWTTRLQNSIRNIVPSFDHELVYNVLHSAK 98

Query: 546  KPDHALQFFRWVEKTG-YRHDRSTHLKIIEILGGASMLNHARCILFDMPQKGVERDEDLF 722
              +HALQFFRWVE++G +RHDR TH KII+ILG A  LNHARCIL DMP KGV+ DEDL+
Sbjct: 99   NSEHALQFFRWVERSGLFRHDRETHFKIIQILGRAEKLNHARCILLDMPNKGVDWDEDLW 158

Query: 723  VVLILSYGKAGIVQESVKIFQKMKELGVERTITTYDTLFKVILRRGRVLMAKRYFNAMLK 902
            V++I SYGKAGIVQESVK+FQKM+ELGVERT+ +Y+ LF VI RRGR +MAKRYFN M+ 
Sbjct: 159  VLMIDSYGKAGIVQESVKLFQKMEELGVERTVKSYNALFNVITRRGRYMMAKRYFNKMVN 218

Query: 903  DGVLPTRHTFNIMIWGFFLSLKVETANRFFEDMKSRGIAPDVVTYNNMINGYCRVRXXXX 1082
             G+ PT HT+N++IWGFFLS KV+TA RFFEDMKS+GI PDVVTYN MINGY RV+    
Sbjct: 219  QGIEPTGHTYNLLIWGFFLSSKVDTAIRFFEDMKSKGIMPDVVTYNTMINGYIRVKKIEE 278

Query: 1083 XXXXXXXXXXXDLIPTVISYTTMIKGYVSAGRVDDGLRLVEEMRAHKIRPNAVTYSTLLP 1262
                       ++ PTVISYTT+IKGY +  R+DD +RL EEM++  I+PNA+TYSTLLP
Sbjct: 279  AEKYFVEMKARNIEPTVISYTTLIKGYSAVERIDDAVRLFEEMKSFGIKPNAITYSTLLP 338

Query: 1263 GLCDAEKMPEAQKFLNEMVQRNIVPKDNSIFLRLISCQCKSGKLDWAEDVLKGMVKLSIP 1442
            GLCDA+KM EA   L EM  + I PKDNSIF+RLIS QC++G LD A DVLK M++LS+P
Sbjct: 339  GLCDAQKMSEAGAILKEMEDKYIAPKDNSIFIRLISGQCEAGDLDAAADVLKTMIRLSVP 398

Query: 1443 AEPAHYGVLMENFCRAGFYDRAIKLLDEVVGNETLLNPQISLEMEDSAYNPVIEYLCNHK 1622
             E  HYGVL+ENFC+AG YDRA+K LD+++  E +L PQ S  ME SAYN +I+YLCN+ 
Sbjct: 399  TEAGHYGVLIENFCKAGIYDRAVKFLDKLIEKEIVLRPQSSSSMEPSAYNLIIDYLCNNG 458

Query: 1623 LTAKAETFFRQLMKTGVQDPASFNNLLLGHSKEGNPESALEILKIMSRRKIPADALAYRS 1802
             T KAETFFRQLMKTGVQDP +FNNL+ GHS+EG P+SA E+LKIM RRK+ +D +A++S
Sbjct: 459  QTGKAETFFRQLMKTGVQDPIAFNNLVCGHSREGVPDSAFELLKIMGRRKVLSDGIAHKS 518

Query: 1803 LIMSFLKKGEPADAKVALDGMIESGHLPDSELFRLVMESLFEDGRVQTASRVMKSMIEKG 1982
            L+ S+LKK EPADAK ALD M+E GH PDS L+R VMESL  DGRVQTASRVMK M+EKG
Sbjct: 519  LVESYLKKREPADAKAALDNMLEHGHDPDSLLYRSVMESLMGDGRVQTASRVMKIMLEKG 578

Query: 1983 VKEHMDLVAKILEALLMRGHVEEALGRIVLLMDNDCIPDFDHLLAVLCEKEKTIAALKLL 2162
            VKEHMDL++ ILEALLMRGHVEEALGRI LL+ N   PD D LL+VLCEK KT AALKLL
Sbjct: 579  VKEHMDLISTILEALLMRGHVEEALGRIELLLHNSLSPDLDGLLSVLCEKGKTSAALKLL 638

Query: 2163 DFGLERDCNISFASYDKVLDSLLVAGKTLNAYSILCKVMEKGGIKDLSGVGDLIRSLNAQ 2342
            DF LER+CNI F+SYDKVLDSLL AGKTLNAYSILCK+ME GG+KD     +LI+SLN +
Sbjct: 639  DFILERNCNIDFSSYDKVLDSLLAAGKTLNAYSILCKMMENGGVKDHKSCEELIKSLNDE 698

Query: 2343 GNTKQADILSRMIM 2384
            GNTKQADIL RMI+
Sbjct: 699  GNTKQADILRRMIL 712


>ref|XP_004251992.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            isoform 1 [Solanum lycopersicum]
            gi|460413221|ref|XP_004251993.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g37230-like isoform 2 [Solanum lycopersicum]
          Length = 731

 Score =  902 bits (2330), Expect = 0.0
 Identities = 451/678 (66%), Positives = 540/678 (79%), Gaps = 2/678 (0%)
 Frame = +3

Query: 357  STGKVNSPRRTPRGKPPNP-EKPEDIIIRMMSNRAWTTRLQNSIRSLVPQFDHSLVLNVL 533
            +T  +N+  R P+G  P P EK ED+I RMMS RAWTTRLQNSIR++VP FDH LV NVL
Sbjct: 35   NTESLNNHERIPKGNSPKPQEKLEDLICRMMSTRAWTTRLQNSIRNIVPSFDHELVYNVL 94

Query: 534  NGARKPDHALQFFRWVEKTG-YRHDRSTHLKIIEILGGASMLNHARCILFDMPQKGVERD 710
            + A+  +HALQFFRWVE++G +RHDR TH KII+ILG A  LNHARCIL DMP KGV+ D
Sbjct: 95   HSAKNSEHALQFFRWVERSGLFRHDRETHFKIIQILGRAEKLNHARCILLDMPNKGVDWD 154

Query: 711  EDLFVVLILSYGKAGIVQESVKIFQKMKELGVERTITTYDTLFKVILRRGRVLMAKRYFN 890
            EDL+V++I SYGKAGIVQESVK+FQKM+ELGVERT+ +Y+ LF VI RRGR +MAKRYFN
Sbjct: 155  EDLWVLMIDSYGKAGIVQESVKLFQKMEELGVERTVKSYNALFNVITRRGRYMMAKRYFN 214

Query: 891  AMLKDGVLPTRHTFNIMIWGFFLSLKVETANRFFEDMKSRGIAPDVVTYNNMINGYCRVR 1070
             M+  G+ PT HT+N++IWGFFLS KV+TA RFFEDMK +GI PDVVTYN MINGY  V+
Sbjct: 215  RMVNQGIEPTGHTYNLLIWGFFLSSKVDTAIRFFEDMKGKGIMPDVVTYNTMINGYNCVK 274

Query: 1071 XXXXXXXXXXXXXXXDLIPTVISYTTMIKGYVSAGRVDDGLRLVEEMRAHKIRPNAVTYS 1250
                           ++ P VISYTT+IKGY +  R+DD L+L EEM++  I+PNA+TYS
Sbjct: 275  KIEEAEKYFVEMKARNIEPNVISYTTLIKGYSAVERIDDALKLFEEMKSFGIKPNAITYS 334

Query: 1251 TLLPGLCDAEKMPEAQKFLNEMVQRNIVPKDNSIFLRLISCQCKSGKLDWAEDVLKGMVK 1430
            TLLPGLCDA+KM EA   L EM +R I PKDNSIF+RLIS QC++G LD A DVLK M++
Sbjct: 335  TLLPGLCDAQKMSEAGTILKEMEERYIAPKDNSIFIRLISGQCEAGDLDAAADVLKTMIR 394

Query: 1431 LSIPAEPAHYGVLMENFCRAGFYDRAIKLLDEVVGNETLLNPQISLEMEDSAYNPVIEYL 1610
            LS+P E  HYGVL+ENFC+AG YDRA+K LD+++  E +L PQ S  ME SAYN +I+YL
Sbjct: 395  LSVPTEAGHYGVLIENFCKAGIYDRAVKFLDKLIEKEIVLRPQSSSSMETSAYNLIIDYL 454

Query: 1611 CNHKLTAKAETFFRQLMKTGVQDPASFNNLLLGHSKEGNPESALEILKIMSRRKIPADAL 1790
            CN+  T KAET FRQLMKTG+QDP +FNNL+ GHS+EG P+SA E+LKIM RRK+ +D++
Sbjct: 455  CNNGQTGKAETLFRQLMKTGIQDPIAFNNLVCGHSREGVPDSAFELLKIMGRRKVLSDSI 514

Query: 1791 AYRSLIMSFLKKGEPADAKVALDGMIESGHLPDSELFRLVMESLFEDGRVQTASRVMKSM 1970
            A++SL+ S+LKKGEPADAK ALD M+E GH PDS L+R VMESL  DGRVQTASRVMK M
Sbjct: 515  AHKSLVESYLKKGEPADAKAALDNMLEHGHDPDSLLYRSVMESLMGDGRVQTASRVMKIM 574

Query: 1971 IEKGVKEHMDLVAKILEALLMRGHVEEALGRIVLLMDNDCIPDFDHLLAVLCEKEKTIAA 2150
            +EKGVKEHMDL++ ILEALLMRGHVEEA GRI LL+ N   PD D LL+VLCEK KT AA
Sbjct: 575  LEKGVKEHMDLISTILEALLMRGHVEEAFGRIELLLHNSLSPDLDGLLSVLCEKGKTTAA 634

Query: 2151 LKLLDFGLERDCNISFASYDKVLDSLLVAGKTLNAYSILCKVMEKGGIKDLSGVGDLIRS 2330
            LKLLDF LER+CNI F+SYDKVLDSLL AGKTLNAYSILCK+ME GG+KD     +LI+S
Sbjct: 635  LKLLDFILERNCNIDFSSYDKVLDSLLAAGKTLNAYSILCKMMENGGVKDHKSCEELIKS 694

Query: 2331 LNAQGNTKQADILSRMIM 2384
            LN +GNTKQADIL RMI+
Sbjct: 695  LNDEGNTKQADILRRMIL 712


>ref|XP_007225233.1| hypothetical protein PRUPE_ppa001877mg [Prunus persica]
            gi|462422169|gb|EMJ26432.1| hypothetical protein
            PRUPE_ppa001877mg [Prunus persica]
          Length = 749

 Score =  899 bits (2324), Expect = 0.0
 Identities = 467/716 (65%), Positives = 552/716 (77%), Gaps = 8/716 (1%)
 Frame = +3

Query: 258  RITNPSPLYSLHGFCSNTKEENGVSNSPGDAA-PSTGKVNSPRRTPRGKPP---NPEKPE 425
            R +NP  L     F S        + +P +   P  G V +P   P+ +     N EK E
Sbjct: 15   RPSNPQTLTLFRLFSSTEAATGASTEAPTETPNPQDGSV-TPTHVPKARQHRTRNAEKIE 73

Query: 426  DIIIRMMSNRAWTTRLQNSIRSLVPQFDHSLVLNVLNGARKPDHALQFFRWVEKTG-YRH 602
            DII RMM+NR WTTRLQNSIR+LVP+FDH+LV NVL+GAR  +HALQFFRWVE++G ++H
Sbjct: 74   DIICRMMANRVWTTRLQNSIRNLVPEFDHNLVWNVLHGARSWEHALQFFRWVERSGLFKH 133

Query: 603  DRSTHLKIIEILGGASMLNHARCILFDMPQKGVERDEDLFVVLILSYGKAG---IVQESV 773
            DR THLKIIEIL   S LNHARCIL DMP+KGV+ DEDLF+ LI  YGK+    I+QESV
Sbjct: 134  DRETHLKIIEILSRNSKLNHARCILLDMPKKGVQLDEDLFIGLIDGYGKSDKGCIIQESV 193

Query: 774  KIFQKMKELGVERTITTYDTLFKVILRRGRVLMAKRYFNAMLKDGVLPTRHTFNIMIWGF 953
            K+F KMKELGVER++ +Y+ L+K ILR GR +MAKRYFNAML +G+ PTRHT+N+MIWGF
Sbjct: 194  KLFIKMKELGVERSLKSYEALYKAILRWGRCMMAKRYFNAMLSEGIEPTRHTYNVMIWGF 253

Query: 954  FLSLKVETANRFFEDMKSRGIAPDVVTYNNMINGYCRVRXXXXXXXXXXXXXXXDLIPTV 1133
              S K+ETA RFFEDMKSRGI+PD+VTYN MI+GY RV                ++ P V
Sbjct: 254  LKSRKLETAKRFFEDMKSRGISPDLVTYNTMIHGYIRVDKMDESEQLFVELKGRNIEPNV 313

Query: 1134 ISYTTMIKGYVSAGRVDDGLRLVEEMRAHKIRPNAVTYSTLLPGLCDAEKMPEAQKFLNE 1313
            ISYTTMIKGYVS GRVDDGLRL  EM++  IRPNAVT+STLLPGLCDAEK   A K L E
Sbjct: 314  ISYTTMIKGYVSVGRVDDGLRLFGEMKSFGIRPNAVTFSTLLPGLCDAEKKDAAHKVLME 373

Query: 1314 MVQRNIVPKDNSIFLRLISCQCKSGKLDWAEDVLKGMVKLSIPAEPAHYGVLMENFCRAG 1493
            MV + I P DNSIF RL+S QCKSG +D A  VLK M++L IP E  HYG+L+ENFC+AG
Sbjct: 374  MVSKYIAPIDNSIFERLLSLQCKSGDMDAAAYVLKAMIRLRIPTEAGHYGILIENFCKAG 433

Query: 1494 FYDRAIKLLDEVVGNETLLNPQISLEMEDSAYNPVIEYLCNHKLTAKAETFFRQLMKTGV 1673
             YD+A+KLLD+++  E +L PQ S+E+E SA+NP+IEYLCNH  T KAE FFRQLMK GV
Sbjct: 434  VYDQAVKLLDKLIEKEIILRPQNSIELEPSAFNPMIEYLCNHGQTGKAEAFFRQLMKKGV 493

Query: 1674 QDPASFNNLLLGHSKEGNPESALEILKIMSRRKIPADALAYRSLIMSFLKKGEPADAKVA 1853
            +D  +FNNLL GH+KEGN +SA EIL+IM+RR IP +A +Y  LI S+L KGEPADAK A
Sbjct: 494  EDSVAFNNLLRGHAKEGNSDSAFEILRIMNRRGIPGEADSYILLIKSYLSKGEPADAKTA 553

Query: 1854 LDGMIESGHLPDSELFRLVMESLFEDGRVQTASRVMKSMIEKGVKEHMDLVAKILEALLM 2033
            LD MIE GH+P+S LFR V+ESLFEDGRVQTASRVMKSM+EKGV E+MDLVAKILEAL M
Sbjct: 554  LDSMIEGGHIPESSLFRSVIESLFEDGRVQTASRVMKSMVEKGVMENMDLVAKILEALFM 613

Query: 2034 RGHVEEALGRIVLLMDNDCIPDFDHLLAVLCEKEKTIAALKLLDFGLERDCNISFASYDK 2213
            RGHVEEALGRI LLM + C   FD LL+VL +K KTIAALKLLDF LERDC++ F+SYDK
Sbjct: 614  RGHVEEALGRIDLLMQSGCALQFDSLLSVLADKGKTIAALKLLDFCLERDCSVDFSSYDK 673

Query: 2214 VLDSLLVAGKTLNAYSILCKVMEKGGIKDLSGVGDLIRSLNAQGNTKQADILSRMI 2381
            VLD+LL +GKTLNAYSILCK+MEKGGI D S   DLI+SLN +GNTKQADILSRMI
Sbjct: 674  VLDALLASGKTLNAYSILCKLMEKGGITDWSSTEDLIKSLNQEGNTKQADILSRMI 729


>ref|XP_003524868.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Glycine max]
          Length = 733

 Score =  892 bits (2305), Expect = 0.0
 Identities = 449/664 (67%), Positives = 534/664 (80%), Gaps = 4/664 (0%)
 Frame = +3

Query: 402  PPNPEKPEDIIIRMMSNRAWTTRLQNSIRSLVPQFDHSLVLNVLNGARKPDHALQFFRWV 581
            PP     E  I +MMSNRAWTTRLQNSIRSLVP+FD SLV NVL+GA  P+HALQF+RWV
Sbjct: 50   PPREHNLELTICKMMSNRAWTTRLQNSIRSLVPEFDPSLVYNVLHGAASPEHALQFYRWV 109

Query: 582  EKTG-YRHDRSTHLKIIEILGGASMLNHARCILFDMPQKGVER---DEDLFVVLILSYGK 749
            E+ G + H   T LKI++ILG  S LNHARCILF+  + GV R    ED FV LI SYG+
Sbjct: 110  ERAGLFTHTPETTLKIVQILGRYSKLNHARCILFNDTRGGVSRAAVTEDAFVSLIDSYGR 169

Query: 750  AGIVQESVKIFQKMKELGVERTITTYDTLFKVILRRGRVLMAKRYFNAMLKDGVLPTRHT 929
            AGIVQESVK+F+KMKELG++RT+ +YD LFKVILRRGR +MAKRY+NAML +GV PTRHT
Sbjct: 170  AGIVQESVKLFKKMKELGLDRTVKSYDALFKVILRRGRYMMAKRYYNAMLLEGVDPTRHT 229

Query: 930  FNIMIWGFFLSLKVETANRFFEDMKSRGIAPDVVTYNNMINGYCRVRXXXXXXXXXXXXX 1109
            FNI++WG FLSL+++TA RF+EDMKSRGI PDVVTYN +INGY R +             
Sbjct: 230  FNILLWGMFLSLRLDTAVRFYEDMKSRGILPDVVTYNTLINGYFRFKKVDEAEKLFVEMK 289

Query: 1110 XXDLIPTVISYTTMIKGYVSAGRVDDGLRLVEEMRAHKIRPNAVTYSTLLPGLCDAEKMP 1289
              D++P VIS+TTM+KGYV+AGR+DD L++ EEM+   ++PN VT+STLLPGLCDAEKM 
Sbjct: 290  GRDIVPNVISFTTMLKGYVAAGRIDDALKVFEEMKGCGVKPNVVTFSTLLPGLCDAEKMA 349

Query: 1290 EAQKFLNEMVQRNIVPKDNSIFLRLISCQCKSGKLDWAEDVLKGMVKLSIPAEPAHYGVL 1469
            EA+  L EMV+R I PKDN++F++++SCQCK+G LD A DVLK MV+LSIP E  HYGVL
Sbjct: 350  EARDVLGEMVERYIAPKDNALFMKMMSCQCKAGDLDAAADVLKAMVRLSIPTEAGHYGVL 409

Query: 1470 MENFCRAGFYDRAIKLLDEVVGNETLLNPQISLEMEDSAYNPVIEYLCNHKLTAKAETFF 1649
            +E+FC+A  YD+A KLLD+++  E +L PQ   EME SAYN +I YLC H  T KAETFF
Sbjct: 410  IESFCKANVYDKAEKLLDKLIEKEIVLRPQNDSEMEPSAYNLMIGYLCEHGRTGKAETFF 469

Query: 1650 RQLMKTGVQDPASFNNLLLGHSKEGNPESALEILKIMSRRKIPADALAYRSLIMSFLKKG 1829
            RQL+K GVQD  +FNNL+ GHSKEGNP+SA EI+KIM RR +  D  +YR LI S+L+KG
Sbjct: 470  RQLLKKGVQDSVAFNNLIRGHSKEGNPDSAFEIMKIMGRRGVARDVDSYRLLIESYLRKG 529

Query: 1830 EPADAKVALDGMIESGHLPDSELFRLVMESLFEDGRVQTASRVMKSMIEKGVKEHMDLVA 2009
            EPADAK ALDGM+ESGHLP+S L+R VMESLF+DGRVQTASRVMKSM+EKG KE+MDLV 
Sbjct: 530  EPADAKTALDGMLESGHLPESSLYRSVMESLFDDGRVQTASRVMKSMVEKGAKENMDLVL 589

Query: 2010 KILEALLMRGHVEEALGRIVLLMDNDCIPDFDHLLAVLCEKEKTIAALKLLDFGLERDCN 2189
            KILEALL+RGHVEEALGRI LLM N C PDFDHLL+VLCEKEKTIAALKLLDF LERDC 
Sbjct: 590  KILEALLLRGHVEEALGRIDLLMHNGCEPDFDHLLSVLCEKEKTIAALKLLDFVLERDCI 649

Query: 2190 ISFASYDKVLDSLLVAGKTLNAYSILCKVMEKGGIKDLSGVGDLIRSLNAQGNTKQADIL 2369
            I F+ YDKVLD+LL AGKTLNAYSILCK++EKGG  D S   +LI+SLN +GNTKQAD+L
Sbjct: 650  IDFSIYDKVLDALLAAGKTLNAYSILCKILEKGGSTDWSSRDELIKSLNQEGNTKQADVL 709

Query: 2370 SRMI 2381
            SRMI
Sbjct: 710  SRMI 713


>ref|XP_003532699.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Glycine max]
          Length = 738

 Score =  892 bits (2304), Expect = 0.0
 Identities = 455/694 (65%), Positives = 546/694 (78%), Gaps = 12/694 (1%)
 Frame = +3

Query: 336  SPGDAAPSTGKVNSPRRTPRGKPPNPEKPEDI---IIRMMSNRAWTTRLQNSIRSLVPQF 506
            S  +A   T   + P   P+  P  P +  ++   I +MMSNRAWTTRLQNSIRSLVP+F
Sbjct: 25   STAEALSETDHPSPPSPQPQPPPIIPPRENNLELTICKMMSNRAWTTRLQNSIRSLVPEF 84

Query: 507  DHSLVLNVLNGARKPDHALQFFRWVEKTG-YRHDRSTHLKIIEILGGASMLNHARCILFD 683
            D SLV NVL+GA  P+HALQF+RWVE+ G + H   T LKI++ILG  S LNHARCILFD
Sbjct: 85   DPSLVYNVLHGAASPEHALQFYRWVERAGLFTHTPETTLKIVQILGRYSKLNHARCILFD 144

Query: 684  MPQKGVER---DEDLFVVLILSYGKAGIVQESVKIFQKMKELGVERTITTYDTLFKVILR 854
              + G  R    ED FV LI SYG+AGIVQESVK+F+KMKELGV+RT+ +YD LFKVILR
Sbjct: 145  DTRGGASRATVTEDAFVSLIDSYGRAGIVQESVKLFKKMKELGVDRTVKSYDALFKVILR 204

Query: 855  RGRVLMAKRYFNAMLKDGVLPTRHTFNIMIWGFFLSLKVETANRFFEDMKSRGIAPDVVT 1034
            RGR +MAKRY+NAML + V PTRHT+NI++WG FLSL+++TA RF+EDMKSRGI PDVVT
Sbjct: 205  RGRYMMAKRYYNAMLNESVEPTRHTYNILLWGMFLSLRLDTAVRFYEDMKSRGILPDVVT 264

Query: 1035 YNNMINGYCRVRXXXXXXXXXXXXXXXDLIPTVISYTTMIKGYVSAGRVDDGLRLVEEMR 1214
            YN +INGY R +               D++P VIS+TTM+KGYV+AG++DD L++ EEM+
Sbjct: 265  YNTLINGYFRFKKVEEAEKLFVEMKGRDIVPNVISFTTMLKGYVAAGQIDDALKVFEEMK 324

Query: 1215 AHKIRPNAVTYSTLLPGLCDAEKMPEAQKFLNEMVQRNIVPKDNSIFLRLISCQCKSGKL 1394
               ++PNAVT+STLLPGLCDAEKM EA+  L EMV+R I PKDN++F++L+SCQCK+G L
Sbjct: 325  GCGVKPNAVTFSTLLPGLCDAEKMAEARDVLGEMVERYIAPKDNAVFMKLMSCQCKAGDL 384

Query: 1395 DWAEDVLKGMVKLSIPAEPAHYGVLMENFCRAGFYDRAIKLLDEVVGNETLLNPQIS--- 1565
            D A DVLK M++LSIP E  HYGVL+ENFC+A  YD+A KLLD+++  E +L  + +   
Sbjct: 385  DAAGDVLKAMIRLSIPTEAGHYGVLIENFCKANLYDKAEKLLDKMIEKEIVLRQKNAYET 444

Query: 1566 --LEMEDSAYNPVIEYLCNHKLTAKAETFFRQLMKTGVQDPASFNNLLLGHSKEGNPESA 1739
               EME SAYN +I YLC H  T KAETFFRQLMK GVQD  SFNNL+ GHSKEGNP+SA
Sbjct: 445  ELFEMEPSAYNLMIGYLCEHGRTGKAETFFRQLMKKGVQDSVSFNNLICGHSKEGNPDSA 504

Query: 1740 LEILKIMSRRKIPADALAYRSLIMSFLKKGEPADAKVALDGMIESGHLPDSELFRLVMES 1919
             EI+KIM RR +  DA +YR LI S+L+KGEPADAK ALDGM+ESGHLP+S L+R VMES
Sbjct: 505  FEIIKIMGRRGVARDADSYRLLIESYLRKGEPADAKTALDGMLESGHLPESSLYRSVMES 564

Query: 1920 LFEDGRVQTASRVMKSMIEKGVKEHMDLVAKILEALLMRGHVEEALGRIVLLMDNDCIPD 2099
            LF+DGRVQTASRVMKSM+EKGVKE+MDLV+K+LEALLMRGHVEEALGRI LLM N C PD
Sbjct: 565  LFDDGRVQTASRVMKSMVEKGVKENMDLVSKVLEALLMRGHVEEALGRIHLLMLNGCEPD 624

Query: 2100 FDHLLAVLCEKEKTIAALKLLDFGLERDCNISFASYDKVLDSLLVAGKTLNAYSILCKVM 2279
            FDHLL+VLCEKEKTIAALKLLDF LERDC I F+ YDKVLD+LL AGKTLNAYSILCK++
Sbjct: 625  FDHLLSVLCEKEKTIAALKLLDFVLERDCIIDFSIYDKVLDALLAAGKTLNAYSILCKIL 684

Query: 2280 EKGGIKDLSGVGDLIRSLNAQGNTKQADILSRMI 2381
            EKGG  D S   +LI+SLN +GNTKQAD+LSRMI
Sbjct: 685  EKGGSTDWSSRDELIKSLNQEGNTKQADVLSRMI 718


>ref|XP_006410903.1| hypothetical protein EUTSA_v10017966mg [Eutrema salsugineum]
            gi|557112072|gb|ESQ52356.1| hypothetical protein
            EUTSA_v10017966mg [Eutrema salsugineum]
          Length = 761

 Score =  887 bits (2291), Expect = 0.0
 Identities = 465/732 (63%), Positives = 556/732 (75%), Gaps = 15/732 (2%)
 Frame = +3

Query: 231  RALFHSRVSRITNPSPLYSLHGFCSNTKEENGVSNSPGDAA----PSTGKVNSPR----- 383
            RA     + R ++ S L     F S  + +N V+N    +A    P T  + S R     
Sbjct: 13   RARVRLSLPRSSDSSFLSVSRLFSSIEETQNPVANPQTQSADAVKPETTNLGSIRPEGRP 72

Query: 384  ---RTPRGKPPNPEKPEDIIIRMMSNRAWTTRLQNSIRSLVPQFDHSLVLNVLNGARKPD 554
               R  RGK  N EK ED I RMM NR WTTRLQNSIR LVP++DHSLV NVL+GARK D
Sbjct: 73   LRERFQRGKRQNHEKLEDTICRMMDNREWTTRLQNSIRDLVPEWDHSLVYNVLHGARKLD 132

Query: 555  HALQFFRWVEKTGY-RHDRSTHLKIIEILGGASMLNHARCILFDMPQKGVERDEDLFVVL 731
            HALQFFRW E++G  RHDR TH+K+IE+LG AS LNHARCIL DMP+KG+  DED+FVVL
Sbjct: 133  HALQFFRWSERSGLIRHDRDTHMKMIEMLGQASKLNHARCILLDMPEKGIPWDEDMFVVL 192

Query: 732  ILSYGKAGIVQESVKIFQKMKELGVERTITTYDTLFKVILRRGRVLMAKRYFNAMLKDGV 911
            I SYGKAGIVQESVKIFQKMK+LGVERTI +YDTLFKVILRRGR +MAKRYFN M+ +G+
Sbjct: 193  IESYGKAGIVQESVKIFQKMKDLGVERTIKSYDTLFKVILRRGRYMMAKRYFNKMVSEGI 252

Query: 912  LPTRHTFNIMIWGFFLSLKVETANRFFEDMKSRGIAPDVVTYNNMINGYCRVRXXXXXXX 1091
             PTRHT+N+M+WGFFLSL++ETA RF+EDM SRGI+PDVVTYN MINGYCR +       
Sbjct: 253  EPTRHTYNLMLWGFFLSLRLETALRFYEDMISRGISPDVVTYNTMINGYCRFKKMDEAEK 312

Query: 1092 XXXXXXXXDLIPTVISYTTMIKGYVSAGRVDDGLRLVEEMRAHKIRPNAVTYSTLLPGLC 1271
                    ++ P+V+SYTTMIKGY++  RVDDGLR+ +EMR+  I PNA TYSTLLPGLC
Sbjct: 313  VFVEMKGKNIEPSVVSYTTMIKGYLAVERVDDGLRIFDEMRSFGIEPNATTYSTLLPGLC 372

Query: 1272 DAEKMPEAQKFLNEMVQRNIVPKDNSIFLRLISCQCKSGKLDWAEDVLKGMVKLSIPAEP 1451
            DA KM EA+  L  M+ ++I PKDNSIFL+L+  Q K+G +  A +VLK M  L++PAE 
Sbjct: 373  DAGKMVEAKSILKNMMAKHIAPKDNSIFLKLLVSQSKAGDMAAATEVLKAMATLNVPAEA 432

Query: 1452 AHYGVLMENFCRAGFYDRAIKLLDEVVGNETLLNPQISLEMEDSAYNPVIEYLCNHKLTA 1631
             HYGVL+EN C+A  ++RAIKLLD +V  E +L  Q +LEME +AYNP+IEYLCN+  T+
Sbjct: 433  GHYGVLIENQCKANAHNRAIKLLDILVEKEIILRHQDTLEMEPNAYNPIIEYLCNNGQTS 492

Query: 1632 KAETFFRQLMKTGVQDPASFNNLLLGHSKEGNPESALEILKIMSRRKIPADALAYRSLIM 1811
            KAE  FRQLMK GVQD  + NNL+ GH+KEGNP+S+ EILKIMSRR +P DA AY  LI 
Sbjct: 493  KAEVLFRQLMKRGVQDQEALNNLIRGHAKEGNPDSSYEILKIMSRRGVPRDANAYELLIK 552

Query: 1812 SFLKKGEPADAKVALDGMIESGHLPDSELFRLVMESLFEDGRVQTASRVMKSMIEK--GV 1985
            S++ KGEP DAK ALD M+E GH+PDS LFR V+ESLFEDGRVQTASRVM  MI+K  G+
Sbjct: 553  SYMSKGEPGDAKTALDSMVEDGHVPDSSLFRSVIESLFEDGRVQTASRVMMIMIDKNVGI 612

Query: 1986 KEHMDLVAKILEALLMRGHVEEALGRIVLLMDNDCIPDFDHLLAVLCEKEKTIAALKLLD 2165
            +++MDLVAKILEALLMRGHVEEALGRI LL  N    D D LL+VL EK KTIAALKLLD
Sbjct: 613  EDNMDLVAKILEALLMRGHVEEALGRIDLLNQNGHSADLDSLLSVLSEKGKTIAALKLLD 672

Query: 2166 FGLERDCNISFASYDKVLDSLLVAGKTLNAYSILCKVMEKGGIKDLSGVGDLIRSLNAQG 2345
            FGLERD ++ F+SYDKVLD+LL AGKTLNAYS+LCK+M KG + D     DLI+SLN +G
Sbjct: 673  FGLERDLSLDFSSYDKVLDALLGAGKTLNAYSVLCKIMAKGSVTDWKSCDDLIKSLNQEG 732

Query: 2346 NTKQADILSRMI 2381
            NTKQAD+LSRMI
Sbjct: 733  NTKQADVLSRMI 744


>ref|XP_002881498.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297327337|gb|EFH57757.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 756

 Score =  882 bits (2280), Expect = 0.0
 Identities = 461/734 (62%), Positives = 556/734 (75%), Gaps = 14/734 (1%)
 Frame = +3

Query: 222  WKSRALFHSRVSRITNPS-----PLYSLHGFCSNTKEENGVSNSPGDAAPSTGKVNSPRR 386
            ++SRA  +  + R ++ S      L+S       + + N  + SP DA P T  + S   
Sbjct: 10   YQSRARVYLSLPRSSDSSFFSFPRLFSSIEETQTSGDANPETQSP-DAKPETKNLGSTET 68

Query: 387  TP------RGKPPNPEKPEDIIIRMMSNRAWTTRLQNSIRSLVPQFDHSLVLNVLNGARK 548
             P      RGK  N EK ED I RMM NRAWTTRLQNSIR LVP++DHSLV NVL+GA+K
Sbjct: 69   RPLRERFQRGKRQNHEKLEDTICRMMDNRAWTTRLQNSIRDLVPEWDHSLVYNVLHGAKK 128

Query: 549  PDHALQFFRWVEKTGY-RHDRSTHLKIIEILGGASMLNHARCILFDMPQKGVERDEDLFV 725
             +HALQFFRW E++G  RHDR TH+K+I++LG    LNHARCIL DMP+KGV  DED+FV
Sbjct: 129  LEHALQFFRWTERSGLIRHDRDTHMKMIKMLGEVQKLNHARCILLDMPEKGVPWDEDMFV 188

Query: 726  VLILSYGKAGIVQESVKIFQKMKELGVERTITTYDTLFKVILRRGRVLMAKRYFNAMLKD 905
            VLI SYGKAGIVQESVKIFQKMK+LGVERTI +Y+TLFKVILRRGR +MAKRYFN M+ +
Sbjct: 189  VLIESYGKAGIVQESVKIFQKMKDLGVERTIKSYNTLFKVILRRGRYMMAKRYFNKMVSE 248

Query: 906  GVLPTRHTFNIMIWGFFLSLKVETANRFFEDMKSRGIAPDVVTYNNMINGYCRVRXXXXX 1085
            GV PTRHT+N+M+WGFFLSL++ETA RFF+DMK+RGI+PD VTYN +INGYCR +     
Sbjct: 249  GVEPTRHTYNLMLWGFFLSLRLETALRFFDDMKTRGISPDAVTYNTIINGYCRFKKMDEA 308

Query: 1086 XXXXXXXXXXDLIPTVISYTTMIKGYVSAGRVDDGLRLVEEMRAHKIRPNAVTYSTLLPG 1265
                      +  P+V++YTTMIKGY+S  RVDDGLR+ EEMR+  I PNA TYSTLLPG
Sbjct: 309  EKLFVEMKGNNSEPSVVTYTTMIKGYLSVDRVDDGLRIFEEMRSFGIEPNATTYSTLLPG 368

Query: 1266 LCDAEKMPEAQKFLNEMVQRNIVPKDNSIFLRLISCQCKSGKLDWAEDVLKGMVKLSIPA 1445
            LCD  KM EA+  L  M+ ++I PKDNSIFL+L+  Q K+G +  A +VLK M  L++PA
Sbjct: 369  LCDVGKMVEAKNILKNMMAKHIAPKDNSIFLKLLVSQSKAGDMAAATEVLKAMATLNVPA 428

Query: 1446 EPAHYGVLMENFCRAGFYDRAIKLLDEVVGNETLLNPQISLEMEDSAYNPVIEYLCNHKL 1625
            E  HYGVL+EN C+A  Y+RAIKLLD ++  E +L  Q +LEME SAYNP+IEYLCN+  
Sbjct: 429  EAGHYGVLIENQCKASAYNRAIKLLDTLIEKEIILRHQDTLEMEPSAYNPIIEYLCNNGQ 488

Query: 1626 TAKAETFFRQLMKTGVQDPASFNNLLLGHSKEGNPESALEILKIMSRRKIPADALAYRSL 1805
            TAKAE  FRQLMK GVQD  + NNL+ GH+KEGNPES+ EILKIMSRR +P +A AY  L
Sbjct: 489  TAKAEVLFRQLMKRGVQDQDALNNLIRGHAKEGNPESSYEILKIMSRRGVPREANAYELL 548

Query: 1806 IMSFLKKGEPADAKVALDGMIESGHLPDSELFRLVMESLFEDGRVQTASRVMKSMIEK-- 1979
            I S++ KGEP DAK ALD M+E GH+PDS LFR V+ESLFEDGRVQTASRVM  MI+K  
Sbjct: 549  IKSYMSKGEPGDAKTALDSMVEDGHVPDSALFRSVIESLFEDGRVQTASRVMMIMIDKNV 608

Query: 1980 GVKEHMDLVAKILEALLMRGHVEEALGRIVLLMDNDCIPDFDHLLAVLCEKEKTIAALKL 2159
            G++++MDL+AKILEALLMRGHVEEALGRI LL  N    D D LL+VL EK KTIAALKL
Sbjct: 609  GIEDNMDLIAKILEALLMRGHVEEALGRIDLLNQNGHTADLDSLLSVLSEKGKTIAALKL 668

Query: 2160 LDFGLERDCNISFASYDKVLDSLLVAGKTLNAYSILCKVMEKGGIKDLSGVGDLIRSLNA 2339
            LDFGLERD ++ F+SYDKVLD+LL AGKTLNAYS+LCK+MEKG   D     +LI+SLN 
Sbjct: 669  LDFGLERDLSLDFSSYDKVLDALLGAGKTLNAYSVLCKIMEKGSSTDWKSSDELIKSLNQ 728

Query: 2340 QGNTKQADILSRMI 2381
            +GNTKQAD+LSRMI
Sbjct: 729  EGNTKQADVLSRMI 742


>ref|XP_007158766.1| hypothetical protein PHAVU_002G180100g [Phaseolus vulgaris]
            gi|561032181|gb|ESW30760.1| hypothetical protein
            PHAVU_002G180100g [Phaseolus vulgaris]
          Length = 728

 Score =  881 bits (2277), Expect = 0.0
 Identities = 449/695 (64%), Positives = 543/695 (78%), Gaps = 1/695 (0%)
 Frame = +3

Query: 297  FCSNTKEENGVSNSPGDAAPSTGKVNSPRRTPRGKPPNPEKPEDIIIRMMSNRAWTTRLQ 476
            F SN       +    +  PS G   SP+   +  PP  +  E +I RMM+NRAWTTRLQ
Sbjct: 16   FRSNPFSTVSTAEVLSEPEPSHG---SPQPESQPVPPIDKNLELVICRMMANRAWTTRLQ 72

Query: 477  NSIRSLVPQFDHSLVLNVLNGARKPDHALQFFRWVEKTG-YRHDRSTHLKIIEILGGASM 653
            NSIRSLVP+FD SLV NVL+GA  P+HALQF+RWVE+ G + H   T LKI++ILG  S 
Sbjct: 73   NSIRSLVPRFDPSLVYNVLHGAASPEHALQFYRWVERAGLFAHTPDTTLKIVQILGRYSK 132

Query: 654  LNHARCILFDMPQKGVERDEDLFVVLILSYGKAGIVQESVKIFQKMKELGVERTITTYDT 833
            LNHARCIL D  +      ED FV LI SYG+AGIVQESVK+FQKMKELGVERTI +YD 
Sbjct: 133  LNHARCILLDNTRAREAATEDAFVSLIDSYGRAGIVQESVKLFQKMKELGVERTIKSYDA 192

Query: 834  LFKVILRRGRVLMAKRYFNAMLKDGVLPTRHTFNIMIWGFFLSLKVETANRFFEDMKSRG 1013
            LFKVILRRGR +MAKRY+NAML++GV PTRHT+NI++WG FLSL+++TA RF+E+M SRG
Sbjct: 193  LFKVILRRGRYMMAKRYYNAMLREGVEPTRHTYNILLWGMFLSLRLDTAVRFYEEMNSRG 252

Query: 1014 IAPDVVTYNNMINGYCRVRXXXXXXXXXXXXXXXDLIPTVISYTTMIKGYVSAGRVDDGL 1193
            + PDVVTYN +INGY R +               D++P VIS+TTM+KGYV+AGR+DD +
Sbjct: 253  VLPDVVTYNTLINGYFRFKKVEDAEKLFVEMKGRDIVPNVISFTTMLKGYVAAGRIDDAM 312

Query: 1194 RLVEEMRAHKIRPNAVTYSTLLPGLCDAEKMPEAQKFLNEMVQRNIVPKDNSIFLRLISC 1373
            ++ E+M+   I+PNAVT+STLLPGLCDAEK  EA+  L EMV+R I PKDNS+F++L+S 
Sbjct: 313  KVFEDMKNCGIKPNAVTFSTLLPGLCDAEKTVEARDVLREMVERYIAPKDNSVFMKLLSV 372

Query: 1374 QCKSGKLDWAEDVLKGMVKLSIPAEPAHYGVLMENFCRAGFYDRAIKLLDEVVGNETLLN 1553
            Q KSG LD A DVLK M++LSIP E  HYGVL+E+FC+A  +D+A KLLD+++  E +  
Sbjct: 373  QSKSGDLDAAADVLKAMIRLSIPTEAGHYGVLIESFCKANEHDKAEKLLDKLIEKEIVSR 432

Query: 1554 PQISLEMEDSAYNPVIEYLCNHKLTAKAETFFRQLMKTGVQDPASFNNLLLGHSKEGNPE 1733
            PQ + EME S+YN +IEYLC+H  T+KAE FFRQL+K GVQD  +FN+L+ GHSKEGNP+
Sbjct: 433  PQNAFEMEASSYNLMIEYLCDHGRTSKAERFFRQLLKKGVQDSVAFNSLIRGHSKEGNPD 492

Query: 1734 SALEILKIMSRRKIPADALAYRSLIMSFLKKGEPADAKVALDGMIESGHLPDSELFRLVM 1913
            SA EI+KIM RR +P DA +YR LI S+L+KGEPADAK ALD M+ESGHLP+S L+RLVM
Sbjct: 493  SAFEIIKIMGRRAVPRDADSYRLLIESYLRKGEPADAKTALDSMLESGHLPESSLYRLVM 552

Query: 1914 ESLFEDGRVQTASRVMKSMIEKGVKEHMDLVAKILEALLMRGHVEEALGRIVLLMDNDCI 2093
            ESLF DGRVQTASRVMKSM+EKGVKEHMDLV+KILEALLMRGHVEEALGRI LLM N C 
Sbjct: 553  ESLFNDGRVQTASRVMKSMVEKGVKEHMDLVSKILEALLMRGHVEEALGRIDLLMHNGCE 612

Query: 2094 PDFDHLLAVLCEKEKTIAALKLLDFGLERDCNISFASYDKVLDSLLVAGKTLNAYSILCK 2273
            PDFDHLL++LCEKEKTIAALKLLDF LERDC I F+ YDKVLD+LL  GKTLNAYSILCK
Sbjct: 613  PDFDHLLSILCEKEKTIAALKLLDFVLERDCIIDFSLYDKVLDTLLAVGKTLNAYSILCK 672

Query: 2274 VMEKGGIKDLSGVGDLIRSLNAQGNTKQADILSRM 2378
            ++EK G  D     +LI+SLN +GNTKQAD+LSRM
Sbjct: 673  ILEKRGSTDWRSREELIKSLNHEGNTKQADVLSRM 707


>ref|XP_006296196.1| hypothetical protein CARUB_v10025361mg [Capsella rubella]
            gi|482564904|gb|EOA29094.1| hypothetical protein
            CARUB_v10025361mg [Capsella rubella]
          Length = 757

 Score =  880 bits (2275), Expect = 0.0
 Identities = 459/735 (62%), Positives = 561/735 (76%), Gaps = 15/735 (2%)
 Frame = +3

Query: 222  WKSRALFHSRVSRITNPSPLYSLHGFCSNTKE--ENGVSN---SPGDAAPSTGKVNSPRR 386
            +++RA  +  + R ++ S L+SL    S+ ++   +G +N      DA P T  + S   
Sbjct: 10   YQARARVYLSLPR-SSDSSLFSLPRLFSSVEDIQTSGDANPETQSADAKPETKNLGSSTE 68

Query: 387  T-------PRGKPPNPEKPEDIIIRMMSNRAWTTRLQNSIRSLVPQFDHSLVLNVLNGAR 545
            T        RGK  N EK ED I RMM NRAWTTRLQNSIR LVP++DHSLV NVL+GA+
Sbjct: 69   TRPLRERFQRGKRQNHEKLEDTICRMMDNRAWTTRLQNSIRDLVPEWDHSLVYNVLHGAK 128

Query: 546  KPDHALQFFRWVEKTGY-RHDRSTHLKIIEILGGASMLNHARCILFDMPQKGVERDEDLF 722
            K +HALQFFRW E++G  RHDR TH+K+I++LG    +N+ARCIL DMP+KGV  DED+F
Sbjct: 129  KLEHALQFFRWTERSGLIRHDRDTHMKMIKMLGEVQKVNYARCILLDMPEKGVPWDEDMF 188

Query: 723  VVLILSYGKAGIVQESVKIFQKMKELGVERTITTYDTLFKVILRRGRVLMAKRYFNAMLK 902
            VVLI SYGKAGIVQESVKIFQKMK+LGVERTI +Y+TLFKVI+RRGR +MAKRYFN M+ 
Sbjct: 189  VVLIESYGKAGIVQESVKIFQKMKDLGVERTIKSYNTLFKVIMRRGRYMMAKRYFNKMVS 248

Query: 903  DGVLPTRHTFNIMIWGFFLSLKVETANRFFEDMKSRGIAPDVVTYNNMINGYCRVRXXXX 1082
            +GV PTRHT+N+M+WGFFLSL++ETA RFFEDMK+RGI+PD VTYN MINGYCR +    
Sbjct: 249  EGVEPTRHTYNLMLWGFFLSLRLETALRFFEDMKTRGISPDAVTYNTMINGYCRFKKMDE 308

Query: 1083 XXXXXXXXXXXDLIPTVISYTTMIKGYVSAGRVDDGLRLVEEMRAHKIRPNAVTYSTLLP 1262
                       ++ P+V+SYTTMIKGY+S  RVDDGLR+ EEMR+  I PNA TYST+LP
Sbjct: 309  AEKLFVEMKGNNIEPSVVSYTTMIKGYLSVDRVDDGLRIFEEMRSSGIEPNATTYSTVLP 368

Query: 1263 GLCDAEKMPEAQKFLNEMVQRNIVPKDNSIFLRLISCQCKSGKLDWAEDVLKGMVKLSIP 1442
            GLCDA KM EA+  L  M+ ++I PKDNSIFL+L+  Q K+G +  A +VLK M  L++P
Sbjct: 369  GLCDAGKMVEAKNILKNMMAKHIAPKDNSIFLKLLVSQSKAGDMAAATEVLKAMATLNVP 428

Query: 1443 AEPAHYGVLMENFCRAGFYDRAIKLLDEVVGNETLLNPQISLEMEDSAYNPVIEYLCNHK 1622
            AE  HYGVL+EN C+A  Y+RAIKLLD ++  E +L  Q +LEME SAYNP+IEYLCN+ 
Sbjct: 429  AEAGHYGVLIENQCKANAYNRAIKLLDTLLEKEIILRHQDTLEMEPSAYNPIIEYLCNNG 488

Query: 1623 LTAKAETFFRQLMKTGVQDPASFNNLLLGHSKEGNPESALEILKIMSRRKIPADALAYRS 1802
             T+KAE  FRQLMK GVQD  + NNL+ GH+KEGNP+S+ EILKIMSRR +P +A AY  
Sbjct: 489  QTSKAEVLFRQLMKRGVQDQDALNNLISGHAKEGNPDSSYEILKIMSRRGVPREANAYEL 548

Query: 1803 LIMSFLKKGEPADAKVALDGMIESGHLPDSELFRLVMESLFEDGRVQTASRVMKSMIEK- 1979
            LI S++ KGEP DAK ALD M+E GH+PDS LFR V+ESLFEDGRVQTASRVM  MI+K 
Sbjct: 549  LIKSYMSKGEPGDAKTALDSMVEDGHVPDSSLFRSVIESLFEDGRVQTASRVMMIMIDKN 608

Query: 1980 -GVKEHMDLVAKILEALLMRGHVEEALGRIVLLMDNDCIPDFDHLLAVLCEKEKTIAALK 2156
             G++E+MDL+AKILEALLMRGHVEEALGRI LL  N    D D LL+VL EK KTIAALK
Sbjct: 609  VGIEENMDLIAKILEALLMRGHVEEALGRIDLLNQNGHAADLDSLLSVLSEKGKTIAALK 668

Query: 2157 LLDFGLERDCNISFASYDKVLDSLLVAGKTLNAYSILCKVMEKGGIKDLSGVGDLIRSLN 2336
            LLDFGLERD ++ F+SY+KVLD+LL AGKTLNAYS+LCK+MEKG   D     +LI+SLN
Sbjct: 669  LLDFGLERDLSLDFSSYEKVLDALLGAGKTLNAYSVLCKIMEKGSATDWKSSDELIKSLN 728

Query: 2337 AQGNTKQADILSRMI 2381
             +GNTKQAD+LSRMI
Sbjct: 729  QEGNTKQADVLSRMI 743


Top