BLASTX nr result

ID: Catharanthus23_contig00015560 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00015560
         (2396 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ17652.1| hypothetical protein PRUPE_ppa023564mg [Prunus pe...   962   0.0  
ref|XP_004243267.1| PREDICTED: pentatricopeptide repeat-containi...   949   0.0  
ref|XP_006348896.1| PREDICTED: pentatricopeptide repeat-containi...   946   0.0  
ref|XP_004305453.1| PREDICTED: pentatricopeptide repeat-containi...   936   0.0  
ref|XP_006468978.1| PREDICTED: pentatricopeptide repeat-containi...   928   0.0  
ref|XP_002277494.1| PREDICTED: pentatricopeptide repeat-containi...   927   0.0  
ref|XP_004137032.1| PREDICTED: pentatricopeptide repeat-containi...   908   0.0  
gb|EOX99930.1| Pentatricopeptide repeat-containing protein [Theo...   904   0.0  
gb|EXB92378.1| hypothetical protein L484_021362 [Morus notabilis]     895   0.0  
ref|XP_003532658.1| PREDICTED: pentatricopeptide repeat-containi...   875   0.0  
ref|XP_002322810.2| hypothetical protein POPTR_0016s07590g [Popu...   872   0.0  
ref|XP_004504670.1| PREDICTED: pentatricopeptide repeat-containi...   872   0.0  
ref|XP_006446832.1| hypothetical protein CICLE_v10018004mg [Citr...   855   0.0  
gb|ESW30891.1| hypothetical protein PHAVU_002G191000g [Phaseolus...   854   0.0  
sp|Q8S9M4.2|PP198_ARATH RecName: Full=Pentatricopeptide repeat-c...   793   0.0  
ref|XP_006293459.1| hypothetical protein CARUB_v10022804mg [Caps...   787   0.0  
ref|XP_006411375.1| hypothetical protein EUTSA_v10017967mg [Eutr...   786   0.0  
ref|XP_004970613.1| PREDICTED: pentatricopeptide repeat-containi...   755   0.0  
gb|EMT33444.1| hypothetical protein F775_20071 [Aegilops tauschii]    743   0.0  
ref|NP_850342.1| pentatricopeptide repeat-containing protein [Ar...   741   0.0  

>gb|EMJ17652.1| hypothetical protein PRUPE_ppa023564mg [Prunus persica]
          Length = 670

 Score =  962 bits (2488), Expect = 0.0
 Identities = 469/631 (74%), Positives = 542/631 (85%)
 Frame = +3

Query: 297  EFTDLCSKGHLKEAFANFSSLIWSEPPLCSLLLKACIQIQSLFLTKQIHSLITTSGCSRD 476
            + + LCSKGH+KEAF +F S IWS P L S LL+ACI  +SL L KQ+HSLI TSGCS D
Sbjct: 40   QLSSLCSKGHIKEAFESFKSEIWSNPSLFSHLLQACIPRKSLSLGKQLHSLIITSGCSAD 99

Query: 477  KFVNNHLLNTYCKLGRVSTAVALFETLPKKNVMSYNILIGGFIQSGDLDSAMKLFDEMVE 656
            KFV+NHLLN Y K+G +  A+ LF  LP++N+MS NILI G++Q GDL+SA K+F+EM E
Sbjct: 100  KFVSNHLLNFYSKVGDLGVALTLFGHLPRRNIMSCNILINGYVQKGDLESAQKVFNEMPE 159

Query: 657  RNLATWNAMITGLTQFEFNKEGLSMFSSMHELGFLPDAFTLGSVLRGCAGLKDLNRGRQV 836
            RN+ATWNA++TGLTQF+FN+EGL +FS MHELGFLPD FTLGSVLRGCAGL+ L+ GRQV
Sbjct: 160  RNVATWNALVTGLTQFQFNEEGLGLFSEMHELGFLPDEFTLGSVLRGCAGLRALHAGRQV 219

Query: 837  HSYAVKSGFDLHLVVGSSLAHMYMKSGSLGEGEKVIRSMPVHSVVACNTFIAGMAQNGYS 1016
            H+Y +K  F+ +LVVGSSLAHMYMKSGSL EGE+VI+S+P+ +VVA NT IAG AQNG+S
Sbjct: 220  HTYVMKCRFEFNLVVGSSLAHMYMKSGSLEEGERVIKSLPIRNVVAWNTLIAGKAQNGHS 279

Query: 1017 EVVLDQYNIMKIAGFRPDKITFVSVISSCSELATLGQGQQIHAEVIKFGAXXXXXXXXXX 1196
            E VLDQYNIMKIAGFRPDK+TFVSVISSCSELATLGQGQQIHAE IK GA          
Sbjct: 280  EAVLDQYNIMKIAGFRPDKVTFVSVISSCSELATLGQGQQIHAEAIKAGASTVDAVISSL 339

Query: 1197 XXMYSRCGCLDDALKVFEEKEGADVVLWSAMIAAYGFHGRGKEAIELFNRMEKEGLEANE 1376
              MYSRCGCL+D+LK F+E  G DVVL S+MI+AYGFHGR +EAI+LF  ME+E LEAN+
Sbjct: 340  ISMYSRCGCLEDSLKAFKESVGGDVVLRSSMISAYGFHGRVEEAIQLFEEMEQEELEAND 399

Query: 1377 VTFLSLLYACSHCGLKDKGLELFDLMMDKYRLQPKVEHYTCVVDLLGRSGRLEEAEWLIR 1556
            VTFLSLLYACSHCGLK+KG+E F+ M++KY L+P+VEHYTCVVDLLGRSGRLEEAE +IR
Sbjct: 400  VTFLSLLYACSHCGLKEKGIEFFNSMVEKYGLKPRVEHYTCVVDLLGRSGRLEEAESMIR 459

Query: 1557 SMPVKADAIIWKTLLSACKTHKNADMAKRIAEQVLRINPHDSASYVLLSNIQASARRWQD 1736
            SMPVKADAIIWKTLLSACK HKNA++AKRI+E+V+R +P DSASYVLLSNI ASARRWQD
Sbjct: 460  SMPVKADAIIWKTLLSACKIHKNANIAKRISEEVIRRDPQDSASYVLLSNIHASARRWQD 519

Query: 1737 VSEVRKAMRERNVKKEPGTSWFELKNQVHQFIMGDKSHPQSEEIDAYLKDLMEELKLKGY 1916
            VSEVRKAMR+R VKKEPG SW E+KNQVHQF +GDKSHPQS+E+D YL++L  ELKL GY
Sbjct: 520  VSEVRKAMRDRKVKKEPGISWLEIKNQVHQFCIGDKSHPQSKELDMYLQELTSELKLHGY 579

Query: 1917 VPDTGSVLHDMDAEEKEYNLVHHSEKLAIAFALMNTVEGVPIRIMKNLRVCGDCHIAIKY 2096
            VPDTGSVLHDMD EEKEYNL HHSEKLAIAFALMNT EGVP+R+MKNLRVC DCH+AIKY
Sbjct: 580  VPDTGSVLHDMDNEEKEYNLAHHSEKLAIAFALMNTPEGVPVRVMKNLRVCIDCHVAIKY 639

Query: 2097 ISSMKKREIIVRDASRFHHFKDGHCSCGDYW 2189
            IS +K REIIVRDASRFHHFK+G CSCGDYW
Sbjct: 640  ISLIKNREIIVRDASRFHHFKNGKCSCGDYW 670


>ref|XP_004243267.1| PREDICTED: pentatricopeptide repeat-containing protein At2g41080-like
            [Solanum lycopersicum]
          Length = 658

 Score =  949 bits (2453), Expect = 0.0
 Identities = 466/660 (70%), Positives = 546/660 (82%)
 Frame = +3

Query: 210  MGKYLLKPLAAFARFSHQHRCLCTTTLTAEFTDLCSKGHLKEAFANFSSLIWSEPPLCSL 389
            MG+  L+PL      S   R    +    E + LCS+G++KEAF  FS LIW  P   S 
Sbjct: 1    MGQSCLRPLRFLPLRSANTRRF--SAAGTELSILCSQGYVKEAFNKFSFLIWDNPSHFSY 58

Query: 390  LLKACIQIQSLFLTKQIHSLITTSGCSRDKFVNNHLLNTYCKLGRVSTAVALFETLPKKN 569
            LL+ACIQ +S FLTKQ+HSLI TSGC RDKFV+NHLLN Y KLG++  AV LF+ LPK+N
Sbjct: 59   LLQACIQEKSFFLTKQLHSLIVTSGCFRDKFVSNHLLNAYSKLGQLDIAVTLFDKLPKRN 118

Query: 570  VMSYNILIGGFIQSGDLDSAMKLFDEMVERNLATWNAMITGLTQFEFNKEGLSMFSSMHE 749
            VMS+NILIGG++Q GDLDSA K+FDEM ERNLA+WNAMITGLTQFEFN+  LS+F+ M+ 
Sbjct: 119  VMSFNILIGGYVQIGDLDSASKVFDEMGERNLASWNAMITGLTQFEFNERALSLFARMYG 178

Query: 750  LGFLPDAFTLGSVLRGCAGLKDLNRGRQVHSYAVKSGFDLHLVVGSSLAHMYMKSGSLGE 929
            LG+LPDAFTLGSVLRGCAGLKDLN+GRQVH   +K G +   VV SSLAHMYM+SGSL E
Sbjct: 179  LGYLPDAFTLGSVLRGCAGLKDLNKGRQVHGCGLKLGLEGDFVVASSLAHMYMRSGSLSE 238

Query: 930  GEKVIRSMPVHSVVACNTFIAGMAQNGYSEVVLDQYNIMKIAGFRPDKITFVSVISSCSE 1109
            GE VI SMP  ++ A NT IAG AQNG  E  L+ YN++KIAGFRPDKITFVSVISSCSE
Sbjct: 239  GEIVIMSMPDQTMAAWNTLIAGRAQNGCFEGALELYNLVKIAGFRPDKITFVSVISSCSE 298

Query: 1110 LATLGQGQQIHAEVIKFGAXXXXXXXXXXXXMYSRCGCLDDALKVFEEKEGADVVLWSAM 1289
            LAT+GQGQQIH++VIK G             MYS+CGCLD+A K+FEE++ AD+VLWSAM
Sbjct: 299  LATIGQGQQIHSDVIKTGVISVVAVVSSLISMYSKCGCLDEAEKIFEERKEADLVLWSAM 358

Query: 1290 IAAYGFHGRGKEAIELFNRMEKEGLEANEVTFLSLLYACSHCGLKDKGLELFDLMMDKYR 1469
            I+AYGFHGRGK A+ELF+RME+EGL  N +T LSLLYACSH G+KD+GLE FDLM++KY 
Sbjct: 359  ISAYGFHGRGKNAVELFHRMEQEGLAPNHITLLSLLYACSHSGMKDEGLEFFDLMVEKYN 418

Query: 1470 LQPKVEHYTCVVDLLGRSGRLEEAEWLIRSMPVKADAIIWKTLLSACKTHKNADMAKRIA 1649
            ++P++ HYTCVVDLLGR+GRL+EAE LIRSMPVK D +IWKTLLSACK HKNADMA+ IA
Sbjct: 419  VEPQLVHYTCVVDLLGRAGRLQEAEALIRSMPVKPDGVIWKTLLSACKIHKNADMARSIA 478

Query: 1650 EQVLRINPHDSASYVLLSNIQASARRWQDVSEVRKAMRERNVKKEPGTSWFELKNQVHQF 1829
            E+VLRI+P DSASYVLL+N+QASA+RW+ VSEVRK+M++R VKKEPG SW ELKNQVH F
Sbjct: 479  EEVLRIDPQDSASYVLLANVQASAKRWKSVSEVRKSMKDRGVKKEPGISWLELKNQVHHF 538

Query: 1830 IMGDKSHPQSEEIDAYLKDLMEELKLKGYVPDTGSVLHDMDAEEKEYNLVHHSEKLAIAF 2009
            I+GDKSHPQS+E+D YLK+L+ ELKL+GYVPDTGSVLHDM+ EEKEYNLVHHSEKLAIAF
Sbjct: 539  IIGDKSHPQSDEVDVYLKELIAELKLEGYVPDTGSVLHDMELEEKEYNLVHHSEKLAIAF 598

Query: 2010 ALMNTVEGVPIRIMKNLRVCGDCHIAIKYISSMKKREIIVRDASRFHHFKDGHCSCGDYW 2189
            ALMNT EG PIRIMKNLR+C DCH+AIKYIS MKKREIIVRD+SRFHHFK+G CSCGDYW
Sbjct: 599  ALMNTPEGFPIRIMKNLRICSDCHMAIKYISKMKKREIIVRDSSRFHHFKEGCCSCGDYW 658


>ref|XP_006348896.1| PREDICTED: pentatricopeptide repeat-containing protein At2g41080-like
            [Solanum tuberosum]
          Length = 658

 Score =  946 bits (2446), Expect = 0.0
 Identities = 469/663 (70%), Positives = 548/663 (82%), Gaps = 3/663 (0%)
 Frame = +3

Query: 210  MGKYLLKPLAAFARFSHQHRCLCTTTLTAEFTDL---CSKGHLKEAFANFSSLIWSEPPL 380
            MG+  ++PL    RF H  R   T   +A  T+L   CS+G++KEAF  FS LIW  P  
Sbjct: 1    MGQSCVRPL----RFLHL-RSANTRRFSAAATELSILCSQGYVKEAFNKFSFLIWDNPSH 55

Query: 381  CSLLLKACIQIQSLFLTKQIHSLITTSGCSRDKFVNNHLLNTYCKLGRVSTAVALFETLP 560
             S LL+ACIQ +S  LTKQ+HSLI TSGC RDKFV+NHLLN Y KLG++  AV+LF+ LP
Sbjct: 56   FSYLLQACIQEKSFSLTKQLHSLIVTSGCFRDKFVSNHLLNAYSKLGQLDIAVSLFDKLP 115

Query: 561  KKNVMSYNILIGGFIQSGDLDSAMKLFDEMVERNLATWNAMITGLTQFEFNKEGLSMFSS 740
            K+NVMS+NILIGG++Q GDL+SA K+FDEM ERNLA+WNAMITGLTQFEFN+  LS+FS 
Sbjct: 116  KRNVMSFNILIGGYVQIGDLESASKVFDEMGERNLASWNAMITGLTQFEFNERALSLFSQ 175

Query: 741  MHELGFLPDAFTLGSVLRGCAGLKDLNRGRQVHSYAVKSGFDLHLVVGSSLAHMYMKSGS 920
            M+  G+LPDAFTLGSVLRGCAGLKDLN+GRQVH   +K G     VV SSLAHMYM+SGS
Sbjct: 176  MYGFGYLPDAFTLGSVLRGCAGLKDLNKGRQVHGCGLKLGLQGDFVVASSLAHMYMRSGS 235

Query: 921  LGEGEKVIRSMPVHSVVACNTFIAGMAQNGYSEVVLDQYNIMKIAGFRPDKITFVSVISS 1100
            L EGE VI SMP  ++ A NT IAG AQNG  E  L+ YN++KIAGFRPDKITFVSVISS
Sbjct: 236  LREGEIVIMSMPDQTMAAWNTLIAGRAQNGCFEGALELYNLVKIAGFRPDKITFVSVISS 295

Query: 1101 CSELATLGQGQQIHAEVIKFGAXXXXXXXXXXXXMYSRCGCLDDALKVFEEKEGADVVLW 1280
            CSELAT+GQGQQIH++VIK GA            MYS+CGCLD+A K+FEE+E AD+VLW
Sbjct: 296  CSELATIGQGQQIHSDVIKTGAISVVAVVSSLISMYSKCGCLDEAEKIFEEREEADIVLW 355

Query: 1281 SAMIAAYGFHGRGKEAIELFNRMEKEGLEANEVTFLSLLYACSHCGLKDKGLELFDLMMD 1460
            SAMI+AYGFHG GK A+ELF+RME+EGL  N +T LSLLYACSH G+KD+GLE FDLM++
Sbjct: 356  SAMISAYGFHGMGKNAVELFHRMEQEGLAPNHITLLSLLYACSHSGMKDEGLEFFDLMVE 415

Query: 1461 KYRLQPKVEHYTCVVDLLGRSGRLEEAEWLIRSMPVKADAIIWKTLLSACKTHKNADMAK 1640
            KY ++P++ HYTCVVDLLGR+G L+EAE LIRSMPVK D +IWKTLLSACK HKNADMA+
Sbjct: 416  KYNVEPQLVHYTCVVDLLGRAGCLQEAEALIRSMPVKPDGVIWKTLLSACKIHKNADMAR 475

Query: 1641 RIAEQVLRINPHDSASYVLLSNIQASARRWQDVSEVRKAMRERNVKKEPGTSWFELKNQV 1820
             IAE+VLRI+P DSASYVLL+N+QASA+RW+ VSEVRK+M++R VKKEPG SW ELKNQV
Sbjct: 476  SIAEEVLRIDPEDSASYVLLANVQASAKRWKSVSEVRKSMKDRGVKKEPGISWLELKNQV 535

Query: 1821 HQFIMGDKSHPQSEEIDAYLKDLMEELKLKGYVPDTGSVLHDMDAEEKEYNLVHHSEKLA 2000
            H FI+GDKSHPQS+E+D YLK+L+ ELKL+GYVPDTGSVLHDM+ EEKEYNLVHHSEKLA
Sbjct: 536  HHFIIGDKSHPQSDEVDVYLKELIAELKLEGYVPDTGSVLHDMELEEKEYNLVHHSEKLA 595

Query: 2001 IAFALMNTVEGVPIRIMKNLRVCGDCHIAIKYISSMKKREIIVRDASRFHHFKDGHCSCG 2180
            IAFALMNT EG PIRIMKNLR+CGDCH+AIKYIS MKKREIIVRD+SRFHHFKDG CSCG
Sbjct: 596  IAFALMNTPEGFPIRIMKNLRICGDCHMAIKYISQMKKREIIVRDSSRFHHFKDGCCSCG 655

Query: 2181 DYW 2189
            DYW
Sbjct: 656  DYW 658


>ref|XP_004305453.1| PREDICTED: pentatricopeptide repeat-containing protein At2g41080-like
            [Fragaria vesca subsp. vesca]
          Length = 641

 Score =  936 bits (2420), Expect = 0.0
 Identities = 463/638 (72%), Positives = 531/638 (83%)
 Frame = +3

Query: 276  CTTTLTAEFTDLCSKGHLKEAFANFSSLIWSEPPLCSLLLKACIQIQSLFLTKQIHSLIT 455
            CT+++  + T LCSKG +K+AF  F S + S+P + S LLKACI  +SL L+KQ+HSL+ 
Sbjct: 5    CTSSIE-QLTTLCSKGLIKQAFDTFKSELLSDPSIFSHLLKACIPTKSLSLSKQLHSLLI 63

Query: 456  TSGCSRDKFVNNHLLNTYCKLGRVSTAVALFETLPKKNVMSYNILIGGFIQSGDLDSAMK 635
            TSGCS DKF +NHLLN Y K+G + +A ALF  LP++N+MS NILI GF+Q GDL+SA K
Sbjct: 64   TSGCSSDKFASNHLLNLYSKIGDLQSASALFRHLPRRNIMSGNILINGFVQIGDLESAQK 123

Query: 636  LFDEMVERNLATWNAMITGLTQFEFNKEGLSMFSSMHELGFLPDAFTLGSVLRGCAGLKD 815
            +FDEM ERN+ATWNAM+TGL QFEFN+EGL +F  MHELGF  D FTLGSVLRGCAGL+ 
Sbjct: 124  VFDEMPERNMATWNAMVTGLVQFEFNEEGLELFKGMHELGFSMDVFTLGSVLRGCAGLRV 183

Query: 816  LNRGRQVHSYAVKSGFDLHLVVGSSLAHMYMKSGSLGEGEKVIRSMPVHSVVACNTFIAG 995
            +N G QVH YAVK G + +LVVGSSLAHMYM+SG L EGEKVI+SMP+ +VV+ NT IAG
Sbjct: 184  VNAGCQVHGYAVKCGLEFNLVVGSSLAHMYMRSGRLVEGEKVIKSMPIRNVVSWNTLIAG 243

Query: 996  MAQNGYSEVVLDQYNIMKIAGFRPDKITFVSVISSCSELATLGQGQQIHAEVIKFGAXXX 1175
             AQNG SE VLDQYN+MKIAGFRPDKITFVSV+SSCSELATLGQGQQIHAEVIK G    
Sbjct: 244  KAQNGQSEGVLDQYNMMKIAGFRPDKITFVSVLSSCSELATLGQGQQIHAEVIKAGVSSV 303

Query: 1176 XXXXXXXXXMYSRCGCLDDALKVFEEKEGADVVLWSAMIAAYGFHGRGKEAIELFNRMEK 1355
                     MYSRCGCL+DALK F E EGADVVLWS++I+AYGFHGRG+EAI+LF +ME+
Sbjct: 304  VAVISTLITMYSRCGCLEDALKAFWECEGADVVLWSSVISAYGFHGRGEEAIKLFEQMEQ 363

Query: 1356 EGLEANEVTFLSLLYACSHCGLKDKGLELFDLMMDKYRLQPKVEHYTCVVDLLGRSGRLE 1535
            EG EAN+VTFLSLLYACSHCG+K+KGLELFDLM+ KY L PK+EHYTCVVDLLGRSG LE
Sbjct: 364  EGFEANDVTFLSLLYACSHCGMKEKGLELFDLMVQKYGLIPKLEHYTCVVDLLGRSGCLE 423

Query: 1536 EAEWLIRSMPVKADAIIWKTLLSACKTHKNADMAKRIAEQVLRINPHDSASYVLLSNIQA 1715
            EAE +IRSMPVKADAIIW TLLSACK HKNADMA+RI + VLR NP DSA YVLLSNI A
Sbjct: 424  EAEAMIRSMPVKADAIIWITLLSACKIHKNADMARRIGQDVLRQNPEDSALYVLLSNIHA 483

Query: 1716 SARRWQDVSEVRKAMRERNVKKEPGTSWFELKNQVHQFIMGDKSHPQSEEIDAYLKDLME 1895
            SA+RW+ VSEVR AMR+R VKKEPG SW E+KN+V+QF MGD SHPQ   ID YLK+L  
Sbjct: 484  SAKRWEAVSEVRTAMRDRKVKKEPGISWLEIKNKVYQFRMGDNSHPQYMAIDLYLKELRS 543

Query: 1896 ELKLKGYVPDTGSVLHDMDAEEKEYNLVHHSEKLAIAFALMNTVEGVPIRIMKNLRVCGD 2075
            E+KL GYVPDTGSVLHDMD EEKEY+L HHSEKLAIAF LMNT EGVP+R+MKNLRVC D
Sbjct: 544  EMKLHGYVPDTGSVLHDMDNEEKEYDLAHHSEKLAIAFGLMNTPEGVPLRVMKNLRVCID 603

Query: 2076 CHIAIKYISSMKKREIIVRDASRFHHFKDGHCSCGDYW 2189
            CH+AIKYIS +K REIIVRDASRFHHFK+G CSCGDYW
Sbjct: 604  CHVAIKYISQIKNREIIVRDASRFHHFKNGKCSCGDYW 641


>ref|XP_006468978.1| PREDICTED: pentatricopeptide repeat-containing protein At2g41080-like
            isoform X1 [Citrus sinensis]
            gi|568829336|ref|XP_006468979.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g41080-like isoform X2 [Citrus sinensis]
          Length = 654

 Score =  928 bits (2399), Expect = 0.0
 Identities = 445/633 (70%), Positives = 528/633 (83%)
 Frame = +3

Query: 291  TAEFTDLCSKGHLKEAFANFSSLIWSEPPLCSLLLKACIQIQSLFLTKQIHSLITTSGCS 470
            T EF +LCSKGH+KEAF  F S IWS+P L S L+++C   +SL  +KQ+HSLI TSGCS
Sbjct: 22   TEEFINLCSKGHIKEAFNRFKSEIWSDPTLFSHLIQSCTLKKSLSCSKQLHSLIVTSGCS 81

Query: 471  RDKFVNNHLLNTYCKLGRVSTAVALFETLPKKNVMSYNILIGGFIQSGDLDSAMKLFDEM 650
             + F+ NHLLN Y K+G++ TAV LF  +P++N+MS NI+I   +QSGDL+SA K+FD M
Sbjct: 82   SNNFICNHLLNMYSKIGQLQTAVTLFGLMPRRNIMSCNIMINANVQSGDLESARKVFDGM 141

Query: 651  VERNLATWNAMITGLTQFEFNKEGLSMFSSMHELGFLPDAFTLGSVLRGCAGLKDLNRGR 830
             +RN+ATWNAM+ GL QFEFN+EGL + S MH++GFLPD FTLGSVLRGCAGL+ L+ GR
Sbjct: 142  TKRNIATWNAMVAGLVQFEFNEEGLRLLSEMHQVGFLPDEFTLGSVLRGCAGLRGLDAGR 201

Query: 831  QVHSYAVKSGFDLHLVVGSSLAHMYMKSGSLGEGEKVIRSMPVHSVVACNTFIAGMAQNG 1010
            Q+H Y +K GF+L LVVGSSLAHMYMKSGSL EGEKVIR MP+ +V+A NT IAG AQNG
Sbjct: 202  QIHCYVMKGGFELDLVVGSSLAHMYMKSGSLVEGEKVIRLMPIRNVIAWNTLIAGKAQNG 261

Query: 1011 YSEVVLDQYNIMKIAGFRPDKITFVSVISSCSELATLGQGQQIHAEVIKFGAXXXXXXXX 1190
             +E VLDQYN+M++ GFRPDKITFVSV+SSCSELATLGQGQQIHAEV+K GA        
Sbjct: 262  LAEDVLDQYNLMRMVGFRPDKITFVSVVSSCSELATLGQGQQIHAEVVKAGASLDVGVIS 321

Query: 1191 XXXXMYSRCGCLDDALKVFEEKEGADVVLWSAMIAAYGFHGRGKEAIELFNRMEKEGLEA 1370
                MYSRCGCLDD++K F E E +DVVLWS+MIAAYGFHG+G+EAI LF +ME++  EA
Sbjct: 322  SLISMYSRCGCLDDSMKAFLECEYSDVVLWSSMIAAYGFHGKGEEAINLFEQMEQKEFEA 381

Query: 1371 NEVTFLSLLYACSHCGLKDKGLELFDLMMDKYRLQPKVEHYTCVVDLLGRSGRLEEAEWL 1550
            N+VTF+SLLYACSHCGLK+KG+E F+LM+ KY L+P++EHYTCVVDLLGR G L+EAE L
Sbjct: 382  NDVTFVSLLYACSHCGLKEKGMEFFNLMVKKYGLKPRLEHYTCVVDLLGRCGCLDEAEAL 441

Query: 1551 IRSMPVKADAIIWKTLLSACKTHKNADMAKRIAEQVLRINPHDSASYVLLSNIQASARRW 1730
            IR+MPVKAD IIWKTLLSACK HK+ DMA RIAE+VL++NP D+A YVL SNI ASA+RW
Sbjct: 442  IRNMPVKADTIIWKTLLSACKIHKSTDMAGRIAEEVLKLNPRDAAPYVLFSNIHASAKRW 501

Query: 1731 QDVSEVRKAMRERNVKKEPGTSWFELKNQVHQFIMGDKSHPQSEEIDAYLKDLMEELKLK 1910
            Q VSE+R+AMRERNVKKEPG SW E+KNQVHQF MGDKSHP S EID YL++L  E+KL+
Sbjct: 502  QGVSELREAMRERNVKKEPGVSWLEIKNQVHQFTMGDKSHPSSMEIDLYLEELTSEMKLR 561

Query: 1911 GYVPDTGSVLHDMDAEEKEYNLVHHSEKLAIAFALMNTVEGVPIRIMKNLRVCGDCHIAI 2090
            GYVPDTG+ +HDMD EEKEYNL HHSEKLAIAFALMNT  GVPIR+MKNLRVC DCH+AI
Sbjct: 562  GYVPDTGADMHDMDNEEKEYNLKHHSEKLAIAFALMNTPTGVPIRVMKNLRVCSDCHVAI 621

Query: 2091 KYISSMKKREIIVRDASRFHHFKDGHCSCGDYW 2189
            KYIS +K REIIVRDASRFHHF++G CSCGDYW
Sbjct: 622  KYISEIKNREIIVRDASRFHHFRNGKCSCGDYW 654


>ref|XP_002277494.1| PREDICTED: pentatricopeptide repeat-containing protein At2g41080
            [Vitis vinifera]
          Length = 657

 Score =  927 bits (2396), Expect = 0.0
 Identities = 456/660 (69%), Positives = 534/660 (80%)
 Frame = +3

Query: 210  MGKYLLKPLAAFARFSHQHRCLCTTTLTAEFTDLCSKGHLKEAFANFSSLIWSEPPLCSL 389
            MGKY L+PL    R          + LTAEFT+LCSKGHLK+AF  FSS IWSEP L S 
Sbjct: 1    MGKYCLRPLT---RRHFSTNPSSGSELTAEFTNLCSKGHLKQAFDRFSSHIWSEPSLFSH 57

Query: 390  LLKACIQIQSLFLTKQIHSLITTSGCSRDKFVNNHLLNTYCKLGRVSTAVALFETLPKKN 569
            LL++CI   SL L KQ+HSLI TSGCS DKF++NHLLN Y K G++ TA+ LF  +P+KN
Sbjct: 58   LLQSCISENSLSLGKQLHSLIITSGCSSDKFISNHLLNLYSKCGQLDTAITLFGVMPRKN 117

Query: 570  VMSYNILIGGFIQSGDLDSAMKLFDEMVERNLATWNAMITGLTQFEFNKEGLSMFSSMHE 749
            +MS NILI G+ +SGD  +A K+FDEM ERN+ATWNAM+ GL QFEFN+EGL +FS M+E
Sbjct: 118  IMSCNILINGYFRSGDWVTARKMFDEMPERNVATWNAMVAGLIQFEFNEEGLGLFSRMNE 177

Query: 750  LGFLPDAFTLGSVLRGCAGLKDLNRGRQVHSYAVKSGFDLHLVVGSSLAHMYMKSGSLGE 929
            LGFLPD F LGSVLRGCAGL+ L  GRQVH Y  K GF+ +LVV SSLAHMYMK GSLGE
Sbjct: 178  LGFLPDEFALGSVLRGCAGLRALVAGRQVHGYVRKCGFEFNLVVVSSLAHMYMKCGSLGE 237

Query: 930  GEKVIRSMPVHSVVACNTFIAGMAQNGYSEVVLDQYNIMKIAGFRPDKITFVSVISSCSE 1109
            GE++IR+MP  +VVA NT IAG AQNGY E VLDQYN+MK+AGFRPDKITFVSVISSCSE
Sbjct: 238  GERLIRAMPSQNVVAWNTLIAGRAQNGYPEEVLDQYNMMKMAGFRPDKITFVSVISSCSE 297

Query: 1110 LATLGQGQQIHAEVIKFGAXXXXXXXXXXXXMYSRCGCLDDALKVFEEKEGADVVLWSAM 1289
            LATLGQGQQIHAEVIK GA            MYSRCGCL+ +LKVF E E  DVV WS+M
Sbjct: 298  LATLGQGQQIHAEVIKAGASLIVSVISSLISMYSRCGCLEYSLKVFLECENGDVVCWSSM 357

Query: 1290 IAAYGFHGRGKEAIELFNRMEKEGLEANEVTFLSLLYACSHCGLKDKGLELFDLMMDKYR 1469
            IAAYGFHGRG EAI+LFN+ME+E LEAN+VTFLSLLYACSHCGLK+KG++ FDLM++KY 
Sbjct: 358  IAAYGFHGRGVEAIDLFNQMEQEKLEANDVTFLSLLYACSHCGLKEKGIKFFDLMVEKYG 417

Query: 1470 LQPKVEHYTCVVDLLGRSGRLEEAEWLIRSMPVKADAIIWKTLLSACKTHKNADMAKRIA 1649
            ++P++EHYTC+VDLLGR G +EEAE LIRSMPVKAD I WKTLLSACK HK  +MA+RI+
Sbjct: 418  VKPRLEHYTCMVDLLGRYGSVEEAEALIRSMPVKADVITWKTLLSACKIHKKTEMARRIS 477

Query: 1650 EQVLRINPHDSASYVLLSNIQASARRWQDVSEVRKAMRERNVKKEPGTSWFELKNQVHQF 1829
            E+V R++P D   YVLLSNI AS +RW DVS+VRKAMR+R +KKEPG SW E+KNQ+HQF
Sbjct: 478  EEVFRLDPRDPVPYVLLSNIHASDKRWDDVSDVRKAMRDRKLKKEPGISWLEVKNQIHQF 537

Query: 1830 IMGDKSHPQSEEIDAYLKDLMEELKLKGYVPDTGSVLHDMDAEEKEYNLVHHSEKLAIAF 2009
             MGDKSHP+S EI +YL++L  E+K +GYVPD  SVLHDMD E+KEY+LVHHSEKLAIAF
Sbjct: 538  CMGDKSHPKSVEIASYLRELTSEMKKRGYVPDIDSVLHDMDVEDKEYSLVHHSEKLAIAF 597

Query: 2010 ALMNTVEGVPIRIMKNLRVCGDCHIAIKYISSMKKREIIVRDASRFHHFKDGHCSCGDYW 2189
            AL+ T  G PIR++KNLRVC DCH+AIKYIS +  REIIVRD+SRFHHFK+G CSCGDYW
Sbjct: 598  ALLYTPVGTPIRVIKNLRVCSDCHVAIKYISEISNREIIVRDSSRFHHFKNGRCSCGDYW 657


>ref|XP_004137032.1| PREDICTED: pentatricopeptide repeat-containing protein At2g41080-like
            [Cucumis sativus] gi|449526872|ref|XP_004170437.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At2g41080-like [Cucumis sativus]
          Length = 667

 Score =  908 bits (2347), Expect = 0.0
 Identities = 445/655 (67%), Positives = 538/655 (82%)
 Frame = +3

Query: 225  LKPLAAFARFSHQHRCLCTTTLTAEFTDLCSKGHLKEAFANFSSLIWSEPPLCSLLLKAC 404
            L PL +F   S   +   + +L  EFT LC+ G +K+A+  F+S IWS+P L S LL++C
Sbjct: 14   LNPLYSFTVRSLSMKISSSASLQ-EFTSLCNDGRIKQAYDTFTSEIWSDPSLFSHLLQSC 72

Query: 405  IQIQSLFLTKQIHSLITTSGCSRDKFVNNHLLNTYCKLGRVSTAVALFETLPKKNVMSYN 584
            I++ SLF  KQ+HSLI TSG S+DKF++NHLLN Y KLG+  +++ LF  +P++NVMS+N
Sbjct: 73   IKLGSLFGGKQVHSLIITSGGSKDKFISNHLLNFYSKLGQFKSSLVLFSNMPRRNVMSFN 132

Query: 585  ILIGGFIQSGDLDSAMKLFDEMVERNLATWNAMITGLTQFEFNKEGLSMFSSMHELGFLP 764
            ILI G++Q GDL+SA KLFDEM ERN+ATWNAMI GLTQFEFNK+ LS+F  M+ LGFLP
Sbjct: 133  ILINGYLQLGDLESAQKLFDEMSERNIATWNAMIAGLTQFEFNKQALSLFKEMYGLGFLP 192

Query: 765  DAFTLGSVLRGCAGLKDLNRGRQVHSYAVKSGFDLHLVVGSSLAHMYMKSGSLGEGEKVI 944
            D FTLGSVLRGCAGL+ L  G++VH+  +K GF+L  VVGSSLAHMY+KSGSL +GEK+I
Sbjct: 193  DEFTLGSVLRGCAGLRSLLAGQEVHACLLKCGFELSSVVGSSLAHMYIKSGSLSDGEKLI 252

Query: 945  RSMPVHSVVACNTFIAGMAQNGYSEVVLDQYNIMKIAGFRPDKITFVSVISSCSELATLG 1124
            +SMP+ +VVA NT IAG AQNG  E VL+QYN+MK+AGFRPDKITFVSV+S+CSELATLG
Sbjct: 253  KSMPIRTVVAWNTLIAGKAQNGCPEEVLNQYNMMKMAGFRPDKITFVSVLSACSELATLG 312

Query: 1125 QGQQIHAEVIKFGAXXXXXXXXXXXXMYSRCGCLDDALKVFEEKEGADVVLWSAMIAAYG 1304
            QGQQIHAEVIK GA            MYSR GCL+D++K F ++E  DVVLWS+MIAAYG
Sbjct: 313  QGQQIHAEVIKAGASSVLAVVSSLISMYSRSGCLEDSIKAFVDRENFDVVLWSSMIAAYG 372

Query: 1305 FHGRGKEAIELFNRMEKEGLEANEVTFLSLLYACSHCGLKDKGLELFDLMMDKYRLQPKV 1484
            FHGRG+EA+ELF++ME   +EANEVTFLSLLYACSH GLK+KG E FDLM+ KY+L+P++
Sbjct: 373  FHGRGEEALELFHQMEDLKMEANEVTFLSLLYACSHSGLKEKGTEYFDLMVKKYKLKPRI 432

Query: 1485 EHYTCVVDLLGRSGRLEEAEWLIRSMPVKADAIIWKTLLSACKTHKNADMAKRIAEQVLR 1664
            EHYTCVVDLLGR+GRLEEAE +IRSMPV+ D IIWKTLL+ACK HK A+MA+RI+E++++
Sbjct: 433  EHYTCVVDLLGRAGRLEEAEGMIRSMPVQPDGIIWKTLLAACKLHKEAEMAERISEEIIK 492

Query: 1665 INPHDSASYVLLSNIQASARRWQDVSEVRKAMRERNVKKEPGTSWFELKNQVHQFIMGDK 1844
            ++P D+ASYVLLSNI ASAR W +VS++RKAMR+R+V+KEPG SW ELKN VHQF MGDK
Sbjct: 493  LDPLDAASYVLLSNIHASARNWLNVSQIRKAMRDRSVRKEPGISWLELKNLVHQFSMGDK 552

Query: 1845 SHPQSEEIDAYLKDLMEELKLKGYVPDTGSVLHDMDAEEKEYNLVHHSEKLAIAFALMNT 2024
            SHPQ  EID YLK+LM ELK  GYVP+ GSVLHDMD EEKEYNL HHSEK AIAFALMNT
Sbjct: 553  SHPQYFEIDLYLKELMSELKQHGYVPELGSVLHDMDNEEKEYNLAHHSEKFAIAFALMNT 612

Query: 2025 VEGVPIRIMKNLRVCGDCHIAIKYISSMKKREIIVRDASRFHHFKDGHCSCGDYW 2189
             E VPIR+MKNLRVC DCH AIK IS ++ REIIVRDASRFHHFKDG CSCG+YW
Sbjct: 613  SENVPIRVMKNLRVCDDCHNAIKCISRIRNREIIVRDASRFHHFKDGECSCGNYW 667


>gb|EOX99930.1| Pentatricopeptide repeat-containing protein [Theobroma cacao]
          Length = 672

 Score =  904 bits (2336), Expect = 0.0
 Identities = 447/668 (66%), Positives = 526/668 (78%), Gaps = 7/668 (1%)
 Frame = +3

Query: 207  CMGKYLLKPLAAFARFSHQHR-------CLCTTTLTAEFTDLCSKGHLKEAFANFSSLIW 365
            CMG Y      +F  FS   R       C   +  T+E T LCSKG  K+AF  F   IW
Sbjct: 8    CMGWYCP---GSFLSFSSSSRFLSAIAACESASNFTSELTHLCSKGLAKQAFDRFHPQIW 64

Query: 366  SEPPLCSLLLKACIQIQSLFLTKQIHSLITTSGCSRDKFVNNHLLNTYCKLGRVSTAVAL 545
            ++P L S L+++CI   SL L KQ+HSL+ TSG S+D+F++NHLLN Y K G + TAV+L
Sbjct: 65   ADPSLFSHLIQSCIPQNSLSLGKQLHSLVITSGSSKDRFISNHLLNMYSKFGNLRTAVSL 124

Query: 546  FETLPKKNVMSYNILIGGFIQSGDLDSAMKLFDEMVERNLATWNAMITGLTQFEFNKEGL 725
            +  + +KN+MS NILI G +Q GDL+ A KLF EM  RNLATWNAM+ G  +FEFN+EGL
Sbjct: 125  YGVMLRKNIMSCNILINGHVQVGDLEGARKLFGEMPLRNLATWNAMVGGFIEFEFNEEGL 184

Query: 726  SMFSSMHELGFLPDAFTLGSVLRGCAGLKDLNRGRQVHSYAVKSGFDLHLVVGSSLAHMY 905
             +F  MH LGF+PD FTL +VLRGCAGLK L  GRQVH Y +K GF+ HLVVG+SLAHMY
Sbjct: 185  RLFKEMHFLGFMPDDFTLSTVLRGCAGLKALLEGRQVHCYVMKCGFEFHLVVGNSLAHMY 244

Query: 906  MKSGSLGEGEKVIRSMPVHSVVACNTFIAGMAQNGYSEVVLDQYNIMKIAGFRPDKITFV 1085
            MKSG LGEGE+V++S+P+ +VVA NT IAG A NGYSE VL+ Y +M +AG RPDKITFV
Sbjct: 245  MKSGRLGEGERVMKSLPIQNVVAWNTLIAGNAHNGYSESVLNLYCMMNMAGVRPDKITFV 304

Query: 1086 SVISSCSELATLGQGQQIHAEVIKFGAXXXXXXXXXXXXMYSRCGCLDDALKVFEEKEGA 1265
            SVISSCSELATLGQGQQIHA+V+K GA            MYSRCGCL D++K+F E E  
Sbjct: 305  SVISSCSELATLGQGQQIHADVVKTGASSVVGVISSLISMYSRCGCLGDSIKIFLECEEP 364

Query: 1266 DVVLWSAMIAAYGFHGRGKEAIELFNRMEKEGLEANEVTFLSLLYACSHCGLKDKGLELF 1445
            D+V+WS+MIAAYGFHGRG EA+ELF ++E+E L  N+VTFLSLLYACSHCG KDKGLE F
Sbjct: 365  DLVVWSSMIAAYGFHGRGVEAVELFEQIEQEELGPNDVTFLSLLYACSHCGFKDKGLEFF 424

Query: 1446 DLMMDKYRLQPKVEHYTCVVDLLGRSGRLEEAEWLIRSMPVKADAIIWKTLLSACKTHKN 1625
            +LM +KY ++P++EHYTCVVDLLGR G L+EAE +IRS+P+KADAIIWKTLLSACK HKN
Sbjct: 425  NLMTEKYGVKPRLEHYTCVVDLLGRFGGLDEAEAMIRSIPMKADAIIWKTLLSACKIHKN 484

Query: 1626 ADMAKRIAEQVLRINPHDSASYVLLSNIQASARRWQDVSEVRKAMRERNVKKEPGTSWFE 1805
            ADMA+RIAE+VL+++P DSASYVLLSNI ASA RWQDVSEVRKAMR++ VKKEPG SW E
Sbjct: 485  ADMARRIAEEVLKLDPQDSASYVLLSNIHASAERWQDVSEVRKAMRDKGVKKEPGISWLE 544

Query: 1806 LKNQVHQFIMGDKSHPQSEEIDAYLKDLMEELKLKGYVPDTGSVLHDMDAEEKEYNLVHH 1985
            +KNQVHQF MGDKSHPQSEEID YLK+L  E+KL GYVPDTGSVLHDM  EEKEYNL HH
Sbjct: 545  IKNQVHQFSMGDKSHPQSEEIDIYLKELTAEMKLHGYVPDTGSVLHDMANEEKEYNLTHH 604

Query: 1986 SEKLAIAFALMNTVEGVPIRIMKNLRVCGDCHIAIKYISSMKKREIIVRDASRFHHFKDG 2165
            SEK+AIAFAL NT  G PIR+MKNLRVC DCH+AIK IS +K REIIVRDASRFHHFK+G
Sbjct: 605  SEKMAIAFALKNTPAGAPIRVMKNLRVCSDCHVAIKIISEIKNREIIVRDASRFHHFKNG 664

Query: 2166 HCSCGDYW 2189
             CSC DYW
Sbjct: 665  KCSCSDYW 672


>gb|EXB92378.1| hypothetical protein L484_021362 [Morus notabilis]
          Length = 673

 Score =  895 bits (2314), Expect = 0.0
 Identities = 443/674 (65%), Positives = 532/674 (78%), Gaps = 13/674 (1%)
 Frame = +3

Query: 207  CMGKYLLKPLAAFARFSHQHRCLCT-------------TTLTAEFTDLCSKGHLKEAFAN 347
            CMGK  L  +   + F+ Q  C+ T             +T   EFT LCSKGH+KEAF +
Sbjct: 3    CMGKSCLNHVRLCSLFNTQ--CIKTRHFISTSTSKTGASTSIEEFTALCSKGHVKEAFKS 60

Query: 348  FSSLIWSEPPLCSLLLKACIQIQSLFLTKQIHSLITTSGCSRDKFVNNHLLNTYCKLGRV 527
            F S IWS+  L   L++ACI  +SL + KQ+HSL  TSGC  +KF +NHLL+ Y KL   
Sbjct: 61   FRSEIWSDTSLFCHLVQACILRKSLPMGKQLHSLTITSGCL-NKFFSNHLLSMYSKLRES 119

Query: 528  STAVALFETLPKKNVMSYNILIGGFIQSGDLDSAMKLFDEMVERNLATWNAMITGLTQFE 707
             TA+ LF+ +P +N+MS NI+I  ++QSGDLDSA  +FDEM +RN+ATWNAM++GL QFE
Sbjct: 120  QTAITLFDHMPWRNIMSCNIMINCYVQSGDLDSARNVFDEMPQRNVATWNAMVSGLIQFE 179

Query: 708  FNKEGLSMFSSMHELGFLPDAFTLGSVLRGCAGLKDLNRGRQVHSYAVKSGFDLHLVVGS 887
            FN +GL +FS MHELGFLPD +TLGSVLRGCAGL+ L  G+QVH+Y +KSGF   LVVGS
Sbjct: 180  FNGDGLCLFSEMHELGFLPDEYTLGSVLRGCAGLRSLRAGKQVHAYVMKSGFKFDLVVGS 239

Query: 888  SLAHMYMKSGSLGEGEKVIRSMPVHSVVACNTFIAGMAQNGYSEVVLDQYNIMKIAGFRP 1067
            SLAHMYMKSGSL EGEKVI SMP+ +VVA NT IAG AQ+G+ E VLD YNIMK+AG RP
Sbjct: 240  SLAHMYMKSGSLEEGEKVIDSMPIRNVVAWNTLIAGKAQSGHPEEVLDNYNIMKLAGLRP 299

Query: 1068 DKITFVSVISSCSELATLGQGQQIHAEVIKFGAXXXXXXXXXXXXMYSRCGCLDDALKVF 1247
            DKITFVSVISSCS+LATLGQGQQ HAE IK GA            MYSRCGCL+D++KVF
Sbjct: 300  DKITFVSVISSCSDLATLGQGQQTHAEAIKAGACSVVDLTSTLVSMYSRCGCLEDSVKVF 359

Query: 1248 EEKEGADVVLWSAMIAAYGFHGRGKEAIELFNRMEKEGLEANEVTFLSLLYACSHCGLKD 1427
             E E  D VLWS+MIAAYGFHGRG+EAI+LF RME+EG+EA++V FLSLLYACSHCGL++
Sbjct: 360  VESESMDPVLWSSMIAAYGFHGRGEEAIKLFERMEEEGMEADDVAFLSLLYACSHCGLRE 419

Query: 1428 KGLELFDLMMDKYRLQPKVEHYTCVVDLLGRSGRLEEAEWLIRSMPVKADAIIWKTLLSA 1607
            KGLE FDLM+ +Y L+P+ EHY C+VDLL R G LEEAE +IRSMP+KADAIIWK LL+A
Sbjct: 420  KGLEFFDLMVGRYGLKPRREHYACIVDLLSRYGCLEEAEAMIRSMPIKADAIIWKILLAA 479

Query: 1608 CKTHKNADMAKRIAEQVLRINPHDSASYVLLSNIQASARRWQDVSEVRKAMRERNVKKEP 1787
            CK HKNAD+A R+AE+VL+++P DSASYVLLSN+ ASA+RW+DVS VRK MR++N+KKEP
Sbjct: 480  CKIHKNADVASRVAEEVLKVDPQDSASYVLLSNVHASAKRWEDVSAVRKMMRDKNLKKEP 539

Query: 1788 GTSWFELKNQVHQFIMGDKSHPQSEEIDAYLKDLMEELKLKGYVPDTGSVLHDMDAEEKE 1967
            G SW E+KNQVHQF  GD+SHP+S+EID YL +L  E+K +GY P+T +VLHDMD EEKE
Sbjct: 540  GVSWVEIKNQVHQFSRGDRSHPKSKEIDLYLNELTTEMKFRGYAPNTSAVLHDMDVEEKE 599

Query: 1968 YNLVHHSEKLAIAFALMNTVEGVPIRIMKNLRVCGDCHIAIKYISSMKKREIIVRDASRF 2147
             +L HHSEKLAIAFALMNT  GVP+RIMKNLRVC DCH+AIKYIS  K REIIVRD+SRF
Sbjct: 600  DSLAHHSEKLAIAFALMNTPGGVPLRIMKNLRVCEDCHLAIKYISETKNREIIVRDSSRF 659

Query: 2148 HHFKDGHCSCGDYW 2189
            HHF++G CSCGDYW
Sbjct: 660  HHFRNGGCSCGDYW 673


>ref|XP_003532658.1| PREDICTED: pentatricopeptide repeat-containing protein At2g41080-like
            [Glycine max]
          Length = 674

 Score =  875 bits (2260), Expect = 0.0
 Identities = 426/631 (67%), Positives = 519/631 (82%)
 Frame = +3

Query: 297  EFTDLCSKGHLKEAFANFSSLIWSEPPLCSLLLKACIQIQSLFLTKQIHSLITTSGCSRD 476
            +F  LCSKGH++EAF +F S IW+EP L S LL+ACI ++S+ L KQ+HSLI TSGCS D
Sbjct: 44   QFATLCSKGHIREAFESFLSEIWAEPRLFSNLLQACIPLKSVSLGKQLHSLIFTSGCSSD 103

Query: 477  KFVNNHLLNTYCKLGRVSTAVALFETLPKKNVMSYNILIGGFIQSGDLDSAMKLFDEMVE 656
            KF++NHLLN Y K G +  AVALF+ +P++N+MS NI+I  ++  G+L+SA  LFDEM +
Sbjct: 104  KFISNHLLNLYSKFGELQAAVALFDRMPRRNIMSCNIMIKAYLGMGNLESAKNLFDEMPD 163

Query: 657  RNLATWNAMITGLTQFEFNKEGLSMFSSMHELGFLPDAFTLGSVLRGCAGLKDLNRGRQV 836
            RN+ATWNAM+TGLT+FE N+E L +FS M+EL F+PD ++LGSVLRGCA L  L  G+QV
Sbjct: 164  RNVATWNAMVTGLTKFEMNEEALLLFSRMNELSFMPDEYSLGSVLRGCAHLGALLAGQQV 223

Query: 837  HSYAVKSGFDLHLVVGSSLAHMYMKSGSLGEGEKVIRSMPVHSVVACNTFIAGMAQNGYS 1016
            H+Y +K GF+ +LVVG SLAHMYMK+GS+ +GE+VI  MP  S+VA NT ++G AQ GY 
Sbjct: 224  HAYVMKCGFECNLVVGCSLAHMYMKAGSMHDGERVINWMPDCSLVAWNTLMSGKAQKGYF 283

Query: 1017 EVVLDQYNIMKIAGFRPDKITFVSVISSCSELATLGQGQQIHAEVIKFGAXXXXXXXXXX 1196
            E VLDQY +MK+AGFRPDKITFVSVISSCSELA L QG+QIHAE +K GA          
Sbjct: 284  EGVLDQYCMMKMAGFRPDKITFVSVISSCSELAILCQGKQIHAEAVKAGASSEVSVVSSL 343

Query: 1197 XXMYSRCGCLDDALKVFEEKEGADVVLWSAMIAAYGFHGRGKEAIELFNRMEKEGLEANE 1376
              MYSRCGCL D++K F E +  DVVLWS+MIAAYGFHG+G+EAI+LFN ME+E L  NE
Sbjct: 344  VSMYSRCGCLQDSIKTFLECKERDVVLWSSMIAAYGFHGQGEEAIKLFNEMEQENLPGNE 403

Query: 1377 VTFLSLLYACSHCGLKDKGLELFDLMMDKYRLQPKVEHYTCVVDLLGRSGRLEEAEWLIR 1556
            +TFLSLLYACSHCGLKDKGL LFD+M+ KY L+ +++HYTC+VDLLGRSG LEEAE +IR
Sbjct: 404  ITFLSLLYACSHCGLKDKGLGLFDMMVKKYGLKARLQHYTCLVDLLGRSGCLEEAEAMIR 463

Query: 1557 SMPVKADAIIWKTLLSACKTHKNADMAKRIAEQVLRINPHDSASYVLLSNIQASARRWQD 1736
            SMPVKADAIIWKTLLSACK HKNA++A+R+A++VLRI+P DSASYVLL+NI +SA RWQ+
Sbjct: 464  SMPVKADAIIWKTLLSACKIHKNAEIARRVADEVLRIDPQDSASYVLLANIYSSANRWQN 523

Query: 1737 VSEVRKAMRERNVKKEPGTSWFELKNQVHQFIMGDKSHPQSEEIDAYLKDLMEELKLKGY 1916
            VSEVR+AM+++ VKKEPG SW E+KNQVHQF MGD+ HP+  EI+ YL++L  E+K +GY
Sbjct: 524  VSEVRRAMKDKMVKKEPGISWVEVKNQVHQFHMGDECHPKHVEINQYLEELTSEIKRQGY 583

Query: 1917 VPDTGSVLHDMDAEEKEYNLVHHSEKLAIAFALMNTVEGVPIRIMKNLRVCGDCHIAIKY 2096
            VPDT SVLHDMD EEKE  L HHSEKLAIAFALMNT EGVPIR+MKNLRVC DCH+AIKY
Sbjct: 584  VPDTSSVLHDMDNEEKEQILRHHSEKLAIAFALMNTPEGVPIRVMKNLRVCSDCHVAIKY 643

Query: 2097 ISSMKKREIIVRDASRFHHFKDGHCSCGDYW 2189
            IS +KK EIIVRD+SRFHHFK+G CSCGDYW
Sbjct: 644  ISEIKKLEIIVRDSSRFHHFKNGTCSCGDYW 674


>ref|XP_002322810.2| hypothetical protein POPTR_0016s07590g [Populus trichocarpa]
            gi|550321057|gb|EEF04571.2| hypothetical protein
            POPTR_0016s07590g [Populus trichocarpa]
          Length = 670

 Score =  872 bits (2253), Expect = 0.0
 Identities = 426/656 (64%), Positives = 519/656 (79%), Gaps = 10/656 (1%)
 Frame = +3

Query: 252  FSHQHRCLCTTTLTA---------EFTDLCSKGHLKEAFANFSSLIWSEPPLCSLLLKAC 404
            F H HR   T+T  A         +F  LCS G +KEAF  +++ IW++  L S L+++ 
Sbjct: 15   FCHLHRFFSTSTENAASSISDIEGKFKSLCSAGRIKEAFKTYNAEIWTDQHLFSYLIQSF 74

Query: 405  IQIQSLFLTKQIHSLITTSGCS-RDKFVNNHLLNTYCKLGRVSTAVALFETLPKKNVMSY 581
            I  +SL + KQ+HSL  TSG   +DKFV NHLLN Y K+G +  A+A F  +P +N+MS+
Sbjct: 75   IPQKSLLIAKQLHSLAITSGYYFKDKFVRNHLLNMYFKMGEIQEAIAFFNAMPMRNIMSH 134

Query: 582  NILIGGFIQSGDLDSAMKLFDEMVERNLATWNAMITGLTQFEFNKEGLSMFSSMHELGFL 761
            NILI G +Q GDLDSA+K+FDEM+ERN+ATWNAM++GL QFEFN+ GL +F  MHELGFL
Sbjct: 135  NILINGHVQHGDLDSAIKVFDEMLERNVATWNAMVSGLIQFEFNENGLFLFREMHELGFL 194

Query: 762  PDAFTLGSVLRGCAGLKDLNRGRQVHSYAVKSGFDLHLVVGSSLAHMYMKSGSLGEGEKV 941
            PD FTLGSVLRGCAGL+    G+QVH+Y +K G++ +LVVGSSLAHMYMKSGSLGEGEKV
Sbjct: 195  PDEFTLGSVLRGCAGLRASYAGKQVHAYVLKYGYEFNLVVGSSLAHMYMKSGSLGEGEKV 254

Query: 942  IRSMPVHSVVACNTFIAGMAQNGYSEVVLDQYNIMKIAGFRPDKITFVSVISSCSELATL 1121
            I++MP+ +VVA NT IAG AQNG+ E VLD YN+MK++G RPDKIT VSVISS +ELATL
Sbjct: 255  IKAMPIRNVVAWNTLIAGNAQNGHFEGVLDLYNMMKMSGLRPDKITLVSVISSSAELATL 314

Query: 1122 GQGQQIHAEVIKFGAXXXXXXXXXXXXMYSRCGCLDDALKVFEEKEGADVVLWSAMIAAY 1301
             QGQQIHAE IK GA            MYS+CGCL+D++K   + E  D VLWS+MIAAY
Sbjct: 315  FQGQQIHAEAIKAGANSAVAVLSSLISMYSKCGCLEDSMKALLDCEHPDSVLWSSMIAAY 374

Query: 1302 GFHGRGKEAIELFNRMEKEGLEANEVTFLSLLYACSHCGLKDKGLELFDLMMDKYRLQPK 1481
            GFHGRG+EA+ LF +ME+EGL  N+VTFLSLLYACSH GLK+KG+  F LM++KY L+P+
Sbjct: 375  GFHGRGEEAVHLFEQMEQEGLGGNDVTFLSLLYACSHNGLKEKGMGFFKLMVEKYGLKPR 434

Query: 1482 VEHYTCVVDLLGRSGRLEEAEWLIRSMPVKADAIIWKTLLSACKTHKNADMAKRIAEQVL 1661
            +EHYTCVVDLLGRSG L+EAE +IRSMP++AD +IWKTLLSAC+ H+NADMA R AE++L
Sbjct: 435  LEHYTCVVDLLGRSGCLDEAEAMIRSMPLEADVVIWKTLLSACRIHRNADMATRTAEEIL 494

Query: 1662 RINPHDSASYVLLSNIQASARRWQDVSEVRKAMRERNVKKEPGTSWFELKNQVHQFIMGD 1841
            R+NP DSA+YVLLSNI ASA+RW+DVS+VR  MR+RNVKKEPG SW E+KN+V QF MGD
Sbjct: 495  RLNPQDSATYVLLSNIHASAKRWKDVSKVRTTMRDRNVKKEPGVSWLEVKNRVFQFSMGD 554

Query: 1842 KSHPQSEEIDAYLKDLMEELKLKGYVPDTGSVLHDMDAEEKEYNLVHHSEKLAIAFALMN 2021
            KSHP SEEID YLK+LMEE+KL+GYVPDT +V HD D+EEKE +LV+HSEKLAIAF LMN
Sbjct: 555  KSHPMSEEIDLYLKELMEEMKLRGYVPDTATVFHDTDSEEKENSLVNHSEKLAIAFGLMN 614

Query: 2022 TVEGVPIRIMKNLRVCGDCHIAIKYISSMKKREIIVRDASRFHHFKDGHCSCGDYW 2189
               G PIR+MKNLR+C DCH+AIK IS +  REIIVRD SRFHHFK G CSCGDYW
Sbjct: 615  IPPGSPIRVMKNLRICSDCHVAIKLISDINNREIIVRDTSRFHHFKHGKCSCGDYW 670


>ref|XP_004504670.1| PREDICTED: pentatricopeptide repeat-containing protein At2g41080-like
            [Cicer arietinum]
          Length = 683

 Score =  872 bits (2253), Expect = 0.0
 Identities = 425/627 (67%), Positives = 513/627 (81%)
 Frame = +3

Query: 309  LCSKGHLKEAFANFSSLIWSEPPLCSLLLKACIQIQSLFLTKQIHSLITTSGCSRDKFVN 488
            LCSKGH+KEAF +F   IW EP L S LL+ACI   S+F  KQ+HSLI TSGCS DKF++
Sbjct: 57   LCSKGHIKEAFESFVYEIWEEPRLFSNLLQACIPTNSVFAGKQLHSLILTSGCSSDKFIS 116

Query: 489  NHLLNTYCKLGRVSTAVALFETLPKKNVMSYNILIGGFIQSGDLDSAMKLFDEMVERNLA 668
            NHLLN Y K G +   V LF+ +P++N+MS NI+I  +++ G+ ++A KLFDEM ERN+A
Sbjct: 117  NHLLNLYSKFGELHAVVKLFDGMPRRNIMSCNIMIKAYLEIGNYENAKKLFDEMPERNVA 176

Query: 669  TWNAMITGLTQFEFNKEGLSMFSSMHELGFLPDAFTLGSVLRGCAGLKDLNRGRQVHSYA 848
            TWNAM+TGLT+F  N+E L  FS M+ LGF+PD ++ GSVLRGCA L+ L  G+QVH+Y 
Sbjct: 177  TWNAMVTGLTKFGANEESLFFFSQMNALGFVPDEYSFGSVLRGCAHLRALFAGQQVHAYV 236

Query: 849  VKSGFDLHLVVGSSLAHMYMKSGSLGEGEKVIRSMPVHSVVACNTFIAGMAQNGYSEVVL 1028
            VK GF+ + VVG SLAHMYMK+GSL +GE+VI+ MP  +VVA NT +AG AQNGYSE VL
Sbjct: 237  VKCGFEFNSVVGCSLAHMYMKAGSLLDGERVIKWMPNCNVVAWNTLMAGKAQNGYSEGVL 296

Query: 1029 DQYNIMKIAGFRPDKITFVSVISSCSELATLGQGQQIHAEVIKFGAXXXXXXXXXXXXMY 1208
            D Y++MK+AGFRPD+ITFVSVISSCSELATLGQG+QIHAEVIK GA            MY
Sbjct: 297  DHYSMMKMAGFRPDRITFVSVISSCSELATLGQGKQIHAEVIKAGASSVVSVISSLVSMY 356

Query: 1209 SRCGCLDDALKVFEEKEGADVVLWSAMIAAYGFHGRGKEAIELFNRMEKEGLEANEVTFL 1388
            SRCG L+D++K F E E  DVVLWS+MIAAYG HG+G++AI+LFN ME+E L  NEVTFL
Sbjct: 357  SRCGSLEDSIKAFLECEERDVVLWSSMIAAYGCHGQGEKAIKLFNEMEQENLAGNEVTFL 416

Query: 1389 SLLYACSHCGLKDKGLELFDLMMDKYRLQPKVEHYTCVVDLLGRSGRLEEAEWLIRSMPV 1568
            SLLYACSHCGLKDKGL+ FD+M+ K  L+ ++EHYTCVVDLLGRSG LEEAE +IRSMPV
Sbjct: 417  SLLYACSHCGLKDKGLDFFDMMVKKCGLKARLEHYTCVVDLLGRSGCLEEAEAMIRSMPV 476

Query: 1569 KADAIIWKTLLSACKTHKNADMAKRIAEQVLRINPHDSASYVLLSNIQASARRWQDVSEV 1748
            +ADAIIWKTLLSACK H+N +MA+R+AE+VLRI+P DSASYVLL+ I ASA+RWQ+VSEV
Sbjct: 477  RADAIIWKTLLSACKIHRNEEMARRVAEEVLRIDPQDSASYVLLAGIHASAKRWQNVSEV 536

Query: 1749 RKAMRERNVKKEPGTSWFELKNQVHQFIMGDKSHPQSEEIDAYLKDLMEELKLKGYVPDT 1928
            R+AM+++ VKKEPG SW E+KN+VHQF MGD+ HP+S EI+ YL++L  E+K++GYVPD 
Sbjct: 537  RRAMKDKMVKKEPGVSWVEVKNRVHQFRMGDECHPKSVEINLYLEELTSEMKMRGYVPDI 596

Query: 1929 GSVLHDMDAEEKEYNLVHHSEKLAIAFALMNTVEGVPIRIMKNLRVCGDCHIAIKYISSM 2108
             SVLHDMD EEKEYNL HHSEKLAIAFALM   +G PIR+MKNLRVCGDCHIAIKYIS M
Sbjct: 597  SSVLHDMDIEEKEYNLTHHSEKLAIAFALMTIPKGEPIRVMKNLRVCGDCHIAIKYISEM 656

Query: 2109 KKREIIVRDASRFHHFKDGHCSCGDYW 2189
            K REIIVRD+SRFHHF+DG CSCGDYW
Sbjct: 657  KNREIIVRDSSRFHHFRDGVCSCGDYW 683


>ref|XP_006446832.1| hypothetical protein CICLE_v10018004mg [Citrus clementina]
            gi|557549443|gb|ESR60072.1| hypothetical protein
            CICLE_v10018004mg [Citrus clementina]
          Length = 632

 Score =  855 bits (2210), Expect = 0.0
 Identities = 418/634 (65%), Positives = 501/634 (79%), Gaps = 1/634 (0%)
 Frame = +3

Query: 291  TAEFTDLCSKGHLKEAFANFSSLIWSEPPLCSLLLKACIQIQSLFLTKQIHSLITTSGCS 470
            T EF +LCSKGH+KEAF  F S IWS+P L S L++ C   +SL  +KQ+HSLI TSGCS
Sbjct: 22   TEEFINLCSKGHIKEAFNRFKSEIWSDPTLFSHLIQWCTLKKSLSCSKQLHSLIVTSGCS 81

Query: 471  RDKFVNNHLLNTYCKLGRVSTAVALFETLPKKNVMSYNILIGGFIQSGDLDSAMKLFDEM 650
             + F+ NHLLN Y K+G++ TAV LF  +P++N+MS NI+I  ++QSGDL+ A K+FD M
Sbjct: 82   SNSFICNHLLNMYSKIGQLQTAVTLFGLMPRRNIMSCNIMINAYVQSGDLERARKVFDGM 141

Query: 651  VERNLATWNAMITGLTQFEFNKEGLSMFSSMHELGFLPDAFTLGSVLRGCAGLKDLNRGR 830
             +RN+ATWNAM+ GL QFEFN+EGLS+ S MH++GFLPD FTLGSVLRGCAGL+ L+ GR
Sbjct: 142  TKRNIATWNAMVAGLVQFEFNEEGLSLLSEMHQVGFLPDEFTLGSVLRGCAGLRGLDAGR 201

Query: 831  QVHSYAVKSGFDLHLVVGSSLAHMYMKSGSLGEGEKVIRSMPV-HSVVACNTFIAGMAQN 1007
            Q+H Y  +                        +  +VIR   +  +V+  NT IAG AQN
Sbjct: 202  QIHCYVNER-----------------------KERRVIRLNALSRNVIGWNTLIAGKAQN 238

Query: 1008 GYSEVVLDQYNIMKIAGFRPDKITFVSVISSCSELATLGQGQQIHAEVIKFGAXXXXXXX 1187
            G +E VLDQYN+M++ GFRPDKITFVSVISSCSELATLGQGQQIHAEV+K GA       
Sbjct: 239  GLAEDVLDQYNLMRMVGFRPDKITFVSVISSCSELATLGQGQQIHAEVVKAGASLDVGVI 298

Query: 1188 XXXXXMYSRCGCLDDALKVFEEKEGADVVLWSAMIAAYGFHGRGKEAIELFNRMEKEGLE 1367
                 MYSRCGCLDD++K F E E +DVVLWS+MIAAYGFHG+G+EAI LF +ME++  E
Sbjct: 299  SSLISMYSRCGCLDDSMKAFLECEYSDVVLWSSMIAAYGFHGKGEEAINLFEQMEQKEFE 358

Query: 1368 ANEVTFLSLLYACSHCGLKDKGLELFDLMMDKYRLQPKVEHYTCVVDLLGRSGRLEEAEW 1547
            AN+VTF+SLLYACSHCGLK+KG+E FDLM+ KY L+P++EHYTCVVDLLGR G L+EAE 
Sbjct: 359  ANDVTFVSLLYACSHCGLKEKGMEFFDLMVKKYGLKPRLEHYTCVVDLLGRCGCLDEAEA 418

Query: 1548 LIRSMPVKADAIIWKTLLSACKTHKNADMAKRIAEQVLRINPHDSASYVLLSNIQASARR 1727
            LIR+MPVKAD IIWKTLLSACK HK+ DMA RIAE+VL++NP D+A YVLLSNI ASA+R
Sbjct: 419  LIRNMPVKADTIIWKTLLSACKIHKSTDMAGRIAEEVLKLNPQDAAPYVLLSNIHASAKR 478

Query: 1728 WQDVSEVRKAMRERNVKKEPGTSWFELKNQVHQFIMGDKSHPQSEEIDAYLKDLMEELKL 1907
            WQ VSE+R+AMRERNVKKEPG SW E+KNQVHQF MGDKSHP S EID YL++L  E+KL
Sbjct: 479  WQGVSELREAMRERNVKKEPGVSWLEIKNQVHQFTMGDKSHPSSMEIDLYLEELASEMKL 538

Query: 1908 KGYVPDTGSVLHDMDAEEKEYNLVHHSEKLAIAFALMNTVEGVPIRIMKNLRVCGDCHIA 2087
            +GYVPDTG+ +HDMD EEKEYNL HHSEKLAIA ALMNT  GVPIR+MKNLRVC DCH+A
Sbjct: 539  RGYVPDTGADMHDMDNEEKEYNLKHHSEKLAIALALMNTPAGVPIRVMKNLRVCSDCHVA 598

Query: 2088 IKYISSMKKREIIVRDASRFHHFKDGHCSCGDYW 2189
            IKYIS +K REIIVRD+SRFHHF++G CSCGDYW
Sbjct: 599  IKYISEIKNREIIVRDSSRFHHFRNGKCSCGDYW 632


>gb|ESW30891.1| hypothetical protein PHAVU_002G191000g [Phaseolus vulgaris]
          Length = 673

 Score =  854 bits (2207), Expect = 0.0
 Identities = 421/646 (65%), Positives = 513/646 (79%), Gaps = 2/646 (0%)
 Frame = +3

Query: 258  HQHRCLCTTTLTA--EFTDLCSKGHLKEAFANFSSLIWSEPPLCSLLLKACIQIQSLFLT 431
            H+  C  TT   A  +F  LCSKGH++EAF +F S IW EP L S LL+AC++++S+ L 
Sbjct: 28   HKPTCKMTTFRIAKEQFATLCSKGHVREAFESFVSEIWEEPHLFSNLLQACVRLKSVSLG 87

Query: 432  KQIHSLITTSGCSRDKFVNNHLLNTYCKLGRVSTAVALFETLPKKNVMSYNILIGGFIQS 611
            KQIHSLI TSGCS DKF++NHLLN Y K G +  +VALF+ +P+KN+MS NI+I  +++ 
Sbjct: 88   KQIHSLILTSGCSSDKFISNHLLNLYSKFGELRASVALFDRMPRKNIMSCNIMIKAYLEM 147

Query: 612  GDLDSAMKLFDEMVERNLATWNAMITGLTQFEFNKEGLSMFSSMHELGFLPDAFTLGSVL 791
            G+++SA  LFD M ERN+ATWNAM+TGL +FE N+E L +FS M+ELG +PD ++LGSVL
Sbjct: 148  GNIESARNLFDAMPERNIATWNAMVTGLAKFEMNEESLIIFSRMNELGLVPDEYSLGSVL 207

Query: 792  RGCAGLKDLNRGRQVHSYAVKSGFDLHLVVGSSLAHMYMKSGSLGEGEKVIRSMPVHSVV 971
            RGCA L  L  G+QVH+Y +K GF+ +LVVG SLAHMYMK+ S+ +GE+VI  MP +++V
Sbjct: 208  RGCAHLGALFAGQQVHAYVMKCGFEFNLVVGCSLAHMYMKARSMDDGERVINCMPAYNLV 267

Query: 972  ACNTFIAGMAQNGYSEVVLDQYNIMKIAGFRPDKITFVSVISSCSELATLGQGQQIHAEV 1151
            A NT +AG AQ G  E VLDQY  MK AGFRPDKITFVSVISSCSELA LGQG+QIHAE 
Sbjct: 268  AWNTLMAGKAQKGSFEGVLDQYCKMKKAGFRPDKITFVSVISSCSELAILGQGKQIHAEA 327

Query: 1152 IKFGAXXXXXXXXXXXXMYSRCGCLDDALKVFEEKEGADVVLWSAMIAAYGFHGRGKEAI 1331
            IK GA            MYSRCGCL ++ K F E +  DVVLWS+MIAAYGFHG+G+EAI
Sbjct: 328  IKAGASYEVSVVSSLVSMYSRCGCLQESFKSFLECKERDVVLWSSMIAAYGFHGQGEEAI 387

Query: 1332 ELFNRMEKEGLEANEVTFLSLLYACSHCGLKDKGLELFDLMMDKYRLQPKVEHYTCVVDL 1511
            +LFN+ME+E    NEVTFLSLLYACSHCGLKDKGL+ FD+M+ KY L  +++HYTCVVDL
Sbjct: 388  KLFNQMEQENQPVNEVTFLSLLYACSHCGLKDKGLDFFDMMVKKYGLGARLKHYTCVVDL 447

Query: 1512 LGRSGRLEEAEWLIRSMPVKADAIIWKTLLSACKTHKNADMAKRIAEQVLRINPHDSASY 1691
            LGRSG LEEAE +IRSMPVKADAIIWKTLLSACK HKNA++ +R+A +VL I+P DSASY
Sbjct: 448  LGRSGCLEEAEAMIRSMPVKADAIIWKTLLSACKLHKNAEIGRRVAAEVLTIDPQDSASY 507

Query: 1692 VLLSNIQASARRWQDVSEVRKAMRERNVKKEPGTSWFELKNQVHQFIMGDKSHPQSEEID 1871
            VLL+NI +SA+RW +VSEVR AM+++ VKKEPG SW E+KNQVHQF MG + HP+  EI+
Sbjct: 508  VLLANIYSSAKRWHNVSEVRTAMKDKMVKKEPGVSWVEVKNQVHQFHMGGECHPKLVEIN 567

Query: 1872 AYLKDLMEELKLKGYVPDTGSVLHDMDAEEKEYNLVHHSEKLAIAFALMNTVEGVPIRIM 2051
             YL+ L  E+K +GYVPDT SVLHDMD EEKE NL HHSEKLAI+FALM+T  GVPIR+M
Sbjct: 568  QYLEQLTSEMKKRGYVPDTNSVLHDMDNEEKEQNLRHHSEKLAISFALMSTPVGVPIRVM 627

Query: 2052 KNLRVCGDCHIAIKYISSMKKREIIVRDASRFHHFKDGHCSCGDYW 2189
            KNLRVC DCH+AIKYIS +K  EIIVRD+SRFHHFK+G CSCGDYW
Sbjct: 628  KNLRVCSDCHVAIKYISEIKNVEIIVRDSSRFHHFKNGTCSCGDYW 673


>sp|Q8S9M4.2|PP198_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g41080
          Length = 650

 Score =  793 bits (2049), Expect = 0.0
 Identities = 390/628 (62%), Positives = 486/628 (77%), Gaps = 1/628 (0%)
 Frame = +3

Query: 309  LCSKGHLKEAFANFSSLIWSEPPLCSLLLKACIQIQSLFLTKQIHSLITTSGCSRDKFVN 488
            LCSKG+L+EAF  F   I++   L +  +++C   QSL   KQ+H L+  SG S DKF+ 
Sbjct: 23   LCSKGNLREAFQRFRLNIFTNTSLFTPFIQSCTTRQSLPSGKQLHCLLVVSGFSSDKFIC 82

Query: 489  NHLLNTYCKLGRVSTAVALFETLPKKNVMSYNILIGGFIQSGDLDSAMKLFDEMVERNLA 668
            NHL++ Y KLG   +AVA++  + KKN MS NILI G++++GDL +A K+FDEM +R L 
Sbjct: 83   NHLMSMYSKLGDFPSAVAVYGRMRKKNYMSSNILINGYVRAGDLVNARKVFDEMPDRKLT 142

Query: 669  TWNAMITGLTQFEFNKEGLSMFSSMHELGFLPDAFTLGSVLRGCAGLKDLNRGRQVHSYA 848
            TWNAMI GL QFEFN+EGLS+F  MH LGF PD +TLGSV  G AGL+ ++ G+Q+H Y 
Sbjct: 143  TWNAMIAGLIQFEFNEEGLSLFREMHGLGFSPDEYTLGSVFSGSAGLRSVSIGQQIHGYT 202

Query: 849  VKSGFDLHLVVGSSLAHMYMKSGSLGEGEKVIRSMPVHSVVACNTFIAGMAQNGYSEVVL 1028
            +K G +L LVV SSLAHMYM++G L +GE VIRSMPV ++VA NT I G AQNG  E VL
Sbjct: 203  IKYGLELDLVVNSSLAHMYMRNGKLQDGEIVIRSMPVRNLVAWNTLIMGNAQNGCPETVL 262

Query: 1029 DQYNIMKIAGFRPDKITFVSVISSCSELATLGQGQQIHAEVIKFGAXXXXXXXXXXXXMY 1208
              Y +MKI+G RP+KITFV+V+SSCS+LA  GQGQQIHAE IK GA            MY
Sbjct: 263  YLYKMMKISGCRPNKITFVTVLSSCSDLAIRGQGQQIHAEAIKIGASSVVAVVSSLISMY 322

Query: 1209 SRCGCLDDALKVFEEKEGADVVLWSAMIAAYGFHGRGKEAIELFNRM-EKEGLEANEVTF 1385
            S+CGCL DA K F E+E  D V+WS+MI+AYGFHG+G EAIELFN M E+  +E NEV F
Sbjct: 323  SKCGCLGDAAKAFSEREDEDEVMWSSMISAYGFHGQGDEAIELFNTMAEQTNMEINEVAF 382

Query: 1386 LSLLYACSHCGLKDKGLELFDLMMDKYRLQPKVEHYTCVVDLLGRSGRLEEAEWLIRSMP 1565
            L+LLYACSH GLKDKGLELFD+M++KY  +P ++HYTCVVDLLGR+G L++AE +IRSMP
Sbjct: 383  LNLLYACSHSGLKDKGLELFDMMVEKYGFKPGLKHYTCVVDLLGRAGCLDQAEAIIRSMP 442

Query: 1566 VKADAIIWKTLLSACKTHKNADMAKRIAEQVLRINPHDSASYVLLSNIQASARRWQDVSE 1745
            +K D +IWKTLLSAC  HKNA+MA+R+ +++L+I+P+DSA YVLL+N+ ASA+RW+DVSE
Sbjct: 443  IKTDIVIWKTLLSACNIHKNAEMAQRVFKEILQIDPNDSACYVLLANVHASAKRWRDVSE 502

Query: 1746 VRKAMRERNVKKEPGTSWFELKNQVHQFIMGDKSHPQSEEIDAYLKDLMEELKLKGYVPD 1925
            VRK+MR++NVKKE G SWFE K +VHQF MGD+S  +S+EI +YLK+L  E+KLKGY PD
Sbjct: 503  VRKSMRDKNVKKEAGISWFEHKGEVHQFKMGDRSQSKSKEIYSYLKELTLEMKLKGYKPD 562

Query: 1926 TGSVLHDMDAEEKEYNLVHHSEKLAIAFALMNTVEGVPIRIMKNLRVCGDCHIAIKYISS 2105
            T SVLHDMD EEKE +LV HSEKLA+AFALM   EG PIRI+KNLRVC DCH+A KYIS 
Sbjct: 563  TASVLHDMDEEEKESDLVQHSEKLAVAFALMILPEGAPIRIIKNLRVCSDCHVAFKYISV 622

Query: 2106 MKKREIIVRDASRFHHFKDGHCSCGDYW 2189
            +K REI +RD SRFHHF +G CSCGDYW
Sbjct: 623  IKNREITLRDGSRFHHFINGKCSCGDYW 650


>ref|XP_006293459.1| hypothetical protein CARUB_v10022804mg [Capsella rubella]
            gi|482562167|gb|EOA26357.1| hypothetical protein
            CARUB_v10022804mg [Capsella rubella]
          Length = 650

 Score =  787 bits (2033), Expect = 0.0
 Identities = 385/628 (61%), Positives = 484/628 (77%), Gaps = 1/628 (0%)
 Frame = +3

Query: 309  LCSKGHLKEAFANFSSLIWSEPPLCSLLLKACIQIQSLFLTKQIHSLITTSGCSRDKFVN 488
            LCSKG+L+EAF  F   I++   L +  +++C   QSL   KQ+H L+  SG S DKF+ 
Sbjct: 23   LCSKGNLREAFQRFRLNIFTNTSLFTPFIQSCTTSQSLPSGKQLHGLLVVSGFSSDKFIC 82

Query: 489  NHLLNTYCKLGRVSTAVALFETLPKKNVMSYNILIGGFIQSGDLDSAMKLFDEMVERNLA 668
            NHL++ Y K+G   +AVAL+  +PKKN MS NILI G++++GDL SA K+FDEM +R L 
Sbjct: 83   NHLMSMYSKIGDFPSAVALYGRMPKKNYMSSNILIYGYVRAGDLPSARKVFDEMPDRKLT 142

Query: 669  TWNAMITGLTQFEFNKEGLSMFSSMHELGFLPDAFTLGSVLRGCAGLKDLNRGRQVHSYA 848
            TWNAMI GL   E+N+EGLS+F  MH LGF PD +TLGSV  G AGL+ ++ G+Q+H Y 
Sbjct: 143  TWNAMIAGLIHSEYNEEGLSLFREMHGLGFCPDEYTLGSVFSGSAGLRSVSIGQQIHGYT 202

Query: 849  VKSGFDLHLVVGSSLAHMYMKSGSLGEGEKVIRSMPVHSVVACNTFIAGMAQNGYSEVVL 1028
            +K G +L LVV SSLAHMYM++G L +GE VIRSMPV ++VA NT I G AQNG  E VL
Sbjct: 203  IKYGLELDLVVNSSLAHMYMRNGKLQDGEIVIRSMPVRNLVAWNTLIMGNAQNGCPETVL 262

Query: 1029 DQYNIMKIAGFRPDKITFVSVISSCSELATLGQGQQIHAEVIKFGAXXXXXXXXXXXXMY 1208
              Y IMKI+G RP+KITFV+V+SSCS+LA  GQGQQIHAE IK GA            MY
Sbjct: 263  YLYKIMKISGCRPNKITFVTVLSSCSDLAIRGQGQQIHAEAIKIGASSVVAVVSSLISMY 322

Query: 1209 SRCGCLDDALKVFEEKEGADVVLWSAMIAAYGFHGRGKEAIELFNRM-EKEGLEANEVTF 1385
            S+CGCL+DA K F E+   D V+WS+MI+AYGFHG G EAI+LFN M E+  +E NEV F
Sbjct: 323  SKCGCLEDAAKAFSERIDEDEVMWSSMISAYGFHGHGDEAIKLFNTMVEQTEMEINEVAF 382

Query: 1386 LSLLYACSHCGLKDKGLELFDLMMDKYRLQPKVEHYTCVVDLLGRSGRLEEAEWLIRSMP 1565
            L+LLYACSH GL+DKGLELFD+M++KY  +P ++HYTCVVDLLGR+G L++AE  IRSMP
Sbjct: 383  LNLLYACSHSGLRDKGLELFDMMVEKYGFKPGLKHYTCVVDLLGRAGSLDQAEAKIRSMP 442

Query: 1566 VKADAIIWKTLLSACKTHKNADMAKRIAEQVLRINPHDSASYVLLSNIQASARRWQDVSE 1745
            +K D IIWKTLLSAC  HKN +MA+R+ +++L+I+P+DSA YVLL+N+ ASA+RW+DVSE
Sbjct: 443  IKPDTIIWKTLLSACNIHKNTEMAQRVFQEILQIDPNDSACYVLLANVHASAKRWRDVSE 502

Query: 1746 VRKAMRERNVKKEPGTSWFELKNQVHQFIMGDKSHPQSEEIDAYLKDLMEELKLKGYVPD 1925
            VRK+MR++NVKKE G SWFE K +VH+F MGD+S P+S+EI +YLK+L  E+KLKGY PD
Sbjct: 503  VRKSMRDKNVKKEAGISWFEHKGEVHRFKMGDRSQPKSKEIYSYLKELTLEMKLKGYKPD 562

Query: 1926 TGSVLHDMDAEEKEYNLVHHSEKLAIAFALMNTVEGVPIRIMKNLRVCGDCHIAIKYISS 2105
            T SVLHDMD EEKE +LV HSEKLA+A+ALM   EGVPIRI+KNLRVC DCH+A +YIS 
Sbjct: 563  TASVLHDMDEEEKESDLVQHSEKLAVAYALMILPEGVPIRIIKNLRVCSDCHVAFRYISV 622

Query: 2106 MKKREIIVRDASRFHHFKDGHCSCGDYW 2189
            +K REI +RD SRFHHF++G CSC DYW
Sbjct: 623  IKNREITLRDGSRFHHFRNGKCSCADYW 650


>ref|XP_006411375.1| hypothetical protein EUTSA_v10017967mg [Eutrema salsugineum]
            gi|557112544|gb|ESQ52828.1| hypothetical protein
            EUTSA_v10017967mg [Eutrema salsugineum]
          Length = 650

 Score =  786 bits (2030), Expect = 0.0
 Identities = 382/628 (60%), Positives = 483/628 (76%), Gaps = 1/628 (0%)
 Frame = +3

Query: 309  LCSKGHLKEAFANFSSLIWSEPPLCSLLLKACIQIQSLFLTKQIHSLITTSGCSRDKFVN 488
            LCSKG+L+EAF  F   I+++  L +  +K+C   +SL   KQ+H L+  SG S DKF+ 
Sbjct: 23   LCSKGNLREAFQRFRFNIFTDTSLFTHFIKSCATTKSLPSGKQLHCLLVVSGFSSDKFIC 82

Query: 489  NHLLNTYCKLGRVSTAVALFETLPKKNVMSYNILIGGFIQSGDLDSAMKLFDEMVERNLA 668
            NHL++ Y KL    +AVAL+  +PKKN MS NILI G++ +GDL SA+K+F EM ++ L 
Sbjct: 83   NHLMSMYSKLKDFPSAVALYRLMPKKNFMSSNILINGYVCAGDLTSALKVFGEMTDKKLT 142

Query: 669  TWNAMITGLTQFEFNKEGLSMFSSMHELGFLPDAFTLGSVLRGCAGLKDLNRGRQVHSYA 848
            TWNAMI+GL QFE N+EGLS+F  MH LGF PD +TLGSV  GCAGL+ L+ G+Q+H Y 
Sbjct: 143  TWNAMISGLIQFEHNEEGLSLFRDMHALGFSPDEYTLGSVFSGCAGLRSLSIGQQIHGYT 202

Query: 849  VKSGFDLHLVVGSSLAHMYMKSGSLGEGEKVIRSMPVHSVVACNTFIAGMAQNGYSEVVL 1028
            +K G +L  VV +S+AHMYM+SG L +GE VIR MPV ++VA N  IAG AQNG  E+VL
Sbjct: 203  IKYGLELDSVVNNSVAHMYMRSGILQDGENVIRLMPVRNLVAWNILIAGNAQNGCPEIVL 262

Query: 1029 DQYNIMKIAGFRPDKITFVSVISSCSELATLGQGQQIHAEVIKFGAXXXXXXXXXXXXMY 1208
             QY  MKI GFRP++ITFV+V+SSCS+LA  GQGQQIHAE IK GA            MY
Sbjct: 263  FQYKKMKIEGFRPNQITFVTVLSSCSDLAIRGQGQQIHAEAIKIGASSVVAVVSSLISMY 322

Query: 1209 SRCGCLDDALKVFEEKEGADVVLWSAMIAAYGFHGRGKEAIELFNRM-EKEGLEANEVTF 1385
            S+CGCL+DA K F E+E  D V+WS+MI+AYGFHG+G EA++LF+ M EK  +E NEV F
Sbjct: 323  SKCGCLEDAAKAFSEREDEDEVMWSSMISAYGFHGQGGEAVKLFDTMVEKTDMEINEVAF 382

Query: 1386 LSLLYACSHCGLKDKGLELFDLMMDKYRLQPKVEHYTCVVDLLGRSGRLEEAEWLIRSMP 1565
            L+LLYACSH GLKDKGLELFD+M++KY  +P ++HYTCVVDLLGR+G L++AE  IRSMP
Sbjct: 383  LNLLYACSHSGLKDKGLELFDMMVEKYGFKPGLKHYTCVVDLLGRAGSLDQAEAKIRSMP 442

Query: 1566 VKADAIIWKTLLSACKTHKNADMAKRIAEQVLRINPHDSASYVLLSNIQASARRWQDVSE 1745
            +K D ++WKTLLSAC  HKNA++A+R  +++L+I+P+DS  YVLL+N+ ASA+RW DVSE
Sbjct: 443  IKPDTVLWKTLLSACNIHKNAEVAQRAFKEILQIDPNDSTCYVLLANVHASAKRWNDVSE 502

Query: 1746 VRKAMRERNVKKEPGTSWFELKNQVHQFIMGDKSHPQSEEIDAYLKDLMEELKLKGYVPD 1925
            VR++MR++NVKKEPG SWFE K ++HQF MGD+S  +S+EI +YLK+L  E+KLKGY PD
Sbjct: 503  VRRSMRDKNVKKEPGISWFEHKGELHQFKMGDRSQSKSKEIYSYLKELTLEMKLKGYKPD 562

Query: 1926 TGSVLHDMDAEEKEYNLVHHSEKLAIAFALMNTVEGVPIRIMKNLRVCGDCHIAIKYISS 2105
            T SVLHDMD EEKE +L  HSEKLA+AFALM   EGVPIRI+KNLRVC DCH+A KYIS 
Sbjct: 563  TASVLHDMDEEEKESDLAQHSEKLAVAFALMILPEGVPIRIIKNLRVCSDCHVAFKYISL 622

Query: 2106 MKKREIIVRDASRFHHFKDGHCSCGDYW 2189
            +K REI +RD SRFHHF +G CSC DYW
Sbjct: 623  IKNREITLRDGSRFHHFINGKCSCADYW 650


>ref|XP_004970613.1| PREDICTED: pentatricopeptide repeat-containing protein At2g41080-like
            [Setaria italica]
          Length = 647

 Score =  755 bits (1949), Expect = 0.0
 Identities = 371/632 (58%), Positives = 463/632 (73%), Gaps = 1/632 (0%)
 Frame = +3

Query: 297  EFTDLCSKGHLKEAFAN-FSSLIWSEPPLCSLLLKACIQIQSLFLTKQIHSLITTSGCSR 473
            E   LCS+G LK+A  + F  ++WSEP L S L +AC   ++L L +Q+H+   TSG + 
Sbjct: 19   EIIRLCSRGRLKDALHHRFREVLWSEPDLFSHLFRAC---RALPLLRQLHAFAATSGAAT 75

Query: 474  DKFVNNHLLNTYCKLGRVSTAVALFETLPKKNVMSYNILIGGFIQSGDLDSAMKLFDEMV 653
            D+F  NHLL  Y  LG   TA  LFE +PK+NVMS+NILIGG+I++GDL++A KLFDEM 
Sbjct: 76   DRFTANHLLLAYADLGDFPTARCLFERIPKRNVMSWNILIGGYIKNGDLETARKLFDEMP 135

Query: 654  ERNLATWNAMITGLTQFEFNKEGLSMFSSMHELGFLPDAFTLGSVLRGCAGLKDLNRGRQ 833
             RN+ATWNAM+ GLT    N+E L  F +M   G  PD F LGS  R CAGL+D+  GRQ
Sbjct: 136  SRNVATWNAMVAGLTNSGLNEESLGFFLAMRREGMQPDEFGLGSFFRSCAGLRDVVSGRQ 195

Query: 834  VHSYAVKSGFDLHLVVGSSLAHMYMKSGSLGEGEKVIRSMPVHSVVACNTFIAGMAQNGY 1013
            VH+Y V+SG D  + VGSSLAHMY++ G L EGE V+R +P  ++VACNT IAG  QNG 
Sbjct: 196  VHAYVVRSGLDRDMCVGSSLAHMYLRCGFLEEGEAVLRVLPSLNIVACNTIIAGRTQNGD 255

Query: 1014 SEVVLDQYNIMKIAGFRPDKITFVSVISSCSELATLGQGQQIHAEVIKFGAXXXXXXXXX 1193
            SE  L+ + +M+  G     +T+V+ ISSCS+LA L QGQQ+HA+ +K G          
Sbjct: 256  SEGALEYFCMMRGVGVEASAVTYVTAISSCSDLAALAQGQQVHAQAMKAGVDKVVPVMTS 315

Query: 1194 XXXMYSRCGCLDDALKVFEEKEGADVVLWSAMIAAYGFHGRGKEAIELFNRMEKEGLEAN 1373
               MYSRCGCL D+  VF    G D+VL SAMI+AYGFHG G++A++LF RM   G E N
Sbjct: 316  LVHMYSRCGCLGDSEGVFSGYSGTDLVLCSAMISAYGFHGHGQKAVDLFKRMMAGGAEPN 375

Query: 1374 EVTFLSLLYACSHCGLKDKGLELFDLMMDKYRLQPKVEHYTCVVDLLGRSGRLEEAEWLI 1553
            E+TFL+LLYACSH GLKD+G++ F+LM   YRLQP V+HYTC+VDLLGRSGRL EAE LI
Sbjct: 376  EITFLTLLYACSHSGLKDEGMDCFELMTKTYRLQPSVKHYTCIVDLLGRSGRLNEAEALI 435

Query: 1554 RSMPVKADAIIWKTLLSACKTHKNADMAKRIAEQVLRINPHDSASYVLLSNIQASARRWQ 1733
             SMPV+ D IIWKTLLSACK  KN DMA+RIAE+V+ ++PHDSASYVLLSNI+A++ RW+
Sbjct: 436  LSMPVRPDGIIWKTLLSACKIQKNFDMAERIAERVIELDPHDSASYVLLSNIRATSSRWE 495

Query: 1734 DVSEVRKAMRERNVKKEPGTSWFELKNQVHQFIMGDKSHPQSEEIDAYLKDLMEELKLKG 1913
            DVS VR+ MR++NV+KEPG SW E K Q+HQF  GDKSH +  EID  L+++M +++  G
Sbjct: 496  DVSTVRETMRKQNVRKEPGVSWVEFKGQIHQFCTGDKSHSRQLEIDECLEEMMAKIRQCG 555

Query: 1914 YVPDTGSVLHDMDAEEKEYNLVHHSEKLAIAFALMNTVEGVPIRIMKNLRVCGDCHIAIK 2093
            Y PD   VLHDM+ EEKE +L HHSEKLAIAFA ++  EGVPIRIMKNLRVC DCH+AIK
Sbjct: 556  YAPDMSMVLHDMEDEEKEVSLAHHSEKLAIAFAFLSLPEGVPIRIMKNLRVCDDCHVAIK 615

Query: 2094 YISSMKKREIIVRDASRFHHFKDGHCSCGDYW 2189
             +S +  REI+VRD SRFHHFKDG CSCGDYW
Sbjct: 616  LMSKVTGREIVVRDVSRFHHFKDGKCSCGDYW 647


>gb|EMT33444.1| hypothetical protein F775_20071 [Aegilops tauschii]
          Length = 647

 Score =  743 bits (1919), Expect = 0.0
 Identities = 367/632 (58%), Positives = 466/632 (73%), Gaps = 1/632 (0%)
 Frame = +3

Query: 297  EFTDLCSKGHLKEAFAN-FSSLIWSEPPLCSLLLKACIQIQSLFLTKQIHSLITTSGCSR 473
            EF  LCS G LK+A  + F  ++WSEP L + + +AC   +++ L +QIH+   T G + 
Sbjct: 19   EFIRLCSSGRLKDALHHPFRDVLWSEPTLFAHVFRAC---RAIPLLRQIHAFAATCGAAA 75

Query: 474  DKFVNNHLLNTYCKLGRVSTAVALFETLPKKNVMSYNILIGGFIQSGDLDSAMKLFDEMV 653
            D+F  N+L+  Y  LG + TA +LFE +PK NVMS+NILIGG+I++GDL SA KLFDEM 
Sbjct: 76   DRFTTNNLMLAYADLGDLPTACSLFERIPKPNVMSWNILIGGYIKNGDLGSARKLFDEMP 135

Query: 654  ERNLATWNAMITGLTQFEFNKEGLSMFSSMHELGFLPDAFTLGSVLRGCAGLKDLNRGRQ 833
             RN+ATWNAM+ GLT    +++ L  F +M   G  PD F LGSV R CAGL DL  GRQ
Sbjct: 136  MRNVATWNAMVAGLTNAGHDEDSLGFFLAMRREGLHPDEFGLGSVFRCCAGLSDLVSGRQ 195

Query: 834  VHSYAVKSGFDLHLVVGSSLAHMYMKSGSLGEGEKVIRSMPVHSVVACNTFIAGMAQNGY 1013
            VH+Y ++ G D+ + VG+SLAHMYM+ G L EGE V++++P  +VV+ NT IAG AQ+G 
Sbjct: 196  VHAYVLRCGMDIDMCVGNSLAHMYMRCGCLAEGEAVLKALPSLTVVSFNTTIAGRAQHGD 255

Query: 1014 SEVVLDQYNIMKIAGFRPDKITFVSVISSCSELATLGQGQQIHAEVIKFGAXXXXXXXXX 1193
            SE  L+ +++M+  G   D +TFVS+I+ CS+LA L QGQQ+HA+VIK G          
Sbjct: 256  SEGALEYFSMMRGVGIAADVVTFVSIITCCSDLAALAQGQQVHAQVIKAGVDKVVPVITC 315

Query: 1194 XXXMYSRCGCLDDALKVFEEKEGADVVLWSAMIAAYGFHGRGKEAIELFNRMEKEGLEAN 1373
               MYSRCGCL D+ +V+    G+D+ L SAMI+A GFHG+G +A+ELF +M   G   N
Sbjct: 316  LVHMYSRCGCLGDSERVYSGYCGSDLFLLSAMISACGFHGQGHKAVELFKQMMNAGARPN 375

Query: 1374 EVTFLSLLYACSHCGLKDKGLELFDLMMDKYRLQPKVEHYTCVVDLLGRSGRLEEAEWLI 1553
            EVTFL+LLYACSH GLKD+GLE F+LM   Y LQP V+HYTC+VDLLGRSG L+EAE LI
Sbjct: 376  EVTFLALLYACSHSGLKDEGLEFFELMTKTYGLQPSVKHYTCIVDLLGRSGCLDEAEALI 435

Query: 1554 RSMPVKADAIIWKTLLSACKTHKNADMAKRIAEQVLRINPHDSASYVLLSNIQASARRWQ 1733
             SMPV+AD +IWKTLLSACKT KN DMA+RIAE+V+  +P DSA YVLLSNI+A++RRW 
Sbjct: 436  LSMPVRADGVIWKTLLSACKTQKNFDMAERIAERVIEFDPRDSAPYVLLSNIRATSRRWG 495

Query: 1734 DVSEVRKAMRERNVKKEPGTSWFELKNQVHQFIMGDKSHPQSEEIDAYLKDLMEELKLKG 1913
            DVSE+RK MRE+NV+KEPG SW ELK QVHQF  GDKSHP+  EI+ YL+++M +++  G
Sbjct: 496  DVSELRKNMREKNVRKEPGVSWVELKGQVHQFCTGDKSHPRQGEINEYLEEMMAKIRQCG 555

Query: 1914 YVPDTGSVLHDMDAEEKEYNLVHHSEKLAIAFALMNTVEGVPIRIMKNLRVCGDCHIAIK 2093
            Y PD   V HDM+ EEKE +L HHSEKLAIAFA ++  EGVPIRIMKNLRVC DCH+AIK
Sbjct: 556  YAPDMSMVFHDMEDEEKEVSLTHHSEKLAIAFAFLSLPEGVPIRIMKNLRVCDDCHVAIK 615

Query: 2094 YISSMKKREIIVRDASRFHHFKDGHCSCGDYW 2189
             +S +  REI+VRD SRFHHF+DG CSCGDYW
Sbjct: 616  LMSQVTGREIVVRDVSRFHHFRDGKCSCGDYW 647


>ref|NP_850342.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|18491267|gb|AAL69458.1| At2g41080/T3K9.15 [Arabidopsis
            thaliana] gi|330254831|gb|AEC09925.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 565

 Score =  741 bits (1912), Expect = 0.0
 Identities = 361/565 (63%), Positives = 446/565 (78%), Gaps = 1/565 (0%)
 Frame = +3

Query: 498  LNTYCKLGRVSTAVALFETLPKKNVMSYNILIGGFIQSGDLDSAMKLFDEMVERNLATWN 677
            ++ Y KLG   +AVA++  + KKN MS NILI G++++GDL +A K+FDEM +R L TWN
Sbjct: 1    MSMYSKLGDFPSAVAVYGRMRKKNYMSSNILINGYVRAGDLVNARKVFDEMPDRKLTTWN 60

Query: 678  AMITGLTQFEFNKEGLSMFSSMHELGFLPDAFTLGSVLRGCAGLKDLNRGRQVHSYAVKS 857
            AMI GL QFEFN+EGLS+F  MH LGF PD +TLGSV  G AGL+ ++ G+Q+H Y +K 
Sbjct: 61   AMIAGLIQFEFNEEGLSLFREMHGLGFSPDEYTLGSVFSGSAGLRSVSIGQQIHGYTIKY 120

Query: 858  GFDLHLVVGSSLAHMYMKSGSLGEGEKVIRSMPVHSVVACNTFIAGMAQNGYSEVVLDQY 1037
            G +L LVV SSLAHMYM++G L +GE VIRSMPV ++VA NT I G AQNG  E VL  Y
Sbjct: 121  GLELDLVVNSSLAHMYMRNGKLQDGEIVIRSMPVRNLVAWNTLIMGNAQNGCPETVLYLY 180

Query: 1038 NIMKIAGFRPDKITFVSVISSCSELATLGQGQQIHAEVIKFGAXXXXXXXXXXXXMYSRC 1217
             +MKI+G RP+KITFV+V+SSCS+LA  GQGQQIHAE IK GA            MYS+C
Sbjct: 181  KMMKISGCRPNKITFVTVLSSCSDLAIRGQGQQIHAEAIKIGASSVVAVVSSLISMYSKC 240

Query: 1218 GCLDDALKVFEEKEGADVVLWSAMIAAYGFHGRGKEAIELFNRM-EKEGLEANEVTFLSL 1394
            GCL DA K F E+E  D V+WS+MI+AYGFHG+G EAIELFN M E+  +E NEV FL+L
Sbjct: 241  GCLGDAAKAFSEREDEDEVMWSSMISAYGFHGQGDEAIELFNTMAEQTNMEINEVAFLNL 300

Query: 1395 LYACSHCGLKDKGLELFDLMMDKYRLQPKVEHYTCVVDLLGRSGRLEEAEWLIRSMPVKA 1574
            LYACSH GLKDKGLELFD+M++KY  +P ++HYTCVVDLLGR+G L++AE +IRSMP+K 
Sbjct: 301  LYACSHSGLKDKGLELFDMMVEKYGFKPGLKHYTCVVDLLGRAGCLDQAEAIIRSMPIKT 360

Query: 1575 DAIIWKTLLSACKTHKNADMAKRIAEQVLRINPHDSASYVLLSNIQASARRWQDVSEVRK 1754
            D +IWKTLLSAC  HKNA+MA+R+ +++L+I+P+DSA YVLL+N+ ASA+RW+DVSEVRK
Sbjct: 361  DIVIWKTLLSACNIHKNAEMAQRVFKEILQIDPNDSACYVLLANVHASAKRWRDVSEVRK 420

Query: 1755 AMRERNVKKEPGTSWFELKNQVHQFIMGDKSHPQSEEIDAYLKDLMEELKLKGYVPDTGS 1934
            +MR++NVKKE G SWFE K +VHQF MGD+S  +S+EI +YLK+L  E+KLKGY PDT S
Sbjct: 421  SMRDKNVKKEAGISWFEHKGEVHQFKMGDRSQSKSKEIYSYLKELTLEMKLKGYKPDTAS 480

Query: 1935 VLHDMDAEEKEYNLVHHSEKLAIAFALMNTVEGVPIRIMKNLRVCGDCHIAIKYISSMKK 2114
            VLHDMD EEKE +LV HSEKLA+AFALM   EG PIRI+KNLRVC DCH+A KYIS +K 
Sbjct: 481  VLHDMDEEEKESDLVQHSEKLAVAFALMILPEGAPIRIIKNLRVCSDCHVAFKYISVIKN 540

Query: 2115 REIIVRDASRFHHFKDGHCSCGDYW 2189
            REI +RD SRFHHF +G CSCGDYW
Sbjct: 541  REITLRDGSRFHHFINGKCSCGDYW 565


Top