BLASTX nr result

ID: Paeonia24_contig00016533 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia24_contig00016533
         (2990 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002284744.1| PREDICTED: pentatricopeptide repeat-containi...  1164   0.0  
ref|XP_004292932.1| PREDICTED: pentatricopeptide repeat-containi...  1089   0.0  
emb|CAN63659.1| hypothetical protein VITISV_008415 [Vitis vinifera]  1071   0.0  
ref|XP_007227203.1| hypothetical protein PRUPE_ppa019251mg [Prun...  1031   0.0  
ref|XP_004250454.1| PREDICTED: pentatricopeptide repeat-containi...  1024   0.0  
ref|XP_007017888.1| Pentatricopeptide repeat (PPR) superfamily p...  1015   0.0  
ref|XP_002301973.2| pentatricopeptide repeat-containing family p...   980   0.0  
gb|EYU24286.1| hypothetical protein MIMGU_mgv1a025107mg [Mimulus...   974   0.0  
ref|XP_003549191.1| PREDICTED: pentatricopeptide repeat-containi...   973   0.0  
gb|EXB68664.1| hypothetical protein L484_024678 [Morus notabilis]     972   0.0  
gb|EYU18955.1| hypothetical protein MIMGU_mgv1a022111mg [Mimulus...   964   0.0  
ref|XP_002890375.1| pentatricopeptide repeat-containing protein ...   946   0.0  
ref|XP_006306841.1| hypothetical protein CARUB_v10008385mg [Caps...   944   0.0  
ref|XP_006416418.1| hypothetical protein EUTSA_v10009574mg [Eutr...   943   0.0  
ref|XP_004515286.1| PREDICTED: pentatricopeptide repeat-containi...   939   0.0  
ref|NP_173449.1| pentatricopeptide repeat-containing protein [Ar...   929   0.0  
gb|EPS74339.1| hypothetical protein M569_00411 [Genlisea aurea]       906   0.0  
ref|XP_007152607.1| hypothetical protein PHAVU_004G144300g [Phas...   857   0.0  
ref|XP_006587447.1| PREDICTED: pentatricopeptide repeat-containi...   850   0.0  
gb|AAF79892.1|AC022472_1 Contains similarity to an unknown prote...   835   0.0  

>ref|XP_002284744.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230
            [Vitis vinifera]
          Length = 758

 Score = 1164 bits (3010), Expect = 0.0
 Identities = 556/757 (73%), Positives = 653/757 (86%)
 Frame = -2

Query: 2959 AMTRQALHLLNSSHHITLNTFISTSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANH 2780
            +++ QAL LL+S  H   N   ST+ASL Q RQAH HILKTGL N+TH  TKLLS YAN+
Sbjct: 2    SLSAQALALLDSVQHTIFNCLNSTTASLSQTRQAHAHILKTGLFNDTHLATKLLSHYANN 61

Query: 2779 LCFDDASHVLDSILQPDIFSFSALIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAI 2600
            +CF DA+ VLD + +P++FSFS LI+A +K H+++HAL  FS+ML+ GL PD  V+PSA+
Sbjct: 62   MCFADATLVLDLVPEPNVFSFSTLIYAFSKFHQFHHALSTFSQMLTRGLMPDNRVLPSAV 121

Query: 2599 KACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVV 2420
            KACAGLSAL+  ++VHG+ SVSG  SD FVQ+SLVH+Y+KC +IR+AH+VFD M +PDVV
Sbjct: 122  KACAGLSALKPARQVHGIASVSGFDSDSFVQSSLVHMYIKCNQIRDAHRVFDRMFEPDVV 181

Query: 2419 TCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKM 2240
            + SALVA +AR G V EA RLF  MGDSG++ N +SWNGMIAGFNHS LY+EAVLMF  M
Sbjct: 182  SWSALVAAYARQGCVDEAKRLFSEMGDSGVQPNLISWNGMIAGFNHSGLYSEAVLMFLDM 241

Query: 2239 HSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACA 2060
            H +GF+PD T+ SSVLPAVGDLEDL +GI IH Y+IK GL SDKCV SALIDMYGKC+C 
Sbjct: 242  HLRGFEPDGTTISSVLPAVGDLEDLVMGILIHGYVIKQGLVSDKCVSSALIDMYGKCSCT 301

Query: 2059 KEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACC 1880
             EMSQVFD+MD +D+G+CNA + GLSRNG  E++L +F++++ QG+ELNVVSWTSMIACC
Sbjct: 302  SEMSQVFDQMDHMDVGSCNAFIFGLSRNGQVESSLRLFRQLKDQGMELNVVSWTSMIACC 361

Query: 1879 SQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDV 1700
            SQNG+DIEALELFR+MQIAGV PNSVTIPCLLPACGN+AALMHGKAAHCFSLRRG S DV
Sbjct: 362  SQNGRDIEALELFREMQIAGVKPNSVTIPCLLPACGNIAALMHGKAAHCFSLRRGISTDV 421

Query: 1699 YVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRS 1520
            YVGSALIDMYAKCG+I++S+ CFDG+P +NLVCWNA+I GYAMHGK KEA+EIF +MQRS
Sbjct: 422  YVGSALIDMYAKCGRIQASRICFDGIPTKNLVCWNAVIAGYAMHGKAKEAMEIFDLMQRS 481

Query: 1519 GQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKE 1340
            GQKPD+I+FT VLSACSQSGLTEEG Y+FNSMS ++GI+ARVEHYACMV LL RAGKL++
Sbjct: 482  GQKPDIISFTCVLSACSQSGLTEEGSYYFNSMSSKYGIEARVEHYACMVTLLSRAGKLEQ 541

Query: 1339 AYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYAS 1160
            AY+MI++MP  PDACVWGALLSSCRVHNN+ LGE+AA++LFELEP NPGNYILLSNIYAS
Sbjct: 542  AYAMIRRMPVNPDACVWGALLSSCRVHNNVSLGEVAAEKLFELEPSNPGNYILLSNIYAS 601

Query: 1159 NGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLE 980
             GMWN+V+ VR MMK+ GLRKNPGCSWIE+KNKVHMLLAGDKSHP M QIIEKL+KLS+E
Sbjct: 602  KGMWNEVNRVRDMMKNKGLRKNPGCSWIEVKNKVHMLLAGDKSHPQMTQIIEKLDKLSME 661

Query: 979  MKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDC 800
            MKK G+FP  NFVLQDVEEQDKEQ LCGHSEKLAVVFGLLNT  G PL+VIKNLRICGDC
Sbjct: 662  MKKLGYFPEINFVLQDVEEQDKEQILCGHSEKLAVVFGLLNTPPGYPLQVIKNLRICGDC 721

Query: 799  HAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 689
            H  IKF+SS+E REIFVRDTN FHHFK+GACSCGD+W
Sbjct: 722  HVVIKFISSFERREIFVRDTNRFHHFKEGACSCGDYW 758


>ref|XP_004292932.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like
            [Fragaria vesca subsp. vesca]
          Length = 755

 Score = 1089 bits (2817), Expect = 0.0
 Identities = 532/756 (70%), Positives = 618/756 (81%)
 Frame = -2

Query: 2956 MTRQALHLLNSSHHITLNTFISTSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANHL 2777
            MTRQ L+L +   H  L+ F++ S+SL QA QAH  ILKTGLSN T+  TKLLSLYAN L
Sbjct: 1    MTRQVLNLSDHLLHKLLS-FLNPSSSLSQAHQAHAQILKTGLSNHTNLTTKLLSLYANSL 59

Query: 2776 CFDDASHVLDSILQPDIFSFSALIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIK 2597
            CF +A  VL SI  P++FSFS LIHA  K + + +AL +FS+MLS GLAPD  + PS +K
Sbjct: 60   CFVEAKLVLHSIPHPNLFSFSTLIHAFAKLNSFGNALSLFSQMLSRGLAPDSFLFPSVVK 119

Query: 2596 ACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVT 2417
            ACAGL + ++ ++VH +   SG A D FVQ+SLVH+Y+KC +I +A KVFD +P+ DV+ 
Sbjct: 120  ACAGLQSSQSARQVHAISFSSGFALDSFVQSSLVHMYIKCDRIGDARKVFDRVPERDVII 179

Query: 2416 CSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMH 2237
             SAL++G++R G V EA RL   M   G   N V WNGMIAGF+ S+LYA  V +F+KMH
Sbjct: 180  YSALISGYSRRGCVDEAMRLLGEMRGLGFVPNVVLWNGMIAGFSQSKLYASTVGVFQKMH 239

Query: 2236 SQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAK 2057
            SQGF+PD +S SSVLPAVG+LEDL +G+QIH  +IK GL SDKCVVSAL+DMYGKCAC  
Sbjct: 240  SQGFEPDGSSISSVLPAVGELEDLDIGVQIHGQVIKRGLKSDKCVVSALVDMYGKCACTL 299

Query: 2056 EMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACCS 1877
            EMS+V  EMD++D+GACNALV GL+RNG  +NAL VF + +GQGVELN VSWTS+IA CS
Sbjct: 300  EMSRVVGEMDELDVGACNALVTGLARNGLVDNALEVFMQFKGQGVELNTVSWTSIIASCS 359

Query: 1876 QNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDVY 1697
            QNGKD+EALELFR+MQI GV PNS+TI CLLPACGN+AAL HGKAAHCF+ RRG  +DVY
Sbjct: 360  QNGKDMEALELFREMQIEGVEPNSMTISCLLPACGNIAALTHGKAAHCFAFRRGMLSDVY 419

Query: 1696 VGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSG 1517
            VGSALIDMYAKCG+I+ S+ CFD MP RNLVCWNA++ GYAMHGK KE +EIFHMMQRSG
Sbjct: 420  VGSALIDMYAKCGKIQLSRLCFDKMPTRNLVCWNAVMSGYAMHGKAKETMEIFHMMQRSG 479

Query: 1516 QKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEA 1337
             KPD+I+FT VLSACSQ+GLTEEGWY+FNSMS EHGI+AR+EHYACMV LLGRAGKL EA
Sbjct: 480  LKPDIISFTCVLSACSQNGLTEEGWYYFNSMSKEHGIEARIEHYACMVTLLGRAGKLDEA 539

Query: 1336 YSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASN 1157
            YSMI+KMPFEPDACVWGALLSSCRVHNN+ LGE  AK+LF LEP NPGNYILLSNIYAS 
Sbjct: 540  YSMIKKMPFEPDACVWGALLSSCRVHNNVTLGESTAKKLFNLEPGNPGNYILLSNIYASK 599

Query: 1156 GMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEM 977
            GMW +VD VR  MKS+GLRKNPGCSWIE KN VHMLLAGDK+HP M +I EKLN LS EM
Sbjct: 600  GMWTEVDRVRDTMKSLGLRKNPGCSWIEFKNNVHMLLAGDKTHPQMNKITEKLNTLSSEM 659

Query: 976  KKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCH 797
            KKSG+ P T+FVLQDVEEQ+KEQ LCGHSEKLAVV GLLNT  GS LRVIKNLRICGDCH
Sbjct: 660  KKSGYLPSTHFVLQDVEEQEKEQILCGHSEKLAVVLGLLNTPPGSSLRVIKNLRICGDCH 719

Query: 796  AFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 689
            + IKF+SS EGREI VRDTN FHHFKDG CSCGD+W
Sbjct: 720  SVIKFISSLEGREISVRDTNRFHHFKDGVCSCGDYW 755


>emb|CAN63659.1| hypothetical protein VITISV_008415 [Vitis vinifera]
          Length = 760

 Score = 1071 bits (2769), Expect = 0.0
 Identities = 515/709 (72%), Positives = 609/709 (85%)
 Frame = -2

Query: 2959 AMTRQALHLLNSSHHITLNTFISTSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANH 2780
            +++ QAL LL+S  H  LN   ST+ASL Q RQAH HILKTGL N+TH  TKLLS YAN+
Sbjct: 2    SLSAQALALLDSVQHTILNCLNSTTASLSQTRQAHAHILKTGLFNDTHLATKLLSHYANN 61

Query: 2779 LCFDDASHVLDSILQPDIFSFSALIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAI 2600
            +CF DA+ VLD + +P++FSFS LI+A +K H+++HAL  FS+ML+ GL PD  V+PSA+
Sbjct: 62   MCFADATLVLDLVPEPNVFSFSTLIYAFSKFHQFHHALSTFSQMLTRGLMPDNRVLPSAV 121

Query: 2599 KACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVV 2420
            KACAGLSAL+  ++VHG+ SVSG  SD FVQ+SLVH+Y+KC +IR+AH+VFD M +PDVV
Sbjct: 122  KACAGLSALKPARQVHGIASVSGFDSDSFVQSSLVHMYIKCNQIRDAHRVFDRMFEPDVV 181

Query: 2419 TCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKM 2240
            + SALVA +AR G V EA RLF  MGDSG++ N +SWNGMIAGFNHS LY+EAVLMF  M
Sbjct: 182  SWSALVAAYARQGCVDEAKRLFSEMGDSGVQPNLISWNGMIAGFNHSGLYSEAVLMFLDM 241

Query: 2239 HSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACA 2060
            H +GF+PD T+ SSVLPAVGDLEDL +GI IH Y+IK GL SDKCV SALIDMYGKC+C 
Sbjct: 242  HLRGFEPDGTTISSVLPAVGDLEDLVMGILIHGYVIKQGLVSDKCVSSALIDMYGKCSCT 301

Query: 2059 KEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACC 1880
             EMSQVFD+MD +D+G+CNA + GLSRNG  E++L +F++++ QG+ELNVVSWTSMIACC
Sbjct: 302  SEMSQVFDQMDHMDVGSCNAFIFGLSRNGQVESSLRLFRQLKDQGMELNVVSWTSMIACC 361

Query: 1879 SQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDV 1700
            SQNG+D+EALELFR+MQIAGV PNSVTIPCLLPACGN+AALMHGKAAHCFSLRRG S DV
Sbjct: 362  SQNGRDMEALELFREMQIAGVKPNSVTIPCLLPACGNIAALMHGKAAHCFSLRRGISTDV 421

Query: 1699 YVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRS 1520
            YVGSALIDMYAKCG+I++S+ CFDG+P +NLVCWNA+I GYAMHGK KEA+EIF +MQRS
Sbjct: 422  YVGSALIDMYAKCGRIQASRICFDGIPTKNLVCWNAVIAGYAMHGKAKEAMEIFDLMQRS 481

Query: 1519 GQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKE 1340
            GQKPD+I+FT VLSACSQSGLTEEG Y+FNSMS ++GI+ARVEHYACMV LL RAGKL++
Sbjct: 482  GQKPDIISFTCVLSACSQSGLTEEGSYYFNSMSSKYGIEARVEHYACMVTLLSRAGKLEQ 541

Query: 1339 AYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYAS 1160
            AY+MI++MP  PDACVWGALLSSCRVHNN+ LGE+AA++LFELEP NPGNYILLSNIYAS
Sbjct: 542  AYAMIRRMPVNPDACVWGALLSSCRVHNNVSLGEVAAEKLFELEPSNPGNYILLSNIYAS 601

Query: 1159 NGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLE 980
             GMWN+V+ VR MMK+ GLRKNPGCSWIE+KNKVHMLLAGDKSHP M QIIE L+KLS+E
Sbjct: 602  KGMWNEVNRVRDMMKNKGLRKNPGCSWIEVKNKVHMLLAGDKSHPQMTQIIENLDKLSME 661

Query: 979  MKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLR 833
            MKK G+FP  NFVLQDVEEQDKEQ LCGHSEKLAVVFGLLNT  G PL+
Sbjct: 662  MKKLGYFPEINFVLQDVEEQDKEQILCGHSEKLAVVFGLLNTPPGYPLQ 710


>ref|XP_007227203.1| hypothetical protein PRUPE_ppa019251mg [Prunus persica]
            gi|462424139|gb|EMJ28402.1| hypothetical protein
            PRUPE_ppa019251mg [Prunus persica]
          Length = 654

 Score = 1031 bits (2666), Expect = 0.0
 Identities = 489/654 (74%), Positives = 563/654 (86%)
 Frame = -2

Query: 2650 MLSHGLAPDCHVVPSAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGK 2471
            MLS GL PD  + PS +KACAGL A + GK+VH + SVSGLASD FVQ+SLVH+Y+KC +
Sbjct: 1    MLSRGLVPDSFLFPSVVKACAGLPASKAGKQVHAIASVSGLASDSFVQSSLVHMYIKCDQ 60

Query: 2470 IREAHKVFDTMPQPDVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAG 2291
            IR+A K+FD +PQ DV+ CSAL++G++R G V EA +L   M    LE N V WNGMIAG
Sbjct: 61   IRDARKLFDRVPQRDVIICSALISGYSRRGCVDEAMQLLSEMRGMCLEPNVVLWNGMIAG 120

Query: 2290 FNHSRLYAEAVLMFKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSD 2111
            FN S+LYA+ V + +KMHS+GFQPD +S SS LPAVG LEDL +GIQIH Y++K GLGSD
Sbjct: 121  FNQSKLYADTVAVLQKMHSEGFQPDGSSISSALPAVGHLEDLGMGIQIHGYVVKQGLGSD 180

Query: 2110 KCVVSALIDMYGKCACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEG 1931
            KCVVSALIDMYGKCAC+ E SQVF EMD++D+GACNALV GLSRNG  +NAL VF++ + 
Sbjct: 181  KCVVSALIDMYGKCACSFETSQVFHEMDQMDVGACNALVTGLSRNGLVDNALKVFRQFKD 240

Query: 1930 QGVELNVVSWTSMIACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMH 1751
            QG+ELN+VSWTS+IA CSQNGKD+EALELFR+MQ+ GV PNSVTIPCLLPACGN+AALMH
Sbjct: 241  QGMELNIVSWTSIIASCSQNGKDMEALELFREMQVEGVEPNSVTIPCLLPACGNIAALMH 300

Query: 1750 GKAAHCFSLRRGFSNDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAM 1571
            GKAAHCFSLRRG SNDVYVGS+LIDMYAKCG+IR S+ CFD MP RNLVCWNA++GGYAM
Sbjct: 301  GKAAHCFSLRRGISNDVYVGSSLIDMYAKCGKIRLSRLCFDEMPTRNLVCWNAVMGGYAM 360

Query: 1570 HGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVE 1391
            HGK  E +E+F +MQRSGQKPD I+FT VLSACSQ GLT+EGWY+FNSMS EHG++ARVE
Sbjct: 361  HGKANETMEVFRLMQRSGQKPDFISFTCVLSACSQKGLTDEGWYYFNSMSKEHGLEARVE 420

Query: 1390 HYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFEL 1211
            HYACMV LL R+GKL+EAYSMI++MPFEPDACVWGALLSSCRVH+N+ LG+  AK+LF L
Sbjct: 421  HYACMVTLLSRSGKLEEAYSMIKQMPFEPDACVWGALLSSCRVHSNVTLGKYVAKKLFNL 480

Query: 1210 EPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKS 1031
            EPKNPGNYILLSNIYAS GMW++VD VR  MKS+GLRKNPGCSWIE+KNKVHMLLAGDK+
Sbjct: 481  EPKNPGNYILLSNIYASKGMWSEVDKVRDKMKSLGLRKNPGCSWIEVKNKVHMLLAGDKA 540

Query: 1030 HPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTH 851
            HP M QIIEKLNKLS EMKK G+FP T+FVLQDVEEQDKEQ LCGHSEKLAVV GLLN+ 
Sbjct: 541  HPQMNQIIEKLNKLSSEMKKLGYFPNTHFVLQDVEEQDKEQILCGHSEKLAVVLGLLNSP 600

Query: 850  QGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 689
             GS LRVIKNLRICGDCHA IKF+SS+EGREI VRDTNLFHHFKDG CSC D+W
Sbjct: 601  PGSSLRVIKNLRICGDCHAVIKFISSFEGREISVRDTNLFHHFKDGVCSCEDYW 654



 Score =  187 bits (474), Expect = 3e-44
 Identities = 106/328 (32%), Positives = 174/328 (53%), Gaps = 4/328 (1%)
 Frame = -2

Query: 2803 LLSLYANHLCFDDASHVLDSI----LQPDIFSFSALIHASTKHHRYNHALRIFSRMLSHG 2636
            L+S Y+   C D+A  +L  +    L+P++  ++ +I    +   Y   + +  +M S G
Sbjct: 82   LISGYSRRGCVDEAMQLLSEMRGMCLEPNVVLWNGMIAGFNQSKLYADTVAVLQKMHSEG 141

Query: 2635 LAPDCHVVPSAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAH 2456
              PD   + SA+ A   L  L  G ++HG V   GL SD  V ++L+ +Y KC    E  
Sbjct: 142  FQPDGSSISSALPAVGHLEDLGMGIQIHGYVVKQGLGSDKCVVSALIDMYGKCACSFETS 201

Query: 2455 KVFDTMPQPDVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSR 2276
            +VF  M Q DV  C+ALV G +R+G V  A ++F    D G+ELN VSW  +IA  + + 
Sbjct: 202  QVFHEMDQMDVGACNALVTGLSRNGLVDNALKVFRQFKDQGMELNIVSWTSIIASCSQNG 261

Query: 2275 LYAEAVLMFKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVS 2096
               EA+ +F++M  +G +P+  +   +LPA G++  L  G   H + ++ G+ +D  V S
Sbjct: 262  KDMEALELFREMQVEGVEPNSVTIPCLLPACGNIAALMHGKAAHCFSLRRGISNDVYVGS 321

Query: 2095 ALIDMYGKCACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVEL 1916
            +LIDMY KC   +     FDEM   ++   NA++ G + +G A   + VF+ ++  G + 
Sbjct: 322  SLIDMYAKCGKIRLSRLCFDEMPTRNLVCWNAVMGGYAMHGKANETMEVFRLMQRSGQKP 381

Query: 1915 NVVSWTSMIACCSQNGKDIEALELFRDM 1832
            + +S+T +++ CSQ G   E    F  M
Sbjct: 382  DFISFTCVLSACSQKGLTDEGWYYFNSM 409



 Score = 99.8 bits (247), Expect = 7e-18
 Identities = 62/246 (25%), Positives = 114/246 (46%), Gaps = 35/246 (14%)
 Frame = -2

Query: 2863 QAHGHILKTGLSNETHFVTKLLSLYANHLCFDDASHVLDSILQPDIFSFSALIHASTKHH 2684
            Q HG+++K GL ++   V+ L+ +Y    C  + S V   + Q D+ + +AL+   +++ 
Sbjct: 167  QIHGYVVKQGLGSDKCVVSALIDMYGKCACSFETSQVFHEMDQMDVGACNALVTGLSRNG 226

Query: 2683 RYNHALRIFSR-----------------------------------MLSHGLAPDCHVVP 2609
              ++AL++F +                                   M   G+ P+   +P
Sbjct: 227  LVDNALKVFRQFKDQGMELNIVSWTSIIASCSQNGKDMEALELFREMQVEGVEPNSVTIP 286

Query: 2608 SAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQP 2429
              + AC  ++AL  GK  H      G+++D +V +SL+ +Y KCGKIR +   FD MP  
Sbjct: 287  CLLPACGNIAALMHGKAAHCFSLRRGISNDVYVGSSLIDMYAKCGKIRLSRLCFDEMPTR 346

Query: 2428 DVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMF 2249
            ++V  +A++ G+A HG   E   +F  M  SG + + +S+  +++  +   L  E    F
Sbjct: 347  NLVCWNAVMGGYAMHGKANETMEVFRLMQRSGQKPDFISFTCVLSACSQKGLTDEGWYYF 406

Query: 2248 KKMHSQ 2231
              M  +
Sbjct: 407  NSMSKE 412


>ref|XP_004250454.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like
            [Solanum lycopersicum]
          Length = 828

 Score = 1024 bits (2648), Expect = 0.0
 Identities = 501/764 (65%), Positives = 604/764 (79%)
 Frame = -2

Query: 2980 KLLPQSQAMTRQALHLLNSSHHITLNTFISTSASLPQARQAHGHILKTGLSNETHFVTKL 2801
            +LL    A   Q+L +L+S    T+ + I+ S+SL Q +Q H HILKTG S++THF  K+
Sbjct: 65   ELLNSMNARQAQSLRVLDSLMPNTILSLIARSSSLSQTQQVHAHILKTGHSSDTHFTNKV 124

Query: 2800 LSLYANHLCFDDASHVLDSILQPDIFSFSALIHASTKHHRYNHALRIFSRMLSHGLAPDC 2621
            LSLYAN  CF +A  +L S+  P+IFSF +LIHAS+K + +++ L +FSR+LS  + PD 
Sbjct: 125  LSLYANFNCFANAESLLHSLPNPNIFSFKSLIHASSKSNLFSYTLVLFSRLLSKCILPDV 184

Query: 2620 HVVPSAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDT 2441
            HV+PSAIKACAGLSA   GK+VHG    +GLA D FV+ SLVH+YVKC +++ A K+FD 
Sbjct: 185  HVLPSAIKACAGLSASEVGKQVHGYGLTTGLALDSFVEASLVHMYVKCDQLKCARKMFDK 244

Query: 2440 MPQPDVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEA 2261
            M +PDVV+ SAL  G+A+ G V  A  +FD  G  G+E N VSWNGMIAGFN S  Y EA
Sbjct: 245  MREPDVVSWSALSGGYAKKGDVFNAKMVFDEGGKLGIEPNLVSWNGMIAGFNQSGCYLEA 304

Query: 2260 VLMFKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDM 2081
            VLMF++M+S GF+ D TS SSVLPAV DLEDL +G+Q+H+++IK G  SD C++SAL+DM
Sbjct: 305  VLMFQRMNSDGFRSDGTSISSVLPAVSDLEDLKMGVQVHSHVIKTGFESDNCIISALVDM 364

Query: 2080 YGKCACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSW 1901
            YGKC C  EMS+VF+  ++ID+G  NALVAGLSRNG  + A  VFKK + +  ELNVVSW
Sbjct: 365  YGKCRCTSEMSRVFEGAEEIDLGGFNALVAGLSRNGLVDEAFKVFKKFKLKVKELNVVSW 424

Query: 1900 TSMIACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLR 1721
            TSMI+ CSQ+GKD+EALE+FR+MQ+A V PNSVTI CLLPACGN+AAL+HGKA HCFSLR
Sbjct: 425  TSMISSCSQHGKDLEALEIFREMQLAKVRPNSVTISCLLPACGNIAALVHGKATHCFSLR 484

Query: 1720 RGFSNDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEI 1541
              FS+DVYV SALIDMYA CG+I+ ++  FD MP RNLVCWNA+  GYAMHGK KEA+EI
Sbjct: 485  NWFSDDVYVSSALIDMYANCGRIQLARVIFDRMPVRNLVCWNAMTSGYAMHGKAKEAIEI 544

Query: 1540 FHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLG 1361
            F  M+RSGQKPD I+FTSVLSACSQ+GLTE+G ++F+ MS  HG++ARVEHYACMV+LLG
Sbjct: 545  FDSMRRSGQKPDFISFTSVLSACSQAGLTEQGQHYFDCMSRIHGLEARVEHYACMVSLLG 604

Query: 1360 RAGKLKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYIL 1181
            R GKLKEAY MI  MP EPDACVWGALLSSCR H N+ LGEIAA +LFELEPKNPGNYIL
Sbjct: 605  RTGKLKEAYDMISTMPIEPDACVWGALLSSCRTHRNMSLGEIAADKLFELEPKNPGNYIL 664

Query: 1180 LSNIYASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEK 1001
            LSNIYASN  WN+VD VR MMK +GL KNPGCSWIEIKNKVHMLLAGD  HP M QI+EK
Sbjct: 665  LSNIYASNNRWNEVDKVRDMMKHVGLSKNPGCSWIEIKNKVHMLLAGDDLHPQMPQIMEK 724

Query: 1000 LNKLSLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKN 821
            L KLS++MK +G    T  VLQDVEEQDKE  LCGHSEKLAVV G+LNT+ G+ LRVIKN
Sbjct: 725  LRKLSMDMKNTGVSHDTELVLQDVEEQDKELILCGHSEKLAVVLGILNTNPGTSLRVIKN 784

Query: 820  LRICGDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 689
            LRICGDCH FIKF+SS+EGREI+VRD N +HHF +G CSCGD+W
Sbjct: 785  LRICGDCHTFIKFISSFEGREIYVRDANRYHHFNEGICSCGDYW 828


>ref|XP_007017888.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao]
            gi|508723216|gb|EOY15113.1| Pentatricopeptide repeat
            (PPR) superfamily protein [Theobroma cacao]
          Length = 758

 Score = 1015 bits (2625), Expect = 0.0
 Identities = 493/758 (65%), Positives = 605/758 (79%), Gaps = 2/758 (0%)
 Frame = -2

Query: 2956 MTRQALHLLNSSHHITLNTFISTSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANHL 2777
            MT QAL      +   L    S  ASL Q  QAH +ILK+G+  +T   TKL+S YAN  
Sbjct: 1    MTVQALPFFEILNRSILPCLNSAVASLSQTSQAHAYILKSGVCIDTLISTKLISQYANRH 60

Query: 2776 CFDDASHVLDSILQPDIFSFSALIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIK 2597
            CF +A  VL+SI +P + SFSALI+A  K++ +  +L +FSRMLS G+ PD  V+P+ +K
Sbjct: 61   CFAEAELVLNSISEPLVSSFSALIYALNKYNLFTQSLYVFSRMLSRGILPDNRVLPNVVK 120

Query: 2596 ACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVT 2417
            AC  LSA + GKEVHG+V   G  SD  VQ SLVHLY+K  +I++A  VF+ +P+ DVVT
Sbjct: 121  ACGKLSAFKLGKEVHGIVVKYGFDSDSVVQASLVHLYLKGDRIQDAKNVFERLPERDVVT 180

Query: 2416 CSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMH 2237
            C AL++ +AR G V EA  +F GM   G+  N VSWNGMI GFN S  Y EAV+MFK+MH
Sbjct: 181  CGALLSAYARKGCVNEAKEIFYGMQSFGVGPNLVSWNGMITGFNQSEQYNEAVVMFKEMH 240

Query: 2236 SQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAK 2057
            S+GF PD+ + SSV  AVGDLE L +GIQ+  Y+IK GL   K V+SAL+DM+GKCACA 
Sbjct: 241  SEGFLPDDITISSVFSAVGDLERLNIGIQVLCYVIKLGLLHCKFVISALMDMFGKCACAG 300

Query: 2056 EMSQVFDEMDK--IDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIAC 1883
            E+ + F+E+D+  +D GA NAL+ GLSRNG  + AL  F++   QG ELNVVSWTS+IA 
Sbjct: 301  ELMKAFEEVDEEIMDTGALNALITGLSRNGLVDVALETFQRFRVQGRELNVVSWTSIIAG 360

Query: 1882 CSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSND 1703
            CSQNGKDIEALELFR+MQ A + PNSVTIPCLLPACGN+AAL+HGKAAH F++R G +ND
Sbjct: 361  CSQNGKDIEALELFREMQSARLKPNSVTIPCLLPACGNIAALIHGKAAHGFAIRTGIAND 420

Query: 1702 VYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQR 1523
            V+VGSAL+DMYAKCG+I  S+ CFD +P++N VCWNAI+GGYAMHGK KEA++IFHMMQR
Sbjct: 421  VHVGSALVDMYAKCGRIHLSRLCFDRIPSKNSVCWNAIMGGYAMHGKAKEAIDIFHMMQR 480

Query: 1522 SGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLK 1343
             GQKPD I+F+ VLSACSQ GLTEEGW+ FNSMS +HG+KA++EHY+CMVNLLGR+GKL+
Sbjct: 481  RGQKPDFISFSCVLSACSQGGLTEEGWHFFNSMSRDHGVKAKMEHYSCMVNLLGRSGKLE 540

Query: 1342 EAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYA 1163
            +AY++IQ+MPFEPDACVWGALLSSCR+HNN+ LGEIAA+ LF+LEP NPGNYILLSNIYA
Sbjct: 541  QAYALIQQMPFEPDACVWGALLSSCRLHNNISLGEIAAQNLFKLEPSNPGNYILLSNIYA 600

Query: 1162 SNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSL 983
            S GMW++VD VR +M+S G++KNPGCSWIEIKN+VHMLLAGDKSHP M +IIEK+ KLS+
Sbjct: 601  SKGMWDEVDAVRDVMRSRGMKKNPGCSWIEIKNQVHMLLAGDKSHPQMTEIIEKIYKLSM 660

Query: 982  EMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGD 803
            +MKK+G+ P T+FVLQDV+EQDKEQ LCGHSEKLAV FGLLNT  GSPL++IKNLRICGD
Sbjct: 661  DMKKAGYLPNTDFVLQDVDEQDKEQILCGHSEKLAVAFGLLNTPPGSPLQIIKNLRICGD 720

Query: 802  CHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 689
            CHA IKF+S +EGREI+VRDTN FHHFKDG CSC D+W
Sbjct: 721  CHAVIKFISGFEGREIYVRDTNRFHHFKDGVCSCRDYW 758


>ref|XP_002301973.2| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550344115|gb|EEE81246.2|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 724

 Score =  980 bits (2533), Expect = 0.0
 Identities = 493/756 (65%), Positives = 580/756 (76%)
 Frame = -2

Query: 2956 MTRQALHLLNSSHHITLNTFISTSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANHL 2777
            M RQAL L  +  H   +   +T ASL QA   H HILKTG+S                 
Sbjct: 13   MARQALPLFENFSHCLCS---ATKASLSQA---HAHILKTGIS----------------- 49

Query: 2776 CFDDASHVLDSILQPDIFSFSALIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIK 2597
                        L   I  FS L H       + H +R+FS ML+ G+ PD  V+P+ IK
Sbjct: 50   ------------LPETIQIFSKLNH-------FGHVIRVFSYMLTQGIVPDSRVLPTVIK 90

Query: 2596 ACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVT 2417
             CA LSAL+TGK++H    VSGL  D  V +SL+H+YV+   +++A  VFD +PQP VVT
Sbjct: 91   TCAALSALQTGKQMHCFALVSGLGLDSVVLSSLLHMYVQFDHLKDARNVFDKLPQPGVVT 150

Query: 2416 CSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMH 2237
             SAL++ FAR G VKE   LF    D G+ELN VSWNGMI+GFN S  Y +AVLMF+ MH
Sbjct: 151  SSALISRFARKGRVKETKELFYQTRDLGVELNLVSWNGMISGFNRSGSYLDAVLMFQNMH 210

Query: 2236 SQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAK 2057
             +G +PD TS SSVLPAVGDL+   +GIQIH Y+IK GLG DK VVSALIDMYGKCACA 
Sbjct: 211  LEGLKPDGTSVSSVLPAVGDLDMPLMGIQIHCYVIKQGLGPDKFVVSALIDMYGKCACAS 270

Query: 2056 EMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACCS 1877
            EMS VF+EMD++D+GACNALV GLSRNG  +NAL VFK+ +G  ++LNVVSWTSMIA CS
Sbjct: 271  EMSGVFNEMDEVDVGACNALVTGLSRNGLVDNALEVFKQFKG--MDLNVVSWTSMIASCS 328

Query: 1876 QNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDVY 1697
            QNGKD+EALELFR+MQI GV PNSVTIPCLLPACGN+AAL+HGKAAHCFSLR G  NDVY
Sbjct: 329  QNGKDMEALELFREMQIEGVKPNSVTIPCLLPACGNIAALLHGKAAHCFSLRNGIFNDVY 388

Query: 1696 VGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSG 1517
            VGSALIDMYAKCG++ +S+ CFD MP RNLV WN+++ GYAMHGK  EA+ IF +MQR G
Sbjct: 389  VGSALIDMYAKCGRMLASRLCFDMMPNRNLVSWNSLMAGYAMHGKTFEAINIFELMQRCG 448

Query: 1516 QKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEA 1337
            QKPD ++FT VLSAC+Q GLTEEGW++F+SMS  HG++AR+EHY+CMV LLGR+G+L+EA
Sbjct: 449  QKPDHVSFTCVLSACTQGGLTEEGWFYFDSMSRNHGVEARMEHYSCMVTLLGRSGRLEEA 508

Query: 1336 YSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASN 1157
            Y+MI++MPFEPD+CVWGALLSSCRVHN + LGEIAAK +FELEP+NPGNYILLSNIYAS 
Sbjct: 509  YAMIKQMPFEPDSCVWGALLSSCRVHNRVDLGEIAAKRVFELEPRNPGNYILLSNIYASK 568

Query: 1156 GMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEM 977
             MW +VD+VR MM+S GL+KNPG SWIEIKNKVHMLLAGD SHP M QIIEKL KL++EM
Sbjct: 569  AMWVEVDMVRDMMRSRGLKKNPGYSWIEIKNKVHMLLAGDSSHPQMPQIIEKLAKLTVEM 628

Query: 976  KKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCH 797
            KKSG+ P T+FVLQDVEEQDKEQ LCGHSEKLAVV GLLNT  G PL+VIKNLRIC DCH
Sbjct: 629  KKSGYVPHTDFVLQDVEEQDKEQILCGHSEKLAVVLGLLNTKPGFPLQVIKNLRICRDCH 688

Query: 796  AFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 689
            A IKF+S +E REIFVRDTN FH FK G CSCGD+W
Sbjct: 689  AVIKFISDFEKREIFVRDTNRFHQFKGGVCSCGDYW 724


>gb|EYU24286.1| hypothetical protein MIMGU_mgv1a025107mg [Mimulus guttatus]
          Length = 654

 Score =  974 bits (2517), Expect = 0.0
 Identities = 458/654 (70%), Positives = 551/654 (84%)
 Frame = -2

Query: 2650 MLSHGLAPDCHVVPSAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGK 2471
            ML HGL PD HV+PS IKACAGL A+  GK+VHG    SG++ D FVQ+SLVH YVKC +
Sbjct: 1    MLKHGLFPDAHVLPSVIKACAGLLAVNIGKQVHGFSLASGISLDSFVQSSLVHFYVKCDE 60

Query: 2470 IREAHKVFDTMPQPDVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAG 2291
            + +AHK+FD M + DVV+ SAL AG+AR G    A ++F+ + + G + N VSWNGMIAG
Sbjct: 61   LVDAHKLFDNMVERDVVSWSALAAGYARKGDRVNARKVFNEVKNLGFQPNTVSWNGMIAG 120

Query: 2290 FNHSRLYAEAVLMFKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSD 2111
            FN S  + +AVLMF++MH  GF+ D TS SSVLPA+GDL  L  G Q+H Y+IK+G   D
Sbjct: 121  FNQSGCFLDAVLMFQQMHKHGFKSDGTSISSVLPAIGDLGYLSTGTQVHGYVIKNGFAVD 180

Query: 2110 KCVVSALIDMYGKCACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEG 1931
            KC+VSALIDMYGKC CA EMSQV ++M ++++GACNAL+ GL+R+G  + AL VFK+++G
Sbjct: 181  KCIVSALIDMYGKCGCALEMSQVLEDMGQVEVGACNALITGLARHGLVDKALRVFKELQG 240

Query: 1930 QGVELNVVSWTSMIACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMH 1751
            Q +ELNVVSWTS+IACCSQ+GKDIEALELFR+MQ AGV PN+VTIPCLLPACGN+AALMH
Sbjct: 241  QQMELNVVSWTSVIACCSQHGKDIEALELFREMQSAGVKPNAVTIPCLLPACGNIAALMH 300

Query: 1750 GKAAHCFSLRRGFSNDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAM 1571
            GKAAHCFSLRRG S DVYVGSALIDMYA CG+I+ ++CCFD MP RNLVCWNA++GGYAM
Sbjct: 301  GKAAHCFSLRRGISGDVYVGSALIDMYANCGKIQLARCCFDRMPVRNLVCWNAMLGGYAM 360

Query: 1570 HGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVE 1391
            HGK  EA+E F +MQRSGQKPD ++ TS+LSACSQSGLTEEG  +F+ M+ +HGIK RVE
Sbjct: 361  HGKANEAIEFFLLMQRSGQKPDSVSLTSLLSACSQSGLTEEGHRYFDRMTTDHGIKPRVE 420

Query: 1390 HYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFEL 1211
            HYAC+V+LLGRAGKL+EAYSMI+KMPFEPDACVWGALLSSCRVH+N+ LGE+AA++LFEL
Sbjct: 421  HYACVVSLLGRAGKLEEAYSMIEKMPFEPDACVWGALLSSCRVHHNMSLGEVAARKLFEL 480

Query: 1210 EPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKS 1031
            EP NPGNYIL+SNIYAS G + +VD +R +M+  GLRKNPGCSWIE+KNKVHMLLAGDKS
Sbjct: 481  EPMNPGNYILMSNIYASKGRYKEVDKIRDIMRDKGLRKNPGCSWIEVKNKVHMLLAGDKS 540

Query: 1030 HPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTH 851
             P MAQI++KLN+LS+EMKK+G+ P T++VLQDVEEQ+KE  LCGHSEKLAVVFG+LNT 
Sbjct: 541  LPQMAQIMDKLNRLSIEMKKAGYSPNTDYVLQDVEEQEKEHILCGHSEKLAVVFGILNTS 600

Query: 850  QGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 689
             GSPLRV KNLRICGDCHA IKF+S +E REIFVRDTN +HHFKDG CSCGD+W
Sbjct: 601  PGSPLRVTKNLRICGDCHAVIKFISRFERREIFVRDTNRYHHFKDGDCSCGDYW 654



 Score =  181 bits (458), Expect = 2e-42
 Identities = 138/492 (28%), Positives = 224/492 (45%), Gaps = 42/492 (8%)
 Frame = -2

Query: 2866 RQAHGHILKTGLSNETHFVTKLLSLYANHLCFDDASHVLDSILQPDIFSFSALIHA-STK 2690
            +Q HG  L +G+S ++   + L+  Y       DA  + D++++ D+ S+SAL    + K
Sbjct: 30   KQVHGFSLASGISLDSFVQSSLVHFYVKCDELVDAHKLFDNMVERDVVSWSALAAGYARK 89

Query: 2689 HHRYN----------------------------------HALRIFSRMLSHGLAPDCHVV 2612
              R N                                   A+ +F +M  HG   D   +
Sbjct: 90   GDRVNARKVFNEVKNLGFQPNTVSWNGMIAGFNQSGCFLDAVLMFQQMHKHGFKSDGTSI 149

Query: 2611 PSAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQ 2432
             S + A   L  L TG +VHG V  +G A D  + ++L+ +Y KCG   E  +V + M Q
Sbjct: 150  SSVLPAIGDLGYLSTGTQVHGYVIKNGFAVDKCIVSALIDMYGKCGCALEMSQVLEDMGQ 209

Query: 2431 PDVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLM 2252
             +V  C+AL+ G ARHG V +A R+F  +    +ELN VSW  +IA  +      EA+ +
Sbjct: 210  VEVGACNALITGLARHGLVDKALRVFKELQGQQMELNVVSWTSVIACCSQHGKDIEALEL 269

Query: 2251 FKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGK 2072
            F++M S G +P+  +   +LPA G++  L  G   H + ++ G+  D  V SALIDMY  
Sbjct: 270  FREMQSAGVKPNAVTIPCLLPACGNIAALMHGKAAHCFSLRRGISGDVYVGSALIDMYAN 329

Query: 2071 CACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSM 1892
            C   +     FD M   ++   NA++ G + +G A  A+  F  ++  G + + VS TS+
Sbjct: 330  CGKIQLARCCFDRMPVRNLVCWNAMLGGYAMHGKANEAIEFFLLMQRSGQKPDSVSLTSL 389

Query: 1891 IACCSQNGKDIEALELFRDMQI-AGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRG 1715
            ++ CSQ+G   E    F  M    G+ P      C++   G    L   + A+    +  
Sbjct: 390  LSACSQSGLTEEGHRYFDRMTTDHGIKPRVEHYACVVSLLGRAGKL---EEAYSMIEKMP 446

Query: 1714 FSNDVYVGSALIDM-----YAKCGQIRSSQCC-FDGMPARNLVCWNAIIGGYAMHGKVKE 1553
            F  D  V  AL+           G++ + +    + M   N +  + I   YA  G+ KE
Sbjct: 447  FEPDACVWGALLSSCRVHHNMSLGEVAARKLFELEPMNPGNYILMSNI---YASKGRYKE 503

Query: 1552 ALEIFHMMQRSG 1517
              +I  +M+  G
Sbjct: 504  VDKIRDIMRDKG 515



 Score =  108 bits (270), Expect = 1e-20
 Identities = 82/303 (27%), Positives = 133/303 (43%), Gaps = 39/303 (12%)
 Frame = -2

Query: 2878 LPQARQAHGHILKTGLSNETHFVTKLLSLYANHLCFDDASHVLDSILQPDIFSFSALIHA 2699
            L    Q HG+++K G + +   V+ L+ +Y    C  + S VL+ + Q ++ + +ALI  
Sbjct: 162  LSTGTQVHGYVIKNGFAVDKCIVSALIDMYGKCGCALEMSQVLEDMGQVEVGACNALITG 221

Query: 2698 STKHHRYNHALRIFS-----------------------------------RMLSHGLAPD 2624
              +H   + ALR+F                                     M S G+ P+
Sbjct: 222  LARHGLVDKALRVFKELQGQQMELNVVSWTSVIACCSQHGKDIEALELFREMQSAGVKPN 281

Query: 2623 CHVVPSAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFD 2444
               +P  + AC  ++AL  GK  H      G++ D +V ++L+ +Y  CGKI+ A   FD
Sbjct: 282  AVTIPCLLPACGNIAALMHGKAAHCFSLRRGISGDVYVGSALIDMYANCGKIQLARCCFD 341

Query: 2443 TMPQPDVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAE 2264
             MP  ++V  +A++ G+A HG   EA   F  M  SG + + VS   +++  + S L  E
Sbjct: 342  RMPVRNLVCWNAMLGGYAMHGKANEAIEFFLLMQRSGQKPDSVSLTSLLSACSQSGLTEE 401

Query: 2263 AVLMFKKMHS-QGFQP---DETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVS 2096
                F +M +  G +P         S+L   G LE+ Y  I+      K     D CV  
Sbjct: 402  GHRYFDRMTTDHGIKPRVEHYACVVSLLGRAGKLEEAYSMIE------KMPFEPDACVWG 455

Query: 2095 ALI 2087
            AL+
Sbjct: 456  ALL 458


>ref|XP_003549191.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like
            isoform X1 [Glycine max]
          Length = 748

 Score =  973 bits (2516), Expect = 0.0
 Identities = 477/746 (63%), Positives = 579/746 (77%), Gaps = 3/746 (0%)
 Frame = -2

Query: 2917 HITLNTFISTSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANHLCFD--DASHVLDS 2744
            H       S++ASL QARQAH  IL+  L ++T   T LLS YAN L       S  L S
Sbjct: 3    HALSQCLSSSTASLSQARQAHALILRLNLFSDTQLTTSLLSFYANALSLSTPQLSLTLSS 62

Query: 2743 IL-QPDIFSFSALIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIKACAGLSALRT 2567
             L  P +FSFS+LIHA  + H + H L  FS +    L PD  ++PSAIK+CA L AL  
Sbjct: 63   HLPHPTLFSFSSLIHAFARSHHFPHVLTTFSHLHPLRLIPDAFLLPSAIKSCASLRALDP 122

Query: 2566 GKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVTCSALVAGFAR 2387
            G+++H   + SG  +D  V +SL H+Y+KC +I +A K+FD MP  DVV  SA++AG++R
Sbjct: 123  GQQLHAFAAASGFLTDSIVASSLTHMYLKCDRILDARKLFDRMPDRDVVVWSAMIAGYSR 182

Query: 2386 HGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMHSQGFQPDETS 2207
             G V+EA  LF  M   G+E N VSWNGM+AGF ++  Y EAV MF+ M  QGF PD ++
Sbjct: 183  LGLVEEAKELFGEMRSGGVEPNLVSWNGMLAGFGNNGFYDEAVGMFRMMLVQGFWPDGST 242

Query: 2206 TSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAKEMSQVFDEMD 2027
             S VLPAVG LED+ +G Q+H Y+IK GLGSDK VVSA++DMYGKC C KEMS+VFDE++
Sbjct: 243  VSCVLPAVGCLEDVVVGAQVHGYVIKQGLGSDKFVVSAMLDMYGKCGCVKEMSRVFDEVE 302

Query: 2026 KIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACCSQNGKDIEALE 1847
            +++IG+ NA + GLSRNG  + AL VF K + Q +ELNVV+WTS+IA CSQNGKD+EALE
Sbjct: 303  EMEIGSLNAFLTGLSRNGMVDTALEVFNKFKDQKMELNVVTWTSIIASCSQNGKDLEALE 362

Query: 1846 LFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDVYVGSALIDMYA 1667
            LFRDMQ  GV PN+VTIP L+PACGN++ALMHGK  HCFSLRRG  +DVYVGSALIDMYA
Sbjct: 363  LFRDMQAYGVEPNAVTIPSLIPACGNISALMHGKEIHCFSLRRGIFDDVYVGSALIDMYA 422

Query: 1666 KCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTS 1487
            KCG+I+ ++ CFD M A NLV WNA++ GYAMHGK KE +E+FHMM +SGQKPDL+TFT 
Sbjct: 423  KCGRIQLARRCFDKMSALNLVSWNAVMKGYAMHGKAKETMEMFHMMLQSGQKPDLVTFTC 482

Query: 1486 VLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFE 1307
            VLSAC+Q+GLTEEGW  +NSMS EHGI+ ++EHYAC+V LL R GKL+EAYS+I++MPFE
Sbjct: 483  VLSACAQNGLTEEGWRCYNSMSEEHGIEPKMEHYACLVTLLSRVGKLEEAYSIIKEMPFE 542

Query: 1306 PDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVR 1127
            PDACVWGALLSSCRVHNNL LGEIAA++LF LEP NPGNYILLSNIYAS G+W++ + +R
Sbjct: 543  PDACVWGALLSSCRVHNNLSLGEIAAEKLFFLEPTNPGNYILLSNIYASKGLWDEENRIR 602

Query: 1126 GMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTN 947
             +MKS GLRKNPG SWIE+ +KVHMLLAGD+SHP M  I+EKL+KL+++MKKSG+ P TN
Sbjct: 603  EVMKSKGLRKNPGYSWIEVGHKVHMLLAGDQSHPQMKDILEKLDKLNMQMKKSGYLPKTN 662

Query: 946  FVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYE 767
            FVLQDVEEQDKEQ LCGHSEKLAVV GLLNT  G PL+VIKNLRIC DCHA IK +S  E
Sbjct: 663  FVLQDVEEQDKEQILCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLE 722

Query: 766  GREIFVRDTNLFHHFKDGACSCGDFW 689
            GREI+VRDTN FHHFKDG CSCGDFW
Sbjct: 723  GREIYVRDTNRFHHFKDGVCSCGDFW 748


>gb|EXB68664.1| hypothetical protein L484_024678 [Morus notabilis]
          Length = 728

 Score =  973 bits (2514), Expect = 0.0
 Identities = 476/737 (64%), Positives = 575/737 (78%), Gaps = 2/737 (0%)
 Frame = -2

Query: 2893 STSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANHLCFDDASHVLDSILQPDIFSFS 2714
            ST  SL   RQ H ++LK+  S +    TKLLSLYAN+LCF +A+ VLDSI  PD+F FS
Sbjct: 20   STPPSL--TRQLHAYLLKSN-SAQLSTTTKLLSLYANNLCFFEANLVLDSIPNPDLFCFS 76

Query: 2713 ALIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIKACAGLSALRTGKEVHGVVSVS 2534
             LIHAS+K  R++ +LR+FSRMLS  + PD  + PS +KA +GL +L  GK++H    + 
Sbjct: 77   TLIHASSKLGRFSFSLRLFSRMLSRQIFPDAFLFPSLVKASSGLPSLEVGKQLHSFAFLF 136

Query: 2533 GLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVTCSALVAGFARHGYVKEANRLF 2354
            G  SD FVQ+SL+H+Y+KC  I +A K+FD MPQ D+V  SAL++G++  G V+EA  LF
Sbjct: 137  GFCSDSFVQSSLLHMYLKCDHIWDARKLFDGMPQRDLVAWSALISGYSSRGLVEEAKGLF 196

Query: 2353 DGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMHSQGFQPDETSTSSVLPAVGDL 2174
              MG  GLE N V+WNGMI+GF+ S   +EAV MF++MHS+G  PD +S SSVLPA+GDL
Sbjct: 197  YDMGMGGLEPNVVTWNGMISGFSRSGSCSEAVDMFRRMHSEGVPPDGSSVSSVLPAIGDL 256

Query: 2173 EDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAKEMSQVFDEMDKIDIGACNALV 1994
            EDL +GIQ+H Y++K G GSDKCV SALIDMYGK +                        
Sbjct: 257  EDLNVGIQVHGYVVKRGFGSDKCVTSALIDMYGKSSW----------------------- 293

Query: 1993 AGLSRNGFAENALAVFKKI--EGQGVELNVVSWTSMIACCSQNGKDIEALELFRDMQIAG 1820
              LSRNGF E+AL VF+K   + Q ++LN+VSWTS+IACCSQNGKD++ALELFR+MQ+ G
Sbjct: 294  --LSRNGFVEDALEVFRKFKRQQQAMQLNIVSWTSVIACCSQNGKDMDALELFREMQLEG 351

Query: 1819 VMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDVYVGSALIDMYAKCGQIRSSQ 1640
              PNSVTIPC+LPACGN+AAL +GKAAHCFSLR G  +++YVGSALIDMY  CG++  S+
Sbjct: 352  FKPNSVTIPCMLPACGNIAALTYGKAAHCFSLRMGIFDNLYVGSALIDMYGNCGKLHLSR 411

Query: 1639 CCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSG 1460
             CFD +P RNLVCWNAI+ GYAMHGK +E +EIF MMQ+SGQKPD I+FT VLSACSQ+G
Sbjct: 412  LCFDQLPVRNLVCWNAIMSGYAMHGKARETIEIFQMMQKSGQKPDFISFTCVLSACSQNG 471

Query: 1459 LTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGAL 1280
            LT+EGW++F+SMS EHGI+AR+EHYACMV LLGR+GKL+EAYS+I KMP EPDACVWG+L
Sbjct: 472  LTDEGWHYFSSMSKEHGIEARLEHYACMVTLLGRSGKLEEAYSLINKMPMEPDACVWGSL 531

Query: 1279 LSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLR 1100
            LSSCRVHNN+ LGE+AA++LFELEP+NPGNY++LSNIY S GMW+ VD VR MM   GLR
Sbjct: 532  LSSCRVHNNVSLGEVAAEKLFELEPRNPGNYVILSNIYGSKGMWSQVDRVRDMMNQKGLR 591

Query: 1099 KNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQ 920
            KNPGCSWIE+KN+VHMLLAGDKSHP   QII KLNKLS+EMK SG+FP   FVLQDVEEQ
Sbjct: 592  KNPGCSWIEVKNEVHMLLAGDKSHPQRIQIIGKLNKLSMEMKNSGYFPNFTFVLQDVEEQ 651

Query: 919  DKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDT 740
            DK   LCGHSEKLAV FGLLNT  GS LRVIKNLRICGDCH  IKF+SS+E REIFVRDT
Sbjct: 652  DKVHILCGHSEKLAVAFGLLNTPPGSSLRVIKNLRICGDCHVVIKFISSFEQREIFVRDT 711

Query: 739  NLFHHFKDGACSCGDFW 689
            N FHHFKDG CSCGD+W
Sbjct: 712  NRFHHFKDGHCSCGDYW 728


>gb|EYU18955.1| hypothetical protein MIMGU_mgv1a022111mg [Mimulus guttatus]
          Length = 654

 Score =  964 bits (2492), Expect = 0.0
 Identities = 456/654 (69%), Positives = 548/654 (83%)
 Frame = -2

Query: 2650 MLSHGLAPDCHVVPSAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGK 2471
            ML  GL PD HV+PS IKACAGL A++ GK+VHG    SG++ D FVQ+SLVH YVKC +
Sbjct: 1    MLKQGLFPDAHVLPSVIKACAGLLAVKIGKQVHGFSLASGISLDSFVQSSLVHFYVKCDE 60

Query: 2470 IREAHKVFDTMPQPDVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAG 2291
            + +AHK+FD M + DVV+ SAL AG+AR G    A ++F+ + + G + N VSWNGMIAG
Sbjct: 61   LVDAHKLFDNMVERDVVSWSALAAGYARKGDAVNARKVFNEVKNLGFQPNTVSWNGMIAG 120

Query: 2290 FNHSRLYAEAVLMFKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSD 2111
            FN S  + +AVLMF++MH  GF+ D TS SSVLPA+GDL  L  G Q+H Y+IK+G   D
Sbjct: 121  FNRSGCFLDAVLMFQQMHKHGFKSDGTSISSVLPAIGDLGYLSSGTQVHGYVIKNGFAVD 180

Query: 2110 KCVVSALIDMYGKCACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEG 1931
            KC+VSALIDMYGKC  A EMSQV ++M ++++GACNAL+ GL+R+G  + AL VFK+++G
Sbjct: 181  KCIVSALIDMYGKCGYALEMSQVLEDMGQVEVGACNALITGLARHGLVDKALGVFKELQG 240

Query: 1930 QGVELNVVSWTSMIACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMH 1751
            Q +ELNVVSWTS+IACCSQ+GKDIEALELFR+MQ +GV PN+VTIPCLLPACGN+AALMH
Sbjct: 241  QQMELNVVSWTSVIACCSQHGKDIEALELFREMQASGVKPNAVTIPCLLPACGNIAALMH 300

Query: 1750 GKAAHCFSLRRGFSNDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAM 1571
            GKAAHCFSLRRG S DVYVGSALIDMYA CG+I+ ++CCFD M  RNLVCWNA++GGYAM
Sbjct: 301  GKAAHCFSLRRGISGDVYVGSALIDMYANCGKIQLARCCFDRMSVRNLVCWNAMLGGYAM 360

Query: 1570 HGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVE 1391
            HGK KEA+E F +MQRSGQKPD ++ TS+LSACSQSGLTEEG  +F+ M+ +HGIK RVE
Sbjct: 361  HGKAKEAIEFFLLMQRSGQKPDSVSLTSLLSACSQSGLTEEGHRYFDRMTTDHGIKPRVE 420

Query: 1390 HYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFEL 1211
            HYAC+V+LLGRAGKL+EAYSMI+KMPFEPDACVWGALLSSCRVH+N+ LG +AA++LFEL
Sbjct: 421  HYACVVSLLGRAGKLEEAYSMIEKMPFEPDACVWGALLSSCRVHHNMSLGGVAARKLFEL 480

Query: 1210 EPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKS 1031
            EPKNPGNYILLSNIYAS G + +VD +R +M   GLRKNPGCSWIE+KNKVHMLLAGDKS
Sbjct: 481  EPKNPGNYILLSNIYASKGRYKEVDKIRDIMGDKGLRKNPGCSWIEVKNKVHMLLAGDKS 540

Query: 1030 HPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTH 851
             P MAQI+EKLN+LS+EMKK+G+ P T++VLQDVEEQ+KE  LCGHSEKLAVVFG+LN  
Sbjct: 541  LPQMAQIMEKLNRLSIEMKKAGYSPNTDYVLQDVEEQEKEHILCGHSEKLAVVFGILNMS 600

Query: 850  QGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 689
             GSPLRV KNLRICGDCHA IKF+S +E REIFVRDTN +HHFKDG CSCGD+W
Sbjct: 601  PGSPLRVTKNLRICGDCHAVIKFISRFERREIFVRDTNRYHHFKDGDCSCGDYW 654



 Score =  174 bits (442), Expect = 2e-40
 Identities = 121/432 (28%), Positives = 200/432 (46%), Gaps = 36/432 (8%)
 Frame = -2

Query: 2866 RQAHGHILKTGLSNETHFVTKLLSLYANHLCFDDASHVLDSILQPDIFSFSAL------- 2708
            +Q HG  L +G+S ++   + L+  Y       DA  + D++++ D+ S+SAL       
Sbjct: 30   KQVHGFSLASGISLDSFVQSSLVHFYVKCDELVDAHKLFDNMVERDVVSWSALAAGYARK 89

Query: 2707 ----------------------------IHASTKHHRYNHALRIFSRMLSHGLAPDCHVV 2612
                                        I    +   +  A+ +F +M  HG   D   +
Sbjct: 90   GDAVNARKVFNEVKNLGFQPNTVSWNGMIAGFNRSGCFLDAVLMFQQMHKHGFKSDGTSI 149

Query: 2611 PSAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQ 2432
             S + A   L  L +G +VHG V  +G A D  + ++L+ +Y KCG   E  +V + M Q
Sbjct: 150  SSVLPAIGDLGYLSSGTQVHGYVIKNGFAVDKCIVSALIDMYGKCGYALEMSQVLEDMGQ 209

Query: 2431 PDVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLM 2252
             +V  C+AL+ G ARHG V +A  +F  +    +ELN VSW  +IA  +      EA+ +
Sbjct: 210  VEVGACNALITGLARHGLVDKALGVFKELQGQQMELNVVSWTSVIACCSQHGKDIEALEL 269

Query: 2251 FKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGK 2072
            F++M + G +P+  +   +LPA G++  L  G   H + ++ G+  D  V SALIDMY  
Sbjct: 270  FREMQASGVKPNAVTIPCLLPACGNIAALMHGKAAHCFSLRRGISGDVYVGSALIDMYAN 329

Query: 2071 CACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSM 1892
            C   +     FD M   ++   NA++ G + +G A+ A+  F  ++  G + + VS TS+
Sbjct: 330  CGKIQLARCCFDRMSVRNLVCWNAMLGGYAMHGKAKEAIEFFLLMQRSGQKPDSVSLTSL 389

Query: 1891 IACCSQNGKDIEALELFRDMQI-AGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRG 1715
            ++ CSQ+G   E    F  M    G+ P      C++   G    L   + A+    +  
Sbjct: 390  LSACSQSGLTEEGHRYFDRMTTDHGIKPRVEHYACVVSLLGRAGKL---EEAYSMIEKMP 446

Query: 1714 FSNDVYVGSALI 1679
            F  D  V  AL+
Sbjct: 447  FEPDACVWGALL 458


>ref|XP_002890375.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297336217|gb|EFH66634.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 760

 Score =  946 bits (2444), Expect = 0.0
 Identities = 451/760 (59%), Positives = 584/760 (76%), Gaps = 4/760 (0%)
 Frame = -2

Query: 2956 MTRQALHLLNSSHHITLNTFISTSA----SLPQARQAHGHILKTGLSNETHFVTKLLSLY 2789
            MT+Q L L+       L    S+S+    SL +  QAH  ILK+G  N+ +   KL++ Y
Sbjct: 1    MTKQVLPLIEKIPQTILGILESSSSLWSSSLSKTTQAHARILKSGAQNDGYISAKLIASY 60

Query: 2788 ANHLCFDDASHVLDSILQPDIFSFSALIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVP 2609
            +N+ CF+DA  +L SI  P ++SFS+LI+A TK   ++ ++ +FSRM SHGL PD HV+P
Sbjct: 61   SNYNCFNDADLILQSIPDPTVYSFSSLIYALTKAKLFSQSIGVFSRMFSHGLIPDTHVLP 120

Query: 2608 SAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQP 2429
            +  K CA LSA + GK++H V  VSGL  D FVQ SL H+Y++CG++ +A KVFD M + 
Sbjct: 121  NLFKVCAELSAFKAGKQIHCVACVSGLDMDAFVQGSLFHMYMRCGRMGDARKVFDRMSEK 180

Query: 2428 DVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMF 2249
            DVVTCSAL+ G+AR G ++E  R+   M  SG+E N VSWNG+++GFN S  + EAV+MF
Sbjct: 181  DVVTCSALLCGYARKGCLEEVVRILSEMEKSGIEPNIVSWNGILSGFNRSGYHKEAVIMF 240

Query: 2248 KKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKC 2069
            +KMH  GF PD+ + SSVLP+VGD E+L +G QIH Y+IK GL  DKCV+SA++DMYGK 
Sbjct: 241  QKMHHLGFCPDQVTVSSVLPSVGDSENLNMGRQIHGYVIKQGLLKDKCVISAMLDMYGKS 300

Query: 2068 ACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMI 1889
                 + ++FDE + ++ G CNA + GLSRNG  + AL +F   + Q +ELNVVSWTS+I
Sbjct: 301  GHVYGIIKLFDEFEMMETGVCNAYITGLSRNGLVDKALEMFGLFKEQKMELNVVSWTSII 360

Query: 1888 ACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFS 1709
            A C+QNGKDIEALELFR+MQ+AGV PN VTIP +LPACGN+AAL HG++ H F++R    
Sbjct: 361  AGCAQNGKDIEALELFREMQVAGVKPNRVTIPSMLPACGNIAALGHGRSTHGFAVRVHLL 420

Query: 1708 NDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMM 1529
            +DV+VGSALIDMYAKCG+I+ SQ  F+ MP +NLVCWN+++ GY+MHGK KE + IF  +
Sbjct: 421  DDVHVGSALIDMYAKCGRIKMSQIVFNMMPTKNLVCWNSLMNGYSMHGKAKEVMSIFESL 480

Query: 1528 QRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGK 1349
             R+  KPD I+FTS+LSAC Q GLT+EGW +FN MS E+GIK R+EHY+CMVNLLGRAGK
Sbjct: 481  MRTRLKPDFISFTSLLSACGQVGLTDEGWKYFNMMSEEYGIKPRLEHYSCMVNLLGRAGK 540

Query: 1348 LKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNI 1169
            L+EAY +I+++PFEPD+CVWGALL+SCR+ NN+ L EIAA++LF LEP+NPG Y+L+SNI
Sbjct: 541  LQEAYDLIKEIPFEPDSCVWGALLNSCRLQNNVDLAEIAAQKLFHLEPENPGTYVLMSNI 600

Query: 1168 YASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKL 989
            YA+ GMW +VD +R  M+S+GL+KNPGCSWI++KNKV+ LLA DKSHP + QI EK++++
Sbjct: 601  YAAKGMWTEVDSIRNKMESLGLKKNPGCSWIQVKNKVYTLLACDKSHPQIDQITEKMDEI 660

Query: 988  SLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRIC 809
            S EM+KSG  P  +F LQDVEEQ++EQ L GHSEKLAVVFGLLNT  G+PL+VIKNLRIC
Sbjct: 661  SEEMRKSGHRPNLDFALQDVEEQEQEQMLWGHSEKLAVVFGLLNTPDGTPLQVIKNLRIC 720

Query: 808  GDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 689
            GDCHA IKF+SSY GREIF+RDTN FHHFKDG CSCGDFW
Sbjct: 721  GDCHAVIKFISSYAGREIFIRDTNRFHHFKDGICSCGDFW 760


>ref|XP_006306841.1| hypothetical protein CARUB_v10008385mg [Capsella rubella]
            gi|482575552|gb|EOA39739.1| hypothetical protein
            CARUB_v10008385mg [Capsella rubella]
          Length = 760

 Score =  944 bits (2439), Expect = 0.0
 Identities = 452/760 (59%), Positives = 586/760 (77%), Gaps = 4/760 (0%)
 Frame = -2

Query: 2956 MTRQALHLLNSSHHITLNTFISTSA----SLPQARQAHGHILKTGLSNETHFVTKLLSLY 2789
            MT+Q L L+       +    S+S+    SL +  QAH  ILK+G  N+ +   KL++ Y
Sbjct: 1    MTKQVLPLIVQIPQSIVGFLESSSSIWSSSLSKTTQAHARILKSGAQNDGYISAKLIASY 60

Query: 2788 ANHLCFDDASHVLDSILQPDIFSFSALIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVP 2609
            +N+ CFDDA  VL SI  P ++SFS+LI+A TK   ++ ++ +FSRM SHGL PD HV+P
Sbjct: 61   SNYSCFDDADLVLQSIPDPTVYSFSSLIYALTKAKLFSQSIGVFSRMFSHGLIPDSHVLP 120

Query: 2608 SAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQP 2429
            +  K CA LSA + GK++H V  VSGL  D FVQ SL H+Y++CG++ +A KVFD M + 
Sbjct: 121  NLFKVCAELSAFKVGKQIHCVSCVSGLDMDAFVQGSLFHMYMRCGRMGDARKVFDRMFEK 180

Query: 2428 DVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMF 2249
            DVVTCSAL+ G+AR G ++E  R+  GM +SG+E N VSWNG+++GFN S  + EAV+MF
Sbjct: 181  DVVTCSALLCGYARKGCLEEVVRILSGMENSGIEPNIVSWNGILSGFNRSGYHREAVIMF 240

Query: 2248 KKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKC 2069
            +KMH  GF PD+ + SSVLP+VGD E L +G QIH Y+IK GL  DKCV+SA++DMYGK 
Sbjct: 241  QKMHLCGFSPDQVTVSSVLPSVGDSEMLNMGRQIHGYVIKQGLLKDKCVISAMLDMYGKS 300

Query: 2068 ACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMI 1889
                 + ++FDE + ++ G CNA + GLSRNG  + AL +F+  + Q VELNVVSWTS+I
Sbjct: 301  GHVYGIIKLFDEFEMMETGVCNAYITGLSRNGLVDKALEMFELFKEQKVELNVVSWTSII 360

Query: 1888 ACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFS 1709
            A C+QNGKDIEALELFR+MQ+AGV PN VTIP +LPACGN+AAL HG++ H F++R    
Sbjct: 361  AGCAQNGKDIEALELFREMQVAGVKPNRVTIPSMLPACGNIAALGHGRSTHGFAVRVHLW 420

Query: 1708 NDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMM 1529
            +DV+VGSALIDMYAKCG+I  SQ  F+ MP +NLVCWN+++ GY+MHGK KE + IF  +
Sbjct: 421  DDVHVGSALIDMYAKCGRINMSQFVFNMMPTKNLVCWNSLMNGYSMHGKAKEVMSIFESL 480

Query: 1528 QRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGK 1349
             R+  KPD I+FTS+L++C Q GLT+EGW +F+ MS E+GIK R+EHY+CMVNLLGRAGK
Sbjct: 481  LRTRLKPDFISFTSLLASCGQVGLTDEGWKYFSMMSEEYGIKPRLEHYSCMVNLLGRAGK 540

Query: 1348 LKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNI 1169
            L+EAY +I++MPFEPD+CVWGALL+SCR+ +N+ L EIAA +LF+LEP+NPG Y+LLSNI
Sbjct: 541  LQEAYELIKEMPFEPDSCVWGALLNSCRLQSNVDLAEIAADKLFDLEPENPGTYVLLSNI 600

Query: 1168 YASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKL 989
            YA+ GMW +VD +R  M+S+GL+KNPGCSWI++KN+V+ LLAGDKSHP + QI EK++++
Sbjct: 601  YAAKGMWTEVDSIRNKMESLGLKKNPGCSWIQVKNRVYTLLAGDKSHPQIDQITEKMDEI 660

Query: 988  SLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRIC 809
            S EM+KSG  P  +F LQDVEEQ++EQ L GHSEKLAVVFGLLNT  G+PL+VIKNLRIC
Sbjct: 661  SEEMRKSGHRPNLDFALQDVEEQEQEQMLWGHSEKLAVVFGLLNTPDGTPLQVIKNLRIC 720

Query: 808  GDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 689
            GDCH+ IKF+SSY GREIFVRDTN FHHFKDG CSCGDFW
Sbjct: 721  GDCHSVIKFISSYAGREIFVRDTNRFHHFKDGICSCGDFW 760


>ref|XP_006416418.1| hypothetical protein EUTSA_v10009574mg [Eutrema salsugineum]
            gi|557094189|gb|ESQ34771.1| hypothetical protein
            EUTSA_v10009574mg [Eutrema salsugineum]
          Length = 760

 Score =  943 bits (2437), Expect = 0.0
 Identities = 452/760 (59%), Positives = 584/760 (76%), Gaps = 4/760 (0%)
 Frame = -2

Query: 2956 MTRQALHLLNSSHHITLNTFIST----SASLPQARQAHGHILKTGLSNETHFVTKLLSLY 2789
            MT+Q L L+       L     +    S+SL +  QAH  ILK+G  N+ +  +KL++ Y
Sbjct: 1    MTKQVLPLIEKIPQSILGFLEFSPSCWSSSLTKTTQAHARILKSGAQNDGYISSKLIASY 60

Query: 2788 ANHLCFDDASHVLDSILQPDIFSFSALIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVP 2609
            +N+ CFDDA+ +L SI  P ++SFS+LI+A TK   ++ +L +FSRM SHGL PD HV+P
Sbjct: 61   SNYSCFDDANLILQSIPDPSVYSFSSLIYALTKAKLFSQSLGVFSRMFSHGLIPDTHVLP 120

Query: 2608 SAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQP 2429
            +  K CA LSA + GK++H V    GL  D FVQ SL H+Y++CG++ +A KVFD M + 
Sbjct: 121  NLFKVCAELSAFKAGKQIHCVSCTLGLDEDAFVQGSLFHMYMRCGRMGDARKVFDRMSEK 180

Query: 2428 DVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMF 2249
            DVVTCSAL+ G+AR G +++  R+   M  SG+E N VSWNG+++GFN S  + EAV+MF
Sbjct: 181  DVVTCSALLCGYARKGCLEDVVRILSEMEKSGIEPNIVSWNGILSGFNRSGYHEEAVIMF 240

Query: 2248 KKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKC 2069
            +KMH  GF PDE + SSVLP+VGD E L +G QIH Y+IK GL  DKCV SA+IDMYGK 
Sbjct: 241  QKMHHLGFFPDEVAVSSVLPSVGDSEKLDMGRQIHGYVIKQGLLKDKCVTSAMIDMYGKS 300

Query: 2068 ACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMI 1889
                 + ++F++++ ++ G CNA + GLSRNG  + AL +F+  + Q +ELNVVSWTS+I
Sbjct: 301  GQVYGIIKLFEQVELMETGVCNACITGLSRNGLIDKALEMFELFKEQNIELNVVSWTSII 360

Query: 1888 ACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFS 1709
            A C+QNGKDIEALELFR+MQ+A V PN VTIP +LPACGN+AAL+HG++AH F++R    
Sbjct: 361  AGCAQNGKDIEALELFREMQVARVKPNRVTIPSMLPACGNIAALVHGRSAHGFAVRVHLL 420

Query: 1708 NDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMM 1529
            +DV+VGSALIDMYAKCG+I  SQ  FD MP RNLVCWN+++ GY+MHGK KE + IF  +
Sbjct: 421  DDVHVGSALIDMYAKCGRINMSQMVFDMMPTRNLVCWNSLMSGYSMHGKAKEVMSIFDSL 480

Query: 1528 QRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGK 1349
             R+  KPD I+FTS+LSACSQ GLT+EGW +F  M+ E+GIK R+EHY+CMV+LLGRAGK
Sbjct: 481  VRTRLKPDFISFTSLLSACSQVGLTDEGWKYFGMMTEEYGIKPRLEHYSCMVSLLGRAGK 540

Query: 1348 LKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNI 1169
            L+EAY +I+++PFEPD+CVWGALL+SCR+ NN+ L EIAA++LF+LEP+NPG Y+LLSNI
Sbjct: 541  LQEAYDLIKEIPFEPDSCVWGALLNSCRLQNNVDLAEIAAEKLFDLEPENPGTYVLLSNI 600

Query: 1168 YASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKL 989
            YA+ GMW +VD VR  M+S+GL+KNPGCSWI++KNKV+ LLAGDKSHP + QI EK++++
Sbjct: 601  YAAKGMWAEVDSVRNKMESLGLKKNPGCSWIQVKNKVYTLLAGDKSHPQIEQITEKMDEI 660

Query: 988  SLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRIC 809
            S EM+KSG  P  +F LQDVEEQ+KEQ L GHSEKLAVVFGLLNT  G+PL+VIKNLRIC
Sbjct: 661  SKEMRKSGHRPNLDFALQDVEEQEKEQILLGHSEKLAVVFGLLNTPDGTPLQVIKNLRIC 720

Query: 808  GDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 689
            GDCH+ IKF+S Y GREIFVRDTN FHHFKDG CSCGDFW
Sbjct: 721  GDCHSVIKFISGYAGREIFVRDTNRFHHFKDGICSCGDFW 760


>ref|XP_004515286.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like
            [Cicer arietinum]
          Length = 730

 Score =  939 bits (2427), Expect = 0.0
 Identities = 450/735 (61%), Positives = 567/735 (77%)
 Frame = -2

Query: 2893 STSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANHLCFDDASHVLDSILQPDIFSFS 2714
            ST+++L  ARQAH H LK GL  +T   T LLSLY+++L F     VL S+ QP +FSFS
Sbjct: 11   STTSTLFHARQAHAHFLKFGLFFDTQLTTSLLSLYSHYLPFTQLKLVLSSLPQPTLFSFS 70

Query: 2713 ALIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIKACAGLSALRTGKEVHGVVSVS 2534
            ++I++  +   +NH L +FS+M S GL PD +++PSAIKAC+ L AL+ G++VHG   VS
Sbjct: 71   SIINSFARSRHFNHVLGVFSQMGSLGLVPDSYLLPSAIKACSALKALKLGRQVHGFAYVS 130

Query: 2533 GLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVTCSALVAGFARHGYVKEANRLF 2354
            G  SD  + +SLVH+Y+KC  I +A K+FD+M + DVV  SA++AG++R G V  A  LF
Sbjct: 131  GFGSDSILISSLVHMYLKCKTIEDAQKLFDSMSERDVVVWSAMIAGYSRLGLVDRAKELF 190

Query: 2353 DGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMHSQGFQPDETSTSSVLPAVGDL 2174
              M + G+E N VSWNGMIAGF ++  Y EA ++F+ M S+GF PD ++ S VLP +G+L
Sbjct: 191  SEMRNEGVEPNLVSWNGMIAGFGNAGSYGEAAMLFRGMISEGFLPDGSAVSCVLPGIGNL 250

Query: 2173 EDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAKEMSQVFDEMDKIDIGACNALV 1994
            ED+ +G Q+H Y+IK GL SD  V+SAL+DMYGKC C  EMS+VFDE+D+ +IG+ NA +
Sbjct: 251  EDVLMGKQVHGYVIKQGLDSDNFVISALLDMYGKCGCTSEMSRVFDEIDQTEIGSLNAFL 310

Query: 1993 AGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACCSQNGKDIEALELFRDMQIAGVM 1814
             GLSRNG  + AL +FKK + Q +ELNVV+WTS+IA C+Q+GKD+EALE FRDMQ  GV 
Sbjct: 311  TGLSRNGLVDTALEMFKKFKAQEIELNVVTWTSIIASCTQHGKDMEALEFFRDMQADGVE 370

Query: 1813 PNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDVYVGSALIDMYAKCGQIRSSQCC 1634
            P +VTIP L+PACGN++AL HGK  HCFSLR+G  +DVYVGSALIDMYAKCG+I+ S+ C
Sbjct: 371  PTAVTIPSLIPACGNVSALTHGKEIHCFSLRKGIFDDVYVGSALIDMYAKCGRIQLSRHC 430

Query: 1633 FDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLT 1454
            FD MPA+NLV WN+++ GYAMHGK +E +E+F+MM +SGQKPDLITFT VLSAC+Q+GL 
Sbjct: 431  FDIMPAKNLVSWNSVMSGYAMHGKARETIEMFNMMLQSGQKPDLITFTCVLSACTQNGLI 490

Query: 1453 EEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLS 1274
            EEGW +FNSMS EH ++ R+EHY               AYS++++MPFEPDACVWG+LLS
Sbjct: 491  EEGWNYFNSMSKEHDVEPRMEHY---------------AYSIVKEMPFEPDACVWGSLLS 535

Query: 1273 SCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRKN 1094
            SCRVH NL LGEIAA++LF LEP NPGNY+LLSNIYAS GMW + + +R MMK+ GLRKN
Sbjct: 536  SCRVHKNLSLGEIAAEKLFVLEPDNPGNYVLLSNIYASKGMWGEENRIRNMMKNKGLRKN 595

Query: 1093 PGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQDK 914
            PGCSWIEI  +VH LL+GDKSHP M +I+EK +KLS+E+KKSG+ P+TN VLQDVEEQDK
Sbjct: 596  PGCSWIEIGRRVHTLLSGDKSHPQMKEILEKSDKLSIEIKKSGYLPMTNTVLQDVEEQDK 655

Query: 913  EQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTNL 734
            EQ LCGHSEKLAVV GLLNT  G PL+VIKNLRIC DCHA IK +S  E REI+VRDTN 
Sbjct: 656  EQILCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLEAREIYVRDTNR 715

Query: 733  FHHFKDGACSCGDFW 689
            FHHFKDG CSC DFW
Sbjct: 716  FHHFKDGVCSCEDFW 730


>ref|NP_173449.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|193806503|sp|Q9LNU6.2|PPR53_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g20230 gi|332191832|gb|AEE29953.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 760

 Score =  929 bits (2401), Expect = 0.0
 Identities = 445/760 (58%), Positives = 577/760 (75%), Gaps = 4/760 (0%)
 Frame = -2

Query: 2956 MTRQALHLLNSSHHITLNTFISTS----ASLPQARQAHGHILKTGLSNETHFVTKLLSLY 2789
            MT+Q L L+       +    S+S    +SL +  QAH  ILK+G  N+ +   KL++ Y
Sbjct: 1    MTKQVLPLIEKIPQSIVGFLESSSYHWSSSLSKTTQAHARILKSGAQNDGYISAKLIASY 60

Query: 2788 ANHLCFDDASHVLDSILQPDIFSFSALIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVP 2609
            +N+ CF+DA  VL SI  P I+SFS+LI+A TK   +  ++ +FSRM SHGL PD HV+P
Sbjct: 61   SNYNCFNDADLVLQSIPDPTIYSFSSLIYALTKAKLFTQSIGVFSRMFSHGLIPDSHVLP 120

Query: 2608 SAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQP 2429
            +  K CA LSA + GK++H V  VSGL  D FVQ S+ H+Y++CG++ +A KVFD M   
Sbjct: 121  NLFKVCAELSAFKVGKQIHCVSCVSGLDMDAFVQGSMFHMYMRCGRMGDARKVFDRMSDK 180

Query: 2428 DVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMF 2249
            DVVTCSAL+  +AR G ++E  R+   M  SG+E N VSWNG+++GFN S  + EAV+MF
Sbjct: 181  DVVTCSALLCAYARKGCLEEVVRILSEMESSGIEANIVSWNGILSGFNRSGYHKEAVVMF 240

Query: 2248 KKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKC 2069
            +K+H  GF PD+ + SSVLP+VGD E L +G  IH Y+IK GL  DKCV+SA+IDMYGK 
Sbjct: 241  QKIHHLGFCPDQVTVSSVLPSVGDSEMLNMGRLIHGYVIKQGLLKDKCVISAMIDMYGKS 300

Query: 2068 ACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMI 1889
                 +  +F++ + ++ G CNA + GLSRNG  + AL +F+  + Q +ELNVVSWTS+I
Sbjct: 301  GHVYGIISLFNQFEMMEAGVCNAYITGLSRNGLVDKALEMFELFKEQTMELNVVSWTSII 360

Query: 1888 ACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFS 1709
            A C+QNGKDIEALELFR+MQ+AGV PN VTIP +LPACGN+AAL HG++ H F++R    
Sbjct: 361  AGCAQNGKDIEALELFREMQVAGVKPNHVTIPSMLPACGNIAALGHGRSTHGFAVRVHLL 420

Query: 1708 NDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMM 1529
            ++V+VGSALIDMYAKCG+I  SQ  F+ MP +NLVCWN+++ G++MHGK KE + IF  +
Sbjct: 421  DNVHVGSALIDMYAKCGRINLSQIVFNMMPTKNLVCWNSLMNGFSMHGKAKEVMSIFESL 480

Query: 1528 QRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGK 1349
             R+  KPD I+FTS+LSAC Q GLT+EGW +F  MS E+GIK R+EHY+CMVNLLGRAGK
Sbjct: 481  MRTRLKPDFISFTSLLSACGQVGLTDEGWKYFKMMSEEYGIKPRLEHYSCMVNLLGRAGK 540

Query: 1348 LKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNI 1169
            L+EAY +I++MPFEPD+CVWGALL+SCR+ NN+ L EIAA++LF LEP+NPG Y+LLSNI
Sbjct: 541  LQEAYDLIKEMPFEPDSCVWGALLNSCRLQNNVDLAEIAAEKLFHLEPENPGTYVLLSNI 600

Query: 1168 YASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKL 989
            YA+ GMW +VD +R  M+S+GL+KNPGCSWI++KN+V+ LLAGDKSHP + QI EK++++
Sbjct: 601  YAAKGMWTEVDSIRNKMESLGLKKNPGCSWIQVKNRVYTLLAGDKSHPQIDQITEKMDEI 660

Query: 988  SLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRIC 809
            S EM+KSG  P  +F L DVEEQ++EQ L GHSEKLAVVFGLLNT  G+PL+VIKNLRIC
Sbjct: 661  SKEMRKSGHRPNLDFALHDVEEQEQEQMLWGHSEKLAVVFGLLNTPDGTPLQVIKNLRIC 720

Query: 808  GDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 689
            GDCHA IKF+SSY GREIF+RDTN FHHFKDG CSCGDFW
Sbjct: 721  GDCHAVIKFISSYAGREIFIRDTNRFHHFKDGICSCGDFW 760


>gb|EPS74339.1| hypothetical protein M569_00411 [Genlisea aurea]
          Length = 1063

 Score =  906 bits (2342), Expect = 0.0
 Identities = 440/742 (59%), Positives = 560/742 (75%), Gaps = 2/742 (0%)
 Frame = -2

Query: 2908 LNTFISTSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANHLCFDDASHVLDSILQPD 2729
            L+      ASL Q RQAH  +L+TGL   + +   +LSLYA H    DA  +L S+L PD
Sbjct: 323  LSNLSKIGASLSQIRQAHAQLLRTGLFELSQYSNNILSLYARHQYLSDAKRLLRSLLTPD 382

Query: 2728 IFSFSALIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIKACAGLSALRTGKEVHG 2549
              +F+ LI A +K       L + S  L  GL PD +V+PS I+ACAGL A + GK+ HG
Sbjct: 383  SAAFTVLITACSKSSDLKSTLILVSEFLRSGLTPDVYVLPSIIRACAGLFAFKIGKQAHG 442

Query: 2548 VVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVTCSALVAGFARHGYVKE 2369
               VSG   DPF+++SLVH Y+KCG++  A KVF +M + D+V+ SAL A +AR G V  
Sbjct: 443  FSIVSGFVLDPFIESSLVHFYLKCGELAGARKVFYSMDEKDIVSWSALSAAYARKGDVLN 502

Query: 2368 ANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMHSQGFQPDETSTSSVLP 2189
            A +LF  +   G E N VSWNGMIAGFN S+ + +AVLMF++MHS GF  D  + SS LP
Sbjct: 503  AKKLFFSVRGFGFEPNAVSWNGMIAGFNQSKHFLDAVLMFQQMHSCGFPSDGINISSALP 562

Query: 2188 AVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAKEMSQVFDEMDKIDIGA 2009
            AV DL  L LG Q+H ++IK G   DKC+VSALIDMYGK   A E+  VF++M ++D+  
Sbjct: 563  AVSDLGSLKLGTQVHGHVIKIGFAGDKCIVSALIDMYGKLGNASEILLVFEDMHQLDVVV 622

Query: 2008 CNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACCSQNGKDIEALELFRDMQ 1829
            CNAL++GLSR+G  + +L++F+K+   G+E N+VSWTS I+CCSQ+G+D+EAL LFR+MQ
Sbjct: 623  CNALISGLSRHGLVDESLSMFEKLRSSGIE-NLVSWTSAISCCSQHGRDMEALGLFREMQ 681

Query: 1828 IAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDVYVGSALIDMYAKCGQIR 1649
             +GV PN+VTIP LLPACGN+AAL +GKA HCFSLR    NDVYVGSALIDMYA CG+I+
Sbjct: 682  FSGVKPNAVTIPSLLPACGNIAALSYGKAVHCFSLRNNICNDVYVGSALIDMYANCGKIK 741

Query: 1648 SSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACS 1469
            +++C F+ MP RNLVCWNA++G Y+MHG+ KEA+ +F  MQR GQKPD ++FTS+LSACS
Sbjct: 742  AARCLFERMPVRNLVCWNAMLGAYSMHGEAKEAIGLFQSMQRCGQKPDSVSFTSLLSACS 801

Query: 1468 QSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVW 1289
            QSGL EEG  +F SM  +HG++ R+EHYAC+V LLGRAGKL EAY+ I++MPFE DACVW
Sbjct: 802  QSGLAEEGRRYFESMFEDHGLEPRLEHYACIVGLLGRAGKLDEAYAKIKRMPFEADACVW 861

Query: 1288 GALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSM 1109
            GALLSSC +HNN  LGE+AA++LFELE  N GNYILLSNIYAS+  W +V  +R MM   
Sbjct: 862  GALLSSCALHNNEFLGEVAAEKLFELELGNSGNYILLSNIYASSRKWKEVRRIRDMMSLK 921

Query: 1108 GLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMK-KSGFFPVTNFVLQD 932
            G++KNPGCSWIE+KNKVHM+LAGDK+ P +++I+E+L +L+ EMK   G+FP TN+VLQD
Sbjct: 922  GMKKNPGCSWIEVKNKVHMILAGDKALPQVSKIMERLKRLNQEMKGAGGYFPNTNYVLQD 981

Query: 931  VEEQ-DKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREI 755
            VEEQ ++E  LCGHSEKLAVVFG+LNT +GSP+RV KNLRICGDCHA IKF+S +EGREI
Sbjct: 982  VEEQEEREGILCGHSEKLAVVFGILNTSRGSPIRVTKNLRICGDCHAVIKFISGFEGREI 1041

Query: 754  FVRDTNLFHHFKDGACSCGDFW 689
             VRDTN +HHFKDG CSCGD+W
Sbjct: 1042 SVRDTNRYHHFKDGICSCGDYW 1063


>ref|XP_007152607.1| hypothetical protein PHAVU_004G144300g [Phaseolus vulgaris]
            gi|561025916|gb|ESW24601.1| hypothetical protein
            PHAVU_004G144300g [Phaseolus vulgaris]
          Length = 601

 Score =  857 bits (2214), Expect = 0.0
 Identities = 409/601 (68%), Positives = 493/601 (82%)
 Frame = -2

Query: 2491 LYVKCGKIREAHKVFDTMPQPDVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVS 2312
            +Y+KC +I  A K+FD MP+ DVV  SA++AG++R G V EA  LF  M   G+E N V+
Sbjct: 1    MYLKCDRIVGARKLFDRMPERDVVVWSAMIAGYSRLGLVDEARGLFGEMRSCGVEPNLVT 60

Query: 2311 WNGMIAGFNHSRLYAEAVLMFKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLI 2132
            WNGM+AGF ++ LY EAV MF+ M  +GF PD ++ S VLP+VG LED+ +G Q+H Y+ 
Sbjct: 61   WNGMLAGFGNNGLYDEAVGMFRVMLLEGFWPDGSTVSCVLPSVGCLEDVVMGAQVHGYVT 120

Query: 2131 KHGLGSDKCVVSALIDMYGKCACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALA 1952
            K GL  DK VVSAL+DMYGKC   KEMS+VFDE+++++IG+ NA + GLSRNG  + AL 
Sbjct: 121  KQGLICDKFVVSALLDMYGKCGFVKEMSRVFDEVEEMEIGSLNAFLTGLSRNGMVDAALE 180

Query: 1951 VFKKIEGQGVELNVVSWTSMIACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACG 1772
            VF +++ Q VELNVV+WTS+IA CSQNGKD EALELFRDMQ  GV PN+VTIP L+PACG
Sbjct: 181  VFNRLKDQRVELNVVTWTSVIASCSQNGKDFEALELFRDMQAYGVEPNAVTIPSLIPACG 240

Query: 1771 NMAALMHGKAAHCFSLRRGFSNDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNA 1592
            N++AL HGK  HCFSLR+G  +DVYVGSALIDMYAKCG+I+ S+ CFD M A NLV WNA
Sbjct: 241  NISALTHGKEIHCFSLRKGIFDDVYVGSALIDMYAKCGRIQLSRRCFDNMLAPNLVSWNA 300

Query: 1591 IIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEH 1412
            +I GYAMHGK KE +E+FHMMQ+SGQKPD ITFT +LSAC+Q+GLTEEGW+++NSMS EH
Sbjct: 301  VISGYAMHGKAKETMEMFHMMQQSGQKPDSITFTCILSACAQNGLTEEGWHYYNSMSKEH 360

Query: 1411 GIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIA 1232
            GI+ ++EHYACMV LL R GKL+EAYS+I++MPFEPDACVWGALLSSCRVHNNL LGEIA
Sbjct: 361  GIEPKMEHYACMVTLLSRVGKLEEAYSIIKEMPFEPDACVWGALLSSCRVHNNLSLGEIA 420

Query: 1231 AKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHM 1052
            A++LF LEP NPGNY+LLSNIYAS G+W++ + +R MMKS GLRKNPG SWIE+ +KVHM
Sbjct: 421  AEKLFPLEPANPGNYVLLSNIYASKGLWDEENRIREMMKSKGLRKNPGYSWIEVGHKVHM 480

Query: 1051 LLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVV 872
            LLAGD+SHP M  I+EKL+KL++EMKKSG+ P TNFVLQDVEEQDKEQ LCGHSEKLAVV
Sbjct: 481  LLAGDQSHPQMKDILEKLDKLNMEMKKSGYLPKTNFVLQDVEEQDKEQILCGHSEKLAVV 540

Query: 871  FGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDF 692
             GLLNT  G PL+VIKNLRIC DCHA IK +S  EGREI++RDTN FHH KDG CSCGDF
Sbjct: 541  LGLLNTSPGQPLQVIKNLRICDDCHAVIKAISRLEGREIYIRDTNRFHHIKDGVCSCGDF 600

Query: 691  W 689
            W
Sbjct: 601  W 601



 Score =  172 bits (436), Expect = 8e-40
 Identities = 98/355 (27%), Positives = 189/355 (53%), Gaps = 1/355 (0%)
 Frame = -2

Query: 2740 LQPDIFSFSALIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIKACAGLSALRTGK 2561
            ++P++ +++ ++     +  Y+ A+ +F  ML  G  PD   V   + +   L  +  G 
Sbjct: 54   VEPNLVTWNGMLAGFGNNGLYDEAVGMFRVMLLEGFWPDGSTVSCVLPSVGCLEDVVMGA 113

Query: 2560 EVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVTCSALVAGFARHG 2381
            +VHG V+  GL  D FV ++L+ +Y KCG ++E  +VFD + + ++ + +A + G +R+G
Sbjct: 114  QVHGYVTKQGLICDKFVVSALLDMYGKCGFVKEMSRVFDEVEEMEIGSLNAFLTGLSRNG 173

Query: 2380 YVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMHSQGFQPDETSTS 2201
             V  A  +F+ + D  +ELN V+W  +IA  + +    EA+ +F+ M + G +P+  +  
Sbjct: 174  MVDAALEVFNRLKDQRVELNVVTWTSVIASCSQNGKDFEALELFRDMQAYGVEPNAVTIP 233

Query: 2200 SVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAKEMSQVFDEMDKI 2021
            S++PA G++  L  G +IH + ++ G+  D  V SALIDMY KC   +   + FD M   
Sbjct: 234  SLIPACGNISALTHGKEIHCFSLRKGIFDDVYVGSALIDMYAKCGRIQLSRRCFDNMLAP 293

Query: 2020 DIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACCSQNGKDIEALELF 1841
            ++ + NA+++G + +G A+  + +F  ++  G + + +++T +++ C+QNG   E    +
Sbjct: 294  NLVSWNAVISGYAMHGKAKETMEMFHMMQQSGQKPDSITFTCILSACAQNGLTEEGWHYY 353

Query: 1840 RDM-QIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDVYVGSALI 1679
              M +  G+ P      C++     +  L   + A+       F  D  V  AL+
Sbjct: 354  NSMSKEHGIEPKMEHYACMVTLLSRVGKL---EEAYSIIKEMPFEPDACVWGALL 405



 Score =  111 bits (277), Expect = 2e-21
 Identities = 76/298 (25%), Positives = 139/298 (46%), Gaps = 39/298 (13%)
 Frame = -2

Query: 2863 QAHGHILKTGLSNETHFVTKLLSLYANHLCFDDASHVLDSILQPDIFSFSALIHASTKHH 2684
            Q HG++ K GL  +   V+ LL +Y       + S V D + + +I S +A +   +++ 
Sbjct: 114  QVHGYVTKQGLICDKFVVSALLDMYGKCGFVKEMSRVFDEVEEMEIGSLNAFLTGLSRNG 173

Query: 2683 RYNHALRIFSR-----------------------------------MLSHGLAPDCHVVP 2609
              + AL +F+R                                   M ++G+ P+   +P
Sbjct: 174  MVDAALEVFNRLKDQRVELNVVTWTSVIASCSQNGKDFEALELFRDMQAYGVEPNAVTIP 233

Query: 2608 SAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQP 2429
            S I AC  +SAL  GKE+H      G+  D +V ++L+ +Y KCG+I+ + + FD M  P
Sbjct: 234  SLIPACGNISALTHGKEIHCFSLRKGIFDDVYVGSALIDMYAKCGRIQLSRRCFDNMLAP 293

Query: 2428 DVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMF 2249
            ++V+ +A+++G+A HG  KE   +F  M  SG + + +++  +++    + L  E    +
Sbjct: 294  NLVSWNAVISGYAMHGKAKETMEMFHMMQQSGQKPDSITFTCILSACAQNGLTEEGWHYY 353

Query: 2248 KKMHSQ-GFQPDETSTS---SVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALI 2087
              M  + G +P     +   ++L  VG LE+ Y      + + +     D CV  AL+
Sbjct: 354  NSMSKEHGIEPKMEHYACMVTLLSRVGKLEEAY------SIIKEMPFEPDACVWGALL 405


>ref|XP_006587447.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like
            [Glycine max]
          Length = 601

 Score =  850 bits (2195), Expect = 0.0
 Identities = 403/601 (67%), Positives = 489/601 (81%)
 Frame = -2

Query: 2491 LYVKCGKIREAHKVFDTMPQPDVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVS 2312
            +Y+KC +IR+A K+FD MP+ DVV  SA+VAG++R G V EA   F  M   G+  N VS
Sbjct: 1    MYLKCDRIRDARKLFDMMPERDVVVWSAMVAGYSRLGLVDEAKEFFGEMRSGGMAPNLVS 60

Query: 2311 WNGMIAGFNHSRLYAEAVLMFKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLI 2132
            WNGM+AGF ++ LY  A+ MF+ M   GF PD ++ S VLP+VG LED  +G Q+H Y+I
Sbjct: 61   WNGMLAGFGNNGLYDVALGMFRMMLVDGFWPDGSTVSCVLPSVGCLEDAVVGAQVHGYVI 120

Query: 2131 KHGLGSDKCVVSALIDMYGKCACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALA 1952
            K GLG DK VVSA++DMYGKC C KEMS+VFDE+++++IG+ NA + GLSRNG  + AL 
Sbjct: 121  KQGLGCDKFVVSAMLDMYGKCGCVKEMSRVFDEVEEMEIGSLNAFLTGLSRNGMVDAALE 180

Query: 1951 VFKKIEGQGVELNVVSWTSMIACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACG 1772
            VF K + + +ELNVV+WTS+IA CSQNGKD+EALELFRDMQ  GV PN+VTIP L+PACG
Sbjct: 181  VFNKFKDRKMELNVVTWTSIIASCSQNGKDLEALELFRDMQADGVEPNAVTIPSLIPACG 240

Query: 1771 NMAALMHGKAAHCFSLRRGFSNDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNA 1592
            N++ALMHGK  HCFSLRRG  +DVYVGSALIDMYAKCG+I+ S+CCFD M A NLV WNA
Sbjct: 241  NISALMHGKEIHCFSLRRGIFDDVYVGSALIDMYAKCGRIQLSRCCFDKMSAPNLVSWNA 300

Query: 1591 IIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEH 1412
            ++ GYAMHGK KE +E+FHMM +SGQKP+L+TFT VLSAC+Q+GLTEEGW ++NSMS EH
Sbjct: 301  VMSGYAMHGKAKETMEMFHMMLQSGQKPNLVTFTCVLSACAQNGLTEEGWRYYNSMSEEH 360

Query: 1411 GIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIA 1232
            G + ++EHYACMV LL R GKL+EAYS+I++MPFEPDACV GALLSSCRVHNNL LGEI 
Sbjct: 361  GFEPKMEHYACMVTLLSRVGKLEEAYSIIKEMPFEPDACVRGALLSSCRVHNNLSLGEIT 420

Query: 1231 AKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHM 1052
            A++LF LEP NPGNYI+LSNIYAS G+W++ + +R +MKS GLRKNPG SWIE+ +K+HM
Sbjct: 421  AEKLFLLEPTNPGNYIILSNIYASKGLWDEENRIREVMKSKGLRKNPGYSWIEVGHKIHM 480

Query: 1051 LLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVV 872
            LLAGD+SHP M  I+EKL+KL++EMKKSG+ P +NFV QDVEE DKEQ LCGHSEKLAVV
Sbjct: 481  LLAGDQSHPQMKDILEKLDKLNMEMKKSGYLPKSNFVWQDVEEHDKEQILCGHSEKLAVV 540

Query: 871  FGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDF 692
             GLLNT  G PL+VIKNLRIC DCHA IK +S  EGREI+VRDTN  HHFKDG CSCGDF
Sbjct: 541  LGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLEGREIYVRDTNRLHHFKDGVCSCGDF 600

Query: 691  W 689
            W
Sbjct: 601  W 601



 Score =  172 bits (435), Expect = 1e-39
 Identities = 101/355 (28%), Positives = 184/355 (51%), Gaps = 1/355 (0%)
 Frame = -2

Query: 2740 LQPDIFSFSALIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIKACAGLSALRTGK 2561
            + P++ S++ ++     +  Y+ AL +F  ML  G  PD   V   + +   L     G 
Sbjct: 54   MAPNLVSWNGMLAGFGNNGLYDVALGMFRMMLVDGFWPDGSTVSCVLPSVGCLEDAVVGA 113

Query: 2560 EVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVTCSALVAGFARHG 2381
            +VHG V   GL  D FV ++++ +Y KCG ++E  +VFD + + ++ + +A + G +R+G
Sbjct: 114  QVHGYVIKQGLGCDKFVVSAMLDMYGKCGCVKEMSRVFDEVEEMEIGSLNAFLTGLSRNG 173

Query: 2380 YVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMHSQGFQPDETSTS 2201
             V  A  +F+   D  +ELN V+W  +IA  + +    EA+ +F+ M + G +P+  +  
Sbjct: 174  MVDAALEVFNKFKDRKMELNVVTWTSIIASCSQNGKDLEALELFRDMQADGVEPNAVTIP 233

Query: 2200 SVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAKEMSQVFDEMDKI 2021
            S++PA G++  L  G +IH + ++ G+  D  V SALIDMY KC   +     FD+M   
Sbjct: 234  SLIPACGNISALMHGKEIHCFSLRRGIFDDVYVGSALIDMYAKCGRIQLSRCCFDKMSAP 293

Query: 2020 DIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACCSQNGKDIEALELF 1841
            ++ + NA+++G + +G A+  + +F  +   G + N+V++T +++ C+QNG   E    +
Sbjct: 294  NLVSWNAVMSGYAMHGKAKETMEMFHMMLQSGQKPNLVTFTCVLSACAQNGLTEEGWRYY 353

Query: 1840 RDM-QIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDVYVGSALI 1679
              M +  G  P      C++     +  L   + A+       F  D  V  AL+
Sbjct: 354  NSMSEEHGFEPKMEHYACMVTLLSRVGKL---EEAYSIIKEMPFEPDACVRGALL 405



 Score =  115 bits (287), Expect = 2e-22
 Identities = 78/298 (26%), Positives = 140/298 (46%), Gaps = 39/298 (13%)
 Frame = -2

Query: 2863 QAHGHILKTGLSNETHFVTKLLSLYANHLCFDDASHVLDSILQPDIFSFSALIHASTKHH 2684
            Q HG+++K GL  +   V+ +L +Y    C  + S V D + + +I S +A +   +++ 
Sbjct: 114  QVHGYVIKQGLGCDKFVVSAMLDMYGKCGCVKEMSRVFDEVEEMEIGSLNAFLTGLSRNG 173

Query: 2683 RYNHALRIFSR-----------------------------------MLSHGLAPDCHVVP 2609
              + AL +F++                                   M + G+ P+   +P
Sbjct: 174  MVDAALEVFNKFKDRKMELNVVTWTSIIASCSQNGKDLEALELFRDMQADGVEPNAVTIP 233

Query: 2608 SAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQP 2429
            S I AC  +SAL  GKE+H      G+  D +V ++L+ +Y KCG+I+ +   FD M  P
Sbjct: 234  SLIPACGNISALMHGKEIHCFSLRRGIFDDVYVGSALIDMYAKCGRIQLSRCCFDKMSAP 293

Query: 2428 DVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMF 2249
            ++V+ +A+++G+A HG  KE   +F  M  SG + N V++  +++    + L  E    +
Sbjct: 294  NLVSWNAVMSGYAMHGKAKETMEMFHMMLQSGQKPNLVTFTCVLSACAQNGLTEEGWRYY 353

Query: 2248 KKMHSQ-GFQPDETSTS---SVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALI 2087
              M  + GF+P     +   ++L  VG LE+ Y      + + +     D CV  AL+
Sbjct: 354  NSMSEEHGFEPKMEHYACMVTLLSRVGKLEEAY------SIIKEMPFEPDACVRGALL 405


>gb|AAF79892.1|AC022472_1 Contains similarity to an unknown protein F28A21.160 gi|7486269 from
            Arabidopsis thaliana BAC F28A21 gi|T04867 and contains
            multiple PPR PF|01535 repeats. EST gb|AI999742 comes from
            this gene. This gene may be cut off, partial [Arabidopsis
            thaliana]
          Length = 757

 Score =  835 bits (2157), Expect(2) = 0.0
 Identities = 404/713 (56%), Positives = 534/713 (74%), Gaps = 4/713 (0%)
 Frame = -2

Query: 2956 MTRQALHLLNSSHHITLNTFISTS----ASLPQARQAHGHILKTGLSNETHFVTKLLSLY 2789
            MT+Q L L+       +    S+S    +SL +  QAH  ILK+G  N+ +   KL++ Y
Sbjct: 1    MTKQVLPLIEKIPQSIVGFLESSSYHWSSSLSKTTQAHARILKSGAQNDGYISAKLIASY 60

Query: 2788 ANHLCFDDASHVLDSILQPDIFSFSALIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVP 2609
            +N+ CF+DA  VL SI  P I+SFS+LI+A TK   +  ++ +FSRM SHGL PD HV+P
Sbjct: 61   SNYNCFNDADLVLQSIPDPTIYSFSSLIYALTKAKLFTQSIGVFSRMFSHGLIPDSHVLP 120

Query: 2608 SAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQP 2429
            +  K CA LSA + GK++H V  VSGL  D FVQ S+ H+Y++CG++ +A KVFD M   
Sbjct: 121  NLFKVCAELSAFKVGKQIHCVSCVSGLDMDAFVQGSMFHMYMRCGRMGDARKVFDRMSDK 180

Query: 2428 DVVTCSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMF 2249
            DVVTCSAL+  +AR G ++E  R+   M  SG+E N VSWNG+++GFN S  + EAV+MF
Sbjct: 181  DVVTCSALLCAYARKGCLEEVVRILSEMESSGIEANIVSWNGILSGFNRSGYHKEAVVMF 240

Query: 2248 KKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKC 2069
            +K+H  GF PD+ + SSVLP+VGD E L +G  IH Y+IK GL  DKCV+SA+IDMYGK 
Sbjct: 241  QKIHHLGFCPDQVTVSSVLPSVGDSEMLNMGRLIHGYVIKQGLLKDKCVISAMIDMYGKS 300

Query: 2068 ACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMI 1889
                 +  +F++ + ++ G CNA + GLSRNG  + AL +F+  + Q +ELNVVSWTS+I
Sbjct: 301  GHVYGIISLFNQFEMMEAGVCNAYITGLSRNGLVDKALEMFELFKEQTMELNVVSWTSII 360

Query: 1888 ACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFS 1709
            A C+QNGKDIEALELFR+MQ+AGV PN VTIP +LPACGN+AAL HG++ H F++R    
Sbjct: 361  AGCAQNGKDIEALELFREMQVAGVKPNHVTIPSMLPACGNIAALGHGRSTHGFAVRVHLL 420

Query: 1708 NDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMM 1529
            ++V+VGSALIDMYAKCG+I  SQ  F+ MP +NLVCWN+++ G++MHGK KE + IF  +
Sbjct: 421  DNVHVGSALIDMYAKCGRINLSQIVFNMMPTKNLVCWNSLMNGFSMHGKAKEVMSIFESL 480

Query: 1528 QRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGK 1349
             R+  KPD I+FTS+LSAC Q GLT+EGW +F  MS E+GIK R+EHY+CMVNLLGRAGK
Sbjct: 481  MRTRLKPDFISFTSLLSACGQVGLTDEGWKYFKMMSEEYGIKPRLEHYSCMVNLLGRAGK 540

Query: 1348 LKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNI 1169
            L+EAY +I++MPFEPD+CVWGALL+SCR+ NN+ L EIAA++LF LEP+NPG Y+LLSNI
Sbjct: 541  LQEAYDLIKEMPFEPDSCVWGALLNSCRLQNNVDLAEIAAEKLFHLEPENPGTYVLLSNI 600

Query: 1168 YASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKL 989
            YA+ GMW +VD +R  M+S+GL+KNPGCSWI++KN+V+ LLAGDKSHP + QI EK++++
Sbjct: 601  YAAKGMWTEVDSIRNKMESLGLKKNPGCSWIQVKNRVYTLLAGDKSHPQIDQITEKMDEI 660

Query: 988  SLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRV 830
            S EM+KSG  P  +F L DVEEQ++EQ L GHSEKLAVVFGLLNT  G+PL+V
Sbjct: 661  SKEMRKSGHRPNLDFALHDVEEQEQEQMLWGHSEKLAVVFGLLNTPDGTPLQV 713



 Score = 29.3 bits (64), Expect(2) = 0.0
 Identities = 15/28 (53%), Positives = 18/28 (64%)
 Frame = -3

Query: 762 EKFLSETQISFTILKTELVLVGIFGELR 679
           E+F  E QI F ILKTE V V I G+ +
Sbjct: 714 ERFSLEIQIGFIILKTEFVPVEISGDTK 741


Top