BLASTX nr result

ID: Mentha29_contig00010370 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00010370
         (1809 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU26539.1| hypothetical protein MIMGU_mgv1a002527mg [Mimulus...   830   0.0  
ref|XP_002270184.2| PREDICTED: pentatricopeptide repeat-containi...   754   0.0  
ref|XP_004250704.1| PREDICTED: pentatricopeptide repeat-containi...   725   0.0  
ref|XP_006343482.1| PREDICTED: pentatricopeptide repeat-containi...   717   0.0  
ref|XP_006343483.1| PREDICTED: pentatricopeptide repeat-containi...   713   0.0  
ref|XP_006343481.1| PREDICTED: pentatricopeptide repeat-containi...   713   0.0  
ref|XP_006431883.1| hypothetical protein CICLE_v10000525mg [Citr...   709   0.0  
ref|XP_002529286.1| pentatricopeptide repeat-containing protein,...   707   0.0  
ref|XP_002299667.2| hypothetical protein POPTR_0001s21880g [Popu...   702   0.0  
ref|XP_004141206.1| PREDICTED: pentatricopeptide repeat-containi...   701   0.0  
ref|XP_007042228.1| Pentatricopeptide repeat (PPR) superfamily p...   691   0.0  
ref|XP_007042227.1| Pentatricopeptide repeat superfamily protein...   691   0.0  
ref|XP_004299940.1| PREDICTED: pentatricopeptide repeat-containi...   689   0.0  
ref|XP_007225150.1| hypothetical protein PRUPE_ppa002505mg [Prun...   687   0.0  
ref|XP_006417404.1| hypothetical protein EUTSA_v10007006mg [Eutr...   660   0.0  
ref|XP_002889841.1| hypothetical protein ARALYDRAFT_888388 [Arab...   659   0.0  
ref|XP_007148512.1| hypothetical protein PHAVU_006G214900g [Phas...   658   0.0  
ref|NP_172560.2| pentatricopeptide repeat-containing protein [Ar...   656   0.0  
ref|XP_006343484.1| PREDICTED: pentatricopeptide repeat-containi...   653   0.0  
ref|XP_004486236.1| PREDICTED: pentatricopeptide repeat-containi...   648   0.0  

>gb|EYU26539.1| hypothetical protein MIMGU_mgv1a002527mg [Mimulus guttatus]
          Length = 663

 Score =  830 bits (2143), Expect = 0.0
 Identities = 424/588 (72%), Positives = 480/588 (81%), Gaps = 14/588 (2%)
 Frame = -3

Query: 1723 GNGFQPVFSFHSTPSINNSAQIPATSQLGRVTSPLSSIA-----NATHTQVWSKSGKTPV 1559
            GNGFQPV    +T     S  IPAT+       P   +A     N T     +    T V
Sbjct: 8    GNGFQPVSCLSTTRPAKISKHIPATTLATPPPPPPQRVATRAPPNGTEKHSGAPRNPTNV 67

Query: 1558 ---------ASRQTKESAIHDIQHSSNLEAALLRSGELLKVQDLNIVLRHFGKLNRWMDV 1406
                     ASR+ +ESAI DIQ S+ L +AL RSGE+LK QDLNIVLRHFGKL RW D+
Sbjct: 68   TRNIPNNLSASRKARESAITDIQDSTELASALSRSGEVLKAQDLNIVLRHFGKLYRWKDL 127

Query: 1405 CQLFDWMQQHGKTNIASYSSYIKFVGRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCL 1226
             QLF+WM+QHGKTNIASYSSYIKFVGRDSN+ KA+EIYNSIKDDS ++N SVCNSTL CL
Sbjct: 128  SQLFNWMRQHGKTNIASYSSYIKFVGRDSNATKAVEIYNSIKDDSTKTNVSVCNSTLYCL 187

Query: 1225 IKCGKFHNSLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMD 1046
            IK GKF + LKLFNQMKQAGL PDIVTYSTLL GC KVKGGY KAMELV+E+K R L MD
Sbjct: 188  IKSGKFESGLKLFNQMKQAGLEPDIVTYSTLLSGCTKVKGGYIKAMELVQEIKCRKLQMD 247

Query: 1045 DVLYGTLISVCASNHQCDEAEKYFDEMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQ 866
             V+YGTLISVCASN+Q +EAEKYF+EMKSEGHSPNVFHYSSLLNAYAIDG+YKKAD LI+
Sbjct: 248  TVIYGTLISVCASNNQREEAEKYFNEMKSEGHSPNVFHYSSLLNAYAIDGSYKKADALIE 307

Query: 865  EMGSAGLTLNKVILTTLLKVYVRAGLFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSG 686
            EM SAG+ LNK+ILTT LKVYV+ GLF+KSRELLD+LQ LGYAEDEMPYCLLMDGLAKSG
Sbjct: 308  EMRSAGIELNKIILTTQLKVYVKGGLFDKSRELLDQLQALGYAEDEMPYCLLMDGLAKSG 367

Query: 685  NLEEAKSVFDEMRLKEVKNDGYSYSIMISALCRSKFIDEAKQLASEFEMKYDKYDVVILN 506
             + EAKS+FDEMR KEVKNDG+SYSIMISALCRS  I+EAK LA EFE KYDKYDVVILN
Sbjct: 368  KVPEAKSLFDEMRQKEVKNDGFSYSIMISALCRSGLIEEAKMLACEFETKYDKYDVVILN 427

Query: 505  SMLCAYCRSGEMENVMKMMRKMDESAISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKK 326
            SMLCAYCRSGEMENVMK M+KMDES+ISPDW+T+ ILIKYFCKE+LYLLAYRTM DMHKK
Sbjct: 428  SMLCAYCRSGEMENVMKTMKKMDESSISPDWNTFHILIKYFCKEKLYLLAYRTMVDMHKK 487

Query: 325  GHHTPEDLAVSLMKSLGSMGSYSEAFSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDA 146
            GH   EDL V L+  LG  G+++EAFSVY+MLKYSKRT+NK LH KILH L++GGL KDA
Sbjct: 488  GHQLEEDLCVFLIHHLGKTGAHAEAFSVYSMLKYSKRTINKTLHEKILHTLLAGGLFKDA 547

Query: 145  YVVVKDNAEFISKPAIKRFAISFMRNGNINLVNDVIKSIHNSDYNIDQ 2
            YV+VKDNA++IS+ AI++F  +FMR GNINL+NDVIKSIH+S Y IDQ
Sbjct: 548  YVLVKDNAKYISESAIRKFTTTFMRKGNINLINDVIKSIHSSSYKIDQ 595


>ref|XP_002270184.2| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic [Vitis vinifera]
            gi|298204537|emb|CBI23812.3| unnamed protein product
            [Vitis vinifera]
          Length = 659

 Score =  754 bits (1947), Expect = 0.0
 Identities = 385/583 (66%), Positives = 464/583 (79%), Gaps = 9/583 (1%)
 Frame = -3

Query: 1723 GNGFQPVFSFHST-PSINNSAQ-IPAT-----SQLGRVTSPL--SSIANATHTQVWSKSG 1571
            G GF+ V +  S  PS+++ A  +P+T     S L   T PL   S    TH  V  +  
Sbjct: 7    GGGFKQVITRLSPLPSLSSPASPLPSTTRAKSSHLTSATPPLHKESQIEPTHVSVTPRKR 66

Query: 1570 KTPVASRQTKESAIHDIQHSSNLEAALLRSGELLKVQDLNIVLRHFGKLNRWMDVCQLFD 1391
               V  +  ++SAI ++Q SS+L +AL R G++LKVQDLN++LRHFGKL RW D+ QLFD
Sbjct: 67   CHSVGYK-ARQSAILEVQQSSDLGSALARLGDMLKVQDLNVILRHFGKLCRWQDLSQLFD 125

Query: 1390 WMQQHGKTNIASYSSYIKFVGRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCLIKCGK 1211
            WMQ+H K   +SYS+YIKF+G+  N  KALEIYNSI+D+S+R+N SVCNS L CLI+ GK
Sbjct: 126  WMQKHEKITFSSYSTYIKFMGKSLNPIKALEIYNSIQDESVRNNVSVCNSVLSCLIRNGK 185

Query: 1210 FHNSLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMDDVLYG 1031
            F NSLKLF+QMKQ GL PD VTYSTLL GC KVK GY KA+ELV+EM+   L MD V+YG
Sbjct: 186  FENSLKLFHQMKQDGLRPDAVTYSTLLAGCMKVKHGYSKALELVQEMERSRLPMDSVIYG 245

Query: 1030 TLISVCASNHQCDEAEKYFDEMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQEMGSA 851
            TL++VCASN++C EAE YF++MK EGH PNVFHYSSLLNAY+ DG+YKKAD L+Q+M SA
Sbjct: 246  TLLAVCASNNRCKEAENYFNQMKDEGHLPNVFHYSSLLNAYSADGDYKKADMLVQDMKSA 305

Query: 850  GLTLNKVILTTLLKVYVRAGLFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSGNLEEA 671
            GL  NKVILTTLLKVYVR GLFEKSRELL EL+DLGYAEDEMPYCLLMDGLAKS  + EA
Sbjct: 306  GLVPNKVILTTLLKVYVRGGLFEKSRELLAELEDLGYAEDEMPYCLLMDGLAKSRRILEA 365

Query: 670  KSVFDEMRLKEVKNDGYSYSIMISALCRSKFIDEAKQLASEFEMKYDKYDVVILNSMLCA 491
            KS+F+EM+ K+VK+DGY YSIMISA CRS  + EAKQLA +FE  YDKYD+V+LN+MLCA
Sbjct: 366  KSIFEEMKKKQVKSDGYCYSIMISAFCRSGLLKEAKQLARDFEATYDKYDLVMLNTMLCA 425

Query: 490  YCRSGEMENVMKMMRKMDESAISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKKGHHTP 311
            YCR+GEME+VM+MMRKMDE AISPDW+T+ ILIKYFCKE+LYLLAYRTMEDMH KGH   
Sbjct: 426  YCRAGEMESVMQMMRKMDELAISPDWNTFHILIKYFCKEKLYLLAYRTMEDMHNKGHQPE 485

Query: 310  EDLAVSLMKSLGSMGSYSEAFSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDAYVVVK 131
            E+L  SL+  LG + ++S+AFSVYNML+YSKRTM KALH KILHILV+G LLKDAYVVVK
Sbjct: 486  EELCSSLISHLGKIRAHSQAFSVYNMLRYSKRTMCKALHEKILHILVAGRLLKDAYVVVK 545

Query: 130  DNAEFISKPAIKRFAISFMRNGNINLVNDVIKSIHNSDYNIDQ 2
            DN   ISKP+IK+FA +FM+ GN+NL+NDV+K+IH S Y IDQ
Sbjct: 546  DNEGLISKPSIKKFATAFMKFGNVNLINDVMKAIHGSGYKIDQ 588


>ref|XP_004250704.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Solanum lycopersicum]
          Length = 642

 Score =  725 bits (1872), Expect = 0.0
 Identities = 353/519 (68%), Positives = 435/519 (83%)
 Frame = -3

Query: 1558 ASRQTKESAIHDIQHSSNLEAALLRSGELLKVQDLNIVLRHFGKLNRWMDVCQLFDWMQQ 1379
            ASR  ++S I  IQ SS+L +AL R G+ LKVQD+N++LR+FGKLNR  ++CQ+F+WMQQ
Sbjct: 57   ASRTDRQSTILHIQDSSDLASALARHGDTLKVQDMNVILRYFGKLNRRPELCQVFEWMQQ 116

Query: 1378 HGKTNIASYSSYIKFVGRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCLIKCGKFHNS 1199
            + K N+ASYSSY+KF+G+  +   A+E+Y  IKD SI+ N SVCN+ L  LIK GK  +S
Sbjct: 117  NQKINVASYSSYVKFMGKSLSCVDAVEMYRDIKDRSIKFNVSVCNAFLSSLIKNGKSESS 176

Query: 1198 LKLFNQMKQAGLVPDIVTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMDDVLYGTLIS 1019
            LKLF QMK+ GLVPD+ TYSTLL GCAKV GGY+KA+ELV+EM S GL MD V YG+L+S
Sbjct: 177  LKLFTQMKRDGLVPDVFTYSTLLAGCAKVNGGYYKALELVQEMMSNGLEMDSVTYGSLLS 236

Query: 1018 VCASNHQCDEAEKYFDEMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQEMGSAGLTL 839
            VCAS+ +C+EA KYF +MK EGHSPNV+HYSSLLNAY+ D NY+KA+ LI+EM SAGL L
Sbjct: 237  VCASHKECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEALIEEMRSAGLVL 296

Query: 838  NKVILTTLLKVYVRAGLFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSGNLEEAKSVF 659
            NKVI TTLLKVYV+ GLFEKS+ELL EL+ LGYA+DEMP+CLLMDGLAKSG+L EAKSVF
Sbjct: 297  NKVIYTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEAKSVF 356

Query: 658  DEMRLKEVKNDGYSYSIMISALCRSKFIDEAKQLASEFEMKYDKYDVVILNSMLCAYCRS 479
            DEM  K+VK DGYSYSIMISA CR   +++AK+LASEFE KYDKYD+VILN+ML AYCR+
Sbjct: 357  DEMMEKQVKTDGYSYSIMISAFCRRGLLEDAKKLASEFEEKYDKYDIVILNAMLSAYCRA 416

Query: 478  GEMENVMKMMRKMDESAISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKKGHHTPEDLA 299
            G+MENVM MM+KMD+SAISPDW+T+ ILI+YFCKE+LYLLAYRTMEDMH KGH   E L 
Sbjct: 417  GKMENVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGHQPEEGLC 476

Query: 298  VSLMKSLGSMGSYSEAFSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDAYVVVKDNAE 119
             SL+  LG  G++SEAFSVYNML+YSKRT++ ALH  ILHIL++G LLKDAYVVVKDNA 
Sbjct: 477  SSLIYHLGKTGAHSEAFSVYNMLRYSKRTISNALHENILHILIAGRLLKDAYVVVKDNAG 536

Query: 118  FISKPAIKRFAISFMRNGNINLVNDVIKSIHNSDYNIDQ 2
            FIS+PAIK+F+++FMR+GN+NL+NDV+ ++H+S + IDQ
Sbjct: 537  FISQPAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQ 575


>ref|XP_006343482.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X2 [Solanum tuberosum]
          Length = 651

 Score =  717 bits (1852), Expect = 0.0
 Identities = 353/525 (67%), Positives = 437/525 (83%)
 Frame = -3

Query: 1576 SGKTPVASRQTKESAIHDIQHSSNLEAALLRSGELLKVQDLNIVLRHFGKLNRWMDVCQL 1397
            S + PV SR  ++SAI  IQ SS+L +AL R G+ LKVQD+N++LR+FGKL+R  ++ Q 
Sbjct: 52   SNQFPV-SRTDRQSAILHIQDSSDLASALARHGDTLKVQDMNVILRYFGKLSRRRELYQA 110

Query: 1396 FDWMQQHGKTNIASYSSYIKFVGRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCLIKC 1217
            F+WMQQ+ K N+ASYSSY+KF+G+  +   A+E+Y  IKD SI+ N SVCN+ L  LIK 
Sbjct: 111  FEWMQQNQKINVASYSSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKN 170

Query: 1216 GKFHNSLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMDDVL 1037
            GK  +SLKLF QMK+ GLVPD+ TYSTLL GCAKV GGY+KA+ELV+E+ S GL MD V 
Sbjct: 171  GKSESSLKLFTQMKRDGLVPDVFTYSTLLAGCAKVNGGYYKALELVQELMSNGLQMDSVT 230

Query: 1036 YGTLISVCASNHQCDEAEKYFDEMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQEMG 857
            YG+L+SVCAS+ +C+EA KYF +MK EGHSPNV+HYSSLLNAY+ D NY+KA+ LI+EM 
Sbjct: 231  YGSLLSVCASHKECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMR 290

Query: 856  SAGLTLNKVILTTLLKVYVRAGLFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSGNLE 677
            SAGL LNKVI TTLLKVYV+ GLFEKS+ELL EL+ LGYA+DEMP+CLLMDGLAKSG+L 
Sbjct: 291  SAGLVLNKVIYTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLL 350

Query: 676  EAKSVFDEMRLKEVKNDGYSYSIMISALCRSKFIDEAKQLASEFEMKYDKYDVVILNSML 497
            EAKSVFDEM  K VK DGYSYSIMISA CRS  +++AK++ASEFE KYDKYD+VILN+ML
Sbjct: 351  EAKSVFDEMMEKHVKTDGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAML 410

Query: 496  CAYCRSGEMENVMKMMRKMDESAISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKKGHH 317
             AYCR+G+MENVM MM+KMD+SAISPDW+T+ ILI+YFCKE+LYLLAYRTMEDMH KGH 
Sbjct: 411  SAYCRAGKMENVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGHQ 470

Query: 316  TPEDLAVSLMKSLGSMGSYSEAFSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDAYVV 137
              E L  SL+  LG  G++SEAFSVYNML+YSKRT++ ALH  ILHIL++G LLKDAYVV
Sbjct: 471  PEEGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTISNALHEHILHILIAGRLLKDAYVV 530

Query: 136  VKDNAEFISKPAIKRFAISFMRNGNINLVNDVIKSIHNSDYNIDQ 2
            VKDNA FIS+PAIK+F+++FMR+GN+NL+NDV+ ++H+S + IDQ
Sbjct: 531  VKDNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQ 575


>ref|XP_006343483.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X3 [Solanum tuberosum]
          Length = 646

 Score =  713 bits (1840), Expect = 0.0
 Identities = 353/526 (67%), Positives = 437/526 (83%), Gaps = 1/526 (0%)
 Frame = -3

Query: 1576 SGKTPVASRQTKESAIHDIQHSSNLEAALLRSGELLKVQDLNIVLRHFGKLNRWMDVCQL 1397
            S + PV SR  ++SAI  IQ SS+L +AL R G+ LKVQD+N++LR+FGKL+R  ++ Q 
Sbjct: 52   SNQFPV-SRTDRQSAILHIQDSSDLASALARHGDTLKVQDMNVILRYFGKLSRRRELYQA 110

Query: 1396 FDWMQQHGKTNIASYSSYIKFVGRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCLIKC 1217
            F+WMQQ+ K N+ASYSSY+KF+G+  +   A+E+Y  IKD SI+ N SVCN+ L  LIK 
Sbjct: 111  FEWMQQNQKINVASYSSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKN 170

Query: 1216 GKFHNSLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMDDVL 1037
            GK  +SLKLF QMK+ GLVPD+ TYSTLL GCAKV GGY+KA+ELV+E+ S GL MD V 
Sbjct: 171  GKSESSLKLFTQMKRDGLVPDVFTYSTLLAGCAKVNGGYYKALELVQELMSNGLQMDSVT 230

Query: 1036 YGTLISVCASNHQCDEAEKYFDEMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQEMG 857
            YG+L+SVCAS+ +C+EA KYF +MK EGHSPNV+HYSSLLNAY+ D NY+KA+ LI+EM 
Sbjct: 231  YGSLLSVCASHKECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMR 290

Query: 856  SAGLTLNKVILTTLLKVYVRAGLFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSGNLE 677
            SAGL LNKVI TTLLKVYV+ GLFEKS+ELL EL+ LGYA+DEMP+CLLMDGLAKSG+L 
Sbjct: 291  SAGLVLNKVIYTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLL 350

Query: 676  EAKSVFDEMRLKEVKN-DGYSYSIMISALCRSKFIDEAKQLASEFEMKYDKYDVVILNSM 500
            EAKSVFDEM  K VK  DGYSYSIMISA CRS  +++AK++ASEFE KYDKYD+VILN+M
Sbjct: 351  EAKSVFDEMMEKHVKTADGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAM 410

Query: 499  LCAYCRSGEMENVMKMMRKMDESAISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKKGH 320
            L AYCR+G+MENVM MM+KMD+SAISPDW+T+ ILI+YFCKE+LYLLAYRTMEDMH KGH
Sbjct: 411  LSAYCRAGKMENVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGH 470

Query: 319  HTPEDLAVSLMKSLGSMGSYSEAFSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDAYV 140
               E L  SL+  LG  G++SEAFSVYNML+YSKRT++ ALH  ILHIL++G LLKDAYV
Sbjct: 471  QPEEGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTISNALHEHILHILIAGRLLKDAYV 530

Query: 139  VVKDNAEFISKPAIKRFAISFMRNGNINLVNDVIKSIHNSDYNIDQ 2
            VVKDNA FIS+PAIK+F+++FMR+GN+NL+NDV+ ++H+S + IDQ
Sbjct: 531  VVKDNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQ 576


>ref|XP_006343481.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X1 [Solanum tuberosum]
          Length = 652

 Score =  713 bits (1840), Expect = 0.0
 Identities = 353/526 (67%), Positives = 437/526 (83%), Gaps = 1/526 (0%)
 Frame = -3

Query: 1576 SGKTPVASRQTKESAIHDIQHSSNLEAALLRSGELLKVQDLNIVLRHFGKLNRWMDVCQL 1397
            S + PV SR  ++SAI  IQ SS+L +AL R G+ LKVQD+N++LR+FGKL+R  ++ Q 
Sbjct: 52   SNQFPV-SRTDRQSAILHIQDSSDLASALARHGDTLKVQDMNVILRYFGKLSRRRELYQA 110

Query: 1396 FDWMQQHGKTNIASYSSYIKFVGRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCLIKC 1217
            F+WMQQ+ K N+ASYSSY+KF+G+  +   A+E+Y  IKD SI+ N SVCN+ L  LIK 
Sbjct: 111  FEWMQQNQKINVASYSSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKN 170

Query: 1216 GKFHNSLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMDDVL 1037
            GK  +SLKLF QMK+ GLVPD+ TYSTLL GCAKV GGY+KA+ELV+E+ S GL MD V 
Sbjct: 171  GKSESSLKLFTQMKRDGLVPDVFTYSTLLAGCAKVNGGYYKALELVQELMSNGLQMDSVT 230

Query: 1036 YGTLISVCASNHQCDEAEKYFDEMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQEMG 857
            YG+L+SVCAS+ +C+EA KYF +MK EGHSPNV+HYSSLLNAY+ D NY+KA+ LI+EM 
Sbjct: 231  YGSLLSVCASHKECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMR 290

Query: 856  SAGLTLNKVILTTLLKVYVRAGLFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSGNLE 677
            SAGL LNKVI TTLLKVYV+ GLFEKS+ELL EL+ LGYA+DEMP+CLLMDGLAKSG+L 
Sbjct: 291  SAGLVLNKVIYTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLL 350

Query: 676  EAKSVFDEMRLKEVKN-DGYSYSIMISALCRSKFIDEAKQLASEFEMKYDKYDVVILNSM 500
            EAKSVFDEM  K VK  DGYSYSIMISA CRS  +++AK++ASEFE KYDKYD+VILN+M
Sbjct: 351  EAKSVFDEMMEKHVKTADGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAM 410

Query: 499  LCAYCRSGEMENVMKMMRKMDESAISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKKGH 320
            L AYCR+G+MENVM MM+KMD+SAISPDW+T+ ILI+YFCKE+LYLLAYRTMEDMH KGH
Sbjct: 411  LSAYCRAGKMENVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGH 470

Query: 319  HTPEDLAVSLMKSLGSMGSYSEAFSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDAYV 140
               E L  SL+  LG  G++SEAFSVYNML+YSKRT++ ALH  ILHIL++G LLKDAYV
Sbjct: 471  QPEEGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTISNALHEHILHILIAGRLLKDAYV 530

Query: 139  VVKDNAEFISKPAIKRFAISFMRNGNINLVNDVIKSIHNSDYNIDQ 2
            VVKDNA FIS+PAIK+F+++FMR+GN+NL+NDV+ ++H+S + IDQ
Sbjct: 531  VVKDNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQ 576


>ref|XP_006431883.1| hypothetical protein CICLE_v10000525mg [Citrus clementina]
            gi|557534005|gb|ESR45123.1| hypothetical protein
            CICLE_v10000525mg [Citrus clementina]
          Length = 660

 Score =  709 bits (1831), Expect = 0.0
 Identities = 359/580 (61%), Positives = 452/580 (77%), Gaps = 9/580 (1%)
 Frame = -3

Query: 1714 FQPVFSFHSTPSINNSAQIPATSQLGRVTSPLSSIANATHTQ---------VWSKSGKTP 1562
            F P+ S   T +++N A   AT+++         +   THT+           +  GK  
Sbjct: 16   FHPLSSSFPTATVSNWASATATARV--------VVKEQTHTEPAQQPPPAAAAAARGKRQ 67

Query: 1561 VASRQTKESAIHDIQHSSNLEAALLRSGELLKVQDLNIVLRHFGKLNRWMDVCQLFDWMQ 1382
             +S   ++SAI ++Q SS+L ++L R G +LKV DLN +LRHFG L R  DV QLF+WMQ
Sbjct: 68   SSSYLARKSAILEVQQSSDLTSSLERLGGILKVPDLNAILRHFGDLGRGRDVLQLFEWMQ 127

Query: 1381 QHGKTNIASYSSYIKFVGRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCLIKCGKFHN 1202
            QHGKT+I+SYSSYIKF+G+  NS KALEIYNSI D+S + N  +CNS L CL++ GKF +
Sbjct: 128  QHGKTSISSYSSYIKFLGKSGNSLKALEIYNSITDESDKVNVFICNSILSCLVRNGKFES 187

Query: 1201 SLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMDDVLYGTLI 1022
            SLKLF++MKQ+GL PD VTY+TLL GC K K GY KA+ELV+E+K  G  MD+V+YG L+
Sbjct: 188  SLKLFDKMKQSGLTPDAVTYNTLLTGCIKDKNGYSKALELVQELKYNGAQMDNVMYGILL 247

Query: 1021 SVCASNHQCDEAEKYFDEMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQEMGSAGLT 842
            ++CASN+ C +A+ YF++MK EGHSPNV+HYSSLLNAY+  G+Y KAD+LIQ+M S+GL 
Sbjct: 248  AICASNNLCAKAQSYFNQMKVEGHSPNVYHYSSLLNAYSSGGDYTKADELIQDMKSSGLV 307

Query: 841  LNKVILTTLLKVYVRAGLFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSGNLEEAKSV 662
             NKVILTTLLKVYVR GLFEKSRELL EL  LGYAE+EMPYCLLMDGL+K+G L+EA+ V
Sbjct: 308  PNKVILTTLLKVYVRGGLFEKSRELLAELDTLGYAENEMPYCLLMDGLSKAGCLDEARVV 367

Query: 661  FDEMRLKEVKNDGYSYSIMISALCRSKFIDEAKQLASEFEMKYDKYDVVILNSMLCAYCR 482
            F+EM+ K VK+DGY++SIMISA CR    +EAKQLA +FE KYDKYDVV+LNSMLCAYCR
Sbjct: 368  FNEMQEKCVKSDGYAHSIMISAFCRGGCFEEAKQLAGDFEAKYDKYDVVLLNSMLCAYCR 427

Query: 481  SGEMENVMKMMRKMDESAISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKKGHHTPEDL 302
            +G+ME+VM +MRK+DE AISPD++T+ ILIKYFCKE++Y+LAYRTM DMH+KGH   E+L
Sbjct: 428  TGDMESVMHVMRKLDELAISPDYNTFHILIKYFCKEKMYILAYRTMVDMHRKGHQPEEEL 487

Query: 301  AVSLMKSLGSMGSYSEAFSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDAYVVVKDNA 122
              SL+  LG M ++SEA SVYNML+YSKR+M KALH KILHIL+SG LLKDAYVVVKDN+
Sbjct: 488  CSSLIFHLGKMRAHSEALSVYNMLRYSKRSMCKALHEKILHILISGKLLKDAYVVVKDNS 547

Query: 121  EFISKPAIKRFAISFMRNGNINLVNDVIKSIHNSDYNIDQ 2
            E IS P IK+FA +F+R GNINLVNDV+K+IH + Y IDQ
Sbjct: 548  ESISHPVIKKFASAFVRLGNINLVNDVMKAIHTTGYRIDQ 587


>ref|XP_002529286.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223531275|gb|EEF33118.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 672

 Score =  707 bits (1826), Expect = 0.0
 Identities = 348/563 (61%), Positives = 442/563 (78%), Gaps = 5/563 (0%)
 Frame = -3

Query: 1675 NNSAQIPATSQLGRVTSPLSSIANATHTQVWSKSGKTPVASRQTK-----ESAIHDIQHS 1511
            NNS +  A   L   T+  + +    H      +G+  V  R +K     ++AI ++Q S
Sbjct: 39   NNSQRSSAV--LSTNTTTETPLLKQPHNNEQPPNGQFHVQRRHSKSYLARQAAILEVQQS 96

Query: 1510 SNLEAALLRSGELLKVQDLNIVLRHFGKLNRWMDVCQLFDWMQQHGKTNIASYSSYIKFV 1331
             +L++AL R G +LK QDLN++LR+ GK +RW D+ +LFDWMQQH K +++SY+SY+KF+
Sbjct: 97   PDLDSALRRLGAILKAQDLNVILRNLGKQSRWQDLSKLFDWMQQHSKISVSSYTSYMKFM 156

Query: 1330 GRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCLIKCGKFHNSLKLFNQMKQAGLVPDI 1151
            G+  N AKALEIYNSI D+S+++N  +CNS L CL++ GKF  SLKLF++MKQ GL PD 
Sbjct: 157  GKSLNPAKALEIYNSIADESVKNNVFICNSVLSCLVRSGKFDISLKLFHKMKQNGLTPDT 216

Query: 1150 VTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMDDVLYGTLISVCASNHQCDEAEKYFD 971
            +TYSTLL GC K K GY K ++ V+E+K  GL MD V+YGT+++VCAS+++C+EAE YF 
Sbjct: 217  ITYSTLLSGCIKAKDGYSKTLDFVQELKYNGLQMDTVIYGTILAVCASHNRCEEAESYFS 276

Query: 970  EMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQEMGSAGLTLNKVILTTLLKVYVRAG 791
            +MK+EGH PNVFHYSSLLNAYA  GNYKKA++L+Q+M S GL  NKVI TTLLKVYVR G
Sbjct: 277  QMKNEGHLPNVFHYSSLLNAYASSGNYKKAEELVQDMKSLGLVPNKVIWTTLLKVYVRGG 336

Query: 790  LFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSGNLEEAKSVFDEMRLKEVKNDGYSYS 611
            LFEKS++LL EL+ LGYAEDEMPYCLLMDGL+K+G ++EA+S FDEM+ K VK+DGY+YS
Sbjct: 337  LFEKSQQLLLELETLGYAEDEMPYCLLMDGLSKAGRVDEARSFFDEMKEKNVKSDGYAYS 396

Query: 610  IMISALCRSKFIDEAKQLASEFEMKYDKYDVVILNSMLCAYCRSGEMENVMKMMRKMDES 431
            IMISA CR + ++EAKQLA EFE KYDKYDVVILN+MLCAYCR+G+ME+VM+ MRKMDE 
Sbjct: 397  IMISAYCRGRLLEEAKQLAKEFEAKYDKYDVVILNTMLCAYCRAGDMESVMQTMRKMDEL 456

Query: 430  AISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKKGHHTPEDLAVSLMKSLGSMGSYSEA 251
            AISP + T+ ILIKYFCK++LYLLAY+TMEDMH+KGH   E+L   L+  LG   +Y+EA
Sbjct: 457  AISPSYCTFHILIKYFCKQKLYLLAYQTMEDMHRKGHQPEEELCSMLIFHLGKAKAYTEA 516

Query: 250  FSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDAYVVVKDNAEFISKPAIKRFAISFMR 71
            FSVY MLKY KRTM KALH KILH+L+ G LLKDAYVVVKDNAE IS+ AIK+FA +FM+
Sbjct: 517  FSVYTMLKYGKRTMCKALHEKILHVLLGGQLLKDAYVVVKDNAELISQAAIKKFANAFMK 576

Query: 70   NGNINLVNDVIKSIHNSDYNIDQ 2
             GNINL+NDV+K IH+S Y IDQ
Sbjct: 577  LGNINLINDVMKVIHSSGYKIDQ 599


>ref|XP_002299667.2| hypothetical protein POPTR_0001s21880g [Populus trichocarpa]
            gi|550347847|gb|EEE84472.2| hypothetical protein
            POPTR_0001s21880g [Populus trichocarpa]
          Length = 673

 Score =  702 bits (1813), Expect = 0.0
 Identities = 350/580 (60%), Positives = 450/580 (77%), Gaps = 14/580 (2%)
 Frame = -3

Query: 1699 SFHSTPSINNSAQIPATSQLGRVTSPLSSIAN---------ATHTQVWSKSGKTPVASRQ 1547
            SF   PS++ ++    TS      +  +S+ N         AT T+   K G      RQ
Sbjct: 23   SFPLIPSVSTTSSCQRTSAAAAAAAATTSLVNEPCNDDSQPATTTRRRPKGGAVDAQRRQ 82

Query: 1546 TK-----ESAIHDIQHSSNLEAALLRSGELLKVQDLNIVLRHFGKLNRWMDVCQLFDWMQ 1382
            +K     ++AI ++Q S +L++AL R G +LKVQDLNI+LR+FG+  RW D+ QLFDWMQ
Sbjct: 83   SKSYMSRKAAILEVQQSPHLDSALQRLGGMLKVQDLNIILRNFGEQCRWQDLSQLFDWMQ 142

Query: 1381 QHGKTNIASYSSYIKFVGRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCLIKCGKFHN 1202
            +H K + +SYSSYIKF+G   N AKALEIY+SI D+S ++N  +CNS L CL++  KF +
Sbjct: 143  RHNKISASSYSSYIKFMGTSLNPAKALEIYHSIPDESKKTNVFICNSLLRCLVRNTKFDS 202

Query: 1201 SLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMDDVLYGTLI 1022
            S+K F++MK  GL PD +TYSTLL GC K+K GY KA++LV+E+   GL MD ++YGTL+
Sbjct: 203  SMKFFHKMKNNGLTPDAITYSTLLAGCMKIKDGYSKALDLVQELNYNGLQMDSIMYGTLL 262

Query: 1021 SVCASNHQCDEAEKYFDEMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQEMGSAGLT 842
            +VCASN++C+EA+ YF++MK EGHSPN+FHYSSLLNAY+ DGNYKKA++L+Q+M S+GL 
Sbjct: 263  AVCASNNRCEEAQSYFNQMKDEGHSPNIFHYSSLLNAYSSDGNYKKAEELVQDMKSSGLV 322

Query: 841  LNKVILTTLLKVYVRAGLFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSGNLEEAKSV 662
             NKVILTTLLKVYVR GLFEKSR+LL EL  LG+A++EMPYCLLMDGLAK+G L+EA+SV
Sbjct: 323  PNKVILTTLLKVYVRGGLFEKSRDLLVELDTLGFAKNEMPYCLLMDGLAKNGLLDEARSV 382

Query: 661  FDEMRLKEVKNDGYSYSIMISALCRSKFIDEAKQLASEFEMKYDKYDVVILNSMLCAYCR 482
            F+EM+ K VK+ GYSYSIMIS+ CR    +EAK+LA EFE KYDKYDVVILN++LCAYCR
Sbjct: 383  FNEMKEKRVKSGGYSYSIMISSFCRGGLFEEAKELAEEFEAKYDKYDVVILNTILCAYCR 442

Query: 481  SGEMENVMKMMRKMDESAISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKKGHHTPEDL 302
            +GE E+VM+ MRKMDE AISPD++T+ ILIKYFCKE+LY+LAY+TMEDMH+KGH   E+L
Sbjct: 443  TGEKESVMRTMRKMDELAISPDYNTFHILIKYFCKEKLYMLAYQTMEDMHRKGHQPMEEL 502

Query: 301  AVSLMKSLGSMGSYSEAFSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDAYVVVKDNA 122
              SL+  LG + +++EAFSVY+MLK SKRTM+KA H  ILHIL++G LLKDAYVVVKDNA
Sbjct: 503  CSSLILHLGKIKAHAEAFSVYSMLKSSKRTMSKAFHEDILHILIAGRLLKDAYVVVKDNA 562

Query: 121  EFISKPAIKRFAISFMRNGNINLVNDVIKSIHNSDYNIDQ 2
            E IS  AIK+FA SF++ G+INL+NDV+K IH S Y IDQ
Sbjct: 563  ELISPAAIKKFASSFVKLGDINLINDVMKVIHGSGYKIDQ 602


>ref|XP_004141206.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Cucumis sativus]
          Length = 668

 Score =  701 bits (1810), Expect = 0.0
 Identities = 336/514 (65%), Positives = 429/514 (83%)
 Frame = -3

Query: 1543 KESAIHDIQHSSNLEAALLRSGELLKVQDLNIVLRHFGKLNRWMDVCQLFDWMQQHGKTN 1364
            ++SAI  ++  S L  AL R G LLK QDLN++LRHFG L+RW D+ QLF+WMQ+ GKTN
Sbjct: 77   RQSAIAQVKDCSELAPALARYGGLLKAQDLNVILRHFGMLSRWKDLSQLFEWMQETGKTN 136

Query: 1363 IASYSSYIKFVGRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCLIKCGKFHNSLKLFN 1184
            ++SYSSYIKF+GR  N  KALE+YN+I++ SI+++  +CNS L CL++ GKF  S+KLF+
Sbjct: 137  VSSYSSYIKFMGRGLNPLKALEVYNNIEEVSIKNSIFICNSILNCLVRNGKFDTSVKLFH 196

Query: 1183 QMKQAGLVPDIVTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMDDVLYGTLISVCASN 1004
            QMK  GL PD VTYST+L GC +VK GY KAMEL++E++  GL MD V YGTLI++CAS+
Sbjct: 197  QMKNDGLCPDTVTYSTMLTGCIRVKHGYAKAMELLKELQDNGLCMDCVSYGTLIAICASH 256

Query: 1003 HQCDEAEKYFDEMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQEMGSAGLTLNKVIL 824
            ++ ++AE++F++M++EGHSPN+FHY SLLNAY+I+G+YKKAD+LI++M   GL  NKVIL
Sbjct: 257  NRLEDAERFFNQMRAEGHSPNMFHYGSLLNAYSINGDYKKADELIEDMKLTGLVPNKVIL 316

Query: 823  TTLLKVYVRAGLFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSGNLEEAKSVFDEMRL 644
            TTLLKVYVR GLFEKSR+LL EL+ LGY E+EMPYCLLMDGLAK+G++ EAK+VFDEM+ 
Sbjct: 317  TTLLKVYVRGGLFEKSRKLLSELESLGYGENEMPYCLLMDGLAKAGSIREAKTVFDEMKA 376

Query: 643  KEVKNDGYSYSIMISALCRSKFIDEAKQLASEFEMKYDKYDVVILNSMLCAYCRSGEMEN 464
            K VK DGY++SIMISA CR   ++EAK LA +FE  YD+YD+VILN+MLCAYCR+GEME+
Sbjct: 377  KNVKTDGYAHSIMISAFCRGGLLEEAKLLAKDFEATYDRYDIVILNTMLCAYCRAGEMES 436

Query: 463  VMKMMRKMDESAISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKKGHHTPEDLAVSLMK 284
            VM+M+RKMD+ AISPD++T+ ILIKYF KE+LYLL YRT+EDMH+KGH   E+L  SL+ 
Sbjct: 437  VMQMLRKMDDLAISPDYNTFHILIKYFFKEKLYLLCYRTLEDMHRKGHQPEEELCSSLIL 496

Query: 283  SLGSMGSYSEAFSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDAYVVVKDNAEFISKP 104
            SLG++ +YSEAFSVYN+LKYSKRTM KALH KILHIL++G LLKDAYVVVKDNA  ISKP
Sbjct: 497  SLGNIRAYSEAFSVYNILKYSKRTMCKALHEKILHILIAGRLLKDAYVVVKDNAGVISKP 556

Query: 103  AIKRFAISFMRNGNINLVNDVIKSIHNSDYNIDQ 2
            AI++FA  FM+ GN+NL+NDV+K+IH S Y IDQ
Sbjct: 557  AIRKFAFGFMKFGNVNLINDVMKAIHGSGYKIDQ 590


>ref|XP_007042228.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 2
            [Theobroma cacao] gi|508706163|gb|EOX98059.1|
            Pentatricopeptide repeat (PPR) superfamily protein
            isoform 2 [Theobroma cacao]
          Length = 649

 Score =  691 bits (1783), Expect = 0.0
 Identities = 349/581 (60%), Positives = 451/581 (77%), Gaps = 7/581 (1%)
 Frame = -3

Query: 1723 GNGFQPVFSFHSTPSINNSAQIPATSQLGRVTSPLSSIAN---ATHTQVWSKSGKTPVA- 1556
            G GF       + PS ++  +IP+ S      +P +SI+N   AT T         P   
Sbjct: 8    GTGFH--LQILTFPSPSSFPRIPSLSP----PTPRASISNLNSATSTTATPVKEPNPTRP 61

Query: 1555 ---SRQTKESAIHDIQHSSNLEAALLRSGELLKVQDLNIVLRHFGKLNRWMDVCQLFDWM 1385
               S   ++SA+ ++Q SS+L +AL   G +LK QDLN+++RHFGKL +W  + +LF WM
Sbjct: 62   HSKSYLQRKSALLEVQQSSDLNSALQNFGGILKPQDLNVIIRHFGKLGKWHHLSELFAWM 121

Query: 1384 QQHGKTNIASYSSYIKFVGRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCLIKCGKFH 1205
            QQHGKTN +SYSSYIK +G+  +  KALEIYNSI D+S R N  +CNS L  L++ GKF 
Sbjct: 122  QQHGKTNGSSYSSYIKIMGKKLSPIKALEIYNSIPDESTRINVFICNSLLSSLVRNGKFE 181

Query: 1204 NSLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMDDVLYGTL 1025
            + +KLF++MKQ GL PD VTY+TLL GC K+K G+ KA+EL++E+K  GL MD V+YGTL
Sbjct: 182  SGIKLFDKMKQDGLTPDSVTYNTLLAGCIKIKHGHSKALELIKELKYNGLKMDSVMYGTL 241

Query: 1024 ISVCASNHQCDEAEKYFDEMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQEMGSAGL 845
            ++VCAS+   +EA+ YF++M+ EGHSPN++HYSSLLNAY+ DGNY KAD+L+++M S+GL
Sbjct: 242  LAVCASSGLHEEAQNYFNQMREEGHSPNLYHYSSLLNAYSYDGNYCKADELVEQMKSSGL 301

Query: 844  TLNKVILTTLLKVYVRAGLFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSGNLEEAKS 665
              NKVILTTLLKVYVR GLFEKS +LL EL+ LGYAEDEMP+CLLMDGL+K+G L+EA+S
Sbjct: 302  VPNKVILTTLLKVYVRGGLFEKSTKLLAELEALGYAEDEMPFCLLMDGLSKAGRLDEARS 361

Query: 664  VFDEMRLKEVKNDGYSYSIMISALCRSKFIDEAKQLASEFEMKYDKYDVVILNSMLCAYC 485
            VF EM+ K VK+DGYS+SIMISALCR+   +EAK+LA +FE +Y+KYD+V+LN+MLCAYC
Sbjct: 362  VFVEMQQKCVKSDGYSHSIMISALCRAGLFEEAKELAQDFEAQYNKYDLVMLNTMLCAYC 421

Query: 484  RSGEMENVMKMMRKMDESAISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKKGHHTPED 305
            R+GEME+VM+ M+KMDE AISPD++T+ ILIKYFCKE+LYLLAY+TMEDMH KG+H  E+
Sbjct: 422  RAGEMESVMQTMKKMDELAISPDYNTFHILIKYFCKEKLYLLAYKTMEDMHGKGYHPEEE 481

Query: 304  LAVSLMKSLGSMGSYSEAFSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDAYVVVKDN 125
            L  SL+  LG M ++ EAFSVYNML+YSKRTM KALH KILHIL++G LLKDAYVVVKDN
Sbjct: 482  LCSSLIFQLGKMKAHLEAFSVYNMLRYSKRTMCKALHEKILHILIAGQLLKDAYVVVKDN 541

Query: 124  AEFISKPAIKRFAISFMRNGNINLVNDVIKSIHNSDYNIDQ 2
            AE IS+PAI +FA +FM+ GNIN++NDV+K +H S Y IDQ
Sbjct: 542  AELISQPAITKFATAFMKLGNINMINDVLKVLHGSGYKIDQ 582


>ref|XP_007042227.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma
            cacao] gi|508706162|gb|EOX98058.1| Pentatricopeptide
            repeat superfamily protein isoform 1 [Theobroma cacao]
          Length = 717

 Score =  691 bits (1783), Expect = 0.0
 Identities = 349/581 (60%), Positives = 451/581 (77%), Gaps = 7/581 (1%)
 Frame = -3

Query: 1723 GNGFQPVFSFHSTPSINNSAQIPATSQLGRVTSPLSSIAN---ATHTQVWSKSGKTPVA- 1556
            G GF       + PS ++  +IP+ S      +P +SI+N   AT T         P   
Sbjct: 8    GTGFH--LQILTFPSPSSFPRIPSLSP----PTPRASISNLNSATSTTATPVKEPNPTRP 61

Query: 1555 ---SRQTKESAIHDIQHSSNLEAALLRSGELLKVQDLNIVLRHFGKLNRWMDVCQLFDWM 1385
               S   ++SA+ ++Q SS+L +AL   G +LK QDLN+++RHFGKL +W  + +LF WM
Sbjct: 62   HSKSYLQRKSALLEVQQSSDLNSALQNFGGILKPQDLNVIIRHFGKLGKWHHLSELFAWM 121

Query: 1384 QQHGKTNIASYSSYIKFVGRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCLIKCGKFH 1205
            QQHGKTN +SYSSYIK +G+  +  KALEIYNSI D+S R N  +CNS L  L++ GKF 
Sbjct: 122  QQHGKTNGSSYSSYIKIMGKKLSPIKALEIYNSIPDESTRINVFICNSLLSSLVRNGKFE 181

Query: 1204 NSLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMDDVLYGTL 1025
            + +KLF++MKQ GL PD VTY+TLL GC K+K G+ KA+EL++E+K  GL MD V+YGTL
Sbjct: 182  SGIKLFDKMKQDGLTPDSVTYNTLLAGCIKIKHGHSKALELIKELKYNGLKMDSVMYGTL 241

Query: 1024 ISVCASNHQCDEAEKYFDEMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQEMGSAGL 845
            ++VCAS+   +EA+ YF++M+ EGHSPN++HYSSLLNAY+ DGNY KAD+L+++M S+GL
Sbjct: 242  LAVCASSGLHEEAQNYFNQMREEGHSPNLYHYSSLLNAYSYDGNYCKADELVEQMKSSGL 301

Query: 844  TLNKVILTTLLKVYVRAGLFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSGNLEEAKS 665
              NKVILTTLLKVYVR GLFEKS +LL EL+ LGYAEDEMP+CLLMDGL+K+G L+EA+S
Sbjct: 302  VPNKVILTTLLKVYVRGGLFEKSTKLLAELEALGYAEDEMPFCLLMDGLSKAGRLDEARS 361

Query: 664  VFDEMRLKEVKNDGYSYSIMISALCRSKFIDEAKQLASEFEMKYDKYDVVILNSMLCAYC 485
            VF EM+ K VK+DGYS+SIMISALCR+   +EAK+LA +FE +Y+KYD+V+LN+MLCAYC
Sbjct: 362  VFVEMQQKCVKSDGYSHSIMISALCRAGLFEEAKELAQDFEAQYNKYDLVMLNTMLCAYC 421

Query: 484  RSGEMENVMKMMRKMDESAISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKKGHHTPED 305
            R+GEME+VM+ M+KMDE AISPD++T+ ILIKYFCKE+LYLLAY+TMEDMH KG+H  E+
Sbjct: 422  RAGEMESVMQTMKKMDELAISPDYNTFHILIKYFCKEKLYLLAYKTMEDMHGKGYHPEEE 481

Query: 304  LAVSLMKSLGSMGSYSEAFSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDAYVVVKDN 125
            L  SL+  LG M ++ EAFSVYNML+YSKRTM KALH KILHIL++G LLKDAYVVVKDN
Sbjct: 482  LCSSLIFQLGKMKAHLEAFSVYNMLRYSKRTMCKALHEKILHILIAGQLLKDAYVVVKDN 541

Query: 124  AEFISKPAIKRFAISFMRNGNINLVNDVIKSIHNSDYNIDQ 2
            AE IS+PAI +FA +FM+ GNIN++NDV+K +H S Y IDQ
Sbjct: 542  AELISQPAITKFATAFMKLGNINMINDVLKVLHGSGYKIDQ 582


>ref|XP_004299940.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 642

 Score =  689 bits (1777), Expect = 0.0
 Identities = 334/514 (64%), Positives = 415/514 (80%)
 Frame = -3

Query: 1543 KESAIHDIQHSSNLEAALLRSGELLKVQDLNIVLRHFGKLNRWMDVCQLFDWMQQHGKTN 1364
            ++SAI  +QHSS+LE+AL R G  L VQDLN ++RHFG L RW D+ QLF+WMQQ+GK +
Sbjct: 58   RQSAILQVQHSSDLESALTRLGGSLNVQDLNAIIRHFGMLKRWHDLSQLFEWMQQNGKVS 117

Query: 1363 IASYSSYIKFVGRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCLIKCGKFHNSLKLFN 1184
             +SYSSYIKF+G+  N  KALEIYNSI+D+S + N  +CNS L  L++ GKF  S+KLF+
Sbjct: 118  ASSYSSYIKFMGKSLNPVKALEIYNSIQDESTKKNVHICNSVLGSLVRSGKFDGSIKLFH 177

Query: 1183 QMKQAGLVPDIVTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMDDVLYGTLISVCASN 1004
            QMKQ GL PD VTYSTLL GC K K GY KA+ELV+E+++  L MD V+YGTL+++CASN
Sbjct: 178  QMKQDGLTPDAVTYSTLLAGCIKFKHGYSKALELVQELQNNELQMDSVIYGTLLAICASN 237

Query: 1003 HQCDEAEKYFDEMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQEMGSAGLTLNKVIL 824
            ++ +EAE YF +MK EGH PN FHYSSLLNAY+I GNYKKAD ++Q+M SAGL  NKV L
Sbjct: 238  NKWEEAESYFKQMKDEGHLPNEFHYSSLLNAYSISGNYKKADDVVQDMKSAGLVPNKVTL 297

Query: 823  TTLLKVYVRAGLFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSGNLEEAKSVFDEMRL 644
            TTLLK YVR GLFEKSRELL EL+ LGYAEDEMPYC+LMD  AK+G +E+AK VFDE++ 
Sbjct: 298  TTLLKAYVRGGLFEKSRELLTELEALGYAEDEMPYCILMDAFAKAGRIEDAKLVFDEIKE 357

Query: 643  KEVKNDGYSYSIMISALCRSKFIDEAKQLASEFEMKYDKYDVVILNSMLCAYCRSGEMEN 464
            K V++DGYSYSIMISA CR   +D+AKQLA +FE  YDKYD+V+LN+M+CAYCR+GEM++
Sbjct: 358  KSVRSDGYSYSIMISAFCRGGLVDDAKQLAKDFERTYDKYDLVMLNTMICAYCRAGEMDS 417

Query: 463  VMKMMRKMDESAISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKKGHHTPEDLAVSLMK 284
            VM+M+RKMDE  I+PD +T+ ILIKYFCKE+LY+LAY+TMEDMH KG+   E+L  SLM 
Sbjct: 418  VMEMLRKMDELKITPDNNTFHILIKYFCKEKLYMLAYKTMEDMHNKGYPPDEELCSSLMF 477

Query: 283  SLGSMGSYSEAFSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDAYVVVKDNAEFISKP 104
             LG + +YSEA+S+YN+L+YSKRTM KALH KILHILV+G LLKDAYVVVKDN   ISK 
Sbjct: 478  HLGKIRAYSEAYSIYNILRYSKRTMCKALHEKILHILVAGRLLKDAYVVVKDNPRLISKA 537

Query: 103  AIKRFAISFMRNGNINLVNDVIKSIHNSDYNIDQ 2
            A  +FA +FM+ GNINL+NDV+K+I  S   IDQ
Sbjct: 538  ATMKFATAFMKLGNINLINDVLKAIDGSGCKIDQ 571


>ref|XP_007225150.1| hypothetical protein PRUPE_ppa002505mg [Prunus persica]
            gi|462422086|gb|EMJ26349.1| hypothetical protein
            PRUPE_ppa002505mg [Prunus persica]
          Length = 664

 Score =  687 bits (1772), Expect = 0.0
 Identities = 335/514 (65%), Positives = 421/514 (81%)
 Frame = -3

Query: 1543 KESAIHDIQHSSNLEAALLRSGELLKVQDLNIVLRHFGKLNRWMDVCQLFDWMQQHGKTN 1364
            ++SAI ++Q SS+L++AL R G  LKVQDLN ++RHFG L RW D+ QLF+WMQQ+GK +
Sbjct: 74   RQSAILEVQESSDLDSALTRLGGSLKVQDLNAIIRHFGILKRWHDLSQLFEWMQQNGKIS 133

Query: 1363 IASYSSYIKFVGRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCLIKCGKFHNSLKLFN 1184
             +SYSSYIKF+G+  N  KALEIYN+I+D S + N  +CNS L  LI+ GKF  S KLF+
Sbjct: 134  ASSYSSYIKFMGKSLNPVKALEIYNNIQDASTKKNVHICNSVLGSLIRSGKFDGSFKLFH 193

Query: 1183 QMKQAGLVPDIVTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMDDVLYGTLISVCASN 1004
            QMKQ GL PD VTYSTLL GC KVK GY KA+ELV+E++   L MD V+YGTL++VCASN
Sbjct: 194  QMKQDGLTPDAVTYSTLLAGCNKVKHGYSKALELVQELQRNELQMDSVIYGTLLAVCASN 253

Query: 1003 HQCDEAEKYFDEMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQEMGSAGLTLNKVIL 824
            ++ +EAE YF +MK+EG+ PNVFHYS++LNAY+I GNYK+AD L+Q+M SAGL  NKVIL
Sbjct: 254  NKLEEAEGYFKQMKNEGYLPNVFHYSAMLNAYSISGNYKEADDLVQDMKSAGLVPNKVIL 313

Query: 823  TTLLKVYVRAGLFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSGNLEEAKSVFDEMRL 644
            TTLLKVYVR GLFEKSRELL EL+ LGYAEDEMPYCLLMD LAK+G + EAK VFDEM+ 
Sbjct: 314  TTLLKVYVRGGLFEKSRELLAELEALGYAEDEMPYCLLMDALAKAGRIHEAKLVFDEMKE 373

Query: 643  KEVKNDGYSYSIMISALCRSKFIDEAKQLASEFEMKYDKYDVVILNSMLCAYCRSGEMEN 464
            K ++++GYSYSIMISA CR   +++AKQL+ + E  +DK+D+V+LN+M+CAYCR+GEM++
Sbjct: 374  KSIRSNGYSYSIMISAFCRGGLLEDAKQLSKDVERTHDKFDLVMLNTMICAYCRAGEMDS 433

Query: 463  VMKMMRKMDESAISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKKGHHTPEDLAVSLMK 284
            VM+MMRKMDE  I+PD++T+ ILIKYFCKE+LYLLAY+TMEDMH KGH   E+L  SLM 
Sbjct: 434  VMEMMRKMDEQKITPDYNTFHILIKYFCKEKLYLLAYQTMEDMHNKGHQPDEELCSSLMF 493

Query: 283  SLGSMGSYSEAFSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDAYVVVKDNAEFISKP 104
             LG + +YSEA+SVYN+L+YSKRTM KALH KILHIL++G LLKDAYVVVKDNA  ISKP
Sbjct: 494  LLGKIRAYSEAYSVYNILRYSKRTMCKALHEKILHILLAGQLLKDAYVVVKDNAGLISKP 553

Query: 103  AIKRFAISFMRNGNINLVNDVIKSIHNSDYNIDQ 2
            A+K+F+ +F++ GNINL+NDV+K I  S   IDQ
Sbjct: 554  AVKKFSTAFLKLGNINLINDVLKVIDASGCKIDQ 587


>ref|XP_006417404.1| hypothetical protein EUTSA_v10007006mg [Eutrema salsugineum]
            gi|557095175|gb|ESQ35757.1| hypothetical protein
            EUTSA_v10007006mg [Eutrema salsugineum]
          Length = 666

 Score =  660 bits (1704), Expect = 0.0
 Identities = 329/574 (57%), Positives = 437/574 (76%), Gaps = 6/574 (1%)
 Frame = -3

Query: 1705 VFSFHSTPSINNSAQIPA------TSQLGRVTSPLSSIANATHTQVWSKSGKTPVASRQT 1544
            +F+ +S PS  +S  +PA      T+      + +S++A +  T   +   K    S  T
Sbjct: 15   LFTTNSRPS--SSFSLPAVSLRTLTAATATSAAAVSTVAESPATVAEASRSKRHSKSYLT 72

Query: 1543 KESAIHDIQHSSNLEAALLRSGELLKVQDLNIVLRHFGKLNRWMDVCQLFDWMQQHGKTN 1364
            ++SAI +++ S +  ++L R   +LKVQDLN++LR FG   RW D+ QLFDWMQQ GK +
Sbjct: 73   RKSAISEVERSPDFLSSLQRLAGVLKVQDLNVILRDFGISGRWQDLIQLFDWMQQQGKIS 132

Query: 1363 IASYSSYIKFVGRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCLIKCGKFHNSLKLFN 1184
            +++YSS IKFVG  S S KALEIY SI D+S + N  +CNS L CL+K GK  +  KLF+
Sbjct: 133  VSTYSSCIKFVGAKSVS-KALEIYQSIPDESTKINVYICNSILSCLVKNGKLESCFKLFD 191

Query: 1183 QMKQAGLVPDIVTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMDDVLYGTLISVCASN 1004
            QMK+ GL PD++TY+TLL GC KVK GY KAMELV E+   G+ MD V+YGT++++CASN
Sbjct: 192  QMKRDGLKPDVITYNTLLAGCIKVKNGYSKAMELVGELPHNGIQMDGVMYGTVLAICASN 251

Query: 1003 HQCDEAEKYFDEMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQEMGSAGLTLNKVIL 824
             +C+EAE +  +MK +GHSPN++HYSSLLN+Y+  G+YKKAD+L+ EM S G+  NKV++
Sbjct: 252  GRCEEAESFIQQMKVKGHSPNIYHYSSLLNSYSWKGDYKKADELMTEMKSVGIVPNKVMM 311

Query: 823  TTLLKVYVRAGLFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSGNLEEAKSVFDEMRL 644
            TTLLKVY+R GLFE+SRELL EL+  GYAE+EMPYC+LMDGL+K+G  EEA+S+FDEM+ 
Sbjct: 312  TTLLKVYIRGGLFERSRELLSELESAGYAENEMPYCMLMDGLSKAGKFEEARSIFDEMKG 371

Query: 643  KEVKNDGYSYSIMISALCRSKFIDEAKQLASEFEMKYDKYDVVILNSMLCAYCRSGEMEN 464
            K VK+DGY+ SIMISALCRSK  +EAKQLA + E  Y+K D+V+LN+MLCAYCR+GEME+
Sbjct: 372  KGVKSDGYANSIMISALCRSKRFEEAKQLARDSESTYEKCDLVMLNTMLCAYCRAGEMES 431

Query: 463  VMKMMRKMDESAISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKKGHHTPEDLAVSLMK 284
            VM+MM+KMDE A+SPD++T+ ILIKYF KE+L+LLAY+T+ DMH KGH   E+L  SL+ 
Sbjct: 432  VMRMMKKMDEQAVSPDYNTFHILIKYFIKEKLHLLAYQTLLDMHSKGHRLEEELCSSLIY 491

Query: 283  SLGSMGSYSEAFSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDAYVVVKDNAEFISKP 104
             LG + ++SEAFSVY+ML+YSKRT+ K LH KILHIL+ G LLKDAYVVVKDNA+ IS+P
Sbjct: 492  HLGKIRAHSEAFSVYSMLRYSKRTICKDLHEKILHILIHGKLLKDAYVVVKDNAKMISQP 551

Query: 103  AIKRFAISFMRNGNINLVNDVIKSIHNSDYNIDQ 2
             +KRF  +FM +GN+NLVNDV+K +H S + IDQ
Sbjct: 552  TLKRFGRAFMNSGNVNLVNDVLKVLHGSGHKIDQ 585


>ref|XP_002889841.1| hypothetical protein ARALYDRAFT_888388 [Arabidopsis lyrata subsp.
            lyrata] gi|297335683|gb|EFH66100.1| hypothetical protein
            ARALYDRAFT_888388 [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  659 bits (1701), Expect = 0.0
 Identities = 330/572 (57%), Positives = 439/572 (76%), Gaps = 4/572 (0%)
 Frame = -3

Query: 1705 VFSFHSTPSINNSAQIPATSQLGRVTSPLSSIANAT----HTQVWSKSGKTPVASRQTKE 1538
            +F+ HS PS  +S  +PA S L  +TS  ++ + A      T   +   K    S  T++
Sbjct: 18   LFNTHSRPS--SSLSLPALS-LRILTSTAATTSTAVVESPATVAGAPRSKRHSNSYLTRK 74

Query: 1537 SAIHDIQHSSNLEAALLRSGELLKVQDLNIVLRHFGKLNRWMDVCQLFDWMQQHGKTNIA 1358
            SAI ++Q SS+  ++L R   +LKVQDLN++LR FG   RW D+ QLFDWMQQHGK +++
Sbjct: 75   SAISEVQRSSDFLSSLHRLERVLKVQDLNVILRDFGISGRWQDLIQLFDWMQQHGKISVS 134

Query: 1357 SYSSYIKFVGRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCLIKCGKFHNSLKLFNQM 1178
            +YSS IKFVG   N +KALEIY SI D+S + N  +CNS L CL+K GK  + +KLF+QM
Sbjct: 135  TYSSCIKFVGA-KNVSKALEIYQSIPDESTKINVYICNSILSCLVKNGKLDSCIKLFDQM 193

Query: 1177 KQAGLVPDIVTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMDDVLYGTLISVCASNHQ 998
            K+ GL PD++TY+TLL GC KVK GY KA+EL+ E+   G+ MD V+YGT++++CASN +
Sbjct: 194  KRGGLKPDVITYNTLLAGCIKVKNGYPKAVELIGELPHNGIQMDSVMYGTVLAICASNGR 253

Query: 997  CDEAEKYFDEMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQEMGSAGLTLNKVILTT 818
            C+EAE +  +MK+EGHSPN++HYSSLLN+Y+  G+YKKAD+L+ EM S GL  NKV++TT
Sbjct: 254  CEEAENFIQQMKAEGHSPNIYHYSSLLNSYSWKGDYKKADELMTEMKSIGLVPNKVMMTT 313

Query: 817  LLKVYVRAGLFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSGNLEEAKSVFDEMRLKE 638
            LLKVY++ GLF++SRELL EL+  GYAE+EMPYC+LMDGL+K+G LEEA+S+FD+M+ K 
Sbjct: 314  LLKVYIKGGLFDRSRELLSELESAGYAENEMPYCMLMDGLSKAGKLEEARSIFDDMKGKG 373

Query: 637  VKNDGYSYSIMISALCRSKFIDEAKQLASEFEMKYDKYDVVILNSMLCAYCRSGEMENVM 458
            VK+DGY+ SIMISALCRSK  +EAK+L+ + E  Y+K D+V+LN+MLCAYCR+GEME+VM
Sbjct: 374  VKSDGYANSIMISALCRSKRFEEAKELSRDSETTYEKCDLVMLNTMLCAYCRAGEMESVM 433

Query: 457  KMMRKMDESAISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKKGHHTPEDLAVSLMKSL 278
            +MM+KMDE AI PD++T+ ILIKYF KE+L+LLAY+T  DMH KGH   E+L  SL+  L
Sbjct: 434  RMMKKMDEQAIIPDYNTFHILIKYFIKEKLHLLAYQTTLDMHSKGHRLEEELCSSLIYHL 493

Query: 277  GSMGSYSEAFSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDAYVVVKDNAEFISKPAI 98
            G + + SEAFSVYNML+YSKRT+ K LH KILHIL+ G LLKDAY+VVKDNA+ IS+P +
Sbjct: 494  GKIRAPSEAFSVYNMLRYSKRTICKELHEKILHILIHGDLLKDAYIVVKDNAKMISQPTL 553

Query: 97   KRFAISFMRNGNINLVNDVIKSIHNSDYNIDQ 2
            K+F  +FM +GNINLVNDV+K +H S + IDQ
Sbjct: 554  KKFGRAFMISGNINLVNDVLKVLHGSGHKIDQ 585


>ref|XP_007148512.1| hypothetical protein PHAVU_006G214900g [Phaseolus vulgaris]
            gi|561021735|gb|ESW20506.1| hypothetical protein
            PHAVU_006G214900g [Phaseolus vulgaris]
          Length = 639

 Score =  658 bits (1697), Expect = 0.0
 Identities = 327/552 (59%), Positives = 431/552 (78%), Gaps = 8/552 (1%)
 Frame = -3

Query: 1633 VTSPLSSIANATHTQVWSKSG--KTPVASRQTK------ESAIHDIQHSSNLEAALLRSG 1478
            + SPL+  A+A+  +  S++   +  V  R +K      +SA  +IQ SS+L +AL R G
Sbjct: 16   IPSPLAIPASASTAEPLSQTPPHRNSVKLRSSKPFPSARKSATLEIQRSSDLPSALARLG 75

Query: 1477 ELLKVQDLNIVLRHFGKLNRWMDVCQLFDWMQQHGKTNIASYSSYIKFVGRDSNSAKALE 1298
            E L V+DLN  L HF   N++  + QLF WMQ++ K +++SYS Y++F+  + ++A+ L+
Sbjct: 76   ETLTVKDLNAALYHFKNSNKFNHISQLFKWMQENNKLDVSSYSHYMRFMANNLDAAEMLQ 135

Query: 1297 IYNSIKDDSIRSNASVCNSTLCCLIKCGKFHNSLKLFNQMKQAGLVPDIVTYSTLLLGCA 1118
            +Y+SI+D+S R N  VCNS L CLIK GKF + +KLF QM+  GLVPD VTYSTLL GC 
Sbjct: 136  LYHSIQDESARKNILVCNSVLGCLIKKGKFDSGMKLFRQMQLDGLVPDPVTYSTLLAGCI 195

Query: 1117 KVKGGYFKAMELVREMKSRGLSMDDVLYGTLISVCASNHQCDEAEKYFDEMKSEGHSPNV 938
            K++ GY KA+EL++E++   L MD V+YGT+++VCASN + +EAEKYF++MK EGHS NV
Sbjct: 196  KIENGYPKALELIQELQHSKLQMDGVIYGTILAVCASNGKWEEAEKYFNQMKDEGHSRNV 255

Query: 937  FHYSSLLNAYAIDGNYKKADKLIQEMGSAGLTLNKVILTTLLKVYVRAGLFEKSRELLDE 758
            +HYSSLLNAY+  GNYKKAD L Q+M S GL  NKVILTTLLKVYV+ GLF+KSRELL E
Sbjct: 256  YHYSSLLNAYSTCGNYKKADILFQDMKSEGLVPNKVILTTLLKVYVKGGLFDKSRELLAE 315

Query: 757  LQDLGYAEDEMPYCLLMDGLAKSGNLEEAKSVFDEMRLKEVKNDGYSYSIMISALCRSKF 578
            L+ LGYAEDEMPYC+LMDGLAK+G + EAK +FDEM    V++DGY++SIMISALCRSK 
Sbjct: 316  LKSLGYAEDEMPYCILMDGLAKAGQIHEAKLIFDEMMKNHVRSDGYAHSIMISALCRSKL 375

Query: 577  IDEAKQLASEFEMKYDKYDVVILNSMLCAYCRSGEMENVMKMMRKMDESAISPDWSTYKI 398
              EAKQLA +FE   +KYD+VILNSMLCA+CR GEME+VM+ ++KMDE AISP ++T+ I
Sbjct: 376  FREAKQLAKDFETTSNKYDIVILNSMLCAFCRVGEMESVMETLKKMDELAISPSYNTFHI 435

Query: 397  LIKYFCKERLYLLAYRTMEDMHKKGHHTPEDLAVSLMKSLGSMGSYSEAFSVYNMLKYSK 218
            LIKYFC+E++YLLAYRTM+DMH KGH   E+L  +L+  LG + +YSEAFSVYNML+Y K
Sbjct: 436  LIKYFCREKMYLLAYRTMKDMHSKGHQPGEELCSTLISHLGQVNAYSEAFSVYNMLRYGK 495

Query: 217  RTMNKALHGKILHILVSGGLLKDAYVVVKDNAEFISKPAIKRFAISFMRNGNINLVNDVI 38
            RTM K+LH KIL+IL++G LLKDAYVVVKDNA++IS+P  K+FAI+FM++GNIN +NDV+
Sbjct: 496  RTMCKSLHEKILYILLAGHLLKDAYVVVKDNAKYISRPPTKKFAIAFMKSGNINYINDVL 555

Query: 37   KSIHNSDYNIDQ 2
            K++H+S Y +DQ
Sbjct: 556  KTLHDSGYKLDQ 567


>ref|NP_172560.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|122242678|sp|Q0WVV0.1|PPR31_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g10910, chloroplastic; Flags: Precursor
            gi|110741600|dbj|BAE98748.1| membrane-associated
            salt-inducible protein isolog [Arabidopsis thaliana]
            gi|332190541|gb|AEE28662.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 664

 Score =  656 bits (1692), Expect = 0.0
 Identities = 324/573 (56%), Positives = 440/573 (76%), Gaps = 5/573 (0%)
 Frame = -3

Query: 1705 VFSFHSTPSINNSAQIPATSQLGRVTSPLSSIANATHTQVWSKSGKTPVASRQT-----K 1541
            +F+ HS PS  +S  IPA S   R+ +P ++  ++   ++ +   + P + R +     +
Sbjct: 17   LFNTHSRPS--SSLSIPALSL--RILTPTAATTSSAVIELPANVAEAPRSKRHSNSYLAR 72

Query: 1540 ESAIHDIQHSSNLEAALLRSGELLKVQDLNIVLRHFGKLNRWMDVCQLFDWMQQHGKTNI 1361
            +SAI ++Q SS+  ++L R   +LKVQDLN++LR FG   RW D+ QLF+WMQQHGK ++
Sbjct: 73   KSAISEVQRSSDFLSSLQRLATVLKVQDLNVILRDFGISGRWQDLIQLFEWMQQHGKISV 132

Query: 1360 ASYSSYIKFVGRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCLIKCGKFHNSLKLFNQ 1181
            ++YSS IKFVG   N +KALEIY SI D+S + N  +CNS L CL+K GK  + +KLF+Q
Sbjct: 133  STYSSCIKFVGA-KNVSKALEIYQSIPDESTKINVYICNSILSCLVKNGKLDSCIKLFDQ 191

Query: 1180 MKQAGLVPDIVTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMDDVLYGTLISVCASNH 1001
            MK+ GL PD+VTY+TLL GC KVK GY KA+EL+ E+   G+ MD V+YGT++++CASN 
Sbjct: 192  MKRDGLKPDVVTYNTLLAGCIKVKNGYPKAIELIGELPHNGIQMDSVMYGTVLAICASNG 251

Query: 1000 QCDEAEKYFDEMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQEMGSAGLTLNKVILT 821
            + +EAE +  +MK EGHSPN++HYSSLLN+Y+  G+YKKAD+L+ EM S GL  NKV++T
Sbjct: 252  RSEEAENFIQQMKVEGHSPNIYHYSSLLNSYSWKGDYKKADELMTEMKSIGLVPNKVMMT 311

Query: 820  TLLKVYVRAGLFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSGNLEEAKSVFDEMRLK 641
            TLLKVY++ GLF++SRELL EL+  GYAE+EMPYC+LMDGL+K+G LEEA+S+FD+M+ K
Sbjct: 312  TLLKVYIKGGLFDRSRELLSELESAGYAENEMPYCMLMDGLSKAGKLEEARSIFDDMKGK 371

Query: 640  EVKNDGYSYSIMISALCRSKFIDEAKQLASEFEMKYDKYDVVILNSMLCAYCRSGEMENV 461
             V++DGY+ SIMISALCRSK   EAK+L+ + E  Y+K D+V+LN+MLCAYCR+GEME+V
Sbjct: 372  GVRSDGYANSIMISALCRSKRFKEAKELSRDSETTYEKCDLVMLNTMLCAYCRAGEMESV 431

Query: 460  MKMMRKMDESAISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKKGHHTPEDLAVSLMKS 281
            M+MM+KMDE A+SPD++T+ ILIKYF KE+L+LLAY+T  DMH KGH   E+L  SL+  
Sbjct: 432  MRMMKKMDEQAVSPDYNTFHILIKYFIKEKLHLLAYQTTLDMHSKGHRLEEELCSSLIYH 491

Query: 280  LGSMGSYSEAFSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDAYVVVKDNAEFISKPA 101
            LG + + +EAFSVYNML+YSKRT+ K LH KILHIL+ G LLKDAY+VVKDNA+ IS+P 
Sbjct: 492  LGKIRAQAEAFSVYNMLRYSKRTICKELHEKILHILIQGNLLKDAYIVVKDNAKMISQPT 551

Query: 100  IKRFAISFMRNGNINLVNDVIKSIHNSDYNIDQ 2
            +K+F  +FM +GNINLVNDV+K +H S + IDQ
Sbjct: 552  LKKFGRAFMISGNINLVNDVLKVLHGSGHKIDQ 584


>ref|XP_006343484.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X4 [Solanum tuberosum]
          Length = 539

 Score =  653 bits (1684), Expect = 0.0
 Identities = 320/463 (69%), Positives = 390/463 (84%), Gaps = 1/463 (0%)
 Frame = -3

Query: 1387 MQQHGKTNIASYSSYIKFVGRDSNSAKALEIYNSIKDDSIRSNASVCNSTLCCLIKCGKF 1208
            MQQ+ K N+ASYSSY+KF+G+  +   A+E+Y  IKD SI+ N SVCN+ L  LIK GK 
Sbjct: 1    MQQNQKINVASYSSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKNGKS 60

Query: 1207 HNSLKLFNQMKQAGLVPDIVTYSTLLLGCAKVKGGYFKAMELVREMKSRGLSMDDVLYGT 1028
             +SLKLF QMK+ GLVPD+ TYSTLL GCAKV GGY+KA+ELV+E+ S GL MD V YG+
Sbjct: 61   ESSLKLFTQMKRDGLVPDVFTYSTLLAGCAKVNGGYYKALELVQELMSNGLQMDSVTYGS 120

Query: 1027 LISVCASNHQCDEAEKYFDEMKSEGHSPNVFHYSSLLNAYAIDGNYKKADKLIQEMGSAG 848
            L+SVCAS+ +C+EA KYF +MK EGHSPNV+HYSSLLNAY+ D NY+KA+ LI+EM SAG
Sbjct: 121  LLSVCASHKECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMRSAG 180

Query: 847  LTLNKVILTTLLKVYVRAGLFEKSRELLDELQDLGYAEDEMPYCLLMDGLAKSGNLEEAK 668
            L LNKVI TTLLKVYV+ GLFEKS+ELL EL+ LGYA+DEMP+CLLMDGLAKSG+L EAK
Sbjct: 181  LVLNKVIYTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEAK 240

Query: 667  SVFDEMRLKEVKN-DGYSYSIMISALCRSKFIDEAKQLASEFEMKYDKYDVVILNSMLCA 491
            SVFDEM  K VK  DGYSYSIMISA CRS  +++AK++ASEFE KYDKYD+VILN+ML A
Sbjct: 241  SVFDEMMEKHVKTADGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAMLSA 300

Query: 490  YCRSGEMENVMKMMRKMDESAISPDWSTYKILIKYFCKERLYLLAYRTMEDMHKKGHHTP 311
            YCR+G+MENVM MM+KMD+SAISPDW+T+ ILI+YFCKE+LYLLAYRTMEDMH KGH   
Sbjct: 301  YCRAGKMENVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGHQPE 360

Query: 310  EDLAVSLMKSLGSMGSYSEAFSVYNMLKYSKRTMNKALHGKILHILVSGGLLKDAYVVVK 131
            E L  SL+  LG  G++SEAFSVYNML+YSKRT++ ALH  ILHIL++G LLKDAYVVVK
Sbjct: 361  EGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTISNALHEHILHILIAGRLLKDAYVVVK 420

Query: 130  DNAEFISKPAIKRFAISFMRNGNINLVNDVIKSIHNSDYNIDQ 2
            DNA FIS+PAIK+F+++FMR+GN+NL+NDV+ ++H+S + IDQ
Sbjct: 421  DNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQ 463


>ref|XP_004486236.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Cicer arietinum]
          Length = 642

 Score =  648 bits (1672), Expect = 0.0
 Identities = 319/555 (57%), Positives = 423/555 (76%)
 Frame = -3

Query: 1666 AQIPATSQLGRVTSPLSSIANATHTQVWSKSGKTPVASRQTKESAIHDIQHSSNLEAALL 1487
            + I A++ +    +P     + T   +   + K P ++R++ +  +H    +S+L + L 
Sbjct: 20   SSISASASITEPPTPTPPSPSQTKKSIKFVNSK-PFSARKSAKLQLH---RASDLNSVLS 75

Query: 1486 RSGELLKVQDLNIVLRHFGKLNRWMDVCQLFDWMQQHGKTNIASYSSYIKFVGRDSNSAK 1307
            + G+ L V++LN  L HFG  N++  + QLF WMQ++ K ++ SYS+YIKF+    +++ 
Sbjct: 76   KVGKTLTVKELNSTLHHFGNSNKFNHISQLFLWMQENKKLDVYSYSNYIKFMANKLDAST 135

Query: 1306 ALEIYNSIKDDSIRSNASVCNSTLCCLIKCGKFHNSLKLFNQMKQAGLVPDIVTYSTLLL 1127
             L++YN+I+D+S + N  VCNS L CLIK GKF  ++KLF+QMKQ GLVPD+VTYS L+ 
Sbjct: 136  VLKLYNNIQDESAKDNVYVCNSVLSCLIKKGKFDTAIKLFHQMKQDGLVPDLVTYSMLIA 195

Query: 1126 GCAKVKGGYFKAMELVREMKSRGLSMDDVLYGTLISVCASNHQCDEAEKYFDEMKSEGHS 947
            GC KVK GY KA++L++E++   L MD+V+YG +++VCASN + +EAE YF+ MK+EGHS
Sbjct: 196  GCVKVKDGYSKALQLIQELQDNKLRMDNVIYGAILAVCASNGKWEEAEHYFNGMKNEGHS 255

Query: 946  PNVFHYSSLLNAYAIDGNYKKADKLIQEMGSAGLTLNKVILTTLLKVYVRAGLFEKSREL 767
            PNV+HYSSLLNAY+  GN+KKAD LIQ+M S GL  NKVILTTLLKVYVR GL EKSREL
Sbjct: 256  PNVYHYSSLLNAYSASGNFKKADSLIQDMKSEGLVPNKVILTTLLKVYVRGGLLEKSREL 315

Query: 766  LDELQDLGYAEDEMPYCLLMDGLAKSGNLEEAKSVFDEMRLKEVKNDGYSYSIMISALCR 587
            L +L+ L YAEDEMPYC+LMDGLAK+G + EAK VFDEM  K V++DGY++SIMISA CR
Sbjct: 316  LTKLESLSYAEDEMPYCVLMDGLAKAGQVHEAKIVFDEMMKKHVRSDGYAHSIMISAFCR 375

Query: 586  SKFIDEAKQLASEFEMKYDKYDVVILNSMLCAYCRSGEMENVMKMMRKMDESAISPDWST 407
            +K  +EAKQLA  F+  ++KYDVVI+NSMLCA+CR+GEME+VM+ +RKMDE AISPD++T
Sbjct: 376  AKLFEEAKQLAKNFQTTFNKYDVVIMNSMLCAFCRAGEMESVMETLRKMDELAISPDYNT 435

Query: 406  YKILIKYFCKERLYLLAYRTMEDMHKKGHHTPEDLAVSLMKSLGSMGSYSEAFSVYNMLK 227
            + ILIKYFC++ +YLLAY+TMEDMH KG+   E+L  SL+  LG   +YSEAFSVYNMLK
Sbjct: 436  FNILIKYFCRQNMYLLAYQTMEDMHSKGYQPVEELCSSLIYHLGQANAYSEAFSVYNMLK 495

Query: 226  YSKRTMNKALHGKILHILVSGGLLKDAYVVVKDNAEFISKPAIKRFAISFMRNGNINLVN 47
            YSKRT+ K LH KILHIL++G LLKDAYVV KDNA FIS    K+FA +FM+ GNINL+N
Sbjct: 496  YSKRTIRKTLHEKILHILLAGKLLKDAYVVFKDNATFISGHTTKKFASAFMKLGNINLIN 555

Query: 46   DVIKSIHNSDYNIDQ 2
            DV+K++HN  Y IDQ
Sbjct: 556  DVMKTLHNCGYKIDQ 570


Top