BLASTX nr result

ID: Mentha28_contig00025507 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00025507
         (773 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU41644.1| hypothetical protein MIMGU_mgv1a001284mg [Mimulus...   354   2e-95
ref|XP_006367266.1| PREDICTED: pentatricopeptide repeat-containi...   265   1e-68
ref|XP_004246707.1| PREDICTED: pentatricopeptide repeat-containi...   264   2e-68
ref|XP_007208081.1| hypothetical protein PRUPE_ppa001520mg [Prun...   244   2e-62
ref|XP_007027210.1| Tetratricopeptide repeat (TPR)-like superfam...   243   4e-62
ref|XP_004305248.1| PREDICTED: pentatricopeptide repeat-containi...   233   4e-59
ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containi...   228   2e-57
ref|XP_004167767.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   219   7e-55
ref|XP_004142106.1| PREDICTED: pentatricopeptide repeat-containi...   219   7e-55
ref|XP_006428907.1| hypothetical protein CICLE_v10011055mg [Citr...   216   6e-54
ref|XP_006381507.1| pentatricopeptide repeat-containing family p...   209   1e-51
ref|XP_006398740.1| hypothetical protein EUTSA_v10012661mg [Eutr...   196   1e-47
ref|XP_006398739.1| hypothetical protein EUTSA_v10012661mg [Eutr...   196   1e-47
ref|XP_006828302.1| hypothetical protein AMTR_s00023p00232870 [A...   194   3e-47
ref|XP_006287051.1| hypothetical protein CARUB_v10000200mg [Caps...   191   2e-46
gb|EXB84820.1| hypothetical protein L484_003853 [Morus notabilis]     190   4e-46
dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]                186   6e-45
ref|NP_195903.2| pentatricopeptide repeat-containing protein [Ar...   186   6e-45
ref|XP_002525196.1| pentatricopeptide repeat-containing protein,...   171   3e-40
ref|XP_003554352.1| PREDICTED: pentatricopeptide repeat-containi...   170   5e-40

>gb|EYU41644.1| hypothetical protein MIMGU_mgv1a001284mg [Mimulus guttatus]
          Length = 847

 Score =  354 bits (909), Expect = 2e-95
 Identities = 176/239 (73%), Positives = 209/239 (87%), Gaps = 1/239 (0%)
 Frame = -2

Query: 715 NPKH-KPQFPASSPLLSSSRWDNRQSLAYYSQLASKLAEDGRFEEFLMIAESVVASGVKM 539
           NPKH KPQFP  SPLLSS R DNRQSL Y ++LASKLAEDG FE+FLMI+ESVVASGVK 
Sbjct: 44  NPKHNKPQFPTYSPLLSSYRRDNRQSLTYNTELASKLAEDGMFEDFLMISESVVASGVKP 103

Query: 538 SEFLALLKSEHLVSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAVAVESLKVECR 359
           SEFLALL ++ +  G+ RVL +G   SVV MLFNG+ K+G++PV++FDAV+ ESL+ ECR
Sbjct: 104 SEFLALLNAKCVAIGVARVLDEGNLHSVVKMLFNGLEKIGIDPVQMFDAVSTESLRRECR 163

Query: 358 RLLKCGEVERLLSFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQNFPHAEVL 179
           RLLK GEVE+L+SFME  AGF+F+I+ELVEPS+ I+LCI++RDP AAIRYAQNFPH E++
Sbjct: 164 RLLKRGEVEQLVSFMETLAGFKFQIRELVEPSDVISLCISQRDPTAAIRYAQNFPHMEIM 223

Query: 178 FCSIVLEFGKKRDLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKSRTIYEGL 2
           FCSI+LEFGKKRDLASAL AFEA+KQN S+PNMHAYR+IIDVCG+CGDYLKSRTIYEGL
Sbjct: 224 FCSIILEFGKKRDLASALTAFEAAKQNTSTPNMHAYRTIIDVCGLCGDYLKSRTIYEGL 282


>ref|XP_006367266.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
           chloroplastic-like [Solanum tuberosum]
          Length = 859

 Score =  265 bits (678), Expect = 1e-68
 Identities = 134/247 (54%), Positives = 178/247 (72%), Gaps = 10/247 (4%)
 Frame = -2

Query: 712 PKHKPQFPASS------PLLSSSRWDNRQS----LAYYSQLASKLAEDGRFEEFLMIAES 563
           P H P    SS      PLLS+ RWD+       L YY++LASKLA+DGRF++ LMIAES
Sbjct: 41  PTHSPSHFTSSITTPQSPLLSTLRWDSASGSCNGLKYYAELASKLAQDGRFDDSLMIAES 100

Query: 562 VVASGVKMSEFLALLKSEHLVSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAVAV 383
           VV SGV  +EF ALL  + +  GIVR+L + K  SVV++L NG ++LG++P++L D  A+
Sbjct: 101 VVVSGVNAAEFAALLNVKLVSGGIVRLLEERKVGSVVELL-NGAQQLGIDPLKLLDGDAL 159

Query: 382 ESLKVECRRLLKCGEVERLLSFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQ 203
            +L  ECRR + CGE+E ++S ME   G    IK+LV+PSE + LC+++R P AA+RYA 
Sbjct: 160 NALSRECRRTMGCGEIEEVVSLMETLKGCGMPIKDLVKPSEILRLCVSQRKPNAAVRYAH 219

Query: 202 NFPHAEVLFCSIVLEFGKKRDLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKS 23
            FPH +++FC+I+LEFGKK DL SAL  FEASKQN  +PN++ YR+ IDVCG+CGDYLKS
Sbjct: 220 IFPHVDIMFCTIILEFGKKGDLVSALTVFEASKQNQDTPNLYIYRTAIDVCGLCGDYLKS 279

Query: 22  RTIYEGL 2
           R+IYEGL
Sbjct: 280 RSIYEGL 286


>ref|XP_004246707.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
           chloroplastic-like [Solanum lycopersicum]
          Length = 857

 Score =  264 bits (675), Expect = 2e-68
 Identities = 134/249 (53%), Positives = 178/249 (71%), Gaps = 12/249 (4%)
 Frame = -2

Query: 712 PKHKPQFPASS------PLLSSSRWDNRQS------LAYYSQLASKLAEDGRFEEFLMIA 569
           P H P    SS      PLLSS RWD+  +      L YY++LASKLA+DGRF++ LMIA
Sbjct: 41  PTHSPSHFTSSITTPQSPLLSSLRWDSASASGSCNGLKYYAELASKLAQDGRFDDSLMIA 100

Query: 568 ESVVASGVKMSEFLALLKSEHLVSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAV 389
           ESVV SGV   EF ALL  + +  GIVR+L + K  SVV++L NG ++LG++P +L D  
Sbjct: 101 ESVVVSGVNAEEFTALLNVKLVSGGIVRLLEERKVGSVVELL-NGAQQLGIDPSKLLDED 159

Query: 388 AVESLKVECRRLLKCGEVERLLSFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRY 209
           ++ +L  ECRR ++C E+E ++S ME   G    IK+LV+PSE + LC+++R P AA+RY
Sbjct: 160 SINALSRECRRTMQCSEIEEVVSLMETLRGCGMPIKDLVKPSEILRLCVSQRKPNAAVRY 219

Query: 208 AQNFPHAEVLFCSIVLEFGKKRDLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYL 29
           A  FPH +++FC+I+LEFGKK DLASAL  FEASKQN  +PN++ YR+ IDVCG+CGDYL
Sbjct: 220 AHIFPHVDIMFCTIILEFGKKGDLASALTVFEASKQNQDTPNLYIYRTAIDVCGLCGDYL 279

Query: 28  KSRTIYEGL 2
           KSR+IYEGL
Sbjct: 280 KSRSIYEGL 288


>ref|XP_007208081.1| hypothetical protein PRUPE_ppa001520mg [Prunus persica]
           gi|462403723|gb|EMJ09280.1| hypothetical protein
           PRUPE_ppa001520mg [Prunus persica]
          Length = 809

 Score =  244 bits (623), Expect = 2e-62
 Identities = 125/227 (55%), Positives = 166/227 (73%), Gaps = 1/227 (0%)
 Frame = -2

Query: 679 PLLSSSRWD-NRQSLAYYSQLASKLAEDGRFEEFLMIAESVVASGVKMSEFLALLKSEHL 503
           P L + RWD  +  L+Y++ LASKLA DG+F++F M+ ESVV SGV+ SEF A LK E +
Sbjct: 60  PPLFAVRWDPTKTHLSYFADLASKLARDGKFQDFAMVVESVVLSGVRGSEFTAALKLELV 119

Query: 502 VSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAVAVESLKVECRRLLKCGEVERLL 323
             GI  +L++GK  SVV++L   + +LGV P++LFD  A+E L  +C RLLKC +V+ L+
Sbjct: 120 AKGISGLLKEGKVRSVVEVL-GKVNELGVPPLKLFDGYAMELLGRQCSRLLKCKQVQELV 178

Query: 322 SFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQNFPHAEVLFCSIVLEFGKKR 143
             MEA AG++F IKEL+EPSE I LC+ K  P  AIRYA  FPHA +LFC+I+ EFGK++
Sbjct: 179 ELMEALAGYRFPIKELLEPSEVIKLCVDKCCPKLAIRYACIFPHAHILFCNIIYEFGKRK 238

Query: 142 DLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKSRTIYEGL 2
            L  AL A+EASK+NL+  NM+ YR+IIDVCG+C DY+KSR IYE L
Sbjct: 239 ALEPALAAYEASKENLNGSNMYVYRTIIDVCGLCKDYMKSRYIYEDL 285


>ref|XP_007027210.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
           [Theobroma cacao] gi|508715815|gb|EOY07712.1|
           Tetratricopeptide repeat (TPR)-like superfamily protein,
           putative [Theobroma cacao]
          Length = 858

 Score =  243 bits (621), Expect = 4e-62
 Identities = 128/232 (55%), Positives = 168/232 (72%), Gaps = 6/232 (2%)
 Frame = -2

Query: 679 PLLSSS--RWD--NRQS--LAYYSQLASKLAEDGRFEEFLMIAESVVASGVKMSEFLALL 518
           PLLSSS  RWD  +R+S  L YY+ LASKLAEDGR E+F MI E +VASGV     +++L
Sbjct: 64  PLLSSSSVRWDPTSRRSSLLKYYADLASKLAEDGRLEDFAMIVEMLVASGVNAPRIVSML 123

Query: 517 KSEHLVSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAVAVESLKVECRRLLKCGE 338
             + +  G+   +++GK  SVV++L   + KLG+ P +L D   + S+K E +R++  GE
Sbjct: 124 SVQFVSKGVASNVQEGKVKSVVEVL-KKVEKLGIAPSKLVDGFGLVSMKREFQRIVGSGE 182

Query: 337 VERLLSFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQNFPHAEVLFCSIVLE 158
           VE+ +  +EA  GFQF IKELV+PS  I +C+ KR+P  A+RYA   PHA++LFCSI+ E
Sbjct: 183 VEQAVDLLEALRGFQFTIKELVDPSYIIKVCVDKRNPNLAVRYACLLPHAKILFCSIISE 242

Query: 157 FGKKRDLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKSRTIYEGL 2
           FGKKRDLASAL A+EASK+NLS PNM+ YR+IID CG+CGDYLKSR IYE L
Sbjct: 243 FGKKRDLASALTAYEASKKNLSGPNMYLYRAIIDACGLCGDYLKSRNIYEDL 294


>ref|XP_004305248.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
           chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 847

 Score =  233 bits (595), Expect = 4e-59
 Identities = 121/227 (53%), Positives = 158/227 (69%), Gaps = 1/227 (0%)
 Frame = -2

Query: 679 PLLSSSRWDNRQS-LAYYSQLASKLAEDGRFEEFLMIAESVVASGVKMSEFLALLKSEHL 503
           PL + +RWD   + L+Y++ LASKLA DG+  +F M+ ESVV SGVK S+F A L+ + +
Sbjct: 61  PLFAGTRWDPHHTHLSYFADLASKLARDGKLHDFSMLLESVVLSGVKPSQFTAALQLDMV 120

Query: 502 VSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAVAVESLKVECRRLLKCGEVERLL 323
             GI  +L+DGK   +V++L   + +LGV PV LFD  A+E L   C RLLK  +V+ L+
Sbjct: 121 SRGISGILKDGKVGGLVEVLVK-VAELGVRPVELFDGYAMELLGAHCLRLLKFKQVQELV 179

Query: 322 SFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQNFPHAEVLFCSIVLEFGKKR 143
             ME   G  F I+ELV+PSE I  C+ KR P  AIRYA  FPH+ +LFC+I+ EFGKKR
Sbjct: 180 ELMEVLYGLHFPIRELVDPSEVIKACVEKRRPKLAIRYACIFPHSHMLFCNIMYEFGKKR 239

Query: 142 DLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKSRTIYEGL 2
            LASAL A+EASK+ LS  NM+ YR+IIDVCG+C DY+KSR IYE L
Sbjct: 240 ALASALTAYEASKEKLSGSNMYIYRTIIDVCGVCKDYMKSRYIYEDL 286


>ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
           chloroplastic [Vitis vinifera]
           gi|297741486|emb|CBI32618.3| unnamed protein product
           [Vitis vinifera]
          Length = 842

 Score =  228 bits (580), Expect = 2e-57
 Identities = 115/226 (50%), Positives = 159/226 (70%)
 Frame = -2

Query: 679 PLLSSSRWDNRQSLAYYSQLASKLAEDGRFEEFLMIAESVVASGVKMSEFLALLKSEHLV 500
           PLLS  RWD    L  YS LA+KL +DGRF++F  +AE+++ SGV++S+ + L+ +    
Sbjct: 56  PLLSDVRWD----LNNYSDLATKLVQDGRFDDFSTMAETLILSGVELSQLVELVSA---- 107

Query: 499 SGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAVAVESLKVECRRLLKCGEVERLLS 320
            GI  +LR+G+   VV++L   + KLG+ P+ LFD   +E L  ECRR+L CG+VE ++ 
Sbjct: 108 -GISGLLREGRVYCVVEVL-RKVDKLGICPLELFDGSTLELLSKECRRILNCGQVEEVVE 165

Query: 319 FMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQNFPHAEVLFCSIVLEFGKKRD 140
            +E   GF F +K+L+EP +FI +C+ KR+P  A+RYA   PHA++LFC+I+ EFGKKRD
Sbjct: 166 LIEILDGFHFPVKKLLEPLDFIKICVNKRNPNLAVRYACILPHAQILFCTIIHEFGKKRD 225

Query: 139 LASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKSRTIYEGL 2
           L SAL AFEASKQ L  PNM+ YR++IDVCG+C  Y KSR IYE L
Sbjct: 226 LGSALTAFEASKQKLIGPNMYCYRTMIDVCGLCSHYQKSRYIYEEL 271


>ref|XP_004167767.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
           protein At5g02830, chloroplastic-like [Cucumis sativus]
          Length = 855

 Score =  219 bits (559), Expect = 7e-55
 Identities = 115/238 (48%), Positives = 160/238 (67%), Gaps = 2/238 (0%)
 Frame = -2

Query: 709 KHKPQFPASSPLL--SSSRWDNRQSLAYYSQLASKLAEDGRFEEFLMIAESVVASGVKMS 536
           +H P    SS  L  + +    R  + +Y+ +ASKLAE G+ E+F M+ ESVV +GV+ S
Sbjct: 51  RHSPPALLSSVELDIAGASSGGRIPIQHYAGVASKLAEGGKLEDFAMVVESVVVAGVEPS 110

Query: 535 EFLALLKSEHLVSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAVAVESLKVECRR 356
           +F A+L  E +  GI R LR+GK  SVV +L   + +LG+  + L D  AVESL+ +CRR
Sbjct: 111 QFGAMLAVELVAKGISRCLREGKVWSVVQVL-RKVEELGISVLELCDEPAVESLRRDCRR 169

Query: 355 LLKCGEVERLLSFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQNFPHAEVLF 176
           + K GE+E L+  ME  +GF F ++E+++PSE I LC+  R+P  AIRYA   PHA++LF
Sbjct: 170 MAKSGELEELVELMEVLSGFGFSVREMMKPSEVIKLCVDYRNPKMAIRYASILPHADILF 229

Query: 175 CSIVLEFGKKRDLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKSRTIYEGL 2
           C+ + EFGKKRDL SA  A+  SK N++  NM+ YR+IIDVCG+CGDY KSR IY+ L
Sbjct: 230 CTTINEFGKKRDLKSAYIAYTESKANMNGSNMYIYRTIIDVCGLCGDYKKSRNIYQDL 287


>ref|XP_004142106.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
           chloroplastic-like [Cucumis sativus]
          Length = 849

 Score =  219 bits (559), Expect = 7e-55
 Identities = 115/238 (48%), Positives = 160/238 (67%), Gaps = 2/238 (0%)
 Frame = -2

Query: 709 KHKPQFPASSPLL--SSSRWDNRQSLAYYSQLASKLAEDGRFEEFLMIAESVVASGVKMS 536
           +H P    SS  L  + +    R  + +Y+ +ASKLAE G+ E+F M+ ESVV +GV+ S
Sbjct: 51  RHSPPALLSSVELDIAGASSGGRIPIQHYAGVASKLAEGGKLEDFAMVVESVVVAGVEPS 110

Query: 535 EFLALLKSEHLVSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAVAVESLKVECRR 356
           +F A+L  E +  GI R LR+GK  SVV +L   + +LG+  + L D  AVESL+ +CRR
Sbjct: 111 QFGAMLAVELVAKGISRCLREGKVWSVVQVL-RKVEELGISVLELCDEPAVESLRRDCRR 169

Query: 355 LLKCGEVERLLSFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQNFPHAEVLF 176
           + K GE+E L+  ME  +GF F ++E+++PSE I LC+  R+P  AIRYA   PHA++LF
Sbjct: 170 MAKSGELEELVELMEVLSGFGFSVREMMKPSEVIKLCVDYRNPKMAIRYASILPHADILF 229

Query: 175 CSIVLEFGKKRDLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKSRTIYEGL 2
           C+ + EFGKKRDL SA  A+  SK N++  NM+ YR+IIDVCG+CGDY KSR IY+ L
Sbjct: 230 CTTINEFGKKRDLKSAYIAYTESKANMNGSNMYIYRTIIDVCGLCGDYKKSRNIYQDL 287


>ref|XP_006428907.1| hypothetical protein CICLE_v10011055mg [Citrus clementina]
           gi|568853887|ref|XP_006480569.1| PREDICTED:
           pentatricopeptide repeat-containing protein At5g02830,
           chloroplastic-like [Citrus sinensis]
           gi|557530964|gb|ESR42147.1| hypothetical protein
           CICLE_v10011055mg [Citrus clementina]
          Length = 850

 Score =  216 bits (551), Expect = 6e-54
 Identities = 116/229 (50%), Positives = 153/229 (66%)
 Frame = -2

Query: 688 ASSPLLSSSRWDNRQSLAYYSQLASKLAEDGRFEEFLMIAESVVASGVKMSEFLALLKSE 509
           + + LLS+ R D      YY+ +ASKLA+DGR EEF MI ESVV S   +S+F ++L  E
Sbjct: 55  SQTALLSTVRRDLSSRNDYYADMASKLAKDGRLEEFAMIVESVVVSEGNVSKFASMLSLE 114

Query: 508 HLVSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAVAVESLKVECRRLLKCGEVER 329
            + SGIV+ + +G+   VV +L   + +LGV P+ LF     + LK EC+RLL  GEVE 
Sbjct: 115 MVASGIVKSIGEGRIDCVVGVL-KKLNELGVAPLELFHGSGFKLLKNECQRLLDSGEVEM 173

Query: 328 LLSFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQNFPHAEVLFCSIVLEFGK 149
            +  ME    F+  +KEL E    + LC+ K D   AIRYA   P A++LFC+ V EFGK
Sbjct: 174 FVGLMEVLEEFRLPVKELDEEFRIVQLCVNKPDVNLAIRYACIVPRADILFCNFVREFGK 233

Query: 148 KRDLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKSRTIYEGL 2
           KRDL SAL+A+EASK++LSSPNM+  R+IIDVCG+CGDY+KSR IYE L
Sbjct: 234 KRDLVSALRAYEASKKHLSSPNMYICRTIIDVCGLCGDYMKSRAIYEDL 282


>ref|XP_006381507.1| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|550336211|gb|ERP59304.1|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 828

 Score =  209 bits (531), Expect = 1e-51
 Identities = 117/257 (45%), Positives = 162/257 (63%), Gaps = 19/257 (7%)
 Frame = -2

Query: 715 NPKHKPQFPA--------------SSPLLSS----SRWDNRQSLAYYSQLASKLAEDGRF 590
           +PK KP+ P+              S PLLS+       ++   L Y++ LASKLAEDGR 
Sbjct: 28  SPKPKPKTPSLHAPSKPIPAVHSRSPPLLSTIPFRQNHNSSSLLDYHANLASKLAEDGRL 87

Query: 589 EEFLMIAESVVASGVKMSEFLALLKSEHLVSGIVRVLRDGKPSSVVDMLFNGIRKLGVEP 410
           ++F+MIAESV+ASGV+ S F+A L    +  GI + L+ G    VV  L     +LGV  
Sbjct: 88  QDFVMIAESVIASGVEPSSFVAALSVGPVAKGISKNLQQGNVDCVVRFL-KKTEELGVST 146

Query: 409 VRLFDAVAVESLKVECRRLLKCGEVERLLSFMEAFAGFQFKIKELVEPSEFINLCITKRD 230
           ++  D VA++ LK E  R++ CG+VE+++  ME  AGF F  KELV+PS  I +C+ K +
Sbjct: 147 LKFLDGVAIDLLKKEFIRIVNCGDVEQVVYIMETLAGFCFSFKELVDPSYIIKICVDKLN 206

Query: 229 PIAAIRYAQNFP-HAEVLFCSIVLEFGKKRDLASALKAFEASKQNLSSPNMHAYRSIIDV 53
           P  A+RYA  FP    +LFC+I+ EFG+K  L SAL A++ +K  LS PNM+ +R+IIDV
Sbjct: 207 PKMAVRYAAIFPGEGRILFCNIISEFGRKGHLDSALVAYDEAKHKLSVPNMYLHRTIIDV 266

Query: 52  CGICGDYLKSRTIYEGL 2
           CG+CGDY+KSR IYE L
Sbjct: 267 CGLCGDYMKSRYIYEDL 283


>ref|XP_006398740.1| hypothetical protein EUTSA_v10012661mg [Eutrema salsugineum]
           gi|557099830|gb|ESQ40193.1| hypothetical protein
           EUTSA_v10012661mg [Eutrema salsugineum]
          Length = 863

 Score =  196 bits (497), Expect = 1e-47
 Identities = 105/229 (45%), Positives = 147/229 (64%), Gaps = 1/229 (0%)
 Frame = -2

Query: 685 SSPLLSSSRWDNRQSLAYYSQLASKLAEDGRFEEFLMIAESVVA-SGVKMSEFLALLKSE 509
           SS    + RW    S+ YY+  ASKLAEDGR ++  +IAE++ A SG  ++ F +++ S+
Sbjct: 64  SSHFSDAVRWIPDGSVEYYADFASKLAEDGRIQDVALIAETLAAESGANVARFASMVDSD 123

Query: 508 HLVSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAVAVESLKVECRRLLKCGEVER 329
            L  GI   LR GK  SVV  L   I K+G+ P+ L D  +V+ ++   R +    +VE+
Sbjct: 124 LLSKGISLNLRQGKIESVVYTL-QRIEKVGIAPLDLVDESSVKLMRKHFRAMANSVQVEK 182

Query: 328 LLSFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQNFPHAEVLFCSIVLEFGK 149
            +  ME  AGF+FKIKELV+P + + +C+   +P  AIRYA   PH E+L C I+  FGK
Sbjct: 183 AIDLMEILAGFRFKIKELVDPFDVVKICVDISNPQLAIRYACLLPHTELLLCRIIHGFGK 242

Query: 148 KRDLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKSRTIYEGL 2
           K D+ S L A+EA KQ L +PNM+ YR++IDVCG+CGDY+KSR IYE L
Sbjct: 243 KGDMVSVLTAYEACKQILDNPNMYIYRTMIDVCGLCGDYVKSRYIYEDL 291


>ref|XP_006398739.1| hypothetical protein EUTSA_v10012661mg [Eutrema salsugineum]
           gi|557099829|gb|ESQ40192.1| hypothetical protein
           EUTSA_v10012661mg [Eutrema salsugineum]
          Length = 858

 Score =  196 bits (497), Expect = 1e-47
 Identities = 105/229 (45%), Positives = 147/229 (64%), Gaps = 1/229 (0%)
 Frame = -2

Query: 685 SSPLLSSSRWDNRQSLAYYSQLASKLAEDGRFEEFLMIAESVVA-SGVKMSEFLALLKSE 509
           SS    + RW    S+ YY+  ASKLAEDGR ++  +IAE++ A SG  ++ F +++ S+
Sbjct: 64  SSHFSDAVRWIPDGSVEYYADFASKLAEDGRIQDVALIAETLAAESGANVARFASMVDSD 123

Query: 508 HLVSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAVAVESLKVECRRLLKCGEVER 329
            L  GI   LR GK  SVV  L   I K+G+ P+ L D  +V+ ++   R +    +VE+
Sbjct: 124 LLSKGISLNLRQGKIESVVYTL-QRIEKVGIAPLDLVDESSVKLMRKHFRAMANSVQVEK 182

Query: 328 LLSFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQNFPHAEVLFCSIVLEFGK 149
            +  ME  AGF+FKIKELV+P + + +C+   +P  AIRYA   PH E+L C I+  FGK
Sbjct: 183 AIDLMEILAGFRFKIKELVDPFDVVKICVDISNPQLAIRYACLLPHTELLLCRIIHGFGK 242

Query: 148 KRDLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKSRTIYEGL 2
           K D+ S L A+EA KQ L +PNM+ YR++IDVCG+CGDY+KSR IYE L
Sbjct: 243 KGDMVSVLTAYEACKQILDNPNMYIYRTMIDVCGLCGDYVKSRYIYEDL 291


>ref|XP_006828302.1| hypothetical protein AMTR_s00023p00232870 [Amborella trichopoda]
           gi|548832949|gb|ERM95718.1| hypothetical protein
           AMTR_s00023p00232870 [Amborella trichopoda]
          Length = 855

 Score =  194 bits (493), Expect = 3e-47
 Identities = 104/236 (44%), Positives = 149/236 (63%), Gaps = 4/236 (1%)
 Frame = -2

Query: 697 QFPASSPLLSSSRWD----NRQSLAYYSQLASKLAEDGRFEEFLMIAESVVASGVKMSEF 530
           ++ +S+PLLS  R D    N  SL +Y+ +ASKLAE+GR +EF M+AES + SG+    F
Sbjct: 45  KYLSSTPLLSDIRPDLGLQNPSSLKFYASMASKLAENGRLDEFSMLAESFIGSGMAPGHF 104

Query: 529 LALLKSEHLVSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAVAVESLKVECRRLL 350
           +  L  +H+ +G    L++G+  +V+ ++     KLG+ P  +FD  A   L   CRR+L
Sbjct: 105 VEALSIKHVSAGFALCLKNGEFDTVLGVM-EKFDKLGICPSLIFDGSARRLLLSACRRVL 163

Query: 349 KCGEVERLLSFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQNFPHAEVLFCS 170
               +   +  +E FAG++F +K++V+P+  +  CI + DP  A RYA   PHA+V F  
Sbjct: 164 DGDNIGEFVRLVEIFAGYRFSVKDVVKPTFILQACIDRHDPFMAGRYASILPHADVWFNF 223

Query: 169 IVLEFGKKRDLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKSRTIYEGL 2
           ++ EFGKK+DL SAL AFE SK    SPNM+ YRSIID CG CGD LKSR+I+E L
Sbjct: 224 LICEFGKKKDLQSALVAFEVSKGKSVSPNMYIYRSIIDACGYCGDSLKSRSIFEDL 279


>ref|XP_006287051.1| hypothetical protein CARUB_v10000200mg [Capsella rubella]
           gi|482555757|gb|EOA19949.1| hypothetical protein
           CARUB_v10000200mg [Capsella rubella]
          Length = 858

 Score =  191 bits (486), Expect = 2e-46
 Identities = 104/229 (45%), Positives = 145/229 (63%), Gaps = 1/229 (0%)
 Frame = -2

Query: 685 SSPLLSSSRWDNRQSLAYYSQLASKLAEDGRFEEFLMIAESVVA-SGVKMSEFLALLKSE 509
           SS   +  RW    SL YY+  ASKLAEDGR E+  +IAE++ A SG  ++ F +++  +
Sbjct: 66  SSHFSNVVRWLPDGSLEYYADFASKLAEDGRIEDVALIAETLAAESGANVARFASMVDFD 125

Query: 508 HLVSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAVAVESLKVECRRLLKCGEVER 329
            L  GI   LR GK  SVV  L   I K+G+ P+ L D  +V+ ++ + R +    +VE+
Sbjct: 126 LLSKGISSNLRQGKIESVVYTL-KRIEKVGIAPLDLVDESSVKLMRKQFRAMANSVQVEK 184

Query: 328 LLSFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQNFPHAEVLFCSIVLEFGK 149
            +  ME  AG +FKIKELV+P + +  C+   +P  AIRYA   PH E+L C I+L FGK
Sbjct: 185 AIDLMEILAGLRFKIKELVDPFDIVKSCVDISNPELAIRYACLLPHTEILLCRIILGFGK 244

Query: 148 KRDLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKSRTIYEGL 2
           K D+ S + A+EA KQ L +PNM+  R++IDVCG+CGDY+KSR IYE L
Sbjct: 245 KGDMVSVMTAYEACKQILDTPNMYICRTMIDVCGLCGDYVKSRYIYEDL 293


>gb|EXB84820.1| hypothetical protein L484_003853 [Morus notabilis]
          Length = 822

 Score =  190 bits (483), Expect = 4e-46
 Identities = 106/232 (45%), Positives = 144/232 (62%), Gaps = 2/232 (0%)
 Frame = -2

Query: 691 PASSPLLSSSRWDNRQSLAYYSQLASKLAEDGRFEEFLMIAESVVASGVKMSEFLALLKS 512
           P++ P  SS+    R  L +++  A     D +  +  ++ ES+  SGV  S   + L++
Sbjct: 39  PSNLPSRSSAV---RSDLRHFADFAG----DAKLRDLSVVVESLAVSGVDASRLRSALRA 91

Query: 511 E--HLVSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAVAVESLKVECRRLLKCGE 338
           E      GI  VLRDGK  S   +L   + +LG  PV +FD  A+E ++ ECRR+L+C +
Sbjct: 92  ELASAEKGISAVLRDGKVRSFARLL-GKLDELGFPPVEIFDGWALELIRRECRRILRCEQ 150

Query: 337 VERLLSFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQNFPHAEVLFCSIVLE 158
           VE L+   E  +G+ F IKELV+PS+ I +C+ KR+P  AIRYA   PHA ++FC  V E
Sbjct: 151 VEELVELFEVLSGYGFSIKELVKPSDVIKICVEKRNPKMAIRYACTLPHAHIIFCDAVYE 210

Query: 157 FGKKRDLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKSRTIYEGL 2
           FGKK DL SAL A EASK+N +S NM+ YR+IIDVCG C DY KSR IYE L
Sbjct: 211 FGKKGDLVSALIAHEASKKNSTSTNMYLYRTIIDVCGRCHDYQKSRYIYEDL 262


>dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]
          Length = 852

 Score =  186 bits (473), Expect = 6e-45
 Identities = 103/229 (44%), Positives = 143/229 (62%), Gaps = 1/229 (0%)
 Frame = -2

Query: 685 SSPLLSSSRWDNRQSLAYYSQLASKLAEDGRFEEFLMIAESVVA-SGVKMSEFLALLKSE 509
           SS   +  RW    SL YY+  ASKLAEDGR E+  +IAE++ A SG  ++ F +++  +
Sbjct: 66  SSHFSNVVRWIPDGSLEYYADFASKLAEDGRIEDVALIAETLAAESGANVARFASMVDYD 125

Query: 508 HLVSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAVAVESLKVECRRLLKCGEVER 329
            L  GI   LR GK  SVV  L   I K+G+ P+ L D  +V+ ++ + R +    +VE+
Sbjct: 126 LLSKGISSNLRQGKIESVVYTL-KRIEKVGIAPLDLVDDSSVKLMRKQFRAMANSVQVEK 184

Query: 328 LLSFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQNFPHAEVLFCSIVLEFGK 149
            +  ME  AG  FKIKELV+P + +  C+   +P  AIRYA   PH E+L C I+  FGK
Sbjct: 185 AIDLMEILAGLGFKIKELVDPFDVVKSCVEISNPQLAIRYACLLPHTELLLCRIIHGFGK 244

Query: 148 KRDLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKSRTIYEGL 2
           K D+ S + A+EA KQ L +PNM+  R++IDVCG+CGDY+KSR IYE L
Sbjct: 245 KGDMVSVMTAYEACKQILDTPNMYICRTMIDVCGLCGDYVKSRYIYEDL 293


>ref|NP_195903.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|332278227|sp|Q8GYL7.3|PP361_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At5g02830, chloroplastic; Flags: Precursor
           gi|332003140|gb|AED90523.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 852

 Score =  186 bits (473), Expect = 6e-45
 Identities = 103/229 (44%), Positives = 143/229 (62%), Gaps = 1/229 (0%)
 Frame = -2

Query: 685 SSPLLSSSRWDNRQSLAYYSQLASKLAEDGRFEEFLMIAESVVA-SGVKMSEFLALLKSE 509
           SS   +  RW    SL YY+  ASKLAEDGR E+  +IAE++ A SG  ++ F +++  +
Sbjct: 66  SSHFSNVVRWIPDGSLEYYADFASKLAEDGRIEDVALIAETLAAESGANVARFASMVDYD 125

Query: 508 HLVSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAVAVESLKVECRRLLKCGEVER 329
            L  GI   LR GK  SVV  L   I K+G+ P+ L D  +V+ ++ + R +    +VE+
Sbjct: 126 LLSKGISSNLRQGKIESVVYTL-KRIEKVGIAPLDLVDDSSVKLMRKQFRAMANSVQVEK 184

Query: 328 LLSFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQNFPHAEVLFCSIVLEFGK 149
            +  ME  AG  FKIKELV+P + +  C+   +P  AIRYA   PH E+L C I+  FGK
Sbjct: 185 AIDLMEILAGLGFKIKELVDPFDVVKSCVEISNPQLAIRYACLLPHTELLLCRIIHGFGK 244

Query: 148 KRDLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKSRTIYEGL 2
           K D+ S + A+EA KQ L +PNM+  R++IDVCG+CGDY+KSR IYE L
Sbjct: 245 KGDMVSVMTAYEACKQILDTPNMYICRTMIDVCGLCGDYVKSRYIYEDL 293


>ref|XP_002525196.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223535493|gb|EEF37162.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 786

 Score =  171 bits (433), Expect = 3e-40
 Identities = 85/168 (50%), Positives = 118/168 (70%)
 Frame = -2

Query: 505 LVSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAVAVESLKVECRRLLKCGEVERL 326
           L  GI + LR+    SVVD L N   +LG+ P +LFDA +++ LK EC R++  G +E +
Sbjct: 49  LAKGISKNLRERNVDSVVDAL-NTADQLGLPPSQLFDAASMDLLKTECLRIVNFGRLEDI 107

Query: 325 LSFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQNFPHAEVLFCSIVLEFGKK 146
           +  ME  AG+ F IKELVEPS  I LC+ +R+P  A+RYA+ FPH  +L CSIV +FGKK
Sbjct: 108 ILLMETLAGYSFSIKELVEPSRVIKLCVHQRNPHLAVRYARLFPHEGILMCSIVKQFGKK 167

Query: 145 RDLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKSRTIYEGL 2
            DL SAL A+EA  Q+ + P+M+ YR++IDVCG+CGDY++SR I+E +
Sbjct: 168 GDLDSALAAYEAYMQHSTVPDMYLYRALIDVCGLCGDYMQSRYIFEDI 215


>ref|XP_003554352.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
           chloroplastic-like [Glycine max]
          Length = 811

 Score =  170 bits (431), Expect = 5e-40
 Identities = 100/239 (41%), Positives = 142/239 (59%), Gaps = 2/239 (0%)
 Frame = -2

Query: 712 PKHKPQFPASSPLLSSSRWDNRQSLAYYSQLA-SKLAEDGRFEEFLMIAESVVASGVKMS 536
           P HKP  P  +P   SS W+   S A  + LA S  A+    +EF ++ E  + SGV  +
Sbjct: 31  PPHKPSLPKLAPF--SSNWNI--SCALQAPLALSHCADSKLVQEFEVVFEDFIDSGVVDA 86

Query: 535 EFLALLKSEHLVSGIVRVLRDGKPSSVVDMLFNGIRKLGVEPVRLFDAV-AVESLKVECR 359
           E LA +        ++  +R  K  SV+    + + K+    + L   +   + +  EC 
Sbjct: 87  ELLAKV--------VLLGIRGKKVRSVI----HALNKVQGRRISLSTHLNGSDIIAKECC 134

Query: 358 RLLKCGEVERLLSFMEAFAGFQFKIKELVEPSEFINLCITKRDPIAAIRYAQNFPHAEVL 179
           RL+ C  VE  +  ME  A FQ  I+ELV+PS+ I  C+  R+PI A+RYA   PHA +L
Sbjct: 135 RLVTCSHVEEAVELMEVLARFQISIRELVQPSDIIKRCVLSRNPILAVRYACLLPHAHIL 194

Query: 178 FCSIVLEFGKKRDLASALKAFEASKQNLSSPNMHAYRSIIDVCGICGDYLKSRTIYEGL 2
           FC+I+ EFGK+RDL SALKA+EASK++L++PNM+ YR+ ID CG+C DY+KSR IYE L
Sbjct: 195 FCNIISEFGKRRDLVSALKAYEASKKHLNTPNMYIYRATIDTCGLCRDYMKSRYIYEDL 253


Top