BLASTX nr result

ID: Mentha25_contig00015058 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00015058
         (1632 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU22729.1| hypothetical protein MIMGU_mgv1a019695mg, partial...   474   e-131
gb|EPS72967.1| hypothetical protein M569_01792 [Genlisea aurea]       350   8e-94
ref|XP_003634944.1| PREDICTED: pentatricopeptide repeat-containi...   318   4e-84
ref|XP_007019869.1| Pentatricopeptide repeat superfamily protein...   297   1e-77
ref|XP_004243688.1| PREDICTED: pentatricopeptide repeat-containi...   292   3e-76
ref|XP_006353779.1| PREDICTED: pentatricopeptide repeat-containi...   288   5e-75
ref|XP_006441650.1| hypothetical protein CICLE_v10019950mg [Citr...   285   5e-74
ref|XP_004141361.1| PREDICTED: pentatricopeptide repeat-containi...   281   8e-73
gb|EXC01127.1| hypothetical protein L484_025500 [Morus notabilis]     280   1e-72
ref|XP_002525536.1| pentatricopeptide repeat-containing protein,...   277   1e-71
ref|XP_002325952.1| hypothetical protein POPTR_0019s10460g [Popu...   267   9e-69
ref|XP_007153232.1| hypothetical protein PHAVU_003G017800g [Phas...   226   2e-56
ref|XP_006854920.1| hypothetical protein AMTR_s00052p00103370 [A...   225   4e-56
ref|XP_006603965.1| PREDICTED: pentatricopeptide repeat-containi...   224   1e-55
ref|XP_004513354.1| PREDICTED: pentatricopeptide repeat-containi...   220   1e-54
ref|XP_007222181.1| hypothetical protein PRUPE_ppa009631mg [Prun...   216   3e-53
ref|XP_004301448.1| PREDICTED: pentatricopeptide repeat-containi...   214   9e-53
ref|NP_001167893.1| hypothetical protein [Zea mays] gi|223944699...   214   9e-53
ref|XP_004985951.1| PREDICTED: pentatricopeptide repeat-containi...   209   2e-51
ref|XP_002468625.1| hypothetical protein SORBIDRAFT_01g049260 [S...   203   2e-49

>gb|EYU22729.1| hypothetical protein MIMGU_mgv1a019695mg, partial [Mimulus guttatus]
          Length = 531

 Score =  474 bits (1220), Expect = e-131
 Identities = 254/461 (55%), Positives = 317/461 (68%), Gaps = 27/461 (5%)
 Frame = +3

Query: 75   SLC----GLPLKLK---------IDIWTLSSHISPNVCAGFSWLRCGKMARCHTVSSSDK 215
            SLC    GLP K++          D  TL    S   C+    LR G   RC T++SSD+
Sbjct: 85   SLCYFQSGLPSKIRSSTHSVLHYFDSSTLLKRESHYRCSNSERLRFG---RCFTIASSDE 141

Query: 216  MLGSDTVVESEVQ--------------RKHSLGKIVLEVVNMVRNNGEDLESRLNKLHPR 353
               S  V+E+EV+              R     K VLE+V+++RNNG DLESRL+ LH  
Sbjct: 142  TFTSSPVMENEVRSVANICSDQQKFEKRNKFPQKFVLEIVDILRNNGADLESRLSMLHSN 201

Query: 354  LNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKNASVCSLIIDNLGRLEDYATMHTL 533
            L+ YSITEIFEVLN++R+ GLR  EW+WSN P+LHKNA +CSLIIDN GRL+DY  +   
Sbjct: 202  LSVYSITEIFEVLNSRRISGLRLVEWIWSNKPQLHKNAHICSLIIDNFGRLDDYENISVW 261

Query: 534  FRKFANENISLTYDAFGFLPVLASTDASLKESIKRVLDLLNEVGGSCRSSGVCALIEMFC 713
            F+KF++E I LTY+AF FLPVLA  ++SL+ES  RV+DLLN++G           +EMFC
Sbjct: 262  FKKFSSEKICLTYEAFAFLPVLAPENSSLRESATRVVDLLNKIG-----------VEMFC 310

Query: 714  KFHLFEMAKHVIKVEGSKTSYYCLLIREKCRNGLIEDAHDIIREMREAHCAPTTTVYNYL 893
            K  LFEMAK+VIK+  SK +YYC+LIREKCR+GLIEDAH IIREM  A+C P TT+YNYL
Sbjct: 311  KLDLFEMAKYVIKITESKNAYYCILIREKCRSGLIEDAHSIIREMGNANCVPNTTIYNYL 370

Query: 894  LGRLWKHSRIGEASSLLDEMKENDIPLDAVTFEILIDSACSLGNMDAVHQLLHQLVAQGL 1073
            LG LWK+ R+ +AS+LLDEMKE  IP D +TFEILI+  C  G MD VH LL ++ +QG+
Sbjct: 371  LGSLWKNGRMDKASALLDEMKEIGIPRDEITFEILINFVCRFGEMDEVHHLLDEMTSQGI 430

Query: 1074 QPRLSTHAYVIKNLFXXXXXXXXXXXVADSTANDKTSSNMIYSLMAKLYCEKGHIMSARS 1253
            +PR+STHA ++K LF           V D +   KTSSNM+YSLMA LY EKG IMSA++
Sbjct: 431  EPRISTHACIVKTLFAAEKYEAAHKYVVDFSVIYKTSSNMMYSLMANLYWEKGDIMSAKN 490

Query: 1254 TLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARDLENRHS 1376
             LV+MMEKGLKP+FSIYLKIVK +RRSG  NLARDLE+ HS
Sbjct: 491  ILVEMMEKGLKPNFSIYLKIVKRIRRSGSTNLARDLESLHS 531


>gb|EPS72967.1| hypothetical protein M569_01792 [Genlisea aurea]
          Length = 447

 Score =  350 bits (899), Expect = 8e-94
 Identities = 180/383 (46%), Positives = 269/383 (70%), Gaps = 4/383 (1%)
 Frame = +3

Query: 246  EVQRKHSLGKIVLEVVNMVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFF 425
            + ++K    K V E+V+++RN+G+DLESRL KL PRL+ Y+ITEIFE LNTQRV GL+ F
Sbjct: 61   DTRKKRVQCKFVCEIVSILRNDGKDLESRLIKLAPRLSLYTITEIFEALNTQRVSGLKLF 120

Query: 426  EWVWSNIPKLHKNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLAS 605
             W+ +N PKLHK+A VCSL+IDNLGRL  Y TM  + ++F++ NI LTY+AFGFLPV AS
Sbjct: 121  IWIRNNSPKLHKSARVCSLLIDNLGRLGAYDTMLLMLKEFSSHNICLTYEAFGFLPVSAS 180

Query: 606  TD-ASLKESIKRVLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYC 782
            T+ +SL ES KRV+D LN+ GGSCR+SG+ AL+EMF    +F MA++V+K+   +  Y+ 
Sbjct: 181  TESSSLAESTKRVVDFLNQAGGSCRNSGLYALVEMFSALDMFHMARYVMKITEIRRVYFT 240

Query: 783  LLIREKCRNGLIEDAHDIIREMREA-HCAPTTTVYNYLLGRLWKHSRIGEASSLLDEMKE 959
            ++IRE C+  L EDA  +I+EM E+    P   +YN++LG L + SRI EAS +   M+E
Sbjct: 241  VMIREMCKRDLFEDAIRLIKEMEESTRFFPDANIYNHILGSLLRTSRIDEASKIFSRMRE 300

Query: 960  NDIPLDAVTFEILIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLF-XXXXXX 1136
             D+  D +T+EILI+S CSLG ++   +LL ++ + G++PR+ THA +IK +F       
Sbjct: 301  LDVKPDGITYEILINSHCSLGRLEDAKRLLDEMGSIGIEPRIETHALIIKAMFATGEEYE 360

Query: 1137 XXXXXVADSTANDKTSSNMIYSLMAKLYCEKGHIMSARSTLVDMME-KGLKPDFSIYLKI 1313
                 V + +   +TS+N +++LMA L+C++G ++ A   L +MM+ + LKPDF++Y+ +
Sbjct: 361  GLRKHVVEYSGVYRTSANTMHTLMADLHCKRGDVLRAAEVLSEMMKCRNLKPDFNVYMDV 420

Query: 1314 VKLLRRSGRGNLARDLENRHSKF 1382
            V  L+R+ R +LA DL++ +S+F
Sbjct: 421  VMKLKRARRVDLAWDLQSMYSRF 443


>ref|XP_003634944.1| PREDICTED: pentatricopeptide repeat-containing protein At5g16420,
            mitochondrial-like [Vitis vinifera]
          Length = 582

 Score =  318 bits (815), Expect = 4e-84
 Identities = 172/427 (40%), Positives = 259/427 (60%), Gaps = 14/427 (3%)
 Frame = +3

Query: 147  CAGFSWLRCGKMA------------RCHTVSSSDKMLGSDTVVESEVQR--KHSLGKIVL 284
            C+   W  C ++             R  ++SS D   GS       +     +   K + 
Sbjct: 148  CSSILWNTCARLNVNSLKLSTQTSFRLLSISSFDNCSGSFEFGNKGIDELVPNMSPKRIS 207

Query: 285  EVVNMVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKN 464
            E++ ++R++  D+E +LN ++ RL+  S+TEIF VLN +R+  +RFFEW+  +   L +N
Sbjct: 208  EIIKVIRSDEIDMEVKLNLMNLRLSVASVTEIFRVLNLERLSAMRFFEWISHSRSGLSRN 267

Query: 465  ASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESIKRVL 644
              +CSLIIDN GRL DY TM  L + F ++ + LT  AFGF+PV   + AS+ + +++++
Sbjct: 268  YDICSLIIDNCGRLGDYETMRCLLKDFNSKRVCLTSKAFGFVPVFTLSKASIMDFVRKLI 327

Query: 645  DLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCRNGLIED 824
            ++L++VGG CR SG+  LIEMF     FEMAK V+++   KTSYY +L+RE CR    ++
Sbjct: 328  EVLDDVGGVCRRSGLFGLIEMFSVSGSFEMAKFVMEITERKTSYYNILVREMCRKCNFKE 387

Query: 825  AHDIIREMREAHCAPTTTVYNYLLGRLWKHSRIGEASSLLDEMKENDIPLDAVTFEILID 1004
            A D++ EMR   C P    YNYLL  L K++R  EA ++L+EM+E   P DA+TFEI I 
Sbjct: 388  ARDLLDEMRLFGCRPNAKTYNYLLSSLCKNNRDDEACNVLEEMQEAGCPPDALTFEIFIY 447

Query: 1005 SACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXXVADSTANDKTS 1184
                LG +D   + L Q+V++GL+PRL+THA  IK  F           V DS    K  
Sbjct: 448  YTYRLGKLDFAIKFLDQMVSRGLEPRLTTHAAFIKGYFHSRRYEEAYEYVVDSGVTYKWP 507

Query: 1185 SNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARDLE 1364
            SNMIYSL+A L+   G+++SA+  L++M+EKGLKP+FS+Y ++++ L +SGR +LA DL 
Sbjct: 508  SNMIYSLLASLHQRNGNLISAQKILIEMIEKGLKPNFSVYKRVLEHLDKSGREDLAGDLR 567

Query: 1365 NRHSKFT 1385
            +R S  +
Sbjct: 568  SRFSSLS 574


>ref|XP_007019869.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
            cacao] gi|508725197|gb|EOY17094.1| Pentatricopeptide
            repeat superfamily protein, putative [Theobroma cacao]
          Length = 494

 Score =  297 bits (760), Expect = 1e-77
 Identities = 151/374 (40%), Positives = 237/374 (63%)
 Frame = +3

Query: 273  KIVLEVVNMVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPK 452
            K  LEVV+++R+   DLES+L+ ++  L+  S+  IF +LN ++V  LRFF W+  + P+
Sbjct: 116  KQALEVVSLIRSGQNDLESKLDGMNVSLSEASLNTIFRILNNEKVSALRFFYWIRESHPQ 175

Query: 453  LHKNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESI 632
             + N+ +CSL+IDN GRL+D+ +  +L   F    I L + AFGF+PV+ S+ A+ K+SI
Sbjct: 176  FYHNSDICSLVIDNCGRLDDFDSAASLLNDFKLHGIRLNHRAFGFVPVMISSKAATKKSI 235

Query: 633  KRVLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCRNG 812
             +V+++LN +GGSC  SG+ ALIEM C    FEMAK+VI     + S Y +LIR +CR G
Sbjct: 236  CKVVEVLNRIGGSCSVSGIHALIEMLCALESFEMAKYVIAKAEKRLSNYNILIRGQCRKG 295

Query: 813  LIEDAHDIIREMREAHCAPTTTVYNYLLGRLWKHSRIGEASSLLDEMKENDIPLDAVTFE 992
              E A +I+  M +  C P +  +N +L  L K+ ++ EA  LL++M E+  PLDA+TFE
Sbjct: 296  DFEGAREILDWMIKVGCNPNSQTFNNILSCLCKNDKVAEACQLLEQMLESGCPLDALTFE 355

Query: 993  ILIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXXVADSTAN 1172
            I I   C LG +D   + L+++ + G++PR++THA  +K  F           V   +  
Sbjct: 356  IFICYYCGLGRLDMAFEWLNKMDSSGIEPRITTHAAFVKGYFKLQQYEEAHNYVVVCSDK 415

Query: 1173 DKTSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLA 1352
             K +SN++YSL+A L+ ++G  + A+S L +M+EKGLKP+F++Y+ + K L++SGR +LA
Sbjct: 416  YKQASNIVYSLLASLHRKRGKPVIAQSILSEMIEKGLKPNFAVYMTVTKQLQKSGREDLA 475

Query: 1353 RDLENRHSKFTSNP 1394
             +L +  S   S P
Sbjct: 476  GNLRSSFSSLISQP 489


>ref|XP_004243688.1| PREDICTED: pentatricopeptide repeat-containing protein At1g12775,
            mitochondrial-like [Solanum lycopersicum]
          Length = 525

 Score =  292 bits (748), Expect = 3e-76
 Identities = 158/368 (42%), Positives = 229/368 (62%)
 Frame = +3

Query: 279  VLEVVNMVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLH 458
            VLE+V ++R  G+++  +L  +  +L+   + EIF++LN QR+ GL+FF W+  + P+ H
Sbjct: 142  VLEIVEIIRGGGQNVRQQLILVASKLSFKCVVEIFDLLNEQRISGLKFFNWLRDSHPEFH 201

Query: 459  KNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESIKR 638
            ++A V SLII N G L+DY TM +L  +F  E   LT  AFGFL V  S   SL  S K+
Sbjct: 202  RSAYVNSLIICNCGWLDDYKTMFSLLEEFKTEQTCLTDKAFGFLTVFGSCKDSLMNSTKK 261

Query: 639  VLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCRNGLI 818
            V+D+L EVGGSC  SGV  LIEMFC   LFEMA  VI++     S Y +LIR++CR G I
Sbjct: 262  VVDMLIEVGGSCCGSGVYGLIEMFCSLDLFEMATFVIEITERTASRYNILIRKRCRAGQI 321

Query: 819  EDAHDIIREMREAHCAPTTTVYNYLLGRLWKHSRIGEASSLLDEMKENDIPLDAVTFEIL 998
            E A  II EM E  C+P T  YNYLLG L K+ ++ +   +L+EM+   +  DA+TFE L
Sbjct: 322  EKARAIIEEMSEFGCSPNTKSYNYLLGSLCKNDKLEDVRIVLEEMRNKGLNPDAITFETL 381

Query: 999  IDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXXVADSTANDK 1178
            +    S G ++   + ++ +V   ++PR +THA  +K L            V D +A   
Sbjct: 382  VYHLSSRGQVEFASEFMNLMVNVNVKPRSTTHAAFLKVLLEAGEREKAYKYVIDMSAKYN 441

Query: 1179 TSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARD 1358
             S N +YSL+ +L  +KG IM+A++ L +M++KGLKPDF I++K VK L ++ R +LARD
Sbjct: 442  HSVNTLYSLLVRLNQKKGDIMAAQNILNEMIDKGLKPDFGIFIKFVKQLGKTRRKSLARD 501

Query: 1359 LENRHSKF 1382
            L+ ++S F
Sbjct: 502  LKMKYSVF 509


>ref|XP_006353779.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            isoform X1 [Solanum tuberosum]
            gi|565374458|ref|XP_006353780.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At1g09900-like isoform X2 [Solanum tuberosum]
            gi|565374460|ref|XP_006353781.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At1g09900-like isoform X3 [Solanum tuberosum]
          Length = 537

 Score =  288 bits (737), Expect = 5e-75
 Identities = 155/368 (42%), Positives = 227/368 (61%)
 Frame = +3

Query: 279  VLEVVNMVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLH 458
            +LE++ +++  GEDL  +L  +  +L+   +  IF++LN QR+ GL FF W+  + P+ H
Sbjct: 151  ILEIIEIIKGGGEDLRQQLILVASKLSFKCVIGIFDLLNEQRISGLNFFNWLRDSHPEFH 210

Query: 459  KNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESIKR 638
             +A V SLII N G L+DY TM +L  +F  E   LT  AFGFL V  S   SL  S K+
Sbjct: 211  CSAYVNSLIICNCGWLDDYKTMFSLLEEFKAEQTCLTDKAFGFLTVFGSCKDSLMNSTKK 270

Query: 639  VLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCRNGLI 818
            V+D+LNEVGGSC  SGV  LIEM+C   LFEMA  VI++     S Y +LIR++CR G I
Sbjct: 271  VVDMLNEVGGSCCGSGVHGLIEMYCCLDLFEMATFVIEITERTASRYNILIRKRCRAGQI 330

Query: 819  EDAHDIIREMREAHCAPTTTVYNYLLGRLWKHSRIGEASSLLDEMKENDIPLDAVTFEIL 998
            E A  II+EM E  C+P T  YNYLLG L K+ ++     +L+EM+   +  DA+TFE L
Sbjct: 331  EKARAIIKEMSEFGCSPNTKSYNYLLGSLCKNDKLEYVRIVLEEMRNKGLNPDAITFETL 390

Query: 999  IDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXXVADSTANDK 1178
            +  + S G ++   + ++ ++   ++PR +THA  +K L            V D +A   
Sbjct: 391  VYHSSSRGQVEFASEFMNLMINVNVEPRSTTHAAFLKVLLEAGEREKAYKYVIDMSAKYN 450

Query: 1179 TSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARD 1358
               N +YSL+ +L  +KG IM+A++ L +M++KGLKPDF I++KIVK L ++ + +LARD
Sbjct: 451  HCGNTLYSLLVRLNQKKGDIMAAQNILSEMIDKGLKPDFGIFIKIVKQLGKTRKKSLARD 510

Query: 1359 LENRHSKF 1382
            L  ++S F
Sbjct: 511  LRMKYSVF 518


>ref|XP_006441650.1| hypothetical protein CICLE_v10019950mg [Citrus clementina]
            gi|557543912|gb|ESR54890.1| hypothetical protein
            CICLE_v10019950mg [Citrus clementina]
          Length = 478

 Score =  285 bits (728), Expect = 5e-74
 Identities = 154/379 (40%), Positives = 232/379 (61%)
 Frame = +3

Query: 249  VQRKHSLGKIVLEVVNMVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFE 428
            V   ++  K V E++ ++R+   + ES+L  +   L+  S+ EI  VLN+++V  L F +
Sbjct: 92   VNGPYATPKQVSEIIELLRSGDSETESKLLSMSVSLSNASVIEILRVLNSEKVSALCFLK 151

Query: 429  WVWSNIPKLHKNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLAST 608
            ++   IP+ +KN+ +CSL+IDN GRL+DY TM  L  +F    + L   AFGFLPVL S+
Sbjct: 152  YMREIIPEFYKNSDICSLVIDNCGRLDDYETMRQLLNEFNVYQVCLNEKAFGFLPVLISS 211

Query: 609  DASLKESIKRVLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLL 788
             A  K+ I RV+++LN+V GSC  SGV ALIEMF    L+EMAK+VIK    K SYY +L
Sbjct: 212  KALTKKCIWRVVEVLNQVEGSCLVSGVRALIEMFSVLGLYEMAKYVIKKTERKVSYYNIL 271

Query: 789  IREKCRNGLIEDAHDIIREMREAHCAPTTTVYNYLLGRLWKHSRIGEASSLLDEMKENDI 968
            I+E CR    +   D++ EMR+  C P T  YNY+LG L K+ +  +A  LL+EM   + 
Sbjct: 272  IKEMCRRCDFKGPRDLLVEMRQVGCEPITLTYNYVLGVLCKNGQDADACELLEEMLGRNC 331

Query: 969  PLDAVTFEILIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXX 1148
              DA+T+EI I  +C +G  D      +Q+V +GLQPRL+THA  IK  F          
Sbjct: 332  HPDAITYEIFIVYSCRVGKFDVAFNFFNQMVKRGLQPRLTTHAAFIKGYFSCYRYEDAYK 391

Query: 1149 XVADSTANDKTSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLR 1328
             V  S    K+SSNM+YSL+A L+ +  + + A++ L +MM+ GL+P+ S+Y +++K L 
Sbjct: 392  YVVLSADKYKSSSNMLYSLLASLHDKNNNPVMAKNVLSEMMKIGLRPNVSVYRRVLKHLH 451

Query: 1329 RSGRGNLARDLENRHSKFT 1385
             S + ++A+ L +R+S  +
Sbjct: 452  TSRQEHMAKCLSSRYSSLS 470


>ref|XP_004141361.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05670,
            mitochondrial-like [Cucumis sativus]
            gi|449498723|ref|XP_004160616.1| PREDICTED:
            pentatricopeptide repeat-containing protein At1g05670,
            mitochondrial-like [Cucumis sativus]
          Length = 494

 Score =  281 bits (718), Expect = 8e-73
 Identities = 148/376 (39%), Positives = 229/376 (60%)
 Frame = +3

Query: 249  VQRKHSLGKIVLEVVNMVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFE 428
            V +++     +  ++N++R N EDLES+L+  + RL    + +I E+LN  ++   RFF 
Sbjct: 109  VSKRNVTSNQLSNIINIIRENQEDLESKLDSPNVRLTNVLVGQILEMLNKHKISASRFFN 168

Query: 429  WVWSNIPKLHKNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLAST 608
            WV     K   N+ V SL+IDN GRL+DY  +  +  +F  + I L + AFGFL  L S 
Sbjct: 169  WVSVQSCKFPCNSDVYSLLIDNFGRLDDYEGILPVLIEFGLKGIELNHKAFGFLLPL-SN 227

Query: 609  DASLKESIKRVLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLL 788
            + S+K S+ +++ LLNE GG+CR SG+ ALIEMFC    F MAK VI++   ++S+Y ++
Sbjct: 228  EHSMKLSVVKLVKLLNEAGGTCRLSGIMALIEMFCSLGSFGMAKFVIEITEKRSSFYYII 287

Query: 789  IREKCRNGLIEDAHDIIREMREAHCAPTTTVYNYLLGRLWKHSRIGEASSLLDEMKENDI 968
            +REKC+    E A   + EMR+  C P   + NYLL  L K+ + GEA +LL+EM E + 
Sbjct: 288  VREKCKQKDFEGARCTLDEMRQVGCIPDAGILNYLLSSLCKNDKFGEAHNLLEEMLEQNC 347

Query: 969  PLDAVTFEILIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXX 1148
              +++TFEI+I   C +GN+++    L  +VA GL PRLSTHA  +K+ F          
Sbjct: 348  SPNSLTFEIIICHLCKIGNIESALGYLDMMVAGGLMPRLSTHAAFVKSYFSSQRYEEAYQ 407

Query: 1149 XVADSTANDKTSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLR 1328
               DS+    T+ N  YSL+A L+ ++G+++ A+  L ++M+ GLKP F +Y +++K L+
Sbjct: 408  YAVDSSLKYVTTQNATYSLLATLHEKRGNLVDAQKILSELMDAGLKPHFHVYTRLLKKLQ 467

Query: 1329 RSGRGNLARDLENRHS 1376
              GRG+LA DL+ + S
Sbjct: 468  VQGRGDLANDLKRKIS 483


>gb|EXC01127.1| hypothetical protein L484_025500 [Morus notabilis]
          Length = 502

 Score =  280 bits (717), Expect = 1e-72
 Identities = 155/369 (42%), Positives = 224/369 (60%), Gaps = 1/369 (0%)
 Frame = +3

Query: 285  EVVNMVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKN 464
            EVV+M+R N   LE  LN     L    I  IFE LN+++V  LRFF WV  + P L +N
Sbjct: 122  EVVDMIRRNEISLECELNLSEVWLTVAYINRIFEALNSEKVSALRFFNWVRVSKPGLRRN 181

Query: 465  ASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESIKRVL 644
            + +CSL+IDNLGRL DY +M  +  +F  E I LT +AF FLP L     SL  S+   +
Sbjct: 182  SDICSLMIDNLGRLNDYESMTCILNEFREEQICLTKNAFRFLPDLMLNKDSLMNSVTEAV 241

Query: 645  DLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCRNGLIED 824
             +L  VGGSC ++GV +LIE+F    L EMA+ V+++    TSYY ++IREKCR   +E 
Sbjct: 242  KILKGVGGSCGATGVRSLIELFSSLGLLEMARFVMQLTEKNTSYYNIMIREKCRKHDLEG 301

Query: 825  AHDIIREMREAHCAPTTTVYNYLLGRLWKHSRIGE-ASSLLDEMKENDIPLDAVTFEILI 1001
            A  ++ EMR+A C P +T YN +L  L+K   IGE A  LL EMK+     D  TFEIL+
Sbjct: 302  ARGLLNEMRQAGCEPNSTSYNLVLSILYK---IGESAEVLLKEMKDMGCSPDETTFEILV 358

Query: 1002 DSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXXVADSTANDKT 1181
              +C  G+ D    ++ ++++ GL+PRL+TH  ++K  F           V DS+   + 
Sbjct: 359  LQSCKHGHFDFALGIVDEMLSFGLEPRLTTHVAIVKGYFASQRYEEAHKYVVDSSLKHRQ 418

Query: 1182 SSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARDL 1361
            SSN++YSL+A LY +K   ++A + + +M+EKGL+P F +Y +++  L+RS   +LARDL
Sbjct: 419  SSNVLYSLLAGLYLKKDDTVNALNIISEMIEKGLRPHFRVYNELLGCLQRSDGTDLARDL 478

Query: 1362 ENRHSKFTS 1388
            E + S+  S
Sbjct: 479  EIKFSRLCS 487


>ref|XP_002525536.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223535215|gb|EEF36894.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 430

 Score =  277 bits (708), Expect = 1e-71
 Identities = 152/405 (37%), Positives = 244/405 (60%)
 Frame = +3

Query: 156  FSWLRCGKMARCHTVSSSDKMLGSDTVVESEVQRKHSLGKIVLEVVNMVRNNGEDLESRL 335
            FS+   G + R ++VSSSD  L     + ++  R  +  K V  ++ ++     +LE++L
Sbjct: 23   FSYPGYGSL-RLYSVSSSDASLYKK--LHTDELRPKATRKQVSVIIGLLITEDNELETKL 79

Query: 336  NKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKNASVCSLIIDNLGRLEDY 515
            N L  RL+  S+  +F+VLN ++   L+FF W+    P+L  N+ +CSL+IDN G L+DY
Sbjct: 80   NSLGVRLSIGSVRWVFQVLNREKKSALQFFHWIRRWQPELEGNSDICSLVIDNCGHLDDY 139

Query: 516  ATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESIKRVLDLLNEVGGSCRSSGVCA 695
              M  L   F+ + + LT  AF +L + +S +  LK++ + V+D+L E+GG+   +GV +
Sbjct: 140  KAMRCLLDGFSLQRLFLTKKAFEYLQLTSSKEELLKKATQNVVDILQEIGGTSYGTGVPS 199

Query: 696  LIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCRNGLIEDAHDIIREMREAHCAPTT 875
            LIEMF     F+MAK VI+  G K SYY +LIRE CR G  + A D++ E+ +  C P+ 
Sbjct: 200  LIEMFSDLGSFDMAKFVIEKTGRKLSYYNVLIRELCRRGDFKAARDLMDEIGKEGCNPSA 259

Query: 876  TVYNYLLGRLWKHSRIGEASSLLDEMKENDIPLDAVTFEILIDSACSLGNMDAVHQLLHQ 1055
              YNY++  L K+ +  +A  +  EM++ND P DA+TFEI I ++C+ G +D   +    
Sbjct: 260  HTYNYIISSLLKNGKNADACEVFQEMQDNDCPPDALTFEIFIYNSCNEGKLDNAFEFFDD 319

Query: 1056 LVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXXVADSTANDKTSSNMIYSLMAKLYCEKGH 1235
            +VA+GL+PRL THA  IK  F           V  S  +DK SSN+ YSL+A L+ ++G+
Sbjct: 320  MVARGLEPRLLTHAAFIKGFFNSQQYEKAYKYVVGS--DDKYSSNVNYSLLANLHQKQGN 377

Query: 1236 IMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARDLENR 1370
            ++ A + L +M++KGL+P F++++K+ K L +SG   LA  L+ +
Sbjct: 378  LVDAENILSEMIKKGLRPHFNVFMKVKKHLMKSGNEELATSLQKK 422


>ref|XP_002325952.1| hypothetical protein POPTR_0019s10460g [Populus trichocarpa]
            gi|222862827|gb|EEF00334.1| hypothetical protein
            POPTR_0019s10460g [Populus trichocarpa]
          Length = 432

 Score =  267 bits (683), Expect = 9e-69
 Identities = 151/379 (39%), Positives = 223/379 (58%), Gaps = 4/379 (1%)
 Frame = +3

Query: 255  RKHSLGKIVLEVVNMVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWV 434
            R  ++ K V  ++++++ +  DLE +L  L  +L+  S+T +F VLN+++V  LRFF W+
Sbjct: 54   RPKAMQKQVAYIIDLIKRDEYDLEYKLGSLSVKLSIASVTLVFHVLNSEKVSALRFFRWI 113

Query: 435  WSNIPKLHKNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDA 614
                P+L  N+ +CSL+IDN GRL+DY  M +L  +F    + LT  AF FL V+  T+ 
Sbjct: 114  RHWQPELRCNSDICSLVIDNCGRLDDYDAMRSLLNEFNENQLCLTKKAFEFLHVMNVTNE 173

Query: 615  SLKESIKRVLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIR 794
            SL ES +RV+ LL EV GSC    V +LIEMF     F+M + V+K    K SYY + IR
Sbjct: 174  SLVESTQRVIVLLLEVRGSCYG-WVSSLIEMFSVLGSFDMVEFVMKKTERKISYYYIFIR 232

Query: 795  EKCRNGLIEDAHDIIREMREAHCAPTTTVYNYLLGRLWKHSRIGEASSLLDEMKENDIPL 974
            E CR    +   DI  EMR+        +YNYL+  L K+    +A  +L EM++ D P 
Sbjct: 233  EMCRRCDFKGVRDIQDEMRKEGFELNARIYNYLISCLLKNGEYADACKVLTEMQDKDCPP 292

Query: 975  DAVTFEILIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXXV 1154
            DA+TFEI I   C+ G  +       ++VA+GL+PRLSTHA  IK  F           V
Sbjct: 293  DALTFEIFIYYCCNNGKTEIACHYFDEIVARGLEPRLSTHAAFIKGFFNSEQYEEAYKYV 352

Query: 1155 ADSTANDKTSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRS 1334
             DS    K +S M YSL+A+L+ ++G+++ A++ L +M++KGL+P F +Y+K+   L +S
Sbjct: 353  VDSDKKYKCTSCMNYSLLARLHQKRGNLVIAQNILSEMIKKGLRPYFKVYMKVFNCLNKS 412

Query: 1335 GRGNLARDLENR----HSK 1379
            GR  LA DL+ +    HSK
Sbjct: 413  GRETLATDLQEQFHQLHSK 431


>ref|XP_007153232.1| hypothetical protein PHAVU_003G017800g [Phaseolus vulgaris]
            gi|561026586|gb|ESW25226.1| hypothetical protein
            PHAVU_003G017800g [Phaseolus vulgaris]
          Length = 480

 Score =  226 bits (576), Expect = 2e-56
 Identities = 127/365 (34%), Positives = 209/365 (57%)
 Frame = +3

Query: 279  VLEVVNMVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLH 458
            V +++ ++R NG+DL  +LN ++  L+  S+ +IF++L ++RV  L+FF+W+  + P + 
Sbjct: 110  VAQIIALIRENGDDLGCKLNSMNVSLSDASVVDIFQILASERVSALQFFDWLKGSDPDIC 169

Query: 459  KNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESIKR 638
             ++ + SL ++N G L +Y  M  + R F+ + + L   AFGFL  L    AS  E +K+
Sbjct: 170  CDSDLGSLFVNNCGLLGNYEAMVPVLRGFSLKGVFLGVKAFGFLLDLGLDKASSIERVKK 229

Query: 639  VLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCRNGLI 818
            ++ + NEVGG  +S GV  L+EMF     FE+A+ VI+  G K   Y +L++  C+ G  
Sbjct: 230  IMAVFNEVGGVYQSCGVQLLVEMFGLSGSFEIAEFVIRAAGRKVKNYHVLMKIMCKRGDC 289

Query: 819  EDAHDIIREMREAHCAPTTTVYNYLLGRLWKHSRIGEASSLLDEMKENDIPLDAVTFEIL 998
            +   D+++EM  +      + YN LL  L K  +I EA  +L+ M++N    D  +F+IL
Sbjct: 290  KRVGDLVKEMERSGIDVNASTYNLLLSCLCKSGKIDEACQVLEAMEKNYGLTDVHSFDIL 349

Query: 999  IDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXXVADSTANDK 1178
            I++ C     D V +LL ++  +G++P + THA VIK+ F           V  S     
Sbjct: 350  INTFCKQHQFDLVLKLLDKMTLKGIEPSILTHAAVIKSYFESGKYEEAHEYVIGSADKLS 409

Query: 1179 TSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARD 1358
             SSN  YSL+A L+ + G+++ A   L +MM+KGLKP+FS Y KI   L + G  +L+ +
Sbjct: 410  YSSNANYSLLATLHLKNGNVLLASKVLSEMMDKGLKPNFSAYKKIRIHLEKKGEKDLSME 469

Query: 1359 LENRH 1373
            L  R+
Sbjct: 470  LSRRY 474


>ref|XP_006854920.1| hypothetical protein AMTR_s00052p00103370 [Amborella trichopoda]
            gi|548858645|gb|ERN16387.1| hypothetical protein
            AMTR_s00052p00103370 [Amborella trichopoda]
          Length = 367

 Score =  225 bits (574), Expect = 4e-56
 Identities = 129/355 (36%), Positives = 200/355 (56%)
 Frame = +3

Query: 321  LESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKNASVCSLIIDNLG 500
            +E +LN ++ +L+   +T+I    +T     L FF W  +  P  + N++  +L I   G
Sbjct: 12   MEEKLNHMNLKLSNKVVTDILR--STPNRGALMFFNWAKTR-PGFNPNSTNYNLAISISG 68

Query: 501  RLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESIKRVLDLLNEVGGSCRS 680
             LE++  M  L    +++   LT  AF FL    S   S++ S+K +L ++  VGG C  
Sbjct: 69   LLENFELMLLLMEGLSSKGHCLTVTAFSFL----SRSPSIQNSVKEILSIIRRVGGPCLK 124

Query: 681  SGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCRNGLIEDAHDIIREMREAH 860
            SGV  LI   C  + FE+A  V++  G KTSYY +LI  KCRNG  E+A   + EM+  H
Sbjct: 125  SGVYYLISSLCDLNCFELAILVMEEMGKKTSYYNVLIAAKCRNGEFEEAKVALDEMKGLH 184

Query: 861  CAPTTTVYNYLLGRLWKHSRIGEASSLLDEMKENDIPLDAVTFEILIDSACSLGNMDAVH 1040
                T  +NYLLG L K  R+ EA  LL+ M++     D +TFE++   AC +G MD+  
Sbjct: 185  YGINTGSFNYLLGSLCKKGRVAEACQLLEAMEDLGCYPDEITFEVMAYHACRMGKMDSAL 244

Query: 1041 QLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXXVADSTANDKTSSNMIYSLMAKLY 1220
            + L++++ +GL+PR +T+A  IK  F           V + +  D  S+NM YSL++ L 
Sbjct: 245  EFLNKMILEGLKPRFTTYAAFIKGYFFVGEVQNAHKFVLEMSEKDNCSANMNYSLLSSLL 304

Query: 1221 CEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARDLENRHSKFT 1385
             + G I+ A + LV+M++KGLKP+F +++K+VK L  +G   +  DL+ R SKFT
Sbjct: 305  RKSGKIVEAHAILVEMIDKGLKPNFPVFIKVVKDLSHAGFREMGLDLKCRFSKFT 359


>ref|XP_006603965.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like isoform X2 [Glycine max]
            gi|571554333|ref|XP_003554979.2| PREDICTED:
            pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like isoform X1 [Glycine max]
          Length = 465

 Score =  224 bits (570), Expect = 1e-55
 Identities = 127/374 (33%), Positives = 217/374 (58%)
 Frame = +3

Query: 255  RKHSLGKIVLEVVNMVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWV 434
            R H+  +   +++ ++R + ++L S+LN ++  L+  S+ +IF++L +++V  L+FF+++
Sbjct: 87   RPHATSEQFYQIIALIREDVDELGSKLNSMNVSLSDASVVDIFQILASEKVSALQFFDFL 146

Query: 435  WSNIPKLHKNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDA 614
              + P+L  +  + SL I+N G L +Y  M  +   F++  + L   AFGFL  L    A
Sbjct: 147  KGSDPELCCDPDIGSLFINNCGLLGNYEAMVPVLSGFSHRRVFLGMKAFGFLLDLGLDKA 206

Query: 615  SLKESIKRVLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIR 794
            S  E +++V+ + N+VGG  +S GV  LIEMF     FE+A+ VI+  G K  +Y +L+R
Sbjct: 207  SSMECVRKVMAVFNKVGGMYQSCGVQLLIEMFGLSGSFEIAEFVIRTAGRKVKHYHVLMR 266

Query: 795  EKCRNGLIEDAHDIIREMREAHCAPTTTVYNYLLGRLWKHSRIGEASSLLDEMKENDIPL 974
              C++G  +   D+++EM+ + C    + YN LL  L K+ +I EA  LL+ M++N    
Sbjct: 267  ILCKSGDCKRVSDLVKEMKRSGCDMDVSTYNLLLSCLCKNGKIDEAWQLLEAMEKNYGLT 326

Query: 975  DAVTFEILIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXXV 1154
            +A +F+ILI+  C     D+V +LL ++  +G++P + THA +IK+ F           V
Sbjct: 327  NAHSFDILINFLCKRRQFDSVLKLLDKMFLKGIEPSILTHAAIIKSYFESGKYEEAHEYV 386

Query: 1155 ADSTANDKTSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRS 1334
              S      SSN  Y L+A L  + G+++ A   L +MM+KGLKP+FS+Y KI K L + 
Sbjct: 387  IGSANRLSYSSNANYGLLATLQLKNGNVLLACKVLSEMMDKGLKPNFSVYKKIRKHLEKK 446

Query: 1335 GRGNLARDLENRHS 1376
               +L+ +L  R+S
Sbjct: 447  DEKDLSLELLRRYS 460


>ref|XP_004513354.1| PREDICTED: pentatricopeptide repeat-containing protein At1g62670,
            mitochondrial-like isoform X1 [Cicer arietinum]
            gi|502164974|ref|XP_004513355.1| PREDICTED:
            pentatricopeptide repeat-containing protein At1g62670,
            mitochondrial-like isoform X2 [Cicer arietinum]
          Length = 471

 Score =  220 bits (561), Expect = 1e-54
 Identities = 122/373 (32%), Positives = 211/373 (56%)
 Frame = +3

Query: 255  RKHSLGKIVLEVVNMVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWV 434
            + ++  K V E++ ++     DL+ RLN ++  L+  S+  IF+ L ++RV  L FF+W+
Sbjct: 93   KPYATSKQVSEIIRLICEGVNDLDYRLNMMNVSLSMSSVIYIFDKLASERVSALLFFDWL 152

Query: 435  WSNIPKLHKNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDA 614
              +  +L  +  +  LI++N G + ++  M  +  +F  + + L   AF FL VL     
Sbjct: 153  NVSHTELCCDPEIGGLIVENCGLVGNFDAMVAILNEFNRKKMCLGRRAFRFLVVLRLDKD 212

Query: 615  SLKESIKRVLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIR 794
            S  E ++RV+D+LN+VGG CR+SGV  LIE+FC    F+MA+ VI+  G K ++Y  L+R
Sbjct: 213  SSMECVRRVIDVLNKVGGVCRNSGVQLLIEIFCFSGSFDMAEFVIEEAGRKVNHYNFLLR 272

Query: 795  EKCRNGLIEDAHDIIREMREAHCAPTTTVYNYLLGRLWKHSRIGEASSLLDEMKENDIPL 974
              C+ G  E   D++++M+ +   P  + Y+ L+  L+          +++ M+++D   
Sbjct: 273  MMCKRGDFERVCDLVKKMKRSGAEPNGSTYSLLVSCLFNIDNFVGTCQVIETMEKDDGLP 332

Query: 975  DAVTFEILIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXXV 1154
            D  TF+ LI  +C  G +D   + L ++  +G++P   THA VIK  F           V
Sbjct: 333  DEFTFDTLIRLSCKHGQIDLALKFLDKMTLKGIEPCSLTHAAVIKFYFESGKYDAAYEYV 392

Query: 1155 ADSTANDKTSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRS 1334
            ADS      SSN  Y+L+A L+ +KG+++ ++  L +MM+KGLKP++S+Y K+ K L + 
Sbjct: 393  ADSAGKYSYSSNENYTLLASLHLKKGNVLLSQRILYEMMDKGLKPNYSVYTKVRKRLEKK 452

Query: 1335 GRGNLARDLENRH 1373
             R +L+ +L  R+
Sbjct: 453  NRKDLSLELSRRY 465


>ref|XP_007222181.1| hypothetical protein PRUPE_ppa009631mg [Prunus persica]
            gi|462419117|gb|EMJ23380.1| hypothetical protein
            PRUPE_ppa009631mg [Prunus persica]
          Length = 284

 Score =  216 bits (549), Expect = 3e-53
 Identities = 112/280 (40%), Positives = 173/280 (61%)
 Frame = +3

Query: 531  LFRKFANENISLTYDAFGFLPVLASTDASLKESIKRVLDLLNEVGGSCRSSGVCALIEMF 710
            +   F +  I LT +AF F+    S  +S K S+ +V+++LNEVGGSCR  G+ +LIEM 
Sbjct: 4    IMNDFRSAGICLTRNAFEFI----SVSSSKKASVIKVVEVLNEVGGSCRPVGLLSLIEML 59

Query: 711  CKFHLFEMAKHVIKVEGSKTSYYCLLIREKCRNGLIEDAHDIIREMREAHCAPTTTVYNY 890
                 F+MA+ V+K+   K SYY ++IRE CR      A D++ EMR+  C P +  YNY
Sbjct: 60   SVKGSFKMAEFVMKITERKRSYYNIMIRESCRRRNFGRAIDMLDEMRQVGCDPDSKTYNY 119

Query: 891  LLGRLWKHSRIGEASSLLDEMKENDIPLDAVTFEILIDSACSLGNMDAVHQLLHQLVAQG 1070
            +L  L+K+ +   A+ L ++M E +   D +T+EILI  +C +GN D   +LL  +V +G
Sbjct: 120  ILSSLYKNYKSAVATKLFEQMLEMNCSPDEITYEILICYSCKVGNFDFARKLLDSMVLKG 179

Query: 1071 LQPRLSTHAYVIKNLFXXXXXXXXXXXVADSTANDKTSSNMIYSLMAKLYCEKGHIMSAR 1250
            ++PRL++HA  +K  F           V DS+      SN +YSL+A+LY  +G+++ A+
Sbjct: 180  IKPRLTSHAAFVKGYFNLRRYKEAYEHVVDSSVKYSCFSNSVYSLLARLYMNEGNVVIAQ 239

Query: 1251 STLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARDLENR 1370
            + L+DM+ KGLKPDF++Y K++K L ++GR  LA DL +R
Sbjct: 240  NILIDMINKGLKPDFAVYTKVLKELSKTGRTGLAEDLSSR 279


>ref|XP_004301448.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like
            [Fragaria vesca subsp. vesca]
          Length = 503

 Score =  214 bits (545), Expect = 9e-53
 Identities = 129/401 (32%), Positives = 216/401 (53%), Gaps = 1/401 (0%)
 Frame = +3

Query: 189  CHTVSSSDKMLGSDTVVESEVQRKHSLGKIVLEVVNMVRNNGEDLESRLNKLHPRLNTYS 368
            C+  SSS    G   V  +E+ R  +  K V +++ M+R    DLES++  ++  LN   
Sbjct: 106  CYGTSSSVNQCGFRNVGVNEL-RYFASHKQVRDILGMIRRKDNDLESKVRSMNVSLNLKM 164

Query: 369  ITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKNASVCSLIIDNLGRLEDYATMHTLFRKFA 548
             +  FE LN Q+   L   +W+    P    +  V S++IDNLGRL +Y  M  +  +FA
Sbjct: 165  CSRFFEELNNQKRDALAVCDWIRYAHPSCRND--VHSMVIDNLGRLGNYDAMARVLSEFA 222

Query: 549  NENISLTYDAFGFLPVLASTDASLKES-IKRVLDLLNEVGGSCRSSGVCALIEMFCKFHL 725
             + + L   AF F+ +     + LKE+ + +V+D+L  V    R SG+C+LI+MF     
Sbjct: 223  KKKVRLVPLAFEFVSL-----SPLKEATVMKVVDVLKAVEEPTRGSGLCSLIKMFSAVGS 277

Query: 726  FEMAKHVIKVEGSKTSYYCLLIREKCRNGLIEDAHDIIREMREAHCAPTTTVYNYLLGRL 905
            F+MA+ V+++   K ++Y +++ EKC  G  E A D++  MR+    P   +YNY+L  L
Sbjct: 278  FDMAELVMQLSERKATFYKIMVVEKCGKGDFEGAADLVEVMRKHGLKPEAKIYNYVLSTL 337

Query: 906  WKHSRIGEASSLLDEMKENDIPLDAVTFEILIDSACSLGNMDAVHQLLHQLVAQGLQPRL 1085
             KH +  EA +L +EM  ++   D +T+EI +  +C  GN D   +LL ++ AQG++PR+
Sbjct: 338  CKHDKSAEAGALFEEMLASECAPDPITYEIFVCHSCKAGNFDLARKLLDRMNAQGIEPRV 397

Query: 1086 STHAYVIKNLFXXXXXXXXXXXVADSTANDKTSSNMIYSLMAKLYCEKGHIMSARSTLVD 1265
            S H  ++K  F           V     +       I S +A+LY +  +++ A + L++
Sbjct: 398  SMHGVILKGYFNLKRFEEAYEYV--MACDRYICFPAICSTLARLYVKADNVIVAHNLLLE 455

Query: 1266 MMEKGLKPDFSIYLKIVKLLRRSGRGNLARDLENRHSKFTS 1388
            ++++GL+PD S+Y  + + L  +GR  LA DL +R S   S
Sbjct: 456  LIDRGLRPDQSVYTNVFRRLIDTGRTALAEDLRSRLSSIGS 496


>ref|NP_001167893.1| hypothetical protein [Zea mays] gi|223944699|gb|ACN26433.1| unknown
            [Zea mays] gi|414864420|tpg|DAA42977.1| TPA: hypothetical
            protein ZEAMMB73_690405 [Zea mays]
          Length = 430

 Score =  214 bits (545), Expect = 9e-53
 Identities = 128/374 (34%), Positives = 200/374 (53%), Gaps = 6/374 (1%)
 Frame = +3

Query: 288  VVNMVRNNGEDLESRLNKLHPRLNTYS-ITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKN 464
            +V +V   G  LE+ L++L   + ++  ++ +   L  + VP  RFF W  S        
Sbjct: 45   IVCLVVAGGGGLEADLDRLFSAVLSHGLVSSVLRALTDRGVPAERFFAWASSLGRGFSPG 104

Query: 465  ASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLA--STDASLKESIKR 638
                +L+++N GRL+DY  M       +   +SLT  AF FL   +  S   S++++ + 
Sbjct: 105  PRAYNLLVENAGRLDDYGAMSRALALMSERRLSLTDRAFAFLAPSSGSSRSGSVEDAARA 164

Query: 639  VLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCRNGLI 818
            VL  L+ VGG CR+SGV +L++       F+ A  VI+    K  YY +L+  KC+ G  
Sbjct: 165  VLRALDGVGGPCRASGVFSLVKALASIGEFDAAVLVIEETARKVRYYNVLVAAKCKAGDF 224

Query: 819  EDAHDIIREMREAHCAPTTTVYNYLLGRLWKHSRIGEASSLLDEM---KENDIPLDAVTF 989
              A ++  EMR +   P    +NYLLG L K  R+ EA  L++ M   K ++IP  ++T+
Sbjct: 225  VGAREVFDEMRRSGSDPDANTWNYLLGCLLKKGRLAEACGLVEAMERLKRSEIP-SSLTY 283

Query: 990  EILIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXXVADSTA 1169
            EIL   AC  G MD+  Q+L Q+ ++ L PR++ H+  IK  F           V+D + 
Sbjct: 284  EILTYHACKAGKMDSAMQILDQMFSENLTPRITIHSAFIKGYFYAGRIEDACKYVSDMST 343

Query: 1170 NDKTSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNL 1349
             D+ S N  YSL+AKL  + G  + A   L ++MEKGL+PD S Y+K+ K L + G+GNL
Sbjct: 344  RDRHSVNRNYSLLAKLLWKSGRTIDAGRVLYELMEKGLRPDHSAYVKVAKDLHKMGKGNL 403

Query: 1350 ARDLENRHSKFTSN 1391
            A +L+    +F+ N
Sbjct: 404  ACELKMMFQRFSVN 417


>ref|XP_004985951.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like [Setaria italica]
          Length = 419

 Score =  209 bits (533), Expect = 2e-51
 Identities = 125/370 (33%), Positives = 197/370 (53%), Gaps = 4/370 (1%)
 Frame = +3

Query: 288  VVNMVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKNA 467
            +V +V   G  LE+  ++L P L+   +      L    VP  RFF W  S       +A
Sbjct: 45   IVRLVAAGGSSLEADFDRLDPALSHALVARTLRALTDSGVPAERFFAWA-SLRRGFSPSA 103

Query: 468  SVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFL-PVLASTDASLKESIKRVL 644
               +L+I+N G+L DY  M       +   + LT  AF FL P  +S  + ++++ + VL
Sbjct: 104  HAHNLLIENAGKLADYRAMSRALALMSQRRLPLTDRAFAFLAPSGSSRSSCVEDAARAVL 163

Query: 645  DLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCRNGLIED 824
             +L++VGG CR+SGV +L++       F+ A  VI+       Y+ +++  KC+ G    
Sbjct: 164  RVLDDVGGPCRASGVFSLVKALASTGEFDAAVSVIEETRRMARYFNVVVAAKCKAGNFVG 223

Query: 825  AHDIIREMREAHCAPTTTVYNYLLGRLWKHSRIGEASSLLDEM---KENDIPLDAVTFEI 995
            A ++  EMR++  AP    +N LLG L K+ R+ EA  L++ M   K  ++P D++T+EI
Sbjct: 224  AREVFDEMRKSGSAPNANTWNCLLGCLLKNGRLAEACGLVESMERSKPGEVP-DSLTYEI 282

Query: 996  LIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXXVADSTAND 1175
            L   AC  G MD+  Q+L Q+ +  L PR++ H+  IK  F           V D +  D
Sbjct: 283  LTYHACKAGKMDSAMQILDQMFSANLTPRITIHSAFIKGYFYAGRIEDAQKYVDDMSTRD 342

Query: 1176 KTSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLAR 1355
            + S N  YSL+AKL  + G  + A   L ++MEKGL+PD S Y+K+ K L + GRG+LA 
Sbjct: 343  RHSVNRNYSLLAKLLRKSGRTIDAGRVLYELMEKGLRPDHSAYVKVAKDLYKMGRGDLAS 402

Query: 1356 DLENRHSKFT 1385
            +L+    +F+
Sbjct: 403  ELKLMFQRFS 412


>ref|XP_002468625.1| hypothetical protein SORBIDRAFT_01g049260 [Sorghum bicolor]
            gi|241922479|gb|EER95623.1| hypothetical protein
            SORBIDRAFT_01g049260 [Sorghum bicolor]
          Length = 422

 Score =  203 bits (516), Expect = 2e-49
 Identities = 124/372 (33%), Positives = 196/372 (52%), Gaps = 6/372 (1%)
 Frame = +3

Query: 288  VVNMVRNNGEDLESRLNKLHPRLNTYS-ITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKN 464
            +V +V   G  LE+ L++L     ++  ++     L    VP  RFF W  S        
Sbjct: 45   IVRLVAAGGGGLEADLDRLFAATLSHGLVSSALRALTDSGVPAERFFAWASSLGRGFSPG 104

Query: 465  ASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLA--STDASLKESIKR 638
                +L+++N GRL D   M       +   + LT  AF FL + +  S   S+++S   
Sbjct: 105  PRAHNLLVENTGRLGDCGAMSRALALMSERMLPLTDRAFAFLALSSGSSRSGSVEDSTTS 164

Query: 639  VLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCRNGLI 818
            VL  L+ VGG CR+SGV +L++       F+ A  VI+    K  YY +L+  KC+ G  
Sbjct: 165  VLRALDGVGGPCRASGVFSLVKALASIGEFDAAVSVIEETTRKVRYYNVLVAAKCKAGDF 224

Query: 819  EDAHDIIREMREAHCAPTTTVYNYLLGRLWKHSRIGEASSLLDEMKE---NDIPLDAVTF 989
              A ++  EMR++   P    +NYLLG L K+ R+ EA  L++ M+    ++IP +++T+
Sbjct: 225  VGAREVFDEMRKSGSDPDANTWNYLLGCLLKNGRLAEACGLVEAMERLKCSEIP-NSLTY 283

Query: 990  EILIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXXVADSTA 1169
            EIL   AC  G MD+  Q+L+Q+ ++ L PR++ H+  IK  F           V D + 
Sbjct: 284  EILTYHACKAGKMDSAMQILNQMFSENLTPRITIHSAFIKGYFYAGRIEDACKYVNDMST 343

Query: 1170 NDKTSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNL 1349
             D+ S N  YSL+AKL  + G  + A   L ++M+KGL+PD S Y+K+ K L + GRG+L
Sbjct: 344  RDRHSVNRNYSLLAKLLRKSGRTVDAGRVLYELMDKGLRPDHSAYVKVAKDLHKMGRGDL 403

Query: 1350 ARDLENRHSKFT 1385
            A +L+    +F+
Sbjct: 404  ASELKMMFQRFS 415


Top