BLASTX nr result

ID: Mentha28_contig00024509 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00024509
         (1890 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU22729.1| hypothetical protein MIMGU_mgv1a019695mg, partial...   473   e-130
gb|EPS72967.1| hypothetical protein M569_01792 [Genlisea aurea]       352   4e-94
ref|XP_003634944.1| PREDICTED: pentatricopeptide repeat-containi...   322   4e-85
ref|XP_007019869.1| Pentatricopeptide repeat superfamily protein...   294   8e-77
ref|XP_004243688.1| PREDICTED: pentatricopeptide repeat-containi...   292   3e-76
ref|XP_006353779.1| PREDICTED: pentatricopeptide repeat-containi...   288   6e-75
ref|XP_006441650.1| hypothetical protein CICLE_v10019950mg [Citr...   281   6e-73
ref|XP_004141361.1| PREDICTED: pentatricopeptide repeat-containi...   279   3e-72
ref|XP_002525536.1| pentatricopeptide repeat-containing protein,...   278   8e-72
gb|EXC01127.1| hypothetical protein L484_025500 [Morus notabilis]     275   4e-71
ref|XP_002325952.1| hypothetical protein POPTR_0019s10460g [Popu...   266   3e-68
ref|XP_007153232.1| hypothetical protein PHAVU_003G017800g [Phas...   226   3e-56
ref|XP_006854920.1| hypothetical protein AMTR_s00052p00103370 [A...   225   6e-56
ref|XP_006603965.1| PREDICTED: pentatricopeptide repeat-containi...   224   1e-55
ref|XP_004513354.1| PREDICTED: pentatricopeptide repeat-containi...   219   3e-54
ref|XP_007222181.1| hypothetical protein PRUPE_ppa009631mg [Prun...   215   5e-53
ref|NP_001167893.1| hypothetical protein [Zea mays] gi|223944699...   214   8e-53
ref|XP_004301448.1| PREDICTED: pentatricopeptide repeat-containi...   214   1e-52
ref|XP_004985951.1| PREDICTED: pentatricopeptide repeat-containi...   210   2e-51
ref|XP_002468625.1| hypothetical protein SORBIDRAFT_01g049260 [S...   202   4e-49

>gb|EYU22729.1| hypothetical protein MIMGU_mgv1a019695mg, partial [Mimulus guttatus]
          Length = 531

 Score =  473 bits (1218), Expect = e-130
 Identities = 255/461 (55%), Positives = 317/461 (68%), Gaps = 27/461 (5%)
 Frame = -2

Query: 1460 SLC----GLPLKLK---------IDIWTLSSHISPNVCAGFSWLRCGKMARCYTVSSSDK 1320
            SLC    GLP K++          D  TL    S   C+    LR G   RC+T++SSD+
Sbjct: 85   SLCYFQSGLPSKIRSSTHSVLHYFDSSTLLKRESHYRCSNSERLRFG---RCFTIASSDE 141

Query: 1319 MLGSDTVVESEAQ--------------RKHSLGKIVLEVVYIVRNNGEDLESRLNKLHPR 1182
               S  V+E+E +              R     K VLE+V I+RNNG DLESRL+ LH  
Sbjct: 142  TFTSSPVMENEVRSVANICSDQQKFEKRNKFPQKFVLEIVDILRNNGADLESRLSMLHSN 201

Query: 1181 LNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKNASVCSLIIDNLGRLEDYATMHTL 1002
            L+ YSITEIFEVLN++R+ GLR  EW+WSN P+LHKNA +CSLIIDN GRL+DY  +   
Sbjct: 202  LSVYSITEIFEVLNSRRISGLRLVEWIWSNKPQLHKNAHICSLIIDNFGRLDDYENISVW 261

Query: 1001 FRKFANENISLTYDAFGFLPVLASTDASLKESIKRVLDLLNEVGGSCRSSGVCALIEMFC 822
            F+KF++E I LTY+AF FLPVLA  ++SL+ES  RV+DLLN++G           +EMFC
Sbjct: 262  FKKFSSEKICLTYEAFAFLPVLAPENSSLRESATRVVDLLNKIG-----------VEMFC 310

Query: 821  KFHLFEMAKHVIKVEGSKTSYYCLLIREKCSNGLIEDAHDIIREMREAHCAPTTTVYNYL 642
            K  LFEMAK+VIK+  SK +YYC+LIREKC +GLIEDAH IIREM  A+C P TT+YNYL
Sbjct: 311  KLDLFEMAKYVIKITESKNAYYCILIREKCRSGLIEDAHSIIREMGNANCVPNTTIYNYL 370

Query: 641  LGSLWKHSRIGEASSLLDEMKENDIPLDAVTFEILIDSACSLGNMDAVHQLLHQLVAQGL 462
            LGSLWK+ R+ +AS+LLDEMKE  IP D +TFEILI+  C  G MD VH LL ++ +QG+
Sbjct: 371  LGSLWKNGRMDKASALLDEMKEIGIPRDEITFEILINFVCRFGEMDEVHHLLDEMTSQGI 430

Query: 461  QPRLSTHAYVIKNLFXXXXXXXXXXYVADSTANDKTSSNMIYSLMAKLYCEKGHIMSARS 282
            +PR+STHA ++K LF          YV D +   KTSSNM+YSLMA LY EKG IMSA++
Sbjct: 431  EPRISTHACIVKTLFAAEKYEAAHKYVVDFSVIYKTSSNMMYSLMANLYWEKGDIMSAKN 490

Query: 281  TLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARDLENRHS 159
             LV+MMEKGLKP+FSIYLKIVK +RRSG  NLARDLE+ HS
Sbjct: 491  ILVEMMEKGLKPNFSIYLKIVKRIRRSGSTNLARDLESLHS 531


>gb|EPS72967.1| hypothetical protein M569_01792 [Genlisea aurea]
          Length = 447

 Score =  352 bits (902), Expect = 4e-94
 Identities = 182/383 (47%), Positives = 269/383 (70%), Gaps = 4/383 (1%)
 Frame = -2

Query: 1289 EAQRKHSLGKIVLEVVYIVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFF 1110
            + ++K    K V E+V I+RN+G+DLESRL KL PRL+ Y+ITEIFE LNTQRV GL+ F
Sbjct: 61   DTRKKRVQCKFVCEIVSILRNDGKDLESRLIKLAPRLSLYTITEIFEALNTQRVSGLKLF 120

Query: 1109 EWVWSNIPKLHKNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLAS 930
             W+ +N PKLHK+A VCSL+IDNLGRL  Y TM  + ++F++ NI LTY+AFGFLPV AS
Sbjct: 121  IWIRNNSPKLHKSARVCSLLIDNLGRLGAYDTMLLMLKEFSSHNICLTYEAFGFLPVSAS 180

Query: 929  TD-ASLKESIKRVLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYC 753
            T+ +SL ES KRV+D LN+ GGSCR+SG+ AL+EMF    +F MA++V+K+   +  Y+ 
Sbjct: 181  TESSSLAESTKRVVDFLNQAGGSCRNSGLYALVEMFSALDMFHMARYVMKITEIRRVYFT 240

Query: 752  LLIREKCSNGLIEDAHDIIREMREA-HCAPTTTVYNYLLGSLWKHSRIGEASSLLDEMKE 576
            ++IRE C   L EDA  +I+EM E+    P   +YN++LGSL + SRI EAS +   M+E
Sbjct: 241  VMIREMCKRDLFEDAIRLIKEMEESTRFFPDANIYNHILGSLLRTSRIDEASKIFSRMRE 300

Query: 575  NDIPLDAVTFEILIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLF-XXXXXX 399
             D+  D +T+EILI+S CSLG ++   +LL ++ + G++PR+ THA +IK +F       
Sbjct: 301  LDVKPDGITYEILINSHCSLGRLEDAKRLLDEMGSIGIEPRIETHALIIKAMFATGEEYE 360

Query: 398  XXXXYVADSTANDKTSSNMIYSLMAKLYCEKGHIMSARSTLVDMME-KGLKPDFSIYLKI 222
                +V + +   +TS+N +++LMA L+C++G ++ A   L +MM+ + LKPDF++Y+ +
Sbjct: 361  GLRKHVVEYSGVYRTSANTMHTLMADLHCKRGDVLRAAEVLSEMMKCRNLKPDFNVYMDV 420

Query: 221  VKLLRRSGRGNLARDLENRHSKF 153
            V  L+R+ R +LA DL++ +S+F
Sbjct: 421  VMKLKRARRVDLAWDLQSMYSRF 443


>ref|XP_003634944.1| PREDICTED: pentatricopeptide repeat-containing protein At5g16420,
            mitochondrial-like [Vitis vinifera]
          Length = 582

 Score =  322 bits (825), Expect = 4e-85
 Identities = 198/532 (37%), Positives = 298/532 (56%), Gaps = 31/532 (5%)
 Frame = -2

Query: 1652 SSVESRPLPKNPVLS-----VTSSGSCSILHSRLPSFRIKLXXXXXXXXXXXXXXSIPFL 1488
            +SVES  L KNP  +     + SSGS +++ S  PS  +                    L
Sbjct: 50   TSVESNQLYKNPASAYQFRLILSSGS-ALISSPKPSLCLSQGYSYLCADRDVYHSRRVVL 108

Query: 1487 -PVKNHFLGHSLCGLPLKLKIDIWTLSSHISPNV-----CAGFSWLRCGKMA-------- 1350
             P KN    ++  G    +     + S   S  V     C+   W  C ++         
Sbjct: 109  NPEKNRIQWNTTSGSQFSIMFSSGSSSFFQSQKVSHFYACSSILWNTCARLNVNSLKLST 168

Query: 1349 ----RCYTVSSSDKMLGS--------DTVVESEAQRKHSLGKIVLEVVYIVRNNGEDLES 1206
                R  ++SS D   GS        D +V + + ++ S      E++ ++R++  D+E 
Sbjct: 169  QTSFRLLSISSFDNCSGSFEFGNKGIDELVPNMSPKRIS------EIIKVIRSDEIDMEV 222

Query: 1205 RLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKNASVCSLIIDNLGRLE 1026
            +LN ++ RL+  S+TEIF VLN +R+  +RFFEW+  +   L +N  +CSLIIDN GRL 
Sbjct: 223  KLNLMNLRLSVASVTEIFRVLNLERLSAMRFFEWISHSRSGLSRNYDICSLIIDNCGRLG 282

Query: 1025 DYATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESIKRVLDLLNEVGGSCRSSGV 846
            DY TM  L + F ++ + LT  AFGF+PV   + AS+ + +++++++L++VGG CR SG+
Sbjct: 283  DYETMRCLLKDFNSKRVCLTSKAFGFVPVFTLSKASIMDFVRKLIEVLDDVGGVCRRSGL 342

Query: 845  CALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCSNGLIEDAHDIIREMREAHCAP 666
              LIEMF     FEMAK V+++   KTSYY +L+RE C     ++A D++ EMR   C P
Sbjct: 343  FGLIEMFSVSGSFEMAKFVMEITERKTSYYNILVREMCRKCNFKEARDLLDEMRLFGCRP 402

Query: 665  TTTVYNYLLGSLWKHSRIGEASSLLDEMKENDIPLDAVTFEILIDSACSLGNMDAVHQLL 486
                YNYLL SL K++R  EA ++L+EM+E   P DA+TFEI I     LG +D   + L
Sbjct: 403  NAKTYNYLLSSLCKNNRDDEACNVLEEMQEAGCPPDALTFEIFIYYTYRLGKLDFAIKFL 462

Query: 485  HQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXYVADSTANDKTSSNMIYSLMAKLYCEK 306
             Q+V++GL+PRL+THA  IK  F          YV DS    K  SNMIYSL+A L+   
Sbjct: 463  DQMVSRGLEPRLTTHAAFIKGYFHSRRYEEAYEYVVDSGVTYKWPSNMIYSLLASLHQRN 522

Query: 305  GHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARDLENRHSKFT 150
            G+++SA+  L++M+EKGLKP+FS+Y ++++ L +SGR +LA DL +R S  +
Sbjct: 523  GNLISAQKILIEMIEKGLKPNFSVYKRVLEHLDKSGREDLAGDLRSRFSSLS 574


>ref|XP_007019869.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
            cacao] gi|508725197|gb|EOY17094.1| Pentatricopeptide
            repeat superfamily protein, putative [Theobroma cacao]
          Length = 494

 Score =  294 bits (753), Expect = 8e-77
 Identities = 151/374 (40%), Positives = 236/374 (63%)
 Frame = -2

Query: 1262 KIVLEVVYIVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPK 1083
            K  LEVV ++R+   DLES+L+ ++  L+  S+  IF +LN ++V  LRFF W+  + P+
Sbjct: 116  KQALEVVSLIRSGQNDLESKLDGMNVSLSEASLNTIFRILNNEKVSALRFFYWIRESHPQ 175

Query: 1082 LHKNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESI 903
             + N+ +CSL+IDN GRL+D+ +  +L   F    I L + AFGF+PV+ S+ A+ K+SI
Sbjct: 176  FYHNSDICSLVIDNCGRLDDFDSAASLLNDFKLHGIRLNHRAFGFVPVMISSKAATKKSI 235

Query: 902  KRVLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCSNG 723
             +V+++LN +GGSC  SG+ ALIEM C    FEMAK+VI     + S Y +LIR +C  G
Sbjct: 236  CKVVEVLNRIGGSCSVSGIHALIEMLCALESFEMAKYVIAKAEKRLSNYNILIRGQCRKG 295

Query: 722  LIEDAHDIIREMREAHCAPTTTVYNYLLGSLWKHSRIGEASSLLDEMKENDIPLDAVTFE 543
              E A +I+  M +  C P +  +N +L  L K+ ++ EA  LL++M E+  PLDA+TFE
Sbjct: 296  DFEGAREILDWMIKVGCNPNSQTFNNILSCLCKNDKVAEACQLLEQMLESGCPLDALTFE 355

Query: 542  ILIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXYVADSTAN 363
            I I   C LG +D   + L+++ + G++PR++THA  +K  F          YV   +  
Sbjct: 356  IFICYYCGLGRLDMAFEWLNKMDSSGIEPRITTHAAFVKGYFKLQQYEEAHNYVVVCSDK 415

Query: 362  DKTSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLA 183
             K +SN++YSL+A L+ ++G  + A+S L +M+EKGLKP+F++Y+ + K L++SGR +LA
Sbjct: 416  YKQASNIVYSLLASLHRKRGKPVIAQSILSEMIEKGLKPNFAVYMTVTKQLQKSGREDLA 475

Query: 182  RDLENRHSKFTSNP 141
             +L +  S   S P
Sbjct: 476  GNLRSSFSSLISQP 489


>ref|XP_004243688.1| PREDICTED: pentatricopeptide repeat-containing protein At1g12775,
            mitochondrial-like [Solanum lycopersicum]
          Length = 525

 Score =  292 bits (748), Expect = 3e-76
 Identities = 160/368 (43%), Positives = 230/368 (62%)
 Frame = -2

Query: 1256 VLEVVYIVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLH 1077
            VLE+V I+R  G+++  +L  +  +L+   + EIF++LN QR+ GL+FF W+  + P+ H
Sbjct: 142  VLEIVEIIRGGGQNVRQQLILVASKLSFKCVVEIFDLLNEQRISGLKFFNWLRDSHPEFH 201

Query: 1076 KNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESIKR 897
            ++A V SLII N G L+DY TM +L  +F  E   LT  AFGFL V  S   SL  S K+
Sbjct: 202  RSAYVNSLIICNCGWLDDYKTMFSLLEEFKTEQTCLTDKAFGFLTVFGSCKDSLMNSTKK 261

Query: 896  VLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCSNGLI 717
            V+D+L EVGGSC  SGV  LIEMFC   LFEMA  VI++     S Y +LIR++C  G I
Sbjct: 262  VVDMLIEVGGSCCGSGVYGLIEMFCSLDLFEMATFVIEITERTASRYNILIRKRCRAGQI 321

Query: 716  EDAHDIIREMREAHCAPTTTVYNYLLGSLWKHSRIGEASSLLDEMKENDIPLDAVTFEIL 537
            E A  II EM E  C+P T  YNYLLGSL K+ ++ +   +L+EM+   +  DA+TFE L
Sbjct: 322  EKARAIIEEMSEFGCSPNTKSYNYLLGSLCKNDKLEDVRIVLEEMRNKGLNPDAITFETL 381

Query: 536  IDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXYVADSTANDK 357
            +    S G ++   + ++ +V   ++PR +THA  +K L           YV D +A   
Sbjct: 382  VYHLSSRGQVEFASEFMNLMVNVNVKPRSTTHAAFLKVLLEAGEREKAYKYVIDMSAKYN 441

Query: 356  TSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARD 177
             S N +YSL+ +L  +KG IM+A++ L +M++KGLKPDF I++K VK L ++ R +LARD
Sbjct: 442  HSVNTLYSLLVRLNQKKGDIMAAQNILNEMIDKGLKPDFGIFIKFVKQLGKTRRKSLARD 501

Query: 176  LENRHSKF 153
            L+ ++S F
Sbjct: 502  LKMKYSVF 509


>ref|XP_006353779.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            isoform X1 [Solanum tuberosum]
            gi|565374458|ref|XP_006353780.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At1g09900-like isoform X2 [Solanum tuberosum]
            gi|565374460|ref|XP_006353781.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At1g09900-like isoform X3 [Solanum tuberosum]
          Length = 537

 Score =  288 bits (737), Expect = 6e-75
 Identities = 157/368 (42%), Positives = 228/368 (61%)
 Frame = -2

Query: 1256 VLEVVYIVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLH 1077
            +LE++ I++  GEDL  +L  +  +L+   +  IF++LN QR+ GL FF W+  + P+ H
Sbjct: 151  ILEIIEIIKGGGEDLRQQLILVASKLSFKCVIGIFDLLNEQRISGLNFFNWLRDSHPEFH 210

Query: 1076 KNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESIKR 897
             +A V SLII N G L+DY TM +L  +F  E   LT  AFGFL V  S   SL  S K+
Sbjct: 211  CSAYVNSLIICNCGWLDDYKTMFSLLEEFKAEQTCLTDKAFGFLTVFGSCKDSLMNSTKK 270

Query: 896  VLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCSNGLI 717
            V+D+LNEVGGSC  SGV  LIEM+C   LFEMA  VI++     S Y +LIR++C  G I
Sbjct: 271  VVDMLNEVGGSCCGSGVHGLIEMYCCLDLFEMATFVIEITERTASRYNILIRKRCRAGQI 330

Query: 716  EDAHDIIREMREAHCAPTTTVYNYLLGSLWKHSRIGEASSLLDEMKENDIPLDAVTFEIL 537
            E A  II+EM E  C+P T  YNYLLGSL K+ ++     +L+EM+   +  DA+TFE L
Sbjct: 331  EKARAIIKEMSEFGCSPNTKSYNYLLGSLCKNDKLEYVRIVLEEMRNKGLNPDAITFETL 390

Query: 536  IDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXYVADSTANDK 357
            +  + S G ++   + ++ ++   ++PR +THA  +K L           YV D +A   
Sbjct: 391  VYHSSSRGQVEFASEFMNLMINVNVEPRSTTHAAFLKVLLEAGEREKAYKYVIDMSAKYN 450

Query: 356  TSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARD 177
               N +YSL+ +L  +KG IM+A++ L +M++KGLKPDF I++KIVK L ++ + +LARD
Sbjct: 451  HCGNTLYSLLVRLNQKKGDIMAAQNILSEMIDKGLKPDFGIFIKIVKQLGKTRKKSLARD 510

Query: 176  LENRHSKF 153
            L  ++S F
Sbjct: 511  LRMKYSVF 518


>ref|XP_006441650.1| hypothetical protein CICLE_v10019950mg [Citrus clementina]
            gi|557543912|gb|ESR54890.1| hypothetical protein
            CICLE_v10019950mg [Citrus clementina]
          Length = 478

 Score =  281 bits (720), Expect = 6e-73
 Identities = 153/371 (41%), Positives = 229/371 (61%)
 Frame = -2

Query: 1262 KIVLEVVYIVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPK 1083
            K V E++ ++R+   + ES+L  +   L+  S+ EI  VLN+++V  L F +++   IP+
Sbjct: 100  KQVSEIIELLRSGDSETESKLLSMSVSLSNASVIEILRVLNSEKVSALCFLKYMREIIPE 159

Query: 1082 LHKNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESI 903
             +KN+ +CSL+IDN GRL+DY TM  L  +F    + L   AFGFLPVL S+ A  K+ I
Sbjct: 160  FYKNSDICSLVIDNCGRLDDYETMRQLLNEFNVYQVCLNEKAFGFLPVLISSKALTKKCI 219

Query: 902  KRVLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCSNG 723
             RV+++LN+V GSC  SGV ALIEMF    L+EMAK+VIK    K SYY +LI+E C   
Sbjct: 220  WRVVEVLNQVEGSCLVSGVRALIEMFSVLGLYEMAKYVIKKTERKVSYYNILIKEMCRRC 279

Query: 722  LIEDAHDIIREMREAHCAPTTTVYNYLLGSLWKHSRIGEASSLLDEMKENDIPLDAVTFE 543
              +   D++ EMR+  C P T  YNY+LG L K+ +  +A  LL+EM   +   DA+T+E
Sbjct: 280  DFKGPRDLLVEMRQVGCEPITLTYNYVLGVLCKNGQDADACELLEEMLGRNCHPDAITYE 339

Query: 542  ILIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXYVADSTAN 363
            I I  +C +G  D      +Q+V +GLQPRL+THA  IK  F          YV  S   
Sbjct: 340  IFIVYSCRVGKFDVAFNFFNQMVKRGLQPRLTTHAAFIKGYFSCYRYEDAYKYVVLSADK 399

Query: 362  DKTSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLA 183
             K+SSNM+YSL+A L+ +  + + A++ L +MM+ GL+P+ S+Y +++K L  S + ++A
Sbjct: 400  YKSSSNMLYSLLASLHDKNNNPVMAKNVLSEMMKIGLRPNVSVYRRVLKHLHTSRQEHMA 459

Query: 182  RDLENRHSKFT 150
            + L +R+S  +
Sbjct: 460  KCLSSRYSSLS 470


>ref|XP_004141361.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05670,
            mitochondrial-like [Cucumis sativus]
            gi|449498723|ref|XP_004160616.1| PREDICTED:
            pentatricopeptide repeat-containing protein At1g05670,
            mitochondrial-like [Cucumis sativus]
          Length = 494

 Score =  279 bits (714), Expect = 3e-72
 Identities = 149/363 (41%), Positives = 224/363 (61%)
 Frame = -2

Query: 1247 VVYIVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKNA 1068
            ++ I+R N EDLES+L+  + RL    + +I E+LN  ++   RFF WV     K   N+
Sbjct: 122  IINIIRENQEDLESKLDSPNVRLTNVLVGQILEMLNKHKISASRFFNWVSVQSCKFPCNS 181

Query: 1067 SVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESIKRVLD 888
             V SL+IDN GRL+DY  +  +  +F  + I L + AFGFL  L S + S+K S+ +++ 
Sbjct: 182  DVYSLLIDNFGRLDDYEGILPVLIEFGLKGIELNHKAFGFLLPL-SNEHSMKLSVVKLVK 240

Query: 887  LLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCSNGLIEDA 708
            LLNE GG+CR SG+ ALIEMFC    F MAK VI++   ++S+Y +++REKC     E A
Sbjct: 241  LLNEAGGTCRLSGIMALIEMFCSLGSFGMAKFVIEITEKRSSFYYIIVREKCKQKDFEGA 300

Query: 707  HDIIREMREAHCAPTTTVYNYLLGSLWKHSRIGEASSLLDEMKENDIPLDAVTFEILIDS 528
               + EMR+  C P   + NYLL SL K+ + GEA +LL+EM E +   +++TFEI+I  
Sbjct: 301  RCTLDEMRQVGCIPDAGILNYLLSSLCKNDKFGEAHNLLEEMLEQNCSPNSLTFEIIICH 360

Query: 527  ACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXYVADSTANDKTSS 348
             C +GN+++    L  +VA GL PRLSTHA  +K+ F          Y  DS+    T+ 
Sbjct: 361  LCKIGNIESALGYLDMMVAGGLMPRLSTHAAFVKSYFSSQRYEEAYQYAVDSSLKYVTTQ 420

Query: 347  NMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARDLEN 168
            N  YSL+A L+ ++G+++ A+  L ++M+ GLKP F +Y +++K L+  GRG+LA DL+ 
Sbjct: 421  NATYSLLATLHEKRGNLVDAQKILSELMDAGLKPHFHVYTRLLKKLQVQGRGDLANDLKR 480

Query: 167  RHS 159
            + S
Sbjct: 481  KIS 483


>ref|XP_002525536.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223535215|gb|EEF36894.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 430

 Score =  278 bits (710), Expect = 8e-72
 Identities = 154/405 (38%), Positives = 245/405 (60%)
 Frame = -2

Query: 1379 FSWLRCGKMARCYTVSSSDKMLGSDTVVESEAQRKHSLGKIVLEVVYIVRNNGEDLESRL 1200
            FS+   G + R Y+VSSSD  L     + ++  R  +  K V  ++ ++     +LE++L
Sbjct: 23   FSYPGYGSL-RLYSVSSSDASLYKK--LHTDELRPKATRKQVSVIIGLLITEDNELETKL 79

Query: 1199 NKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKNASVCSLIIDNLGRLEDY 1020
            N L  RL+  S+  +F+VLN ++   L+FF W+    P+L  N+ +CSL+IDN G L+DY
Sbjct: 80   NSLGVRLSIGSVRWVFQVLNREKKSALQFFHWIRRWQPELEGNSDICSLVIDNCGHLDDY 139

Query: 1019 ATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESIKRVLDLLNEVGGSCRSSGVCA 840
              M  L   F+ + + LT  AF +L + +S +  LK++ + V+D+L E+GG+   +GV +
Sbjct: 140  KAMRCLLDGFSLQRLFLTKKAFEYLQLTSSKEELLKKATQNVVDILQEIGGTSYGTGVPS 199

Query: 839  LIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCSNGLIEDAHDIIREMREAHCAPTT 660
            LIEMF     F+MAK VI+  G K SYY +LIRE C  G  + A D++ E+ +  C P+ 
Sbjct: 200  LIEMFSDLGSFDMAKFVIEKTGRKLSYYNVLIRELCRRGDFKAARDLMDEIGKEGCNPSA 259

Query: 659  TVYNYLLGSLWKHSRIGEASSLLDEMKENDIPLDAVTFEILIDSACSLGNMDAVHQLLHQ 480
              YNY++ SL K+ +  +A  +  EM++ND P DA+TFEI I ++C+ G +D   +    
Sbjct: 260  HTYNYIISSLLKNGKNADACEVFQEMQDNDCPPDALTFEIFIYNSCNEGKLDNAFEFFDD 319

Query: 479  LVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXYVADSTANDKTSSNMIYSLMAKLYCEKGH 300
            +VA+GL+PRL THA  IK  F          YV  S  +DK SSN+ YSL+A L+ ++G+
Sbjct: 320  MVARGLEPRLLTHAAFIKGFFNSQQYEKAYKYVVGS--DDKYSSNVNYSLLANLHQKQGN 377

Query: 299  IMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARDLENR 165
            ++ A + L +M++KGL+P F++++K+ K L +SG   LA  L+ +
Sbjct: 378  LVDAENILSEMIKKGLRPHFNVFMKVKKHLMKSGNEELATSLQKK 422


>gb|EXC01127.1| hypothetical protein L484_025500 [Morus notabilis]
          Length = 502

 Score =  275 bits (704), Expect = 4e-71
 Identities = 154/369 (41%), Positives = 223/369 (60%), Gaps = 1/369 (0%)
 Frame = -2

Query: 1250 EVVYIVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKN 1071
            EVV ++R N   LE  LN     L    I  IFE LN+++V  LRFF WV  + P L +N
Sbjct: 122  EVVDMIRRNEISLECELNLSEVWLTVAYINRIFEALNSEKVSALRFFNWVRVSKPGLRRN 181

Query: 1070 ASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESIKRVL 891
            + +CSL+IDNLGRL DY +M  +  +F  E I LT +AF FLP L     SL  S+   +
Sbjct: 182  SDICSLMIDNLGRLNDYESMTCILNEFREEQICLTKNAFRFLPDLMLNKDSLMNSVTEAV 241

Query: 890  DLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCSNGLIED 711
             +L  VGGSC ++GV +LIE+F    L EMA+ V+++    TSYY ++IREKC    +E 
Sbjct: 242  KILKGVGGSCGATGVRSLIELFSSLGLLEMARFVMQLTEKNTSYYNIMIREKCRKHDLEG 301

Query: 710  AHDIIREMREAHCAPTTTVYNYLLGSLWKHSRIGE-ASSLLDEMKENDIPLDAVTFEILI 534
            A  ++ EMR+A C P +T YN +L  L+K   IGE A  LL EMK+     D  TFEIL+
Sbjct: 302  ARGLLNEMRQAGCEPNSTSYNLVLSILYK---IGESAEVLLKEMKDMGCSPDETTFEILV 358

Query: 533  DSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXYVADSTANDKT 354
              +C  G+ D    ++ ++++ GL+PRL+TH  ++K  F          YV DS+   + 
Sbjct: 359  LQSCKHGHFDFALGIVDEMLSFGLEPRLTTHVAIVKGYFASQRYEEAHKYVVDSSLKHRQ 418

Query: 353  SSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARDL 174
            SSN++YSL+A LY +K   ++A + + +M+EKGL+P F +Y +++  L+RS   +LARDL
Sbjct: 419  SSNVLYSLLAGLYLKKDDTVNALNIISEMIEKGLRPHFRVYNELLGCLQRSDGTDLARDL 478

Query: 173  ENRHSKFTS 147
            E + S+  S
Sbjct: 479  EIKFSRLCS 487


>ref|XP_002325952.1| hypothetical protein POPTR_0019s10460g [Populus trichocarpa]
            gi|222862827|gb|EEF00334.1| hypothetical protein
            POPTR_0019s10460g [Populus trichocarpa]
          Length = 432

 Score =  266 bits (679), Expect = 3e-68
 Identities = 159/404 (39%), Positives = 232/404 (57%), Gaps = 6/404 (1%)
 Frame = -2

Query: 1349 RCYTVSSSD--KMLGSDTVVESEAQRKHSLGKIVLEVVYIVRNNGEDLESRLNKLHPRLN 1176
            R Y  SS D  K +G+  V      R  ++ K V  ++ +++ +  DLE +L  L  +L+
Sbjct: 35   RGYKTSSFDVYKEMGTGRV------RPKAMQKQVAYIIDLIKRDEYDLEYKLGSLSVKLS 88

Query: 1175 TYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKNASVCSLIIDNLGRLEDYATMHTLFR 996
              S+T +F VLN+++V  LRFF W+    P+L  N+ +CSL+IDN GRL+DY  M +L  
Sbjct: 89   IASVTLVFHVLNSEKVSALRFFRWIRHWQPELRCNSDICSLVIDNCGRLDDYDAMRSLLN 148

Query: 995  KFANENISLTYDAFGFLPVLASTDASLKESIKRVLDLLNEVGGSCRSSGVCALIEMFCKF 816
            +F    + LT  AF FL V+  T+ SL ES +RV+ LL EV GSC    V +LIEMF   
Sbjct: 149  EFNENQLCLTKKAFEFLHVMNVTNESLVESTQRVIVLLLEVRGSCYG-WVSSLIEMFSVL 207

Query: 815  HLFEMAKHVIKVEGSKTSYYCLLIREKCSNGLIEDAHDIIREMREAHCAPTTTVYNYLLG 636
              F+M + V+K    K SYY + IRE C     +   DI  EMR+        +YNYL+ 
Sbjct: 208  GSFDMVEFVMKKTERKISYYYIFIREMCRRCDFKGVRDIQDEMRKEGFELNARIYNYLIS 267

Query: 635  SLWKHSRIGEASSLLDEMKENDIPLDAVTFEILIDSACSLGNMDAVHQLLHQLVAQGLQP 456
             L K+    +A  +L EM++ D P DA+TFEI I   C+ G  +       ++VA+GL+P
Sbjct: 268  CLLKNGEYADACKVLTEMQDKDCPPDALTFEIFIYYCCNNGKTEIACHYFDEIVARGLEP 327

Query: 455  RLSTHAYVIKNLFXXXXXXXXXXYVADSTANDKTSSNMIYSLMAKLYCEKGHIMSARSTL 276
            RLSTHA  IK  F          YV DS    K +S M YSL+A+L+ ++G+++ A++ L
Sbjct: 328  RLSTHAAFIKGFFNSEQYEEAYKYVVDSDKKYKCTSCMNYSLLARLHQKRGNLVIAQNIL 387

Query: 275  VDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARDLENR----HSK 156
             +M++KGL+P F +Y+K+   L +SGR  LA DL+ +    HSK
Sbjct: 388  SEMIKKGLRPYFKVYMKVFNCLNKSGRETLATDLQEQFHQLHSK 431


>ref|XP_007153232.1| hypothetical protein PHAVU_003G017800g [Phaseolus vulgaris]
            gi|561026586|gb|ESW25226.1| hypothetical protein
            PHAVU_003G017800g [Phaseolus vulgaris]
          Length = 480

 Score =  226 bits (576), Expect = 3e-56
 Identities = 128/365 (35%), Positives = 209/365 (57%)
 Frame = -2

Query: 1256 VLEVVYIVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLH 1077
            V +++ ++R NG+DL  +LN ++  L+  S+ +IF++L ++RV  L+FF+W+  + P + 
Sbjct: 110  VAQIIALIRENGDDLGCKLNSMNVSLSDASVVDIFQILASERVSALQFFDWLKGSDPDIC 169

Query: 1076 KNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESIKR 897
             ++ + SL ++N G L +Y  M  + R F+ + + L   AFGFL  L    AS  E +K+
Sbjct: 170  CDSDLGSLFVNNCGLLGNYEAMVPVLRGFSLKGVFLGVKAFGFLLDLGLDKASSIERVKK 229

Query: 896  VLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCSNGLI 717
            ++ + NEVGG  +S GV  L+EMF     FE+A+ VI+  G K   Y +L++  C  G  
Sbjct: 230  IMAVFNEVGGVYQSCGVQLLVEMFGLSGSFEIAEFVIRAAGRKVKNYHVLMKIMCKRGDC 289

Query: 716  EDAHDIIREMREAHCAPTTTVYNYLLGSLWKHSRIGEASSLLDEMKENDIPLDAVTFEIL 537
            +   D+++EM  +      + YN LL  L K  +I EA  +L+ M++N    D  +F+IL
Sbjct: 290  KRVGDLVKEMERSGIDVNASTYNLLLSCLCKSGKIDEACQVLEAMEKNYGLTDVHSFDIL 349

Query: 536  IDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXYVADSTANDK 357
            I++ C     D V +LL ++  +G++P + THA VIK+ F          YV  S     
Sbjct: 350  INTFCKQHQFDLVLKLLDKMTLKGIEPSILTHAAVIKSYFESGKYEEAHEYVIGSADKLS 409

Query: 356  TSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARD 177
             SSN  YSL+A L+ + G+++ A   L +MM+KGLKP+FS Y KI   L + G  +L+ +
Sbjct: 410  YSSNANYSLLATLHLKNGNVLLASKVLSEMMDKGLKPNFSAYKKIRIHLEKKGEKDLSME 469

Query: 176  LENRH 162
            L  R+
Sbjct: 470  LSRRY 474


>ref|XP_006854920.1| hypothetical protein AMTR_s00052p00103370 [Amborella trichopoda]
            gi|548858645|gb|ERN16387.1| hypothetical protein
            AMTR_s00052p00103370 [Amborella trichopoda]
          Length = 367

 Score =  225 bits (573), Expect = 6e-56
 Identities = 129/355 (36%), Positives = 201/355 (56%)
 Frame = -2

Query: 1214 LESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKNASVCSLIIDNLG 1035
            +E +LN ++ +L+   +T+I    +T     L FF W  +  P  + N++  +L I   G
Sbjct: 12   MEEKLNHMNLKLSNKVVTDILR--STPNRGALMFFNWAKTR-PGFNPNSTNYNLAISISG 68

Query: 1034 RLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDASLKESIKRVLDLLNEVGGSCRS 855
             LE++  M  L    +++   LT  AF FL    S   S++ S+K +L ++  VGG C  
Sbjct: 69   LLENFELMLLLMEGLSSKGHCLTVTAFSFL----SRSPSIQNSVKEILSIIRRVGGPCLK 124

Query: 854  SGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCSNGLIEDAHDIIREMREAH 675
            SGV  LI   C  + FE+A  V++  G KTSYY +LI  KC NG  E+A   + EM+  H
Sbjct: 125  SGVYYLISSLCDLNCFELAILVMEEMGKKTSYYNVLIAAKCRNGEFEEAKVALDEMKGLH 184

Query: 674  CAPTTTVYNYLLGSLWKHSRIGEASSLLDEMKENDIPLDAVTFEILIDSACSLGNMDAVH 495
                T  +NYLLGSL K  R+ EA  LL+ M++     D +TFE++   AC +G MD+  
Sbjct: 185  YGINTGSFNYLLGSLCKKGRVAEACQLLEAMEDLGCYPDEITFEVMAYHACRMGKMDSAL 244

Query: 494  QLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXYVADSTANDKTSSNMIYSLMAKLY 315
            + L++++ +GL+PR +T+A  IK  F          +V + +  D  S+NM YSL++ L 
Sbjct: 245  EFLNKMILEGLKPRFTTYAAFIKGYFFVGEVQNAHKFVLEMSEKDNCSANMNYSLLSSLL 304

Query: 314  CEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARDLENRHSKFT 150
             + G I+ A + LV+M++KGLKP+F +++K+VK L  +G   +  DL+ R SKFT
Sbjct: 305  RKSGKIVEAHAILVEMIDKGLKPNFPVFIKVVKDLSHAGFREMGLDLKCRFSKFT 359


>ref|XP_006603965.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like isoform X2 [Glycine max]
            gi|571554333|ref|XP_003554979.2| PREDICTED:
            pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like isoform X1 [Glycine max]
          Length = 465

 Score =  224 bits (570), Expect = 1e-55
 Identities = 128/374 (34%), Positives = 217/374 (58%)
 Frame = -2

Query: 1280 RKHSLGKIVLEVVYIVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWV 1101
            R H+  +   +++ ++R + ++L S+LN ++  L+  S+ +IF++L +++V  L+FF+++
Sbjct: 87   RPHATSEQFYQIIALIREDVDELGSKLNSMNVSLSDASVVDIFQILASEKVSALQFFDFL 146

Query: 1100 WSNIPKLHKNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDA 921
              + P+L  +  + SL I+N G L +Y  M  +   F++  + L   AFGFL  L    A
Sbjct: 147  KGSDPELCCDPDIGSLFINNCGLLGNYEAMVPVLSGFSHRRVFLGMKAFGFLLDLGLDKA 206

Query: 920  SLKESIKRVLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIR 741
            S  E +++V+ + N+VGG  +S GV  LIEMF     FE+A+ VI+  G K  +Y +L+R
Sbjct: 207  SSMECVRKVMAVFNKVGGMYQSCGVQLLIEMFGLSGSFEIAEFVIRTAGRKVKHYHVLMR 266

Query: 740  EKCSNGLIEDAHDIIREMREAHCAPTTTVYNYLLGSLWKHSRIGEASSLLDEMKENDIPL 561
              C +G  +   D+++EM+ + C    + YN LL  L K+ +I EA  LL+ M++N    
Sbjct: 267  ILCKSGDCKRVSDLVKEMKRSGCDMDVSTYNLLLSCLCKNGKIDEAWQLLEAMEKNYGLT 326

Query: 560  DAVTFEILIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXYV 381
            +A +F+ILI+  C     D+V +LL ++  +G++P + THA +IK+ F          YV
Sbjct: 327  NAHSFDILINFLCKRRQFDSVLKLLDKMFLKGIEPSILTHAAIIKSYFESGKYEEAHEYV 386

Query: 380  ADSTANDKTSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRS 201
              S      SSN  Y L+A L  + G+++ A   L +MM+KGLKP+FS+Y KI K L + 
Sbjct: 387  IGSANRLSYSSNANYGLLATLQLKNGNVLLACKVLSEMMDKGLKPNFSVYKKIRKHLEKK 446

Query: 200  GRGNLARDLENRHS 159
               +L+ +L  R+S
Sbjct: 447  DEKDLSLELLRRYS 460


>ref|XP_004513354.1| PREDICTED: pentatricopeptide repeat-containing protein At1g62670,
            mitochondrial-like isoform X1 [Cicer arietinum]
            gi|502164974|ref|XP_004513355.1| PREDICTED:
            pentatricopeptide repeat-containing protein At1g62670,
            mitochondrial-like isoform X2 [Cicer arietinum]
          Length = 471

 Score =  219 bits (559), Expect = 3e-54
 Identities = 123/373 (32%), Positives = 211/373 (56%)
 Frame = -2

Query: 1280 RKHSLGKIVLEVVYIVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQRVPGLRFFEWV 1101
            + ++  K V E++ ++     DL+ RLN ++  L+  S+  IF+ L ++RV  L FF+W+
Sbjct: 93   KPYATSKQVSEIIRLICEGVNDLDYRLNMMNVSLSMSSVIYIFDKLASERVSALLFFDWL 152

Query: 1100 WSNIPKLHKNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLASTDA 921
              +  +L  +  +  LI++N G + ++  M  +  +F  + + L   AF FL VL     
Sbjct: 153  NVSHTELCCDPEIGGLIVENCGLVGNFDAMVAILNEFNRKKMCLGRRAFRFLVVLRLDKD 212

Query: 920  SLKESIKRVLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIR 741
            S  E ++RV+D+LN+VGG CR+SGV  LIE+FC    F+MA+ VI+  G K ++Y  L+R
Sbjct: 213  SSMECVRRVIDVLNKVGGVCRNSGVQLLIEIFCFSGSFDMAEFVIEEAGRKVNHYNFLLR 272

Query: 740  EKCSNGLIEDAHDIIREMREAHCAPTTTVYNYLLGSLWKHSRIGEASSLLDEMKENDIPL 561
              C  G  E   D++++M+ +   P  + Y+ L+  L+          +++ M+++D   
Sbjct: 273  MMCKRGDFERVCDLVKKMKRSGAEPNGSTYSLLVSCLFNIDNFVGTCQVIETMEKDDGLP 332

Query: 560  DAVTFEILIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXYV 381
            D  TF+ LI  +C  G +D   + L ++  +G++P   THA VIK  F          YV
Sbjct: 333  DEFTFDTLIRLSCKHGQIDLALKFLDKMTLKGIEPCSLTHAAVIKFYFESGKYDAAYEYV 392

Query: 380  ADSTANDKTSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRS 201
            ADS      SSN  Y+L+A L+ +KG+++ ++  L +MM+KGLKP++S+Y K+ K L + 
Sbjct: 393  ADSAGKYSYSSNENYTLLASLHLKKGNVLLSQRILYEMMDKGLKPNYSVYTKVRKRLEKK 452

Query: 200  GRGNLARDLENRH 162
             R +L+ +L  R+
Sbjct: 453  NRKDLSLELSRRY 465


>ref|XP_007222181.1| hypothetical protein PRUPE_ppa009631mg [Prunus persica]
            gi|462419117|gb|EMJ23380.1| hypothetical protein
            PRUPE_ppa009631mg [Prunus persica]
          Length = 284

 Score =  215 bits (548), Expect = 5e-53
 Identities = 112/280 (40%), Positives = 174/280 (62%)
 Frame = -2

Query: 1004 LFRKFANENISLTYDAFGFLPVLASTDASLKESIKRVLDLLNEVGGSCRSSGVCALIEMF 825
            +   F +  I LT +AF F+    S  +S K S+ +V+++LNEVGGSCR  G+ +LIEM 
Sbjct: 4    IMNDFRSAGICLTRNAFEFI----SVSSSKKASVIKVVEVLNEVGGSCRPVGLLSLIEML 59

Query: 824  CKFHLFEMAKHVIKVEGSKTSYYCLLIREKCSNGLIEDAHDIIREMREAHCAPTTTVYNY 645
                 F+MA+ V+K+   K SYY ++IRE C       A D++ EMR+  C P +  YNY
Sbjct: 60   SVKGSFKMAEFVMKITERKRSYYNIMIRESCRRRNFGRAIDMLDEMRQVGCDPDSKTYNY 119

Query: 644  LLGSLWKHSRIGEASSLLDEMKENDIPLDAVTFEILIDSACSLGNMDAVHQLLHQLVAQG 465
            +L SL+K+ +   A+ L ++M E +   D +T+EILI  +C +GN D   +LL  +V +G
Sbjct: 120  ILSSLYKNYKSAVATKLFEQMLEMNCSPDEITYEILICYSCKVGNFDFARKLLDSMVLKG 179

Query: 464  LQPRLSTHAYVIKNLFXXXXXXXXXXYVADSTANDKTSSNMIYSLMAKLYCEKGHIMSAR 285
            ++PRL++HA  +K  F          +V DS+      SN +YSL+A+LY  +G+++ A+
Sbjct: 180  IKPRLTSHAAFVKGYFNLRRYKEAYEHVVDSSVKYSCFSNSVYSLLARLYMNEGNVVIAQ 239

Query: 284  STLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNLARDLENR 165
            + L+DM+ KGLKPDF++Y K++K L ++GR  LA DL +R
Sbjct: 240  NILIDMINKGLKPDFAVYTKVLKELSKTGRTGLAEDLSSR 279


>ref|NP_001167893.1| hypothetical protein [Zea mays] gi|223944699|gb|ACN26433.1| unknown
            [Zea mays] gi|414864420|tpg|DAA42977.1| TPA: hypothetical
            protein ZEAMMB73_690405 [Zea mays]
          Length = 430

 Score =  214 bits (546), Expect = 8e-53
 Identities = 129/374 (34%), Positives = 200/374 (53%), Gaps = 6/374 (1%)
 Frame = -2

Query: 1247 VVYIVRNNGEDLESRLNKLHPRLNTYS-ITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKN 1071
            +V +V   G  LE+ L++L   + ++  ++ +   L  + VP  RFF W  S        
Sbjct: 45   IVCLVVAGGGGLEADLDRLFSAVLSHGLVSSVLRALTDRGVPAERFFAWASSLGRGFSPG 104

Query: 1070 ASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLA--STDASLKESIKR 897
                +L+++N GRL+DY  M       +   +SLT  AF FL   +  S   S++++ + 
Sbjct: 105  PRAYNLLVENAGRLDDYGAMSRALALMSERRLSLTDRAFAFLAPSSGSSRSGSVEDAARA 164

Query: 896  VLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCSNGLI 717
            VL  L+ VGG CR+SGV +L++       F+ A  VI+    K  YY +L+  KC  G  
Sbjct: 165  VLRALDGVGGPCRASGVFSLVKALASIGEFDAAVLVIEETARKVRYYNVLVAAKCKAGDF 224

Query: 716  EDAHDIIREMREAHCAPTTTVYNYLLGSLWKHSRIGEASSLLDEM---KENDIPLDAVTF 546
              A ++  EMR +   P    +NYLLG L K  R+ EA  L++ M   K ++IP  ++T+
Sbjct: 225  VGAREVFDEMRRSGSDPDANTWNYLLGCLLKKGRLAEACGLVEAMERLKRSEIP-SSLTY 283

Query: 545  EILIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXYVADSTA 366
            EIL   AC  G MD+  Q+L Q+ ++ L PR++ H+  IK  F          YV+D + 
Sbjct: 284  EILTYHACKAGKMDSAMQILDQMFSENLTPRITIHSAFIKGYFYAGRIEDACKYVSDMST 343

Query: 365  NDKTSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNL 186
             D+ S N  YSL+AKL  + G  + A   L ++MEKGL+PD S Y+K+ K L + G+GNL
Sbjct: 344  RDRHSVNRNYSLLAKLLWKSGRTIDAGRVLYELMEKGLRPDHSAYVKVAKDLHKMGKGNL 403

Query: 185  ARDLENRHSKFTSN 144
            A +L+    +F+ N
Sbjct: 404  ACELKMMFQRFSVN 417


>ref|XP_004301448.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like
            [Fragaria vesca subsp. vesca]
          Length = 503

 Score =  214 bits (545), Expect = 1e-52
 Identities = 130/401 (32%), Positives = 217/401 (54%), Gaps = 1/401 (0%)
 Frame = -2

Query: 1346 CYTVSSSDKMLGSDTVVESEAQRKHSLGKIVLEVVYIVRNNGEDLESRLNKLHPRLNTYS 1167
            CY  SSS    G   V  +E  R  +  K V +++ ++R    DLES++  ++  LN   
Sbjct: 106  CYGTSSSVNQCGFRNVGVNEL-RYFASHKQVRDILGMIRRKDNDLESKVRSMNVSLNLKM 164

Query: 1166 ITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKNASVCSLIIDNLGRLEDYATMHTLFRKFA 987
             +  FE LN Q+   L   +W+    P    +  V S++IDNLGRL +Y  M  +  +FA
Sbjct: 165  CSRFFEELNNQKRDALAVCDWIRYAHPSCRND--VHSMVIDNLGRLGNYDAMARVLSEFA 222

Query: 986  NENISLTYDAFGFLPVLASTDASLKES-IKRVLDLLNEVGGSCRSSGVCALIEMFCKFHL 810
             + + L   AF F+ +     + LKE+ + +V+D+L  V    R SG+C+LI+MF     
Sbjct: 223  KKKVRLVPLAFEFVSL-----SPLKEATVMKVVDVLKAVEEPTRGSGLCSLIKMFSAVGS 277

Query: 809  FEMAKHVIKVEGSKTSYYCLLIREKCSNGLIEDAHDIIREMREAHCAPTTTVYNYLLGSL 630
            F+MA+ V+++   K ++Y +++ EKC  G  E A D++  MR+    P   +YNY+L +L
Sbjct: 278  FDMAELVMQLSERKATFYKIMVVEKCGKGDFEGAADLVEVMRKHGLKPEAKIYNYVLSTL 337

Query: 629  WKHSRIGEASSLLDEMKENDIPLDAVTFEILIDSACSLGNMDAVHQLLHQLVAQGLQPRL 450
             KH +  EA +L +EM  ++   D +T+EI +  +C  GN D   +LL ++ AQG++PR+
Sbjct: 338  CKHDKSAEAGALFEEMLASECAPDPITYEIFVCHSCKAGNFDLARKLLDRMNAQGIEPRV 397

Query: 449  STHAYVIKNLFXXXXXXXXXXYVADSTANDKTSSNMIYSLMAKLYCEKGHIMSARSTLVD 270
            S H  ++K  F          YV     +       I S +A+LY +  +++ A + L++
Sbjct: 398  SMHGVILKGYFNLKRFEEAYEYV--MACDRYICFPAICSTLARLYVKADNVIVAHNLLLE 455

Query: 269  MMEKGLKPDFSIYLKIVKLLRRSGRGNLARDLENRHSKFTS 147
            ++++GL+PD S+Y  + + L  +GR  LA DL +R S   S
Sbjct: 456  LIDRGLRPDQSVYTNVFRRLIDTGRTALAEDLRSRLSSIGS 496


>ref|XP_004985951.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like [Setaria italica]
          Length = 419

 Score =  210 bits (534), Expect = 2e-51
 Identities = 131/392 (33%), Positives = 204/392 (52%), Gaps = 4/392 (1%)
 Frame = -2

Query: 1313 GSDTVVESEAQRKHSLGKIVLEVVYIVRNNGEDLESRLNKLHPRLNTYSITEIFEVLNTQ 1134
            GSD    S+A    +       +V +V   G  LE+  ++L P L+   +      L   
Sbjct: 30   GSDADANSDAAASDA-------IVRLVAAGGSSLEADFDRLDPALSHALVARTLRALTDS 82

Query: 1133 RVPGLRFFEWVWSNIPKLHKNASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAF 954
             VP  RFF W  S       +A   +L+I+N G+L DY  M       +   + LT  AF
Sbjct: 83   GVPAERFFAWA-SLRRGFSPSAHAHNLLIENAGKLADYRAMSRALALMSQRRLPLTDRAF 141

Query: 953  GFL-PVLASTDASLKESIKRVLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVE 777
             FL P  +S  + ++++ + VL +L++VGG CR+SGV +L++       F+ A  VI+  
Sbjct: 142  AFLAPSGSSRSSCVEDAARAVLRVLDDVGGPCRASGVFSLVKALASTGEFDAAVSVIEET 201

Query: 776  GSKTSYYCLLIREKCSNGLIEDAHDIIREMREAHCAPTTTVYNYLLGSLWKHSRIGEASS 597
                 Y+ +++  KC  G    A ++  EMR++  AP    +N LLG L K+ R+ EA  
Sbjct: 202  RRMARYFNVVVAAKCKAGNFVGAREVFDEMRKSGSAPNANTWNCLLGCLLKNGRLAEACG 261

Query: 596  LLDEM---KENDIPLDAVTFEILIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIK 426
            L++ M   K  ++P D++T+EIL   AC  G MD+  Q+L Q+ +  L PR++ H+  IK
Sbjct: 262  LVESMERSKPGEVP-DSLTYEILTYHACKAGKMDSAMQILDQMFSANLTPRITIHSAFIK 320

Query: 425  NLFXXXXXXXXXXYVADSTANDKTSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKP 246
              F          YV D +  D+ S N  YSL+AKL  + G  + A   L ++MEKGL+P
Sbjct: 321  GYFYAGRIEDAQKYVDDMSTRDRHSVNRNYSLLAKLLRKSGRTIDAGRVLYELMEKGLRP 380

Query: 245  DFSIYLKIVKLLRRSGRGNLARDLENRHSKFT 150
            D S Y+K+ K L + GRG+LA +L+    +F+
Sbjct: 381  DHSAYVKVAKDLYKMGRGDLASELKLMFQRFS 412


>ref|XP_002468625.1| hypothetical protein SORBIDRAFT_01g049260 [Sorghum bicolor]
            gi|241922479|gb|EER95623.1| hypothetical protein
            SORBIDRAFT_01g049260 [Sorghum bicolor]
          Length = 422

 Score =  202 bits (514), Expect = 4e-49
 Identities = 125/372 (33%), Positives = 196/372 (52%), Gaps = 6/372 (1%)
 Frame = -2

Query: 1247 VVYIVRNNGEDLESRLNKLHPRLNTYS-ITEIFEVLNTQRVPGLRFFEWVWSNIPKLHKN 1071
            +V +V   G  LE+ L++L     ++  ++     L    VP  RFF W  S        
Sbjct: 45   IVRLVAAGGGGLEADLDRLFAATLSHGLVSSALRALTDSGVPAERFFAWASSLGRGFSPG 104

Query: 1070 ASVCSLIIDNLGRLEDYATMHTLFRKFANENISLTYDAFGFLPVLA--STDASLKESIKR 897
                +L+++N GRL D   M       +   + LT  AF FL + +  S   S+++S   
Sbjct: 105  PRAHNLLVENTGRLGDCGAMSRALALMSERMLPLTDRAFAFLALSSGSSRSGSVEDSTTS 164

Query: 896  VLDLLNEVGGSCRSSGVCALIEMFCKFHLFEMAKHVIKVEGSKTSYYCLLIREKCSNGLI 717
            VL  L+ VGG CR+SGV +L++       F+ A  VI+    K  YY +L+  KC  G  
Sbjct: 165  VLRALDGVGGPCRASGVFSLVKALASIGEFDAAVSVIEETTRKVRYYNVLVAAKCKAGDF 224

Query: 716  EDAHDIIREMREAHCAPTTTVYNYLLGSLWKHSRIGEASSLLDEMKE---NDIPLDAVTF 546
              A ++  EMR++   P    +NYLLG L K+ R+ EA  L++ M+    ++IP +++T+
Sbjct: 225  VGAREVFDEMRKSGSDPDANTWNYLLGCLLKNGRLAEACGLVEAMERLKCSEIP-NSLTY 283

Query: 545  EILIDSACSLGNMDAVHQLLHQLVAQGLQPRLSTHAYVIKNLFXXXXXXXXXXYVADSTA 366
            EIL   AC  G MD+  Q+L+Q+ ++ L PR++ H+  IK  F          YV D + 
Sbjct: 284  EILTYHACKAGKMDSAMQILNQMFSENLTPRITIHSAFIKGYFYAGRIEDACKYVNDMST 343

Query: 365  NDKTSSNMIYSLMAKLYCEKGHIMSARSTLVDMMEKGLKPDFSIYLKIVKLLRRSGRGNL 186
             D+ S N  YSL+AKL  + G  + A   L ++M+KGL+PD S Y+K+ K L + GRG+L
Sbjct: 344  RDRHSVNRNYSLLAKLLRKSGRTVDAGRVLYELMDKGLRPDHSAYVKVAKDLHKMGRGDL 403

Query: 185  ARDLENRHSKFT 150
            A +L+    +F+
Sbjct: 404  ASELKMMFQRFS 415


Top