BLASTX nr result

ID: Mentha27_contig00016072 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha27_contig00016072
         (1275 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU45090.1| hypothetical protein MIMGU_mgv1a021870mg, partial...   356   1e-95
emb|CBI19618.3| unnamed protein product [Vitis vinifera]              331   5e-88
ref|XP_006375040.1| hypothetical protein POPTR_0014s03840g [Popu...   304   5e-80
ref|NP_001239891.1| uncharacterized protein LOC100783921 [Glycin...   253   1e-64
ref|XP_003620888.1| Pentatricopeptide repeat-containing protein ...   249   2e-63
ref|XP_003620912.1| Pentatricopeptide repeat-containing protein ...   248   3e-63
ref|XP_006345330.1| PREDICTED: pentatricopeptide repeat-containi...   245   3e-62
ref|XP_003517821.1| PREDICTED: pentatricopeptide repeat-containi...   236   1e-59
ref|XP_007152924.1| hypothetical protein PHAVU_004G171700g [Phas...   232   3e-58
ref|XP_004505212.1| PREDICTED: pentatricopeptide repeat-containi...   227   7e-57
ref|XP_004505209.1| PREDICTED: pentatricopeptide repeat-containi...   225   3e-56
gb|AFK42502.1| unknown [Medicago truncatula]                          224   6e-56
ref|XP_003607988.1| Pentatricopeptide repeat-containing protein ...   224   6e-56
ref|XP_006451427.1| hypothetical protein CICLE_v10008172mg [Citr...   224   8e-56
gb|EPS72112.1| hypothetical protein M569_02645, partial [Genlise...   223   2e-55
ref|XP_002279824.1| PREDICTED: pentatricopeptide repeat-containi...   212   2e-52
ref|XP_007202366.1| hypothetical protein PRUPE_ppa009223mg [Prun...   208   4e-51
emb|CAN64107.1| hypothetical protein VITISV_013147 [Vitis vinifera]   208   5e-51
gb|EXB38821.1| Serine acetyltransferase 5 [Morus notabilis]           204   5e-50
ref|XP_007012889.1| Tetratricopeptide repeat-like superfamily pr...   204   5e-50

>gb|EYU45090.1| hypothetical protein MIMGU_mgv1a021870mg, partial [Mimulus guttatus]
          Length = 277

 Score =  356 bits (914), Expect = 1e-95
 Identities = 183/310 (59%), Positives = 216/310 (69%), Gaps = 1/310 (0%)
 Frame = +1

Query: 199  NVHSTVEALEEMAKNRVVADSNHLLELMQLTVDMESMEGGDRIYEYVMRFSSNYSVSIFN 378
            N++  +E LE M +N   A+   + ELMQ TVD +S+  GDRIYEYVMRFSS+Y VS+FN
Sbjct: 15   NINLALETLEAMGRNETPAEPIRVSELMQFTVDSKSLPAGDRIYEYVMRFSSSYDVSVFN 74

Query: 379  EMIDMYLKLGHYRRGGRIFEQMLCRNIDSWNMMIEALVENGEAEEAIQVFTRLVKGEDEL 558
            E+IDMY KLG YRR GR+FEQM+C+NIDSWN MI+ L ENG+  EAIQ+F +LVK     
Sbjct: 75   ELIDMYFKLGDYRRAGRVFEQMVCKNIDSWNTMIKGLSENGQENEAIQLFAKLVK----- 129

Query: 559  KPNETTFASILKACELLGEVEKGRAYFASMRKDYGITPSSDHHLSYNNLVKNSKKEADNA 738
                                           +DYGITPS DH+ SY NL           
Sbjct: 130  -------------------------------EDYGITPSLDHYTSYVNL----------- 147

Query: 739  SRMIQKKLVSEKNRAP-SDRGMAYKKLMSLSKKAKEAGYVPDTRYVLHDLDHEAKERALL 915
             R   +++VSEK+RA  SD+ +AY+KL  LS +AK+AGYV DTRYVLHD+D EAKERAL+
Sbjct: 148  QRKTNRRVVSEKDRAKNSDKSLAYEKLRCLSDEAKKAGYVADTRYVLHDIDEEAKERALM 207

Query: 916  YHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFK 1095
            YHSERLAIAYGLI T PGT LR+IKNLRICGDCHNFIKILS FE+REFIVRD KRFHHFK
Sbjct: 208  YHSERLAIAYGLIRTTPGTTLRIIKNLRICGDCHNFIKILSKFEEREFIVRDNKRFHHFK 267

Query: 1096 DGECSCRDFW 1125
            DG CSCRDFW
Sbjct: 268  DGICSCRDFW 277


>emb|CBI19618.3| unnamed protein product [Vitis vinifera]
          Length = 576

 Score =  331 bits (848), Expect = 5e-88
 Identities = 172/352 (48%), Positives = 224/352 (63%), Gaps = 43/352 (12%)
 Frame = +1

Query: 199  NVHSTVEALEEMAKNRVVADSNHLLELMQLTVDMESMEGGDRIYEYVMRFSSNYSVSIFN 378
            NV + +  ++EM +N V   +  L EL+Q+ +D++ +E G R +E VMR SSN SV +FN
Sbjct: 227  NVEAALHVIDEMERNGVTVSALGLAELLQVCIDLKLLEVGKRAHELVMRLSSNPSVIVFN 286

Query: 379  EMIDMYLKLGHYRRGGRIFEQMLCRNIDSWNMMIEALVENGEAEEAIQVFTRLVKGEDEL 558
            ++++MY  LG  R   R+FE+M  R +DSWN MI  LV+NGE EEA+ +F++L K  D +
Sbjct: 287  KLLEMYFDLGDTRSACRVFEEMRGRTLDSWNRMILGLVKNGEGEEALAIFSKLKK--DGI 344

Query: 559  KPNETTFASILKACELLGEVEKGRAYFASMRKDYGITPSSDHHLSYNNLVKNSKKEAD-- 732
            +P+ +TF  +L ACE LG VE+G A+F SM  DYGITPS +H     +L    +K A+  
Sbjct: 345  EPDGSTFIGVLSACECLGAVEEGLAHFNSMSTDYGITPSMEHFAIIVDLFGRLQKIAEAK 404

Query: 733  -----------------------------------------NASRMIQKKLVSEKNRAPS 789
                                                     +  + ++   VS++  A  
Sbjct: 405  EFIASMPLEPSSMIWQTLQKYLKTERVDEPAPLTTGSGLKLSHKKRVKSNFVSKQKNASP 464

Query: 790  DRGMAYKKLMSLSKKAKEAGYVPDTRYVLHDLDHEAKERALLYHSERLAIAYGLISTPPG 969
            ++  AY+KL SL K  KEAGYV DTRYVLHDLD EAKE++LLYHSERLAIAYGLISTPPG
Sbjct: 465  EKSKAYEKLRSLHKGVKEAGYVSDTRYVLHDLDQEAKEKSLLYHSERLAIAYGLISTPPG 524

Query: 970  TALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSCRDFW 1125
            T LR+IKNLRICGDCHNFIKILS  EKRE IVRD KRFHHF+DG+CSC D+W
Sbjct: 525  TTLRIIKNLRICGDCHNFIKILSNIEKREIIVRDNKRFHHFRDGKCSCGDYW 576


>ref|XP_006375040.1| hypothetical protein POPTR_0014s03840g [Populus trichocarpa]
            gi|550323354|gb|ERP52837.1| hypothetical protein
            POPTR_0014s03840g [Populus trichocarpa]
          Length = 429

 Score =  304 bits (779), Expect = 5e-80
 Identities = 167/353 (47%), Positives = 226/353 (64%), Gaps = 45/353 (12%)
 Frame = +1

Query: 202  VHSTVEALEEMAKNRVVADSNHLLELMQLTVDMESMEGGDRIYEYVMRFSSNY--SVSIF 375
            V + +E ++E  +N   AD   +++L+Q+  D++ +E G ++ EYVMR SS +  SV + 
Sbjct: 79   VEAALEIMDEKERNGGYADLLDIVKLIQVCADLKLLEAGKKVDEYVMRSSSKFKSSVVVL 138

Query: 376  NEMIDMYLKLGHYRRGGRIFEQMLCRNIDSWNMMIEALVENGEAEEAIQVFTRLVKGEDE 555
            N +++MY KLG       IFEQM  RN+DSWN M+  L EN E E+A+++F+++ KG D 
Sbjct: 139  NNLVEMYCKLGDTNGAREIFEQMGVRNLDSWNKMLLGLAENKEGEKALEIFSQM-KG-DG 196

Query: 556  LKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITPSSDHHLSYNNLVKNSKKEA-- 729
            ++P+ ++F  +L AC  LG  ++G+ +F SM +DYGITP+ +H+  + +L+  + K A  
Sbjct: 197  IRPDGSSFVGVLMACVCLGAEKEGQKHFESMSRDYGITPTVEHYEVFVDLLGRTGKIAEA 256

Query: 730  ---------DNASRM---IQK-------------------KLVSEKN----------RAP 786
                     D  SR+   +QK                   KL   K           R  
Sbjct: 257  KELVSNMPIDPNSRIWETLQKYSKARTQGQLGYPVSPPGLKLGDMKRAKDNTNTNHRRVT 316

Query: 787  SDRGMAYKKLMSLSKKAKEAGYVPDTRYVLHDLDHEAKERALLYHSERLAIAYGLISTPP 966
            SDR  AY+KL SLSK+ ++AGYVPDTR+VLHDLD EAKE+AL YHSERLAIAYGLI+T P
Sbjct: 317  SDRSKAYEKLRSLSKEVRDAGYVPDTRFVLHDLDQEAKEKALFYHSERLAIAYGLINTSP 376

Query: 967  GTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSCRDFW 1125
            GT LR++KNLRICGDCHNFIKILS  E REFIVRD KRFHHFK G CSCRD+W
Sbjct: 377  GTTLRIMKNLRICGDCHNFIKILSKIEDREFIVRDNKRFHHFKAGNCSCRDYW 429


>ref|NP_001239891.1| uncharacterized protein LOC100783921 [Glycine max]
            gi|255636013|gb|ACU18351.1| unknown [Glycine max]
          Length = 449

 Score =  253 bits (647), Expect = 1e-64
 Identities = 162/403 (40%), Positives = 211/403 (52%), Gaps = 71/403 (17%)
 Frame = +1

Query: 130  LTPSHKKLRP--AKLGASDLNSGRRNV-------HSTVEALEEMAKNRVVADSNHLLELM 282
            +TP  K+  P   KL     N    NV          ++ + E+     VAD    L L+
Sbjct: 50   ITPLRKEKHPNEQKLKLDHQNQNPLNVDLVALCEEGNLDQVLELMGQGAVADYRVYLALL 109

Query: 283  QLTVDMESMEGGDRIYEYVMRFSSNYSVSIFNEMIDMYLKLGHYRRGGRIFEQMLCRNID 462
             L     S+E G R++E + R +    V + N +I MY K G  +   R+F+QML RN+ 
Sbjct: 110  NLCEHTRSLESGKRVHEILRRSAFRGDVELSNRLIGMYCKCGSVKNARRVFDQMLDRNMA 169

Query: 463  SWNMMIEALVENGEAEEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFA 642
            +W++MI     NG   + + VF ++ + E  L P+  TF  +L AC     VE+G  +F 
Sbjct: 170  TWHLMIGGYTSNGLGCDGLLVFQQMKQAE--LPPDGETFELVLAACSQAEAVEEGFLHFE 227

Query: 643  SMRKDYGITPSSDHHLSYNNLVKNSK--KEAD---------------------------- 732
            SM K+YGI PS +H+L   N++ N+   KEA+                            
Sbjct: 228  SM-KEYGIVPSMEHYLEVINIMGNAGQLKEAEEFIENVPIELGVEAWESLRKFARIHGDL 286

Query: 733  -------------NASRMIQKKL-------------VSEKNRAPSDR-GMAYK-----KL 816
                         + S+ I  KL             + EKNRA   R  + YK     KL
Sbjct: 287  DLEDCAEELLTRFDPSKAIADKLPTPPRKKQSDVNMLEEKNRATEYRYSIPYKEEDNEKL 346

Query: 817  MSLSKKAKEAGYVPDTRYVLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNL 996
              LS + +EAGYVPDTRYVLHD+D E KE+AL YHSERLAIAYGLISTPP T LR+IKNL
Sbjct: 347  GGLSGQMREAGYVPDTRYVLHDIDEEEKEKALQYHSERLAIAYGLISTPPRTTLRIIKNL 406

Query: 997  RICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSCRDFW 1125
            RICGDCHN IKI+S    RE IVRD KRFHHFKDG+CSC D+W
Sbjct: 407  RICGDCHNAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW 449


>ref|XP_003620888.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355495903|gb|AES77106.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 446

 Score =  249 bits (635), Expect = 2e-63
 Identities = 164/447 (36%), Positives = 228/447 (51%), Gaps = 82/447 (18%)
 Frame = +1

Query: 31   LRKQMAQAIASMKTFCTQPMLKGFITLPLENAL----LTPSHKKLRPAKLGASDLNSG-- 192
            L+  MA + +S   FCT  +       P +N       TP  +K +P ++G   +     
Sbjct: 5    LQASMASSSSSSTPFCTYAIHHATNLHPRQNGTNNSRFTP--RKTQPLRMGNPSIQPKLN 62

Query: 193  -------RRNVH-------STVEALEEMAKNRVVADSNHLLELMQLTVDMESMEGGDRIY 330
                    +NV+         V  + E+      AD +  L L++L  D++S+E G R++
Sbjct: 63   HHQAPHQHKNVNFAHFLQEGNVNQVLELMGQGAFADYSDFLSLLKLCEDLKSLELGKRVH 122

Query: 331  EYVMRFSSNYSVSIFNEMIDMYLKLGHYRRGGRIFEQMLCRNIDSWNMMIEALVENGEAE 510
            E++ R     +V + N +I +Y+K G  +   ++F++M  RN+ S N+MI     NG   
Sbjct: 123  EFLRRSKFGGNVELCNRLIGLYVKCGSVKDARKVFDKMPDRNVGSLNLMIGGYNVNGLGI 182

Query: 511  EAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITPSSDHHL 690
            + + VF ++   +  + P+E TFA +L  C L+  VE+G   F SM K+YGI P  +H+L
Sbjct: 183  DGLLVFKQM--RQQGVVPDEETFALVLAVCALVDGVEEGLMQFESM-KEYGIVPGMEHYL 239

Query: 691  SYNNL-------------VKN--------------------------------------S 717
               N+             ++N                                      S
Sbjct: 240  GVVNIFGCAGRLDEAHEFIENMPIEAGVELWETLRNFARIHGDLEREDCADELLTVLDPS 299

Query: 718  KKEADNASRMIQKK-----LVSEKNRAPSDR-GMAYK-----KLMSLSKKAKEAGYVPDT 864
            K  AD      +KK     ++ EKNR    R  M YK     KL  L+ + +EAGYVPDT
Sbjct: 300  KAAADKVPLPQRKKQSAINMLEEKNRVSEYRCNMPYKEEGDVKLRGLTGQMREAGYVPDT 359

Query: 865  RYVLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTF 1044
            RYVLHD+D E KE+AL YHSERLAIAYGLISTPP T LR+IKNLRICGDCHN IKI+S  
Sbjct: 360  RYVLHDIDEEEKEKALQYHSERLAIAYGLISTPPRTTLRIIKNLRICGDCHNAIKIMSKI 419

Query: 1045 EKREFIVRDTKRFHHFKDGECSCRDFW 1125
              RE IVRD KRFHHFKDG+CSC D+W
Sbjct: 420  VGRELIVRDNKRFHHFKDGKCSCGDYW 446


>ref|XP_003620912.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355495927|gb|AES77130.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 415

 Score =  248 bits (634), Expect = 3e-63
 Identities = 157/416 (37%), Positives = 226/416 (54%), Gaps = 51/416 (12%)
 Frame = +1

Query: 31   LRKQMAQAIASMKTFCTQPMLKGFITLPLENAL----LTPSHKKLRPAKLGASDLNSGRR 198
            L+  MA + +S   FCT  +       P +N       TP  +K +P ++G   +   + 
Sbjct: 5    LQASMASSSSSSTPFCTYAIHHATNLHPRQNGTNNSRFTP--RKTQPLRMGNPSIQP-KL 61

Query: 199  NVHST---------VEALEEMAKNRVV--------ADSNHLLELMQLTVDMESMEGGDRI 327
            N H T            L+E   N+V+        AD +  L L++L  D++S+E G R+
Sbjct: 62   NHHQTPHQHKNVNFAHFLQEGNVNQVLELMGQGAFADYSDFLSLLKLCEDLKSLELGKRV 121

Query: 328  YEYVMRFSSNYSVSIFNEMIDMYLKLGHYRRGGRIFEQMLCRNIDSWNMMIEALVENGEA 507
            +E++ R     +V + N +I +Y+K G  +   ++F++M  RN+ SWN+MI     NG  
Sbjct: 122  HEFLRRSKFGGNVELCNRLIGLYVKCGSVKDARKVFDKMPDRNVGSWNLMIGGYNVNGLG 181

Query: 508  EEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITPS-SDH 684
             + + VF ++   +  + P+E TFA +L  C L+  VE+G  ++  +   +G     ++ 
Sbjct: 182  IDGLLVFKQM--RQQGIVPDEETFALVLAVCALVDGVEEGMEHYLGVVNIFGCAGRLNEA 239

Query: 685  HLSYNNLVKN-----------------SKKEADNASRMIQKK------LVSEKNRAPSDR 795
            H    N++                   SK  AD+   + Q+K      ++ EKNR    R
Sbjct: 240  HEFIENIIHGDLEREDCADELLTVIDPSKAAADDKVPLPQRKKQSAINMMEEKNRVSEYR 299

Query: 796  -GMAYK-----KLMSLSKKAKEAGYVPDTRYVLHDLDHEAKERALLYHSERLAIAYGLIS 957
              M Y+     KL  L+ + +EAGYVPDTRYVLHD+D E KE+AL YHSE LAIAYGLIS
Sbjct: 300  CNMPYEEEDDEKLRGLTGQMREAGYVPDTRYVLHDIDEEEKEKALQYHSECLAIAYGLIS 359

Query: 958  TPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSCRDFW 1125
            TPP T LR+IKNLRICGDCHN IKI+S    RE IVRD KRFHHFKDG+CSC D+W
Sbjct: 360  TPPRTTLRIIKNLRICGDCHNAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW 415


>ref|XP_006345330.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15690-like
            [Solanum tuberosum]
          Length = 466

 Score =  245 bits (625), Expect = 3e-62
 Identities = 149/449 (33%), Positives = 224/449 (49%), Gaps = 82/449 (18%)
 Frame = +1

Query: 25   VHLRKQMAQAIASMKTFCTQPMLKGFITLPLENA-----------------LLTPSHKKL 153
            +HL+ ++  +    +  C  P  +   T P  N                  + TP  K+ 
Sbjct: 19   IHLKFRIPSSTLLFRPICAAPHDRSSTTYPNNNRYSPQSRPRRNNQLDYRRISTPIQKEA 78

Query: 154  RPAKLG---ASDLNSGRRNVHSTVEALEEMAKNRVVADSNHLLELMQLTVDMESMEGGDR 324
             P  L    A+D+N         ++ + E     V AD      L+     + S++ G++
Sbjct: 79   DPVTLSPISAADVNLMSLCNEGKIDQVIEYISQGVDADFRIFETLLSYCTKLASLDVGEK 138

Query: 325  IYEYVMRFSSNYSVSIFNEMIDMYLKLGHYRRGGRIFEQMLCRNIDSWNMMIEALVENGE 504
            ++E ++R   + ++ + +++++MY+K G  R   ++F++M  R ++ W++MI    E G+
Sbjct: 139  VHELLLRSPWSNNIELNSKLVEMYVKNGRMRNARKVFDKMRERKLELWHLMISGYAETGD 198

Query: 505  AEEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASMRKDYGITPSSDH 684
             E  + +F ++ K    L+PN  TF+ +L  C   G V+ G  YF  M+ +YGI P  +H
Sbjct: 199  GESGLLLFEQMRKLR-LLEPNGDTFSVVLSGCASKGSVKDGFTYFELMKNEYGIVPGVEH 257

Query: 685  HLSY----------NNLVK----------------------------------------- 711
            +L            N L++                                         
Sbjct: 258  YLGIIDVLGKSGHLNELLEFIEDMPIEPTKVVWEAVINFARIHGDIELEDRTEELLIRLD 317

Query: 712  NSKKEADNASRMIQKK-----LVSEKNRAPS------DRGMAYKKLMSLSKKAKEAGYVP 858
            +S+  AD      QK+     ++  K+RA         R  AY+KL  LS + ++AGYVP
Sbjct: 318  SSRNMADKPLAPFQKRHSEFSMLEGKDRANEFKSTIPHRADAYEKLKGLSGQMRDAGYVP 377

Query: 859  DTRYVLHDLDHEAKERALLYHSERLAIAYGLISTPPGTALRVIKNLRICGDCHNFIKILS 1038
            DTRYVLHD+D  AKE+AL+YHSERLAIAYGLISTP  T LR+IKNLRICGDCHN IKI+S
Sbjct: 378  DTRYVLHDIDEAAKEQALMYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMS 437

Query: 1039 TFEKREFIVRDTKRFHHFKDGECSCRDFW 1125
                RE IVRD KRFHHF+DG+CSC D+W
Sbjct: 438  KIVGRELIVRDNKRFHHFRDGKCSCNDYW 466


>ref|XP_003517821.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15690-like
            [Glycine max]
          Length = 452

 Score =  236 bits (603), Expect = 1e-59
 Identities = 156/417 (37%), Positives = 205/417 (49%), Gaps = 62/417 (14%)
 Frame = +1

Query: 61   SMKTFCTQPMLKGFITLPLENALLTPSHKKLRPAKLGASDLNSGRRNVHSTVEALEEMAK 240
            S  T    P+ KG   LP E  L      +  P  L    ++         ++ + E+  
Sbjct: 44   SRSTHKIPPLCKG--NLPNEQKLQLDHQNQNAPLPLNVDLVSLCEEG---NLDQVLELMG 98

Query: 241  NRVVADSNHLLELMQLTVDMESMEGGDRIYEYVMRFSSNYSVSIFNEMIDMYLKLGHYRR 420
               VAD    L L+ L     S+E G R++E++ R +    V + N +I MY K G  + 
Sbjct: 99   QGAVADYRVYLALLNLCEHTRSLESGKRVHEFLRRSTFRRDVELSNRLIGMYCKCGSVKD 158

Query: 421  GGRIFEQMLCRNIDSWNMMIEALVENGEAEEAIQVFTRLVKGEDELKPNETTFASILKAC 600
              R+F+Q+  RNI SW++MI     NG   + + VF ++ +    + P+  TF  +L AC
Sbjct: 159  ARRVFDQIPERNISSWHLMIGGYAANGLGCDGLLVFQQMKQAG--VPPDGETFELVLAAC 216

Query: 601  ELLGEVEKGRAYFASMRKDYGITPS----------------------------------- 675
                 VE+G  +F SM K++GI PS                                   
Sbjct: 217  AQAEAVEEGFLHFESM-KEHGIVPSMEHYLEVINILGNTGQLNEAEEFIEKIPIELGVEA 275

Query: 676  ----------------SDHHLSYNNLVKNSKKEADNASRMIQKK-----LVSEKNRAPSD 792
                             DH       +  SK  AD      +KK     ++ EKNR    
Sbjct: 276  WESLRNFAQKHGDLDLEDHAEEVLTCLDPSKAVADKLPPPPRKKQSDMNMLEEKNRVTEY 335

Query: 793  RGM------AYKKLMSLSKKAKEAGYVPDTRYVLHDLDHEAKERALLYHSERLAIAYGLI 954
            R        A++KL  LS + +EAGYVPDTRYVLHD+D E KE+AL YHSERLAIAYGLI
Sbjct: 336  RYSIPYKEEAHEKLGGLSGQMREAGYVPDTRYVLHDIDEEEKEKALQYHSERLAIAYGLI 395

Query: 955  STPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSCRDFW 1125
            STPP T LR+IKNLRICGDCHN IKI+S    RE IVRD KRFHHFKDG+CSC D+W
Sbjct: 396  STPPRTTLRIIKNLRICGDCHNAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW 452


>ref|XP_007152924.1| hypothetical protein PHAVU_004G171700g [Phaseolus vulgaris]
            gi|561026233|gb|ESW24918.1| hypothetical protein
            PHAVU_004G171700g [Phaseolus vulgaris]
          Length = 460

 Score =  232 bits (591), Expect = 3e-58
 Identities = 155/414 (37%), Positives = 201/414 (48%), Gaps = 62/414 (14%)
 Frame = +1

Query: 70   TFCTQPMLKGFITLPLENALLTPSHKKLRPAKLGASDLNSGRRNVHSTVEALEEMAKNRV 249
            T  T P+ KG    P E AL         P       +       H  V    E+    V
Sbjct: 54   THKTPPLRKGK-NHPNETALKLDHQNHKAPLPFNVDLVALCEEGKHDQVV---ELMGQGV 109

Query: 250  VADSNHLLELMQLTVDMESMEGGDRIYEYVMRFSSNYSVSIFNEMIDMYLKLGHYRRGGR 429
             AD    L L+    +  S+E G R++E++ R S    V + N +I +Y K G  +   R
Sbjct: 110  AADYRVYLALLNFCENTRSLELGKRVHEFLRRSSFRGDVELSNRVIGVYSKCGSVKDARR 169

Query: 430  IFEQMLCRNIDSWNMMIEALVENGEAEEAIQVFTRLVKGEDELKPNETTFASILKACELL 609
            +F+QM  RN  SW++MI     NG A + +  F ++ +       +  TF  +L AC   
Sbjct: 170  VFDQMQERNTVSWHLMIGGYTANGLARDGLLAFQKMKQAGVPF--DGETFKLVLAACAQA 227

Query: 610  GEVEKGRAYFASMRKDYGITPSSDHHLSYNNLVKN------------------------- 714
              VE+G  +   M K+ GI PS +H+L   N++ N                         
Sbjct: 228  EAVEEGLLHLEYM-KENGIVPSMEHYLEVVNILGNAGRLNEAEEFIEKIPIEVGAEGWES 286

Query: 715  --------------------------SKKEADNASRMIQKK-----LVSEKNRAPSDRGM 801
                                      SK  AD  +   +KK     ++ EKNR    R  
Sbjct: 287  LRNLARIHGNLDLEDRAKELLMYLDPSKSIADELAMPPRKKQHDINMLEEKNRVSEYRYS 346

Query: 802  ------AYKKLMSLSKKAKEAGYVPDTRYVLHDLDHEAKERALLYHSERLAIAYGLISTP 963
                  A++KL  LS + +EAGYVPDTRYVLHD+D E KE+AL YHSERLAIAYGLISTP
Sbjct: 347  IPYKEEAHEKLGGLSGQMREAGYVPDTRYVLHDIDEEEKEKALQYHSERLAIAYGLISTP 406

Query: 964  PGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSCRDFW 1125
            P T LR+IKNLRICGDCHN IKI+S    RE IVRD KRFHHFKDG+CSC D+W
Sbjct: 407  PRTTLRIIKNLRICGDCHNAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW 460


>ref|XP_004505212.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15690-like
            [Cicer arietinum]
          Length = 567

 Score =  227 bits (579), Expect = 7e-57
 Identities = 144/363 (39%), Positives = 190/363 (52%), Gaps = 60/363 (16%)
 Frame = +1

Query: 217  EALEEMAKNRVVADSNHLLELMQLTVDMESMEGGDRIYEYVMRFSSNYSVSIFNEMIDMY 396
            EALE M K  V AD+N    L  L    +S+E   ++++Y ++ +      + N++I+MY
Sbjct: 209  EALELMEKG-VKADANCFDLLFDLCGKSKSVEDAKKVHDYFLQSTFRSDFKLHNKVIEMY 267

Query: 397  LKLGHYRRGGRIFEQMLCRNIDSWNMMIEALVENGEAEEAIQVFTRLVKGEDELKPNETT 576
                      R+F+ M  RN+DSW+MMI     +   +EA+Q+F  +   E  L+    T
Sbjct: 268  GNCKSMTDARRVFDHMPNRNMDSWHMMIRGYANSTMGDEALQLFEHM--NELGLEITSET 325

Query: 577  FASILKACELLGEVEKGRAYFASMRKDYGITPSSDHHLSYNNLVKNSK--KEADN----- 735
              ++L AC     VE    +F SM+  YGI P  +H++    ++  S   KEA+      
Sbjct: 326  LLAVLSACGSAEAVEDAYLHFESMKNKYGIEPGVEHYMGLLEVLGQSGYLKEAEEFIEKL 385

Query: 736  -------------------------------ASRMIQKKLVSEKNRAPS----------- 789
                                             R+   K V+ K   P            
Sbjct: 386  PFGPTVTVLETLKSYARIHGDIDLEDHVEELIVRLDPSKAVANKIPTPPPKKYSAISMLE 445

Query: 790  --DRGMAYK---------KLMSLSKKAKEAGYVPDTRYVLHDLDHEAKERALLYHSERLA 936
              +R + YK         KL +LS   KEAGYVPDTRYVLHD+D EAKE+ALLYHSERLA
Sbjct: 446  GRNRMIEYKNPTLYKDDEKLKALSGM-KEAGYVPDTRYVLHDIDQEAKEQALLYHSERLA 504

Query: 937  IAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSCR 1116
            IAYGLISTPP T LR+IKNLR+CGDCHN IKI+S    RE IVRD KRFHHFKDG+CSC 
Sbjct: 505  IAYGLISTPPRTPLRIIKNLRVCGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCG 564

Query: 1117 DFW 1125
            D+W
Sbjct: 565  DYW 567


>ref|XP_004505209.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15690-like
            [Cicer arietinum]
          Length = 567

 Score =  225 bits (574), Expect = 3e-56
 Identities = 143/363 (39%), Positives = 190/363 (52%), Gaps = 60/363 (16%)
 Frame = +1

Query: 217  EALEEMAKNRVVADSNHLLELMQLTVDMESMEGGDRIYEYVMRFSSNYSVSIFNEMIDMY 396
            EALE M K  V AD++    L  L    +S+E   ++++Y ++ +      + N++I+MY
Sbjct: 209  EALELMEKG-VKADASCFDLLFDLCGKSKSVEDAKKVHDYFLQSTFRSDFKLHNKVIEMY 267

Query: 397  LKLGHYRRGGRIFEQMLCRNIDSWNMMIEALVENGEAEEAIQVFTRLVKGEDELKPNETT 576
                      R+F+ M  RN+DSW+MMI     +   +EA+Q+F  +   E  L+    T
Sbjct: 268  GNCKSMTDARRVFDHMPNRNMDSWHMMIRGYANSTMGDEALQLFEHM--NELGLEITSET 325

Query: 577  FASILKACELLGEVEKGRAYFASMRKDYGITPSSDHHLSYNNLVKNSK--KEADN----- 735
              ++L AC     VE    +F SM+  YGI P  +H++    ++  S   KEA+      
Sbjct: 326  LLAVLSACGSAEAVEDAYLHFESMKNKYGIEPGVEHYMGLLEVLGQSGYLKEAEEFIEKL 385

Query: 736  -------------------------------ASRMIQKKLVSEKNRAPS----------- 789
                                             R+   K V+ K   P            
Sbjct: 386  PFGPTVTVLETLKSYARIHGDIDLEDHVEELIVRLDPSKAVANKIPTPPPKKYSAISMLE 445

Query: 790  --DRGMAYK---------KLMSLSKKAKEAGYVPDTRYVLHDLDHEAKERALLYHSERLA 936
              +R + YK         KL +LS   KEAGYVPDTRYVLHD+D EAKE+ALLYHSERLA
Sbjct: 446  GRNRMIEYKNPTLYKDDEKLKALSGM-KEAGYVPDTRYVLHDIDQEAKEQALLYHSERLA 504

Query: 937  IAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSCR 1116
            IAYGLISTPP T LR+IKNLR+CGDCHN IKI+S    RE IVRD KRFHHFKDG+CSC 
Sbjct: 505  IAYGLISTPPRTPLRIIKNLRVCGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCG 564

Query: 1117 DFW 1125
            D+W
Sbjct: 565  DYW 567


>gb|AFK42502.1| unknown [Medicago truncatula]
          Length = 565

 Score =  224 bits (571), Expect = 6e-56
 Identities = 138/362 (38%), Positives = 191/362 (52%), Gaps = 59/362 (16%)
 Frame = +1

Query: 217  EALEEMAKNRVVADSNHLLELMQLTVDMESMEGGDRIYEYVMRFSSNYSVSIFNEMIDMY 396
            EALE M K  + AD+N    L  L    +S+E   ++++Y ++ +      + N++I+MY
Sbjct: 207  EALELMEKG-IKADANCFEILFDLCGKSKSVEDAKKVHDYFLQSTFRSDFKMHNKVIEMY 265

Query: 397  LKLGHYRRGGRIFEQMLCRNIDSWNMMIEALVENGEAEEAIQVFTRLVKGEDELKPNETT 576
                      R+F+ M  RN+DSW+MMI     +   +E +Q+F ++   E  L+    T
Sbjct: 266  GNCKSMTDARRVFDHMPNRNMDSWHMMIRGYANSTMGDEGLQLFEQM--NELGLEITSET 323

Query: 577  FASILKACELLGEVEKGRAYFASMRKDYGITPSSDHHLSYNNLVKNSK--KEAD------ 732
              ++L AC     VE    Y  SM+  YGI P  +H++   +++  S   KEA+      
Sbjct: 324  MLAVLSACGSAEAVEDAYIYLESMKSKYGIEPGVEHYMGLLDVLGQSGYLKEAEEFIEQL 383

Query: 733  -------------NASRM---------IQKKLVS------EKNRAPSDRGMAYKKLMSLS 828
                         N +R+         +++ +VS        N+ P+     Y  +  L 
Sbjct: 384  PFEPTVTVFETLKNYARIHGDVDLEDHVEELIVSLDPSKAVANKIPTPPPKKYTAISMLD 443

Query: 829  KK-----------------------AKEAGYVPDTRYVLHDLDHEAKERALLYHSERLAI 939
             K                        K+AGYVPDTRYVLHD+D EAKE+ALLYHSERLAI
Sbjct: 444  GKNRIIEYKNPTLYKDDEKLIAMNSMKDAGYVPDTRYVLHDIDQEAKEQALLYHSERLAI 503

Query: 940  AYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSCRD 1119
            AYGLISTPP T LR+IKNLR+CGDCHN IKI+S    RE IVRD KRFHHFKDG+CSC D
Sbjct: 504  AYGLISTPPRTPLRIIKNLRVCGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD 563

Query: 1120 FW 1125
            +W
Sbjct: 564  YW 565


>ref|XP_003607988.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355509043|gb|AES90185.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 565

 Score =  224 bits (571), Expect = 6e-56
 Identities = 138/362 (38%), Positives = 191/362 (52%), Gaps = 59/362 (16%)
 Frame = +1

Query: 217  EALEEMAKNRVVADSNHLLELMQLTVDMESMEGGDRIYEYVMRFSSNYSVSIFNEMIDMY 396
            EALE M K  + AD+N    L  L    +S+E   ++++Y ++ +      + N++I+MY
Sbjct: 207  EALELMEKG-IKADANCFEILFDLCGKSKSVEDAKKVHDYFLQSTFRSDFKMHNKVIEMY 265

Query: 397  LKLGHYRRGGRIFEQMLCRNIDSWNMMIEALVENGEAEEAIQVFTRLVKGEDELKPNETT 576
                      R+F+ M  RN+DSW+MMI     +   +E +Q+F ++   E  L+    T
Sbjct: 266  GNCKSMTDARRVFDHMPNRNMDSWHMMIRGYANSTMGDEGLQLFEQM--NELGLEITSET 323

Query: 577  FASILKACELLGEVEKGRAYFASMRKDYGITPSSDHHLSYNNLVKNSK--KEAD------ 732
              ++L AC     VE    Y  SM+  YGI P  +H++   +++  S   KEA+      
Sbjct: 324  MLAVLSACGSAEAVEDAYIYLESMKSKYGIEPGVEHYMGLLDVLGQSGYLKEAEEFIEQL 383

Query: 733  -------------NASRM---------IQKKLVS------EKNRAPSDRGMAYKKLMSLS 828
                         N +R+         +++ +VS        N+ P+     Y  +  L 
Sbjct: 384  PFEPTVTVFETLKNYARIHGDVDLEDHVEELIVSLDPSKAVANKIPTPPPKKYTAISMLD 443

Query: 829  KK-----------------------AKEAGYVPDTRYVLHDLDHEAKERALLYHSERLAI 939
             K                        K+AGYVPDTRYVLHD+D EAKE+ALLYHSERLAI
Sbjct: 444  GKNRIIEYKNPTLYKDDEKLIAMNSMKDAGYVPDTRYVLHDIDQEAKEQALLYHSERLAI 503

Query: 940  AYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSCRD 1119
            AYGLISTPP T LR+IKNLR+CGDCHN IKI+S    RE IVRD KRFHHFKDG+CSC D
Sbjct: 504  AYGLISTPPRTPLRIIKNLRVCGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD 563

Query: 1120 FW 1125
            +W
Sbjct: 564  YW 565


>ref|XP_006451427.1| hypothetical protein CICLE_v10008172mg [Citrus clementina]
            gi|568842990|ref|XP_006475408.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g15690-like [Citrus sinensis]
            gi|557554653|gb|ESR64667.1| hypothetical protein
            CICLE_v10008172mg [Citrus clementina]
          Length = 475

 Score =  224 bits (570), Expect = 8e-56
 Identities = 134/365 (36%), Positives = 193/365 (52%), Gaps = 62/365 (16%)
 Frame = +1

Query: 217  EALEEMAKNRVVADSNHLLE-LMQLTVDMESMEGGDRIYEYVMRFSSNYSVSIFNEMIDM 393
            EA+E M ++   +    +   L+    +++S+E G +++E +   +    V + N++I+M
Sbjct: 113  EAIEYMGQDASASAGYDVFSSLLDSCGNLKSIEMGKKVHELLRTSAFVKDVELNNKLIEM 172

Query: 394  YLKLGHYRRGGRIFEQMLCRNIDSWNMMIEALVENGEAEEAIQVFTRLVKGEDELKPNET 573
            Y K  + R   ++F+Q+  RN+ SW++MI     NG+  + + +F ++ K      P++ 
Sbjct: 173  YGKCCNTRLARKVFDQLRKRNLSSWHLMISGYAANGQGADGLMLFEQMRKTGPH--PDKE 230

Query: 574  TFASILKACELLGEVEKGRAYFASMRKDYGITP-------------SSDHHLSYNNLVKN 714
            TF  +  AC     V++G  YF  M+ DYGI P             S+ H +     V+ 
Sbjct: 231  TFLVVFAACASAEAVKEGFLYFEIMKNDYGIVPGIEHYIAIIKVLGSAGHLIEAEEFVER 290

Query: 715  SKKEA---------------------DNASRMI-----------------QKK-----LV 765
               E                      D A  ++                 +KK     ++
Sbjct: 291  MPFEPTVEVWEALRNFAQIHGDVELEDRAEELLGDLDPSKAIVDKIPLPPRKKQSATNML 350

Query: 766  SEKNRAPSDRGM-----AYKKLMSLSKKAKEAGYVPDTRYVLHDLDHEAKERALLYHSER 930
             EKNR    R        Y+K+  L+ + +EAGYVPDTRYVLHD+D EAKE+AL YHSER
Sbjct: 351  EEKNRVSDYRSTDLYRGEYEKMKGLNGQMREAGYVPDTRYVLHDIDEEAKEKALQYHSER 410

Query: 931  LAIAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECS 1110
            LAIAYGLISTPP   LR+IKNLRICGDCHN IKI+S    RE IVRD KRFHHF+DG+CS
Sbjct: 411  LAIAYGLISTPPRMPLRIIKNLRICGDCHNAIKIMSKIVGRELIVRDNKRFHHFRDGKCS 470

Query: 1111 CRDFW 1125
            C D+W
Sbjct: 471  CGDYW 475


>gb|EPS72112.1| hypothetical protein M569_02645, partial [Genlisea aurea]
          Length = 419

 Score =  223 bits (567), Expect = 2e-55
 Identities = 135/360 (37%), Positives = 187/360 (51%), Gaps = 57/360 (15%)
 Frame = +1

Query: 217  EALEEMAKNRVVADSNHLLELMQLTVDMESMEGGDRIYEYVMRFSSNYSVSIFNEMIDMY 396
            E +E M +  + ADS     L QL  + +  E   +++++ +R +    + + N++++MY
Sbjct: 63   EVIEHMDQG-IRADSECFALLFQLCGNSKKFEDAKKVHDFFLRSTFRSDLQLNNKVLEMY 121

Query: 397  LKLGHYRRGGRIFEQMLCRNIDSWNMMIEALVENGEAEEAIQVFTRLVKGEDELKPNETT 576
             K G      R+F+ M  RN+DSW++MI +   NG  ++ + +F  +      L PNE T
Sbjct: 122  SKCGSMTDARRVFDHMPDRNMDSWHIMIYSYATNGLGDDGLALFEHM--RTLGLTPNEQT 179

Query: 577  FASILKACELLGEVEKGRAYFASMRKDY----------------GITPSSDHHLSY---- 696
            F ++ +AC     +E+   +F SMR DY                G T   D  LSY    
Sbjct: 180  FLAVFEACATADAIEEAFLHFESMRTDYSIHPSIEHYLGVLGVLGKTGHLDEALSYIETL 239

Query: 697  ---------NNLVKNSKKEAD--------------NASRMIQKKL-------------VS 768
                       L+  ++   D              + ++ +  K+             + 
Sbjct: 240  PFEPTPVIWEALMNYARIHGDIDLEDHAEELMVSLDPTKAVANKIPTPPPKKQTAINMLQ 299

Query: 769  EKNRAPSDRGMA-YKKLMSLSKKAKEAGYVPDTRYVLHDLDHEAKERALLYHSERLAIAY 945
             +NR P  R    YK    +    KE  YVPDTRYVLHD+D EAKE+ALLYHSERLAIAY
Sbjct: 300  GRNRIPEFRNPTIYKDEEKIRAAKKEQAYVPDTRYVLHDIDQEAKEQALLYHSERLAIAY 359

Query: 946  GLISTPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSCRDFW 1125
            GLISTP  T LR+IKNLRICGDCHN IKI+S    RE IVRD KRFHHFKDG+CSC D+W
Sbjct: 360  GLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCNDYW 419


>ref|XP_002279824.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15690-like
            [Vitis vinifera]
          Length = 476

 Score =  212 bits (540), Expect = 2e-52
 Identities = 132/360 (36%), Positives = 181/360 (50%), Gaps = 57/360 (15%)
 Frame = +1

Query: 217  EALEEMAKNRVVADSNHLLELMQLTVDMESMEGGDRIYEYVMRFSSNYSVSIFNEMIDMY 396
            EA+E M +  V A+      ++      +S+E G R+++ + R      V + N++I+MY
Sbjct: 118  EAVEYMGQG-VCAEYGVFCAMLSSCGKTKSLEVGRRVHDLLARSKFGGDVELNNKLIEMY 176

Query: 397  LKLGHYRRGGRIFEQMLCRNIDSWNMMIEALVENGEAEEAIQVFTR-------------- 534
             + G  R   R+F++M  RN+ SW++MI     NG+  + + +F R              
Sbjct: 177  GRCGSMRDARRVFDRMRERNMSSWHLMINGYAANGQGRDGLLLFERMRKVGLRPVGETFV 236

Query: 535  -----------------LVKGEDELKPNETTFASILKACELLGEVEKGRAYFASM----- 648
                             L+K E  + P    +  ++      G + +   +   M     
Sbjct: 237  AVLSACGSVEEGLMYFELMKKECGIIPGIEHYLGVIDVLGKFGHINEAEEFVDKMPIEPT 296

Query: 649  ----------RKDYGITPSSDHHLSYNNLVKNSKKEADNASRMIQKK-----LVSEKNRA 783
                       + +G     D        +  SK   D      QKK     ++  KNR 
Sbjct: 297  AEVWEALRNFARIHGAIELEDRAEEMLAALDPSKAITDKIPTPPQKKQLAVNMLEGKNRV 356

Query: 784  PSDR------GMAYKKLMSLSKKAKEAGYVPDTRYVLHDLDHEAKERALLYHSERLAIAY 945
               R      G AY+KL  L+ + +EAGYVPDTRYVLHD+D EAKE+ALLYHSERLAIAY
Sbjct: 357  SEYRSTNPYKGDAYEKLKGLNGQMREAGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAY 416

Query: 946  GLISTPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSCRDFW 1125
            GLISTP  T LR+IKNLRICGDCHN IKI+S    RE IVRD KRFHHFKDG+CSC D+W
Sbjct: 417  GLISTPARTPLRIIKNLRICGDCHNAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW 476


>ref|XP_007202366.1| hypothetical protein PRUPE_ppa009223mg [Prunus persica]
            gi|462397897|gb|EMJ03565.1| hypothetical protein
            PRUPE_ppa009223mg [Prunus persica]
          Length = 301

 Score =  208 bits (530), Expect = 4e-51
 Identities = 123/303 (40%), Positives = 162/303 (53%), Gaps = 58/303 (19%)
 Frame = +1

Query: 391  MYLKLGHYRRGGRIFEQMLCRNIDSWNMMIEALVENGEAEEAIQVFTRLVKGEDELKPNE 570
            MY + G  R   ++F++M  R++  W+ MI     NG+ +E + +F ++      LKPN+
Sbjct: 1    MYGRCGSMRDARKVFDRMPKRSMSLWHSMIHGYAVNGQGDEGLLLFEQM--RNLGLKPNK 58

Query: 571  TTFASILKACELLGEVEKGRAYFASMRKDYGIT--------------------------- 669
             TF  +L AC     VE+G  YF SM+ +Y I                            
Sbjct: 59   ETFVVVLVACASAEAVEEGLTYFESMKNEYEIVPEIEHYLGLIDVLGKSGHLNEAEEFIE 118

Query: 670  -----PSSDHHLSYNNLVK-------------------NSKKEADNASRMIQKK-----L 762
                 P+++   +  N  +                    SK  A+     ++K+     +
Sbjct: 119  KMPFEPTAEVWEALRNFARIHGDIELEDRAEDLLVSLDPSKANAEKIPLPLRKQHSEINM 178

Query: 763  VSEKNRAPSDR--GMAYKKLMSLSKKAKEAGYVPDTRYVLHDLDHEAKERALLYHSERLA 936
            + EKNR    R    AY+KL  L  + +EAGYVPDTRYVLHD+D EAKE+AL YHSERLA
Sbjct: 179  LGEKNRVSEYRITNEAYEKLKGLKGQMREAGYVPDTRYVLHDIDQEAKEQALQYHSERLA 238

Query: 937  IAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSCR 1116
            IAYGLISTP    LR+IKNLRICGDCHN IKI+S    RE IVRD KRFHHFKDG+CSC 
Sbjct: 239  IAYGLISTPARQTLRIIKNLRICGDCHNAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCG 298

Query: 1117 DFW 1125
            D+W
Sbjct: 299  DYW 301


>emb|CAN64107.1| hypothetical protein VITISV_013147 [Vitis vinifera]
          Length = 497

 Score =  208 bits (529), Expect = 5e-51
 Identities = 131/359 (36%), Positives = 180/359 (50%), Gaps = 57/359 (15%)
 Frame = +1

Query: 217  EALEEMAKNRVVADSNHLLELMQLTVDMESMEGGDRIYEYVMRFSSNYSVSIFNEMIDMY 396
            EA+E M +  V A+      ++      +S+E G R+++ + R      V + N++I+MY
Sbjct: 120  EAVEYMGQG-VCAEYGVFCAMLSSCGKTKSLEVGRRVHDLLARSKFGGDVELNNKLIEMY 178

Query: 397  LKLGHYRRGGRIFEQMLCRNIDSWNMMIEALVENGEAEEAIQVFTR-------------- 534
             + G  R   R+F++M  RN+ SW++MI     NG+  + + +F R              
Sbjct: 179  GRCGSMRDARRVFDRMRERNMSSWHLMINGYAANGQGRDGLLLFERMRKVGLRPVGETFV 238

Query: 535  -----------------LVKGEDELKPNETTFASILKACELLGEVEKGRAYFASM----- 648
                             L+K E  + P    +  ++      G + +   +   M     
Sbjct: 239  AVLSACGSVEEGLMYFELMKKECGIIPGIEHYLGVIDVLGKFGHINEAEEFVDKMPIEPT 298

Query: 649  ----------RKDYGITPSSDHHLSYNNLVKNSKKEADNASRMIQKK-----LVSEKNRA 783
                       + +G     D        +  SK   D      QKK     ++  KNR 
Sbjct: 299  AEVWEALRNFARIHGAIELEDRAEEMLAALDPSKAITDKIPTPPQKKQLAVNMLEGKNRV 358

Query: 784  PSDR------GMAYKKLMSLSKKAKEAGYVPDTRYVLHDLDHEAKERALLYHSERLAIAY 945
               R      G AY+KL  L+ + +EAGYVPDTRYVLHD+D EAKE+ALLYHSERLAIAY
Sbjct: 359  SEYRSTNPYKGDAYEKLKGLNGQMREAGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAY 418

Query: 946  GLISTPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSCRDF 1122
            GLISTP  T LR+IKNLRICGDCHN IKI+S    RE IVRD KRFHHFKDG+CSC D+
Sbjct: 419  GLISTPARTPLRIIKNLRICGDCHNAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDY 477


>gb|EXB38821.1| Serine acetyltransferase 5 [Morus notabilis]
          Length = 819

 Score =  204 bits (520), Expect = 5e-50
 Identities = 127/364 (34%), Positives = 182/364 (50%), Gaps = 61/364 (16%)
 Frame = +1

Query: 217  EALEEMAKNRVVADSNHLLELMQLTVDMESMEGGDRIYEYVMRFSSNYSVSIFNEMIDMY 396
            EALE + +  V AD      L+    D  S+E G R++E++ R      V + N++++MY
Sbjct: 458  EALEFLGQG-VSADYEVFRSLLDSCRDKRSVEVGKRVHEFLRRSPFRGDVELSNKVMEMY 516

Query: 397  LKLGHYRRGGRIFEQMLCRNIDSWNMMI-------------------------------- 480
             K G  +   ++F++M  R++ SW++MI                                
Sbjct: 517  GKCGSVKSARKVFDRMTERSLSSWHLMIKVYALNGQGDDGLLLFEQMKNVGLSPDKETFS 576

Query: 481  ---EALVENGEAEEAIQVFTRLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASM- 648
               EA    G  EE + +F  + + E  + P    +  +++     G + +   +   M 
Sbjct: 577  VVLEACASAGAVEEGLIIFESM-ESEYGIVPGIEHYLRVVEVLGASGYLNEAEEFIKKMP 635

Query: 649  --------------RKDYGITPSSDH-----------HLSYNNLVKNSKKEADNASRMIQ 753
                           + +G     D             ++ N +    +K    +S + +
Sbjct: 636  FEPPAEAWEALRNFTRTHGDLELEDRIEELLVTVDPSKVNPNKIPLAQRKRHSESSMIEE 695

Query: 754  KKLVSEKNRAPSDRGMAYKKLMSLSKKAKEAGYVPDTRYVLHDLDHEAKERALLYHSERL 933
            K  VSE       +   Y+KL  L+ + +EAGYVPDTRYVLHD+D EAKE+AL YHSERL
Sbjct: 696  KSRVSEYRCPNPYKEEVYQKLKGLNGQLREAGYVPDTRYVLHDIDEEAKEQALQYHSERL 755

Query: 934  AIAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSC 1113
            AIAYGLISTPP T LR++KNLRICGDCHN IKI+S    RE IVRD KRFHHFKDG+CSC
Sbjct: 756  AIAYGLISTPPRTTLRIMKNLRICGDCHNAIKIMSKIVGRELIVRDNKRFHHFKDGKCSC 815

Query: 1114 RDFW 1125
             D+W
Sbjct: 816  GDYW 819


>ref|XP_007012889.1| Tetratricopeptide repeat-like superfamily protein, putative
            [Theobroma cacao] gi|508783252|gb|EOY30508.1|
            Tetratricopeptide repeat-like superfamily protein,
            putative [Theobroma cacao]
          Length = 485

 Score =  204 bits (520), Expect = 5e-50
 Identities = 128/363 (35%), Positives = 184/363 (50%), Gaps = 60/363 (16%)
 Frame = +1

Query: 217  EALEEMAKNRVVADSNHLLELMQLTVDMESMEGGDRIYEYVMRFSSNYSVSIFNEMIDMY 396
            EAL+ M +  V+AD N    L+    +M S+E   R+ E+  R   +  + + N++I +Y
Sbjct: 124  EALDHMGQG-VLADFNVFGALLDACGNMNSLELARRVNEFFRRSKFSGDIELNNKLIGIY 182

Query: 397  LKLGHYRRGGRIFEQMLCRNIDSWNMMIEALVENGEAEEAIQVFT--------------- 531
             K    R   R+F++M  RN+ SWN+MI     NG+ ++ + +F                
Sbjct: 183  GKCASIRDARRVFDKMRERNMASWNLMINDYAVNGKGDDGLSLFEDMRKDGFQPDSETFL 242

Query: 532  -------------------RLVKGEDELKPNETTFASILKACELLGEVEKGRAYFASM-- 648
                                L+K E  + P    +  ++      G + +   +  +M  
Sbjct: 243  AVLSACASVAAVEEGIMYFELMKNEYRIAPGVEHYLGVIDVFGRAGYLNEAVEFIENMPI 302

Query: 649  -------------RKDYGITPSSDHH----LSYNNLVKNSKK-EADNASRMIQKKLVSEK 774
                          + +G     DH     L ++  +++  + +A    +     ++ EK
Sbjct: 303  EPTVEIWEAVRGFARIHGDIDLEDHFEELLLGFDPPMRSENEHQAPPRKKHSVINMIEEK 362

Query: 775  NRAPSDR------GMAYKKLMSLSKKAKEAGYVPDTRYVLHDLDHEAKERALLYHSERLA 936
            NR    R      G   +KL  L+ + +EAGYVPDTRYVLHD+D EAKE+AL YHSERLA
Sbjct: 363  NRVIEYRCMNPFKGEVNEKLKGLNGQMREAGYVPDTRYVLHDIDQEAKEQALQYHSERLA 422

Query: 937  IAYGLISTPPGTALRVIKNLRICGDCHNFIKILSTFEKREFIVRDTKRFHHFKDGECSCR 1116
            IAYGLISTP  T LR+IKNLRICGDCHN IKI+S    RE IVRD KRFHHF+DG+CSC 
Sbjct: 423  IAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSKIVGRELIVRDNKRFHHFRDGKCSCG 482

Query: 1117 DFW 1125
            D+W
Sbjct: 483  DYW 485


Top