BLASTX nr result

ID: Ephedra26_contig00008186 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra26_contig00008186
         (2477 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004142106.1| PREDICTED: pentatricopeptide repeat-containi...   478   e-132
ref|XP_006828302.1| hypothetical protein AMTR_s00023p00232870 [A...   474   e-131
ref|XP_004246707.1| PREDICTED: pentatricopeptide repeat-containi...   468   e-129
ref|XP_006367266.1| PREDICTED: pentatricopeptide repeat-containi...   464   e-128
gb|EOY07712.1| Tetratricopeptide repeat (TPR)-like superfamily p...   464   e-128
ref|XP_004493936.1| PREDICTED: pentatricopeptide repeat-containi...   464   e-128
ref|XP_004167767.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   462   e-127
ref|XP_004305248.1| PREDICTED: pentatricopeptide repeat-containi...   462   e-127
ref|XP_003554352.1| PREDICTED: pentatricopeptide repeat-containi...   450   e-123
gb|EXB84820.1| hypothetical protein L484_003853 [Morus notabilis]     449   e-123
ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containi...   447   e-122
ref|XP_006287051.1| hypothetical protein CARUB_v10000200mg [Caps...   446   e-122
ref|XP_006428907.1| hypothetical protein CICLE_v10011055mg [Citr...   445   e-122
gb|EMJ09280.1| hypothetical protein PRUPE_ppa001520mg [Prunus pe...   441   e-121
gb|ESW34707.1| hypothetical protein PHAVU_001G174000g [Phaseolus...   439   e-120
ref|XP_006398739.1| hypothetical protein EUTSA_v10012661mg [Eutr...   438   e-120
ref|XP_006398740.1| hypothetical protein EUTSA_v10012661mg [Eutr...   432   e-118
ref|NP_195903.2| pentatricopeptide repeat-containing protein [Ar...   431   e-117
dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]                430   e-117
ref|XP_002525196.1| pentatricopeptide repeat-containing protein,...   430   e-117

>ref|XP_004142106.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Cucumis sativus]
          Length = 849

 Score =  478 bits (1231), Expect = e-132
 Identities = 300/825 (36%), Positives = 446/825 (54%), Gaps = 30/825 (3%)
 Frame = +2

Query: 8    SARTTKLSTSTNNESSIKATTSPPKVPLASEKRLWVAKNLMKQGNLMDCLSLMREVSEKG 187
            S R +  +  ++ E  I   +S  ++P+  +    VA  L + G L D   ++  V   G
Sbjct: 49   STRHSPPALLSSVELDIAGASSGGRIPI--QHYAGVASKLAEGGKLEDFAMVVESVVVAG 106

Query: 188  -RVCVLG-LLDLEDLKGCIGVHIENGDVGLVVDGLRILNEMGFHGPSLLDERALAMLGLE 361
                  G +L +E +   I   +  G V  VV  LR + E+G     L DE A+  L  +
Sbjct: 107  VEPSQFGAMLAVELVAKGISRCLREGKVWSVVQVLRKVEELGISVLELCDEPAVESLRRD 166

Query: 362  LSKMVKKDSLGVKECVEGLQVLEYCQFHTQELVEPMLVFKQCIKLKDVSLALRYVSLIRN 541
              +M K   L  +E VE ++VL    F  +E+++P  V K C+  ++  +A+RY S++ +
Sbjct: 167  CRRMAKSGEL--EELVELMEVLSGFGFSVREMMKPSEVIKLCVDYRNPKMAIRYASILPH 224

Query: 542  PHVWWNFLIREFGRKGDLSSALIVFRKYQSCNTTPNMCLYRDIIDACGSCGDSTMAETIF 721
              + +   I EFG+K DL SA I + + ++     NM +YR IID CG CGD   +  I+
Sbjct: 225  ADILFCTTINEFGKKRDLKSAYIAYTESKANMNGSNMYIYRTIIDVCGLCGDYKKSRNIY 284

Query: 722  KEMLSHNVLPSTFVYNSFMNAVAADLNRTLQSFHDMQSSGVKPDIATYNILLKACRVARN 901
            +++++ NV+P+ FV+NS MN  A DLN T Q + +MQ+ GV  D+A+YNILLKAC +A  
Sbjct: 285  QDLVNQNVIPNIFVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGR 344

Query: 902  ADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQVVGDSKNLAKALEIKRDMEVAGILPDMI 1081
             DLA DIY  +K     G +K+DV TYST+++V  D+K    AL +K DM+ AG+ P+M+
Sbjct: 345  VDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGVSPNMV 404

Query: 1082 TWNMLLTACANVGLVEKALVLFDEMVLSGYQPNSQCCNALLSAFVKDFQYARAFTFFKNW 1261
            TW+ L+++CAN GLVE A+ LF+EMV +G +PN+QCCN LL A V+  Q+ RAF  F++W
Sbjct: 405  TWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNTLLHACVEGRQFDRAFRLFRSW 464

Query: 1262 KEKGFYAGSLKRYKKGNLPDNFSAPPSCIPK------------------FKPTVVTYNTL 1387
            KEK  + G  ++    N  D  S    C  K                  FKPT+ TYN L
Sbjct: 465  KEKELWDGIERKSSTDNNLDADSTSQLCNTKMPNAPSHVHQISFVGNFAFKPTITTYNIL 524

Query: 1388 MKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLIDAYGNTADLQGCMQAFNNMCEAGVKP 1567
            MKAC +  Y A+ +M+EMK  G+ P++++WS+L+D  G + D++  +Q    M  AGV P
Sbjct: 525  MKACGTDYYHAKALMEEMKSVGLTPNHISWSILVDICGRSHDVESAVQILTTMRMAGVDP 584

Query: 1568 DVVTYTTLIKVCVKNKNFTKAIMTFEAMKKKNVQPNAITYNTILRGWREHGDHMQVYALL 1747
            DVV YTT IKVCV+ KN+  A   FE MK+  +QPN +TY+T+LR    +G   +V   L
Sbjct: 585  DVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCL 644

Query: 1748 SLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSYSTSQSMKLKGNQYQRKDENTRQKSDI 1927
            ++Y++MRKSGF  ND +LK LI EW EGVIQK            N  Q  +     K DI
Sbjct: 645  AIYQDMRKSGFKSNDHYLKELIAEWCEGVIQK------------NNQQPVEITPCNKIDI 692

Query: 1928 -----IFFEKIAG-LACSKSSDFTVDLRGLSETETXXXXXXXXXXXXEKHGPGNPIDSDL 2089
                 +  EK+A  L  S +   T+DL+ L++ E             E +  G  +  D+
Sbjct: 693  GKPRCLILEKVADHLQKSFAESLTIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDI 752

Query: 2090 IIIIGYS---GNLQKQRLE-SSHITKILNQELGLLVLQNRDYDQPVVAGTSSSDILSCIN 2257
             II+  +    +L  Q  E    IT++L  ELGL VL       P +A      +    N
Sbjct: 753  FIILEVNKVETDLVPQNFEVRDAITRLLQDELGLEVLPT----GPTIA------LDKVPN 802

Query: 2258 NRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIVPKKSLYQWVEK 2392
            +  +K  + T    +    +   ++   VQRL V KKSL  W+++
Sbjct: 803  SESSKISHTTKLKGTMGRNKYFTRKPADVQRLKVTKKSLQDWLQR 847


>ref|XP_006828302.1| hypothetical protein AMTR_s00023p00232870 [Amborella trichopoda]
            gi|548832949|gb|ERM95718.1| hypothetical protein
            AMTR_s00023p00232870 [Amborella trichopoda]
          Length = 855

 Score =  474 bits (1220), Expect = e-131
 Identities = 271/762 (35%), Positives = 429/762 (56%), Gaps = 34/762 (4%)
 Frame = +2

Query: 209  LDLEDLKGCIGVHIENGDVGLVVDGLRILNEMGFHGPSLLDERALAMLGLELSKMVKKDS 388
            L ++ +     + ++NG+   V+  +   +++G     + D  A  +L     +++  D+
Sbjct: 108  LSIKHVSAGFALCLKNGEFDTVLGVMEKFDKLGICPSLIFDGSARRLLLSACRRVLDGDN 167

Query: 389  LGVKECVEGLQVLEYCQFHTQELVEPMLVFKQCIKLKDVSLALRYVSLIRNPHVWWNFLI 568
            +G  E V  +++    +F  +++V+P  + + CI   D  +A RY S++ +  VW+NFLI
Sbjct: 168  IG--EFVRLVEIFAGYRFSVKDVVKPTFILQACIDRHDPFMAGRYASILPHADVWFNFLI 225

Query: 569  REFGRKGDLSSALIVFRKYQSCNTTPNMCLYRDIIDACGSCGDSTMAETIFKEMLSHNVL 748
             EFG+K DL SAL+ F   +  + +PNM +YR IIDACG CGDS  + +IF+++L   + 
Sbjct: 226  CEFGKKKDLQSALVAFEVSKGKSVSPNMYIYRSIIDACGYCGDSLKSRSIFEDLLVQKIT 285

Query: 749  PSTFVYNSFMNAVAADLNRTLQSFHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYE 928
            P+TFV+NS MN  A D +  L  +  M+  GV  D+A+YN+LLK C +A   DLA +IYE
Sbjct: 286  PNTFVFNSLMNVNAHDSHYALHIYKQMKKLGVAADMASYNVLLKVCCLAGRVDLAQEIYE 345

Query: 929  GLKEKSIDGGIKMDVITYSTLIQVVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTAC 1108
             + ++++ GG+K+DVITYST+I+V  D+K    A +IK DM  AG+ P+++TW+ L++AC
Sbjct: 346  EILQRALFGGLKLDVITYSTIIKVFADAKMWEMAFKIKDDMISAGVSPNIVTWSSLISAC 405

Query: 1109 ANVGLVEKALVLFDEMVLSGYQPNSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGS 1288
            AN GLVE+ + + +EM++ G +PN+QCCN LL+A V+  Q+ RAF  F  WK+ GF  GS
Sbjct: 406  ANAGLVERVIQVLEEMLVVGCEPNTQCCNILLNACVESCQFDRAFRIFHFWKQNGFSMGS 465

Query: 1289 LKR---------------YKKGNLPDNFSAPP--------SCIPKFKPTVVTYNTLMKAC 1399
              +               +  GN   + ++          S +  FKPTV TYN LMKAC
Sbjct: 466  NAKECGSKTVTDIKQNEYFSSGNHEFHITSDALDPHDLNFSEVIPFKPTVATYNILMKAC 525

Query: 1400 RSTPYLARTMMDEMKDNGIVPDNVTWSMLIDAYGNTADLQGCMQAFNNMCEAGVKPDVVT 1579
             +  Y A+ +MDEMK  G+ P++++WS+LID  G + +++G +QAF +M  AG+ PDVV 
Sbjct: 526  GTDYYRAQALMDEMKAGGLSPNHISWSILIDICGRSYNMKGAIQAFKSMYNAGIIPDVVA 585

Query: 1580 YTTLIKVCVKNKNFTKAIMTFEAMKKKNVQPNAITYNTILRGWREHGDHMQVYALLSLYE 1759
            YTT IK CV NK F  A   FE MK+  +QPN +TYNT+L     +G   +V   L++Y+
Sbjct: 586  YTTAIKACVGNKYFKMAFSLFEEMKRHRLQPNLVTYNTLLTARSRYGSLDEVLQCLAIYQ 645

Query: 1760 EMRKSGFPPNDEFLKGLIEEWAEGVIQKSYSTSQSMKLKGNQYQRKDENTRQKSDIIF-- 1933
            +MRK+G+  ND FLK L+EEW EGVI            KG ++   + +   K   ++  
Sbjct: 646  DMRKAGYNSNDRFLKELLEEWCEGVISD----------KGKRWSELNIDKCDKGSEVYGP 695

Query: 1934 ----FEKIAG-LACSKSSDFTVDLRGLSETETXXXXXXXXXXXXEKHGPGNPIDSDLIII 2098
                 EK+A  L  + + + T+DLRGL++ E             E +  G P+  D+III
Sbjct: 696  QSLLLEKVAAYLQENFAENLTIDLRGLTKVEARIIVLAKLRMLKENYILGKPVRDDMIII 755

Query: 2099 IGYS-GNLQKQRLE---SSHITKILNQELGLLVLQNRDYDQPVVAGTSSSDILSCINNRR 2266
               +  N+     E      + ++L  ELGL VL+  +  +     T  + ++S ++   
Sbjct: 756  TANTRSNMDAAETELRVRDAVIRVLQGELGLSVLEGPELGE---LSTRHAHVISSLSPET 812

Query: 2267 TKAGNKTGFSVSNLNERMPLKRLPTVQRLIVPKKSLYQWVEK 2392
                 +    +     R P+     VQRL +P++SL  W++K
Sbjct: 813  LTMSKRP--QLREYTTRRPV----DVQRLKIPRRSLNLWLQK 848


>ref|XP_004246707.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Solanum lycopersicum]
          Length = 857

 Score =  468 bits (1205), Expect = e-129
 Identities = 281/834 (33%), Positives = 452/834 (54%), Gaps = 49/834 (5%)
 Frame = +2

Query: 38   TNNESSIKATTSPPKVPLASEKRLW-----------------VAKNLMKQGNLMDCLSLM 166
            T++ S   ++ + P+ PL S  R W                 +A  L + G   D L + 
Sbjct: 42   THSPSHFTSSITTPQSPLLSSLR-WDSASASGSCNGLKYYAELASKLAQDGRFDDSLMIA 100

Query: 167  REVSEKGRVC--VLGLLDLEDLKGCIGVHIENGDVGLVVDGLRILNEMGFHGPSLLDERA 340
              V   G        LL+++ + G I   +E   VG VV+ L    ++G     LLDE +
Sbjct: 101  ESVVVSGVNAEEFTALLNVKLVSGGIVRLLEERKVGSVVELLNGAQQLGIDPSKLLDEDS 160

Query: 341  LAMLGLELSKMVKKDSLGVKECVEGLQVLEYCQFHTQELVEPMLVFKQCIKLKDVSLALR 520
            +  L  E  + ++     ++E V  ++ L  C    ++LV+P  + + C+  +  + A+R
Sbjct: 161  INALSRECRRTMQCSE--IEEVVSLMETLRGCGMPIKDLVKPSEILRLCVSQRKPNAAVR 218

Query: 521  YVSLIRNPHVWWNFLIREFGRKGDLSSALIVFRKYQSCNTTPNMCLYRDIIDACGSCGDS 700
            Y  +  +  + +  +I EFG+KGDL+SAL VF   +    TPN+ +YR  ID CG CGD 
Sbjct: 219  YAHIFPHVDIMFCTIILEFGKKGDLASALTVFEASKQNQDTPNLYIYRTAIDVCGLCGDY 278

Query: 701  TMAETIFKEMLSHNVLPSTFVYNSFMNAVAADLNRTLQSFHDMQSSGVKPDIATYNILLK 880
              + +I++ +++    P+ +V+NS MN  A DL+ TL  +  MQ  GV  D+ +YNILLK
Sbjct: 279  LKSRSIYEGLIASKFTPNIYVFNSLMNVNACDLSYTLDIYKQMQKLGVPADLTSYNILLK 338

Query: 881  ACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQVVGDSKNLAKALEIKRDMEVA 1060
            +C +A   DLA +IY  LK   + G +K+DV TYSTLI+V  D+K    ALEIK+DM  A
Sbjct: 339  SCCLATRVDLAKEIYGELKHLEMAGALKLDVFTYSTLIKVFADAKMWQMALEIKKDMLSA 398

Query: 1061 GILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQPNSQCCNALLSAFVKDFQYARA 1240
            G+ P+++TW+ L++ACAN G+V++A+ LF+EM+ +G +PNSQC N LL A V+  QY RA
Sbjct: 399  GVTPNIVTWSSLISACANAGVVDQAIQLFEEMLQAGCEPNSQCYNILLHACVEACQYDRA 458

Query: 1241 FTFFKNWKEKGFYAGSLKRYKKGNLPDNFSAPPSCIPK--------------------FK 1360
            F  F++WKE        + Y  G   +N    P+ +                      F 
Sbjct: 459  FRLFRSWKENALQKDKCEDY-GGKTDNNIDLSPTLVVSASIPTRTSASSHRHISTRVPFI 517

Query: 1361 PTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLIDAYGNTADLQGCMQAFN 1540
            PT  TYN LMKAC S  Y A+ +M+EMK+ G+ P+++TW++LID  G + +++G +Q   
Sbjct: 518  PTTSTYNILMKACGSDYYRAKALMEEMKEVGLSPNHITWTILIDICGGSGNVEGALQILR 577

Query: 1541 NMCEAGVKPDVVTYTTLIKVCVKNKNFTKAIMTFEAMKKKNVQPNAITYNTILRGWREHG 1720
             M EAG++PDVVTYTT+IKVCV+NK+F  A   F AMK+  ++PN +TYNT+LR    +G
Sbjct: 578  VMREAGIQPDVVTYTTIIKVCVENKDFKSAFSLFAAMKRYQIKPNMVTYNTLLRARSRYG 637

Query: 1721 DHMQVYALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSYSTSQSMKLKGNQYQRK- 1897
               +V   L++Y++MRK+G+ PND +LK LIE+W EGVIQ             N  QRK 
Sbjct: 638  SLQEVQQCLAIYQDMRKAGYKPNDYYLKQLIEQWCEGVIQ-------------NANQRKY 684

Query: 1898 DENTRQKSDI----IFFEKIA-GLACSKSSDFTVDLRGLSETETXXXXXXXXXXXXEKHG 2062
            + +TR ++D+    +  EK+A  L    ++  +++LRGL++ E             EK+ 
Sbjct: 685  NFSTRNRTDLGPQSMILEKVAEHLQKDSANSISINLRGLTKVEARIVVLAVLRMIREKYT 744

Query: 2063 PGNPIDSDLIIIIGYS----GNLQKQRLESSHITKILNQELGLLVLQNRDYDQPVVAGTS 2230
             G+ I  D+ I +G        ++++ +    I ++L  +LGL V+          A ++
Sbjct: 745  AGDSIKDDVQIFLGVKEVGIRAVKQESVVKEAIIQLLQHDLGLEVIS---------AAST 795

Query: 2231 SSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIVPKKSLYQWVEK 2392
              + ++  +N+ +         +   +   P ++   +Q++ + K+SL  W+ +
Sbjct: 796  IGNGINHPDNKHSNMEENAERVILRPSVYSPTRKPVVLQKMRITKESLQSWLTR 849


>ref|XP_006367266.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Solanum tuberosum]
          Length = 859

 Score =  464 bits (1195), Expect = e-128
 Identities = 288/841 (34%), Positives = 449/841 (53%), Gaps = 56/841 (6%)
 Frame = +2

Query: 38   TNNESSIKATTSPPKVPLASEKRLW---------------VAKNLMKQGNLMDCLSLMRE 172
            T++ S   ++ + P+ PL S  R W               +A  L + G   D L +   
Sbjct: 42   THSPSHFTSSITTPQSPLLSTLR-WDSASGSCNGLKYYAELASKLAQDGRFDDSLMIAES 100

Query: 173  VSEKGRVCV--LGLLDLEDLKGCIGVHIENGDVGLVVDGLRILNEMGFHGPSLLDERALA 346
            V   G        LL+++ + G I   +E   VG VV+ L    ++G     LLD  AL 
Sbjct: 101  VVVSGVNAAEFAALLNVKLVSGGIVRLLEERKVGSVVELLNGAQQLGIDPLKLLDGDALN 160

Query: 347  MLGLELSKMVKKDSLGVKECVEGLQVLEYCQFHTQELVEPMLVFKQCIKLKDVSLALRYV 526
             L  E  + +      ++E V  ++ L+ C    ++LV+P  + + C+  +  + A+RY 
Sbjct: 161  ALSRECRRTMGCGE--IEEVVSLMETLKGCGMPIKDLVKPSEILRLCVSQRKPNAAVRYA 218

Query: 527  SLIRNPHVWWNFLIREFGRKGDLSSALIVFRKYQSCNTTPNMCLYRDIIDACGSCGDSTM 706
             +  +  + +  +I EFG+KGDL SAL VF   +    TPN+ +YR  ID CG CGD   
Sbjct: 219  HIFPHVDIMFCTIILEFGKKGDLVSALTVFEASKQNQDTPNLYIYRTAIDVCGLCGDYLK 278

Query: 707  AETIFKEMLSHNVLPSTFVYNSFMNAVAADLNRTLQSFHDMQSSGVKPDIATYNILLKAC 886
            + +I++ +++    P+ +V+NS MN  A DL+ TL  +  MQ  GV  D+ +YNILLK+C
Sbjct: 279  SRSIYEGLIASKFTPNIYVFNSLMNVNACDLSYTLDIYKQMQKLGVPADLTSYNILLKSC 338

Query: 887  RVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQVVGDSKNLAKALEIKRDMEVAGI 1066
             +A   DLA +IY  LK   + G +K+DV TYSTLI+V  D+K    ALEIK+DM  AG+
Sbjct: 339  CLATRVDLAKEIYGELKHLEMAGALKLDVFTYSTLIKVFADAKMWQMALEIKKDMLSAGV 398

Query: 1067 LPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQPNSQCCNALLSAFVKDFQYARAFT 1246
             P+++TW+ L++ACAN GLV++A+ LF+EM+ +G +PNSQC N LL A V+  QY RAF 
Sbjct: 399  TPNIVTWSSLISACANAGLVDQAIQLFEEMLQAGCEPNSQCYNILLHACVEACQYDRAFR 458

Query: 1247 FFKNWKEKGFYAGSLKRY---------------KKGNLPDNFSAPP----SCIPKFKPTV 1369
             F++WKE      + + +                  ++P   SA      S    F+PT 
Sbjct: 459  LFRSWKENALQKDNCEDFGGKTDNTIDLSPTLVVSASIPTRTSASSHGHFSTRVPFRPTT 518

Query: 1370 VTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLIDAYGNTADLQGCMQAFNNMC 1549
             TYN L+KAC S  Y A+ +M+EMK+ G+ P+++TW++LID  G + +++G +Q    M 
Sbjct: 519  STYNILIKACGSDYYRAKALMEEMKEVGLSPNHITWTILIDICGGSGNVEGALQILRAMR 578

Query: 1550 EAGVKPDVVTYTTLIKVCVKNKNFTKAIMTFEAMKKKNVQPNAITYNTILRGWREHGDHM 1729
            EAG++PDVVTYTT+IKVCV+NK+F  A   F AMK+  ++PN +TYNT+LR    +G   
Sbjct: 579  EAGIQPDVVTYTTIIKVCVENKDFKSAFSLFAAMKRYQIKPNMVTYNTLLRARSRYGSLQ 638

Query: 1730 QVYALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSYSTSQSMKLKGNQYQRKDENT 1909
            +V   L++Y+ MRK+G+ PND +LK LIE+W EGVIQ            GNQ ++ + +T
Sbjct: 639  EVQQCLAIYQHMRKAGYKPNDYYLKQLIEQWCEGVIQ-----------NGNQ-RKYNFST 686

Query: 1910 RQKSDI----IFFEKIA-GLACSKSSDFTVDLRGLSETETXXXXXXXXXXXXEKHGPGNP 2074
            R ++D+    +  +K+A  L    ++  +++LRGLS+ E             EK+  G+ 
Sbjct: 687  RNRTDLGPESMILDKVAEHLQKDSANSISINLRGLSKVEARIVVLAVLRMIREKYTAGDS 746

Query: 2075 IDSDLIIIIGYS----GNLQKQRLESSHITKILNQELGLLVLQNRDYDQPVVAGTSSSDI 2242
            I  D+ I +G        + ++ +    I K+L  +LGL V+                  
Sbjct: 747  IKEDVQIFLGVQEVGIRAVGQESVVKEAIVKLLQHDLGLEVI----------------SA 790

Query: 2243 LSCINNRRTKAGNKTGFSVSNLNE-----------RMPLKRLPTVQRLIVPKKSLYQWVE 2389
             S I N R + G       SN+ E             P ++   +Q++ + K+SL  W+ 
Sbjct: 791  ASRIGNDRNQDGINHPDKHSNMEENAERVILRANVHSPTRKPVVLQKMRITKESLQSWLT 850

Query: 2390 K 2392
            +
Sbjct: 851  R 851


>gb|EOY07712.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
            [Theobroma cacao]
          Length = 858

 Score =  464 bits (1195), Expect = e-128
 Identities = 287/782 (36%), Positives = 434/782 (55%), Gaps = 24/782 (3%)
 Frame = +2

Query: 113  VAKNLMKQGNLMDCLSLMREVSEKGRVC--VLGLLDLEDLKGCIGVHIENGDVGLVVDGL 286
            +A  L + G L D   ++  +   G     ++ +L ++ +   +  +++ G V  VV+ L
Sbjct: 89   LASKLAEDGRLEDFAMIVEMLVASGVNAPRIVSMLSVQFVSKGVASNVQEGKVKSVVEVL 148

Query: 287  RILNEMGFHGPSLLDERALAMLGLELSKMVKKDSLGVKECVEGLQVLEYCQFHTQELVEP 466
            + + ++G     L+D   L  +  E  ++V      V++ V+ L+ L   QF  +ELV+P
Sbjct: 149  KKVEKLGIAPSKLVDGFGLVSMKREFQRIVGSGE--VEQAVDLLEALRGFQFTIKELVDP 206

Query: 467  MLVFKQCIKLKDVSLALRYVSLIRNPHVWWNFLIREFGRKGDLSSALIVFRKYQSCNTTP 646
              + K C+  ++ +LA+RY  L+ +  + +  +I EFG+K DL+SAL  +   +   + P
Sbjct: 207  SYIIKVCVDKRNPNLAVRYACLLPHAKILFCSIISEFGKKRDLASALTAYEASKKNLSGP 266

Query: 647  NMCLYRDIIDACGSCGDSTMAETIFKEMLSHNVLPSTFVYNSFMNAVAADLNRTLQSFHD 826
            NM LYR IIDACG CGD   +  I++++++  V P+ +V+NS MN  A DL  TL  + D
Sbjct: 267  NMYLYRAIIDACGLCGDYLKSRNIYEDLVNQRVTPNIYVFNSLMNVNAHDLGYTLDVYKD 326

Query: 827  MQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQVVG 1006
            MQ+ G+  D+A+YNILLKAC +A+  DLA DIY  +K     G +K+DV TY T+I+V  
Sbjct: 327  MQNLGITADMASYNILLKACCLAQRVDLAQDIYNEVKHLESTGVLKLDVFTYCTIIKVFA 386

Query: 1007 DSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQPNSQ 1186
            D++    AL+IK DM  AG+ P+ +TW+ L++ACAN GLVE+A  LF+EM+L+G +PNSQ
Sbjct: 387  DARLWQMALKIKEDMLSAGVTPNTVTWSSLISACANAGLVEQAFQLFEEMILTGCEPNSQ 446

Query: 1187 CCNALLSAFVKDFQYARAFTFFKNWK--EKGFYAGSL----------KRYKKGNLPDNFS 1330
            CCN LL A V+  QY RAF  F  W   ++GF AG++           R     L ++  
Sbjct: 447  CCNILLHACVEASQYDRAFRLFHCWTGGQEGF-AGNIDSVLGTKQLNNRTTSTALTNSHH 505

Query: 1331 APPSCIPKFKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLIDAYGNTA 1510
               +    F PT  TYN LMKAC +  Y A+ +MDEMK  G+ P++V+WS+LID    + 
Sbjct: 506  LSFAKKFSFTPTTATYNILMKACCTDYYRAKALMDEMKSVGLSPNHVSWSILIDICRGSG 565

Query: 1511 DLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKNFTKAIMTFEAMKKKNVQPNAITYN 1690
            +++G +Q    M   G+KPDVV YTT IKVCV +KN   A   FE MK+  VQPN +TYN
Sbjct: 566  NVEGAIQILKTMHVTGIKPDVVAYTTAIKVCVGSKNLKLAFSLFEEMKRYRVQPNLVTYN 625

Query: 1691 TILRGWREHGDHMQVYALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSYSTSQSMK 1870
            T+LR    +G   +V   L++Y++MRK+G+  ND +LK LIEEW EGVI           
Sbjct: 626  TLLRARSRYGSLHEVQQCLAIYQDMRKAGYKSNDIYLKELIEEWCEGVI----------- 674

Query: 1871 LKGNQYQRKDENTRQKSDI-----IFFEKIA-GLACSKSSDFTVDLRGLSETETXXXXXX 2032
             K N ++R+  ++ +++D+     +  EKIA  L  S +    +DLRGL++ E       
Sbjct: 675  -KENNHKREGLSSCKRTDLERPHSLLLEKIAVHLQMSTAESPAIDLRGLTKVEARIVVLA 733

Query: 2033 XXXXXXEKHGPGNPIDSDLIIIIGYS---GNLQKQRLE-SSHITKILNQELGLLVLQNRD 2200
                  E H  G+ +  D++II+G S    N  KQ+ E    + K+L  ELGL VL    
Sbjct: 734  VLRMIKENHILGHSVKDDMLIILGVSERHANAAKQKSEVKDAVMKLLQDELGLEVLL--- 790

Query: 2201 YDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIVPKKSLYQ 2380
             +  V  G          +    +   K   S   L+     +R   +QRL V +KSL  
Sbjct: 791  VEPQVKNGLVDLQTPIDADPVLLETVGKNSLSSKPLSS---TRRPVILQRLKVTRKSLNH 847

Query: 2381 WV 2386
            W+
Sbjct: 848  WL 849


>ref|XP_004493936.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Cicer arietinum]
          Length = 799

 Score =  464 bits (1195), Expect = e-128
 Identities = 280/753 (37%), Positives = 425/753 (56%), Gaps = 25/753 (3%)
 Frame = +2

Query: 209  LDLEDLKGCIGVHIENGDVGLVVDGLRILNEMGFHGPSL---LDERALAMLGLELSKMVK 379
            L+ E L   + + I+  +  +V+D L  L  +G +G SL    DE A++++  E S MV 
Sbjct: 70   LNAELLANAVLMGIKGRNFRIVIDSLNKLQGIG-NGISLSTQFDESAMSVIAKECSFMVT 128

Query: 380  KDSLGVKECVEGLQVLEYCQFHTQELVEPMLVFKQCIKLKDVSLALRYVSLIRNPHVWWN 559
                 ++E VE ++VL   Q    ELV+P  + K+C+  +  +LA+RY SL+   H+ + 
Sbjct: 129  SGH--IQESVELMEVLSRYQLSIGELVQPSDIIKRCVLNRKPNLAVRYASLLPQAHILFC 186

Query: 560  FLIREFGRKGDLSSALIVFRKYQSCNTTPNMCLYRDIIDACGSCGDSTMAETIFKEMLSH 739
             +I  FG+  DL SAL  +   +     PNM +YR IID CG CGD   +  I++++L+ 
Sbjct: 187  SIISGFGKSRDLVSALKAYDAMKKNLKRPNMYIYRAIIDVCGLCGDFMKSRYIYEDLLNQ 246

Query: 740  NVLPSTFVYNSFMNAVAADLNRTLQSFHDMQSSGVKPDIATYNILLKACRVARNADLAID 919
             + P+ +V+NS MNA A D++ TL  + +MQ  G+KPD+ +YNILLKAC VA   DLA D
Sbjct: 247  KITPNIYVFNSLMNANAHDISYTLNLYQNMQKVGLKPDMTSYNILLKACCVAGRVDLAQD 306

Query: 920  IYEGLKEKSIDGGIKMDVITYSTLIQVVGDSKNLAKALEIKRDMEVAGILPDMITWNMLL 1099
            +Y+ LK     G +K+DV TYST+I+V  D+K    AL+IK DM +AG+  + + W+ L+
Sbjct: 307  MYKELKHLESIGQLKLDVFTYSTIIKVFADAKLWQMALKIKHDMLLAGVSLNTVAWSSLI 366

Query: 1100 TACANVGLVEKALVLFDEMVLSGYQPNSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFY 1279
             ACA+ GLVE+A+ LF+EM+LSG +PN+QC N +L A V+  QY RAF FF +WK     
Sbjct: 367  NACAHAGLVEQAIQLFEEMLLSGCEPNTQCFNIILHACVEGCQYDRAFRFFYSWKGNKTL 426

Query: 1280 AGSLKRYKKGNLPDNFSAPPSCIPK---------------FKPTVVTYNTLMKACRSTPY 1414
                + +          +  + +PK               FKPT  TYNTL+KAC +  Y
Sbjct: 427  VSFGESHNSNAEEGGMDSVTTTVPKGISSSHIMSFTERFPFKPTTSTYNTLLKACGTNYY 486

Query: 1415 LARTMMDEMKDNGIVPDNVTWSMLIDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLI 1594
             A+ +++EMK  G+ P+ ++WS+LI+  G + +++G ++    M +AGVKPDVV YTT I
Sbjct: 487  HAKALINEMKTVGLSPNQISWSILINICGGSENVEGAIEILRTMIDAGVKPDVVAYTTAI 546

Query: 1595 KVCVKNKNFTKAIMTFEAMKKKNVQPNAITYNTILRGWREHGDHMQVYALLSLYEEMRKS 1774
            KVCV++KNFTKA+  +E MK    QPN +TYNT+LR   ++G   +V   L++Y++MRK+
Sbjct: 547  KVCVESKNFTKALTLYEEMKSYETQPNLVTYNTLLRARSKYGSLREVQQCLAIYQDMRKA 606

Query: 1775 GFPPNDEFLKGLIEEWAEGVIQ--KSYSTSQSMKLKGNQYQRKDENTRQKSDIIFFEKIA 1948
            G+ PND +L+ LIEEW EGVIQ  + Y    S         +K E  R +S  +  EKIA
Sbjct: 607  GYKPNDYYLEELIEEWCEGVIQDNEEYEVEFSS-------SKKPEIERPES--LLLEKIA 657

Query: 1949 GLACSKSSD-FTVDLRGLSETETXXXXXXXXXXXXEKHGPGNPIDSDLIIIIGYS---GN 2116
                 + +D   +D++GLS+ E             E +  G+ ++ D++IIIG +    +
Sbjct: 658  AHLLKRVADILAIDVQGLSKVEARLVILAVLRMIKENYAFGHSVNDDILIIIGATKADES 717

Query: 2117 LQKQRLE-SSHITKILNQELGLLVLQNRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGF 2293
              K+ LE    + K+L  ELGL  L       P     + SD     N +         F
Sbjct: 718  PAKEILEVQEAVIKLLRNELGLEAL-------PAKTRFAPSDSPKLQNTKENALPTTMVF 770

Query: 2294 SVSNLNERMPLKRLPTVQRLIVPKKSLYQWVEK 2392
                       +R   +QRL V K+SL++W+++
Sbjct: 771  HT---------RRPAVLQRLKVTKQSLHRWLQR 794


>ref|XP_004167767.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At5g02830, chloroplastic-like [Cucumis sativus]
          Length = 855

 Score =  462 bits (1190), Expect = e-127
 Identities = 298/831 (35%), Positives = 441/831 (53%), Gaps = 36/831 (4%)
 Frame = +2

Query: 8    SARTTKLSTSTNNESSIKATTSPPKVPLASEKRLWVAKNLMKQGNLMDCLSLMREVSEKG 187
            S R +  +  ++ E  I   +S  ++P+  +    VA  L + G L D   ++  V   G
Sbjct: 49   STRHSPPALLSSVELDIAGASSGGRIPI--QHYAGVASKLAEGGKLEDFAMVVESVVVAG 106

Query: 188  -RVCVLG-LLDLEDLKGCIGVHIENGDVGLVVDGLRILNEMGFHGPSLLDERALAMLGLE 361
                  G +L +E +   I   +  G V  VV  LR + E+G     L DE A+  L  +
Sbjct: 107  VEPSQFGAMLAVELVAKGISRCLREGKVWSVVQVLRKVEELGISVLELCDEPAVESLRRD 166

Query: 362  LSKMVKKDSLGVKECVEGLQVLEYCQFHTQELVEPMLVFKQCIKLKDVSLALRYVSLIRN 541
              +M K   L  +E VE ++VL    F  +E+++P  V K C+  ++  +A+RY S++ +
Sbjct: 167  CRRMAKSGEL--EELVELMEVLSGFGFSVREMMKPSEVIKLCVDYRNPKMAIRYASILPH 224

Query: 542  PHVWWNFLIREFGRKGDLSSALIVFRKYQSCNTTPNMCLYRDIIDACGSCGDSTMAETIF 721
              + +   I EFG+K DL SA I + + ++     NM +YR IID CG CGD   +  I+
Sbjct: 225  ADILFCTTINEFGKKRDLKSAYIAYTESKANMNGSNMYIYRTIIDVCGLCGDYKKSRNIY 284

Query: 722  KEMLSHNVLPSTFVYNSFMNAVAADLNRTLQSFHDMQSSGVKPDIATYNILLKACRVARN 901
            +++++ NV P+ FV+NS MN  A DLN T Q + +MQ+ GV  D+A+YNILLKAC +A  
Sbjct: 285  QDLVNQNVTPNIFVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGR 344

Query: 902  ADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQVVGDSKNLAKALEIKRDMEVAGILPDMI 1081
             DLA DIY  +K     G +K+DV TYST+++V  D+K    AL +K DM+ AG+ P+M+
Sbjct: 345  VDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGVSPNMV 404

Query: 1082 TWNMLLTACANVGLVEKALVLFDEMVLSGYQPNSQCCNALLSAFVKDFQYARAFTFFKNW 1261
            TW+ L+++CAN GLVE A+ LF+EMV +G +PN+QCCN LL A V+  Q+ RAF  F++W
Sbjct: 405  TWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNTLLHACVEGRQFDRAFRLFRSW 464

Query: 1262 KEKGFYAGSLKRYKKGNLPDNFSAPPSCIPK------------------FKPTVVTYNTL 1387
            KEK  + G  ++    N  D  S    C  K                  FKPT+ TYN L
Sbjct: 465  KEKELWDGIERKSSTDNNLDADSTSQLCTTKMPNAPSHVHQISFVGNLAFKPTITTYNIL 524

Query: 1388 MKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLIDAYGNTADLQGCMQAFNNMCEAGVKP 1567
            MKAC +  Y A+ +M+EMK  G+ P++++WS+L+D  G + D++  +Q    M  AGV P
Sbjct: 525  MKACGTDYYHAKALMEEMKSVGLTPNHISWSILVDICGRSHDVESAVQILTTMRMAGVDP 584

Query: 1568 DVVTYTTLIK------VCVKNKNFTKAIMTFEAMKKKNVQPNAITYNTILRGWREHGDHM 1729
            DVV YTT IK      V V   N+  A   FE MK   +QPN +TY+T+LR    +G   
Sbjct: 585  DVVAYTTAIKVSIPLAVLVLKXNWKLAFSLFEEMKGFEIQPNLVTYSTLLRARSTYGSLH 644

Query: 1730 QVYALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSYSTSQSMKLKGNQYQRKDENT 1909
            +V   L++Y++MRKSGF  ND +LK LI EW EGVIQK            N  Q  +   
Sbjct: 645  EVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQK------------NNQQPVEITP 692

Query: 1910 RQKSDI-----IFFEKIAG-LACSKSSDFTVDLRGLSETETXXXXXXXXXXXXEKHGPGN 2071
              K DI     +  EK+A  L  S +   T+DL+ L++ E             E +  G 
Sbjct: 693  CNKIDIGKPRCLILEKVADHLQKSFAESLTIDLQELTKVEARIVVLAVLRMIKENYALGE 752

Query: 2072 PIDSDLIIIIGYS---GNLQKQRLE-SSHITKILNQELGLLVLQNRDYDQPVVAGTSSSD 2239
             +  D+ II+  +    +L  Q  E    IT++L  ELGL VL       P +A      
Sbjct: 753  SVKDDIFIILEVNKVETDLVPQNFEVRDAITRLLQDELGLEVLPT----GPTIA------ 802

Query: 2240 ILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIVPKKSLYQWVEK 2392
            +    N+  +K  + T    +    +   ++   VQRL V KKSL  W+++
Sbjct: 803  LDKVPNSESSKISHTTKLKGTMGRNKYFTRKPADVQRLKVTKKSLQDWLQR 853


>ref|XP_004305248.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 847

 Score =  462 bits (1188), Expect = e-127
 Identities = 288/787 (36%), Positives = 430/787 (54%), Gaps = 27/787 (3%)
 Frame = +2

Query: 113  VAKNLMKQGNLMDCLSLMREVSEKG--RVCVLGLLDLEDLKGCIGVHIENGDVGLVVDGL 286
            +A  L + G L D   L+  V   G         L L+ +   I   +++G VG +V+ L
Sbjct: 81   LASKLARDGKLHDFSMLLESVVLSGVKPSQFTAALQLDMVSRGISGILKDGKVGGLVEVL 140

Query: 287  RILNEMGFHGPSLLDERALAMLGLELSKMVKKDSLGVKECVEGLQVLEYCQFHTQELVEP 466
              + E+G     L D  A+ +LG    +++K     V+E VE ++VL    F  +ELV+P
Sbjct: 141  VKVAELGVRPVELFDGYAMELLGAHCLRLLKFKQ--VQELVELMEVLYGLHFPIRELVDP 198

Query: 467  MLVFKQCIKLKDVSLALRYVSLIRNPHVWWNFLIREFGRKGDLSSALIVFRKYQSCNTTP 646
              V K C++ +   LA+RY  +  + H+ +  ++ EFG+K  L+SAL  +   +   +  
Sbjct: 199  SEVIKACVEKRRPKLAIRYACIFPHSHMLFCNIMYEFGKKRALASALTAYEASKEKLSGS 258

Query: 647  NMCLYRDIIDACGSCGDSTMAETIFKEMLSHNVLPSTFVYNSFMNAVAADLNRTLQSFHD 826
            NM +YR IID CG C D   +  I++++L   V+P+ +V+NS MN  + DL+ T   +  
Sbjct: 259  NMYIYRTIIDVCGVCKDYMKSRYIYEDLLKQKVIPNIYVFNSLMNVNSHDLSYTFHVYKS 318

Query: 827  MQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQVVG 1006
            MQ+ GV  D+A YNILLKAC +A   DLA DIY+ ++     G +K+DV TYST+++V  
Sbjct: 319  MQNLGVTADLACYNILLKACSLAGRVDLAQDIYKEVQHLESTGVLKLDVFTYSTVVKVFS 378

Query: 1007 DSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQPNSQ 1186
            D+K    AL +K DM+ AG++P+ +TW+  ++ACAN GLV+KA+ LF+EM+L+  +PNSQ
Sbjct: 379  DAKMWHMALNVKEDMQSAGVIPNTVTWSSFISACANAGLVDKAIQLFEEMLLASCEPNSQ 438

Query: 1187 CCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGSLKRYKKGNLPDNFSAPPSCIP----- 1351
            C N LL A V+  QY RAF  F ++K         K YK      + + P   +P     
Sbjct: 439  CFNILLHACVEACQYDRAFRLFHSFKSNKLQETFGKNYKGSAGSSSTTIPLIILPSNFAE 498

Query: 1352 --KFKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLIDAYGNTADLQGC 1525
               FKPT  TYNTLMKAC S  Y A+ +MDEMK  G++P+ +TWS+L D  G++ ++QG 
Sbjct: 499  GLSFKPTTTTYNTLMKACGSDYYHAKALMDEMKTVGLLPNQITWSILADICGSSGNVQGA 558

Query: 1526 MQAFNNMCEAGVKPDVVTYTTLIKVCVKNKNFTKAIMTFEAMKKKNVQPNAITYNTILRG 1705
            +Q   +M  AG++PDVV YTT IK+CV+++N   A++ F  MKK  + PN +TYNT+LR 
Sbjct: 559  LQILKSMRVAGIQPDVVAYTTAIKICVESENLDLALLLFAEMKKYQIHPNLVTYNTLLRA 618

Query: 1706 WREHGDHMQVYALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSYSTSQSMKLKGNQ 1885
               +G   +V   L++Y++MRK+G+ PND +L+ LIEEW EGVIQ S         K  +
Sbjct: 619  RSRYGSVSEVQQCLAIYQDMRKAGYKPNDYYLEQLIEEWCEGVIQDSCP-------KQGE 671

Query: 1886 YQRKDENTRQKSDIIFFEKIAGLACSKSSD-FTVDLRGLSETETXXXXXXXXXXXXEKHG 2062
            +   D+    +   +  EK+A       +D   VDL+GL++ E             E + 
Sbjct: 672  FSYGDKADIGRPGSLLLEKVAEHLQQHIADTLAVDLQGLTKVEARIVVLAVLRMIKENYI 731

Query: 2063 PGNPIDSDLIIIIG----YSGNLQKQRLE-SSHITKILNQELGLLVLQNRDYDQPVVAGT 2227
             G+ +  D++I++G      G      LE    ITK+L  ELGL VL       P VA  
Sbjct: 732  LGDSVKDDMLIMVGVHDEVDGGSTAHNLEVKDAITKLLQDELGLKVLST----VPKVALD 787

Query: 2228 SSSDILSCINNRRTKAGNKTGFSVSNLNERMPLK-----------RLPTV-QRLIVPKKS 2371
            +            T     T  S  NL+E+ PL+           R P V +RL V +KS
Sbjct: 788  T------------TIVSQNTIDSDQNLDEK-PLRKELQPELIYSTRRPVVLERLKVSRKS 834

Query: 2372 LYQWVEK 2392
            L QW+ K
Sbjct: 835  LQQWLRK 841


>ref|XP_003554352.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Glycine max]
          Length = 811

 Score =  450 bits (1157), Expect = e-123
 Identities = 255/687 (37%), Positives = 403/687 (58%), Gaps = 21/687 (3%)
 Frame = +2

Query: 395  VKECVEGLQVLEYCQFHTQELVEPMLVFKQCIKLKDVSLALRYVSLIRNPHVWWNFLIRE 574
            V+E VE ++VL   Q   +ELV+P  + K+C+  ++  LA+RY  L+ + H+ +  +I E
Sbjct: 142  VEEAVELMEVLARFQISIRELVQPSDIIKRCVLSRNPILAVRYACLLPHAHILFCNIISE 201

Query: 575  FGRKGDLSSALIVFRKYQSCNTTPNMCLYRDIIDACGSCGDSTMAETIFKEMLSHNVLPS 754
            FG++ DL SAL  +   +    TPNM +YR  ID CG C D   +  I++++L+  + P+
Sbjct: 202  FGKRRDLVSALKAYEASKKHLNTPNMYIYRATIDTCGLCRDYMKSRYIYEDLLNQKITPN 261

Query: 755  TFVYNSFMNAVAADLNRTLQSFHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGL 934
             +V+NS MN  + DL+ TL  + +MQ+ G+KPD+ +YNILLKAC VA   DLA DIY  L
Sbjct: 262  IYVFNSLMNVNSHDLSYTLNLYQNMQNLGLKPDMTSYNILLKACCVAGRVDLAQDIYREL 321

Query: 935  KEKSIDGGIKMDVITYSTLIQVVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACAN 1114
            K     G +K+DV TYST+I+V  D K    AL+IK+DM  AG+  +++ W+ L+ ACA+
Sbjct: 322  KHLESVGQLKLDVFTYSTIIKVFADVKLWQMALKIKQDMLSAGVSLNIVAWSSLINACAH 381

Query: 1115 VGLVEKALVLFDEMVLSGYQPNSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGSLK 1294
             GLVE+A+ LF+EM+L+G +PN+QC N +L+A V+ +QY RAF FF +WK K     S +
Sbjct: 382  AGLVEQAIQLFEEMLLAGCEPNTQCFNIILNACVEAYQYDRAFRFFHSWKGKKMLGSSGE 441

Query: 1295 RYK----KGNLPDNFSAPPSCIPK----------FKPTVVTYNTLMKACRSTPYLARTMM 1432
             Y     +G++ D  S P                F PT  TYN L+KAC +  Y A+ ++
Sbjct: 442  GYNSNIGQGHMHDVTSIPNGISNSHILNFAERFPFTPTTTTYNILLKACGTDYYHAKALI 501

Query: 1433 DEMKDNGIVPDNVTWSMLIDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKN 1612
             EM+  G+ P+ ++WS+LID  G +++++G ++    M +AG+KPDV+ YTT IKVCV++
Sbjct: 502  KEMETVGLSPNQISWSILIDICGASSNVEGAIEILKTMGDAGIKPDVIAYTTAIKVCVES 561

Query: 1613 KNFTKAIMTFEAMKKKNVQPNAITYNTILRGWREHGDHMQVYALLSLYEEMRKSGFPPND 1792
            KNF +A+  +E MK   ++PN +TYNT+L+   ++G   +V   L++Y++MRK+G+ PND
Sbjct: 562  KNFMQALTLYEEMKCYQIRPNWVTYNTLLKARSKYGFLHEVQQCLAIYQDMRKAGYKPND 621

Query: 1793 EFLKGLIEEWAEGVIQKSYSTSQSMKLKGNQYQRKDENTRQKSDIIFFEKIAGLACSKSS 1972
             +L+ LIEEW EGVIQ +       + K  ++   +++  ++   +  EKIA     + +
Sbjct: 622  YYLEELIEEWCEGVIQNN-------REKQGEFSSSNKSESERPQSLLLEKIAAHLLKRVA 674

Query: 1973 D-FTVDLRGLSETETXXXXXXXXXXXXEKHGPGNPIDSDLIIIIG---YSGNLQKQRLE- 2137
            D   +D++GL++ E             E +G G+ ++ D++IIIG      N  K  LE 
Sbjct: 675  DILAIDVQGLTKVEARLVVLAVLRMIKENYGLGHSVNDDILIIIGATKVDENPSKHILEV 734

Query: 2138 SSHITKILNQELGLLVLQNRDYDQPVVAGTSSSDI--LSCINNRRTKAGNKTGFSVSNLN 2311
               I K+L  ELGL V   +   +  ++ T++ +    S ++       N  GF      
Sbjct: 735  QEAIIKLLRNELGLEVFPAK--TRLALSDTANLEYPNFSNLSIEAQPGENALGFQT---- 788

Query: 2312 ERMPLKRLPTVQRLIVPKKSLYQWVEK 2392
                 +R   + RL V KKSLY+W+ +
Sbjct: 789  -----RRPGVLVRLKVTKKSLYRWLHR 810


>gb|EXB84820.1| hypothetical protein L484_003853 [Morus notabilis]
          Length = 822

 Score =  449 bits (1155), Expect = e-123
 Identities = 273/743 (36%), Positives = 417/743 (56%), Gaps = 24/743 (3%)
 Frame = +2

Query: 236  IGVHIENGDVGLVVDGLRILNEMGFHGPSLLDERALAMLGLELSKMVKKDSLGVKECVEG 415
            I   + +G V      L  L+E+GF    + D  AL ++  E  ++++ +   V+E VE 
Sbjct: 100  ISAVLRDGKVRSFARLLGKLDELGFPPVEIFDGWALELIRRECRRILRCEQ--VEELVEL 157

Query: 416  LQVLEYCQFHTQELVEPMLVFKQCIKLKDVSLALRYVSLIRNPHVWWNFLIREFGRKGDL 595
             +VL    F  +ELV+P  V K C++ ++  +A+RY   + + H+ +   + EFG+KGDL
Sbjct: 158  FEVLSGYGFSIKELVKPSDVIKICVEKRNPKMAIRYACTLPHAHIIFCDAVYEFGKKGDL 217

Query: 596  SSALIVFRKYQSCNTTPNMCLYRDIIDACGSCGDSTMAETIFKEMLSHNVLPSTFVYNSF 775
             SALI     +  +T+ NM LYR IID CG C D   +  I++++L+  V P+ +V+NS 
Sbjct: 218  VSALIAHEASKKNSTSTNMYLYRTIIDVCGRCHDYQKSRYIYEDLLNEKVTPNVYVFNSL 277

Query: 776  MNAVAADLNRTLQSFHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDG 955
            MN  A D + TL  + DMQ+ GV+ D+A+YNILLKAC +A   DLA DIY+ ++     G
Sbjct: 278  MNVNAHDFSYTLNVYKDMQNLGVQADMASYNILLKACCLAGRVDLAQDIYKEVQHLESTG 337

Query: 956  GIKMDVITYSTLIQVVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKA 1135
             +K+DV TYST+++V+ D+K    AL++K DM  AG+ P+ +TW+ L++ACAN G+V+KA
Sbjct: 338  LLKLDVFTYSTIVKVLADAKLWQMALKVKEDMLSAGVNPNTVTWSSLISACANAGIVDKA 397

Query: 1136 LVLFDEMVLSGYQPNSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGSLKRYKKGNL 1315
            + LF+EM+L+G +PN+QCCN LL A V+  QY RAF  F+  K       S +   +G+ 
Sbjct: 398  VQLFEEMLLAGCKPNTQCCNILLHACVEACQYDRAFRLFEFLKRNRVQETS-EEDGRGDR 456

Query: 1316 PDNFSAPPSCIPK--------------FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNG 1453
              N SA  + I +              F PT  TYN LMKAC S  Y A+ +++EM+  G
Sbjct: 457  DSNQSAGVTSISQSSTLCGLNFARELPFTPTTTTYNILMKACGSDYYHAKALIEEMEAVG 516

Query: 1454 IVPDNVTWSMLIDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKNFTKAI 1633
            + P+ +TWS+LID  G+  +++G +Q    M   G++PDVV YTT+IKVCV++K+  +A 
Sbjct: 517  LSPNQITWSILIDICGDLGNVEGALQILKTMRATGIEPDVVAYTTVIKVCVESKDLKQAF 576

Query: 1634 MTFEAMKKKNVQPNAITYNTILRGWREHGDHMQVYALLSLYEEMRKSGFPPNDEFLKGLI 1813
              F  MK+  +QPN +TYNT+LR    +G   +V   L++Y++MR++G+  ND +LK LI
Sbjct: 577  ELFAEMKRYQIQPNLVTYNTLLRARNRYGSLQEVKQCLAVYQDMRRAGYNSNDYYLKQLI 636

Query: 1814 EEWAEGVIQKSYSTSQSMKLKGNQYQRKDENTRQKSD-----IIFFEKIA-GLACSKSSD 1975
            EEW EGVIQ            GN   R++ ++  K+D      +  EK+A  L    +  
Sbjct: 637  EEWCEGVIQ------------GNNQNREESSSFNKTDKKRPQSLLLEKVAEHLEKHIAET 684

Query: 1976 FTVDLRGLSETETXXXXXXXXXXXXEKHGPGNPIDSDLIIIIG---YSGNLQKQRLE-SS 2143
             TVD++GL + E             E +  G  +  D++IIIG         +Q LE   
Sbjct: 685  LTVDVQGLKKVEARIVVLAVLRMVKENYTMGYLVKDDMLIIIGACKVDAVPDEQELEVKD 744

Query: 2144 HITKILNQELGLLVLQNRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMP 2323
             ITK+L  ELGL VL      +P               NR+  + +  G S  +   +  
Sbjct: 745  AITKLLKDELGLEVLSTGLKIEP---------------NRQVDS-DSLGSSDFSGEMKYS 788

Query: 2324 LKRLPTVQRLIVPKKSLYQWVEK 2392
             +R   +QRL V K+SL  W+++
Sbjct: 789  TRRPVVIQRLKVTKESLQHWLQR 811


>ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic [Vitis vinifera]
            gi|297741486|emb|CBI32618.3| unnamed protein product
            [Vitis vinifera]
          Length = 842

 Score =  447 bits (1149), Expect = e-122
 Identities = 293/829 (35%), Positives = 444/829 (53%), Gaps = 40/829 (4%)
 Frame = +2

Query: 26   LSTSTNNESSIKATTSPPKVPLASEKRLWVAKNL------MKQGNLMDCLSLMREVSEKG 187
            L TST+   S   ++   + PL S+ R W   N       + Q    D  S M E     
Sbjct: 36   LLTSTSARLSPPISSLRSRHPLLSDVR-WDLNNYSDLATKLVQDGRFDDFSTMAETLILS 94

Query: 188  RVCVLGLLDLEDLKGCIGVHIENGDVGLVVDGLRILNEMGFHGPSLLDERALAMLGLELS 367
             V +  L++L    G  G+ +  G V  VV+ LR ++++G     L D   L +L  E  
Sbjct: 95   GVELSQLVELVSA-GISGL-LREGRVYCVVEVLRKVDKLGICPLELFDGSTLELLSKECR 152

Query: 368  KMVKKDSLGVKECVEGLQVLEYCQFHTQELVEPMLVFKQCIKLKDVSLALRYVSLIRNPH 547
            +++      V+E VE +++L+   F  ++L+EP+   K C+  ++ +LA+RY  ++ +  
Sbjct: 153  RILNCGQ--VEEVVELIEILDGFHFPVKKLLEPLDFIKICVNKRNPNLAVRYACILPHAQ 210

Query: 548  VWWNFLIREFGRKGDLSSALIVFRKYQSCNTTPNMCLYRDIIDACGSCGDSTMAETIFKE 727
            + +  +I EFG+K DL SAL  F   +     PNM  YR +ID CG C     +  I++E
Sbjct: 211  ILFCTIIHEFGKKRDLGSALTAFEASKQKLIGPNMYCYRTMIDVCGLCSHYQKSRYIYEE 270

Query: 728  MLSHNVLPSTFVYNSFMNAVAADLNRTLQSFHDMQSSGVKPDIATYNILLKACRVARNAD 907
            +L+  + P+ +V+NS MN    DL+ T   + +MQ+ GV  D+A+YNILLKAC VA   D
Sbjct: 271  LLAQKITPNIYVFNSLMNVNVHDLSYTFNVYKNMQNLGVTADMASYNILLKACCVAGRVD 330

Query: 908  LAIDIYEGLKEKSIDGGIKMDVITYSTLIQVVGDSKNLAKALEIKRDMEVAGILPDMITW 1087
            LA +IY  ++    +G +K+DV TYST+I+V  D+K    AL+IK DM  AG++P+ +TW
Sbjct: 331  LAQEIYREVQNLESNGMLKLDVFTYSTIIKVFADAKLWQMALKIKEDMLSAGVIPNTVTW 390

Query: 1088 NMLLTACANVGLVEKALVLFDEMVLSGYQPNSQCCNALLSAFVKDFQYARAFTFFKNWKE 1267
            + L+++CAN G+ E+A+ LF EM+L+G +PNSQC N LL A V+  QY RAF  F++WK+
Sbjct: 391  SALISSCANAGITEQAIQLFKEMLLAGCEPNSQCYNILLHACVEACQYDRAFRLFQSWKD 450

Query: 1268 KGFYAGSLKRYKKGNL-------PDNFSAPPSCIPK-----------FKPTVVTYNTLMK 1393
              F   S      GN         +  ++ P+C+             F PT  TYN LMK
Sbjct: 451  SRFQEIS-GGTGNGNTVGVELKHQNCITSMPNCLSNSHHLSFSKSFPFTPTTTTYNILMK 509

Query: 1394 ACRSTPYLARTMMDEMKDNGIVPDNVTWSMLIDAYGNTADLQGCMQAFNNMCEAGVKPDV 1573
            AC +  Y A+ +MDEMK  G+ P++++WS+LID  G T ++ G ++    M EAG+KPDV
Sbjct: 510  ACGTDYYRAKALMDEMKTAGLSPNHISWSILIDICGGTGNIVGAVRILKTMREAGIKPDV 569

Query: 1574 VTYTTLIKVCVKNKNFTKAIMTFEAMKKKNVQPNAITYNTILRGWREHGDHMQVYALLSL 1753
            V YTT IK CV++KN   A   F  MK+  +QPN +TYNT+LR    +G   +V   L++
Sbjct: 570  VAYTTAIKYCVESKNLKIAFSLFAEMKRYQIQPNLVTYNTLLRARSRYGSLHEVQQCLAI 629

Query: 1754 YEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSYSTSQSMKLKGNQYQRKDENTRQKSDIIF 1933
            Y+ MRK+G+  ND +LK LIEEW EGVIQ + + +QS   K +   R D    Q    + 
Sbjct: 630  YQHMRKAGYKSNDYYLKELIEEWCEGVIQDN-NLNQS---KFSSVNRADWGRPQS---LL 682

Query: 1934 FEKIAG-LACSKSSDFTVDLRGLSETETXXXXXXXXXXXXEKHGPGNPIDSDLIIIIGY- 2107
             EK+A  L  S +    +DL+GL++ E             E +  G+PI  D++II+G  
Sbjct: 683  LEKVAAHLQKSVAESLAIDLQGLTQVEARIVVLAVLRMIKENYILGHPIKDDILIILGIK 742

Query: 2108 ---SGNLQKQRLESSHITKILNQELGLLVLQNRDYDQPVVAGTSSSDILSCINNRRTKAG 2278
               +  ++ +      I K+L  ELGL V     +  P +A            ++R   G
Sbjct: 743  KVDANLVEHESPVKGAIIKLLQDELGLEVA----FAGPKIA-----------LDKRINLG 787

Query: 2279 NKTGFSVSNLNERMPLKRLPT-----------VQRLIVPKKSLYQWVEK 2392
               G S  +  E +   RLPT           +QR  V +KSL  W+++
Sbjct: 788  GPPG-SDPDWQEALGRNRLPTELESSTRRPAVLQRFKVTRKSLDHWLQR 835


>ref|XP_006287051.1| hypothetical protein CARUB_v10000200mg [Capsella rubella]
            gi|482555757|gb|EOA19949.1| hypothetical protein
            CARUB_v10000200mg [Capsella rubella]
          Length = 858

 Score =  446 bits (1147), Expect = e-122
 Identities = 270/791 (34%), Positives = 427/791 (53%), Gaps = 32/791 (4%)
 Frame = +2

Query: 116  AKNLMKQGNLMDCLSLMREVSEKGRVCVL---GLLDLEDLKGCIGVHIENGDVGLVVDGL 286
            A  L + G + D   +   ++ +    V     ++D + L   I  ++  G +  VV  L
Sbjct: 88   ASKLAEDGRIEDVALIAETLAAESGANVARFASMVDFDLLSKGISSNLRQGKIESVVYTL 147

Query: 287  RILNEMGFHGPSLLDERALAMLGLELSKMVKKDSLGVKECVEGLQVLEYCQFHTQELVEP 466
            + + ++G     L+DE ++ ++  +   M   +S+ V++ ++ +++L   +F  +ELV+P
Sbjct: 148  KRIEKVGIAPLDLVDESSVKLMRKQFRAMA--NSVQVEKAIDLMEILAGLRFKIKELVDP 205

Query: 467  MLVFKQCIKLKDVSLALRYVSLIRNPHVWWNFLIREFGRKGDLSSALIVFRKYQSCNTTP 646
              + K C+ + +  LA+RY  L+ +  +    +I  FG+KGD+ S +  +   +    TP
Sbjct: 206  FDIVKSCVDISNPELAIRYACLLPHTEILLCRIILGFGKKGDMVSVMTAYEACKQILDTP 265

Query: 647  NMCLYRDIIDACGSCGDSTMAETIFKEMLSHNVLPSTFVYNSFMNAVAADLNRTLQSFHD 826
            NM + R +ID CG CGD   +  I++++L  NV P+ +V NS MN  + DL  TL+ + +
Sbjct: 266  NMYICRTMIDVCGLCGDYVKSRYIYEDLLKENVKPNIYVMNSLMNVNSHDLGYTLKVYKN 325

Query: 827  MQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQVVG 1006
            MQ   V  D+ +YNILLK C +A   DLA DIY+  K     G +K+D  TY T+I+V  
Sbjct: 326  MQKLDVTADMTSYNILLKTCCLAGRVDLAQDIYKEAKRMESSGLLKLDAFTYCTIIKVFA 385

Query: 1007 DSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQPNSQ 1186
            D+K    AL++K DM+  G+ P+  TW+ L++ACAN GLVE+A  LF+EM+ SG +PNSQ
Sbjct: 386  DAKMWKWALKVKDDMKSVGVTPNTHTWSSLISACANAGLVEQANHLFEEMLASGCEPNSQ 445

Query: 1187 CCNALLSAFVKDFQYARAFTFFKNWK--------------EKG--FYAGSLKRYKKGNLP 1318
            C N LL A V+  QY RAF  F++WK               KG  F    LK    G+L 
Sbjct: 446  CFNILLHACVEACQYDRAFRLFQSWKGSSVKEALYADKIVSKGRTFSPNKLKTNDPGSLV 505

Query: 1319 DNFSAPPSCIPK----FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSML 1486
            +N S  P         FKPT  TYN L+KAC +  Y  + +MDEMK  G+ P+ +TWS L
Sbjct: 506  NNNSTSPYIQASNRFFFKPTTATYNILLKACGTDYYRGKELMDEMKSLGLTPNQITWSTL 565

Query: 1487 IDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKNFTKAIMTFEAMKKKNV 1666
            ID  G + D++G ++    M  AG +PDVV YTT IK+C +NK+   A   FE M++  +
Sbjct: 566  IDMCGGSGDVEGAVRILRTMHSAGTRPDVVAYTTAIKICAENKSLKLAFSLFEEMRRYQI 625

Query: 1667 QPNAITYNTILRGWREHGDHMQVYALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKS 1846
            +PN +TYNT+L+   ++G  ++V   L++Y++MRK+G+ PND FLK LIEEW EGVIQ++
Sbjct: 626  KPNWVTYNTLLKARSKYGSLLEVRQCLAIYQDMRKAGYKPNDHFLKELIEEWCEGVIQEN 685

Query: 1847 YSTSQSMKLKGNQYQRKDENTRQKSDIIFFEKIA-GLACSKSSDFTVDLRGLSETETXXX 2023
              +   +       Q  D   R  S  +  EK+A  L    + +  +DL+GL++ E    
Sbjct: 686  GQSQNKI-----SDQEGDHAGRPVS--LLIEKVATHLQERTAGNLAIDLQGLTKVEARLV 738

Query: 2024 XXXXXXXXXEKHGPGNPIDSDLIIIIGYS-GNLQKQRLE---SSHITKILNQELGLLVL- 2188
                     E +  G+ +  D++II+G S  N    + +      + K+L +EL L+VL 
Sbjct: 739  VLAVLRMIKEDYMRGDVVIDDVLIILGTSEANTDSGKQDIAVKEALVKLLQEELSLVVLP 798

Query: 2189 ---QNRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIV 2359
               +N   D   V   +        +   T    K+  S+S+       +R   ++RL+V
Sbjct: 799  AGQRNIKQDAHCVDDANQ-------DTEHTLENTKSFISISS------TRRPAILERLMV 845

Query: 2360 PKKSLYQWVEK 2392
             K SLYQW+++
Sbjct: 846  TKASLYQWLQR 856


>ref|XP_006428907.1| hypothetical protein CICLE_v10011055mg [Citrus clementina]
            gi|568853887|ref|XP_006480569.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Citrus sinensis]
            gi|557530964|gb|ESR42147.1| hypothetical protein
            CICLE_v10011055mg [Citrus clementina]
          Length = 850

 Score =  445 bits (1145), Expect = e-122
 Identities = 287/826 (34%), Positives = 442/826 (53%), Gaps = 32/826 (3%)
 Frame = +2

Query: 11   ARTTKLSTSTNNESSIKATTSPPKVPLASEKRLW--VAKNLMKQGNLMDCLSLMREV--S 178
            A ++ LS+     SS  A  S  +  L+S    +  +A  L K G L +   ++  V  S
Sbjct: 41   ASSSSLSSIPTVHSSQTALLSTVRRDLSSRNDYYADMASKLAKDGRLEEFAMIVESVVVS 100

Query: 179  EKGRVCVLGLLDLEDLKGCIGVHIENGDVGLVVDGLRILNEMGFHGPSLLDERALAMLGL 358
            E        +L LE +   I   I  G +  VV  L+ LNE+G     L       +L  
Sbjct: 101  EGNVSKFASMLSLEMVASGIVKSIGEGRIDCVVGVLKKLNELGVAPLELFHGSGFKLLKN 160

Query: 359  ELSKMVKKDSLGVKECVEGLQVLEYCQFHTQELVEPMLVFKQCIKLKDVSLALRYVSLIR 538
            E  +++  DS  V+  V  ++VLE  +   +EL E   + + C+   DV+LA+RY  ++ 
Sbjct: 161  ECQRLL--DSGEVEMFVGLMEVLEEFRLPVKELDEEFRIVQLCVNKPDVNLAIRYACIVP 218

Query: 539  NPHVWWNFLIREFGRKGDLSSALIVFRKYQSCNTTPNMCLYRDIIDACGSCGDSTMAETI 718
               + +   +REFG+K DL SAL  +   +   ++PNM + R IID CG CGD   +  I
Sbjct: 219  RADILFCNFVREFGKKRDLVSALRAYEASKKHLSSPNMYICRTIIDVCGLCGDYMKSRAI 278

Query: 719  FKEMLSHNVLPSTFVYNSFMNAVAADLNRTLQSFHDMQSSGVKPDIATYNILLKACRVAR 898
            ++++ S NV  + +V+NS MN  A DL  TL+ + +MQ  GV  D+A+YNILLKAC +A 
Sbjct: 279  YEDLRSQNVTLNIYVFNSLMNVNAHDLKFTLEVYKNMQKLGVMADMASYNILLKACCLAG 338

Query: 899  NADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQVVGDSKNLAKALEIKRDMEVAGILPDM 1078
            N  LA +IY  +K     G +K+DV TYST+++V  D+K    AL++K DM  AG+ P+ 
Sbjct: 339  NTVLAQEIYGEVKHLEAKGVLKLDVFTYSTIVKVFADAKWWQMALKVKEDMLSAGVTPNT 398

Query: 1079 ITWNMLLTACANVGLVEKALVLFDEMVLSGYQPNSQCCNALLSAFVKDFQYARAFTFFKN 1258
            ITW+ L+ ACAN GLVE+A+ LF+EM  +G +PNSQCCN LL A V+  Q+ RAF  F++
Sbjct: 399  ITWSSLINACANAGLVEQAMHLFEEMRQAGCEPNSQCCNILLQACVEACQFDRAFRLFRS 458

Query: 1259 WKEKGF-------YAGSLKRYKKGNLPD--NFSAPPSCIP-----------KFKPTVVTY 1378
            W            Y G+  R       D  + +  P+ +P            FKPT  TY
Sbjct: 459  WTLSKTQVALGEDYDGNTDRISNMEHKDKQSITNTPNFVPNSHYSSFDKRFSFKPTTTTY 518

Query: 1379 NTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLIDAYGNTADLQGCMQAFNNMCEAG 1558
            N LMKAC +  Y  + +MDEM+  G+ P++++W++LIDA G + +++G +Q    M E G
Sbjct: 519  NILMKACCTDYYRVKALMDEMRTVGLSPNHISWTILIDACGGSGNVEGALQILKIMREDG 578

Query: 1559 VKPDVVTYTTLIKVCVKNKNFTKAIMTFEAMKKKNVQPNAITYNTILRGWREHGDHMQVY 1738
            + PDVV YTT IKVCV++K    A   FE MK   +QPN +TY T+LR    +G   +V 
Sbjct: 579  MSPDVVAYTTAIKVCVRSKRLKLAFSLFEEMKHYQIQPNLVTYITLLRARSRYGSLHEVQ 638

Query: 1739 ALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSYSTSQSMKLKGNQYQRKDENTRQK 1918
              L++Y++M K+G+  ND +LK +IEEW EGVIQ        + L      R+  + R +
Sbjct: 639  QCLAVYQDMWKAGYKANDTYLKEVIEEWCEGVIQDKNQNQGEVTL-----CRRTNSQRPQ 693

Query: 1919 SDIIFFEKIA-GLACSKSSDFTVDLRGLSETETXXXXXXXXXXXXEKHGPGNPIDSDLII 2095
            S  +  EK+A  L  S + +  +DL+GL++ E             E +  G P+  DL+I
Sbjct: 694  S--LLLEKVAVHLQKSAAENLAIDLQGLTKVEARIVVLAVLQMMKENYSLGVPVKDDLMI 751

Query: 2096 IIGYSGNLQKQRLESSH-------ITKILNQELGLLVLQNRDYDQPVVAGTSSSDILSCI 2254
            ++G +   +  ++++ H       ITK+L  +LGL V      D P +    ++ +   +
Sbjct: 752  VLGPN---KVNKIQAKHDLEVKDAITKLLQDDLGLKVF----LDGPSIQ-HKNAHMQKLL 803

Query: 2255 NNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIVPKKSLYQWVEK 2392
            ++    A         ++  +   +R   +QRL VPKKSL+ W+++
Sbjct: 804  DSESNMA------KTLHIELKSSTRRPKILQRLKVPKKSLHHWLQR 843


>gb|EMJ09280.1| hypothetical protein PRUPE_ppa001520mg [Prunus persica]
          Length = 809

 Score =  441 bits (1134), Expect = e-121
 Identities = 257/657 (39%), Positives = 374/657 (56%), Gaps = 3/657 (0%)
 Frame = +2

Query: 227  KGCIGVHIENGDVGLVVDGLRILNEMGFHGPSLLDERALAMLGLELSKMVKKDSLGVKEC 406
            KG  G+ ++ G V  VV+ L  +NE+G     L D  A+ +LG + S+++K     V+E 
Sbjct: 121  KGISGL-LKEGKVRSVVEVLGKVNELGVPPLKLFDGYAMELLGRQCSRLLKCKQ--VQEL 177

Query: 407  VEGLQVLEYCQFHTQELVEPMLVFKQCIKLKDVSLALRYVSLIRNPHVWWNFLIREFGRK 586
            VE ++ L   +F  +EL+EP  V K C+      LA+RY  +  + H+ +  +I EFG++
Sbjct: 178  VELMEALAGYRFPIKELLEPSEVIKLCVDKCCPKLAIRYACIFPHAHILFCNIIYEFGKR 237

Query: 587  GDLSSALIVFRKYQSCNTTPNMCLYRDIIDACGSCGDSTMAETIFKEMLSHNVLPSTFVY 766
              L  AL  +   +      NM +YR IID CG C D   +  I++++L   V P+ +V+
Sbjct: 238  KALEPALAAYEASKENLNGSNMYVYRTIIDVCGLCKDYMKSRYIYEDLLKQKVTPNIYVF 297

Query: 767  NSFMNAVAADLNRTLQSFHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKS 946
            NS MN  A DLN T   +  MQ+ GV+ D+A YNILLKAC +A   DLA DIY  ++   
Sbjct: 298  NSLMNVNAHDLNYTFHVYKSMQNLGVRADMACYNILLKACCLAGRVDLAQDIYSEVQHLE 357

Query: 947  IDGGIKMDVITYSTLIQVVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLV 1126
              G +K+DV TYST+++V  D+K    AL +K DM  AG+ P+ +TW+ L++ACAN G+V
Sbjct: 358  STGVLKLDVFTYSTIVKVFADAKLWHMALNVKEDMLSAGVTPNTVTWSSLISACANAGIV 417

Query: 1127 EKALVLFDEMVLSGYQPNSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGSLKRYKK 1306
            EKA+ LF+EM+L+G +PNSQC N LL A V+  QY RAF  F+          SLKR   
Sbjct: 418  EKAIQLFEEMLLAGSEPNSQCFNILLHACVEANQYDRAFRLFQ----------SLKRL-- 465

Query: 1307 GNLPDNFSAPPSCIPKFKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSML 1486
                            FKPT  TYNTLMKAC +  Y A+ ++DEM+  G+ P+ ++WS+L
Sbjct: 466  ---------------SFKPTTTTYNTLMKACGTDYYHAKALLDEMRAVGLYPNQISWSIL 510

Query: 1487 IDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKNFTKAIMTFEAMKKKNV 1666
             D  G + +++G +Q   NM  AG+KPDVV YTT IKVCV+N+N   A+  F  MKK  +
Sbjct: 511  ADICGGSGNVEGALQILKNMRAAGMKPDVVAYTTAIKVCVENENLELALSLFGEMKKYQI 570

Query: 1667 QPNAITYNTILRGWREHGDHMQVYALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKS 1846
             PN +TYNT+LR    +G   +V   L++Y++MRK+G+  ND +L+ LIEEW EGVIQ S
Sbjct: 571  HPNLVTYNTLLRARSRYGSVSEVQQCLAIYQDMRKAGYKSNDYYLEQLIEEWCEGVIQDS 630

Query: 1847 YSTSQSMKLKGNQYQRKDENTRQKSDIIFFEKIA-GLACSKSSDFTVDLRGLSETETXXX 2023
                     K  ++   ++    +   +  EK+A  L    +    VDL+GL++ E    
Sbjct: 631  -------NAKQEEFSSCNKTDIGRPGSLLLEKVAEHLQTHIAETLAVDLQGLTKVEARIV 683

Query: 2024 XXXXXXXXXEKHGPGNPIDSDLIIIIG-YSGNLQKQRLE-SSHITKILNQELGLLVL 2188
                     E +  G+ +  D++I++G   G    Q LE    ITK+L  ELGL VL
Sbjct: 684  VLAVLRMIKENYTLGHSVKDDMLIVVGEVDGGSTTQNLEVKDAITKLLQDELGLKVL 740


>gb|ESW34707.1| hypothetical protein PHAVU_001G174000g [Phaseolus vulgaris]
          Length = 809

 Score =  439 bits (1128), Expect = e-120
 Identities = 261/751 (34%), Positives = 419/751 (55%), Gaps = 23/751 (3%)
 Frame = +2

Query: 209  LDLEDLKGCIGVHIENGDVGLVVDGLRILNEMGFHGPSLLDERALAMLGLELSKMVKKDS 388
            +D E L   + + I+   V  VV  L  + +      S L+  ++  +  E  ++V    
Sbjct: 76   VDAEVLAKMVLLGIQGNSVRSVVHTLNRVQDHSVSLASHLNGSSIDAIAKECCRLVMCGH 135

Query: 389  LGVKECVEGLQVLEYCQFHTQELVEPMLVFKQCIKLKDVSLALRYVSLIRNPHVWWNFLI 568
              ++E VE ++VL   +   +  V+P  V K+C+  ++  LA+RY  L+ +  + +  +I
Sbjct: 136  --IEEAVELMEVLTRFKISIRGFVQPSDVIKRCVLSRNPILAVRYACLLPHAQILFCSII 193

Query: 569  REFGRKGDLSSALIVFRKYQSCNTTPNMCLYRDIIDACGSCGDSTMAETIFKEMLSHNVL 748
             EFG++ DL SA   +   +     PNM +YR IIDACG C D   +  I++++L+  + 
Sbjct: 194  SEFGKRRDLISAFKAYELSKKHMNIPNMYMYRAIIDACGLCRDYMKSRYIYEDLLNQKIT 253

Query: 749  PSTFVYNSFMNAVAADLNRTLQSFHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYE 928
            P+ +V+NS MN  A DL+ TL  + +MQ+ G+KPD+ +YNILLK C VA   DLA DIY 
Sbjct: 254  PNIYVFNSLMNVNAHDLSYTLNLYQNMQNLGLKPDMTSYNILLKGCCVAGRVDLAQDIYR 313

Query: 929  GLKEKSIDGGIKMDVITYSTLIQVVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTAC 1108
             LK     G +K+DV TYST+I+V  D++    AL IK+DM  AG+  +++ W+ L+ AC
Sbjct: 314  ELKHLESVGQLKLDVFTYSTIIKVFADARLWQMALTIKQDMLSAGVSLNIVAWSSLINAC 373

Query: 1109 ANVGLVEKALVLFDEMVLSGYQPNSQCCNALLSAFVKDFQYARAFTFFKNWKEK---GFY 1279
            A+ GLVE+A+ LF+EM+L+G +PN+QC N +L+A V+  QY RAF FF +WK K   G +
Sbjct: 374  AHAGLVEQAIQLFEEMLLAGREPNTQCFNIILNACVEACQYDRAFRFFHSWKGKKMLGSF 433

Query: 1280 AGSLKRYKKGNLPDNFSAPPSCIPK-----------FKPTVVTYNTLMKACRSTPYLART 1426
                    +  L  N +  P+ I             F PT  TYN L+KAC +  Y A+ 
Sbjct: 434  GEGCNNNTRQELVHNVTTVPNGISNSHILSFAERFPFTPTTTTYNILLKACGTDYYHAKA 493

Query: 1427 MMDEMKDNGIVPDNVTWSMLIDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCV 1606
            ++ EM+  G+ P+ ++WS LID  G +A+++G ++   NM +AG+KPDV+ YTT IKVCV
Sbjct: 494  LIKEMETVGLSPNQISWSTLIDICGASANVEGAIEILKNMGDAGIKPDVIAYTTAIKVCV 553

Query: 1607 KNKNFTKAIMTFEAMKKKNVQPNAITYNTILRGWREHGDHMQVYALLSLYEEMRKSGFPP 1786
            ++KNF +A+  ++ MK  +++PN ITYNT+L+   ++G   +V   L++Y++MRK+G+ P
Sbjct: 554  ESKNFMQALALYKEMKSYHIRPNLITYNTLLKARSKYGSLHEVQQCLAIYQDMRKAGYKP 613

Query: 1787 NDEFLKGLIEEWAEGVIQKSYSTSQSMKLKGNQYQRKDENTRQKSDIIFFEKIAGLACSK 1966
            ND +L+ LIEEW EGVIQ       + +++G ++   +++  +KS  +  EKIA     +
Sbjct: 614  NDCYLEELIEEWCEGVIQ------DNREIQG-EFSSSNKSELEKSQSLLLEKIAAHLLKR 666

Query: 1967 SSD-FTVDLRGLSETETXXXXXXXXXXXXEKHGPGNPIDSDLIIIIG---YSGNLQKQRL 2134
             +D   +D++GL++ E             E +  G+ I+ D++I+IG      N  K+ L
Sbjct: 667  VADILAIDVQGLTKVEARLVVLAVLRMIKENYSLGHSINDDILIVIGATKVDENPAKRIL 726

Query: 2135 E-SSHITKILNQELGLLVLQNRD----YDQPVVAGTSSSDILSCINNRRTKAGNKTGFSV 2299
            E    I K+L  ELGL     R      D P +   + +++               GF  
Sbjct: 727  EVQEAILKLLRNELGLEAFPARTRLALSDTPKLKNPTLANLKIEAVPAEDALPTSMGFQT 786

Query: 2300 SNLNERMPLKRLPTVQRLIVPKKSLYQWVEK 2392
                     +R   + RL + +KSLY W+ +
Sbjct: 787  ---------RRPGILVRLKITRKSLYSWLHR 808


>ref|XP_006398739.1| hypothetical protein EUTSA_v10012661mg [Eutrema salsugineum]
            gi|557099829|gb|ESQ40192.1| hypothetical protein
            EUTSA_v10012661mg [Eutrema salsugineum]
          Length = 858

 Score =  438 bits (1127), Expect = e-120
 Identities = 266/792 (33%), Positives = 427/792 (53%), Gaps = 33/792 (4%)
 Frame = +2

Query: 116  AKNLMKQGNLMDCLSLMREVSEKGRVCVL---GLLDLEDLKGCIGVHIENGDVGLVVDGL 286
            A  L + G + D   +   ++ +    V     ++D + L   I +++  G +  VV  L
Sbjct: 86   ASKLAEDGRIQDVALIAETLAAESGANVARFASMVDSDLLSKGISLNLRQGKIESVVYTL 145

Query: 287  RILNEMGFHGPSLLDERALAMLGLELSKMVKKDSLGVKECVEGLQVLEYCQFHTQELVEP 466
            + + ++G     L+DE ++ ++      M   +S+ V++ ++ +++L   +F  +ELV+P
Sbjct: 146  QRIEKVGIAPLDLVDESSVKLMRKHFRAMA--NSVQVEKAIDLMEILAGFRFKIKELVDP 203

Query: 467  MLVFKQCIKLKDVSLALRYVSLIRNPHVWWNFLIREFGRKGDLSSALIVFRKYQSCNTTP 646
              V K C+ + +  LA+RY  L+ +  +    +I  FG+KGD+ S L  +   +     P
Sbjct: 204  FDVVKICVDISNPQLAIRYACLLPHTELLLCRIIHGFGKKGDMVSVLTAYEACKQILDNP 263

Query: 647  NMCLYRDIIDACGSCGDSTMAETIFKEMLSHNVLPSTFVYNSFMNAVAADLNRTLQSFHD 826
            NM +YR +ID CG CGD   +  I++++L  N+ P+ +V NS MN  + DL  TL+ + +
Sbjct: 264  NMYIYRTMIDVCGLCGDYVKSRYIYEDLLKENIKPNIYVMNSLMNVNSHDLGYTLKVYKN 323

Query: 827  MQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQVVG 1006
            MQ   V  D+ +YNILLK C +A   DLA DIY+  K     G +K+D  TY T+I+V  
Sbjct: 324  MQKLDVTADMTSYNILLKTCCLAGRVDLAQDIYKEAKRMESSGLLKLDAFTYCTIIKVFA 383

Query: 1007 DSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQPNSQ 1186
            D+K    AL++K DM+  G+ P+  TW+ L++ACAN GLVE+A  LF+EM+ SG +PNSQ
Sbjct: 384  DAKMWKMALKVKEDMQSVGVTPNTHTWSSLISACANAGLVEQANHLFEEMLASGCEPNSQ 443

Query: 1187 CCNALLSAFVKDFQYARAFTFFKNWK----EKGFYA------------GSLKRYKKGNLP 1318
            C N LL A V+  Q+ RAF  F++WK    ++  YA              LK +  G+L 
Sbjct: 444  CFNILLHACVEACQFDRAFRLFQSWKGSSDKEALYADDITGKGSIFSPNKLKNHGNGSLV 503

Query: 1319 DNFSAP---PSCIPKFKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLI 1489
            +  S+P    S    FKPT  TYN L+KAC +  Y  + +MDEM+  G+ P+ +TWS LI
Sbjct: 504  NTNSSPYIQASNRFFFKPTTATYNILLKACGTDYYRGKELMDEMRSLGLAPNQITWSTLI 563

Query: 1490 DAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKNFTKAIMTFEAMKKKNVQ 1669
            D  G + D++G +     M  AG +PDVV YTT IK+C +NK+   A   FE M++  ++
Sbjct: 564  DICGGSGDVEGAVGILRTMHSAGTRPDVVAYTTAIKICAENKSLKLAFSLFEEMRRYQIK 623

Query: 1670 PNAITYNTILRGWREHGDHMQVYALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSY 1849
            PN +TYNT+L+   ++G  ++V   L++Y++MRK+G+ PND FLK LIEEW EGVIQ++ 
Sbjct: 624  PNWVTYNTLLKARSKYGSLLEVRQCLAIYQDMRKAGYKPNDHFLKELIEEWCEGVIQEN- 682

Query: 1850 STSQSMKLKGNQYQRKDENTRQKSDI-IFFEKIA-GLACSKSSDFTVDLRGLSETETXXX 2023
              SQS     +Q     E T     + +  EK+A  L    + +  +DL+GL++ E    
Sbjct: 683  --SQSQIKTSDQ-----EGTNLGRPVSLLIEKVATHLQERTAGNLAIDLQGLTKVEARLV 735

Query: 2024 XXXXXXXXXEKHGPGNPIDSDLIIIIGY-SGNLQKQRLE---SSHITKILNQELGLLVLQ 2191
                     E +  G+ +  DL+II+G    N+   + E      + ++L  EL L+VL 
Sbjct: 736  VLAVLRMIKEDYIRGDVVTDDLLIILGTGEANIDPGKQEIAVKDVLVQLLKDELSLVVLP 795

Query: 2192 NRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNER-----MPLKRLPTVQRLI 2356
                            +L    + R       G  +++ N +        +R   ++RL+
Sbjct: 796  -----------AGHRHVLDITLDARCVDDADQGIELTSENTKSIVGISSTRRPAILERLM 844

Query: 2357 VPKKSLYQWVEK 2392
            V K SL+QW+++
Sbjct: 845  VTKASLHQWLQR 856


>ref|XP_006398740.1| hypothetical protein EUTSA_v10012661mg [Eutrema salsugineum]
            gi|557099830|gb|ESQ40193.1| hypothetical protein
            EUTSA_v10012661mg [Eutrema salsugineum]
          Length = 863

 Score =  432 bits (1111), Expect = e-118
 Identities = 266/797 (33%), Positives = 427/797 (53%), Gaps = 38/797 (4%)
 Frame = +2

Query: 116  AKNLMKQGNLMDCLSLMREVSEKGRVCVL---GLLDLEDLKGCIGVHIENGDVGLVVDGL 286
            A  L + G + D   +   ++ +    V     ++D + L   I +++  G +  VV  L
Sbjct: 86   ASKLAEDGRIQDVALIAETLAAESGANVARFASMVDSDLLSKGISLNLRQGKIESVVYTL 145

Query: 287  RILNEMGFHGPSLLDERALAMLGLELSKMVKKDSLGVKECVEGLQVLEYCQFHTQELVEP 466
            + + ++G     L+DE ++ ++      M   +S+ V++ ++ +++L   +F  +ELV+P
Sbjct: 146  QRIEKVGIAPLDLVDESSVKLMRKHFRAMA--NSVQVEKAIDLMEILAGFRFKIKELVDP 203

Query: 467  MLVFKQCIKLKDVSLALRYVSLIRNPHVWWNFLIREFGRKGDLSSALIVFRKYQSCNTTP 646
              V K C+ + +  LA+RY  L+ +  +    +I  FG+KGD+ S L  +   +     P
Sbjct: 204  FDVVKICVDISNPQLAIRYACLLPHTELLLCRIIHGFGKKGDMVSVLTAYEACKQILDNP 263

Query: 647  NMCLYRDIIDACGSCGDSTMAETIFKEMLSHNVLPSTFVYNSFMNAVAADLNRTLQSFHD 826
            NM +YR +ID CG CGD   +  I++++L  N+ P+ +V NS MN  + DL  TL+ + +
Sbjct: 264  NMYIYRTMIDVCGLCGDYVKSRYIYEDLLKENIKPNIYVMNSLMNVNSHDLGYTLKVYKN 323

Query: 827  MQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQVVG 1006
            MQ   V  D+ +YNILLK C +A   DLA DIY+  K     G +K+D  TY T+I+V  
Sbjct: 324  MQKLDVTADMTSYNILLKTCCLAGRVDLAQDIYKEAKRMESSGLLKLDAFTYCTIIKVFA 383

Query: 1007 DSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQPNSQ 1186
            D+K    AL++K DM+  G+ P+  TW+ L++ACAN GLVE+A  LF+EM+ SG +PNSQ
Sbjct: 384  DAKMWKMALKVKEDMQSVGVTPNTHTWSSLISACANAGLVEQANHLFEEMLASGCEPNSQ 443

Query: 1187 CCNALLSAFVKDFQYARAFTFFKNWK----EKGFYA------------GSLKRYKKGNLP 1318
            C N LL A V+  Q+ RAF  F++WK    ++  YA              LK +  G+L 
Sbjct: 444  CFNILLHACVEACQFDRAFRLFQSWKGSSDKEALYADDITGKGSIFSPNKLKNHGNGSLV 503

Query: 1319 DNFSAP---PSCIPKFKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLI 1489
            +  S+P    S    FKPT  TYN L+KAC +  Y  + +MDEM+  G+ P+ +TWS LI
Sbjct: 504  NTNSSPYIQASNRFFFKPTTATYNILLKACGTDYYRGKELMDEMRSLGLAPNQITWSTLI 563

Query: 1490 DAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIK-----VCVKNKNFTKAIMTFEAMK 1654
            D  G + D++G +     M  AG +PDVV YTT IK     +C +NK+   A   FE M+
Sbjct: 564  DICGGSGDVEGAVGILRTMHSAGTRPDVVAYTTAIKHAIFQICAENKSLKLAFSLFEEMR 623

Query: 1655 KKNVQPNAITYNTILRGWREHGDHMQVYALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGV 1834
            +  ++PN +TYNT+L+   ++G  ++V   L++Y++MRK+G+ PND FLK LIEEW EGV
Sbjct: 624  RYQIKPNWVTYNTLLKARSKYGSLLEVRQCLAIYQDMRKAGYKPNDHFLKELIEEWCEGV 683

Query: 1835 IQKSYSTSQSMKLKGNQYQRKDENTRQKSDI-IFFEKIA-GLACSKSSDFTVDLRGLSET 2008
            IQ++   SQS     +Q     E T     + +  EK+A  L    + +  +DL+GL++ 
Sbjct: 684  IQEN---SQSQIKTSDQ-----EGTNLGRPVSLLIEKVATHLQERTAGNLAIDLQGLTKV 735

Query: 2009 ETXXXXXXXXXXXXEKHGPGNPIDSDLIIIIGY-SGNLQKQRLE---SSHITKILNQELG 2176
            E             E +  G+ +  DL+II+G    N+   + E      + ++L  EL 
Sbjct: 736  EARLVVLAVLRMIKEDYIRGDVVTDDLLIILGTGEANIDPGKQEIAVKDVLVQLLKDELS 795

Query: 2177 LLVLQNRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNER-----MPLKRLPT 2341
            L+VL                 +L    + R       G  +++ N +        +R   
Sbjct: 796  LVVLP-----------AGHRHVLDITLDARCVDDADQGIELTSENTKSIVGISSTRRPAI 844

Query: 2342 VQRLIVPKKSLYQWVEK 2392
            ++RL+V K SL+QW+++
Sbjct: 845  LERLMVTKASLHQWLQR 861


>ref|NP_195903.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|332278227|sp|Q8GYL7.3|PP361_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g02830, chloroplastic; Flags: Precursor
            gi|332003140|gb|AED90523.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 852

 Score =  431 bits (1107), Expect = e-117
 Identities = 259/786 (32%), Positives = 421/786 (53%), Gaps = 27/786 (3%)
 Frame = +2

Query: 116  AKNLMKQGNLMDCLSLMREVSEKGRVCVL---GLLDLEDLKGCIGVHIENGDVGLVVDGL 286
            A  L + G + D   +   ++ +    V     ++D + L   I  ++  G +  VV  L
Sbjct: 88   ASKLAEDGRIEDVALIAETLAAESGANVARFASMVDYDLLSKGISSNLRQGKIESVVYTL 147

Query: 287  RILNEMGFHGPSLLDERALAMLGLELSKMVKKDSLGVKECVEGLQVLEYCQFHTQELVEP 466
            + + ++G     L+D+ ++ ++  +   M   +S+ V++ ++ +++L    F  +ELV+P
Sbjct: 148  KRIEKVGIAPLDLVDDSSVKLMRKQFRAMA--NSVQVEKAIDLMEILAGLGFKIKELVDP 205

Query: 467  MLVFKQCIKLKDVSLALRYVSLIRNPHVWWNFLIREFGRKGDLSSALIVFRKYQSCNTTP 646
              V K C+++ +  LA+RY  L+ +  +    +I  FG+KGD+ S +  +   +    TP
Sbjct: 206  FDVVKSCVEISNPQLAIRYACLLPHTELLLCRIIHGFGKKGDMVSVMTAYEACKQILDTP 265

Query: 647  NMCLYRDIIDACGSCGDSTMAETIFKEMLSHNVLPSTFVYNSFMNAVAADLNRTLQSFHD 826
            NM + R +ID CG CGD   +  I++++L  N+ P+ +V NS MN  + DL  TL+ + +
Sbjct: 266  NMYICRTMIDVCGLCGDYVKSRYIYEDLLKENIKPNIYVINSLMNVNSHDLGYTLKVYKN 325

Query: 827  MQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQVVG 1006
            MQ   V  D+ +YNILLK C +A   DLA DIY+  K     G +K+D  TY T+I+V  
Sbjct: 326  MQILDVTADMTSYNILLKTCCLAGRVDLAQDIYKEAKRMESSGLLKLDAFTYCTIIKVFA 385

Query: 1007 DSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQPNSQ 1186
            D+K    AL++K DM+  G+ P+  TW+ L++ACAN GLVE+A  LF+EM+ SG +PNSQ
Sbjct: 386  DAKMWKWALKVKDDMKSVGVTPNTHTWSSLISACANAGLVEQANHLFEEMLASGCEPNSQ 445

Query: 1187 CCNALLSAFVKDFQYARAFTFFKNWK----EKGFYAGS------------LKRYKKGNLP 1318
            C N LL A V+  QY RAF  F++WK     +  YA              LK    G+L 
Sbjct: 446  CFNILLHACVEACQYDRAFRLFQSWKGSSVNESLYADDIVSKGRTSSPNILKNNGPGSLV 505

Query: 1319 DNFSAPPSCIPK----FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSML 1486
            +  S  P         FKPT  TYN L+KAC +  Y  + +MDEMK  G+ P+ +TWS L
Sbjct: 506  NRNSNSPYIQASKRFCFKPTTATYNILLKACGTDYYRGKELMDEMKSLGLSPNQITWSTL 565

Query: 1487 IDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKNFTKAIMTFEAMKKKNV 1666
            ID  G + D++G ++    M  AG +PDVV YTT IK+C +NK    A   FE M++  +
Sbjct: 566  IDMCGGSGDVEGAVRILRTMHSAGTRPDVVAYTTAIKICAENKCLKLAFSLFEEMRRYQI 625

Query: 1667 QPNAITYNTILRGWREHGDHMQVYALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKS 1846
            +PN +TYNT+L+   ++G  ++V   L++Y++MR +G+ PND FLK LIEEW EGVIQ++
Sbjct: 626  KPNWVTYNTLLKARSKYGSLLEVRQCLAIYQDMRNAGYKPNDHFLKELIEEWCEGVIQEN 685

Query: 1847 YSTSQSMKLKGNQYQRKDENTRQKSDIIFFEKIAGLACSKSSDFTVDLRGLSETETXXXX 2026
              +   +        ++ +N  +   ++  +    +    + +  +DL+GL++ E     
Sbjct: 686  GQSQDKIS------DQEGDNAGRPVSLLIEKVATHMQERTAGNLAIDLQGLTKIEARLVV 739

Query: 2027 XXXXXXXXEKHGPGNPIDSDLIIIIGY-SGNLQKQRLE---SSHITKILNQELGLLVLQN 2194
                    E +  G+ +  D++IIIG    N    + E      + K+L  EL L+VL  
Sbjct: 740  LAVLRMIKEDYMRGDVVIDDVLIIIGTDEANTVSGKQEITVQEALVKLLRDELSLVVL-- 797

Query: 2195 RDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIVPKKSL 2374
                 P        D   C+++   +   K+  S+S+       +R   ++RL+V K SL
Sbjct: 798  -----PAGQRNIIQD-AHCVDD-ADQENTKSFVSISS------TRRPAILERLMVTKASL 844

Query: 2375 YQWVEK 2392
            YQW+++
Sbjct: 845  YQWLQR 850


>dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]
          Length = 852

 Score =  430 bits (1106), Expect = e-117
 Identities = 259/786 (32%), Positives = 421/786 (53%), Gaps = 27/786 (3%)
 Frame = +2

Query: 116  AKNLMKQGNLMDCLSLMREVSEKGRVCVL---GLLDLEDLKGCIGVHIENGDVGLVVDGL 286
            A  L + G + D   +   ++ +    V     ++D + L   I  ++  G +  VV  L
Sbjct: 88   ASKLAEDGRIEDVALIAETLAAESGANVARFASMVDYDLLSKGISSNLRQGKIESVVYTL 147

Query: 287  RILNEMGFHGPSLLDERALAMLGLELSKMVKKDSLGVKECVEGLQVLEYCQFHTQELVEP 466
            + + ++G     L+D+ ++ ++  +   M   +S+ V++ ++ +++L    F  +ELV+P
Sbjct: 148  KRIEKVGIAPLDLVDDSSVKLMRKQFRAMA--NSVQVEKAIDLMEILAGLGFKIKELVDP 205

Query: 467  MLVFKQCIKLKDVSLALRYVSLIRNPHVWWNFLIREFGRKGDLSSALIVFRKYQSCNTTP 646
              V K C+++ +  LA+RY  L+ +  +    +I  FG+KGD+ S +  +   +    TP
Sbjct: 206  FDVVKSCVEISNPQLAIRYACLLPHTELLLCRIIHGFGKKGDMVSVMTAYEACKQILDTP 265

Query: 647  NMCLYRDIIDACGSCGDSTMAETIFKEMLSHNVLPSTFVYNSFMNAVAADLNRTLQSFHD 826
            NM + R +ID CG CGD   +  I++++L  N+ P+ +V NS MN  + DL  TL+ + +
Sbjct: 266  NMYICRTMIDVCGLCGDYVKSRYIYEDLLKENIKPNIYVINSLMNVNSHDLGYTLKVYKN 325

Query: 827  MQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQVVG 1006
            MQ   V  D+ +YNILLK C +A   DLA DIY+  K     G +K+D  TY T+I+V  
Sbjct: 326  MQILDVTADMTSYNILLKTCCLAGRVDLAQDIYKEAKRMESSGLLKLDAFTYCTIIKVFA 385

Query: 1007 DSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQPNSQ 1186
            D+K    AL++K DM+  G+ P+  TW+ L++ACAN GLVE+A  LF+EM+ SG +PNSQ
Sbjct: 386  DAKMWKWALKVKDDMKSVGVTPNTHTWSSLISACANAGLVEQANHLFEEMLASGCEPNSQ 445

Query: 1187 CCNALLSAFVKDFQYARAFTFFKNWK----EKGFYAGS------------LKRYKKGNLP 1318
            C N LL A V+  QY RAF  F++WK     +  YA              LK    G+L 
Sbjct: 446  CFNILLHACVEACQYDRAFRLFQSWKGSSVNESLYADDIVSKGRTSSPNILKNNGPGSLV 505

Query: 1319 DNFSAPPSCIPK----FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSML 1486
            +  S  P         FKPT  TYN L+KAC +  Y  + +MDEMK  G+ P+ +TWS L
Sbjct: 506  NRNSNSPYIQASKRFCFKPTTATYNILLKACGTDYYRGKELMDEMKSLGLSPNQITWSTL 565

Query: 1487 IDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKNFTKAIMTFEAMKKKNV 1666
            ID  G + D++G ++    M  AG +PDVV YTT IK+C +NK    A   FE M++  +
Sbjct: 566  IDMCGGSGDVEGAVRILRTMHSAGTRPDVVAYTTAIKICAENKCLKLAFSLFEEMRRYQI 625

Query: 1667 QPNAITYNTILRGWREHGDHMQVYALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKS 1846
            +PN +TYNT+L+   ++G  ++V   L++Y++MR +G+ PND FLK LIEEW EGVIQ++
Sbjct: 626  KPNWVTYNTLLKARSKYGSLLEVRQCLAIYQDMRNAGYKPNDHFLKELIEEWCEGVIQEN 685

Query: 1847 YSTSQSMKLKGNQYQRKDENTRQKSDIIFFEKIAGLACSKSSDFTVDLRGLSETETXXXX 2026
              +   +        ++ +N  +   ++  +    +    + +  +DL+GL++ E     
Sbjct: 686  GRSQDKIS------DQEGDNAGRPVSLLIEKVATHMQERTAGNLAIDLQGLTKIEARLVV 739

Query: 2027 XXXXXXXXEKHGPGNPIDSDLIIIIGY-SGNLQKQRLE---SSHITKILNQELGLLVLQN 2194
                    E +  G+ +  D++IIIG    N    + E      + K+L  EL L+VL  
Sbjct: 740  LAVLRMIKEDYMRGDVVIDDVLIIIGTDEANTVSGKQEITVQEALVKLLRDELSLVVL-- 797

Query: 2195 RDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIVPKKSL 2374
                 P        D   C+++   +   K+  S+S+       +R   ++RL+V K SL
Sbjct: 798  -----PAGQRNIIQD-AHCVDD-ADQENTKSFVSISS------TRRPAILERLMVTKASL 844

Query: 2375 YQWVEK 2392
            YQW+++
Sbjct: 845  YQWLQR 850


>ref|XP_002525196.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223535493|gb|EEF37162.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 786

 Score =  430 bits (1105), Expect = e-117
 Identities = 267/751 (35%), Positives = 409/751 (54%), Gaps = 32/751 (4%)
 Frame = +2

Query: 236  IGVHIENGDVGLVVDGLRILNEMGFHGPSLLDERALAMLGLELSKMVKKDSLGVKECVEG 415
            I  ++   +V  VVD L   +++G     L D  ++ +L  E  ++V    L  ++ +  
Sbjct: 53   ISKNLRERNVDSVVDALNTADQLGLPPSQLFDAASMDLLKTECLRIVNFGRL--EDIILL 110

Query: 416  LQVLEYCQFHTQELVEPMLVFKQCIKLKDVSLALRYVSLIRNPHVWWNFLIREFGRKGDL 595
            ++ L    F  +ELVEP  V K C+  ++  LA+RY  L  +  +    ++++FG+KGDL
Sbjct: 111  METLAGYSFSIKELVEPSRVIKLCVHQRNPHLAVRYARLFPHEGILMCSIVKQFGKKGDL 170

Query: 596  SSALIVFRKYQSCNTTPNMCLYRDIIDACGSCGDSTMAETIFKEMLSHNVLPSTFVYNSF 775
             SAL  +  Y   +T P+M LYR +ID CG CGD   +  IF++++S  V+P+ FV+NS 
Sbjct: 171  DSALAAYEAYMQHSTVPDMYLYRALIDVCGLCGDYMQSRYIFEDIVSQKVIPNIFVFNSL 230

Query: 776  MNAVAADLNRTLQSFHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDG 955
            MN  A DL  TL  +  MQ+ GV  D+ +YNILLK+C +A   DLA DIY   K+  + G
Sbjct: 231  MNVNAHDLGYTLHVYKKMQNLGVTADMTSYNILLKSCSLAGKVDLAQDIYREAKQLELAG 290

Query: 956  GIKMDVITYSTLIQVVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKA 1135
             +K+D  TY T+I++  D+K    AL+IK DM  +G+ P+  TW+ L++A AN GLV++A
Sbjct: 291  LLKLDDFTYCTIIKIFADAKLWQLALKIKEDMLSSGVTPNTFTWSSLISASANAGLVDQA 350

Query: 1136 LVLFDEMVLSGYQPNSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGSLKRYKKGNL 1315
            + LF+EM+L+G  PNS CCN LL A V+  QY RAF  F  WK           Y   N 
Sbjct: 351  IKLFEEMLLAGCVPNSHCCNILLHACVEACQYDRAFRLFNAWKGSEIQNTFTTDY---NC 407

Query: 1316 P--DNFSAPPSC------IPK---------------FKPTVVTYNTLMKACRSTPYLART 1426
            P  D  SA  +C      +P                F P+  TYNTLMKAC S    A+ 
Sbjct: 408  PVDDISSAMHACEDYIITVPNLASNSLHLSFLKKFPFTPSSATYNTLMKACGSDYNRAKA 467

Query: 1427 MMDEMKDNGIVPDNVTWSMLIDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCV 1606
            +MDEM+  G+ P++++WS+LID  G++ +++G +Q   NM  AG++PDV+ YTT IKV V
Sbjct: 468  LMDEMQAVGLSPNHISWSILIDICGSSGNMEGAIQILKNMRMAGIEPDVIAYTTAIKVSV 527

Query: 1607 KNKNFTKAIMTFEAMKKKNVQPNAITYNTILRGWREHGDHMQVYALLSLYEEMRKSGFPP 1786
            ++KN   A   F  MK+  ++PN +TY+T+LR    +G   +V   L++Y++MRK+G+  
Sbjct: 528  ESKNLKMAFSLFAEMKRYQLKPNLVTYDTLLRARTRYGSLKEVQQCLAIYQDMRKAGYKS 587

Query: 1787 NDEFLKGLIEEWAEGVIQKSYSTSQSMK-LKGNQYQRKDENTRQKSDIIFFEKIAG-LAC 1960
            ND +LK LIEEW EGVIQ +       K  K  ++ R           +  EK+A  L  
Sbjct: 588  NDNYLKQLIEEWCEGVIQDNDQCQDDFKPCKRAEFGRPHS--------LLLEKVAAHLHH 639

Query: 1961 SKSSDFTVDLRGLSETETXXXXXXXXXXXXEKHGPGNPIDSDLIIIIGYS-----GNLQK 2125
            + +   +VDL+GL++ E             E +  G+ +  D+ I +G          QK
Sbjct: 640  NVAESLSVDLQGLTKVEARIVVLAVLRMVKENYIQGHLVKDDMSITLGIDKVDVLPATQK 699

Query: 2126 QRLESSHITKILNQELGLLVLQNRDYDQPVVAGTSSSDILSCINNRR--TKAGNKTGFSV 2299
              ++ + I K+L+ ELGL VL       P       +D+   +N+ +  +K+  +    V
Sbjct: 700  AEVKDA-IFKLLHNELGLEVL----IVVPRYTADLETDLEIPLNSYQNWSKSSGRENIRV 754

Query: 2300 SNLNERMPLKRLPTVQRLIVPKKSLYQWVEK 2392
            S  + R PL     +QRL V + SL+ W+++
Sbjct: 755  S--SARRPL----VLQRLKVTRNSLHSWLQR 779


Top