BLASTX nr result

ID: Ephedra28_contig00011654 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra28_contig00011654
         (1600 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004142106.1| PREDICTED: pentatricopeptide repeat-containi...   355   3e-95
ref|XP_004246707.1| PREDICTED: pentatricopeptide repeat-containi...   354   7e-95
ref|XP_004305248.1| PREDICTED: pentatricopeptide repeat-containi...   352   2e-94
ref|XP_006367266.1| PREDICTED: pentatricopeptide repeat-containi...   352   3e-94
ref|XP_006828302.1| hypothetical protein AMTR_s00023p00232870 [A...   350   7e-94
ref|XP_004493936.1| PREDICTED: pentatricopeptide repeat-containi...   350   7e-94
ref|XP_003554352.1| PREDICTED: pentatricopeptide repeat-containi...   350   1e-93
gb|EOY07712.1| Tetratricopeptide repeat (TPR)-like superfamily p...   346   2e-92
gb|ESW34707.1| hypothetical protein PHAVU_001G174000g [Phaseolus...   343   2e-91
gb|ESW34706.1| hypothetical protein PHAVU_001G174000g [Phaseolus...   343   2e-91
gb|EXB84820.1| hypothetical protein L484_003853 [Morus notabilis]     340   8e-91
ref|XP_004167767.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   340   8e-91
ref|XP_006287051.1| hypothetical protein CARUB_v10000200mg [Caps...   340   1e-90
ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containi...   339   2e-90
ref|XP_006428907.1| hypothetical protein CICLE_v10011055mg [Citr...   336   1e-89
gb|EMJ09280.1| hypothetical protein PRUPE_ppa001520mg [Prunus pe...   336   2e-89
ref|XP_006398739.1| hypothetical protein EUTSA_v10012661mg [Eutr...   329   2e-87
ref|XP_006577707.1| PREDICTED: pentatricopeptide repeat-containi...   328   3e-87
ref|NP_195903.2| pentatricopeptide repeat-containing protein [Ar...   325   3e-86
dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]                325   3e-86

>ref|XP_004142106.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Cucumis sativus]
          Length = 849

 Score =  355 bits (911), Expect = 3e-95
 Identities = 210/548 (38%), Positives = 308/548 (56%), Gaps = 23/548 (4%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            + +MQ+ GV  D+A+YNILLKAC +A   DLA DIY  +K     G +K+DV TYST+++
Sbjct: 317  YKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVK 376

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V  D+K    AL +K DM+ AG+ P+M+TW+ L+++CAN GLVE A+ LF+EMV +G +P
Sbjct: 377  VFADAKLWKMALRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEP 436

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGSLKRYKKGNLPDNFSAPPSCIPK- 542
            N+QCCN LL A V+  Q+ RAF  F++WKEK  + G  ++    N  D  S    C  K 
Sbjct: 437  NTQCCNTLLHACVEGRQFDRAFRLFRSWKEKELWDGIERKSSTDNNLDADSTSQLCNTKM 496

Query: 543  -----------------FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSM 671
                             FKPT+ TYN LMKAC +  Y A+ +M+EMK  G+ P++++WS+
Sbjct: 497  PNAPSHVHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSI 556

Query: 672  LIDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKKKN 851
            L+D  G + D++  +Q    M  AGV PDVV YTT IKVCV+ K +  A  +FE MK+  
Sbjct: 557  LVDICGRSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFE 616

Query: 852  VQPNAITYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQK 1031
            +QPN +TY+T+LR    +G   +V   L++Y++MRKSGF  ND +LK LI EW EGVIQK
Sbjct: 617  IQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQK 676

Query: 1032 SYSTSQPMKLKGNQHQRKDENTQQKSDIIFFEKIAG-LACSKSSDFTVDLRGLSETETXX 1208
              +  QP+++        ++    K   +  EK+A  L  S +   T+DL+ L++ E   
Sbjct: 677  --NNQQPVEI-----TPCNKIDIGKPRCLILEKVADHLQKSFAESLTIDLQELTKVEARI 729

Query: 1209 XXXXXXXXXXEKHGPGNPIDSDLIIIIGYS---GNLQKQRLE-SSHITKILNQELGLLVL 1376
                      E +  G  +  D+ II+  +    +L  Q  E    IT++L  ELGL VL
Sbjct: 730  VVLAVLRMIKENYALGESVKDDIFIILEVNKVETDLVPQNFEVRDAITRLLQDELGLEVL 789

Query: 1377 QNRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIVPKK 1556
                   P +A      +    N+  +K  + T    +    +   ++   VQRL V KK
Sbjct: 790  PT----GPTIA------LDKVPNSESSKISHTTKLKGTMGRNKYFTRKPADVQRLKVTKK 839

Query: 1557 SLYQWVEK 1580
            SL  W+++
Sbjct: 840  SLQDWLQR 847


>ref|XP_004246707.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Solanum lycopersicum]
          Length = 857

 Score =  354 bits (908), Expect = 7e-95
 Identities = 200/555 (36%), Positives = 317/555 (57%), Gaps = 30/555 (5%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            +  MQ  GV  D+ +YNILLK+C +A   DLA +IY  LK   + G +K+DV TYSTLI+
Sbjct: 318  YKQMQKLGVPADLTSYNILLKSCCLATRVDLAKEIYGELKHLEMAGALKLDVFTYSTLIK 377

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V  D+K    ALEIK+DM  AG+ P+++TW+ L++ACAN G+V++A+ LF+EM+ +G +P
Sbjct: 378  VFADAKMWQMALEIKKDMLSAGVTPNIVTWSSLISACANAGVVDQAIQLFEEMLQAGCEP 437

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGSLKRYKKGNLPDNFSAPPSCIPK- 542
            NSQC N LL A V+  QY RAF  F++WKE        + Y  G   +N    P+ +   
Sbjct: 438  NSQCYNILLHACVEACQYDRAFRLFRSWKENALQKDKCEDYG-GKTDNNIDLSPTLVVSA 496

Query: 543  -------------------FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTW 665
                               F PT  TYN LMKAC S  Y A+ +M+EMK+ G+ P+++TW
Sbjct: 497  SIPTRTSASSHRHISTRVPFIPTTSTYNILMKACGSDYYRAKALMEEMKEVGLSPNHITW 556

Query: 666  SMLIDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKK 845
            ++LID  G + +++G +Q    M EAG++PDVVTYTT+IKVCV+NK F  A  +F AMK+
Sbjct: 557  TILIDICGGSGNVEGALQILRVMREAGIQPDVVTYTTIIKVCVENKDFKSAFSLFAAMKR 616

Query: 846  KNVQPNAITYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVI 1025
              ++PN +TYNT+LR    +G   +V   L++Y++MRK+G+ PND +LK LIE+W EGVI
Sbjct: 617  YQIKPNMVTYNTLLRARSRYGSLQEVQQCLAIYQDMRKAGYKPNDYYLKQLIEQWCEGVI 676

Query: 1026 QKSYSTSQPMKLKGNQHQRK-DENTQQKSDI----IFFEKIA-GLACSKSSDFTVDLRGL 1187
            Q             N +QRK + +T+ ++D+    +  EK+A  L    ++  +++LRGL
Sbjct: 677  Q-------------NANQRKYNFSTRNRTDLGPQSMILEKVAEHLQKDSANSISINLRGL 723

Query: 1188 SETETXXXXXXXXXXXXEKHGPGNPIDSDLIIIIGYS----GNLQKQRLESSHITKILNQ 1355
            ++ E             EK+  G+ I  D+ I +G        ++++ +    I ++L  
Sbjct: 724  TKVEARIVVLAVLRMIREKYTAGDSIKDDVQIFLGVKEVGIRAVKQESVVKEAIIQLLQH 783

Query: 1356 ELGLLVLQNRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQ 1535
            +LGL V+          A ++  + ++  +N+ +         +   +   P ++   +Q
Sbjct: 784  DLGLEVIS---------AASTIGNGINHPDNKHSNMEENAERVILRPSVYSPTRKPVVLQ 834

Query: 1536 RLIVPKKSLYQWVEK 1580
            ++ + K+SL  W+ +
Sbjct: 835  KMRITKESLQSWLTR 849


>ref|XP_004305248.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 847

 Score =  352 bits (904), Expect = 2e-94
 Identities = 216/554 (38%), Positives = 308/554 (55%), Gaps = 28/554 (5%)
 Frame = +3

Query: 3    TFH---DMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYS 173
            TFH    MQ+ GV  D+A YNILLKAC +A   DLA DIY+ ++     G +K+DV TYS
Sbjct: 312  TFHVYKSMQNLGVTADLACYNILLKACSLAGRVDLAQDIYKEVQHLESTGVLKLDVFTYS 371

Query: 174  TLIQVVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLS 353
            T+++V  D+K    AL +K DM+ AG++P+ +TW+  ++ACAN GLV+KA+ LF+EM+L+
Sbjct: 372  TVVKVFSDAKMWHMALNVKEDMQSAGVIPNTVTWSSFISACANAGLVDKAIQLFEEMLLA 431

Query: 354  GYQPNSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGSLKRYKKGNLPDNFSAPPSC 533
              +PNSQC N LL A V+  QY RAF  F ++K         K YK      + + P   
Sbjct: 432  SCEPNSQCFNILLHACVEACQYDRAFRLFHSFKSNKLQETFGKNYKGSAGSSSTTIPLII 491

Query: 534  IPK-------FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLIDAYGN 692
            +P        FKPT  TYNTLMKAC S  Y A+ +MDEMK  G++P+ +TWS+L D  G+
Sbjct: 492  LPSNFAEGLSFKPTTTTYNTLMKACGSDYYHAKALMDEMKTVGLLPNQITWSILADICGS 551

Query: 693  TADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKKKNVQPNAIT 872
            + ++QG +Q   +M  AG++PDVV YTT IK+CV+++    A+++F  MKK  + PN +T
Sbjct: 552  SGNVQGALQILKSMRVAGIQPDVVAYTTAIKICVESENLDLALLLFAEMKKYQIHPNLVT 611

Query: 873  YNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSYSTSQP 1052
            YNT+LR    +G   +V   L++Y++MRK+G+ PND +L+ LIEEW EGVIQ S      
Sbjct: 612  YNTLLRARSRYGSVSEVQQCLAIYQDMRKAGYKPNDYYLEQLIEEWCEGVIQDSCP---- 667

Query: 1053 MKLKGNQHQRKDENTQQKSDIIFFEKIAGLACSKSSD-FTVDLRGLSETETXXXXXXXXX 1229
               K  +    D+    +   +  EK+A       +D   VDL+GL++ E          
Sbjct: 668  ---KQGEFSYGDKADIGRPGSLLLEKVAEHLQQHIADTLAVDLQGLTKVEARIVVLAVLR 724

Query: 1230 XXXEKHGPGNPIDSDLIIIIG----YSGNLQKQRLE-SSHITKILNQELGLLVLQNRDYD 1394
               E +  G+ +  D++I++G      G      LE    ITK+L  ELGL VL      
Sbjct: 725  MIKENYILGDSVKDDMLIMVGVHDEVDGGSTAHNLEVKDAITKLLQDELGLKVLST---- 780

Query: 1395 QPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLK-----------RLPTV-QR 1538
             P VA  +            T     T  S  NL+E+ PL+           R P V +R
Sbjct: 781  VPKVALDT------------TIVSQNTIDSDQNLDEK-PLRKELQPELIYSTRRPVVLER 827

Query: 1539 LIVPKKSLYQWVEK 1580
            L V +KSL QW+ K
Sbjct: 828  LKVSRKSLQQWLRK 841


>ref|XP_006367266.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Solanum tuberosum]
          Length = 859

 Score =  352 bits (903), Expect = 3e-94
 Identities = 206/564 (36%), Positives = 315/564 (55%), Gaps = 39/564 (6%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            +  MQ  GV  D+ +YNILLK+C +A   DLA +IY  LK   + G +K+DV TYSTLI+
Sbjct: 316  YKQMQKLGVPADLTSYNILLKSCCLATRVDLAKEIYGELKHLEMAGALKLDVFTYSTLIK 375

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V  D+K    ALEIK+DM  AG+ P+++TW+ L++ACAN GLV++A+ LF+EM+ +G +P
Sbjct: 376  VFADAKMWQMALEIKKDMLSAGVTPNIVTWSSLISACANAGLVDQAIQLFEEMLQAGCEP 435

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGSLKRYK---------------KGN 500
            NSQC N LL A V+  QY RAF  F++WKE      + + +                  +
Sbjct: 436  NSQCYNILLHACVEACQYDRAFRLFRSWKENALQKDNCEDFGGKTDNTIDLSPTLVVSAS 495

Query: 501  LPDNFSAPP----SCIPKFKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWS 668
            +P   SA      S    F+PT  TYN L+KAC S  Y A+ +M+EMK+ G+ P+++TW+
Sbjct: 496  IPTRTSASSHGHFSTRVPFRPTTSTYNILIKACGSDYYRAKALMEEMKEVGLSPNHITWT 555

Query: 669  MLIDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKKK 848
            +LID  G + +++G +Q    M EAG++PDVVTYTT+IKVCV+NK F  A  +F AMK+ 
Sbjct: 556  ILIDICGGSGNVEGALQILRAMREAGIQPDVVTYTTIIKVCVENKDFKSAFSLFAAMKRY 615

Query: 849  NVQPNAITYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQ 1028
             ++PN +TYNT+LR    +G   +V   L++Y+ MRK+G+ PND +LK LIE+W EGVIQ
Sbjct: 616  QIKPNMVTYNTLLRARSRYGSLQEVQQCLAIYQHMRKAGYKPNDYYLKQLIEQWCEGVIQ 675

Query: 1029 KSYSTSQPMKLKGNQHQRKDENTQQKSDI----IFFEKIA-GLACSKSSDFTVDLRGLSE 1193
                        GNQ ++ + +T+ ++D+    +  +K+A  L    ++  +++LRGLS+
Sbjct: 676  -----------NGNQ-RKYNFSTRNRTDLGPESMILDKVAEHLQKDSANSISINLRGLSK 723

Query: 1194 TETXXXXXXXXXXXXEKHGPGNPIDSDLIIIIGYS----GNLQKQRLESSHITKILNQEL 1361
             E             EK+  G+ I  D+ I +G        + ++ +    I K+L  +L
Sbjct: 724  VEARIVVLAVLRMIREKYTAGDSIKEDVQIFLGVQEVGIRAVGQESVVKEAIVKLLQHDL 783

Query: 1362 GLLVLQNRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNE-----------RM 1508
            GL V+                   S I N R + G       SN+ E             
Sbjct: 784  GLEVI----------------SAASRIGNDRNQDGINHPDKHSNMEENAERVILRANVHS 827

Query: 1509 PLKRLPTVQRLIVPKKSLYQWVEK 1580
            P ++   +Q++ + K+SL  W+ +
Sbjct: 828  PTRKPVVLQKMRITKESLQSWLTR 851


>ref|XP_006828302.1| hypothetical protein AMTR_s00023p00232870 [Amborella trichopoda]
            gi|548832949|gb|ERM95718.1| hypothetical protein
            AMTR_s00023p00232870 [Amborella trichopoda]
          Length = 855

 Score =  350 bits (899), Expect = 7e-94
 Identities = 205/553 (37%), Positives = 316/553 (57%), Gaps = 28/553 (5%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            +  M+  GV  D+A+YN+LLK C +A   DLA +IYE + ++++ GG+K+DVITYST+I+
Sbjct: 309  YKQMKKLGVAADMASYNVLLKVCCLAGRVDLAQEIYEEILQRALFGGLKLDVITYSTIIK 368

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V  D+K    A +IK DM  AG+ P+++TW+ L++ACAN GLVE+ + + +EM++ G +P
Sbjct: 369  VFADAKMWEMAFKIKDDMISAGVSPNIVTWSSLISACANAGLVERVIQVLEEMLVVGCEP 428

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGSLKR---------------YKKGN 500
            N+QCCN LL+A V+  Q+ RAF  F  WK+ GF  GS  +               +  GN
Sbjct: 429  NTQCCNILLNACVESCQFDRAFRIFHFWKQNGFSMGSNAKECGSKTVTDIKQNEYFSSGN 488

Query: 501  LPDNFSAPP--------SCIPKFKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDN 656
               + ++          S +  FKPTV TYN LMKAC +  Y A+ +MDEMK  G+ P++
Sbjct: 489  HEFHITSDALDPHDLNFSEVIPFKPTVATYNILMKACGTDYYRAQALMDEMKAGGLSPNH 548

Query: 657  VTWSMLIDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEA 836
            ++WS+LID  G + +++G +QAF +M  AG+ PDVV YTT IK CV NK F  A  +FE 
Sbjct: 549  ISWSILIDICGRSYNMKGAIQAFKSMYNAGIIPDVVAYTTAIKACVGNKYFKMAFSLFEE 608

Query: 837  MKKKNVQPNAITYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAE 1016
            MK+  +QPN +TYNT+L     +G   +V   L++Y++MRK+G+  ND FLK L+EEW E
Sbjct: 609  MKRHRLQPNLVTYNTLLTARSRYGSLDEVLQCLAIYQDMRKAGYNSNDRFLKELLEEWCE 668

Query: 1017 GVIQKSYSTSQPMKLKGNQHQRKDENTQQKSDIIFFEKIAG-LACSKSSDFTVDLRGLSE 1193
            GVI  S    +  +L  ++  +  E    +S  +  EK+A  L  + + + T+DLRGL++
Sbjct: 669  GVI--SDKGKRWSELNIDKCDKGSEVYGPQS--LLLEKVAAYLQENFAENLTIDLRGLTK 724

Query: 1194 TETXXXXXXXXXXXXEKHGPGNPIDSDLIIIIGYS-GNLQKQRLE---SSHITKILNQEL 1361
             E             E +  G P+  D+III   +  N+     E      + ++L  EL
Sbjct: 725  VEARIIVLAKLRMLKENYILGKPVRDDMIIITANTRSNMDAAETELRVRDAVIRVLQGEL 784

Query: 1362 GLLVLQNRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRL 1541
            GL VL+  +  +     T  + ++S ++        +    +     R P+     VQRL
Sbjct: 785  GLSVLEGPELGE---LSTRHAHVISSLSPETLTMSKRP--QLREYTTRRPV----DVQRL 835

Query: 1542 IVPKKSLYQWVEK 1580
             +P++SL  W++K
Sbjct: 836  KIPRRSLNLWLQK 848


>ref|XP_004493936.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Cicer arietinum]
          Length = 799

 Score =  350 bits (899), Expect = 7e-94
 Identities = 204/550 (37%), Positives = 309/550 (56%), Gaps = 25/550 (4%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            + +MQ  G+KPD+ +YNILLKAC VA   DLA D+Y+ LK     G +K+DV TYST+I+
Sbjct: 273  YQNMQKVGLKPDMTSYNILLKACCVAGRVDLAQDMYKELKHLESIGQLKLDVFTYSTIIK 332

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V  D+K    AL+IK DM +AG+  + + W+ L+ ACA+ GLVE+A+ LF+EM+LSG +P
Sbjct: 333  VFADAKLWQMALKIKHDMLLAGVSLNTVAWSSLINACAHAGLVEQAIQLFEEMLLSGCEP 392

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGSLKRYKKGNLPDNFSAPPSCIPK- 542
            N+QC N +L A V+  QY RAF FF +WK         + +          +  + +PK 
Sbjct: 393  NTQCFNIILHACVEGCQYDRAFRFFYSWKGNKTLVSFGESHNSNAEEGGMDSVTTTVPKG 452

Query: 543  --------------FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLID 680
                          FKPT  TYNTL+KAC +  Y A+ +++EMK  G+ P+ ++WS+LI+
Sbjct: 453  ISSSHIMSFTERFPFKPTTSTYNTLLKACGTNYYHAKALINEMKTVGLSPNQISWSILIN 512

Query: 681  AYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKKKNVQP 860
              G + +++G ++    M +AGVKPDVV YTT IKVCV++K FTKA+ ++E MK    QP
Sbjct: 513  ICGGSENVEGAIEILRTMIDAGVKPDVVAYTTAIKVCVESKNFTKALTLYEEMKSYETQP 572

Query: 861  NAITYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSYS 1040
            N +TYNT+LR   ++G   +V   L++Y++MRK+G+ PND +L+ LIEEW EGVIQ    
Sbjct: 573  NLVTYNTLLRARSKYGSLREVQQCLAIYQDMRKAGYKPNDYYLEELIEEWCEGVIQ---- 628

Query: 1041 TSQPMKLKGNQHQRKDENTQQKSDI-----IFFEKIAGLACSKSSD-FTVDLRGLSETET 1202
                     N+    + ++ +K +I     +  EKIA     + +D   +D++GLS+ E 
Sbjct: 629  --------DNEEYEVEFSSSKKPEIERPESLLLEKIAAHLLKRVADILAIDVQGLSKVEA 680

Query: 1203 XXXXXXXXXXXXEKHGPGNPIDSDLIIIIGYS---GNLQKQRLE-SSHITKILNQELGLL 1370
                        E +  G+ ++ D++IIIG +    +  K+ LE    + K+L  ELGL 
Sbjct: 681  RLVILAVLRMIKENYAFGHSVNDDILIIIGATKADESPAKEILEVQEAVIKLLRNELGLE 740

Query: 1371 VLQNRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIVP 1550
             L       P     + SD     N +         F           +R   +QRL V 
Sbjct: 741  AL-------PAKTRFAPSDSPKLQNTKENALPTTMVFHT---------RRPAVLQRLKVT 784

Query: 1551 KKSLYQWVEK 1580
            K+SL++W+++
Sbjct: 785  KQSLHRWLQR 794



 Score = 70.9 bits (172), Expect = 1e-09
 Identities = 64/282 (22%), Positives = 117/282 (41%), Gaps = 6/282 (2%)
 Frame = +3

Query: 60  LLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQVVGDSKNLAKALEIKRDM 239
           ++K C + R  +LA+     L +  I          + ++I   G S++L  AL+    M
Sbjct: 158 IIKRCVLNRKPNLAVRYASLLPQAHI---------LFCSIISGFGKSRDLVSALKAYDAM 208

Query: 240 EVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQPNSQCCNALLSAFVKDFQY 419
           +     P+M  +  ++  C   G   K+  ++++++     PN    N+L++A   D  Y
Sbjct: 209 KKNLKRPNMYIYRAIIDVCGLCGDFMKSRYIYEDLLNQKITPNIYVFNSLMNANAHDISY 268

Query: 420 ARAFTFFKNWKEKGFYAGSLKRYKKGNLPDNFSAPPSCIPKFKPTVVTYNTLMKAC--RS 593
                 ++N ++ G                            KP + +YN L+KAC    
Sbjct: 269 --TLNLYQNMQKVG---------------------------LKPDMTSYNILLKACCVAG 299

Query: 594 TPYLARTMMDEMKD----NGIVPDNVTWSMLIDAYGNTADLQGCMQAFNNMCEAGVKPDV 761
              LA+ M  E+K       +  D  T+S +I  + +    Q  ++  ++M  AGV  + 
Sbjct: 300 RVDLAQDMYKELKHLESIGQLKLDVFTYSTIIKVFADAKLWQMALKIKHDMLLAGVSLNT 359

Query: 762 VTYTTLIKVCVKNKKFTKAIMMFEAMKKKNVQPNAITYNTIL 887
           V +++LI  C       +AI +FE M     +PN   +N IL
Sbjct: 360 VAWSSLINACAHAGLVEQAIQLFEEMLLSGCEPNTQCFNIIL 401


>ref|XP_003554352.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Glycine max]
          Length = 811

 Score =  350 bits (897), Expect = 1e-93
 Identities = 202/546 (36%), Positives = 317/546 (58%), Gaps = 21/546 (3%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            + +MQ+ G+KPD+ +YNILLKAC VA   DLA DIY  LK     G +K+DV TYST+I+
Sbjct: 283  YQNMQNLGLKPDMTSYNILLKACCVAGRVDLAQDIYRELKHLESVGQLKLDVFTYSTIIK 342

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V  D K    AL+IK+DM  AG+  +++ W+ L+ ACA+ GLVE+A+ LF+EM+L+G +P
Sbjct: 343  VFADVKLWQMALKIKQDMLSAGVSLNIVAWSSLINACAHAGLVEQAIQLFEEMLLAGCEP 402

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGSLKRYK----KGNLPDNFSAPPSC 533
            N+QC N +L+A V+ +QY RAF FF +WK K     S + Y     +G++ D  S P   
Sbjct: 403  NTQCFNIILNACVEAYQYDRAFRFFHSWKGKKMLGSSGEGYNSNIGQGHMHDVTSIPNGI 462

Query: 534  IPK----------FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLIDA 683
                         F PT  TYN L+KAC +  Y A+ ++ EM+  G+ P+ ++WS+LID 
Sbjct: 463  SNSHILNFAERFPFTPTTTTYNILLKACGTDYYHAKALIKEMETVGLSPNQISWSILIDI 522

Query: 684  YGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKKKNVQPN 863
             G +++++G ++    M +AG+KPDV+ YTT IKVCV++K F +A+ ++E MK   ++PN
Sbjct: 523  CGASSNVEGAIEILKTMGDAGIKPDVIAYTTAIKVCVESKNFMQALTLYEEMKCYQIRPN 582

Query: 864  AITYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSYST 1043
             +TYNT+L+   ++G   +V   L++Y++MRK+G+ PND +L+ LIEEW EGVIQ +   
Sbjct: 583  WVTYNTLLKARSKYGFLHEVQQCLAIYQDMRKAGYKPNDYYLEELIEEWCEGVIQNN--- 639

Query: 1044 SQPMKLKGNQHQRKDENTQQKSDIIFFEKIAGLACSKSSD-FTVDLRGLSETETXXXXXX 1220
                + K  +    +++  ++   +  EKIA     + +D   +D++GL++ E       
Sbjct: 640  ----REKQGEFSSSNKSESERPQSLLLEKIAAHLLKRVADILAIDVQGLTKVEARLVVLA 695

Query: 1221 XXXXXXEKHGPGNPIDSDLIIIIG---YSGNLQKQRLE-SSHITKILNQELGLLVLQNRD 1388
                  E +G G+ ++ D++IIIG      N  K  LE    I K+L  ELGL V   + 
Sbjct: 696  VLRMIKENYGLGHSVNDDILIIIGATKVDENPSKHILEVQEAIIKLLRNELGLEVFPAK- 754

Query: 1389 YDQPVVAGTSSSDI--LSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIVPKKSL 1562
              +  ++ T++ +    S ++       N  GF           +R   + RL V KKSL
Sbjct: 755  -TRLALSDTANLEYPNFSNLSIEAQPGENALGFQT---------RRPGVLVRLKVTKKSL 804

Query: 1563 YQWVEK 1580
            Y+W+ +
Sbjct: 805  YRWLHR 810



 Score = 60.1 bits (144), Expect = 3e-06
 Identities = 60/282 (21%), Positives = 110/282 (39%), Gaps = 6/282 (2%)
 Frame = +3

Query: 60  LLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQVVGDSKNLAKALEIKRDM 239
           ++K C ++RN  LA+     L    I          +  +I   G  ++L  AL+     
Sbjct: 168 IIKRCVLSRNPILAVRYACLLPHAHI---------LFCNIISEFGKRRDLVSALKAYEAS 218

Query: 240 EVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQPNSQCCNALLSAFVKDFQY 419
           +     P+M  +   +  C       K+  ++++++     PN    N+L++    D  Y
Sbjct: 219 KKHLNTPNMYIYRATIDTCGLCRDYMKSRYIYEDLLNQKITPNIYVFNSLMNVNSHDLSY 278

Query: 420 ARAFTFFKNWKEKGFYAGSLKRYKKGNLPDNFSAPPSCIPKFKPTVVTYNTLMKAC--RS 593
                 ++N +  G                            KP + +YN L+KAC    
Sbjct: 279 --TLNLYQNMQNLG---------------------------LKPDMTSYNILLKACCVAG 309

Query: 594 TPYLARTMMDEMKD----NGIVPDNVTWSMLIDAYGNTADLQGCMQAFNNMCEAGVKPDV 761
              LA+ +  E+K       +  D  T+S +I  + +    Q  ++   +M  AGV  ++
Sbjct: 310 RVDLAQDIYRELKHLESVGQLKLDVFTYSTIIKVFADVKLWQMALKIKQDMLSAGVSLNI 369

Query: 762 VTYTTLIKVCVKNKKFTKAIMMFEAMKKKNVQPNAITYNTIL 887
           V +++LI  C       +AI +FE M     +PN   +N IL
Sbjct: 370 VAWSSLINACAHAGLVEQAIQLFEEMLLAGCEPNTQCFNIIL 411


>gb|EOY07712.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
            [Theobroma cacao]
          Length = 858

 Score =  346 bits (887), Expect = 2e-92
 Identities = 213/545 (39%), Positives = 304/545 (55%), Gaps = 22/545 (4%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            + DMQ+ G+  D+A+YNILLKAC +A+  DLA DIY  +K     G +K+DV TY T+I+
Sbjct: 324  YKDMQNLGITADMASYNILLKACCLAQRVDLAQDIYNEVKHLESTGVLKLDVFTYCTIIK 383

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V  D++    AL+IK DM  AG+ P+ +TW+ L++ACAN GLVE+A  LF+EM+L+G +P
Sbjct: 384  VFADARLWQMALKIKEDMLSAGVTPNTVTWSSLISACANAGLVEQAFQLFEEMILTGCEP 443

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWK--EKGFYAGSLKRYKKGNLPDNFSAPPSCIP 539
            NSQCCN LL A V+  QY RAF  F  W   ++GF AG++         +N +   +   
Sbjct: 444  NSQCCNILLHACVEASQYDRAFRLFHCWTGGQEGF-AGNIDSVLGTKQLNNRTTSTALTN 502

Query: 540  K----------FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLIDAYG 689
                       F PT  TYN LMKAC +  Y A+ +MDEMK  G+ P++V+WS+LID   
Sbjct: 503  SHHLSFAKKFSFTPTTATYNILMKACCTDYYRAKALMDEMKSVGLSPNHVSWSILIDICR 562

Query: 690  NTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKKKNVQPNAI 869
             + +++G +Q    M   G+KPDVV YTT IKVCV +K    A  +FE MK+  VQPN +
Sbjct: 563  GSGNVEGAIQILKTMHVTGIKPDVVAYTTAIKVCVGSKNLKLAFSLFEEMKRYRVQPNLV 622

Query: 870  TYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSYSTSQ 1049
            TYNT+LR    +G   +V   L++Y++MRK+G+  ND +LK LIEEW EGVI        
Sbjct: 623  TYNTLLRARSRYGSLHEVQQCLAIYQDMRKAGYKSNDIYLKELIEEWCEGVI-------- 674

Query: 1050 PMKLKGNQHQRKDENTQQKSDI-----IFFEKIA-GLACSKSSDFTVDLRGLSETETXXX 1211
                K N H+R+  ++ +++D+     +  EKIA  L  S +    +DLRGL++ E    
Sbjct: 675  ----KENNHKREGLSSCKRTDLERPHSLLLEKIAVHLQMSTAESPAIDLRGLTKVEARIV 730

Query: 1212 XXXXXXXXXEKHGPGNPIDSDLIIIIGYS---GNLQKQRLE-SSHITKILNQELGLLVLQ 1379
                     E H  G+ +  D++II+G S    N  KQ+ E    + K+L  ELGL VL 
Sbjct: 731  VLAVLRMIKENHILGHSVKDDMLIILGVSERHANAAKQKSEVKDAVMKLLQDELGLEVLL 790

Query: 1380 NRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIVPKKS 1559
                +  V  G          +    +   K   S   L+     +R   +QRL V +KS
Sbjct: 791  ---VEPQVKNGLVDLQTPIDADPVLLETVGKNSLSSKPLSS---TRRPVILQRLKVTRKS 844

Query: 1560 LYQWV 1574
            L  W+
Sbjct: 845  LNHWL 849


>gb|ESW34707.1| hypothetical protein PHAVU_001G174000g [Phaseolus vulgaris]
          Length = 809

 Score =  343 bits (879), Expect = 2e-91
 Identities = 199/548 (36%), Positives = 312/548 (56%), Gaps = 23/548 (4%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            + +MQ+ G+KPD+ +YNILLK C VA   DLA DIY  LK     G +K+DV TYST+I+
Sbjct: 277  YQNMQNLGLKPDMTSYNILLKGCCVAGRVDLAQDIYRELKHLESVGQLKLDVFTYSTIIK 336

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V  D++    AL IK+DM  AG+  +++ W+ L+ ACA+ GLVE+A+ LF+EM+L+G +P
Sbjct: 337  VFADARLWQMALTIKQDMLSAGVSLNIVAWSSLINACAHAGLVEQAIQLFEEMLLAGREP 396

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWKEK---GFYAGSLKRYKKGNLPDNFSAPPSCI 536
            N+QC N +L+A V+  QY RAF FF +WK K   G +        +  L  N +  P+ I
Sbjct: 397  NTQCFNIILNACVEACQYDRAFRFFHSWKGKKMLGSFGEGCNNNTRQELVHNVTTVPNGI 456

Query: 537  PK-----------FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLIDA 683
                         F PT  TYN L+KAC +  Y A+ ++ EM+  G+ P+ ++WS LID 
Sbjct: 457  SNSHILSFAERFPFTPTTTTYNILLKACGTDYYHAKALIKEMETVGLSPNQISWSTLIDI 516

Query: 684  YGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKKKNVQPN 863
             G +A+++G ++   NM +AG+KPDV+ YTT IKVCV++K F +A+ +++ MK  +++PN
Sbjct: 517  CGASANVEGAIEILKNMGDAGIKPDVIAYTTAIKVCVESKNFMQALALYKEMKSYHIRPN 576

Query: 864  AITYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSYST 1043
             ITYNT+L+   ++G   +V   L++Y++MRK+G+ PND +L+ LIEEW EGVIQ +   
Sbjct: 577  LITYNTLLKARSKYGSLHEVQQCLAIYQDMRKAGYKPNDCYLEELIEEWCEGVIQDN--- 633

Query: 1044 SQPMKLKGNQHQRKDENTQQKSDIIFFEKIAGLACSKSSD-FTVDLRGLSETETXXXXXX 1220
                +++G +    +++  +KS  +  EKIA     + +D   +D++GL++ E       
Sbjct: 634  ---REIQG-EFSSSNKSELEKSQSLLLEKIAAHLLKRVADILAIDVQGLTKVEARLVVLA 689

Query: 1221 XXXXXXEKHGPGNPIDSDLIIIIG---YSGNLQKQRLE-SSHITKILNQELGLLVLQNRD 1388
                  E +  G+ I+ D++I+IG      N  K+ LE    I K+L  ELGL     R 
Sbjct: 690  VLRMIKENYSLGHSINDDILIVIGATKVDENPAKRILEVQEAILKLLRNELGLEAFPART 749

Query: 1389 ----YDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIVPKK 1556
                 D P +   + +++               GF           +R   + RL + +K
Sbjct: 750  RLALSDTPKLKNPTLANLKIEAVPAEDALPTSMGFQT---------RRPGILVRLKITRK 800

Query: 1557 SLYQWVEK 1580
            SLY W+ +
Sbjct: 801  SLYSWLHR 808



 Score = 59.7 bits (143), Expect = 3e-06
 Identities = 59/282 (20%), Positives = 111/282 (39%), Gaps = 6/282 (2%)
 Frame = +3

Query: 60  LLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQVVGDSKNLAKALEIKRDM 239
           ++K C ++RN  LA+     L    I          + ++I   G  ++L  A +     
Sbjct: 162 VIKRCVLSRNPILAVRYACLLPHAQI---------LFCSIISEFGKRRDLISAFKAYELS 212

Query: 240 EVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQPNSQCCNALLSAFVKDFQY 419
           +    +P+M  +  ++ AC       K+  ++++++     PN    N+L++    D  Y
Sbjct: 213 KKHMNIPNMYMYRAIIDACGLCRDYMKSRYIYEDLLNQKITPNIYVFNSLMNVNAHDLSY 272

Query: 420 ARAFTFFKNWKEKGFYAGSLKRYKKGNLPDNFSAPPSCIPKFKPTVVTYNTLMKAC--RS 593
                 ++N +  G                            KP + +YN L+K C    
Sbjct: 273 --TLNLYQNMQNLG---------------------------LKPDMTSYNILLKGCCVAG 303

Query: 594 TPYLARTMMDEMKD----NGIVPDNVTWSMLIDAYGNTADLQGCMQAFNNMCEAGVKPDV 761
              LA+ +  E+K       +  D  T+S +I  + +    Q  +    +M  AGV  ++
Sbjct: 304 RVDLAQDIYRELKHLESVGQLKLDVFTYSTIIKVFADARLWQMALTIKQDMLSAGVSLNI 363

Query: 762 VTYTTLIKVCVKNKKFTKAIMMFEAMKKKNVQPNAITYNTIL 887
           V +++LI  C       +AI +FE M     +PN   +N IL
Sbjct: 364 VAWSSLINACAHAGLVEQAIQLFEEMLLAGREPNTQCFNIIL 405


>gb|ESW34706.1| hypothetical protein PHAVU_001G174000g [Phaseolus vulgaris]
          Length = 594

 Score =  343 bits (879), Expect = 2e-91
 Identities = 199/548 (36%), Positives = 312/548 (56%), Gaps = 23/548 (4%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            + +MQ+ G+KPD+ +YNILLK C VA   DLA DIY  LK     G +K+DV TYST+I+
Sbjct: 62   YQNMQNLGLKPDMTSYNILLKGCCVAGRVDLAQDIYRELKHLESVGQLKLDVFTYSTIIK 121

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V  D++    AL IK+DM  AG+  +++ W+ L+ ACA+ GLVE+A+ LF+EM+L+G +P
Sbjct: 122  VFADARLWQMALTIKQDMLSAGVSLNIVAWSSLINACAHAGLVEQAIQLFEEMLLAGREP 181

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWKEK---GFYAGSLKRYKKGNLPDNFSAPPSCI 536
            N+QC N +L+A V+  QY RAF FF +WK K   G +        +  L  N +  P+ I
Sbjct: 182  NTQCFNIILNACVEACQYDRAFRFFHSWKGKKMLGSFGEGCNNNTRQELVHNVTTVPNGI 241

Query: 537  PK-----------FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLIDA 683
                         F PT  TYN L+KAC +  Y A+ ++ EM+  G+ P+ ++WS LID 
Sbjct: 242  SNSHILSFAERFPFTPTTTTYNILLKACGTDYYHAKALIKEMETVGLSPNQISWSTLIDI 301

Query: 684  YGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKKKNVQPN 863
             G +A+++G ++   NM +AG+KPDV+ YTT IKVCV++K F +A+ +++ MK  +++PN
Sbjct: 302  CGASANVEGAIEILKNMGDAGIKPDVIAYTTAIKVCVESKNFMQALALYKEMKSYHIRPN 361

Query: 864  AITYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSYST 1043
             ITYNT+L+   ++G   +V   L++Y++MRK+G+ PND +L+ LIEEW EGVIQ +   
Sbjct: 362  LITYNTLLKARSKYGSLHEVQQCLAIYQDMRKAGYKPNDCYLEELIEEWCEGVIQDN--- 418

Query: 1044 SQPMKLKGNQHQRKDENTQQKSDIIFFEKIAGLACSKSSD-FTVDLRGLSETETXXXXXX 1220
                +++G +    +++  +KS  +  EKIA     + +D   +D++GL++ E       
Sbjct: 419  ---REIQG-EFSSSNKSELEKSQSLLLEKIAAHLLKRVADILAIDVQGLTKVEARLVVLA 474

Query: 1221 XXXXXXEKHGPGNPIDSDLIIIIG---YSGNLQKQRLE-SSHITKILNQELGLLVLQNRD 1388
                  E +  G+ I+ D++I+IG      N  K+ LE    I K+L  ELGL     R 
Sbjct: 475  VLRMIKENYSLGHSINDDILIVIGATKVDENPAKRILEVQEAILKLLRNELGLEAFPART 534

Query: 1389 ----YDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIVPKK 1556
                 D P +   + +++               GF           +R   + RL + +K
Sbjct: 535  RLALSDTPKLKNPTLANLKIEAVPAEDALPTSMGFQT---------RRPGILVRLKITRK 585

Query: 1557 SLYQWVEK 1580
            SLY W+ +
Sbjct: 586  SLYSWLHR 593


>gb|EXB84820.1| hypothetical protein L484_003853 [Morus notabilis]
          Length = 822

 Score =  340 bits (873), Expect = 8e-91
 Identities = 206/549 (37%), Positives = 309/549 (56%), Gaps = 24/549 (4%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            + DMQ+ GV+ D+A+YNILLKAC +A   DLA DIY+ ++     G +K+DV TYST+++
Sbjct: 292  YKDMQNLGVQADMASYNILLKACCLAGRVDLAQDIYKEVQHLESTGLLKLDVFTYSTIVK 351

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V+ D+K    AL++K DM  AG+ P+ +TW+ L++ACAN G+V+KA+ LF+EM+L+G +P
Sbjct: 352  VLADAKLWQMALKVKEDMLSAGVNPNTVTWSSLISACANAGIVDKAVQLFEEMLLAGCKP 411

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGSLKRYKKGNLPDNFSAPPSCIPK- 542
            N+QCCN LL A V+  QY RAF  F+  K       S +   +G+   N SA  + I + 
Sbjct: 412  NTQCCNILLHACVEACQYDRAFRLFEFLKRNRVQETS-EEDGRGDRDSNQSAGVTSISQS 470

Query: 543  -------------FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLIDA 683
                         F PT  TYN LMKAC S  Y A+ +++EM+  G+ P+ +TWS+LID 
Sbjct: 471  STLCGLNFARELPFTPTTTTYNILMKACGSDYYHAKALIEEMEAVGLSPNQITWSILIDI 530

Query: 684  YGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKKKNVQPN 863
             G+  +++G +Q    M   G++PDVV YTT+IKVCV++K   +A  +F  MK+  +QPN
Sbjct: 531  CGDLGNVEGALQILKTMRATGIEPDVVAYTTVIKVCVESKDLKQAFELFAEMKRYQIQPN 590

Query: 864  AITYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSYST 1043
             +TYNT+LR    +G   +V   L++Y++MR++G+  ND +LK LIEEW EGVIQ     
Sbjct: 591  LVTYNTLLRARNRYGSLQEVKQCLAVYQDMRRAGYNSNDYYLKQLIEEWCEGVIQ----- 645

Query: 1044 SQPMKLKGNQHQRKDENTQQKSD-----IIFFEKIA-GLACSKSSDFTVDLRGLSETETX 1205
                   GN   R++ ++  K+D      +  EK+A  L    +   TVD++GL + E  
Sbjct: 646  -------GNNQNREESSSFNKTDKKRPQSLLLEKVAEHLEKHIAETLTVDVQGLKKVEAR 698

Query: 1206 XXXXXXXXXXXEKHGPGNPIDSDLIIIIG---YSGNLQKQRLE-SSHITKILNQELGLLV 1373
                       E +  G  +  D++IIIG         +Q LE    ITK+L  ELGL V
Sbjct: 699  IVVLAVLRMVKENYTMGYLVKDDMLIIIGACKVDAVPDEQELEVKDAITKLLKDELGLEV 758

Query: 1374 LQNRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIVPK 1553
            L      +P               NR+  + +  G S  +   +   +R   +QRL V K
Sbjct: 759  LSTGLKIEP---------------NRQVDS-DSLGSSDFSGEMKYSTRRPVVIQRLKVTK 802

Query: 1554 KSLYQWVEK 1580
            +SL  W+++
Sbjct: 803  ESLQHWLQR 811


>ref|XP_004167767.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At5g02830, chloroplastic-like [Cucumis sativus]
          Length = 855

 Score =  340 bits (873), Expect = 8e-91
 Identities = 208/554 (37%), Positives = 304/554 (54%), Gaps = 29/554 (5%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            + +MQ+ GV  D+A+YNILLKAC +A   DLA DIY  +K     G +K+DV TYST+++
Sbjct: 317  YKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVK 376

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V  D+K    AL +K DM+ AG+ P+M+TW+ L+++CAN GLVE A+ LF+EMV +G +P
Sbjct: 377  VFADAKLWKMALRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEP 436

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGSLKRYKKGNLPDNFSAPPSCIPK- 542
            N+QCCN LL A V+  Q+ RAF  F++WKEK  + G  ++    N  D  S    C  K 
Sbjct: 437  NTQCCNTLLHACVEGRQFDRAFRLFRSWKEKELWDGIERKSSTDNNLDADSTSQLCTTKM 496

Query: 543  -----------------FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSM 671
                             FKPT+ TYN LMKAC +  Y A+ +M+EMK  G+ P++++WS+
Sbjct: 497  PNAPSHVHQISFVGNLAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSI 556

Query: 672  LIDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIK------VCVKNKKFTKAIMMFE 833
            L+D  G + D++  +Q    M  AGV PDVV YTT IK      V V    +  A  +FE
Sbjct: 557  LVDICGRSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVSIPLAVLVLKXNWKLAFSLFE 616

Query: 834  AMKKKNVQPNAITYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWA 1013
             MK   +QPN +TY+T+LR    +G   +V   L++Y++MRKSGF  ND +LK LI EW 
Sbjct: 617  EMKGFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWC 676

Query: 1014 EGVIQKSYSTSQPMKLKGNQHQRKDENTQQKSDIIFFEKIAG-LACSKSSDFTVDLRGLS 1190
            EGVIQK  +  QP+++        ++    K   +  EK+A  L  S +   T+DL+ L+
Sbjct: 677  EGVIQK--NNQQPVEI-----TPCNKIDIGKPRCLILEKVADHLQKSFAESLTIDLQELT 729

Query: 1191 ETETXXXXXXXXXXXXEKHGPGNPIDSDLIIIIGYS---GNLQKQRLE-SSHITKILNQE 1358
            + E             E +  G  +  D+ II+  +    +L  Q  E    IT++L  E
Sbjct: 730  KVEARIVVLAVLRMIKENYALGESVKDDIFIILEVNKVETDLVPQNFEVRDAITRLLQDE 789

Query: 1359 LGLLVLQNRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQR 1538
            LGL VL       P +A      +    N+  +K  + T    +    +   ++   VQR
Sbjct: 790  LGLEVLPT----GPTIA------LDKVPNSESSKISHTTKLKGTMGRNKYFTRKPADVQR 839

Query: 1539 LIVPKKSLYQWVEK 1580
            L V KKSL  W+++
Sbjct: 840  LKVTKKSLQDWLQR 853


>ref|XP_006287051.1| hypothetical protein CARUB_v10000200mg [Capsella rubella]
            gi|482555757|gb|EOA19949.1| hypothetical protein
            CARUB_v10000200mg [Capsella rubella]
          Length = 858

 Score =  340 bits (871), Expect = 1e-90
 Identities = 202/554 (36%), Positives = 303/554 (54%), Gaps = 29/554 (5%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            + +MQ   V  D+ +YNILLK C +A   DLA DIY+  K     G +K+D  TY T+I+
Sbjct: 323  YKNMQKLDVTADMTSYNILLKTCCLAGRVDLAQDIYKEAKRMESSGLLKLDAFTYCTIIK 382

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V  D+K    AL++K DM+  G+ P+  TW+ L++ACAN GLVE+A  LF+EM+ SG +P
Sbjct: 383  VFADAKMWKWALKVKDDMKSVGVTPNTHTWSSLISACANAGLVEQANHLFEEMLASGCEP 442

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWK--------------EKG--FYAGSLKRYKKG 497
            NSQC N LL A V+  QY RAF  F++WK               KG  F    LK    G
Sbjct: 443  NSQCFNILLHACVEACQYDRAFRLFQSWKGSSVKEALYADKIVSKGRTFSPNKLKTNDPG 502

Query: 498  NLPDNFSAPPSCIPK----FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTW 665
            +L +N S  P         FKPT  TYN L+KAC +  Y  + +MDEMK  G+ P+ +TW
Sbjct: 503  SLVNNNSTSPYIQASNRFFFKPTTATYNILLKACGTDYYRGKELMDEMKSLGLTPNQITW 562

Query: 666  SMLIDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKK 845
            S LID  G + D++G ++    M  AG +PDVV YTT IK+C +NK    A  +FE M++
Sbjct: 563  STLIDMCGGSGDVEGAVRILRTMHSAGTRPDVVAYTTAIKICAENKSLKLAFSLFEEMRR 622

Query: 846  KNVQPNAITYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVI 1025
              ++PN +TYNT+L+   ++G  ++V   L++Y++MRK+G+ PND FLK LIEEW EGVI
Sbjct: 623  YQIKPNWVTYNTLLKARSKYGSLLEVRQCLAIYQDMRKAGYKPNDHFLKELIEEWCEGVI 682

Query: 1026 QKSYSTSQPMKLKGNQHQRKDENTQQKSDIIFFEKIA-GLACSKSSDFTVDLRGLSETET 1202
            Q++  +   +  +   H  +  +       +  EK+A  L    + +  +DL+GL++ E 
Sbjct: 683  QENGQSQNKISDQEGDHAGRPVS-------LLIEKVATHLQERTAGNLAIDLQGLTKVEA 735

Query: 1203 XXXXXXXXXXXXEKHGPGNPIDSDLIIIIGYS-GNLQKQRLE---SSHITKILNQELGLL 1370
                        E +  G+ +  D++II+G S  N    + +      + K+L +EL L+
Sbjct: 736  RLVVLAVLRMIKEDYMRGDVVIDDVLIILGTSEANTDSGKQDIAVKEALVKLLQEELSLV 795

Query: 1371 VL----QNRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQR 1538
            VL    +N   D   V   +        +   T    K+  S+S+       +R   ++R
Sbjct: 796  VLPAGQRNIKQDAHCVDDANQ-------DTEHTLENTKSFISISS------TRRPAILER 842

Query: 1539 LIVPKKSLYQWVEK 1580
            L+V K SLYQW+++
Sbjct: 843  LMVTKASLYQWLQR 856


>ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic [Vitis vinifera]
            gi|297741486|emb|CBI32618.3| unnamed protein product
            [Vitis vinifera]
          Length = 842

 Score =  339 bits (869), Expect = 2e-90
 Identities = 209/559 (37%), Positives = 306/559 (54%), Gaps = 34/559 (6%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            + +MQ+ GV  D+A+YNILLKAC VA   DLA +IY  ++    +G +K+DV TYST+I+
Sbjct: 301  YKNMQNLGVTADMASYNILLKACCVAGRVDLAQEIYREVQNLESNGMLKLDVFTYSTIIK 360

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V  D+K    AL+IK DM  AG++P+ +TW+ L+++CAN G+ E+A+ LF EM+L+G +P
Sbjct: 361  VFADAKLWQMALKIKEDMLSAGVIPNTVTWSALISSCANAGITEQAIQLFKEMLLAGCEP 420

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGSLKRYKKGNL-------PDNFSAP 524
            NSQC N LL A V+  QY RAF  F++WK+  F   S      GN         +  ++ 
Sbjct: 421  NSQCYNILLHACVEACQYDRAFRLFQSWKDSRFQEIS-GGTGNGNTVGVELKHQNCITSM 479

Query: 525  PSCIPK-----------FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSM 671
            P+C+             F PT  TYN LMKAC +  Y A+ +MDEMK  G+ P++++WS+
Sbjct: 480  PNCLSNSHHLSFSKSFPFTPTTTTYNILMKACGTDYYRAKALMDEMKTAGLSPNHISWSI 539

Query: 672  LIDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKKKN 851
            LID  G T ++ G ++    M EAG+KPDVV YTT IK CV++K    A  +F  MK+  
Sbjct: 540  LIDICGGTGNIVGAVRILKTMREAGIKPDVVAYTTAIKYCVESKNLKIAFSLFAEMKRYQ 599

Query: 852  VQPNAITYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQK 1031
            +QPN +TYNT+LR    +G   +V   L++Y+ MRK+G+  ND +LK LIEEW EGVIQ 
Sbjct: 600  IQPNLVTYNTLLRARSRYGSLHEVQQCLAIYQHMRKAGYKSNDYYLKELIEEWCEGVIQD 659

Query: 1032 SYSTSQPMKLKGNQHQRKDENTQQKSDIIFFEKIAG-LACSKSSDFTVDLRGLSETETXX 1208
            +         K +   R D    Q    +  EK+A  L  S +    +DL+GL++ E   
Sbjct: 660  NNLNQS----KFSSVNRADWGRPQS---LLLEKVAAHLQKSVAESLAIDLQGLTQVEARI 712

Query: 1209 XXXXXXXXXXEKHGPGNPIDSDLIIIIGY----SGNLQKQRLESSHITKILNQELGLLVL 1376
                      E +  G+PI  D++II+G     +  ++ +      I K+L  ELGL V 
Sbjct: 713  VVLAVLRMIKENYILGHPIKDDILIILGIKKVDANLVEHESPVKGAIIKLLQDELGLEVA 772

Query: 1377 QNRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPT--------- 1529
                +  P +A            ++R   G   G S  +  E +   RLPT         
Sbjct: 773  ----FAGPKIA-----------LDKRINLGGPPG-SDPDWQEALGRNRLPTELESSTRRP 816

Query: 1530 --VQRLIVPKKSLYQWVEK 1580
              +QR  V +KSL  W+++
Sbjct: 817  AVLQRFKVTRKSLDHWLQR 835


>ref|XP_006428907.1| hypothetical protein CICLE_v10011055mg [Citrus clementina]
            gi|568853887|ref|XP_006480569.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Citrus sinensis]
            gi|557530964|gb|ESR42147.1| hypothetical protein
            CICLE_v10011055mg [Citrus clementina]
          Length = 850

 Score =  336 bits (862), Expect = 1e-89
 Identities = 198/552 (35%), Positives = 305/552 (55%), Gaps = 27/552 (4%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            + +MQ  GV  D+A+YNILLKAC +A N  LA +IY  +K     G +K+DV TYST+++
Sbjct: 312  YKNMQKLGVMADMASYNILLKACCLAGNTVLAQEIYGEVKHLEAKGVLKLDVFTYSTIVK 371

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V  D+K    AL++K DM  AG+ P+ ITW+ L+ ACAN GLVE+A+ LF+EM  +G +P
Sbjct: 372  VFADAKWWQMALKVKEDMLSAGVTPNTITWSSLINACANAGLVEQAMHLFEEMRQAGCEP 431

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWKEKGF-------YAGSLKRYKKGNLPDNFSAP 524
            NSQCCN LL A V+  Q+ RAF  F++W            Y G+  R       D  S  
Sbjct: 432  NSQCCNILLQACVEACQFDRAFRLFRSWTLSKTQVALGEDYDGNTDRISNMEHKDKQSIT 491

Query: 525  --PSCIPK-----------FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTW 665
              P+ +P            FKPT  TYN LMKAC +  Y  + +MDEM+  G+ P++++W
Sbjct: 492  NTPNFVPNSHYSSFDKRFSFKPTTTTYNILMKACCTDYYRVKALMDEMRTVGLSPNHISW 551

Query: 666  SMLIDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKK 845
            ++LIDA G + +++G +Q    M E G+ PDVV YTT IKVCV++K+   A  +FE MK 
Sbjct: 552  TILIDACGGSGNVEGALQILKIMREDGMSPDVVAYTTAIKVCVRSKRLKLAFSLFEEMKH 611

Query: 846  KNVQPNAITYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVI 1025
              +QPN +TY T+LR    +G   +V   L++Y++M K+G+  ND +LK +IEEW EGVI
Sbjct: 612  YQIQPNLVTYITLLRARSRYGSLHEVQQCLAVYQDMWKAGYKANDTYLKEVIEEWCEGVI 671

Query: 1026 QKSYSTSQPMKLKGNQHQRKDENTQQKSDIIFFEKIAGLACSKSSDFTVDLRGLSETETX 1205
            Q           +G     +  N+Q+   ++  +    L  S + +  +DL+GL++ E  
Sbjct: 672  QDKNQN------QGEVTLCRRTNSQRPQSLLLEKVAVHLQKSAAENLAIDLQGLTKVEAR 725

Query: 1206 XXXXXXXXXXXEKHGPGNPIDSDLIIIIGYSGNLQKQRLESSH-------ITKILNQELG 1364
                       E +  G P+  DL+I++G +   +  ++++ H       ITK+L  +LG
Sbjct: 726  IVVLAVLQMMKENYSLGVPVKDDLMIVLGPN---KVNKIQAKHDLEVKDAITKLLQDDLG 782

Query: 1365 LLVLQNRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLI 1544
            L V      D P +    ++ +   +++    A         ++  +   +R   +QRL 
Sbjct: 783  LKVF----LDGPSIQ-HKNAHMQKLLDSESNMA------KTLHIELKSSTRRPKILQRLK 831

Query: 1545 VPKKSLYQWVEK 1580
            VPKKSL+ W+++
Sbjct: 832  VPKKSLHHWLQR 843


>gb|EMJ09280.1| hypothetical protein PRUPE_ppa001520mg [Prunus persica]
          Length = 809

 Score =  336 bits (861), Expect = 2e-89
 Identities = 193/469 (41%), Positives = 275/469 (58%), Gaps = 11/469 (2%)
 Frame = +3

Query: 3    TFH---DMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYS 173
            TFH    MQ+ GV+ D+A YNILLKAC +A   DLA DIY  ++     G +K+DV TYS
Sbjct: 311  TFHVYKSMQNLGVRADMACYNILLKACCLAGRVDLAQDIYSEVQHLESTGVLKLDVFTYS 370

Query: 174  TLIQVVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLS 353
            T+++V  D+K    AL +K DM  AG+ P+ +TW+ L++ACAN G+VEKA+ LF+EM+L+
Sbjct: 371  TIVKVFADAKLWHMALNVKEDMLSAGVTPNTVTWSSLISACANAGIVEKAIQLFEEMLLA 430

Query: 354  GYQPNSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGSLKRYKKGNLPDNFSAPPSC 533
            G +PNSQC N LL A V+  QY RAF  F+          SLKR                
Sbjct: 431  GSEPNSQCFNILLHACVEANQYDRAFRLFQ----------SLKRLS-------------- 466

Query: 534  IPKFKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLIDAYGNTADLQGC 713
               FKPT  TYNTLMKAC +  Y A+ ++DEM+  G+ P+ ++WS+L D  G + +++G 
Sbjct: 467  ---FKPTTTTYNTLMKACGTDYYHAKALLDEMRAVGLYPNQISWSILADICGGSGNVEGA 523

Query: 714  MQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKKKNVQPNAITYNTILRG 893
            +Q   NM  AG+KPDVV YTT IKVCV+N+    A+ +F  MKK  + PN +TYNT+LR 
Sbjct: 524  LQILKNMRAAGMKPDVVAYTTAIKVCVENENLELALSLFGEMKKYQIHPNLVTYNTLLRA 583

Query: 894  WREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSYSTSQPMKLKGNQ 1073
               +G   +V   L++Y++MRK+G+  ND +L+ LIEEW EGVIQ S +           
Sbjct: 584  RSRYGSVSEVQQCLAIYQDMRKAGYKSNDYYLEQLIEEWCEGVIQDSNA----------- 632

Query: 1074 HQRKDENTQQKSDI-----IFFEKIA-GLACSKSSDFTVDLRGLSETETXXXXXXXXXXX 1235
             ++++ ++  K+DI     +  EK+A  L    +    VDL+GL++ E            
Sbjct: 633  -KQEEFSSCNKTDIGRPGSLLLEKVAEHLQTHIAETLAVDLQGLTKVEARIVVLAVLRMI 691

Query: 1236 XEKHGPGNPIDSDLIIIIG-YSGNLQKQRLE-SSHITKILNQELGLLVL 1376
             E +  G+ +  D++I++G   G    Q LE    ITK+L  ELGL VL
Sbjct: 692  KENYTLGHSVKDDMLIVVGEVDGGSTTQNLEVKDAITKLLQDELGLKVL 740


>ref|XP_006398739.1| hypothetical protein EUTSA_v10012661mg [Eutrema salsugineum]
            gi|557099829|gb|ESQ40192.1| hypothetical protein
            EUTSA_v10012661mg [Eutrema salsugineum]
          Length = 858

 Score =  329 bits (843), Expect = 2e-87
 Identities = 197/554 (35%), Positives = 298/554 (53%), Gaps = 29/554 (5%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            + +MQ   V  D+ +YNILLK C +A   DLA DIY+  K     G +K+D  TY T+I+
Sbjct: 321  YKNMQKLDVTADMTSYNILLKTCCLAGRVDLAQDIYKEAKRMESSGLLKLDAFTYCTIIK 380

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V  D+K    AL++K DM+  G+ P+  TW+ L++ACAN GLVE+A  LF+EM+ SG +P
Sbjct: 381  VFADAKMWKMALKVKEDMQSVGVTPNTHTWSSLISACANAGLVEQANHLFEEMLASGCEP 440

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWK----EKGFYA------------GSLKRYKKG 497
            NSQC N LL A V+  Q+ RAF  F++WK    ++  YA              LK +  G
Sbjct: 441  NSQCFNILLHACVEACQFDRAFRLFQSWKGSSDKEALYADDITGKGSIFSPNKLKNHGNG 500

Query: 498  NLPDNFSAPPSCIPK---FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWS 668
            +L +  S+P         FKPT  TYN L+KAC +  Y  + +MDEM+  G+ P+ +TWS
Sbjct: 501  SLVNTNSSPYIQASNRFFFKPTTATYNILLKACGTDYYRGKELMDEMRSLGLAPNQITWS 560

Query: 669  MLIDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKKK 848
             LID  G + D++G +     M  AG +PDVV YTT IK+C +NK    A  +FE M++ 
Sbjct: 561  TLIDICGGSGDVEGAVGILRTMHSAGTRPDVVAYTTAIKICAENKSLKLAFSLFEEMRRY 620

Query: 849  NVQPNAITYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQ 1028
             ++PN +TYNT+L+   ++G  ++V   L++Y++MRK+G+ PND FLK LIEEW EGVIQ
Sbjct: 621  QIKPNWVTYNTLLKARSKYGSLLEVRQCLAIYQDMRKAGYKPNDHFLKELIEEWCEGVIQ 680

Query: 1029 K-SYSTSQPMKLKGNQHQRKDENTQQKSDIIFFEKIAGLACSKSSDFTVDLRGLSETETX 1205
            + S S  +    +G    R      +K      E+ AG       +  +DL+GL++ E  
Sbjct: 681  ENSQSQIKTSDQEGTNLGRPVSLLIEKVATHLQERTAG-------NLAIDLQGLTKVEAR 733

Query: 1206 XXXXXXXXXXXEKHGPGNPIDSDLIIIIGY-SGNLQKQRLE---SSHITKILNQELGLLV 1373
                       E +  G+ +  DL+II+G    N+   + E      + ++L  EL L+V
Sbjct: 734  LVVLAVLRMIKEDYIRGDVVTDDLLIILGTGEANIDPGKQEIAVKDVLVQLLKDELSLVV 793

Query: 1374 LQNRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNER-----MPLKRLPTVQR 1538
            L                 +L    + R       G  +++ N +        +R   ++R
Sbjct: 794  LP-----------AGHRHVLDITLDARCVDDADQGIELTSENTKSIVGISSTRRPAILER 842

Query: 1539 LIVPKKSLYQWVEK 1580
            L+V K SL+QW+++
Sbjct: 843  LMVTKASLHQWLQR 856


>ref|XP_006577707.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Glycine max]
          Length = 597

 Score =  328 bits (842), Expect = 3e-87
 Identities = 194/545 (35%), Positives = 304/545 (55%), Gaps = 20/545 (3%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            + +MQ+ G+KPD+ +YNILLKAC VA   DL   IY  LK     G +K+DV+TYST+I+
Sbjct: 69   YQNMQNLGLKPDMTSYNILLKACCVAGRVDLTQGIYRELKHLESVGQLKLDVLTYSTIIK 128

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V  D K    AL+IK+DM  AG+  +++ W+ L  ACA+ GLVE+A+ LF+EM+L+G +P
Sbjct: 129  VFADVKLWQMALKIKQDMLSAGVSLNIVAWSSLSNACAHAGLVEQAIQLFEEMLLAGCEP 188

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWKEKGFYAGSLKRYK----KGNLPDNFSAPPSC 533
            N+QC N +L+A V+ +QY R F FF +WK K     S + Y     +G++  N ++ P+ 
Sbjct: 189  NTQCFNIILNACVEAYQYDRGFRFFHSWKGKKMLGSSGEGYNSNLGQGHM-HNVTSMPNG 247

Query: 534  IPK-----------FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTWSMLID 680
            I             F PT  TY  L+K C +  Y A+ ++ EM+  G+ P+ ++WS+LID
Sbjct: 248  ISNSHILSFSERFPFTPTTTTYYILLKPCGTDYYHAKALIKEMETVGLSPNQISWSILID 307

Query: 681  AYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKKKNVQP 860
              G +A+++G ++    M +AG+KP V+ YTT +KVCV++K F +A+ ++E MK   ++P
Sbjct: 308  ICGASANVEGAIEILKTMGDAGIKPGVIAYTTAMKVCVESKNFMQALTLYEEMKCYEIRP 367

Query: 861  NAITYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVIQKSYS 1040
            + +TYNT+L+   ++G   +V   L++Y++MRK+G+ PND +L+ LIEEW EGVIQ +  
Sbjct: 368  SWVTYNTLLKARSKYGSLHEVQQCLAIYQDMRKAGYKPNDYYLEELIEEWCEGVIQDN-- 425

Query: 1041 TSQPMKLKGNQHQRKDENTQQKSDIIFFEKIAGLACSKSSD-FTVDLRGLSETETXXXXX 1217
                 + K  +    +++  ++   +  EKIA     + +D   +D++GL++ E      
Sbjct: 426  -----REKQGEFSSSNKSESERPHSLLLEKIAAHLLKRVADILAIDVQGLTKVEAHLVVL 480

Query: 1218 XXXXXXXEKHGPGNPIDSDLIIIIG---YSGNLQKQRLE-SSHITKILNQELGLLVLQNR 1385
                   E +  G+ ++ D++IIIG      N  K  LE    I K+L  EL L V   R
Sbjct: 481  AVLRMIKENYSLGHSVNDDILIIIGATKVDENPSKHILEVQEAIIKLLRNELELEVFPAR 540

Query: 1386 DYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIVPKKSLY 1565
                            S +N       N  GF           +R   + RL V KKSLY
Sbjct: 541  TRLALCDTAKLEYPNRSNLNIEALPGENALGFQT---------RRPGVLVRLKVTKKSLY 591

Query: 1566 QWVEK 1580
             W+ +
Sbjct: 592  SWLHR 596


>ref|NP_195903.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|332278227|sp|Q8GYL7.3|PP361_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g02830, chloroplastic; Flags: Precursor
            gi|332003140|gb|AED90523.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 852

 Score =  325 bits (834), Expect = 3e-86
 Identities = 195/549 (35%), Positives = 298/549 (54%), Gaps = 24/549 (4%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            + +MQ   V  D+ +YNILLK C +A   DLA DIY+  K     G +K+D  TY T+I+
Sbjct: 323  YKNMQILDVTADMTSYNILLKTCCLAGRVDLAQDIYKEAKRMESSGLLKLDAFTYCTIIK 382

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V  D+K    AL++K DM+  G+ P+  TW+ L++ACAN GLVE+A  LF+EM+ SG +P
Sbjct: 383  VFADAKMWKWALKVKDDMKSVGVTPNTHTWSSLISACANAGLVEQANHLFEEMLASGCEP 442

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWK----EKGFYAGS------------LKRYKKG 497
            NSQC N LL A V+  QY RAF  F++WK     +  YA              LK    G
Sbjct: 443  NSQCFNILLHACVEACQYDRAFRLFQSWKGSSVNESLYADDIVSKGRTSSPNILKNNGPG 502

Query: 498  NLPDNFSAPPSCIPK----FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTW 665
            +L +  S  P         FKPT  TYN L+KAC +  Y  + +MDEMK  G+ P+ +TW
Sbjct: 503  SLVNRNSNSPYIQASKRFCFKPTTATYNILLKACGTDYYRGKELMDEMKSLGLSPNQITW 562

Query: 666  SMLIDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKK 845
            S LID  G + D++G ++    M  AG +PDVV YTT IK+C +NK    A  +FE M++
Sbjct: 563  STLIDMCGGSGDVEGAVRILRTMHSAGTRPDVVAYTTAIKICAENKCLKLAFSLFEEMRR 622

Query: 846  KNVQPNAITYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVI 1025
              ++PN +TYNT+L+   ++G  ++V   L++Y++MR +G+ PND FLK LIEEW EGVI
Sbjct: 623  YQIKPNWVTYNTLLKARSKYGSLLEVRQCLAIYQDMRNAGYKPNDHFLKELIEEWCEGVI 682

Query: 1026 QKSYSTSQPMKLKGNQHQRKDENTQQKSDIIFFEKIAGLACSKSSDFTVDLRGLSETETX 1205
            Q++  +   +        ++ +N  +   ++  +    +    + +  +DL+GL++ E  
Sbjct: 683  QENGQSQDKIS------DQEGDNAGRPVSLLIEKVATHMQERTAGNLAIDLQGLTKIEAR 736

Query: 1206 XXXXXXXXXXXEKHGPGNPIDSDLIIIIGY-SGNLQKQRLE---SSHITKILNQELGLLV 1373
                       E +  G+ +  D++IIIG    N    + E      + K+L  EL L+V
Sbjct: 737  LVVLAVLRMIKEDYMRGDVVIDDVLIIIGTDEANTVSGKQEITVQEALVKLLRDELSLVV 796

Query: 1374 LQNRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIVPK 1553
            L       P        D   C+++   +   K+  S+S+       +R   ++RL+V K
Sbjct: 797  L-------PAGQRNIIQD-AHCVDD-ADQENTKSFVSISS------TRRPAILERLMVTK 841

Query: 1554 KSLYQWVEK 1580
             SLYQW+++
Sbjct: 842  ASLYQWLQR 850


>dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]
          Length = 852

 Score =  325 bits (833), Expect = 3e-86
 Identities = 195/549 (35%), Positives = 298/549 (54%), Gaps = 24/549 (4%)
 Frame = +3

Query: 6    FHDMQSSGVKPDIATYNILLKACRVARNADLAIDIYEGLKEKSIDGGIKMDVITYSTLIQ 185
            + +MQ   V  D+ +YNILLK C +A   DLA DIY+  K     G +K+D  TY T+I+
Sbjct: 323  YKNMQILDVTADMTSYNILLKTCCLAGRVDLAQDIYKEAKRMESSGLLKLDAFTYCTIIK 382

Query: 186  VVGDSKNLAKALEIKRDMEVAGILPDMITWNMLLTACANVGLVEKALVLFDEMVLSGYQP 365
            V  D+K    AL++K DM+  G+ P+  TW+ L++ACAN GLVE+A  LF+EM+ SG +P
Sbjct: 383  VFADAKMWKWALKVKDDMKSVGVTPNTHTWSSLISACANAGLVEQANHLFEEMLASGCEP 442

Query: 366  NSQCCNALLSAFVKDFQYARAFTFFKNWK----EKGFYAGS------------LKRYKKG 497
            NSQC N LL A V+  QY RAF  F++WK     +  YA              LK    G
Sbjct: 443  NSQCFNILLHACVEACQYDRAFRLFQSWKGSSVNESLYADDIVSKGRTSSPNILKNNGPG 502

Query: 498  NLPDNFSAPPSCIPK----FKPTVVTYNTLMKACRSTPYLARTMMDEMKDNGIVPDNVTW 665
            +L +  S  P         FKPT  TYN L+KAC +  Y  + +MDEMK  G+ P+ +TW
Sbjct: 503  SLVNRNSNSPYIQASKRFCFKPTTATYNILLKACGTDYYRGKELMDEMKSLGLSPNQITW 562

Query: 666  SMLIDAYGNTADLQGCMQAFNNMCEAGVKPDVVTYTTLIKVCVKNKKFTKAIMMFEAMKK 845
            S LID  G + D++G ++    M  AG +PDVV YTT IK+C +NK    A  +FE M++
Sbjct: 563  STLIDMCGGSGDVEGAVRILRTMHSAGTRPDVVAYTTAIKICAENKCLKLAFSLFEEMRR 622

Query: 846  KNVQPNAITYNTILRGWREHGDHMQVHALLSLYEEMRKSGFPPNDEFLKGLIEEWAEGVI 1025
              ++PN +TYNT+L+   ++G  ++V   L++Y++MR +G+ PND FLK LIEEW EGVI
Sbjct: 623  YQIKPNWVTYNTLLKARSKYGSLLEVRQCLAIYQDMRNAGYKPNDHFLKELIEEWCEGVI 682

Query: 1026 QKSYSTSQPMKLKGNQHQRKDENTQQKSDIIFFEKIAGLACSKSSDFTVDLRGLSETETX 1205
            Q++  +   +        ++ +N  +   ++  +    +    + +  +DL+GL++ E  
Sbjct: 683  QENGRSQDKIS------DQEGDNAGRPVSLLIEKVATHMQERTAGNLAIDLQGLTKIEAR 736

Query: 1206 XXXXXXXXXXXEKHGPGNPIDSDLIIIIGY-SGNLQKQRLE---SSHITKILNQELGLLV 1373
                       E +  G+ +  D++IIIG    N    + E      + K+L  EL L+V
Sbjct: 737  LVVLAVLRMIKEDYMRGDVVIDDVLIIIGTDEANTVSGKQEITVQEALVKLLRDELSLVV 796

Query: 1374 LQNRDYDQPVVAGTSSSDILSCINNRRTKAGNKTGFSVSNLNERMPLKRLPTVQRLIVPK 1553
            L       P        D   C+++   +   K+  S+S+       +R   ++RL+V K
Sbjct: 797  L-------PAGQRNIIQD-AHCVDD-ADQENTKSFVSISS------TRRPAILERLMVTK 841

Query: 1554 KSLYQWVEK 1580
             SLYQW+++
Sbjct: 842  ASLYQWLQR 850


Top