BLASTX nr result

ID: Rehmannia28_contig00001387 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia28_contig00001387
         (4782 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011079552.1| PREDICTED: probable glutamyl endopeptidase, ...   821   0.0  
gb|EYU46808.1| hypothetical protein MIMGU_mgv1a000875mg [Erythra...   814   0.0  
ref|XP_012834176.1| PREDICTED: probable glutamyl endopeptidase, ...   814   0.0  
emb|CBI36950.3| unnamed protein product [Vitis vinifera]              789   0.0  
ref|XP_010652242.1| PREDICTED: probable glutamyl endopeptidase, ...   789   0.0  
ref|XP_010652241.1| PREDICTED: probable glutamyl endopeptidase, ...   789   0.0  
ref|XP_010665808.1| PREDICTED: protein FAR1-RELATED SEQUENCE 5-l...   780   0.0  
gb|KJB49819.1| hypothetical protein B456_008G139300 [Gossypium r...   770   0.0  
gb|KCW58331.1| hypothetical protein EUGRSUZ_H01015 [Eucalyptus g...   782   0.0  
ref|XP_010069839.1| PREDICTED: probable glutamyl endopeptidase, ...   782   0.0  
gb|KCW58332.1| hypothetical protein EUGRSUZ_H01015 [Eucalyptus g...   782   0.0  
ref|XP_010681849.1| PREDICTED: protein FAR1-RELATED SEQUENCE 5-l...   775   0.0  
gb|EEF47267.1| dipeptidyl-peptidase, putative [Ricinus communis]      778   0.0  
ref|XP_012082895.1| PREDICTED: probable glutamyl endopeptidase, ...   779   0.0  
ref|XP_007051106.1| Prolyl oligopeptidase family protein [Theobr...   779   0.0  
ref|XP_015572385.1| PREDICTED: LOW QUALITY PROTEIN: probable glu...   778   0.0  
ref|XP_010259303.1| PREDICTED: probable glutamyl endopeptidase, ...   776   0.0  
ref|XP_010259302.1| PREDICTED: probable glutamyl endopeptidase, ...   776   0.0  
gb|KJB49818.1| hypothetical protein B456_008G139300 [Gossypium r...   770   0.0  
ref|XP_009774674.1| PREDICTED: probable glutamyl endopeptidase, ...   768   0.0  

>ref|XP_011079552.1| PREDICTED: probable glutamyl endopeptidase, chloroplastic [Sesamum
            indicum]
          Length = 955

 Score =  821 bits (2121), Expect = 0.0
 Identities = 406/475 (85%), Positives = 426/475 (89%), Gaps = 4/475 (0%)
 Frame = -3

Query: 4780 IRTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLL 4601
            +RTWIISPG ESA+PRILFDRSSEDVYSDPGSPMLR+T  GTYVIAKIKKE +EGTYLLL
Sbjct: 483  LRTWIISPGCESASPRILFDRSSEDVYSDPGSPMLRKTSIGTYVIAKIKKESDEGTYLLL 542

Query: 4600 NGSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILT 4421
            NGSGATPQGNIPFLDLF+INTGNKERIWESDKEKY+ETVVALM DQDEG+I++NQL++LT
Sbjct: 543  NGSGATPQGNIPFLDLFNINTGNKERIWESDKEKYYETVVALMCDQDEGDIYVNQLRVLT 602

Query: 4420 SKESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPP 4241
            SKESKTENTQY+ILSWP+KKA Q+TNFPHPYPQLSSLKKEMIRY+R DGVQLTATLYLPP
Sbjct: 603  SKESKTENTQYFILSWPEKKATQVTNFPHPYPQLSSLKKEMIRYQRSDGVQLTATLYLPP 662

Query: 4240 DYDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPT 4061
            DYDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIG TSP+LWLARRFAILSGPT
Sbjct: 663  DYDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGPTSPMLWLARRFAILSGPT 722

Query: 4060 IPIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHA 3881
            IPIIGEGNEEANDRY              VIRRGVAHP+KIAVGGHSYGAFMTANLLAHA
Sbjct: 723  IPIIGEGNEEANDRYVEQLVASAEAAVEEVIRRGVAHPDKIAVGGHSYGAFMTANLLAHA 782

Query: 3880 PHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEE 3701
            PHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAV TYVEMSPFISANKIKKPILLIHGEE
Sbjct: 783  PHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVDTYVEMSPFISANKIKKPILLIHGEE 842

Query: 3700 DNNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCV- 3524
            DNN GTLTMQSDRF+NALKGHGALCRLV+LPFESHGYAARESVMHVLWETDRWLQ YCV 
Sbjct: 843  DNNPGTLTMQSDRFYNALKGHGALCRLVILPFESHGYAARESVMHVLWETDRWLQNYCVT 902

Query: 3523 ANSSDTSEDPN---ESASRGISDAESKXXXXXXXXXAENSDHEIDKVHITCRSSL 3368
            ANSSD  EDPN   E AS  ISD ESK         AE  DHEIDK+H   RSSL
Sbjct: 903  ANSSDAGEDPNEHEEDASCSISDGESK--VGAAGGVAERPDHEIDKIHTMHRSSL 955


>gb|EYU46808.1| hypothetical protein MIMGU_mgv1a000875mg [Erythranthe guttata]
          Length = 953

 Score =  814 bits (2102), Expect = 0.0
 Identities = 402/472 (85%), Positives = 423/472 (89%), Gaps = 1/472 (0%)
 Frame = -3

Query: 4780 IRTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLL 4601
            IRTWIISP SES +PRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKI+KEG+EGTYLLL
Sbjct: 483  IRTWIISPQSESVSPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIRKEGDEGTYLLL 542

Query: 4600 NGSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILT 4421
            NGSGATPQGN+PFLDLFDINTGNKERIWESDKEKY+ETVVALMSDQDE E++L+QLK+LT
Sbjct: 543  NGSGATPQGNVPFLDLFDINTGNKERIWESDKEKYYETVVALMSDQDEREMYLHQLKVLT 602

Query: 4420 SKESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPP 4241
            SKESKTENTQYY+ SWP+KKACQ+TNFPHPYPQLSSLKKEMIRYER DGVQLTATLYLPP
Sbjct: 603  SKESKTENTQYYLFSWPEKKACQVTNFPHPYPQLSSLKKEMIRYERSDGVQLTATLYLPP 662

Query: 4240 DYDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPT 4061
             YDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPT
Sbjct: 663  GYDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPT 722

Query: 4060 IPIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHA 3881
            IPIIGEGNEEANDRY              VIRRGVAHPNKIAVGGHSYGAFMTANLLAHA
Sbjct: 723  IPIIGEGNEEANDRYVEQLVASAEAAVKEVIRRGVAHPNKIAVGGHSYGAFMTANLLAHA 782

Query: 3880 PHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEE 3701
            P+LF CGIARSGAYNRTLTPFGFQ+EDRTLWEAV TYVEMSPFISANKIKKPILLIHGEE
Sbjct: 783  PNLFSCGIARSGAYNRTLTPFGFQSEDRTLWEAVNTYVEMSPFISANKIKKPILLIHGEE 842

Query: 3700 DNNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCVA 3521
            DNN GTLTMQSDRF+NALKGHGALCRLV+LPFESHGYAARESVMHVLWETDRWLQK+CV 
Sbjct: 843  DNNPGTLTMQSDRFYNALKGHGALCRLVILPFESHGYAARESVMHVLWETDRWLQKHCVD 902

Query: 3520 NSSDTSEDPNESASRGISDAESKXXXXXXXXXAEN-SDHEIDKVHITCRSSL 3368
            NSSD    P E+A+ GI+DAE+K              D EID V I  RSSL
Sbjct: 903  NSSDPIA-PEENANTGITDAENKAVGVAGGVAENQIPDDEIDNVQIMRRSSL 953


>ref|XP_012834176.1| PREDICTED: probable glutamyl endopeptidase, chloroplastic
            [Erythranthe guttata]
          Length = 957

 Score =  814 bits (2102), Expect = 0.0
 Identities = 402/472 (85%), Positives = 423/472 (89%), Gaps = 1/472 (0%)
 Frame = -3

Query: 4780 IRTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLL 4601
            IRTWIISP SES +PRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKI+KEG+EGTYLLL
Sbjct: 483  IRTWIISPQSESVSPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIRKEGDEGTYLLL 542

Query: 4600 NGSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILT 4421
            NGSGATPQGN+PFLDLFDINTGNKERIWESDKEKY+ETVVALMSDQDE E++L+QLK+LT
Sbjct: 543  NGSGATPQGNVPFLDLFDINTGNKERIWESDKEKYYETVVALMSDQDEREMYLHQLKVLT 602

Query: 4420 SKESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPP 4241
            SKESKTENTQYY+ SWP+KKACQ+TNFPHPYPQLSSLKKEMIRYER DGVQLTATLYLPP
Sbjct: 603  SKESKTENTQYYLFSWPEKKACQVTNFPHPYPQLSSLKKEMIRYERSDGVQLTATLYLPP 662

Query: 4240 DYDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPT 4061
             YDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPT
Sbjct: 663  GYDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPT 722

Query: 4060 IPIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHA 3881
            IPIIGEGNEEANDRY              VIRRGVAHPNKIAVGGHSYGAFMTANLLAHA
Sbjct: 723  IPIIGEGNEEANDRYVEQLVASAEAAVKEVIRRGVAHPNKIAVGGHSYGAFMTANLLAHA 782

Query: 3880 PHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEE 3701
            P+LF CGIARSGAYNRTLTPFGFQ+EDRTLWEAV TYVEMSPFISANKIKKPILLIHGEE
Sbjct: 783  PNLFSCGIARSGAYNRTLTPFGFQSEDRTLWEAVNTYVEMSPFISANKIKKPILLIHGEE 842

Query: 3700 DNNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCVA 3521
            DNN GTLTMQSDRF+NALKGHGALCRLV+LPFESHGYAARESVMHVLWETDRWLQK+CV 
Sbjct: 843  DNNPGTLTMQSDRFYNALKGHGALCRLVILPFESHGYAARESVMHVLWETDRWLQKHCVD 902

Query: 3520 NSSDTSEDPNESASRGISDAESKXXXXXXXXXAEN-SDHEIDKVHITCRSSL 3368
            NSSD    P E+A+ GI+DAE+K              D EID V I  RSSL
Sbjct: 903  NSSDPIA-PEENANTGITDAENKAVGVAGGVAENQIPDDEIDNVQIMRRSSL 953


>emb|CBI36950.3| unnamed protein product [Vitis vinifera]
          Length = 913

 Score =  789 bits (2038), Expect = 0.0
 Identities = 383/473 (80%), Positives = 417/473 (88%), Gaps = 3/473 (0%)
 Frame = -3

Query: 4777 RTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLLN 4598
            RTW+ISPGSE  +PRILFDRSSEDVYSDPGSPMLRRT  GTYVIAKIKKE +EGTY+LLN
Sbjct: 432  RTWVISPGSEDVSPRILFDRSSEDVYSDPGSPMLRRTTAGTYVIAKIKKENDEGTYILLN 491

Query: 4597 GSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILTS 4418
            GSGATP+GNIPFLDLFDINTG+KERIWESDKEKY+ETVVALMSDQ EG+++LNQLKILTS
Sbjct: 492  GSGATPEGNIPFLDLFDINTGSKERIWESDKEKYYETVVALMSDQSEGDLYLNQLKILTS 551

Query: 4417 KESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPPD 4238
            KESKTENTQY+I SW DKKACQITNFPHPYPQL+SL+KEMIRYERKDGVQLTATLYLPP 
Sbjct: 552  KESKTENTQYFIQSWLDKKACQITNFPHPYPQLASLQKEMIRYERKDGVQLTATLYLPPG 611

Query: 4237 YDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPTI 4058
            YDP++DGPLPCL+WSYPGEFKSKDAAGQVRGSPNEFAGIG TS LLWLARRFAILSGPTI
Sbjct: 612  YDPSKDGPLPCLVWSYPGEFKSKDAAGQVRGSPNEFAGIGPTSALLWLARRFAILSGPTI 671

Query: 4057 PIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 3878
            PIIGEGNEEANDRY              VIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP
Sbjct: 672  PIIGEGNEEANDRYVEQLVASAEAAVEEVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 731

Query: 3877 HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEED 3698
            HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEA  TYVEMSPF+SANKIK+P+LLIHGEED
Sbjct: 732  HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATDTYVEMSPFMSANKIKRPVLLIHGEED 791

Query: 3697 NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCVAN 3518
            NN GTLTMQSDRFFNALKGHGALCRLV+LPFESHGYAARES+MHVLWETDRWLQK+CV+N
Sbjct: 792  NNPGTLTMQSDRFFNALKGHGALCRLVILPFESHGYAARESIMHVLWETDRWLQKHCVSN 851

Query: 3517 SSDTSED---PNESASRGISDAESKXXXXXXXXXAENSDHEIDKVHITCRSSL 3368
            +++ +E+    N+ A   I+D ESK          E ++ E +  H   R+SL
Sbjct: 852  TTNVNENLDTCNDEAKEEITDPESKTVPASGGGNPELAESEHEGFHPRARASL 904


>ref|XP_010652242.1| PREDICTED: probable glutamyl endopeptidase, chloroplastic isoform X2
            [Vitis vinifera]
          Length = 962

 Score =  789 bits (2038), Expect = 0.0
 Identities = 383/473 (80%), Positives = 417/473 (88%), Gaps = 3/473 (0%)
 Frame = -3

Query: 4777 RTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLLN 4598
            RTW+ISPGSE  +PRILFDRSSEDVYSDPGSPMLRRT  GTYVIAKIKKE +EGTY+LLN
Sbjct: 490  RTWVISPGSEDVSPRILFDRSSEDVYSDPGSPMLRRTTAGTYVIAKIKKENDEGTYILLN 549

Query: 4597 GSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILTS 4418
            GSGATP+GNIPFLDLFDINTG+KERIWESDKEKY+ETVVALMSDQ EG+++LNQLKILTS
Sbjct: 550  GSGATPEGNIPFLDLFDINTGSKERIWESDKEKYYETVVALMSDQSEGDLYLNQLKILTS 609

Query: 4417 KESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPPD 4238
            KESKTENTQY+I SW DKKACQITNFPHPYPQL+SL+KEMIRYERKDGVQLTATLYLPP 
Sbjct: 610  KESKTENTQYFIQSWLDKKACQITNFPHPYPQLASLQKEMIRYERKDGVQLTATLYLPPG 669

Query: 4237 YDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPTI 4058
            YDP++DGPLPCL+WSYPGEFKSKDAAGQVRGSPNEFAGIG TS LLWLARRFAILSGPTI
Sbjct: 670  YDPSKDGPLPCLVWSYPGEFKSKDAAGQVRGSPNEFAGIGPTSALLWLARRFAILSGPTI 729

Query: 4057 PIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 3878
            PIIGEGNEEANDRY              VIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP
Sbjct: 730  PIIGEGNEEANDRYVEQLVASAEAAVEEVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 789

Query: 3877 HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEED 3698
            HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEA  TYVEMSPF+SANKIK+P+LLIHGEED
Sbjct: 790  HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATDTYVEMSPFMSANKIKRPVLLIHGEED 849

Query: 3697 NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCVAN 3518
            NN GTLTMQSDRFFNALKGHGALCRLV+LPFESHGYAARES+MHVLWETDRWLQK+CV+N
Sbjct: 850  NNPGTLTMQSDRFFNALKGHGALCRLVILPFESHGYAARESIMHVLWETDRWLQKHCVSN 909

Query: 3517 SSDTSED---PNESASRGISDAESKXXXXXXXXXAENSDHEIDKVHITCRSSL 3368
            +++ +E+    N+ A   I+D ESK          E ++ E +  H   R+SL
Sbjct: 910  TTNVNENLDTCNDEAKEEITDPESKTVPASGGGNPELAESEHEGFHPRARASL 962


>ref|XP_010652241.1| PREDICTED: probable glutamyl endopeptidase, chloroplastic isoform X1
            [Vitis vinifera]
          Length = 963

 Score =  789 bits (2038), Expect = 0.0
 Identities = 383/473 (80%), Positives = 417/473 (88%), Gaps = 3/473 (0%)
 Frame = -3

Query: 4777 RTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLLN 4598
            RTW+ISPGSE  +PRILFDRSSEDVYSDPGSPMLRRT  GTYVIAKIKKE +EGTY+LLN
Sbjct: 490  RTWVISPGSEDVSPRILFDRSSEDVYSDPGSPMLRRTTAGTYVIAKIKKENDEGTYILLN 549

Query: 4597 GSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILTS 4418
            GSGATP+GNIPFLDLFDINTG+KERIWESDKEKY+ETVVALMSDQ EG+++LNQLKILTS
Sbjct: 550  GSGATPEGNIPFLDLFDINTGSKERIWESDKEKYYETVVALMSDQSEGDLYLNQLKILTS 609

Query: 4417 KESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPPD 4238
            KESKTENTQY+I SW DKKACQITNFPHPYPQL+SL+KEMIRYERKDGVQLTATLYLPP 
Sbjct: 610  KESKTENTQYFIQSWLDKKACQITNFPHPYPQLASLQKEMIRYERKDGVQLTATLYLPPG 669

Query: 4237 YDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPTI 4058
            YDP++DGPLPCL+WSYPGEFKSKDAAGQVRGSPNEFAGIG TS LLWLARRFAILSGPTI
Sbjct: 670  YDPSKDGPLPCLVWSYPGEFKSKDAAGQVRGSPNEFAGIGPTSALLWLARRFAILSGPTI 729

Query: 4057 PIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 3878
            PIIGEGNEEANDRY              VIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP
Sbjct: 730  PIIGEGNEEANDRYVEQLVASAEAAVEEVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 789

Query: 3877 HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEED 3698
            HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEA  TYVEMSPF+SANKIK+P+LLIHGEED
Sbjct: 790  HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATDTYVEMSPFMSANKIKRPVLLIHGEED 849

Query: 3697 NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCVAN 3518
            NN GTLTMQSDRFFNALKGHGALCRLV+LPFESHGYAARES+MHVLWETDRWLQK+CV+N
Sbjct: 850  NNPGTLTMQSDRFFNALKGHGALCRLVILPFESHGYAARESIMHVLWETDRWLQKHCVSN 909

Query: 3517 SSDTSED---PNESASRGISDAESKXXXXXXXXXAENSDHEIDKVHITCRSSL 3368
            +++ +E+    N+ A   I+D ESK          E ++ E +  H   R+SL
Sbjct: 910  TTNVNENLDTCNDEAKEEITDPESKTVPASGGGNPELAESEHEGFHPRARASL 962


>ref|XP_010665808.1| PREDICTED: protein FAR1-RELATED SEQUENCE 5-like [Beta vulgaris subsp.
            vulgaris]
          Length = 789

 Score =  780 bits (2015), Expect = 0.0
 Identities = 379/732 (51%), Positives = 496/732 (67%), Gaps = 29/732 (3%)
 Frame = -1

Query: 2151 AVVDNGDEDEFDVGGCLLHITRKTIKDMHQLYCRHARAVGFSVRKSTTRKSTAETGVVV- 1975
            ++V+   EDE D+ G L+ + +KTI+++++LY  H+ A+GFS+RK T+R    E  V + 
Sbjct: 59   SIVEECTEDELDISGSLIGVQKKTIEELYELYRLHSGALGFSIRKYTSRIGRVEGNVYLT 118

Query: 1974 -EKYYVCSSAGFKQSAAE-------------------KKRANVTRTGCKASIRVKLNGEG 1855
             EKY+VCS  G     ++                   ++R  VT+TGC A +RVKLN  G
Sbjct: 119  KEKYFVCSCHGKPDGQSDFVSVPDVVVDEDGNERKRKQRRVIVTKTGCNAMMRVKLNDSG 178

Query: 1854 LYEVVQHIVEHNHQLTRTEWSHLHRSERVITNEKGKAIEDMISCGMKATQSYRYMVHDAG 1675
            +YEV+ H++ HNH+LTRTEW H HRSER I + K K I  M    M+    YRY  H  G
Sbjct: 179  MYEVIGHVLVHNHELTRTEWQHYHRSERAIGDGKAKEITVMTEASMRPAVQYRYECHQYG 238

Query: 1674 GEQSLGHTLKDHLNFVNRLKMKAIEGGDAQTLIDILYQQDVEEEDFFFRVKLDEFGRLSN 1495
            G ++LGHT +DH NFVNRLKMKAIEGGDAQ +ID L Q+  +E+DFF+RVKLD  GRL N
Sbjct: 239  GAEALGHTSRDHYNFVNRLKMKAIEGGDAQVVIDKLSQRAADEDDFFYRVKLDSMGRLCN 298

Query: 1494 VFWRDSMMKEDYKIYGDVMIFDTTYRTNKYNLICAPFVGVNNHWKNVMFGCAFLSDEKSE 1315
            VFWRDSMMK+DY++YGDV +FDTTYRTN+YNLIC  FVG+NNHWKN+MFGC FLSDE  +
Sbjct: 299  VFWRDSMMKDDYELYGDVTVFDTTYRTNRYNLICGAFVGINNHWKNIMFGCCFLSDETID 358

Query: 1314 SFEWLFQVFTKSMGGKCPVTLFTDQDLAISNAIEKVFPQTRHRLCLWHLHQNAISRFGRL 1135
            +F WLF+VF K MGG CPV++FTDQD A+SNA+ +VFP TR RLC WHL+QNA+S FG L
Sbjct: 359  TFVWLFEVFKKCMGGVCPVSIFTDQDQAMSNALSQVFPNTRARLCQWHLYQNAVSHFGTL 418

Query: 1134 KKNNSFKEAFNKCLTGCVDERDFEICWNSMILEYELHDDSWFRRMYDLKEKWCTALSKDI 955
            K +++F +AF KCL GC D  +FE  W+ M+ +Y L  D WF R+Y LK+KW TALSKD 
Sbjct: 419  KSDDTFSDAFKKCLRGCYDSVEFEASWDYMMKKYGLEGDKWFARLYQLKDKWSTALSKDF 478

Query: 954  FSAGILSSQRSESTNNAIGFNAKKTTTLTEFYGIFQGTLKRWRNNEKENEFQCSRSIPTS 775
            FSAGILSSQRSESTNNAIGF A KTT+L +F+G+FQ T+ RWR  E  +EF  S+S+P S
Sbjct: 479  FSAGILSSQRSESTNNAIGFQASKTTSLYDFFGMFQNTISRWRQTETNDEFVDSKSVPKS 538

Query: 774  VLPLTGMLRHAAEVYTLTLFKEFESEFIKSISTECIIVRVEENIMVYDISMPSDGGCCHR 595
              P+TGMLRHA+++YT T+F++FE EF  S+ T C ++ +++ I+VY +         H+
Sbjct: 539  YFPMTGMLRHASQIYTATMFRDFEQEFGYSLGTICELLTMDDTILVYKVWPEKHPQRTHQ 598

Query: 594  VVYDCLNMLINCSCKKFEECGLLCCHSLRVFHMHSISKIPECYIVKRWTKFVKTELWGKF 415
            V +DC+N  ++CSC+ +EE G+LC H LRV HMHS+S+IP  YI++RWTKF KTE+W + 
Sbjct: 599  VTFDCVNKFVSCSCRNYEEVGMLCYHCLRVLHMHSVSEIPSSYILRRWTKFAKTEVWDRL 658

Query: 414  NKM-AESSHRPKDCVPWRHQMARNYYNLILKCQDNEEARRIIEDGYNRDSLAVDALMNTL 238
             +     S   KDC+PWR QM+R Y NLI++  DNEEAR+I+E GY RD   V  +   +
Sbjct: 659  KQQRVNMSSLKKDCIPWRFQMSRVYNNLIIRSHDNEEARKIMEKGYMRDLADVLTVFANI 718

Query: 237  N------STGQHEVSNSSN-SILDPTRSVTKGRRQRIKGHFQQXXXXXXXXXXXXXXXXK 79
            N        G+   SNSS  ++ DP    TKGR  R K   ++                K
Sbjct: 719  NLGDSESGVGESTASNSSTITVFDPPWVKTKGRSVRPKSGIEK-SKRGRGRGKCVSNVAK 777

Query: 78   EFGSKTPNPRLF 43
            EFGS TP  RLF
Sbjct: 778  EFGSYTPPARLF 789


>gb|KJB49819.1| hypothetical protein B456_008G139300 [Gossypium raimondii]
          Length = 585

 Score =  770 bits (1989), Expect = 0.0
 Identities = 370/459 (80%), Positives = 409/459 (89%), Gaps = 3/459 (0%)
 Frame = -3

Query: 4777 RTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLLN 4598
            RTW+ISPGS+  +PRILFDRSSEDVYSDPGSPMLRRT TG YVIAK++KE ++ TYLLLN
Sbjct: 112  RTWVISPGSKDVSPRILFDRSSEDVYSDPGSPMLRRTSTGNYVIAKLRKENDDATYLLLN 171

Query: 4597 GSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILTS 4418
            G+GATP+G+IPFLDLFDINTG+KERIWESDKEKY+E+VVAL+SDQ EG+IH+N LKILTS
Sbjct: 172  GNGATPEGDIPFLDLFDINTGSKERIWESDKEKYYESVVALLSDQKEGDIHINDLKILTS 231

Query: 4417 KESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPPD 4238
            KESKTENTQYYI SWPDKK CQIT+FPHPYPQL+SL+K+MIRYERKDGVQLTATLYLPP 
Sbjct: 232  KESKTENTQYYIQSWPDKKLCQITDFPHPYPQLASLQKDMIRYERKDGVQLTATLYLPPG 291

Query: 4237 YDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPTI 4058
            YDP++DGPLPCL+WSYPGEFKSKDAAGQVRGSPNEFAGIG TS LLWLARRFAILSGPTI
Sbjct: 292  YDPSKDGPLPCLVWSYPGEFKSKDAAGQVRGSPNEFAGIGPTSALLWLARRFAILSGPTI 351

Query: 4057 PIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 3878
            PIIGEG+EEANDRY              VIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP
Sbjct: 352  PIIGEGDEEANDRYVEQLVASAEAAVEEVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 411

Query: 3877 HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEED 3698
            HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEA  TYVEMSPF+SAN+IKKPILLIHGEED
Sbjct: 412  HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNTYVEMSPFMSANRIKKPILLIHGEED 471

Query: 3697 NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCVAN 3518
            NN+GTLTMQSDRFFNALKGHGALCRLV+LPFESHGY+ARES+MHVLWETDRWLQK+CV+N
Sbjct: 472  NNAGTLTMQSDRFFNALKGHGALCRLVILPFESHGYSARESIMHVLWETDRWLQKHCVSN 531

Query: 3517 SSDTSEDPNES---ASRGISDAESKXXXXXXXXXAENSD 3410
            +S+ S D  +S     + ++D E+K         AE SD
Sbjct: 532  TSEVSADIGKSKDGEGKEVTDIENKAVAASGGGGAELSD 570


>gb|KCW58331.1| hypothetical protein EUGRSUZ_H01015 [Eucalyptus grandis]
          Length = 967

 Score =  782 bits (2019), Expect = 0.0
 Identities = 377/473 (79%), Positives = 417/473 (88%), Gaps = 3/473 (0%)
 Frame = -3

Query: 4777 RTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLLN 4598
            R+W+ISPGS+  NPRILFDRSSED YSDPGSPMLRRTP GTYVIAK+KK  +EGTY+LLN
Sbjct: 495  RSWVISPGSKDTNPRILFDRSSEDAYSDPGSPMLRRTPAGTYVIAKVKKGNDEGTYVLLN 554

Query: 4597 GSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILTS 4418
            GSGATP+GNIPFLDLF+INTG+KERIW+SDKEKYFETVVALMSDQ++G+I L+QLKILTS
Sbjct: 555  GSGATPEGNIPFLDLFEINTGSKERIWQSDKEKYFETVVALMSDQNDGDISLDQLKILTS 614

Query: 4417 KESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPPD 4238
            KESKTENTQYYI+SWPD+KACQIT+FPHPYPQL+SL KEMIRY+RKDGVQLTATLYLPP+
Sbjct: 615  KESKTENTQYYIMSWPDRKACQITDFPHPYPQLASLNKEMIRYQRKDGVQLTATLYLPPN 674

Query: 4237 YDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPTI 4058
            YDP ++GPLPCL+WSYPGEFKSKDAAGQVRGSPNEFAGIG TSPLLWLARRFAILSGPTI
Sbjct: 675  YDPLKEGPLPCLVWSYPGEFKSKDAAGQVRGSPNEFAGIGPTSPLLWLARRFAILSGPTI 734

Query: 4057 PIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 3878
            PIIGEG EEANDRY              VIRRGVAHPNKIA+GGHSYGAFMTANLLAHAP
Sbjct: 735  PIIGEGEEEANDRYVEQLVGSAEAAVEEVIRRGVAHPNKIAIGGHSYGAFMTANLLAHAP 794

Query: 3877 HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEED 3698
            HLFCCG+ARSGAYNRTLTPFGFQNEDRTLWEA  TYVEMSPF+SA+KIKKPILLIHGEED
Sbjct: 795  HLFCCGVARSGAYNRTLTPFGFQNEDRTLWEATNTYVEMSPFMSAHKIKKPILLIHGEED 854

Query: 3697 NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCVAN 3518
            NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARES+MHVLWETDRWLQKY V+ 
Sbjct: 855  NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESIMHVLWETDRWLQKYSVSA 914

Query: 3517 SSDTSEDPN---ESASRGISDAESKXXXXXXXXXAENSDHEIDKVHITCRSSL 3368
            +SD + D N   ++ S+G+  +E K          E +D + D++    RSSL
Sbjct: 915  ASDVTTDVNSCKDAESKGVVGSEVKVVGASGGGNPELADFDQDQLKCIPRSSL 967


>ref|XP_010069839.1| PREDICTED: probable glutamyl endopeptidase, chloroplastic [Eucalyptus
            grandis]
          Length = 968

 Score =  782 bits (2019), Expect = 0.0
 Identities = 377/473 (79%), Positives = 417/473 (88%), Gaps = 3/473 (0%)
 Frame = -3

Query: 4777 RTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLLN 4598
            R+W+ISPGS+  NPRILFDRSSED YSDPGSPMLRRTP GTYVIAK+KK  +EGTY+LLN
Sbjct: 496  RSWVISPGSKDTNPRILFDRSSEDAYSDPGSPMLRRTPAGTYVIAKVKKGNDEGTYVLLN 555

Query: 4597 GSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILTS 4418
            GSGATP+GNIPFLDLF+INTG+KERIW+SDKEKYFETVVALMSDQ++G+I L+QLKILTS
Sbjct: 556  GSGATPEGNIPFLDLFEINTGSKERIWQSDKEKYFETVVALMSDQNDGDISLDQLKILTS 615

Query: 4417 KESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPPD 4238
            KESKTENTQYYI+SWPD+KACQIT+FPHPYPQL+SL KEMIRY+RKDGVQLTATLYLPP+
Sbjct: 616  KESKTENTQYYIMSWPDRKACQITDFPHPYPQLASLNKEMIRYQRKDGVQLTATLYLPPN 675

Query: 4237 YDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPTI 4058
            YDP ++GPLPCL+WSYPGEFKSKDAAGQVRGSPNEFAGIG TSPLLWLARRFAILSGPTI
Sbjct: 676  YDPLKEGPLPCLVWSYPGEFKSKDAAGQVRGSPNEFAGIGPTSPLLWLARRFAILSGPTI 735

Query: 4057 PIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 3878
            PIIGEG EEANDRY              VIRRGVAHPNKIA+GGHSYGAFMTANLLAHAP
Sbjct: 736  PIIGEGEEEANDRYVEQLVGSAEAAVEEVIRRGVAHPNKIAIGGHSYGAFMTANLLAHAP 795

Query: 3877 HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEED 3698
            HLFCCG+ARSGAYNRTLTPFGFQNEDRTLWEA  TYVEMSPF+SA+KIKKPILLIHGEED
Sbjct: 796  HLFCCGVARSGAYNRTLTPFGFQNEDRTLWEATNTYVEMSPFMSAHKIKKPILLIHGEED 855

Query: 3697 NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCVAN 3518
            NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARES+MHVLWETDRWLQKY V+ 
Sbjct: 856  NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESIMHVLWETDRWLQKYSVSA 915

Query: 3517 SSDTSEDPN---ESASRGISDAESKXXXXXXXXXAENSDHEIDKVHITCRSSL 3368
            +SD + D N   ++ S+G+  +E K          E +D + D++    RSSL
Sbjct: 916  ASDVTTDVNSCKDAESKGVVGSEVKVVGASGGGNPELADFDQDQLKCIPRSSL 968


>gb|KCW58332.1| hypothetical protein EUGRSUZ_H01015 [Eucalyptus grandis]
          Length = 971

 Score =  782 bits (2019), Expect = 0.0
 Identities = 377/473 (79%), Positives = 417/473 (88%), Gaps = 3/473 (0%)
 Frame = -3

Query: 4777 RTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLLN 4598
            R+W+ISPGS+  NPRILFDRSSED YSDPGSPMLRRTP GTYVIAK+KK  +EGTY+LLN
Sbjct: 495  RSWVISPGSKDTNPRILFDRSSEDAYSDPGSPMLRRTPAGTYVIAKVKKGNDEGTYVLLN 554

Query: 4597 GSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILTS 4418
            GSGATP+GNIPFLDLF+INTG+KERIW+SDKEKYFETVVALMSDQ++G+I L+QLKILTS
Sbjct: 555  GSGATPEGNIPFLDLFEINTGSKERIWQSDKEKYFETVVALMSDQNDGDISLDQLKILTS 614

Query: 4417 KESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPPD 4238
            KESKTENTQYYI+SWPD+KACQIT+FPHPYPQL+SL KEMIRY+RKDGVQLTATLYLPP+
Sbjct: 615  KESKTENTQYYIMSWPDRKACQITDFPHPYPQLASLNKEMIRYQRKDGVQLTATLYLPPN 674

Query: 4237 YDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPTI 4058
            YDP ++GPLPCL+WSYPGEFKSKDAAGQVRGSPNEFAGIG TSPLLWLARRFAILSGPTI
Sbjct: 675  YDPLKEGPLPCLVWSYPGEFKSKDAAGQVRGSPNEFAGIGPTSPLLWLARRFAILSGPTI 734

Query: 4057 PIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 3878
            PIIGEG EEANDRY              VIRRGVAHPNKIA+GGHSYGAFMTANLLAHAP
Sbjct: 735  PIIGEGEEEANDRYVEQLVGSAEAAVEEVIRRGVAHPNKIAIGGHSYGAFMTANLLAHAP 794

Query: 3877 HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEED 3698
            HLFCCG+ARSGAYNRTLTPFGFQNEDRTLWEA  TYVEMSPF+SA+KIKKPILLIHGEED
Sbjct: 795  HLFCCGVARSGAYNRTLTPFGFQNEDRTLWEATNTYVEMSPFMSAHKIKKPILLIHGEED 854

Query: 3697 NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCVAN 3518
            NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARES+MHVLWETDRWLQKY V+ 
Sbjct: 855  NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESIMHVLWETDRWLQKYSVSA 914

Query: 3517 SSDTSEDPN---ESASRGISDAESKXXXXXXXXXAENSDHEIDKVHITCRSSL 3368
            +SD + D N   ++ S+G+  +E K          E +D + D++    RSSL
Sbjct: 915  ASDVTTDVNSCKDAESKGVVGSEVKVVGASGGGNPELADFDQDQLKCIPRSSL 967


>ref|XP_010681849.1| PREDICTED: protein FAR1-RELATED SEQUENCE 5-like [Beta vulgaris subsp.
            vulgaris]
          Length = 789

 Score =  775 bits (2001), Expect = 0.0
 Identities = 378/732 (51%), Positives = 495/732 (67%), Gaps = 29/732 (3%)
 Frame = -1

Query: 2151 AVVDNGDEDEFDVGGCLLHITRKTIKDMHQLYCRHARAVGFSVRKSTTRKSTAETGVVV- 1975
            ++V+   EDE D+ G L+ + +KTI+++++LY  H+ A+GFS+RK T+R    E  V + 
Sbjct: 59   SIVEECTEDELDISGSLIGVQKKTIEELYELYRLHSGALGFSIRKYTSRIGRVEGNVYLT 118

Query: 1974 -EKYYVCSSAGFKQSAAE-------------------KKRANVTRTGCKASIRVKLNGEG 1855
             EKY+VCS  G     ++                   ++R  VT+TGC A +RVKLN  G
Sbjct: 119  KEKYFVCSCHGKPDGQSDFVSVPDVVVDEDGNERKRKQRRVIVTKTGCNAMMRVKLNDSG 178

Query: 1854 LYEVVQHIVEHNHQLTRTEWSHLHRSERVITNEKGKAIEDMISCGMKATQSYRYMVHDAG 1675
            +YEV+ H++ HNH+LTRTEW H HRSER I + K K I  M    M+    YRY  H  G
Sbjct: 179  MYEVIGHVLVHNHELTRTEWQHYHRSERAIGDGKAKEITVMTEASMRPAVQYRYECHQYG 238

Query: 1674 GEQSLGHTLKDHLNFVNRLKMKAIEGGDAQTLIDILYQQDVEEEDFFFRVKLDEFGRLSN 1495
            G ++LGHT +DH NFVNRLKMKAIEGGDAQ +ID L Q+  +E+DFF+RVKLD  GRL N
Sbjct: 239  GAEALGHTSRDHYNFVNRLKMKAIEGGDAQVVIDKLSQRAADEDDFFYRVKLDSMGRLCN 298

Query: 1494 VFWRDSMMKEDYKIYGDVMIFDTTYRTNKYNLICAPFVGVNNHWKNVMFGCAFLSDEKSE 1315
            VFWRDSMMK+DY++YGDV +FDTTYRTN+YNLIC  FVG+NNHWKN+MFGC FLSDE  +
Sbjct: 299  VFWRDSMMKDDYELYGDVTVFDTTYRTNRYNLICGAFVGINNHWKNIMFGCCFLSDETID 358

Query: 1314 SFEWLFQVFTKSMGGKCPVTLFTDQDLAISNAIEKVFPQTRHRLCLWHLHQNAISRFGRL 1135
            +F WLF+VF K MGG CPV++FTDQD A+SNA+ +VFP TR RLC WHL+QNA+S FG L
Sbjct: 359  TFVWLFEVFKKCMGGVCPVSIFTDQDQAMSNALSQVFPNTRARLCQWHLYQNAVSHFGTL 418

Query: 1134 KKNNSFKEAFNKCLTGCVDERDFEICWNSMILEYELHDDSWFRRMYDLKEKWCTALSKDI 955
            K +++F +AF KCL GC D  +FE  W+ M+ +Y L  D WF R+Y LK+KW TALSKD 
Sbjct: 419  KSDDTFSDAFKKCLRGCYDSVEFEASWDYMMKKYGLEGDKWFARLYQLKDKWSTALSKDF 478

Query: 954  FSAGILSSQRSESTNNAIGFNAKKTTTLTEFYGIFQGTLKRWRNNEKENEFQCSRSIPTS 775
            FSAGILSSQRSESTNNAIGF A KTT+L +F+G+FQ T+ RWR  E  +EF  S+S+P S
Sbjct: 479  FSAGILSSQRSESTNNAIGFQASKTTSLYDFFGMFQNTISRWRQTETNDEFVDSKSVPKS 538

Query: 774  VLPLTGMLRHAAEVYTLTLFKEFESEFIKSISTECIIVRVEENIMVYDISMPSDGGCCHR 595
              P+TGMLRHA+++YT T+F++FE EF  S+ T C ++ +++ I+VY +         H+
Sbjct: 539  YFPMTGMLRHASQIYTATMFRDFEQEFGYSLGTICELLTMDDTILVYKVWPEKHPQRTHQ 598

Query: 594  VVYDCLNMLINCSCKKFEECGLLCCHSLRVFHMHSISKIPECYIVKRWTKFVKTELWGKF 415
            V +DC+N  ++CSC+ +EE G+LC H LRV HMHS+S+IP  YI++RWTKF KTE+  + 
Sbjct: 599  VTFDCVNKFVSCSCRNYEEVGMLCYHCLRVLHMHSVSEIPSSYILRRWTKFAKTEVCDRL 658

Query: 414  NKM-AESSHRPKDCVPWRHQMARNYYNLILKCQDNEEARRIIEDGYNRDSLAVDALMNTL 238
             +     S   KDC+PWR QM+R Y NLI++  DNEEAR+I+E GY RD   V  +   +
Sbjct: 659  KQQRMNMSSLKKDCIPWRFQMSRLYNNLIIRSHDNEEARKIMEKGYMRDLADVLTVFANI 718

Query: 237  N------STGQHEVSNSSN-SILDPTRSVTKGRRQRIKGHFQQXXXXXXXXXXXXXXXXK 79
            N        G+   SNSS  ++ DP    TKGR  R K   ++                K
Sbjct: 719  NLGDSESGVGESTASNSSTVTVFDPPWVKTKGRSVRPKSGIEK-SKRGRGRGKCVSNVAK 777

Query: 78   EFGSKTPNPRLF 43
            EFGS TP  RLF
Sbjct: 778  EFGSYTPPARLF 789


>gb|EEF47267.1| dipeptidyl-peptidase, putative [Ricinus communis]
          Length = 926

 Score =  778 bits (2009), Expect = 0.0
 Identities = 375/473 (79%), Positives = 415/473 (87%), Gaps = 3/473 (0%)
 Frame = -3

Query: 4777 RTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLLN 4598
            RTW+ISPG+E  +PRILFDRSSEDVYSDPGSPM+RRTP+G YVIAKIKKE +EGTY+LLN
Sbjct: 454  RTWVISPGAEDVSPRILFDRSSEDVYSDPGSPMMRRTPSGNYVIAKIKKENDEGTYVLLN 513

Query: 4597 GSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILTS 4418
            GSGATP+G+IPFLDLFDINTG+KERIW+SDKEK++E+VVALMSD  EG+++L+QLK+LTS
Sbjct: 514  GSGATPEGDIPFLDLFDINTGSKERIWQSDKEKHYESVVALMSDIKEGDLYLDQLKVLTS 573

Query: 4417 KESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPPD 4238
            KESKTENTQYYI SWPDKKACQITNFPHPYPQL+SL+KEMIRY+RKDGVQLTATLYLPP 
Sbjct: 574  KESKTENTQYYIQSWPDKKACQITNFPHPYPQLASLQKEMIRYQRKDGVQLTATLYLPPG 633

Query: 4237 YDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPTI 4058
            YDP++DGPLPCL+WSYPGEFKSKDAAGQVRGSPNEFAGIG TS LLWLARRFAIL+GPTI
Sbjct: 634  YDPSKDGPLPCLVWSYPGEFKSKDAAGQVRGSPNEFAGIGPTSVLLWLARRFAILAGPTI 693

Query: 4057 PIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 3878
            PIIGEG++EANDRY              VIRRGVAHP KIAVGGHSYGAFMTANLLAHAP
Sbjct: 694  PIIGEGDDEANDRYVEQLVASAEAAVEEVIRRGVAHPGKIAVGGHSYGAFMTANLLAHAP 753

Query: 3877 HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEED 3698
            HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEA  TYVEMSPF+SANKIKKPILLIHGEED
Sbjct: 754  HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATTTYVEMSPFMSANKIKKPILLIHGEED 813

Query: 3697 NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCVAN 3518
            NNSGTLTMQSDRFFNALKGHGALCRLV+LPFESHGYAARES+MHVLWETDRWLQKYCV N
Sbjct: 814  NNSGTLTMQSDRFFNALKGHGALCRLVILPFESHGYAARESIMHVLWETDRWLQKYCVPN 873

Query: 3517 SSDTSED---PNESASRGISDAESKXXXXXXXXXAENSDHEIDKVHITCRSSL 3368
            +SD + +     + AS+G+ D E+           E +D E +  H   RS L
Sbjct: 874  TSDVNAELGARKDEASKGVIDTENNAVAATGGGGPELADSEHEGFHCIPRSLL 926


>ref|XP_012082895.1| PREDICTED: probable glutamyl endopeptidase, chloroplastic isoform X1
            [Jatropha curcas] gi|802690232|ref|XP_012082896.1|
            PREDICTED: probable glutamyl endopeptidase, chloroplastic
            isoform X2 [Jatropha curcas] gi|643716632|gb|KDP28258.1|
            hypothetical protein JCGZ_14029 [Jatropha curcas]
          Length = 961

 Score =  779 bits (2012), Expect = 0.0
 Identities = 376/473 (79%), Positives = 413/473 (87%), Gaps = 3/473 (0%)
 Frame = -3

Query: 4777 RTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLLN 4598
            RTWIISPGS   +PRILFDRSSEDVYSDPGSPM+RRTP+GTYVIAKIKKE ++GTY+LLN
Sbjct: 489  RTWIISPGSTDVSPRILFDRSSEDVYSDPGSPMMRRTPSGTYVIAKIKKENDDGTYVLLN 548

Query: 4597 GSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILTS 4418
            G+GATP+GNIPFLDLFDINTGNKERIWESDKEKY+ETVVALMSD  EG+++L+QLKILTS
Sbjct: 549  GNGATPEGNIPFLDLFDINTGNKERIWESDKEKYYETVVALMSDHKEGDLYLDQLKILTS 608

Query: 4417 KESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPPD 4238
            KESKTENTQYYI  WPDKK  QITNFPHPYPQL+SL+KEMIRY+RKDGVQLTATLYLPPD
Sbjct: 609  KESKTENTQYYIQRWPDKKMFQITNFPHPYPQLASLQKEMIRYQRKDGVQLTATLYLPPD 668

Query: 4237 YDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPTI 4058
            YDP++DGPLPCL+WSYPGEFKSKDAAGQVRGSPNEFAGIG TS LLWLARRFAILSGPTI
Sbjct: 669  YDPSKDGPLPCLVWSYPGEFKSKDAAGQVRGSPNEFAGIGPTSALLWLARRFAILSGPTI 728

Query: 4057 PIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 3878
            PIIGEG+EEANDRY              V+RRGVAHP KIAVGGHSYGAFMTANLLAHAP
Sbjct: 729  PIIGEGDEEANDRYVEQLVASAEAAVEEVVRRGVAHPRKIAVGGHSYGAFMTANLLAHAP 788

Query: 3877 HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEED 3698
            HLF CGIARSGAYNRTLTPFGFQNEDRTLWEA  TYVEMSPF+SAN+IKKPILLIHGEED
Sbjct: 789  HLFSCGIARSGAYNRTLTPFGFQNEDRTLWEATNTYVEMSPFMSANRIKKPILLIHGEED 848

Query: 3697 NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCVAN 3518
            NN GTLTMQSDRFFNALKGHGALCRLV+LPFESHGYA+RES+MHVLWETDRWLQKYCV+N
Sbjct: 849  NNPGTLTMQSDRFFNALKGHGALCRLVILPFESHGYASRESIMHVLWETDRWLQKYCVSN 908

Query: 3517 SSDTS---EDPNESASRGISDAESKXXXXXXXXXAENSDHEIDKVHITCRSSL 3368
            +SD +   +D  +  S+G++D E K          E +D E ++     RS L
Sbjct: 909  TSDVNAELDDSKDDVSKGVTDPEGKAVAASGGGGLELADFEHEEFQYMPRSLL 961


>ref|XP_007051106.1| Prolyl oligopeptidase family protein [Theobroma cacao]
            gi|508703367|gb|EOX95263.1| Prolyl oligopeptidase family
            protein [Theobroma cacao]
          Length = 974

 Score =  779 bits (2012), Expect = 0.0
 Identities = 376/471 (79%), Positives = 418/471 (88%), Gaps = 3/471 (0%)
 Frame = -3

Query: 4777 RTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLLN 4598
            RTW+ISPGS+  +PRILFDRSSEDVYSDPGSPMLRRTP GTYVIAKI+KE +EGTY+LLN
Sbjct: 502  RTWVISPGSKDVSPRILFDRSSEDVYSDPGSPMLRRTPAGTYVIAKIRKENDEGTYVLLN 561

Query: 4597 GSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILTS 4418
            G+GATP+GNIPFLDLFDINTG+KERIWES+KEKY+E+VVALMSDQ EG+IHL++LKILTS
Sbjct: 562  GNGATPEGNIPFLDLFDINTGSKERIWESNKEKYYESVVALMSDQKEGDIHLHELKILTS 621

Query: 4417 KESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPPD 4238
            KESKTENTQYYI SWPD+K CQIT+FPHPYPQL+SL+KEMIRY+RKDGVQLTATLYLPP 
Sbjct: 622  KESKTENTQYYIQSWPDRKVCQITDFPHPYPQLASLQKEMIRYQRKDGVQLTATLYLPPG 681

Query: 4237 YDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPTI 4058
            YDP+++GPLPCL+WSYPGEFKSKDAAGQVRGSPNEFAGIG TS LLWLARRFAILSGPTI
Sbjct: 682  YDPSKEGPLPCLVWSYPGEFKSKDAAGQVRGSPNEFAGIGPTSALLWLARRFAILSGPTI 741

Query: 4057 PIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 3878
            PIIGEG+EEANDRY              VIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP
Sbjct: 742  PIIGEGDEEANDRYVEQLVSSAEAAVEEVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 801

Query: 3877 HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEED 3698
            HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEA  TYVEMSPF+SANKIKKPILL+HGEED
Sbjct: 802  HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATTTYVEMSPFMSANKIKKPILLVHGEED 861

Query: 3697 NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCVAN 3518
            NN GTLTMQSDRFFNALKGHGALCRLV+LPFESHGYAARES+MHVLWETDRWLQKYCV+N
Sbjct: 862  NNPGTLTMQSDRFFNALKGHGALCRLVILPFESHGYAARESIMHVLWETDRWLQKYCVSN 921

Query: 3517 SSDTS---EDPNESASRGISDAESKXXXXXXXXXAENSDHEIDKVHITCRS 3374
            +SD S   +   ++AS  ++++E+K         AE +D E ++     RS
Sbjct: 922  TSDISAGLDTSKDAASDEVTESENKVVAASGGSGAELADSENEEFQSKPRS 972


>ref|XP_015572385.1| PREDICTED: LOW QUALITY PROTEIN: probable glutamyl endopeptidase,
            chloroplastic [Ricinus communis]
          Length = 951

 Score =  778 bits (2009), Expect = 0.0
 Identities = 375/473 (79%), Positives = 415/473 (87%), Gaps = 3/473 (0%)
 Frame = -3

Query: 4777 RTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLLN 4598
            RTW+ISPG+E  +PRILFDRSSEDVYSDPGSPM+RRTP+G YVIAKIKKE +EGTY+LLN
Sbjct: 479  RTWVISPGAEDVSPRILFDRSSEDVYSDPGSPMMRRTPSGNYVIAKIKKENDEGTYVLLN 538

Query: 4597 GSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILTS 4418
            GSGATP+G+IPFLDLFDINTG+KERIW+SDKEK++E+VVALMSD  EG+++L+QLK+LTS
Sbjct: 539  GSGATPEGDIPFLDLFDINTGSKERIWQSDKEKHYESVVALMSDIKEGDLYLDQLKVLTS 598

Query: 4417 KESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPPD 4238
            KESKTENTQYYI SWPDKKACQITNFPHPYPQL+SL+KEMIRY+RKDGVQLTATLYLPP 
Sbjct: 599  KESKTENTQYYIQSWPDKKACQITNFPHPYPQLASLQKEMIRYQRKDGVQLTATLYLPPG 658

Query: 4237 YDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPTI 4058
            YDP++DGPLPCL+WSYPGEFKSKDAAGQVRGSPNEFAGIG TS LLWLARRFAIL+GPTI
Sbjct: 659  YDPSKDGPLPCLVWSYPGEFKSKDAAGQVRGSPNEFAGIGPTSVLLWLARRFAILAGPTI 718

Query: 4057 PIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 3878
            PIIGEG++EANDRY              VIRRGVAHP KIAVGGHSYGAFMTANLLAHAP
Sbjct: 719  PIIGEGDDEANDRYVEQLVASAEAAVEEVIRRGVAHPGKIAVGGHSYGAFMTANLLAHAP 778

Query: 3877 HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEED 3698
            HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEA  TYVEMSPF+SANKIKKPILLIHGEED
Sbjct: 779  HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATTTYVEMSPFMSANKIKKPILLIHGEED 838

Query: 3697 NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCVAN 3518
            NNSGTLTMQSDRFFNALKGHGALCRLV+LPFESHGYAARES+MHVLWETDRWLQKYCV N
Sbjct: 839  NNSGTLTMQSDRFFNALKGHGALCRLVILPFESHGYAARESIMHVLWETDRWLQKYCVPN 898

Query: 3517 SSDTSED---PNESASRGISDAESKXXXXXXXXXAENSDHEIDKVHITCRSSL 3368
            +SD + +     + AS+G+ D E+           E +D E +  H   RS L
Sbjct: 899  TSDVNAELGARKDEASKGVIDTENNAVAATGGGGPELADSEHEGFHCIPRSLL 951


>ref|XP_010259303.1| PREDICTED: probable glutamyl endopeptidase, chloroplastic isoform X2
            [Nelumbo nucifera] gi|720010618|ref|XP_010259304.1|
            PREDICTED: probable glutamyl endopeptidase, chloroplastic
            isoform X3 [Nelumbo nucifera]
          Length = 963

 Score =  776 bits (2005), Expect = 0.0
 Identities = 377/473 (79%), Positives = 411/473 (86%), Gaps = 3/473 (0%)
 Frame = -3

Query: 4777 RTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLLN 4598
            RTW+ISPGSE A+PRILFDRSSEDVYSDPGSPM+RRT  GTYVIAK+KKEG+ GTY+LLN
Sbjct: 491  RTWVISPGSEDASPRILFDRSSEDVYSDPGSPMMRRTHAGTYVIAKVKKEGDGGTYILLN 550

Query: 4597 GSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILTS 4418
            GSGATP+GNIPFLDLF INTG+K+RIWESDKEKY+ETVVALMSDQ+EG++ ++QLKILTS
Sbjct: 551  GSGATPEGNIPFLDLFGINTGSKQRIWESDKEKYYETVVALMSDQNEGDLCIDQLKILTS 610

Query: 4417 KESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPPD 4238
            KESKTENTQYYI SWPDKK  QITNFPHPYPQL+SL+KEM+RY+RKDGVQLTATLYLPP 
Sbjct: 611  KESKTENTQYYIQSWPDKKVYQITNFPHPYPQLASLQKEMVRYQRKDGVQLTATLYLPPG 670

Query: 4237 YDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPTI 4058
            YDP++DGPLPCL+WSYPGEFKSKDAAGQVRGSPNEFAGIG TS LLWLARRFAILSGPTI
Sbjct: 671  YDPSKDGPLPCLVWSYPGEFKSKDAAGQVRGSPNEFAGIGPTSALLWLARRFAILSGPTI 730

Query: 4057 PIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 3878
            PIIGEG+EEANDRY              VIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP
Sbjct: 731  PIIGEGDEEANDRYVEQLVASAEAAVEEVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 790

Query: 3877 HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEED 3698
            HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEA  TYVEMSPF+SANKIKKPILLIHGEED
Sbjct: 791  HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATSTYVEMSPFMSANKIKKPILLIHGEED 850

Query: 3697 NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCVAN 3518
            NN GTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARES+MHVLWETDRWLQKYC++N
Sbjct: 851  NNPGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESIMHVLWETDRWLQKYCISN 910

Query: 3517 SSDTSEDPNE---SASRGISDAESKXXXXXXXXXAENSDHEIDKVHITCRSSL 3368
            SSD   D ++     ++   D   K          E  D + D+  +T RS L
Sbjct: 911  SSDIVADRDDCKVDGNKAKDDLGGKAVSVGGEGGQEQDDVDQDEFLVTLRSLL 963


>ref|XP_010259302.1| PREDICTED: probable glutamyl endopeptidase, chloroplastic isoform X1
            [Nelumbo nucifera]
          Length = 964

 Score =  776 bits (2005), Expect = 0.0
 Identities = 377/473 (79%), Positives = 411/473 (86%), Gaps = 3/473 (0%)
 Frame = -3

Query: 4777 RTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLLN 4598
            RTW+ISPGSE A+PRILFDRSSEDVYSDPGSPM+RRT  GTYVIAK+KKEG+ GTY+LLN
Sbjct: 491  RTWVISPGSEDASPRILFDRSSEDVYSDPGSPMMRRTHAGTYVIAKVKKEGDGGTYILLN 550

Query: 4597 GSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILTS 4418
            GSGATP+GNIPFLDLF INTG+K+RIWESDKEKY+ETVVALMSDQ+EG++ ++QLKILTS
Sbjct: 551  GSGATPEGNIPFLDLFGINTGSKQRIWESDKEKYYETVVALMSDQNEGDLCIDQLKILTS 610

Query: 4417 KESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPPD 4238
            KESKTENTQYYI SWPDKK  QITNFPHPYPQL+SL+KEM+RY+RKDGVQLTATLYLPP 
Sbjct: 611  KESKTENTQYYIQSWPDKKVYQITNFPHPYPQLASLQKEMVRYQRKDGVQLTATLYLPPG 670

Query: 4237 YDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPTI 4058
            YDP++DGPLPCL+WSYPGEFKSKDAAGQVRGSPNEFAGIG TS LLWLARRFAILSGPTI
Sbjct: 671  YDPSKDGPLPCLVWSYPGEFKSKDAAGQVRGSPNEFAGIGPTSALLWLARRFAILSGPTI 730

Query: 4057 PIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 3878
            PIIGEG+EEANDRY              VIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP
Sbjct: 731  PIIGEGDEEANDRYVEQLVASAEAAVEEVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 790

Query: 3877 HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEED 3698
            HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEA  TYVEMSPF+SANKIKKPILLIHGEED
Sbjct: 791  HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATSTYVEMSPFMSANKIKKPILLIHGEED 850

Query: 3697 NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCVAN 3518
            NN GTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARES+MHVLWETDRWLQKYC++N
Sbjct: 851  NNPGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESIMHVLWETDRWLQKYCISN 910

Query: 3517 SSDTSEDPNE---SASRGISDAESKXXXXXXXXXAENSDHEIDKVHITCRSSL 3368
            SSD   D ++     ++   D   K          E  D + D+  +T RS L
Sbjct: 911  SSDIVADRDDCKVDGNKAKDDLGGKAVSVGGEGGQEQDDVDQDEFLVTLRSLL 963


>gb|KJB49818.1| hypothetical protein B456_008G139300 [Gossypium raimondii]
          Length = 830

 Score =  770 bits (1989), Expect = 0.0
 Identities = 370/459 (80%), Positives = 409/459 (89%), Gaps = 3/459 (0%)
 Frame = -3

Query: 4777 RTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLLN 4598
            RTW+ISPGS+  +PRILFDRSSEDVYSDPGSPMLRRT TG YVIAK++KE ++ TYLLLN
Sbjct: 357  RTWVISPGSKDVSPRILFDRSSEDVYSDPGSPMLRRTSTGNYVIAKLRKENDDATYLLLN 416

Query: 4597 GSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILTS 4418
            G+GATP+G+IPFLDLFDINTG+KERIWESDKEKY+E+VVAL+SDQ EG+IH+N LKILTS
Sbjct: 417  GNGATPEGDIPFLDLFDINTGSKERIWESDKEKYYESVVALLSDQKEGDIHINDLKILTS 476

Query: 4417 KESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPPD 4238
            KESKTENTQYYI SWPDKK CQIT+FPHPYPQL+SL+K+MIRYERKDGVQLTATLYLPP 
Sbjct: 477  KESKTENTQYYIQSWPDKKLCQITDFPHPYPQLASLQKDMIRYERKDGVQLTATLYLPPG 536

Query: 4237 YDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPTI 4058
            YDP++DGPLPCL+WSYPGEFKSKDAAGQVRGSPNEFAGIG TS LLWLARRFAILSGPTI
Sbjct: 537  YDPSKDGPLPCLVWSYPGEFKSKDAAGQVRGSPNEFAGIGPTSALLWLARRFAILSGPTI 596

Query: 4057 PIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 3878
            PIIGEG+EEANDRY              VIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP
Sbjct: 597  PIIGEGDEEANDRYVEQLVASAEAAVEEVIRRGVAHPNKIAVGGHSYGAFMTANLLAHAP 656

Query: 3877 HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEED 3698
            HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEA  TYVEMSPF+SAN+IKKPILLIHGEED
Sbjct: 657  HLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNTYVEMSPFMSANRIKKPILLIHGEED 716

Query: 3697 NNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCVAN 3518
            NN+GTLTMQSDRFFNALKGHGALCRLV+LPFESHGY+ARES+MHVLWETDRWLQK+CV+N
Sbjct: 717  NNAGTLTMQSDRFFNALKGHGALCRLVILPFESHGYSARESIMHVLWETDRWLQKHCVSN 776

Query: 3517 SSDTSEDPNES---ASRGISDAESKXXXXXXXXXAENSD 3410
            +S+ S D  +S     + ++D E+K         AE SD
Sbjct: 777  TSEVSADIGKSKDGEGKEVTDIENKAVAASGGGGAELSD 815


>ref|XP_009774674.1| PREDICTED: probable glutamyl endopeptidase, chloroplastic isoform X3
            [Nicotiana sylvestris]
          Length = 809

 Score =  768 bits (1983), Expect = 0.0
 Identities = 370/445 (83%), Positives = 401/445 (90%), Gaps = 2/445 (0%)
 Frame = -3

Query: 4780 IRTWIISPGSESANPRILFDRSSEDVYSDPGSPMLRRTPTGTYVIAKIKKEGEEGTYLLL 4601
            +RTW+ISPGSE+ NPRILFDRSSEDVYSDPGSPM RRTP GTYVIAK+KKE +  TYLLL
Sbjct: 328  VRTWVISPGSENVNPRILFDRSSEDVYSDPGSPMSRRTPAGTYVIAKVKKEDDGDTYLLL 387

Query: 4600 NGSGATPQGNIPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQDEGEIHLNQLKILT 4421
            NGSGATP+GNIPFLDLFDINTG+KERIW+S++EKYFETVVALMSDQ EGE+ +N+LKILT
Sbjct: 388  NGSGATPEGNIPFLDLFDINTGSKERIWQSNREKYFETVVALMSDQKEGELSINKLKILT 447

Query: 4420 SKESKTENTQYYILSWPDKKACQITNFPHPYPQLSSLKKEMIRYERKDGVQLTATLYLPP 4241
            SKESKTENTQYY+LSWP+KKACQITNFPHPYPQL SL+KEMIRY+RKDGVQLTATLYLPP
Sbjct: 448  SKESKTENTQYYLLSWPEKKACQITNFPHPYPQLESLQKEMIRYQRKDGVQLTATLYLPP 507

Query: 4240 DYDPARDGPLPCLMWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSPLLWLARRFAILSGPT 4061
             YDP+RDGPLPCL+WSYPGEFKSKDAA QVRGSPNEFAGIG TSPLLWLARRFA+LSGPT
Sbjct: 508  GYDPSRDGPLPCLVWSYPGEFKSKDAASQVRGSPNEFAGIGPTSPLLWLARRFAVLSGPT 567

Query: 4060 IPIIGEGNEEANDRYXXXXXXXXXXXXXXVIRRGVAHPNKIAVGGHSYGAFMTANLLAHA 3881
            IPIIGEG+EEANDRY              VIRRGVA PNKIAVGGHSYGAFMTANLLAHA
Sbjct: 568  IPIIGEGDEEANDRYIEQLVASAEAAVEEVIRRGVADPNKIAVGGHSYGAFMTANLLAHA 627

Query: 3880 PHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAVGTYVEMSPFISANKIKKPILLIHGEE 3701
            PHLFCCGIARSGAYNRTLTPFGFQNE+RTLWEA  TYVEMSPF+SANKIKKPILLIHGEE
Sbjct: 628  PHLFCCGIARSGAYNRTLTPFGFQNEERTLWEATNTYVEMSPFMSANKIKKPILLIHGEE 687

Query: 3700 DNNSGTLTMQSDRFFNALKGHGALCRLVVLPFESHGYAARESVMHVLWETDRWLQKYCVA 3521
            DNN GTLTMQSDRFFNALKGHGALCRLV+LP+ESHGY ARES+MHVLWETDRWLQK+C A
Sbjct: 688  DNNPGTLTMQSDRFFNALKGHGALCRLVILPYESHGYGARESIMHVLWETDRWLQKHC-A 746

Query: 3520 NSSDTSEDPN--ESASRGISDAESK 3452
             SSD   D N  +  + G  D++SK
Sbjct: 747  YSSDVKADLNACKDNAEGTVDSQSK 771


Top