BLASTX nr result

ID: Achyranthes22_contig00014854 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes22_contig00014854
         (2227 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002270247.1| PREDICTED: protease Do-like 2, chloroplastic...   877   0.0  
gb|EOX94933.1| DEGP protease 2 isoform 1 [Theobroma cacao]            875   0.0  
gb|EMJ01505.1| hypothetical protein PRUPE_ppa002853mg [Prunus pe...   872   0.0  
gb|EOX94934.1| DEGP protease 2 isoform 2 [Theobroma cacao]            871   0.0  
emb|CBI32271.3| unnamed protein product [Vitis vinifera]              871   0.0  
ref|XP_006444216.1| hypothetical protein CICLE_v10019366mg [Citr...   865   0.0  
ref|XP_006479864.1| PREDICTED: protease Do-like 2, chloroplastic...   863   0.0  
ref|XP_006397989.1| hypothetical protein EUTSA_v10001363mg [Eutr...   858   0.0  
ref|XP_002520690.1| serine endopeptidase degp2, putative [Ricinu...   853   0.0  
ref|XP_004290719.1| PREDICTED: protease Do-like 2, chloroplastic...   846   0.0  
ref|XP_006295823.1| hypothetical protein CARUB_v10024950mg [Caps...   846   0.0  
ref|NP_566115.1| DegP2 protease [Arabidopsis thaliana] gi|752202...   845   0.0  
ref|NP_001118544.1| DegP2 protease [Arabidopsis thaliana] gi|330...   845   0.0  
pdb|4FLN|A Chain A, Crystal Structure Of Plant Protease Deg2 gi|...   843   0.0  
ref|XP_004148888.1| PREDICTED: protease Do-like 2, chloroplastic...   841   0.0  
ref|XP_006366368.1| PREDICTED: protease Do-like 2, chloroplastic...   838   0.0  
ref|XP_002882138.1| hypothetical protein ARALYDRAFT_483986 [Arab...   837   0.0  
ref|XP_004247469.1| PREDICTED: protease Do-like 2, chloroplastic...   832   0.0  
ref|XP_006855396.1| hypothetical protein AMTR_s00057p00143260 [A...   822   0.0  
ref|XP_003520225.1| PREDICTED: protease Do-like 2, chloroplastic...   819   0.0  

>ref|XP_002270247.1| PREDICTED: protease Do-like 2, chloroplastic-like [Vitis vinifera]
          Length = 606

 Score =  877 bits (2267), Expect = 0.0
 Identities = 450/576 (78%), Positives = 489/576 (84%), Gaps = 5/576 (0%)
 Frame = -3

Query: 2012 NAFTCREADTVSHKLFFGEPTDERHT--SFGHHDGRKKDVGRSQSSGFQSLVIQ--RKDK 1845
            + F+CR A     +   G  +        FG   G + +  R+QSS F+S   Q  RKDK
Sbjct: 32   STFSCRSAPKAISRSNKGASSSPNKPPKQFGGGSG-EDEKRRTQSSPFKSFGAQSQRKDK 90

Query: 1844 KAFAYELKEQQ-AEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGN 1668
            K  + +LKEQQ  E GNLQD AFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAF+IG+
Sbjct: 91   KGVSSDLKEQQQVETGNLQDGAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFIIGD 150

Query: 1667 GMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRFG 1488
            G L+TNAHCVEH TQVKVKRRGDDTKYVAKVLARGI+CDIALLSVE+EEFWKG EPL FG
Sbjct: 151  GKLLTNAHCVEHATQVKVKRRGDDTKYVAKVLARGIECDIALLSVESEEFWKGTEPLNFG 210

Query: 1487 LLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP 1308
             LP LQD+VTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP
Sbjct: 211  RLPRLQDAVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP 270

Query: 1307 AFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLE 1128
            AFNDQGECIGVAFQV RSED ENIGYVIPTTVVSHFL DYERNGKYTGFPCLGVLLQKLE
Sbjct: 271  AFNDQGECIGVAFQVFRSEDVENIGYVIPTTVVSHFLDDYERNGKYTGFPCLGVLLQKLE 330

Query: 1127 NPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIA 948
            NPALRSCLKV SNEGVLVRRVEPTSDA NVLK+GDVIV FDGVHVGCEGTVPFRSTERIA
Sbjct: 331  NPALRSCLKVQSNEGVLVRRVEPTSDANNVLKEGDVIVSFDGVHVGCEGTVPFRSTERIA 390

Query: 947  FRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPL 768
            FRYLISQKF+GD VE+GIIR GA +KVQ VL+PRVHLVPYHI+GGQPSYLI++GLVFTPL
Sbjct: 391  FRYLISQKFTGDVVEVGIIRAGAFMKVQVVLDPRVHLVPYHIEGGQPSYLIISGLVFTPL 450

Query: 767  SEPLIXXXXXEVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNIGYEDMSNQQVMKLN 588
            SEPLI     + +GLKLLTKARYSLA F GEQIVILSQVLANEVNIGYE+MSNQQV+K N
Sbjct: 451  SEPLIEEECEDTIGLKLLTKARYSLARFKGEQIVILSQVLANEVNIGYENMSNQQVLKFN 510

Query: 587  GTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXASPCILKDYGIPSERSADL 408
            GT I+NIHHL HL+DSC +KYLVFEFEDNY           ASPCILKDYGIPSERS+DL
Sbjct: 511  GTWIKNIHHLAHLIDSCKDKYLVFEFEDNYLAVLEREAAAAASPCILKDYGIPSERSSDL 570

Query: 407  KEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 300
             +PY+ + GDN +I+Q  GD PVSN+E+G DGLLWA
Sbjct: 571  LKPYMDSLGDNRSINQDFGDIPVSNLEIGSDGLLWA 606


>gb|EOX94933.1| DEGP protease 2 isoform 1 [Theobroma cacao]
          Length = 633

 Score =  875 bits (2262), Expect = 0.0
 Identities = 441/577 (76%), Positives = 486/577 (84%)
 Frame = -3

Query: 2030 CNNSSSNAFTCREADTVSHKLFFGEPTDERHTSFGHHDGRKKDVGRSQSSGFQSLVIQRK 1851
            C+++S   F  ++ D VS K   G   DE+ + +      + D+GR QS+GF+S   QRK
Sbjct: 58   CSSTSPRKFNVKK-DPVSQKKLPGRSKDEKSSLYADGISGRGDMGRPQSTGFKSFGTQRK 116

Query: 1850 DKKAFAYELKEQQAEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIG 1671
            D++ F  +L+EQQ E GNLQD  FLNAVVKVYCTHTAPDYSLPWQKQRQ+TSTGSAFMIG
Sbjct: 117  DREEFQLDLREQQVEPGNLQDATFLNAVVKVYCTHTAPDYSLPWQKQRQFTSTGSAFMIG 176

Query: 1670 NGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRF 1491
            +G L+TNAHCVEHDTQVKVKRRGDDTKYVAKVLARG+DCDIALLSVE++EFW+G EPLR 
Sbjct: 177  DGKLLTNAHCVEHDTQVKVKRRGDDTKYVAKVLARGVDCDIALLSVESKEFWRGAEPLRL 236

Query: 1490 GLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGG 1311
            G LP LQD+VTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGG
Sbjct: 237  GHLPGLQDAVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGG 296

Query: 1310 PAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKL 1131
            PAFN+QGECIGVAFQV RSE+ ENIGYVIPTTVVSHFL+DYERNGKYTGFPCLGVLLQKL
Sbjct: 297  PAFNEQGECIGVAFQVYRSEEAENIGYVIPTTVVSHFLSDYERNGKYTGFPCLGVLLQKL 356

Query: 1130 ENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERI 951
            ENPALR+CL V SNEGVLVRRVEPTSDA NVLK+GDVIV FD VHVG EGTVPFRS ERI
Sbjct: 357  ENPALRACLHVQSNEGVLVRRVEPTSDANNVLKEGDVIVSFDDVHVGSEGTVPFRSNERI 416

Query: 950  AFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTP 771
            AFRYLISQKF+GD  ELGI+R G  +KVQ VLN RVHLVPYHIDGGQPSYLI+AGLVFTP
Sbjct: 417  AFRYLISQKFAGDVAELGIVRAGRFMKVQVVLNRRVHLVPYHIDGGQPSYLIIAGLVFTP 476

Query: 770  LSEPLIXXXXXEVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNIGYEDMSNQQVMKL 591
            LSEPLI     + +GLKLL KARYSLA F GEQIVILSQVLANEVNIGYEDM NQQV+K 
Sbjct: 477  LSEPLIEEECEDSIGLKLLAKARYSLARFKGEQIVILSQVLANEVNIGYEDMGNQQVLKF 536

Query: 590  NGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXASPCILKDYGIPSERSAD 411
            NG RI+NIHHL HLV  C +KYLVFEFEDNY           AS  ILKDYGIPSE+S D
Sbjct: 537  NGIRIKNIHHLAHLVACCKDKYLVFEFEDNYLAVLEREAAMAASSRILKDYGIPSEKSDD 596

Query: 410  LKEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 300
            L EPY+ + GDN AI+Q  GDSPVSN+E+GF+GLLWA
Sbjct: 597  LLEPYVDSLGDNQAIEQDYGDSPVSNLEIGFEGLLWA 633


>gb|EMJ01505.1| hypothetical protein PRUPE_ppa002853mg [Prunus persica]
          Length = 628

 Score =  872 bits (2252), Expect = 0.0
 Identities = 444/576 (77%), Positives = 492/576 (85%)
 Frame = -3

Query: 2027 NNSSSNAFTCREADTVSHKLFFGEPTDERHTSFGHHDGRKKDVGRSQSSGFQSLVIQRKD 1848
            ++SSS+A +  E + V +KL       +R +  G   G+K   G+SQ + ++S   QRK+
Sbjct: 61   SSSSSSAKSQPEKEAVPNKL---SGNGDRWSVTGR--GKK---GQSQPTAYRSFGTQRKE 112

Query: 1847 KKAFAYELKEQQAEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGN 1668
            KK FA + KEQQ E  +LQD  FLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIG+
Sbjct: 113  KKEFAVDQKEQQVEPRSLQDADFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGD 172

Query: 1667 GMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRFG 1488
            G L+TNAHCVEH TQVKVKRRGDDTKYVAKVLARG+DCDIALLSVE+EEFWKG EPL+ G
Sbjct: 173  GKLLTNAHCVEHYTQVKVKRRGDDTKYVAKVLARGVDCDIALLSVESEEFWKGAEPLQLG 232

Query: 1487 LLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP 1308
             LPHLQ++VTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP
Sbjct: 233  SLPHLQEAVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP 292

Query: 1307 AFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLE 1128
            AFNDQGECIGVAFQV RSE+ ENIGYVIPTTVVSHFL DYERNG+YTGFPCLGVLLQKLE
Sbjct: 293  AFNDQGECIGVAFQVYRSEEAENIGYVIPTTVVSHFLDDYERNGRYTGFPCLGVLLQKLE 352

Query: 1127 NPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIA 948
            NPALR+CLKV S EGVLVRRVEPTSDA+NVLK+GDVIV FD VHVGCEGTVPFRS ERIA
Sbjct: 353  NPALRACLKVESIEGVLVRRVEPTSDAHNVLKEGDVIVSFDDVHVGCEGTVPFRSNERIA 412

Query: 947  FRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPL 768
            FRYLISQKF+GD  +LGIIR G   KV+ VLNPRVHLVP+HIDGGQPSYLI+AGLVFTPL
Sbjct: 413  FRYLISQKFAGDVSDLGIIRAGEFKKVKAVLNPRVHLVPFHIDGGQPSYLIIAGLVFTPL 472

Query: 767  SEPLIXXXXXEVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNIGYEDMSNQQVMKLN 588
            SEPLI     + +GLKLL KARYSLA F GEQIVILSQVLANEVNIGYEDMSNQQV+KLN
Sbjct: 473  SEPLIDEECEDSIGLKLLAKARYSLARFKGEQIVILSQVLANEVNIGYEDMSNQQVLKLN 532

Query: 587  GTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXASPCILKDYGIPSERSADL 408
            GT+IRNIHHL +LVDSC +KYLVFEFEDNY           AS CILKDYGIPSERS+DL
Sbjct: 533  GTQIRNIHHLAYLVDSCKDKYLVFEFEDNYITVLEREAATAASSCILKDYGIPSERSSDL 592

Query: 407  KEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 300
             EPY+ + GDN A++Q  GDSPVSN+E+GFDG++WA
Sbjct: 593  LEPYVDSLGDNQAVNQDIGDSPVSNLEIGFDGIIWA 628


>gb|EOX94934.1| DEGP protease 2 isoform 2 [Theobroma cacao]
          Length = 634

 Score =  871 bits (2250), Expect = 0.0
 Identities = 441/578 (76%), Positives = 486/578 (84%), Gaps = 1/578 (0%)
 Frame = -3

Query: 2030 CNNSSSNAFTCREADTVSHKLFFGEPTDERHTSFGHHDGRKKDVGRSQSSGFQSLVIQRK 1851
            C+++S   F  ++ D VS K   G   DE+ + +      + D+GR QS+GF+S   QRK
Sbjct: 58   CSSTSPRKFNVKK-DPVSQKKLPGRSKDEKSSLYADGISGRGDMGRPQSTGFKSFGTQRK 116

Query: 1850 DKKAFAYELKEQQAEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIG 1671
            D++ F  +L+EQQ E GNLQD  FLNAVVKVYCTHTAPDYSLPWQKQRQ+TSTGSAFMIG
Sbjct: 117  DREEFQLDLREQQVEPGNLQDATFLNAVVKVYCTHTAPDYSLPWQKQRQFTSTGSAFMIG 176

Query: 1670 NGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRF 1491
            +G L+TNAHCVEHDTQVKVKRRGDDTKYVAKVLARG+DCDIALLSVE++EFW+G EPLR 
Sbjct: 177  DGKLLTNAHCVEHDTQVKVKRRGDDTKYVAKVLARGVDCDIALLSVESKEFWRGAEPLRL 236

Query: 1490 GLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGG 1311
            G LP LQD+VTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGG
Sbjct: 237  GHLPGLQDAVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGG 296

Query: 1310 PAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKL 1131
            PAFN+QGECIGVAFQV RSE+ ENIGYVIPTTVVSHFL+DYERNGKYTGFPCLGVLLQKL
Sbjct: 297  PAFNEQGECIGVAFQVYRSEEAENIGYVIPTTVVSHFLSDYERNGKYTGFPCLGVLLQKL 356

Query: 1130 ENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERI 951
            ENPALR+CL V SNEGVLVRRVEPTSDA NVLK+GDVIV FD VHVG EGTVPFRS ERI
Sbjct: 357  ENPALRACLHVQSNEGVLVRRVEPTSDANNVLKEGDVIVSFDDVHVGSEGTVPFRSNERI 416

Query: 950  AFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTP 771
            AFRYLISQKF+GD  ELGI+R G  +KVQ VLN RVHLVPYHIDGGQPSYLI+AGLVFTP
Sbjct: 417  AFRYLISQKFAGDVAELGIVRAGRFMKVQVVLNRRVHLVPYHIDGGQPSYLIIAGLVFTP 476

Query: 770  LSEPLIXXXXXEVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNIGYEDMSN-QQVMK 594
            LSEPLI     + +GLKLL KARYSLA F GEQIVILSQVLANEVNIGYEDM N QQV+K
Sbjct: 477  LSEPLIEEECEDSIGLKLLAKARYSLARFKGEQIVILSQVLANEVNIGYEDMGNQQQVLK 536

Query: 593  LNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXASPCILKDYGIPSERSA 414
             NG RI+NIHHL HLV  C +KYLVFEFEDNY           AS  ILKDYGIPSE+S 
Sbjct: 537  FNGIRIKNIHHLAHLVACCKDKYLVFEFEDNYLAVLEREAAMAASSRILKDYGIPSEKSD 596

Query: 413  DLKEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 300
            DL EPY+ + GDN AI+Q  GDSPVSN+E+GF+GLLWA
Sbjct: 597  DLLEPYVDSLGDNQAIEQDYGDSPVSNLEIGFEGLLWA 634


>emb|CBI32271.3| unnamed protein product [Vitis vinifera]
          Length = 612

 Score =  871 bits (2250), Expect = 0.0
 Identities = 450/582 (77%), Positives = 489/582 (84%), Gaps = 11/582 (1%)
 Frame = -3

Query: 2012 NAFTCREADTVSHKLFFGEPTDERHT--SFGHHDGRKKDVGRSQSSGFQSLVIQ--RKDK 1845
            + F+CR A     +   G  +        FG   G + +  R+QSS F+S   Q  RKDK
Sbjct: 32   STFSCRSAPKAISRSNKGASSSPNKPPKQFGGGSG-EDEKRRTQSSPFKSFGAQSQRKDK 90

Query: 1844 KAFAYELKEQQ-AEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGN 1668
            K  + +LKEQQ  E GNLQD AFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAF+IG+
Sbjct: 91   KGVSSDLKEQQQVETGNLQDGAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFIIGD 150

Query: 1667 GMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRFG 1488
            G L+TNAHCVEH TQVKVKRRGDDTKYVAKVLARGI+CDIALLSVE+EEFWKG EPL FG
Sbjct: 151  GKLLTNAHCVEHATQVKVKRRGDDTKYVAKVLARGIECDIALLSVESEEFWKGTEPLNFG 210

Query: 1487 LLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP 1308
             LP LQD+VTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP
Sbjct: 211  RLPRLQDAVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP 270

Query: 1307 AFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLE 1128
            AFNDQGECIGVAFQV RSED ENIGYVIPTTVVSHFL DYERNGKYTGFPCLGVLLQKLE
Sbjct: 271  AFNDQGECIGVAFQVFRSEDVENIGYVIPTTVVSHFLDDYERNGKYTGFPCLGVLLQKLE 330

Query: 1127 NPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIA 948
            NPALRSCLKV SNEGVLVRRVEPTSDA NVLK+GDVIV FDGVHVGCEGTVPFRSTERIA
Sbjct: 331  NPALRSCLKVQSNEGVLVRRVEPTSDANNVLKEGDVIVSFDGVHVGCEGTVPFRSTERIA 390

Query: 947  FRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPL 768
            FRYLISQKF+GD VE+GIIR GA +KVQ VL+PRVHLVPYHI+GGQPSYLI++GLVFTPL
Sbjct: 391  FRYLISQKFTGDVVEVGIIRAGAFMKVQVVLDPRVHLVPYHIEGGQPSYLIISGLVFTPL 450

Query: 767  SEPLIXXXXXEVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNIGYEDMSNQQ----- 603
            SEPLI     + +GLKLLTKARYSLA F GEQIVILSQVLANEVNIGYE+MSNQQ     
Sbjct: 451  SEPLIEEECEDTIGLKLLTKARYSLARFKGEQIVILSQVLANEVNIGYENMSNQQASNNL 510

Query: 602  -VMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXASPCILKDYGIPS 426
             V+K NGT I+NIHHL HL+DSC +KYLVFEFEDNY           ASPCILKDYGIPS
Sbjct: 511  NVLKFNGTWIKNIHHLAHLIDSCKDKYLVFEFEDNYLAVLEREAAAAASPCILKDYGIPS 570

Query: 425  ERSADLKEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 300
            ERS+DL +PY+ + GDN +I+Q  GD PVSN+E+G DGLLWA
Sbjct: 571  ERSSDLLKPYMDSLGDNRSINQDFGDIPVSNLEIGSDGLLWA 612


>ref|XP_006444216.1| hypothetical protein CICLE_v10019366mg [Citrus clementina]
            gi|557546478|gb|ESR57456.1| hypothetical protein
            CICLE_v10019366mg [Citrus clementina]
          Length = 606

 Score =  865 bits (2235), Expect = 0.0
 Identities = 438/555 (78%), Positives = 475/555 (85%), Gaps = 4/555 (0%)
 Frame = -3

Query: 1952 TDERHTSFGHHDGRKKD----VGRSQSSGFQSLVIQRKDKKAFAYELKEQQAEAGNLQDT 1785
            T +  T+     GR KD      RSQS+ F+S   QRKDKK F ++ KEQ +E+GNLQD 
Sbjct: 52   TSKSSTTDRKFPGRSKDGKGETERSQSTAFKSFGAQRKDKKEFQFDSKEQLSESGNLQDA 111

Query: 1784 AFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRR 1605
            AFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIG+G L+TNAHCVEH TQVKVKRR
Sbjct: 112  AFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGDGKLLTNAHCVEHYTQVKVKRR 171

Query: 1604 GDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTI 1425
            GDDTKYVAKVLARG+DCDIALLSVE+EEFWK  EPL  G LP LQD+VTVVGYPLGGDTI
Sbjct: 172  GDDTKYVAKVLARGVDCDIALLSVESEEFWKDAEPLCLGHLPRLQDAVTVVGYPLGGDTI 231

Query: 1424 SVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDT 1245
            SVTKGVVSRIEVTSYAHGSS+LLGIQIDAAINPGNSGGPAFND+GECIGVAFQV RSE+ 
Sbjct: 232  SVTKGVVSRIEVTSYAHGSSELLGIQIDAAINPGNSGGPAFNDKGECIGVAFQVYRSEEV 291

Query: 1244 ENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRV 1065
            ENIGYVIPTTVVSHFL+DYERNGKYTGFPCLGVLLQKLENPALR+CLKV SNEGVLVRRV
Sbjct: 292  ENIGYVIPTTVVSHFLSDYERNGKYTGFPCLGVLLQKLENPALRTCLKVPSNEGVLVRRV 351

Query: 1064 EPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRN 885
            EPTSDA N+LK+GDVIV FD V VG EGTVPFRS ERIAFRYLISQKF+GD  ELGIIR 
Sbjct: 352  EPTSDANNILKEGDVIVSFDDVCVGSEGTVPFRSNERIAFRYLISQKFAGDVAELGIIRA 411

Query: 884  GAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXEVMGLKLLTKA 705
            G  +KV+ VLNPRVHLVPYHIDGGQPSYLI+AGLVFTPLSEPLI     + +GLKLL KA
Sbjct: 412  GTFMKVKVVLNPRVHLVPYHIDGGQPSYLIIAGLVFTPLSEPLIEEECDDSIGLKLLAKA 471

Query: 704  RYSLAHFIGEQIVILSQVLANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKY 525
            RYSLA F GEQ+VILSQVLANEV+IGYEDMSNQQV+K NGTRI+NIHHL HLVDSC +KY
Sbjct: 472  RYSLARFEGEQMVILSQVLANEVSIGYEDMSNQQVLKFNGTRIKNIHHLAHLVDSCKDKY 531

Query: 524  LVFEFEDNYXXXXXXXXXXXASPCILKDYGIPSERSADLKEPYLYTFGDNNAIDQAPGDS 345
            LVFEFEDNY           AS CILKDYGIPSERS+DL EPY+   G N AI+Q  GDS
Sbjct: 532  LVFEFEDNYLAVLEREAAVAASSCILKDYGIPSERSSDLLEPYVDPLGGNQAINQDSGDS 591

Query: 344  PVSNMEMGFDGLLWA 300
            PVS++E+GFDGL WA
Sbjct: 592  PVSDLEIGFDGLKWA 606


>ref|XP_006479864.1| PREDICTED: protease Do-like 2, chloroplastic-like [Citrus sinensis]
          Length = 606

 Score =  863 bits (2231), Expect = 0.0
 Identities = 437/555 (78%), Positives = 475/555 (85%), Gaps = 4/555 (0%)
 Frame = -3

Query: 1952 TDERHTSFGHHDGRKKD----VGRSQSSGFQSLVIQRKDKKAFAYELKEQQAEAGNLQDT 1785
            T +  T+     GR KD      RSQS+ F+S   QRKDKK F ++ KEQ +E+GNLQD 
Sbjct: 52   TSKSSTTDRKFPGRSKDGKGETERSQSTAFKSFGAQRKDKKEFQFDSKEQLSESGNLQDA 111

Query: 1784 AFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRR 1605
            AFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIG+G L+TNAHCVEH TQVKVKRR
Sbjct: 112  AFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGDGKLLTNAHCVEHYTQVKVKRR 171

Query: 1604 GDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTI 1425
            GDDTKYVAKVLARG+DCDIALLSVE+EEFWK  EPL  G LP LQD+VTVVGYPLGGDTI
Sbjct: 172  GDDTKYVAKVLARGVDCDIALLSVESEEFWKDAEPLCLGHLPRLQDAVTVVGYPLGGDTI 231

Query: 1424 SVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDT 1245
            SVTKGVVSRIEVTSYAHGSS+LLGIQIDAAINPGNSGGPAFND+GECIGVAFQV RSE+ 
Sbjct: 232  SVTKGVVSRIEVTSYAHGSSELLGIQIDAAINPGNSGGPAFNDKGECIGVAFQVYRSEEV 291

Query: 1244 ENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRV 1065
            ENIGYVIPTTVVSHFL+DYERNGKYTGFPCLGVLLQKLENPALR+CLKV SNEGVLVRRV
Sbjct: 292  ENIGYVIPTTVVSHFLSDYERNGKYTGFPCLGVLLQKLENPALRTCLKVPSNEGVLVRRV 351

Query: 1064 EPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRN 885
            EPTSDA N+LK+GDVIV FD V VG EGTVPFRS ERIAFRYLISQKF+GD  ELGIIR 
Sbjct: 352  EPTSDANNILKEGDVIVSFDDVCVGSEGTVPFRSNERIAFRYLISQKFAGDVAELGIIRA 411

Query: 884  GAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXEVMGLKLLTKA 705
            G  +KV+ VLNPRVHLVPYHIDGGQPSYLI+AGLVFTPLSEPLI     + +GLKLL KA
Sbjct: 412  GTFMKVKVVLNPRVHLVPYHIDGGQPSYLIIAGLVFTPLSEPLIEEECDDSIGLKLLAKA 471

Query: 704  RYSLAHFIGEQIVILSQVLANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKY 525
            RYSLA F GEQ+VILSQVLANEV+IGYEDMSNQQV+K NGTRI+NIHHL HLVDSC +KY
Sbjct: 472  RYSLARFEGEQMVILSQVLANEVSIGYEDMSNQQVLKFNGTRIKNIHHLAHLVDSCKDKY 531

Query: 524  LVFEFEDNYXXXXXXXXXXXASPCILKDYGIPSERSADLKEPYLYTFGDNNAIDQAPGDS 345
            LVFEFEDNY           AS CILKDYGIPSERS+DL EP++   G N AI+Q  GDS
Sbjct: 532  LVFEFEDNYLAVLEREAAVAASSCILKDYGIPSERSSDLLEPFVDPLGGNQAINQDSGDS 591

Query: 344  PVSNMEMGFDGLLWA 300
            PVS++E+GFDGL WA
Sbjct: 592  PVSDLEIGFDGLKWA 606


>ref|XP_006397989.1| hypothetical protein EUTSA_v10001363mg [Eutrema salsugineum]
            gi|557099062|gb|ESQ39442.1| hypothetical protein
            EUTSA_v10001363mg [Eutrema salsugineum]
          Length = 612

 Score =  858 bits (2217), Expect = 0.0
 Identities = 428/550 (77%), Positives = 466/550 (84%)
 Frame = -3

Query: 1949 DERHTSFGHHDGRKKDVGRSQSSGFQSLVIQRKDKKAFAYELKEQQAEAGNLQDTAFLNA 1770
            DE   S G  DG        Q+  F++    +KDKK    + ++QQ + G + D +FLNA
Sbjct: 68   DESCNSHGKGDG-----AGPQTMAFKAFGSPKKDKKEAQSDFRDQQTDPGKIHDASFLNA 122

Query: 1769 VVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTK 1590
            VVKVYCTHTAPDYSLPWQKQRQ+TSTGSAFMIG+G L+TNAHCVEHDTQVKVKRRGDD K
Sbjct: 123  VVKVYCTHTAPDYSLPWQKQRQFTSTGSAFMIGDGKLLTNAHCVEHDTQVKVKRRGDDRK 182

Query: 1589 YVAKVLARGIDCDIALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKG 1410
            YVAKVL RG+DCDIALLSVE+E+FWKG EPLR G LP LQDSVTVVGYPLGGDTISVTKG
Sbjct: 183  YVAKVLVRGVDCDIALLSVESEDFWKGAEPLRLGHLPRLQDSVTVVGYPLGGDTISVTKG 242

Query: 1409 VVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGY 1230
            VVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQV RSE+TENIGY
Sbjct: 243  VVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVYRSEETENIGY 302

Query: 1229 VIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSD 1050
            VIPTTVVSHFLTDYERNGKYTG+PCLGVLLQKLENPALR CLKV +NEGVLVRRVEPTSD
Sbjct: 303  VIPTTVVSHFLTDYERNGKYTGYPCLGVLLQKLENPALRECLKVPTNEGVLVRRVEPTSD 362

Query: 1049 AYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIK 870
            A  VLK+GDVIV FD +HVGCEGTVPFRS+ERIAFRYLISQKFSGD  ELGIIR G H K
Sbjct: 363  ASKVLKEGDVIVSFDDLHVGCEGTVPFRSSERIAFRYLISQKFSGDIAELGIIRAGEHKK 422

Query: 869  VQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXEVMGLKLLTKARYSLA 690
            VQ VL PRVHLVP+HIDGGQPSY+I+AGLVFTPLSEPLI     + +GLKLLTKARYS+A
Sbjct: 423  VQVVLRPRVHLVPFHIDGGQPSYIIIAGLVFTPLSEPLIEEECEDTIGLKLLTKARYSVA 482

Query: 689  HFIGEQIVILSQVLANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEF 510
             F GEQIVILSQVLANEVNIGYEDM+NQQV+K NGT IRNIHHL HL+D C +KYLVFEF
Sbjct: 483  RFRGEQIVILSQVLANEVNIGYEDMNNQQVLKFNGTPIRNIHHLAHLIDMCKDKYLVFEF 542

Query: 509  EDNYXXXXXXXXXXXASPCILKDYGIPSERSADLKEPYLYTFGDNNAIDQAPGDSPVSNM 330
            EDNY           AS CILKDYGIPSERSADL+EPY+    D  A+DQ  GDSPVSN+
Sbjct: 543  EDNYVAVLEREASDSASLCILKDYGIPSERSADLREPYIDPIDDTRALDQGFGDSPVSNL 602

Query: 329  EMGFDGLLWA 300
            E+GFDGL+WA
Sbjct: 603  EIGFDGLVWA 612


>ref|XP_002520690.1| serine endopeptidase degp2, putative [Ricinus communis]
            gi|223540075|gb|EEF41652.1| serine endopeptidase degp2,
            putative [Ricinus communis]
          Length = 621

 Score =  853 bits (2204), Expect = 0.0
 Identities = 427/550 (77%), Positives = 472/550 (85%), Gaps = 1/550 (0%)
 Frame = -3

Query: 1946 ERHTSFGHHDGRKKDVGRSQSSGFQSLVIQRKDKKAFAYELKEQQAEAGNLQDTAFLNAV 1767
            +R   +   +G K + G++QS  ++S   +RKDKK F ++  E Q E+G LQD AFLNAV
Sbjct: 72   KRSNLYSDENGGKAERGKAQSVAYKSFGTERKDKKEFQFDSNELQIESGKLQDMAFLNAV 131

Query: 1766 VKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKY 1587
            VKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIG+G L+TNAHCVEH TQVKVKRRGDDTKY
Sbjct: 132  VKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGDGKLLTNAHCVEHYTQVKVKRRGDDTKY 191

Query: 1586 VAKVLARGIDCDIALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGV 1407
            VAKVLARG+DCDIALLSV+++EFW+G EPL+ G LP LQD+VTVVGYPLGGDTISVTKGV
Sbjct: 192  VAKVLARGVDCDIALLSVKDKEFWEGAEPLQLGHLPRLQDAVTVVGYPLGGDTISVTKGV 251

Query: 1406 VSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYV 1227
            VSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFN+QGECIGVAFQV RSE+ ENIGYV
Sbjct: 252  VSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNEQGECIGVAFQVYRSEEAENIGYV 311

Query: 1226 IPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDA 1047
            IPTTVVSHFL DYERNGKYTGFPCLGVLLQKLENPALR+CLKV SNEGVLVRR+EPTSDA
Sbjct: 312  IPTTVVSHFLNDYERNGKYTGFPCLGVLLQKLENPALRACLKVESNEGVLVRRIEPTSDA 371

Query: 1046 YNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKV 867
             NVLK+GDVIV FD V+VGCEGTVPFRS ERIAFRYLISQKF+GD  ELGIIR G+ +KV
Sbjct: 372  NNVLKEGDVIVSFDDVNVGCEGTVPFRSNERIAFRYLISQKFAGDVAELGIIRAGSFMKV 431

Query: 866  QTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXEVMGLKLLTKARYSLAH 687
            + VLNPRVHLVPYH+DGGQPSYLI+AGLVFTPLSEPLI       +GLKLL KARYSLA 
Sbjct: 432  KVVLNPRVHLVPYHVDGGQPSYLIIAGLVFTPLSEPLIDEECEGSIGLKLLAKARYSLAR 491

Query: 686  FIGEQIVILSQVLANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFE 507
            F GEQIVILSQVLANEVNIGYEDMSNQQV+K NGTRI+NIHHL +LVDSC +KYLVFEFE
Sbjct: 492  FKGEQIVILSQVLANEVNIGYEDMSNQQVLKFNGTRIKNIHHLAYLVDSCKDKYLVFEFE 551

Query: 506  DNYXXXXXXXXXXXASPCILKDYGIPSERSADLKEPYLYTFGDNNAIDQ-APGDSPVSNM 330
            DNY           AS CIL DYGIPSERS DL +PY+ +  DN   +Q A GDSPVSN+
Sbjct: 552  DNYLAVLERQPATAASSCILTDYGIPSERSPDLLKPYVDSQVDNQLAEQDALGDSPVSNL 611

Query: 329  EMGFDGLLWA 300
            E+G DG+LWA
Sbjct: 612  EIGNDGILWA 621


>ref|XP_004290719.1| PREDICTED: protease Do-like 2, chloroplastic-like [Fragaria vesca
            subsp. vesca]
          Length = 622

 Score =  846 bits (2186), Expect = 0.0
 Identities = 427/540 (79%), Positives = 464/540 (85%), Gaps = 1/540 (0%)
 Frame = -3

Query: 1916 GRKKDVGRSQSSGFQSLVIQRKDKKAFAYELKEQ-QAEAGNLQDTAFLNAVVKVYCTHTA 1740
            G KK  GRSQ + ++    QRK+KK    + KE+ QAE  NLQD  FLNAVVKVYCTHTA
Sbjct: 85   GGKK--GRSQQAAYKPFGTQRKEKKESVADQKEKKQAEVRNLQDADFLNAVVKVYCTHTA 142

Query: 1739 PDYSLPWQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGI 1560
            PDYSLPWQKQRQ+TSTGSAFMIG+G L+TNAHCVEH TQVKVKRRGDDTKYVAKVLA+G+
Sbjct: 143  PDYSLPWQKQRQFTSTGSAFMIGDGKLLTNAHCVEHYTQVKVKRRGDDTKYVAKVLAKGV 202

Query: 1559 DCDIALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSY 1380
            DCDIALL+VE+EEFWKG EPL FG LPHLQ++VTVVGYPLGGDTISVTKGVVSRIEVTSY
Sbjct: 203  DCDIALLTVESEEFWKGAEPLHFGSLPHLQEAVTVVGYPLGGDTISVTKGVVSRIEVTSY 262

Query: 1379 AHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHF 1200
            AHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQV RSE+ ENIGYVIPTTVVSHF
Sbjct: 263  AHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVYRSEEAENIGYVIPTTVVSHF 322

Query: 1199 LTDYERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDV 1020
            L DYERNGKYTGFPCLGV+LQKLENPALR+CLKV S EGVLVRRVEPT DA+NVLK+GDV
Sbjct: 323  LNDYERNGKYTGFPCLGVMLQKLENPALRACLKVESVEGVLVRRVEPTCDAHNVLKEGDV 382

Query: 1019 IVDFDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVH 840
            IV FD VHVGCEGTVPFRS ERIAFRYLISQKF+GD  ELGIIR G  +KV+  LNPRVH
Sbjct: 383  IVSFDDVHVGCEGTVPFRSNERIAFRYLISQKFAGDVAELGIIRAGEFMKVKAELNPRVH 442

Query: 839  LVPYHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXEVMGLKLLTKARYSLAHFIGEQIVIL 660
            LVPYHIDGGQPSYLI+AGLVFTPLSEPLI     + +GLKLL KARYSLA F GEQIVIL
Sbjct: 443  LVPYHIDGGQPSYLIIAGLVFTPLSEPLIDEECDDSIGLKLLAKARYSLARFKGEQIVIL 502

Query: 659  SQVLANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXX 480
            SQVLANEVNIGYEDMSNQQV+KLNGT I+NIHHL HLVDSC  KYLVFEFEDNY      
Sbjct: 503  SQVLANEVNIGYEDMSNQQVLKLNGTPIKNIHHLAHLVDSCKHKYLVFEFEDNYITVLER 562

Query: 479  XXXXXASPCILKDYGIPSERSADLKEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 300
                 +S  ILKDYGIP+ERS+DL EPY+ +  D  A  +  GDSPVSN+E+GFDGL+WA
Sbjct: 563  EGALASSTSILKDYGIPAERSSDLLEPYVDSVVDGQADQEDLGDSPVSNLEIGFDGLIWA 622


>ref|XP_006295823.1| hypothetical protein CARUB_v10024950mg [Capsella rubella]
            gi|482564531|gb|EOA28721.1| hypothetical protein
            CARUB_v10024950mg [Capsella rubella]
          Length = 604

 Score =  846 bits (2185), Expect = 0.0
 Identities = 421/530 (79%), Positives = 456/530 (86%)
 Frame = -3

Query: 1889 QSSGFQSLVIQRKDKKAFAYELKEQQAEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQ 1710
            Q+  F++    +KDKK      ++QQ +   + D +FLNAVVKVYCTHTAPDYSLPWQKQ
Sbjct: 76   QTMAFKAFGSPKKDKKDAPLS-RDQQTDPAKIHDASFLNAVVKVYCTHTAPDYSLPWQKQ 134

Query: 1709 RQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVE 1530
            RQ+TSTGSAFMIG+G L+TNAHCVEHDTQVKVKRRGDD KYVAKVL RG+DCDIALLSVE
Sbjct: 135  RQFTSTGSAFMIGDGKLLTNAHCVEHDTQVKVKRRGDDRKYVAKVLVRGVDCDIALLSVE 194

Query: 1529 NEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGI 1350
            +E+FWKG EPLR G LP LQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGI
Sbjct: 195  SEDFWKGAEPLRLGHLPRLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGI 254

Query: 1349 QIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKY 1170
            QIDAAINPGNSGGPAFNDQGECIGVAFQV RSE+TENIGYVIPTTVVSHFLTDYERNGKY
Sbjct: 255  QIDAAINPGNSGGPAFNDQGECIGVAFQVYRSEETENIGYVIPTTVVSHFLTDYERNGKY 314

Query: 1169 TGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVG 990
            TG+PCLGVLLQKLENPALR CLKV +NEGVLVRRVEPTSDA  VLK+GDVIV FD +HVG
Sbjct: 315  TGYPCLGVLLQKLENPALRECLKVPTNEGVLVRRVEPTSDASKVLKEGDVIVSFDDLHVG 374

Query: 989  CEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQ 810
            CEGTVPFRS+ERIAFRYLISQKF+GD  ELGIIR G H KVQ  L PRVHLVPYHIDGGQ
Sbjct: 375  CEGTVPFRSSERIAFRYLISQKFAGDIAELGIIRAGEHKKVQVALRPRVHLVPYHIDGGQ 434

Query: 809  PSYLIVAGLVFTPLSEPLIXXXXXEVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNI 630
            PSY+IVAGLVFTPLSEPLI     + +GLKLLTKARYS+A F GEQIVILSQVLANEVNI
Sbjct: 435  PSYIIVAGLVFTPLSEPLIEEECEDTIGLKLLTKARYSVARFRGEQIVILSQVLANEVNI 494

Query: 629  GYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXASPCI 450
            GYEDM+NQQV+K NG  IRNIHHL HL+D C +KYLVFEFEDNY           AS CI
Sbjct: 495  GYEDMNNQQVLKFNGIPIRNIHHLAHLIDMCKDKYLVFEFEDNYVAVLEREASNSASLCI 554

Query: 449  LKDYGIPSERSADLKEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 300
            LKDYGIPSERSADL EPY+    DN A+DQ  GDSPVSN+E+GFDGL+WA
Sbjct: 555  LKDYGIPSERSADLLEPYVDPIDDNQALDQGIGDSPVSNLEIGFDGLVWA 604


>ref|NP_566115.1| DegP2 protease [Arabidopsis thaliana]
            gi|75220233|sp|O82261.2|DEGP2_ARATH RecName:
            Full=Protease Do-like 2, chloroplastic; Flags: Precursor
            gi|11908036|gb|AAG41447.1|AF326865_1 putative DegP2
            protease [Arabidopsis thaliana]
            gi|13172275|gb|AAK14061.1|AF245171_1 DegP2 protease
            [Arabidopsis thaliana]
            gi|13194802|gb|AAK15563.1|AF349516_1 putative DegP2
            protease [Arabidopsis thaliana]
            gi|18700190|gb|AAL77706.1| At2g47940/F17A22.33
            [Arabidopsis thaliana] gi|20197307|gb|AAC63648.2| DegP2
            protease [Arabidopsis thaliana]
            gi|20197550|gb|AAM15122.1| DegP2 protease [Arabidopsis
            thaliana] gi|20857214|gb|AAM26706.1| At2g47940/F17A22.33
            [Arabidopsis thaliana] gi|330255820|gb|AEC10914.1| DegP2
            protease [Arabidopsis thaliana]
          Length = 607

 Score =  845 bits (2184), Expect = 0.0
 Identities = 426/576 (73%), Positives = 473/576 (82%), Gaps = 3/576 (0%)
 Frame = -3

Query: 2018 SSNAFTCREADTVSHKLFFGEPTDERHTSFGHHDGRKKDVGRS--QSSGFQSLVIQRKDK 1845
            S+++ T R +  +  K    +          ++ GR +D   +  Q   F++    +K+K
Sbjct: 32   SASSLTPRASSNIKRKSSRSDSPSPILNPEKNYPGRVRDESSNPPQKMAFKAFGSPKKEK 91

Query: 1844 KAFAYEL-KEQQAEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGN 1668
            K    +  ++QQ +   + D +FLNAVVKVYCTHTAPDYSLPWQKQRQ+TSTGSAFMIG+
Sbjct: 92   KESLSDFSRDQQTDPAKIHDASFLNAVVKVYCTHTAPDYSLPWQKQRQFTSTGSAFMIGD 151

Query: 1667 GMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRFG 1488
            G L+TNAHCVEHDTQVKVKRRGDD KYVAKVL RG+DCDIALLSVE+E+FWKG EPLR G
Sbjct: 152  GKLLTNAHCVEHDTQVKVKRRGDDRKYVAKVLVRGVDCDIALLSVESEDFWKGAEPLRLG 211

Query: 1487 LLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP 1308
             LP LQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP
Sbjct: 212  HLPRLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP 271

Query: 1307 AFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLE 1128
            AFNDQGECIGVAFQV RSE+TENIGYVIPTTVVSHFLTDYERNGKYTG+PCLGVLLQKLE
Sbjct: 272  AFNDQGECIGVAFQVYRSEETENIGYVIPTTVVSHFLTDYERNGKYTGYPCLGVLLQKLE 331

Query: 1127 NPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIA 948
            NPALR CLKV +NEGVLVRRVEPTSDA  VLK+GDVIV FD +HVGCEGTVPFRS+ERIA
Sbjct: 332  NPALRECLKVPTNEGVLVRRVEPTSDASKVLKEGDVIVSFDDLHVGCEGTVPFRSSERIA 391

Query: 947  FRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPL 768
            FRYLISQKF+GD  E+GIIR G H KVQ VL PRVHLVPYHIDGGQPSY+IVAGLVFTPL
Sbjct: 392  FRYLISQKFAGDIAEIGIIRAGEHKKVQVVLRPRVHLVPYHIDGGQPSYIIVAGLVFTPL 451

Query: 767  SEPLIXXXXXEVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNIGYEDMSNQQVMKLN 588
            SEPLI     + +GLKLLTKARYS+A F GEQIVILSQVLANEVNIGYEDM+NQQV+K N
Sbjct: 452  SEPLIEEECEDTIGLKLLTKARYSVARFRGEQIVILSQVLANEVNIGYEDMNNQQVLKFN 511

Query: 587  GTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXASPCILKDYGIPSERSADL 408
            G  IRNIHHL HL+D C +KYLVFEFEDNY           AS CILKDYGIPSERSADL
Sbjct: 512  GIPIRNIHHLAHLIDMCKDKYLVFEFEDNYVAVLEREASNSASLCILKDYGIPSERSADL 571

Query: 407  KEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 300
             EPY+    D  A+DQ  GDSPVSN+E+GFDGL+WA
Sbjct: 572  LEPYVDPIDDTQALDQGIGDSPVSNLEIGFDGLVWA 607


>ref|NP_001118544.1| DegP2 protease [Arabidopsis thaliana] gi|330255821|gb|AEC10915.1|
            DegP2 protease [Arabidopsis thaliana]
          Length = 606

 Score =  845 bits (2182), Expect = 0.0
 Identities = 422/545 (77%), Positives = 463/545 (84%), Gaps = 3/545 (0%)
 Frame = -3

Query: 1925 HHDGRKKDVGRS--QSSGFQSLVIQRKDKKAFAYEL-KEQQAEAGNLQDTAFLNAVVKVY 1755
            ++ GR +D   +  Q   F++    +K+KK    +  ++QQ +   + D +FLNAVVKVY
Sbjct: 62   NYPGRVRDESSNPPQKMAFKAFGSPKKEKKESLSDFSRDQQTDPAKIHDASFLNAVVKVY 121

Query: 1754 CTHTAPDYSLPWQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKYVAKV 1575
            CTHTAPDYSLPWQKQRQ+TSTGSAFMIG+G L+TNAHCVEHDTQVKVKRRGDD KYVAKV
Sbjct: 122  CTHTAPDYSLPWQKQRQFTSTGSAFMIGDGKLLTNAHCVEHDTQVKVKRRGDDRKYVAKV 181

Query: 1574 LARGIDCDIALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGVVSRI 1395
            L RG+DCDIALLSVE+E+FWKG EPLR G LP LQDSVTVVGYPLGGDTISVTKGVVSRI
Sbjct: 182  LVRGVDCDIALLSVESEDFWKGAEPLRLGHLPRLQDSVTVVGYPLGGDTISVTKGVVSRI 241

Query: 1394 EVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYVIPTT 1215
            EVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQV RSE+TENIGYVIPTT
Sbjct: 242  EVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVYRSEETENIGYVIPTT 301

Query: 1214 VVSHFLTDYERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDAYNVL 1035
            VVSHFLTDYERNGKYTG+PCLGVLLQKLENPALR CLKV +NEGVLVRRVEPTSDA  VL
Sbjct: 302  VVSHFLTDYERNGKYTGYPCLGVLLQKLENPALRECLKVPTNEGVLVRRVEPTSDASKVL 361

Query: 1034 KQGDVIVDFDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKVQTVL 855
            K+GDVIV FD +HVGCEGTVPFRS+ERIAFRYLISQKF+GD  E+GIIR G H KVQ VL
Sbjct: 362  KEGDVIVSFDDLHVGCEGTVPFRSSERIAFRYLISQKFAGDIAEIGIIRAGEHKKVQVVL 421

Query: 854  NPRVHLVPYHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXEVMGLKLLTKARYSLAHFIGE 675
             PRVHLVPYHIDGGQPSY+IVAGLVFTPLSEPLI     + +GLKLLTKARYS+A F GE
Sbjct: 422  RPRVHLVPYHIDGGQPSYIIVAGLVFTPLSEPLIEEECEDTIGLKLLTKARYSVARFRGE 481

Query: 674  QIVILSQVLANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYX 495
            QIVILSQVLANEVNIGYEDM+NQQV+K NG  IRNIHHL HL+D C +KYLVFEFEDNY 
Sbjct: 482  QIVILSQVLANEVNIGYEDMNNQQVLKFNGIPIRNIHHLAHLIDMCKDKYLVFEFEDNYV 541

Query: 494  XXXXXXXXXXASPCILKDYGIPSERSADLKEPYLYTFGDNNAIDQAPGDSPVSNMEMGFD 315
                      AS CILKDYGIPSERSADL EPY+    D  A+DQ  GDSPVSN+E+GFD
Sbjct: 542  AVLEREASNSASLCILKDYGIPSERSADLLEPYVDPIDDTQALDQGIGDSPVSNLEIGFD 601

Query: 314  GLLWA 300
            GL+WA
Sbjct: 602  GLVWA 606


>pdb|4FLN|A Chain A, Crystal Structure Of Plant Protease Deg2
            gi|405944959|pdb|4FLN|B Chain B, Crystal Structure Of
            Plant Protease Deg2 gi|405944960|pdb|4FLN|C Chain C,
            Crystal Structure Of Plant Protease Deg2
          Length = 539

 Score =  843 bits (2178), Expect = 0.0
 Identities = 419/531 (78%), Positives = 456/531 (85%), Gaps = 1/531 (0%)
 Frame = -3

Query: 1889 QSSGFQSLVIQRKDKKAFAYEL-KEQQAEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQK 1713
            Q   F++    +K+KK    +  ++QQ +   + D +FLNAVVKVYCTHTAPDYSLPWQK
Sbjct: 9    QKMAFKAFGSPKKEKKESLSDFSRDQQTDPAKIHDASFLNAVVKVYCTHTAPDYSLPWQK 68

Query: 1712 QRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSV 1533
            QRQ+TSTGSAFMIG+G L+TNAHCVEHDTQVKVKRRGDD KYVAKVL RG+DCDIALLSV
Sbjct: 69   QRQFTSTGSAFMIGDGKLLTNAHCVEHDTQVKVKRRGDDRKYVAKVLVRGVDCDIALLSV 128

Query: 1532 ENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLG 1353
            E+E+FWKG EPLR G LP LQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLG
Sbjct: 129  ESEDFWKGAEPLRLGHLPRLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLG 188

Query: 1352 IQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGK 1173
            IQIDAAINPGNSGGPAFNDQGECIGVAFQV RSE+TENIGYVIPTTVVSHFLTDYERNGK
Sbjct: 189  IQIDAAINPGNSGGPAFNDQGECIGVAFQVYRSEETENIGYVIPTTVVSHFLTDYERNGK 248

Query: 1172 YTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHV 993
            YTG+PCLGVLLQKLENPALR CLKV +NEGVLVRRVEPTSDA  VLK+GDVIV FD +HV
Sbjct: 249  YTGYPCLGVLLQKLENPALRECLKVPTNEGVLVRRVEPTSDASKVLKEGDVIVSFDDLHV 308

Query: 992  GCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGG 813
            GCEGTVPFRS+ERIAFRYLISQKF+GD  E+GIIR G H KVQ VL PRVHLVPYHIDGG
Sbjct: 309  GCEGTVPFRSSERIAFRYLISQKFAGDIAEIGIIRAGEHKKVQVVLRPRVHLVPYHIDGG 368

Query: 812  QPSYLIVAGLVFTPLSEPLIXXXXXEVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVN 633
            QPSY+IVAGLVFTPLSEPLI     + +GLKLLTKARYS+A F GEQIVILSQVLANEVN
Sbjct: 369  QPSYIIVAGLVFTPLSEPLIEEECEDTIGLKLLTKARYSVARFRGEQIVILSQVLANEVN 428

Query: 632  IGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXASPC 453
            IGYEDM+NQQV+K NG  IRNIHHL HL+D C +KYLVFEFEDNY           AS C
Sbjct: 429  IGYEDMNNQQVLKFNGIPIRNIHHLAHLIDMCKDKYLVFEFEDNYVAVLEREASNSASLC 488

Query: 452  ILKDYGIPSERSADLKEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 300
            ILKDYGIPSERSADL EPY+    D  A+DQ  GDSPVSN+E+GFDGL+WA
Sbjct: 489  ILKDYGIPSERSADLLEPYVDPIDDTQALDQGIGDSPVSNLEIGFDGLVWA 539


>ref|XP_004148888.1| PREDICTED: protease Do-like 2, chloroplastic-like [Cucumis sativus]
            gi|449491511|ref|XP_004158921.1| PREDICTED: protease
            Do-like 2, chloroplastic-like [Cucumis sativus]
          Length = 623

 Score =  841 bits (2172), Expect = 0.0
 Identities = 423/538 (78%), Positives = 464/538 (86%), Gaps = 1/538 (0%)
 Frame = -3

Query: 1910 KKDVGRSQSSGFQSLVIQRKDKKAFAYELKEQQAEAGNLQDTAFLNAVVKVYCTHTAPDY 1731
            +++ GR Q+  ++S  +QRKDKK     + E Q E+GNLQ  AFLNAVVKVYCTHTAPDY
Sbjct: 87   QRNSGRVQTEAYKSFGMQRKDKKELVNAI-EDQVESGNLQGAAFLNAVVKVYCTHTAPDY 145

Query: 1730 SLPWQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCD 1551
            SLPWQKQRQ+TSTGSAFMIG+G L+TNAHCVEHDTQVKVK+RGDDTKYVAKVLARG+DCD
Sbjct: 146  SLPWQKQRQFTSTGSAFMIGDGKLLTNAHCVEHDTQVKVKKRGDDTKYVAKVLARGVDCD 205

Query: 1550 IALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHG 1371
            IALLSVENEEFWKG EPL+FG LP LQD+VTVVGYPLGGDTISVT+GVVSRIEVTSYAHG
Sbjct: 206  IALLSVENEEFWKGAEPLKFGNLPCLQDAVTVVGYPLGGDTISVTRGVVSRIEVTSYAHG 265

Query: 1370 SSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTD 1191
            SSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQV RSE+ ENIGYVIPTTVVSHFL D
Sbjct: 266  SSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVYRSEEVENIGYVIPTTVVSHFLND 325

Query: 1190 YERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVD 1011
            YERN KYTGFP LGVLLQKLENPALR+CL+V SNEGVLVRRVEPTSDA  VLK+GDVIV 
Sbjct: 326  YERNRKYTGFPSLGVLLQKLENPALRACLRVKSNEGVLVRRVEPTSDANKVLKEGDVIVS 385

Query: 1010 FDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVP 831
            FD + VGCEGTVPFR+ ERIAFRYLISQKF+GD  ELGIIR+G  IK + +LNPRVHLVP
Sbjct: 386  FDDIKVGCEGTVPFRTNERIAFRYLISQKFAGDVAELGIIRSGELIKAKVILNPRVHLVP 445

Query: 830  YHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXEVMGLKLLTKARYSLAHFIGEQIVILSQV 651
            +HIDGGQPSYLI+AGLVFTPLSEPLI     + +GLKLL KARYSLA F GEQIVILSQV
Sbjct: 446  FHIDGGQPSYLIIAGLVFTPLSEPLIDEECEDSIGLKLLAKARYSLASFKGEQIVILSQV 505

Query: 650  LANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXX 471
            LANEVNIGYEDM NQQV+KLNGTRIRNIHHLTHLVD+C +KYLVFEFE+NY         
Sbjct: 506  LANEVNIGYEDMGNQQVLKLNGTRIRNIHHLTHLVDTCKDKYLVFEFEENYIAVLEREAA 565

Query: 470  XXASPCILKDYGIPSERSADLKEPYLYTFGDNNA-IDQAPGDSPVSNMEMGFDGLLWA 300
              AS CIL+DYGIPSERS+DL EPY+    D    + Q  GDSPVSN E+GF+GLLWA
Sbjct: 566  IAASSCILRDYGIPSERSSDLLEPYVDISEDEKGMVVQNYGDSPVSNAEIGFEGLLWA 623


>ref|XP_006366368.1| PREDICTED: protease Do-like 2, chloroplastic-like [Solanum tuberosum]
          Length = 621

 Score =  838 bits (2165), Expect = 0.0
 Identities = 427/585 (72%), Positives = 481/585 (82%), Gaps = 4/585 (0%)
 Frame = -3

Query: 2042 RALHCNNSSSNAFTCREADTVSHKLFFGEPTDERHTSFGHHDGR--KKDVGRSQSSGFQS 1869
            +A+H   SSS+         V  + F     DERH +  + DGR  K +  RS+S+ F+ 
Sbjct: 40   KAVH-QKSSSSPHHPPSQKAVGKQNFIWRSKDERHLA--NKDGRSSKNETERSKSTAFKF 96

Query: 1868 LVIQRKDK-KAFAYELKEQQAEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQYTST 1692
              +QRK   K   +E KE Q E G ++D  FLNAVVKV+CTHTAPDYSLPWQKQRQ+ ST
Sbjct: 97   SGLQRKGSGKGVPFESKEPQVETGIIEDATFLNAVVKVFCTHTAPDYSLPWQKQRQFAST 156

Query: 1691 GSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVENEEFWK 1512
            GSAFMIG+G L+TNAHCVEH TQVKVKRRGDDTKYVAKVLARG++CDIALLSVE+++FWK
Sbjct: 157  GSAFMIGDGKLLTNAHCVEHGTQVKVKRRGDDTKYVAKVLARGVECDIALLSVESKDFWK 216

Query: 1511 GVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAI 1332
            G EPLRFG LPHLQD+VTVVGYPLGGDTISVTKGVVSR+EVTSYAHGSS+LLGIQIDAAI
Sbjct: 217  GAEPLRFGHLPHLQDAVTVVGYPLGGDTISVTKGVVSRVEVTSYAHGSSELLGIQIDAAI 276

Query: 1331 NPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKYTGFPCL 1152
            NPGNSGGPAFND GECIGVAFQV RS+D ENIGYVIPTTVVSHFL DYERNGKY+GFPCL
Sbjct: 277  NPGNSGGPAFNDDGECIGVAFQVYRSDDVENIGYVIPTTVVSHFLEDYERNGKYSGFPCL 336

Query: 1151 GVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEGTVP 972
            GV+LQKLENPALR+CL+V SNEG+LVR++EPTSD  NV+K+GDVIV FDGV VGCEGTVP
Sbjct: 337  GVMLQKLENPALRACLRVPSNEGILVRKIEPTSDVSNVVKEGDVIVSFDGVRVGCEGTVP 396

Query: 971  FRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSYLIV 792
            FRS+ERIAFRYLISQKF+GD  ELGIIR G  +KVQ VL PRVHLVPYHI+GGQPSYLIV
Sbjct: 397  FRSSERIAFRYLISQKFTGDVAELGIIRAGELLKVQAVLKPRVHLVPYHIEGGQPSYLIV 456

Query: 791  AGLVFTPLSEPLIXXXXXEVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNIGYEDMS 612
            AGLVFTPLSEPLI     + +GLKLL KARYS A F GEQIVILSQVLANEVNIGYED+S
Sbjct: 457  AGLVFTPLSEPLIEEECEDTIGLKLLIKARYSFAKFEGEQIVILSQVLANEVNIGYEDLS 516

Query: 611  NQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXASPCILKDYGI 432
            N+QV+KLNGTRI+NIHHL HLVDSC +KYLVFEFEDN+           AS  IL DYGI
Sbjct: 517  NEQVLKLNGTRIKNIHHLAHLVDSCKDKYLVFEFEDNFLVVLEREAASSASSSILIDYGI 576

Query: 431  PSERSADLKEPYLYTFGDNNAIDQAP-GDSPVSNMEMGFDGLLWA 300
            P+ERS+DL EPY+ + G + A DQ   GDSPVSN E G+DGLLWA
Sbjct: 577  PAERSSDLLEPYVDSIGPDEATDQHEFGDSPVSNSEFGYDGLLWA 621


>ref|XP_002882138.1| hypothetical protein ARALYDRAFT_483986 [Arabidopsis lyrata subsp.
            lyrata] gi|297327977|gb|EFH58397.1| hypothetical protein
            ARALYDRAFT_483986 [Arabidopsis lyrata subsp. lyrata]
          Length = 613

 Score =  837 bits (2163), Expect = 0.0
 Identities = 423/552 (76%), Positives = 463/552 (83%), Gaps = 10/552 (1%)
 Frame = -3

Query: 1925 HHDGRKKDVGRS--QSSGFQSLVIQRKDKKAFAYEL-KEQQAEAGNLQDTAFLNAVVKVY 1755
            ++ GR +D   +  Q   F++    +K+KK    +  ++QQ + G + D +FLNAVVKVY
Sbjct: 62   NYPGRVRDDSPNPPQKMAFKAFGSPKKEKKEPLSDFSRDQQTDPGKIHDASFLNAVVKVY 121

Query: 1754 CTHTAPDYSLPWQKQRQYTSTGS-------AFMIGNGMLITNAHCVEHDTQVKVKRRGDD 1596
            CTHTAPDYSLPWQKQRQ+TSTG        AFMIG+G L+TNAHCVEHDTQVKVKRRGDD
Sbjct: 122  CTHTAPDYSLPWQKQRQFTSTGRHVFFIHIAFMIGDGKLLTNAHCVEHDTQVKVKRRGDD 181

Query: 1595 TKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVT 1416
             KYVAKVL RG+DCDIALLSVE+E+FWKG EPLR G LP LQDSVTVVGYPLGGDTISVT
Sbjct: 182  RKYVAKVLVRGVDCDIALLSVESEDFWKGAEPLRLGHLPRLQDSVTVVGYPLGGDTISVT 241

Query: 1415 KGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENI 1236
            KGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQV RSE+TENI
Sbjct: 242  KGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVYRSEETENI 301

Query: 1235 GYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPT 1056
            GYVIPTTVVSHFLTDYERNGKYTG+PCLGVLLQKLENPALR CLKV +NEGVLVRRVEPT
Sbjct: 302  GYVIPTTVVSHFLTDYERNGKYTGYPCLGVLLQKLENPALRECLKVPTNEGVLVRRVEPT 361

Query: 1055 SDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAH 876
            SDA  VLK+GDVIV FD +HVGCEGTVPFRS+ERIAFRYLISQKF+GD  ELGIIR G H
Sbjct: 362  SDASKVLKEGDVIVSFDDLHVGCEGTVPFRSSERIAFRYLISQKFAGDIAELGIIRAGEH 421

Query: 875  IKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXEVMGLKLLTKARYS 696
             KVQ VL PRVHLVPYHIDGGQPSY+IVAGLVFTPLSEPLI     + +GLKLLTKARYS
Sbjct: 422  KKVQVVLRPRVHLVPYHIDGGQPSYIIVAGLVFTPLSEPLIEEECEDTIGLKLLTKARYS 481

Query: 695  LAHFIGEQIVILSQVLANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVF 516
            +A F GEQIVILSQVLANEVNIGYEDM+NQQV+K NG  IRNIHHL HL+D C +KYLVF
Sbjct: 482  VARFRGEQIVILSQVLANEVNIGYEDMNNQQVLKFNGIPIRNIHHLAHLIDMCKDKYLVF 541

Query: 515  EFEDNYXXXXXXXXXXXASPCILKDYGIPSERSADLKEPYLYTFGDNNAIDQAPGDSPVS 336
            EFEDNY           AS CILKDYGIPSERSADL EPY+    D  A+DQ  GDSPVS
Sbjct: 542  EFEDNYVAVLEREASNSASLCILKDYGIPSERSADLLEPYVDPIDDTQALDQGIGDSPVS 601

Query: 335  NMEMGFDGLLWA 300
            N+E+GFDGL+WA
Sbjct: 602  NLEIGFDGLVWA 613


>ref|XP_004247469.1| PREDICTED: protease Do-like 2, chloroplastic-like [Solanum
            lycopersicum]
          Length = 621

 Score =  832 bits (2150), Expect = 0.0
 Identities = 421/560 (75%), Positives = 469/560 (83%), Gaps = 4/560 (0%)
 Frame = -3

Query: 1967 FFGEPTDERHTSFGHHDGR--KKDVGRSQSSGFQSLVIQRKDK-KAFAYELKEQQAEAGN 1797
            F     DERH +  ++DGR  K + GRS+S+ F+   +QRK   K   +E KE Q E G 
Sbjct: 64   FIWRSKDERHLA--NNDGRSSKNETGRSKSTAFKFSGLQRKGSGKGAPFESKEPQVETGF 121

Query: 1796 LQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVK 1617
            ++D  FLNAVVKV+CTHTAPDYSLPWQKQRQ+ STGSAFMIG+G L+TNAHCVEH TQVK
Sbjct: 122  IEDAPFLNAVVKVFCTHTAPDYSLPWQKQRQFASTGSAFMIGDGKLLTNAHCVEHGTQVK 181

Query: 1616 VKRRGDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLG 1437
            VKRRGDDTKYVAKVLARG++CDIALLSVE+++FWKG EPL FG LPHLQD+VTVVGYPLG
Sbjct: 182  VKRRGDDTKYVAKVLARGVECDIALLSVESKDFWKGAEPLCFGHLPHLQDAVTVVGYPLG 241

Query: 1436 GDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLR 1257
            GDTISVTKGVVSR+EVTSYAHGSS+LLGIQIDAAINPGNSGGPAFND GECIGVAFQV R
Sbjct: 242  GDTISVTKGVVSRVEVTSYAHGSSELLGIQIDAAINPGNSGGPAFNDDGECIGVAFQVYR 301

Query: 1256 SEDTENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVL 1077
            S+D ENIGYVIP  VVSHFL DYERNGKY+GFPCLGVLLQKLENPALR+CL+V SNEGVL
Sbjct: 302  SDDVENIGYVIPAMVVSHFLEDYERNGKYSGFPCLGVLLQKLENPALRACLRVPSNEGVL 361

Query: 1076 VRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELG 897
            VR++EPTSD  NV+K+GDVIV FDGV VGCEGTVPFRS+ERIAFRYLISQKF+GD  ELG
Sbjct: 362  VRKIEPTSDVSNVVKEGDVIVSFDGVRVGCEGTVPFRSSERIAFRYLISQKFTGDVAELG 421

Query: 896  IIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXEVMGLKL 717
            IIR G  +KVQ VL PRVHLVPYHI+GGQPSYLIVAGLVFTPLSEPLI     + +GLKL
Sbjct: 422  IIRAGEFLKVQAVLKPRVHLVPYHIEGGQPSYLIVAGLVFTPLSEPLIEEECEDTIGLKL 481

Query: 716  LTKARYSLAHFIGEQIVILSQVLANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSC 537
            L KARYS A F GEQIVILSQVLANEVNIGYED+SN+QV+KLNGTRI+NIHHL HLVDSC
Sbjct: 482  LIKARYSFAKFEGEQIVILSQVLANEVNIGYEDLSNEQVLKLNGTRIKNIHHLAHLVDSC 541

Query: 536  TEKYLVFEFEDNYXXXXXXXXXXXASPCILKDYGIPSERSADLKEPYLYTFGDNNAIDQA 357
             +KYLVFEFEDN+           AS  IL DYGIP+ERS+DL EPY+ + G   A DQ 
Sbjct: 542  KDKYLVFEFEDNFLVALEREAASSASSSILIDYGIPAERSSDLLEPYVDSIGPYEATDQH 601

Query: 356  P-GDSPVSNMEMGFDGLLWA 300
              GDSPVSN E G+DGLLWA
Sbjct: 602  EFGDSPVSNSEFGYDGLLWA 621


>ref|XP_006855396.1| hypothetical protein AMTR_s00057p00143260 [Amborella trichopoda]
            gi|548859162|gb|ERN16863.1| hypothetical protein
            AMTR_s00057p00143260 [Amborella trichopoda]
          Length = 528

 Score =  822 bits (2123), Expect = 0.0
 Identities = 413/527 (78%), Positives = 459/527 (87%), Gaps = 1/527 (0%)
 Frame = -3

Query: 1877 FQSLVIQRKDKKAFAYELKEQQA-EAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQY 1701
            F+SL +QRK+K A  ++LKEQQ  EA  LQD AFLNAVVKVYCTHTAPDYSLPWQKQRQ+
Sbjct: 3    FKSLGMQRKEK-AIVHDLKEQQINEASTLQDGAFLNAVVKVYCTHTAPDYSLPWQKQRQF 61

Query: 1700 TSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVENEE 1521
            TSTGSAFMIG+G L+TNAHCVEH TQVKVKRRGDDTKYVAKVLARG++CDIALL VE+EE
Sbjct: 62   TSTGSAFMIGDGKLLTNAHCVEHYTQVKVKRRGDDTKYVAKVLARGVECDIALLYVESEE 121

Query: 1520 FWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQID 1341
            FWKG +PL+FG LP LQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHG+SDLLGIQID
Sbjct: 122  FWKGADPLKFGRLPCLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGASDLLGIQID 181

Query: 1340 AAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKYTGF 1161
            AAINPGNSGGPAFNDQGECIGVAFQV RS++ ENIGYVIPTTVVSHFLTDYERNGKYTGF
Sbjct: 182  AAINPGNSGGPAFNDQGECIGVAFQVFRSDEAENIGYVIPTTVVSHFLTDYERNGKYTGF 241

Query: 1160 PCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEG 981
            P LGVLLQKLENPALR+CLKV SNEGVLVRR+EPT+ A++ LK+GDVIV FDG+ VGCEG
Sbjct: 242  PSLGVLLQKLENPALRACLKVNSNEGVLVRRIEPTAAAHDALKEGDVIVSFDGIPVGCEG 301

Query: 980  TVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSY 801
            TVPFRSTERIAFRYLISQKF+GD  ELGIIR GAH+KV+T+L PRVHLVPYHI+GGQPSY
Sbjct: 302  TVPFRSTERIAFRYLISQKFAGDTAELGIIRGGAHMKVKTLLYPRVHLVPYHIEGGQPSY 361

Query: 800  LIVAGLVFTPLSEPLIXXXXXEVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNIGYE 621
            LI+AGLVFTPLSEPLI     + MGLKLL KARYSLA F GEQIV+LSQVLANE NIGYE
Sbjct: 362  LIIAGLVFTPLSEPLIDEECEDSMGLKLLAKARYSLAKFKGEQIVLLSQVLANEANIGYE 421

Query: 620  DMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXASPCILKD 441
            DM NQQV+K NGT+I+NI HL HLVD+C ++YL+FEFEDN+           ASP ILKD
Sbjct: 422  DMGNQQVLKFNGTKIKNIRHLAHLVDTCKDEYLIFEFEDNFLAVLDREAASIASPRILKD 481

Query: 440  YGIPSERSADLKEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 300
            YGIP ERS++L E YL +  D+ A+     D P SN+E+GFDGLLWA
Sbjct: 482  YGIPFERSSNLAELYLDSSEDDLALSGDLDDIPASNLEIGFDGLLWA 528


>ref|XP_003520225.1| PREDICTED: protease Do-like 2, chloroplastic-like [Glycine max]
          Length = 612

 Score =  819 bits (2116), Expect = 0.0
 Identities = 430/597 (72%), Positives = 476/597 (79%), Gaps = 13/597 (2%)
 Frame = -3

Query: 2051 PSFRALHCNN----------SSSNAFTCREADTVSHKLFFGEPTDERHTSFGHHDGRKKD 1902
            P   + HCNN          SSS++ + R+ +   HK    +  DER          + +
Sbjct: 29   PIVASFHCNNHPLRVSSSSSSSSSSKSNRKKEGAGHKK---QSKDERPA--------RGN 77

Query: 1901 VGRSQSSGFQSLVIQRKDKKAFAYELKEQQAEAGNLQDTAFLNAVVKVYCTHTAPDYSLP 1722
            V  SQ +  +   IQRK+K    ++ K+QQ E   LQD+AFLNAVVKVYCTHTAPDYSLP
Sbjct: 78   VLESQPTSSKPFGIQRKNKDLI-FDSKDQQVEQSILQDSAFLNAVVKVYCTHTAPDYSLP 136

Query: 1721 WQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIAL 1542
            WQKQRQYTSTGSAFMIG+  L+TNAHCVEHDTQVKVK+RGDD+KYVAKVLARG+DCDIAL
Sbjct: 137  WQKQRQYTSTGSAFMIGDRKLLTNAHCVEHDTQVKVKKRGDDSKYVAKVLARGVDCDIAL 196

Query: 1541 LSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSD 1362
            LSVE+EEFW+ VEPLR G LPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSD
Sbjct: 197  LSVESEEFWRDVEPLRLGRLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSD 256

Query: 1361 LLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYER 1182
            LLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSE+ ENIGYVIPTTVVSHFLTDYER
Sbjct: 257  LLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEEAENIGYVIPTTVVSHFLTDYER 316

Query: 1181 NGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDG 1002
            NG+YTGFPCLGVL+QKLENPALR+ LKV SNEGVLVRRVEPTSDA NVLK+GDVIV FD 
Sbjct: 317  NGRYTGFPCLGVLIQKLENPALRAWLKVQSNEGVLVRRVEPTSDANNVLKEGDVIVSFDD 376

Query: 1001 VHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHI 822
            V VG EGTVPFRS ERIAF +LISQKF+GD  ELGIIR G  +K + VLN RVHLVPYHI
Sbjct: 377  VRVGSEGTVPFRSNERIAFHFLISQKFAGDTAELGIIRAGTLMKTKVVLNSRVHLVPYHI 436

Query: 821  DGGQPSYLIVAGLVFTPLSEPLIXXXXXEVMGLKLLTKARYSLAHFIGEQIVILSQVLAN 642
            D G PSYLI+AGLVFTPLSEPLI     + +GLKLL +ARYSLA F GEQIVILSQVLAN
Sbjct: 437  DEGLPSYLIIAGLVFTPLSEPLIEEECEDSIGLKLLARARYSLAKFKGEQIVILSQVLAN 496

Query: 641  EVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXA 462
            EVNIGYEDM NQQV+K NG RI+NIHHL HL+DSC ++YL FEFED+Y           A
Sbjct: 497  EVNIGYEDMGNQQVVKFNGARIKNIHHLAHLIDSCEDRYLRFEFEDSYVAVLEKEAVAAA 556

Query: 461  SPCILKDYGIPSERSADLKEPYLYTF---GDNNAIDQAPGDSPVSNMEMGFDGLLWA 300
            SP +L DYGIPSERS+DL +PY+ T    GD  A DQ  GDSPVSN E G DGLLWA
Sbjct: 557  SPSVLSDYGIPSERSSDLSKPYVDTLEVEGDQPA-DQEFGDSPVSNYEFGPDGLLWA 612


Top