BLASTX nr result

ID: Achyranthes23_contig00008334 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes23_contig00008334
         (2222 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002270247.1| PREDICTED: protease Do-like 2, chloroplastic...   877   0.0  
gb|EOX94933.1| DEGP protease 2 isoform 1 [Theobroma cacao]            875   0.0  
gb|EMJ01505.1| hypothetical protein PRUPE_ppa002853mg [Prunus pe...   872   0.0  
gb|EOX94934.1| DEGP protease 2 isoform 2 [Theobroma cacao]            871   0.0  
emb|CBI32271.3| unnamed protein product [Vitis vinifera]              871   0.0  
ref|XP_006444216.1| hypothetical protein CICLE_v10019366mg [Citr...   865   0.0  
ref|XP_006479864.1| PREDICTED: protease Do-like 2, chloroplastic...   863   0.0  
ref|XP_006397989.1| hypothetical protein EUTSA_v10001363mg [Eutr...   858   0.0  
ref|XP_002520690.1| serine endopeptidase degp2, putative [Ricinu...   853   0.0  
ref|XP_004290719.1| PREDICTED: protease Do-like 2, chloroplastic...   846   0.0  
ref|XP_006295823.1| hypothetical protein CARUB_v10024950mg [Caps...   846   0.0  
ref|NP_566115.1| DegP2 protease [Arabidopsis thaliana] gi|752202...   845   0.0  
ref|NP_001118544.1| DegP2 protease [Arabidopsis thaliana] gi|330...   845   0.0  
pdb|4FLN|A Chain A, Crystal Structure Of Plant Protease Deg2 gi|...   843   0.0  
ref|XP_004148888.1| PREDICTED: protease Do-like 2, chloroplastic...   841   0.0  
ref|XP_006366368.1| PREDICTED: protease Do-like 2, chloroplastic...   838   0.0  
ref|XP_002882138.1| hypothetical protein ARALYDRAFT_483986 [Arab...   837   0.0  
ref|XP_004247469.1| PREDICTED: protease Do-like 2, chloroplastic...   832   0.0  
ref|XP_006855396.1| hypothetical protein AMTR_s00057p00143260 [A...   822   0.0  
ref|XP_003520225.1| PREDICTED: protease Do-like 2, chloroplastic...   819   0.0  

>ref|XP_002270247.1| PREDICTED: protease Do-like 2, chloroplastic-like [Vitis vinifera]
          Length = 606

 Score =  877 bits (2267), Expect = 0.0
 Identities = 449/576 (77%), Positives = 487/576 (84%), Gaps = 5/576 (0%)
 Frame = +1

Query: 211  NAFTCREADTVSHKLFFGEPTDERHT--SFGHHDGRKKDVGRSQSSGFQSLVIQ--RKDK 378
            + F+CR A     +   G  +        FG   G + +  R+QSS F+S   Q  RKDK
Sbjct: 32   STFSCRSAPKAISRSNKGASSSPNKPPKQFGGGSG-EDEKRRTQSSPFKSFGAQSQRKDK 90

Query: 379  KAFAYELKEQQ-AEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGN 555
            K  + +LKEQQ  E GNLQD AFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAF+IG+
Sbjct: 91   KGVSSDLKEQQQVETGNLQDGAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFIIGD 150

Query: 556  GMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRFG 735
            G L+TNAHCVEH TQVKVKRRGDDTKYVAKVLARGI+CDIALLSVE+EEFWKG EPL FG
Sbjct: 151  GKLLTNAHCVEHATQVKVKRRGDDTKYVAKVLARGIECDIALLSVESEEFWKGTEPLNFG 210

Query: 736  LLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP 915
             LP LQD+VTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP
Sbjct: 211  RLPRLQDAVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP 270

Query: 916  AFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLE 1095
            AFNDQGECIGVAFQV RSED ENIGYVIPTTVVSHFL DYERNGKYTGFPCLGVLLQKLE
Sbjct: 271  AFNDQGECIGVAFQVFRSEDVENIGYVIPTTVVSHFLDDYERNGKYTGFPCLGVLLQKLE 330

Query: 1096 NPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIA 1275
            NPALRSCLKV SNEGVLVRRVEPTSDA NVLK+GDVIV FDGVHVGCEGTVPFRSTERIA
Sbjct: 331  NPALRSCLKVQSNEGVLVRRVEPTSDANNVLKEGDVIVSFDGVHVGCEGTVPFRSTERIA 390

Query: 1276 FRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPL 1455
            FRYLISQKF+GD VE+GIIR GA +KVQ VL+PRVHLVPYHI+GGQPSYLI++GLVFTPL
Sbjct: 391  FRYLISQKFTGDVVEVGIIRAGAFMKVQVVLDPRVHLVPYHIEGGQPSYLIISGLVFTPL 450

Query: 1456 SEPLIXXXXXXVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNIGYEDMSNQQVMKLN 1635
            SEPLI       +GLKLLTKARYSLA F GEQIVILSQVLANEVNIGYE+MSNQQV+K N
Sbjct: 451  SEPLIEEECEDTIGLKLLTKARYSLARFKGEQIVILSQVLANEVNIGYENMSNQQVLKFN 510

Query: 1636 GTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXXSPCILKDYGIPSERSADL 1815
            GT I+NIHHL HL+DSC +KYLVFEFEDNY            SPCILKDYGIPSERS+DL
Sbjct: 511  GTWIKNIHHLAHLIDSCKDKYLVFEFEDNYLAVLEREAAAAASPCILKDYGIPSERSSDL 570

Query: 1816 KEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 1923
             +PY+ + GDN +I+Q  GD PVSN+E+G DGLLWA
Sbjct: 571  LKPYMDSLGDNRSINQDFGDIPVSNLEIGSDGLLWA 606


>gb|EOX94933.1| DEGP protease 2 isoform 1 [Theobroma cacao]
          Length = 633

 Score =  875 bits (2262), Expect = 0.0
 Identities = 440/577 (76%), Positives = 484/577 (83%)
 Frame = +1

Query: 193  CNNSSSNAFTCREADTVSHKLFFGEPTDERHTSFGHHDGRKKDVGRSQSSGFQSLVIQRK 372
            C+++S   F  ++ D VS K   G   DE+ + +      + D+GR QS+GF+S   QRK
Sbjct: 58   CSSTSPRKFNVKK-DPVSQKKLPGRSKDEKSSLYADGISGRGDMGRPQSTGFKSFGTQRK 116

Query: 373  DKKAFAYELKEQQAEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIG 552
            D++ F  +L+EQQ E GNLQD  FLNAVVKVYCTHTAPDYSLPWQKQRQ+TSTGSAFMIG
Sbjct: 117  DREEFQLDLREQQVEPGNLQDATFLNAVVKVYCTHTAPDYSLPWQKQRQFTSTGSAFMIG 176

Query: 553  NGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRF 732
            +G L+TNAHCVEHDTQVKVKRRGDDTKYVAKVLARG+DCDIALLSVE++EFW+G EPLR 
Sbjct: 177  DGKLLTNAHCVEHDTQVKVKRRGDDTKYVAKVLARGVDCDIALLSVESKEFWRGAEPLRL 236

Query: 733  GLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGG 912
            G LP LQD+VTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGG
Sbjct: 237  GHLPGLQDAVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGG 296

Query: 913  PAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKL 1092
            PAFN+QGECIGVAFQV RSE+ ENIGYVIPTTVVSHFL+DYERNGKYTGFPCLGVLLQKL
Sbjct: 297  PAFNEQGECIGVAFQVYRSEEAENIGYVIPTTVVSHFLSDYERNGKYTGFPCLGVLLQKL 356

Query: 1093 ENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERI 1272
            ENPALR+CL V SNEGVLVRRVEPTSDA NVLK+GDVIV FD VHVG EGTVPFRS ERI
Sbjct: 357  ENPALRACLHVQSNEGVLVRRVEPTSDANNVLKEGDVIVSFDDVHVGSEGTVPFRSNERI 416

Query: 1273 AFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTP 1452
            AFRYLISQKF+GD  ELGI+R G  +KVQ VLN RVHLVPYHIDGGQPSYLI+AGLVFTP
Sbjct: 417  AFRYLISQKFAGDVAELGIVRAGRFMKVQVVLNRRVHLVPYHIDGGQPSYLIIAGLVFTP 476

Query: 1453 LSEPLIXXXXXXVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNIGYEDMSNQQVMKL 1632
            LSEPLI       +GLKLL KARYSLA F GEQIVILSQVLANEVNIGYEDM NQQV+K 
Sbjct: 477  LSEPLIEEECEDSIGLKLLAKARYSLARFKGEQIVILSQVLANEVNIGYEDMGNQQVLKF 536

Query: 1633 NGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXXSPCILKDYGIPSERSAD 1812
            NG RI+NIHHL HLV  C +KYLVFEFEDNY            S  ILKDYGIPSE+S D
Sbjct: 537  NGIRIKNIHHLAHLVACCKDKYLVFEFEDNYLAVLEREAAMAASSRILKDYGIPSEKSDD 596

Query: 1813 LKEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 1923
            L EPY+ + GDN AI+Q  GDSPVSN+E+GF+GLLWA
Sbjct: 597  LLEPYVDSLGDNQAIEQDYGDSPVSNLEIGFEGLLWA 633


>gb|EMJ01505.1| hypothetical protein PRUPE_ppa002853mg [Prunus persica]
          Length = 628

 Score =  872 bits (2252), Expect = 0.0
 Identities = 443/576 (76%), Positives = 490/576 (85%)
 Frame = +1

Query: 196  NNSSSNAFTCREADTVSHKLFFGEPTDERHTSFGHHDGRKKDVGRSQSSGFQSLVIQRKD 375
            ++SSS+A +  E + V +KL       +R +  G   G+K   G+SQ + ++S   QRK+
Sbjct: 61   SSSSSSAKSQPEKEAVPNKL---SGNGDRWSVTGR--GKK---GQSQPTAYRSFGTQRKE 112

Query: 376  KKAFAYELKEQQAEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGN 555
            KK FA + KEQQ E  +LQD  FLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIG+
Sbjct: 113  KKEFAVDQKEQQVEPRSLQDADFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGD 172

Query: 556  GMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRFG 735
            G L+TNAHCVEH TQVKVKRRGDDTKYVAKVLARG+DCDIALLSVE+EEFWKG EPL+ G
Sbjct: 173  GKLLTNAHCVEHYTQVKVKRRGDDTKYVAKVLARGVDCDIALLSVESEEFWKGAEPLQLG 232

Query: 736  LLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP 915
             LPHLQ++VTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP
Sbjct: 233  SLPHLQEAVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP 292

Query: 916  AFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLE 1095
            AFNDQGECIGVAFQV RSE+ ENIGYVIPTTVVSHFL DYERNG+YTGFPCLGVLLQKLE
Sbjct: 293  AFNDQGECIGVAFQVYRSEEAENIGYVIPTTVVSHFLDDYERNGRYTGFPCLGVLLQKLE 352

Query: 1096 NPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIA 1275
            NPALR+CLKV S EGVLVRRVEPTSDA+NVLK+GDVIV FD VHVGCEGTVPFRS ERIA
Sbjct: 353  NPALRACLKVESIEGVLVRRVEPTSDAHNVLKEGDVIVSFDDVHVGCEGTVPFRSNERIA 412

Query: 1276 FRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPL 1455
            FRYLISQKF+GD  +LGIIR G   KV+ VLNPRVHLVP+HIDGGQPSYLI+AGLVFTPL
Sbjct: 413  FRYLISQKFAGDVSDLGIIRAGEFKKVKAVLNPRVHLVPFHIDGGQPSYLIIAGLVFTPL 472

Query: 1456 SEPLIXXXXXXVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNIGYEDMSNQQVMKLN 1635
            SEPLI       +GLKLL KARYSLA F GEQIVILSQVLANEVNIGYEDMSNQQV+KLN
Sbjct: 473  SEPLIDEECEDSIGLKLLAKARYSLARFKGEQIVILSQVLANEVNIGYEDMSNQQVLKLN 532

Query: 1636 GTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXXSPCILKDYGIPSERSADL 1815
            GT+IRNIHHL +LVDSC +KYLVFEFEDNY            S CILKDYGIPSERS+DL
Sbjct: 533  GTQIRNIHHLAYLVDSCKDKYLVFEFEDNYITVLEREAATAASSCILKDYGIPSERSSDL 592

Query: 1816 KEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 1923
             EPY+ + GDN A++Q  GDSPVSN+E+GFDG++WA
Sbjct: 593  LEPYVDSLGDNQAVNQDIGDSPVSNLEIGFDGIIWA 628


>gb|EOX94934.1| DEGP protease 2 isoform 2 [Theobroma cacao]
          Length = 634

 Score =  871 bits (2250), Expect = 0.0
 Identities = 440/578 (76%), Positives = 484/578 (83%), Gaps = 1/578 (0%)
 Frame = +1

Query: 193  CNNSSSNAFTCREADTVSHKLFFGEPTDERHTSFGHHDGRKKDVGRSQSSGFQSLVIQRK 372
            C+++S   F  ++ D VS K   G   DE+ + +      + D+GR QS+GF+S   QRK
Sbjct: 58   CSSTSPRKFNVKK-DPVSQKKLPGRSKDEKSSLYADGISGRGDMGRPQSTGFKSFGTQRK 116

Query: 373  DKKAFAYELKEQQAEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIG 552
            D++ F  +L+EQQ E GNLQD  FLNAVVKVYCTHTAPDYSLPWQKQRQ+TSTGSAFMIG
Sbjct: 117  DREEFQLDLREQQVEPGNLQDATFLNAVVKVYCTHTAPDYSLPWQKQRQFTSTGSAFMIG 176

Query: 553  NGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRF 732
            +G L+TNAHCVEHDTQVKVKRRGDDTKYVAKVLARG+DCDIALLSVE++EFW+G EPLR 
Sbjct: 177  DGKLLTNAHCVEHDTQVKVKRRGDDTKYVAKVLARGVDCDIALLSVESKEFWRGAEPLRL 236

Query: 733  GLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGG 912
            G LP LQD+VTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGG
Sbjct: 237  GHLPGLQDAVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGG 296

Query: 913  PAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKL 1092
            PAFN+QGECIGVAFQV RSE+ ENIGYVIPTTVVSHFL+DYERNGKYTGFPCLGVLLQKL
Sbjct: 297  PAFNEQGECIGVAFQVYRSEEAENIGYVIPTTVVSHFLSDYERNGKYTGFPCLGVLLQKL 356

Query: 1093 ENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERI 1272
            ENPALR+CL V SNEGVLVRRVEPTSDA NVLK+GDVIV FD VHVG EGTVPFRS ERI
Sbjct: 357  ENPALRACLHVQSNEGVLVRRVEPTSDANNVLKEGDVIVSFDDVHVGSEGTVPFRSNERI 416

Query: 1273 AFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTP 1452
            AFRYLISQKF+GD  ELGI+R G  +KVQ VLN RVHLVPYHIDGGQPSYLI+AGLVFTP
Sbjct: 417  AFRYLISQKFAGDVAELGIVRAGRFMKVQVVLNRRVHLVPYHIDGGQPSYLIIAGLVFTP 476

Query: 1453 LSEPLIXXXXXXVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNIGYEDMSN-QQVMK 1629
            LSEPLI       +GLKLL KARYSLA F GEQIVILSQVLANEVNIGYEDM N QQV+K
Sbjct: 477  LSEPLIEEECEDSIGLKLLAKARYSLARFKGEQIVILSQVLANEVNIGYEDMGNQQQVLK 536

Query: 1630 LNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXXSPCILKDYGIPSERSA 1809
             NG RI+NIHHL HLV  C +KYLVFEFEDNY            S  ILKDYGIPSE+S 
Sbjct: 537  FNGIRIKNIHHLAHLVACCKDKYLVFEFEDNYLAVLEREAAMAASSRILKDYGIPSEKSD 596

Query: 1810 DLKEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 1923
            DL EPY+ + GDN AI+Q  GDSPVSN+E+GF+GLLWA
Sbjct: 597  DLLEPYVDSLGDNQAIEQDYGDSPVSNLEIGFEGLLWA 634


>emb|CBI32271.3| unnamed protein product [Vitis vinifera]
          Length = 612

 Score =  871 bits (2250), Expect = 0.0
 Identities = 449/582 (77%), Positives = 487/582 (83%), Gaps = 11/582 (1%)
 Frame = +1

Query: 211  NAFTCREADTVSHKLFFGEPTDERHT--SFGHHDGRKKDVGRSQSSGFQSLVIQ--RKDK 378
            + F+CR A     +   G  +        FG   G + +  R+QSS F+S   Q  RKDK
Sbjct: 32   STFSCRSAPKAISRSNKGASSSPNKPPKQFGGGSG-EDEKRRTQSSPFKSFGAQSQRKDK 90

Query: 379  KAFAYELKEQQ-AEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGN 555
            K  + +LKEQQ  E GNLQD AFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAF+IG+
Sbjct: 91   KGVSSDLKEQQQVETGNLQDGAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFIIGD 150

Query: 556  GMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRFG 735
            G L+TNAHCVEH TQVKVKRRGDDTKYVAKVLARGI+CDIALLSVE+EEFWKG EPL FG
Sbjct: 151  GKLLTNAHCVEHATQVKVKRRGDDTKYVAKVLARGIECDIALLSVESEEFWKGTEPLNFG 210

Query: 736  LLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP 915
             LP LQD+VTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP
Sbjct: 211  RLPRLQDAVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP 270

Query: 916  AFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLE 1095
            AFNDQGECIGVAFQV RSED ENIGYVIPTTVVSHFL DYERNGKYTGFPCLGVLLQKLE
Sbjct: 271  AFNDQGECIGVAFQVFRSEDVENIGYVIPTTVVSHFLDDYERNGKYTGFPCLGVLLQKLE 330

Query: 1096 NPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIA 1275
            NPALRSCLKV SNEGVLVRRVEPTSDA NVLK+GDVIV FDGVHVGCEGTVPFRSTERIA
Sbjct: 331  NPALRSCLKVQSNEGVLVRRVEPTSDANNVLKEGDVIVSFDGVHVGCEGTVPFRSTERIA 390

Query: 1276 FRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPL 1455
            FRYLISQKF+GD VE+GIIR GA +KVQ VL+PRVHLVPYHI+GGQPSYLI++GLVFTPL
Sbjct: 391  FRYLISQKFTGDVVEVGIIRAGAFMKVQVVLDPRVHLVPYHIEGGQPSYLIISGLVFTPL 450

Query: 1456 SEPLIXXXXXXVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNIGYEDMSNQQ----- 1620
            SEPLI       +GLKLLTKARYSLA F GEQIVILSQVLANEVNIGYE+MSNQQ     
Sbjct: 451  SEPLIEEECEDTIGLKLLTKARYSLARFKGEQIVILSQVLANEVNIGYENMSNQQASNNL 510

Query: 1621 -VMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXXSPCILKDYGIPS 1797
             V+K NGT I+NIHHL HL+DSC +KYLVFEFEDNY            SPCILKDYGIPS
Sbjct: 511  NVLKFNGTWIKNIHHLAHLIDSCKDKYLVFEFEDNYLAVLEREAAAAASPCILKDYGIPS 570

Query: 1798 ERSADLKEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 1923
            ERS+DL +PY+ + GDN +I+Q  GD PVSN+E+G DGLLWA
Sbjct: 571  ERSSDLLKPYMDSLGDNRSINQDFGDIPVSNLEIGSDGLLWA 612


>ref|XP_006444216.1| hypothetical protein CICLE_v10019366mg [Citrus clementina]
            gi|557546478|gb|ESR57456.1| hypothetical protein
            CICLE_v10019366mg [Citrus clementina]
          Length = 606

 Score =  865 bits (2235), Expect = 0.0
 Identities = 437/555 (78%), Positives = 473/555 (85%), Gaps = 4/555 (0%)
 Frame = +1

Query: 271  TDERHTSFGHHDGRKKD----VGRSQSSGFQSLVIQRKDKKAFAYELKEQQAEAGNLQDT 438
            T +  T+     GR KD      RSQS+ F+S   QRKDKK F ++ KEQ +E+GNLQD 
Sbjct: 52   TSKSSTTDRKFPGRSKDGKGETERSQSTAFKSFGAQRKDKKEFQFDSKEQLSESGNLQDA 111

Query: 439  AFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRR 618
            AFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIG+G L+TNAHCVEH TQVKVKRR
Sbjct: 112  AFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGDGKLLTNAHCVEHYTQVKVKRR 171

Query: 619  GDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTI 798
            GDDTKYVAKVLARG+DCDIALLSVE+EEFWK  EPL  G LP LQD+VTVVGYPLGGDTI
Sbjct: 172  GDDTKYVAKVLARGVDCDIALLSVESEEFWKDAEPLCLGHLPRLQDAVTVVGYPLGGDTI 231

Query: 799  SVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDT 978
            SVTKGVVSRIEVTSYAHGSS+LLGIQIDAAINPGNSGGPAFND+GECIGVAFQV RSE+ 
Sbjct: 232  SVTKGVVSRIEVTSYAHGSSELLGIQIDAAINPGNSGGPAFNDKGECIGVAFQVYRSEEV 291

Query: 979  ENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRV 1158
            ENIGYVIPTTVVSHFL+DYERNGKYTGFPCLGVLLQKLENPALR+CLKV SNEGVLVRRV
Sbjct: 292  ENIGYVIPTTVVSHFLSDYERNGKYTGFPCLGVLLQKLENPALRTCLKVPSNEGVLVRRV 351

Query: 1159 EPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRN 1338
            EPTSDA N+LK+GDVIV FD V VG EGTVPFRS ERIAFRYLISQKF+GD  ELGIIR 
Sbjct: 352  EPTSDANNILKEGDVIVSFDDVCVGSEGTVPFRSNERIAFRYLISQKFAGDVAELGIIRA 411

Query: 1339 GAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXXVMGLKLLTKA 1518
            G  +KV+ VLNPRVHLVPYHIDGGQPSYLI+AGLVFTPLSEPLI       +GLKLL KA
Sbjct: 412  GTFMKVKVVLNPRVHLVPYHIDGGQPSYLIIAGLVFTPLSEPLIEEECDDSIGLKLLAKA 471

Query: 1519 RYSLAHFIGEQIVILSQVLANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKY 1698
            RYSLA F GEQ+VILSQVLANEV+IGYEDMSNQQV+K NGTRI+NIHHL HLVDSC +KY
Sbjct: 472  RYSLARFEGEQMVILSQVLANEVSIGYEDMSNQQVLKFNGTRIKNIHHLAHLVDSCKDKY 531

Query: 1699 LVFEFEDNYXXXXXXXXXXXXSPCILKDYGIPSERSADLKEPYLYTFGDNNAIDQAPGDS 1878
            LVFEFEDNY            S CILKDYGIPSERS+DL EPY+   G N AI+Q  GDS
Sbjct: 532  LVFEFEDNYLAVLEREAAVAASSCILKDYGIPSERSSDLLEPYVDPLGGNQAINQDSGDS 591

Query: 1879 PVSNMEMGFDGLLWA 1923
            PVS++E+GFDGL WA
Sbjct: 592  PVSDLEIGFDGLKWA 606


>ref|XP_006479864.1| PREDICTED: protease Do-like 2, chloroplastic-like [Citrus sinensis]
          Length = 606

 Score =  863 bits (2231), Expect = 0.0
 Identities = 436/555 (78%), Positives = 473/555 (85%), Gaps = 4/555 (0%)
 Frame = +1

Query: 271  TDERHTSFGHHDGRKKD----VGRSQSSGFQSLVIQRKDKKAFAYELKEQQAEAGNLQDT 438
            T +  T+     GR KD      RSQS+ F+S   QRKDKK F ++ KEQ +E+GNLQD 
Sbjct: 52   TSKSSTTDRKFPGRSKDGKGETERSQSTAFKSFGAQRKDKKEFQFDSKEQLSESGNLQDA 111

Query: 439  AFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRR 618
            AFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIG+G L+TNAHCVEH TQVKVKRR
Sbjct: 112  AFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGDGKLLTNAHCVEHYTQVKVKRR 171

Query: 619  GDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTI 798
            GDDTKYVAKVLARG+DCDIALLSVE+EEFWK  EPL  G LP LQD+VTVVGYPLGGDTI
Sbjct: 172  GDDTKYVAKVLARGVDCDIALLSVESEEFWKDAEPLCLGHLPRLQDAVTVVGYPLGGDTI 231

Query: 799  SVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDT 978
            SVTKGVVSRIEVTSYAHGSS+LLGIQIDAAINPGNSGGPAFND+GECIGVAFQV RSE+ 
Sbjct: 232  SVTKGVVSRIEVTSYAHGSSELLGIQIDAAINPGNSGGPAFNDKGECIGVAFQVYRSEEV 291

Query: 979  ENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRV 1158
            ENIGYVIPTTVVSHFL+DYERNGKYTGFPCLGVLLQKLENPALR+CLKV SNEGVLVRRV
Sbjct: 292  ENIGYVIPTTVVSHFLSDYERNGKYTGFPCLGVLLQKLENPALRTCLKVPSNEGVLVRRV 351

Query: 1159 EPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRN 1338
            EPTSDA N+LK+GDVIV FD V VG EGTVPFRS ERIAFRYLISQKF+GD  ELGIIR 
Sbjct: 352  EPTSDANNILKEGDVIVSFDDVCVGSEGTVPFRSNERIAFRYLISQKFAGDVAELGIIRA 411

Query: 1339 GAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXXVMGLKLLTKA 1518
            G  +KV+ VLNPRVHLVPYHIDGGQPSYLI+AGLVFTPLSEPLI       +GLKLL KA
Sbjct: 412  GTFMKVKVVLNPRVHLVPYHIDGGQPSYLIIAGLVFTPLSEPLIEEECDDSIGLKLLAKA 471

Query: 1519 RYSLAHFIGEQIVILSQVLANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKY 1698
            RYSLA F GEQ+VILSQVLANEV+IGYEDMSNQQV+K NGTRI+NIHHL HLVDSC +KY
Sbjct: 472  RYSLARFEGEQMVILSQVLANEVSIGYEDMSNQQVLKFNGTRIKNIHHLAHLVDSCKDKY 531

Query: 1699 LVFEFEDNYXXXXXXXXXXXXSPCILKDYGIPSERSADLKEPYLYTFGDNNAIDQAPGDS 1878
            LVFEFEDNY            S CILKDYGIPSERS+DL EP++   G N AI+Q  GDS
Sbjct: 532  LVFEFEDNYLAVLEREAAVAASSCILKDYGIPSERSSDLLEPFVDPLGGNQAINQDSGDS 591

Query: 1879 PVSNMEMGFDGLLWA 1923
            PVS++E+GFDGL WA
Sbjct: 592  PVSDLEIGFDGLKWA 606


>ref|XP_006397989.1| hypothetical protein EUTSA_v10001363mg [Eutrema salsugineum]
            gi|557099062|gb|ESQ39442.1| hypothetical protein
            EUTSA_v10001363mg [Eutrema salsugineum]
          Length = 612

 Score =  858 bits (2217), Expect = 0.0
 Identities = 427/550 (77%), Positives = 464/550 (84%)
 Frame = +1

Query: 274  DERHTSFGHHDGRKKDVGRSQSSGFQSLVIQRKDKKAFAYELKEQQAEAGNLQDTAFLNA 453
            DE   S G  DG        Q+  F++    +KDKK    + ++QQ + G + D +FLNA
Sbjct: 68   DESCNSHGKGDG-----AGPQTMAFKAFGSPKKDKKEAQSDFRDQQTDPGKIHDASFLNA 122

Query: 454  VVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTK 633
            VVKVYCTHTAPDYSLPWQKQRQ+TSTGSAFMIG+G L+TNAHCVEHDTQVKVKRRGDD K
Sbjct: 123  VVKVYCTHTAPDYSLPWQKQRQFTSTGSAFMIGDGKLLTNAHCVEHDTQVKVKRRGDDRK 182

Query: 634  YVAKVLARGIDCDIALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKG 813
            YVAKVL RG+DCDIALLSVE+E+FWKG EPLR G LP LQDSVTVVGYPLGGDTISVTKG
Sbjct: 183  YVAKVLVRGVDCDIALLSVESEDFWKGAEPLRLGHLPRLQDSVTVVGYPLGGDTISVTKG 242

Query: 814  VVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGY 993
            VVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQV RSE+TENIGY
Sbjct: 243  VVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVYRSEETENIGY 302

Query: 994  VIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSD 1173
            VIPTTVVSHFLTDYERNGKYTG+PCLGVLLQKLENPALR CLKV +NEGVLVRRVEPTSD
Sbjct: 303  VIPTTVVSHFLTDYERNGKYTGYPCLGVLLQKLENPALRECLKVPTNEGVLVRRVEPTSD 362

Query: 1174 AYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIK 1353
            A  VLK+GDVIV FD +HVGCEGTVPFRS+ERIAFRYLISQKFSGD  ELGIIR G H K
Sbjct: 363  ASKVLKEGDVIVSFDDLHVGCEGTVPFRSSERIAFRYLISQKFSGDIAELGIIRAGEHKK 422

Query: 1354 VQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXXVMGLKLLTKARYSLA 1533
            VQ VL PRVHLVP+HIDGGQPSY+I+AGLVFTPLSEPLI       +GLKLLTKARYS+A
Sbjct: 423  VQVVLRPRVHLVPFHIDGGQPSYIIIAGLVFTPLSEPLIEEECEDTIGLKLLTKARYSVA 482

Query: 1534 HFIGEQIVILSQVLANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEF 1713
             F GEQIVILSQVLANEVNIGYEDM+NQQV+K NGT IRNIHHL HL+D C +KYLVFEF
Sbjct: 483  RFRGEQIVILSQVLANEVNIGYEDMNNQQVLKFNGTPIRNIHHLAHLIDMCKDKYLVFEF 542

Query: 1714 EDNYXXXXXXXXXXXXSPCILKDYGIPSERSADLKEPYLYTFGDNNAIDQAPGDSPVSNM 1893
            EDNY            S CILKDYGIPSERSADL+EPY+    D  A+DQ  GDSPVSN+
Sbjct: 543  EDNYVAVLEREASDSASLCILKDYGIPSERSADLREPYIDPIDDTRALDQGFGDSPVSNL 602

Query: 1894 EMGFDGLLWA 1923
            E+GFDGL+WA
Sbjct: 603  EIGFDGLVWA 612


>ref|XP_002520690.1| serine endopeptidase degp2, putative [Ricinus communis]
            gi|223540075|gb|EEF41652.1| serine endopeptidase degp2,
            putative [Ricinus communis]
          Length = 621

 Score =  853 bits (2204), Expect = 0.0
 Identities = 426/550 (77%), Positives = 471/550 (85%), Gaps = 1/550 (0%)
 Frame = +1

Query: 277  ERHTSFGHHDGRKKDVGRSQSSGFQSLVIQRKDKKAFAYELKEQQAEAGNLQDTAFLNAV 456
            +R   +   +G K + G++QS  ++S   +RKDKK F ++  E Q E+G LQD AFLNAV
Sbjct: 72   KRSNLYSDENGGKAERGKAQSVAYKSFGTERKDKKEFQFDSNELQIESGKLQDMAFLNAV 131

Query: 457  VKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKY 636
            VKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIG+G L+TNAHCVEH TQVKVKRRGDDTKY
Sbjct: 132  VKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGDGKLLTNAHCVEHYTQVKVKRRGDDTKY 191

Query: 637  VAKVLARGIDCDIALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGV 816
            VAKVLARG+DCDIALLSV+++EFW+G EPL+ G LP LQD+VTVVGYPLGGDTISVTKGV
Sbjct: 192  VAKVLARGVDCDIALLSVKDKEFWEGAEPLQLGHLPRLQDAVTVVGYPLGGDTISVTKGV 251

Query: 817  VSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYV 996
            VSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFN+QGECIGVAFQV RSE+ ENIGYV
Sbjct: 252  VSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNEQGECIGVAFQVYRSEEAENIGYV 311

Query: 997  IPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDA 1176
            IPTTVVSHFL DYERNGKYTGFPCLGVLLQKLENPALR+CLKV SNEGVLVRR+EPTSDA
Sbjct: 312  IPTTVVSHFLNDYERNGKYTGFPCLGVLLQKLENPALRACLKVESNEGVLVRRIEPTSDA 371

Query: 1177 YNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKV 1356
             NVLK+GDVIV FD V+VGCEGTVPFRS ERIAFRYLISQKF+GD  ELGIIR G+ +KV
Sbjct: 372  NNVLKEGDVIVSFDDVNVGCEGTVPFRSNERIAFRYLISQKFAGDVAELGIIRAGSFMKV 431

Query: 1357 QTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXXVMGLKLLTKARYSLAH 1536
            + VLNPRVHLVPYH+DGGQPSYLI+AGLVFTPLSEPLI       +GLKLL KARYSLA 
Sbjct: 432  KVVLNPRVHLVPYHVDGGQPSYLIIAGLVFTPLSEPLIDEECEGSIGLKLLAKARYSLAR 491

Query: 1537 FIGEQIVILSQVLANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFE 1716
            F GEQIVILSQVLANEVNIGYEDMSNQQV+K NGTRI+NIHHL +LVDSC +KYLVFEFE
Sbjct: 492  FKGEQIVILSQVLANEVNIGYEDMSNQQVLKFNGTRIKNIHHLAYLVDSCKDKYLVFEFE 551

Query: 1717 DNYXXXXXXXXXXXXSPCILKDYGIPSERSADLKEPYLYTFGDNNAIDQ-APGDSPVSNM 1893
            DNY            S CIL DYGIPSERS DL +PY+ +  DN   +Q A GDSPVSN+
Sbjct: 552  DNYLAVLERQPATAASSCILTDYGIPSERSPDLLKPYVDSQVDNQLAEQDALGDSPVSNL 611

Query: 1894 EMGFDGLLWA 1923
            E+G DG+LWA
Sbjct: 612  EIGNDGILWA 621


>ref|XP_004290719.1| PREDICTED: protease Do-like 2, chloroplastic-like [Fragaria vesca
            subsp. vesca]
          Length = 622

 Score =  846 bits (2186), Expect = 0.0
 Identities = 427/540 (79%), Positives = 462/540 (85%), Gaps = 1/540 (0%)
 Frame = +1

Query: 307  GRKKDVGRSQSSGFQSLVIQRKDKKAFAYELKEQ-QAEAGNLQDTAFLNAVVKVYCTHTA 483
            G KK  GRSQ + ++    QRK+KK    + KE+ QAE  NLQD  FLNAVVKVYCTHTA
Sbjct: 85   GGKK--GRSQQAAYKPFGTQRKEKKESVADQKEKKQAEVRNLQDADFLNAVVKVYCTHTA 142

Query: 484  PDYSLPWQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGI 663
            PDYSLPWQKQRQ+TSTGSAFMIG+G L+TNAHCVEH TQVKVKRRGDDTKYVAKVLA+G+
Sbjct: 143  PDYSLPWQKQRQFTSTGSAFMIGDGKLLTNAHCVEHYTQVKVKRRGDDTKYVAKVLAKGV 202

Query: 664  DCDIALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSY 843
            DCDIALL+VE+EEFWKG EPL FG LPHLQ++VTVVGYPLGGDTISVTKGVVSRIEVTSY
Sbjct: 203  DCDIALLTVESEEFWKGAEPLHFGSLPHLQEAVTVVGYPLGGDTISVTKGVVSRIEVTSY 262

Query: 844  AHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHF 1023
            AHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQV RSE+ ENIGYVIPTTVVSHF
Sbjct: 263  AHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVYRSEEAENIGYVIPTTVVSHF 322

Query: 1024 LTDYERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDV 1203
            L DYERNGKYTGFPCLGV+LQKLENPALR+CLKV S EGVLVRRVEPT DA+NVLK+GDV
Sbjct: 323  LNDYERNGKYTGFPCLGVMLQKLENPALRACLKVESVEGVLVRRVEPTCDAHNVLKEGDV 382

Query: 1204 IVDFDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVH 1383
            IV FD VHVGCEGTVPFRS ERIAFRYLISQKF+GD  ELGIIR G  +KV+  LNPRVH
Sbjct: 383  IVSFDDVHVGCEGTVPFRSNERIAFRYLISQKFAGDVAELGIIRAGEFMKVKAELNPRVH 442

Query: 1384 LVPYHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXXVMGLKLLTKARYSLAHFIGEQIVIL 1563
            LVPYHIDGGQPSYLI+AGLVFTPLSEPLI       +GLKLL KARYSLA F GEQIVIL
Sbjct: 443  LVPYHIDGGQPSYLIIAGLVFTPLSEPLIDEECDDSIGLKLLAKARYSLARFKGEQIVIL 502

Query: 1564 SQVLANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXX 1743
            SQVLANEVNIGYEDMSNQQV+KLNGT I+NIHHL HLVDSC  KYLVFEFEDNY      
Sbjct: 503  SQVLANEVNIGYEDMSNQQVLKLNGTPIKNIHHLAHLVDSCKHKYLVFEFEDNYITVLER 562

Query: 1744 XXXXXXSPCILKDYGIPSERSADLKEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 1923
                  S  ILKDYGIP+ERS+DL EPY+ +  D  A  +  GDSPVSN+E+GFDGL+WA
Sbjct: 563  EGALASSTSILKDYGIPAERSSDLLEPYVDSVVDGQADQEDLGDSPVSNLEIGFDGLIWA 622


>ref|XP_006295823.1| hypothetical protein CARUB_v10024950mg [Capsella rubella]
            gi|482564531|gb|EOA28721.1| hypothetical protein
            CARUB_v10024950mg [Capsella rubella]
          Length = 604

 Score =  846 bits (2185), Expect = 0.0
 Identities = 420/530 (79%), Positives = 454/530 (85%)
 Frame = +1

Query: 334  QSSGFQSLVIQRKDKKAFAYELKEQQAEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQ 513
            Q+  F++    +KDKK      ++QQ +   + D +FLNAVVKVYCTHTAPDYSLPWQKQ
Sbjct: 76   QTMAFKAFGSPKKDKKDAPLS-RDQQTDPAKIHDASFLNAVVKVYCTHTAPDYSLPWQKQ 134

Query: 514  RQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVE 693
            RQ+TSTGSAFMIG+G L+TNAHCVEHDTQVKVKRRGDD KYVAKVL RG+DCDIALLSVE
Sbjct: 135  RQFTSTGSAFMIGDGKLLTNAHCVEHDTQVKVKRRGDDRKYVAKVLVRGVDCDIALLSVE 194

Query: 694  NEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGI 873
            +E+FWKG EPLR G LP LQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGI
Sbjct: 195  SEDFWKGAEPLRLGHLPRLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGI 254

Query: 874  QIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKY 1053
            QIDAAINPGNSGGPAFNDQGECIGVAFQV RSE+TENIGYVIPTTVVSHFLTDYERNGKY
Sbjct: 255  QIDAAINPGNSGGPAFNDQGECIGVAFQVYRSEETENIGYVIPTTVVSHFLTDYERNGKY 314

Query: 1054 TGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVG 1233
            TG+PCLGVLLQKLENPALR CLKV +NEGVLVRRVEPTSDA  VLK+GDVIV FD +HVG
Sbjct: 315  TGYPCLGVLLQKLENPALRECLKVPTNEGVLVRRVEPTSDASKVLKEGDVIVSFDDLHVG 374

Query: 1234 CEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQ 1413
            CEGTVPFRS+ERIAFRYLISQKF+GD  ELGIIR G H KVQ  L PRVHLVPYHIDGGQ
Sbjct: 375  CEGTVPFRSSERIAFRYLISQKFAGDIAELGIIRAGEHKKVQVALRPRVHLVPYHIDGGQ 434

Query: 1414 PSYLIVAGLVFTPLSEPLIXXXXXXVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNI 1593
            PSY+IVAGLVFTPLSEPLI       +GLKLLTKARYS+A F GEQIVILSQVLANEVNI
Sbjct: 435  PSYIIVAGLVFTPLSEPLIEEECEDTIGLKLLTKARYSVARFRGEQIVILSQVLANEVNI 494

Query: 1594 GYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXXSPCI 1773
            GYEDM+NQQV+K NG  IRNIHHL HL+D C +KYLVFEFEDNY            S CI
Sbjct: 495  GYEDMNNQQVLKFNGIPIRNIHHLAHLIDMCKDKYLVFEFEDNYVAVLEREASNSASLCI 554

Query: 1774 LKDYGIPSERSADLKEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 1923
            LKDYGIPSERSADL EPY+    DN A+DQ  GDSPVSN+E+GFDGL+WA
Sbjct: 555  LKDYGIPSERSADLLEPYVDPIDDNQALDQGIGDSPVSNLEIGFDGLVWA 604


>ref|NP_566115.1| DegP2 protease [Arabidopsis thaliana]
            gi|75220233|sp|O82261.2|DEGP2_ARATH RecName:
            Full=Protease Do-like 2, chloroplastic; Flags: Precursor
            gi|11908036|gb|AAG41447.1|AF326865_1 putative DegP2
            protease [Arabidopsis thaliana]
            gi|13172275|gb|AAK14061.1|AF245171_1 DegP2 protease
            [Arabidopsis thaliana]
            gi|13194802|gb|AAK15563.1|AF349516_1 putative DegP2
            protease [Arabidopsis thaliana]
            gi|18700190|gb|AAL77706.1| At2g47940/F17A22.33
            [Arabidopsis thaliana] gi|20197307|gb|AAC63648.2| DegP2
            protease [Arabidopsis thaliana]
            gi|20197550|gb|AAM15122.1| DegP2 protease [Arabidopsis
            thaliana] gi|20857214|gb|AAM26706.1| At2g47940/F17A22.33
            [Arabidopsis thaliana] gi|330255820|gb|AEC10914.1| DegP2
            protease [Arabidopsis thaliana]
          Length = 607

 Score =  845 bits (2184), Expect = 0.0
 Identities = 425/576 (73%), Positives = 471/576 (81%), Gaps = 3/576 (0%)
 Frame = +1

Query: 205  SSNAFTCREADTVSHKLFFGEPTDERHTSFGHHDGRKKDVGRS--QSSGFQSLVIQRKDK 378
            S+++ T R +  +  K    +          ++ GR +D   +  Q   F++    +K+K
Sbjct: 32   SASSLTPRASSNIKRKSSRSDSPSPILNPEKNYPGRVRDESSNPPQKMAFKAFGSPKKEK 91

Query: 379  KAFAYEL-KEQQAEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGN 555
            K    +  ++QQ +   + D +FLNAVVKVYCTHTAPDYSLPWQKQRQ+TSTGSAFMIG+
Sbjct: 92   KESLSDFSRDQQTDPAKIHDASFLNAVVKVYCTHTAPDYSLPWQKQRQFTSTGSAFMIGD 151

Query: 556  GMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRFG 735
            G L+TNAHCVEHDTQVKVKRRGDD KYVAKVL RG+DCDIALLSVE+E+FWKG EPLR G
Sbjct: 152  GKLLTNAHCVEHDTQVKVKRRGDDRKYVAKVLVRGVDCDIALLSVESEDFWKGAEPLRLG 211

Query: 736  LLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP 915
             LP LQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP
Sbjct: 212  HLPRLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGP 271

Query: 916  AFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLE 1095
            AFNDQGECIGVAFQV RSE+TENIGYVIPTTVVSHFLTDYERNGKYTG+PCLGVLLQKLE
Sbjct: 272  AFNDQGECIGVAFQVYRSEETENIGYVIPTTVVSHFLTDYERNGKYTGYPCLGVLLQKLE 331

Query: 1096 NPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIA 1275
            NPALR CLKV +NEGVLVRRVEPTSDA  VLK+GDVIV FD +HVGCEGTVPFRS+ERIA
Sbjct: 332  NPALRECLKVPTNEGVLVRRVEPTSDASKVLKEGDVIVSFDDLHVGCEGTVPFRSSERIA 391

Query: 1276 FRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPL 1455
            FRYLISQKF+GD  E+GIIR G H KVQ VL PRVHLVPYHIDGGQPSY+IVAGLVFTPL
Sbjct: 392  FRYLISQKFAGDIAEIGIIRAGEHKKVQVVLRPRVHLVPYHIDGGQPSYIIVAGLVFTPL 451

Query: 1456 SEPLIXXXXXXVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNIGYEDMSNQQVMKLN 1635
            SEPLI       +GLKLLTKARYS+A F GEQIVILSQVLANEVNIGYEDM+NQQV+K N
Sbjct: 452  SEPLIEEECEDTIGLKLLTKARYSVARFRGEQIVILSQVLANEVNIGYEDMNNQQVLKFN 511

Query: 1636 GTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXXSPCILKDYGIPSERSADL 1815
            G  IRNIHHL HL+D C +KYLVFEFEDNY            S CILKDYGIPSERSADL
Sbjct: 512  GIPIRNIHHLAHLIDMCKDKYLVFEFEDNYVAVLEREASNSASLCILKDYGIPSERSADL 571

Query: 1816 KEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 1923
             EPY+    D  A+DQ  GDSPVSN+E+GFDGL+WA
Sbjct: 572  LEPYVDPIDDTQALDQGIGDSPVSNLEIGFDGLVWA 607


>ref|NP_001118544.1| DegP2 protease [Arabidopsis thaliana] gi|330255821|gb|AEC10915.1|
            DegP2 protease [Arabidopsis thaliana]
          Length = 606

 Score =  845 bits (2182), Expect = 0.0
 Identities = 421/545 (77%), Positives = 461/545 (84%), Gaps = 3/545 (0%)
 Frame = +1

Query: 298  HHDGRKKDVGRS--QSSGFQSLVIQRKDKKAFAYEL-KEQQAEAGNLQDTAFLNAVVKVY 468
            ++ GR +D   +  Q   F++    +K+KK    +  ++QQ +   + D +FLNAVVKVY
Sbjct: 62   NYPGRVRDESSNPPQKMAFKAFGSPKKEKKESLSDFSRDQQTDPAKIHDASFLNAVVKVY 121

Query: 469  CTHTAPDYSLPWQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKYVAKV 648
            CTHTAPDYSLPWQKQRQ+TSTGSAFMIG+G L+TNAHCVEHDTQVKVKRRGDD KYVAKV
Sbjct: 122  CTHTAPDYSLPWQKQRQFTSTGSAFMIGDGKLLTNAHCVEHDTQVKVKRRGDDRKYVAKV 181

Query: 649  LARGIDCDIALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGVVSRI 828
            L RG+DCDIALLSVE+E+FWKG EPLR G LP LQDSVTVVGYPLGGDTISVTKGVVSRI
Sbjct: 182  LVRGVDCDIALLSVESEDFWKGAEPLRLGHLPRLQDSVTVVGYPLGGDTISVTKGVVSRI 241

Query: 829  EVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYVIPTT 1008
            EVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQV RSE+TENIGYVIPTT
Sbjct: 242  EVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVYRSEETENIGYVIPTT 301

Query: 1009 VVSHFLTDYERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDAYNVL 1188
            VVSHFLTDYERNGKYTG+PCLGVLLQKLENPALR CLKV +NEGVLVRRVEPTSDA  VL
Sbjct: 302  VVSHFLTDYERNGKYTGYPCLGVLLQKLENPALRECLKVPTNEGVLVRRVEPTSDASKVL 361

Query: 1189 KQGDVIVDFDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKVQTVL 1368
            K+GDVIV FD +HVGCEGTVPFRS+ERIAFRYLISQKF+GD  E+GIIR G H KVQ VL
Sbjct: 362  KEGDVIVSFDDLHVGCEGTVPFRSSERIAFRYLISQKFAGDIAEIGIIRAGEHKKVQVVL 421

Query: 1369 NPRVHLVPYHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXXVMGLKLLTKARYSLAHFIGE 1548
             PRVHLVPYHIDGGQPSY+IVAGLVFTPLSEPLI       +GLKLLTKARYS+A F GE
Sbjct: 422  RPRVHLVPYHIDGGQPSYIIVAGLVFTPLSEPLIEEECEDTIGLKLLTKARYSVARFRGE 481

Query: 1549 QIVILSQVLANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYX 1728
            QIVILSQVLANEVNIGYEDM+NQQV+K NG  IRNIHHL HL+D C +KYLVFEFEDNY 
Sbjct: 482  QIVILSQVLANEVNIGYEDMNNQQVLKFNGIPIRNIHHLAHLIDMCKDKYLVFEFEDNYV 541

Query: 1729 XXXXXXXXXXXSPCILKDYGIPSERSADLKEPYLYTFGDNNAIDQAPGDSPVSNMEMGFD 1908
                       S CILKDYGIPSERSADL EPY+    D  A+DQ  GDSPVSN+E+GFD
Sbjct: 542  AVLEREASNSASLCILKDYGIPSERSADLLEPYVDPIDDTQALDQGIGDSPVSNLEIGFD 601

Query: 1909 GLLWA 1923
            GL+WA
Sbjct: 602  GLVWA 606


>pdb|4FLN|A Chain A, Crystal Structure Of Plant Protease Deg2
            gi|405944959|pdb|4FLN|B Chain B, Crystal Structure Of
            Plant Protease Deg2 gi|405944960|pdb|4FLN|C Chain C,
            Crystal Structure Of Plant Protease Deg2
          Length = 539

 Score =  843 bits (2178), Expect = 0.0
 Identities = 418/531 (78%), Positives = 454/531 (85%), Gaps = 1/531 (0%)
 Frame = +1

Query: 334  QSSGFQSLVIQRKDKKAFAYEL-KEQQAEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQK 510
            Q   F++    +K+KK    +  ++QQ +   + D +FLNAVVKVYCTHTAPDYSLPWQK
Sbjct: 9    QKMAFKAFGSPKKEKKESLSDFSRDQQTDPAKIHDASFLNAVVKVYCTHTAPDYSLPWQK 68

Query: 511  QRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSV 690
            QRQ+TSTGSAFMIG+G L+TNAHCVEHDTQVKVKRRGDD KYVAKVL RG+DCDIALLSV
Sbjct: 69   QRQFTSTGSAFMIGDGKLLTNAHCVEHDTQVKVKRRGDDRKYVAKVLVRGVDCDIALLSV 128

Query: 691  ENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLG 870
            E+E+FWKG EPLR G LP LQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLG
Sbjct: 129  ESEDFWKGAEPLRLGHLPRLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLG 188

Query: 871  IQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGK 1050
            IQIDAAINPGNSGGPAFNDQGECIGVAFQV RSE+TENIGYVIPTTVVSHFLTDYERNGK
Sbjct: 189  IQIDAAINPGNSGGPAFNDQGECIGVAFQVYRSEETENIGYVIPTTVVSHFLTDYERNGK 248

Query: 1051 YTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHV 1230
            YTG+PCLGVLLQKLENPALR CLKV +NEGVLVRRVEPTSDA  VLK+GDVIV FD +HV
Sbjct: 249  YTGYPCLGVLLQKLENPALRECLKVPTNEGVLVRRVEPTSDASKVLKEGDVIVSFDDLHV 308

Query: 1231 GCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGG 1410
            GCEGTVPFRS+ERIAFRYLISQKF+GD  E+GIIR G H KVQ VL PRVHLVPYHIDGG
Sbjct: 309  GCEGTVPFRSSERIAFRYLISQKFAGDIAEIGIIRAGEHKKVQVVLRPRVHLVPYHIDGG 368

Query: 1411 QPSYLIVAGLVFTPLSEPLIXXXXXXVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVN 1590
            QPSY+IVAGLVFTPLSEPLI       +GLKLLTKARYS+A F GEQIVILSQVLANEVN
Sbjct: 369  QPSYIIVAGLVFTPLSEPLIEEECEDTIGLKLLTKARYSVARFRGEQIVILSQVLANEVN 428

Query: 1591 IGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXXSPC 1770
            IGYEDM+NQQV+K NG  IRNIHHL HL+D C +KYLVFEFEDNY            S C
Sbjct: 429  IGYEDMNNQQVLKFNGIPIRNIHHLAHLIDMCKDKYLVFEFEDNYVAVLEREASNSASLC 488

Query: 1771 ILKDYGIPSERSADLKEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 1923
            ILKDYGIPSERSADL EPY+    D  A+DQ  GDSPVSN+E+GFDGL+WA
Sbjct: 489  ILKDYGIPSERSADLLEPYVDPIDDTQALDQGIGDSPVSNLEIGFDGLVWA 539


>ref|XP_004148888.1| PREDICTED: protease Do-like 2, chloroplastic-like [Cucumis sativus]
            gi|449491511|ref|XP_004158921.1| PREDICTED: protease
            Do-like 2, chloroplastic-like [Cucumis sativus]
          Length = 623

 Score =  841 bits (2172), Expect = 0.0
 Identities = 422/538 (78%), Positives = 462/538 (85%), Gaps = 1/538 (0%)
 Frame = +1

Query: 313  KKDVGRSQSSGFQSLVIQRKDKKAFAYELKEQQAEAGNLQDTAFLNAVVKVYCTHTAPDY 492
            +++ GR Q+  ++S  +QRKDKK     + E Q E+GNLQ  AFLNAVVKVYCTHTAPDY
Sbjct: 87   QRNSGRVQTEAYKSFGMQRKDKKELVNAI-EDQVESGNLQGAAFLNAVVKVYCTHTAPDY 145

Query: 493  SLPWQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCD 672
            SLPWQKQRQ+TSTGSAFMIG+G L+TNAHCVEHDTQVKVK+RGDDTKYVAKVLARG+DCD
Sbjct: 146  SLPWQKQRQFTSTGSAFMIGDGKLLTNAHCVEHDTQVKVKKRGDDTKYVAKVLARGVDCD 205

Query: 673  IALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHG 852
            IALLSVENEEFWKG EPL+FG LP LQD+VTVVGYPLGGDTISVT+GVVSRIEVTSYAHG
Sbjct: 206  IALLSVENEEFWKGAEPLKFGNLPCLQDAVTVVGYPLGGDTISVTRGVVSRIEVTSYAHG 265

Query: 853  SSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTD 1032
            SSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQV RSE+ ENIGYVIPTTVVSHFL D
Sbjct: 266  SSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVYRSEEVENIGYVIPTTVVSHFLND 325

Query: 1033 YERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVD 1212
            YERN KYTGFP LGVLLQKLENPALR+CL+V SNEGVLVRRVEPTSDA  VLK+GDVIV 
Sbjct: 326  YERNRKYTGFPSLGVLLQKLENPALRACLRVKSNEGVLVRRVEPTSDANKVLKEGDVIVS 385

Query: 1213 FDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVP 1392
            FD + VGCEGTVPFR+ ERIAFRYLISQKF+GD  ELGIIR+G  IK + +LNPRVHLVP
Sbjct: 386  FDDIKVGCEGTVPFRTNERIAFRYLISQKFAGDVAELGIIRSGELIKAKVILNPRVHLVP 445

Query: 1393 YHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXXVMGLKLLTKARYSLAHFIGEQIVILSQV 1572
            +HIDGGQPSYLI+AGLVFTPLSEPLI       +GLKLL KARYSLA F GEQIVILSQV
Sbjct: 446  FHIDGGQPSYLIIAGLVFTPLSEPLIDEECEDSIGLKLLAKARYSLASFKGEQIVILSQV 505

Query: 1573 LANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXX 1752
            LANEVNIGYEDM NQQV+KLNGTRIRNIHHLTHLVD+C +KYLVFEFE+NY         
Sbjct: 506  LANEVNIGYEDMGNQQVLKLNGTRIRNIHHLTHLVDTCKDKYLVFEFEENYIAVLEREAA 565

Query: 1753 XXXSPCILKDYGIPSERSADLKEPYLYTFGDNNA-IDQAPGDSPVSNMEMGFDGLLWA 1923
               S CIL+DYGIPSERS+DL EPY+    D    + Q  GDSPVSN E+GF+GLLWA
Sbjct: 566  IAASSCILRDYGIPSERSSDLLEPYVDISEDEKGMVVQNYGDSPVSNAEIGFEGLLWA 623


>ref|XP_006366368.1| PREDICTED: protease Do-like 2, chloroplastic-like [Solanum tuberosum]
          Length = 621

 Score =  838 bits (2165), Expect = 0.0
 Identities = 426/585 (72%), Positives = 479/585 (81%), Gaps = 4/585 (0%)
 Frame = +1

Query: 181  RALHCNNSSSNAFTCREADTVSHKLFFGEPTDERHTSFGHHDGR--KKDVGRSQSSGFQS 354
            +A+H   SSS+         V  + F     DERH +  + DGR  K +  RS+S+ F+ 
Sbjct: 40   KAVH-QKSSSSPHHPPSQKAVGKQNFIWRSKDERHLA--NKDGRSSKNETERSKSTAFKF 96

Query: 355  LVIQRKDK-KAFAYELKEQQAEAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQYTST 531
              +QRK   K   +E KE Q E G ++D  FLNAVVKV+CTHTAPDYSLPWQKQRQ+ ST
Sbjct: 97   SGLQRKGSGKGVPFESKEPQVETGIIEDATFLNAVVKVFCTHTAPDYSLPWQKQRQFAST 156

Query: 532  GSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVENEEFWK 711
            GSAFMIG+G L+TNAHCVEH TQVKVKRRGDDTKYVAKVLARG++CDIALLSVE+++FWK
Sbjct: 157  GSAFMIGDGKLLTNAHCVEHGTQVKVKRRGDDTKYVAKVLARGVECDIALLSVESKDFWK 216

Query: 712  GVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAI 891
            G EPLRFG LPHLQD+VTVVGYPLGGDTISVTKGVVSR+EVTSYAHGSS+LLGIQIDAAI
Sbjct: 217  GAEPLRFGHLPHLQDAVTVVGYPLGGDTISVTKGVVSRVEVTSYAHGSSELLGIQIDAAI 276

Query: 892  NPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKYTGFPCL 1071
            NPGNSGGPAFND GECIGVAFQV RS+D ENIGYVIPTTVVSHFL DYERNGKY+GFPCL
Sbjct: 277  NPGNSGGPAFNDDGECIGVAFQVYRSDDVENIGYVIPTTVVSHFLEDYERNGKYSGFPCL 336

Query: 1072 GVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEGTVP 1251
            GV+LQKLENPALR+CL+V SNEG+LVR++EPTSD  NV+K+GDVIV FDGV VGCEGTVP
Sbjct: 337  GVMLQKLENPALRACLRVPSNEGILVRKIEPTSDVSNVVKEGDVIVSFDGVRVGCEGTVP 396

Query: 1252 FRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSYLIV 1431
            FRS+ERIAFRYLISQKF+GD  ELGIIR G  +KVQ VL PRVHLVPYHI+GGQPSYLIV
Sbjct: 397  FRSSERIAFRYLISQKFTGDVAELGIIRAGELLKVQAVLKPRVHLVPYHIEGGQPSYLIV 456

Query: 1432 AGLVFTPLSEPLIXXXXXXVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNIGYEDMS 1611
            AGLVFTPLSEPLI       +GLKLL KARYS A F GEQIVILSQVLANEVNIGYED+S
Sbjct: 457  AGLVFTPLSEPLIEEECEDTIGLKLLIKARYSFAKFEGEQIVILSQVLANEVNIGYEDLS 516

Query: 1612 NQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXXSPCILKDYGI 1791
            N+QV+KLNGTRI+NIHHL HLVDSC +KYLVFEFEDN+            S  IL DYGI
Sbjct: 517  NEQVLKLNGTRIKNIHHLAHLVDSCKDKYLVFEFEDNFLVVLEREAASSASSSILIDYGI 576

Query: 1792 PSERSADLKEPYLYTFGDNNAIDQAP-GDSPVSNMEMGFDGLLWA 1923
            P+ERS+DL EPY+ + G + A DQ   GDSPVSN E G+DGLLWA
Sbjct: 577  PAERSSDLLEPYVDSIGPDEATDQHEFGDSPVSNSEFGYDGLLWA 621


>ref|XP_002882138.1| hypothetical protein ARALYDRAFT_483986 [Arabidopsis lyrata subsp.
            lyrata] gi|297327977|gb|EFH58397.1| hypothetical protein
            ARALYDRAFT_483986 [Arabidopsis lyrata subsp. lyrata]
          Length = 613

 Score =  837 bits (2163), Expect = 0.0
 Identities = 422/552 (76%), Positives = 461/552 (83%), Gaps = 10/552 (1%)
 Frame = +1

Query: 298  HHDGRKKDVGRS--QSSGFQSLVIQRKDKKAFAYEL-KEQQAEAGNLQDTAFLNAVVKVY 468
            ++ GR +D   +  Q   F++    +K+KK    +  ++QQ + G + D +FLNAVVKVY
Sbjct: 62   NYPGRVRDDSPNPPQKMAFKAFGSPKKEKKEPLSDFSRDQQTDPGKIHDASFLNAVVKVY 121

Query: 469  CTHTAPDYSLPWQKQRQYTSTGS-------AFMIGNGMLITNAHCVEHDTQVKVKRRGDD 627
            CTHTAPDYSLPWQKQRQ+TSTG        AFMIG+G L+TNAHCVEHDTQVKVKRRGDD
Sbjct: 122  CTHTAPDYSLPWQKQRQFTSTGRHVFFIHIAFMIGDGKLLTNAHCVEHDTQVKVKRRGDD 181

Query: 628  TKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVT 807
             KYVAKVL RG+DCDIALLSVE+E+FWKG EPLR G LP LQDSVTVVGYPLGGDTISVT
Sbjct: 182  RKYVAKVLVRGVDCDIALLSVESEDFWKGAEPLRLGHLPRLQDSVTVVGYPLGGDTISVT 241

Query: 808  KGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENI 987
            KGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQV RSE+TENI
Sbjct: 242  KGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVYRSEETENI 301

Query: 988  GYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPT 1167
            GYVIPTTVVSHFLTDYERNGKYTG+PCLGVLLQKLENPALR CLKV +NEGVLVRRVEPT
Sbjct: 302  GYVIPTTVVSHFLTDYERNGKYTGYPCLGVLLQKLENPALRECLKVPTNEGVLVRRVEPT 361

Query: 1168 SDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAH 1347
            SDA  VLK+GDVIV FD +HVGCEGTVPFRS+ERIAFRYLISQKF+GD  ELGIIR G H
Sbjct: 362  SDASKVLKEGDVIVSFDDLHVGCEGTVPFRSSERIAFRYLISQKFAGDIAELGIIRAGEH 421

Query: 1348 IKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXXVMGLKLLTKARYS 1527
             KVQ VL PRVHLVPYHIDGGQPSY+IVAGLVFTPLSEPLI       +GLKLLTKARYS
Sbjct: 422  KKVQVVLRPRVHLVPYHIDGGQPSYIIVAGLVFTPLSEPLIEEECEDTIGLKLLTKARYS 481

Query: 1528 LAHFIGEQIVILSQVLANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVF 1707
            +A F GEQIVILSQVLANEVNIGYEDM+NQQV+K NG  IRNIHHL HL+D C +KYLVF
Sbjct: 482  VARFRGEQIVILSQVLANEVNIGYEDMNNQQVLKFNGIPIRNIHHLAHLIDMCKDKYLVF 541

Query: 1708 EFEDNYXXXXXXXXXXXXSPCILKDYGIPSERSADLKEPYLYTFGDNNAIDQAPGDSPVS 1887
            EFEDNY            S CILKDYGIPSERSADL EPY+    D  A+DQ  GDSPVS
Sbjct: 542  EFEDNYVAVLEREASNSASLCILKDYGIPSERSADLLEPYVDPIDDTQALDQGIGDSPVS 601

Query: 1888 NMEMGFDGLLWA 1923
            N+E+GFDGL+WA
Sbjct: 602  NLEIGFDGLVWA 613


>ref|XP_004247469.1| PREDICTED: protease Do-like 2, chloroplastic-like [Solanum
            lycopersicum]
          Length = 621

 Score =  832 bits (2150), Expect = 0.0
 Identities = 420/560 (75%), Positives = 467/560 (83%), Gaps = 4/560 (0%)
 Frame = +1

Query: 256  FFGEPTDERHTSFGHHDGR--KKDVGRSQSSGFQSLVIQRKDK-KAFAYELKEQQAEAGN 426
            F     DERH +  ++DGR  K + GRS+S+ F+   +QRK   K   +E KE Q E G 
Sbjct: 64   FIWRSKDERHLA--NNDGRSSKNETGRSKSTAFKFSGLQRKGSGKGAPFESKEPQVETGF 121

Query: 427  LQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVK 606
            ++D  FLNAVVKV+CTHTAPDYSLPWQKQRQ+ STGSAFMIG+G L+TNAHCVEH TQVK
Sbjct: 122  IEDAPFLNAVVKVFCTHTAPDYSLPWQKQRQFASTGSAFMIGDGKLLTNAHCVEHGTQVK 181

Query: 607  VKRRGDDTKYVAKVLARGIDCDIALLSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLG 786
            VKRRGDDTKYVAKVLARG++CDIALLSVE+++FWKG EPL FG LPHLQD+VTVVGYPLG
Sbjct: 182  VKRRGDDTKYVAKVLARGVECDIALLSVESKDFWKGAEPLCFGHLPHLQDAVTVVGYPLG 241

Query: 787  GDTISVTKGVVSRIEVTSYAHGSSDLLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLR 966
            GDTISVTKGVVSR+EVTSYAHGSS+LLGIQIDAAINPGNSGGPAFND GECIGVAFQV R
Sbjct: 242  GDTISVTKGVVSRVEVTSYAHGSSELLGIQIDAAINPGNSGGPAFNDDGECIGVAFQVYR 301

Query: 967  SEDTENIGYVIPTTVVSHFLTDYERNGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVL 1146
            S+D ENIGYVIP  VVSHFL DYERNGKY+GFPCLGVLLQKLENPALR+CL+V SNEGVL
Sbjct: 302  SDDVENIGYVIPAMVVSHFLEDYERNGKYSGFPCLGVLLQKLENPALRACLRVPSNEGVL 361

Query: 1147 VRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELG 1326
            VR++EPTSD  NV+K+GDVIV FDGV VGCEGTVPFRS+ERIAFRYLISQKF+GD  ELG
Sbjct: 362  VRKIEPTSDVSNVVKEGDVIVSFDGVRVGCEGTVPFRSSERIAFRYLISQKFTGDVAELG 421

Query: 1327 IIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSYLIVAGLVFTPLSEPLIXXXXXXVMGLKL 1506
            IIR G  +KVQ VL PRVHLVPYHI+GGQPSYLIVAGLVFTPLSEPLI       +GLKL
Sbjct: 422  IIRAGEFLKVQAVLKPRVHLVPYHIEGGQPSYLIVAGLVFTPLSEPLIEEECEDTIGLKL 481

Query: 1507 LTKARYSLAHFIGEQIVILSQVLANEVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSC 1686
            L KARYS A F GEQIVILSQVLANEVNIGYED+SN+QV+KLNGTRI+NIHHL HLVDSC
Sbjct: 482  LIKARYSFAKFEGEQIVILSQVLANEVNIGYEDLSNEQVLKLNGTRIKNIHHLAHLVDSC 541

Query: 1687 TEKYLVFEFEDNYXXXXXXXXXXXXSPCILKDYGIPSERSADLKEPYLYTFGDNNAIDQA 1866
             +KYLVFEFEDN+            S  IL DYGIP+ERS+DL EPY+ + G   A DQ 
Sbjct: 542  KDKYLVFEFEDNFLVALEREAASSASSSILIDYGIPAERSSDLLEPYVDSIGPYEATDQH 601

Query: 1867 P-GDSPVSNMEMGFDGLLWA 1923
              GDSPVSN E G+DGLLWA
Sbjct: 602  EFGDSPVSNSEFGYDGLLWA 621


>ref|XP_006855396.1| hypothetical protein AMTR_s00057p00143260 [Amborella trichopoda]
            gi|548859162|gb|ERN16863.1| hypothetical protein
            AMTR_s00057p00143260 [Amborella trichopoda]
          Length = 528

 Score =  822 bits (2123), Expect = 0.0
 Identities = 412/527 (78%), Positives = 457/527 (86%), Gaps = 1/527 (0%)
 Frame = +1

Query: 346  FQSLVIQRKDKKAFAYELKEQQA-EAGNLQDTAFLNAVVKVYCTHTAPDYSLPWQKQRQY 522
            F+SL +QRK+K A  ++LKEQQ  EA  LQD AFLNAVVKVYCTHTAPDYSLPWQKQRQ+
Sbjct: 3    FKSLGMQRKEK-AIVHDLKEQQINEASTLQDGAFLNAVVKVYCTHTAPDYSLPWQKQRQF 61

Query: 523  TSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIALLSVENEE 702
            TSTGSAFMIG+G L+TNAHCVEH TQVKVKRRGDDTKYVAKVLARG++CDIALL VE+EE
Sbjct: 62   TSTGSAFMIGDGKLLTNAHCVEHYTQVKVKRRGDDTKYVAKVLARGVECDIALLYVESEE 121

Query: 703  FWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSDLLGIQID 882
            FWKG +PL+FG LP LQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHG+SDLLGIQID
Sbjct: 122  FWKGADPLKFGRLPCLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGASDLLGIQID 181

Query: 883  AAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYERNGKYTGF 1062
            AAINPGNSGGPAFNDQGECIGVAFQV RS++ ENIGYVIPTTVVSHFLTDYERNGKYTGF
Sbjct: 182  AAINPGNSGGPAFNDQGECIGVAFQVFRSDEAENIGYVIPTTVVSHFLTDYERNGKYTGF 241

Query: 1063 PCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDGVHVGCEG 1242
            P LGVLLQKLENPALR+CLKV SNEGVLVRR+EPT+ A++ LK+GDVIV FDG+ VGCEG
Sbjct: 242  PSLGVLLQKLENPALRACLKVNSNEGVLVRRIEPTAAAHDALKEGDVIVSFDGIPVGCEG 301

Query: 1243 TVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHIDGGQPSY 1422
            TVPFRSTERIAFRYLISQKF+GD  ELGIIR GAH+KV+T+L PRVHLVPYHI+GGQPSY
Sbjct: 302  TVPFRSTERIAFRYLISQKFAGDTAELGIIRGGAHMKVKTLLYPRVHLVPYHIEGGQPSY 361

Query: 1423 LIVAGLVFTPLSEPLIXXXXXXVMGLKLLTKARYSLAHFIGEQIVILSQVLANEVNIGYE 1602
            LI+AGLVFTPLSEPLI       MGLKLL KARYSLA F GEQIV+LSQVLANE NIGYE
Sbjct: 362  LIIAGLVFTPLSEPLIDEECEDSMGLKLLAKARYSLAKFKGEQIVLLSQVLANEANIGYE 421

Query: 1603 DMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXXSPCILKD 1782
            DM NQQV+K NGT+I+NI HL HLVD+C ++YL+FEFEDN+            SP ILKD
Sbjct: 422  DMGNQQVLKFNGTKIKNIRHLAHLVDTCKDEYLIFEFEDNFLAVLDREAASIASPRILKD 481

Query: 1783 YGIPSERSADLKEPYLYTFGDNNAIDQAPGDSPVSNMEMGFDGLLWA 1923
            YGIP ERS++L E YL +  D+ A+     D P SN+E+GFDGLLWA
Sbjct: 482  YGIPFERSSNLAELYLDSSEDDLALSGDLDDIPASNLEIGFDGLLWA 528


>ref|XP_003520225.1| PREDICTED: protease Do-like 2, chloroplastic-like [Glycine max]
          Length = 612

 Score =  819 bits (2116), Expect = 0.0
 Identities = 429/597 (71%), Positives = 474/597 (79%), Gaps = 13/597 (2%)
 Frame = +1

Query: 172  PSFRALHCNN----------SSSNAFTCREADTVSHKLFFGEPTDERHTSFGHHDGRKKD 321
            P   + HCNN          SSS++ + R+ +   HK    +  DER          + +
Sbjct: 29   PIVASFHCNNHPLRVSSSSSSSSSSKSNRKKEGAGHKK---QSKDERPA--------RGN 77

Query: 322  VGRSQSSGFQSLVIQRKDKKAFAYELKEQQAEAGNLQDTAFLNAVVKVYCTHTAPDYSLP 501
            V  SQ +  +   IQRK+K    ++ K+QQ E   LQD+AFLNAVVKVYCTHTAPDYSLP
Sbjct: 78   VLESQPTSSKPFGIQRKNKDLI-FDSKDQQVEQSILQDSAFLNAVVKVYCTHTAPDYSLP 136

Query: 502  WQKQRQYTSTGSAFMIGNGMLITNAHCVEHDTQVKVKRRGDDTKYVAKVLARGIDCDIAL 681
            WQKQRQYTSTGSAFMIG+  L+TNAHCVEHDTQVKVK+RGDD+KYVAKVLARG+DCDIAL
Sbjct: 137  WQKQRQYTSTGSAFMIGDRKLLTNAHCVEHDTQVKVKKRGDDSKYVAKVLARGVDCDIAL 196

Query: 682  LSVENEEFWKGVEPLRFGLLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSD 861
            LSVE+EEFW+ VEPLR G LPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSD
Sbjct: 197  LSVESEEFWRDVEPLRLGRLPHLQDSVTVVGYPLGGDTISVTKGVVSRIEVTSYAHGSSD 256

Query: 862  LLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEDTENIGYVIPTTVVSHFLTDYER 1041
            LLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSE+ ENIGYVIPTTVVSHFLTDYER
Sbjct: 257  LLGIQIDAAINPGNSGGPAFNDQGECIGVAFQVLRSEEAENIGYVIPTTVVSHFLTDYER 316

Query: 1042 NGKYTGFPCLGVLLQKLENPALRSCLKVASNEGVLVRRVEPTSDAYNVLKQGDVIVDFDG 1221
            NG+YTGFPCLGVL+QKLENPALR+ LKV SNEGVLVRRVEPTSDA NVLK+GDVIV FD 
Sbjct: 317  NGRYTGFPCLGVLIQKLENPALRAWLKVQSNEGVLVRRVEPTSDANNVLKEGDVIVSFDD 376

Query: 1222 VHVGCEGTVPFRSTERIAFRYLISQKFSGDNVELGIIRNGAHIKVQTVLNPRVHLVPYHI 1401
            V VG EGTVPFRS ERIAF +LISQKF+GD  ELGIIR G  +K + VLN RVHLVPYHI
Sbjct: 377  VRVGSEGTVPFRSNERIAFHFLISQKFAGDTAELGIIRAGTLMKTKVVLNSRVHLVPYHI 436

Query: 1402 DGGQPSYLIVAGLVFTPLSEPLIXXXXXXVMGLKLLTKARYSLAHFIGEQIVILSQVLAN 1581
            D G PSYLI+AGLVFTPLSEPLI       +GLKLL +ARYSLA F GEQIVILSQVLAN
Sbjct: 437  DEGLPSYLIIAGLVFTPLSEPLIEEECEDSIGLKLLARARYSLAKFKGEQIVILSQVLAN 496

Query: 1582 EVNIGYEDMSNQQVMKLNGTRIRNIHHLTHLVDSCTEKYLVFEFEDNYXXXXXXXXXXXX 1761
            EVNIGYEDM NQQV+K NG RI+NIHHL HL+DSC ++YL FEFED+Y            
Sbjct: 497  EVNIGYEDMGNQQVVKFNGARIKNIHHLAHLIDSCEDRYLRFEFEDSYVAVLEKEAVAAA 556

Query: 1762 SPCILKDYGIPSERSADLKEPYLYTF---GDNNAIDQAPGDSPVSNMEMGFDGLLWA 1923
            SP +L DYGIPSERS+DL +PY+ T    GD  A DQ  GDSPVSN E G DGLLWA
Sbjct: 557  SPSVLSDYGIPSERSSDLSKPYVDTLEVEGDQPA-DQEFGDSPVSNYEFGPDGLLWA 612


Top