BLASTX nr result

ID: Achyranthes23_contig00012338 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes23_contig00012338
         (1725 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004307176.1| PREDICTED: aspartic proteinase CDR1-like [Fr...   296   1e-77
ref|XP_006279430.1| hypothetical protein CARUB_v10007923mg [Caps...   280   2e-72
ref|NP_176663.1| aspartyl protease family protein [Arabidopsis t...   279   3e-72
ref|XP_003539632.1| PREDICTED: aspartic proteinase CDR1-like [Gl...   279   3e-72
gb|EXB70640.1| putative aspartic protease [Morus notabilis]           278   5e-72
ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor,...   277   9e-72
ref|XP_006484733.1| PREDICTED: aspartic proteinase CDR1-like [Ci...   276   2e-71
gb|EMJ11898.1| hypothetical protein PRUPE_ppa025167mg [Prunus pe...   275   3e-71
gb|ESW21352.1| hypothetical protein PHAVU_005G063600g [Phaseolus...   275   6e-71
ref|XP_006403054.1| hypothetical protein EUTSA_v10003479mg [Eutr...   275   6e-71
ref|XP_006437358.1| hypothetical protein CICLE_v10033646mg [Citr...   274   1e-70
ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago ...   274   1e-70
ref|XP_006393532.1| hypothetical protein EUTSA_v10012077mg, part...   273   2e-70
ref|XP_006285435.1| hypothetical protein CARUB_v10006851mg [Caps...   273   2e-70
ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago ...   273   2e-70
ref|XP_004244686.1| PREDICTED: aspartic proteinase CDR1-like [So...   272   4e-70
ref|XP_004229589.1| PREDICTED: aspartic proteinase CDR1-like [So...   270   1e-69
ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35...   270   2e-69
gb|EOY14113.1| Eukaryotic aspartyl protease family protein, puta...   269   2e-69
ref|XP_002320947.1| aspartyl protease family protein [Populus tr...   269   2e-69

>ref|XP_004307176.1| PREDICTED: aspartic proteinase CDR1-like [Fragaria vesca subsp.
            vesca]
          Length = 447

 Score =  296 bits (759), Expect = 1e-77
 Identities = 183/444 (41%), Positives = 236/444 (53%), Gaps = 28/444 (6%)
 Frame = +3

Query: 138  IKASNLISSNGTFTIDLIHRNSPHSPYYNP---QHHSTRQPNLVSITHGHQIHHSSSYLV 308
            + A + +S+ G F+IDLIHR+SP SP+YNP   Q    R     S  H ++ +   S  V
Sbjct: 16   VSAFSSVSAKGGFSIDLIHRDSPRSPFYNPLETQSQRLRNAFRRSFRHSNRFNPKVSSTV 75

Query: 309  P-----------NGGDYLIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQKSPLFK 455
            P           N G+YL+++++GTP +   A+ADTGSDLIW QC PC  C  Q  PLF 
Sbjct: 76   PQTDEAEATITNNRGEYLMELSIGTPPVPIKAIADTGSDLIWTQCKPCVACYTQTDPLFD 135

Query: 456  PKNSSTYHTIPCTSPICNYPHITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGKLARDTI 635
            P  SSTY TIPCTS  C+   +   CLS    S + SPC YEA YGD + ++G LA DT+
Sbjct: 136  PSKSSTYKTIPCTSNQCS--ALKGNCLSK--GSGSGSPCQYEANYGDRSHTIGDLAVDTL 191

Query: 636  SLPSQMKSISISLPNTIFGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHINYSFSYC 815
            +L S     ++S P TI GCG    G   +KG GIVGLG G  SL+ QL   I+  FSYC
Sbjct: 192  TLAS-TTGRNVSFPKTIIGCGHDNAGTFDKKGSGIVGLGGGDESLISQLSTSIDGKFSYC 250

Query: 816  LVPLSS--GLTSKLTFGPLRXXXXXXXXXXXXXSSPILTGRDP---TFYRLILNRISVGK 980
            LVP +S   LTSKL FG                S+PI+   DP   TF+ L L  ISVGK
Sbjct: 251  LVPFASEGDLTSKLNFG-----DNALVSGSGVLSTPIIEDEDPTKKTFFFLTLEAISVGK 305

Query: 981  ---------NASFNXXXXXXXXSGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDL 1133
                     ++S +        SGTTLT L    Y  +++A+ + V    VSDP     L
Sbjct: 306  TKIPFTSSSSSSASGGGNIIIDSGTTLTLLEQSFYSELEEAVDQVVGGERVSDPQSPIPL 365

Query: 1134 CYDTRSLVSKKMFNPSDVVFHFKGGDVVLKGINTFRTLEDGLSCLAMSSIEGTPVFGNVA 1313
            CY        K+ N   +  HF G DV L   NTF  + D   C + +  E   ++GN+A
Sbjct: 366  CYKVSLSEDLKVPN---ITVHFSGADVKLDAKNTFIRVSDEAVCFSFAPSESLSIYGNLA 422

Query: 1314 QVNFEVGFDLKAKKISFAPKDCTK 1385
            Q  F VG+DLK K +SF P DCTK
Sbjct: 423  QAGFLVGYDLKEKTVSFKPTDCTK 446


>ref|XP_006279430.1| hypothetical protein CARUB_v10007923mg [Capsella rubella]
            gi|482548130|gb|EOA12328.1| hypothetical protein
            CARUB_v10007923mg [Capsella rubella]
          Length = 432

 Score =  280 bits (715), Expect = 2e-72
 Identities = 164/443 (37%), Positives = 236/443 (53%), Gaps = 20/443 (4%)
 Frame = +3

Query: 117  SSAFSFSIKASNLISSNGTFTIDLIHRNSPHSPYYNPQHHSTRQ----------PNLVSI 266
            ++ FS  + ++    +   FT+DLIHR+SP SP+YNP   S+++            L   
Sbjct: 7    TTLFSLLLLSNTNAHTKHGFTVDLIHRDSPKSPFYNPAETSSQRMINAIRRSARSTLQLA 66

Query: 267  THGHQIHHSSSYLVPNGGDYLIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQKSP 446
             +    + + S +  N G+YL+ I++GTP    +A+ADTGSDLIW QC PC  C  Q +P
Sbjct: 67   KYEASPNSAQSVITSNHGEYLMNISIGTPPFPTLAIADTGSDLIWTQCKPCIKCYRQTAP 126

Query: 447  LFKPKNSSTYHTIPCTSPICNYPHITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGKLAR 626
            +F PK S+TY  + C+S  C         L + + SK    C +   YGD + + G +A 
Sbjct: 127  IFNPKESTTYKNVSCSSSKCR-------ALEDSSCSKAEDTCSFIIAYGDFSYTKGVVAV 179

Query: 627  DTISLPSQMKSISISLPNTIFGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHINYSF 806
            DT++L S  +  S+SL N   GCG    G+    G GI+GLG GS SLV QLG  IN  F
Sbjct: 180  DTVTLGSTDRR-SMSLRNVTIGCGHKNSGKFDPAGSGIIGLGRGSTSLVSQLGKSINGKF 238

Query: 807  SYCLVPLSS--GLTSKLTFGPLRXXXXXXXXXXXXXSSPILTGRD-PTFYRLILNRISVG 977
            SYCLVPL+S  GLTSK+ FG                S+P++   D P+FY L +  +SVG
Sbjct: 239  SYCLVPLTSQTGLTSKINFG-----RNGVVSGKGVVSTPLVQKADSPSFYYLTIEAVSVG 293

Query: 978  KNA-------SFNXXXXXXXXSGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLC 1136
            +         +          SGTTLT  PP+    ++  +A T++   + DP    +LC
Sbjct: 294  RKKLHFSGTNNGTGEGNIIIDSGTTLTLFPPEFQQKLEPVVASTIKAHPMQDPSGLLNLC 353

Query: 1137 YDTRSLVSKKMFNPSDVVFHFKGGDVVLKGINTFRTLEDGLSCLAMSSIEGTPVFGNVAQ 1316
            Y   S      F   ++  HFKGGDV L+ +NTF  + +GLSC A S  +   ++GN+AQ
Sbjct: 354  YKNIS-----SFKLPEITIHFKGGDVKLENLNTFVEVAEGLSCFAFSGNDRMTIYGNLAQ 408

Query: 1317 VNFEVGFDLKAKKISFAPKDCTK 1385
            +NF VG+D  +K +SF   +C K
Sbjct: 409  MNFLVGYDTVSKTMSFKKTNCAK 431


>ref|NP_176663.1| aspartyl protease family protein [Arabidopsis thaliana]
            gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein
            [Arabidopsis thaliana] gi|332196174|gb|AEE34295.1|
            aspartyl protease family protein [Arabidopsis thaliana]
          Length = 431

 Score =  279 bits (713), Expect = 3e-72
 Identities = 169/424 (39%), Positives = 228/424 (53%), Gaps = 20/424 (4%)
 Frame = +3

Query: 174  FTIDLIHRNSPHSPYYNPQHHST-RQPNLVSITHGHQIHHSS---------SYLVPNGGD 323
            FTIDLIHR+SP SP+YN    S+ R  N +  +    +  S+         S++  N G+
Sbjct: 26   FTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRSARSTLQFSNDDASPNSPQSFITSNRGE 85

Query: 324  YLIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQKSPLFKPKNSSTYHTIPCTSPI 503
            YL+ I++GTP +  +A+ADTGSDLIW QC+PCE+C  Q SPLF PK SSTY  + C+S  
Sbjct: 86   YLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSSQ 145

Query: 504  CNYPHITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGKLARDTISLPSQMKSISISLPNT 683
            C         L + + S + + C Y   YGD + + G +A DT+++ S  +   +SL N 
Sbjct: 146  CR-------ALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRR-PVSLRNM 197

Query: 684  IFGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHINYSFSYCLVPLSS--GLTSKLTF 857
            I GCG    G     G GI+GLG GS SLV QL   IN  FSYCLVP +S  GLTSK+ F
Sbjct: 198  IIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINF 257

Query: 858  GPLRXXXXXXXXXXXXXSSPILTGRDP-TFYRLILNRISVG-KNASF------NXXXXXX 1013
            G                 S  +  +DP T+Y L L  ISVG K   F             
Sbjct: 258  G------TNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNIV 311

Query: 1014 XXSGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCYDTRSLVSKKMFNPSDVVF 1193
              SGTTLT LP + Y  ++  +A T++   V DP     LCY   S      F   D+  
Sbjct: 312  IDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSS-----SFKVPDITV 366

Query: 1194 HFKGGDVVLKGINTFRTLEDGLSCLAMSSIEGTPVFGNVAQVNFEVGFDLKAKKISFAPK 1373
            HFKGGDV L  +NTF  + + +SC A ++ E   +FGN+AQ+NF VG+D  +  +SF   
Sbjct: 367  HFKGGDVKLGNLNTFVAVSEDVSCFAFAANEQLTIFGNLAQMNFLVGYDTVSGTVSFKKT 426

Query: 1374 DCTK 1385
            DC++
Sbjct: 427  DCSQ 430


>ref|XP_003539632.1| PREDICTED: aspartic proteinase CDR1-like [Glycine max]
          Length = 444

 Score =  279 bits (713), Expect = 3e-72
 Identities = 164/434 (37%), Positives = 224/434 (51%), Gaps = 28/434 (6%)
 Frame = +3

Query: 168  GTFTIDLIHRNSPHSPYYNPQH-----------------HSTRQPNLVSITHGHQIHHSS 296
            G F++++IHR+S  SPYY P                   +   +PNLV+ T+      + 
Sbjct: 30   GGFSVEIIHRDSSRSPYYRPTETQFQRVANALRRSINRANHFNKPNLVASTNT-----AE 84

Query: 297  SYLVPNGGDYLIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQKSPLFKPKNSSTY 476
            S ++ + G+YL+  ++GTP  + + + DTGSD+IW+QC PCE+C  Q +P+F P  S TY
Sbjct: 85   STVIASQGEYLMSYSVGTPPFQILGIVDTGSDIIWLQCQPCEDCYNQTTPIFDPSQSKTY 144

Query: 477  HTIPCTSPICNYPHITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGKLARDTISLPSQMK 656
             T+PC+S IC        C      S N+  C Y   YGD + S G L+ +T++L S   
Sbjct: 145  KTLPCSSNICQSVQSAASC------SSNNDECEYTITYGDNSHSQGDLSVETLTLGSTDG 198

Query: 657  SISISLPNTIFGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHINYSFSYCLVPL--S 830
            S S+  P T+ GCG   +G   R+G GIVGLG G +SL+ QL   I   FSYCL PL   
Sbjct: 199  S-SVQFPKTVIGCGHNNKGTFQREGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPLFSQ 257

Query: 831  SGLTSKLTFGPLRXXXXXXXXXXXXXSSPILTGRDPTFYRLILNRISVGKN--------- 983
            S  +SKL FG                S+PI+      FY L L   SVG N         
Sbjct: 258  SNSSSKLNFG-----DEAVVSGRGTVSTPIVPKNGLGFYFLTLEAFSVGDNRIEFGSSSF 312

Query: 984  ASFNXXXXXXXXSGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCYDTRSLVSK 1163
             S          SGTTLT LP D Y  ++ A+A  + L  V DP +F  LCY T    S 
Sbjct: 313  ESSGGEGNIIIDSGTTLTILPEDDYLNLESAVADAIELERVEDPSKFLRLCYRT---TSS 369

Query: 1164 KMFNPSDVVFHFKGGDVVLKGINTFRTLEDGLSCLAMSSIEGTPVFGNVAQVNFEVGFDL 1343
               N   +  HFKG DV L  I+TF  +++G+ C A  S +  P+FGN+AQ N  VG+DL
Sbjct: 370  DELNVPVITAHFKGADVELNPISTFIEVDEGVVCFAFRSSKIGPIFGNLAQQNLLVGYDL 429

Query: 1344 KAKKISFAPKDCTK 1385
              + +SF P DCT+
Sbjct: 430  VKQTVSFKPTDCTQ 443


>gb|EXB70640.1| putative aspartic protease [Morus notabilis]
          Length = 446

 Score =  278 bits (711), Expect = 5e-72
 Identities = 178/443 (40%), Positives = 231/443 (52%), Gaps = 20/443 (4%)
 Frame = +3

Query: 117  SSAFSFSIKASNLISSNGTFTIDLIHRNSPHSPYYNPQHH-STRQPNLVSITHGHQIHHS 293
            SSA S      N+I+S   F++DLIHR+SP SP++NP    S R  N  S      IH +
Sbjct: 25   SSALSHDHDHKNIINS---FSVDLIHRDSPKSPFFNPSETPSERLTNAFS----RSIHRN 77

Query: 294  SSYLVP-----------NGGDYLIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQK 440
               L P           N G+YL++I++GTP    +A+ADTGSDL W QC PC +C  Q 
Sbjct: 78   KKLLTPSPNDVQATVLSNRGEYLMEISIGTPPFRILAIADTGSDLTWTQCKPCTHCYNQT 137

Query: 441  SPLFKPKNSSTYHTIPCTSPICNYPHITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGKL 620
            SPLF P +S TY  IPC +  C    IT+G     T   N   C Y+  YGD + + G L
Sbjct: 138  SPLFDPASSKTYKNIPCKTSTC--MSITRG-----TCVTNPKLCPYDVEYGDRSHTEGYL 190

Query: 621  ARDTISLPSQMKSISISLPNTIFGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHINY 800
            A DT++L S      +S P  I GCG    G   ++G GIVGLG G  SLV QLG  I  
Sbjct: 191  ANDTLTLAS-TTGRPVSFPKFILGCGKDNAGTFDKRGSGIVGLGKGQESLVSQLGSSIGG 249

Query: 801  SFSYCLVPLSSGLTSKLTFGPLRXXXXXXXXXXXXXSSPILTG--RDPTFYRLILNRISV 974
             FSYCLVPLSS  +SK+ FG +              S+P+L    R   FY + L  ISV
Sbjct: 250  KFSYCLVPLSSEKSSKMNFGSI-----AQVSGPGTVSTPLLPDDFRPENFYFITLEGISV 304

Query: 975  GK-NASFN---XXXXXXXXSGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCYD 1142
            G     F            SGTTLT LP D Y  ++ A+A+   L  V DP  F  LCY 
Sbjct: 305  GSTRVKFRYGVEQGNIIIDSGTTLTILPSDFYSDLESAVAKETDLERVEDPTGFLSLCY- 363

Query: 1143 TRSLVSKKMFNPSDVVFHFKGGDVVLKGINTFRTLEDGLSCLAMSSIE--GTPVFGNVAQ 1316
             +S  S   F   ++  HF G DV L  +NTF  + D + C A    +   T ++GN+AQ
Sbjct: 364  -KSPESGAEFLAPNITVHFSGADVNLSTLNTFVRVSDDVVCFAFYGDDSGSTSIYGNLAQ 422

Query: 1317 VNFEVGFDLKAKKISFAPKDCTK 1385
            +NF VG+D + + +SF P DC+K
Sbjct: 423  MNFLVGYDRQKRTLSFKPTDCSK 445


>ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223536720|gb|EEF38361.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 439

 Score =  277 bits (709), Expect = 9e-72
 Identities = 166/426 (38%), Positives = 224/426 (52%), Gaps = 22/426 (5%)
 Frame = +3

Query: 174  FTIDLIHRNSPHSPYYNPQHHSTRQPNLVSITHGHQIHHSS-------------SYLVPN 314
            FT++LI+R+SP SP+YNP+   T++          ++HH S             S ++ N
Sbjct: 29   FTVELINRDSPKSPFYNPRETPTQRIVSAVRRSMSRVHHFSPTKNSDIFTDTAQSEMISN 88

Query: 315  GGDYLIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQKSPLFKPKNSSTYHTIPCT 494
             G+YL+K +LGTP  + +A+ADTGSDLIW QC PC+ C  Q +PLF PK+SSTY  I C+
Sbjct: 89   QGEYLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPLFDPKSSSTYRDISCS 148

Query: 495  SPICNYPHITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGKLARDTISLPSQMKSISISL 674
            +  C+       C     + + +  C Y   YGD + + G +A DTI+L S      + L
Sbjct: 149  TKQCDLLKEGASC-----SGEGNKTCHYSYSYGDRSFTSGNVAADTITLGS-TSGRPVLL 202

Query: 675  PNTIFGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHINYSFSYCLVPLSSGLT--SK 848
            P  I GCG    G    KG GIVGLG G +SL+ QLG  I+  FSYCLVPLSS  T  SK
Sbjct: 203  PKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLVPLSSNATNSSK 262

Query: 849  LTFGPLRXXXXXXXXXXXXXSSPILTGRDPTFYRLILNRISVGK------NASF-NXXXX 1007
            L FG                S+P+++    TFY L L  +SVG        +SF      
Sbjct: 263  LNFG-----SNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGTSEGN 317

Query: 1008 XXXXSGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCYDTRSLVSKKMFNPSDV 1187
                SGTTLT  P D +  +  A+   V    V DP     LCY     +   +  PS +
Sbjct: 318  IIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSLCYS----IDADLKFPS-I 372

Query: 1188 VFHFKGGDVVLKGINTFRTLEDGLSCLAMSSIEGTPVFGNVAQVNFEVGFDLKAKKISFA 1367
              HF G DV L  +NTF  + D + C A + I    +FGN+AQ+NF VG+DL+ K +SF 
Sbjct: 373  TAHFDGADVKLNPLNTFVQVSDTVLCFAFNPINSGAIFGNLAQMNFLVGYDLEGKTVSFK 432

Query: 1368 PKDCTK 1385
            P DCT+
Sbjct: 433  PTDCTQ 438


>ref|XP_006484733.1| PREDICTED: aspartic proteinase CDR1-like [Citrus sinensis]
          Length = 426

 Score =  276 bits (706), Expect = 2e-71
 Identities = 167/440 (37%), Positives = 226/440 (51%), Gaps = 17/440 (3%)
 Frame = +3

Query: 117  SSAFSFSIKASNLIS---SNGTFTIDLIHRNSPHSPYYNPQH-HSTRQPNLVSITHGHQI 284
            +SA SF I   + +S   + G F++DLI R++P SP+Y+P   +  R    +  +     
Sbjct: 6    ASAISFLILCLSSLSITEAKGGFSLDLIRRDAPKSPFYSPDETYHQRVTKALKRSVNRVS 65

Query: 285  HHSSSYLVPNG---------GDYLIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQ 437
            H   + + PN          G+Y++ I++GTP +E +A+ADTGSDLIW QC PC  C  Q
Sbjct: 66   HFDPAIITPNTAQADIISALGEYVMNISIGTPPVEILAIADTGSDLIWTQCKPCTECYKQ 125

Query: 438  KSPLFKPKNSSTYHTIPCTSPICNYPHITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGK 617
             +P F P+ SSTY  + C S  C            +T+      C Y A YGD + S G 
Sbjct: 126  AAPFFDPEQSSTYKDLSCDSRQCT--------AYERTSCSTEETCEYSATYGDRSFSNGN 177

Query: 618  LARDTISLPSQMKSISISLPNTIFGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHIN 797
            LA +T++L S      ++L N IFGCG    G       GIVGLG GS+SLV Q+G  I 
Sbjct: 178  LAVETVTLGS-TNGRPVALRNIIFGCGHNDDGTFNENATGIVGLGGGSVSLVTQMGSSIG 236

Query: 798  YSFSYCLVP-LSSGLTSKLTFGPLRXXXXXXXXXXXXXSSPILTGRDPTFYRLILNRISV 974
              FSYCLVP LSS  +SK+ FG                ++P++     TFY L L  ISV
Sbjct: 237  GKFSYCLVPFLSSESSSKINFG-----SNGVVSGTGVVTTPLVAKDPDTFYFLTLESISV 291

Query: 975  GK---NASFNXXXXXXXXSGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCYDT 1145
            GK   +            SGTTLT+LPPD+   +  A++  ++   +SDP    DLCY  
Sbjct: 292  GKKKIHFDDASEGNIIIDSGTTLTFLPPDIVSKLTSAVSDLIKADPISDPEGVLDLCYPY 351

Query: 1146 RSLVSKKMFNPSDVVFHFKGGDVVLKGINTFRTLEDGLSCLAMSSIEGTPVFGNVAQVNF 1325
             S      F    +  HF G DVVL   NTF    D   C     +EG  ++GN+AQ NF
Sbjct: 352  SS-----DFKAPQITVHFSGADVVLSPENTFIRTSDTSVCFTFKGMEGQSIYGNLAQANF 406

Query: 1326 EVGFDLKAKKISFAPKDCTK 1385
             VG+D KAK +SF P DC+K
Sbjct: 407  LVGYDTKAKTVSFKPTDCSK 426


>gb|EMJ11898.1| hypothetical protein PRUPE_ppa025167mg [Prunus persica]
          Length = 457

 Score =  275 bits (704), Expect = 3e-71
 Identities = 165/443 (37%), Positives = 227/443 (51%), Gaps = 33/443 (7%)
 Frame = +3

Query: 159  SSNGTFTIDLIHRNSPHSPYYNPQ-------HHSTRQ----------PNLVSITHGHQIH 287
            SS+G FT DLIHR+SP SP YN         H++ R+          P + S++      
Sbjct: 29   SSHG-FTADLIHRDSPLSPLYNSSMSHLDRLHNAFRRSVTRVHHFIKPTMTSLSSSLAAP 87

Query: 288  HSSSYLVPNGGDYLIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQKSPLFKPKNS 467
            +  S ++P+ G+YL+ +++GTP +E + +ADTGSDLIW QC PC+ C  Q  PLF PK S
Sbjct: 88   NIQSIIIPSAGEYLMNVSIGTPPVEVLGIADTGSDLIWTQCKPCKQCFNQNPPLFDPKKS 147

Query: 468  STYHTIPCTSPICNYPHITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGKLARDTISLPS 647
            STYH+IPC S  C Y  + +       N  + + C Y   YGD + + G LA +T++  S
Sbjct: 148  STYHSIPCQSSSCTY--LEEAACGTLINGDHDT-CEYSYRYGDRSFTRGTLALETLTFGS 204

Query: 648  QMKSISISLPNTIFGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHIN-YSFSYCLVP 824
                   SLP  +FGCG    G     G G++GLG G LSL+ QL    N   FSYCL+P
Sbjct: 205  -TSGRPTSLPKVVFGCGHENGGTFDESGSGLIGLGGGPLSLISQLTKLTNGGKFSYCLLP 263

Query: 825  LSSGLTSKLTFGPLRXXXXXXXXXXXXXSSPILTGRDPTFYRLILNRISVG--------- 977
             ++   SK++FG                S+P++     TFY L L  ISVG         
Sbjct: 264  TANTAASKISFG-----SAGIVSGSGAVSTPLVAKNPDTFYYLTLEAISVGEKRLAYKTK 318

Query: 978  -----KNASFNXXXXXXXXSGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCYD 1142
                 K A           SGTTLT LPP  +D +  AL   +    VSDP     LC+ 
Sbjct: 319  SPDCEKAAVAANEGNIIIDSGTTLTLLPPGFHDDLVSALETAINAERVSDPRGILSLCFK 378

Query: 1143 TRSLVSKKMFNPSDVVFHFKGG-DVVLKGINTFRTLEDGLSCLAMSSIEGTPVFGNVAQV 1319
            ++S           +  HF GG DV L+ +NTF  ++D + C  M       +FGN+AQ+
Sbjct: 379  SKS----DDIGVPVITVHFSGGADVKLQALNTFARMDDDMICFTMIPSSDVAIFGNLAQM 434

Query: 1320 NFEVGFDLKAKKISFAPKDCTKH 1388
            NF VG+DL+ + +SF P DCTKH
Sbjct: 435  NFLVGYDLEERSVSFKPTDCTKH 457


>gb|ESW21352.1| hypothetical protein PHAVU_005G063600g [Phaseolus vulgaris]
          Length = 444

 Score =  275 bits (702), Expect = 6e-71
 Identities = 165/426 (38%), Positives = 224/426 (52%), Gaps = 20/426 (4%)
 Frame = +3

Query: 168  GTFTIDLIHRNSPHSPYYNPQHHSTRQPN------LVSITHGHQIHHSS-----SYLVPN 314
            G F+++LIHR+SP SP+ NP     ++ N      L  + H +    ++     S +  N
Sbjct: 33   GGFSVELIHRDSPKSPFNNPTKTLFQKLNNSFHRSLERVKHFYPTTKATENTPQSVITSN 92

Query: 315  GGDYLIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQKSPLFKPKNSSTYHTIPCT 494
             G+YL+K ++GTP  E + +ADTGSDLIW QC PCE C  Q SPLF P  S TY  + C 
Sbjct: 93   QGEYLVKYSIGTPAFEVMGIADTGSDLIWSQCKPCEQCYNQSSPLFDPSKSKTYKPVSCY 152

Query: 495  SPICNYPHITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGKLARDTISLPSQMKSISISL 674
            S +C     T  C S+         C Y   YGDG+ S G LA DT++L S   S S++ 
Sbjct: 153  SRVCQSLGQTY-CYSD-----TDPNCQYTVSYGDGSHSQGNLAFDTLTLGSSADS-SVAF 205

Query: 675  PNTIFGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHINYSFSYCLVPL-SSGLTSKL 851
            P    GCG    G    +G GIVGLG G +SLV Q+GP I++ FSYCLVPL  S  +SKL
Sbjct: 206  PRIPIGCGVNNAGTFDTEGSGIVGLGGGHVSLVSQIGPSIDFKFSYCLVPLFDSKSSSKL 265

Query: 852  TFGPLRXXXXXXXXXXXXXSSPILTGRDPTFYRLILNRISVGK--------NASFNXXXX 1007
             FG                S+PI++G   TFY L L  +SVG         + S +    
Sbjct: 266  NFG-----ANAVVDGPGTVSTPIISGSVDTFYYLKLEGMSVGSKRIDFVGDSTSDDEKGN 320

Query: 1008 XXXXSGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCYDTRSLVSKKMFNPSDV 1187
                SGTTLT LP   Y  ++  +A  + L  V+   +   LCY++             +
Sbjct: 321  IIIDSGTTLTILPETFYAKLESEVAANINLERVNITDQILSLCYNSG---GHSAVEAPPI 377

Query: 1188 VFHFKGGDVVLKGINTFRTLEDGLSCLAMSSIEGTPVFGNVAQVNFEVGFDLKAKKISFA 1367
            V HF G DVVL  +NTF ++ DG+ C A + +    +FGN+AQ+N  VG+DL  + +SF 
Sbjct: 378  VAHFSGADVVLNSLNTFVSVSDGVLCFAFAPVATGSIFGNLAQMNHLVGYDLLRRTVSFK 437

Query: 1368 PKDCTK 1385
            P DCTK
Sbjct: 438  PTDCTK 443


>ref|XP_006403054.1| hypothetical protein EUTSA_v10003479mg [Eutrema salsugineum]
            gi|557104161|gb|ESQ44507.1| hypothetical protein
            EUTSA_v10003479mg [Eutrema salsugineum]
          Length = 439

 Score =  275 bits (702), Expect = 6e-71
 Identities = 165/426 (38%), Positives = 223/426 (52%), Gaps = 22/426 (5%)
 Frame = +3

Query: 174  FTIDLIHRNSPHSPYYNPQHHST-RQPNLVSITHGHQIHHSS---------SYLVPNGGD 323
            FT DLIHR+SP SP+Y P   S+ R  N +  +  H +H SS         + +  N G+
Sbjct: 31   FTTDLIHRDSPKSPFYKPTETSSQRLRNAIRRSVNHVVHFSSKDASVDSPQTEITSNRGE 90

Query: 324  YLIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQKSPLFKPKNSSTYHTIPCTSPI 503
            YL+ I+LGTP    +A+ADTGSDL+W QC PC++C  Q  PLF PK SSTY    C+S  
Sbjct: 91   YLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQNDPLFDPKASSTYKDFSCSSSQ 150

Query: 504  CNYPHITKGCLSNKTN-SKNSSPCLYEALYGDGTQSVGKLARDTISLPSQMKSISISLPN 680
            C+        L N+ + S   + C Y   YGD + + G +A DT++L S   +  + L N
Sbjct: 151  CS-------ALGNQASCSTEDNTCSYSMSYGDHSYTNGNVAADTLTLGS-TNNRPVQLKN 202

Query: 681  TIFGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHINYSFSYCLVPLSS--GLTSKLT 854
             I GCG    G   ++G GIVGLG G +SL+ QLG  I+  FSYCL+PLSS  G TS + 
Sbjct: 203  VIIGCGHNNNGTFNKEGSGIVGLGGGPVSLISQLGESIDGKFSYCLIPLSSENGKTSNIN 262

Query: 855  FGPLRXXXXXXXXXXXXXSSPILTGRDPTFYRLILNRISVG-KNASF--------NXXXX 1007
            FG                S+P++T    TFY L L  ISVG KN  F             
Sbjct: 263  FG-----TSAVVSGTGAVSTPLITKSRETFYYLTLASISVGSKNIKFPVSDPGSGEGEGN 317

Query: 1008 XXXXSGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCYDTRSLVSKKMFNPSDV 1187
                SGTTLT LP   Y  ++DA+A ++     +DP     LCY   + +   +     +
Sbjct: 318  IIIDSGTTLTMLPTTFYSELEDAVASSIDAERQNDPESPLSLCYSATANLKVPV-----I 372

Query: 1188 VFHFKGGDVVLKGINTFRTLEDGLSCLAMSSIEGTPVFGNVAQVNFEVGFDLKAKKISFA 1367
              HF G DV L   N+F  L + L C A    E   ++GN++Q+NF VG+D  +K +SF 
Sbjct: 373  TMHFDGADVKLDSSNSFVQLSEELVCFAFRGSEDLAIYGNLSQMNFLVGYDTVSKTVSFK 432

Query: 1368 PKDCTK 1385
            P DC K
Sbjct: 433  PADCAK 438


>ref|XP_006437358.1| hypothetical protein CICLE_v10033646mg [Citrus clementina]
            gi|557539554|gb|ESR50598.1| hypothetical protein
            CICLE_v10033646mg [Citrus clementina]
          Length = 426

 Score =  274 bits (700), Expect = 1e-70
 Identities = 161/430 (37%), Positives = 221/430 (51%), Gaps = 14/430 (3%)
 Frame = +3

Query: 138  IKASNLISSNGTFTIDLIHRNSPHSPYYNPQH-HSTRQPNLVSITHGHQIHHSSSYLVPN 314
            + + ++  + G F++DLI R++P SP+Y+P   +  R    +  +     H   + + PN
Sbjct: 16   LSSLSITEAKGGFSLDLIRRDAPKSPFYSPDETYHQRVTKALKRSVNRVSHFDPAIITPN 75

Query: 315  G---------GDYLIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQKSPLFKPKNS 467
                      G+Y++ I++GTP +E +A+ADTGSDLIW QC PC  C  Q +P F P+ S
Sbjct: 76   TAQADIISALGEYVMNISIGTPPVEILAIADTGSDLIWTQCKPCTECYKQAAPFFDPEQS 135

Query: 468  STYHTIPCTSPICNYPHITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGKLARDTISLPS 647
            STY  + C S  C            +T+      C Y A YGD + S G LA +T++L S
Sbjct: 136  STYKDLSCDSRQCT--------AYERTSCSTEEICEYSATYGDRSFSNGNLAVETVTLGS 187

Query: 648  QMKSISISLPNTIFGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHINYSFSYCLVP- 824
                  ++L N IFGCG    G       GIVGLG GS+SLV Q+G  I   FSYCLVP 
Sbjct: 188  -TNGRPVALRNLIFGCGHNDDGTFNENATGIVGLGGGSVSLVTQMGSSIGGKFSYCLVPF 246

Query: 825  LSSGLTSKLTFGPLRXXXXXXXXXXXXXSSPILTGRDPTFYRLILNRISVGK---NASFN 995
            LSS  +SK+ FG                ++P++     TFY L L  ISVGK   +    
Sbjct: 247  LSSESSSKINFG-----SNGVVSGTGVVTTPLVAKDPDTFYFLTLESISVGKKKIHFDDA 301

Query: 996  XXXXXXXXSGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCYDTRSLVSKKMFN 1175
                    SGTTLT+LPPD+   +  A++  ++   +SDP    DLCY   S      F 
Sbjct: 302  SEGNIIIDSGTTLTFLPPDIVSKLTSAVSDLIKADPISDPEGVLDLCYPYSS-----DFK 356

Query: 1176 PSDVVFHFKGGDVVLKGINTFRTLEDGLSCLAMSSIEGTPVFGNVAQVNFEVGFDLKAKK 1355
               +  HF G DVVL   NTF    D   C     +EG  ++GN+AQ NF VG+D KAK 
Sbjct: 357  APQITVHFSGADVVLSPENTFIRTSDTTVCFTFKGMEGQSIYGNLAQANFLVGYDTKAKT 416

Query: 1356 ISFAPKDCTK 1385
            +SF P DC+K
Sbjct: 417  VSFKPTDCSK 426


>ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
            gi|355517824|gb|AES99447.1| Aspartic proteinase
            nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  274 bits (700), Expect = 1e-70
 Identities = 167/443 (37%), Positives = 231/443 (52%), Gaps = 21/443 (4%)
 Frame = +3

Query: 120  SAFSFSIKASNLISSNGTFTIDLIHRNSPHSPYYNPQHHSTRQ---PNLVSITHGHQIHH 290
            S FS    AS   + +  F+++LIHR+SP SPYY P  +  +        SI   +    
Sbjct: 10   SLFSLCFIASFSHALSNGFSVELIHRDSPKSPYYKPTENKYQHFVDAARRSINRANHFFK 69

Query: 291  SS------SYLVPNGGDYLIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQKSPLF 452
             S      S ++P+ G YL+  ++GTP  +   +ADTGSD++W+QC PCE C  Q +P+F
Sbjct: 70   DSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIF 129

Query: 453  KPKNSSTYHTIPCTSPICNYPHITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGKLARDT 632
             P  SS+Y  IPC S +C+            T+  + + C Y+  YGD + S G L+ DT
Sbjct: 130  NPSKSSSYKNIPCLSKLCH--------SVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDT 181

Query: 633  ISLPSQMKSISISLPNTIFGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHINYSFSY 812
            +SL S   S  +S P T+ GCG    G  G    GIVGLG G +SL+ QLG  I   FSY
Sbjct: 182  LSLESTSGS-PVSFPKTVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSY 240

Query: 813  CLVPL---SSGLTSKLTFGPLRXXXXXXXXXXXXXSSPILTGRDPTFYRLILNRISVG-K 980
            CLVPL    S  +S L+FG                S+P++  +DP FY L L   SVG K
Sbjct: 241  CLVPLLNKESNASSILSFG-----DAAVVSGDGVVSTPLIK-KDPVFYFLTLQAFSVGNK 294

Query: 981  NASF-------NXXXXXXXXSGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCY 1139
               F       +        SGTTLT +P DVY  ++ A+   V+L  V DP + + LCY
Sbjct: 295  RVEFGGSSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCY 354

Query: 1140 DTRSLVSKKMFNPSDVVFHFKGGDVVLKGINTFRTLEDGLSCLAMS-SIEGTPVFGNVAQ 1316
              +S      ++   +  HFKG D+ L  I+TF  + DG+ C A   S +   +FGN+AQ
Sbjct: 355  SLKS----NEYDFPIITAHFKGADIELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQ 410

Query: 1317 VNFEVGFDLKAKKISFAPKDCTK 1385
             N  VG+DL+ K +SF P DCTK
Sbjct: 411  QNLLVGYDLQQKTVSFKPTDCTK 433


>ref|XP_006393532.1| hypothetical protein EUTSA_v10012077mg, partial [Eutrema salsugineum]
            gi|557090110|gb|ESQ30818.1| hypothetical protein
            EUTSA_v10012077mg, partial [Eutrema salsugineum]
          Length = 452

 Score =  273 bits (698), Expect = 2e-70
 Identities = 166/426 (38%), Positives = 222/426 (52%), Gaps = 22/426 (5%)
 Frame = +3

Query: 174  FTIDLIHRNSPHSPYYNPQHHST-RQPNLVSITHGHQIHHSS---------SYLVPNGGD 323
            FT DLIHR+SP SP+Y P   S+ R  N +  +    +H SS         + +  N G+
Sbjct: 44   FTTDLIHRDSPKSPFYKPTETSSQRLRNAIRRSVNRVVHFSSKDASVDSPQTEITSNRGE 103

Query: 324  YLIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQKSPLFKPKNSSTYHTIPCTSPI 503
            YL+ I+LGTP    +A+ADTGSDLIW QC PC++C  Q  PLF PK SSTY    C+S  
Sbjct: 104  YLMNISLGTPPFPIMAIADTGSDLIWTQCKPCDDCYTQNDPLFDPKASSTYKYFSCSSSQ 163

Query: 504  CNYPHITKGCLSNKTN-SKNSSPCLYEALYGDGTQSVGKLARDTISLPSQMKSISISLPN 680
            C+        L N+ + S   + C Y   YGD + + G +A DT++L S  K   + L N
Sbjct: 164  CS-------ALGNQASCSTEDNTCPYSISYGDHSYTNGNVAADTLTLGSTNKR-PVQLKN 215

Query: 681  TIFGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHINYSFSYCLVPLSS--GLTSKLT 854
             I GCG    G   ++G GIVGLG G +SL+ QLG  I+  FSYCL+PLSS    TSK+ 
Sbjct: 216  VIIGCGHNNNGTFNKEGSGIVGLGGGPVSLISQLGESIDGKFSYCLIPLSSENDKTSKIN 275

Query: 855  FGPLRXXXXXXXXXXXXXSSPILTGRDPTFYRLILNRISVG-KNASF--------NXXXX 1007
            FG                S+P++T    TFY L L  ISVG KN  F             
Sbjct: 276  FG-----TSAVVSGTGAVSTPLITKSRETFYYLTLESISVGSKNIKFPVSDPGSGEGEGN 330

Query: 1008 XXXXSGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCYDTRSLVSKKMFNPSDV 1187
                SGTTLT LP   Y  ++DA+A ++     +DP     LCY   + +   +     +
Sbjct: 331  IIIDSGTTLTMLPTTFYSELEDAVASSIDAERQNDPESPLSLCYSATANLKVPV-----I 385

Query: 1188 VFHFKGGDVVLKGINTFRTLEDGLSCLAMSSIEGTPVFGNVAQVNFEVGFDLKAKKISFA 1367
              HF G DV L   N+F  L + L C A    E   ++GN++Q+NF VG+D  +K +SF 
Sbjct: 386  TMHFDGADVKLDSSNSFVQLSEELVCFAFRGSEDLAIYGNLSQMNFLVGYDTVSKTVSFK 445

Query: 1368 PKDCTK 1385
            P DC K
Sbjct: 446  PADCAK 451


>ref|XP_006285435.1| hypothetical protein CARUB_v10006851mg [Capsella rubella]
            gi|482554140|gb|EOA18333.1| hypothetical protein
            CARUB_v10006851mg [Capsella rubella]
          Length = 436

 Score =  273 bits (698), Expect = 2e-70
 Identities = 158/422 (37%), Positives = 218/422 (51%), Gaps = 18/422 (4%)
 Frame = +3

Query: 174  FTIDLIHRNSPHSPYYNP-QHHSTRQPNLVSITHGHQIHHSSSY--------LVPNGGDY 326
            FT DLIHR+SP SP++NP +  S R  N ++ +     H +           +  NGG+Y
Sbjct: 31   FTADLIHRDSPKSPFFNPTETPSQRLRNSINRSVNRAFHFTEDTSANSPQVEITSNGGEY 90

Query: 327  LIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQKSPLFKPKNSSTYHTIPCTSPIC 506
            L+ ++LGTP    +A+ADTGSDL+W QC PC++C  Q  PLF PK SSTY  + C+S  C
Sbjct: 91   LMNVSLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQDDPLFDPKASSTYKDVSCSSSQC 150

Query: 507  NYPHITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGKLARDTISLPSQMKSISISLPNTI 686
            N       C      S + + C Y   YGD + + G +A DT++L S   +  + + N +
Sbjct: 151  NALEDHASC------SVDDTTCSYSMSYGDHSYTRGNIAADTLTLGS-TNNRPVQIKNVL 203

Query: 687  FGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHINYSFSYCLVPLSS--GLTSKLTFG 860
             GCG    G    KG GI+GLG G+ SL+ QLG  I+  FSYCLVPL+S    TSKL FG
Sbjct: 204  IGCGHNNSGTFNEKGSGIIGLGGGAASLITQLGDSIDGKFSYCLVPLTSETDRTSKLNFG 263

Query: 861  PLRXXXXXXXXXXXXXSSPILTGRDPTFYRLILNRISVGK-------NASFNXXXXXXXX 1019
                            S+P+++    TFY L L  ISVG        + S          
Sbjct: 264  -----TNAEVSGTGVVSTPLISKSPETFYYLTLESISVGSKKIPFPVSESGTTEGNIIID 318

Query: 1020 SGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCYDTRSLVSKKMFNPSDVVFHF 1199
            SGTTLT LP + Y  ++DA+A  +      DP +   LCY     +   +     +  HF
Sbjct: 319  SGTTLTLLPAEFYSELEDAVASAITAERKEDPKKVLSLCYSATEDLKVPI-----ITMHF 373

Query: 1200 KGGDVVLKGINTFRTLEDGLSCLAMSSIEGTPVFGNVAQVNFEVGFDLKAKKISFAPKDC 1379
             G DV L   N+F  +   L C A S      ++GN++Q+NF VG+D  +KK+SF P DC
Sbjct: 374  DGADVKLDSSNSFVQISQELVCFAFSGSPSLAIYGNLSQMNFLVGYDTVSKKVSFKPTDC 433

Query: 1380 TK 1385
             K
Sbjct: 434  AK 435


>ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
            gi|355517818|gb|AES99441.1| Aspartic proteinase
            nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  273 bits (698), Expect = 2e-70
 Identities = 167/443 (37%), Positives = 231/443 (52%), Gaps = 21/443 (4%)
 Frame = +3

Query: 120  SAFSFSIKASNLISSNGTFTIDLIHRNSPHSPYYNPQHHSTRQ---PNLVSITHGHQIHH 290
            S FS    AS   + +  F+++LIHR+SP SPYY P  +  +        SI   +    
Sbjct: 10   SLFSLCFIASFSHALSNGFSVELIHRDSPKSPYYKPTENKYQHFVDAARRSINRANHFFK 69

Query: 291  SS------SYLVPNGGDYLIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQKSPLF 452
             S      S ++P+ G YL+  ++GTP  +   +ADTGSD++W+QC PCE C  Q +P+F
Sbjct: 70   DSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIF 129

Query: 453  KPKNSSTYHTIPCTSPICNYPHITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGKLARDT 632
             P  SS+Y  IPC+S +C+            T+  + + C Y+  YGD + S G L+ DT
Sbjct: 130  NPSKSSSYKNIPCSSKLCH--------SVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDT 181

Query: 633  ISLPSQMKSISISLPNTIFGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHINYSFSY 812
            +SL S   S  +S P  + GCG    G  G    GIVGLG G +SL+ QLG  I   FSY
Sbjct: 182  LSLESTSGS-PVSFPKIVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSY 240

Query: 813  CLVPL---SSGLTSKLTFGPLRXXXXXXXXXXXXXSSPILTGRDPTFYRLILNRISVG-K 980
            CLVPL    S  +S L+FG                S+P++  +DP FY L L   SVG K
Sbjct: 241  CLVPLLNKESNASSILSFG-----DAAVVSGDGVVSTPLIK-KDPVFYFLTLQAFSVGNK 294

Query: 981  NASF-------NXXXXXXXXSGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCY 1139
               F       +        SGTTLT +P DVY  ++ A+   V+L  V DP + + LCY
Sbjct: 295  RVEFGGSSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCY 354

Query: 1140 DTRSLVSKKMFNPSDVVFHFKGGDVVLKGINTFRTLEDGLSCLAMS-SIEGTPVFGNVAQ 1316
              +S      ++   +  HFKG DV L  I+TF  + DG+ C A   S +   +FGN+AQ
Sbjct: 355  SLKS----NEYDFPIITVHFKGADVELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQ 410

Query: 1317 VNFEVGFDLKAKKISFAPKDCTK 1385
             N  VG+DL+ K +SF P DCTK
Sbjct: 411  QNLLVGYDLQQKTVSFKPTDCTK 433


>ref|XP_004244686.1| PREDICTED: aspartic proteinase CDR1-like [Solanum lycopersicum]
          Length = 442

 Score =  272 bits (695), Expect = 4e-70
 Identities = 155/426 (36%), Positives = 224/426 (52%), Gaps = 20/426 (4%)
 Frame = +3

Query: 171  TFTIDLIHRNSPHSPYYNP--------QHHSTRQPNLVSITHGHQIHHSSSYLVPNGGDY 326
            +FT+DLIHR+SP SP++NP        QH   R  +  S      ++   S L+P+GG+Y
Sbjct: 33   SFTLDLIHRDSPLSPFHNPSNTPYERLQHALYRSFSRASFLKKKYVNPIESTLIPSGGEY 92

Query: 327  LIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQKSPLFKPKNSSTYHTIPCTSPIC 506
            L+KI++GTP ++ + +ADTGSDL W QC PC NC  Q +P+F PK SS+Y TI C + +C
Sbjct: 93   LMKISIGTPPIDTLVIADTGSDLTWTQCKPCVNCFKQLTPIFNPKKSSSYKTIGCNNKLC 152

Query: 507  NYPHITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGKLARDTISLPSQMKSISISLPNTI 686
                  +G L       N+S C YE  YGD + ++G L+ +T +  S   S ++S+PN +
Sbjct: 153  ------QGSLC------NNSRCNYEVSYGDQSHTMGDLSIETFTF-SSTSSQNVSIPNIV 199

Query: 687  FGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHINYSFSYCLVPLSSGLTSKLTFGPL 866
            FGCG    G       GI+GLG G++S+V Q+   I   FSYCL+PL S L +      +
Sbjct: 200  FGCGHDNGGTFPNVTSGIIGLGGGNVSIVNQMHQQIKGKFSYCLIPLESLLDNSNATSHI 259

Query: 867  RXXXXXXXXXXXXXSSPILTGRDPTFYRLILNRISVG-KNASFN----------XXXXXX 1013
                          S+P++     TFY L L RIS+G +   FN                
Sbjct: 260  NFGNCATVSGPNVVSTPLIKKEPSTFYYLNLERISIGNRTVEFNSFPVVVGGDDDPGNII 319

Query: 1014 XXSGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCYDTRSLVSKKMFNPSDVVF 1193
              SGTTLTY+P   Y  ++  L  ++      DP   + LCY++         +   +V 
Sbjct: 320  IDSGTTLTYVPDAFYLNLESMLILSINATKKDDPSSSFRLCYESN---KNGTIDVPKIVA 376

Query: 1194 HFKGGDVVLKGINTFRTLEDGLSCLA-MSSIEGTPVFGNVAQVNFEVGFDLKAKKISFAP 1370
            HF   D+ L   N F  + +G+ CL  +       +FGN+AQ NF +G+DLKA K+SF P
Sbjct: 377  HFTNADLELSTSNIFTKVVEGIVCLTIVPGGNQISIFGNLAQANFLIGYDLKANKVSFKP 436

Query: 1371 KDCTKH 1388
             DCTK+
Sbjct: 437  TDCTKY 442


>ref|XP_004229589.1| PREDICTED: aspartic proteinase CDR1-like [Solanum lycopersicum]
          Length = 440

 Score =  270 bits (691), Expect = 1e-69
 Identities = 156/435 (35%), Positives = 226/435 (51%), Gaps = 17/435 (3%)
 Frame = +3

Query: 132  FSIKASNLI----SSNGTFTIDLIHRNSPHSPYYNPQHHSTRQPNLVSITHGHQ-----I 284
            F+I  S+L     + N  F+IDLIHR+SP+SP+Y+P    T++ N       H+     +
Sbjct: 14   FAILVSSLFPFTKALNNVFSIDLIHRDSPNSPFYDPSLSFTQRMNNSFHRSFHRSISLCL 73

Query: 285  HHSSSYLVPNGGDYLIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQKSPLFKPKN 464
              SS  +  N G+YL++ ++GTP +  + VADTGSDL W QC PC+NC  Q+S +F PK 
Sbjct: 74   PSSSVVVASNNGEYLMRYSIGTPPVRTLGVADTGSDLTWTQCLPCKNCFKQQSRIFNPKK 133

Query: 465  SSTYHTIPCTSPICNYPHITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGKLARDTISLP 644
            SSTY  + C+S +C+  +    C  N+   K    C Y   YGD + S+G LA +TI   
Sbjct: 134  SSTYKPLHCSSKMCHAANFPTSC--NRVMMKKKKTCRYHVRYGDNSYSIGDLATETIRFG 191

Query: 645  SQMKSISISLPNTIFGCGFYQRGQM-GRKGEGIVGLGAGSLSLVQQLGPHINYSFSYCLV 821
            S +    + L  T+ GCG    G   G K  GIVGLG G  SL+ Q+G  I   FSYCL 
Sbjct: 192  SSIHK-QVKLKKTVIGCGHNNAGTFSGDKESGIVGLGGGKFSLISQMGSSIGGKFSYCLA 250

Query: 822  PLSSGLTSKLTFGPLRXXXXXXXXXXXXXSSPILTGRDPTFYRLILNRISVGK------N 983
            P      + +    +              ++P+      T+Y L L  +SVGK      N
Sbjct: 251  PFFYQQKTYIPKSKIHFGTDGFVPGDDVVTTPLTRKFPATYYYLTLEGVSVGKQRLDFRN 310

Query: 984  ASFN-XXXXXXXXSGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCYDTRSLVS 1160
             SF+         SGT LT   P++Y  ++  +   ++LPT++DP    +LCY + S+  
Sbjct: 311  VSFSYGEGNIIIDSGTVLTLFSPEIYVKLESMVKTAIKLPTIADPSGSLNLCYKSISIDK 370

Query: 1161 KKMFNPSDVVFHFKGGDVVLKGINTFRTLEDGLSCLAMSSIEGTPVFGNVAQVNFEVGFD 1340
              +     +  HF G D+ L   NTF    D   C A ++  G  +FGN+AQ+NF VG+D
Sbjct: 371  IPI-----ITMHFIGADLKLGPWNTFVETSDSSMCFAFAASYGGQIFGNIAQMNFLVGYD 425

Query: 1341 LKAKKISFAPKDCTK 1385
            L  K++SF P DC+K
Sbjct: 426  LNNKRVSFKPTDCSK 440


>ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
            sativus] gi|449470632|ref|XP_004153020.1| PREDICTED:
            probable aspartic protease At2g35615-like [Cucumis
            sativus] gi|449499016|ref|XP_004160697.1| PREDICTED:
            probable aspartic protease At2g35615-like [Cucumis
            sativus]
          Length = 434

 Score =  270 bits (689), Expect = 2e-69
 Identities = 162/422 (38%), Positives = 215/422 (50%), Gaps = 20/422 (4%)
 Frame = +3

Query: 174  FTIDLIHRNSPHSPYYNPQH-HSTRQPNLV--SITHGHQIHHSSSYLVP---NGGDYLIK 335
            FT++LIHR+SP SP YN    H  R  N +  S      +  S +   P   NGG+YL++
Sbjct: 27   FTVELIHRDSPKSPMYNSSETHFDRIVNALRRSSHRNTVVLESDTAEAPIFNNGGEYLVE 86

Query: 336  ITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQKSPLFKPKNSSTYHTIPCTSPICNYP 515
            I++GTP    +AVADTGSD+IW QC PC NC  Q +P+F P  S+TY  + C+SP+C+Y 
Sbjct: 87   ISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPSKSTTYKNVACSSPVCSYS 146

Query: 516  HITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGKLARDTISLPSQMKSISISLPNTIFGC 695
                 C        + S CLY   YGD + S G LA DT+++ S      ++ P T+ GC
Sbjct: 147  GDGSSC-------SDDSECLYSIAYGDDSHSQGNLAVDTVTMQS-TSGRPVAFPRTVIGC 198

Query: 696  GFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHINYSFSYCLVPLSSGLT---SKLTFGPL 866
            G    G       GIVGLG G  SLV QLGP     FSYCL+P+ +G T   +KL FG  
Sbjct: 199  GHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTGSTNDSTKLNFG-- 256

Query: 867  RXXXXXXXXXXXXXSSPILTGRD-PTFYRLILNRISVGKN--------ASFNXXXXXXXX 1019
                          S+PI +     TFY L L  +SVG          +           
Sbjct: 257  ---SNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLGGESNIIID 313

Query: 1020 SGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCYDTRSLVSKKMFNPSDVVFHF 1199
            SGTTLTYLP  + +    A+++++ LP   DP EF D C+ T    +   +    V  HF
Sbjct: 314  SGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCFAT----TTDDYEMPPVTMHF 369

Query: 1200 KGGDVVLKGINTFRTLEDGLSCLAMSSIEGTPVF--GNVAQVNFEVGFDLKAKKISFAPK 1373
            +G DV L+  N F  L D   CLA  S     +F  GN+AQ NF VG+D+K   +SF P 
Sbjct: 370  EGADVPLQRENLFVRLSDDTICLAFGSFPDDNIFIYGNIAQSNFLVGYDIKNLAVSFQPA 429

Query: 1374 DC 1379
             C
Sbjct: 430  HC 431


>gb|EOY14113.1| Eukaryotic aspartyl protease family protein, putative [Theobroma
            cacao]
          Length = 429

 Score =  269 bits (688), Expect = 2e-69
 Identities = 163/420 (38%), Positives = 229/420 (54%), Gaps = 16/420 (3%)
 Frame = +3

Query: 174  FTIDLIHRNSPHSPYYNPQHHST---RQPNLVSITHGHQIHH------SSSYLVPNGGDY 326
            F+++LIHR+SP SP++N    S+   R+  L S+     I        + S ++PNGG Y
Sbjct: 32   FSVELIHRDSPVSPFFNDSITSSELLRKNALHSMDRIKNIQFYIDQKATQSVVIPNGGTY 91

Query: 327  LIKITLGTPKLEFIAVADTGSDLIWIQCSPC--ENCIPQKSPLFKPKNSSTYHTIPCTSP 500
            L+K++ GTP +E++A+ADTGSDL WIQC+PC    C  Q S  F P  SSTY  + C S 
Sbjct: 92   LMKLSFGTPPVEYVAIADTGSDLTWIQCAPCPQSQCYSQGSSPFDPAASSTYRKLSCVSE 151

Query: 501  ICN-YPHITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGKLARDTISLPSQMKSISISLP 677
             C   P   K CL       N++ C Y   YGD + ++G L+ DT+S  S   S   S P
Sbjct: 152  ACQALPR--KSCL-------NTNECEYFYSYGDKSYTIGILSSDTLSFDSS-SSPKTSFP 201

Query: 678  NTIFGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHINYSFSYCLVPLSSGLTSKLTF 857
             +IFGCG   +G   R G G+VGLG G LSL+ Q+G  I++ FSYCLVP S+  + KL F
Sbjct: 202  TSIFGCGHNNQGNFRRPGAGLVGLGGGPLSLISQIGTQIDHRFSYCLVPRSATSSGKLVF 261

Query: 858  GPLRXXXXXXXXXXXXXSSPILTGRDPTFYRLILNRISVGKNA--SFNXXXXXXXXSGTT 1031
            G                S+P++T    TFY L L  IS+G     + +        SGTT
Sbjct: 262  G-----QEAIISRPGAVSTPLITKTPATFYYLNLEGISIGDKTAQAASSQGNIIIDSGTT 316

Query: 1032 LTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCYDTRSLVSKKMFNPSDVVFHFKGGD 1211
            LT L  + Y+ V+  +   +      DP   + LCY   + +        D+VFHF G D
Sbjct: 317  LTILESNFYNSVETMVKGAIGAEPEQDPSGTFTLCYRAETKI-------PDMVFHFTGAD 369

Query: 1212 VVLKGINTFRTLEDGLSCLAM--SSIEGTPVFGNVAQVNFEVGFDLKAKKISFAPKDCTK 1385
            + L+ +NTF  + DGL C+ +  S+     +FGN AQ+NF+V +DL+ + +SFAP DCTK
Sbjct: 370  LRLQPVNTF-GVNDGLLCMLIVPSNTNSNSIFGNYAQINFQVEYDLQKRTVSFAPTDCTK 428


>ref|XP_002320947.1| aspartyl protease family protein [Populus trichocarpa]
            gi|222861720|gb|EEE99262.1| aspartyl protease family
            protein [Populus trichocarpa]
          Length = 440

 Score =  269 bits (688), Expect = 2e-69
 Identities = 165/428 (38%), Positives = 222/428 (51%), Gaps = 24/428 (5%)
 Frame = +3

Query: 174  FTIDLIHRNSPHSPYYNPQHHSTRQPNLVSITHGHQIHH-------------SSSYLVPN 314
            FT+DLIHR+SP SP+YN +    ++ N        ++HH             + S +  N
Sbjct: 32   FTVDLIHRDSPLSPFYNSEETDLQRINNALRRSISRVHHFDPIAAASVSPKAAESDVTSN 91

Query: 315  GGDYLIKITLGTPKLEFIAVADTGSDLIWIQCSPCENCIPQKSPLFKPKNSSTYHTIPCT 494
             G+YL+ ++LGTP  + + +ADTGSDLIW QC PCE C  Q  PLF PK+S TY    C 
Sbjct: 92   RGEYLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSKTYRDFSCD 151

Query: 495  SPICNYPHITKGCLSNKTNSKNSSPCLYEALYGDGTQSVGKLARDTISLPSQMKSISISL 674
            +  C+        L   T S N   C Y+  YGD + ++G +A DTI+L S   S  +S 
Sbjct: 152  ARQCS-------LLDQSTCSGNI--CQYQYSYGDRSYTMGNVASDTITLDSTTGS-PVSF 201

Query: 675  PNTIFGCGFYQRGQMGRKGEGIVGLGAGSLSLVQQLGPHINYSFSYCLVPLSS--GLTSK 848
            P T+ GCG    G    KG GIVGLGAG LSL+ Q+G  +   FSYCLVPLSS  G +SK
Sbjct: 202  PKTVIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSK 261

Query: 849  LTFGPLRXXXXXXXXXXXXXSSPILTGRD-PTFYRLILNRISVGK-------NASFNXXX 1004
            L FG                S+P+L+     +FY L L  +SVG        ++      
Sbjct: 262  LNFG-----SNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEG 316

Query: 1005 XXXXXSGTTLTYLPPDVYDGVKDALARTVRLPTVSDPMEFYDLCYDTRSLVSKKMFNPSD 1184
                 SGTTLT +P D +  +  A+   V      DP  F  +CY   S +         
Sbjct: 317  NIIIDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSATSDLKVPA----- 371

Query: 1185 VVFHFKGGDVVLKGINTFRTLEDGLSCLAM-SSIEGTPVFGNVAQVNFEVGFDLKAKKIS 1361
            +  HF G DV LK INTF  + D + CLA  S+  G  ++GNVAQ+NF V ++++ K +S
Sbjct: 372  ITAHFTGADVKLKPINTFVQVSDDVVCLAFASTTSGISIYGNVAQMNFLVEYNIQGKSLS 431

Query: 1362 FAPKDCTK 1385
            F P DCTK
Sbjct: 432  FKPTDCTK 439


Top