BLASTX nr result

ID: Mentha26_contig00030304 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00030304
         (1451 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU18131.1| hypothetical protein MIMGU_mgv1a025649mg [Mimulus...   573   e-161
ref|XP_002309394.1| aspartyl protease family protein [Populus tr...   472   e-130
ref|XP_002534234.1| pepsin A, putative [Ricinus communis] gi|223...   464   e-128
ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2...   462   e-127
emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]   460   e-127
ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2...   455   e-125
ref|XP_004303503.1| PREDICTED: aspartic proteinase nepenthesin-2...   452   e-124
ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2...   446   e-123
ref|XP_007027933.1| Eukaryotic aspartyl protease family protein,...   444   e-122
ref|XP_007202027.1| hypothetical protein PRUPE_ppa005104mg [Prun...   434   e-119
gb|EXC01923.1| Aspartic proteinase nepenthesin-2 [Morus notabilis]    430   e-118
ref|XP_006430085.1| hypothetical protein CICLE_v10011613mg [Citr...   429   e-117
ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2...   429   e-117
ref|XP_007162958.1| hypothetical protein PHAVU_001G194500g [Phas...   429   e-117
ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic pro...   429   e-117
ref|XP_006481575.1| PREDICTED: aspartic proteinase nepenthesin-1...   426   e-116
ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor,...   422   e-115
ref|XP_002877867.1| aspartyl protease family protein [Arabidopsi...   421   e-115
ref|XP_002323393.2| hypothetical protein POPTR_0016s07260g [Popu...   420   e-115
ref|XP_006403798.1| hypothetical protein EUTSA_v10010339mg [Eutr...   419   e-114

>gb|EYU18131.1| hypothetical protein MIMGU_mgv1a025649mg [Mimulus guttatus]
          Length = 462

 Score =  573 bits (1478), Expect = e-161
 Identities = 284/436 (65%), Positives = 326/436 (74%), Gaps = 9/436 (2%)
 Frame = +3

Query: 51   PTFAAPP--LANPWQXXXXXXXXXXXXXXXMKHRETNISFXXXXXXTKVPLFPRGYGGYS 224
            PT A+PP  LANPWQ               +KH  T+ S       TK PLFPRGYGGYS
Sbjct: 31   PTTASPPPPLANPWQRLNHLSAASSTRAHLLKHPNTSTS---AAAATKAPLFPRGYGGYS 87

Query: 225  ISLGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFADT--ANFSVFIPKFSSSAKIV 398
            ISL FGTPPQT  FVMDTGSSLVWFPCT RY C+SCNF +   +N S+F+PK SSS+ I+
Sbjct: 88   ISLSFGTPPQTLPFVMDTGSSLVWFPCTQRYACNSCNFVNVNPSNISIFLPKSSSSSMII 147

Query: 399  GCRNPKCKWLFENVQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXXVFPDKSVDN 578
            GC+NPKC+W+F +VQC+ CD NST C + CP YI+QY               FP+KSV+N
Sbjct: 148  GCKNPKCRWIFPDVQCKNCDQNSTTCKEFCPPYIIQYGSGSTTGLLLSETLFFPEKSVEN 207

Query: 579  FVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLIL----D 746
            F VGCS  S RQPAGIAGFGRGPESLPAQMGLK+FSYCLVSHRFD +PVSSDL+      
Sbjct: 208  FFVGCSIFSSRQPAGIAGFGRGPESLPAQMGLKRFSYCLVSHRFDDEPVSSDLVFVGGGG 267

Query: 747  XXXXXXXTKYTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTI 926
                    +YTPFRKNP S+NPAF++YYYVTLRKITVGGV VKAPY+FLVAD+ G+GGTI
Sbjct: 268  AAGAAAGVEYTPFRKNPKSANPAFQDYYYVTLRKITVGGVHVKAPYEFLVADAAGDGGTI 327

Query: 927  VDSGTTFTFMEGKVFELVAEEFEKQVG-EHYRRAAAVEEESGLRPCYNISGEKTVELPQL 1103
            VDSGTTFTFME +VFE VAEEFEKQVG  +Y RA  VE+ SGLRPC+N+SGE +V LP+L
Sbjct: 328  VDSGTTFTFMESRVFEPVAEEFEKQVGRRNYSRAREVEDRSGLRPCFNVSGEGSVSLPEL 387

Query: 1104 TFHFKGGAKMALPLADYFSFLDEAVICMTXXXXXXXXXXXXXPGPAIILGNYQQQNFYME 1283
            +FHFKGGA+M LPLADYFSFLD++VICMT             PGPAIILGNYQQQNFYME
Sbjct: 388  SFHFKGGAEMVLPLADYFSFLDDSVICMT-VVTNNSTREGIGPGPAIILGNYQQQNFYME 446

Query: 1284 YDLENERLGFRSQVCK 1331
            YDLENERLGF+ Q+CK
Sbjct: 447  YDLENERLGFKRQLCK 462


>ref|XP_002309394.1| aspartyl protease family protein [Populus trichocarpa]
            gi|222855370|gb|EEE92917.1| aspartyl protease family
            protein [Populus trichocarpa]
          Length = 469

 Score =  472 bits (1215), Expect = e-130
 Identities = 243/428 (56%), Positives = 284/428 (66%), Gaps = 11/428 (2%)
 Frame = +3

Query: 78   NPWQXXXXXXXXXXXXXXXMKHRETNISFXXXXXXTKVPLFPRGYGGYSISLGFGTPPQT 257
            NPW                +K  +T  S        K PLFPR YGGYSISL FGTPPQT
Sbjct: 51   NPWGALNHLASLSLSRAHHIKSPKTKFSLL------KTPLFPRSYGGYSISLNFGTPPQT 104

Query: 258  TSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWLF 431
            T FVMDTGSSLVWFPCT RY CS C+F   +      FIPK SSS+ ++GC+N KC WLF
Sbjct: 105  TKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSNLIGCKNHKCSWLF 164

Query: 432  E---NVQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXXVFPDK-SVDNFVVGCSF 599
                  +C+ECD  +  C Q CP Y++QY               FP K ++  F+VGCS 
Sbjct: 165  GPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGSTAGLLLSETLDFPHKKTIPGFLVGCSL 224

Query: 600  ASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXXTK-- 773
             SIRQP GIAGFGR PESLP+Q+GLKKFSYCLVSH FD  P SSDL+LD       TK  
Sbjct: 225  FSIRQPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPASSDLVLDTGSGSDDTKTP 284

Query: 774  ---YTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTT 944
               YTPF+KNP +   AFR+YYYV LR I +G   VK PYKFLV  SDGNGGTIVDSGTT
Sbjct: 285  GLSYTPFQKNPTA---AFRDYYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGTT 341

Query: 945  FTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGG 1124
            FTFME  V+ELVA+EFEKQV  HY  A  V+ ++GLRPC+NISGEK+V +P+  FHFKGG
Sbjct: 342  FTFMEKPVYELVAKEFEKQVA-HYTVATEVQNQTGLRPCFNISGEKSVSVPEFIFHFKGG 400

Query: 1125 AKMALPLADYFSFLDEAVICMTXXXXXXXXXXXXXPGPAIILGNYQQQNFYMEYDLENER 1304
            AKMALPLA+YFSF+D  VIC+T              GPAIILGNYQQ+NF++E+DL+NER
Sbjct: 401  AKMALPLANYFSFVDSGVICLTIVSDNMSGSGIGG-GPAIILGNYQQRNFHVEFDLKNER 459

Query: 1305 LGFRSQVC 1328
             GF+ Q C
Sbjct: 460  FGFKQQNC 467


>ref|XP_002534234.1| pepsin A, putative [Ricinus communis] gi|223525662|gb|EEF28148.1|
            pepsin A, putative [Ricinus communis]
          Length = 468

 Score =  464 bits (1193), Expect = e-128
 Identities = 239/441 (54%), Positives = 291/441 (65%), Gaps = 15/441 (3%)
 Frame = +3

Query: 51   PTFAAPPLANPWQXXXXXXXXXXXXXXXMKHRETNISFXXXXXXTKVPLFPRGYGGYSIS 230
            PT    P ++PW+               +K  +TN S        K PLF R YGGYS+S
Sbjct: 34   PTITKRPSSDPWEYLNHLATTSISRAHHLKSPKTNFSLI------KTPLFSRSYGGYSMS 87

Query: 231  LGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFADT--ANFSVFIPKFSSSAKIVGC 404
            L  GTP QT   +MDTGSSLVWFPCT RY C+SCNF +T       F+P+ SSS+K++GC
Sbjct: 88   LSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSKLIGC 147

Query: 405  RNPKCKWLF-ENVQ--CRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXXVFPDKSVD 575
            +NPKC W+F  +VQ  C  C+  +  C Q CP YI+QY               FP+K++ 
Sbjct: 148  KNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLGSTAGLLLSETINFPNKTIS 207

Query: 576  NFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXX 755
            +F+ GCS  S RQP GIAGFGR  ESLP Q+GLKKFSYCLVS RFD  PVSSDLILD   
Sbjct: 208  DFLAGCSLLSTRQPEGIAGFGRSQESLPLQLGLKKFSYCLVSRRFDDSPVSSDLILDMGP 267

Query: 756  XXXXTK-----YTPFRKNPAS-SNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNG 917
                +K     YTPF+KN AS SNPAF+EYYYV LRKI VG   VK PY FLV  SDGNG
Sbjct: 268  STSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVGKTHVKVPYSFLVPGSDGNG 327

Query: 918  GTIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELP 1097
            GTIVDSG+TFTF+EG VFEL+A+EFEKQ+  +Y  A  V++ +GLRPC++ISGEK+V +P
Sbjct: 328  GTIVDSGSTFTFVEGHVFELLAKEFEKQMA-NYTVATNVQKLTGLRPCFDISGEKSVVIP 386

Query: 1098 QLTFHFKGGAKMALPLADYFSFLDEAVICMT----XXXXXXXXXXXXXPGPAIILGNYQQ 1265
             LTF FKGGAKM LPL++YF+F+D  V+C+T                  GPAIILGN+QQ
Sbjct: 387  DLTFQFKGGAKMQLPLSNYFAFVDMGVVCLTIVSDNAAALGGDGGVRSSGPAIILGNFQQ 446

Query: 1266 QNFYMEYDLENERLGFRSQVC 1328
            QNFY+EYDLEN+R GF+ Q C
Sbjct: 447  QNFYIEYDLENDRFGFKEQSC 467


>ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 469

 Score =  462 bits (1188), Expect = e-127
 Identities = 243/438 (55%), Positives = 283/438 (64%), Gaps = 11/438 (2%)
 Frame = +3

Query: 51   PTFAAPPLANPWQXXXXXXXXXXXXXXXMKHRETNISFXXXXXXTKVPLFPRGYGGYSIS 230
            P F   P ++PWQ               +KHR+   S          PLF   YGGYS+S
Sbjct: 41   PLFTKNPSSDPWQLLSHLTSASLTRAHHLKHRKNTSS-------VNTPLFAHSYGGYSVS 93

Query: 231  LGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGC 404
            L FGTP QT SFVMDTGSSLVWFPCT RY C+ C+F   D A    FIPK SSSAKIVGC
Sbjct: 94   LSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGC 153

Query: 405  RNPKCKWLFEN---VQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXXVFPDKSVD 575
             NPKC ++ ++    +C  CD NS  C + CPTY +QY              VF +++  
Sbjct: 154  LNPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEP 213

Query: 576  NFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXX 755
            +FVVGCS  S RQP+GIAGFGRGP SLP QMGLKKFSYCL+SHRFD  P SS + L    
Sbjct: 214  DFVVGCSILSSRQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGP 273

Query: 756  XXXXTK-----YTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGG 920
                 K     YTPFRKNP SSN AF+EYYYVTLR I VG  +VK PY F+VA SDGNGG
Sbjct: 274  DSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGG 333

Query: 921  TIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQ 1100
            TIVDSG+TFTFME  VFE VA EF++Q+  +Y RAA VE  SGL+PC+N+SG  +V LP 
Sbjct: 334  TIVDSGSTFTFMEKPVFEAVATEFDRQMA-NYTRAADVEALSGLKPCFNLSGVGSVALPS 392

Query: 1101 LTFHFKGGAKMALPLADYFSFL-DEAVICMTXXXXXXXXXXXXXPGPAIILGNYQQQNFY 1277
            L F FKGGAKM LP+A+YFS + D +V+C+T              GP+IILGNYQ QNFY
Sbjct: 393  LVFQFKGGAKMELPVANYFSLVGDLSVLCLT-IVSNEAVGSTLSSGPSIILGNYQSQNFY 451

Query: 1278 MEYDLENERLGFRSQVCK 1331
             EYDLENER GFR Q CK
Sbjct: 452  TEYDLENERFGFRRQRCK 469


>emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
          Length = 609

 Score =  460 bits (1184), Expect = e-127
 Identities = 243/440 (55%), Positives = 283/440 (64%), Gaps = 11/440 (2%)
 Frame = +3

Query: 51   PTFAAPPLANPWQXXXXXXXXXXXXXXXMKHRETNISFXXXXXXTKVPLFPRGYGGYSIS 230
            P F   P ++PWQ               +KHR+   S          PLF   YGGYS+S
Sbjct: 41   PLFTKNPSSDPWQLLSHLTSASLTRAHHLKHRKNTSS-------VNTPLFAHSYGGYSVS 93

Query: 231  LGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGC 404
            L FGTP QT SFVMDTGSSLVWFPCT RY C+ C+F   D A    FIPK SSSAKIVGC
Sbjct: 94   LSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGC 153

Query: 405  RNPKCKWLFEN---VQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXXVFPDKSVD 575
             NPKC ++ ++    +C  CD NS  C + CPTY +QY              VF +++  
Sbjct: 154  LNPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEP 213

Query: 576  NFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXX 755
            +FVVGCS  S RQP+GIAGFGRGP SLP QMGLKKFSYCL+SHRFD  P SS + L    
Sbjct: 214  DFVVGCSILSSRQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGP 273

Query: 756  XXXXTK-----YTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGG 920
                 K     YTPFRKNP SSN AF+EYYYVTLR I VG  +VK PY F+VA SDGNGG
Sbjct: 274  DSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKXPYSFMVAGSDGNGG 333

Query: 921  TIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQ 1100
            TIVDSG+TFTFME  VFE VA EF++Q+  +Y RAA VE  SGL+PC+N+SG  +V LP 
Sbjct: 334  TIVDSGSTFTFMEKPVFEAVATEFDRQMA-NYTRAADVEALSGLKPCFNLSGVGSVALPS 392

Query: 1101 LTFHFKGGAKMALPLADYFSFL-DEAVICMTXXXXXXXXXXXXXPGPAIILGNYQQQNFY 1277
            L F FKGGAKM LP+A+YFS + D +V+C+T              GP+IILGNYQ QNFY
Sbjct: 393  LVFQFKGGAKMELPVANYFSLVGDLSVLCLT-IVSNEAVGSTLSSGPSIILGNYQSQNFY 451

Query: 1278 MEYDLENERLGFRSQVCK*C 1337
             EYDLENER GFR Q C  C
Sbjct: 452  TEYDLENERFGFRRQRCFQC 471


>ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 474

 Score =  455 bits (1171), Expect = e-125
 Identities = 231/409 (56%), Positives = 275/409 (67%), Gaps = 10/409 (2%)
 Frame = +3

Query: 135  MKHRETNISFXXXXXXTKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSSLVWFPCTDR 314
            +KHR  N            P +P+ YGGYSI L  GTPPQT+ FV+DTGSSLVWFPCT R
Sbjct: 69   LKHRNNN-----SPSVATTPAYPKSYGGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSR 123

Query: 315  YTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWLFEN---VQCRECDGNSTACN 479
            Y CS CNF   DT     FIPK SS+AK++GCRNPKC ++F +    +C +C   S  C+
Sbjct: 124  YLCSHCNFPNIDTTKIPTFIPKNSSTAKLLGCRNPKCGYIFGSDVQFRCPQCKPESQNCS 183

Query: 480  QLCPTYILQYXXXXXXXXXXXXXXVFPDKSVDNFVVGCSFASIRQPAGIAGFGRGPESLP 659
              CP YI+QY               FP K+V  F+VGCS  SIRQP+GIAGFGRG ESLP
Sbjct: 184  LTCPAYIIQYGLGSTAGFLLLDNLNFPGKTVPQFLVGCSILSIRQPSGIAGFGRGQESLP 243

Query: 660  AQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXXT----KYTPFRKNPASSNPAFREY 827
            +QM LK+FSYCLVSHRFD  P SSDL+L              YTPFR NP+++NPAF+EY
Sbjct: 244  SQMNLKRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAFKEY 303

Query: 828  YYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEEFEKQVG 1007
            YY+TLRK+ VGG  VK PY FL   SDGNGGTIVDSG+TFTFME  V+ LVA+EF KQ+ 
Sbjct: 304  YYLTLRKVIVGGKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLE 363

Query: 1008 EHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYFSFLDEA-VIC 1184
            ++Y RA   E +SGL PC+NISG KTV  P+LTF FKGGAKM  PL +YFS + +A V+C
Sbjct: 364  KNYSRAEDAETQSGLSPCFNISGVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVC 423

Query: 1185 MTXXXXXXXXXXXXXPGPAIILGNYQQQNFYMEYDLENERLGFRSQVCK 1331
            +T              GPAIILGNYQQQNFY+EYDLENER GF  + C+
Sbjct: 424  LT-VVSDGGAGPPKTTGPAIILGNYQQQNFYIEYDLENERFGFGPRSCR 471


>ref|XP_004303503.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Fragaria vesca
            subsp. vesca]
          Length = 458

 Score =  452 bits (1162), Expect = e-124
 Identities = 225/394 (57%), Positives = 274/394 (69%), Gaps = 11/394 (2%)
 Frame = +3

Query: 183  TKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANF 356
            TKVPL+PR YGGYSISL FGTPPQ ++FVMDTGSSLVWFPCT RY CS C+F   D +  
Sbjct: 70   TKVPLYPRSYGGYSISLSFGTPPQISTFVMDTGSSLVWFPCTSRYLCSRCSFPNIDPSTI 129

Query: 357  SVFIPKFSSSAKIVGCRNPKCKWLFENVQCRECDGNSTACNQLCPTYILQYXXXXXXXXX 536
              FIPK SSSA+++GC+NPKC W+F      +C  +S    Q CP+Y++QY         
Sbjct: 130  PAFIPKLSSSARLLGCKNPKCAWIFGPEVNTKCPNSS----QACPSYVIQYGSGTTAGVL 185

Query: 537  XXXXXVFPDKSVDNFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDG 716
                  FPDK+V +F+VGCSF SIRQPAG+AGFGRGP+SLP QMGL KFSYCLVSHRFD 
Sbjct: 186  LSESLDFPDKTVPDFLVGCSFLSIRQPAGMAGFGRGPQSLPVQMGLSKFSYCLVSHRFDD 245

Query: 717  KPVSSDLIL--------DXXXXXXXTKYTPFRKNPASSNPAFREYYYVTLRKITVGGVKV 872
             PVSSDL+L        D         YTPF+KNP ++N A+REYYY+ LRK+ VG   V
Sbjct: 246  TPVSSDLVLYSGSTSDGDEIDDNHDISYTPFQKNPGAANTAYREYYYLALRKVIVGKKHV 305

Query: 873  KAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGL 1052
            K PYK+LV   D NGGTIVDSG+TFTFME  VFE VAE F  Q+ E Y RA  +E  +GL
Sbjct: 306  KIPYKYLVPGEDDNGGTIVDSGSTFTFMERPVFEAVAEAFATQM-EKYTRAGDIENRTGL 364

Query: 1053 RPCYNISGEKTVELPQLTFHFKGGAKMALPLADYFSFL-DEAVICMTXXXXXXXXXXXXX 1229
            +PC++IS E+ V+ P+L F FKGGAKMA+PL +YF+ +  + V+C+T             
Sbjct: 365  KPCFDISKEEKVDFPELVFQFKGGAKMAMPLNNYFALVTSDGVVCLT-IVTDGVAGPGVA 423

Query: 1230 PGPAIILGNYQQQNFYMEYDLENERLGFRSQVCK 1331
             GPA+ILGN+QQQNFY+EYDLE ER GF+ Q CK
Sbjct: 424  AGPAVILGNFQQQNFYVEYDLERERFGFKKQSCK 457


>ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 470

 Score =  446 bits (1148), Expect = e-123
 Identities = 230/409 (56%), Positives = 269/409 (65%), Gaps = 10/409 (2%)
 Frame = +3

Query: 135  MKHRETNISFXXXXXXTKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSSLVWFPCTDR 314
            +KHR  N            P +P+ YGGYSI L  GTPPQT+ FV+DTGSSLVWFPCT  
Sbjct: 65   LKHRNNN-----SPSVATTPAYPKSYGGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSH 119

Query: 315  YTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWLFE---NVQCRECDG-NSTAC 476
            Y CS CNF   D      FIPK SS+AK++GCRNPKC +LF      +C +C    S  C
Sbjct: 120  YLCSHCNFPNIDPTKIPTFIPKNSSTAKLLGCRNPKCGYLFGPDVESRCPQCKKPGSQNC 179

Query: 477  NQLCPTYILQYXXXXXXXXXXXXXXVFPDKSVDNFVVGCSFASIRQPAGIAGFGRGPESL 656
            +  CP+YI+QY               FP K+V  F+VGCS  SIRQP+GIAGFGRG ESL
Sbjct: 180  SLTCPSYIIQYGLGATAGFLLLDNLNFPGKTVPQFLVGCSILSIRQPSGIAGFGRGQESL 239

Query: 657  PAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXXT----KYTPFRKNPASSNPAFRE 824
            P+QM LK+FSYCLVSHRFD  P SSDL+L              YTPFR NP S+N  FRE
Sbjct: 240  PSQMNLKRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNP-SNNSVFRE 298

Query: 825  YYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEEFEKQV 1004
            YYYVTLRK+ VGGV VK PYKFL   SDGNGGTIVDSG+TFTFME  V+ LVA+EF +Q+
Sbjct: 299  YYYVTLRKLIVGGVDVKIPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQL 358

Query: 1005 GEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYFSFLDEAVIC 1184
            G+ Y R   VE +SGL PC+NISG KT+  P+ TF FKGGAKM+ PL +YFSF+ +A + 
Sbjct: 359  GKKYSREENVEAQSGLSPCFNISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVL 418

Query: 1185 MTXXXXXXXXXXXXXPGPAIILGNYQQQNFYMEYDLENERLGFRSQVCK 1331
                            GPAIILGNYQQQNFY+EYDLENER GF  + CK
Sbjct: 419  CFTVVSDGGAGQPKTAGPAIILGNYQQQNFYVEYDLENERFGFGPRNCK 467


>ref|XP_007027933.1| Eukaryotic aspartyl protease family protein, putative isoform 1
            [Theobroma cacao] gi|590632770|ref|XP_007027934.1|
            Eukaryotic aspartyl protease family protein, putative
            isoform 1 [Theobroma cacao]
            gi|590632774|ref|XP_007027935.1| Eukaryotic aspartyl
            protease family protein, putative isoform 1 [Theobroma
            cacao] gi|508716538|gb|EOY08435.1| Eukaryotic aspartyl
            protease family protein, putative isoform 1 [Theobroma
            cacao] gi|508716539|gb|EOY08436.1| Eukaryotic aspartyl
            protease family protein, putative isoform 1 [Theobroma
            cacao] gi|508716540|gb|EOY08437.1| Eukaryotic aspartyl
            protease family protein, putative isoform 1 [Theobroma
            cacao]
          Length = 472

 Score =  444 bits (1141), Expect = e-122
 Identities = 224/391 (57%), Positives = 266/391 (68%), Gaps = 10/391 (2%)
 Frame = +3

Query: 186  KVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNF--ADTANFS 359
            K PLFP  YGGY+ISLG GTPPQT +F+MDTGSSL WFPCT RY CS C F   D     
Sbjct: 83   KTPLFPHSYGGYTISLGIGTPPQTLTFIMDTGSSLSWFPCTSRYICSQCAFPNVDPKKIP 142

Query: 360  VFIPKFSSSAKIVGCRNPKCKWLFE---NVQCRECDGNSTACNQLCPTYILQYXXXXXXX 530
             F PK SSS  +VGC+NPKC+WLF      +C++C+  S  C Q CP YI+QY       
Sbjct: 143  TFSPKLSSSKALVGCKNPKCRWLFGPDVESRCQDCEPASKNCTQNCPPYIIQYGLGSTGG 202

Query: 531  XXXXXXXVFPDKSVDNFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRF 710
                   VF  K+  +F+VGCS  S RQPAGI GFGR PESLP+Q+G+KKFSYCLVS RF
Sbjct: 203  LLLVENLVFSQKTFQDFLVGCSIFSNRQPAGIVGFGRRPESLPSQLGVKKFSYCLVSRRF 262

Query: 711  DGKPVSSDLILDXXXXXXXTK-----YTPFRKNPASSNPAFREYYYVTLRKITVGGVKVK 875
            D   VSS+++L+        K     YTPF KN  +S+P F+E+YYVT+RKI VG   VK
Sbjct: 263  DDTGVSSNMLLETGSGSGDAKTKGLSYTPFYKNQFASHPIFQEFYYVTIRKILVGDKHVK 322

Query: 876  APYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLR 1055
             PYK+LV   DGNGGTIVDSG+TFTFME  VFELV++EFEKQ+G +Y RA  VE +SGL 
Sbjct: 323  VPYKYLVPGPDGNGGTIVDSGSTFTFMERAVFELVSKEFEKQMG-NYSRAHEVENKSGLA 381

Query: 1056 PCYNISGEKTVELPQLTFHFKGGAKMALPLADYFSFLDEAVICMTXXXXXXXXXXXXXPG 1235
            PC NISG K++  P+L F FKGGAKMALPLA+YFSFLD  V+C+               G
Sbjct: 382  PCVNISGHKSISFPELIFQFKGGAKMALPLANYFSFLDVNVVCLMVVTDNIIGQGVSG-G 440

Query: 1236 PAIILGNYQQQNFYMEYDLENERLGFRSQVC 1328
            PAIILGN+QQQN+Y+EYDL NE  GF  Q C
Sbjct: 441  PAIILGNFQQQNYYIEYDLANESFGFAKQSC 471


>ref|XP_007202027.1| hypothetical protein PRUPE_ppa005104mg [Prunus persica]
            gi|462397558|gb|EMJ03226.1| hypothetical protein
            PRUPE_ppa005104mg [Prunus persica]
          Length = 477

 Score =  434 bits (1116), Expect = e-119
 Identities = 227/411 (55%), Positives = 271/411 (65%), Gaps = 28/411 (6%)
 Frame = +3

Query: 183  TKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFADT--ANF 356
            T+VPLFP  YG YS+SL FGTPPQT+SF+MDTGSSLVWFPCT RY CS C F +   A  
Sbjct: 69   TQVPLFPHSYGDYSVSLNFGTPPQTSSFIMDTGSSLVWFPCTKRYICSRCQFPNINPAKI 128

Query: 357  SVFIPKFSSSAKIVGCRNPKCKWLFE---NVQCRECDGNSTA-CNQLCPTYILQYXXXXX 524
              F PK SSS+KIVGC+NPKC W+F      +C  C+  S   C+Q CPTYI+QY     
Sbjct: 129  PTFKPKLSSSSKIVGCQNPKCGWIFGPEVKSKCPNCNNPSHQNCSQACPTYIIQYGSGTT 188

Query: 525  XXXXXXXXXVFPDKSVDNFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSH 704
                      FP K V +F+VGCSF SIRQPAGIAGFGRGP+SLPAQMGL KFSYCLVSH
Sbjct: 189  AGILLSETLDFPKKIVPDFLVGCSFVSIRQPAGIAGFGRGPQSLPAQMGLTKFSYCLVSH 248

Query: 705  RFDGKPVSSDLILDXXXXXXXT---------------------KYTPFRKNPASSNPAFR 821
            RFD  P SSDL+L        +                       TPF+KNP   N AFR
Sbjct: 249  RFDDTPQSSDLVLYSSSSGSSSSSEEEPTIAESQRNKTKLQSLSSTPFQKNPGPPNSAFR 308

Query: 822  EYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEEFEKQ 1001
            EYYY+ LRK+ VG   VK PYKFLV  +D +GGTIVDSG+TFTFME  VFE VA+EFE Q
Sbjct: 309  EYYYIMLRKVIVGNKNVKIPYKFLVPGADSSGGTIVDSGSTFTFMEKPVFEPVAKEFEAQ 368

Query: 1002 VGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYFSFLDEA-V 1178
            +  +Y RA  +E ++GLRPC++IS EK V+ P+L F FKGGAKM LP  +YFS +  + V
Sbjct: 369  MA-NYTRAKDLENKTGLRPCFDISKEKKVDFPELVFQFKGGAKMELPSKNYFSMVSSSGV 427

Query: 1179 ICMTXXXXXXXXXXXXXPGPAIILGNYQQQNFYMEYDLENERLGFRSQVCK 1331
            +C+T              GPAIILGNYQQQ+F++EYDL++ + GFR Q CK
Sbjct: 428  VCLTIVTDGVVGPGGNG-GPAIILGNYQQQDFHVEYDLQHGKFGFRKQSCK 477


>gb|EXC01923.1| Aspartic proteinase nepenthesin-2 [Morus notabilis]
          Length = 473

 Score =  430 bits (1105), Expect = e-118
 Identities = 216/391 (55%), Positives = 264/391 (67%), Gaps = 8/391 (2%)
 Frame = +3

Query: 183  TKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFADTANFSV 362
            TK PL+PR YGGYS+SL FGTPPQ   FVMDTGSSLVWFPCT RY CS C+F ++ N   
Sbjct: 84   TKTPLYPRSYGGYSVSLRFGTPPQILQFVMDTGSSLVWFPCTSRYLCSKCSFPNSQNPPK 143

Query: 363  FIPKFSSSAKIVGCRNPKCKW-LFENVQCRECDGNSTACNQLCPTYILQYXXXXXXXXXX 539
            FIPK SSS+K++GC+NPKC+  L    +C +        N+ CP YI+QY          
Sbjct: 144  FIPKKSSSSKLIGCQNPKCQLVLGATAKCDDATAGENPKNKACPAYIIQYGSGSTIGQLL 203

Query: 540  XXXXVFPDKSVDNFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGK 719
                 FP K V +F+VGCS  SIRQP+GIAGFGRG ESLP+Q+ L KFSYCLVSHRFD  
Sbjct: 204  SETLNFPGKMVPDFIVGCSVLSIRQPSGIAGFGRGKESLPSQLRLAKFSYCLVSHRFDDT 263

Query: 720  PVSSDLIL-----DXXXXXXXTKYTPFRKNPA-SSNPAFREYYYVTLRKITVGGVKVKAP 881
              SSDL+L     D         YTPF+KNP+ SS PA +EYYY+ +RK+ VG   VK P
Sbjct: 264  SFSSDLVLYSSSSDDKQPEGSISYTPFQKNPSLSSIPALKEYYYILIRKVIVGKTHVKIP 323

Query: 882  YKFLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPC 1061
            Y++LV  SDG+GGTIVDSGTTFT+ME  VF+ V+ EF KQ+  +Y RA  +E  +GL PC
Sbjct: 324  YRYLVPGSDGHGGTIVDSGTTFTYMEKPVFDAVSSEFAKQMA-NYTRAKGIENRTGLGPC 382

Query: 1062 YNISGEKTVELPQLTFHFKGGAKMALPLADYFSFL-DEAVICMTXXXXXXXXXXXXXPGP 1238
            ++IS EK+V  P+L   FKGGAKM LPL +YFS +     +C+T              GP
Sbjct: 383  FDISKEKSVNFPELVLQFKGGAKMNLPLTNYFSIVGSPGSVCLTVVTNDDVGGPESVGGP 442

Query: 1239 AIILGNYQQQNFYMEYDLENERLGFRSQVCK 1331
            AIILGNYQQQNF++EYDL+NER GFR Q+CK
Sbjct: 443  AIILGNYQQQNFHIEYDLKNERFGFRRQICK 473


>ref|XP_006430085.1| hypothetical protein CICLE_v10011613mg [Citrus clementina]
            gi|557532142|gb|ESR43325.1| hypothetical protein
            CICLE_v10011613mg [Citrus clementina]
          Length = 483

 Score =  429 bits (1103), Expect = e-117
 Identities = 220/408 (53%), Positives = 274/408 (67%), Gaps = 11/408 (2%)
 Frame = +3

Query: 138  KHRETNISFXXXXXXTKVPLFPRGYGGYSISLGFGTPPQ-TTSFVMDTGSSLVWFPCTDR 314
            K +++NI         K PL    YGGYSISL FGTPPQ +T F+ DTGSSLVWFPCT R
Sbjct: 77   KTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSR 136

Query: 315  YTCSSCNF--ADTANFSVFIPKFSSSAKIVGCRNPKCKWLF-ENVQ--CRECDGNSTACN 479
            Y C+ CNF   D +    FIPK SSS++++GC+NPKC W+F  NV+  C+ C+  +  C 
Sbjct: 137  YRCADCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCNPRNKTCP 196

Query: 480  QLCPTYILQYXXXXXXXXXXXXXXVFPDKSVDNFVVGCSFASIRQPAGIAGFGRGPESLP 659
              CP Y++QY               FP K+V NF+VGCS  S RQPAGIAGFGR  ESLP
Sbjct: 197  LACPPYLIQYGLGFTAGLLLSETLGFPSKTVPNFLVGCSILSNRQPAGIAGFGRSSESLP 256

Query: 660  AQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXXTK-----YTPFRKNPASSNPAFRE 824
            +Q+GLKKFSYCL+S +FD  PVSS+L+LD       +K     YTPF KNP  S+ AF E
Sbjct: 257  SQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGSGSGDSKTPGLSYTPFYKNPVGSSSAFGE 316

Query: 825  YYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEEFEKQV 1004
            YYYV LR+I VG   VK PY +LV  SDGNGG IVDSG+T TFMEG +FE VA+EF +Q+
Sbjct: 317  YYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTLTFMEGPLFEAVAKEFIRQM 376

Query: 1005 GEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYFSFLDEAVIC 1184
            G +Y RAA VE++SGLRPC++ISG+K+V LP+L   FKGGAKMALPL +YF+ +   V+C
Sbjct: 377  G-NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPLENYFALVGNEVLC 435

Query: 1185 MTXXXXXXXXXXXXXPGPAIILGNYQQQNFYMEYDLENERLGFRSQVC 1328
            +               GPAIILG++Q QNFY+E+DL N+R GF  Q C
Sbjct: 436  LILFTDNAAGPAPGG-GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482


>ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  429 bits (1103), Expect = e-117
 Identities = 213/388 (54%), Positives = 259/388 (66%), Gaps = 7/388 (1%)
 Frame = +3

Query: 186  KVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFS 359
            K PL P  YG YS  L FGTP QT   + DTGSSLVWFPCT RY CS C+F   D     
Sbjct: 70   KSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIP 129

Query: 360  VFIPKFSSSAKIVGCRNPKCKWLFE---NVQCRECDGNSTACNQLCPTYILQYXXXXXXX 530
             F+PK SSS+K+VGC+NPKC W+F      QCR C+  +  C Q CP Y++QY       
Sbjct: 130  RFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAG 189

Query: 531  XXXXXXXVFPDKSVDNFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRF 710
                    FPDK + NFVVGCSF SI QP+GIAGFGRG ESLP+QMGLKKF+YCL S +F
Sbjct: 190  LLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKF 249

Query: 711  DGKPVSSDLILDXXXXXXX-TKYTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYK 887
            D  P S  LILD          YTPFR+NP+ SN A++EYYY+ +RKI VG   VK PYK
Sbjct: 250  DDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYK 309

Query: 888  FLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYN 1067
            FLV   DGNGG+I+DSG+TFTFM+  V E+VA EFEKQ+  ++ RA  VE  +GLRPC++
Sbjct: 310  FLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLA-NWTRATDVETLTGLRPCFD 368

Query: 1068 ISGEKTVELPQLTFHFKGGAKMALPLADYFSFLDEA-VICMTXXXXXXXXXXXXXPGPAI 1244
            IS EK+V+ P+L F FKGGAK ALPL +YF+ +  + V C+T              GP++
Sbjct: 369  ISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGPSV 428

Query: 1245 ILGNYQQQNFYMEYDLENERLGFRSQVC 1328
            ILG +QQQNFY+EYDL N+RLGFR Q C
Sbjct: 429  ILGAFQQQNFYVEYDLVNQRLGFRQQTC 456


>ref|XP_007162958.1| hypothetical protein PHAVU_001G194500g [Phaseolus vulgaris]
            gi|561036422|gb|ESW34952.1| hypothetical protein
            PHAVU_001G194500g [Phaseolus vulgaris]
          Length = 466

 Score =  429 bits (1102), Expect = e-117
 Identities = 216/389 (55%), Positives = 265/389 (68%), Gaps = 10/389 (2%)
 Frame = +3

Query: 195  LFPRGYGGYSISLGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFSVFI 368
            ++P+ YGGYSI L FGTPPQT+ FV+DTGSSLVWFPCT RY CS C F   D      FI
Sbjct: 77   VYPKSYGGYSIDLNFGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCLFPNIDPTKIPTFI 136

Query: 369  PKFSSSAKIVGCRNPKCKWLFEN---VQCRECDGNSTACNQLCPTYILQYXXXXXXXXXX 539
            PK SS+++++GC+NPKC +LF +    +C +C  +S  C+  CP YI+QY          
Sbjct: 137  PKNSSTSRLLGCKNPKCGYLFGSDLQSRCPQCKPDSQNCSLTCPPYIIQYGLGSTAGFLL 196

Query: 540  XXXXVFPDKSVDNFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGK 719
                 FP+K V  F+VGCS  SIRQP+GIAGFGRG ESLPAQM LK+FSYCL+SH FD  
Sbjct: 197  LDNLNFPEKIVPQFLVGCSILSIRQPSGIAGFGRGQESLPAQMALKRFSYCLLSHNFDDS 256

Query: 720  PVSSDLILDXXXXXXXT----KYTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYK 887
              +SDL+L              YTPF  NP+++NPAF EYYY++LRK+ VGG  VK P  
Sbjct: 257  TENSDLVLQISSTGDTKTNGLSYTPFHPNPSANNPAFLEYYYLSLRKVIVGGKNVKIPLS 316

Query: 888  FLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYN 1067
            FL   SDGNGGTIVDSG+TFTFME   ++LV +EF KQ+G +Y RA  VE +SGL PC+N
Sbjct: 317  FLEPGSDGNGGTIVDSGSTFTFMERPAYDLVVKEFVKQLG-NYSRAEDVEAQSGLGPCFN 375

Query: 1068 ISGEKTVELPQLTFHFKGGAKMALPLADYFSFLDEA-VICMTXXXXXXXXXXXXXPGPAI 1244
            ISG KTV  P+ T  FKGGAKM LP+ +YFS +D++ V+C+T              GPAI
Sbjct: 376  ISGAKTVNFPKFTLQFKGGAKMTLPVENYFSLIDDSEVVCLT-IVSDGGAGPATTSGPAI 434

Query: 1245 ILGNYQQQNFYMEYDLENERLGFRSQVCK 1331
            ILGNYQQQNF++EYDLENER GF  Q CK
Sbjct: 435  ILGNYQQQNFHIEYDLENERFGFGPQSCK 463


>ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
            nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  429 bits (1102), Expect = e-117
 Identities = 213/388 (54%), Positives = 259/388 (66%), Gaps = 7/388 (1%)
 Frame = +3

Query: 186  KVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFS 359
            K PL P  YG YS  L FGTP QT   + DTGSSLVWFPCT RY CS C+F   D     
Sbjct: 70   KSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIP 129

Query: 360  VFIPKFSSSAKIVGCRNPKCKWLFE---NVQCRECDGNSTACNQLCPTYILQYXXXXXXX 530
             F+PK SSS+K+VGC+NPKC W+F      QCR C+  +  C Q CP Y++QY       
Sbjct: 130  RFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAG 189

Query: 531  XXXXXXXVFPDKSVDNFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRF 710
                    FPDK + NFVVGCSF SI QP+GIAGFGRG ESLP+QMGLKKF+YCL S +F
Sbjct: 190  LLLSETLDFPDKXIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKF 249

Query: 711  DGKPVSSDLILDXXXXXXX-TKYTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYK 887
            D  P S  LILD          YTPFR+NP+ SN A++EYYY+ +RKI VG   VK PYK
Sbjct: 250  DDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYK 309

Query: 888  FLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYN 1067
            FLV   DGNGG+I+DSG+TFTFM+  V E+VA EFEKQ+  ++ RA  VE  +GLRPC++
Sbjct: 310  FLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLA-NWTRATDVETLTGLRPCFD 368

Query: 1068 ISGEKTVELPQLTFHFKGGAKMALPLADYFSFLDEA-VICMTXXXXXXXXXXXXXPGPAI 1244
            IS EK+V+ P+L F FKGGAK ALPL +YF+ +  + V C+T              GP++
Sbjct: 369  ISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGPSV 428

Query: 1245 ILGNYQQQNFYMEYDLENERLGFRSQVC 1328
            ILG +QQQNFY+EYDL N+RLGFR Q C
Sbjct: 429  ILGAFQQQNFYVEYDLVNQRLGFRQQTC 456


>ref|XP_006481575.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Citrus sinensis]
          Length = 483

 Score =  426 bits (1094), Expect = e-116
 Identities = 219/408 (53%), Positives = 272/408 (66%), Gaps = 11/408 (2%)
 Frame = +3

Query: 138  KHRETNISFXXXXXXTKVPLFPRGYGGYSISLGFGTPPQ-TTSFVMDTGSSLVWFPCTDR 314
            K +++NI         K PL    YGGYSISL FGTPPQ +T F+ DTGSSLVWFPCT R
Sbjct: 77   KTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSR 136

Query: 315  YTCSSCNF--ADTANFSVFIPKFSSSAKIVGCRNPKCKWLF-ENVQ--CRECDGNSTACN 479
            Y C  CNF   D +    FIPK SSS++++GC+NPKC W+F  NV+  C+ C   +  C 
Sbjct: 137  YRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCP 196

Query: 480  QLCPTYILQYXXXXXXXXXXXXXXVFPDKSVDNFVVGCSFASIRQPAGIAGFGRGPESLP 659
              CP+Y+LQY               FP K+V NF+ GCS  S RQPAGIAGFGR  ESLP
Sbjct: 197  LACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLP 256

Query: 660  AQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXXTK-----YTPFRKNPASSNPAFRE 824
            +Q+GLKKFSYCL+S +FD  PVSS+L+LD       +K     YTPF KNP  S+ AF E
Sbjct: 257  SQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE 316

Query: 825  YYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEEFEKQV 1004
            +YYV LR+I VG   VK PY +LV  SDGNGG IVDSG+TFTFMEG +FE VA+EF +Q+
Sbjct: 317  FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376

Query: 1005 GEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYFSFLDEAVIC 1184
            G +Y RAA VE++SGLRPC++ISG+K+V LP+L   FKGGAKMALP  +YF+ +   V+C
Sbjct: 377  G-NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLC 435

Query: 1185 MTXXXXXXXXXXXXXPGPAIILGNYQQQNFYMEYDLENERLGFRSQVC 1328
            +               GPAIILG++Q QNFY+E+DL N+R GF  Q C
Sbjct: 436  LILFTDNAAGPALGR-GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482


>ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
            communis] gi|223536957|gb|EEF38595.1| Aspartic proteinase
            nepenthesin-2 precursor, putative [Ricinus communis]
          Length = 447

 Score =  422 bits (1085), Expect = e-115
 Identities = 211/387 (54%), Positives = 262/387 (67%), Gaps = 7/387 (1%)
 Frame = +3

Query: 192  PLFPRGYGGYSISLGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFADTANFSVFIP 371
            P+F   YGGYSISL FGTPPQT SFVMDTGSS VWFPCT RY C++C+F  T+  S F+P
Sbjct: 68   PVFSHSYGGYSISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNCSF--TSRISPFLP 125

Query: 372  KFSSSAKIVGCRNPKCKWLFE-NVQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXX 548
            K SSS+KI+GC+NPKC W+ + +++C +CD NS  C+Q+CP Y++ Y             
Sbjct: 126  KHSSSSKIIGCKNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSGTTGGVALSET 185

Query: 549  XVFPDKSVDNFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVS 728
                   V NF+VGCS  S RQPAGIAGFGRGP SLP+Q+GL KFSYCL+SH+FD    S
Sbjct: 186  LHLHGLIVPNFLVGCSVFSSRQPAGIAGFGRGPSSLPSQLGLTKFSYCLLSHKFDDTQES 245

Query: 729  SDLILDXXXXXXXTK----YTPFRKNP-ASSNPAFREYYYVTLRKITVGGVKVKAPYKFL 893
            S L+LD             YTP  KNP     PAF  YYYV+LR+I++GG  VK PYK+L
Sbjct: 246  SSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPYKYL 305

Query: 894  VADSDGNGGTIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNIS 1073
              D DGNGGTI+DSGTTFT+M  + FE+++ EF  QV ++Y RA  VE  SGL+PC+N+S
Sbjct: 306  SPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQV-KNYERALMVEALSGLKPCFNVS 364

Query: 1074 GEKTVELPQLTFHFKGGAKMALPLADYFSFL-DEAVICMTXXXXXXXXXXXXXPGPAIIL 1250
            G K +ELPQL  HFKGGA + LPL +YF+FL    V C T              GP +IL
Sbjct: 365  GAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFT----VVTDGAEKASGPGMIL 420

Query: 1251 GNYQQQNFYMEYDLENERLGFRSQVCK 1331
            GN+Q QNFY+EYDL+NERLGF+ + CK
Sbjct: 421  GNFQMQNFYVEYDLQNERLGFKKESCK 447


>ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297323705|gb|EFH54126.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 469

 Score =  421 bits (1082), Expect = e-115
 Identities = 220/391 (56%), Positives = 263/391 (67%), Gaps = 10/391 (2%)
 Frame = +3

Query: 186  KVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFS 359
            K  L P+ YGGYS+SL FGTP QT  FV DTGSSLVWFPCT RY CS CNF+  D     
Sbjct: 79   KSHLSPKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSDCNFSGLDPTQIP 138

Query: 360  VFIPKFSSSAKIVGCRNPKCKWLF-ENVQCRECDGNSTACNQLCPTYILQYXXXXXXXXX 536
             FIPK SSS++++GC+NPKC++LF  NVQCR CD N+  C   CP YILQY         
Sbjct: 139  RFIPKNSSSSRVIGCQNPKCQFLFGANVQCRGCDPNTRNCTVPCPPYILQYGLGSTAGIL 198

Query: 537  XXXXXVFPDKSVDNFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDG 716
                  FPD +V +FVVGCS  S R PAGIAGFGRGPESLP+QM LK FS+CLVS RFD 
Sbjct: 199  ISEKLDFPDLTVPDFVVGCSVISTRTPAGIAGFGRGPESLPSQMKLKSFSHCLVSRRFDD 258

Query: 717  KPVSSDLILDXXXXXXX------TKYTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKA 878
              V++DL LD               YTPFRKNP  SN AF EYYY+ LR+I VG   VK 
Sbjct: 259  TNVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFLEYYYLNLRRIYVGSKHVKI 318

Query: 879  PYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRP 1058
            PYKFL   ++GNGG+IVDSG+TFTFME  VFELVAEEF  Q+  +Y R   +E+ SG+ P
Sbjct: 319  PYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQM-SNYTREKDLEKVSGIAP 377

Query: 1059 CYNISGEKTVELPQLTFHFKGGAKMALPLADYFSFLDEA-VICMTXXXXXXXXXXXXXPG 1235
            C+NISG+  V +P+L F FKGGAKM LPL++YFSF+  A  +C+T              G
Sbjct: 378  CFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNADTVCLT-VVSDNTVNPGGGTG 436

Query: 1236 PAIILGNYQQQNFYMEYDLENERLGFRSQVC 1328
            PAIILG++QQQN+ +EYDLEN+R GF  + C
Sbjct: 437  PAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>ref|XP_002323393.2| hypothetical protein POPTR_0016s07260g [Populus trichocarpa]
            gi|550321034|gb|EEF05154.2| hypothetical protein
            POPTR_0016s07260g [Populus trichocarpa]
          Length = 454

 Score =  420 bits (1080), Expect = e-115
 Identities = 212/381 (55%), Positives = 255/381 (66%), Gaps = 11/381 (2%)
 Frame = +3

Query: 81   PWQXXXXXXXXXXXXXXXMKHRETNISFXXXXXXTKVPLFPRGYGGYSISLGFGTPPQTT 260
            PW                +K  +TN S        K PLFPR YGGYSISL FGTPPQTT
Sbjct: 43   PWGSLNHLASLSLSRAHHIKSPKTNFSLI------KTPLFPRSYGGYSISLNFGTPPQTT 96

Query: 261  SFVMDTGSSLVWFPCTDRYTCSSCNFADTANFSV--FIPKFSSSAKIVGCRNPKCKWLFE 434
             FVMDTGSSLVWFPCT RY CS CNF +     +  F+PK SSS+K++GC+NP+C  +F 
Sbjct: 97   KFVMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMIFG 156

Query: 435  ---NVQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXXVFPDK-SVDNFVVGCSFA 602
                 +C+ECD  +  C Q CP Y++QY               FP+K ++ +F+VGCS  
Sbjct: 157  PEIQSKCQECDSTAQNCTQTCPPYVIQYGSGSTAGLLLSETLDFPNKKTIPDFLVGCSIF 216

Query: 603  SIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXXTK--- 773
            SI+QP GIAGFGR PESLP+Q+GLKKFSYCLVSH FD  P SSDL+LD       TK   
Sbjct: 217  SIKQPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPTSSDLVLDTGSGSGVTKTAG 276

Query: 774  --YTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTF 947
              +TPF KNP +   AFR+YYYV LR I +G   VK PYKFLV  +DGNGGTIVDSGTTF
Sbjct: 277  LSHTPFLKNPTT---AFRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDGNGGTIVDSGTTF 333

Query: 948  TFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGA 1127
            TFME  V+ELVA+EFEKQ+  HY  A  ++  +GLRPCYNISGEK++ +P L F FKGGA
Sbjct: 334  TFMENPVYELVAKEFEKQMA-HYTVATEIQNLTGLRPCYNISGEKSLSVPDLIFQFKGGA 392

Query: 1128 KMALPLADYFSFLDEAVICMT 1190
            KMALPL++YFS +D  VIC+T
Sbjct: 393  KMALPLSNYFSIVDSGVICLT 413


>ref|XP_006403798.1| hypothetical protein EUTSA_v10010339mg [Eutrema salsugineum]
            gi|557104917|gb|ESQ45251.1| hypothetical protein
            EUTSA_v10010339mg [Eutrema salsugineum]
          Length = 471

 Score =  419 bits (1078), Expect = e-114
 Identities = 214/391 (54%), Positives = 264/391 (67%), Gaps = 10/391 (2%)
 Frame = +3

Query: 186  KVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFS 359
            K PL PR YGGYS+SL FGTP QT  FV DTGSSLVWFPCT RY CS CNF+  D     
Sbjct: 85   KSPLSPRSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSGCNFSGLDPNRIP 144

Query: 360  VFIPKFSSSAKIVGCRNPKCKWLF-ENVQCRECDGNSTACNQLCPTYILQYXXXXXXXXX 536
             F+PK SSS++IVGC+NPKC  LF  N++CR CD N+  C   CP Y++QY         
Sbjct: 145  RFLPKNSSSSRIVGCQNPKCSLLFGPNLKCRGCDPNTRNCTLGCPPYVIQYGSGSTAGIL 204

Query: 537  XXXXXVFPDKSVDNFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDG 716
                 VFPD +V +F+VGCS  S RQPAGIAGFGRGPESLP+QM LK+FS+CLVS RFD 
Sbjct: 205  ISDKLVFPDLTVPDFLVGCSILSTRQPAGIAGFGRGPESLPSQMNLKRFSHCLVSRRFDD 264

Query: 717  KPVSSDLILD------XXXXXXXTKYTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKA 878
              V++DL LD               YTPFR NP  SN AF EYYY+ LR+I VG  +VK 
Sbjct: 265  TNVTTDLDLDTGSGHKTGLKTPGLSYTPFRNNPNVSNAAFLEYYYLNLRRIFVGSKRVKI 324

Query: 879  PYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRP 1058
            PYK+L   +DGNGGTIVDSGTT TFME  +F+LVAEEF  Q+  +Y R   +E+ +G+ P
Sbjct: 325  PYKYLAPGTDGNGGTIVDSGTTLTFMEQPIFDLVAEEFATQM-SNYSREKDLEKTTGIGP 383

Query: 1059 CYNISGEKTVELPQLTFHFKGGAKMALPLADYFSFL-DEAVICMTXXXXXXXXXXXXXPG 1235
            C+NISG+ ++ +P LTF FKGGAKM LP ++YF+F+     +C+T              G
Sbjct: 384  CFNISGKGSLTVPDLTFEFKGGAKMKLPTSNYFAFVKSNDNVCLT-----VVSADAGGSG 438

Query: 1236 PAIILGNYQQQNFYMEYDLENERLGFRSQVC 1328
            PAIILG++QQQN+++EYDLEN+R GF  + C
Sbjct: 439  PAIILGSFQQQNYHVEYDLENDRFGFAQKKC 469


Top