BLASTX nr result

ID: Mentha28_contig00013718 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00013718
         (1389 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU18131.1| hypothetical protein MIMGU_mgv1a025649mg [Mimulus...   571   e-160
ref|XP_002309394.1| aspartyl protease family protein [Populus tr...   472   e-130
ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2...   468   e-129
emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]   466   e-128
ref|XP_002534234.1| pepsin A, putative [Ricinus communis] gi|223...   462   e-127
ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2...   454   e-125
ref|XP_004303503.1| PREDICTED: aspartic proteinase nepenthesin-2...   451   e-124
ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2...   443   e-122
ref|XP_007027933.1| Eukaryotic aspartyl protease family protein,...   440   e-121
ref|XP_002323393.2| hypothetical protein POPTR_0016s07260g [Popu...   437   e-120
ref|XP_007202027.1| hypothetical protein PRUPE_ppa005104mg [Prun...   433   e-119
ref|XP_007162958.1| hypothetical protein PHAVU_001G194500g [Phas...   429   e-117
emb|CBI30372.3| unnamed protein product [Vitis vinifera]              427   e-117
ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2...   424   e-116
ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic pro...   423   e-116
gb|EXC01923.1| Aspartic proteinase nepenthesin-2 [Morus notabilis]    422   e-115
ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor,...   421   e-115
ref|XP_006430085.1| hypothetical protein CICLE_v10011613mg [Citr...   417   e-114
ref|XP_007145803.1| hypothetical protein PHAVU_007G269300g [Phas...   414   e-113
ref|XP_006481575.1| PREDICTED: aspartic proteinase nepenthesin-1...   414   e-113

>gb|EYU18131.1| hypothetical protein MIMGU_mgv1a025649mg [Mimulus guttatus]
          Length = 462

 Score =  571 bits (1471), Expect = e-160
 Identities = 286/435 (65%), Positives = 329/435 (75%), Gaps = 19/435 (4%)
 Frame = -1

Query: 1338 PTFAAPP--LANPWQRLAHLASASSTRAHHLKHRETNISFXXXXXATKVPLFPRGYGGYS 1165
            PT A+PP  LANPWQRL HL++ASSTRAH LKH  T+ S      ATK PLFPRGYGGYS
Sbjct: 31   PTTASPPPPLANPWQRLNHLSAASSTRAHLLKHPNTSTS---AAAATKAPLFPRGYGGYS 87

Query: 1164 ISLGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFADT--ANFSVFIPKFSSSAKIV 991
            ISL FGTPPQT  FVMDTGSSLVWFPCT RY C+SCNF +   +N S+F+PK SSS+ I+
Sbjct: 88   ISLSFGTPPQTLPFVMDTGSSLVWFPCTQRYACNSCNFVNVNPSNISIFLPKSSSSSMII 147

Query: 990  GCRNPKCKWLFENVQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDN 811
            GC+NPKC+W+F +VQC+ CD NST C + CP YI+QY             L FP+KSV+N
Sbjct: 148  GCKNPKCRWIFPDVQCKNCDQNSTTCKEFCPPYIIQYGSGSTTGLLLSETLFFPEKSVEN 207

Query: 810  FVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLIL----D 643
            F VGCS  S RQPAGIAGFGRGPESLPAQMGLK+FSYCLVSHRFD +PVSSDL+      
Sbjct: 208  FFVGCSIFSSRQPAGIAGFGRGPESLPAQMGLKRFSYCLVSHRFDDEPVSSDLVFVGGGG 267

Query: 642  XXXXXXATKYTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTI 463
                    +YTPFRKNP S+NPAF++YYYVTLRKITVGGV VKAPY+FLVAD+ G+GGTI
Sbjct: 268  AAGAAAGVEYTPFRKNPKSANPAFQDYYYVTLRKITVGGVHVKAPYEFLVADAAGDGGTI 327

Query: 462  VDSGTTFTFMEGKVFELVAEEFEKQVG-EHYRRAAAVEEESGLRPCYNISGEKTVELPQL 286
            VDSGTTFTFME +VFE VAEEFEKQVG  +Y RA  VE+ SGLRPC+N+SGE +V LP+L
Sbjct: 328  VDSGTTFTFMESRVFEPVAEEFEKQVGRRNYSRAREVEDRSGLRPCFNVSGEGSVSLPEL 387

Query: 285  TFHFKGGAKMALPLADYFSFLDEAV----------XXXXXXXXXXXXXXNYQQQNFYMEY 136
            +FHFKGGA+M LPLADYFSFLD++V                        NYQQQNFYMEY
Sbjct: 388  SFHFKGGAEMVLPLADYFSFLDDSVICMTVVTNNSTREGIGPGPAIILGNYQQQNFYMEY 447

Query: 135  DLENERLGFRSQVCK 91
            DLENERLGF+ Q+CK
Sbjct: 448  DLENERLGFKRQLCK 462


>ref|XP_002309394.1| aspartyl protease family protein [Populus trichocarpa]
            gi|222855370|gb|EEE92917.1| aspartyl protease family
            protein [Populus trichocarpa]
          Length = 469

 Score =  472 bits (1214), Expect = e-130
 Identities = 244/427 (57%), Positives = 285/427 (66%), Gaps = 21/427 (4%)
 Frame = -1

Query: 1311 NPWQRLAHLASASSTRAHHLKHRETNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQT 1132
            NPW  L HLAS S +RAHH+K  +T  S        K PLFPR YGGYSISL FGTPPQT
Sbjct: 51   NPWGALNHLASLSLSRAHHIKSPKTKFSLL------KTPLFPRSYGGYSISLNFGTPPQT 104

Query: 1131 TSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWLF 958
            T FVMDTGSSLVWFPCT RY CS C+F   +      FIPK SSS+ ++GC+N KC WLF
Sbjct: 105  TKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSNLIGCKNHKCSWLF 164

Query: 957  E---NVQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDK-SVDNFVVGCSF 790
                  +C+ECD  +  C Q CP Y++QY             L FP K ++  F+VGCS 
Sbjct: 165  GPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGSTAGLLLSETLDFPHKKTIPGFLVGCSL 224

Query: 789  ASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXATK-- 616
             SIRQP GIAGFGR PESLP+Q+GLKKFSYCLVSH FD  P SSDL+LD       TK  
Sbjct: 225  FSIRQPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPASSDLVLDTGSGSDDTKTP 284

Query: 615  ---YTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTT 445
               YTPF+KNP +   AFR+YYYV LR I +G   VK PYKFLV  SDGNGGTIVDSGTT
Sbjct: 285  GLSYTPFQKNPTA---AFRDYYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGTT 341

Query: 444  FTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGG 265
            FTFME  V+ELVA+EFEKQV  HY  A  V+ ++GLRPC+NISGEK+V +P+  FHFKGG
Sbjct: 342  FTFMEKPVYELVAKEFEKQVA-HYTVATEVQNQTGLRPCFNISGEKSVSVPEFIFHFKGG 400

Query: 264  AKMALPLADYFSFLDEAV----------XXXXXXXXXXXXXXNYQQQNFYMEYDLENERL 115
            AKMALPLA+YFSF+D  V                        NYQQ+NF++E+DL+NER 
Sbjct: 401  AKMALPLANYFSFVDSGVICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHVEFDLKNERF 460

Query: 114  GFRSQVC 94
            GF+ Q C
Sbjct: 461  GFKQQNC 467


>ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 469

 Score =  468 bits (1205), Expect = e-129
 Identities = 249/438 (56%), Positives = 286/438 (65%), Gaps = 22/438 (5%)
 Frame = -1

Query: 1338 PTFAAPPLANPWQRLAHLASASSTRAHHLKHRETNISFXXXXXATKVPLFPRGYGGYSIS 1159
            P F   P ++PWQ L+HL SAS TRAHHLKHR+   S          PLF   YGGYS+S
Sbjct: 41   PLFTKNPSSDPWQLLSHLTSASLTRAHHLKHRKNTSS-------VNTPLFAHSYGGYSVS 93

Query: 1158 LGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGC 985
            L FGTP QT SFVMDTGSSLVWFPCT RY C+ C+F   D A    FIPK SSSAKIVGC
Sbjct: 94   LSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGC 153

Query: 984  RNPKCKWLFEN---VQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVD 814
             NPKC ++ ++    +C  CD NS  C + CPTY +QY             LVF +++  
Sbjct: 154  LNPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEP 213

Query: 813  NFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXX 634
            +FVVGCS  S RQP+GIAGFGRGP SLP QMGLKKFSYCL+SHRFD  P SS + L    
Sbjct: 214  DFVVGCSILSSRQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGP 273

Query: 633  XXXATK-----YTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGG 469
                 K     YTPFRKNP SSN AF+EYYYVTLR I VG  +VK PY F+VA SDGNGG
Sbjct: 274  DSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGG 333

Query: 468  TIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQ 289
            TIVDSG+TFTFME  VFE VA EF++Q+  +Y RAA VE  SGL+PC+N+SG  +V LP 
Sbjct: 334  TIVDSGSTFTFMEKPVFEAVATEFDRQMA-NYTRAADVEALSGLKPCFNLSGVGSVALPS 392

Query: 288  LTFHFKGGAKMALPLADYFSFL------------DEAVXXXXXXXXXXXXXXNYQQQNFY 145
            L F FKGGAKM LP+A+YFS +            +EAV              NYQ QNFY
Sbjct: 393  LVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAV-GSTLSSGPSIILGNYQSQNFY 451

Query: 144  MEYDLENERLGFRSQVCK 91
             EYDLENER GFR Q CK
Sbjct: 452  TEYDLENERFGFRRQRCK 469


>emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
          Length = 609

 Score =  466 bits (1199), Expect = e-128
 Identities = 248/437 (56%), Positives = 285/437 (65%), Gaps = 22/437 (5%)
 Frame = -1

Query: 1338 PTFAAPPLANPWQRLAHLASASSTRAHHLKHRETNISFXXXXXATKVPLFPRGYGGYSIS 1159
            P F   P ++PWQ L+HL SAS TRAHHLKHR+   S          PLF   YGGYS+S
Sbjct: 41   PLFTKNPSSDPWQLLSHLTSASLTRAHHLKHRKNTSS-------VNTPLFAHSYGGYSVS 93

Query: 1158 LGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGC 985
            L FGTP QT SFVMDTGSSLVWFPCT RY C+ C+F   D A    FIPK SSSAKIVGC
Sbjct: 94   LSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGC 153

Query: 984  RNPKCKWLFEN---VQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVD 814
             NPKC ++ ++    +C  CD NS  C + CPTY +QY             LVF +++  
Sbjct: 154  LNPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEP 213

Query: 813  NFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXX 634
            +FVVGCS  S RQP+GIAGFGRGP SLP QMGLKKFSYCL+SHRFD  P SS + L    
Sbjct: 214  DFVVGCSILSSRQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGP 273

Query: 633  XXXATK-----YTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGG 469
                 K     YTPFRKNP SSN AF+EYYYVTLR I VG  +VK PY F+VA SDGNGG
Sbjct: 274  DSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKXPYSFMVAGSDGNGG 333

Query: 468  TIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQ 289
            TIVDSG+TFTFME  VFE VA EF++Q+  +Y RAA VE  SGL+PC+N+SG  +V LP 
Sbjct: 334  TIVDSGSTFTFMEKPVFEAVATEFDRQMA-NYTRAADVEALSGLKPCFNLSGVGSVALPS 392

Query: 288  LTFHFKGGAKMALPLADYFSFL------------DEAVXXXXXXXXXXXXXXNYQQQNFY 145
            L F FKGGAKM LP+A+YFS +            +EAV              NYQ QNFY
Sbjct: 393  LVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAV-GSTLSSGPSIILGNYQSQNFY 451

Query: 144  MEYDLENERLGFRSQVC 94
             EYDLENER GFR Q C
Sbjct: 452  TEYDLENERFGFRRQRC 468


>ref|XP_002534234.1| pepsin A, putative [Ricinus communis] gi|223525662|gb|EEF28148.1|
            pepsin A, putative [Ricinus communis]
          Length = 468

 Score =  462 bits (1190), Expect = e-127
 Identities = 240/441 (54%), Positives = 292/441 (66%), Gaps = 26/441 (5%)
 Frame = -1

Query: 1338 PTFAAPPLANPWQRLAHLASASSTRAHHLKHRETNISFXXXXXATKVPLFPRGYGGYSIS 1159
            PT    P ++PW+ L HLA+ S +RAHHLK  +TN S        K PLF R YGGYS+S
Sbjct: 34   PTITKRPSSDPWEYLNHLATTSISRAHHLKSPKTNFSLI------KTPLFSRSYGGYSMS 87

Query: 1158 LGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFADT--ANFSVFIPKFSSSAKIVGC 985
            L  GTP QT   +MDTGSSLVWFPCT RY C+SCNF +T       F+P+ SSS+K++GC
Sbjct: 88   LSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSKLIGC 147

Query: 984  RNPKCKWLF-ENVQ--CRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVD 814
            +NPKC W+F  +VQ  C  C+  +  C Q CP YI+QY             + FP+K++ 
Sbjct: 148  KNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLGSTAGLLLSETINFPNKTIS 207

Query: 813  NFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXX 634
            +F+ GCS  S RQP GIAGFGR  ESLP Q+GLKKFSYCLVS RFD  PVSSDLILD   
Sbjct: 208  DFLAGCSLLSTRQPEGIAGFGRSQESLPLQLGLKKFSYCLVSRRFDDSPVSSDLILDMGP 267

Query: 633  XXXATK-----YTPFRKNPAS-SNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNG 472
                +K     YTPF+KN AS SNPAF+EYYYV LRKI VG   VK PY FLV  SDGNG
Sbjct: 268  STSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVGKTHVKVPYSFLVPGSDGNG 327

Query: 471  GTIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELP 292
            GTIVDSG+TFTF+EG VFEL+A+EFEKQ+  +Y  A  V++ +GLRPC++ISGEK+V +P
Sbjct: 328  GTIVDSGSTFTFVEGHVFELLAKEFEKQMA-NYTVATNVQKLTGLRPCFDISGEKSVVIP 386

Query: 291  QLTFHFKGGAKMALPLADYFSFLDEAV---------------XXXXXXXXXXXXXXNYQQ 157
             LTF FKGGAKM LPL++YF+F+D  V                             N+QQ
Sbjct: 387  DLTFQFKGGAKMQLPLSNYFAFVDMGVVCLTIVSDNAAALGGDGGVRSSGPAIILGNFQQ 446

Query: 156  QNFYMEYDLENERLGFRSQVC 94
            QNFY+EYDLEN+R GF+ Q C
Sbjct: 447  QNFYIEYDLENDRFGFKEQSC 467


>ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 474

 Score =  454 bits (1168), Expect = e-125
 Identities = 234/428 (54%), Positives = 278/428 (64%), Gaps = 20/428 (4%)
 Frame = -1

Query: 1314 ANPWQRLAHLASASSTRAHHLKHRETNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQ 1135
            ++P+  L   ASAS TRAHHLKHR  N            P +P+ YGGYSI L  GTPPQ
Sbjct: 49   SDPFHSLKFAASASLTRAHHLKHRNNN-----SPSVATTPAYPKSYGGYSIDLNLGTPPQ 103

Query: 1134 TTSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWL 961
            T+ FV+DTGSSLVWFPCT RY CS CNF   DT     FIPK SS+AK++GCRNPKC ++
Sbjct: 104  TSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSSTAKLLGCRNPKCGYI 163

Query: 960  FEN---VQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSF 790
            F +    +C +C   S  C+  CP YI+QY             L FP K+V  F+VGCS 
Sbjct: 164  FGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLGSTAGFLLLDNLNFPGKTVPQFLVGCSI 223

Query: 789  ASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXAT--- 619
             SIRQP+GIAGFGRG ESLP+QM LK+FSYCLVSHRFD  P SSDL+L            
Sbjct: 224  LSIRQPSGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNG 283

Query: 618  -KYTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTF 442
              YTPFR NP+++NPAF+EYYY+TLRK+ VGG  VK PY FL   SDGNGGTIVDSG+TF
Sbjct: 284  LSYTPFRSNPSTNNPAFKEYYYLTLRKVIVGGKDVKIPYTFLEPGSDGNGGTIVDSGSTF 343

Query: 441  TFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGA 262
            TFME  V+ LVA+EF KQ+ ++Y RA   E +SGL PC+NISG KTV  P+LTF FKGGA
Sbjct: 344  TFMERPVYNLVAQEFVKQLEKNYSRAEDAETQSGLSPCFNISGVKTVTFPELTFKFKGGA 403

Query: 261  KMALPLADYFSFLDEAV-----------XXXXXXXXXXXXXXNYQQQNFYMEYDLENERL 115
            KM  PL +YFS + +A                          NYQQQNFY+EYDLENER 
Sbjct: 404  KMTQPLQNYFSLVGDAEVVCLTVVSDGGAGPPKTTGPAIILGNYQQQNFYIEYDLENERF 463

Query: 114  GFRSQVCK 91
            GF  + C+
Sbjct: 464  GFGPRSCR 471


>ref|XP_004303503.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Fragaria vesca
            subsp. vesca]
          Length = 458

 Score =  451 bits (1160), Expect = e-124
 Identities = 236/437 (54%), Positives = 287/437 (65%), Gaps = 21/437 (4%)
 Frame = -1

Query: 1338 PTFAAPPLANPWQRLAHLASASSTRAHHLKHRETNISFXXXXXATKVPLFPRGYGGYSIS 1159
            P    P  ++P Q L  L+SAS +RAHHLK  + N S      ATKVPL+PR YGGYSIS
Sbjct: 32   PLAKHPSSSDPIQTLNLLSSASLSRAHHLKRPKHNSS------ATKVPLYPRSYGGYSIS 85

Query: 1158 LGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGC 985
            L FGTPPQ ++FVMDTGSSLVWFPCT RY CS C+F   D +    FIPK SSSA+++GC
Sbjct: 86   LSFGTPPQISTFVMDTGSSLVWFPCTSRYLCSRCSFPNIDPSTIPAFIPKLSSSARLLGC 145

Query: 984  RNPKCKWLFENVQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFV 805
            +NPKC W+F      +C  +S    Q CP+Y++QY             L FPDK+V +F+
Sbjct: 146  KNPKCAWIFGPEVNTKCPNSS----QACPSYVIQYGSGTTAGVLLSESLDFPDKTVPDFL 201

Query: 804  VGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLIL------- 646
            VGCSF SIRQPAG+AGFGRGP+SLP QMGL KFSYCLVSHRFD  PVSSDL+L       
Sbjct: 202  VGCSFLSIRQPAGMAGFGRGPQSLPVQMGLSKFSYCLVSHRFDDTPVSSDLVLYSGSTSD 261

Query: 645  -DXXXXXXATKYTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGG 469
             D         YTPF+KNP ++N A+REYYY+ LRK+ VG   VK PYK+LV   D NGG
Sbjct: 262  GDEIDDNHDISYTPFQKNPGAANTAYREYYYLALRKVIVGKKHVKIPYKYLVPGEDDNGG 321

Query: 468  TIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQ 289
            TIVDSG+TFTFME  VFE VAE F  Q+ E Y RA  +E  +GL+PC++IS E+ V+ P+
Sbjct: 322  TIVDSGSTFTFMERPVFEAVAEAFATQM-EKYTRAGDIENRTGLKPCFDISKEEKVDFPE 380

Query: 288  LTFHFKGGAKMALPLADYF-----------SFLDEAVXXXXXXXXXXXXXXNYQQQNFYM 142
            L F FKGGAKMA+PL +YF           + + + V              N+QQQNFY+
Sbjct: 381  LVFQFKGGAKMAMPLNNYFALVTSDGVVCLTIVTDGVAGPGVAAGPAVILGNFQQQNFYV 440

Query: 141  EYDLENERLGFRSQVCK 91
            EYDLE ER GF+ Q CK
Sbjct: 441  EYDLERERFGFKKQSCK 457


>ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 470

 Score =  443 bits (1140), Expect = e-122
 Identities = 234/429 (54%), Positives = 276/429 (64%), Gaps = 21/429 (4%)
 Frame = -1

Query: 1314 ANPWQRLAHLASASSTRAHHLKHRETNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQ 1135
            ++P+  +   AS+S TRAHHLKHR  N            P +P+ YGGYSI L  GTPPQ
Sbjct: 45   SDPFHSVKLAASSSLTRAHHLKHRNNN-----SPSVATTPAYPKSYGGYSIDLNLGTPPQ 99

Query: 1134 TTSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWL 961
            T+ FV+DTGSSLVWFPCT  Y CS CNF   D      FIPK SS+AK++GCRNPKC +L
Sbjct: 100  TSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSSTAKLLGCRNPKCGYL 159

Query: 960  FE---NVQCRECDG-NSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCS 793
            F      +C +C    S  C+  CP+YI+QY             L FP K+V  F+VGCS
Sbjct: 160  FGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGATAGFLLLDNLNFPGKTVPQFLVGCS 219

Query: 792  FASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXAT-- 619
              SIRQP+GIAGFGRG ESLP+QM LK+FSYCLVSHRFD  P SSDL+L           
Sbjct: 220  ILSIRQPSGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTN 279

Query: 618  --KYTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTT 445
               YTPFR NP S+N  FREYYYVTLRK+ VGGV VK PYKFL   SDGNGGTIVDSG+T
Sbjct: 280  GLSYTPFRSNP-SNNSVFREYYYVTLRKLIVGGVDVKIPYKFLEPGSDGNGGTIVDSGST 338

Query: 444  FTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGG 265
            FTFME  V+ LVA+EF +Q+G+ Y R   VE +SGL PC+NISG KT+  P+ TF FKGG
Sbjct: 339  FTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLSPCFNISGVKTISFPEFTFQFKGG 398

Query: 264  AKMALPLADYFSFLDEAV-----------XXXXXXXXXXXXXXNYQQQNFYMEYDLENER 118
            AKM+ PL +YFSF+ +A                          NYQQQNFY+EYDLENER
Sbjct: 399  AKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQPKTAGPAIILGNYQQQNFYVEYDLENER 458

Query: 117  LGFRSQVCK 91
             GF  + CK
Sbjct: 459  FGFGPRNCK 467


>ref|XP_007027933.1| Eukaryotic aspartyl protease family protein, putative isoform 1
            [Theobroma cacao] gi|590632770|ref|XP_007027934.1|
            Eukaryotic aspartyl protease family protein, putative
            isoform 1 [Theobroma cacao]
            gi|590632774|ref|XP_007027935.1| Eukaryotic aspartyl
            protease family protein, putative isoform 1 [Theobroma
            cacao] gi|508716538|gb|EOY08435.1| Eukaryotic aspartyl
            protease family protein, putative isoform 1 [Theobroma
            cacao] gi|508716539|gb|EOY08436.1| Eukaryotic aspartyl
            protease family protein, putative isoform 1 [Theobroma
            cacao] gi|508716540|gb|EOY08437.1| Eukaryotic aspartyl
            protease family protein, putative isoform 1 [Theobroma
            cacao]
          Length = 472

 Score =  440 bits (1132), Expect = e-121
 Identities = 231/435 (53%), Positives = 281/435 (64%), Gaps = 22/435 (5%)
 Frame = -1

Query: 1332 FAAPPLANPWQRLAHLASASSTRAHHLKHRETNISFXXXXXAT--KVPLFPRGYGGYSIS 1159
            F  PP  + +Q L +LA++S +RAHHLK     I       ++  K PLFP  YGGY+IS
Sbjct: 38   FPHPPSFDAYQILNNLATSSVSRAHHLKQPTHKIKAKANTTSSLLKTPLFPHSYGGYTIS 97

Query: 1158 LGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNF--ADTANFSVFIPKFSSSAKIVGC 985
            LG GTPPQT +F+MDTGSSL WFPCT RY CS C F   D      F PK SSS  +VGC
Sbjct: 98   LGIGTPPQTLTFIMDTGSSLSWFPCTSRYICSQCAFPNVDPKKIPTFSPKLSSSKALVGC 157

Query: 984  RNPKCKWLFE---NVQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVD 814
            +NPKC+WLF      +C++C+  S  C Q CP YI+QY             LVF  K+  
Sbjct: 158  KNPKCRWLFGPDVESRCQDCEPASKNCTQNCPPYIIQYGLGSTGGLLLVENLVFSQKTFQ 217

Query: 813  NFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXX 634
            +F+VGCS  S RQPAGI GFGR PESLP+Q+G+KKFSYCLVS RFD   VSS+++L+   
Sbjct: 218  DFLVGCSIFSNRQPAGIVGFGRRPESLPSQLGVKKFSYCLVSRRFDDTGVSSNMLLETGS 277

Query: 633  XXXATK-----YTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGG 469
                 K     YTPF KN  +S+P F+E+YYVT+RKI VG   VK PYK+LV   DGNGG
Sbjct: 278  GSGDAKTKGLSYTPFYKNQFASHPIFQEFYYVTIRKILVGDKHVKVPYKYLVPGPDGNGG 337

Query: 468  TIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQ 289
            TIVDSG+TFTFME  VFELV++EFEKQ+G +Y RA  VE +SGL PC NISG K++  P+
Sbjct: 338  TIVDSGSTFTFMERAVFELVSKEFEKQMG-NYSRAHEVENKSGLAPCVNISGHKSISFPE 396

Query: 288  LTFHFKGGAKMALPLADYFSFLD----------EAVXXXXXXXXXXXXXXNYQQQNFYME 139
            L F FKGGAKMALPLA+YFSFLD          + +              N+QQQN+Y+E
Sbjct: 397  LIFQFKGGAKMALPLANYFSFLDVNVVCLMVVTDNIIGQGVSGGPAIILGNFQQQNYYIE 456

Query: 138  YDLENERLGFRSQVC 94
            YDL NE  GF  Q C
Sbjct: 457  YDLANESFGFAKQSC 471


>ref|XP_002323393.2| hypothetical protein POPTR_0016s07260g [Populus trichocarpa]
            gi|550321034|gb|EEF05154.2| hypothetical protein
            POPTR_0016s07260g [Populus trichocarpa]
          Length = 454

 Score =  437 bits (1124), Expect = e-120
 Identities = 220/377 (58%), Positives = 263/377 (69%), Gaps = 11/377 (2%)
 Frame = -1

Query: 1308 PWQRLAHLASASSTRAHHLKHRETNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTT 1129
            PW  L HLAS S +RAHH+K  +TN S        K PLFPR YGGYSISL FGTPPQTT
Sbjct: 43   PWGSLNHLASLSLSRAHHIKSPKTNFSLI------KTPLFPRSYGGYSISLNFGTPPQTT 96

Query: 1128 SFVMDTGSSLVWFPCTDRYTCSSCNFADTANFSV--FIPKFSSSAKIVGCRNPKCKWLFE 955
             FVMDTGSSLVWFPCT RY CS CNF +     +  F+PK SSS+K++GC+NP+C  +F 
Sbjct: 97   KFVMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMIFG 156

Query: 954  ---NVQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDK-SVDNFVVGCSFA 787
                 +C+ECD  +  C Q CP Y++QY             L FP+K ++ +F+VGCS  
Sbjct: 157  PEIQSKCQECDSTAQNCTQTCPPYVIQYGSGSTAGLLLSETLDFPNKKTIPDFLVGCSIF 216

Query: 786  SIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXATK--- 616
            SI+QP GIAGFGR PESLP+Q+GLKKFSYCLVSH FD  P SSDL+LD       TK   
Sbjct: 217  SIKQPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPTSSDLVLDTGSGSGVTKTAG 276

Query: 615  --YTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTF 442
              +TPF KNP +   AFR+YYYV LR I +G   VK PYKFLV  +DGNGGTIVDSGTTF
Sbjct: 277  LSHTPFLKNPTT---AFRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDGNGGTIVDSGTTF 333

Query: 441  TFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGA 262
            TFME  V+ELVA+EFEKQ+  HY  A  ++  +GLRPCYNISGEK++ +P L F FKGGA
Sbjct: 334  TFMENPVYELVAKEFEKQMA-HYTVATEIQNLTGLRPCYNISGEKSLSVPDLIFQFKGGA 392

Query: 261  KMALPLADYFSFLDEAV 211
            KMALPL++YFS +D  V
Sbjct: 393  KMALPLSNYFSIVDSGV 409


>ref|XP_007202027.1| hypothetical protein PRUPE_ppa005104mg [Prunus persica]
            gi|462397558|gb|EMJ03226.1| hypothetical protein
            PRUPE_ppa005104mg [Prunus persica]
          Length = 477

 Score =  433 bits (1114), Expect = e-119
 Identities = 236/453 (52%), Positives = 286/453 (63%), Gaps = 39/453 (8%)
 Frame = -1

Query: 1332 FAAPPLANPWQRLAHLASASSTRAHHLKH-RETNISFXXXXXATKVPLFPRGYGGYSISL 1156
            F   P ++P Q L+  ASAS +RAHH+K+ R+ N S       T+VPLFP  YG YS+SL
Sbjct: 32   FPNHPSSDPLQALSFHASASISRAHHIKNSRKPNSSL------TQVPLFPHSYGDYSVSL 85

Query: 1155 GFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFADT--ANFSVFIPKFSSSAKIVGCR 982
             FGTPPQT+SF+MDTGSSLVWFPCT RY CS C F +   A    F PK SSS+KIVGC+
Sbjct: 86   NFGTPPQTSSFIMDTGSSLVWFPCTKRYICSRCQFPNINPAKIPTFKPKLSSSSKIVGCQ 145

Query: 981  NPKCKWLFE---NVQCRECDGNSTA-CNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVD 814
            NPKC W+F      +C  C+  S   C+Q CPTYI+QY             L FP K V 
Sbjct: 146  NPKCGWIFGPEVKSKCPNCNNPSHQNCSQACPTYIIQYGSGTTAGILLSETLDFPKKIVP 205

Query: 813  NFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXX 634
            +F+VGCSF SIRQPAGIAGFGRGP+SLPAQMGL KFSYCLVSHRFD  P SSDL+L    
Sbjct: 206  DFLVGCSFVSIRQPAGIAGFGRGPQSLPAQMGLTKFSYCLVSHRFDDTPQSSDLVLYSSS 265

Query: 633  XXXAT---------------------KYTPFRKNPASSNPAFREYYYVTLRKITVGGVKV 517
               ++                       TPF+KNP   N AFREYYY+ LRK+ VG   V
Sbjct: 266  SGSSSSSEEEPTIAESQRNKTKLQSLSSTPFQKNPGPPNSAFREYYYIMLRKVIVGNKNV 325

Query: 516  KAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGL 337
            K PYKFLV  +D +GGTIVDSG+TFTFME  VFE VA+EFE Q+  +Y RA  +E ++GL
Sbjct: 326  KIPYKFLVPGADSSGGTIVDSGSTFTFMEKPVFEPVAKEFEAQMA-NYTRAKDLENKTGL 384

Query: 336  RPCYNISGEKTVELPQLTFHFKGGAKMALPLADYFSFLD-----------EAVXXXXXXX 190
            RPC++IS EK V+ P+L F FKGGAKM LP  +YFS +            + V       
Sbjct: 385  RPCFDISKEKKVDFPELVFQFKGGAKMELPSKNYFSMVSSSGVVCLTIVTDGVVGPGGNG 444

Query: 189  XXXXXXXNYQQQNFYMEYDLENERLGFRSQVCK 91
                   NYQQQ+F++EYDL++ + GFR Q CK
Sbjct: 445  GPAIILGNYQQQDFHVEYDLQHGKFGFRKQSCK 477


>ref|XP_007162958.1| hypothetical protein PHAVU_001G194500g [Phaseolus vulgaris]
            gi|561036422|gb|ESW34952.1| hypothetical protein
            PHAVU_001G194500g [Phaseolus vulgaris]
          Length = 466

 Score =  429 bits (1102), Expect = e-117
 Identities = 226/436 (51%), Positives = 276/436 (63%), Gaps = 20/436 (4%)
 Frame = -1

Query: 1338 PTFAAPPLANPWQRLAHLASASSTRAHHLKHRETNISFXXXXXATKVPLFPRGYGGYSIS 1159
            P    P  ++P+  L   ASAS TRAHHLKHR    S      A    ++P+ YGGYSI 
Sbjct: 35   PLLTKPQSSDPFHSLKLAASASLTRAHHLKHRLNAPS------AATTQVYPKSYGGYSID 88

Query: 1158 LGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGC 985
            L FGTPPQT+ FV+DTGSSLVWFPCT RY CS C F   D      FIPK SS+++++GC
Sbjct: 89   LNFGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCLFPNIDPTKIPTFIPKNSSTSRLLGC 148

Query: 984  RNPKCKWLFEN---VQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVD 814
            +NPKC +LF +    +C +C  +S  C+  CP YI+QY             L FP+K V 
Sbjct: 149  KNPKCGYLFGSDLQSRCPQCKPDSQNCSLTCPPYIIQYGLGSTAGFLLLDNLNFPEKIVP 208

Query: 813  NFVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXX 634
             F+VGCS  SIRQP+GIAGFGRG ESLPAQM LK+FSYCL+SH FD    +SDL+L    
Sbjct: 209  QFLVGCSILSIRQPSGIAGFGRGQESLPAQMALKRFSYCLLSHNFDDSTENSDLVLQISS 268

Query: 633  XXXAT----KYTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGT 466
                      YTPF  NP+++NPAF EYYY++LRK+ VGG  VK P  FL   SDGNGGT
Sbjct: 269  TGDTKTNGLSYTPFHPNPSANNPAFLEYYYLSLRKVIVGGKNVKIPLSFLEPGSDGNGGT 328

Query: 465  IVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQL 286
            IVDSG+TFTFME   ++LV +EF KQ+G +Y RA  VE +SGL PC+NISG KTV  P+ 
Sbjct: 329  IVDSGSTFTFMERPAYDLVVKEFVKQLG-NYSRAEDVEAQSGLGPCFNISGAKTVNFPKF 387

Query: 285  TFHFKGGAKMALPLADYFSFLDEAV-----------XXXXXXXXXXXXXXNYQQQNFYME 139
            T  FKGGAKM LP+ +YFS +D++                          NYQQQNF++E
Sbjct: 388  TLQFKGGAKMTLPVENYFSLIDDSEVVCLTIVSDGGAGPATTSGPAIILGNYQQQNFHIE 447

Query: 138  YDLENERLGFRSQVCK 91
            YDLENER GF  Q CK
Sbjct: 448  YDLENERFGFGPQSCK 463


>emb|CBI30372.3| unnamed protein product [Vitis vinifera]
          Length = 445

 Score =  427 bits (1099), Expect = e-117
 Identities = 222/381 (58%), Positives = 258/381 (67%), Gaps = 7/381 (1%)
 Frame = -1

Query: 1338 PTFAAPPLANPWQRLAHLASASSTRAHHLKHRETNISFXXXXXATKVPLFPRGYGGYSIS 1159
            P F   P ++PWQ L+HL SAS TRAHHLKHR+   S          PLF   YGGYS+S
Sbjct: 57   PLFTKNPSSDPWQLLSHLTSASLTRAHHLKHRKNTSS-------VNTPLFAHSYGGYSVS 109

Query: 1158 LGFGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGC 985
            L FGTP QT SFVMDTGSSLVWFPCT RY C+ C+F   D A    FIPK SSSAKIVGC
Sbjct: 110  LSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGC 169

Query: 984  RNPKCKWLFENVQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFV 805
             NPKC ++ ++        NS  C + CPTY +QY             LVF +++  +FV
Sbjct: 170  LNPKCGFVMDSE-------NSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPDFV 222

Query: 804  VGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXX 625
            VGCS  S RQP+GIAGFGRGP SLP QMGLKKFSYCL+SHRFD  P SS + L       
Sbjct: 223  VGCSILSSRQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSK 282

Query: 624  ATK-----YTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIV 460
              K     YTPFRKNP SSN AF+EYYYVTLR I VG  +VK PY F+VA SDGNGGTIV
Sbjct: 283  DDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGGTIV 342

Query: 459  DSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTF 280
            DSG+TFTFME  VFE VA EF++Q+  +Y RAA VE  SGL+PC+N+SG  +V LP L F
Sbjct: 343  DSGSTFTFMEKPVFEAVATEFDRQMA-NYTRAADVEALSGLKPCFNLSGVGSVALPSLVF 401

Query: 279  HFKGGAKMALPLADYFSFLDE 217
             FKGGAKM LP+A+YFS + +
Sbjct: 402  QFKGGAKMELPVANYFSLVGD 422


>ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  424 bits (1089), Expect = e-116
 Identities = 220/424 (51%), Positives = 269/424 (63%), Gaps = 18/424 (4%)
 Frame = -1

Query: 1311 NPWQRLAHLASASSTRAHHLKHRETNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQT 1132
            +P Q L  LAS+S TRAH +K  ++N  F       K PL P  YG YS  L FGTP QT
Sbjct: 41   DPLQALTFLASSSQTRAHQIKTPKSNSVF-------KSPLSPHSYGAYSTPLSFGTPQQT 93

Query: 1131 TSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWLF 958
               + DTGSSLVWFPCT RY CS C+F   D      F+PK SSS+K+VGC+NPKC W+F
Sbjct: 94   LHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIF 153

Query: 957  E---NVQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFA 787
                  QCR C+  +  C Q CP Y++QY             L FPDK + NFVVGCSF 
Sbjct: 154  GPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFL 213

Query: 786  SIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXA-TKYT 610
            SI QP+GIAGFGRG ESLP+QMGLKKF+YCL S +FD  P S  LILD      +   YT
Sbjct: 214  SIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYT 273

Query: 609  PFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFME 430
            PFR+NP+ SN A++EYYY+ +RKI VG   VK PYKFLV   DGNGG+I+DSG+TFTFM+
Sbjct: 274  PFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMD 333

Query: 429  GKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMAL 250
              V E+VA EFEKQ+  ++ RA  VE  +GLRPC++IS EK+V+ P+L F FKGGAK AL
Sbjct: 334  KPVLEVVAREFEKQLA-NWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWAL 392

Query: 249  PLADYFSFLDEAVXXXXXXXXXXXXXXN------------YQQQNFYMEYDLENERLGFR 106
            PL +YF+ +  +                            +QQQNFY+EYDL N+RLGFR
Sbjct: 393  PLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFR 452

Query: 105  SQVC 94
             Q C
Sbjct: 453  QQTC 456


>ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
            nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  423 bits (1088), Expect = e-116
 Identities = 220/424 (51%), Positives = 269/424 (63%), Gaps = 18/424 (4%)
 Frame = -1

Query: 1311 NPWQRLAHLASASSTRAHHLKHRETNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQT 1132
            +P Q L  LAS+S TRAH +K  ++N  F       K PL P  YG YS  L FGTP QT
Sbjct: 41   DPLQALTFLASSSQTRAHQIKTPKSNSVF-------KSPLSPHSYGAYSTPLSFGTPQQT 93

Query: 1131 TSFVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWLF 958
               + DTGSSLVWFPCT RY CS C+F   D      F+PK SSS+K+VGC+NPKC W+F
Sbjct: 94   LHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIF 153

Query: 957  E---NVQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFA 787
                  QCR C+  +  C Q CP Y++QY             L FPDK + NFVVGCSF 
Sbjct: 154  GPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKXIPNFVVGCSFL 213

Query: 786  SIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXA-TKYT 610
            SI QP+GIAGFGRG ESLP+QMGLKKF+YCL S +FD  P S  LILD      +   YT
Sbjct: 214  SIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYT 273

Query: 609  PFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFME 430
            PFR+NP+ SN A++EYYY+ +RKI VG   VK PYKFLV   DGNGG+I+DSG+TFTFM+
Sbjct: 274  PFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMD 333

Query: 429  GKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMAL 250
              V E+VA EFEKQ+  ++ RA  VE  +GLRPC++IS EK+V+ P+L F FKGGAK AL
Sbjct: 334  KPVLEVVAREFEKQLA-NWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWAL 392

Query: 249  PLADYFSFLDEAVXXXXXXXXXXXXXXN------------YQQQNFYMEYDLENERLGFR 106
            PL +YF+ +  +                            +QQQNFY+EYDL N+RLGFR
Sbjct: 393  PLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFR 452

Query: 105  SQVC 94
             Q C
Sbjct: 453  QQTC 456


>gb|EXC01923.1| Aspartic proteinase nepenthesin-2 [Morus notabilis]
          Length = 473

 Score =  422 bits (1084), Expect = e-115
 Identities = 222/432 (51%), Positives = 278/432 (64%), Gaps = 24/432 (5%)
 Frame = -1

Query: 1314 ANPWQRLAHLASASSTRAHHLK-----HRETNISFXXXXXATKVPLFPRGYGGYSISLGF 1150
            ++P Q +  LASAS +RAH LK     +  ++ S       TK PL+PR YGGYS+SL F
Sbjct: 43   SDPLQTITSLASASLSRAHALKRPKSVNSSSSSSSTDSKYQTKTPLYPRSYGGYSVSLRF 102

Query: 1149 GTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFADTANFSVFIPKFSSSAKIVGCRNPKC 970
            GTPPQ   FVMDTGSSLVWFPCT RY CS C+F ++ N   FIPK SSS+K++GC+NPKC
Sbjct: 103  GTPPQILQFVMDTGSSLVWFPCTSRYLCSKCSFPNSQNPPKFIPKKSSSSKLIGCQNPKC 162

Query: 969  KW-LFENVQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCS 793
            +  L    +C +        N+ CP YI+QY             L FP K V +F+VGCS
Sbjct: 163  QLVLGATAKCDDATAGENPKNKACPAYIIQYGSGSTIGQLLSETLNFPGKMVPDFIVGCS 222

Query: 792  FASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLIL-----DXXXXX 628
              SIRQP+GIAGFGRG ESLP+Q+ L KFSYCLVSHRFD    SSDL+L     D     
Sbjct: 223  VLSIRQPSGIAGFGRGKESLPSQLRLAKFSYCLVSHRFDDTSFSSDLVLYSSSSDDKQPE 282

Query: 627  XATKYTPFRKNPA-SSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSG 451
             +  YTPF+KNP+ SS PA +EYYY+ +RK+ VG   VK PY++LV  SDG+GGTIVDSG
Sbjct: 283  GSISYTPFQKNPSLSSIPALKEYYYILIRKVIVGKTHVKIPYRYLVPGSDGHGGTIVDSG 342

Query: 450  TTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFK 271
            TTFT+ME  VF+ V+ EF KQ+  +Y RA  +E  +GL PC++IS EK+V  P+L   FK
Sbjct: 343  TTFTYMEKPVFDAVSSEFAKQMA-NYTRAKGIENRTGLGPCFDISKEKSVNFPELVLQFK 401

Query: 270  GGAKMALPLADYFSFL------------DEAVXXXXXXXXXXXXXXNYQQQNFYMEYDLE 127
            GGAKM LPL +YFS +            ++ V              NYQQQNF++EYDL+
Sbjct: 402  GGAKMNLPLTNYFSIVGSPGSVCLTVVTNDDVGGPESVGGPAIILGNYQQQNFHIEYDLK 461

Query: 126  NERLGFRSQVCK 91
            NER GFR Q+CK
Sbjct: 462  NERFGFRRQICK 473


>ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
            communis] gi|223536957|gb|EEF38595.1| Aspartic proteinase
            nepenthesin-2 precursor, putative [Ricinus communis]
          Length = 447

 Score =  421 bits (1083), Expect = e-115
 Identities = 217/424 (51%), Positives = 272/424 (64%), Gaps = 14/424 (3%)
 Frame = -1

Query: 1320 PLANPWQRLAHLASASSTRAHHLKHRETNISFXXXXXATKVPLFPRGYGGYSISLGFGTP 1141
            P  +  Q+L +L S S  RAHHLK+ +T             P+F   YGGYSISL FGTP
Sbjct: 39   PSQDHLQKLNYLVSTSLARAHHLKNPQTT------------PVFSHSYGGYSISLSFGTP 86

Query: 1140 PQTTSFVMDTGSSLVWFPCTDRYTCSSCNFADTANFSVFIPKFSSSAKIVGCRNPKCKWL 961
            PQT SFVMDTGSS VWFPCT RY C++C+F  T+  S F+PK SSS+KI+GC+NPKC W+
Sbjct: 87   PQTLSFVMDTGSSFVWFPCTLRYLCNNCSF--TSRISPFLPKHSSSSKIIGCKNPKCSWI 144

Query: 960  FE-NVQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFAS 784
             + +++C +CD NS  C+Q+CP Y++ Y             L      V NF+VGCS  S
Sbjct: 145  HQTDLRCTDCDNNSRNCSQICPPYLILYGSGTTGGVALSETLHLHGLIVPNFLVGCSVFS 204

Query: 783  IRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXATK---- 616
             RQPAGIAGFGRGP SLP+Q+GL KFSYCL+SH+FD    SS L+LD             
Sbjct: 205  SRQPAGIAGFGRGPSSLPSQLGLTKFSYCLLSHKFDDTQESSSLVLDSQSDSDKKTAALM 264

Query: 615  YTPFRKNP-ASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFT 439
            YTP  KNP     PAF  YYYV+LR+I++GG  VK PYK+L  D DGNGGTI+DSGTTFT
Sbjct: 265  YTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPYKYLSPDKDGNGGTIIDSGTTFT 324

Query: 438  FMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAK 259
            +M  + FE+++ EF  QV ++Y RA  VE  SGL+PC+N+SG K +ELPQL  HFKGGA 
Sbjct: 325  YMSTEAFEILSNEFISQV-KNYERALMVEALSGLKPCFNVSGAKELELPQLRLHFKGGAD 383

Query: 258  MALPLADYFSFLDE--------AVXXXXXXXXXXXXXXNYQQQNFYMEYDLENERLGFRS 103
            + LPL +YF+FL                          N+Q QNFY+EYDL+NERLGF+ 
Sbjct: 384  VELPLENYFAFLGSREVACFTVVTDGAEKASGPGMILGNFQMQNFYVEYDLQNERLGFKK 443

Query: 102  QVCK 91
            + CK
Sbjct: 444  ESCK 447


>ref|XP_006430085.1| hypothetical protein CICLE_v10011613mg [Citrus clementina]
            gi|557532142|gb|ESR43325.1| hypothetical protein
            CICLE_v10011613mg [Citrus clementina]
          Length = 483

 Score =  417 bits (1072), Expect = e-114
 Identities = 223/434 (51%), Positives = 280/434 (64%), Gaps = 27/434 (6%)
 Frame = -1

Query: 1314 ANPWQRLAHLASASSTRAHHLKHR------ETNISFXXXXXATKVPLFPRGYGGYSISLG 1153
            ++P + L  LAS+S +RA HLK +      ++NI         K PL    YGGYSISL 
Sbjct: 50   SDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLS 109

Query: 1152 FGTPPQ-TTSFVMDTGSSLVWFPCTDRYTCSSCNF--ADTANFSVFIPKFSSSAKIVGCR 982
            FGTPPQ +T F+ DTGSSLVWFPCT RY C+ CNF   D +    FIPK SSS++++GC+
Sbjct: 110  FGTPPQASTPFIFDTGSSLVWFPCTSRYRCADCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169

Query: 981  NPKCKWLF-ENVQ--CRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDN 811
            NPKC W+F  NV+  C+ C+  +  C   CP Y++QY             L FP K+V N
Sbjct: 170  NPKCSWIFGPNVESRCKGCNPRNKTCPLACPPYLIQYGLGFTAGLLLSETLGFPSKTVPN 229

Query: 810  FVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXX 631
            F+VGCS  S RQPAGIAGFGR  ESLP+Q+GLKKFSYCL+S +FD  PVSS+L+LD    
Sbjct: 230  FLVGCSILSNRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGSG 289

Query: 630  XXATK-----YTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGT 466
               +K     YTPF KNP  S+ AF EYYYV LR+I VG   VK PY +LV  SDGNGG 
Sbjct: 290  SGDSKTPGLSYTPFYKNPVGSSSAFGEYYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGV 349

Query: 465  IVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQL 286
            IVDSG+T TFMEG +FE VA+EF +Q+G +Y RAA VE++SGLRPC++ISG+K+V LP+L
Sbjct: 350  IVDSGSTLTFMEGPLFEAVAKEFIRQMG-NYSRAADVEKKSGLRPCFDISGKKSVYLPEL 408

Query: 285  TFHFKGGAKMALPLADYFSFLDEAV----------XXXXXXXXXXXXXXNYQQQNFYMEY 136
               FKGGAKMALPL +YF+ +   V                        ++Q QNFY+E+
Sbjct: 409  ILKFKGGAKMALPLENYFALVGNEVLCLILFTDNAAGPAPGGGPAIILGDFQLQNFYLEF 468

Query: 135  DLENERLGFRSQVC 94
            DL N+R GF  Q C
Sbjct: 469  DLANDRFGFAKQKC 482


>ref|XP_007145803.1| hypothetical protein PHAVU_007G269300g [Phaseolus vulgaris]
            gi|561018993|gb|ESW17797.1| hypothetical protein
            PHAVU_007G269300g [Phaseolus vulgaris]
          Length = 458

 Score =  414 bits (1065), Expect = e-113
 Identities = 220/432 (50%), Positives = 274/432 (63%), Gaps = 18/432 (4%)
 Frame = -1

Query: 1332 FAAPPLANPWQRLAHLASASSTRAHHLKHRETNISFXXXXXATKVPLFPRGYGGYSISLG 1153
            F   P ++P+  L    S S TRAHHLK+ + N          K  + P+ YGGYSI L 
Sbjct: 37   FTTHPSSHPFHTLKLAVSTSLTRAHHLKNHQPN--------PPKTQIHPKSYGGYSIDLN 88

Query: 1152 FGTPPQTTSFVMDTGSSLVWFPCTDRYTCSSCNFADTANFSVFIPKFSSSAKIVGCRNPK 973
            FGTPPQT SF++DTGS+LVW PC+  Y CS+CN    +  S FIPK SSS+K VGC NPK
Sbjct: 89   FGTPPQTFSFILDTGSTLVWLPCSSHYLCSNCNNFHNSPKS-FIPKNSSSSKFVGCTNPK 147

Query: 972  CKWLF-ENVQCRECDGNSTA--CNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVV 802
            CKW+F  +V+ R C  NS    C+Q CP Y +QY             L FP K + +F+V
Sbjct: 148  CKWVFGTSVESRCCKQNSATANCSQTCPAYTVQYGLGSTAGFLLSENLNFPGKLLPDFLV 207

Query: 801  GCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLIL----DXXX 634
            GCS  S+ QPAGIAGFGRGPESLP+QM L  FSYCL+SH+FD  P +SDL+L        
Sbjct: 208  GCSIVSVYQPAGIAGFGRGPESLPSQMNLTGFSYCLLSHQFDDSPETSDLVLHTSSSDNK 267

Query: 633  XXXATKYTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDS 454
                  YTPFRKNP+S NPAF  YYY+TLR+I VG  +V+ P + L  D +GNGG+IVDS
Sbjct: 268  RTNGVSYTPFRKNPSSKNPAFGAYYYLTLRRIVVGEKRVRVPKRLLEPDVNGNGGSIVDS 327

Query: 453  GTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHF 274
            G+TFTFME  +F+LVAEEF +QV  +Y RA  +E++SGL PC+ +SG  T   P+L F F
Sbjct: 328  GSTFTFMERPIFDLVAEEFARQV--NYTRAREIEKKSGLSPCFVVSG--TATFPELRFEF 383

Query: 273  KGGAKMALPLADYFSFLDEA-----------VXXXXXXXXXXXXXXNYQQQNFYMEYDLE 127
            +GGAKM+LPL +YFS + ++           V              NYQQQNFY+EYDL 
Sbjct: 384  RGGAKMSLPLTNYFSLVGKSDVACLTIVSDDVAGPGVAAGPAVILGNYQQQNFYVEYDLG 443

Query: 126  NERLGFRSQVCK 91
            NER GFRSQ CK
Sbjct: 444  NERFGFRSQSCK 455


>ref|XP_006481575.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Citrus sinensis]
          Length = 483

 Score =  414 bits (1063), Expect = e-113
 Identities = 222/434 (51%), Positives = 278/434 (64%), Gaps = 27/434 (6%)
 Frame = -1

Query: 1314 ANPWQRLAHLASASSTRAHHLKHR------ETNISFXXXXXATKVPLFPRGYGGYSISLG 1153
            ++P + L  LAS+S +RA HLK +      ++NI         K PL    YGGYSISL 
Sbjct: 50   SDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLS 109

Query: 1152 FGTPPQ-TTSFVMDTGSSLVWFPCTDRYTCSSCNF--ADTANFSVFIPKFSSSAKIVGCR 982
            FGTPPQ +T F+ DTGSSLVWFPCT RY C  CNF   D +    FIPK SSS++++GC+
Sbjct: 110  FGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169

Query: 981  NPKCKWLF-ENVQ--CRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDN 811
            NPKC W+F  NV+  C+ C   +  C   CP+Y+LQY             L FP K+V N
Sbjct: 170  NPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPN 229

Query: 810  FVVGCSFASIRQPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXX 631
            F+ GCS  S RQPAGIAGFGR  ESLP+Q+GLKKFSYCL+S +FD  PVSS+L+LD    
Sbjct: 230  FLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPG 289

Query: 630  XXATK-----YTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGT 466
               +K     YTPF KNP  S+ AF E+YYV LR+I VG   VK PY +LV  SDGNGG 
Sbjct: 290  SGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGV 349

Query: 465  IVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQL 286
            IVDSG+TFTFMEG +FE VA+EF +Q+G +Y RAA VE++SGLRPC++ISG+K+V LP+L
Sbjct: 350  IVDSGSTFTFMEGPLFEAVAKEFIRQMG-NYSRAADVEKKSGLRPCFDISGKKSVYLPEL 408

Query: 285  TFHFKGGAKMALPLADYFSFLDEAV----------XXXXXXXXXXXXXXNYQQQNFYMEY 136
               FKGGAKMALP  +YF+ +   V                        ++Q QNFY+E+
Sbjct: 409  ILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468

Query: 135  DLENERLGFRSQVC 94
            DL N+R GF  Q C
Sbjct: 469  DLANDRFGFAKQKC 482


Top