BLASTX nr result
ID: Mentha29_contig00024753
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00024753 (1388 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU18131.1| hypothetical protein MIMGU_mgv1a025649mg [Mimulus... 570 e-160 ref|XP_002309394.1| aspartyl protease family protein [Populus tr... 476 e-132 ref|XP_002534234.1| pepsin A, putative [Ricinus communis] gi|223... 467 e-129 ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2... 466 e-128 ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2... 462 e-127 emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera] 459 e-126 ref|XP_004303503.1| PREDICTED: aspartic proteinase nepenthesin-2... 457 e-126 ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2... 457 e-126 ref|XP_007027933.1| Eukaryotic aspartyl protease family protein,... 449 e-123 ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2... 439 e-120 ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic pro... 438 e-120 ref|XP_007202027.1| hypothetical protein PRUPE_ppa005104mg [Prun... 438 e-120 ref|XP_007162958.1| hypothetical protein PHAVU_001G194500g [Phas... 437 e-120 gb|EXC01923.1| Aspartic proteinase nepenthesin-2 [Morus notabilis] 433 e-119 ref|XP_006430085.1| hypothetical protein CICLE_v10011613mg [Citr... 428 e-117 ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor,... 427 e-117 ref|XP_002323393.2| hypothetical protein POPTR_0016s07260g [Popu... 427 e-117 ref|XP_002877867.1| aspartyl protease family protein [Arabidopsi... 427 e-117 ref|XP_006481575.1| PREDICTED: aspartic proteinase nepenthesin-1... 425 e-116 ref|XP_007145803.1| hypothetical protein PHAVU_007G269300g [Phas... 423 e-116 >gb|EYU18131.1| hypothetical protein MIMGU_mgv1a025649mg [Mimulus guttatus] Length = 462 Score = 570 bits (1470), Expect = e-160 Identities = 283/413 (68%), Positives = 323/413 (78%), Gaps = 7/413 (1%) Frame = -3 Query: 1353 SSTRAHQLKHRGTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSSLV 1174 SSTRAH LKH T+ S ATK PLFPRGYGGYSISL FGTPPQT FVMDTGSSLV Sbjct: 54 SSTRAHLLKHPNTSTS---AAAATKAPLFPRGYGGYSISLSFGTPPQTLPFVMDTGSSLV 110 Query: 1173 WFPCTDRYTCSSCNFADT--ANFSVFIPKFSSSAKIVGCRNPKCKWLFENVQCRECDGNS 1000 WFPCT RY C+SCNF + +N S+F+PK SSS+ I+GC+NPKC+W+F +VQC+ CD NS Sbjct: 111 WFPCTQRYACNSCNFVNVNPSNISIFLPKSSSSSMIIGCKNPKCRWIFPDVQCKNCDQNS 170 Query: 999 TACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIRQPAGIAGFGRGP 820 T C + CP YI+QY L FP+KSV+NF VGCS S RQPAGIAGFGRGP Sbjct: 171 TTCKEFCPPYIIQYGSGSTTGLLLSETLFFPEKSVENFFVGCSIFSSRQPAGIAGFGRGP 230 Query: 819 ESLPAQMGLKKFSYCLVSHRFDGKPVSSDLIL----DXXXXXXATKYTPFRKNPASSNPA 652 ESLPAQMGLK+FSYCLVSHRFD +PVSSDL+ +YTPFRKNP S+NPA Sbjct: 231 ESLPAQMGLKRFSYCLVSHRFDDEPVSSDLVFVGGGGAAGAAAGVEYTPFRKNPKSANPA 290 Query: 651 FREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEEFE 472 F++YYYVTLRKITVGGV VKAPY+FLVAD+ G+GGTIVDSGTTFTFME +VFE VAEEFE Sbjct: 291 FQDYYYVTLRKITVGGVHVKAPYEFLVADAAGDGGTIVDSGTTFTFMESRVFEPVAEEFE 350 Query: 471 KQVG-EHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYFSFLDE 295 KQVG +Y RA VE+ SGLRPC+N+SGE +V LP+L+FHFKGGA+M LPLADYFSFLD+ Sbjct: 351 KQVGRRNYSRAREVEDRSGLRPCFNVSGEGSVSLPELSFHFKGGAEMVLPLADYFSFLDD 410 Query: 294 AVICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGFRSQVCK 136 +VICMT GPGPAIILGNYQQQNFYMEYDLENERLGF+ Q+CK Sbjct: 411 SVICMT-VVTNNSTREGIGPGPAIILGNYQQQNFYMEYDLENERLGFKRQLCK 462 >ref|XP_002309394.1| aspartyl protease family protein [Populus trichocarpa] gi|222855370|gb|EEE92917.1| aspartyl protease family protein [Populus trichocarpa] Length = 469 Score = 476 bits (1226), Expect = e-132 Identities = 246/416 (59%), Positives = 287/416 (68%), Gaps = 11/416 (2%) Frame = -3 Query: 1353 SSTRAHQLKHRGTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSSLV 1174 S +RAH +K T S K PLFPR YGGYSISL FGTPPQTT FVMDTGSSLV Sbjct: 63 SLSRAHHIKSPKTKFSLL------KTPLFPRSYGGYSISLNFGTPPQTTKFVMDTGSSLV 116 Query: 1173 WFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWLFE---NVQCRECD 1009 WFPCT RY CS C+F + FIPK SSS+ ++GC+N KC WLF +C+ECD Sbjct: 117 WFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSNLIGCKNHKCSWLFGPKVQSKCQECD 176 Query: 1008 GNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDK-SVDNFVVGCSFASIRQPAGIAGF 832 + C Q CP Y++QY L FP K ++ F+VGCS SIRQP GIAGF Sbjct: 177 PTTQNCTQSCPPYVIQYGLGSTAGLLLSETLDFPHKKTIPGFLVGCSLFSIRQPEGIAGF 236 Query: 831 GRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXATK-----YTPFRKNPA 667 GR PESLP+Q+GLKKFSYCLVSH FD P SSDL+LD TK YTPF+KNP Sbjct: 237 GRSPESLPSQLGLKKFSYCLVSHAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPT 296 Query: 666 SSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELV 487 + AFR+YYYV LR I +G VK PYKFLV SDGNGGTIVDSGTTFTFME V+ELV Sbjct: 297 A---AFRDYYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELV 353 Query: 486 AEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYFS 307 A+EFEKQV HY A V+ ++GLRPC+NISGEK+V +P+ FHFKGGAKMALPLA+YFS Sbjct: 354 AKEFEKQVA-HYTVATEVQNQTGLRPCFNISGEKSVSVPEFIFHFKGGAKMALPLANYFS 412 Query: 306 FLDEAVICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGFRSQVC 139 F+D VIC+T G GPAIILGNYQQ+NF++E+DL+NER GF+ Q C Sbjct: 413 FVDSGVICLTIVSDNMSGSGIGG-GPAIILGNYQQRNFHVEFDLKNERFGFKQQNC 467 >ref|XP_002534234.1| pepsin A, putative [Ricinus communis] gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis] Length = 468 Score = 467 bits (1201), Expect = e-129 Identities = 241/422 (57%), Positives = 290/422 (68%), Gaps = 15/422 (3%) Frame = -3 Query: 1359 TTSSTRAHQLKHRGTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSS 1180 TTS +RAH LK TN S K PLF R YGGYS+SL GTP QT +MDTGSS Sbjct: 53 TTSISRAHHLKSPKTNFSLI------KTPLFSRSYGGYSMSLSLGTPSQTVKLIMDTGSS 106 Query: 1179 LVWFPCTDRYTCSSCNFADT--ANFSVFIPKFSSSAKIVGCRNPKCKWLF-ENVQ--CRE 1015 LVWFPCT RY C+SCNF +T F+P+ SSS+K++GC+NPKC W+F +VQ C Sbjct: 107 LVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSKLIGCKNPKCAWVFGSSVQSKCHN 166 Query: 1014 CDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIRQPAGIAG 835 C+ + C Q CP YI+QY + FP+K++ +F+ GCS S RQP GIAG Sbjct: 167 CNPQAQNCTQACPPYIIQYGLGSTAGLLLSETINFPNKTISDFLAGCSLLSTRQPEGIAG 226 Query: 834 FGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXATK-----YTPFRKNP 670 FGR ESLP Q+GLKKFSYCLVS RFD PVSSDLILD +K YTPF+KN Sbjct: 227 FGRSQESLPLQLGLKKFSYCLVSRRFDDSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNL 286 Query: 669 AS-SNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFE 493 AS SNPAF+EYYYV LRKI VG VK PY FLV SDGNGGTIVDSG+TFTF+EG VFE Sbjct: 287 ASQSNPAFQEYYYVMLRKIIVGKTHVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFE 346 Query: 492 LVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADY 313 L+A+EFEKQ+ +Y A V++ +GLRPC++ISGEK+V +P LTF FKGGAKM LPL++Y Sbjct: 347 LLAKEFEKQMA-NYTVATNVQKLTGLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNY 405 Query: 312 FSFLDEAVICMT----XXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGFRSQ 145 F+F+D V+C+T GPAIILGN+QQQNFY+EYDLEN+R GF+ Q Sbjct: 406 FAFVDMGVVCLTIVSDNAAALGGDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQ 465 Query: 144 VC 139 C Sbjct: 466 SC 467 >ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max] Length = 474 Score = 466 bits (1198), Expect = e-128 Identities = 238/418 (56%), Positives = 282/418 (67%), Gaps = 10/418 (2%) Frame = -3 Query: 1359 TTSSTRAHQLKHRGTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSS 1180 + S TRAH LKHR N P +P+ YGGYSI L GTPPQT+ FV+DTGSS Sbjct: 60 SASLTRAHHLKHRNNN-----SPSVATTPAYPKSYGGYSIDLNLGTPPQTSPFVLDTGSS 114 Query: 1179 LVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWLFEN---VQCRE 1015 LVWFPCT RY CS CNF DT FIPK SS+AK++GCRNPKC ++F + +C + Sbjct: 115 LVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSSTAKLLGCRNPKCGYIFGSDVQFRCPQ 174 Query: 1014 CDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIRQPAGIAG 835 C S C+ CP YI+QY L FP K+V F+VGCS SIRQP+GIAG Sbjct: 175 CKPESQNCSLTCPAYIIQYGLGSTAGFLLLDNLNFPGKTVPQFLVGCSILSIRQPSGIAG 234 Query: 834 FGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXAT----KYTPFRKNPA 667 FGRG ESLP+QM LK+FSYCLVSHRFD P SSDL+L YTPFR NP+ Sbjct: 235 FGRGQESLPSQMNLKRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPS 294 Query: 666 SSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELV 487 ++NPAF+EYYY+TLRK+ VGG VK PY FL SDGNGGTIVDSG+TFTFME V+ LV Sbjct: 295 TNNPAFKEYYYLTLRKVIVGGKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYNLV 354 Query: 486 AEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYFS 307 A+EF KQ+ ++Y RA E +SGL PC+NISG KTV P+LTF FKGGAKM PL +YFS Sbjct: 355 AQEFVKQLEKNYSRAEDAETQSGLSPCFNISGVKTVTFPELTFKFKGGAKMTQPLQNYFS 414 Query: 306 FLDEA-VICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGFRSQVCK 136 + +A V+C+T GPAIILGNYQQQNFY+EYDLENER GF + C+ Sbjct: 415 LVGDAEVVCLT-VVSDGGAGPPKTTGPAIILGNYQQQNFYIEYDLENERFGFGPRSCR 471 >ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera] Length = 469 Score = 462 bits (1188), Expect = e-127 Identities = 244/419 (58%), Positives = 281/419 (67%), Gaps = 11/419 (2%) Frame = -3 Query: 1359 TTSSTRAHQLKHRGTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSS 1180 + S TRAH LKHR S PLF YGGYS+SL FGTP QT SFVMDTGSS Sbjct: 60 SASLTRAHHLKHRKNTSS-------VNTPLFAHSYGGYSVSLSFGTPSQTLSFVMDTGSS 112 Query: 1179 LVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWLFEN---VQCRE 1015 LVWFPCT RY C+ C+F D A FIPK SSSAKIVGC NPKC ++ ++ +C Sbjct: 113 LVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNPKCGFVMDSEVRTRCPG 172 Query: 1014 CDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIRQPAGIAG 835 CD NS C + CPTY +QY LVF +++ +FVVGCS S RQP+GIAG Sbjct: 173 CDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPDFVVGCSILSSRQPSGIAG 232 Query: 834 FGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXATK-----YTPFRKNP 670 FGRGP SLP QMGLKKFSYCL+SHRFD P SS + L K YTPFRKNP Sbjct: 233 FGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNP 292 Query: 669 ASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFEL 490 SSN AF+EYYYVTLR I VG +VK PY F+VA SDGNGGTIVDSG+TFTFME VFE Sbjct: 293 VSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEA 352 Query: 489 VAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYF 310 VA EF++Q+ +Y RAA VE SGL+PC+N+SG +V LP L F FKGGAKM LP+A+YF Sbjct: 353 VATEFDRQMA-NYTRAADVEALSGLKPCFNLSGVGSVALPSLVFQFKGGAKMELPVANYF 411 Query: 309 SFL-DEAVICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGFRSQVCK 136 S + D +V+C+T GP+IILGNYQ QNFY EYDLENER GFR Q CK Sbjct: 412 SLVGDLSVLCLT-IVSNEAVGSTLSSGPSIILGNYQSQNFYTEYDLENERFGFRRQRCK 469 >emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera] Length = 609 Score = 459 bits (1182), Expect = e-126 Identities = 243/418 (58%), Positives = 280/418 (66%), Gaps = 11/418 (2%) Frame = -3 Query: 1359 TTSSTRAHQLKHRGTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSS 1180 + S TRAH LKHR S PLF YGGYS+SL FGTP QT SFVMDTGSS Sbjct: 60 SASLTRAHHLKHRKNTSS-------VNTPLFAHSYGGYSVSLSFGTPSQTLSFVMDTGSS 112 Query: 1179 LVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWLFEN---VQCRE 1015 LVWFPCT RY C+ C+F D A FIPK SSSAKIVGC NPKC ++ ++ +C Sbjct: 113 LVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNPKCGFVMDSEVRTRCPG 172 Query: 1014 CDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIRQPAGIAG 835 CD NS C + CPTY +QY LVF +++ +FVVGCS S RQP+GIAG Sbjct: 173 CDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPDFVVGCSILSSRQPSGIAG 232 Query: 834 FGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXATK-----YTPFRKNP 670 FGRGP SLP QMGLKKFSYCL+SHRFD P SS + L K YTPFRKNP Sbjct: 233 FGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNP 292 Query: 669 ASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFEL 490 SSN AF+EYYYVTLR I VG +VK PY F+VA SDGNGGTIVDSG+TFTFME VFE Sbjct: 293 VSSNSAFKEYYYVTLRHIIVGDKRVKXPYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEA 352 Query: 489 VAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYF 310 VA EF++Q+ +Y RAA VE SGL+PC+N+SG +V LP L F FKGGAKM LP+A+YF Sbjct: 353 VATEFDRQMA-NYTRAADVEALSGLKPCFNLSGVGSVALPSLVFQFKGGAKMELPVANYF 411 Query: 309 SFL-DEAVICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGFRSQVC 139 S + D +V+C+T GP+IILGNYQ QNFY EYDLENER GFR Q C Sbjct: 412 SLVGDLSVLCLT-IVSNEAVGSTLSSGPSIILGNYQSQNFYTEYDLENERFGFRRQRC 468 >ref|XP_004303503.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Fragaria vesca subsp. vesca] Length = 458 Score = 457 bits (1176), Expect = e-126 Identities = 235/419 (56%), Positives = 286/419 (68%), Gaps = 11/419 (2%) Frame = -3 Query: 1359 TTSSTRAHQLKHRGTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSS 1180 + S +RAH LK N S ATKVPL+PR YGGYSISL FGTPPQ ++FVMDTGSS Sbjct: 51 SASLSRAHHLKRPKHNSS------ATKVPLYPRSYGGYSISLSFGTPPQISTFVMDTGSS 104 Query: 1179 LVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWLFENVQCRECDG 1006 LVWFPCT RY CS C+F D + FIPK SSSA+++GC+NPKC W+F +C Sbjct: 105 LVWFPCTSRYLCSRCSFPNIDPSTIPAFIPKLSSSARLLGCKNPKCAWIFGPEVNTKCPN 164 Query: 1005 NSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIRQPAGIAGFGR 826 +S Q CP+Y++QY L FPDK+V +F+VGCSF SIRQPAG+AGFGR Sbjct: 165 SS----QACPSYVIQYGSGTTAGVLLSESLDFPDKTVPDFLVGCSFLSIRQPAGMAGFGR 220 Query: 825 GPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLIL--------DXXXXXXATKYTPFRKNP 670 GP+SLP QMGL KFSYCLVSHRFD PVSSDL+L D YTPF+KNP Sbjct: 221 GPQSLPVQMGLSKFSYCLVSHRFDDTPVSSDLVLYSGSTSDGDEIDDNHDISYTPFQKNP 280 Query: 669 ASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFEL 490 ++N A+REYYY+ LRK+ VG VK PYK+LV D NGGTIVDSG+TFTFME VFE Sbjct: 281 GAANTAYREYYYLALRKVIVGKKHVKIPYKYLVPGEDDNGGTIVDSGSTFTFMERPVFEA 340 Query: 489 VAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYF 310 VAE F Q+ E Y RA +E +GL+PC++IS E+ V+ P+L F FKGGAKMA+PL +YF Sbjct: 341 VAEAFATQM-EKYTRAGDIENRTGLKPCFDISKEEKVDFPELVFQFKGGAKMAMPLNNYF 399 Query: 309 SFL-DEAVICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGFRSQVCK 136 + + + V+C+T GPA+ILGN+QQQNFY+EYDLE ER GF+ Q CK Sbjct: 400 ALVTSDGVVCLT-IVTDGVAGPGVAAGPAVILGNFQQQNFYVEYDLERERFGFKKQSCK 457 >ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max] Length = 470 Score = 457 bits (1176), Expect = e-126 Identities = 237/418 (56%), Positives = 277/418 (66%), Gaps = 10/418 (2%) Frame = -3 Query: 1359 TTSSTRAHQLKHRGTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSS 1180 ++S TRAH LKHR N P +P+ YGGYSI L GTPPQT+ FV+DTGSS Sbjct: 56 SSSLTRAHHLKHRNNN-----SPSVATTPAYPKSYGGYSIDLNLGTPPQTSPFVLDTGSS 110 Query: 1179 LVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWLFE---NVQCRE 1015 LVWFPCT Y CS CNF D FIPK SS+AK++GCRNPKC +LF +C + Sbjct: 111 LVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSSTAKLLGCRNPKCGYLFGPDVESRCPQ 170 Query: 1014 CDG-NSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIRQPAGIA 838 C S C+ CP+YI+QY L FP K+V F+VGCS SIRQP+GIA Sbjct: 171 CKKPGSQNCSLTCPSYIIQYGLGATAGFLLLDNLNFPGKTVPQFLVGCSILSIRQPSGIA 230 Query: 837 GFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXAT----KYTPFRKNP 670 GFGRG ESLP+QM LK+FSYCLVSHRFD P SSDL+L YTPFR NP Sbjct: 231 GFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNP 290 Query: 669 ASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFEL 490 S+N FREYYYVTLRK+ VGGV VK PYKFL SDGNGGTIVDSG+TFTFME V+ L Sbjct: 291 -SNNSVFREYYYVTLRKLIVGGVDVKIPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNL 349 Query: 489 VAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYF 310 VA+EF +Q+G+ Y R VE +SGL PC+NISG KT+ P+ TF FKGGAKM+ PL +YF Sbjct: 350 VAQEFLRQLGKKYSREENVEAQSGLSPCFNISGVKTISFPEFTFQFKGGAKMSQPLLNYF 409 Query: 309 SFLDEAVICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGFRSQVCK 136 SF+ +A + GPAIILGNYQQQNFY+EYDLENER GF + CK Sbjct: 410 SFVGDAEVLCFTVVSDGGAGQPKTAGPAIILGNYQQQNFYVEYDLENERFGFGPRNCK 467 >ref|XP_007027933.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] gi|590632770|ref|XP_007027934.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] gi|590632774|ref|XP_007027935.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] gi|508716538|gb|EOY08435.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] gi|508716539|gb|EOY08436.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] gi|508716540|gb|EOY08437.1| Eukaryotic aspartyl protease family protein, putative isoform 1 [Theobroma cacao] Length = 472 Score = 449 bits (1154), Expect = e-123 Identities = 234/419 (55%), Positives = 280/419 (66%), Gaps = 12/419 (2%) Frame = -3 Query: 1359 TTSSTRAHQLKHRGTNISFXXXXXAT--KVPLFPRGYGGYSISLGFGTPPQTTSFVMDTG 1186 T+S +RAH LK I ++ K PLFP YGGY+ISLG GTPPQT +F+MDTG Sbjct: 55 TSSVSRAHHLKQPTHKIKAKANTTSSLLKTPLFPHSYGGYTISLGIGTPPQTLTFIMDTG 114 Query: 1185 SSLVWFPCTDRYTCSSCNF--ADTANFSVFIPKFSSSAKIVGCRNPKCKWLFE---NVQC 1021 SSL WFPCT RY CS C F D F PK SSS +VGC+NPKC+WLF +C Sbjct: 115 SSLSWFPCTSRYICSQCAFPNVDPKKIPTFSPKLSSSKALVGCKNPKCRWLFGPDVESRC 174 Query: 1020 RECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIRQPAGI 841 ++C+ S C Q CP YI+QY LVF K+ +F+VGCS S RQPAGI Sbjct: 175 QDCEPASKNCTQNCPPYIIQYGLGSTGGLLLVENLVFSQKTFQDFLVGCSIFSNRQPAGI 234 Query: 840 AGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXATK-----YTPFRK 676 GFGR PESLP+Q+G+KKFSYCLVS RFD VSS+++L+ K YTPF K Sbjct: 235 VGFGRRPESLPSQLGVKKFSYCLVSRRFDDTGVSSNMLLETGSGSGDAKTKGLSYTPFYK 294 Query: 675 NPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVF 496 N +S+P F+E+YYVT+RKI VG VK PYK+LV DGNGGTIVDSG+TFTFME VF Sbjct: 295 NQFASHPIFQEFYYVTIRKILVGDKHVKVPYKYLVPGPDGNGGTIVDSGSTFTFMERAVF 354 Query: 495 ELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLAD 316 ELV++EFEKQ+G +Y RA VE +SGL PC NISG K++ P+L F FKGGAKMALPLA+ Sbjct: 355 ELVSKEFEKQMG-NYSRAHEVENKSGLAPCVNISGHKSISFPELIFQFKGGAKMALPLAN 413 Query: 315 YFSFLDEAVICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGFRSQVC 139 YFSFLD V+C+ G GPAIILGN+QQQN+Y+EYDL NE GF Q C Sbjct: 414 YFSFLDVNVVCLMVVTDNIIGQGVSG-GPAIILGNFQQQNYYIEYDLANESFGFAKQSC 471 >ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus] Length = 457 Score = 439 bits (1128), Expect = e-120 Identities = 224/414 (54%), Positives = 275/414 (66%), Gaps = 7/414 (1%) Frame = -3 Query: 1359 TTSSTRAHQLKHRGTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSS 1180 ++S TRAHQ+K +N F K PL P YG YS L FGTP QT + DTGSS Sbjct: 51 SSSQTRAHQIKTPKSNSVF-------KSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSS 103 Query: 1179 LVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWLFE---NVQCRE 1015 LVWFPCT RY CS C+F D F+PK SSS+K+VGC+NPKC W+F QCR Sbjct: 104 LVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRS 163 Query: 1014 CDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIRQPAGIAG 835 C+ + C Q CP Y++QY L FPDK + NFVVGCSF SI QP+GIAG Sbjct: 164 CNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAG 223 Query: 834 FGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXA-TKYTPFRKNPASSN 658 FGRG ESLP+QMGLKKF+YCL S +FD P S LILD + YTPFR+NP+ SN Sbjct: 224 FGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSN 283 Query: 657 PAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEE 478 A++EYYY+ +RKI VG VK PYKFLV DGNGG+I+DSG+TFTFM+ V E+VA E Sbjct: 284 NAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVARE 343 Query: 477 FEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYFSFLD 298 FEKQ+ ++ RA VE +GLRPC++IS EK+V+ P+L F FKGGAK ALPL +YF+ + Sbjct: 344 FEKQLA-NWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVS 402 Query: 297 EA-VICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGFRSQVC 139 + V C+T G GP++ILG +QQQNFY+EYDL N+RLGFR Q C Sbjct: 403 SSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456 >ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase nepenthesin-2-like [Cucumis sativus] Length = 457 Score = 438 bits (1127), Expect = e-120 Identities = 224/414 (54%), Positives = 275/414 (66%), Gaps = 7/414 (1%) Frame = -3 Query: 1359 TTSSTRAHQLKHRGTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSS 1180 ++S TRAHQ+K +N F K PL P YG YS L FGTP QT + DTGSS Sbjct: 51 SSSQTRAHQIKTPKSNSVF-------KSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSS 103 Query: 1179 LVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWLFE---NVQCRE 1015 LVWFPCT RY CS C+F D F+PK SSS+K+VGC+NPKC W+F QCR Sbjct: 104 LVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRS 163 Query: 1014 CDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIRQPAGIAG 835 C+ + C Q CP Y++QY L FPDK + NFVVGCSF SI QP+GIAG Sbjct: 164 CNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKXIPNFVVGCSFLSIHQPSGIAG 223 Query: 834 FGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXA-TKYTPFRKNPASSN 658 FGRG ESLP+QMGLKKF+YCL S +FD P S LILD + YTPFR+NP+ SN Sbjct: 224 FGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSN 283 Query: 657 PAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEE 478 A++EYYY+ +RKI VG VK PYKFLV DGNGG+I+DSG+TFTFM+ V E+VA E Sbjct: 284 NAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVARE 343 Query: 477 FEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYFSFLD 298 FEKQ+ ++ RA VE +GLRPC++IS EK+V+ P+L F FKGGAK ALPL +YF+ + Sbjct: 344 FEKQLA-NWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVS 402 Query: 297 EA-VICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGFRSQVC 139 + V C+T G GP++ILG +QQQNFY+EYDL N+RLGFR Q C Sbjct: 403 SSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456 >ref|XP_007202027.1| hypothetical protein PRUPE_ppa005104mg [Prunus persica] gi|462397558|gb|EMJ03226.1| hypothetical protein PRUPE_ppa005104mg [Prunus persica] Length = 477 Score = 438 bits (1126), Expect = e-120 Identities = 237/437 (54%), Positives = 286/437 (65%), Gaps = 29/437 (6%) Frame = -3 Query: 1359 TTSSTRAHQLKH-RGTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGS 1183 + S +RAH +K+ R N S T+VPLFP YG YS+SL FGTPPQT+SF+MDTGS Sbjct: 49 SASISRAHHIKNSRKPNSSL------TQVPLFPHSYGDYSVSLNFGTPPQTSSFIMDTGS 102 Query: 1182 SLVWFPCTDRYTCSSCNFADT--ANFSVFIPKFSSSAKIVGCRNPKCKWLFE---NVQCR 1018 SLVWFPCT RY CS C F + A F PK SSS+KIVGC+NPKC W+F +C Sbjct: 103 SLVWFPCTKRYICSRCQFPNINPAKIPTFKPKLSSSSKIVGCQNPKCGWIFGPEVKSKCP 162 Query: 1017 ECDGNSTA-CNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIRQPAGI 841 C+ S C+Q CPTYI+QY L FP K V +F+VGCSF SIRQPAGI Sbjct: 163 NCNNPSHQNCSQACPTYIIQYGSGTTAGILLSETLDFPKKIVPDFLVGCSFVSIRQPAGI 222 Query: 840 AGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXAT------------ 697 AGFGRGP+SLPAQMGL KFSYCLVSHRFD P SSDL+L ++ Sbjct: 223 AGFGRGPQSLPAQMGLTKFSYCLVSHRFDDTPQSSDLVLYSSSSGSSSSSEEEPTIAESQ 282 Query: 696 ---------KYTPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGT 544 TPF+KNP N AFREYYY+ LRK+ VG VK PYKFLV +D +GGT Sbjct: 283 RNKTKLQSLSSTPFQKNPGPPNSAFREYYYIMLRKVIVGNKNVKIPYKFLVPGADSSGGT 342 Query: 543 IVDSGTTFTFMEGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQL 364 IVDSG+TFTFME VFE VA+EFE Q+ +Y RA +E ++GLRPC++IS EK V+ P+L Sbjct: 343 IVDSGSTFTFMEKPVFEPVAKEFEAQMA-NYTRAKDLENKTGLRPCFDISKEKKVDFPEL 401 Query: 363 TFHFKGGAKMALPLADYFSFLDEA-VICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYM 187 F FKGGAKM LP +YFS + + V+C+T G GPAIILGNYQQQ+F++ Sbjct: 402 VFQFKGGAKMELPSKNYFSMVSSSGVVCLTIVTDGVVGPGGNG-GPAIILGNYQQQDFHV 460 Query: 186 EYDLENERLGFRSQVCK 136 EYDL++ + GFR Q CK Sbjct: 461 EYDLQHGKFGFRKQSCK 477 >ref|XP_007162958.1| hypothetical protein PHAVU_001G194500g [Phaseolus vulgaris] gi|561036422|gb|ESW34952.1| hypothetical protein PHAVU_001G194500g [Phaseolus vulgaris] Length = 466 Score = 437 bits (1124), Expect = e-120 Identities = 228/418 (54%), Positives = 278/418 (66%), Gaps = 10/418 (2%) Frame = -3 Query: 1359 TTSSTRAHQLKHRGTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSS 1180 + S TRAH LKHR S A ++P+ YGGYSI L FGTPPQT+ FV+DTGSS Sbjct: 54 SASLTRAHHLKHRLNAPS------AATTQVYPKSYGGYSIDLNFGTPPQTSPFVLDTGSS 107 Query: 1179 LVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWLFEN---VQCRE 1015 LVWFPCT RY CS C F D FIPK SS+++++GC+NPKC +LF + +C + Sbjct: 108 LVWFPCTSRYLCSHCLFPNIDPTKIPTFIPKNSSTSRLLGCKNPKCGYLFGSDLQSRCPQ 167 Query: 1014 CDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIRQPAGIAG 835 C +S C+ CP YI+QY L FP+K V F+VGCS SIRQP+GIAG Sbjct: 168 CKPDSQNCSLTCPPYIIQYGLGSTAGFLLLDNLNFPEKIVPQFLVGCSILSIRQPSGIAG 227 Query: 834 FGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXAT----KYTPFRKNPA 667 FGRG ESLPAQM LK+FSYCL+SH FD +SDL+L YTPF NP+ Sbjct: 228 FGRGQESLPAQMALKRFSYCLLSHNFDDSTENSDLVLQISSTGDTKTNGLSYTPFHPNPS 287 Query: 666 SSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELV 487 ++NPAF EYYY++LRK+ VGG VK P FL SDGNGGTIVDSG+TFTFME ++LV Sbjct: 288 ANNPAFLEYYYLSLRKVIVGGKNVKIPLSFLEPGSDGNGGTIVDSGSTFTFMERPAYDLV 347 Query: 486 AEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYFS 307 +EF KQ+G +Y RA VE +SGL PC+NISG KTV P+ T FKGGAKM LP+ +YFS Sbjct: 348 VKEFVKQLG-NYSRAEDVEAQSGLGPCFNISGAKTVNFPKFTLQFKGGAKMTLPVENYFS 406 Query: 306 FLDEA-VICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGFRSQVCK 136 +D++ V+C+T GPAIILGNYQQQNF++EYDLENER GF Q CK Sbjct: 407 LIDDSEVVCLT-IVSDGGAGPATTSGPAIILGNYQQQNFHIEYDLENERFGFGPQSCK 463 >gb|EXC01923.1| Aspartic proteinase nepenthesin-2 [Morus notabilis] Length = 473 Score = 433 bits (1114), Expect = e-119 Identities = 224/421 (53%), Positives = 278/421 (66%), Gaps = 13/421 (3%) Frame = -3 Query: 1359 TTSSTRAHQLK-----HRGTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTTSFVM 1195 + S +RAH LK + ++ S TK PL+PR YGGYS+SL FGTPPQ FVM Sbjct: 54 SASLSRAHALKRPKSVNSSSSSSSTDSKYQTKTPLYPRSYGGYSVSLRFGTPPQILQFVM 113 Query: 1194 DTGSSLVWFPCTDRYTCSSCNFADTANFSVFIPKFSSSAKIVGCRNPKCKW-LFENVQCR 1018 DTGSSLVWFPCT RY CS C+F ++ N FIPK SSS+K++GC+NPKC+ L +C Sbjct: 114 DTGSSLVWFPCTSRYLCSKCSFPNSQNPPKFIPKKSSSSKLIGCQNPKCQLVLGATAKCD 173 Query: 1017 ECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIRQPAGIA 838 + N+ CP YI+QY L FP K V +F+VGCS SIRQP+GIA Sbjct: 174 DATAGENPKNKACPAYIIQYGSGSTIGQLLSETLNFPGKMVPDFIVGCSVLSIRQPSGIA 233 Query: 837 GFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLIL-----DXXXXXXATKYTPFRKN 673 GFGRG ESLP+Q+ L KFSYCLVSHRFD SSDL+L D + YTPF+KN Sbjct: 234 GFGRGKESLPSQLRLAKFSYCLVSHRFDDTSFSSDLVLYSSSSDDKQPEGSISYTPFQKN 293 Query: 672 PA-SSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVF 496 P+ SS PA +EYYY+ +RK+ VG VK PY++LV SDG+GGTIVDSGTTFT+ME VF Sbjct: 294 PSLSSIPALKEYYYILIRKVIVGKTHVKIPYRYLVPGSDGHGGTIVDSGTTFTYMEKPVF 353 Query: 495 ELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLAD 316 + V+ EF KQ+ +Y RA +E +GL PC++IS EK+V P+L FKGGAKM LPL + Sbjct: 354 DAVSSEFAKQMA-NYTRAKGIENRTGLGPCFDISKEKSVNFPELVLQFKGGAKMNLPLTN 412 Query: 315 YFSFL-DEAVICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGFRSQVC 139 YFS + +C+T GPAIILGNYQQQNF++EYDL+NER GFR Q+C Sbjct: 413 YFSIVGSPGSVCLTVVTNDDVGGPESVGGPAIILGNYQQQNFHIEYDLKNERFGFRRQIC 472 Query: 138 K 136 K Sbjct: 473 K 473 >ref|XP_006430085.1| hypothetical protein CICLE_v10011613mg [Citrus clementina] gi|557532142|gb|ESR43325.1| hypothetical protein CICLE_v10011613mg [Citrus clementina] Length = 483 Score = 428 bits (1101), Expect = e-117 Identities = 226/424 (53%), Positives = 282/424 (66%), Gaps = 17/424 (4%) Frame = -3 Query: 1359 TTSSTRAHQLKHR------GTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQ-TTSF 1201 ++S +RA LK + +NI K PL YGGYSISL FGTPPQ +T F Sbjct: 61 SSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPF 120 Query: 1200 VMDTGSSLVWFPCTDRYTCSSCNF--ADTANFSVFIPKFSSSAKIVGCRNPKCKWLF-EN 1030 + DTGSSLVWFPCT RY C+ CNF D + FIPK SSS++++GC+NPKC W+F N Sbjct: 121 IFDTGSSLVWFPCTSRYRCADCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180 Query: 1029 VQ--CRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIR 856 V+ C+ C+ + C CP Y++QY L FP K+V NF+VGCS S R Sbjct: 181 VESRCKGCNPRNKTCPLACPPYLIQYGLGFTAGLLLSETLGFPSKTVPNFLVGCSILSNR 240 Query: 855 QPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXATK-----Y 691 QPAGIAGFGR ESLP+Q+GLKKFSYCL+S +FD PVSS+L+LD +K Y Sbjct: 241 QPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGSGSGDSKTPGLSY 300 Query: 690 TPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFM 511 TPF KNP S+ AF EYYYV LR+I VG VK PY +LV SDGNGG IVDSG+T TFM Sbjct: 301 TPFYKNPVGSSSAFGEYYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTLTFM 360 Query: 510 EGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMA 331 EG +FE VA+EF +Q+G +Y RAA VE++SGLRPC++ISG+K+V LP+L FKGGAKMA Sbjct: 361 EGPLFEAVAKEFIRQMG-NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMA 419 Query: 330 LPLADYFSFLDEAVICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGFR 151 LPL +YF+ + V+C+ G GPAIILG++Q QNFY+E+DL N+R GF Sbjct: 420 LPLENYFALVGNEVLCLILFTDNAAGPAPGG-GPAIILGDFQLQNFYLEFDLANDRFGFA 478 Query: 150 SQVC 139 Q C Sbjct: 479 KQKC 482 >ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus communis] gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus communis] Length = 447 Score = 427 bits (1098), Expect = e-117 Identities = 220/415 (53%), Positives = 273/415 (65%), Gaps = 7/415 (1%) Frame = -3 Query: 1359 TTSSTRAHQLKHRGTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSS 1180 +TS RAH LK+ T P+F YGGYSISL FGTPPQT SFVMDTGSS Sbjct: 52 STSLARAHHLKNPQTT------------PVFSHSYGGYSISLSFGTPPQTLSFVMDTGSS 99 Query: 1179 LVWFPCTDRYTCSSCNFADTANFSVFIPKFSSSAKIVGCRNPKCKWLFE-NVQCRECDGN 1003 VWFPCT RY C++C+F T+ S F+PK SSS+KI+GC+NPKC W+ + +++C +CD N Sbjct: 100 FVWFPCTLRYLCNNCSF--TSRISPFLPKHSSSSKIIGCKNPKCSWIHQTDLRCTDCDNN 157 Query: 1002 STACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIRQPAGIAGFGRG 823 S C+Q+CP Y++ Y L V NF+VGCS S RQPAGIAGFGRG Sbjct: 158 SRNCSQICPPYLILYGSGTTGGVALSETLHLHGLIVPNFLVGCSVFSSRQPAGIAGFGRG 217 Query: 822 PESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXATK----YTPFRKNP-ASSN 658 P SLP+Q+GL KFSYCL+SH+FD SS L+LD YTP KNP Sbjct: 218 PSSLPSQLGLTKFSYCLLSHKFDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDK 277 Query: 657 PAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELVAEE 478 PAF YYYV+LR+I++GG VK PYK+L D DGNGGTI+DSGTTFT+M + FE+++ E Sbjct: 278 PAFSVYYYVSLRRISIGGRSVKIPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNE 337 Query: 477 FEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYFSFL- 301 F QV ++Y RA VE SGL+PC+N+SG K +ELPQL HFKGGA + LPL +YF+FL Sbjct: 338 FISQV-KNYERALMVEALSGLKPCFNVSGAKELELPQLRLHFKGGADVELPLENYFAFLG 396 Query: 300 DEAVICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGFRSQVCK 136 V C T GP +ILGN+Q QNFY+EYDL+NERLGF+ + CK Sbjct: 397 SREVACFT----VVTDGAEKASGPGMILGNFQMQNFYVEYDLQNERLGFKKESCK 447 >ref|XP_002323393.2| hypothetical protein POPTR_0016s07260g [Populus trichocarpa] gi|550321034|gb|EEF05154.2| hypothetical protein POPTR_0016s07260g [Populus trichocarpa] Length = 454 Score = 427 bits (1097), Expect = e-117 Identities = 215/370 (58%), Positives = 258/370 (69%), Gaps = 11/370 (2%) Frame = -3 Query: 1353 SSTRAHQLKHRGTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSSLV 1174 S +RAH +K TN S K PLFPR YGGYSISL FGTPPQTT FVMDTGSSLV Sbjct: 54 SLSRAHHIKSPKTNFSLI------KTPLFPRSYGGYSISLNFGTPPQTTKFVMDTGSSLV 107 Query: 1173 WFPCTDRYTCSSCNFADTANFSV--FIPKFSSSAKIVGCRNPKCKWLFE---NVQCRECD 1009 WFPCT RY CS CNF + + F+PK SSS+K++GC+NP+C +F +C+ECD Sbjct: 108 WFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMIFGPEIQSKCQECD 167 Query: 1008 GNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDK-SVDNFVVGCSFASIRQPAGIAGF 832 + C Q CP Y++QY L FP+K ++ +F+VGCS SI+QP GIAGF Sbjct: 168 STAQNCTQTCPPYVIQYGSGSTAGLLLSETLDFPNKKTIPDFLVGCSIFSIKQPEGIAGF 227 Query: 831 GRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXATK-----YTPFRKNPA 667 GR PESLP+Q+GLKKFSYCLVSH FD P SSDL+LD TK +TPF KNP Sbjct: 228 GRSPESLPSQLGLKKFSYCLVSHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKNPT 287 Query: 666 SSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELV 487 + AFR+YYYV LR I +G VK PYKFLV +DGNGGTIVDSGTTFTFME V+ELV Sbjct: 288 T---AFRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELV 344 Query: 486 AEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYFS 307 A+EFEKQ+ HY A ++ +GLRPCYNISGEK++ +P L F FKGGAKMALPL++YFS Sbjct: 345 AKEFEKQMA-HYTVATEIQNLTGLRPCYNISGEKSLSVPDLIFQFKGGAKMALPLSNYFS 403 Query: 306 FLDEAVICMT 277 +D VIC+T Sbjct: 404 IVDSGVICLT 413 >ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata] Length = 469 Score = 427 bits (1097), Expect = e-117 Identities = 232/425 (54%), Positives = 280/425 (65%), Gaps = 19/425 (4%) Frame = -3 Query: 1356 TSSTRAHQLKHRGTNI---------SFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTTS 1204 +S RAH+LKH GT+I + K L P+ YGGYS+SL FGTP QT Sbjct: 46 SSIARAHKLKH-GTSIKPDEEALSSTATASATVVKSHLSPKSYGGYSVSLSFGTPSQTIP 104 Query: 1203 FVMDTGSSLVWFPCTDRYTCSSCNFA--DTANFSVFIPKFSSSAKIVGCRNPKCKWLF-E 1033 FV DTGSSLVWFPCT RY CS CNF+ D FIPK SSS++++GC+NPKC++LF Sbjct: 105 FVFDTGSSLVWFPCTSRYLCSDCNFSGLDPTQIPRFIPKNSSSSRVIGCQNPKCQFLFGA 164 Query: 1032 NVQCRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIRQ 853 NVQCR CD N+ C CP YILQY L FPD +V +FVVGCS S R Sbjct: 165 NVQCRGCDPNTRNCTVPCPPYILQYGLGSTAGILISEKLDFPDLTVPDFVVGCSVISTRT 224 Query: 852 PAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXA------TKY 691 PAGIAGFGRGPESLP+QM LK FS+CLVS RFD V++DL LD + Y Sbjct: 225 PAGIAGFGRGPESLPSQMKLKSFSHCLVSRRFDDTNVTTDLGLDTGSGHKSGSKTPGLSY 284 Query: 690 TPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFM 511 TPFRKNP SN AF EYYY+ LR+I VG VK PYKFL ++GNGG+IVDSG+TFTFM Sbjct: 285 TPFRKNPNVSNTAFLEYYYLNLRRIYVGSKHVKIPYKFLAPGTNGNGGSIVDSGSTFTFM 344 Query: 510 EGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMA 331 E VFELVAEEF Q+ +Y R +E+ SG+ PC+NISG+ V +P+L F FKGGAKM Sbjct: 345 ERPVFELVAEEFATQM-SNYTREKDLEKVSGIAPCFNISGKGDVTVPELIFEFKGGAKME 403 Query: 330 LPLADYFSFLDEA-VICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGF 154 LPL++YFSF+ A +C+T G GPAIILG++QQQN+ +EYDLEN+R GF Sbjct: 404 LPLSNYFSFVGNADTVCLT-VVSDNTVNPGGGTGPAIILGSFQQQNYLVEYDLENDRFGF 462 Query: 153 RSQVC 139 + C Sbjct: 463 AKKKC 467 >ref|XP_006481575.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Citrus sinensis] Length = 483 Score = 425 bits (1092), Expect = e-116 Identities = 224/424 (52%), Positives = 279/424 (65%), Gaps = 17/424 (4%) Frame = -3 Query: 1359 TTSSTRAHQLKHR------GTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQ-TTSF 1201 ++S +RA LK + +NI K PL YGGYSISL FGTPPQ +T F Sbjct: 61 SSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPF 120 Query: 1200 VMDTGSSLVWFPCTDRYTCSSCNF--ADTANFSVFIPKFSSSAKIVGCRNPKCKWLF-EN 1030 + DTGSSLVWFPCT RY C CNF D + FIPK SSS++++GC+NPKC W+F N Sbjct: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180 Query: 1029 VQ--CRECDGNSTACNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIR 856 V+ C+ C + C CP+Y+LQY L FP K+V NF+ GCS S R Sbjct: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDR 240 Query: 855 QPAGIAGFGRGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLILDXXXXXXATK-----Y 691 QPAGIAGFGR ESLP+Q+GLKKFSYCL+S +FD PVSS+L+LD +K Y Sbjct: 241 QPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSY 300 Query: 690 TPFRKNPASSNPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFM 511 TPF KNP S+ AF E+YYV LR+I VG VK PY +LV SDGNGG IVDSG+TFTFM Sbjct: 301 TPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFM 360 Query: 510 EGKVFELVAEEFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMA 331 EG +FE VA+EF +Q+G +Y RAA VE++SGLRPC++ISG+K+V LP+L FKGGAKMA Sbjct: 361 EGPLFEAVAKEFIRQMG-NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMA 419 Query: 330 LPLADYFSFLDEAVICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGFR 151 LP +YF+ + V+C+ GPAIILG++Q QNFY+E+DL N+R GF Sbjct: 420 LPPENYFALVGNEVLCLILFTDNAAGPALGR-GPAIILGDFQLQNFYLEFDLANDRFGFA 478 Query: 150 SQVC 139 Q C Sbjct: 479 KQKC 482 >ref|XP_007145803.1| hypothetical protein PHAVU_007G269300g [Phaseolus vulgaris] gi|561018993|gb|ESW17797.1| hypothetical protein PHAVU_007G269300g [Phaseolus vulgaris] Length = 458 Score = 423 bits (1088), Expect = e-116 Identities = 220/415 (53%), Positives = 273/415 (65%), Gaps = 7/415 (1%) Frame = -3 Query: 1359 TTSSTRAHQLKHRGTNISFXXXXXATKVPLFPRGYGGYSISLGFGTPPQTTSFVMDTGSS 1180 +TS TRAH LK+ N K + P+ YGGYSI L FGTPPQT SF++DTGS+ Sbjct: 54 STSLTRAHHLKNHQPN--------PPKTQIHPKSYGGYSIDLNFGTPPQTFSFILDTGST 105 Query: 1179 LVWFPCTDRYTCSSCNFADTANFSVFIPKFSSSAKIVGCRNPKCKWLF-ENVQCRECDGN 1003 LVW PC+ Y CS+CN + S FIPK SSS+K VGC NPKCKW+F +V+ R C N Sbjct: 106 LVWLPCSSHYLCSNCNNFHNSPKS-FIPKNSSSSKFVGCTNPKCKWVFGTSVESRCCKQN 164 Query: 1002 STA--CNQLCPTYILQYXXXXXXXXXXXXXLVFPDKSVDNFVVGCSFASIRQPAGIAGFG 829 S C+Q CP Y +QY L FP K + +F+VGCS S+ QPAGIAGFG Sbjct: 165 SATANCSQTCPAYTVQYGLGSTAGFLLSENLNFPGKLLPDFLVGCSIVSVYQPAGIAGFG 224 Query: 828 RGPESLPAQMGLKKFSYCLVSHRFDGKPVSSDLIL----DXXXXXXATKYTPFRKNPASS 661 RGPESLP+QM L FSYCL+SH+FD P +SDL+L YTPFRKNP+S Sbjct: 225 RGPESLPSQMNLTGFSYCLLSHQFDDSPETSDLVLHTSSSDNKRTNGVSYTPFRKNPSSK 284 Query: 660 NPAFREYYYVTLRKITVGGVKVKAPYKFLVADSDGNGGTIVDSGTTFTFMEGKVFELVAE 481 NPAF YYY+TLR+I VG +V+ P + L D +GNGG+IVDSG+TFTFME +F+LVAE Sbjct: 285 NPAFGAYYYLTLRRIVVGEKRVRVPKRLLEPDVNGNGGSIVDSGSTFTFMERPIFDLVAE 344 Query: 480 EFEKQVGEHYRRAAAVEEESGLRPCYNISGEKTVELPQLTFHFKGGAKMALPLADYFSFL 301 EF +QV +Y RA +E++SGL PC+ +SG T P+L F F+GGAKM+LPL +YFS + Sbjct: 345 EFARQV--NYTRAREIEKKSGLSPCFVVSG--TATFPELRFEFRGGAKMSLPLTNYFSLV 400 Query: 300 DEAVICMTXXXXXXXXXXXXGPGPAIILGNYQQQNFYMEYDLENERLGFRSQVCK 136 ++ + GPA+ILGNYQQQNFY+EYDL NER GFRSQ CK Sbjct: 401 GKSDVACLTIVSDDVAGPGVAAGPAVILGNYQQQNFYVEYDLGNERFGFRSQSCK 455