BLASTX nr result

ID: Ziziphus21_contig00011100 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ziziphus21_contig00011100
         (2014 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007215315.1| hypothetical protein PRUPE_ppa003547mg [Prun...   741   0.0  
ref|XP_008229704.1| PREDICTED: uncharacterized protein LOC103329...   736   0.0  
ref|XP_010087862.1| Aspartic proteinase nepenthesin-1 [Morus not...   731   0.0  
ref|XP_006437136.1| hypothetical protein CICLE_v10031104mg [Citr...   721   0.0  
ref|XP_003553140.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   719   0.0  
ref|XP_008380197.1| PREDICTED: aspartic proteinase nepenthesin-2...   715   0.0  
ref|XP_008353635.1| PREDICTED: aspartic proteinase nepenthesin-2...   714   0.0  
ref|XP_006484924.1| PREDICTED: uncharacterized protein LOC102625...   712   0.0  
ref|XP_009356605.1| PREDICTED: aspartic proteinase nepenthesin-2...   710   0.0  
ref|XP_009377195.1| PREDICTED: aspartic proteinase nepenthesin-2...   710   0.0  
ref|XP_007048981.1| Eukaryotic aspartyl protease family protein ...   707   0.0  
gb|KRH46018.1| hypothetical protein GLYMA_08G307600 [Glycine max]     706   0.0  
ref|XP_011044763.1| PREDICTED: aspartic proteinase nepenthesin-2...   704   0.0  
ref|XP_002307559.2| hypothetical protein POPTR_0005s22630g [Popu...   703   0.0  
ref|XP_011467594.1| PREDICTED: aspartic proteinase nepenthesin-2...   698   0.0  
ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit,...   698   0.0  
ref|XP_007146226.1| hypothetical protein PHAVU_006G022800g [Phas...   697   0.0  
ref|XP_006595492.1| PREDICTED: uncharacterized protein LOC100305...   693   0.0  
gb|KHN11976.1| Aspartic proteinase nepenthesin-1 [Glycine soja]       692   0.0  
gb|KRH14503.1| hypothetical protein GLYMA_14G030000 [Glycine max]     692   0.0  

>ref|XP_007215315.1| hypothetical protein PRUPE_ppa003547mg [Prunus persica]
            gi|462411465|gb|EMJ16514.1| hypothetical protein
            PRUPE_ppa003547mg [Prunus persica]
          Length = 566

 Score =  741 bits (1912), Expect = 0.0
 Identities = 377/575 (65%), Positives = 445/575 (77%), Gaps = 6/575 (1%)
 Frame = -1

Query: 1795 MVVKVSPMLVLLVLFFISK-AIARNHNHDNLNPNGSTVTGIEFPDHMSFNAVSASGSTGC 1619
            MVVK S +LVLLV+F  +  AIA  HNH+N  PNGST+ G+E P+HMSFNAVS+S  TGC
Sbjct: 1    MVVKASLILVLLVIFSCTLVAIAGIHNHNNQTPNGSTLAGMEVPEHMSFNAVSSSSHTGC 60

Query: 1618 SLTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIA--SFLSLKSHKQSVKFHLKRQSSS 1445
            SL++S+KT+QS+S ME   K            +++A  S   +++HKQSVK HL+ +S +
Sbjct: 61   SLSSSRKTKQSDSTME---KAVSDNEESDDDEDEVADDSVTKMRTHKQSVKLHLRHRSQN 117

Query: 1444 SETEPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXXXX 1265
             E+E K SV + T+RDL RIQ L+ R+VEKKNQNTISRLQ+ K K+  +           
Sbjct: 118  RESERKSSVIESTVRDLVRIQTLHTRIVEKKNQNTISRLQKDK-KVHEFKPVVAPAASP- 175

Query: 1264 XPESAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCY 1085
              ES  +  SGQL ATL+SGV  GSGEYFMDVF+GTPPKHFSLILDTGSDLNW+QC PCY
Sbjct: 176  --ESYTSELSGQLQATLKSGVSLGSGEYFMDVFIGTPPKHFSLILDTGSDLNWVQCAPCY 233

Query: 1084 DCFEQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNTT 905
             CFEQ+GP YDPKDSTSFR+ISC+DPRC+LVSSPDPPQPCK+E+QTCPY+YWYGDSSNTT
Sbjct: 234  ACFEQDGPHYDPKDSTSFRDISCQDPRCRLVSSPDPPQPCKAENQTCPYFYWYGDSSNTT 293

Query: 904  GDFALETFTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQLQ 725
            GDF+LETFTVNLT+ +GK +FK+VENVMFGCGHWN                     SQLQ
Sbjct: 294  GDFSLETFTVNLTSHTGKTDFKRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQ 353

Query: 724  SLYGHSFSYCLVERNS--NASSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQIK 551
            SLYGHSFSYCLV+RNS  N SSKL+FGEDK+LL +P+L++TSLV GKENP DTFYYVQIK
Sbjct: 354  SLYGHSFSYCLVDRNSDTNVSSKLIFGEDKELLSHPKLSYTSLVGGKENPADTFYYVQIK 413

Query: 550  SIMVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEVK 371
            SIMVG E ++IPEETWN +             TLSYFA+PA ++IKEAF K+VKGY  VK
Sbjct: 414  SIMVGGEVVDIPEETWNLTPEGAGGTIIDSGTTLSYFADPAYQIIKEAFSKKVKGYPVVK 473

Query: 370  LEDFP-LDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTPR 194
              DFP LD CYNVSGVEK+ LP+F I FADGAVW+FPV+NYFIQIDPQEVVCLA++GTP+
Sbjct: 474  --DFPFLDPCYNVSGVEKIVLPEFAILFADGAVWDFPVENYFIQIDPQEVVCLAVLGTPK 531

Query: 193  SGLSIIGNYQQQNFHIVYDTKKSRLGYAPMKCADV 89
            SGLSIIGNYQQQNFHI+YDTKKSRLGY PMKCADV
Sbjct: 532  SGLSIIGNYQQQNFHILYDTKKSRLGYVPMKCADV 566


>ref|XP_008229704.1| PREDICTED: uncharacterized protein LOC103329046 [Prunus mume]
          Length = 776

 Score =  736 bits (1901), Expect = 0.0
 Identities = 375/574 (65%), Positives = 442/574 (77%), Gaps = 6/574 (1%)
 Frame = -1

Query: 1795 MVVKVSPMLVLLVLF-FISKAIARNHNHDNLNPNGSTVTGIEFPDHMSFNAVSASGSTGC 1619
            MVVK S +LVLLV+F +   AI   H+H+N  PNGST+ G+E P+HMSFNAVS+S  TGC
Sbjct: 1    MVVKASLILVLLVIFSYTLVAIVGIHDHNNQTPNGSTLAGMELPEHMSFNAVSSSSHTGC 60

Query: 1618 SLTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIA--SFLSLKSHKQSVKFHLKRQSSS 1445
            SL++SKKT+QS+S ME   K            +++   S   ++ HKQSVK HL+ +S +
Sbjct: 61   SLSSSKKTKQSDSTME---KAVSDNEESDDEDDEVVDDSMTKIRPHKQSVKLHLRHRSLN 117

Query: 1444 SETEPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXXXX 1265
             E+E K SV + T+RDL RIQ L+ R+VEKKNQNTISRLQ+ K K+  +           
Sbjct: 118  RESERKSSVIESTVRDLVRIQTLHTRIVEKKNQNTISRLQKDK-KVHEFKPVVAPAASP- 175

Query: 1264 XPESAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCY 1085
              ES  +  SGQL ATL+SGV  GSGEYFMDVF+GTPPKHFSLILDTGSDLNW+QC PCY
Sbjct: 176  --ESYTSELSGQLQATLKSGVSHGSGEYFMDVFIGTPPKHFSLILDTGSDLNWVQCAPCY 233

Query: 1084 DCFEQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNTT 905
             CFEQ+GP YDPKDSTSFR+ISC+DPRC+LVSSPDPPQPCK+E+QTCPY+YWYGDSSNTT
Sbjct: 234  ACFEQDGPHYDPKDSTSFRDISCQDPRCRLVSSPDPPQPCKAENQTCPYFYWYGDSSNTT 293

Query: 904  GDFALETFTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQLQ 725
            GDF+LETFTVNLT+ +GK +FK+VENVMFGCGHWN                     SQLQ
Sbjct: 294  GDFSLETFTVNLTSDTGKADFKRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQ 353

Query: 724  SLYGHSFSYCLVERNS--NASSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQIK 551
            SLYGHSFSYCLV+RNS  N SSKL+FGEDK+LL +P+L++TSLV GKENP DTFYYVQIK
Sbjct: 354  SLYGHSFSYCLVDRNSDTNVSSKLIFGEDKELLSHPKLSYTSLVGGKENPADTFYYVQIK 413

Query: 550  SIMVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEVK 371
            SIMVG E ++IPEETWN +             TLSYFA+PA ++IKEAF K+VKGY  VK
Sbjct: 414  SIMVGGEVVDIPEETWNLTPEGAGGTIIDSGTTLSYFADPAYQIIKEAFSKKVKGYPVVK 473

Query: 370  LEDFP-LDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTPR 194
              DFP LD CYNVSGVEK+ELP+F I FADGAVW+FPV+NYFIQIDPQEVVCLA++GTP+
Sbjct: 474  --DFPFLDPCYNVSGVEKIELPEFTILFADGAVWDFPVENYFIQIDPQEVVCLAVLGTPK 531

Query: 193  SGLSIIGNYQQQNFHIVYDTKKSRLGYAPMKCAD 92
            SGLSIIGNYQQQNFHI+YDTKKSRLGY PMKCAD
Sbjct: 532  SGLSIIGNYQQQNFHILYDTKKSRLGYVPMKCAD 565


>ref|XP_010087862.1| Aspartic proteinase nepenthesin-1 [Morus notabilis]
            gi|587839644|gb|EXB30298.1| Aspartic proteinase
            nepenthesin-1 [Morus notabilis]
          Length = 564

 Score =  731 bits (1886), Expect = 0.0
 Identities = 371/572 (64%), Positives = 431/572 (75%), Gaps = 6/572 (1%)
 Frame = -1

Query: 1786 KVSPMLVLLVLFFISK--AIARNHNHDNLNPNGSTVTGIEFPDHMSFNAVSASGSTGCSL 1613
            KVS +++LLV+F ++   AIA NHN      NGS++ GIEFPDHMSFNAVS+S ++ CSL
Sbjct: 5    KVSSVIILLVIFSVAAIGAIAGNHNK-----NGSSLPGIEFPDHMSFNAVSSSVTSDCSL 59

Query: 1612 TNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLKSHKQSVKFHLKRQSSSSETE 1433
              S   +QSES  +                  I+S LSL+ +KQ+VK HL R+    E+E
Sbjct: 60   ATSNDDKQSESNSKFDDNEGYDDNDDGNEGINISSVLSLEQNKQAVKLHLNRRG---ESE 116

Query: 1432 PKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKK-AKIQRYDXXXXXXXXXXXPE 1256
            PKKSV   TIRD+ RIQ L++R+ EKKNQ+ +SRL++KK  K+Q+               
Sbjct: 117  PKKSVISSTIRDIARIQTLHKRMTEKKNQSNVSRLKKKKNKKVQK----KANQPISPVVA 172

Query: 1255 SAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYDCF 1076
              P  ++G+LMATLESGVGFGSGEYFMDVF+GTPPKHFSLILDTGSDLNWIQCVPC+DCF
Sbjct: 173  QLPESYAGRLMATLESGVGFGSGEYFMDVFIGTPPKHFSLILDTGSDLNWIQCVPCHDCF 232

Query: 1075 EQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNTTGDF 896
            EQNGP+YDPK+STSFRNISC+DPRC+LVSSPDPP+PCKSESQ+CPYYYWYGDSSNTTGDF
Sbjct: 233  EQNGPYYDPKESTSFRNISCRDPRCQLVSSPDPPKPCKSESQSCPYYYWYGDSSNTTGDF 292

Query: 895  ALETFTVNLTA-LSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQLQSL 719
            A+ETFTVNLT   +GK EF++VENVMFGCGHWN                     SQLQSL
Sbjct: 293  AVETFTVNLTTGATGKAEFRRVENVMFGCGHWNRGLFKGAAGLLGLGRGPLSFSSQLQSL 352

Query: 718  YGHSFSYCLVERNS--NASSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQIKSI 545
            YGHSFSYCLV+RNS  N SSKL+FGEDKDLL  PELNFT+LV GKENPVDTFYYV+IK I
Sbjct: 353  YGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSRPELNFTALVAGKENPVDTFYYVEIKYI 412

Query: 544  MVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEVKLE 365
            +VG E LNIPEETW  S             TLSYF +PA ++IKEAF K+VKGY+ VK+E
Sbjct: 413  LVGGEVLNIPEETWKLSPEGYGGTIIDSGTTLSYFQDPAYQVIKEAFLKKVKGYQLVKIE 472

Query: 364  DFPLDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTPRSGL 185
            DFPLDLCYNVSGVE +ELPDFGI F+DG VWNFPV+NYFIQ+DPQEVVCLA   T  S L
Sbjct: 473  DFPLDLCYNVSGVENIELPDFGILFSDGGVWNFPVENYFIQVDPQEVVCLAFKNTSASAL 532

Query: 184  SIIGNYQQQNFHIVYDTKKSRLGYAPMKCADV 89
            SIIGNYQQQNFHI+YDT KSRLGYAP  CADV
Sbjct: 533  SIIGNYQQQNFHILYDTNKSRLGYAPRNCADV 564


>ref|XP_006437136.1| hypothetical protein CICLE_v10031104mg [Citrus clementina]
            gi|557539332|gb|ESR50376.1| hypothetical protein
            CICLE_v10031104mg [Citrus clementina]
          Length = 567

 Score =  721 bits (1861), Expect = 0.0
 Identities = 370/576 (64%), Positives = 436/576 (75%), Gaps = 7/576 (1%)
 Frame = -1

Query: 1795 MVVKVSPMLVLLVLFFIS-KAIARNHNH---DNLNPNGSTVTGIEFPDHMSFNAVSASGS 1628
            MV KVS +LVLL +   S  A+AR H+H   ++ N N S++ GI+ PDHMSFNAVS+S +
Sbjct: 1    MVFKVSLVLVLLSISAGSFDAVARAHDHRRTNSFNSNTSSLAGIKLPDHMSFNAVSSSTN 60

Query: 1627 TGCSLTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLKSHKQSVKFHLKRQSS 1448
            +GCS + S K    E  ++   K                  L+LK  KQ VK HLK +S 
Sbjct: 61   SGCSFSKSNKPTHPER-IDTQEKDGDVALDDDDGD----DLLTLKLSKQKVKLHLKHRSK 115

Query: 1447 SSETEPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXXX 1268
            + ETEPKKSV++ TIRDLTRIQ L+RR++EKKNQNT+SRL+++  K ++           
Sbjct: 116  NRETEPKKSVSESTIRDLTRIQALHRRIIEKKNQNTVSRLKKESQKSKKQ--IKPVVTPA 173

Query: 1267 XXPESAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC 1088
              PES  +G SGQL+ATLESGV  G+GEYFMDVFVGTPPKH+  ILDTGSDLNWIQCVPC
Sbjct: 174  ASPESYASGVSGQLVATLESGVSLGAGEYFMDVFVGTPPKHYYFILDTGSDLNWIQCVPC 233

Query: 1087 YDCFEQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNT 908
            YDCFEQNGP YDPKDS+SF+NISC DPRC LVSSPDPP+PC++E+QTCPY+YWYGDSSNT
Sbjct: 234  YDCFEQNGPHYDPKDSSSFKNISCHDPRCHLVSSPDPPRPCQAENQTCPYFYWYGDSSNT 293

Query: 907  TGDFALETFTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQL 728
            TGDFALETFTVNL+  +GK EF+QVENVMFGCGHWN                     SQL
Sbjct: 294  TGDFALETFTVNLSTPTGKSEFRQVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQL 353

Query: 727  QSLYGHSFSYCLVERNS--NASSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQI 554
            QSLYGHSFSYCLV+RNS  N SSKL+FGEDKDLL +P LNFTSLV+GKENPVDTFYY+QI
Sbjct: 354  QSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPNLNFTSLVSGKENPVDTFYYLQI 413

Query: 553  KSIMVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEV 374
            KSI+VG E L+IP+ETW  S             TLSYFAEPA ++IK+AF K+VKGY  V
Sbjct: 414  KSIIVGGEVLSIPDETWRLSPEGAGGTIIDSGTTLSYFAEPAYQIIKQAFMKKVKGYPLV 473

Query: 373  KLEDFP-LDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTP 197
            K  DFP LD CYNVSG+EKMELP+FGIQFADG VWNFPV+NYFI++DP++VVCLAI+GTP
Sbjct: 474  K--DFPILDPCYNVSGIEKMELPEFGIQFADGGVWNFPVENYFIRLDPEDVVCLAILGTP 531

Query: 196  RSGLSIIGNYQQQNFHIVYDTKKSRLGYAPMKCADV 89
            RS LSIIGNYQQQNFHI+YDTK SRLGYAPM+CAD+
Sbjct: 532  RSALSIIGNYQQQNFHILYDTKNSRLGYAPMRCADI 567


>ref|XP_003553140.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Glycine
            max] gi|947049434|gb|KRG98962.1| hypothetical protein
            GLYMA_18G110200 [Glycine max]
          Length = 560

 Score =  719 bits (1857), Expect = 0.0
 Identities = 366/574 (63%), Positives = 432/574 (75%), Gaps = 5/574 (0%)
 Frame = -1

Query: 1795 MVVKVSPMLVLLVLFFIS--KAIARNHNHDNLNPNGSTVTGIEFPDHMSFNAVSASGSTG 1622
            MV++VS +LVLLVL      ++I  +HNH++LN NGS++  ++FPDH  FNAVS+S  TG
Sbjct: 1    MVLRVSLILVLLVLHCSCTVQSIFGHHNHNDLNKNGSSLAAVKFPDHAHFNAVSSSTETG 60

Query: 1621 CSLTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLKSHKQSVKFHLKRQSSSS 1442
            CS + S+K + S + M + G                 +F++ K HKQSVK +L+  S S 
Sbjct: 61   CSFSKSEKFEPSVATMTSNGDTDGEEGE---------AFVAAKQHKQSVKLNLRHHSVSK 111

Query: 1441 ETEPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXXXXX 1262
            ++EPK+SV D T+RDL RIQ L+RRV+EKKNQNTISRL++   + ++             
Sbjct: 112  DSEPKRSVADSTVRDLKRIQTLHRRVIEKKNQNTISRLEKAPEQSKK---SYKLAAAAAA 168

Query: 1261 PESAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYD 1082
            P + P  FSGQL+ATLESGV  GSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCY 
Sbjct: 169  PAAPPEYFSGQLVATLESGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYA 228

Query: 1081 CFEQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNTTG 902
            CFEQNGP+YDPKDS+SF+NI+C DPRC+LVSSPDPPQPCK E+Q+CPY+YWYGDSSNTTG
Sbjct: 229  CFEQNGPYYDPKDSSSFKNITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTG 288

Query: 901  DFALETFTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQLQS 722
            DFALETFTVNLT   GKPE K VENVMFGCGHWN                     +QLQS
Sbjct: 289  DFALETFTVNLTTPEGKPELKIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQS 348

Query: 721  LYGHSFSYCLVERNSNA--SSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQIKS 548
            LYGHSFSYCLV+RNSN+  SSKL+FGEDK+LL +P LNFTS V GKENPVDTFYYV IKS
Sbjct: 349  LYGHSFSYCLVDRNSNSSVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKS 408

Query: 547  IMVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEVKL 368
            IMVG E L IPEETW+ SA            TL+YFAEPA  +IKEAF +++KG+  V  
Sbjct: 409  IMVGGEVLKIPEETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLV-- 466

Query: 367  EDF-PLDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTPRS 191
            E F PL  CYNVSGVEKMELP+F I FADGA+W+FPV+NYFIQI+P++VVCLAI+GTPRS
Sbjct: 467  ETFPPLKPCYNVSGVEKMELPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRS 526

Query: 190  GLSIIGNYQQQNFHIVYDTKKSRLGYAPMKCADV 89
             LSIIGNYQQQNFHI+YD KKSRLGYAPMKCADV
Sbjct: 527  ALSIIGNYQQQNFHILYDLKKSRLGYAPMKCADV 560


>ref|XP_008380197.1| PREDICTED: aspartic proteinase nepenthesin-2 [Malus domestica]
          Length = 557

 Score =  715 bits (1846), Expect = 0.0
 Identities = 368/573 (64%), Positives = 425/573 (74%), Gaps = 4/573 (0%)
 Frame = -1

Query: 1795 MVVKVSPMLVLLVLFFIS-KAIARNHNHDNLNPNGSTVTGIEFPDHMSFNAVSASGSTGC 1619
            M VK S +LVLLV+F  +  AIAR HNH N N N ST  GI+ P HMSFNAVS S  TGC
Sbjct: 1    MAVKASILLVLLVIFSSTLAAIARVHNHRNHNSNASTFAGIQLPKHMSFNAVSXSSHTGC 60

Query: 1618 SLTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLKSHKQSVKFHLKRQSSSSE 1439
            SL++SK   Q +SAME                        L  HKQ++K HL+ +S + +
Sbjct: 61   SLSSSKIPNQPDSAMEEAEDSDTDEAAP-----------ELNPHKQTMKLHLRHRSQNKQ 109

Query: 1438 TEPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXXXXXP 1259
            +E K SV + T+RDL RIQ L+ R+VEKKNQNT SRLQ  K K  ++            P
Sbjct: 110  SERKSSVIESTVRDLVRIQTLHTRIVEKKNQNTFSRLQ--KDKKPKHHQSNPVVAPAASP 167

Query: 1258 ESAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYDC 1079
            ES  N  SGQL ATL+SGV  GSGEYFMDVF+GTPPKHFSLILDTGSDLNW+QC PC+DC
Sbjct: 168  ESYTNELSGQLQATLKSGVSLGSGEYFMDVFIGTPPKHFSLILDTGSDLNWVQCAPCHDC 227

Query: 1078 FEQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNTTGD 899
            FEQ+GP YDPKDSTSF +ISCKDPRC+LVSSPDPPQPCKSE+QTCPY+YWYGDSSNTTGD
Sbjct: 228  FEQHGPHYDPKDSTSFLDISCKDPRCRLVSSPDPPQPCKSENQTCPYFYWYGDSSNTTGD 287

Query: 898  FALETFTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQLQSL 719
            FALETFT+NLT+ S K EFK+VENVMFGCGHWN                     SQLQSL
Sbjct: 288  FALETFTINLTS-SNKAEFKRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQSL 346

Query: 718  YGHSFSYCLVERNS--NASSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQIKSI 545
            YGHSFSYCLV+RNS  N SSKL+FGEDK+LL +P+L +TSLV GKENP DTFYYV+IKSI
Sbjct: 347  YGHSFSYCLVDRNSDANVSSKLIFGEDKNLLSHPKLTYTSLVGGKENPADTFYYVEIKSI 406

Query: 544  MVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEVKLE 365
            MVG E ++IP ETW  S             TLSYFA+PA ++IK+AF K+VKGY  V   
Sbjct: 407  MVGGEAVDIPAETWKLSPEGAGGTIVDSGTTLSYFADPAYQIIKDAFEKKVKGYPVV--N 464

Query: 364  DFP-LDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTPRSG 188
            DFP L+ CYNVSGVEK+ELP+F I FADGAVW+FPV+NYFIQI+PQE+VCLA++GTP+SG
Sbjct: 465  DFPFLEPCYNVSGVEKIELPEFAIVFADGAVWDFPVENYFIQIEPQEIVCLAVLGTPKSG 524

Query: 187  LSIIGNYQQQNFHIVYDTKKSRLGYAPMKCADV 89
            LSIIGNYQQQNFHI+YDTKKSRLGYAPMKCADV
Sbjct: 525  LSIIGNYQQQNFHILYDTKKSRLGYAPMKCADV 557


>ref|XP_008353635.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Malus domestica]
          Length = 558

 Score =  714 bits (1843), Expect = 0.0
 Identities = 369/574 (64%), Positives = 432/574 (75%), Gaps = 5/574 (0%)
 Frame = -1

Query: 1795 MVVKVSPMLVLLVLFFIS-KAIARNHNHDNLNPNGSTVTGIEFPDHMSFNAVSASGS-TG 1622
            M V+ S ++VLLV+F  +  AIA  H+HDN NPN ST+ GI+ P+HMSFNAVS+S S TG
Sbjct: 1    MAVRGSILVVLLVIFSSTLAAIAGLHSHDNHNPNASTLAGIQLPEHMSFNAVSSSSSHTG 60

Query: 1621 CSLTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLKSHKQSVKFHLKRQSSSS 1442
            CSL++SK+T QS+SAME                        L  HKQ++K HL+ +S + 
Sbjct: 61   CSLSSSKRTNQSDSAMEDADDSDDDEAAP-----------ELNPHKQTMKLHLRHRSQNK 109

Query: 1441 ETEPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXXXXX 1262
            ++E K SV + T+RDL RIQ L+ R+VEKKNQNTISRLQ+ K K + Y            
Sbjct: 110  QSERKNSVIESTVRDLIRIQTLHTRIVEKKNQNTISRLQKDK-KPKPYQSNPVVAPAASP 168

Query: 1261 PESAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYD 1082
             ES  N  SGQL ATL+SGV  GSGEYFMDVF+GTPPKHFSLILDTGSDLNW+QC PCYD
Sbjct: 169  -ESYTNELSGQLQATLKSGVSLGSGEYFMDVFIGTPPKHFSLILDTGSDLNWVQCAPCYD 227

Query: 1081 CFEQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNTTG 902
            CFEQ GP YDPKDSTSF +ISCKDPRC+L+SSPDPPQPCKSE+QTCPY+YWYGDSSNTTG
Sbjct: 228  CFEQQGPHYDPKDSTSFLDISCKDPRCQLISSPDPPQPCKSENQTCPYFYWYGDSSNTTG 287

Query: 901  DFALETFTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQLQS 722
            DF+LETFTVNLT+   K EFK+ ENVMFGCGHWN                     SQLQS
Sbjct: 288  DFSLETFTVNLTS-PNKAEFKRAENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQS 346

Query: 721  LYGHSFSYCLVERNS--NASSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQIKS 548
            LYGHSFSYCLV+RNS  N SSKL+FGEDK+LL +P+L++TSLV GKENP DTFYYV IKS
Sbjct: 347  LYGHSFSYCLVDRNSDANVSSKLIFGEDKNLLSHPKLSYTSLVGGKENPADTFYYVXIKS 406

Query: 547  IMVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEVKL 368
            +MVG E ++IP ETW  S             TLSYFA+PA ++IKEAF K+VK Y  VK 
Sbjct: 407  VMVGGEAVDIPAETWKLSPXGAGGTIIDSGTTLSYFADPAYQIIKEAFEKKVKXYPVVK- 465

Query: 367  EDFP-LDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTPRS 191
             DFP L+ CYNVSGVEK+ELP+F I FADGAVW+FPV+NYFIQI+PQE+VCLA++GTP+S
Sbjct: 466  -DFPILEPCYNVSGVEKIELPEFAIVFADGAVWDFPVENYFIQIEPQEMVCLAVLGTPKS 524

Query: 190  GLSIIGNYQQQNFHIVYDTKKSRLGYAPMKCADV 89
            GLSIIGNYQQQNFHI+YDTKKSRLGYAPMKCADV
Sbjct: 525  GLSIIGNYQQQNFHILYDTKKSRLGYAPMKCADV 558


>ref|XP_006484924.1| PREDICTED: uncharacterized protein LOC102625748 [Citrus sinensis]
          Length = 820

 Score =  712 bits (1839), Expect = 0.0
 Identities = 367/572 (64%), Positives = 432/572 (75%), Gaps = 7/572 (1%)
 Frame = -1

Query: 1795 MVVKVSPMLVLLVLFFIS-KAIARNHNH---DNLNPNGSTVTGIEFPDHMSFNAVSASGS 1628
            MV KVS +LVLL +   S  A+AR H+H   ++ N N S++ GI+ PDHMSFNAVS+S +
Sbjct: 1    MVFKVSLVLVLLSISAGSFDAVARAHDHRRTNSFNSNTSSLAGIKLPDHMSFNAVSSSTN 60

Query: 1627 TGCSLTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLKSHKQSVKFHLKRQSS 1448
            +GCS + S K    E  ++   K                  L+LK  KQ VK HLK +S 
Sbjct: 61   SGCSFSKSNKPTHPER-IDTQEKDGDVALDDDDGD----DLLTLKLSKQKVKLHLKHRSK 115

Query: 1447 SSETEPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXXX 1268
            + ETEPKKSV++ TIRDLTRIQ L+RR++EKKNQNT+SRL+++  K ++           
Sbjct: 116  NRETEPKKSVSESTIRDLTRIQALHRRIIEKKNQNTVSRLKKESQKSKKQ--IKPVLTPA 173

Query: 1267 XXPESAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC 1088
              PES  +G SGQL+ATLESGV  G+GEYFMDVFVGTPPKH+  ILDTGSDLNWIQCVPC
Sbjct: 174  ASPESYASGVSGQLVATLESGVSLGAGEYFMDVFVGTPPKHYYFILDTGSDLNWIQCVPC 233

Query: 1087 YDCFEQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNT 908
            YDCFEQNGP YDPKDS+SF+NISC DPRC LVSSPDPP+PC++E+QTCPY+YWYGDSSNT
Sbjct: 234  YDCFEQNGPHYDPKDSSSFKNISCHDPRCHLVSSPDPPRPCQAENQTCPYFYWYGDSSNT 293

Query: 907  TGDFALETFTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQL 728
            TGDFALETFTVNL+  +GK EF+QVENVMFGCGHWN                     SQL
Sbjct: 294  TGDFALETFTVNLSTPTGKSEFRQVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQL 353

Query: 727  QSLYGHSFSYCLVERNS--NASSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQI 554
            QSLYGHSFSYCLV+RNS  N SSKL+FGEDKDLL +P LNFTSLV+GKENPVDTFYY+QI
Sbjct: 354  QSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPNLNFTSLVSGKENPVDTFYYLQI 413

Query: 553  KSIMVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEV 374
            KSI+VG E L+IP+ETW  S             TLSYFAEPA ++IK+AF K+VKGY  V
Sbjct: 414  KSIIVGGEVLSIPDETWRLSPEGAGGTIIDSGTTLSYFAEPAYQIIKQAFMKKVKGYPLV 473

Query: 373  KLEDFP-LDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTP 197
            K  DFP LD CYNVSG+EKMELP+FGIQFADG VWNFPV+NYFI++DP++VVCLAI+GTP
Sbjct: 474  K--DFPILDPCYNVSGIEKMELPEFGIQFADGGVWNFPVENYFIRLDPEDVVCLAILGTP 531

Query: 196  RSGLSIIGNYQQQNFHIVYDTKKSRLGYAPMK 101
            RS LSIIGNYQQQNFHI+YDTK SRLGYAPM+
Sbjct: 532  RSALSIIGNYQQQNFHILYDTKNSRLGYAPMR 563


>ref|XP_009356605.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Pyrus x
            bretschneideri]
          Length = 557

 Score =  710 bits (1832), Expect = 0.0
 Identities = 366/573 (63%), Positives = 424/573 (73%), Gaps = 4/573 (0%)
 Frame = -1

Query: 1795 MVVKVSPMLVLLVLFFIS-KAIARNHNHDNLNPNGSTVTGIEFPDHMSFNAVSASGSTGC 1619
            M VK   +LVLLV+F  +  AIAR HNH N N N ST  GI+ P+HMSFNAVS+S  TGC
Sbjct: 1    MAVKAPILLVLLVIFSSTLAAIARVHNHRNHNSNASTFAGIQLPEHMSFNAVSSSSHTGC 60

Query: 1618 SLTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLKSHKQSVKFHLKRQSSSSE 1439
            SL+ SK   Q++S ME                        L  HKQ++K HL+ +S + +
Sbjct: 61   SLSCSKIPNQADSTMEEAEDSDTDEAAP-----------ELNPHKQTMKLHLRHRSQNKQ 109

Query: 1438 TEPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXXXXXP 1259
            +E K SV + T+RDL RIQ L+ R+VEKKNQNT SRLQ  K K  ++            P
Sbjct: 110  SERKSSVIESTVRDLVRIQTLHTRIVEKKNQNTFSRLQ--KDKKPKHHQSNQVVAPAASP 167

Query: 1258 ESAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYDC 1079
            ES  N  SGQL ATL+SGV  GSGEYFMDVF+GTPPK FSLILDTGSDLNW+QCVPC+DC
Sbjct: 168  ESYTNELSGQLQATLKSGVSLGSGEYFMDVFIGTPPKQFSLILDTGSDLNWVQCVPCHDC 227

Query: 1078 FEQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNTTGD 899
            FEQ+GP YDPKDSTSFR+ISCKD RC+LVSSPDPPQPCKSE+QTCPY+YWYGDSSNTTGD
Sbjct: 228  FEQHGPHYDPKDSTSFRDISCKDSRCRLVSSPDPPQPCKSENQTCPYFYWYGDSSNTTGD 287

Query: 898  FALETFTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQLQSL 719
            FA ETFTVNLT+ S K EFK+V+NVMFGCGHWN                     SQLQSL
Sbjct: 288  FARETFTVNLTS-SNKAEFKRVDNVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQSL 346

Query: 718  YGHSFSYCLVERNS--NASSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQIKSI 545
            YGHSFSYCLV+RNS  N SSKL+FGEDK LL +P+L +TSLV GKENP DTFYYV+IKS+
Sbjct: 347  YGHSFSYCLVDRNSDANVSSKLIFGEDKTLLSHPKLTYTSLVGGKENPADTFYYVEIKSV 406

Query: 544  MVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEVKLE 365
            MVG E ++IP ETW  S             TLSYFA+PA ++IKEAF K+VKGY  VK  
Sbjct: 407  MVGGEAVDIPAETWKLSPEGAGGTIIDSGTTLSYFADPAYQIIKEAFEKKVKGYPVVK-- 464

Query: 364  DFP-LDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTPRSG 188
            DFP L+ CYNVSGVEK+ELP+F I FADGAVW+FPV+NYFIQI+PQE+VCLA++GTP+SG
Sbjct: 465  DFPFLEPCYNVSGVEKIELPEFAIVFADGAVWDFPVENYFIQIEPQEIVCLAVLGTPKSG 524

Query: 187  LSIIGNYQQQNFHIVYDTKKSRLGYAPMKCADV 89
            LSIIGNYQQQNFHI+YDTKKSRLGYAPMKCADV
Sbjct: 525  LSIIGNYQQQNFHILYDTKKSRLGYAPMKCADV 557


>ref|XP_009377195.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Pyrus x
            bretschneideri]
          Length = 559

 Score =  710 bits (1832), Expect = 0.0
 Identities = 365/575 (63%), Positives = 433/575 (75%), Gaps = 6/575 (1%)
 Frame = -1

Query: 1795 MVVKVSPMLVLLVLFFIS-KAIARNHNHDNLNPNGSTVTGIEFPDHMSFNAVSASGS--T 1625
            M V+ S ++VLLV+F  +  AIA  H+HDN NPN ST+ GI+ P+HMSFNA+S+S S  T
Sbjct: 1    MAVRGSILVVLLVIFSSTLAAIAGLHSHDNHNPNASTLAGIQLPEHMSFNAISSSSSSHT 60

Query: 1624 GCSLTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLKSHKQSVKFHLKRQSSS 1445
            GCSL++SK+T QS+SAME +G                     L  HKQ++K HL+ +S +
Sbjct: 61   GCSLSSSKRTNQSDSAMEDVGDSDDDEAAP-----------ELNPHKQTMKLHLRHRSQN 109

Query: 1444 SETEPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXXXX 1265
             ++E K SV + T+RDL RIQ L+ R+VEKKNQNTISRLQ+ K K + Y           
Sbjct: 110  KQSERKSSVIESTVRDLIRIQTLHTRIVEKKNQNTISRLQKDK-KPKPYKSNPVVAPAAS 168

Query: 1264 XPESAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCY 1085
              ES  N  SGQL ATL+SGV  GSGEYFMDVF+GTPPKHFSLILDTGSDLNW+QC PC+
Sbjct: 169  P-ESYTNELSGQLQATLKSGVSLGSGEYFMDVFIGTPPKHFSLILDTGSDLNWVQCAPCH 227

Query: 1084 DCFEQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNTT 905
            DCFEQ GP YDPKDSTSF +ISCKDPRC+LVSSPDP QPCKSE+QTCPY+YWYGDSSNTT
Sbjct: 228  DCFEQQGPHYDPKDSTSFLDISCKDPRCRLVSSPDPAQPCKSENQTCPYFYWYGDSSNTT 287

Query: 904  GDFALETFTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQLQ 725
            GDF+LETFTVNLT+   K  FK+VENVMFGCGHWN                     SQLQ
Sbjct: 288  GDFSLETFTVNLTS-PNKAGFKRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQ 346

Query: 724  SLYGHSFSYCLVERNS--NASSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQIK 551
            SLYGHSFSYCLV+RNS  N SSKL+FGEDK+LL +P+L++TSLV GKENP DTFYYV+IK
Sbjct: 347  SLYGHSFSYCLVDRNSDANVSSKLIFGEDKNLLSHPKLSYTSLVGGKENPADTFYYVEIK 406

Query: 550  SIMVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEVK 371
            S+MVG E ++IP ETW  +             TLSYFA+PA ++IKEAF K+VKGY  VK
Sbjct: 407  SVMVGGEAVDIPAETWKLAPEGAGGTIIDSGTTLSYFADPAYQMIKEAFEKKVKGYPVVK 466

Query: 370  LEDFP-LDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTPR 194
              +FP L+ CYNVSGVEK+ELP+F I FADGAVW+FPV+NYFIQI+PQE+VCLA++GTP+
Sbjct: 467  --EFPILEPCYNVSGVEKIELPEFAIVFADGAVWDFPVENYFIQIEPQEMVCLAVLGTPK 524

Query: 193  SGLSIIGNYQQQNFHIVYDTKKSRLGYAPMKCADV 89
            S LSIIGNYQQQNFHI+YDTKKSRLGYAPMKCADV
Sbjct: 525  SSLSIIGNYQQQNFHILYDTKKSRLGYAPMKCADV 559


>ref|XP_007048981.1| Eukaryotic aspartyl protease family protein isoform 1 [Theobroma
            cacao] gi|508701242|gb|EOX93138.1| Eukaryotic aspartyl
            protease family protein isoform 1 [Theobroma cacao]
          Length = 597

 Score =  707 bits (1825), Expect = 0.0
 Identities = 374/598 (62%), Positives = 439/598 (73%), Gaps = 17/598 (2%)
 Frame = -1

Query: 1831 ILWVTKQQEFVTMVVKVSPMLVLLVLFFIS---KAIARNHNHDNL---NPNGSTVTGIEF 1670
            I +VTKQ++   M+VKVS  L+LL+L   S   +AIAR H HD++   N N ST+TGIE 
Sbjct: 22   IFFVTKQEKISAMLVKVSLSLLLLLLSISSGAFEAIARVH-HDHIKNGNSNISTLTGIEL 80

Query: 1669 PDHMSFNAVSASGS-TGCSLTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLK 1493
            PDHMSFNAVS+S S +GCSL+  KK + S+                     +++S+L  +
Sbjct: 81   PDHMSFNAVSSSTSNSGCSLSKQKKAKPSQKIASQ----------------EVSSYLDDE 124

Query: 1492 SH-------KQSVKFHLKRQSSSSETEPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTIS 1334
                     K+SVK HLK +    + EPK SV + T+RDLTRI+  + RV+EKKNQN IS
Sbjct: 125  DEEDEQQKPKKSVKLHLKHRQIDGKAEPKNSVLESTMRDLTRIRTFHTRVIEKKNQNVIS 184

Query: 1333 RLQQKKAKIQRYDXXXXXXXXXXXPESAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTP 1154
            RL   + + +++            PE   +G  GQL+ATLESGV  GSGEYF+DVFVGTP
Sbjct: 185  RLNNDRKQSKQH--LKPVVEKAAAPEPYTSGVPGQLVATLESGVSLGSGEYFIDVFVGTP 242

Query: 1153 PKHFSLILDTGSDLNWIQCVPCYDCFEQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPP 974
            PKHFSLILDTGSDLNWIQCVPCYDCFEQNGP YDP++S+SFRNISC DPRC+LVSSPDPP
Sbjct: 243  PKHFSLILDTGSDLNWIQCVPCYDCFEQNGPHYDPRESSSFRNISCHDPRCQLVSSPDPP 302

Query: 973  QPCKSESQTCPYYYWYGDSSNTTGDFALETFTVNLTALSGKPEFKQVENVMFGCGHWNXX 794
            QPCK+E+QTCPYYYWYGDSSNTTGDFA+ETFTVNLT+ SGK EF+QVENVMFGCGHWN  
Sbjct: 303  QPCKAENQTCPYYYWYGDSSNTTGDFAVETFTVNLTSPSGKSEFRQVENVMFGCGHWNRG 362

Query: 793  XXXXXXXXXXXXXXXXXXXSQLQSLYGHSFSYCLVERNS--NASSKLLFGEDKDLLKNPE 620
                               SQLQSLYGHSFSYCLV+RNS  N SSKL+FGEDKDLL +P 
Sbjct: 363  LFHGAAGLLGLGRGPLSFASQLQSLYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPN 422

Query: 619  LNFTSLVTGKENPVDTFYYVQIKSIMVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYF 440
            LNFT+LV GKEN VDTFYYVQIKS++VG E LNIPEETW  SA            TLSYF
Sbjct: 423  LNFTALVAGKENSVDTFYYVQIKSVIVGGEVLNIPEETWQLSADGAGGTIIDSGTTLSYF 482

Query: 439  AEPADRLIKEAFRKQVKGYKEVKLEDFP-LDLCYNVSGVEKMELPDFGIQFADGAVWNFP 263
            A+P  ++IK+AF K+ KGY  +K  DFP LD CYNVSGVE +ELPDFGIQF DGAVWNFP
Sbjct: 483  ADPTYQIIKDAFVKKTKGYPVLK--DFPVLDPCYNVSGVENVELPDFGIQFVDGAVWNFP 540

Query: 262  VDNYFIQIDPQEVVCLAIMGTPRSGLSIIGNYQQQNFHIVYDTKKSRLGYAPMKCADV 89
            V+NYFI ++ ++VVCLAI+GTPRS LSIIGNYQQQNFHI+YDTKKSRLGYAPMKCADV
Sbjct: 541  VENYFIWLE-EDVVCLAILGTPRSALSIIGNYQQQNFHILYDTKKSRLGYAPMKCADV 597


>gb|KRH46018.1| hypothetical protein GLYMA_08G307600 [Glycine max]
          Length = 557

 Score =  706 bits (1822), Expect = 0.0
 Identities = 361/573 (63%), Positives = 426/573 (74%), Gaps = 4/573 (0%)
 Frame = -1

Query: 1795 MVVKVSPMLVLLVLFFISKAIARNHNHDNLNPNGSTVTGIEFPDHMSFNAVSASGSTGCS 1616
            MV++VS +LVLLVL       +   +H++LN NGS++  ++FPDH  FNAVS+S  TGCS
Sbjct: 1    MVLRVSLILVLLVLHCSCTVESIWGHHNDLNKNGSSLAAVKFPDHAHFNAVSSSTETGCS 60

Query: 1615 LTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLKSHKQSVKFHLKRQSSSSET 1436
             + S+K + S + M +               ++  +F++ K HKQSVK +L+  S S ++
Sbjct: 61   FSKSEKFEPSVATMAS---------NEDTDGKEGEAFVAAKQHKQSVKLNLRHHSVSKDS 111

Query: 1435 EPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXXXXXPE 1256
            EPKKSV D T RDL RIQ L+RRV+EKKNQNTISRL++ +   ++               
Sbjct: 112  EPKKSVVDSTGRDLKRIQTLHRRVIEKKNQNTISRLEKAQEHSKK-----SYKPPTVAAA 166

Query: 1255 SAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYDCF 1076
            + P   SGQLMATLESGV  GSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCY CF
Sbjct: 167  APPEYLSGQLMATLESGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACF 226

Query: 1075 EQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNTTGDF 896
            EQNGP+YDPKDS+SF+NI+C+DPRC+LVSSPDPPQPCK E+Q+CPY+YWYGDSSNTTGDF
Sbjct: 227  EQNGPYYDPKDSSSFKNITCRDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDF 286

Query: 895  ALETFTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQLQSLY 716
            ALETFTVNLT   GKPE K VENVMFGCGHWN                     +QLQSLY
Sbjct: 287  ALETFTVNLTTPEGKPELKIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSLY 346

Query: 715  GHSFSYCLVERNSNA--SSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQIKSIM 542
            GHSFSYCLV+RNSN+  SSKL+FGEDK+LL +P LNFTS V GKENPVDTFYYVQIKSIM
Sbjct: 347  GHSFSYCLVDRNSNSSVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVQIKSIM 406

Query: 541  VGAEKLNIPEETWNFSA-XXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEVKLE 365
            VG E L IPEETW+ SA             TL+YFAEPA  +IKEAF +++KG+  V  E
Sbjct: 407  VGGEVLKIPEETWHLSAQGGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLV--E 464

Query: 364  DF-PLDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTPRSG 188
             F PL  CYNVSGVEKMELP+F I FADGAVWNFPV+NYFIQI+P++VVCLA++GTP S 
Sbjct: 465  TFPPLKPCYNVSGVEKMELPEFAILFADGAVWNFPVENYFIQIEPEDVVCLAVLGTPMSA 524

Query: 187  LSIIGNYQQQNFHIVYDTKKSRLGYAPMKCADV 89
            LSIIGNYQQQNFHI+YD KKSR+GYAPM CADV
Sbjct: 525  LSIIGNYQQQNFHILYDVKKSRIGYAPMNCADV 557


>ref|XP_011044763.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Populus
            euphratica]
          Length = 568

 Score =  704 bits (1817), Expect = 0.0
 Identities = 365/568 (64%), Positives = 427/568 (75%), Gaps = 6/568 (1%)
 Frame = -1

Query: 1774 MLVLLVLFFIS-KAIARNHNH-DNLNPNGSTVTGIEFPDHMSFNAVSASGS-TGCSLTNS 1604
            +LVL +LF  + +AIA  H+H  N+  N ST+ GIE PDHMSFNAVS+S + TGC+L  S
Sbjct: 10   VLVLSLLFSGAFEAIAGIHDHRKNVKSNISTLAGIELPDHMSFNAVSSSTTNTGCNLDTS 69

Query: 1603 KKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLKSHKQSVKFHLKRQSSSSETEPKK 1424
            KK +QS++ +                  +       K  KQ+VK HLK +S   ++E K+
Sbjct: 70   KKVKQSQTIVSQEDFDLLEDDDDDDEGGEEG-----KEAKQTVKLHLKHRSKDRKSEGKE 124

Query: 1423 SVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXXXXXPESAPN 1244
            S  + T RDL RIQ L+ R++EKKNQN ISRL++ K + ++             PES   
Sbjct: 125  SFVESTNRDLARIQTLHTRIIEKKNQNDISRLKKDKKRPEKQ--IKTVVATAASPESYGT 182

Query: 1243 GFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYDCFEQNG 1064
            G SGQLMATLESGV  GSGEYFMDVF+GTPPKH+SLILDTGSDLNWIQCVPC+DCFEQNG
Sbjct: 183  GLSGQLMATLESGVSLGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNG 242

Query: 1063 PFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNTTGDFALET 884
            P+YDPK+S+SFRNI C DPRC LVSSPDPP PCK+E+QTCPY+YWYGDSSNTTGDFALET
Sbjct: 243  PYYDPKESSSFRNIGCHDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFALET 302

Query: 883  FTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQLQSLYGHSF 704
            FTVNLT+ +GK EFK+VENVMFGCGHWN                     SQLQSLYGHSF
Sbjct: 303  FTVNLTSPTGKSEFKRVENVMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHSF 362

Query: 703  SYCLVERN--SNASSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQIKSIMVGAE 530
            SYCLV+RN  SN SSKL+FGEDKDLL +PELNFT+LV GKENPVDTFYYVQIKSIMVG E
Sbjct: 363  SYCLVDRNSDSNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGE 422

Query: 529  KLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEVKLEDFP-L 353
             LNIPE TWN ++            TLSYFAEPA ++IK+AF K+VKGY  V  +DFP L
Sbjct: 423  VLNIPEGTWNLTSDGVGGTIVDSGTTLSYFAEPAYQIIKDAFVKKVKGYPTV--QDFPIL 480

Query: 352  DLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTPRSGLSIIG 173
            D CYNVSGVEK+ELPDFG+ FADGAVWNFPV+NYFI++DP+EVVCLAI+GTPRS LSIIG
Sbjct: 481  DPCYNVSGVEKIELPDFGLLFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRSALSIIG 540

Query: 172  NYQQQNFHIVYDTKKSRLGYAPMKCADV 89
            NYQQQNFH++YDTKK+RLGYAPM CADV
Sbjct: 541  NYQQQNFHVLYDTKKARLGYAPMNCADV 568


>ref|XP_002307559.2| hypothetical protein POPTR_0005s22630g [Populus trichocarpa]
            gi|550339548|gb|EEE94555.2| hypothetical protein
            POPTR_0005s22630g [Populus trichocarpa]
          Length = 566

 Score =  703 bits (1814), Expect = 0.0
 Identities = 364/574 (63%), Positives = 426/574 (74%), Gaps = 9/574 (1%)
 Frame = -1

Query: 1783 VSPMLVLLVLFFIS----KAIARNHNHD-NLNPNGSTVTGIEFPDHMSFNAVSASGS-TG 1622
            +S   ++LVLF +     +AIA  H+H  N+  N ST+ GIE PDHMSFNAVS+S + TG
Sbjct: 2    LSRSFIVLVLFLLFSGAFEAIAGIHDHGKNVKSNISTLAGIELPDHMSFNAVSSSTTNTG 61

Query: 1621 CSLTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLKSHKQSVKFHLKRQSSSS 1442
            C+L  SKK +QS++ +                  +       K  KQ+VK HLK +S   
Sbjct: 62   CNLDTSKKVKQSQTIVSKEDFDLLEDDDDDDEGGE-----EEKEAKQTVKLHLKHRSKDR 116

Query: 1441 ETEPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXXXXX 1262
            ++E K+S  + T RDL RIQ L+ R++EKKNQN ISRL++ K + ++             
Sbjct: 117  KSEGKESFVESTNRDLARIQTLHTRIIEKKNQNDISRLKKDKERPEKQ--IKTVVATAAS 174

Query: 1261 PESAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYD 1082
            PES   G SGQLMATLESGV  GSGEYFMDVF+GTPPKH+SLILDTGSDLNWIQCVPC+D
Sbjct: 175  PESYGTGLSGQLMATLESGVTLGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHD 234

Query: 1081 CFEQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNTTG 902
            CFEQNGP+YDPK+S+SFRNI C DPRC LVSSPDPP PCK+E+QTCPY+YWYGDSSNTTG
Sbjct: 235  CFEQNGPYYDPKESSSFRNIGCHDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTG 294

Query: 901  DFALETFTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQLQS 722
            DFA ETFTVNLT+ +GK EFK+VENVMFGCGHWN                     SQLQS
Sbjct: 295  DFATETFTVNLTSPTGKSEFKRVENVMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQS 354

Query: 721  LYGHSFSYCLVERNS--NASSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQIKS 548
            LYGHSFSYCLV+RNS  N SSKL+FGEDKDLL +PELNFT+LV GKENPVDTFYYVQIKS
Sbjct: 355  LYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKS 414

Query: 547  IMVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEVKL 368
            IMVG E LNIPE TWN ++            TLSYF EPA ++IK+AF K+VKGY  V  
Sbjct: 415  IMVGGEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFAKKVKGYPIV-- 472

Query: 367  EDFP-LDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTPRS 191
            +DFP LD CYNVSGVEK++LPDFGI FADGAVWNFPV+NYFI++DP+EVVCLAI+GTPRS
Sbjct: 473  QDFPILDPCYNVSGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRS 532

Query: 190  GLSIIGNYQQQNFHIVYDTKKSRLGYAPMKCADV 89
             LSIIGNYQQQNFH++YDTKKSRLGYAPM CADV
Sbjct: 533  ALSIIGNYQQQNFHVLYDTKKSRLGYAPMNCADV 566


>ref|XP_011467594.1| PREDICTED: aspartic proteinase nepenthesin-2 [Fragaria vesca subsp.
            vesca]
          Length = 551

 Score =  698 bits (1802), Expect = 0.0
 Identities = 359/575 (62%), Positives = 417/575 (72%), Gaps = 6/575 (1%)
 Frame = -1

Query: 1795 MVVKVSPMLVLLVL---FFISKAIARNHNHDNLNPNGSTVTGIEFPDHMSFNAVSASGST 1625
            M V  S  L L+++    F+S  IAR H+H N +PNGST+ GIEFP+HMSFNAVS+S  T
Sbjct: 1    MFVNASLALALVLISASVFVS--IARVHSHSNHSPNGSTLAGIEFPEHMSFNAVSSSSHT 58

Query: 1624 GCSLTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLKSHKQSVKFHLKRQSSS 1445
             CS   S+ T    S  E                       ++ SHKQSVK HL+ +   
Sbjct: 59   ACSFVKSESTTTESSPNEESDDEEDEDEE------------AVGSHKQSVKLHLRHE--- 103

Query: 1444 SETEPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXXXX 1265
                 KKSV + T++DLTRIQ L+ R+VE+KNQNTISRL + K    + +          
Sbjct: 104  -----KKSVFESTVKDLTRIQTLHTRIVERKNQNTISRLHKDKKVPTKEEVVVVAPAASP 158

Query: 1264 XPESAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCY 1085
                + +   GQL ATL+SGV  GSGEYFMDVF+GTPPKHFSLILDTGSDLNWIQC PCY
Sbjct: 159  ESYVSASELPGQLEATLKSGVSLGSGEYFMDVFIGTPPKHFSLILDTGSDLNWIQCAPCY 218

Query: 1084 DCFEQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNTT 905
            DCFEQNGP YDPKDSTSF NI C DPRC+LVSSPDPPQPCK+E+QTCPY+YWYGDSSNTT
Sbjct: 219  DCFEQNGPHYDPKDSTSFSNIGCHDPRCQLVSSPDPPQPCKAENQTCPYFYWYGDSSNTT 278

Query: 904  GDFALETFTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQLQ 725
            GDFALETFTVNLT+ +GK E+++VENVMFGCGHWN                     SQLQ
Sbjct: 279  GDFALETFTVNLTSPAGKAEYRKVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQ 338

Query: 724  SLYGHSFSYCLVERNS--NASSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQIK 551
            SLYGHSFSYCLV+RNS  N SSKL+FGEDK LL +P+LN+TSLV GK+NPVDTFYYVQIK
Sbjct: 339  SLYGHSFSYCLVDRNSDTNVSSKLIFGEDKALLSHPQLNYTSLVAGKDNPVDTFYYVQIK 398

Query: 550  SIMVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEVK 371
            SI+VG E +NIPEETW  S+            TLSYFA+PA  +IKEAF K++KGY  VK
Sbjct: 399  SILVGGEVVNIPEETWKLSSEGAGGTIIDSGTTLSYFADPAYDIIKEAFLKKIKGYPVVK 458

Query: 370  LEDFP-LDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTPR 194
              DFP LD CYNVSGVEK+ELP+F I FADGAVW+FPV+NYFIQI+PQEVVCLA++GTP+
Sbjct: 459  --DFPVLDPCYNVSGVEKLELPEFAIVFADGAVWDFPVENYFIQIEPQEVVCLAMLGTPK 516

Query: 193  SGLSIIGNYQQQNFHIVYDTKKSRLGYAPMKCADV 89
            S LSIIGNYQQQNFHI+YDTKKSRLGYAPM CADV
Sbjct: 517  SALSIIGNYQQQNFHILYDTKKSRLGYAPMNCADV 551


>ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
            communis] gi|223540418|gb|EEF41987.1| basic 7S globulin 2
            precursor small subunit, putative [Ricinus communis]
          Length = 557

 Score =  698 bits (1801), Expect = 0.0
 Identities = 361/578 (62%), Positives = 423/578 (73%), Gaps = 9/578 (1%)
 Frame = -1

Query: 1795 MVVKVSPMLVLLVLFF--ISKAIARNHNHDNLNPNGSTVTGIEFPDHMSFNAVSASGS-- 1628
            M+ + SP+LVL+++F          NH+  N+N N ST+ GIE P HMSFNAVS+S +  
Sbjct: 1    MLSEFSPILVLVLIFSGAFEATAGINHHKKNVNSNFSTLAGIELPGHMSFNAVSSSSTVK 60

Query: 1627 -TGCSLTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLKSHKQSVKFHLKRQS 1451
             T CSL+ +KK Q S+S      +                     +  KQ++K HLK + 
Sbjct: 61   TTDCSLSKAKKDQHSQSIASQEEEEDWDLDDDD------------QESKQTLKLHLKHRW 108

Query: 1450 SSSETEPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXX 1271
             + ++  K+S    T RDLTRIQ L++R++EKKNQN +SRL +++ K             
Sbjct: 109  INRDSTHKESFVASTTRDLTRIQTLHKRILEKKNQNALSRLNKEEPK-------QPVVAP 161

Query: 1270 XXXPESAP-NGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCV 1094
               PES P NG SGQLMATLESGV  GSGEYFMDVF+GTPP+HFSLILDTGSDLNWIQCV
Sbjct: 162  AASPESYPANGLSGQLMATLESGVSLGSGEYFMDVFIGTPPRHFSLILDTGSDLNWIQCV 221

Query: 1093 PCYDCFEQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSS 914
            PCYDCF QNGP+YDPK+S+SF+NI C DPRC LVSSPDPPQPCK+E+QTCPY+YWYGDSS
Sbjct: 222  PCYDCFVQNGPYYDPKESSSFKNIGCHDPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSS 281

Query: 913  NTTGDFALETFTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXS 734
            NTTGDFALETFTVNLT+ +GK EFK+VENVMFGCGHWN                     S
Sbjct: 282  NTTGDFALETFTVNLTSPAGKSEFKRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSS 341

Query: 733  QLQSLYGHSFSYCLVERNS--NASSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYV 560
            QLQSLYGHSFSYCLV+RNS  N SSKL+FGEDKDLL +PE+NFTSLV GKENPVDTFYYV
Sbjct: 342  QLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYV 401

Query: 559  QIKSIMVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYK 380
            QIKSIMVG E L IPEETW+ S             TLSYFAEP+  +IK+AF K+VKGY 
Sbjct: 402  QIKSIMVGGEVLKIPEETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYP 461

Query: 379  EVKLEDFP-LDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMG 203
             +K  DFP LD CYNVSGVEKMELP+F I F DGAVWNFPV+NYFI+++P+E+VCLAI+G
Sbjct: 462  VIK--DFPILDPCYNVSGVEKMELPEFRILFEDGAVWNFPVENYFIKLEPEEIVCLAILG 519

Query: 202  TPRSGLSIIGNYQQQNFHIVYDTKKSRLGYAPMKCADV 89
            TPRS LSIIGNYQQQNFHI+YDTKKSRLGYAPMKCADV
Sbjct: 520  TPRSALSIIGNYQQQNFHILYDTKKSRLGYAPMKCADV 557


>ref|XP_007146226.1| hypothetical protein PHAVU_006G022800g [Phaseolus vulgaris]
            gi|561019449|gb|ESW18220.1| hypothetical protein
            PHAVU_006G022800g [Phaseolus vulgaris]
          Length = 553

 Score =  697 bits (1798), Expect = 0.0
 Identities = 361/573 (63%), Positives = 421/573 (73%), Gaps = 4/573 (0%)
 Frame = -1

Query: 1795 MVVKVSPMLVLLVLFFISKAIARNHNHDNLNP-NGSTVTGIEFPDHMSFNAVSASGSTGC 1619
            MV++VS +LVLLVL   S  +  N  H N N  NGS++  ++FPDH  FNA S+S  TGC
Sbjct: 1    MVLRVSLILVLLVLHN-SCTVESNFEHLNQNDLNGSSLAAVKFPDHAHFNAGSSSTETGC 59

Query: 1618 SLTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLKSHKQSVKFHLKRQSSSSE 1439
            S + S+K + S + M +               E   +F     HKQSVK +L+  S S +
Sbjct: 60   SFSKSEKFESSVATMTS------------NEGENGEAFEEAGQHKQSVKLNLRHHSESKD 107

Query: 1438 TEPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXXXXXP 1259
            TEPKKSV D T+RDL RIQ L RRV+EKKNQNTISRL++ + + ++             P
Sbjct: 108  TEPKKSVVDSTVRDLKRIQTLYRRVIEKKNQNTISRLERAQEQSKK-----SFKPEAAAP 162

Query: 1258 ESAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYDC 1079
             + P  FSGQL+ATLESGV  GSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCY C
Sbjct: 163  AAPPEYFSGQLVATLESGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYAC 222

Query: 1078 FEQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNTTGD 899
            FEQNGP+YDPKDS+SF+NI+C DPRC+LVSSPDPPQPCK+E+Q+CPY+YWYGDSSNTTGD
Sbjct: 223  FEQNGPYYDPKDSSSFKNITCHDPRCQLVSSPDPPQPCKAETQSCPYFYWYGDSSNTTGD 282

Query: 898  FALETFTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQLQSL 719
             A+ETFTVNLT   GKPE K VENVMFGCGHWN                     +QLQSL
Sbjct: 283  LAIETFTVNLTTPKGKPELKLVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSL 342

Query: 718  YGHSFSYCLVERNSNA--SSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQIKSI 545
            YGHSFSYCLV+RNSN+  SSKL+FGEDK+LL +P LNFTS V GKENPVDTFYYVQIKSI
Sbjct: 343  YGHSFSYCLVDRNSNSSVSSKLIFGEDKELLSHPHLNFTSFVGGKENPVDTFYYVQIKSI 402

Query: 544  MVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEVKLE 365
            MVG E L IPE+TW+ SA            TL+YFAEPA   IKEAF +++KGY  V  E
Sbjct: 403  MVGGEVLKIPEDTWHLSAQGGGGTIIDSGTTLTYFAEPAYDTIKEAFMRKIKGYPLV--E 460

Query: 364  DF-PLDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTPRSG 188
             F PL  CYNVSGV+K+ELP+F I FADGAVW+FPV+NYFIQI+P++VVCLAI+GTPRS 
Sbjct: 461  TFPPLKPCYNVSGVDKIELPEFSILFADGAVWDFPVENYFIQIEPEDVVCLAILGTPRSA 520

Query: 187  LSIIGNYQQQNFHIVYDTKKSRLGYAPMKCADV 89
            LSIIGNYQQQNFHI+YD K SRLGYAPM CADV
Sbjct: 521  LSIIGNYQQQNFHIMYDVKNSRLGYAPMNCADV 553


>ref|XP_006595492.1| PREDICTED: uncharacterized protein LOC100305604 isoform X1 [Glycine
            max]
          Length = 753

 Score =  693 bits (1789), Expect = 0.0
 Identities = 360/575 (62%), Positives = 422/575 (73%), Gaps = 6/575 (1%)
 Frame = -1

Query: 1795 MVVKVSPMLVLLVLFF-ISKAIARNHN---HDNLNPNGSTVTGIEFPDHMSFNAVSASGS 1628
            MV+KVS ++VLLV+   + +AI+RNHN   H+N+N NGS++  I+FPDH SF+ VS+SG 
Sbjct: 1    MVLKVSLIVVLLVICSCVVEAISRNHNNHNHNNINKNGSSLAAIKFPDHPSFSDVSSSGD 60

Query: 1627 TGCSLTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLKSHKQSVKFHLKRQSS 1448
              CS +NS++   S   M +               E+  +F + K HK SVK HLK +S 
Sbjct: 61   NDCSFSNSEQLGHSVPTMTS----------GEETDEESEAFPAPKPHKNSVKLHLKHRSG 110

Query: 1447 SSETEPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXXX 1268
            S   EPK SV D T+RDLTRIQNL+RRV+E +NQNTISRLQ    ++Q+           
Sbjct: 111  SKGAEPKNSVIDSTVRDLTRIQNLHRRVIENRNQNTISRLQ----RLQKEQPKQSFKPVF 166

Query: 1267 XXPESAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC 1088
                S+ +  SGQL+ATLESGV  GSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC
Sbjct: 167  APAASSTSPVSGQLVATLESGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC 226

Query: 1087 YDCFEQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNT 908
              CFEQ+GP+YDPKDS+SFRNISC DPRC+LVSSPDPP PCK+E+Q+CPY+YWYGD SNT
Sbjct: 227  IACFEQSGPYYDPKDSSSFRNISCHDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNT 286

Query: 907  TGDFALETFTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQL 728
            TGDFALETFTVNLT  +GK E K VENVMFGCGHWN                     SQ+
Sbjct: 287  TGDFALETFTVNLTTPNGKSELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQM 346

Query: 727  QSLYGHSFSYCLVERNSNA--SSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQI 554
            QSLYG SFSYCLV+RNSNA  SSKL+FGEDK+LL +P LNFTS   GK+  VDTFYYVQI
Sbjct: 347  QSLYGQSFSYCLVDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQI 406

Query: 553  KSIMVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEV 374
             S+MV  E L IPEETW+ S+            TL+YFAEPA  +IKEAF +++KGY+ V
Sbjct: 407  NSVMVDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELV 466

Query: 373  KLEDFPLDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTPR 194
            +    PL  CYNVSG+EKMELPDFGI FADGAVWNFPV+NYFIQIDP +VVCLAI+G PR
Sbjct: 467  EGLP-PLKPCYNVSGIEKMELPDFGILFADGAVWNFPVENYFIQIDP-DVVCLAILGNPR 524

Query: 193  SGLSIIGNYQQQNFHIVYDTKKSRLGYAPMKCADV 89
            S LSIIGNYQQQNFHI+YD KKSRLGYAPMKCADV
Sbjct: 525  SALSIIGNYQQQNFHILYDMKKSRLGYAPMKCADV 559


>gb|KHN11976.1| Aspartic proteinase nepenthesin-1 [Glycine soja]
          Length = 559

 Score =  692 bits (1787), Expect = 0.0
 Identities = 360/575 (62%), Positives = 421/575 (73%), Gaps = 6/575 (1%)
 Frame = -1

Query: 1795 MVVKVSPMLVLLVLFF-ISKAIARNHN---HDNLNPNGSTVTGIEFPDHMSFNAVSASGS 1628
            MV+KVS ++VLLV+   + +AI+RNHN   H+N+N NGS++  I+FPDH SF+ VS+SG 
Sbjct: 1    MVLKVSLIVVLLVICSCVVEAISRNHNNHNHNNINKNGSSLAAIKFPDHPSFSDVSSSGD 60

Query: 1627 TGCSLTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLKSHKQSVKFHLKRQSS 1448
              CS +NS++   S   M +               E+  +F + K HK SVK HLK +S 
Sbjct: 61   NDCSFSNSEQLGHSVPTMTS----------GEETDEESEAFPAPKPHKNSVKLHLKHRSG 110

Query: 1447 SSETEPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXXX 1268
            S   EPK SV D T+RDLTRIQNL+RRV+E +NQNTISRLQ    ++Q+           
Sbjct: 111  SKGAEPKNSVIDSTVRDLTRIQNLHRRVIENRNQNTISRLQ----RLQKEQPKQSFKPVF 166

Query: 1267 XXPESAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC 1088
                S  +  SGQL+ATLESGV  GSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC
Sbjct: 167  APAASPTSPVSGQLVATLESGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC 226

Query: 1087 YDCFEQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNT 908
              CFEQ+GP+YDPKDS+SFRNISC DPRC+LVSSPDPP PCK+E+Q+CPY+YWYGD SNT
Sbjct: 227  IACFEQSGPYYDPKDSSSFRNISCHDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNT 286

Query: 907  TGDFALETFTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQL 728
            TGDFALETFTVNLT  +GK E K VENVMFGCGHWN                     SQ+
Sbjct: 287  TGDFALETFTVNLTTPNGKSELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQM 346

Query: 727  QSLYGHSFSYCLVERNSNA--SSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQI 554
            QSLYG SFSYCLV+RNSNA  SSKL+FGEDK+LL +P LNFTS   GK+  VDTFYYVQI
Sbjct: 347  QSLYGQSFSYCLVDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQI 406

Query: 553  KSIMVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEV 374
             S+MV  E L IPEETW+ S+            TL+YFAEPA  +IKEAF +++KGY+ V
Sbjct: 407  NSVMVDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLV 466

Query: 373  KLEDFPLDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTPR 194
            +    PL  CYNVSG+EKMELPDFGI FADGAVWNFPV+NYFIQIDP +VVCLAI+G PR
Sbjct: 467  EGLP-PLKPCYNVSGIEKMELPDFGILFADGAVWNFPVENYFIQIDP-DVVCLAILGNPR 524

Query: 193  SGLSIIGNYQQQNFHIVYDTKKSRLGYAPMKCADV 89
            S LSIIGNYQQQNFHI+YD KKSRLGYAPMKCADV
Sbjct: 525  SALSIIGNYQQQNFHILYDMKKSRLGYAPMKCADV 559


>gb|KRH14503.1| hypothetical protein GLYMA_14G030000 [Glycine max]
          Length = 731

 Score =  692 bits (1785), Expect = 0.0
 Identities = 359/574 (62%), Positives = 421/574 (73%), Gaps = 6/574 (1%)
 Frame = -1

Query: 1795 MVVKVSPMLVLLVLFF-ISKAIARNHN---HDNLNPNGSTVTGIEFPDHMSFNAVSASGS 1628
            MV+KVS ++VLLV+   + +AI+RNHN   H+N+N NGS++  I+FPDH SF+ VS+SG 
Sbjct: 1    MVLKVSLIVVLLVICSCVVEAISRNHNNHNHNNINKNGSSLAAIKFPDHPSFSDVSSSGD 60

Query: 1627 TGCSLTNSKKTQQSESAMEAMGKXXXXXXXXXXXXEKIASFLSLKSHKQSVKFHLKRQSS 1448
              CS +NS++   S   M +               E+  +F + K HK SVK HLK +S 
Sbjct: 61   NDCSFSNSEQLGHSVPTMTS----------GEETDEESEAFPAPKPHKNSVKLHLKHRSG 110

Query: 1447 SSETEPKKSVTDYTIRDLTRIQNLNRRVVEKKNQNTISRLQQKKAKIQRYDXXXXXXXXX 1268
            S   EPK SV D T+RDLTRIQNL+RRV+E +NQNTISRLQ    ++Q+           
Sbjct: 111  SKGAEPKNSVIDSTVRDLTRIQNLHRRVIENRNQNTISRLQ----RLQKEQPKQSFKPVF 166

Query: 1267 XXPESAPNGFSGQLMATLESGVGFGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC 1088
                S+ +  SGQL+ATLESGV  GSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC
Sbjct: 167  APAASSTSPVSGQLVATLESGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC 226

Query: 1087 YDCFEQNGPFYDPKDSTSFRNISCKDPRCKLVSSPDPPQPCKSESQTCPYYYWYGDSSNT 908
              CFEQ+GP+YDPKDS+SFRNISC DPRC+LVSSPDPP PCK+E+Q+CPY+YWYGD SNT
Sbjct: 227  IACFEQSGPYYDPKDSSSFRNISCHDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNT 286

Query: 907  TGDFALETFTVNLTALSGKPEFKQVENVMFGCGHWNXXXXXXXXXXXXXXXXXXXXXSQL 728
            TGDFALETFTVNLT  +GK E K VENVMFGCGHWN                     SQ+
Sbjct: 287  TGDFALETFTVNLTTPNGKSELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQM 346

Query: 727  QSLYGHSFSYCLVERNSNA--SSKLLFGEDKDLLKNPELNFTSLVTGKENPVDTFYYVQI 554
            QSLYG SFSYCLV+RNSNA  SSKL+FGEDK+LL +P LNFTS   GK+  VDTFYYVQI
Sbjct: 347  QSLYGQSFSYCLVDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQI 406

Query: 553  KSIMVGAEKLNIPEETWNFSAXXXXXXXXXXXXTLSYFAEPADRLIKEAFRKQVKGYKEV 374
             S+MV  E L IPEETW+ S+            TL+YFAEPA  +IKEAF +++KGY+ V
Sbjct: 407  NSVMVDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELV 466

Query: 373  KLEDFPLDLCYNVSGVEKMELPDFGIQFADGAVWNFPVDNYFIQIDPQEVVCLAIMGTPR 194
            +    PL  CYNVSG+EKMELPDFGI FADGAVWNFPV+NYFIQIDP +VVCLAI+G PR
Sbjct: 467  EGLP-PLKPCYNVSGIEKMELPDFGILFADGAVWNFPVENYFIQIDP-DVVCLAILGNPR 524

Query: 193  SGLSIIGNYQQQNFHIVYDTKKSRLGYAPMKCAD 92
            S LSIIGNYQQQNFHI+YD KKSRLGYAPMKCAD
Sbjct: 525  SALSIIGNYQQQNFHILYDMKKSRLGYAPMKCAD 558


Top