BLASTX nr result

ID: Sinomenium21_contig00002509 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00002509
         (2057 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007018614.1| Insulinase (Peptidase family M16) family pro...   840   0.0  
ref|XP_002277544.2| PREDICTED: uncharacterized protein LOC100266...   840   0.0  
emb|CBI40802.3| unnamed protein product [Vitis vinifera]              840   0.0  
ref|XP_007018613.1| Insulinase family protein isoform 1 [Theobro...   836   0.0  
ref|XP_006494428.1| PREDICTED: uncharacterized protein LOC102613...   821   0.0  
ref|XP_006435501.1| hypothetical protein CICLE_v10000050mg [Citr...   821   0.0  
ref|XP_002301748.2| hypothetical protein POPTR_0002s23680g [Popu...   817   0.0  
ref|XP_004155123.1| PREDICTED: uncharacterized protein LOC101224...   815   0.0  
ref|XP_002320445.2| pitrilysin family protein [Populus trichocar...   814   0.0  
ref|XP_004152885.1| PREDICTED: uncharacterized protein LOC101202...   814   0.0  
ref|XP_002516378.1| pitrilysin, putative [Ricinus communis] gi|2...   813   0.0  
gb|EXB56235.1| putative zinc protease pqqL [Morus notabilis]          812   0.0  
ref|XP_006849871.1| hypothetical protein AMTR_s00022p00070510 [A...   806   0.0  
ref|XP_007018616.1| Insulinase (Peptidase family M16) family pro...   806   0.0  
ref|XP_004307194.1| PREDICTED: uncharacterized protein LOC101308...   806   0.0  
ref|XP_007157075.1| hypothetical protein PHAVU_002G040800g [Phas...   804   0.0  
ref|XP_007018617.1| Insulinase (Peptidase family M16) family pro...   804   0.0  
ref|XP_006573851.1| PREDICTED: uncharacterized protein LOC100794...   797   0.0  
ref|XP_003537738.1| PREDICTED: uncharacterized protein LOC100809...   793   0.0  
gb|EYU24512.1| hypothetical protein MIMGU_mgv1a000585mg [Mimulus...   790   0.0  

>ref|XP_007018614.1| Insulinase (Peptidase family M16) family protein isoform 2 [Theobroma
            cacao] gi|590597455|ref|XP_007018615.1| Insulinase
            (Peptidase family M16) family protein isoform 2
            [Theobroma cacao] gi|508723942|gb|EOY15839.1| Insulinase
            (Peptidase family M16) family protein isoform 2
            [Theobroma cacao] gi|508723943|gb|EOY15840.1| Insulinase
            (Peptidase family M16) family protein isoform 2
            [Theobroma cacao]
          Length = 1285

 Score =  840 bits (2171), Expect = 0.0
 Identities = 431/562 (76%), Positives = 481/562 (85%), Gaps = 2/562 (0%)
 Frame = +2

Query: 2    AAIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 181
            AAI++GL             KELIS  QL +LR+Q  P+FIPLS E++ TKV DKETGIT
Sbjct: 726  AAIKSGLEEPIEAEPELEVPKELISPLQLQELRMQRGPSFIPLSAEMNVTKVQDKETGIT 785

Query: 182  QCRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFS 361
            Q RLSNGIPVNYKI+KNEA+ GVMRLIV              V+VGVRTLSEGGRVGNFS
Sbjct: 786  QLRLSNGIPVNYKISKNEARGGVMRLIVGGGRAAETSDSKGAVVVGVRTLSEGGRVGNFS 845

Query: 362  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDR 541
            REQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDR
Sbjct: 846  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMHAAFQLLHMVLEHSVWLDDAFDR 905

Query: 542  ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 721
            ARQLYLSYYRSIPKSLERSTAHKLMLAM+NGDERFVEPTP+ LQ LTL+ VKDAVMNQFV
Sbjct: 906  ARQLYLSYYRSIPKSLERSTAHKLMLAMMNGDERFVEPTPKSLQNLTLKSVKDAVMNQFV 965

Query: 722  GDNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVF 901
            GDNMEVSIVGDF+E +IESC+LDYLGTV A+        H  +PI+FRPSPSDLQFQQVF
Sbjct: 966  GDNMEVSIVGDFSEEEIESCVLDYLGTVRASRDSE--RAHGFSPILFRPSPSDLQFQQVF 1023

Query: 902  LKDSDERACAYIAGPAPNRWGLTVEGRDLFESIRDVQSNSD-ELQSLELKDVKMDLQRKL 1078
            LKD+DERACAYIAGPAPNRWGLTV+G+DL ES+ D+ S  D +  S E KD++ DLQ+KL
Sbjct: 1024 LKDTDERACAYIAGPAPNRWGLTVDGQDLLESVADIPSADDAQPHSDEGKDIQKDLQKKL 1083

Query: 1079 RGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPA 1258
            RGHPLFFGIT+GLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTP+
Sbjct: 1084 RGHPLFFGITMGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPS 1143

Query: 1259 KVYKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQASSIP 1438
            KVY+AVDACKNVLRGL++++IA REL+RAKRTLLM+HEAE KSNAYWLGL+AH+QASS+P
Sbjct: 1144 KVYRAVDACKNVLRGLHTNKIAPRELERAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP 1203

Query: 1439 RKDLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEIDSASLEAELT 1618
            RKD+SC+K+L  LYEAA+IEDIYLAY+ LKVDEDSL+SCIGIAG  AGE  +AS E E +
Sbjct: 1204 RKDISCVKELTSLYEAASIEDIYLAYDQLKVDEDSLYSCIGIAGVHAGEGTTASEEEEES 1263

Query: 1619 T-GLQGVFPVGRGLSTMTRPTT 1681
              G QGV PVGRGLSTMTRPTT
Sbjct: 1264 DGGFQGVIPVGRGLSTMTRPTT 1285


>ref|XP_002277544.2| PREDICTED: uncharacterized protein LOC100266746 [Vitis vinifera]
          Length = 1269

 Score =  840 bits (2171), Expect = 0.0
 Identities = 434/563 (77%), Positives = 479/563 (85%), Gaps = 4/563 (0%)
 Frame = +2

Query: 5    AIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 184
            AI+AGL             KELISSSQL  LR++  P+FIPLS EV+ TKVYD ETGITQ
Sbjct: 710  AIKAGLEEPIEAEPELEVPKELISSSQLQKLRVERSPSFIPLSPEVNVTKVYDNETGITQ 769

Query: 185  CRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFSR 364
             RLSNGIPVNYKI++NEA+ GVMRLIV              V+VGVRTLSEGGRVGNFSR
Sbjct: 770  LRLSNGIPVNYKISRNEARGGVMRLIVGGGRAAESFESRGAVVVGVRTLSEGGRVGNFSR 829

Query: 365  EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDRA 544
            EQVELFCVNHLINCSLESTEEFI MEFRFTLRD GMRAAFQLLHMV+EHSVWL+DAFDRA
Sbjct: 830  EQVELFCVNHLINCSLESTEEFICMEFRFTLRDNGMRAAFQLLHMVLEHSVWLDDAFDRA 889

Query: 545  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 724
            RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEP+P+ LQ LTLQ VKDAVMNQFVG
Sbjct: 890  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQSVKDAVMNQFVG 949

Query: 725  DNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVFL 904
            DNMEVS+VGDF+E DIESCILDY+GTV A+         +S+ I+FR  PSDLQFQQVFL
Sbjct: 950  DNMEVSVVGDFSEEDIESCILDYMGTVRASRDSE--IEQQSSSIMFRSYPSDLQFQQVFL 1007

Query: 905  KDSDERACAYIAGPAPNRWGLTVEGRDLFESIRDVQSNSDE---LQSL-ELKDVKMDLQR 1072
            KD+DERACAYIAGPAPNRWG T+EG+DLFESI ++  + DE    +SL E+KD + DLQR
Sbjct: 1008 KDTDERACAYIAGPAPNRWGFTIEGKDLFESINNISVDDDEEPQSESLSEMKDCRKDLQR 1067

Query: 1073 KLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTST 1252
            KLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRLKLGWYVISVTST
Sbjct: 1068 KLRNHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKLGWYVISVTST 1127

Query: 1253 PAKVYKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQASS 1432
            P KVYKAVDACKNVLRGL+SS+IAQRELDRAKRTLLM+HEAE+K+NAYWLGL+AH+QAS+
Sbjct: 1128 PGKVYKAVDACKNVLRGLHSSKIAQRELDRAKRTLLMRHEAETKANAYWLGLLAHLQAST 1187

Query: 1433 IPRKDLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEIDSASLEAE 1612
            +PRKD+SCIKDL  LYEAATIEDIYLAYE LKVDE+SL+SCIGIAG+QA E  S   E E
Sbjct: 1188 VPRKDISCIKDLTSLYEAATIEDIYLAYEQLKVDENSLYSCIGIAGAQAAEEISVE-EEE 1246

Query: 1613 LTTGLQGVFPVGRGLSTMTRPTT 1681
               GLQGV P GRGLSTMTRPTT
Sbjct: 1247 SDEGLQGVIPAGRGLSTMTRPTT 1269


>emb|CBI40802.3| unnamed protein product [Vitis vinifera]
          Length = 1276

 Score =  840 bits (2171), Expect = 0.0
 Identities = 434/563 (77%), Positives = 479/563 (85%), Gaps = 4/563 (0%)
 Frame = +2

Query: 5    AIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 184
            AI+AGL             KELISSSQL  LR++  P+FIPLS EV+ TKVYD ETGITQ
Sbjct: 717  AIKAGLEEPIEAEPELEVPKELISSSQLQKLRVERSPSFIPLSPEVNVTKVYDNETGITQ 776

Query: 185  CRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFSR 364
             RLSNGIPVNYKI++NEA+ GVMRLIV              V+VGVRTLSEGGRVGNFSR
Sbjct: 777  LRLSNGIPVNYKISRNEARGGVMRLIVGGGRAAESFESRGAVVVGVRTLSEGGRVGNFSR 836

Query: 365  EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDRA 544
            EQVELFCVNHLINCSLESTEEFI MEFRFTLRD GMRAAFQLLHMV+EHSVWL+DAFDRA
Sbjct: 837  EQVELFCVNHLINCSLESTEEFICMEFRFTLRDNGMRAAFQLLHMVLEHSVWLDDAFDRA 896

Query: 545  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 724
            RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEP+P+ LQ LTLQ VKDAVMNQFVG
Sbjct: 897  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQSVKDAVMNQFVG 956

Query: 725  DNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVFL 904
            DNMEVS+VGDF+E DIESCILDY+GTV A+         +S+ I+FR  PSDLQFQQVFL
Sbjct: 957  DNMEVSVVGDFSEEDIESCILDYMGTVRASRDSE--IEQQSSSIMFRSYPSDLQFQQVFL 1014

Query: 905  KDSDERACAYIAGPAPNRWGLTVEGRDLFESIRDVQSNSDE---LQSL-ELKDVKMDLQR 1072
            KD+DERACAYIAGPAPNRWG T+EG+DLFESI ++  + DE    +SL E+KD + DLQR
Sbjct: 1015 KDTDERACAYIAGPAPNRWGFTIEGKDLFESINNISVDDDEEPQSESLSEMKDCRKDLQR 1074

Query: 1073 KLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTST 1252
            KLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRLKLGWYVISVTST
Sbjct: 1075 KLRNHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKLGWYVISVTST 1134

Query: 1253 PAKVYKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQASS 1432
            P KVYKAVDACKNVLRGL+SS+IAQRELDRAKRTLLM+HEAE+K+NAYWLGL+AH+QAS+
Sbjct: 1135 PGKVYKAVDACKNVLRGLHSSKIAQRELDRAKRTLLMRHEAETKANAYWLGLLAHLQAST 1194

Query: 1433 IPRKDLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEIDSASLEAE 1612
            +PRKD+SCIKDL  LYEAATIEDIYLAYE LKVDE+SL+SCIGIAG+QA E  S   E E
Sbjct: 1195 VPRKDISCIKDLTSLYEAATIEDIYLAYEQLKVDENSLYSCIGIAGAQAAEEISVE-EEE 1253

Query: 1613 LTTGLQGVFPVGRGLSTMTRPTT 1681
               GLQGV P GRGLSTMTRPTT
Sbjct: 1254 SDEGLQGVIPAGRGLSTMTRPTT 1276


>ref|XP_007018613.1| Insulinase family protein isoform 1 [Theobroma cacao]
            gi|508723941|gb|EOY15838.1| Insulinase family protein
            isoform 1 [Theobroma cacao]
          Length = 1302

 Score =  836 bits (2159), Expect = 0.0
 Identities = 426/542 (78%), Positives = 474/542 (87%), Gaps = 2/542 (0%)
 Frame = +2

Query: 62   KELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQCRLSNGIPVNYKITKNEAK 241
            KELIS  QL +LR+Q  P+FIPLS E++ TKV DKETGITQ RLSNGIPVNYKI+KNEA+
Sbjct: 763  KELISPLQLQELRMQRGPSFIPLSAEMNVTKVQDKETGITQLRLSNGIPVNYKISKNEAR 822

Query: 242  SGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFSREQVELFCVNHLINCSLEST 421
             GVMRLIV              V+VGVRTLSEGGRVGNFSREQVELFCVNHLINCSLEST
Sbjct: 823  GGVMRLIVGGGRAAETSDSKGAVVVGVRTLSEGGRVGNFSREQVELFCVNHLINCSLEST 882

Query: 422  EEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDRARQLYLSYYRSIPKSLERST 601
            EEFISMEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDRARQLYLSYYRSIPKSLERST
Sbjct: 883  EEFISMEFRFTLRDNGMHAAFQLLHMVLEHSVWLDDAFDRARQLYLSYYRSIPKSLERST 942

Query: 602  AHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVGDNMEVSIVGDFTEGDIESC 781
            AHKLMLAM+NGDERFVEPTP+ LQ LTL+ VKDAVMNQFVGDNMEVSIVGDF+E +IESC
Sbjct: 943  AHKLMLAMMNGDERFVEPTPKSLQNLTLKSVKDAVMNQFVGDNMEVSIVGDFSEEEIESC 1002

Query: 782  ILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVFLKDSDERACAYIAGPAPNRW 961
            +LDYLGTV A+        H  +PI+FRPSPSDLQFQQVFLKD+DERACAYIAGPAPNRW
Sbjct: 1003 VLDYLGTVRASRDSE--RAHGFSPILFRPSPSDLQFQQVFLKDTDERACAYIAGPAPNRW 1060

Query: 962  GLTVEGRDLFESIRDVQSNSD-ELQSLELKDVKMDLQRKLRGHPLFFGITLGLLAEVINS 1138
            GLTV+G+DL ES+ D+ S  D +  S E KD++ DLQ+KLRGHPLFFGIT+GLLAEVINS
Sbjct: 1061 GLTVDGQDLLESVADIPSADDAQPHSDEGKDIQKDLQKKLRGHPLFFGITMGLLAEVINS 1120

Query: 1139 RLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPAKVYKAVDACKNVLRGLNSSR 1318
            RLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTP+KVY+AVDACKNVLRGL++++
Sbjct: 1121 RLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPSKVYRAVDACKNVLRGLHTNK 1180

Query: 1319 IAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQASSIPRKDLSCIKDLQLLYEAATIE 1498
            IA REL+RAKRTLLM+HEAE KSNAYWLGL+AH+QASS+PRKD+SC+K+L  LYEAA+IE
Sbjct: 1181 IAPRELERAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDISCVKELTSLYEAASIE 1240

Query: 1499 DIYLAYEHLKVDEDSLFSCIGIAGSQAGEIDSASLEAELTT-GLQGVFPVGRGLSTMTRP 1675
            DIYLAY+ LKVDEDSL+SCIGIAG  AGE  +AS E E +  G QGV PVGRGLSTMTRP
Sbjct: 1241 DIYLAYDQLKVDEDSLYSCIGIAGVHAGEGTTASEEEEESDGGFQGVIPVGRGLSTMTRP 1300

Query: 1676 TT 1681
            TT
Sbjct: 1301 TT 1302


>ref|XP_006494428.1| PREDICTED: uncharacterized protein LOC102613059 [Citrus sinensis]
          Length = 1259

 Score =  821 bits (2120), Expect = 0.0
 Identities = 420/564 (74%), Positives = 476/564 (84%), Gaps = 5/564 (0%)
 Frame = +2

Query: 5    AIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 184
            AI++G+             KELIS+S+L +L+L+ RP+FIP   E++ TKV+DKE+GITQ
Sbjct: 698  AIKSGMEEPIEAEPELEVPKELISASELEELKLRCRPSFIPPRPELNVTKVHDKESGITQ 757

Query: 185  CRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFSR 364
             RLSNGIP+NYKI+K+EA+ GVMRLIV              VIVGVRTLSEGGRVG FSR
Sbjct: 758  LRLSNGIPINYKISKSEAQGGVMRLIVGGGRAAESSESRGAVIVGVRTLSEGGRVGKFSR 817

Query: 365  EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDRA 544
            EQVELFCVNHLINCSLESTEEFI+MEFRFTLRD GMRAAFQLLHMV+EHSVWL+DAFDRA
Sbjct: 818  EQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLDDAFDRA 877

Query: 545  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 724
            RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTP+ L+ L L+ VK+AVMNQFVG
Sbjct: 878  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPKSLENLNLKSVKEAVMNQFVG 937

Query: 725  DNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVFL 904
            +NMEVSIVGDF+E +IESCILDYLGTV AT        H  +PI+FRPSPSDL FQQVFL
Sbjct: 938  NNMEVSIVGDFSEEEIESCILDYLGTVRATNDSK--REHEYSPILFRPSPSDLHFQQVFL 995

Query: 905  KDSDERACAYIAGPAPNRWGLTVEGRDLFESIRDVQSNSD----ELQSLELKDVKMDLQR 1072
            KD+DERACAYIAGPAPNRWG TV+G DLF+SI +   + D      +S+ LKD++ D QR
Sbjct: 996  KDTDERACAYIAGPAPNRWGFTVDGMDLFKSIDNTSCSFDMPPKSEESMMLKDIEKDQQR 1055

Query: 1073 KLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTST 1252
            KLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTS 
Sbjct: 1056 KLRSHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSP 1115

Query: 1253 PAKVYKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQASS 1432
            P KV+KAVDACKNVLRGL+S+RI QRELDRAKRTLLM+HEAE KSNAYWLGL+AH+QASS
Sbjct: 1116 PGKVHKAVDACKNVLRGLHSNRIVQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASS 1175

Query: 1433 IPRKDLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEIDSASLEAE 1612
            +PRKD+SCIKDL  LYEAA++EDIYLAYE L+VDEDSL+SCIGIAG+QAG+ ++AS E E
Sbjct: 1176 VPRKDISCIKDLMSLYEAASVEDIYLAYEQLRVDEDSLYSCIGIAGAQAGDEETASSEEE 1235

Query: 1613 LTTGLQ-GVFPVGRGLSTMTRPTT 1681
               G   GV PVGRGLSTMTRPTT
Sbjct: 1236 SDEGYPGGVIPVGRGLSTMTRPTT 1259


>ref|XP_006435501.1| hypothetical protein CICLE_v10000050mg [Citrus clementina]
            gi|567885887|ref|XP_006435502.1| hypothetical protein
            CICLE_v10000050mg [Citrus clementina]
            gi|567885889|ref|XP_006435503.1| hypothetical protein
            CICLE_v10000050mg [Citrus clementina]
            gi|567885891|ref|XP_006435504.1| hypothetical protein
            CICLE_v10000050mg [Citrus clementina]
            gi|557537623|gb|ESR48741.1| hypothetical protein
            CICLE_v10000050mg [Citrus clementina]
            gi|557537624|gb|ESR48742.1| hypothetical protein
            CICLE_v10000050mg [Citrus clementina]
            gi|557537625|gb|ESR48743.1| hypothetical protein
            CICLE_v10000050mg [Citrus clementina]
            gi|557537626|gb|ESR48744.1| hypothetical protein
            CICLE_v10000050mg [Citrus clementina]
          Length = 1260

 Score =  821 bits (2120), Expect = 0.0
 Identities = 420/564 (74%), Positives = 476/564 (84%), Gaps = 5/564 (0%)
 Frame = +2

Query: 5    AIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 184
            AI++G+             KELIS+S+L +L+L+ RP+FIP   E++ TKV+DKE+GITQ
Sbjct: 699  AIKSGMEEPIEAEPELEVPKELISASELEELKLRCRPSFIPPRPELNVTKVHDKESGITQ 758

Query: 185  CRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFSR 364
             RLSNGIP+NYKI+K+EA+ GVMRLIV              VIVGVRTLSEGGRVG FSR
Sbjct: 759  LRLSNGIPINYKISKSEAQGGVMRLIVGGGRAAESSESRGAVIVGVRTLSEGGRVGKFSR 818

Query: 365  EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDRA 544
            EQVELFCVNHLINCSLESTEEFI+MEFRFTLRD GMRAAFQLLHMV+EHSVWL+DAFDRA
Sbjct: 819  EQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLDDAFDRA 878

Query: 545  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 724
            RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTP+ L+ L L+ VK+AVMNQFVG
Sbjct: 879  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPKSLENLNLKSVKEAVMNQFVG 938

Query: 725  DNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVFL 904
            +NMEVSIVGDF+E +IESCILDYLGTV AT        H  +PI+FRPSPSDL FQQVFL
Sbjct: 939  NNMEVSIVGDFSEEEIESCILDYLGTVRATNDSK--REHEYSPILFRPSPSDLHFQQVFL 996

Query: 905  KDSDERACAYIAGPAPNRWGLTVEGRDLFESIRDVQSNSD----ELQSLELKDVKMDLQR 1072
            KD+DERACAYIAGPAPNRWG TV+G DLF+SI +   + D      +S+ LKD++ D QR
Sbjct: 997  KDTDERACAYIAGPAPNRWGFTVDGMDLFKSIDNTSCSFDMPPKSEESMMLKDIEKDQQR 1056

Query: 1073 KLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTST 1252
            KLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTS 
Sbjct: 1057 KLRSHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSP 1116

Query: 1253 PAKVYKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQASS 1432
            P KV+KAVDACKNVLRGL+S+RI QRELDRAKRTLLM+HEAE KSNAYWLGL+AH+QASS
Sbjct: 1117 PGKVHKAVDACKNVLRGLHSNRIVQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASS 1176

Query: 1433 IPRKDLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEIDSASLEAE 1612
            +PRKD+SCIKDL  LYEAA++EDIYLAYE L+VDEDSL+SCIGIAG+QAG+ ++AS E E
Sbjct: 1177 VPRKDISCIKDLMSLYEAASVEDIYLAYEQLRVDEDSLYSCIGIAGAQAGDEETASSEEE 1236

Query: 1613 LTTGLQ-GVFPVGRGLSTMTRPTT 1681
               G   GV PVGRGLSTMTRPTT
Sbjct: 1237 SDEGYPGGVIPVGRGLSTMTRPTT 1260


>ref|XP_002301748.2| hypothetical protein POPTR_0002s23680g [Populus trichocarpa]
            gi|550345688|gb|EEE81021.2| hypothetical protein
            POPTR_0002s23680g [Populus trichocarpa]
          Length = 1268

 Score =  817 bits (2110), Expect = 0.0
 Identities = 415/566 (73%), Positives = 477/566 (84%), Gaps = 6/566 (1%)
 Frame = +2

Query: 2    AAIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 181
            AAI++GL             KELISS+QL +LRL+ RP+F+PL  +   TK++D+ETGIT
Sbjct: 705  AAIKSGLEEAIEAEPELEVPKELISSTQLEELRLERRPSFVPLLPDAGYTKLHDQETGIT 764

Query: 182  QCRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFS 361
            QCRLSNGI VNYKI+K+E++ GVMRLIV              V+VGVRTLSEGGRVG+FS
Sbjct: 765  QCRLSNGIAVNYKISKSESRGGVMRLIVGGGRAAESSESKGAVVVGVRTLSEGGRVGSFS 824

Query: 362  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDR 541
            REQVELFCVNHLINCSLESTEEFI MEFRFTLRD GM+AAF+LLHMV+E+SVWL+DAFDR
Sbjct: 825  REQVELFCVNHLINCSLESTEEFICMEFRFTLRDNGMQAAFELLHMVLENSVWLDDAFDR 884

Query: 542  ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 721
            ARQLYLSYYRSIPKSLER+TAHKLM AMLNGDERF+EPTPQ LQ LTL+ VKDAVMNQFV
Sbjct: 885  ARQLYLSYYRSIPKSLERATAHKLMTAMLNGDERFIEPTPQSLQNLTLKSVKDAVMNQFV 944

Query: 722  GDNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVF 901
            G NMEVSIVGDF+E +++SCI+DYLGTV AT           NP++FRPSPSDLQFQQVF
Sbjct: 945  GGNMEVSIVGDFSEEEVQSCIIDYLGTVRATR--DSDQEQEFNPVMFRPSPSDLQFQQVF 1002

Query: 902  LKDSDERACAYIAGPAPNRWGLTVEGRDLFESIRDVQSNSD-----ELQSLELKDVKMDL 1066
            LKD+DERACAYIAGPAPNRWG TV+G DLF+S+     ++D     E Q ++  DV+ D+
Sbjct: 1003 LKDTDERACAYIAGPAPNRWGFTVDGTDLFKSMSGFSVSADAQPISETQQIDGMDVQKDM 1062

Query: 1067 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1246
            Q KLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRLKLGWYV+SVT
Sbjct: 1063 QGKLRCHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKLGWYVVSVT 1122

Query: 1247 STPAKVYKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQA 1426
            STP KV+KAVDACK+VLRGL+S+++AQRELDRA+RTLLM+HEAE KSNAYWLGL+AH+QA
Sbjct: 1123 STPGKVHKAVDACKSVLRGLHSNKVAQRELDRARRTLLMRHEAEIKSNAYWLGLLAHLQA 1182

Query: 1427 SSIPRKDLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEIDSASLE 1606
            SS+PRKD+SCIKDL  LYEAATIEDIYLAYE LKVDEDSL+SCIG+AG+QAGE  +A LE
Sbjct: 1183 SSVPRKDVSCIKDLTSLYEAATIEDIYLAYEQLKVDEDSLYSCIGVAGTQAGEEINAPLE 1242

Query: 1607 AELT-TGLQGVFPVGRGLSTMTRPTT 1681
             E T  GLQG  PVGRGLSTMTRPTT
Sbjct: 1243 VEETDDGLQGGIPVGRGLSTMTRPTT 1268


>ref|XP_004155123.1| PREDICTED: uncharacterized protein LOC101224074 [Cucumis sativus]
          Length = 1267

 Score =  815 bits (2104), Expect = 0.0
 Identities = 419/564 (74%), Positives = 470/564 (83%), Gaps = 5/564 (0%)
 Frame = +2

Query: 5    AIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 184
            AI AGL             KELISSSQ+ +LR+QH+P+FI L+ E + TK +DKETGITQ
Sbjct: 706  AIEAGLREPIEAEPELEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQ 765

Query: 185  CRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFSR 364
            CRLSNGIPVNYKI+K+E K+GVMRLIV              V+VGVRTLSEGGRVG FSR
Sbjct: 766  CRLSNGIPVNYKISKSENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSR 825

Query: 365  EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDRA 544
            EQVELFCVNHLINCSLESTEEFI+MEFRFTLRD GMRAAFQLLHMV+EHSVWLEDAFDRA
Sbjct: 826  EQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRA 885

Query: 545  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 724
            +QLY+SYYRSIPKSLERSTAHKLMLAMLNGDERFVEP+P+ LQ LTLQ VKDAVMNQFVG
Sbjct: 886  KQLYMSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVG 945

Query: 725  DNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVFL 904
            +NMEVS+VGDF+E +IESCILDYLGTVTAT          S PI+FRPS S+LQFQQVFL
Sbjct: 946  NNMEVSLVGDFSEEEIESCILDYLGTVTATTTSEA--ALASVPIVFRPSASELQFQQVFL 1003

Query: 905  KDSDERACAYIAGPAPNRWGLTVEGRDLFESIRDVQSNSDELQSLEL----KDVKMDLQR 1072
            KD+DERACAYI+GPAPNRWG+T EG +L ESI  +     E    E+     D++  LQR
Sbjct: 1004 KDTDERACAYISGPAPNRWGVTFEGLELLESISQISRTGGEFLCEEVDESDNDIEKGLQR 1063

Query: 1073 KLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTST 1252
            KLR HPLFFGIT+GLLAE+INSRLFT+VRDSLGLTYDVSFEL+LFDRLKLGWYVISVTST
Sbjct: 1064 KLRSHPLFFGITMGLLAEIINSRLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTST 1123

Query: 1253 PAKVYKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQASS 1432
            PAKVYKAVDACK+VLRGL+S++IAQRELDRAKRTLLM+HEAE KSNAYWLGL+AH+QASS
Sbjct: 1124 PAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASS 1183

Query: 1433 IPRKDLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEIDSASLEAE 1612
            +PRKDLSCIKDL  LYEAATI+D+Y+AY+ LKVD DSL++CIGIAG+QAGE    S E E
Sbjct: 1184 VPRKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEE 1243

Query: 1613 -LTTGLQGVFPVGRGLSTMTRPTT 1681
                  QGV P GRGLSTMTRPTT
Sbjct: 1244 GSDQDFQGVIPSGRGLSTMTRPTT 1267


>ref|XP_002320445.2| pitrilysin family protein [Populus trichocarpa]
            gi|550324212|gb|EEE98760.2| pitrilysin family protein
            [Populus trichocarpa]
          Length = 1267

 Score =  814 bits (2103), Expect = 0.0
 Identities = 414/560 (73%), Positives = 467/560 (83%)
 Frame = +2

Query: 2    AAIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 181
            AAI++GL             KELI+S+QL +LRLQ  P+FIPL  +   TK++D ETGIT
Sbjct: 717  AAIKSGLEEAIEAEPELEVPKELITSTQLEELRLQLTPSFIPLVPDADYTKLHDPETGIT 776

Query: 182  QCRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFS 361
            QCRLSNGI VNYKI+K+E++ GVMRLIV              V+VGVRTLSEGGRVGNFS
Sbjct: 777  QCRLSNGIAVNYKISKSESRGGVMRLIVGGGRAAESSESKGAVVVGVRTLSEGGRVGNFS 836

Query: 362  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDR 541
            REQVELFCVNHLINCSLESTEEFI MEFRFTLRD GMRAAF+LLHMV+EHSVWL+DA DR
Sbjct: 837  REQVELFCVNHLINCSLESTEEFICMEFRFTLRDNGMRAAFELLHMVLEHSVWLDDALDR 896

Query: 542  ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 721
            ARQLYLSYYRSIPKSLER+TAHKLM AMLNGDERF+EPTPQ LQ LTL+ VKDAVMNQFV
Sbjct: 897  ARQLYLSYYRSIPKSLERATAHKLMTAMLNGDERFIEPTPQSLQNLTLKSVKDAVMNQFV 956

Query: 722  GDNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVF 901
            G NMEVSIVGDF+E +IESCI+DYLGTV AT           NP++FRPSPSDLQFQQVF
Sbjct: 957  GGNMEVSIVGDFSEEEIESCIIDYLGTVRATR--DSDREQEFNPVMFRPSPSDLQFQQVF 1014

Query: 902  LKDSDERACAYIAGPAPNRWGLTVEGRDLFESIRDVQSNSDELQSLELKDVKMDLQRKLR 1081
            LKD+DERACAYIAGPAPNRWG TV+G+DLFES       +  +  ++ KDV+ D Q KLR
Sbjct: 1015 LKDTDERACAYIAGPAPNRWGFTVDGKDLFES-------TSGISQIDRKDVQKDKQGKLR 1067

Query: 1082 GHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPAK 1261
             HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRLKLGWYV+SVTSTP K
Sbjct: 1068 SHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKLGWYVVSVTSTPGK 1127

Query: 1262 VYKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQASSIPR 1441
            V+KAVDACK+VLRGL+S+++AQRELDRAKRTLLM+HE E KSNAYWLGL+AH+QASS+PR
Sbjct: 1128 VHKAVDACKSVLRGLHSNKVAQRELDRAKRTLLMRHETEIKSNAYWLGLLAHLQASSVPR 1187

Query: 1442 KDLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEIDSASLEAELTT 1621
            KD+SCIKDL  LYEAATIEDIY+AYE LKVDEDSL+SCIG+AG+QAGE  +A  E E   
Sbjct: 1188 KDVSCIKDLTSLYEAATIEDIYVAYEQLKVDEDSLYSCIGVAGAQAGEEINALEEEETDD 1247

Query: 1622 GLQGVFPVGRGLSTMTRPTT 1681
              QGV PVGRGLSTMTRPTT
Sbjct: 1248 DFQGVIPVGRGLSTMTRPTT 1267


>ref|XP_004152885.1| PREDICTED: uncharacterized protein LOC101202810 [Cucumis sativus]
          Length = 1261

 Score =  814 bits (2102), Expect = 0.0
 Identities = 418/560 (74%), Positives = 470/560 (83%), Gaps = 1/560 (0%)
 Frame = +2

Query: 5    AIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 184
            AI AGL             KELISSSQ+ +LR+QH+P+FI L+ E + TK +DKETGITQ
Sbjct: 706  AIEAGLREPIEAEPELEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQ 765

Query: 185  CRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFSR 364
            CRLSNGIPVNYKI+K+E K+GVMRLIV              V+VGVRTLSEGGRVG FSR
Sbjct: 766  CRLSNGIPVNYKISKSENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSR 825

Query: 365  EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDRA 544
            EQVELFCVNHLINCSLESTEEFI+MEFRFTLRD GMRAAFQLLHMV+EHSVWLEDAFDRA
Sbjct: 826  EQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRA 885

Query: 545  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 724
            +QLY+SYYRSIPKSLERSTAHKLMLAMLNGDERFVEP+P+ LQ LTLQ VKDAVMNQFVG
Sbjct: 886  KQLYMSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVG 945

Query: 725  DNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVFL 904
            +NMEVS+VGDF+E +IESCILDYLGTVTAT          S PI+FRPS S+LQFQQVFL
Sbjct: 946  NNMEVSLVGDFSEEEIESCILDYLGTVTATTTSEA--ALASVPIVFRPSASELQFQQVFL 1003

Query: 905  KDSDERACAYIAGPAPNRWGLTVEGRDLFESIRDVQSNSDELQSLELKDVKMDLQRKLRG 1084
            KD+DERACAYI+GPAPNRWG+T EG +L ESI  +    +  +S    D++  LQRKLR 
Sbjct: 1004 KDTDERACAYISGPAPNRWGVTFEGLELLESISQISRTGESDES--DNDIEKGLQRKLRS 1061

Query: 1085 HPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPAKV 1264
            HPLFFGIT+GLLAE+INSRLFT+VRDSLGLTYDVSFEL+LFDRLKLGWYVISVTSTPAKV
Sbjct: 1062 HPLFFGITMGLLAEIINSRLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKV 1121

Query: 1265 YKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQASSIPRK 1444
            YKAVDACK+VLRGL+S++IAQRELDRAKRTLLM+HEAE KSNAYWLGL+AH+QASS+PRK
Sbjct: 1122 YKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRK 1181

Query: 1445 DLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEIDSASLEAE-LTT 1621
            DLSCIKDL  LYEAATI+D+Y+AY+ LKVD DSL++CIGIAG+QAGE    S E E    
Sbjct: 1182 DLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQ 1241

Query: 1622 GLQGVFPVGRGLSTMTRPTT 1681
              QGV P GRGLSTMTRPTT
Sbjct: 1242 DFQGVIPSGRGLSTMTRPTT 1261


>ref|XP_002516378.1| pitrilysin, putative [Ricinus communis] gi|223544476|gb|EEF45995.1|
            pitrilysin, putative [Ricinus communis]
          Length = 1268

 Score =  813 bits (2101), Expect = 0.0
 Identities = 424/568 (74%), Positives = 476/568 (83%), Gaps = 9/568 (1%)
 Frame = +2

Query: 5    AIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 184
            AI++GL             KELIS+SQL +LRLQ RP+F+PL  EV+  K +D+ETGITQ
Sbjct: 707  AIKSGLEEPIEAEPELEVPKELISTSQLEELRLQRRPSFVPLLPEVNILKSHDQETGITQ 766

Query: 185  CRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFSR 364
            CRLSNGI VNYKI+++E++ GVMRLIV              VIVGVRTLSEGGRVGNFSR
Sbjct: 767  CRLSNGIAVNYKISRSESRGGVMRLIVGGGRAAETTESKGAVIVGVRTLSEGGRVGNFSR 826

Query: 365  EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDRA 544
            EQVELFCVNHLINCSLESTEEFI MEFRFTLRD GMRAAF+LLHMV+EHSVWL+DAFDRA
Sbjct: 827  EQVELFCVNHLINCSLESTEEFICMEFRFTLRDNGMRAAFELLHMVLEHSVWLDDAFDRA 886

Query: 545  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 724
            RQLYLSYYRSIPKSLER+TAHKLM AMLNGDERFVEPTPQ L+ LTL+ VKDAVMNQFVG
Sbjct: 887  RQLYLSYYRSIPKSLERATAHKLMTAMLNGDERFVEPTPQSLENLTLKSVKDAVMNQFVG 946

Query: 725  DNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSN--PIIFRPSPSDLQFQQV 898
            DNMEVSIVGDF+E +IESCI+DYLGTV  T     G V  +   PI+FRPS SDLQ QQV
Sbjct: 947  DNMEVSIVGDFSEEEIESCIIDYLGTVRETR----GSVGAAKFVPILFRPS-SDLQSQQV 1001

Query: 899  FLKDSDERACAYIAGPAPNRWGLTVEGRDLFESIRDV------QSNSDELQSLELKDVKM 1060
            FLKD+DERACAYIAGPAPNRWG TV+G+DLFESI D+      QS S++   +  KDV+ 
Sbjct: 1002 FLKDTDERACAYIAGPAPNRWGFTVDGKDLFESISDIAVVPDAQSKSEQ-PLMGRKDVQE 1060

Query: 1061 DLQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVIS 1240
            D QRKLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRL LGWYVIS
Sbjct: 1061 DWQRKLRSHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLNLGWYVIS 1120

Query: 1241 VTSTPAKVYKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHV 1420
            VTSTP+KVYKAVDACK+VLRGL S++IA RELDRAKRTLLM+HEAE KSNAYWLGL+AH+
Sbjct: 1121 VTSTPSKVYKAVDACKSVLRGLYSNKIAPRELDRAKRTLLMRHEAEVKSNAYWLGLLAHL 1180

Query: 1421 QASSIPRKDLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEIDSAS 1600
            QASS+PRKD+SCIKDL  LYEAATI+DIYLAYE LK+D+DSL+SCIG+AGSQAG+  +  
Sbjct: 1181 QASSVPRKDISCIKDLTSLYEAATIDDIYLAYEQLKIDDDSLYSCIGVAGSQAGDEITVP 1240

Query: 1601 LEAELT-TGLQGVFPVGRGLSTMTRPTT 1681
            LE E T  G QGV PVGRGLSTMTRPTT
Sbjct: 1241 LEEEETENGFQGVIPVGRGLSTMTRPTT 1268


>gb|EXB56235.1| putative zinc protease pqqL [Morus notabilis]
          Length = 1263

 Score =  812 bits (2097), Expect = 0.0
 Identities = 425/567 (74%), Positives = 471/567 (83%), Gaps = 7/567 (1%)
 Frame = +2

Query: 2    AAIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 181
            AAI AGL              ELIS+SQL +L ++ RP+F+ LS E + TK++DKETGIT
Sbjct: 701  AAIEAGLKEPIAAEPELEVPTELISASQLQELWMERRPSFVSLSPETNVTKLHDKETGIT 760

Query: 182  QCRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFS 361
            QC LSNGIPVNYKI+K EA  GVMRLIV              V+VGVRTLSEGGRVGNFS
Sbjct: 761  QCCLSNGIPVNYKISKTEACGGVMRLIVGGGRAVECPDSRGAVVVGVRTLSEGGRVGNFS 820

Query: 362  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDR 541
            REQVELFCVNHLINCSLESTEEFI+MEFRFTLRD GMRAAFQLLHMV+E SVWL+DAFDR
Sbjct: 821  REQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLERSVWLDDAFDR 880

Query: 542  ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 721
            ARQLYLSYYRSIPKSLERSTAHKLMLAML+GDERFVEPTP+ LQ LTLQ VKDAVM+QFV
Sbjct: 881  ARQLYLSYYRSIPKSLERSTAHKLMLAMLDGDERFVEPTPKSLQNLTLQTVKDAVMDQFV 940

Query: 722  GDNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVF 901
            G+NMEVSIVGDF+E DIESCILDYLGTV AT+        +  P++FRPSPSDLQ QQVF
Sbjct: 941  GNNMEVSIVGDFSEEDIESCILDYLGTVRATKNSK--RERQYAPVVFRPSPSDLQSQQVF 998

Query: 902  LKDSDERACAYIAGPAPNRWGLTVEGRDLFESIR------DVQSNSDELQSLELKDVKMD 1063
            LKD+DERACAYIAGPAPNRWG TV+G+DLFESIR      D QS S E  S E ++ + D
Sbjct: 999  LKDTDERACAYIAGPAPNRWGFTVDGKDLFESIRSISITEDAQSRSGE--SAEGENTEKD 1056

Query: 1064 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1243
             QRKLR HPLFFGIT+GLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRL LGWYVISV
Sbjct: 1057 YQRKLRHHPLFFGITMGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLNLGWYVISV 1116

Query: 1244 TSTPAKVYKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQ 1423
            TSTPAKV+KAVDACKNVLRGL+S++I  RELDRAKRTLLM+HEAE KSNAYWLGL+AH+Q
Sbjct: 1117 TSTPAKVHKAVDACKNVLRGLHSNKITPRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQ 1176

Query: 1424 ASSIPRKDLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEIDSASL 1603
            ASS+PRKD+SCIKDL LLYEAA IED YLAY+ LKVDEDSL+SCIGIAG+Q  E  SAS+
Sbjct: 1177 ASSVPRKDISCIKDLTLLYEAAGIEDAYLAYDQLKVDEDSLYSCIGIAGAQDDEEISASI 1236

Query: 1604 EAE-LTTGLQGVFPVGRGLSTMTRPTT 1681
            E +    G  G+ P+GRGLSTMTRPTT
Sbjct: 1237 EEDGSDEGFPGIAPMGRGLSTMTRPTT 1263


>ref|XP_006849871.1| hypothetical protein AMTR_s00022p00070510 [Amborella trichopoda]
            gi|548853469|gb|ERN11452.1| hypothetical protein
            AMTR_s00022p00070510 [Amborella trichopoda]
          Length = 1274

 Score =  806 bits (2083), Expect = 0.0
 Identities = 411/563 (73%), Positives = 473/563 (84%), Gaps = 4/563 (0%)
 Frame = +2

Query: 5    AIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 184
            AIR GL             KELISSS L +L+   +P F+PL+ +V+ T+++D+ETGITQ
Sbjct: 714  AIREGLNEPIEAEPELEVPKELISSSHLSELKSLCKPAFVPLNPDVNATRIFDEETGITQ 773

Query: 185  CRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFSR 364
            CRLSNGIPVNYKIT+NEAK GVMRLIV              V+VGVRTLSEGGRVGNFSR
Sbjct: 774  CRLSNGIPVNYKITQNEAKGGVMRLIVGGGRANETSESRGSVVVGVRTLSEGGRVGNFSR 833

Query: 365  EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDRA 544
            EQVELFCVNHLINCSLESTEEF+ MEFRFTLRDGGMRAAFQLLHMV+EHSVWLEDAFDRA
Sbjct: 834  EQVELFCVNHLINCSLESTEEFVCMEFRFTLRDGGMRAAFQLLHMVLEHSVWLEDAFDRA 893

Query: 545  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 724
            RQLYL YYR+IPKSLER+TAHKLM+AMLNGDERF EPTP+ LQ+LTL  VK+AVMNQF G
Sbjct: 894  RQLYLQYYRAIPKSLERATAHKLMIAMLNGDERFFEPTPESLQQLTLPIVKNAVMNQFRG 953

Query: 725  DNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVFL 904
            DNMEVSIVGDFTE +IESCILDYLGTVTAT     G+ +   PI FRPSPSDLQ QQVFL
Sbjct: 954  DNMEVSIVGDFTEDEIESCILDYLGTVTATGSTEKGNEY--EPIFFRPSPSDLQSQQVFL 1011

Query: 905  KDSDERACAYIAGPAPNRWGLTVEGRDLFESIR--DVQSNSDELQSLELKDVKMDLQRKL 1078
            KD+DERACAYIAGPAPNRWGLT+EG+DLFE ++   + S+ ++ + +E KD + +L  K+
Sbjct: 1012 KDTDERACAYIAGPAPNRWGLTIEGQDLFELVKKGSLVSDDEQRKPVESKDGEANLSGKI 1071

Query: 1079 RGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPA 1258
            +  PLFF IT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRLK GWYVISVTSTP+
Sbjct: 1072 QQLPLFFAITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKFGWYVISVTSTPS 1131

Query: 1259 KVYKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQASSIP 1438
            KVYKAVDACK+VLRGL++S+I QRELDRA+RTLLM+HEAE KSN YWLGL+AH+QASSIP
Sbjct: 1132 KVYKAVDACKDVLRGLHNSKITQRELDRARRTLLMRHEAEMKSNVYWLGLLAHLQASSIP 1191

Query: 1439 RKDLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAG-EIDSASLEAEL 1615
            RKD+SCIKDL  LYEAATIED+Y+AY HLKV EDSL+SCIG+AGSQA  E DSAS+ +E 
Sbjct: 1192 RKDISCIKDLTSLYEAATIEDVYVAYNHLKVGEDSLYSCIGVAGSQARVEADSASVVSEE 1251

Query: 1616 TTG-LQGVFPVGRGLSTMTRPTT 1681
            + G   G+ P+GRGL+TMTRPTT
Sbjct: 1252 SDGSAAGLIPIGRGLATMTRPTT 1274


>ref|XP_007018616.1| Insulinase (Peptidase family M16) family protein isoform 4 [Theobroma
            cacao] gi|508723944|gb|EOY15841.1| Insulinase (Peptidase
            family M16) family protein isoform 4 [Theobroma cacao]
          Length = 1018

 Score =  806 bits (2082), Expect = 0.0
 Identities = 409/529 (77%), Positives = 457/529 (86%), Gaps = 1/529 (0%)
 Frame = +2

Query: 2    AAIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 181
            AAI++GL             KELIS  QL +LR+Q  P+FIPLS E++ TKV DKETGIT
Sbjct: 484  AAIKSGLEEPIEAEPELEVPKELISPLQLQELRMQRGPSFIPLSAEMNVTKVQDKETGIT 543

Query: 182  QCRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFS 361
            Q RLSNGIPVNYKI+KNEA+ GVMRLIV              V+VGVRTLSEGGRVGNFS
Sbjct: 544  QLRLSNGIPVNYKISKNEARGGVMRLIVGGGRAAETSDSKGAVVVGVRTLSEGGRVGNFS 603

Query: 362  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDR 541
            REQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDR
Sbjct: 604  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMHAAFQLLHMVLEHSVWLDDAFDR 663

Query: 542  ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 721
            ARQLYLSYYRSIPKSLERSTAHKLMLAM+NGDERFVEPTP+ LQ LTL+ VKDAVMNQFV
Sbjct: 664  ARQLYLSYYRSIPKSLERSTAHKLMLAMMNGDERFVEPTPKSLQNLTLKSVKDAVMNQFV 723

Query: 722  GDNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVF 901
            GDNMEVSIVGDF+E +IESC+LDYLGTV A+        H  +PI+FRPSPSDLQFQQVF
Sbjct: 724  GDNMEVSIVGDFSEEEIESCVLDYLGTVRASRDSE--RAHGFSPILFRPSPSDLQFQQVF 781

Query: 902  LKDSDERACAYIAGPAPNRWGLTVEGRDLFESIRDVQSNSD-ELQSLELKDVKMDLQRKL 1078
            LKD+DERACAYIAGPAPNRWGLTV+G+DL ES+ D+ S  D +  S E KD++ DLQ+KL
Sbjct: 782  LKDTDERACAYIAGPAPNRWGLTVDGQDLLESVADIPSADDAQPHSDEGKDIQKDLQKKL 841

Query: 1079 RGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPA 1258
            RGHPLFFGIT+GLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTP+
Sbjct: 842  RGHPLFFGITMGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPS 901

Query: 1259 KVYKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQASSIP 1438
            KVY+AVDACKNVLRGL++++IA REL+RAKRTLLM+HEAE KSNAYWLGL+AH+QASS+P
Sbjct: 902  KVYRAVDACKNVLRGLHTNKIAPRELERAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP 961

Query: 1439 RKDLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGE 1585
            RKD+SC+K+L  LYEAA+IEDIYLAY+ LKVDEDSL+SCIGIAG  AGE
Sbjct: 962  RKDISCVKELTSLYEAASIEDIYLAYDQLKVDEDSLYSCIGIAGVHAGE 1010


>ref|XP_004307194.1| PREDICTED: uncharacterized protein LOC101308217 [Fragaria vesca
            subsp. vesca]
          Length = 1263

 Score =  806 bits (2082), Expect = 0.0
 Identities = 420/565 (74%), Positives = 469/565 (83%), Gaps = 5/565 (0%)
 Frame = +2

Query: 2    AAIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 181
            AA RAGL             KELISSSQL +LR +  P+FI  S E S TK+YDKETGIT
Sbjct: 705  AATRAGLEDPIEPEPELEVPKELISSSQLQELRQERMPSFITCSPETSMTKIYDKETGIT 764

Query: 182  QCRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFS 361
            + RLSNGI VNYKI+K+EA+ GVMRLIV              V+VGVRTLSEGGRVGNFS
Sbjct: 765  RARLSNGISVNYKISKSEARGGVMRLIVGGGRATESSESKGSVVVGVRTLSEGGRVGNFS 824

Query: 362  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDR 541
            REQVELFCVNHLINCSLESTEEFISMEFRFTLRD GMRAAFQLLHMV+EHSVWL+DAFDR
Sbjct: 825  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMRAAFQLLHMVLEHSVWLDDAFDR 884

Query: 542  ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 721
            ARQLYLSYYRSIPKSLERSTAHKLMLAML+GDERFVEPTP  LQ LTLQ VKDAVMNQFV
Sbjct: 885  ARQLYLSYYRSIPKSLERSTAHKLMLAMLDGDERFVEPTPTSLQNLTLQSVKDAVMNQFV 944

Query: 722  GDNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVF 901
            G+NMEVSIVGDF+E +IESCILDYLGTV + +        + NP++FR S SDLQ QQVF
Sbjct: 945  GNNMEVSIVGDFSEEEIESCILDYLGTVQSAKHSEV--EQKYNPVVFRAS-SDLQSQQVF 1001

Query: 902  LKDSDERACAYIAGPAPNRWGLTVEGRDLFESIRDVQSNSD-ELQSLEL----KDVKMDL 1066
            LKD+DERACAYIAGPAPNRWG TV+G+DLF SI D+ S  D +L+S EL    KD + D+
Sbjct: 1002 LKDTDERACAYIAGPAPNRWGFTVDGKDLF-SITDISSCDDAQLKSEELVAEGKDTQKDM 1060

Query: 1067 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1246
            QR LRGHPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFELNLFDRL LGWYVISVT
Sbjct: 1061 QRTLRGHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELNLFDRLNLGWYVISVT 1120

Query: 1247 STPAKVYKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQA 1426
            STP KV+KAVDACKNVLRGL+S++I+QRELDRAKRTLLM+HEAE KSN YWLGL+AH+QA
Sbjct: 1121 STPGKVHKAVDACKNVLRGLHSNKISQRELDRAKRTLLMRHEAEIKSNGYWLGLLAHLQA 1180

Query: 1427 SSIPRKDLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEIDSASLE 1606
            SS+PRKD+SCIKDL  LYE A IED+YLAY+ L++D+DSL+SC+GIAG+QAG  D  +  
Sbjct: 1181 SSVPRKDISCIKDLTTLYEIAAIEDVYLAYDQLRIDDDSLYSCVGIAGAQAG--DEITEV 1238

Query: 1607 AELTTGLQGVFPVGRGLSTMTRPTT 1681
             E   G  GVFPVGRGLSTMTRPTT
Sbjct: 1239 EEPEGGFPGVFPVGRGLSTMTRPTT 1263


>ref|XP_007157075.1| hypothetical protein PHAVU_002G040800g [Phaseolus vulgaris]
            gi|561030490|gb|ESW29069.1| hypothetical protein
            PHAVU_002G040800g [Phaseolus vulgaris]
          Length = 1247

 Score =  804 bits (2077), Expect = 0.0
 Identities = 411/560 (73%), Positives = 467/560 (83%), Gaps = 1/560 (0%)
 Frame = +2

Query: 5    AIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 184
            AI+AGL             KELI SS+L +L+   +P FIP++ E  +TK+ D+ETGITQ
Sbjct: 691  AIKAGLDEPIQPEPELEVPKELIQSSKLEELKKLRKPAFIPVNPEADSTKLLDEETGITQ 750

Query: 185  CRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFSR 364
             RLSNGIPVNYKI+K E +SGVMRLIV              VIVGVRTLSEGGRVGNFSR
Sbjct: 751  RRLSNGIPVNYKISKTETQSGVMRLIVGGGRAAESSDSRGSVIVGVRTLSEGGRVGNFSR 810

Query: 365  EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDRA 544
            EQVELFCVNHLINCSLESTEEFISMEFRFTLRD GMRAAFQLLHMV+EHSVW++DAFDRA
Sbjct: 811  EQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMRAAFQLLHMVLEHSVWVDDAFDRA 870

Query: 545  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 724
            RQLYLSYYRSIPKSLERSTAHKLM+AML+GDERF+EPTP+ L+ LTLQ VKDAVMNQF G
Sbjct: 871  RQLYLSYYRSIPKSLERSTAHKLMVAMLDGDERFIEPTPKSLENLTLQSVKDAVMNQFFG 930

Query: 725  DNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVFL 904
            DNMEV IVGDFTE DIESCILDYLGT  AT   + G     NP IFRPSPS+LQFQ+VFL
Sbjct: 931  DNMEVCIVGDFTEEDIESCILDYLGTAQATR--NHGREQEFNPPIFRPSPSELQFQEVFL 988

Query: 905  KDSDERACAYIAGPAPNRWGLTVEGRDLFESIRDVQSNSDELQSLELKDVKMDLQRKLRG 1084
            KD+DERACAYIAGPAPNRWG TV+G+ L ESI +  + +D+  + + +  +  LQ+ LRG
Sbjct: 989  KDTDERACAYIAGPAPNRWGFTVDGKYLLESINNASTTNDDQSNSDAQQTQ-GLQKSLRG 1047

Query: 1085 HPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPAKV 1264
            HPLFFGIT+GLL+E+INSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTP+KV
Sbjct: 1048 HPLFFGITMGLLSEIINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPSKV 1107

Query: 1265 YKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQASSIPRK 1444
            +KAVDACKNVLRGL+S++I +RELDRAKRTLLM+HEAE KSNAYWLGL+AH+QASS+PRK
Sbjct: 1108 HKAVDACKNVLRGLHSNKITERELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRK 1167

Query: 1445 DLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEIDSASLEAELTTG 1624
            DLSCIKDL  LYE ATIEDIYLAYE LKVDE+SL+SCIGIAG+Q  +  +A +E E+   
Sbjct: 1168 DLSCIKDLTFLYEVATIEDIYLAYEQLKVDENSLYSCIGIAGAQDAQDIAAPIEEEVAGD 1227

Query: 1625 L-QGVFPVGRGLSTMTRPTT 1681
            +  GV PVGRGLSTMTRPTT
Sbjct: 1228 VYPGVIPVGRGLSTMTRPTT 1247


>ref|XP_007018617.1| Insulinase (Peptidase family M16) family protein isoform 5, partial
            [Theobroma cacao] gi|508723945|gb|EOY15842.1| Insulinase
            (Peptidase family M16) family protein isoform 5, partial
            [Theobroma cacao]
          Length = 1022

 Score =  804 bits (2077), Expect = 0.0
 Identities = 408/528 (77%), Positives = 456/528 (86%), Gaps = 1/528 (0%)
 Frame = +2

Query: 2    AAIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 181
            AAI++GL             KELIS  QL +LR+Q  P+FIPLS E++ TKV DKETGIT
Sbjct: 497  AAIKSGLEEPIEAEPELEVPKELISPLQLQELRMQRGPSFIPLSAEMNVTKVQDKETGIT 556

Query: 182  QCRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFS 361
            Q RLSNGIPVNYKI+KNEA+ GVMRLIV              V+VGVRTLSEGGRVGNFS
Sbjct: 557  QLRLSNGIPVNYKISKNEARGGVMRLIVGGGRAAETSDSKGAVVVGVRTLSEGGRVGNFS 616

Query: 362  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDR 541
            REQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDR
Sbjct: 617  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMHAAFQLLHMVLEHSVWLDDAFDR 676

Query: 542  ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 721
            ARQLYLSYYRSIPKSLERSTAHKLMLAM+NGDERFVEPTP+ LQ LTL+ VKDAVMNQFV
Sbjct: 677  ARQLYLSYYRSIPKSLERSTAHKLMLAMMNGDERFVEPTPKSLQNLTLKSVKDAVMNQFV 736

Query: 722  GDNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVF 901
            GDNMEVSIVGDF+E +IESC+LDYLGTV A+        H  +PI+FRPSPSDLQFQQVF
Sbjct: 737  GDNMEVSIVGDFSEEEIESCVLDYLGTVRASRDSE--RAHGFSPILFRPSPSDLQFQQVF 794

Query: 902  LKDSDERACAYIAGPAPNRWGLTVEGRDLFESIRDVQSNSD-ELQSLELKDVKMDLQRKL 1078
            LKD+DERACAYIAGPAPNRWGLTV+G+DL ES+ D+ S  D +  S E KD++ DLQ+KL
Sbjct: 795  LKDTDERACAYIAGPAPNRWGLTVDGQDLLESVADIPSADDAQPHSDEGKDIQKDLQKKL 854

Query: 1079 RGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPA 1258
            RGHPLFFGIT+GLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTP+
Sbjct: 855  RGHPLFFGITMGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPS 914

Query: 1259 KVYKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQASSIP 1438
            KVY+AVDACKNVLRGL++++IA REL+RAKRTLLM+HEAE KSNAYWLGL+AH+QASS+P
Sbjct: 915  KVYRAVDACKNVLRGLHTNKIAPRELERAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP 974

Query: 1439 RKDLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAG 1582
            RKD+SC+K+L  LYEAA+IEDIYLAY+ LKVDEDSL+SCIGIAG  AG
Sbjct: 975  RKDISCVKELTSLYEAASIEDIYLAYDQLKVDEDSLYSCIGIAGVHAG 1022


>ref|XP_006573851.1| PREDICTED: uncharacterized protein LOC100794716 [Glycine max]
          Length = 1254

 Score =  797 bits (2058), Expect = 0.0
 Identities = 409/560 (73%), Positives = 462/560 (82%), Gaps = 1/560 (0%)
 Frame = +2

Query: 5    AIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 184
            AI+AGL             KELI S++L +L+   +P FIP++ E   TK++D+ETGIT+
Sbjct: 698  AIKAGLDEPIQPEPELEVPKELIQSTKLEELKKLRKPAFIPVNPETDATKLHDEETGITR 757

Query: 185  CRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFSR 364
             RL+NGIPVNYKI+K E +SGVMRLIV              VIVGVRTLSEGGRVGNFSR
Sbjct: 758  RRLANGIPVNYKISKTETQSGVMRLIVGGGRAAESPESRGSVIVGVRTLSEGGRVGNFSR 817

Query: 365  EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDRA 544
            EQVELFCVNHLINCSLESTEEFISMEFRFTLRD GMRAAFQLLHMV+EHSVW++DAFDRA
Sbjct: 818  EQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMRAAFQLLHMVLEHSVWVDDAFDRA 877

Query: 545  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 724
            RQLYLSYYRSIPKSLERSTAHKLM+AML+GDERF+EPTP+ L+ LTLQ VKDAVMNQF G
Sbjct: 878  RQLYLSYYRSIPKSLERSTAHKLMVAMLDGDERFIEPTPKSLENLTLQSVKDAVMNQFFG 937

Query: 725  DNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVFL 904
            DNMEV IVGDFTE DIESCILDYLGT  AT         + NP +FRPSPSDLQFQ+VFL
Sbjct: 938  DNMEVCIVGDFTEEDIESCILDYLGTAQATRNHE--REQKFNPPLFRPSPSDLQFQEVFL 995

Query: 905  KDSDERACAYIAGPAPNRWGLTVEGRDLFESIRDVQSNSDELQSLELKDVKMDLQRKLRG 1084
            KD+DERACAYIAGPAPNRWG TV+G DL ESI +    +D+ QS         LQ+ L G
Sbjct: 996  KDTDERACAYIAGPAPNRWGFTVDGVDLLESINNASIINDD-QSKSDAQQTQGLQKSLCG 1054

Query: 1085 HPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPAKV 1264
            HPLFFGIT+GLL+E+INSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTP+KV
Sbjct: 1055 HPLFFGITMGLLSEIINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPSKV 1114

Query: 1265 YKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQASSIPRK 1444
            +KAVDACKNVLRGL+S++I +RELDRAKRTLLM+HEAE KSNAYWLGL+AH+QASS+PRK
Sbjct: 1115 HKAVDACKNVLRGLHSNKITERELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRK 1174

Query: 1445 DLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEIDSASLEAELTTG 1624
            D+SCIKDL  LYE ATIEDIYLAYE LKVDE+SL+SCIGIAG+Q  +  +A LE E+   
Sbjct: 1175 DISCIKDLTFLYEVATIEDIYLAYEQLKVDENSLYSCIGIAGAQTAQDIAAPLEEEVADD 1234

Query: 1625 L-QGVFPVGRGLSTMTRPTT 1681
            +  GV PVGRGLSTMTRPTT
Sbjct: 1235 VYPGVIPVGRGLSTMTRPTT 1254


>ref|XP_003537738.1| PREDICTED: uncharacterized protein LOC100809828 [Glycine max]
          Length = 1257

 Score =  793 bits (2047), Expect = 0.0
 Identities = 407/560 (72%), Positives = 461/560 (82%), Gaps = 1/560 (0%)
 Frame = +2

Query: 5    AIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 184
            AI+AGL             KELI S++L +L+   +P FIP++ E   TK++D+ETGI++
Sbjct: 701  AIKAGLDEPIQPEPELEVPKELIQSTKLEELKKLRKPAFIPVNPETDATKLHDEETGISR 760

Query: 185  CRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFSR 364
             RLSNGIPVNYKI+K E +SGVMRLIV              VIVGVRTLSEGGRVGNFSR
Sbjct: 761  RRLSNGIPVNYKISKTETQSGVMRLIVGGGRAAESPESRGSVIVGVRTLSEGGRVGNFSR 820

Query: 365  EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDRA 544
            EQVELFCVNHLINCSLESTEEFISMEFRFTLRD GMRAAFQLLHMV+EHSVW++DAFDRA
Sbjct: 821  EQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMRAAFQLLHMVLEHSVWVDDAFDRA 880

Query: 545  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 724
            RQLYLSYYRSIPKSLERSTAHKLM+AML+GDERF+EPTP+ L+ LTLQ VKDAVMNQF G
Sbjct: 881  RQLYLSYYRSIPKSLERSTAHKLMVAMLDGDERFIEPTPKSLENLTLQSVKDAVMNQFFG 940

Query: 725  DNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVFL 904
            DNMEV IVGDFTE DIESCILDYLGT  A            NP +FRPSPSDLQFQ+VFL
Sbjct: 941  DNMEVCIVGDFTEEDIESCILDYLGTAQAARNHE--REKEFNPPLFRPSPSDLQFQEVFL 998

Query: 905  KDSDERACAYIAGPAPNRWGLTVEGRDLFESIRDVQSNSDELQSLELKDVKMDLQRKLRG 1084
            KD+DERACAYIAGPAPNRWG TV+G DL ESI +  + +D+ QS         LQ+ L G
Sbjct: 999  KDTDERACAYIAGPAPNRWGFTVDGVDLLESINNASTINDD-QSKSNAQQTQGLQKSLCG 1057

Query: 1085 HPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPAKV 1264
            HPLFFGIT+GLL+E+INSRLFT+VRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTP+KV
Sbjct: 1058 HPLFFGITMGLLSEIINSRLFTSVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPSKV 1117

Query: 1265 YKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQASSIPRK 1444
            +KAVDACKNVLRGL+S++I +RELDRAKRTLLM+HEAE KSNAYWLGL+AH+QASS+PRK
Sbjct: 1118 HKAVDACKNVLRGLHSNKITERELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRK 1177

Query: 1445 DLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEIDSASLEAELTTG 1624
            D+SCIKDL  LYE ATIEDIY AYE LKVDE+SL+SCIGIAG+QA +  +A LE E+   
Sbjct: 1178 DISCIKDLTFLYEVATIEDIYRAYEQLKVDENSLYSCIGIAGAQAAQEIAAPLEEEVADD 1237

Query: 1625 L-QGVFPVGRGLSTMTRPTT 1681
            +  GV PVGRGLSTMTRPTT
Sbjct: 1238 VYPGVIPVGRGLSTMTRPTT 1257


>gb|EYU24512.1| hypothetical protein MIMGU_mgv1a000585mg [Mimulus guttatus]
          Length = 1057

 Score =  790 bits (2041), Expect = 0.0
 Identities = 412/566 (72%), Positives = 462/566 (81%), Gaps = 6/566 (1%)
 Frame = +2

Query: 2    AAIRAGLXXXXXXXXXXXXXKELISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 181
            A+I AGL             KELISS QL +L LQ  P+FIP+ +E   TKVYD+ETGI 
Sbjct: 494  ASIEAGLKEPIEAEPELEIPKELISSEQLQELSLQQPPSFIPVDQEKKMTKVYDEETGII 553

Query: 182  QCRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXXDVIVGVRTLSEGGRVGNFS 361
            Q RLSNGIPVNYKI+K+EA SGVMRLIV              VIVGVRTLSEGGRVGNF+
Sbjct: 554  QRRLSNGIPVNYKISKSEANSGVMRLIVGGGRAAESAESKGAVIVGVRTLSEGGRVGNFT 613

Query: 362  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMRAAFQLLHMVIEHSVWLEDAFDR 541
            REQVELFCVNHLINCSLESTEEFISMEFRFTLRD GMRAAFQLLHMV+EHSVWLEDAFDR
Sbjct: 614  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDR 673

Query: 542  ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 721
            A+QLYLSYYRSIPKSLERSTAHKLMLAML+GDERFVEPTP  LQ+LTL+ VK+AVMNQFV
Sbjct: 674  AKQLYLSYYRSIPKSLERSTAHKLMLAMLDGDERFVEPTPNSLQQLTLEQVKEAVMNQFV 733

Query: 722  GDNMEVSIVGDFTEGDIESCILDYLGTVTATEKPSPGDVHRSNPIIFRPSPSDLQFQQVF 901
             DNMEVSIVGDF+E DIESCIL+YLGTV   E+       + +PI+FRP  +DLQ QQVF
Sbjct: 734  CDNMEVSIVGDFSEEDIESCILEYLGTV--RERKGSERAQKYSPILFRPYTADLQHQQVF 791

Query: 902  LKDSDERACAYIAGPAPNRWGLTVEGRDLFESIRDVQSNSD----ELQSLELKDVKMDLQ 1069
            LKD+DERACAY+AGPAPNRWG T EG++L ES     +  +    E Q  EL++    +Q
Sbjct: 792  LKDTDERACAYVAGPAPNRWGFTFEGKNLLESDSTASTFGEHVKFEEQPQELENSDKVMQ 851

Query: 1070 RKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTS 1249
             KLR HPLFF IT+GLL E+INSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTS
Sbjct: 852  GKLRTHPLFFAITMGLLQEIINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTS 911

Query: 1250 TPAKVYKAVDACKNVLRGLNSSRIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHVQAS 1429
            TP KV+KAVDACKNVL+GL SSRIA RELDRA+RTLLM+HEAE KSNAYWLGL+AH+QA+
Sbjct: 912  TPGKVHKAVDACKNVLKGLLSSRIAPRELDRARRTLLMRHEAEIKSNAYWLGLMAHLQAT 971

Query: 1430 SIPRKDLSCIKDLQLLYEAATIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEIDSAS--L 1603
            S+PRKD+SCIKDL  LYEAATIED+Y+AYE LKVD++SLFSCIG+AGSQAGE+ + S  L
Sbjct: 972  SVPRKDISCIKDLISLYEAATIEDVYIAYEQLKVDDNSLFSCIGVAGSQAGEVATGSVVL 1031

Query: 1604 EAELTTGLQGVFPVGRGLSTMTRPTT 1681
            E E   GLQ +  VGRG STMTRPTT
Sbjct: 1032 EEESVEGLQNIIQVGRGSSTMTRPTT 1057


Top