BLASTX nr result

ID: Sinomenium21_contig00007457 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00007457
         (2548 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002277544.2| PREDICTED: uncharacterized protein LOC100266...   843   0.0  
emb|CBI40802.3| unnamed protein product [Vitis vinifera]              843   0.0  
ref|XP_007018614.1| Insulinase (Peptidase family M16) family pro...   842   0.0  
ref|XP_007018613.1| Insulinase family protein isoform 1 [Theobro...   839   0.0  
ref|XP_006494428.1| PREDICTED: uncharacterized protein LOC102613...   818   0.0  
ref|XP_006435501.1| hypothetical protein CICLE_v10000050mg [Citr...   818   0.0  
gb|EXB56235.1| putative zinc protease pqqL [Morus notabilis]          814   0.0  
ref|XP_002301748.2| hypothetical protein POPTR_0002s23680g [Popu...   812   0.0  
ref|XP_007157075.1| hypothetical protein PHAVU_002G040800g [Phas...   811   0.0  
ref|XP_004307194.1| PREDICTED: uncharacterized protein LOC101308...   810   0.0  
ref|XP_002516378.1| pitrilysin, putative [Ricinus communis] gi|2...   810   0.0  
ref|XP_002320445.2| pitrilysin family protein [Populus trichocar...   808   0.0  
ref|XP_007018616.1| Insulinase (Peptidase family M16) family pro...   808   0.0  
ref|XP_006573851.1| PREDICTED: uncharacterized protein LOC100794...   806   0.0  
ref|XP_007018617.1| Insulinase (Peptidase family M16) family pro...   806   0.0  
ref|XP_004152885.1| PREDICTED: uncharacterized protein LOC101202...   805   0.0  
ref|XP_004155123.1| PREDICTED: uncharacterized protein LOC101224...   802   0.0  
ref|XP_003537738.1| PREDICTED: uncharacterized protein LOC100809...   800   0.0  
ref|XP_006849871.1| hypothetical protein AMTR_s00022p00070510 [A...   790   0.0  
gb|EYU24512.1| hypothetical protein MIMGU_mgv1a000585mg [Mimulus...   788   0.0  

>ref|XP_002277544.2| PREDICTED: uncharacterized protein LOC100266746 [Vitis vinifera]
          Length = 1269

 Score =  843 bits (2179), Expect = 0.0
 Identities = 435/565 (76%), Positives = 482/565 (85%)
 Frame = -2

Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365
            AI+AG             PK+LISSSQL  LR++  P+FIPLS EV+ TKVYD ETGITQ
Sbjct: 710  AIKAGLEEPIEAEPELEVPKELISSSQLQKLRVERSPSFIPLSPEVNVTKVYDNETGITQ 769

Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185
             RLSNGIPVNYKI++NEA+ GVMRLIV            G V+VGVRTLSEGGRVGNFSR
Sbjct: 770  LRLSNGIPVNYKISRNEARGGVMRLIVGGGRAAESFESRGAVVVGVRTLSEGGRVGNFSR 829

Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005
            EQVELFCVNHLINCSLESTEEFI MEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDRA
Sbjct: 830  EQVELFCVNHLINCSLESTEEFICMEFRFTLRDNGMRAAFQLLHMVLEHSVWLDDAFDRA 889

Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825
            RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEP+P+ LQ LTLQ VKDAVMNQFVG
Sbjct: 890  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQSVKDAVMNQFVG 949

Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645
            DNMEVS+VGDF+E DIESCILDY+GTV A++        +S+ I+FR  PSDLQFQQVFL
Sbjct: 950  DNMEVSVVGDFSEEDIESCILDYMGTVRASRDSEIE--QQSSSIMFRSYPSDLQFQQVFL 1007

Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465
            KDTDERACAYIAGPAPNRWGFT+EG+DLFE+I +    DDE+  S+ L   E+KD + DL
Sbjct: 1008 KDTDERACAYIAGPAPNRWGFTIEGKDLFESINNISVDDDEEPQSESLS--EMKDCRKDL 1065

Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285
            QRKLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRLKLGWYVISVT
Sbjct: 1066 QRKLRNHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKLGWYVISVT 1125

Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105
            STP KVYKAVDACKNVLRGL+SSKIAQRELDRAKRTLLM+HEAE+K+NAYWLGL+AH QA
Sbjct: 1126 STPGKVYKAVDACKNVLRGLHSSKIAQRELDRAKRTLLMRHEAETKANAYWLGLLAHLQA 1185

Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLE 925
            S+VPRKD+SCIKDL  LYEAA+IEDIYLAYE LKVDE+SL+SCIGIAG+QA E +S   E
Sbjct: 1186 STVPRKDISCIKDLTSLYEAATIEDIYLAYEQLKVDENSLYSCIGIAGAQAAEEISVE-E 1244

Query: 924  DELTTGVQGVFPVGRGLSTMTRPTT 850
            +E   G+QGV P GRGLSTMTRPTT
Sbjct: 1245 EESDEGLQGVIPAGRGLSTMTRPTT 1269


>emb|CBI40802.3| unnamed protein product [Vitis vinifera]
          Length = 1276

 Score =  843 bits (2179), Expect = 0.0
 Identities = 435/565 (76%), Positives = 482/565 (85%)
 Frame = -2

Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365
            AI+AG             PK+LISSSQL  LR++  P+FIPLS EV+ TKVYD ETGITQ
Sbjct: 717  AIKAGLEEPIEAEPELEVPKELISSSQLQKLRVERSPSFIPLSPEVNVTKVYDNETGITQ 776

Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185
             RLSNGIPVNYKI++NEA+ GVMRLIV            G V+VGVRTLSEGGRVGNFSR
Sbjct: 777  LRLSNGIPVNYKISRNEARGGVMRLIVGGGRAAESFESRGAVVVGVRTLSEGGRVGNFSR 836

Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005
            EQVELFCVNHLINCSLESTEEFI MEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDRA
Sbjct: 837  EQVELFCVNHLINCSLESTEEFICMEFRFTLRDNGMRAAFQLLHMVLEHSVWLDDAFDRA 896

Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825
            RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEP+P+ LQ LTLQ VKDAVMNQFVG
Sbjct: 897  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQSVKDAVMNQFVG 956

Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645
            DNMEVS+VGDF+E DIESCILDY+GTV A++        +S+ I+FR  PSDLQFQQVFL
Sbjct: 957  DNMEVSVVGDFSEEDIESCILDYMGTVRASRDSEIE--QQSSSIMFRSYPSDLQFQQVFL 1014

Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465
            KDTDERACAYIAGPAPNRWGFT+EG+DLFE+I +    DDE+  S+ L   E+KD + DL
Sbjct: 1015 KDTDERACAYIAGPAPNRWGFTIEGKDLFESINNISVDDDEEPQSESLS--EMKDCRKDL 1072

Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285
            QRKLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRLKLGWYVISVT
Sbjct: 1073 QRKLRNHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKLGWYVISVT 1132

Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105
            STP KVYKAVDACKNVLRGL+SSKIAQRELDRAKRTLLM+HEAE+K+NAYWLGL+AH QA
Sbjct: 1133 STPGKVYKAVDACKNVLRGLHSSKIAQRELDRAKRTLLMRHEAETKANAYWLGLLAHLQA 1192

Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLE 925
            S+VPRKD+SCIKDL  LYEAA+IEDIYLAYE LKVDE+SL+SCIGIAG+QA E +S   E
Sbjct: 1193 STVPRKDISCIKDLTSLYEAATIEDIYLAYEQLKVDENSLYSCIGIAGAQAAEEISVE-E 1251

Query: 924  DELTTGVQGVFPVGRGLSTMTRPTT 850
            +E   G+QGV P GRGLSTMTRPTT
Sbjct: 1252 EESDEGLQGVIPAGRGLSTMTRPTT 1276


>ref|XP_007018614.1| Insulinase (Peptidase family M16) family protein isoform 2 [Theobroma
            cacao] gi|590597455|ref|XP_007018615.1| Insulinase
            (Peptidase family M16) family protein isoform 2
            [Theobroma cacao] gi|508723942|gb|EOY15839.1| Insulinase
            (Peptidase family M16) family protein isoform 2
            [Theobroma cacao] gi|508723943|gb|EOY15840.1| Insulinase
            (Peptidase family M16) family protein isoform 2
            [Theobroma cacao]
          Length = 1285

 Score =  842 bits (2174), Expect = 0.0
 Identities = 436/567 (76%), Positives = 483/567 (85%), Gaps = 1/567 (0%)
 Frame = -2

Query: 2547 AAIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 2368
            AAI++G             PK+LIS  QL +LR+Q  P+FIPLS E++ TKV DKETGIT
Sbjct: 726  AAIKSGLEEPIEAEPELEVPKELISPLQLQELRMQRGPSFIPLSAEMNVTKVQDKETGIT 785

Query: 2367 QRRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFS 2188
            Q RLSNGIPVNYKI+KNEA+ GVMRLIV            G V+VGVRTLSEGGRVGNFS
Sbjct: 786  QLRLSNGIPVNYKISKNEARGGVMRLIVGGGRAAETSDSKGAVVVGVRTLSEGGRVGNFS 845

Query: 2187 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDR 2008
            REQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDR
Sbjct: 846  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMHAAFQLLHMVLEHSVWLDDAFDR 905

Query: 2007 ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 1828
            ARQLYLSYYRSIPKSLERSTAHKLMLAM+NGDERFVEPTP+ LQ LTL+ VKDAVMNQFV
Sbjct: 906  ARQLYLSYYRSIPKSLERSTAHKLMLAMMNGDERFVEPTPKSLQNLTLKSVKDAVMNQFV 965

Query: 1827 GDNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVF 1648
            GDNMEVSIVGDF+E +IESC+LDYLGTV A++    A  H  +PI+FRPSPSDLQFQQVF
Sbjct: 966  GDNMEVSIVGDFSEEEIESCVLDYLGTVRASRDSERA--HGFSPILFRPSPSDLQFQQVF 1023

Query: 1647 LKDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTD 1468
            LKDTDERACAYIAGPAPNRWG TV+GQDL E++    S DD Q +SD     E KD++ D
Sbjct: 1024 LKDTDERACAYIAGPAPNRWGLTVDGQDLLESVADIPSADDAQPHSD-----EGKDIQKD 1078

Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288
            LQ+KLRGHPLFFGIT+GLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV
Sbjct: 1079 LQKKLRGHPLFFGITMGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1138

Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108
            TSTP+KVY+AVDACKNVLRGL+++KIA REL+RAKRTLLM+HEAE KSNAYWLGL+AH Q
Sbjct: 1139 TSTPSKVYRAVDACKNVLRGLHTNKIAPRELERAKRTLLMRHEAEIKSNAYWLGLLAHLQ 1198

Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYL 928
            ASSVPRKD+SC+K+L  LYEAASIEDIYLAY+ LKVDEDSL+SCIGIAG  AGE  +A  
Sbjct: 1199 ASSVPRKDISCVKELTSLYEAASIEDIYLAYDQLKVDEDSLYSCIGIAGVHAGEGTTASE 1258

Query: 927  EDELTT-GVQGVFPVGRGLSTMTRPTT 850
            E+E +  G QGV PVGRGLSTMTRPTT
Sbjct: 1259 EEEESDGGFQGVIPVGRGLSTMTRPTT 1285


>ref|XP_007018613.1| Insulinase family protein isoform 1 [Theobroma cacao]
            gi|508723941|gb|EOY15838.1| Insulinase family protein
            isoform 1 [Theobroma cacao]
          Length = 1302

 Score =  839 bits (2167), Expect = 0.0
 Identities = 431/547 (78%), Positives = 476/547 (87%), Gaps = 1/547 (0%)
 Frame = -2

Query: 2487 KQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQRRLSNGIPVNYKITKNEAK 2308
            K+LIS  QL +LR+Q  P+FIPLS E++ TKV DKETGITQ RLSNGIPVNYKI+KNEA+
Sbjct: 763  KELISPLQLQELRMQRGPSFIPLSAEMNVTKVQDKETGITQLRLSNGIPVNYKISKNEAR 822

Query: 2307 SGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSREQVELFCVNHLINCSLEST 2128
             GVMRLIV            G V+VGVRTLSEGGRVGNFSREQVELFCVNHLINCSLEST
Sbjct: 823  GGVMRLIVGGGRAAETSDSKGAVVVGVRTLSEGGRVGNFSREQVELFCVNHLINCSLEST 882

Query: 2127 EEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRARQLYLSYYRSIPKSLERST 1948
            EEFISMEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDRARQLYLSYYRSIPKSLERST
Sbjct: 883  EEFISMEFRFTLRDNGMHAAFQLLHMVLEHSVWLDDAFDRARQLYLSYYRSIPKSLERST 942

Query: 1947 AHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVGDNMEVSIVGDFTEGDIESC 1768
            AHKLMLAM+NGDERFVEPTP+ LQ LTL+ VKDAVMNQFVGDNMEVSIVGDF+E +IESC
Sbjct: 943  AHKLMLAMMNGDERFVEPTPKSLQNLTLKSVKDAVMNQFVGDNMEVSIVGDFSEEEIESC 1002

Query: 1767 ILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFLKDTDERACAYIAGPAPNRW 1588
            +LDYLGTV A++    A  H  +PI+FRPSPSDLQFQQVFLKDTDERACAYIAGPAPNRW
Sbjct: 1003 VLDYLGTVRASRDSERA--HGFSPILFRPSPSDLQFQQVFLKDTDERACAYIAGPAPNRW 1060

Query: 1587 GFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDLQRKLRGHPLFFGITLGLLA 1408
            G TV+GQDL E++    S DD Q +SD     E KD++ DLQ+KLRGHPLFFGIT+GLLA
Sbjct: 1061 GLTVDGQDLLESVADIPSADDAQPHSD-----EGKDIQKDLQKKLRGHPLFFGITMGLLA 1115

Query: 1407 EVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPAKVYKAVDACKNVLRG 1228
            EVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTP+KVY+AVDACKNVLRG
Sbjct: 1116 EVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPSKVYRAVDACKNVLRG 1175

Query: 1227 LNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQASSVPRKDLSCIKDLQLLYE 1048
            L+++KIA REL+RAKRTLLM+HEAE KSNAYWLGL+AH QASSVPRKD+SC+K+L  LYE
Sbjct: 1176 LHTNKIAPRELERAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDISCVKELTSLYE 1235

Query: 1047 AASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLEDELTT-GVQGVFPVGRGLS 871
            AASIEDIYLAY+ LKVDEDSL+SCIGIAG  AGE  +A  E+E +  G QGV PVGRGLS
Sbjct: 1236 AASIEDIYLAYDQLKVDEDSLYSCIGIAGVHAGEGTTASEEEEESDGGFQGVIPVGRGLS 1295

Query: 870  TMTRPTT 850
            TMTRPTT
Sbjct: 1296 TMTRPTT 1302


>ref|XP_006494428.1| PREDICTED: uncharacterized protein LOC102613059 [Citrus sinensis]
          Length = 1259

 Score =  818 bits (2114), Expect = 0.0
 Identities = 423/566 (74%), Positives = 476/566 (84%), Gaps = 1/566 (0%)
 Frame = -2

Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365
            AI++G             PK+LIS+S+L +L+L+ RP+FIP   E++ TKV+DKE+GITQ
Sbjct: 698  AIKSGMEEPIEAEPELEVPKELISASELEELKLRCRPSFIPPRPELNVTKVHDKESGITQ 757

Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185
             RLSNGIP+NYKI+K+EA+ GVMRLIV            G VIVGVRTLSEGGRVG FSR
Sbjct: 758  LRLSNGIPINYKISKSEAQGGVMRLIVGGGRAAESSESRGAVIVGVRTLSEGGRVGKFSR 817

Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005
            EQVELFCVNHLINCSLESTEEFI+MEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDRA
Sbjct: 818  EQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLDDAFDRA 877

Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825
            RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTP+ L+ L L+ VK+AVMNQFVG
Sbjct: 878  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPKSLENLNLKSVKEAVMNQFVG 937

Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645
            +NMEVSIVGDF+E +IESCILDYLGTV AT        H  +PI+FRPSPSDL FQQVFL
Sbjct: 938  NNMEVSIVGDFSEEEIESCILDYLGTVRATNDSKRE--HEYSPILFRPSPSDLHFQQVFL 995

Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465
            KDTDERACAYIAGPAPNRWGFTV+G DLF++I +     D    S+E  S+ LKD++ D 
Sbjct: 996  KDTDERACAYIAGPAPNRWGFTVDGMDLFKSIDNTSCSFDMPPKSEE--SMMLKDIEKDQ 1053

Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285
            QRKLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT
Sbjct: 1054 QRKLRSHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1113

Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105
            S P KV+KAVDACKNVLRGL+S++I QRELDRAKRTLLM+HEAE KSNAYWLGL+AH QA
Sbjct: 1114 SPPGKVHKAVDACKNVLRGLHSNRIVQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQA 1173

Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLE 925
            SSVPRKD+SCIKDL  LYEAAS+EDIYLAYE L+VDEDSL+SCIGIAG+QAG+  +A  E
Sbjct: 1174 SSVPRKDISCIKDLMSLYEAASVEDIYLAYEQLRVDEDSLYSCIGIAGAQAGDEETASSE 1233

Query: 924  DELTTGVQ-GVFPVGRGLSTMTRPTT 850
            +E   G   GV PVGRGLSTMTRPTT
Sbjct: 1234 EESDEGYPGGVIPVGRGLSTMTRPTT 1259


>ref|XP_006435501.1| hypothetical protein CICLE_v10000050mg [Citrus clementina]
            gi|567885887|ref|XP_006435502.1| hypothetical protein
            CICLE_v10000050mg [Citrus clementina]
            gi|567885889|ref|XP_006435503.1| hypothetical protein
            CICLE_v10000050mg [Citrus clementina]
            gi|567885891|ref|XP_006435504.1| hypothetical protein
            CICLE_v10000050mg [Citrus clementina]
            gi|557537623|gb|ESR48741.1| hypothetical protein
            CICLE_v10000050mg [Citrus clementina]
            gi|557537624|gb|ESR48742.1| hypothetical protein
            CICLE_v10000050mg [Citrus clementina]
            gi|557537625|gb|ESR48743.1| hypothetical protein
            CICLE_v10000050mg [Citrus clementina]
            gi|557537626|gb|ESR48744.1| hypothetical protein
            CICLE_v10000050mg [Citrus clementina]
          Length = 1260

 Score =  818 bits (2114), Expect = 0.0
 Identities = 423/566 (74%), Positives = 476/566 (84%), Gaps = 1/566 (0%)
 Frame = -2

Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365
            AI++G             PK+LIS+S+L +L+L+ RP+FIP   E++ TKV+DKE+GITQ
Sbjct: 699  AIKSGMEEPIEAEPELEVPKELISASELEELKLRCRPSFIPPRPELNVTKVHDKESGITQ 758

Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185
             RLSNGIP+NYKI+K+EA+ GVMRLIV            G VIVGVRTLSEGGRVG FSR
Sbjct: 759  LRLSNGIPINYKISKSEAQGGVMRLIVGGGRAAESSESRGAVIVGVRTLSEGGRVGKFSR 818

Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005
            EQVELFCVNHLINCSLESTEEFI+MEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDRA
Sbjct: 819  EQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLDDAFDRA 878

Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825
            RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTP+ L+ L L+ VK+AVMNQFVG
Sbjct: 879  RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPKSLENLNLKSVKEAVMNQFVG 938

Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645
            +NMEVSIVGDF+E +IESCILDYLGTV AT        H  +PI+FRPSPSDL FQQVFL
Sbjct: 939  NNMEVSIVGDFSEEEIESCILDYLGTVRATNDSKRE--HEYSPILFRPSPSDLHFQQVFL 996

Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465
            KDTDERACAYIAGPAPNRWGFTV+G DLF++I +     D    S+E  S+ LKD++ D 
Sbjct: 997  KDTDERACAYIAGPAPNRWGFTVDGMDLFKSIDNTSCSFDMPPKSEE--SMMLKDIEKDQ 1054

Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285
            QRKLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT
Sbjct: 1055 QRKLRSHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1114

Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105
            S P KV+KAVDACKNVLRGL+S++I QRELDRAKRTLLM+HEAE KSNAYWLGL+AH QA
Sbjct: 1115 SPPGKVHKAVDACKNVLRGLHSNRIVQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQA 1174

Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLE 925
            SSVPRKD+SCIKDL  LYEAAS+EDIYLAYE L+VDEDSL+SCIGIAG+QAG+  +A  E
Sbjct: 1175 SSVPRKDISCIKDLMSLYEAASVEDIYLAYEQLRVDEDSLYSCIGIAGAQAGDEETASSE 1234

Query: 924  DELTTGVQ-GVFPVGRGLSTMTRPTT 850
            +E   G   GV PVGRGLSTMTRPTT
Sbjct: 1235 EESDEGYPGGVIPVGRGLSTMTRPTT 1260


>gb|EXB56235.1| putative zinc protease pqqL [Morus notabilis]
          Length = 1263

 Score =  814 bits (2102), Expect = 0.0
 Identities = 428/567 (75%), Positives = 472/567 (83%), Gaps = 1/567 (0%)
 Frame = -2

Query: 2547 AAIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 2368
            AAI AG             P +LIS+SQL +L ++ RP+F+ LS E + TK++DKETGIT
Sbjct: 701  AAIEAGLKEPIAAEPELEVPTELISASQLQELWMERRPSFVSLSPETNVTKLHDKETGIT 760

Query: 2367 QRRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFS 2188
            Q  LSNGIPVNYKI+K EA  GVMRLIV            G V+VGVRTLSEGGRVGNFS
Sbjct: 761  QCCLSNGIPVNYKISKTEACGGVMRLIVGGGRAVECPDSRGAVVVGVRTLSEGGRVGNFS 820

Query: 2187 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDR 2008
            REQVELFCVNHLINCSLESTEEFI+MEFRFTLRD GM AAFQLLHMV+E SVWL+DAFDR
Sbjct: 821  REQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLERSVWLDDAFDR 880

Query: 2007 ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 1828
            ARQLYLSYYRSIPKSLERSTAHKLMLAML+GDERFVEPTP+ LQ LTLQ VKDAVM+QFV
Sbjct: 881  ARQLYLSYYRSIPKSLERSTAHKLMLAMLDGDERFVEPTPKSLQNLTLQTVKDAVMDQFV 940

Query: 1827 GDNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVF 1648
            G+NMEVSIVGDF+E DIESCILDYLGTV ATK        +  P++FRPSPSDLQ QQVF
Sbjct: 941  GNNMEVSIVGDFSEEDIESCILDYLGTVRATKNSKRE--RQYAPVVFRPSPSDLQSQQVF 998

Query: 1647 LKDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTD 1468
            LKDTDERACAYIAGPAPNRWGFTV+G+DLFE+IRS    +D QS S E  S E ++ + D
Sbjct: 999  LKDTDERACAYIAGPAPNRWGFTVDGKDLFESIRSISITEDAQSRSGE--SAEGENTEKD 1056

Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288
             QRKLR HPLFFGIT+GLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRL LGWYVISV
Sbjct: 1057 YQRKLRHHPLFFGITMGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLNLGWYVISV 1116

Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108
            TSTPAKV+KAVDACKNVLRGL+S+KI  RELDRAKRTLLM+HEAE KSNAYWLGL+AH Q
Sbjct: 1117 TSTPAKVHKAVDACKNVLRGLHSNKITPRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQ 1176

Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYL 928
            ASSVPRKD+SCIKDL LLYEAA IED YLAY+ LKVDEDSL+SCIGIAG+Q  E +SA +
Sbjct: 1177 ASSVPRKDISCIKDLTLLYEAAGIEDAYLAYDQLKVDEDSLYSCIGIAGAQDDEEISASI 1236

Query: 927  -EDELTTGVQGVFPVGRGLSTMTRPTT 850
             ED    G  G+ P+GRGLSTMTRPTT
Sbjct: 1237 EEDGSDEGFPGIAPMGRGLSTMTRPTT 1263


>ref|XP_002301748.2| hypothetical protein POPTR_0002s23680g [Populus trichocarpa]
            gi|550345688|gb|EEE81021.2| hypothetical protein
            POPTR_0002s23680g [Populus trichocarpa]
          Length = 1268

 Score =  812 bits (2097), Expect = 0.0
 Identities = 415/567 (73%), Positives = 478/567 (84%), Gaps = 1/567 (0%)
 Frame = -2

Query: 2547 AAIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 2368
            AAI++G             PK+LISS+QL +LRL+ RP+F+PL  +   TK++D+ETGIT
Sbjct: 705  AAIKSGLEEAIEAEPELEVPKELISSTQLEELRLERRPSFVPLLPDAGYTKLHDQETGIT 764

Query: 2367 QRRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFS 2188
            Q RLSNGI VNYKI+K+E++ GVMRLIV            G V+VGVRTLSEGGRVG+FS
Sbjct: 765  QCRLSNGIAVNYKISKSESRGGVMRLIVGGGRAAESSESKGAVVVGVRTLSEGGRVGSFS 824

Query: 2187 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDR 2008
            REQVELFCVNHLINCSLESTEEFI MEFRFTLRD GM AAF+LLHMV+E+SVWL+DAFDR
Sbjct: 825  REQVELFCVNHLINCSLESTEEFICMEFRFTLRDNGMQAAFELLHMVLENSVWLDDAFDR 884

Query: 2007 ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 1828
            ARQLYLSYYRSIPKSLER+TAHKLM AMLNGDERF+EPTPQ LQ LTL+ VKDAVMNQFV
Sbjct: 885  ARQLYLSYYRSIPKSLERATAHKLMTAMLNGDERFIEPTPQSLQNLTLKSVKDAVMNQFV 944

Query: 1827 GDNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVF 1648
            G NMEVSIVGDF+E +++SCI+DYLGTV AT+          NP++FRPSPSDLQFQQVF
Sbjct: 945  GGNMEVSIVGDFSEEEVQSCIIDYLGTVRATRDSD--QEQEFNPVMFRPSPSDLQFQQVF 1002

Query: 1647 LKDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTD 1468
            LKDTDERACAYIAGPAPNRWGFTV+G DLF+++ S  S   +     E Q ++  DV+ D
Sbjct: 1003 LKDTDERACAYIAGPAPNRWGFTVDGTDLFKSM-SGFSVSADAQPISETQQIDGMDVQKD 1061

Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288
            +Q KLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRLKLGWYV+SV
Sbjct: 1062 MQGKLRCHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKLGWYVVSV 1121

Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108
            TSTP KV+KAVDACK+VLRGL+S+K+AQRELDRA+RTLLM+HEAE KSNAYWLGL+AH Q
Sbjct: 1122 TSTPGKVHKAVDACKSVLRGLHSNKVAQRELDRARRTLLMRHEAEIKSNAYWLGLLAHLQ 1181

Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYL 928
            ASSVPRKD+SCIKDL  LYEAA+IEDIYLAYE LKVDEDSL+SCIG+AG+QAGE ++A L
Sbjct: 1182 ASSVPRKDVSCIKDLTSLYEAATIEDIYLAYEQLKVDEDSLYSCIGVAGTQAGEEINAPL 1241

Query: 927  E-DELTTGVQGVFPVGRGLSTMTRPTT 850
            E +E   G+QG  PVGRGLSTMTRPTT
Sbjct: 1242 EVEETDDGLQGGIPVGRGLSTMTRPTT 1268


>ref|XP_007157075.1| hypothetical protein PHAVU_002G040800g [Phaseolus vulgaris]
            gi|561030490|gb|ESW29069.1| hypothetical protein
            PHAVU_002G040800g [Phaseolus vulgaris]
          Length = 1247

 Score =  811 bits (2096), Expect = 0.0
 Identities = 419/566 (74%), Positives = 473/566 (83%), Gaps = 1/566 (0%)
 Frame = -2

Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365
            AI+AG             PK+LI SS+L +L+   +P FIP++ E  +TK+ D+ETGITQ
Sbjct: 691  AIKAGLDEPIQPEPELEVPKELIQSSKLEELKKLRKPAFIPVNPEADSTKLLDEETGITQ 750

Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185
            RRLSNGIPVNYKI+K E +SGVMRLIV            G VIVGVRTLSEGGRVGNFSR
Sbjct: 751  RRLSNGIPVNYKISKTETQSGVMRLIVGGGRAAESSDSRGSVIVGVRTLSEGGRVGNFSR 810

Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005
            EQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVW++DAFDRA
Sbjct: 811  EQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMRAAFQLLHMVLEHSVWVDDAFDRA 870

Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825
            RQLYLSYYRSIPKSLERSTAHKLM+AML+GDERF+EPTP+ L+ LTLQ VKDAVMNQF G
Sbjct: 871  RQLYLSYYRSIPKSLERSTAHKLMVAMLDGDERFIEPTPKSLENLTLQSVKDAVMNQFFG 930

Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645
            DNMEV IVGDFTE DIESCILDYLGT  AT+  +       NP IFRPSPS+LQFQ+VFL
Sbjct: 931  DNMEVCIVGDFTEEDIESCILDYLGTAQATR--NHGREQEFNPPIFRPSPSELQFQEVFL 988

Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465
            KDTDERACAYIAGPAPNRWGFTV+G+ L E+I +  + +D+QSNSD  Q+         L
Sbjct: 989  KDTDERACAYIAGPAPNRWGFTVDGKYLLESINNASTTNDDQSNSDAQQT-------QGL 1041

Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285
            Q+ LRGHPLFFGIT+GLL+E+INSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT
Sbjct: 1042 QKSLRGHPLFFGITMGLLSEIINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1101

Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105
            STP+KV+KAVDACKNVLRGL+S+KI +RELDRAKRTLLM+HEAE KSNAYWLGL+AH QA
Sbjct: 1102 STPSKVHKAVDACKNVLRGLHSNKITERELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQA 1161

Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLE 925
            SSVPRKDLSCIKDL  LYE A+IEDIYLAYE LKVDE+SL+SCIGIAG+Q  + ++A +E
Sbjct: 1162 SSVPRKDLSCIKDLTFLYEVATIEDIYLAYEQLKVDENSLYSCIGIAGAQDAQDIAAPIE 1221

Query: 924  DELTTGV-QGVFPVGRGLSTMTRPTT 850
            +E+   V  GV PVGRGLSTMTRPTT
Sbjct: 1222 EEVAGDVYPGVIPVGRGLSTMTRPTT 1247


>ref|XP_004307194.1| PREDICTED: uncharacterized protein LOC101308217 [Fragaria vesca
            subsp. vesca]
          Length = 1263

 Score =  810 bits (2093), Expect = 0.0
 Identities = 424/566 (74%), Positives = 472/566 (83%)
 Frame = -2

Query: 2547 AAIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 2368
            AA RAG             PK+LISSSQL +LR +  P+FI  S E S TK+YDKETGIT
Sbjct: 705  AATRAGLEDPIEPEPELEVPKELISSSQLQELRQERMPSFITCSPETSMTKIYDKETGIT 764

Query: 2367 QRRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFS 2188
            + RLSNGI VNYKI+K+EA+ GVMRLIV            G V+VGVRTLSEGGRVGNFS
Sbjct: 765  RARLSNGISVNYKISKSEARGGVMRLIVGGGRATESSESKGSVVVGVRTLSEGGRVGNFS 824

Query: 2187 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDR 2008
            REQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDR
Sbjct: 825  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMRAAFQLLHMVLEHSVWLDDAFDR 884

Query: 2007 ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 1828
            ARQLYLSYYRSIPKSLERSTAHKLMLAML+GDERFVEPTP  LQ LTLQ VKDAVMNQFV
Sbjct: 885  ARQLYLSYYRSIPKSLERSTAHKLMLAMLDGDERFVEPTPTSLQNLTLQSVKDAVMNQFV 944

Query: 1827 GDNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVF 1648
            G+NMEVSIVGDF+E +IESCILDYLGTV + K        + NP++FR S SDLQ QQVF
Sbjct: 945  GNNMEVSIVGDFSEEEIESCILDYLGTVQSAKHSEVE--QKYNPVVFRAS-SDLQSQQVF 1001

Query: 1647 LKDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTD 1468
            LKDTDERACAYIAGPAPNRWGFTV+G+DLF +I    S DD Q  S+EL + E KD + D
Sbjct: 1002 LKDTDERACAYIAGPAPNRWGFTVDGKDLF-SITDISSCDDAQLKSEELVA-EGKDTQKD 1059

Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288
            +QR LRGHPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFELNLFDRL LGWYVISV
Sbjct: 1060 MQRTLRGHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELNLFDRLNLGWYVISV 1119

Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108
            TSTP KV+KAVDACKNVLRGL+S+KI+QRELDRAKRTLLM+HEAE KSN YWLGL+AH Q
Sbjct: 1120 TSTPGKVHKAVDACKNVLRGLHSNKISQRELDRAKRTLLMRHEAEIKSNGYWLGLLAHLQ 1179

Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYL 928
            ASSVPRKD+SCIKDL  LYE A+IED+YLAY+ L++D+DSL+SC+GIAG+QAG+ ++   
Sbjct: 1180 ASSVPRKDISCIKDLTTLYEIAAIEDVYLAYDQLRIDDDSLYSCVGIAGAQAGDEITEVE 1239

Query: 927  EDELTTGVQGVFPVGRGLSTMTRPTT 850
            E E   G  GVFPVGRGLSTMTRPTT
Sbjct: 1240 EPE--GGFPGVFPVGRGLSTMTRPTT 1263


>ref|XP_002516378.1| pitrilysin, putative [Ricinus communis] gi|223544476|gb|EEF45995.1|
            pitrilysin, putative [Ricinus communis]
          Length = 1268

 Score =  810 bits (2092), Expect = 0.0
 Identities = 422/566 (74%), Positives = 475/566 (83%), Gaps = 1/566 (0%)
 Frame = -2

Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365
            AI++G             PK+LIS+SQL +LRLQ RP+F+PL  EV+  K +D+ETGITQ
Sbjct: 707  AIKSGLEEPIEAEPELEVPKELISTSQLEELRLQRRPSFVPLLPEVNILKSHDQETGITQ 766

Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185
             RLSNGI VNYKI+++E++ GVMRLIV            G VIVGVRTLSEGGRVGNFSR
Sbjct: 767  CRLSNGIAVNYKISRSESRGGVMRLIVGGGRAAETTESKGAVIVGVRTLSEGGRVGNFSR 826

Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005
            EQVELFCVNHLINCSLESTEEFI MEFRFTLRD GM AAF+LLHMV+EHSVWL+DAFDRA
Sbjct: 827  EQVELFCVNHLINCSLESTEEFICMEFRFTLRDNGMRAAFELLHMVLEHSVWLDDAFDRA 886

Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825
            RQLYLSYYRSIPKSLER+TAHKLM AMLNGDERFVEPTPQ L+ LTL+ VKDAVMNQFVG
Sbjct: 887  RQLYLSYYRSIPKSLERATAHKLMTAMLNGDERFVEPTPQSLENLTLKSVKDAVMNQFVG 946

Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645
            DNMEVSIVGDF+E +IESCI+DYLGTV  T+        +  PI+FRPS SDLQ QQVFL
Sbjct: 947  DNMEVSIVGDFSEEEIESCIIDYLGTVRETR--GSVGAAKFVPILFRPS-SDLQSQQVFL 1003

Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465
            KDTDERACAYIAGPAPNRWGFTV+G+DLFE+I       D QS S++   +  KDV+ D 
Sbjct: 1004 KDTDERACAYIAGPAPNRWGFTVDGKDLFESISDIAVVPDAQSKSEQ-PLMGRKDVQEDW 1062

Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285
            QRKLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRL LGWYVISVT
Sbjct: 1063 QRKLRSHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLNLGWYVISVT 1122

Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105
            STP+KVYKAVDACK+VLRGL S+KIA RELDRAKRTLLM+HEAE KSNAYWLGL+AH QA
Sbjct: 1123 STPSKVYKAVDACKSVLRGLYSNKIAPRELDRAKRTLLMRHEAEVKSNAYWLGLLAHLQA 1182

Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLE 925
            SSVPRKD+SCIKDL  LYEAA+I+DIYLAYE LK+D+DSL+SCIG+AGSQAG+ ++  LE
Sbjct: 1183 SSVPRKDISCIKDLTSLYEAATIDDIYLAYEQLKIDDDSLYSCIGVAGSQAGDEITVPLE 1242

Query: 924  DELT-TGVQGVFPVGRGLSTMTRPTT 850
            +E T  G QGV PVGRGLSTMTRPTT
Sbjct: 1243 EEETENGFQGVIPVGRGLSTMTRPTT 1268


>ref|XP_002320445.2| pitrilysin family protein [Populus trichocarpa]
            gi|550324212|gb|EEE98760.2| pitrilysin family protein
            [Populus trichocarpa]
          Length = 1267

 Score =  808 bits (2087), Expect = 0.0
 Identities = 414/566 (73%), Positives = 469/566 (82%)
 Frame = -2

Query: 2547 AAIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 2368
            AAI++G             PK+LI+S+QL +LRLQ  P+FIPL  +   TK++D ETGIT
Sbjct: 717  AAIKSGLEEAIEAEPELEVPKELITSTQLEELRLQLTPSFIPLVPDADYTKLHDPETGIT 776

Query: 2367 QRRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFS 2188
            Q RLSNGI VNYKI+K+E++ GVMRLIV            G V+VGVRTLSEGGRVGNFS
Sbjct: 777  QCRLSNGIAVNYKISKSESRGGVMRLIVGGGRAAESSESKGAVVVGVRTLSEGGRVGNFS 836

Query: 2187 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDR 2008
            REQVELFCVNHLINCSLESTEEFI MEFRFTLRD GM AAF+LLHMV+EHSVWL+DA DR
Sbjct: 837  REQVELFCVNHLINCSLESTEEFICMEFRFTLRDNGMRAAFELLHMVLEHSVWLDDALDR 896

Query: 2007 ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 1828
            ARQLYLSYYRSIPKSLER+TAHKLM AMLNGDERF+EPTPQ LQ LTL+ VKDAVMNQFV
Sbjct: 897  ARQLYLSYYRSIPKSLERATAHKLMTAMLNGDERFIEPTPQSLQNLTLKSVKDAVMNQFV 956

Query: 1827 GDNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVF 1648
            G NMEVSIVGDF+E +IESCI+DYLGTV AT+          NP++FRPSPSDLQFQQVF
Sbjct: 957  GGNMEVSIVGDFSEEEIESCIIDYLGTVRATRDSDRE--QEFNPVMFRPSPSDLQFQQVF 1014

Query: 1647 LKDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTD 1468
            LKDTDERACAYIAGPAPNRWGFTV+G+DLFE             ++  +  ++ KDV+ D
Sbjct: 1015 LKDTDERACAYIAGPAPNRWGFTVDGKDLFE-------------STSGISQIDRKDVQKD 1061

Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288
             Q KLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRLKLGWYV+SV
Sbjct: 1062 KQGKLRSHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKLGWYVVSV 1121

Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108
            TSTP KV+KAVDACK+VLRGL+S+K+AQRELDRAKRTLLM+HE E KSNAYWLGL+AH Q
Sbjct: 1122 TSTPGKVHKAVDACKSVLRGLHSNKVAQRELDRAKRTLLMRHETEIKSNAYWLGLLAHLQ 1181

Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYL 928
            ASSVPRKD+SCIKDL  LYEAA+IEDIY+AYE LKVDEDSL+SCIG+AG+QAGE ++A  
Sbjct: 1182 ASSVPRKDVSCIKDLTSLYEAATIEDIYVAYEQLKVDEDSLYSCIGVAGAQAGEEINALE 1241

Query: 927  EDELTTGVQGVFPVGRGLSTMTRPTT 850
            E+E     QGV PVGRGLSTMTRPTT
Sbjct: 1242 EEETDDDFQGVIPVGRGLSTMTRPTT 1267


>ref|XP_007018616.1| Insulinase (Peptidase family M16) family protein isoform 4 [Theobroma
            cacao] gi|508723944|gb|EOY15841.1| Insulinase (Peptidase
            family M16) family protein isoform 4 [Theobroma cacao]
          Length = 1018

 Score =  808 bits (2087), Expect = 0.0
 Identities = 415/534 (77%), Positives = 459/534 (85%)
 Frame = -2

Query: 2547 AAIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 2368
            AAI++G             PK+LIS  QL +LR+Q  P+FIPLS E++ TKV DKETGIT
Sbjct: 484  AAIKSGLEEPIEAEPELEVPKELISPLQLQELRMQRGPSFIPLSAEMNVTKVQDKETGIT 543

Query: 2367 QRRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFS 2188
            Q RLSNGIPVNYKI+KNEA+ GVMRLIV            G V+VGVRTLSEGGRVGNFS
Sbjct: 544  QLRLSNGIPVNYKISKNEARGGVMRLIVGGGRAAETSDSKGAVVVGVRTLSEGGRVGNFS 603

Query: 2187 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDR 2008
            REQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDR
Sbjct: 604  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMHAAFQLLHMVLEHSVWLDDAFDR 663

Query: 2007 ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 1828
            ARQLYLSYYRSIPKSLERSTAHKLMLAM+NGDERFVEPTP+ LQ LTL+ VKDAVMNQFV
Sbjct: 664  ARQLYLSYYRSIPKSLERSTAHKLMLAMMNGDERFVEPTPKSLQNLTLKSVKDAVMNQFV 723

Query: 1827 GDNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVF 1648
            GDNMEVSIVGDF+E +IESC+LDYLGTV A++    A  H  +PI+FRPSPSDLQFQQVF
Sbjct: 724  GDNMEVSIVGDFSEEEIESCVLDYLGTVRASRDSERA--HGFSPILFRPSPSDLQFQQVF 781

Query: 1647 LKDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTD 1468
            LKDTDERACAYIAGPAPNRWG TV+GQDL E++    S DD Q +SD     E KD++ D
Sbjct: 782  LKDTDERACAYIAGPAPNRWGLTVDGQDLLESVADIPSADDAQPHSD-----EGKDIQKD 836

Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288
            LQ+KLRGHPLFFGIT+GLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV
Sbjct: 837  LQKKLRGHPLFFGITMGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 896

Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108
            TSTP+KVY+AVDACKNVLRGL+++KIA REL+RAKRTLLM+HEAE KSNAYWLGL+AH Q
Sbjct: 897  TSTPSKVYRAVDACKNVLRGLHTNKIAPRELERAKRTLLMRHEAEIKSNAYWLGLLAHLQ 956

Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGE 946
            ASSVPRKD+SC+K+L  LYEAASIEDIYLAY+ LKVDEDSL+SCIGIAG  AGE
Sbjct: 957  ASSVPRKDISCVKELTSLYEAASIEDIYLAYDQLKVDEDSLYSCIGIAGVHAGE 1010


>ref|XP_006573851.1| PREDICTED: uncharacterized protein LOC100794716 [Glycine max]
          Length = 1254

 Score =  806 bits (2082), Expect = 0.0
 Identities = 415/566 (73%), Positives = 470/566 (83%), Gaps = 1/566 (0%)
 Frame = -2

Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365
            AI+AG             PK+LI S++L +L+   +P FIP++ E   TK++D+ETGIT+
Sbjct: 698  AIKAGLDEPIQPEPELEVPKELIQSTKLEELKKLRKPAFIPVNPETDATKLHDEETGITR 757

Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185
            RRL+NGIPVNYKI+K E +SGVMRLIV            G VIVGVRTLSEGGRVGNFSR
Sbjct: 758  RRLANGIPVNYKISKTETQSGVMRLIVGGGRAAESPESRGSVIVGVRTLSEGGRVGNFSR 817

Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005
            EQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVW++DAFDRA
Sbjct: 818  EQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMRAAFQLLHMVLEHSVWVDDAFDRA 877

Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825
            RQLYLSYYRSIPKSLERSTAHKLM+AML+GDERF+EPTP+ L+ LTLQ VKDAVMNQF G
Sbjct: 878  RQLYLSYYRSIPKSLERSTAHKLMVAMLDGDERFIEPTPKSLENLTLQSVKDAVMNQFFG 937

Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645
            DNMEV IVGDFTE DIESCILDYLGT  AT+        + NP +FRPSPSDLQFQ+VFL
Sbjct: 938  DNMEVCIVGDFTEEDIESCILDYLGTAQATRNHERE--QKFNPPLFRPSPSDLQFQEVFL 995

Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465
            KDTDERACAYIAGPAPNRWGFTV+G DL E+I +    +D+QS SD  Q+         L
Sbjct: 996  KDTDERACAYIAGPAPNRWGFTVDGVDLLESINNASIINDDQSKSDAQQT-------QGL 1048

Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285
            Q+ L GHPLFFGIT+GLL+E+INSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT
Sbjct: 1049 QKSLCGHPLFFGITMGLLSEIINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1108

Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105
            STP+KV+KAVDACKNVLRGL+S+KI +RELDRAKRTLLM+HEAE KSNAYWLGL+AH QA
Sbjct: 1109 STPSKVHKAVDACKNVLRGLHSNKITERELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQA 1168

Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLE 925
            SSVPRKD+SCIKDL  LYE A+IEDIYLAYE LKVDE+SL+SCIGIAG+Q  + ++A LE
Sbjct: 1169 SSVPRKDISCIKDLTFLYEVATIEDIYLAYEQLKVDENSLYSCIGIAGAQTAQDIAAPLE 1228

Query: 924  DELTTGV-QGVFPVGRGLSTMTRPTT 850
            +E+   V  GV PVGRGLSTMTRPTT
Sbjct: 1229 EEVADDVYPGVIPVGRGLSTMTRPTT 1254


>ref|XP_007018617.1| Insulinase (Peptidase family M16) family protein isoform 5, partial
            [Theobroma cacao] gi|508723945|gb|EOY15842.1| Insulinase
            (Peptidase family M16) family protein isoform 5, partial
            [Theobroma cacao]
          Length = 1022

 Score =  806 bits (2082), Expect = 0.0
 Identities = 414/533 (77%), Positives = 458/533 (85%)
 Frame = -2

Query: 2547 AAIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 2368
            AAI++G             PK+LIS  QL +LR+Q  P+FIPLS E++ TKV DKETGIT
Sbjct: 497  AAIKSGLEEPIEAEPELEVPKELISPLQLQELRMQRGPSFIPLSAEMNVTKVQDKETGIT 556

Query: 2367 QRRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFS 2188
            Q RLSNGIPVNYKI+KNEA+ GVMRLIV            G V+VGVRTLSEGGRVGNFS
Sbjct: 557  QLRLSNGIPVNYKISKNEARGGVMRLIVGGGRAAETSDSKGAVVVGVRTLSEGGRVGNFS 616

Query: 2187 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDR 2008
            REQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDR
Sbjct: 617  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMHAAFQLLHMVLEHSVWLDDAFDR 676

Query: 2007 ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 1828
            ARQLYLSYYRSIPKSLERSTAHKLMLAM+NGDERFVEPTP+ LQ LTL+ VKDAVMNQFV
Sbjct: 677  ARQLYLSYYRSIPKSLERSTAHKLMLAMMNGDERFVEPTPKSLQNLTLKSVKDAVMNQFV 736

Query: 1827 GDNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVF 1648
            GDNMEVSIVGDF+E +IESC+LDYLGTV A++    A  H  +PI+FRPSPSDLQFQQVF
Sbjct: 737  GDNMEVSIVGDFSEEEIESCVLDYLGTVRASRDSERA--HGFSPILFRPSPSDLQFQQVF 794

Query: 1647 LKDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTD 1468
            LKDTDERACAYIAGPAPNRWG TV+GQDL E++    S DD Q +SD     E KD++ D
Sbjct: 795  LKDTDERACAYIAGPAPNRWGLTVDGQDLLESVADIPSADDAQPHSD-----EGKDIQKD 849

Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288
            LQ+KLRGHPLFFGIT+GLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV
Sbjct: 850  LQKKLRGHPLFFGITMGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 909

Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108
            TSTP+KVY+AVDACKNVLRGL+++KIA REL+RAKRTLLM+HEAE KSNAYWLGL+AH Q
Sbjct: 910  TSTPSKVYRAVDACKNVLRGLHTNKIAPRELERAKRTLLMRHEAEIKSNAYWLGLLAHLQ 969

Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAG 949
            ASSVPRKD+SC+K+L  LYEAASIEDIYLAY+ LKVDEDSL+SCIGIAG  AG
Sbjct: 970  ASSVPRKDISCVKELTSLYEAASIEDIYLAYDQLKVDEDSLYSCIGIAGVHAG 1022


>ref|XP_004152885.1| PREDICTED: uncharacterized protein LOC101202810 [Cucumis sativus]
          Length = 1261

 Score =  805 bits (2079), Expect = 0.0
 Identities = 420/566 (74%), Positives = 473/566 (83%), Gaps = 1/566 (0%)
 Frame = -2

Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365
            AI AG             PK+LISSSQ+ +LR+QH+P+FI L+ E + TK +DKETGITQ
Sbjct: 706  AIEAGLREPIEAEPELEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQ 765

Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185
             RLSNGIPVNYKI+K+E K+GVMRLIV            G V+VGVRTLSEGGRVG FSR
Sbjct: 766  CRLSNGIPVNYKISKSENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSR 825

Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005
            EQVELFCVNHLINCSLESTEEFI+MEFRFTLRD GM AAFQLLHMV+EHSVWLEDAFDRA
Sbjct: 826  EQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRA 885

Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825
            +QLY+SYYRSIPKSLERSTAHKLMLAMLNGDERFVEP+P+ LQ LTLQ VKDAVMNQFVG
Sbjct: 886  KQLYMSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVG 945

Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645
            +NMEVS+VGDF+E +IESCILDYLGTVTAT     A    S PI+FRPS S+LQFQQVFL
Sbjct: 946  NNMEVSLVGDFSEEEIESCILDYLGTVTATTTSEAA--LASVPIVFRPSASELQFQQVFL 1003

Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465
            KDTDERACAYI+GPAPNRWG T EG +L E+I S +S+  E   SD        D++  L
Sbjct: 1004 KDTDERACAYISGPAPNRWGVTFEGLELLESI-SQISRTGESDESD-------NDIEKGL 1055

Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285
            QRKLR HPLFFGIT+GLLAE+INSRLFT+VRDSLGLTYDVSFEL+LFDRLKLGWYVISVT
Sbjct: 1056 QRKLRSHPLFFGITMGLLAEIINSRLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVT 1115

Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105
            STPAKVYKAVDACK+VLRGL+S+KIAQRELDRAKRTLLM+HEAE KSNAYWLGL+AH QA
Sbjct: 1116 STPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQA 1175

Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAG-EVVSAYL 928
            SSVPRKDLSCIKDL  LYEAA+I+D+Y+AY+ LKVD DSL++CIGIAG+QAG E + ++ 
Sbjct: 1176 SSVPRKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFE 1235

Query: 927  EDELTTGVQGVFPVGRGLSTMTRPTT 850
            E+      QGV P GRGLSTMTRPTT
Sbjct: 1236 EEGSDQDFQGVIPSGRGLSTMTRPTT 1261


>ref|XP_004155123.1| PREDICTED: uncharacterized protein LOC101224074 [Cucumis sativus]
          Length = 1267

 Score =  802 bits (2072), Expect = 0.0
 Identities = 418/566 (73%), Positives = 475/566 (83%), Gaps = 1/566 (0%)
 Frame = -2

Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365
            AI AG             PK+LISSSQ+ +LR+QH+P+FI L+ E + TK +DKETGITQ
Sbjct: 706  AIEAGLREPIEAEPELEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQ 765

Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185
             RLSNGIPVNYKI+K+E K+GVMRLIV            G V+VGVRTLSEGGRVG FSR
Sbjct: 766  CRLSNGIPVNYKISKSENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSR 825

Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005
            EQVELFCVNHLINCSLESTEEFI+MEFRFTLRD GM AAFQLLHMV+EHSVWLEDAFDRA
Sbjct: 826  EQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRA 885

Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825
            +QLY+SYYRSIPKSLERSTAHKLMLAMLNGDERFVEP+P+ LQ LTLQ VKDAVMNQFVG
Sbjct: 886  KQLYMSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVG 945

Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645
            +NMEVS+VGDF+E +IESCILDYLGTVTAT     A    S PI+FRPS S+LQFQQVFL
Sbjct: 946  NNMEVSLVGDFSEEEIESCILDYLGTVTATTTSEAA--LASVPIVFRPSASELQFQQVFL 1003

Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465
            KDTDERACAYI+GPAPNRWG T EG +L E+I S +S+   +   +E+   +  D++  L
Sbjct: 1004 KDTDERACAYISGPAPNRWGVTFEGLELLESI-SQISRTGGEFLCEEVDESD-NDIEKGL 1061

Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285
            QRKLR HPLFFGIT+GLLAE+INSRLFT+VRDSLGLTYDVSFEL+LFDRLKLGWYVISVT
Sbjct: 1062 QRKLRSHPLFFGITMGLLAEIINSRLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVT 1121

Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105
            STPAKVYKAVDACK+VLRGL+S+KIAQRELDRAKRTLLM+HEAE KSNAYWLGL+AH QA
Sbjct: 1122 STPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQA 1181

Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAG-EVVSAYL 928
            SSVPRKDLSCIKDL  LYEAA+I+D+Y+AY+ LKVD DSL++CIGIAG+QAG E + ++ 
Sbjct: 1182 SSVPRKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFE 1241

Query: 927  EDELTTGVQGVFPVGRGLSTMTRPTT 850
            E+      QGV P GRGLSTMTRPTT
Sbjct: 1242 EEGSDQDFQGVIPSGRGLSTMTRPTT 1267


>ref|XP_003537738.1| PREDICTED: uncharacterized protein LOC100809828 [Glycine max]
          Length = 1257

 Score =  800 bits (2066), Expect = 0.0
 Identities = 412/566 (72%), Positives = 469/566 (82%), Gaps = 1/566 (0%)
 Frame = -2

Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365
            AI+AG             PK+LI S++L +L+   +P FIP++ E   TK++D+ETGI++
Sbjct: 701  AIKAGLDEPIQPEPELEVPKELIQSTKLEELKKLRKPAFIPVNPETDATKLHDEETGISR 760

Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185
            RRLSNGIPVNYKI+K E +SGVMRLIV            G VIVGVRTLSEGGRVGNFSR
Sbjct: 761  RRLSNGIPVNYKISKTETQSGVMRLIVGGGRAAESPESRGSVIVGVRTLSEGGRVGNFSR 820

Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005
            EQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVW++DAFDRA
Sbjct: 821  EQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMRAAFQLLHMVLEHSVWVDDAFDRA 880

Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825
            RQLYLSYYRSIPKSLERSTAHKLM+AML+GDERF+EPTP+ L+ LTLQ VKDAVMNQF G
Sbjct: 881  RQLYLSYYRSIPKSLERSTAHKLMVAMLDGDERFIEPTPKSLENLTLQSVKDAVMNQFFG 940

Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645
            DNMEV IVGDFTE DIESCILDYLGT  A +          NP +FRPSPSDLQFQ+VFL
Sbjct: 941  DNMEVCIVGDFTEEDIESCILDYLGTAQAARNHERE--KEFNPPLFRPSPSDLQFQEVFL 998

Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465
            KDTDERACAYIAGPAPNRWGFTV+G DL E+I +  + +D+QS S+  Q+         L
Sbjct: 999  KDTDERACAYIAGPAPNRWGFTVDGVDLLESINNASTINDDQSKSNAQQT-------QGL 1051

Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285
            Q+ L GHPLFFGIT+GLL+E+INSRLFT+VRDSLGLTYDVSFELNLFDRLKLGWYVISVT
Sbjct: 1052 QKSLCGHPLFFGITMGLLSEIINSRLFTSVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1111

Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105
            STP+KV+KAVDACKNVLRGL+S+KI +RELDRAKRTLLM+HEAE KSNAYWLGL+AH QA
Sbjct: 1112 STPSKVHKAVDACKNVLRGLHSNKITERELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQA 1171

Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLE 925
            SSVPRKD+SCIKDL  LYE A+IEDIY AYE LKVDE+SL+SCIGIAG+QA + ++A LE
Sbjct: 1172 SSVPRKDISCIKDLTFLYEVATIEDIYRAYEQLKVDENSLYSCIGIAGAQAAQEIAAPLE 1231

Query: 924  DELTTGV-QGVFPVGRGLSTMTRPTT 850
            +E+   V  GV PVGRGLSTMTRPTT
Sbjct: 1232 EEVADDVYPGVIPVGRGLSTMTRPTT 1257


>ref|XP_006849871.1| hypothetical protein AMTR_s00022p00070510 [Amborella trichopoda]
            gi|548853469|gb|ERN11452.1| hypothetical protein
            AMTR_s00022p00070510 [Amborella trichopoda]
          Length = 1274

 Score =  790 bits (2041), Expect = 0.0
 Identities = 410/568 (72%), Positives = 467/568 (82%), Gaps = 3/568 (0%)
 Frame = -2

Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365
            AIR G             PK+LISSS L +L+   +P F+PL+ +V+ T+++D+ETGITQ
Sbjct: 714  AIREGLNEPIEAEPELEVPKELISSSHLSELKSLCKPAFVPLNPDVNATRIFDEETGITQ 773

Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185
             RLSNGIPVNYKIT+NEAK GVMRLIV            G V+VGVRTLSEGGRVGNFSR
Sbjct: 774  CRLSNGIPVNYKITQNEAKGGVMRLIVGGGRANETSESRGSVVVGVRTLSEGGRVGNFSR 833

Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005
            EQVELFCVNHLINCSLESTEEF+ MEFRFTLRDGGM AAFQLLHMV+EHSVWLEDAFDRA
Sbjct: 834  EQVELFCVNHLINCSLESTEEFVCMEFRFTLRDGGMRAAFQLLHMVLEHSVWLEDAFDRA 893

Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825
            RQLYL YYR+IPKSLER+TAHKLM+AMLNGDERF EPTP+ LQ+LTL  VK+AVMNQF G
Sbjct: 894  RQLYLQYYRAIPKSLERATAHKLMIAMLNGDERFFEPTPESLQQLTLPIVKNAVMNQFRG 953

Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645
            DNMEVSIVGDFTE +IESCILDYLGTVTAT      + +   PI FRPSPSDLQ QQVFL
Sbjct: 954  DNMEVSIVGDFTEDEIESCILDYLGTVTATGSTEKGNEY--EPIFFRPSPSDLQSQQVFL 1011

Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETI-RSNLSKDDEQSNSDELQSLELKDVKTD 1468
            KDTDERACAYIAGPAPNRWG T+EGQDLFE + + +L  DDEQ      + +E KD + +
Sbjct: 1012 KDTDERACAYIAGPAPNRWGLTIEGQDLFELVKKGSLVSDDEQR-----KPVESKDGEAN 1066

Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288
            L  K++  PLFF IT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRLK GWYVISV
Sbjct: 1067 LSGKIQQLPLFFAITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKFGWYVISV 1126

Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108
            TSTP+KVYKAVDACK+VLRGL++SKI QRELDRA+RTLLM+HEAE KSN YWLGL+AH Q
Sbjct: 1127 TSTPSKVYKAVDACKDVLRGLHNSKITQRELDRARRTLLMRHEAEMKSNVYWLGLLAHLQ 1186

Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAG-EVVSAY 931
            ASS+PRKD+SCIKDL  LYEAA+IED+Y+AY HLKV EDSL+SCIG+AGSQA  E  SA 
Sbjct: 1187 ASSIPRKDISCIKDLTSLYEAATIEDVYVAYNHLKVGEDSLYSCIGVAGSQARVEADSAS 1246

Query: 930  LEDELTTG-VQGVFPVGRGLSTMTRPTT 850
            +  E + G   G+ P+GRGL+TMTRPTT
Sbjct: 1247 VVSEESDGSAAGLIPIGRGLATMTRPTT 1274


>gb|EYU24512.1| hypothetical protein MIMGU_mgv1a000585mg [Mimulus guttatus]
          Length = 1057

 Score =  788 bits (2036), Expect = 0.0
 Identities = 416/568 (73%), Positives = 465/568 (81%), Gaps = 2/568 (0%)
 Frame = -2

Query: 2547 AAIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 2368
            A+I AG             PK+LISS QL +L LQ  P+FIP+ +E   TKVYD+ETGI 
Sbjct: 494  ASIEAGLKEPIEAEPELEIPKELISSEQLQELSLQQPPSFIPVDQEKKMTKVYDEETGII 553

Query: 2367 QRRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFS 2188
            QRRLSNGIPVNYKI+K+EA SGVMRLIV            G VIVGVRTLSEGGRVGNF+
Sbjct: 554  QRRLSNGIPVNYKISKSEANSGVMRLIVGGGRAAESAESKGAVIVGVRTLSEGGRVGNFT 613

Query: 2187 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDR 2008
            REQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVWLEDAFDR
Sbjct: 614  REQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDR 673

Query: 2007 ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 1828
            A+QLYLSYYRSIPKSLERSTAHKLMLAML+GDERFVEPTP  LQ+LTL+ VK+AVMNQFV
Sbjct: 674  AKQLYLSYYRSIPKSLERSTAHKLMLAMLDGDERFVEPTPNSLQQLTLEQVKEAVMNQFV 733

Query: 1827 GDNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVF 1648
             DNMEVSIVGDF+E DIESCIL+YLGTV   K    A   + +PI+FRP  +DLQ QQVF
Sbjct: 734  CDNMEVSIVGDFSEEDIESCILEYLGTVRERKGSERA--QKYSPILFRPYTADLQHQQVF 791

Query: 1647 LKDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTD 1468
            LKDTDERACAY+AGPAPNRWGFT EG++L E+  S  S   E    +E Q  EL++    
Sbjct: 792  LKDTDERACAYVAGPAPNRWGFTFEGKNLLES-DSTASTFGEHVKFEE-QPQELENSDKV 849

Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288
            +Q KLR HPLFF IT+GLL E+INSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV
Sbjct: 850  MQGKLRTHPLFFAITMGLLQEIINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 909

Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108
            TSTP KV+KAVDACKNVL+GL SS+IA RELDRA+RTLLM+HEAE KSNAYWLGL+AH Q
Sbjct: 910  TSTPGKVHKAVDACKNVLKGLLSSRIAPRELDRARRTLLMRHEAEIKSNAYWLGLMAHLQ 969

Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVV--SA 934
            A+SVPRKD+SCIKDL  LYEAA+IED+Y+AYE LKVD++SLFSCIG+AGSQAGEV   S 
Sbjct: 970  ATSVPRKDISCIKDLISLYEAATIEDVYIAYEQLKVDDNSLFSCIGVAGSQAGEVATGSV 1029

Query: 933  YLEDELTTGVQGVFPVGRGLSTMTRPTT 850
             LE+E   G+Q +  VGRG STMTRPTT
Sbjct: 1030 VLEEESVEGLQNIIQVGRGSSTMTRPTT 1057


Top