BLASTX nr result
ID: Sinomenium21_contig00007457
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00007457 (2548 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002277544.2| PREDICTED: uncharacterized protein LOC100266... 843 0.0 emb|CBI40802.3| unnamed protein product [Vitis vinifera] 843 0.0 ref|XP_007018614.1| Insulinase (Peptidase family M16) family pro... 842 0.0 ref|XP_007018613.1| Insulinase family protein isoform 1 [Theobro... 839 0.0 ref|XP_006494428.1| PREDICTED: uncharacterized protein LOC102613... 818 0.0 ref|XP_006435501.1| hypothetical protein CICLE_v10000050mg [Citr... 818 0.0 gb|EXB56235.1| putative zinc protease pqqL [Morus notabilis] 814 0.0 ref|XP_002301748.2| hypothetical protein POPTR_0002s23680g [Popu... 812 0.0 ref|XP_007157075.1| hypothetical protein PHAVU_002G040800g [Phas... 811 0.0 ref|XP_004307194.1| PREDICTED: uncharacterized protein LOC101308... 810 0.0 ref|XP_002516378.1| pitrilysin, putative [Ricinus communis] gi|2... 810 0.0 ref|XP_002320445.2| pitrilysin family protein [Populus trichocar... 808 0.0 ref|XP_007018616.1| Insulinase (Peptidase family M16) family pro... 808 0.0 ref|XP_006573851.1| PREDICTED: uncharacterized protein LOC100794... 806 0.0 ref|XP_007018617.1| Insulinase (Peptidase family M16) family pro... 806 0.0 ref|XP_004152885.1| PREDICTED: uncharacterized protein LOC101202... 805 0.0 ref|XP_004155123.1| PREDICTED: uncharacterized protein LOC101224... 802 0.0 ref|XP_003537738.1| PREDICTED: uncharacterized protein LOC100809... 800 0.0 ref|XP_006849871.1| hypothetical protein AMTR_s00022p00070510 [A... 790 0.0 gb|EYU24512.1| hypothetical protein MIMGU_mgv1a000585mg [Mimulus... 788 0.0 >ref|XP_002277544.2| PREDICTED: uncharacterized protein LOC100266746 [Vitis vinifera] Length = 1269 Score = 843 bits (2179), Expect = 0.0 Identities = 435/565 (76%), Positives = 482/565 (85%) Frame = -2 Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365 AI+AG PK+LISSSQL LR++ P+FIPLS EV+ TKVYD ETGITQ Sbjct: 710 AIKAGLEEPIEAEPELEVPKELISSSQLQKLRVERSPSFIPLSPEVNVTKVYDNETGITQ 769 Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185 RLSNGIPVNYKI++NEA+ GVMRLIV G V+VGVRTLSEGGRVGNFSR Sbjct: 770 LRLSNGIPVNYKISRNEARGGVMRLIVGGGRAAESFESRGAVVVGVRTLSEGGRVGNFSR 829 Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005 EQVELFCVNHLINCSLESTEEFI MEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDRA Sbjct: 830 EQVELFCVNHLINCSLESTEEFICMEFRFTLRDNGMRAAFQLLHMVLEHSVWLDDAFDRA 889 Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEP+P+ LQ LTLQ VKDAVMNQFVG Sbjct: 890 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQSVKDAVMNQFVG 949 Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645 DNMEVS+VGDF+E DIESCILDY+GTV A++ +S+ I+FR PSDLQFQQVFL Sbjct: 950 DNMEVSVVGDFSEEDIESCILDYMGTVRASRDSEIE--QQSSSIMFRSYPSDLQFQQVFL 1007 Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465 KDTDERACAYIAGPAPNRWGFT+EG+DLFE+I + DDE+ S+ L E+KD + DL Sbjct: 1008 KDTDERACAYIAGPAPNRWGFTIEGKDLFESINNISVDDDEEPQSESLS--EMKDCRKDL 1065 Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285 QRKLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRLKLGWYVISVT Sbjct: 1066 QRKLRNHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKLGWYVISVT 1125 Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105 STP KVYKAVDACKNVLRGL+SSKIAQRELDRAKRTLLM+HEAE+K+NAYWLGL+AH QA Sbjct: 1126 STPGKVYKAVDACKNVLRGLHSSKIAQRELDRAKRTLLMRHEAETKANAYWLGLLAHLQA 1185 Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLE 925 S+VPRKD+SCIKDL LYEAA+IEDIYLAYE LKVDE+SL+SCIGIAG+QA E +S E Sbjct: 1186 STVPRKDISCIKDLTSLYEAATIEDIYLAYEQLKVDENSLYSCIGIAGAQAAEEISVE-E 1244 Query: 924 DELTTGVQGVFPVGRGLSTMTRPTT 850 +E G+QGV P GRGLSTMTRPTT Sbjct: 1245 EESDEGLQGVIPAGRGLSTMTRPTT 1269 >emb|CBI40802.3| unnamed protein product [Vitis vinifera] Length = 1276 Score = 843 bits (2179), Expect = 0.0 Identities = 435/565 (76%), Positives = 482/565 (85%) Frame = -2 Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365 AI+AG PK+LISSSQL LR++ P+FIPLS EV+ TKVYD ETGITQ Sbjct: 717 AIKAGLEEPIEAEPELEVPKELISSSQLQKLRVERSPSFIPLSPEVNVTKVYDNETGITQ 776 Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185 RLSNGIPVNYKI++NEA+ GVMRLIV G V+VGVRTLSEGGRVGNFSR Sbjct: 777 LRLSNGIPVNYKISRNEARGGVMRLIVGGGRAAESFESRGAVVVGVRTLSEGGRVGNFSR 836 Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005 EQVELFCVNHLINCSLESTEEFI MEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDRA Sbjct: 837 EQVELFCVNHLINCSLESTEEFICMEFRFTLRDNGMRAAFQLLHMVLEHSVWLDDAFDRA 896 Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEP+P+ LQ LTLQ VKDAVMNQFVG Sbjct: 897 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQSVKDAVMNQFVG 956 Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645 DNMEVS+VGDF+E DIESCILDY+GTV A++ +S+ I+FR PSDLQFQQVFL Sbjct: 957 DNMEVSVVGDFSEEDIESCILDYMGTVRASRDSEIE--QQSSSIMFRSYPSDLQFQQVFL 1014 Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465 KDTDERACAYIAGPAPNRWGFT+EG+DLFE+I + DDE+ S+ L E+KD + DL Sbjct: 1015 KDTDERACAYIAGPAPNRWGFTIEGKDLFESINNISVDDDEEPQSESLS--EMKDCRKDL 1072 Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285 QRKLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRLKLGWYVISVT Sbjct: 1073 QRKLRNHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKLGWYVISVT 1132 Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105 STP KVYKAVDACKNVLRGL+SSKIAQRELDRAKRTLLM+HEAE+K+NAYWLGL+AH QA Sbjct: 1133 STPGKVYKAVDACKNVLRGLHSSKIAQRELDRAKRTLLMRHEAETKANAYWLGLLAHLQA 1192 Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLE 925 S+VPRKD+SCIKDL LYEAA+IEDIYLAYE LKVDE+SL+SCIGIAG+QA E +S E Sbjct: 1193 STVPRKDISCIKDLTSLYEAATIEDIYLAYEQLKVDENSLYSCIGIAGAQAAEEISVE-E 1251 Query: 924 DELTTGVQGVFPVGRGLSTMTRPTT 850 +E G+QGV P GRGLSTMTRPTT Sbjct: 1252 EESDEGLQGVIPAGRGLSTMTRPTT 1276 >ref|XP_007018614.1| Insulinase (Peptidase family M16) family protein isoform 2 [Theobroma cacao] gi|590597455|ref|XP_007018615.1| Insulinase (Peptidase family M16) family protein isoform 2 [Theobroma cacao] gi|508723942|gb|EOY15839.1| Insulinase (Peptidase family M16) family protein isoform 2 [Theobroma cacao] gi|508723943|gb|EOY15840.1| Insulinase (Peptidase family M16) family protein isoform 2 [Theobroma cacao] Length = 1285 Score = 842 bits (2174), Expect = 0.0 Identities = 436/567 (76%), Positives = 483/567 (85%), Gaps = 1/567 (0%) Frame = -2 Query: 2547 AAIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 2368 AAI++G PK+LIS QL +LR+Q P+FIPLS E++ TKV DKETGIT Sbjct: 726 AAIKSGLEEPIEAEPELEVPKELISPLQLQELRMQRGPSFIPLSAEMNVTKVQDKETGIT 785 Query: 2367 QRRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFS 2188 Q RLSNGIPVNYKI+KNEA+ GVMRLIV G V+VGVRTLSEGGRVGNFS Sbjct: 786 QLRLSNGIPVNYKISKNEARGGVMRLIVGGGRAAETSDSKGAVVVGVRTLSEGGRVGNFS 845 Query: 2187 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDR 2008 REQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDR Sbjct: 846 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMHAAFQLLHMVLEHSVWLDDAFDR 905 Query: 2007 ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 1828 ARQLYLSYYRSIPKSLERSTAHKLMLAM+NGDERFVEPTP+ LQ LTL+ VKDAVMNQFV Sbjct: 906 ARQLYLSYYRSIPKSLERSTAHKLMLAMMNGDERFVEPTPKSLQNLTLKSVKDAVMNQFV 965 Query: 1827 GDNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVF 1648 GDNMEVSIVGDF+E +IESC+LDYLGTV A++ A H +PI+FRPSPSDLQFQQVF Sbjct: 966 GDNMEVSIVGDFSEEEIESCVLDYLGTVRASRDSERA--HGFSPILFRPSPSDLQFQQVF 1023 Query: 1647 LKDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTD 1468 LKDTDERACAYIAGPAPNRWG TV+GQDL E++ S DD Q +SD E KD++ D Sbjct: 1024 LKDTDERACAYIAGPAPNRWGLTVDGQDLLESVADIPSADDAQPHSD-----EGKDIQKD 1078 Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288 LQ+KLRGHPLFFGIT+GLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV Sbjct: 1079 LQKKLRGHPLFFGITMGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1138 Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108 TSTP+KVY+AVDACKNVLRGL+++KIA REL+RAKRTLLM+HEAE KSNAYWLGL+AH Q Sbjct: 1139 TSTPSKVYRAVDACKNVLRGLHTNKIAPRELERAKRTLLMRHEAEIKSNAYWLGLLAHLQ 1198 Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYL 928 ASSVPRKD+SC+K+L LYEAASIEDIYLAY+ LKVDEDSL+SCIGIAG AGE +A Sbjct: 1199 ASSVPRKDISCVKELTSLYEAASIEDIYLAYDQLKVDEDSLYSCIGIAGVHAGEGTTASE 1258 Query: 927 EDELTT-GVQGVFPVGRGLSTMTRPTT 850 E+E + G QGV PVGRGLSTMTRPTT Sbjct: 1259 EEEESDGGFQGVIPVGRGLSTMTRPTT 1285 >ref|XP_007018613.1| Insulinase family protein isoform 1 [Theobroma cacao] gi|508723941|gb|EOY15838.1| Insulinase family protein isoform 1 [Theobroma cacao] Length = 1302 Score = 839 bits (2167), Expect = 0.0 Identities = 431/547 (78%), Positives = 476/547 (87%), Gaps = 1/547 (0%) Frame = -2 Query: 2487 KQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQRRLSNGIPVNYKITKNEAK 2308 K+LIS QL +LR+Q P+FIPLS E++ TKV DKETGITQ RLSNGIPVNYKI+KNEA+ Sbjct: 763 KELISPLQLQELRMQRGPSFIPLSAEMNVTKVQDKETGITQLRLSNGIPVNYKISKNEAR 822 Query: 2307 SGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSREQVELFCVNHLINCSLEST 2128 GVMRLIV G V+VGVRTLSEGGRVGNFSREQVELFCVNHLINCSLEST Sbjct: 823 GGVMRLIVGGGRAAETSDSKGAVVVGVRTLSEGGRVGNFSREQVELFCVNHLINCSLEST 882 Query: 2127 EEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRARQLYLSYYRSIPKSLERST 1948 EEFISMEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDRARQLYLSYYRSIPKSLERST Sbjct: 883 EEFISMEFRFTLRDNGMHAAFQLLHMVLEHSVWLDDAFDRARQLYLSYYRSIPKSLERST 942 Query: 1947 AHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVGDNMEVSIVGDFTEGDIESC 1768 AHKLMLAM+NGDERFVEPTP+ LQ LTL+ VKDAVMNQFVGDNMEVSIVGDF+E +IESC Sbjct: 943 AHKLMLAMMNGDERFVEPTPKSLQNLTLKSVKDAVMNQFVGDNMEVSIVGDFSEEEIESC 1002 Query: 1767 ILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFLKDTDERACAYIAGPAPNRW 1588 +LDYLGTV A++ A H +PI+FRPSPSDLQFQQVFLKDTDERACAYIAGPAPNRW Sbjct: 1003 VLDYLGTVRASRDSERA--HGFSPILFRPSPSDLQFQQVFLKDTDERACAYIAGPAPNRW 1060 Query: 1587 GFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDLQRKLRGHPLFFGITLGLLA 1408 G TV+GQDL E++ S DD Q +SD E KD++ DLQ+KLRGHPLFFGIT+GLLA Sbjct: 1061 GLTVDGQDLLESVADIPSADDAQPHSD-----EGKDIQKDLQKKLRGHPLFFGITMGLLA 1115 Query: 1407 EVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPAKVYKAVDACKNVLRG 1228 EVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTP+KVY+AVDACKNVLRG Sbjct: 1116 EVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVTSTPSKVYRAVDACKNVLRG 1175 Query: 1227 LNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQASSVPRKDLSCIKDLQLLYE 1048 L+++KIA REL+RAKRTLLM+HEAE KSNAYWLGL+AH QASSVPRKD+SC+K+L LYE Sbjct: 1176 LHTNKIAPRELERAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDISCVKELTSLYE 1235 Query: 1047 AASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLEDELTT-GVQGVFPVGRGLS 871 AASIEDIYLAY+ LKVDEDSL+SCIGIAG AGE +A E+E + G QGV PVGRGLS Sbjct: 1236 AASIEDIYLAYDQLKVDEDSLYSCIGIAGVHAGEGTTASEEEEESDGGFQGVIPVGRGLS 1295 Query: 870 TMTRPTT 850 TMTRPTT Sbjct: 1296 TMTRPTT 1302 >ref|XP_006494428.1| PREDICTED: uncharacterized protein LOC102613059 [Citrus sinensis] Length = 1259 Score = 818 bits (2114), Expect = 0.0 Identities = 423/566 (74%), Positives = 476/566 (84%), Gaps = 1/566 (0%) Frame = -2 Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365 AI++G PK+LIS+S+L +L+L+ RP+FIP E++ TKV+DKE+GITQ Sbjct: 698 AIKSGMEEPIEAEPELEVPKELISASELEELKLRCRPSFIPPRPELNVTKVHDKESGITQ 757 Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185 RLSNGIP+NYKI+K+EA+ GVMRLIV G VIVGVRTLSEGGRVG FSR Sbjct: 758 LRLSNGIPINYKISKSEAQGGVMRLIVGGGRAAESSESRGAVIVGVRTLSEGGRVGKFSR 817 Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005 EQVELFCVNHLINCSLESTEEFI+MEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDRA Sbjct: 818 EQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLDDAFDRA 877 Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTP+ L+ L L+ VK+AVMNQFVG Sbjct: 878 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPKSLENLNLKSVKEAVMNQFVG 937 Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645 +NMEVSIVGDF+E +IESCILDYLGTV AT H +PI+FRPSPSDL FQQVFL Sbjct: 938 NNMEVSIVGDFSEEEIESCILDYLGTVRATNDSKRE--HEYSPILFRPSPSDLHFQQVFL 995 Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465 KDTDERACAYIAGPAPNRWGFTV+G DLF++I + D S+E S+ LKD++ D Sbjct: 996 KDTDERACAYIAGPAPNRWGFTVDGMDLFKSIDNTSCSFDMPPKSEE--SMMLKDIEKDQ 1053 Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285 QRKLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT Sbjct: 1054 QRKLRSHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1113 Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105 S P KV+KAVDACKNVLRGL+S++I QRELDRAKRTLLM+HEAE KSNAYWLGL+AH QA Sbjct: 1114 SPPGKVHKAVDACKNVLRGLHSNRIVQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQA 1173 Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLE 925 SSVPRKD+SCIKDL LYEAAS+EDIYLAYE L+VDEDSL+SCIGIAG+QAG+ +A E Sbjct: 1174 SSVPRKDISCIKDLMSLYEAASVEDIYLAYEQLRVDEDSLYSCIGIAGAQAGDEETASSE 1233 Query: 924 DELTTGVQ-GVFPVGRGLSTMTRPTT 850 +E G GV PVGRGLSTMTRPTT Sbjct: 1234 EESDEGYPGGVIPVGRGLSTMTRPTT 1259 >ref|XP_006435501.1| hypothetical protein CICLE_v10000050mg [Citrus clementina] gi|567885887|ref|XP_006435502.1| hypothetical protein CICLE_v10000050mg [Citrus clementina] gi|567885889|ref|XP_006435503.1| hypothetical protein CICLE_v10000050mg [Citrus clementina] gi|567885891|ref|XP_006435504.1| hypothetical protein CICLE_v10000050mg [Citrus clementina] gi|557537623|gb|ESR48741.1| hypothetical protein CICLE_v10000050mg [Citrus clementina] gi|557537624|gb|ESR48742.1| hypothetical protein CICLE_v10000050mg [Citrus clementina] gi|557537625|gb|ESR48743.1| hypothetical protein CICLE_v10000050mg [Citrus clementina] gi|557537626|gb|ESR48744.1| hypothetical protein CICLE_v10000050mg [Citrus clementina] Length = 1260 Score = 818 bits (2114), Expect = 0.0 Identities = 423/566 (74%), Positives = 476/566 (84%), Gaps = 1/566 (0%) Frame = -2 Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365 AI++G PK+LIS+S+L +L+L+ RP+FIP E++ TKV+DKE+GITQ Sbjct: 699 AIKSGMEEPIEAEPELEVPKELISASELEELKLRCRPSFIPPRPELNVTKVHDKESGITQ 758 Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185 RLSNGIP+NYKI+K+EA+ GVMRLIV G VIVGVRTLSEGGRVG FSR Sbjct: 759 LRLSNGIPINYKISKSEAQGGVMRLIVGGGRAAESSESRGAVIVGVRTLSEGGRVGKFSR 818 Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005 EQVELFCVNHLINCSLESTEEFI+MEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDRA Sbjct: 819 EQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLDDAFDRA 878 Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTP+ L+ L L+ VK+AVMNQFVG Sbjct: 879 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPKSLENLNLKSVKEAVMNQFVG 938 Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645 +NMEVSIVGDF+E +IESCILDYLGTV AT H +PI+FRPSPSDL FQQVFL Sbjct: 939 NNMEVSIVGDFSEEEIESCILDYLGTVRATNDSKRE--HEYSPILFRPSPSDLHFQQVFL 996 Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465 KDTDERACAYIAGPAPNRWGFTV+G DLF++I + D S+E S+ LKD++ D Sbjct: 997 KDTDERACAYIAGPAPNRWGFTVDGMDLFKSIDNTSCSFDMPPKSEE--SMMLKDIEKDQ 1054 Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285 QRKLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT Sbjct: 1055 QRKLRSHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1114 Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105 S P KV+KAVDACKNVLRGL+S++I QRELDRAKRTLLM+HEAE KSNAYWLGL+AH QA Sbjct: 1115 SPPGKVHKAVDACKNVLRGLHSNRIVQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQA 1174 Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLE 925 SSVPRKD+SCIKDL LYEAAS+EDIYLAYE L+VDEDSL+SCIGIAG+QAG+ +A E Sbjct: 1175 SSVPRKDISCIKDLMSLYEAASVEDIYLAYEQLRVDEDSLYSCIGIAGAQAGDEETASSE 1234 Query: 924 DELTTGVQ-GVFPVGRGLSTMTRPTT 850 +E G GV PVGRGLSTMTRPTT Sbjct: 1235 EESDEGYPGGVIPVGRGLSTMTRPTT 1260 >gb|EXB56235.1| putative zinc protease pqqL [Morus notabilis] Length = 1263 Score = 814 bits (2102), Expect = 0.0 Identities = 428/567 (75%), Positives = 472/567 (83%), Gaps = 1/567 (0%) Frame = -2 Query: 2547 AAIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 2368 AAI AG P +LIS+SQL +L ++ RP+F+ LS E + TK++DKETGIT Sbjct: 701 AAIEAGLKEPIAAEPELEVPTELISASQLQELWMERRPSFVSLSPETNVTKLHDKETGIT 760 Query: 2367 QRRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFS 2188 Q LSNGIPVNYKI+K EA GVMRLIV G V+VGVRTLSEGGRVGNFS Sbjct: 761 QCCLSNGIPVNYKISKTEACGGVMRLIVGGGRAVECPDSRGAVVVGVRTLSEGGRVGNFS 820 Query: 2187 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDR 2008 REQVELFCVNHLINCSLESTEEFI+MEFRFTLRD GM AAFQLLHMV+E SVWL+DAFDR Sbjct: 821 REQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLERSVWLDDAFDR 880 Query: 2007 ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 1828 ARQLYLSYYRSIPKSLERSTAHKLMLAML+GDERFVEPTP+ LQ LTLQ VKDAVM+QFV Sbjct: 881 ARQLYLSYYRSIPKSLERSTAHKLMLAMLDGDERFVEPTPKSLQNLTLQTVKDAVMDQFV 940 Query: 1827 GDNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVF 1648 G+NMEVSIVGDF+E DIESCILDYLGTV ATK + P++FRPSPSDLQ QQVF Sbjct: 941 GNNMEVSIVGDFSEEDIESCILDYLGTVRATKNSKRE--RQYAPVVFRPSPSDLQSQQVF 998 Query: 1647 LKDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTD 1468 LKDTDERACAYIAGPAPNRWGFTV+G+DLFE+IRS +D QS S E S E ++ + D Sbjct: 999 LKDTDERACAYIAGPAPNRWGFTVDGKDLFESIRSISITEDAQSRSGE--SAEGENTEKD 1056 Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288 QRKLR HPLFFGIT+GLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRL LGWYVISV Sbjct: 1057 YQRKLRHHPLFFGITMGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLNLGWYVISV 1116 Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108 TSTPAKV+KAVDACKNVLRGL+S+KI RELDRAKRTLLM+HEAE KSNAYWLGL+AH Q Sbjct: 1117 TSTPAKVHKAVDACKNVLRGLHSNKITPRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQ 1176 Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYL 928 ASSVPRKD+SCIKDL LLYEAA IED YLAY+ LKVDEDSL+SCIGIAG+Q E +SA + Sbjct: 1177 ASSVPRKDISCIKDLTLLYEAAGIEDAYLAYDQLKVDEDSLYSCIGIAGAQDDEEISASI 1236 Query: 927 -EDELTTGVQGVFPVGRGLSTMTRPTT 850 ED G G+ P+GRGLSTMTRPTT Sbjct: 1237 EEDGSDEGFPGIAPMGRGLSTMTRPTT 1263 >ref|XP_002301748.2| hypothetical protein POPTR_0002s23680g [Populus trichocarpa] gi|550345688|gb|EEE81021.2| hypothetical protein POPTR_0002s23680g [Populus trichocarpa] Length = 1268 Score = 812 bits (2097), Expect = 0.0 Identities = 415/567 (73%), Positives = 478/567 (84%), Gaps = 1/567 (0%) Frame = -2 Query: 2547 AAIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 2368 AAI++G PK+LISS+QL +LRL+ RP+F+PL + TK++D+ETGIT Sbjct: 705 AAIKSGLEEAIEAEPELEVPKELISSTQLEELRLERRPSFVPLLPDAGYTKLHDQETGIT 764 Query: 2367 QRRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFS 2188 Q RLSNGI VNYKI+K+E++ GVMRLIV G V+VGVRTLSEGGRVG+FS Sbjct: 765 QCRLSNGIAVNYKISKSESRGGVMRLIVGGGRAAESSESKGAVVVGVRTLSEGGRVGSFS 824 Query: 2187 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDR 2008 REQVELFCVNHLINCSLESTEEFI MEFRFTLRD GM AAF+LLHMV+E+SVWL+DAFDR Sbjct: 825 REQVELFCVNHLINCSLESTEEFICMEFRFTLRDNGMQAAFELLHMVLENSVWLDDAFDR 884 Query: 2007 ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 1828 ARQLYLSYYRSIPKSLER+TAHKLM AMLNGDERF+EPTPQ LQ LTL+ VKDAVMNQFV Sbjct: 885 ARQLYLSYYRSIPKSLERATAHKLMTAMLNGDERFIEPTPQSLQNLTLKSVKDAVMNQFV 944 Query: 1827 GDNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVF 1648 G NMEVSIVGDF+E +++SCI+DYLGTV AT+ NP++FRPSPSDLQFQQVF Sbjct: 945 GGNMEVSIVGDFSEEEVQSCIIDYLGTVRATRDSD--QEQEFNPVMFRPSPSDLQFQQVF 1002 Query: 1647 LKDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTD 1468 LKDTDERACAYIAGPAPNRWGFTV+G DLF+++ S S + E Q ++ DV+ D Sbjct: 1003 LKDTDERACAYIAGPAPNRWGFTVDGTDLFKSM-SGFSVSADAQPISETQQIDGMDVQKD 1061 Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288 +Q KLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRLKLGWYV+SV Sbjct: 1062 MQGKLRCHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKLGWYVVSV 1121 Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108 TSTP KV+KAVDACK+VLRGL+S+K+AQRELDRA+RTLLM+HEAE KSNAYWLGL+AH Q Sbjct: 1122 TSTPGKVHKAVDACKSVLRGLHSNKVAQRELDRARRTLLMRHEAEIKSNAYWLGLLAHLQ 1181 Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYL 928 ASSVPRKD+SCIKDL LYEAA+IEDIYLAYE LKVDEDSL+SCIG+AG+QAGE ++A L Sbjct: 1182 ASSVPRKDVSCIKDLTSLYEAATIEDIYLAYEQLKVDEDSLYSCIGVAGTQAGEEINAPL 1241 Query: 927 E-DELTTGVQGVFPVGRGLSTMTRPTT 850 E +E G+QG PVGRGLSTMTRPTT Sbjct: 1242 EVEETDDGLQGGIPVGRGLSTMTRPTT 1268 >ref|XP_007157075.1| hypothetical protein PHAVU_002G040800g [Phaseolus vulgaris] gi|561030490|gb|ESW29069.1| hypothetical protein PHAVU_002G040800g [Phaseolus vulgaris] Length = 1247 Score = 811 bits (2096), Expect = 0.0 Identities = 419/566 (74%), Positives = 473/566 (83%), Gaps = 1/566 (0%) Frame = -2 Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365 AI+AG PK+LI SS+L +L+ +P FIP++ E +TK+ D+ETGITQ Sbjct: 691 AIKAGLDEPIQPEPELEVPKELIQSSKLEELKKLRKPAFIPVNPEADSTKLLDEETGITQ 750 Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185 RRLSNGIPVNYKI+K E +SGVMRLIV G VIVGVRTLSEGGRVGNFSR Sbjct: 751 RRLSNGIPVNYKISKTETQSGVMRLIVGGGRAAESSDSRGSVIVGVRTLSEGGRVGNFSR 810 Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005 EQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVW++DAFDRA Sbjct: 811 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMRAAFQLLHMVLEHSVWVDDAFDRA 870 Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825 RQLYLSYYRSIPKSLERSTAHKLM+AML+GDERF+EPTP+ L+ LTLQ VKDAVMNQF G Sbjct: 871 RQLYLSYYRSIPKSLERSTAHKLMVAMLDGDERFIEPTPKSLENLTLQSVKDAVMNQFFG 930 Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645 DNMEV IVGDFTE DIESCILDYLGT AT+ + NP IFRPSPS+LQFQ+VFL Sbjct: 931 DNMEVCIVGDFTEEDIESCILDYLGTAQATR--NHGREQEFNPPIFRPSPSELQFQEVFL 988 Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465 KDTDERACAYIAGPAPNRWGFTV+G+ L E+I + + +D+QSNSD Q+ L Sbjct: 989 KDTDERACAYIAGPAPNRWGFTVDGKYLLESINNASTTNDDQSNSDAQQT-------QGL 1041 Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285 Q+ LRGHPLFFGIT+GLL+E+INSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT Sbjct: 1042 QKSLRGHPLFFGITMGLLSEIINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1101 Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105 STP+KV+KAVDACKNVLRGL+S+KI +RELDRAKRTLLM+HEAE KSNAYWLGL+AH QA Sbjct: 1102 STPSKVHKAVDACKNVLRGLHSNKITERELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQA 1161 Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLE 925 SSVPRKDLSCIKDL LYE A+IEDIYLAYE LKVDE+SL+SCIGIAG+Q + ++A +E Sbjct: 1162 SSVPRKDLSCIKDLTFLYEVATIEDIYLAYEQLKVDENSLYSCIGIAGAQDAQDIAAPIE 1221 Query: 924 DELTTGV-QGVFPVGRGLSTMTRPTT 850 +E+ V GV PVGRGLSTMTRPTT Sbjct: 1222 EEVAGDVYPGVIPVGRGLSTMTRPTT 1247 >ref|XP_004307194.1| PREDICTED: uncharacterized protein LOC101308217 [Fragaria vesca subsp. vesca] Length = 1263 Score = 810 bits (2093), Expect = 0.0 Identities = 424/566 (74%), Positives = 472/566 (83%) Frame = -2 Query: 2547 AAIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 2368 AA RAG PK+LISSSQL +LR + P+FI S E S TK+YDKETGIT Sbjct: 705 AATRAGLEDPIEPEPELEVPKELISSSQLQELRQERMPSFITCSPETSMTKIYDKETGIT 764 Query: 2367 QRRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFS 2188 + RLSNGI VNYKI+K+EA+ GVMRLIV G V+VGVRTLSEGGRVGNFS Sbjct: 765 RARLSNGISVNYKISKSEARGGVMRLIVGGGRATESSESKGSVVVGVRTLSEGGRVGNFS 824 Query: 2187 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDR 2008 REQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDR Sbjct: 825 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMRAAFQLLHMVLEHSVWLDDAFDR 884 Query: 2007 ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 1828 ARQLYLSYYRSIPKSLERSTAHKLMLAML+GDERFVEPTP LQ LTLQ VKDAVMNQFV Sbjct: 885 ARQLYLSYYRSIPKSLERSTAHKLMLAMLDGDERFVEPTPTSLQNLTLQSVKDAVMNQFV 944 Query: 1827 GDNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVF 1648 G+NMEVSIVGDF+E +IESCILDYLGTV + K + NP++FR S SDLQ QQVF Sbjct: 945 GNNMEVSIVGDFSEEEIESCILDYLGTVQSAKHSEVE--QKYNPVVFRAS-SDLQSQQVF 1001 Query: 1647 LKDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTD 1468 LKDTDERACAYIAGPAPNRWGFTV+G+DLF +I S DD Q S+EL + E KD + D Sbjct: 1002 LKDTDERACAYIAGPAPNRWGFTVDGKDLF-SITDISSCDDAQLKSEELVA-EGKDTQKD 1059 Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288 +QR LRGHPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFELNLFDRL LGWYVISV Sbjct: 1060 MQRTLRGHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELNLFDRLNLGWYVISV 1119 Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108 TSTP KV+KAVDACKNVLRGL+S+KI+QRELDRAKRTLLM+HEAE KSN YWLGL+AH Q Sbjct: 1120 TSTPGKVHKAVDACKNVLRGLHSNKISQRELDRAKRTLLMRHEAEIKSNGYWLGLLAHLQ 1179 Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYL 928 ASSVPRKD+SCIKDL LYE A+IED+YLAY+ L++D+DSL+SC+GIAG+QAG+ ++ Sbjct: 1180 ASSVPRKDISCIKDLTTLYEIAAIEDVYLAYDQLRIDDDSLYSCVGIAGAQAGDEITEVE 1239 Query: 927 EDELTTGVQGVFPVGRGLSTMTRPTT 850 E E G GVFPVGRGLSTMTRPTT Sbjct: 1240 EPE--GGFPGVFPVGRGLSTMTRPTT 1263 >ref|XP_002516378.1| pitrilysin, putative [Ricinus communis] gi|223544476|gb|EEF45995.1| pitrilysin, putative [Ricinus communis] Length = 1268 Score = 810 bits (2092), Expect = 0.0 Identities = 422/566 (74%), Positives = 475/566 (83%), Gaps = 1/566 (0%) Frame = -2 Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365 AI++G PK+LIS+SQL +LRLQ RP+F+PL EV+ K +D+ETGITQ Sbjct: 707 AIKSGLEEPIEAEPELEVPKELISTSQLEELRLQRRPSFVPLLPEVNILKSHDQETGITQ 766 Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185 RLSNGI VNYKI+++E++ GVMRLIV G VIVGVRTLSEGGRVGNFSR Sbjct: 767 CRLSNGIAVNYKISRSESRGGVMRLIVGGGRAAETTESKGAVIVGVRTLSEGGRVGNFSR 826 Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005 EQVELFCVNHLINCSLESTEEFI MEFRFTLRD GM AAF+LLHMV+EHSVWL+DAFDRA Sbjct: 827 EQVELFCVNHLINCSLESTEEFICMEFRFTLRDNGMRAAFELLHMVLEHSVWLDDAFDRA 886 Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825 RQLYLSYYRSIPKSLER+TAHKLM AMLNGDERFVEPTPQ L+ LTL+ VKDAVMNQFVG Sbjct: 887 RQLYLSYYRSIPKSLERATAHKLMTAMLNGDERFVEPTPQSLENLTLKSVKDAVMNQFVG 946 Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645 DNMEVSIVGDF+E +IESCI+DYLGTV T+ + PI+FRPS SDLQ QQVFL Sbjct: 947 DNMEVSIVGDFSEEEIESCIIDYLGTVRETR--GSVGAAKFVPILFRPS-SDLQSQQVFL 1003 Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465 KDTDERACAYIAGPAPNRWGFTV+G+DLFE+I D QS S++ + KDV+ D Sbjct: 1004 KDTDERACAYIAGPAPNRWGFTVDGKDLFESISDIAVVPDAQSKSEQ-PLMGRKDVQEDW 1062 Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285 QRKLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRL LGWYVISVT Sbjct: 1063 QRKLRSHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLNLGWYVISVT 1122 Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105 STP+KVYKAVDACK+VLRGL S+KIA RELDRAKRTLLM+HEAE KSNAYWLGL+AH QA Sbjct: 1123 STPSKVYKAVDACKSVLRGLYSNKIAPRELDRAKRTLLMRHEAEVKSNAYWLGLLAHLQA 1182 Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLE 925 SSVPRKD+SCIKDL LYEAA+I+DIYLAYE LK+D+DSL+SCIG+AGSQAG+ ++ LE Sbjct: 1183 SSVPRKDISCIKDLTSLYEAATIDDIYLAYEQLKIDDDSLYSCIGVAGSQAGDEITVPLE 1242 Query: 924 DELT-TGVQGVFPVGRGLSTMTRPTT 850 +E T G QGV PVGRGLSTMTRPTT Sbjct: 1243 EEETENGFQGVIPVGRGLSTMTRPTT 1268 >ref|XP_002320445.2| pitrilysin family protein [Populus trichocarpa] gi|550324212|gb|EEE98760.2| pitrilysin family protein [Populus trichocarpa] Length = 1267 Score = 808 bits (2087), Expect = 0.0 Identities = 414/566 (73%), Positives = 469/566 (82%) Frame = -2 Query: 2547 AAIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 2368 AAI++G PK+LI+S+QL +LRLQ P+FIPL + TK++D ETGIT Sbjct: 717 AAIKSGLEEAIEAEPELEVPKELITSTQLEELRLQLTPSFIPLVPDADYTKLHDPETGIT 776 Query: 2367 QRRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFS 2188 Q RLSNGI VNYKI+K+E++ GVMRLIV G V+VGVRTLSEGGRVGNFS Sbjct: 777 QCRLSNGIAVNYKISKSESRGGVMRLIVGGGRAAESSESKGAVVVGVRTLSEGGRVGNFS 836 Query: 2187 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDR 2008 REQVELFCVNHLINCSLESTEEFI MEFRFTLRD GM AAF+LLHMV+EHSVWL+DA DR Sbjct: 837 REQVELFCVNHLINCSLESTEEFICMEFRFTLRDNGMRAAFELLHMVLEHSVWLDDALDR 896 Query: 2007 ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 1828 ARQLYLSYYRSIPKSLER+TAHKLM AMLNGDERF+EPTPQ LQ LTL+ VKDAVMNQFV Sbjct: 897 ARQLYLSYYRSIPKSLERATAHKLMTAMLNGDERFIEPTPQSLQNLTLKSVKDAVMNQFV 956 Query: 1827 GDNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVF 1648 G NMEVSIVGDF+E +IESCI+DYLGTV AT+ NP++FRPSPSDLQFQQVF Sbjct: 957 GGNMEVSIVGDFSEEEIESCIIDYLGTVRATRDSDRE--QEFNPVMFRPSPSDLQFQQVF 1014 Query: 1647 LKDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTD 1468 LKDTDERACAYIAGPAPNRWGFTV+G+DLFE ++ + ++ KDV+ D Sbjct: 1015 LKDTDERACAYIAGPAPNRWGFTVDGKDLFE-------------STSGISQIDRKDVQKD 1061 Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288 Q KLR HPLFFGIT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRLKLGWYV+SV Sbjct: 1062 KQGKLRSHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKLGWYVVSV 1121 Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108 TSTP KV+KAVDACK+VLRGL+S+K+AQRELDRAKRTLLM+HE E KSNAYWLGL+AH Q Sbjct: 1122 TSTPGKVHKAVDACKSVLRGLHSNKVAQRELDRAKRTLLMRHETEIKSNAYWLGLLAHLQ 1181 Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYL 928 ASSVPRKD+SCIKDL LYEAA+IEDIY+AYE LKVDEDSL+SCIG+AG+QAGE ++A Sbjct: 1182 ASSVPRKDVSCIKDLTSLYEAATIEDIYVAYEQLKVDEDSLYSCIGVAGAQAGEEINALE 1241 Query: 927 EDELTTGVQGVFPVGRGLSTMTRPTT 850 E+E QGV PVGRGLSTMTRPTT Sbjct: 1242 EEETDDDFQGVIPVGRGLSTMTRPTT 1267 >ref|XP_007018616.1| Insulinase (Peptidase family M16) family protein isoform 4 [Theobroma cacao] gi|508723944|gb|EOY15841.1| Insulinase (Peptidase family M16) family protein isoform 4 [Theobroma cacao] Length = 1018 Score = 808 bits (2087), Expect = 0.0 Identities = 415/534 (77%), Positives = 459/534 (85%) Frame = -2 Query: 2547 AAIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 2368 AAI++G PK+LIS QL +LR+Q P+FIPLS E++ TKV DKETGIT Sbjct: 484 AAIKSGLEEPIEAEPELEVPKELISPLQLQELRMQRGPSFIPLSAEMNVTKVQDKETGIT 543 Query: 2367 QRRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFS 2188 Q RLSNGIPVNYKI+KNEA+ GVMRLIV G V+VGVRTLSEGGRVGNFS Sbjct: 544 QLRLSNGIPVNYKISKNEARGGVMRLIVGGGRAAETSDSKGAVVVGVRTLSEGGRVGNFS 603 Query: 2187 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDR 2008 REQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDR Sbjct: 604 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMHAAFQLLHMVLEHSVWLDDAFDR 663 Query: 2007 ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 1828 ARQLYLSYYRSIPKSLERSTAHKLMLAM+NGDERFVEPTP+ LQ LTL+ VKDAVMNQFV Sbjct: 664 ARQLYLSYYRSIPKSLERSTAHKLMLAMMNGDERFVEPTPKSLQNLTLKSVKDAVMNQFV 723 Query: 1827 GDNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVF 1648 GDNMEVSIVGDF+E +IESC+LDYLGTV A++ A H +PI+FRPSPSDLQFQQVF Sbjct: 724 GDNMEVSIVGDFSEEEIESCVLDYLGTVRASRDSERA--HGFSPILFRPSPSDLQFQQVF 781 Query: 1647 LKDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTD 1468 LKDTDERACAYIAGPAPNRWG TV+GQDL E++ S DD Q +SD E KD++ D Sbjct: 782 LKDTDERACAYIAGPAPNRWGLTVDGQDLLESVADIPSADDAQPHSD-----EGKDIQKD 836 Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288 LQ+KLRGHPLFFGIT+GLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV Sbjct: 837 LQKKLRGHPLFFGITMGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 896 Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108 TSTP+KVY+AVDACKNVLRGL+++KIA REL+RAKRTLLM+HEAE KSNAYWLGL+AH Q Sbjct: 897 TSTPSKVYRAVDACKNVLRGLHTNKIAPRELERAKRTLLMRHEAEIKSNAYWLGLLAHLQ 956 Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGE 946 ASSVPRKD+SC+K+L LYEAASIEDIYLAY+ LKVDEDSL+SCIGIAG AGE Sbjct: 957 ASSVPRKDISCVKELTSLYEAASIEDIYLAYDQLKVDEDSLYSCIGIAGVHAGE 1010 >ref|XP_006573851.1| PREDICTED: uncharacterized protein LOC100794716 [Glycine max] Length = 1254 Score = 806 bits (2082), Expect = 0.0 Identities = 415/566 (73%), Positives = 470/566 (83%), Gaps = 1/566 (0%) Frame = -2 Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365 AI+AG PK+LI S++L +L+ +P FIP++ E TK++D+ETGIT+ Sbjct: 698 AIKAGLDEPIQPEPELEVPKELIQSTKLEELKKLRKPAFIPVNPETDATKLHDEETGITR 757 Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185 RRL+NGIPVNYKI+K E +SGVMRLIV G VIVGVRTLSEGGRVGNFSR Sbjct: 758 RRLANGIPVNYKISKTETQSGVMRLIVGGGRAAESPESRGSVIVGVRTLSEGGRVGNFSR 817 Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005 EQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVW++DAFDRA Sbjct: 818 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMRAAFQLLHMVLEHSVWVDDAFDRA 877 Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825 RQLYLSYYRSIPKSLERSTAHKLM+AML+GDERF+EPTP+ L+ LTLQ VKDAVMNQF G Sbjct: 878 RQLYLSYYRSIPKSLERSTAHKLMVAMLDGDERFIEPTPKSLENLTLQSVKDAVMNQFFG 937 Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645 DNMEV IVGDFTE DIESCILDYLGT AT+ + NP +FRPSPSDLQFQ+VFL Sbjct: 938 DNMEVCIVGDFTEEDIESCILDYLGTAQATRNHERE--QKFNPPLFRPSPSDLQFQEVFL 995 Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465 KDTDERACAYIAGPAPNRWGFTV+G DL E+I + +D+QS SD Q+ L Sbjct: 996 KDTDERACAYIAGPAPNRWGFTVDGVDLLESINNASIINDDQSKSDAQQT-------QGL 1048 Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285 Q+ L GHPLFFGIT+GLL+E+INSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT Sbjct: 1049 QKSLCGHPLFFGITMGLLSEIINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1108 Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105 STP+KV+KAVDACKNVLRGL+S+KI +RELDRAKRTLLM+HEAE KSNAYWLGL+AH QA Sbjct: 1109 STPSKVHKAVDACKNVLRGLHSNKITERELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQA 1168 Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLE 925 SSVPRKD+SCIKDL LYE A+IEDIYLAYE LKVDE+SL+SCIGIAG+Q + ++A LE Sbjct: 1169 SSVPRKDISCIKDLTFLYEVATIEDIYLAYEQLKVDENSLYSCIGIAGAQTAQDIAAPLE 1228 Query: 924 DELTTGV-QGVFPVGRGLSTMTRPTT 850 +E+ V GV PVGRGLSTMTRPTT Sbjct: 1229 EEVADDVYPGVIPVGRGLSTMTRPTT 1254 >ref|XP_007018617.1| Insulinase (Peptidase family M16) family protein isoform 5, partial [Theobroma cacao] gi|508723945|gb|EOY15842.1| Insulinase (Peptidase family M16) family protein isoform 5, partial [Theobroma cacao] Length = 1022 Score = 806 bits (2082), Expect = 0.0 Identities = 414/533 (77%), Positives = 458/533 (85%) Frame = -2 Query: 2547 AAIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 2368 AAI++G PK+LIS QL +LR+Q P+FIPLS E++ TKV DKETGIT Sbjct: 497 AAIKSGLEEPIEAEPELEVPKELISPLQLQELRMQRGPSFIPLSAEMNVTKVQDKETGIT 556 Query: 2367 QRRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFS 2188 Q RLSNGIPVNYKI+KNEA+ GVMRLIV G V+VGVRTLSEGGRVGNFS Sbjct: 557 QLRLSNGIPVNYKISKNEARGGVMRLIVGGGRAAETSDSKGAVVVGVRTLSEGGRVGNFS 616 Query: 2187 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDR 2008 REQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVWL+DAFDR Sbjct: 617 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMHAAFQLLHMVLEHSVWLDDAFDR 676 Query: 2007 ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 1828 ARQLYLSYYRSIPKSLERSTAHKLMLAM+NGDERFVEPTP+ LQ LTL+ VKDAVMNQFV Sbjct: 677 ARQLYLSYYRSIPKSLERSTAHKLMLAMMNGDERFVEPTPKSLQNLTLKSVKDAVMNQFV 736 Query: 1827 GDNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVF 1648 GDNMEVSIVGDF+E +IESC+LDYLGTV A++ A H +PI+FRPSPSDLQFQQVF Sbjct: 737 GDNMEVSIVGDFSEEEIESCVLDYLGTVRASRDSERA--HGFSPILFRPSPSDLQFQQVF 794 Query: 1647 LKDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTD 1468 LKDTDERACAYIAGPAPNRWG TV+GQDL E++ S DD Q +SD E KD++ D Sbjct: 795 LKDTDERACAYIAGPAPNRWGLTVDGQDLLESVADIPSADDAQPHSD-----EGKDIQKD 849 Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288 LQ+KLRGHPLFFGIT+GLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV Sbjct: 850 LQKKLRGHPLFFGITMGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 909 Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108 TSTP+KVY+AVDACKNVLRGL+++KIA REL+RAKRTLLM+HEAE KSNAYWLGL+AH Q Sbjct: 910 TSTPSKVYRAVDACKNVLRGLHTNKIAPRELERAKRTLLMRHEAEIKSNAYWLGLLAHLQ 969 Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAG 949 ASSVPRKD+SC+K+L LYEAASIEDIYLAY+ LKVDEDSL+SCIGIAG AG Sbjct: 970 ASSVPRKDISCVKELTSLYEAASIEDIYLAYDQLKVDEDSLYSCIGIAGVHAG 1022 >ref|XP_004152885.1| PREDICTED: uncharacterized protein LOC101202810 [Cucumis sativus] Length = 1261 Score = 805 bits (2079), Expect = 0.0 Identities = 420/566 (74%), Positives = 473/566 (83%), Gaps = 1/566 (0%) Frame = -2 Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365 AI AG PK+LISSSQ+ +LR+QH+P+FI L+ E + TK +DKETGITQ Sbjct: 706 AIEAGLREPIEAEPELEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQ 765 Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185 RLSNGIPVNYKI+K+E K+GVMRLIV G V+VGVRTLSEGGRVG FSR Sbjct: 766 CRLSNGIPVNYKISKSENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSR 825 Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005 EQVELFCVNHLINCSLESTEEFI+MEFRFTLRD GM AAFQLLHMV+EHSVWLEDAFDRA Sbjct: 826 EQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRA 885 Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825 +QLY+SYYRSIPKSLERSTAHKLMLAMLNGDERFVEP+P+ LQ LTLQ VKDAVMNQFVG Sbjct: 886 KQLYMSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVG 945 Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645 +NMEVS+VGDF+E +IESCILDYLGTVTAT A S PI+FRPS S+LQFQQVFL Sbjct: 946 NNMEVSLVGDFSEEEIESCILDYLGTVTATTTSEAA--LASVPIVFRPSASELQFQQVFL 1003 Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465 KDTDERACAYI+GPAPNRWG T EG +L E+I S +S+ E SD D++ L Sbjct: 1004 KDTDERACAYISGPAPNRWGVTFEGLELLESI-SQISRTGESDESD-------NDIEKGL 1055 Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285 QRKLR HPLFFGIT+GLLAE+INSRLFT+VRDSLGLTYDVSFEL+LFDRLKLGWYVISVT Sbjct: 1056 QRKLRSHPLFFGITMGLLAEIINSRLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVT 1115 Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105 STPAKVYKAVDACK+VLRGL+S+KIAQRELDRAKRTLLM+HEAE KSNAYWLGL+AH QA Sbjct: 1116 STPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQA 1175 Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAG-EVVSAYL 928 SSVPRKDLSCIKDL LYEAA+I+D+Y+AY+ LKVD DSL++CIGIAG+QAG E + ++ Sbjct: 1176 SSVPRKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFE 1235 Query: 927 EDELTTGVQGVFPVGRGLSTMTRPTT 850 E+ QGV P GRGLSTMTRPTT Sbjct: 1236 EEGSDQDFQGVIPSGRGLSTMTRPTT 1261 >ref|XP_004155123.1| PREDICTED: uncharacterized protein LOC101224074 [Cucumis sativus] Length = 1267 Score = 802 bits (2072), Expect = 0.0 Identities = 418/566 (73%), Positives = 475/566 (83%), Gaps = 1/566 (0%) Frame = -2 Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365 AI AG PK+LISSSQ+ +LR+QH+P+FI L+ E + TK +DKETGITQ Sbjct: 706 AIEAGLREPIEAEPELEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQ 765 Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185 RLSNGIPVNYKI+K+E K+GVMRLIV G V+VGVRTLSEGGRVG FSR Sbjct: 766 CRLSNGIPVNYKISKSENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSR 825 Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005 EQVELFCVNHLINCSLESTEEFI+MEFRFTLRD GM AAFQLLHMV+EHSVWLEDAFDRA Sbjct: 826 EQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRA 885 Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825 +QLY+SYYRSIPKSLERSTAHKLMLAMLNGDERFVEP+P+ LQ LTLQ VKDAVMNQFVG Sbjct: 886 KQLYMSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVG 945 Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645 +NMEVS+VGDF+E +IESCILDYLGTVTAT A S PI+FRPS S+LQFQQVFL Sbjct: 946 NNMEVSLVGDFSEEEIESCILDYLGTVTATTTSEAA--LASVPIVFRPSASELQFQQVFL 1003 Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465 KDTDERACAYI+GPAPNRWG T EG +L E+I S +S+ + +E+ + D++ L Sbjct: 1004 KDTDERACAYISGPAPNRWGVTFEGLELLESI-SQISRTGGEFLCEEVDESD-NDIEKGL 1061 Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285 QRKLR HPLFFGIT+GLLAE+INSRLFT+VRDSLGLTYDVSFEL+LFDRLKLGWYVISVT Sbjct: 1062 QRKLRSHPLFFGITMGLLAEIINSRLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVT 1121 Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105 STPAKVYKAVDACK+VLRGL+S+KIAQRELDRAKRTLLM+HEAE KSNAYWLGL+AH QA Sbjct: 1122 STPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQA 1181 Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAG-EVVSAYL 928 SSVPRKDLSCIKDL LYEAA+I+D+Y+AY+ LKVD DSL++CIGIAG+QAG E + ++ Sbjct: 1182 SSVPRKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFE 1241 Query: 927 EDELTTGVQGVFPVGRGLSTMTRPTT 850 E+ QGV P GRGLSTMTRPTT Sbjct: 1242 EEGSDQDFQGVIPSGRGLSTMTRPTT 1267 >ref|XP_003537738.1| PREDICTED: uncharacterized protein LOC100809828 [Glycine max] Length = 1257 Score = 800 bits (2066), Expect = 0.0 Identities = 412/566 (72%), Positives = 469/566 (82%), Gaps = 1/566 (0%) Frame = -2 Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365 AI+AG PK+LI S++L +L+ +P FIP++ E TK++D+ETGI++ Sbjct: 701 AIKAGLDEPIQPEPELEVPKELIQSTKLEELKKLRKPAFIPVNPETDATKLHDEETGISR 760 Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185 RRLSNGIPVNYKI+K E +SGVMRLIV G VIVGVRTLSEGGRVGNFSR Sbjct: 761 RRLSNGIPVNYKISKTETQSGVMRLIVGGGRAAESPESRGSVIVGVRTLSEGGRVGNFSR 820 Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005 EQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVW++DAFDRA Sbjct: 821 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMRAAFQLLHMVLEHSVWVDDAFDRA 880 Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825 RQLYLSYYRSIPKSLERSTAHKLM+AML+GDERF+EPTP+ L+ LTLQ VKDAVMNQF G Sbjct: 881 RQLYLSYYRSIPKSLERSTAHKLMVAMLDGDERFIEPTPKSLENLTLQSVKDAVMNQFFG 940 Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645 DNMEV IVGDFTE DIESCILDYLGT A + NP +FRPSPSDLQFQ+VFL Sbjct: 941 DNMEVCIVGDFTEEDIESCILDYLGTAQAARNHERE--KEFNPPLFRPSPSDLQFQEVFL 998 Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTDL 1465 KDTDERACAYIAGPAPNRWGFTV+G DL E+I + + +D+QS S+ Q+ L Sbjct: 999 KDTDERACAYIAGPAPNRWGFTVDGVDLLESINNASTINDDQSKSNAQQT-------QGL 1051 Query: 1464 QRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1285 Q+ L GHPLFFGIT+GLL+E+INSRLFT+VRDSLGLTYDVSFELNLFDRLKLGWYVISVT Sbjct: 1052 QKSLCGHPLFFGITMGLLSEIINSRLFTSVRDSLGLTYDVSFELNLFDRLKLGWYVISVT 1111 Query: 1284 STPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQA 1105 STP+KV+KAVDACKNVLRGL+S+KI +RELDRAKRTLLM+HEAE KSNAYWLGL+AH QA Sbjct: 1112 STPSKVHKAVDACKNVLRGLHSNKITERELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQA 1171 Query: 1104 SSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVVSAYLE 925 SSVPRKD+SCIKDL LYE A+IEDIY AYE LKVDE+SL+SCIGIAG+QA + ++A LE Sbjct: 1172 SSVPRKDISCIKDLTFLYEVATIEDIYRAYEQLKVDENSLYSCIGIAGAQAAQEIAAPLE 1231 Query: 924 DELTTGV-QGVFPVGRGLSTMTRPTT 850 +E+ V GV PVGRGLSTMTRPTT Sbjct: 1232 EEVADDVYPGVIPVGRGLSTMTRPTT 1257 >ref|XP_006849871.1| hypothetical protein AMTR_s00022p00070510 [Amborella trichopoda] gi|548853469|gb|ERN11452.1| hypothetical protein AMTR_s00022p00070510 [Amborella trichopoda] Length = 1274 Score = 790 bits (2041), Expect = 0.0 Identities = 410/568 (72%), Positives = 467/568 (82%), Gaps = 3/568 (0%) Frame = -2 Query: 2544 AIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGITQ 2365 AIR G PK+LISSS L +L+ +P F+PL+ +V+ T+++D+ETGITQ Sbjct: 714 AIREGLNEPIEAEPELEVPKELISSSHLSELKSLCKPAFVPLNPDVNATRIFDEETGITQ 773 Query: 2364 RRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFSR 2185 RLSNGIPVNYKIT+NEAK GVMRLIV G V+VGVRTLSEGGRVGNFSR Sbjct: 774 CRLSNGIPVNYKITQNEAKGGVMRLIVGGGRANETSESRGSVVVGVRTLSEGGRVGNFSR 833 Query: 2184 EQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDRA 2005 EQVELFCVNHLINCSLESTEEF+ MEFRFTLRDGGM AAFQLLHMV+EHSVWLEDAFDRA Sbjct: 834 EQVELFCVNHLINCSLESTEEFVCMEFRFTLRDGGMRAAFQLLHMVLEHSVWLEDAFDRA 893 Query: 2004 RQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFVG 1825 RQLYL YYR+IPKSLER+TAHKLM+AMLNGDERF EPTP+ LQ+LTL VK+AVMNQF G Sbjct: 894 RQLYLQYYRAIPKSLERATAHKLMIAMLNGDERFFEPTPESLQQLTLPIVKNAVMNQFRG 953 Query: 1824 DNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVFL 1645 DNMEVSIVGDFTE +IESCILDYLGTVTAT + + PI FRPSPSDLQ QQVFL Sbjct: 954 DNMEVSIVGDFTEDEIESCILDYLGTVTATGSTEKGNEY--EPIFFRPSPSDLQSQQVFL 1011 Query: 1644 KDTDERACAYIAGPAPNRWGFTVEGQDLFETI-RSNLSKDDEQSNSDELQSLELKDVKTD 1468 KDTDERACAYIAGPAPNRWG T+EGQDLFE + + +L DDEQ + +E KD + + Sbjct: 1012 KDTDERACAYIAGPAPNRWGLTIEGQDLFELVKKGSLVSDDEQR-----KPVESKDGEAN 1066 Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288 L K++ PLFF IT+GLLAE+INSRLFTTVRDSLGLTYDVSFEL+LFDRLK GWYVISV Sbjct: 1067 LSGKIQQLPLFFAITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKFGWYVISV 1126 Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108 TSTP+KVYKAVDACK+VLRGL++SKI QRELDRA+RTLLM+HEAE KSN YWLGL+AH Q Sbjct: 1127 TSTPSKVYKAVDACKDVLRGLHNSKITQRELDRARRTLLMRHEAEMKSNVYWLGLLAHLQ 1186 Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAG-EVVSAY 931 ASS+PRKD+SCIKDL LYEAA+IED+Y+AY HLKV EDSL+SCIG+AGSQA E SA Sbjct: 1187 ASSIPRKDISCIKDLTSLYEAATIEDVYVAYNHLKVGEDSLYSCIGVAGSQARVEADSAS 1246 Query: 930 LEDELTTG-VQGVFPVGRGLSTMTRPTT 850 + E + G G+ P+GRGL+TMTRPTT Sbjct: 1247 VVSEESDGSAAGLIPIGRGLATMTRPTT 1274 >gb|EYU24512.1| hypothetical protein MIMGU_mgv1a000585mg [Mimulus guttatus] Length = 1057 Score = 788 bits (2036), Expect = 0.0 Identities = 416/568 (73%), Positives = 465/568 (81%), Gaps = 2/568 (0%) Frame = -2 Query: 2547 AAIRAGXXXXXXXXXXXXXPKQLISSSQLHDLRLQHRPTFIPLSEEVSTTKVYDKETGIT 2368 A+I AG PK+LISS QL +L LQ P+FIP+ +E TKVYD+ETGI Sbjct: 494 ASIEAGLKEPIEAEPELEIPKELISSEQLQELSLQQPPSFIPVDQEKKMTKVYDEETGII 553 Query: 2367 QRRLSNGIPVNYKITKNEAKSGVMRLIVXXXXXXXXXXXXGDVIVGVRTLSEGGRVGNFS 2188 QRRLSNGIPVNYKI+K+EA SGVMRLIV G VIVGVRTLSEGGRVGNF+ Sbjct: 554 QRRLSNGIPVNYKISKSEANSGVMRLIVGGGRAAESAESKGAVIVGVRTLSEGGRVGNFT 613 Query: 2187 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDGGMPAAFQLLHMVIEHSVWLEDAFDR 2008 REQVELFCVNHLINCSLESTEEFISMEFRFTLRD GM AAFQLLHMV+EHSVWLEDAFDR Sbjct: 614 REQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDR 673 Query: 2007 ARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPTPQMLQKLTLQDVKDAVMNQFV 1828 A+QLYLSYYRSIPKSLERSTAHKLMLAML+GDERFVEPTP LQ+LTL+ VK+AVMNQFV Sbjct: 674 AKQLYLSYYRSIPKSLERSTAHKLMLAMLDGDERFVEPTPNSLQQLTLEQVKEAVMNQFV 733 Query: 1827 GDNMEVSIVGDFTEGDIESCILDYLGTVTATKKPSPADVHRSNPIIFRPSPSDLQFQQVF 1648 DNMEVSIVGDF+E DIESCIL+YLGTV K A + +PI+FRP +DLQ QQVF Sbjct: 734 CDNMEVSIVGDFSEEDIESCILEYLGTVRERKGSERA--QKYSPILFRPYTADLQHQQVF 791 Query: 1647 LKDTDERACAYIAGPAPNRWGFTVEGQDLFETIRSNLSKDDEQSNSDELQSLELKDVKTD 1468 LKDTDERACAY+AGPAPNRWGFT EG++L E+ S S E +E Q EL++ Sbjct: 792 LKDTDERACAYVAGPAPNRWGFTFEGKNLLES-DSTASTFGEHVKFEE-QPQELENSDKV 849 Query: 1467 LQRKLRGHPLFFGITLGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 1288 +Q KLR HPLFF IT+GLL E+INSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV Sbjct: 850 MQGKLRTHPLFFAITMGLLQEIINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVISV 909 Query: 1287 TSTPAKVYKAVDACKNVLRGLNSSKIAQRELDRAKRTLLMKHEAESKSNAYWLGLIAHAQ 1108 TSTP KV+KAVDACKNVL+GL SS+IA RELDRA+RTLLM+HEAE KSNAYWLGL+AH Q Sbjct: 910 TSTPGKVHKAVDACKNVLKGLLSSRIAPRELDRARRTLLMRHEAEIKSNAYWLGLMAHLQ 969 Query: 1107 ASSVPRKDLSCIKDLQLLYEAASIEDIYLAYEHLKVDEDSLFSCIGIAGSQAGEVV--SA 934 A+SVPRKD+SCIKDL LYEAA+IED+Y+AYE LKVD++SLFSCIG+AGSQAGEV S Sbjct: 970 ATSVPRKDISCIKDLISLYEAATIEDVYIAYEQLKVDDNSLFSCIGVAGSQAGEVATGSV 1029 Query: 933 YLEDELTTGVQGVFPVGRGLSTMTRPTT 850 LE+E G+Q + VGRG STMTRPTT Sbjct: 1030 VLEEESVEGLQNIIQVGRGSSTMTRPTT 1057