BLASTX nr result
ID: Akebia25_contig00022338
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00022338 (2408 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003634700.1| PREDICTED: uncharacterized protein LOC100241... 1073 0.0 ref|XP_007021073.1| Uncharacterized protein isoform 5, partial [... 1053 0.0 ref|XP_007021072.1| Uncharacterized protein isoform 4 [Theobroma... 1053 0.0 ref|XP_007021071.1| Uncharacterized protein isoform 3, partial [... 1053 0.0 ref|XP_007021070.1| Uncharacterized protein isoform 2 [Theobroma... 1053 0.0 ref|XP_007021069.1| Uncharacterized protein isoform 1 [Theobroma... 1053 0.0 ref|XP_007214561.1| hypothetical protein PRUPE_ppa000393mg [Prun... 1044 0.0 ref|XP_002282514.2| PREDICTED: uncharacterized protein LOC100241... 1030 0.0 ref|XP_002316974.2| hypothetical protein POPTR_0011s13620g [Popu... 1018 0.0 ref|XP_004294340.1| PREDICTED: uncharacterized protein LOC101295... 1018 0.0 ref|XP_002522835.1| conserved hypothetical protein [Ricinus comm... 1012 0.0 emb|CBI20510.3| unnamed protein product [Vitis vinifera] 1008 0.0 ref|XP_007149696.1| hypothetical protein PHAVU_005G091400g [Phas... 999 0.0 ref|XP_006475219.1| PREDICTED: uncharacterized protein LOC102606... 999 0.0 ref|XP_006592884.1| PREDICTED: uncharacterized protein LOC100811... 998 0.0 ref|XP_006592883.1| PREDICTED: uncharacterized protein LOC100811... 998 0.0 ref|XP_006370616.1| hypothetical protein POPTR_0001s44280g [Popu... 995 0.0 ref|XP_004152911.1| PREDICTED: uncharacterized protein LOC101210... 994 0.0 ref|XP_003543291.1| PREDICTED: uncharacterized protein LOC100803... 994 0.0 ref|XP_006830454.1| hypothetical protein AMTR_s00115p00072410 [A... 986 0.0 >ref|XP_003634700.1| PREDICTED: uncharacterized protein LOC100241773 isoform 2 [Vitis vinifera] Length = 1215 Score = 1073 bits (2776), Expect = 0.0 Identities = 539/699 (77%), Positives = 607/699 (86%), Gaps = 4/699 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESI+A ALEYTLKYWLKSFSRDQFKLQGRTV+ SNLDINGDA+H+S+GLPPALNVTTAK Sbjct: 1 MESIVALALEYTLKYWLKSFSRDQFKLQGRTVQLSNLDINGDALHSSLGLPPALNVTTAK 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGKLEI LP VSNVQ+EP+ VQIDRLDLVLEEN D +ACRSS+S Q ++SSGKGSGYGFA Sbjct: 61 VGKLEILLPYVSNVQIEPVVVQIDRLDLVLEENSDVDACRSSSSTQSSTSSGKGSGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMTLEV VNL++ET GG++ QGGATWASPLASITI NLLLYTTNENW VVNLKE Sbjct: 121 DKIADGMTLEVRTVNLLLETRGGARCQGGATWASPLASITIRNLLLYTTNENWHVVNLKE 180 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 ARDFSN+KK IYVFKKLEWE LSIDLLPHPDMF DA++ N+RD+DGAKR+FFGG Sbjct: 181 ARDFSNDKKFIYVFKKLEWEFLSIDLLPHPDMFMDANIAHPEEEVNRRDEDGAKRVFFGG 240 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCLN 1222 ERF+EGISGEAYITVQRTE NSPLGLEVQLH+TEAVCPALSEPGLRALLRF+TGLYVCLN Sbjct: 241 ERFIEGISGEAYITVQRTELNSPLGLEVQLHITEAVCPALSEPGLRALLRFLTGLYVCLN 300 Query: 1223 R-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNTK 1399 R DVDP+AQQR TE+AGRS VSI+VDHIFL IKDAEF+L+LLMQSLFFSRASVSDG+ TK Sbjct: 301 RGDVDPKAQQRTTESAGRSLVSIIVDHIFLCIKDAEFRLELLMQSLFFSRASVSDGEKTK 360 Query: 1400 NLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQLN 1579 NL+R+ +GGLFLRDTFSHPP TLVQPSMQAVT+D LH+P+FG+NFCP IYPL Q WQL+ Sbjct: 361 NLNRVMIGGLFLRDTFSHPPCTLVQPSMQAVTKDVLHIPEFGQNFCPAIYPLGEQQWQLH 420 Query: 1580 EGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGAV 1759 EG+PLI LHSLQ+KPSPAPP F SQTVI CQPLMI+LQEESCLRISSFL+DGIVVNPGAV Sbjct: 421 EGIPLICLHSLQVKPSPAPPCFASQTVIDCQPLMIHLQEESCLRISSFLADGIVVNPGAV 480 Query: 1760 LPDFSVYSLVFSLKELELTVPL---EADNFPANGNNAFQSSFAGAKLHIKDLFFSESASV 1930 LPDFSV SLVF+LKEL++T+P+ E++ + N+ QSSFAGA+LHI++LFFSES + Sbjct: 481 LPDFSVDSLVFTLKELDITIPMDTGESNISAGDSNSTHQSSFAGARLHIENLFFSESPKL 540 Query: 1931 KLRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGLW 2110 KLRLLNL+KDPACFSLW QP+DASQKKW T S LILSLE CS L +RS+G W Sbjct: 541 KLRLLNLEKDPACFSLWAGQPIDASQKKWTTGASQLILSLETCSDLTGLQIPLERSSGSW 600 Query: 2111 ECVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAYI 2290 CVEL + CIE AMATADG PLI+IPPPGGVVR+GV+ QQ+LSNTSVEQLFFVLDLY Y Sbjct: 601 RCVELKDACIEVAMATADGRPLISIPPPGGVVRVGVAFQQYLSNTSVEQLFFVLDLYTYF 660 Query: 2291 GRVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 GRVSEKI + K++ K + GSLMEK PSDTAVS Sbjct: 661 GRVSEKIAIVGKNNRPKTSENEALAGSLMEKVPSDTAVS 699 >ref|XP_007021073.1| Uncharacterized protein isoform 5, partial [Theobroma cacao] gi|508720701|gb|EOY12598.1| Uncharacterized protein isoform 5, partial [Theobroma cacao] Length = 1005 Score = 1053 bits (2722), Expect = 0.0 Identities = 534/699 (76%), Positives = 599/699 (85%), Gaps = 4/699 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESILARALEYTLKYWLKSFSRDQFKLQGRTV+ SNLDINGDA+HAS+GLPPALNVTTAK Sbjct: 1 MESILARALEYTLKYWLKSFSRDQFKLQGRTVQLSNLDINGDALHASMGLPPALNVTTAK 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGKLEI LP VSNVQ+EPI VQIDRLDLVLEEN D+++ RSS+S Q ++SSGKGSGYGFA Sbjct: 61 VGKLEIILPYVSNVQIEPIIVQIDRLDLVLEENPDADSSRSSSSTQSSTSSGKGSGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMTL+V VNL++ET GG++ +GGA WASP+ASIT+ N+LLYTTNENWQVVNLKE Sbjct: 121 DKIADGMTLQVQTVNLLLETRGGARGKGGAAWASPMASITMRNILLYTTNENWQVVNLKE 180 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 ARDFS+NKK IYVFKKLEWESLSIDLLPHPDMF+DA+L S GA RDDDGAKR+FFGG Sbjct: 181 ARDFSSNKKFIYVFKKLEWESLSIDLLPHPDMFSDANLARSQEGATHRDDDGAKRVFFGG 240 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCLN 1222 ERFLEGISGEAYITVQRTE NSPLGLEVQLHVTEAVCPALSEPGLRALLRF+TG YVCLN Sbjct: 241 ERFLEGISGEAYITVQRTELNSPLGLEVQLHVTEAVCPALSEPGLRALLRFLTGFYVCLN 300 Query: 1223 R-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNTK 1399 R DVD +AQQ EAAGRS VS+VVDHIFL IKD EFQL+LLMQSL FSRASVSDG+N Sbjct: 301 RGDVDLKAQQGSIEAAGRSLVSVVVDHIFLCIKDPEFQLELLMQSLLFSRASVSDGENAH 360 Query: 1400 NLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQLN 1579 NLS++ +GGLFLRDTFS PP TLVQPSM+AV++ LH+PDFGKNFCPPIYPL Q WQL Sbjct: 361 NLSKVMIGGLFLRDTFSRPPCTLVQPSMEAVSDSCLHIPDFGKNFCPPIYPLGEQQWQLT 420 Query: 1580 EGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGAV 1759 GVPLI LHSLQ+KPSP PPSF SQTVI CQPLMI+LQEESCLRISSFL+DGIVVNPGA+ Sbjct: 421 LGVPLICLHSLQVKPSPFPPSFASQTVIGCQPLMIHLQEESCLRISSFLADGIVVNPGAI 480 Query: 1760 LPDFSVYSLVFSLKELELTVPLEA---DNFPANGNNAFQSSFAGAKLHIKDLFFSESASV 1930 LPD SV SLVF++KEL+++VPL+ DN N+ Q SFAGA+LHI+ LFF ES S+ Sbjct: 481 LPDSSVNSLVFTIKELDISVPLDTSKLDNPGGGENHIIQKSFAGARLHIEKLFFYESPSL 540 Query: 1931 KLRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGLW 2110 KL+LLNL+KDPACFSLWE QP+DASQKKW S L LSLE S+L S S+GLW Sbjct: 541 KLKLLNLEKDPACFSLWEGQPIDASQKKWTAGASQLSLSLETASSLLGLQSSLGCSSGLW 600 Query: 2111 ECVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAYI 2290 CVEL + IE AMA+ADG+PL +PPPGG+VRIGV+CQQF+SNTSVEQLFFVLDLYAYI Sbjct: 601 RCVELKDASIEVAMASADGNPLTVVPPPGGIVRIGVACQQFMSNTSVEQLFFVLDLYAYI 660 Query: 2291 GRVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 GRVSEKI + K+ KR R ++GG LMEK PSDTAVS Sbjct: 661 GRVSEKIAVVGKNKRPKRNRDESLGGRLMEKVPSDTAVS 699 >ref|XP_007021072.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508720700|gb|EOY12597.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 1058 Score = 1053 bits (2722), Expect = 0.0 Identities = 534/699 (76%), Positives = 599/699 (85%), Gaps = 4/699 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESILARALEYTLKYWLKSFSRDQFKLQGRTV+ SNLDINGDA+HAS+GLPPALNVTTAK Sbjct: 1 MESILARALEYTLKYWLKSFSRDQFKLQGRTVQLSNLDINGDALHASMGLPPALNVTTAK 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGKLEI LP VSNVQ+EPI VQIDRLDLVLEEN D+++ RSS+S Q ++SSGKGSGYGFA Sbjct: 61 VGKLEIILPYVSNVQIEPIIVQIDRLDLVLEENPDADSSRSSSSTQSSTSSGKGSGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMTL+V VNL++ET GG++ +GGA WASP+ASIT+ N+LLYTTNENWQVVNLKE Sbjct: 121 DKIADGMTLQVQTVNLLLETRGGARGKGGAAWASPMASITMRNILLYTTNENWQVVNLKE 180 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 ARDFS+NKK IYVFKKLEWESLSIDLLPHPDMF+DA+L S GA RDDDGAKR+FFGG Sbjct: 181 ARDFSSNKKFIYVFKKLEWESLSIDLLPHPDMFSDANLARSQEGATHRDDDGAKRVFFGG 240 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCLN 1222 ERFLEGISGEAYITVQRTE NSPLGLEVQLHVTEAVCPALSEPGLRALLRF+TG YVCLN Sbjct: 241 ERFLEGISGEAYITVQRTELNSPLGLEVQLHVTEAVCPALSEPGLRALLRFLTGFYVCLN 300 Query: 1223 R-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNTK 1399 R DVD +AQQ EAAGRS VS+VVDHIFL IKD EFQL+LLMQSL FSRASVSDG+N Sbjct: 301 RGDVDLKAQQGSIEAAGRSLVSVVVDHIFLCIKDPEFQLELLMQSLLFSRASVSDGENAH 360 Query: 1400 NLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQLN 1579 NLS++ +GGLFLRDTFS PP TLVQPSM+AV++ LH+PDFGKNFCPPIYPL Q WQL Sbjct: 361 NLSKVMIGGLFLRDTFSRPPCTLVQPSMEAVSDSCLHIPDFGKNFCPPIYPLGEQQWQLT 420 Query: 1580 EGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGAV 1759 GVPLI LHSLQ+KPSP PPSF SQTVI CQPLMI+LQEESCLRISSFL+DGIVVNPGA+ Sbjct: 421 LGVPLICLHSLQVKPSPFPPSFASQTVIGCQPLMIHLQEESCLRISSFLADGIVVNPGAI 480 Query: 1760 LPDFSVYSLVFSLKELELTVPLEA---DNFPANGNNAFQSSFAGAKLHIKDLFFSESASV 1930 LPD SV SLVF++KEL+++VPL+ DN N+ Q SFAGA+LHI+ LFF ES S+ Sbjct: 481 LPDSSVNSLVFTIKELDISVPLDTSKLDNPGGGENHIIQKSFAGARLHIEKLFFYESPSL 540 Query: 1931 KLRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGLW 2110 KL+LLNL+KDPACFSLWE QP+DASQKKW S L LSLE S+L S S+GLW Sbjct: 541 KLKLLNLEKDPACFSLWEGQPIDASQKKWTAGASQLSLSLETASSLLGLQSSLGCSSGLW 600 Query: 2111 ECVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAYI 2290 CVEL + IE AMA+ADG+PL +PPPGG+VRIGV+CQQF+SNTSVEQLFFVLDLYAYI Sbjct: 601 RCVELKDASIEVAMASADGNPLTVVPPPGGIVRIGVACQQFMSNTSVEQLFFVLDLYAYI 660 Query: 2291 GRVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 GRVSEKI + K+ KR R ++GG LMEK PSDTAVS Sbjct: 661 GRVSEKIAVVGKNKRPKRNRDESLGGRLMEKVPSDTAVS 699 >ref|XP_007021071.1| Uncharacterized protein isoform 3, partial [Theobroma cacao] gi|508720699|gb|EOY12596.1| Uncharacterized protein isoform 3, partial [Theobroma cacao] Length = 1018 Score = 1053 bits (2722), Expect = 0.0 Identities = 534/699 (76%), Positives = 599/699 (85%), Gaps = 4/699 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESILARALEYTLKYWLKSFSRDQFKLQGRTV+ SNLDINGDA+HAS+GLPPALNVTTAK Sbjct: 1 MESILARALEYTLKYWLKSFSRDQFKLQGRTVQLSNLDINGDALHASMGLPPALNVTTAK 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGKLEI LP VSNVQ+EPI VQIDRLDLVLEEN D+++ RSS+S Q ++SSGKGSGYGFA Sbjct: 61 VGKLEIILPYVSNVQIEPIIVQIDRLDLVLEENPDADSSRSSSSTQSSTSSGKGSGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMTL+V VNL++ET GG++ +GGA WASP+ASIT+ N+LLYTTNENWQVVNLKE Sbjct: 121 DKIADGMTLQVQTVNLLLETRGGARGKGGAAWASPMASITMRNILLYTTNENWQVVNLKE 180 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 ARDFS+NKK IYVFKKLEWESLSIDLLPHPDMF+DA+L S GA RDDDGAKR+FFGG Sbjct: 181 ARDFSSNKKFIYVFKKLEWESLSIDLLPHPDMFSDANLARSQEGATHRDDDGAKRVFFGG 240 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCLN 1222 ERFLEGISGEAYITVQRTE NSPLGLEVQLHVTEAVCPALSEPGLRALLRF+TG YVCLN Sbjct: 241 ERFLEGISGEAYITVQRTELNSPLGLEVQLHVTEAVCPALSEPGLRALLRFLTGFYVCLN 300 Query: 1223 R-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNTK 1399 R DVD +AQQ EAAGRS VS+VVDHIFL IKD EFQL+LLMQSL FSRASVSDG+N Sbjct: 301 RGDVDLKAQQGSIEAAGRSLVSVVVDHIFLCIKDPEFQLELLMQSLLFSRASVSDGENAH 360 Query: 1400 NLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQLN 1579 NLS++ +GGLFLRDTFS PP TLVQPSM+AV++ LH+PDFGKNFCPPIYPL Q WQL Sbjct: 361 NLSKVMIGGLFLRDTFSRPPCTLVQPSMEAVSDSCLHIPDFGKNFCPPIYPLGEQQWQLT 420 Query: 1580 EGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGAV 1759 GVPLI LHSLQ+KPSP PPSF SQTVI CQPLMI+LQEESCLRISSFL+DGIVVNPGA+ Sbjct: 421 LGVPLICLHSLQVKPSPFPPSFASQTVIGCQPLMIHLQEESCLRISSFLADGIVVNPGAI 480 Query: 1760 LPDFSVYSLVFSLKELELTVPLEA---DNFPANGNNAFQSSFAGAKLHIKDLFFSESASV 1930 LPD SV SLVF++KEL+++VPL+ DN N+ Q SFAGA+LHI+ LFF ES S+ Sbjct: 481 LPDSSVNSLVFTIKELDISVPLDTSKLDNPGGGENHIIQKSFAGARLHIEKLFFYESPSL 540 Query: 1931 KLRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGLW 2110 KL+LLNL+KDPACFSLWE QP+DASQKKW S L LSLE S+L S S+GLW Sbjct: 541 KLKLLNLEKDPACFSLWEGQPIDASQKKWTAGASQLSLSLETASSLLGLQSSLGCSSGLW 600 Query: 2111 ECVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAYI 2290 CVEL + IE AMA+ADG+PL +PPPGG+VRIGV+CQQF+SNTSVEQLFFVLDLYAYI Sbjct: 601 RCVELKDASIEVAMASADGNPLTVVPPPGGIVRIGVACQQFMSNTSVEQLFFVLDLYAYI 660 Query: 2291 GRVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 GRVSEKI + K+ KR R ++GG LMEK PSDTAVS Sbjct: 661 GRVSEKIAVVGKNKRPKRNRDESLGGRLMEKVPSDTAVS 699 >ref|XP_007021070.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508720698|gb|EOY12595.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 1200 Score = 1053 bits (2722), Expect = 0.0 Identities = 534/699 (76%), Positives = 599/699 (85%), Gaps = 4/699 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESILARALEYTLKYWLKSFSRDQFKLQGRTV+ SNLDINGDA+HAS+GLPPALNVTTAK Sbjct: 1 MESILARALEYTLKYWLKSFSRDQFKLQGRTVQLSNLDINGDALHASMGLPPALNVTTAK 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGKLEI LP VSNVQ+EPI VQIDRLDLVLEEN D+++ RSS+S Q ++SSGKGSGYGFA Sbjct: 61 VGKLEIILPYVSNVQIEPIIVQIDRLDLVLEENPDADSSRSSSSTQSSTSSGKGSGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMTL+V VNL++ET GG++ +GGA WASP+ASIT+ N+LLYTTNENWQVVNLKE Sbjct: 121 DKIADGMTLQVQTVNLLLETRGGARGKGGAAWASPMASITMRNILLYTTNENWQVVNLKE 180 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 ARDFS+NKK IYVFKKLEWESLSIDLLPHPDMF+DA+L S GA RDDDGAKR+FFGG Sbjct: 181 ARDFSSNKKFIYVFKKLEWESLSIDLLPHPDMFSDANLARSQEGATHRDDDGAKRVFFGG 240 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCLN 1222 ERFLEGISGEAYITVQRTE NSPLGLEVQLHVTEAVCPALSEPGLRALLRF+TG YVCLN Sbjct: 241 ERFLEGISGEAYITVQRTELNSPLGLEVQLHVTEAVCPALSEPGLRALLRFLTGFYVCLN 300 Query: 1223 R-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNTK 1399 R DVD +AQQ EAAGRS VS+VVDHIFL IKD EFQL+LLMQSL FSRASVSDG+N Sbjct: 301 RGDVDLKAQQGSIEAAGRSLVSVVVDHIFLCIKDPEFQLELLMQSLLFSRASVSDGENAH 360 Query: 1400 NLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQLN 1579 NLS++ +GGLFLRDTFS PP TLVQPSM+AV++ LH+PDFGKNFCPPIYPL Q WQL Sbjct: 361 NLSKVMIGGLFLRDTFSRPPCTLVQPSMEAVSDSCLHIPDFGKNFCPPIYPLGEQQWQLT 420 Query: 1580 EGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGAV 1759 GVPLI LHSLQ+KPSP PPSF SQTVI CQPLMI+LQEESCLRISSFL+DGIVVNPGA+ Sbjct: 421 LGVPLICLHSLQVKPSPFPPSFASQTVIGCQPLMIHLQEESCLRISSFLADGIVVNPGAI 480 Query: 1760 LPDFSVYSLVFSLKELELTVPLEA---DNFPANGNNAFQSSFAGAKLHIKDLFFSESASV 1930 LPD SV SLVF++KEL+++VPL+ DN N+ Q SFAGA+LHI+ LFF ES S+ Sbjct: 481 LPDSSVNSLVFTIKELDISVPLDTSKLDNPGGGENHIIQKSFAGARLHIEKLFFYESPSL 540 Query: 1931 KLRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGLW 2110 KL+LLNL+KDPACFSLWE QP+DASQKKW S L LSLE S+L S S+GLW Sbjct: 541 KLKLLNLEKDPACFSLWEGQPIDASQKKWTAGASQLSLSLETASSLLGLQSSLGCSSGLW 600 Query: 2111 ECVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAYI 2290 CVEL + IE AMA+ADG+PL +PPPGG+VRIGV+CQQF+SNTSVEQLFFVLDLYAYI Sbjct: 601 RCVELKDASIEVAMASADGNPLTVVPPPGGIVRIGVACQQFMSNTSVEQLFFVLDLYAYI 660 Query: 2291 GRVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 GRVSEKI + K+ KR R ++GG LMEK PSDTAVS Sbjct: 661 GRVSEKIAVVGKNKRPKRNRDESLGGRLMEKVPSDTAVS 699 >ref|XP_007021069.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508720697|gb|EOY12594.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1211 Score = 1053 bits (2722), Expect = 0.0 Identities = 534/699 (76%), Positives = 599/699 (85%), Gaps = 4/699 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESILARALEYTLKYWLKSFSRDQFKLQGRTV+ SNLDINGDA+HAS+GLPPALNVTTAK Sbjct: 1 MESILARALEYTLKYWLKSFSRDQFKLQGRTVQLSNLDINGDALHASMGLPPALNVTTAK 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGKLEI LP VSNVQ+EPI VQIDRLDLVLEEN D+++ RSS+S Q ++SSGKGSGYGFA Sbjct: 61 VGKLEIILPYVSNVQIEPIIVQIDRLDLVLEENPDADSSRSSSSTQSSTSSGKGSGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMTL+V VNL++ET GG++ +GGA WASP+ASIT+ N+LLYTTNENWQVVNLKE Sbjct: 121 DKIADGMTLQVQTVNLLLETRGGARGKGGAAWASPMASITMRNILLYTTNENWQVVNLKE 180 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 ARDFS+NKK IYVFKKLEWESLSIDLLPHPDMF+DA+L S GA RDDDGAKR+FFGG Sbjct: 181 ARDFSSNKKFIYVFKKLEWESLSIDLLPHPDMFSDANLARSQEGATHRDDDGAKRVFFGG 240 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCLN 1222 ERFLEGISGEAYITVQRTE NSPLGLEVQLHVTEAVCPALSEPGLRALLRF+TG YVCLN Sbjct: 241 ERFLEGISGEAYITVQRTELNSPLGLEVQLHVTEAVCPALSEPGLRALLRFLTGFYVCLN 300 Query: 1223 R-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNTK 1399 R DVD +AQQ EAAGRS VS+VVDHIFL IKD EFQL+LLMQSL FSRASVSDG+N Sbjct: 301 RGDVDLKAQQGSIEAAGRSLVSVVVDHIFLCIKDPEFQLELLMQSLLFSRASVSDGENAH 360 Query: 1400 NLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQLN 1579 NLS++ +GGLFLRDTFS PP TLVQPSM+AV++ LH+PDFGKNFCPPIYPL Q WQL Sbjct: 361 NLSKVMIGGLFLRDTFSRPPCTLVQPSMEAVSDSCLHIPDFGKNFCPPIYPLGEQQWQLT 420 Query: 1580 EGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGAV 1759 GVPLI LHSLQ+KPSP PPSF SQTVI CQPLMI+LQEESCLRISSFL+DGIVVNPGA+ Sbjct: 421 LGVPLICLHSLQVKPSPFPPSFASQTVIGCQPLMIHLQEESCLRISSFLADGIVVNPGAI 480 Query: 1760 LPDFSVYSLVFSLKELELTVPLEA---DNFPANGNNAFQSSFAGAKLHIKDLFFSESASV 1930 LPD SV SLVF++KEL+++VPL+ DN N+ Q SFAGA+LHI+ LFF ES S+ Sbjct: 481 LPDSSVNSLVFTIKELDISVPLDTSKLDNPGGGENHIIQKSFAGARLHIEKLFFYESPSL 540 Query: 1931 KLRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGLW 2110 KL+LLNL+KDPACFSLWE QP+DASQKKW S L LSLE S+L S S+GLW Sbjct: 541 KLKLLNLEKDPACFSLWEGQPIDASQKKWTAGASQLSLSLETASSLLGLQSSLGCSSGLW 600 Query: 2111 ECVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAYI 2290 CVEL + IE AMA+ADG+PL +PPPGG+VRIGV+CQQF+SNTSVEQLFFVLDLYAYI Sbjct: 601 RCVELKDASIEVAMASADGNPLTVVPPPGGIVRIGVACQQFMSNTSVEQLFFVLDLYAYI 660 Query: 2291 GRVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 GRVSEKI + K+ KR R ++GG LMEK PSDTAVS Sbjct: 661 GRVSEKIAVVGKNKRPKRNRDESLGGRLMEKVPSDTAVS 699 >ref|XP_007214561.1| hypothetical protein PRUPE_ppa000393mg [Prunus persica] gi|462410426|gb|EMJ15760.1| hypothetical protein PRUPE_ppa000393mg [Prunus persica] Length = 1213 Score = 1044 bits (2700), Expect = 0.0 Identities = 525/697 (75%), Positives = 597/697 (85%), Gaps = 2/697 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESILA ALEYTLKYWLKSFSRDQFKLQGRT + SNLDINGDAVH+S+GLPPALNV TAK Sbjct: 1 MESILALALEYTLKYWLKSFSRDQFKLQGRTAQLSNLDINGDAVHSSMGLPPALNVATAK 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGKLEI LPSVSNVQ+EPI VQIDRLDLVLEE D +A RS S +SSS KGSGYGFA Sbjct: 61 VGKLEIVLPSVSNVQIEPIVVQIDRLDLVLEEKSDLDA-RSPRSSPSSSSSAKGSGYGFA 119 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMT+E+ VNL++ET GG + QGGA+WASPLASITI NLLLYTTNENWQVVNLKE Sbjct: 120 DKIADGMTVEILTVNLLLETRGGGRCQGGASWASPLASITIRNLLLYTTNENWQVVNLKE 179 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 AR+FSN+KK IY+FKKLEWESLSIDLLPHPDMF DA++ + +G N+RDDDGAKR+FFGG Sbjct: 180 AREFSNDKKFIYLFKKLEWESLSIDLLPHPDMFMDANIARTEDGGNQRDDDGAKRVFFGG 239 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCLN 1222 ERF+EGISGEAYITVQRTE NSPLGLEVQ+H+TEA+CPA+SEPGLRALLRFMTGLYVCLN Sbjct: 240 ERFIEGISGEAYITVQRTELNSPLGLEVQIHITEAICPAISEPGLRALLRFMTGLYVCLN 299 Query: 1223 R-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNTK 1399 R DVD QQR TEAAGRS VSIVVDHIFL IKD EFQL+LLMQSLFFSRASVSDG+ Sbjct: 300 RGDVDSNTQQRSTEAAGRSIVSIVVDHIFLCIKDTEFQLELLMQSLFFSRASVSDGEIDN 359 Query: 1400 NLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQLN 1579 NLSR+ +GGLFLRDT+S PP TLVQPSM+AV+E+ LHVPDFGKNF PPIYPL +Q WQLN Sbjct: 360 NLSRVMIGGLFLRDTYSRPPCTLVQPSMRAVSEEPLHVPDFGKNFSPPIYPLGDQEWQLN 419 Query: 1580 EGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGAV 1759 +GVP + LHSLQIKPSP PPSF SQTVI CQPLMI+LQE SCLRI SFL+DGIVVNPGAV Sbjct: 420 KGVPFLCLHSLQIKPSPVPPSFASQTVINCQPLMIDLQEGSCLRICSFLADGIVVNPGAV 479 Query: 1760 LPDFSVYSLVFSLKELELTVPLEADNFPANGNNAF-QSSFAGAKLHIKDLFFSESASVKL 1936 L DFSV SL+F+LKEL++ VPL+ D+ PAN + QS+F+GA+LHI++LFFSES S+KL Sbjct: 480 LADFSVNSLIFNLKELDVAVPLDIDSNPANKRGSINQSAFSGARLHIENLFFSESPSLKL 539 Query: 1937 RLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGLWEC 2116 RLLNL+KDPACF LWE QPVDASQKKW T SHL LSLE C+ S D+++GLW C Sbjct: 540 RLLNLEKDPACFCLWEGQPVDASQKKWTTGASHLSLSLETCTKSAGHQSSLDQNSGLWRC 599 Query: 2117 VELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAYIGR 2296 VEL + C+E M TADGSPL +PPPGG+VR+GV+CQ +LSNTSVEQLFFVLDLYAY GR Sbjct: 600 VELKDACVEVVMVTADGSPLTNVPPPGGIVRVGVACQNYLSNTSVEQLFFVLDLYAYFGR 659 Query: 2297 VSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 VSEKI + K+ GQK+ R + G+L++K P+DTAVS Sbjct: 660 VSEKIVLVGKNTGQKKNRDHSSDGNLIDKVPNDTAVS 696 >ref|XP_002282514.2| PREDICTED: uncharacterized protein LOC100241773 isoform 1 [Vitis vinifera] Length = 1136 Score = 1030 bits (2664), Expect = 0.0 Identities = 524/699 (74%), Positives = 590/699 (84%), Gaps = 4/699 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESI+A ALEYTLKYWLKSFSRDQFKLQGRTV+ SNLDINGDA+H+S+GLPPALNVTTAK Sbjct: 1 MESIVALALEYTLKYWLKSFSRDQFKLQGRTVQLSNLDINGDALHSSLGLPPALNVTTAK 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGKLEI LP VSNVQ+EP+ VQIDRLDLVLEEN D +ACRSS+S Q ++SSGKGSGYGFA Sbjct: 61 VGKLEILLPYVSNVQIEPVVVQIDRLDLVLEENSDVDACRSSSSTQSSTSSGKGSGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMTLEV VNL++ET GG++ QGGATWASPLASITI NLLLYTTNENW VVNLKE Sbjct: 121 DKIADGMTLEVRTVNLLLETRGGARCQGGATWASPLASITIRNLLLYTTNENWHVVNLKE 180 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 ARDFSN+KK IYVFKKLEWE LSIDLLPHPDMF DA++ N+RD+DGAKR Sbjct: 181 ARDFSNDKKFIYVFKKLEWEFLSIDLLPHPDMFMDANIAHPEEEVNRRDEDGAKR----- 235 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCLN 1222 ITVQRTE NSPLGLEVQLH+TEAVCPALSEPGLRALLRF+TGLYVCLN Sbjct: 236 ------------ITVQRTELNSPLGLEVQLHITEAVCPALSEPGLRALLRFLTGLYVCLN 283 Query: 1223 R-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNTK 1399 R DVDP+AQQR TE+AGRS VSI+VDHIFL IKDAEF+L+LLMQSLFFSRASVSDG+ TK Sbjct: 284 RGDVDPKAQQRTTESAGRSLVSIIVDHIFLCIKDAEFRLELLMQSLFFSRASVSDGEKTK 343 Query: 1400 NLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQLN 1579 NL+R+ +GGLFLRDTFSHPP TLVQPSMQAVT+D LH+P+FG+NFCP IYPL Q WQL+ Sbjct: 344 NLNRVMIGGLFLRDTFSHPPCTLVQPSMQAVTKDVLHIPEFGQNFCPAIYPLGEQQWQLH 403 Query: 1580 EGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGAV 1759 EG+PLI LHSLQ+KPSPAPP F SQTVI CQPLMI+LQEESCLRISSFL+DGIVVNPGAV Sbjct: 404 EGIPLICLHSLQVKPSPAPPCFASQTVIDCQPLMIHLQEESCLRISSFLADGIVVNPGAV 463 Query: 1760 LPDFSVYSLVFSLKELELTVPL---EADNFPANGNNAFQSSFAGAKLHIKDLFFSESASV 1930 LPDFSV SLVF+LKEL++T+P+ E++ + N+ QSSFAGA+LHI++LFFSES + Sbjct: 464 LPDFSVDSLVFTLKELDITIPMDTGESNISAGDSNSTHQSSFAGARLHIENLFFSESPKL 523 Query: 1931 KLRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGLW 2110 KLRLLNL+KDPACFSLW QP+DASQKKW T S LILSLE CS L +RS+G W Sbjct: 524 KLRLLNLEKDPACFSLWAGQPIDASQKKWTTGASQLILSLETCSDLTGLQIPLERSSGSW 583 Query: 2111 ECVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAYI 2290 CVEL + CIE AMATADG PLI+IPPPGGVVR+GV+ QQ+LSNTSVEQLFFVLDLY Y Sbjct: 584 RCVELKDACIEVAMATADGRPLISIPPPGGVVRVGVAFQQYLSNTSVEQLFFVLDLYTYF 643 Query: 2291 GRVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 GRVSEKI + K++ K + GSLMEK PSDTAVS Sbjct: 644 GRVSEKIAIVGKNNRPKTSENEALAGSLMEKVPSDTAVS 682 >ref|XP_002316974.2| hypothetical protein POPTR_0011s13620g [Populus trichocarpa] gi|550328324|gb|EEE97586.2| hypothetical protein POPTR_0011s13620g [Populus trichocarpa] Length = 1212 Score = 1018 bits (2633), Expect = 0.0 Identities = 511/700 (73%), Positives = 584/700 (83%), Gaps = 5/700 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESILARALEYTLKYWLKSFSRDQFKL GRTV+ SNL++NGDA+HAS+GLPPALNVT AK Sbjct: 1 MESILARALEYTLKYWLKSFSRDQFKLHGRTVQLSNLELNGDALHASMGLPPALNVTKAK 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGK EI LP VSNVQVEPI +QID+LDLVLEEN +S+A NS +SSS KGSGYGFA Sbjct: 61 VGKFEIILPYVSNVQVEPIVIQIDKLDLVLEENSESDASSGPNSAHSSSSSSKGSGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMT++V VNL++ET GG+Q GGATWASPLASITI NLLLYTTNENWQVVNLKE Sbjct: 121 DKIADGMTIQVSTVNLLLETRGGAQHGGGATWASPLASITIRNLLLYTTNENWQVVNLKE 180 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 ARDFSNNKK IYVFKKLEWESLSIDLLPHPDMF DA L + GA++RDDDGAKR+FFGG Sbjct: 181 ARDFSNNKKFIYVFKKLEWESLSIDLLPHPDMFADASLACAQEGASRRDDDGAKRVFFGG 240 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEP-GLRALLRFMTGLYVCL 1219 ERFLEGISGEAYIT+QRTEQNSPLGLEVQLH+ EA+CPALSEP GLRALLRFMTGLYVCL Sbjct: 241 ERFLEGISGEAYITMQRTEQNSPLGLEVQLHIPEAICPALSEPAGLRALLRFMTGLYVCL 300 Query: 1220 NR-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNT 1396 NR DVD Q+QQR TEAAGRS VSIVVDHIFL IKDAEFQL+LLMQSL FSRA+VSDG+ Sbjct: 301 NRGDVDLQSQQRSTEAAGRSLVSIVVDHIFLCIKDAEFQLELLMQSLLFSRATVSDGKIA 360 Query: 1397 KNLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQL 1576 NL+++ +GG+FLRDTFS PP TLVQPSMQA+TE+ +PDF KNFCPPIYPL + WQ Sbjct: 361 SNLTKVMLGGMFLRDTFSRPPCTLVQPSMQAITENDGQIPDFAKNFCPPIYPLGDHQWQT 420 Query: 1577 NEGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGA 1756 N G+PLI LHSLQ+KPSP PP F SQTVI CQPLMI+LQEESCLRI+SFL+DGI VNPG Sbjct: 421 NVGIPLICLHSLQLKPSPVPPCFASQTVIACQPLMIHLQEESCLRITSFLADGIAVNPGD 480 Query: 1757 VLPDFSVYSLVFSLKELELTVPL---EADNFPANGNNAFQSSFAGAKLHIKDLFFSESAS 1927 +LPDFSV S+VF LKEL++ VPL ++ N NGN ++FAGA+LHI++LFFSES Sbjct: 481 ILPDFSVNSVVFVLKELDVIVPLDVSQSHNPADNGNYTVHNAFAGARLHIENLFFSESPK 540 Query: 1928 VKLRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGL 2107 +KLRLLNL+KDPACF LW+ QP+DASQKKW T SHL LSLE S+L ++ ++G+ Sbjct: 541 LKLRLLNLEKDPACFCLWDGQPIDASQKKWTTGASHLTLSLETSSSLNGTLNLNGMNSGI 600 Query: 2108 WECVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAY 2287 W CVEL + +E AM +ADG PL +PPPGG VR+GV+CQQ+ SNTSVEQLFFVLDLYAY Sbjct: 601 WRCVELQDASVEVAMISADGGPLTNVPPPGGTVRVGVACQQYFSNTSVEQLFFVLDLYAY 660 Query: 2288 IGRVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 +GRVSE I + K+ QK R + G LM+K P DTAVS Sbjct: 661 LGRVSETIASVGKNRRQKINRNESSGVRLMDKVPCDTAVS 700 >ref|XP_004294340.1| PREDICTED: uncharacterized protein LOC101295784 [Fragaria vesca subsp. vesca] Length = 1206 Score = 1018 bits (2631), Expect = 0.0 Identities = 509/699 (72%), Positives = 594/699 (84%), Gaps = 4/699 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESILARALEYTLKYWLKSFSRDQFKLQGRTV+ SNLD++GDA+H+S+GLPPAL+VTTA+ Sbjct: 1 MESILARALEYTLKYWLKSFSRDQFKLQGRTVQLSNLDMDGDALHSSMGLPPALHVTTAR 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGKL I LPSVSNVQVEPI VQID+LDLVLEEN + +A S +S +++SGKGSGYGFA Sbjct: 61 VGKLVIVLPSVSNVQVEPIVVQIDKLDLVLEENAELDASSSPSSSPSSATSGKGSGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMT+E+ VN+++ET GG RQGGA WASPLASITI NLLLY+TNENW+VVNLKE Sbjct: 121 DKIADGMTIEIRTVNILLETRGGG-RQGGAAWASPLASITIRNLLLYSTNENWEVVNLKE 179 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 AR+FS NK+ IYVFKKLEW+SLSIDLLPHPDMFTDA++ + G N+RDDDGAKR FFGG Sbjct: 180 AREFSTNKRFIYVFKKLEWQSLSIDLLPHPDMFTDANIACTQMGGNQRDDDGAKRAFFGG 239 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCLN 1222 ERF+EGISGEAYITVQRTE NSPLGLEVQLH+TEA+CPA+SEPGLRALLRFMTGLYVCL+ Sbjct: 240 ERFIEGISGEAYITVQRTELNSPLGLEVQLHITEAICPAISEPGLRALLRFMTGLYVCLS 299 Query: 1223 R-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNTK 1399 R D+D QQR T+AAGRS VSIVVDHIFL IKD EF+L+LLMQSLFFSRASVSDG Sbjct: 300 RGDIDSNTQQRSTQAAGRSIVSIVVDHIFLCIKDTEFKLELLMQSLFFSRASVSDGGIDN 359 Query: 1400 NLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQLN 1579 NLS++ +GGLFLRDTFS PP TLVQPSM A++E+ +HVPDFGK+FCPPIYPL Q WQL Sbjct: 360 NLSKVMIGGLFLRDTFSRPPCTLVQPSMHAISEEPVHVPDFGKDFCPPIYPLGAQQWQLI 419 Query: 1580 EGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGAV 1759 EGVPL+ LHSL KPSP PP+F +QTVI CQPLMI+LQE SCLRISSFL+DGI+ +PGAV Sbjct: 420 EGVPLLCLHSLLTKPSPEPPAFATQTVINCQPLMIHLQEGSCLRISSFLADGILASPGAV 479 Query: 1760 LPDFSVYSLVFSLKELELTVPLEADNFPANGNNAF---QSSFAGAKLHIKDLFFSESASV 1930 LPDFSV SL+F LKEL++TVPL+ DN + GNN QSSF+GA+LHI++LFFSES S+ Sbjct: 480 LPDFSVNSLIFILKELDVTVPLDVDNLRSRGNNRSSINQSSFSGARLHIENLFFSESPSL 539 Query: 1931 KLRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGLW 2110 KLRLLNLDKDPACF LW+ QPVDASQKKW T SH+ LSLE C+ S D ++GLW Sbjct: 540 KLRLLNLDKDPACFCLWKGQPVDASQKKWTTRSSHISLSLETCTASAGLQSSLDGTSGLW 599 Query: 2111 ECVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAYI 2290 C+EL + CIE AM TADGSPL +PPPGG+VRIGV+C+++LSNTSVEQL+FVLDLYAY Sbjct: 600 RCIELKDACIEVAMVTADGSPLTNVPPPGGIVRIGVACEKYLSNTSVEQLYFVLDLYAYF 659 Query: 2291 GRVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 GRVSEKI + KS + +I+ + G L++K P+DTAVS Sbjct: 660 GRVSEKIVLVGKST-RPKIKDDSFKGRLIDKVPNDTAVS 697 >ref|XP_002522835.1| conserved hypothetical protein [Ricinus communis] gi|223537919|gb|EEF39533.1| conserved hypothetical protein [Ricinus communis] Length = 1210 Score = 1012 bits (2617), Expect = 0.0 Identities = 512/699 (73%), Positives = 586/699 (83%), Gaps = 4/699 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 ME+ILARALEYTLKYWLKSFSRDQFKLQGRTV+ SNLDINGDA+HAS+GLPPALNVT AK Sbjct: 1 MEAILARALEYTLKYWLKSFSRDQFKLQGRTVQLSNLDINGDALHASMGLPPALNVTKAK 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGK EI LP VSNVQVEPI VQID+LDLVLEEN D +AC S++S Q ++ S K SGYGFA Sbjct: 61 VGKFEIILPYVSNVQVEPIVVQIDKLDLVLEENNDLDACSSTHSTQSSTGSTKASGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMT++V VNL++ET GG++R+GGA WASPLA+ITI NLLLYTTNENWQVVNLKE Sbjct: 121 DKIADGMTIQVSTVNLLLETRGGARREGGAAWASPLAAITIRNLLLYTTNENWQVVNLKE 180 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 ARDFSNNK IYVFKKLEWESLSIDLLPHPDMF DA L S G+ +RDDDGAKR+FFGG Sbjct: 181 ARDFSNNKGFIYVFKKLEWESLSIDLLPHPDMFADASLARSQEGSTQRDDDGAKRVFFGG 240 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCLN 1222 ERFLEGISGEA+IT+QRTEQN+PLGLEVQLH+TEAVCPALSEPGLRALLRF+TGLYVCLN Sbjct: 241 ERFLEGISGEAHITMQRTEQNNPLGLEVQLHITEAVCPALSEPGLRALLRFLTGLYVCLN 300 Query: 1223 R-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNTK 1399 R DVD +AQQR TEAAGRS VS++VDHIF IKDA+FQL+LLMQSL FSRA+VSDG+ Sbjct: 301 RGDVDLKAQQRSTEAAGRSLVSLLVDHIFFCIKDADFQLELLMQSLLFSRATVSDGEIVN 360 Query: 1400 NLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQLN 1579 NL+ + VGGLFLRDTFS PP TLVQPS++ VTE+ L +P F KNFCPPI+PL +Q +QL+ Sbjct: 361 NLTTVMVGGLFLRDTFSRPPCTLVQPSIENVTENCLEIPAFAKNFCPPIHPLGDQQFQLS 420 Query: 1580 EGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGAV 1759 G+PLI LHSLQ+KPSP PPSF S+TVI CQPLMI+LQEESCLRISSFL+DGIVVNPG V Sbjct: 421 AGIPLICLHSLQVKPSPLPPSFASETVIACQPLMIHLQEESCLRISSFLADGIVVNPGDV 480 Query: 1760 LPDFSVYSLVFSLKELELTVPLE---ADNFPANGNNAFQSSFAGAKLHIKDLFFSESASV 1930 LPDFSV SL+F LKEL++TVPL+ +DN N NN QSSF GA+LHI++LFFSES S+ Sbjct: 481 LPDFSVNSLMFILKELDVTVPLDMSNSDNQAYNKNNTVQSSFTGARLHIENLFFSESPSL 540 Query: 1931 KLRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGLW 2110 KLRLL L+KDPACF +WE QPVDASQKKW T SHL LSLE + +S ++GLW Sbjct: 541 KLRLLKLEKDPACFCMWEGQPVDASQKKWTTGASHLSLSLETSISSAGQLSSHGLTSGLW 600 Query: 2111 ECVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAYI 2290 CVEL + IE AM TADG PL +PPPGGVVR+GV+CQQ+LSNTSV+QLFFVLDLYAY Sbjct: 601 RCVELKDASIEVAMVTADGGPLTIVPPPGGVVRVGVACQQYLSNTSVDQLFFVLDLYAYF 660 Query: 2291 GRVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 GRV EKI + K+ + + G LM+K P DTAVS Sbjct: 661 GRVGEKIASVGKNKRTESRNESSDDGRLMDKVPCDTAVS 699 >emb|CBI20510.3| unnamed protein product [Vitis vinifera] Length = 1146 Score = 1008 bits (2607), Expect = 0.0 Identities = 514/696 (73%), Positives = 572/696 (82%), Gaps = 1/696 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESI+A ALEYTLKYWLKSFSRDQFKLQGRTV+ SNLDINGDA+H+S+GLPPALNVTTAK Sbjct: 1 MESIVALALEYTLKYWLKSFSRDQFKLQGRTVQLSNLDINGDALHSSLGLPPALNVTTAK 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGKLEI LP VSNVQ+EP+ VQIDRLDLVLEEN D +ACRSS+S Q ++SSGKGSGYGFA Sbjct: 61 VGKLEILLPYVSNVQIEPVVVQIDRLDLVLEENSDVDACRSSSSTQSSTSSGKGSGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMTLEV VNL++ET GG++ QGGATWASPLASITI NLLLYTTNENW VVNLKE Sbjct: 121 DKIADGMTLEVRTVNLLLETRGGARCQGGATWASPLASITIRNLLLYTTNENWHVVNLKE 180 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 ARDFSN+KK IYVFKKLEWE LSIDLLPHPDMF DA++ N+RD+DGAKR+FFGG Sbjct: 181 ARDFSNDKKFIYVFKKLEWEFLSIDLLPHPDMFMDANIAHPEEEVNRRDEDGAKRVFFGG 240 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCLN 1222 ERF+EGISGEAYITVQRTE NSPLGLEVQLH+TEAVCPALSEPGLRALLRF+TGLYVCLN Sbjct: 241 ERFIEGISGEAYITVQRTELNSPLGLEVQLHITEAVCPALSEPGLRALLRFLTGLYVCLN 300 Query: 1223 R-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNTK 1399 R DVDP+AQQR TE+AGRS VSI+VDHIFL IKDAEF+L+LLMQSLFFSRASVSDG+ TK Sbjct: 301 RGDVDPKAQQRTTESAGRSLVSIIVDHIFLCIKDAEFRLELLMQSLFFSRASVSDGEKTK 360 Query: 1400 NLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQLN 1579 NL+R+ +GGLFLRDTFSHPP TLVQPSMQAVT+D LH+P+FG+NFCP IYPL Q WQL+ Sbjct: 361 NLNRVMIGGLFLRDTFSHPPCTLVQPSMQAVTKDVLHIPEFGQNFCPAIYPLGEQQWQLH 420 Query: 1580 EGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGAV 1759 EG+PLI LHSLQ+KPSPAPP F SQTVI CQPLMI+LQEESCLRISSFL+DGIVVNPGAV Sbjct: 421 EGIPLICLHSLQVKPSPAPPCFASQTVIDCQPLMIHLQEESCLRISSFLADGIVVNPGAV 480 Query: 1760 LPDFSVYSLVFSLKELELTVPLEADNFPANGNNAFQSSFAGAKLHIKDLFFSESASVKLR 1939 LHI++LFFSES +KLR Sbjct: 481 -------------------------------------------LHIENLFFSESPKLKLR 497 Query: 1940 LLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGLWECV 2119 LLNL+KDPACFSLW QP+DASQKKW T S LILSLE CS L +RS+G W CV Sbjct: 498 LLNLEKDPACFSLWAGQPIDASQKKWTTGASQLILSLETCSDLTGLQIPLERSSGSWRCV 557 Query: 2120 ELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAYIGRV 2299 EL + CIE AMATADG PLI+IPPPGGVVR+GV+ QQ+LSNTSVEQLFFVLDLY Y GRV Sbjct: 558 ELKDACIEVAMATADGRPLISIPPPGGVVRVGVAFQQYLSNTSVEQLFFVLDLYTYFGRV 617 Query: 2300 SEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 SEKI + K++ K + GSLMEK PSDTAVS Sbjct: 618 SEKIAIVGKNNRPKTSENEALAGSLMEKVPSDTAVS 653 >ref|XP_007149696.1| hypothetical protein PHAVU_005G091400g [Phaseolus vulgaris] gi|561022960|gb|ESW21690.1| hypothetical protein PHAVU_005G091400g [Phaseolus vulgaris] Length = 1212 Score = 999 bits (2584), Expect = 0.0 Identities = 502/699 (71%), Positives = 579/699 (82%), Gaps = 4/699 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESIL RALEYTLKYWLKSFSR+QFKLQGRTV SNLDI+GDA+H+S+GLPPALNV +AK Sbjct: 1 MESILGRALEYTLKYWLKSFSREQFKLQGRTVHLSNLDIDGDALHSSIGLPPALNVASAK 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGKLEI LPSVSNVQ EPI VQIDRLDLVLEEN D +A SSN +++S KGSGYGFA Sbjct: 61 VGKLEITLPSVSNVQTEPIVVQIDRLDLVLEENSDFDASLSSNCSTPSAASAKGSGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMT+++ VNL++ET GGS+RQGGATWA P+ASITI NLLLYTTNENWQVVNLKE Sbjct: 121 DKIADGMTIQIQTVNLLLETCGGSRRQGGATWAPPMASITIRNLLLYTTNENWQVVNLKE 180 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 AR+FS+NKK IYVFKKLEW+SLSIDLLPHPDMFT+A L S G+N RDDDGAKR+FFGG Sbjct: 181 AREFSSNKKYIYVFKKLEWQSLSIDLLPHPDMFTEATLDHSEEGSNFRDDDGAKRVFFGG 240 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCLN 1222 ERF+EGISGEAYIT+QRTE NSPLGLEVQLH+ EAVCPALSEPGLRALLRFMTG+YVCLN Sbjct: 241 ERFIEGISGEAYITIQRTELNSPLGLEVQLHINEAVCPALSEPGLRALLRFMTGVYVCLN 300 Query: 1223 R-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNTK 1399 R DVD +R TEAAGRS VSIVVDHIFL IKD EFQL+LLMQSLFFSRAS+S+G N Sbjct: 301 RGDVD---SKRSTEAAGRSLVSIVVDHIFLCIKDTEFQLELLMQSLFFSRASLSEGDNDN 357 Query: 1400 NLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQLN 1579 NL+RIT+GGLFLRDTF PP LVQPSMQA T D+ VP+F ++FCPPIYPL+ Q WQL Sbjct: 358 NLTRITIGGLFLRDTFCSPPCILVQPSMQAGTRDAFRVPEFARSFCPPIYPLQEQQWQLI 417 Query: 1580 EGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGAV 1759 EG PLI LH+L+I PSP PPSF S+TVI CQPL+I+LQEESCLRISSFL+DGIVVNPG + Sbjct: 418 EGTPLICLHALKIMPSPLPPSFASETVIDCQPLVIHLQEESCLRISSFLADGIVVNPGDI 477 Query: 1760 LPDFSVYSLVFSLKELELTVPLEADNFPANGN---NAFQSSFAGAKLHIKDLFFSESASV 1930 LPDFSV S +F+LK L+LTVP + ++ N NA Q+SF+GA+LHI+ LFF S S+ Sbjct: 478 LPDFSVKSFIFNLKGLDLTVPFDKTKLDSSKNDMDNAVQTSFSGARLHIESLFFLNSPSL 537 Query: 1931 KLRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGLW 2110 KLR+LNL+KDPACFSLWE QP+DASQ+KW S L L LE C + ++AGLW Sbjct: 538 KLRMLNLEKDPACFSLWEGQPIDASQEKWTARASQLTLFLEASIDGPGCQNSLGQTAGLW 597 Query: 2111 ECVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAYI 2290 CV+L + CIE AMATADGSPL+ +PPPGG+VR+GV+C+Q+LSNTS+EQLFFVLDLY Y Sbjct: 598 RCVDLKDACIEVAMATADGSPLLQVPPPGGIVRVGVACEQYLSNTSIEQLFFVLDLYGYF 657 Query: 2291 GRVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 G VSEKI K + IR ++ GG LM+K PSD AVS Sbjct: 658 GSVSEKIAMAGKRKQLEDIRDKSFGGKLMDKVPSDAAVS 696 >ref|XP_006475219.1| PREDICTED: uncharacterized protein LOC102606947 [Citrus sinensis] Length = 1206 Score = 999 bits (2583), Expect = 0.0 Identities = 504/700 (72%), Positives = 582/700 (83%), Gaps = 5/700 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESI+ARALEYT KYWLKSFSRDQFKLQGRT + SNLDINGDA+HAS+GLPPAL+VTTAK Sbjct: 1 MESIIARALEYTFKYWLKSFSRDQFKLQGRTAQLSNLDINGDALHASMGLPPALHVTTAK 60 Query: 503 VGKLEIKLPS-VSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGF 679 +GKLEI LPS VSNVQ+EPI +Q+DRLDLVLEEN D +AC ++S + S KGSGYGF Sbjct: 61 LGKLEIILPSSVSNVQIEPIVLQVDRLDLVLEENPDKDACNYASSTPTPTGSSKGSGYGF 120 Query: 680 ADKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLK 859 ADKIADGMTL+V VNL++ T GG+QR GGA+W P+ASITI NL+L TTNENWQVVNLK Sbjct: 121 ADKIADGMTLQVNTVNLLLVTRGGAQRDGGASWTPPMASITIRNLVLCTTNENWQVVNLK 180 Query: 860 EARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFG 1039 EARDFS NKK IYVFKKLEWE+LS+DLLPHPDMF D + SN GA+ RD+DGAKR FFG Sbjct: 181 EARDFSLNKKFIYVFKKLEWETLSVDLLPHPDMFADGSIARSNEGASHRDEDGAKRAFFG 240 Query: 1040 GERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCL 1219 GERF+EGIS +AYITVQRTE NSPLGLEVQLHVTEAVCPALSEPGLRALLRF++GLYVCL Sbjct: 241 GERFIEGISAQAYITVQRTELNSPLGLEVQLHVTEAVCPALSEPGLRALLRFLSGLYVCL 300 Query: 1220 NR-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNT 1396 NR DVD QQ TEAAGRS VSIVVDHIFL IKDAEFQL+LLMQSLFFSRA+VSDG+ Sbjct: 301 NRDDVDLTTQQLSTEAAGRSLVSIVVDHIFLCIKDAEFQLELLMQSLFFSRATVSDGETA 360 Query: 1397 KNLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQL 1576 NL++ITV GLFLRDTFS PPSTLVQPSMQAV+ED + +PDF K+FCP I PL +Q WQ+ Sbjct: 361 SNLTKITVAGLFLRDTFSRPPSTLVQPSMQAVSEDLVLIPDFAKDFCPVICPLGDQQWQI 420 Query: 1577 NEGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGA 1756 N+GVPLI LH+LQ+KPSPAPPSF S+TVI CQPLMI+LQEESCLRISSFL+DGI+VN GA Sbjct: 421 NKGVPLICLHTLQVKPSPAPPSFASRTVISCQPLMIHLQEESCLRISSFLADGILVNHGA 480 Query: 1757 VLPDFSVYSLVFSLKELELTVPLE---ADNFPANGNNAFQSSFAGAKLHIKDLFFSESAS 1927 VLPD SV SL F L++L++TVPL+ DN N SSFAGA+LHIK LFFSES S Sbjct: 481 VLPDSSVNSLAFYLEDLDITVPLDMNKLDNHARQRNLTAHSSFAGARLHIKKLFFSESPS 540 Query: 1928 VKLRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGL 2107 +KLRLL+L+KDPACF LWEDQP+DASQ+KW SHL LSLE C+++ + ++GL Sbjct: 541 LKLRLLHLEKDPACFCLWEDQPIDASQRKWTAGASHLSLSLETCTSI---TGSQNSNSGL 597 Query: 2108 WECVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAY 2287 W+CVEL + CIE AM +ADG PL +PPPGGVVRIGV+CQQ+LSNTSVEQLFFVLD+Y Y Sbjct: 598 WKCVELKDACIEVAMVSADGKPLTVVPPPGGVVRIGVACQQYLSNTSVEQLFFVLDIYTY 657 Query: 2288 IGRVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 GRVSEKI ++ K+ + ++G LME P+DTAVS Sbjct: 658 FGRVSEKIVRVGKNKSAMKSGNESLGVKLMENAPNDTAVS 697 >ref|XP_006592884.1| PREDICTED: uncharacterized protein LOC100811661 isoform X2 [Glycine max] Length = 1012 Score = 998 bits (2581), Expect = 0.0 Identities = 498/699 (71%), Positives = 578/699 (82%), Gaps = 4/699 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESIL RALEYTLKYWLKSFSR+QFKLQGRTV SNLDI+GDA+H+SVGLPPALNV TAK Sbjct: 1 MESILGRALEYTLKYWLKSFSREQFKLQGRTVHLSNLDIDGDALHSSVGLPPALNVATAK 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGKLEI LPSVSNVQ EPI V IDRLDLVLEEN DS+ SSN +++S KGSGYGFA Sbjct: 61 VGKLEITLPSVSNVQTEPIVVHIDRLDLVLEENSDSDESLSSNCSTPSAASAKGSGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMT+++ VNL++ET GGS+RQ GATWA P+ASITI NLLLYTTNENWQVVNLKE Sbjct: 121 DKIADGMTIQIQTVNLLLETRGGSRRQAGATWAPPMASITIRNLLLYTTNENWQVVNLKE 180 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 AR+FS++KK IYVFKKLEW+SLSIDLLPHPDMFT+A S +N RDDDGAKR+FFGG Sbjct: 181 AREFSSHKKYIYVFKKLEWQSLSIDLLPHPDMFTEAAFGHSQGESNFRDDDGAKRVFFGG 240 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCLN 1222 ERF+EG+SGEAYIT+QRTE NSPLGLEVQLH+ EAVCPA+SEPGLRALLRFMTG+YVCLN Sbjct: 241 ERFIEGVSGEAYITIQRTELNSPLGLEVQLHINEAVCPAVSEPGLRALLRFMTGVYVCLN 300 Query: 1223 R-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNTK 1399 R D+D + QR TEAAGRS VSIVVDHIFL IKD EFQL+LLMQSL FSRAS+S+G N Sbjct: 301 RGDLDSKIHQRSTEAAGRSLVSIVVDHIFLCIKDTEFQLELLMQSLCFSRASLSEGDNDN 360 Query: 1400 NLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQLN 1579 NL+RIT+GGLFLRDTF PP LVQPSMQ VT D+ HVP+F ++FCPPIYPL+ Q WQL Sbjct: 361 NLTRITIGGLFLRDTFCSPPCILVQPSMQVVTRDAFHVPEFARSFCPPIYPLQEQEWQLI 420 Query: 1580 EGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGAV 1759 EG PLI LH+L+I PSP PPSF S+TVI CQPL+I+LQEESCLRISS L+DGIVVNPG + Sbjct: 421 EGTPLICLHALKIMPSPLPPSFASETVIDCQPLVIHLQEESCLRISSLLADGIVVNPGDI 480 Query: 1760 LPDFSVYSLVFSLKELELTVPLE---ADNFPANGNNAFQSSFAGAKLHIKDLFFSESASV 1930 LPDFSV S +F+LK L+LTVP + D ++ +N Q+SFAGA+LHI+ L F S S+ Sbjct: 481 LPDFSVKSFIFNLKGLDLTVPFDKTKLDISKSDMDNTVQTSFAGARLHIESLCFLNSPSL 540 Query: 1931 KLRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGLW 2110 KLR+LNL+KDPACFSLWE QP+DASQ+KW S L LSLE C+ C + +++GLW Sbjct: 541 KLRILNLEKDPACFSLWEGQPIDASQEKWTARASQLTLSLEACTDRTGCQNSLKQTSGLW 600 Query: 2111 ECVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAYI 2290 CV+L + CIE AMATADGSPL+ +PPPGG+VR+GV+C+Q+LSNTSVEQLFFVLDLY Y Sbjct: 601 RCVDLKDACIEVAMATADGSPLLQVPPPGGIVRVGVACEQYLSNTSVEQLFFVLDLYGYF 660 Query: 2291 GRVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 GRVSEKI K K + IR ++ G LM+K PSD AVS Sbjct: 661 GRVSEKIAKAVKRKQLEDIRDKSFSGKLMDKVPSDAAVS 699 >ref|XP_006592883.1| PREDICTED: uncharacterized protein LOC100811661 isoform X1 [Glycine max] Length = 1216 Score = 998 bits (2581), Expect = 0.0 Identities = 498/699 (71%), Positives = 578/699 (82%), Gaps = 4/699 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESIL RALEYTLKYWLKSFSR+QFKLQGRTV SNLDI+GDA+H+SVGLPPALNV TAK Sbjct: 1 MESILGRALEYTLKYWLKSFSREQFKLQGRTVHLSNLDIDGDALHSSVGLPPALNVATAK 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGKLEI LPSVSNVQ EPI V IDRLDLVLEEN DS+ SSN +++S KGSGYGFA Sbjct: 61 VGKLEITLPSVSNVQTEPIVVHIDRLDLVLEENSDSDESLSSNCSTPSAASAKGSGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMT+++ VNL++ET GGS+RQ GATWA P+ASITI NLLLYTTNENWQVVNLKE Sbjct: 121 DKIADGMTIQIQTVNLLLETRGGSRRQAGATWAPPMASITIRNLLLYTTNENWQVVNLKE 180 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 AR+FS++KK IYVFKKLEW+SLSIDLLPHPDMFT+A S +N RDDDGAKR+FFGG Sbjct: 181 AREFSSHKKYIYVFKKLEWQSLSIDLLPHPDMFTEAAFGHSQGESNFRDDDGAKRVFFGG 240 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCLN 1222 ERF+EG+SGEAYIT+QRTE NSPLGLEVQLH+ EAVCPA+SEPGLRALLRFMTG+YVCLN Sbjct: 241 ERFIEGVSGEAYITIQRTELNSPLGLEVQLHINEAVCPAVSEPGLRALLRFMTGVYVCLN 300 Query: 1223 R-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNTK 1399 R D+D + QR TEAAGRS VSIVVDHIFL IKD EFQL+LLMQSL FSRAS+S+G N Sbjct: 301 RGDLDSKIHQRSTEAAGRSLVSIVVDHIFLCIKDTEFQLELLMQSLCFSRASLSEGDNDN 360 Query: 1400 NLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQLN 1579 NL+RIT+GGLFLRDTF PP LVQPSMQ VT D+ HVP+F ++FCPPIYPL+ Q WQL Sbjct: 361 NLTRITIGGLFLRDTFCSPPCILVQPSMQVVTRDAFHVPEFARSFCPPIYPLQEQEWQLI 420 Query: 1580 EGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGAV 1759 EG PLI LH+L+I PSP PPSF S+TVI CQPL+I+LQEESCLRISS L+DGIVVNPG + Sbjct: 421 EGTPLICLHALKIMPSPLPPSFASETVIDCQPLVIHLQEESCLRISSLLADGIVVNPGDI 480 Query: 1760 LPDFSVYSLVFSLKELELTVPLE---ADNFPANGNNAFQSSFAGAKLHIKDLFFSESASV 1930 LPDFSV S +F+LK L+LTVP + D ++ +N Q+SFAGA+LHI+ L F S S+ Sbjct: 481 LPDFSVKSFIFNLKGLDLTVPFDKTKLDISKSDMDNTVQTSFAGARLHIESLCFLNSPSL 540 Query: 1931 KLRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGLW 2110 KLR+LNL+KDPACFSLWE QP+DASQ+KW S L LSLE C+ C + +++GLW Sbjct: 541 KLRILNLEKDPACFSLWEGQPIDASQEKWTARASQLTLSLEACTDRTGCQNSLKQTSGLW 600 Query: 2111 ECVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAYI 2290 CV+L + CIE AMATADGSPL+ +PPPGG+VR+GV+C+Q+LSNTSVEQLFFVLDLY Y Sbjct: 601 RCVDLKDACIEVAMATADGSPLLQVPPPGGIVRVGVACEQYLSNTSVEQLFFVLDLYGYF 660 Query: 2291 GRVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 GRVSEKI K K + IR ++ G LM+K PSD AVS Sbjct: 661 GRVSEKIAKAVKRKQLEDIRDKSFSGKLMDKVPSDAAVS 699 >ref|XP_006370616.1| hypothetical protein POPTR_0001s44280g [Populus trichocarpa] gi|550349822|gb|ERP67185.1| hypothetical protein POPTR_0001s44280g [Populus trichocarpa] Length = 1212 Score = 995 bits (2572), Expect = 0.0 Identities = 505/700 (72%), Positives = 574/700 (82%), Gaps = 5/700 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 ME+ILA ALEYTLKYWLKSFSRDQFKLQGRTV+ SNL+INGDA+HAS+GLPPALNVT AK Sbjct: 1 MEAILACALEYTLKYWLKSFSRDQFKLQGRTVQLSNLEINGDALHASMGLPPALNVTKAK 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGK EI LP VS VQVEPI +QID+LDLVLEEN D + S NS QL+ S K SGYGFA Sbjct: 61 VGKFEIILPYVSYVQVEPIVIQIDKLDLVLEENSDLDGSSSPNSSQLSGDSSKSSGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMT+++ VNL++ET GG QR GGA WASPLASITIHNLLLYTTNENWQVVNLKE Sbjct: 121 DKIADGMTIQITTVNLLLETRGGVQRGGGAAWASPLASITIHNLLLYTTNENWQVVNLKE 180 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 ARDFS NKK IY FKKLEWESLS+DLLPHPDMFTDA L + GA++RDDDGAKR+FFGG Sbjct: 181 ARDFSTNKKFIYAFKKLEWESLSVDLLPHPDMFTDASLARAEEGASQRDDDGAKRVFFGG 240 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEP-GLRALLRFMTGLYVCL 1219 ERFLEGISGEAYIT+QRTE NSPLGLEVQLH+ EAVCPALSEP GLRALLRFMTGLYVCL Sbjct: 241 ERFLEGISGEAYITIQRTELNSPLGLEVQLHIPEAVCPALSEPAGLRALLRFMTGLYVCL 300 Query: 1220 NR-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNT 1396 NR DV QAQQR TEAAG S VSIVVDHIFL IKDAEFQL+LLMQSL FSRA+VSDG+ Sbjct: 301 NRGDVGLQAQQRSTEAAGCSLVSIVVDHIFLRIKDAEFQLELLMQSLLFSRATVSDGKIA 360 Query: 1397 KNLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQL 1576 NL+++ +GG+FLRDTFS PP TL+QPS+QA+T+ +PDF K+FCPPIYPL + WQ Sbjct: 361 NNLTKVMLGGMFLRDTFSRPPCTLLQPSLQAITKHVARIPDFAKDFCPPIYPLGDHQWQK 420 Query: 1577 NEGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGA 1756 + G+PLI LHSLQ KPSP PP F SQTVI CQPLMI+LQEESCLRISSFL+DGIV+NPG Sbjct: 421 SVGIPLICLHSLQAKPSPVPPCFASQTVITCQPLMIHLQEESCLRISSFLADGIVINPGD 480 Query: 1757 VLPDFSVYSLVFSLKELELTVPL---EADNFPANGNNAFQSSFAGAKLHIKDLFFSESAS 1927 VLPDFSV SLVF LKEL++ VPL +++N NGN+ F + FAGA+L I++LFFSES + Sbjct: 481 VLPDFSVNSLVFVLKELDVIVPLDVSQSNNPTENGNSTFHNVFAGARLRIENLFFSESPT 540 Query: 1928 VKLRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGL 2107 +KLRLL L+KDPACF LWE QP+DASQKKW T SHL LSLE + L S S+G Sbjct: 541 LKLRLLKLEKDPACFYLWEGQPIDASQKKWTTGASHLTLSLETSTNLNGTPSSNGMSSGS 600 Query: 2108 WECVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAY 2287 W C+EL + +E AM +ADGSPL +PPPGG+VR+GV+CQQ+LSNTSVEQLFFVLDLYAY Sbjct: 601 WRCIELQDASVEVAMISADGSPLTNVPPPGGIVRVGVACQQYLSNTSVEQLFFVLDLYAY 660 Query: 2288 IGRVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 GRV EKI + K K R + G LM+K P DTAVS Sbjct: 661 FGRVCEKIVSVGKDKRPKITRNGSSGVRLMDKVPCDTAVS 700 >ref|XP_004152911.1| PREDICTED: uncharacterized protein LOC101210396 [Cucumis sativus] Length = 1203 Score = 994 bits (2571), Expect = 0.0 Identities = 506/699 (72%), Positives = 583/699 (83%), Gaps = 4/699 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESILARALEYTLKYWLKSFSRDQFKLQGRT + SNLDINGDA+H+S+GLPPALNVTTA+ Sbjct: 1 MESILARALEYTLKYWLKSFSRDQFKLQGRTAQLSNLDINGDALHSSLGLPPALNVTTAR 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGKLEI LPS+SNVQVEP+ VQID+LDLVLEEN D++ RS++S Q +SS+ KG GYGFA Sbjct: 61 VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADMGRSTSSSQTSSSTVKGGGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMT+EV VNL++ET GGS+ QGGATWASPLASITI NLLLYTTNENWQVVNLKE Sbjct: 121 DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE 180 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 ARDFS NKK IYVFKKLEWESLSIDLLPHPDMF DA+L + G RDDDGAKR+FFGG Sbjct: 181 ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADANLARAQEGPIGRDDDGAKRVFFGG 240 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCLN 1222 ERF+EGISGEA IT+QRTE NSPLGLEV L++TEAVCPALSEPGLRA LRF+TGLYVCLN Sbjct: 241 ERFIEGISGEANITLQRTELNSPLGLEVNLYITEAVCPALSEPGLRAFLRFLTGLYVCLN 300 Query: 1223 R-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNTK 1399 R DVD ++QQR TEAAGRS VSI+VDHIFL +KD EFQL+ LMQSL FSRASVSDGQN Sbjct: 301 RGDVDLKSQQRSTEAAGRSLVSIIVDHIFLCVKDPEFQLEFLMQSLLFSRASVSDGQNDN 360 Query: 1400 NLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQLN 1579 NL+R+ +GGLFLRDTFS PP TLVQP+MQAVT+D LHVP+F +NFCPPIYP +++ W L+ Sbjct: 361 NLTRVMIGGLFLRDTFSRPPCTLVQPAMQAVTDDFLHVPEFARNFCPPIYPFKDKQWGLS 420 Query: 1580 EGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGAV 1759 VPL+ LHS+Q+KPSP PPSF SQTVI+CQPL I+LQE+SCLRISSFL+DGIVVNPG+V Sbjct: 421 GNVPLLCLHSVQVKPSPVPPSFASQTVIHCQPLTIHLQEKSCLRISSFLADGIVVNPGSV 480 Query: 1760 LPDFSVYSLVFSLKELELTVPLE---ADNFPANGNNAFQSSFAGAKLHIKDLFFSESASV 1930 LPDFSV S+V SLKEL+++VPL+ + ++ + + SSF GA+LHIK++ FSES S+ Sbjct: 481 LPDFSVSSIVLSLKELDVSVPLDVAKSSDYHGSWDGISHSSFDGARLHIKNMQFSESPSL 540 Query: 1931 KLRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGLW 2110 LRLLNLDKDPACF LWE QPVDASQKKW T VS + LSLE + + D L Sbjct: 541 NLRLLNLDKDPACFLLWEGQPVDASQKKWATSVSQISLSLETYNKVSGS-KRSDAILALL 599 Query: 2111 ECVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAYI 2290 CVEL +V IE AMATADG L IPPPGGVVR+GVSCQQ+LSNTSV+QLFFVLDLYAY Sbjct: 600 RCVELTDVSIEVAMATADGKTLTAIPPPGGVVRVGVSCQQYLSNTSVDQLFFVLDLYAYF 659 Query: 2291 GRVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 GRV+EKI + K + K + G L++K PSDTAVS Sbjct: 660 GRVTEKIALVGKKNRPKESGSNMLVGKLVDKVPSDTAVS 698 >ref|XP_003543291.1| PREDICTED: uncharacterized protein LOC100803142 [Glycine max] Length = 1216 Score = 994 bits (2571), Expect = 0.0 Identities = 499/699 (71%), Positives = 580/699 (82%), Gaps = 4/699 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESIL RALEYTLKYWLKSFSR+QFKLQGRTV SNLDI+GDA+H+SVGLPPALNV TAK Sbjct: 1 MESILGRALEYTLKYWLKSFSREQFKLQGRTVHLSNLDIDGDALHSSVGLPPALNVATAK 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGKLEI LPSVSNVQ EPI V IDRLDLVLEE+ DS+ SSN +++S KGSGYGFA Sbjct: 61 VGKLEITLPSVSNVQTEPIVVHIDRLDLVLEESSDSDESLSSNCSTPSAASVKGSGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMT+++ VNL++ET GGS+RQ GATWA P+ASITI NLLLYTTNENWQVVNLKE Sbjct: 121 DKIADGMTIQIQTVNLLLETRGGSRRQVGATWAPPMASITIRNLLLYTTNENWQVVNLKE 180 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 AR+FS+NK IYVFKKLEW+SLSIDLLPHPDMFT+A L S G+N RDDDGAKR+FFGG Sbjct: 181 AREFSSNKY-IYVFKKLEWQSLSIDLLPHPDMFTEAALGHSQEGSNFRDDDGAKRVFFGG 239 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCLN 1222 ERF+EG+SGEAYIT+QRTE NSPLGLEVQLH+ EAVCPALSEPGLRALLRFMTG+YVCLN Sbjct: 240 ERFIEGVSGEAYITIQRTELNSPLGLEVQLHINEAVCPALSEPGLRALLRFMTGVYVCLN 299 Query: 1223 R-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNTK 1399 R DVD + QQR TEAAGRS VSIV+DHIFL IKD EFQL+LLMQSL FSRAS+S+G N Sbjct: 300 RGDVDSKIQQRSTEAAGRSLVSIVIDHIFLCIKDTEFQLELLMQSLCFSRASLSEGDNDN 359 Query: 1400 NLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDFGKNFCPPIYPLRNQLWQLN 1579 NL+RIT+GGLFLRDTF PP LVQPSMQAVT+D+ HVP+F ++FCPPIYPL+ Q WQL Sbjct: 360 NLTRITIGGLFLRDTFCSPPCILVQPSMQAVTKDAFHVPEFARSFCPPIYPLQEQEWQLI 419 Query: 1580 EGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGAV 1759 EG PLI LH+L+I PSP PPSF S+TVI CQPL+I+LQEESCLRISS L+DGIVVNPG + Sbjct: 420 EGTPLICLHALKIMPSPLPPSFASETVIDCQPLVIHLQEESCLRISSLLADGIVVNPGDI 479 Query: 1760 LPDFSVYSLVFSLKELELTVPLE---ADNFPANGNNAFQSSFAGAKLHIKDLFFSESASV 1930 L DFSV S +F+LK L+LTVP + D ++ +N Q+SFAGA+LHI+ L F S S+ Sbjct: 480 LSDFSVKSFIFNLKGLDLTVPFDKTKLDISKSDMDNTVQTSFAGARLHIESLCFLNSPSL 539 Query: 1931 KLRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGLW 2110 KLR+LNL+KDPACFSLWE QP+DASQ+KW S L LSLE C+ C + ++++GLW Sbjct: 540 KLRILNLEKDPACFSLWEGQPIDASQEKWTARASQLTLSLEACTDRTGCQNSLEQTSGLW 599 Query: 2111 ECVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAYI 2290 CV+L + CIE AM TADGSPL+ +PPPGG+VR+GV+C+Q+LSNTSVEQLFFVLDLY Y Sbjct: 600 RCVDLKDACIEVAMVTADGSPLLQVPPPGGIVRVGVACEQYLSNTSVEQLFFVLDLYGYF 659 Query: 2291 GRVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 GRVSEKI K K + IR + G LM+K PSD +VS Sbjct: 660 GRVSEKIAKAGKRKQLEDIRDTSFSGKLMDKVPSDASVS 698 >ref|XP_006830454.1| hypothetical protein AMTR_s00115p00072410 [Amborella trichopoda] gi|548836827|gb|ERM97870.1| hypothetical protein AMTR_s00115p00072410 [Amborella trichopoda] Length = 1129 Score = 986 bits (2550), Expect = 0.0 Identities = 500/698 (71%), Positives = 577/698 (82%), Gaps = 3/698 (0%) Frame = +2 Query: 323 MESILARALEYTLKYWLKSFSRDQFKLQGRTVRFSNLDINGDAVHASVGLPPALNVTTAK 502 MESI+ +ALEYTLKYWLKSFSR+QFKLQGRT + NLDINGDA+HAS GLPPALNVT A+ Sbjct: 1 MESIIGKALEYTLKYWLKSFSREQFKLQGRTAQLYNLDINGDALHASAGLPPALNVTHAR 60 Query: 503 VGKLEIKLPSVSNVQVEPICVQIDRLDLVLEENLDSNACRSSNSVQLTSSSGKGSGYGFA 682 VGKLEI+LPS SNVQ EPI VQID+LDLVLEEN S+ ++S S Q +SSS K SGYGFA Sbjct: 61 VGKLEIQLPSFSNVQTEPIVVQIDKLDLVLEENTGSDLGKTSCSNQSSSSSAKSSGYGFA 120 Query: 683 DKIADGMTLEVGIVNLMIETHGGSQRQGGATWASPLASITIHNLLLYTTNENWQVVNLKE 862 DKIADGMT+EVGIVNLM+ET GG R+GGATW PLASITI NLLLYTTNE WQVVNLKE Sbjct: 121 DKIADGMTVEVGIVNLMLETRGGPGRKGGATWTPPLASITIRNLLLYTTNEKWQVVNLKE 180 Query: 863 ARDFSNNKKCIYVFKKLEWESLSIDLLPHPDMFTDAHLTFSNNGANKRDDDGAKRMFFGG 1042 ARDFS+N+K IYVFKK+EWESLSIDLLPHPDMF D LT SN+ RDDDGAKR+FFGG Sbjct: 181 ARDFSDNEKFIYVFKKMEWESLSIDLLPHPDMFADERLTSSNSRDTSRDDDGAKRLFFGG 240 Query: 1043 ERFLEGISGEAYITVQRTEQNSPLGLEVQLHVTEAVCPALSEPGLRALLRFMTGLYVCLN 1222 ERFL+ ISG+AYITVQRTEQN+PLGLEVQLH+ EAVCP+LSEPGLRALLRFMTGLYVCLN Sbjct: 241 ERFLDSISGQAYITVQRTEQNNPLGLEVQLHIPEAVCPSLSEPGLRALLRFMTGLYVCLN 300 Query: 1223 R-DVDPQAQQRCTEAAGRSFVSIVVDHIFLSIKDAEFQLDLLMQSLFFSRASVSDGQNTK 1399 R DVDP+AQQRCTEAAGRS VSI+VDH+FL +KDAEFQL+LLMQSL++SRASVSDG+NTK Sbjct: 301 RGDVDPKAQQRCTEAAGRSLVSIIVDHVFLCVKDAEFQLELLMQSLYYSRASVSDGENTK 360 Query: 1400 NLSRITVGGLFLRDTFSHPPSTLVQPSMQAVTEDSLHVPDF-GKNFCPPIYPLRNQLWQL 1576 N+SR+ VGGLFLRDTFSHPP TLVQPSMQ ++DS PDF G+ P IYPL Q WQL Sbjct: 361 NISRVIVGGLFLRDTFSHPPCTLVQPSMQIDSKDSPDTPDFAGEGLWPKIYPLGEQPWQL 420 Query: 1577 NEGVPLISLHSLQIKPSPAPPSFTSQTVIYCQPLMINLQEESCLRISSFLSDGIVVNPGA 1756 + +PL+ L+S Q+ PSPAPPSF SQTVI C+PL+INLQE+SCLRISSFL+DGIVVN GA Sbjct: 421 HASIPLVFLYSFQLNPSPAPPSFASQTVINCEPLIINLQEKSCLRISSFLADGIVVNSGA 480 Query: 1757 VLPDFSVYSLVFSLKELELTVPLEADNFPANGN-NAFQSSFAGAKLHIKDLFFSESASVK 1933 VLPDFSV S+VF+LKE LTVPL++ A N QSSF GA+LH ++L F +S +++ Sbjct: 481 VLPDFSVNSMVFTLKEFNLTVPLDSGLPDAKLNMMPSQSSFEGARLHAENLIFHQSPALR 540 Query: 1934 LRLLNLDKDPACFSLWEDQPVDASQKKWRTEVSHLILSLEPCSTLKECVSFPDRSAGLWE 2113 L+LLNL+KDPACF LWE QP+D+SQ+KW SHL LSLE K+ + S GLW Sbjct: 541 LKLLNLEKDPACFCLWESQPIDSSQRKWTMRASHLNLSLETSIGEKKSPDLSEWSTGLWR 600 Query: 2114 CVELHEVCIEAAMATADGSPLITIPPPGGVVRIGVSCQQFLSNTSVEQLFFVLDLYAYIG 2293 CVEL + C EAAM TADGSPLIT+PPPGG+VRIGV+C+Q+LSNTSVEQL FVLDLYAY G Sbjct: 601 CVELQDACFEAAMVTADGSPLITVPPPGGLVRIGVACEQYLSNTSVEQLLFVLDLYAYFG 660 Query: 2294 RVSEKITKLRKSDGQKRIRRRTIGGSLMEKFPSDTAVS 2407 RVSE+I K+ K Q R + + G +M+ PSDT VS Sbjct: 661 RVSEEIAKVGKIKRQGR-KAGLLKGGMMDYAPSDTGVS 697