BLASTX nr result
ID: Akebia25_contig00009303
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00009303 (1951 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI27163.3| unnamed protein product [Vitis vinifera] 947 0.0 ref|XP_002277727.1| PREDICTED: uncharacterized protein LOC100250... 947 0.0 ref|XP_006448716.1| hypothetical protein CICLE_v10014649mg [Citr... 929 0.0 ref|XP_006468463.1| PREDICTED: uncharacterized protein LOC102626... 923 0.0 ref|XP_002524131.1| conserved hypothetical protein [Ricinus comm... 913 0.0 ref|XP_002322374.2| hypothetical protein POPTR_0015s15390g [Popu... 912 0.0 ref|XP_006377029.1| hypothetical protein POPTR_0012s12240g [Popu... 908 0.0 ref|XP_006441047.1| hypothetical protein CICLE_v10019372mg [Citr... 904 0.0 ref|XP_004294199.1| PREDICTED: uncharacterized protein LOC101301... 902 0.0 gb|EXC24765.1| hypothetical protein L484_018479 [Morus notabilis] 900 0.0 ref|XP_007211386.1| hypothetical protein PRUPE_ppa003117mg [Prun... 897 0.0 ref|XP_007024931.1| Trypsin family protein isoform 1 [Theobroma ... 893 0.0 ref|XP_003531502.1| PREDICTED: uncharacterized protein LOC100806... 892 0.0 ref|XP_003546786.1| PREDICTED: uncharacterized protein LOC100783... 890 0.0 ref|XP_007149278.1| hypothetical protein PHAVU_005G056800g [Phas... 885 0.0 ref|XP_002513414.1| conserved hypothetical protein [Ricinus comm... 882 0.0 ref|XP_003556316.1| PREDICTED: uncharacterized protein LOC100816... 876 0.0 ref|XP_007022258.1| Trypsin family protein [Theobroma cacao] gi|... 872 0.0 ref|XP_003529430.1| PREDICTED: uncharacterized protein LOC100796... 872 0.0 ref|XP_004488832.1| PREDICTED: uncharacterized protein LOC101500... 871 0.0 >emb|CBI27163.3| unnamed protein product [Vitis vinifera] Length = 684 Score = 947 bits (2449), Expect = 0.0 Identities = 481/597 (80%), Positives = 522/597 (87%), Gaps = 2/597 (0%) Frame = -2 Query: 1941 RMERASSLDLRGRHSGSTQSEESALDLERNYCGHPNLPSSSPP-LQAFASAGQHSESNAA 1765 +M+R + LDLR HSGS QSEESALDLERNYC HPNLPS SPP LQAFAS GQ SESNAA Sbjct: 88 KMDR-TRLDLRFHHSGSIQSEESALDLERNYCNHPNLPSPSPPPLQAFASGGQLSESNAA 146 Query: 1764 YFSWPNSSRLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILR 1585 YFSWP SSRLN AAEDRANYFGNLQKGVLPE L +LPTG+QATTLLELMTIRAFHS+ILR Sbjct: 147 YFSWPTSSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILR 206 Query: 1584 RYSLGTAIGFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVE 1405 R+SLGTAIGFRIRRG LT+IPAILVFVARKVHRQWL+HIQCLP+ALEGPGGVWCDVDVVE Sbjct: 207 RFSLGTAIGFRIRRGVLTEIPAILVFVARKVHRQWLNHIQCLPAALEGPGGVWCDVDVVE 266 Query: 1404 FSYFGAPAATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGF 1225 FSY+GAPA TPKEQLYTELVDGLRGSD CIGSGSQVASQETYGTLGAIVKSRTGN+QVGF Sbjct: 267 FSYYGAPAPTPKEQLYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVKSRTGNQQVGF 326 Query: 1224 LTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVR 1045 LTNRHVAVDLDYP+QKMFHPLPPSLGPGVYLGAVERATSFITDDLWYG FAGTNPETFVR Sbjct: 327 LTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVR 386 Query: 1044 ADGAFIPFADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGT 865 ADGAFIPFADDF+ SNVTT+VKGVGEIGDV IIDLQSPINSLIG+QV KVGRSSGLTTGT Sbjct: 387 ADGAFIPFADDFNVSNVTTTVKGVGEIGDVNIIDLQSPINSLIGRQVVKVGRSSGLTTGT 446 Query: 864 IMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGT 685 IMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTG N +KPRP+GIIWGGT Sbjct: 447 IMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGT 506 Query: 684 ANRGRLKLKVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTV 505 ANRGRLKLKVGQPPENWTSGV L+T++EGLQ AV EQ +ASA DSTV Sbjct: 507 ANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSEGLQAAVHEQINASAAGIDSTV 566 Query: 504 GESSPAEGALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPP 325 GESSP E L K+K ++ FE L IN++Q+PIE S+ + P F H +FHIE+GV+ P Sbjct: 567 GESSPPEPVLLKNKTEENFEPLGINLQQVPIE--GESQQAVLPSFIHTEFHIEEGVEAAP 624 Query: 324 NV-ELQFIPSCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRR 157 NV E QFIPSC +SP+H+ + Q NPE KNL ALRN S+E++ SLQLG EPKRR+ Sbjct: 625 NVEEHQFIPSCPGKSPVHQNNKQENPELKNLWALRNTSEEEMAVSLQLGKPEPKRRK 681 >ref|XP_002277727.1| PREDICTED: uncharacterized protein LOC100250825 [Vitis vinifera] Length = 596 Score = 947 bits (2447), Expect = 0.0 Identities = 479/590 (81%), Positives = 517/590 (87%), Gaps = 2/590 (0%) Frame = -2 Query: 1920 LDLRGRHSGSTQSEESALDLERNYCGHPNLPSSSPP-LQAFASAGQHSESNAAYFSWPNS 1744 LDLR HSGS QSEESALDLERNYC HPNLPS SPP LQAFAS GQ SESNAAYFSWP S Sbjct: 6 LDLRFHHSGSIQSEESALDLERNYCNHPNLPSPSPPPLQAFASGGQLSESNAAYFSWPTS 65 Query: 1743 SRLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRRYSLGTA 1564 SRLN AAEDRANYFGNLQKGVLPE L +LPTG+QATTLLELMTIRAFHS+ILRR+SLGTA Sbjct: 66 SRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTA 125 Query: 1563 IGFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEFSYFGAP 1384 IGFRIRRG LT+IPAILVFVARKVHRQWL+HIQCLP+ALEGPGGVWCDVDVVEFSY+GAP Sbjct: 126 IGFRIRRGVLTEIPAILVFVARKVHRQWLNHIQCLPAALEGPGGVWCDVDVVEFSYYGAP 185 Query: 1383 AATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVA 1204 A TPKEQLYTELVDGLRGSD CIGSGSQVASQETYGTLGAIVKSRTGN+QVGFLTNRHVA Sbjct: 186 APTPKEQLYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVKSRTGNQQVGFLTNRHVA 245 Query: 1203 VDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRADGAFIP 1024 VDLDYP+QKMFHPLPPSLGPGVYLGAVERATSFITDDLWYG FAGTNPETFVRADGAFIP Sbjct: 246 VDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIP 305 Query: 1023 FADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTIMAYALE 844 FADDF+ SNVTT+VKGVGEIGDV IIDLQSPINSLIG+QV KVGRSSGLTTGTIMAYALE Sbjct: 306 FADDFNVSNVTTTVKGVGEIGDVNIIDLQSPINSLIGRQVVKVGRSSGLTTGTIMAYALE 365 Query: 843 YNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTANRGRLK 664 YNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTG N +KPRP+GIIWGGTANRGRLK Sbjct: 366 YNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLK 425 Query: 663 LKVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVGESSPAE 484 LKVGQPPENWTSGV L+T++EGLQ AV EQ +ASA DSTVGESSP E Sbjct: 426 LKVGQPPENWTSGVDLGRLLDLLELDLITTSEGLQAAVHEQINASAAGIDSTVGESSPPE 485 Query: 483 GALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPPNV-ELQF 307 L K+K ++ FE L IN++Q+PIE S+ + P F H +FHIE+GV+ PNV E QF Sbjct: 486 PVLLKNKTEENFEPLGINLQQVPIE--GESQQAVLPSFIHTEFHIEEGVEAAPNVEEHQF 543 Query: 306 IPSCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRR 157 IPSC +SP+H+ + Q NPE KNL ALRN S+E++ SLQLG EPKRR+ Sbjct: 544 IPSCPGKSPVHQNNKQENPELKNLWALRNTSEEEMAVSLQLGKPEPKRRK 593 >ref|XP_006448716.1| hypothetical protein CICLE_v10014649mg [Citrus clementina] gi|567912807|ref|XP_006448717.1| hypothetical protein CICLE_v10014649mg [Citrus clementina] gi|568828259|ref|XP_006468461.1| PREDICTED: uncharacterized protein LOC102626549 isoform X1 [Citrus sinensis] gi|568828261|ref|XP_006468462.1| PREDICTED: uncharacterized protein LOC102626549 isoform X2 [Citrus sinensis] gi|557551327|gb|ESR61956.1| hypothetical protein CICLE_v10014649mg [Citrus clementina] gi|557551328|gb|ESR61957.1| hypothetical protein CICLE_v10014649mg [Citrus clementina] Length = 604 Score = 929 bits (2402), Expect = 0.0 Identities = 468/599 (78%), Positives = 517/599 (86%), Gaps = 2/599 (0%) Frame = -2 Query: 1917 DLRGRHSGSTQSEESALDLERNYCGHPNLPSSSP-PLQAFASAGQHSESNAAYFSWPNSS 1741 DLR ++SGS+QSEESALDLERNYC HPNLPSSSP PLQ FAS GQHSESNAAYFSWP S Sbjct: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66 Query: 1740 RLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRRYSLGTAI 1561 RLN AAEDRANYFGNLQKGVLPE L +LPTG+QATTLLELMTIRAFHS+ILRR+SLGTAI Sbjct: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126 Query: 1560 GFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEFSYFGAPA 1381 GFRIRRG LTDIPAILVFVARKVHRQWLSH+QCLP+ALEGPGGVWCDVDVVEFSY+GAPA Sbjct: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186 Query: 1380 ATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAV 1201 TPKE+LYTELVDGLRGSD CIGSGSQVASQETYGTLGAIV+SRTGN+QVGFLTNRHVAV Sbjct: 187 PTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAV 246 Query: 1200 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRADGAFIPF 1021 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYG FAGTNPETFVRADGAFIPF Sbjct: 247 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPF 306 Query: 1020 ADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTIMAYALEY 841 A+DF+ +NVTTSVKGVGEIGDV IIDLQSPINSLIG+QV KVGRSSGLTTGT+MAYALEY Sbjct: 307 AEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEY 366 Query: 840 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTANRGRLKL 661 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTG N +KPRP+GIIWGGTANRGRLKL Sbjct: 367 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKL 426 Query: 660 KVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVGESSPAEG 481 KVGQPP NWTSGV L+ + EG Q AVQ+QR+ASA A +STVGES PAE Sbjct: 427 KVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGESPPAER 486 Query: 480 ALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPPNVELQFIP 301 K+K + E ++NI+Q ++ SE PPF H +FH+EDG++ NV QFIP Sbjct: 487 EQSKEKTAERLEPFNLNIQQDLVD--GESEQGPTPPFIHTEFHVEDGIESSSNVGHQFIP 544 Query: 300 SCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRR-ADSTIVIDDS 127 S RSP+H+ ++Q N K+LSALRNG DED SLQLG+ EPKRR+ +D+++ + +S Sbjct: 545 SFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSDTSLNVQES 603 >ref|XP_006468463.1| PREDICTED: uncharacterized protein LOC102626549 isoform X3 [Citrus sinensis] Length = 602 Score = 923 bits (2385), Expect = 0.0 Identities = 467/599 (77%), Positives = 516/599 (86%), Gaps = 2/599 (0%) Frame = -2 Query: 1917 DLRGRHSGSTQSEESALDLERNYCGHPNLPSSSP-PLQAFASAGQHSESNAAYFSWPNSS 1741 DLR ++SGS+QSEESALDLERNYC HPNLPSSSP PLQ FAS GQHSESNAAYFSWP S Sbjct: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66 Query: 1740 RLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRRYSLGTAI 1561 RLN AAEDRANYFGNLQKGVLPE L +LPTG+QATTLLELMTIRAFHS+ILRR+SLGTAI Sbjct: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126 Query: 1560 GFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEFSYFGAPA 1381 GFRIRRG LTDIPAILVFVARKVHRQWLSH+QCLP+ALEGPGGVWCDVDVVEFSY+GAPA Sbjct: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186 Query: 1380 ATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAV 1201 TPKE+LYTELVDGLRGSD CIGSGSQVASQETYGTLGAIV+SRTGN+QVGFLTNRHVAV Sbjct: 187 PTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAV 246 Query: 1200 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRADGAFIPF 1021 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYG FAGTNPETFVRADGAFIPF Sbjct: 247 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPF 306 Query: 1020 ADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTIMAYALEY 841 A+DF+ +NVTTSVKGVGEIGDV IIDLQSPINSLIG+QV KVGRSSGLTTGT+MAYALEY Sbjct: 307 AEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEY 366 Query: 840 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTANRGRLKL 661 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTG N +KPRP+GIIWGGTANRGRLKL Sbjct: 367 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKL 426 Query: 660 KVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVGESSPAEG 481 KVGQPP NWTSGV L+ + EG Q VQ+QR+ASA A +STVGES PAE Sbjct: 427 KVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQ--VQDQRNASAAAIESTVGESPPAER 484 Query: 480 ALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPPNVELQFIP 301 K+K + E ++NI+Q ++ SE PPF H +FH+EDG++ NV QFIP Sbjct: 485 EQSKEKTAERLEPFNLNIQQDLVD--GESEQGPTPPFIHTEFHVEDGIESSSNVGHQFIP 542 Query: 300 SCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRR-ADSTIVIDDS 127 S RSP+H+ ++Q N K+LSALRNG DED SLQLG+ EPKRR+ +D+++ + +S Sbjct: 543 SFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSDTSLNVQES 601 >ref|XP_002524131.1| conserved hypothetical protein [Ricinus communis] gi|223536598|gb|EEF38242.1| conserved hypothetical protein [Ricinus communis] Length = 593 Score = 913 bits (2360), Expect = 0.0 Identities = 466/591 (78%), Positives = 506/591 (85%), Gaps = 1/591 (0%) Frame = -2 Query: 1926 SSLDLRGRHSGSTQSEESALDLERNYCGHPNLPSSSPP-LQAFASAGQHSESNAAYFSWP 1750 + LDLR HSGSTQSEESALDLERN C HPN SSP LQ FAS+GQH ESNAAYFSWP Sbjct: 4 NKLDLRLHHSGSTQSEESALDLERNCCNHPNPHWSSPTSLQPFASSGQHYESNAAYFSWP 63 Query: 1749 NSSRLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRRYSLG 1570 SRLN AEDRANYFGNLQKGVLPE L +LP+G+QATTLLELMTIRAFHS+ILRR+SLG Sbjct: 64 TLSRLNDTAEDRANYFGNLQKGVLPETLGRLPSGQQATTLLELMTIRAFHSKILRRFSLG 123 Query: 1569 TAIGFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEFSYFG 1390 TAIGFRIRRG LTDIPAILVFVARKVHRQWLSH+QCLP+ALEGPGGVWCDVDVVEFSY+G Sbjct: 124 TAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYG 183 Query: 1389 APAATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRH 1210 APA+TPKEQLYTELVDGLRGS CIGSGSQVA+QETYGTLGAIVKSRTGNRQVGFLTNRH Sbjct: 184 APASTPKEQLYTELVDGLRGSYPCIGSGSQVANQETYGTLGAIVKSRTGNRQVGFLTNRH 243 Query: 1209 VAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRADGAF 1030 VAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITD+LWYG FAGTNPETFVRADGAF Sbjct: 244 VAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDELWYGIFAGTNPETFVRADGAF 303 Query: 1029 IPFADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTIMAYA 850 IPFA+DF+ +NVTTSVKGVGEIGDV IDLQSPINSLIG+QV KVGRSSGLTTGTIMAYA Sbjct: 304 IPFAEDFNMNNVTTSVKGVGEIGDVHSIDLQSPINSLIGRQVVKVGRSSGLTTGTIMAYA 363 Query: 849 LEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTANRGR 670 LEYNDEKGICFFTDFLVVGENQQ FDLEGDSGSLILLTG N DKPRP+GIIWGGTANRGR Sbjct: 364 LEYNDEKGICFFTDFLVVGENQQPFDLEGDSGSLILLTGQNGDKPRPVGIIWGGTANRGR 423 Query: 669 LKLKVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVGESSP 490 LKLKVGQPPENWTSGV LVTS EGLQ VQ+Q++ SA DSTVGESSP Sbjct: 424 LKLKVGQPPENWTSGVDLGRLLDLLELDLVTSNEGLQ--VQDQKNVSAAGLDSTVGESSP 481 Query: 489 AEGALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPPNVELQ 310 + L KD+ +D E L++NI+Q+ +E S+ L PFT +FHIEDGV+ PNVE Q Sbjct: 482 PDRVLSKDRIEDNIEPLNLNIQQVLLE--EESQHGLTAPFTRTEFHIEDGVETAPNVEHQ 539 Query: 309 FIPSCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRR 157 FIPS +H + Q N E +NLSALR+GSDE++ SL+LG+ EPKRRR Sbjct: 540 FIPSFTGGPMVHDKNKQENVELENLSALRHGSDEEIHVSLRLGEPEPKRRR 590 >ref|XP_002322374.2| hypothetical protein POPTR_0015s15390g [Populus trichocarpa] gi|550322788|gb|EEF06501.2| hypothetical protein POPTR_0015s15390g [Populus trichocarpa] Length = 594 Score = 912 bits (2356), Expect = 0.0 Identities = 461/590 (78%), Positives = 506/590 (85%), Gaps = 2/590 (0%) Frame = -2 Query: 1920 LDLRGRHSGSTQSEESALDLERNYCGHPNLPSSSP-PLQAFASAGQHSESNAAYFSWPNS 1744 L LR HSGS+QSEESALDLERNYC HPNL SSP PLQ FAS GQHSESNAAYFSWP Sbjct: 6 LGLRIHHSGSSQSEESALDLERNYCSHPNLLWSSPSPLQPFASGGQHSESNAAYFSWPTL 65 Query: 1743 SRLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRRYSLGTA 1564 SRLN AAE RANYFGNLQKGVLPE L +LP+G++ATTLLELMTIRAFHS+ILRR+SLGTA Sbjct: 66 SRLNDAAEVRANYFGNLQKGVLPETLGRLPSGQRATTLLELMTIRAFHSKILRRFSLGTA 125 Query: 1563 IGFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEFSYFGAP 1384 IGFRIRRG LTDIPAILVFVARKVHRQWLSH+QCLP+ALEGPGGVWCDVDVVEFSY+G P Sbjct: 126 IGFRIRRGDLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGVP 185 Query: 1383 AATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVA 1204 AATPKEQLYTELVDGLRGSD CIGSGSQVA+QETYGTLGAIVKSRTGNRQVGFLTNRHVA Sbjct: 186 AATPKEQLYTELVDGLRGSDPCIGSGSQVANQETYGTLGAIVKSRTGNRQVGFLTNRHVA 245 Query: 1203 VDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRADGAFIP 1024 VDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITD+LWYG FAGTNPETFVRADGAFIP Sbjct: 246 VDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDELWYGIFAGTNPETFVRADGAFIP 305 Query: 1023 FADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTIMAYALE 844 FA+DF+ +NV +VKGVGE+GDV +IDLQ+PINSLIG+QV KVGRSSGLTTGTIMAYALE Sbjct: 306 FAEDFNMNNVNITVKGVGEVGDVHVIDLQAPINSLIGRQVVKVGRSSGLTTGTIMAYALE 365 Query: 843 YNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTANRGRLK 664 YNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTG + +KPRP+GIIWGGTANRGRLK Sbjct: 366 YNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGRDCEKPRPVGIIWGGTANRGRLK 425 Query: 663 LKVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVGESSPAE 484 LKVGQPPENWTSGV ++T+ EGLQ A+Q+QR+ A DSTVGESSP + Sbjct: 426 LKVGQPPENWTSGVDLGRLLDLLELDIITTNEGLQAAIQDQRNPLAQGIDSTVGESSPLD 485 Query: 483 GALPKDKKKDIFESLDINIEQIPIEVGSG-SEVNLNPPFTHVDFHIEDGVDVPPNVELQF 307 K+K ++ FE L++NI+Q+ G G S+ P F +FHIED V+ PNVE QF Sbjct: 486 RVPSKEKIEENFEPLNLNIQQV---TGEGESQHGQTPLFIGPEFHIEDAVEASPNVEHQF 542 Query: 306 IPSCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRR 157 IPS RSP+H Q NPE KNLSALR+ SDE +CFSL LG+ EPKRR+ Sbjct: 543 IPSFSGRSPMHDNTPQENPELKNLSALRSDSDE-MCFSLHLGEPEPKRRK 591 >ref|XP_006377029.1| hypothetical protein POPTR_0012s12240g [Populus trichocarpa] gi|550326967|gb|ERP54826.1| hypothetical protein POPTR_0012s12240g [Populus trichocarpa] Length = 593 Score = 908 bits (2347), Expect = 0.0 Identities = 462/595 (77%), Positives = 510/595 (85%), Gaps = 1/595 (0%) Frame = -2 Query: 1938 MERASSLDLRGRHSGSTQSEESALDLERNYCGHPNLP-SSSPPLQAFASAGQHSESNAAY 1762 MER + L LR HSGS+QSEESALDLERNYC H LP SS PLQ F S GQHSESNAAY Sbjct: 1 MER-NRLGLRIHHSGSSQSEESALDLERNYCNH--LPWSSLSPLQPFTSGGQHSESNAAY 57 Query: 1761 FSWPNSSRLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRR 1582 FSWP SRLN AAEDRANYFGNLQKGVLPE L +LPTG+QATTLLELMTIRAFHS+ILRR Sbjct: 58 FSWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRR 117 Query: 1581 YSLGTAIGFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEF 1402 +SLGTAIGFRIRRG LTDIPAILVFVARKVHRQWLSH+QCLP+ALEGPGGVWCDVDVVEF Sbjct: 118 FSLGTAIGFRIRRGILTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEF 177 Query: 1401 SYFGAPAATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFL 1222 SY+GAPAATPKEQLYT+LVDGLRGSD CIGSGSQVA+QETYGTLGAIVKSRTGNRQVGFL Sbjct: 178 SYYGAPAATPKEQLYTDLVDGLRGSDPCIGSGSQVANQETYGTLGAIVKSRTGNRQVGFL 237 Query: 1221 TNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRA 1042 TNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYG FAGTNPETFVRA Sbjct: 238 TNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRA 297 Query: 1041 DGAFIPFADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTI 862 DGAFIPFA DF+ +NVTT+VKGVGE+GDV +IDLQ+PINSLIG+QV KVGRSSGLTTGTI Sbjct: 298 DGAFIPFAGDFNMNNVTTTVKGVGEVGDVHVIDLQAPINSLIGRQVVKVGRSSGLTTGTI 357 Query: 861 MAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTA 682 MAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILL G + +KP+P+GIIWGGTA Sbjct: 358 MAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLKGQDCEKPQPVGIIWGGTA 417 Query: 681 NRGRLKLKVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVG 502 NRGRLKLKVG PPENWTSGV L+T+ +GLQ AVQ+QR+ASA A DSTVG Sbjct: 418 NRGRLKLKVGLPPENWTSGVDLGRLLDLLELDLITTNDGLQAAVQDQRNASAPAIDSTVG 477 Query: 501 ESSPAEGALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPPN 322 ESSP + K+K ++ FE +++N++Q ++ S+ +P F +FHIEDG + PN Sbjct: 478 ESSPLDRVPSKEKIEENFEPINLNMQQGVVK--GESQQGQSPLFIGPEFHIEDGAEAAPN 535 Query: 321 VELQFIPSCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRR 157 VE QFIPS +S +H Q PE KNLSALR+ SDE++CFSLQLG EPKRR+ Sbjct: 536 VEHQFIPSFSGQSLMHDNKPQETPELKNLSALRSDSDEEMCFSLQLGKPEPKRRK 590 >ref|XP_006441047.1| hypothetical protein CICLE_v10019372mg [Citrus clementina] gi|568848484|ref|XP_006478038.1| PREDICTED: uncharacterized protein LOC102612774 [Citrus sinensis] gi|557543309|gb|ESR54287.1| hypothetical protein CICLE_v10019372mg [Citrus clementina] Length = 604 Score = 904 bits (2337), Expect = 0.0 Identities = 457/599 (76%), Positives = 504/599 (84%), Gaps = 1/599 (0%) Frame = -2 Query: 1920 LDLRGRHSGSTQSEESALDLERNYCGHPNLPSSSPP-LQAFASAGQHSESNAAYFSWPNS 1744 L++R R SGST SEESALD ERN C HPNLPS SPP LQ FASAGQH ESNAAYFSWP S Sbjct: 6 LNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTS 65 Query: 1743 SRLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRRYSLGTA 1564 SRL+ AAE+RANYF NLQKGVLPE L QLP G+QATTLLELMTIRAFHS+ILR YSLGTA Sbjct: 66 SRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTA 125 Query: 1563 IGFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEFSYFGAP 1384 IGFRI+RG LTDIPAILVFV+RKVH+QWLS IQCLP+ALEGPGGVWCDVDVVEFSYFGAP Sbjct: 126 IGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAP 185 Query: 1383 AATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVA 1204 TPKEQLYT++VD LRG D IGSGSQVASQETYGTLGAIVKS+TG+RQVGFLTNRHVA Sbjct: 186 EPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVA 245 Query: 1203 VDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRADGAFIP 1024 VDLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDDLWYG FAG N ETFVRADGAFIP Sbjct: 246 VDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDDLWYGIFAGINAETFVRADGAFIP 305 Query: 1023 FADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTIMAYALE 844 FADDFD S VTTSVKG+GEIGDVKI+DLQSPI+SLIGKQV KVGRSSGLTTGT++AYALE Sbjct: 306 FADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALE 365 Query: 843 YNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTANRGRLK 664 YNDEKGICF TDFLVVGENQQTFDLEGDSGSLIL+ G N +KPRPIGIIWGGTANRGRLK Sbjct: 366 YNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLK 425 Query: 663 LKVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVGESSPAE 484 LK+GQPPENWTSGV L+T+ EGL+VAVQEQR+ASA A STVG+SSP + Sbjct: 426 LKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGDSSPPD 485 Query: 483 GALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPPNVELQFI 304 G KDK +D FE L + I+ IP+EV S NP +FH+EDGV P+VELQFI Sbjct: 486 GMHLKDKAEDKFEPLGLQIQHIPVEVEHHSP-ETNPSLMETEFHLEDGVKAGPSVELQFI 544 Query: 303 PSCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRRADSTIVIDDS 127 PS SPLH+ + +NL++L NG DED+CFSLQLGD E KRRR+D++ ++S Sbjct: 545 PSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDASTSKEES 603 >ref|XP_004294199.1| PREDICTED: uncharacterized protein LOC101301759 [Fragaria vesca subsp. vesca] Length = 604 Score = 902 bits (2331), Expect = 0.0 Identities = 458/599 (76%), Positives = 506/599 (84%), Gaps = 1/599 (0%) Frame = -2 Query: 1938 MERASSLDLRGRHSGSTQSEESALDLERNYCGHPNLPSSSPP-LQAFASAGQHSESNAAY 1762 M+RA ++R R SGST SEESALDLER+ C H NLPS SPP LQ FASAGQH E++AAY Sbjct: 1 MDRAR-FNMRMRCSGSTPSEESALDLERSCCSHSNLPSLSPPTLQPFASAGQHCETSAAY 59 Query: 1761 FSWPNSSRLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRR 1582 FSWP SSRLN AAE+RANYF NLQKGVLPE L QLP G+QATTLLELMTIRAFHS+ILR Sbjct: 60 FSWPTSSRLNDAAEERANYFTNLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRC 119 Query: 1581 YSLGTAIGFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEF 1402 YSLGTAIGFRIRRG LTDIPAILVFV+RKVH+QWLS IQCLP+ALEGPGGVWCDVDVVEF Sbjct: 120 YSLGTAIGFRIRRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEF 179 Query: 1401 SYFGAPAATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFL 1222 SYFGAP PKEQLYTE+VD LRG D IGSGSQVASQETYGTLGAIVKS+TG+RQVGFL Sbjct: 180 SYFGAPEPAPKEQLYTEIVDDLRGGDPRIGSGSQVASQETYGTLGAIVKSQTGSRQVGFL 239 Query: 1221 TNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRA 1042 TNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITD+LWYG FAG NPETFVRA Sbjct: 240 TNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRA 299 Query: 1041 DGAFIPFADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTI 862 DGAFIPF+DDFD S VTTSVKGVG IGDVKIIDLQSPI++LIGK V KVGRSSGLT GT+ Sbjct: 300 DGAFIPFSDDFDMSTVTTSVKGVGGIGDVKIIDLQSPISTLIGKHVMKVGRSSGLTAGTV 359 Query: 861 MAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTA 682 +AYALEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLI L G N +KPRPIGIIWGGTA Sbjct: 360 LAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLITLKGENGEKPRPIGIIWGGTA 419 Query: 681 NRGRLKLKVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVG 502 NRGRLKLK+GQPPENWTSGV L+T+ EGL+VAVQEQRS SA A STVG Sbjct: 420 NRGRLKLKIGQPPENWTSGVDLGRLLHLLELDLITTDEGLKVAVQEQRSVSATAIGSTVG 479 Query: 501 ESSPAEGALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPPN 322 +SSP +G LPK+ +D FESL + I++IP++V GS + ++P +FH+EDG P+ Sbjct: 480 DSSPLDGLLPKEGPEDKFESLGLQIQRIPLDVEPGS-LPMSPSLVETEFHLEDGTKAVPS 538 Query: 321 VELQFIPSCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRRADST 145 VE QFIPS I SPLHK + G +NLS+LRNG DED+CFSLQLGD E KRRR+D++ Sbjct: 539 VEHQFIPSFISGSPLHKMNQMGRTVSENLSSLRNGCDEDICFSLQLGDNEAKRRRSDTS 597 >gb|EXC24765.1| hypothetical protein L484_018479 [Morus notabilis] Length = 607 Score = 900 bits (2326), Expect = 0.0 Identities = 460/590 (77%), Positives = 499/590 (84%), Gaps = 2/590 (0%) Frame = -2 Query: 1920 LDLRGRHSGSTQSEESALDLERNYCGHPNLPSSSP-PLQAFASAGQHSESNAAYFSWPNS 1744 LDLR SGSTQS+ES LDL+RNYCGHPNLPSSSP PLQ FAS QHSESNAAYFSWP S Sbjct: 6 LDLRFHPSGSTQSDESVLDLDRNYCGHPNLPSSSPSPLQPFASGAQHSESNAAYFSWPTS 65 Query: 1743 SRLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRRYSLGTA 1564 SRLN AAEDRANYFGNLQKGVLPE L LPTG+QATTLLELMTIRAFHS+ILRR+SLGTA Sbjct: 66 SRLNDAAEDRANYFGNLQKGVLPETLGCLPTGQQATTLLELMTIRAFHSKILRRFSLGTA 125 Query: 1563 IGFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEFSYFGAP 1384 IGFRIRRG LT+IPAILVFVARKVHRQWL+H+QCLP+ALEGPGGVWCDVDVVEFSY+GAP Sbjct: 126 IGFRIRRGVLTNIPAILVFVARKVHRQWLNHVQCLPAALEGPGGVWCDVDVVEFSYYGAP 185 Query: 1383 AATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVA 1204 A TPKEQLYTELVDGLRGSD CIGSGSQVASQETYGTLGAIV+SRTGNRQVGFLTNRHVA Sbjct: 186 APTPKEQLYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNRQVGFLTNRHVA 245 Query: 1203 VDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRADGAFIP 1024 VDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYG FAGTN ETFVRADGAFIP Sbjct: 246 VDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNAETFVRADGAFIP 305 Query: 1023 FADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTIMAYALE 844 FA+DF+ +NV +V+GVG+IGDV IIDLQSPINSLIG+QV KVGRSSGLTTGTIMAYALE Sbjct: 306 FAEDFNMNNVDVTVRGVGQIGDVNIIDLQSPINSLIGRQVVKVGRSSGLTTGTIMAYALE 365 Query: 843 YNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTANRGRLK 664 YNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLT + +PRP+GIIWGGTANRGRLK Sbjct: 366 YNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTSPDGVRPRPVGIIWGGTANRGRLK 425 Query: 663 LKVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVGESSPAE 484 LKVGQPPENWTSGV L+T+ EGLQ A+QEQR+ASA STVGESSP + Sbjct: 426 LKVGQPPENWTSGVDLGRLLDLLELDLITTNEGLQAALQEQRNASAAGMGSTVGESSPTD 485 Query: 483 GALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPPNVELQFI 304 K+K + E +NI+Q PI S P +FHIE+G+ PN E QFI Sbjct: 486 RVPFKEKLEGNSEPFGLNIQQAPIV--RESFQGPAPTIRQSEFHIENGIKTAPNFEHQFI 543 Query: 303 PSCIDRSPLHKYDSQ-GNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRR 157 PS S +HK Q N E KNLSALRNGSDE++CFSL+LG+ EPKRR+ Sbjct: 544 PSFTGGSTVHKSSYQEENLESKNLSALRNGSDEEICFSLRLGEPEPKRRK 593 >ref|XP_007211386.1| hypothetical protein PRUPE_ppa003117mg [Prunus persica] gi|462407251|gb|EMJ12585.1| hypothetical protein PRUPE_ppa003117mg [Prunus persica] Length = 601 Score = 897 bits (2319), Expect = 0.0 Identities = 454/599 (75%), Positives = 505/599 (84%), Gaps = 1/599 (0%) Frame = -2 Query: 1938 MERASSLDLRGRHSGSTQSEESALDLERNYCGHPNLPSSSPP-LQAFASAGQHSESNAAY 1762 MER + ++R R SGST SEES LDLERN H NLPS SPP LQ +ASAGQH E++AAY Sbjct: 1 MER-TRFNMRMRCSGSTPSEESVLDLERNCYSHSNLPSLSPPTLQPYASAGQHCETSAAY 59 Query: 1761 FSWPNSSRLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRR 1582 FSWP SSRLN AAE+RANYF NLQKGVLPE L QLP G+QATTLLELMTIRAFHS+ILR Sbjct: 60 FSWPTSSRLNDAAEERANYFTNLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRC 119 Query: 1581 YSLGTAIGFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEF 1402 YSLGTAIGFRIRRG LTDIPAILVFVARKVH+QWLS IQCLP+ALEGPGGVWCDVDVVEF Sbjct: 120 YSLGTAIGFRIRRGVLTDIPAILVFVARKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEF 179 Query: 1401 SYFGAPAATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFL 1222 SYFGAP PKEQLYTE+VD LRG D CIGSGSQVASQETYGTLGAIV+S+TGNRQVGFL Sbjct: 180 SYFGAPEPAPKEQLYTEIVDDLRGGDPCIGSGSQVASQETYGTLGAIVRSQTGNRQVGFL 239 Query: 1221 TNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRA 1042 TNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITD+LWYG FAG NPETFVRA Sbjct: 240 TNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRA 299 Query: 1041 DGAFIPFADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTI 862 DGAFIPFADDFD V TSVKGVGEIG+VKIIDLQSPI++LIGKQV KVGRSSGLTTGT+ Sbjct: 300 DGAFIPFADDFDMCTVITSVKGVGEIGNVKIIDLQSPISTLIGKQVMKVGRSSGLTTGTV 359 Query: 861 MAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTA 682 +AYALEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLI+L G N +KPRPIGIIWGGTA Sbjct: 360 LAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENGEKPRPIGIIWGGTA 419 Query: 681 NRGRLKLKVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVG 502 NRGRLKLK+GQPPENWTSGV L+T+ EG++VAVQEQR+ASA A STVG Sbjct: 420 NRGRLKLKIGQPPENWTSGVDLGRLLKLLELDLITTDEGVKVAVQEQRTASATAIGSTVG 479 Query: 501 ESSPAEGALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPPN 322 +SSP +G LPK++ ++ FESL + I+ IP+E S ++L +FH+EDG+ P+ Sbjct: 480 DSSPPDGMLPKERPEEKFESLGLQIQHIPLEAEPSSSLSL----VETEFHLEDGIKAVPS 535 Query: 321 VELQFIPSCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRRADST 145 VE QFIPS + SPLHK + G +NLS+LRNG DED+CFSLQLGD E KRRR+ ++ Sbjct: 536 VEHQFIPSFLGGSPLHKKNQMGRTVSENLSSLRNGCDEDICFSLQLGDNEAKRRRSGAS 594 >ref|XP_007024931.1| Trypsin family protein isoform 1 [Theobroma cacao] gi|590622019|ref|XP_007024932.1| Trypsin family protein isoform 1 [Theobroma cacao] gi|590622023|ref|XP_007024933.1| Trypsin family protein isoform 1 [Theobroma cacao] gi|508780297|gb|EOY27553.1| Trypsin family protein isoform 1 [Theobroma cacao] gi|508780298|gb|EOY27554.1| Trypsin family protein isoform 1 [Theobroma cacao] gi|508780299|gb|EOY27555.1| Trypsin family protein isoform 1 [Theobroma cacao] Length = 607 Score = 893 bits (2308), Expect = 0.0 Identities = 461/600 (76%), Positives = 503/600 (83%), Gaps = 1/600 (0%) Frame = -2 Query: 1938 MERASSLDLRGRHSGSTQSEESALDLERNYCGHPNLPSSSP-PLQAFASAGQHSESNAAY 1762 MER + LDLR HSGS +SEESALDLERN C H NLPSSSP PLQ FAS QHSESNAAY Sbjct: 1 MER-NRLDLRFHHSGSIESEESALDLERNCCNHFNLPSSSPSPLQPFASGAQHSESNAAY 59 Query: 1761 FSWPNSSRLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRR 1582 FSWP SSRL AAEDRANYFGNLQKGVLPE L +LP+G+QATTLLELMTIRAFHS+ LRR Sbjct: 60 FSWPTSSRLIDAAEDRANYFGNLQKGVLPETLGRLPSGQQATTLLELMTIRAFHSKKLRR 119 Query: 1581 YSLGTAIGFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEF 1402 +SLGTAIGFRIRRG LT IPAILVFVARKVHRQWLS QCLP+ALEGPGGVWCDVDVVEF Sbjct: 120 FSLGTAIGFRIRRGVLTKIPAILVFVARKVHRQWLSQFQCLPAALEGPGGVWCDVDVVEF 179 Query: 1401 SYFGAPAATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFL 1222 SY+GAPAATPKEQLYTELVDGLRGSD IGSGSQVASQETYGTLGAIVKSRTGNRQVGFL Sbjct: 180 SYYGAPAATPKEQLYTELVDGLRGSDPIIGSGSQVASQETYGTLGAIVKSRTGNRQVGFL 239 Query: 1221 TNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRA 1042 TNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITD LWYG FAG NPETFVRA Sbjct: 240 TNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDVLWYGIFAGINPETFVRA 299 Query: 1041 DGAFIPFADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTI 862 DGAFIPFA+DF+ +NVTT+VKGVGEIGDV IIDLQSPI+SLIG+QV KVGRSSGLTTGTI Sbjct: 300 DGAFIPFAEDFNMNNVTTTVKGVGEIGDVHIIDLQSPISSLIGRQVVKVGRSSGLTTGTI 359 Query: 861 MAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTA 682 MAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSL+LLTG N +KPRP+GIIWGGTA Sbjct: 360 MAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLVLLTGRNREKPRPVGIIWGGTA 419 Query: 681 NRGRLKLKVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVG 502 NRGRLKLKVGQPPENWTSGV L+T+ GLQ AVQ+QR+ SA DSTV Sbjct: 420 NRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNVGLQAAVQDQRNVSAAGIDSTVV 479 Query: 501 ESSPAEGALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPPN 322 ESSP L KDK ++ F +++NI+Q+ E S V L P H ++ ED V PN Sbjct: 480 ESSPLVQTLSKDKIEENFGPINLNIQQVLAEGESQQGVTL--PIMHNEYRAEDRVKAAPN 537 Query: 321 VELQFIPSCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRRADSTI 142 +E QFIPS S +H + + NPE +NLSALRNGSDE++ SLQLG+ EPKRR+ ++ Sbjct: 538 LEHQFIPSFNGTSRVHDNNKRENPESRNLSALRNGSDEEIYVSLQLGEPEPKRRKHSDSL 597 >ref|XP_003531502.1| PREDICTED: uncharacterized protein LOC100806376 [Glycine max] Length = 602 Score = 892 bits (2306), Expect = 0.0 Identities = 456/599 (76%), Positives = 503/599 (83%), Gaps = 1/599 (0%) Frame = -2 Query: 1938 MERASSLDLRGRHSGSTQSEESALDLERNYCGHPNLPSSSPP-LQAFASAGQHSESNAAY 1762 MERA L++RG SGST SEESALDLERN C H NLPS SPP LQ FASAGQH ES+AAY Sbjct: 1 MERAR-LNMRGHCSGSTPSEESALDLERNCCSHSNLPSLSPPTLQPFASAGQHCESSAAY 59 Query: 1761 FSWPNSSRLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRR 1582 FSWP SRLN AAE+RANYF NLQKGVLPE L +LP G QATTLLELMTIRAFHS+ILR Sbjct: 60 FSWP--SRLNDAAEERANYFLNLQKGVLPETLGRLPKGHQATTLLELMTIRAFHSKILRC 117 Query: 1581 YSLGTAIGFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEF 1402 YSLGTAIGFRIRRG LTDIPAILVFV+RKVH+QWLS IQCLP+ALEGPGGVWCDVDVVEF Sbjct: 118 YSLGTAIGFRIRRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEF 177 Query: 1401 SYFGAPAATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFL 1222 SYFGAP PKEQLYTE+VD LRG D CIGSGSQVASQETYGTLGAIVKS+TG+RQVGFL Sbjct: 178 SYFGAPEPVPKEQLYTEIVDDLRGGDPCIGSGSQVASQETYGTLGAIVKSQTGSRQVGFL 237 Query: 1221 TNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRA 1042 TNRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITD+LWYG FAG NPETFVRA Sbjct: 238 TNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRA 297 Query: 1041 DGAFIPFADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTI 862 DGAFIPFADDFD S VTTSV+GVG+IGDVKIIDLQ+PI+SLIGKQV KVGRSSGLTTG + Sbjct: 298 DGAFIPFADDFDMSTVTTSVRGVGDIGDVKIIDLQAPISSLIGKQVVKVGRSSGLTTGVV 357 Query: 861 MAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTA 682 +AYALEYNDEKGICF TD LVVGENQQTFDLEGDSGSLI+L G +KPRPIGIIWGGTA Sbjct: 358 LAYALEYNDEKGICFLTDLLVVGENQQTFDLEGDSGSLIMLKGDIGEKPRPIGIIWGGTA 417 Query: 681 NRGRLKLKVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVG 502 NRGRLKLKVGQPPENWTSGV L+T+ EGLQVAVQEQR+ SA STVG Sbjct: 418 NRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITTDEGLQVAVQEQRAVSATVIGSTVG 477 Query: 501 ESSPAEGALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPPN 322 +SSP +G LPKDK +D +E L + I+ IP+ V S+ ++ P +F +EDG++V P+ Sbjct: 478 DSSPPDGVLPKDKAEDKYEPLGLQIQSIPLGVVPSSQ-DMKPSIMETEFKLEDGINVGPS 536 Query: 321 VELQFIPSCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRRADST 145 +E QFIPS I RSPLHK Q +NLS+LRN DEDLC SLQLGD E KRRR++++ Sbjct: 537 IEHQFIPSFIGRSPLHKNSIQDRTATENLSSLRNNCDEDLCVSLQLGDNEAKRRRSEAS 595 >ref|XP_003546786.1| PREDICTED: uncharacterized protein LOC100783035 [Glycine max] Length = 602 Score = 890 bits (2299), Expect = 0.0 Identities = 455/599 (75%), Positives = 502/599 (83%), Gaps = 1/599 (0%) Frame = -2 Query: 1938 MERASSLDLRGRHSGSTQSEESALDLERNYCGHPNLPSSSPP-LQAFASAGQHSESNAAY 1762 MER + L++RGR SGST SEESALDLERN C H NLPS SPP LQ FASAGQH ES+AAY Sbjct: 1 MER-TRLNMRGRCSGSTPSEESALDLERNCCSHSNLPSLSPPTLQPFASAGQHCESSAAY 59 Query: 1761 FSWPNSSRLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRR 1582 FSWP SRLN AAE+RANYF NLQK VLPE L +LP G QATTLLELMTIRAFHS+ILR Sbjct: 60 FSWP--SRLNDAAEERANYFLNLQKEVLPETLGRLPKGHQATTLLELMTIRAFHSKILRC 117 Query: 1581 YSLGTAIGFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEF 1402 YSLGTAIGFRIRRG LTDIPAILVFV+RKVH+QWLS IQCLP+ALEGPGGVWCDVDVVEF Sbjct: 118 YSLGTAIGFRIRRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEF 177 Query: 1401 SYFGAPAATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFL 1222 SYFGAP KEQLYTE+VD LRG D CIGSGSQVASQETYGTLGAIVKS+TG+RQVGFL Sbjct: 178 SYFGAPEPVSKEQLYTEIVDDLRGGDPCIGSGSQVASQETYGTLGAIVKSQTGSRQVGFL 237 Query: 1221 TNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRA 1042 TNRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITD+LWYG FAG NPETFVRA Sbjct: 238 TNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRA 297 Query: 1041 DGAFIPFADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTI 862 DGAFIPFADDFD S VTTSV+GVG+IGDVKIIDLQ+PI+SLIGKQV KVGRSSGLTTG + Sbjct: 298 DGAFIPFADDFDMSTVTTSVRGVGDIGDVKIIDLQAPISSLIGKQVVKVGRSSGLTTGVV 357 Query: 861 MAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTA 682 +AYALEYNDEKGICF TD LVVGENQQTFDLEGDSGSLI+L G N +KPRPIGIIWGGTA Sbjct: 358 LAYALEYNDEKGICFLTDLLVVGENQQTFDLEGDSGSLIMLKGDNGEKPRPIGIIWGGTA 417 Query: 681 NRGRLKLKVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVG 502 NRGRLKLKVGQPPENWTSGV L+T+ EGLQVAVQEQR+ SA STVG Sbjct: 418 NRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITTDEGLQVAVQEQRAVSATVIGSTVG 477 Query: 501 ESSPAEGALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPPN 322 +SSP +G LPKDK +D +E L + I+ IP+ V S+ ++ P +F +EDG+ V P+ Sbjct: 478 DSSPPDGVLPKDKAEDKYEPLGLQIQSIPLGVVPSSQ-DMKPSIMETEFKLEDGIKVGPS 536 Query: 321 VELQFIPSCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRRADST 145 +E QFIPS I RSPLHK Q +NLS+LRN DEDLC SLQLGD E KRRR++++ Sbjct: 537 IEHQFIPSFIGRSPLHKNSIQDRTATENLSSLRNNCDEDLCVSLQLGDNEAKRRRSEAS 595 >ref|XP_007149278.1| hypothetical protein PHAVU_005G056800g [Phaseolus vulgaris] gi|561022542|gb|ESW21272.1| hypothetical protein PHAVU_005G056800g [Phaseolus vulgaris] Length = 602 Score = 885 bits (2287), Expect = 0.0 Identities = 453/599 (75%), Positives = 500/599 (83%), Gaps = 1/599 (0%) Frame = -2 Query: 1938 MERASSLDLRGRHSGSTQSEESALDLERNYCGHPNLPSSSPP-LQAFASAGQHSESNAAY 1762 MER S L++RGR SGST SEESALDLERN C H NLPS SPP LQ FASAGQH ES+AAY Sbjct: 1 MER-SRLNMRGRCSGSTPSEESALDLERNCCSHSNLPSLSPPTLQPFASAGQHCESSAAY 59 Query: 1761 FSWPNSSRLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRR 1582 FSWP SRLN AAE+RANYF NLQKGVLPE +LP G QATTLLELMTIRAFHS+ILR Sbjct: 60 FSWP--SRLNDAAEERANYFLNLQKGVLPETPGRLPKGHQATTLLELMTIRAFHSKILRC 117 Query: 1581 YSLGTAIGFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEF 1402 YSLGTAIGFRIR+G LTDIPAILVFV+RKVH+QWLS IQCLP+ALEGPGGVWCDVDVVEF Sbjct: 118 YSLGTAIGFRIRQGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEF 177 Query: 1401 SYFGAPAATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFL 1222 SYFGAP PKEQLYTE+VD LRG D CIGSGSQVA+QETYGTLGAIVKS+TG+RQVGFL Sbjct: 178 SYFGAPEPVPKEQLYTEIVDDLRGGDPCIGSGSQVANQETYGTLGAIVKSQTGSRQVGFL 237 Query: 1221 TNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRA 1042 TNRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITD+LWYG FAG NPETFVRA Sbjct: 238 TNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRA 297 Query: 1041 DGAFIPFADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTI 862 DGAFIPFADDFD S VTTSV+GVG+IGDVKIIDLQ+PI+SLIGKQV KVGRSSGLTTG + Sbjct: 298 DGAFIPFADDFDMSTVTTSVRGVGDIGDVKIIDLQAPISSLIGKQVVKVGRSSGLTTGVV 357 Query: 861 MAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTA 682 +AYALEYNDEKGICF TD LVVGENQQTFDLEGDSGSLI+L G N +KPRPIGIIWGGTA Sbjct: 358 LAYALEYNDEKGICFLTDLLVVGENQQTFDLEGDSGSLIMLKGDNGEKPRPIGIIWGGTA 417 Query: 681 NRGRLKLKVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVG 502 NRGRLKLKVGQPPENWTSGV L+T+ EGLQVAVQEQR+ SA STVG Sbjct: 418 NRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITTDEGLQVAVQEQRAMSATVIGSTVG 477 Query: 501 ESSPAEGALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPPN 322 +SSP EG L K+K +D +E L + I+ IPI V S+ ++ P +F +EDG+ V P+ Sbjct: 478 DSSPPEGILTKEKAEDKYEPLGLQIQSIPIGVAPSSQ-DMKPSIMETEFKLEDGIKVGPS 536 Query: 321 VELQFIPSCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRRADST 145 +E QFIPS I RSPLHK +NLS+LR DEDLC SLQLGD E KRRR++++ Sbjct: 537 IEHQFIPSFIGRSPLHKNSIHDRTAAENLSSLRTNCDEDLCVSLQLGDNEAKRRRSEAS 595 >ref|XP_002513414.1| conserved hypothetical protein [Ricinus communis] gi|223547322|gb|EEF48817.1| conserved hypothetical protein [Ricinus communis] Length = 600 Score = 882 bits (2280), Expect = 0.0 Identities = 448/600 (74%), Positives = 499/600 (83%), Gaps = 1/600 (0%) Frame = -2 Query: 1926 SSLDLRGRHSGSTQSEESALDLERNYCGHPNLPSSSP-PLQAFASAGQHSESNAAYFSWP 1750 S L++R R SGST SEESALD ERN C HPNLPS SP LQ F SAGQH ES+AAYFSWP Sbjct: 4 SRLNMRARCSGSTPSEESALDAERNCCSHPNLPSLSPRTLQPFVSAGQHCESSAAYFSWP 63 Query: 1749 NSSRLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRRYSLG 1570 S RLN A E+RANYF NLQKGVLPE L++LP G++ATTLLELMTIRAFHS+ILR YSLG Sbjct: 64 -SWRLNDAVEERANYFSNLQKGVLPETLNRLPRGQRATTLLELMTIRAFHSKILRCYSLG 122 Query: 1569 TAIGFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEFSYFG 1390 TAIGFRI+RG LTDIPAILVFV+RKVH+QWLS IQCLP+ALEGPGGVWCDVDVVEFSYFG Sbjct: 123 TAIGFRIQRGVLTDIPAILVFVSRKVHKQWLSPIQCLPNALEGPGGVWCDVDVVEFSYFG 182 Query: 1389 APAATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRH 1210 AP TPKEQLYTE+VD LRG DLCIGSG QVASQETYGTLGAIVKS+TG RQVGFLTNRH Sbjct: 183 APEPTPKEQLYTEIVDDLRGGDLCIGSGFQVASQETYGTLGAIVKSQTGTRQVGFLTNRH 242 Query: 1209 VAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRADGAF 1030 VAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDDLWYG FAG NPETFVRADGAF Sbjct: 243 VAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDDLWYGIFAGMNPETFVRADGAF 302 Query: 1029 IPFADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTIMAYA 850 IPFADDFD S VTTSVKGVG+IGDVKIIDLQ PI SLIGKQV KVGRSSGLTTGTI+AY Sbjct: 303 IPFADDFDMSTVTTSVKGVGQIGDVKIIDLQCPIGSLIGKQVMKVGRSSGLTTGTILAYG 362 Query: 849 LEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTANRGR 670 LEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLI++ G N +KPRPIGIIWGGTANRGR Sbjct: 363 LEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIMKGENGEKPRPIGIIWGGTANRGR 422 Query: 669 LKLKVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVGESSP 490 LKLKVGQPPENWTSGV L+T+ EGL+VA+QEQR ASA ST+G+SSP Sbjct: 423 LKLKVGQPPENWTSGVDLGRLLNLLELGLITTDEGLKVAIQEQRIASATTIGSTIGDSSP 482 Query: 489 AEGALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPPNVELQ 310 +G LP DK + ESL + IE IP+EV G+ +NP +FH+EDG+ V P+VE Q Sbjct: 483 LDGMLPSDK---VEESLGLQIEHIPLEVELGNS-EINPRLVETNFHLEDGIMVAPSVEHQ 538 Query: 309 FIPSCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRRADSTIVIDD 130 FIPS +SPLHK + +NL++LRNG +ED+C SL LGD E K+R ++++ I++ Sbjct: 539 FIPSFTRQSPLHKSNLSDKVVLENLASLRNGCNEDVCVSLHLGDNEAKKRSSNASTSIEE 598 >ref|XP_003556316.1| PREDICTED: uncharacterized protein LOC100816119 isoformX1 [Glycine max] gi|571563655|ref|XP_006605510.1| PREDICTED: uncharacterized protein LOC100816119 isoform X2 [Glycine max] Length = 598 Score = 876 bits (2263), Expect = 0.0 Identities = 447/594 (75%), Positives = 490/594 (82%) Frame = -2 Query: 1926 SSLDLRGRHSGSTQSEESALDLERNYCGHPNLPSSSPPLQAFASAGQHSESNAAYFSWPN 1747 + LDLR HSGSTQSEESALDLER+Y GHPN PSS PLQ FA QHSESNAAYFSWP Sbjct: 4 NQLDLRAHHSGSTQSEESALDLERSYYGHPN-PSSPSPLQPFAGGAQHSESNAAYFSWPT 62 Query: 1746 SSRLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRRYSLGT 1567 SR N AAEDRANYFGNLQKGVLPE L +LPTG+QATTLLELMTIRAFHS+ILRR+SLGT Sbjct: 63 LSRWNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGT 122 Query: 1566 AIGFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEFSYFGA 1387 AIGFRIR G LTDIPAILVFVARKVHRQWL+HIQCLP+ALEGPGGVWCDVDVVEFSY+GA Sbjct: 123 AIGFRIRGGVLTDIPAILVFVARKVHRQWLNHIQCLPAALEGPGGVWCDVDVVEFSYYGA 182 Query: 1386 PAATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHV 1207 PA TPKEQLYTEL DGLRGSD C+GSGSQVASQETYGTLGAIV+SR+GNR+VGFLTNRHV Sbjct: 183 PAQTPKEQLYTELADGLRGSDSCVGSGSQVASQETYGTLGAIVRSRSGNREVGFLTNRHV 242 Query: 1206 AVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRADGAFI 1027 AVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYG FAGTNPETFVRADGAFI Sbjct: 243 AVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFI 302 Query: 1026 PFADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTIMAYAL 847 PFA+DF+ +NV T+VKGVGEIGDV IIDLQSPINSLIG+QV KVGRSSGLTTGTIMAYAL Sbjct: 303 PFAEDFNMNNVITTVKGVGEIGDVNIIDLQSPINSLIGRQVVKVGRSSGLTTGTIMAYAL 362 Query: 846 EYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTANRGRL 667 EYNDEKGICF TDFLVVGENQQTFDLEGDSGSLILLTG N +KP P+GIIWGGTANRGRL Sbjct: 363 EYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPCPVGIIWGGTANRGRL 422 Query: 666 KLKVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVGESSPA 487 KLKVGQPPENWTSGV L+T+ E LQ AV EQR+ SA DSTVGESSP Sbjct: 423 KLKVGQPPENWTSGVDLGRLLDLLELDLITTNEALQAAVLEQRNGSAAGIDSTVGESSPT 482 Query: 486 EGALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPPNVELQF 307 K+K ++ FE +NI +E V NP +FHI+ +++ PNVE QF Sbjct: 483 VPI--KEKLEESFEPFCLNIPLAQVEDEPSQRV--NPSIRPCEFHIKSEIEIAPNVEHQF 538 Query: 306 IPSCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRRADST 145 IPS +SP + + + E K+L+ LRNG DED SL LG+ E KRR+ ++ Sbjct: 539 IPSYAGKSPARQSYLKEDMELKSLAELRNGPDEDNFVSLHLGEPEMKRRKLSNS 592 >ref|XP_007022258.1| Trypsin family protein [Theobroma cacao] gi|508721886|gb|EOY13783.1| Trypsin family protein [Theobroma cacao] Length = 603 Score = 872 bits (2254), Expect = 0.0 Identities = 447/599 (74%), Positives = 497/599 (82%), Gaps = 1/599 (0%) Frame = -2 Query: 1938 MERASSLDLRGRHSGSTQSEESALDLERNYCGHPNLPSSSPP-LQAFASAGQHSESNAAY 1762 MER S L++RGR SGST SEESALD ERN C HP+LPS SP LQ FASAG H+ESNA Y Sbjct: 1 MER-SRLNMRGRCSGSTPSEESALDFERNCCCHPHLPSFSPSTLQPFASAGMHNESNAPY 59 Query: 1761 FSWPNSSRLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRR 1582 F WP SSRLN AAE+RANYF NLQKGVLPE L +LP G+QATTLLELMTIRAFHS+ILR Sbjct: 60 FLWPPSSRLNDAAEERANYFANLQKGVLPETLGRLPEGQQATTLLELMTIRAFHSKILRC 119 Query: 1581 YSLGTAIGFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEF 1402 YSLGTAIGFRI++G LT+IPAILVFV+RKV +QWLS IQCLP+ALEGPGGVWCDVDVVEF Sbjct: 120 YSLGTAIGFRIKKGVLTEIPAILVFVSRKVDKQWLSPIQCLPTALEGPGGVWCDVDVVEF 179 Query: 1401 SYFGAPAATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFL 1222 SYFGAP TPKEQLYTE+VD LRG D IGSGSQVA+QETYGTLGAIVKS+TG+RQVGFL Sbjct: 180 SYFGAPEPTPKEQLYTEIVDDLRGGDPHIGSGSQVANQETYGTLGAIVKSQTGSRQVGFL 239 Query: 1221 TNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRA 1042 TNRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDDLWYG FAGTNPETFVRA Sbjct: 240 TNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRA 299 Query: 1041 DGAFIPFADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTI 862 DGAFIPF DDFD S VTTSVKGVGEI DVK+IDLQS I S+IGKQV KVGRSSGLT+GT+ Sbjct: 300 DGAFIPFTDDFDMSTVTTSVKGVGEISDVKVIDLQSSIGSIIGKQVMKVGRSSGLTSGTV 359 Query: 861 MAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTA 682 +AYALEYNDEKGICF TDFLVVGENQQ+FDLEGDSGSLI++ G N +K RPIGIIWGGTA Sbjct: 360 LAYALEYNDEKGICFLTDFLVVGENQQSFDLEGDSGSLIIMKGENGEKSRPIGIIWGGTA 419 Query: 681 NRGRLKLKVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVG 502 NRGRLKLKVGQPPENWTSGV ++ + EGL+VAVQEQR+ASA STVG Sbjct: 420 NRGRLKLKVGQPPENWTSGVDLGRLLNLLQLDIIITEEGLKVAVQEQRAASAATFASTVG 479 Query: 501 ESSPAEGALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPPN 322 +SSP +G L KDK ++ FE L I+ IP+EV S NP +FH+EDGV+ P+ Sbjct: 480 DSSPPDGVLLKDKSENKFEPLGFQIQNIPLEVDCNSP-EANPSTIKSEFHLEDGVNAGPS 538 Query: 321 VELQFIPSCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRRADST 145 VE QFIPS I RSPLHK S +NL++LRNG DED C SL LGD E KRRR++++ Sbjct: 539 VEHQFIPSFIGRSPLHKNFSD-KAVSENLASLRNGCDEDFCVSLHLGDNEAKRRRSEAS 596 >ref|XP_003529430.1| PREDICTED: uncharacterized protein LOC100796081 isoform X1 [Glycine max] gi|571467385|ref|XP_006583925.1| PREDICTED: uncharacterized protein LOC100796081 isoform X2 [Glycine max] Length = 600 Score = 872 bits (2252), Expect = 0.0 Identities = 446/592 (75%), Positives = 486/592 (82%) Frame = -2 Query: 1920 LDLRGRHSGSTQSEESALDLERNYCGHPNLPSSSPPLQAFASAGQHSESNAAYFSWPNSS 1741 LDLR HSGSTQSEESALDLER+Y GHPN PS PLQ FA QHSESNAAYFSWP S Sbjct: 6 LDLRAHHSGSTQSEESALDLERSYYGHPN-PSCPSPLQPFAGGAQHSESNAAYFSWPTLS 64 Query: 1740 RLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRRYSLGTAI 1561 R N AAEDRANYFGNLQKGVLPE L +LPTG+QATTLLELMTIRAFHS+ILRR+SLGTAI Sbjct: 65 RWNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 124 Query: 1560 GFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEFSYFGAPA 1381 GFRIR G LTDIPAILVFVARKV RQWL+H+QCLP+ALEGPGGVWCDVDVVEFSY+GAPA Sbjct: 125 GFRIRGGVLTDIPAILVFVARKVRRQWLNHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 184 Query: 1380 ATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAV 1201 TPKEQLYTEL DGLRGSD C+GSGSQVASQETYGTLGAIV+SRTGNR+VGFLTNRHVAV Sbjct: 185 QTPKEQLYTELADGLRGSDSCVGSGSQVASQETYGTLGAIVRSRTGNREVGFLTNRHVAV 244 Query: 1200 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRADGAFIPF 1021 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYG FAGTNPETFVRADGAFIPF Sbjct: 245 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPF 304 Query: 1020 ADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTIMAYALEY 841 A+DF+ +NV T+VKGVGEI DV IIDLQSPINSLIG+QV KVGRSSGLTTGTIMAYALEY Sbjct: 305 AEDFNMNNVITTVKGVGEISDVNIIDLQSPINSLIGRQVVKVGRSSGLTTGTIMAYALEY 364 Query: 840 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTANRGRLKL 661 NDEKGICF TDFLVVGENQQTFDLEGDSGSLILLTG N +KPRP+GIIWGGTANRGRLKL Sbjct: 365 NDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKL 424 Query: 660 KVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVGESSPAEG 481 KVGQPPENWTSGV L+T+ E LQ AV EQR+ SA DSTVGESSP Sbjct: 425 KVGQPPENWTSGVDLGRLLDLLELDLITTNEALQAAVLEQRNGSAAGIDSTVGESSPTVP 484 Query: 480 ALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDVPPNVELQFIP 301 K+K ++ FE +NI +E V NP DFHI+ ++ PNVE QFIP Sbjct: 485 I--KEKLEESFEPFCLNIPLAQVEDEPSQRV--NPSIRPCDFHIKSEIETAPNVEHQFIP 540 Query: 300 SCIDRSPLHKYDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRRADST 145 S +SP + + + E K+L+ LRNG DED SL LG+ E KRR+ ++ Sbjct: 541 SYAGKSPACQSYLKEDMELKSLAELRNGPDEDNFVSLHLGEPEMKRRKISNS 592 >ref|XP_004488832.1| PREDICTED: uncharacterized protein LOC101500387 isoform X1 [Cicer arietinum] gi|502089160|ref|XP_004488833.1| PREDICTED: uncharacterized protein LOC101500387 isoform X2 [Cicer arietinum] Length = 603 Score = 871 bits (2251), Expect = 0.0 Identities = 451/599 (75%), Positives = 496/599 (82%), Gaps = 5/599 (0%) Frame = -2 Query: 1926 SSLDLRGRHSGSTQSEESALDLERNYCGHPNLPSSSPP-LQAFASAGQHSESNAAYFSWP 1750 S L+ R R SGST SEESALDLERN CGH NLPS SPP LQ FAS+GQH ESNAAYFSWP Sbjct: 4 SRLNSRVRCSGSTPSEESALDLERNCCGHSNLPSLSPPSLQPFASSGQHCESNAAYFSWP 63 Query: 1749 NSSRLNGAAEDRANYFGNLQKGVLPEILDQLPTGKQATTLLELMTIRAFHSQILRRYSLG 1570 SRL AAE+RANYF NLQKGVLPE L +LP G+QATTLLELMTIRAFHS+ILR YSLG Sbjct: 64 --SRLPDAAEERANYFLNLQKGVLPETLGRLPKGQQATTLLELMTIRAFHSKILRCYSLG 121 Query: 1569 TAIGFRIRRGTLTDIPAILVFVARKVHRQWLSHIQCLPSALEGPGGVWCDVDVVEFSYFG 1390 TAIGFRIRRG LTDIPAILVFV+RKVH+QWLS IQCLP+ALEGPGGVWCDVDVVEFSYFG Sbjct: 122 TAIGFRIRRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFG 181 Query: 1389 APAATPKEQLYTELVDGLRGSDLCIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRH 1210 AP PKEQ YTE+VD LRG D CIGSGSQVASQETYGTLGAIV+S+TG+RQVGFLTNRH Sbjct: 182 APEPVPKEQHYTEIVDDLRGGDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRH 241 Query: 1209 VAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGTFAGTNPETFVRADGAF 1030 VAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITD+LWYG FAG NPETFVRADGAF Sbjct: 242 VAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAF 301 Query: 1029 IPFADDFDTSNVTTSVKGVGEIGDVKIIDLQSPINSLIGKQVKKVGRSSGLTTGTIMAYA 850 IPF DDFD VTTSV+GVG+IGDVKIIDLQSPI+SLIGKQV KVGRSSGLTTG ++AYA Sbjct: 302 IPFVDDFDMCTVTTSVRGVGDIGDVKIIDLQSPISSLIGKQVVKVGRSSGLTTGIVLAYA 361 Query: 849 LEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGHNADKPRPIGIIWGGTANRGR 670 LEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLI+ G N +KPRPIGIIWGGTANRGR Sbjct: 362 LEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIMFKGDNGEKPRPIGIIWGGTANRGR 421 Query: 669 LKLKVGQPPENWTSGVXXXXXXXXXXXXLVTSTEGLQVAVQEQRSASAVAGDSTVGESSP 490 LKLKVG PPENWTSGV L+TS EGL+VAVQEQR+ASA STVG+SS Sbjct: 422 LKLKVGLPPENWTSGVDLGRLLNLLELDLITSDEGLRVAVQEQRAASATVMGSTVGDSST 481 Query: 489 AEGALPKDKKKDIFESLDINIEQIPIEVGSGSEVNLNPPFTHVDFHIEDGVDV-PPNVEL 313 +G LPKD+ +D FE L + I+ IP+ V S+ P +F +EDG+ V P++E Sbjct: 482 PDGMLPKDRAEDKFEPLGLQIQSIPLGVEPSSQ-ETKPSIMETEFKLEDGIKVGGPSIEH 540 Query: 312 QFIPSCIDRSPLHK---YDSQGNPEFKNLSALRNGSDEDLCFSLQLGDREPKRRRADST 145 QFIPS I RSP+HK +D E NLS+LRNG DEDLC SLQLGD E KRRR++++ Sbjct: 541 QFIPSFIGRSPMHKNTVHDRDAAAE--NLSSLRNGCDEDLCVSLQLGDNEAKRRRSEAS 597