BLASTX nr result
ID: Catharanthus22_contig00005092
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00005092 (1863 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583... 515 e-143 ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260... 511 e-142 gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus pe... 472 e-130 gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis] 464 e-128 ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm... 463 e-127 ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr... 463 e-127 ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citr... 458 e-126 ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-... 454 e-125 gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein i... 453 e-124 gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein i... 447 e-123 ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210... 442 e-121 ref|XP_004157195.1| PREDICTED: uncharacterized protein LOC101225... 441 e-121 ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264... 402 e-109 ref|XP_006413289.1| hypothetical protein EUTSA_v10025027mg [Eutr... 393 e-106 ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494... 392 e-106 ref|XP_006589528.1| PREDICTED: uncharacterized protein LOC100798... 390 e-106 gb|ESW24210.1| hypothetical protein PHAVU_004G111400g [Phaseolus... 388 e-105 gb|AFK46430.1| unknown [Medicago truncatula] 388 e-105 ref|XP_003549033.2| PREDICTED: uncharacterized protein LOC100806... 382 e-103 ref|XP_003533172.2| PREDICTED: uncharacterized protein LOC100818... 381 e-103 >ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum] Length = 470 Score = 515 bits (1326), Expect = e-143 Identities = 283/455 (62%), Positives = 321/455 (70%), Gaps = 4/455 (0%) Frame = +2 Query: 506 MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685 MSSV N+ ESRVQPST QKRRW SCWSLYWCFGS+K+SKRIGHAVLV Sbjct: 1 MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60 Query: 686 PESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSM 865 PE APG PV +N NH L SDPPSATQSPA GLLSL S S+ Sbjct: 61 PEPAAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPA-GLLSLKSLSI 119 Query: 866 NSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPF 1045 N++SPGGTASIFAIGPY HETQLV+PPVFS FTTEPSTA FTPPPE V +TTP SPEVPF Sbjct: 120 NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEVPF 179 Query: 1046 AQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQSPGSPGSHLISPASAISNSGTSSPFLDK 1225 AQLLTSSLARNRR+SG N +FPLSQY+F PYQ PGSPGS+LISP S +SNSGTSSPF K Sbjct: 180 AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGK 239 Query: 1226 LPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 1402 PI+EFR GE PKFLGYEHF T KWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG Sbjct: 240 CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 299 Query: 1403 TQTPNGGISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEELLDPRVS 1579 T +TPNGGEP ++SYLLE QISEVASLA+S+ SE E ++D RVS Sbjct: 300 T--------------VTPNGGEPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVS 345 Query: 1580 FELTGEHV--LKFDEAVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEYCNGQIVNE 1753 FELTGE V + E V +H P+ ++ N +S S +E E G + Sbjct: 346 FELTGEDVPSCREKEPVMSHSQQTLPMDV-SNLLANEMKSGSSMAE----EKTYG---SP 397 Query: 1754 EKALEGEGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858 KA E C +KH +++ GSSKDF+FD++K E+ Sbjct: 398 RKASESGEDQCHRKHRNITFGSSKDFDFDNVKIEV 432 >ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum lycopersicum] Length = 470 Score = 511 bits (1315), Expect = e-142 Identities = 278/455 (61%), Positives = 318/455 (69%), Gaps = 4/455 (0%) Frame = +2 Query: 506 MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685 MSSV N+ ESRVQPST QKRRW SCWSLYWCFGS+K+SKRIGHAVLV Sbjct: 1 MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60 Query: 686 PESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSM 865 PE APG PV +N NH L SDPPSATQSPA GLLSL + S+ Sbjct: 61 PEPVAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPA-GLLSLKALSI 119 Query: 866 NSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPF 1045 N++SPGGTASIFAIGPY HETQLV+PPVFS FTTEPSTA FTPPPE V +TTP SPEVPF Sbjct: 120 NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPF 179 Query: 1046 AQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQSPGSPGSHLISPASAISNSGTSSPFLDK 1225 AQLLTSSLARNRR+SG N +FPLSQY+F PYQ PGSPGS+LISP S +SNSGTSSPF K Sbjct: 180 AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGK 239 Query: 1226 LPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 1402 PI+EFR GE PKFLGYEHF T KWGSRVGSGS+TPSGWGSRLGSGTLTPNGGISRLGSG Sbjct: 240 CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSG 299 Query: 1403 TQTPNGGISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEELLDPRVS 1579 T +TPNGGEP ++SYLLE+QISEVASLA+S+ SE E ++D RVS Sbjct: 300 T--------------VTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVS 345 Query: 1580 FELTGEHV--LKFDEAVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEYCNGQIVNE 1753 FELT E V + E V +H P+ + N S + + E G + Sbjct: 346 FELTEEDVPSCREKEPVMSHSQPTLPMDVS-----NLLASEMRSGSSMAEEKTYG---SP 397 Query: 1754 EKALEGEGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858 KA E C +KH +++ GSSKDF+FD++K E+ Sbjct: 398 RKASESGEDECHRKHRNITFGSSKDFDFDNVKIEV 432 >gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] Length = 499 Score = 472 bits (1215), Expect = e-130 Identities = 269/464 (57%), Positives = 317/464 (68%), Gaps = 12/464 (2%) Frame = +2 Query: 506 MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685 M SV++S E+R QP+T KRRW SCWSLYWCFG +KN KRIGHAVLV Sbjct: 1 MRSVNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFGPHKN-KRIGHAVLV 59 Query: 686 PESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSM 865 PE PGA DN L SDPPSATQSPA G LSL S S Sbjct: 60 PEPVVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPA-GFLSLKSLSA 118 Query: 866 NSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPF 1045 N++SPGG ASIF+IGPY +ETQLV+PPVFS F TEPSTA FTPPPESVQLTTPSSPEVPF Sbjct: 119 NAYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVPF 178 Query: 1046 AQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQS-PGSPGSHLISPASAISNSGTSSPFLD 1222 AQLLTSSL RNRR+SG N +F LS Y+FQPYQ PGSPG +LISP SA+SNSGTSSPF D Sbjct: 179 AQLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFPD 238 Query: 1223 KLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPS--GWGSRLGSGTLTPNGGI--S 1387 + P++EFRMGEAPK G++HF T KWGSR+GSGSLTP G GSRLGSG+LTP+G S Sbjct: 239 RHPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELGS 298 Query: 1388 RLGSGTQTPNG-GI-SRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEKS-ENEEE 1558 RLGSG TPNG GI SRLGSG LTP+G P ++S+LLE+QISEVASLA+SE + E Sbjct: 299 RLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTVET 358 Query: 1559 LLDPRVSFELTGEHV--LKFDEAVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEYC 1732 + D RVSFELTGE V ++AV ++ T A + P+ ++S S N C Sbjct: 359 VFDHRVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCEFSV 418 Query: 1733 NGQIVNEEKALEGEGK-HCIKKHHSVSLGSSKDFNFDSMKQELP 1861 + + GEG+ +KH S++LGS+KDFNFD+ K E+P Sbjct: 419 EESSSRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDNTKAEVP 462 >gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis] Length = 521 Score = 464 bits (1193), Expect = e-128 Identities = 269/485 (55%), Positives = 324/485 (66%), Gaps = 34/485 (7%) Frame = +2 Query: 506 MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685 M +V+NS E+R QP+ KRRW SCWSLYWCFGS+KNSKRIGHAVLV Sbjct: 1 MRTVNNSVETINAAATAIVSAEARAQPAAVPKRRWGSCWSLYWCFGSHKNSKRIGHAVLV 60 Query: 686 PESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSM 865 PE PGA AP +N LQSDPPSATQSPA GLLSLTS S+ Sbjct: 61 PEPVLPGAAAPAPENQAPSTAIVLPFIAPPSSPASFLQSDPPSATQSPA-GLLSLTSLSI 119 Query: 866 NSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPF 1045 N++SPGG SIFAIGPY +ETQLV+PPVFS FTTEPSTA FTPPPESVQLTTPSSPEVPF Sbjct: 120 NAYSPGGPTSIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPF 179 Query: 1046 AQLLTSSLARNRRH-SGPNLRFPLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTSSPFL 1219 AQLLTSSL R RR+ SG N +F LS +FQPYQ PGSPG +LISP S +SNSGTSSPF Sbjct: 180 AQLLTSSLDRTRRNSSGANQKFSLSHCEFQPYQLYPGSPGGNLISPGSVVSNSGTSSPFP 239 Query: 1220 DKLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPS------------------GWG 1342 DK PI+ FRMGEAP+ LG+EHF T+KWGSR+GSGSLTP G G Sbjct: 240 DKHPILGFRMGEAPRLLGFEHFTTWKWGSRLGSGSLTPDGVGLGSRLGSGSVTPDGVGLG 299 Query: 1343 SRLGSGTLTPNG-GI-SRLGSGTQTPNG-GI-SRLGSGSLTPNGGEPVCQESYLLESQIS 1510 SRLGSG+LTP+G G+ SRLGSG TPNG G+ SRLGSG+LTP+G V +S+LLE+QIS Sbjct: 300 SRLGSGSLTPDGYGLGSRLGSGCMTPNGPGLGSRLGSGTLTPDGFLVVSGDSFLLENQIS 359 Query: 1511 EVASLAHSEKS-ENEEELLDPRVSFELTGEHV---LKFDEAVTAHDTVLEPVTTNADQAP 1678 EVASLA+S+ +N+ ++D RVSFELTGE V L A + T E + + + P Sbjct: 360 EVASLANSDNGCQNDGSVVDHRVSFELTGEDVARCLASKSASSNGRTTSESLEDSPAECP 419 Query: 1679 N-----NCQSMSKKSENCCCEYCNGQIVNEEKALEGEGKHCIKKHHSVSLGSSKDFNFDS 1843 + ++ ++ C E + + + EGE H +KH S++LGS K+FNFD+ Sbjct: 420 TKKDGISANNVDSPNDQSCVEETSNK-TPQSDCREGEDDHFYQKHRSITLGSIKEFNFDN 478 Query: 1844 MKQEL 1858 K ++ Sbjct: 479 TKADV 483 >ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis] gi|223547583|gb|EEF49078.1| conserved hypothetical protein [Ricinus communis] Length = 510 Score = 463 bits (1192), Expect = e-127 Identities = 264/441 (59%), Positives = 313/441 (70%), Gaps = 13/441 (2%) Frame = +2 Query: 569 ESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLVPESTAPGATAPVADNVNHXXX 748 ESRVQP+T QKRRW CWSLYWCFGS+K +KRIGHAVL PE GA A+N + Sbjct: 36 ESRVQPTTVQKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPEVQGAVVTSAENQSQSTA 94 Query: 749 XXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSMNSFSPGGTASIFAIGPYDHET 928 LQSDPPSATQSPA GLLSLTS S+N++SPGG ASIFAIGPY HET Sbjct: 95 ITVPFIAPPSSPASFLQSDPPSATQSPA-GLLSLTSLSVNAYSPGGPASIFAIGPYAHET 153 Query: 929 QLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPFAQLLTSSLARNRRHSGPNLRF 1108 QLVTPP FSAFTTEPSTA FTPPPESVQLTTPSSPEVPFAQLLTSSL R RR+SG N +F Sbjct: 154 QLVTPPAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKF 213 Query: 1109 PLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTSSPFLDKLPIVEFRMGEAPKFLGYEHF 1285 LS Y+FQ Y PGSPG LISP S ISNSGTSSPF D+ PI+EFRMGEAPK LG+EHF Sbjct: 214 ALSHYEFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPILEFRMGEAPKLLGFEHF 273 Query: 1286 -TYKWGSRVGSGSLTPS--GWGSRLGSGTLTPN--GGISRLGSGTQTPNG-GI-SRLGSG 1444 T KWGSR+GSG++TP G GSRLGSGT+TP+ G SRLGSGT TP+G G+ S LGSG Sbjct: 274 TTRKWGSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGSG 333 Query: 1445 SLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEELLDPRVSFELTGEHVLKFDEA 1621 SLTP+ P ++ + LE+QISEVASLA+SE S+ +E ++D RVSFEL+GE V + E+ Sbjct: 334 SLTPDAVGPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLES 393 Query: 1622 VTAHD----TVLEPVTTNADQAPNNCQSMSKKSENCCCEYCNGQIVNEEKALEGEGKHCI 1789 + + P + DQ + M EN +G+ E+ + E E +HC Sbjct: 394 KSLASCRAFSECPPDSMAEDQIKSG--KMLMTDENLPTGETSGE-TPEKPSGEMEEEHCY 450 Query: 1790 KKHHSVSLGSSKDFNFDSMKQ 1852 +KH S++LGS K+FNFD+ K+ Sbjct: 451 RKHRSITLGSIKEFNFDNSKE 471 >ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|222858882|gb|EEE96429.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 507 Score = 463 bits (1192), Expect = e-127 Identities = 267/448 (59%), Positives = 312/448 (69%), Gaps = 18/448 (4%) Frame = +2 Query: 569 ESRVQPSTN--QKRRWASCWSLYWCFGSY---KNSKRIGHAVLVPESTAPGATAPVADNV 733 ESRVQPS++ QKRRW CWSLYWCFGS+ KNSKRIGHAVLVPE PGA + +N Sbjct: 23 ESRVQPSSSSVQKRRWGGCWSLYWCFGSHGSHKNSKRIGHAVLVPEPEVPGAVSSSTENQ 82 Query: 734 NHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSMNSFSPGGTASIFAIGP 913 LQSDPPS+TQSPA GLLSLTS S N++SP G ASIFAIGP Sbjct: 83 TQSTPILLPFIAPPSSPASFLQSDPPSSTQSPA-GLLSLTSLSANAYSPRGPASIFAIGP 141 Query: 914 YDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPFAQLLTSSLARNRRHSG 1093 Y HETQLVTPPVFSAFTTEPSTA FTPPPESVQLTTPSSPEVPFAQLLTSSL R RR+SG Sbjct: 142 YAHETQLVTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSG 201 Query: 1094 PNLRFPLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTSSPFLDKLPIVEFRMGEAPKFL 1270 PN +F LS Y+FQ Y PGSPG +ISP SAISNSGTSSPF D+ P++EFRMGEAPK L Sbjct: 202 PNQKFSLSHYEFQSYHLYPGSPGGQIISPGSAISNSGTSSPFPDRHPMLEFRMGEAPKLL 261 Query: 1271 GYEHF-TYKWGSRVGSGSL----TPSGWG-SRLGSGTLTPNG-GISRLGSGTQTPNGG-- 1423 G+EHF T KWGSR+GSGSL TP G G SRLGSGT+TP+G G+SRL SGT TP+G Sbjct: 262 GFEHFSTRKWGSRLGSGSLTPDATPDGMGLSRLGSGTVTPDGMGLSRLCSGTATPDGAGL 321 Query: 1424 ISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEELLDPRVSFELTGEH 1600 SRLGSG+LTP+ P Q +LLE+QISEVASL +SE S+ EE ++ RVSFEL+GE Sbjct: 322 RSRLGSGTLTPDCFVPASQIGFLLENQISEVASLTNSENGSKTEENVVHHRVSFELSGEE 381 Query: 1601 VLKFDE--AVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEYCNGQIVNEEKALEGE 1774 V + E +V + T E + P ++ E C + E+ + E E Sbjct: 382 VARCLEIKSVASTRTFPEYPQDTMPEDPVRGDRLAMNGERCLQNGEASSEMPEKNSEETE 441 Query: 1775 GKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858 H +KH S++LGS K+FNFD+ K E+ Sbjct: 442 EDHVYRKHRSITLGSIKEFNFDNSKGEV 469 >ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citrus clementina] gi|557523850|gb|ESR35217.1| hypothetical protein CICLE_v10004813mg [Citrus clementina] Length = 500 Score = 458 bits (1178), Expect = e-126 Identities = 258/465 (55%), Positives = 321/465 (69%), Gaps = 14/465 (3%) Frame = +2 Query: 506 MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685 MSSVH+S ESR++P+ QKRRW SCWSLYWCFGS+K SKRI HAVLV Sbjct: 1 MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLV 60 Query: 686 PESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSM 865 PE GA AP A+ H LQSDPPSATQSPA GLLSL S S+ Sbjct: 61 PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPPSATQSPA-GLLSLNSLSV 119 Query: 866 NSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPF 1045 N++SPGG AS+FAIGPY HETQLVTPPVFSAFTTEPSTA TPPPESVQLTTPSSPEVPF Sbjct: 120 NAYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPF 179 Query: 1046 AQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTSSPFLD 1222 AQLLTSSL R RR+SG N + LS Y +QPYQ PGSPG LISP S +S SGTSSPF D Sbjct: 180 AQLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPD 239 Query: 1223 KLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPS--GWGSRLGSGTLTPNG-GI-S 1387 + PI++F APK LG+EHF T KWGSR+GSGS+TP G GSR+GSG+LTP+G G+ S Sbjct: 240 RHPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGS 299 Query: 1388 RLGSGTQTPNG-GI-SRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEE 1558 RLGSGT TP+G G+ SRLGSGSLTP+G P ++ ++ E+QISEVASLA+S+ ++++E Sbjct: 300 RLGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEH 359 Query: 1559 LLDPRVSFELTGEHVLKFDEAVTAHDTVLEP-----VTTNADQAPNNCQSMSKKSENCCC 1723 ++D RVSFEL+GE V + +A + P + + + + S+ C Sbjct: 360 IIDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCP 419 Query: 1724 EYCNGQIVNEEKALEGEGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858 E + + + E+ +GE ++C +KH S++LGS K+FNFD+ + E+ Sbjct: 420 EESSNR-MPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEV 463 >ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-like [Citrus sinensis] Length = 500 Score = 454 bits (1167), Expect = e-125 Identities = 256/465 (55%), Positives = 320/465 (68%), Gaps = 14/465 (3%) Frame = +2 Query: 506 MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685 MSSVH+S ESR++P+ QKRRW SCWSLYWCFGS+K SKRI HAVL+ Sbjct: 1 MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLL 60 Query: 686 PESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSM 865 PE GA AP A+ H LQSDP SATQSPA GLLSL S S+ Sbjct: 61 PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPSSATQSPA-GLLSLNSLSV 119 Query: 866 NSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPF 1045 N++SPGG AS+FAIGPY HETQLVTPPVFSAFTTEPSTA TPPPESVQLTTPSSPEVPF Sbjct: 120 NAYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPF 179 Query: 1046 AQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTSSPFLD 1222 AQLLTSSL R RR+SG N + LS Y +QPYQ PGSPG LISP S +S SGTSSPF D Sbjct: 180 AQLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPD 239 Query: 1223 KLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPS--GWGSRLGSGTLTPNG-GI-S 1387 + PI++F APK LG+EHF T KWGSR+GSGS+TP G GSR+GSG+LTP+G G+ S Sbjct: 240 RHPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGS 299 Query: 1388 RLGSGTQTPNG-GI-SRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEE 1558 RLGSGT TP+G G+ SRLGSGSLTP+G P ++ ++ E+QISEVASLA+S+ ++++E Sbjct: 300 RLGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEH 359 Query: 1559 LLDPRVSFELTGEHVLKFDEAVTAHDTVLEP-----VTTNADQAPNNCQSMSKKSENCCC 1723 ++D RVSFEL+GE V + +A + P + + + + S+ C Sbjct: 360 IIDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCP 419 Query: 1724 EYCNGQIVNEEKALEGEGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858 E + + + E+ +GE ++C +KH S++LGS K+FNFD+ + E+ Sbjct: 420 EESSNR-MPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEV 463 >gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 485 Score = 453 bits (1166), Expect = e-124 Identities = 266/461 (57%), Positives = 314/461 (68%), Gaps = 11/461 (2%) Frame = +2 Query: 506 MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685 M SV++S +SRVQP+T QK+RW SCW LYWCFGS KNSKRIGHAVLV Sbjct: 1 MRSVNDSVETVNAAATAIVSADSRVQPTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLV 60 Query: 686 PESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSM 865 PE PGA+ A+NV++ LQSDPPSATQSPA GLLSLTS S+ Sbjct: 61 PEPVVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPA-GLLSLTSLSV 119 Query: 866 NSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPF 1045 N++SP G ASIFAIGPY HETQLVTPPVFSA TTEPSTA FTPPPESVQLTTPSSPEVPF Sbjct: 120 NAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPF 179 Query: 1046 AQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTSSPFLD 1222 AQLLTSSL R RR+SG N +F LS Y+FQ YQ PGSPG +LISP SAISNSGTSSPF D Sbjct: 180 AQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPD 239 Query: 1223 KLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPS--GWGSRLGSGTLTPNGGISRL 1393 + PI+EFRMGEAPK LG+E+F T KWGSR+GSGSLTP G GSRLGSG++TP+G + Sbjct: 240 RRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDG----M 295 Query: 1394 GSGTQTPNGGISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAH-SEKSENEEELLDP 1570 G G SRLGSGSLTP+G P ++ +L+ SQISEVA LA+ + +N+E ++D Sbjct: 296 GLG--------SRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDH 347 Query: 1571 RVSFELTGEHV---LKFDEAVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEYCNGQ 1741 RVSFEL+GE V L+ + + P A+ + KK CE + Sbjct: 348 RVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRKE--RDGIKKDLESSCELFIRE 405 Query: 1742 IVNE--EKAL-EGEGKHCIKKHHSVSLGSSKDFNFDSMKQE 1855 NE EKA E E +H +KH SV+LGS K+FNFD+ K E Sbjct: 406 TSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDNTKGE 446 >gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 489 Score = 447 bits (1151), Expect = e-123 Identities = 266/465 (57%), Positives = 314/465 (67%), Gaps = 15/465 (3%) Frame = +2 Query: 506 MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQ----KRRWASCWSLYWCFGSYKNSKRIGH 673 M SV++S +SRVQP+T Q K+RW SCW LYWCFGS KNSKRIGH Sbjct: 1 MRSVNDSVETVNAAATAIVSADSRVQPTTVQVHVYKKRWGSCWGLYWCFGSQKNSKRIGH 60 Query: 674 AVLVPESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLT 853 AVLVPE PGA+ A+NV++ LQSDPPSATQSPA GLLSLT Sbjct: 61 AVLVPEPVVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPA-GLLSLT 119 Query: 854 SFSMNSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSP 1033 S S+N++SP G ASIFAIGPY HETQLVTPPVFSA TTEPSTA FTPPPESVQLTTPSSP Sbjct: 120 SLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSP 179 Query: 1034 EVPFAQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTSS 1210 EVPFAQLLTSSL R RR+SG N +F LS Y+FQ YQ PGSPG +LISP SAISNSGTSS Sbjct: 180 EVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSS 239 Query: 1211 PFLDKLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPS--GWGSRLGSGTLTPNGG 1381 PF D+ PI+EFRMGEAPK LG+E+F T KWGSR+GSGSLTP G GSRLGSG++TP+G Sbjct: 240 PFPDRRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDG- 298 Query: 1382 ISRLGSGTQTPNGGISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAH-SEKSENEEE 1558 +G G SRLGSGSLTP+G P ++ +L+ SQISEVA LA+ + +N+E Sbjct: 299 ---MGLG--------SRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDET 347 Query: 1559 LLDPRVSFELTGEHV---LKFDEAVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEY 1729 ++D RVSFEL+GE V L+ + + P A+ + KK CE Sbjct: 348 IVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRKE--RDGIKKDLESSCEL 405 Query: 1730 CNGQIVNE--EKAL-EGEGKHCIKKHHSVSLGSSKDFNFDSMKQE 1855 + NE EKA E E +H +KH SV+LGS K+FNFD+ K E Sbjct: 406 FIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDNTKGE 450 >ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210841 [Cucumis sativus] Length = 497 Score = 442 bits (1137), Expect = e-121 Identities = 257/463 (55%), Positives = 316/463 (68%), Gaps = 12/463 (2%) Frame = +2 Query: 506 MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFG--SYKNSKRIGHAV 679 M+S++NS E+RVQP+T KRRW SCWSLYWCFG S K++KRIGHAV Sbjct: 1 MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60 Query: 680 LVPESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSF 859 LVPE PGA AP ++ LQS+P S TQSPA GLLSLT+ Sbjct: 61 LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPA-GLLSLTAL 119 Query: 860 SMNSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEV 1039 S+N++SP G ASIFAIGPY ++TQLV+PPVFSAFTTEPSTA TPPPESVQLTTPSSPEV Sbjct: 120 SVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEV 179 Query: 1040 PFAQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQS-PGSPGSHLISPASAISNSGTSSPF 1216 PFA+LLTSSL+ + G N +F LS DFQPYQ PGSPG+HLISP S ISNSGTSSPF Sbjct: 180 PFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPF 239 Query: 1217 LDKLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPSGWG--SRLGSGTLTPNG-GI 1384 DK PI+EFRM +APK LG EHF T KW SR+GSGSLTP G G SRLGSGTLTP+G G+ Sbjct: 240 PDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGM 299 Query: 1385 -SRLGSGTQTPNG--GISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEKSENEE 1555 SRLGSG+ TPNG SRLGSG+LTP+G Q+S LL++QISEVASLA+SE + + Sbjct: 300 GSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSE-TGCQN 358 Query: 1556 ELLDPRVSFELTGEHVLK--FDEAVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEY 1729 ++ + RVSFELTGE V + ++++T+ T E + N + S+++E CE+ Sbjct: 359 DVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAET--CEF 416 Query: 1730 CNGQIVNEEKALEGEGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858 + + + GE C + +V+LGS K+FNFD K E+ Sbjct: 417 FDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEI 459 >ref|XP_004157195.1| PREDICTED: uncharacterized protein LOC101225370 [Cucumis sativus] Length = 497 Score = 441 bits (1133), Expect = e-121 Identities = 256/463 (55%), Positives = 315/463 (68%), Gaps = 12/463 (2%) Frame = +2 Query: 506 MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFG--SYKNSKRIGHAV 679 M+S++NS E+RVQP+T KRRW SCWSLYWCFG S K++KRIGHAV Sbjct: 1 MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60 Query: 680 LVPESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSF 859 LVPE PGA AP ++ LQS+P S TQSPA GLLS T+ Sbjct: 61 LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPA-GLLSFTAL 119 Query: 860 SMNSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEV 1039 S+N++SP G ASIFAIGPY ++TQLV+PPVFSAFTTEPSTA TPPPESVQLTTPSSPEV Sbjct: 120 SVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEV 179 Query: 1040 PFAQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQS-PGSPGSHLISPASAISNSGTSSPF 1216 PFA+LLTSSL+ + G N +F LS DFQPYQ PGSPG+HLISP S ISNSGTSSPF Sbjct: 180 PFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPF 239 Query: 1217 LDKLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPSGWG--SRLGSGTLTPNG-GI 1384 DK PI+EFRM +APK LG EHF T KW SR+GSGSLTP G G SRLGSGTLTP+G G+ Sbjct: 240 PDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGM 299 Query: 1385 -SRLGSGTQTPNG--GISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEKSENEE 1555 SRLGSG+ TPNG SRLGSG+LTP+G Q+S LL++QISEVASLA+SE + + Sbjct: 300 GSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSE-TGCQN 358 Query: 1556 ELLDPRVSFELTGEHVLK--FDEAVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEY 1729 ++ + RVSFELTGE V + ++++T+ T E + N + S+++E CE+ Sbjct: 359 DVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAET--CEF 416 Query: 1730 CNGQIVNEEKALEGEGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858 + + + GE C + +V+LGS K+FNFD K E+ Sbjct: 417 FDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEI 459 >ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera] Length = 448 Score = 402 bits (1034), Expect = e-109 Identities = 240/463 (51%), Positives = 289/463 (62%), Gaps = 12/463 (2%) Frame = +2 Query: 506 MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685 M SV+NS ESRVQP+T QKRRW SC SLYWCFGS+++SKRIGHAVLV Sbjct: 1 MRSVNNSVETINAAATAIVSAESRVQPTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLV 60 Query: 686 PESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSM 865 PE PGA AP ++N+N LQSDPPS+TQSPA G LSLT+ S+ Sbjct: 61 PEPMVPGAVAPASENLNLSTSIVLPFIAPPSSPASFLQSDPPSSTQSPA-GFLSLTALSV 119 Query: 866 NSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPF 1045 N++SP G AS+FAIGPY HETQLV+PPVFS F TEPSTA FTPPPESVQLTTPSSPEVPF Sbjct: 120 NAYSPSGPASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPF 179 Query: 1046 AQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTSSPFLD 1222 AQLLTSSL R+RR+SG N + LS Y+FQPYQ P SP HLISP ISNSGTSSPF D Sbjct: 180 AQLLTSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPD 236 Query: 1223 KLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGS 1399 + PIV EAPK LG+EHF T +WGSR+GSGSLTP G G Sbjct: 237 RRPIV-----EAPKLLGFEHFSTRRWGSRLGSGSLTPDGAG------------------- 272 Query: 1400 GTQTPNGGISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEELLDPRV 1576 P ++S+LLE+QISEVASLA+SE S+N E ++D RV Sbjct: 273 -----------------------PASRDSFLLENQISEVASLANSESGSQNGETVIDHRV 309 Query: 1577 SFELTGEHVLKFDE------AVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEYCNG 1738 SFEL GE V E A T +T+ + V + +S+ +EN CCE+C G Sbjct: 310 SFELAGEDVAVCVEKKPVASAETVQNTLQDIV--EEGEIERERDGISESTEN-CCEFCVG 366 Query: 1739 QIV---NEEKALEGEGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858 + + +E+ + EGE + C KKH + GS K+FNFD+ K E+ Sbjct: 367 EALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEV 409 >ref|XP_006413289.1| hypothetical protein EUTSA_v10025027mg [Eutrema salsugineum] gi|557114459|gb|ESQ54742.1| hypothetical protein EUTSA_v10025027mg [Eutrema salsugineum] Length = 489 Score = 393 bits (1010), Expect = e-106 Identities = 241/470 (51%), Positives = 299/470 (63%), Gaps = 19/470 (4%) Frame = +2 Query: 506 MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685 M +V+NS ESRVQPS+ QK+RW SCWSLYWCFGS KN+KRIGHAVLV Sbjct: 1 MRNVNNSVDTVNAAASAIVSAESRVQPSSVQKKRWGSCWSLYWCFGSQKNNKRIGHAVLV 60 Query: 686 PESTAPGA--TAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSF 859 PE + G+ APV ++ + LQS PPS + +P AGLLSLT Sbjct: 61 PEPVSSGSVPVAPVQNSSTNSTSIFLPFIAPPSSPASFLQSGPPSVSHTPPAGLLSLT-- 118 Query: 860 SMNSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQL--TTPSSP 1033 +N++S AS FAIGPY HETQ VTPPV SAFTT PSTA FTPPPES Q+ TTPSSP Sbjct: 119 -VNTYSRNEPASAFAIGPYAHETQPVTPPVDSAFTTRPSTAPFTPPPESAQMASTTPSSP 177 Query: 1034 EVPFAQLLTSSLARNRRHS-GPNLRFPLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTS 1207 EVPFAQLLTSSL R RR+S G N +F + Y+F +Q PGSPG +LISP S ISNSGTS Sbjct: 178 EVPFAQLLTSSLERARRNSGGMNQKFSAAHYEFHSHQVFPGSPGGNLISPGSVISNSGTS 237 Query: 1208 SPFLDKLPIVEFRMGEAPKFLGYEHFT-YKWGSRVGSGSLTPSGWGSRLGSGTLTPNGG- 1381 SP+ K I+EFR+GE PKFLG+EHFT KWGSR GSGS+TP+G GSRLGSG LTP+GG Sbjct: 238 SPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAGQGSRLGSGALTPDGGG 297 Query: 1382 -ISRLGSGTQTPNGG--ISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEKSENE 1552 S+L SG TPNG +SR GSG++TP ES LL+ QISEVASLA+S+ + Sbjct: 298 LGSKLASGAVTPNGAEMVSRKGSGNVTP-------LESSLLDCQISEVASLANSDHGSSR 350 Query: 1553 EE----LLDPRVSFELTGEHVLKFDEA----VTAHDTVLEPVTTNADQAPNNCQSMSKKS 1708 + ++ RVSFELTGE V + + D + E N D N +++S + Sbjct: 351 HDEAVAVVSHRVSFELTGEDVARCFASKLNRAGLDDCLHE--KANGDHTDTN-EAVSPTN 407 Query: 1709 ENCCCEYCNGQIVNEEKALEGEGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858 +G + + + E E + +K S+SLGSSK+F FD+ K+E+ Sbjct: 408 R------WSGSVPGSKTSGETESEQSLKL-RSISLGSSKEFKFDNTKEEM 450 >ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494240 [Cicer arietinum] Length = 492 Score = 392 bits (1006), Expect = e-106 Identities = 237/445 (53%), Positives = 304/445 (68%), Gaps = 15/445 (3%) Frame = +2 Query: 569 ESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLVPESTAPGATAPVADNV-NHXX 745 ESRVQPST+ K+RW SC+SL CFGS+K+SKRIGHAVLVPE AP PVA + N Sbjct: 23 ESRVQPSTSPKKRWGSCFSLSSCFGSHKSSKRIGHAVLVPEPVAP--IVPVAHSAPNPST 80 Query: 746 XXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSMNS-FSPGGTASIFAIGPYDH 922 LQSDPPS+T SPAAGLLS S+N+ +S G+ASIF IGPY + Sbjct: 81 VIVMPFIAPPSSPASFLQSDPPSSTHSPAAGLLSP---SVNAAYSSSGSASIFTIGPYAY 137 Query: 923 ETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPFAQLLTSSLARNRRHSGPNL 1102 ETQLV+PPVFS FTTEPSTA FTPPPESVQ+TTPSSPEVPFAQLL SSL R R+++G + Sbjct: 138 ETQLVSPPVFSNFTTEPSTASFTPPPESVQMTTPSSPEVPFAQLLASSLDRARKNNGSH- 196 Query: 1103 RFPLSQYDFQPYQS-PGSPGSHLISPASAISNSGTSSPFLDKLPIVEFRMGEAPKFLGYE 1279 +F L Y+FQPYQ PGSPG+ L+SP S IS SGTS+PF D+ +E GE PK LG+E Sbjct: 197 KFALYNYEFQPYQQYPGSPGAQLVSPGSVISTSGTSTPFPDRRSSLELSRGETPKILGFE 256 Query: 1280 HF-TYKWGSRVGSGSLTP--SGWGSRLGSGTLTPNG--GISRLGSGTQTPN--GGISRLG 1438 HF T +W SR+GSGSLTP +G GSRLGSG+LTP+G SRLGSG TP+ G SRLG Sbjct: 257 HFSTRRWNSRIGSGSLTPDGAGQGSRLGSGSLTPDGFAHASRLGSGCTTPDGLGQDSRLG 316 Query: 1439 SGSLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEELLDPRVSFELTGEHVLKFD 1615 SGSLTP+G P +ES +++QISE S+A+SE S++ L+D RVSFELTGE V + Sbjct: 317 SGSLTPDGAGPTTRES--MQNQISEDVSVANSEHGSQSNATLVDHRVSFELTGEDVARC- 373 Query: 1616 EAVTAHDTVLEPVTTNAD----QAPNNCQSMSKKSENCCCEYCNGQIVNEEKALEGEGKH 1783 +L +++++ + P + + + K++ N CC+ C+ + ++ G+ Sbjct: 374 -LANKAGALLRNMSSSSQGILAKDPIDRERILKET-NGCCDVCSRKTNDKSDNSCAGGEQ 431 Query: 1784 CIKKHHSVSLGSSKDFNFDSMKQEL 1858 C +K +SVS SSK+FNFD+ K ++ Sbjct: 432 CCQKRNSVS--SSKEFNFDNRKGDV 454 >ref|XP_006589528.1| PREDICTED: uncharacterized protein LOC100798631 isoform X1 [Glycine max] Length = 504 Score = 390 bits (1003), Expect = e-106 Identities = 238/470 (50%), Positives = 302/470 (64%), Gaps = 22/470 (4%) Frame = +2 Query: 506 MSSVHNSXXXXXXXXXXXXXXESRVQPSTN-QKRRWASCWSLYWCFGSYKNSKRIGHAVL 682 M +V+N+ ESR+QP+T K+RW SCWSL WCFG +KNSKR+G+AVL Sbjct: 1 MGTVNNTVDTVNAAASAIVYAESRIQPTTTVPKKRWGSCWSLCWCFGPHKNSKRVGNAVL 60 Query: 683 VPESTAPGATA---PVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLT 853 VPE P P N LQSDPPSATQSP GL SL+ Sbjct: 61 VPEPVEPIGPVGFHPATAAPNPSTAIVMPFIVPPSSPASFLQSDPPSATQSPV-GLFSLS 119 Query: 854 SFSMNSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSP 1033 S ++N+ GG ASIFAIGPY +ETQLV+PPVFS FTTEPSTA FTPPPESVQLTTPSSP Sbjct: 120 SLTVNA--SGGPASIFAIGPYTYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSP 177 Query: 1034 EVPFAQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQS-PGSPGSHLISPASAISNSGTSS 1210 EVPFAQLL SSL RN + +G N RF LS Y+FQPYQ PGSPG+ L+SP S IS SG+S+ Sbjct: 178 EVPFAQLLASSLDRNCKSNGTNQRFALSNYEFQPYQQYPGSPGTQLVSPRSIISTSGSST 237 Query: 1211 PFLDKLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTP--SGWGSRLGSGTLTPNGG 1381 PF D+ P++EF GEAPK LG+E+F T+KW SR+GSGSLTP +G GSRLGSG+ TP+ Sbjct: 238 PFPDRHPVLEFHKGEAPKLLGFENFLTHKWNSRLGSGSLTPDSAGQGSRLGSGSFTPDAV 297 Query: 1382 --ISRLGSGTQTPNG--GISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSE-KSE 1546 S+LGSG TP+G SR GSGSLTP+ P + + QISEV S+ +SE + + Sbjct: 298 KLASQLGSGCLTPDGLCQDSRFGSGSLTPDAVAPTARNDIDIGKQISEVTSIVNSENECQ 357 Query: 1547 NEEELLDPRVSFELTGEHVLKFDEAVTAHDTVLEPVTTNAD----QAPNNCQSMSKKSEN 1714 + L+D RVSFELTG V + A + ++L ++ ++ + P + + + K S N Sbjct: 358 PKAALVDHRVSFELTGVDVPRC-LANKSGSSLLGNMSGSSQGTLVEDPVDIEKIQKNS-N 415 Query: 1715 CCCEYCNGQIVN--EEKALE--GEG-KHCIKKHHSVSLGSSKDFNFDSMK 1849 C +C+ + N +K+ GEG + C +KHH S SSK+FNFD+ K Sbjct: 416 SSCAFCSRKTSNASNDKSCNSPGEGAEQCCRKHH--SFNSSKEFNFDNRK 463 >gb|ESW24210.1| hypothetical protein PHAVU_004G111400g [Phaseolus vulgaris] Length = 500 Score = 388 bits (997), Expect = e-105 Identities = 234/444 (52%), Positives = 289/444 (65%), Gaps = 17/444 (3%) Frame = +2 Query: 569 ESRVQPSTNQKRRWASCWSLYWCFGSYKN---SKRIGHAVLVPESTAP-GATAPVADNVN 736 ESRVQP+T K+RW CWS YWCFGSYK+ SKRIGHAVLVPE AP G A A N Sbjct: 23 ESRVQPTTVPKKRWGGCWSQYWCFGSYKSTKSSKRIGHAVLVPEPVAPTGPAAAAAAPPN 82 Query: 737 HXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSMNSFSPGGTASIFAIGPY 916 +QSDPPSA QSP GLLSL+S + +++S GG AS+F IGPY Sbjct: 83 PSTAIVMPFIAPPSSPASLIQSDPPSAIQSPP-GLLSLSSLAASAYSSGGPASMFTIGPY 141 Query: 917 DHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPFAQLLTSSLARNRRHSGP 1096 +ETQLV+PPVFS FTTEPSTA FTPPPESV TTPSSP+VPFAQLL SSL R R+ +G Sbjct: 142 AYETQLVSPPVFSNFTTEPSTAPFTPPPESVHQTTPSSPDVPFAQLLASSLDRARKSNG- 200 Query: 1097 NLRFPLSQYDFQPY-QSPGSPGSHLISPASAISNSGTSSPFLDKLPIVEFRMGEAPKFLG 1273 N +F L YDFQPY Q PGSPG LISP SA S SGTS+PF D+ P +EFR GE PK LG Sbjct: 201 NQKFALYNYDFQPYHQYPGSPGGQLISPGSAFSTSGTSTPFPDRPPTLEFRKGETPKILG 260 Query: 1274 YEHF-TYKWGSRVGSGSLTP--SGWGSRLGSGTLTPNG-GI-SRLGSGTQTPN--GGISR 1432 EHF T +W SR+GSGSLTP +G GSRLGSG++TP+G G+ SRLGSG TP+ G SR Sbjct: 261 VEHFSTQRWSSRLGSGSLTPDGAGQGSRLGSGSVTPDGVGLASRLGSGCATPDGLGQESR 320 Query: 1433 LGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEKSE-NEEELLDPRVSFELTGEHVLK 1609 LGSG LTP+G + + + +++QIS+ A+LA+S+ + L+D RVSFELTGE V + Sbjct: 321 LGSGCLTPDGVGQINENNLPVQNQISKEATLANSDNGHPSNATLIDHRVSFELTGEDVAR 380 Query: 1610 FDEAVTAHDTVLEPVTTNADQAPNNCQSMSK----KSENCCCEYCNGQIVNEEKALEGEG 1777 + VL + + Q + + + + C C + ++ GEG Sbjct: 381 ---CLANKTGVLLRNMSGSSQGILAKDPVDRERVLRDTDASCNVCTEKTDDKPYNPIGEG 437 Query: 1778 KHCIKKHHSVSLGSSKDFNFDSMK 1849 + C K +SV+ SSK+FNFDS K Sbjct: 438 EQCFHKQNSVN--SSKEFNFDSSK 459 >gb|AFK46430.1| unknown [Medicago truncatula] Length = 487 Score = 388 bits (997), Expect = e-105 Identities = 235/445 (52%), Positives = 299/445 (67%), Gaps = 15/445 (3%) Frame = +2 Query: 569 ESRVQPSTNQKRRWASCWSLYWCFGSY-KNSKRIGHAVLVPESTAPGATAPVADNV-NHX 742 ESRVQP+++ K+RW SC+SL CFGS+ K S+RIGHAVLVPE AP T PVA+ N Sbjct: 23 ESRVQPTSSPKKRWGSCFSLPSCFGSHNKTSERIGHAVLVPEPVAP--TVPVANAAPNPS 80 Query: 743 XXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSMNSFSPGGTASIFAIGPYDH 922 LQSDPPS+T SPAAGLLSL+S S N++S G AS+F IGPY + Sbjct: 81 TAIVIPFIAPPSSPASFLQSDPPSSTHSPAAGLLSLSSLSANAYSTSGPASMFTIGPYAY 140 Query: 923 ETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPFAQLLTSSLARNRRHSGPNL 1102 ETQLV+PPVFS FT EPSTA FTPPPESV +TTPSSPEVPFAQLL SSL R R+ N Sbjct: 141 ETQLVSPPVFSNFTAEPSTANFTPPPESVLMTTPSSPEVPFAQLLASSLDRARK---SNH 197 Query: 1103 RFPLSQYDFQPYQS-PGSPGSHLISPASAISNSGTSSPFLDKLPIVEFRMGEAPKFLGYE 1279 +F L Y++QPYQ PGSPG+ L+SP S IS SGTS+PF D+ +E R GEAPK LG+E Sbjct: 198 KFALYNYEYQPYQQYPGSPGAQLVSPGSVISTSGTSTPFPDRRSSLELRKGEAPKILGFE 257 Query: 1280 HF-TYKWGSRVGSGSLTP--SGWGSRLGSGTLTPNG--GISRLGSGTQTPN--GGISRLG 1438 HF T KW SR+GSGSLTP +G GSRLGSG+LTP+G SRLGSG TP+ G SRLG Sbjct: 258 HFSTRKWMSRIGSGSLTPDGTGQGSRLGSGSLTPDGVSHTSRLGSGCATPDGLGQDSRLG 317 Query: 1439 SGSLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEELLDPRVSFELTGEHVLKFD 1615 SGSLTP+G P + S +++QI S+A+S+ S+ L+D RVSFELTGE V + Sbjct: 318 SGSLTPDGVGPTTRGSIDVQNQIPVGVSVANSDHGSQTNATLVDHRVSFELTGEDVARCL 377 Query: 1616 EAVTAHDTVLEPVTTNAD----QAPNNCQSMSKKSENCCCEYCNGQIVNEEKALEGEGKH 1783 T +L +++++ + P + + + K++ N CC+ C+G+ + G+H Sbjct: 378 ANKTG--ALLRNMSSSSQGILAKDPIDREKILKET-NSCCDVCSGKAIG--------GEH 426 Query: 1784 CIKKHHSVSLGSSKDFNFDSMKQEL 1858 C K +SVS SSK+FNFD+ K ++ Sbjct: 427 CCPKRNSVS--SSKEFNFDNRKGDV 449 >ref|XP_003549033.2| PREDICTED: uncharacterized protein LOC100806399 [Glycine max] Length = 515 Score = 382 bits (982), Expect = e-103 Identities = 232/448 (51%), Positives = 289/448 (64%), Gaps = 18/448 (4%) Frame = +2 Query: 569 ESRVQPSTNQKRRWASCWSLYWCFGSYKNSK---RIGHAVLVPESTAPG--ATAPVADNV 733 ESRVQP+ K+RW CWS YWCFGS K+SK RIGHAVLVPE AP A A A Sbjct: 37 ESRVQPTDAPKKRWGGCWSQYWCFGSRKSSKSSKRIGHAVLVPEPAAPTGPAAAATAAAP 96 Query: 734 NHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSMNSFSPGGTASIFAIGP 913 N LQSDPPS QSP GLLSL++ + N++S GG A++F IGP Sbjct: 97 NPSTAIVMPFIAPPSSPASFLQSDPPSGIQSPP-GLLSLSALAANAYSSGGPATMFTIGP 155 Query: 914 YDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPFAQLLTSSLARNRRHSG 1093 Y +ETQLV+PPVFSAFTTEPSTA +TPPPESVQ TTPSSP+VPFAQLL SSL R R+ +G Sbjct: 156 YAYETQLVSPPVFSAFTTEPSTAPYTPPPESVQQTTPSSPDVPFAQLLASSLDRARKCNG 215 Query: 1094 PNLRFPLSQYDFQPYQS-PGSPGSHLISPASAISNSGTSSPFLDKLPIVEFRMGEAPKFL 1270 + +FPL Y+F PYQ PGSPG LISP SA S SGTS+PF D+ P +EF GE PK L Sbjct: 216 -HQKFPLYNYEFHPYQQYPGSPGGQLISPGSAFSTSGTSTPFPDRPPTLEFPKGETPKIL 274 Query: 1271 GYEHF-TYKWGSRVGSGSLTP-SGW-GSRLGSGTLTPNG-GI-SRLGSGTQTPN--GGIS 1429 G EHF T +WGSR+GSGSLTP S W GSRLGSG+LTP+G G+ SRLGSG TP+ G S Sbjct: 275 GVEHFSTRRWGSRLGSGSLTPDSAWQGSRLGSGSLTPDGVGLASRLGSGCVTPDGLGQES 334 Query: 1430 RLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEKSE-NEEELLDPRVSFELTGEHVL 1606 RLGSG LTP+ P Q + +++QIS+ A+LA S+ + L+D RVSFELTGE V Sbjct: 335 RLGSGCLTPDSAGPTNQNNISVQNQISKEATLADSDNGHPSNATLVDHRVSFELTGEDVA 394 Query: 1607 KFDEAVTAHDTVLEPVTTNADQAPNNCQSMSKK----SENCCCEYCNGQIVNEEKALEGE 1774 + + VL + + Q + ++ N C C + ++ G+ Sbjct: 395 R---CLANKTGVLLRNMSGSSQGILTKDPVDRERVQIDTNSSCNACTEKTDDKPDNPVGK 451 Query: 1775 GKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858 G+ C+ K +SV+ SSK+FNFD+ K ++ Sbjct: 452 GEQCLHKQNSVN--SSKEFNFDNRKGDV 477 >ref|XP_003533172.2| PREDICTED: uncharacterized protein LOC100818313 isoform X1 [Glycine max] Length = 509 Score = 381 bits (979), Expect = e-103 Identities = 231/449 (51%), Positives = 293/449 (65%), Gaps = 19/449 (4%) Frame = +2 Query: 569 ESRVQPSTNQKRRWASCWSLYWCFGSYKNSK---RIGHAVLVPESTAPGATAPVADNVNH 739 ESRVQP+ K+RW CWS YWCFGS K+SK RIGHAVLVPE AP A A N Sbjct: 36 ESRVQPTDAPKKRWGGCWSQYWCFGSCKSSKSSKRIGHAVLVPEPAAPTGPAAAAAAPNP 95 Query: 740 XXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSMNSFSPGGTASIFAIGPYD 919 LQSDPPS QSP GLLSL++ + N++S GG AS+F IGPY Sbjct: 96 SAAIVMPFIAPPSSPASFLQSDPPSGIQSPP-GLLSLSALAANAYSSGGPASMFTIGPYA 154 Query: 920 HETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPFAQLLTSSLARNRRHSGPN 1099 +ETQLV+PPVFSAFTTEPSTA +TPPPESVQ TTPSSP+VPFAQLL SSL R R+ +G N Sbjct: 155 YETQLVSPPVFSAFTTEPSTAPYTPPPESVQQTTPSSPDVPFAQLLASSLDRARKSNG-N 213 Query: 1100 LRFPLSQYDFQPYQS-PGSPGSHLISPASAISNSGTSSPFLDKLPIVEFRM--GEAPKFL 1270 +FPL Y+F PYQ PGSPG LISP SA S SGTS+PF D+ P +EF GE P+ L Sbjct: 214 HKFPLYNYEFHPYQQYPGSPGGQLISPGSAFSTSGTSTPFPDRPPTLEFPFPKGETPRIL 273 Query: 1271 GYEHF-TYKWGSRVGSGSLTPSG-W-GSRLGSGTLTPNG-GI-SRLGSGTQTPNG-GI-S 1429 G+EHF T +WGSR+GSGSLTP G W GSRLGSG+LTP+G G+ SRLGSG TP+G G+ S Sbjct: 274 GFEHFSTRRWGSRLGSGSLTPDGAWQGSRLGSGSLTPDGIGLASRLGSGCVTPDGLGLES 333 Query: 1430 RLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEKSENEE-ELLDPRVSFELTGEHVL 1606 RLGSG LTP+ P+ Q + +++QIS+ A+LA ++ + L+D RVSFELTGE V Sbjct: 334 RLGSGCLTPDSAGPINQNNISVQNQISKEATLADTDNGHSSNATLIDHRVSFELTGEDVA 393 Query: 1607 KFDEAVTAHDTVLEPVTTNADQA-----PNNCQSMSKKSENCCCEYCNGQIVNEEKALEG 1771 + + VL + + Q P + + + K ++ C + +++ Sbjct: 394 R---CLANKTGVLLRNMSGSSQGILSKDPVDRERVQKDTDTCT------EKTDDKPDNSV 444 Query: 1772 EGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858 G+ C+ K +SV+ SSK+FNFD+ K ++ Sbjct: 445 GGEQCLHKQNSVN--SSKEFNFDNRKGDV 471