BLASTX nr result
ID: Rheum21_contig00017500
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rheum21_contig00017500 (1780 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626... 363 2e-97 ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr... 362 2e-97 gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus pe... 353 1e-94 ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241... 345 3e-92 ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu... 345 5e-92 gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [... 344 8e-92 ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu... 341 5e-91 gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis] 340 2e-90 emb|CBI34651.3| unnamed protein product [Vitis vinifera] 320 2e-84 ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660... 319 2e-84 ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254... 318 5e-84 ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm... 315 4e-83 ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309... 311 4e-82 ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309... 311 4e-82 gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein i... 303 1e-79 ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm... 302 4e-79 gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein i... 301 5e-79 ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264... 296 2e-77 ref|XP_003525388.1| PREDICTED: uncharacterized protein LOC100791... 286 2e-74 ref|XP_006580574.1| PREDICTED: uncharacterized protein LOC100791... 284 1e-73 >ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis] Length = 460 Score = 363 bits (931), Expect = 2e-97 Identities = 202/421 (47%), Positives = 258/421 (61%), Gaps = 6/421 (1%) Frame = -3 Query: 1727 SQKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXX 1548 SQK+RW GC S WCF QKH+KRIG A+LVPEP+++ A+ A N + I F Sbjct: 38 SQKRRWGGCWSISWCFGFQKHRKRIGHAVLVPEPTASRSNASEAVNSTQAAAISLPFVAP 97 Query: 1547 XXXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFS 1368 AGLVS SI+ N++SPGGP S+FAIGPYAHETQLVSPP FS Sbjct: 98 PSSPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFS 157 Query: 1367 TYTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSG-FSHRFMXXXXXXXXXXXY 1191 T+TTEPSTAPFTPPPESVH+TTPSSPEVPFA LD S G +F + Sbjct: 158 TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLH 217 Query: 1190 PGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGS 1011 PGSP G LISP S ISGSGTSSP D E AG +PD G PP++LNL L +WGS Sbjct: 218 PGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGS 277 Query: 1010 GRESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDN----QESVSQRVSFELISEDVVR 843 + SG+LTPD +GF N Q + +L +N + V RVSFEL +EDVVR Sbjct: 278 RQGSGTLTPDAVGSTPRNGFFQNRQISEVALRPHSENGLRKDQIVDHRVSFELTTEDVVR 337 Query: 842 CVEKEPTVLPKALQTVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKP-QAPADAEEGQ 666 CVEK+PT L +A+ + +L++ E+ ++ + + E A ++P + P D EE Sbjct: 338 CVEKKPTTLAEAV-SESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAP 396 Query: 665 RCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFPVAQ 486 R Q+Q+S +LGS KEFNFD+ + G+ +P + +WW N+KV GK++ KNW+FFPV Q Sbjct: 397 RHQKQQSITLGSTKEFNFDSAD-GDSHEPTIASDWWANEKVVGKDS-GAIKNWAFFPVIQ 454 Query: 485 P 483 P Sbjct: 455 P 455 >ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina] gi|557541785|gb|ESR52763.1| hypothetical protein CICLE_v10020073mg [Citrus clementina] Length = 460 Score = 362 bits (930), Expect = 2e-97 Identities = 201/421 (47%), Positives = 258/421 (61%), Gaps = 6/421 (1%) Frame = -3 Query: 1727 SQKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXX 1548 SQK+RW GC + WCF QKH+KRIG A+LVPEP+++ A+ A N + I F Sbjct: 38 SQKRRWGGCWNISWCFGFQKHRKRIGHAVLVPEPTASRSNASEAVNSTQATAISLPFVAP 97 Query: 1547 XXXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFS 1368 AGLVS SI+ N++SPGGP S+FAIGPYAHETQLVSPP FS Sbjct: 98 PSSPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFS 157 Query: 1367 TYTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSG-FSHRFMXXXXXXXXXXXY 1191 T+TTEPSTAPFTPPPESVH+TTPSSPEVPFA LD S G +F + Sbjct: 158 TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLH 217 Query: 1190 PGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGS 1011 PGSP G LISP S ISGSGTSSP D E AG +PD G PP++LNL L +WGS Sbjct: 218 PGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGS 277 Query: 1010 GRESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDN----QESVSQRVSFELISEDVVR 843 + SG+LTPD +GF N Q + +L +N + V RVSFEL +EDVVR Sbjct: 278 RQGSGTLTPDAVRSTPRNGFFQNRQISEVALRPHSENGLRKDQIVDHRVSFELTTEDVVR 337 Query: 842 CVEKEPTVLPKALQTVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKP-QAPADAEEGQ 666 CVEK+PT L +A+ + +L++ E+ ++ + + E A ++P + P D EE Sbjct: 338 CVEKKPTTLAEAV-SESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAP 396 Query: 665 RCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFPVAQ 486 R Q+Q+S +LGS KEFNFD+ + G+ +P + +WW N+KV GK++ KNW+FFPV Q Sbjct: 397 RHQKQQSITLGSTKEFNFDSAD-GDSHEPTIASDWWANEKVVGKDS-GAIKNWAFFPVIQ 454 Query: 485 P 483 P Sbjct: 455 P 455 >gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica] Length = 455 Score = 353 bits (907), Expect = 1e-94 Identities = 208/427 (48%), Positives = 254/427 (59%), Gaps = 10/427 (2%) Frame = -3 Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545 QK+RW S YWCF Q+HKKRIG A+LVPE + A A N PI + Sbjct: 38 QKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPETTDRGGDAPRAEN---PIQTPSIVLPFV 94 Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365 F S+ A+++SP GP S+FAIGPYAHETQLVSPP FST Sbjct: 95 APPSSPASFLQSEPPSATQSPAGFFSLTASMYSPSGPTSIFAIGPYAHETQLVSPPVFST 154 Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSG-FSHRFMXXXXXXXXXXXYP 1188 +TTEPSTAPFTPPPESVH+TTPSSPEVPFA LD +G RF YP Sbjct: 155 FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQLYP 214 Query: 1187 GSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSG 1008 GSP GQLISP S ISGSGTSSP D E A G ++ + R G PP++LNL +L DWGS Sbjct: 215 GSPVGQLISPSSGISGSGTSSPFPDLEFAARGHHFLEFRTGDPPKLLNLDILSTRDWGSR 274 Query: 1007 RESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDNQE-----SVSQRVSFELISEDVVR 843 SGS+TPDG S DGF + Q L +N+ S++ RVSFEL SE+V+R Sbjct: 275 LGSGSVTPDGAKSTSSDGFLLKPQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEEVIR 334 Query: 842 CVEKEPTVLPKALQTVALED----KDEDERSSVDDKSTTMTGKTSQEDATEKPQAPADAE 675 CVEK+P L +A+ T +LED + +++ S V S G+TS DA EK A AD E Sbjct: 335 CVEKKPVALAEAVST-SLEDTEKAQSKEDPSKVVSSSICPVGETS-NDAAEK--AVADGE 390 Query: 674 EGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFP 495 E Q +QRS +LGS+KEFNFDN + G+ +G +WW N+KV KE P KNWSFFP Sbjct: 391 EAQLHPKQRSITLGSVKEFNFDNPDGGDSGN-SIGSDWWANEKVDAKE-NGPTKNWSFFP 448 Query: 494 VAQPGAT 474 + QPG + Sbjct: 449 MMQPGVS 455 >ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera] Length = 479 Score = 345 bits (886), Expect = 3e-92 Identities = 208/446 (46%), Positives = 252/446 (56%), Gaps = 29/446 (6%) Frame = -3 Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545 QK+RW C YWCFRS K K RIG A+L PE + G A NL+ I F Sbjct: 38 QKRRWGSCWGEYWCFRSPKDK-RIGHAVLAPESRAPGSGVPAAENLTQAPTIVLPFVAPP 96 Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365 +GL+S TSI ANI+SPGGP S+FAIGPYAHETQLVSPP FST Sbjct: 97 SSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFST 156 Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSG-FSHRFMXXXXXXXXXXXYP 1188 +TTEPSTAPFTPPPESVH+TTPSSPEVPFA D + +G HRF+ YP Sbjct: 157 FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLYP 216 Query: 1187 GSPAGQLISPGSVISGSGTSSPLLDHELFAAG-LYYPDARFGVPPRILNLKLLPACDWGS 1011 GSP G LISP S ISGSGTSSP D + +G + + R G PP++L L L +WGS Sbjct: 217 GSPVGHLISPSSGISGSGTSSPFPDRDFVCSGSSQFLEFRAGGPPKLLTLDKLSNHEWGS 276 Query: 1010 GRESGSLTPDGYLPKSHDGFPVNHQ--------NTTDSLTDEH---------------DN 900 SGS+TPD P S DG ++ Q + DS+ D +N Sbjct: 277 RIGSGSITPDALGPPSRDGSVLDRQVSDVIHPPSGDDSVLDRQISDVASHSLSDSGCPNN 336 Query: 899 QESVSQRVSFELISEDVVRCVEKEPTVLPKA----LQTVALEDKDEDERSSVDDKSTTMT 732 + V RVSFEL +EDVVRCVEK+ L KA LQ A + DE+ R V D S Sbjct: 337 EIMVDHRVSFELTAEDVVRCVEKDSAALVKAVSASLQNPATVEIDENSREVVVD-SEGRV 395 Query: 731 GKTSQEDATEKPQAPADAEEGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTN 552 G+T+ + EK A+ EEGQ +QRS +LGS KEFNFDN + G KP + +WW N Sbjct: 396 GETA-NNPPEKAPEDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWAN 454 Query: 551 DKVSGKEAEQPAKNWSFFPVAQPGAT 474 +KV GKE +KNWS F + QP + Sbjct: 455 EKVVGKEV-GASKNWSIFHMMQPSVS 479 >ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] gi|550346901|gb|EEE82832.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] Length = 453 Score = 345 bits (884), Expect = 5e-92 Identities = 196/427 (45%), Positives = 251/427 (58%), Gaps = 10/427 (2%) Frame = -3 Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545 QK+RW C S Y CF QKHKK+IG A+L PEPS+ GA + N + + F Sbjct: 38 QKRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFAAPP 97 Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365 AGLVS TSI+A+++SP GP S+FAIGPYAHETQLVSPP FST Sbjct: 98 SSPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFST 157 Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSGFSHRFMXXXXXXXXXXXYPG 1185 +TTEPSTAPFTPPPESVH+TTPSSPEVPFA FLD S +G + + +PG Sbjct: 158 FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTG--LRFPFDFQSYQFHPG 215 Query: 1184 SPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSGR 1005 SP GQLISP S ISGSGTSSP D E G ++P+ R G PP++LNL L C+WGS + Sbjct: 216 SPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQ 275 Query: 1004 ESGSLTPDGYLPKSHDGFPVNHQ----NTTDSLTDEHDNQESVSQRVSFELISEDVVRCV 837 SG+LTP+ + + F ++ Q + + H N + V+ RVSFEL +ED RCV Sbjct: 276 GSGALTPES-VRRGSPNFLLHRQFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCV 334 Query: 836 EKEPTVLPKALQ------TVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKPQAPADAE 675 E++P K + T A E+K+ E + +T S E A D E Sbjct: 335 EEKPAFSIKTVPEYVENGTQAKEEKNSGESIQSFECRVGVTSNDSPE------MASTDGE 388 Query: 674 EGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFP 495 + ++Q+S +LGS+KEFNFDN ++G+ +KP NWW N V GKE E KNWSFFP Sbjct: 389 AAPQHRKQQSITLGSVKEFNFDNADEGDSRKPS-SSNWWANGSVIGKEGE-TTKNWSFFP 446 Query: 494 VAQPGAT 474 + Q G + Sbjct: 447 MVQSGVS 453 >gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao] Length = 458 Score = 344 bits (882), Expect = 8e-92 Identities = 203/427 (47%), Positives = 254/427 (59%), Gaps = 12/427 (2%) Frame = -3 Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545 QK+RW GC S YWCF S K KKRIG A+L E S + A N + I F Sbjct: 38 QKRRWGGCWSIYWCFGSYKQKKRIGPAVLTSETSFSGANVPAAENPTQAPAIALPFVAPP 97 Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365 AGLVS TSI+A+++SPG P S+FAIGPYAHETQLVSPP FST Sbjct: 98 SSPASFLPSEPPSATQSPAGLVSLTSISASMYSPG-PASIFAIGPYAHETQLVSPPVFST 156 Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSGFS-HRFMXXXXXXXXXXXYP 1188 +TTEPSTAPFTPPPESVH+TTPSSPEVPFA L + G RF +P Sbjct: 157 FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLGPNLQYGEGVQRFPISHYEFQSYQLHP 216 Query: 1187 GSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSG 1008 GSP GQLISP S ISGSGTSSP D E FAA L++P+ R G PP++LNL +C+WGS Sbjct: 217 GSPVGQLISPSSGISGSGTSSPFRDGE-FAASLHFPEFRMGDPPKLLNLDKHSSCEWGSH 275 Query: 1007 RESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEH-------DNQESVSQRVSFELISEDV 849 SG+LTPD +GF ++HQ ++ + H ++Q + + RVSFEL +E+V Sbjct: 276 HGSGTLTPDATRSTPRNGFLLDHQ-ISEITSHPHLKNKEVQNDQVAHNHRVSFELTTEEV 334 Query: 848 VRCVEKEPTVLPKA----LQTVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKPQAPAD 681 VR +E E +A LQ A + +E + VDD + G+TS E +A AD Sbjct: 335 VRSLEMETATPSEAVSGSLQIEATRESEEHDTKVVDDYECRV-GETSNE---RPEKALAD 390 Query: 680 AEEGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSF 501 E + + +S +LGS KEFNFDNV+ G+ KP L +WW NDKV+GK P +NWSF Sbjct: 391 REGKPQHHKHQSITLGSAKEFNFDNVDGGDAHKPILTSDWWANDKVAGKGGGVP-RNWSF 449 Query: 500 FPVAQPG 480 FP+ QPG Sbjct: 450 FPMMQPG 456 >ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] gi|550346902|gb|ERP65330.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] Length = 452 Score = 341 bits (875), Expect = 5e-91 Identities = 194/426 (45%), Positives = 250/426 (58%), Gaps = 10/426 (2%) Frame = -3 Query: 1721 KKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXXX 1542 ++RW C S Y CF QKHKK+IG A+L PEPS+ GA + N + + F Sbjct: 38 QRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFAAPPS 97 Query: 1541 XXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFSTY 1362 AGLVS TSI+A+++SP GP S+FAIGPYAHETQLVSPP FST+ Sbjct: 98 SPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTF 157 Query: 1361 TTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSGFSHRFMXXXXXXXXXXXYPGS 1182 TTEPSTAPFTPPPESVH+TTPSSPEVPFA FLD S +G + + +PGS Sbjct: 158 TTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTG--LRFPFDFQSYQFHPGS 215 Query: 1181 PAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSGRE 1002 P GQLISP S ISGSGTSSP D E G ++P+ R G PP++LNL L C+WGS + Sbjct: 216 PVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQG 275 Query: 1001 SGSLTPDGYLPKSHDGFPVNHQ----NTTDSLTDEHDNQESVSQRVSFELISEDVVRCVE 834 SG+LTP+ + + F ++ Q + + H N + V+ RVSFEL +ED RCVE Sbjct: 276 SGALTPES-VRRGSPNFLLHRQFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCVE 334 Query: 833 KEPTVLPKALQ------TVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKPQAPADAEE 672 ++P K + T A E+K+ E + +T S E A D E Sbjct: 335 EKPAFSIKTVPEYVENGTQAKEEKNSGESIQSFECRVGVTSNDSPE------MASTDGEA 388 Query: 671 GQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFPV 492 + ++Q+S +LGS+KEFNFDN ++G+ +KP NWW N V GKE E KNWSFFP+ Sbjct: 389 APQHRKQQSITLGSVKEFNFDNADEGDSRKPS-SSNWWANGSVIGKEGE-TTKNWSFFPM 446 Query: 491 AQPGAT 474 Q G + Sbjct: 447 VQSGVS 452 >gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis] Length = 455 Score = 340 bits (871), Expect = 2e-90 Identities = 197/425 (46%), Positives = 250/425 (58%), Gaps = 8/425 (1%) Frame = -3 Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545 +K+RW GCLS YWCF + K++ RIG +LVPE + A A N + + F Sbjct: 40 RKRRWGGCLSIYWCFGTPKNRTRIGHGVLVPETAQPGNSAPRAENSTQTHAVILPFIAPP 99 Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365 AGL+S TS++A+++SPGGP S+FAIGPYAHETQLVSPP FST Sbjct: 100 SSPASFLQSEPPSATQSPAGLLSLTSVSASMYSPGGPASIFAIGPYAHETQLVSPPVFST 159 Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSGF-SHRFMXXXXXXXXXXXYP 1188 +TTEPSTAPFTPPPESVH+TTPSSPEVPFA LD + H+G RF P Sbjct: 160 FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNIHNGEPGQRFPIFHNEFQSYYFQP 219 Query: 1187 GSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSG 1008 GSP GQLISP S ISGSGTSSP D E A G ++ + R G PP++LNL L DWGS Sbjct: 220 GSPIGQLISPSSGISGSGTSSPFPDPEFAARGPHFLEFRTGDPPKLLNLDKLSKFDWGSR 279 Query: 1007 RESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDNQESVSQRVSFELISEDVVRCVEKE 828 + SGSLTPD P S F V + +N +RVSF++ +EDV+R VEK+ Sbjct: 280 QGSGSLTPDSVKPIS--TFEVAPHLKPNGRCRNAEN--VADRRVSFDVSTEDVIRYVEKK 335 Query: 827 PTVLPKALQTVALEDKDEDERSSVDDKS-------TTMTGKTSQEDATEKPQAPADAEEG 669 L +A+ T +L+D +R D + G+TS E E +AP EE Sbjct: 336 TVPLAEAMLT-SLKDTTMGQREENSDSNKVEEIGCENRVGETSNE---EPDKAPTSGEEV 391 Query: 668 QRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFPVA 489 + Q+ RS +LGS KEFNFDN + G+ K +WW N KV+GKE P++NWSFFP+ Sbjct: 392 LQHQKHRSITLGSSKEFNFDNADAGDLHKSDSVSDWWANQKVAGKEG-APSQNWSFFPMI 450 Query: 488 QPGAT 474 QPG + Sbjct: 451 QPGVS 455 >emb|CBI34651.3| unnamed protein product [Vitis vinifera] Length = 412 Score = 320 bits (819), Expect = 2e-84 Identities = 195/422 (46%), Positives = 231/422 (54%), Gaps = 5/422 (1%) Frame = -3 Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545 QK+RW C YWCFRS K K RIG A+L PE + G A NL+ I F Sbjct: 38 QKRRWGSCWGEYWCFRSPKDK-RIGHAVLAPESRAPGSGVPAAENLTQAPTIVLPFVAPP 96 Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365 +GL+S TSI ANI+SPGGP S+FAIGPYAHETQLVSPP FST Sbjct: 97 SSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFST 156 Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSG-FSHRFMXXXXXXXXXXXYP 1188 +TTEPSTAPFTPPPESVH+TTPSSPEVPFA D + +G HRF+ YP Sbjct: 157 FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLYP 216 Query: 1187 GSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSG 1008 GSP G LISP S ISGSGTSSP +PD Sbjct: 217 GSPVGHLISPSSGISGSGTSSP------------FPD----------------------- 241 Query: 1007 RESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDNQESVSQRVSFELISEDVVRCVEKE 828 SGS+TPD P S DG ++H +N+ V RVSFEL +EDVVRCVEK+ Sbjct: 242 -RSGSITPDALGPPSRDGSVLDHSGCP-------NNEIMVDHRVSFELTAEDVVRCVEKD 293 Query: 827 PTVLPKA----LQTVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKPQAPADAEEGQRC 660 L KA LQ A + DE+ R V D S G+T+ + EK A+ EEGQ Sbjct: 294 SAALVKAVSASLQNPATVEIDENSREVVVD-SEGRVGETA-NNPPEKAPEDANGEEGQPH 351 Query: 659 QRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFPVAQPG 480 +QRS +LGS KEFNFDN + G KP + +WW N+KV GKE +KNWS F + QP Sbjct: 352 HKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWANEKVVGKEV-GASKNWSIFHMMQPS 410 Query: 479 AT 474 + Sbjct: 411 VS 412 >ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum] Length = 443 Score = 319 bits (818), Expect = 2e-84 Identities = 191/429 (44%), Positives = 242/429 (56%), Gaps = 12/429 (2%) Frame = -3 Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGAT-PARNLSLPIPIGAHFXXX 1548 QK+RW GC S YWCF SQK KRIG A+ +PE +TA GA P+ N S P + Sbjct: 38 QKRRWGGCWSMYWCFGSQKQTKRIGHAVFIPE--TTASGADRPSSNTSSQAP--SIVLPF 93 Query: 1547 XXXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFS 1368 V ++ + +SP GP S+FAIGPYAHETQLVSPP FS Sbjct: 94 IAPPSSPASFLPSEPPSATHSPVGSKCLSMSTYSPSGPASIFAIGPYAHETQLVSPPVFS 153 Query: 1367 TYTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLS-PHSGFSHRFMXXXXXXXXXXXY 1191 +TTEPSTAPFTPPPESVH+TTPSSPEVPFA LD + + HR+ Sbjct: 154 AFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQ 213 Query: 1190 PGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGS 1011 PGSP LISPGS IS SGTSSP LD E Y P P+ LNL+ + +WGS Sbjct: 214 PGSPVSNLISPGSAISVSGTSSPFLDRE------YTPGR-----PQFLNLEKIAPHEWGS 262 Query: 1010 GRESGSLTPDGYLPKSHDGFPVNHQNT-----TDSLTDEHDNQESVSQRVSFELISEDVV 846 + SG+LTP+ PK HD F +N+QN+ ++ V RVSFE+ +EDVV Sbjct: 263 RQGSGTLTPEAVNPKYHDNFLLNYQNSGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVV 322 Query: 845 RCVEKEPTVLPKALQTVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKPQ-----APAD 681 RCVEK+PT++ + +V+L+D ERS+ ++ E + + D Sbjct: 323 RCVEKKPTMMMRT-GSVSLQD---TERSTKRQENLAEMSNGHDHGGHEPSREIHEGSSTD 378 Query: 680 AEEGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSF 501 E+GQR Q+ RS +LGS KEFNFDNV+ G P K +G +WW N+KV GKE P NW Sbjct: 379 GEDGQRQQKHRSITLGSSKEFNFDNVDGGYPDKATIGSDWWANEKVLGKE---PCNNW-I 434 Query: 500 FPVAQPGAT 474 FP+ QPG + Sbjct: 435 FPMMQPGVS 443 >ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum lycopersicum] Length = 443 Score = 318 bits (815), Expect = 5e-84 Identities = 188/431 (43%), Positives = 241/431 (55%), Gaps = 14/431 (3%) Frame = -3 Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545 QK+RW C S YWCF SQK KRIG A+ +PE +++A P+ N S P + Sbjct: 38 QKRRWGSCWSMYWCFGSQKQTKRIGHAVFIPETTASAADR-PSSNTSSQAP--SIVLPFI 94 Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365 V ++ + +SP GP S+FAIGPYAHETQLVSPP FS Sbjct: 95 APPSSPASFLPSEPPSATHSPVGSKCLSMSTYSPSGPASIFAIGPYAHETQLVSPPVFSA 154 Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLS-PHSGFSHRFMXXXXXXXXXXXYP 1188 +TTEPSTAPFTPPPESVH+TTPSSPEVPFA LD + + HR+ P Sbjct: 155 FTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQP 214 Query: 1187 GSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSG 1008 GSP LISPGS IS SGTSSP L+ E Y P P+ LNL+ + +WGS Sbjct: 215 GSPVSNLISPGSAISVSGTSSPFLERE------YTPGR-----PQFLNLEKIAPHEWGSR 263 Query: 1007 RESGSLTPDGYLPKSHDGFPVNHQNT-----TDSLTDEHDNQESVSQRVSFELISEDVVR 843 + SG+LTP+ PK HD F +N+QNT ++ V RVSFE+ +EDVVR Sbjct: 264 QGSGTLTPEAVNPKYHDSFLLNYQNTGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVR 323 Query: 842 CVEKEPTVLPKALQTVALEDKDED--------ERSSVDDKSTTMTGKTSQEDATEKPQAP 687 CVEK+PT++ + +V+L+D + E S+ D S + E ++ Sbjct: 324 CVEKKPTMMMRT-GSVSLQDTERSTKRQENLAEMSNAHDHSGHEPSREIHEGSS------ 376 Query: 686 ADAEEGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNW 507 D E+GQR Q+ RS +LGS KEFNFDNV+ G P K +G +WW N+KV GKE P NW Sbjct: 377 TDGEDGQRQQKHRSITLGSSKEFNFDNVDGGYPDKATIGSDWWANEKVLGKE---PCNNW 433 Query: 506 SFFPVAQPGAT 474 FP+ QPG + Sbjct: 434 -IFPMMQPGVS 443 >ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis] gi|223549721|gb|EEF51209.1| conserved hypothetical protein [Ricinus communis] Length = 459 Score = 315 bits (807), Expect = 4e-83 Identities = 190/424 (44%), Positives = 247/424 (58%), Gaps = 10/424 (2%) Frame = -3 Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIP-IGAHFXXX 1548 QK+RW C S YWCF +H+KRIG A+LVPE S+ ++ A N + P I F Sbjct: 41 QKRRWGSCWSVYWCFGYHRHRKRIGHAVLVPENSAPGNDSSAAENPTTQAPTITLPFVAP 100 Query: 1547 XXXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFS 1368 AG++S TS++A+++SP GP S+FAIGPYAHETQLVSPP FS Sbjct: 101 PSSPASFLQSEPPSASQSPAGILSLTSVSASMYSPSGPASIFAIGPYAHETQLVSPPAFS 160 Query: 1367 TYTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSGFSH-RFMXXXXXXXXXXXY 1191 T+TTEPSTAPFTPPPESV +TTPSSPEVPFA L+ S +G + RF Y Sbjct: 161 TFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLEPSNRNGEAGLRFPFSNYEFQSYQFY 220 Query: 1190 PGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGS 1011 PGSP GQLISP S ISGSGTSSP D E AAG + + + VPP++LNL L + GS Sbjct: 221 PGSPVGQLISPSSGISGSGTSSPFPDGEFAAAGPRFLEFQMAVPPKLLNLDKLSVHECGS 280 Query: 1010 GRESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDNQESVSQ----RVSFELISEDVVR 843 + SG+LTPD S FP++ Q + + DN+ Q RVSF+L +ED +R Sbjct: 281 RQGSGTLTPDAVRATS-CSFPLDRQCSDIASNRHSDNENKDDQVADLRVSFDLSAEDALR 339 Query: 842 CVEKEPT----VLPKALQTVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKPQAPADAE 675 E +P ++P++++ +K + + S + G+TS QA E Sbjct: 340 YAEPKPASPVKIMPESMKNEIAAEKVQ-KSSEIRHNFECRVGETSNGIL---EQASTGGE 395 Query: 674 EGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFP 495 + R Q+ R+ +LG+ KEFNFDN + G P KP GP+WW N GKE + AKNWSFFP Sbjct: 396 KTPRHQKHRTLTLGTFKEFNFDNAD-GVP-KPSAGPDWWDNGSDVGKE-DFTAKNWSFFP 452 Query: 494 VAQP 483 V QP Sbjct: 453 VMQP 456 >ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria vesca subsp. vesca] Length = 422 Score = 311 bits (798), Expect = 4e-82 Identities = 188/427 (44%), Positives = 250/427 (58%), Gaps = 12/427 (2%) Frame = -3 Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545 QK+RW+ YWCF Q+H+KRIG A+++PE +S A NL+ I F Sbjct: 2 QKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAENLTQASSIVLPFAAPP 61 Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365 +F S++A+++SPG P S+FAIGPYAHETQLVSPP FST Sbjct: 62 SSPASFLQSEPPSAMQSPG--FNF-SLSASMYSPG-PSSIFAIGPYAHETQLVSPPVFST 117 Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSG-FSHRFMXXXXXXXXXXXYP 1188 +TTEPSTAPFTPP ESVH+T PSSPEVPFA LD + G R+ YP Sbjct: 118 FTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYP 177 Query: 1187 GSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSG 1008 GSP GQLISP S ISGSGTSSP LD E + G ++ + R G P++LNL +L DWGS Sbjct: 178 GSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGSR 237 Query: 1007 RESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDNQE------SVSQRVSFELISEDVV 846 SGS+TPD S +GF + T + + + N S+ RVSFEL +E+VV Sbjct: 238 LCSGSVTPDAAKSTSSEGFTLK-PYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVV 296 Query: 845 RCVEKEPTVLPKALQTVALEDKDEDERSSVDDKSTTMTGK----TSQEDATEKPQAPADA 678 RCVEK+P L +A+ T +L+ ++ ER ++ + + + + D++EK DA Sbjct: 297 RCVEKKPVALAEAVST-SLQSAEKAEREEGPNQEVSSSHECPVVDTSNDSSEK-AVGGDA 354 Query: 677 EE-GQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSF 501 EE R Q++RS +LGS KEFNFDN + G+ + +WW N+KV KE + +KNWSF Sbjct: 355 EELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGE-SKNWSF 413 Query: 500 FPVAQPG 480 FP+ QPG Sbjct: 414 FPMIQPG 420 >ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria vesca subsp. vesca] Length = 459 Score = 311 bits (798), Expect = 4e-82 Identities = 188/427 (44%), Positives = 250/427 (58%), Gaps = 12/427 (2%) Frame = -3 Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545 QK+RW+ YWCF Q+H+KRIG A+++PE +S A NL+ I F Sbjct: 39 QKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAENLTQASSIVLPFAAPP 98 Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365 +F S++A+++SPG P S+FAIGPYAHETQLVSPP FST Sbjct: 99 SSPASFLQSEPPSAMQSPG--FNF-SLSASMYSPG-PSSIFAIGPYAHETQLVSPPVFST 154 Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSG-FSHRFMXXXXXXXXXXXYP 1188 +TTEPSTAPFTPP ESVH+T PSSPEVPFA LD + G R+ YP Sbjct: 155 FTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYP 214 Query: 1187 GSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSG 1008 GSP GQLISP S ISGSGTSSP LD E + G ++ + R G P++LNL +L DWGS Sbjct: 215 GSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGSR 274 Query: 1007 RESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDNQE------SVSQRVSFELISEDVV 846 SGS+TPD S +GF + T + + + N S+ RVSFEL +E+VV Sbjct: 275 LCSGSVTPDAAKSTSSEGFTLK-PYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVV 333 Query: 845 RCVEKEPTVLPKALQTVALEDKDEDERSSVDDKSTTMTGK----TSQEDATEKPQAPADA 678 RCVEK+P L +A+ T +L+ ++ ER ++ + + + + D++EK DA Sbjct: 334 RCVEKKPVALAEAVST-SLQSAEKAEREEGPNQEVSSSHECPVVDTSNDSSEK-AVGGDA 391 Query: 677 EE-GQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSF 501 EE R Q++RS +LGS KEFNFDN + G+ + +WW N+KV KE + +KNWSF Sbjct: 392 EELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGE-SKNWSF 450 Query: 500 FPVAQPG 480 FP+ QPG Sbjct: 451 FPMIQPG 457 >gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 485 Score = 303 bits (777), Expect = 1e-79 Identities = 191/461 (41%), Positives = 233/461 (50%), Gaps = 47/461 (10%) Frame = -3 Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545 QKKRW C YWCF SQK+ KRIG A+LVPEP + A N+S P I F Sbjct: 31 QKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGIILPFIAPP 90 Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365 AGL+S TS++ N +SP GP S+FAIGPYAHETQLV+PP FS Sbjct: 91 SSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSA 150 Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLS-----PHSGFSHRFMXXXXXXXXX 1200 TTEPSTAPFTPPPESV +TTPSSPEVPFA L S +SG + +F Sbjct: 151 LTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSY 210 Query: 1199 XXYPGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACD 1020 YPGSP G LISPGS IS SGTSSP D + R G P++L + Sbjct: 211 QIYPGSPGGNLISPGSAISNSGTSSPFPDRRPIL------EFRMGEAPKLLGFENFTTRK 264 Query: 1019 WGSGRESGSLTPDGYL--------------------------------PKSHDGFPVNHQ 936 WGS SGSLTPDG P S DGF V Q Sbjct: 265 WGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQ 324 Query: 935 NTTDSLTDEHDN-----QESVSQRVSFELISEDVVRCVEKEPTVLPKALQT-----VALE 786 + +L N + V RVSFEL EDV C+E + + +A+ VA Sbjct: 325 ISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEG 384 Query: 785 DKDEDERSSVDDKSTTMTGKTSQEDATEKPQAPADAEEGQRCQRQRSASLGSIKEFNFDN 606 K+ D + S + + + + EK A +AEE Q+ RS +LGSIKEFNFDN Sbjct: 385 RKERDGIKKDLESSCELFIRETSNETVEK--ASGEAEEEHSYQKHRSVTLGSIKEFNFDN 442 Query: 605 VNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFPVAQP 483 KP + WW N+KV+GKEA +P +W+FFP+ QP Sbjct: 443 TKGEASDKPTIRSEWWANEKVAGKEA-RPGNSWTFFPMLQP 482 >ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis] gi|223547583|gb|EEF49078.1| conserved hypothetical protein [Ricinus communis] Length = 510 Score = 302 bits (773), Expect = 4e-79 Identities = 194/475 (40%), Positives = 237/475 (49%), Gaps = 61/475 (12%) Frame = -3 Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545 QK+RW GC S YWCF S K KRIG A+L PEP T A N S I F Sbjct: 45 QKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPEVQGAVVTSAENQSQSTAITVPFIAPP 103 Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365 AGL+S TS++ N +SPGGP S+FAIGPYAHETQLV+PP FS Sbjct: 104 SSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLVTPPAFSA 163 Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLS-----PHSGFSHRFMXXXXXXXXX 1200 +TTEPSTAPFTPPPESV +TTPSSPEVPFA L S +SG + +F Sbjct: 164 FTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALSHYEFQSY 223 Query: 1199 XXYPGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYP--DARFGVPPRILNLKLLPA 1026 YPGSP GQLISPGSVIS SGTSSP D YP + R G P++L + Sbjct: 224 PLYPGSPGGQLISPGSVISNSGTSSPFPDR--------YPILEFRMGEAPKLLGFEHFTT 275 Query: 1025 CDWGSGRES------------------------------------------------GSL 990 WGS S GSL Sbjct: 276 RKWGSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGSGSL 335 Query: 989 TPDGYLPKSHDGFPVNHQ-----NTTDSLTDEHDNQESVSQRVSFELISEDVVRCVEKEP 825 TPD P S DGF + +Q + +S ++ V RVSFEL E+V RC+E + Sbjct: 336 TPDAVGPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLESKS 395 Query: 824 TVLPKALQTVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKPQAPADAEEGQRCQRQ-R 648 +A + ED+ S T T E + E P+ P+ E + C R+ R Sbjct: 396 LASCRAFSECPPDSMAEDQIKSGKMLMTDENLPTG-ETSGETPEKPSGEMEEEHCYRKHR 454 Query: 647 SASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFPVAQP 483 S +LGSIKEFNFDN +K P KP + WW N+ ++GKEA +PA NW+FFP+ QP Sbjct: 455 SITLGSIKEFNFDN-SKEVPDKPSINSEWWANETIAGKEA-RPANNWTFFPLLQP 507 >gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 489 Score = 301 bits (772), Expect = 5e-79 Identities = 190/460 (41%), Positives = 232/460 (50%), Gaps = 47/460 (10%) Frame = -3 Query: 1721 KKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXXX 1542 KKRW C YWCF SQK+ KRIG A+LVPEP + A N+S P I F Sbjct: 36 KKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGIILPFIAPPS 95 Query: 1541 XXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFSTY 1362 AGL+S TS++ N +SP GP S+FAIGPYAHETQLV+PP FS Sbjct: 96 SPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSAL 155 Query: 1361 TTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLS-----PHSGFSHRFMXXXXXXXXXX 1197 TTEPSTAPFTPPPESV +TTPSSPEVPFA L S +SG + +F Sbjct: 156 TTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQ 215 Query: 1196 XYPGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDW 1017 YPGSP G LISPGS IS SGTSSP D + R G P++L + W Sbjct: 216 IYPGSPGGNLISPGSAISNSGTSSPFPDRRPIL------EFRMGEAPKLLGFENFTTRKW 269 Query: 1016 GSGRESGSLTPDGYL--------------------------------PKSHDGFPVNHQN 933 GS SGSLTPDG P S DGF V Q Sbjct: 270 GSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQI 329 Query: 932 TTDSLTDEHDN-----QESVSQRVSFELISEDVVRCVEKEPTVLPKALQT-----VALED 783 + +L N + V RVSFEL EDV C+E + + +A+ VA Sbjct: 330 SEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGR 389 Query: 782 KDEDERSSVDDKSTTMTGKTSQEDATEKPQAPADAEEGQRCQRQRSASLGSIKEFNFDNV 603 K+ D + S + + + + EK A +AEE Q+ RS +LGSIKEFNFDN Sbjct: 390 KERDGIKKDLESSCELFIRETSNETVEK--ASGEAEEEHSYQKHRSVTLGSIKEFNFDNT 447 Query: 602 NKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFPVAQP 483 KP + WW N+KV+GKEA +P +W+FFP+ QP Sbjct: 448 KGEASDKPTIRSEWWANEKVAGKEA-RPGNSWTFFPMLQP 486 >ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera] Length = 448 Score = 296 bits (758), Expect = 2e-77 Identities = 186/434 (42%), Positives = 230/434 (52%), Gaps = 19/434 (4%) Frame = -3 Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545 QK+RW CLS YWCF S +H KRIG A+LVPEP A + NL+L I F Sbjct: 31 QKRRWGSCLSLYWCFGSHRHSKRIGHAVLVPEPMVPGAVAPASENLNLSTSIVLPFIAPP 90 Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365 AG +S T+++ N +SP GP SMFAIGPYAHETQLVSPP FST Sbjct: 91 SSPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSGPASMFAIGPYAHETQLVSPPVFST 150 Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLS-----PHSGFSHRFMXXXXXXXXX 1200 + TEPSTAPFTPPPESV +TTPSSPEVPFA L S +SG + + Sbjct: 151 FPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRSRRNSGTNQKLSLSNYEFQPY 210 Query: 1199 XXYPGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGV-PPRILNLKLLPAC 1023 YP SP G LISP IS SGTSSP +PD R V P++L + Sbjct: 211 QLYPESPVGHLISP---ISNSGTSSP------------FPDRRPIVEAPKLLGFEHFSTR 255 Query: 1022 DWGSGRESGSLTPDGYLPKSHDGFPVNHQ-----NTTDSLTDEHDNQESVSQRVSFELIS 858 WGS SGSLTPDG P S D F + +Q + +S + + + + RVSFEL Sbjct: 256 RWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSFELAG 315 Query: 857 EDVVRCVEKEPT----VLPKALQTVALEDKDEDERSSVDDKSTT---MTGKTSQEDATEK 699 EDV CVEK+P + LQ + E + E ER + + + + + A+EK Sbjct: 316 EDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKAASEK 375 Query: 698 PQAPADAEEGQRCQRQRSASLGSIKEFNFDNVNKGEPQKP-CLGPNWWTNDKVSGKEAEQ 522 A A+ EE Q ++ GSIKEFNFDN KP +G WW N+KV GK Sbjct: 376 --ASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVGK-GTG 432 Query: 521 PAKNWSFFPVAQPG 480 P NW+FFP+ QPG Sbjct: 433 PQTNWTFFPLLQPG 446 >ref|XP_003525388.1| PREDICTED: uncharacterized protein LOC100791666 isoform X1 [Glycine max] Length = 461 Score = 286 bits (732), Expect = 2e-74 Identities = 186/434 (42%), Positives = 235/434 (54%), Gaps = 16/434 (3%) Frame = -3 Query: 1727 SQKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLS-LPIP-IGAHFX 1554 +QKKRW L CF +K +KRIG A+LVPEP++ GA PA S + P I F Sbjct: 39 TQKKRWGSWLGKIGCFGYKKTRKRIGHAVLVPEPTTN--GADPAAAASSIQAPSITLPFV 96 Query: 1553 XXXXXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPT 1374 G VS T ++A+I+SPGGP S+FAIGPYAHETQLVSPP Sbjct: 97 APPSSPASFFQSEPPSTAQSPIGKVSHTCVSASIYSPGGPASIFAIGPYAHETQLVSPPV 156 Query: 1373 FSTYTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLD-LSPHSGFSHRFMXXXXXXXXXX 1197 FS STAPFTPPPESVHMTTPSSPEVPFA LD + +S RF Sbjct: 157 FSA----SSTAPFTPPPESVHMTTPSSPEVPFAQLLDPNNKNSETFQRFQISHYDFQSYQ 212 Query: 1196 XYPGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNL--KLLPAC 1023 +PGSP GQLISP S IS SGTSSPL D E A + D + PP++LNL KL Sbjct: 213 FHPGSPVGQLISPRSAISVSGTSSPLPDSEFNATFAHILDFQRADPPKLLNLDNKLSSCE 272 Query: 1022 DWGSGRESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDNQE----SVSQRVSFELISE 855 + S SGSLTPD + GF NH + ++ N S++ RVSFEL ++ Sbjct: 273 NQKSNHGSGSLTPDAARSTTQSGFLSNHWVSEIKMSPHPSNNRLNEISINHRVSFELSAQ 332 Query: 854 DVVRCVEKEP------TVLPKALQTVALEDKDE-DERSSVDDKSTTMTGKTSQEDATEKP 696 V++ +E +P VLPK DK+E E S++DDK Q T Sbjct: 333 KVLKSLENKPAASAWTNVLPKLKNDAPTTDKEEKSEESALDDKQVVSEAHNDQPLETTLG 392 Query: 695 QAPADAEEGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPA 516 A ++ +S +L S KEFNFDN + G+ P + +WW N+KV+GKE E + Sbjct: 393 GDKATTVH----EKDQSLTLSSAKEFNFDNADGGDSLAPNIVADWWANEKVAGKERE-AS 447 Query: 515 KNWSFFPVAQPGAT 474 K+WSFFP+ QPG + Sbjct: 448 KDWSFFPMIQPGVS 461 >ref|XP_006580574.1| PREDICTED: uncharacterized protein LOC100791666 isoform X2 [Glycine max] Length = 441 Score = 284 bits (726), Expect = 1e-73 Identities = 185/432 (42%), Positives = 233/432 (53%), Gaps = 16/432 (3%) Frame = -3 Query: 1721 KKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLS-LPIP-IGAHFXXX 1548 KKRW L CF +K +KRIG A+LVPEP++ GA PA S + P I F Sbjct: 21 KKRWGSWLGKIGCFGYKKTRKRIGHAVLVPEPTTN--GADPAAAASSIQAPSITLPFVAP 78 Query: 1547 XXXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFS 1368 G VS T ++A+I+SPGGP S+FAIGPYAHETQLVSPP FS Sbjct: 79 PSSPASFFQSEPPSTAQSPIGKVSHTCVSASIYSPGGPASIFAIGPYAHETQLVSPPVFS 138 Query: 1367 TYTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLD-LSPHSGFSHRFMXXXXXXXXXXXY 1191 STAPFTPPPESVHMTTPSSPEVPFA LD + +S RF + Sbjct: 139 A----SSTAPFTPPPESVHMTTPSSPEVPFAQLLDPNNKNSETFQRFQISHYDFQSYQFH 194 Query: 1190 PGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNL--KLLPACDW 1017 PGSP GQLISP S IS SGTSSPL D E A + D + PP++LNL KL + Sbjct: 195 PGSPVGQLISPRSAISVSGTSSPLPDSEFNATFAHILDFQRADPPKLLNLDNKLSSCENQ 254 Query: 1016 GSGRESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDNQE----SVSQRVSFELISEDV 849 S SGSLTPD + GF NH + ++ N S++ RVSFEL ++ V Sbjct: 255 KSNHGSGSLTPDAARSTTQSGFLSNHWVSEIKMSPHPSNNRLNEISINHRVSFELSAQKV 314 Query: 848 VRCVEKEP------TVLPKALQTVALEDKDE-DERSSVDDKSTTMTGKTSQEDATEKPQA 690 ++ +E +P VLPK DK+E E S++DDK Q T Sbjct: 315 LKSLENKPAASAWTNVLPKLKNDAPTTDKEEKSEESALDDKQVVSEAHNDQPLETTLGGD 374 Query: 689 PADAEEGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKN 510 A ++ +S +L S KEFNFDN + G+ P + +WW N+KV+GKE E +K+ Sbjct: 375 KATTVH----EKDQSLTLSSAKEFNFDNADGGDSLAPNIVADWWANEKVAGKERE-ASKD 429 Query: 509 WSFFPVAQPGAT 474 WSFFP+ QPG + Sbjct: 430 WSFFPMIQPGVS 441