BLASTX nr result
ID: Rehmannia25_contig00012072
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia25_contig00012072 (1895 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241... 441 e-121 gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus pe... 433 e-118 ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu... 418 e-114 gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis] 415 e-113 ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626... 414 e-113 ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu... 411 e-112 ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr... 409 e-111 gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [... 405 e-110 ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660... 402 e-109 ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254... 401 e-109 emb|CBI34651.3| unnamed protein product [Vitis vinifera] 379 e-102 ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm... 379 e-102 ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309... 370 e-100 ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309... 363 1e-97 ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264... 359 2e-96 ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Popu... 353 2e-94 gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein i... 345 5e-92 gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein i... 342 3e-91 ref|XP_003525388.1| PREDICTED: uncharacterized protein LOC100791... 325 3e-86 ref|XP_006580574.1| PREDICTED: uncharacterized protein LOC100791... 322 4e-85 >ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera] Length = 479 Score = 441 bits (1133), Expect = e-121 Identities = 241/452 (53%), Positives = 295/452 (65%), Gaps = 19/452 (4%) Frame = -3 Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNKTKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVI 1513 R P +VQKRRWGSC Y CF S K KRIGHA + PE+ P +E+ +Q P+++ Sbjct: 31 RVPQPTVQKRRWGSCWGEYWCFRSPKDKRIGHAVLAPESRAP-GSGVPAAENLTQAPTIV 89 Query: 1512 XXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLV 1333 SATQSP+GLLS+TS++AN+YSPGGP SIFAIGPYAHETQLV Sbjct: 90 LPFVAPPSSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLV 149 Query: 1332 SPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEF 1153 SPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+L +PN RNGEAG R+ L+QYEF Sbjct: 150 SPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEF 209 Query: 1152 QSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDF-AAGYPFFLEFRTGNPPKLLDLDKI 976 QSYQL PGSPV HL PFPDRDF +G FLEFR G PPKLL LDK+ Sbjct: 210 QSYQLYPGSPVGHLISPSSGISGSGTSSPFPDRDFVCSGSSQFLEFRAGGPPKLLTLDKL 269 Query: 975 VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLP------------NTGSYRLA 832 EW S GSG++TPDA+GP SRD +L+RQ SDV P + S+ L+ Sbjct: 270 SNHEWGSRIGSGSITPDALGPPSRDGSVLDRQVSDVIHPPSGDDSVLDRQISDVASHSLS 329 Query: 831 -----NDETVLDHRVSFEITAEEVVRCVEKKPVVGSPKSAPESVENVEHIK-EEKPIKTA 670 N+E ++DHRVSFE+TAE+VVRCVEK K+ S++N ++ +E + Sbjct: 330 DSGCPNNEIMVDHRVSFELTAEDVVRCVEKDS-AALVKAVSASLQNPATVEIDENSREVV 388 Query: 669 NGVDHPSGETSNITSEKDHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIA 490 + GET+N EK +G+ + HHK R+ITLGS KEFNFD+ DGG+ D+P+I Sbjct: 389 VDSEGRVGETANNPPEKAPEDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNI- 447 Query: 489 SSDWWVNEKVVAEDGSPSNQWSFFPLMQTGVS 394 SSDWW NEKVV ++ S WS F +MQ VS Sbjct: 448 SSDWWANEKVVGKEVGASKNWSIFHMMQPSVS 479 >gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica] Length = 455 Score = 433 bits (1113), Expect = e-118 Identities = 237/435 (54%), Positives = 289/435 (66%), Gaps = 2/435 (0%) Frame = -3 Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516 R P +VQKRRWGS S+Y CFG + KRIGHA ++PETT R +AP +E+P Q PS+ Sbjct: 31 RVPQATVQKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPETTD-RGGDAPRAENPIQTPSI 89 Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336 + SATQSP G S+T A+MYSP GP SIFAIGPYAHETQL Sbjct: 90 VLPFVAPPSSPASFLQSEPPSATQSPAGFFSLT---ASMYSPSGPTSIFAIGPYAHETQL 146 Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156 VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+P+ RNGE GQR+PL+ YE Sbjct: 147 VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPHFRNGEGGQRFPLSHYE 206 Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976 FQSYQL PGSPV L PFPD +FAA FLEFRTG+PPKLL+LD + Sbjct: 207 FQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDLEFAARGHHFLEFRTGDPPKLLNLDIL 266 Query: 975 VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSF 796 R+W S GSG+VTPD S D LL Q +V P + + R N++ ++HRVSF Sbjct: 267 STRDWGSRLGSGSVTPDGAKSTSSDGFLLKPQTPEVVLNPRSNN-RGRNNDISINHRVSF 325 Query: 795 EITAEEVVRCVEKKPVVGSPKSAPESVENVEHIK-EEKPIKTANGVDHPSGETSNITSEK 619 E+++EEV+RCVEKKP V ++ S+E+ E + +E P K + P GETSN +EK Sbjct: 326 ELSSEEVIRCVEKKP-VALAEAVSTSLEDTEKAQSKEDPSKVVSSSICPVGETSNDAAEK 384 Query: 618 DHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSP 439 DG+ + H K R+ITLGS KEFNFD+ DGG D + SDWW NEKV A++ P Sbjct: 385 --AVADGEEAQLHPKQRSITLGSVKEFNFDNPDGG--DSGNSIGSDWWANEKVDAKENGP 440 Query: 438 SNQWSFFPLMQTGVS 394 + WSFFP+MQ GVS Sbjct: 441 TKNWSFFPMMQPGVS 455 >ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] gi|550346901|gb|EEE82832.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] Length = 453 Score = 418 bits (1074), Expect = e-114 Identities = 231/435 (53%), Positives = 287/435 (65%), Gaps = 2/435 (0%) Frame = -3 Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNKTKR-IGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516 R P +VQKRRWGSC S+Y CFG K K+ IGHA + PE + + AP SE+P+Q P+V Sbjct: 31 RVPQATVQKRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAP-GNGAPASENPTQAPAV 89 Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336 S TQSP GL+S+TS+SA+MYSP GP SIFAIGPYAHETQL Sbjct: 90 TLPFAAPPSSPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQL 149 Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156 VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+ L+P+LRNG+ G R+P ++ Sbjct: 150 VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFP---FD 206 Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976 FQSYQ PGSPV L PFPD +FA G F EFR G PPKLL+LDK+ Sbjct: 207 FQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKL 266 Query: 975 VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSF 796 EW S QGSGA+TP++V R + LL+RQ SDV P +G+ + V++HRVSF Sbjct: 267 STCEWGSYQGSGALTPESV-RRGSPNFLLHRQFSDVPSRPRSGNGH--KNGQVVNHRVSF 323 Query: 795 EITAEEVVRCVEKKPVVGSPKSAPESVENVEHIKEEKPI-KTANGVDHPSGETSNITSEK 619 E+TAE+ RCVE+KP S K+ PE VEN KEEK ++ + G TSN + E Sbjct: 324 ELTAEDASRCVEEKPAF-SIKTVPEYVENGTQAKEEKNSGESIQSFECRVGVTSNDSPEM 382 Query: 618 DHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSP 439 TDG+ QH K ++ITLGS KEFNFD+ D G+ +PS SS+WW N V+ ++G Sbjct: 383 --ASTDGEAAPQHRKQQSITLGSVKEFNFDNADEGDSRKPS--SSNWWANGSVIGKEGET 438 Query: 438 SNQWSFFPLMQTGVS 394 + WSFFP++Q+GVS Sbjct: 439 TKNWSFFPMVQSGVS 453 >gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis] Length = 455 Score = 415 bits (1067), Expect = e-113 Identities = 230/439 (52%), Positives = 285/439 (64%), Gaps = 6/439 (1%) Frame = -3 Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNKTK-RIGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516 R P +V+KRRWG CLS+Y CFG+ K + RIGH ++PET ++AP +E+ +Q +V Sbjct: 33 RVPQATVRKRRWGGCLSIYWCFGTPKNRTRIGHGVLVPETAQP-GNSAPRAENSTQTHAV 91 Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336 I SATQSP GLLS+TSVSA+MYSPGGP SIFAIGPYAHETQL Sbjct: 92 ILPFIAPPSSPASFLQSEPPSATQSPAGLLSLTSVSASMYSPGGPASIFAIGPYAHETQL 151 Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156 VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+PN+ NGE GQR+P+ E Sbjct: 152 VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNIHNGEPGQRFPIFHNE 211 Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976 FQSY QPGSP+ L PFPD +FAA P FLEFRTG+PPKLL+LDK+ Sbjct: 212 FQSYYFQPGSPIGQLISPSSGISGSGTSSPFPDPEFAARGPHFLEFRTGDPPKLLNLDKL 271 Query: 975 VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAP--LPNTGSYRLANDETVLDHRV 802 + +W S QGSG++TPD+V P S +VAP PN R N E V D RV Sbjct: 272 SKFDWGSRQGSGSLTPDSVKPIS---------TFEVAPHLKPNG---RCRNAENVADRRV 319 Query: 801 SFEITAEEVVRCVEKKPVVGSPKSAPESVENVEHIKEEKPIKT---ANGVDHPSGETSNI 631 SF+++ E+V+R VEKK V + + +EE G ++ GETSN Sbjct: 320 SFDVSTEDVIRYVEKKTVPLAEAMLTSLKDTTMGQREENSDSNKVEEIGCENRVGETSN- 378 Query: 630 TSEKDHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAE 451 E D T G+ QH K R+ITLGS+KEFNFD+ D G+ + S + SDWW N+KV + Sbjct: 379 -EEPDKAPTSGEEVLQHQKHRSITLGSSKEFNFDNADAGDLHK-SDSVSDWWANQKVAGK 436 Query: 450 DGSPSNQWSFFPLMQTGVS 394 +G+PS WSFFP++Q GVS Sbjct: 437 EGAPSQNWSFFPMIQPGVS 455 >ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis] Length = 460 Score = 414 bits (1063), Expect = e-113 Identities = 226/424 (53%), Positives = 285/424 (67%), Gaps = 2/424 (0%) Frame = -3 Query: 1671 QKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXXXX 1495 QKRRWG C S+ CFG K KRIGHA ++PE T +R+ NA + + +Q ++ Sbjct: 39 QKRRWGGCWSISWCFGFQKHRKRIGHAVLVPEPTASRS-NASEAVNSTQAAAISLPFVAP 97 Query: 1494 XXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPVFS 1315 SATQSP GL+S+ S+S NMYSPGGP+SIFAIGPYAHETQLVSPPVFS Sbjct: 98 PSSPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFS 157 Query: 1314 TFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEFQSYQLQ 1135 TFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+P+LR GE GQ++P + YEFQSY L Sbjct: 158 TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLH 217 Query: 1134 PGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVRREWES 955 PGSPV +L PFPD +FA P F +F G+PPKLL+LDK+ REW S Sbjct: 218 PGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGS 277 Query: 954 CQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSFEITAEEV 775 QGSG +TPDAVG R+ NRQ S+VA P++ + L D+ ++DHRVSFE+T E+V Sbjct: 278 RQGSGTLTPDAVGSTPRNGFFQNRQISEVALRPHSEN-GLRKDQ-IVDHRVSFELTTEDV 335 Query: 774 VRCVEKKPVVGSPKSAPESVENVEHIKEEKPIKTANGVDHP-SGETSNITSEKDHIHTDG 598 VRCVEKKP ++ ES++N +++E+ A V H +GE +N K + D Sbjct: 336 VRCVEKKPTT-LAEAVSESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPV--DV 392 Query: 597 DNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPSNQWSFF 418 + +H K ++ITLGSTKEFNFDS D G+ EP+IA SDWW NEKVV +D W+FF Sbjct: 393 EEAPRHQKQQSITLGSTKEFNFDSAD-GDSHEPTIA-SDWWANEKVVGKDSGAIKNWAFF 450 Query: 417 PLMQ 406 P++Q Sbjct: 451 PVIQ 454 >ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] gi|550346902|gb|ERP65330.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] Length = 452 Score = 411 bits (1057), Expect = e-112 Identities = 230/435 (52%), Positives = 286/435 (65%), Gaps = 2/435 (0%) Frame = -3 Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNKTKR-IGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516 R P +VQ RRWGSC S+Y CFG K K+ IGHA + PE + + AP SE+P+Q P+V Sbjct: 31 RVPQATVQ-RRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAP-GNGAPASENPTQAPAV 88 Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336 S TQSP GL+S+TS+SA+MYSP GP SIFAIGPYAHETQL Sbjct: 89 TLPFAAPPSSPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQL 148 Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156 VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+ L+P+LRNG+ G R+P ++ Sbjct: 149 VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFP---FD 205 Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976 FQSYQ PGSPV L PFPD +FA G F EFR G PPKLL+LDK+ Sbjct: 206 FQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKL 265 Query: 975 VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSF 796 EW S QGSGA+TP++V R + LL+RQ SDV P +G+ + V++HRVSF Sbjct: 266 STCEWGSYQGSGALTPESV-RRGSPNFLLHRQFSDVPSRPRSGNGH--KNGQVVNHRVSF 322 Query: 795 EITAEEVVRCVEKKPVVGSPKSAPESVENVEHIKEEKPI-KTANGVDHPSGETSNITSEK 619 E+TAE+ RCVE+KP S K+ PE VEN KEEK ++ + G TSN + E Sbjct: 323 ELTAEDASRCVEEKPAF-SIKTVPEYVENGTQAKEEKNSGESIQSFECRVGVTSNDSPEM 381 Query: 618 DHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSP 439 TDG+ QH K ++ITLGS KEFNFD+ D G+ +PS SS+WW N V+ ++G Sbjct: 382 --ASTDGEAAPQHRKQQSITLGSVKEFNFDNADEGDSRKPS--SSNWWANGSVIGKEGET 437 Query: 438 SNQWSFFPLMQTGVS 394 + WSFFP++Q+GVS Sbjct: 438 TKNWSFFPMVQSGVS 452 >ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina] gi|557541785|gb|ESR52763.1| hypothetical protein CICLE_v10020073mg [Citrus clementina] Length = 460 Score = 409 bits (1052), Expect = e-111 Identities = 224/424 (52%), Positives = 284/424 (66%), Gaps = 2/424 (0%) Frame = -3 Query: 1671 QKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXXXX 1495 QKRRWG C ++ CFG K KRIGHA ++PE T +R+ NA + + +Q ++ Sbjct: 39 QKRRWGGCWNISWCFGFQKHRKRIGHAVLVPEPTASRS-NASEAVNSTQATAISLPFVAP 97 Query: 1494 XXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPVFS 1315 SATQSP GL+S+ S+S NMYSPGGP+SIFAIGPYAHETQLVSPPVFS Sbjct: 98 PSSPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFS 157 Query: 1314 TFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEFQSYQLQ 1135 TFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+P+LR GE GQ++P + YEFQSY L Sbjct: 158 TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLH 217 Query: 1134 PGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVRREWES 955 PGSPV +L PFPD +FA P F +F G+PPKLL+LDK+ REW S Sbjct: 218 PGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGS 277 Query: 954 CQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSFEITAEEV 775 QGSG +TPDAV R+ NRQ S+VA P++ + L D+ ++DHRVSFE+T E+V Sbjct: 278 RQGSGTLTPDAVRSTPRNGFFQNRQISEVALRPHSEN-GLRKDQ-IVDHRVSFELTTEDV 335 Query: 774 VRCVEKKPVVGSPKSAPESVENVEHIKEEKPIKTANGVDHP-SGETSNITSEKDHIHTDG 598 VRCVEKKP ++ ES++N +++E+ A V H +GE +N K + D Sbjct: 336 VRCVEKKPTT-LAEAVSESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPV--DV 392 Query: 597 DNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPSNQWSFF 418 + +H K ++ITLGSTKEFNFDS D G+ EP+IA SDWW NEKVV +D W+FF Sbjct: 393 EEAPRHQKQQSITLGSTKEFNFDSAD-GDSHEPTIA-SDWWANEKVVGKDSGAIKNWAFF 450 Query: 417 PLMQ 406 P++Q Sbjct: 451 PVIQ 454 >gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao] Length = 458 Score = 405 bits (1042), Expect = e-110 Identities = 227/435 (52%), Positives = 279/435 (64%), Gaps = 3/435 (0%) Frame = -3 Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNKTK-RIGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516 R P +VQKRRWG C S+Y CFGS K K RIG A + ET+ + A N P +E+P+Q P++ Sbjct: 31 RVPQATVQKRRWGGCWSIYWCFGSYKQKKRIGPAVLTSETSFSGA-NVPAAENPTQAPAI 89 Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336 SATQSP GL+S+TS+SA+MYSPG P SIFAIGPYAHETQL Sbjct: 90 ALPFVAPPSSPASFLPSEPPSATQSPAGLVSLTSISASMYSPG-PASIFAIGPYAHETQL 148 Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156 VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL PNL+ GE QR+P++ YE Sbjct: 149 VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLGPNLQYGEGVQRFPISHYE 208 Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976 FQSYQL PGSPV L PF D +FAA F EFR G+PPKLL+LDK Sbjct: 209 FQSYQLHPGSPVGQLISPSSGISGSGTSSPFRDGEFAASL-HFPEFRMGDPPKLLNLDKH 267 Query: 975 VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSF 796 EW S GSG +TPDA R+ LL+ Q S++ P+ + + ND+ +HRVSF Sbjct: 268 SSCEWGSHHGSGTLTPDATRSTPRNGFLLDHQISEITSHPHLKNKEVQNDQVAHNHRVSF 327 Query: 795 EITAEEVVRCVEKKPVVGSPKSAPESVENVEHIK--EEKPIKTANGVDHPSGETSNITSE 622 E+T EEVVR +E + +P A +E + EE K + + GETSN E Sbjct: 328 ELTTEEVVRSLEME--TATPSEAVSGSLQIEATRESEEHDTKVVDDYECRVGETSNERPE 385 Query: 621 KDHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGS 442 K D + + QHHK ++ITLGS KEFNFD+VDGG+ +P I +SDWW N+KV + G Sbjct: 386 K--ALADREGKPQHHKHQSITLGSAKEFNFDNVDGGDAHKP-ILTSDWWANDKVAGKGGG 442 Query: 441 PSNQWSFFPLMQTGV 397 WSFFP+MQ GV Sbjct: 443 VPRNWSFFPMMQPGV 457 >ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum] Length = 443 Score = 402 bits (1034), Expect = e-109 Identities = 232/436 (53%), Positives = 280/436 (64%), Gaps = 3/436 (0%) Frame = -3 Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516 R P S+QKRRWG C S+Y CFGS K TKRIGHA IPETT + AD P+S SQ PS+ Sbjct: 31 RVPQASIQKRRWGGCWSMYWCFGSQKQTKRIGHAVFIPETTASGADR-PSSNTSSQAPSI 89 Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336 + SAT SP G +S + YSP GP SIFAIGPYAHETQL Sbjct: 90 VLPFIAPPSSPASFLPSEPPSATHSPVG---SKCLSMSTYSPSGPASIFAIGPYAHETQL 146 Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156 VSPPVFS FTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+PN +N AG RYP QYE Sbjct: 147 VSPPVFSAFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYE 206 Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976 FQSYQLQPGSPVS+L PF DR++ G P F L+L+KI Sbjct: 207 FQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDREYTPGRPQF-----------LNLEKI 255 Query: 975 VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSF 796 EW S QGSG +TP+AV P+ D+ LLN Q+S V LP + ND TV+DHRVSF Sbjct: 256 APHEWGSRQGSGTLTPEAVNPKYHDNFLLNYQNSGVHRLPKPFN-GWKNDLTVVDHRVSF 314 Query: 795 EITAEEVVRCVEKKPVVGSPKSAPESVENVEHI--KEEKPIKTANGVDHPSGETSNITSE 622 EITAE+VVRCVEKKP + ++ S+++ E ++E + +NG DH E S E Sbjct: 315 EITAEDVVRCVEKKPTM-MMRTGSVSLQDTERSTKRQENLAEMSNGHDHGGHEPSREIHE 373 Query: 621 KDHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGS 442 TDG++ ++ K R+ITLGS+KEFNFD+VDGG D+ +I SDWW NEKV+ ++ Sbjct: 374 GS--STDGEDGQRQQKHRSITLGSSKEFNFDNVDGGYPDKATI-GSDWWANEKVLGKE-- 428 Query: 441 PSNQWSFFPLMQTGVS 394 P N W FP+MQ GVS Sbjct: 429 PCNNW-IFPMMQPGVS 443 >ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum lycopersicum] Length = 443 Score = 401 bits (1030), Expect = e-109 Identities = 231/436 (52%), Positives = 280/436 (64%), Gaps = 3/436 (0%) Frame = -3 Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516 R P S+QKRRWGSC S+Y CFGS K TKRIGHA IPETT + AD P+S SQ PS+ Sbjct: 31 RVPQASIQKRRWGSCWSMYWCFGSQKQTKRIGHAVFIPETTASAADR-PSSNTSSQAPSI 89 Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336 + SAT SP G +S + YSP GP SIFAIGPYAHETQL Sbjct: 90 VLPFIAPPSSPASFLPSEPPSATHSPVG---SKCLSMSTYSPSGPASIFAIGPYAHETQL 146 Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156 VSPPVFS FTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+PN +N AG RYP QYE Sbjct: 147 VSPPVFSAFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYE 206 Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976 FQSYQLQPGSPVS+L PF +R++ G P F L+L+KI Sbjct: 207 FQSYQLQPGSPVSNLISPGSAISVSGTSSPFLEREYTPGRPQF-----------LNLEKI 255 Query: 975 VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSF 796 EW S QGSG +TP+AV P+ DS LLN Q++ V LP + ND TV+DHRVSF Sbjct: 256 APHEWGSRQGSGTLTPEAVNPKYHDSFLLNYQNTGVHRLPKPFN-GWKNDLTVVDHRVSF 314 Query: 795 EITAEEVVRCVEKKPVVGSPKSAPESVENVEHI--KEEKPIKTANGVDHPSGETSNITSE 622 EITAE+VVRCVEKKP + ++ S+++ E ++E + +N DH E S E Sbjct: 315 EITAEDVVRCVEKKPTM-MMRTGSVSLQDTERSTKRQENLAEMSNAHDHSGHEPSREIHE 373 Query: 621 KDHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGS 442 TDG++ ++ K R+ITLGS+KEFNFD+VDGG D+ +I SDWW NEKV+ ++ Sbjct: 374 GS--STDGEDGQRQQKHRSITLGSSKEFNFDNVDGGYPDKATI-GSDWWANEKVLGKE-- 428 Query: 441 PSNQWSFFPLMQTGVS 394 P N W FP+MQ GVS Sbjct: 429 PCNNW-IFPMMQPGVS 443 >emb|CBI34651.3| unnamed protein product [Vitis vinifera] Length = 412 Score = 379 bits (973), Expect = e-102 Identities = 212/434 (48%), Positives = 261/434 (60%), Gaps = 1/434 (0%) Frame = -3 Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNKTKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVI 1513 R P +VQKRRWGSC Y CF S K KRIGHA + PE+ P +E+ +Q P+++ Sbjct: 31 RVPQPTVQKRRWGSCWGEYWCFRSPKDKRIGHAVLAPESRAP-GSGVPAAENLTQAPTIV 89 Query: 1512 XXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLV 1333 SATQSP+GLLS+TS++AN+YSPGGP SIFAIGPYAHETQLV Sbjct: 90 LPFVAPPSSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLV 149 Query: 1332 SPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEF 1153 SPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+L +PN RNGEAG R+ L+QYEF Sbjct: 150 SPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEF 209 Query: 1152 QSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIV 973 QSYQL PGSPV HL PFPDR Sbjct: 210 QSYQLYPGSPVGHLISPSSGISGSGTSSPFPDR--------------------------- 242 Query: 972 RREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSFE 793 SG++TPDA+GP SRD +L+ N+E ++DHRVSFE Sbjct: 243 ---------SGSITPDALGPPSRDGSVLDHSG-------------CPNNEIMVDHRVSFE 280 Query: 792 ITAEEVVRCVEKKPVVGSPKSAPESVENVEHIK-EEKPIKTANGVDHPSGETSNITSEKD 616 +TAE+VVRCVEK K+ S++N ++ +E + + GET+N EK Sbjct: 281 LTAEDVVRCVEKDS-AALVKAVSASLQNPATVEIDENSREVVVDSEGRVGETANNPPEKA 339 Query: 615 HIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPS 436 +G+ + HHK R+ITLGS KEFNFD+ DGG+ D+P+I SSDWW NEKVV ++ S Sbjct: 340 PEDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNI-SSDWWANEKVVGKEVGAS 398 Query: 435 NQWSFFPLMQTGVS 394 WS F +MQ VS Sbjct: 399 KNWSIFHMMQPSVS 412 >ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis] gi|223549721|gb|EEF51209.1| conserved hypothetical protein [Ricinus communis] Length = 459 Score = 379 bits (973), Expect = e-102 Identities = 220/435 (50%), Positives = 274/435 (62%), Gaps = 3/435 (0%) Frame = -3 Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516 R P ++QKRRWGSC S+Y CFG ++ KRIGHA ++PE + D++ +Q P++ Sbjct: 34 RVPQATIQKRRWGSCWSVYWCFGYHRHRKRIGHAVLVPENSAPGNDSSAAENPTTQAPTI 93 Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336 SA+QSP G+LS+TSVSA+MYSP GP SIFAIGPYAHETQL Sbjct: 94 TLPFVAPPSSPASFLQSEPPSASQSPAGILSLTSVSASMYSPSGPASIFAIGPYAHETQL 153 Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156 VSPP FSTFTTEPSTAPFTPPPES+ LTTPSSPEVPFA+LLEP+ RNGEAG R+P + YE Sbjct: 154 VSPPAFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLEPSNRNGEAGLRFPFSNYE 213 Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976 FQSYQ PGSPV L PFPD +FAA P FLEF+ PPKLL+LDK+ Sbjct: 214 FQSYQFYPGSPVGQLISPSSGISGSGTSSPFPDGEFAAAGPRFLEFQMAVPPKLLNLDKL 273 Query: 975 VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSF 796 E S QGSG +TPDAV S S L+RQ SD+A N S D+ V D RVSF Sbjct: 274 SVHECGSRQGSGTLTPDAVRATS-CSFPLDRQCSDIA--SNRHSDNENKDDQVADLRVSF 330 Query: 795 EITAEEVVRCVEKKPVVGSP-KSAPESVEN-VEHIKEEKPIKTANGVDHPSGETSNITSE 622 +++AE+ +R E KP SP K PES++N + K +K + + + GETSN E Sbjct: 331 DLSAEDALRYAEPKP--ASPVKIMPESMKNEIAAEKVQKSSEIRHNFECRVGETSNGILE 388 Query: 621 KDHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGS 442 + T G+ +H K RT+TLG+ KEFNFD+ DG +PS A DWW N V ++ Sbjct: 389 Q--ASTGGEKTPRHQKHRTLTLGTFKEFNFDNADG--VPKPS-AGPDWWDNGSDVGKEDF 443 Query: 441 PSNQWSFFPLMQTGV 397 + WSFFP+MQ + Sbjct: 444 TAKNWSFFPVMQPSI 458 >ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria vesca subsp. vesca] Length = 459 Score = 370 bits (951), Expect = e-100 Identities = 228/471 (48%), Positives = 283/471 (60%), Gaps = 10/471 (2%) Frame = -3 Query: 1776 LTMRRGANGTDXXXXXXXXXXXXXXXXA---RGPHDSVQKRRWGSCLSLYSCFGSNK-TK 1609 + MRRG NG D A R P +VQKRRW +Y CFG + K Sbjct: 1 MMMRRGVNGGDGNNALDTINAAASAIAAAESRVPQATVQKRRWAKGWGVYWCFGFQRHRK 60 Query: 1608 RIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXXXXXXXXXXXXXXXXXSATQSPTGL 1429 RIGHA ++PETT + N P +E+ +Q S++ SA QSP Sbjct: 61 RIGHAVILPETT-SPGHNDPRAENLTQASSIVLPFAAPPSSPASFLQSEPPSAMQSPGFN 119 Query: 1428 LSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESIHLTT 1249 S+ SA+MYSPG P+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPP ES+HLT Sbjct: 120 FSL---SASMYSPG-PSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPAESVHLTR 175 Query: 1248 PSSPEVPFARLLEPNLRNGEAGQRYPLTQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXX 1069 PSSPEVPFA+LL+ N R GE GQRYPL+ YEFQSYQ PGSPV L Sbjct: 176 PSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSS 235 Query: 1068 PFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVRREWESCQGSGAVTPDAVGPRSRDSRLL 889 PF D +FA+G FLEFRTG PK+L+LD + R+W S SG+VTPDA S + L Sbjct: 236 PFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGSRLCSGSVTPDAAKSTSSEGFTL 295 Query: 888 NRQDSDVAPLPNTGSYRLANDETVLDHRVSFEITAEEVVRCVEKKPV--VGSPKSAPESV 715 + L + R ND + HRVSFE++AEEVVRCVEKKPV + ++ +S Sbjct: 296 KPYTPE-GVLNARSNSRRRNDGASIGHRVSFELSAEEVVRCVEKKPVALAEAVSTSLQSA 354 Query: 714 ENVEHIKEEKP-IKTANGVDHPSGETSNITSEKDHIHTDGDNEK---QHHKTRTITLGST 547 E E +EE P + ++ + P +TSN +SEK GD E+ ++ K R+ITLGS Sbjct: 355 EKAE--REEGPNQEVSSSHECPVVDTSNDSSEK---AVGGDAEELSYRYQKERSITLGSA 409 Query: 546 KEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPSNQWSFFPLMQTGVS 394 KEFNFD+ DGG+ SI S+DWW NEKVV ++ S WSFFP++Q G+S Sbjct: 410 KEFNFDNADGGDSGTSSI-STDWWANEKVVLKENGESKNWSFFPMIQPGMS 459 >ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria vesca subsp. vesca] Length = 422 Score = 363 bits (933), Expect = 1e-97 Identities = 217/434 (50%), Positives = 271/434 (62%), Gaps = 7/434 (1%) Frame = -3 Query: 1674 VQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXXX 1498 +QKRRW +Y CFG + KRIGHA ++PETT + N P +E+ +Q S++ Sbjct: 1 MQKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETT-SPGHNDPRAENLTQASSIVLPFAA 59 Query: 1497 XXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPVF 1318 SA QSP S+ SA+MYSPG P+SIFAIGPYAHETQLVSPPVF Sbjct: 60 PPSSPASFLQSEPPSAMQSPGFNFSL---SASMYSPG-PSSIFAIGPYAHETQLVSPPVF 115 Query: 1317 STFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEFQSYQL 1138 STFTTEPSTAPFTPP ES+HLT PSSPEVPFA+LL+ N R GE GQRYPL+ YEFQSYQ Sbjct: 116 STFTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQW 175 Query: 1137 QPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVRREWE 958 PGSPV L PF D +FA+G FLEFRTG PK+L+LD + R+W Sbjct: 176 YPGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWG 235 Query: 957 SCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSFEITAEE 778 S SG+VTPDA S + L + L + R ND + HRVSFE++AEE Sbjct: 236 SRLCSGSVTPDAAKSTSSEGFTLKPYTPE-GVLNARSNSRRRNDGASIGHRVSFELSAEE 294 Query: 777 VVRCVEKKPV--VGSPKSAPESVENVEHIKEEKP-IKTANGVDHPSGETSNITSEKDHIH 607 VVRCVEKKPV + ++ +S E E +EE P + ++ + P +TSN +SEK Sbjct: 295 VVRCVEKKPVALAEAVSTSLQSAEKAE--REEGPNQEVSSSHECPVVDTSNDSSEK---A 349 Query: 606 TDGDNEK---QHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPS 436 GD E+ ++ K R+ITLGS KEFNFD+ DGG+ SI S+DWW NEKVV ++ S Sbjct: 350 VGGDAEELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSI-STDWWANEKVVLKENGES 408 Query: 435 NQWSFFPLMQTGVS 394 WSFFP++Q G+S Sbjct: 409 KNWSFFPMIQPGMS 422 >ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera] Length = 448 Score = 359 bits (922), Expect = 2e-96 Identities = 214/443 (48%), Positives = 262/443 (59%), Gaps = 15/443 (3%) Frame = -3 Query: 1677 SVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXX 1501 +VQKRRWGSCLSLY CFGS++ +KRIGHA ++PE A AP SE+ + S++ Sbjct: 29 TVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLVPEPMVPGAV-APASENLNLSTSIVLPFI 87 Query: 1500 XXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPV 1321 S+TQSP G LS+T++S N YSP GP S+FAIGPYAHETQLVSPPV Sbjct: 88 APPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSGPASMFAIGPYAHETQLVSPPV 147 Query: 1320 FSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNL----RNGEAGQRYPLTQYEF 1153 FSTF TEPSTAPFTPPPES+ LTTPSSPEVPFA+LL +L RN Q+ L+ YEF Sbjct: 148 FSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRSRRNSGTNQKLSLSNYEF 207 Query: 1152 QSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIV 973 Q YQL P SPV HL P + PF PKLL + Sbjct: 208 QPYQLYPESPVGHLIS--------------PISNSGTSSPFPDRRPIVEAPKLLGFEHFS 253 Query: 972 RREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSFE 793 R W S GSG++TPD GP SRDS LL Q S+VA L N+ S N ETV+DHRVSFE Sbjct: 254 TRRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSES-GSQNGETVIDHRVSFE 312 Query: 792 ITAEEVVRCVEKKPVVGSPKSAPESVEN-VEHIKEEKPIK---------TANGVDHPSGE 643 + E+V CVEKKPV ++ E+V+N ++ I EE I+ T N + GE Sbjct: 313 LAGEDVAVCVEKKPV-----ASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGE 367 Query: 642 TSNITSEKDHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEK 463 SEK +G+ E+ H K I GS KEFNFD+ G +P+I S+WWVNEK Sbjct: 368 ALKAASEK--ASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEK 425 Query: 462 VVAEDGSPSNQWSFFPLMQTGVS 394 VV + P W+FFPL+Q G+S Sbjct: 426 VVGKGTGPQTNWTFFPLLQPGIS 448 >ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Populus trichocarpa] gi|222841936|gb|EEE79483.1| hypothetical protein POPTR_0003s12950g [Populus trichocarpa] Length = 441 Score = 353 bits (905), Expect = 2e-94 Identities = 214/424 (50%), Positives = 262/424 (61%), Gaps = 2/424 (0%) Frame = -3 Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNKTKR-IGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516 R P VQK+RW S S+Y CFG K+KR IGHA + PE++ AP +E+ +Q P V Sbjct: 31 RVPQAMVQKQRWRSHWSIYWCFGYQKSKRQIGHAVLFPESSAP-GSGAPAAENSAQAPEV 89 Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336 S TQSP GL+S TS+SA+MYSP GP SIFAIGPYAHETQL Sbjct: 90 TFPFVAPPSSPASFFQSEPPSVTQSPAGLVSRTSISASMYSPSGPASIFAIGPYAHETQL 149 Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156 VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+L++P LRNG G R+P ++ Sbjct: 150 VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLIDPTLRNGVTGLRFP---FD 206 Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976 FQSYQ PGS V L PFPD +FA G P EFR G PKLL+LDK+ Sbjct: 207 FQSYQFHPGSSVGQLISPSSGISGSGTSSPFPDGEFAVGGPHSPEFRMG--PKLLNLDKL 264 Query: 975 VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSF 796 REW S Q SGA+TPD+V S + LL+RQ SDVA P + + +D+ V++HR SF Sbjct: 265 STREWGSYQDSGALTPDSVRHGS-PNFLLHRQFSDVASHPRSENGH--DDDQVVNHRFSF 321 Query: 795 EITAEEVVRCVEKKPVVGSPKSAPESVENVEHIKEEKPI-KTANGVDHPSGETSNITSEK 619 E++ ++ RCVE+KP S K+ PE VEN KEE+ + + SG+TSN T E Sbjct: 322 ELSVKDASRCVEEKPAC-SIKTVPEYVENGTKAKEEENYGELIQSFERRSGDTSNDTPET 380 Query: 618 DHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSP 439 TDG+ QH K + ITLGS EFNFD+ D G+ PS SS+W V P Sbjct: 381 P--STDGE-APQHRKQQPITLGSVNEFNFDNADEGDSHNPS--SSNW-----VKQPRTGP 430 Query: 438 SNQW 427 S+ W Sbjct: 431 SSLW 434 >gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 485 Score = 345 bits (884), Expect = 5e-92 Identities = 215/471 (45%), Positives = 271/471 (57%), Gaps = 43/471 (9%) Frame = -3 Query: 1677 SVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXX 1501 +VQK+RWGSC LY CFGS K +KRIGHA ++PE A + T+E+ S P +I Sbjct: 29 TVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGA-SVSTAENVSNPTGIILPFI 87 Query: 1500 XXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPV 1321 SATQSP GLLS+TS+S N YSP GP SIFAIGPYAHETQLV+PPV Sbjct: 88 APPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPV 147 Query: 1320 FSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNL----RNGEAGQRYPLTQYEF 1153 FS TTEPSTAPFTPPPES+ LTTPSSPEVPFA+LL +L RN Q++ L+ YEF Sbjct: 148 FSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEF 207 Query: 1152 QSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIV 973 QSYQ+ PGSP +L PFPDR LEFR G PKLL + Sbjct: 208 QSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRP------ILEFRMGEAPKLLGFENFT 261 Query: 972 RREWESCQGSGA----------------VT----------------PDAVGPRSRDSRLL 889 R+W S GSG+ VT PD +GP SRD L+ Sbjct: 262 TRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLV 321 Query: 888 NRQDSDVAPLPNTGSYRLANDETVLDHRVSFEITAEEVVRCVEKKPVVGSPKSAPESVEN 709 Q S+VA L N + NDET++DHRVSFE++ E+V C+E K ++ S ++ E ++ Sbjct: 322 GSQISEVALLANPAN-GPKNDETIVDHRVSFELSGEDVAPCLESKSLLPS-RAVSEYPKD 379 Query: 708 V--EHIKEEKPIK--TANGVDHPSGETSNITSEKDHIHTDGDNEKQH--HKTRTITLGST 547 + E KE IK + + ETSN T EK G+ E++H K R++TLGS Sbjct: 380 LVAEGRKERDGIKKDLESSCELFIRETSNETVEK----ASGEAEEEHSYQKHRSVTLGSI 435 Query: 546 KEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPSNQWSFFPLMQTGVS 394 KEFNFD+ G D+P+I S+WW NEKV ++ P N W+FFP++Q VS Sbjct: 436 KEFNFDNTKGEASDKPTI-RSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 485 >gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 489 Score = 342 bits (877), Expect = 3e-91 Identities = 214/470 (45%), Positives = 269/470 (57%), Gaps = 43/470 (9%) Frame = -3 Query: 1674 VQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXXX 1498 V K+RWGSC LY CFGS K +KRIGHA ++PE A + T+E+ S P +I Sbjct: 34 VYKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGA-SVSTAENVSNPTGIILPFIA 92 Query: 1497 XXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPVF 1318 SATQSP GLLS+TS+S N YSP GP SIFAIGPYAHETQLV+PPVF Sbjct: 93 PPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVF 152 Query: 1317 STFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNL----RNGEAGQRYPLTQYEFQ 1150 S TTEPSTAPFTPPPES+ LTTPSSPEVPFA+LL +L RN Q++ L+ YEFQ Sbjct: 153 SALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQ 212 Query: 1149 SYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVR 970 SYQ+ PGSP +L PFPDR LEFR G PKLL + Sbjct: 213 SYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRP------ILEFRMGEAPKLLGFENFTT 266 Query: 969 REWESCQGSGA----------------VT----------------PDAVGPRSRDSRLLN 886 R+W S GSG+ VT PD +GP SRD L+ Sbjct: 267 RKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVG 326 Query: 885 RQDSDVAPLPNTGSYRLANDETVLDHRVSFEITAEEVVRCVEKKPVVGSPKSAPESVENV 706 Q S+VA L N + NDET++DHRVSFE++ E+V C+E K ++ S ++ E +++ Sbjct: 327 SQISEVALLANPAN-GPKNDETIVDHRVSFELSGEDVAPCLESKSLLPS-RAVSEYPKDL 384 Query: 705 --EHIKEEKPIK--TANGVDHPSGETSNITSEKDHIHTDGDNEKQH--HKTRTITLGSTK 544 E KE IK + + ETSN T EK G+ E++H K R++TLGS K Sbjct: 385 VAEGRKERDGIKKDLESSCELFIRETSNETVEK----ASGEAEEEHSYQKHRSVTLGSIK 440 Query: 543 EFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPSNQWSFFPLMQTGVS 394 EFNFD+ G D+P+I S+WW NEKV ++ P N W+FFP++Q VS Sbjct: 441 EFNFDNTKGEASDKPTI-RSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 489 >ref|XP_003525388.1| PREDICTED: uncharacterized protein LOC100791666 isoform X1 [Glycine max] Length = 461 Score = 325 bits (834), Expect = 3e-86 Identities = 185/432 (42%), Positives = 250/432 (57%), Gaps = 4/432 (0%) Frame = -3 Query: 1677 SVQKRRWGSCLSLYSCFGSNKT-KRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXX 1501 S QK+RWGS L CFG KT KRIGHA ++PE T AD A + Q PS+ Sbjct: 38 STQKKRWGSWLGKIGCFGYKKTRKRIGHAVLVPEPTTNGADPAAAASS-IQAPSITLPFV 96 Query: 1500 XXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPV 1321 S QSP G +S T VSA++YSPGGP SIFAIGPYAHETQLVSPPV Sbjct: 97 APPSSPASFFQSEPPSTAQSPIGKVSHTCVSASIYSPGGPASIFAIGPYAHETQLVSPPV 156 Query: 1320 FSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEFQSYQ 1141 FS STAPFTPPPES+H+TTPSSPEVPFA+LL+PN +N E QR+ ++ Y+FQSYQ Sbjct: 157 FSA----SSTAPFTPPPESVHMTTPSSPEVPFAQLLDPNNKNSETFQRFQISHYDFQSYQ 212 Query: 1140 LQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVR--R 967 PGSPV L P PD +F A + L+F+ +PPKLL+LD + Sbjct: 213 FHPGSPVGQLISPRSAISVSGTSSPLPDSEFNATFAHILDFQRADPPKLLNLDNKLSSCE 272 Query: 966 EWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSFEIT 787 +S GSG++TPDA ++ L N S++ P+ + RL +E ++HRVSFE++ Sbjct: 273 NQKSNHGSGSLTPDAARSTTQSGFLSNHWVSEIKMSPHPSNNRL--NEISINHRVSFELS 330 Query: 786 AEEVVRCVEKKPVVGSPKSAPESVENVEHIKEEKPIKTANGVDHPSGETSNITSEKDHIH 607 A++V++ +E KP + + ++N +++ + +D + + Sbjct: 331 AQKVLKSLENKPAASAWTNVLPKLKNDAPTTDKEEKSEESALDDKQVVSEAHNDQPLETT 390 Query: 606 TDGDNEKQ-HHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPSNQ 430 GD H K +++TL S KEFNFD+ DGG+ P+I +DWW NEKV ++ S Sbjct: 391 LGGDKATTVHEKDQSLTLSSAKEFNFDNADGGDSLAPNIV-ADWWANEKVAGKEREASKD 449 Query: 429 WSFFPLMQTGVS 394 WSFFP++Q GVS Sbjct: 450 WSFFPMIQPGVS 461 >ref|XP_006580574.1| PREDICTED: uncharacterized protein LOC100791666 isoform X2 [Glycine max] Length = 441 Score = 322 bits (825), Expect = 4e-85 Identities = 183/429 (42%), Positives = 248/429 (57%), Gaps = 4/429 (0%) Frame = -3 Query: 1668 KRRWGSCLSLYSCFGSNKT-KRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXXXXX 1492 K+RWGS L CFG KT KRIGHA ++PE T AD A + Q PS+ Sbjct: 21 KKRWGSWLGKIGCFGYKKTRKRIGHAVLVPEPTTNGADPAAAASS-IQAPSITLPFVAPP 79 Query: 1491 XXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPVFST 1312 S QSP G +S T VSA++YSPGGP SIFAIGPYAHETQLVSPPVFS Sbjct: 80 SSPASFFQSEPPSTAQSPIGKVSHTCVSASIYSPGGPASIFAIGPYAHETQLVSPPVFSA 139 Query: 1311 FTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEFQSYQLQP 1132 STAPFTPPPES+H+TTPSSPEVPFA+LL+PN +N E QR+ ++ Y+FQSYQ P Sbjct: 140 ----SSTAPFTPPPESVHMTTPSSPEVPFAQLLDPNNKNSETFQRFQISHYDFQSYQFHP 195 Query: 1131 GSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVR--REWE 958 GSPV L P PD +F A + L+F+ +PPKLL+LD + + Sbjct: 196 GSPVGQLISPRSAISVSGTSSPLPDSEFNATFAHILDFQRADPPKLLNLDNKLSSCENQK 255 Query: 957 SCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSFEITAEE 778 S GSG++TPDA ++ L N S++ P+ + RL +E ++HRVSFE++A++ Sbjct: 256 SNHGSGSLTPDAARSTTQSGFLSNHWVSEIKMSPHPSNNRL--NEISINHRVSFELSAQK 313 Query: 777 VVRCVEKKPVVGSPKSAPESVENVEHIKEEKPIKTANGVDHPSGETSNITSEKDHIHTDG 598 V++ +E KP + + ++N +++ + +D + + G Sbjct: 314 VLKSLENKPAASAWTNVLPKLKNDAPTTDKEEKSEESALDDKQVVSEAHNDQPLETTLGG 373 Query: 597 DNEKQ-HHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPSNQWSF 421 D H K +++TL S KEFNFD+ DGG+ P+I +DWW NEKV ++ S WSF Sbjct: 374 DKATTVHEKDQSLTLSSAKEFNFDNADGGDSLAPNIV-ADWWANEKVAGKEREASKDWSF 432 Query: 420 FPLMQTGVS 394 FP++Q GVS Sbjct: 433 FPMIQPGVS 441