BLASTX nr result
ID: Akebia24_contig00011856
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00011856 (1122 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun... 313 6e-83 ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot... 312 2e-82 ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot... 312 2e-82 ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264... 308 4e-81 emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera] 308 4e-81 ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241... 301 4e-79 ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prun... 290 6e-76 ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm... 283 1e-73 ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626... 282 2e-73 ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr... 279 2e-72 ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr... 279 2e-72 ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309... 273 1e-70 ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309... 273 1e-70 ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family prot... 272 2e-70 gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis] 270 6e-70 ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu... 267 7e-69 ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu... 267 7e-69 ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260... 262 2e-67 ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583... 258 2e-66 gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis] 245 2e-62 >ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica] gi|462404864|gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica] Length = 455 Score = 313 bits (803), Expect = 6e-83 Identities = 194/371 (52%), Positives = 229/371 (61%), Gaps = 8/371 (2%) Frame = -2 Query: 1091 RGRGCPAPENAIRPPTIXXXXXXXXXXXXXXLQSEPHSSTQSPTGLLSLSANTYSP-GPN 915 RG P EN I+ P+I LQSEP S+TQSP G SL+A+ YSP GP Sbjct: 73 RGGDAPRAENPIQTPSIVLPFVAPPSSPASFLQSEPPSATQSPAGFFSLTASMYSPSGPT 132 Query: 914 SIFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXE---HLTTPSSPEVPFARLLTSSLD 744 SIFAIGPYAHETQL HLTTPSSPEVPFA+LL D Sbjct: 133 SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLL----D 188 Query: 743 RNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHP 564 + + Q+F SHYEFQSYQLYPGSPVG LISPSS IS SGTSSPFPD EF++ GH Sbjct: 189 PHFRNGEGGQRFPLSHYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDLEFAARGHH 248 Query: 563 FLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASLAN 384 FLEFRTG+PPKL + D LSTR W GSGS+T PD A TS D FL++ Q EV Sbjct: 249 FLEFRTGDPPKLLNLDILSTRDWGSRLGSGSVT-PDGAKSTSSDGFLLKPQTPEVVLNPR 307 Query: 383 SNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTPLDKTTLATVTPERDG 204 SNN +NN+I I+HRVSFEL++EE CVEK+P +A EA S T L+ T A + D Sbjct: 308 SNNRGRNNDISINHRVSFELSSEEVIRCVEKKP-VALAEAVS-TSLEDTEKA--QSKEDP 363 Query: 203 LSSEAENTC-VGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSDR 27 + + C VGETS++ + KA DG++ H +Q+ T+GSVKEF FDN DGG S Sbjct: 364 SKVVSSSICPVGETSNDAAEKAVADGEEAQLHPKQRS--ITLGSVKEFNFDNPDGGDSGN 421 Query: 26 ---SDWWANEK 3 SDWWANEK Sbjct: 422 SIGSDWWANEK 432 >ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 489 Score = 312 bits (799), Expect = 2e-82 Identities = 193/410 (47%), Positives = 230/410 (56%), Gaps = 45/410 (10%) Frame = -2 Query: 1097 VFRGRGCPAPENAIRPPTIXXXXXXXXXXXXXXLQSEPHSSTQSPTGLLSL---SANTYS 927 V G EN P I LQS+P S+TQSP GLLSL S N YS Sbjct: 68 VVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYS 127 Query: 926 P-GPNSIFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXEH---LTTPSSPEVPFARLL 759 P GP SIFAIGPYAHETQL LTTPSSPEVPFA+LL Sbjct: 128 PRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLL 187 Query: 758 TSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFS 579 TSSL+R + SG QKF SHYEFQSYQ+YPGSP G+LISP S IS SGTSSPFPDR Sbjct: 188 TSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDR--- 244 Query: 578 SGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP--------------------- 462 P LEFR GE PKL F+ +TRKW GSGSLTP Sbjct: 245 ---RPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGL 301 Query: 461 ----------PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEE 312 PD GP SRD FLV +QISEVA LAN NG +N+E ++DHRVSFEL+ E+ Sbjct: 302 GSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGED 361 Query: 311 TPSCVEKEPMMASLEAKSVTPLDKTTLATVTPERDGLSSEAENTC---VGETSSNVSGKA 141 C+E + ++ S ++V+ K +A ERDG+ + E++C + ETS+ KA Sbjct: 362 VAPCLESKSLLPS---RAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKA 418 Query: 140 FGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSD----RSDWWANEK 3 G+ ++E H Q+ T+GS+KEF FDNT G SD RS+WWANEK Sbjct: 419 SGEAEEE--HSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEK 466 >ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 485 Score = 312 bits (799), Expect = 2e-82 Identities = 193/410 (47%), Positives = 230/410 (56%), Gaps = 45/410 (10%) Frame = -2 Query: 1097 VFRGRGCPAPENAIRPPTIXXXXXXXXXXXXXXLQSEPHSSTQSPTGLLSL---SANTYS 927 V G EN P I LQS+P S+TQSP GLLSL S N YS Sbjct: 64 VVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYS 123 Query: 926 P-GPNSIFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXEH---LTTPSSPEVPFARLL 759 P GP SIFAIGPYAHETQL LTTPSSPEVPFA+LL Sbjct: 124 PRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLL 183 Query: 758 TSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFS 579 TSSL+R + SG QKF SHYEFQSYQ+YPGSP G+LISP S IS SGTSSPFPDR Sbjct: 184 TSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDR--- 240 Query: 578 SGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP--------------------- 462 P LEFR GE PKL F+ +TRKW GSGSLTP Sbjct: 241 ---RPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGL 297 Query: 461 ----------PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEE 312 PD GP SRD FLV +QISEVA LAN NG +N+E ++DHRVSFEL+ E+ Sbjct: 298 GSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGED 357 Query: 311 TPSCVEKEPMMASLEAKSVTPLDKTTLATVTPERDGLSSEAENTC---VGETSSNVSGKA 141 C+E + ++ S ++V+ K +A ERDG+ + E++C + ETS+ KA Sbjct: 358 VAPCLESKSLLPS---RAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKA 414 Query: 140 FGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSD----RSDWWANEK 3 G+ ++E H Q+ T+GS+KEF FDNT G SD RS+WWANEK Sbjct: 415 SGEAEEE--HSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEK 462 >ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera] Length = 448 Score = 308 bits (788), Expect = 4e-81 Identities = 196/377 (51%), Positives = 222/377 (58%), Gaps = 15/377 (3%) Frame = -2 Query: 1088 GRGCPAPENAIRPPTIXXXXXXXXXXXXXXLQSEPHSSTQSPTGLLSLSA---NTYSP-G 921 G PA EN +I LQS+P SSTQSP G LSL+A N YSP G Sbjct: 67 GAVAPASENLNLSTSIVLPFIAPPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSG 126 Query: 920 PNSIFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXEH---LTTPSSPEVPFARLLTSS 750 P S+FAIGPYAHETQL LTTPSSPEVPFA+LLTSS Sbjct: 127 PASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSS 186 Query: 749 LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 570 LDR+ + SG QK + S+YEFQ YQLYP SPVGHLISP IS SGTSSPFPDR Sbjct: 187 LDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPDRR----- 238 Query: 569 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASL 390 P +E PKL F+ STR+W GSGSLTP D AGP SRDSFL+ENQISEVASL Sbjct: 239 -PIVE-----APKLLGFEHFSTRRWGSRLGSGSLTP-DGAGPASRDSFLLENQISEVASL 291 Query: 389 ANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTPLDKTTLATVTPER 210 ANS +GSQN E VIDHRVSFEL E+ CVEK+P +AS E T D + ER Sbjct: 292 ANSESGSQNGETVIDHRVSFELAGEDVAVCVEKKP-VASAETVQNTLQDIVEEGEIERER 350 Query: 209 DGLSSEAENT---CVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGG 39 DG+S EN CVGE S KA +G++E H + P GS+KEF FDNT G Sbjct: 351 DGISESTENCCEFCVGEALKAASEKASAEGEEEQCHKKHPP--IRHGSIKEFNFDNTKGE 408 Query: 38 TSDR-----SDWWANEK 3 S + S+WW NEK Sbjct: 409 VSAKPNIIGSEWWVNEK 425 >emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera] Length = 385 Score = 308 bits (788), Expect = 4e-81 Identities = 196/377 (51%), Positives = 222/377 (58%), Gaps = 15/377 (3%) Frame = -2 Query: 1088 GRGCPAPENAIRPPTIXXXXXXXXXXXXXXLQSEPHSSTQSPTGLLSLSA---NTYSP-G 921 G PA EN +I LQS+P SSTQSP G LSL+A N YSP G Sbjct: 4 GAVAPASENLNLSTSIVLPFIAPPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSG 63 Query: 920 PNSIFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXEH---LTTPSSPEVPFARLLTSS 750 P S+FAIGPYAHETQL LTTPSSPEVPFA+LLTSS Sbjct: 64 PASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSS 123 Query: 749 LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 570 LDR+ + SG QK + S+YEFQ YQLYP SPVGHLISP IS SGTSSPFPDR Sbjct: 124 LDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPDRR----- 175 Query: 569 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASL 390 P +E PKL F+ STR+W GSGSLTP D AGP SRDSFL+ENQISEVASL Sbjct: 176 -PIVE-----APKLLGFEHFSTRRWGSRLGSGSLTP-DGAGPASRDSFLLENQISEVASL 228 Query: 389 ANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTPLDKTTLATVTPER 210 ANS +GSQN E VIDHRVSFEL E+ CVEK+P +AS E T D + ER Sbjct: 229 ANSESGSQNGETVIDHRVSFELAGEDVAVCVEKKP-VASAETVQNTLQDIVEEGEIERER 287 Query: 209 DGLSSEAENT---CVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGG 39 DG+S EN CVGE S KA +G++E H + P GS+KEF FDNT G Sbjct: 288 DGISESTENCCEFCVGEALKAASEKASAEGEEEQCHKKHPP--IRHGSIKEFNFDNTKGE 345 Query: 38 TSDR-----SDWWANEK 3 S + S+WW NEK Sbjct: 346 VSAKPNIIGSEWWVNEK 362 >ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera] Length = 479 Score = 301 bits (770), Expect = 4e-79 Identities = 195/400 (48%), Positives = 232/400 (58%), Gaps = 38/400 (9%) Frame = -2 Query: 1088 GRGCPAPENAIRPPTIXXXXXXXXXXXXXXLQSEPHSSTQSPTGLLSLS---ANTYSPG- 921 G G PA EN + PTI LQSEP S+TQSP+GLLSL+ AN YSPG Sbjct: 73 GSGVPAAENLTQAPTIVLPFVAPPSSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGG 132 Query: 920 PNSIFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXE---HLTTPSSPEVPFARLLTSS 750 P SIFAIGPYAHETQL HLTTPSSPEVPFA+L Sbjct: 133 PASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLF--- 189 Query: 749 LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREF-SSG 573 D N + +F S YEFQSYQLYPGSPVGHLISPSS IS SGTSSPFPDR+F SG Sbjct: 190 -DPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSPFPDRDFVCSG 248 Query: 572 GHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSR-------------- 435 FLEFR G PPKL + D LS +W GSGS+T PD GP SR Sbjct: 249 SSQFLEFRAGGPPKLLTLDKLSNHEWGSRIGSGSIT-PDALGPPSRDGSVLDRQVSDVIH 307 Query: 434 ----DSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKE------P 285 D +++ QIS+VAS + S++G NNEI++DHRVSFELTAE+ CVEK+ Sbjct: 308 PPSGDDSVLDRQISDVASHSLSDSGCPNNEIMVDHRVSFELTAEDVVRCVEKDSAALVKA 367 Query: 284 MMASLEAKSVTPLDKTTLATVTPERDGLSSEAENTCVGETSSNVSGKAFGD--GDDEVPH 111 + ASL+ + +D+ + V + SE VGET++N KA D G++ PH Sbjct: 368 VSASLQNPATVEIDENSREVV------VDSEGR---VGETANNPPEKAPEDANGEEGQPH 418 Query: 110 HRQQPSLTTIGSVKEFKFDNTDGGTSDR----SDWWANEK 3 H+Q+ T+GS KEF FDN DGG SD+ SDWWANEK Sbjct: 419 HKQRS--ITLGSAKEFNFDNADGGHSDKPNISSDWWANEK 456 >ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] gi|462415503|gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] Length = 499 Score = 290 bits (743), Expect = 6e-76 Identities = 181/390 (46%), Positives = 219/390 (56%), Gaps = 61/390 (15%) Frame = -2 Query: 992 SEPHSSTQSPTGLLSL---SANTYSPG-PNSIFAIGPYAHETQLXXXXXXXXXXXXXXXX 825 S+P S+TQSP G LSL SAN YSPG P SIF+IGPYA+ETQL Sbjct: 98 SDPPSATQSPAGFLSLKSLSANAYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTA 157 Query: 824 XXXXXEH---LTTPSSPEVPFARLLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPV 654 LTTPSSPEVPFA+LLTSSLDRN + SG QKF SHYEFQ YQ YPGSP Sbjct: 158 PFTPPPESVQLTTPSSPEVPFAQLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPG 217 Query: 653 GHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSG 474 G+LISP S +S SGTSSPFPDR HP LEFR GE PKL+ FD +TRKW GSG Sbjct: 218 GNLISPGSAVSNSGTSSPFPDR------HPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSG 271 Query: 473 SLTP-----------------------------------------------PDPAGPTSR 435 SLTP PD GP SR Sbjct: 272 SLTPDGVGLGSRLGSGSLTPDGNELGSRLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASR 331 Query: 434 DSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSV 255 DSFL+ENQISEVASLANS +G Q E V DHRVSFELT E+ C+ + + ++ ++ Sbjct: 332 DSFLLENQISEVASLANSESGCQTVETVFDHRVSFELTGEDVACCLANKAVASN---RTA 388 Query: 254 TPLDKTTLATVTPERDGLSSEAENTC---VGETSSNVSGKAFGDGDDEVPHHRQQPSLTT 84 + K + ERD LSS++ N C V E+SS + G+G+D+ +R+ S+ T Sbjct: 389 SGSSKVIASEYPSERDALSSDSSNHCEFSVEESSSRIPENVSGEGEDQ--GYRKHRSI-T 445 Query: 83 IGSVKEFKFDNTDGGTSDR----SDWWANE 6 +GS K+F FDNT ++ S+WWAN+ Sbjct: 446 LGSTKDFNFDNTKAEVPNKPNIGSEWWANK 475 >ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis] gi|223547583|gb|EEF49078.1| conserved hypothetical protein [Ricinus communis] Length = 510 Score = 283 bits (723), Expect = 1e-73 Identities = 181/387 (46%), Positives = 213/387 (55%), Gaps = 57/387 (14%) Frame = -2 Query: 995 QSEPHSSTQSPTGLLSL---SANTYSPG-PNSIFAIGPYAHETQLXXXXXXXXXXXXXXX 828 QS+P S+TQSP GLLSL S N YSPG P SIFAIGPYAHETQL Sbjct: 111 QSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLVTPPAFSAFTTEPST 170 Query: 827 XXXXXXEH---LTTPSSPEVPFARLLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSP 657 LTTPSSPEVPFA+LLTSSL+R + SG QKF SHYEFQSY LYPGSP Sbjct: 171 APFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALSHYEFQSYPLYPGSP 230 Query: 656 VGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGS 477 G LISP SVIS SGTSSPFPDR +P LEFR GE PKL F+ +TRKW GS Sbjct: 231 GGQLISPGSVISNSGTSSPFPDR------YPILEFRMGEAPKLLGFEHFTTRKWGSRLGS 284 Query: 476 GSLTP-----------------------------------------------PDPAGPTS 438 G++TP PD GP S Sbjct: 285 GTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGSGSLTPDAVGPAS 344 Query: 437 RDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKS 258 RD F +ENQISEVASLANS NGS+ +E ++DHRVSFEL+ EE C+E + +AS A S Sbjct: 345 RDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLESK-SLASCRAFS 403 Query: 257 VTPLDKTTLATVTPERDGLSSEAENTCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIG 78 P D ++A + + EN GETS K G+ ++E H ++ T+G Sbjct: 404 ECPPD--SMAEDQIKSGKMLMTDENLPTGETSGETPEKPSGEMEEE--HCYRKHRSITLG 459 Query: 77 SVKEFKFDNT---DGGTSDRSDWWANE 6 S+KEF FDN+ S S+WWANE Sbjct: 460 SIKEFNFDNSKEVPDKPSINSEWWANE 486 >ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis] Length = 460 Score = 282 bits (721), Expect = 2e-73 Identities = 177/344 (51%), Positives = 217/344 (63%), Gaps = 13/344 (3%) Frame = -2 Query: 995 QSEPHSSTQSPTGLLSL---SANTYSPG-PNSIFAIGPYAHETQLXXXXXXXXXXXXXXX 828 QSEP S+TQSP GL+SL S N YSPG P+SIFAIGPYAHETQL Sbjct: 106 QSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPST 165 Query: 827 XXXXXXE---HLTTPSSPEVPFARLLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSP 657 HLTTPSSPEVPFA+LL SL + QKF S+YEFQSY L+PGSP Sbjct: 166 APFTPPPESVHLTTPSSPEVPFAQLLDPSL----RFGEQGQKFPFSYYEFQSYHLHPGSP 221 Query: 656 VGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGS 477 VG+LISPSS IS SGTSSPFPD EF++ G F +F G+PPKL + D LS R+W QGS Sbjct: 222 VGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGSRQGS 281 Query: 476 GSLTPPDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCV 297 G+LT PD G T R+ F QISEVA +S NG + ++IV DHRVSFELT E+ CV Sbjct: 282 GTLT-PDAVGSTPRNGFFQNRQISEVALRPHSENGLRKDQIV-DHRVSFELTTEDVVRCV 339 Query: 296 EKEPMMASLEAKSVTPLDKTTLATVTPERDGLSSEAEN---TCVGETSSNVSGKAFGDGD 126 EK+P + EA S + + TT+ E++ S EAEN +C GE +++ K D Sbjct: 340 EKKPTTLA-EAVSESLQNGTTV-----EKEESSGEAENVHHSCAGEAANDEPLKTPVD-V 392 Query: 125 DEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSD---RSDWWANEK 3 +E P H++Q S+ T+GS KEF FD+ DG + + SDWWANEK Sbjct: 393 EEAPRHQKQQSI-TLGSTKEFNFDSADGDSHEPTIASDWWANEK 435 >ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina] gi|557541785|gb|ESR52763.1| hypothetical protein CICLE_v10020073mg [Citrus clementina] Length = 460 Score = 279 bits (713), Expect = 2e-72 Identities = 176/344 (51%), Positives = 216/344 (62%), Gaps = 13/344 (3%) Frame = -2 Query: 995 QSEPHSSTQSPTGLLSL---SANTYSPG-PNSIFAIGPYAHETQLXXXXXXXXXXXXXXX 828 QSEP S+TQSP GL+SL S N YSPG P+SIFAIGPYAHETQL Sbjct: 106 QSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPST 165 Query: 827 XXXXXXE---HLTTPSSPEVPFARLLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSP 657 HLTTPSSPEVPFA+LL SL + QKF S+YEFQSY L+PGSP Sbjct: 166 APFTPPPESVHLTTPSSPEVPFAQLLDPSL----RFGEQGQKFPFSYYEFQSYHLHPGSP 221 Query: 656 VGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGS 477 VG+LISPSS IS SGTSSPFPD EF++ G F +F G+PPKL + D LS R+W QGS Sbjct: 222 VGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGSRQGS 281 Query: 476 GSLTPPDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCV 297 G+LT PD T R+ F QISEVA +S NG + ++IV DHRVSFELT E+ CV Sbjct: 282 GTLT-PDAVRSTPRNGFFQNRQISEVALRPHSENGLRKDQIV-DHRVSFELTTEDVVRCV 339 Query: 296 EKEPMMASLEAKSVTPLDKTTLATVTPERDGLSSEAEN---TCVGETSSNVSGKAFGDGD 126 EK+P + EA S + + TT+ E++ S EAEN +C GE +++ K D Sbjct: 340 EKKPTTLA-EAVSESLQNGTTV-----EKEESSGEAENVHHSCAGEAANDEPLKTPVD-V 392 Query: 125 DEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSD---RSDWWANEK 3 +E P H++Q S+ T+GS KEF FD+ DG + + SDWWANEK Sbjct: 393 EEAPRHQKQQSI-TLGSTKEFNFDSADGDSHEPTIASDWWANEK 435 >ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|222858882|gb|EEE96429.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 507 Score = 279 bits (713), Expect = 2e-72 Identities = 185/392 (47%), Positives = 213/392 (54%), Gaps = 62/392 (15%) Frame = -2 Query: 995 QSEPHSSTQSPTGLLSL---SANTYSP-GPNSIFAIGPYAHETQLXXXXXXXXXXXXXXX 828 QS+P SSTQSP GLLSL SAN YSP GP SIFAIGPYAHETQL Sbjct: 104 QSDPPSSTQSPAGLLSLTSLSANAYSPRGPASIFAIGPYAHETQLVTPPVFSAFTTEPST 163 Query: 827 XXXXXXEH---LTTPSSPEVPFARLLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSP 657 LTTPSSPEVPFA+LLTSSL+R + SGP QKF+ SHYEFQSY LYPGSP Sbjct: 164 APFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGPNQKFSLSHYEFQSYHLYPGSP 223 Query: 656 VGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGS 477 G +ISP S IS SGTSSPFPDR HP LEFR GE PKL F+ STRKW GS Sbjct: 224 GGQIISPGSAISNSGTSSPFPDR------HPMLEFRMGEAPKLLGFEHFSTRKWGSRLGS 277 Query: 476 GSLTP---------------------------------PDPAG----------------P 444 GSLTP PD AG P Sbjct: 278 GSLTPDATPDGMGLSRLGSGTVTPDGMGLSRLCSGTATPDGAGLRSRLGSGTLTPDCFVP 337 Query: 443 TSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEA 264 S+ FL+ENQISEVASL NS NGS+ E V+ HRVSFEL+ EE C+E + +AS Sbjct: 338 ASQIGFLLENQISEVASLTNSENGSKTEENVVHHRVSFELSGEEVARCLEIK-SVASTRT 396 Query: 263 KSVTPLDKTTLATVTPERDGLSSEAENTCV--GETSSNVSGKAFGDGDDEVPHHRQQPSL 90 P D V +R ++ E C+ GE SS + K + E H ++ Sbjct: 397 FPEYPQDTMPEDPVRGDRLAMNGE---RCLQNGEASSEMPEK--NSEETEEDHVYRKHRS 451 Query: 89 TTIGSVKEFKFDNTDGGTSDR----SDWWANE 6 T+GS+KEF FDN+ G SD+ S+WWANE Sbjct: 452 ITLGSIKEFNFDNSKGEVSDKPAISSEWWANE 483 >ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria vesca subsp. vesca] Length = 422 Score = 273 bits (697), Expect = 1e-70 Identities = 176/369 (47%), Positives = 208/369 (56%), Gaps = 7/369 (1%) Frame = -2 Query: 1088 GRGCPAPENAIRPPTIXXXXXXXXXXXXXXLQSEPHSSTQSPTGLLSLSANTYSPGPNSI 909 G P EN + +I LQSEP S+ QSP SLSA+ YSPGP+SI Sbjct: 38 GHNDPRAENLTQASSIVLPFAAPPSSPASFLQSEPPSAMQSPGFNFSLSASMYSPGPSSI 97 Query: 908 FAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXE---HLTTPSSPEVPFARLLTSSLDRN 738 FAIGPYAHETQL HLT PSSPEVPFA+LL D N Sbjct: 98 FAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLL----DSN 153 Query: 737 CKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPFL 558 + Q++ SHYEFQSYQ YPGSPVG LISPSS IS SGTSSPF D EF+SGGH FL Sbjct: 154 FRFGEGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFL 213 Query: 557 EFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASLANSN 378 EFRTGE PK+ + D L TR W SGS+T PD A TS + F ++ E A SN Sbjct: 214 EFRTGEAPKVLNLDILFTRDWGSRLCSGSVT-PDAAKSTSSEGFTLKPYTPEGVLNARSN 272 Query: 377 NGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTPLDKTTLATVTPERDGLS 198 + +N+ I HRVSFEL+AEE CVEK+P +A EA S T L A + Sbjct: 273 SRRRNDGASIGHRVSFELSAEEVVRCVEKKP-VALAEAVS-TSLQSAEKAEREEGPNQEV 330 Query: 197 SEAENTCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSDRS-- 24 S + V +TS++ S KA G +E+ + Q+ T+GS KEF FDN DGG S S Sbjct: 331 SSSHECPVVDTSNDSSEKAVGGDAEELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSI 390 Query: 23 --DWWANEK 3 DWWANEK Sbjct: 391 STDWWANEK 399 >ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria vesca subsp. vesca] Length = 459 Score = 273 bits (697), Expect = 1e-70 Identities = 176/369 (47%), Positives = 208/369 (56%), Gaps = 7/369 (1%) Frame = -2 Query: 1088 GRGCPAPENAIRPPTIXXXXXXXXXXXXXXLQSEPHSSTQSPTGLLSLSANTYSPGPNSI 909 G P EN + +I LQSEP S+ QSP SLSA+ YSPGP+SI Sbjct: 75 GHNDPRAENLTQASSIVLPFAAPPSSPASFLQSEPPSAMQSPGFNFSLSASMYSPGPSSI 134 Query: 908 FAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXE---HLTTPSSPEVPFARLLTSSLDRN 738 FAIGPYAHETQL HLT PSSPEVPFA+LL D N Sbjct: 135 FAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLL----DSN 190 Query: 737 CKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPFL 558 + Q++ SHYEFQSYQ YPGSPVG LISPSS IS SGTSSPF D EF+SGGH FL Sbjct: 191 FRFGEGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFL 250 Query: 557 EFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASLANSN 378 EFRTGE PK+ + D L TR W SGS+T PD A TS + F ++ E A SN Sbjct: 251 EFRTGEAPKVLNLDILFTRDWGSRLCSGSVT-PDAAKSTSSEGFTLKPYTPEGVLNARSN 309 Query: 377 NGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTPLDKTTLATVTPERDGLS 198 + +N+ I HRVSFEL+AEE CVEK+P +A EA S T L A + Sbjct: 310 SRRRNDGASIGHRVSFELSAEEVVRCVEKKP-VALAEAVS-TSLQSAEKAEREEGPNQEV 367 Query: 197 SEAENTCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSDRS-- 24 S + V +TS++ S KA G +E+ + Q+ T+GS KEF FDN DGG S S Sbjct: 368 SSSHECPVVDTSNDSSEKAVGGDAEELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSI 427 Query: 23 --DWWANEK 3 DWWANEK Sbjct: 428 STDWWANEK 436 >ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao] gi|508777528|gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao] Length = 458 Score = 272 bits (696), Expect = 2e-70 Identities = 173/375 (46%), Positives = 212/375 (56%), Gaps = 11/375 (2%) Frame = -2 Query: 1094 FRGRGCPAPENAIRPPTIXXXXXXXXXXXXXXLQSEPHSSTQSPTGLLSL---SANTYSP 924 F G PA EN + P I L SEP S+TQSP GL+SL SA+ YSP Sbjct: 72 FSGANVPAAENPTQAPAIALPFVAPPSSPASFLPSEPPSATQSPAGLVSLTSISASMYSP 131 Query: 923 GPNSIFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXE---HLTTPSSPEVPFARLLTS 753 GP SIFAIGPYAHETQL HLTTPSSPEVPFA+LL Sbjct: 132 GPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLGP 191 Query: 752 SLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSG 573 +L + Q+F SHYEFQSYQL+PGSPVG LISPSS IS SGTSSPF D EF++ Sbjct: 192 NL----QYGEGVQRFPISHYEFQSYQLHPGSPVGQLISPSSGISGSGTSSPFRDGEFAAS 247 Query: 572 GHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVAS 393 H F EFR G+PPKL + D S+ +W H GSG+LT PD T R+ FL+++QISE+ S Sbjct: 248 LH-FPEFRMGDPPKLLNLDKHSSCEWGSHHGSGTLT-PDATRSTPRNGFLLDHQISEITS 305 Query: 392 LAN-SNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTPLDKTTLATVTP 216 + N QN+++ +HRVSFELT EE +E E S ++ T + Sbjct: 306 HPHLKNKEVQNDQVAHNHRVSFELTTEEVVRSLEMETATPSEAVSGSLQIEAT---RESE 362 Query: 215 ERDGLSSEAENTCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGT 36 E D + VGETS+ KA D + + HH+ Q T+GS KEF FDN DGG Sbjct: 363 EHDTKVVDDYECRVGETSNERPEKALADREGKPQHHKHQS--ITLGSAKEFNFDNVDGGD 420 Query: 35 SDR----SDWWANEK 3 + + SDWWAN+K Sbjct: 421 AHKPILTSDWWANDK 435 >gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis] Length = 521 Score = 270 bits (691), Expect = 6e-70 Identities = 189/445 (42%), Positives = 227/445 (51%), Gaps = 80/445 (17%) Frame = -2 Query: 1097 VFRGRGCPAPENAIRPPTIXXXXXXXXXXXXXXLQSEPHSSTQSPTGLLSL---SANTYS 927 V G PAPEN I LQS+P S+TQSP GLLSL S N YS Sbjct: 64 VLPGAAAPAPENQAPSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSINAYS 123 Query: 926 PG-PNSIFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXEH---LTTPSSPEVPFARLL 759 PG P SIFAIGPYA+ETQL LTTPSSPEVPFA+LL Sbjct: 124 PGGPTSIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLL 183 Query: 758 TSSLDRNCK-TSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREF 582 TSSLDR + +SG QKF+ SH EFQ YQLYPGSP G+LISP SV+S SGTSSPFPD+ Sbjct: 184 TSSLDRTRRNSSGANQKFSLSHCEFQPYQLYPGSPGGNLISPGSVVSNSGTSSPFPDK-- 241 Query: 581 SSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP---------------PDPAG 447 HP L FR GE P+L F+ +T KW GSGSLTP PD G Sbjct: 242 ----HPILGFRMGEAPRLLGFEHFTTWKWGSRLGSGSLTPDGVGLGSRLGSGSVTPDGVG 297 Query: 446 PTSR----------------------------------------DSFLV--------ENQ 411 SR D FLV ENQ Sbjct: 298 LGSRLGSGSLTPDGYGLGSRLGSGCMTPNGPGLGSRLGSGTLTPDGFLVVSGDSFLLENQ 357 Query: 410 ISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTPLDKTTL 231 ISEVASLANS+NG QN+ V+DHRVSFELT E+ C+ + AS ++ + + + Sbjct: 358 ISEVASLANSDNGCQNDGSVVDHRVSFELTGEDVARCLASK--SASSNGRTTSESLEDSP 415 Query: 230 ATVTPERDGLS-----SEAENTCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKE 66 A ++DG+S S + +CV ETS+ +G+D+ H Q+ T+GS+KE Sbjct: 416 AECPTKKDGISANNVDSPNDQSCVEETSNKTPQSDCREGEDD--HFYQKHRSITLGSIKE 473 Query: 65 FKFDNTDGGTSDR----SDWWANEK 3 F FDNT S + S+WWANEK Sbjct: 474 FNFDNTKADVSVKPTIGSEWWANEK 498 >ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] gi|550346902|gb|ERP65330.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] Length = 452 Score = 267 bits (682), Expect = 7e-69 Identities = 174/370 (47%), Positives = 212/370 (57%), Gaps = 10/370 (2%) Frame = -2 Query: 1088 GRGCPAPENAIRPPTIXXXXXXXXXXXXXXLQSEPHSSTQSPTGLLSL---SANTYSP-G 921 G G PA EN + P + QSEP S TQSP GL+SL SA+ YSP G Sbjct: 73 GNGAPASENPTQAPAVTLPFAAPPSSPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSG 132 Query: 920 PNSIFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXE---HLTTPSSPEVPFARLLTSS 750 P SIFAIGPYAHETQL HLTTPSSPEVPFA+ L S Sbjct: 133 PASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPS 192 Query: 749 LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 570 L RN T +F ++FQSYQ +PGSPVG LISPSS IS SGTSSPFPD EF+ GG Sbjct: 193 L-RNGDTG---LRFP---FDFQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGG 245 Query: 569 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASL 390 F EFR GEPPKL + D LST +W +QGSG+LTP +FL+ Q S+V S Sbjct: 246 AHFPEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTPESVR--RGSPNFLLHRQFSDVPSR 303 Query: 389 ANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTPLDKTTLATVTPER 210 S NG +N + V++HRVSFELTAE+ CVE++P + K+V + + Sbjct: 304 PRSGNGHKNGQ-VVNHRVSFELTAEDASRCVEEKP---AFSIKTVPEYVENGTQAKEEKN 359 Query: 209 DGLSSEAENTCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSD 30 G S ++ VG TS++ A DG + P HR+Q S+ T+GSVKEF FDN D G S Sbjct: 360 SGESIQSFECRVGVTSNDSPEMASTDG-EAAPQHRKQQSI-TLGSVKEFNFDNADEGDSR 417 Query: 29 R---SDWWAN 9 + S+WWAN Sbjct: 418 KPSSSNWWAN 427 >ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] gi|550346901|gb|EEE82832.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] Length = 453 Score = 267 bits (682), Expect = 7e-69 Identities = 174/370 (47%), Positives = 212/370 (57%), Gaps = 10/370 (2%) Frame = -2 Query: 1088 GRGCPAPENAIRPPTIXXXXXXXXXXXXXXLQSEPHSSTQSPTGLLSL---SANTYSP-G 921 G G PA EN + P + QSEP S TQSP GL+SL SA+ YSP G Sbjct: 74 GNGAPASENPTQAPAVTLPFAAPPSSPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSG 133 Query: 920 PNSIFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXE---HLTTPSSPEVPFARLLTSS 750 P SIFAIGPYAHETQL HLTTPSSPEVPFA+ L S Sbjct: 134 PASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPS 193 Query: 749 LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 570 L RN T +F ++FQSYQ +PGSPVG LISPSS IS SGTSSPFPD EF+ GG Sbjct: 194 L-RNGDTG---LRFP---FDFQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGG 246 Query: 569 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASL 390 F EFR GEPPKL + D LST +W +QGSG+LTP +FL+ Q S+V S Sbjct: 247 AHFPEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTPESVR--RGSPNFLLHRQFSDVPSR 304 Query: 389 ANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTPLDKTTLATVTPER 210 S NG +N + V++HRVSFELTAE+ CVE++P + K+V + + Sbjct: 305 PRSGNGHKNGQ-VVNHRVSFELTAEDASRCVEEKP---AFSIKTVPEYVENGTQAKEEKN 360 Query: 209 DGLSSEAENTCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSD 30 G S ++ VG TS++ A DG + P HR+Q S+ T+GSVKEF FDN D G S Sbjct: 361 SGESIQSFECRVGVTSNDSPEMASTDG-EAAPQHRKQQSI-TLGSVKEFNFDNADEGDSR 418 Query: 29 R---SDWWAN 9 + S+WWAN Sbjct: 419 KPSSSNWWAN 428 >ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum lycopersicum] Length = 470 Score = 262 bits (670), Expect = 2e-67 Identities = 179/400 (44%), Positives = 213/400 (53%), Gaps = 38/400 (9%) Frame = -2 Query: 1088 GRGCPAPENAIRPPTIXXXXXXXXXXXXXXLQSEPHSSTQSPTGLLSLSA---NTYSPGP 918 G P EN TI L S+P S+TQSP GLLSL A N YSPG Sbjct: 67 GPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSINAYSPGG 126 Query: 917 N-SIFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXE---HLTTPSSPEVPFARLLTSS 750 SIFAIGPYAHETQL H+TTP SPEVPFA+LLTSS Sbjct: 127 TASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSS 186 Query: 749 LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 570 L RN + SG KF S YEF YQ PGSP +LISP SV+S SGTSSPFP G Sbjct: 187 LARNRRYSGSNYKFPLSQYEFVPYQ-DPGSPGSNLISPGSVVSNSGTSSPFP------GK 239 Query: 569 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP------------------------ 462 P +EFR GEPPK ++ STRKW GSGS+TP Sbjct: 240 CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSG 299 Query: 461 ---PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEK 291 P+ P SRDS+L+ENQISEVASLANS+NGS+ E VIDHRVSFELT E+ PSC EK Sbjct: 300 TVTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREK 359 Query: 290 EPMMASLEAKSVTPLDKTTLATVTPERDGLSSEAENTCVGETSSNVSGKAFGDGDDEVPH 111 EP+M+ ++ P+D + L + E SS AE G KA G+DE Sbjct: 360 EPVMS--HSQPTLPMDVSNL--LASEMRSGSSMAEEKTYGSPR-----KASESGEDEC-- 408 Query: 110 HRQQPSLTTIGSVKEFKFDNTDGGTSDRS----DWWANEK 3 HR+ ++ T GS K+F FDN ++ +WW ++K Sbjct: 409 HRKHRNI-TFGSSKDFDFDNVKIEVLEKDSIDCEWWTSDK 447 >ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum] Length = 470 Score = 258 bits (660), Expect = 2e-66 Identities = 178/400 (44%), Positives = 212/400 (53%), Gaps = 38/400 (9%) Frame = -2 Query: 1088 GRGCPAPENAIRPPTIXXXXXXXXXXXXXXLQSEPHSSTQSPTGLLSL---SANTYSPGP 918 G P EN TI L S+P S+TQSP GLLSL S N YSPG Sbjct: 67 GPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSLSINAYSPGG 126 Query: 917 N-SIFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXE---HLTTPSSPEVPFARLLTSS 750 SIFAIGPYAHETQL H+TTP SPEVPFA+LLTSS Sbjct: 127 TASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEVPFAQLLTSS 186 Query: 749 LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 570 L RN + SG KF S YEF YQ PGSP +LISP SV+S SGTSSPFP G Sbjct: 187 LARNRRYSGSNYKFPLSQYEFVPYQ-DPGSPGSNLISPGSVVSNSGTSSPFP------GK 239 Query: 569 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP------------------------ 462 P +EFR GEPPK ++ STRKW GSGSLTP Sbjct: 240 CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 299 Query: 461 ---PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEK 291 P+ P SRDS+L+E QISEVASLANS+NGS+ E VIDHRVSFELT E+ PSC EK Sbjct: 300 TVTPNGGEPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREK 359 Query: 290 EPMMASLEAKSVTPLDKTTLATVTPERDGLSSEAENTCVGETSSNVSGKAFGDGDDEVPH 111 EP+M+ ++ P+D + L + E SS AE G KA G+D+ Sbjct: 360 EPVMS--HSQQTLPMDVSNL--LANEMKSGSSMAEEKTYGSPR-----KASESGEDQC-- 408 Query: 110 HRQQPSLTTIGSVKEFKFDNTDGGTSDRS----DWWANEK 3 HR+ ++ T GS K+F FDN ++ +WW ++K Sbjct: 409 HRKHRNI-TFGSSKDFDFDNVKIEVLEKDSIDCEWWTSDK 447 >gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis] Length = 455 Score = 245 bits (626), Expect = 2e-62 Identities = 169/376 (44%), Positives = 199/376 (52%), Gaps = 14/376 (3%) Frame = -2 Query: 1088 GRGCPAPENAIRPPTIXXXXXXXXXXXXXXLQSEPHSSTQSPTGLLSL---SANTYSPG- 921 G P EN+ + + LQSEP S+TQSP GLLSL SA+ YSPG Sbjct: 76 GNSAPRAENSTQTHAVILPFIAPPSSPASFLQSEPPSATQSPAGLLSLTSVSASMYSPGG 135 Query: 920 PNSIFAIGPYAHETQLXXXXXXXXXXXXXXXXXXXXXE---HLTTPSSPEVPFARLLTSS 750 P SIFAIGPYAHETQL HLTTPSSPEVPFA+LL Sbjct: 136 PASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLL--- 192 Query: 749 LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 570 D N P Q+F H EFQSY PGSP+G LISPSS IS SGTSSPFPD EF++ G Sbjct: 193 -DPNIHNGEPGQRFPIFHNEFQSYYFQPGSPIGQLISPSSGISGSGTSSPFPDPEFAARG 251 Query: 569 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASL 390 FLEFRTG+PPKL + D LS W QGSGSLT PD P S EVA Sbjct: 252 PHFLEFRTGDPPKLLNLDKLSKFDWGSRQGSGSLT-PDSVKPIS---------TFEVAPH 301 Query: 389 ANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTPLDKTTLATVTPER 210 N +N E V D RVSF+++ E+ VEK+ + L +T L TT+ Sbjct: 302 LKPNGRCRNAENVADRRVSFDVSTEDVIRYVEKKTV--PLAEAMLTSLKDTTMGQREENS 359 Query: 209 DGLSSE---AENTCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGG 39 D E EN VGETS+ KA G++ + H + + T+GS KEF FDN D G Sbjct: 360 DSNKVEEIGCENR-VGETSNEEPDKAPTSGEEVLQHQKHRS--ITLGSSKEFNFDNADAG 416 Query: 38 ----TSDRSDWWANEK 3 + SDWWAN+K Sbjct: 417 DLHKSDSVSDWWANQK 432