BLASTX nr result
ID: Akebia27_contig00014989
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00014989 (792 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot... 246 9e-63 ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot... 246 9e-63 ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264... 243 6e-62 emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera] 243 6e-62 ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun... 241 3e-61 ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prun... 231 2e-58 ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241... 225 2e-56 ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626... 217 4e-54 ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr... 214 4e-53 ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309... 211 2e-52 ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309... 211 2e-52 ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family prot... 196 6e-48 ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260... 196 1e-47 ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu... 195 2e-47 ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu... 195 2e-47 ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583... 190 4e-46 ref|XP_007038760.1| Hydroxyproline-rich glycoprotein family prot... 181 3e-43 gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis] 179 8e-43 ref|XP_007143454.1| hypothetical protein PHAVU_007G073100g [Phas... 174 3e-41 ref|XP_003516706.1| PREDICTED: uncharacterized protein LOC100777... 172 1e-40 >ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 489 Score = 246 bits (627), Expect = 9e-63 Identities = 145/301 (48%), Positives = 179/301 (59%), Gaps = 38/301 (12%) Frame = +2 Query: 2 LLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDRE 181 LLTSSL+R + SG QKF SHYEFQSYQ+YPGSP G+LISP S IS SGTSSPFPDR Sbjct: 186 LLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDR- 244 Query: 182 FSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP------------------- 304 P LEFR GE PKL F+ +TRKW GSGSLTP Sbjct: 245 -----RPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGM 299 Query: 305 ------------PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTA 448 PD GP SRD FLV +QISEVA LAN NG +N+E ++DHRVSFEL+ Sbjct: 300 GLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSG 359 Query: 449 EETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAENTC---VGETSSNVSG 619 E+ C+E + ++ S ++V+ K +A ERDG+ + E++C + ETS+ Sbjct: 360 EDVAPCLESKSLLPS---RAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVE 416 Query: 620 KAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSD----RSDWWANEKVVITKEAG 787 KA G+ ++E H Q+ T+GS+KEF FDNT G SD RS+WWANEKV KEA Sbjct: 417 KASGEAEEE--HSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVA-GKEAR 473 Query: 788 P 790 P Sbjct: 474 P 474 >ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 485 Score = 246 bits (627), Expect = 9e-63 Identities = 145/301 (48%), Positives = 179/301 (59%), Gaps = 38/301 (12%) Frame = +2 Query: 2 LLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDRE 181 LLTSSL+R + SG QKF SHYEFQSYQ+YPGSP G+LISP S IS SGTSSPFPDR Sbjct: 182 LLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDR- 240 Query: 182 FSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP------------------- 304 P LEFR GE PKL F+ +TRKW GSGSLTP Sbjct: 241 -----RPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGM 295 Query: 305 ------------PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTA 448 PD GP SRD FLV +QISEVA LAN NG +N+E ++DHRVSFEL+ Sbjct: 296 GLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSG 355 Query: 449 EETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAENTC---VGETSSNVSG 619 E+ C+E + ++ S ++V+ K +A ERDG+ + E++C + ETS+ Sbjct: 356 EDVAPCLESKSLLPS---RAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVE 412 Query: 620 KAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSD----RSDWWANEKVVITKEAG 787 KA G+ ++E H Q+ T+GS+KEF FDNT G SD RS+WWANEKV KEA Sbjct: 413 KASGEAEEE--HSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVA-GKEAR 469 Query: 788 P 790 P Sbjct: 470 P 470 >ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera] Length = 448 Score = 243 bits (620), Expect = 6e-62 Identities = 149/271 (54%), Positives = 170/271 (62%), Gaps = 8/271 (2%) Frame = +2 Query: 2 LLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDRE 181 LLTSSLDR+ + SG QK + S+YEFQ YQLYP SPVGHLISP IS SGTSSPFPDR Sbjct: 182 LLTSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPDRR 238 Query: 182 FSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQIS 361 P +E PKL F+ STR+W GSGSLTP D AGP SRDSFL+ENQIS Sbjct: 239 ------PIVE-----APKLLGFEHFSTRRWGSRLGSGSLTP-DGAGPASRDSFLLENQIS 286 Query: 362 EVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLAT 541 EVASLANS +GSQN E VIDHRVSFEL E+ CVEK+P +AS E T D Sbjct: 287 EVASLANSESGSQNGETVIDHRVSFELAGEDVAVCVEKKP-VASAETVQNTLQDIVEEGE 345 Query: 542 VTPERDGLSSEAENT---CVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFD 712 + ERDG+S EN CVGE S KA +G++E H + P GS+KEF FD Sbjct: 346 IERERDGISESTENCCEFCVGEALKAASEKASAEGEEEQCHKKHPP--IRHGSIKEFNFD 403 Query: 713 NTDGGTSDR-----SDWWANEKVVITKEAGP 790 NT G S + S+WW NEKVV K GP Sbjct: 404 NTKGEVSAKPNIIGSEWWVNEKVV-GKGTGP 433 >emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera] Length = 385 Score = 243 bits (620), Expect = 6e-62 Identities = 149/271 (54%), Positives = 170/271 (62%), Gaps = 8/271 (2%) Frame = +2 Query: 2 LLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDRE 181 LLTSSLDR+ + SG QK + S+YEFQ YQLYP SPVGHLISP IS SGTSSPFPDR Sbjct: 119 LLTSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPDRR 175 Query: 182 FSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQIS 361 P +E PKL F+ STR+W GSGSLTP D AGP SRDSFL+ENQIS Sbjct: 176 ------PIVE-----APKLLGFEHFSTRRWGSRLGSGSLTP-DGAGPASRDSFLLENQIS 223 Query: 362 EVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLAT 541 EVASLANS +GSQN E VIDHRVSFEL E+ CVEK+P +AS E T D Sbjct: 224 EVASLANSESGSQNGETVIDHRVSFELAGEDVAVCVEKKP-VASAETVQNTLQDIVEEGE 282 Query: 542 VTPERDGLSSEAENT---CVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFD 712 + ERDG+S EN CVGE S KA +G++E H + P GS+KEF FD Sbjct: 283 IERERDGISESTENCCEFCVGEALKAASEKASAEGEEEQCHKKHPP--IRHGSIKEFNFD 340 Query: 713 NTDGGTSDR-----SDWWANEKVVITKEAGP 790 NT G S + S+WW NEKVV K GP Sbjct: 341 NTKGEVSAKPNIIGSEWWVNEKVV-GKGTGP 370 >ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica] gi|462404864|gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica] Length = 455 Score = 241 bits (614), Expect = 3e-61 Identities = 145/262 (55%), Positives = 174/262 (66%), Gaps = 4/262 (1%) Frame = +2 Query: 17 LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 196 LD + + Q+F SHYEFQSYQLYPGSPVG LISPSS IS SGTSSPFPD EF++ G Sbjct: 187 LDPHFRNGEGGQRFPLSHYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDLEFAARG 246 Query: 197 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASL 376 H FLEFRTG+PPKL + D LSTR W GSGS+T PD A TS D FL++ Q EV Sbjct: 247 HHFLEFRTGDPPKLLNLDILSTRDWGSRLGSGSVT-PDGAKSTSSDGFLLKPQTPEVVLN 305 Query: 377 ANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPER 556 SNN +NN+I I+HRVSFEL++EE CVEK+P +A EA S TSL+ T A + Sbjct: 306 PRSNNRGRNNDISINHRVSFELSSEEVIRCVEKKP-VALAEAVS-TSLEDTEKA--QSKE 361 Query: 557 DGLSSEAENTC-VGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTS 733 D + + C VGETS++ + KA DG++ H +Q+ T+GSVKEF FDN DGG S Sbjct: 362 DPSKVVSSSICPVGETSNDAAEKAVADGEEAQLHPKQRS--ITLGSVKEFNFDNPDGGDS 419 Query: 734 DR---SDWWANEKVVITKEAGP 790 SDWWANEK V KE GP Sbjct: 420 GNSIGSDWWANEK-VDAKENGP 440 >ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] gi|462415503|gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] Length = 499 Score = 231 bits (590), Expect = 2e-58 Identities = 143/317 (45%), Positives = 176/317 (55%), Gaps = 54/317 (17%) Frame = +2 Query: 2 LLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDRE 181 LLTSSLDRN + SG QKF SHYEFQ YQ YPGSP G+LISP S +S SGTSSPFPDR Sbjct: 181 LLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFPDR- 239 Query: 182 FSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP------------------- 304 HP LEFR GE PKL+ FD +TRKW GSGSLTP Sbjct: 240 -----HPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGN 294 Query: 305 ----------------------------PDPAGPTSRDSFLVENQISEVASLANSNNGSQ 400 PD GP SRDSFL+ENQISEVASLANS +G Q Sbjct: 295 ELGSRLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQ 354 Query: 401 NNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAE 580 E V DHRVSFELT E+ C+ + + ++ ++ + K + ERD LSS++ Sbjct: 355 TVETVFDHRVSFELTGEDVACCLANKAVASN---RTASGSSKVIASEYPSERDALSSDSS 411 Query: 581 NTC---VGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSDR---- 739 N C V E+SS + G+G+D+ +R+ S+ T+GS K+F FDNT ++ Sbjct: 412 NHCEFSVEESSSRIPENVSGEGEDQ--GYRKHRSI-TLGSTKDFNFDNTKAEVPNKPNIG 468 Query: 740 SDWWANEKVVITKEAGP 790 S+WWAN K V KE+ P Sbjct: 469 SEWWAN-KNVAAKESKP 484 >ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera] Length = 479 Score = 225 bits (573), Expect = 2e-56 Identities = 141/287 (49%), Positives = 173/287 (60%), Gaps = 31/287 (10%) Frame = +2 Query: 20 DRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREF-SSGG 196 D N + +F S YEFQSYQLYPGSPVGHLISPSS IS SGTSSPFPDR+F SG Sbjct: 190 DPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSPFPDRDFVCSGS 249 Query: 197 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSR--------------- 331 FLEFR G PPKL + D LS +W GSGS+T PD GP SR Sbjct: 250 SQFLEFRAGGPPKLLTLDKLSNHEWGSRIGSGSIT-PDALGPPSRDGSVLDRQVSDVIHP 308 Query: 332 ---DSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKE------PM 484 D +++ QIS+VAS + S++G NNEI++DHRVSFELTAE+ CVEK+ + Sbjct: 309 PSGDDSVLDRQISDVASHSLSDSGCPNNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAV 368 Query: 485 MASLEAKSVTSLDKTTLATVTPERDGLSSEAENTCVGETSSNVSGKAFGD--GDDEVPHH 658 ASL+ + +D+ + V + SE VGET++N KA D G++ PHH Sbjct: 369 SASLQNPATVEIDENSREVV------VDSEGR---VGETANNPPEKAPEDANGEEGQPHH 419 Query: 659 RQQPSLTTIGSVKEFKFDNTDGGTSDR----SDWWANEKVVITKEAG 787 +Q+ T+GS KEF FDN DGG SD+ SDWWANEKVV KE G Sbjct: 420 KQRS--ITLGSAKEFNFDNADGGHSDKPNISSDWWANEKVV-GKEVG 463 >ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis] Length = 460 Score = 217 bits (552), Expect = 4e-54 Identities = 130/257 (50%), Positives = 167/257 (64%), Gaps = 6/257 (2%) Frame = +2 Query: 17 LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 196 LD + + QKF S+YEFQSY L+PGSPVG+LISPSS IS SGTSSPFPD EF++ G Sbjct: 191 LDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSPFPDGEFATAG 250 Query: 197 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASL 376 F +F G+PPKL + D LS R+W QGSG+LT PD G T R+ F QISEVA Sbjct: 251 PQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLT-PDAVGSTPRNGFFQNRQISEVALR 309 Query: 377 ANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPER 556 +S NG + ++IV DHRVSFELT E+ CVEK+P + EA S + + TT+ E+ Sbjct: 310 PHSENGLRKDQIV-DHRVSFELTTEDVVRCVEKKPTTLA-EAVSESLQNGTTV-----EK 362 Query: 557 DGLSSEAEN---TCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGG 727 + S EAEN +C GE +++ K D +E P H++Q S+ T+GS KEF FD+ DG Sbjct: 363 EESSGEAENVHHSCAGEAANDEPLKTPVD-VEEAPRHQKQQSI-TLGSTKEFNFDSADGD 420 Query: 728 TSD---RSDWWANEKVV 769 + + SDWWANEKVV Sbjct: 421 SHEPTIASDWWANEKVV 437 >ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina] gi|557541785|gb|ESR52763.1| hypothetical protein CICLE_v10020073mg [Citrus clementina] Length = 460 Score = 214 bits (544), Expect = 4e-53 Identities = 129/257 (50%), Positives = 166/257 (64%), Gaps = 6/257 (2%) Frame = +2 Query: 17 LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 196 LD + + QKF S+YEFQSY L+PGSPVG+LISPSS IS SGTSSPFPD EF++ G Sbjct: 191 LDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSPFPDGEFATAG 250 Query: 197 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASL 376 F +F G+PPKL + D LS R+W QGSG+LT PD T R+ F QISEVA Sbjct: 251 PQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLT-PDAVRSTPRNGFFQNRQISEVALR 309 Query: 377 ANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPER 556 +S NG + ++IV DHRVSFELT E+ CVEK+P + EA S + + TT+ E+ Sbjct: 310 PHSENGLRKDQIV-DHRVSFELTTEDVVRCVEKKPTTLA-EAVSESLQNGTTV-----EK 362 Query: 557 DGLSSEAEN---TCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGG 727 + S EAEN +C GE +++ K D +E P H++Q S+ T+GS KEF FD+ DG Sbjct: 363 EESSGEAENVHHSCAGEAANDEPLKTPVD-VEEAPRHQKQQSI-TLGSTKEFNFDSADGD 420 Query: 728 TSD---RSDWWANEKVV 769 + + SDWWANEKVV Sbjct: 421 SHEPTIASDWWANEKVV 437 >ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria vesca subsp. vesca] Length = 422 Score = 211 bits (538), Expect = 2e-52 Identities = 132/261 (50%), Positives = 159/261 (60%), Gaps = 4/261 (1%) Frame = +2 Query: 17 LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 196 LD N + Q++ SHYEFQSYQ YPGSPVG LISPSS IS SGTSSPF D EF+SGG Sbjct: 150 LDSNFRFGEGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSSPFLDSEFASGG 209 Query: 197 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASL 376 H FLEFRTGE PK+ + D L TR W SGS+T PD A TS + F ++ E Sbjct: 210 HHFLEFRTGEAPKVLNLDILFTRDWGSRLCSGSVT-PDAAKSTSSEGFTLKPYTPEGVLN 268 Query: 377 ANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPER 556 A SN+ +N+ I HRVSFEL+AEE CVEK+P +A EA S TSL A Sbjct: 269 ARSNSRRRNDGASIGHRVSFELSAEEVVRCVEKKP-VALAEAVS-TSLQSAEKAEREEGP 326 Query: 557 DGLSSEAENTCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSD 736 + S + V +TS++ S KA G +E+ + Q+ T+GS KEF FDN DGG S Sbjct: 327 NQEVSSSHECPVVDTSNDSSEKAVGGDAEELSYRYQKERSITLGSAKEFNFDNADGGDSG 386 Query: 737 RS----DWWANEKVVITKEAG 787 S DWWANEKVV+ KE G Sbjct: 387 TSSISTDWWANEKVVL-KENG 406 >ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria vesca subsp. vesca] Length = 459 Score = 211 bits (538), Expect = 2e-52 Identities = 132/261 (50%), Positives = 159/261 (60%), Gaps = 4/261 (1%) Frame = +2 Query: 17 LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 196 LD N + Q++ SHYEFQSYQ YPGSPVG LISPSS IS SGTSSPF D EF+SGG Sbjct: 187 LDSNFRFGEGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSSPFLDSEFASGG 246 Query: 197 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASL 376 H FLEFRTGE PK+ + D L TR W SGS+T PD A TS + F ++ E Sbjct: 247 HHFLEFRTGEAPKVLNLDILFTRDWGSRLCSGSVT-PDAAKSTSSEGFTLKPYTPEGVLN 305 Query: 377 ANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPER 556 A SN+ +N+ I HRVSFEL+AEE CVEK+P +A EA S TSL A Sbjct: 306 ARSNSRRRNDGASIGHRVSFELSAEEVVRCVEKKP-VALAEAVS-TSLQSAEKAEREEGP 363 Query: 557 DGLSSEAENTCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSD 736 + S + V +TS++ S KA G +E+ + Q+ T+GS KEF FDN DGG S Sbjct: 364 NQEVSSSHECPVVDTSNDSSEKAVGGDAEELSYRYQKERSITLGSAKEFNFDNADGGDSG 423 Query: 737 RS----DWWANEKVVITKEAG 787 S DWWANEKVV+ KE G Sbjct: 424 TSSISTDWWANEKVVL-KENG 443 >ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao] gi|508777528|gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao] Length = 458 Score = 196 bits (499), Expect = 6e-48 Identities = 116/244 (47%), Positives = 148/244 (60%), Gaps = 5/244 (2%) Frame = +2 Query: 50 QKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRTGEP 229 Q+F SHYEFQSYQL+PGSPVG LISPSS IS SGTSSPF D EF++ H F EFR G+P Sbjct: 200 QRFPISHYEFQSYQLHPGSPVGQLISPSSGISGSGTSSPFRDGEFAASLH-FPEFRMGDP 258 Query: 230 PKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASLAN-SNNGSQNN 406 PKL + D S+ +W H GSG+LT PD T R+ FL+++QISE+ S + N QN+ Sbjct: 259 PKLLNLDKHSSCEWGSHHGSGTLT-PDATRSTPRNGFLLDHQISEITSHPHLKNKEVQND 317 Query: 407 EIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAENT 586 ++ +HRVSFELT EE +E E S ++ T + E D + Sbjct: 318 QVAHNHRVSFELTTEEVVRSLEMETATPSEAVSGSLQIEAT---RESEEHDTKVVDDYEC 374 Query: 587 CVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSDR----SDWWA 754 VGETS+ KA D + + HH+ Q T+GS KEF FDN DGG + + SDWWA Sbjct: 375 RVGETSNERPEKALADREGKPQHHKHQS--ITLGSAKEFNFDNVDGGDAHKPILTSDWWA 432 Query: 755 NEKV 766 N+KV Sbjct: 433 NDKV 436 >ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum lycopersicum] Length = 470 Score = 196 bits (497), Expect = 1e-47 Identities = 131/293 (44%), Positives = 163/293 (55%), Gaps = 31/293 (10%) Frame = +2 Query: 2 LLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDRE 181 LLTSSL RN + SG KF S YEF YQ PGSP +LISP SV+S SGTSSPFP Sbjct: 182 LLTSSLARNRRYSGSNYKFPLSQYEFVPYQ-DPGSPGSNLISPGSVVSNSGTSSPFP--- 237 Query: 182 FSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP------------------- 304 G P +EFR GEPPK ++ STRKW GSGS+TP Sbjct: 238 ---GKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGIS 294 Query: 305 --------PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETP 460 P+ P SRDS+L+ENQISEVASLANS+NGS+ E VIDHRVSFELT E+ P Sbjct: 295 RLGSGTVTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVP 354 Query: 461 SCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAENTCVGETSSNVSGKAFGDGD 640 SC EKEP+M+ ++ +D + L + E SS AE G KA G+ Sbjct: 355 SCREKEPVMS--HSQPTLPMDVSNL--LASEMRSGSSMAEEKTYGSPR-----KASESGE 405 Query: 641 DEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSDRS----DWWANEKVVITKEAG 787 DE HR+ ++ T GS K+F FDN ++ +WW ++K + KE+G Sbjct: 406 DEC--HRKHRNI-TFGSSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAV-KESG 454 >ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] gi|550346902|gb|ERP65330.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] Length = 452 Score = 195 bits (495), Expect = 2e-47 Identities = 114/236 (48%), Positives = 147/236 (62%), Gaps = 3/236 (1%) Frame = +2 Query: 71 YEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRTGEPPKLWSFD 250 ++FQSYQ +PGSPVG LISPSS IS SGTSSPFPD EF+ GG F EFR GEPPKL + D Sbjct: 204 FDFQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLD 263 Query: 251 GLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRV 430 LST +W +QGSG+LTP +FL+ Q S+V S S NG +N + V++HRV Sbjct: 264 KLSTCEWGSYQGSGALTPESVR--RGSPNFLLHRQFSDVPSRPRSGNGHKNGQ-VVNHRV 320 Query: 431 SFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAENTCVGETSSN 610 SFELTAE+ CVE++P + K+V + + G S ++ VG TS++ Sbjct: 321 SFELTAEDASRCVEEKP---AFSIKTVPEYVENGTQAKEEKNSGESIQSFECRVGVTSND 377 Query: 611 VSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSDR---SDWWANEKVV 769 A DG + P HR+Q S+ T+GSVKEF FDN D G S + S+WWAN V+ Sbjct: 378 SPEMASTDG-EAAPQHRKQQSI-TLGSVKEFNFDNADEGDSRKPSSSNWWANGSVI 431 >ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] gi|550346901|gb|EEE82832.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] Length = 453 Score = 195 bits (495), Expect = 2e-47 Identities = 114/236 (48%), Positives = 147/236 (62%), Gaps = 3/236 (1%) Frame = +2 Query: 71 YEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRTGEPPKLWSFD 250 ++FQSYQ +PGSPVG LISPSS IS SGTSSPFPD EF+ GG F EFR GEPPKL + D Sbjct: 205 FDFQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLD 264 Query: 251 GLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRV 430 LST +W +QGSG+LTP +FL+ Q S+V S S NG +N + V++HRV Sbjct: 265 KLSTCEWGSYQGSGALTPESVR--RGSPNFLLHRQFSDVPSRPRSGNGHKNGQ-VVNHRV 321 Query: 431 SFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAENTCVGETSSN 610 SFELTAE+ CVE++P + K+V + + G S ++ VG TS++ Sbjct: 322 SFELTAEDASRCVEEKP---AFSIKTVPEYVENGTQAKEEKNSGESIQSFECRVGVTSND 378 Query: 611 VSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSDR---SDWWANEKVV 769 A DG + P HR+Q S+ T+GSVKEF FDN D G S + S+WWAN V+ Sbjct: 379 SPEMASTDG-EAAPQHRKQQSI-TLGSVKEFNFDNADEGDSRKPSSSNWWANGSVI 432 >ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum] Length = 470 Score = 190 bits (483), Expect = 4e-46 Identities = 130/293 (44%), Positives = 161/293 (54%), Gaps = 31/293 (10%) Frame = +2 Query: 2 LLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDRE 181 LLTSSL RN + SG KF S YEF YQ PGSP +LISP SV+S SGTSSPFP Sbjct: 182 LLTSSLARNRRYSGSNYKFPLSQYEFVPYQ-DPGSPGSNLISPGSVVSNSGTSSPFP--- 237 Query: 182 FSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP------------------- 304 G P +EFR GEPPK ++ STRKW GSGSLTP Sbjct: 238 ---GKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGIS 294 Query: 305 --------PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETP 460 P+ P SRDS+L+E QISEVASLANS+NGS+ E VIDHRVSFELT E+ P Sbjct: 295 RLGSGTVTPNGGEPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVP 354 Query: 461 SCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAENTCVGETSSNVSGKAFGDGD 640 SC EKEP+M+ ++ +D + L + E SS AE G KA G+ Sbjct: 355 SCREKEPVMS--HSQQTLPMDVSNL--LANEMKSGSSMAEEKTYGSPR-----KASESGE 405 Query: 641 DEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSDRS----DWWANEKVVITKEAG 787 D+ HR+ ++ T GS K+F FDN ++ +WW ++K KE+G Sbjct: 406 DQC--HRKHRNI-TFGSSKDFDFDNVKIEVLEKDSIDCEWWTSDKAA-GKESG 454 >ref|XP_007038760.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao] gi|508776005|gb|EOY23261.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao] Length = 540 Score = 181 bits (459), Expect = 3e-43 Identities = 114/275 (41%), Positives = 146/275 (53%), Gaps = 44/275 (16%) Frame = +2 Query: 98 PGSPVGHLISPS------SVISESGTSSPFPDREFSSGGHPFLEFRTGEPPKLWSFDGLS 259 P P L++ S IS SGTSSPFPDR P LEF GE PKL F+ L+ Sbjct: 263 PEVPFAQLLASSLESARRKAISNSGTSSPFPDRR------PILEFHMGEAPKLLGFENLT 316 Query: 260 TRKWVPHQGSGSLTP-------------------------------PDPAGPTSRDSFLV 346 TRKW GSGSLTP PD GP SRD FL+ Sbjct: 317 TRKWCSRLGSGSLTPDGLGRGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPPSRDGFLL 376 Query: 347 ENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDK 526 +QISEVA L N NG +N+E ++DHRVSFEL+ E+ C+E + ++ S ++V+ K Sbjct: 377 GSQISEVALLTNQANGPKNDETIVDHRVSFELSGEDVARCLESKSLLPS---RTVSEYPK 433 Query: 527 TTLATVTPERDGLSSEAENTC---VGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVK 697 +A ERDG+ + E++C + ETS+ KA G ++E H Q+ T+GS+K Sbjct: 434 DLVAEGRIERDGIKKDLESSCELFIRETSNETVEKASGKAEEE--HSYQKHRSVTLGSIK 491 Query: 698 EFKFDNTDGGTSD----RSDWWANEKVVITKEAGP 790 EF FDNT G SD RS+WWANEK KEA P Sbjct: 492 EFNFDNTKGEASDKPTIRSEWWANEKFA-RKEARP 525 >gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis] Length = 455 Score = 179 bits (455), Expect = 8e-43 Identities = 120/265 (45%), Positives = 144/265 (54%), Gaps = 7/265 (2%) Frame = +2 Query: 17 LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 196 LD N P Q+F H EFQSY PGSP+G LISPSS IS SGTSSPFPD EF++ G Sbjct: 192 LDPNIHNGEPGQRFPIFHNEFQSYYFQPGSPIGQLISPSSGISGSGTSSPFPDPEFAARG 251 Query: 197 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASL 376 FLEFRTG+PPKL + D LS W QGSGSLT PD P S EVA Sbjct: 252 PHFLEFRTGDPPKLLNLDKLSKFDWGSRQGSGSLT-PDSVKPIS---------TFEVAPH 301 Query: 377 ANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPER 556 N +N E V D RVSF+++ E+ VEK+ + L +TSL TT+ Sbjct: 302 LKPNGRCRNAENVADRRVSFDVSTEDVIRYVEKKTV--PLAEAMLTSLKDTTMGQREENS 359 Query: 557 DGLSSE---AENTCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGG 727 D E EN VGETS+ KA G++ + H + + T+GS KEF FDN D G Sbjct: 360 DSNKVEEIGCENR-VGETSNEEPDKAPTSGEEVLQHQKHRS--ITLGSSKEFNFDNADAG 416 Query: 728 ----TSDRSDWWANEKVVITKEAGP 790 + SDWWAN+KV KE P Sbjct: 417 DLHKSDSVSDWWANQKVA-GKEGAP 440 >ref|XP_007143454.1| hypothetical protein PHAVU_007G073100g [Phaseolus vulgaris] gi|561016644|gb|ESW15448.1| hypothetical protein PHAVU_007G073100g [Phaseolus vulgaris] Length = 479 Score = 174 bits (442), Expect = 3e-41 Identities = 112/293 (38%), Positives = 152/293 (51%), Gaps = 37/293 (12%) Frame = +2 Query: 2 LLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDRE 181 LLTSSLDR+CK G Q+F S+YEFQ YQ YPGSP LISP+S+IS SG+S+PFPD Sbjct: 180 LLTSSLDRDCKDKGTNQRFALSNYEFQLYQQYPGSPGPQLISPASIISTSGSSTPFPDT- 238 Query: 182 FSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP------------------- 304 HP LEF GE L F+ ST KW GSGSLTP Sbjct: 239 -----HPLLEFHKGEASNLLGFEHFSTHKWNSRLGSGSLTPDSTGQGSGLGSGSLTPNAV 293 Query: 305 ----------PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEE 454 P+ PT+R+ V Q SE+ LANS N Q N ++DHRVSFELT E+ Sbjct: 294 KLVSSSGCLTPEGVAPTARNGIYVGKQTSELTPLANSENECQPNAALVDHRVSFELTGED 353 Query: 455 TPSCVEKE---PMMASLEAKSVTSLDKTTLATVTPERDGLSSEAE-NTCVGETSSNVSGK 622 C+ + P++ ++ S +L V ER +S+++ + C +TS++ Sbjct: 354 VARCLANKSGSPLIGNISGSSQGAL---VGEPVDRERIHKNSDSDCDLCSRKTSNDKPEN 410 Query: 623 AFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSDR----SDWWANEKVV 769 + G+G+++ S S K+F FD+ G SD S+WW N+K+V Sbjct: 411 SPGEGEEQCCLKHNSSS-----SSKDFNFDSRKGVVSDNPANASEWWTNKKIV 458 >ref|XP_003516706.1| PREDICTED: uncharacterized protein LOC100777876 [Glycine max] Length = 431 Score = 172 bits (437), Expect = 1e-40 Identities = 109/262 (41%), Positives = 146/262 (55%), Gaps = 5/262 (1%) Frame = +2 Query: 17 LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 196 LD N K S Q+F S Y+F SYQL+PGSPVG LISP S S SGTSSPFPD +F+S G Sbjct: 177 LDPNTKNSETYQRFQISQYDFHSYQLHPGSPVGQLISPRSAFSPSGTSSPFPDTDFNSRG 236 Query: 197 HPFLEFRTGEPPKLWSFDGLSTRK-WVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVAS 373 L+F+ G+P KL +FD ST + HQGSGSLT PD T++ FL + +S++ Sbjct: 237 SLLLDFQIGDPTKLLNFDKPSTNENHKSHQGSGSLT-PDSIRSTTQAGFLPSHWVSDII- 294 Query: 374 LANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPE 553 ++ + NEI ++HRVS E++A+E CVE + + S L T P Sbjct: 295 MSPRPRKNHPNEISVNHRVSIEVSAQEVLKCVENKAVALS------------KLKTDAPG 342 Query: 554 RDGLSSEAENTCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTS 733 D + E V ET ++ + +GD E HH+ + + KEF FDN +GG S Sbjct: 343 EDKKDNSIE-VLVSETPNDAPQQTADNGDVERAHHKDE--CIIFSAAKEFNFDNAEGGDS 399 Query: 734 DR----SDWWANEKVVITKEAG 787 +DWWANEKV +KE G Sbjct: 400 PAPNIVADWWANEKVA-SKEGG 420