BLASTX nr result
ID: Atropa21_contig00017978
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00017978 (1097 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583... 428 e-117 ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260... 419 e-114 gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus pe... 239 1e-60 gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein i... 234 5e-59 gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein i... 234 5e-59 ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm... 233 1e-58 ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr... 228 3e-57 gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis] 226 2e-56 gb|EOY23261.1| Hydroxyproline-rich glycoprotein family protein [... 223 1e-55 ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-... 214 5e-53 ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citr... 214 5e-53 ref|XP_004157195.1| PREDICTED: uncharacterized protein LOC101225... 206 1e-50 ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210... 206 1e-50 ref|XP_006413289.1| hypothetical protein EUTSA_v10025027mg [Eutr... 199 1e-48 ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264... 188 4e-45 emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera] 188 4e-45 ref|NP_194292.2| hydroxyproline-rich glycoprotein family protein... 187 6e-45 emb|CAA18164.1| putative protein [Arabidopsis thaliana] gi|72694... 187 6e-45 gb|AFK46430.1| unknown [Medicago truncatula] 186 1e-44 ref|XP_003549033.2| PREDICTED: uncharacterized protein LOC100806... 185 3e-44 >ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum] Length = 470 Score = 428 bits (1101), Expect = e-117 Identities = 207/234 (88%), Positives = 217/234 (92%) Frame = +1 Query: 1 PIIEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTPSGWGSRLASGTQTPNGGISRLGSGT 180 PIIEFRKGEPPKFLGYEHFSTRKWGSR+GSGSLTPSGWGSRL SGT TPNGGISRLGSGT Sbjct: 241 PIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGT 300 Query: 181 VTPNGGEPPSRDSYLLENQISEIASLANSDNGSEIEEGVINHRVSFELTGEDVPSCREKE 360 VTPNGGEPPSRDSYLLE QISE+ASLANSDNGSEI EGVI+HRVSFELTGEDVPSCREKE Sbjct: 301 VTPNGGEPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREKE 360 Query: 361 PIMSHSQQSLPMDVPAATSNLLAKEMESSCSIVKEKTDGLPRKASESGEDHCHRKHRNIT 540 P+MSHSQQ+LPMDV SNLLA EM+S S+ +EKT G PRKASESGED CHRKHRNIT Sbjct: 361 PVMSHSQQTLPMDV----SNLLANEMKSGSSMAEEKTYGSPRKASESGEDQCHRKHRNIT 416 Query: 541 FGSSKDFDFDNVKIEVLEKECVDCEWWTSDKATGKESGIQNNWTFFPVLQPGVS 702 FGSSKDFDFDNVKIEVLEK+ +DCEWWTSDKA GKESGIQNNWTFFPVLQPGVS Sbjct: 417 FGSSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAGKESGIQNNWTFFPVLQPGVS 470 >ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum lycopersicum] Length = 470 Score = 419 bits (1076), Expect = e-114 Identities = 203/234 (86%), Positives = 213/234 (91%) Frame = +1 Query: 1 PIIEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTPSGWGSRLASGTQTPNGGISRLGSGT 180 PIIEFRKGEPPKFLGYEHFSTRKWGSR+GSGS+TPSGWGSRL SGT TPNGGISRLGSGT Sbjct: 241 PIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGT 300 Query: 181 VTPNGGEPPSRDSYLLENQISEIASLANSDNGSEIEEGVINHRVSFELTGEDVPSCREKE 360 VTPNGGEPPSRDSYLLENQISE+ASLANSDNGSEI E VI+HRVSFELT EDVPSCREKE Sbjct: 301 VTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKE 360 Query: 361 PIMSHSQQSLPMDVPAATSNLLAKEMESSCSIVKEKTDGLPRKASESGEDHCHRKHRNIT 540 P+MSHSQ +LPMDV SNLLA EM S S+ +EKT G PRKASESGED CHRKHRNIT Sbjct: 361 PVMSHSQPTLPMDV----SNLLASEMRSGSSMAEEKTYGSPRKASESGEDECHRKHRNIT 416 Query: 541 FGSSKDFDFDNVKIEVLEKECVDCEWWTSDKATGKESGIQNNWTFFPVLQPGVS 702 FGSSKDFDFDNVKIEVLEK+ +DCEWWTSDKA KESGIQNNWTFFPVLQPGVS Sbjct: 417 FGSSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAVKESGIQNNWTFFPVLQPGVS 470 >gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] Length = 499 Score = 239 bits (611), Expect = 1e-60 Identities = 131/258 (50%), Positives = 163/258 (63%), Gaps = 25/258 (9%) Frame = +1 Query: 1 PIIEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTPSG------------------WGSRL 126 P++EFR GE PK G++HF+TRKWGSRIGSGSLTP G GSRL Sbjct: 241 PVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELGSRL 300 Query: 127 ASGTQTPNG-GI-SRLGSGTVTPNGGEPPSRDSYLLENQISEIASLANSDNGSEIEEGVI 300 SG TPNG GI SRLGSG +TP+G P SRDS+LLENQISE+ASLANS++G + E V Sbjct: 301 GSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTVETVF 360 Query: 301 NHRVSFELTGEDVPSCREKEPIMSHSQQSLPMDVPAAT----SNLLAKEMESSCSI-VKE 465 +HRVSFELTGEDV C + + S+ S V A+ + L+ + + C V+E Sbjct: 361 DHRVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCEFSVEE 420 Query: 466 KTDGLPRKASESGEDHCHRKHRNITFGSSKDFDFDNVKIEVLEKECVDCEWWTSDKATGK 645 + +P S GED +RKHR+IT GS+KDF+FDN K EV K + EWW + K Sbjct: 421 SSSRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDNTKAEVPNKPNIGSEWWANKNVAAK 480 Query: 646 ESGIQNNWTFFPVLQPGV 699 ES N+WTFFP+LQPGV Sbjct: 481 ESKPCNDWTFFPILQPGV 498 >gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 489 Score = 234 bits (597), Expect = 5e-59 Identities = 124/244 (50%), Positives = 174/244 (71%), Gaps = 10/244 (4%) Frame = +1 Query: 1 PIIEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTPSGWG--SRLASGTQTPNG-GI-SRL 168 PI+EFR GE PK LG+E+F+TRKWGSR+GSGSLTP G G SRL SG+ TP+G G+ SRL Sbjct: 246 PILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRL 305 Query: 169 GSGTVTPNGGEPPSRDSYLLENQISEIASLANSDNGSEIEEGVINHRVSFELTGEDVPSC 348 GSG++TP+G P SRD +L+ +QISE+A LAN NG + +E +++HRVSFEL+GEDV C Sbjct: 306 GSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPC 365 Query: 349 REKEPIM-SHSQQSLPMDVPAA---TSNLLAKEMESSCSI-VKEKTDGLPRKAS-ESGED 510 E + ++ S + P D+ A + + K++ESSC + ++E ++ KAS E+ E+ Sbjct: 366 LESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEEE 425 Query: 511 HCHRKHRNITFGSSKDFDFDNVKIEVLEKECVDCEWWTSDKATGKESGIQNNWTFFPVLQ 690 H ++KHR++T GS K+F+FDN K E +K + EWW ++K GKE+ N+WTFFP+LQ Sbjct: 426 HSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQ 485 Query: 691 PGVS 702 P VS Sbjct: 486 PEVS 489 >gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 485 Score = 234 bits (597), Expect = 5e-59 Identities = 124/244 (50%), Positives = 174/244 (71%), Gaps = 10/244 (4%) Frame = +1 Query: 1 PIIEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTPSGWG--SRLASGTQTPNG-GI-SRL 168 PI+EFR GE PK LG+E+F+TRKWGSR+GSGSLTP G G SRL SG+ TP+G G+ SRL Sbjct: 242 PILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRL 301 Query: 169 GSGTVTPNGGEPPSRDSYLLENQISEIASLANSDNGSEIEEGVINHRVSFELTGEDVPSC 348 GSG++TP+G P SRD +L+ +QISE+A LAN NG + +E +++HRVSFEL+GEDV C Sbjct: 302 GSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPC 361 Query: 349 REKEPIM-SHSQQSLPMDVPAA---TSNLLAKEMESSCSI-VKEKTDGLPRKAS-ESGED 510 E + ++ S + P D+ A + + K++ESSC + ++E ++ KAS E+ E+ Sbjct: 362 LESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEEE 421 Query: 511 HCHRKHRNITFGSSKDFDFDNVKIEVLEKECVDCEWWTSDKATGKESGIQNNWTFFPVLQ 690 H ++KHR++T GS K+F+FDN K E +K + EWW ++K GKE+ N+WTFFP+LQ Sbjct: 422 HSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQ 481 Query: 691 PGVS 702 P VS Sbjct: 482 PEVS 485 >ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis] gi|223547583|gb|EEF49078.1| conserved hypothetical protein [Ricinus communis] Length = 510 Score = 233 bits (593), Expect = 1e-58 Identities = 130/257 (50%), Positives = 168/257 (65%), Gaps = 23/257 (8%) Frame = +1 Query: 1 PIIEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTPSG--WGSRLASGTQTPNG--GISRL 168 PI+EFR GE PK LG+EHF+TRKWGSR+GSG++TP G GSRL SGT TP+G SRL Sbjct: 255 PILEFRMGEAPKLLGFEHFTTRKWGSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRL 314 Query: 169 GSGTVTPNGGE----------------PPSRDSYLLENQISEIASLANSDNGSEIEEGVI 300 GSGTVTP+G P SRD + LENQISE+ASLANS+NGS+ +E ++ Sbjct: 315 GSGTVTPDGVGLRSMLGSGSLTPDAVGPASRDGFFLENQISEVASLANSENGSKTDENIV 374 Query: 301 NHRVSFELTGEDVPSCREKEPIMS-HSQQSLPMDVPAATSNLLAKEMESSCSIVKEKTDG 477 +HRVSFEL+GE+V C E + + S + P D A K + + ++ +T G Sbjct: 375 DHRVSFELSGEEVARCLESKSLASCRAFSECPPDSMAEDQIKSGKMLMTDENLPTGETSG 434 Query: 478 -LPRKAS-ESGEDHCHRKHRNITFGSSKDFDFDNVKIEVLEKECVDCEWWTSDKATGKES 651 P K S E E+HC+RKHR+IT GS K+F+FDN K EV +K ++ EWW ++ GKE+ Sbjct: 435 ETPEKPSGEMEEEHCYRKHRSITLGSIKEFNFDNSK-EVPDKPSINSEWWANETIAGKEA 493 Query: 652 GIQNNWTFFPVLQPGVS 702 NNWTFFP+LQP VS Sbjct: 494 RPANNWTFFPLLQPEVS 510 >ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|222858882|gb|EEE96429.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 507 Score = 228 bits (582), Expect = 3e-57 Identities = 132/263 (50%), Positives = 168/263 (63%), Gaps = 29/263 (11%) Frame = +1 Query: 1 PIIEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTPS----GWG-SRLASGTQTPNG-GIS 162 P++EFR GE PK LG+EHFSTRKWGSR+GSGSLTP G G SRL SGT TP+G G+S Sbjct: 248 PMLEFRMGEAPKLLGFEHFSTRKWGSRLGSGSLTPDATPDGMGLSRLGSGTVTPDGMGLS 307 Query: 163 RLGSGTVTPNGGE----------------PPSRDSYLLENQISEIASLANSDNGSEIEEG 294 RL SGT TP+G P S+ +LLENQISE+ASL NS+NGS+ EE Sbjct: 308 RLCSGTATPDGAGLRSRLGSGTLTPDCFVPASQIGFLLENQISEVASLTNSENGSKTEEN 367 Query: 295 VINHRVSFELTGEDVPSCREKEPIMS------HSQQSLPMDVPAATSNLLAKEMESSCSI 456 V++HRVSFEL+GE+V C E + + S + Q ++P D + LA E C Sbjct: 368 VVHHRVSFELSGEEVARCLEIKSVASTRTFPEYPQDTMPED--PVRGDRLAMNGER-CLQ 424 Query: 457 VKEKTDGLPRKASE-SGEDHCHRKHRNITFGSSKDFDFDNVKIEVLEKECVDCEWWTSDK 633 E + +P K SE + EDH +RKHR+IT GS K+F+FDN K EV +K + EWW ++ Sbjct: 425 NGEASSEMPEKNSEETEEDHVYRKHRSITLGSIKEFNFDNSKGEVSDKPAISSEWWANET 484 Query: 634 ATGKESGIQNNWTFFPVLQPGVS 702 GKE+ N+WTFFP+LQP VS Sbjct: 485 IAGKEARPANSWTFFPLLQPEVS 507 >gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis] Length = 521 Score = 226 bits (575), Expect = 2e-56 Identities = 128/279 (45%), Positives = 166/279 (59%), Gaps = 45/279 (16%) Frame = +1 Query: 1 PIIEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTPS------------------------ 108 PI+ FR GE P+ LG+EHF+T KWGSR+GSGSLTP Sbjct: 243 PILGFRMGEAPRLLGFEHFTTWKWGSRLGSGSLTPDGVGLGSRLGSGSVTPDGVGLGSRL 302 Query: 109 ----------GWGSRLASGTQTPNG-GI-SRLGSGTVTPNGGEPPSRDSYLLENQISEIA 252 G GSRL SG TPNG G+ SRLGSGT+TP+G S DS+LLENQISE+A Sbjct: 303 GSGSLTPDGYGLGSRLGSGCMTPNGPGLGSRLGSGTLTPDGFLVVSGDSFLLENQISEVA 362 Query: 253 SLANSDNGSEIEEGVINHRVSFELTGEDVPSCREKEPIMSH------SQQSLPMDVPAAT 414 SLANSDNG + + V++HRVSFELTGEDV C + S+ S + P + P Sbjct: 363 SLANSDNGCQNDGSVVDHRVSFELTGEDVARCLASKSASSNGRTTSESLEDSPAECPTKK 422 Query: 415 SNLLAKEMES--SCSIVKEKTDGLPRKASESGE-DHCHRKHRNITFGSSKDFDFDNVKIE 585 + A ++S S V+E ++ P+ GE DH ++KHR+IT GS K+F+FDN K + Sbjct: 423 DGISANNVDSPNDQSCVEETSNKTPQSDCREGEDDHFYQKHRSITLGSIKEFNFDNTKAD 482 Query: 586 VLEKECVDCEWWTSDKATGKESGIQNNWTFFPVLQPGVS 702 V K + EWW ++K GKE+ N+W+FFP+LQPGVS Sbjct: 483 VSVKPTIGSEWWANEKVAGKEAKAGNSWSFFPILQPGVS 521 >gb|EOY23261.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao] Length = 540 Score = 223 bits (567), Expect = 1e-55 Identities = 118/243 (48%), Positives = 169/243 (69%), Gaps = 10/243 (4%) Frame = +1 Query: 1 PIIEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTPSGWG--SRLASGTQTPNG-GI-SRL 168 PI+EF GE PK LG+E+ +TRKW SR+GSGSLTP G G SRL SG+ TP+G G+ SRL Sbjct: 297 PILEFHMGEAPKLLGFENLTTRKWCSRLGSGSLTPDGLGRGSRLGSGSVTPDGMGLGSRL 356 Query: 169 GSGTVTPNGGEPPSRDSYLLENQISEIASLANSDNGSEIEEGVINHRVSFELTGEDVPSC 348 GSG++TP+G PPSRD +LL +QISE+A L N NG + +E +++HRVSFEL+GEDV C Sbjct: 357 GSGSLTPDGLGPPSRDGFLLGSQISEVALLTNQANGPKNDETIVDHRVSFELSGEDVARC 416 Query: 349 REKEPIM-SHSQQSLPMDVPA---ATSNLLAKEMESSCSI-VKEKTDGLPRKAS-ESGED 510 E + ++ S + P D+ A + + K++ESSC + ++E ++ KAS ++ E+ Sbjct: 417 LESKSLLPSRTVSEYPKDLVAEGRIERDGIKKDLESSCELFIRETSNETVEKASGKAEEE 476 Query: 511 HCHRKHRNITFGSSKDFDFDNVKIEVLEKECVDCEWWTSDKATGKESGIQNNWTFFPVLQ 690 H ++KHR++T GS K+F+FDN K E +K + EWW ++K KE+ N+WTFFP+ + Sbjct: 477 HSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKFARKEARPGNSWTFFPMFR 536 Query: 691 PGV 699 PGV Sbjct: 537 PGV 539 >ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-like [Citrus sinensis] Length = 500 Score = 214 bits (545), Expect = 5e-53 Identities = 122/260 (46%), Positives = 160/260 (61%), Gaps = 26/260 (10%) Frame = +1 Query: 1 PIIEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTPSG------------------WGSRL 126 PI++F PK LG+EHF+TRKWGSR+GSGS+TP G GSRL Sbjct: 242 PILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSRL 301 Query: 127 ASGTQTPNG-GI-SRLGSGTVTPNGGEPPSRDSYLLENQISEIASLANSDNGSEIEEGVI 300 SGT TP+G G+ SRLGSG++TP+G P SRD ++ ENQISE+ASLANSDNG++ +E +I Sbjct: 302 GSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHII 361 Query: 301 NHRVSFELTGEDVPSC-REKEPIMSHSQQSLPMD-VPAATSNLLAKEMESSCSI---VKE 465 +HRVSFEL+GE+V C K P D VP K +S +E Sbjct: 362 DHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCPEE 421 Query: 466 KTDGLPRKASESG-EDHCHRKHRNITFGSSKDFDFDNVKIEVLEKECVDCEWWTSDKATG 642 ++ +P K G E++C+RKHR+IT GS K+F+FDN + EV K ++ EWW ++ G Sbjct: 422 SSNRMPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEVSNKPSINSEWWANEN-VG 480 Query: 643 KESGIQNNWTFFPVLQPGVS 702 KES NNWTFFP+LQ S Sbjct: 481 KESKPSNNWTFFPMLQSEAS 500 >ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citrus clementina] gi|557523850|gb|ESR35217.1| hypothetical protein CICLE_v10004813mg [Citrus clementina] Length = 500 Score = 214 bits (545), Expect = 5e-53 Identities = 122/260 (46%), Positives = 160/260 (61%), Gaps = 26/260 (10%) Frame = +1 Query: 1 PIIEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTPSG------------------WGSRL 126 PI++F PK LG+EHF+TRKWGSR+GSGS+TP G GSRL Sbjct: 242 PILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSRL 301 Query: 127 ASGTQTPNG-GI-SRLGSGTVTPNGGEPPSRDSYLLENQISEIASLANSDNGSEIEEGVI 300 SGT TP+G G+ SRLGSG++TP+G P SRD ++ ENQISE+ASLANSDNG++ +E +I Sbjct: 302 GSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHII 361 Query: 301 NHRVSFELTGEDVPSC-REKEPIMSHSQQSLPMD-VPAATSNLLAKEMESSCSI---VKE 465 +HRVSFEL+GE+V C K P D VP K +S +E Sbjct: 362 DHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCPEE 421 Query: 466 KTDGLPRKASESG-EDHCHRKHRNITFGSSKDFDFDNVKIEVLEKECVDCEWWTSDKATG 642 ++ +P K G E++C+RKHR+IT GS K+F+FDN + EV K ++ EWW ++ G Sbjct: 422 SSNRMPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEVSNKPSINSEWWANEN-VG 480 Query: 643 KESGIQNNWTFFPVLQPGVS 702 KES NNWTFFP+LQ S Sbjct: 481 KESKPSNNWTFFPMLQSEAS 500 >ref|XP_004157195.1| PREDICTED: uncharacterized protein LOC101225370 [Cucumis sativus] Length = 497 Score = 206 bits (524), Expect = 1e-50 Identities = 124/260 (47%), Positives = 154/260 (59%), Gaps = 26/260 (10%) Frame = +1 Query: 1 PIIEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTPSGWG--SRLASGTQTPNG-GI-SRL 168 PI+EFR + PK LG EHF+TRKW SR+GSGSLTP G G SRL SGT TP+G G+ SRL Sbjct: 244 PILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMGSRL 303 Query: 169 GSGTVTPNGGEPPSR----------------DSYLLENQISEIASLANSDNGSEIEEGVI 300 GSG+VTPNG SR DS LL+NQISE+ASLANS+ G + + V Sbjct: 304 GSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQND--VT 361 Query: 301 NHRVSFELTGEDVPSCREKEPIMSHSQQSLPMDVPAATSNLLAKEMESS------CSIVK 462 NHRVSFELTGEDV C + + S +S + P TS E + S C Sbjct: 362 NHRVSFELTGEDVARCLANKSLTSIRTES---ESPKQTSTSNQNENKESSREAETCEFFD 418 Query: 463 EKTDGLPRKASESGEDHCHRKHRNITFGSSKDFDFDNVKIEVLEKECVDCEWWTSDKATG 642 KT P K + +D C++ R +T GS K+F+FD K E+ + EWW ++K Sbjct: 419 IKTSAAPEK-TPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGV 477 Query: 643 KESGIQNNWTFFPVLQPGVS 702 KE+ NNWTFFP+LQPGVS Sbjct: 478 KEASPGNNWTFFPLLQPGVS 497 >ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210841 [Cucumis sativus] Length = 497 Score = 206 bits (524), Expect = 1e-50 Identities = 124/260 (47%), Positives = 154/260 (59%), Gaps = 26/260 (10%) Frame = +1 Query: 1 PIIEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTPSGWG--SRLASGTQTPNG-GI-SRL 168 PI+EFR + PK LG EHF+TRKW SR+GSGSLTP G G SRL SGT TP+G G+ SRL Sbjct: 244 PILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMGSRL 303 Query: 169 GSGTVTPNGGEPPSR----------------DSYLLENQISEIASLANSDNGSEIEEGVI 300 GSG+VTPNG SR DS LL+NQISE+ASLANS+ G + + V Sbjct: 304 GSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQND--VT 361 Query: 301 NHRVSFELTGEDVPSCREKEPIMSHSQQSLPMDVPAATSNLLAKEMESS------CSIVK 462 NHRVSFELTGEDV C + + S +S + P TS E + S C Sbjct: 362 NHRVSFELTGEDVARCLANKSLTSIRTES---ESPKQTSTSNQNENKESSREAETCEFFD 418 Query: 463 EKTDGLPRKASESGEDHCHRKHRNITFGSSKDFDFDNVKIEVLEKECVDCEWWTSDKATG 642 KT P K + +D C++ R +T GS K+F+FD K E+ + EWW ++K Sbjct: 419 IKTSAAPEK-TPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGV 477 Query: 643 KESGIQNNWTFFPVLQPGVS 702 KE+ NNWTFFP+LQPGVS Sbjct: 478 KEASPGNNWTFFPLLQPGVS 497 >ref|XP_006413289.1| hypothetical protein EUTSA_v10025027mg [Eutrema salsugineum] gi|557114459|gb|ESQ54742.1| hypothetical protein EUTSA_v10025027mg [Eutrema salsugineum] Length = 489 Score = 199 bits (507), Expect = 1e-48 Identities = 120/249 (48%), Positives = 152/249 (61%), Gaps = 16/249 (6%) Frame = +1 Query: 4 IIEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTPSGWGSRLASGTQTPNGG--ISRLGSG 177 IIEFR GEPPKFLG+EHF+ RKWGSR GSGS+TP+G GSRL SG TP+GG S+L SG Sbjct: 246 IIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAGQGSRLGSGALTPDGGGLGSKLASG 305 Query: 178 TVTPNGGEPPSR---------DSYLLENQISEIASLANSDNGSEIEE---GVINHRVSFE 321 VTPNG E SR +S LL+ QISE+ASLANSD+GS + V++HRVSFE Sbjct: 306 AVTPNGAEMVSRKGSGNVTPLESSLLDCQISEVASLANSDHGSSRHDEAVAVVSHRVSFE 365 Query: 322 LTGEDVPSCREKEPIMSHSQQSLPMDVPAATSNLLAKEMESSCSIVKEKTDGLP-RKASE 498 LTGEDV C S ++ D +N + + S + +P K S Sbjct: 366 LTGEDVARC-----FASKLNRAGLDDCLHEKANGDHTDTNEAVSPTNRWSGSVPGSKTSG 420 Query: 499 SGEDHCHRKHRNITFGSSKDFDFDNVKIEVLEKECVDCEWWTSDKATGK-ESGIQNNWTF 675 E K R+I+ GSSK+F FDN K E++EK V EWW ++K GK ++ N+W+F Sbjct: 421 ETESEQSLKLRSISLGSSKEFKFDNTKEEMIEKTAVRSEWWANEKVAGKGDNSPGNSWSF 480 Query: 676 FPVLQPGVS 702 FPVL+ G S Sbjct: 481 FPVLRSGFS 489 >ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera] Length = 448 Score = 188 bits (477), Expect = 4e-45 Identities = 108/235 (45%), Positives = 139/235 (59%), Gaps = 9/235 (3%) Frame = +1 Query: 25 EPPKFLGYEHFSTRKWGSRIGSGSLTPSGWGSRLASGTQTPNGGISRLGSGTVTPNGGEP 204 E PK LG+EHFSTR+WGSR LGSG++TP+G P Sbjct: 242 EAPKLLGFEHFSTRRWGSR----------------------------LGSGSLTPDGAGP 273 Query: 205 PSRDSYLLENQISEIASLANSDNGSEIEEGVINHRVSFELTGEDVPSCREKEPIMS-HSQ 381 SRDS+LLENQISE+ASLANS++GS+ E VI+HRVSFEL GEDV C EK+P+ S + Sbjct: 274 ASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSFELAGEDVAVCVEKKPVASAETV 333 Query: 382 QSLPMDVP-----AATSNLLAKEMESSCSI-VKEKTDGLPRKASESG-EDHCHRKHRNIT 540 Q+ D+ + +++ E+ C V E KAS G E+ CH+KH I Sbjct: 334 QNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKAASEKASAEGEEEQCHKKHPPIR 393 Query: 541 FGSSKDFDFDNVKIEVLEK-ECVDCEWWTSDKATGKESGIQNNWTFFPVLQPGVS 702 GS K+F+FDN K EV K + EWW ++K GK +G Q NWTFFP+LQPG+S Sbjct: 394 HGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVGKGTGPQTNWTFFPLLQPGIS 448 >emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera] Length = 385 Score = 188 bits (477), Expect = 4e-45 Identities = 108/235 (45%), Positives = 139/235 (59%), Gaps = 9/235 (3%) Frame = +1 Query: 25 EPPKFLGYEHFSTRKWGSRIGSGSLTPSGWGSRLASGTQTPNGGISRLGSGTVTPNGGEP 204 E PK LG+EHFSTR+WGSR LGSG++TP+G P Sbjct: 179 EAPKLLGFEHFSTRRWGSR----------------------------LGSGSLTPDGAGP 210 Query: 205 PSRDSYLLENQISEIASLANSDNGSEIEEGVINHRVSFELTGEDVPSCREKEPIMS-HSQ 381 SRDS+LLENQISE+ASLANS++GS+ E VI+HRVSFEL GEDV C EK+P+ S + Sbjct: 211 ASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSFELAGEDVAVCVEKKPVASAETV 270 Query: 382 QSLPMDVP-----AATSNLLAKEMESSCSI-VKEKTDGLPRKASESG-EDHCHRKHRNIT 540 Q+ D+ + +++ E+ C V E KAS G E+ CH+KH I Sbjct: 271 QNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKAASEKASAEGEEEQCHKKHPPIR 330 Query: 541 FGSSKDFDFDNVKIEVLEK-ECVDCEWWTSDKATGKESGIQNNWTFFPVLQPGVS 702 GS K+F+FDN K EV K + EWW ++K GK +G Q NWTFFP+LQPG+S Sbjct: 331 HGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVGKGTGPQTNWTFFPLLQPGIS 385 >ref|NP_194292.2| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|26449762|dbj|BAC42004.1| unknown protein [Arabidopsis thaliana] gi|28951011|gb|AAO63429.1| At4g25620 [Arabidopsis thaliana] gi|332659684|gb|AEE85084.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 449 Score = 187 bits (475), Expect = 6e-45 Identities = 117/247 (47%), Positives = 146/247 (59%), Gaps = 16/247 (6%) Frame = +1 Query: 4 IIEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTPSGWGSRLASGTQTPNGGISRLGSGTV 183 IIEFR GEPPKFLG+EHF+ RKWGSR GSGS+TP+G GSRL SG TP+G S+L SG V Sbjct: 230 IIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAGQGSRLGSGALTPDG--SKLTSGVV 287 Query: 184 TPNGGEPPSRDSY---------LLENQISEIASLANSDNGS---EIEEGVINHRVSFELT 327 TPNG E R SY LL++QISE+ASLANSD+GS E V+ HRVSFELT Sbjct: 288 TPNGAETVIRMSYGNLTPLEGSLLDSQISEVASLANSDHGSSRHNDEALVVPHRVSFELT 347 Query: 328 GEDVPSCREKEPIMSHSQQSLPMDVPAATSNLLAKEMESSCSIVKEKTDGL-PRKASESG 504 GEDV C LA ++ S S K + L P SG Sbjct: 348 GEDVARC-------------------------LASKLNRSGSHEKASGEHLRPNCCKTSG 382 Query: 505 EDHCH--RKHRNITFGSSKDFDFDNVKIEVLEKECVDCEWWTSDKATGK-ESGIQNNWTF 675 E +K R+ + GS+K+F FD+ E++EK + EWW ++K GK + +N+WTF Sbjct: 383 ETESEQSQKLRSFSTGSNKEFKFDSTNEEMIEK--IRSEWWANEKVAGKGDHSPRNSWTF 440 Query: 676 FPVLQPG 696 FPVL+ G Sbjct: 441 FPVLRSG 447 >emb|CAA18164.1| putative protein [Arabidopsis thaliana] gi|7269412|emb|CAB81372.1| putative protein [Arabidopsis thaliana] Length = 424 Score = 187 bits (475), Expect = 6e-45 Identities = 117/247 (47%), Positives = 146/247 (59%), Gaps = 16/247 (6%) Frame = +1 Query: 4 IIEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTPSGWGSRLASGTQTPNGGISRLGSGTV 183 IIEFR GEPPKFLG+EHF+ RKWGSR GSGS+TP+G GSRL SG TP+G S+L SG V Sbjct: 205 IIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAGQGSRLGSGALTPDG--SKLTSGVV 262 Query: 184 TPNGGEPPSRDSY---------LLENQISEIASLANSDNGS---EIEEGVINHRVSFELT 327 TPNG E R SY LL++QISE+ASLANSD+GS E V+ HRVSFELT Sbjct: 263 TPNGAETVIRMSYGNLTPLEGSLLDSQISEVASLANSDHGSSRHNDEALVVPHRVSFELT 322 Query: 328 GEDVPSCREKEPIMSHSQQSLPMDVPAATSNLLAKEMESSCSIVKEKTDGL-PRKASESG 504 GEDV C LA ++ S S K + L P SG Sbjct: 323 GEDVARC-------------------------LASKLNRSGSHEKASGEHLRPNCCKTSG 357 Query: 505 EDHCH--RKHRNITFGSSKDFDFDNVKIEVLEKECVDCEWWTSDKATGK-ESGIQNNWTF 675 E +K R+ + GS+K+F FD+ E++EK + EWW ++K GK + +N+WTF Sbjct: 358 ETESEQSQKLRSFSTGSNKEFKFDSTNEEMIEK--IRSEWWANEKVAGKGDHSPRNSWTF 415 Query: 676 FPVLQPG 696 FPVL+ G Sbjct: 416 FPVLRSG 422 >gb|AFK46430.1| unknown [Medicago truncatula] Length = 487 Score = 186 bits (472), Expect = 1e-44 Identities = 114/257 (44%), Positives = 147/257 (57%), Gaps = 25/257 (9%) Frame = +1 Query: 7 IEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTP--SGWGSRLASGTQTPNG--------- 153 +E RKGE PK LG+EHFSTRKW SRIGSGSLTP +G GSRL SG+ TP+G Sbjct: 243 LELRKGEAPKILGFEHFSTRKWMSRIGSGSLTPDGTGQGSRLGSGSLTPDGVSHTSRLGS 302 Query: 154 ------GI---SRLGSGTVTPNGGEPPSRDSYLLENQISEIASLANSDNGSEIEEGVINH 306 G+ SRLGSG++TP+G P +R S ++NQI S+ANSD+GS+ +++H Sbjct: 303 GCATPDGLGQDSRLGSGSLTPDGVGPTTRGSIDVQNQIPVGVSVANSDHGSQTNATLVDH 362 Query: 307 RVSFELTGEDVPSCREKEP-----IMSHSQQSLPMDVPAATSNLLAKEMESSCSIVKEKT 471 RVSFELTGEDV C + MS S Q + P +L KE S C + K Sbjct: 363 RVSFELTGEDVARCLANKTGALLRNMSSSSQGILAKDPIDREKIL-KETNSCCDVCSGKA 421 Query: 472 DGLPRKASESGEDHCHRKHRNITFGSSKDFDFDNVKIEVLEKECVDCEWWTSDKATGKES 651 G +HC K +++ SSK+F+FDN K +V WWT+ K GKES Sbjct: 422 ---------IGGEHCCPKRNSVS--SSKEFNFDNRKGDVSGTSANGSSWWTNKKVDGKES 470 Query: 652 GIQNNWTFFPVLQPGVS 702 N+W FFP+LQP +S Sbjct: 471 KSVNSWAFFPMLQPDIS 487 >ref|XP_003549033.2| PREDICTED: uncharacterized protein LOC100806399 [Glycine max] Length = 515 Score = 185 bits (469), Expect = 3e-44 Identities = 112/255 (43%), Positives = 147/255 (57%), Gaps = 25/255 (9%) Frame = +1 Query: 1 PIIEFRKGEPPKFLGYEHFSTRKWGSRIGSGSLTP-SGW-GSRLASGTQTPNG------- 153 P +EF KGE PK LG EHFSTR+WGSR+GSGSLTP S W GSRL SG+ TP+G Sbjct: 261 PTLEFPKGETPKILGVEHFSTRRWGSRLGSGSLTPDSAWQGSRLGSGSLTPDGVGLASRL 320 Query: 154 --------GI---SRLGSGTVTPNGGEPPSRDSYLLENQISEIASLANSDNGSEIEEGVI 300 G+ SRLGSG +TP+ P ++++ ++NQIS+ A+LA+SDNG ++ Sbjct: 321 GSGCVTPDGLGQESRLGSGCLTPDSAGPTNQNNISVQNQISKEATLADSDNGHPSNATLV 380 Query: 301 NHRVSFELTGEDVPSCREKEP-----IMSHSQQSLPMDVPAATSNLLAKEMESSCSIVKE 465 +HRVSFELTGEDV C + MS S Q + P + + SSC+ E Sbjct: 381 DHRVSFELTGEDVARCLANKTGVLLRNMSGSSQGILTKDPVDRERVQI-DTNSSCNACTE 439 Query: 466 KTDGLPRKASESGEDHCHRKHRNITFGSSKDFDFDNVKIEVLEKECVDCEWWTSDKATGK 645 KTD P GE H+++ + SSK+F+FDN K +V EWWT+ K GK Sbjct: 440 KTDDKPDNPVGKGEQCLHKQN---SVNSSKEFNFDNRKGDVSVTTGSGYEWWTNRKVAGK 496 Query: 646 ESGIQNNWTFFPVLQ 690 E N+W FFP+LQ Sbjct: 497 EGRSANSWAFFPMLQ 511