BLASTX nr result
ID: Mentha25_contig00013446
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00013446 (1135 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252... 322 1e-85 ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prun... 322 2e-85 emb|CBI26785.3| unnamed protein product [Vitis vinifera] 317 8e-84 emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera] 310 7e-82 gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis] 310 9e-82 ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family prot... 306 1e-80 ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309... 303 9e-80 ref|XP_002527549.1| conserved hypothetical protein [Ricinus comm... 303 1e-79 ref|XP_007045471.1| Hydroxyproline-rich glycoprotein family prot... 301 3e-79 ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family prot... 301 3e-79 ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261... 299 1e-78 ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600... 299 2e-78 ref|XP_006469304.1| PREDICTED: uncharacterized protein LOC102618... 296 8e-78 ref|XP_006448091.1| hypothetical protein CICLE_v10014588mg [Citr... 296 1e-77 ref|XP_006448090.1| hypothetical protein CICLE_v10014588mg [Citr... 296 1e-77 ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809... 293 7e-77 ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809... 290 6e-76 ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Popu... 290 1e-75 gb|ABK95394.1| unknown [Populus trichocarpa] 290 1e-75 ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210... 286 1e-74 >ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252594 [Vitis vinifera] Length = 698 Score = 322 bits (826), Expect = 1e-85 Identities = 184/397 (46%), Positives = 231/397 (58%), Gaps = 24/397 (6%) Frame = +2 Query: 14 KSEENGSATRQRSTQGDVTQADVDAEDTGSSSVDGSGLA-------EKSNLEVSPKSFVA 172 KS EN +R ++ T+A+ D +D GS ++ A EK N SPK+FV Sbjct: 221 KSSENSEGSRCGISE---TEAN-DMDDGGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVG 276 Query: 173 TEFYDGKSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQGQTFVALKRPMKG 352 TE +DGK+VNV +GLK+YE+LFDDS++ K +LV+DLRAAGKRGQLQGQTFV KRPMKG Sbjct: 277 TEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKG 336 Query: 353 HGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAII 532 HGREMIQLG+PIADAP EDE G S+D + E IP LQDVI L+ V+++KPD+ II Sbjct: 337 HGREMIQLGVPIADAPLEDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACII 396 Query: 533 DIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYXXXXXXXXXXXXVI 712 D +NEGDHSQPHIWP WFGRPVC++ LT C+M+FG+VI AD PG+Y ++ Sbjct: 397 DFYNEGDHSQPHIWPTWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLL 456 Query: 713 SMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF---XXXXXXXXXXXSRTP 883 MQG+SADFA+HAIPSL+KQRILVT KSQ +K + + R SR+P Sbjct: 457 VMQGKSADFAKHAIPSLRKQRILVTFTKSQPKKTMASDGQRLLPPAAQSSHWVPPPSRSP 516 Query: 884 GQIR-PAPAKHFXXXXXXXXXXXXXXRQ-------------XVATPVAPGIAYPAAVXXX 1021 +R P KH+ V T VAP + +PA V Sbjct: 517 NHMRHPMGPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPVPLP 576 Query: 1022 XXXXXXXXXXXXXXXXXXXXXGTGVFLPSQGQGPGNS 1132 GTGVFLP G G +S Sbjct: 577 TGSPGWPAAPPRHPPPRLPVPGTGVFLPPPGSGNSSS 613 >ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica] gi|462422058|gb|EMJ26321.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica] Length = 650 Score = 322 bits (824), Expect = 2e-85 Identities = 171/346 (49%), Positives = 213/346 (61%), Gaps = 16/346 (4%) Frame = +2 Query: 131 EKSNLEVSPKSFVATEFYDGKSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQL 310 +K NL + PK+F+ E DGK+VNV +GLK+YED D+++ KL +LV+DLRAAGKR QL Sbjct: 217 QKQNLSIVPKTFIGNEISDGKTVNVVDGLKLYEDFLGDTEVSKLVSLVNDLRAAGKRRQL 276 Query: 311 QGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLL 490 QGQT+V KRPMKGHGREMIQLGIPIADAPPEDE++AG S+D KIEPIP LQDVI+RL+ Sbjct: 277 QGQTYVVSKRPMKGHGREMIQLGIPIADAPPEDEISAGTSKDRKIEPIPSLLQDVIDRLV 336 Query: 491 TKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNY 670 +V+++KPDS IID++NEGDHSQPH WP WFGRPVC + LT C+M+FG+++ D PG+Y Sbjct: 337 GMHVMTVKPDSCIIDVYNEGDHSQPHTWPSWFGRPVCALYLTECDMTFGRLLLMDHPGDY 396 Query: 671 XXXXXXXXXXXXVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF---- 838 ++ MQG+SADFA+HAIPS++KQRILVTL KSQ +K + RF Sbjct: 397 RGSLRLSLTPGSILLMQGKSADFAKHAIPSIRKQRILVTLTKSQPKKSTTSDGQRFPAPA 456 Query: 839 XXXXXXXXXXXSRTPGQIR-PAPAKHFXXXXXXXXXXXXXXRQ-----------XVATPV 982 SR+P IR P KH+ R V PV Sbjct: 457 PAQSSYWGPPPSRSPNHIRHPTGPKHYAAVPTTGVLPAPPIRSQLPPQNGIQPLFVPAPV 516 Query: 983 APGIAYPAAVXXXXXXXXXXXXXXXXXXXXXXXXGTGVFLPSQGQG 1120 P I + AAV GTGVFLP G G Sbjct: 517 GPAIPFAAAV-PIPPGSAGWPAAPRHPPPRIPLPGTGVFLPPPGSG 561 >emb|CBI26785.3| unnamed protein product [Vitis vinifera] Length = 672 Score = 317 bits (811), Expect = 8e-84 Identities = 184/404 (45%), Positives = 231/404 (57%), Gaps = 31/404 (7%) Frame = +2 Query: 14 KSEENGSATRQRSTQGDVTQADVDAEDTGSSSVDGS-------------GLAEKSNLEVS 154 KS EN +R ++ T+A+ D +D G+ + GS EK N S Sbjct: 221 KSSENSEGSRCGISE---TEAN-DMDDGGTLNPKGSCNMIMENNAHPVQNQNEKPNPTTS 276 Query: 155 PKSFVATEFYDGKSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQ-GQTFVA 331 PK+FV TE +DGK+VNV +GLK+YE+LFDDS++ K +LV+DLRAAGKRGQLQ GQTFV Sbjct: 277 PKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQAGQTFVV 336 Query: 332 LKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSI 511 KRPMKGHGREMIQLG+PIADAP EDE G S+D + E IP LQDVI L+ V+++ Sbjct: 337 SKRPMKGHGREMIQLGVPIADAPLEDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTV 396 Query: 512 KPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYXXXXXXX 691 KPD+ IID +NEGDHSQPHIWP WFGRPVC++ LT C+M+FG+VI AD PG+Y Sbjct: 397 KPDACIIDFYNEGDHSQPHIWPTWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLS 456 Query: 692 XXXXXVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF---XXXXXXXX 862 ++ MQG+SADFA+HAIPSL+KQRILVT KSQ +K + + R Sbjct: 457 LVPGSLLVMQGKSADFAKHAIPSLRKQRILVTFTKSQPKKTMASDGQRLLPPAAQSSHWV 516 Query: 863 XXXSRTPGQIR-PAPAKHFXXXXXXXXXXXXXXRQ-------------XVATPVAPGIAY 1000 SR+P +R P KH+ V T VAP + + Sbjct: 517 PPPSRSPNHMRHPMGPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPF 576 Query: 1001 PAAVXXXXXXXXXXXXXXXXXXXXXXXXGTGVFLPSQGQGPGNS 1132 PA V GTGVFLP G G +S Sbjct: 577 PAPVPLPTGSPGWPAAPPRHPPPRLPVPGTGVFLPPPGSGNSSS 620 >emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera] Length = 1145 Score = 310 bits (794), Expect = 7e-82 Identities = 170/355 (47%), Positives = 210/355 (59%), Gaps = 21/355 (5%) Frame = +2 Query: 131 EKSNLEVSPKSFVATEFYDGKSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQL 310 EK N SPK+FV TE +DGK+VNV +GLK+YE+LFDDS++ K +LV+DLRAAGKRGQL Sbjct: 272 EKPNPTTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQL 331 Query: 311 QGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASR----DPKIEPIPVALQDVI 478 QGQTFV KRPMKGHGREMIQLG+PIADAP EDE G S+ + + E IP LQDVI Sbjct: 332 QGQTFVVSKRPMKGHGREMIQLGVPIADAPLEDESVVGTSKGMFHNRRTESIPSLLQDVI 391 Query: 479 ERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADT 658 +L+ V+++KPD+ IID +NEGDHSQPHIWP WFGRPVC++ LT C+M+FG+VI AD Sbjct: 392 GQLVGSQVLTVKPDACIIDFYNEGDHSQPHIWPTWFGRPVCILFLTECDMTFGRVIGADH 451 Query: 659 PGNYXXXXXXXXXXXXVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF 838 PG+Y ++ MQG+SADFA+HAIPSL+KQRILVT KSQ +K + R Sbjct: 452 PGDYRGSLKLSLVPGSLLVMQGKSADFAKHAIPSLRKQRILVTFTKSQPKKTTASDGQRL 511 Query: 839 ---XXXXXXXXXXXSRTPGQIR-PAPAKHFXXXXXXXXXXXXXXRQ-------------X 967 SR+P +R P KH+ Sbjct: 512 LPPAAQSSHWVPPPSRSPNHMRHPMGPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLF 571 Query: 968 VATPVAPGIAYPAAVXXXXXXXXXXXXXXXXXXXXXXXXGTGVFLPSQGQGPGNS 1132 V T VAP + +PA GTGVFLP G G +S Sbjct: 572 VTTAVAPAMPFPAPXPLPTGSPGWPAAPPRHPPPRLPVPGTGVFLPPPGSGNSSS 626 >gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis] Length = 681 Score = 310 bits (793), Expect = 9e-82 Identities = 177/398 (44%), Positives = 229/398 (57%), Gaps = 24/398 (6%) Frame = +2 Query: 14 KSEENGSATRQRSTQGDVT--QADVDAEDTG--SSSVDGSGLA-----EKSNLEVSPKSF 166 KS+E+G+ + +G V+ + +V A D G SSS + + E SNL PK+F Sbjct: 201 KSQEDGNVKSLGNFEGVVSGSEPEVHAVDDGCTSSSKENDSHSTPKQNENSNLANVPKTF 260 Query: 167 VATEFYDGKSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQGQTFVALKRPM 346 E +DGK VNV EGLK+YE+ D+++ KL LV+DLR+AG+RG Q QT+V KRPM Sbjct: 261 SGNEMFDGKPVNVVEGLKLYEEFCADTEVSKLVALVNDLRSAGERGHFQSQTYVVSKRPM 320 Query: 347 KGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSA 526 KGHGRE IQLG+PIADAP EDE++AG +D + E IP LQDV ERL++ V ++KPDS Sbjct: 321 KGHGREKIQLGLPIADAPVEDEISAGTLKDRRTEAIPPLLQDVAERLVSMQVATVKPDSC 380 Query: 527 IIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYXXXXXXXXXXXX 706 IID +NEGDHSQPH+WP WFGRPVCV+ LT C+M+FG+V A D PG+Y Sbjct: 381 IIDFYNEGDHSQPHLWPSWFGRPVCVLFLTECDMTFGRVFAIDHPGDYRGALKLSLKPGS 440 Query: 707 VISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF----XXXXXXXXXXXS 874 +++MQG+SADFA+HAIPSL++QRILVT KSQ +K + + R S Sbjct: 441 LLAMQGKSADFAKHAIPSLRRQRILVTFTKSQPKKSMPSDGQRMPSPGVAPSSHWGPQPS 500 Query: 875 RTPGQIRPAPAKHFXXXXXXXXXXXXXXRQ-----------XVATPVAPGIAYPAAVXXX 1021 R+P IR KH+ R V PVAP + +PA V Sbjct: 501 RSPNHIRHPGPKHYAPVPTTGVLQASPVRPQIPPPNGIQPLFVTAPVAPAMPFPAPVPIP 560 Query: 1022 XXXXXXXXXXXXXXXXXXXXXGTGVFLPSQGQGPGNST 1135 GTGVFLP G G GNS+ Sbjct: 561 PSSSGWSAAPPRHPPPRLPVPGTGVFLPPPGSG-GNSS 597 >ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] gi|590697545|ref|XP_007045470.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] gi|508709403|gb|EOY01300.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] gi|508709405|gb|EOY01302.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] Length = 680 Score = 306 bits (784), Expect = 1e-80 Identities = 180/399 (45%), Positives = 228/399 (57%), Gaps = 23/399 (5%) Frame = +2 Query: 5 FNEKSEENGSATRQRSTQGDVTQADVDAEDTGSSSVDGSGLA------EKSNLEVSPKSF 166 F E ++ GS + GD D +SS + L EK NL PK+F Sbjct: 209 FTEDKKDTGS----KPHAGDAESVTEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGPKTF 264 Query: 167 VATEFYDGKSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQGQTFVALKRPM 346 V E +DGK VNV +GLK+YE+LFDD ++L L +LV+DLRAAGKRGQLQGQT+VA KRPM Sbjct: 265 VGNEMFDGKMVNVVDGLKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQGQTYVAAKRPM 324 Query: 347 KGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSA 526 KGHGREMIQLG+PIADAP +DE AAG S+D +IE IP LQD IERL+ V+++KPDS Sbjct: 325 KGHGREMIQLGLPIADAPLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSC 384 Query: 527 IIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGK-VIAADTPGNYXXXXXXXXXXX 703 IID++NEGDHSQP +WP WFG+PVC++ LT C+++FG+ VI AD PG+Y Sbjct: 385 IIDVYNEGDHSQPRMWPPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPG 444 Query: 704 XVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF----XXXXXXXXXXX 871 ++ MQG+SADFA+HA+PS++KQRILVT K K + R Sbjct: 445 SLLVMQGKSADFAKHALPSVRKQRILVTFTKYCQPKKSTTDNQRLSSPSVSQSSQWGPPP 504 Query: 872 SRTPGQIR-PAPAKHFXXXXXXXXXXXXXXRQ-----------XVATPVAPGIAYPAAVX 1015 SR+P +IR A KH+ R V T VAP I++PA V Sbjct: 505 SRSPNRIRHSAGPKHYAVIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPAPV- 563 Query: 1016 XXXXXXXXXXXXXXXXXXXXXXXGTGVFLPSQGQGPGNS 1132 GTGVFLP G G +S Sbjct: 564 PIPPGSTGWPAAPRHPPPRLPVPGTGVFLPPPGSGNSSS 602 >ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca subsp. vesca] Length = 682 Score = 303 bits (776), Expect = 9e-80 Identities = 172/397 (43%), Positives = 226/397 (56%), Gaps = 22/397 (5%) Frame = +2 Query: 8 NEKSEENGSATRQRSTQGDVTQADVDAEDTGSSSV---DGSGLA---EKSNLEVSPKSFV 169 +E SA Q + G+ D + +SS+ + + + EK NL + PK+FV Sbjct: 203 HEYISSRSSANSQGTISGNSESEDAVVNEGCTSSIKENESNSIQIQNEKQNLSLIPKTFV 262 Query: 170 ATEFYDGKSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQGQTFVALKRPMK 349 E +DGK+VNV +GLK+YE+ D+++ KL +LV+DLR G+RGQLQGQT+V KRPMK Sbjct: 263 GNETFDGKTVNVVDGLKLYEEFLGDTEVSKLFSLVNDLRTTGRRGQLQGQTYVLSKRPMK 322 Query: 350 GHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAI 529 GHGREMIQLGIPIAD P EDE++AG S+D ++E IP LQDVI+RL+ V++ KPDS I Sbjct: 323 GHGREMIQLGIPIADGPQEDEISAGISKDRRMEAIPSLLQDVIDRLIGTQVLTDKPDSCI 382 Query: 530 IDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYXXXXXXXXXXXXV 709 ID FNEGDHS PH+WP WFGRPV V+ LT C+++FGKV+ D PG+Y + Sbjct: 383 IDFFNEGDHSHPHMWPPWFGRPVSVLFLTECDLTFGKVLGMDHPGDYRGALRLSLTPGSL 442 Query: 710 ISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRFXXXXXXXXXXXS----R 877 + +QG+SAD+A+HAIPS++KQRILVT KSQ RK + R S R Sbjct: 443 LLLQGKSADYAKHAIPSIRKQRILVTFTKSQPRKSFPTDGQRLPSPGPSQSPYWSPPPGR 502 Query: 878 TPGQIR-PAPAKHFXXXXXXXXXXXXXXRQ-----------XVATPVAPGIAYPAAVXXX 1021 +P IR PA KH+ R VA PV P + +PA V Sbjct: 503 SPNHIRHPAGPKHYAAVPTTGVLPAPPNRPQLPPANGIQPLFVAAPVGPAMPFPAPV-VI 561 Query: 1022 XXXXXXXXXXXXXXXXXXXXXGTGVFLPSQGQGPGNS 1132 GTGVFLP G G ++ Sbjct: 562 PPGSPGWVAAPRHPPPRMPLPGTGVFLPPPGSGSSSA 598 >ref|XP_002527549.1| conserved hypothetical protein [Ricinus communis] gi|223533099|gb|EEF34858.1| conserved hypothetical protein [Ricinus communis] Length = 697 Score = 303 bits (775), Expect = 1e-79 Identities = 168/393 (42%), Positives = 225/393 (57%), Gaps = 18/393 (4%) Frame = +2 Query: 8 NEKSEEN--GSATRQRSTQGDVTQADVDAEDTGSSSVDGSGLAEKSNLEVSPKSFVATEF 181 N KS N GS + T+ + ++ S + + K NL +PK+FV E Sbjct: 228 NLKSSGNSEGSLSGNLETEAEAVHEQSSPKEHDSHFIQNQIV--KLNLTTTPKTFVGAEM 285 Query: 182 YDGKSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQGQTFVALKRPMKGHGR 361 DGKSVNV +GLK+YE L DD ++ KL +LV+DLRAAG++GQ QGQ +V KRPMKGHGR Sbjct: 286 VDGKSVNVVDGLKLYEQLLDDVEVSKLVSLVNDLRAAGRKGQFQGQAYVVSKRPMKGHGR 345 Query: 362 EMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAIIDIF 541 EMIQLG+PIADAP E+E AAG S+D KIE IP LQ+VIER ++ ++++KPDS IIDI+ Sbjct: 346 EMIQLGLPIADAPAEEENAAGTSKDRKIESIPTLLQEVIERFVSMQIMTMKPDSCIIDIY 405 Query: 542 NEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYXXXXXXXXXXXXVISMQ 721 NEGDHSQPH+WP WFG+P+ V+ LT C+++FG+VI AD PG+Y ++ MQ Sbjct: 406 NEGDHSQPHMWPPWFGKPISVLFLTECDLTFGRVITADHPGDYRGSLKLPLAPGSLLVMQ 465 Query: 722 GRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF----XXXXXXXXXXXSRTPGQ 889 G++ DFA+HAIP+++KQR+L+T KSQ +K V + R SR+P Sbjct: 466 GKATDFAKHAIPAIRKQRVLLTFTKSQPKKFVQSDGQRLTSPAASPSSHWGPPPSRSPNH 525 Query: 890 IRPAPAKHFXXXXXXXXXXXXXXRQXVA-----------TPVAPGIAYPAAV-XXXXXXX 1033 IR +KH+ R +A PVA + +PA V Sbjct: 526 IRHPVSKHYAPIPTTGVLPAPSIRPQIAPPNGVQPLFVTAPVAAPMPFPAPVPMPPVSTG 585 Query: 1034 XXXXXXXXXXXXXXXXXGTGVFLPSQGQGPGNS 1132 GTGVFLP G G +S Sbjct: 586 WPAAPRHPPNRLPVPVPGTGVFLPPPGSGNASS 618 >ref|XP_007045471.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 5 [Theobroma cacao] gi|508709406|gb|EOY01303.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 5 [Theobroma cacao] Length = 572 Score = 301 bits (772), Expect = 3e-79 Identities = 180/400 (45%), Positives = 228/400 (57%), Gaps = 24/400 (6%) Frame = +2 Query: 5 FNEKSEENGSATRQRSTQGDVTQADVDAEDTGSSSVDGSGLA------EKSNLEVSPKSF 166 F E ++ GS + GD D +SS + L EK NL PK+F Sbjct: 100 FTEDKKDTGS----KPHAGDAESVTEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGPKTF 155 Query: 167 VATEFYDGKSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQ-GQTFVALKRP 343 V E +DGK VNV +GLK+YE+LFDD ++L L +LV+DLRAAGKRGQLQ GQT+VA KRP Sbjct: 156 VGNEMFDGKMVNVVDGLKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKRP 215 Query: 344 MKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDS 523 MKGHGREMIQLG+PIADAP +DE AAG S+D +IE IP LQD IERL+ V+++KPDS Sbjct: 216 MKGHGREMIQLGLPIADAPLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDS 275 Query: 524 AIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGK-VIAADTPGNYXXXXXXXXXX 700 IID++NEGDHSQP +WP WFG+PVC++ LT C+++FG+ VI AD PG+Y Sbjct: 276 CIIDVYNEGDHSQPRMWPPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAP 335 Query: 701 XXVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF----XXXXXXXXXX 868 ++ MQG+SADFA+HA+PS++KQRILVT K K + R Sbjct: 336 GSLLVMQGKSADFAKHALPSVRKQRILVTFTKYCQPKKSTTDNQRLSSPSVSQSSQWGPP 395 Query: 869 XSRTPGQIR-PAPAKHFXXXXXXXXXXXXXXRQ-----------XVATPVAPGIAYPAAV 1012 SR+P +IR A KH+ R V T VAP I++PA V Sbjct: 396 PSRSPNRIRHSAGPKHYAVIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPAPV 455 Query: 1013 XXXXXXXXXXXXXXXXXXXXXXXXGTGVFLPSQGQGPGNS 1132 GTGVFLP G G +S Sbjct: 456 -PIPPGSTGWPAAPRHPPPRLPVPGTGVFLPPPGSGNSSS 494 >ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] gi|590697542|ref|XP_007045469.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] gi|508709402|gb|EOY01299.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] gi|508709404|gb|EOY01301.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 681 Score = 301 bits (772), Expect = 3e-79 Identities = 180/400 (45%), Positives = 228/400 (57%), Gaps = 24/400 (6%) Frame = +2 Query: 5 FNEKSEENGSATRQRSTQGDVTQADVDAEDTGSSSVDGSGLA------EKSNLEVSPKSF 166 F E ++ GS + GD D +SS + L EK NL PK+F Sbjct: 209 FTEDKKDTGS----KPHAGDAESVTEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGPKTF 264 Query: 167 VATEFYDGKSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQ-GQTFVALKRP 343 V E +DGK VNV +GLK+YE+LFDD ++L L +LV+DLRAAGKRGQLQ GQT+VA KRP Sbjct: 265 VGNEMFDGKMVNVVDGLKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKRP 324 Query: 344 MKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDS 523 MKGHGREMIQLG+PIADAP +DE AAG S+D +IE IP LQD IERL+ V+++KPDS Sbjct: 325 MKGHGREMIQLGLPIADAPLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDS 384 Query: 524 AIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGK-VIAADTPGNYXXXXXXXXXX 700 IID++NEGDHSQP +WP WFG+PVC++ LT C+++FG+ VI AD PG+Y Sbjct: 385 CIIDVYNEGDHSQPRMWPPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAP 444 Query: 701 XXVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF----XXXXXXXXXX 868 ++ MQG+SADFA+HA+PS++KQRILVT K K + R Sbjct: 445 GSLLVMQGKSADFAKHALPSVRKQRILVTFTKYCQPKKSTTDNQRLSSPSVSQSSQWGPP 504 Query: 869 XSRTPGQIR-PAPAKHFXXXXXXXXXXXXXXRQ-----------XVATPVAPGIAYPAAV 1012 SR+P +IR A KH+ R V T VAP I++PA V Sbjct: 505 PSRSPNRIRHSAGPKHYAVIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPAPV 564 Query: 1013 XXXXXXXXXXXXXXXXXXXXXXXXGTGVFLPSQGQGPGNS 1132 GTGVFLP G G +S Sbjct: 565 -PIPPGSTGWPAAPRHPPPRLPVPGTGVFLPPPGSGNSSS 603 >ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261013 [Solanum lycopersicum] Length = 641 Score = 299 bits (766), Expect = 1e-78 Identities = 177/404 (43%), Positives = 225/404 (55%), Gaps = 26/404 (6%) Frame = +2 Query: 2 EFNEKSEENGSA-----TRQRSTQGDVTQADV--DAEDTGSSSVDGSGLA-----EKSNL 145 E K E N S T +QG+V + D D+ GSS+V+ + EK N Sbjct: 191 ELAAKPEANSSVKGSVCTEAGDSQGEVDKTDDKRDSNSEGSSNVESESHSFQIPTEKQN- 249 Query: 146 EVSPKSFVATEFYDGKSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQGQTF 325 V PK+FVATE YDGK VNV +G+K+YE+L S++ KL LV+DLRAAG+RGQL Q F Sbjct: 250 -VVPKTFVATEIYDGKPVNVVDGMKLYEELLSSSEVSKLVTLVNDLRAAGRRGQLPAQAF 308 Query: 326 VALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVV 505 + KRPMKGHGREM+QLG+PI DAPPE+E A +D K E IP LQDVI++L + Sbjct: 309 IVSKRPMKGHGREMVQLGLPIVDAPPEEESAISTYKDRKTEAIPGLLQDVIDQLSAMQAL 368 Query: 506 SIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYXXXXX 685 S+KPD+ +IDIFNEGDHSQPH+WP W+GRP+ + LT CEM+FGKVI D PG+Y Sbjct: 369 SVKPDACVIDIFNEGDHSQPHLWPYWYGRPISTLFLTDCEMTFGKVIGVDHPGDYRGSLK 428 Query: 686 XXXXXXXVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF---XXXXXX 856 V+ MQGRS +FA++AIPS++KQR+LVT K Q R+I G+ RF Sbjct: 429 LSLAPGSVLVMQGRSTEFAKYAIPSIRKQRMLVTFTKLQLRRIKSGDSQRFPSSAGGPVS 488 Query: 857 XXXXXSRTPGQI-RPAPAKHFXXXXXXXXXXXXXXR--------QXVATP--VAPGIAYP 1003 SR+ I RP KH+ R Q + P VAP + +P Sbjct: 489 QWVPPSRSSNHIRRPFGPKHYGSMPATGVLPIPGVRPQFAPANMQPIFVPATVAPAMPFP 548 Query: 1004 AAVXXXXXXXXXXXXXXXXXXXXXXXXGTGVFLPSQGQGPGNST 1135 A V GTGVFLP PG+ T Sbjct: 549 APVALPPASAGWAVPPIRHPPPRLPLPGTGVFLP-----PGSGT 587 >ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600383 [Solanum tuberosum] Length = 638 Score = 299 bits (765), Expect = 2e-78 Identities = 173/394 (43%), Positives = 222/394 (56%), Gaps = 21/394 (5%) Frame = +2 Query: 17 SEENGSATRQRSTQGDVTQADV--DAEDTGSSSVDGSGLA-----EKSNLEVSPKSFVAT 175 S ++ T +QG+V + D D+ GSS+V+ + EK N V PK+FVAT Sbjct: 198 SVKSSVCTEAGDSQGEVDKTDDKRDSNSEGSSNVESESHSIQVPTEKQN--VVPKTFVAT 255 Query: 176 EFYDGKSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQGQTFVALKRPMKGH 355 E YDGK VNV +G+K+YE+L S++ KL LV+DLRAAG+RGQL Q F+ KRPMKGH Sbjct: 256 EIYDGKPVNVVDGMKLYEELLSSSEVSKLLTLVNDLRAAGRRGQLPAQAFIVSKRPMKGH 315 Query: 356 GREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAIID 535 GREM+QLG+PI DAPPE+E A +D K E IP QDVI++L +S+KPD+ +ID Sbjct: 316 GREMVQLGLPIVDAPPEEEAAISTYKDRKTEAIPGLFQDVIDQLSAMQALSVKPDACVID 375 Query: 536 IFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYXXXXXXXXXXXXVIS 715 IFNEGDHSQPH+WP W+GRP+ ++ LT CEM+FGKVI D PG+Y V+ Sbjct: 376 IFNEGDHSQPHLWPYWYGRPISMLFLTDCEMTFGKVIGVDHPGDYRGSLKLSLAPGSVLV 435 Query: 716 MQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF---XXXXXXXXXXXSRTPG 886 MQGRS +FA++AIPS +KQRILVT K Q R+I + RF SR+P Sbjct: 436 MQGRSTEFAKYAIPSTRKQRILVTFTKLQLRRIKSADSQRFPSSAGGPVSQWVPPSRSPN 495 Query: 887 QI-RPAPAKHFXXXXXXXXXXXXXXR--------QXVATP--VAPGIAYPAAVXXXXXXX 1033 I RP KH+ R Q + P VAP + +PA V Sbjct: 496 HIRRPFGPKHYGSMSTTGVLPIPGVRPQFAPANMQPIFVPATVAPAMPFPAPVALPPASA 555 Query: 1034 XXXXXXXXXXXXXXXXXGTGVFLPSQGQGPGNST 1135 GTGVFLP PG+ T Sbjct: 556 GWAVPPLRHPPPRLPLPGTGVFLP-----PGSGT 584 >ref|XP_006469304.1| PREDICTED: uncharacterized protein LOC102618872 [Citrus sinensis] Length = 627 Score = 296 bits (759), Expect = 8e-78 Identities = 173/405 (42%), Positives = 228/405 (56%), Gaps = 32/405 (7%) Frame = +2 Query: 14 KSEENGSATRQRSTQGDVTQADVDAEDTGSSSVDGS--GLAE-----------KSNLEVS 154 K+ ++GSA +++ +TQ DAE + DG GL E K N ++ Sbjct: 144 KAHDDGSAKSLGNSE--ITQVG-DAEPKAEALDDGCTPGLKENDSQSVQSQNEKQNQSMA 200 Query: 155 PKSFVATEFYDGKSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQGQTFVAL 334 KSFV TE DGK VNV +GLK+YE++ +S++ KL +LV+DLR AGKRGQ+QG +V Sbjct: 201 AKSFVGTEMVDGKMVNVVDGLKLYEEVSGNSEVSKLVSLVNDLRTAGKRGQIQGPAYVVS 260 Query: 335 KRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIK 514 KRP++GHGRE+IQLG+PI D PPEDE+AAG SRD +IEPIP LQDVI+RL+ ++++K Sbjct: 261 KRPIRGHGREVIQLGLPIVDGPPEDEIAAGTSRDRRIEPIPSLLQDVIDRLVGLQIMTVK 320 Query: 515 PDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYXXXXXXXX 694 PDS I+D+FNEGDHSQPHI P WFGRPVC++ LT C+M+FG++I D PG+Y Sbjct: 321 PDSCIVDVFNEGDHSQPHISPSWFGRPVCILFLTECDMTFGRMIGIDHPGDYRGTLRLSV 380 Query: 695 XXXXVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF----XXXXXXXX 862 ++ MQG+SAD A+HAI S++KQRILVT KSQ +K+ + R Sbjct: 381 APGSLLVMQGKSADIAKHAISSIRKQRILVTFTKSQPKKLTPTDGQRLASPGIAPSPHWG 440 Query: 863 XXXSRTPGQIR-PAPAKHFXXXXXXXXXXXXXXRQ-----------XVATPVAPGIAYPA 1006 R P IR P KHF R V+ PV P + +PA Sbjct: 441 LPPGRPPNHIRHPTGPKHFAPIPTTGVLPAPAIRAQIPPTNGVPPIFVSPPVTPAMPFPA 500 Query: 1007 AV---XXXXXXXXXXXXXXXXXXXXXXXXGTGVFLPSQGQGPGNS 1132 V GTGVFLP G G +S Sbjct: 501 PVPIPPGSTGWTAAPPRHTPPPPPRLPVPGTGVFLPPPGSGGSSS 545 >ref|XP_006448091.1| hypothetical protein CICLE_v10014588mg [Citrus clementina] gi|557550702|gb|ESR61331.1| hypothetical protein CICLE_v10014588mg [Citrus clementina] Length = 635 Score = 296 bits (758), Expect = 1e-77 Identities = 172/405 (42%), Positives = 224/405 (55%), Gaps = 32/405 (7%) Frame = +2 Query: 14 KSEENGSAT---RQRSTQGDVTQADVDAEDTG---------SSSVDGSGLAEKSNLEVSP 157 K+ ++GSA TQ + +A D G S SV EK N ++ Sbjct: 151 KAHDDGSAKSLGNSEITQVGDAEPKAEALDDGCTPSLKENDSQSVQSQN--EKQNQSMAA 208 Query: 158 KSFVATEFYDGKSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQGQTFVALK 337 KSFV TE DGK VNV +GLK+YE++ +S++ KL +LV+DLR AGKRGQ+QG +V K Sbjct: 209 KSFVGTEMVDGKMVNVVDGLKLYEEVSGNSEVSKLVSLVNDLRTAGKRGQIQGPAYVVSK 268 Query: 338 RPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKP 517 RP++GHGRE+IQLG+PI D PPEDE+AAG SRD +IEPIP LQDVI+RL+ ++++KP Sbjct: 269 RPIRGHGREVIQLGLPIVDGPPEDEIAAGTSRDRRIEPIPSLLQDVIDRLVGLQIMTVKP 328 Query: 518 DSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYXXXXXXXXX 697 DS I+D+FNEGDHSQPHI P WFGRPVC++ LT C+M+FG++I D PG+Y Sbjct: 329 DSCIVDVFNEGDHSQPHISPSWFGRPVCILFLTECDMTFGRMIGIDHPGDYRGTLRLSVA 388 Query: 698 XXXVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF----XXXXXXXXX 865 ++ MQG+SAD A+HAI S++KQRILVT KSQ +K+ + R Sbjct: 389 PGSLLVMQGKSADIAKHAISSIRKQRILVTFTKSQPKKLTPTDGQRLASPGIAPSPHWGP 448 Query: 866 XXSRTPGQIR-PAPAKHFXXXXXXXXXXXXXXRQ-----------XVATPVAPGIAYPAA 1009 R P IR P KHF R V+ PV P + +PA Sbjct: 449 PPGRPPNHIRHPTGPKHFAPIPTTGVLPAPAIRAQIPPTNGVPPIFVSPPVTPAMPFPAP 508 Query: 1010 V----XXXXXXXXXXXXXXXXXXXXXXXXGTGVFLPSQGQGPGNS 1132 V GTGVFLP G G +S Sbjct: 509 VPIPPGSTGWTAAPPRHTPPPPPPRLPVPGTGVFLPPPGSGGSSS 553 >ref|XP_006448090.1| hypothetical protein CICLE_v10014588mg [Citrus clementina] gi|557550701|gb|ESR61330.1| hypothetical protein CICLE_v10014588mg [Citrus clementina] Length = 486 Score = 296 bits (758), Expect = 1e-77 Identities = 172/405 (42%), Positives = 224/405 (55%), Gaps = 32/405 (7%) Frame = +2 Query: 14 KSEENGSAT---RQRSTQGDVTQADVDAEDTG---------SSSVDGSGLAEKSNLEVSP 157 K+ ++GSA TQ + +A D G S SV EK N ++ Sbjct: 2 KAHDDGSAKSLGNSEITQVGDAEPKAEALDDGCTPSLKENDSQSVQSQN--EKQNQSMAA 59 Query: 158 KSFVATEFYDGKSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQGQTFVALK 337 KSFV TE DGK VNV +GLK+YE++ +S++ KL +LV+DLR AGKRGQ+QG +V K Sbjct: 60 KSFVGTEMVDGKMVNVVDGLKLYEEVSGNSEVSKLVSLVNDLRTAGKRGQIQGPAYVVSK 119 Query: 338 RPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKP 517 RP++GHGRE+IQLG+PI D PPEDE+AAG SRD +IEPIP LQDVI+RL+ ++++KP Sbjct: 120 RPIRGHGREVIQLGLPIVDGPPEDEIAAGTSRDRRIEPIPSLLQDVIDRLVGLQIMTVKP 179 Query: 518 DSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYXXXXXXXXX 697 DS I+D+FNEGDHSQPHI P WFGRPVC++ LT C+M+FG++I D PG+Y Sbjct: 180 DSCIVDVFNEGDHSQPHISPSWFGRPVCILFLTECDMTFGRMIGIDHPGDYRGTLRLSVA 239 Query: 698 XXXVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF----XXXXXXXXX 865 ++ MQG+SAD A+HAI S++KQRILVT KSQ +K+ + R Sbjct: 240 PGSLLVMQGKSADIAKHAISSIRKQRILVTFTKSQPKKLTPTDGQRLASPGIAPSPHWGP 299 Query: 866 XXSRTPGQIR-PAPAKHFXXXXXXXXXXXXXXRQ-----------XVATPVAPGIAYPAA 1009 R P IR P KHF R V+ PV P + +PA Sbjct: 300 PPGRPPNHIRHPTGPKHFAPIPTTGVLPAPAIRAQIPPTNGVPPIFVSPPVTPAMPFPAP 359 Query: 1010 V----XXXXXXXXXXXXXXXXXXXXXXXXGTGVFLPSQGQGPGNS 1132 V GTGVFLP G G +S Sbjct: 360 VPIPPGSTGWTAAPPRHTPPPPPPRLPVPGTGVFLPPPGSGGSSS 404 >ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809865 isoform X1 [Glycine max] Length = 681 Score = 293 bits (751), Expect = 7e-77 Identities = 160/393 (40%), Positives = 221/393 (56%), Gaps = 24/393 (6%) Frame = +2 Query: 14 KSEENGSATRQRSTQGDVTQADVDA--------EDTGSSSVDGSGLAEKSNLEVSPKSFV 169 K + +GS RST+G ++ + +A G S + +L K+F+ Sbjct: 212 KHQTDGSLKSTRSTEGSLSNLESEAVVNDECISNSKGDDSHSVQNQHQSQSLSTKAKTFI 271 Query: 170 ATEFYDGKSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQG-QTFVALKRPM 346 E +DGK VNV +GLK+YEDLFD ++I L +LV+DLR +GK+GQLQG Q ++ +RPM Sbjct: 272 GNEMFDGKMVNVVDGLKLYEDLFDSTEIANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPM 331 Query: 347 KGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSA 526 KGHGREMIQLG+PIADAP E E GAS+D +EPIP QD+IER+++ V+++KPD Sbjct: 332 KGHGREMIQLGVPIADAPAEGENMTGASKDMNVEPIPSLFQDIIERMVSSQVMTVKPDCC 391 Query: 527 IIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYXXXXXXXXXXXX 706 I+D +NEGDHSQPH WP W+GRPV ++ LT CEM+FG+VIA++ PG+Y Sbjct: 392 IVDFYNEGDHSQPHSWPSWYGRPVYILFLTECEMTFGRVIASEHPGDYRGGIKLSLVPGS 451 Query: 707 VISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF--XXXXXXXXXXXSRT 880 ++ M+G+S+DFA+HA+PS++KQRILVT KSQ RK + + R SR+ Sbjct: 452 LLVMEGKSSDFAKHALPSVRKQRILVTFTKSQPRKSLSSDAQRLASTATSSHWGPLPSRS 511 Query: 881 PGQIR-PAPAKHFXXXXXXXXXXXXXXRQXVA-----------TPVAPGIAYPAAV-XXX 1021 P +R +KH+ R +A PV P + +PA V Sbjct: 512 PNHVRHHVGSKHYATLPTTGVLPSPPIRPQMAAPVGMQPLFVTAPVVPPMPFPAPVAFPP 571 Query: 1022 XXXXXXXXXXXXXXXXXXXXXGTGVFLPSQGQG 1120 GTGVFLP G G Sbjct: 572 GSTGWTGAPPPRHPPPRVPAPGTGVFLPPPGSG 604 >ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809865 isoform X2 [Glycine max] Length = 641 Score = 290 bits (743), Expect = 6e-76 Identities = 161/386 (41%), Positives = 219/386 (56%), Gaps = 16/386 (4%) Frame = +2 Query: 11 EKSEENGSATRQRSTQGDVTQADVDAEDTGSSSVDGSGLAEKSNLEVSPKSFVATEFYDG 190 EKSEE+ S + GD A + + G S + +L K+F+ E +DG Sbjct: 181 EKSEEHKSGGKVEKV-GDKGLASAE-DKKGDDSHSVQNQHQSQSLSTKAKTFIGNEMFDG 238 Query: 191 KSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQG-QTFVALKRPMKGHGREM 367 K VNV +GLK+YEDLFD ++I L +LV+DLR +GK+GQLQG Q ++ +RPMKGHGREM Sbjct: 239 KMVNVVDGLKLYEDLFDSTEIANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREM 298 Query: 368 IQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAIIDIFNE 547 IQLG+PIADAP E E GAS+D +EPIP QD+IER+++ V+++KPD I+D +NE Sbjct: 299 IQLGVPIADAPAEGENMTGASKDMNVEPIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNE 358 Query: 548 GDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYXXXXXXXXXXXXVISMQGR 727 GDHSQPH WP W+GRPV ++ LT CEM+FG+VIA++ PG+Y ++ M+G+ Sbjct: 359 GDHSQPHSWPSWYGRPVYILFLTECEMTFGRVIASEHPGDYRGGIKLSLVPGSLLVMEGK 418 Query: 728 SADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF--XXXXXXXXXXXSRTPGQIR-P 898 S+DFA+HA+PS++KQRILVT KSQ RK + + R SR+P +R Sbjct: 419 SSDFAKHALPSVRKQRILVTFTKSQPRKSLSSDAQRLASTATSSHWGPLPSRSPNHVRHH 478 Query: 899 APAKHFXXXXXXXXXXXXXXRQXVA-----------TPVAPGIAYPAAV-XXXXXXXXXX 1042 +KH+ R +A PV P + +PA V Sbjct: 479 VGSKHYATLPTTGVLPSPPIRPQMAAPVGMQPLFVTAPVVPPMPFPAPVAFPPGSTGWTG 538 Query: 1043 XXXXXXXXXXXXXXGTGVFLPSQGQG 1120 GTGVFLP G G Sbjct: 539 APPPRHPPPRVPAPGTGVFLPPPGSG 564 >ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa] gi|550333016|gb|ERP57586.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa] Length = 693 Score = 290 bits (741), Expect = 1e-75 Identities = 163/391 (41%), Positives = 218/391 (55%), Gaps = 19/391 (4%) Frame = +2 Query: 17 SEENGSATRQRSTQGDVTQADVDAEDTG--SSSVDGSGLAEKSNLEVSPKSFVATEFYDG 190 + +N S Q + G+ VD + S S + EK NL ++PK+FVA E DG Sbjct: 225 NHKNSSGNAQGTFSGNSEAVAVDDRSSPEESDSHPSNNQNEKQNLAITPKTFVAEEKIDG 284 Query: 191 KSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQGQTFVALKRPMKGHGREMI 370 + VNV +GLK+YE+L D ++ KL +LV++LRA G+RGQ QGQT++ KRPMKGHGREMI Sbjct: 285 QMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGREMI 344 Query: 371 QLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAIIDIFNEG 550 QLG+PIADAP EDE A G S++ ++E IP LQDVIE + V+++KPDS IIDI+NEG Sbjct: 345 QLGLPIADAPAEDENATGTSKERRVESIPALLQDVIEHFVAMQVMTMKPDSCIIDIYNEG 404 Query: 551 DHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYXXXXXXXXXXXXVISMQGRS 730 DHSQPH+WP WFG+PV V+ LT CE++FGKVI G+Y ++ MQG+S Sbjct: 405 DHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQGKS 464 Query: 731 ADFARHAIPSLQKQRILVTLVKSQSRKIVGGE----PHRFXXXXXXXXXXXSRTPGQIRP 898 +D A+HAIP ++KQR+LVT KSQ +K+ + P SR+P +R Sbjct: 465 SDLAKHAIPMIKKQRMLVTFTKSQPKKLTSNDGPRLPSHAVAPSSHWGPPPSRSPNHLRH 524 Query: 899 APAKHFXXXXXXXXXXXXXXRQXV-----------ATPVAPGIAYPAAV--XXXXXXXXX 1039 KH+ R + TPVA + +PA V Sbjct: 525 PVPKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVPIPPVSTGWPT 584 Query: 1040 XXXXXXXXXXXXXXXGTGVFLPSQGQGPGNS 1132 GTGVFLP G G +S Sbjct: 585 SSPRHPSARLPVPIPGTGVFLPPPGSGNASS 615 >gb|ABK95394.1| unknown [Populus trichocarpa] Length = 694 Score = 290 bits (741), Expect = 1e-75 Identities = 163/391 (41%), Positives = 218/391 (55%), Gaps = 19/391 (4%) Frame = +2 Query: 17 SEENGSATRQRSTQGDVTQADVDAEDTG--SSSVDGSGLAEKSNLEVSPKSFVATEFYDG 190 + +N S Q + G+ VD + S S + EK NL ++PK+FVA E DG Sbjct: 226 NHKNSSGNAQGTFSGNSEAVAVDDRSSPEESDSHPSNNQNEKQNLAITPKTFVAEEKIDG 285 Query: 191 KSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQGQTFVALKRPMKGHGREMI 370 + VNV +GLK+YE+L D ++ KL +LV++LRA G+RGQ QGQT++ KRPMKGHGREMI Sbjct: 286 QMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGREMI 345 Query: 371 QLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAIIDIFNEG 550 QLG+PIADAP EDE A G S++ ++E IP LQDVIE + V+++KPDS IIDI+NEG Sbjct: 346 QLGLPIADAPAEDENATGTSKERRVESIPALLQDVIEHFVAMQVMTMKPDSCIIDIYNEG 405 Query: 551 DHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYXXXXXXXXXXXXVISMQGRS 730 DHSQPH+WP WFG+PV V+ LT CE++FGKVI G+Y ++ MQG+S Sbjct: 406 DHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQGKS 465 Query: 731 ADFARHAIPSLQKQRILVTLVKSQSRKIVGGE----PHRFXXXXXXXXXXXSRTPGQIRP 898 +D A+HAIP ++KQR+LVT KSQ +K+ + P SR+P +R Sbjct: 466 SDLAKHAIPMIKKQRMLVTFTKSQPKKLTSNDGPRLPSHAVAPSSHWGPPPSRSPNHLRH 525 Query: 899 APAKHFXXXXXXXXXXXXXXRQXV-----------ATPVAPGIAYPAAV--XXXXXXXXX 1039 KH+ R + TPVA + +PA V Sbjct: 526 PVPKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVPIPPVSTGWPT 585 Query: 1040 XXXXXXXXXXXXXXXGTGVFLPSQGQGPGNS 1132 GTGVFLP G G +S Sbjct: 586 SSPRHPSARLPVPIPGTGVFLPPPGSGNASS 616 >ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210274 [Cucumis sativus] gi|449481289|ref|XP_004156139.1| PREDICTED: uncharacterized LOC101210274 [Cucumis sativus] Length = 684 Score = 286 bits (732), Expect = 1e-74 Identities = 134/234 (57%), Positives = 178/234 (76%) Frame = +2 Query: 134 KSNLEVSPKSFVATEFYDGKSVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLQ 313 K +P++FVA+E +DGK VNV +GLK++E+L DD+++ KL +LV+DLRA+GKRGQ Q Sbjct: 261 KQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEVSKLLSLVNDLRASGKRGQFQ 320 Query: 314 GQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLT 493 GQT+V KRPMKGHGREMIQLG PIADAP ED+ + G S+D +IEPIP LQD+I+RL+ Sbjct: 321 GQTYVVSKRPMKGHGREMIQLGFPIADAPHEDDNSLGLSKDRRIEPIPSLLQDLIDRLVG 380 Query: 494 KNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYX 673 V+++KPDS IID +NEGDHSQPH+WP WFGRPV V+ LT CE++FG+VI D GNY Sbjct: 381 DQVMTVKPDSCIIDFYNEGDHSQPHVWPSWFGRPVGVLLLTECEITFGRVIGTDHSGNYR 440 Query: 674 XXXXXXXXXXXVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHR 835 ++ +QG+SADFA+HA+P+++KQRILVTL KSQ ++ + R Sbjct: 441 GAMKLSLTPGNLLVVQGKSADFAKHALPAIRKQRILVTLTKSQPKRAAPADGQR 494