BLASTX nr result
ID: Atropa21_contig00022001
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00022001 (717 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600... 342 9e-92 ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261... 333 3e-89 gb|EMJ26321.1| hypothetical protein PRUPE_ppa002630mg [Prunus pe... 203 5e-50 emb|CBI26785.3| unnamed protein product [Vitis vinifera] 200 3e-49 ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252... 200 4e-49 ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309... 192 1e-46 ref|XP_006448091.1| hypothetical protein CICLE_v10014588mg [Citr... 188 2e-45 ref|XP_006469304.1| PREDICTED: uncharacterized protein LOC102618... 187 2e-45 gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis] 186 7e-45 gb|EOY01300.1| Hydroxyproline-rich glycoprotein family protein, ... 184 3e-44 emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera] 182 1e-43 gb|EOY01299.1| Hydroxyproline-rich glycoprotein family protein, ... 181 3e-43 gb|EOY01303.1| Hydroxyproline-rich glycoprotein family protein, ... 174 2e-41 ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Popu... 170 5e-40 ref|XP_002527549.1| conserved hypothetical protein [Ricinus comm... 170 5e-40 gb|ABK95394.1| unknown [Populus trichocarpa] 169 8e-40 ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781... 166 7e-39 ref|XP_004513242.1| PREDICTED: uncharacterized protein LOC101506... 166 7e-39 ref|XP_004513244.1| PREDICTED: uncharacterized protein LOC101507... 165 1e-38 ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus tr... 165 2e-38 >ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600383 [Solanum tuberosum] Length = 638 Score = 342 bits (876), Expect = 9e-92 Identities = 185/253 (73%), Positives = 195/253 (77%), Gaps = 15/253 (5%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDGG--------------GKMXXXXXXXXXXX 140 VLHMQQY SVAEVIYSLHQVEW KQQKGFDGG G Sbjct: 98 VLHMQQYHSVAEVIYSLHQVEWMKQQKGFDGGVKKVEKRNGSRGGGGGWKSEGLKDGKES 157 Query: 141 XXXXXXXXXXLKVNGVVKIDVVNKEAAVKQGETKELVGNPEENNSMKSSVI-EVGDSQNE 317 K NGV KIDVV VKQGE KEL NPE N+S+KSSV E GDSQ E Sbjct: 158 QGQNFSLDAHSKTNGVEKIDVVE----VKQGEKKELAANPEANSSVKSSVCTEAGDSQGE 213 Query: 318 VDKTDDKRDSNSDGSSTVENESHSVQVPTEKQNVVPKTFVATEIYDGKPVNVVDGMKLYE 497 VDKTDDKRDSNS+GSS VE+ESHS+QVPTEKQNVVPKTFVATEIYDGKPVNVVDGMKLYE Sbjct: 214 VDKTDDKRDSNSEGSSNVESESHSIQVPTEKQNVVPKTFVATEIYDGKPVNVVDGMKLYE 273 Query: 498 ELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGREMIQLGLPIVDAPPED 677 ELLS+SEVSKL+TLVNDLR AGRRGQLPAQ FIVSKRPMKGHGREM+QLGLPIVDAPPE+ Sbjct: 274 ELLSSSEVSKLLTLVNDLRAAGRRGQLPAQAFIVSKRPMKGHGREMVQLGLPIVDAPPEE 333 Query: 678 EAAIATYKDRKTE 716 EAAI+TYKDRKTE Sbjct: 334 EAAISTYKDRKTE 346 >ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261013 [Solanum lycopersicum] Length = 641 Score = 333 bits (854), Expect = 3e-89 Identities = 180/254 (70%), Positives = 192/254 (75%), Gaps = 16/254 (6%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDGG---------------GKMXXXXXXXXXX 137 VLHMQQY SVAEVIYSLHQVEW KQQKGFDGG G Sbjct: 100 VLHMQQYHSVAEVIYSLHQVEWMKQQKGFDGGVNKVGKRNGSKGGGGGGWKSEGLKDGKE 159 Query: 138 XXXXXXXXXXXLKVNGVVKIDVVNKEAAVKQGETKELVGNPEENNSMKSSVI-EVGDSQN 314 K NGV KIDVV + KQG+ KEL PE N+S+K SV E GDSQ Sbjct: 160 SQGQNFSLDAHSKTNGVEKIDVVEE----KQGDKKELAAKPEANSSVKGSVCTEAGDSQG 215 Query: 315 EVDKTDDKRDSNSDGSSTVENESHSVQVPTEKQNVVPKTFVATEIYDGKPVNVVDGMKLY 494 EVDKTDDKRDSNS+GSS VE+ESHS Q+PTEKQNVVPKTFVATEIYDGKPVNVVDGMKLY Sbjct: 216 EVDKTDDKRDSNSEGSSNVESESHSFQIPTEKQNVVPKTFVATEIYDGKPVNVVDGMKLY 275 Query: 495 EELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGREMIQLGLPIVDAPPE 674 EELLS+SEVSKLVTLVNDLR AGRRGQLPAQ FIVSKRPMKGHGREM+QLGLPIVDAPPE Sbjct: 276 EELLSSSEVSKLVTLVNDLRAAGRRGQLPAQAFIVSKRPMKGHGREMVQLGLPIVDAPPE 335 Query: 675 DEAAIATYKDRKTE 716 +E+AI+TYKDRKTE Sbjct: 336 EESAISTYKDRKTE 349 >gb|EMJ26321.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica] Length = 650 Score = 203 bits (516), Expect = 5e-50 Identities = 124/253 (49%), Positives = 152/253 (60%), Gaps = 15/253 (5%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQKGFD---GGGKMXXXXXXXXXXXXXXXXXXXXXL 173 VLHMQQYFSVAEVIY+L V WR+QQ+ +D G K Sbjct: 93 VLHMQQYFSVAEVIYALQHVAWRRQQRYYDPVKAGAK---------------------EF 131 Query: 174 KVNGVVKIDVVNKEAAVKQGETKELVGNPEENNSMK----------SSVIEVGDSQNEVD 323 K +GV + A K+G L + + NS S V E + EV Sbjct: 132 KRSGVGFNKGQQRAEAFKEGHNSTLESHSNDGNSSGVVAPEKFERGSEVGEEVEPGGEVG 191 Query: 324 KTDDKRDSNSDGSSTVENESHSVQVPTEKQN--VVPKTFVATEIYDGKPVNVVDGMKLYE 497 K +DK + + G V NESHS+Q+ +KQN +VPKTF+ EI DGK VNVVDG+KLYE Sbjct: 192 KLNDKGLAPA-GEKKV-NESHSIQIQNQKQNLSIVPKTFIGNEISDGKTVNVVDGLKLYE 249 Query: 498 ELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGREMIQLGLPIVDAPPED 677 + L ++EVSKLV+LVNDLR AG+R QL QT++VSKRPMKGHGREMIQLG+PI DAPPED Sbjct: 250 DFLGDTEVSKLVSLVNDLRAAGKRRQLQGQTYVVSKRPMKGHGREMIQLGIPIADAPPED 309 Query: 678 EAAIATYKDRKTE 716 E + T KDRK E Sbjct: 310 EISAGTSKDRKIE 322 >emb|CBI26785.3| unnamed protein product [Vitis vinifera] Length = 672 Score = 200 bits (509), Expect = 3e-49 Identities = 133/283 (46%), Positives = 154/283 (54%), Gaps = 45/283 (15%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQKGFD---GGGKMXXXXXXXXXXXXXXXXXXXXXL 173 VLHMQQYFSVAEVIY+L QV WR+QQ+ D G GK Sbjct: 93 VLHMQQYFSVAEVIYALQQVGWRRQQRHLDPVKGAGKEYKRYGVAYRQGQRGETAKDSHN 152 Query: 174 K---------------------------VNGVVKIDVVNK------EAAVKQGETKELVG 254 V G K DVV K AA ++ + V Sbjct: 153 SNFENHSHDANSSGTLEKGERVSEIYDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVA 212 Query: 255 NPEENNSMKSSVIEVGD----SQNEVDKTDDKRDSNSDGSSTV--ENESHSVQVPTEKQN 416 P N+ KSS G S+ E + DD N GS + EN +H VQ EK N Sbjct: 213 KPNANSCSKSSENSEGSRCGISETEANDMDDGGTLNPKGSCNMIMENNAHPVQNQNEKPN 272 Query: 417 VV--PKTFVATEIYDGKPVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPA-Q 587 PKTFV TEI+DGK VNVVDG+KLYEEL +SEVSK V+LVNDLR AG+RGQL A Q Sbjct: 273 PTTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQAGQ 332 Query: 588 TFIVSKRPMKGHGREMIQLGLPIVDAPPEDEAAIATYKDRKTE 716 TF+VSKRPMKGHGREMIQLG+PI DAP EDE+ + T KDR+TE Sbjct: 333 TFVVSKRPMKGHGREMIQLGVPIADAPLEDESVVGTSKDRRTE 375 >ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252594 [Vitis vinifera] Length = 698 Score = 200 bits (508), Expect = 4e-49 Identities = 130/280 (46%), Positives = 151/280 (53%), Gaps = 42/280 (15%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQKGFD---GGGKMXXXXXXXXXXXXXXXXXXXXXL 173 VLHMQQYFSVAEVIY+L QV WR+QQ+ D G GK Sbjct: 93 VLHMQQYFSVAEVIYALQQVGWRRQQRHLDPVKGAGKEYKRYGVAYRQGQRGETAKDSHN 152 Query: 174 K---------------------------VNGVVKIDVVNK------EAAVKQGETKELVG 254 V G K DVV K AA ++ + V Sbjct: 153 SNFENHSHDANSSGTLEKGERVSEIYDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVA 212 Query: 255 NPEENNSMKSSVIEVGD----SQNEVDKTDDKRDSNSDGSSTVENESHSVQVPTEKQNVV 422 P N+ KSS G S+ E + DD N +EN +H VQ EK N Sbjct: 213 KPNANSCSKSSENSEGSRCGISETEANDMDDGGSCNM----IMENNAHPVQNQNEKPNPT 268 Query: 423 --PKTFVATEIYDGKPVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFI 596 PKTFV TEI+DGK VNVVDG+KLYEEL +SEVSK V+LVNDLR AG+RGQL QTF+ Sbjct: 269 TSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFV 328 Query: 597 VSKRPMKGHGREMIQLGLPIVDAPPEDEAAIATYKDRKTE 716 VSKRPMKGHGREMIQLG+PI DAP EDE+ + T KDR+TE Sbjct: 329 VSKRPMKGHGREMIQLGVPIADAPLEDESVVGTSKDRRTE 368 >ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca subsp. vesca] Length = 682 Score = 192 bits (487), Expect = 1e-46 Identities = 117/265 (44%), Positives = 150/265 (56%), Gaps = 27/265 (10%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDG---GGK--------MXXXXXXXXXXXXXX 149 VLHMQQYFSVAEVIY+L QV WR+QQ+ ++ G K + Sbjct: 92 VLHMQQYFSVAEVIYALQQVAWRRQQRYYEPVKMGNKDYKRSNSGVGFKPRNEPVKEWHT 151 Query: 150 XXXXXXXLKVNGVVKIDVVNKEAAVKQGE--------------TKELVGNPEENNSMKSS 287 +G+ K+ +E GE TK ++ P E S +SS Sbjct: 152 ASVEYRSYDGSGLEKVGSEMREEVKPGGEAGKVDDKGSAAGAVTKGVLTKPHEYISSRSS 211 Query: 288 VIEVGDSQNEVDKTDDKRDSNSDGSSTVENESHSVQVPTEKQNV--VPKTFVATEIYDGK 461 G +++D + SS ENES+S+Q+ EKQN+ +PKTFV E +DGK Sbjct: 212 ANSQGTISGN-SESEDAVVNEGCTSSIKENESNSIQIQNEKQNLSLIPKTFVGNETFDGK 270 Query: 462 PVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGREMIQ 641 VNVVDG+KLYEE L ++EVSKL +LVNDLR GRRGQL QT+++SKRPMKGHGREMIQ Sbjct: 271 TVNVVDGLKLYEEFLGDTEVSKLFSLVNDLRTTGRRGQLQGQTYVLSKRPMKGHGREMIQ 330 Query: 642 LGLPIVDAPPEDEAAIATYKDRKTE 716 LG+PI D P EDE + KDR+ E Sbjct: 331 LGIPIADGPQEDEISAGISKDRRME 355 >ref|XP_006448091.1| hypothetical protein CICLE_v10014588mg [Citrus clementina] gi|557550702|gb|ESR61331.1| hypothetical protein CICLE_v10014588mg [Citrus clementina] Length = 635 Score = 188 bits (477), Expect = 2e-45 Identities = 114/256 (44%), Positives = 147/256 (57%), Gaps = 18/256 (7%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDGGGKMXXXXXXXXXXXXXXXXXXXXXLKVN 182 VLH+QQYFSV+EV+ +L QV WRKQQ+ FD K + Sbjct: 64 VLHLQQYFSVSEVMLALQQVAWRKQQRSFD---------HHHHHHHHHQQQHHLNRTKRS 114 Query: 183 GVVKIDVVNKEAAVKQG------------ETKELVGNPEENNSMKS----SVIEVGDSQN 314 VK D N + K++V ++ S KS + +VGD++ Sbjct: 115 AFVKKDFHNNNNNNNNNNHAFDSNSSAFDDKKDVVMKAHDDGSAKSLGNSEITQVGDAEP 174 Query: 315 EVDKTDDKRDSNSDGSSTVENESHSVQVPTEKQN--VVPKTFVATEIYDGKPVNVVDGMK 488 + + DD S EN+S SVQ EKQN + K+FV TE+ DGK VNVVDG+K Sbjct: 175 KAEALDD-----GCTPSLKENDSQSVQSQNEKQNQSMAAKSFVGTEMVDGKMVNVVDGLK 229 Query: 489 LYEELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGREMIQLGLPIVDAP 668 LYEE+ NSEVSKLV+LVNDLR AG+RGQ+ ++VSKRP++GHGRE+IQLGLPIVD P Sbjct: 230 LYEEVSGNSEVSKLVSLVNDLRTAGKRGQIQGPAYVVSKRPIRGHGREVIQLGLPIVDGP 289 Query: 669 PEDEAAIATYKDRKTE 716 PEDE A T +DR+ E Sbjct: 290 PEDEIAAGTSRDRRIE 305 >ref|XP_006469304.1| PREDICTED: uncharacterized protein LOC102618872 [Citrus sinensis] Length = 627 Score = 187 bits (476), Expect = 2e-45 Identities = 113/253 (44%), Positives = 146/253 (57%), Gaps = 15/253 (5%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDGGGKMXXXXXXXXXXXXXXXXXXXXXLKVN 182 VLH+QQYFSV+EV+ +L QV WRKQQ+ FD K + Sbjct: 64 VLHLQQYFSVSEVMLALQQVAWRKQQRSFD-------------HHHHHQQQHHLNRTKRS 110 Query: 183 GVVKIDVVNKEAAVKQG---------ETKELVGNPEENNSMKS----SVIEVGDSQNEVD 323 VK D N + K++V ++ S KS + +VGD++ + + Sbjct: 111 AFVKKDFHNNNNNNNHAFDSNSSAFDDKKDVVMKAHDDGSAKSLGNSEITQVGDAEPKAE 170 Query: 324 KTDDKRDSNSDGSSTVENESHSVQVPTEKQN--VVPKTFVATEIYDGKPVNVVDGMKLYE 497 DD EN+S SVQ EKQN + K+FV TE+ DGK VNVVDG+KLYE Sbjct: 171 ALDD-----GCTPGLKENDSQSVQSQNEKQNQSMAAKSFVGTEMVDGKMVNVVDGLKLYE 225 Query: 498 ELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGREMIQLGLPIVDAPPED 677 E+ NSEVSKLV+LVNDLR AG+RGQ+ ++VSKRP++GHGRE+IQLGLPIVD PPED Sbjct: 226 EVSGNSEVSKLVSLVNDLRTAGKRGQIQGPAYVVSKRPIRGHGREVIQLGLPIVDGPPED 285 Query: 678 EAAIATYKDRKTE 716 E A T +DR+ E Sbjct: 286 EIAAGTSRDRRIE 298 >gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis] Length = 681 Score = 186 bits (472), Expect = 7e-45 Identities = 115/268 (42%), Positives = 149/268 (55%), Gaps = 30/268 (11%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQKGFD-----------GGGKMXXXXXXXXXXXXXX 149 VLHMQQYFSVAEV+++L QV WR+QQ+ +D G Sbjct: 90 VLHMQQYFSVAEVMFALQQVAWRRQQRFYDPVKMGNKEFKRSGVGFKQWQRNDSFKDGRN 149 Query: 150 XXXXXXXLKVNGVVKIDVVNKEAAVKQGETKELVGNPEENNSMKSS-------------- 287 L N K + K G+ VGN ++ SM ++ Sbjct: 150 SAAESHCLDGNSSFGNAASEKGGSDKSGDE---VGNSDDRGSMPAAKEKNDSAAKSQEDG 206 Query: 288 -VIEVGDSQNEVDKTDDKRDSNSDG--SSTVENESHSVQVPTEKQNV--VPKTFVATEIY 452 V +G+ + V ++ + + DG SS+ EN+SHS E N+ VPKTF E++ Sbjct: 207 NVKSLGNFEGVVSGSEPEVHAVDDGCTSSSKENDSHSTPKQNENSNLANVPKTFSGNEMF 266 Query: 453 DGKPVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGRE 632 DGKPVNVV+G+KLYEE +++EVSKLV LVNDLR AG RG +QT++VSKRPMKGHGRE Sbjct: 267 DGKPVNVVEGLKLYEEFCADTEVSKLVALVNDLRSAGERGHFQSQTYVVSKRPMKGHGRE 326 Query: 633 MIQLGLPIVDAPPEDEAAIATYKDRKTE 716 IQLGLPI DAP EDE + T KDR+TE Sbjct: 327 KIQLGLPIADAPVEDEISAGTLKDRRTE 354 >gb|EOY01300.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] gi|508709405|gb|EOY01302.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] Length = 680 Score = 184 bits (466), Expect = 3e-44 Identities = 113/252 (44%), Positives = 145/252 (57%), Gaps = 14/252 (5%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDGG---GKMXXXXXXXXXXXXXXXXXXXXXL 173 VLHMQQYFSVAEV Y+L QV WR++Q+ ++ G GK Sbjct: 107 VLHMQQYFSVAEVSYALQQVAWRRRQRHYESGKVGGKEFKRSGMGFKGQRMEVAKEGQNS 166 Query: 174 KVNG-----VVKIDVVNKEAAVKQGETKEL--VGNPEEN-NSMKSSVIEVGDSQNEVDKT 329 V+ V + N+ + K+ E K VG E+ ++ + G + D Sbjct: 167 GVDSDGNSTVTAVSERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAGDAE 226 Query: 330 DDKRDSNSDGSSTV-ENESHSVQVPTEKQNVV--PKTFVATEIYDGKPVNVVDGMKLYEE 500 D N +S+ EN+ S+Q EKQN+ PKTFV E++DGK VNVVDG+KLYEE Sbjct: 227 SVTEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDGLKLYEE 286 Query: 501 LLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGREMIQLGLPIVDAPPEDE 680 L + EV LV+LVNDLR AG+RGQL QT++ +KRPMKGHGREMIQLGLPI DAP +DE Sbjct: 287 LFDDKEVLDLVSLVNDLRAAGKRGQLQGQTYVAAKRPMKGHGREMIQLGLPIADAPLDDE 346 Query: 681 AAIATYKDRKTE 716 A T KDR+ E Sbjct: 347 NAAGTSKDRRIE 358 >emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera] Length = 1145 Score = 182 bits (461), Expect = 1e-43 Identities = 127/291 (43%), Positives = 153/291 (52%), Gaps = 53/291 (18%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQKGFD---GGGKMXXXXXXXXXXXXXXXXXXXXXL 173 VLHMQQYFSVAEVIY+L QV WR+QQ+ D G GK Sbjct: 91 VLHMQQYFSVAEVIYALQQVGWRRQQRHLDPVKGAGKEYKRYGVAYRQGQRGETAKDSHN 150 Query: 174 K---------------------------VNGVVKIDVVNK------EAAVKQGETKELV- 251 V G K DVV K AA ++ E V Sbjct: 151 SNFENHSHDANSSGTLEKGERVSEIYDDVKGGDKGDVVGKLEDKDLSAAAEKKEVMNFVI 210 Query: 252 -GNPEE---NNSMKSSVIEVGDSQNEVDKTDDKRDSNS------DGSSTVENESHSVQVP 401 G E+ N M+ +V V +Q + D + + + +EN +H VQ Sbjct: 211 FGQLEQMLLQNPMQIAVRRVQKTQKDPDVAFQRLRPMTWMMEARSCNMIMENNAHPVQNQ 270 Query: 402 TEKQNVV--PKTFVATEIYDGKPVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQ 575 EK N PKTFV TEI+DGK VNVVDG+KLYEEL +SEVSK V+LVNDLR AG+RGQ Sbjct: 271 NEKPNPTTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQ 330 Query: 576 LPAQTFIVSKRPMKGHGREMIQLGLPIVDAPPEDEAAIATYK----DRKTE 716 L QTF+VSKRPMKGHGREMIQLG+PI DAP EDE+ + T K +R+TE Sbjct: 331 LQGQTFVVSKRPMKGHGREMIQLGVPIADAPLEDESVVGTSKGMFHNRRTE 381 >gb|EOY01299.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] gi|508709404|gb|EOY01301.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 681 Score = 181 bits (458), Expect = 3e-43 Identities = 114/253 (45%), Positives = 146/253 (57%), Gaps = 15/253 (5%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDGG---GKMXXXXXXXXXXXXXXXXXXXXXL 173 VLHMQQYFSVAEV Y+L QV WR++Q+ ++ G GK Sbjct: 107 VLHMQQYFSVAEVSYALQQVAWRRRQRHYESGKVGGKEFKRSGMGFKGQRMEVAKEGQNS 166 Query: 174 KVNG-----VVKIDVVNKEAAVKQGETKEL--VGNPEEN-NSMKSSVIEVGDSQNEVDKT 329 V+ V + N+ + K+ E K VG E+ ++ + G + D Sbjct: 167 GVDSDGNSTVTAVSERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAGDAE 226 Query: 330 DDKRDSNSDGSSTV-ENESHSVQVPTEKQNVV--PKTFVATEIYDGKPVNVVDGMKLYEE 500 D N +S+ EN+ S+Q EKQN+ PKTFV E++DGK VNVVDG+KLYEE Sbjct: 227 SVTEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDGLKLYEE 286 Query: 501 LLSNSEVSKLVTLVNDLRVAGRRGQLPA-QTFIVSKRPMKGHGREMIQLGLPIVDAPPED 677 L + EV LV+LVNDLR AG+RGQL A QT++ +KRPMKGHGREMIQLGLPI DAP +D Sbjct: 287 LFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKRPMKGHGREMIQLGLPIADAPLDD 346 Query: 678 EAAIATYKDRKTE 716 E A T KDR+ E Sbjct: 347 ENAAGTSKDRRIE 359 >gb|EOY01303.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 5 [Theobroma cacao] Length = 572 Score = 174 bits (442), Expect = 2e-41 Identities = 111/250 (44%), Positives = 143/250 (57%), Gaps = 15/250 (6%) Frame = +3 Query: 12 MQQYFSVAEVIYSLHQVEWRKQQKGFDGG---GKMXXXXXXXXXXXXXXXXXXXXXLKVN 182 MQQYFSVAEV Y+L QV WR++Q+ ++ G GK V+ Sbjct: 1 MQQYFSVAEVSYALQQVAWRRRQRHYESGKVGGKEFKRSGMGFKGQRMEVAKEGQNSGVD 60 Query: 183 G-----VVKIDVVNKEAAVKQGETKEL--VGNPEEN-NSMKSSVIEVGDSQNEVDKTDDK 338 V + N+ + K+ E K VG E+ ++ + G + D Sbjct: 61 SDGNSTVTAVSERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAGDAESVT 120 Query: 339 RDSNSDGSSTV-ENESHSVQVPTEKQNVV--PKTFVATEIYDGKPVNVVDGMKLYEELLS 509 D N +S+ EN+ S+Q EKQN+ PKTFV E++DGK VNVVDG+KLYEEL Sbjct: 121 EDVNGGCTSSYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDGLKLYEELFD 180 Query: 510 NSEVSKLVTLVNDLRVAGRRGQLPA-QTFIVSKRPMKGHGREMIQLGLPIVDAPPEDEAA 686 + EV LV+LVNDLR AG+RGQL A QT++ +KRPMKGHGREMIQLGLPI DAP +DE A Sbjct: 181 DKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKRPMKGHGREMIQLGLPIADAPLDDENA 240 Query: 687 IATYKDRKTE 716 T KDR+ E Sbjct: 241 AGTSKDRRIE 250 >ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa] gi|550333016|gb|ERP57586.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa] Length = 693 Score = 170 bits (430), Expect = 5e-40 Identities = 119/281 (42%), Positives = 142/281 (50%), Gaps = 43/281 (15%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQKG-------------FDGGGKMXXXXXXXXXXXX 143 VLHMQQYFSV EVI +L QV R+QQ+ + GK+ Sbjct: 97 VLHMQQYFSVGEVIVALQQVVLRRQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAG 156 Query: 144 XXXXXXXXXLKVNGVVKIDVVNKEAAVKQGETKELVGNPEENNSMKSSVIEVGDSQNEVD 323 G D V KE E GN EN ++S E S + Sbjct: 157 FNRGHRGGGGGGGG----DAV-KEGVNSSVENHSFNGNSSEN--IRSEKFEEVKSGGDGG 209 Query: 324 KTDDKRDSNS----------------------------DGSSTVENESHSVQVPTEKQN- 416 K+DDK+D+ + D SS E++SH EKQN Sbjct: 210 KSDDKKDATAKSHTDNHKNSSGNAQGTFSGNSEAVAVDDRSSPEESDSHPSNNQNEKQNL 269 Query: 417 -VVPKTFVATEIYDGKPVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTF 593 + PKTFVA E DG+ VNVVDG+KLYE LL EVSKLV+LVN+LR GRRGQ QT+ Sbjct: 270 AITPKTFVAEEKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTY 329 Query: 594 IVSKRPMKGHGREMIQLGLPIVDAPPEDEAAIATYKDRKTE 716 I+SKRPMKGHGREMIQLGLPI DAP EDE A T K+R+ E Sbjct: 330 ILSKRPMKGHGREMIQLGLPIADAPAEDENATGTSKERRVE 370 >ref|XP_002527549.1| conserved hypothetical protein [Ricinus communis] gi|223533099|gb|EEF34858.1| conserved hypothetical protein [Ricinus communis] Length = 697 Score = 170 bits (430), Expect = 5e-40 Identities = 117/279 (41%), Positives = 148/279 (53%), Gaps = 41/279 (14%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQ---------------------------------- 80 VLHMQQYFSV EVI +L QV RKQQ Sbjct: 103 VLHMQQYFSVGEVILALQQVALRKQQQHQHQHQHQQHRYYYDQPKVGGKDFKRNSSMGFN 162 Query: 81 KGFDGGGKMXXXXXXXXXXXXXXXXXXXXXLKVNGVVKID----VVNKEAAVKQGETKEL 248 KG GGG++ K N + + NK A + + K+ Sbjct: 163 KGHRGGGEVVKEVNYGAESHGLDGNTSGNE-KFNEIKSGGDSGRLENKSLATAE-DKKDA 220 Query: 249 VGNPEENNSMKSSVIEVGDSQNEVD-KTDDKRDSNSDGSSTVENESHSVQVPTEKQNVV- 422 P +N +KSS G+S+ + + + ++ + SS E++SH +Q K N+ Sbjct: 221 ASKPHVDN-LKSS----GNSEGSLSGNLETEAEAVHEQSSPKEHDSHFIQNQIVKLNLTT 275 Query: 423 -PKTFVATEIYDGKPVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIV 599 PKTFV E+ DGK VNVVDG+KLYE+LL + EVSKLV+LVNDLR AGR+GQ Q ++V Sbjct: 276 TPKTFVGAEMVDGKSVNVVDGLKLYEQLLDDVEVSKLVSLVNDLRAAGRKGQFQGQAYVV 335 Query: 600 SKRPMKGHGREMIQLGLPIVDAPPEDEAAIATYKDRKTE 716 SKRPMKGHGREMIQLGLPI DAP E+E A T KDRK E Sbjct: 336 SKRPMKGHGREMIQLGLPIADAPAEEENAAGTSKDRKIE 374 >gb|ABK95394.1| unknown [Populus trichocarpa] Length = 694 Score = 169 bits (428), Expect = 8e-40 Identities = 117/278 (42%), Positives = 144/278 (51%), Gaps = 40/278 (14%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDGGGKMXXXXXXXXXXXXXXXXXXXXXLKVN 182 VLHMQQYFSV EVI +L QV R+QQ+ + K + Sbjct: 97 VLHMQQYFSVGEVIVALQQVVLRRQQQQ-QQQQQQQQNHHHQQRFYYDHGKVGGRDFKRS 155 Query: 183 GVVKIDVVNKEA-----AVKQG-----ETKELVGNPEENNSMKSSVIEVGDSQNEVDKTD 332 + ++ AVK+G E GN EN ++S E S + K+D Sbjct: 156 SSAGFNRGHRGGGGGGDAVKEGVNSSVENHSFNGNSSEN--IRSEKFEEVKSGGDGGKSD 213 Query: 333 DKRDSNS----------------------------DGSSTVENESHSVQVPTEKQN--VV 422 DK+D+ + D SS E++SH EKQN + Sbjct: 214 DKKDATAKSHTDNHKNSSGNAQGTFSGNSEAVAVDDRSSPEESDSHPSNNQNEKQNLAIT 273 Query: 423 PKTFVATEIYDGKPVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVS 602 PKTFVA E DG+ VNVVDG+KLYE LL EVSKLV+LVN+LR GRRGQ QT+I+S Sbjct: 274 PKTFVAEEKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILS 333 Query: 603 KRPMKGHGREMIQLGLPIVDAPPEDEAAIATYKDRKTE 716 KRPMKGHGREMIQLGLPI DAP EDE A T K+R+ E Sbjct: 334 KRPMKGHGREMIQLGLPIADAPAEDENATGTSKERRVE 371 >ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781773 [Glycine max] Length = 664 Score = 166 bits (420), Expect = 7e-39 Identities = 115/261 (44%), Positives = 145/261 (55%), Gaps = 23/261 (8%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQKGFD------------GGGKMXXXXXXXXXXXXX 146 VL MQQYFSV+EV+Y+L QV WR+QQ+ D G G Sbjct: 91 VLLMQQYFSVSEVVYALQQVSWRRQQRVVDPAKTGAKEFRKFGLGFKQGQHRFEAVKDGY 150 Query: 147 XXXXXXXXLKVNGVVKIDVVNKEAAV--KQGETKE--LVGNPEENNSMKSSVIEVGDSQN 314 N VV V K A V K GE K +VG + N + + + Sbjct: 151 NSSVESFGHGTNAVVVAGGVEKGACVTEKNGEIKSGGMVGTMDNKNLGSPEERKDAITNH 210 Query: 315 EVDKTDDKRDSNSDGS-STVENESHSVQ---VPTEKQN--VVPKTFVATEIYDGKPVNVV 476 + D K NS GS S+ E E+ V V K+N ++ K F+ E++DGK VNVV Sbjct: 211 QSDGIL-KGSRNSQGSLSSSECEAVGVNEECVSNSKENDSIMGKFFIGNEMFDGKMVNVV 269 Query: 477 DGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPA-QTFIVSKRPMKGHGREMIQLGLP 653 DG+KLYE+LL ++EVSKLV+LVNDLRVAG+RGQ QTF+VSKRPMKGHGREMIQLG+P Sbjct: 270 DGLKLYEDLLDSTEVSKLVSLVNDLRVAGKRGQFQGNQTFVVSKRPMKGHGREMIQLGVP 329 Query: 654 IVDAPPEDEAAIATYKDRKTE 716 I DAPP+ + KD+K E Sbjct: 330 IADAPPDVDNVTGISKDKKVE 350 >ref|XP_004513242.1| PREDICTED: uncharacterized protein LOC101506929 isoform X1 [Cicer arietinum] Length = 669 Score = 166 bits (420), Expect = 7e-39 Identities = 108/273 (39%), Positives = 147/273 (53%), Gaps = 35/273 (12%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQK----------------GFDGGGKMXXXXXXXXX 134 VL MQQY+SV+EV Y+L QV WR+QQ+ F+GGG + Sbjct: 94 VLLMQQYYSVSEVAYALQQVAWRRQQRVVKPVAREFKKVRQWQRFEGGGNVKEGCNSGVE 153 Query: 135 XXXXXXXXXXXXLKVNGVVK-IDVVNKEAAVKQG---------------ETKELVGNPEE 266 + N VK VV+K +K G E K+ N + Sbjct: 154 FHRN---------EANSTVKGTRVVDKSEELKSGGKVGVKDDKSSDIAEEKKDTTTNHQS 204 Query: 267 NNSMKSSVIEVGDSQNEVDKTDDKRDSNSDGSSTVENESHSVQVPTEKQN--VVPKTFVA 440 + +KS V G + K +D + + S EN+SHS+Q + +N KTF A Sbjct: 205 DGILKSPVNSQGSLSSAEYKAEDVNEEGASNSG--ENDSHSIQNQHQNENGSFTGKTFTA 262 Query: 441 TEIYDGKPVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPA-QTFIVSKRPMK 617 E++DGK VN V+G+KLYE+L ++EVSKLV+LVNDLRVAGR+GQL QT++VSKRPM+ Sbjct: 263 NEMFDGKTVNAVEGLKLYEDLFDSTEVSKLVSLVNDLRVAGRKGQLQGNQTYVVSKRPMR 322 Query: 618 GHGREMIQLGLPIVDAPPEDEAAIATYKDRKTE 716 G GREMIQLG+PI A P+ + A+ KD+ E Sbjct: 323 GRGREMIQLGVPIAYASPDVDNVTASTKDKNME 355 >ref|XP_004513244.1| PREDICTED: uncharacterized protein LOC101507475 [Cicer arietinum] Length = 657 Score = 165 bits (418), Expect = 1e-38 Identities = 105/265 (39%), Positives = 154/265 (58%), Gaps = 27/265 (10%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDGGGK-MXXXXXXXXXXXXXXXXXXXXXLKV 179 VL MQQY+SV+EV Y+L QV WR+QQ+ K +++ Sbjct: 91 VLLMQQYYSVSEVSYALQQVAWRRQQRVVKPVVKEFRKVRQWQRFEGANVKEGCNSSVEL 150 Query: 180 NG------VVKIDVVNKEAAVKQG---------------ETKELVGNPEENNSMKSSVIE 296 NG V + V++K +K E K+ + N + N +K S Sbjct: 151 NGNKANLSVKETPVIDKIGELKSEGKVGTKDDKSSDIGEEKKDTITNHQSGNILKRS--- 207 Query: 297 VGDSQNEVDKTDDKRDSNSDG--SSTVENESHSVQVPTEKQN--VVPKTFVATEIYDGKP 464 G+SQ + ++ + ++G S++ EN+SHS+Q +K+N + K F+ EI DGK Sbjct: 208 -GNSQGSLSSSECEAVGVNEGITSNSRENDSHSMQNQNQKENNSTMGKAFIGNEIVDGKM 266 Query: 465 VNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPA-QTFIVSKRPMKGHGREMIQ 641 VNVVDG+KL+E+L ++EVSKLV+LVND+R+AG++GQ QT++VSKRPM+GHGREMIQ Sbjct: 267 VNVVDGLKLHEDLFDSTEVSKLVSLVNDMRIAGKKGQFQGNQTYVVSKRPMRGHGREMIQ 326 Query: 642 LGLPIVDAPPEDEAAIATYKDRKTE 716 LGLPIVDAP +++ A+ K +K E Sbjct: 327 LGLPIVDAPQDEDNMTASTKGKKIE 351 >ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|550333015|gb|EEE88914.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 675 Score = 165 bits (417), Expect = 2e-38 Identities = 115/259 (44%), Positives = 135/259 (52%), Gaps = 26/259 (10%) Frame = +3 Query: 3 VLHMQQYFSVAEVIYSLHQVEWRKQQKG-------------FDGGGKMXXXXXXXXXXXX 143 VLHMQQYFSV EVI +L QV R+QQ+ + GK+ Sbjct: 97 VLHMQQYFSVGEVIVALQQVVLRRQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAG 156 Query: 144 XXXXXXXXXLKVNGVVKIDVVNKEAAVKQGETKELVGNPEENNSMKSSVIEVGDSQNEVD 323 G D V KE E GN EN ++S E S + Sbjct: 157 FNRGHRGGGGGGGG----DAV-KEGVNSSVENHSFNGNSSEN--IRSEKFEEVKSGGDGG 209 Query: 324 KTDDKR-----------DSNSDGSSTVENESHSVQVPTEKQN--VVPKTFVATEIYDGKP 464 K+DDK+ NS G++ +S V EKQN + PKTFVA E DG+ Sbjct: 210 KSDDKKADATAKSHTDNHKNSSGNAQGTFSGNSEAVANEKQNLAITPKTFVAEEKIDGQM 269 Query: 465 VNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGREMIQL 644 VNVVDG+KLYE LL EVSKLV+LVN+LR GRRGQ QT+I+SKRPMKGHGREMIQL Sbjct: 270 VNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGREMIQL 329 Query: 645 GLPIVDAPPEDEAAIATYK 701 GLPI DAP EDE A T K Sbjct: 330 GLPIADAPAEDENATGTSK 348