BLASTX nr result
ID: Akebia25_contig00006983
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00006983 (1784 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI26785.3| unnamed protein product [Vitis vinifera] 327 1e-86 ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252... 326 2e-86 gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis] 318 4e-84 ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309... 316 2e-83 emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera] 310 1e-81 ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family prot... 305 4e-80 ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family prot... 300 1e-78 ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prun... 300 1e-78 ref|XP_002527549.1| conserved hypothetical protein [Ricinus comm... 296 2e-77 ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210... 292 4e-76 ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809... 286 2e-74 ref|XP_006849434.1| hypothetical protein AMTR_s00024p00040890 [A... 283 1e-73 ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794... 281 9e-73 ref|XP_007158785.1| hypothetical protein PHAVU_002G181800g [Phas... 278 7e-72 ref|XP_007153188.1| hypothetical protein PHAVU_003G014200g [Phas... 277 1e-71 ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Popu... 277 1e-71 gb|ABK95394.1| unknown [Populus trichocarpa] 275 4e-71 ref|XP_006605475.1| PREDICTED: uncharacterized protein LOC100814... 270 1e-69 ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809... 270 2e-69 ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus tr... 268 5e-69 >emb|CBI26785.3| unnamed protein product [Vitis vinifera] Length = 672 Score = 327 bits (838), Expect = 1e-86 Identities = 194/372 (52%), Positives = 227/372 (61%), Gaps = 13/372 (3%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTSSGSVGGG----EIQN-RQWFMDERDRFISWLQGEFAAANAII 871 MAMPSGNV ISDKMQF G GGG EI + RQWF DERD FISWL+GEFAAANAII Sbjct: 1 MAMPSGNVVISDKMQFPGGGGRGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAII 60 Query: 872 DSLCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQQAAWRKQQTH 1051 DSLC+HLR IGEP EYD V+ CIQQRR NW+ VLHMQQYFS+A+V +ALQQ WR+QQ H Sbjct: 61 DSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRH 120 Query: 1052 FDHRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXXXXXXXDTISSAQLVNMGSDXX 1231 D VK + K+ KR GV RQ R ETAK D SS L + Sbjct: 121 LD-PVKGAGKEYKR---YGVAYRQGQRGETAKDSHNSNFENHSHDANSSGTL-----EKG 171 Query: 1232 XXXXXXXXXXXQCQRGDVVERSDGKSTVLAEDSEG-----VVGTTNSHVDGGLRSSGIEC 1396 +GDVV + + K AE+ + NS S G C Sbjct: 172 ERVSEIYDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCSKSSENSEGSRC 231 Query: 1397 --DNLKSGAVDGGCSTNLKEPSNALLKSGGDAIQNQNEAENVIPSPKPLVGTEIIDGKAV 1570 ++ +D G + N K N ++++ +QNQNE N SPK VGTEI DGKAV Sbjct: 232 GISETEANDMDDGGTLNPKGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAV 291 Query: 1571 NVVEGLRVYEGLFDSLGISKLVQLANELRTAGRRGQFQ-GRTFVSSKRPMKGRGREMIQL 1747 NVV+GL++YE LFD +SK V L N+LR AG+RGQ Q G+TFV SKRPMKG GREMIQL Sbjct: 292 NVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQAGQTFVVSKRPMKGHGREMIQL 351 Query: 1748 GVPIADAPPEDE 1783 GVPIADAP EDE Sbjct: 352 GVPIADAPLEDE 363 >ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252594 [Vitis vinifera] Length = 698 Score = 326 bits (835), Expect = 2e-86 Identities = 193/371 (52%), Positives = 226/371 (60%), Gaps = 12/371 (3%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTSSGSVGGG----EIQN-RQWFMDERDRFISWLQGEFAAANAII 871 MAMPSGNV ISDKMQF G GGG EI + RQWF DERD FISWL+GEFAAANAII Sbjct: 1 MAMPSGNVVISDKMQFPGGGGRGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAII 60 Query: 872 DSLCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQQAAWRKQQTH 1051 DSLC+HLR IGEP EYD V+ CIQQRR NW+ VLHMQQYFS+A+V +ALQQ WR+QQ H Sbjct: 61 DSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRH 120 Query: 1052 FDHRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXXXXXXXDTISSAQLVNMGSDXX 1231 D VK + K+ KR GV RQ R ETAK D SS L + Sbjct: 121 LD-PVKGAGKEYKR---YGVAYRQGQRGETAKDSHNSNFENHSHDANSSGTL-----EKG 171 Query: 1232 XXXXXXXXXXXQCQRGDVVERSDGKSTVLAEDSEG-----VVGTTNSHVDGGLRSSGIEC 1396 +GDVV + + K AE+ + NS S G C Sbjct: 172 ERVSEIYDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCSKSSENSEGSRC 231 Query: 1397 --DNLKSGAVDGGCSTNLKEPSNALLKSGGDAIQNQNEAENVIPSPKPLVGTEIIDGKAV 1570 ++ +D G S N+ ++++ +QNQNE N SPK VGTEI DGKAV Sbjct: 232 GISETEANDMDDGGSCNM------IMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAV 285 Query: 1571 NVVEGLRVYEGLFDSLGISKLVQLANELRTAGRRGQFQGRTFVSSKRPMKGRGREMIQLG 1750 NVV+GL++YE LFD +SK V L N+LR AG+RGQ QG+TFV SKRPMKG GREMIQLG Sbjct: 286 NVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKGHGREMIQLG 345 Query: 1751 VPIADAPPEDE 1783 VPIADAP EDE Sbjct: 346 VPIADAPLEDE 356 >gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis] Length = 681 Score = 318 bits (816), Expect = 4e-84 Identities = 190/371 (51%), Positives = 229/371 (61%), Gaps = 12/371 (3%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTSSGSVGGGEIQ---NRQWFMDERDRFISWLQGEFAAANAIIDS 877 MAMPSGNV SDKMQF S G+ G GEI NRQWF DERD FISWL+GEFAAANA+IDS Sbjct: 1 MAMPSGNVVSSDKMQFPS-GTAGAGEISHHNNRQWFPDERDGFISWLRGEFAAANAMIDS 59 Query: 878 LCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQQAAWRKQQTHFD 1057 LCHHLR++GEP EYD V++CIQ RRCNWNPVLHMQQYFS+A+V FALQQ AWR+QQ +D Sbjct: 60 LCHHLRAVGEPGEYDAVIACIQLRRCNWNPVLHMQQYFSVAEVMFALQQVAWRRQQRFYD 119 Query: 1058 HRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXXXXXXXDTISS---AQLVNMGSDX 1228 VK+ K+ KRS GVG +QW R ++ K D SS A GSD Sbjct: 120 -PVKMGNKEFKRS---GVGFKQWQRNDSFKDGRNSAAESHCLDGNSSFGNAASEKGGSD- 174 Query: 1229 XXXXXXXXXXXXQCQRGDVVERSDGKSTVLAEDSEGVVGTTNSHVDGGLRSSGIECDNLK 1408 + GD V SD + ++ A E S DG ++S G + + Sbjct: 175 --------------KSGDEVGNSDDRGSMPAA-KEKNDSAAKSQEDGNVKSLG-NFEGVV 218 Query: 1409 SG------AVDGGCSTNLKEPSNALLKSGGDAIQNQNEAENVIPSPKPLVGTEIIDGKAV 1570 SG AVD GC+++ KE + + QNE N+ PK G E+ DGK V Sbjct: 219 SGSEPEVHAVDDGCTSSSKE-------NDSHSTPKQNENSNLANVPKTFSGNEMFDGKPV 271 Query: 1571 NVVEGLRVYEGLFDSLGISKLVQLANELRTAGRRGQFQGRTFVSSKRPMKGRGREMIQLG 1750 NVVEGL++YE +SKLV L N+LR+AG RG FQ +T+V SKRPMKG GRE IQLG Sbjct: 272 NVVEGLKLYEEFCADTEVSKLVALVNDLRSAGERGHFQSQTYVVSKRPMKGHGREKIQLG 331 Query: 1751 VPIADAPPEDE 1783 +PIADAP EDE Sbjct: 332 LPIADAPVEDE 342 >ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca subsp. vesca] Length = 682 Score = 316 bits (810), Expect = 2e-83 Identities = 181/367 (49%), Positives = 230/367 (62%), Gaps = 8/367 (2%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTS--SGSVGGGEI--QNRQWFMDERDRFISWLQGEFAAANAIID 874 M MPSGNV +SDKMQ+ S +V GGEI Q RQWF DERD FISWL+GEFAAANAIID Sbjct: 1 MTMPSGNVVLSDKMQYPSVAGAAVSGGEIHQQPRQWFPDERDGFISWLRGEFAAANAIID 60 Query: 875 SLCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQQAAWRKQQTHF 1054 SLCHHLR++GEPSEYD V+ C+QQRRCNW PVLHMQQYFS+A+V +ALQQ AWR+QQ ++ Sbjct: 61 SLCHHLRAVGEPSEYDMVIGCVQQRRCNWTPVLHMQQYFSVAEVIYALQQVAWRRQQRYY 120 Query: 1055 DHRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXXXXXXXDTISSAQLVNMGSDXXX 1234 + VK+ KD KRS S GVG + R E K D + L +GS+ Sbjct: 121 E-PVKMGNKDYKRSNS-GVGFKP--RNEPVKEWHTASVEYRSYD---GSGLEKVGSE--- 170 Query: 1235 XXXXXXXXXXQCQRGDVVERSDGKSTVLAEDSEGVVGTTNSHVD--GGLRSSGIECDNLK 1408 + + G + D K + ++GV+ + ++ S G N + Sbjct: 171 -------MREEVKPGGEAGKVDDKGSAAGAVTKGVLTKPHEYISSRSSANSQGTISGNSE 223 Query: 1409 S--GAVDGGCSTNLKEPSNALLKSGGDAIQNQNEAENVIPSPKPLVGTEIIDGKAVNVVE 1582 S V+ GC++++KE + ++IQ QNE +N+ PK VG E DGK VNVV+ Sbjct: 224 SEDAVVNEGCTSSIKE-------NESNSIQIQNEKQNLSLIPKTFVGNETFDGKTVNVVD 276 Query: 1583 GLRVYEGLFDSLGISKLVQLANELRTAGRRGQFQGRTFVSSKRPMKGRGREMIQLGVPIA 1762 GL++YE +SKL L N+LRT GRRGQ QG+T+V SKRPMKG GREMIQLG+PIA Sbjct: 277 GLKLYEEFLGDTEVSKLFSLVNDLRTTGRRGQLQGQTYVLSKRPMKGHGREMIQLGIPIA 336 Query: 1763 DAPPEDE 1783 D P EDE Sbjct: 337 DGPQEDE 343 >emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera] Length = 1145 Score = 310 bits (794), Expect = 1e-81 Identities = 189/377 (50%), Positives = 223/377 (59%), Gaps = 20/377 (5%) Frame = +2 Query: 713 MPSGNVAISDKMQFTSSGSVGGG----EIQN-RQWFMDERDRFISWLQGEFAAANAIIDS 877 MPSGNV ISDKMQF G GGG EI + RQWF DERD FISWL+GEFAAANAIIDS Sbjct: 1 MPSGNVVISDKMQFPGGGGGGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAIIDS 60 Query: 878 LCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQQAAWRKQQTHFD 1057 LC+HLR IGEP EYD V+ CIQQRR NW+ VLHMQQYFS+A+V +ALQQ WR+QQ H D Sbjct: 61 LCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRHLD 120 Query: 1058 HRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXXXXXXXDTISSAQLVNMGSDXXXX 1237 VK + K+ KR GV RQ R ETAK D SS L + Sbjct: 121 -PVKGAGKEYKR---YGVAYRQGQRGETAKDSHNSNFENHSHDANSSGTL-----EKGER 171 Query: 1238 XXXXXXXXXQCQRGDVVERSDGKSTVLAEDSEGVVGTTNSHVDGGLRS------SGIECD 1399 +GDVV + + K A + + V+ N + G L I Sbjct: 172 VSEIYDDVKGGDKGDVVGKLEDKDLSAAAEKKEVM---NFVIFGQLEQMLLQNPMQIAVR 228 Query: 1400 NLKSGAVDGGCSTNLKEP---------SNALLKSGGDAIQNQNEAENVIPSPKPLVGTEI 1552 ++ D + P N ++++ +QNQNE N SPK VGTEI Sbjct: 229 RVQKTQKDPDVAFQRLRPMTWMMEARSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEI 288 Query: 1553 IDGKAVNVVEGLRVYEGLFDSLGISKLVQLANELRTAGRRGQFQGRTFVSSKRPMKGRGR 1732 DGKAVNVV+GL++YE LFD +SK V L N+LR AG+RGQ QG+TFV SKRPMKG GR Sbjct: 289 FDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKGHGR 348 Query: 1733 EMIQLGVPIADAPPEDE 1783 EMIQLGVPIADAP EDE Sbjct: 349 EMIQLGVPIADAPLEDE 365 >ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] gi|590697545|ref|XP_007045470.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] gi|508709403|gb|EOY01300.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] gi|508709405|gb|EOY01302.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] Length = 680 Score = 305 bits (781), Expect = 4e-80 Identities = 180/379 (47%), Positives = 230/379 (60%), Gaps = 20/379 (5%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTSS----------------GSVGGGEIQ---NRQWFMDERDRFI 829 MAMPSGNV +SDKMQF ++ G GGGEI +RQW DERD FI Sbjct: 1 MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDGFI 60 Query: 830 SWLQGEFAAANAIIDSLCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVN 1009 WL+GEFAA+NAIIDSLCHHLR +GE EY+ V++CIQQRRCNWNPVLHMQQYFS+A+V+ Sbjct: 61 YWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAEVS 120 Query: 1010 FALQQAAWRKQQTHFDHRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXXXXXXXDT 1189 +ALQQ AWR++Q H++ KV K+ KRSG G R+E AK T Sbjct: 121 YALQQVAWRRRQRHYESG-KVGGKEFKRSGMGFKGQ----RMEVAKEGQNSGVDSDGNST 175 Query: 1190 ISSAQLVN-MGSDXXXXXXXXXXXXXQCQRGDVVERSDGKSTVLAEDSEGVVGTTNSHVD 1366 +++ N GS+ + + V + + K + ED + T S Sbjct: 176 VTAVSERNERGSE----------KREEVKSCGEVGKVEDKCSTFTEDKK----DTGSKPH 221 Query: 1367 GGLRSSGIECDNLKSGAVDGGCSTNLKEPSNALLKSGGDAIQNQNEAENVIPSPKPLVGT 1546 G S E V+GGC+++ KE N L +IQNQNE +N+ PK VG Sbjct: 222 AGDAESVTE-------DVNGGCTSSYKE--NDLC-----SIQNQNEKQNLAAGPKTFVGN 267 Query: 1547 EIIDGKAVNVVEGLRVYEGLFDSLGISKLVQLANELRTAGRRGQFQGRTFVSSKRPMKGR 1726 E+ DGK VNVV+GL++YE LFD + LV L N+LR AG+RGQ QG+T+V++KRPMKG Sbjct: 268 EMFDGKMVNVVDGLKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQGQTYVAAKRPMKGH 327 Query: 1727 GREMIQLGVPIADAPPEDE 1783 GREMIQLG+PIADAP +DE Sbjct: 328 GREMIQLGLPIADAPLDDE 346 >ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] gi|590697542|ref|XP_007045469.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] gi|508709402|gb|EOY01299.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] gi|508709404|gb|EOY01301.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 681 Score = 300 bits (769), Expect = 1e-78 Identities = 180/380 (47%), Positives = 230/380 (60%), Gaps = 21/380 (5%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTSS----------------GSVGGGEIQ---NRQWFMDERDRFI 829 MAMPSGNV +SDKMQF ++ G GGGEI +RQW DERD FI Sbjct: 1 MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDGFI 60 Query: 830 SWLQGEFAAANAIIDSLCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVN 1009 WL+GEFAA+NAIIDSLCHHLR +GE EY+ V++CIQQRRCNWNPVLHMQQYFS+A+V+ Sbjct: 61 YWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAEVS 120 Query: 1010 FALQQAAWRKQQTHFDHRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXXXXXXXDT 1189 +ALQQ AWR++Q H++ KV K+ KRSG G R+E AK T Sbjct: 121 YALQQVAWRRRQRHYESG-KVGGKEFKRSGMGFKGQ----RMEVAKEGQNSGVDSDGNST 175 Query: 1190 ISSAQLVN-MGSDXXXXXXXXXXXXXQCQRGDVVERSDGKSTVLAEDSEGVVGTTNSHVD 1366 +++ N GS+ + + V + + K + ED + T S Sbjct: 176 VTAVSERNERGSE----------KREEVKSCGEVGKVEDKCSTFTEDKK----DTGSKPH 221 Query: 1367 GGLRSSGIECDNLKSGAVDGGCSTNLKEPSNALLKSGGDAIQNQNEAENVIPSPKPLVGT 1546 G S E V+GGC+++ KE N L +IQNQNE +N+ PK VG Sbjct: 222 AGDAESVTE-------DVNGGCTSSYKE--NDLC-----SIQNQNEKQNLAAGPKTFVGN 267 Query: 1547 EIIDGKAVNVVEGLRVYEGLFDSLGISKLVQLANELRTAGRRGQFQ-GRTFVSSKRPMKG 1723 E+ DGK VNVV+GL++YE LFD + LV L N+LR AG+RGQ Q G+T+V++KRPMKG Sbjct: 268 EMFDGKMVNVVDGLKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKRPMKG 327 Query: 1724 RGREMIQLGVPIADAPPEDE 1783 GREMIQLG+PIADAP +DE Sbjct: 328 HGREMIQLGLPIADAPLDDE 347 >ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica] gi|462422058|gb|EMJ26321.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica] Length = 650 Score = 300 bits (768), Expect = 1e-78 Identities = 179/366 (48%), Positives = 214/366 (58%), Gaps = 7/366 (1%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTSSG---SVGGGEI--QNRQWFMDERDRFISWLQGEFAAANAII 871 M MPSGNV +SDKMQF S G +VGGGEI +RQWF DERD FISWL+GEFAAANAII Sbjct: 1 MTMPSGNVVLSDKMQFPSGGGGGAVGGGEIAQHHRQWFPDERDGFISWLRGEFAAANAII 60 Query: 872 DSLCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQQAAWRKQQTH 1051 DSLCHHLR++GEP EYD V+ CIQQRRCNWNPVLHMQQYFS+A+V +ALQ AWR+QQ + Sbjct: 61 DSLCHHLRAVGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVAEVIYALQHVAWRRQQRY 120 Query: 1052 FDHRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXXXXXXXDTISSAQLVNMGSDXX 1231 +D VK K+ KRSG +Q R E K D SS Sbjct: 121 YD-PVKAGAKEFKRSGVGFNKGQQ--RAEAFKEGHNSTLESHSNDGNSS----------- 166 Query: 1232 XXXXXXXXXXXQCQRGDVVERSDGKSTVLAEDSE--GVVGTTNSHVDGGLRSSGIECDNL 1405 G V + + + E+ E G VG N D GL +G + N Sbjct: 167 ---------------GVVAPEKFERGSEVGEEVEPGGEVGKLN---DKGLAPAGEKKVN- 207 Query: 1406 KSGAVDGGCSTNLKEPSNALLKSGGDAIQNQNEAENVIPSPKPLVGTEIIDGKAVNVVEG 1585 +IQ QN+ +N+ PK +G EI DGK VNVV+G Sbjct: 208 -----------------------ESHSIQIQNQKQNLSIVPKTFIGNEISDGKTVNVVDG 244 Query: 1586 LRVYEGLFDSLGISKLVQLANELRTAGRRGQFQGRTFVSSKRPMKGRGREMIQLGVPIAD 1765 L++YE +SKLV L N+LR AG+R Q QG+T+V SKRPMKG GREMIQLG+PIAD Sbjct: 245 LKLYEDFLGDTEVSKLVSLVNDLRAAGKRRQLQGQTYVVSKRPMKGHGREMIQLGIPIAD 304 Query: 1766 APPEDE 1783 APPEDE Sbjct: 305 APPEDE 310 >ref|XP_002527549.1| conserved hypothetical protein [Ricinus communis] gi|223533099|gb|EEF34858.1| conserved hypothetical protein [Ricinus communis] Length = 697 Score = 296 bits (759), Expect = 2e-77 Identities = 184/392 (46%), Positives = 223/392 (56%), Gaps = 33/392 (8%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTSSGS-VGGG-------EIQNRQ------WF-MDERDRFISWLQ 841 MAMP GNV ISDK+QF + G VGGG EIQ +Q WF +DERD FISWL+ Sbjct: 1 MAMPPGNVVISDKIQFPAGGGGVGGGNNGGIGNEIQQQQHHHRHQWFPVDERDGFISWLR 60 Query: 842 GEFAAANAIIDSLCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQ 1021 GEFAAANAIIDSLCHHLR+ GEP EYD V+ CIQQRRCNWNPVLHMQQYFS+ +V ALQ Sbjct: 61 GEFAAANAIIDSLCHHLRAAGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVGEVILALQ 120 Query: 1022 QAAWRKQQTH-----------FDHRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXX 1168 Q A RKQQ H + + KV KD KR+ S G E K Sbjct: 121 QVALRKQQQHQHQHQHQQHRYYYDQPKVGGKDFKRNSSMGFNKGHRGGGEVVKEVNYGAE 180 Query: 1169 XXXXXDTISSAQLVNMGSDXXXXXXXXXXXXXQCQRGDVVERSDGKSTVLAEDSEGVVGT 1348 S + N + + G R + KS AED + Sbjct: 181 SHGLDGNTSGNEKFN-----------------EIKSGGDSGRLENKSLATAEDKKD--AA 221 Query: 1349 TNSHVDGGLRSSGIECDNLK-SGAVDGGCSTNLKEPSNALLKSGGDA------IQNQNEA 1507 + HV DNLK SG +G S NL+ + A+ + IQNQ Sbjct: 222 SKPHV-----------DNLKSSGNSEGSLSGNLETEAEAVHEQSSPKEHDSHFIQNQIVK 270 Query: 1508 ENVIPSPKPLVGTEIIDGKAVNVVEGLRVYEGLFDSLGISKLVQLANELRTAGRRGQFQG 1687 N+ +PK VG E++DGK+VNVV+GL++YE L D + +SKLV L N+LR AGR+GQFQG Sbjct: 271 LNLTTTPKTFVGAEMVDGKSVNVVDGLKLYEQLLDDVEVSKLVSLVNDLRAAGRKGQFQG 330 Query: 1688 RTFVSSKRPMKGRGREMIQLGVPIADAPPEDE 1783 + +V SKRPMKG GREMIQLG+PIADAP E+E Sbjct: 331 QAYVVSKRPMKGHGREMIQLGLPIADAPAEEE 362 >ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210274 [Cucumis sativus] gi|449481289|ref|XP_004156139.1| PREDICTED: uncharacterized LOC101210274 [Cucumis sativus] Length = 684 Score = 292 bits (747), Expect = 4e-76 Identities = 170/376 (45%), Positives = 222/376 (59%), Gaps = 17/376 (4%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTSSGSV----GGGEIQN---RQWFMDERDRFISWLQGEFAAANA 865 MAMPSGNV + DK+ F S G V GGGEI R WF DERD FISWL+GEFAA+NA Sbjct: 1 MAMPSGNVGVPDKVSFQSGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA 60 Query: 866 IIDSLCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQQAAWRKQQ 1045 IID+LCHHLR++GEP EYD V+ CIQQRRCNW PVLHMQQYFS+A+V +ALQQ R+QQ Sbjct: 61 IIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQ 120 Query: 1046 THFDHRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXXXXXXXDTISSAQLVNMG-S 1222 + D VKV K +R G G +Q R E +TI+ A+ N G S Sbjct: 121 RYMD-PVKVGPKLYRRPG-PGFKQQQGHRAEAT----------VKEETITCAESCNGGNS 168 Query: 1223 DXXXXXXXXXXXXXQCQRGDVVERSDGKSTVLAE-DSEGVVGTTNSHVDGGLRSSGIECD 1399 C ++ G+ L+E DS V ++H + Sbjct: 169 STFVSSRKVEQVSNTCDE----SKASGEDEKLSEKDSGSAVDNKDTHGKDQSNCKTKSAE 224 Query: 1400 NLKSGAV--------DGGCSTNLKEPSNALLKSGGDAIQNQNEAENVIPSPKPLVGTEII 1555 NL+ A+ D GCS++ ++ ++Q+QN + +P+ V +E+ Sbjct: 225 NLEDNAINKDSQVEPDDGCSSSHRDKEL-------QSVQSQNGKQYAATTPRTFVASEMF 277 Query: 1556 DGKAVNVVEGLRVYEGLFDSLGISKLVQLANELRTAGRRGQFQGRTFVSSKRPMKGRGRE 1735 DGK VNV++GL+++E L D +SKL+ L N+LR +G+RGQFQG+T+V SKRPMKG GRE Sbjct: 278 DGKMVNVMDGLKLFEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGRE 337 Query: 1736 MIQLGVPIADAPPEDE 1783 MIQLG PIADAP ED+ Sbjct: 338 MIQLGFPIADAPHEDD 353 >ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809865 isoform X1 [Glycine max] Length = 681 Score = 286 bits (733), Expect = 2e-74 Identities = 174/371 (46%), Positives = 222/371 (59%), Gaps = 12/371 (3%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTSSGSVGGG---EIQN----RQWFMDERDRFISWLQGEFAAANA 865 MAMPSGNV I DKMQF S G+ GG EI +QWF+DERD I WL+ EFAAANA Sbjct: 1 MAMPSGNVVIQDKMQFPSGGAGAGGAGGEIHQPHYCQQWFVDERDGLIGWLRSEFAAANA 60 Query: 866 IIDSLCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQQAAWRKQQ 1045 IIDSLCHHLR +G+P EYD V+ IQQRRCNWN VL MQQYFS+ADV ALQQ AWR+QQ Sbjct: 61 IIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVADVAHALQQVAWRRQQ 120 Query: 1046 THFDHRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXXXXXXXDTISSAQLVNMGSD 1225 D VKV K+ ++SGS G R R E K + + V G++ Sbjct: 121 RPLD-PVKVGAKEFRKSGS---GYRHGQRFEPVK--EGYNSSVESYNQYDANVTVTGGTE 174 Query: 1226 XXXXXXXXXXXXXQCQRGDVVERSDGKSTVLAEDSEGVVGTTNSHVDGGL---RSSGIEC 1396 + + G VE+ K AED + + T DG L RS+ Sbjct: 175 ---KGTPVVEKSEEHKSGGKVEKVGDKGLASAEDKKDAI--TKHQTDGSLKSTRSTEGSL 229 Query: 1397 DNLKSGA-VDGGCSTNLKEPSNALLKSGGDAIQNQNEAENVIPSPKPLVGTEIIDGKAVN 1573 NL+S A V+ C +N K + ++QNQ++++++ K +G E+ DGK VN Sbjct: 230 SNLESEAVVNDECISNSKGDDS-------HSVQNQHQSQSLSTKAKTFIGNEMFDGKMVN 282 Query: 1574 VVEGLRVYEGLFDSLGISKLVQLANELRTAGRRGQFQG-RTFVSSKRPMKGRGREMIQLG 1750 VV+GL++YE LFDS I+ LV L N+LR +G++GQ QG + ++ S+RPMKG GREMIQLG Sbjct: 283 VVDGLKLYEDLFDSTEIANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLG 342 Query: 1751 VPIADAPPEDE 1783 VPIADAP E E Sbjct: 343 VPIADAPAEGE 353 >ref|XP_006849434.1| hypothetical protein AMTR_s00024p00040890 [Amborella trichopoda] gi|548853009|gb|ERN11015.1| hypothetical protein AMTR_s00024p00040890 [Amborella trichopoda] Length = 655 Score = 283 bits (725), Expect = 1e-73 Identities = 167/377 (44%), Positives = 215/377 (57%), Gaps = 20/377 (5%) Frame = +2 Query: 713 MPSGN--------VAISDKMQFTSSGSVGGGEIQNRQ--WFMDERDRFISWLQGEFAAAN 862 MP+G+ + I D+MQF GGEI RQ WF DERD FISWL+ EFAAAN Sbjct: 1 MPAGDASLNSNPCITIPDRMQFQ------GGEIHQRQQPWFPDERDGFISWLRSEFAAAN 54 Query: 863 AIIDSLCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQQAAWRKQ 1042 AIIDSLC+HL+++G P EY+ ++ IQQRRCNW PVLHMQQYF +A++ ++LQQ AWRKQ Sbjct: 55 AIIDSLCYHLKAVGSPGEYETTLAFIQQRRCNWTPVLHMQQYFPVAEIAYSLQQVAWRKQ 114 Query: 1043 QTHFD-----HRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXXXXXXXDTISSAQL 1207 Q H D ++ SEK+ K+SG Q G+R W V+ S + Sbjct: 115 QRHCDPTMPGFHMRYSEKEPKKSGQQSFGNRHWSMVQGHGIYGGSEKESQDSGASSKVVV 174 Query: 1208 VNMGSDXXXXXXXXXXXXXQCQRGDVVERSDGKSTVLAEDSEGVVGTTNSHV---DGGLR 1378 G+ G+ V++ +G ++ E+ EGV + + V G Sbjct: 175 GTSGNG--------------ADHGEEVKQVNG--SMSGEEREGVEVSKSQRVCSLSNGPN 218 Query: 1379 SSGIECDNLKSGAVD--GGCSTNLKEPSNALLKSGGDAIQNQNEAENVIPSPKPLVGTEI 1552 S G E N + ++ G C T K D +Q + E +P+PK V TE Sbjct: 219 SLGTEDGNSEPKILNNCGPCDT-------VTQKDEADGVQKEVEENESVPAPKTFVATEY 271 Query: 1553 IDGKAVNVVEGLRVYEGLFDSLGISKLVQLANELRTAGRRGQFQGRTFVSSKRPMKGRGR 1732 +DGKAVNV+EGL +YE LFDS IS+LV ANELR AGRRG QG TFV SKRPM+G GR Sbjct: 272 LDGKAVNVLEGLELYEELFDSTEISRLVTFANELRAAGRRGDIQGPTFVVSKRPMRGHGR 331 Query: 1733 EMIQLGVPIADAPPEDE 1783 EMIQLG+PI D P E+E Sbjct: 332 EMIQLGIPIYDGPVEEE 348 >ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794176 [Glycine max] Length = 683 Score = 281 bits (718), Expect = 9e-73 Identities = 172/377 (45%), Positives = 224/377 (59%), Gaps = 18/377 (4%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTSSGSVGGG------EIQNR-----QWFMDERDRFISWLQGEFA 853 MAMPSGNV I DKMQF S GGG EI QWF+DERD I WL+ EFA Sbjct: 1 MAMPSGNVVIQDKMQFPSGAGGGGGGGGAGGEIHQPHHYRPQWFVDERDGLIGWLRSEFA 60 Query: 854 AANAIIDSLCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQQAAW 1033 AANAIIDSLCHHLR +G+P EYD V+ IQQRRCNWN VL MQQYFS+ADV +ALQQ AW Sbjct: 61 AANAIIDSLCHHLRVVGDPGEYDMVVGAIQQRRCNWNQVLMMQQYFSVADVAYALQQVAW 120 Query: 1034 RKQQTHFDHRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXXXXXXXDTISSAQLVN 1213 R+QQ D +KV K++++SGS G R R E+ K D + V Sbjct: 121 RRQQRPLD-PMKVGAKEVRKSGS---GYRHGQRFESVKEGYNSSVESYSHD---ANVAVT 173 Query: 1214 MGSDXXXXXXXXXXXXXQCQRGDVVERSDGKSTVLAEDSEGVVGTTNSHVDGGL---RSS 1384 G++ + + G VE+ K E+ + + TN +G L RS+ Sbjct: 174 GGTE---KGTPVVEKSEEHKSGGKVEKVGDKGLASVEEKKDAI--TNHQSEGSLKSARST 228 Query: 1385 GIECDNLKSGA-VDGGCSTNLKEPSNALLKSGGD--AIQNQNEAENVIPSPKPLVGTEII 1555 NL+S A V+ GC +N K G D ++QNQ++++++ K +G E+ Sbjct: 229 EGSLSNLESEAVVNDGCISNSK---------GNDLHSVQNQSQSQSLSNIAKTFIGNEMF 279 Query: 1556 DGKAVNVVEGLRVYEGLFDSLGISKLVQLANELRTAGRRGQFQG-RTFVSSKRPMKGRGR 1732 DGK VNVV+GL++Y+ LFDS ++ LV L N+LR +G++GQ QG + ++ S+RPMKG GR Sbjct: 280 DGKTVNVVDGLKLYDDLFDSTEVANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGR 339 Query: 1733 EMIQLGVPIADAPPEDE 1783 EMIQLGV IADAP E E Sbjct: 340 EMIQLGVRIADAPAEGE 356 >ref|XP_007158785.1| hypothetical protein PHAVU_002G181800g [Phaseolus vulgaris] gi|561032200|gb|ESW30779.1| hypothetical protein PHAVU_002G181800g [Phaseolus vulgaris] Length = 671 Score = 278 bits (710), Expect = 7e-72 Identities = 167/369 (45%), Positives = 221/369 (59%), Gaps = 10/369 (2%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTSSGSVGG-GEIQN----RQWFMDERDRFISWLQGEFAAANAII 871 MAMPSGNV I DKMQF + G G GEIQ +QWF+DERD I WL+ EFAAANAII Sbjct: 1 MAMPSGNVVIQDKMQFPNGGGGAGVGEIQQHHYRQQWFVDERDGLIGWLRSEFAAANAII 60 Query: 872 DSLCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQQAAWRKQQTH 1051 DSLCHHLR +G+P EYD V+ IQQRRCNWN VL MQQYFS+ADV + LQQ AWRKQQ Sbjct: 61 DSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLLMQQYFSVADVTYTLQQVAWRKQQRP 120 Query: 1052 FDHRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXXXXXXXD-TISSAQLVNMGSDX 1228 D VKV K++++ G G R R E +K D + + + G+ Sbjct: 121 LD-PVKVGAKEVRKPGP---GYRYGHRFEPSKEGYNSSVESYSHDGNATFTRGMEKGT-- 174 Query: 1229 XXXXXXXXXXXXQCQRGDVVERSDGKSTVLAEDSEGVVGTTNSHVDGGLRSSGIE---CD 1399 + + G VE+ K E+ + + DG L+S+G Sbjct: 175 -----PTVDKSEEHKSGSKVEKVGDKGLASPEEKKDAI--IKHQTDGNLKSTGSSEGYLS 227 Query: 1400 NLKSGAVDGGCSTNLKEPSNALLKSGGDAIQNQNEAENVIPSPKPLVGTEIIDGKAVNVV 1579 NL+S AV N + SN+ + D++++Q+++++ K +G E+IDGK VN+ Sbjct: 228 NLESEAV----VVNDEFISNS-KGNDSDSVESQHQSQSFSTIAKTFIGNEMIDGKMVNLA 282 Query: 1580 EGLRVYEGLFDSLGISKLVQLANELRTAGRRGQFQG-RTFVSSKRPMKGRGREMIQLGVP 1756 +GL++YE +FDS +S LV L N+LR +G++GQ QG + +V S+RPMKG GREMIQLGVP Sbjct: 283 DGLKLYEDIFDSTEVSNLVSLVNDLRISGKKGQLQGNQAYVVSRRPMKGHGREMIQLGVP 342 Query: 1757 IADAPPEDE 1783 IADAP E E Sbjct: 343 IADAPVEGE 351 >ref|XP_007153188.1| hypothetical protein PHAVU_003G014200g [Phaseolus vulgaris] gi|561026542|gb|ESW25182.1| hypothetical protein PHAVU_003G014200g [Phaseolus vulgaris] Length = 691 Score = 277 bits (709), Expect = 1e-71 Identities = 169/377 (44%), Positives = 221/377 (58%), Gaps = 20/377 (5%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTSSGSV--GGGEIQNR--QWFMDERDRFISWLQGEFAAANAIID 874 MAMPSGN + +K+QF G GGGEIQ R QWF+DERD FI WL+ EFAAANAIID Sbjct: 1 MAMPSGNGGMPEKLQFPVGGGAASGGGEIQYRHQQWFVDERDGFIGWLRSEFAAANAIID 60 Query: 875 SLCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQQAAWRKQQTHF 1054 SLC HLR +GEP YD V+ IQQRRCNW VL MQQYFS+++V +ALQQ AWR+QQ Sbjct: 61 SLCQHLRVVGEPGVYDMVVGAIQQRRCNWTQVLLMQQYFSVSEVVYALQQVAWRRQQRFV 120 Query: 1055 DHRVKVSEKDLKRSGS-------QGVGSRQWF---RVETAKXXXXXXXXXXXXDTISSAQ 1204 D K K+ ++ GS + S++ + R E AK + +A Sbjct: 121 D-PAKAGSKEFRKFGSGFRQGQHRNEASKEGYNNSRNEAAKEGYNSKVESFGRE--MNAV 177 Query: 1205 LVNMGSDXXXXXXXXXXXXXQCQRGDVVERSDGKSTVLAEDSEGVVGTTNSHVDGGLRSS 1384 +V G + + G V D S E+S+ + TN +DG L S Sbjct: 178 VVTGGVE---KGTRVIDKNGELNSGGKVGTMDNNSIASPEESKDTI--TNDQLDGILNGS 232 Query: 1385 GIECDNLKSGAVDG-----GCSTNLKEPSNALLKSGGDAIQNQNEAENVIPSPKPLVGTE 1549 G +L S + C++N K + ++QNQ++++N K +G E Sbjct: 233 GNFQGSLSSSECEAVGENEECTSNSK-------GNDSHSVQNQHQSQNASTIGKTFIGNE 285 Query: 1550 IIDGKAVNVVEGLRVYEGLFDSLGISKLVQLANELRTAGRRGQFQG-RTFVSSKRPMKGR 1726 + +GK VNVV+GL++YE L DS +SKLV L N++R AG+RGQFQG +TFV SKRP+KGR Sbjct: 286 MFEGKMVNVVDGLKLYEDLIDSAEVSKLVSLVNDMRVAGKRGQFQGSQTFVVSKRPIKGR 345 Query: 1727 GREMIQLGVPIADAPPE 1777 GREMIQLGVPIADAPP+ Sbjct: 346 GREMIQLGVPIADAPPD 362 >ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa] gi|550333016|gb|ERP57586.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa] Length = 693 Score = 277 bits (709), Expect = 1e-71 Identities = 170/384 (44%), Positives = 216/384 (56%), Gaps = 25/384 (6%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTSSGSVGGG--------EIQNRQWF-MDERDRFISWLQGEFAAA 859 MAMP GNV I DK+QF + + GGG ++Q QWF +DERD FISWL+GEFAAA Sbjct: 1 MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFAAA 60 Query: 860 NAIIDSLCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQQAAWRK 1039 NAIIDSLCHHLR++GE EYD V+ CIQQRR NWN VLHMQQYFS+ +V ALQQ R+ Sbjct: 61 NAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVLRR 120 Query: 1040 QQT--------------HFDHRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXXXXX 1177 QQ ++DH KV +D KRS S G + Sbjct: 121 QQQQQQQQQNHHHQQRFYYDHG-KVGGRDFKRSSSAGFNRGH-------RGGGGGGGGDA 172 Query: 1178 XXDTISSAQLVNMGSDXXXXXXXXXXXXXQCQRGDVVERSDGKSTVLAEDSEGVVGTTNS 1357 + ++S+ + N + + + G +SD K A+ T++ Sbjct: 173 VKEGVNSS-VENHSFNGNSSENIRSEKFEEVKSGGDGGKSDDKKDATAKSH------TDN 225 Query: 1358 HVDGGLRSSGIECDNLKSGAVDGGCSTNLKE--PSNALLKSGGDAIQNQNEAENVIPSPK 1531 H + + G N ++ AVD S + PSN NQNE +N+ +PK Sbjct: 226 HKNSSGNAQGTFSGNSEAVAVDDRSSPEESDSHPSN-----------NQNEKQNLAITPK 274 Query: 1532 PLVGTEIIDGKAVNVVEGLRVYEGLFDSLGISKLVQLANELRTAGRRGQFQGRTFVSSKR 1711 V E IDG+ VNVV+GL++YE L D L +SKLV L NELR GRRGQ QG+T++ SKR Sbjct: 275 TFVAEEKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKR 334 Query: 1712 PMKGRGREMIQLGVPIADAPPEDE 1783 PMKG GREMIQLG+PIADAP EDE Sbjct: 335 PMKGHGREMIQLGLPIADAPAEDE 358 >gb|ABK95394.1| unknown [Populus trichocarpa] Length = 694 Score = 275 bits (704), Expect = 4e-71 Identities = 169/385 (43%), Positives = 208/385 (54%), Gaps = 26/385 (6%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTSSGSVGGG--------EIQNRQWF-MDERDRFISWLQGEFAAA 859 MAMP GNV I DK+QF + + GGG ++Q QWF +DERD FISWL+GEFAAA Sbjct: 1 MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFAAA 60 Query: 860 NAIIDSLCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQQAAWRK 1039 NAIIDSLCHHLR++GE EYD V+ CIQQRR NWN VLHMQQYFS+ +V ALQQ R+ Sbjct: 61 NAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVLRR 120 Query: 1040 QQT-----------------HFDHRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXX 1168 QQ ++DH KV +D KRS S G Sbjct: 121 QQQQQQQQQQQQNHHHQQRFYYDHG-KVGGRDFKRSSSAGFNRGH-------------RG 166 Query: 1169 XXXXXDTISSAQLVNMGSDXXXXXXXXXXXXXQCQRGDVVERSDGKSTVLAEDSEGVVGT 1348 D + VN + + +V DG + +D+ T Sbjct: 167 GGGGGDAVKEG--VNSSVENHSFNGNSSENIRSEKFEEVKSGGDGGKSDDKKDA-----T 219 Query: 1349 TNSHVDGGLRSSGIECDNLKSGAVDGGCSTNLKEPSNALLKSGGDAIQNQNEAENVIPSP 1528 SH D SSG G G + ++ +S NQNE +N+ +P Sbjct: 220 AKSHTDNHKNSSGNA-----QGTFSGNSEAVAVDDRSSPEESDSHPSNNQNEKQNLAITP 274 Query: 1529 KPLVGTEIIDGKAVNVVEGLRVYEGLFDSLGISKLVQLANELRTAGRRGQFQGRTFVSSK 1708 K V E IDG+ VNVV+GL++YE L D L +SKLV L NELR GRRGQ QG+T++ SK Sbjct: 275 KTFVAEEKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSK 334 Query: 1709 RPMKGRGREMIQLGVPIADAPPEDE 1783 RPMKG GREMIQLG+PIADAP EDE Sbjct: 335 RPMKGHGREMIQLGLPIADAPAEDE 359 >ref|XP_006605475.1| PREDICTED: uncharacterized protein LOC100814525 [Glycine max] Length = 626 Score = 270 bits (691), Expect = 1e-69 Identities = 168/365 (46%), Positives = 210/365 (57%), Gaps = 8/365 (2%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTSSGSVGGGEIQNRQ-WFMDERDRFISWLQGEFAAANAIIDSLC 883 MAMPSGN + +K+QF G GG EI RQ WF+DERD FI WL+ EFAAANAIIDSLC Sbjct: 1 MAMPSGNAVMPEKLQFPGGG--GGSEIHYRQQWFVDERDGFIGWLRSEFAAANAIIDSLC 58 Query: 884 HHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQQAAWRKQQTHFDHR 1063 HHLR +GEP EYD V+ IQQRRCNW VL MQQYFS+++V ALQQ +WR+QQ D Sbjct: 59 HHLRCVGEPGEYDMVVGAIQQRRCNWTQVLLMQQYFSVSEVVCALQQVSWRRQQRVVD-L 117 Query: 1064 VKVSEKDLKRSGSQGVGSRQ-WFRVETAKXXXXXXXXXXXXDTISSAQLVNMGSDXXXXX 1240 K K+ ++ GS G RQ R+E AK T +A +V G + Sbjct: 118 AKTGAKEFRKFGS---GIRQGQHRLEAAKDGYNSSVESFCHGT--NAVVVAGGVE---KG 169 Query: 1241 XXXXXXXXQCQRGDVVERSDGKSTVLAEDSEGVVGTTNSHVDGGLRSSG-----IECDNL 1405 + + G V D KS E+ + + TN DG L+ SG + Sbjct: 170 TPLTEKNGEIKSGGKVGTMDNKSLASPEERKDTI--TNHQSDGILKGSGNSQGSLSTSEC 227 Query: 1406 KSGAVDGGCSTNLKEPSNALLKSGGDAIQNQNEAENVIPSPKPLVGTEIIDGKAVNVVEG 1585 ++ V+ C +N KE + + K +G E+ DGK VNVV+G Sbjct: 228 EAVGVNEECVSNSKENDSTM--------------------GKTFIGNEMFDGKMVNVVDG 267 Query: 1586 LRVYEGLFDSLGISKLVQLANELRTAGRRGQFQG-RTFVSSKRPMKGRGREMIQLGVPIA 1762 L++YE L D +SKLV L N+LR AG+RGQFQG +TFV SKRPMKG GREMIQLGVPIA Sbjct: 268 LKLYEDLLDRTEVSKLVSLVNDLRVAGKRGQFQGNQTFVVSKRPMKGHGREMIQLGVPIA 327 Query: 1763 DAPPE 1777 DAPP+ Sbjct: 328 DAPPD 332 >ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809865 isoform X2 [Glycine max] Length = 641 Score = 270 bits (690), Expect = 2e-69 Identities = 167/367 (45%), Positives = 208/367 (56%), Gaps = 8/367 (2%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTSSGSVGGG---EIQN----RQWFMDERDRFISWLQGEFAAANA 865 MAMPSGNV I DKMQF S G+ GG EI +QWF+DERD I WL+ EFAAANA Sbjct: 1 MAMPSGNVVIQDKMQFPSGGAGAGGAGGEIHQPHYCQQWFVDERDGLIGWLRSEFAAANA 60 Query: 866 IIDSLCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQQAAWRKQQ 1045 IIDSLCHHLR +G+P EYD V+ IQQRRCNWN VL MQQYFS+ADV ALQQ AWR+QQ Sbjct: 61 IIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVADVAHALQQVAWRRQQ 120 Query: 1046 THFDHRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXXXXXXXDTISSAQLVNMGSD 1225 D VKV K+ ++SGS G R R E K SS + N Sbjct: 121 RPLD-PVKVGAKEFRKSGS---GYRHGQRFEPVKEGYN-----------SSVESYN---- 161 Query: 1226 XXXXXXXXXXXXXQCQRGDVVERSDGKSTVLAEDSEGVVGTTNSHVDGGLRSSGIECDNL 1405 Q V K T + E SE H GG + Sbjct: 162 -------------QYDANVTVTGGTEKGTPVVEKSE-------EHKSGG---------KV 192 Query: 1406 KSGAVDGGCSTNLKEPSNALLKSGGDAIQNQNEAENVIPSPKPLVGTEIIDGKAVNVVEG 1585 + G S K+ ++ ++QNQ++++++ K +G E+ DGK VNVV+G Sbjct: 193 EKVGDKGLASAEDKKGDDS------HSVQNQHQSQSLSTKAKTFIGNEMFDGKMVNVVDG 246 Query: 1586 LRVYEGLFDSLGISKLVQLANELRTAGRRGQFQG-RTFVSSKRPMKGRGREMIQLGVPIA 1762 L++YE LFDS I+ LV L N+LR +G++GQ QG + ++ S+RPMKG GREMIQLGVPIA Sbjct: 247 LKLYEDLFDSTEIANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVPIA 306 Query: 1763 DAPPEDE 1783 DAP E E Sbjct: 307 DAPAEGE 313 >ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|550333015|gb|EEE88914.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 675 Score = 268 bits (686), Expect = 5e-69 Identities = 167/382 (43%), Positives = 203/382 (53%), Gaps = 23/382 (6%) Frame = +2 Query: 707 MAMPSGNVAISDKMQFTSSGSVGGG--------EIQNRQWF-MDERDRFISWLQGEFAAA 859 MAMP GNV I DK+QF + + GGG ++Q QWF +DERD FISWL+GEFAAA Sbjct: 1 MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFAAA 60 Query: 860 NAIIDSLCHHLRSIGEPSEYDFVMSCIQQRRCNWNPVLHMQQYFSIADVNFALQQAAWRK 1039 NAIIDSLCHHLR++GE EYD V+ CIQQRR NWN VLHMQQYFS+ +V ALQQ R+ Sbjct: 61 NAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVLRR 120 Query: 1040 QQT--------------HFDHRVKVSEKDLKRSGSQGVGSRQWFRVETAKXXXXXXXXXX 1177 QQ ++DH KV +D KRS S G Sbjct: 121 QQQQQQQQQNHHHQQRFYYDHG-KVGGRDFKRSSSAGF-----------NRGHRGGGGGG 168 Query: 1178 XXDTISSAQLVNMGSDXXXXXXXXXXXXXQCQRGDVVERSDGKSTVLAEDSEGVVGTTNS 1357 D + VN + + +V DG + D + T S Sbjct: 169 GGDAVKEG--VNSSVENHSFNGNSSENIRSEKFEEVKSGGDGGKS----DDKKADATAKS 222 Query: 1358 HVDGGLRSSGIECDNLKSGAVDGGCSTNLKEPSNALLKSGGDAIQNQNEAENVIPSPKPL 1537 H D SSG NA G++ NE +N+ +PK Sbjct: 223 HTDNHKNSSG-----------------------NAQGTFSGNSEAVANEKQNLAITPKTF 259 Query: 1538 VGTEIIDGKAVNVVEGLRVYEGLFDSLGISKLVQLANELRTAGRRGQFQGRTFVSSKRPM 1717 V E IDG+ VNVV+GL++YE L D L +SKLV L NELR GRRGQ QG+T++ SKRPM Sbjct: 260 VAEEKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPM 319 Query: 1718 KGRGREMIQLGVPIADAPPEDE 1783 KG GREMIQLG+PIADAP EDE Sbjct: 320 KGHGREMIQLGLPIADAPAEDE 341