BLASTX nr result
ID: Zingiber24_contig00022431
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Zingiber24_contig00022431 (2339 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252... 472 e-130 emb|CBI26785.3| unnamed protein product [Vitis vinifera] 467 e-128 emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera] 446 e-122 gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis] 442 e-121 ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309... 441 e-121 ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Popu... 433 e-118 gb|ABK95394.1| unknown [Populus trichocarpa] 432 e-118 ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809... 427 e-117 ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794... 427 e-116 gb|ESW30779.1| hypothetical protein PHAVU_002G181800g [Phaseolus... 426 e-116 ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210... 423 e-115 ref|XP_002527549.1| conserved hypothetical protein [Ricinus comm... 422 e-115 gb|ESW25182.1| hypothetical protein PHAVU_003G014200g [Phaseolus... 418 e-114 ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus tr... 418 e-114 gb|EOY01300.1| Hydroxyproline-rich glycoprotein family protein, ... 413 e-112 ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261... 410 e-111 gb|EOY01299.1| Hydroxyproline-rich glycoprotein family protein, ... 409 e-111 ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781... 407 e-111 ref|XP_006605475.1| PREDICTED: uncharacterized protein LOC100814... 402 e-109 ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600... 400 e-108 >ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252594 [Vitis vinifera] Length = 698 Score = 472 bits (1215), Expect = e-130 Identities = 290/651 (44%), Positives = 370/651 (56%), Gaps = 33/651 (5%) Frame = +2 Query: 461 MAALSGNSVVVSVEPMQFPLNGGRG-----SEVP--QQWFVDERDGLISWLRGEFAAANA 619 MA SGN VV + MQFP GGRG +E+ +QWF DERDG ISWLRGEFAAANA Sbjct: 1 MAMPSGN--VVISDKMQFPGGGGRGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANA 58 Query: 620 IIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQL 799 IID L +H+R+ GE GEYD V+GCI QRR+ W++ LHMQQ+F V +V YALQQV RQ Sbjct: 59 IIDSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQ 118 Query: 800 KQRQRHSYGQKDGRKSAFGHRYGHRSDGVRESRVSPASGTA--VSESGTVEQ---MVDKL 964 + K+ ++ +R G R + ++S S + + SGT+E+ + + Sbjct: 119 RHLDPVKGAGKEYKRYGVAYRQGQRGETAKDSHNSNFENHSHDANSSGTLEKGERVSEIY 178 Query: 965 DHSKNVNQNNV-----QMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVIELEAA 1129 D K ++ +V A AG D + A SC ++ C I A Sbjct: 179 DDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCSKSSENSEGSRCGISETEA 238 Query: 1130 SGC---GSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFL 1300 + GS +N NQ+ +P FV E DG VNVV+GLKLYE Sbjct: 239 NDMDDGGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVVDGLKLYEELF 298 Query: 1301 DSSEVTKFVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHR 1480 D SEV+KFVSL N++RAAG RG+L GQT V KRP KGHGREMIQLG+P + +EDE Sbjct: 299 DDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKGHGREMIQLGVPIADAPLEDESV 358 Query: 1481 TLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPV 1660 K+R+ E+IPS L + L QVL VKPD C+IDF+NEGDHSQPH WP+WFGRPV Sbjct: 359 VGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACIIDFYNEGDHSQPHIWPTWFGRPV 418 Query: 1661 CNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLI 1840 C LFLT+CD+ FGR +G++H GDY GSLKL ++ GSLLVMQGKSAD AK AIPSLRK I Sbjct: 419 CILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSADFAKHAIPSLRKQRI 478 Query: 1841 LLTFGKYRPKKTLLPSEGTFFSSSAVNPLSIPTSARPSSFSRYPSARKPYGIMPANGILH 2020 L+TF K +PKKT+ +A + +P +R + R+P K YG +P G+L Sbjct: 479 LVTFTKSQPKKTMASDGQRLLPPAAQSSHWVPPPSRSPNHMRHPMGPKHYGAVPTTGVLP 538 Query: 2021 AQQAPQ--IMLSPNGVQPRF---AAAPMVASPA---LPSGPPATVGWAAASSMNAXXXXX 2176 A P + PNG+QP F A AP + PA LP+G P GW AA + Sbjct: 539 APAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPVPLPTGSP---GWPAAPPRHPPPRLP 595 Query: 2177 XXXXXXXFLPPGSVHIYPIQHL--PGTPISVQTIYDNSR---SEKPTSNSN 2314 PPGS + QH+ T SV+T + S K +SNSN Sbjct: 596 VPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKENGSGKSSSNSN 646 >emb|CBI26785.3| unnamed protein product [Vitis vinifera] Length = 672 Score = 467 bits (1201), Expect = e-128 Identities = 289/661 (43%), Positives = 375/661 (56%), Gaps = 35/661 (5%) Frame = +2 Query: 461 MAALSGNSVVVSVEPMQFPLNGGRG-----SEVP--QQWFVDERDGLISWLRGEFAAANA 619 MA SGN VV + MQFP GGRG +E+ +QWF DERDG ISWLRGEFAAANA Sbjct: 1 MAMPSGN--VVISDKMQFPGGGGRGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANA 58 Query: 620 IIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQL 799 IID L +H+R+ GE GEYD V+GCI QRR+ W++ LHMQQ+F V +V YALQQV RQ Sbjct: 59 IIDSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQ 118 Query: 800 KQRQRHSYGQKDGRKSAFGHRYGHRSDGVRESRVSPASGTA--VSESGTVEQ---MVDKL 964 + K+ ++ +R G R + ++S S + + SGT+E+ + + Sbjct: 119 RHLDPVKGAGKEYKRYGVAYRQGQRGETAKDSHNSNFENHSHDANSSGTLEKGERVSEIY 178 Query: 965 DHSKNVNQNNV-----QMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVIELEAA 1129 D K ++ +V A AG D + A SC ++ C I A Sbjct: 179 DDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCSKSSENSEGSRCGISETEA 238 Query: 1130 SGC---GSQASNSGDNTVMTSN----QDRTQKVIP--APNEFVAKETCDGMMVNVVEGLK 1282 + G+ N +M +N Q++ +K P +P FV E DG VNVV+GLK Sbjct: 239 NDMDDGGTLNPKGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVVDGLK 298 Query: 1283 LYENFLDSSEVTKFVSLANEMRAAGHRGEL-SGQTLVTLKRPTKGHGREMIQLGIPTNEG 1459 LYE D SEV+KFVSL N++RAAG RG+L +GQT V KRP KGHGREMIQLG+P + Sbjct: 299 LYEELFDDSEVSKFVSLVNDLRAAGKRGQLQAGQTFVVSKRPMKGHGREMIQLGVPIADA 358 Query: 1460 HIEDEHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWP 1639 +EDE K+R+ E+IPS L + L QVL VKPD C+IDF+NEGDHSQPH WP Sbjct: 359 PLEDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACIIDFYNEGDHSQPHIWP 418 Query: 1640 SWFGRPVCNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIP 1819 +WFGRPVC LFLT+CD+ FGR +G++H GDY GSLKL ++ GSLLVMQGKSAD AK AIP Sbjct: 419 TWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSADFAKHAIP 478 Query: 1820 SLRKTLILLTFGKYRPKKTLLPSEGTFFSSSAVNPLSIPTSARPSSFSRYPSARKPYGIM 1999 SLRK IL+TF K +PKKT+ +A + +P +R + R+P K YG + Sbjct: 479 SLRKQRILVTFTKSQPKKTMASDGQRLLPPAAQSSHWVPPPSRSPNHMRHPMGPKHYGAV 538 Query: 2000 PANGILHAQQAPQ--IMLSPNGVQPRF---AAAPMVASPA---LPSGPPATVGWAAASSM 2155 P G+L A P + PNG+QP F A AP + PA LP+G P GW AA Sbjct: 539 PTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPVPLPTGSP---GWPAAPPR 595 Query: 2156 NAXXXXXXXXXXXXFLPPGSVHIYPIQHLPGTPISVQTIYDNSRSEKPTSNSNASDCSTN 2335 + PPGS + QH IS + + + PT N S S+ Sbjct: 596 HPPPRLPVPGTGVFLPPPGSGNSSSPQH-----ISTEATSTSVETAAPTEKENGSGKSST 650 Query: 2336 I 2338 + Sbjct: 651 V 651 >emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera] Length = 1145 Score = 446 bits (1148), Expect = e-122 Identities = 283/663 (42%), Positives = 363/663 (54%), Gaps = 49/663 (7%) Frame = +2 Query: 473 SGNSVVVSVEPMQFPLNGGRG-----SEVP--QQWFVDERDGLISWLRGEFAAANAIIDV 631 SGN VV + MQFP GG G +E+ +QWF DERDG ISWLRGEFAAANAIID Sbjct: 3 SGN--VVISDKMQFPGGGGGGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAIIDS 60 Query: 632 LMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQLKQRQ 811 L +H+R+ GE GEYD V+GCI QRR+ W++ LHMQQ+F V +V YALQQV RQ + Sbjct: 61 LCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRHLD 120 Query: 812 RHSYGQKDGRKSAFGHRYGHRSDGVRESRVSPASGTA--VSESGTVEQ---MVDKLDHSK 976 K+ ++ +R G R + ++S S + + SGT+E+ + + D K Sbjct: 121 PVKGAGKEYKRYGVAYRQGQRGETAKDSHNSNFENHSHDANSSGTLEKGERVSEIYDDVK 180 Query: 977 NVNQNNV-------------QMSRAMNCFPVAGKDG-----NSHSLAESCCMKDGSNPAE 1102 ++ +V + MN F + G+ N +A K +P Sbjct: 181 GGDKGDVVGKLEDKDLSAAAEKKEVMN-FVIFGQLEQMLLQNPMQIAVRRVQKTQKDPDV 239 Query: 1103 TC-----VIELEAASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNV 1267 + + A C N N NQ+ +P FV E DG VNV Sbjct: 240 AFQRLRPMTWMMEARSCNMIMEN---NAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNV 296 Query: 1268 VEGLKLYENFLDSSEVTKFVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIP 1447 V+GLKLYE D SEV+KFVSL N++RAAG RG+L GQT V KRP KGHGREMIQLG+P Sbjct: 297 VDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKGHGREMIQLGVP 356 Query: 1448 TNEGHIEDEHRTLNYK----ERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGD 1615 + +EDE K R+ E+IPS L + L QVL VKPD C+IDF+NEGD Sbjct: 357 IADAPLEDESVVGTSKGMFHNRRTESIPSLLQDVIGQLVGSQVLTVKPDACIIDFYNEGD 416 Query: 1616 HSQPHTWPSWFGRPVCNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSA 1795 HSQPH WP+WFGRPVC LFLT+CD+ FGR +G++H GDY GSLKL ++ GSLLVMQGKSA Sbjct: 417 HSQPHIWPTWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSA 476 Query: 1796 DLAKRAIPSLRKTLILLTFGKYRPKKTLLPSEGTFFSSSAVNPLSIPTSARPSSFSRYPS 1975 D AK AIPSLRK IL+TF K +PKKT +A + +P +R + R+P Sbjct: 477 DFAKHAIPSLRKQRILVTFTKSQPKKTTASDGQRLLPPAAQSSHWVPPPSRSPNHMRHPM 536 Query: 1976 ARKPYGIMPANGILHAQQAPQ--IMLSPNGVQPRF---AAAPMVASPALPSGPPATVGWA 2140 K YG +P G+L A P + PNG+QP F A AP + PA P + GW Sbjct: 537 GPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPXPLPTGSPGWP 596 Query: 2141 AASSMNAXXXXXXXXXXXXFLPPGSVHIYPIQHL--PGTPISVQTIYDNSR---SEKPTS 2305 AA + PPGS + QH+ T SV+T + S K +S Sbjct: 597 AAPPRHPPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKENGSGKSSS 656 Query: 2306 NSN 2314 NSN Sbjct: 657 NSN 659 >gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis] Length = 681 Score = 442 bits (1136), Expect = e-121 Identities = 262/605 (43%), Positives = 339/605 (56%), Gaps = 20/605 (3%) Frame = +2 Query: 461 MAALSGNSVVVSVEPMQFPLNGGRGSEVP----QQWFVDERDGLISWLRGEFAAANAIID 628 MA SGN VVS + MQFP E+ +QWF DERDG ISWLRGEFAAANA+ID Sbjct: 1 MAMPSGN--VVSSDKMQFPSGTAGAGEISHHNNRQWFPDERDGFISWLRGEFAAANAMID 58 Query: 629 VLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQLKQR 808 L H+R GE GEYD V+ CI RR W LHMQQ+F V +V +ALQQV RQ + Sbjct: 59 SLCHHLRAVGEPGEYDAVIACIQLRRCNWNPVLHMQQYFSVAEVMFALQQVAWRRQQRFY 118 Query: 809 QRHSYGQKDGRKSAFGHRYGHRSDGVRESRVSPAS----------GTAVSESGTVEQMVD 958 G K+ ++S G + R+D ++ R S A G A SE G ++ D Sbjct: 119 DPVKMGNKEFKRSGVGFKQWQRNDSFKDGRNSAAESHCLDGNSSFGNAASEKGGSDKSGD 178 Query: 959 KLDHSKNVNQNNVQMSRAMNCFPVAGK-DGNSHSLAESCCMKDGSNPAETCVIELEAASG 1135 ++ +S + + ++ ++ N + DGN SL + GS P V + G Sbjct: 179 EVGNSDD--RGSMPAAKEKNDSAAKSQEDGNVKSLGNFEGVVSGSEPEVHAVDD-----G 231 Query: 1136 CGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFLDSSEV 1315 C S + + ++ T Q+ + P F E DG VNVVEGLKLYE F +EV Sbjct: 232 CTSSSKENDSHS--TPKQNENSNLANVPKTFSGNEMFDGKPVNVVEGLKLYEEFCADTEV 289 Query: 1316 TKFVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHRTLNYK 1495 +K V+L N++R+AG RG QT V KRP KGHGRE IQLG+P + +EDE K Sbjct: 290 SKLVALVNDLRSAGERGHFQSQTYVVSKRPMKGHGREKIQLGLPIADAPVEDEISAGTLK 349 Query: 1496 ERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPVCNLFL 1675 +R+ EAIP L + + L QV VKPD C+IDF+NEGDHSQPH WPSWFGRPVC LFL Sbjct: 350 DRRTEAIPPLLQDVAERLVSMQVATVKPDSCIIDFYNEGDHSQPHLWPSWFGRPVCVLFL 409 Query: 1676 TDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLILLTFG 1855 T+CD+ FGR +H GDY G+LKL + GSLL MQGKSAD AK AIPSLR+ IL+TF Sbjct: 410 TECDMTFGRVFAIDHPGDYRGALKLSLKPGSLLAMQGKSADFAKHAIPSLRRQRILVTFT 469 Query: 1856 KYRPKKTLLPSEGTFFSSSAVNPLSI--PTSARPSSFSRYPSARKPYGIMPANGILHAQQ 2029 K +PKK+ +PS+G S V P S P +R + R+P K Y +P G+L A Sbjct: 470 KSQPKKS-MPSDGQRMPSPGVAPSSHWGPQPSRSPNHIRHPGP-KHYAPVPTTGVLQASP 527 Query: 2030 APQIMLSPNGVQPRFAAAPMVASPALPSG---PPATVGWAAASSMNAXXXXXXXXXXXXF 2200 + PNG+QP F AP+ + P+ PP++ GW+AA + Sbjct: 528 VRPQIPPPNGIQPLFVTAPVAPAMPFPAPVPIPPSSSGWSAAPPRHPPPRLPVPGTGVFL 587 Query: 2201 LPPGS 2215 PPGS Sbjct: 588 PPPGS 592 >ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca subsp. vesca] Length = 682 Score = 441 bits (1133), Expect = e-121 Identities = 267/642 (41%), Positives = 354/642 (55%), Gaps = 22/642 (3%) Frame = +2 Query: 461 MAALSGNSVVVSVEPMQFP------LNGGRGSEVPQQWFVDERDGLISWLRGEFAAANAI 622 M SGN VV + MQ+P ++GG + P+QWF DERDG ISWLRGEFAAANAI Sbjct: 1 MTMPSGN--VVLSDKMQYPSVAGAAVSGGEIHQQPRQWFPDERDGFISWLRGEFAAANAI 58 Query: 623 IDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQLK 802 ID L H+R GE EYD V+GC+ QRR WT LHMQQ+F V +V YALQQV RQ + Sbjct: 59 IDSLCHHLRAVGEPSEYDMVIGCVQQRRCNWTPVLHMQQYFSVAEVIYALQQVAWRRQQR 118 Query: 803 QRQRHSYGQKDGRKSAFGHRYGHRSDGVRESRVSPASGTAVSESGTV-------EQMVDK 961 + G KD ++S G + R++ V+E + + SG E++ Sbjct: 119 YYEPVKMGNKDYKRSNSGVGFKPRNEPVKEWHTASVEYRSYDGSGLEKVGSEMREEVKPG 178 Query: 962 LDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVIELEAASGCG 1141 + K ++ + + + +S S A S G++ +E V+ GC Sbjct: 179 GEAGKVDDKGSAAGAVTKGVLTKPHEYISSRSSANSQGTISGNSESEDAVVN----EGCT 234 Query: 1142 SQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFLDSSEVTK 1321 S + N++ N+ + +IP FV ET DG VNVV+GLKLYE FL +EV+K Sbjct: 235 SSIKENESNSIQIQNEKQNLSLIP--KTFVGNETFDGKTVNVVDGLKLYEEFLGDTEVSK 292 Query: 1322 FVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHRTLNYKER 1501 SL N++R G RG+L GQT V KRP KGHGREMIQLGIP +G EDE K+R Sbjct: 293 LFSLVNDLRTTGRRGQLQGQTYVLSKRPMKGHGREMIQLGIPIADGPQEDEISAGISKDR 352 Query: 1502 KIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPVCNLFLTD 1681 ++EAIPS L + D L QVL KPD C+IDFFNEGDHS PH WP WFGRPV LFLT+ Sbjct: 353 RMEAIPSLLQDVIDRLIGTQVLTDKPDSCIIDFFNEGDHSHPHMWPPWFGRPVSVLFLTE 412 Query: 1682 CDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLILLTFGKY 1861 CD+ FG+ +G +H GDY G+L+L + GSLL++QGKSAD AK AIPS+RK IL+TF K Sbjct: 413 CDLTFGKVLGMDHPGDYRGALRLSLTPGSLLLLQGKSADYAKHAIPSIRKQRILVTFTKS 472 Query: 1862 RPKKTLLPSEGTFFSS--SAVNPLSIPTSARPSSFSRYPSARKPYGIMPANGILHAQQAP 2035 +P+K+ P++G S + +P P R + R+P+ K Y +P G+L A Sbjct: 473 QPRKS-FPTDGQRLPSPGPSQSPYWSPPPGRSPNHIRHPAGPKHYAAVPTTGVLPAPPNR 531 Query: 2036 QIMLSPNGVQPRFAAAPMVASPALPSG-----PPATVGWAAASSMNAXXXXXXXXXXXXF 2200 + NG+QP F AAP+ PA+P PP + GW AA F Sbjct: 532 PQLPPANGIQPLFVAAPV--GPAMPFPAPVVIPPGSPGWVAAP--RHPPPRMPLPGTGVF 587 Query: 2201 LPP--GSVHIYPIQHLPGTPISVQTIYDNSRSEKPTSNSNAS 2320 LPP P Q P T + + + +EK + +S Sbjct: 588 LPPPGSGSSSAPPQQFPSTATEMNPSVETASTEKDNGTAKSS 629 >ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa] gi|550333016|gb|ERP57586.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa] Length = 693 Score = 433 bits (1114), Expect = e-118 Identities = 265/600 (44%), Positives = 340/600 (56%), Gaps = 37/600 (6%) Frame = +2 Query: 461 MAALSGNSVVVSVEPMQFPLN----GGRGSEVPQ------QWF-VDERDGLISWLRGEFA 607 MA GN VV + +QFP GG G+E+ Q QWF VDERDG ISWLRGEFA Sbjct: 1 MAMPPGN--VVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFA 58 Query: 608 AANAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMD 787 AANAIID L H+R GE GEYD VVGCI QRR W + LHMQQ+F V +V ALQQV+ Sbjct: 59 AANAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVL 118 Query: 788 LRQLKQRQR-----------HSYGQKDGR----KSAFGHRYGHRSDGVRESRVSPASGTA 922 RQ +Q+Q+ + +G+ GR S+ G GHR G G A Sbjct: 119 RRQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAGFNRGHRGGG------GGGGGDA 172 Query: 923 VSESGTVEQMVDKLDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCM------KD 1084 V E V V+ +HS N N + S G G S ++ K+ Sbjct: 173 VKEG--VNSSVE--NHSFNGNSSENIRSEKFEEVKSGGDGGKSDDKKDATAKSHTDNHKN 228 Query: 1085 GSNPAETCVIELEAASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVN 1264 S A+ A ++S ++ ++NQ+ Q + P FVA+E DG MVN Sbjct: 229 SSGNAQGTFSGNSEAVAVDDRSSPEESDSHPSNNQNEKQNLAITPKTFVAEEKIDGQMVN 288 Query: 1265 VVEGLKLYENFLDSSEVTKFVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGI 1444 VV+GLKLYEN LD EV+K VSL NE+RA G RG+ GQT + KRP KGHGREMIQLG+ Sbjct: 289 VVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGREMIQLGL 348 Query: 1445 PTNEGHIEDEHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQ 1624 P + EDE+ T KER++E+IP+ L + + QV+ +KPD C+ID +NEGDHSQ Sbjct: 349 PIADAPAEDENATGTSKERRVESIPALLQDVIEHFVAMQVMTMKPDSCIIDIYNEGDHSQ 408 Query: 1625 PHTWPSWFGRPVCNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLA 1804 PH WP WFG+PV LFLT+C++ FG+ + + H GDY GSLKL V GSLLVMQGKS+DLA Sbjct: 409 PHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQGKSSDLA 468 Query: 1805 KRAIPSLRKTLILLTFGKYRPKKTLLPSEGTFFSSSAVNPLSI--PTSARPSSFSRYPSA 1978 K AIP ++K +L+TF K +PKK L ++G S AV P S P +R + R+P Sbjct: 469 KHAIPMIKKQRMLVTFTKSQPKK-LTSNDGPRLPSHAVAPSSHWGPPPSRSPNHLRHP-V 526 Query: 1979 RKPYGIMPANGILHAQQAPQIMLSPNGVQPRFAAAPMVASPALPSG---PPATVGWAAAS 2149 K Y +P G+L + PNGVQP F P+ A P+ PP + GW +S Sbjct: 527 PKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVPIPPVSTGWPTSS 586 >gb|ABK95394.1| unknown [Populus trichocarpa] Length = 694 Score = 432 bits (1112), Expect = e-118 Identities = 261/593 (44%), Positives = 339/593 (57%), Gaps = 30/593 (5%) Frame = +2 Query: 461 MAALSGNSVVVSVEPMQFPLN----GGRGSEVPQ------QWF-VDERDGLISWLRGEFA 607 MA GN VV + +QFP GG G+E+ Q QWF VDERDG ISWLRGEFA Sbjct: 1 MAMPPGN--VVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFA 58 Query: 608 AANAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMD 787 AANAIID L H+R GE GEYD VVGCI QRR W + LHMQQ+F V +V ALQQV+ Sbjct: 59 AANAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVL 118 Query: 788 LRQLKQRQRHSYGQKDGRKSAFGHRYGHRSDGVRESRVSPASGTAVSESG------TVEQ 949 RQ +Q+Q+ Q + F Y H G R+ + S ++G G V++ Sbjct: 119 RRQQQQQQQQQQQQNHHHQQRF--YYDHGKVGGRDFKRSSSAGFNRGHRGGGGGGDAVKE 176 Query: 950 MVDKL--DHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCM------KDGSNPAET 1105 V+ +HS N N + S G G S ++ K+ S A+ Sbjct: 177 GVNSSVENHSFNGNSSENIRSEKFEEVKSGGDGGKSDDKKDATAKSHTDNHKNSSGNAQG 236 Query: 1106 CVIELEAASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKL 1285 A ++S ++ ++NQ+ Q + P FVA+E DG MVNVV+GLKL Sbjct: 237 TFSGNSEAVAVDDRSSPEESDSHPSNNQNEKQNLAITPKTFVAEEKIDGQMVNVVDGLKL 296 Query: 1286 YENFLDSSEVTKFVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPTNEGHI 1465 YEN LD EV+K VSL NE+RA G RG+ GQT + KRP KGHGREMIQLG+P + Sbjct: 297 YENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGREMIQLGLPIADAPA 356 Query: 1466 EDEHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSW 1645 EDE+ T KER++E+IP+ L + + QV+ +KPD C+ID +NEGDHSQPH WP W Sbjct: 357 EDENATGTSKERRVESIPALLQDVIEHFVAMQVMTMKPDSCIIDIYNEGDHSQPHMWPPW 416 Query: 1646 FGRPVCNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSL 1825 FG+PV LFLT+C++ FG+ + + H GDY GSLKL V GSLLVMQGKS+DLAK AIP + Sbjct: 417 FGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQGKSSDLAKHAIPMI 476 Query: 1826 RKTLILLTFGKYRPKKTLLPSEGTFFSSSAVNPLSI--PTSARPSSFSRYPSARKPYGIM 1999 +K +L+TF K +PKK L ++G S AV P S P +R + R+P K Y + Sbjct: 477 KKQRMLVTFTKSQPKK-LTSNDGPRLPSHAVAPSSHWGPPPSRSPNHLRHP-VPKHYAAI 534 Query: 2000 PANGILHAQQAPQIMLSPNGVQPRFAAAPMVASPALPSG---PPATVGWAAAS 2149 P G+L + PNGVQP F P+ A P+ PP + GW +S Sbjct: 535 PTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVPIPPVSTGWPTSS 587 >ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809865 isoform X1 [Glycine max] Length = 681 Score = 427 bits (1099), Expect = e-117 Identities = 250/606 (41%), Positives = 337/606 (55%), Gaps = 23/606 (3%) Frame = +2 Query: 461 MAALSGNSVVVSVEPMQFPLNG----GRGSEVPQ-----QWFVDERDGLISWLRGEFAAA 613 MA SGN VV + MQFP G G G E+ Q QWFVDERDGLI WLR EFAAA Sbjct: 1 MAMPSGN--VVIQDKMQFPSGGAGAGGAGGEIHQPHYCQQWFVDERDGLIGWLRSEFAAA 58 Query: 614 NAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLR 793 NAIID L H+RV G+ GEYD V+G I QRR W L MQQ+F V DV +ALQQV R Sbjct: 59 NAIIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVADVAHALQQVAWRR 118 Query: 794 QLKQRQRHSYGQKDGRKSAFGHRYGHRSDGVRESRVSPASG----------TAVSESGTV 943 Q + G K+ RKS G+R+G R + V+E S T +E GT Sbjct: 119 QQRPLDPVKVGAKEFRKSGSGYRHGQRFEPVKEGYNSSVESYNQYDANVTVTGGTEKGT- 177 Query: 944 EQMVDKLDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVIELE 1123 +V+K + K+ + + + KD + + S +E E Sbjct: 178 -PVVEKSEEHKSGGKVEKVGDKGL-ASAEDKKDAITKHQTDGSLKSTRSTEGSLSNLESE 235 Query: 1124 AASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFLD 1303 A +++ GD++ NQ ++Q + F+ E DG MVNVV+GLKLYE+ D Sbjct: 236 AVVNDECISNSKGDDSHSVQNQHQSQSLSTKAKTFIGNEMFDGKMVNVVDGLKLYEDLFD 295 Query: 1304 SSEVTKFVSLANEMRAAGHRGELSG-QTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHR 1480 S+E+ VSL N++R +G +G+L G Q + +RP KGHGREMIQLG+P + E E+ Sbjct: 296 STEIANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVPIADAPAEGENM 355 Query: 1481 TLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPV 1660 T K+ +E IPS I + + QV+ VKPD C++DF+NEGDHSQPH+WPSW+GRPV Sbjct: 356 TGASKDMNVEPIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQPHSWPSWYGRPV 415 Query: 1661 CNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLI 1840 LFLT+C++ FGR + S H GDY G +KL ++ GSLLVM+GKS+D AK A+PS+RK I Sbjct: 416 YILFLTECEMTFGRVIASEHPGDYRGGIKLSLVPGSLLVMEGKSSDFAKHALPSVRKQRI 475 Query: 1841 LLTFGKYRPKKTLLPSEGTFFSSSAVNPLSIPTSARPSSFSRYPSARKPYGIMPANGILH 2020 L+TF K +P+K+ L S+ +S+A + P +R + R+ K Y +P G+L Sbjct: 476 LVTFTKSQPRKS-LSSDAQRLASTATSSHWGPLPSRSPNHVRHHVGSKHYATLPTTGVLP 534 Query: 2021 AQQAPQIMLSPNGVQPRFAAAPMVAS---PALPSGPPATVGWAAASSMNAXXXXXXXXXX 2191 + M +P G+QP F AP+V PA + PP + GW A Sbjct: 535 SPPIRPQMAAPVGMQPLFVTAPVVPPMPFPAPVAFPPGSTGWTGAPPPRHPPPRVPAPGT 594 Query: 2192 XXFLPP 2209 FLPP Sbjct: 595 GVFLPP 600 >ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794176 [Glycine max] Length = 683 Score = 427 bits (1098), Expect = e-116 Identities = 263/643 (40%), Positives = 352/643 (54%), Gaps = 29/643 (4%) Frame = +2 Query: 461 MAALSGNSVVVSVEPMQFPLN-------GGRGSEVPQ------QWFVDERDGLISWLRGE 601 MA SGN VV + MQFP GG G E+ Q QWFVDERDGLI WLR E Sbjct: 1 MAMPSGN--VVIQDKMQFPSGAGGGGGGGGAGGEIHQPHHYRPQWFVDERDGLIGWLRSE 58 Query: 602 FAAANAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQV 781 FAAANAIID L H+RV G+ GEYD VVG I QRR W L MQQ+F V DV YALQQV Sbjct: 59 FAAANAIIDSLCHHLRVVGDPGEYDMVVGAIQQRRCNWNQVLMMQQYFSVADVAYALQQV 118 Query: 782 MDLRQLKQRQRHSYGQKDGRKSAFGHRYGHRSDGVRESRVSPASG---------TAVSES 934 RQ + G K+ RKS G+R+G R + V+E S T +E Sbjct: 119 AWRRQQRPLDPMKVGAKEVRKSGSGYRHGQRFESVKEGYNSSVESYSHDANVAVTGGTEK 178 Query: 935 GTVEQMVDKLDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVI 1114 GT +V+K + K+ + + + KD ++ +E S + Sbjct: 179 GT--PVVEKSEEHKSGGKVEKVGDKGLASVEEK-KDAITNHQSEGSLKSARSTEGSLSNL 235 Query: 1115 ELEAASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYEN 1294 E EA G +++ G++ NQ ++Q + F+ E DG VNVV+GLKLY++ Sbjct: 236 ESEAVVNDGCISNSKGNDLHSVQNQSQSQSLSNIAKTFIGNEMFDGKTVNVVDGLKLYDD 295 Query: 1295 FLDSSEVTKFVSLANEMRAAGHRGELSG-QTLVTLKRPTKGHGREMIQLGIPTNEGHIED 1471 DS+EV VSL N++R +G +G+L G Q + +RP KGHGREMIQLG+ + E Sbjct: 296 LFDSTEVANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVRIADAPAEG 355 Query: 1472 EHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFG 1651 E+ T K+ +E+IPS I + + QV+ VKPD C++DF+NEGDHSQPH+WPSW+G Sbjct: 356 ENMTGASKDMNVESIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQPHSWPSWYG 415 Query: 1652 RPVCNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRK 1831 RPV LFLT+C++ FGR + S H GDY GS+KL ++ GSLLVMQGKS+D AK A+PS RK Sbjct: 416 RPVYVLFLTECEMTFGRVIASEHPGDYRGSIKLSLVPGSLLVMQGKSSDFAKHALPSTRK 475 Query: 1832 TLILLTFGKYRPKKTLLPSEGTFFSSSAVNPLSIPTSARPSSFSRYPSARKPYGIMPANG 2011 IL+TF K +P+K+ L S+ +S+ + P +R + R+ K Y +P G Sbjct: 476 QRILVTFTKSQPRKS-LSSDAQQLASAVASSHWGPPPSRSPNHVRHHVGPKHYATLPTTG 534 Query: 2012 ILHAQQAPQIMLSPNGVQPRFAAAPMVASPALPSGPPATV-----GWAAASSMNAXXXXX 2176 +L A M +P G+QP F AAP+V P +P P + GW AA Sbjct: 535 VLPAPPIRPQMAAPVGMQPLFVAAPVV--PPMPFSAPVPIPAGSTGWTAAPPPRHPPPRV 592 Query: 2177 XXXXXXXFLPP-GSVHIYPIQHLPGTPISVQTIYDNSRSEKPT 2302 FLPP GS + Q LP + ++ N +E PT Sbjct: 593 PAPGTGVFLPPSGSGN--SSQQLPASTLAEV----NPSTETPT 629 >gb|ESW30779.1| hypothetical protein PHAVU_002G181800g [Phaseolus vulgaris] Length = 671 Score = 426 bits (1095), Expect = e-116 Identities = 268/645 (41%), Positives = 350/645 (54%), Gaps = 24/645 (3%) Frame = +2 Query: 461 MAALSGNSVVVSVEPMQFPLNGGRGSEVP--------QQWFVDERDGLISWLRGEFAAAN 616 MA SGN VV + MQFP NGG G+ V QQWFVDERDGLI WLR EFAAAN Sbjct: 1 MAMPSGN--VVIQDKMQFP-NGGGGAGVGEIQQHHYRQQWFVDERDGLIGWLRSEFAAAN 57 Query: 617 AIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQ 796 AIID L H+RV G+ GEYD V+G I QRR W L MQQ+F V DV Y LQQV +Q Sbjct: 58 AIIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLLMQQYFSVADVTYTLQQVAWRKQ 117 Query: 797 LKQRQRHSYGQKDGRKSAFGHRYGHRSDGVRE---------SRVSPASGTAVSESGTVEQ 949 + G K+ RK G+RYGHR + +E S A+ T E GT Sbjct: 118 QRPLDPVKVGAKEVRKPGPGYRYGHRFEPSKEGYNSSVESYSHDGNATFTRGMEKGT--P 175 Query: 950 MVDKLDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVIELEAA 1129 VDK + K+ ++ + + P KD + GS+ +E EA Sbjct: 176 TVDKSEEHKSGSKVEKVGDKGL-ASPEEKKDAIIKHQTDGNLKSTGSSEGYLSNLESEAV 234 Query: 1130 SGCGSQASNS-GDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFLDS 1306 SNS G+++ +Q ++Q F+ E DG MVN+ +GLKLYE+ DS Sbjct: 235 VVNDEFISNSKGNDSDSVESQHQSQSFSTIAKTFIGNEMIDGKMVNLADGLKLYEDIFDS 294 Query: 1307 SEVTKFVSLANEMRAAGHRGELSG-QTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHRT 1483 +EV+ VSL N++R +G +G+L G Q V +RP KGHGREMIQLG+P + +E E+ T Sbjct: 295 TEVSNLVSLVNDLRISGKKGQLQGNQAYVVSRRPMKGHGREMIQLGVPIADAPVEGENMT 354 Query: 1484 LNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPVC 1663 K +E IPS I + + QV+ KPD C++DF+NEGDHSQPH+WPSWFGRPV Sbjct: 355 GASKVMNVEPIPSLFEDIIERMVSSQVMTTKPDCCIVDFYNEGDHSQPHSWPSWFGRPVY 414 Query: 1664 NLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLIL 1843 LFLT+C++ FGR + S H GDY GSLKL ++ GSLL MQGKS D AK A+PS+RK IL Sbjct: 415 TLFLTECEMTFGRLIASEHPGDYRGSLKLSLVPGSLLAMQGKSCDFAKHALPSIRKQRIL 474 Query: 1844 LTFGKYRPKKTLLPSEGTFFSSSAVNPLSIPTSARPSSFSRYPSARKPYGIMPANGILHA 2023 +TF K +PKK+ +PS+ A + P +R + R+ K Y +P G+L A Sbjct: 475 VTFTKSQPKKS-VPSDAQRLYLPAASSQWGPPPSRSPNHVRHSVGSKHYAALPTTGVLPA 533 Query: 2024 QQAPQIMLSPNGVQPRFAAAPMVAS---PALPSGPPATVGWAAASSMNAXXXXXXXXXXX 2194 + + G+QP F AAP+V PA S PP + GW A Sbjct: 534 PPIRPQIPAQVGMQPLFVAAPVVPPMPYPAPVSIPPGSAGWTTAPPPRHPPPRIPAPGTG 593 Query: 2195 XFL-PPGSVHIYPIQHLP-GTPISVQTIYDNSRSEKPTSNSNASD 2323 FL PPGS + Q LP GT V + + + N ++D Sbjct: 594 VFLPPPGSGNSQ--QQLPAGTLAEVNPSIETPTTMQEKENGKSND 636 >ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210274 [Cucumis sativus] gi|449481289|ref|XP_004156139.1| PREDICTED: uncharacterized LOC101210274 [Cucumis sativus] Length = 684 Score = 423 bits (1087), Expect = e-115 Identities = 264/661 (39%), Positives = 358/661 (54%), Gaps = 39/661 (5%) Frame = +2 Query: 473 SGNSVVVSVEPMQFPLNGGRGSEV----PQQWFVDERDGLISWLRGEFAAANAIIDVLMD 640 SG V VS G G E+ P+ WF DERDG ISWLRGEFAA+NAIID L Sbjct: 18 SGGGVAVS----------GGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAIIDALCH 67 Query: 641 HIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQLKQRQRHS 820 H+R GE GEYD V+GCI QRR WT LHMQQ+F V +V YALQQV RQ + Sbjct: 68 HLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQRYMDPVK 127 Query: 821 YGQKDGRKSA--FGHRYGHRSDG-VRESRVSPASGTAVSESGT------VEQMVDKLDHS 973 G K R+ F + GHR++ V+E ++ A S T VEQ+ + D S Sbjct: 128 VGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVSSRKVEQVSNTCDES 187 Query: 974 KNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVI----ELEAASGCG 1141 K ++ ++S + V KD +H +S C + E I ++E GC Sbjct: 188 KASGEDE-KLSEKDSGSAVDNKD--THGKDQSNCKTKSAENLEDNAINKDSQVEPDDGCS 244 Query: 1142 SQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFLDSSEVTK 1321 S S+ +Q+ Q P FVA E DG MVNV++GLKL+E LD +EV+K Sbjct: 245 S--SHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEVSK 302 Query: 1322 FVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHRTLNYKER 1501 +SL N++RA+G RG+ GQT V KRP KGHGREMIQLG P + ED++ K+R Sbjct: 303 LLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHEDDNSLGLSKDR 362 Query: 1502 KIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPVCNLFLTD 1681 +IE IPS L + D L QV+ VKPD C+IDF+NEGDHSQPH WPSWFGRPV L LT+ Sbjct: 363 RIEPIPSLLQDLIDRLVGDQVMTVKPDSCIIDFYNEGDHSQPHVWPSWFGRPVGVLLLTE 422 Query: 1682 CDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLILLTFGKY 1861 C++ FGR +G++H G+Y G++KL + G+LLV+QGKSAD AK A+P++RK IL+T K Sbjct: 423 CEITFGRVIGTDHSGNYRGAMKLSLTPGNLLVVQGKSADFAKHALPAIRKQRILVTLTKS 482 Query: 1862 RPKKTLLPSEG--TFFSSSAVNPLSIPTSARPSSFSRYPSARKPYGIMPANGILHAQQAP 2035 +PK+ P++G T + + P++ P+ R +KPY +P+ G+L Sbjct: 483 QPKRA-APADGQRTSLNVGTFSGWGPPSARSPN--PRLSPGQKPYPTVPSTGVLPVPPIR 539 Query: 2036 QIMLSPNGVQPRF---AAAPMVASPA-LPSGPPATVGWAAASSMNAXXXXXXXXXXXXFL 2203 M PNG+ P A+PM +P +P+GP A W A + + Sbjct: 540 PQMAPPNGIPPLIVPPVASPMPFTPVPIPTGPSA---WPTAHTRHPPPRLPVPGTGVFLP 596 Query: 2204 PPGSVHI---YPIQHLPGTPISVQTIYDNSR-------------SEKPTSNSNASDCSTN 2335 PPGS P Q LP + I ++ + EKP + + +C+ + Sbjct: 597 PPGSSSAPTPSPQQQLPISNIETGSLSEKENGLTKSDHSSGTFPGEKPDAKAQRQECNGS 656 Query: 2336 I 2338 I Sbjct: 657 I 657 >ref|XP_002527549.1| conserved hypothetical protein [Ricinus communis] gi|223533099|gb|EEF34858.1| conserved hypothetical protein [Ricinus communis] Length = 697 Score = 422 bits (1085), Expect = e-115 Identities = 261/618 (42%), Positives = 340/618 (55%), Gaps = 37/618 (5%) Frame = +2 Query: 467 ALSGNSVVVSVEPMQFPL---------NGGRGSEVPQQ-------WF-VDERDGLISWLR 595 A+ +VV+S + +QFP NGG G+E+ QQ WF VDERDG ISWLR Sbjct: 2 AMPPGNVVIS-DKIQFPAGGGGVGGGNNGGIGNEIQQQQHHHRHQWFPVDERDGFISWLR 60 Query: 596 GEFAAANAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQ 775 GEFAAANAIID L H+R GE GEYD V+GCI QRR W LHMQQ+F V +V ALQ Sbjct: 61 GEFAAANAIIDSLCHHLRAAGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVGEVILALQ 120 Query: 776 QVMDLRQLK-----QRQRHSY-------GQKD-GRKSAFGHRYGHRSDG--VRESRVSPA 910 QV +Q + Q Q+H Y G KD R S+ G GHR G V+E Sbjct: 121 QVALRKQQQHQHQHQHQQHRYYYDQPKVGGKDFKRNSSMGFNKGHRGGGEVVKEVNYGAE 180 Query: 911 SGTAVSESGTVEQMVDKLDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGS 1090 S + E+ + + N ++ A + KD S ++ S Sbjct: 181 SHGLDGNTSGNEKFNEIKSGGDSGRLENKSLATAED-----KKDAASKPHVDNLKSSGNS 235 Query: 1091 NPAETCVIELEAASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVV 1270 + + +E EA + Q+S ++ NQ + P FV E DG VNVV Sbjct: 236 EGSLSGNLETEAEA-VHEQSSPKEHDSHFIQNQIVKLNLTTTPKTFVGAEMVDGKSVNVV 294 Query: 1271 EGLKLYENFLDSSEVTKFVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPT 1450 +GLKLYE LD EV+K VSL N++RAAG +G+ GQ V KRP KGHGREMIQLG+P Sbjct: 295 DGLKLYEQLLDDVEVSKLVSLVNDLRAAGRKGQFQGQAYVVSKRPMKGHGREMIQLGLPI 354 Query: 1451 NEGHIEDEHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPH 1630 + E+E+ K+RKIE+IP+ L + + Q++ +KPD C+ID +NEGDHSQPH Sbjct: 355 ADAPAEEENAAGTSKDRKIESIPTLLQEVIERFVSMQIMTMKPDSCIIDIYNEGDHSQPH 414 Query: 1631 TWPSWFGRPVCNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKR 1810 WP WFG+P+ LFLT+CD+ FGR + ++H GDY GSLKLP+ GSLLVMQGK+ D AK Sbjct: 415 MWPPWFGKPISVLFLTECDLTFGRVITADHPGDYRGSLKLPLAPGSLLVMQGKATDFAKH 474 Query: 1811 AIPSLRKTLILLTFGKYRPKKTLLPSEGTFFSSSAVNPLSI--PTSARPSSFSRYPSARK 1984 AIP++RK +LLTF K +PKK + S+G +S A +P S P +R + R+P K Sbjct: 475 AIPAIRKQRVLLTFTKSQPKK-FVQSDGQRLTSPAASPSSHWGPPPSRSPNHIRHP-VSK 532 Query: 1985 PYGIMPANGILHAQQAPQIMLSPNGVQPRFAAAPMVASPALPSG---PPATVGWAAASSM 2155 Y +P G+L A + PNGVQP F AP+ A P+ PP + GW AA Sbjct: 533 HYAPIPTTGVLPAPSIRPQIAPPNGVQPLFVTAPVAAPMPFPAPVPMPPVSTGWPAAPRH 592 Query: 2156 NAXXXXXXXXXXXXFLPP 2209 FLPP Sbjct: 593 PPNRLPVPVPGTGVFLPP 610 >gb|ESW25182.1| hypothetical protein PHAVU_003G014200g [Phaseolus vulgaris] Length = 691 Score = 418 bits (1074), Expect = e-114 Identities = 271/649 (41%), Positives = 351/649 (54%), Gaps = 31/649 (4%) Frame = +2 Query: 461 MAALSGNSVVVSVEPMQFPLNGGR---GSEVP---QQWFVDERDGLISWLRGEFAAANAI 622 MA SGN + E +QFP+ GG G E+ QQWFVDERDG I WLR EFAAANAI Sbjct: 1 MAMPSGNGGMP--EKLQFPVGGGAASGGGEIQYRHQQWFVDERDGFIGWLRSEFAAANAI 58 Query: 623 IDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQLK 802 ID L H+RV GE G YD VVG I QRR WT L MQQ+F V++V YALQQV RQ + Sbjct: 59 IDSLCQHLRVVGEPGVYDMVVGAIQQRRCNWTQVLLMQQYFSVSEVVYALQQVAWRRQQR 118 Query: 803 QRQRHSYGQKDGRKSAFGHRYG-HRSDGVRE----SRVSPASG-------------TAVS 928 G K+ RK G R G HR++ +E SR A AV Sbjct: 119 FVDPAKAGSKEFRKFGSGFRQGQHRNEASKEGYNNSRNEAAKEGYNSKVESFGREMNAVV 178 Query: 929 ESGTVEQMVDKLDHSKNVNQNNVQMSRAMNCF--PVAGKDGNSHSLAESCCMKDGSNPAE 1102 +G VE+ +D + +N + N P KD ++ + G+ Sbjct: 179 VTGGVEKGTRVIDKNGELNSGGKVGTMDNNSIASPEESKDTITNDQLDGILNGSGNFQGS 238 Query: 1103 TCVIELEAASGCGSQASNS-GDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGL 1279 E EA SNS G+++ NQ ++Q F+ E +G MVNVV+GL Sbjct: 239 LSSSECEAVGENEECTSNSKGNDSHSVQNQHQSQNASTIGKTFIGNEMFEGKMVNVVDGL 298 Query: 1280 KLYENFLDSSEVTKFVSLANEMRAAGHRGELSG-QTLVTLKRPTKGHGREMIQLGIPTNE 1456 KLYE+ +DS+EV+K VSL N+MR AG RG+ G QT V KRP KG GREMIQLG+P + Sbjct: 299 KLYEDLIDSAEVSKLVSLVNDMRVAGKRGQFQGSQTFVVSKRPIKGRGREMIQLGVPIAD 358 Query: 1457 GHIEDEHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTW 1636 + ++ T K++K+E+IPS I + L QV+ VKPD C++DFFNEGDHSQP++ Sbjct: 359 APPDVDNVTGLSKDKKVESIPSLFEDIIERLAASQVMTVKPDACIVDFFNEGDHSQPNSC 418 Query: 1637 PSWFGRPVCNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAI 1816 P WFGRPV LFLT+CD+ FGR + S+H GDY G++KL ++ GSLLVMQGKS DLAK A+ Sbjct: 419 PPWFGRPVYMLFLTECDITFGRTIVSDHPGDYRGAVKLSLVPGSLLVMQGKSTDLAKHAL 478 Query: 1817 PSLRKTLILLTFGKYRPKKTLLPSEGTFFSSSAVNPLSIPTSARPSSFSRYPSARKPYGI 1996 PS+ K IL+TF K +P KT LP++ S AV P R + R+ K Y Sbjct: 479 PSIHKQRILVTFTKSQP-KTSLPNDSQRL-SPAVTSHWAPPQGRTPNHMRHQLGPKHYPT 536 Query: 1997 MPANGILHAQQAPQIMLSPNGVQPRFAAAPMVASPALPSGPP---ATVGWAAASSMNAXX 2167 +PA G+L AP I PNG+Q F P+ + S P + GWA+A + Sbjct: 537 IPATGVL---PAPSIRAPPNGMQTLFVPTPVAPPISFASPVPIPLGSTGWASAPQRH-PP 592 Query: 2168 XXXXXXXXXXFLPPGSVHIYPIQHLPGTPISVQTIYDNSRSEKPTSNSN 2314 FLPP QHLPG V + + + K + SN Sbjct: 593 PRMPVPGTGVFLPPPGSGTTSSQHLPGVVSEVNLSGETTSTGKESLKSN 641 >ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|550333015|gb|EEE88914.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 675 Score = 418 bits (1074), Expect = e-114 Identities = 261/594 (43%), Positives = 334/594 (56%), Gaps = 31/594 (5%) Frame = +2 Query: 461 MAALSGNSVVVSVEPMQFPLN----GGRGSEVPQ------QWF-VDERDGLISWLRGEFA 607 MA GN VV + +QFP GG G+E+ Q QWF VDERDG ISWLRGEFA Sbjct: 1 MAMPPGN--VVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFA 58 Query: 608 AANAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMD 787 AANAIID L H+R GE GEYD VVGCI QRR W + LHMQQ+F V +V ALQQV+ Sbjct: 59 AANAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVL 118 Query: 788 LRQLKQRQR-----------HSYGQKDGR----KSAFGHRYGHRSDGVRESRVSPASGTA 922 RQ +Q+Q+ + +G+ GR S+ G GHR G G A Sbjct: 119 RRQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAGFNRGHRGGG------GGGGGDA 172 Query: 923 VSESGTVEQMVDKLDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAE 1102 V E V V+ +HS N N + S G G S K ++ + Sbjct: 173 VKEG--VNSSVE--NHSFNGNSSENIRSEKFEEVKSGGDGGKSDDKKADATAKSHTDNHK 228 Query: 1103 TCVIELEAASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLK 1282 S +Q + SG++ + + + Q + P FVA+E DG MVNVV+GLK Sbjct: 229 N--------SSGNAQGTFSGNSEAVANEK---QNLAITPKTFVAEEKIDGQMVNVVDGLK 277 Query: 1283 LYENFLDSSEVTKFVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPTNEGH 1462 LYEN LD EV+K VSL NE+RA G RG+ GQT + KRP KGHGREMIQLG+P + Sbjct: 278 LYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGREMIQLGLPIADAP 337 Query: 1463 IEDEHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPS 1642 EDE+ T K +E+IP+ L + + QV+ +KPD C+ID +NEGDHSQPH WP Sbjct: 338 AEDENATGTSKGT-VESIPALLQDVIEHFVAMQVMTMKPDSCIIDIYNEGDHSQPHMWPP 396 Query: 1643 WFGRPVCNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPS 1822 WFG+PV LFLT+C++ FG+ + + H GDY GSLKL V GSLLVMQGKS+DLAK AIP Sbjct: 397 WFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQGKSSDLAKHAIPM 456 Query: 1823 LRKTLILLTFGKYRPKKTLLPSEGTFFSSSAVNPLSI--PTSARPSSFSRYPSARKPYGI 1996 ++K +L+TF K +PKK L ++G S AV P S P +R + R+P K Y Sbjct: 457 IKKQRMLVTFTKSQPKK-LTSNDGPRLPSHAVAPSSHWGPPPSRSPNHLRHP-VPKHYAA 514 Query: 1997 MPANGILHAQQAPQIMLSPNGVQPRFAAAPMVASPALPSG---PPATVGWAAAS 2149 +P G+L + PNGVQP F P+ A P+ PP + GW +S Sbjct: 515 IPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVPIPPVSTGWPTSS 568 >gb|EOY01300.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] gi|508709405|gb|EOY01302.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] Length = 680 Score = 413 bits (1061), Expect = e-112 Identities = 258/595 (43%), Positives = 335/595 (56%), Gaps = 33/595 (5%) Frame = +2 Query: 461 MAALSGNSVVVSVEPMQFPLN-----------------GGRGSEVPQ----QWFVDERDG 577 MA SGN VV + MQFP GG G E+ Q QW DERDG Sbjct: 1 MAMPSGN--VVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDG 58 Query: 578 LISWLRGEFAAANAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTD 757 I WLRGEFAA+NAIID L H+R GE+GEY+ V+ CI QRR W LHMQQ+F V + Sbjct: 59 FIYWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAE 118 Query: 758 VGYALQQVMDLRQLKQRQRHSYGQKDGRKSAFGHRYGHRSDGVRESRVSPASG------T 919 V YALQQV R+ + + G K+ ++S G + G R + +E + S T Sbjct: 119 VSYALQQVAWRRRQRHYESGKVGGKEFKRSGMGFK-GQRMEVAKEGQNSGVDSDGNSTVT 177 Query: 920 AVSESGTVEQMVDKLDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPA 1099 AVSE E+ +K + K+ + ++ + F KD S K + A Sbjct: 178 AVSERN--ERGSEKREEVKSCGEVG-KVEDKCSTFTEDKKDTGS---------KPHAGDA 225 Query: 1100 ETCVIELEAASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGL 1279 E+ ++ GC S S ++ NQ+ Q + P FV E DG MVNVV+GL Sbjct: 226 ESVTEDVNG--GCTS--SYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDGL 281 Query: 1280 KLYENFLDSSEVTKFVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPTNEG 1459 KLYE D EV VSL N++RAAG RG+L GQT V KRP KGHGREMIQLG+P + Sbjct: 282 KLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQGQTYVAAKRPMKGHGREMIQLGLPIADA 341 Query: 1460 HIEDEHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWP 1639 ++DE+ K+R+IE IP L + L QV+ VKPD C+ID +NEGDHSQP WP Sbjct: 342 PLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDHSQPRMWP 401 Query: 1640 SWFGRPVCNLFLTDCDVIFGRAV-GSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAI 1816 WFG+PVC +FLT+CD+ FGR V ++H GDY GSLKL + GSLLVMQGKSAD AK A+ Sbjct: 402 PWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSADFAKHAL 461 Query: 1817 PSLRKTLILLTFGKY-RPKKTLLPSEGTFFSSSAVNPLSIPTSARPSSFSRYPSARKPYG 1993 PS+RK IL+TF KY +PKK+ ++ S + + P +R + R+ + K Y Sbjct: 462 PSVRKQRILVTFTKYCQPKKSTTDNQRLSSPSVSQSSQWGPPPSRSPNRIRHSAGPKHYA 521 Query: 1994 IMPANGILHAQQ-APQIMLSPNGVQPRF---AAAPMVASPALPSGPPATVGWAAA 2146 ++P G+L A PQI S +GVQP F A AP ++ PA PP + GW AA Sbjct: 522 VIPTTGVLPAPPIRPQIPPS-SGVQPLFVPTAVAPAISFPAPVPIPPGSTGWPAA 575 >ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261013 [Solanum lycopersicum] Length = 641 Score = 410 bits (1054), Expect = e-111 Identities = 264/644 (40%), Positives = 362/644 (56%), Gaps = 25/644 (3%) Frame = +2 Query: 473 SGNSVV-VSVE-PMQFPLNGGRGSEVP--------QQWF----VDERDGLISWLRGEFAA 610 SGN+ V V+V P + NGG G V QQWF VDERDG ISWLRGEFAA Sbjct: 3 SGNAAVAVAVAVPEKKHSNGGGGEAVAVPRQHQHQQQWFHPQQVDERDGFISWLRGEFAA 62 Query: 611 ANAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDL 790 +NAIID L H+R+ GE GEYD V+GC+ QRR W + LHMQQ+ V +V Y+L QV + Sbjct: 63 SNAIIDALCHHLRLVGEPGEYDGVIGCVQQRRANWNSVLHMQQYHSVAEVIYSLHQVEWM 122 Query: 791 RQLKQRQR--HSYGQKDGRKSAFGHRYGHRSDGVRESRVSPASGTAVSESGTVEQMVDKL 964 +Q K + G+++G K G G +S+G+++ + S ++ ++ + V+K+ Sbjct: 123 KQQKGFDGGVNKVGKRNGSKGGGGG--GWKSEGLKDGKESQGQNFSL-DAHSKTNGVEKI 179 Query: 965 DHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVIELEAASGCGS 1144 D + + ++ A K + S+ S C + G + E V + + S Sbjct: 180 DVVEEKQGDKKEL---------AAKPEANSSVKGSVCTEAGDSQGE--VDKTDDKRDSNS 228 Query: 1145 QASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFLDSSEVTKF 1324 + S++ ++ S Q T+K P FVA E DG VNVV+G+KLYE L SSEV+K Sbjct: 229 EGSSNVESE-SHSFQIPTEKQNVVPKTFVATEIYDGKPVNVVDGMKLYEELLSSSEVSKL 287 Query: 1325 VSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHRTLNYKERK 1504 V+L N++RAAG RG+L Q + KRP KGHGREM+QLG+P + E+E YK+RK Sbjct: 288 VTLVNDLRAAGRRGQLPAQAFIVSKRPMKGHGREMVQLGLPIVDAPPEEESAISTYKDRK 347 Query: 1505 IEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPVCNLFLTDC 1684 EAIP L + D L Q L VKPD C+ID FNEGDHSQPH WP W+GRP+ LFLTDC Sbjct: 348 TEAIPGLLQDVIDQLSAMQALSVKPDACVIDIFNEGDHSQPHLWPYWYGRPISTLFLTDC 407 Query: 1685 DVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLILLTFGKYR 1864 ++ FG+ +G +H GDY GSLKL + GS+LVMQG+S + AK AIPS+RK +L+TF K + Sbjct: 408 EMTFGKVIGVDHPGDYRGSLKLSLAPGSVLVMQGRSTEFAKYAIPSIRKQRMLVTFTKLQ 467 Query: 1865 PKKTLLPSEGTFFSSSAVNPLS--IPTSARPSSFSRYPSARKPYGIMPANGILHAQQA-P 2035 ++ + + F SSA P+S +P S R S+ R P K YG MPA G+L P Sbjct: 468 LRR-IKSGDSQRFPSSAGGPVSQWVPPS-RSSNHIRRPFGPKHYGSMPATGVLPIPGVRP 525 Query: 2036 QIMLSPNGVQPRF---AAAPMVASPALPSGPPATVGWAAASSMNAXXXXXXXXXXXXFLP 2206 Q +P +QP F AP + PA + PPA+ GW A + FLP Sbjct: 526 Q--FAPANMQPIFVPATVAPAMPFPAPVALPPASAGW-AVPPIRHPPPRLPLPGTGVFLP 582 Query: 2207 PGSVHIYPIQHLPGT---PISVQTIYDNSRSEKPTSNSNASDCS 2329 PGS ++P P+S T+ S+ +S DC+ Sbjct: 583 PGS-GTSSTDNIPAENTGPLSDSTVSQKVNSD--SSEVQTQDCN 623 >gb|EOY01299.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] gi|508709404|gb|EOY01301.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 681 Score = 409 bits (1050), Expect = e-111 Identities = 258/596 (43%), Positives = 336/596 (56%), Gaps = 34/596 (5%) Frame = +2 Query: 461 MAALSGNSVVVSVEPMQFPLN-----------------GGRGSEVPQ----QWFVDERDG 577 MA SGN VV + MQFP GG G E+ Q QW DERDG Sbjct: 1 MAMPSGN--VVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDG 58 Query: 578 LISWLRGEFAAANAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTD 757 I WLRGEFAA+NAIID L H+R GE+GEY+ V+ CI QRR W LHMQQ+F V + Sbjct: 59 FIYWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAE 118 Query: 758 VGYALQQVMDLRQLKQRQRHSYGQKDGRKSAFGHRYGHRSDGVRESRVSPASG------T 919 V YALQQV R+ + + G K+ ++S G + G R + +E + S T Sbjct: 119 VSYALQQVAWRRRQRHYESGKVGGKEFKRSGMGFK-GQRMEVAKEGQNSGVDSDGNSTVT 177 Query: 920 AVSESGTVEQMVDKLDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPA 1099 AVSE E+ +K + K+ + ++ + F KD S K + A Sbjct: 178 AVSERN--ERGSEKREEVKSCGEVG-KVEDKCSTFTEDKKDTGS---------KPHAGDA 225 Query: 1100 ETCVIELEAASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGL 1279 E+ ++ GC S S ++ NQ+ Q + P FV E DG MVNVV+GL Sbjct: 226 ESVTEDVNG--GCTS--SYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDGL 281 Query: 1280 KLYENFLDSSEVTKFVSLANEMRAAGHRGEL-SGQTLVTLKRPTKGHGREMIQLGIPTNE 1456 KLYE D EV VSL N++RAAG RG+L +GQT V KRP KGHGREMIQLG+P + Sbjct: 282 KLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKRPMKGHGREMIQLGLPIAD 341 Query: 1457 GHIEDEHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTW 1636 ++DE+ K+R+IE IP L + L QV+ VKPD C+ID +NEGDHSQP W Sbjct: 342 APLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDHSQPRMW 401 Query: 1637 PSWFGRPVCNLFLTDCDVIFGRAV-GSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRA 1813 P WFG+PVC +FLT+CD+ FGR V ++H GDY GSLKL + GSLLVMQGKSAD AK A Sbjct: 402 PPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSADFAKHA 461 Query: 1814 IPSLRKTLILLTFGKY-RPKKTLLPSEGTFFSSSAVNPLSIPTSARPSSFSRYPSARKPY 1990 +PS+RK IL+TF KY +PKK+ ++ S + + P +R + R+ + K Y Sbjct: 462 LPSVRKQRILVTFTKYCQPKKSTTDNQRLSSPSVSQSSQWGPPPSRSPNRIRHSAGPKHY 521 Query: 1991 GIMPANGILHAQQ-APQIMLSPNGVQPRF---AAAPMVASPALPSGPPATVGWAAA 2146 ++P G+L A PQI S +GVQP F A AP ++ PA PP + GW AA Sbjct: 522 AVIPTTGVLPAPPIRPQIPPS-SGVQPLFVPTAVAPAISFPAPVPIPPGSTGWPAA 576 >ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781773 [Glycine max] Length = 664 Score = 407 bits (1047), Expect = e-111 Identities = 266/651 (40%), Positives = 354/651 (54%), Gaps = 25/651 (3%) Frame = +2 Query: 461 MAALSGNSVVVSVEPMQFPLNGGR---GSEVP--QQWFVDERDGLISWLRGEFAAANAII 625 MA SGN+V+ E +QFP GG GSE+ QQWFVDERDG I WLR EFAAANAII Sbjct: 1 MAMPSGNAVMP--EKLQFPGGGGAPGGGSEIHFRQQWFVDERDGFIGWLRSEFAAANAII 58 Query: 626 DVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQLKQ 805 D L H+R GE GEY+ VVG I QRR WT L MQQ+F V++V YALQQV RQ + Sbjct: 59 DSLCHHLRDVGEPGEYNMVVGAIQQRRCNWTQVLLMQQYFSVSEVVYALQQVSWRRQQRV 118 Query: 806 RQRHSYGQKDGRKSAFG-----HRYGHRSDGVRESRVSPASGT-AVSESGTVEQMVDKLD 967 G K+ RK G HR+ DG S S GT AV +G VE+ + Sbjct: 119 VDPAKTGAKEFRKFGLGFKQGQHRFEAVKDGYNSSVESFGHGTNAVVVAGGVEKGACVTE 178 Query: 968 HSKNVNQNNV--QMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVIELEAASGCG 1141 + + + M P KD ++ ++ ++ E EA Sbjct: 179 KNGEIKSGGMVGTMDNKNLGSPEERKDAITNHQSDGILKGSRNSQGSLSSSECEAVGVNE 238 Query: 1142 SQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFLDSSEVTK 1321 SNS +N + F+ E DG MVNVV+GLKLYE+ LDS+EV+K Sbjct: 239 ECVSNSKENDSIMGKF------------FIGNEMFDGKMVNVVDGLKLYEDLLDSTEVSK 286 Query: 1322 FVSLANEMRAAGHRGELSG-QTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHRTLNYKE 1498 VSL N++R AG RG+ G QT V KRP KGHGREMIQLG+P + + ++ T K+ Sbjct: 287 LVSLVNDLRVAGKRGQFQGNQTFVVSKRPMKGHGREMIQLGVPIADAPPDVDNVTGISKD 346 Query: 1499 RKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPVCNLFLT 1678 +K+E+IPS I + L QV+ VKPD C++DFFNEG+HS P+ WP WFGRPV LFLT Sbjct: 347 KKVESIPSLFQDIIERLAASQVMTVKPDACIVDFFNEGEHSHPNNWPPWFGRPVYTLFLT 406 Query: 1679 DCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLILLTFGK 1858 +CD+ FGR + S+H G++ G+++L ++ GSLLVMQGKS D AK A+PS+ K I++TF K Sbjct: 407 ECDMTFGRIIVSDHPGEFRGAVRLSLVPGSLLVMQGKSTDFAKHALPSIHKQRIIITFTK 466 Query: 1859 YRPKKTLLPSEGTFFSSSAVNPLSIPTSARPSSFSRYPSARKPYGIMPANGILHAQQAPQ 2038 +PK + LP++ + A + + P S P+ R+ K Y +PA +L AP Sbjct: 467 SQPKCS-LPNDSQRLAPPAASHWAPPQSRSPNHV-RHQLGPKHYPTVPATVVL---PAPS 521 Query: 2039 IMLSPNGVQPRFAAAPMVASPALPSG---PPATVGWAAASSMNAXXXXXXXXXXXXFLPP 2209 I PN +QP F AP+ + P+ PP + GW +A S + PP Sbjct: 522 IHAPPNSMQPLFVPAPVAPPMSFPTPVPIPPGSTGWTSAPSRHPPPRIPVPGTGVFLPPP 581 Query: 2210 GSVHIYPIQHLPGT----PISVQTI----YDNSRSEKPTSNSNASDCSTNI 2338 GS QHLP T SV+T+ +N +S T++S NI Sbjct: 582 GSG--TSSQHLPCTVPEVNPSVETLTVSGKENGKSNHNTNSSPKGKMDGNI 630 >ref|XP_006605475.1| PREDICTED: uncharacterized protein LOC100814525 [Glycine max] Length = 626 Score = 402 bits (1034), Expect = e-109 Identities = 263/638 (41%), Positives = 354/638 (55%), Gaps = 17/638 (2%) Frame = +2 Query: 461 MAALSGNSVVVSVEPMQFPLNGGRGSEVP--QQWFVDERDGLISWLRGEFAAANAIIDVL 634 MA SGN+V+ E +QFP GG GSE+ QQWFVDERDG I WLR EFAAANAIID L Sbjct: 1 MAMPSGNAVMP--EKLQFP-GGGGGSEIHYRQQWFVDERDGFIGWLRSEFAAANAIIDSL 57 Query: 635 MDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQLKQRQR 814 H+R GE GEYD VVG I QRR WT L MQQ+F V++V ALQQV RQ + Sbjct: 58 CHHLRCVGEPGEYDMVVGAIQQRRCNWTQVLLMQQYFSVSEVVCALQQVSWRRQQRVVDL 117 Query: 815 HSYGQKDGRKSAFGHRYG-HR----SDGVRESRVSPASGT-AVSESGTVEQ---MVDKLD 967 G K+ RK G R G HR DG S S GT AV +G VE+ + +K Sbjct: 118 AKTGAKEFRKFGSGIRQGQHRLEAAKDGYNSSVESFCHGTNAVVVAGGVEKGTPLTEKNG 177 Query: 968 HSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVIELEAASGCGSQ 1147 K+ + ++++ P KD ++ ++ G++ E EA Sbjct: 178 EIKSGGKVGTMDNKSL-ASPEERKDTITNHQSDGILKGSGNSQGSLSTSECEAVGVNEEC 236 Query: 1148 ASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFLDSSEVTKFV 1327 SNS +N S +T F+ E DG MVNVV+GLKLYE+ LD +EV+K V Sbjct: 237 VSNSKEND---STMGKT---------FIGNEMFDGKMVNVVDGLKLYEDLLDRTEVSKLV 284 Query: 1328 SLANEMRAAGHRGELSG-QTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHRTLNYKERK 1504 SL N++R AG RG+ G QT V KRP KGHGREMIQLG+P + + ++ T K++K Sbjct: 285 SLVNDLRVAGKRGQFQGNQTFVVSKRPMKGHGREMIQLGVPIADAPPDVDNVTGISKDKK 344 Query: 1505 IEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPVCNLFLTDC 1684 +E+IPS I L QV+ VKPD C++DFFNEG+HS P+ WP WFGRP+ LFLT+C Sbjct: 345 VESIPSLFQDIIKRLVASQVMTVKPDACIVDFFNEGEHSHPNNWPPWFGRPLYILFLTEC 404 Query: 1685 DVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLILLTFGKYR 1864 D+ FGR + S+H G++ G++ L ++ GSLLVMQGKS D AK A+PS+ K I++TF K + Sbjct: 405 DMTFGRIIVSDHPGEFRGAVTLSLVPGSLLVMQGKSTDFAKHALPSIHKQRIIVTFTKSQ 464 Query: 1865 PKKTLLPSEGTFFSSSAVNPLSIPTSARPSSFSRYPSARKPYGIMPANGILHAQQAPQIM 2044 P+ + LP++ + A P P +R + R+ K Y + A G+L A Sbjct: 465 PRSS-LPNDSERLAPPAA-PHWAPPPSRSPNHVRHQLGPKHYPTVQATGVLPA------- 515 Query: 2045 LSPNGVQPRFAAAPM-VASP-ALPSG---PPATVGWAAASSMNAXXXXXXXXXXXXFLPP 2209 PNG+QP F P+ VASP + P+ PP ++GW +A + PP Sbjct: 516 --PNGMQPLFVPVPVPVASPMSFPTPVPIPPGSIGWTSAPPRHPPPRIPVPGTGVFLPPP 573 Query: 2210 GSVHIYPIQHLPGTPISVQTIYDNSRSEKPTSNSNASD 2323 GS I+ + P + +N +S +NS A + Sbjct: 574 GSGTIHEVN--PSVETWTVSGKENGKSNHSKTNSEAEE 609 >ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600383 [Solanum tuberosum] Length = 638 Score = 400 bits (1027), Expect = e-108 Identities = 251/601 (41%), Positives = 338/601 (56%), Gaps = 20/601 (3%) Frame = +2 Query: 473 SGNSVVVSVEPMQFPLNGGRGSEVP--------QQWF----VDERDGLISWLRGEFAAAN 616 SGN+ V E M GG V QQWF VDERDG ISWLRGEFAA+N Sbjct: 3 SGNAAVAVPEKMNGNGVGGEAVAVALPRQHQHQQQWFHPQQVDERDGFISWLRGEFAASN 62 Query: 617 AIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQ 796 AIID L H+R+ GE GEYD V+GC+ QRR W + LHMQQ+ V +V Y+L QV Sbjct: 63 AIIDALCHHLRLVGEPGEYDGVIGCVQQRRANWNSVLHMQQYHSVAEVIYSLHQV---EW 119 Query: 797 LKQRQRHSYGQK--DGRKSAFGHRYGHRSDGVRESRVSPASGTAVSESGTVEQMVDKLDH 970 +KQ++ G K + R + G G +S+G+++ + S ++ ++ + V+K+D Sbjct: 120 MKQQKGFDGGVKKVEKRNGSRGGGGGWKSEGLKDGKESQGQNFSL-DAHSKTNGVEKIDV 178 Query: 971 SKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVIELEAASGCGSQA 1150 V++ + A + NS S+ S C + G + E V + + S+ Sbjct: 179 --------VEVKQGEKKELAANPEANS-SVKSSVCTEAGDSQGE--VDKTDDKRDSNSEG 227 Query: 1151 SNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFLDSSEVTKFVS 1330 S++ ++ S Q T+K P FVA E DG VNVV+G+KLYE L SSEV+K ++ Sbjct: 228 SSNVESE-SHSIQVPTEKQNVVPKTFVATEIYDGKPVNVVDGMKLYEELLSSSEVSKLLT 286 Query: 1331 LANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHRTLNYKERKIE 1510 L N++RAAG RG+L Q + KRP KGHGREM+QLG+P + E+E YK+RK E Sbjct: 287 LVNDLRAAGRRGQLPAQAFIVSKRPMKGHGREMVQLGLPIVDAPPEEEAAISTYKDRKTE 346 Query: 1511 AIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPVCNLFLTDCDV 1690 AIP + D L Q L VKPD C+ID FNEGDHSQPH WP W+GRP+ LFLTDC++ Sbjct: 347 AIPGLFQDVIDQLSAMQALSVKPDACVIDIFNEGDHSQPHLWPYWYGRPISMLFLTDCEM 406 Query: 1691 IFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLILLTFGKYRPK 1870 FG+ +G +H GDY GSLKL + GS+LVMQG+S + AK AIPS RK IL+TF K + + Sbjct: 407 TFGKVIGVDHPGDYRGSLKLSLAPGSVLVMQGRSTEFAKYAIPSTRKQRILVTFTKLQLR 466 Query: 1871 KTLLPSEGTFFSSSAVNPLS--IPTSARPSSFSRYPSARKPYGIMPANGILHAQQA-PQI 2041 + + ++ F SSA P+S +P S P+ R P K YG M G+L PQ Sbjct: 467 R-IKSADSQRFPSSAGGPVSQWVPPSRSPNHIRR-PFGPKHYGSMSTTGVLPIPGVRPQ- 523 Query: 2042 MLSPNGVQPRF---AAAPMVASPALPSGPPATVGWAAASSMNAXXXXXXXXXXXXFLPPG 2212 +P +QP F AP + PA + PPA+ GW A + FLPPG Sbjct: 524 -FAPANMQPIFVPATVAPAMPFPAPVALPPASAGW-AVPPLRHPPPRLPLPGTGVFLPPG 581 Query: 2213 S 2215 S Sbjct: 582 S 582