BLASTX nr result
ID: Zingiber23_contig00022223
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Zingiber23_contig00022223 (2388 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252... 473 e-130 emb|CBI26785.3| unnamed protein product [Vitis vinifera] 466 e-128 emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera] 447 e-123 ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309... 443 e-121 gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis] 439 e-120 ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Popu... 431 e-118 ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809... 431 e-118 gb|ABK95394.1| unknown [Populus trichocarpa] 431 e-118 ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794... 430 e-117 ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210... 430 e-117 gb|ESW30779.1| hypothetical protein PHAVU_002G181800g [Phaseolus... 427 e-117 gb|ESW25182.1| hypothetical protein PHAVU_003G014200g [Phaseolus... 417 e-113 ref|XP_002527549.1| conserved hypothetical protein [Ricinus comm... 417 e-113 ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus tr... 416 e-113 gb|EOY01300.1| Hydroxyproline-rich glycoprotein family protein, ... 414 e-113 ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261... 411 e-112 gb|EOY01299.1| Hydroxyproline-rich glycoprotein family protein, ... 410 e-111 ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781... 409 e-111 ref|XP_006605475.1| PREDICTED: uncharacterized protein LOC100814... 399 e-108 ref|NP_001145739.1| uncharacterized protein LOC100279246 [Zea ma... 397 e-107 >ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252594 [Vitis vinifera] Length = 698 Score = 473 bits (1218), Expect = e-130 Identities = 297/688 (43%), Positives = 382/688 (55%), Gaps = 33/688 (4%) Frame = +1 Query: 208 MAALSGNSVVVSVEPMQFPLNGGRG-----SEVP--QQWFVDERDGLISWLRGEFAAANA 366 MA SGN VV + MQFP GGRG +E+ +QWF DERDG ISWLRGEFAAANA Sbjct: 1 MAMPSGN--VVISDKMQFPGGGGRGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANA 58 Query: 367 IIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQL 546 IID L +H+R+ GE GEYD V+GCI QRR+ W++ LHMQQ+F V +V YALQQV RQ Sbjct: 59 IIDSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQ 118 Query: 547 KQRQRHSYGQKDGRKSGFGHRYGHRSDGVRESRVSPASGTA--VSESGTVEQ---MVDKL 711 + K+ ++ G +R G R + ++S S + + SGT+E+ + + Sbjct: 119 RHLDPVKGAGKEYKRYGVAYRQGQRGETAKDSHNSNFENHSHDANSSGTLEKGERVSEIY 178 Query: 712 DHSKNVNQNNV-----QMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVIELEAA 876 D K ++ +V A AG D + A SC ++ C I A Sbjct: 179 DDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCSKSSENSEGSRCGISETEA 238 Query: 877 SGC---GSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFL 1047 + GS +N NQ+ +P FV E DG VNVV+GLKLYE Sbjct: 239 NDMDDGGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVVDGLKLYEELF 298 Query: 1048 DSSEVTKFVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHR 1227 D SEV+KFVSL N++RAAG RG+L GQT V KRP KGHGREMIQLG+P + +EDE Sbjct: 299 DDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKGHGREMIQLGVPIADAPLEDESV 358 Query: 1228 TLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPV 1407 K+R+ E+IPS L + L QVL VKPD C+IDF+NEGDHSQPH WP+WFGRPV Sbjct: 359 VGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACIIDFYNEGDHSQPHIWPTWFGRPV 418 Query: 1408 CNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLI 1587 C LFLT+CD+ FGR +G++H GDY GSLKL ++ GSLLVMQGKSAD AK AIPSLRK I Sbjct: 419 CILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSADFAKHAIPSLRKQRI 478 Query: 1588 LLTFGKYRPKKTLLPSEGTFFSSSAVNPLSISTSARPSSFSRYPSARKPYGIMPANGIQH 1767 L+TF K +PKKT+ +A + + +R + R+P K YG +P G+ Sbjct: 479 LVTFTKSQPKKTMASDGQRLLPPAAQSSHWVPPPSRSPNHMRHPMGPKHYGAVPTTGVLP 538 Query: 1768 AQQAPQ--IMLSPNGVQPRF---AAAPMVASPA---LPSGPPATVGWAAASSMNAXXXXX 1923 A P + PNG+QP F A AP + PA LP+G P GW AA + Sbjct: 539 APAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPVPLPTGSP---GWPAAPPRHPPPRLP 595 Query: 1924 XXXXXXXFLPPGSVHIYPIQHL--PGTPISVQTIYDNSRSE-KPTSNSNASDCSTNITTD 2094 PPGS + QH+ T SV+T + S+SN++ S D Sbjct: 596 VPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKENGSGKSSSNSNTVSPKGKLD 655 Query: 2095 LSEPVPECNG--DGWCLDTAAVAPNQEQ 2172 ECNG D +D AV ++Q Sbjct: 656 GKVHRQECNGSMDETGVDERAVTKEEQQ 683 >emb|CBI26785.3| unnamed protein product [Vitis vinifera] Length = 672 Score = 466 bits (1199), Expect = e-128 Identities = 289/664 (43%), Positives = 376/664 (56%), Gaps = 35/664 (5%) Frame = +1 Query: 208 MAALSGNSVVVSVEPMQFPLNGGRG-----SEVP--QQWFVDERDGLISWLRGEFAAANA 366 MA SGN VV + MQFP GGRG +E+ +QWF DERDG ISWLRGEFAAANA Sbjct: 1 MAMPSGN--VVISDKMQFPGGGGRGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANA 58 Query: 367 IIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQL 546 IID L +H+R+ GE GEYD V+GCI QRR+ W++ LHMQQ+F V +V YALQQV RQ Sbjct: 59 IIDSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQ 118 Query: 547 KQRQRHSYGQKDGRKSGFGHRYGHRSDGVRESRVSPASGTA--VSESGTVEQ---MVDKL 711 + K+ ++ G +R G R + ++S S + + SGT+E+ + + Sbjct: 119 RHLDPVKGAGKEYKRYGVAYRQGQRGETAKDSHNSNFENHSHDANSSGTLEKGERVSEIY 178 Query: 712 DHSKNVNQNNV-----QMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVIELEAA 876 D K ++ +V A AG D + A SC ++ C I A Sbjct: 179 DDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCSKSSENSEGSRCGISETEA 238 Query: 877 SGC---GSQASNSGDNTVMTSN----QDRTQKVIP--APNEFVAKETCDGMMVNVVEGLK 1029 + G+ N +M +N Q++ +K P +P FV E DG VNVV+GLK Sbjct: 239 NDMDDGGTLNPKGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVVDGLK 298 Query: 1030 LYENFLDSSEVTKFVSLANEMRAAGHRGEL-SGQTLVTLKRPTKGHGREMIQLGIPTNEG 1206 LYE D SEV+KFVSL N++RAAG RG+L +GQT V KRP KGHGREMIQLG+P + Sbjct: 299 LYEELFDDSEVSKFVSLVNDLRAAGKRGQLQAGQTFVVSKRPMKGHGREMIQLGVPIADA 358 Query: 1207 HIEDEHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWP 1386 +EDE K+R+ E+IPS L + L QVL VKPD C+IDF+NEGDHSQPH WP Sbjct: 359 PLEDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACIIDFYNEGDHSQPHIWP 418 Query: 1387 SWFGRPVCNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIP 1566 +WFGRPVC LFLT+CD+ FGR +G++H GDY GSLKL ++ GSLLVMQGKSAD AK AIP Sbjct: 419 TWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSADFAKHAIP 478 Query: 1567 SLRKTLILLTFGKYRPKKTLLPSEGTFFSSSAVNPLSISTSARPSSFSRYPSARKPYGIM 1746 SLRK IL+TF K +PKKT+ +A + + +R + R+P K YG + Sbjct: 479 SLRKQRILVTFTKSQPKKTMASDGQRLLPPAAQSSHWVPPPSRSPNHMRHPMGPKHYGAV 538 Query: 1747 PANGIQHAQQAPQ--IMLSPNGVQPRF---AAAPMVASPA---LPSGPPATVGWAAASSM 1902 P G+ A P + PNG+QP F A AP + PA LP+G P GW AA Sbjct: 539 PTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPVPLPTGSP---GWPAAPPR 595 Query: 1903 NAXXXXXXXXXXXXFLPPGSVHIYPIQHLPGTPISVQTIYDNSRSEKPTSNSNASDCSTN 2082 + PPGS + QH IS + + + PT N S S+ Sbjct: 596 HPPPRLPVPGTGVFLPPPGSGNSSSPQH-----ISTEATSTSVETAAPTEKENGSGKSST 650 Query: 2083 ITTD 2094 +T + Sbjct: 651 VTKE 654 >emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera] Length = 1145 Score = 447 bits (1151), Expect = e-123 Identities = 290/700 (41%), Positives = 375/700 (53%), Gaps = 49/700 (7%) Frame = +1 Query: 220 SGNSVVVSVEPMQFPLNGGRG-----SEVP--QQWFVDERDGLISWLRGEFAAANAIIDV 378 SGN VV + MQFP GG G +E+ +QWF DERDG ISWLRGEFAAANAIID Sbjct: 3 SGN--VVISDKMQFPGGGGGGGGGGAAEIHHHRQWFPDERDGFISWLRGEFAAANAIIDS 60 Query: 379 LMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQLKQRQ 558 L +H+R+ GE GEYD V+GCI QRR+ W++ LHMQQ+F V +V YALQQV RQ + Sbjct: 61 LCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRHLD 120 Query: 559 RHSYGQKDGRKSGFGHRYGHRSDGVRESRVSPASGTA--VSESGTVEQ---MVDKLDHSK 723 K+ ++ G +R G R + ++S S + + SGT+E+ + + D K Sbjct: 121 PVKGAGKEYKRYGVAYRQGQRGETAKDSHNSNFENHSHDANSSGTLEKGERVSEIYDDVK 180 Query: 724 NVNQNNV-------------QMSRAMNCFPVAGKDG-----NSHSLAESCCMKDGSNPAE 849 ++ +V + MN F + G+ N +A K +P Sbjct: 181 GGDKGDVVGKLEDKDLSAAAEKKEVMN-FVIFGQLEQMLLQNPMQIAVRRVQKTQKDPDV 239 Query: 850 TC-----VIELEAASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNV 1014 + + A C N N NQ+ +P FV E DG VNV Sbjct: 240 AFQRLRPMTWMMEARSCNMIMEN---NAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNV 296 Query: 1015 VEGLKLYENFLDSSEVTKFVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIP 1194 V+GLKLYE D SEV+KFVSL N++RAAG RG+L GQT V KRP KGHGREMIQLG+P Sbjct: 297 VDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKGHGREMIQLGVP 356 Query: 1195 TNEGHIEDEHRTLNYK----ERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGD 1362 + +EDE K R+ E+IPS L + L QVL VKPD C+IDF+NEGD Sbjct: 357 IADAPLEDESVVGTSKGMFHNRRTESIPSLLQDVIGQLVGSQVLTVKPDACIIDFYNEGD 416 Query: 1363 HSQPHTWPSWFGRPVCNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSA 1542 HSQPH WP+WFGRPVC LFLT+CD+ FGR +G++H GDY GSLKL ++ GSLLVMQGKSA Sbjct: 417 HSQPHIWPTWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSA 476 Query: 1543 DLAKRAIPSLRKTLILLTFGKYRPKKTLLPSEGTFFSSSAVNPLSISTSARPSSFSRYPS 1722 D AK AIPSLRK IL+TF K +PKKT +A + + +R + R+P Sbjct: 477 DFAKHAIPSLRKQRILVTFTKSQPKKTTASDGQRLLPPAAQSSHWVPPPSRSPNHMRHPM 536 Query: 1723 ARKPYGIMPANGIQHAQQAPQ--IMLSPNGVQPRF---AAAPMVASPALPSGPPATVGWA 1887 K YG +P G+ A P + PNG+QP F A AP + PA P + GW Sbjct: 537 GPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPXPLPTGSPGWP 596 Query: 1888 AASSMNAXXXXXXXXXXXXFLPPGSVHIYPIQHL--PGTPISVQTIYDNSRSE-KPTSNS 2058 AA + PPGS + QH+ T SV+T + S+S Sbjct: 597 AAPPRHPPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATSTSVETAAPTEKENGSGKSSS 656 Query: 2059 NASDCSTNITTDLSEPVPECNG--DGWCLDTAAVAPNQEQ 2172 N++ S D ECNG D +D AV ++Q Sbjct: 657 NSNTVSPKGKLDGKVHRQECNGSMDETGVDERAVTKEEQQ 696 >ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca subsp. vesca] Length = 682 Score = 443 bits (1139), Expect = e-121 Identities = 277/690 (40%), Positives = 371/690 (53%), Gaps = 26/690 (3%) Frame = +1 Query: 208 MAALSGNSVVVSVEPMQFP------LNGGRGSEVPQQWFVDERDGLISWLRGEFAAANAI 369 M SGN VV + MQ+P ++GG + P+QWF DERDG ISWLRGEFAAANAI Sbjct: 1 MTMPSGN--VVLSDKMQYPSVAGAAVSGGEIHQQPRQWFPDERDGFISWLRGEFAAANAI 58 Query: 370 IDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQLK 549 ID L H+R GE EYD V+GC+ QRR WT LHMQQ+F V +V YALQQV RQ + Sbjct: 59 IDSLCHHLRAVGEPSEYDMVIGCVQQRRCNWTPVLHMQQYFSVAEVIYALQQVAWRRQQR 118 Query: 550 QRQRHSYGQKDGRKSGFGHRYGHRSDGVRESRVSPASGTAVSESGTV-------EQMVDK 708 + G KD ++S G + R++ V+E + + SG E++ Sbjct: 119 YYEPVKMGNKDYKRSNSGVGFKPRNEPVKEWHTASVEYRSYDGSGLEKVGSEMREEVKPG 178 Query: 709 LDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVIELEAASGCG 888 + K ++ + + + +S S A S G++ +E V+ GC Sbjct: 179 GEAGKVDDKGSAAGAVTKGVLTKPHEYISSRSSANSQGTISGNSESEDAVVN----EGCT 234 Query: 889 SQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFLDSSEVTK 1068 S + N++ N+ + +IP FV ET DG VNVV+GLKLYE FL +EV+K Sbjct: 235 SSIKENESNSIQIQNEKQNLSLIP--KTFVGNETFDGKTVNVVDGLKLYEEFLGDTEVSK 292 Query: 1069 FVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHRTLNYKER 1248 SL N++R G RG+L GQT V KRP KGHGREMIQLGIP +G EDE K+R Sbjct: 293 LFSLVNDLRTTGRRGQLQGQTYVLSKRPMKGHGREMIQLGIPIADGPQEDEISAGISKDR 352 Query: 1249 KIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPVCNLFLTD 1428 ++EAIPS L + D L QVL KPD C+IDFFNEGDHS PH WP WFGRPV LFLT+ Sbjct: 353 RMEAIPSLLQDVIDRLIGTQVLTDKPDSCIIDFFNEGDHSHPHMWPPWFGRPVSVLFLTE 412 Query: 1429 CDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLILLTFGKY 1608 CD+ FG+ +G +H GDY G+L+L + GSLL++QGKSAD AK AIPS+RK IL+TF K Sbjct: 413 CDLTFGKVLGMDHPGDYRGALRLSLTPGSLLLLQGKSADYAKHAIPSIRKQRILVTFTKS 472 Query: 1609 RPKKTLLPSEGTFFSS--SAVNPLSISTSARPSSFSRYPSARKPYGIMPANGIQHAQQAP 1782 +P+K+ P++G S + +P R + R+P+ K Y +P G+ A Sbjct: 473 QPRKS-FPTDGQRLPSPGPSQSPYWSPPPGRSPNHIRHPAGPKHYAAVPTTGVLPAPPNR 531 Query: 1783 QIMLSPNGVQPRFAAAPMVASPALPSG-----PPATVGWAAASSMNAXXXXXXXXXXXXF 1947 + NG+QP F AAP+ PA+P PP + GW AA F Sbjct: 532 PQLPPANGIQPLFVAAPV--GPAMPFPAPVVIPPGSPGWVAAP--RHPPPRMPLPGTGVF 587 Query: 1948 LPP--GSVHIYPIQHLPGTPISVQTIYDNSRSEKP--TSNSNASDCSTNITTDLSEPVPE 2115 LPP P Q P T + + + +EK T+ S+ + S D+ + Sbjct: 588 LPPPGSGSSSAPPQQFPSTATEMNPSVETASTEKDNGTAKSSHAIASPKAKLDVKAQRQD 647 Query: 2116 CNG--DGWCLDTAAVAPNQEQTDSVEKADD 2199 CNG DG V Q+Q + A++ Sbjct: 648 CNGSVDGTGSGRGTVKQEQQQNSNNAAANN 677 >gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis] Length = 681 Score = 439 bits (1130), Expect = e-120 Identities = 279/705 (39%), Positives = 367/705 (52%), Gaps = 20/705 (2%) Frame = +1 Query: 208 MAALSGNSVVVSVEPMQFPLNGGRGSEVP----QQWFVDERDGLISWLRGEFAAANAIID 375 MA SGN VVS + MQFP E+ +QWF DERDG ISWLRGEFAAANA+ID Sbjct: 1 MAMPSGN--VVSSDKMQFPSGTAGAGEISHHNNRQWFPDERDGFISWLRGEFAAANAMID 58 Query: 376 VLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQLKQR 555 L H+R GE GEYD V+ CI RR W LHMQQ+F V +V +ALQQV RQ + Sbjct: 59 SLCHHLRAVGEPGEYDAVIACIQLRRCNWNPVLHMQQYFSVAEVMFALQQVAWRRQQRFY 118 Query: 556 QRHSYGQKDGRKSGFGHRYGHRSDGVRESRVSPAS----------GTAVSESGTVEQMVD 705 G K+ ++SG G + R+D ++ R S A G A SE G ++ D Sbjct: 119 DPVKMGNKEFKRSGVGFKQWQRNDSFKDGRNSAAESHCLDGNSSFGNAASEKGGSDKSGD 178 Query: 706 KLDHSKNVNQNNVQMSRAMNCFPVAGK-DGNSHSLAESCCMKDGSNPAETCVIELEAASG 882 ++ +S + + ++ ++ N + DGN SL + GS P V + G Sbjct: 179 EVGNSDD--RGSMPAAKEKNDSAAKSQEDGNVKSLGNFEGVVSGSEPEVHAVDD-----G 231 Query: 883 CGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFLDSSEV 1062 C S + + ++ T Q+ + P F E DG VNVVEGLKLYE F +EV Sbjct: 232 CTSSSKENDSHS--TPKQNENSNLANVPKTFSGNEMFDGKPVNVVEGLKLYEEFCADTEV 289 Query: 1063 TKFVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHRTLNYK 1242 +K V+L N++R+AG RG QT V KRP KGHGRE IQLG+P + +EDE K Sbjct: 290 SKLVALVNDLRSAGERGHFQSQTYVVSKRPMKGHGREKIQLGLPIADAPVEDEISAGTLK 349 Query: 1243 ERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPVCNLFL 1422 +R+ EAIP L + + L QV VKPD C+IDF+NEGDHSQPH WPSWFGRPVC LFL Sbjct: 350 DRRTEAIPPLLQDVAERLVSMQVATVKPDSCIIDFYNEGDHSQPHLWPSWFGRPVCVLFL 409 Query: 1423 TDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLILLTFG 1602 T+CD+ FGR +H GDY G+LKL + GSLL MQGKSAD AK AIPSLR+ IL+TF Sbjct: 410 TECDMTFGRVFAIDHPGDYRGALKLSLKPGSLLAMQGKSADFAKHAIPSLRRQRILVTFT 469 Query: 1603 KYRPKKTLLPSEGTFFSSSAVNPLSI--STSARPSSFSRYPSARKPYGIMPANGIQHAQQ 1776 K +PKK+ +PS+G S V P S +R + R+P K Y +P G+ A Sbjct: 470 KSQPKKS-MPSDGQRMPSPGVAPSSHWGPQPSRSPNHIRHPGP-KHYAPVPTTGVLQASP 527 Query: 1777 APQIMLSPNGVQPRFAAAPMVASPALPSG---PPATVGWAAASSMNAXXXXXXXXXXXXF 1947 + PNG+QP F AP+ + P+ PP++ GW+AA + Sbjct: 528 VRPQIPPPNGIQPLFVTAPVAPAMPFPAPVPIPPSSSGWSAAPPRHPPPRLPVPGTGVFL 587 Query: 1948 LPPGSVHIYPIQHLPGTPISVQTIYDNSRSEKPTSNSNASDCSTNITTDLSEPVPECNGD 2127 PPGS G Q + N TN T + + P + NG Sbjct: 588 PPPGS---------GGNSSGSQQVLGN---------------DTNHTVETAAPPEKENGS 623 Query: 2128 GWCLDTAAVAPNQEQTDSVEKADDNLSESVAK*AEGEHFVLSLRK 2262 G +P + +K + N S +G V+S+ K Sbjct: 624 GKLNHGMTASPKGKVDSKTQKQECNGS------LDGSGSVISVTK 662 >ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa] gi|550333016|gb|ERP57586.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa] Length = 693 Score = 431 bits (1109), Expect = e-118 Identities = 279/698 (39%), Positives = 368/698 (52%), Gaps = 39/698 (5%) Frame = +1 Query: 208 MAALSGNSVVVSVEPMQFPLN----GGRGSEVPQ------QWF-VDERDGLISWLRGEFA 354 MA GN VV + +QFP GG G+E+ Q QWF VDERDG ISWLRGEFA Sbjct: 1 MAMPPGN--VVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFA 58 Query: 355 AANAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMD 534 AANAIID L H+R GE GEYD VVGCI QRR W + LHMQQ+F V +V ALQQV+ Sbjct: 59 AANAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVL 118 Query: 535 LRQLKQRQR-----------HSYGQKDGR----KSGFGHRYGHRSDGVRESRVSPASGTA 669 RQ +Q+Q+ + +G+ GR S G GHR G G A Sbjct: 119 RRQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAGFNRGHRGGG------GGGGGDA 172 Query: 670 VSESGTVEQMVDKLDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCM------KD 831 V E V V+ +HS N N + S G G S ++ K+ Sbjct: 173 VKEG--VNSSVE--NHSFNGNSSENIRSEKFEEVKSGGDGGKSDDKKDATAKSHTDNHKN 228 Query: 832 GSNPAETCVIELEAASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVN 1011 S A+ A ++S ++ ++NQ+ Q + P FVA+E DG MVN Sbjct: 229 SSGNAQGTFSGNSEAVAVDDRSSPEESDSHPSNNQNEKQNLAITPKTFVAEEKIDGQMVN 288 Query: 1012 VVEGLKLYENFLDSSEVTKFVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGI 1191 VV+GLKLYEN LD EV+K VSL NE+RA G RG+ GQT + KRP KGHGREMIQLG+ Sbjct: 289 VVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGREMIQLGL 348 Query: 1192 PTNEGHIEDEHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQ 1371 P + EDE+ T KER++E+IP+ L + + QV+ +KPD C+ID +NEGDHSQ Sbjct: 349 PIADAPAEDENATGTSKERRVESIPALLQDVIEHFVAMQVMTMKPDSCIIDIYNEGDHSQ 408 Query: 1372 PHTWPSWFGRPVCNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLA 1551 PH WP WFG+PV LFLT+C++ FG+ + + H GDY GSLKL V GSLLVMQGKS+DLA Sbjct: 409 PHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQGKSSDLA 468 Query: 1552 KRAIPSLRKTLILLTFGKYRPKKTLLPSEGTFFSSSAVNPLSI--STSARPSSFSRYPSA 1725 K AIP ++K +L+TF K +PKK L ++G S AV P S +R + R+P Sbjct: 469 KHAIPMIKKQRMLVTFTKSQPKK-LTSNDGPRLPSHAVAPSSHWGPPPSRSPNHLRHP-V 526 Query: 1726 RKPYGIMPANGIQHAQQAPQIMLSPNGVQPRFAAAPMVASPALPSG---PPATVGWAAAS 1896 K Y +P G+ + PNGVQP F P+ A P+ PP + GW +S Sbjct: 527 PKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVPIPPVSTGWPTSS 586 Query: 1897 SM--NAXXXXXXXXXXXXFLPPGSVHIYPIQHLPGTPISVQTIYDNSRSEKPTSNSNASD 2070 +A PPGS + L T + + + ++ + D Sbjct: 587 PRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATATEMNFPTETEKEKENGPGKSNHD 646 Query: 2071 CSTNITTDLSEPVPECNGDGWCLDTAAVAPNQEQTDSV 2184 S + +E + +G D +A +E+ SV Sbjct: 647 TSASPKEKSAEKTQRQDSNG---DVDGIAVKKEEQQSV 681 >ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809865 isoform X1 [Glycine max] Length = 681 Score = 431 bits (1108), Expect = e-118 Identities = 276/692 (39%), Positives = 370/692 (53%), Gaps = 30/692 (4%) Frame = +1 Query: 208 MAALSGNSVVVSVEPMQFPLNG----GRGSEVPQ-----QWFVDERDGLISWLRGEFAAA 360 MA SGN VV + MQFP G G G E+ Q QWFVDERDGLI WLR EFAAA Sbjct: 1 MAMPSGN--VVIQDKMQFPSGGAGAGGAGGEIHQPHYCQQWFVDERDGLIGWLRSEFAAA 58 Query: 361 NAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLR 540 NAIID L H+RV G+ GEYD V+G I QRR W L MQQ+F V DV +ALQQV R Sbjct: 59 NAIIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVADVAHALQQVAWRR 118 Query: 541 QLKQRQRHSYGQKDGRKSGFGHRYGHRSDGVRESRVSPASG----------TAVSESGTV 690 Q + G K+ RKSG G+R+G R + V+E S T +E GT Sbjct: 119 QQRPLDPVKVGAKEFRKSGSGYRHGQRFEPVKEGYNSSVESYNQYDANVTVTGGTEKGT- 177 Query: 691 EQMVDKLDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVIELE 870 +V+K + K+ + + + KD + + S +E E Sbjct: 178 -PVVEKSEEHKSGGKVEKVGDKGL-ASAEDKKDAITKHQTDGSLKSTRSTEGSLSNLESE 235 Query: 871 AASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFLD 1050 A +++ GD++ NQ ++Q + F+ E DG MVNVV+GLKLYE+ D Sbjct: 236 AVVNDECISNSKGDDSHSVQNQHQSQSLSTKAKTFIGNEMFDGKMVNVVDGLKLYEDLFD 295 Query: 1051 SSEVTKFVSLANEMRAAGHRGELSG-QTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHR 1227 S+E+ VSL N++R +G +G+L G Q + +RP KGHGREMIQLG+P + E E+ Sbjct: 296 STEIANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVPIADAPAEGENM 355 Query: 1228 TLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPV 1407 T K+ +E IPS I + + QV+ VKPD C++DF+NEGDHSQPH+WPSW+GRPV Sbjct: 356 TGASKDMNVEPIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQPHSWPSWYGRPV 415 Query: 1408 CNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLI 1587 LFLT+C++ FGR + S H GDY G +KL ++ GSLLVM+GKS+D AK A+PS+RK I Sbjct: 416 YILFLTECEMTFGRVIASEHPGDYRGGIKLSLVPGSLLVMEGKSSDFAKHALPSVRKQRI 475 Query: 1588 LLTFGKYRPKKTLLPSEGTFFSSSAVNPLSISTSARPSSFSRYPSARKPYGIMPANGIQH 1767 L+TF K +P+K+ L S+ +S+A + +R + R+ K Y +P G+ Sbjct: 476 LVTFTKSQPRKS-LSSDAQRLASTATSSHWGPLPSRSPNHVRHHVGSKHYATLPTTGVLP 534 Query: 1768 AQQAPQIMLSPNGVQPRFAAAPMVAS---PALPSGPPATVGWAAASSMNAXXXXXXXXXX 1938 + M +P G+QP F AP+V PA + PP + GW A Sbjct: 535 SPPIRPQMAAPVGMQPLFVTAPVVPPMPFPAPVAFPPGSTGWTGAPPPRHPPPRVPAPGT 594 Query: 1939 XXFL-PPGSVHIYPIQHLP-GTPISVQTIYDNSRSEKPT----SNSNASDCSTNITTDLS 2100 FL PPGS + Q LP GT V N +E PT N + ST+ + Sbjct: 595 GVFLPPPGSGN--SSQQLPAGTLAEV-----NPSTETPTMLEKENGKTNHNSTSASPKGK 647 Query: 2101 EPVPECNGDGWCLDTAAVAPNQE-QTDSVEKA 2193 ECNG D V P E + DS +KA Sbjct: 648 VQKQECNGH--AADGTQVEPALETRQDSNDKA 677 >gb|ABK95394.1| unknown [Populus trichocarpa] Length = 694 Score = 431 bits (1107), Expect = e-118 Identities = 275/691 (39%), Positives = 368/691 (53%), Gaps = 32/691 (4%) Frame = +1 Query: 208 MAALSGNSVVVSVEPMQFPLN----GGRGSEVPQ------QWF-VDERDGLISWLRGEFA 354 MA GN VV + +QFP GG G+E+ Q QWF VDERDG ISWLRGEFA Sbjct: 1 MAMPPGN--VVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFA 58 Query: 355 AANAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMD 534 AANAIID L H+R GE GEYD VVGCI QRR W + LHMQQ+F V +V ALQQV+ Sbjct: 59 AANAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVL 118 Query: 535 LRQLKQRQRHSYGQKDGRKSGFGHRYGHRSDGVRESRVSPASGTAVSESG------TVEQ 696 RQ +Q+Q+ Q + F Y H G R+ + S ++G G V++ Sbjct: 119 RRQQQQQQQQQQQQNHHHQQRF--YYDHGKVGGRDFKRSSSAGFNRGHRGGGGGGDAVKE 176 Query: 697 MVDKL--DHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCM------KDGSNPAET 852 V+ +HS N N + S G G S ++ K+ S A+ Sbjct: 177 GVNSSVENHSFNGNSSENIRSEKFEEVKSGGDGGKSDDKKDATAKSHTDNHKNSSGNAQG 236 Query: 853 CVIELEAASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKL 1032 A ++S ++ ++NQ+ Q + P FVA+E DG MVNVV+GLKL Sbjct: 237 TFSGNSEAVAVDDRSSPEESDSHPSNNQNEKQNLAITPKTFVAEEKIDGQMVNVVDGLKL 296 Query: 1033 YENFLDSSEVTKFVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPTNEGHI 1212 YEN LD EV+K VSL NE+RA G RG+ GQT + KRP KGHGREMIQLG+P + Sbjct: 297 YENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGREMIQLGLPIADAPA 356 Query: 1213 EDEHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSW 1392 EDE+ T KER++E+IP+ L + + QV+ +KPD C+ID +NEGDHSQPH WP W Sbjct: 357 EDENATGTSKERRVESIPALLQDVIEHFVAMQVMTMKPDSCIIDIYNEGDHSQPHMWPPW 416 Query: 1393 FGRPVCNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSL 1572 FG+PV LFLT+C++ FG+ + + H GDY GSLKL V GSLLVMQGKS+DLAK AIP + Sbjct: 417 FGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQGKSSDLAKHAIPMI 476 Query: 1573 RKTLILLTFGKYRPKKTLLPSEGTFFSSSAVNPLSI--STSARPSSFSRYPSARKPYGIM 1746 +K +L+TF K +PKK L ++G S AV P S +R + R+P K Y + Sbjct: 477 KKQRMLVTFTKSQPKK-LTSNDGPRLPSHAVAPSSHWGPPPSRSPNHLRHP-VPKHYAAI 534 Query: 1747 PANGIQHAQQAPQIMLSPNGVQPRFAAAPMVASPALPSG---PPATVGWAAASSM--NAX 1911 P G+ + PNGVQP F P+ A P+ PP + GW +S +A Sbjct: 535 PTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVPIPPVSTGWPTSSPRHPSAR 594 Query: 1912 XXXXXXXXXXXFLPPGSVHIYPIQHLPGTPISVQTIYDNSRSEKPTSNSNASDCSTNITT 2091 PPGS + L T + + + ++ + D S + Sbjct: 595 LPVPIPGTGVFLPPPGSGNASSALQLSATATEMNFPTETEKEKENGPGKSNHDTSASPKE 654 Query: 2092 DLSEPVPECNGDGWCLDTAAVAPNQEQTDSV 2184 +E + +G D +A +E+ SV Sbjct: 655 KSAEKTQRQDSNG---DVDGIAVKKEEQQSV 682 >ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794176 [Glycine max] Length = 683 Score = 430 bits (1106), Expect = e-117 Identities = 277/696 (39%), Positives = 371/696 (53%), Gaps = 34/696 (4%) Frame = +1 Query: 208 MAALSGNSVVVSVEPMQFPLN-------GGRGSEVPQ------QWFVDERDGLISWLRGE 348 MA SGN VV + MQFP GG G E+ Q QWFVDERDGLI WLR E Sbjct: 1 MAMPSGN--VVIQDKMQFPSGAGGGGGGGGAGGEIHQPHHYRPQWFVDERDGLIGWLRSE 58 Query: 349 FAAANAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQV 528 FAAANAIID L H+RV G+ GEYD VVG I QRR W L MQQ+F V DV YALQQV Sbjct: 59 FAAANAIIDSLCHHLRVVGDPGEYDMVVGAIQQRRCNWNQVLMMQQYFSVADVAYALQQV 118 Query: 529 MDLRQLKQRQRHSYGQKDGRKSGFGHRYGHRSDGVRESRVSPASG---------TAVSES 681 RQ + G K+ RKSG G+R+G R + V+E S T +E Sbjct: 119 AWRRQQRPLDPMKVGAKEVRKSGSGYRHGQRFESVKEGYNSSVESYSHDANVAVTGGTEK 178 Query: 682 GTVEQMVDKLDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVI 861 GT +V+K + K+ + + + KD ++ +E S + Sbjct: 179 GT--PVVEKSEEHKSGGKVEKVGDKGLASVEEK-KDAITNHQSEGSLKSARSTEGSLSNL 235 Query: 862 ELEAASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYEN 1041 E EA G +++ G++ NQ ++Q + F+ E DG VNVV+GLKLY++ Sbjct: 236 ESEAVVNDGCISNSKGNDLHSVQNQSQSQSLSNIAKTFIGNEMFDGKTVNVVDGLKLYDD 295 Query: 1042 FLDSSEVTKFVSLANEMRAAGHRGELSG-QTLVTLKRPTKGHGREMIQLGIPTNEGHIED 1218 DS+EV VSL N++R +G +G+L G Q + +RP KGHGREMIQLG+ + E Sbjct: 296 LFDSTEVANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVRIADAPAEG 355 Query: 1219 EHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFG 1398 E+ T K+ +E+IPS I + + QV+ VKPD C++DF+NEGDHSQPH+WPSW+G Sbjct: 356 ENMTGASKDMNVESIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQPHSWPSWYG 415 Query: 1399 RPVCNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRK 1578 RPV LFLT+C++ FGR + S H GDY GS+KL ++ GSLLVMQGKS+D AK A+PS RK Sbjct: 416 RPVYVLFLTECEMTFGRVIASEHPGDYRGSIKLSLVPGSLLVMQGKSSDFAKHALPSTRK 475 Query: 1579 TLILLTFGKYRPKKTLLPSEGTFFSSSAVNPLSISTSARPSSFSRYPSARKPYGIMPANG 1758 IL+TF K +P+K+ L S+ +S+ + +R + R+ K Y +P G Sbjct: 476 QRILVTFTKSQPRKS-LSSDAQQLASAVASSHWGPPPSRSPNHVRHHVGPKHYATLPTTG 534 Query: 1759 IQHAQQAPQIMLSPNGVQPRFAAAPMVASPALPSGPPATV-----GWAAASSMNAXXXXX 1923 + A M +P G+QP F AAP+V P +P P + GW AA Sbjct: 535 VLPAPPIRPQMAAPVGMQPLFVAAPVV--PPMPFSAPVPIPAGSTGWTAAPPPRHPPPRV 592 Query: 1924 XXXXXXXFLPP-GSVHIYPIQHLPGTPISVQTIYDNSRSEKPT----SNSNASDCSTNIT 2088 FLPP GS + Q LP + ++ N +E PT N + ST+ + Sbjct: 593 PAPGTGVFLPPSGSGN--SSQQLPASTLAEV----NPSTETPTMPEKENGKINHNSTSAS 646 Query: 2089 TDLSEPVPECNGDGWCLDTAAVAPNQE-QTDSVEKA 2193 ECNG D V P E + DS +KA Sbjct: 647 PKGKVQKQECNGHA---DGTQVEPALETRLDSNDKA 679 >ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210274 [Cucumis sativus] gi|449481289|ref|XP_004156139.1| PREDICTED: uncharacterized LOC101210274 [Cucumis sativus] Length = 684 Score = 430 bits (1105), Expect = e-117 Identities = 278/692 (40%), Positives = 371/692 (53%), Gaps = 32/692 (4%) Frame = +1 Query: 220 SGNSVVVSVEPMQFPLNGGRGSEV----PQQWFVDERDGLISWLRGEFAAANAIIDVLMD 387 SG V VS G G E+ P+ WF DERDG ISWLRGEFAA+NAIID L Sbjct: 18 SGGGVAVS----------GGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAIIDALCH 67 Query: 388 HIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQLKQRQRHS 567 H+R GE GEYD V+GCI QRR WT LHMQQ+F V +V YALQQV RQ + Sbjct: 68 HLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQRYMDPVK 127 Query: 568 YGQKDGRKSG--FGHRYGHRSDG-VRESRVSPASGTAVSESGT------VEQMVDKLDHS 720 G K R+ G F + GHR++ V+E ++ A S T VEQ+ + D S Sbjct: 128 VGPKLYRRPGPGFKQQQGHRAEATVKEETITCAESCNGGNSSTFVSSRKVEQVSNTCDES 187 Query: 721 KNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVI----ELEAASGCG 888 K ++ ++S + V KD +H +S C + E I ++E GC Sbjct: 188 KASGEDE-KLSEKDSGSAVDNKD--THGKDQSNCKTKSAENLEDNAINKDSQVEPDDGCS 244 Query: 889 SQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFLDSSEVTK 1068 S S+ +Q+ Q P FVA E DG MVNV++GLKL+E LD +EV+K Sbjct: 245 S--SHRDKELQSVQSQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEVSK 302 Query: 1069 FVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHRTLNYKER 1248 +SL N++RA+G RG+ GQT V KRP KGHGREMIQLG P + ED++ K+R Sbjct: 303 LLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHEDDNSLGLSKDR 362 Query: 1249 KIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPVCNLFLTD 1428 +IE IPS L + D L QV+ VKPD C+IDF+NEGDHSQPH WPSWFGRPV L LT+ Sbjct: 363 RIEPIPSLLQDLIDRLVGDQVMTVKPDSCIIDFYNEGDHSQPHVWPSWFGRPVGVLLLTE 422 Query: 1429 CDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLILLTFGKY 1608 C++ FGR +G++H G+Y G++KL + G+LLV+QGKSAD AK A+P++RK IL+T K Sbjct: 423 CEITFGRVIGTDHSGNYRGAMKLSLTPGNLLVVQGKSADFAKHALPAIRKQRILVTLTKS 482 Query: 1609 RPKKTLLPSEGTFFSSSAVNPLSISTSARPSSFS---RYPSARKPYGIMPANGIQHAQQA 1779 +PK+ P++G +++N + S PS+ S R +KPY +P+ G+ Sbjct: 483 QPKRA-APADG---QRTSLNVGTFSGWGPPSARSPNPRLSPGQKPYPTVPSTGVLPVPPI 538 Query: 1780 PQIMLSPNGVQPRF---AAAPMVASPA-LPSGPPATVGWAAASSMNAXXXXXXXXXXXXF 1947 M PNG+ P A+PM +P +P+GP A W A + + Sbjct: 539 RPQMAPPNGIPPLIVPPVASPMPFTPVPIPTGPSA---WPTAHTRHPPPRLPVPGTGVFL 595 Query: 1948 LPPGSVHI---YPIQHLPGTPISVQTIYDNSRSEKP---TSNSNASDCSTNITTDLSEPV 2109 PPGS P Q LP + I S SEK T + ++S D Sbjct: 596 PPPGSSSAPTPSPQQQLP-----ISNIETGSLSEKENGLTKSDHSSGTFPGEKPDAKAQR 650 Query: 2110 PECNG--DGWCLDTAAVAPNQEQTDSVEKADD 2199 ECNG DG D Q+Q + + A + Sbjct: 651 QECNGSIDGSGNDKVKEEEQQQQQEEEQSAQN 682 >gb|ESW30779.1| hypothetical protein PHAVU_002G181800g [Phaseolus vulgaris] Length = 671 Score = 427 bits (1099), Expect = e-117 Identities = 272/664 (40%), Positives = 359/664 (54%), Gaps = 25/664 (3%) Frame = +1 Query: 208 MAALSGNSVVVSVEPMQFPLNGGRGSEVP--------QQWFVDERDGLISWLRGEFAAAN 363 MA SGN VV + MQFP NGG G+ V QQWFVDERDGLI WLR EFAAAN Sbjct: 1 MAMPSGN--VVIQDKMQFP-NGGGGAGVGEIQQHHYRQQWFVDERDGLIGWLRSEFAAAN 57 Query: 364 AIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQ 543 AIID L H+RV G+ GEYD V+G I QRR W L MQQ+F V DV Y LQQV +Q Sbjct: 58 AIIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLLMQQYFSVADVTYTLQQVAWRKQ 117 Query: 544 LKQRQRHSYGQKDGRKSGFGHRYGHRSDGVRE---------SRVSPASGTAVSESGTVEQ 696 + G K+ RK G G+RYGHR + +E S A+ T E GT Sbjct: 118 QRPLDPVKVGAKEVRKPGPGYRYGHRFEPSKEGYNSSVESYSHDGNATFTRGMEKGT--P 175 Query: 697 MVDKLDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVIELEAA 876 VDK + K+ ++ + + P KD + GS+ +E EA Sbjct: 176 TVDKSEEHKSGSKVEKVGDKGL-ASPEEKKDAIIKHQTDGNLKSTGSSEGYLSNLESEAV 234 Query: 877 SGCGSQASNS-GDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFLDS 1053 SNS G+++ +Q ++Q F+ E DG MVN+ +GLKLYE+ DS Sbjct: 235 VVNDEFISNSKGNDSDSVESQHQSQSFSTIAKTFIGNEMIDGKMVNLADGLKLYEDIFDS 294 Query: 1054 SEVTKFVSLANEMRAAGHRGELSG-QTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHRT 1230 +EV+ VSL N++R +G +G+L G Q V +RP KGHGREMIQLG+P + +E E+ T Sbjct: 295 TEVSNLVSLVNDLRISGKKGQLQGNQAYVVSRRPMKGHGREMIQLGVPIADAPVEGENMT 354 Query: 1231 LNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPVC 1410 K +E IPS I + + QV+ KPD C++DF+NEGDHSQPH+WPSWFGRPV Sbjct: 355 GASKVMNVEPIPSLFEDIIERMVSSQVMTTKPDCCIVDFYNEGDHSQPHSWPSWFGRPVY 414 Query: 1411 NLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLIL 1590 LFLT+C++ FGR + S H GDY GSLKL ++ GSLL MQGKS D AK A+PS+RK IL Sbjct: 415 TLFLTECEMTFGRLIASEHPGDYRGSLKLSLVPGSLLAMQGKSCDFAKHALPSIRKQRIL 474 Query: 1591 LTFGKYRPKKTLLPSEGTFFSSSAVNPLSISTSARPSSFSRYPSARKPYGIMPANGIQHA 1770 +TF K +PKK+ +PS+ A + +R + R+ K Y +P G+ A Sbjct: 475 VTFTKSQPKKS-VPSDAQRLYLPAASSQWGPPPSRSPNHVRHSVGSKHYAALPTTGVLPA 533 Query: 1771 QQAPQIMLSPNGVQPRFAAAPMVAS---PALPSGPPATVGWAAASSMNAXXXXXXXXXXX 1941 + + G+QP F AAP+V PA S PP + GW A Sbjct: 534 PPIRPQIPAQVGMQPLFVAAPVVPPMPYPAPVSIPPGSAGWTTAPPPRHPPPRIPAPGTG 593 Query: 1942 XFL-PPGSVHIYPIQHLP-GTPISVQTIYDNSRSEKPTSNSNASDCSTNITTDLSE-PVP 2112 FL PPGS + Q LP GT V + + + N ++D +++ T+ + Sbjct: 594 VFLPPPGSGNSQ--QQLPAGTLAEVNPSIETPTTMQEKENGKSNDDNSSSTSPKGKVQKQ 651 Query: 2113 ECNG 2124 ECNG Sbjct: 652 ECNG 655 >gb|ESW25182.1| hypothetical protein PHAVU_003G014200g [Phaseolus vulgaris] Length = 691 Score = 417 bits (1072), Expect = e-113 Identities = 275/674 (40%), Positives = 361/674 (53%), Gaps = 34/674 (5%) Frame = +1 Query: 208 MAALSGNSVVVSVEPMQFPLNGGR---GSEVP---QQWFVDERDGLISWLRGEFAAANAI 369 MA SGN + E +QFP+ GG G E+ QQWFVDERDG I WLR EFAAANAI Sbjct: 1 MAMPSGNGGMP--EKLQFPVGGGAASGGGEIQYRHQQWFVDERDGFIGWLRSEFAAANAI 58 Query: 370 IDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQLK 549 ID L H+RV GE G YD VVG I QRR WT L MQQ+F V++V YALQQV RQ + Sbjct: 59 IDSLCQHLRVVGEPGVYDMVVGAIQQRRCNWTQVLLMQQYFSVSEVVYALQQVAWRRQQR 118 Query: 550 QRQRHSYGQKDGRKSGFGHRYG-HRSDGVRE----SRVSPASG-------------TAVS 675 G K+ RK G G R G HR++ +E SR A AV Sbjct: 119 FVDPAKAGSKEFRKFGSGFRQGQHRNEASKEGYNNSRNEAAKEGYNSKVESFGREMNAVV 178 Query: 676 ESGTVEQMVDKLDHSKNVNQNNVQMSRAMNCF--PVAGKDGNSHSLAESCCMKDGSNPAE 849 +G VE+ +D + +N + N P KD ++ + G+ Sbjct: 179 VTGGVEKGTRVIDKNGELNSGGKVGTMDNNSIASPEESKDTITNDQLDGILNGSGNFQGS 238 Query: 850 TCVIELEAASGCGSQASNS-GDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGL 1026 E EA SNS G+++ NQ ++Q F+ E +G MVNVV+GL Sbjct: 239 LSSSECEAVGENEECTSNSKGNDSHSVQNQHQSQNASTIGKTFIGNEMFEGKMVNVVDGL 298 Query: 1027 KLYENFLDSSEVTKFVSLANEMRAAGHRGELSG-QTLVTLKRPTKGHGREMIQLGIPTNE 1203 KLYE+ +DS+EV+K VSL N+MR AG RG+ G QT V KRP KG GREMIQLG+P + Sbjct: 299 KLYEDLIDSAEVSKLVSLVNDMRVAGKRGQFQGSQTFVVSKRPIKGRGREMIQLGVPIAD 358 Query: 1204 GHIEDEHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTW 1383 + ++ T K++K+E+IPS I + L QV+ VKPD C++DFFNEGDHSQP++ Sbjct: 359 APPDVDNVTGLSKDKKVESIPSLFEDIIERLAASQVMTVKPDACIVDFFNEGDHSQPNSC 418 Query: 1384 PSWFGRPVCNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAI 1563 P WFGRPV LFLT+CD+ FGR + S+H GDY G++KL ++ GSLLVMQGKS DLAK A+ Sbjct: 419 PPWFGRPVYMLFLTECDITFGRTIVSDHPGDYRGAVKLSLVPGSLLVMQGKSTDLAKHAL 478 Query: 1564 PSLRKTLILLTFGKYRPKKTLLPSEGTFFSSSAVNPLSISTSARPSSFSRYPSARKPYGI 1743 PS+ K IL+TF K +P KT LP++ S AV R + R+ K Y Sbjct: 479 PSIHKQRILVTFTKSQP-KTSLPNDSQRL-SPAVTSHWAPPQGRTPNHMRHQLGPKHYPT 536 Query: 1744 MPANGIQHAQQAPQIMLSPNGVQPRFAAAPMVASPALPSGPP---ATVGWAAASSMNAXX 1914 +PA G+ AP I PNG+Q F P+ + S P + GWA+A + Sbjct: 537 IPATGV---LPAPSIRAPPNGMQTLFVPTPVAPPISFASPVPIPLGSTGWASAPQRH-PP 592 Query: 1915 XXXXXXXXXXFLPPGSVHIYPIQHLPGTPISVQTIYDNSRSEKPTSNSNASDCSTNITTD 2094 FLPP QHLPG V + + + K + SN + +++ Sbjct: 593 PRMPVPGTGVFLPPPGSGTTSSQHLPGVVSEVNLSGETTSTGKESLKSNHNTINSSPKGK 652 Query: 2095 LSEPV---PECNGD 2127 + V ECNG+ Sbjct: 653 VDGNVVGRQECNGN 666 >ref|XP_002527549.1| conserved hypothetical protein [Ricinus communis] gi|223533099|gb|EEF34858.1| conserved hypothetical protein [Ricinus communis] Length = 697 Score = 417 bits (1071), Expect = e-113 Identities = 270/680 (39%), Positives = 353/680 (51%), Gaps = 43/680 (6%) Frame = +1 Query: 214 ALSGNSVVVSVEPMQFPL---------NGGRGSEVPQQ-------WF-VDERDGLISWLR 342 A+ +VV+S + +QFP NGG G+E+ QQ WF VDERDG ISWLR Sbjct: 2 AMPPGNVVIS-DKIQFPAGGGGVGGGNNGGIGNEIQQQQHHHRHQWFPVDERDGFISWLR 60 Query: 343 GEFAAANAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQ 522 GEFAAANAIID L H+R GE GEYD V+GCI QRR W LHMQQ+F V +V ALQ Sbjct: 61 GEFAAANAIIDSLCHHLRAAGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVGEVILALQ 120 Query: 523 QVMDLRQLK-----QRQRHSY-------GQKD-GRKSGFGHRYGHRSDG--VRESRVSPA 657 QV +Q + Q Q+H Y G KD R S G GHR G V+E Sbjct: 121 QVALRKQQQHQHQHQHQQHRYYYDQPKVGGKDFKRNSSMGFNKGHRGGGEVVKEVNYGAE 180 Query: 658 SGTAVSESGTVEQMVDKLDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGS 837 S + E+ + + N ++ A + KD S ++ S Sbjct: 181 SHGLDGNTSGNEKFNEIKSGGDSGRLENKSLATAED-----KKDAASKPHVDNLKSSGNS 235 Query: 838 NPAETCVIELEAASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVV 1017 + + +E EA + Q+S ++ NQ + P FV E DG VNVV Sbjct: 236 EGSLSGNLETEAEA-VHEQSSPKEHDSHFIQNQIVKLNLTTTPKTFVGAEMVDGKSVNVV 294 Query: 1018 EGLKLYENFLDSSEVTKFVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPT 1197 +GLKLYE LD EV+K VSL N++RAAG +G+ GQ V KRP KGHGREMIQLG+P Sbjct: 295 DGLKLYEQLLDDVEVSKLVSLVNDLRAAGRKGQFQGQAYVVSKRPMKGHGREMIQLGLPI 354 Query: 1198 NEGHIEDEHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPH 1377 + E+E+ K+RKIE+IP+ L + + Q++ +KPD C+ID +NEGDHSQPH Sbjct: 355 ADAPAEEENAAGTSKDRKIESIPTLLQEVIERFVSMQIMTMKPDSCIIDIYNEGDHSQPH 414 Query: 1378 TWPSWFGRPVCNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKR 1557 WP WFG+P+ LFLT+CD+ FGR + ++H GDY GSLKLP+ GSLLVMQGK+ D AK Sbjct: 415 MWPPWFGKPISVLFLTECDLTFGRVITADHPGDYRGSLKLPLAPGSLLVMQGKATDFAKH 474 Query: 1558 AIPSLRKTLILLTFGKYRPKKTLLPSEGTFFSSSAVNPLSI--STSARPSSFSRYPSARK 1731 AIP++RK +LLTF K +PKK + S+G +S A +P S +R + R+P K Sbjct: 475 AIPAIRKQRVLLTFTKSQPKK-FVQSDGQRLTSPAASPSSHWGPPPSRSPNHIRHP-VSK 532 Query: 1732 PYGIMPANGIQHAQQAPQIMLSPNGVQPRFAAAPMVASPALPSG---PPATVGWAAASSM 1902 Y +P G+ A + PNGVQP F AP+ A P+ PP + GW AA Sbjct: 533 HYAPIPTTGVLPAPSIRPQIAPPNGVQPLFVTAPVAAPMPFPAPVPMPPVSTGWPAAPRH 592 Query: 1903 NAXXXXXXXXXXXXFLPPGSVHIYPIQHLPGT-----PISVQTIYDNSRSEKPTSNSNAS 2067 FLPP +P P ++ D E SN Sbjct: 593 PPNRLPVPVPGTGVFLPPPGSGNASSPQIPNATEINFPAETASLQD---KENGLGKSNHG 649 Query: 2068 DC-STNITTDLSEPVPECNG 2124 C S + +CNG Sbjct: 650 TCASPKEKLEAKSQKQDCNG 669 >ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|550333015|gb|EEE88914.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 675 Score = 416 bits (1069), Expect = e-113 Identities = 275/692 (39%), Positives = 362/692 (52%), Gaps = 33/692 (4%) Frame = +1 Query: 208 MAALSGNSVVVSVEPMQFPLN----GGRGSEVPQ------QWF-VDERDGLISWLRGEFA 354 MA GN VV + +QFP GG G+E+ Q QWF VDERDG ISWLRGEFA Sbjct: 1 MAMPPGN--VVIPDKVQFPAGAAGGGGGGNEIHQHQLQRHQWFPVDERDGFISWLRGEFA 58 Query: 355 AANAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMD 534 AANAIID L H+R GE GEYD VVGCI QRR W + LHMQQ+F V +V ALQQV+ Sbjct: 59 AANAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGEVIVALQQVVL 118 Query: 535 LRQLKQRQR-----------HSYGQKDGR----KSGFGHRYGHRSDGVRESRVSPASGTA 669 RQ +Q+Q+ + +G+ GR S G GHR G G A Sbjct: 119 RRQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAGFNRGHRGGG------GGGGGDA 172 Query: 670 VSESGTVEQMVDKLDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAE 849 V E V V+ +HS N N + S G G S K ++ + Sbjct: 173 VKEG--VNSSVE--NHSFNGNSSENIRSEKFEEVKSGGDGGKSDDKKADATAKSHTDNHK 228 Query: 850 TCVIELEAASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLK 1029 S +Q + SG++ + + + Q + P FVA+E DG MVNVV+GLK Sbjct: 229 N--------SSGNAQGTFSGNSEAVANEK---QNLAITPKTFVAEEKIDGQMVNVVDGLK 277 Query: 1030 LYENFLDSSEVTKFVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPTNEGH 1209 LYEN LD EV+K VSL NE+RA G RG+ GQT + KRP KGHGREMIQLG+P + Sbjct: 278 LYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGREMIQLGLPIADAP 337 Query: 1210 IEDEHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPS 1389 EDE+ T K +E+IP+ L + + QV+ +KPD C+ID +NEGDHSQPH WP Sbjct: 338 AEDENATGTSKGT-VESIPALLQDVIEHFVAMQVMTMKPDSCIIDIYNEGDHSQPHMWPP 396 Query: 1390 WFGRPVCNLFLTDCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPS 1569 WFG+PV LFLT+C++ FG+ + + H GDY GSLKL V GSLLVMQGKS+DLAK AIP Sbjct: 397 WFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQGKSSDLAKHAIPM 456 Query: 1570 LRKTLILLTFGKYRPKKTLLPSEGTFFSSSAVNPLSI--STSARPSSFSRYPSARKPYGI 1743 ++K +L+TF K +PKK L ++G S AV P S +R + R+P K Y Sbjct: 457 IKKQRMLVTFTKSQPKK-LTSNDGPRLPSHAVAPSSHWGPPPSRSPNHLRHP-VPKHYAA 514 Query: 1744 MPANGIQHAQQAPQIMLSPNGVQPRFAAAPMVASPALPSG---PPATVGWAAASSM--NA 1908 +P G+ + PNGVQP F P+ A P+ PP + GW +S +A Sbjct: 515 IPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAPVPIPPVSTGWPTSSPRHPSA 574 Query: 1909 XXXXXXXXXXXXFLPPGSVHIYPIQHLPGTPISVQTIYDNSRSEKPTSNSNASDCSTNIT 2088 PPGS + L T + + + ++ + D S + Sbjct: 575 RLPVPIPGTGVFLPPPGSGNASSALQLSATATEMNFPTETEKEKENGPGKSNHDTSASPK 634 Query: 2089 TDLSEPVPECNGDGWCLDTAAVAPNQEQTDSV 2184 +E + +G D +A +E+ SV Sbjct: 635 EKSAEKTQRQDSNG---DVDGIAVKKEEQQSV 663 >gb|EOY01300.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] gi|508709405|gb|EOY01302.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] Length = 680 Score = 414 bits (1064), Expect = e-113 Identities = 278/699 (39%), Positives = 364/699 (52%), Gaps = 38/699 (5%) Frame = +1 Query: 208 MAALSGNSVVVSVEPMQFPLN-----------------GGRGSEVPQ----QWFVDERDG 324 MA SGN VV + MQFP GG G E+ Q QW DERDG Sbjct: 1 MAMPSGN--VVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDG 58 Query: 325 LISWLRGEFAAANAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTD 504 I WLRGEFAA+NAIID L H+R GE+GEY+ V+ CI QRR W LHMQQ+F V + Sbjct: 59 FIYWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAE 118 Query: 505 VGYALQQVMDLRQLKQRQRHSYGQKDGRKSGFGHRYGHRSDGVRESRVSPASG------T 666 V YALQQV R+ + + G K+ ++SG G + G R + +E + S T Sbjct: 119 VSYALQQVAWRRRQRHYESGKVGGKEFKRSGMGFK-GQRMEVAKEGQNSGVDSDGNSTVT 177 Query: 667 AVSESGTVEQMVDKLDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPA 846 AVSE E+ +K + K+ + ++ + F KD S K + A Sbjct: 178 AVSERN--ERGSEKREEVKSCGEVG-KVEDKCSTFTEDKKDTGS---------KPHAGDA 225 Query: 847 ETCVIELEAASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGL 1026 E+ ++ GC S S ++ NQ+ Q + P FV E DG MVNVV+GL Sbjct: 226 ESVTEDVNG--GCTS--SYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDGL 281 Query: 1027 KLYENFLDSSEVTKFVSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPTNEG 1206 KLYE D EV VSL N++RAAG RG+L GQT V KRP KGHGREMIQLG+P + Sbjct: 282 KLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQGQTYVAAKRPMKGHGREMIQLGLPIADA 341 Query: 1207 HIEDEHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWP 1386 ++DE+ K+R+IE IP L + L QV+ VKPD C+ID +NEGDHSQP WP Sbjct: 342 PLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDHSQPRMWP 401 Query: 1387 SWFGRPVCNLFLTDCDVIFGRAV-GSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAI 1563 WFG+PVC +FLT+CD+ FGR V ++H GDY GSLKL + GSLLVMQGKSAD AK A+ Sbjct: 402 PWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSADFAKHAL 461 Query: 1564 PSLRKTLILLTFGKY-RPKKTLLPSEGTFFSSSAVNPLSISTSARPSSFSRYPSARKPYG 1740 PS+RK IL+TF KY +PKK+ ++ S + + +R + R+ + K Y Sbjct: 462 PSVRKQRILVTFTKYCQPKKSTTDNQRLSSPSVSQSSQWGPPPSRSPNRIRHSAGPKHYA 521 Query: 1741 IMPANGIQHAQQ-APQIMLSPNGVQPRF---AAAPMVASPALPSGPPATVGWAAASSMNA 1908 ++P G+ A PQI S +GVQP F A AP ++ PA PP + GW AA Sbjct: 522 VIPTTGVLPAPPIRPQIPPS-SGVQPLFVPTAVAPAISFPAPVPIPPGSTGWPAAP--RH 578 Query: 1909 XXXXXXXXXXXXFLPPGSVHIYPIQHLPGTPISVQTIYDNSRSEKPTSNS---NASDCST 2079 FLPP Q L T + + + + + + S N S Sbjct: 579 PPPRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTSPREKENGSVKPNHHTTSP 638 Query: 2080 NITTDLSEPVPECNG--DGWCLDTAAVAPNQEQTDSVEK 2190 D P +CNG DG A + Q D+ K Sbjct: 639 RGRLDGKSPKQDCNGSVDGAGSGRALMKEEQHCADNSVK 677 >ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261013 [Solanum lycopersicum] Length = 641 Score = 411 bits (1056), Expect = e-112 Identities = 272/681 (39%), Positives = 371/681 (54%), Gaps = 30/681 (4%) Frame = +1 Query: 220 SGNSVV-VSVE-PMQFPLNGGRGSEVP--------QQWF----VDERDGLISWLRGEFAA 357 SGN+ V V+V P + NGG G V QQWF VDERDG ISWLRGEFAA Sbjct: 3 SGNAAVAVAVAVPEKKHSNGGGGEAVAVPRQHQHQQQWFHPQQVDERDGFISWLRGEFAA 62 Query: 358 ANAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDL 537 +NAIID L H+R+ GE GEYD V+GC+ QRR W + LHMQQ+ V +V Y+L QV + Sbjct: 63 SNAIIDALCHHLRLVGEPGEYDGVIGCVQQRRANWNSVLHMQQYHSVAEVIYSLHQVEWM 122 Query: 538 RQLKQRQR--HSYGQKDGRKSGFGHRYGHRSDGVRESRVSPASGTAVSESGTVEQMVDKL 711 +Q K + G+++G K G G G +S+G+++ + S ++ ++ + V+K+ Sbjct: 123 KQQKGFDGGVNKVGKRNGSKGGGGG--GWKSEGLKDGKESQGQNFSL-DAHSKTNGVEKI 179 Query: 712 DHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVIELEAASGCGS 891 D + + ++ A K + S+ S C + G + E V + + S Sbjct: 180 DVVEEKQGDKKEL---------AAKPEANSSVKGSVCTEAGDSQGE--VDKTDDKRDSNS 228 Query: 892 QASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFLDSSEVTKF 1071 + S++ ++ S Q T+K P FVA E DG VNVV+G+KLYE L SSEV+K Sbjct: 229 EGSSNVESE-SHSFQIPTEKQNVVPKTFVATEIYDGKPVNVVDGMKLYEELLSSSEVSKL 287 Query: 1072 VSLANEMRAAGHRGELSGQTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHRTLNYKERK 1251 V+L N++RAAG RG+L Q + KRP KGHGREM+QLG+P + E+E YK+RK Sbjct: 288 VTLVNDLRAAGRRGQLPAQAFIVSKRPMKGHGREMVQLGLPIVDAPPEEESAISTYKDRK 347 Query: 1252 IEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPVCNLFLTDC 1431 EAIP L + D L Q L VKPD C+ID FNEGDHSQPH WP W+GRP+ LFLTDC Sbjct: 348 TEAIPGLLQDVIDQLSAMQALSVKPDACVIDIFNEGDHSQPHLWPYWYGRPISTLFLTDC 407 Query: 1432 DVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLILLTFGKYR 1611 ++ FG+ +G +H GDY GSLKL + GS+LVMQG+S + AK AIPS+RK +L+TF K + Sbjct: 408 EMTFGKVIGVDHPGDYRGSLKLSLAPGSVLVMQGRSTEFAKYAIPSIRKQRMLVTFTKLQ 467 Query: 1612 PKKTLLPSEGTFFSSSAVNPLS-ISTSARPSSFSRYPSARKPYGIMPANGIQHAQQAPQI 1788 ++ + + F SSA P+S +R S+ R P K YG MPA G+ Sbjct: 468 LRR-IKSGDSQRFPSSAGGPVSQWVPPSRSSNHIRRPFGPKHYGSMPATGV--------- 517 Query: 1789 MLSPNGVQPRFAAA-------PMVASPALP-----SGPPATVGWAAASSMNAXXXXXXXX 1932 L GV+P+FA A P +PA+P + PPA+ GW A + Sbjct: 518 -LPIPGVRPQFAPANMQPIFVPATVAPAMPFPAPVALPPASAGW-AVPPIRHPPPRLPLP 575 Query: 1933 XXXXFLPPGSVHIYPIQHLPGTPISVQTIYDNSRSEKPTSNSNASDCSTNITTDLSE-PV 2109 FLPPGS GT + DN +E T + S S + +D SE Sbjct: 576 GTGVFLPPGS----------GTSST-----DNIPAEN-TGPLSDSTVSQKVNSDSSEVQT 619 Query: 2110 PECNGDGWCLDTAAVAPNQEQ 2172 +CNG D +EQ Sbjct: 620 QDCNGKADVSDAEKAVACEEQ 640 >gb|EOY01299.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] gi|508709404|gb|EOY01301.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 681 Score = 410 bits (1053), Expect = e-111 Identities = 278/700 (39%), Positives = 365/700 (52%), Gaps = 39/700 (5%) Frame = +1 Query: 208 MAALSGNSVVVSVEPMQFPLN-----------------GGRGSEVPQ----QWFVDERDG 324 MA SGN VV + MQFP GG G E+ Q QW DERDG Sbjct: 1 MAMPSGN--VVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDG 58 Query: 325 LISWLRGEFAAANAIIDVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTD 504 I WLRGEFAA+NAIID L H+R GE+GEY+ V+ CI QRR W LHMQQ+F V + Sbjct: 59 FIYWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAE 118 Query: 505 VGYALQQVMDLRQLKQRQRHSYGQKDGRKSGFGHRYGHRSDGVRESRVSPASG------T 666 V YALQQV R+ + + G K+ ++SG G + G R + +E + S T Sbjct: 119 VSYALQQVAWRRRQRHYESGKVGGKEFKRSGMGFK-GQRMEVAKEGQNSGVDSDGNSTVT 177 Query: 667 AVSESGTVEQMVDKLDHSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPA 846 AVSE E+ +K + K+ + ++ + F KD S K + A Sbjct: 178 AVSERN--ERGSEKREEVKSCGEVG-KVEDKCSTFTEDKKDTGS---------KPHAGDA 225 Query: 847 ETCVIELEAASGCGSQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGL 1026 E+ ++ GC S S ++ NQ+ Q + P FV E DG MVNVV+GL Sbjct: 226 ESVTEDVNG--GCTS--SYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDGL 281 Query: 1027 KLYENFLDSSEVTKFVSLANEMRAAGHRGEL-SGQTLVTLKRPTKGHGREMIQLGIPTNE 1203 KLYE D EV VSL N++RAAG RG+L +GQT V KRP KGHGREMIQLG+P + Sbjct: 282 KLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKRPMKGHGREMIQLGLPIAD 341 Query: 1204 GHIEDEHRTLNYKERKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTW 1383 ++DE+ K+R+IE IP L + L QV+ VKPD C+ID +NEGDHSQP W Sbjct: 342 APLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDHSQPRMW 401 Query: 1384 PSWFGRPVCNLFLTDCDVIFGRAV-GSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRA 1560 P WFG+PVC +FLT+CD+ FGR V ++H GDY GSLKL + GSLLVMQGKSAD AK A Sbjct: 402 PPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSADFAKHA 461 Query: 1561 IPSLRKTLILLTFGKY-RPKKTLLPSEGTFFSSSAVNPLSISTSARPSSFSRYPSARKPY 1737 +PS+RK IL+TF KY +PKK+ ++ S + + +R + R+ + K Y Sbjct: 462 LPSVRKQRILVTFTKYCQPKKSTTDNQRLSSPSVSQSSQWGPPPSRSPNRIRHSAGPKHY 521 Query: 1738 GIMPANGIQHAQQ-APQIMLSPNGVQPRF---AAAPMVASPALPSGPPATVGWAAASSMN 1905 ++P G+ A PQI S +GVQP F A AP ++ PA PP + GW AA Sbjct: 522 AVIPTTGVLPAPPIRPQIPPS-SGVQPLFVPTAVAPAISFPAPVPIPPGSTGWPAAP--R 578 Query: 1906 AXXXXXXXXXXXXFLPPGSVHIYPIQHLPGTPISVQTIYDNSRSEKPTSNS---NASDCS 2076 FLPP Q L T + + + + + + S N S Sbjct: 579 HPPPRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTSPREKENGSVKPNHHTTS 638 Query: 2077 TNITTDLSEPVPECNG--DGWCLDTAAVAPNQEQTDSVEK 2190 D P +CNG DG A + Q D+ K Sbjct: 639 PRGRLDGKSPKQDCNGSVDGAGSGRALMKEEQHCADNSVK 678 >ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781773 [Glycine max] Length = 664 Score = 409 bits (1052), Expect = e-111 Identities = 273/683 (39%), Positives = 363/683 (53%), Gaps = 25/683 (3%) Frame = +1 Query: 208 MAALSGNSVVVSVEPMQFPLNGGR---GSEVP--QQWFVDERDGLISWLRGEFAAANAII 372 MA SGN+V+ E +QFP GG GSE+ QQWFVDERDG I WLR EFAAANAII Sbjct: 1 MAMPSGNAVMP--EKLQFPGGGGAPGGGSEIHFRQQWFVDERDGFIGWLRSEFAAANAII 58 Query: 373 DVLMDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQLKQ 552 D L H+R GE GEY+ VVG I QRR WT L MQQ+F V++V YALQQV RQ + Sbjct: 59 DSLCHHLRDVGEPGEYNMVVGAIQQRRCNWTQVLLMQQYFSVSEVVYALQQVSWRRQQRV 118 Query: 553 RQRHSYGQKDGRKSGFG-----HRYGHRSDGVRESRVSPASGT-AVSESGTVEQMVDKLD 714 G K+ RK G G HR+ DG S S GT AV +G VE+ + Sbjct: 119 VDPAKTGAKEFRKFGLGFKQGQHRFEAVKDGYNSSVESFGHGTNAVVVAGGVEKGACVTE 178 Query: 715 HSKNVNQNNV--QMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVIELEAASGCG 888 + + + M P KD ++ ++ ++ E EA Sbjct: 179 KNGEIKSGGMVGTMDNKNLGSPEERKDAITNHQSDGILKGSRNSQGSLSSSECEAVGVNE 238 Query: 889 SQASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFLDSSEVTK 1068 SNS +N + F+ E DG MVNVV+GLKLYE+ LDS+EV+K Sbjct: 239 ECVSNSKENDSIMGKF------------FIGNEMFDGKMVNVVDGLKLYEDLLDSTEVSK 286 Query: 1069 FVSLANEMRAAGHRGELSG-QTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHRTLNYKE 1245 VSL N++R AG RG+ G QT V KRP KGHGREMIQLG+P + + ++ T K+ Sbjct: 287 LVSLVNDLRVAGKRGQFQGNQTFVVSKRPMKGHGREMIQLGVPIADAPPDVDNVTGISKD 346 Query: 1246 RKIEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPVCNLFLT 1425 +K+E+IPS I + L QV+ VKPD C++DFFNEG+HS P+ WP WFGRPV LFLT Sbjct: 347 KKVESIPSLFQDIIERLAASQVMTVKPDACIVDFFNEGEHSHPNNWPPWFGRPVYTLFLT 406 Query: 1426 DCDVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLILLTFGK 1605 +CD+ FGR + S+H G++ G+++L ++ GSLLVMQGKS D AK A+PS+ K I++TF K Sbjct: 407 ECDMTFGRIIVSDHPGEFRGAVRLSLVPGSLLVMQGKSTDFAKHALPSIHKQRIIITFTK 466 Query: 1606 YRPKKTLLPSEGTFFSSSAVNPLSISTSARPSSFSRYPSARKPYGIMPANGIQHAQQAPQ 1785 +PK + LP++ + A + + S P+ R+ K Y +PA + AP Sbjct: 467 SQPKCS-LPNDSQRLAPPAASHWAPPQSRSPNHV-RHQLGPKHYPTVPATVV---LPAPS 521 Query: 1786 IMLSPNGVQPRFAAAPMVASPALPSG---PPATVGWAAASSMNAXXXXXXXXXXXXFLPP 1956 I PN +QP F AP+ + P+ PP + GW +A S + PP Sbjct: 522 IHAPPNSMQPLFVPAPVAPPMSFPTPVPIPPGSTGWTSAPSRHPPPRIPVPGTGVFLPPP 581 Query: 1957 GSVHIYPIQHLPGT----PISVQTI----YDNSRSEKPTSNSNASDCSTNITTDLSEPVP 2112 GS QHLP T SV+T+ +N +S T++S NI Sbjct: 582 GSG--TSSQHLPCTVPEVNPSVETLTVSGKENGKSNHNTNSSPKGKMDGNIQGGQES--- 636 Query: 2113 ECNGDGWCLDTAAVAPNQEQTDS 2181 N DG + A V QE D+ Sbjct: 637 NGNADGTQAEQAVVEKEQESNDT 659 >ref|XP_006605475.1| PREDICTED: uncharacterized protein LOC100814525 [Glycine max] Length = 626 Score = 399 bits (1026), Expect = e-108 Identities = 264/640 (41%), Positives = 356/640 (55%), Gaps = 19/640 (2%) Frame = +1 Query: 208 MAALSGNSVVVSVEPMQFPLNGGRGSEVP--QQWFVDERDGLISWLRGEFAAANAIIDVL 381 MA SGN+V+ E +QFP GG GSE+ QQWFVDERDG I WLR EFAAANAIID L Sbjct: 1 MAMPSGNAVMP--EKLQFP-GGGGGSEIHYRQQWFVDERDGFIGWLRSEFAAANAIIDSL 57 Query: 382 MDHIRVTGELGEYDHVVGCIHQRRFYWTNFLHMQQFFPVTDVGYALQQVMDLRQLKQRQR 561 H+R GE GEYD VVG I QRR WT L MQQ+F V++V ALQQV RQ + Sbjct: 58 CHHLRCVGEPGEYDMVVGAIQQRRCNWTQVLLMQQYFSVSEVVCALQQVSWRRQQRVVDL 117 Query: 562 HSYGQKDGRKSGFGHRYG-HR----SDGVRESRVSPASGT-AVSESGTVEQ---MVDKLD 714 G K+ RK G G R G HR DG S S GT AV +G VE+ + +K Sbjct: 118 AKTGAKEFRKFGSGIRQGQHRLEAAKDGYNSSVESFCHGTNAVVVAGGVEKGTPLTEKNG 177 Query: 715 HSKNVNQNNVQMSRAMNCFPVAGKDGNSHSLAESCCMKDGSNPAETCVIELEAASGCGSQ 894 K+ + ++++ P KD ++ ++ G++ E EA Sbjct: 178 EIKSGGKVGTMDNKSL-ASPEERKDTITNHQSDGILKGSGNSQGSLSTSECEAVGVNEEC 236 Query: 895 ASNSGDNTVMTSNQDRTQKVIPAPNEFVAKETCDGMMVNVVEGLKLYENFLDSSEVTKFV 1074 SNS +N S +T F+ E DG MVNVV+GLKLYE+ LD +EV+K V Sbjct: 237 VSNSKEND---STMGKT---------FIGNEMFDGKMVNVVDGLKLYEDLLDRTEVSKLV 284 Query: 1075 SLANEMRAAGHRGELSG-QTLVTLKRPTKGHGREMIQLGIPTNEGHIEDEHRTLNYKERK 1251 SL N++R AG RG+ G QT V KRP KGHGREMIQLG+P + + ++ T K++K Sbjct: 285 SLVNDLRVAGKRGQFQGNQTFVVSKRPMKGHGREMIQLGVPIADAPPDVDNVTGISKDKK 344 Query: 1252 IEAIPSPLHSIFDCLFEQQVLPVKPDFCMIDFFNEGDHSQPHTWPSWFGRPVCNLFLTDC 1431 +E+IPS I L QV+ VKPD C++DFFNEG+HS P+ WP WFGRP+ LFLT+C Sbjct: 345 VESIPSLFQDIIKRLVASQVMTVKPDACIVDFFNEGEHSHPNNWPPWFGRPLYILFLTEC 404 Query: 1432 DVIFGRAVGSNHRGDYDGSLKLPVIAGSLLVMQGKSADLAKRAIPSLRKTLILLTFGKYR 1611 D+ FGR + S+H G++ G++ L ++ GSLLVMQGKS D AK A+PS+ K I++TF K + Sbjct: 405 DMTFGRIIVSDHPGEFRGAVTLSLVPGSLLVMQGKSTDFAKHALPSIHKQRIIVTFTKSQ 464 Query: 1612 PKKTLLPSEGTFFSSSAVNPLSISTSARPSSFSRYPSARKPYGIMPANGIQH--AQQAPQ 1785 P+ + LP++ S + P + A P S R P + G +H QA Sbjct: 465 PRSS-LPND-----SERLAPPAAPHWAPPPS-------RSPNHVRHQLGPKHYPTVQATG 511 Query: 1786 IMLSPNGVQPRFAAAPM-VASP-ALPSG---PPATVGWAAASSMNAXXXXXXXXXXXXFL 1950 ++ +PNG+QP F P+ VASP + P+ PP ++GW +A + Sbjct: 512 VLPAPNGMQPLFVPVPVPVASPMSFPTPVPIPPGSIGWTSAPPRHPPPRIPVPGTGVFLP 571 Query: 1951 PPGSVHIYPIQHLPGTPISVQTIYDNSRSEKPTSNSNASD 2070 PPGS I+ + P + +N +S +NS A + Sbjct: 572 PPGSGTIHEVN--PSVETWTVSGKENGKSNHSKTNSEAEE 609 >ref|NP_001145739.1| uncharacterized protein LOC100279246 [Zea mays] gi|219884237|gb|ACL52493.1| unknown [Zea mays] gi|414865753|tpg|DAA44310.1| TPA: hypothetical protein ZEAMMB73_610940 [Zea mays] Length = 653 Score = 397 bits (1020), Expect = e-107 Identities = 253/671 (37%), Positives = 356/671 (53%), Gaps = 32/671 (4%) Frame = +1 Query: 274 GRGSEVPQQWFVDERDGLISWLRGEFAAANAIIDVLMDHIRVTGELGEYDHVVGCIHQRR 453 G + P W VDERDG I+WLRGEFAAANAI+D L+ H+R E GEYDHV + QRR Sbjct: 14 GAAAAEPAAWLVDERDGFITWLRGEFAAANAIVDHLIFHLRSISEPGEYDHVFSLVQQRR 73 Query: 454 FYWTNFLHMQQFFPVTDVGYALQQVMDLRQLKQRQRHSYGQKDG-------RKSGFG--- 603 +W + +HMQQFFPV+D+ +ALQQ R Q G R++ F Sbjct: 74 HHWPHVIHMQQFFPVSDIAFALQQASWRRHAPPAQALGAGASPAALPPPPPRRASFSQSH 133 Query: 604 HRYGHRSDGVRESRVSPASGTAVSESGTVEQMVDKLDHSKNVNQNNVQMSRAMNCFPVAG 783 H + H G R PA G A + +G + Sbjct: 134 HSHQHHRHGA-HYRPDPARGAATAATG-------------------------------SE 161 Query: 784 KDGNS-HSLAESCCMKDGSNPAETCVIELEAASGCGSQASNSG-DNTVMTSNQDRTQKVI 957 KDG H+ E +K+ N +T + L+ S ++ G + + + + + + K++ Sbjct: 162 KDGREVHNNKEGRGLKEAGNVVDTKSLRLD------SPITDEGEEKSKLQAVSEESSKMV 215 Query: 958 PAPNEFVAKETCDGMMVNVVEGLKLYENFLDSSEVTKFVSLANEMRAAGHRGELSG-QTL 1134 P E+ E DG MVN VEGLK+YE L+ +E K +SL NE RA+ RG L Q + Sbjct: 216 ATPVEYSTNEIIDGSMVNTVEGLKVYEGLLNVTEANKILSLVNETRASYRRGGLEARQKV 275 Query: 1135 VTLKRPTKGHGREMIQLGIPTNEGHIEDEHRTLNYKERKIEAIPSPLHSIFDCLFEQQVL 1314 + KRP KGHGRE++QLG+P +G +DE N +E ++EAIP L+ +FD L +Q+++ Sbjct: 276 IIAKRPMKGHGREIVQLGVPIIDGPPDDE----NLRETRVEAIPGLLNDLFDRLSQQEII 331 Query: 1315 PVKPDFCMIDFFNEGDHSQPHTWPSWFGRPVCNLFLTDCDVIFGRAVGSNHRGDYDGSLK 1494 P KPD+C+ID FNEGD+S PH PSW+GRP+C L LTDCD++FGR + S +GD+ G LK Sbjct: 332 PFKPDYCVIDIFNEGDYSHPHQSPSWYGRPLCTLCLTDCDMVFGRYI-SGEKGDHRGPLK 390 Query: 1495 LPVIAGSLLVMQGKSADLAKRAIPSLRKTLILLTFGKYRPKKTLLPSEGTFFSSSAVNPL 1674 L + GSLL+MQG+S D AKRAIP+ RK ++L FGK +K +P+E +S+ P+ Sbjct: 391 LSLATGSLLLMQGRSIDCAKRAIPATRKQRVILNFGKSVARKH-IPAESA-WSTPLTPPM 448 Query: 1675 SISTSARPSSFSRYPSARKPYGIMPANGIQHAQQAPQIMLSP-NGVQPRFAAAPMVASPA 1851 S+RP + SR+P + K YG P +G+ A + P +G+QP F A +++ A Sbjct: 449 PWGQSSRPVNGSRHPQSPKHYGYAPISGVLPAPPVGAHHVPPSDGMQPLFVAPAPISAAA 508 Query: 1852 LPSGPP-----ATVGWAAASSMNAXXXXXXXXXXXXFLPPGSVHIYPIQHLP-----GTP 2001 +P P + W + FLPPGS H P Q +P G P Sbjct: 509 IPFTPTVPLQNTSAAWIQEVTPRPAPPRFPGPGTGVFLPPGSGHPLPHQMMPASHGHGEP 568 Query: 2002 ISVQ--TIYDNSR-SEKPTSNSNAS-DCSTNITTDLSEPVPECNGD----GWCLDTAAVA 2157 S Q + Y +S+ + K TSN N S S T+ +E PECNG G D + Sbjct: 569 NSPQGSSAYLHSKVNGKETSNGNLSPKNSPRKTSCTAEEKPECNGSLNGGGGSADEKSTV 628 Query: 2158 PNQEQTDSVEK 2190 EQ + V+K Sbjct: 629 -GMEQQNGVQK 638