BLASTX nr result
ID: Rheum21_contig00005963
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rheum21_contig00005963 (2349 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002270269.1| PREDICTED: uncharacterized protein LOC100254... 874 0.0 gb|EOY18900.1| UDP-Glycosyltransferase superfamily protein isofo... 869 0.0 emb|CAN65363.1| hypothetical protein VITISV_036074 [Vitis vinifera] 865 0.0 gb|EMJ21765.1| hypothetical protein PRUPE_ppa001222mg [Prunus pe... 860 0.0 ref|XP_004496154.1| PREDICTED: uncharacterized protein LOC101505... 852 0.0 gb|EXB58479.1| hypothetical protein L484_005213 [Morus notabilis] 846 0.0 gb|EOY18902.1| UDP-Glycosyltransferase superfamily protein isofo... 846 0.0 ref|XP_006378794.1| hypothetical protein POPTR_0010s23830g [Popu... 840 0.0 ref|XP_006436561.1| hypothetical protein CICLE_v10030581mg [Citr... 835 0.0 ref|XP_006436560.1| hypothetical protein CICLE_v10030581mg [Citr... 835 0.0 ref|XP_006436559.1| hypothetical protein CICLE_v10030581mg [Citr... 835 0.0 ref|XP_003535489.1| PREDICTED: uncharacterized protein LOC100779... 828 0.0 ref|XP_006606299.1| PREDICTED: uncharacterized protein LOC100790... 827 0.0 ref|XP_006606298.1| PREDICTED: uncharacterized protein LOC100790... 827 0.0 ref|XP_003555467.1| PREDICTED: uncharacterized protein LOC100790... 827 0.0 ref|NP_001190226.1| UDP-glycosyltransferase family protein [Arab... 824 0.0 ref|NP_568137.1| UDP-glycosyltransferase family protein [Arabido... 824 0.0 ref|XP_006379502.1| hypothetical protein POPTR_0008s02940g [Popu... 823 0.0 ref|XP_006589360.1| PREDICTED: uncharacterized protein LOC100779... 823 0.0 ref|XP_006606297.1| PREDICTED: uncharacterized protein LOC100790... 822 0.0 >ref|XP_002270269.1| PREDICTED: uncharacterized protein LOC100254795 [Vitis vinifera] Length = 1028 Score = 874 bits (2258), Expect = 0.0 Identities = 420/678 (61%), Positives = 519/678 (76%), Gaps = 15/678 (2%) Frame = +1 Query: 1 KDVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYG 180 K+ G FRF+FLCGNSTDGYND L+E+A HL+L PGS+R YGMN+DVNG++LM+D+V+Y Sbjct: 364 KNAGAMFRFVFLCGNSTDGYNDHLKEVASHLKLLPGSVRQYGMNSDVNGLILMADVVIYA 423 Query: 181 TSQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVS 360 +SQ EQ FP LLTRA+SFGIP+IAPDLP I+ YV DGVH +IF ++NPD L+RAFSLL+S Sbjct: 424 SSQVEQGFPPLLTRAMSFGIPVIAPDLPDIRKYVVDGVHVVIFPKNNPDALMRAFSLLIS 483 Query: 361 RDGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTW 540 +G LS+ AKAV SG+LLAKN+ AS+C+ +A+L+ENVL+F SD LLP I+QS+ W Sbjct: 484 -NGKLSKFAKAVALSGRLLAKNMLASECVNSYAKLLENVLSFPSDVLLPGHISQSQHDAW 542 Query: 541 EWNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEK---DHLDLGITDPMNMSEYNGEL 711 EWN ++ DM + + + +S ++ LE+ + LD G N+S N E Sbjct: 543 EWNSF------RTADMPLIENGSASMRKSSVVDVLEETLSNQLDSG-----NIS--NSET 589 Query: 712 EQDLLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANE 891 E D+L DWD + EIES E++ERLEM++++ERMEK PG WDEIYRNARK E+VKFE NE Sbjct: 590 ENDVLTQLDWDVLREIESIEEMERLEMEELEERMEKNPGIWDEIYRNARKVERVKFETNE 649 Query: 892 RDEGDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFL 1071 RDEG+LERTGQP+CIYEIY GAGAWPFLHHGS+YRGLSL T RRL SDDVDA RL L Sbjct: 650 RDEGELERTGQPLCIYEIYNGAGAWPFLHHGSMYRGLSLTTSARRLRSDDVDAVDRLPVL 709 Query: 1072 NESHYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQ 1251 N+++YR++ C++GGMFSIA++VD IH RPWIGFQ AE+ LEETIQ++ Sbjct: 710 NDTYYRDIFCDIGGMFSIAFRVDKIHKRPWIGFQSWHAVGSKVSLSSRAEKVLEETIQEE 769 Query: 1252 TKGDVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDAL 1431 TKGDV+YFWA L +D G T+ ++ FWSMCDILNGG CR+ FE+AFR+MYA+P +AL Sbjct: 770 TKGDVLYFWAHLNVDDGPTQKNRIPTFWSMCDILNGGNCRTAFEDAFRQMYAMPSYIEAL 829 Query: 1432 PPMPDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAH----------- 1578 PPMP+DGGYWSALH WVMPTPSFLEFIMFSRMF DS+DALH N ++ + Sbjct: 830 PPMPEDGGYWSALHSWVMPTPSFLEFIMFSRMFADSLDALHMNSRQSMNLSQSMNSSQPT 889 Query: 1579 -CLLGSSELERKHCYCRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAK 1755 CLLGSS+LE+KHCYCR+LELLVNVWAYHSAR+MVYI+P +G LEE HPV+QR+GFMWAK Sbjct: 890 VCLLGSSKLEKKHCYCRVLELLVNVWAYHSARKMVYINPYSGQLEEQHPVEQRRGFMWAK 949 Query: 1756 YFNITLLKSMXXXXXXXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKV 1935 YFN TLLKSM RE WLWPLTGEVHW G+YE++REE+YR KMDKKRK Sbjct: 950 YFNSTLLKSMDEDLAEAADDGDHPRERWLWPLTGEVHWQGIYEREREERYRSKMDKKRKA 1009 Query: 1936 KEKLIDRFQHGYKQKTLG 1989 KEKL++R +HGYKQK +G Sbjct: 1010 KEKLVERMKHGYKQKPIG 1027 >gb|EOY18900.1| UDP-Glycosyltransferase superfamily protein isoform 1 [Theobroma cacao] Length = 1041 Score = 869 bits (2245), Expect = 0.0 Identities = 418/663 (63%), Positives = 512/663 (77%), Gaps = 1/663 (0%) Frame = +1 Query: 4 DVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGT 183 D GGSF+FIFL GNSTDGY+DALQ++A L L+ GS+RHYG++ DVNGVLLM+DIVLYGT Sbjct: 392 DAGGSFKFIFLSGNSTDGYHDALQQVASRLGLTQGSVRHYGLDGDVNGVLLMADIVLYGT 451 Query: 184 SQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSR 363 SQ+EQ FPSL+ RA++FGIP+I PD P++K YV DG HG+ F +H PD LLRAFSLL+S Sbjct: 452 SQEEQGFPSLIIRAMTFGIPVITPDFPIMKKYVVDGTHGVFFPKHQPDALLRAFSLLIS- 510 Query: 364 DGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWE 543 +G LS A+ V SSG+LLAKN+ AS+CI G+A L+EN+LNF SD LLP+ ++Q G+WE Sbjct: 511 NGRLSRFAQTVASSGRLLAKNILASECITGYASLLENLLNFPSDVLLPAPVSQLRLGSWE 570 Query: 544 WNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QD 720 WN G +E + D+ + ++YALE++ I+ ++S+Y E++ QD Sbjct: 571 WNVFGMEIEHGTGDISRYFS---------VVYALEEEFTKHTISS--DISQYGAEIQDQD 619 Query: 721 LLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDE 900 + +DWD + EIE+FED ERLEMD+++ERME+ PG WD+IYRNAR+SEK+KFEANERDE Sbjct: 620 IPTEQDWDIVTEIENFEDYERLEMDEVEERMERNPGVWDDIYRNARRSEKLKFEANERDE 679 Query: 901 GDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNES 1080 G+LERTGQPVCIYEIY GAGAWPFLHHGSLYRGLSL + RRL SDDVDA RL LN++ Sbjct: 680 GELERTGQPVCIYEIYSGAGAWPFLHHGSLYRGLSLSRKARRLRSDDVDAVGRLPVLNDT 739 Query: 1081 HYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKG 1260 HYR+LLCE+GGMFSIA +VD+IH RPWIGFQ AE LEETIQ +K Sbjct: 740 HYRDLLCEVGGMFSIANRVDNIHKRPWIGFQSWRAAGRKVSLSTRAEEVLEETIQG-SKR 798 Query: 1261 DVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440 DV+YFWARL++D G + L FWSMCD+LN G CR+ FE+AFR+MY LP +ALPPM Sbjct: 799 DVMYFWARLDIDGGGAGTNDALTFWSMCDLLNAGHCRTAFESAFRKMYILPSDTEALPPM 858 Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620 P D G+WSALH WVMPT SFLEF+MFSRMFVDS+DALH+N E CLLGSSELE+KHCY Sbjct: 859 PKDDGHWSALHSWVMPTTSFLEFVMFSRMFVDSLDALHTNSGEVNLCLLGSSELEKKHCY 918 Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800 C++LELLVNVWAYHS RRMVYI+P +GLLEE HPV QRK FMWA+YFN TLLKSM Sbjct: 919 CQVLELLVNVWAYHSGRRMVYIEPHSGLLEEQHPVDQRKEFMWARYFNFTLLKSMDEDLA 978 Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980 R+ WLWPLTGEVHW G+YE++REE+YRLKMDKKRK KEKL +R ++GYKQ+ Sbjct: 979 EAADDEDHPRKMWLWPLTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKNGYKQR 1038 Query: 1981 TLG 1989 +LG Sbjct: 1039 SLG 1041 >emb|CAN65363.1| hypothetical protein VITISV_036074 [Vitis vinifera] Length = 1037 Score = 865 bits (2235), Expect = 0.0 Identities = 420/687 (61%), Positives = 519/687 (75%), Gaps = 24/687 (3%) Frame = +1 Query: 1 KDVGGSFRFIFLCGNSTDGYNDALQ---------ELAVHLRLSPGSIRHYGMNADVNGVL 153 K+ G RF+FLCGNSTDGYND L+ E+A HL+L PGS+R YGMN+DVNG++ Sbjct: 364 KNAGAMXRFVFLCGNSTDGYNDHLKVYGYNDHLKEVASHLKLLPGSVRQYGMNSDVNGLM 423 Query: 154 LMSDIVLYGTSQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDL 333 LM+D+V+Y +SQ EQ FP LLTRA+SFGIP+IAPDLP I+ YV DGVH +IF ++NPD L Sbjct: 424 LMADVVIYASSQVEQGFPPLLTRAMSFGIPVIAPDLPDIRKYVVDGVHVVIFPKNNPDAL 483 Query: 334 LRAFSLLVSRDGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSS 513 +RAFSLL+S +G LS+ AKAV SG+LLAKN+ AS+C+ +A+L+ENVL+F SD LLP Sbjct: 484 MRAFSLLIS-NGKLSKFAKAVALSGRLLAKNMLASECVNSYAKLLENVLSFPSDVLLPGH 542 Query: 514 ITQSEQGTWEWNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEK---DHLDLGITDPM 684 I+QS+ WEWN ++ DM + + + +S ++ LE+ + LD G Sbjct: 543 ISQSQHDAWEWNSF------RTADMPLIENGSASMRKSSVVDVLEETLSNQLDSG----- 591 Query: 685 NMSEYNGELEQDLLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKS 864 N+S N E E D+L DWD + EIES E++ERLEM++++ERMEK PG WDEIYRNARK Sbjct: 592 NIS--NSETENDVLTQLDWDVLREIESIEEMERLEMEELEERMEKNPGIWDEIYRNARKV 649 Query: 865 EKVKFEANERDEGDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDV 1044 E+VKFEANERDEG+LERTGQP+CIYEIY GAGAWPFLHHGS+YRGLSL T RRL SDDV Sbjct: 650 ERVKFEANERDEGELERTGQPLCIYEIYNGAGAWPFLHHGSMYRGLSLTTSARRLRSDDV 709 Query: 1045 DAFTRLSFLNESHYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAER 1224 DA RL LN+++YR++ C++GGMFSIA++VD IH RPWIGFQ AE+ Sbjct: 710 DAVDRLPVLNDTYYRDIFCDIGGMFSIAFRVDKIHKRPWIGFQSWHAVGSKVSLSSRAEK 769 Query: 1225 ALEETIQQQTKGDVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMY 1404 LEETIQ++TKGDV+YFWA L +D G T+ ++ FWSMCDILNGG CR+ FE+AFR+MY Sbjct: 770 VLEETIQEETKGDVLYFWAHLNVDDGPTQKNRIPTFWSMCDILNGGNCRTAFEDAFRQMY 829 Query: 1405 ALPPVKDALPPMPDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAH-- 1578 A+P +ALPPMP+DGGYWSALH WVMPTPSFLEFIMFSRMF DS+DALH N ++ + Sbjct: 830 AMPSYIEALPPMPEDGGYWSALHSWVMPTPSFLEFIMFSRMFADSLDALHMNSRQSMNLS 889 Query: 1579 ----------CLLGSSELERKHCYCRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQ 1728 CLLGSS+LE+KHCYCR+LELLVNVWAYHSAR+MVYI+P +G LEE HPV+ Sbjct: 890 QSMNSSQPTVCLLGSSKLEKKHCYCRVLELLVNVWAYHSARKMVYINPYSGQLEEQHPVE 949 Query: 1729 QRKGFMWAKYFNITLLKSMXXXXXXXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYR 1908 QR+GFMWAKYFN TLLKSM RE WLWPLTGEVHW G+YE++REE+YR Sbjct: 950 QRRGFMWAKYFNSTLLKSMDEDLAEAADDGDHPRERWLWPLTGEVHWQGIYEREREERYR 1009 Query: 1909 LKMDKKRKVKEKLIDRFQHGYKQKTLG 1989 KMDKKRK KEKL++R +HGYKQK +G Sbjct: 1010 SKMDKKRKAKEKLVERMKHGYKQKPIG 1036 >gb|EMJ21765.1| hypothetical protein PRUPE_ppa001222mg [Prunus persica] Length = 877 Score = 860 bits (2223), Expect = 0.0 Identities = 409/666 (61%), Positives = 511/666 (76%), Gaps = 2/666 (0%) Frame = +1 Query: 1 KDVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYG 180 +D GGSF+F+FLCGNS+DGY+DA QE+A L L GS+RH+G+N DVN +LLM+DIVLYG Sbjct: 217 EDAGGSFKFVFLCGNSSDGYDDAFQEVASPLGLPRGSVRHFGLNGDVNSMLLMADIVLYG 276 Query: 181 TSQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVS 360 + QD Q FP LL RA++FGIP+IAPD PV+K YV DGVH F HNPD L+++FSL++S Sbjct: 277 SFQDVQGFPPLLIRAMTFGIPVIAPDFPVLKKYVTDGVHINTFPNHNPDALMKSFSLMIS 336 Query: 361 RDGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTW 540 +G LS+ A+ V SSG+LLA NL AS+CI G+AR++EN LNF SD LLP I++ ++GTW Sbjct: 337 -NGKLSKFARTVASSGRLLAMNLLASECITGYARVLENALNFPSDALLPGPISELQRGTW 395 Query: 541 EWNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-- 714 EWN G ++ + DM DE+++ ++ + ++YALE++ G+ N+S+ NG E Sbjct: 396 EWNLFGNEIDYTTGDMQGIDEQSS-LESTSVVYALEEEFS--GLAYSTNISD-NGTWESA 451 Query: 715 QDLLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANER 894 QD+ DWD + EIE+ E+ ER+EM+++ ERME++PG WD+IYRNARK EK +FEANER Sbjct: 452 QDIPTQLDWDLLTEIENSEEYERVEMEELSERMERDPGLWDDIYRNARKVEKFRFEANER 511 Query: 895 DEGDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLN 1074 DEG+LERTGQ VCIYEIY G+G WPFLHHGSLYRGLSL R RR +SDDVDA RL LN Sbjct: 512 DEGELERTGQSVCIYEIYSGSGTWPFLHHGSLYRGLSLSIRARRSTSDDVDAVDRLPILN 571 Query: 1075 ESHYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQT 1254 E+HYRN+LCE+GGMF+IA KVD +H RPWIGFQ AE+ LEE IQ Sbjct: 572 ETHYRNILCEIGGMFAIANKVDSVHKRPWIGFQSWRAAGRKVSLSKKAEKVLEEAIQDNR 631 Query: 1255 KGDVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALP 1434 +GDV+YFW RL M+ G+T L FWS CDILNGG CR+ FE+AFR MYALP +ALP Sbjct: 632 EGDVIYFWGRLNMNGGMTGSKDALTFWSACDILNGGHCRNVFEHAFRWMYALPNNTEALP 691 Query: 1435 PMPDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKH 1614 PMP+DGG+WSALH WVMPT SFLEF+MFSRMFV+S+DALH+N + + CLLGSSELE+KH Sbjct: 692 PMPEDGGHWSALHSWVMPTHSFLEFVMFSRMFVNSLDALHTNNSGQSMCLLGSSELEQKH 751 Query: 1615 CYCRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXX 1794 CYCR+LE+LVNVWAYHSAR++VYIDP +G +EE H + QR+ FMWAKYFN TLLKSM Sbjct: 752 CYCRVLEVLVNVWAYHSARKLVYIDPISGSMEEQHRIDQRQAFMWAKYFNATLLKSMDED 811 Query: 1795 XXXXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYK 1974 RENWLWPLTGEVHW G+YE++RE +YRLKMDKKRK KEKL++R ++GYK Sbjct: 812 LAEAADDGDHPRENWLWPLTGEVHWQGIYEREREVRYRLKMDKKRKTKEKLLERMKYGYK 871 Query: 1975 QKTLGG 1992 QKTLGG Sbjct: 872 QKTLGG 877 >ref|XP_004496154.1| PREDICTED: uncharacterized protein LOC101505326 [Cicer arietinum] Length = 1042 Score = 852 bits (2201), Expect = 0.0 Identities = 407/663 (61%), Positives = 510/663 (76%), Gaps = 1/663 (0%) Frame = +1 Query: 4 DVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGT 183 D SF+F+FLCGNSTDGY+DALQE+A L L GSIRHYG++ DVN VLLM+DIVLYG+ Sbjct: 388 DAAESFKFVFLCGNSTDGYDDALQEVASRLGLPHGSIRHYGLDGDVNSVLLMADIVLYGS 447 Query: 184 SQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSR 363 +QD Q FP LL RA++F IP+IAPD PV++ Y+ DGVHG+ +S+HNP+ LL AFSLL+S Sbjct: 448 AQDVQGFPPLLIRAMTFEIPVIAPDFPVLRKYIVDGVHGVFYSKHNPEALLNAFSLLLS- 506 Query: 364 DGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWE 543 G LS+ A+A+GSSG+ AKN+ A +CI G+ARL+ENVL F SD+LLP ++Q +QG W Sbjct: 507 SGRLSKFAQAIGSSGRQFAKNVLALECITGYARLLENVLTFPSDSLLPGPVSQIQQGAWG 566 Query: 544 WNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QD 720 W+ L + +DM DE +K R +++A+E++ G+ N+ E E+ QD Sbjct: 567 WS-----LMQIDIDMKKIDEDFSK-GRVTVVHAVEQELA--GLNYSTNIFENGTEVPMQD 618 Query: 721 LLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDE 900 L DWD + EIE ++ E LEM++++ERMEK+ G WDEIYRNARKSEK+KFEANERDE Sbjct: 619 ELTKLDWDILREIEIADESEMLEMEEVEERMEKDVGVWDEIYRNARKSEKLKFEANERDE 678 Query: 901 GDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNES 1080 G+LERTGQPVCIYEIY G G WPFLHHGSLYRGLSL +++R SSDDVDA RL LN++ Sbjct: 679 GELERTGQPVCIYEIYSGTGVWPFLHHGSLYRGLSLSRKSQRQSSDDVDAVGRLPLLNDT 738 Query: 1081 HYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKG 1260 +YR++LCE+GGMF+IA +VD IH RPW+GFQ AERALEET+ + +G Sbjct: 739 YYRDILCEIGGMFAIANRVDGIHRRPWVGFQSWRAAGRKVALSMEAERALEETMNESFRG 798 Query: 1261 DVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440 DV+YFW RL++D + + L FWSMCDILNGG CR+ F+++FR+MYALPP +ALPPM Sbjct: 799 DVIYFWGRLDLDGSVIGSNNALTFWSMCDILNGGNCRNVFQDSFRQMYALPPHAEALPPM 858 Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620 P+DGGYWSALH WVMPTPSFLEFIMFSRMFVDSIDALH + ++ + CLLGSSE+E KHCY Sbjct: 859 PEDGGYWSALHSWVMPTPSFLEFIMFSRMFVDSIDALHRDSSKHSVCLLGSSEIEEKHCY 918 Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800 CR+LELL+NVWAYHSAR+MVYI+P TG +EE H V QRKGFMWA+YFN TLLKSM Sbjct: 919 CRVLELLINVWAYHSARKMVYINPDTGSMEEQHVVDQRKGFMWAQYFNFTLLKSMDEDLA 978 Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980 RENWLWP+TGEVHW G+YE++REE+YR+KMDKKRK KEKL +R ++GYKQK Sbjct: 979 EAADDGDHPRENWLWPMTGEVHWQGIYEREREERYRIKMDKKRKTKEKLYERMKYGYKQK 1038 Query: 1981 TLG 1989 +LG Sbjct: 1039 SLG 1041 >gb|EXB58479.1| hypothetical protein L484_005213 [Morus notabilis] Length = 1043 Score = 846 bits (2186), Expect = 0.0 Identities = 408/668 (61%), Positives = 501/668 (75%), Gaps = 4/668 (0%) Frame = +1 Query: 1 KDVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYG 180 KD GGSF+F+FLCGNSTDGYND L+E+A L L S+RHYG+N+DV +LLM+DI LY Sbjct: 384 KDSGGSFKFVFLCGNSTDGYNDVLKEVASRLGLQDDSLRHYGLNSDVKSLLLMADIFLYD 443 Query: 181 TSQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVS 360 +SQ Q FP LL +A++F IP+IAPD PV++ Y+ DGVHG+ F +HNPD LL+AFS L+S Sbjct: 444 SSQGVQGFPPLLIQAMTFEIPVIAPDFPVLQKYIVDGVHGIFFPKHNPDALLKAFSFLIS 503 Query: 361 RDGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTW 540 G LS A+ V SSG+ LAKN+ A++CI G+ARL+E+VL F SD LP I+Q G W Sbjct: 504 -SGKLSRSAQTVASSGRRLAKNIMATECIMGYARLLESVLYFPSDAFLPGPISQLHLGAW 562 Query: 541 EWNFVGEILEEKSVDMMSFDERAT----KVDRSRIIYALEKDHLDLGITDPMNMSEYNGE 708 EWN L +K +D++ DE + K ++YALE++ L + G Sbjct: 563 EWN-----LFQKEIDLIG-DEMSHIAEGKSAAKSVVYALEEE-LTYSANSQNFSEDGTGN 615 Query: 709 LEQDLLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEAN 888 LEQD+ +DWD + EIES E+ ERLEMD++DERMEK G WD+IYRNARKSEK+KFE N Sbjct: 616 LEQDIPKQQDWDVLGEIESSEEYERLEMDELDERMEKVSGVWDDIYRNARKSEKLKFEPN 675 Query: 889 ERDEGDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSF 1068 ERDEG+LERTGQPVCIYEIY GA AWPFLHHGSLYRGLSL R+L SDDV+A RL Sbjct: 676 ERDEGELERTGQPVCIYEIYSGAAAWPFLHHGSLYRGLSLSAGARKLRSDDVNAVGRLPI 735 Query: 1069 LNESHYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQ 1248 LN+++YR++LCE+GGMF+IA KVD+IH RPWIGFQ AE+ LEETIQ+ Sbjct: 736 LNQTYYRDILCEIGGMFAIAKKVDNIHGRPWIGFQSWHAAGRKVSLSPKAEKVLEETIQE 795 Query: 1249 QTKGDVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDA 1428 TKGDV+YFWARL MD G+T L FWSMCDILNGG CR+ FE+AFRR+Y LP +A Sbjct: 796 NTKGDVIYFWARLNMDGGVTGSKNALTFWSMCDILNGGYCRTAFEDAFRRIYGLPSHIEA 855 Query: 1429 LPPMPDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELER 1608 LPPMP+DGG+WSALH WVMPTPSFLEF+MF+RMF DS+DALH+N ++ CLLGSS++E+ Sbjct: 856 LPPMPEDGGHWSALHSWVMPTPSFLEFVMFARMFADSLDALHANVSKENTCLLGSSDIEK 915 Query: 1609 KHCYCRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMX 1788 KHCYCR+LE+LVNVWAYHSAR+MVYIDP G LEE HPV+QRK FMWAKYFN TLLK + Sbjct: 916 KHCYCRMLEVLVNVWAYHSARKMVYIDPHAGSLEEQHPVEQRKEFMWAKYFNQTLLKRID 975 Query: 1789 XXXXXXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHG 1968 E WLWPLTGEVHW G+YE++RE++YRLKMDKKRK +EKL +R ++G Sbjct: 976 ENLAEAADDGDHPSEMWLWPLTGEVHWQGIYEREREQRYRLKMDKKRKTREKLFERMKYG 1035 Query: 1969 YKQKTLGG 1992 YKQK+LGG Sbjct: 1036 YKQKSLGG 1043 >gb|EOY18902.1| UDP-Glycosyltransferase superfamily protein isoform 3 [Theobroma cacao] Length = 1034 Score = 846 bits (2186), Expect = 0.0 Identities = 413/663 (62%), Positives = 505/663 (76%), Gaps = 1/663 (0%) Frame = +1 Query: 4 DVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGT 183 D GGSF+FIFL GNSTDGY+DALQ++A L L+ GS+RHYG++ DVNGVLLM+DIVLYGT Sbjct: 392 DAGGSFKFIFLSGNSTDGYHDALQQVASRLGLTQGSVRHYGLDGDVNGVLLMADIVLYGT 451 Query: 184 SQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSR 363 SQ+EQ FPSL+ RA++FGIP+I PD P++K YV DG HG+ F +H PD LLRAFSLL+S Sbjct: 452 SQEEQGFPSLIIRAMTFGIPVITPDFPIMKKYVVDGTHGVFFPKHQPDALLRAFSLLIS- 510 Query: 364 DGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWE 543 +G LS A+ V SSG+LLAKN+ AS+CI G+A L+EN+LNF SD LLP+ ++Q G+WE Sbjct: 511 NGRLSRFAQTVASSGRLLAKNILASECITGYASLLENLLNFPSDVLLPAPVSQLRLGSWE 570 Query: 544 WNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QD 720 WN G +E + D+ + ++YALE++ I+ ++S+Y E++ QD Sbjct: 571 WNVFGMEIEHGTGDISRYFS---------VVYALEEEFTKHTISS--DISQYGAEIQDQD 619 Query: 721 LLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDE 900 + +DWD + EIE+FED ERLEMD+++ERME+ PG WD+IYRNAR+SEK+KFEANERDE Sbjct: 620 IPTEQDWDIVTEIENFEDYERLEMDEVEERMERNPGVWDDIYRNARRSEKLKFEANERDE 679 Query: 901 GDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNES 1080 G+LERTGQPVCIYEIY GAGAWPFLHHGSLYRGLSL + RRL SDDVDA RL LN++ Sbjct: 680 GELERTGQPVCIYEIYSGAGAWPFLHHGSLYRGLSLSRKARRLRSDDVDAVGRLPVLNDT 739 Query: 1081 HYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKG 1260 HYR+LLCE+GGMFSIA +VD+IH RPWIGFQ AE LEETI Q +K Sbjct: 740 HYRDLLCEVGGMFSIANRVDNIHKRPWIGFQSWRAAGRKVSLSTRAEEVLEETI-QGSKR 798 Query: 1261 DVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440 DV+YFWARL++D G + L FWSMCD+LN G CR+ FE+AFR+MY LP +ALPPM Sbjct: 799 DVMYFWARLDIDGGGAGTNDALTFWSMCDLLNAGHCRTAFESAFRKMYILPSDTEALPPM 858 Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620 P D G+WSALH WVMPT SFLEF+MFSRMFVDS+DALH+N E CLLGSSELE Sbjct: 859 PKDDGHWSALHSWVMPTTSFLEFVMFSRMFVDSLDALHTNSGEVNLCLLGSSELE----- 913 Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800 +LELLVNVWAYHS RRMVYI+P +GLLEE HPV QRK FMWA+YFN TLLKSM Sbjct: 914 --VLELLVNVWAYHSGRRMVYIEPHSGLLEEQHPVDQRKEFMWARYFNFTLLKSMDEDLA 971 Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980 R+ WLWPLTGEVHW G+YE++REE+YRLKMDKKRK KEKL +R ++GYKQ+ Sbjct: 972 EAADDEDHPRKMWLWPLTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKNGYKQR 1031 Query: 1981 TLG 1989 +LG Sbjct: 1032 SLG 1034 >ref|XP_006378794.1| hypothetical protein POPTR_0010s23830g [Populus trichocarpa] gi|550330474|gb|ERP56591.1| hypothetical protein POPTR_0010s23830g [Populus trichocarpa] Length = 1053 Score = 840 bits (2169), Expect = 0.0 Identities = 404/665 (60%), Positives = 504/665 (75%), Gaps = 1/665 (0%) Frame = +1 Query: 1 KDVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYG 180 KD GSF+F+FLCGNSTD +DA QE+ + L P S+RHYG+N D N VLL +DIVLYG Sbjct: 395 KDAEGSFKFVFLCGNSTD--DDAFQEIVSRVGLHPSSVRHYGLNGDANSVLLAADIVLYG 452 Query: 181 TSQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVS 360 +SQDEQ FP +L RA++FGIP+IAPD+P +K YV D HG+ FS++NP+ L RAFSLL+S Sbjct: 453 SSQDEQGFPPVLIRAMTFGIPVIAPDIPTMKKYVSDEAHGIFFSKYNPEALTRAFSLLIS 512 Query: 361 RDGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTW 540 +G LS+ A+ V SG+LLAKN+ AS+CI G+ARL+EN+L+F SDTLLP +++ EQ W Sbjct: 513 -NGKLSKFAETVAFSGRLLAKNMLASECITGYARLLENMLSFPSDTLLPGPVSKLEQREW 571 Query: 541 EWNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGE-LEQ 717 EWN + LE+++ D+ E + I+Y+LEK+ +L + +SE E L Sbjct: 572 EWNLFNKELEQETDDLSGMYESLFSSRETSIVYSLEKEWSNL--VNSTIISENGTEILVP 629 Query: 718 DLLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERD 897 D DWD + EIESFE+ ER+ ++++ERM+K G WD+IYR+ARKSEK+KFE+NERD Sbjct: 630 DTPTESDWDVLMEIESFEEHERVVKEELEERMDKTRGLWDDIYRSARKSEKLKFESNERD 689 Query: 898 EGDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNE 1077 EG+LERTGQPVCIYEIY+GAGAWP LHHGSLYRGLSL T+ RR SDDVDA RL LNE Sbjct: 690 EGELERTGQPVCIYEIYDGAGAWPLLHHGSLYRGLSLSTKARRSRSDDVDAVARLPLLNE 749 Query: 1078 SHYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTK 1257 S+Y+N+LCE+GGMFSIA +VD IH RPWIGFQ AE+ LEE Q++ K Sbjct: 750 SYYQNILCEIGGMFSIAIRVDAIHKRPWIGFQSWHAAGRKVSLSFKAEKVLEEKTQEENK 809 Query: 1258 GDVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPP 1437 DV+YFWARL MD G+T ++ L FWSMCD+LNGG+CR+ FE+AFR+MY LP +ALPP Sbjct: 810 -DVMYFWARLGMDGGVTGSNEELTFWSMCDVLNGGRCRTAFEDAFRQMYDLPSYLEALPP 868 Query: 1438 MPDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHC 1617 MP+DGG+WSALH WVMPTPSFLEFIMFSRMFVDS+DAL SN ++ CLL S+ELE KHC Sbjct: 869 MPEDGGHWSALHSWVMPTPSFLEFIMFSRMFVDSLDALQSNSSQVNKCLLSSTELEEKHC 928 Query: 1618 YCRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXX 1797 YCRI+E+LVNVWAYHSARRMVYIDP TG +EE HP++QRK W KYFN+T+LKSM Sbjct: 929 YCRIMEVLVNVWAYHSARRMVYIDPHTGSVEEQHPIKQRKEIAWKKYFNLTVLKSMDEDL 988 Query: 1798 XXXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQ 1977 RE WLWPLTGEVHW G+YE++REE+YR+KMDKKRK +EKL++R + GYKQ Sbjct: 989 AEAADDGDHPRERWLWPLTGEVHWQGIYEREREERYRIKMDKKRKTREKLVERLKAGYKQ 1048 Query: 1978 KTLGG 1992 K LGG Sbjct: 1049 KPLGG 1053 >ref|XP_006436561.1| hypothetical protein CICLE_v10030581mg [Citrus clementina] gi|568863734|ref|XP_006485286.1| PREDICTED: uncharacterized protein LOC102618162 isoform X1 [Citrus sinensis] gi|557538757|gb|ESR49801.1| hypothetical protein CICLE_v10030581mg [Citrus clementina] Length = 1055 Score = 835 bits (2157), Expect = 0.0 Identities = 403/664 (60%), Positives = 501/664 (75%), Gaps = 2/664 (0%) Frame = +1 Query: 7 VGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGTS 186 V GSF+F+FLCGNSTDGYNDALQE+A L L S+RHYG N DVNGVLLM+DIVLYG+S Sbjct: 399 VEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSS 458 Query: 187 QDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSRD 366 Q EQ FPSL+ RA++FGIP+I PD P+IK YV +G + F + NP+ L RAFSL +S + Sbjct: 459 QVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS-N 517 Query: 367 GILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWEW 546 G LS+ A+ V S+G+L AKN+ A DC+ +AR++ENVLNF SD LLP I+Q +Q +WEW Sbjct: 518 GKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQQVSWEW 577 Query: 547 NFVGEILEEKSVDMMSFDERATKVD-RSRIIYALEKDHLDLGITDPMNMSEYNGELEQDL 723 N + ++ + D+++ DE T R+ + L ++ IT+ N S +QD Sbjct: 578 NLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITENENRSA-----DQDT 632 Query: 724 LNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDEG 903 ++ DWD + +IES E+ ERLEM+Q++ERM+ +WD+IYRNARKSE+ KFEANERDEG Sbjct: 633 ISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEG 692 Query: 904 DLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNESH 1083 +LERTGQPVCIYEIY G+GAWPFLHHGSLYRGL+L + RRL SDDVDA +RL LN +H Sbjct: 693 ELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTH 752 Query: 1084 YRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKGD 1263 YR++LCE+GGMFSIA KVD+IH RPWIGFQ AE+ LEET+Q+ T+GD Sbjct: 753 YRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQE-TEGD 811 Query: 1264 VVYFWARLEMDSGLTRGSKP-LPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440 V+YFWA L+MD G TR + L FWSMCDILNGG CR+ F +AFR+MY LP +ALPPM Sbjct: 812 VMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFVDAFRQMYGLPSHVEALPPM 871 Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620 P+DGG WSALHGWVM TPSFLEFIMFSRMFVDS+DAL++N ++ CLL SSELE+KHCY Sbjct: 872 PEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHCY 931 Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800 CR+LELLVNVWAYHS R+MVY+DP +G L+E HP+++R+GFMW KYFN TLLKSM Sbjct: 932 CRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLA 991 Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980 RE WLWP TGEVHW G+YE++REE+YR KMDKKRK+KEK+ DR GY+QK Sbjct: 992 EAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQK 1051 Query: 1981 TLGG 1992 TLGG Sbjct: 1052 TLGG 1055 >ref|XP_006436560.1| hypothetical protein CICLE_v10030581mg [Citrus clementina] gi|568863738|ref|XP_006485288.1| PREDICTED: uncharacterized protein LOC102618162 isoform X3 [Citrus sinensis] gi|568863740|ref|XP_006485289.1| PREDICTED: uncharacterized protein LOC102618162 isoform X4 [Citrus sinensis] gi|557538756|gb|ESR49800.1| hypothetical protein CICLE_v10030581mg [Citrus clementina] Length = 875 Score = 835 bits (2157), Expect = 0.0 Identities = 403/664 (60%), Positives = 501/664 (75%), Gaps = 2/664 (0%) Frame = +1 Query: 7 VGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGTS 186 V GSF+F+FLCGNSTDGYNDALQE+A L L S+RHYG N DVNGVLLM+DIVLYG+S Sbjct: 219 VEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSS 278 Query: 187 QDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSRD 366 Q EQ FPSL+ RA++FGIP+I PD P+IK YV +G + F + NP+ L RAFSL +S + Sbjct: 279 QVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS-N 337 Query: 367 GILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWEW 546 G LS+ A+ V S+G+L AKN+ A DC+ +AR++ENVLNF SD LLP I+Q +Q +WEW Sbjct: 338 GKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQQVSWEW 397 Query: 547 NFVGEILEEKSVDMMSFDERATKVD-RSRIIYALEKDHLDLGITDPMNMSEYNGELEQDL 723 N + ++ + D+++ DE T R+ + L ++ IT+ N S +QD Sbjct: 398 NLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITENENRSA-----DQDT 452 Query: 724 LNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDEG 903 ++ DWD + +IES E+ ERLEM+Q++ERM+ +WD+IYRNARKSE+ KFEANERDEG Sbjct: 453 ISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEG 512 Query: 904 DLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNESH 1083 +LERTGQPVCIYEIY G+GAWPFLHHGSLYRGL+L + RRL SDDVDA +RL LN +H Sbjct: 513 ELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTH 572 Query: 1084 YRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKGD 1263 YR++LCE+GGMFSIA KVD+IH RPWIGFQ AE+ LEET+Q+ T+GD Sbjct: 573 YRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQE-TEGD 631 Query: 1264 VVYFWARLEMDSGLTRGSKP-LPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440 V+YFWA L+MD G TR + L FWSMCDILNGG CR+ F +AFR+MY LP +ALPPM Sbjct: 632 VMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFVDAFRQMYGLPSHVEALPPM 691 Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620 P+DGG WSALHGWVM TPSFLEFIMFSRMFVDS+DAL++N ++ CLL SSELE+KHCY Sbjct: 692 PEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHCY 751 Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800 CR+LELLVNVWAYHS R+MVY+DP +G L+E HP+++R+GFMW KYFN TLLKSM Sbjct: 752 CRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLA 811 Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980 RE WLWP TGEVHW G+YE++REE+YR KMDKKRK+KEK+ DR GY+QK Sbjct: 812 EAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQK 871 Query: 1981 TLGG 1992 TLGG Sbjct: 872 TLGG 875 >ref|XP_006436559.1| hypothetical protein CICLE_v10030581mg [Citrus clementina] gi|557538755|gb|ESR49799.1| hypothetical protein CICLE_v10030581mg [Citrus clementina] Length = 797 Score = 835 bits (2157), Expect = 0.0 Identities = 403/664 (60%), Positives = 501/664 (75%), Gaps = 2/664 (0%) Frame = +1 Query: 7 VGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGTS 186 V GSF+F+FLCGNSTDGYNDALQE+A L L S+RHYG N DVNGVLLM+DIVLYG+S Sbjct: 141 VEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSS 200 Query: 187 QDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSRD 366 Q EQ FPSL+ RA++FGIP+I PD P+IK YV +G + F + NP+ L RAFSL +S + Sbjct: 201 QVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS-N 259 Query: 367 GILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWEW 546 G LS+ A+ V S+G+L AKN+ A DC+ +AR++ENVLNF SD LLP I+Q +Q +WEW Sbjct: 260 GKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQQVSWEW 319 Query: 547 NFVGEILEEKSVDMMSFDERATKVD-RSRIIYALEKDHLDLGITDPMNMSEYNGELEQDL 723 N + ++ + D+++ DE T R+ + L ++ IT+ N S +QD Sbjct: 320 NLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITENENRSA-----DQDT 374 Query: 724 LNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDEG 903 ++ DWD + +IES E+ ERLEM+Q++ERM+ +WD+IYRNARKSE+ KFEANERDEG Sbjct: 375 ISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEG 434 Query: 904 DLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNESH 1083 +LERTGQPVCIYEIY G+GAWPFLHHGSLYRGL+L + RRL SDDVDA +RL LN +H Sbjct: 435 ELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTH 494 Query: 1084 YRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKGD 1263 YR++LCE+GGMFSIA KVD+IH RPWIGFQ AE+ LEET+Q+ T+GD Sbjct: 495 YRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQE-TEGD 553 Query: 1264 VVYFWARLEMDSGLTRGSKP-LPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440 V+YFWA L+MD G TR + L FWSMCDILNGG CR+ F +AFR+MY LP +ALPPM Sbjct: 554 VMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFVDAFRQMYGLPSHVEALPPM 613 Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620 P+DGG WSALHGWVM TPSFLEFIMFSRMFVDS+DAL++N ++ CLL SSELE+KHCY Sbjct: 614 PEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHCY 673 Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800 CR+LELLVNVWAYHS R+MVY+DP +G L+E HP+++R+GFMW KYFN TLLKSM Sbjct: 674 CRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLA 733 Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980 RE WLWP TGEVHW G+YE++REE+YR KMDKKRK+KEK+ DR GY+QK Sbjct: 734 EAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQK 793 Query: 1981 TLGG 1992 TLGG Sbjct: 794 TLGG 797 >ref|XP_003535489.1| PREDICTED: uncharacterized protein LOC100779157 isoform X1 [Glycine max] Length = 1044 Score = 828 bits (2139), Expect = 0.0 Identities = 399/659 (60%), Positives = 495/659 (75%), Gaps = 1/659 (0%) Frame = +1 Query: 16 SFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGTSQDE 195 SF+F+FLCGNSTDGY+DALQ +A + L GSIRHYG+N DVN VLLM+DI+LYG++Q+ Sbjct: 395 SFKFVFLCGNSTDGYDDALQGVASRMGLRQGSIRHYGLNGDVNSVLLMADIILYGSAQEV 454 Query: 196 QSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSRDGIL 375 Q FP LL RA++F IP++ PD V+K Y+ DGVHG+ FS+HNP+ L+ AFSLL+S +G L Sbjct: 455 QGFPPLLIRAMTFEIPVVVPDFSVLKKYIVDGVHGIFFSKHNPEALMNAFSLLLS-NGRL 513 Query: 376 SELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWEWNFV 555 S+ A+A+ SSG+ LAKN+ A DCI G+ARL+ENVLNF SD LLP ++Q +QG+WEWN Sbjct: 514 SKFAQAIASSGRQLAKNVLALDCITGYARLLENVLNFPSDALLPGPVSQIQQGSWEWNLF 573 Query: 556 GEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QDLLNA 732 ++ +D F R I+YA+E + L + ++ E E+ +D L Sbjct: 574 RNEIDLSKIDG-DFSNRKVS-----IVYAVEHELASLNYST--SIFENGTEVPLRDELTQ 625 Query: 733 EDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDEGDLE 912 DWD + EIE E+ E E+++ +ER EK G WD+IYRNARKSEK+KFE NERDEG+LE Sbjct: 626 LDWDILREIEISEENEMFEVEEAEERREKGVGVWDDIYRNARKSEKLKFEVNERDEGELE 685 Query: 913 RTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNESHYRN 1092 RTGQPVCIYEIY GAG WPFLHHGSLYRGLSL R +R SSDDVDA RL LN+++YR+ Sbjct: 686 RTGQPVCIYEIYNGAGVWPFLHHGSLYRGLSLSRRAQRQSSDDVDAVGRLPLLNDTYYRD 745 Query: 1093 LLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKGDVVY 1272 +LCEMGGMF+IA +VD+IH RPWIGFQ AE+ LEET+Q+ +GDV+Y Sbjct: 746 ILCEMGGMFAIANRVDNIHRRPWIGFQSWRAAGRKVALSAKAEKVLEETMQENFRGDVIY 805 Query: 1273 FWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPMPDDG 1452 FW R +MD + FW MCDILNGG CR F+ FR+MYALPP +ALPPMP+D Sbjct: 806 FWGRFDMDQSVIGNHNANSFWYMCDILNGGNCRIVFQEGFRQMYALPPHAEALPPMPED- 864 Query: 1453 GYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCYCRIL 1632 GYWSALH WVMPTPSFLEFIMFSRMFVDSIDALH + T+ + CLLGSSE+E+KHCYCR+L Sbjct: 865 GYWSALHSWVMPTPSFLEFIMFSRMFVDSIDALHRDSTKYSLCLLGSSEIEKKHCYCRVL 924 Query: 1633 ELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXXXXXX 1812 ELL+NVWAYHSAR+MVYI+P TG +EE HP++QRKGFMWAKYFNI+LLKSM Sbjct: 925 ELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWAKYFNISLLKSMDEDLAEAAD 984 Query: 1813 XXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQKTLG 1989 RE WLWP+TGEVHW G+YE++REE+YRLKMDKKRK KEKL +R ++GYKQK+LG Sbjct: 985 DGDHPREMWLWPMTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKYGYKQKSLG 1043 >ref|XP_006606299.1| PREDICTED: uncharacterized protein LOC100790929 isoform X4 [Glycine max] Length = 869 Score = 827 bits (2136), Expect = 0.0 Identities = 397/663 (59%), Positives = 495/663 (74%), Gaps = 1/663 (0%) Frame = +1 Query: 4 DVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGT 183 D SF+F+FLCGNSTDGY+DALQ +A + L GSIRHYG+N DVN VLLM+DI+LYG+ Sbjct: 218 DATDSFKFVFLCGNSTDGYDDALQGVASRMGLRQGSIRHYGLNGDVNSVLLMADIILYGS 277 Query: 184 SQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSR 363 +Q+ Q FP LL RA++F IP++ PD V+K Y+ DGVHG+ FS+HNP+ L+ AFSLL+S Sbjct: 278 AQEVQGFPPLLIRAMTFEIPVVVPDFSVLKKYIVDGVHGIFFSKHNPEALMNAFSLLLS- 336 Query: 364 DGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWE 543 +G LS+ A+A+ SSG+ LAKN+ A DCI G+ARL+ENVLNF SD LLP +++Q +QG+WE Sbjct: 337 NGRLSKFAQAIASSGRQLAKNVLALDCITGYARLLENVLNFPSDALLPGAVSQIQQGSWE 396 Query: 544 WNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QD 720 WN L + +D+ D + I+YA+E + L + ++ E E+ QD Sbjct: 397 WN-----LFQNEIDLSKIDSNR----KVSIVYAVEHELASLNYST--SIVENGTEVPLQD 445 Query: 721 LLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDE 900 L D D + EIE E+ E E+++ +ERMEK WD+IYRNARKSEK+KFE NERDE Sbjct: 446 ELTQLDLDTLREIEISEENEMFEVEEAEERMEKGVSVWDDIYRNARKSEKLKFEVNERDE 505 Query: 901 GDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNES 1080 G+LERTGQ VCIYEIY GAG WPFLHHGSLYRGLSL R +R +SDDVDA RL LN++ Sbjct: 506 GELERTGQSVCIYEIYNGAGVWPFLHHGSLYRGLSLSRRAQRQTSDDVDAVGRLPLLNDT 565 Query: 1081 HYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKG 1260 +YR++LCEMGGMF+IA +VD IH RPWIGFQ AE LEET+Q+ +G Sbjct: 566 YYRDILCEMGGMFAIANRVDSIHRRPWIGFQSWRAAGRKVALSAKAENVLEETMQENFRG 625 Query: 1261 DVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440 DV+YFW RL+MD R + FW MCDILNGG CR F++ FR+MYALPP +ALPPM Sbjct: 626 DVIYFWGRLDMDQSAIRNHNAISFWYMCDILNGGNCRIVFQDGFRQMYALPPHAEALPPM 685 Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620 P+DGGYWSALH WVMPT SFLEFIMFSRMFVDSIDA H + T+ + CLLGSSE+E+KHCY Sbjct: 686 PEDGGYWSALHSWVMPTSSFLEFIMFSRMFVDSIDAKHRDSTKYSLCLLGSSEIEKKHCY 745 Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800 CR+LELL+NVWAYHSAR+MVYI+P TG +EE HP++QRKGFMW+KYFN +LLKSM Sbjct: 746 CRMLELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWSKYFNFSLLKSMDEDLA 805 Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980 RE WLWP+TGEVHW G+YE++REE+YRLKMDKKRK KEKL +R ++GYKQK Sbjct: 806 EAADDGDHPREMWLWPMTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKYGYKQK 865 Query: 1981 TLG 1989 +LG Sbjct: 866 SLG 868 >ref|XP_006606298.1| PREDICTED: uncharacterized protein LOC100790929 isoform X3 [Glycine max] Length = 1015 Score = 827 bits (2136), Expect = 0.0 Identities = 397/663 (59%), Positives = 495/663 (74%), Gaps = 1/663 (0%) Frame = +1 Query: 4 DVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGT 183 D SF+F+FLCGNSTDGY+DALQ +A + L GSIRHYG+N DVN VLLM+DI+LYG+ Sbjct: 364 DATDSFKFVFLCGNSTDGYDDALQGVASRMGLRQGSIRHYGLNGDVNSVLLMADIILYGS 423 Query: 184 SQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSR 363 +Q+ Q FP LL RA++F IP++ PD V+K Y+ DGVHG+ FS+HNP+ L+ AFSLL+S Sbjct: 424 AQEVQGFPPLLIRAMTFEIPVVVPDFSVLKKYIVDGVHGIFFSKHNPEALMNAFSLLLS- 482 Query: 364 DGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWE 543 +G LS+ A+A+ SSG+ LAKN+ A DCI G+ARL+ENVLNF SD LLP +++Q +QG+WE Sbjct: 483 NGRLSKFAQAIASSGRQLAKNVLALDCITGYARLLENVLNFPSDALLPGAVSQIQQGSWE 542 Query: 544 WNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QD 720 WN L + +D+ D + I+YA+E + L + ++ E E+ QD Sbjct: 543 WN-----LFQNEIDLSKIDSNR----KVSIVYAVEHELASLNYST--SIVENGTEVPLQD 591 Query: 721 LLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDE 900 L D D + EIE E+ E E+++ +ERMEK WD+IYRNARKSEK+KFE NERDE Sbjct: 592 ELTQLDLDTLREIEISEENEMFEVEEAEERMEKGVSVWDDIYRNARKSEKLKFEVNERDE 651 Query: 901 GDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNES 1080 G+LERTGQ VCIYEIY GAG WPFLHHGSLYRGLSL R +R +SDDVDA RL LN++ Sbjct: 652 GELERTGQSVCIYEIYNGAGVWPFLHHGSLYRGLSLSRRAQRQTSDDVDAVGRLPLLNDT 711 Query: 1081 HYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKG 1260 +YR++LCEMGGMF+IA +VD IH RPWIGFQ AE LEET+Q+ +G Sbjct: 712 YYRDILCEMGGMFAIANRVDSIHRRPWIGFQSWRAAGRKVALSAKAENVLEETMQENFRG 771 Query: 1261 DVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440 DV+YFW RL+MD R + FW MCDILNGG CR F++ FR+MYALPP +ALPPM Sbjct: 772 DVIYFWGRLDMDQSAIRNHNAISFWYMCDILNGGNCRIVFQDGFRQMYALPPHAEALPPM 831 Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620 P+DGGYWSALH WVMPT SFLEFIMFSRMFVDSIDA H + T+ + CLLGSSE+E+KHCY Sbjct: 832 PEDGGYWSALHSWVMPTSSFLEFIMFSRMFVDSIDAKHRDSTKYSLCLLGSSEIEKKHCY 891 Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800 CR+LELL+NVWAYHSAR+MVYI+P TG +EE HP++QRKGFMW+KYFN +LLKSM Sbjct: 892 CRMLELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWSKYFNFSLLKSMDEDLA 951 Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980 RE WLWP+TGEVHW G+YE++REE+YRLKMDKKRK KEKL +R ++GYKQK Sbjct: 952 EAADDGDHPREMWLWPMTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKYGYKQK 1011 Query: 1981 TLG 1989 +LG Sbjct: 1012 SLG 1014 >ref|XP_003555467.1| PREDICTED: uncharacterized protein LOC100790929 isoform X1 [Glycine max] Length = 1045 Score = 827 bits (2136), Expect = 0.0 Identities = 397/663 (59%), Positives = 495/663 (74%), Gaps = 1/663 (0%) Frame = +1 Query: 4 DVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGT 183 D SF+F+FLCGNSTDGY+DALQ +A + L GSIRHYG+N DVN VLLM+DI+LYG+ Sbjct: 394 DATDSFKFVFLCGNSTDGYDDALQGVASRMGLRQGSIRHYGLNGDVNSVLLMADIILYGS 453 Query: 184 SQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSR 363 +Q+ Q FP LL RA++F IP++ PD V+K Y+ DGVHG+ FS+HNP+ L+ AFSLL+S Sbjct: 454 AQEVQGFPPLLIRAMTFEIPVVVPDFSVLKKYIVDGVHGIFFSKHNPEALMNAFSLLLS- 512 Query: 364 DGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWE 543 +G LS+ A+A+ SSG+ LAKN+ A DCI G+ARL+ENVLNF SD LLP +++Q +QG+WE Sbjct: 513 NGRLSKFAQAIASSGRQLAKNVLALDCITGYARLLENVLNFPSDALLPGAVSQIQQGSWE 572 Query: 544 WNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QD 720 WN L + +D+ D + I+YA+E + L + ++ E E+ QD Sbjct: 573 WN-----LFQNEIDLSKIDSNR----KVSIVYAVEHELASLNYST--SIVENGTEVPLQD 621 Query: 721 LLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDE 900 L D D + EIE E+ E E+++ +ERMEK WD+IYRNARKSEK+KFE NERDE Sbjct: 622 ELTQLDLDTLREIEISEENEMFEVEEAEERMEKGVSVWDDIYRNARKSEKLKFEVNERDE 681 Query: 901 GDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNES 1080 G+LERTGQ VCIYEIY GAG WPFLHHGSLYRGLSL R +R +SDDVDA RL LN++ Sbjct: 682 GELERTGQSVCIYEIYNGAGVWPFLHHGSLYRGLSLSRRAQRQTSDDVDAVGRLPLLNDT 741 Query: 1081 HYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKG 1260 +YR++LCEMGGMF+IA +VD IH RPWIGFQ AE LEET+Q+ +G Sbjct: 742 YYRDILCEMGGMFAIANRVDSIHRRPWIGFQSWRAAGRKVALSAKAENVLEETMQENFRG 801 Query: 1261 DVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440 DV+YFW RL+MD R + FW MCDILNGG CR F++ FR+MYALPP +ALPPM Sbjct: 802 DVIYFWGRLDMDQSAIRNHNAISFWYMCDILNGGNCRIVFQDGFRQMYALPPHAEALPPM 861 Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620 P+DGGYWSALH WVMPT SFLEFIMFSRMFVDSIDA H + T+ + CLLGSSE+E+KHCY Sbjct: 862 PEDGGYWSALHSWVMPTSSFLEFIMFSRMFVDSIDAKHRDSTKYSLCLLGSSEIEKKHCY 921 Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800 CR+LELL+NVWAYHSAR+MVYI+P TG +EE HP++QRKGFMW+KYFN +LLKSM Sbjct: 922 CRMLELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWSKYFNFSLLKSMDEDLA 981 Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980 RE WLWP+TGEVHW G+YE++REE+YRLKMDKKRK KEKL +R ++GYKQK Sbjct: 982 EAADDGDHPREMWLWPMTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKYGYKQK 1041 Query: 1981 TLG 1989 +LG Sbjct: 1042 SLG 1044 >ref|NP_001190226.1| UDP-glycosyltransferase family protein [Arabidopsis thaliana] gi|332003368|gb|AED90751.1| UDP-glycosyltransferase family protein [Arabidopsis thaliana] Length = 1035 Score = 824 bits (2129), Expect = 0.0 Identities = 392/665 (58%), Positives = 502/665 (75%), Gaps = 1/665 (0%) Frame = +1 Query: 1 KDVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYG 180 KD GSF+F+FL GNST G +DA+QE+A L L+ G++RH+G+N DVN VL M+DI++Y Sbjct: 376 KDTSGSFKFVFLYGNSTKGQSDAVQEVASRLGLTEGTVRHFGLNEDVNRVLRMADILVYA 435 Query: 181 TSQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVS 360 +SQ+EQ+FP L+ RA+SFGIPII PD P++K Y+ D VHG+ F R++PD LL+AFS L+S Sbjct: 436 SSQEEQNFPPLIVRAMSFGIPIITPDFPIMKKYMADEVHGIFFRRNDPDALLKAFSPLIS 495 Query: 361 RDGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTW 540 DG LS+ A+ + SSG+LL KNL A++CI G+ARL+EN+L+F SDT LP SI+Q + W Sbjct: 496 -DGRLSKFAQTIASSGRLLTKNLMATECITGYARLLENMLHFPSDTFLPGSISQLQVAAW 554 Query: 541 EWNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELEQD 720 EWNF LE+ ++ D + +S I++ +E+ + G+ + N + N D Sbjct: 555 EWNFFRSELEQPKSFIL--DSAYAFIGKSGIVFQVEEKFM--GVIESTNPVDNNTLFVSD 610 Query: 721 LLNAE-DWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERD 897 L ++ DWD + EIE E+ E++E +++++RME++ +W+EIYRNARKSEK+KFE NERD Sbjct: 611 ELPSKLDWDVLEEIEGAEEYEKVESEELEDRMERDVEDWEEIYRNARKSEKLKFEVNERD 670 Query: 898 EGDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNE 1077 EG+LERTG+P+CIYEIY GAGAWPFLHHGSLYRGLSL ++ RRLSSDDVDA RL LN+ Sbjct: 671 EGELERTGEPLCIYEIYNGAGAWPFLHHGSLYRGLSLSSKDRRLSSDDVDAADRLPLLND 730 Query: 1078 SHYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTK 1257 ++YR++LCE+GGMFS+A KVD IHMRPWIGFQ AE +LE I+Q+TK Sbjct: 731 TYYRDILCEIGGMFSVANKVDSIHMRPWIGFQSWRAAGRKVSLSSKAEESLENIIKQETK 790 Query: 1258 GDVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPP 1437 G+++YFW RL++D L FWSMCDILN G CR+TFE+AFR MY LP +ALPP Sbjct: 791 GEIIYFWTRLDIDGDAYGSKNALTFWSMCDILNQGNCRTTFEDAFRHMYGLPEHIEALPP 850 Query: 1438 MPDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHC 1617 MP+DG +WS+LH WVMPTPSFLEF+MFSRMF +S+DALH+N ++ C L SS LERKHC Sbjct: 851 MPEDGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLDALHNNLNDSKSCSLASSLLERKHC 910 Query: 1618 YCRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXX 1797 YCR+LELLVNVWAYHS R+MVYI+P+ G LEE HP+QQRKG MWAKYFN TLLKSM Sbjct: 911 YCRVLELLVNVWAYHSGRKMVYINPRDGSLEEQHPLQQRKGLMWAKYFNFTLLKSMDEDL 970 Query: 1798 XXXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQ 1977 RE WLWPLTGEVHW GVYE++REE+YRLKMDKKRK KEKL DR ++GYKQ Sbjct: 971 AEAADDKDHPRERWLWPLTGEVHWKGVYEREREERYRLKMDKKRKTKEKLYDRIKNGYKQ 1030 Query: 1978 KTLGG 1992 K+LGG Sbjct: 1031 KSLGG 1035 >ref|NP_568137.1| UDP-glycosyltransferase family protein [Arabidopsis thaliana] gi|15450503|gb|AAK96544.1| AT5g04480/T32M21_80 [Arabidopsis thaliana] gi|24111433|gb|AAN46867.1| At5g04480/T32M21_80 [Arabidopsis thaliana] gi|332003367|gb|AED90750.1| UDP-glycosyltransferase family protein [Arabidopsis thaliana] Length = 1050 Score = 824 bits (2129), Expect = 0.0 Identities = 392/665 (58%), Positives = 502/665 (75%), Gaps = 1/665 (0%) Frame = +1 Query: 1 KDVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYG 180 KD GSF+F+FL GNST G +DA+QE+A L L+ G++RH+G+N DVN VL M+DI++Y Sbjct: 391 KDTSGSFKFVFLYGNSTKGQSDAVQEVASRLGLTEGTVRHFGLNEDVNRVLRMADILVYA 450 Query: 181 TSQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVS 360 +SQ+EQ+FP L+ RA+SFGIPII PD P++K Y+ D VHG+ F R++PD LL+AFS L+S Sbjct: 451 SSQEEQNFPPLIVRAMSFGIPIITPDFPIMKKYMADEVHGIFFRRNDPDALLKAFSPLIS 510 Query: 361 RDGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTW 540 DG LS+ A+ + SSG+LL KNL A++CI G+ARL+EN+L+F SDT LP SI+Q + W Sbjct: 511 -DGRLSKFAQTIASSGRLLTKNLMATECITGYARLLENMLHFPSDTFLPGSISQLQVAAW 569 Query: 541 EWNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELEQD 720 EWNF LE+ ++ D + +S I++ +E+ + G+ + N + N D Sbjct: 570 EWNFFRSELEQPKSFIL--DSAYAFIGKSGIVFQVEEKFM--GVIESTNPVDNNTLFVSD 625 Query: 721 LLNAE-DWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERD 897 L ++ DWD + EIE E+ E++E +++++RME++ +W+EIYRNARKSEK+KFE NERD Sbjct: 626 ELPSKLDWDVLEEIEGAEEYEKVESEELEDRMERDVEDWEEIYRNARKSEKLKFEVNERD 685 Query: 898 EGDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNE 1077 EG+LERTG+P+CIYEIY GAGAWPFLHHGSLYRGLSL ++ RRLSSDDVDA RL LN+ Sbjct: 686 EGELERTGEPLCIYEIYNGAGAWPFLHHGSLYRGLSLSSKDRRLSSDDVDAADRLPLLND 745 Query: 1078 SHYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTK 1257 ++YR++LCE+GGMFS+A KVD IHMRPWIGFQ AE +LE I+Q+TK Sbjct: 746 TYYRDILCEIGGMFSVANKVDSIHMRPWIGFQSWRAAGRKVSLSSKAEESLENIIKQETK 805 Query: 1258 GDVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPP 1437 G+++YFW RL++D L FWSMCDILN G CR+TFE+AFR MY LP +ALPP Sbjct: 806 GEIIYFWTRLDIDGDAYGSKNALTFWSMCDILNQGNCRTTFEDAFRHMYGLPEHIEALPP 865 Query: 1438 MPDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHC 1617 MP+DG +WS+LH WVMPTPSFLEF+MFSRMF +S+DALH+N ++ C L SS LERKHC Sbjct: 866 MPEDGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLDALHNNLNDSKSCSLASSLLERKHC 925 Query: 1618 YCRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXX 1797 YCR+LELLVNVWAYHS R+MVYI+P+ G LEE HP+QQRKG MWAKYFN TLLKSM Sbjct: 926 YCRVLELLVNVWAYHSGRKMVYINPRDGSLEEQHPLQQRKGLMWAKYFNFTLLKSMDEDL 985 Query: 1798 XXXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQ 1977 RE WLWPLTGEVHW GVYE++REE+YRLKMDKKRK KEKL DR ++GYKQ Sbjct: 986 AEAADDKDHPRERWLWPLTGEVHWKGVYEREREERYRLKMDKKRKTKEKLYDRIKNGYKQ 1045 Query: 1978 KTLGG 1992 K+LGG Sbjct: 1046 KSLGG 1050 >ref|XP_006379502.1| hypothetical protein POPTR_0008s02940g [Populus trichocarpa] gi|550332296|gb|ERP57299.1| hypothetical protein POPTR_0008s02940g [Populus trichocarpa] Length = 1061 Score = 823 bits (2127), Expect = 0.0 Identities = 397/666 (59%), Positives = 501/666 (75%), Gaps = 4/666 (0%) Frame = +1 Query: 1 KDVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYG 180 KD GSF+ IFL GNSTD ++ALQE+ L L GS+ HYG++ DVN VLLM+D+VLYG Sbjct: 397 KDAEGSFKLIFLGGNSTD--DNALQEVVSGLGLHHGSVWHYGLHGDVNSVLLMADVVLYG 454 Query: 181 TSQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVS 360 +SQ+EQ FP LL RA++FG P+IAPD+P++K YV DG HG++FS+++P+ L RA SLL+S Sbjct: 455 SSQNEQGFPPLLIRAMTFGTPVIAPDIPILKKYVDDGAHGILFSKYSPEALTRALSLLIS 514 Query: 361 RDGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTW 540 +G LS+ A+ + SG+LLAKN+ AS+CI G+ARL+EN+++F SDTLLP ++ ++ W Sbjct: 515 -NGKLSKFAQTLAFSGRLLAKNMLASECIIGYARLLENLISFPSDTLLPGPVSNLQRREW 573 Query: 541 EWNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGE---- 708 EWN + LE++ D++S E + +Y+LEK+ ++ +N + +G Sbjct: 574 EWNLFSKELEQEIDDLLSMAEGDFSFRETSAVYSLEKEW-----SNHVNSTSISGNGTEI 628 Query: 709 LEQDLLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEAN 888 L D+ DWD ++EIESFE+ ER+E +++ ERM+K G WDEIY +ARKSEK+KFEAN Sbjct: 629 LVPDIPTESDWDVLSEIESFEEYERVETEELQERMDKSHGPWDEIYHDARKSEKLKFEAN 688 Query: 889 ERDEGDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSF 1068 ERDEG+LERTGQPVCIYEIY+GAGAWPFL+HGSLYRGLSL T+ RR SDDVDA RL Sbjct: 689 ERDEGELERTGQPVCIYEIYDGAGAWPFLNHGSLYRGLSLSTKARRSRSDDVDAVARLPL 748 Query: 1069 LNESHYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQ 1248 LN+S+Y+N+LC++GGMFSIA +VDDIH RPWIGFQ AE+ LEE +Q+ Sbjct: 749 LNDSYYQNILCDIGGMFSIANRVDDIHKRPWIGFQSWHAAGSKVSLTFKAEQVLEEKVQE 808 Query: 1249 QTKGDVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDA 1428 + K DV+Y+WARL+MD G+T + L FWSMCDILNGG CR FE+AFR MY LP + Sbjct: 809 ENK-DVMYYWARLDMDGGVTGSNDELTFWSMCDILNGGHCRIAFEDAFRHMYGLPSNLEV 867 Query: 1429 LPPMPDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELER 1608 LPPMP+DGG+WSALH WVMPTPSFLEFIMFSRMFVDS+DAL SN ++ CLL SSEL+ Sbjct: 868 LPPMPEDGGHWSALHSWVMPTPSFLEFIMFSRMFVDSLDALQSNSSQMTKCLLSSSELQE 927 Query: 1609 KHCYCRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMX 1788 KHCYCRILE+LVNVWAYHSARRMVYIDP TG +EE HPV+QRKG MW KYF + +LKSM Sbjct: 928 KHCYCRILEVLVNVWAYHSARRMVYIDPHTGSVEEQHPVEQRKGIMWEKYFKLMVLKSMD 987 Query: 1789 XXXXXXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHG 1968 RE WLWPLTGEVHW G+YE++REEKYR+KMDKKRK KEKL +R + G Sbjct: 988 EDLAEAADDGDHPRERWLWPLTGEVHWQGIYEREREEKYRVKMDKKRKTKEKLFERLKSG 1047 Query: 1969 YKQKTL 1986 YKQK L Sbjct: 1048 YKQKPL 1053 >ref|XP_006589360.1| PREDICTED: uncharacterized protein LOC100779157 isoform X2 [Glycine max] Length = 1043 Score = 823 bits (2125), Expect = 0.0 Identities = 399/659 (60%), Positives = 494/659 (74%), Gaps = 1/659 (0%) Frame = +1 Query: 16 SFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGTSQDE 195 SF+F+FLCGNSTDGY+DALQ +A + L GSIRHYG+N DVN VLLM+DI+LYG++Q+ Sbjct: 395 SFKFVFLCGNSTDGYDDALQGVASRMGLRQGSIRHYGLNGDVNSVLLMADIILYGSAQEV 454 Query: 196 QSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSRDGIL 375 Q FP LL RA++F IP++ PD V+K Y+ DGVHG+ FS+HNP+ L+ AFSLL+S +G L Sbjct: 455 QGFPPLLIRAMTFEIPVVVPDFSVLKKYIVDGVHGIFFSKHNPEALMNAFSLLLS-NGRL 513 Query: 376 SELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWEWNFV 555 S+ A+A+ SSG+ LAKN+ A DCI G+ARL+ENVLNF SD LLP ++Q +QG+WEWN Sbjct: 514 SKFAQAIASSGRQLAKNVLALDCITGYARLLENVLNFPSDALLPGPVSQIQQGSWEWNLF 573 Query: 556 GEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QDLLNA 732 ++ +D F R I+YA+E + L + ++ E E+ +D L Sbjct: 574 RNEIDLSKIDG-DFSNRKVS-----IVYAVEHELASLNYST--SIFENGTEVPLRDELTQ 625 Query: 733 EDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDEGDLE 912 DWD + EIE E+ E E+++ +ER EK G WD+IYRNARKSEK+KFE NERDEG+LE Sbjct: 626 LDWDILREIEISEENEMFEVEEAEERREKGVGVWDDIYRNARKSEKLKFEVNERDEGELE 685 Query: 913 RTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNESHYRN 1092 RTGQPVCIYEIY GAG WPFLHHGSLYRGLSL R +R SSDDVDA RL LN+++YR+ Sbjct: 686 RTGQPVCIYEIYNGAGVWPFLHHGSLYRGLSLSRRAQRQSSDDVDAVGRLPLLNDTYYRD 745 Query: 1093 LLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKGDVVY 1272 +LCEMGGMF+IA +VD+IH RPWIGFQ AE+ LEET+Q+ +GDV+Y Sbjct: 746 ILCEMGGMFAIANRVDNIHRRPWIGFQSWRAAGRKVALSAKAEKVLEETMQENFRGDVIY 805 Query: 1273 FWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPMPDDG 1452 FW R +MD + FW MCDILNGG CR F+ FR+MYALPP +ALPPMP+D Sbjct: 806 FWGRFDMDQSVIGNHNANSFWYMCDILNGGNCRIVFQEGFRQMYALPPHAEALPPMPED- 864 Query: 1453 GYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCYCRIL 1632 GYWSALH WVMPTPSFLEFIMFSRMFVDSIDALH + T+ + CLLGSSE+E KHCYCR+L Sbjct: 865 GYWSALHSWVMPTPSFLEFIMFSRMFVDSIDALHRDSTKYSLCLLGSSEIE-KHCYCRVL 923 Query: 1633 ELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXXXXXX 1812 ELL+NVWAYHSAR+MVYI+P TG +EE HP++QRKGFMWAKYFNI+LLKSM Sbjct: 924 ELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWAKYFNISLLKSMDEDLAEAAD 983 Query: 1813 XXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQKTLG 1989 RE WLWP+TGEVHW G+YE++REE+YRLKMDKKRK KEKL +R ++GYKQK+LG Sbjct: 984 DGDHPREMWLWPMTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKYGYKQKSLG 1042 >ref|XP_006606297.1| PREDICTED: uncharacterized protein LOC100790929 isoform X2 [Glycine max] Length = 1044 Score = 822 bits (2122), Expect = 0.0 Identities = 397/663 (59%), Positives = 494/663 (74%), Gaps = 1/663 (0%) Frame = +1 Query: 4 DVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGT 183 D SF+F+FLCGNSTDGY+DALQ +A + L GSIRHYG+N DVN VLLM+DI+LYG+ Sbjct: 394 DATDSFKFVFLCGNSTDGYDDALQGVASRMGLRQGSIRHYGLNGDVNSVLLMADIILYGS 453 Query: 184 SQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSR 363 +Q+ Q FP LL RA++F IP++ PD V+K Y+ DGVHG+ FS+HNP+ L+ AFSLL+S Sbjct: 454 AQEVQGFPPLLIRAMTFEIPVVVPDFSVLKKYIVDGVHGIFFSKHNPEALMNAFSLLLS- 512 Query: 364 DGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWE 543 +G LS+ A+A+ SSG+ LAKN+ A DCI G+ARL+ENVLNF SD LLP +++Q +QG+WE Sbjct: 513 NGRLSKFAQAIASSGRQLAKNVLALDCITGYARLLENVLNFPSDALLPGAVSQIQQGSWE 572 Query: 544 WNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QD 720 WN L + +D+ D + I+YA+E + L + ++ E E+ QD Sbjct: 573 WN-----LFQNEIDLSKIDSNR----KVSIVYAVEHELASLNYST--SIVENGTEVPLQD 621 Query: 721 LLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDE 900 L D D + EIE E+ E E+++ +ERMEK WD+IYRNARKSEK+KFE NERDE Sbjct: 622 ELTQLDLDTLREIEISEENEMFEVEEAEERMEKGVSVWDDIYRNARKSEKLKFEVNERDE 681 Query: 901 GDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNES 1080 G+LERTGQ VCIYEIY GAG WPFLHHGSLYRGLSL R +R +SDDVDA RL LN++ Sbjct: 682 GELERTGQSVCIYEIYNGAGVWPFLHHGSLYRGLSLSRRAQRQTSDDVDAVGRLPLLNDT 741 Query: 1081 HYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKG 1260 +YR++LCEMGGMF+IA +VD IH RPWIGFQ AE LEET+Q+ +G Sbjct: 742 YYRDILCEMGGMFAIANRVDSIHRRPWIGFQSWRAAGRKVALSAKAENVLEETMQENFRG 801 Query: 1261 DVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440 DV+YFW RL+MD R + FW MCDILNGG CR F++ FR+MYALPP +ALPPM Sbjct: 802 DVIYFWGRLDMDQSAIRNHNAISFWYMCDILNGGNCRIVFQDGFRQMYALPPHAEALPPM 861 Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620 P+DGGYWSALH WVMPT SFLEFIMFSRMFVDSIDA H + T+ + CLLGSSE+E KHCY Sbjct: 862 PEDGGYWSALHSWVMPTSSFLEFIMFSRMFVDSIDAKHRDSTKYSLCLLGSSEIE-KHCY 920 Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800 CR+LELL+NVWAYHSAR+MVYI+P TG +EE HP++QRKGFMW+KYFN +LLKSM Sbjct: 921 CRMLELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWSKYFNFSLLKSMDEDLA 980 Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980 RE WLWP+TGEVHW G+YE++REE+YRLKMDKKRK KEKL +R ++GYKQK Sbjct: 981 EAADDGDHPREMWLWPMTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKYGYKQK 1040 Query: 1981 TLG 1989 +LG Sbjct: 1041 SLG 1043