BLASTX nr result

ID: Rheum21_contig00005963 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00005963
         (2349 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002270269.1| PREDICTED: uncharacterized protein LOC100254...   874   0.0  
gb|EOY18900.1| UDP-Glycosyltransferase superfamily protein isofo...   869   0.0  
emb|CAN65363.1| hypothetical protein VITISV_036074 [Vitis vinifera]   865   0.0  
gb|EMJ21765.1| hypothetical protein PRUPE_ppa001222mg [Prunus pe...   860   0.0  
ref|XP_004496154.1| PREDICTED: uncharacterized protein LOC101505...   852   0.0  
gb|EXB58479.1| hypothetical protein L484_005213 [Morus notabilis]     846   0.0  
gb|EOY18902.1| UDP-Glycosyltransferase superfamily protein isofo...   846   0.0  
ref|XP_006378794.1| hypothetical protein POPTR_0010s23830g [Popu...   840   0.0  
ref|XP_006436561.1| hypothetical protein CICLE_v10030581mg [Citr...   835   0.0  
ref|XP_006436560.1| hypothetical protein CICLE_v10030581mg [Citr...   835   0.0  
ref|XP_006436559.1| hypothetical protein CICLE_v10030581mg [Citr...   835   0.0  
ref|XP_003535489.1| PREDICTED: uncharacterized protein LOC100779...   828   0.0  
ref|XP_006606299.1| PREDICTED: uncharacterized protein LOC100790...   827   0.0  
ref|XP_006606298.1| PREDICTED: uncharacterized protein LOC100790...   827   0.0  
ref|XP_003555467.1| PREDICTED: uncharacterized protein LOC100790...   827   0.0  
ref|NP_001190226.1| UDP-glycosyltransferase family protein [Arab...   824   0.0  
ref|NP_568137.1| UDP-glycosyltransferase family protein [Arabido...   824   0.0  
ref|XP_006379502.1| hypothetical protein POPTR_0008s02940g [Popu...   823   0.0  
ref|XP_006589360.1| PREDICTED: uncharacterized protein LOC100779...   823   0.0  
ref|XP_006606297.1| PREDICTED: uncharacterized protein LOC100790...   822   0.0  

>ref|XP_002270269.1| PREDICTED: uncharacterized protein LOC100254795 [Vitis vinifera]
          Length = 1028

 Score =  874 bits (2258), Expect = 0.0
 Identities = 420/678 (61%), Positives = 519/678 (76%), Gaps = 15/678 (2%)
 Frame = +1

Query: 1    KDVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYG 180
            K+ G  FRF+FLCGNSTDGYND L+E+A HL+L PGS+R YGMN+DVNG++LM+D+V+Y 
Sbjct: 364  KNAGAMFRFVFLCGNSTDGYNDHLKEVASHLKLLPGSVRQYGMNSDVNGLILMADVVIYA 423

Query: 181  TSQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVS 360
            +SQ EQ FP LLTRA+SFGIP+IAPDLP I+ YV DGVH +IF ++NPD L+RAFSLL+S
Sbjct: 424  SSQVEQGFPPLLTRAMSFGIPVIAPDLPDIRKYVVDGVHVVIFPKNNPDALMRAFSLLIS 483

Query: 361  RDGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTW 540
             +G LS+ AKAV  SG+LLAKN+ AS+C+  +A+L+ENVL+F SD LLP  I+QS+   W
Sbjct: 484  -NGKLSKFAKAVALSGRLLAKNMLASECVNSYAKLLENVLSFPSDVLLPGHISQSQHDAW 542

Query: 541  EWNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEK---DHLDLGITDPMNMSEYNGEL 711
            EWN        ++ DM   +  +  + +S ++  LE+   + LD G     N+S  N E 
Sbjct: 543  EWNSF------RTADMPLIENGSASMRKSSVVDVLEETLSNQLDSG-----NIS--NSET 589

Query: 712  EQDLLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANE 891
            E D+L   DWD + EIES E++ERLEM++++ERMEK PG WDEIYRNARK E+VKFE NE
Sbjct: 590  ENDVLTQLDWDVLREIESIEEMERLEMEELEERMEKNPGIWDEIYRNARKVERVKFETNE 649

Query: 892  RDEGDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFL 1071
            RDEG+LERTGQP+CIYEIY GAGAWPFLHHGS+YRGLSL T  RRL SDDVDA  RL  L
Sbjct: 650  RDEGELERTGQPLCIYEIYNGAGAWPFLHHGSMYRGLSLTTSARRLRSDDVDAVDRLPVL 709

Query: 1072 NESHYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQ 1251
            N+++YR++ C++GGMFSIA++VD IH RPWIGFQ              AE+ LEETIQ++
Sbjct: 710  NDTYYRDIFCDIGGMFSIAFRVDKIHKRPWIGFQSWHAVGSKVSLSSRAEKVLEETIQEE 769

Query: 1252 TKGDVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDAL 1431
            TKGDV+YFWA L +D G T+ ++   FWSMCDILNGG CR+ FE+AFR+MYA+P   +AL
Sbjct: 770  TKGDVLYFWAHLNVDDGPTQKNRIPTFWSMCDILNGGNCRTAFEDAFRQMYAMPSYIEAL 829

Query: 1432 PPMPDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAH----------- 1578
            PPMP+DGGYWSALH WVMPTPSFLEFIMFSRMF DS+DALH N  ++ +           
Sbjct: 830  PPMPEDGGYWSALHSWVMPTPSFLEFIMFSRMFADSLDALHMNSRQSMNLSQSMNSSQPT 889

Query: 1579 -CLLGSSELERKHCYCRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAK 1755
             CLLGSS+LE+KHCYCR+LELLVNVWAYHSAR+MVYI+P +G LEE HPV+QR+GFMWAK
Sbjct: 890  VCLLGSSKLEKKHCYCRVLELLVNVWAYHSARKMVYINPYSGQLEEQHPVEQRRGFMWAK 949

Query: 1756 YFNITLLKSMXXXXXXXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKV 1935
            YFN TLLKSM              RE WLWPLTGEVHW G+YE++REE+YR KMDKKRK 
Sbjct: 950  YFNSTLLKSMDEDLAEAADDGDHPRERWLWPLTGEVHWQGIYEREREERYRSKMDKKRKA 1009

Query: 1936 KEKLIDRFQHGYKQKTLG 1989
            KEKL++R +HGYKQK +G
Sbjct: 1010 KEKLVERMKHGYKQKPIG 1027


>gb|EOY18900.1| UDP-Glycosyltransferase superfamily protein isoform 1 [Theobroma
            cacao]
          Length = 1041

 Score =  869 bits (2245), Expect = 0.0
 Identities = 418/663 (63%), Positives = 512/663 (77%), Gaps = 1/663 (0%)
 Frame = +1

Query: 4    DVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGT 183
            D GGSF+FIFL GNSTDGY+DALQ++A  L L+ GS+RHYG++ DVNGVLLM+DIVLYGT
Sbjct: 392  DAGGSFKFIFLSGNSTDGYHDALQQVASRLGLTQGSVRHYGLDGDVNGVLLMADIVLYGT 451

Query: 184  SQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSR 363
            SQ+EQ FPSL+ RA++FGIP+I PD P++K YV DG HG+ F +H PD LLRAFSLL+S 
Sbjct: 452  SQEEQGFPSLIIRAMTFGIPVITPDFPIMKKYVVDGTHGVFFPKHQPDALLRAFSLLIS- 510

Query: 364  DGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWE 543
            +G LS  A+ V SSG+LLAKN+ AS+CI G+A L+EN+LNF SD LLP+ ++Q   G+WE
Sbjct: 511  NGRLSRFAQTVASSGRLLAKNILASECITGYASLLENLLNFPSDVLLPAPVSQLRLGSWE 570

Query: 544  WNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QD 720
            WN  G  +E  + D+  +           ++YALE++     I+   ++S+Y  E++ QD
Sbjct: 571  WNVFGMEIEHGTGDISRYFS---------VVYALEEEFTKHTISS--DISQYGAEIQDQD 619

Query: 721  LLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDE 900
            +   +DWD + EIE+FED ERLEMD+++ERME+ PG WD+IYRNAR+SEK+KFEANERDE
Sbjct: 620  IPTEQDWDIVTEIENFEDYERLEMDEVEERMERNPGVWDDIYRNARRSEKLKFEANERDE 679

Query: 901  GDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNES 1080
            G+LERTGQPVCIYEIY GAGAWPFLHHGSLYRGLSL  + RRL SDDVDA  RL  LN++
Sbjct: 680  GELERTGQPVCIYEIYSGAGAWPFLHHGSLYRGLSLSRKARRLRSDDVDAVGRLPVLNDT 739

Query: 1081 HYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKG 1260
            HYR+LLCE+GGMFSIA +VD+IH RPWIGFQ              AE  LEETIQ  +K 
Sbjct: 740  HYRDLLCEVGGMFSIANRVDNIHKRPWIGFQSWRAAGRKVSLSTRAEEVLEETIQG-SKR 798

Query: 1261 DVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440
            DV+YFWARL++D G    +  L FWSMCD+LN G CR+ FE+AFR+MY LP   +ALPPM
Sbjct: 799  DVMYFWARLDIDGGGAGTNDALTFWSMCDLLNAGHCRTAFESAFRKMYILPSDTEALPPM 858

Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620
            P D G+WSALH WVMPT SFLEF+MFSRMFVDS+DALH+N  E   CLLGSSELE+KHCY
Sbjct: 859  PKDDGHWSALHSWVMPTTSFLEFVMFSRMFVDSLDALHTNSGEVNLCLLGSSELEKKHCY 918

Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800
            C++LELLVNVWAYHS RRMVYI+P +GLLEE HPV QRK FMWA+YFN TLLKSM     
Sbjct: 919  CQVLELLVNVWAYHSGRRMVYIEPHSGLLEEQHPVDQRKEFMWARYFNFTLLKSMDEDLA 978

Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980
                     R+ WLWPLTGEVHW G+YE++REE+YRLKMDKKRK KEKL +R ++GYKQ+
Sbjct: 979  EAADDEDHPRKMWLWPLTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKNGYKQR 1038

Query: 1981 TLG 1989
            +LG
Sbjct: 1039 SLG 1041


>emb|CAN65363.1| hypothetical protein VITISV_036074 [Vitis vinifera]
          Length = 1037

 Score =  865 bits (2235), Expect = 0.0
 Identities = 420/687 (61%), Positives = 519/687 (75%), Gaps = 24/687 (3%)
 Frame = +1

Query: 1    KDVGGSFRFIFLCGNSTDGYNDALQ---------ELAVHLRLSPGSIRHYGMNADVNGVL 153
            K+ G   RF+FLCGNSTDGYND L+         E+A HL+L PGS+R YGMN+DVNG++
Sbjct: 364  KNAGAMXRFVFLCGNSTDGYNDHLKVYGYNDHLKEVASHLKLLPGSVRQYGMNSDVNGLM 423

Query: 154  LMSDIVLYGTSQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDL 333
            LM+D+V+Y +SQ EQ FP LLTRA+SFGIP+IAPDLP I+ YV DGVH +IF ++NPD L
Sbjct: 424  LMADVVIYASSQVEQGFPPLLTRAMSFGIPVIAPDLPDIRKYVVDGVHVVIFPKNNPDAL 483

Query: 334  LRAFSLLVSRDGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSS 513
            +RAFSLL+S +G LS+ AKAV  SG+LLAKN+ AS+C+  +A+L+ENVL+F SD LLP  
Sbjct: 484  MRAFSLLIS-NGKLSKFAKAVALSGRLLAKNMLASECVNSYAKLLENVLSFPSDVLLPGH 542

Query: 514  ITQSEQGTWEWNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEK---DHLDLGITDPM 684
            I+QS+   WEWN        ++ DM   +  +  + +S ++  LE+   + LD G     
Sbjct: 543  ISQSQHDAWEWNSF------RTADMPLIENGSASMRKSSVVDVLEETLSNQLDSG----- 591

Query: 685  NMSEYNGELEQDLLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKS 864
            N+S  N E E D+L   DWD + EIES E++ERLEM++++ERMEK PG WDEIYRNARK 
Sbjct: 592  NIS--NSETENDVLTQLDWDVLREIESIEEMERLEMEELEERMEKNPGIWDEIYRNARKV 649

Query: 865  EKVKFEANERDEGDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDV 1044
            E+VKFEANERDEG+LERTGQP+CIYEIY GAGAWPFLHHGS+YRGLSL T  RRL SDDV
Sbjct: 650  ERVKFEANERDEGELERTGQPLCIYEIYNGAGAWPFLHHGSMYRGLSLTTSARRLRSDDV 709

Query: 1045 DAFTRLSFLNESHYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAER 1224
            DA  RL  LN+++YR++ C++GGMFSIA++VD IH RPWIGFQ              AE+
Sbjct: 710  DAVDRLPVLNDTYYRDIFCDIGGMFSIAFRVDKIHKRPWIGFQSWHAVGSKVSLSSRAEK 769

Query: 1225 ALEETIQQQTKGDVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMY 1404
             LEETIQ++TKGDV+YFWA L +D G T+ ++   FWSMCDILNGG CR+ FE+AFR+MY
Sbjct: 770  VLEETIQEETKGDVLYFWAHLNVDDGPTQKNRIPTFWSMCDILNGGNCRTAFEDAFRQMY 829

Query: 1405 ALPPVKDALPPMPDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAH-- 1578
            A+P   +ALPPMP+DGGYWSALH WVMPTPSFLEFIMFSRMF DS+DALH N  ++ +  
Sbjct: 830  AMPSYIEALPPMPEDGGYWSALHSWVMPTPSFLEFIMFSRMFADSLDALHMNSRQSMNLS 889

Query: 1579 ----------CLLGSSELERKHCYCRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQ 1728
                      CLLGSS+LE+KHCYCR+LELLVNVWAYHSAR+MVYI+P +G LEE HPV+
Sbjct: 890  QSMNSSQPTVCLLGSSKLEKKHCYCRVLELLVNVWAYHSARKMVYINPYSGQLEEQHPVE 949

Query: 1729 QRKGFMWAKYFNITLLKSMXXXXXXXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYR 1908
            QR+GFMWAKYFN TLLKSM              RE WLWPLTGEVHW G+YE++REE+YR
Sbjct: 950  QRRGFMWAKYFNSTLLKSMDEDLAEAADDGDHPRERWLWPLTGEVHWQGIYEREREERYR 1009

Query: 1909 LKMDKKRKVKEKLIDRFQHGYKQKTLG 1989
             KMDKKRK KEKL++R +HGYKQK +G
Sbjct: 1010 SKMDKKRKAKEKLVERMKHGYKQKPIG 1036


>gb|EMJ21765.1| hypothetical protein PRUPE_ppa001222mg [Prunus persica]
          Length = 877

 Score =  860 bits (2223), Expect = 0.0
 Identities = 409/666 (61%), Positives = 511/666 (76%), Gaps = 2/666 (0%)
 Frame = +1

Query: 1    KDVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYG 180
            +D GGSF+F+FLCGNS+DGY+DA QE+A  L L  GS+RH+G+N DVN +LLM+DIVLYG
Sbjct: 217  EDAGGSFKFVFLCGNSSDGYDDAFQEVASPLGLPRGSVRHFGLNGDVNSMLLMADIVLYG 276

Query: 181  TSQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVS 360
            + QD Q FP LL RA++FGIP+IAPD PV+K YV DGVH   F  HNPD L+++FSL++S
Sbjct: 277  SFQDVQGFPPLLIRAMTFGIPVIAPDFPVLKKYVTDGVHINTFPNHNPDALMKSFSLMIS 336

Query: 361  RDGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTW 540
             +G LS+ A+ V SSG+LLA NL AS+CI G+AR++EN LNF SD LLP  I++ ++GTW
Sbjct: 337  -NGKLSKFARTVASSGRLLAMNLLASECITGYARVLENALNFPSDALLPGPISELQRGTW 395

Query: 541  EWNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-- 714
            EWN  G  ++  + DM   DE+++ ++ + ++YALE++    G+    N+S+ NG  E  
Sbjct: 396  EWNLFGNEIDYTTGDMQGIDEQSS-LESTSVVYALEEEFS--GLAYSTNISD-NGTWESA 451

Query: 715  QDLLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANER 894
            QD+    DWD + EIE+ E+ ER+EM+++ ERME++PG WD+IYRNARK EK +FEANER
Sbjct: 452  QDIPTQLDWDLLTEIENSEEYERVEMEELSERMERDPGLWDDIYRNARKVEKFRFEANER 511

Query: 895  DEGDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLN 1074
            DEG+LERTGQ VCIYEIY G+G WPFLHHGSLYRGLSL  R RR +SDDVDA  RL  LN
Sbjct: 512  DEGELERTGQSVCIYEIYSGSGTWPFLHHGSLYRGLSLSIRARRSTSDDVDAVDRLPILN 571

Query: 1075 ESHYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQT 1254
            E+HYRN+LCE+GGMF+IA KVD +H RPWIGFQ              AE+ LEE IQ   
Sbjct: 572  ETHYRNILCEIGGMFAIANKVDSVHKRPWIGFQSWRAAGRKVSLSKKAEKVLEEAIQDNR 631

Query: 1255 KGDVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALP 1434
            +GDV+YFW RL M+ G+T     L FWS CDILNGG CR+ FE+AFR MYALP   +ALP
Sbjct: 632  EGDVIYFWGRLNMNGGMTGSKDALTFWSACDILNGGHCRNVFEHAFRWMYALPNNTEALP 691

Query: 1435 PMPDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKH 1614
            PMP+DGG+WSALH WVMPT SFLEF+MFSRMFV+S+DALH+N +  + CLLGSSELE+KH
Sbjct: 692  PMPEDGGHWSALHSWVMPTHSFLEFVMFSRMFVNSLDALHTNNSGQSMCLLGSSELEQKH 751

Query: 1615 CYCRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXX 1794
            CYCR+LE+LVNVWAYHSAR++VYIDP +G +EE H + QR+ FMWAKYFN TLLKSM   
Sbjct: 752  CYCRVLEVLVNVWAYHSARKLVYIDPISGSMEEQHRIDQRQAFMWAKYFNATLLKSMDED 811

Query: 1795 XXXXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYK 1974
                       RENWLWPLTGEVHW G+YE++RE +YRLKMDKKRK KEKL++R ++GYK
Sbjct: 812  LAEAADDGDHPRENWLWPLTGEVHWQGIYEREREVRYRLKMDKKRKTKEKLLERMKYGYK 871

Query: 1975 QKTLGG 1992
            QKTLGG
Sbjct: 872  QKTLGG 877


>ref|XP_004496154.1| PREDICTED: uncharacterized protein LOC101505326 [Cicer arietinum]
          Length = 1042

 Score =  852 bits (2201), Expect = 0.0
 Identities = 407/663 (61%), Positives = 510/663 (76%), Gaps = 1/663 (0%)
 Frame = +1

Query: 4    DVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGT 183
            D   SF+F+FLCGNSTDGY+DALQE+A  L L  GSIRHYG++ DVN VLLM+DIVLYG+
Sbjct: 388  DAAESFKFVFLCGNSTDGYDDALQEVASRLGLPHGSIRHYGLDGDVNSVLLMADIVLYGS 447

Query: 184  SQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSR 363
            +QD Q FP LL RA++F IP+IAPD PV++ Y+ DGVHG+ +S+HNP+ LL AFSLL+S 
Sbjct: 448  AQDVQGFPPLLIRAMTFEIPVIAPDFPVLRKYIVDGVHGVFYSKHNPEALLNAFSLLLS- 506

Query: 364  DGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWE 543
             G LS+ A+A+GSSG+  AKN+ A +CI G+ARL+ENVL F SD+LLP  ++Q +QG W 
Sbjct: 507  SGRLSKFAQAIGSSGRQFAKNVLALECITGYARLLENVLTFPSDSLLPGPVSQIQQGAWG 566

Query: 544  WNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QD 720
            W+     L +  +DM   DE  +K  R  +++A+E++    G+    N+ E   E+  QD
Sbjct: 567  WS-----LMQIDIDMKKIDEDFSK-GRVTVVHAVEQELA--GLNYSTNIFENGTEVPMQD 618

Query: 721  LLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDE 900
             L   DWD + EIE  ++ E LEM++++ERMEK+ G WDEIYRNARKSEK+KFEANERDE
Sbjct: 619  ELTKLDWDILREIEIADESEMLEMEEVEERMEKDVGVWDEIYRNARKSEKLKFEANERDE 678

Query: 901  GDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNES 1080
            G+LERTGQPVCIYEIY G G WPFLHHGSLYRGLSL  +++R SSDDVDA  RL  LN++
Sbjct: 679  GELERTGQPVCIYEIYSGTGVWPFLHHGSLYRGLSLSRKSQRQSSDDVDAVGRLPLLNDT 738

Query: 1081 HYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKG 1260
            +YR++LCE+GGMF+IA +VD IH RPW+GFQ              AERALEET+ +  +G
Sbjct: 739  YYRDILCEIGGMFAIANRVDGIHRRPWVGFQSWRAAGRKVALSMEAERALEETMNESFRG 798

Query: 1261 DVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440
            DV+YFW RL++D  +   +  L FWSMCDILNGG CR+ F+++FR+MYALPP  +ALPPM
Sbjct: 799  DVIYFWGRLDLDGSVIGSNNALTFWSMCDILNGGNCRNVFQDSFRQMYALPPHAEALPPM 858

Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620
            P+DGGYWSALH WVMPTPSFLEFIMFSRMFVDSIDALH + ++ + CLLGSSE+E KHCY
Sbjct: 859  PEDGGYWSALHSWVMPTPSFLEFIMFSRMFVDSIDALHRDSSKHSVCLLGSSEIEEKHCY 918

Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800
            CR+LELL+NVWAYHSAR+MVYI+P TG +EE H V QRKGFMWA+YFN TLLKSM     
Sbjct: 919  CRVLELLINVWAYHSARKMVYINPDTGSMEEQHVVDQRKGFMWAQYFNFTLLKSMDEDLA 978

Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980
                     RENWLWP+TGEVHW G+YE++REE+YR+KMDKKRK KEKL +R ++GYKQK
Sbjct: 979  EAADDGDHPRENWLWPMTGEVHWQGIYEREREERYRIKMDKKRKTKEKLYERMKYGYKQK 1038

Query: 1981 TLG 1989
            +LG
Sbjct: 1039 SLG 1041


>gb|EXB58479.1| hypothetical protein L484_005213 [Morus notabilis]
          Length = 1043

 Score =  846 bits (2186), Expect = 0.0
 Identities = 408/668 (61%), Positives = 501/668 (75%), Gaps = 4/668 (0%)
 Frame = +1

Query: 1    KDVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYG 180
            KD GGSF+F+FLCGNSTDGYND L+E+A  L L   S+RHYG+N+DV  +LLM+DI LY 
Sbjct: 384  KDSGGSFKFVFLCGNSTDGYNDVLKEVASRLGLQDDSLRHYGLNSDVKSLLLMADIFLYD 443

Query: 181  TSQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVS 360
            +SQ  Q FP LL +A++F IP+IAPD PV++ Y+ DGVHG+ F +HNPD LL+AFS L+S
Sbjct: 444  SSQGVQGFPPLLIQAMTFEIPVIAPDFPVLQKYIVDGVHGIFFPKHNPDALLKAFSFLIS 503

Query: 361  RDGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTW 540
              G LS  A+ V SSG+ LAKN+ A++CI G+ARL+E+VL F SD  LP  I+Q   G W
Sbjct: 504  -SGKLSRSAQTVASSGRRLAKNIMATECIMGYARLLESVLYFPSDAFLPGPISQLHLGAW 562

Query: 541  EWNFVGEILEEKSVDMMSFDERAT----KVDRSRIIYALEKDHLDLGITDPMNMSEYNGE 708
            EWN     L +K +D++  DE +     K     ++YALE++ L           +  G 
Sbjct: 563  EWN-----LFQKEIDLIG-DEMSHIAEGKSAAKSVVYALEEE-LTYSANSQNFSEDGTGN 615

Query: 709  LEQDLLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEAN 888
            LEQD+   +DWD + EIES E+ ERLEMD++DERMEK  G WD+IYRNARKSEK+KFE N
Sbjct: 616  LEQDIPKQQDWDVLGEIESSEEYERLEMDELDERMEKVSGVWDDIYRNARKSEKLKFEPN 675

Query: 889  ERDEGDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSF 1068
            ERDEG+LERTGQPVCIYEIY GA AWPFLHHGSLYRGLSL    R+L SDDV+A  RL  
Sbjct: 676  ERDEGELERTGQPVCIYEIYSGAAAWPFLHHGSLYRGLSLSAGARKLRSDDVNAVGRLPI 735

Query: 1069 LNESHYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQ 1248
            LN+++YR++LCE+GGMF+IA KVD+IH RPWIGFQ              AE+ LEETIQ+
Sbjct: 736  LNQTYYRDILCEIGGMFAIAKKVDNIHGRPWIGFQSWHAAGRKVSLSPKAEKVLEETIQE 795

Query: 1249 QTKGDVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDA 1428
             TKGDV+YFWARL MD G+T     L FWSMCDILNGG CR+ FE+AFRR+Y LP   +A
Sbjct: 796  NTKGDVIYFWARLNMDGGVTGSKNALTFWSMCDILNGGYCRTAFEDAFRRIYGLPSHIEA 855

Query: 1429 LPPMPDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELER 1608
            LPPMP+DGG+WSALH WVMPTPSFLEF+MF+RMF DS+DALH+N ++   CLLGSS++E+
Sbjct: 856  LPPMPEDGGHWSALHSWVMPTPSFLEFVMFARMFADSLDALHANVSKENTCLLGSSDIEK 915

Query: 1609 KHCYCRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMX 1788
            KHCYCR+LE+LVNVWAYHSAR+MVYIDP  G LEE HPV+QRK FMWAKYFN TLLK + 
Sbjct: 916  KHCYCRMLEVLVNVWAYHSARKMVYIDPHAGSLEEQHPVEQRKEFMWAKYFNQTLLKRID 975

Query: 1789 XXXXXXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHG 1968
                          E WLWPLTGEVHW G+YE++RE++YRLKMDKKRK +EKL +R ++G
Sbjct: 976  ENLAEAADDGDHPSEMWLWPLTGEVHWQGIYEREREQRYRLKMDKKRKTREKLFERMKYG 1035

Query: 1969 YKQKTLGG 1992
            YKQK+LGG
Sbjct: 1036 YKQKSLGG 1043


>gb|EOY18902.1| UDP-Glycosyltransferase superfamily protein isoform 3 [Theobroma
            cacao]
          Length = 1034

 Score =  846 bits (2186), Expect = 0.0
 Identities = 413/663 (62%), Positives = 505/663 (76%), Gaps = 1/663 (0%)
 Frame = +1

Query: 4    DVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGT 183
            D GGSF+FIFL GNSTDGY+DALQ++A  L L+ GS+RHYG++ DVNGVLLM+DIVLYGT
Sbjct: 392  DAGGSFKFIFLSGNSTDGYHDALQQVASRLGLTQGSVRHYGLDGDVNGVLLMADIVLYGT 451

Query: 184  SQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSR 363
            SQ+EQ FPSL+ RA++FGIP+I PD P++K YV DG HG+ F +H PD LLRAFSLL+S 
Sbjct: 452  SQEEQGFPSLIIRAMTFGIPVITPDFPIMKKYVVDGTHGVFFPKHQPDALLRAFSLLIS- 510

Query: 364  DGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWE 543
            +G LS  A+ V SSG+LLAKN+ AS+CI G+A L+EN+LNF SD LLP+ ++Q   G+WE
Sbjct: 511  NGRLSRFAQTVASSGRLLAKNILASECITGYASLLENLLNFPSDVLLPAPVSQLRLGSWE 570

Query: 544  WNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QD 720
            WN  G  +E  + D+  +           ++YALE++     I+   ++S+Y  E++ QD
Sbjct: 571  WNVFGMEIEHGTGDISRYFS---------VVYALEEEFTKHTISS--DISQYGAEIQDQD 619

Query: 721  LLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDE 900
            +   +DWD + EIE+FED ERLEMD+++ERME+ PG WD+IYRNAR+SEK+KFEANERDE
Sbjct: 620  IPTEQDWDIVTEIENFEDYERLEMDEVEERMERNPGVWDDIYRNARRSEKLKFEANERDE 679

Query: 901  GDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNES 1080
            G+LERTGQPVCIYEIY GAGAWPFLHHGSLYRGLSL  + RRL SDDVDA  RL  LN++
Sbjct: 680  GELERTGQPVCIYEIYSGAGAWPFLHHGSLYRGLSLSRKARRLRSDDVDAVGRLPVLNDT 739

Query: 1081 HYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKG 1260
            HYR+LLCE+GGMFSIA +VD+IH RPWIGFQ              AE  LEETI Q +K 
Sbjct: 740  HYRDLLCEVGGMFSIANRVDNIHKRPWIGFQSWRAAGRKVSLSTRAEEVLEETI-QGSKR 798

Query: 1261 DVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440
            DV+YFWARL++D G    +  L FWSMCD+LN G CR+ FE+AFR+MY LP   +ALPPM
Sbjct: 799  DVMYFWARLDIDGGGAGTNDALTFWSMCDLLNAGHCRTAFESAFRKMYILPSDTEALPPM 858

Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620
            P D G+WSALH WVMPT SFLEF+MFSRMFVDS+DALH+N  E   CLLGSSELE     
Sbjct: 859  PKDDGHWSALHSWVMPTTSFLEFVMFSRMFVDSLDALHTNSGEVNLCLLGSSELE----- 913

Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800
              +LELLVNVWAYHS RRMVYI+P +GLLEE HPV QRK FMWA+YFN TLLKSM     
Sbjct: 914  --VLELLVNVWAYHSGRRMVYIEPHSGLLEEQHPVDQRKEFMWARYFNFTLLKSMDEDLA 971

Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980
                     R+ WLWPLTGEVHW G+YE++REE+YRLKMDKKRK KEKL +R ++GYKQ+
Sbjct: 972  EAADDEDHPRKMWLWPLTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKNGYKQR 1031

Query: 1981 TLG 1989
            +LG
Sbjct: 1032 SLG 1034


>ref|XP_006378794.1| hypothetical protein POPTR_0010s23830g [Populus trichocarpa]
            gi|550330474|gb|ERP56591.1| hypothetical protein
            POPTR_0010s23830g [Populus trichocarpa]
          Length = 1053

 Score =  840 bits (2169), Expect = 0.0
 Identities = 404/665 (60%), Positives = 504/665 (75%), Gaps = 1/665 (0%)
 Frame = +1

Query: 1    KDVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYG 180
            KD  GSF+F+FLCGNSTD  +DA QE+   + L P S+RHYG+N D N VLL +DIVLYG
Sbjct: 395  KDAEGSFKFVFLCGNSTD--DDAFQEIVSRVGLHPSSVRHYGLNGDANSVLLAADIVLYG 452

Query: 181  TSQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVS 360
            +SQDEQ FP +L RA++FGIP+IAPD+P +K YV D  HG+ FS++NP+ L RAFSLL+S
Sbjct: 453  SSQDEQGFPPVLIRAMTFGIPVIAPDIPTMKKYVSDEAHGIFFSKYNPEALTRAFSLLIS 512

Query: 361  RDGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTW 540
             +G LS+ A+ V  SG+LLAKN+ AS+CI G+ARL+EN+L+F SDTLLP  +++ EQ  W
Sbjct: 513  -NGKLSKFAETVAFSGRLLAKNMLASECITGYARLLENMLSFPSDTLLPGPVSKLEQREW 571

Query: 541  EWNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGE-LEQ 717
            EWN   + LE+++ D+    E       + I+Y+LEK+  +L   +   +SE   E L  
Sbjct: 572  EWNLFNKELEQETDDLSGMYESLFSSRETSIVYSLEKEWSNL--VNSTIISENGTEILVP 629

Query: 718  DLLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERD 897
            D     DWD + EIESFE+ ER+  ++++ERM+K  G WD+IYR+ARKSEK+KFE+NERD
Sbjct: 630  DTPTESDWDVLMEIESFEEHERVVKEELEERMDKTRGLWDDIYRSARKSEKLKFESNERD 689

Query: 898  EGDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNE 1077
            EG+LERTGQPVCIYEIY+GAGAWP LHHGSLYRGLSL T+ RR  SDDVDA  RL  LNE
Sbjct: 690  EGELERTGQPVCIYEIYDGAGAWPLLHHGSLYRGLSLSTKARRSRSDDVDAVARLPLLNE 749

Query: 1078 SHYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTK 1257
            S+Y+N+LCE+GGMFSIA +VD IH RPWIGFQ              AE+ LEE  Q++ K
Sbjct: 750  SYYQNILCEIGGMFSIAIRVDAIHKRPWIGFQSWHAAGRKVSLSFKAEKVLEEKTQEENK 809

Query: 1258 GDVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPP 1437
             DV+YFWARL MD G+T  ++ L FWSMCD+LNGG+CR+ FE+AFR+MY LP   +ALPP
Sbjct: 810  -DVMYFWARLGMDGGVTGSNEELTFWSMCDVLNGGRCRTAFEDAFRQMYDLPSYLEALPP 868

Query: 1438 MPDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHC 1617
            MP+DGG+WSALH WVMPTPSFLEFIMFSRMFVDS+DAL SN ++   CLL S+ELE KHC
Sbjct: 869  MPEDGGHWSALHSWVMPTPSFLEFIMFSRMFVDSLDALQSNSSQVNKCLLSSTELEEKHC 928

Query: 1618 YCRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXX 1797
            YCRI+E+LVNVWAYHSARRMVYIDP TG +EE HP++QRK   W KYFN+T+LKSM    
Sbjct: 929  YCRIMEVLVNVWAYHSARRMVYIDPHTGSVEEQHPIKQRKEIAWKKYFNLTVLKSMDEDL 988

Query: 1798 XXXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQ 1977
                      RE WLWPLTGEVHW G+YE++REE+YR+KMDKKRK +EKL++R + GYKQ
Sbjct: 989  AEAADDGDHPRERWLWPLTGEVHWQGIYEREREERYRIKMDKKRKTREKLVERLKAGYKQ 1048

Query: 1978 KTLGG 1992
            K LGG
Sbjct: 1049 KPLGG 1053


>ref|XP_006436561.1| hypothetical protein CICLE_v10030581mg [Citrus clementina]
            gi|568863734|ref|XP_006485286.1| PREDICTED:
            uncharacterized protein LOC102618162 isoform X1 [Citrus
            sinensis] gi|557538757|gb|ESR49801.1| hypothetical
            protein CICLE_v10030581mg [Citrus clementina]
          Length = 1055

 Score =  835 bits (2157), Expect = 0.0
 Identities = 403/664 (60%), Positives = 501/664 (75%), Gaps = 2/664 (0%)
 Frame = +1

Query: 7    VGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGTS 186
            V GSF+F+FLCGNSTDGYNDALQE+A  L L   S+RHYG N DVNGVLLM+DIVLYG+S
Sbjct: 399  VEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSS 458

Query: 187  QDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSRD 366
            Q EQ FPSL+ RA++FGIP+I PD P+IK YV +G   + F + NP+ L RAFSL +S +
Sbjct: 459  QVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS-N 517

Query: 367  GILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWEW 546
            G LS+ A+ V S+G+L AKN+ A DC+  +AR++ENVLNF SD LLP  I+Q +Q +WEW
Sbjct: 518  GKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQQVSWEW 577

Query: 547  NFVGEILEEKSVDMMSFDERATKVD-RSRIIYALEKDHLDLGITDPMNMSEYNGELEQDL 723
            N   + ++  + D+++ DE  T    R+  +  L ++     IT+  N S      +QD 
Sbjct: 578  NLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITENENRSA-----DQDT 632

Query: 724  LNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDEG 903
            ++  DWD + +IES E+ ERLEM+Q++ERM+    +WD+IYRNARKSE+ KFEANERDEG
Sbjct: 633  ISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEG 692

Query: 904  DLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNESH 1083
            +LERTGQPVCIYEIY G+GAWPFLHHGSLYRGL+L +  RRL SDDVDA +RL  LN +H
Sbjct: 693  ELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTH 752

Query: 1084 YRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKGD 1263
            YR++LCE+GGMFSIA KVD+IH RPWIGFQ              AE+ LEET+Q+ T+GD
Sbjct: 753  YRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQE-TEGD 811

Query: 1264 VVYFWARLEMDSGLTRGSKP-LPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440
            V+YFWA L+MD G TR +   L FWSMCDILNGG CR+ F +AFR+MY LP   +ALPPM
Sbjct: 812  VMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFVDAFRQMYGLPSHVEALPPM 871

Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620
            P+DGG WSALHGWVM TPSFLEFIMFSRMFVDS+DAL++N ++   CLL SSELE+KHCY
Sbjct: 872  PEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHCY 931

Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800
            CR+LELLVNVWAYHS R+MVY+DP +G L+E HP+++R+GFMW KYFN TLLKSM     
Sbjct: 932  CRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLA 991

Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980
                     RE WLWP TGEVHW G+YE++REE+YR KMDKKRK+KEK+ DR   GY+QK
Sbjct: 992  EAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQK 1051

Query: 1981 TLGG 1992
            TLGG
Sbjct: 1052 TLGG 1055


>ref|XP_006436560.1| hypothetical protein CICLE_v10030581mg [Citrus clementina]
            gi|568863738|ref|XP_006485288.1| PREDICTED:
            uncharacterized protein LOC102618162 isoform X3 [Citrus
            sinensis] gi|568863740|ref|XP_006485289.1| PREDICTED:
            uncharacterized protein LOC102618162 isoform X4 [Citrus
            sinensis] gi|557538756|gb|ESR49800.1| hypothetical
            protein CICLE_v10030581mg [Citrus clementina]
          Length = 875

 Score =  835 bits (2157), Expect = 0.0
 Identities = 403/664 (60%), Positives = 501/664 (75%), Gaps = 2/664 (0%)
 Frame = +1

Query: 7    VGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGTS 186
            V GSF+F+FLCGNSTDGYNDALQE+A  L L   S+RHYG N DVNGVLLM+DIVLYG+S
Sbjct: 219  VEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSS 278

Query: 187  QDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSRD 366
            Q EQ FPSL+ RA++FGIP+I PD P+IK YV +G   + F + NP+ L RAFSL +S +
Sbjct: 279  QVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS-N 337

Query: 367  GILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWEW 546
            G LS+ A+ V S+G+L AKN+ A DC+  +AR++ENVLNF SD LLP  I+Q +Q +WEW
Sbjct: 338  GKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQQVSWEW 397

Query: 547  NFVGEILEEKSVDMMSFDERATKVD-RSRIIYALEKDHLDLGITDPMNMSEYNGELEQDL 723
            N   + ++  + D+++ DE  T    R+  +  L ++     IT+  N S      +QD 
Sbjct: 398  NLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITENENRSA-----DQDT 452

Query: 724  LNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDEG 903
            ++  DWD + +IES E+ ERLEM+Q++ERM+    +WD+IYRNARKSE+ KFEANERDEG
Sbjct: 453  ISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEG 512

Query: 904  DLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNESH 1083
            +LERTGQPVCIYEIY G+GAWPFLHHGSLYRGL+L +  RRL SDDVDA +RL  LN +H
Sbjct: 513  ELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTH 572

Query: 1084 YRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKGD 1263
            YR++LCE+GGMFSIA KVD+IH RPWIGFQ              AE+ LEET+Q+ T+GD
Sbjct: 573  YRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQE-TEGD 631

Query: 1264 VVYFWARLEMDSGLTRGSKP-LPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440
            V+YFWA L+MD G TR +   L FWSMCDILNGG CR+ F +AFR+MY LP   +ALPPM
Sbjct: 632  VMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFVDAFRQMYGLPSHVEALPPM 691

Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620
            P+DGG WSALHGWVM TPSFLEFIMFSRMFVDS+DAL++N ++   CLL SSELE+KHCY
Sbjct: 692  PEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHCY 751

Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800
            CR+LELLVNVWAYHS R+MVY+DP +G L+E HP+++R+GFMW KYFN TLLKSM     
Sbjct: 752  CRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLA 811

Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980
                     RE WLWP TGEVHW G+YE++REE+YR KMDKKRK+KEK+ DR   GY+QK
Sbjct: 812  EAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQK 871

Query: 1981 TLGG 1992
            TLGG
Sbjct: 872  TLGG 875


>ref|XP_006436559.1| hypothetical protein CICLE_v10030581mg [Citrus clementina]
            gi|557538755|gb|ESR49799.1| hypothetical protein
            CICLE_v10030581mg [Citrus clementina]
          Length = 797

 Score =  835 bits (2157), Expect = 0.0
 Identities = 403/664 (60%), Positives = 501/664 (75%), Gaps = 2/664 (0%)
 Frame = +1

Query: 7    VGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGTS 186
            V GSF+F+FLCGNSTDGYNDALQE+A  L L   S+RHYG N DVNGVLLM+DIVLYG+S
Sbjct: 141  VEGSFKFVFLCGNSTDGYNDALQEVASRLGLLEHSVRHYGFNGDVNGVLLMADIVLYGSS 200

Query: 187  QDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSRD 366
            Q EQ FPSL+ RA++FGIP+I PD P+IK YV +G   + F + NP+ L RAFSL +S +
Sbjct: 201  QVEQGFPSLIVRAMTFGIPVITPDFPIIKEYVAEGAQVIFFQKDNPEGLSRAFSLFIS-N 259

Query: 367  GILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWEW 546
            G LS+ A+ V S+G+L AKN+ A DC+  +AR++ENVLNF SD LLP  I+Q +Q +WEW
Sbjct: 260  GKLSKFARTVASAGRLHAKNMLALDCVTRYARILENVLNFPSDALLPGPISQLQQVSWEW 319

Query: 547  NFVGEILEEKSVDMMSFDERATKVD-RSRIIYALEKDHLDLGITDPMNMSEYNGELEQDL 723
            N   + ++  + D+++ DE  T    R+  +  L ++     IT+  N S      +QD 
Sbjct: 320  NLFRKEIDLGTGDILNMDEWGTSTSSRNSSVVDLLEEEFTKNITENENRSA-----DQDT 374

Query: 724  LNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDEG 903
            ++  DWD + +IES E+ ERLEM+Q++ERM+    +WD+IYRNARKSE+ KFEANERDEG
Sbjct: 375  ISELDWDVLHDIESSEEYERLEMEQLEERMDGTFASWDDIYRNARKSERFKFEANERDEG 434

Query: 904  DLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNESH 1083
            +LERTGQPVCIYEIY G+GAWPFLHHGSLYRGL+L +  RRL SDDVDA +RL  LN +H
Sbjct: 435  ELERTGQPVCIYEIYSGSGAWPFLHHGSLYRGLALSSAARRLRSDDVDAVSRLHLLNYTH 494

Query: 1084 YRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKGD 1263
            YR++LCE+GGMFSIA KVD+IH RPWIGFQ              AE+ LEET+Q+ T+GD
Sbjct: 495  YRDILCEIGGMFSIANKVDNIHKRPWIGFQSWRAAGRKVSLSISAEKVLEETVQE-TEGD 553

Query: 1264 VVYFWARLEMDSGLTRGSKP-LPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440
            V+YFWA L+MD G TR +   L FWSMCDILNGG CR+ F +AFR+MY LP   +ALPPM
Sbjct: 554  VMYFWAHLDMDGGFTRNNNDVLTFWSMCDILNGGHCRTAFVDAFRQMYGLPSHVEALPPM 613

Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620
            P+DGG WSALHGWVM TPSFLEFIMFSRMFVDS+DAL++N ++   CLL SSELE+KHCY
Sbjct: 614  PEDGGCWSALHGWVMQTPSFLEFIMFSRMFVDSLDALNANSSKVNSCLLSSSELEKKHCY 673

Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800
            CR+LELLVNVWAYHS R+MVY+DP +G L+E HP+++R+GFMW KYFN TLLKSM     
Sbjct: 674  CRVLELLVNVWAYHSGRKMVYLDPLSGSLQEQHPIERRRGFMWMKYFNFTLLKSMDEDLA 733

Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980
                     RE WLWP TGEVHW G+YE++REE+YR KMDKKRK+KEK+ DR   GY+QK
Sbjct: 734  EAADDGDYPREKWLWPWTGEVHWKGIYEREREERYRQKMDKKRKMKEKMFDRLTKGYRQK 793

Query: 1981 TLGG 1992
            TLGG
Sbjct: 794  TLGG 797


>ref|XP_003535489.1| PREDICTED: uncharacterized protein LOC100779157 isoform X1 [Glycine
            max]
          Length = 1044

 Score =  828 bits (2139), Expect = 0.0
 Identities = 399/659 (60%), Positives = 495/659 (75%), Gaps = 1/659 (0%)
 Frame = +1

Query: 16   SFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGTSQDE 195
            SF+F+FLCGNSTDGY+DALQ +A  + L  GSIRHYG+N DVN VLLM+DI+LYG++Q+ 
Sbjct: 395  SFKFVFLCGNSTDGYDDALQGVASRMGLRQGSIRHYGLNGDVNSVLLMADIILYGSAQEV 454

Query: 196  QSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSRDGIL 375
            Q FP LL RA++F IP++ PD  V+K Y+ DGVHG+ FS+HNP+ L+ AFSLL+S +G L
Sbjct: 455  QGFPPLLIRAMTFEIPVVVPDFSVLKKYIVDGVHGIFFSKHNPEALMNAFSLLLS-NGRL 513

Query: 376  SELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWEWNFV 555
            S+ A+A+ SSG+ LAKN+ A DCI G+ARL+ENVLNF SD LLP  ++Q +QG+WEWN  
Sbjct: 514  SKFAQAIASSGRQLAKNVLALDCITGYARLLENVLNFPSDALLPGPVSQIQQGSWEWNLF 573

Query: 556  GEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QDLLNA 732
               ++   +D   F  R        I+YA+E +   L  +   ++ E   E+  +D L  
Sbjct: 574  RNEIDLSKIDG-DFSNRKVS-----IVYAVEHELASLNYST--SIFENGTEVPLRDELTQ 625

Query: 733  EDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDEGDLE 912
             DWD + EIE  E+ E  E+++ +ER EK  G WD+IYRNARKSEK+KFE NERDEG+LE
Sbjct: 626  LDWDILREIEISEENEMFEVEEAEERREKGVGVWDDIYRNARKSEKLKFEVNERDEGELE 685

Query: 913  RTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNESHYRN 1092
            RTGQPVCIYEIY GAG WPFLHHGSLYRGLSL  R +R SSDDVDA  RL  LN+++YR+
Sbjct: 686  RTGQPVCIYEIYNGAGVWPFLHHGSLYRGLSLSRRAQRQSSDDVDAVGRLPLLNDTYYRD 745

Query: 1093 LLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKGDVVY 1272
            +LCEMGGMF+IA +VD+IH RPWIGFQ              AE+ LEET+Q+  +GDV+Y
Sbjct: 746  ILCEMGGMFAIANRVDNIHRRPWIGFQSWRAAGRKVALSAKAEKVLEETMQENFRGDVIY 805

Query: 1273 FWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPMPDDG 1452
            FW R +MD  +        FW MCDILNGG CR  F+  FR+MYALPP  +ALPPMP+D 
Sbjct: 806  FWGRFDMDQSVIGNHNANSFWYMCDILNGGNCRIVFQEGFRQMYALPPHAEALPPMPED- 864

Query: 1453 GYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCYCRIL 1632
            GYWSALH WVMPTPSFLEFIMFSRMFVDSIDALH + T+ + CLLGSSE+E+KHCYCR+L
Sbjct: 865  GYWSALHSWVMPTPSFLEFIMFSRMFVDSIDALHRDSTKYSLCLLGSSEIEKKHCYCRVL 924

Query: 1633 ELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXXXXXX 1812
            ELL+NVWAYHSAR+MVYI+P TG +EE HP++QRKGFMWAKYFNI+LLKSM         
Sbjct: 925  ELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWAKYFNISLLKSMDEDLAEAAD 984

Query: 1813 XXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQKTLG 1989
                 RE WLWP+TGEVHW G+YE++REE+YRLKMDKKRK KEKL +R ++GYKQK+LG
Sbjct: 985  DGDHPREMWLWPMTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKYGYKQKSLG 1043


>ref|XP_006606299.1| PREDICTED: uncharacterized protein LOC100790929 isoform X4 [Glycine
            max]
          Length = 869

 Score =  827 bits (2136), Expect = 0.0
 Identities = 397/663 (59%), Positives = 495/663 (74%), Gaps = 1/663 (0%)
 Frame = +1

Query: 4    DVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGT 183
            D   SF+F+FLCGNSTDGY+DALQ +A  + L  GSIRHYG+N DVN VLLM+DI+LYG+
Sbjct: 218  DATDSFKFVFLCGNSTDGYDDALQGVASRMGLRQGSIRHYGLNGDVNSVLLMADIILYGS 277

Query: 184  SQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSR 363
            +Q+ Q FP LL RA++F IP++ PD  V+K Y+ DGVHG+ FS+HNP+ L+ AFSLL+S 
Sbjct: 278  AQEVQGFPPLLIRAMTFEIPVVVPDFSVLKKYIVDGVHGIFFSKHNPEALMNAFSLLLS- 336

Query: 364  DGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWE 543
            +G LS+ A+A+ SSG+ LAKN+ A DCI G+ARL+ENVLNF SD LLP +++Q +QG+WE
Sbjct: 337  NGRLSKFAQAIASSGRQLAKNVLALDCITGYARLLENVLNFPSDALLPGAVSQIQQGSWE 396

Query: 544  WNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QD 720
            WN     L +  +D+   D       +  I+YA+E +   L  +   ++ E   E+  QD
Sbjct: 397  WN-----LFQNEIDLSKIDSNR----KVSIVYAVEHELASLNYST--SIVENGTEVPLQD 445

Query: 721  LLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDE 900
             L   D D + EIE  E+ E  E+++ +ERMEK    WD+IYRNARKSEK+KFE NERDE
Sbjct: 446  ELTQLDLDTLREIEISEENEMFEVEEAEERMEKGVSVWDDIYRNARKSEKLKFEVNERDE 505

Query: 901  GDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNES 1080
            G+LERTGQ VCIYEIY GAG WPFLHHGSLYRGLSL  R +R +SDDVDA  RL  LN++
Sbjct: 506  GELERTGQSVCIYEIYNGAGVWPFLHHGSLYRGLSLSRRAQRQTSDDVDAVGRLPLLNDT 565

Query: 1081 HYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKG 1260
            +YR++LCEMGGMF+IA +VD IH RPWIGFQ              AE  LEET+Q+  +G
Sbjct: 566  YYRDILCEMGGMFAIANRVDSIHRRPWIGFQSWRAAGRKVALSAKAENVLEETMQENFRG 625

Query: 1261 DVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440
            DV+YFW RL+MD    R    + FW MCDILNGG CR  F++ FR+MYALPP  +ALPPM
Sbjct: 626  DVIYFWGRLDMDQSAIRNHNAISFWYMCDILNGGNCRIVFQDGFRQMYALPPHAEALPPM 685

Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620
            P+DGGYWSALH WVMPT SFLEFIMFSRMFVDSIDA H + T+ + CLLGSSE+E+KHCY
Sbjct: 686  PEDGGYWSALHSWVMPTSSFLEFIMFSRMFVDSIDAKHRDSTKYSLCLLGSSEIEKKHCY 745

Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800
            CR+LELL+NVWAYHSAR+MVYI+P TG +EE HP++QRKGFMW+KYFN +LLKSM     
Sbjct: 746  CRMLELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWSKYFNFSLLKSMDEDLA 805

Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980
                     RE WLWP+TGEVHW G+YE++REE+YRLKMDKKRK KEKL +R ++GYKQK
Sbjct: 806  EAADDGDHPREMWLWPMTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKYGYKQK 865

Query: 1981 TLG 1989
            +LG
Sbjct: 866  SLG 868


>ref|XP_006606298.1| PREDICTED: uncharacterized protein LOC100790929 isoform X3 [Glycine
            max]
          Length = 1015

 Score =  827 bits (2136), Expect = 0.0
 Identities = 397/663 (59%), Positives = 495/663 (74%), Gaps = 1/663 (0%)
 Frame = +1

Query: 4    DVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGT 183
            D   SF+F+FLCGNSTDGY+DALQ +A  + L  GSIRHYG+N DVN VLLM+DI+LYG+
Sbjct: 364  DATDSFKFVFLCGNSTDGYDDALQGVASRMGLRQGSIRHYGLNGDVNSVLLMADIILYGS 423

Query: 184  SQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSR 363
            +Q+ Q FP LL RA++F IP++ PD  V+K Y+ DGVHG+ FS+HNP+ L+ AFSLL+S 
Sbjct: 424  AQEVQGFPPLLIRAMTFEIPVVVPDFSVLKKYIVDGVHGIFFSKHNPEALMNAFSLLLS- 482

Query: 364  DGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWE 543
            +G LS+ A+A+ SSG+ LAKN+ A DCI G+ARL+ENVLNF SD LLP +++Q +QG+WE
Sbjct: 483  NGRLSKFAQAIASSGRQLAKNVLALDCITGYARLLENVLNFPSDALLPGAVSQIQQGSWE 542

Query: 544  WNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QD 720
            WN     L +  +D+   D       +  I+YA+E +   L  +   ++ E   E+  QD
Sbjct: 543  WN-----LFQNEIDLSKIDSNR----KVSIVYAVEHELASLNYST--SIVENGTEVPLQD 591

Query: 721  LLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDE 900
             L   D D + EIE  E+ E  E+++ +ERMEK    WD+IYRNARKSEK+KFE NERDE
Sbjct: 592  ELTQLDLDTLREIEISEENEMFEVEEAEERMEKGVSVWDDIYRNARKSEKLKFEVNERDE 651

Query: 901  GDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNES 1080
            G+LERTGQ VCIYEIY GAG WPFLHHGSLYRGLSL  R +R +SDDVDA  RL  LN++
Sbjct: 652  GELERTGQSVCIYEIYNGAGVWPFLHHGSLYRGLSLSRRAQRQTSDDVDAVGRLPLLNDT 711

Query: 1081 HYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKG 1260
            +YR++LCEMGGMF+IA +VD IH RPWIGFQ              AE  LEET+Q+  +G
Sbjct: 712  YYRDILCEMGGMFAIANRVDSIHRRPWIGFQSWRAAGRKVALSAKAENVLEETMQENFRG 771

Query: 1261 DVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440
            DV+YFW RL+MD    R    + FW MCDILNGG CR  F++ FR+MYALPP  +ALPPM
Sbjct: 772  DVIYFWGRLDMDQSAIRNHNAISFWYMCDILNGGNCRIVFQDGFRQMYALPPHAEALPPM 831

Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620
            P+DGGYWSALH WVMPT SFLEFIMFSRMFVDSIDA H + T+ + CLLGSSE+E+KHCY
Sbjct: 832  PEDGGYWSALHSWVMPTSSFLEFIMFSRMFVDSIDAKHRDSTKYSLCLLGSSEIEKKHCY 891

Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800
            CR+LELL+NVWAYHSAR+MVYI+P TG +EE HP++QRKGFMW+KYFN +LLKSM     
Sbjct: 892  CRMLELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWSKYFNFSLLKSMDEDLA 951

Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980
                     RE WLWP+TGEVHW G+YE++REE+YRLKMDKKRK KEKL +R ++GYKQK
Sbjct: 952  EAADDGDHPREMWLWPMTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKYGYKQK 1011

Query: 1981 TLG 1989
            +LG
Sbjct: 1012 SLG 1014


>ref|XP_003555467.1| PREDICTED: uncharacterized protein LOC100790929 isoform X1 [Glycine
            max]
          Length = 1045

 Score =  827 bits (2136), Expect = 0.0
 Identities = 397/663 (59%), Positives = 495/663 (74%), Gaps = 1/663 (0%)
 Frame = +1

Query: 4    DVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGT 183
            D   SF+F+FLCGNSTDGY+DALQ +A  + L  GSIRHYG+N DVN VLLM+DI+LYG+
Sbjct: 394  DATDSFKFVFLCGNSTDGYDDALQGVASRMGLRQGSIRHYGLNGDVNSVLLMADIILYGS 453

Query: 184  SQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSR 363
            +Q+ Q FP LL RA++F IP++ PD  V+K Y+ DGVHG+ FS+HNP+ L+ AFSLL+S 
Sbjct: 454  AQEVQGFPPLLIRAMTFEIPVVVPDFSVLKKYIVDGVHGIFFSKHNPEALMNAFSLLLS- 512

Query: 364  DGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWE 543
            +G LS+ A+A+ SSG+ LAKN+ A DCI G+ARL+ENVLNF SD LLP +++Q +QG+WE
Sbjct: 513  NGRLSKFAQAIASSGRQLAKNVLALDCITGYARLLENVLNFPSDALLPGAVSQIQQGSWE 572

Query: 544  WNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QD 720
            WN     L +  +D+   D       +  I+YA+E +   L  +   ++ E   E+  QD
Sbjct: 573  WN-----LFQNEIDLSKIDSNR----KVSIVYAVEHELASLNYST--SIVENGTEVPLQD 621

Query: 721  LLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDE 900
             L   D D + EIE  E+ E  E+++ +ERMEK    WD+IYRNARKSEK+KFE NERDE
Sbjct: 622  ELTQLDLDTLREIEISEENEMFEVEEAEERMEKGVSVWDDIYRNARKSEKLKFEVNERDE 681

Query: 901  GDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNES 1080
            G+LERTGQ VCIYEIY GAG WPFLHHGSLYRGLSL  R +R +SDDVDA  RL  LN++
Sbjct: 682  GELERTGQSVCIYEIYNGAGVWPFLHHGSLYRGLSLSRRAQRQTSDDVDAVGRLPLLNDT 741

Query: 1081 HYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKG 1260
            +YR++LCEMGGMF+IA +VD IH RPWIGFQ              AE  LEET+Q+  +G
Sbjct: 742  YYRDILCEMGGMFAIANRVDSIHRRPWIGFQSWRAAGRKVALSAKAENVLEETMQENFRG 801

Query: 1261 DVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440
            DV+YFW RL+MD    R    + FW MCDILNGG CR  F++ FR+MYALPP  +ALPPM
Sbjct: 802  DVIYFWGRLDMDQSAIRNHNAISFWYMCDILNGGNCRIVFQDGFRQMYALPPHAEALPPM 861

Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620
            P+DGGYWSALH WVMPT SFLEFIMFSRMFVDSIDA H + T+ + CLLGSSE+E+KHCY
Sbjct: 862  PEDGGYWSALHSWVMPTSSFLEFIMFSRMFVDSIDAKHRDSTKYSLCLLGSSEIEKKHCY 921

Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800
            CR+LELL+NVWAYHSAR+MVYI+P TG +EE HP++QRKGFMW+KYFN +LLKSM     
Sbjct: 922  CRMLELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWSKYFNFSLLKSMDEDLA 981

Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980
                     RE WLWP+TGEVHW G+YE++REE+YRLKMDKKRK KEKL +R ++GYKQK
Sbjct: 982  EAADDGDHPREMWLWPMTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKYGYKQK 1041

Query: 1981 TLG 1989
            +LG
Sbjct: 1042 SLG 1044


>ref|NP_001190226.1| UDP-glycosyltransferase family protein [Arabidopsis thaliana]
            gi|332003368|gb|AED90751.1| UDP-glycosyltransferase
            family protein [Arabidopsis thaliana]
          Length = 1035

 Score =  824 bits (2129), Expect = 0.0
 Identities = 392/665 (58%), Positives = 502/665 (75%), Gaps = 1/665 (0%)
 Frame = +1

Query: 1    KDVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYG 180
            KD  GSF+F+FL GNST G +DA+QE+A  L L+ G++RH+G+N DVN VL M+DI++Y 
Sbjct: 376  KDTSGSFKFVFLYGNSTKGQSDAVQEVASRLGLTEGTVRHFGLNEDVNRVLRMADILVYA 435

Query: 181  TSQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVS 360
            +SQ+EQ+FP L+ RA+SFGIPII PD P++K Y+ D VHG+ F R++PD LL+AFS L+S
Sbjct: 436  SSQEEQNFPPLIVRAMSFGIPIITPDFPIMKKYMADEVHGIFFRRNDPDALLKAFSPLIS 495

Query: 361  RDGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTW 540
             DG LS+ A+ + SSG+LL KNL A++CI G+ARL+EN+L+F SDT LP SI+Q +   W
Sbjct: 496  -DGRLSKFAQTIASSGRLLTKNLMATECITGYARLLENMLHFPSDTFLPGSISQLQVAAW 554

Query: 541  EWNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELEQD 720
            EWNF    LE+    ++  D     + +S I++ +E+  +  G+ +  N  + N     D
Sbjct: 555  EWNFFRSELEQPKSFIL--DSAYAFIGKSGIVFQVEEKFM--GVIESTNPVDNNTLFVSD 610

Query: 721  LLNAE-DWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERD 897
             L ++ DWD + EIE  E+ E++E +++++RME++  +W+EIYRNARKSEK+KFE NERD
Sbjct: 611  ELPSKLDWDVLEEIEGAEEYEKVESEELEDRMERDVEDWEEIYRNARKSEKLKFEVNERD 670

Query: 898  EGDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNE 1077
            EG+LERTG+P+CIYEIY GAGAWPFLHHGSLYRGLSL ++ RRLSSDDVDA  RL  LN+
Sbjct: 671  EGELERTGEPLCIYEIYNGAGAWPFLHHGSLYRGLSLSSKDRRLSSDDVDAADRLPLLND 730

Query: 1078 SHYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTK 1257
            ++YR++LCE+GGMFS+A KVD IHMRPWIGFQ              AE +LE  I+Q+TK
Sbjct: 731  TYYRDILCEIGGMFSVANKVDSIHMRPWIGFQSWRAAGRKVSLSSKAEESLENIIKQETK 790

Query: 1258 GDVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPP 1437
            G+++YFW RL++D         L FWSMCDILN G CR+TFE+AFR MY LP   +ALPP
Sbjct: 791  GEIIYFWTRLDIDGDAYGSKNALTFWSMCDILNQGNCRTTFEDAFRHMYGLPEHIEALPP 850

Query: 1438 MPDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHC 1617
            MP+DG +WS+LH WVMPTPSFLEF+MFSRMF +S+DALH+N  ++  C L SS LERKHC
Sbjct: 851  MPEDGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLDALHNNLNDSKSCSLASSLLERKHC 910

Query: 1618 YCRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXX 1797
            YCR+LELLVNVWAYHS R+MVYI+P+ G LEE HP+QQRKG MWAKYFN TLLKSM    
Sbjct: 911  YCRVLELLVNVWAYHSGRKMVYINPRDGSLEEQHPLQQRKGLMWAKYFNFTLLKSMDEDL 970

Query: 1798 XXXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQ 1977
                      RE WLWPLTGEVHW GVYE++REE+YRLKMDKKRK KEKL DR ++GYKQ
Sbjct: 971  AEAADDKDHPRERWLWPLTGEVHWKGVYEREREERYRLKMDKKRKTKEKLYDRIKNGYKQ 1030

Query: 1978 KTLGG 1992
            K+LGG
Sbjct: 1031 KSLGG 1035


>ref|NP_568137.1| UDP-glycosyltransferase family protein [Arabidopsis thaliana]
            gi|15450503|gb|AAK96544.1| AT5g04480/T32M21_80
            [Arabidopsis thaliana] gi|24111433|gb|AAN46867.1|
            At5g04480/T32M21_80 [Arabidopsis thaliana]
            gi|332003367|gb|AED90750.1| UDP-glycosyltransferase
            family protein [Arabidopsis thaliana]
          Length = 1050

 Score =  824 bits (2129), Expect = 0.0
 Identities = 392/665 (58%), Positives = 502/665 (75%), Gaps = 1/665 (0%)
 Frame = +1

Query: 1    KDVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYG 180
            KD  GSF+F+FL GNST G +DA+QE+A  L L+ G++RH+G+N DVN VL M+DI++Y 
Sbjct: 391  KDTSGSFKFVFLYGNSTKGQSDAVQEVASRLGLTEGTVRHFGLNEDVNRVLRMADILVYA 450

Query: 181  TSQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVS 360
            +SQ+EQ+FP L+ RA+SFGIPII PD P++K Y+ D VHG+ F R++PD LL+AFS L+S
Sbjct: 451  SSQEEQNFPPLIVRAMSFGIPIITPDFPIMKKYMADEVHGIFFRRNDPDALLKAFSPLIS 510

Query: 361  RDGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTW 540
             DG LS+ A+ + SSG+LL KNL A++CI G+ARL+EN+L+F SDT LP SI+Q +   W
Sbjct: 511  -DGRLSKFAQTIASSGRLLTKNLMATECITGYARLLENMLHFPSDTFLPGSISQLQVAAW 569

Query: 541  EWNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELEQD 720
            EWNF    LE+    ++  D     + +S I++ +E+  +  G+ +  N  + N     D
Sbjct: 570  EWNFFRSELEQPKSFIL--DSAYAFIGKSGIVFQVEEKFM--GVIESTNPVDNNTLFVSD 625

Query: 721  LLNAE-DWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERD 897
             L ++ DWD + EIE  E+ E++E +++++RME++  +W+EIYRNARKSEK+KFE NERD
Sbjct: 626  ELPSKLDWDVLEEIEGAEEYEKVESEELEDRMERDVEDWEEIYRNARKSEKLKFEVNERD 685

Query: 898  EGDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNE 1077
            EG+LERTG+P+CIYEIY GAGAWPFLHHGSLYRGLSL ++ RRLSSDDVDA  RL  LN+
Sbjct: 686  EGELERTGEPLCIYEIYNGAGAWPFLHHGSLYRGLSLSSKDRRLSSDDVDAADRLPLLND 745

Query: 1078 SHYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTK 1257
            ++YR++LCE+GGMFS+A KVD IHMRPWIGFQ              AE +LE  I+Q+TK
Sbjct: 746  TYYRDILCEIGGMFSVANKVDSIHMRPWIGFQSWRAAGRKVSLSSKAEESLENIIKQETK 805

Query: 1258 GDVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPP 1437
            G+++YFW RL++D         L FWSMCDILN G CR+TFE+AFR MY LP   +ALPP
Sbjct: 806  GEIIYFWTRLDIDGDAYGSKNALTFWSMCDILNQGNCRTTFEDAFRHMYGLPEHIEALPP 865

Query: 1438 MPDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHC 1617
            MP+DG +WS+LH WVMPTPSFLEF+MFSRMF +S+DALH+N  ++  C L SS LERKHC
Sbjct: 866  MPEDGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLDALHNNLNDSKSCSLASSLLERKHC 925

Query: 1618 YCRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXX 1797
            YCR+LELLVNVWAYHS R+MVYI+P+ G LEE HP+QQRKG MWAKYFN TLLKSM    
Sbjct: 926  YCRVLELLVNVWAYHSGRKMVYINPRDGSLEEQHPLQQRKGLMWAKYFNFTLLKSMDEDL 985

Query: 1798 XXXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQ 1977
                      RE WLWPLTGEVHW GVYE++REE+YRLKMDKKRK KEKL DR ++GYKQ
Sbjct: 986  AEAADDKDHPRERWLWPLTGEVHWKGVYEREREERYRLKMDKKRKTKEKLYDRIKNGYKQ 1045

Query: 1978 KTLGG 1992
            K+LGG
Sbjct: 1046 KSLGG 1050


>ref|XP_006379502.1| hypothetical protein POPTR_0008s02940g [Populus trichocarpa]
            gi|550332296|gb|ERP57299.1| hypothetical protein
            POPTR_0008s02940g [Populus trichocarpa]
          Length = 1061

 Score =  823 bits (2127), Expect = 0.0
 Identities = 397/666 (59%), Positives = 501/666 (75%), Gaps = 4/666 (0%)
 Frame = +1

Query: 1    KDVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYG 180
            KD  GSF+ IFL GNSTD  ++ALQE+   L L  GS+ HYG++ DVN VLLM+D+VLYG
Sbjct: 397  KDAEGSFKLIFLGGNSTD--DNALQEVVSGLGLHHGSVWHYGLHGDVNSVLLMADVVLYG 454

Query: 181  TSQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVS 360
            +SQ+EQ FP LL RA++FG P+IAPD+P++K YV DG HG++FS+++P+ L RA SLL+S
Sbjct: 455  SSQNEQGFPPLLIRAMTFGTPVIAPDIPILKKYVDDGAHGILFSKYSPEALTRALSLLIS 514

Query: 361  RDGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTW 540
             +G LS+ A+ +  SG+LLAKN+ AS+CI G+ARL+EN+++F SDTLLP  ++  ++  W
Sbjct: 515  -NGKLSKFAQTLAFSGRLLAKNMLASECIIGYARLLENLISFPSDTLLPGPVSNLQRREW 573

Query: 541  EWNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGE---- 708
            EWN   + LE++  D++S  E       +  +Y+LEK+      ++ +N +  +G     
Sbjct: 574  EWNLFSKELEQEIDDLLSMAEGDFSFRETSAVYSLEKEW-----SNHVNSTSISGNGTEI 628

Query: 709  LEQDLLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEAN 888
            L  D+    DWD ++EIESFE+ ER+E +++ ERM+K  G WDEIY +ARKSEK+KFEAN
Sbjct: 629  LVPDIPTESDWDVLSEIESFEEYERVETEELQERMDKSHGPWDEIYHDARKSEKLKFEAN 688

Query: 889  ERDEGDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSF 1068
            ERDEG+LERTGQPVCIYEIY+GAGAWPFL+HGSLYRGLSL T+ RR  SDDVDA  RL  
Sbjct: 689  ERDEGELERTGQPVCIYEIYDGAGAWPFLNHGSLYRGLSLSTKARRSRSDDVDAVARLPL 748

Query: 1069 LNESHYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQ 1248
            LN+S+Y+N+LC++GGMFSIA +VDDIH RPWIGFQ              AE+ LEE +Q+
Sbjct: 749  LNDSYYQNILCDIGGMFSIANRVDDIHKRPWIGFQSWHAAGSKVSLTFKAEQVLEEKVQE 808

Query: 1249 QTKGDVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDA 1428
            + K DV+Y+WARL+MD G+T  +  L FWSMCDILNGG CR  FE+AFR MY LP   + 
Sbjct: 809  ENK-DVMYYWARLDMDGGVTGSNDELTFWSMCDILNGGHCRIAFEDAFRHMYGLPSNLEV 867

Query: 1429 LPPMPDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELER 1608
            LPPMP+DGG+WSALH WVMPTPSFLEFIMFSRMFVDS+DAL SN ++   CLL SSEL+ 
Sbjct: 868  LPPMPEDGGHWSALHSWVMPTPSFLEFIMFSRMFVDSLDALQSNSSQMTKCLLSSSELQE 927

Query: 1609 KHCYCRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMX 1788
            KHCYCRILE+LVNVWAYHSARRMVYIDP TG +EE HPV+QRKG MW KYF + +LKSM 
Sbjct: 928  KHCYCRILEVLVNVWAYHSARRMVYIDPHTGSVEEQHPVEQRKGIMWEKYFKLMVLKSMD 987

Query: 1789 XXXXXXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHG 1968
                         RE WLWPLTGEVHW G+YE++REEKYR+KMDKKRK KEKL +R + G
Sbjct: 988  EDLAEAADDGDHPRERWLWPLTGEVHWQGIYEREREEKYRVKMDKKRKTKEKLFERLKSG 1047

Query: 1969 YKQKTL 1986
            YKQK L
Sbjct: 1048 YKQKPL 1053


>ref|XP_006589360.1| PREDICTED: uncharacterized protein LOC100779157 isoform X2 [Glycine
            max]
          Length = 1043

 Score =  823 bits (2125), Expect = 0.0
 Identities = 399/659 (60%), Positives = 494/659 (74%), Gaps = 1/659 (0%)
 Frame = +1

Query: 16   SFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGTSQDE 195
            SF+F+FLCGNSTDGY+DALQ +A  + L  GSIRHYG+N DVN VLLM+DI+LYG++Q+ 
Sbjct: 395  SFKFVFLCGNSTDGYDDALQGVASRMGLRQGSIRHYGLNGDVNSVLLMADIILYGSAQEV 454

Query: 196  QSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSRDGIL 375
            Q FP LL RA++F IP++ PD  V+K Y+ DGVHG+ FS+HNP+ L+ AFSLL+S +G L
Sbjct: 455  QGFPPLLIRAMTFEIPVVVPDFSVLKKYIVDGVHGIFFSKHNPEALMNAFSLLLS-NGRL 513

Query: 376  SELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWEWNFV 555
            S+ A+A+ SSG+ LAKN+ A DCI G+ARL+ENVLNF SD LLP  ++Q +QG+WEWN  
Sbjct: 514  SKFAQAIASSGRQLAKNVLALDCITGYARLLENVLNFPSDALLPGPVSQIQQGSWEWNLF 573

Query: 556  GEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QDLLNA 732
               ++   +D   F  R        I+YA+E +   L  +   ++ E   E+  +D L  
Sbjct: 574  RNEIDLSKIDG-DFSNRKVS-----IVYAVEHELASLNYST--SIFENGTEVPLRDELTQ 625

Query: 733  EDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDEGDLE 912
             DWD + EIE  E+ E  E+++ +ER EK  G WD+IYRNARKSEK+KFE NERDEG+LE
Sbjct: 626  LDWDILREIEISEENEMFEVEEAEERREKGVGVWDDIYRNARKSEKLKFEVNERDEGELE 685

Query: 913  RTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNESHYRN 1092
            RTGQPVCIYEIY GAG WPFLHHGSLYRGLSL  R +R SSDDVDA  RL  LN+++YR+
Sbjct: 686  RTGQPVCIYEIYNGAGVWPFLHHGSLYRGLSLSRRAQRQSSDDVDAVGRLPLLNDTYYRD 745

Query: 1093 LLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKGDVVY 1272
            +LCEMGGMF+IA +VD+IH RPWIGFQ              AE+ LEET+Q+  +GDV+Y
Sbjct: 746  ILCEMGGMFAIANRVDNIHRRPWIGFQSWRAAGRKVALSAKAEKVLEETMQENFRGDVIY 805

Query: 1273 FWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPMPDDG 1452
            FW R +MD  +        FW MCDILNGG CR  F+  FR+MYALPP  +ALPPMP+D 
Sbjct: 806  FWGRFDMDQSVIGNHNANSFWYMCDILNGGNCRIVFQEGFRQMYALPPHAEALPPMPED- 864

Query: 1453 GYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCYCRIL 1632
            GYWSALH WVMPTPSFLEFIMFSRMFVDSIDALH + T+ + CLLGSSE+E KHCYCR+L
Sbjct: 865  GYWSALHSWVMPTPSFLEFIMFSRMFVDSIDALHRDSTKYSLCLLGSSEIE-KHCYCRVL 923

Query: 1633 ELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXXXXXX 1812
            ELL+NVWAYHSAR+MVYI+P TG +EE HP++QRKGFMWAKYFNI+LLKSM         
Sbjct: 924  ELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWAKYFNISLLKSMDEDLAEAAD 983

Query: 1813 XXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQKTLG 1989
                 RE WLWP+TGEVHW G+YE++REE+YRLKMDKKRK KEKL +R ++GYKQK+LG
Sbjct: 984  DGDHPREMWLWPMTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKYGYKQKSLG 1042


>ref|XP_006606297.1| PREDICTED: uncharacterized protein LOC100790929 isoform X2 [Glycine
            max]
          Length = 1044

 Score =  822 bits (2122), Expect = 0.0
 Identities = 397/663 (59%), Positives = 494/663 (74%), Gaps = 1/663 (0%)
 Frame = +1

Query: 4    DVGGSFRFIFLCGNSTDGYNDALQELAVHLRLSPGSIRHYGMNADVNGVLLMSDIVLYGT 183
            D   SF+F+FLCGNSTDGY+DALQ +A  + L  GSIRHYG+N DVN VLLM+DI+LYG+
Sbjct: 394  DATDSFKFVFLCGNSTDGYDDALQGVASRMGLRQGSIRHYGLNGDVNSVLLMADIILYGS 453

Query: 184  SQDEQSFPSLLTRALSFGIPIIAPDLPVIKNYVRDGVHGMIFSRHNPDDLLRAFSLLVSR 363
            +Q+ Q FP LL RA++F IP++ PD  V+K Y+ DGVHG+ FS+HNP+ L+ AFSLL+S 
Sbjct: 454  AQEVQGFPPLLIRAMTFEIPVVVPDFSVLKKYIVDGVHGIFFSKHNPEALMNAFSLLLS- 512

Query: 364  DGILSELAKAVGSSGKLLAKNLQASDCIKGFARLMENVLNFQSDTLLPSSITQSEQGTWE 543
            +G LS+ A+A+ SSG+ LAKN+ A DCI G+ARL+ENVLNF SD LLP +++Q +QG+WE
Sbjct: 513  NGRLSKFAQAIASSGRQLAKNVLALDCITGYARLLENVLNFPSDALLPGAVSQIQQGSWE 572

Query: 544  WNFVGEILEEKSVDMMSFDERATKVDRSRIIYALEKDHLDLGITDPMNMSEYNGELE-QD 720
            WN     L +  +D+   D       +  I+YA+E +   L  +   ++ E   E+  QD
Sbjct: 573  WN-----LFQNEIDLSKIDSNR----KVSIVYAVEHELASLNYST--SIVENGTEVPLQD 621

Query: 721  LLNAEDWDGIAEIESFEDLERLEMDQIDERMEKEPGNWDEIYRNARKSEKVKFEANERDE 900
             L   D D + EIE  E+ E  E+++ +ERMEK    WD+IYRNARKSEK+KFE NERDE
Sbjct: 622  ELTQLDLDTLREIEISEENEMFEVEEAEERMEKGVSVWDDIYRNARKSEKLKFEVNERDE 681

Query: 901  GDLERTGQPVCIYEIYEGAGAWPFLHHGSLYRGLSLVTRTRRLSSDDVDAFTRLSFLNES 1080
            G+LERTGQ VCIYEIY GAG WPFLHHGSLYRGLSL  R +R +SDDVDA  RL  LN++
Sbjct: 682  GELERTGQSVCIYEIYNGAGVWPFLHHGSLYRGLSLSRRAQRQTSDDVDAVGRLPLLNDT 741

Query: 1081 HYRNLLCEMGGMFSIAYKVDDIHMRPWIGFQXXXXXXXXXXXXXXAERALEETIQQQTKG 1260
            +YR++LCEMGGMF+IA +VD IH RPWIGFQ              AE  LEET+Q+  +G
Sbjct: 742  YYRDILCEMGGMFAIANRVDSIHRRPWIGFQSWRAAGRKVALSAKAENVLEETMQENFRG 801

Query: 1261 DVVYFWARLEMDSGLTRGSKPLPFWSMCDILNGGQCRSTFENAFRRMYALPPVKDALPPM 1440
            DV+YFW RL+MD    R    + FW MCDILNGG CR  F++ FR+MYALPP  +ALPPM
Sbjct: 802  DVIYFWGRLDMDQSAIRNHNAISFWYMCDILNGGNCRIVFQDGFRQMYALPPHAEALPPM 861

Query: 1441 PDDGGYWSALHGWVMPTPSFLEFIMFSRMFVDSIDALHSNKTETAHCLLGSSELERKHCY 1620
            P+DGGYWSALH WVMPT SFLEFIMFSRMFVDSIDA H + T+ + CLLGSSE+E KHCY
Sbjct: 862  PEDGGYWSALHSWVMPTSSFLEFIMFSRMFVDSIDAKHRDSTKYSLCLLGSSEIE-KHCY 920

Query: 1621 CRILELLVNVWAYHSARRMVYIDPQTGLLEEHHPVQQRKGFMWAKYFNITLLKSMXXXXX 1800
            CR+LELL+NVWAYHSAR+MVYI+P TG +EE HP++QRKGFMW+KYFN +LLKSM     
Sbjct: 921  CRMLELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWSKYFNFSLLKSMDEDLA 980

Query: 1801 XXXXXXXXXRENWLWPLTGEVHWPGVYEKQREEKYRLKMDKKRKVKEKLIDRFQHGYKQK 1980
                     RE WLWP+TGEVHW G+YE++REE+YRLKMDKKRK KEKL +R ++GYKQK
Sbjct: 981  EAADDGDHPREMWLWPMTGEVHWQGIYEREREERYRLKMDKKRKTKEKLFERMKYGYKQK 1040

Query: 1981 TLG 1989
            +LG
Sbjct: 1041 SLG 1043


Top