BLASTX nr result

ID: Panax24_contig00022992 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Panax24_contig00022992
         (1738 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_010660825.2 PREDICTED: glycosyltransferase family 92 protein ...   765   0.0  
CBI17696.3 unnamed protein product, partial [Vitis vinifera]          765   0.0  
XP_018847820.1 PREDICTED: glycosyltransferase family 92 protein ...   746   0.0  
XP_015879042.1 PREDICTED: UPF0392 protein RCOM_0530710 [Ziziphus...   755   0.0  
XP_011097331.1 PREDICTED: UPF0392 protein RCOM_0530710 [Sesamum ...   741   0.0  
XP_002300081.1 hypothetical protein POPTR_0001s36210g [Populus t...   753   0.0  
XP_011045246.1 PREDICTED: UPF0392 protein RCOM_0530710 [Populus ...   749   0.0  
OAY29580.1 hypothetical protein MANES_15G156000 [Manihot esculenta]   729   0.0  
KVI09280.1 protein of unknown function DUF23 [Cynara cardunculus...   728   0.0  
GAV85513.1 Glyco_transf_92 domain-containing protein/zf-C3HC4_2 ...   733   0.0  
B9SLR1.1 RecName: Full=Glycosyltransferase family 92 protein RCO...   716   0.0  
XP_010111642.1 hypothetical protein L484_017669 [Morus notabilis...   709   0.0  
XP_006338653.1 PREDICTED: UPF0392 protein RCOM_0530710-like [Sol...   707   0.0  
XP_010029049.1 PREDICTED: glycosyltransferase family 92 protein ...   720   0.0  
KDP46347.1 hypothetical protein JCGZ_10187 [Jatropha curcas]          705   0.0  
XP_015579533.1 PREDICTED: LOW QUALITY PROTEIN: UPF0392 protein R...   717   0.0  
OMO69085.1 hypothetical protein COLO4_29270 [Corchorus olitorius]     701   0.0  
XP_017975667.1 PREDICTED: glycosyltransferase family 92 protein ...   698   0.0  
EOY02628.1 UPF0392 protein RCOM_0530710 isoform 2 [Theobroma cacao]   697   0.0  
XP_016683836.1 PREDICTED: glycosyltransferase family 92 protein ...   696   0.0  

>XP_010660825.2 PREDICTED: glycosyltransferase family 92 protein RCOM_0530710 [Vitis
            vinifera]
          Length = 908

 Score =  765 bits (1976), Expect = 0.0
 Identities = 363/506 (71%), Positives = 425/506 (83%), Gaps = 4/506 (0%)
 Frame = -2

Query: 1725 PQLVSSWRTPAMEAISGDSPATPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCIYI 1546
            P LVSSWR  AMEAI+G+S   P +S+RETVIFPDQ ++FLKYPPS  LFTKDD+DC+Y 
Sbjct: 46   PVLVSSWRKQAMEAIAGESFIPPAISIRETVIFPDQELVFLKYPPSARLFTKDDLDCLYF 105

Query: 1545 SPNSSQAQLKLAPVSIDREYLGHQIVRCPLQPRGSTVTITVKSNGNLPAGPTHHWDSLAY 1366
            SPNSS + +KL P  +D E   HQIVRCP +PRG T+++ +KSN  L  GPTH WDSL Y
Sbjct: 106  SPNSSDSHIKLPPEDVDGETRDHQIVRCPRRPRGFTLSLVLKSNALLRPGPTHQWDSLVY 165

Query: 1365 EALIDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSIAQEIVRCK 1186
            EALIDRDNTT+ FVKGLNLRP R S+ +RFECVYGWDF + +FLL S+VVSIAQE+VRC+
Sbjct: 166  EALIDRDNTTVAFVKGLNLRPDRASDPTRFECVYGWDFRKPRFLLRSEVVSIAQEVVRCR 225

Query: 1185 TPLSVLK--QRVNNNDNPIKVSIRVLGRGILKSIARPVRKMDPDPPAQKQHEMCICTMLR 1012
            TPLS+L   QR+N+    IKVS+R+ G+GIL SIA P R+  PDPP +KQHEMCICTM+R
Sbjct: 226  TPLSILNNPQRLNST---IKVSVRMKGKGILNSIAEPKRRSPPDPPIRKQHEMCICTMVR 282

Query: 1011 NQARFIREWVMYHSRIGVQRWFIYDNNSNDDIDKVVELLMEDNVNITRHVWPWIKTQEAG 832
            NQARF+REW+MYH++IGVQRWFIYDNNS D+I+KV+E L   N+NI+RH+WPWIKTQEAG
Sbjct: 283  NQARFLREWIMYHAQIGVQRWFIYDNNSVDNIEKVLESLETANLNISRHLWPWIKTQEAG 342

Query: 831  FAHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNNESRG--RQVAELRTSCYSFGPS 658
            FAHCALRARDSCEWVGF+DVDEFLHL  G +L +V+ N+SR     VAELR SCYSFGPS
Sbjct: 343  FAHCALRARDSCEWVGFIDVDEFLHLPSGASLQDVVWNQSRSANNNVAELRISCYSFGPS 402

Query: 657  GLSRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFDLRDDFEYVNMDRNVM 478
            GL+ VP KGV VGYTCRL APERHKSIVRPEALNSTLIN+VHHF LR+ F++VN+DR  M
Sbjct: 403  GLTSVPPKGVAVGYTCRLSAPERHKSIVRPEALNSTLINVVHHFHLRNGFDFVNVDRGAM 462

Query: 477  VINHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGTRAVEPPDWSSRFCEV 298
            VINHYKYQVW+VFKEKFYRRVA YV DWQ E+NVGSKDRAPGLGTRAVEPPDWS+RFCEV
Sbjct: 463  VINHYKYQVWEVFKEKFYRRVAAYVADWQDEENVGSKDRAPGLGTRAVEPPDWSTRFCEV 522

Query: 297  TDTGLRDQVLENFTDPKTGLLPWQEN 220
            TDTGLRD+VL+ F DP+T  +PWQE+
Sbjct: 523  TDTGLRDRVLQIFKDPETHRMPWQEH 548


>CBI17696.3 unnamed protein product, partial [Vitis vinifera]
          Length = 1019

 Score =  765 bits (1976), Expect = 0.0
 Identities = 363/506 (71%), Positives = 425/506 (83%), Gaps = 4/506 (0%)
 Frame = -2

Query: 1725 PQLVSSWRTPAMEAISGDSPATPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCIYI 1546
            P LVSSWR  AMEAI+G+S   P +S+RETVIFPDQ ++FLKYPPS  LFTKDD+DC+Y 
Sbjct: 79   PVLVSSWRKQAMEAIAGESFIPPAISIRETVIFPDQELVFLKYPPSARLFTKDDLDCLYF 138

Query: 1545 SPNSSQAQLKLAPVSIDREYLGHQIVRCPLQPRGSTVTITVKSNGNLPAGPTHHWDSLAY 1366
            SPNSS + +KL P  +D E   HQIVRCP +PRG T+++ +KSN  L  GPTH WDSL Y
Sbjct: 139  SPNSSDSHIKLPPEDVDGETRDHQIVRCPRRPRGFTLSLVLKSNALLRPGPTHQWDSLVY 198

Query: 1365 EALIDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSIAQEIVRCK 1186
            EALIDRDNTT+ FVKGLNLRP R S+ +RFECVYGWDF + +FLL S+VVSIAQE+VRC+
Sbjct: 199  EALIDRDNTTVAFVKGLNLRPDRASDPTRFECVYGWDFRKPRFLLRSEVVSIAQEVVRCR 258

Query: 1185 TPLSVLK--QRVNNNDNPIKVSIRVLGRGILKSIARPVRKMDPDPPAQKQHEMCICTMLR 1012
            TPLS+L   QR+N+    IKVS+R+ G+GIL SIA P R+  PDPP +KQHEMCICTM+R
Sbjct: 259  TPLSILNNPQRLNST---IKVSVRMKGKGILNSIAEPKRRSPPDPPIRKQHEMCICTMVR 315

Query: 1011 NQARFIREWVMYHSRIGVQRWFIYDNNSNDDIDKVVELLMEDNVNITRHVWPWIKTQEAG 832
            NQARF+REW+MYH++IGVQRWFIYDNNS D+I+KV+E L   N+NI+RH+WPWIKTQEAG
Sbjct: 316  NQARFLREWIMYHAQIGVQRWFIYDNNSVDNIEKVLESLETANLNISRHLWPWIKTQEAG 375

Query: 831  FAHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNNESRG--RQVAELRTSCYSFGPS 658
            FAHCALRARDSCEWVGF+DVDEFLHL  G +L +V+ N+SR     VAELR SCYSFGPS
Sbjct: 376  FAHCALRARDSCEWVGFIDVDEFLHLPSGASLQDVVWNQSRSANNNVAELRISCYSFGPS 435

Query: 657  GLSRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFDLRDDFEYVNMDRNVM 478
            GL+ VP KGV VGYTCRL APERHKSIVRPEALNSTLIN+VHHF LR+ F++VN+DR  M
Sbjct: 436  GLTSVPPKGVAVGYTCRLSAPERHKSIVRPEALNSTLINVVHHFHLRNGFDFVNVDRGAM 495

Query: 477  VINHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGTRAVEPPDWSSRFCEV 298
            VINHYKYQVW+VFKEKFYRRVA YV DWQ E+NVGSKDRAPGLGTRAVEPPDWS+RFCEV
Sbjct: 496  VINHYKYQVWEVFKEKFYRRVAAYVADWQDEENVGSKDRAPGLGTRAVEPPDWSTRFCEV 555

Query: 297  TDTGLRDQVLENFTDPKTGLLPWQEN 220
            TDTGLRD+VL+ F DP+T  +PWQE+
Sbjct: 556  TDTGLRDRVLQIFKDPETHRMPWQEH 581


>XP_018847820.1 PREDICTED: glycosyltransferase family 92 protein RCOM_0530710-like
            [Juglans regia]
          Length = 582

 Score =  746 bits (1926), Expect = 0.0
 Identities = 365/534 (68%), Positives = 427/534 (79%), Gaps = 20/534 (3%)
 Frame = -2

Query: 1725 PQLVSSWRTPAMEAISGDSPATPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCIYI 1546
            P LV  WR P ME+ISGDSP T  +S+RETV+ PDQA++FL Y PS  LFTK+DIDC+Y 
Sbjct: 49   PALVFKWRAPVMESISGDSPVTRSISIRETVMLPDQALIFLNYLPSARLFTKEDIDCVYF 108

Query: 1545 SPNSSQAQLKLAPVSI----DREYLGHQIVRCPLQPRGSTVTITVKSNGNLPA-GPTHHW 1381
            S  SSQ QL+  P+ +    DR+   +QIVRCP QPRG TV++ +KS+G++P  GP+H W
Sbjct: 109  SAESSQRQLRKPPLDVEGGGDRD---NQIVRCPRQPRGFTVSLALKSSGHVPPPGPSHRW 165

Query: 1380 DSLAYEALIDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSIAQE 1201
            DSLAYEA+IDRDNTTIVFVKGLNLRP R+SN SR+ECVYGWDF R KFLL SDVVSIAQE
Sbjct: 166  DSLAYEAMIDRDNTTIVFVKGLNLRPERLSNVSRYECVYGWDFGRPKFLLRSDVVSIAQE 225

Query: 1200 IVRCKTPLSVLK--QRVNNNDNPIKVSIRVLGRGILKSIARPVRKMDP--DPPAQKQHEM 1033
            IVRCKTPLSVL   +R++ ++N +KVS+R  GRGI +SIARP   + P  DP  +KQ+EM
Sbjct: 226  IVRCKTPLSVLNSPRRMSTSNNFVKVSVRPKGRGIFRSIARPGSGLRPGNDPYTRKQYEM 285

Query: 1032 CICTMLRNQARFIREWVMYHSRIGVQRWFIYDNNSNDDI----DKVVELLM-------ED 886
            C+CTMLRNQARF++EWVMYH+RIGVQRWFIYDNNS+DDI    D VV++L        E 
Sbjct: 286  CVCTMLRNQARFLKEWVMYHARIGVQRWFIYDNNSDDDIEDVIDNVVDMLSKTKDMDKEV 345

Query: 885  NVNITRHVWPWIKTQEAGFAHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNNESRG 706
            N NITRHVWPWIKTQEAGF+HCALRARD+CEWVGF+DVDEF +L  G+ LH+VL N+SR 
Sbjct: 346  NYNITRHVWPWIKTQEAGFSHCALRARDTCEWVGFIDVDEFFYLPSGLVLHDVLQNQSRY 405

Query: 705  RQVAELRTSCYSFGPSGLSRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHF 526
              V E+R SCYSFGPSGL  VP +GV  GY CRL APERHKSIV+PEALNSTLIN+VHHF
Sbjct: 406  SYVGEVRVSCYSFGPSGLKHVPEQGVTAGYNCRLAAPERHKSIVKPEALNSTLINVVHHF 465

Query: 525  DLRDDFEYVNMDRNVMVINHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLG 346
             LR  FEYVN+D+ VMVINHYKYQVWQVFKEKFYRRVATYV DWQ ++N GSKDRAPGLG
Sbjct: 466  HLRKGFEYVNVDKGVMVINHYKYQVWQVFKEKFYRRVATYVADWQDDQNAGSKDRAPGLG 525

Query: 345  TRAVEPPDWSSRFCEVTDTGLRDQVLENFTDPKTGLLPWQENGFYVEHKKRKRR 184
            TRAV+PPDWSSRFCE TD GLR+QVL  F DP+T  LPWQE G    H+K + +
Sbjct: 526  TRAVQPPDWSSRFCEFTDNGLRNQVLRLFADPETQTLPWQEVGEQANHRKTRNK 579


>XP_015879042.1 PREDICTED: UPF0392 protein RCOM_0530710 [Ziziphus jujuba]
          Length = 926

 Score =  755 bits (1950), Expect = 0.0
 Identities = 361/518 (69%), Positives = 421/518 (81%), Gaps = 6/518 (1%)
 Frame = -2

Query: 1725 PQLVSSWRTPAMEAISGDSPAT-PVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCIY 1549
            P LV  WR P MEAISGDSP   P +S+RE V+ PDQA++FL  PP   LFTKDD+ C+Y
Sbjct: 50   PVLVLPWRAPQMEAISGDSPVKKPYISIREIVLLPDQALVFLNCPPPARLFTKDDLHCVY 109

Query: 1548 ISPN---SSQAQLKLAPVSIDREYLGHQIVRCPLQPRGSTVTITVKSNGNLPAGPTHHWD 1378
               N   SSQ +LK  P  +D +   +QIVRCPLQPRG TV + +K  G LP+GP+H W+
Sbjct: 110  FPANDSSSSQRRLKKLPREVDGDDPENQIVRCPLQPRGFTVAVALKPKGELPSGPSHRWN 169

Query: 1377 SLAYEALIDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSIAQEI 1198
            SLAYEALIDRDNTTI+FVKGLNL+P RVSNASRFECVYGWDF + KFLL +DVVSIAQEI
Sbjct: 170  SLAYEALIDRDNTTIIFVKGLNLKPERVSNASRFECVYGWDFRKPKFLLRTDVVSIAQEI 229

Query: 1197 VRCKTPLSVLK--QRVNNNDNPIKVSIRVLGRGILKSIARPVRKMDPDPPAQKQHEMCIC 1024
            VRCKTPLSVL   QR N++   +KVS+R+ G+GIL +IARP+ + +P+P  +K HEMC+C
Sbjct: 230  VRCKTPLSVLNAPQRANSS---VKVSVRIKGKGILPTIARPISRTEPNPQTRKSHEMCVC 286

Query: 1023 TMLRNQARFIREWVMYHSRIGVQRWFIYDNNSNDDIDKVVELLMEDNVNITRHVWPWIKT 844
            TM+RNQ RF+REWV YH+ +GV RWFIYDNNS+DDID V+E L+  N N+TRHVWPWIKT
Sbjct: 287  TMVRNQGRFVREWVKYHAEMGVDRWFIYDNNSDDDIDFVIESLVSANYNVTRHVWPWIKT 346

Query: 843  QEAGFAHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNNESRGRQVAELRTSCYSFG 664
            QEAGFAHCALRARDSC WVGF+DVDEF HL  G+ LH+VL+N+S+   +AELR SCYSFG
Sbjct: 347  QEAGFAHCALRARDSCNWVGFIDVDEFFHLPSGLFLHDVLHNQSKYANIAELRVSCYSFG 406

Query: 663  PSGLSRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFDLRDDFEYVNMDRN 484
            PSGL  VP +GV VGYTCRL APERHKSIV+P+ALNS+LIN+VHHF LRD F YVNMD+ 
Sbjct: 407  PSGLKHVPPQGVTVGYTCRLAAPERHKSIVKPDALNSSLINVVHHFHLRDGFRYVNMDKG 466

Query: 483  VMVINHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGTRAVEPPDWSSRFC 304
             +VINHYKYQVW+VFKEKFYRRVATYV DWQ E+NVGSKDRAPGLGTRAVEPPDWSSRFC
Sbjct: 467  EIVINHYKYQVWEVFKEKFYRRVATYVADWQDEQNVGSKDRAPGLGTRAVEPPDWSSRFC 526

Query: 303  EVTDTGLRDQVLENFTDPKTGLLPWQENGFYVEHKKRK 190
            EVTDTGLRD+VL N  DP+T LLPWQE     +HKK K
Sbjct: 527  EVTDTGLRDRVLTNLADPQTHLLPWQE---VEKHKKSK 561


>XP_011097331.1 PREDICTED: UPF0392 protein RCOM_0530710 [Sesamum indicum]
          Length = 558

 Score =  741 bits (1912), Expect = 0.0
 Identities = 343/507 (67%), Positives = 428/507 (84%), Gaps = 4/507 (0%)
 Frame = -2

Query: 1728 HPQLVSSWRTPAMEAISGDSPATPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCIY 1549
            HP++++    PAMEAIS ++  TP +S+RETVIFPDQA++FLKYPPS  LFTKDDI+CIY
Sbjct: 43   HPRMITKGNIPAMEAISAENSVTPAISIRETVIFPDQALVFLKYPPSAALFTKDDINCIY 102

Query: 1548 ISPNSSQAQLKLAPVSIDREYLGHQIVRCPLQPRGSTVTITVKSNGNLPAGPTHHWDSLA 1369
            + PNSS+ QL+L+P S+D +Y  HQIVRCPLQPRG  +++ VK NGNL  GPT+ WD+LA
Sbjct: 103  LLPNSSRPQLELSPSSVDYDYGNHQIVRCPLQPRGLILSLAVKHNGNLTPGPTYRWDALA 162

Query: 1368 YEALIDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSIAQEIVRC 1189
            YEA+IDRDN+TIVFVKGLNLR GRV+N+SRF+C+YGWD ++ KF+LS++ VS+AQEIVRC
Sbjct: 163  YEAMIDRDNSTIVFVKGLNLRSGRVANSSRFKCLYGWDLTKPKFVLSANAVSVAQEIVRC 222

Query: 1188 KTPLSVLK--QRVNNNDNP-IKVSIRVLGRGILKSIARPVRKMDPDPPAQ-KQHEMCICT 1021
            +TPLS+L   QR  ++ N  IKVS+RV+G+  L SIAR   +      A+ KQHE+C+CT
Sbjct: 223  RTPLSILNRLQRFTSSFNDSIKVSVRVIGQRTLNSIARVKHQYGSKQLARGKQHELCVCT 282

Query: 1020 MLRNQARFIREWVMYHSRIGVQRWFIYDNNSNDDIDKVVELLMEDNVNITRHVWPWIKTQ 841
            MLRNQARF+REWVMYH+ +GVQRWFIYDNNS+DDI++VVE L++ +  +TRHVWPWIKTQ
Sbjct: 283  MLRNQARFLREWVMYHAHVGVQRWFIYDNNSDDDIERVVESLVDTSYKVTRHVWPWIKTQ 342

Query: 840  EAGFAHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNNESRGRQVAELRTSCYSFGP 661
            EAGFAHCALRA+DSCEWVGF+DVDEF HL  G++L +V+ N SR  +VAELR  C+SFGP
Sbjct: 343  EAGFAHCALRAQDSCEWVGFIDVDEFFHLPSGLSLKDVVANASRPSEVAELRVPCHSFGP 402

Query: 660  SGLSRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFDLRDDFEYVNMDRNV 481
            SGL +VP+KGVM GYTCRL APERHKSIV+PEALN++LIN+VHHF L+  F ++NM+R+ 
Sbjct: 403  SGLKKVPMKGVMAGYTCRLAAPERHKSIVKPEALNTSLINVVHHFHLKSGFRHINMNRSK 462

Query: 480  MVINHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGTRAVEPPDWSSRFCE 301
            +VINHYKYQVW+VFKEKFYRRVATYV DWQQE+NVGSKDRAPGLGT+A+EP DWS RFCE
Sbjct: 463  LVINHYKYQVWEVFKEKFYRRVATYVSDWQQERNVGSKDRAPGLGTKAIEPLDWSMRFCE 522

Query: 300  VTDTGLRDQVLENFTDPKTGLLPWQEN 220
            V DTGLRD++ + F DPKT LLPW+++
Sbjct: 523  VKDTGLRDRIFKIFVDPKTSLLPWEDH 549


>XP_002300081.1 hypothetical protein POPTR_0001s36210g [Populus trichocarpa]
            EEE84886.1 hypothetical protein POPTR_0001s36210g
            [Populus trichocarpa]
          Length = 915

 Score =  753 bits (1944), Expect = 0.0
 Identities = 365/511 (71%), Positives = 425/511 (83%), Gaps = 7/511 (1%)
 Frame = -2

Query: 1728 HPQLVSSWRTPAMEAISGDSPATPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCIY 1549
            HP++VS+WRTPAMEA+SGDS A P  S+RETVI PDQ ++FLKYPPS+ LFTK+D+ C+Y
Sbjct: 48   HPEIVSTWRTPAMEALSGDSSAVPAPSIRETVILPDQVLVFLKYPPSSRLFTKEDLLCVY 107

Query: 1548 ISPN--SSQAQLKLAPVSIDREYLGHQIVRCPLQPRGSTVTITVKSNGNLPAGPTHHWDS 1375
            +S N  SSQ+Q +L P  ID + +  QIVRCPL PRG TV++ +KS G +  GPTH WDS
Sbjct: 108  LSANKSSSQSQRRLPPNHIDGKDVDDQIVRCPLIPRGYTVSLALKSGGYIHPGPTHKWDS 167

Query: 1374 LAYEALIDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSIAQEIV 1195
            L YEALIDRDNTT+VFVKGLNLRP ++SNASRFECVYGWDF R KFLL S V+S+AQEIV
Sbjct: 168  LVYEALIDRDNTTVVFVKGLNLRPEKLSNASRFECVYGWDFRRPKFLLRSQVISMAQEIV 227

Query: 1194 RCKTPLSVL--KQRVNNNDNPIKVSIRVLGRGILKSIARPV--RKMDPDPPAQKQHEMCI 1027
            RCKTPLSVL   Q VN++   IK SIRV GRG L SIARP    K  P PP +K HEMCI
Sbjct: 228  RCKTPLSVLGAPQMVNSS---IKASIRVKGRGTLHSIARPGLRSKPQPGPPERKPHEMCI 284

Query: 1026 CTMLRNQARFIREWVMYHSRIGVQRWFIYDNNSNDDIDKVVELLMEDNVNITRHVWPWIK 847
            CTMLRNQARF+REWVMYH+++GVQ W+IYDNNS+DDI+ V+E L++   NI+RHVWPWIK
Sbjct: 285  CTMLRNQARFLREWVMYHAQVGVQSWYIYDNNSDDDIEDVMESLVQAGFNISRHVWPWIK 344

Query: 846  TQEAGFAHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNNES-RGRQVAELRTSCYS 670
            TQEAGFAHCALRAR+SCEWVGF+DVDEF +   G++LH+V++N+S  G  VAE+RTSCYS
Sbjct: 345  TQEAGFAHCALRARESCEWVGFIDVDEFFYSPLGLSLHDVISNQSGSGNNVAEIRTSCYS 404

Query: 669  FGPSGLSRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFDLRDDFEYVNMD 490
            FGPSGL  +P +GVMVGYTCRLGAPERHKSIV+PEALNSTLIN+VHHF L + F YVN D
Sbjct: 405  FGPSGLKHLPPQGVMVGYTCRLGAPERHKSIVKPEALNSTLINVVHHFHLSEGFRYVNAD 464

Query: 489  RNVMVINHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGTRAVEPPDWSSR 310
            R V+ INHYKYQVW+VFKEKFYRRVATYV DWQ E+NVGSKDRAPGLGTRAVEPPDWSSR
Sbjct: 465  RGVLAINHYKYQVWEVFKEKFYRRVATYVADWQNEQNVGSKDRAPGLGTRAVEPPDWSSR 524

Query: 309  FCEVTDTGLRDQVLENFTDPKTGLLPWQENG 217
            FCEVTDTGLR+ VL+ F DP T  LPW+E G
Sbjct: 525  FCEVTDTGLRNLVLQKFMDPLTNHLPWEELG 555


>XP_011045246.1 PREDICTED: UPF0392 protein RCOM_0530710 [Populus euphratica]
          Length = 989

 Score =  749 bits (1933), Expect = 0.0
 Identities = 364/511 (71%), Positives = 425/511 (83%), Gaps = 7/511 (1%)
 Frame = -2

Query: 1728 HPQLVSSWRTPAMEAISGDSPATPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCIY 1549
            HP++VS+WRTPAMEA+SGDS A    S+RETV+ PDQ ++FLKYPPS+ LFTK+D+ C+Y
Sbjct: 47   HPEIVSTWRTPAMEALSGDSSAVLAPSIRETVMLPDQVLVFLKYPPSSRLFTKEDLLCVY 106

Query: 1548 ISPN--SSQAQLKLAPVSIDREYLGHQIVRCPLQPRGSTVTITVKSNGNLPAGPTHHWDS 1375
            IS N  SSQ+Q  L P+ ID +    QIVRCPL PRG TV++++KS G +  GPTH W+S
Sbjct: 107  ISANKSSSQSQRWLPPIHIDGKDADDQIVRCPLIPRGYTVSLSLKSGGYIHPGPTHKWES 166

Query: 1374 LAYEALIDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSIAQEIV 1195
            L YEALIDRDNTT+VFVKGLNLRP ++SNASRFECVYGWDF R KFLL S V+S+AQEIV
Sbjct: 167  LVYEALIDRDNTTVVFVKGLNLRPEKLSNASRFECVYGWDFRRPKFLLRSQVISMAQEIV 226

Query: 1194 RCKTPLSVL--KQRVNNNDNPIKVSIRVLGRGILKSIARPVRKMDPD--PPAQKQHEMCI 1027
            RCKTPLSVL   Q VN++   IKVSIRV GRG L SIARP  +  P   PP +K HEMCI
Sbjct: 227  RCKTPLSVLGAPQMVNSS---IKVSIRVKGRGTLHSIARPGLRSMPQRGPPERKPHEMCI 283

Query: 1026 CTMLRNQARFIREWVMYHSRIGVQRWFIYDNNSNDDIDKVVELLMEDNVNITRHVWPWIK 847
            CTMLRNQARF+REWVMYH+++GVQ W+IYDNNS+DDI+ V+E L++   NI+RHVWPWIK
Sbjct: 284  CTMLRNQARFLREWVMYHAQVGVQSWYIYDNNSDDDIEDVMESLVQAGFNISRHVWPWIK 343

Query: 846  TQEAGFAHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNNESR-GRQVAELRTSCYS 670
            TQEAGFAHCALRAR+SCEWVGF+DVDEF +   G++LH+V++N+S  G  VAE+RTSCYS
Sbjct: 344  TQEAGFAHCALRARESCEWVGFIDVDEFFYSPLGLSLHDVISNQSESGNNVAEIRTSCYS 403

Query: 669  FGPSGLSRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFDLRDDFEYVNMD 490
            FGPSGL  VP +GVMVGYTCRLGAPERHKSIV+PEALNSTLIN+VHHF L + F YVN D
Sbjct: 404  FGPSGLKHVPPQGVMVGYTCRLGAPERHKSIVKPEALNSTLINVVHHFHLSEGFRYVNAD 463

Query: 489  RNVMVINHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGTRAVEPPDWSSR 310
            R V+VINHYKYQ W+VFKEKFYRRVATYV DWQ E+NVGSKDRAPGLGTRAVEPPDWSSR
Sbjct: 464  RGVLVINHYKYQAWEVFKEKFYRRVATYVADWQNEQNVGSKDRAPGLGTRAVEPPDWSSR 523

Query: 309  FCEVTDTGLRDQVLENFTDPKTGLLPWQENG 217
            FCEVTDTGLR+ VL+ F DP T  LPW+E G
Sbjct: 524  FCEVTDTGLRNLVLQKFMDPLTNNLPWEEMG 554


>OAY29580.1 hypothetical protein MANES_15G156000 [Manihot esculenta]
          Length = 562

 Score =  729 bits (1883), Expect = 0.0
 Identities = 353/512 (68%), Positives = 414/512 (80%), Gaps = 9/512 (1%)
 Frame = -2

Query: 1728 HPQLVSSWRTPAMEAISGDSPA-TPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCI 1552
            HP++V +WR PA+ AI+G+SP  +  +S+RETV+ PD+AVLFLKYP S  LFTK+D+DC 
Sbjct: 46   HPEIVYAWRPPALRAITGESPGNSDTISIRETVLLPDEAVLFLKYPQSARLFTKEDLDCA 105

Query: 1551 YISPNSS---QAQLKLAPVSIDREYLGHQIVRCPLQPRGSTVTITVKSNGNLPAGPTHHW 1381
            Y S +SS   Q Q K  P  ID +   HQIVRCPL PRG  V + +KS G + +GP H W
Sbjct: 106  YFSTDSSSSAQPQFKRPPNRIDSDDPDHQIVRCPLSPRGLVVALALKSGGYINSGPIHRW 165

Query: 1380 DSLAYEALIDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSIAQE 1201
            DSL YEAL+DRDNTT+VFVKGLNLRP ++ N SRFEC+YGWDF   KFLL S V+SIAQE
Sbjct: 166  DSLVYEALLDRDNTTVVFVKGLNLRPDKLYNTSRFECLYGWDFRTPKFLLRSSVMSIAQE 225

Query: 1200 IVRCKTPLSVLK--QRVNNNDNPIKVSIRVLGRGILKSIARPVRKMDPDPP--AQKQHEM 1033
            IVRC+TPLS+L   Q+VNN+   IKVSIRV GRG L SIARP  ++DP+P   A+K H M
Sbjct: 226  IVRCQTPLSILSNPQKVNNS---IKVSIRVKGRGTLHSIARPRLQLDPNPDLKARKPHHM 282

Query: 1032 CICTMLRNQARFIREWVMYHSRIGVQRWFIYDNNSNDDIDKVVELLMEDNVNITRHVWPW 853
            CICTMLRNQARF+ EWVMYH RIGVQRWFIYDNNS D+ID V+E LM+ N NI+++VWPW
Sbjct: 283  CICTMLRNQARFLGEWVMYHGRIGVQRWFIYDNNSEDNIDSVIESLMDSNYNISKYVWPW 342

Query: 852  IKTQEAGFAHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNNESRGRQ-VAELRTSC 676
            IKTQEAGFAHCALRAR SCEWVGF+DVDEF HL +G+ L +VL N+S     V ELR SC
Sbjct: 343  IKTQEAGFAHCALRARTSCEWVGFIDVDEFFHLPNGLNLLDVLRNQSESNSNVGELRVSC 402

Query: 675  YSFGPSGLSRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFDLRDDFEYVN 496
            +SFGPSGL  VP +GV VGYTCR+  PERHKSIV+PEALN+TLIN+VHHF LRD F +VN
Sbjct: 403  HSFGPSGLQHVPTEGVTVGYTCRMILPERHKSIVKPEALNATLINVVHHFHLRDGFRHVN 462

Query: 495  MDRNVMVINHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGTRAVEPPDWS 316
             DR V+VINHYKYQVW+VFKEKFYRRVATYV+DWQ ++NVGSKDR PGLGTRAVEPPDW+
Sbjct: 463  ADRGVLVINHYKYQVWEVFKEKFYRRVATYVVDWQNQQNVGSKDRVPGLGTRAVEPPDWA 522

Query: 315  SRFCEVTDTGLRDQVLENFTDPKTGLLPWQEN 220
            SRFCEVTDTGLRD+VLE F+DP T LLPWQE+
Sbjct: 523  SRFCEVTDTGLRDRVLEMFSDPITRLLPWQED 554


>KVI09280.1 protein of unknown function DUF23 [Cynara cardunculus var. scolymus]
          Length = 599

 Score =  728 bits (1879), Expect = 0.0
 Identities = 341/503 (67%), Positives = 409/503 (81%)
 Frame = -2

Query: 1731 HHPQLVSSWRTPAMEAISGDSPATPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCI 1552
            +HP L+SSWRT   EAI  DSPA+  LSV ETV+FPDQ ++ LKYPPS+ L TKD+I+C+
Sbjct: 95   YHPSLISSWRTSPAEAIYSDSPASRGLSVGETVLFPDQVLILLKYPPSSRLLTKDEIECV 154

Query: 1551 YISPNSSQAQLKLAPVSIDREYLGHQIVRCPLQPRGSTVTITVKSNGNLPAGPTHHWDSL 1372
            Y SPN+S+  L+ +P+SI  +YL HQIVRC + PRG+ V++ +K+ G+LP GPTH W SL
Sbjct: 155  YYSPNTSRPHLRSSPLSIGGQYLDHQIVRCGIPPRGTIVSVALKAQGDLPPGPTHEWKSL 214

Query: 1371 AYEALIDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSIAQEIVR 1192
            AYEA+IDRDNTTIVFVKGLNLRPGR SN SRF+CVYG DF+ RK  L S+VVSIAQEIVR
Sbjct: 215  AYEAMIDRDNTTIVFVKGLNLRPGRASNHSRFQCVYGSDFTNRKSRLKSEVVSIAQEIVR 274

Query: 1191 CKTPLSVLKQRVNNNDNPIKVSIRVLGRGILKSIARPVRKMDPDPPAQKQHEMCICTMLR 1012
            C+TP S+L     ++   IKVS+ V+G+GIL SIARP     PDPP   QH +CICTMLR
Sbjct: 275  CRTPSSLLGSP--HSRGSIKVSVNVIGKGILSSIARPEFLASPDPPVFIQHRVCICTMLR 332

Query: 1011 NQARFIREWVMYHSRIGVQRWFIYDNNSNDDIDKVVELLMEDNVNITRHVWPWIKTQEAG 832
            NQARF+REWVMYH+RIGV+RWFIYDNNS+D+I+ V+E L      ITRH+WPWIKTQEAG
Sbjct: 333  NQARFLREWVMYHARIGVERWFIYDNNSDDEIEDVIESLANQGFEITRHLWPWIKTQEAG 392

Query: 831  FAHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNNESRGRQVAELRTSCYSFGPSGL 652
            FAHC LRARD CEWV F+DVDEFLHL  G+ L ++++N++    VAELR SC++FGPSGL
Sbjct: 393  FAHCGLRARDLCEWVAFIDVDEFLHLRTGVHLGSIISNQTSRPDVAELRISCHNFGPSGL 452

Query: 651  SRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFDLRDDFEYVNMDRNVMVI 472
               P +GVMVGYTCRL  PERHKSIV+PE LN TLINMVHHFDLRD   YVN+DRN+MVI
Sbjct: 453  KESPREGVMVGYTCRLRPPERHKSIVQPEKLNPTLINMVHHFDLRDGLNYVNIDRNLMVI 512

Query: 471  NHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGTRAVEPPDWSSRFCEVTD 292
            NHYK+QVW  FKEKFYRRVATYV DWQ+E+NVGS+DRAPGLGTRA+EP DW+SRFCE+ D
Sbjct: 513  NHYKFQVWDEFKEKFYRRVATYVSDWQKEENVGSRDRAPGLGTRAIEPADWASRFCEIND 572

Query: 291  TGLRDQVLENFTDPKTGLLPWQE 223
            TGLRD V++ F+DP TGLLPWQ+
Sbjct: 573  TGLRDWVVKTFSDPNTGLLPWQQ 595


>GAV85513.1 Glyco_transf_92 domain-containing protein/zf-C3HC4_2
            domain-containing protein, partial [Cephalotus
            follicularis]
          Length = 900

 Score =  733 bits (1892), Expect = 0.0
 Identities = 350/495 (70%), Positives = 405/495 (81%), Gaps = 3/495 (0%)
 Frame = -2

Query: 1695 AMEAISGDSPATPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCIYISPNSSQAQLK 1516
            ++++IS DSP  P +S+RETV+FPDQ + FL YPPS  LFTK+D+DC+Y+S NSSQ QLK
Sbjct: 54   SIKSISDDSPHLPAISIRETVVFPDQTLFFLNYPPSARLFTKEDLDCVYLSANSSQPQLK 113

Query: 1515 LAPVSIDREYLGHQIVRCPLQPRGSTVTITVKSNGNLPAGPTHHWDSLAYEALIDRDNTT 1336
              P  ID   +  QIVRCP  P GSTV++ +KS  ++P GPTH WDSLAYEALIDRDNTT
Sbjct: 114  RPPARIDGHDVDDQIVRCPAGPPGSTVSLGLKSRVHVPVGPTHRWDSLAYEALIDRDNTT 173

Query: 1335 IVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSIAQEIVRCKTPLSVL--KQ 1162
            IVF KGLNLRP  VS+ASRFECVYGWDF R KFLL S+V+SIAQEIVRCKTPLSVL   Q
Sbjct: 174  IVFAKGLNLRPESVSDASRFECVYGWDFERTKFLLRSEVISIAQEIVRCKTPLSVLGNPQ 233

Query: 1161 RVNNNDNPIKVSIRVLGRGILKSIARPVRKMDPDPPAQKQHEMCICTMLRNQARFIREWV 982
            R+NN   P+KVS+RV GRG L SIA P  +M  DPP +K HEMC+CTM+RNQA+F+REWV
Sbjct: 234  RLNN---PVKVSVRVKGRGTLHSIATPWLRMGHDPPTRKAHEMCLCTMVRNQAKFLREWV 290

Query: 981  MYHSRIGVQRWFIYDNNSNDDIDKVVELLME-DNVNITRHVWPWIKTQEAGFAHCALRAR 805
            MYH+RIGVQRWFIYDNNS DDI  V++ L+  +  NI+RHVWPWIKTQE GFAHC LRAR
Sbjct: 291  MYHARIGVQRWFIYDNNSEDDIYNVIKSLVHAERKNISRHVWPWIKTQEGGFAHCELRAR 350

Query: 804  DSCEWVGFMDVDEFLHLGDGMTLHNVLNNESRGRQVAELRTSCYSFGPSGLSRVPIKGVM 625
            + CEWVGF+DVDEF  L   ++LH+VL N S    V E+R SCYSFGPSGL  VP +GVM
Sbjct: 351  ELCEWVGFIDVDEFFRLPSELSLHDVLRNASESDNVGEIRISCYSFGPSGLKHVPQQGVM 410

Query: 624  VGYTCRLGAPERHKSIVRPEALNSTLINMVHHFDLRDDFEYVNMDRNVMVINHYKYQVWQ 445
            VGYTCR  A ERHKSIVRPEALNSTLIN+VHHF L + FEY+N+DR V+VINHYKYQVW+
Sbjct: 411  VGYTCRTAATERHKSIVRPEALNSTLINVVHHFHLSNGFEYMNVDRGVLVINHYKYQVWE 470

Query: 444  VFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGTRAVEPPDWSSRFCEVTDTGLRDQVLE 265
            VFKEKFYRRVATYV DWQ E+NVGSKDRAPGLGTRAVEPPDWS+RFCEV DTGLRDQVLE
Sbjct: 471  VFKEKFYRRVATYVADWQDEQNVGSKDRAPGLGTRAVEPPDWSTRFCEVNDTGLRDQVLE 530

Query: 264  NFTDPKTGLLPWQEN 220
             F +P+T LLPWQ++
Sbjct: 531  RFANPQTHLLPWQDD 545


>B9SLR1.1 RecName: Full=Glycosyltransferase family 92 protein RCOM_0530710
            EEF35417.1 ubiquitin-protein ligase, putative [Ricinus
            communis]
          Length = 552

 Score =  716 bits (1847), Expect = 0.0
 Identities = 343/513 (66%), Positives = 412/513 (80%), Gaps = 13/513 (2%)
 Frame = -2

Query: 1725 PQLVSSWRTPAMEA----ISGDSPATPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDID 1558
            P++VS+WR PAMEA    +S +SPA P +S+RETV+ PDQ ++F+ YP S+ LFTK+D  
Sbjct: 40   PEIVSAWRQPAMEATTTIMSTNSPAKPSISIRETVMLPDQVLIFVNYPQSSRLFTKEDFS 99

Query: 1557 CIYISPNS---SQAQLKLAPVSIDREYLGHQIVRCPLQPRGSTVTITVKSNGN-LPAGPT 1390
            C+Y S NS   S+ QLK  P  ID   + +QIVRCPL PRG +V++ +KS G  +  GPT
Sbjct: 100  CVYFSRNSTSLSETQLKKPPNQIDGTDVNNQIVRCPLNPRGFSVSLELKSGGGYINPGPT 159

Query: 1389 HHWDSLAYEALIDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSI 1210
            H WDSL YEA+IDRDNTT+VFVKG NLR  R+ NAS+FECVYGWDF + KF+L S+V+SI
Sbjct: 160  HRWDSLVYEAMIDRDNTTVVFVKGFNLRADRIYNASKFECVYGWDFRKTKFVLRSNVISI 219

Query: 1209 AQEIVRCKTPLSVLKQRVNNNDNPIKVSIRVLGRGILKSIARPVRKM--DPDPP--AQKQ 1042
            AQEIVRC+TPLS+L  ++  N N IKVSIR+ G+G L SIARP  ++  DP+P    +K 
Sbjct: 220  AQEIVRCQTPLSILNNQLKVN-NAIKVSIRLKGKGTLHSIARPGVQLLTDPEPGLRGEKP 278

Query: 1041 HEMCICTMLRNQARFIREWVMYHSRIGVQRWFIYDNNSNDDIDKVVELLMEDNVNITRHV 862
            HEMCICTMLRNQ RF++EWVMYHS+IGV+RWFIYDNNS DDID V+E L++   NI+RHV
Sbjct: 279  HEMCICTMLRNQGRFLKEWVMYHSQIGVERWFIYDNNSEDDIDSVIESLIDAKFNISRHV 338

Query: 861  WPWIKTQEAGFAHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNNESR-GRQVAELR 685
            WPW+K QEAGFAHCALRAR  CEWVGF+DVDEF HL  G+ L + + N+S  G  VAELR
Sbjct: 339  WPWVKAQEAGFAHCALRARGLCEWVGFIDVDEFFHLPTGLNLQDAVKNQSNSGNNVAELR 398

Query: 684  TSCYSFGPSGLSRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFDLRDDFE 505
             SC+SFGPSGL  VP +GV VGYTCR+  PERHKSIV+PEALNSTLIN+VHHF LRD F 
Sbjct: 399  VSCHSFGPSGLKHVPAQGVTVGYTCRMMLPERHKSIVKPEALNSTLINVVHHFHLRDGFR 458

Query: 504  YVNMDRNVMVINHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGTRAVEPP 325
            YVN D+ ++VINHYKYQVW+VFKEKFYRRVATYV+DWQ E+NVGSKDRAPGLGTRAVEPP
Sbjct: 459  YVNADKGILVINHYKYQVWEVFKEKFYRRVATYVVDWQNEQNVGSKDRAPGLGTRAVEPP 518

Query: 324  DWSSRFCEVTDTGLRDQVLENFTDPKTGLLPWQ 226
            DWSSRFCEV+DTGLRD++L+NF DP T LLPWQ
Sbjct: 519  DWSSRFCEVSDTGLRDRILQNFLDPLTDLLPWQ 551


>XP_010111642.1 hypothetical protein L484_017669 [Morus notabilis] EXC31387.1
            hypothetical protein L484_017669 [Morus notabilis]
          Length = 583

 Score =  709 bits (1830), Expect = 0.0
 Identities = 347/526 (65%), Positives = 416/526 (79%), Gaps = 12/526 (2%)
 Frame = -2

Query: 1725 PQLVSSWRTPAMEAISGDSPATPV-LSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCIY 1549
            P +V   R P  +AISG++P T   +S+RE V+ PDQA++FL YP  + LFTK+D+ C++
Sbjct: 49   PVVVLKCRAPPAKAISGETPGTDSSISIREVVMLPDQALVFLNYPSPSRLFTKEDLACVF 108

Query: 1548 ISPNSSQAQ---LKLAPVSIDR-EYLGHQIVRCPLQPRGSTVTITVKSNG-NLPAGPTHH 1384
             + +SS  +    K +P+S+D  E  G QIVRCP  PRG TV++ +K  G N  AGP H 
Sbjct: 109  YTTSSSATKPRRFKRSPISVDASEDSGVQIVRCPFTPRGFTVSVGLKPKGRNFRAGPRHE 168

Query: 1383 WDSLAYEALIDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFS--RRKFLLSSDVVSI 1210
            WDSLAYEALIDRDNTT+VFVKGLNL+  RVS AS+FECVYGWDF+    KF+L S+V+SI
Sbjct: 169  WDSLAYEALIDRDNTTVVFVKGLNLKRERVSVASKFECVYGWDFTSPNPKFVLRSEVLSI 228

Query: 1209 AQEIVRCKTPLSVLK--QRVN-NNDNPIKVSIRVLGRGILKSIARPVRKMDPDPPAQKQH 1039
            AQEIVRCKTPLSVL    R N +N N +KVS+R+ GR  LK++ARP  +  PDP  +KQH
Sbjct: 229  AQEIVRCKTPLSVLSGPHRPNISNHNSVKVSVRMKGRKTLKTVARPKIRHIPDPMTRKQH 288

Query: 1038 EMCICTMLRNQARFIREWVMYHSRIGVQRWFIYDNNSNDDIDKVVELLMEDNVNITRHVW 859
            EMC+CTM+RNQARF++EW+MYHS IGVQRW+IYDNNS+DD+D V E L   N N+TRH W
Sbjct: 289  EMCVCTMVRNQARFLKEWIMYHSEIGVQRWYIYDNNSDDDLDSVAESLYNSNYNVTRHAW 348

Query: 858  PWIKTQEAGFAHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNNESR-GRQVAELRT 682
            PW+KTQEAGFAHCALRARD+CEWVGF+DVDEF+HL  G+ LH+V+ N+++    VAELRT
Sbjct: 349  PWVKTQEAGFAHCALRARDTCEWVGFIDVDEFIHLPTGLFLHDVVANQTQYNNNVAELRT 408

Query: 681  SCYSFGPSGLSRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFDLRDDFEY 502
            SCYSFGPSGL RVP  GVMVGYTCR+   ERHKS VRPEALNS+LIN+VHHF LRD FEY
Sbjct: 409  SCYSFGPSGLRRVPAHGVMVGYTCRVAIAERHKSFVRPEALNSSLINVVHHFHLRDGFEY 468

Query: 501  VNMDRNVMVINHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGTRAVEPPD 322
            VNMDR VMVINHYKYQVW+VFKEKFYRRVATYV DWQ ++NVGSKDRAPGLGTRAVEPPD
Sbjct: 469  VNMDRGVMVINHYKYQVWEVFKEKFYRRVATYVADWQDDQNVGSKDRAPGLGTRAVEPPD 528

Query: 321  WSSRFCEVTDTGLRDQVLENFTDPKTGLLPWQENGFYVEHKKRKRR 184
            W +RFCEVTDTGLRD+VL  F DPKT LLPW++   Y  +K +  R
Sbjct: 529  WPNRFCEVTDTGLRDRVLRTFADPKTHLLPWEKG--YTNNKNKTER 572


>XP_006338653.1 PREDICTED: UPF0392 protein RCOM_0530710-like [Solanum tuberosum]
          Length = 550

 Score =  707 bits (1825), Expect = 0.0
 Identities = 342/508 (67%), Positives = 410/508 (80%), Gaps = 12/508 (2%)
 Frame = -2

Query: 1707 WRTPAMEAISGDSPATPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCIYISPNSSQ 1528
            WR  A  A+ G  P  PV+   ETV FPD+ V+FLKYPPSTPLFTK D+ C+Y++PNSSQ
Sbjct: 49   WRPAATTALPGVKPIVPVI---ETVQFPDETVVFLKYPPSTPLFTKYDLYCLYLNPNSSQ 105

Query: 1527 AQLKLAPVSIDR-EYLGHQIVRCPL-QPRGSTVTITVKSNG-NLPAGPTHHWDSLAYEAL 1357
                  P S++  E LG Q+VRCPL +PRG   +++VKS G NL  GP+  W+SLAYEA+
Sbjct: 106  P-----PESVENDELLGQQLVRCPLIKPRGVLTSLSVKSTGYNLSIGPSRRWNSLAYEAM 160

Query: 1356 IDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSIAQEIVRCKTPL 1177
            IDRDNTTIVFVKG NLR G+ S+AS+F+CVYGWD    KF+L SDVVSIAQE+VRC TP+
Sbjct: 161  IDRDNTTIVFVKGFNLRGGKQSDASKFKCVYGWDVKNPKFVLQSDVVSIAQEVVRCNTPV 220

Query: 1176 SVLKQ------RVNNNDNPIKVSIRVLGRGILKSIARPVRKMDPDPPAQKQHEMCICTML 1015
            S+L          +N + PIKVS+R++G+  L+SIARP R++  + P QKQHEMC+CTML
Sbjct: 221  SILNNPQRFVGTTSNMNQPIKVSVRMVGKEPLESIARPKRRLQLNLPGQKQHEMCVCTML 280

Query: 1014 RNQARFIREWVMYHSRIGVQRWFIYDNNSNDDIDKVVELL-MEDNVNITRHVWPWIKTQE 838
            RNQA F++EW+MYH+RIGVQRWFIYDNNS DDID V++LL M+ N+N+TRHVWPWIKTQE
Sbjct: 281  RNQASFLKEWIMYHTRIGVQRWFIYDNNSLDDIDDVIKLLSMDGNINVTRHVWPWIKTQE 340

Query: 837  AGFAHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNNESRGR--QVAELRTSCYSFG 664
            AGFAHCALRARD CEWVGFMDVDEF HL  GM+L ++L N+SR    +VAELR SC++FG
Sbjct: 341  AGFAHCALRARDVCEWVGFMDVDEFFHLPTGMSLLDILRNQSRSSNSKVAELRVSCHNFG 400

Query: 663  PSGLSRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFDLRDDFEYVNMDRN 484
            PSGL  VP +GV +GY CR+ APERHKSIV+PEALNSTLIN+VHHF L+ +F Y NMDRN
Sbjct: 401  PSGLKHVPTQGVTMGYNCRMIAPERHKSIVKPEALNSTLINVVHHFHLKSEFRYANMDRN 460

Query: 483  VMVINHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGTRAVEPPDWSSRFC 304
            V+VINHYKYQVW VFKEKFYRRVATYV DWQQ +NVGS+DRAPGLGTRAVEPPDWSSRFC
Sbjct: 461  VLVINHYKYQVWDVFKEKFYRRVATYVSDWQQHRNVGSRDRAPGLGTRAVEPPDWSSRFC 520

Query: 303  EVTDTGLRDQVLENFTDPKTGLLPWQEN 220
            EVTDTGL+D+V + FTDP TG LPWQ++
Sbjct: 521  EVTDTGLKDRVAQMFTDPNTGKLPWQKS 548


>XP_010029049.1 PREDICTED: glycosyltransferase family 92 protein RCOM_0530710
            [Eucalyptus grandis] KCW55894.1 hypothetical protein
            EUGRSUZ_I01697 [Eucalyptus grandis]
          Length = 905

 Score =  720 bits (1859), Expect = 0.0
 Identities = 339/502 (67%), Positives = 403/502 (80%), Gaps = 1/502 (0%)
 Frame = -2

Query: 1725 PQLVSSWRTPAMEAISGDSPATPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCIYI 1546
            P +V SWR P M A+SGDS   P +S+RE VIFPDQAV+FL+YPPS+ LFTK+DI C+Y+
Sbjct: 65   PVVVPSWRNPVMRAVSGDSEPAPSVSIREMVIFPDQAVIFLEYPPSSRLFTKEDIHCVYV 124

Query: 1545 -SPNSSQAQLKLAPVSIDREYLGHQIVRCPLQPRGSTVTITVKSNGNLPAGPTHHWDSLA 1369
             + +S++ +L+  P  +D E  G QIVRCPL   GS V++  +S  ++P GP+H WDSL 
Sbjct: 125  EAADSTKLRLRQPPAGVDGEDSGEQIVRCPLAENGSLVSLAFESKEHVPPGPSHRWDSLV 184

Query: 1368 YEALIDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSIAQEIVRC 1189
            Y+A+IDRDNTT++FVKGLNLRP RVS+ASRFECV+GWDF + +FLL ++V SIAQEIVRC
Sbjct: 185  YDAMIDRDNTTVIFVKGLNLRPERVSDASRFECVFGWDFGKPRFLLRTEVESIAQEIVRC 244

Query: 1188 KTPLSVLKQRVNNNDNPIKVSIRVLGRGILKSIARPVRKMDPDPPAQKQHEMCICTMLRN 1009
            KTPLSV       N   +KVS+RV G   ++SIARP  +    P A+K H MCICTM+RN
Sbjct: 245  KTPLSVFSSP-QRNGTAVKVSVRVKGGRTMRSIARPGCRAALKPTARKLHNMCICTMVRN 303

Query: 1008 QARFIREWVMYHSRIGVQRWFIYDNNSNDDIDKVVELLMEDNVNITRHVWPWIKTQEAGF 829
            Q RFIREWVMYH+RIGVQRWFIYDNNS DD D V+E L   + NI+RH+WPWIKTQEAGF
Sbjct: 304  QGRFIREWVMYHARIGVQRWFIYDNNSEDDTDDVIESLAGSDYNISRHLWPWIKTQEAGF 363

Query: 828  AHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNNESRGRQVAELRTSCYSFGPSGLS 649
            AHCA+RARD+CEWVGF+DVDEF HL  G+ L ++L NESR R VAELR SC+SFGPSGL 
Sbjct: 364  AHCAMRARDACEWVGFIDVDEFFHLPTGLFLDDILRNESRPRNVAELRVSCHSFGPSGLK 423

Query: 648  RVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFDLRDDFEYVNMDRNVMVIN 469
              P +GVM GYTCRL APERHKSIVRPEALNSTLIN+VHHF LR  F++ N+DR  MVIN
Sbjct: 424  HAPRQGVMAGYTCRLAAPERHKSIVRPEALNSTLINVVHHFHLRRGFDFANLDRGAMVIN 483

Query: 468  HYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGTRAVEPPDWSSRFCEVTDT 289
            HYKYQVW+VFKEKFYRRVATYV DW++E+NVGSKDRAPGLGT+AVEPPDW SRFCEV DT
Sbjct: 484  HYKYQVWEVFKEKFYRRVATYVADWKEEQNVGSKDRAPGLGTKAVEPPDWPSRFCEVEDT 543

Query: 288  GLRDQVLENFTDPKTGLLPWQE 223
            GLRD+VL  F DP+T LLPWQE
Sbjct: 544  GLRDRVLHVFADPRTLLLPWQE 565


>KDP46347.1 hypothetical protein JCGZ_10187 [Jatropha curcas]
          Length = 572

 Score =  705 bits (1819), Expect = 0.0
 Identities = 341/529 (64%), Positives = 407/529 (76%), Gaps = 8/529 (1%)
 Frame = -2

Query: 1731 HHPQLVSSWRTPAMEAISGDSPATPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCI 1552
            + P++V +WR P++  I  DSP     S+RETV+ PDQ ++F KYP    LFTK+D DC+
Sbjct: 45   YRPEIVPAWRPPSLAVILSDSPVNSAFSIRETVMLPDQVLIFPKYPRLGRLFTKEDFDCV 104

Query: 1551 YISPN--SSQAQLKLAPVSIDREYLGHQIVRCPLQPRGSTVTITVKSNGN-LPAGPTHHW 1381
            Y   N  SS+ Q+K  P  ID + L  QIVRCPL PRG TV++ +KS+G  +   PTH W
Sbjct: 105  YFLANASSSEPQVKQPPNHIDGDDLDDQIVRCPLNPRGLTVSLALKSDGGYIQPRPTHRW 164

Query: 1380 DSLAYEALIDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSIAQE 1201
            DSL YEA+ID DNTTIVFVKGLNLRP ++ NAS+FECVYGWDF R KFL+ SDV+SIAQE
Sbjct: 165  DSLVYEAIIDSDNTTIVFVKGLNLRPDKLYNASKFECVYGWDFRRPKFLIGSDVISIAQE 224

Query: 1200 IVRCKTPLSVLKQRVNNNDNPIKVSIRVLGRGILKSIARPVRKMDP--DPPAQKQHEMCI 1027
            IVRC+TPLS+L   +  N N IKVSIR+ G+G L S+AR   K+    DP AQK HEMCI
Sbjct: 225  IVRCQTPLSILSNPLKVN-NSIKVSIRLKGKGTLHSVARLRLKLSSGSDPKAQKPHEMCI 283

Query: 1026 CTMLRNQARFIREWVMYHSRIGVQRWFIYDNNSNDDIDKVVELLMEDNVNITRHVWPWIK 847
            CTMLRNQARF +EWVMYHS+IGVQRWFIYDNNS+DDI+ V+E L+  N NI+RHVWPW+K
Sbjct: 284  CTMLRNQARFFKEWVMYHSKIGVQRWFIYDNNSDDDINSVIESLVGSNFNISRHVWPWVK 343

Query: 846  TQEAGFAHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNNESRGRQ---VAELRTSC 676
            TQEAGFAHCALRAR  CEWVGF+DVDE+ +L  G+ LH  + N+S+      VAELR SC
Sbjct: 344  TQEAGFAHCALRARGLCEWVGFIDVDEYFYLPTGLDLHEAIRNQSQSSNRSNVAELRVSC 403

Query: 675  YSFGPSGLSRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFDLRDDFEYVN 496
            +SFGPSGL+ VP +GV VGYTCR+   ERHKSIV+PEALNSTL+N+VHHF LRD F YV+
Sbjct: 404  HSFGPSGLTHVPKQGVTVGYTCRMVIRERHKSIVKPEALNSTLMNVVHHFHLRDGFSYVD 463

Query: 495  MDRNVMVINHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGTRAVEPPDWS 316
             D  V+VINHYKYQVW+VFKEKFYRRVATYV DWQ E+NVGSKDR PGLGTRAVEPPDW 
Sbjct: 464  ADMGVLVINHYKYQVWEVFKEKFYRRVATYVADWQNEQNVGSKDRTPGLGTRAVEPPDWP 523

Query: 315  SRFCEVTDTGLRDQVLENFTDPKTGLLPWQENGFYVEHKKRKRRMSRVL 169
            +RFCEVTDTGL+DQVL+ F DP T +LPWQE    ++ K   RR  + L
Sbjct: 524  NRFCEVTDTGLKDQVLQKFMDPATNILPWQEEQGVLDRKAISRRQLKSL 572


>XP_015579533.1 PREDICTED: LOW QUALITY PROTEIN: UPF0392 protein RCOM_0530710 [Ricinus
            communis]
          Length = 978

 Score =  717 bits (1850), Expect = 0.0
 Identities = 345/523 (65%), Positives = 415/523 (79%), Gaps = 13/523 (2%)
 Frame = -2

Query: 1725 PQLVSSWRTPAMEA----ISGDSPATPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDID 1558
            P++VS+WR PAMEA    +S +SPA P +S+RETV+ PDQ ++F+ YP S+ LFTK+D  
Sbjct: 40   PEIVSAWRQPAMEATTTIMSTNSPAKPSISIRETVMLPDQVLIFVNYPQSSRLFTKEDFS 99

Query: 1557 CIYISPNS---SQAQLKLAPVSIDREYLGHQIVRCPLQPRGSTVTITVKSNGN-LPAGPT 1390
            C+Y S NS   S+ QLK  P  ID   + +QIVRCPL PRG +V++ +KS G  +  GPT
Sbjct: 100  CVYFSRNSTSLSETQLKKPPNQIDGTDVNNQIVRCPLNPRGFSVSLELKSGGGYINPGPT 159

Query: 1389 HHWDSLAYEALIDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSI 1210
            H WDSL YEA+IDRDNTT+VFVKG NLR  R+ NAS+FECVYGWDF + KF+L S+V+SI
Sbjct: 160  HRWDSLVYEAMIDRDNTTVVFVKGFNLRADRIYNASKFECVYGWDFRKTKFVLRSNVISI 219

Query: 1209 AQEIVRCKTPLSVLKQRVNNNDNPIKVSIRVLGRGILKSIARPVRKM--DPDPP--AQKQ 1042
            AQEIVRC+TPLS+L  ++  N N IKVSIR+ G+G L SIARP  ++  DP+P    +K 
Sbjct: 220  AQEIVRCQTPLSILNNQLKVN-NAIKVSIRLKGKGTLHSIARPGVQLLTDPEPGLRGEKP 278

Query: 1041 HEMCICTMLRNQARFIREWVMYHSRIGVQRWFIYDNNSNDDIDKVVELLMEDNVNITRHV 862
            HEMCICTMLRNQ RF++EWVMYHS+IGV+RWFIYDNNS DDID V+E L++   NI+RHV
Sbjct: 279  HEMCICTMLRNQGRFLKEWVMYHSQIGVERWFIYDNNSEDDIDSVIESLIDAKFNISRHV 338

Query: 861  WPWIKTQEAGFAHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNNESR-GRQVAELR 685
            WPW+K QEAGFAHCALRAR  CEWVGF+DVDEF HL  G+ L + + N+S  G  VAELR
Sbjct: 339  WPWVKAQEAGFAHCALRARGLCEWVGFIDVDEFFHLPTGLNLQDAVKNQSNSGNNVAELR 398

Query: 684  TSCYSFGPSGLSRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFDLRDDFE 505
             SC+SFGPSGL  VP +GV VGYTCR+  PERHKSIV+PEALNSTLIN+VHHF LRD F 
Sbjct: 399  VSCHSFGPSGLKHVPAQGVTVGYTCRMMLPERHKSIVKPEALNSTLINVVHHFHLRDGFR 458

Query: 504  YVNMDRNVMVINHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGTRAVEPP 325
            YVN D+ ++VINHYKYQVW+VFKEKFYRRVATYV+DWQ E+NVGSKDRAPGLGTRAVEPP
Sbjct: 459  YVNADKGILVINHYKYQVWEVFKEKFYRRVATYVVDWQNEQNVGSKDRAPGLGTRAVEPP 518

Query: 324  DWSSRFCEVTDTGLRDQVLENFTDPKTGLLPWQENGFYVEHKK 196
            DWSSRFCEV+DTGLRD++L+NF DP T LLPWQ    Y   K+
Sbjct: 519  DWSSRFCEVSDTGLRDRILQNFLDPLTDLLPWQIXDKYTNIKR 561


>OMO69085.1 hypothetical protein COLO4_29270 [Corchorus olitorius]
          Length = 585

 Score =  701 bits (1808), Expect = 0.0
 Identities = 342/524 (65%), Positives = 410/524 (78%), Gaps = 24/524 (4%)
 Frame = -2

Query: 1683 ISGDSPATPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCIYISPN--SSQAQLKLA 1510
            +S +    PVLS+RETV+ PD+ +LFLKYPPS   FTK+++ C+Y+S +  SS+ +LK  
Sbjct: 62   VSNEVAVIPVLSIRETVLLPDEVLLFLKYPPSVRFFTKEELVCVYLSADKSSSEPRLKQP 121

Query: 1509 PVSIDREYLGHQIVRCPLQPRGSTVTITVKSNGNLPAGPTHHWDSLAYEALIDRDNTTIV 1330
             V +D E+LG QIVRCP  PRG TVT+  KS G +PAGPTH WD+LAYEALID+DNTT+V
Sbjct: 122  AVGVDNEHLGEQIVRCPSCPRGMTVTVDSKSYGIIPAGPTHRWDTLAYEALIDKDNTTVV 181

Query: 1329 FVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSIAQEIVRCKTPLSVL--KQRV 1156
            FVKGLNLRP R+SNASRFEC+YGWDF +   LL S+V+SIAQEIVRCKTPLS+L  +QRV
Sbjct: 182  FVKGLNLRPERISNASRFECMYGWDFMKLSLLLRSEVLSIAQEIVRCKTPLSILNGQQRV 241

Query: 1155 NNNDNPIKVSIRVLGRGILKSIARPVRKMDPDP---PAQKQHEMCICTMLRNQARFIREW 985
            N     IKVS+R+ G G L SIAR     +P P   P +K HEMCICTM RNQARF++EW
Sbjct: 242  NGT---IKVSVRIKGSGALPSIARLGHLGNPGPDPTPTRKPHEMCICTMARNQARFLKEW 298

Query: 984  VMYHSRIGVQRWFIYDNNSNDDIDKVVELLMEDNVNITRHVWPWIKTQEAGFAHCALRAR 805
            VMYH+RIGVQRW+IYDNNS+DD D V+E L + N NI+RH+WPWIKTQEAGFAHCALRAR
Sbjct: 299  VMYHARIGVQRWYIYDNNSDDDTDSVIESLFDANYNISRHLWPWIKTQEAGFAHCALRAR 358

Query: 804  DSCEWVGFMDVDEFLHLGDGMTLHNV-----------LNNESRGRQVAELRTSCYSFGPS 658
            D C+WVGF+DVDEF HL  G+ LH+V           LN +SR   + ELR SC+SFGPS
Sbjct: 359  DFCDWVGFIDVDEFFHLPSGLMLHDVIRNLTSTTSLKLNTQSRYIPIGELRVSCHSFGPS 418

Query: 657  GLSRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFDLRDDFEYVNMDRNVM 478
            GL  VP +GV VGYTCR+ APERHKSIV+PEALNSTLIN+VHHF LRD F +V+++R +M
Sbjct: 419  GLKHVPQQGVTVGYTCRMMAPERHKSIVKPEALNSTLINVVHHFHLRDGFRFVDVNRTMM 478

Query: 477  VINHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGTRAVEPPDWSSRFCEV 298
            V+NHYK+QVW+VFKEKFYRRVATYV DWQ E+NVGSKDRAPGLGT+AVEP DWS+RFCEV
Sbjct: 479  VVNHYKFQVWEVFKEKFYRRVATYVADWQDEQNVGSKDRAPGLGTKAVEPVDWSNRFCEV 538

Query: 297  TDTGLRDQVLENFTDPKTGLLPWQE-----NGF-YVEHKKRKRR 184
             DTGL+D+VL+NF  P T LLPWQ+      GF  VE K RK +
Sbjct: 539  VDTGLKDRVLQNFAHPGTSLLPWQDANDVIKGFSSVEKKSRKSK 582


>XP_017975667.1 PREDICTED: glycosyltransferase family 92 protein RCOM_0530710-like
            [Theobroma cacao]
          Length = 571

 Score =  698 bits (1801), Expect = 0.0
 Identities = 339/520 (65%), Positives = 409/520 (78%), Gaps = 18/520 (3%)
 Frame = -2

Query: 1728 HPQLVSSWRTPAMEAI-SGDSPATPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCI 1552
            HP L S+ +T  + A+ S + P  PVLS+RET++FPDQ +LFL+ PP   LFTK+++ CI
Sbjct: 46   HPVLGSTRQTSTINAVVSNEFPVVPVLSIRETILFPDQVILFLECPPRARLFTKEELLCI 105

Query: 1551 YISP--NSSQAQLKLAPVSIDREYLGHQIVRCPLQPRGSTVTITVKSNGNLPAGPTHHWD 1378
            Y+S   NSS+ +LK  P  +D   +G QIV CP  PRG  VT+  KSNG +PAGP+H W 
Sbjct: 106  YLSADNNSSETRLKKPPARVDSRQVGEQIVWCPGCPRGLIVTVASKSNGVIPAGPSHRWG 165

Query: 1377 SLAYEALIDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSIAQEI 1198
            +LAYEALID+DNTT++FVKGLNLRP RVSNASRFECVYGWDFSR   +L S+V+SIAQEI
Sbjct: 166  TLAYEALIDKDNTTLLFVKGLNLRPERVSNASRFECVYGWDFSRLNLVLRSEVLSIAQEI 225

Query: 1197 VRCKTPLSVL--KQRVNNNDNPIKVSIRVLGRGILKSIARPVRKMDPD---PPAQKQHEM 1033
            VRCKTPLSVL  +Q+VN    P+KVSIR+ G G+L S+AR    + P     P +K HEM
Sbjct: 226  VRCKTPLSVLNGQQKVNG---PVKVSIRIKGNGLLPSVARLGHLVGPGLQPVPTRKPHEM 282

Query: 1032 CICTMLRNQARFIREWVMYHSRIGVQRWFIYDNNSNDDIDKVVELLMEDNVNITRHVWPW 853
            CICTM RNQARF++EWVMYH++IGVQRW+IYDNNS+DD D V+E L + N NI+RH+WPW
Sbjct: 283  CICTMARNQARFLKEWVMYHAQIGVQRWYIYDNNSDDDTDSVIESLSDANYNISRHIWPW 342

Query: 852  IKTQEAGFAHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNN----------ESRGR 703
            IKTQEAGFAHCALR+R SC+WVGF+DVDEF HL  G+TL  VL N          E+   
Sbjct: 343  IKTQEAGFAHCALRSRSSCDWVGFIDVDEFFHLPSGLTLQAVLRNLSSRAPSLDTEAAHI 402

Query: 702  QVAELRTSCYSFGPSGLSRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFD 523
             + ELR SCYSFGPSGL  VP +GVMVGYTCR+ APERHKSIVRPEALNSTLIN+VHHF 
Sbjct: 403  PIGELRVSCYSFGPSGLRHVPRQGVMVGYTCRMTAPERHKSIVRPEALNSTLINVVHHFH 462

Query: 522  LRDDFEYVNMDRNVMVINHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGT 343
            L   F ++++D+ +MV+NHYKYQVW+VFKEKFYRRVATYV DW+ E+NVGSKDRAPGLGT
Sbjct: 463  LMHGFRFLDVDKTMMVVNHYKYQVWEVFKEKFYRRVATYVADWKDEQNVGSKDRAPGLGT 522

Query: 342  RAVEPPDWSSRFCEVTDTGLRDQVLENFTDPKTGLLPWQE 223
            +AVEP DWSSRFCEV DTGLRD+VL+NF +PKT LLPWQ+
Sbjct: 523  KAVEPVDWSSRFCEVLDTGLRDRVLQNFVNPKTFLLPWQD 562


>EOY02628.1 UPF0392 protein RCOM_0530710 isoform 2 [Theobroma cacao]
          Length = 571

 Score =  697 bits (1798), Expect = 0.0
 Identities = 338/520 (65%), Positives = 409/520 (78%), Gaps = 18/520 (3%)
 Frame = -2

Query: 1728 HPQLVSSWRTPAMEAI-SGDSPATPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCI 1552
            HP L S+ +T  M A+ S + P  PVLS+RET++FPDQ +LFL+ PP   LFTK+++ C+
Sbjct: 46   HPVLGSTRQTSTMNAVVSNEFPVVPVLSIRETILFPDQVILFLECPPRARLFTKEELLCV 105

Query: 1551 YISP--NSSQAQLKLAPVSIDREYLGHQIVRCPLQPRGSTVTITVKSNGNLPAGPTHHWD 1378
            Y+S   NSS+ +LK  P  +D   +G QIV CP  PRG  VT+  KSNG +PAGP+H W 
Sbjct: 106  YLSADNNSSETRLKKPPARVDSRQVGEQIVWCPGCPRGLIVTVASKSNGVIPAGPSHRWG 165

Query: 1377 SLAYEALIDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSIAQEI 1198
            +LAYEALID+DNTT++FVKGLNLRP RVSNASRFECVYGWDFSR   +L S+V+SIAQEI
Sbjct: 166  TLAYEALIDKDNTTLLFVKGLNLRPERVSNASRFECVYGWDFSRLNLVLRSEVLSIAQEI 225

Query: 1197 VRCKTPLSVL--KQRVNNNDNPIKVSIRVLGRGILKSIARPVRKMDPD---PPAQKQHEM 1033
            VRCKTPLSVL  +Q+VN +   +KVSIR+ G G+L S+AR    + P     P +K HEM
Sbjct: 226  VRCKTPLSVLNCQQKVNGS---VKVSIRIKGNGLLPSVARLRHLVGPGLQPVPTRKPHEM 282

Query: 1032 CICTMLRNQARFIREWVMYHSRIGVQRWFIYDNNSNDDIDKVVELLMEDNVNITRHVWPW 853
            CICTM RNQARF++EWVMYH++IGVQRW+IYDNNS+DD D V+E L + N NI+RH+WPW
Sbjct: 283  CICTMARNQARFLKEWVMYHAQIGVQRWYIYDNNSDDDTDSVIESLSDANYNISRHIWPW 342

Query: 852  IKTQEAGFAHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNN----------ESRGR 703
            IKTQEAGFAHCALR+R SC+WVGF+DVDEF HL  G+TL  VL N          E+   
Sbjct: 343  IKTQEAGFAHCALRSRSSCDWVGFIDVDEFFHLPSGLTLQAVLRNLSSRAPSLDTEAAHI 402

Query: 702  QVAELRTSCYSFGPSGLSRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFD 523
             + ELR SCYSFGPSGL  VP +GVMVGYTCR+ APERHKSIVRPEALNSTLIN+VHHF 
Sbjct: 403  PIGELRVSCYSFGPSGLRHVPRQGVMVGYTCRMTAPERHKSIVRPEALNSTLINVVHHFH 462

Query: 522  LRDDFEYVNMDRNVMVINHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGT 343
            L   F ++++D+ +MV+NHYKYQVW+VFKEKFYRRVATYV DW+ E+NVGSKDRAPGLGT
Sbjct: 463  LMHGFRFLDVDKTMMVVNHYKYQVWEVFKEKFYRRVATYVADWKDEQNVGSKDRAPGLGT 522

Query: 342  RAVEPPDWSSRFCEVTDTGLRDQVLENFTDPKTGLLPWQE 223
            +AVEP DWSSRFCEV DTGLRD+VL+NF +PKT LLPWQ+
Sbjct: 523  KAVEPVDWSSRFCEVLDTGLRDRVLQNFVNPKTYLLPWQD 562


>XP_016683836.1 PREDICTED: glycosyltransferase family 92 protein RCOM_0530710-like
            [Gossypium hirsutum]
          Length = 585

 Score =  696 bits (1795), Expect = 0.0
 Identities = 342/533 (64%), Positives = 414/533 (77%), Gaps = 20/533 (3%)
 Frame = -2

Query: 1725 PQLVSSWRTPAMEAI-SGDSPATPVLSVRETVIFPDQAVLFLKYPPSTPLFTKDDIDCIY 1549
            P L  + +TP   AI SG+ P T +LSVRETV+ PDQ +LFLKYP S  LFTK++++C+Y
Sbjct: 47   PVLKFARKTPTANAIVSGEHPVTQILSVRETVLLPDQVILFLKYPRSARLFTKEELECVY 106

Query: 1548 ISPN--SSQAQLKLAPVSIDREYLGHQIVRCPLQPRGSTVTITVKSNGNLPAGPTHHWDS 1375
            +S +  S    LK +P  +D E L  QIVRCP  PRG  +T+  KSNG  PAGPTHHWDS
Sbjct: 107  LSVHNPSPVPWLKQSPARVDTEQLSEQIVRCPSGPRGLLITVGSKSNGVFPAGPTHHWDS 166

Query: 1374 LAYEALIDRDNTTIVFVKGLNLRPGRVSNASRFECVYGWDFSRRKFLLSSDVVSIAQEIV 1195
            LAYEAL+D DNTT+VFVKGLNLRP RVSNASRFECVYG DFSR K +L S+V+SIAQEIV
Sbjct: 167  LAYEALVDNDNTTVVFVKGLNLRPERVSNASRFECVYGEDFSRLKLVLRSEVLSIAQEIV 226

Query: 1194 RCKTPLSVL--KQRVNNNDNPIKVSIRVLGRGILKSIARPVRKMDP--DP-PAQKQHEMC 1030
            RC+TPL VL  K++VN++   +KVS+R+ G G L SIAR     D   DP P +K HE C
Sbjct: 227  RCRTPLIVLNSKKKVNSS---VKVSVRIKGGGTLPSIARLGFLSDSGRDPLPTRKPHETC 283

Query: 1029 ICTMLRNQARFIREWVMYHSRIGVQRWFIYDNNSNDDIDKVVELLMEDNVNITRHVWPWI 850
            ICTM RNQARF++EWVMYH+ IGVQRW+IYDNNS+DD D+V+E L     NI+RH+WPW+
Sbjct: 284  ICTMARNQARFLKEWVMYHALIGVQRWYIYDNNSDDDTDQVIESLFNAGYNISRHIWPWV 343

Query: 849  KTQEAGFAHCALRARDSCEWVGFMDVDEFLHLGDGMTLHNVLNN-----ESRGR-QVAEL 688
            KTQE GF+HCALRA+ SCEW+GF+DVDEFLHL  G+ LH+V++N      S G   + E+
Sbjct: 344  KTQEGGFSHCALRAKGSCEWIGFIDVDEFLHLPSGLFLHDVISNLTSITTSFGNISIGEI 403

Query: 687  RTSCYSFGPSGLSRVPIKGVMVGYTCRLGAPERHKSIVRPEALNSTLINMVHHFDLRDDF 508
            R SCYSFGPSGL R+P +GV VGYTCRL  PERHKSIV+PEALNSTLIN+VHHF LRDDF
Sbjct: 404  RVSCYSFGPSGLKRIPKQGVTVGYTCRLVVPERHKSIVKPEALNSTLINIVHHFHLRDDF 463

Query: 507  EYVNMDRNVMVINHYKYQVWQVFKEKFYRRVATYVIDWQQEKNVGSKDRAPGLGTRAVEP 328
             ++++DR +MV+NHYKYQVW+VFK+KFYRRVATYV DW+ E+NVGSKDRAPGLGTRAVEP
Sbjct: 464  RFIDVDRRMMVVNHYKYQVWEVFKQKFYRRVATYVADWKDEQNVGSKDRAPGLGTRAVEP 523

Query: 327  PDWSSRFCEVTDTGLRDQVLENFTDPKTGLLPWQENG------FYVEHKKRKR 187
             DWSSRFCEVTDTGL+D+VL+NF +PK  LLPWQ         ++VE K   +
Sbjct: 524  VDWSSRFCEVTDTGLKDRVLQNFANPKNSLLPWQTTSNKKAGPWFVEEKNTSK 576


Top