BLASTX nr result

ID: Rehmannia31_contig00019032 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia31_contig00019032
         (706 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011071049.1| glycosyltransferase family 92 protein RCOM_0...   224   4e-66
gb|PIN24926.1| hypothetical protein CDL12_02314 [Handroanthus im...   217   1e-63
ref|XP_012855282.1| PREDICTED: UPF0392 protein RCOM_0530710-like...   187   2e-52
gb|KZV24649.1| hypothetical protein F511_37363 [Dorcoceras hygro...   186   2e-51
ref|XP_022868348.1| glycosyltransferase family 92 protein RCOM_0...   176   7e-48
ref|XP_022869016.1| glycosyltransferase family 92 protein RCOM_0...   167   4e-47
ref|XP_022856133.1| glycosyltransferase family 92 protein RCOM_0...   172   2e-46
gb|PIN18035.1| hypothetical protein CDL12_09291 [Handroanthus im...   171   6e-46
ref|XP_011086443.1| glycosyltransferase family 92 protein RCOM_0...   167   1e-44
ref|XP_016501070.1| PREDICTED: glycosyltransferase family 92 pro...   154   1e-39
ref|XP_009779733.1| PREDICTED: UPF0392 protein RCOM_0530710 [Nic...   154   1e-39
ref|XP_016451148.1| PREDICTED: glycosyltransferase family 92 pro...   152   3e-39
emb|CDP00239.1| unnamed protein product [Coffea canephora]            152   5e-39
ref|XP_009602227.1| PREDICTED: glycosyltransferase family 92 pro...   151   1e-38
ref|XP_019254339.1| PREDICTED: glycosyltransferase family 92 pro...   148   1e-37
ref|XP_012847676.1| PREDICTED: UPF0392 protein RCOM_0530710-like...   147   3e-37
ref|XP_018807521.1| PREDICTED: glycosyltransferase family 92 pro...   138   8e-36
ref|XP_006357222.1| PREDICTED: UPF0392 protein RCOM_0530710 [Sol...   142   2e-35
ref|XP_004238737.1| PREDICTED: glycosyltransferase family 92 pro...   141   3e-35
ref|XP_015074620.1| PREDICTED: UPF0392 protein RCOM_0530710 [Sol...   140   1e-34

>ref|XP_011071049.1| glycosyltransferase family 92 protein RCOM_0530710 [Sesamum
           indicum]
          Length = 611

 Score =  224 bits (571), Expect = 4e-66
 Identities = 132/233 (56%), Positives = 144/233 (61%), Gaps = 5/233 (2%)
 Frame = -1

Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXLICXXXXXXXXXXXXXSHFPASAFRPXXXX 506
           MD SDQRRKRKRI                  +C             + F +SA+RP    
Sbjct: 1   MDSSDQRRKRKRILRQSYSPRHFLSVRSLV-MCFSFFTSLCLLSYTTPFRSSAYRPVQVV 59

Query: 505 XXXXXXXXXXSRTVKAVQVFDGLSQPLKIENRVFFPDHVLLLVSGGKKN----QELECVY 338
                     SR VK+VQ FDG   PLKIE+RV FPDHVLLLV GGK      +ELEC+Y
Sbjct: 60  SSLSLLSSSISRKVKSVQDFDGSLLPLKIESRVLFPDHVLLLVGGGKMEKSGAEELECIY 119

Query: 337 YGKPGKFRKGNFDFGKKKVMSVDEYDGLSSIVRCPLPSVNYSSVVSLKL-GKNGILGEKN 161
           YGK      G  DF KK V SVDEYDG  SIVRCPLP VNYSSVV+LK  GKNGILGEK 
Sbjct: 120 YGKFDSIDSG--DFEKKNVFSVDEYDGFRSIVRCPLPPVNYSSVVNLKRRGKNGILGEKV 177

Query: 160 EFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHFGLG 2
              V +QTVN WENVAY A LDGD+ +VFVKGLNLR  RESDPS FSCHFGLG
Sbjct: 178 GLRVDSQTVNSWENVAYEATLDGDTAIVFVKGLNLRPDRESDPSQFSCHFGLG 230


>gb|PIN24926.1| hypothetical protein CDL12_02314 [Handroanthus impetiginosus]
          Length = 592

 Score =  217 bits (553), Expect = 1e-63
 Identities = 129/232 (55%), Positives = 145/232 (62%), Gaps = 4/232 (1%)
 Frame = -1

Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXLICXXXXXXXXXXXXXSHFPASAFRPXXXX 506
           MD SDQRRKRKRI                  +C             + F +SA RP    
Sbjct: 1   MDSSDQRRKRKRILRQSYFHLHLSSVRSLV-VCFSLLTFLYLLSYTNPFTSSATRPVLVE 59

Query: 505 XXXXXXXXXXSRTVKAVQVFDGLSQPLKIENRVFFPDHVLLLVSGGKKN----QELECVY 338
                     SRTVK+VQ FD L  PLKIENRV FPDHVLLLVSG KK+     +LECVY
Sbjct: 60  SSLSLVSSSSSRTVKSVQDFDDLLLPLKIENRVLFPDHVLLLVSGVKKDWLMKNKLECVY 119

Query: 337 YGKPGKFRKGNFDFGKKKVMSVDEYDGLSSIVRCPLPSVNYSSVVSLKLGKNGILGEKNE 158
           YGK G+F  GNF   +K  + VDEYD   SIVRCPLPSVNYSSVV+LKL KNGI  E N 
Sbjct: 120 YGKFGRF-SGNF--ARKNALFVDEYDVFRSIVRCPLPSVNYSSVVNLKL-KNGIFKENNG 175

Query: 157 FLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHFGLG 2
             +RNQT N WENVAY A +DG++ VVFVKGL+LRAGRESDP   +CHFGLG
Sbjct: 176 IWMRNQTANSWENVAYEATVDGETAVVFVKGLHLRAGRESDPGQLTCHFGLG 227


>ref|XP_012855282.1| PREDICTED: UPF0392 protein RCOM_0530710-like [Erythranthe guttata]
 gb|EYU22455.1| hypothetical protein MIMGU_mgv1a003356mg [Erythranthe guttata]
          Length = 590

 Score =  187 bits (476), Expect = 2e-52
 Identities = 123/238 (51%), Positives = 138/238 (57%), Gaps = 10/238 (4%)
 Frame = -1

Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXLICXXXXXXXXXXXXXSHFPA-SAFRPXXX 509
           MD  DQRRKRKR                  LIC             + F +   FRP   
Sbjct: 1   MDSLDQRRKRKR-SFRQYPNLQLLLSVRSLLICFSFLIFLYLFSYTNPFSSIPVFRPVLV 59

Query: 508 XXXXXXXXXXXSRTVKAVQVFDGL-SQPLKIENRVFFPDHVLLLVSGGKKNQ----ELEC 344
                      S TVK+VQ  D L S PLKIE+RV FPDHVLLLV GG KN+    E EC
Sbjct: 60  VSSLSLLSTSISTTVKSVQDSDSLKSPPLKIESRVLFPDHVLLLVGGGSKNRFLKNEFEC 119

Query: 343 VYYGKPGKFRKGNFDFGKKKVMSVDEYDGLSSIVRCPLPSVNYSSVVSL-KLGKNGILGE 167
           VY          N  FGKKKV+S DEYD   S++RCPLPS+NYS+ V L ++ KN +L E
Sbjct: 120 VYN------HVKNGGFGKKKVVSFDEYDEFRSVLRCPLPSLNYSNAVKLQRVDKNRVLAE 173

Query: 166 K-NEFLVRNQTVNYWENVAYAAALDGDS--VVVFVKGLNLRAGRESDPSPFSCHFGLG 2
           K N FL RNQTV  WEN+AY AALDGD+   VVFVKGLNLR GRESDPS FSCHFG G
Sbjct: 174 KSNGFLRRNQTVKSWENIAYEAALDGDTDTAVVFVKGLNLRPGRESDPSQFSCHFGFG 231


>gb|KZV24649.1| hypothetical protein F511_37363 [Dorcoceras hygrometricum]
          Length = 612

 Score =  186 bits (471), Expect = 2e-51
 Identities = 118/242 (48%), Positives = 139/242 (57%), Gaps = 15/242 (6%)
 Frame = -1

Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXL----ICXXXXXXXXXXXXXSHFPASAFRP 518
           MD SDQRRKRKR                  +    +C               F +SAF P
Sbjct: 1   MDSSDQRRKRKRALRQPYPSFCCCPSYLLSVRFLGLCLGFLTFFYLLVSTVPFNSSAFHP 60

Query: 517 XXXXXXXXXXXXXXSRTVKAVQVFDGLSQPLKIENRVFFPDHVLLLVSGGKKN------- 359
                         SRTVK+V  FDG + P +IE RV FPDHVLLLVSGGKK+       
Sbjct: 61  VLVVSSLSLLSSSSSRTVKSVLDFDGFAFPWRIEERVMFPDHVLLLVSGGKKDGLMNTIG 120

Query: 358 -QELECVYYGKPGKFRKGNFDFGKKKVMSVDEYDGLSSIVRCPLPSVNYSSVVSLKL-GK 185
              LECVY    G       D   K V+ +DE+D L S+VRCPLPSVNYS++V+L++ GK
Sbjct: 121 VTGLECVYVRDTGS------DLEMKGVILMDEFDDLRSVVRCPLPSVNYSALVTLRVSGK 174

Query: 184 NGILGEKNE-FLVR-NQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHF 11
           NGIL E N  FLV  NQTV  WE VAY+A LDGD+VVVFVKGLNLR  +ESDPS ++CHF
Sbjct: 175 NGILREANNGFLVNYNQTVYSWEKVAYSATLDGDTVVVFVKGLNLRGNKESDPSLYTCHF 234

Query: 10  GL 5
           GL
Sbjct: 235 GL 236


>ref|XP_022868348.1| glycosyltransferase family 92 protein RCOM_0530710-like [Olea
           europaea var. sylvestris]
          Length = 617

 Score =  176 bits (446), Expect = 7e-48
 Identities = 100/171 (58%), Positives = 119/171 (69%), Gaps = 17/171 (9%)
 Frame = -1

Query: 463 KAVQVFDG-LSQPLKIENRVFFPDHVLLLVSGGKKN-------QELECVYYGKPGKFRKG 308
           K +Q FDG ++ P KIENRV FPDHVLLLVS  KK+       +ELECVYYGK       
Sbjct: 72  KRIQEFDGFITFPWKIENRVLFPDHVLLLVSSLKKDGLIKKSAKELECVYYGKTVLEGNS 131

Query: 307 NFDFG--------KKKVMSVDEYDGLSSIVRCPLPSVNYSSVVSL-KLGKNGILGEKNEF 155
           N   G        K+ V+SVDEYD   SIVRCPLPS+NYSSVV+L + GK+G+L E    
Sbjct: 132 NGSSGERDPSILVKQNVLSVDEYDECRSIVRCPLPSMNYSSVVNLGRRGKSGVLEEDTGS 191

Query: 154 LVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHFGLG 2
           L+  QTV+ WENV YAA+LDG++ VVFVKGLNLR+ RES+P  FSCHFGLG
Sbjct: 192 LMNIQTVHSWENVVYAASLDGNTAVVFVKGLNLRSDRESNPRKFSCHFGLG 242


>ref|XP_022869016.1| glycosyltransferase family 92 protein RCOM_0530710-like, partial
           [Olea europaea var. sylvestris]
          Length = 324

 Score =  167 bits (424), Expect = 4e-47
 Identities = 99/190 (52%), Positives = 118/190 (62%), Gaps = 10/190 (5%)
 Frame = -1

Query: 541 FPASAFRPXXXXXXXXXXXXXXSRTVKAVQVFDGLSQPLKIENRVFFPDHVLLLVSGGKK 362
           F +SAFRP                + K++Q F  L  P+KIENRV FPD  +LLV+ G+K
Sbjct: 51  FNSSAFRPVLVVSRLS-------NSAKSIQDFHKLLMPIKIENRVLFPDDFMLLVADGRK 103

Query: 361 N--------QELECVYYGKPGKFRKGNFDFGKKKVMSVDEYDGLSSIVRCPLPSVNYSSV 206
           N         ELECVYY K       N    K+ V+SVDEYD   SIVRCP+PSVNYS+V
Sbjct: 104 NGFIKKGIIAELECVYYRKNVL---DNAIVDKENVLSVDEYDEFRSIVRCPIPSVNYSTV 160

Query: 205 VSLKLG-KNGILGEKNEF-LVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDP 32
           V+L+   KN  L E+N   +  NQTV+ WENV Y A LDG++VVVF KGLNLR  RESDP
Sbjct: 161 VNLRWRYKNEKLREENGLRMTENQTVHSWENVTYGAVLDGETVVVFAKGLNLRPARESDP 220

Query: 31  SPFSCHFGLG 2
             FSCHFGLG
Sbjct: 221 GQFSCHFGLG 230


>ref|XP_022856133.1| glycosyltransferase family 92 protein RCOM_0530710-like [Olea
           europaea var. sylvestris]
          Length = 619

 Score =  172 bits (436), Expect = 2e-46
 Identities = 96/169 (56%), Positives = 115/169 (68%), Gaps = 16/169 (9%)
 Frame = -1

Query: 463 KAVQVFD-GLSQPLKIENRVFFPDHVLLLVSGGKKN-------QELECVYYGKPGKFRKG 308
           K +Q FD   + P KIENRV FPDH+LLLVS  KK+       +ELEC YYGK     +G
Sbjct: 72  KRIQEFDDSFTFPWKIENRVLFPDHILLLVSSLKKDGLIKKSAKELECFYYGKNVLESRG 131

Query: 307 NFD-------FGKKKVMSVDEYDGLSSIVRCPLPSVNYSSVVSL-KLGKNGILGEKNEFL 152
                       K+ V+SVDEYD   SI+RCPLPSVNYS+VV+L + GKNG+L E     
Sbjct: 132 GSSGERDRDILVKQNVLSVDEYDEFRSIMRCPLPSVNYSTVVNLGRGGKNGVLEEDTGLW 191

Query: 151 VRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHFGL 5
           + NQTV+ WEN+ YAA+LDGD+ VVFVKGLNLR+ RESDP  FSCHFGL
Sbjct: 192 MSNQTVHSWENLVYAASLDGDTAVVFVKGLNLRSDRESDPGQFSCHFGL 240


>gb|PIN18035.1| hypothetical protein CDL12_09291 [Handroanthus impetiginosus]
          Length = 604

 Score =  171 bits (432), Expect = 6e-46
 Identities = 113/239 (47%), Positives = 132/239 (55%), Gaps = 11/239 (4%)
 Frame = -1

Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXL---ICXXXXXXXXXXXXXSHFPASAFRPX 515
           M  SDQRRKRKR                      +C               F +SAFRP 
Sbjct: 1   MKSSDQRRKRKRFLRQSDSPLFSVLHLLSVRSLVLCFSFLIFLYLLSYTVPFSSSAFRPV 60

Query: 514 XXXXXXXXXXXXXSRTVKAVQVFDGLSQPLKIENRVFFPDHVLLLV---SGG--KKNQ-- 356
                          TVK+VQ F G   PLKIE+RV FPDHVLL+V    GG  KK+   
Sbjct: 61  LVVSSLSLLSSSSYSTVKSVQDFGGFLLPLKIESRVLFPDHVLLMVRKNDGGILKKSDFD 120

Query: 355 ELECVYYGKPGKFRKGNFDFGKKKVMSVDEYDGLSSIVRCPLPSVNYSSVVSLKLG-KNG 179
           +LECVYYGK       + +F  ++++S DEYD   SIVRCPLPSVNYS+ V+L    KNG
Sbjct: 121 DLECVYYGKNAN---DSDNFVTQQLLSWDEYDEFRSIVRCPLPSVNYSAEVNLSSNDKNG 177

Query: 178 ILGEKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHFGLG 2
           IL E    +V + +   WE VAYAA LD  +VVVFVKGLNLRA RESDPS FSCHFG G
Sbjct: 178 ILREDKNGMVNSCS---WEKVAYAATLDKGTVVVFVKGLNLRADRESDPSQFSCHFGFG 233


>ref|XP_011086443.1| glycosyltransferase family 92 protein RCOM_0530710-like [Sesamum
           indicum]
          Length = 620

 Score =  167 bits (424), Expect = 1e-44
 Identities = 116/241 (48%), Positives = 133/241 (55%), Gaps = 16/241 (6%)
 Frame = -1

Query: 676 SDQRRKRKRIXXXXXXXXXXXXXXXXXL---ICXXXXXXXXXXXXXSHFPASAFRPXXXX 506
           SDQRRKRKRI                     +C               F +SAF P    
Sbjct: 5   SDQRRKRKRILRQSDSSLLSLPHLLSVRSLVLCFSFLTFLYLLSRTLPFASSAFHPVLVV 64

Query: 505 XXXXXXXXXXSRTV-KAVQVFDGLSQPLKIENRVFFPDHVLLLVSGG---KKN--QELEC 344
                     S T  ++VQ F     PL+IENRV FPDHVLLLV      KK+   ELEC
Sbjct: 65  SSLSLLSSSSSSTTAQSVQDFGSFLYPLRIENRVLFPDHVLLLVKNDGLLKKSVVDELEC 124

Query: 343 VYYGKP--GKFRKGNFDFGKKKVMSVDEYDGLSSIVRCPLPSVNYSSVVSLKL-GKNGIL 173
           VY G          + DF   KV+S DEYD   S+VRCPLPSVNYS+  +L+  G+NG+ 
Sbjct: 125 VYSGVTVLSTSNGSSGDFSVLKVLSFDEYDEFRSVVRCPLPSVNYSADANLRWSGRNGVF 184

Query: 172 --GEKNEFLVRNQTVNY--WENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHFGL 5
             GEK  +L  N+ VN   WENVAYAAALDG +VVVFVKGLNLRA RESDPS FSCHFGL
Sbjct: 185 AGGEKGLWL-DNRRVNSCSWENVAYAAALDGGTVVVFVKGLNLRADRESDPSQFSCHFGL 243

Query: 4   G 2
           G
Sbjct: 244 G 244


>ref|XP_016501070.1| PREDICTED: glycosyltransferase family 92 protein RCOM_0530710-like
           [Nicotiana tabacum]
          Length = 614

 Score =  154 bits (388), Expect = 1e-39
 Identities = 107/244 (43%), Positives = 125/244 (51%), Gaps = 16/244 (6%)
 Frame = -1

Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXL------ICXXXXXXXXXXXXXSHFPASAF 524
           MD S+QRRKRKRI                        +C               F +S F
Sbjct: 1   MDSSEQRRKRKRIFRPSSPTAFLPYSLKYFFSVRSLVLCFSFLIFLFLLSYQIPFGSSVF 60

Query: 523 RPXXXXXXXXXXXXXXS--RTVKAVQVFDGLSQPLKIENRVFFPDHVLLLVSGGK---KN 359
           RP                  +  + Q F     PL+IE RV FPDHVLLL+       K+
Sbjct: 61  RPVLVVSSLSLLSSSSDLSSSYASFQDFGSFLLPLQIEGRVLFPDHVLLLIKKSSLLGKS 120

Query: 358 QELECVYYGKPGKFRKGNFD-FGKKKVMSVDEY-DGLSSIVRCPLPSVNYSSVVSL-KLG 188
            ELECVY     + R    D F K+K  SVDEY D    +VRCPLP  NYS+VV+L K  
Sbjct: 121 TELECVY----ARNRSVEGDIFVKEKAFSVDEYGDENGMLVRCPLPPANYSAVVNLRKFR 176

Query: 187 KNGILG--EKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCH 14
            NG++    +N     NQTVN WENVAYAA LDG++ VVFVKGLNLR  RESDPS FSCH
Sbjct: 177 GNGVVNMEAENGIWKSNQTVNSWENVAYAATLDGNTAVVFVKGLNLRPQRESDPSQFSCH 236

Query: 13  FGLG 2
           FGLG
Sbjct: 237 FGLG 240


>ref|XP_009779733.1| PREDICTED: UPF0392 protein RCOM_0530710 [Nicotiana sylvestris]
          Length = 614

 Score =  154 bits (388), Expect = 1e-39
 Identities = 107/244 (43%), Positives = 125/244 (51%), Gaps = 16/244 (6%)
 Frame = -1

Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXL------ICXXXXXXXXXXXXXSHFPASAF 524
           MD S+QRRKRKRI                        +C               F +S F
Sbjct: 1   MDSSEQRRKRKRIFRPSSPTAFLPYSLKYFFSVRSLVLCFSFLIFLFLLSYQIPFGSSVF 60

Query: 523 RPXXXXXXXXXXXXXXSRTVKAV--QVFDGLSQPLKIENRVFFPDHVLLLVSGGK---KN 359
           RP                +  +   Q F     PL+IE RV FPDHVLLL+       K+
Sbjct: 61  RPVLVVSSLSLLSSSSDLSSSSASFQDFGSFLLPLQIEGRVLFPDHVLLLIKKSSLLGKS 120

Query: 358 QELECVYYGKPGKFRKGNFD-FGKKKVMSVDEY-DGLSSIVRCPLPSVNYSSVVSL-KLG 188
            ELECVY     + R    D F K+K  SVDEY D    +VRCPLP  NYS+VV+L K  
Sbjct: 121 TELECVY----ARNRSVEGDIFVKEKAFSVDEYGDENGMLVRCPLPPANYSAVVNLRKFR 176

Query: 187 KNGILG--EKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCH 14
            NG++    +N     NQTVN WENVAYAA LDG++ VVFVKGLNLR  RESDPS FSCH
Sbjct: 177 GNGVVNMEAENGIWKSNQTVNSWENVAYAATLDGNTAVVFVKGLNLRPQRESDPSQFSCH 236

Query: 13  FGLG 2
           FGLG
Sbjct: 237 FGLG 240


>ref|XP_016451148.1| PREDICTED: glycosyltransferase family 92 protein RCOM_0530710-like
           [Nicotiana tabacum]
          Length = 614

 Score =  152 bits (385), Expect = 3e-39
 Identities = 107/244 (43%), Positives = 126/244 (51%), Gaps = 16/244 (6%)
 Frame = -1

Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXL------ICXXXXXXXXXXXXXSHFPASAF 524
           MD S+QRRKRKRI                        +C               F +S F
Sbjct: 1   MDSSEQRRKRKRIFRPSSPTAFLPYSLKYFFSVRSLVLCFSFLIFLFLLSYQIPFGSSVF 60

Query: 523 RPXXXXXXXXXXXXXXSRTVKAV--QVFDGLSQPLKIENRVFFPDHVLLLVSGGK---KN 359
           RP                +  +   Q F     PL+IE RV FPDHVLLLV       K+
Sbjct: 61  RPVLVVSSLSLLSSSSDLSSSSASFQDFGSFLLPLQIEGRVLFPDHVLLLVKKNTLLGKS 120

Query: 358 QELECVYYGKPGKFRKGNFDFG-KKKVMSVDEY-DGLSSIVRCPLPSVNYSSVVSLKLGK 185
            ELECVY     + R    D   K+K  SVD+Y D    +VRCPLP  NYS+VV+LK  +
Sbjct: 121 TELECVY----ARNRSVEGDIVVKEKAFSVDDYGDENGMLVRCPLPPANYSAVVNLKKFR 176

Query: 184 -NGILG--EKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCH 14
            NG++    +N  L  NQTVN WENVAYAA LDG++ VVFVKGLNLR  RESDPS FSCH
Sbjct: 177 GNGVVDMEAENGILKSNQTVNSWENVAYAATLDGNTAVVFVKGLNLRPQRESDPSQFSCH 236

Query: 13  FGLG 2
           FGLG
Sbjct: 237 FGLG 240


>emb|CDP00239.1| unnamed protein product [Coffea canephora]
          Length = 621

 Score =  152 bits (384), Expect = 5e-39
 Identities = 108/252 (42%), Positives = 131/252 (51%), Gaps = 24/252 (9%)
 Frame = -1

Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXLI--------CXXXXXXXXXXXXXSHFPAS 530
           MD SDQRRKRKR+                  +        C             +   +S
Sbjct: 1   MDSSDQRRKRKRLLRTSAASQTSYLLVLPSHLFSVRSLLLCFTFFTFLYLLSYTAKIHSS 60

Query: 529 AFRPXXXXXXXXXXXXXXSRTVKAVQVFDGL-SQPLKIENRVFFPDHVLLLVSGG---KK 362
            FRP                +  +VQ FD L S P KIE+RV  PDH+LLLV      K+
Sbjct: 61  VFRPVLVVSSLSLLSS----SSDSVQHFDKLISLPFKIEDRVLLPDHILLLVKNNGTVKR 116

Query: 361 NQELECVYYGKPGKFRKGNFD--------FGKKKVMSVDEYDGLSSIVRCPLPSVNYSSV 206
           NQEL+CVY+       + N D          K  V+SVDE D    IVRCPLP VNYS+V
Sbjct: 117 NQELDCVYWRSIVSEGR-NIDGLEARQSLVAKLNVLSVDENDEFRLIVRCPLPPVNYSAV 175

Query: 205 VSL-KLGKNGI--LGEKNEFL-VRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRES 38
           V+L +  +NGI  LG++N    + NQ+V+ WE VAY A LDGD+ VVFVKGLNLR  RES
Sbjct: 176 VNLQRRWRNGIENLGDENGLRGISNQSVHKWERVAYTATLDGDTAVVFVKGLNLRQQRES 235

Query: 37  DPSPFSCHFGLG 2
           DP  FSCHFGLG
Sbjct: 236 DPRQFSCHFGLG 247


>ref|XP_009602227.1| PREDICTED: glycosyltransferase family 92 protein RCOM_0530710
           [Nicotiana tomentosiformis]
          Length = 614

 Score =  151 bits (381), Expect = 1e-38
 Identities = 106/244 (43%), Positives = 126/244 (51%), Gaps = 16/244 (6%)
 Frame = -1

Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXL------ICXXXXXXXXXXXXXSHFPASAF 524
           MD S++RRKRKRI                        +C               F +S F
Sbjct: 1   MDSSEKRRKRKRIFRPSSPTAFLPYSLKYFFSVRSLVLCFSFLIFLFLLSYQIPFGSSVF 60

Query: 523 RPXXXXXXXXXXXXXXSRTVKAV--QVFDGLSQPLKIENRVFFPDHVLLLVSGGK---KN 359
           RP                +  +   Q F     PL+IE RV FPDHVLLLV       K+
Sbjct: 61  RPVLVVSSLSLLSSSSDLSSSSASFQDFGSFLLPLQIEGRVLFPDHVLLLVKKNTLLGKS 120

Query: 358 QELECVYYGKPGKFRKGNFDFG-KKKVMSVDEY-DGLSSIVRCPLPSVNYSSVVSLKLGK 185
            ELECVY     + R    D   K+K  SVD+Y D    +VRCPLP  NYS+VV+LK  +
Sbjct: 121 TELECVY----ARNRSVEGDIVVKEKAFSVDDYGDENGMLVRCPLPPANYSAVVNLKKFR 176

Query: 184 -NGILG--EKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCH 14
            NG++    +N  L  NQTVN WENVAYAA LDG++ VVFVKGLNLR  RESDPS FSCH
Sbjct: 177 GNGVVDMEAENGILKSNQTVNSWENVAYAATLDGNTAVVFVKGLNLRPQRESDPSQFSCH 236

Query: 13  FGLG 2
           FGLG
Sbjct: 237 FGLG 240


>ref|XP_019254339.1| PREDICTED: glycosyltransferase family 92 protein RCOM_0530710
           [Nicotiana attenuata]
 gb|OIS97653.1| glycosyltransferase family 92 protein [Nicotiana attenuata]
          Length = 611

 Score =  148 bits (374), Expect = 1e-37
 Identities = 104/243 (42%), Positives = 122/243 (50%), Gaps = 15/243 (6%)
 Frame = -1

Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXL------ICXXXXXXXXXXXXXSHFPASAF 524
           MD S+QRRKRKRI                        +C               F +S F
Sbjct: 1   MDSSEQRRKRKRIFRPSSPTVFLPYSLKYFFSVRSLVLCFSFLIFIFLLSYQIPFGSSVF 60

Query: 523 RPXXXXXXXXXXXXXXSRTVKAV--QVFDGLSQPLKIENRVFFPDHVLLLVSGGK---KN 359
           RP                +  +   Q F     PL+IE RV FPDHVLLL+       KN
Sbjct: 61  RPVLVVSSLSLLSSSSDLSSSSASFQDFGSFLLPLQIEGRVLFPDHVLLLIKKNTLLGKN 120

Query: 358 QELECVYYGKPGKFRKGNFDFGKKKVMSVDEY-DGLSSIVRCPLPSVNYSSVVSL-KLGK 185
            ELECVY       R  +    K+K  S+D+Y D    IVRCPLP  NYS+VV+L K   
Sbjct: 121 TELECVYA------RNTSDIVVKEKAFSMDDYGDENGMIVRCPLPPANYSAVVNLMKFKG 174

Query: 184 NGILG--EKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHF 11
           NG++    +N     NQT N WENVAYAA LDG++ VVFVKGLNLR  RESDPS FSCHF
Sbjct: 175 NGVVDMEAENGIWKSNQTDNSWENVAYAAMLDGNTAVVFVKGLNLRPQRESDPSQFSCHF 234

Query: 10  GLG 2
           GLG
Sbjct: 235 GLG 237


>ref|XP_012847676.1| PREDICTED: UPF0392 protein RCOM_0530710-like [Erythranthe guttata]
 gb|EYU28667.1| hypothetical protein MIMGU_mgv1a003136mg [Erythranthe guttata]
          Length = 605

 Score =  147 bits (371), Expect = 3e-37
 Identities = 83/150 (55%), Positives = 103/150 (68%), Gaps = 4/150 (2%)
 Frame = -1

Query: 442 GLSQPLKIENRVFFPDHVLLLVSG-GKKNQELECVYYGKPGKFRKGNFDFGKKKVMSVDE 266
           G   PL+IENRV FPDHVLLLV+       ELECVY GK       + +F  KKV+SVD+
Sbjct: 87  GFPFPLQIENRVLFPDHVLLLVAAENTATDELECVYSGKT---LFSSLNFSVKKVLSVDK 143

Query: 265 YDGLSSIVRCPLPSVNYSSVVSLKL-GKNGILGEKNEFLVRNQTVNY--WENVAYAAALD 95
           YD   S++RCPLPS NYS+  +L+  GKN I      F   N+T+N   W+N++YAAALD
Sbjct: 144 YDDFRSVIRCPLPSANYSADPNLRRRGKNRI------FFRSNRTLNSCSWDNLSYAAALD 197

Query: 94  GDSVVVFVKGLNLRAGRESDPSPFSCHFGL 5
           G +V+VFVKGLNLRA +ESDPS F+CHF L
Sbjct: 198 GGTVIVFVKGLNLRADKESDPSGFTCHFWL 227


>ref|XP_018807521.1| PREDICTED: glycosyltransferase family 92 protein RCOM_0530710-like,
           partial [Juglans regia]
          Length = 322

 Score =  138 bits (348), Expect = 8e-36
 Identities = 77/150 (51%), Positives = 100/150 (66%), Gaps = 7/150 (4%)
 Frame = -1

Query: 430 PLKIENRVFFPDHVLLLVSGG-KKNQELECVYY----GKPGKFR-KGNFDFGKKKVMSVD 269
           PL++E+RV FPDH+LLLVS    +++ELECVY     G     R K + + G + V+S +
Sbjct: 83  PLRVEDRVLFPDHLLLLVSNKLHESEELECVYSTFLNGSGELLRDKRSQEVGIRPVLSTE 142

Query: 268 EYDGLSSIVRCPLPSVNYSSV-VSLKLGKNGILGEKNEFLVRNQTVNYWENVAYAAALDG 92
            YD L SIVRCPLP VNYS+  V L+  ++G  G     +  NQTV +WE +AY A LDG
Sbjct: 143 PYDALRSIVRCPLPPVNYSTAAVDLRRLRSGEAGHYEWSVGANQTVYWWEKLAYEAVLDG 202

Query: 91  DSVVVFVKGLNLRAGRESDPSPFSCHFGLG 2
           D+V VFVKGLNLR  ++SDP+ F CHFG G
Sbjct: 203 DTVAVFVKGLNLRPHKKSDPTQFRCHFGFG 232


>ref|XP_006357222.1| PREDICTED: UPF0392 protein RCOM_0530710 [Solanum tuberosum]
          Length = 603

 Score =  142 bits (357), Expect = 2e-35
 Identities = 107/241 (44%), Positives = 125/241 (51%), Gaps = 13/241 (5%)
 Frame = -1

Query: 685 MDYSDQRRKRKRI--XXXXXXXXXXXXXXXXXLICXXXXXXXXXXXXXSHFPASAFRP-- 518
           MD S+QRRKRKRI                   ++C               F +S FRP  
Sbjct: 1   MDSSEQRRKRKRIFRPSSPTATLVYFFSVRSFVLCFSFLIFLFLLSYQIPFSSSVFRPVL 60

Query: 517 XXXXXXXXXXXXXXSRTVKAVQVFDGLS-QPLKIENRVFFPDHVLLLVSGG---KKNQEL 350
                           T ++ Q F   S   L+IE RV FPDHVLLLV+      KN   
Sbjct: 61  VVSRLSLLSSSSDFLSTSQSFQDFGSSSLLHLQIEGRVLFPDHVLLLVNKNDLFSKNTNF 120

Query: 349 ECVYYGKPGKFRKGNFDFG--KKKVMSVDEY--DGLSSIVRCPLPSVNYSSVVSL-KLGK 185
           ECVY    G+   G+ D G  K+K  SVD Y       +VRCPLP VNYS+VV+L K   
Sbjct: 121 ECVY----GRNSTGD-DVGVVKEKSFSVDVYGESEFGVLVRCPLPPVNYSAVVNLRKFRG 175

Query: 184 NGILGEKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHFGL 5
           NG+  E       NQTVN WENVAYAA LDG++VVVFVKGLNLR  RESD S FSC+FGL
Sbjct: 176 NGLWKES------NQTVNSWENVAYAATLDGNTVVVFVKGLNLRPDRESDSSQFSCYFGL 229

Query: 4   G 2
           G
Sbjct: 230 G 230


>ref|XP_004238737.1| PREDICTED: glycosyltransferase family 92 protein RCOM_0530710
           [Solanum lycopersicum]
          Length = 600

 Score =  141 bits (356), Expect = 3e-35
 Identities = 107/242 (44%), Positives = 124/242 (51%), Gaps = 14/242 (5%)
 Frame = -1

Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXLI-CXXXXXXXXXXXXXSHFPASAFRP--X 515
           MD S+QRRKRKRI                  + C               F +S FRP   
Sbjct: 1   MDSSEQRRKRKRILRPSSISALVYFFSVRSFVLCFSFLIFVFLLSYHIPFSSSVFRPVLV 60

Query: 514 XXXXXXXXXXXXXSRTVKAVQVFDGLSQP---LKIENRVFFPDHVLLLVSGGK---KNQE 353
                          T  + Q F   S     L+IE RV FPDHVLL V+  +   KN E
Sbjct: 61  VSRLSLLSSSSDLLSTSLSFQDFGSSSSSLLHLQIEGRVLFPDHVLLFVNKNELFSKNTE 120

Query: 352 LECVYYGKPGKFRKGNFDFG--KKKVMSVDEYDG--LSSIVRCPLPSVNYSSVVSLK-LG 188
            ECVY    G+   G+ D G  K+K  SVD Y      + VRCPLP VNYS+VV+L+ L 
Sbjct: 121 FECVY----GRNSTGD-DVGIVKEKSYSVDVYGDFEFGAFVRCPLPRVNYSAVVNLRELR 175

Query: 187 KNGILGEKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHFG 8
            NG           NQTVN WENVAYAAALDG++VVVFVKGLNLR  RESD S FSC+FG
Sbjct: 176 GNG----------SNQTVNSWENVAYAAALDGNTVVVFVKGLNLRPDRESDSSQFSCYFG 225

Query: 7   LG 2
           LG
Sbjct: 226 LG 227


>ref|XP_015074620.1| PREDICTED: UPF0392 protein RCOM_0530710 [Solanum pennellii]
          Length = 601

 Score =  140 bits (352), Expect = 1e-34
 Identities = 107/243 (44%), Positives = 125/243 (51%), Gaps = 15/243 (6%)
 Frame = -1

Query: 685 MDYSDQRRKRKRI--XXXXXXXXXXXXXXXXXLICXXXXXXXXXXXXXSHFPASAFRP-- 518
           MD S+QRRKRKRI                   ++C               F +S FRP  
Sbjct: 1   MDSSEQRRKRKRIFRPSSSISALVYFFSVRSFVLCFSFLIFVFLLSYHIPFSSSVFRPVL 60

Query: 517 XXXXXXXXXXXXXXSRTVKAVQVFDGLSQP---LKIENRVFFPDHVLLLVSGG---KKNQ 356
                           T  + Q F   S     L+IE RV FPDHVLLLV+      KN 
Sbjct: 61  VVSRLSLLSSSSDLLSTSLSFQDFGSSSSSLLHLQIEGRVLFPDHVLLLVNKNDLFSKNT 120

Query: 355 ELECVYYGKPGKFRKGNFDFG--KKKVMSVDEYDG--LSSIVRCPLPSVNYSSVVSLK-L 191
           E ECVY    G+   G+ D G  K+K  SVD Y      + VRCPLP VNYS+VV+L+ L
Sbjct: 121 EFECVY----GRNSTGD-DVGIVKEKSYSVDVYGDFEFGAFVRCPLPRVNYSAVVNLREL 175

Query: 190 GKNGILGEKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHF 11
             NG           NQTVN WE+VAYAAALDG++VVVFVKGLNLR  RESD S FSC+F
Sbjct: 176 RGNG----------SNQTVNSWESVAYAAALDGNTVVVFVKGLNLRPDRESDSSQFSCYF 225

Query: 10  GLG 2
           GLG
Sbjct: 226 GLG 228