BLASTX nr result
ID: Rehmannia31_contig00019032
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia31_contig00019032 (706 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011071049.1| glycosyltransferase family 92 protein RCOM_0... 224 4e-66 gb|PIN24926.1| hypothetical protein CDL12_02314 [Handroanthus im... 217 1e-63 ref|XP_012855282.1| PREDICTED: UPF0392 protein RCOM_0530710-like... 187 2e-52 gb|KZV24649.1| hypothetical protein F511_37363 [Dorcoceras hygro... 186 2e-51 ref|XP_022868348.1| glycosyltransferase family 92 protein RCOM_0... 176 7e-48 ref|XP_022869016.1| glycosyltransferase family 92 protein RCOM_0... 167 4e-47 ref|XP_022856133.1| glycosyltransferase family 92 protein RCOM_0... 172 2e-46 gb|PIN18035.1| hypothetical protein CDL12_09291 [Handroanthus im... 171 6e-46 ref|XP_011086443.1| glycosyltransferase family 92 protein RCOM_0... 167 1e-44 ref|XP_016501070.1| PREDICTED: glycosyltransferase family 92 pro... 154 1e-39 ref|XP_009779733.1| PREDICTED: UPF0392 protein RCOM_0530710 [Nic... 154 1e-39 ref|XP_016451148.1| PREDICTED: glycosyltransferase family 92 pro... 152 3e-39 emb|CDP00239.1| unnamed protein product [Coffea canephora] 152 5e-39 ref|XP_009602227.1| PREDICTED: glycosyltransferase family 92 pro... 151 1e-38 ref|XP_019254339.1| PREDICTED: glycosyltransferase family 92 pro... 148 1e-37 ref|XP_012847676.1| PREDICTED: UPF0392 protein RCOM_0530710-like... 147 3e-37 ref|XP_018807521.1| PREDICTED: glycosyltransferase family 92 pro... 138 8e-36 ref|XP_006357222.1| PREDICTED: UPF0392 protein RCOM_0530710 [Sol... 142 2e-35 ref|XP_004238737.1| PREDICTED: glycosyltransferase family 92 pro... 141 3e-35 ref|XP_015074620.1| PREDICTED: UPF0392 protein RCOM_0530710 [Sol... 140 1e-34 >ref|XP_011071049.1| glycosyltransferase family 92 protein RCOM_0530710 [Sesamum indicum] Length = 611 Score = 224 bits (571), Expect = 4e-66 Identities = 132/233 (56%), Positives = 144/233 (61%), Gaps = 5/233 (2%) Frame = -1 Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXLICXXXXXXXXXXXXXSHFPASAFRPXXXX 506 MD SDQRRKRKRI +C + F +SA+RP Sbjct: 1 MDSSDQRRKRKRILRQSYSPRHFLSVRSLV-MCFSFFTSLCLLSYTTPFRSSAYRPVQVV 59 Query: 505 XXXXXXXXXXSRTVKAVQVFDGLSQPLKIENRVFFPDHVLLLVSGGKKN----QELECVY 338 SR VK+VQ FDG PLKIE+RV FPDHVLLLV GGK +ELEC+Y Sbjct: 60 SSLSLLSSSISRKVKSVQDFDGSLLPLKIESRVLFPDHVLLLVGGGKMEKSGAEELECIY 119 Query: 337 YGKPGKFRKGNFDFGKKKVMSVDEYDGLSSIVRCPLPSVNYSSVVSLKL-GKNGILGEKN 161 YGK G DF KK V SVDEYDG SIVRCPLP VNYSSVV+LK GKNGILGEK Sbjct: 120 YGKFDSIDSG--DFEKKNVFSVDEYDGFRSIVRCPLPPVNYSSVVNLKRRGKNGILGEKV 177 Query: 160 EFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHFGLG 2 V +QTVN WENVAY A LDGD+ +VFVKGLNLR RESDPS FSCHFGLG Sbjct: 178 GLRVDSQTVNSWENVAYEATLDGDTAIVFVKGLNLRPDRESDPSQFSCHFGLG 230 >gb|PIN24926.1| hypothetical protein CDL12_02314 [Handroanthus impetiginosus] Length = 592 Score = 217 bits (553), Expect = 1e-63 Identities = 129/232 (55%), Positives = 145/232 (62%), Gaps = 4/232 (1%) Frame = -1 Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXLICXXXXXXXXXXXXXSHFPASAFRPXXXX 506 MD SDQRRKRKRI +C + F +SA RP Sbjct: 1 MDSSDQRRKRKRILRQSYFHLHLSSVRSLV-VCFSLLTFLYLLSYTNPFTSSATRPVLVE 59 Query: 505 XXXXXXXXXXSRTVKAVQVFDGLSQPLKIENRVFFPDHVLLLVSGGKKN----QELECVY 338 SRTVK+VQ FD L PLKIENRV FPDHVLLLVSG KK+ +LECVY Sbjct: 60 SSLSLVSSSSSRTVKSVQDFDDLLLPLKIENRVLFPDHVLLLVSGVKKDWLMKNKLECVY 119 Query: 337 YGKPGKFRKGNFDFGKKKVMSVDEYDGLSSIVRCPLPSVNYSSVVSLKLGKNGILGEKNE 158 YGK G+F GNF +K + VDEYD SIVRCPLPSVNYSSVV+LKL KNGI E N Sbjct: 120 YGKFGRF-SGNF--ARKNALFVDEYDVFRSIVRCPLPSVNYSSVVNLKL-KNGIFKENNG 175 Query: 157 FLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHFGLG 2 +RNQT N WENVAY A +DG++ VVFVKGL+LRAGRESDP +CHFGLG Sbjct: 176 IWMRNQTANSWENVAYEATVDGETAVVFVKGLHLRAGRESDPGQLTCHFGLG 227 >ref|XP_012855282.1| PREDICTED: UPF0392 protein RCOM_0530710-like [Erythranthe guttata] gb|EYU22455.1| hypothetical protein MIMGU_mgv1a003356mg [Erythranthe guttata] Length = 590 Score = 187 bits (476), Expect = 2e-52 Identities = 123/238 (51%), Positives = 138/238 (57%), Gaps = 10/238 (4%) Frame = -1 Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXLICXXXXXXXXXXXXXSHFPA-SAFRPXXX 509 MD DQRRKRKR LIC + F + FRP Sbjct: 1 MDSLDQRRKRKR-SFRQYPNLQLLLSVRSLLICFSFLIFLYLFSYTNPFSSIPVFRPVLV 59 Query: 508 XXXXXXXXXXXSRTVKAVQVFDGL-SQPLKIENRVFFPDHVLLLVSGGKKNQ----ELEC 344 S TVK+VQ D L S PLKIE+RV FPDHVLLLV GG KN+ E EC Sbjct: 60 VSSLSLLSTSISTTVKSVQDSDSLKSPPLKIESRVLFPDHVLLLVGGGSKNRFLKNEFEC 119 Query: 343 VYYGKPGKFRKGNFDFGKKKVMSVDEYDGLSSIVRCPLPSVNYSSVVSL-KLGKNGILGE 167 VY N FGKKKV+S DEYD S++RCPLPS+NYS+ V L ++ KN +L E Sbjct: 120 VYN------HVKNGGFGKKKVVSFDEYDEFRSVLRCPLPSLNYSNAVKLQRVDKNRVLAE 173 Query: 166 K-NEFLVRNQTVNYWENVAYAAALDGDS--VVVFVKGLNLRAGRESDPSPFSCHFGLG 2 K N FL RNQTV WEN+AY AALDGD+ VVFVKGLNLR GRESDPS FSCHFG G Sbjct: 174 KSNGFLRRNQTVKSWENIAYEAALDGDTDTAVVFVKGLNLRPGRESDPSQFSCHFGFG 231 >gb|KZV24649.1| hypothetical protein F511_37363 [Dorcoceras hygrometricum] Length = 612 Score = 186 bits (471), Expect = 2e-51 Identities = 118/242 (48%), Positives = 139/242 (57%), Gaps = 15/242 (6%) Frame = -1 Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXL----ICXXXXXXXXXXXXXSHFPASAFRP 518 MD SDQRRKRKR + +C F +SAF P Sbjct: 1 MDSSDQRRKRKRALRQPYPSFCCCPSYLLSVRFLGLCLGFLTFFYLLVSTVPFNSSAFHP 60 Query: 517 XXXXXXXXXXXXXXSRTVKAVQVFDGLSQPLKIENRVFFPDHVLLLVSGGKKN------- 359 SRTVK+V FDG + P +IE RV FPDHVLLLVSGGKK+ Sbjct: 61 VLVVSSLSLLSSSSSRTVKSVLDFDGFAFPWRIEERVMFPDHVLLLVSGGKKDGLMNTIG 120 Query: 358 -QELECVYYGKPGKFRKGNFDFGKKKVMSVDEYDGLSSIVRCPLPSVNYSSVVSLKL-GK 185 LECVY G D K V+ +DE+D L S+VRCPLPSVNYS++V+L++ GK Sbjct: 121 VTGLECVYVRDTGS------DLEMKGVILMDEFDDLRSVVRCPLPSVNYSALVTLRVSGK 174 Query: 184 NGILGEKNE-FLVR-NQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHF 11 NGIL E N FLV NQTV WE VAY+A LDGD+VVVFVKGLNLR +ESDPS ++CHF Sbjct: 175 NGILREANNGFLVNYNQTVYSWEKVAYSATLDGDTVVVFVKGLNLRGNKESDPSLYTCHF 234 Query: 10 GL 5 GL Sbjct: 235 GL 236 >ref|XP_022868348.1| glycosyltransferase family 92 protein RCOM_0530710-like [Olea europaea var. sylvestris] Length = 617 Score = 176 bits (446), Expect = 7e-48 Identities = 100/171 (58%), Positives = 119/171 (69%), Gaps = 17/171 (9%) Frame = -1 Query: 463 KAVQVFDG-LSQPLKIENRVFFPDHVLLLVSGGKKN-------QELECVYYGKPGKFRKG 308 K +Q FDG ++ P KIENRV FPDHVLLLVS KK+ +ELECVYYGK Sbjct: 72 KRIQEFDGFITFPWKIENRVLFPDHVLLLVSSLKKDGLIKKSAKELECVYYGKTVLEGNS 131 Query: 307 NFDFG--------KKKVMSVDEYDGLSSIVRCPLPSVNYSSVVSL-KLGKNGILGEKNEF 155 N G K+ V+SVDEYD SIVRCPLPS+NYSSVV+L + GK+G+L E Sbjct: 132 NGSSGERDPSILVKQNVLSVDEYDECRSIVRCPLPSMNYSSVVNLGRRGKSGVLEEDTGS 191 Query: 154 LVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHFGLG 2 L+ QTV+ WENV YAA+LDG++ VVFVKGLNLR+ RES+P FSCHFGLG Sbjct: 192 LMNIQTVHSWENVVYAASLDGNTAVVFVKGLNLRSDRESNPRKFSCHFGLG 242 >ref|XP_022869016.1| glycosyltransferase family 92 protein RCOM_0530710-like, partial [Olea europaea var. sylvestris] Length = 324 Score = 167 bits (424), Expect = 4e-47 Identities = 99/190 (52%), Positives = 118/190 (62%), Gaps = 10/190 (5%) Frame = -1 Query: 541 FPASAFRPXXXXXXXXXXXXXXSRTVKAVQVFDGLSQPLKIENRVFFPDHVLLLVSGGKK 362 F +SAFRP + K++Q F L P+KIENRV FPD +LLV+ G+K Sbjct: 51 FNSSAFRPVLVVSRLS-------NSAKSIQDFHKLLMPIKIENRVLFPDDFMLLVADGRK 103 Query: 361 N--------QELECVYYGKPGKFRKGNFDFGKKKVMSVDEYDGLSSIVRCPLPSVNYSSV 206 N ELECVYY K N K+ V+SVDEYD SIVRCP+PSVNYS+V Sbjct: 104 NGFIKKGIIAELECVYYRKNVL---DNAIVDKENVLSVDEYDEFRSIVRCPIPSVNYSTV 160 Query: 205 VSLKLG-KNGILGEKNEF-LVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDP 32 V+L+ KN L E+N + NQTV+ WENV Y A LDG++VVVF KGLNLR RESDP Sbjct: 161 VNLRWRYKNEKLREENGLRMTENQTVHSWENVTYGAVLDGETVVVFAKGLNLRPARESDP 220 Query: 31 SPFSCHFGLG 2 FSCHFGLG Sbjct: 221 GQFSCHFGLG 230 >ref|XP_022856133.1| glycosyltransferase family 92 protein RCOM_0530710-like [Olea europaea var. sylvestris] Length = 619 Score = 172 bits (436), Expect = 2e-46 Identities = 96/169 (56%), Positives = 115/169 (68%), Gaps = 16/169 (9%) Frame = -1 Query: 463 KAVQVFD-GLSQPLKIENRVFFPDHVLLLVSGGKKN-------QELECVYYGKPGKFRKG 308 K +Q FD + P KIENRV FPDH+LLLVS KK+ +ELEC YYGK +G Sbjct: 72 KRIQEFDDSFTFPWKIENRVLFPDHILLLVSSLKKDGLIKKSAKELECFYYGKNVLESRG 131 Query: 307 NFD-------FGKKKVMSVDEYDGLSSIVRCPLPSVNYSSVVSL-KLGKNGILGEKNEFL 152 K+ V+SVDEYD SI+RCPLPSVNYS+VV+L + GKNG+L E Sbjct: 132 GSSGERDRDILVKQNVLSVDEYDEFRSIMRCPLPSVNYSTVVNLGRGGKNGVLEEDTGLW 191 Query: 151 VRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHFGL 5 + NQTV+ WEN+ YAA+LDGD+ VVFVKGLNLR+ RESDP FSCHFGL Sbjct: 192 MSNQTVHSWENLVYAASLDGDTAVVFVKGLNLRSDRESDPGQFSCHFGL 240 >gb|PIN18035.1| hypothetical protein CDL12_09291 [Handroanthus impetiginosus] Length = 604 Score = 171 bits (432), Expect = 6e-46 Identities = 113/239 (47%), Positives = 132/239 (55%), Gaps = 11/239 (4%) Frame = -1 Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXL---ICXXXXXXXXXXXXXSHFPASAFRPX 515 M SDQRRKRKR +C F +SAFRP Sbjct: 1 MKSSDQRRKRKRFLRQSDSPLFSVLHLLSVRSLVLCFSFLIFLYLLSYTVPFSSSAFRPV 60 Query: 514 XXXXXXXXXXXXXSRTVKAVQVFDGLSQPLKIENRVFFPDHVLLLV---SGG--KKNQ-- 356 TVK+VQ F G PLKIE+RV FPDHVLL+V GG KK+ Sbjct: 61 LVVSSLSLLSSSSYSTVKSVQDFGGFLLPLKIESRVLFPDHVLLMVRKNDGGILKKSDFD 120 Query: 355 ELECVYYGKPGKFRKGNFDFGKKKVMSVDEYDGLSSIVRCPLPSVNYSSVVSLKLG-KNG 179 +LECVYYGK + +F ++++S DEYD SIVRCPLPSVNYS+ V+L KNG Sbjct: 121 DLECVYYGKNAN---DSDNFVTQQLLSWDEYDEFRSIVRCPLPSVNYSAEVNLSSNDKNG 177 Query: 178 ILGEKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHFGLG 2 IL E +V + + WE VAYAA LD +VVVFVKGLNLRA RESDPS FSCHFG G Sbjct: 178 ILREDKNGMVNSCS---WEKVAYAATLDKGTVVVFVKGLNLRADRESDPSQFSCHFGFG 233 >ref|XP_011086443.1| glycosyltransferase family 92 protein RCOM_0530710-like [Sesamum indicum] Length = 620 Score = 167 bits (424), Expect = 1e-44 Identities = 116/241 (48%), Positives = 133/241 (55%), Gaps = 16/241 (6%) Frame = -1 Query: 676 SDQRRKRKRIXXXXXXXXXXXXXXXXXL---ICXXXXXXXXXXXXXSHFPASAFRPXXXX 506 SDQRRKRKRI +C F +SAF P Sbjct: 5 SDQRRKRKRILRQSDSSLLSLPHLLSVRSLVLCFSFLTFLYLLSRTLPFASSAFHPVLVV 64 Query: 505 XXXXXXXXXXSRTV-KAVQVFDGLSQPLKIENRVFFPDHVLLLVSGG---KKN--QELEC 344 S T ++VQ F PL+IENRV FPDHVLLLV KK+ ELEC Sbjct: 65 SSLSLLSSSSSSTTAQSVQDFGSFLYPLRIENRVLFPDHVLLLVKNDGLLKKSVVDELEC 124 Query: 343 VYYGKP--GKFRKGNFDFGKKKVMSVDEYDGLSSIVRCPLPSVNYSSVVSLKL-GKNGIL 173 VY G + DF KV+S DEYD S+VRCPLPSVNYS+ +L+ G+NG+ Sbjct: 125 VYSGVTVLSTSNGSSGDFSVLKVLSFDEYDEFRSVVRCPLPSVNYSADANLRWSGRNGVF 184 Query: 172 --GEKNEFLVRNQTVNY--WENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHFGL 5 GEK +L N+ VN WENVAYAAALDG +VVVFVKGLNLRA RESDPS FSCHFGL Sbjct: 185 AGGEKGLWL-DNRRVNSCSWENVAYAAALDGGTVVVFVKGLNLRADRESDPSQFSCHFGL 243 Query: 4 G 2 G Sbjct: 244 G 244 >ref|XP_016501070.1| PREDICTED: glycosyltransferase family 92 protein RCOM_0530710-like [Nicotiana tabacum] Length = 614 Score = 154 bits (388), Expect = 1e-39 Identities = 107/244 (43%), Positives = 125/244 (51%), Gaps = 16/244 (6%) Frame = -1 Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXL------ICXXXXXXXXXXXXXSHFPASAF 524 MD S+QRRKRKRI +C F +S F Sbjct: 1 MDSSEQRRKRKRIFRPSSPTAFLPYSLKYFFSVRSLVLCFSFLIFLFLLSYQIPFGSSVF 60 Query: 523 RPXXXXXXXXXXXXXXS--RTVKAVQVFDGLSQPLKIENRVFFPDHVLLLVSGGK---KN 359 RP + + Q F PL+IE RV FPDHVLLL+ K+ Sbjct: 61 RPVLVVSSLSLLSSSSDLSSSYASFQDFGSFLLPLQIEGRVLFPDHVLLLIKKSSLLGKS 120 Query: 358 QELECVYYGKPGKFRKGNFD-FGKKKVMSVDEY-DGLSSIVRCPLPSVNYSSVVSL-KLG 188 ELECVY + R D F K+K SVDEY D +VRCPLP NYS+VV+L K Sbjct: 121 TELECVY----ARNRSVEGDIFVKEKAFSVDEYGDENGMLVRCPLPPANYSAVVNLRKFR 176 Query: 187 KNGILG--EKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCH 14 NG++ +N NQTVN WENVAYAA LDG++ VVFVKGLNLR RESDPS FSCH Sbjct: 177 GNGVVNMEAENGIWKSNQTVNSWENVAYAATLDGNTAVVFVKGLNLRPQRESDPSQFSCH 236 Query: 13 FGLG 2 FGLG Sbjct: 237 FGLG 240 >ref|XP_009779733.1| PREDICTED: UPF0392 protein RCOM_0530710 [Nicotiana sylvestris] Length = 614 Score = 154 bits (388), Expect = 1e-39 Identities = 107/244 (43%), Positives = 125/244 (51%), Gaps = 16/244 (6%) Frame = -1 Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXL------ICXXXXXXXXXXXXXSHFPASAF 524 MD S+QRRKRKRI +C F +S F Sbjct: 1 MDSSEQRRKRKRIFRPSSPTAFLPYSLKYFFSVRSLVLCFSFLIFLFLLSYQIPFGSSVF 60 Query: 523 RPXXXXXXXXXXXXXXSRTVKAV--QVFDGLSQPLKIENRVFFPDHVLLLVSGGK---KN 359 RP + + Q F PL+IE RV FPDHVLLL+ K+ Sbjct: 61 RPVLVVSSLSLLSSSSDLSSSSASFQDFGSFLLPLQIEGRVLFPDHVLLLIKKSSLLGKS 120 Query: 358 QELECVYYGKPGKFRKGNFD-FGKKKVMSVDEY-DGLSSIVRCPLPSVNYSSVVSL-KLG 188 ELECVY + R D F K+K SVDEY D +VRCPLP NYS+VV+L K Sbjct: 121 TELECVY----ARNRSVEGDIFVKEKAFSVDEYGDENGMLVRCPLPPANYSAVVNLRKFR 176 Query: 187 KNGILG--EKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCH 14 NG++ +N NQTVN WENVAYAA LDG++ VVFVKGLNLR RESDPS FSCH Sbjct: 177 GNGVVNMEAENGIWKSNQTVNSWENVAYAATLDGNTAVVFVKGLNLRPQRESDPSQFSCH 236 Query: 13 FGLG 2 FGLG Sbjct: 237 FGLG 240 >ref|XP_016451148.1| PREDICTED: glycosyltransferase family 92 protein RCOM_0530710-like [Nicotiana tabacum] Length = 614 Score = 152 bits (385), Expect = 3e-39 Identities = 107/244 (43%), Positives = 126/244 (51%), Gaps = 16/244 (6%) Frame = -1 Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXL------ICXXXXXXXXXXXXXSHFPASAF 524 MD S+QRRKRKRI +C F +S F Sbjct: 1 MDSSEQRRKRKRIFRPSSPTAFLPYSLKYFFSVRSLVLCFSFLIFLFLLSYQIPFGSSVF 60 Query: 523 RPXXXXXXXXXXXXXXSRTVKAV--QVFDGLSQPLKIENRVFFPDHVLLLVSGGK---KN 359 RP + + Q F PL+IE RV FPDHVLLLV K+ Sbjct: 61 RPVLVVSSLSLLSSSSDLSSSSASFQDFGSFLLPLQIEGRVLFPDHVLLLVKKNTLLGKS 120 Query: 358 QELECVYYGKPGKFRKGNFDFG-KKKVMSVDEY-DGLSSIVRCPLPSVNYSSVVSLKLGK 185 ELECVY + R D K+K SVD+Y D +VRCPLP NYS+VV+LK + Sbjct: 121 TELECVY----ARNRSVEGDIVVKEKAFSVDDYGDENGMLVRCPLPPANYSAVVNLKKFR 176 Query: 184 -NGILG--EKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCH 14 NG++ +N L NQTVN WENVAYAA LDG++ VVFVKGLNLR RESDPS FSCH Sbjct: 177 GNGVVDMEAENGILKSNQTVNSWENVAYAATLDGNTAVVFVKGLNLRPQRESDPSQFSCH 236 Query: 13 FGLG 2 FGLG Sbjct: 237 FGLG 240 >emb|CDP00239.1| unnamed protein product [Coffea canephora] Length = 621 Score = 152 bits (384), Expect = 5e-39 Identities = 108/252 (42%), Positives = 131/252 (51%), Gaps = 24/252 (9%) Frame = -1 Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXLI--------CXXXXXXXXXXXXXSHFPAS 530 MD SDQRRKRKR+ + C + +S Sbjct: 1 MDSSDQRRKRKRLLRTSAASQTSYLLVLPSHLFSVRSLLLCFTFFTFLYLLSYTAKIHSS 60 Query: 529 AFRPXXXXXXXXXXXXXXSRTVKAVQVFDGL-SQPLKIENRVFFPDHVLLLVSGG---KK 362 FRP + +VQ FD L S P KIE+RV PDH+LLLV K+ Sbjct: 61 VFRPVLVVSSLSLLSS----SSDSVQHFDKLISLPFKIEDRVLLPDHILLLVKNNGTVKR 116 Query: 361 NQELECVYYGKPGKFRKGNFD--------FGKKKVMSVDEYDGLSSIVRCPLPSVNYSSV 206 NQEL+CVY+ + N D K V+SVDE D IVRCPLP VNYS+V Sbjct: 117 NQELDCVYWRSIVSEGR-NIDGLEARQSLVAKLNVLSVDENDEFRLIVRCPLPPVNYSAV 175 Query: 205 VSL-KLGKNGI--LGEKNEFL-VRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRES 38 V+L + +NGI LG++N + NQ+V+ WE VAY A LDGD+ VVFVKGLNLR RES Sbjct: 176 VNLQRRWRNGIENLGDENGLRGISNQSVHKWERVAYTATLDGDTAVVFVKGLNLRQQRES 235 Query: 37 DPSPFSCHFGLG 2 DP FSCHFGLG Sbjct: 236 DPRQFSCHFGLG 247 >ref|XP_009602227.1| PREDICTED: glycosyltransferase family 92 protein RCOM_0530710 [Nicotiana tomentosiformis] Length = 614 Score = 151 bits (381), Expect = 1e-38 Identities = 106/244 (43%), Positives = 126/244 (51%), Gaps = 16/244 (6%) Frame = -1 Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXL------ICXXXXXXXXXXXXXSHFPASAF 524 MD S++RRKRKRI +C F +S F Sbjct: 1 MDSSEKRRKRKRIFRPSSPTAFLPYSLKYFFSVRSLVLCFSFLIFLFLLSYQIPFGSSVF 60 Query: 523 RPXXXXXXXXXXXXXXSRTVKAV--QVFDGLSQPLKIENRVFFPDHVLLLVSGGK---KN 359 RP + + Q F PL+IE RV FPDHVLLLV K+ Sbjct: 61 RPVLVVSSLSLLSSSSDLSSSSASFQDFGSFLLPLQIEGRVLFPDHVLLLVKKNTLLGKS 120 Query: 358 QELECVYYGKPGKFRKGNFDFG-KKKVMSVDEY-DGLSSIVRCPLPSVNYSSVVSLKLGK 185 ELECVY + R D K+K SVD+Y D +VRCPLP NYS+VV+LK + Sbjct: 121 TELECVY----ARNRSVEGDIVVKEKAFSVDDYGDENGMLVRCPLPPANYSAVVNLKKFR 176 Query: 184 -NGILG--EKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCH 14 NG++ +N L NQTVN WENVAYAA LDG++ VVFVKGLNLR RESDPS FSCH Sbjct: 177 GNGVVDMEAENGILKSNQTVNSWENVAYAATLDGNTAVVFVKGLNLRPQRESDPSQFSCH 236 Query: 13 FGLG 2 FGLG Sbjct: 237 FGLG 240 >ref|XP_019254339.1| PREDICTED: glycosyltransferase family 92 protein RCOM_0530710 [Nicotiana attenuata] gb|OIS97653.1| glycosyltransferase family 92 protein [Nicotiana attenuata] Length = 611 Score = 148 bits (374), Expect = 1e-37 Identities = 104/243 (42%), Positives = 122/243 (50%), Gaps = 15/243 (6%) Frame = -1 Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXL------ICXXXXXXXXXXXXXSHFPASAF 524 MD S+QRRKRKRI +C F +S F Sbjct: 1 MDSSEQRRKRKRIFRPSSPTVFLPYSLKYFFSVRSLVLCFSFLIFIFLLSYQIPFGSSVF 60 Query: 523 RPXXXXXXXXXXXXXXSRTVKAV--QVFDGLSQPLKIENRVFFPDHVLLLVSGGK---KN 359 RP + + Q F PL+IE RV FPDHVLLL+ KN Sbjct: 61 RPVLVVSSLSLLSSSSDLSSSSASFQDFGSFLLPLQIEGRVLFPDHVLLLIKKNTLLGKN 120 Query: 358 QELECVYYGKPGKFRKGNFDFGKKKVMSVDEY-DGLSSIVRCPLPSVNYSSVVSL-KLGK 185 ELECVY R + K+K S+D+Y D IVRCPLP NYS+VV+L K Sbjct: 121 TELECVYA------RNTSDIVVKEKAFSMDDYGDENGMIVRCPLPPANYSAVVNLMKFKG 174 Query: 184 NGILG--EKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHF 11 NG++ +N NQT N WENVAYAA LDG++ VVFVKGLNLR RESDPS FSCHF Sbjct: 175 NGVVDMEAENGIWKSNQTDNSWENVAYAAMLDGNTAVVFVKGLNLRPQRESDPSQFSCHF 234 Query: 10 GLG 2 GLG Sbjct: 235 GLG 237 >ref|XP_012847676.1| PREDICTED: UPF0392 protein RCOM_0530710-like [Erythranthe guttata] gb|EYU28667.1| hypothetical protein MIMGU_mgv1a003136mg [Erythranthe guttata] Length = 605 Score = 147 bits (371), Expect = 3e-37 Identities = 83/150 (55%), Positives = 103/150 (68%), Gaps = 4/150 (2%) Frame = -1 Query: 442 GLSQPLKIENRVFFPDHVLLLVSG-GKKNQELECVYYGKPGKFRKGNFDFGKKKVMSVDE 266 G PL+IENRV FPDHVLLLV+ ELECVY GK + +F KKV+SVD+ Sbjct: 87 GFPFPLQIENRVLFPDHVLLLVAAENTATDELECVYSGKT---LFSSLNFSVKKVLSVDK 143 Query: 265 YDGLSSIVRCPLPSVNYSSVVSLKL-GKNGILGEKNEFLVRNQTVNY--WENVAYAAALD 95 YD S++RCPLPS NYS+ +L+ GKN I F N+T+N W+N++YAAALD Sbjct: 144 YDDFRSVIRCPLPSANYSADPNLRRRGKNRI------FFRSNRTLNSCSWDNLSYAAALD 197 Query: 94 GDSVVVFVKGLNLRAGRESDPSPFSCHFGL 5 G +V+VFVKGLNLRA +ESDPS F+CHF L Sbjct: 198 GGTVIVFVKGLNLRADKESDPSGFTCHFWL 227 >ref|XP_018807521.1| PREDICTED: glycosyltransferase family 92 protein RCOM_0530710-like, partial [Juglans regia] Length = 322 Score = 138 bits (348), Expect = 8e-36 Identities = 77/150 (51%), Positives = 100/150 (66%), Gaps = 7/150 (4%) Frame = -1 Query: 430 PLKIENRVFFPDHVLLLVSGG-KKNQELECVYY----GKPGKFR-KGNFDFGKKKVMSVD 269 PL++E+RV FPDH+LLLVS +++ELECVY G R K + + G + V+S + Sbjct: 83 PLRVEDRVLFPDHLLLLVSNKLHESEELECVYSTFLNGSGELLRDKRSQEVGIRPVLSTE 142 Query: 268 EYDGLSSIVRCPLPSVNYSSV-VSLKLGKNGILGEKNEFLVRNQTVNYWENVAYAAALDG 92 YD L SIVRCPLP VNYS+ V L+ ++G G + NQTV +WE +AY A LDG Sbjct: 143 PYDALRSIVRCPLPPVNYSTAAVDLRRLRSGEAGHYEWSVGANQTVYWWEKLAYEAVLDG 202 Query: 91 DSVVVFVKGLNLRAGRESDPSPFSCHFGLG 2 D+V VFVKGLNLR ++SDP+ F CHFG G Sbjct: 203 DTVAVFVKGLNLRPHKKSDPTQFRCHFGFG 232 >ref|XP_006357222.1| PREDICTED: UPF0392 protein RCOM_0530710 [Solanum tuberosum] Length = 603 Score = 142 bits (357), Expect = 2e-35 Identities = 107/241 (44%), Positives = 125/241 (51%), Gaps = 13/241 (5%) Frame = -1 Query: 685 MDYSDQRRKRKRI--XXXXXXXXXXXXXXXXXLICXXXXXXXXXXXXXSHFPASAFRP-- 518 MD S+QRRKRKRI ++C F +S FRP Sbjct: 1 MDSSEQRRKRKRIFRPSSPTATLVYFFSVRSFVLCFSFLIFLFLLSYQIPFSSSVFRPVL 60 Query: 517 XXXXXXXXXXXXXXSRTVKAVQVFDGLS-QPLKIENRVFFPDHVLLLVSGG---KKNQEL 350 T ++ Q F S L+IE RV FPDHVLLLV+ KN Sbjct: 61 VVSRLSLLSSSSDFLSTSQSFQDFGSSSLLHLQIEGRVLFPDHVLLLVNKNDLFSKNTNF 120 Query: 349 ECVYYGKPGKFRKGNFDFG--KKKVMSVDEY--DGLSSIVRCPLPSVNYSSVVSL-KLGK 185 ECVY G+ G+ D G K+K SVD Y +VRCPLP VNYS+VV+L K Sbjct: 121 ECVY----GRNSTGD-DVGVVKEKSFSVDVYGESEFGVLVRCPLPPVNYSAVVNLRKFRG 175 Query: 184 NGILGEKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHFGL 5 NG+ E NQTVN WENVAYAA LDG++VVVFVKGLNLR RESD S FSC+FGL Sbjct: 176 NGLWKES------NQTVNSWENVAYAATLDGNTVVVFVKGLNLRPDRESDSSQFSCYFGL 229 Query: 4 G 2 G Sbjct: 230 G 230 >ref|XP_004238737.1| PREDICTED: glycosyltransferase family 92 protein RCOM_0530710 [Solanum lycopersicum] Length = 600 Score = 141 bits (356), Expect = 3e-35 Identities = 107/242 (44%), Positives = 124/242 (51%), Gaps = 14/242 (5%) Frame = -1 Query: 685 MDYSDQRRKRKRIXXXXXXXXXXXXXXXXXLI-CXXXXXXXXXXXXXSHFPASAFRP--X 515 MD S+QRRKRKRI + C F +S FRP Sbjct: 1 MDSSEQRRKRKRILRPSSISALVYFFSVRSFVLCFSFLIFVFLLSYHIPFSSSVFRPVLV 60 Query: 514 XXXXXXXXXXXXXSRTVKAVQVFDGLSQP---LKIENRVFFPDHVLLLVSGGK---KNQE 353 T + Q F S L+IE RV FPDHVLL V+ + KN E Sbjct: 61 VSRLSLLSSSSDLLSTSLSFQDFGSSSSSLLHLQIEGRVLFPDHVLLFVNKNELFSKNTE 120 Query: 352 LECVYYGKPGKFRKGNFDFG--KKKVMSVDEYDG--LSSIVRCPLPSVNYSSVVSLK-LG 188 ECVY G+ G+ D G K+K SVD Y + VRCPLP VNYS+VV+L+ L Sbjct: 121 FECVY----GRNSTGD-DVGIVKEKSYSVDVYGDFEFGAFVRCPLPRVNYSAVVNLRELR 175 Query: 187 KNGILGEKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHFG 8 NG NQTVN WENVAYAAALDG++VVVFVKGLNLR RESD S FSC+FG Sbjct: 176 GNG----------SNQTVNSWENVAYAAALDGNTVVVFVKGLNLRPDRESDSSQFSCYFG 225 Query: 7 LG 2 LG Sbjct: 226 LG 227 >ref|XP_015074620.1| PREDICTED: UPF0392 protein RCOM_0530710 [Solanum pennellii] Length = 601 Score = 140 bits (352), Expect = 1e-34 Identities = 107/243 (44%), Positives = 125/243 (51%), Gaps = 15/243 (6%) Frame = -1 Query: 685 MDYSDQRRKRKRI--XXXXXXXXXXXXXXXXXLICXXXXXXXXXXXXXSHFPASAFRP-- 518 MD S+QRRKRKRI ++C F +S FRP Sbjct: 1 MDSSEQRRKRKRIFRPSSSISALVYFFSVRSFVLCFSFLIFVFLLSYHIPFSSSVFRPVL 60 Query: 517 XXXXXXXXXXXXXXSRTVKAVQVFDGLSQP---LKIENRVFFPDHVLLLVSGG---KKNQ 356 T + Q F S L+IE RV FPDHVLLLV+ KN Sbjct: 61 VVSRLSLLSSSSDLLSTSLSFQDFGSSSSSLLHLQIEGRVLFPDHVLLLVNKNDLFSKNT 120 Query: 355 ELECVYYGKPGKFRKGNFDFG--KKKVMSVDEYDG--LSSIVRCPLPSVNYSSVVSLK-L 191 E ECVY G+ G+ D G K+K SVD Y + VRCPLP VNYS+VV+L+ L Sbjct: 121 EFECVY----GRNSTGD-DVGIVKEKSYSVDVYGDFEFGAFVRCPLPRVNYSAVVNLREL 175 Query: 190 GKNGILGEKNEFLVRNQTVNYWENVAYAAALDGDSVVVFVKGLNLRAGRESDPSPFSCHF 11 NG NQTVN WE+VAYAAALDG++VVVFVKGLNLR RESD S FSC+F Sbjct: 176 RGNG----------SNQTVNSWESVAYAAALDGNTVVVFVKGLNLRPDRESDSSQFSCYF 225 Query: 10 GLG 2 GLG Sbjct: 226 GLG 228