BLASTX nr result

ID: Mentha26_contig00048349 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00048349
         (611 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU38051.1| hypothetical protein MIMGU_mgv1a000603mg [Mimulus...   249   4e-64
gb|EPS63775.1| hypothetical protein M569_11009, partial [Genlise...   202   5e-50
ref|XP_006360510.1| PREDICTED: uncharacterized protein LOC102588...   190   2e-46
ref|XP_004250018.1| PREDICTED: uncharacterized protein LOC101258...   186   4e-45
gb|EXB58479.1| hypothetical protein L484_005213 [Morus notabilis]     150   4e-34
ref|XP_004496154.1| PREDICTED: uncharacterized protein LOC101505...   144   2e-32
ref|XP_006589360.1| PREDICTED: uncharacterized protein LOC100779...   142   8e-32
ref|XP_007144256.1| hypothetical protein PHAVU_007G141200g [Phas...   142   8e-32
ref|XP_003535489.1| PREDICTED: uncharacterized protein LOC100779...   142   8e-32
ref|XP_007010094.1| UDP-Glycosyltransferase superfamily protein ...   139   6e-31
ref|XP_007010093.1| UDP-Glycosyltransferase superfamily protein ...   139   6e-31
ref|XP_007010092.1| UDP-Glycosyltransferase superfamily protein ...   139   6e-31
ref|XP_007010091.1| UDP-Glycosyltransferase superfamily protein ...   139   6e-31
ref|XP_007010090.1| UDP-Glycosyltransferase superfamily protein ...   139   6e-31
ref|XP_006606300.1| PREDICTED: uncharacterized protein LOC100790...   132   6e-29
ref|XP_006606298.1| PREDICTED: uncharacterized protein LOC100790...   132   6e-29
ref|XP_006606297.1| PREDICTED: uncharacterized protein LOC100790...   132   6e-29
ref|XP_003555467.1| PREDICTED: uncharacterized protein LOC100790...   132   6e-29
ref|XP_006378794.1| hypothetical protein POPTR_0010s23830g [Popu...   132   8e-29
ref|XP_006485287.1| PREDICTED: uncharacterized protein LOC102618...   128   1e-27

>gb|EYU38051.1| hypothetical protein MIMGU_mgv1a000603mg [Mimulus guttatus]
          Length = 1048

 Score =  249 bits (636), Expect = 4e-64
 Identities = 131/207 (63%), Positives = 150/207 (72%), Gaps = 4/207 (1%)
 Frame = +2

Query: 2   SASDDAAAATFHSIRDRFPFKRYTNASSAAELPXXXXXXXXXXXXXXX----HHHKQRKL 169
           SASDDA A  F SIRDRFPFKR  ++S+ +                      HHH +RKL
Sbjct: 11  SASDDATAGPFRSIRDRFPFKRNNSSSNYSSTNTLTRSSSKTTLSSHKASRSHHHHKRKL 70

Query: 170 LLYLFKGKSRLYLCIFAVFFMFLVASMALQSSIMSVFRQGVGREGRQWRWSVKEGLELGS 349
            L  F+GKS  YLCIF V F F +ASM LQSSI SV RQGVG +  +WRWSVK+GL+ GS
Sbjct: 71  SLSPFRGKSCFYLCIFTVIFTFALASMVLQSSITSVLRQGVGGDRMRWRWSVKDGLKEGS 130

Query: 350 SLEFVPRRRLELNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYSLMKNLKALG 529
           SLEFVPRRR ELN SR+D LRSQP IG+RPPRI +ILGN++KDPSALLLYS+MKNLK LG
Sbjct: 131 SLEFVPRRRFELNGSRVDWLRSQPRIGIRPPRIGLILGNMEKDPSALLLYSVMKNLKGLG 190

Query: 530 YLFKLYALKDGRAHSVWQELGGPVSIL 610
           YL KLYAL DGRA  +WQE+GG VSIL
Sbjct: 191 YLLKLYALGDGRARPIWQEIGGQVSIL 217


>gb|EPS63775.1| hypothetical protein M569_11009, partial [Genlisea aurea]
          Length = 849

 Score =  202 bits (515), Expect = 5e-50
 Identities = 110/198 (55%), Positives = 131/198 (66%)
 Frame = +2

Query: 17  AAAATFHSIRDRFPFKRYTNASSAAELPXXXXXXXXXXXXXXXHHHKQRKLLLYLFKGKS 196
           A AA FHSIRDRFPFKRY ++SSA  LP               HHH ++KL    F GK 
Sbjct: 19  AGAAAFHSIRDRFPFKRYNSSSSAPALPRLSKTAHKASRS---HHHHKQKLPFVAFVGKP 75

Query: 197 RLYLCIFAVFFMFLVASMALQSSIMSVFRQGVGREGRQWRWSVKEGLELGSSLEFVPRRR 376
             Y  I  V F F +ASM LQ S M  F Q +G E  +WRWSVKEGL+ GSSL+FVP  R
Sbjct: 76  WFYFSILTVIFTFSLASMVLQKSFMPGFGQTIGAERTRWRWSVKEGLKPGSSLKFVPGWR 135

Query: 377 LELNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYSLMKNLKALGYLFKLYALK 556
           LELN SRLD LR+QP IGVR PRI++IL NL+ D  AL+LYS+MK L+ LGY+ K+YAL+
Sbjct: 136 LELNESRLDWLRTQPRIGVRRPRISIILSNLENDVPALMLYSVMKILRGLGYVLKVYALE 195

Query: 557 DGRAHSVWQELGGPVSIL 610
           DG + S WQ +   VSIL
Sbjct: 196 DGESRSNWQAITEQVSIL 213


>ref|XP_006360510.1| PREDICTED: uncharacterized protein LOC102588632 [Solanum tuberosum]
          Length = 1048

 Score =  190 bits (483), Expect = 2e-46
 Identities = 104/201 (51%), Positives = 136/201 (67%), Gaps = 8/201 (3%)
 Frame = +2

Query: 32  FHSIRDRFPFKRYTNA-SSAAELPXXXXXXXXXXXXXXXHHHKQ-------RKLLLYLFK 187
           FHSIRDRF FKR +   +    LP                HH         RKL+ + F+
Sbjct: 26  FHSIRDRFRFKRNSQRPTETVTLPSSSSSPDRQWKTLARSHHHHHHNRSFSRKLIFFCFR 85

Query: 188 GKSRLYLCIFAVFFMFLVASMALQSSIMSVFRQGVGREGRQWRWSVKEGLELGSSLEFVP 367
           GK  LYLCIF V F+F +ASM LQSSIMSVFRQ    E  +WRWSV++ L+LGSSLEFV 
Sbjct: 86  GKW-LYLCIFMVIFVFALASMVLQSSIMSVFRQN---ERARWRWSVRDDLKLGSSLEFVQ 141

Query: 368 RRRLELNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYSLMKNLKALGYLFKLY 547
            RR +L  + LD +R+QP IGVRPPRIA++LGN++KDP +L+L +++KNL+ LGY+ K+Y
Sbjct: 142 PRRFQLG-NGLDLVRNQPRIGVRPPRIALVLGNMRKDPLSLMLSTVVKNLRGLGYMIKIY 200

Query: 548 ALKDGRAHSVWQELGGPVSIL 610
            ++DG A S+W+E+GG VSIL
Sbjct: 201 TVEDGIARSIWEEIGGKVSIL 221


>ref|XP_004250018.1| PREDICTED: uncharacterized protein LOC101258810 [Solanum
           lycopersicum]
          Length = 1050

 Score =  186 bits (473), Expect = 4e-45
 Identities = 105/201 (52%), Positives = 136/201 (67%), Gaps = 8/201 (3%)
 Frame = +2

Query: 32  FHSIRDRFPFKRYTNA-SSAAELPXXXXXXXXXXXXXXXHHHKQ-------RKLLLYLFK 187
           FH IRDRF FKR +   + A  LP                HH         RKL+ + F+
Sbjct: 26  FHLIRDRFRFKRNSQRPTEAVTLPSSSSPSDRQWKTPARSHHHHHHNRSFSRKLIFFCFR 85

Query: 188 GKSRLYLCIFAVFFMFLVASMALQSSIMSVFRQGVGREGRQWRWSVKEGLELGSSLEFVP 367
           GK  LYLCIF V F+F +ASM LQSSIMSVFRQ    E  + RWSV++ L+LGSSLEFVP
Sbjct: 86  GKW-LYLCIFLVIFVFALASMVLQSSIMSVFRQN---ERARSRWSVRDDLKLGSSLEFVP 141

Query: 368 RRRLELNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYSLMKNLKALGYLFKLY 547
             R +L  + LD +R+QP IGVRPPRIA++LGN++KDP +L+L +++KNL+ LGY+ K+Y
Sbjct: 142 PPRFQLG-NGLDLVRNQPRIGVRPPRIALVLGNMRKDPLSLMLSTVVKNLRGLGYMIKIY 200

Query: 548 ALKDGRAHSVWQELGGPVSIL 610
           A++DG A SVW+E+GG VSIL
Sbjct: 201 AVEDGIARSVWEEIGGKVSIL 221


>gb|EXB58479.1| hypothetical protein L484_005213 [Morus notabilis]
          Length = 1043

 Score =  150 bits (378), Expect = 4e-34
 Identities = 88/195 (45%), Positives = 123/195 (63%), Gaps = 2/195 (1%)
 Frame = +2

Query: 32  FHSIRDRFPFKRYTNASSAAELPXXXXXXXXXXXXXXXHHHKQRKLLLYLFKGKSRLYLC 211
           FHSIRDR  FKR  N S   +                 +    RK  L+ FKGKS LYL 
Sbjct: 27  FHSIRDRLRFKRNPNPSHDRDRTKVFADRAPVRGRSHYNSRFNRKGFLW-FKGKSTLYLV 85

Query: 212 IFAVFFMFLVASMALQSSIMSVFRQGVGREGRQWRWSVKEGLELGSSLEFVPRR--RLEL 385
           I    F+F +ASM LQSSIMSVF+QG  R GR  R    EGL+ G++L FVP R  R   
Sbjct: 86  IIFAVFLFGMASMVLQSSIMSVFKQGSER-GRLLR----EGLKFGTTLRFVPGRISRRLA 140

Query: 386 NVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYSLMKNLKALGYLFKLYALKDGR 565
           + + LD LR++P I VR PR+A++LGN+KK+  +L+L +++KN++ LGY  K++A+++G 
Sbjct: 141 DANGLDRLRNEPRIAVRKPRLALVLGNMKKNSESLMLITIVKNIQKLGYALKIFAVENGN 200

Query: 566 AHSVWQELGGPVSIL 610
           A ++W++LGG +SIL
Sbjct: 201 ARTMWEQLGGQISIL 215


>ref|XP_004496154.1| PREDICTED: uncharacterized protein LOC101505326 [Cicer arietinum]
          Length = 1042

 Score =  144 bits (363), Expect = 2e-32
 Identities = 93/215 (43%), Positives = 131/215 (60%), Gaps = 15/215 (6%)
 Frame = +2

Query: 11  DDAAAAT---FHSIRDRFPFKRYTN-------ASSAAELPXXXXXXXXXXXXXXXHHHKQ 160
           DDA   +   F SIR RFPFKR  N       +SS  +LP               H+   
Sbjct: 14  DDAGGGSDVGFSSIRGRFPFKRNPNLNRDRHRSSSDRQLPRSANSSRSHL-----HNRFT 68

Query: 161 RKLLLYLF---KGKSRLYLCIFAVFFMFLVASMALQSSIMSVFRQGVGREGRQWRWSVKE 331
           RK  L LF   KGKS LY  IF V F+F +ASM +Q+SI SVFRQ    EG ++   ++E
Sbjct: 69  RKGFLSLFPFFKGKSGLYALIFVVVFLFALASMVMQNSITSVFRQR--NEGSRY---LRE 123

Query: 332 GLELGSSLEFVPRRRLE--LNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYSL 505
           GL+ GS+++FVP +  +  L+   LD LRSQP IGVR PRIA+ILG++  DP +L+L ++
Sbjct: 124 GLKFGSTIKFVPGKVSQKFLSGDGLDRLRSQPRIGVRSPRIALILGHMSVDPQSLMLVTV 183

Query: 506 MKNLKALGYLFKLYALKDGRAHSVWQELGGPVSIL 610
           ++NL+ LGY+FK++ +   +A S+W+ +GG +S L
Sbjct: 184 IQNLQKLGYVFKIFVVGHRKARSIWENVGGGLSSL 218


>ref|XP_006589360.1| PREDICTED: uncharacterized protein LOC100779157 isoform X2 [Glycine
           max]
          Length = 1043

 Score =  142 bits (358), Expect = 8e-32
 Identities = 85/203 (41%), Positives = 122/203 (60%), Gaps = 10/203 (4%)
 Frame = +2

Query: 32  FHSIRDRFPFKRYTN-----ASSAAELPXXXXXXXXXXXXXXXHHHKQRKLLLYLF---K 187
           F +IR  FPFKR  +      S   +LP               H HK++ LLL+LF   K
Sbjct: 24  FGAIRGGFPFKRNPSHHRHRGSFDRQLPRSNNNSNSNNNINRSHLHKRKGLLLWLFPFPK 83

Query: 188 GKSRLYLCIFAVFFMFLVASMALQSSIMSVFRQGVGREGRQWRWSVKEGLELGSSLEFVP 367
            KS  Y  I AV F+F +AS+ +QSSI SVFRQ   R        ++ G+  GS+L FVP
Sbjct: 84  SKSGFYAFIIAVVFLFALASLVMQSSITSVFRQRAERASY-----IRGGIRFGSALRFVP 138

Query: 368 RRRLE--LNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYSLMKNLKALGYLFK 541
            +  +  L+   LD +RSQP IGVR PRIA+ILG++  DP +L+L ++++NL+ LGY+FK
Sbjct: 139 GKISQRFLSGDGLDPVRSQPRIGVRAPRIALILGHMTIDPQSLMLVTVIRNLQKLGYVFK 198

Query: 542 LYALKDGRAHSVWQELGGPVSIL 610
           ++A+  G+A S+W+ +GG +S L
Sbjct: 199 IFAVGHGKARSIWENIGGGISPL 221


>ref|XP_007144256.1| hypothetical protein PHAVU_007G141200g [Phaseolus vulgaris]
           gi|561017446|gb|ESW16250.1| hypothetical protein
           PHAVU_007G141200g [Phaseolus vulgaris]
          Length = 1049

 Score =  142 bits (358), Expect = 8e-32
 Identities = 91/213 (42%), Positives = 128/213 (60%), Gaps = 13/213 (6%)
 Frame = +2

Query: 11  DDAAAAT-FHSIRDRFPFKRYTN-----ASSAAELPXXXXXXXXXXXXXXXHHHK--QRK 166
           DDA     FH+IR  FPFKR  +      S   +LP                H +  ++ 
Sbjct: 14  DDAGGDIGFHAIRGGFPFKRNPSHYRHRGSFDRQLPRSSNSSSSNSSSRSHLHSRLTRKG 73

Query: 167 LLLYLF---KGKSRLYLCIFAVFFMFLVASMALQSSIMSVFRQGVGREGRQWRWSVKEGL 337
           LLL+LF   K KS  Y  I  V F+F  +SM +Q+SI SVFRQ   R GR  R    EGL
Sbjct: 74  LLLWLFPFSKCKSGFYALIIVVVFLFAFSSMVMQNSITSVFRQRTER-GRYHR----EGL 128

Query: 338 ELGSSLEFVPRRRLE--LNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYSLMK 511
             G++L FVP R  +  L+   LD +RSQP +GVRPPRIA+ILG++  DP +L+L ++++
Sbjct: 129 RFGTALRFVPGRVSQGFLSGDGLDRVRSQPRLGVRPPRIALILGHMTIDPQSLMLVTVIR 188

Query: 512 NLKALGYLFKLYALKDGRAHSVWQELGGPVSIL 610
           NL+ LGY+FK++A+ +G+AHS+W+ +GG +S L
Sbjct: 189 NLQKLGYVFKIFAVGNGKAHSIWENIGGGISHL 221


>ref|XP_003535489.1| PREDICTED: uncharacterized protein LOC100779157 isoform X1 [Glycine
           max]
          Length = 1044

 Score =  142 bits (358), Expect = 8e-32
 Identities = 85/203 (41%), Positives = 122/203 (60%), Gaps = 10/203 (4%)
 Frame = +2

Query: 32  FHSIRDRFPFKRYTN-----ASSAAELPXXXXXXXXXXXXXXXHHHKQRKLLLYLF---K 187
           F +IR  FPFKR  +      S   +LP               H HK++ LLL+LF   K
Sbjct: 24  FGAIRGGFPFKRNPSHHRHRGSFDRQLPRSNNNSNSNNNINRSHLHKRKGLLLWLFPFPK 83

Query: 188 GKSRLYLCIFAVFFMFLVASMALQSSIMSVFRQGVGREGRQWRWSVKEGLELGSSLEFVP 367
            KS  Y  I AV F+F +AS+ +QSSI SVFRQ   R        ++ G+  GS+L FVP
Sbjct: 84  SKSGFYAFIIAVVFLFALASLVMQSSITSVFRQRAERASY-----IRGGIRFGSALRFVP 138

Query: 368 RRRLE--LNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYSLMKNLKALGYLFK 541
            +  +  L+   LD +RSQP IGVR PRIA+ILG++  DP +L+L ++++NL+ LGY+FK
Sbjct: 139 GKISQRFLSGDGLDPVRSQPRIGVRAPRIALILGHMTIDPQSLMLVTVIRNLQKLGYVFK 198

Query: 542 LYALKDGRAHSVWQELGGPVSIL 610
           ++A+  G+A S+W+ +GG +S L
Sbjct: 199 IFAVGHGKARSIWENIGGGISPL 221


>ref|XP_007010094.1| UDP-Glycosyltransferase superfamily protein isoform 5 [Theobroma
           cacao] gi|508727007|gb|EOY18904.1|
           UDP-Glycosyltransferase superfamily protein isoform 5
           [Theobroma cacao]
          Length = 782

 Score =  139 bits (350), Expect = 6e-31
 Identities = 85/203 (41%), Positives = 120/203 (59%), Gaps = 10/203 (4%)
 Frame = +2

Query: 32  FHSIRDRFPFKRY-------TNASSAAELPXXXXXXXXXXXXXXXHHHKQRKLLLYLFKG 190
           F+SIRDR PFKR        T  SS  + P                   ++  LL+  +G
Sbjct: 33  FYSIRDRLPFKRNPIHTRDRTKQSSLLDRPLVRNRP----------RFNRKGFLLFPLRG 82

Query: 191 KSRLYLCIFAVFFMFLVASMALQSSIMSV-FRQGVGREGRQWRWSVKEGLELGSSLEFVP 367
               Y  IF   F F +ASM +QSSI +V FRQG G  G  WR SV+EGL LGS+L+F+P
Sbjct: 83  IHLFYFLIFFSVFAFAMASMLMQSSIAAVVFRQG-GERG--WRKSVREGLRLGSTLKFMP 139

Query: 368 R--RRLELNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYSLMKNLKALGYLFK 541
               R       LD +RS   IGVR PR+A+ILGN+KKDP +L++ +++K+L+ LGY+ K
Sbjct: 140 AGMSRWVAEGGGLDRMRSTARIGVRGPRLALILGNMKKDPQSLMMLTVVKSLQRLGYVIK 199

Query: 542 LYALKDGRAHSVWQELGGPVSIL 610
           +YA+ +G+AH++W+ + G +S L
Sbjct: 200 IYAVANGKAHAMWEHISGQISFL 222


>ref|XP_007010093.1| UDP-Glycosyltransferase superfamily protein isoform 4 [Theobroma
           cacao] gi|508727006|gb|EOY18903.1|
           UDP-Glycosyltransferase superfamily protein isoform 4
           [Theobroma cacao]
          Length = 969

 Score =  139 bits (350), Expect = 6e-31
 Identities = 85/203 (41%), Positives = 120/203 (59%), Gaps = 10/203 (4%)
 Frame = +2

Query: 32  FHSIRDRFPFKRY-------TNASSAAELPXXXXXXXXXXXXXXXHHHKQRKLLLYLFKG 190
           F+SIRDR PFKR        T  SS  + P                   ++  LL+  +G
Sbjct: 33  FYSIRDRLPFKRNPIHTRDRTKQSSLLDRPLVRNRP----------RFNRKGFLLFPLRG 82

Query: 191 KSRLYLCIFAVFFMFLVASMALQSSIMSV-FRQGVGREGRQWRWSVKEGLELGSSLEFVP 367
               Y  IF   F F +ASM +QSSI +V FRQG G  G  WR SV+EGL LGS+L+F+P
Sbjct: 83  IHLFYFLIFFSVFAFAMASMLMQSSIAAVVFRQG-GERG--WRKSVREGLRLGSTLKFMP 139

Query: 368 R--RRLELNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYSLMKNLKALGYLFK 541
               R       LD +RS   IGVR PR+A+ILGN+KKDP +L++ +++K+L+ LGY+ K
Sbjct: 140 AGMSRWVAEGGGLDRMRSTARIGVRGPRLALILGNMKKDPQSLMMLTVVKSLQRLGYVIK 199

Query: 542 LYALKDGRAHSVWQELGGPVSIL 610
           +YA+ +G+AH++W+ + G +S L
Sbjct: 200 IYAVANGKAHAMWEHISGQISFL 222


>ref|XP_007010092.1| UDP-Glycosyltransferase superfamily protein isoform 3 [Theobroma
           cacao] gi|508727005|gb|EOY18902.1|
           UDP-Glycosyltransferase superfamily protein isoform 3
           [Theobroma cacao]
          Length = 1034

 Score =  139 bits (350), Expect = 6e-31
 Identities = 85/203 (41%), Positives = 120/203 (59%), Gaps = 10/203 (4%)
 Frame = +2

Query: 32  FHSIRDRFPFKRY-------TNASSAAELPXXXXXXXXXXXXXXXHHHKQRKLLLYLFKG 190
           F+SIRDR PFKR        T  SS  + P                   ++  LL+  +G
Sbjct: 33  FYSIRDRLPFKRNPIHTRDRTKQSSLLDRPLVRNRP----------RFNRKGFLLFPLRG 82

Query: 191 KSRLYLCIFAVFFMFLVASMALQSSIMSV-FRQGVGREGRQWRWSVKEGLELGSSLEFVP 367
               Y  IF   F F +ASM +QSSI +V FRQG G  G  WR SV+EGL LGS+L+F+P
Sbjct: 83  IHLFYFLIFFSVFAFAMASMLMQSSIAAVVFRQG-GERG--WRKSVREGLRLGSTLKFMP 139

Query: 368 R--RRLELNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYSLMKNLKALGYLFK 541
               R       LD +RS   IGVR PR+A+ILGN+KKDP +L++ +++K+L+ LGY+ K
Sbjct: 140 AGMSRWVAEGGGLDRMRSTARIGVRGPRLALILGNMKKDPQSLMMLTVVKSLQRLGYVIK 199

Query: 542 LYALKDGRAHSVWQELGGPVSIL 610
           +YA+ +G+AH++W+ + G +S L
Sbjct: 200 IYAVANGKAHAMWEHISGQISFL 222


>ref|XP_007010091.1| UDP-Glycosyltransferase superfamily protein isoform 2 [Theobroma
           cacao] gi|508727004|gb|EOY18901.1|
           UDP-Glycosyltransferase superfamily protein isoform 2
           [Theobroma cacao]
          Length = 735

 Score =  139 bits (350), Expect = 6e-31
 Identities = 85/203 (41%), Positives = 120/203 (59%), Gaps = 10/203 (4%)
 Frame = +2

Query: 32  FHSIRDRFPFKRY-------TNASSAAELPXXXXXXXXXXXXXXXHHHKQRKLLLYLFKG 190
           F+SIRDR PFKR        T  SS  + P                   ++  LL+  +G
Sbjct: 33  FYSIRDRLPFKRNPIHTRDRTKQSSLLDRPLVRNRP----------RFNRKGFLLFPLRG 82

Query: 191 KSRLYLCIFAVFFMFLVASMALQSSIMSV-FRQGVGREGRQWRWSVKEGLELGSSLEFVP 367
               Y  IF   F F +ASM +QSSI +V FRQG G  G  WR SV+EGL LGS+L+F+P
Sbjct: 83  IHLFYFLIFFSVFAFAMASMLMQSSIAAVVFRQG-GERG--WRKSVREGLRLGSTLKFMP 139

Query: 368 R--RRLELNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYSLMKNLKALGYLFK 541
               R       LD +RS   IGVR PR+A+ILGN+KKDP +L++ +++K+L+ LGY+ K
Sbjct: 140 AGMSRWVAEGGGLDRMRSTARIGVRGPRLALILGNMKKDPQSLMMLTVVKSLQRLGYVIK 199

Query: 542 LYALKDGRAHSVWQELGGPVSIL 610
           +YA+ +G+AH++W+ + G +S L
Sbjct: 200 IYAVANGKAHAMWEHISGQISFL 222


>ref|XP_007010090.1| UDP-Glycosyltransferase superfamily protein isoform 1 [Theobroma
           cacao] gi|508727003|gb|EOY18900.1|
           UDP-Glycosyltransferase superfamily protein isoform 1
           [Theobroma cacao]
          Length = 1041

 Score =  139 bits (350), Expect = 6e-31
 Identities = 85/203 (41%), Positives = 120/203 (59%), Gaps = 10/203 (4%)
 Frame = +2

Query: 32  FHSIRDRFPFKRY-------TNASSAAELPXXXXXXXXXXXXXXXHHHKQRKLLLYLFKG 190
           F+SIRDR PFKR        T  SS  + P                   ++  LL+  +G
Sbjct: 33  FYSIRDRLPFKRNPIHTRDRTKQSSLLDRPLVRNRP----------RFNRKGFLLFPLRG 82

Query: 191 KSRLYLCIFAVFFMFLVASMALQSSIMSV-FRQGVGREGRQWRWSVKEGLELGSSLEFVP 367
               Y  IF   F F +ASM +QSSI +V FRQG G  G  WR SV+EGL LGS+L+F+P
Sbjct: 83  IHLFYFLIFFSVFAFAMASMLMQSSIAAVVFRQG-GERG--WRKSVREGLRLGSTLKFMP 139

Query: 368 R--RRLELNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYSLMKNLKALGYLFK 541
               R       LD +RS   IGVR PR+A+ILGN+KKDP +L++ +++K+L+ LGY+ K
Sbjct: 140 AGMSRWVAEGGGLDRMRSTARIGVRGPRLALILGNMKKDPQSLMMLTVVKSLQRLGYVIK 199

Query: 542 LYALKDGRAHSVWQELGGPVSIL 610
           +YA+ +G+AH++W+ + G +S L
Sbjct: 200 IYAVANGKAHAMWEHISGQISFL 222


>ref|XP_006606300.1| PREDICTED: uncharacterized protein LOC100790929 isoform X5 [Glycine
           max]
          Length = 796

 Score =  132 bits (333), Expect = 6e-29
 Identities = 88/213 (41%), Positives = 119/213 (55%), Gaps = 16/213 (7%)
 Frame = +2

Query: 11  DDAAAAT-FHSIRDRFPFKRYTN-----ASSAAELPXXXXXXXXXXXXXXX-----HHHK 157
           DDA     F +IR  FPFKR        AS   +LP                    H HK
Sbjct: 14  DDAGGDIGFGAIRGGFPFKRNPGHHRHRASFDRQLPRSNNSSSSSSSNNNNISIRSHLHK 73

Query: 158 QRKLLLYLF---KGKSRLYLCIFAVFFMFLVASMALQSSIMSVFRQGVGREGRQWRWSVK 328
           ++ LLL+LF   K KS  Y  I  V F+F +ASM LQSSI SVFRQ            + 
Sbjct: 74  RKGLLLWLFPFPKSKSGFYAFIIVVVFLFALASMVLQSSITSVFRQSADSARY-----IS 128

Query: 329 EGLELGSSLEFVPRRRLE--LNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYS 502
            G+  GS+L FVP R  +  L+   LD +RSQP IGVR PRIA+ILG++  DP +L+L +
Sbjct: 129 GGIRFGSALRFVPGRISQRFLSGDGLDPVRSQPRIGVRAPRIALILGHMTIDPQSLMLVT 188

Query: 503 LMKNLKALGYLFKLYALKDGRAHSVWQELGGPV 601
           ++ NL+ LGY+FK++A+  G+A S+W+ +GG +
Sbjct: 189 VIWNLQKLGYVFKIFAVGHGKARSIWENIGGRI 221


>ref|XP_006606298.1| PREDICTED: uncharacterized protein LOC100790929 isoform X3 [Glycine
           max]
          Length = 1015

 Score =  132 bits (333), Expect = 6e-29
 Identities = 88/213 (41%), Positives = 119/213 (55%), Gaps = 16/213 (7%)
 Frame = +2

Query: 11  DDAAAAT-FHSIRDRFPFKRYTN-----ASSAAELPXXXXXXXXXXXXXXX-----HHHK 157
           DDA     F +IR  FPFKR        AS   +LP                    H HK
Sbjct: 14  DDAGGDIGFGAIRGGFPFKRNPGHHRHRASFDRQLPRSNNSSSSSSSNNNNISIRSHLHK 73

Query: 158 QRKLLLYLF---KGKSRLYLCIFAVFFMFLVASMALQSSIMSVFRQGVGREGRQWRWSVK 328
           ++ LLL+LF   K KS  Y  I  V F+F +ASM LQSSI SVFRQ            + 
Sbjct: 74  RKGLLLWLFPFPKSKSGFYAFIIVVVFLFALASMVLQSSITSVFRQSADSARY-----IS 128

Query: 329 EGLELGSSLEFVPRRRLE--LNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYS 502
            G+  GS+L FVP R  +  L+   LD +RSQP IGVR PRIA+ILG++  DP +L+L +
Sbjct: 129 GGIRFGSALRFVPGRISQRFLSGDGLDPVRSQPRIGVRAPRIALILGHMTIDPQSLMLVT 188

Query: 503 LMKNLKALGYLFKLYALKDGRAHSVWQELGGPV 601
           ++ NL+ LGY+FK++A+  G+A S+W+ +GG +
Sbjct: 189 VIWNLQKLGYVFKIFAVGHGKARSIWENIGGRI 221


>ref|XP_006606297.1| PREDICTED: uncharacterized protein LOC100790929 isoform X2 [Glycine
           max]
          Length = 1044

 Score =  132 bits (333), Expect = 6e-29
 Identities = 88/213 (41%), Positives = 119/213 (55%), Gaps = 16/213 (7%)
 Frame = +2

Query: 11  DDAAAAT-FHSIRDRFPFKRYTN-----ASSAAELPXXXXXXXXXXXXXXX-----HHHK 157
           DDA     F +IR  FPFKR        AS   +LP                    H HK
Sbjct: 14  DDAGGDIGFGAIRGGFPFKRNPGHHRHRASFDRQLPRSNNSSSSSSSNNNNISIRSHLHK 73

Query: 158 QRKLLLYLF---KGKSRLYLCIFAVFFMFLVASMALQSSIMSVFRQGVGREGRQWRWSVK 328
           ++ LLL+LF   K KS  Y  I  V F+F +ASM LQSSI SVFRQ            + 
Sbjct: 74  RKGLLLWLFPFPKSKSGFYAFIIVVVFLFALASMVLQSSITSVFRQSADSARY-----IS 128

Query: 329 EGLELGSSLEFVPRRRLE--LNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYS 502
            G+  GS+L FVP R  +  L+   LD +RSQP IGVR PRIA+ILG++  DP +L+L +
Sbjct: 129 GGIRFGSALRFVPGRISQRFLSGDGLDPVRSQPRIGVRAPRIALILGHMTIDPQSLMLVT 188

Query: 503 LMKNLKALGYLFKLYALKDGRAHSVWQELGGPV 601
           ++ NL+ LGY+FK++A+  G+A S+W+ +GG +
Sbjct: 189 VIWNLQKLGYVFKIFAVGHGKARSIWENIGGRI 221


>ref|XP_003555467.1| PREDICTED: uncharacterized protein LOC100790929 isoform X1 [Glycine
           max]
          Length = 1045

 Score =  132 bits (333), Expect = 6e-29
 Identities = 88/213 (41%), Positives = 119/213 (55%), Gaps = 16/213 (7%)
 Frame = +2

Query: 11  DDAAAAT-FHSIRDRFPFKRYTN-----ASSAAELPXXXXXXXXXXXXXXX-----HHHK 157
           DDA     F +IR  FPFKR        AS   +LP                    H HK
Sbjct: 14  DDAGGDIGFGAIRGGFPFKRNPGHHRHRASFDRQLPRSNNSSSSSSSNNNNISIRSHLHK 73

Query: 158 QRKLLLYLF---KGKSRLYLCIFAVFFMFLVASMALQSSIMSVFRQGVGREGRQWRWSVK 328
           ++ LLL+LF   K KS  Y  I  V F+F +ASM LQSSI SVFRQ            + 
Sbjct: 74  RKGLLLWLFPFPKSKSGFYAFIIVVVFLFALASMVLQSSITSVFRQSADSARY-----IS 128

Query: 329 EGLELGSSLEFVPRRRLE--LNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYS 502
            G+  GS+L FVP R  +  L+   LD +RSQP IGVR PRIA+ILG++  DP +L+L +
Sbjct: 129 GGIRFGSALRFVPGRISQRFLSGDGLDPVRSQPRIGVRAPRIALILGHMTIDPQSLMLVT 188

Query: 503 LMKNLKALGYLFKLYALKDGRAHSVWQELGGPV 601
           ++ NL+ LGY+FK++A+  G+A S+W+ +GG +
Sbjct: 189 VIWNLQKLGYVFKIFAVGHGKARSIWENIGGRI 221


>ref|XP_006378794.1| hypothetical protein POPTR_0010s23830g [Populus trichocarpa]
           gi|550330474|gb|ERP56591.1| hypothetical protein
           POPTR_0010s23830g [Populus trichocarpa]
          Length = 1053

 Score =  132 bits (332), Expect = 8e-29
 Identities = 86/218 (39%), Positives = 121/218 (55%), Gaps = 15/218 (6%)
 Frame = +2

Query: 2   SASDDAAAATFHSIRDRFPFKRYTNASSAAELPXXXXXXXXXXXXXXXHHHKQRK----- 166
           + S+  +   FHSI DRF FKR  N S+ +                  HH+  +      
Sbjct: 19  TGSEGVSDQNFHSISDRFLFKRNPNPSTNSP---HKSSKSPPDRLRRWHHYTNKSNNRKG 75

Query: 167 --LLLYLFKGKSRLYLCIFAVFFMFLVASMALQSSI--MSVFRQGVGREGRQW---RWSV 325
                  F+G    Y  IF   F F++AS+ LQSSI  M VF +G       W   R S+
Sbjct: 76  GWFSCIPFRGICLFYFVIFLAVFAFVLASILLQSSITGMVVFSKG-------WIDHRRSI 128

Query: 326 KEGLELGSSLEFVP--RRRLELNVSRLDSLRSQP-MIGVRPPRIAVILGNLKKDPSALLL 496
           +EGL+ G++L+FVP  R RL L    LD  R     +G+RPPR+AVILGN+KKDP +L+L
Sbjct: 129 REGLKSGTTLKFVPGLRSRLLLEGHGLDHARVLANRVGLRPPRLAVILGNMKKDPQSLML 188

Query: 497 YSLMKNLKALGYLFKLYALKDGRAHSVWQELGGPVSIL 610
            S+MKNL+ LGY  K+YAL +G   ++W+++GG +S+L
Sbjct: 189 LSVMKNLRKLGYALKIYALGNGETRTMWEDIGGQISVL 226


>ref|XP_006485287.1| PREDICTED: uncharacterized protein LOC102618162 isoform X2 [Citrus
           sinensis]
          Length = 962

 Score =  128 bits (322), Expect = 1e-27
 Identities = 83/210 (39%), Positives = 111/210 (52%), Gaps = 18/210 (8%)
 Frame = +2

Query: 35  HSIRDRFPFKRYTNASSAAELPXXXXXXXXXXXXXXXHHH------------------KQ 160
           HSIRDRF FKR  N +                     H H                  ++
Sbjct: 33  HSIRDRFRFKRSPNHTQ-----DKTQTKPSLHRYLLRHRHVNSTPSAANAATSGPRFNRK 87

Query: 161 RKLLLYLFKGKSRLYLCIFAVFFMFLVASMALQSSIMSVFRQGVGREGRQWRWSVKEGLE 340
               L+ F+G   LY  IF   F F +ASM LQ+SI SVF    GR        ++E L 
Sbjct: 88  GFSSLFPFRGAYLLYFMIFLAVFAFAMASMVLQNSIASVFGAERGRP-------IREELR 140

Query: 341 LGSSLEFVPRRRLELNVSRLDSLRSQPMIGVRPPRIAVILGNLKKDPSALLLYSLMKNLK 520
            GS L+FVP +    N   LD LRS P  GVRPPRI +ILGN+ KD  +LLL +++KNL+
Sbjct: 141 FGSRLKFVPDQVGFGN--GLDGLRSTPRFGVRPPRIGLILGNMAKDSRSLLLITVVKNLQ 198

Query: 521 ALGYLFKLYALKDGRAHSVWQELGGPVSIL 610
            LGY+FK+YA++ G +HS+W+++ G +SIL
Sbjct: 199 KLGYVFKIYAVRSGNSHSLWEQIAGQISIL 228


Top