BLASTX nr result

ID: Forsythia22_contig00033553 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00033553
         (810 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011084221.1| PREDICTED: uncharacterized protein LOC105166...   311   3e-82
ref|XP_012836694.1| PREDICTED: uncharacterized protein LOC105957...   298   4e-78
ref|XP_009605302.1| PREDICTED: uncharacterized protein LOC104099...   291   3e-76
ref|XP_009796926.1| PREDICTED: uncharacterized protein LOC104243...   290   6e-76
emb|CDO99829.1| unnamed protein product [Coffea canephora]            283   9e-74
ref|XP_004250018.1| PREDICTED: uncharacterized protein LOC101258...   278   2e-72
ref|XP_006360510.1| PREDICTED: uncharacterized protein LOC102588...   277   5e-72
ref|XP_007010094.1| UDP-Glycosyltransferase superfamily protein ...   273   9e-71
ref|XP_007010093.1| UDP-Glycosyltransferase superfamily protein ...   273   9e-71
ref|XP_007010092.1| UDP-Glycosyltransferase superfamily protein ...   273   9e-71
ref|XP_007010091.1| UDP-Glycosyltransferase superfamily protein ...   273   9e-71
ref|XP_007010090.1| UDP-Glycosyltransferase superfamily protein ...   273   9e-71
ref|XP_012447608.1| PREDICTED: uncharacterized protein LOC105770...   270   1e-69
ref|XP_012447607.1| PREDICTED: uncharacterized protein LOC105770...   270   1e-69
ref|XP_012090324.1| PREDICTED: uncharacterized protein LOC105648...   268   3e-69
ref|XP_012090316.1| PREDICTED: uncharacterized protein LOC105648...   268   3e-69
ref|XP_011649860.1| PREDICTED: uncharacterized protein LOC101206...   268   3e-69
gb|KDP45060.1| hypothetical protein JCGZ_01560 [Jatropha curcas]      268   3e-69
ref|XP_004138684.1| PREDICTED: uncharacterized protein LOC101206...   268   3e-69
ref|XP_011469406.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   266   9e-69

>ref|XP_011084221.1| PREDICTED: uncharacterized protein LOC105166536 [Sesamum indicum]
          Length = 1040

 Score =  311 bits (797), Expect = 3e-82
 Identities = 161/270 (59%), Positives = 195/270 (72%)
 Frame = +1

Query: 1    SSNGYNDALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRA 180
            SS  Y+DALQD+  RL L  GSL+HYGI++DVNGLILMADI+LYGSSQ+EQGFPPLLTRA
Sbjct: 396  SSKDYDDALQDVAARLRLNQGSLKHYGINSDVNGLILMADIVLYGSSQDEQGFPPLLTRA 455

Query: 181  MSFGVPVVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSG 360
            M+FG PV++PD+PVI+KYVVDGVHGIIFPK+DA AL NAFS+LIS GKLS  A SVASSG
Sbjct: 456  MAFGNPVIAPDFPVIRKYVVDGVHGIIFPKNDAEALTNAFSLLISGGKLSRFAHSVASSG 515

Query: 361  KSLVKNVLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKSSNI 540
            +   KN+ AAECI GYA+LLE +F+FPSDVLLP   SEL+  +WE SLF+ E++Q  SN 
Sbjct: 516  RLHAKNMFAAECIVGYAELLEYVFDFPSDVLLPARPSELKNLTWEWSLFRRELDQIYSNT 575

Query: 541  ENLYIKXXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXXXXXX 720
            E   ++           DLEE++ + V  KNI+ D+ EDLEEDIPT+ DWD         
Sbjct: 576  E--LLEGYSWMNSSNVYDLEEDMKDYVRSKNITQDNSEDLEEDIPTLLDWDILSEIESSE 633

Query: 721  XXXXXXXXXXXXXXXKDIGEWDEIYRSARK 810
                           KDIGEWD+IYR+ARK
Sbjct: 634  EVEMLEREEIEERMEKDIGEWDDIYRNARK 663


>ref|XP_012836694.1| PREDICTED: uncharacterized protein LOC105957310 [Erythranthe
            guttatus] gi|604333715|gb|EYU38051.1| hypothetical
            protein MIMGU_mgv1a000603mg [Erythranthe guttata]
          Length = 1048

 Score =  298 bits (762), Expect = 4e-78
 Identities = 153/270 (56%), Positives = 188/270 (69%)
 Frame = +1

Query: 1    SSNGYNDALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRA 180
            SS  Y+DALQD+ TRL L   S++HYGI++DVNG+ILMADI+LYGSSQ+EQGFPPLLTRA
Sbjct: 401  SSKDYSDALQDVATRLRLNEQSVKHYGINSDVNGIILMADIVLYGSSQDEQGFPPLLTRA 460

Query: 181  MSFGVPVVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSG 360
            MSFG+PV++PD PVI+KYVVDGVHG+IFPK+D  AL NAFS+LIS+GKLS  A SV SSG
Sbjct: 461  MSFGIPVIAPDKPVIRKYVVDGVHGVIFPKNDPEALKNAFSLLISEGKLSRFAHSVGSSG 520

Query: 361  KSLVKNVLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKSSNI 540
            +   KN+ A ECI GYAKLLE +F+FPSDVLLP   S+L  + WE SLF+ E++Q SS+ 
Sbjct: 521  RLRAKNMFAEECIIGYAKLLEYVFDFPSDVLLPSRPSQLNNSIWEWSLFRMELDQISSHT 580

Query: 541  ENLYIKXXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXXXXXX 720
            ENLY++           DLEE + N     N + D  E+  EDIPT+ DWD         
Sbjct: 581  ENLYLEGSSGPNSGIVYDLEEAMLNDPTSSNETQDHSENPGEDIPTILDWDILDEMESSE 640

Query: 721  XXXXXXXXXXXXXXXKDIGEWDEIYRSARK 810
                           K+IGEWD+IYR ARK
Sbjct: 641  EVDRLEREEIEERMEKNIGEWDDIYRIARK 670


>ref|XP_009605302.1| PREDICTED: uncharacterized protein LOC104099876 [Nicotiana
            tomentosiformis]
          Length = 1052

 Score =  291 bits (746), Expect = 3e-76
 Identities = 151/270 (55%), Positives = 190/270 (70%)
 Frame = +1

Query: 1    SSNGYNDALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRA 180
            SS+GYN+ALQDI TRL L  GSL H+ +  DVNG+IL+ADI+LY SSQ EQ FPP+L RA
Sbjct: 408  SSDGYNEALQDIATRLGLREGSLSHHDMKGDVNGIILIADIVLYSSSQYEQEFPPILIRA 467

Query: 181  MSFGVPVVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSG 360
            MSFG+P+V+PD+PVIKKYVVD VHGIIF KH + AL+  FS+LIS+GKL+  A+++ASSG
Sbjct: 468  MSFGIPIVAPDHPVIKKYVVDEVHGIIFSKHKSNALVQDFSVLISNGKLTRFARTIASSG 527

Query: 361  KSLVKNVLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKSSNI 540
            + L KN+LA ECITGYAKLLEN+ NFPSDV LP   S+L+Q SWE   F+ ++E KS++I
Sbjct: 528  RLLSKNMLAVECITGYAKLLENVINFPSDVTLPGDTSQLKQGSWEWGYFQKDVE-KSNDI 586

Query: 541  ENLYIKXXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXXXXXX 720
            E+L +K           DLE ++T  VP+ N+SGD+ E L ED P+  DWD         
Sbjct: 587  EDLQVKDVDLINSSVVYDLEVDMTGFVPLMNVSGDNSEAL-EDFPSELDWDILNEMERSE 645

Query: 721  XXXXXXXXXXXXXXXKDIGEWDEIYRSARK 810
                           KDIGEWDEIYR+ARK
Sbjct: 646  EVNRLEMEEIEERMEKDIGEWDEIYRNARK 675


>ref|XP_009796926.1| PREDICTED: uncharacterized protein LOC104243440 [Nicotiana
            sylvestris]
          Length = 1052

 Score =  290 bits (743), Expect = 6e-76
 Identities = 150/270 (55%), Positives = 189/270 (70%)
 Frame = +1

Query: 1    SSNGYNDALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRA 180
            SS+GYN+ALQDI TRL L  GSL H+ +  DVNG+IL+ADI+LY SSQ EQ FPP+L RA
Sbjct: 408  SSDGYNEALQDIATRLGLREGSLSHHDMKGDVNGIILIADIVLYSSSQYEQEFPPILIRA 467

Query: 181  MSFGVPVVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSG 360
            MSFG+P+V+PD+PVIKKYVVD VHGIIF KH + AL+  FS+LIS+GKL+  A ++ASSG
Sbjct: 468  MSFGIPIVAPDHPVIKKYVVDEVHGIIFSKHKSNALVQDFSVLISNGKLTRFAHTIASSG 527

Query: 361  KSLVKNVLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKSSNI 540
            + L KN+LA ECI GYAKLLEN+ NFPSDV+LP   S+L+Q SWE   F+ ++E KS++I
Sbjct: 528  RLLSKNMLAVECIAGYAKLLENVINFPSDVILPGDTSQLKQGSWEWGYFQKDVE-KSNDI 586

Query: 541  ENLYIKXXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXXXXXX 720
            E+L +K           DLE ++T  VP+ N+SGD+ E L ED P+  DWD         
Sbjct: 587  EDLQVKDMDPINSSVVYDLEVDMTGFVPLMNVSGDNSEAL-EDFPSELDWDILNEMERSE 645

Query: 721  XXXXXXXXXXXXXXXKDIGEWDEIYRSARK 810
                           KDIGEWDEIYR+ARK
Sbjct: 646  EVNRLEMEEIEERMEKDIGEWDEIYRNARK 675


>emb|CDO99829.1| unnamed protein product [Coffea canephora]
          Length = 1060

 Score =  283 bits (724), Expect = 9e-74
 Identities = 144/270 (53%), Positives = 189/270 (70%)
 Frame = +1

Query: 1    SSNGYNDALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRA 180
            SS+ Y+DALQDI TRL L  GSLRH+G+  D NGLILMADI+LY S Q+EQGFPPLLTRA
Sbjct: 412  SSSQYDDALQDIATRLGLYEGSLRHFGVHGDPNGLILMADIVLYASPQDEQGFPPLLTRA 471

Query: 181  MSFGVPVVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSG 360
            MSFG+P+V+ + PVIK++V D V G+I  KH+  AL+ AFS+LIS+ KL   A S+ASSG
Sbjct: 472  MSFGLPIVALENPVIKRHVADQVQGMIVAKHNPDALIKAFSLLISEAKLLKLAHSIASSG 531

Query: 361  KSLVKNVLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKSSNI 540
            + L KN+LA+EC+  YAKLLENI NFPSDVLLPV+ S+L+Q SWE S F+ EI++K+ ++
Sbjct: 532  RLLAKNMLASECVMSYAKLLENILNFPSDVLLPVNTSQLKQTSWEWSFFQEEIDKKAGDL 591

Query: 541  ENLYIKXXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXXXXXX 720
             N + +          N +EE++ NL+P+KN+SG+D+E L+ D PT  DWD         
Sbjct: 592  ANPHSRGYGLSLGVVYN-IEEDMANLLPLKNVSGNDLEALDGDFPTHLDWDILREMESSE 650

Query: 721  XXXXXXXXXXXXXXXKDIGEWDEIYRSARK 810
                           K IG+WDE+YR+ARK
Sbjct: 651  ELESLEMEEIEERMEKAIGDWDELYRNARK 680


>ref|XP_004250018.1| PREDICTED: uncharacterized protein LOC101258810 [Solanum
            lycopersicum]
          Length = 1050

 Score =  278 bits (712), Expect = 2e-72
 Identities = 143/271 (52%), Positives = 182/271 (67%), Gaps = 1/271 (0%)
 Frame = +1

Query: 1    SSNGYNDALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRA 180
            SS+GYNDALQDI  RL L  GSL H+ +  DVNG+ L+ADI+LY S Q EQ FPP+L RA
Sbjct: 404  SSDGYNDALQDIANRLGLHEGSLSHHDMKGDVNGITLIADIVLYFSPQYEQEFPPILIRA 463

Query: 181  MSFGVPVVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSG 360
            MSFG+P+V+PDYPVIKKYV D VHGIIF +HD+  L+  FS+LISDGKL+  A ++ASSG
Sbjct: 464  MSFGIPIVAPDYPVIKKYVADEVHGIIFSQHDSNELVQDFSLLISDGKLTRFAHTIASSG 523

Query: 361  KSLVKNVLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKSSNI 540
            + L KN+ A ECITGYAKLLEN+  FPSDV+LP   S+++Q SWE   F+ ++E    +I
Sbjct: 524  RLLSKNMFAVECITGYAKLLENVITFPSDVILPGDTSQIKQESWEWGYFQKDLED-PKDI 582

Query: 541  ENLYIKXXXXXXXXXXNDLEEEITNLVPVKNISGDDIE-DLEEDIPTVADWDXXXXXXXX 717
            E+L +K           DLE E+T  VP+ N+SGDD+E  ++ED P+  DWD        
Sbjct: 583  EDLQMKDVDPINSSVVYDLELEMTGFVPLMNVSGDDLEAAIKEDFPSELDWDILNEMERS 642

Query: 718  XXXXXXXXXXXXXXXXKDIGEWDEIYRSARK 810
                            KDIG WD+IYR+ARK
Sbjct: 643  EEVDRLESEEIEERMEKDIGRWDDIYRNARK 673


>ref|XP_006360510.1| PREDICTED: uncharacterized protein LOC102588632 [Solanum tuberosum]
          Length = 1048

 Score =  277 bits (709), Expect = 5e-72
 Identities = 144/270 (53%), Positives = 184/270 (68%)
 Frame = +1

Query: 1    SSNGYNDALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRA 180
            SS+GYNDALQDI TRL L  GSL H+ +  DVNG+ L+ADI+LY S Q EQ FPP+L RA
Sbjct: 404  SSDGYNDALQDIATRLGLHEGSLSHHDMKGDVNGITLIADIVLYFSPQYEQEFPPILIRA 463

Query: 181  MSFGVPVVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSG 360
            MSFG+P+V+PDYPVIKKYVVD VHGIIF +H++  L+  FS+LISDGKL+  A ++ASSG
Sbjct: 464  MSFGIPIVAPDYPVIKKYVVDEVHGIIFSQHNSNELVQDFSLLISDGKLTRFAHTIASSG 523

Query: 361  KSLVKNVLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKSSNI 540
            + L KN+ A ECITGYAKLLEN+  FPSDV+LP   S+L+Q SWE   F+ ++E    +I
Sbjct: 524  RLLSKNMFAVECITGYAKLLENVITFPSDVILPGDTSQLKQDSWEWGYFQKDLED-PKDI 582

Query: 541  ENLYIKXXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXXXXXX 720
            E+L +K          +DLE E+T  VP+ N+S DD E ++ED P+  DWD         
Sbjct: 583  EDLQMKDVDPINSSVVDDLELEMTGFVPL-NVSRDDPEAIKEDFPSELDWDILNEMERSE 641

Query: 721  XXXXXXXXXXXXXXXKDIGEWDEIYRSARK 810
                           KDIG+WD+IYR+ARK
Sbjct: 642  EVDRLESEEIEERMEKDIGKWDDIYRNARK 671


>ref|XP_007010094.1| UDP-Glycosyltransferase superfamily protein isoform 5 [Theobroma
            cacao] gi|508727007|gb|EOY18904.1|
            UDP-Glycosyltransferase superfamily protein isoform 5
            [Theobroma cacao]
          Length = 782

 Score =  273 bits (698), Expect = 9e-71
 Identities = 138/270 (51%), Positives = 181/270 (67%)
 Frame = +1

Query: 1    SSNGYNDALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRA 180
            S++GY+DALQ + +RL L  GS+RHYG+D DVNG++LMADI+LYG+SQEEQGFP L+ RA
Sbjct: 406  STDGYHDALQQVASRLGLTQGSVRHYGLDGDVNGVLLMADIVLYGTSQEEQGFPSLIIRA 465

Query: 181  MSFGVPVVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSG 360
            M+FG+PV++PD+P++KKYVVDG HG+ FPKH   ALL AFS+LIS+G+LS  AQ+VASSG
Sbjct: 466  MTFGIPVITPDFPIMKKYVVDGTHGVFFPKHQPDALLRAFSLLISNGRLSRFAQTVASSG 525

Query: 361  KSLVKNVLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKSSNI 540
            + L KN+LA+ECITGYA LLEN+ NFPSDVLLP  +S+L   SWE ++F  EIE  + +I
Sbjct: 526  RLLAKNILASECITGYASLLENLLNFPSDVLLPAPVSQLRLGSWEWNVFGMEIEHGTGDI 585

Query: 541  ENLYIKXXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXXXXXX 720
               +              LEEE T      +IS    E  ++DIPT  DWD         
Sbjct: 586  SRYF---------SVVYALEEEFTKHTISSDISQYGAEIQDQDIPTEQDWDIVTEIENFE 636

Query: 721  XXXXXXXXXXXXXXXKDIGEWDEIYRSARK 810
                           ++ G WD+IYR+AR+
Sbjct: 637  DYERLEMDEVEERMERNPGVWDDIYRNARR 666


>ref|XP_007010093.1| UDP-Glycosyltransferase superfamily protein isoform 4 [Theobroma
            cacao] gi|508727006|gb|EOY18903.1|
            UDP-Glycosyltransferase superfamily protein isoform 4
            [Theobroma cacao]
          Length = 969

 Score =  273 bits (698), Expect = 9e-71
 Identities = 138/270 (51%), Positives = 181/270 (67%)
 Frame = +1

Query: 1    SSNGYNDALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRA 180
            S++GY+DALQ + +RL L  GS+RHYG+D DVNG++LMADI+LYG+SQEEQGFP L+ RA
Sbjct: 406  STDGYHDALQQVASRLGLTQGSVRHYGLDGDVNGVLLMADIVLYGTSQEEQGFPSLIIRA 465

Query: 181  MSFGVPVVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSG 360
            M+FG+PV++PD+P++KKYVVDG HG+ FPKH   ALL AFS+LIS+G+LS  AQ+VASSG
Sbjct: 466  MTFGIPVITPDFPIMKKYVVDGTHGVFFPKHQPDALLRAFSLLISNGRLSRFAQTVASSG 525

Query: 361  KSLVKNVLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKSSNI 540
            + L KN+LA+ECITGYA LLEN+ NFPSDVLLP  +S+L   SWE ++F  EIE  + +I
Sbjct: 526  RLLAKNILASECITGYASLLENLLNFPSDVLLPAPVSQLRLGSWEWNVFGMEIEHGTGDI 585

Query: 541  ENLYIKXXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXXXXXX 720
               +              LEEE T      +IS    E  ++DIPT  DWD         
Sbjct: 586  SRYF---------SVVYALEEEFTKHTISSDISQYGAEIQDQDIPTEQDWDIVTEIENFE 636

Query: 721  XXXXXXXXXXXXXXXKDIGEWDEIYRSARK 810
                           ++ G WD+IYR+AR+
Sbjct: 637  DYERLEMDEVEERMERNPGVWDDIYRNARR 666


>ref|XP_007010092.1| UDP-Glycosyltransferase superfamily protein isoform 3 [Theobroma
            cacao] gi|508727005|gb|EOY18902.1|
            UDP-Glycosyltransferase superfamily protein isoform 3
            [Theobroma cacao]
          Length = 1034

 Score =  273 bits (698), Expect = 9e-71
 Identities = 138/270 (51%), Positives = 181/270 (67%)
 Frame = +1

Query: 1    SSNGYNDALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRA 180
            S++GY+DALQ + +RL L  GS+RHYG+D DVNG++LMADI+LYG+SQEEQGFP L+ RA
Sbjct: 406  STDGYHDALQQVASRLGLTQGSVRHYGLDGDVNGVLLMADIVLYGTSQEEQGFPSLIIRA 465

Query: 181  MSFGVPVVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSG 360
            M+FG+PV++PD+P++KKYVVDG HG+ FPKH   ALL AFS+LIS+G+LS  AQ+VASSG
Sbjct: 466  MTFGIPVITPDFPIMKKYVVDGTHGVFFPKHQPDALLRAFSLLISNGRLSRFAQTVASSG 525

Query: 361  KSLVKNVLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKSSNI 540
            + L KN+LA+ECITGYA LLEN+ NFPSDVLLP  +S+L   SWE ++F  EIE  + +I
Sbjct: 526  RLLAKNILASECITGYASLLENLLNFPSDVLLPAPVSQLRLGSWEWNVFGMEIEHGTGDI 585

Query: 541  ENLYIKXXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXXXXXX 720
               +              LEEE T      +IS    E  ++DIPT  DWD         
Sbjct: 586  SRYF---------SVVYALEEEFTKHTISSDISQYGAEIQDQDIPTEQDWDIVTEIENFE 636

Query: 721  XXXXXXXXXXXXXXXKDIGEWDEIYRSARK 810
                           ++ G WD+IYR+AR+
Sbjct: 637  DYERLEMDEVEERMERNPGVWDDIYRNARR 666


>ref|XP_007010091.1| UDP-Glycosyltransferase superfamily protein isoform 2 [Theobroma
            cacao] gi|508727004|gb|EOY18901.1|
            UDP-Glycosyltransferase superfamily protein isoform 2
            [Theobroma cacao]
          Length = 735

 Score =  273 bits (698), Expect = 9e-71
 Identities = 138/270 (51%), Positives = 181/270 (67%)
 Frame = +1

Query: 1    SSNGYNDALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRA 180
            S++GY+DALQ + +RL L  GS+RHYG+D DVNG++LMADI+LYG+SQEEQGFP L+ RA
Sbjct: 406  STDGYHDALQQVASRLGLTQGSVRHYGLDGDVNGVLLMADIVLYGTSQEEQGFPSLIIRA 465

Query: 181  MSFGVPVVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSG 360
            M+FG+PV++PD+P++KKYVVDG HG+ FPKH   ALL AFS+LIS+G+LS  AQ+VASSG
Sbjct: 466  MTFGIPVITPDFPIMKKYVVDGTHGVFFPKHQPDALLRAFSLLISNGRLSRFAQTVASSG 525

Query: 361  KSLVKNVLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKSSNI 540
            + L KN+LA+ECITGYA LLEN+ NFPSDVLLP  +S+L   SWE ++F  EIE  + +I
Sbjct: 526  RLLAKNILASECITGYASLLENLLNFPSDVLLPAPVSQLRLGSWEWNVFGMEIEHGTGDI 585

Query: 541  ENLYIKXXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXXXXXX 720
               +              LEEE T      +IS    E  ++DIPT  DWD         
Sbjct: 586  SRYF---------SVVYALEEEFTKHTISSDISQYGAEIQDQDIPTEQDWDIVTEIENFE 636

Query: 721  XXXXXXXXXXXXXXXKDIGEWDEIYRSARK 810
                           ++ G WD+IYR+AR+
Sbjct: 637  DYERLEMDEVEERMERNPGVWDDIYRNARR 666


>ref|XP_007010090.1| UDP-Glycosyltransferase superfamily protein isoform 1 [Theobroma
            cacao] gi|508727003|gb|EOY18900.1|
            UDP-Glycosyltransferase superfamily protein isoform 1
            [Theobroma cacao]
          Length = 1041

 Score =  273 bits (698), Expect = 9e-71
 Identities = 138/270 (51%), Positives = 181/270 (67%)
 Frame = +1

Query: 1    SSNGYNDALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRA 180
            S++GY+DALQ + +RL L  GS+RHYG+D DVNG++LMADI+LYG+SQEEQGFP L+ RA
Sbjct: 406  STDGYHDALQQVASRLGLTQGSVRHYGLDGDVNGVLLMADIVLYGTSQEEQGFPSLIIRA 465

Query: 181  MSFGVPVVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSG 360
            M+FG+PV++PD+P++KKYVVDG HG+ FPKH   ALL AFS+LIS+G+LS  AQ+VASSG
Sbjct: 466  MTFGIPVITPDFPIMKKYVVDGTHGVFFPKHQPDALLRAFSLLISNGRLSRFAQTVASSG 525

Query: 361  KSLVKNVLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKSSNI 540
            + L KN+LA+ECITGYA LLEN+ NFPSDVLLP  +S+L   SWE ++F  EIE  + +I
Sbjct: 526  RLLAKNILASECITGYASLLENLLNFPSDVLLPAPVSQLRLGSWEWNVFGMEIEHGTGDI 585

Query: 541  ENLYIKXXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXXXXXX 720
               +              LEEE T      +IS    E  ++DIPT  DWD         
Sbjct: 586  SRYF---------SVVYALEEEFTKHTISSDISQYGAEIQDQDIPTEQDWDIVTEIENFE 636

Query: 721  XXXXXXXXXXXXXXXKDIGEWDEIYRSARK 810
                           ++ G WD+IYR+AR+
Sbjct: 637  DYERLEMDEVEERMERNPGVWDDIYRNARR 666


>ref|XP_012447608.1| PREDICTED: uncharacterized protein LOC105770810 isoform X2 [Gossypium
            raimondii]
          Length = 889

 Score =  270 bits (689), Expect = 1e-69
 Identities = 138/272 (50%), Positives = 185/272 (68%), Gaps = 2/272 (0%)
 Frame = +1

Query: 1    SSNGYNDALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRA 180
            S++GYNDALQ + +RL LP GS+RHYG+D D NG+ILMADI+LYGSSQEEQGFPPL+ RA
Sbjct: 411  STDGYNDALQQVASRLGLPQGSVRHYGLDGDTNGVILMADIVLYGSSQEEQGFPPLIIRA 470

Query: 181  MSFGVPVVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSG 360
            M+FG+PV++PD+P++KKYVVDG H + FPKHD  ALL AFS+LIS+G+LS  A++VASSG
Sbjct: 471  MTFGIPVITPDFPIVKKYVVDGAHCVFFPKHDPDALLRAFSLLISNGRLSKFAETVASSG 530

Query: 361  KSLVKNVLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKS--S 534
            + L KN+LA+ECITGYA LL N+  FPSDVLLP  +SEL+QASWE +LF+ EIE  +  +
Sbjct: 531  RLLAKNILASECITGYASLLVNLLYFPSDVLLPGPVSELQQASWEWNLFRKEIEHSNFDT 590

Query: 535  NIENLYIKXXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXXXX 714
            ++++  +            D  +  T L     ++G D+ DL  +I    D++       
Sbjct: 591  SVDSSVVYTVEEELTKHIIDTSKNRTELQDQDALTGQDL-DLVTEIENFEDYE------- 642

Query: 715  XXXXXXXXXXXXXXXXXKDIGEWDEIYRSARK 810
                             + +G WDEIYR+ARK
Sbjct: 643  -----RLEMEEINERTERHLGVWDEIYRNARK 669


>ref|XP_012447607.1| PREDICTED: uncharacterized protein LOC105770810 isoform X1 [Gossypium
            raimondii] gi|763793485|gb|KJB60481.1| hypothetical
            protein B456_009G307600 [Gossypium raimondii]
            gi|763793486|gb|KJB60482.1| hypothetical protein
            B456_009G307600 [Gossypium raimondii]
            gi|763793487|gb|KJB60483.1| hypothetical protein
            B456_009G307600 [Gossypium raimondii]
          Length = 1045

 Score =  270 bits (689), Expect = 1e-69
 Identities = 138/272 (50%), Positives = 185/272 (68%), Gaps = 2/272 (0%)
 Frame = +1

Query: 1    SSNGYNDALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRA 180
            S++GYNDALQ + +RL LP GS+RHYG+D D NG+ILMADI+LYGSSQEEQGFPPL+ RA
Sbjct: 411  STDGYNDALQQVASRLGLPQGSVRHYGLDGDTNGVILMADIVLYGSSQEEQGFPPLIIRA 470

Query: 181  MSFGVPVVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSG 360
            M+FG+PV++PD+P++KKYVVDG H + FPKHD  ALL AFS+LIS+G+LS  A++VASSG
Sbjct: 471  MTFGIPVITPDFPIVKKYVVDGAHCVFFPKHDPDALLRAFSLLISNGRLSKFAETVASSG 530

Query: 361  KSLVKNVLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKS--S 534
            + L KN+LA+ECITGYA LL N+  FPSDVLLP  +SEL+QASWE +LF+ EIE  +  +
Sbjct: 531  RLLAKNILASECITGYASLLVNLLYFPSDVLLPGPVSELQQASWEWNLFRKEIEHSNFDT 590

Query: 535  NIENLYIKXXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXXXX 714
            ++++  +            D  +  T L     ++G D+ DL  +I    D++       
Sbjct: 591  SVDSSVVYTVEEELTKHIIDTSKNRTELQDQDALTGQDL-DLVTEIENFEDYE------- 642

Query: 715  XXXXXXXXXXXXXXXXXKDIGEWDEIYRSARK 810
                             + +G WDEIYR+ARK
Sbjct: 643  -----RLEMEEINERTERHLGVWDEIYRNARK 669


>ref|XP_012090324.1| PREDICTED: uncharacterized protein LOC105648510 isoform X2 [Jatropha
            curcas]
          Length = 901

 Score =  268 bits (685), Expect = 3e-69
 Identities = 133/264 (50%), Positives = 180/264 (68%)
 Frame = +1

Query: 19   DALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRAMSFGVP 198
            DALQ + +RL L HGS+RHYG++ DVN ++LMADI++YGSSQ+EQGFPPL+ RAM+FGV 
Sbjct: 430  DALQGVASRLGLLHGSVRHYGLNGDVNSVLLMADIVIYGSSQDEQGFPPLIIRAMTFGVL 489

Query: 199  VVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSGKSLVKN 378
            VV+PD PV+KKY++DGV+G++F KH+  AL+ AFS+LISDGKLS  AQ+VASSG+ L +N
Sbjct: 490  VVAPDVPVMKKYLIDGVYGLLFQKHNPEALMRAFSLLISDGKLSGFAQTVASSGRLLARN 549

Query: 379  VLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKSSNIENLYIK 558
            +  +ECITGYA+LLEN+ +FPSD LLP  +S+L+Q  WE +LF+ EI Q + N   +  +
Sbjct: 550  MFVSECITGYARLLENLLSFPSDALLPGPLSKLQQKEWEWNLFRKEIAQGTDNFLGMDGR 609

Query: 559  XXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXXXXXXXXXXXX 738
                        LEEE+TNL+   NIS +  E L  D+PT +DWD               
Sbjct: 610  DSSYGGSSVVYFLEEELTNLIDSTNISANGTEILVPDLPTESDWDVLREIDSFEEYESLE 669

Query: 739  XXXXXXXXXKDIGEWDEIYRSARK 810
                     K  G WD++YR+AR+
Sbjct: 670  MEELQERMDKSPGVWDDLYRNARR 693


>ref|XP_012090316.1| PREDICTED: uncharacterized protein LOC105648510 isoform X1 [Jatropha
            curcas]
          Length = 1070

 Score =  268 bits (685), Expect = 3e-69
 Identities = 133/264 (50%), Positives = 180/264 (68%)
 Frame = +1

Query: 19   DALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRAMSFGVP 198
            DALQ + +RL L HGS+RHYG++ DVN ++LMADI++YGSSQ+EQGFPPL+ RAM+FGV 
Sbjct: 430  DALQGVASRLGLLHGSVRHYGLNGDVNSVLLMADIVIYGSSQDEQGFPPLIIRAMTFGVL 489

Query: 199  VVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSGKSLVKN 378
            VV+PD PV+KKY++DGV+G++F KH+  AL+ AFS+LISDGKLS  AQ+VASSG+ L +N
Sbjct: 490  VVAPDVPVMKKYLIDGVYGLLFQKHNPEALMRAFSLLISDGKLSGFAQTVASSGRLLARN 549

Query: 379  VLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKSSNIENLYIK 558
            +  +ECITGYA+LLEN+ +FPSD LLP  +S+L+Q  WE +LF+ EI Q + N   +  +
Sbjct: 550  MFVSECITGYARLLENLLSFPSDALLPGPLSKLQQKEWEWNLFRKEIAQGTDNFLGMDGR 609

Query: 559  XXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXXXXXXXXXXXX 738
                        LEEE+TNL+   NIS +  E L  D+PT +DWD               
Sbjct: 610  DSSYGGSSVVYFLEEELTNLIDSTNISANGTEILVPDLPTESDWDVLREIDSFEEYESLE 669

Query: 739  XXXXXXXXXKDIGEWDEIYRSARK 810
                     K  G WD++YR+AR+
Sbjct: 670  MEELQERMDKSPGVWDDLYRNARR 693


>ref|XP_011649860.1| PREDICTED: uncharacterized protein LOC101206364 isoform X2 [Cucumis
            sativus]
          Length = 907

 Score =  268 bits (685), Expect = 3e-69
 Identities = 136/274 (49%), Positives = 186/274 (67%), Gaps = 4/274 (1%)
 Frame = +1

Query: 1    SSNGYNDALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRA 180
            S++G +DAL++I +RL LP GS+ HYG++ DVN +++MADI+LYGSSQE Q FPPLL RA
Sbjct: 392  STDGSHDALKEIASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRA 451

Query: 181  MSFGVPVVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSG 360
            MSFG+P++ PD P +K Y+VDGVHG+IFPKH+  ALL++FS +ISDGKLS  AQS+ASSG
Sbjct: 452  MSFGIPIMVPDLPALKNYIVDGVHGVIFPKHNPDALLSSFSQMISDGKLSRFAQSIASSG 511

Query: 361  KSLVKNVLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNE----IEQK 528
            + L KN+LA+EC+TGYA+LLEN+ NFPSDV LP  +S+L+  +WE +LF+ E    I++ 
Sbjct: 512  RLLAKNILASECVTGYAQLLENVLNFPSDVKLPGPVSQLQLGAWEWNLFRKEMVKTIDEN 571

Query: 529  SSNIENLYIKXXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXX 708
            + N E +               LE ++TN V +  +S ++   LE+DIPT  DWD     
Sbjct: 572  ADNEERI----ATISKASVIFALEAQLTNSVNLTILSENENGTLEQDIPTPQDWDILEKI 627

Query: 709  XXXXXXXXXXXXXXXXXXXKDIGEWDEIYRSARK 810
                               +D+G WDEIYR+ARK
Sbjct: 628  ESAEEYETVEMEEFQERMERDLGAWDEIYRNARK 661


>gb|KDP45060.1| hypothetical protein JCGZ_01560 [Jatropha curcas]
          Length = 893

 Score =  268 bits (685), Expect = 3e-69
 Identities = 133/264 (50%), Positives = 180/264 (68%)
 Frame = +1

Query: 19   DALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRAMSFGVP 198
            DALQ + +RL L HGS+RHYG++ DVN ++LMADI++YGSSQ+EQGFPPL+ RAM+FGV 
Sbjct: 253  DALQGVASRLGLLHGSVRHYGLNGDVNSVLLMADIVIYGSSQDEQGFPPLIIRAMTFGVL 312

Query: 199  VVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSGKSLVKN 378
            VV+PD PV+KKY++DGV+G++F KH+  AL+ AFS+LISDGKLS  AQ+VASSG+ L +N
Sbjct: 313  VVAPDVPVMKKYLIDGVYGLLFQKHNPEALMRAFSLLISDGKLSGFAQTVASSGRLLARN 372

Query: 379  VLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKSSNIENLYIK 558
            +  +ECITGYA+LLEN+ +FPSD LLP  +S+L+Q  WE +LF+ EI Q + N   +  +
Sbjct: 373  MFVSECITGYARLLENLLSFPSDALLPGPLSKLQQKEWEWNLFRKEIAQGTDNFLGMDGR 432

Query: 559  XXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXXXXXXXXXXXX 738
                        LEEE+TNL+   NIS +  E L  D+PT +DWD               
Sbjct: 433  DSSYGGSSVVYFLEEELTNLIDSTNISANGTEILVPDLPTESDWDVLREIDSFEEYESLE 492

Query: 739  XXXXXXXXXKDIGEWDEIYRSARK 810
                     K  G WD++YR+AR+
Sbjct: 493  MEELQERMDKSPGVWDDLYRNARR 516


>ref|XP_004138684.1| PREDICTED: uncharacterized protein LOC101206364 isoform X1 [Cucumis
            sativus] gi|700207911|gb|KGN63030.1| hypothetical protein
            Csa_2G384990 [Cucumis sativus]
          Length = 1034

 Score =  268 bits (685), Expect = 3e-69
 Identities = 136/274 (49%), Positives = 186/274 (67%), Gaps = 4/274 (1%)
 Frame = +1

Query: 1    SSNGYNDALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRA 180
            S++G +DAL++I +RL LP GS+ HYG++ DVN +++MADI+LYGSSQE Q FPPLL RA
Sbjct: 392  STDGSHDALKEIASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRA 451

Query: 181  MSFGVPVVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSG 360
            MSFG+P++ PD P +K Y+VDGVHG+IFPKH+  ALL++FS +ISDGKLS  AQS+ASSG
Sbjct: 452  MSFGIPIMVPDLPALKNYIVDGVHGVIFPKHNPDALLSSFSQMISDGKLSRFAQSIASSG 511

Query: 361  KSLVKNVLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNE----IEQK 528
            + L KN+LA+EC+TGYA+LLEN+ NFPSDV LP  +S+L+  +WE +LF+ E    I++ 
Sbjct: 512  RLLAKNILASECVTGYAQLLENVLNFPSDVKLPGPVSQLQLGAWEWNLFRKEMVKTIDEN 571

Query: 529  SSNIENLYIKXXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXX 708
            + N E +               LE ++TN V +  +S ++   LE+DIPT  DWD     
Sbjct: 572  ADNEERI----ATISKASVIFALEAQLTNSVNLTILSENENGTLEQDIPTPQDWDILEKI 627

Query: 709  XXXXXXXXXXXXXXXXXXXKDIGEWDEIYRSARK 810
                               +D+G WDEIYR+ARK
Sbjct: 628  ESAEEYETVEMEEFQERMERDLGAWDEIYRNARK 661


>ref|XP_011469406.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101310943
            [Fragaria vesca subsp. vesca]
          Length = 1036

 Score =  266 bits (681), Expect = 9e-69
 Identities = 134/270 (49%), Positives = 181/270 (67%)
 Frame = +1

Query: 1    SSNGYNDALQDIGTRLELPHGSLRHYGIDNDVNGLILMADIILYGSSQEEQGFPPLLTRA 180
            SSNGY+DA Q++ +RL L  GSLRHYG++ DVN ++ MADI+LYGS+Q+EQGFPPLL RA
Sbjct: 390  SSNGYDDAFQEVASRLGLHQGSLRHYGLNGDVNSVLSMADIVLYGSAQDEQGFPPLLIRA 449

Query: 181  MSFGVPVVSPDYPVIKKYVVDGVHGIIFPKHDARALLNAFSILISDGKLSDDAQSVASSG 360
            M+FG+PV++PDYPV+KKYVVDGVH I+F +HD  ALL AFS++IS+ KLS  AQ+VASSG
Sbjct: 450  MTFGIPVIAPDYPVLKKYVVDGVHMILFQRHDPDALLKAFSLMISNEKLSKFAQTVASSG 509

Query: 361  KSLVKNVLAAECITGYAKLLENIFNFPSDVLLPVHISELEQASWECSLFKNEIEQKSSNI 540
            + +  N+LA+E ITGYA+LLE++  FPSD LLP  +S+L+Q +WE +LF +EI+  + ++
Sbjct: 510  RLIAMNLLASESITGYARLLESVLKFPSDALLPGPLSQLQQGTWEWNLFGSEIDSGTGDM 569

Query: 541  ENLYIKXXXXXXXXXXNDLEEEITNLVPVKNISGDDIEDLEEDIPTVADWDXXXXXXXXX 720
             N+             + LEEE +       IS +  E    DIPT  DWD         
Sbjct: 570  LNINENQASLENSSVVHALEEEFSGFSYSTKISENGTEIFAHDIPTQLDWDILREIELSE 629

Query: 721  XXXXXXXXXXXXXXXKDIGEWDEIYRSARK 810
                           +D G+WD+IYR+ARK
Sbjct: 630  EYERVEMEELAERMERDPGQWDDIYRNARK 659


Top