BLASTX nr result

ID: Mentha25_contig00035828 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00035828
         (469 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004230707.1| PREDICTED: uncharacterized protein LOC101268...   146   3e-33
ref|XP_006346322.1| PREDICTED: QWRF motif-containing protein 2-l...   144   1e-32
gb|EYU37581.1| hypothetical protein MIMGU_mgv1a002837mg [Mimulus...   142   4e-32
ref|XP_004171185.1| PREDICTED: uncharacterized LOC101215899 [Cuc...   124   1e-26
ref|XP_004136940.1| PREDICTED: uncharacterized protein LOC101215...   124   1e-26
gb|EXB80036.1| hypothetical protein L484_003678 [Morus notabilis]     122   7e-26
ref|XP_007199730.1| hypothetical protein PRUPE_ppa002521mg [Prun...   122   7e-26
ref|XP_004496886.1| PREDICTED: flocculation protein FLO11-like i...   118   1e-24
ref|XP_004496885.1| PREDICTED: flocculation protein FLO11-like i...   118   1e-24
ref|XP_002263972.1| PREDICTED: uncharacterized protein LOC100242...   118   1e-24
emb|CAN69354.1| hypothetical protein VITISV_014039 [Vitis vinifera]   118   1e-24
gb|EPS70660.1| hypothetical protein M569_04100, partial [Genlise...   116   4e-24
ref|XP_007143104.1| hypothetical protein PHAVU_007G043900g [Phas...   114   1e-23
ref|XP_006589620.1| PREDICTED: QWRF motif-containing protein 2-l...   113   3e-23
ref|XP_003536586.1| PREDICTED: QWRF motif-containing protein 2-l...   113   3e-23
ref|XP_007042618.1| Family of Uncharacterized protein function, ...   112   5e-23
ref|XP_007042617.1| Family of Uncharacterized protein function, ...   112   5e-23
ref|XP_007042616.1| Family of Uncharacterized protein function, ...   112   5e-23
ref|XP_007042615.1| Family of Uncharacterized protein function (...   112   5e-23
ref|XP_002527498.1| conserved hypothetical protein [Ricinus comm...   112   7e-23

>ref|XP_004230707.1| PREDICTED: uncharacterized protein LOC101268323 [Solanum
           lycopersicum]
          Length = 641

 Score =  146 bits (368), Expect = 3e-33
 Identities = 91/168 (54%), Positives = 107/168 (63%), Gaps = 13/168 (7%)
 Frame = -3

Query: 467 PSRGKTDIVG--ENSKPSDQHRWPARHRLVNPLSRSMDCSGNFGERSSLMSGSANAIRSL 294
           P RGK D     ENSKP DQHRWP R R  NPL+RS+DCS   G+R  ++ GS N IR+L
Sbjct: 189 PLRGKADGADQLENSKPVDQHRWPGRSRQGNPLARSLDCSN--GDRHKVI-GSGNVIRTL 245

Query: 293 QNSMIDERRPSVDGRMSLDLGHSELPK------DGNTVNSEVLMPCELXXXXXXXXXXXX 132
           Q SMIDERR S DGR+SLD G++E  K      D N+ N++  +P +L            
Sbjct: 246 QQSMIDERRASFDGRLSLDFGNAEPLKAVEQAQDVNSANNDSTLPSDLTASDTDSVSSGS 305

Query: 131 XXGIQE---SQRQNG--RGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
              +QE   S R  G  RGI VSA+FWQETNSRLRRLQDPGSPLSTSP
Sbjct: 306 TG-MQECGGSSRIRGVPRGIVVSARFWQETNSRLRRLQDPGSPLSTSP 352


>ref|XP_006346322.1| PREDICTED: QWRF motif-containing protein 2-like [Solanum tuberosum]
          Length = 641

 Score =  144 bits (364), Expect = 1e-32
 Identities = 91/168 (54%), Positives = 107/168 (63%), Gaps = 13/168 (7%)
 Frame = -3

Query: 467 PSRGKTDIVG--ENSKPSDQHRWPARHRLVNPLSRSMDCSGNFGERSSLMSGSANAIRSL 294
           P RGK D     ENSKP DQHRWP R R  N L+RS+DCS   G+R  ++ GS N IR+L
Sbjct: 189 PLRGKADGADQLENSKPVDQHRWPGRSRQGNLLARSLDCSN--GDRHKVI-GSGNVIRTL 245

Query: 293 QNSMIDERRPSVDGRMSLDLGHSELPK------DGNTVNSEVLMPCELXXXXXXXXXXXX 132
           Q SMIDERR S DGR+SLDLG++E  K      D N+ N++  +P +L            
Sbjct: 246 QQSMIDERRASFDGRLSLDLGNAEPLKAVEQAQDVNSANNDSTLPSDLTASDTDSVSSGS 305

Query: 131 XXGIQE---SQRQNG--RGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
              +QE   S R  G  RGI VSA+FWQETNSRLRRLQDPGSPLSTSP
Sbjct: 306 TG-VQECGGSSRIRGVPRGIVVSARFWQETNSRLRRLQDPGSPLSTSP 352


>gb|EYU37581.1| hypothetical protein MIMGU_mgv1a002837mg [Mimulus guttatus]
          Length = 632

 Score =  142 bits (359), Expect = 4e-32
 Identities = 91/161 (56%), Positives = 106/161 (65%), Gaps = 6/161 (3%)
 Frame = -3

Query: 467 PSRGKTDIVGENSKPSDQHRWPARHRLVNPLSRSMDCSGNFGERSSLMSGSANAIRSLQN 288
           P RG  D   E+SK SDQHRWPAR+R  NPLS SMD S   GERS L+S S + +R+LQ 
Sbjct: 193 PLRGGGDQAAEHSKLSDQHRWPARNRSTNPLSMSMDFSTVNGERSKLIS-SMSGVRALQQ 251

Query: 287 S-MIDERRPSVDGRMSLDL--GHSELPKDGNTVNSEVLMPCELXXXXXXXXXXXXXXGIQ 117
           S MIDERRPS+DGR SLDL  G+S+   DG++VN+E                     G+Q
Sbjct: 252 SIMIDERRPSLDGRSSLDLGCGNSQRASDGSSVNNEA--------SDSDSVSSGSTSGVQ 303

Query: 116 E-SQRQNG--RGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
           E S   NG  RGI  SA+FWQETNSR+RRLQDPGSPLSTSP
Sbjct: 304 ECSGVSNGSRRGIVGSARFWQETNSRMRRLQDPGSPLSTSP 344


>ref|XP_004171185.1| PREDICTED: uncharacterized LOC101215899 [Cucumis sativus]
          Length = 514

 Score =  124 bits (311), Expect = 1e-26
 Identities = 85/174 (48%), Positives = 106/174 (60%), Gaps = 19/174 (10%)
 Frame = -3

Query: 467 PSRGKTDIVG---ENSKPSDQHRWPARHRLVN----PLSRSMDCSGNFGERSSLMSGSAN 309
           P R K+D  G   ENSK  DQHRWPAR+R  N    PLSRS DC G   + + +  GS  
Sbjct: 194 PLRDKSDGSGVQVENSKLLDQHRWPARNRHANLEGNPLSRSFDCGGEQKKVNGI--GSGM 251

Query: 308 AIRSLQNSMIDE-RRPSVDGRMSLDLGHSELPK------DGNTVNSEVLMPCELXXXXXX 150
            +R+LQ ++ D+ RR S DGR+SLDL  SEL K      D ++VN E  +P +L      
Sbjct: 252 VVRALQQTISDDSRRASFDGRLSLDLNSSELIKAVRQNPDADSVN-ESSVPSDLTTSDTD 310

Query: 149 XXXXXXXXGIQE----SQRQNG-RGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
                   G+Q+    ++ +NG RGI VSA+FWQETNSRLRRL DPGSPLSTSP
Sbjct: 311 SVSSGSTSGVQDCGSVAKGRNGPRGIVVSARFWQETNSRLRRLHDPGSPLSTSP 364


>ref|XP_004136940.1| PREDICTED: uncharacterized protein LOC101215899 [Cucumis sativus]
          Length = 667

 Score =  124 bits (311), Expect = 1e-26
 Identities = 85/174 (48%), Positives = 106/174 (60%), Gaps = 19/174 (10%)
 Frame = -3

Query: 467 PSRGKTDIVG---ENSKPSDQHRWPARHRLVN----PLSRSMDCSGNFGERSSLMSGSAN 309
           P R K+D  G   ENSK  DQHRWPAR+R  N    PLSRS DC G   + + +  GS  
Sbjct: 194 PLRDKSDGSGVQVENSKLLDQHRWPARNRHANLEGNPLSRSFDCGGEQKKVNGI--GSGM 251

Query: 308 AIRSLQNSMIDE-RRPSVDGRMSLDLGHSELPK------DGNTVNSEVLMPCELXXXXXX 150
            +R+LQ ++ D+ RR S DGR+SLDL  SEL K      D ++VN E  +P +L      
Sbjct: 252 VVRALQQTISDDSRRASFDGRLSLDLNSSELIKAVRQNPDADSVN-ESSVPSDLTTSDTD 310

Query: 149 XXXXXXXXGIQE----SQRQNG-RGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
                   G+Q+    ++ +NG RGI VSA+FWQETNSRLRRL DPGSPLSTSP
Sbjct: 311 SVSSGSTSGVQDCGSVAKGRNGPRGIVVSARFWQETNSRLRRLHDPGSPLSTSP 364


>gb|EXB80036.1| hypothetical protein L484_003678 [Morus notabilis]
          Length = 670

 Score =  122 bits (305), Expect = 7e-26
 Identities = 83/170 (48%), Positives = 98/170 (57%), Gaps = 15/170 (8%)
 Frame = -3

Query: 467 PSRGKTDIVGENSKPSDQHRWPARHRLVNP--------LSRSMDCSGNFGERSSLMSGSA 312
           P RG      ENSKP DQHRWPAR R  N         LSRS+D       R      S 
Sbjct: 188 PLRGGERDQLENSKPGDQHRWPARTRQGNSNSSNSNPLLSRSVDFGAGGDGRKLNGFRSG 247

Query: 311 NAIRSLQNSMIDE-RRPSVDGRMSLDLGHSELPKDGNTVNSEVLMPCELXXXXXXXXXXX 135
             +R+LQ S++DE RR S DGR+SLDLG +EL K  N+ N+E   P +L           
Sbjct: 248 TVVRALQQSLLDETRRSSFDGRLSLDLGSAELLKV-NSSNNESSAPSDLTASDTDSVSSG 306

Query: 134 XXXGIQE----SQRQNG--RGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
              G+Q+    S+ + G  RGI VSA+FWQETNSRLRRLQDPGSPLSTSP
Sbjct: 307 STSGMQDANGVSKARTGTPRGIVVSARFWQETNSRLRRLQDPGSPLSTSP 356


>ref|XP_007199730.1| hypothetical protein PRUPE_ppa002521mg [Prunus persica]
           gi|462395130|gb|EMJ00929.1| hypothetical protein
           PRUPE_ppa002521mg [Prunus persica]
          Length = 662

 Score =  122 bits (305), Expect = 7e-26
 Identities = 80/162 (49%), Positives = 98/162 (60%), Gaps = 17/162 (10%)
 Frame = -3

Query: 437 ENSKPSDQHRWPARHRLV-----NPLSRSMDCSGNFGERSSLMSGSANAIRSLQNSMIDE 273
           ENSKPSDQ+RWPAR R +     N LSRS+DCS    + + +  GS  A R+LQ SMID+
Sbjct: 200 ENSKPSDQYRWPARTRQLSSGSNNSLSRSLDCSSETRKLNGI--GSGVAARALQQSMIDD 257

Query: 272 -RRPSVDGRMSLDLGHSELPK------DGNTVNSEVLMPCELXXXXXXXXXXXXXXGIQE 114
            RR S D R+SLDLG++E  K      D N+ N   + P +L              G+ +
Sbjct: 258 SRRASFDRRLSLDLGNAEPLKAAEQNPDANSANDSSV-PSDLTASDTDSVSSGSTSGVHD 316

Query: 113 S-----QRQNGRGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
           +      R   RGI VSA+FWQETNSRLRRLQDPGSPLSTSP
Sbjct: 317 AGGVAKSRTAPRGIVVSARFWQETNSRLRRLQDPGSPLSTSP 358


>ref|XP_004496886.1| PREDICTED: flocculation protein FLO11-like isoform X2 [Cicer
           arietinum]
          Length = 610

 Score =  118 bits (295), Expect = 1e-24
 Identities = 74/157 (47%), Positives = 93/157 (59%), Gaps = 11/157 (7%)
 Frame = -3

Query: 440 GENSKPSDQHRWPARHRLVNPLSRSMDCSGNFG---ERSSLM---SGSANAIRSLQNSMI 279
           GENS+PSDQHRWPAR R VN LSRS+DC G  G   E+  ++   +G+   +R+LQ SM+
Sbjct: 159 GENSRPSDQHRWPARSRQVNQLSRSVDCGGGGGDGDEKKKVVGNGNGNGKVVRALQQSMV 218

Query: 278 DE---RRPSVDGR--MSLDLGHSELPKDGNTVNSEVLMPCELXXXXXXXXXXXXXXGIQE 114
            E   RR S DG   +SLDLG + +  +   +NS  +   +                   
Sbjct: 219 MESGRRRVSFDGLRGLSLDLGKT-VELNEPCLNSVDVNASDTDSVSSGSTSGAHDSLGTN 277

Query: 113 SQRQNGRGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
              +  RGI VSA+FWQETNSRLRRLQDPGSPLSTSP
Sbjct: 278 KVSKESRGIVVSARFWQETNSRLRRLQDPGSPLSTSP 314


>ref|XP_004496885.1| PREDICTED: flocculation protein FLO11-like isoform X1 [Cicer
           arietinum]
          Length = 611

 Score =  118 bits (295), Expect = 1e-24
 Identities = 74/157 (47%), Positives = 93/157 (59%), Gaps = 11/157 (7%)
 Frame = -3

Query: 440 GENSKPSDQHRWPARHRLVNPLSRSMDCSGNFG---ERSSLM---SGSANAIRSLQNSMI 279
           GENS+PSDQHRWPAR R VN LSRS+DC G  G   E+  ++   +G+   +R+LQ SM+
Sbjct: 159 GENSRPSDQHRWPARSRQVNQLSRSVDCGGGGGDGDEKKKVVGNGNGNGKVVRALQQSMV 218

Query: 278 DE---RRPSVDGR--MSLDLGHSELPKDGNTVNSEVLMPCELXXXXXXXXXXXXXXGIQE 114
            E   RR S DG   +SLDLG + +  +   +NS  +   +                   
Sbjct: 219 MESGRRRVSFDGLRGLSLDLGKT-VELNEPCLNSVDVNASDTDSVSSGSTSGAHDSLGTN 277

Query: 113 SQRQNGRGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
              +  RGI VSA+FWQETNSRLRRLQDPGSPLSTSP
Sbjct: 278 KVSKESRGIVVSARFWQETNSRLRRLQDPGSPLSTSP 314


>ref|XP_002263972.1| PREDICTED: uncharacterized protein LOC100242868 [Vitis vinifera]
          Length = 639

 Score =  118 bits (295), Expect = 1e-24
 Identities = 79/157 (50%), Positives = 94/157 (59%), Gaps = 12/157 (7%)
 Frame = -3

Query: 437 ENSKPSDQHRWPARHRLVNPLSRSMDCSGNFGERSSLMSGSANAIRSLQNSMIDE-RRPS 261
           ENS+P     WP R R VN L+RS DCS +   + S+  GS   + S Q SMIDE RR S
Sbjct: 186 ENSRP-----WPGRSRSVNVLARSFDCSVD--RKKSI--GSGIVVGSFQQSMIDESRRAS 236

Query: 260 VDGRMSLDLGHSELPK------DGNTVNSEVLMPCELXXXXXXXXXXXXXXGIQE----- 114
            DGR+SLDLG++EL K      DGN+ N   + P +L              G+QE     
Sbjct: 237 FDGRLSLDLGNAELLKVTKQDPDGNSANDSSV-PTDLTASDTDSVSSGSTSGLQECAGVS 295

Query: 113 SQRQNGRGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
            +R   RGI VSA+FWQETNSRLRRLQDPGSPLSTSP
Sbjct: 296 GRRSGPRGIVVSARFWQETNSRLRRLQDPGSPLSTSP 332


>emb|CAN69354.1| hypothetical protein VITISV_014039 [Vitis vinifera]
          Length = 601

 Score =  118 bits (295), Expect = 1e-24
 Identities = 79/157 (50%), Positives = 94/157 (59%), Gaps = 12/157 (7%)
 Frame = -3

Query: 437 ENSKPSDQHRWPARHRLVNPLSRSMDCSGNFGERSSLMSGSANAIRSLQNSMIDE-RRPS 261
           ENS+P     WP R R VN L+RS DCS +   + S+  GS   + S Q SMIDE RR S
Sbjct: 186 ENSRP-----WPGRSRSVNVLARSFDCSVD--RKKSI--GSGIVVGSFQQSMIDESRRAS 236

Query: 260 VDGRMSLDLGHSELPK------DGNTVNSEVLMPCELXXXXXXXXXXXXXXGIQE----- 114
            DGR+SLDLG++EL K      DGN+ N   + P +L              G+QE     
Sbjct: 237 FDGRLSLDLGNAELLKVTKQDPDGNSANDSSV-PTDLTASDTDSVSSGSTSGLQECAGVS 295

Query: 113 SQRQNGRGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
            +R   RGI VSA+FWQETNSRLRRLQDPGSPLSTSP
Sbjct: 296 GRRSGPRGIVVSARFWQETNSRLRRLQDPGSPLSTSP 332


>gb|EPS70660.1| hypothetical protein M569_04100, partial [Genlisea aurea]
          Length = 637

 Score =  116 bits (290), Expect = 4e-24
 Identities = 76/163 (46%), Positives = 98/163 (60%), Gaps = 8/163 (4%)
 Frame = -3

Query: 467 PSRGKTDIVGENS---KPSDQHRWPARHRLVN-PLSRSMDCSGNFGERSSLMSGSANAIR 300
           PSR + +  G+ +   K +DQHRWP R+RLVN PLS+S++ SG   ++   + GS  +IR
Sbjct: 188 PSRVRAEGGGDPADVFKLADQHRWPGRNRLVNNPLSKSLNYSGAADDKRIELIGSGQSIR 247

Query: 299 SLQNSMI-DERRPSVDGRMSLDLGHSELPKD---GNTVNSEVLMPCELXXXXXXXXXXXX 132
           SLQ SMI DERR S DGR+ LDL  S+L K+   G    +    P +             
Sbjct: 248 SLQQSMIIDERRTSFDGRLCLDLDSSDLLKEFPRGGVDRNGDYNPSDSDSASSGSTTGVH 307

Query: 131 XXGIQESQRQNGRGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
             G   S   + RG+ VSA+FWQETNSRLRRLQDP SPLS+SP
Sbjct: 308 DSGGGSSLLNDARGMAVSARFWQETNSRLRRLQDPVSPLSSSP 350


>ref|XP_007143104.1| hypothetical protein PHAVU_007G043900g [Phaseolus vulgaris]
           gi|561016294|gb|ESW15098.1| hypothetical protein
           PHAVU_007G043900g [Phaseolus vulgaris]
          Length = 636

 Score =  114 bits (286), Expect = 1e-23
 Identities = 75/184 (40%), Positives = 100/184 (54%), Gaps = 31/184 (16%)
 Frame = -3

Query: 461 RGKTDIVGENSKPSDQHRWPARHRLVNPLSRSMDCSGNFGERSSLMSGSANAIRSLQNSM 282
           R  T + GENS+P+DQHRWPAR R  + LS+S+D S    ++  + +G    +R+LQ SM
Sbjct: 153 RRATPVKGENSRPADQHRWPARTRQADHLSKSVDISD---KKKVVGNGFGKVVRALQKSM 209

Query: 281 I---DERRPSVDGR--MSLDLGHSELPKDGNTVNSEVL---------------------M 180
           +   ++RR S+DG   +SLDLG +EL K  +  N+  +                     M
Sbjct: 210 VVEGEKRRASLDGLGGLSLDLGKAELLKGNSNANNNSITNANNHNNNDGGGGNLVSKSSM 269

Query: 179 PCELXXXXXXXXXXXXXXGIQESQR-----QNGRGIFVSAKFWQETNSRLRRLQDPGSPL 15
            C+L              G  +S       +  RGI VSA+FWQETNSRLRRLQDPGSPL
Sbjct: 270 SCDLTASDTDSVSSGSTSGAHDSSGSVKGPKEPRGIVVSARFWQETNSRLRRLQDPGSPL 329

Query: 14  STSP 3
           STSP
Sbjct: 330 STSP 333


>ref|XP_006589620.1| PREDICTED: QWRF motif-containing protein 2-like isoform X2 [Glycine
           max]
          Length = 614

 Score =  113 bits (282), Expect = 3e-23
 Identities = 75/169 (44%), Positives = 100/169 (59%), Gaps = 16/169 (9%)
 Frame = -3

Query: 461 RGKTDIVGENSKPSDQHRWPARHRLVNPLSRSMDCSGNFGERSSLMSGSANA----IRSL 294
           R  T + GENS+P+DQHRWPAR R V+ LS+S+D   N  ++  + +G+ N     +R+L
Sbjct: 145 RRATPVKGENSRPADQHRWPARTRHVDHLSKSVDIIDN--KKKVVGNGNGNGFGKVVRAL 202

Query: 293 QNSMI---DERRPSVDGR--MSLDLGHSELPKDGNTVNS--EVLMPCELXXXXXXXXXXX 135
           Q SM+   ++RR S DG   +SLDLG +EL K     N+  +  +  +L           
Sbjct: 203 QQSMVVEGEKRRASFDGLGGLSLDLGKAELLKGNINANNHNKSSLASDLTASDTDSVSSG 262

Query: 134 XXXGIQESQ-----RQNGRGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
              G  +S       +  RGI VSA+FWQETNSRLRRLQDPGSPLSTSP
Sbjct: 263 STSGAHDSSGAAKGTKEPRGIVVSARFWQETNSRLRRLQDPGSPLSTSP 311


>ref|XP_003536586.1| PREDICTED: QWRF motif-containing protein 2-like isoform X1 [Glycine
           max]
          Length = 613

 Score =  113 bits (282), Expect = 3e-23
 Identities = 75/169 (44%), Positives = 100/169 (59%), Gaps = 16/169 (9%)
 Frame = -3

Query: 461 RGKTDIVGENSKPSDQHRWPARHRLVNPLSRSMDCSGNFGERSSLMSGSANA----IRSL 294
           R  T + GENS+P+DQHRWPAR R V+ LS+S+D   N  ++  + +G+ N     +R+L
Sbjct: 145 RRATPVKGENSRPADQHRWPARTRHVDHLSKSVDIIDN--KKKVVGNGNGNGFGKVVRAL 202

Query: 293 QNSMI---DERRPSVDGR--MSLDLGHSELPKDGNTVNS--EVLMPCELXXXXXXXXXXX 135
           Q SM+   ++RR S DG   +SLDLG +EL K     N+  +  +  +L           
Sbjct: 203 QQSMVVEGEKRRASFDGLGGLSLDLGKAELLKGNINANNHNKSSLASDLTASDTDSVSSG 262

Query: 134 XXXGIQESQ-----RQNGRGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
              G  +S       +  RGI VSA+FWQETNSRLRRLQDPGSPLSTSP
Sbjct: 263 STSGAHDSSGAAKGTKEPRGIVVSARFWQETNSRLRRLQDPGSPLSTSP 311


>ref|XP_007042618.1| Family of Uncharacterized protein function, putative isoform 4
           [Theobroma cacao] gi|508706553|gb|EOX98449.1| Family of
           Uncharacterized protein function, putative isoform 4
           [Theobroma cacao]
          Length = 517

 Score =  112 bits (280), Expect = 5e-23
 Identities = 82/170 (48%), Positives = 93/170 (54%), Gaps = 24/170 (14%)
 Frame = -3

Query: 440 GENSKPSDQHRWPARHRL----VNPLSRSMDCSGNFGERSSLMSGSANAIRSLQNSMIDE 273
           GENSKP DQHRWP R R      NPLSRS+D S    ER    SG+  A    Q+ M+DE
Sbjct: 216 GENSKPVDQHRWPGRTRQGNSGTNPLSRSLDYSS---ERKMFGSGAIVAKSLQQSMMLDE 272

Query: 272 --RRPSVDG--RMSLDLGHS-ELPK-------DGNTVNSEVLMPCELXXXXXXXXXXXXX 129
             RR S DG  R+SLDLG S EL K       D N++N    + C+L             
Sbjct: 273 SSRRVSFDGSSRLSLDLGSSAELLKEATKQNSDANSINEASCVSCDLTASDTDSVSSGST 332

Query: 128 XG-IQE-------SQRQNGRGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
              +QE         R   R I VSA+FWQETNSRLRRLQDPGSPLSTSP
Sbjct: 333 NSGMQECGGSGILKGRSGPRNIVVSARFWQETNSRLRRLQDPGSPLSTSP 382


>ref|XP_007042617.1| Family of Uncharacterized protein function, putative isoform 3,
           partial [Theobroma cacao] gi|508706552|gb|EOX98448.1|
           Family of Uncharacterized protein function, putative
           isoform 3, partial [Theobroma cacao]
          Length = 590

 Score =  112 bits (280), Expect = 5e-23
 Identities = 82/170 (48%), Positives = 93/170 (54%), Gaps = 24/170 (14%)
 Frame = -3

Query: 440 GENSKPSDQHRWPARHRL----VNPLSRSMDCSGNFGERSSLMSGSANAIRSLQNSMIDE 273
           GENSKP DQHRWP R R      NPLSRS+D S    ER    SG+  A    Q+ M+DE
Sbjct: 216 GENSKPVDQHRWPGRTRQGNSGTNPLSRSLDYSS---ERKMFGSGAIVAKSLQQSMMLDE 272

Query: 272 --RRPSVDG--RMSLDLGHS-ELPK-------DGNTVNSEVLMPCELXXXXXXXXXXXXX 129
             RR S DG  R+SLDLG S EL K       D N++N    + C+L             
Sbjct: 273 SSRRVSFDGSSRLSLDLGSSAELLKEATKQNSDANSINEASCVSCDLTASDTDSVSSGST 332

Query: 128 XG-IQE-------SQRQNGRGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
              +QE         R   R I VSA+FWQETNSRLRRLQDPGSPLSTSP
Sbjct: 333 NSGMQECGGSGILKGRSGPRNIVVSARFWQETNSRLRRLQDPGSPLSTSP 382


>ref|XP_007042616.1| Family of Uncharacterized protein function, putative isoform 2
           [Theobroma cacao] gi|508706551|gb|EOX98447.1| Family of
           Uncharacterized protein function, putative isoform 2
           [Theobroma cacao]
          Length = 571

 Score =  112 bits (280), Expect = 5e-23
 Identities = 82/170 (48%), Positives = 93/170 (54%), Gaps = 24/170 (14%)
 Frame = -3

Query: 440 GENSKPSDQHRWPARHRL----VNPLSRSMDCSGNFGERSSLMSGSANAIRSLQNSMIDE 273
           GENSKP DQHRWP R R      NPLSRS+D S    ER    SG+  A    Q+ M+DE
Sbjct: 216 GENSKPVDQHRWPGRTRQGNSGTNPLSRSLDYSS---ERKMFGSGAIVAKSLQQSMMLDE 272

Query: 272 --RRPSVDG--RMSLDLGHS-ELPK-------DGNTVNSEVLMPCELXXXXXXXXXXXXX 129
             RR S DG  R+SLDLG S EL K       D N++N    + C+L             
Sbjct: 273 SSRRVSFDGSSRLSLDLGSSAELLKEATKQNSDANSINEASCVSCDLTASDTDSVSSGST 332

Query: 128 XG-IQE-------SQRQNGRGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
              +QE         R   R I VSA+FWQETNSRLRRLQDPGSPLSTSP
Sbjct: 333 NSGMQECGGSGILKGRSGPRNIVVSARFWQETNSRLRRLQDPGSPLSTSP 382


>ref|XP_007042615.1| Family of Uncharacterized protein function (DUF566), putative
           isoform 1 [Theobroma cacao] gi|508706550|gb|EOX98446.1|
           Family of Uncharacterized protein function (DUF566),
           putative isoform 1 [Theobroma cacao]
          Length = 684

 Score =  112 bits (280), Expect = 5e-23
 Identities = 82/170 (48%), Positives = 93/170 (54%), Gaps = 24/170 (14%)
 Frame = -3

Query: 440 GENSKPSDQHRWPARHRL----VNPLSRSMDCSGNFGERSSLMSGSANAIRSLQNSMIDE 273
           GENSKP DQHRWP R R      NPLSRS+D S    ER    SG+  A    Q+ M+DE
Sbjct: 216 GENSKPVDQHRWPGRTRQGNSGTNPLSRSLDYSS---ERKMFGSGAIVAKSLQQSMMLDE 272

Query: 272 --RRPSVDG--RMSLDLGHS-ELPK-------DGNTVNSEVLMPCELXXXXXXXXXXXXX 129
             RR S DG  R+SLDLG S EL K       D N++N    + C+L             
Sbjct: 273 SSRRVSFDGSSRLSLDLGSSAELLKEATKQNSDANSINEASCVSCDLTASDTDSVSSGST 332

Query: 128 XG-IQE-------SQRQNGRGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
              +QE         R   R I VSA+FWQETNSRLRRLQDPGSPLSTSP
Sbjct: 333 NSGMQECGGSGILKGRSGPRNIVVSARFWQETNSRLRRLQDPGSPLSTSP 382


>ref|XP_002527498.1| conserved hypothetical protein [Ricinus communis]
           gi|223533138|gb|EEF34896.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 634

 Score =  112 bits (279), Expect = 7e-23
 Identities = 77/172 (44%), Positives = 98/172 (56%), Gaps = 17/172 (9%)
 Frame = -3

Query: 467 PSRGKTDIV---GENSKPSDQHRWPARHRLVN--------PLSRSMDCSGNFGERSSLMS 321
           P R K+  V   GENS+P DQHRWP R R  N         LSRS DCS   G+   +M 
Sbjct: 170 PERRKSTPVRDQGENSRPLDQHRWPGRSRGGNLALNERNPSLSRSFDCSVG-GDEKRVMG 228

Query: 320 GSANAIRSLQNSMIDERRPSVDGRMSLDLGHSELPKDGNTVNSEVLMPCELXXXXXXXXX 141
               +++SLQ SMI + R     R+SLDLG+++   D N+  S+  +  +L         
Sbjct: 229 SGFMSVKSLQQSMIVDER-----RLSLDLGNAKRNPDVNSSVSDSFVTGDLTASDSDSVS 283

Query: 140 XXXXXGIQE-----SQRQNG-RGIFVSAKFWQETNSRLRRLQDPGSPLSTSP 3
                G+Q+     S+ + G RGI VSA+FWQETNSRLRRLQDPGSPLSTSP
Sbjct: 284 SGSTSGLQDFGSGISRAKTGPRGIAVSARFWQETNSRLRRLQDPGSPLSTSP 335


Top