BLASTX nr result

ID: Mentha24_contig00042940 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00042940
         (1019 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002276395.2| PREDICTED: uncharacterized protein LOC100244...   131   4e-28
emb|CAN65086.1| hypothetical protein VITISV_035031 [Vitis vinifera]   131   4e-28
ref|XP_006350882.1| PREDICTED: uncharacterized protein LOC102603...   123   1e-25
ref|XP_004242481.1| PREDICTED: uncharacterized protein LOC101245...   119   2e-24
gb|EPS64490.1| hypothetical protein M569_10294 [Genlisea aurea]       110   7e-22
gb|EYU43630.1| hypothetical protein MIMGU_mgv1a024583mg [Mimulus...    96   3e-17
gb|EXC19537.1| hypothetical protein L484_010668 [Morus notabilis]      94   9e-17
ref|XP_007146880.1| hypothetical protein PHAVU_006G078200g [Phas...    88   6e-15
ref|XP_004500362.1| PREDICTED: uncharacterized protein LOC101503...    82   4e-13
ref|XP_007026325.1| Telomeric repeat binding-like protein isofor...    76   3e-11
ref|XP_007026323.1| Uncharacterized protein isoform 1 [Theobroma...    76   3e-11
ref|XP_002518779.1| hypothetical protein RCOM_0813700 [Ricinus c...    75   3e-11
ref|XP_002319963.2| hypothetical protein POPTR_0013s15060g [Popu...    74   1e-10
ref|XP_006376552.1| hypothetical protein POPTR_0013s15060g [Popu...    74   1e-10
ref|XP_006486662.1| PREDICTED: uncharacterized protein LOC102608...    73   2e-10
ref|XP_006422498.1| hypothetical protein CICLE_v10028601mg [Citr...    71   8e-10
gb|EXC19536.1| hypothetical protein L484_010667 [Morus notabilis]      69   2e-09
ref|XP_002519314.1| telomeric repeat binding protein, putative [...    64   1e-07
ref|XP_007026326.1| Uncharacterized protein isoform 4 [Theobroma...    61   6e-07

>ref|XP_002276395.2| PREDICTED: uncharacterized protein LOC100244907 [Vitis vinifera]
            gi|297745761|emb|CBI15817.3| unnamed protein product
            [Vitis vinifera]
          Length = 479

 Score =  131 bits (330), Expect = 4e-28
 Identities = 98/269 (36%), Positives = 135/269 (50%), Gaps = 39/269 (14%)
 Frame = -3

Query: 1017 SLLLRELESDASRGSISESTLELLEQLKEIDSRRGDVSLSRAMKRAYCAVAVDCMLKI-- 844
            ++LLR++ES+ S GS+SE+ LELLE ++E+D + G V++  +MK AYCAVAV+C +K   
Sbjct: 42   TVLLRKIESEISDGSVSETILELLEIIEELDYKEG-VAVLDSMKNAYCAVAVECTVKFLV 100

Query: 843  -----------------------------NGLGSELLWVWKDDVEATVWDDGVHKSVMKK 751
                                          GL S+ L  W+DD+EA VWD  V + ++ K
Sbjct: 101  GSGGKEGKYFDAVKRIWRGKIHKMESSATAGLVSDQLRKWRDDIEAAVWDARVCEDILAK 160

Query: 750  FRGCQAHEAVRDYVREEIESIGPTFLELVAERVKGVGGIYPVDG-----FQASPPSPRVD 586
                 A   VR YV E    +GP FLEL A  +K V G+ P  G      QA+  SP V 
Sbjct: 161  NTRNDALRLVRAYVAEAWAIMGPPFLELAARAIKLVEGL-PGAGNGSTCNQAAACSPNVA 219

Query: 585  P---AEMKKEEVMLGDELDGVQHSKTMGSANHRGVKLEDSNETTAEPSKRSYNLPGIAEV 415
                   K +E +    L   +H    G  +  GVK+ D+ E   + S   Y+     EV
Sbjct: 220  TDLVVPDKDKETLKASMLPKRKHVGGHGRRSRGGVKITDTEEVRGQTSGSKYDCLPSPEV 279

Query: 414  DKAGEALKSSIMEPRAAVTDPLPEELQQA 328
            D+   ALKSS +E +A V DPLPE LQ A
Sbjct: 280  DRVQAALKSSSLELQALVKDPLPEALQLA 308


>emb|CAN65086.1| hypothetical protein VITISV_035031 [Vitis vinifera]
          Length = 444

 Score =  131 bits (330), Expect = 4e-28
 Identities = 98/269 (36%), Positives = 135/269 (50%), Gaps = 39/269 (14%)
 Frame = -3

Query: 1017 SLLLRELESDASRGSISESTLELLEQLKEIDSRRGDVSLSRAMKRAYCAVAVDCMLKI-- 844
            ++LLR++ES+ S GS+SE+ LELLE ++E+D + G V++  +MK AYCAVAV+C +K   
Sbjct: 42   TVLLRKIESEISDGSVSETILELLEIIEELDYKEG-VAVLDSMKNAYCAVAVECTVKFLV 100

Query: 843  -----------------------------NGLGSELLWVWKDDVEATVWDDGVHKSVMKK 751
                                          GL S+ L  W+DD+EA VWD  V + ++ K
Sbjct: 101  GSGGKEGKYFDAVKRIWRGKIHKMESSATAGLVSDQLRKWRDDIEAAVWDARVCEDILAK 160

Query: 750  FRGCQAHEAVRDYVREEIESIGPTFLELVAERVKGVGGIYPVDG-----FQASPPSPRVD 586
                 A   VR YV E    +GP FLEL A  +K V G+ P  G      QA+  SP V 
Sbjct: 161  NTRNDALRLVRAYVAEAWAIMGPPFLELAARAIKLVEGL-PGAGNGSTCNQAAACSPNVA 219

Query: 585  P---AEMKKEEVMLGDELDGVQHSKTMGSANHRGVKLEDSNETTAEPSKRSYNLPGIAEV 415
                   K +E +    L   +H    G  +  GVK+ D+ E   + S   Y+     EV
Sbjct: 220  TDLVVPDKDKETLKASMLPKRKHVGGHGRRSRGGVKITDTEEVRGQTSGSKYDCLPSPEV 279

Query: 414  DKAGEALKSSIMEPRAAVTDPLPEELQQA 328
            D+   ALKSS +E +A V DPLPE LQ A
Sbjct: 280  DRVQAALKSSSLELQALVKDPLPEALQLA 308


>ref|XP_006350882.1| PREDICTED: uncharacterized protein LOC102603861 [Solanum tuberosum]
          Length = 468

 Score =  123 bits (308), Expect = 1e-25
 Identities = 91/271 (33%), Positives = 137/271 (50%), Gaps = 41/271 (15%)
 Frame = -3

Query: 1017 SLLLRELESDASRGSISESTLELLEQLKEIDSRRGDVSLSRAMKRAYCAVAVDCMLK--- 847
            +LL+R++ES+ S GS++E  LE LE ++E++ + G +  S  MK AYCAVAV+C +K   
Sbjct: 48   ALLIRKIESEISNGSVNEKILEFLELIEELNHQDG-IEASEVMKAAYCAVAVECTVKFLN 106

Query: 846  ----------------------IN--------GLGSELLWVWKDDVEATVWDDGVHKSVM 757
                                  IN        G  SE LW W+D++EA +WDD    SV+
Sbjct: 107  SEGTGGNKGKYFDAVRRIWKRRINLMEKMENVGFVSEELWSWRDEIEAALWDDKCSYSVI 166

Query: 756  KKFRGCQAHEAVRDYVREEIESIGPTFLELVAERVKGVGGIYPVDGFQASPPSPRVDPAE 577
            +K +G  A E V+ +VRE  E +G  FL++VAE  +             S  + +    E
Sbjct: 167  RKSKGIVAVELVKFFVREAKERMGSPFLDVVAETYQ-------------SDETMKALFGE 213

Query: 576  MKKEEVMLGDELD-----GVQHSKTMGSANHR-GVKLEDSNETTAEPS--KRSYNLPGIA 421
            + KE    G++ +      ++  K +     R G+++ DS E+  E S       LP  A
Sbjct: 214  VNKEGACCGNDREVSKGSALRRKKHVAFKRTRGGIRISDSIESELEASGGGGQDGLPPSA 273

Query: 420  EVDKAGEALKSSIMEPRAAVTDPLPEELQQA 328
            E+ KA +ALK S +E RA V DPLP+ L+ A
Sbjct: 274  EIQKAEKALKLSSLELRAMVKDPLPDALRLA 304


>ref|XP_004242481.1| PREDICTED: uncharacterized protein LOC101245372 [Solanum
            lycopersicum]
          Length = 471

 Score =  119 bits (299), Expect = 2e-24
 Identities = 90/265 (33%), Positives = 131/265 (49%), Gaps = 35/265 (13%)
 Frame = -3

Query: 1017 SLLLRELESDASRGSISESTLELLEQLKEIDSRRGDVSLSRAMKRAYCAVAVDCML---- 850
            +LL+R++ES+ S GS++E  L+ LE ++E++ + G +  S  MK AYCAVAV+C +    
Sbjct: 48   ALLIRKIESEISNGSVNEKILDFLELIEELNHQDG-IEASEVMKAAYCAVAVECTVKFLN 106

Query: 849  ---------------------KIN--------GLGSELLWVWKDDVEATVWDDGVHKSVM 757
                                 KIN        G  SE LW W+D++EA +WDD    SV+
Sbjct: 107  SEGTGGDKGKYFDAVRRIWKRKINLTEKIENVGFVSEELWNWRDEIEAALWDDKCSYSVI 166

Query: 756  KKFRGCQAHEAVRDYVREEIESIGPTFLELVAERVKGVGGIYPVDGFQASPPSPRVDPAE 577
             K +   A E+V+ +VRE  E  G  FL++VAE  +    +  + G      + R +  E
Sbjct: 167  MKSKAVVAVESVKFFVREAKERTGSPFLDVVAEAYQSDETMKTLFGGLNKEGARRENNRE 226

Query: 576  MKKEEVMLGDELDGVQHSKTMGSANHRGVKLEDSNETTAEPS--KRSYNLPGIAEVDKAG 403
            + K     G  L   +H          GV++ DS E   + S       LP  AE+ KA 
Sbjct: 227  VSK-----GTALRRKKH--VAFKRTREGVRINDSIELELKASGGGGQDGLPSSAEIQKAE 279

Query: 402  EALKSSIMEPRAAVTDPLPEELQQA 328
            +ALK S +E RA V DPLPE L+ A
Sbjct: 280  KALKLSSLELRAMVKDPLPEALRYA 304


>gb|EPS64490.1| hypothetical protein M569_10294 [Genlisea aurea]
          Length = 513

 Score =  110 bits (276), Expect = 7e-22
 Identities = 101/322 (31%), Positives = 143/322 (44%), Gaps = 88/322 (27%)
 Frame = -3

Query: 1014 LLLRELESDASRGSISESTLELLEQLKEIDSRRGDVSLSRAMKRAYCAVAVDC---MLKI 844
            ++L+ L SD  R S+SE +LE  EQL +I+  RG  +LS A+KRAYCAVAV C   +LKI
Sbjct: 44   VVLKMLRSDVRRKSVSECSLEYFEQLAKIELDRGVETLSDAVKRAYCAVAVHCTLAVLKI 103

Query: 843  N---------------------------------GLGSELLWVWKDDVEATVWDDGVHKS 763
                                              GLGSE LW WK ++EA++ DD V K 
Sbjct: 104  GEGEHSDPVYRFFDAVRRIWRGKIGCAEAMKGKCGLGSEELWAWKAEIEASLLDDNVRKR 163

Query: 762  VMKKFRGCQAHEAVRDYVREEIESIGPTFLELVA-------------------ERVKGVG 640
            ++K      A EA+  Y+ EE E++GP+FLEL A                   E+V   G
Sbjct: 164  ILKNAEAIDAAEAIEAYLVEEEETMGPSFLELPALQIEDMQKTVGRSSCLNRPEKVISHG 223

Query: 639  GIYPVDGFQASPPSPRVDP---------AEMKKEEVMLGDEL-------------DGVQH 526
                 D F       + +          AE +++E    +E+             + V++
Sbjct: 224  AHNESDVFDHMSGKGKENSTSDHCMDQVAEEEEDEKGNSNEIAAQKGEIADKPMENQVEN 283

Query: 525  SKTMGSANHR----------GVKLEDSNETTAE-PSKRSYNLPGIAEVDKAGEALKSSIM 379
            S    S   R          G KL + +E  A+ PS  +  LP  +EV++  + LKSS  
Sbjct: 284  SSPESSRQTRRRRRRKVHPRGAKLVNPDEKQAKPPSPGNDALPRSSEVEEVRKLLKSSTS 343

Query: 378  EPRAAVTDPLPEELQQAG*DNN 313
            +  A V DPLPE L+ A   NN
Sbjct: 344  DLHALVEDPLPEALRIAEEANN 365


>gb|EYU43630.1| hypothetical protein MIMGU_mgv1a024583mg [Mimulus guttatus]
          Length = 456

 Score = 95.5 bits (236), Expect = 3e-17
 Identities = 81/256 (31%), Positives = 120/256 (46%), Gaps = 29/256 (11%)
 Frame = -3

Query: 1014 LLLRELESDASRGSISESTLELLEQLKEIDSRRGDVSLSRAMKRAYCAVAVD-------C 856
            +LL++LE  A+R  +S STL L EQL+E+DSR+G++ +S  MK AYCAVAV+       C
Sbjct: 47   ILLKQLEYKAARFPVSRSTLVLFEQLEELDSRQGNMKVSDGMKHAYCAVAVNWTMEGAIC 106

Query: 855  MLKI--------------------NG-LGSELLWVWKDDVEATVWDDGVHKSVMKKFRGC 739
             ++                     NG L SE L  W  D++A++W+  + KS+MKK  G 
Sbjct: 107  QIEFFHAVTGLMERIEMMEKHVERNGALCSEELSEWIKDIKASMWNAHLFKSIMKKVEGL 166

Query: 738  QAHEAVRDYVREEIESIGPTFLELVAERVKGVGGIYPVDGFQASPPSPRVDPAEMKKEEV 559
               +AVR +V+ E E +GP  LELV                            ++K +E 
Sbjct: 167  DVLDAVRVFVKGEREKMGPPLLELV---------------------------KKLKDDEY 199

Query: 558  MLGDELDGVQHSKTMGSANHRGVKLEDSNETTAEPS-KRSYNLPGIAEVDKAGEALKSSI 382
            ++ + L G            + V  +  +E    PS + S   P   EV     AL  S+
Sbjct: 200  VM-EFLRG----------KAKIVDPDSGDENAMGPSYETSCKSPMSEEVRIVRRALHKSL 248

Query: 381  MEPRAAVTDPLPEELQ 334
            ++  A V DPLPE L+
Sbjct: 249  LDLEAVVKDPLPEALK 264


>gb|EXC19537.1| hypothetical protein L484_010668 [Morus notabilis]
          Length = 511

 Score = 94.0 bits (232), Expect = 9e-17
 Identities = 87/337 (25%), Positives = 134/337 (39%), Gaps = 107/337 (31%)
 Frame = -3

Query: 1017 SLLLRELESDASRGSISESTLELLEQLKEIDSRRGDVSLSR--AMKRAYCAVAVDCMLKI 844
            ++LLR +E + S G ++E+ L  LE ++E+DSRRG  + +   +MK AYCAVA+DC +++
Sbjct: 45   TVLLRTIEYEVSEGLVTETALGNLELIEELDSRRGSAAAAAGDSMKAAYCAVALDCTVRV 104

Query: 843  ---NG---------------------------------LGSELLWVWKDDVEATVWDDGV 772
               NG                                 L S+L   W D+VEA +WD+GV
Sbjct: 105  LVGNGGKPGGKFLNAVKRIWRGRVGRMEKSAAARESRLLSSDLRRCW-DEVEAAIWDEGV 163

Query: 771  HKSVMKKFRGCQAHEAVRDYVREEIESIGPTFL--------------------------- 673
             + +M+      A   +  Y++E   S+GP+F+                           
Sbjct: 164  SRKLMRINTRIDALMLLGAYLKEAWASMGPSFVAWAARLSAKRRLREDGNGEESRGRGLS 223

Query: 672  ----------------------ELVAERVKGVGGIYPVDGFQASPPSPRVDPAEMKKEEV 559
                                  + V E V G G + P  G       P      +     
Sbjct: 224  LNGLIELRAELRDEVGLRELANDRVRENVVGGGVVLPDSGSLIVIDEPETTAKAVDDGVT 283

Query: 558  MLGDELD------------GVQHSKTMGSANHRG--------VKLEDSNETTAEPSKRSY 439
            +LG   D             V   K +  + H G        V++ D  +   +PS   +
Sbjct: 284  LLGSHTDLAITNGPATTDKEVPREKVVLRSKHVGFQKRIRGPVRISDVEDLETDPSPHRF 343

Query: 438  NLPGIAEVDKAGEALKSSIMEPRAAVTDPLPEELQQA 328
            N     EV+KA EALKSS +E +A VTDP PE +++A
Sbjct: 344  NNIPTPEVNKAHEALKSSSLELQAVVTDPFPEAVREA 380


>ref|XP_007146880.1| hypothetical protein PHAVU_006G078200g [Phaseolus vulgaris]
            gi|561020103|gb|ESW18874.1| hypothetical protein
            PHAVU_006G078200g [Phaseolus vulgaris]
          Length = 473

 Score = 87.8 bits (216), Expect = 6e-15
 Identities = 87/296 (29%), Positives = 130/296 (43%), Gaps = 68/296 (22%)
 Frame = -3

Query: 1017 SLLLRELESDASRGSISESTLELLEQLKEIDSRRGDVSLSRAMKRAYCAVAVDCMLKI-- 844
            +LLLR L +   + S+SE+ L++LE L+++D      S S   +RAY AVAV+C +K   
Sbjct: 42   TLLLRILRTLLLKASLSETALQILELLEDLDG----ASASAVRRRAYLAVAVECTVKYLA 97

Query: 843  -----------------------------NGLGSELLWVWKDDVEATVWDDGVHKSVMKK 751
                                         +GL S  L  W+DD+EA + D    + +   
Sbjct: 98   AAPDDADGEFSGAVNRIWRGRVAALEARRSGLVSGELARWRDDLEAALGDSRACERLADL 157

Query: 750  FRGCQAHEAVRDYVREEIESIGPTFLELVAERVKGV---GGIYPVDGFQASPPSPRVDPA 580
                +A   VR Y++E  ES+GP+FLE VA   KG+      + V G +        + A
Sbjct: 158  NSRREAMNEVRAYLKEAWESMGPSFLESVAAMSKGLTKEKDDFVVSGNKRDNDHENDNDA 217

Query: 579  EMKKEEVMLGDELDG---------------------------------VQHSKTMGSANH 499
             M  E+V + DE  G                                 V+H  +   A H
Sbjct: 218  CM--EDVAMHDENQGKQQLEEKIDANQEVGGRDLLLESDKVIQKQNLRVKHKHSALRACH 275

Query: 498  RGVKLEDSNET-TAEPSKRSYNLPGIAEVDKAGEALKSSIMEPRAAVTDPLPEELQ 334
            RGVK+  S E  +A+   +  ++P  +E+ K  E+LKSS  E RA V DPLP+ L+
Sbjct: 276  RGVKISGSEEVESAKSWSKHDSVP--SEIRKVRESLKSSTCELRALVNDPLPDALR 329


>ref|XP_004500362.1| PREDICTED: uncharacterized protein LOC101503526 [Cicer arietinum]
          Length = 504

 Score = 82.0 bits (201), Expect = 4e-13
 Identities = 89/319 (27%), Positives = 130/319 (40%), Gaps = 92/319 (28%)
 Frame = -3

Query: 1017 SLLLRELESDASRGSISESTLELLEQLKEIDSRRGDVSLSRAMKRAYCAVAVDCMLKI-- 844
            +LLLR L++   + S+SE +L +LEQL+E+      V +  AM+ AYCAVAVD  +K   
Sbjct: 42   TLLLRILQTQHLKASLSEMSLHILEQLEELHCNDA-VPVPDAMRSAYCAVAVDSTVKYLI 100

Query: 843  -------------------------------NGLGSELLWVWKDDVEATVWDDGVHKSVM 757
                                           +G+ S+ L  W +D+EA +WD  V + ++
Sbjct: 101  SSPEDPSGEYFSAVRRIWRGRVVKLSALERRSGIFSDELIQWAEDIEAALWDVRVSERLV 160

Query: 756  KKFRGCQAHEAVRDYVREEIESIGPTFLELVA-------ERVKGV-----GG-------- 637
                   A   V+ Y++E    +GP+FL+ +A        R +GV     GG        
Sbjct: 161  GLNTRRDAMNEVKRYLKEAWGLMGPSFLDSMASFSKAKDSRPEGVCEIASGGEMLRKRDD 220

Query: 636  ------IYPVDGFQASPPSPRVDPAEMKKEEVM--LGDE------------------LDG 535
                   Y  DG   +     VD ++    EV   LGDE                   D 
Sbjct: 221  LGKEKMYYVGDGDDKNDNDNVVDDSDDDVAEVSENLGDEQLEERVGTSVDPNQEVGGCDS 280

Query: 534  VQHSKTMGSAN-------------HRGVKLEDSNETTAEPSKRSYNLPGIAEVDKAGEAL 394
            ++  K +   N             HRGVK+  + E         Y+    AEV K  E+L
Sbjct: 281  LKGDKEIRKGNLQPKRKYSSLRTCHRGVKISGAEEVRPTNLLCKYDSLPSAEVKKVRESL 340

Query: 393  KSSIMEPRAAVTDPLPEEL 337
            KSS ME +A V DPLP+ L
Sbjct: 341  KSSSMELKALVKDPLPDAL 359


>ref|XP_007026325.1| Telomeric repeat binding-like protein isoform 3 [Theobroma cacao]
            gi|508781691|gb|EOY28947.1| Telomeric repeat binding-like
            protein isoform 3 [Theobroma cacao]
          Length = 496

 Score = 75.9 bits (185), Expect = 3e-11
 Identities = 71/256 (27%), Positives = 110/256 (42%), Gaps = 28/256 (10%)
 Frame = -3

Query: 1011 LLRELESDASRGSISESTLELLEQLKEIDSRRGDVSLSRAMKRAYCAVAVDCMLKI---- 844
            ++R ++   + GS+ E  L+ L  ++++D ++G + +  +MK A+CAVA+ C L      
Sbjct: 46   IVRSIQEGIANGSVPEEILDSLLLIEQVDRKQG-LPVFDSMKAAFCAVALHCTLASLTRS 104

Query: 843  --------------------NGLGSEL----LWVWKDDVEATVWDDGVHKSVMKKFRGCQ 736
                                +   SEL    L  W+ +VEA +WD    +++++      
Sbjct: 105  WPCYYDAVQRIWRSRIRILEDSASSELVGPDLDKWRKEVEAALWDSEASQTLLRINTRGD 164

Query: 735  AHEAVRDYVREEIESIGPTFLELVAERVKGVGGIYPVDGFQASPPSPRVDPAEMKKEEVM 556
            A   +R YV E   S+ P FL L        G   P  G        RV+P    K    
Sbjct: 165  ALRCLRVYVDEARSSMKPAFLRLTLAASPASGR--PHCGTDKGKGILRVNPQRRCKR--- 219

Query: 555  LGDELDGVQHSKTMGSANHRGVKLEDSNETTAEPSKRSYNLPGIAEVDKAGEALKSSIME 376
                  GV H    G      V++ DS E    PS   +     +EV+K  EALK+S  +
Sbjct: 220  ------GVPHRHYKGP-----VEIADSEEE--RPSYSKFGSLSTSEVNKVQEALKTSTAD 266

Query: 375  PRAAVTDPLPEELQQA 328
              A VTDPLP+ L+ A
Sbjct: 267  LLAVVTDPLPKVLEVA 282


>ref|XP_007026323.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590626982|ref|XP_007026324.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508781689|gb|EOY28945.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508781690|gb|EOY28946.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 507

 Score = 75.9 bits (185), Expect = 3e-11
 Identities = 71/256 (27%), Positives = 110/256 (42%), Gaps = 28/256 (10%)
 Frame = -3

Query: 1011 LLRELESDASRGSISESTLELLEQLKEIDSRRGDVSLSRAMKRAYCAVAVDCMLKI---- 844
            ++R ++   + GS+ E  L+ L  ++++D ++G + +  +MK A+CAVA+ C L      
Sbjct: 46   IVRSIQEGIANGSVPEEILDSLLLIEQVDRKQG-LPVFDSMKAAFCAVALHCTLASLTRS 104

Query: 843  --------------------NGLGSEL----LWVWKDDVEATVWDDGVHKSVMKKFRGCQ 736
                                +   SEL    L  W+ +VEA +WD    +++++      
Sbjct: 105  WPCYYDAVQRIWRSRIRILEDSASSELVGPDLDKWRKEVEAALWDSEASQTLLRINTRGD 164

Query: 735  AHEAVRDYVREEIESIGPTFLELVAERVKGVGGIYPVDGFQASPPSPRVDPAEMKKEEVM 556
            A   +R YV E   S+ P FL L        G   P  G        RV+P    K    
Sbjct: 165  ALRCLRVYVDEARSSMKPAFLRLTLAASPASGR--PHCGTDKGKGILRVNPQRRCKR--- 219

Query: 555  LGDELDGVQHSKTMGSANHRGVKLEDSNETTAEPSKRSYNLPGIAEVDKAGEALKSSIME 376
                  GV H    G      V++ DS E    PS   +     +EV+K  EALK+S  +
Sbjct: 220  ------GVPHRHYKGP-----VEIADSEEE--RPSYSKFGSLSTSEVNKVQEALKTSTAD 266

Query: 375  PRAAVTDPLPEELQQA 328
              A VTDPLP+ L+ A
Sbjct: 267  LLAVVTDPLPKVLEVA 282


>ref|XP_002518779.1| hypothetical protein RCOM_0813700 [Ricinus communis]
            gi|223542160|gb|EEF43704.1| hypothetical protein
            RCOM_0813700 [Ricinus communis]
          Length = 478

 Score = 75.5 bits (184), Expect = 3e-11
 Identities = 71/271 (26%), Positives = 111/271 (40%), Gaps = 44/271 (16%)
 Frame = -3

Query: 1017 SLLLRELESDASRGSISEST-LELLEQLKEIDSRRGDVSLSRAMKRAYCAVAVDCMLKI- 844
            +LLLR +  + + GS S  T L+ LE ++E+D+      ++ +MK AY AVAVDC LK  
Sbjct: 45   ALLLRSIHDEIANGSASSETILQSLETIEELDNTH----IADSMKLAYQAVAVDCTLKWV 100

Query: 843  -----------------------------NGLGSELLWVWKDDVEATVWDDGVHKSVMKK 751
                                         + L ++ L   K ++EA +WD    K ++ +
Sbjct: 101  IEKHDKGQFFKAVKRIWRGRIERLESLKKSELVTDELKEIKQEIEAALWDSNARKRLLDR 160

Query: 750  FRGCQAHEAVRDYVREEIESIGPTFLELVAERVKGVGGIYPVDGFQASPPSPRVDPAEMK 571
                ++   V+DY+ E ++  GP+FLEL A+                          E +
Sbjct: 161  NIKSESLRLVKDYLNEALDKRGPSFLELAAK-------------------------VETE 195

Query: 570  KEEVMLGDELDGVQHSKTMGSANHRGVK-------------LEDSNETTAEPSKRSYNLP 430
             +E   G    G++ +   G A    +K             ++D+           YN  
Sbjct: 196  MKEKQKGAVQVGLESAVVDGPAGRESLKNDMPDTFCQRPGGIKDTERAHMNILSFKYNPL 255

Query: 429  GIAEVDKAGEALKSSIMEPRAAVTDPLPEEL 337
               EV K  EALKSS +E +A V DPLP  L
Sbjct: 256  LTPEVTKVKEALKSSSLELKALVEDPLPNAL 286


>ref|XP_002319963.2| hypothetical protein POPTR_0013s15060g [Populus trichocarpa]
            gi|550325885|gb|EEE95886.2| hypothetical protein
            POPTR_0013s15060g [Populus trichocarpa]
          Length = 599

 Score = 73.6 bits (179), Expect = 1e-10
 Identities = 51/162 (31%), Positives = 83/162 (51%), Gaps = 34/162 (20%)
 Frame = -3

Query: 1017 SLLLRELESDASRGSISESTLELLEQLKEIDSRRGDVSLSRAMKRAYCAVAVDCMLK--- 847
            +LLLR++++D   GS+SE TL+ +E +++ID   GD+ +  +MK AYCAVAV+C +K   
Sbjct: 57   TLLLRQIDADIEDGSVSEKTLDAIEMVEQIDRNEGDLIMD-SMKNAYCAVAVECTVKYML 115

Query: 846  ----------------------INGLGSE--------LLWVWKDDVEATVWDDGVHKSVM 757
                                  + GL  E         L  + +++E  + DD V K  +
Sbjct: 116  GNLQRARKGKFLEAVERVWKNRVAGLKREGKSELVTGKLMKYFEEMEVALKDDVVAKRWL 175

Query: 756  KKFRGCQAHEAVRDYVREEIESIGPTFLELVAE-RVKGVGGI 634
            +     +A E VR Y+ E +   GP F+E+VA   ++G GG+
Sbjct: 176  RMNTRNEAAEMVRIYLGEAVAVSGPVFVEMVARMEMRGDGGL 217


>ref|XP_006376552.1| hypothetical protein POPTR_0013s15060g [Populus trichocarpa]
            gi|550325884|gb|ERP54349.1| hypothetical protein
            POPTR_0013s15060g [Populus trichocarpa]
          Length = 687

 Score = 73.6 bits (179), Expect = 1e-10
 Identities = 51/162 (31%), Positives = 83/162 (51%), Gaps = 34/162 (20%)
 Frame = -3

Query: 1017 SLLLRELESDASRGSISESTLELLEQLKEIDSRRGDVSLSRAMKRAYCAVAVDCMLK--- 847
            +LLLR++++D   GS+SE TL+ +E +++ID   GD+ +  +MK AYCAVAV+C +K   
Sbjct: 57   TLLLRQIDADIEDGSVSEKTLDAIEMVEQIDRNEGDLIMD-SMKNAYCAVAVECTVKYML 115

Query: 846  ----------------------INGLGSE--------LLWVWKDDVEATVWDDGVHKSVM 757
                                  + GL  E         L  + +++E  + DD V K  +
Sbjct: 116  GNLQRARKGKFLEAVERVWKNRVAGLKREGKSELVTGKLMKYFEEMEVALKDDVVAKRWL 175

Query: 756  KKFRGCQAHEAVRDYVREEIESIGPTFLELVAE-RVKGVGGI 634
            +     +A E VR Y+ E +   GP F+E+VA   ++G GG+
Sbjct: 176  RMNTRNEAAEMVRIYLGEAVAVSGPVFVEMVARMEMRGDGGL 217


>ref|XP_006486662.1| PREDICTED: uncharacterized protein LOC102608364 [Citrus sinensis]
          Length = 396

 Score = 72.8 bits (177), Expect = 2e-10
 Identities = 64/233 (27%), Positives = 109/233 (46%), Gaps = 3/233 (1%)
 Frame = -3

Query: 1017 SLLLRELESDASR---GSISESTLELLEQLKEIDSRRGDVSLSRAMKRAYCAVAVDCMLK 847
            +LLLR ++S  S     S+S++ LE L+ ++++D + G ++++R+M+ A           
Sbjct: 43   TLLLRSIQSQLSSDGDASLSKTILENLKAVRDLDEKEG-IAITRSMEAAI---------- 91

Query: 846  INGLGSELLWVWKDDVEATVWDDGVHKSVMKKFRGCQAHEAVRDYVREEIESIGPTFLEL 667
                        +D  E T  DD +H             + V+ Y+ E   S+GPTFLEL
Sbjct: 92   ------------RDAAENTQNDDALH-------------QVVKTYLEEAWASMGPTFLEL 126

Query: 666  VAERVKGVGGIYPVDGFQASPPSPRVDPAEMKKEEVMLGDELDGVQHSKTMGSANHRGVK 487
             A   +G   +   D  +      R +  + K++ V          H +  G      V+
Sbjct: 127  AAAGGRGRAHVETED--KEKGKGIRKENVQPKRKHV--------ASHRRARGP-----VR 171

Query: 486  LEDSNETTAEPSKRSYNLPGIAEVDKAGEALKSSIMEPRAAVTDPLPEELQQA 328
            + D+ + +++     Y+     EV+K  EALKSS +E +A VTDPLP+ L+QA
Sbjct: 172  IIDNEDLSSDEPCSQYDTLPTPEVNKVQEALKSSSLELQAIVTDPLPDALRQA 224


>ref|XP_006422498.1| hypothetical protein CICLE_v10028601mg [Citrus clementina]
            gi|557524432|gb|ESR35738.1| hypothetical protein
            CICLE_v10028601mg [Citrus clementina]
          Length = 396

 Score = 70.9 bits (172), Expect = 8e-10
 Identities = 64/233 (27%), Positives = 108/233 (46%), Gaps = 3/233 (1%)
 Frame = -3

Query: 1017 SLLLRELESDASR---GSISESTLELLEQLKEIDSRRGDVSLSRAMKRAYCAVAVDCMLK 847
            +LLLR ++S  S     S+S++ LE L+ ++++D + G ++++R+M+ A           
Sbjct: 43   TLLLRSIQSQLSSDGDASLSKTILENLKAVRDLDEKEG-IAITRSMEAAI---------- 91

Query: 846  INGLGSELLWVWKDDVEATVWDDGVHKSVMKKFRGCQAHEAVRDYVREEIESIGPTFLEL 667
                        +D  E T  DD +              + V+ Y+ E   S+GPTFLEL
Sbjct: 92   ------------RDAAENTQNDDALR-------------QVVKTYLEEAWASMGPTFLEL 126

Query: 666  VAERVKGVGGIYPVDGFQASPPSPRVDPAEMKKEEVMLGDELDGVQHSKTMGSANHRGVK 487
             A   +G   +   D  +      R +  + K++ V          H +  G      V+
Sbjct: 127  AAAGGRGRAHVETED--KEKGKGIRKENVQPKRKHV--------ASHRRARGP-----VR 171

Query: 486  LEDSNETTAEPSKRSYNLPGIAEVDKAGEALKSSIMEPRAAVTDPLPEELQQA 328
            + DS + +++     Y+     EV+K  EALKSS +E +A VTDPLP+ L+QA
Sbjct: 172  IIDSEDLSSDEPCSQYDTLPTPEVNKVQEALKSSSLELQAIVTDPLPDALRQA 224


>gb|EXC19536.1| hypothetical protein L484_010667 [Morus notabilis]
          Length = 587

 Score = 69.3 bits (168), Expect = 2e-09
 Identities = 48/157 (30%), Positives = 78/157 (49%), Gaps = 38/157 (24%)
 Frame = -3

Query: 1017 SLLLRELESDASRGSISESTLELLEQLKEIDSRRGDVSLSR--AMKRAYCAVAVDCMLKI 844
            ++LLR +E + S G ++ + LE LE ++E+DSRRG  + +   +MK AYCAVA+DC +++
Sbjct: 45   TVLLRTIEYEVSEGLVTSTALENLELIEELDSRRGSAAAAAGDSMKAAYCAVALDCTVRV 104

Query: 843  ---NG---------------------------------LGSELLWVWKDDVEATVWDDGV 772
               NG                                 L S+L   W D+VEA +WD+GV
Sbjct: 105  LVGNGGKPGGKFLNAVKRIWRGRVGRMEKSAAARESRLLSSDLRRCW-DEVEAAIWDEGV 163

Query: 771  HKSVMKKFRGCQAHEAVRDYVREEIESIGPTFLELVA 661
             + +M+      A   +  Y++E    +GP+F+   A
Sbjct: 164  CRKLMRINTRKDALMLLGAYLKEAWALMGPSFVAWAA 200


>ref|XP_002519314.1| telomeric repeat binding protein, putative [Ricinus communis]
            gi|223541629|gb|EEF43178.1| telomeric repeat binding
            protein, putative [Ricinus communis]
          Length = 637

 Score = 63.9 bits (154), Expect = 1e-07
 Identities = 58/215 (26%), Positives = 98/215 (45%), Gaps = 36/215 (16%)
 Frame = -3

Query: 1017 SLLLRELESDASRGSISESTLELLEQLKEIDSRRGDVSLSRAMKRAYCAVAVDCMLKI-- 844
            +LLLR ++S  S GS+SE+ L+ LE ++E+D R   + ++ +MK AYCAVAV+C +K   
Sbjct: 47   TLLLRSIDSQISDGSVSETILDSLEAIEELD-RENHIIITDSMKAAYCAVAVECTVKYLW 105

Query: 843  --------NGLGSELL-WVWKDDV-----------------------EATVWDDGVHKSV 760
                     G   E +  +W+D +                       EA + D   +KS+
Sbjct: 106  GNQHKSRSQGKYVEAVKRIWRDRIQNLEMAKKSDLVTDELRKSRQKMEAVLLDSHRYKSL 165

Query: 759  MKKFRGCQAHEAVRDYVREEIESIGPTFLELVA--ERVKGVGGIYPVDGFQASPPSPRVD 586
             +      A     DY+ E +  +GP+FLELVA  ER      +      + +      +
Sbjct: 166  KELNTRNVALLLTGDYIHEAMALMGPSFLELVARTEREAKEKEVRVQKENKENEFMAEGE 225

Query: 585  PAEMKKEEVMLGDELDGVQHSKTMGSANHRGVKLE 481
             A+ +KEE+ +G E +  +       A  + ++ E
Sbjct: 226  MAQREKEEMEVGAERENKEKELRAEMAKEKDLRAE 260


>ref|XP_007026326.1| Uncharacterized protein isoform 4 [Theobroma cacao]
           gi|508781692|gb|EOY28948.1| Uncharacterized protein
           isoform 4 [Theobroma cacao]
          Length = 352

 Score = 61.2 bits (147), Expect = 6e-07
 Identities = 63/216 (29%), Positives = 86/216 (39%), Gaps = 28/216 (12%)
 Frame = -3

Query: 891 MKRAYCAVAVDCMLKI------------------------NGLGSEL----LWVWKDDVE 796
           MK A+CAVA+ C L                          +   SEL    L  W+ +VE
Sbjct: 1   MKAAFCAVALHCTLASLTRSWPCYYDAVQRIWRSRIRILEDSASSELVGPDLDKWRKEVE 60

Query: 795 ATVWDDGVHKSVMKKFRGCQAHEAVRDYVREEIESIGPTFLELVAERVKGVGGIYPVDGF 616
           A +WD    +++++      A   +R YV E   S+ P FL L        G   P  G 
Sbjct: 61  AALWDSEASQTLLRINTRGDALRCLRVYVDEARSSMKPAFLRLTLAASPASGR--PHCGT 118

Query: 615 QASPPSPRVDPAEMKKEEVMLGDELDGVQHSKTMGSANHRGVKLEDSNETTAEPSKRSYN 436
                  RV+P    K          GV H    G      V++ DS E    PS   + 
Sbjct: 119 DKGKGILRVNPQRRCKR---------GVPHRHYKGP-----VEIADSEEE--RPSYSKFG 162

Query: 435 LPGIAEVDKAGEALKSSIMEPRAAVTDPLPEELQQA 328
               +EV+K  EALK+S  +  A VTDPLP+ L+ A
Sbjct: 163 SLSTSEVNKVQEALKTSTADLLAVVTDPLPKVLEVA 198


Top