BLASTX nr result

ID: Mentha23_contig00008048 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00008048
         (728 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU36739.1| hypothetical protein MIMGU_mgv1a007033mg [Mimulus...   280   5e-73
gb|EPS65535.1| hypothetical protein M569_09246, partial [Genlise...   211   2e-52
ref|XP_006364664.1| PREDICTED: uncharacterized protein LOC102596...   145   1e-32
ref|XP_004296731.1| PREDICTED: uncharacterized protein LOC101297...   133   5e-29
ref|XP_004247979.1| PREDICTED: uncharacterized protein LOC101254...   132   9e-29
ref|XP_003632065.1| PREDICTED: uncharacterized protein LOC100854...   127   5e-27
ref|XP_002309823.1| hypothetical protein POPTR_0007s02350g [Popu...   118   2e-24
gb|EXB68682.1| putative Golgi transport protein 1 [Morus notabilis]   117   5e-24
ref|XP_006443108.1| hypothetical protein CICLE_v10020134mg [Citr...   113   6e-23
ref|XP_002529769.1| conserved hypothetical protein [Ricinus comm...   112   2e-22
ref|XP_007026508.1| Uncharacterized protein isoform 2, partial [...   108   1e-21
ref|XP_007026507.1| Uncharacterized protein isoform 1 [Theobroma...   108   1e-21
ref|XP_006858644.1| hypothetical protein AMTR_s00066p00051680 [A...   108   2e-21
ref|XP_006294241.1| hypothetical protein CARUB_v10023240mg [Caps...   108   2e-21
ref|XP_006397488.1| hypothetical protein EUTSA_v10001453mg [Eutr...   107   3e-21
ref|XP_007224230.1| hypothetical protein PRUPE_ppa018099mg, part...   106   9e-21
ref|NP_001078044.1| uncharacterized protein [Arabidopsis thalian...   103   8e-20
ref|XP_006606588.1| PREDICTED: uncharacterized protein LOC100814...   102   1e-19
ref|XP_003556568.1| PREDICTED: uncharacterized protein LOC100814...   102   1e-19
ref|XP_007142633.1| hypothetical protein PHAVU_007G004000g [Phas...   102   1e-19

>gb|EYU36739.1| hypothetical protein MIMGU_mgv1a007033mg [Mimulus guttatus]
          Length = 422

 Score =  280 bits (715), Expect = 5e-73
 Identities = 151/245 (61%), Positives = 172/245 (70%), Gaps = 3/245 (1%)
 Frame = +2

Query: 2   RRHHRRRLLKYSPNHSSTPSQHQPAIFKLSDDTLQITLKSPSTSLQ--NLETKLNQFLDI 175
           RRHHRRRLLKYSP  ++TP    P IFKLSDD LQITL+ PSTSLQ   LETKLNQ +  
Sbjct: 27  RRHHRRRLLKYSPTPANTPI-FAPTIFKLSDDGLQITLRRPSTSLQVQQLETKLNQLIGR 85

Query: 176 GREALDDLRTVVAVDGSTGRVVISCRRSTVEXXXXXXXXXXXXXIAFRSLFAMRKNGGGE 355
           GREA DDLRTVVAVD + G  VISCRRS+VE             IAFR LF       GE
Sbjct: 86  GREAFDDLRTVVAVDETNGGFVISCRRSSVEFLAALFFSSLVVVIAFRGLFKQISKNSGE 145

Query: 356 VLVYKRDRSLGGKVVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKDRM-IKSSRRKEEL 532
           VLVYKRDRSLGGK V VGK+E              ++++YYY+KK +R  I    RKEEL
Sbjct: 146 VLVYKRDRSLGGKEVVVGKKETNLPTRRKPTPLSSNDADYYYEKKINRTKILGKSRKEEL 205

Query: 533 PQWWPQVVSQGPVEVENKEEYQRLANHLIQAIMDRKMGGQDISRNDVVQLRHICKTYGVR 712
           PQWWPQ V+ G  E+ENKEEYQR+AN LI AI+DRKM G+DIS ND+VQLRH+CKTYGV+
Sbjct: 206 PQWWPQAVNLGSPEIENKEEYQRMANQLIGAIVDRKMAGEDISANDIVQLRHLCKTYGVK 265

Query: 713 ASIST 727
            SIST
Sbjct: 266 TSIST 270


>gb|EPS65535.1| hypothetical protein M569_09246, partial [Genlisea aurea]
          Length = 400

 Score =  211 bits (538), Expect = 2e-52
 Identities = 121/247 (48%), Positives = 154/247 (62%), Gaps = 5/247 (2%)
 Frame = +2

Query: 2   RRHHRRRLLKYSPNHSSTPS---QHQP--AIFKLSDDTLQITLKSPSTSLQNLETKLNQF 166
           RRHHRRRLLKYSPN +   S   +  P   I KLSD+ LQITL SPS SL+ +E+KLNQ 
Sbjct: 5   RRHHRRRLLKYSPNRNPETSPLIRSTPPITILKLSDNGLQITLSSPSNSLEKVESKLNQI 64

Query: 167 LDIGREALDDLRTVVAVDGSTGRVVISCRRSTVEXXXXXXXXXXXXXIAFRSLFAMRKNG 346
           ++ GREA  DLRT+V  D   GRV ISCRRSTVE             +  R++F +RKN 
Sbjct: 65  IECGREAFFDLRTLVTFDEDYGRVSISCRRSTVEFFIGLFISGFLVVLIIRNVFKLRKN- 123

Query: 347 GGEVLVYKRDRSLGGKVVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKDRMIKSSRRKE 526
           G + LVY+RDRSLGG+ V VG                  +   Y+QKK+  +I+   RKE
Sbjct: 124 GRQALVYRRDRSLGGREVLVGTGHSNWSSKLTSNPLDSVSISDYHQKKRG-IIQGMSRKE 182

Query: 527 ELPQWWPQVVSQGPVEVENKEEYQRLANHLIQAIMDRKMGGQDISRNDVVQLRHICKTYG 706
           +LPQWWPQ       E  N E YQR+AN L+Q I+DR++ G+DIS +D+VQLR++CK + 
Sbjct: 183 KLPQWWPQ-FHDSSGEAPNTEGYQRIANQLVQGIVDRRVSGEDISMDDIVQLRYLCKAHR 241

Query: 707 VRASIST 727
           V  SIST
Sbjct: 242 VNVSIST 248


>ref|XP_006364664.1| PREDICTED: uncharacterized protein LOC102596187 [Solanum tuberosum]
          Length = 455

 Score =  145 bits (366), Expect = 1e-32
 Identities = 99/255 (38%), Positives = 129/255 (50%), Gaps = 13/255 (5%)
 Frame = +2

Query: 2   RRHHRRRLLKYSPNHSSTPSQHQPAIFKLSDDTLQITLKSPSTSLQNLETKLNQFLDIGR 181
           RRH RRRL K+S     TP   Q   F L+ D L    KS  +    L  KL +FL  GR
Sbjct: 40  RRHLRRRLKKFSTE--DTPPSDQNLHFVLTVDNLPT--KSFYSIKDLLHLKLGEFLHSGR 95

Query: 182 EALDDLRTVVAVDGSTGRVVISCRRSTVEXXXXXXXXXXXXXIAFRSLF----AMRKNGG 349
            A++DLRT++ VD   GR+  SC RSTV+                R++      +R N G
Sbjct: 96  AAIEDLRTLIRVDTDAGRLSFSCTRSTVKFLATLVVSSFLLIFTLRAIVNLVRGIRLNSG 155

Query: 350 GE--VLVYKRDRSLGGKVVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKDRMIKSSRRK 523
                LVYKRDRSLGG+ V V K E              D     +   +D  I  SRR+
Sbjct: 156 NNNVELVYKRDRSLGGREVLVAKNETPTLDRKKPNVLDSDEGNSNWDWDRDSPISFSRRR 215

Query: 524 ------EELPQWWPQVVS-QGPVEVENKEEYQRLANHLIQAIMDRKMGGQDISRNDVVQL 682
                 E+LP+WWP   S    V  EN+EEYQR+AN LI+AI+D +M G+DI  +D++QL
Sbjct: 216 KKKSSVEQLPKWWPVSTSGSDQVGAENQEEYQRMANRLIRAILDNRMTGKDILADDIIQL 275

Query: 683 RHICKTYGVRASIST 727
           R I +   V+ S  T
Sbjct: 276 RRIGRISNVKVSFDT 290


>ref|XP_004296731.1| PREDICTED: uncharacterized protein LOC101297340 [Fragaria vesca
           subsp. vesca]
          Length = 430

 Score =  133 bits (335), Expect = 5e-29
 Identities = 89/250 (35%), Positives = 131/250 (52%), Gaps = 10/250 (4%)
 Frame = +2

Query: 8   HHRRRLLKYSPNHSSTPSQHQPAIFKLSD-DTLQITLKSPSTSLQNLETKLNQFLDIGRE 184
           +  RR  + +PN  +T    +PA +  SD + LQ T    +T   +  + L  FL    +
Sbjct: 34  NRNRRNRRRNPNTPTTVPTSKPAFYTSSDPENLQATFDL-NTLYYSSHSYLRYFLSSASD 92

Query: 185 ALDDLRTVVAVDGSTGRVVISCRRSTVEXXXXXXXXXXXXXIAFRSLFAMRKNG------ 346
           A++DL+T+V+VD    R+V+SCR ST+              + FR L  + + G      
Sbjct: 93  AVEDLQTLVSVDADR-RIVVSCRPSTLRFVGNFAVATCAVVLGFRVLVGLVRLGFGSGSG 151

Query: 347 -GGEVLVYKRDRSLGGKVVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKDRMIKSSRRK 523
            G E +V +RDRSLGGK V V + E                 E    KK++ + K +R +
Sbjct: 152 YGREKVVTRRDRSLGGKEVVVARVERPRA------------EEVSVTKKRESVFKKNRVR 199

Query: 524 --EELPQWWPQVVSQGPVEVENKEEYQRLANHLIQAIMDRKMGGQDISRNDVVQLRHICK 697
             E+LPQWWP   SQ  + V+N EE+QR AN L++AI D +M G+DI  +D++ LR IC+
Sbjct: 200 FGEKLPQWWPTTTSQPILGVDN-EEHQREANRLVRAITDNRMSGKDIMEDDIIHLRQICR 258

Query: 698 TYGVRASIST 727
            YGVR S  T
Sbjct: 259 VYGVRVSFDT 268


>ref|XP_004247979.1| PREDICTED: uncharacterized protein LOC101254735 [Solanum
           lycopersicum]
          Length = 458

 Score =  132 bits (333), Expect = 9e-29
 Identities = 96/259 (37%), Positives = 129/259 (49%), Gaps = 17/259 (6%)
 Frame = +2

Query: 2   RRHHRRR----LLKYSPNHSSTPSQHQPAIFKLSDDTLQITLKSPSTSLQNLETKLNQFL 169
           RRH RRR    L K+SP    TP   Q   F L+ D L    KS  +    +  KL +FL
Sbjct: 41  RRHLRRRRFPFLKKFSPE--DTPPSDQNLHFVLTVDNLPT--KSFYSIKDLIHLKLREFL 96

Query: 170 DIGREALDDLRTVVAVDGSTGRVVISCRRSTVEXXXXXXXXXXXXXIAFRSLFAMRKN-- 343
             GR A++DL+T++ +D   GRV  SC RSTV+                R++  + +   
Sbjct: 97  HSGRAAIEDLQTLIRIDTDAGRVSFSCTRSTVKFLATLLVSTFLLIFTLRAILNLVRRIP 156

Query: 344 ---GGGEV-LVYKRDRSLGGKVVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKDRMIKS 511
              G   V LVYKRDRSLGG+ V V K E              D     +    D  I  
Sbjct: 157 LNTGNNNVELVYKRDRSLGGREVLVAKNETPTLDRKKPNVLDRDEGNSNWDL--DTPISF 214

Query: 512 SRRK------EELPQWWPQVVS-QGPVEVENKEEYQRLANHLIQAIMDRKMGGQDISRND 670
           SRR+      E+LP+WWP   S    V  EN+EEYQR+A+ LI+AI+D +M G+DI  +D
Sbjct: 215 SRRRKKKSSVEQLPKWWPVSTSGSDQVGTENQEEYQRMADRLIRAILDNRMTGKDILADD 274

Query: 671 VVQLRHICKTYGVRASIST 727
           ++QLR I +   V+ S  T
Sbjct: 275 IIQLRRIGRISNVKVSFDT 293


>ref|XP_003632065.1| PREDICTED: uncharacterized protein LOC100854590 [Vitis vinifera]
          Length = 436

 Score =  127 bits (318), Expect = 5e-27
 Identities = 93/252 (36%), Positives = 127/252 (50%), Gaps = 10/252 (3%)
 Frame = +2

Query: 2   RRHHRRRLLKYSPNHSSTPSQHQPAIFKLSDDTLQITLKSPSTSLQNLETKLNQFLDIGR 181
           RR+  ++   Y P+H++ PS   P +  + D      L   S   Q L   LN+ +  G 
Sbjct: 31  RRNALKKPHHYHPHHNNKPSP-DPKLHMVVD------LHRLSDRAQIL---LNRLVSSGA 80

Query: 182 EALDDLRTVVAVDGSTGRVVISCRRSTVEXXXXXXXXXXXXXIAFRSL----FAMRKN-- 343
           +A+DDLRT+VAVD +T  VVI+CR ST+                FR L      +R+   
Sbjct: 81  DAIDDLRTLVAVDRATQSVVIACRPSTLRFVGGFVVWSLVVVFGFRVLVRLGLRLRREFG 140

Query: 344 -GGGEVLVYKRDRSLGGKVVAVGKREXXXXXXXXXXXXXXDNSEYYYQ---KKKDRMIKS 511
            G G  +V +RDRSLGGK V VG+ E                            D     
Sbjct: 141 FGSGRGVVVRRDRSLGGKEVVVGRAEESEWRMRNHSRVLGSPLSVVPGIGVNGGDWSPGR 200

Query: 512 SRRKEELPQWWPQVVSQGPVEVENKEEYQRLANHLIQAIMDRKMGGQDISRNDVVQLRHI 691
           SR ++ LP+WWP V    P+EV +K+EYQR AN LI+ IM  +M G+DI  +D++QLR I
Sbjct: 201 SRTEKRLPKWWP-VTLPPPLEVFDKQEYQREANRLIREIMANRMSGKDILEDDMIQLRRI 259

Query: 692 CKTYGVRASIST 727
           C+T G RASI T
Sbjct: 260 CRTSGARASIDT 271


>ref|XP_002309823.1| hypothetical protein POPTR_0007s02350g [Populus trichocarpa]
           gi|222852726|gb|EEE90273.1| hypothetical protein
           POPTR_0007s02350g [Populus trichocarpa]
          Length = 447

 Score =  118 bits (296), Expect = 2e-24
 Identities = 88/256 (34%), Positives = 136/256 (53%), Gaps = 14/256 (5%)
 Frame = +2

Query: 2   RRHHRRRLLKYSPNHSSTPSQHQPAIFKLSDDTLQITLKSPSTSLQNL-ETKLNQFLDIG 178
           RR  + + L  +PN  S  S        L++++  + L    T +  L  ++ +QFL +G
Sbjct: 36  RRSRKSKTLTNNPNKPS--SLDSDYYITLNNNSQNLKLVLNITQISKLPSSRFHQFLSLG 93

Query: 179 REALDDLRTVVAVDGSTGRVVISCRRSTVEXXXXXXXXXXXXXIAFRSLFAM-----RKN 343
           +EA+DDL+T+V++D    RVV+SC++ST++              + R LF +     RK 
Sbjct: 94  QEAVDDLKTLVSLD-ENNRVVLSCQKSTLQFAGTVLLSGFLLISSIRVLFKLGLGFKRKF 152

Query: 344 GGGEV--LVYKRDRSLGGK--VVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKDRMIKS 511
           G G+    V +RDRSLGGK  +VAV  ++              + S        +R   +
Sbjct: 153 GAGKNPNFVVRRDRSLGGKEVIVAVDDQQREESKRPKRLANPVEISGLVDGLGFERGDWT 212

Query: 512 SRR---KEELPQWWPQVVS-QGPVEVENKEEYQRLANHLIQAIMDRKMGGQDISRNDVVQ 679
             R   +++LP+WWP   S  G V   ++EEYQR AN LI+AI D +  G+D+  +D++Q
Sbjct: 213 RYRVGSQQKLPKWWPDSGSFSGRVVGPDQEEYQREANRLIRAITDYRTRGKDVMEHDIIQ 272

Query: 680 LRHICKTYGVRASIST 727
           LR IC+T GVRAS ST
Sbjct: 273 LRRICRTSGVRASFST 288


>gb|EXB68682.1| putative Golgi transport protein 1 [Morus notabilis]
          Length = 586

 Score =  117 bits (292), Expect = 5e-24
 Identities = 82/259 (31%), Positives = 130/259 (50%), Gaps = 17/259 (6%)
 Frame = +2

Query: 2   RRHHRRRLLKYSPNHSSTPSQHQPAIFKLSDDTLQITLKSPSTSLQNLETKLNQFLDIGR 181
           RR+ RRR      +  S+ S   P+    + + + + +     SL +  + L + +    
Sbjct: 23  RRNSRRRKTSSFTSKPSSSSSSNPS--SSNSNYVAVVIDLERLSLSSSNSHLRRLIASAD 80

Query: 182 EALDDLRTVVAVDGSTGRVVISCRRSTVEXXXXXXXXXXXXXIAFRSLFAM-----RKNG 346
           +AL DLRT+VA+D + GR+++SCRRST+              + FR+LF +        G
Sbjct: 81  DALTDLRTLVALDDA-GRLLVSCRRSTLRFVANSLLFSCVVVLGFRALFWLLFKRTHSFG 139

Query: 347 GGEVLVYKRDRSLGGKVVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKDRMI-----KS 511
           GG  +V +RDRSLGGK V V +                 +S     K+   ++     + 
Sbjct: 140 GGGHVVVRRDRSLGGKEVVVARTPPGPSSSTRRAL----SSPLSAAKEGVGLVGGTETRV 195

Query: 512 SRRKEELPQWWPQVV-------SQGPVEVENKEEYQRLANHLIQAIMDRKMGGQDISRND 670
           S R++ LP+WWP +        S     + +K++YQR A+ LI+AI D +M G+DI  +D
Sbjct: 196 SSREKRLPKWWPSLELDKQNWDSDSSDGIFDKQDYQRDADRLIRAITDNRMSGKDIVADD 255

Query: 671 VVQLRHICKTYGVRASIST 727
           ++QLR IC+T GVR S  T
Sbjct: 256 IIQLRRICRTSGVRVSFDT 274


>ref|XP_006443108.1| hypothetical protein CICLE_v10020134mg [Citrus clementina]
           gi|568850296|ref|XP_006478851.1| PREDICTED:
           uncharacterized protein LOC102619110 [Citrus sinensis]
           gi|557545370|gb|ESR56348.1| hypothetical protein
           CICLE_v10020134mg [Citrus clementina]
          Length = 448

 Score =  113 bits (283), Expect = 6e-23
 Identities = 91/263 (34%), Positives = 129/263 (49%), Gaps = 22/263 (8%)
 Frame = +2

Query: 5   RHHRRRLLKYSPNHSST------PSQHQPAIFKLSDDTLQITLKSPSTSLQNLETKLNQF 166
           R HRR L   + N S+T      PS        L  D  QI++ S S+     ++KL+ F
Sbjct: 24  RRHRRHLRNDNNNSSNTYNPLSKPSSFDGENINLVLDFHQISILSSSS-----KSKLHHF 78

Query: 167 LDIGREALDDLRTVVAVDGSTGRVVISCRRSTVEXXXXXXXXXXXXXIAFRSL------F 328
           L    +A  DL+TV+ +D + GR+++SCR+ST++               FR L      F
Sbjct: 79  LSSAEQAYADLKTVITLDDN-GRLLVSCRKSTLQFVGGVLLSGFVLVFVFRVLVKLGLGF 137

Query: 329 AMRKNGGGEVLVYKRDRSLGGK--VVAVGKREXXXXXXXXXXXXXXDN-------SEYYY 481
           + R     +  V +RDRSLGGK  VVAVG+ +              DN       +    
Sbjct: 138 SSRFRFQKQNFVVRRDRSLGGKEVVVAVGRGDDDARLTRNLKNRVLDNPLSEGRDAGSAL 197

Query: 482 QKKKDRMIKSSRRKE-ELPQWWPQVVSQGPVEVENKEEYQRLANHLIQAIMDRKMGGQDI 658
             +  R  +  R  E +LP+WW   VS     V +KE YQR AN LI+AI+D++  GQDI
Sbjct: 198 TGRVKRSYRVQRMSEGKLPKWWSVQVSADRTLVVDKE-YQREANRLIRAIIDQRTHGQDI 256

Query: 659 SRNDVVQLRHICKTYGVRASIST 727
             +D+ +LR IC+  GVR SI T
Sbjct: 257 PEDDIYRLRRICRISGVRVSIDT 279


>ref|XP_002529769.1| conserved hypothetical protein [Ricinus communis]
            gi|223530767|gb|EEF32635.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 898

 Score =  112 bits (279), Expect = 2e-22
 Identities = 85/262 (32%), Positives = 127/262 (48%), Gaps = 24/262 (9%)
 Frame = +2

Query: 14   RRRLLKYSPNHSSTPSQHQPAIFKLSDDTLQITLKSPSTSLQNLETKLNQFLDIGREALD 193
            +R LL  +  ++S  +       +L  D  QIT  + ST         N+FL +G++A  
Sbjct: 467  KRSLLSNNNYYNSGNNDFNNDNLQLVLDVNQITYLTSST--------FNRFLSLGKDAYY 518

Query: 194  DLRTVVAVDGSTGRVVISCRRSTVEXXXXXXXXXXXXXIAFRSL-----------FAMRK 340
            DL+T++++D    R+V +CR+STV+              AFR L           F +R 
Sbjct: 519  DLKTLISLD-ENNRIVFTCRKSTVQFTGGVLLCGVVLVSAFRVLIKLGLGFRSWLFRVRN 577

Query: 341  NGGGEVLVYKRDRSLGGKVVAVGKR------EXXXXXXXXXXXXXXDNSEYYYQKKKDRM 502
            N   + +V +RDRSLGGK V V +R      +              DN  + +    +R 
Sbjct: 578  NRKNKDVVVRRDRSLGGKEVVVARRVEEERPKDVKRKRFGVLDNPLDNPSWVFGSGLERD 637

Query: 503  IKSS---RRKEELPQWWPQVVSQGPVE----VENKEEYQRLANHLIQAIMDRKMGGQDIS 661
               S   R    LP+WW   VS GP +    V +K+EYQR AN LI+AI D +  G+D++
Sbjct: 638  DWRSYRVRSASRLPKWWS--VSVGPEQEDMVVVDKQEYQRDANRLIRAITDYRTSGKDVT 695

Query: 662  RNDVVQLRHICKTYGVRASIST 727
              D++QLR IC+T GV+ S  T
Sbjct: 696  EFDIIQLRRICRTSGVQVSFDT 717


>ref|XP_007026508.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
           gi|508715113|gb|EOY07010.1| Uncharacterized protein
           isoform 2, partial [Theobroma cacao]
          Length = 325

 Score =  108 bits (271), Expect = 1e-21
 Identities = 84/253 (33%), Positives = 126/253 (49%), Gaps = 12/253 (4%)
 Frame = +2

Query: 5   RHHRRRLLKYSPNHSSTPSQHQPAI-FKLSDDTLQITLKSPSTSLQNLET-KLNQFLDIG 178
           R  RR  L  +PN+ +     + +I F+ S D   + L      + +L + KLN+ +   
Sbjct: 43  RRSRRSRLPRNPNYDNHNLSLRRSIEFQNSPDNPNVKLVLDFDQISSLSSSKLNRLISFS 102

Query: 179 REALDDLRTVVAVDGSTGRVVISCRRSTVEXXXXXXXXXXXXXIAFRSLFAM------RK 340
            +A  DLR +V +D  T  + +SCR+ST++              AF  L  +      R 
Sbjct: 103 TDAFQDLRNLVQIDPDTRTLQLSCRKSTLQFLAAFLTCGFVIVFAFTVLVKLGLGLKARF 162

Query: 341 NGGGEVLVYKRDRSLGGKVVAVG-KREXXXXXXXXXXXXXXDNSEYYYQKKKDRMIKSS- 514
               +V+V +RDRSLGG+ V VG KR+                S       K    +   
Sbjct: 163 RPKHKVIV-RRDRSLGGREVIVGTKRDGGDPPSFRALDNPLSLSTARPLSTKTNYPRLQV 221

Query: 515 RRKEELPQWWPQVVSQGPVE--VENKEEYQRLANHLIQAIMDRKMGGQDISRNDVVQLRH 688
           +  ++LP+WWP++ S  P E  V N E YQ  AN LI+AI+D ++GG+DI+  D++QLR 
Sbjct: 222 QLGDKLPKWWPEMDSV-PKEGSVFNSEYYQTQANRLIRAIIDSRLGGKDITEEDIIQLRQ 280

Query: 689 ICKTYGVRASIST 727
           IC+T GVR SI T
Sbjct: 281 ICRTSGVRVSIDT 293


>ref|XP_007026507.1| Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|508715112|gb|EOY07009.1| Uncharacterized protein
           isoform 1 [Theobroma cacao]
          Length = 444

 Score =  108 bits (271), Expect = 1e-21
 Identities = 84/253 (33%), Positives = 126/253 (49%), Gaps = 12/253 (4%)
 Frame = +2

Query: 5   RHHRRRLLKYSPNHSSTPSQHQPAI-FKLSDDTLQITLKSPSTSLQNLET-KLNQFLDIG 178
           R  RR  L  +PN+ +     + +I F+ S D   + L      + +L + KLN+ +   
Sbjct: 43  RRSRRSRLPRNPNYDNHNLSLRRSIEFQNSPDNPNVKLVLDFDQISSLSSSKLNRLISFS 102

Query: 179 REALDDLRTVVAVDGSTGRVVISCRRSTVEXXXXXXXXXXXXXIAFRSLFAM------RK 340
            +A  DLR +V +D  T  + +SCR+ST++              AF  L  +      R 
Sbjct: 103 TDAFQDLRNLVQIDPDTRTLQLSCRKSTLQFLAAFLTCGFVIVFAFTVLVKLGLGLKARF 162

Query: 341 NGGGEVLVYKRDRSLGGKVVAVG-KREXXXXXXXXXXXXXXDNSEYYYQKKKDRMIKSS- 514
               +V+V +RDRSLGG+ V VG KR+                S       K    +   
Sbjct: 163 RPKHKVIV-RRDRSLGGREVIVGTKRDGGDPPSFRALDNPLSLSTARPLSTKTNYPRLQV 221

Query: 515 RRKEELPQWWPQVVSQGPVE--VENKEEYQRLANHLIQAIMDRKMGGQDISRNDVVQLRH 688
           +  ++LP+WWP++ S  P E  V N E YQ  AN LI+AI+D ++GG+DI+  D++QLR 
Sbjct: 222 QLGDKLPKWWPEMDSV-PKEGSVFNSEYYQTQANRLIRAIIDSRLGGKDITEEDIIQLRQ 280

Query: 689 ICKTYGVRASIST 727
           IC+T GVR SI T
Sbjct: 281 ICRTSGVRVSIDT 293


>ref|XP_006858644.1| hypothetical protein AMTR_s00066p00051680 [Amborella trichopoda]
           gi|548862755|gb|ERN20111.1| hypothetical protein
           AMTR_s00066p00051680 [Amborella trichopoda]
          Length = 447

 Score =  108 bits (269), Expect = 2e-21
 Identities = 77/226 (34%), Positives = 113/226 (50%), Gaps = 13/226 (5%)
 Frame = +2

Query: 89  SDDTLQITLKSPSTSLQNLETKLNQFLDIGREALDDLRTVVAVDGSTGRVVISCRRSTVE 268
           SD  L++ +       Q  E+ LN  L  G+EAL DL+ +V +DG+  R+ +SCRRS++E
Sbjct: 66  SDQKLEMVVDLKRMRTQVSES-LNLLLINGKEALKDLQGLVTIDGND-RITVSCRRSSLE 123

Query: 269 XXXXXXXXXXXXXIAFRSLFAMRKNGG---GEVLVYKRDRSLGGKVVAVGKREXXXXXXX 439
                           R L  +    G      LV +RDRSLGG+ V VG R        
Sbjct: 124 FIAYTFVLALCIVFVIRVLLKLGSRYGLYSNWGLVRRRDRSLGGREVVVGLRTKGKDSSA 183

Query: 440 XXXXXXXDN------SEYYYQKKKDRMIKSSRRKEE----LPQWWPQVVSQGPVEVENKE 589
                   N             K++ M   ++ +EE    LP+WWP   S   + +  K+
Sbjct: 184 KIRVSNSINPLSNVGGALGIISKRNSMNHFNKAEEEDEEKLPKWWPDAGSSVIMALP-KD 242

Query: 590 EYQRLANHLIQAIMDRKMGGQDISRNDVVQLRHICKTYGVRASIST 727
           EYQR AN +I+AIMD++M G+D++ +D++QLR ICK  G + SI T
Sbjct: 243 EYQREANRMIRAIMDKRMSGRDVTEDDIIQLRRICKISGAKVSIKT 288


>ref|XP_006294241.1| hypothetical protein CARUB_v10023240mg [Capsella rubella]
           gi|482562949|gb|EOA27139.1| hypothetical protein
           CARUB_v10023240mg [Capsella rubella]
          Length = 437

 Score =  108 bits (269), Expect = 2e-21
 Identities = 67/214 (31%), Positives = 109/214 (50%), Gaps = 5/214 (2%)
 Frame = +2

Query: 92  DDTLQITLKSPSTSLQNLETKLNQFLDIGREALDDLRTVVAVDGSTGRVVISCRRSTVEX 271
           D +L +TL     S +   ++    LD G++A  DL+T++A+D +  RVV+SC++ST++ 
Sbjct: 73  DQSLSLTLDVHGIS-KLANSRFQLLLDSGKDAFSDLQTLIALDDNR-RVVVSCKKSTMQF 130

Query: 272 XXXXXXXXXXXXIAFRSLFAMRKNGGGEV-----LVYKRDRSLGGKVVAVGKREXXXXXX 436
                        A R L  +     G        V +RDRSLGGK V V          
Sbjct: 131 VGGVVVLGLVLGFAIRVLVKLGSALKGNFQSNPKFVVRRDRSLGGKEVVVSVDSIRSSSR 190

Query: 437 XXXXXXXXDNSEYYYQKKKDRMIKSSRRKEELPQWWPQVVSQGPVEVENKEEYQRLANHL 616
                   D +       ++  +KS   +  LP+WWP  +    ++V +KEEYQR AN +
Sbjct: 191 DSKSFMASDQASQSNSIPRNLQLKS---QNNLPKWWPTSLPSQNLDVVDKEEYQREANRI 247

Query: 617 IQAIMDRKMGGQDISRNDVVQLRHICKTYGVRAS 718
           ++AI+D +  G+DI+ +D++QLR +C+  GV+ S
Sbjct: 248 VRAIVDNRTSGKDITDDDIIQLRRVCRISGVQVS 281


>ref|XP_006397488.1| hypothetical protein EUTSA_v10001453mg [Eutrema salsugineum]
           gi|557098561|gb|ESQ38941.1| hypothetical protein
           EUTSA_v10001453mg [Eutrema salsugineum]
          Length = 437

 Score =  107 bits (268), Expect = 3e-21
 Identities = 74/224 (33%), Positives = 114/224 (50%), Gaps = 14/224 (6%)
 Frame = +2

Query: 92  DDTLQITLKSPSTSLQNLETKLNQFLDIGREALDDLRTVVAVDGSTGRVVISCRRSTVEX 271
           D +L +TL     S     ++   FLD G++A  DL+T++A+D +  R+V+SCR+ST++ 
Sbjct: 78  DQSLSLTLDVHRISAL-ATSRFQLFLDSGKDAFSDLQTLIALDDNR-RIVVSCRKSTMQF 135

Query: 272 XXXXXXXXXXXXIAFRSLF----AMRKNGGGEV-LVYKRDRSLGGKVVAVGKREXXXXXX 436
                       +A R L     A + N  G+  LV +RDRSLGGK V V          
Sbjct: 136 VGGVVLLGFVFGVAIRVLVKLGSAFKGNFQGKPKLVVRRDRSLGGKEVVVA--------- 186

Query: 437 XXXXXXXXDNSEYYYQKKKDRMIKSS---------RRKEELPQWWPQVVSQGPVEVENKE 589
                   DNS           +  S         R +  LP+WWP  +    +EV+ +E
Sbjct: 187 -------VDNSRSSSSSIAPGQVSRSNSVPTNLKLRAQNNLPKWWPTSLPSQSLEVD-RE 238

Query: 590 EYQRLANHLIQAIMDRKMGGQDISRNDVVQLRHICKTYGVRASI 721
           +YQR AN +++AI+D +  G+DI+ ND++QLR +C+  GV+ SI
Sbjct: 239 DYQREANKIVRAIVDNRTSGKDITDNDIIQLRRVCRISGVQVSI 282


>ref|XP_007224230.1| hypothetical protein PRUPE_ppa018099mg, partial [Prunus persica]
           gi|462421166|gb|EMJ25429.1| hypothetical protein
           PRUPE_ppa018099mg, partial [Prunus persica]
          Length = 414

 Score =  106 bits (264), Expect = 9e-21
 Identities = 82/246 (33%), Positives = 114/246 (46%), Gaps = 19/246 (7%)
 Frame = +2

Query: 47  SSTPSQHQPAIFKLSDDTLQITLKSPSTSLQNL----ETKLNQFLDIGREALDDLRTVVA 214
           ++T     P+      DTLQ T       LQ L       L QFL    +AL DLRT+V+
Sbjct: 27  TTTKPYFYPSSSPSRPDTLQATF-----DLQYLYHTSHYSLQQFLSSASDALQDLRTLVS 81

Query: 215 VDGSTGRVVISCRRSTVEXXXXXXXXXXXXXIAFRSL-------FAMRKNGGGEVLVYKR 373
           VD    RV++SCR ST+              + FR L       F  R   G E  V +R
Sbjct: 82  VDADN-RVIVSCRPSTLRFVGNLVIMTFAVVLGFRVLVGLVRLGFGGRSGYGREGTVVRR 140

Query: 374 DRSLGGKVVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKK-----DRMIKSSRR---KEE 529
           DRSLGGK V VG+ E               ++     K+       R++ S  R   K++
Sbjct: 141 DRSLGGKEVVVGRVEKDRVDVRKKKSFGMLDNPLSMPKRTVVDGLGRLLNSRVRVWEKKK 200

Query: 530 LPQWWPQVVSQGPVEVENKEEYQRLANHLIQAIMDRKMGGQDISRNDVVQLRHICKTYGV 709
           LP WWP  + Q    V +K+ YQ  A+ L++AI D +M G+DI  +D++ LR IC+   V
Sbjct: 201 LPSWWPSSMPQQS-SVVDKDYYQSEADRLVRAITDNRMSGKDIVEDDIIHLRQICRASRV 259

Query: 710 RASIST 727
           R +  T
Sbjct: 260 RVTFDT 265


>ref|NP_001078044.1| uncharacterized protein [Arabidopsis thaliana]
           gi|62320356|dbj|BAD94734.1| hypothetical protein
           [Arabidopsis thaliana] gi|330255140|gb|AEC10234.1|
           uncharacterized protein AT2G43235 [Arabidopsis thaliana]
          Length = 437

 Score =  103 bits (256), Expect = 8e-20
 Identities = 66/215 (30%), Positives = 110/215 (51%), Gaps = 6/215 (2%)
 Frame = +2

Query: 92  DDTLQITLKSPSTS-LQNLETKLNQFLDIGREALDDLRTVVAVDGSTGRVVISCRRSTVE 268
           D +L +TL     S L N   +L  FLD  ++A  DL+T++++D +  RVV+SC++ST++
Sbjct: 73  DQSLSLTLDVHRISTLANYRFQL--FLDSSKDAFSDLQTLISLDDNR-RVVVSCKKSTMQ 129

Query: 269 XXXXXXXXXXXXXIAFRSLFAMRKNGGGEV-----LVYKRDRSLGGKVVAVGKREXXXXX 433
                         A R L  +     G        V +RDRSLGGK V V         
Sbjct: 130 FVGGVVILGFVFGFAIRVLVKLGSALKGNFQSNPKFVVRRDRSLGGKEVVVSVDNIRSSS 189

Query: 434 XXXXXXXXXDNSEYYYQKKKDRMIKSSRRKEELPQWWPQVVSQGPVEVENKEEYQRLANH 613
                    D +       ++  +K+   +  LP+WWP  ++    +V +KE+YQR AN 
Sbjct: 190 RDSKSFIASDQASRSNSTPRNLHLKA---QNNLPKWWPTSLTSQSFDVVDKEDYQREANR 246

Query: 614 LIQAIMDRKMGGQDISRNDVVQLRHICKTYGVRAS 718
           +++AI+D +  G+DI+ +D++QLR +C+  GV+ +
Sbjct: 247 IVRAIVDNRTSGKDITDDDIIQLRRVCRISGVQVT 281


>ref|XP_006606588.1| PREDICTED: uncharacterized protein LOC100814523 isoform X2 [Glycine
           max]
          Length = 416

 Score =  102 bits (255), Expect = 1e-19
 Identities = 79/252 (31%), Positives = 127/252 (50%), Gaps = 12/252 (4%)
 Frame = +2

Query: 2   RRHHRRRLLKYSPNHSSTPSQHQPAIFKLSDDTLQITLKSPSTSLQNLETKLNQFLDIGR 181
           R   RRR  K S   +++ S  +P +  + D T       P T+ Q+   +L +FL  G+
Sbjct: 33  RSRSRRRHNKVSLPTTTSTSSFEPKLEAVIDLT-------PLTAFQS---ELRRFLSSGK 82

Query: 182 EALDDLRTVVAVDGSTGRVVISCRRSTVEXXXXXXXXXXXXXIAFRSLFAM-------RK 340
           +A  DL+T+V +D +  R+V+SCR ST+                F  L  +       R+
Sbjct: 83  DAYRDLQTLVTLDHNR-RLVVSCRPSTLHFLGTSAALTFLAFSVFTVLAQLISRFSSWRR 141

Query: 341 NGGG-EVLVYKRDRSLGGK--VVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKDRMIKS 511
           N    + +V +RDRSLGGK  VVA G+R                  +   +  K++++  
Sbjct: 142 NASSHKPVVVRRDRSLGGKEVVVAWGQRSDTNPLSAPVR-------DSVKRSAKNKVVPF 194

Query: 512 SRRKEELPQWWPQVV--SQGPVEVENKEEYQRLANHLIQAIMDRKMGGQDISRNDVVQLR 685
            R+   LP+WWP V+  S    +   +EEY+R A  +++AI + ++GG DI  ND++QLR
Sbjct: 195 QRK---LPKWWPTVINSSASVFDANEQEEYKREAYRVVRAITNSRLGGNDIMENDIIQLR 251

Query: 686 HICKTYGVRASI 721
            +C+T GV+ SI
Sbjct: 252 RLCRTSGVQVSI 263


>ref|XP_003556568.1| PREDICTED: uncharacterized protein LOC100814523 isoform X1 [Glycine
           max]
          Length = 431

 Score =  102 bits (255), Expect = 1e-19
 Identities = 79/252 (31%), Positives = 127/252 (50%), Gaps = 12/252 (4%)
 Frame = +2

Query: 2   RRHHRRRLLKYSPNHSSTPSQHQPAIFKLSDDTLQITLKSPSTSLQNLETKLNQFLDIGR 181
           R   RRR  K S   +++ S  +P +  + D T       P T+ Q+   +L +FL  G+
Sbjct: 33  RSRSRRRHNKVSLPTTTSTSSFEPKLEAVIDLT-------PLTAFQS---ELRRFLSSGK 82

Query: 182 EALDDLRTVVAVDGSTGRVVISCRRSTVEXXXXXXXXXXXXXIAFRSLFAM-------RK 340
           +A  DL+T+V +D +  R+V+SCR ST+                F  L  +       R+
Sbjct: 83  DAYRDLQTLVTLDHNR-RLVVSCRPSTLHFLGTSAALTFLAFSVFTVLAQLISRFSSWRR 141

Query: 341 NGGG-EVLVYKRDRSLGGK--VVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKDRMIKS 511
           N    + +V +RDRSLGGK  VVA G+R                  +   +  K++++  
Sbjct: 142 NASSHKPVVVRRDRSLGGKEVVVAWGQRSDTNPLSAPVR-------DSVKRSAKNKVVPF 194

Query: 512 SRRKEELPQWWPQVV--SQGPVEVENKEEYQRLANHLIQAIMDRKMGGQDISRNDVVQLR 685
            R+   LP+WWP V+  S    +   +EEY+R A  +++AI + ++GG DI  ND++QLR
Sbjct: 195 QRK---LPKWWPTVINSSASVFDANEQEEYKREAYRVVRAITNSRLGGNDIMENDIIQLR 251

Query: 686 HICKTYGVRASI 721
            +C+T GV+ SI
Sbjct: 252 RLCRTSGVQVSI 263


>ref|XP_007142633.1| hypothetical protein PHAVU_007G004000g [Phaseolus vulgaris]
           gi|561015823|gb|ESW14627.1| hypothetical protein
           PHAVU_007G004000g [Phaseolus vulgaris]
          Length = 436

 Score =  102 bits (254), Expect = 1e-19
 Identities = 81/253 (32%), Positives = 122/253 (48%), Gaps = 13/253 (5%)
 Frame = +2

Query: 2   RRHHRRRLLKYS-PNHSSTPSQHQPAIFKLSDDTLQITLKSPSTSLQNLETKLNQFLDIG 178
           R   RRR  K S P  +ST S  +P    + D           T L  L++ L +F+  G
Sbjct: 35  RSRSRRRHNKVSLPATTSTSSSFEPKFEAVID----------LTPLTTLQSHLRRFILSG 84

Query: 179 REALDDLRTVVAVDGSTGRVVISCRRSTVEXXXXXXXXXXXXXIAFRSLFAM-------- 334
           R++  DL T++ +D +  R+V+SCR ST+                F  L  +        
Sbjct: 85  RDSYLDLETLLTLDHNR-RLVVSCRPSTLHFLGTSAALTLLTFSVFSVLARLISRFSSWR 143

Query: 335 RKNGGGEVLVYKRDRSLGGK--VVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKDRMIK 508
           R       LV +RDRSLGGK  VVA G+R                      ++    M++
Sbjct: 144 RNASNNRPLVVRRDRSLGGKEVVVAWGQRSNSNPLSPAVRGSV--------KRSAKNMVR 195

Query: 509 SSRRKEELPQWWPQVVS-QGPV-EVENKEEYQRLANHLIQAIMDRKMGGQDISRNDVVQL 682
             R+   LP+WWP VV+  G V +   +EEY+R A  +++AI + ++GG DI+ ND++QL
Sbjct: 196 FERK---LPEWWPTVVNANGSVFDANEQEEYKREAYRVVRAITNSRLGGNDINENDIIQL 252

Query: 683 RHICKTYGVRASI 721
           R +C+T GV+ SI
Sbjct: 253 RQLCRTSGVQVSI 265


Top