BLASTX nr result

ID: Mentha23_contig00008049 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00008049
         (754 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU36739.1| hypothetical protein MIMGU_mgv1a007033mg [Mimulus...   261   1e-67
gb|EPS65535.1| hypothetical protein M569_09246, partial [Genlise...   189   9e-46
ref|XP_004296731.1| PREDICTED: uncharacterized protein LOC101297...   125   2e-26
ref|XP_003632065.1| PREDICTED: uncharacterized protein LOC100854...   125   2e-26
ref|XP_006364664.1| PREDICTED: uncharacterized protein LOC102596...   122   2e-25
ref|XP_004247979.1| PREDICTED: uncharacterized protein LOC101254...   114   3e-23
ref|XP_006858644.1| hypothetical protein AMTR_s00066p00051680 [A...   107   3e-21
gb|EXB68682.1| putative Golgi transport protein 1 [Morus notabilis]   104   3e-20
ref|XP_007224230.1| hypothetical protein PRUPE_ppa018099mg, part...   104   3e-20
ref|XP_002309823.1| hypothetical protein POPTR_0007s02350g [Popu...   104   3e-20
ref|XP_006397488.1| hypothetical protein EUTSA_v10001453mg [Eutr...   103   6e-20
ref|XP_002529769.1| conserved hypothetical protein [Ricinus comm...   103   8e-20
ref|XP_007026508.1| Uncharacterized protein isoform 2, partial [...   102   1e-19
ref|XP_007026507.1| Uncharacterized protein isoform 1 [Theobroma...   102   1e-19
ref|XP_006294241.1| hypothetical protein CARUB_v10023240mg [Caps...   102   2e-19
ref|XP_006443108.1| hypothetical protein CICLE_v10020134mg [Citr...    99   1e-18
ref|XP_004161605.1| PREDICTED: uncharacterized LOC101216122 [Cuc...    99   1e-18
ref|XP_004152289.1| PREDICTED: uncharacterized protein LOC101216...    99   1e-18
ref|NP_001078044.1| uncharacterized protein [Arabidopsis thalian...    98   4e-18
gb|AHA84272.1| sugar porter [Phaseolus vulgaris]                       97   5e-18

>gb|EYU36739.1| hypothetical protein MIMGU_mgv1a007033mg [Mimulus guttatus]
          Length = 422

 Score =  261 bits (668), Expect = 1e-67
 Identities = 147/254 (57%), Positives = 171/254 (67%), Gaps = 4/254 (1%)
 Frame = -3

Query: 752 RHAVVTSXXXXXXXXXXL-KYSPNHSPTPSQHQPAIFKLSDDTLQITLKSPSTSLQ--NL 582
           RHA+VTS          L KYSP  + TP    P IFKLSDD LQITL+ PSTSLQ   L
Sbjct: 17  RHAIVTSAIPRRHHRRRLLKYSPTPANTPI-FAPTIFKLSDDGLQITLRRPSTSLQVQQL 75

Query: 581 ETKLNQFLDTGREALDDLRTIVAVDGSTGGVVISCRRSTVEXXXXXXXXXXXXVIAFRSL 402
           ETKLNQ +  GREA DDLRT+VAVD + GG VISCRRS+VE            VIAFR L
Sbjct: 76  ETKLNQLIGRGREAFDDLRTVVAVDETNGGFVISCRRSSVEFLAALFFSSLVVVIAFRGL 135

Query: 401 FMMRKNYGGEVMVYKRDRSLGGKVVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKDRMK 222
           F       GEV+VYKRDRSLGGK V VGK+E              ++++YYY+KK +R K
Sbjct: 136 FKQISKNSGEVLVYKRDRSLGGKEVVVGKKETNLPTRRKPTPLSSNDADYYYEKKINRTK 195

Query: 221 TSSR-RKEELPQWWPQVVSQGPVEVENKEEYQRLANHLIQAIMDRKMGGEDISRKDVVEL 45
              + RKEELPQWWPQ V+ G  E+ENKEEYQR+AN LI AI+DRKM GEDIS  D+V+L
Sbjct: 196 ILGKSRKEELPQWWPQAVNLGSPEIENKEEYQRMANQLIGAIVDRKMAGEDISANDIVQL 255

Query: 44  RHICKTYGVRASIS 3
           RH+CKTYGV+ SIS
Sbjct: 256 RHLCKTYGVKTSIS 269


>gb|EPS65535.1| hypothetical protein M569_09246, partial [Genlisea aurea]
          Length = 400

 Score =  189 bits (480), Expect = 9e-46
 Identities = 111/237 (46%), Positives = 144/237 (60%), Gaps = 5/237 (2%)
 Frame = -3

Query: 698 KYSPNHSPTPS---QHQP--AIFKLSDDTLQITLKSPSTSLQNLETKLNQFLDTGREALD 534
           KYSPN +P  S   +  P   I KLSD+ LQITL SPS SL+ +E+KLNQ ++ GREA  
Sbjct: 14  KYSPNRNPETSPLIRSTPPITILKLSDNGLQITLSSPSNSLEKVESKLNQIIECGREAFF 73

Query: 533 DLRTIVAVDGSTGGVVISCRRSTVEXXXXXXXXXXXXVIAFRSLFMMRKNYGGEVMVYKR 354
           DLRT+V  D   G V ISCRRSTVE            V+  R++F +RKN G + +VY+R
Sbjct: 74  DLRTLVTFDEDYGRVSISCRRSTVEFFIGLFISGFLVVLIIRNVFKLRKN-GRQALVYRR 132

Query: 353 DRSLGGKVVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKDRMKTSSRRKEELPQWWPQV 174
           DRSLGG+ V VG                  +   Y+QKK+  ++  S RKE+LPQWWPQ 
Sbjct: 133 DRSLGGREVLVGTGHSNWSSKLTSNPLDSVSISDYHQKKRGIIQGMS-RKEKLPQWWPQ- 190

Query: 173 VSQGPVEVENKEEYQRLANHLIQAIMDRKMGGEDISRKDVVELRHICKTYGVRASIS 3
                 E  N E YQR+AN L+Q I+DR++ GEDIS  D+V+LR++CK + V  SIS
Sbjct: 191 FHDSSGEAPNTEGYQRIANQLVQGIVDRRVSGEDISMDDIVQLRYLCKAHRVNVSIS 247


>ref|XP_004296731.1| PREDICTED: uncharacterized protein LOC101297340 [Fragaria vesca
           subsp. vesca]
          Length = 430

 Score =  125 bits (313), Expect = 2e-26
 Identities = 87/238 (36%), Positives = 122/238 (51%), Gaps = 10/238 (4%)
 Frame = -3

Query: 692 SPNHSPTPSQHQPAIFKLSD-DTLQITLKSPSTSLQNLETKLNQFLDTGREALDDLRTIV 516
           +PN   T    +PA +  SD + LQ T    +T   +  + L  FL +  +A++DL+T+V
Sbjct: 43  NPNTPTTVPTSKPAFYTSSDPENLQATFDL-NTLYYSSHSYLRYFLSSASDAVEDLQTLV 101

Query: 515 AVDGSTGGVVISCRRSTVEXXXXXXXXXXXXVIAFRSL-------FMMRKNYGGEVMVYK 357
           +VD     +V+SCR ST+             V+ FR L       F     YG E +V +
Sbjct: 102 SVDADRR-IVVSCRPSTLRFVGNFAVATCAVVLGFRVLVGLVRLGFGSGSGYGREKVVTR 160

Query: 356 RDRSLGGKVVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKDRM--KTSSRRKEELPQWW 183
           RDRSLGGK V V + E                 E    KK++ +  K   R  E+LPQWW
Sbjct: 161 RDRSLGGKEVVVARVERPRA------------EEVSVTKKRESVFKKNRVRFGEKLPQWW 208

Query: 182 PQVVSQGPVEVENKEEYQRLANHLIQAIMDRKMGGEDISRKDVVELRHICKTYGVRAS 9
           P   SQ  + V+N EE+QR AN L++AI D +M G+DI   D++ LR IC+ YGVR S
Sbjct: 209 PTTTSQPILGVDN-EEHQREANRLVRAITDNRMSGKDIMEDDIIHLRQICRVYGVRVS 265


>ref|XP_003632065.1| PREDICTED: uncharacterized protein LOC100854590 [Vitis vinifera]
          Length = 436

 Score =  125 bits (313), Expect = 2e-26
 Identities = 89/244 (36%), Positives = 118/244 (48%), Gaps = 14/244 (5%)
 Frame = -3

Query: 695 YSPNH----SPTPSQHQPAIFKLSDDTLQITLKSPSTSLQNLETKLNQFLDTGREALDDL 528
           Y P+H    SP P  H         D  QI L              N+ + +G +A+DDL
Sbjct: 41  YHPHHNNKPSPDPKLHMVVDLHRLSDRAQILL--------------NRLVSSGADAIDDL 86

Query: 527 RTIVAVDGSTGGVVISCRRSTVEXXXXXXXXXXXXVIAFRSL----FMMRKNYG---GEV 369
           RT+VAVD +T  VVI+CR ST+             V  FR L      +R+ +G   G  
Sbjct: 87  RTLVAVDRATQSVVIACRPSTLRFVGGFVVWSLVVVFGFRVLVRLGLRLRREFGFGSGRG 146

Query: 368 MVYKRDRSLGGKVVAVGKREXXXXXXXXXXXXXXDNSEYYYQ---KKKDRMKTSSRRKEE 198
           +V +RDRSLGGK V VG+ E                            D     SR ++ 
Sbjct: 147 VVVRRDRSLGGKEVVVGRAEESEWRMRNHSRVLGSPLSVVPGIGVNGGDWSPGRSRTEKR 206

Query: 197 LPQWWPQVVSQGPVEVENKEEYQRLANHLIQAIMDRKMGGEDISRKDVVELRHICKTYGV 18
           LP+WWP V    P+EV +K+EYQR AN LI+ IM  +M G+DI   D+++LR IC+T G 
Sbjct: 207 LPKWWP-VTLPPPLEVFDKQEYQREANRLIREIMANRMSGKDILEDDMIQLRRICRTSGA 265

Query: 17  RASI 6
           RASI
Sbjct: 266 RASI 269


>ref|XP_006364664.1| PREDICTED: uncharacterized protein LOC102596187 [Solanum tuberosum]
          Length = 455

 Score =  122 bits (305), Expect = 2e-25
 Identities = 85/235 (36%), Positives = 118/235 (50%), Gaps = 13/235 (5%)
 Frame = -3

Query: 674 TPSQHQPAIFKLSDDTLQITLKSPSTSLQNLETKLNQFLDTGREALDDLRTIVAVDGSTG 495
           TP   Q   F L+ D L    KS  +    L  KL +FL +GR A++DLRT++ VD   G
Sbjct: 55  TPPSDQNLHFVLTVDNLPT--KSFYSIKDLLHLKLGEFLHSGRAAIEDLRTLIRVDTDAG 112

Query: 494 GVVISCRRSTVEXXXXXXXXXXXXVIAFRSLFMM----RKNYGGE--VMVYKRDRSLGGK 333
            +  SC RSTV+            +   R++  +    R N G     +VYKRDRSLGG+
Sbjct: 113 RLSFSCTRSTVKFLATLVVSSFLLIFTLRAIVNLVRGIRLNSGNNNVELVYKRDRSLGGR 172

Query: 332 VVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKDRMKTSSRRK------EELPQWWPQVV 171
            V V K E              D     +   +D   + SRR+      E+LP+WWP   
Sbjct: 173 EVLVAKNETPTLDRKKPNVLDSDEGNSNWDWDRDSPISFSRRRKKKSSVEQLPKWWPVST 232

Query: 170 S-QGPVEVENKEEYQRLANHLIQAIMDRKMGGEDISRKDVVELRHICKTYGVRAS 9
           S    V  EN+EEYQR+AN LI+AI+D +M G+DI   D+++LR I +   V+ S
Sbjct: 233 SGSDQVGAENQEEYQRMANRLIRAILDNRMTGKDILADDIIQLRRIGRISNVKVS 287


>ref|XP_004247979.1| PREDICTED: uncharacterized protein LOC101254735 [Solanum
           lycopersicum]
          Length = 458

 Score =  114 bits (286), Expect = 3e-23
 Identities = 82/241 (34%), Positives = 119/241 (49%), Gaps = 11/241 (4%)
 Frame = -3

Query: 698 KYSPNHSPTPSQHQPAIFKLSDDTLQITLKSPSTSLQNLETKLNQFLDTGREALDDLRTI 519
           K+SP    TP   Q   F L+ D L    KS  +    +  KL +FL +GR A++DL+T+
Sbjct: 54  KFSPED--TPPSDQNLHFVLTVDNLPT--KSFYSIKDLIHLKLREFLHSGRAAIEDLQTL 109

Query: 518 VAVDGSTGGVVISCRRSTVEXXXXXXXXXXXXVIAFRSLFMMRK----NYGGE--VMVYK 357
           + +D   G V  SC RSTV+            +   R++  + +    N G     +VYK
Sbjct: 110 IRIDTDAGRVSFSCTRSTVKFLATLLVSTFLLIFTLRAILNLVRRIPLNTGNNNVELVYK 169

Query: 356 RDRSLGGKVVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKDRMKTSSRRK----EELPQ 189
           RDRSLGG+ V V K E              D     +        +  R+K    E+LP+
Sbjct: 170 RDRSLGGREVLVAKNETPTLDRKKPNVLDRDEGNSNWDLDTPISFSRRRKKKSSVEQLPK 229

Query: 188 WWPQVVS-QGPVEVENKEEYQRLANHLIQAIMDRKMGGEDISRKDVVELRHICKTYGVRA 12
           WWP   S    V  EN+EEYQR+A+ LI+AI+D +M G+DI   D+++LR I +   V+ 
Sbjct: 230 WWPVSTSGSDQVGTENQEEYQRMADRLIRAILDNRMTGKDILADDIIQLRRIGRISNVKV 289

Query: 11  S 9
           S
Sbjct: 290 S 290


>ref|XP_006858644.1| hypothetical protein AMTR_s00066p00051680 [Amborella trichopoda]
           gi|548862755|gb|ERN20111.1| hypothetical protein
           AMTR_s00066p00051680 [Amborella trichopoda]
          Length = 447

 Score =  107 bits (268), Expect = 3e-21
 Identities = 75/224 (33%), Positives = 109/224 (48%), Gaps = 13/224 (5%)
 Frame = -3

Query: 638 SDDTLQITLKSPSTSLQNLETKLNQFLDTGREALDDLRTIVAVDGSTGGVVISCRRSTVE 459
           SD  L++ +       Q  E+ LN  L  G+EAL DL+ +V +DG+   + +SCRRS++E
Sbjct: 66  SDQKLEMVVDLKRMRTQVSES-LNLLLINGKEALKDLQGLVTIDGN-DRITVSCRRSSLE 123

Query: 458 XXXXXXXXXXXXVIAFRSLFMMRKNYG---GEVMVYKRDRSLGGKVVAVGKREXXXXXXX 288
                       V   R L  +   YG      +V +RDRSLGG+ V VG R        
Sbjct: 124 FIAYTFVLALCIVFVIRVLLKLGSRYGLYSNWGLVRRRDRSLGGREVVVGLRTKGKDSSA 183

Query: 287 XXXXXXXDN------SEYYYQKKKDRM----KTSSRRKEELPQWWPQVVSQGPVEVENKE 138
                   N             K++ M    K     +E+LP+WWP   S   +    K+
Sbjct: 184 KIRVSNSINPLSNVGGALGIISKRNSMNHFNKAEEEDEEKLPKWWPDAGS-SVIMALPKD 242

Query: 137 EYQRLANHLIQAIMDRKMGGEDISRKDVVELRHICKTYGVRASI 6
           EYQR AN +I+AIMD++M G D++  D+++LR ICK  G + SI
Sbjct: 243 EYQREANRMIRAIMDKRMSGRDVTEDDIIQLRRICKISGAKVSI 286


>gb|EXB68682.1| putative Golgi transport protein 1 [Morus notabilis]
          Length = 586

 Score =  104 bits (260), Expect = 3e-20
 Identities = 77/243 (31%), Positives = 123/243 (50%), Gaps = 13/243 (5%)
 Frame = -3

Query: 698 KYSPNHSPTPSQHQPAIFKLSDDTLQITLKSPSTSLQNLETKLNQFLDTGREALDDLRTI 519
           K S + S  PS        +  D  +++L S ++ L+ L       + +  +AL DLRT+
Sbjct: 37  KPSSSSSSNPSSSNSNYVAVVIDLERLSLSSSNSHLRRL-------IASADDALTDLRTL 89

Query: 518 VAVDGSTGGVVISCRRSTVEXXXXXXXXXXXXVIAFRSLF--MMRKNY---GGEVMVYKR 354
           VA+D + G +++SCRRST+             V+ FR+LF  + ++ +   GG  +V +R
Sbjct: 90  VALDDA-GRLLVSCRRSTLRFVANSLLFSCVVVLGFRALFWLLFKRTHSFGGGGHVVVRR 148

Query: 353 DRSLGGKVVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKDRMKTS-SRRKEELPQWWPQ 177
           DRSLGGK V V +                 ++           +T  S R++ LP+WWP 
Sbjct: 149 DRSLGGKEVVVARTPPGPSSSTRRALSSPLSAAKEGVGLVGGTETRVSSREKRLPKWWPS 208

Query: 176 VV-------SQGPVEVENKEEYQRLANHLIQAIMDRKMGGEDISRKDVVELRHICKTYGV 18
           +        S     + +K++YQR A+ LI+AI D +M G+DI   D+++LR IC+T GV
Sbjct: 209 LELDKQNWDSDSSDGIFDKQDYQRDADRLIRAITDNRMSGKDIVADDIIQLRRICRTSGV 268

Query: 17  RAS 9
           R S
Sbjct: 269 RVS 271


>ref|XP_007224230.1| hypothetical protein PRUPE_ppa018099mg, partial [Prunus persica]
           gi|462421166|gb|EMJ25429.1| hypothetical protein
           PRUPE_ppa018099mg, partial [Prunus persica]
          Length = 414

 Score =  104 bits (260), Expect = 3e-20
 Identities = 84/246 (34%), Positives = 114/246 (46%), Gaps = 19/246 (7%)
 Frame = -3

Query: 689 PNHSPTPSQHQPAIFKLSDDTLQITLKSPSTSLQNL----ETKLNQFLDTGREALDDLRT 522
           P   P+ S  +P       DTLQ T       LQ L       L QFL +  +AL DLRT
Sbjct: 31  PYFYPSSSPSRP-------DTLQATF-----DLQYLYHTSHYSLQQFLSSASDALQDLRT 78

Query: 521 IVAVDGSTGGVVISCRRSTVEXXXXXXXXXXXXVIAFRSL-------FMMRKNYGGEVMV 363
           +V+VD     V++SCR ST+             V+ FR L       F  R  YG E  V
Sbjct: 79  LVSVDADNR-VIVSCRPSTLRFVGNLVIMTFAVVLGFRVLVGLVRLGFGGRSGYGREGTV 137

Query: 362 YKRDRSLGGKVVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKK-----DRMKTSSRR--- 207
            +RDRSLGGK V VG+ E               ++     K+       R+  S  R   
Sbjct: 138 VRRDRSLGGKEVVVGRVEKDRVDVRKKKSFGMLDNPLSMPKRTVVDGLGRLLNSRVRVWE 197

Query: 206 KEELPQWWPQVVSQGPVEVENKEEYQRLANHLIQAIMDRKMGGEDISRKDVVELRHICKT 27
           K++LP WWP  + Q    V +K+ YQ  A+ L++AI D +M G+DI   D++ LR IC+ 
Sbjct: 198 KKKLPSWWPSSMPQQS-SVVDKDYYQSEADRLVRAITDNRMSGKDIVEDDIIHLRQICRA 256

Query: 26  YGVRAS 9
             VR +
Sbjct: 257 SRVRVT 262


>ref|XP_002309823.1| hypothetical protein POPTR_0007s02350g [Populus trichocarpa]
           gi|222852726|gb|EEE90273.1| hypothetical protein
           POPTR_0007s02350g [Populus trichocarpa]
          Length = 447

 Score =  104 bits (260), Expect = 3e-20
 Identities = 73/227 (32%), Positives = 119/227 (52%), Gaps = 14/227 (6%)
 Frame = -3

Query: 641 LSDDTLQITLKSPSTSLQNL-ETKLNQFLDTGREALDDLRTIVAVDGSTGGVVISCRRST 465
           L++++  + L    T +  L  ++ +QFL  G+EA+DDL+T+V++D     VV+SC++ST
Sbjct: 62  LNNNSQNLKLVLNITQISKLPSSRFHQFLSLGQEAVDDLKTLVSLD-ENNRVVLSCQKST 120

Query: 464 VEXXXXXXXXXXXXVIAFRSLFMM----RKNYGGEV---MVYKRDRSLGGKVVAVG---- 318
           ++            + + R LF +    ++ +G       V +RDRSLGGK V V     
Sbjct: 121 LQFAGTVLLSGFLLISSIRVLFKLGLGFKRKFGAGKNPNFVVRRDRSLGGKEVIVAVDDQ 180

Query: 317 -KREXXXXXXXXXXXXXXDNSEYYYQKKKDRMKTSSRRKEELPQWWPQVVS-QGPVEVEN 144
            + E                 +    ++ D  +     +++LP+WWP   S  G V   +
Sbjct: 181 QREESKRPKRLANPVEISGLVDGLGFERGDWTRYRVGSQQKLPKWWPDSGSFSGRVVGPD 240

Query: 143 KEEYQRLANHLIQAIMDRKMGGEDISRKDVVELRHICKTYGVRASIS 3
           +EEYQR AN LI+AI D +  G+D+   D+++LR IC+T GVRAS S
Sbjct: 241 QEEYQREANRLIRAITDYRTRGKDVMEHDIIQLRRICRTSGVRASFS 287


>ref|XP_006397488.1| hypothetical protein EUTSA_v10001453mg [Eutrema salsugineum]
           gi|557098561|gb|ESQ38941.1| hypothetical protein
           EUTSA_v10001453mg [Eutrema salsugineum]
          Length = 437

 Score =  103 bits (257), Expect = 6e-20
 Identities = 65/215 (30%), Positives = 109/215 (50%), Gaps = 5/215 (2%)
 Frame = -3

Query: 635 DDTLQITLKSPSTSLQNLETKLNQFLDTGREALDDLRTIVAVDGSTGGVVISCRRSTVEX 456
           D +L +TL     S     ++   FLD+G++A  DL+T++A+D +   +V+SCR+ST++ 
Sbjct: 78  DQSLSLTLDVHRISAL-ATSRFQLFLDSGKDAFSDLQTLIALDDNRR-IVVSCRKSTMQF 135

Query: 455 XXXXXXXXXXXVIAFRSLFMMRKNYGGEV-----MVYKRDRSLGGKVVAVGKREXXXXXX 291
                       +A R L  +   + G       +V +RDRSLGGK V V          
Sbjct: 136 VGGVVLLGFVFGVAIRVLVKLGSAFKGNFQGKPKLVVRRDRSLGGKEVVVAVDNSRSSSS 195

Query: 290 XXXXXXXXDNSEYYYQKKKDRMKTSSRRKEELPQWWPQVVSQGPVEVENKEEYQRLANHL 111
                    ++      K        R +  LP+WWP  +    +EV+ +E+YQR AN +
Sbjct: 196 SIAPGQVSRSNSVPTNLKL-------RAQNNLPKWWPTSLPSQSLEVD-REDYQREANKI 247

Query: 110 IQAIMDRKMGGEDISRKDVVELRHICKTYGVRASI 6
           ++AI+D +  G+DI+  D+++LR +C+  GV+ SI
Sbjct: 248 VRAIVDNRTSGKDITDNDIIQLRRVCRISGVQVSI 282


>ref|XP_002529769.1| conserved hypothetical protein [Ricinus communis]
            gi|223530767|gb|EEF32635.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 898

 Score =  103 bits (256), Expect = 8e-20
 Identities = 77/234 (32%), Positives = 116/234 (49%), Gaps = 24/234 (10%)
 Frame = -3

Query: 638  SDDTLQITLKSPSTSLQNLETKLNQFLDTGREALDDLRTIVAVDGSTGGVVISCRRSTVE 459
            ++D LQ+ L     +     T  N+FL  G++A  DL+T++++D     +V +CR+STV+
Sbjct: 485  NNDNLQLVLDVNQITYLTSST-FNRFLSLGKDAYYDLKTLISLD-ENNRIVFTCRKSTVQ 542

Query: 458  XXXXXXXXXXXXVIAFRSL-----------FMMRKNYGGEVMVYKRDRSLGGKVVAVGKR 312
                        V AFR L           F +R N   + +V +RDRSLGGK V V +R
Sbjct: 543  FTGGVLLCGVVLVSAFRVLIKLGLGFRSWLFRVRNNRKNKDVVVRRDRSLGGKEVVVARR 602

Query: 311  ------EXXXXXXXXXXXXXXDNSEYYYQKKKDRMKTSS---RRKEELPQWWPQVVSQGP 159
                  +              DN  + +    +R    S   R    LP+WW   VS GP
Sbjct: 603  VEEERPKDVKRKRFGVLDNPLDNPSWVFGSGLERDDWRSYRVRSASRLPKWWS--VSVGP 660

Query: 158  VE----VENKEEYQRLANHLIQAIMDRKMGGEDISRKDVVELRHICKTYGVRAS 9
             +    V +K+EYQR AN LI+AI D +  G+D++  D+++LR IC+T GV+ S
Sbjct: 661  EQEDMVVVDKQEYQRDANRLIRAITDYRTSGKDVTEFDIIQLRRICRTSGVQVS 714


>ref|XP_007026508.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
           gi|508715113|gb|EOY07010.1| Uncharacterized protein
           isoform 2, partial [Theobroma cacao]
          Length = 325

 Score =  102 bits (254), Expect = 1e-19
 Identities = 70/224 (31%), Positives = 112/224 (50%), Gaps = 10/224 (4%)
 Frame = -3

Query: 647 FKLSDDTLQITLKSPSTSLQNLET-KLNQFLDTGREALDDLRTIVAVDGSTGGVVISCRR 471
           F+ S D   + L      + +L + KLN+ +    +A  DLR +V +D  T  + +SCR+
Sbjct: 69  FQNSPDNPNVKLVLDFDQISSLSSSKLNRLISFSTDAFQDLRNLVQIDPDTRTLQLSCRK 128

Query: 470 STVEXXXXXXXXXXXXVIAFRSLFMMRKNYGGEV-----MVYKRDRSLGGKVVAVGKREX 306
           ST++            V AF  L  +             ++ +RDRSLGG+ V VG +  
Sbjct: 129 STLQFLAAFLTCGFVIVFAFTVLVKLGLGLKARFRPKHKVIVRRDRSLGGREVIVGTKRD 188

Query: 305 XXXXXXXXXXXXXDN--SEYYYQKKKDRMKTSSRRKEELPQWWPQVVSQGPVE--VENKE 138
                         +  +      K +  +   +  ++LP+WWP++ S  P E  V N E
Sbjct: 189 GGDPPSFRALDNPLSLSTARPLSTKTNYPRLQVQLGDKLPKWWPEMDSV-PKEGSVFNSE 247

Query: 137 EYQRLANHLIQAIMDRKMGGEDISRKDVVELRHICKTYGVRASI 6
            YQ  AN LI+AI+D ++GG+DI+ +D+++LR IC+T GVR SI
Sbjct: 248 YYQTQANRLIRAIIDSRLGGKDITEEDIIQLRQICRTSGVRVSI 291


>ref|XP_007026507.1| Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|508715112|gb|EOY07009.1| Uncharacterized protein
           isoform 1 [Theobroma cacao]
          Length = 444

 Score =  102 bits (254), Expect = 1e-19
 Identities = 70/224 (31%), Positives = 112/224 (50%), Gaps = 10/224 (4%)
 Frame = -3

Query: 647 FKLSDDTLQITLKSPSTSLQNLET-KLNQFLDTGREALDDLRTIVAVDGSTGGVVISCRR 471
           F+ S D   + L      + +L + KLN+ +    +A  DLR +V +D  T  + +SCR+
Sbjct: 69  FQNSPDNPNVKLVLDFDQISSLSSSKLNRLISFSTDAFQDLRNLVQIDPDTRTLQLSCRK 128

Query: 470 STVEXXXXXXXXXXXXVIAFRSLFMMRKNYGGEV-----MVYKRDRSLGGKVVAVGKREX 306
           ST++            V AF  L  +             ++ +RDRSLGG+ V VG +  
Sbjct: 129 STLQFLAAFLTCGFVIVFAFTVLVKLGLGLKARFRPKHKVIVRRDRSLGGREVIVGTKRD 188

Query: 305 XXXXXXXXXXXXXDN--SEYYYQKKKDRMKTSSRRKEELPQWWPQVVSQGPVE--VENKE 138
                         +  +      K +  +   +  ++LP+WWP++ S  P E  V N E
Sbjct: 189 GGDPPSFRALDNPLSLSTARPLSTKTNYPRLQVQLGDKLPKWWPEMDSV-PKEGSVFNSE 247

Query: 137 EYQRLANHLIQAIMDRKMGGEDISRKDVVELRHICKTYGVRASI 6
            YQ  AN LI+AI+D ++GG+DI+ +D+++LR IC+T GVR SI
Sbjct: 248 YYQTQANRLIRAIIDSRLGGKDITEEDIIQLRQICRTSGVRVSI 291


>ref|XP_006294241.1| hypothetical protein CARUB_v10023240mg [Capsella rubella]
           gi|482562949|gb|EOA27139.1| hypothetical protein
           CARUB_v10023240mg [Capsella rubella]
          Length = 437

 Score =  102 bits (253), Expect = 2e-19
 Identities = 64/214 (29%), Positives = 105/214 (49%), Gaps = 5/214 (2%)
 Frame = -3

Query: 635 DDTLQITLKSPSTSLQNLETKLNQFLDTGREALDDLRTIVAVDGSTGGVVISCRRSTVEX 456
           D +L +TL     S +   ++    LD+G++A  DL+T++A+D +   VV+SC++ST++ 
Sbjct: 73  DQSLSLTLDVHGIS-KLANSRFQLLLDSGKDAFSDLQTLIALDDNRR-VVVSCKKSTMQF 130

Query: 455 XXXXXXXXXXXVIAFRSLFMMRKNYGGEVM-----VYKRDRSLGGKVVAVGKREXXXXXX 291
                        A R L  +     G        V +RDRSLGGK V V          
Sbjct: 131 VGGVVVLGLVLGFAIRVLVKLGSALKGNFQSNPKFVVRRDRSLGGKEVVVSVDSIRSSSR 190

Query: 290 XXXXXXXXDNSEYYYQKKKDRMKTSSRRKEELPQWWPQVVSQGPVEVENKEEYQRLANHL 111
                     S+   Q          + +  LP+WWP  +    ++V +KEEYQR AN +
Sbjct: 191 DSKSFMA---SDQASQSNSIPRNLQLKSQNNLPKWWPTSLPSQNLDVVDKEEYQREANRI 247

Query: 110 IQAIMDRKMGGEDISRKDVVELRHICKTYGVRAS 9
           ++AI+D +  G+DI+  D+++LR +C+  GV+ S
Sbjct: 248 VRAIVDNRTSGKDITDDDIIQLRRVCRISGVQVS 281


>ref|XP_006443108.1| hypothetical protein CICLE_v10020134mg [Citrus clementina]
           gi|568850296|ref|XP_006478851.1| PREDICTED:
           uncharacterized protein LOC102619110 [Citrus sinensis]
           gi|557545370|gb|ESR56348.1| hypothetical protein
           CICLE_v10020134mg [Citrus clementina]
          Length = 448

 Score = 99.4 bits (246), Expect = 1e-18
 Identities = 82/241 (34%), Positives = 120/241 (49%), Gaps = 19/241 (7%)
 Frame = -3

Query: 671 PSQHQPAIFKLSDDTLQITLKSPSTSLQNLETKLNQFLDTGREALDDLRTIVAVDGSTGG 492
           PS        L  D  QI++ S S+     ++KL+ FL +  +A  DL+T++ +D + G 
Sbjct: 47  PSSFDGENINLVLDFHQISILSSSS-----KSKLHHFLSSAEQAYADLKTVITLDDN-GR 100

Query: 491 VVISCRRSTVEXXXXXXXXXXXXVIAFRSL------FMMRKNYGGEVMVYKRDRSLGGK- 333
           +++SCR+ST++            V  FR L      F  R  +  +  V +RDRSLGGK 
Sbjct: 101 LLVSCRKSTLQFVGGVLLSGFVLVFVFRVLVKLGLGFSSRFRFQKQNFVVRRDRSLGGKE 160

Query: 332 -VVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKD-------RMKTSSRRKE----ELPQ 189
            VVAVG+ +              DN      + +D       R+K S R +     +LP+
Sbjct: 161 VVVAVGRGDDDARLTRNLKNRVLDNP---LSEGRDAGSALTGRVKRSYRVQRMSEGKLPK 217

Query: 188 WWPQVVSQGPVEVENKEEYQRLANHLIQAIMDRKMGGEDISRKDVVELRHICKTYGVRAS 9
           WW   VS     V +KE YQR AN LI+AI+D++  G+DI   D+  LR IC+  GVR S
Sbjct: 218 WWSVQVSADRTLVVDKE-YQREANRLIRAIIDQRTHGQDIPEDDIYRLRRICRISGVRVS 276

Query: 8   I 6
           I
Sbjct: 277 I 277


>ref|XP_004161605.1| PREDICTED: uncharacterized LOC101216122 [Cucumis sativus]
          Length = 472

 Score = 99.4 bits (246), Expect = 1e-18
 Identities = 67/206 (32%), Positives = 92/206 (44%), Gaps = 16/206 (7%)
 Frame = -3

Query: 572 LNQFLDTGREALDDLRTIVAVDGSTGGVVISCRRSTVEXXXXXXXXXXXXVIAFRSLFMM 393
           L QFL +G +A DDLRT++A D     + +SCRRSTVE            V   + L  +
Sbjct: 106 LRQFLSSGLDAFDDLRTLIAFDDQNRTLTVSCRRSTVEFVGQLVLLSFVVVFVVKFLVGI 165

Query: 392 RKNYGGEVM------VYKRDRSLGGKVVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKD 231
               G +        V +RDRSLGG+ V VG R                          D
Sbjct: 166 VSRLGNKFSSGYTAPVMRRDRSLGGREVVVGTRRSVVARNKGMGKKNNLLGLLDSPVLAD 225

Query: 230 RM----------KTSSRRKEELPQWWPQVVSQGPVEVENKEEYQRLANHLIQAIMDRKMG 81
            M          K      E LP+WWP  V +      N++EYQ  AN L++A++D +M 
Sbjct: 226 TMALNDVSSEISKNGVWGGERLPKWWPPAVPRRNATA-NRQEYQIEANRLVRALVDNRMS 284

Query: 80  GEDISRKDVVELRHICKTYGVRASIS 3
           G D    D+V+LR IC+  GV+ S +
Sbjct: 285 GRDFMEDDIVQLREICRISGVKVSFN 310


>ref|XP_004152289.1| PREDICTED: uncharacterized protein LOC101216122 [Cucumis sativus]
          Length = 472

 Score = 99.4 bits (246), Expect = 1e-18
 Identities = 67/206 (32%), Positives = 92/206 (44%), Gaps = 16/206 (7%)
 Frame = -3

Query: 572 LNQFLDTGREALDDLRTIVAVDGSTGGVVISCRRSTVEXXXXXXXXXXXXVIAFRSLFMM 393
           L QFL +G +A DDLRT++A D     + +SCRRSTVE            V   + L  +
Sbjct: 106 LRQFLSSGLDAFDDLRTLIAFDDQNRTLTVSCRRSTVEFVGQLVLLSFVVVFVVKFLVGI 165

Query: 392 RKNYGGEVM------VYKRDRSLGGKVVAVGKREXXXXXXXXXXXXXXDNSEYYYQKKKD 231
               G +        V +RDRSLGG+ V VG R                          D
Sbjct: 166 VSRLGNKFSSGYTAPVMRRDRSLGGREVVVGTRRSVVARNKGMGKKNNLLGLLDSPVLAD 225

Query: 230 RM----------KTSSRRKEELPQWWPQVVSQGPVEVENKEEYQRLANHLIQAIMDRKMG 81
            M          K      E LP+WWP  V +      N++EYQ  AN L++A++D +M 
Sbjct: 226 TMALNDVSSEISKNGVWGGERLPKWWPPAVPRRNATA-NRQEYQIEANRLVRALVDNRMS 284

Query: 80  GEDISRKDVVELRHICKTYGVRASIS 3
           G D    D+V+LR IC+  GV+ S +
Sbjct: 285 GRDFMEDDIVQLREICRISGVKVSFN 310


>ref|NP_001078044.1| uncharacterized protein [Arabidopsis thaliana]
           gi|62320356|dbj|BAD94734.1| hypothetical protein
           [Arabidopsis thaliana] gi|330255140|gb|AEC10234.1|
           uncharacterized protein AT2G43235 [Arabidopsis thaliana]
          Length = 437

 Score = 97.8 bits (242), Expect = 4e-18
 Identities = 65/220 (29%), Positives = 110/220 (50%), Gaps = 11/220 (5%)
 Frame = -3

Query: 635 DDTLQITLKSPSTS-LQNLETKLNQFLDTGREALDDLRTIVAVDGSTGGVVISCRRSTVE 459
           D +L +TL     S L N   +L  FLD+ ++A  DL+T++++D +   VV+SC++ST++
Sbjct: 73  DQSLSLTLDVHRISTLANYRFQL--FLDSSKDAFSDLQTLISLDDNRR-VVVSCKKSTMQ 129

Query: 458 XXXXXXXXXXXXVIAFRSLFMMRKNYGGEVM-----VYKRDRSLGGKVVAVGKREXXXXX 294
                         A R L  +     G        V +RDRSLGGK V V         
Sbjct: 130 FVGGVVILGFVFGFAIRVLVKLGSALKGNFQSNPKFVVRRDRSLGGKEVVVS-------- 181

Query: 293 XXXXXXXXXDNSEYYYQKKKDRMKTSSRR-----KEELPQWWPQVVSQGPVEVENKEEYQ 129
                    D+  +    +  R  ++ R      +  LP+WWP  ++    +V +KE+YQ
Sbjct: 182 VDNIRSSSRDSKSFIASDQASRSNSTPRNLHLKAQNNLPKWWPTSLTSQSFDVVDKEDYQ 241

Query: 128 RLANHLIQAIMDRKMGGEDISRKDVVELRHICKTYGVRAS 9
           R AN +++AI+D +  G+DI+  D+++LR +C+  GV+ +
Sbjct: 242 REANRIVRAIVDNRTSGKDITDDDIIQLRRVCRISGVQVT 281


>gb|AHA84272.1| sugar porter [Phaseolus vulgaris]
          Length = 432

 Score = 97.4 bits (241), Expect = 5e-18
 Identities = 70/215 (32%), Positives = 110/215 (51%), Gaps = 17/215 (7%)
 Frame = -3

Query: 599 TSLQNLETKLNQFLDTGREALDDLRTIVAVDGSTGGVVISCRRSTVEXXXXXXXXXXXXV 420
           T L  L++ L +F+ +GR+A  DL T++ +D +   +V+SCR ST+              
Sbjct: 66  TPLTTLQSHLRRFILSGRDAYLDLETLLTLDHNRR-LVVSCRPSTLHFLGTSAALTLLTF 124

Query: 419 IAFRSLFMM--------RKNYGGEVMVYKRDRSLGGK--VVAVGKREXXXXXXXXXXXXX 270
             F  L  +        R       +V +RDRSLGGK  VVA G+R              
Sbjct: 125 SVFSVLARLISRFSSWRRNASNNRPLVVRRDRSLGGKEVVVAWGQRS------------- 171

Query: 269 XDNSEYYYQKKKDRMKTSSRRK-----EELPQWWPQVVS-QGPV-EVENKEEYQRLANHL 111
             NS       +  +K S++ K      +LP+WWP VV+  G V +  ++EEY+R A  +
Sbjct: 172 --NSNPLSPAVRGSVKRSAKNKVVRFERKLPEWWPTVVNANGSVFDANDQEEYKREAYRV 229

Query: 110 IQAIMDRKMGGEDISRKDVVELRHICKTYGVRASI 6
           ++AI + ++GG DI+ KD+++LR  C+T GV+ SI
Sbjct: 230 VRAITNSRLGGNDINEKDIIQLRQTCRTSGVQVSI 264


Top