BLASTX nr result

ID: Mentha22_contig00002488 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00002488
         (881 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU22288.1| hypothetical protein MIMGU_mgv1a000316mg [Mimulus...   120   7e-25
ref|XP_002268966.1| PREDICTED: uncharacterized protein LOC100247...   117   4e-24
ref|XP_006347374.1| PREDICTED: uncharacterized protein LOC102596...   110   7e-22
ref|XP_006389624.1| hypothetical protein POPTR_0021s00740g [Popu...   110   9e-22
ref|XP_004242147.1| PREDICTED: uncharacterized protein LOC101249...   105   3e-20
ref|XP_007026080.1| Homeodomain-like superfamily protein, putati...   104   5e-20
ref|XP_007026078.1| Homeodomain-like superfamily protein, putati...   104   5e-20
ref|XP_006428336.1| hypothetical protein CICLE_v10010907mg [Citr...   102   1e-19
ref|XP_006480350.1| PREDICTED: uncharacterized protein LOC102624...   102   2e-19
ref|XP_002518479.1| conserved hypothetical protein [Ricinus comm...    97   6e-18
gb|EXC05724.1| hypothetical protein L484_011305 [Morus notabilis]      95   4e-17
ref|XP_007026079.1| Homeodomain-like superfamily protein, putati...    94   7e-17
ref|XP_004295271.1| PREDICTED: uncharacterized protein LOC101297...    93   2e-16
ref|XP_007213734.1| hypothetical protein PRUPE_ppa000251mg [Prun...    88   5e-15
ref|XP_006594422.1| PREDICTED: uncharacterized protein LOC102661...    83   2e-13
ref|XP_007147729.1| hypothetical protein PHAVU_006G149800g [Phas...    82   2e-13
ref|XP_006597583.1| PREDICTED: uncharacterized protein LOC100794...    82   4e-13
gb|EPS74726.1| hypothetical protein M569_00028, partial [Genlise...    75   3e-11
ref|XP_004486161.1| PREDICTED: uncharacterized protein LOC101502...    72   2e-10
ref|XP_006383930.1| hypothetical protein POPTR_0004s01480g, part...    69   2e-09

>gb|EYU22288.1| hypothetical protein MIMGU_mgv1a000316mg [Mimulus guttatus]
          Length = 1264

 Score =  120 bits (301), Expect = 7e-25
 Identities = 98/249 (39%), Positives = 119/249 (47%), Gaps = 10/249 (4%)
 Frame = +1

Query: 4    GDSDLQMHPLLFQAPQDGHXXXXXXXXXXXXXXG---------KQPQLSLSLFHNPRRIR 156
            GDS LQMHPLLFQ+PQ+                          +QP+LSL LFHNPR I+
Sbjct: 926  GDSVLQMHPLLFQSPQNASSIMPYYPVNSTTSTSSSFTFFSGKQQPKLSLGLFHNPRHIK 985

Query: 157  DAVNFLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCAP 336
            DAVNFLS SSK P +  A++ GVDFHPLLQR+D+   D+ +A      PSIA S +    
Sbjct: 986  DAVNFLSMSSKTPPQENASSLGVDFHPLLQRSDD--IDTASA------PSIAESSR---- 1033

Query: 337  IQKHPSSTTKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRNAAQRN 516
                               S GTK +SL  K NELDLN   SFTS N + +ES N     
Sbjct: 1034 ----------------LERSSGTKVASLKGKVNELDLNFHPSFTS-NSKHSESPN----- 1071

Query: 517  TSRSLGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVSDNM-H 693
                               DSSK             NS +  +V SR +GSRK SD    
Sbjct: 1072 -------------------DSSK-------------NSGETRMVKSRTKGSRKCSDIAGS 1099

Query: 694  DESLPEIVM 720
            +ES+ EIVM
Sbjct: 1100 NESIQEIVM 1108


>ref|XP_002268966.1| PREDICTED: uncharacterized protein LOC100247051 [Vitis vinifera]
          Length = 1514

 Score =  117 bits (294), Expect = 4e-24
 Identities = 90/253 (35%), Positives = 125/253 (49%), Gaps = 15/253 (5%)
 Frame = +1

Query: 7    DSDLQMHPLLFQAPQDGHXXXXXXXXXXXXXX------GKQPQLSLSLFHNPRRIRDAVN 168
            +SDL MHPLLFQA +DG                     G Q Q++LSLFHNP +    VN
Sbjct: 1055 ESDLHMHPLLFQASEDGRLPYYPFNCSHGPSNSFSFFSGNQSQVNLSLFHNPHQANPKVN 1114

Query: 169  FLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLP-SIAASRQGCAPIQ- 342
               KS K   K +  + G+DFHPLLQR+D+   D + + P G+L   + + R   A +Q 
Sbjct: 1115 SFYKSLK--SKESTPSCGIDFHPLLQRSDDIDNDLVTSRPTGQLSFDLESFRGKRAQLQN 1172

Query: 343  KHPSSTTKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRNAAQRNTS 522
               +  T+P V+     S GTK S L    NELDL I LS TSK ++   S N  + N  
Sbjct: 1173 SFDAVLTEPRVNSAPPRS-GTKPSCLDGIENELDLEIHLSSTSKTEKVVGSTNVTENNQR 1231

Query: 523  RSLGAPIPG-VIESKNTKDSSKKRD------SAPDAICNELNSSDIPLVASRNRGSRKVS 681
            +S      G  +E++N+     ++       S+P  +  +L S    LV   N     + 
Sbjct: 1232 KSASTLNSGTAVEAQNSSSQYHQQSDHRPSVSSPLEVRGKLISGACALVLPSN----DIL 1287

Query: 682  DNMHDESLPEIVM 720
            DN+ D+SLPEIVM
Sbjct: 1288 DNIGDQSLPEIVM 1300


>ref|XP_006347374.1| PREDICTED: uncharacterized protein LOC102596887 [Solanum tuberosum]
          Length = 1436

 Score =  110 bits (275), Expect = 7e-22
 Identities = 83/247 (33%), Positives = 119/247 (48%), Gaps = 7/247 (2%)
 Frame = +1

Query: 1    RGDSDLQMHPLLFQAPQDG------HXXXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDA 162
            + +S L+MHPLLF+AP+DG                     G QP  +LSLFH+PR+    
Sbjct: 1013 KDESGLRMHPLLFRAPEDGPLPYNQSNSSFSTSSSFNFFSGCQP--NLSLFHHPRQSAHT 1070

Query: 163  VNFLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGAD-SLAAHPNGKLPSIAASRQGCAPI 339
            VNFL KSS P +K  + +SG DFHPLLQRTD+   D  +A+       +   SR  C  +
Sbjct: 1071 VNFLDKSSNPGDK-TSISSGFDFHPLLQRTDDANCDLEVASAVTRPSCTSETSRGWCTQV 1129

Query: 340  QKHPSSTTKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRNAAQRNT 519
            Q    S++  +   I S+ MG        K NE+DL + LSFTS  Q+   SR  A R  
Sbjct: 1130 QNAVDSSSNVAC-SIPSSPMG--------KSNEVDLEMHLSFTSSKQKAIGSRGVADRFM 1180

Query: 520  SRSLGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVSDNMHDE 699
             RS          +  ++D +   +  P+      +S     + S +  +    D++ D+
Sbjct: 1181 GRS---------PTSASRDQNPLNNGTPNRTTQHSDSGATARILSSDEETGNGVDDLEDQ 1231

Query: 700  SLPEIVM 720
            SL EIVM
Sbjct: 1232 SLVEIVM 1238


>ref|XP_006389624.1| hypothetical protein POPTR_0021s00740g [Populus trichocarpa]
            gi|550312453|gb|ERP48538.1| hypothetical protein
            POPTR_0021s00740g [Populus trichocarpa]
          Length = 1441

 Score =  110 bits (274), Expect = 9e-22
 Identities = 87/247 (35%), Positives = 118/247 (47%), Gaps = 9/247 (3%)
 Frame = +1

Query: 7    DSDLQMHPLLFQAPQDGHXXXXXXXXXXXXXX------GKQPQLSLSLFHNPRRIRDAVN 168
            DSDLQMHPLLFQAP+ G                     G QPQL+LSLFHNP +    V+
Sbjct: 979  DSDLQMHPLLFQAPEGGCLPYLPLSCSSGTSSSFSFFSGNQPQLNLSLFHNPLQANHVVD 1038

Query: 169  FLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQG-CAPIQK 345
              +KSSK  +  +A+ S +DFHPLLQRTD E  + + A  N   P+      G  A  Q 
Sbjct: 1039 GFNKSSKSKDSTSASCS-IDFHPLLQRTDEENNNLVMACSN---PNQFVCLSGESAQFQN 1094

Query: 346  HPSSTTKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRNAAQRNTSR 525
            H  +    S       ++  K SS + K N+LDL+I LS  S  +    SR+    N  R
Sbjct: 1095 HFGAVQNKSFVNNIPIAVDPKHSSSNEKANDLDLDIHLSSNSAKEVSERSRDVGANNQPR 1154

Query: 526  S-LGAPIPG-VIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVSDNMHDE 699
            S    P  G  +E+        + +  P    N ++ +D   V S N  +  + D + D+
Sbjct: 1155 STTSEPKSGRRMETCKINSPRDQHNEHPTVHSNLVSGADASPVQSNNVSTCNM-DVVGDQ 1213

Query: 700  SLPEIVM 720
            S PEIVM
Sbjct: 1214 SHPEIVM 1220


>ref|XP_004242147.1| PREDICTED: uncharacterized protein LOC101249932 [Solanum
            lycopersicum]
          Length = 1418

 Score =  105 bits (261), Expect = 3e-20
 Identities = 81/247 (32%), Positives = 116/247 (46%), Gaps = 7/247 (2%)
 Frame = +1

Query: 1    RGDSDLQMHPLLFQAPQDG------HXXXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDA 162
            + +S L+MHPLLF+AP+DG                     G QP  +LSLFH+P +    
Sbjct: 995  KDESGLRMHPLLFRAPEDGPFPHYQSNSSFSTSSSFNFFSGCQP--NLSLFHHPHQSAHT 1052

Query: 163  VNFLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGAD-SLAAHPNGKLPSIAASRQGCAPI 339
            VNFL KSS P +K  + +SG DFHPLLQR D+   D  +A+       +   SR  C  +
Sbjct: 1053 VNFLDKSSNPGDK-TSMSSGFDFHPLLQRIDDANCDLEVASTVTRPSCTSETSRGWCTQV 1111

Query: 340  QKHPSSTTKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRNAAQRNT 519
            Q    S++  +   I S+ MG        K NELDL + LSFT   Q+   SR  A R  
Sbjct: 1112 QNAVDSSSNVAC-AIPSSPMG--------KSNELDLEMHLSFTCSKQKAIGSRGVADRFM 1162

Query: 520  SRSLGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVSDNMHDE 699
             RS          +  ++D +   +  P+      +S     + S +  +    D++ D+
Sbjct: 1163 ERS---------PTSASRDQNPLNNGTPNRTTQHSDSGATARILSSDEETGNGVDDLEDQ 1213

Query: 700  SLPEIVM 720
            SL EIVM
Sbjct: 1214 SLIEIVM 1220


>ref|XP_007026080.1| Homeodomain-like superfamily protein, putative isoform 3 [Theobroma
            cacao] gi|508781446|gb|EOY28702.1| Homeodomain-like
            superfamily protein, putative isoform 3 [Theobroma cacao]
          Length = 1402

 Score =  104 bits (259), Expect = 5e-20
 Identities = 78/244 (31%), Positives = 126/244 (51%), Gaps = 7/244 (2%)
 Frame = +1

Query: 10   SDLQMHPLLFQAPQDGH------XXXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDAVNF 171
            +DLQMHPLLFQAP+DG                     G QPQL+LSLF+NP++   +V  
Sbjct: 969  TDLQMHPLLFQAPEDGQVPYYPLNCGTGASSSFSFFSGNQPQLNLSLFYNPQQTNHSVES 1028

Query: 172  LSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCAPIQKHP 351
            L++S K  + + + + G+DFHPLLQRTD+  ++ +       L S+    +  AP   +P
Sbjct: 1029 LTRSLKMKD-SVSISCGIDFHPLLQRTDDTNSELVTECSTASL-SVNLDGKSVAPC--NP 1084

Query: 352  SSTTK-PSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRNAAQRNTSRS 528
            S+  +  SV   S  +  ++ SS + K NELDL I LS  S  +  A S +AA  + + +
Sbjct: 1085 SNAVQMKSVAQCSPFATRSRPSSPNEKANELDLEIHLSSLSTKENAALSGDAATHHKNSA 1144

Query: 529  LGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVSDNMHDESLP 708
            +      ++ S+N  ++     S+ +   +   +S IP     ++ + +  D+  D+S  
Sbjct: 1145 V-----SLLNSQNAAETRDTTHSSGNKFVSGARASTIP-----SKTTGRYMDDTSDQSHL 1194

Query: 709  EIVM 720
            EIVM
Sbjct: 1195 EIVM 1198


>ref|XP_007026078.1| Homeodomain-like superfamily protein, putative isoform 1 [Theobroma
            cacao] gi|508781444|gb|EOY28700.1| Homeodomain-like
            superfamily protein, putative isoform 1 [Theobroma cacao]
          Length = 1463

 Score =  104 bits (259), Expect = 5e-20
 Identities = 78/244 (31%), Positives = 126/244 (51%), Gaps = 7/244 (2%)
 Frame = +1

Query: 10   SDLQMHPLLFQAPQDGH------XXXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDAVNF 171
            +DLQMHPLLFQAP+DG                     G QPQL+LSLF+NP++   +V  
Sbjct: 1030 TDLQMHPLLFQAPEDGQVPYYPLNCGTGASSSFSFFSGNQPQLNLSLFYNPQQTNHSVES 1089

Query: 172  LSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCAPIQKHP 351
            L++S K  + + + + G+DFHPLLQRTD+  ++ +       L S+    +  AP   +P
Sbjct: 1090 LTRSLKMKD-SVSISCGIDFHPLLQRTDDTNSELVTECSTASL-SVNLDGKSVAPC--NP 1145

Query: 352  SSTTK-PSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRNAAQRNTSRS 528
            S+  +  SV   S  +  ++ SS + K NELDL I LS  S  +  A S +AA  + + +
Sbjct: 1146 SNAVQMKSVAQCSPFATRSRPSSPNEKANELDLEIHLSSLSTKENAALSGDAATHHKNSA 1205

Query: 529  LGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVSDNMHDESLP 708
            +      ++ S+N  ++     S+ +   +   +S IP     ++ + +  D+  D+S  
Sbjct: 1206 V-----SLLNSQNAAETRDTTHSSGNKFVSGARASTIP-----SKTTGRYMDDTSDQSHL 1255

Query: 709  EIVM 720
            EIVM
Sbjct: 1256 EIVM 1259


>ref|XP_006428336.1| hypothetical protein CICLE_v10010907mg [Citrus clementina]
            gi|557530393|gb|ESR41576.1| hypothetical protein
            CICLE_v10010907mg [Citrus clementina]
          Length = 1424

 Score =  102 bits (255), Expect = 1e-19
 Identities = 79/247 (31%), Positives = 121/247 (48%), Gaps = 9/247 (3%)
 Frame = +1

Query: 7    DSDLQMHPLLFQAPQDGHXXXXXXXXXXXXXX------GKQPQLSLSLFHNPRRIRDAVN 168
            + DLQMHPLLFQAP+DGH                    G QPQL+LSLFHNPR++  A++
Sbjct: 997  EPDLQMHPLLFQAPEDGHLPYYPLNCSASTSSSFSFFSGNQPQLNLSLFHNPRQLSHALS 1056

Query: 169  FLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCAPIQKH 348
              +KS K  E + + +  +DFHPLL+RT+    ++L   P+    S+ + R+        
Sbjct: 1057 CFNKSLKTKE-STSGSCVIDFHPLLKRTE-VANNNLVTTPSNARISVGSERKSDQHKNPF 1114

Query: 349  PSSTTKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRNAAQRNTSRS 528
             +  +K SV     A+  +  SS++ K NELDL I LS +S  +    +R  A  N  +S
Sbjct: 1115 DALQSKTSVSNGPFAA-NSVPSSINEKSNELDLEIHLSSSSAKERALGNREMAPHNLMQS 1173

Query: 529  LGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVS---DNMHDE 699
            +       + +   K  ++  D+      +     +   VAS    S + +   D++ D 
Sbjct: 1174 M------TVANSGDKTVTQNNDN-----LHYQYGENYSQVASNGHFSVQTTGNIDDIGDH 1222

Query: 700  SLPEIVM 720
            S PEIVM
Sbjct: 1223 SHPEIVM 1229


>ref|XP_006480350.1| PREDICTED: uncharacterized protein LOC102624036 isoform X1 [Citrus
            sinensis] gi|568853408|ref|XP_006480351.1| PREDICTED:
            uncharacterized protein LOC102624036 isoform X2 [Citrus
            sinensis]
          Length = 1424

 Score =  102 bits (254), Expect = 2e-19
 Identities = 79/245 (32%), Positives = 120/245 (48%), Gaps = 9/245 (3%)
 Frame = +1

Query: 13   DLQMHPLLFQAPQDGHXXXXXXXXXXXXXX------GKQPQLSLSLFHNPRRIRDAVNFL 174
            DLQMHPLLFQAP+DGH                    G QPQL+LSLFHNPR++  A++  
Sbjct: 999  DLQMHPLLFQAPEDGHLPYYPLNCSASTSSSFSFFSGNQPQLNLSLFHNPRQLSHALSCF 1058

Query: 175  SKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCAPIQKHPS 354
            +KS K  E + + +  +DFHPLL+RT+    ++L   P+    S+ + R+         +
Sbjct: 1059 NKSLKTKE-STSGSCVIDFHPLLKRTE-VANNNLVTTPSNARISVGSERKSDQHKNPFDA 1116

Query: 355  STTKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRNAAQRNTSRSLG 534
              +K SV     A+  +  SS++ K NELDL I LS +S  +    +R  A  N  +S+ 
Sbjct: 1117 LQSKTSVSNGPFAA-NSVPSSINEKSNELDLEIHLSSSSAKERALGNREMAPHNLMQSM- 1174

Query: 535  APIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVS---DNMHDESL 705
                  + +   K  ++  D+      +     +   VAS    S + +   D++ D S 
Sbjct: 1175 -----TVANSGDKTVTQNNDN-----LHYQYGENYSQVASNGHFSVQTTGNIDDIGDHSH 1224

Query: 706  PEIVM 720
            PEIVM
Sbjct: 1225 PEIVM 1229


>ref|XP_002518479.1| conserved hypothetical protein [Ricinus communis]
            gi|223542324|gb|EEF43866.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1399

 Score = 97.4 bits (241), Expect = 6e-18
 Identities = 81/259 (31%), Positives = 110/259 (42%), Gaps = 21/259 (8%)
 Frame = +1

Query: 7    DSDLQMHPLLFQAPQDGH------XXXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDAVN 168
            +SDLQMHPLLFQ+P+DG                       QPQL+LSLFH+ R     V+
Sbjct: 972  ESDLQMHPLLFQSPEDGRLSYYPLSCSTGASSSFTFFSANQPQLNLSLFHSSRPANHTVD 1031

Query: 169  FLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGAD---------------SLAAHPNGKLP 303
              +KSSK  E + +A+ G+DFHPLLQR + E  D                 +A P   L 
Sbjct: 1032 CFNKSSKTGE-STSASCGIDFHPLLQRAEEENIDFATSCSIAHQYVCLGGKSAQPQNPLG 1090

Query: 304  SIAASRQGCAPIQKHPSSTTKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQE 483
            ++    Q  +P+   PS+T             G+K  S   K NELDL I LS  S  ++
Sbjct: 1091 AV----QTKSPVNSGPSTT-------------GSKPPSSIEKANELDLEIHLSSMSAVEK 1133

Query: 484  GAESRNAAQRNTSRSLGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNR 663
               SR+    N       P      S NT D  K  D+               +    N 
Sbjct: 1134 TRGSRDVGASNQLE----PSTSAPNSGNTIDKDKSADA---------------IAVQSNN 1174

Query: 664  GSRKVSDNMHDESLPEIVM 720
             +R   ++  D++ PEIVM
Sbjct: 1175 DARCDMEDKGDQAPPEIVM 1193


>gb|EXC05724.1| hypothetical protein L484_011305 [Morus notabilis]
          Length = 1423

 Score = 94.7 bits (234), Expect = 4e-17
 Identities = 81/248 (32%), Positives = 118/248 (47%), Gaps = 10/248 (4%)
 Frame = +1

Query: 7    DSDLQMHPLLFQAPQDGH------XXXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDAVN 168
            DSDLQMHPLLFQAP+DG                     G QPQL LSL HNPR+  + V 
Sbjct: 1003 DSDLQMHPLLFQAPEDGRLPYYPLNCSPSNSSSFSFFSGNQPQLHLSLLHNPRQ-ENLVG 1061

Query: 169  FLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCAPIQKH 348
              +KS +  + + +++ G+DFHPLLQRTD         + +G L  +    Q  + +   
Sbjct: 1062 SFTKSLQLKD-STSSSYGIDFHPLLQRTD---------YVHGDLIDV----QTESLVNAD 1107

Query: 349  PSSTTKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRNAAQRNTSRS 528
            P +T+K                    K NELDL I +S  S+ +EG+ +RN    N  RS
Sbjct: 1108 PHTTSK-----------------FVEKANELDLEIHISSASR-KEGSWNRNETAHNPVRS 1149

Query: 529  LGAPIPGVIESKNTKDSSKK----RDSAPDAICNELNSSDIPLVASRNRGSRKVSDNMHD 696
                 P    +  T++S++      +S+P  I   ++     ++   N G  +  D+M D
Sbjct: 1150 -ATNAPNSEFTSKTQNSNRSLYLHNESSPSNISRPVSGGHSSVLPGDNIG--RYVDDMGD 1206

Query: 697  ESLPEIVM 720
            +S PEIVM
Sbjct: 1207 QSHPEIVM 1214


>ref|XP_007026079.1| Homeodomain-like superfamily protein, putative isoform 2 [Theobroma
            cacao] gi|508781445|gb|EOY28701.1| Homeodomain-like
            superfamily protein, putative isoform 2 [Theobroma cacao]
          Length = 1374

 Score = 94.0 bits (232), Expect = 7e-17
 Identities = 72/243 (29%), Positives = 116/243 (47%), Gaps = 6/243 (2%)
 Frame = +1

Query: 10   SDLQMHPLLFQAPQDGH------XXXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDAVNF 171
            +DLQMHPLLFQAP+DG                     G QPQL+LSLF+NP++   +V  
Sbjct: 969  TDLQMHPLLFQAPEDGQVPYYPLNCGTGASSSFSFFSGNQPQLNLSLFYNPQQTNHSVES 1028

Query: 172  LSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCAPIQKHP 351
            L++S K  + + + + G+DFHPLLQRTD+  ++            +  S   C+P     
Sbjct: 1029 LTRSLKMKD-SVSISCGIDFHPLLQRTDDTNSE------------LMKSVAQCSPF---- 1071

Query: 352  SSTTKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRNAAQRNTSRSL 531
                          +  ++ SS + K NELDL I LS  S  +  A S +AA  + + ++
Sbjct: 1072 --------------ATRSRPSSPNEKANELDLEIHLSSLSTKENAALSGDAATHHKNSAV 1117

Query: 532  GAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVSDNMHDESLPE 711
                  ++ S+N  ++     S+ +   +   +S IP     ++ + +  D+  D+S  E
Sbjct: 1118 -----SLLNSQNAAETRDTTHSSGNKFVSGARASTIP-----SKTTGRYMDDTSDQSHLE 1167

Query: 712  IVM 720
            IVM
Sbjct: 1168 IVM 1170


>ref|XP_004295271.1| PREDICTED: uncharacterized protein LOC101297625 [Fragaria vesca
            subsp. vesca]
          Length = 1378

 Score = 92.8 bits (229), Expect = 2e-16
 Identities = 79/246 (32%), Positives = 112/246 (45%), Gaps = 9/246 (3%)
 Frame = +1

Query: 10   SDLQMHPLLFQAPQDGH------XXXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDAVNF 171
            SDLQMHPLLFQ P+DG                     G QPQL L+L H+P +     N 
Sbjct: 974  SDLQMHPLLFQPPEDGRLPYYPLNCSTSNSGSYSFLSGNQPQLHLTLLHDPHQ----ENQ 1029

Query: 172  LSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCAPIQKHP 351
            +    +  +++   + G+DFHPL+QRT+N   +S+A       P    SR       +HP
Sbjct: 1030 VDGPVRTLKESNVISRGIDFHPLMQRTEN--VNSVAVTKCSTAPLAVGSR------VQHP 1081

Query: 352  SSTTKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRNAAQRNTSRSL 531
            S + +  V   + A       S    G ELDL I LS TS+ ++  +SR  +  N  +S 
Sbjct: 1082 SKSFQTEVPEATGAK-----PSPDEGGIELDLEIHLSSTSRKEKTLKSREVSHHNLVKSR 1136

Query: 532  GAPIPG---VIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVSDNMHDES 702
             AP  G   + +S N+       +S+  A  ++  S    LV   N  SR   D M D S
Sbjct: 1137 TAPGTGTTMIAQSVNSPIYIHAENSS--ASSSKFVSGSNTLVIPSNNMSRYNPDEMGDPS 1194

Query: 703  LPEIVM 720
             P+I M
Sbjct: 1195 QPDIEM 1200


>ref|XP_007213734.1| hypothetical protein PRUPE_ppa000251mg [Prunus persica]
            gi|462409599|gb|EMJ14933.1| hypothetical protein
            PRUPE_ppa000251mg [Prunus persica]
          Length = 1395

 Score = 87.8 bits (216), Expect = 5e-15
 Identities = 77/246 (31%), Positives = 111/246 (45%), Gaps = 8/246 (3%)
 Frame = +1

Query: 7    DSDLQMHPLLFQAPQDGH------XXXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDAVN 168
            DSDL MHPLLFQAP+DG                       QPQL+LSLFHNP +    V+
Sbjct: 1005 DSDLHMHPLLFQAPEDGRLPYYPLNCSNRNSSTFSFLSANQPQLNLSLFHNPHQ-GSHVD 1063

Query: 169  FLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCAPIQKH 348
               KS K     + A   +DFHPL+QRTD              + S+  +    AP+   
Sbjct: 1064 CFDKSLKTSNSTSRA---IDFHPLMQRTD-------------YVSSVPVTTCSTAPLS-- 1105

Query: 349  PSSTTKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRNAAQRNTSRS 528
             +++  P +      ++GT     + K NELDL I LS TS+ +   + R+    N+ +S
Sbjct: 1106 -NTSQTPLLGNTDPQALGT-----NEKANELDLEIHLSSTSEKENFLKRRDVGVHNSVKS 1159

Query: 529  -LGAPIPGVIESKNTKDSSKKRDSA-PDAICNELNSSDIPLVASRNRGSRKVSDNMHDES 702
               AP  G I      + S  + +       +E  S  + LV   N  SR  +D+  ++S
Sbjct: 1160 RTTAPDSGTIMITQCANGSLYQHAENSSGSGSEPVSGGLTLVIPSNILSRYNADDTGEQS 1219

Query: 703  LPEIVM 720
             P+I M
Sbjct: 1220 QPDIEM 1225


>ref|XP_006594422.1| PREDICTED: uncharacterized protein LOC102661544 isoform X1 [Glycine
            max] gi|571499167|ref|XP_006594423.1| PREDICTED:
            uncharacterized protein LOC102661544 isoform X2 [Glycine
            max] gi|571499169|ref|XP_006594424.1| PREDICTED:
            uncharacterized protein LOC102661544 isoform X3 [Glycine
            max] gi|571499171|ref|XP_006594425.1| PREDICTED:
            uncharacterized protein LOC102661544 isoform X4 [Glycine
            max]
          Length = 1406

 Score = 82.8 bits (203), Expect = 2e-13
 Identities = 81/252 (32%), Positives = 115/252 (45%), Gaps = 15/252 (5%)
 Frame = +1

Query: 10   SDLQMHPLLFQAPQDGH------XXXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDAVNF 171
            +DLQMHPLLFQ  +DG+                    G QPQL+LSLFH+ ++ +  ++ 
Sbjct: 993  TDLQMHPLLFQVTEDGNAPYCPLKFSSGTSSSFSFFSGSQPQLNLSLFHSSQQ-QSHIDC 1051

Query: 172  LSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCAPIQKHP 351
             +KS K  + +   + G+DFHPLLQ++D+                   S      IQ  P
Sbjct: 1052 ANKSLKSKD-STLRSGGIDFHPLLQKSDD-----------------TQSPTSFDAIQ--P 1091

Query: 352  SSTTKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRNAAQRNTSRSL 531
             S     V  I++ S G     L+ K NELDL I LS  S  ++  +SR   Q      +
Sbjct: 1092 ESLVNSGVQAIANRSSG-----LNDKSNELDLEIHLSSVSGREKSVKSR---QLKAHDPV 1143

Query: 532  GAPIPGVIESKNTKDSSKKRDSAP---------DAICNELNSSDIPLVASRNRGSRKVSD 684
            G+     I   + K    + D+AP          A   EL SS  PLV S +  +R   D
Sbjct: 1144 GSKKTVAISGTSMK---PQEDTAPYCQHGVENLSAGSCELASS-APLVVSSDNITRYDVD 1199

Query: 685  NMHDESLPEIVM 720
            ++ D+S PEIVM
Sbjct: 1200 DIGDQSHPEIVM 1211


>ref|XP_007147729.1| hypothetical protein PHAVU_006G149800g [Phaseolus vulgaris]
            gi|561020952|gb|ESW19723.1| hypothetical protein
            PHAVU_006G149800g [Phaseolus vulgaris]
          Length = 771

 Score = 82.4 bits (202), Expect = 2e-13
 Identities = 76/246 (30%), Positives = 108/246 (43%), Gaps = 9/246 (3%)
 Frame = +1

Query: 10   SDLQMHPLLFQAPQDGH------XXXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDAVNF 171
            +DLQMHPLLFQ  +DG+                    G QPQL+LSLFH+ ++ +  ++ 
Sbjct: 355  TDLQMHPLLFQVTEDGNVPYYPLKLSSGTSSSFSFFSGSQPQLNLSLFHSSQQ-QSHIDC 413

Query: 172  LSKSSKPPEKNAAATS-GVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCAPIQKH 348
             +KS K   KN+   S G+DFHPLLQ++D+      A  PN                   
Sbjct: 414  ANKSLK--SKNSILRSGGIDFHPLLQKSDD------AQSPNFD--------------SNQ 451

Query: 349  PSSTTKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRNAAQRNTSRS 528
            P S     V  I++ S G      + K NELDL I LS  S  +   +SR    R+ + S
Sbjct: 452  PESLGTSGVSAIANRSSGP-----NDKSNELDLEIHLSSVSGRERSVKSRQPKARDPAGS 506

Query: 529  LGAPIPGVIESKNTKDSSKKRDSAPDAICNELN--SSDIPLVASRNRGSRKVSDNMHDES 702
                    I  +  +DS        + +       +S  PLV   +  +R   D + D+S
Sbjct: 507  KKTVAISRISREPQEDSVPHCQQGGENVSASSRGPASSDPLVVPNDNIARYDVDEIGDQS 566

Query: 703  LPEIVM 720
             PEIVM
Sbjct: 567  HPEIVM 572


>ref|XP_006597583.1| PREDICTED: uncharacterized protein LOC100794351 isoform X1 [Glycine
            max] gi|571517713|ref|XP_006597584.1| PREDICTED:
            uncharacterized protein LOC100794351 isoform X2 [Glycine
            max]
          Length = 1403

 Score = 81.6 bits (200), Expect = 4e-13
 Identities = 82/252 (32%), Positives = 113/252 (44%), Gaps = 15/252 (5%)
 Frame = +1

Query: 10   SDLQMHPLLFQAPQDGH------XXXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDAVNF 171
            SDLQMHPLLFQ  +DG+                    G QPQL+LSLFH+ ++ +  ++ 
Sbjct: 990  SDLQMHPLLFQVTEDGNVPYYPLKFSSGTSSSFSFFSGSQPQLNLSLFHSSQQ-QSHIDC 1048

Query: 172  LSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCAPIQKHP 351
             +KS K  + +   + G+DFHPLLQ++D+                   S      IQ  P
Sbjct: 1049 ANKSLKLKD-STLRSGGIDFHPLLQKSDD-----------------TQSPTSFDAIQ--P 1088

Query: 352  SSTTKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRNAAQRNTSRSL 531
             S     V  I+S S G     L+ K NELDL I LS  S  ++  +SR   Q      +
Sbjct: 1089 ESLVNSGVQAIASRSSG-----LNDKSNELDLEIHLSSVSGREKSVKSR---QLKAHDPV 1140

Query: 532  GAPIPGVIESKNTKDSSKKRDSAP---------DAICNELNSSDIPLVASRNRGSRKVSD 684
            G+     I     K    + D+AP          A   EL SS  PLV   +  +R   D
Sbjct: 1141 GSKKTVAISGTAMK---PQEDTAPYCQQGVENLSAGSCELASS-APLVVPNDNITRYDVD 1196

Query: 685  NMHDESLPEIVM 720
            ++ D+S PEIVM
Sbjct: 1197 DIGDQSHPEIVM 1208


>gb|EPS74726.1| hypothetical protein M569_00028, partial [Genlisea aurea]
          Length = 1049

 Score = 75.1 bits (183), Expect = 3e-11
 Identities = 67/225 (29%), Positives = 97/225 (43%), Gaps = 14/225 (6%)
 Frame = +1

Query: 4    GDSDLQMHPLLFQAPQDGHXXXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRD-AVNFLSK 180
            GD DL+MHPL F++PQD H               +   LSLSLFH+PR ++D A++FL+ 
Sbjct: 870  GDRDLEMHPLFFRSPQDAHWPYYP----------QNSGLSLSLFHHPRHLQDPAMSFLNH 919

Query: 181  SSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCAPIQKHPSST 360
               PP      +SGV FHPLLQ   N+  ++  A     +P+ A                
Sbjct: 920  GKCPP------SSGVVFHPLLQ--SNKAVETGTAR---AVPTTA---------------- 952

Query: 361  TKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGA-------------ESRN 501
                           K +S S KGNELDL+I LS   +N+E               ++  
Sbjct: 953  ---------------KTASRSSKGNELDLDIHLSVLPENRESTLQKPVAAAVAGRDDNNE 997

Query: 502  AAQRNTSRSLGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSD 636
            AA R  + +   P   V+E +   DS  +     +  C E+  S+
Sbjct: 998  AASREMNDATSFP-DIVMEQEELSDSEDEYGENVEFECEEMADSE 1041


>ref|XP_004486161.1| PREDICTED: uncharacterized protein LOC101502269 isoform X1 [Cicer
            arietinum] gi|502079123|ref|XP_004486162.1| PREDICTED:
            uncharacterized protein LOC101502269 isoform X2 [Cicer
            arietinum]
          Length = 1417

 Score = 72.4 bits (176), Expect = 2e-10
 Identities = 71/243 (29%), Positives = 109/243 (44%), Gaps = 6/243 (2%)
 Frame = +1

Query: 10   SDLQMHPLLFQAPQDGH------XXXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDAVNF 171
            +DLQMHPLLFQ  ++G                     G+QPQL+LSLF +  + +  ++ 
Sbjct: 979  ADLQMHPLLFQVTEEGQTPYYPFKFSSGPSSSFSFFSGRQPQLNLSLFSSSLQ-QGHIDR 1037

Query: 172  LSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCAPIQKHP 351
             +KS K  + ++    G+DFHPLLQ++++  A S                 G   IQ   
Sbjct: 1038 ANKSLK-SKNSSLRLGGIDFHPLLQKSNDTQAQS-----------------GSDDIQ--- 1076

Query: 352  SSTTKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRNAAQRNTSRSL 531
                +  V+         ++S L+ K NELDL+I L   S+  +  +SR   + +   S 
Sbjct: 1077 ---AESLVNNSGVPDTTDRSSGLNDKSNELDLDIHLCSVSEGDKSMKSRQLKEHDPIASC 1133

Query: 532  GAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVSDNMHDESLPE 711
               I         ++ S  R       C EL S+D PLVA  +  +R   D++ D+S P 
Sbjct: 1134 ETAINAPYCQHGGRNPSPSR-------C-ELASND-PLVAPEDNITRYDVDDVGDQSHPG 1184

Query: 712  IVM 720
            IVM
Sbjct: 1185 IVM 1187


>ref|XP_006383930.1| hypothetical protein POPTR_0004s01480g, partial [Populus trichocarpa]
            gi|550340089|gb|ERP61727.1| hypothetical protein
            POPTR_0004s01480g, partial [Populus trichocarpa]
          Length = 969

 Score = 68.9 bits (167), Expect = 2e-09
 Identities = 54/171 (31%), Positives = 72/171 (42%), Gaps = 6/171 (3%)
 Frame = +1

Query: 7    DSDLQMHPLLFQAPQDGHXXXXXXXXXXXXXX------GKQPQLSLSLFHNPRRIRDAVN 168
            DS+LQMHPLLFQA + G                     G QPQL+LSLFH   +    V+
Sbjct: 691  DSNLQMHPLLFQASESGRLSYLPLSCNIGASSTFSFFSGHQPQLNLSLFHYHHQANHVVD 750

Query: 169  FLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCAPIQKH 348
              +KS    +  +A+ S +DFHPLLQRTD E ++   +  N              P+   
Sbjct: 751  SFNKSLTSKDSTSASCS-IDFHPLLQRTDEENSNLNKSFVNH------------GPVVVD 797

Query: 349  PSSTTKPSVDGISSASMGTKASSLSRKGNELDLNIQLSFTSKNQEGAESRN 501
            P                  K SS + K N+LD  I LS  S  +     R+
Sbjct: 798  P------------------KQSSSNEKANDLDSEIHLSSNSAKETSERGRD 830


Top