BLASTX nr result

ID: Mentha25_contig00008557 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00008557
         (1051 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254...   209   1e-51
ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591...   205   3e-50
ref|XP_007009265.1| HAT and BED zinc finger domain-containing pr...   194   5e-47
ref|XP_002316272.2| hypothetical protein POPTR_0010s20835g [Popu...   189   2e-45
ref|XP_007049027.1| HAT transposon superfamily, putative [Theobr...   187   5e-45
ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817...   182   2e-43
ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817...   182   2e-43
ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806...   180   8e-43
ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phas...   139   2e-30
ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phas...   139   3e-30
ref|XP_004981234.1| PREDICTED: uncharacterized protein LOC101757...   135   4e-29
ref|XP_002466179.1| hypothetical protein SORBIDRAFT_01g003040 [S...   132   3e-28
gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea]       124   6e-26
gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo]         113   1e-22
ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226...   111   4e-22
ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302...   109   2e-21
ref|XP_002524204.1| DNA binding protein, putative [Ricinus commu...   108   3e-21
ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626...   107   6e-21
ref|NP_188861.2| hAT dimerization domain-containing protein [Ara...   100   2e-18
ref|NP_001154234.1| hAT transposon superfamily [Arabidopsis thal...    98   7e-18

>ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254391 [Solanum
            lycopersicum]
          Length = 748

 Score =  209 bits (533), Expect = 1e-51
 Identities = 124/305 (40%), Positives = 163/305 (53%), Gaps = 1/305 (0%)
 Frame = +3

Query: 138  MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317
            M SNLE VA T +K DPAW HCE  K+G RV+LKCIYCGK+FKGGGI+R KEHLAGQKGN
Sbjct: 1    MGSNLEPVAVTSQKHDPAWKHCEMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGN 60

Query: 318  GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497
             +TC +V PD+RL M + L G           LA E+  Y  + I  +++A         
Sbjct: 61   ASTCLRVQPDVRLLMQDSLNG-VVMKKRKKQKLAEEITTY--NAIDTSDIAAE------- 110

Query: 498  ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNCYLNSHGIMDASG 677
                             +TCG +++        ++ P+S+     S+ +LN         
Sbjct: 111  ---------------FTDTCGLNTQ-------VDLLPMSQAIEHTSSLFLN--------- 139

Query: 678  DREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKV-SVVDMAIGRFFF 854
             R+ G              + +                P +N  K+V + V MA+ RF  
Sbjct: 140  -RDQGPNNRKKKSRIRKGASSS-------------NNLPIINQSKRVNNQVHMAVARFLL 185

Query: 855  DVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAWG 1034
            D  +P DAVNS YFQPM+D IASQG  V  PSY+DLRSW+LK+SV EVR D++QC+S W 
Sbjct: 186  DARVPLDAVNSVYFQPMIDVIASQGPPVSAPSYHDLRSWVLKSSVQEVRTDIDQCSSTWA 245

Query: 1035 RTGCS 1049
            RTGCS
Sbjct: 246  RTGCS 250


>ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591442 [Solanum tuberosum]
          Length = 755

 Score =  205 bits (521), Expect = 3e-50
 Identities = 122/304 (40%), Positives = 156/304 (51%)
 Frame = +3

Query: 138  MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317
            M SNLE V  T +K DPAW HCE  K+G RV+LKCIYCGK+FKGGGI+R KEHLAGQKGN
Sbjct: 1    MGSNLEPVPVTSQKHDPAWKHCEMFKNGERVQLKCIYCGKIFKGGGIHRIKEHLAGQKGN 60

Query: 318  GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497
             +TC +V PD+RL M + L G           LA E+  Y     T    A         
Sbjct: 61   ASTCLRVQPDVRLLMQDSLNG-VVMKKRKKQKLAEEITTYNAGTATSDIAAE-------- 111

Query: 498  ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNCYLNSHGIMDASG 677
                             +TCG D++        ++ P+ +     SN +LN     +  G
Sbjct: 112  ---------------FTDTCGLDTQ-------VDLLPMPQAIEHTSNLFLNRDQGPNNIG 149

Query: 678  DREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRFFFD 857
             R+                 K+        +       P   SK+  + V MA+ RF  D
Sbjct: 150  ARK----------------KKSRIRKGASSSNNNAMLLPINQSKRVNNHVHMAVARFLLD 193

Query: 858  VGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAWGR 1037
              +P DAVNS YFQPM+D IASQG  V  PSY++LRSW+LK SV EVR D++QC+S W R
Sbjct: 194  ARVPLDAVNSVYFQPMIDVIASQGPQVSAPSYHELRSWVLKASVQEVRNDIDQCSSTWAR 253

Query: 1038 TGCS 1049
            +GCS
Sbjct: 254  SGCS 257


>ref|XP_007009265.1| HAT and BED zinc finger domain-containing protein, putative
            [Theobroma cacao] gi|508726178|gb|EOY18075.1| HAT and BED
            zinc finger domain-containing protein, putative
            [Theobroma cacao]
          Length = 749

 Score =  194 bits (493), Expect = 5e-47
 Identities = 122/304 (40%), Positives = 159/304 (52%)
 Frame = +3

Query: 138  MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317
            M SNLE +  T +K DPAW HC+  ++G RV+LKCIYCGK+F+GGGI+R KEHLAGQKGN
Sbjct: 1    MASNLEPIPITSQKHDPAWKHCQMFRNGERVQLKCIYCGKIFRGGGIHRIKEHLAGQKGN 60

Query: 318  GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497
             +TC  V  D+RL M E L G              E+      KI   E  +N + ++ +
Sbjct: 61   ASTCFHVPSDVRLLMRESLDG-------------VEVKKRKKQKI--AEEMSNANQVSSE 105

Query: 498  ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNCYLNSHGIMDASG 677
                +  YD    +V  NT     EG   + P             S+  +N  G  + SG
Sbjct: 106  ----IDTYD---NQVDTNTGLLMIEGPDTLQP------------SSSLLVNREGTSNVSG 146

Query: 678  DREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRFFFD 857
            DR                V  T+                 L +K+  + V +AIGRF FD
Sbjct: 147  DRRKRGKGKSSAAESNALVVNTV----------------GLGAKRVNNHVHVAIGRFLFD 190

Query: 858  VGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAWGR 1037
            +G P DAVNS YFQPM+DAI S G+GV+ PS  DL+ WILK SV EV+ D ++ T+AW R
Sbjct: 191  IGAPLDAVNSVYFQPMVDAIISGGSGVLMPSCSDLQGWILKKSVEEVKSDNDKVTAAWVR 250

Query: 1038 TGCS 1049
            TGCS
Sbjct: 251  TGCS 254


>ref|XP_002316272.2| hypothetical protein POPTR_0010s20835g [Populus trichocarpa]
            gi|550330253|gb|EEF02443.2| hypothetical protein
            POPTR_0010s20835g [Populus trichocarpa]
          Length = 608

 Score =  189 bits (480), Expect = 2e-45
 Identities = 118/304 (38%), Positives = 158/304 (51%)
 Frame = +3

Query: 138  MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317
            M SNLE +  T +K DPAW HC+  K+G RV+LKC+YCGK+FKGGGI+R KEHLAGQKGN
Sbjct: 1    MGSNLEPIPITSQKHDPAWKHCQMFKNGERVQLKCVYCGKIFKGGGIHRIKEHLAGQKGN 60

Query: 318  GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497
             ATC +V  D+RL M++  +            +A E        IT     ++  G+  D
Sbjct: 61   AATCVQVPSDVRL-MMQQSLDGVVVKKRKKQKIAEE--------ITNLNPVSSEIGV-FD 110

Query: 498  ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNCYLNSHGIMDASG 677
            ++V+      +G E+   T   D             P+S + V   +      G+    G
Sbjct: 111  KDVN------TGMELTGVTDAID-------------PVSSLLVTGED------GMGKKGG 145

Query: 678  DREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRFFFD 857
            +R                   T+             G P    K+K   + MAIGRF +D
Sbjct: 146  ERRKRGRGRGRGSVTNAKAVVTMGS-----------GMPLSGGKRKNDHIHMAIGRFLYD 194

Query: 858  VGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAWGR 1037
            +G   DAVNSAYFQ M+ AIAS G+ VV PSY+DLR W+LKNSV EV+ DV++  + W R
Sbjct: 195  IGASLDAVNSAYFQLMVQAIASGGSEVVVPSYHDLRGWVLKNSVEEVKNDVDKHIATWER 254

Query: 1038 TGCS 1049
            TGCS
Sbjct: 255  TGCS 258


>ref|XP_007049027.1| HAT transposon superfamily, putative [Theobroma cacao]
            gi|508701288|gb|EOX93184.1| HAT transposon superfamily,
            putative [Theobroma cacao]
          Length = 750

 Score =  187 bits (476), Expect = 5e-45
 Identities = 111/304 (36%), Positives = 158/304 (51%)
 Frame = +3

Query: 138  MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317
            M+ NL  ++ T++KQDPAWNHCE  K+G R+++KC+YCGK+FKGGGI+RFKEHLAG+KG 
Sbjct: 1    MELNLTPISITKQKQDPAWNHCEAFKNGERLQIKCMYCGKMFKGGGIHRFKEHLAGRKGQ 60

Query: 318  GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497
            G  C +V P +R  M E L G           +   +A  G S   G           +D
Sbjct: 61   GPICEQVPPGVRALMQESLNGVLLKQDNKQNAIPELLACGGSSPHAG----------EID 110

Query: 498  ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNCYLNSHGIMDASG 677
            ++                   +  + N  V P  +  L+ +E        +S  +++  G
Sbjct: 111  KS------------------AYSDDVNNGVKPIQV--LNSLEP-------DSSLVLNGKG 143

Query: 678  DREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRFFFD 857
            +   G+            +  +       L         AL S    + V MAIGRF +D
Sbjct: 144  EVSQGIRDSKKRGRDRSLLANSHSCAKSDL---------ALVSIGAENPVHMAIGRFLYD 194

Query: 858  VGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAWGR 1037
            +G+  DAVNS YFQPM+DAIAS G+G+V PS  DLR WILKN + EV+ D+++  + WG+
Sbjct: 195  IGVNLDAVNSVYFQPMIDAIASTGSGIVPPSSQDLRGWILKNVMEEVKDDIDRNKTMWGK 254

Query: 1038 TGCS 1049
            TGCS
Sbjct: 255  TGCS 258


>ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817502 isoform X4 [Glycine
            max]
          Length = 729

 Score =  182 bits (463), Expect = 2e-43
 Identities = 112/306 (36%), Positives = 155/306 (50%), Gaps = 2/306 (0%)
 Frame = +3

Query: 138  MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317
            M SNLE V  T +K DPAW H +  K+G +V+LKCIYC K+FKGGGI+R KEHLA QKGN
Sbjct: 1    MGSNLEPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGN 60

Query: 318  GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497
             +TCS+V  D+RL M + L G               M+    + +  +   NN   ++V+
Sbjct: 61   ASTCSRVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSVNPLTTVVNSLPNNNNRVVDVN 120

Query: 498  ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNC--YLNSHGIMDA 671
            + +     + +   V N       EG +     N+    ++   K+    Y NS G++  
Sbjct: 121  QGLQAIGVEHNSSLVVN-----PGEGMSR----NMERRKKMRATKNPAAVYANSEGVIAV 171

Query: 672  SGDREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRFF 851
              +                                       L  KK  + + MAIGRF 
Sbjct: 172  EKN--------------------------------------GLFPKKMDNHIYMAIGRFL 193

Query: 852  FDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAW 1031
            +D+G P DAVNS YFQ M+DAIAS+G G   P +++LR WILKNSV EV+ D+++C   W
Sbjct: 194  YDIGAPFDAVNSVYFQEMVDAIASRGVGFERPWHHELRGWILKNSVEEVKNDIDRCKMTW 253

Query: 1032 GRTGCS 1049
            GRTGCS
Sbjct: 254  GRTGCS 259


>ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817502 isoform X1 [Glycine
            max] gi|571489936|ref|XP_006591345.1| PREDICTED:
            uncharacterized protein LOC100817502 isoform X2 [Glycine
            max] gi|571489939|ref|XP_006591346.1| PREDICTED:
            uncharacterized protein LOC100817502 isoform X3 [Glycine
            max]
          Length = 759

 Score =  182 bits (463), Expect = 2e-43
 Identities = 112/306 (36%), Positives = 155/306 (50%), Gaps = 2/306 (0%)
 Frame = +3

Query: 138  MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317
            M SNLE V  T +K DPAW H +  K+G +V+LKCIYC K+FKGGGI+R KEHLA QKGN
Sbjct: 1    MGSNLEPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGN 60

Query: 318  GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497
             +TCS+V  D+RL M + L G               M+    + +  +   NN   ++V+
Sbjct: 61   ASTCSRVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSVNPLTTVVNSLPNNNNRVVDVN 120

Query: 498  ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNC--YLNSHGIMDA 671
            + +     + +   V N       EG +     N+    ++   K+    Y NS G++  
Sbjct: 121  QGLQAIGVEHNSSLVVN-----PGEGMSR----NMERRKKMRATKNPAAVYANSEGVIAV 171

Query: 672  SGDREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRFF 851
              +                                       L  KK  + + MAIGRF 
Sbjct: 172  EKN--------------------------------------GLFPKKMDNHIYMAIGRFL 193

Query: 852  FDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAW 1031
            +D+G P DAVNS YFQ M+DAIAS+G G   P +++LR WILKNSV EV+ D+++C   W
Sbjct: 194  YDIGAPFDAVNSVYFQEMVDAIASRGVGFERPWHHELRGWILKNSVEEVKNDIDRCKMTW 253

Query: 1032 GRTGCS 1049
            GRTGCS
Sbjct: 254  GRTGCS 259


>ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806265 isoform X1 [Glycine
            max] gi|571542833|ref|XP_006601996.1| PREDICTED:
            uncharacterized protein LOC100806265 isoform X2 [Glycine
            max]
          Length = 758

 Score =  180 bits (457), Expect = 8e-43
 Identities = 111/306 (36%), Positives = 155/306 (50%), Gaps = 2/306 (0%)
 Frame = +3

Query: 138  MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317
            M SNLE V  T +K DPAW H +  K+G +V+LKCIYC K+FKGGGI+R KEHLA QKGN
Sbjct: 1    MGSNLEPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGN 60

Query: 318  GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497
             +TCS+V  D+RL M + L G           +  E+ +          + NN   ++V+
Sbjct: 61   ASTCSRVPHDVRLHMQQSLDGVVVKKRRKQR-IEEEIMSVNPLTTVVNSLPNNNQVVDVN 119

Query: 498  ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNC--YLNSHGIMDA 671
            + +     + +   V N       EG +     N+    ++  AK+    Y NS  ++  
Sbjct: 120  QGLQAIGVEHNSTLVVN-----PGEGMSR----NMERRKKMRAAKNPAAVYANSEDVVAV 170

Query: 672  SGDREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRFF 851
              +                                       L  KK  + + MAIGRF 
Sbjct: 171  EKN--------------------------------------GLFPKKMDNHIYMAIGRFL 192

Query: 852  FDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAW 1031
            +D+G P DAVN  +FQ M+DAIAS+G G   PS+++LR WILKNSV EV+ D+++C   W
Sbjct: 193  YDIGAPFDAVNLVFFQEMVDAIASKGTGFERPSHHELRGWILKNSVEEVKNDIDRCKMTW 252

Query: 1032 GRTGCS 1049
            GRTGCS
Sbjct: 253  GRTGCS 258


>ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris]
            gi|561036894|gb|ESW35424.1| hypothetical protein
            PHAVU_001G234100g [Phaseolus vulgaris]
          Length = 869

 Score =  139 bits (350), Expect = 2e-30
 Identities = 97/307 (31%), Positives = 148/307 (48%), Gaps = 2/307 (0%)
 Frame = +3

Query: 135  EMDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKG 314
            +M SNLE V  T +K DPAW H +  K+G +V+LKCIYC K+FKGGGI+R KEHLA QKG
Sbjct: 113  KMGSNLEPVPITSQKHDPAWKHVQMYKNGDKVQLKCIYCQKMFKGGGIHRIKEHLACQKG 172

Query: 315  NGATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNV 494
            N +TCS+V  D+RL M + L G           +  E+ +          + NN   ++V
Sbjct: 173  NASTCSRVPHDVRLHMQQSLDG-VVVKKRRKQKIEEEIMSVNPLTTVVNSLPNNNQ-VDV 230

Query: 495  DENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNC--YLNSHGIMD 668
            ++ +     D +   V N   G            N+    ++  +K+    Y NS G++ 
Sbjct: 231  NQGLQAIGVDHNSSLVVNPGEGMSK---------NMERRKKMRASKNPAAIYANSEGVVA 281

Query: 669  ASGDREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRF 848
                 ++G+            + + L         ++   + A+NS             +
Sbjct: 282  V---EKNGLFPKRVDNHIHMAIGRFL--------YDIGAPFDAVNSV------------Y 318

Query: 849  FFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSA 1028
            F ++    DA++S            +GAG   PS+++LR WILKNSV EV+ D+++C   
Sbjct: 319  FHEM---VDAISS------------RGAGFERPSHHELRGWILKNSVEEVKNDIDRCKMT 363

Query: 1029 WGRTGCS 1049
            WGRTGCS
Sbjct: 364  WGRTGCS 370


>ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris]
            gi|561036895|gb|ESW35425.1| hypothetical protein
            PHAVU_001G234100g [Phaseolus vulgaris]
          Length = 756

 Score =  139 bits (349), Expect = 3e-30
 Identities = 97/306 (31%), Positives = 147/306 (48%), Gaps = 2/306 (0%)
 Frame = +3

Query: 138  MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317
            M SNLE V  T +K DPAW H +  K+G +V+LKCIYC K+FKGGGI+R KEHLA QKGN
Sbjct: 1    MGSNLEPVPITSQKHDPAWKHVQMYKNGDKVQLKCIYCQKMFKGGGIHRIKEHLACQKGN 60

Query: 318  GATCSKVHPDIRLQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVD 497
             +TCS+V  D+RL M + L G           +  E+ +          + NN   ++V+
Sbjct: 61   ASTCSRVPHDVRLHMQQSLDG-VVVKKRRKQKIEEEIMSVNPLTTVVNSLPNNNQ-VDVN 118

Query: 498  ENVHVPVYDISGFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNC--YLNSHGIMDA 671
            + +     D +   V N   G            N+    ++  +K+    Y NS G++  
Sbjct: 119  QGLQAIGVDHNSSLVVNPGEGMSK---------NMERRKKMRASKNPAAIYANSEGVVAV 169

Query: 672  SGDREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDMAIGRFF 851
                ++G+            + + L         ++   + A+NS             +F
Sbjct: 170  ---EKNGLFPKRVDNHIHMAIGRFL--------YDIGAPFDAVNSV------------YF 206

Query: 852  FDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAW 1031
             ++    DA++S            +GAG   PS+++LR WILKNSV EV+ D+++C   W
Sbjct: 207  HEM---VDAISS------------RGAGFERPSHHELRGWILKNSVEEVKNDIDRCKMTW 251

Query: 1032 GRTGCS 1049
            GRTGCS
Sbjct: 252  GRTGCS 257


>ref|XP_004981234.1| PREDICTED: uncharacterized protein LOC101757413 [Setaria italica]
          Length = 803

 Score =  135 bits (339), Expect = 4e-29
 Identities = 99/297 (33%), Positives = 134/297 (45%), Gaps = 5/297 (1%)
 Frame = +3

Query: 174  KKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGNGATCSKVHPDIR 353
            +K DPAW HC  ++   RV LKC YCGK F GGGI+RFKEHLA + GN   C KV  D++
Sbjct: 28   QKHDPAWKHCLMVRAEGRVRLKCAYCGKHFLGGGIHRFKEHLARRPGNACCCPKVPRDVQ 87

Query: 354  LQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGTEVANNGSGLNVDENVH-VPVYDIS 530
              M+  L              A           T    A+  SG   D  +H +P+ ++ 
Sbjct: 88   DTMMRSLDAVAAKKMQRKLANALPPGDMRRFAPTDASPASAASGGATDSPIHMIPLNEVL 147

Query: 531  GFEVANNTCGFDSEGNAHVSPCNIPPLSEVEVAKSNCYLNSHGIMDASGDREDGMXXXXX 710
             FE        D +          PPL   E  + +        M ++            
Sbjct: 148  DFE----PVPLDEQR---------PPLP--ETMRGSVSSKKKRKMLSNASTPPLTPPTLQ 192

Query: 711  XXXXXXXVTKTLXXXXXXLNIEVPP----GYPALNSKKKVSVVDMAIGRFFFDVGLPADA 878
                    T  L      ++   P     G+  L+ K++VSV   A+GRF +DVG+P +A
Sbjct: 193  QHVPSTPQTNPLHQVVMAVDAVTPSSGHFGHAGLD-KEQVSV---AVGRFLYDVGVPLEA 248

Query: 879  VNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVEQCTSAWGRTGCS 1049
            VNS YFQPML+AIAS G      SY+D R  ILK S+ +    +E    +W RTGCS
Sbjct: 249  VNSVYFQPMLEAIASAGGRPEALSYHDFRGHILKKSLDDATSRLEFFKGSWTRTGCS 305


>ref|XP_002466179.1| hypothetical protein SORBIDRAFT_01g003040 [Sorghum bicolor]
            gi|241920033|gb|EER93177.1| hypothetical protein
            SORBIDRAFT_01g003040 [Sorghum bicolor]
          Length = 747

 Score =  132 bits (331), Expect = 3e-28
 Identities = 96/312 (30%), Positives = 140/312 (44%), Gaps = 20/312 (6%)
 Frame = +3

Query: 174  KKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGNGATCSKVHPDIR 353
            +K DPAW HC  ++   R+ LKC YCGK F GGGI+RFKEHLA + GN   C KV  D++
Sbjct: 37   QKHDPAWKHCVMVRSDGRLRLKCAYCGKHFLGGGIHRFKEHLARRPGNACCCPKVPRDVQ 96

Query: 354  LQMLEVLIGXXXXXXXXXXXLAAEMAAYGDSKITGT-------EVANNGSGLNVDENVHV 512
              ML  L             LAA ++     +   T         A+ G+G  +     +
Sbjct: 97   DTMLRSL--DAVAAKKMQRKLAASLSPGDMRRFAATSAPPASVSTASGGTGSPIH---MI 151

Query: 513  PVYDISGFE----------VANNTCGFDSEG---NAHVSPCNIPPLSEVEVAKSNCYLNS 653
            P+ ++  FE          V   +    S G      V+    PPL    + ++      
Sbjct: 152  PLNEVLDFEPVPLEEQRPLVPEGSMRGSSSGKKKRKQVTSATAPPL----IPQTR---PQ 204

Query: 654  HGIMDASGDREDGMXXXXXXXXXXXXVTKTLXXXXXXLNIEVPPGYPALNSKKKVSVVDM 833
            H +     +   G+             T  L      ++   P  Y    +  +   V +
Sbjct: 205  HVLATPQTNLLHGLQHVPPTPH-----TNPLHQVVMAVDAVTPAEYFEHAAPSEKEQVSV 259

Query: 834  AIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRYDVE 1013
            A+GRF +D G+P +AVNS YFQPML+AIA+ G      SY+D+R  +LK S+ +V   +E
Sbjct: 260  AVGRFLYDAGVPLEAVNSVYFQPMLEAIAAAGGRPDVLSYHDVRGHVLKRSLDDVMSHLE 319

Query: 1014 QCTSAWGRTGCS 1049
                +W RTGCS
Sbjct: 320  FFRGSWTRTGCS 331


>gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea]
          Length = 724

 Score =  124 bits (311), Expect = 6e-26
 Identities = 57/75 (76%), Positives = 65/75 (86%)
 Frame = +3

Query: 825  VDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRY 1004
            V MA+GRFF DVGLPA+A NSAYFQPM++AIASQ AGV+GPSY DLRSWILKN VHE RY
Sbjct: 175  VHMAVGRFFVDVGLPAEAANSAYFQPMVEAIASQEAGVIGPSYQDLRSWILKNLVHETRY 234

Query: 1005 DVEQCTSAWGRTGCS 1049
            DV+Q  +AW RTGC+
Sbjct: 235  DVDQYANAWERTGCT 249



 Score =  108 bits (271), Expect = 3e-21
 Identities = 46/81 (56%), Positives = 61/81 (75%)
 Frame = +3

Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317
           M+ ++E V  T +K DPAW HC+  K   ++ LKCIYCGK+FKGGGI+R KEHLAGQKGN
Sbjct: 1   MEPHMELVPMTSQKHDPAWKHCQMFKTEEKIHLKCIYCGKIFKGGGIHRIKEHLAGQKGN 60

Query: 318 GATCSKVHPDIRLQMLEVLIG 380
            +TC +V P+++ QML+ L G
Sbjct: 61  ASTCLRVLPEVKQQMLDSLNG 81


>gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo]
          Length = 752

 Score =  113 bits (283), Expect = 1e-22
 Identities = 51/93 (54%), Positives = 68/93 (73%)
 Frame = +3

Query: 771  IEVPPGYPALNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPS 950
            I +P G   L+S +  + V MAIGRF +D+G   +AVNSAYFQPM+++IA  G G++ PS
Sbjct: 165  IVIPNGGGILDSNRDRNQVHMAIGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPS 224

Query: 951  YYDLRSWILKNSVHEVRYDVEQCTSAWGRTGCS 1049
            Y+D+R WILKNSV EVR D ++C + WG TGCS
Sbjct: 225  YHDIRGWILKNSVEEVRGDFDRCKATWGMTGCS 257



 Score =  104 bits (260), Expect = 5e-20
 Identities = 47/81 (58%), Positives = 59/81 (72%)
 Frame = +3

Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317
           M S L+ V  T +K DPAW HC+  K+G RV+LKC+YC K+FKGGGI+R KEHLAGQKGN
Sbjct: 1   MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60

Query: 318 GATCSKVHPDIRLQMLEVLIG 380
            +TC  V P+++  M E L G
Sbjct: 61  ASTCHSVPPEVQNIMQESLDG 81


>ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226173 [Cucumis sativus]
          Length = 752

 Score =  111 bits (278), Expect = 4e-22
 Identities = 49/93 (52%), Positives = 68/93 (73%)
 Frame = +3

Query: 771  IEVPPGYPALNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPS 950
            I +P G   L+S +  + V MA+GRF +D+G   +AVNSAYFQPM+++IA  G G++ PS
Sbjct: 165  IVIPNGGGILDSNRDRNQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPS 224

Query: 951  YYDLRSWILKNSVHEVRYDVEQCTSAWGRTGCS 1049
            Y+D+R WILKNS+ EVR D ++C + WG TGCS
Sbjct: 225  YHDIRGWILKNSMEEVRSDFDRCKATWGITGCS 257



 Score =  104 bits (260), Expect = 5e-20
 Identities = 47/81 (58%), Positives = 59/81 (72%)
 Frame = +3

Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317
           M S L+ V  T +K DPAW HC+  K+G RV+LKC+YC K+FKGGGI+R KEHLAGQKGN
Sbjct: 1   MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60

Query: 318 GATCSKVHPDIRLQMLEVLIG 380
            +TC  V P+++  M E L G
Sbjct: 61  ASTCHSVPPEVQNIMQESLDG 81


>ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302111 [Fragaria vesca
            subsp. vesca]
          Length = 754

 Score =  109 bits (273), Expect = 2e-21
 Identities = 53/85 (62%), Positives = 65/85 (76%)
 Frame = +3

Query: 795  ALNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWI 974
            AL S+K  S V  AIGRF FD+G P +AVNSAYFQPM+DAIAS G G+  P+ +DLRSWI
Sbjct: 168  ALVSRKVNSYVHEAIGRFLFDIGAPPEAVNSAYFQPMIDAIASGGPGMEPPTCHDLRSWI 227

Query: 975  LKNSVHEVRYDVEQCTSAWGRTGCS 1049
            LKNSV E R ++++  + WGRTGCS
Sbjct: 228  LKNSVEEARNNIDKHRATWGRTGCS 252



 Score =  103 bits (258), Expect = 9e-20
 Identities = 45/77 (58%), Positives = 57/77 (74%)
 Frame = +3

Query: 150 LESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGNGATC 329
           +E V  T +K DPAW HC+  K G R++LKCIYC K+F+GGGI+R KEHLAGQKGN +TC
Sbjct: 1   MEPVPITSQKHDPAWKHCQMFKSGDRIQLKCIYCSKLFRGGGIHRIKEHLAGQKGNASTC 60

Query: 330 SKVHPDIRLQMLEVLIG 380
            +V PD+R  M + L G
Sbjct: 61  LRVPPDVRGLMQQSLDG 77


>ref|XP_002524204.1| DNA binding protein, putative [Ricinus communis]
           gi|223536481|gb|EEF38128.1| DNA binding protein,
           putative [Ricinus communis]
          Length = 753

 Score =  108 bits (271), Expect = 3e-21
 Identities = 49/82 (59%), Positives = 63/82 (76%), Gaps = 1/82 (1%)
 Frame = +3

Query: 138 MDSN-LESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKG 314
           MDS+ LE +  T +K DPAW HC+  K+G RV+LKC+YCGK+FKGGGI+R KEHLAGQKG
Sbjct: 1   MDSDDLEPIPITSQKHDPAWKHCQMFKNGERVQLKCVYCGKIFKGGGIHRIKEHLAGQKG 60

Query: 315 NGATCSKVHPDIRLQMLEVLIG 380
           N +TC +V  D++L M + L G
Sbjct: 61  NASTCLQVPTDVKLIMQQSLDG 82



 Score =  104 bits (260), Expect = 5e-20
 Identities = 51/85 (60%), Positives = 62/85 (72%)
 Frame = +3

Query: 795  ALNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWI 974
            AL +K+    V MAIGRF +D+G P DAVNS YFQPM+DAIAS G  V  PS +DLR WI
Sbjct: 178  ALGAKRVNDHVHMAIGRFLYDIGAPLDAVNSVYFQPMVDAIASGGLDVGMPSCHDLRGWI 237

Query: 975  LKNSVHEVRYDVEQCTSAWGRTGCS 1049
            LKNSV EV+ +V++  + W RTGCS
Sbjct: 238  LKNSVEEVKTEVDKHMATWARTGCS 262


>ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626522 [Citrus sinensis]
          Length = 745

 Score =  107 bits (268), Expect = 6e-21
 Identities = 48/85 (56%), Positives = 65/85 (76%)
 Frame = +3

Query: 795  ALNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWI 974
            +L++ +  + + MA+GRF +D+G P DAVNS YFQPM+DAIAS G     PSY+D+R WI
Sbjct: 170  SLDATRGNNPIFMAVGRFLYDIGAPLDAVNSEYFQPMVDAIASGGPEAAMPSYHDIRGWI 229

Query: 975  LKNSVHEVRYDVEQCTSAWGRTGCS 1049
            LKNSV EV+ DV++ T+ WG+TGCS
Sbjct: 230  LKNSVEEVKNDVDRYTTTWGKTGCS 254



 Score =  102 bits (255), Expect = 2e-19
 Identities = 46/81 (56%), Positives = 60/81 (74%)
 Frame = +3

Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317
           M S LE +  + +K DPAW HC+  K+G RV+LKC+YC K+F+GGGI+R KEHLA QKGN
Sbjct: 1   MASGLEPIPISSQKHDPAWKHCQMFKNGDRVQLKCLYCFKLFRGGGIHRIKEHLACQKGN 60

Query: 318 GATCSKVHPDIRLQMLEVLIG 380
            +TCS+V  D+RL M + L G
Sbjct: 61  ASTCSRVPLDVRLAMQQSLDG 81


>ref|NP_188861.2| hAT dimerization domain-containing protein [Arabidopsis thaliana]
           gi|79313325|ref|NP_001030742.1| hAT dimerization
           domain-containing protein [Arabidopsis thaliana]
           gi|11994740|dbj|BAB03069.1| transposase-like protein
           [Arabidopsis thaliana] gi|28393360|gb|AAO42104.1|
           unknown protein [Arabidopsis thaliana]
           gi|28827622|gb|AAO50655.1| unknown protein [Arabidopsis
           thaliana] gi|332643084|gb|AEE76605.1| hAT dimerization
           domain-containing protein [Arabidopsis thaliana]
           gi|332643085|gb|AEE76606.1| hAT dimerization
           domain-containing protein [Arabidopsis thaliana]
          Length = 761

 Score = 99.8 bits (247), Expect = 2e-18
 Identities = 45/81 (55%), Positives = 59/81 (72%)
 Frame = +3

Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317
           MDS+LE VA T +KQD AW HCE  K G RV+++C+YC K+FKGGGI R KEHLAG+KG 
Sbjct: 1   MDSDLEPVALTPQKQDSAWKHCEVYKYGDRVQMRCLYCRKMFKGGGITRVKEHLAGKKGQ 60

Query: 318 GATCSKVHPDIRLQMLEVLIG 380
           G  C +V  ++RL + + + G
Sbjct: 61  GTICDQVPDEVRLFLQQCIDG 81



 Score = 92.0 bits (227), Expect = 4e-16
 Identities = 42/82 (51%), Positives = 56/82 (68%)
 Frame = +3

Query: 804  SKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKN 983
            SK++   V MA+GRF FD+G   DA NS   QP +DAI S G GV  P++ DLR WILK+
Sbjct: 183  SKEREKTVHMAMGRFLFDIGADFDAANSVNVQPFIDAIVSGGFGVSIPTHEDLRGWILKS 242

Query: 984  SVHEVRYDVEQCTSAWGRTGCS 1049
             V EV+ ++++C + W RTGCS
Sbjct: 243  CVEEVKKEIDECKTLWKRTGCS 264


>ref|NP_001154234.1| hAT transposon superfamily [Arabidopsis thaliana]
           gi|240255844|ref|NP_193238.5| hAT transposon superfamily
           [Arabidopsis thaliana] gi|332658140|gb|AEE83540.1| hAT
           transposon superfamily [Arabidopsis thaliana]
           gi|332658141|gb|AEE83541.1| hAT transposon superfamily
           [Arabidopsis thaliana]
          Length = 768

 Score = 97.8 bits (242), Expect = 7e-18
 Identities = 44/81 (54%), Positives = 58/81 (71%)
 Frame = +3

Query: 138 MDSNLESVARTRKKQDPAWNHCEKLKDGARVELKCIYCGKVFKGGGIYRFKEHLAGQKGN 317
           MD+ LE VA T +KQD AW HCE  K G R++++C+YC K+FKGGGI R KEHLAG+KG 
Sbjct: 1   MDAELEPVALTPQKQDNAWKHCEIYKYGDRLQMRCLYCRKMFKGGGITRVKEHLAGKKGQ 60

Query: 318 GATCSKVHPDIRLQMLEVLIG 380
           G  C +V  D+RL + + + G
Sbjct: 61  GTICDQVPEDVRLFLQQCIDG 81



 Score = 95.5 bits (236), Expect = 3e-17
 Identities = 43/81 (53%), Positives = 57/81 (70%)
 Frame = +3

Query: 807  KKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNS 986
            + + + + MAIGRF F +G   DAVNS  FQPM+DAIAS G GV  P++ DLR WILKN 
Sbjct: 188  RDRENTIHMAIGRFLFGIGADFDAVNSVNFQPMIDAIASGGFGVSAPTHDDLRGWILKNC 247

Query: 987  VHEVRYDVEQCTSAWGRTGCS 1049
            V E+  ++++C + W RTGCS
Sbjct: 248  VEEMAKEIDECKAMWKRTGCS 268


Top