BLASTX nr result

ID: Catharanthus23_contig00016208 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00016208
         (720 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591...   221   2e-55
ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254...   212   8e-53
ref|XP_002316272.2| hypothetical protein POPTR_0010s20835g [Popu...   190   3e-46
gb|EOY18075.1| HAT and BED zinc finger domain-containing protein...   189   6e-46
ref|XP_002524204.1| DNA binding protein, putative [Ricinus commu...   186   5e-45
gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo]         183   4e-44
ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226...   179   1e-42
gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea]       178   1e-42
gb|ESW35425.1| hypothetical protein PHAVU_001G234100g [Phaseolus...   174   3e-41
gb|ESW35424.1| hypothetical protein PHAVU_001G234100g [Phaseolus...   174   3e-41
ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302...   168   1e-39
ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817...   164   3e-38
ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817...   164   3e-38
ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806...   164   3e-38
ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626...   163   5e-38
gb|ESW31639.1| hypothetical protein PHAVU_002G255200g [Phaseolus...   144   2e-32
gb|EOX93184.1| HAT transposon superfamily, putative [Theobroma c...   134   4e-29
ref|NP_188861.2| hAT dimerization domain-containing protein [Ara...   121   3e-25
ref|NP_001154234.1| hAT transposon superfamily [Arabidopsis thal...   119   1e-24
ref|XP_002521049.1| DNA binding protein, putative [Ricinus commu...   119   1e-24

>ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591442 [Solanum tuberosum]
          Length = 755

 Score =  221 bits (563), Expect = 2e-55
 Identities = 101/144 (70%), Positives = 124/144 (86%)
 Frame = -2

Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339
           M S+LE VPVTSQKHDPAWKHC+M+KNGER+QLKC+YCGKIFKGGGIHRIKEHLAGQKGN
Sbjct: 1   MGSNLEPVPVTSQKHDPAWKHCEMFKNGERVQLKCIYCGKIFKGGGIHRIKEHLAGQKGN 60

Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNT 159
           AS+CLRVQPDVRLLMQ+SLNGVVM+KRKKQKLA+EI  YN G+ +        + C L+T
Sbjct: 61  ASTCLRVQPDVRLLMQDSLNGVVMKKRKKQKLAEEITTYNAGTATSDIAAEFTDTCGLDT 120

Query: 158 EVNLLPAPDALDHNTDLFVDSEEG 87
           +V+LLP P A++H ++LF++ ++G
Sbjct: 121 QVDLLPMPQAIEHTSNLFLNRDQG 144


>ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254391 [Solanum
           lycopersicum]
          Length = 748

 Score =  212 bits (540), Expect = 8e-53
 Identities = 100/144 (69%), Positives = 121/144 (84%)
 Frame = -2

Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339
           M S+LE V VTSQKHDPAWKHC+M+KNG+R+QLKC+YCGKIFKGGGIHRIKEHLAGQKGN
Sbjct: 1   MGSNLEPVAVTSQKHDPAWKHCEMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGN 60

Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNT 159
           AS+CLRVQPDVRLLMQ+SLNGVVM+KRKKQKLA+EI  YN    S+I      + C LNT
Sbjct: 61  ASTCLRVQPDVRLLMQDSLNGVVMKKRKKQKLAEEITTYNAIDTSDI-AAEFTDTCGLNT 119

Query: 158 EVNLLPAPDALDHNTDLFVDSEEG 87
           +V+LLP   A++H + LF++ ++G
Sbjct: 120 QVDLLPMSQAIEHTSSLFLNRDQG 143


>ref|XP_002316272.2| hypothetical protein POPTR_0010s20835g [Populus trichocarpa]
           gi|550330253|gb|EEF02443.2| hypothetical protein
           POPTR_0010s20835g [Populus trichocarpa]
          Length = 608

 Score =  190 bits (483), Expect = 3e-46
 Identities = 93/146 (63%), Positives = 116/146 (79%)
 Frame = -2

Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339
           M S+LE +P+TSQKHDPAWKHCQM+KNGER+QLKC+YCGKIFKGGGIHRIKEHLAGQKGN
Sbjct: 1   MGSNLEPIPITSQKHDPAWKHCQMFKNGERVQLKCVYCGKIFKGGGIHRIKEHLAGQKGN 60

Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNT 159
           A++C++V  DVRL+MQ+SL+GVV++KRKKQK+A+EI N N  S SEI         D+NT
Sbjct: 61  AATCVQVPSDVRLMMQQSLDGVVVKKRKKQKIAEEITNLNPVS-SEIGVFDK----DVNT 115

Query: 158 EVNLLPAPDALDHNTDLFVDSEEGLG 81
            + L    DA+D  + L V  E+G+G
Sbjct: 116 GMELTGVTDAIDPVSSLLVTGEDGMG 141


>gb|EOY18075.1| HAT and BED zinc finger domain-containing protein, putative
           [Theobroma cacao]
          Length = 749

 Score =  189 bits (481), Expect = 6e-46
 Identities = 92/142 (64%), Positives = 115/142 (80%)
 Frame = -2

Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339
           MAS+LE +P+TSQKHDPAWKHCQM++NGER+QLKC+YCGKIF+GGGIHRIKEHLAGQKGN
Sbjct: 1   MASNLEPIPITSQKHDPAWKHCQMFRNGERVQLKCIYCGKIFRGGGIHRIKEHLAGQKGN 60

Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNT 159
           AS+C  V  DVRLLM+ESL+GV ++KRKKQK+A+E+ N N  S SEIDT   +N  D NT
Sbjct: 61  ASTCFHVPSDVRLLMRESLDGVEVKKRKKQKIAEEMSNANQVS-SEIDTY--DNQVDTNT 117

Query: 158 EVNLLPAPDALDHNTDLFVDSE 93
            + ++  PD L  ++ L V+ E
Sbjct: 118 GLLMIEGPDTLQPSSSLLVNRE 139


>ref|XP_002524204.1| DNA binding protein, putative [Ricinus communis]
           gi|223536481|gb|EEF38128.1| DNA binding protein,
           putative [Ricinus communis]
          Length = 753

 Score =  186 bits (473), Expect = 5e-45
 Identities = 87/144 (60%), Positives = 117/144 (81%), Gaps = 1/144 (0%)
 Frame = -2

Query: 515 ASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGNA 336
           + DLE +P+TSQKHDPAWKHCQM+KNGER+QLKC+YCGKIFKGGGIHRIKEHLAGQKGNA
Sbjct: 3   SDDLEPIPITSQKHDPAWKHCQMFKNGERVQLKCVYCGKIFKGGGIHRIKEHLAGQKGNA 62

Query: 335 SSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYN-VGSVSEIDTLADNNHCDLNT 159
           S+CL+V  DV+L+MQ+SL+GVV++KRKKQK+A+EI N N V    EI+  A N+  +++T
Sbjct: 63  STCLQVPTDVKLIMQQSLDGVVVKKRKKQKIAEEITNLNPVIGGGEIEVFA-NDQIEVST 121

Query: 158 EVNLLPAPDALDHNTDLFVDSEEG 87
            + L+   + ++ ++ L +  +EG
Sbjct: 122 GMELIGVSNVIEPSSSLLISGQEG 145


>gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo]
          Length = 752

 Score =  183 bits (465), Expect = 4e-44
 Identities = 85/144 (59%), Positives = 115/144 (79%)
 Frame = -2

Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339
           M+S L+ VP+T QKHDPAWKHCQM+KNG+R+QLKCLYC K+FKGGGIHRIKEHLAGQKGN
Sbjct: 1   MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60

Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNT 159
           AS+C  V P+V+ +MQESL+GV+M+KRK+QKL +E+ N N    +E+D +  +NH D+++
Sbjct: 61  ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVN-AMTAEVDAI--SNHMDMDS 117

Query: 158 EVNLLPAPDALDHNTDLFVDSEEG 87
            ++L+   + LD N+ L +  EEG
Sbjct: 118 SIHLIEVAEPLDTNSALLLTHEEG 141


>ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226173 [Cucumis sativus]
          Length = 752

 Score =  179 bits (453), Expect = 1e-42
 Identities = 83/144 (57%), Positives = 114/144 (79%)
 Frame = -2

Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339
           M+S L+ VP+T QKHDPAWKHCQM+KNG+R+QLKCLYC K+FKGGGIHRIKEHLAGQKGN
Sbjct: 1   MSSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGN 60

Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNT 159
           AS+C  V P+V+ +MQESL+GV+M+KRK+QKL +E+ N N     E+D +  +NH D+++
Sbjct: 61  ASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNT-MTGEVDGI--SNHMDMDS 117

Query: 158 EVNLLPAPDALDHNTDLFVDSEEG 87
            ++L+   + L+ N+ L +  E+G
Sbjct: 118 SIHLIEVAEPLETNSVLLLTHEKG 141


>gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea]
          Length = 724

 Score =  178 bits (452), Expect = 1e-42
 Identities = 85/146 (58%), Positives = 107/146 (73%), Gaps = 2/146 (1%)
 Frame = -2

Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339
           M   +ELVP+TSQKHDPAWKHCQM+K  E+I LKC+YCGKIFKGGGIHRIKEHLAGQKGN
Sbjct: 1   MEPHMELVPMTSQKHDPAWKHCQMFKTEEKIHLKCIYCGKIFKGGGIHRIKEHLAGQKGN 60

Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLAD--NNHCDL 165
           AS+CLRV P+V+  M +SLNGV ++K+KK KL +++  Y        D  AD  N H  L
Sbjct: 61  ASTCLRVLPEVKQQMLDSLNGVAVKKKKKLKLTEQLSGY--------DNPADRVNEHSSL 112

Query: 164 NTEVNLLPAPDALDHNTDLFVDSEEG 87
           N+E   LP P+ ++H+ D + + EEG
Sbjct: 113 NSEAFFLPGPEIVEHDDDAYEEGEEG 138


>gb|ESW35425.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris]
          Length = 756

 Score =  174 bits (440), Expect = 3e-41
 Identities = 84/145 (57%), Positives = 113/145 (77%)
 Frame = -2

Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339
           M S+LE VP+TSQKHDPAWKH QMYKNG+++QLKC+YC K+FKGGGIHRIKEHLA QKGN
Sbjct: 1   MGSNLEPVPITSQKHDPAWKHVQMYKNGDKVQLKCIYCQKMFKGGGIHRIKEHLACQKGN 60

Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNT 159
           AS+C RV  DVRL MQ+SL+GVV++KR+KQK+ +EI++ N    + +++L +NN  D+N 
Sbjct: 61  ASTCSRVPHDVRLHMQQSLDGVVVKKRRKQKIEEEIMSVN-PLTTVVNSLPNNNQVDVNQ 119

Query: 158 EVNLLPAPDALDHNTDLFVDSEEGL 84
            +  +     +DHN+ L V+  EG+
Sbjct: 120 GLQAI----GVDHNSSLVVNPGEGM 140


>gb|ESW35424.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris]
          Length = 869

 Score =  174 bits (440), Expect = 3e-41
 Identities = 84/145 (57%), Positives = 113/145 (77%)
 Frame = -2

Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339
           M S+LE VP+TSQKHDPAWKH QMYKNG+++QLKC+YC K+FKGGGIHRIKEHLA QKGN
Sbjct: 114 MGSNLEPVPITSQKHDPAWKHVQMYKNGDKVQLKCIYCQKMFKGGGIHRIKEHLACQKGN 173

Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNT 159
           AS+C RV  DVRL MQ+SL+GVV++KR+KQK+ +EI++ N    + +++L +NN  D+N 
Sbjct: 174 ASTCSRVPHDVRLHMQQSLDGVVVKKRRKQKIEEEIMSVN-PLTTVVNSLPNNNQVDVNQ 232

Query: 158 EVNLLPAPDALDHNTDLFVDSEEGL 84
            +  +     +DHN+ L V+  EG+
Sbjct: 233 GLQAI----GVDHNSSLVVNPGEGM 253


>ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302111 [Fragaria vesca
           subsp. vesca]
          Length = 754

 Score =  168 bits (426), Expect = 1e-39
 Identities = 78/121 (64%), Positives = 98/121 (80%)
 Frame = -2

Query: 506 LELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGNASSC 327
           +E VP+TSQKHDPAWKHCQM+K+G+RIQLKC+YC K+F+GGGIHRIKEHLAGQKGNAS+C
Sbjct: 1   MEPVPITSQKHDPAWKHCQMFKSGDRIQLKCIYCSKLFRGGGIHRIKEHLAGQKGNASTC 60

Query: 326 LRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNTEVNL 147
           LRV PDVR LMQ+SL+GVV++KR +QKL +EI N       ++D+L      D+N  V L
Sbjct: 61  LRVPPDVRGLMQQSLDGVVVKKRNRQKLDEEITNITPPQDGDVDSLG-GTQSDVNNAVQL 119

Query: 146 L 144
           +
Sbjct: 120 V 120


>ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817502 isoform X4 [Glycine
           max]
          Length = 729

 Score =  164 bits (415), Expect = 3e-38
 Identities = 81/147 (55%), Positives = 114/147 (77%), Gaps = 2/147 (1%)
 Frame = -2

Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339
           M S+LE VP+TSQKHDPAWKH QM+KNG+++QLKC+YC K+FKGGGIHRIKEHLA QKGN
Sbjct: 1   MGSNLEPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGN 60

Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNH--CDL 165
           AS+C RV  DVRL MQ+SL+GVV++KR+KQ++ +EI++ N    + +++L +NN+   D+
Sbjct: 61  ASTCSRVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSVN-PLTTVVNSLPNNNNRVVDV 119

Query: 164 NTEVNLLPAPDALDHNTDLFVDSEEGL 84
           N  +  +     ++HN+ L V+  EG+
Sbjct: 120 NQGLQAI----GVEHNSSLVVNPGEGM 142


>ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817502 isoform X1 [Glycine
           max] gi|571489936|ref|XP_006591345.1| PREDICTED:
           uncharacterized protein LOC100817502 isoform X2 [Glycine
           max] gi|571489939|ref|XP_006591346.1| PREDICTED:
           uncharacterized protein LOC100817502 isoform X3 [Glycine
           max]
          Length = 759

 Score =  164 bits (415), Expect = 3e-38
 Identities = 81/147 (55%), Positives = 114/147 (77%), Gaps = 2/147 (1%)
 Frame = -2

Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339
           M S+LE VP+TSQKHDPAWKH QM+KNG+++QLKC+YC K+FKGGGIHRIKEHLA QKGN
Sbjct: 1   MGSNLEPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGN 60

Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNH--CDL 165
           AS+C RV  DVRL MQ+SL+GVV++KR+KQ++ +EI++ N    + +++L +NN+   D+
Sbjct: 61  ASTCSRVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSVN-PLTTVVNSLPNNNNRVVDV 119

Query: 164 NTEVNLLPAPDALDHNTDLFVDSEEGL 84
           N  +  +     ++HN+ L V+  EG+
Sbjct: 120 NQGLQAI----GVEHNSSLVVNPGEGM 142


>ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806265 isoform X1 [Glycine
           max] gi|571542833|ref|XP_006601996.1| PREDICTED:
           uncharacterized protein LOC100806265 isoform X2 [Glycine
           max]
          Length = 758

 Score =  164 bits (414), Expect = 3e-38
 Identities = 81/146 (55%), Positives = 113/146 (77%), Gaps = 1/146 (0%)
 Frame = -2

Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339
           M S+LE VP+TSQKHDPAWKH QM+KNG+++QLKC+YC K+FKGGGIHRIKEHLA QKGN
Sbjct: 1   MGSNLEPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGN 60

Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNH-CDLN 162
           AS+C RV  DVRL MQ+SL+GVV++KR+KQ++ +EI++ N    + +++L +NN   D+N
Sbjct: 61  ASTCSRVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSVN-PLTTVVNSLPNNNQVVDVN 119

Query: 161 TEVNLLPAPDALDHNTDLFVDSEEGL 84
             +  +     ++HN+ L V+  EG+
Sbjct: 120 QGLQAI----GVEHNSTLVVNPGEGM 141


>ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626522 [Citrus sinensis]
          Length = 745

 Score =  163 bits (413), Expect = 5e-38
 Identities = 74/100 (74%), Positives = 90/100 (90%)
 Frame = -2

Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339
           MAS LE +P++SQKHDPAWKHCQM+KNG+R+QLKCLYC K+F+GGGIHRIKEHLA QKGN
Sbjct: 1   MASGLEPIPISSQKHDPAWKHCQMFKNGDRVQLKCLYCFKLFRGGGIHRIKEHLACQKGN 60

Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYN 219
           AS+C RV  DVRL MQ+SL+GVV++K+KKQK+A+EI N N
Sbjct: 61  ASTCSRVPLDVRLAMQQSLDGVVVKKKKKQKIAEEITNNN 100


>gb|ESW31639.1| hypothetical protein PHAVU_002G255200g [Phaseolus vulgaris]
           gi|561033061|gb|ESW31640.1| hypothetical protein
           PHAVU_002G255200g [Phaseolus vulgaris]
          Length = 172

 Score =  144 bits (364), Expect = 2e-32
 Identities = 72/139 (51%), Positives = 102/139 (73%)
 Frame = -2

Query: 500 LVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGNASSCLR 321
           LVP+TSQKHDP WKH QM+KN +++QLKC+Y  K+F+GGGI RIKEHLA QKGNAS C R
Sbjct: 11  LVPITSQKHDPIWKHVQMFKNSDKVQLKCIYFLKMFEGGGIRRIKEHLACQKGNASICSR 70

Query: 320 VQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCDLNTEVNLLP 141
           +  DV+L MQ+SL+G V++K +KQK+ +EII+ N   ++ ++ L +NN  D+N  +  + 
Sbjct: 71  LPHDVKLNMQQSLDGAVVKKMRKQKI-EEIISVNPLGIA-VNLLPNNNQVDVNQGLQAI- 127

Query: 140 APDALDHNTDLFVDSEEGL 84
               +DHN+ L V+  EG+
Sbjct: 128 ---GVDHNSSLVVNPSEGM 143


>gb|EOX93184.1| HAT transposon superfamily, putative [Theobroma cacao]
          Length = 750

 Score =  134 bits (336), Expect = 4e-29
 Identities = 66/144 (45%), Positives = 95/144 (65%), Gaps = 2/144 (1%)
 Frame = -2

Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339
           M  +L  + +T QK DPAW HC+ +KNGER+Q+KC+YCGK+FKGGGIHR KEHLAG+KG 
Sbjct: 1   MELNLTPISITKQKQDPAWNHCEAFKNGERLQIKCMYCGKMFKGGGIHRFKEHLAGRKGQ 60

Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGS--VSEIDTLADNNHCDL 165
              C +V P VR LMQESLNGV++++  KQ    E++     S    EID  A ++  D+
Sbjct: 61  GPICEQVPPGVRALMQESLNGVLLKQDNKQNAIPELLACGGSSPHAGEIDKSAYSD--DV 118

Query: 164 NTEVNLLPAPDALDHNTDLFVDSE 93
           N  V  +   ++L+ ++ L ++ +
Sbjct: 119 NNGVKPIQVLNSLEPDSSLVLNGK 142


>ref|NP_188861.2| hAT dimerization domain-containing protein [Arabidopsis thaliana]
           gi|79313325|ref|NP_001030742.1| hAT dimerization
           domain-containing protein [Arabidopsis thaliana]
           gi|11994740|dbj|BAB03069.1| transposase-like protein
           [Arabidopsis thaliana] gi|28393360|gb|AAO42104.1|
           unknown protein [Arabidopsis thaliana]
           gi|28827622|gb|AAO50655.1| unknown protein [Arabidopsis
           thaliana] gi|332643084|gb|AEE76605.1| hAT dimerization
           domain-containing protein [Arabidopsis thaliana]
           gi|332643085|gb|AEE76606.1| hAT dimerization
           domain-containing protein [Arabidopsis thaliana]
          Length = 761

 Score =  121 bits (303), Expect = 3e-25
 Identities = 55/109 (50%), Positives = 79/109 (72%)
 Frame = -2

Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339
           M SDLE V +T QK D AWKHC++YK G+R+Q++CLYC K+FKGGGI R+KEHLAG+KG 
Sbjct: 1   MDSDLEPVALTPQKQDSAWKHCEVYKYGDRVQMRCLYCRKMFKGGGITRVKEHLAGKKGQ 60

Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDT 192
            + C +V  +VRL +Q+ ++G V R+RK++K + E +        E++T
Sbjct: 61  GTICDQVPDEVRLFLQQCIDGTVRRQRKRRKSSPEPLPIAYFPPCEVET 109


>ref|NP_001154234.1| hAT transposon superfamily [Arabidopsis thaliana]
           gi|240255844|ref|NP_193238.5| hAT transposon superfamily
           [Arabidopsis thaliana] gi|332658140|gb|AEE83540.1| hAT
           transposon superfamily [Arabidopsis thaliana]
           gi|332658141|gb|AEE83541.1| hAT transposon superfamily
           [Arabidopsis thaliana]
          Length = 768

 Score =  119 bits (297), Expect = 1e-24
 Identities = 52/95 (54%), Positives = 73/95 (76%)
 Frame = -2

Query: 518 MASDLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGN 339
           M ++LE V +T QK D AWKHC++YK G+R+Q++CLYC K+FKGGGI R+KEHLAG+KG 
Sbjct: 1   MDAELEPVALTPQKQDNAWKHCEIYKYGDRLQMRCLYCRKMFKGGGITRVKEHLAGKKGQ 60

Query: 338 ASSCLRVQPDVRLLMQESLNGVVMRKRKKQKLADE 234
            + C +V  DVRL +Q+ ++G V R+RK+ K + E
Sbjct: 61  GTICDQVPEDVRLFLQQCIDGTVRRQRKRHKSSSE 95


>ref|XP_002521049.1| DNA binding protein, putative [Ricinus communis]
           gi|223539752|gb|EEF41333.1| DNA binding protein,
           putative [Ricinus communis]
          Length = 854

 Score =  119 bits (297), Expect = 1e-24
 Identities = 65/137 (47%), Positives = 91/137 (66%), Gaps = 1/137 (0%)
 Frame = -2

Query: 491 VTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGNASSCLRVQP 312
           VT  K D AWK+CQ  K G+R+Q+KC YCGK+FKGGGIHR KEHLAG+KG A  C RV  
Sbjct: 122 VTRHKKDMAWKYCQPSKYGDRVQIKCNYCGKVFKGGGIHRFKEHLAGRKGAAPICDRVPS 181

Query: 311 DVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVS-EIDTLADNNHCDLNTEVNLLPAP 135
           DVRLLMQ+ L+ VV +++K++ + +E IN +   V    DT A  NH     + N   AP
Sbjct: 182 DVRLLMQQCLHEVVPKQKKQKVVIEETINVDSPPVPLNTDTFA--NHFGDEDDDN--GAP 237

Query: 134 DALDHNTDLFVDSEEGL 84
            +++ N++L ++ ++ L
Sbjct: 238 ISVEFNSNLSLEEDDVL 254



 Score =  102 bits (255), Expect = 9e-20
 Identities = 49/114 (42%), Positives = 71/114 (62%)
 Frame = -2

Query: 509 DLELVPVTSQKHDPAWKHCQMYKNGERIQLKCLYCGKIFKGGGIHRIKEHLAGQKGNASS 330
           +++  P    KHD  WK+C+M K GE++ +KC YCGKIFKGGGI R KEHLAG+KG    
Sbjct: 2   EIQTPPSLGHKHDLGWKYCEMIKEGEKVHIKCSYCGKIFKGGGIFRFKEHLAGRKGGGPM 61

Query: 329 CLRVQPDVRLLMQESLNGVVMRKRKKQKLADEIINYNVGSVSEIDTLADNNHCD 168
           CL V  DVRLLM+++L+ V   K+  ++ +  +         E+ +L +N + D
Sbjct: 62  CLNVPADVRLLMEQTLD-VSSAKQSSRRQSSRL-----KMTPELPSLPNNKNSD 109


Top