BLASTX nr result

ID: Catharanthus23_contig00016043 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00016043
         (1692 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002269161.1| PREDICTED: uncharacterized protein LOC100264...   378   e-102
ref|XP_002310902.1| predicted protein [Populus trichocarpa]           350   1e-93
ref|XP_002530377.1| protein dimerization, putative [Ricinus comm...   346   2e-92
gb|EOY26199.1| HAT transposon superfamily protein, putative [The...   341   6e-91
ref|XP_006366948.1| PREDICTED: uncharacterized protein LOC102589...   332   2e-88
ref|XP_004246748.1| PREDICTED: uncharacterized protein LOC101247...   322   4e-85
ref|XP_004246747.1| PREDICTED: uncharacterized protein LOC101247...   322   4e-85
ref|XP_002269962.2| PREDICTED: uncharacterized protein LOC100251...   311   4e-82
ref|XP_006345717.1| PREDICTED: uncharacterized protein LOC102580...   308   6e-81
ref|XP_004246933.1| PREDICTED: uncharacterized protein LOC101250...   306   1e-80
ref|XP_006366951.1| PREDICTED: uncharacterized protein LOC102590...   300   2e-78
ref|XP_002312861.1| predicted protein [Populus trichocarpa]           259   3e-66
ref|NP_187909.1| hAT transposon superfamily protein [Arabidopsis...   238   8e-60
ref|XP_004246932.1| PREDICTED: uncharacterized protein LOC101250...   234   6e-59
ref|NP_187908.1| hAT transposon superfamily protein [Arabidopsis...   227   1e-56
ref|XP_006297473.1| hypothetical protein CARUB_v10013494mg [Caps...   218   8e-54
ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Popu...   207   1e-50
gb|EOY18075.1| HAT and BED zinc finger domain-containing protein...   204   7e-50
gb|EEC81276.1| hypothetical protein OsI_24379 [Oryza sativa Indi...   204   1e-49
gb|EOX93184.1| HAT transposon superfamily, putative [Theobroma c...   203   2e-49

>ref|XP_002269161.1| PREDICTED: uncharacterized protein LOC100264734 [Vitis vinifera]
          Length = 714

 Score =  378 bits (970), Expect = e-102
 Identities = 184/345 (53%), Positives = 242/345 (70%), Gaps = 1/345 (0%)
 Frame = +2

Query: 8    KAKTITRFVHGHASVLRLLRDQTSVHDLVKPSKIRSTMPYLTLENMVFEKVNLEXXXXXX 187
            KAK IT+F+H HA+VL+L+R+ TS + LVKPSKI+   P+LTLEN+V EK NL+      
Sbjct: 349  KAKAITKFIHSHATVLKLMRNYTSANTLVKPSKIKLAKPFLTLENIVSEKDNLQNMFVSS 408

Query: 188  XXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIYET 367
                       EGK V +LV D +FW GAI+VLKAT PLV+VL  +N SD  Q+G+IY+T
Sbjct: 409  GWNSLIWASREEGKRVADLVVDPAFWTGAIMVLKATIPLVRVLSWINGSDKPQMGYIYDT 468

Query: 368  MDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVDVE 547
            MDQAKE I +E KD +S YMPFWE ID IWN++L+SPLH+ GY+LNP+ FY +DF+ D E
Sbjct: 469  MDQAKEAIAKEFKDKKSQYMPFWEVIDEIWNKHLYSPLHSTGYYLNPHFFYSSDFHCDAE 528

Query: 548  VSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGSA-NLHSYVSPVTWWLEYGKE 724
            V+SG+LCCIVR+  D   +D I  Q+D Y    GAF  GSA +  + + PV WW  YG++
Sbjct: 529  VASGILCCIVRMVPDLHVQDVIGLQLDKYLWTEGAFAQGSAFDQRTNIPPVLWWSHYGRQ 588

Query: 725  YPELQKLAIQILSQTCSGASKYKLKRSVAENLLSMRRNQIEQQRLTDLAFVHYNLQLQNF 904
            +PE Q+ A +ILSQTC GAS+Y+LK+S+AE LL   RN IEQQRL+DL F+HYNL LQ F
Sbjct: 589  HPEFQRFATRILSQTCDGASRYELKKSLAEKLLMKGRNPIEQQRLSDLIFLHYNLHLQGF 648

Query: 905  ESCFTDDISIYEMNQIEDWIGDNAQTLVSPSDEPTWMDLDCRDST 1039
            +S    DI + E++ ++DWI + A+   S + +  WMDLDC D T
Sbjct: 649  KSRLNADIVLEEIDPMDDWIVEEAKESSSQNGDTAWMDLDCEDRT 693


>ref|XP_002310902.1| predicted protein [Populus trichocarpa]
          Length = 705

 Score =  350 bits (898), Expect = 1e-93
 Identities = 179/342 (52%), Positives = 236/342 (69%), Gaps = 2/342 (0%)
 Frame = +2

Query: 5    KKAKTITRFVHGHASVLRLLRDQTSVHDLVKPSKIRSTMPYLTLENMVFEKVNLEXXXXX 184
            +KAK IT+F++GH  VL+L+R+    +DL+KPSK++  MP+ TLEN++ EK NLE     
Sbjct: 344  EKAKIITKFIYGHKKVLKLMRNHIDDYDLIKPSKMKLAMPFFTLENILSEKKNLEEMFDS 403

Query: 185  XXXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIYE 364
                        EG  V  LV D SFW+GA +  KAT PL++VL ++N  D  Q+G IYE
Sbjct: 404  FEWKTSVWSSTVEGMRVAHLVGDHSFWSGAEMASKATVPLLRVLCLVNEGDKPQVGFIYE 463

Query: 365  TMDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVDV 544
            TMDQ KETI++E K+ +S Y PFW AID+IW+  LHSPLHAAGY+LNP LFY +DFY D 
Sbjct: 464  TMDQVKETIKKEFKNKKSDYTPFWTAIDDIWDTRLHSPLHAAGYYLNPCLFYSSDFYSDP 523

Query: 545  EVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGSANL-HSYVSPVTWWLEYGK 721
            EV+ GLLCC+VR+  DQRT+ +I  Q+D Y++A GAF  G A +  + +SP  WW  YGK
Sbjct: 524  EVTFGLLCCVVRMVADQRTQLKITFQLDEYRHARGAFQEGKAIVKRTNISPAQWWCTYGK 583

Query: 722  EYPELQKLAIQILSQTCSGASKYKLKRSVAENLLSMRRNQIEQQRLTDLAFVHYNLQLQN 901
            + PELQ+ A++ILSQTC GAS+Y LKRS+AE LL+ RRN IEQQRL DL FVHYNLQ+QN
Sbjct: 584  QCPELQRFAVRILSQTCDGASRYGLKRSMAEKLLTDRRNPIEQQRLRDLTFVHYNLQVQN 643

Query: 902  FESCFTDDISIYEMNQIEDWIGDNA-QTLVSPSDEPTWMDLD 1024
              S F  D+   E++ ++D + D A Q +V  + +   MD D
Sbjct: 644  KRSGFRSDVISEEIDPMDDRVVDEAPQEVVPENGDRGLMDSD 685


>ref|XP_002530377.1| protein dimerization, putative [Ricinus communis]
            gi|223530094|gb|EEF32010.1| protein dimerization,
            putative [Ricinus communis]
          Length = 698

 Score =  346 bits (888), Expect = 2e-92
 Identities = 173/346 (50%), Positives = 234/346 (67%), Gaps = 1/346 (0%)
 Frame = +2

Query: 5    KKAKTITRFVHGHASVLRLLRDQTSVHDLVKPSKIRSTMPYLTLENMVFEKVNLEXXXXX 184
            +KAK IT+F++G+  VL+L+R+ T+ +DLVK S+++  +P+LTLEN++ EK NLE     
Sbjct: 344  EKAKIITKFIYGNGEVLKLMRNYTNSYDLVKTSRMKFGVPFLTLENIISEKKNLENMFAS 403

Query: 185  XXXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIYE 364
                        EGK V  L+ D SFW GA + L+AT PL++VL ++  +D  Q+G IYE
Sbjct: 404  SEWMTSVWASSPEGKRVAHLMGDLSFWTGAEMTLRATVPLLRVLCLIIEADKPQVGFIYE 463

Query: 365  TMDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVDV 544
            TMDQAKETI++E ++ +S Y+PFWE ID IW+ +LHSPLHAAGY+LNP+LFY  DFY D 
Sbjct: 464  TMDQAKETIKEEFRNKKSQYVPFWEIIDEIWDTHLHSPLHAAGYYLNPSLFYSTDFYSDP 523

Query: 545  EVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGSA-NLHSYVSPVTWWLEYGK 721
            EVS GLLCCIVR+  D RT+D I  Q+D Y++A GAF  GSA N  + +SP  WW  YGK
Sbjct: 524  EVSFGLLCCIVRMVQDPRTQDLISLQLDEYRHARGAFKEGSAINKRTNISPAQWWSIYGK 583

Query: 722  EYPELQKLAIQILSQTCSGASKYKLKRSVAENLLSMRRNQIEQQRLTDLAFVHYNLQLQN 901
            ++PELQ  AI+ILSQTC GA K+ LKR +AE LL   RN  EQQRL +L +VHYNL LQN
Sbjct: 584  QHPELQNFAIKILSQTCDGAMKFGLKRGLAEKLLLNGRNCNEQQRLDELTYVHYNLHLQN 643

Query: 902  FESCFTDDISIYEMNQIEDWIGDNAQTLVSPSDEPTWMDLDCRDST 1039
             +      +   E++ ++DW+ D    +        WM+ DC ++T
Sbjct: 644  TQFGVEGGLGAEEIDPMDDWVVDKTLEIAPKIGGLEWMEADCTEAT 689


>gb|EOY26199.1| HAT transposon superfamily protein, putative [Theobroma cacao]
          Length = 709

 Score =  341 bits (874), Expect = 6e-91
 Identities = 171/344 (49%), Positives = 229/344 (66%), Gaps = 4/344 (1%)
 Frame = +2

Query: 5    KKAKTITRFVHGHASVLRLLRDQTSVHDLVKPSKIRSTMPYLTLENMVFEKVNLEXXXXX 184
            + A+TI++F+HGH +VL LLRD T  HDL+KP+K+RS MP++TLEN++ EK NL+     
Sbjct: 347  ENARTISKFIHGHLTVLNLLRDYTDGHDLIKPTKVRSAMPFVTLENIIAEKKNLKAMFAS 406

Query: 185  XXXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIYE 364
                        EGK V +LV D SFW GA  V+K   PL++VL ++N  D  Q+G+IYE
Sbjct: 407  SEWNTSAWASRAEGKRVADLVGDPSFWKGAGRVVKTALPLIRVLCLINGDDKPQMGYIYE 466

Query: 365  TMDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVDV 544
            TMDQ KETI++E     S YMPFWE ID IW+ +LHSPLHAAG+FLNP+LFY  DF  D 
Sbjct: 467  TMDQMKETIKKECNSKESQYMPFWELIDKIWDGHLHSPLHAAGHFLNPSLFYSTDFQSDS 526

Query: 545  EVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGS-ANLHSYVSPVTWWLEYGK 721
            EV+ GLLCC+VR+   Q  +D+I++Q++ Y+N+ GAFG GS     +  S   WW  YG 
Sbjct: 527  EVAFGLLCCMVRMIQSQPIQDKIVQQLEAYRNSEGAFGEGSTVQQRTRFSSTMWWSTYGG 586

Query: 722  EYPELQKLAIQILSQTCSGASKYKLKRSVAENLLSMRRNQIEQQRLTDLAFVHYNLQLQN 901
              PELQ+ A +ILSQTC GASKY+L RS+ E LL+  RN +EQQ L+DL FVHYNLQLQ 
Sbjct: 587  RCPELQRFATRILSQTCVGASKYRLNRSLVEKLLTKGRNPVEQQLLSDLIFVHYNLQLQQ 646

Query: 902  FESC---FTDDISIYEMNQIEDWIGDNAQTLVSPSDEPTWMDLD 1024
             +        DI+  E++ +++WI D+   + S   +  W +LD
Sbjct: 647  QQRSQFGVNYDIAGDEIDAMDEWIVDDTPEIGSRDGDSAWKELD 690


>ref|XP_006366948.1| PREDICTED: uncharacterized protein LOC102589543 isoform X1 [Solanum
            tuberosum] gi|565402986|ref|XP_006366949.1| PREDICTED:
            uncharacterized protein LOC102589543 isoform X2 [Solanum
            tuberosum] gi|565402988|ref|XP_006366950.1| PREDICTED:
            uncharacterized protein LOC102589543 isoform X3 [Solanum
            tuberosum]
          Length = 686

 Score =  332 bits (852), Expect = 2e-88
 Identities = 168/341 (49%), Positives = 236/341 (69%), Gaps = 1/341 (0%)
 Frame = +2

Query: 5    KKAKTITRFVHGHASVLRLLRDQTSVHDLVKPSKIRSTMPYLTLENMVFEKVNLEXXXXX 184
            +KAKT+T+F++ HA+ L+LLRD     +LVK SKIRS +P+LTLEN+V +K  L      
Sbjct: 317  EKAKTLTQFIYSHATALKLLRDACP-DELVKSSKIRSIVPFLTLENIVSQKDCLIRMFQS 375

Query: 185  XXXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIYE 364
                        EGK ++ +V D SFW+ A++ +KAT PLV+V+ +++ ++  Q+G IY+
Sbjct: 376  SDWRTSIMASTNEGKRISNMVKDESFWSEALMAVKATIPLVEVMKLLDGTNKPQVGFIYD 435

Query: 365  TMDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVDV 544
            T+DQAKETI++E +D +SLY  FW AID+IW++YLHS LHAAGYFLNP LFY +DFY DV
Sbjct: 436  TLDQAKETIKKEFQDKKSLYAKFWIAIDDIWDEYLHSHLHAAGYFLNPTLFYSSDFYTDV 495

Query: 545  EVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGS-ANLHSYVSPVTWWLEYGK 721
            EVS GL CC+VR+  D+  +D I  Q+D Y+   G F  GS  +  S +SP  WW +YG 
Sbjct: 496  EVSCGLCCCVVRMAEDRHIQDLITLQIDEYRMGRGTFHFGSFKDKLSNISPALWWSQYGV 555

Query: 722  EYPELQKLAIQILSQTCSGASKYKLKRSVAENLLSMRRNQIEQQRLTDLAFVHYNLQLQN 901
            ++PELQ+LA++ILSQTC+GAS Y+LKRS+ E L +   N IE+QRL DL FVH NLQLQ 
Sbjct: 556  QFPELQRLAVRILSQTCNGASHYRLKRSLVETLHTEGMNPIEKQRLQDLVFVHCNLQLQA 615

Query: 902  FESCFTDDISIYEMNQIEDWIGDNAQTLVSPSDEPTWMDLD 1024
            F+   ++D + Y ++ +++WI      LV  + + TWMDL+
Sbjct: 616  FDPDGSNDNTDY-VDPMDEWIVGKEPNLVPENTQLTWMDLE 655


>ref|XP_004246748.1| PREDICTED: uncharacterized protein LOC101247551 isoform 2 [Solanum
            lycopersicum]
          Length = 682

 Score =  322 bits (824), Expect = 4e-85
 Identities = 162/341 (47%), Positives = 226/341 (66%), Gaps = 1/341 (0%)
 Frame = +2

Query: 5    KKAKTITRFVHGHASVLRLLRDQTSVHDLVKPSKIRSTMPYLTLENMVFEKVNLEXXXXX 184
            +KAKT+T+F++ HA+ L+LLRD     +LVK SKIRS +P+LTLEN+V +K  L      
Sbjct: 317  EKAKTLTQFIYNHATALKLLRDACP-DELVKSSKIRSIVPFLTLENIVSQKDCLISMFQS 375

Query: 185  XXXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIYE 364
                        EGK ++E+V + SFW+ A++ +KAT PLVKV+ ++N ++  Q+G IY+
Sbjct: 376  SDWHTSIMASTNEGKRISEMVKNESFWSEALMAVKATIPLVKVIKLLNGTNKPQIGFIYD 435

Query: 365  TMDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVDV 544
            T+DQ K TI++E +   SLY  FW AID+IWN YLHS LHAAGYFLNP  FY +DFY D 
Sbjct: 436  TLDQIKVTIKKEFQGKESLYAKFWAAIDDIWNGYLHSHLHAAGYFLNPIYFYSSDFYADA 495

Query: 545  EVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGSANLHSY-VSPVTWWLEYGK 721
            EV+SGL CC+VR+  D+  +D I  Q+D Y+     F  GS       +SP  WW +YG 
Sbjct: 496  EVTSGLCCCVVRMTEDRHIQDLIALQIDEYRKGRSTFHFGSFKEKLINISPALWWSQYGV 555

Query: 722  EYPELQKLAIQILSQTCSGASKYKLKRSVAENLLSMRRNQIEQQRLTDLAFVHYNLQLQN 901
            +YPE+Q+ A ++LSQTC+GAS Y+LKRS+ E L +   N IE+QRL DL FVH NLQLQ 
Sbjct: 556  QYPEIQRFAFRLLSQTCNGASHYRLKRSLVETLHTEGMNPIEKQRLQDLVFVHCNLQLQA 615

Query: 902  FESCFTDDISIYEMNQIEDWIGDNAQTLVSPSDEPTWMDLD 1024
            F+   ++D + Y ++ +++WI      LV  + + TWMDL+
Sbjct: 616  FDPDGSNDNTDYVVDPMDEWIVRKEPNLVHENTQLTWMDLE 656


>ref|XP_004246747.1| PREDICTED: uncharacterized protein LOC101247551 isoform 1 [Solanum
            lycopersicum]
          Length = 692

 Score =  322 bits (824), Expect = 4e-85
 Identities = 162/341 (47%), Positives = 226/341 (66%), Gaps = 1/341 (0%)
 Frame = +2

Query: 5    KKAKTITRFVHGHASVLRLLRDQTSVHDLVKPSKIRSTMPYLTLENMVFEKVNLEXXXXX 184
            +KAKT+T+F++ HA+ L+LLRD     +LVK SKIRS +P+LTLEN+V +K  L      
Sbjct: 327  EKAKTLTQFIYNHATALKLLRDACP-DELVKSSKIRSIVPFLTLENIVSQKDCLISMFQS 385

Query: 185  XXXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIYE 364
                        EGK ++E+V + SFW+ A++ +KAT PLVKV+ ++N ++  Q+G IY+
Sbjct: 386  SDWHTSIMASTNEGKRISEMVKNESFWSEALMAVKATIPLVKVIKLLNGTNKPQIGFIYD 445

Query: 365  TMDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVDV 544
            T+DQ K TI++E +   SLY  FW AID+IWN YLHS LHAAGYFLNP  FY +DFY D 
Sbjct: 446  TLDQIKVTIKKEFQGKESLYAKFWAAIDDIWNGYLHSHLHAAGYFLNPIYFYSSDFYADA 505

Query: 545  EVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGSANLHSY-VSPVTWWLEYGK 721
            EV+SGL CC+VR+  D+  +D I  Q+D Y+     F  GS       +SP  WW +YG 
Sbjct: 506  EVTSGLCCCVVRMTEDRHIQDLIALQIDEYRKGRSTFHFGSFKEKLINISPALWWSQYGV 565

Query: 722  EYPELQKLAIQILSQTCSGASKYKLKRSVAENLLSMRRNQIEQQRLTDLAFVHYNLQLQN 901
            +YPE+Q+ A ++LSQTC+GAS Y+LKRS+ E L +   N IE+QRL DL FVH NLQLQ 
Sbjct: 566  QYPEIQRFAFRLLSQTCNGASHYRLKRSLVETLHTEGMNPIEKQRLQDLVFVHCNLQLQA 625

Query: 902  FESCFTDDISIYEMNQIEDWIGDNAQTLVSPSDEPTWMDLD 1024
            F+   ++D + Y ++ +++WI      LV  + + TWMDL+
Sbjct: 626  FDPDGSNDNTDYVVDPMDEWIVRKEPNLVHENTQLTWMDLE 666


>ref|XP_002269962.2| PREDICTED: uncharacterized protein LOC100251332 [Vitis vinifera]
          Length = 709

 Score =  311 bits (798), Expect = 4e-82
 Identities = 163/320 (50%), Positives = 204/320 (63%), Gaps = 1/320 (0%)
 Frame = +2

Query: 8    KAKTITRFVHGHASVLRLLRDQTSVHDLVKPSKIRSTMPYLTLENMVFEKVNLEXXXXXX 187
            KAKTITRF++ HA VL L+R+ T VHDLVKPSK +S +P+LTL+N+V EK  LE      
Sbjct: 347  KAKTITRFIYCHAMVLNLMRNHTLVHDLVKPSKSKSAIPFLTLQNIVLEKGRLEKMFISS 406

Query: 188  XXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIYET 367
                       EGK V ++V D SFW+GA +VLK T PLV VL  +      Q+ +IYET
Sbjct: 407  EWKTSCWASRREGKRVADIVLDPSFWSGAEMVLKPTIPLVGVLCSIIRGGKGQMCYIYET 466

Query: 368  MDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVDVE 547
            MD  KE I +E ++  S YMPFWE ID IWN +LHS LHAA   LNP +FY  D+  D E
Sbjct: 467  MDAVKEDIAEEFENNESQYMPFWELIDEIWNNHLHSALHAAANHLNPAIFYSRDYNFDKE 526

Query: 548  VSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGSA-NLHSYVSPVTWWLEYGKE 724
            V  G+ CCI  +  D+  ++EI  Q++ Y++A G FGLG A    +   P  WW  YG  
Sbjct: 527  VFEGINCCIEHMVPDEHIQNEIWLQLEQYKDAEGDFGLGKATERRNIFHPALWWSNYGGH 586

Query: 725  YPELQKLAIQILSQTCSGASKYKLKRSVAENLLSMRRNQIEQQRLTDLAFVHYNLQLQNF 904
             PELQKLA +ILSQTC GAS+YKLKRS+AENLL+  RN I Q RL DL FVHYNL L+N 
Sbjct: 587  CPELQKLATRILSQTCDGASRYKLKRSLAENLLAKGRNPIGQGRLCDLTFVHYNLHLRNA 646

Query: 905  ESCFTDDISIYEMNQIEDWI 964
            +     D    E++ + DWI
Sbjct: 647  DWSTDTDHEFGEIDPMNDWI 666


>ref|XP_006345717.1| PREDICTED: uncharacterized protein LOC102580052 [Solanum tuberosum]
          Length = 586

 Score =  308 bits (788), Expect = 6e-81
 Identities = 154/340 (45%), Positives = 227/340 (66%)
 Frame = +2

Query: 5    KKAKTITRFVHGHASVLRLLRDQTSVHDLVKPSKIRSTMPYLTLENMVFEKVNLEXXXXX 184
            +KAK + +F++ HA+VL+LLRD  S  +LVK SKI++ +P+LTLEN+V +K  L      
Sbjct: 251  EKAKMLVQFIYSHATVLKLLRDAFSEAELVKSSKIKAIVPFLTLENIVSQKDGLIRMFQS 310

Query: 185  XXXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIYE 364
                        EGK ++E++ D SFW  A++ +KAT PLV+V+  +N ++ +Q+G I++
Sbjct: 311  STWQTSLLASTSEGKGMSEMIKDESFWTEALMAVKATIPLVEVIKFLNGTNKAQVGFIHD 370

Query: 365  TMDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVDV 544
            T+DQAKETIR+E K  R  +   W AID+ WN+YLHSPLH AGY+LNP  F+ +++ ++V
Sbjct: 371  TLDQAKETIRKEFKSTRFCHAKIWNAIDDTWNKYLHSPLHDAGYYLNPTFFHSSNWCLNV 430

Query: 545  EVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGSANLHSYVSPVTWWLEYGKE 724
            ++S GL  CI  +  D+R +D I +Q+  +        L S  + S +SP  WW +Y  E
Sbjct: 431  KISDGLCSCITGMAEDRRIKDLITQQIGTFD------FLSSKEILSDISPGHWWSKYEVE 484

Query: 725  YPELQKLAIQILSQTCSGASKYKLKRSVAENLLSMRRNQIEQQRLTDLAFVHYNLQLQNF 904
            +PEL++LA++ILSQTC+GAS Y+LKRS+ E L    RNQIEQQRL+DL FVH NLQLQ F
Sbjct: 485  FPELERLAVRILSQTCNGASHYRLKRSLVETLHRKGRNQIEQQRLSDLVFVHCNLQLQAF 544

Query: 905  ESCFTDDISIYEMNQIEDWIGDNAQTLVSPSDEPTWMDLD 1024
            +    +DI+   ++ +++WI    + LVS + + TWMDLD
Sbjct: 545  DPEGENDIAEDVVDSMDEWIVGKGENLVSENTQLTWMDLD 584


>ref|XP_004246933.1| PREDICTED: uncharacterized protein LOC101250835 [Solanum
            lycopersicum]
          Length = 640

 Score =  306 bits (785), Expect = 1e-80
 Identities = 160/368 (43%), Positives = 232/368 (63%), Gaps = 28/368 (7%)
 Frame = +2

Query: 5    KKAKTITRFVHGHASVLRLLRDQTSVHDLVKPSKIRSTMPYLTLENMVFEKVNLEXXXXX 184
            +KAKT+T+F++ HA+VL+LLRD     +LVK SKIR  +P+LTLEN+V +K  L      
Sbjct: 247  EKAKTLTQFIYSHATVLKLLRDACP-DELVKSSKIRFIVPFLTLENIVSQKKCLIRMFQS 305

Query: 185  XXXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIYE 364
                        EGK ++E+V DRSFW   ++ +KAT PLV+V+ +++ ++  Q+G IY+
Sbjct: 306  SDWHSSVLASTIEGKRMSEMVEDRSFWTEGLMAVKATIPLVEVIKLLDCTNKPQVGFIYD 365

Query: 365  TMDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVDV 544
            T+DQAKETI++E +  RS Y  FW+AID+IW++Y HS LHA GYFLNP LFY ++FY DV
Sbjct: 366  TLDQAKETIKKEFRHKRSHYARFWKAIDDIWDEYFHSHLHAVGYFLNPTLFYSSNFYTDV 425

Query: 545  EVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGS------------------- 667
            EV+ GL CC+VR+  D+  +  I +Q+D Y+   G F  GS                   
Sbjct: 426  EVTCGLCCCVVRMTEDRHIQHLITQQIDEYRKGRGTFHFGSFKDKLSNISPGGIIYTFSA 485

Query: 668  ----ANLHSYVS-----PVTWWLEYGKEYPELQKLAIQILSQTCSGASKYKLKRSVAENL 820
                   +SY++        WW +YG + PELQ+ A++ILSQTC+GAS Y+LKR++ E L
Sbjct: 486  ILIMLTYNSYINLYVMVAALWWSQYGGQCPELQRFAVRILSQTCNGASHYRLKRNLVETL 545

Query: 821  LSMRRNQIEQQRLTDLAFVHYNLQLQNFESCFTDDISIYEMNQIEDWIGDNAQTLVSPSD 1000
            L+   N IE+QRL DL FVH NLQLQ F+   ++D +   ++ +++WI      ++S + 
Sbjct: 546  LTEGMNLIEKQRLQDLVFVHCNLQLQAFDPDGSNDDTDNVVDPMDEWIVGKGPNVMSVNT 605

Query: 1001 EPTWMDLD 1024
            E TWMDL+
Sbjct: 606  ELTWMDLE 613


>ref|XP_006366951.1| PREDICTED: uncharacterized protein LOC102590309 [Solanum tuberosum]
          Length = 507

 Score =  300 bits (767), Expect = 2e-78
 Identities = 149/340 (43%), Positives = 224/340 (65%)
 Frame = +2

Query: 5    KKAKTITRFVHGHASVLRLLRDQTSVHDLVKPSKIRSTMPYLTLENMVFEKVNLEXXXXX 184
            KK K + +F++ HA+VL+LLRD     +LVK SKI++ +P+LTL N++ +K  L      
Sbjct: 172  KKTKMLVQFIYSHATVLKLLRDAFPEVELVKSSKIKAIVPFLTLGNIISQKNGLIRMFQS 231

Query: 185  XXXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIYE 364
                        EGK ++E++ D SFW  A++ +KAT PLV+V+  +N ++ +Q+G I++
Sbjct: 232  STWQTSLLASTSEGKGMSEMIKDESFWTEALMAVKATIPLVEVIKFLNGTNKAQVGFIHD 291

Query: 365  TMDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVDV 544
            T+DQAKET+R+E +  R  +   W AID+ WN+YLHSPLH AGY+LNP  F+ +++ ++V
Sbjct: 292  TLDQAKETVRKEFERTRFCHAKIWNAIDDTWNKYLHSPLHDAGYYLNPTFFHSSNWCLNV 351

Query: 545  EVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGSANLHSYVSPVTWWLEYGKE 724
            ++S GL  CI  +  D+R +D I +Q+  +        L S  + S +SP  WW +Y  E
Sbjct: 352  KISDGLCSCITGMAEDRRIKDLITQQIGTFD------FLSSKEILSDISPGHWWSKYEVE 405

Query: 725  YPELQKLAIQILSQTCSGASKYKLKRSVAENLLSMRRNQIEQQRLTDLAFVHYNLQLQNF 904
            +PEL++LA++ILSQTC+GAS Y+LKRS+ E L    RNQIEQQRL+DL FVH NLQLQ F
Sbjct: 406  FPELERLAVRILSQTCNGASHYRLKRSLVETLHRKGRNQIEQQRLSDLVFVHCNLQLQAF 465

Query: 905  ESCFTDDISIYEMNQIEDWIGDNAQTLVSPSDEPTWMDLD 1024
            +    +DI+   ++ +++WI    + LVS + + TWMDLD
Sbjct: 466  DPEGENDIAEDVVDSMDEWIVGKGENLVSENTQLTWMDLD 505


>ref|XP_002312861.1| predicted protein [Populus trichocarpa]
          Length = 621

 Score =  259 bits (661), Expect = 3e-66
 Identities = 137/300 (45%), Positives = 183/300 (61%), Gaps = 3/300 (1%)
 Frame = +2

Query: 8    KAKTITRFVHGHASVLRLLRDQTSVH--DLVKPSKIRSTMPYLTLENMVFEKVNLEXXXX 181
            +A ++ RFVH +A+VL++ RD T     +L KPSK+RS +P+L LE+++  K  L+    
Sbjct: 292  EATSLVRFVHNNAAVLKMFRDFTGSERENLFKPSKMRSAIPFLILESILSYKEELKEMFT 351

Query: 182  XXXXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIY 361
                         EGK    LV   SFW  A +  KAT  L++V+D ++  +   +G IY
Sbjct: 352  SLEWKSCFWSQQVEGKKAAGLVKSSSFWKRAGMASKATTALIRVVDKISADNKPSIGFIY 411

Query: 362  ETMDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVD 541
            ETMDQ KE I+ E +D +S ++P WE ID IW+ +LHSPLHAA Y+LNP  FY  +F++D
Sbjct: 412  ETMDQIKEAIQYEFRDSKSGHIPLWELIDEIWDDFLHSPLHAAAYYLNPTFFYNRNFHLD 471

Query: 542  VEVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGSANLH-SYVSPVTWWLEYG 718
             EVSSGL C ++R+  DQR +  I KQ   Y  A G F  G A    +   P  WW  YG
Sbjct: 472  TEVSSGLQCSVIRMENDQRIQYLINKQAAQYCRADGDFENGYAEGEINNAHPDLWWSVYG 531

Query: 719  KEYPELQKLAIQILSQTCSGASKYKLKRSVAENLLSMRRNQIEQQRLTDLAFVHYNLQLQ 898
               PELQKLAI+ILSQTC G+ +Y L RS+AE L+   +NQ EQ RL D  FV YNLQL+
Sbjct: 532  NRCPELQKLAIRILSQTCDGSGRYSLDRSLAEKLVCKEQNQHEQHRLRDQMFVRYNLQLE 591


>ref|NP_187909.1| hAT transposon superfamily protein [Arabidopsis thaliana]
            gi|79313211|ref|NP_001030685.1| hAT transposon
            superfamily protein [Arabidopsis thaliana]
            gi|238479754|ref|NP_001154612.1| hAT transposon
            superfamily protein [Arabidopsis thaliana]
            gi|15795135|dbj|BAB02513.1| transposase-like protein
            [Arabidopsis thaliana] gi|28393338|gb|AAO42094.1| unknown
            protein [Arabidopsis thaliana] gi|28827476|gb|AAO50582.1|
            unknown protein [Arabidopsis thaliana]
            gi|222424407|dbj|BAH20159.1| AT3G13030 [Arabidopsis
            thaliana] gi|332641757|gb|AEE75278.1| hAT transposon
            superfamily protein [Arabidopsis thaliana]
            gi|332641758|gb|AEE75279.1| hAT transposon superfamily
            protein [Arabidopsis thaliana]
            gi|332641759|gb|AEE75280.1| hAT transposon superfamily
            protein [Arabidopsis thaliana]
          Length = 544

 Score =  238 bits (606), Expect = 8e-60
 Identities = 127/316 (40%), Positives = 186/316 (58%), Gaps = 3/316 (0%)
 Frame = +2

Query: 2    FKKAKTITRFVHGHASVLRLLRDQTSVHDL-VKPSKIRSTMPYLTLENMVFEKVNLEXXX 178
            F K   I  F++ + SVL + RDQ    D+ V  S+     PYL LE++   K NL    
Sbjct: 234  FDKVNNIWLFINNNPSVLNIFRDQCHGIDITVSSSEFEFVTPYLILESIFKAKKNLTAMF 293

Query: 179  XXXXXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHI 358
                              ++ LV+D SFW     VLK T PL+  L + + +++  LG++
Sbjct: 294  ASSNWNNEQCIA------ISNLVSDSSFWETVESVLKCTSPLIHGLLLFSTANNQHLGYV 347

Query: 359  YETMDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYV 538
            Y+TMD  KE+I +E       Y P W+ ID++WN++LH+PLHAAGYFLNP  FY  +F++
Sbjct: 348  YDTMDSIKESIAREFNHKPQFYKPLWDVIDDVWNKHLHNPLHAAGYFLNPTAFYSTNFHL 407

Query: 539  DVEVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGS-ANLHSYVSPVTWWLEY 715
            D+EV +GL+  ++ +  D   + +I  Q+DMY+     F   S A+  + +SP  WW   
Sbjct: 408  DIEVVTGLISSLIHMVEDCHVQFKISTQIDMYRLGKDCFNEASQADQITGISPAEWWAHK 467

Query: 716  GKEYPELQKLAIQILSQTCSGASKYKLKRSVAEN-LLSMRRNQIEQQRLTDLAFVHYNLQ 892
              +YPELQ LAI+ILSQTC GASKYKLKRS+AE  LLS   +  E+Q L +L FV YNL 
Sbjct: 468  ASQYPELQSLAIKILSQTCEGASKYKLKRSLAEKLLLSEGMSNRERQHLDELVFVQYNLH 527

Query: 893  LQNFESCFTDDISIYE 940
            LQ++++  +++I +Y+
Sbjct: 528  LQSYKAKLSEEIDVYK 543


>ref|XP_004246932.1| PREDICTED: uncharacterized protein LOC101250543 [Solanum
            lycopersicum]
          Length = 618

 Score =  234 bits (598), Expect = 6e-59
 Identities = 128/342 (37%), Positives = 198/342 (57%), Gaps = 2/342 (0%)
 Frame = +2

Query: 5    KKAKTITRFVHGHASVLRLLRDQTSVHDLVKPSKIRSTMPYLTLENMVFEKVNLEXXXXX 184
            +KAK + +F++ H + ++LL D     +LVK SK+++ +P+LTL+N+V +K  L      
Sbjct: 267  QKAKMLIQFIYSHTTTMKLLSDVFPGVELVKSSKVKAIVPFLTLQNIVSQKDVLIRMFQS 326

Query: 185  XXXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIYE 364
                        EGK + E++ D S W+   +  + T PLV+V+  +N ++  Q G I  
Sbjct: 327  SAWGTSQLASTSEGKRIAEMIEDASVWSNFGMAARVTIPLVEVIKYLNGTNKPQAGFISN 386

Query: 365  TMDQAKETIRQELKDLRSLYM--PFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYV 538
             + QAKE I+ E +  R L+     W  I+  W +YLHS LH AGY+LNP  FY +D+  
Sbjct: 387  RLYQAKEIIKMEFRS-RQLWRHEETWNKIEETWKKYLHSDLHGAGYYLNPCYFYSSDWLG 445

Query: 539  DVEVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGSANLHSYVSPVTWWLEYG 718
              E++ GL   I R+ G    +  I +Q+  +         GS  +   +SP  WWL+Y 
Sbjct: 446  TAEITCGLCKTIDRIAG--HIKGLITQQIKEFDFD------GSREILPDISPAQWWLKYE 497

Query: 719  KEYPELQKLAIQILSQTCSGASKYKLKRSVAENLLSMRRNQIEQQRLTDLAFVHYNLQLQ 898
             EYPEL++ A++ILSQTC GAS Y+LKR + E L +  R++IEQQRL DL FVH NLQLQ
Sbjct: 498  VEYPELERFAVRILSQTCDGASHYRLKRRLVETLHTKGRSEIEQQRLKDLVFVHCNLQLQ 557

Query: 899  NFESCFTDDISIYEMNQIEDWIGDNAQTLVSPSDEPTWMDLD 1024
             F+    +DI+   ++ +++WI  +   +VS + + TWMD++
Sbjct: 558  GFDPEGENDIAEDVVDAMDEWILGDRANVVSENSQCTWMDME 599


>ref|NP_187908.1| hAT transposon superfamily protein [Arabidopsis thaliana]
            gi|15795134|dbj|BAB02512.1| transposase-like protein
            [Arabidopsis thaliana] gi|332641756|gb|AEE75277.1| hAT
            transposon superfamily protein [Arabidopsis thaliana]
          Length = 605

 Score =  227 bits (578), Expect = 1e-56
 Identities = 126/302 (41%), Positives = 183/302 (60%), Gaps = 4/302 (1%)
 Frame = +2

Query: 8    KAKTITRFVHGHASVLRLLRDQTSVHDL-VKPSKIRSTMPYLTLENMVFEKVNLEXXXXX 184
            K  TI  F++ + S L++ RDQ+   D+ V  S+     PYL L+++   K NL      
Sbjct: 309  KVNTIWEFINNNPSALKIYRDQSHGKDITVSSSEFEFVKPYLILKSVFKAKKNLAAMFAS 368

Query: 185  XXXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQ-LGHIY 361
                        EGK V+ LV D SFW     +LK T PL   L + +N+D++Q +G+IY
Sbjct: 369  SVWKKE------EGKSVSNLVNDSSFWEAVEEILKCTSPLTDGLRLFSNADNNQHVGYIY 422

Query: 362  ETMDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVD 541
            +T+D  K +I++E  D +  Y+  W+ ID++WN++LH+PLHAAGY+LNP  FY  DF++D
Sbjct: 423  DTLDGIKLSIKKEFNDEKKHYLTLWDVIDDVWNKHLHNPLHAAGYYLNPTSFYSTDFHLD 482

Query: 542  VEVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGS-ANLHSYVSPVTWWLEYG 718
             EVSSGL   +V +  + + +  I  Q+D Y+     F   S  +  S +SP+ WW E  
Sbjct: 483  PEVSSGLTHSLVHVAKEGQIK--IASQLDRYRLGKDCFNEASQPDQISGISPIDWWTEKA 540

Query: 719  KEYPELQKLAIQILSQTCSGASKYKLKRSVAEN-LLSMRRNQIEQQRLTDLAFVHYNLQL 895
             ++PELQ  AI+ILSQTC GAS+YKLKRS+AE  LL+   +  E++ L +LAFVHYNL L
Sbjct: 541  SQHPELQSFAIKILSQTCEGASRYKLKRSLAEKLLLTEGMSHCERKHLEELAFVHYNLHL 600

Query: 896  QN 901
            Q+
Sbjct: 601  QS 602


>ref|XP_006297473.1| hypothetical protein CARUB_v10013494mg [Capsella rubella]
            gi|482566182|gb|EOA30371.1| hypothetical protein
            CARUB_v10013494mg [Capsella rubella]
          Length = 507

 Score =  218 bits (554), Expect = 8e-54
 Identities = 118/300 (39%), Positives = 173/300 (57%), Gaps = 2/300 (0%)
 Frame = +2

Query: 5    KKAKTITRFVHGHASVLRLLRDQTSVHDLVKPSKIRSTMPYLTLENMVFEKVNLEXXXXX 184
            +K K+I  F++ + SVL++    +   D+   S+     PYLTLE++   K  L      
Sbjct: 204  EKVKSIWEFINNNPSVLKIFNCHSHGKDITISSEFEFVTPYLTLESIFKAKKTLAAMFAS 263

Query: 185  XXXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIYE 364
                        E   ++ LV D SFW     VLK T PL++ L + + +++  +G+IY+
Sbjct: 264  SDWNKK------EAIAISTLVKDPSFWKTVERVLKCTSPLIRGLLLFSTANNQHVGYIYD 317

Query: 365  TMDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVDV 544
            TMD  KE I +E    +  Y PFW+ +D IWN++LH+PLH+AGYFLNP  FY  DF++D+
Sbjct: 318  TMDSIKECIAREFNYRKHSYKPFWDVLDEIWNKHLHNPLHSAGYFLNPGTFYSTDFHLDL 377

Query: 545  EVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGS-ANLHSYVSPVTWWLEYGK 721
            EV++GL+  ++ +      + +I  Q+DMY+     F   S A+  S +SP  WW +   
Sbjct: 378  EVATGLISSLLHMVQACHIQVKIATQLDMYRLGKECFNEASQADQISGMSPAEWWAQKAS 437

Query: 722  EYPELQKLAIQILSQTCSGASKYKLKRSVAEN-LLSMRRNQIEQQRLTDLAFVHYNLQLQ 898
             +PELQ  A  ILSQTC GAS+YKLKRS+AE  LL+   +  EQ    +L +VHYNLQLQ
Sbjct: 438  HHPELQSFAFMILSQTCEGASRYKLKRSLAEKLLLTEGLSHREQHHQEELVYVHYNLQLQ 497


>ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Populus trichocarpa]
            gi|550335284|gb|ERP58729.1| hypothetical protein
            POPTR_0006s02210g [Populus trichocarpa]
          Length = 847

 Score =  207 bits (527), Expect = 1e-50
 Identities = 121/347 (34%), Positives = 193/347 (55%), Gaps = 8/347 (2%)
 Frame = +2

Query: 5    KKAKTITRFVHGHASVLRLLRDQTSVHDLVKPSKIRSTMPYLTLENMVFEKVNLEXXXXX 184
            ++AK++TRFV+ +++VL L+R  TS  D+V+    RS   +  L+ M   K+NL+     
Sbjct: 474  EQAKSVTRFVYNNSAVLNLMRKFTSGSDIVQQGITRSATNFTALKRMANFKLNLQTMVTS 533

Query: 185  XXXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIYE 364
                         G  + +++ +RSFW+  IL+++ T PL++VL ++++   + +G+++ 
Sbjct: 534  QEWMDCPYSKQPGGLAMVDIITNRSFWSSCILIIRLTSPLLQVLVIVSSEKRAAMGYVFS 593

Query: 365  TMDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVDV 544
             + +AKETI++EL   R  YM +W  ID+ W Q   +PLHAAG+F NP  FY  +  +  
Sbjct: 594  GIYRAKETIKKELVK-REDYMVYWNIIDHRWEQQWQTPLHAAGFFFNPKFFYSIEGDMHN 652

Query: 545  EVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGSA-NLHSYVSPVTWWLEYGK 721
            ++ S +  CI RL  D   +D+I+K++ +Y+NA G  G   A      + P  WW  YG 
Sbjct: 653  KILSRMFDCIERLVPDTEVQDKIVKELTLYKNAEGHLGKKLAIRARGTMLPTDWWSMYGG 712

Query: 722  EYPELQKLAIQILSQTCS--GASKYKLKRSVAENLLSMRRNQIEQQRLTDLAFVHYNLQL 895
              P L +LAI+ILSQTCS  G S       +    +   RN +++QRLTDL FV YNL+L
Sbjct: 713  SCPNLARLAIRILSQTCSAIGCS----HNHIPFEKVHRTRNFLQRQRLTDLVFVQYNLRL 768

Query: 896  Q-----NFESCFTDDISIYEMNQIEDWIGDNAQTLVSPSDEPTWMDL 1021
            +     N +    D IS  +++ +EDWI  N +  +  S    WM L
Sbjct: 769  RQMVDGNKKQIPEDPISFDDVSLVEDWITQN-ELCLEDSGSSDWMSL 814


>gb|EOY18075.1| HAT and BED zinc finger domain-containing protein, putative
            [Theobroma cacao]
          Length = 749

 Score =  204 bits (520), Expect = 7e-50
 Identities = 118/327 (36%), Positives = 183/327 (55%), Gaps = 4/327 (1%)
 Frame = +2

Query: 5    KKAKTITRFVHGHASVLRLLRDQTSVHDLVKPSKIRSTMPYLTLENMVFEKVNLEXXXXX 184
            ++A++ITRFV+ H+ VL ++R  T  +D+V+P+   S   + TL+ M+  K NL+     
Sbjct: 366  EQARSITRFVYNHSVVLNMVRRYTLGNDIVEPAVTCSATNFTTLKQMIDLKNNLQAMVTS 425

Query: 185  XXXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIYE 364
                         G  + +LV++ SFW+ ++L+ + T PL++VL ++ +     +G++Y 
Sbjct: 426  QEWMDCPYSKKPGGLEMLDLVSNPSFWSSSVLITQLTNPLLRVLRMVGSKKRPAMGYVYA 485

Query: 365  TMDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVDV 544
             M +AKETI++EL   R+ YM +W  ID+ W Q  H PLH AG++LNP  FY  +  +  
Sbjct: 486  GMYRAKETIKKELVK-RNEYMIYWNIIDHWWEQQWHHPLHGAGFYLNPKFFYSMEGDMPN 544

Query: 545  EVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGSA-NLHSYVSPVTWWLEYGK 721
            E+ SG+L CI +L  D + +D+I K+++ Y+N  G FG   A      + P  WW  YG 
Sbjct: 545  EMLSGMLDCIEKLVPDVKVQDKISKEINSYKNTVGDFGRKMAVRARDTLLPAEWWSTYGG 604

Query: 722  EYPELQKLAIQILSQTCSGASKYKLKRSVAENLLSMRRNQIEQQRLTDLAFVHYNLQLQN 901
              P L +LAI +LSQTCS       + S+    L   RN +EQQR  DL FV  NLQL+ 
Sbjct: 605  SCPNLARLAIHVLSQTCSTLG--LKQNSIPFEKLHETRNFLEQQRFRDLIFVQCNLQLRQ 662

Query: 902  FESCFTDDISIYEMN---QIEDWIGDN 973
                  + +S+  M+    IEDW+  N
Sbjct: 663  IGCESKEQVSMQPMSFDATIEDWVMGN 689


>gb|EEC81276.1| hypothetical protein OsI_24379 [Oryza sativa Indica Group]
          Length = 657

 Score =  204 bits (518), Expect = 1e-49
 Identities = 117/331 (35%), Positives = 174/331 (52%), Gaps = 6/331 (1%)
 Frame = +2

Query: 8    KAKTITRFVHGHASVLRLLRDQTSVHDLVKPSKIRSTMPYLTLENMVFEKVNLEXXXXXX 187
            KA+ ITRF++ HA  + L        +++  S ++    ++TL  +V E++NL       
Sbjct: 327  KAREITRFIYSHAVPMELKGKYIQGGEILSSSNLKFVAMFITLGKLVSERINLVEMFSSP 386

Query: 188  XXXXXXXXXXXEGKMVTELV-ADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIYE 364
                         + V E+V  D +FW+ A  +LK T PL+ VL  +  +D+  +G +Y+
Sbjct: 387  EWASSDLASRSSFRHVYEVVKTDNAFWSAAADILKLTDPLITVLYKLE-ADNCPIGILYD 445

Query: 365  TMDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVDV 544
             MD AKE I+  L+D    Y   W  +D IW+ YLH+P+HAAGY LNP +FY   F  D 
Sbjct: 446  AMDCAKEDIKCNLRDKHGDY---WPMVDEIWDHYLHTPVHAAGYILNPRIFYTERFSYDT 502

Query: 545  EVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGSANLHSYVSP-VTWWLEYGK 721
            E+ SG   C+ RL  +     ++  Q+D Y+  S  F   SA   +   P V WW  +G 
Sbjct: 503  EIKSGTNACVTRLAKNHYDPKKVAIQMDRYRRKSAPFDSDSAIQQTMEIPQVRWWSAHGT 562

Query: 722  EYPELQKLAIQILSQTCSGASKYKLKRSVAENLLSMRRNQIEQQRLTDLAFVHYNLQLQN 901
            + PELQ  AI+ILSQTC GAS Y + RS++E L  ++R   EQ+R   + +VHYNL+L +
Sbjct: 563  DTPELQTFAIRILSQTCFGASIYNIDRSISEQLHVVKRTYPEQERFRTMEYVHYNLRLAH 622

Query: 902  FESCFTDDISIYE----MNQIEDWIGDNAQT 982
             E C        +     +Q+ DWI     T
Sbjct: 623  CEPCVRGASGAQQHSRLTSQLGDWISSGQTT 653


>gb|EOX93184.1| HAT transposon superfamily, putative [Theobroma cacao]
          Length = 750

 Score =  203 bits (517), Expect = 2e-49
 Identities = 119/346 (34%), Positives = 186/346 (53%), Gaps = 6/346 (1%)
 Frame = +2

Query: 5    KKAKTITRFVHGHASVLRLLRDQTSVHDLVKPSKIRSTMPYLTLENMVFEKVNLEXXXXX 184
            ++AK++TRFV+ H+ VL ++R  T  +D+V+P+  R    + TL+ M   K+ L+     
Sbjct: 370  EQAKSVTRFVYNHSVVLNMMRRFTFHNDIVEPAVTRFASNFATLKRMADLKLKLQAMVNS 429

Query: 185  XXXXXXXXXXXXEGKMVTELVADRSFWAGAILVLKATCPLVKVLDVMNNSDDSQLGHIYE 364
                         G ++ ++V +RSFW   IL+++   PL++VL+++ +   S +G++Y 
Sbjct: 430  QDWSECPYAKKPGGLVMLDIVKNRSFWNSCILIVRLIYPLLQVLEIVGSKKRSTMGYVYA 489

Query: 365  TMDQAKETIRQELKDLRSLYMPFWEAIDNIWNQYLHSPLHAAGYFLNPNLFYRNDFYVDV 544
             + +AKETI++EL   +  YM +W  ID+ W Q  H PL+AA +FLNP  FY  +  +  
Sbjct: 490  GIYRAKETIKKELVK-KDDYMVYWNIIDHRWEQQRHIPLYAAAFFLNPKFFYSIEGNIHN 548

Query: 545  EVSSGLLCCIVRLGGDQRTRDEIMKQVDMYQNASGAFGLGSA-NLHSYVSPVTWWLEYGK 721
            ++ S +  CI RL  D   +D+I++++ +Y+NA+G  G   A      + P  WW  YG 
Sbjct: 549  DILSSMFDCIERLVPDTNVQDQIVREIHLYKNATGDLGRPMAVRARDNLLPGEWWSMYGG 608

Query: 722  EYPELQKLAIQILSQTCSGASKYKLKRSVAENLLSMRRNQIEQQRLTDLAFVHYNLQLQN 901
              P LQ LAI+ILSQTCS       K S+ E  +   RN +E QRL+DL +V YNL L+ 
Sbjct: 609  GCPNLQHLAIRILSQTCSSIGSKPNKISIEE--IHDTRNFLEHQRLSDLVYVRYNLYLRQ 666

Query: 902  F-----ESCFTDDISIYEMNQIEDWIGDNAQTLVSPSDEPTWMDLD 1024
                  +    D +S       +DWI  NA           WM LD
Sbjct: 667  MVLRSQDKDSADPLSFNSKEIRDDWIAYNA-VCEEDYGSSDWMSLD 711


Top