BLASTX nr result

ID: Rheum21_contig00024761 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00024761
         (1006 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002530377.1| protein dimerization, putative [Ricinus comm...   240   7e-61
ref|XP_002310902.1| predicted protein [Populus trichocarpa]           235   2e-59
ref|XP_002269161.1| PREDICTED: uncharacterized protein LOC100264...   233   7e-59
gb|EOY26199.1| HAT transposon superfamily protein, putative [The...   224   5e-56
ref|XP_006366948.1| PREDICTED: uncharacterized protein LOC102589...   208   3e-51
ref|XP_004246748.1| PREDICTED: uncharacterized protein LOC101247...   208   3e-51
ref|XP_004246747.1| PREDICTED: uncharacterized protein LOC101247...   208   3e-51
ref|XP_002269962.2| PREDICTED: uncharacterized protein LOC100251...   201   5e-49
ref|XP_004246933.1| PREDICTED: uncharacterized protein LOC101250...   191   5e-46
ref|XP_002312861.1| predicted protein [Populus trichocarpa]           187   6e-45
ref|XP_006297473.1| hypothetical protein CARUB_v10013494mg [Caps...   177   7e-42
ref|NP_187909.1| hAT transposon superfamily protein [Arabidopsis...   175   3e-41
ref|NP_187908.1| hAT transposon superfamily protein [Arabidopsis...   168   3e-39
ref|XP_006366951.1| PREDICTED: uncharacterized protein LOC102590...   166   1e-38
ref|XP_006345717.1| PREDICTED: uncharacterized protein LOC102580...   166   1e-38
ref|XP_002273287.1| PREDICTED: uncharacterized protein LOC100260...   150   1e-33
emb|CAN67823.1| hypothetical protein VITISV_028004 [Vitis vinifera]   149   1e-33
ref|XP_006299218.1| hypothetical protein CARUB_v10015366mg [Caps...   145   2e-32
emb|CBI29151.3| unnamed protein product [Vitis vinifera]              144   5e-32
ref|XP_002524204.1| DNA binding protein, putative [Ricinus commu...   141   3e-31

>ref|XP_002530377.1| protein dimerization, putative [Ricinus communis]
            gi|223530094|gb|EEF32010.1| protein dimerization,
            putative [Ricinus communis]
          Length = 698

 Score =  240 bits (612), Expect = 7e-61
 Identities = 122/253 (48%), Positives = 164/253 (64%)
 Frame = -3

Query: 998  KGKRVADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXXX 819
            +GKRVA L+ D SFW    + ++AT+PL++VL LI     P VGFIYETMDQ        
Sbjct: 416  EGKRVAHLMGDLSFWTGAEMTLRATVPLLRVLCLIIEADKPQVGFIYETMDQAKETIKEE 475

Query: 818  XXXXKAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIVR 639
                K+ Y P+           LH PLHAAGY+LNP   Y+ DF++DPEV+ GLL  IVR
Sbjct: 476  FRNKKSQYVPFWEIIDEIWDTHLHSPLHAAGYYLNPSLFYSTDFYSDPEVSFGLLCCIVR 535

Query: 638  MVEDLRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATRI 459
            MV+D RTQDL++ QL+ YR  +G F+ G+   K+ +I PA WWS +G   P+LQ  A +I
Sbjct: 536  MVQDPRTQDLISLQLDEYRHARGAFKEGSAINKRTNISPAQWWSIYGKQHPELQNFAIKI 595

Query: 458  LSQSCNGAESYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLKQRKLGGKREGEA 279
            LSQ+C+GA  +GLK+ +AEKLL N GRN  EQQ+L  L Y+HYN++L+  + G +    A
Sbjct: 596  LSQTCDGAMKFGLKRGLAEKLLLN-GRNCNEQQRLDELTYVHYNLHLQNTQFGVEGGLGA 654

Query: 278  GEIDPKCDWIMDE 240
             EIDP  DW++D+
Sbjct: 655  EEIDPMDDWVVDK 667


>ref|XP_002310902.1| predicted protein [Populus trichocarpa]
          Length = 705

 Score =  235 bits (600), Expect = 2e-59
 Identities = 119/254 (46%), Positives = 162/254 (63%)
 Frame = -3

Query: 1001 VKGKRVADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXX 822
            V+G RVA L+ D SFW    +  KAT+PL++VL L+N    P VGFIYETMDQV      
Sbjct: 415  VEGMRVAHLVGDHSFWSGAEMASKATVPLLRVLCLVNEGDKPQVGFIYETMDQVKETIKK 474

Query: 821  XXXXXKAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIV 642
                 K+ Y P+           LH PLHAAGY+LNP   Y+ DF++DPEV  GLL  +V
Sbjct: 475  EFKNKKSDYTPFWTAIDDIWDTRLHSPLHAAGYYLNPCLFYSSDFYSDPEVTFGLLCCVV 534

Query: 641  RMVEDLRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATR 462
            RMV D RTQ  +T QL+ YR  +G F+ G    K+ +I PA WW ++G  CP+LQ+ A R
Sbjct: 535  RMVADQRTQLKITFQLDEYRHARGAFQEGKAIVKRTNISPAQWWCTYGKQCPELQRFAVR 594

Query: 461  ILSQSCNGAESYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLKQRKLGGKREGE 282
            ILSQ+C+GA  YGLK+ +AEKLL +  RN  EQQ+L +L ++HYN+ ++ ++ G + +  
Sbjct: 595  ILSQTCDGASRYGLKRSMAEKLLTD-RRNPIEQQRLRDLTFVHYNLQVQNKRSGFRSDVI 653

Query: 281  AGEIDPKCDWIMDE 240
            + EIDP  D ++DE
Sbjct: 654  SEEIDPMDDRVVDE 667


>ref|XP_002269161.1| PREDICTED: uncharacterized protein LOC100264734 [Vitis vinifera]
          Length = 714

 Score =  233 bits (595), Expect = 7e-59
 Identities = 119/253 (47%), Positives = 161/253 (63%)
 Frame = -3

Query: 998  KGKRVADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXXX 819
            +GKRVADL+ D +FW    +V+KATIPLV+VLS ING   P +G+IY+TMDQ        
Sbjct: 420  EGKRVADLVVDPAFWTGAIMVLKATIPLVRVLSWINGSDKPQMGYIYDTMDQAKEAIAKE 479

Query: 818  XXXXKAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIVR 639
                K+ Y P+           L+ PLH+ GY+LNP F Y+ DF  D EVA G+L  IVR
Sbjct: 480  FKDKKSQYMPFWEVIDEIWNKHLYSPLHSTGYYLNPHFFYSSDFHCDAEVASGILCCIVR 539

Query: 638  MVEDLRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATRI 459
            MV DL  QD++  QL+ Y   +G F +G+  +++ +IPP  WWS +G   P+ Q+ ATRI
Sbjct: 540  MVPDLHVQDVIGLQLDKYLWTEGAFAQGSAFDQRTNIPPVLWWSHYGRQHPEFQRFATRI 599

Query: 458  LSQSCNGAESYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLKQRKLGGKREGEA 279
            LSQ+C+GA  Y LKK +AEKLL   GRN  EQQ+LS+L ++HYN++L+  K     +   
Sbjct: 600  LSQTCDGASRYELKKSLAEKLLMK-GRNPIEQQRLSDLIFLHYNLHLQGFKSRLNADIVL 658

Query: 278  GEIDPKCDWIMDE 240
             EIDP  DWI++E
Sbjct: 659  EEIDPMDDWIVEE 671


>gb|EOY26199.1| HAT transposon superfamily protein, putative [Theobroma cacao]
          Length = 709

 Score =  224 bits (570), Expect = 5e-56
 Identities = 114/256 (44%), Positives = 156/256 (60%), Gaps = 3/256 (1%)
 Frame = -3

Query: 998  KGKRVADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXXX 819
            +GKRVADL+ D SFW     VVK  +PL++VL LINGD  P +G+IYETMDQ+       
Sbjct: 419  EGKRVADLVGDPSFWKGAGRVVKTALPLIRVLCLINGDDKPQMGYIYETMDQMKETIKKE 478

Query: 818  XXXXKAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIVR 639
                ++ Y P+           LH PLHAAG+FLNP   Y+ DF +D EVA GLL  +VR
Sbjct: 479  CNSKESQYMPFWELIDKIWDGHLHSPLHAAGHFLNPSLFYSTDFQSDSEVAFGLLCCMVR 538

Query: 638  MVEDLRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATRI 459
            M++    QD + +QLE YR  +G F  G+  +++       WWS++G  CP+LQ+ ATRI
Sbjct: 539  MIQSQPIQDKIVQQLEAYRNSEGAFGEGSTVQQRTRFSSTMWWSTYGGRCPELQRFATRI 598

Query: 458  LSQSCNGAESYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLKQR---KLGGKRE 288
            LSQ+C GA  Y L + + EKLL   GRN  EQQ LS+L ++HYN+ L+Q+   + G   +
Sbjct: 599  LSQTCVGASKYRLNRSLVEKLLTK-GRNPVEQQLLSDLIFVHYNLQLQQQQRSQFGVNYD 657

Query: 287  GEAGEIDPKCDWIMDE 240
                EID   +WI+D+
Sbjct: 658  IAGDEIDAMDEWIVDD 673


>ref|XP_006366948.1| PREDICTED: uncharacterized protein LOC102589543 isoform X1 [Solanum
            tuberosum] gi|565402986|ref|XP_006366949.1| PREDICTED:
            uncharacterized protein LOC102589543 isoform X2 [Solanum
            tuberosum] gi|565402988|ref|XP_006366950.1| PREDICTED:
            uncharacterized protein LOC102589543 isoform X3 [Solanum
            tuberosum]
          Length = 686

 Score =  208 bits (529), Expect = 3e-51
 Identities = 107/258 (41%), Positives = 154/258 (59%)
 Frame = -3

Query: 998  KGKRVADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXXX 819
            +GKR+++++ D+SFW    + VKATIPLV+V+ L++G   P VGFIY+T+DQ        
Sbjct: 388  EGKRISNMVKDESFWSEALMAVKATIPLVEVMKLLDGTNKPQVGFIYDTLDQAKETIKKE 447

Query: 818  XXXXKAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIVR 639
                K+ Y  +           LH  LHAAGYFLNP   Y+ DF+ D EV+ GL   +VR
Sbjct: 448  FQDKKSLYAKFWIAIDDIWDEYLHSHLHAAGYFLNPTLFYSSDFYTDVEVSCGLCCCVVR 507

Query: 638  MVEDLRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATRI 459
            M ED   QDL+T Q++ YR  +G F  G+  +K ++I PA WWS +G   P+LQ+LA RI
Sbjct: 508  MAEDRHIQDLITLQIDEYRMGRGTFHFGSFKDKLSNISPALWWSQYGVQFPELQRLAVRI 567

Query: 458  LSQSCNGAESYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLKQRKLGGKREGEA 279
            LSQ+CNGA  Y LK+ + E L    G N  E+Q+L +L ++H N+ L+     G  +   
Sbjct: 568  LSQTCNGASHYRLKRSLVETLHTE-GMNPIEKQRLQDLVFVHCNLQLQAFDPDGSND-NT 625

Query: 278  GEIDPKCDWIMDEEG*LI 225
              +DP  +WI+ +E  L+
Sbjct: 626  DYVDPMDEWIVGKEPNLV 643


>ref|XP_004246748.1| PREDICTED: uncharacterized protein LOC101247551 isoform 2 [Solanum
            lycopersicum]
          Length = 682

 Score =  208 bits (529), Expect = 3e-51
 Identities = 103/258 (39%), Positives = 152/258 (58%)
 Frame = -3

Query: 998  KGKRVADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXXX 819
            +GKR+++++ ++SFW    + VKATIPLVKV+ L+NG   P +GFIY+T+DQ+       
Sbjct: 388  EGKRISEMVKNESFWSEALMAVKATIPLVKVIKLLNGTNKPQIGFIYDTLDQIKVTIKKE 447

Query: 818  XXXXKAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIVR 639
                ++ Y  +           LH  LHAAGYFLNP + Y+ DF+AD EV  GL   +VR
Sbjct: 448  FQGKESLYAKFWAAIDDIWNGYLHSHLHAAGYFLNPIYFYSSDFYADAEVTSGLCCCVVR 507

Query: 638  MVEDLRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATRI 459
            M ED   QDL+  Q++ YR  +  F  G+  EK  +I PA WWS +G   P++Q+ A R+
Sbjct: 508  MTEDRHIQDLIALQIDEYRKGRSTFHFGSFKEKLINISPALWWSQYGVQYPEIQRFAFRL 567

Query: 458  LSQSCNGAESYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLKQRKLGGKREGEA 279
            LSQ+CNGA  Y LK+ + E L    G N  E+Q+L +L ++H N+ L+     G  +   
Sbjct: 568  LSQTCNGASHYRLKRSLVETLHTE-GMNPIEKQRLQDLVFVHCNLQLQAFDPDGSNDNTD 626

Query: 278  GEIDPKCDWIMDEEG*LI 225
              +DP  +WI+ +E  L+
Sbjct: 627  YVVDPMDEWIVRKEPNLV 644


>ref|XP_004246747.1| PREDICTED: uncharacterized protein LOC101247551 isoform 1 [Solanum
            lycopersicum]
          Length = 692

 Score =  208 bits (529), Expect = 3e-51
 Identities = 103/258 (39%), Positives = 152/258 (58%)
 Frame = -3

Query: 998  KGKRVADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXXX 819
            +GKR+++++ ++SFW    + VKATIPLVKV+ L+NG   P +GFIY+T+DQ+       
Sbjct: 398  EGKRISEMVKNESFWSEALMAVKATIPLVKVIKLLNGTNKPQIGFIYDTLDQIKVTIKKE 457

Query: 818  XXXXKAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIVR 639
                ++ Y  +           LH  LHAAGYFLNP + Y+ DF+AD EV  GL   +VR
Sbjct: 458  FQGKESLYAKFWAAIDDIWNGYLHSHLHAAGYFLNPIYFYSSDFYADAEVTSGLCCCVVR 517

Query: 638  MVEDLRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATRI 459
            M ED   QDL+  Q++ YR  +  F  G+  EK  +I PA WWS +G   P++Q+ A R+
Sbjct: 518  MTEDRHIQDLIALQIDEYRKGRSTFHFGSFKEKLINISPALWWSQYGVQYPEIQRFAFRL 577

Query: 458  LSQSCNGAESYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLKQRKLGGKREGEA 279
            LSQ+CNGA  Y LK+ + E L    G N  E+Q+L +L ++H N+ L+     G  +   
Sbjct: 578  LSQTCNGASHYRLKRSLVETLHTE-GMNPIEKQRLQDLVFVHCNLQLQAFDPDGSNDNTD 636

Query: 278  GEIDPKCDWIMDEEG*LI 225
              +DP  +WI+ +E  L+
Sbjct: 637  YVVDPMDEWIVRKEPNLV 654


>ref|XP_002269962.2| PREDICTED: uncharacterized protein LOC100251332 [Vitis vinifera]
          Length = 709

 Score =  201 bits (510), Expect = 5e-49
 Identities = 109/253 (43%), Positives = 144/253 (56%)
 Frame = -3

Query: 998  KGKRVADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXXX 819
            +GKRVAD++ D SFW    +V+K TIPLV VL  I   G   + +IYETMD V       
Sbjct: 418  EGKRVADIVLDPSFWSGAEMVLKPTIPLVGVLCSIIRGGKGQMCYIYETMDAVKEDIAEE 477

Query: 818  XXXXKAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIVR 639
                ++ Y P+           LH  LHAA   LNP   Y+ D+  D EV  G+   I  
Sbjct: 478  FENNESQYMPFWELIDEIWNNHLHSALHAAANHLNPAIFYSRDYNFDKEVFEGINCCIEH 537

Query: 638  MVEDLRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATRI 459
            MV D   Q+ +  QLE Y+  +G F  G   E++N   PA WWS++G HCP+LQKLATRI
Sbjct: 538  MVPDEHIQNEIWLQLEQYKDAEGDFGLGKATERRNIFHPALWWSNYGGHCPELQKLATRI 597

Query: 458  LSQSCNGAESYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLKQRKLGGKREGEA 279
            LSQ+C+GA  Y LK+ +AE LL   GRN   Q +L +L ++HYN++L+        + E 
Sbjct: 598  LSQTCDGASRYKLKRSLAENLLAK-GRNPIGQGRLCDLTFVHYNLHLRNADWSTDTDHEF 656

Query: 278  GEIDPKCDWIMDE 240
            GEIDP  DWI+ E
Sbjct: 657  GEIDPMNDWIVWE 669


>ref|XP_004246933.1| PREDICTED: uncharacterized protein LOC101250835 [Solanum
            lycopersicum]
          Length = 640

 Score =  191 bits (484), Expect = 5e-46
 Identities = 100/279 (35%), Positives = 149/279 (53%), Gaps = 27/279 (9%)
 Frame = -3

Query: 1001 VKGKRVADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXX 822
            ++GKR+++++ D+SFW    + VKATIPLV+V+ L++    P VGFIY+T+DQ       
Sbjct: 317  IEGKRMSEMVEDRSFWTEGLMAVKATIPLVEVIKLLDCTNKPQVGFIYDTLDQAKETIKK 376

Query: 821  XXXXXKAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIV 642
                 ++ Y  +            H  LHA GYFLNP   Y+ +F+ D EV  GL   +V
Sbjct: 377  EFRHKRSHYARFWKAIDDIWDEYFHSHLHAVGYFLNPTLFYSSNFYTDVEVTCGLCCCVV 436

Query: 641  RMVEDLRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPP-------------------- 522
            RM ED   Q L+T+Q++ YR  +G F  G+  +K ++I P                    
Sbjct: 437  RMTEDRHIQHLITQQIDEYRKGRGTFHFGSFKDKLSNISPGGIIYTFSAILIMLTYNSYI 496

Query: 521  -------AAWWSSFGSHCPDLQKLATRILSQSCNGAESYGLKKDVAEKLLCNGGRNHAEQ 363
                   A WWS +G  CP+LQ+ A RILSQ+CNGA  Y LK+++ E LL   G N  E+
Sbjct: 497  NLYVMVAALWWSQYGGQCPELQRFAVRILSQTCNGASHYRLKRNLVETLLTE-GMNLIEK 555

Query: 362  QQLSNLAYIHYNMNLKQRKLGGKREGEAGEIDPKCDWIM 246
            Q+L +L ++H N+ L+     G  +     +DP  +WI+
Sbjct: 556  QRLQDLVFVHCNLQLQAFDPDGSNDDTDNVVDPMDEWIV 594


>ref|XP_002312861.1| predicted protein [Populus trichocarpa]
          Length = 621

 Score =  187 bits (475), Expect = 6e-45
 Identities = 98/240 (40%), Positives = 134/240 (55%), Gaps = 5/240 (2%)
 Frame = -3

Query: 1001 VKGKRVADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXX 822
            V+GK+ A L+   SFW R  +  KAT  L++V+  I+ D  P +GFIYETMDQ+      
Sbjct: 364  VEGKKAAGLVKSSSFWKRAGMASKATTALIRVVDKISADNKPSIGFIYETMDQIKEAIQY 423

Query: 821  XXXXXKAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIV 642
                 K+ + P            LH PLHAA Y+LNP F Y  +F  D EV+ GL  +++
Sbjct: 424  EFRDSKSGHIPLWELIDEIWDDFLHSPLHAAAYYLNPTFFYNRNFHLDTEVSSGLQCSVI 483

Query: 641  RMVEDLRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATR 462
            RM  D R Q L+ KQ   Y    G F  G    + N+  P  WWS +G+ CP+LQKLA R
Sbjct: 484  RMENDQRIQYLINKQAAQYCRADGDFENGYAEGEINNAHPDLWWSVYGNRCPELQKLAIR 543

Query: 461  ILSQSCNGAESYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNL-----KQRKLGG 297
            ILSQ+C+G+  Y L + +AEKL+C     H EQ +L +  ++ YN+ L     K+RK GG
Sbjct: 544  ILSQTCDGSGRYSLDRSLAEKLVCKEQNQH-EQHRLRDQMFVRYNLQLEEANNKKRKAGG 602


>ref|XP_006297473.1| hypothetical protein CARUB_v10013494mg [Capsella rubella]
           gi|482566182|gb|EOA30371.1| hypothetical protein
           CARUB_v10013494mg [Capsella rubella]
          Length = 507

 Score =  177 bits (448), Expect = 7e-42
 Identities = 89/225 (39%), Positives = 129/225 (57%)
 Frame = -3

Query: 986 VADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXXXXXXX 807
           ++ L+ D SFW  +  V+K T PL++ L L +   + HVG+IY+TMD +           
Sbjct: 274 ISTLVKDPSFWKTVERVLKCTSPLIRGLLLFSTANNQHVGYIYDTMDSIKECIAREFNYR 333

Query: 806 KAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIVRMVED 627
           K  YKP+           LH PLH+AGYFLNP   Y+ DF  D EVA GL+++++ MV+ 
Sbjct: 334 KHSYKPFWDVLDEIWNKHLHNPLHSAGYFLNPGTFYSTDFHLDLEVATGLISSLLHMVQA 393

Query: 626 LRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATRILSQS 447
              Q  +  QL++YR  +  F   ++ ++ + + PA WW+   SH P+LQ  A  ILSQ+
Sbjct: 394 CHIQVKIATQLDMYRLGKECFNEASQADQISGMSPAEWWAQKASHHPELQSFAFMILSQT 453

Query: 446 CNGAESYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLKQ 312
           C GA  Y LK+ +AEKLL   G +H EQ     L Y+HYN+ L+Q
Sbjct: 454 CEGASRYKLKRSLAEKLLLTEGLSHREQHHQEELVYVHYNLQLQQ 498


>ref|NP_187909.1| hAT transposon superfamily protein [Arabidopsis thaliana]
           gi|79313211|ref|NP_001030685.1| hAT transposon
           superfamily protein [Arabidopsis thaliana]
           gi|238479754|ref|NP_001154612.1| hAT transposon
           superfamily protein [Arabidopsis thaliana]
           gi|15795135|dbj|BAB02513.1| transposase-like protein
           [Arabidopsis thaliana] gi|28393338|gb|AAO42094.1|
           unknown protein [Arabidopsis thaliana]
           gi|28827476|gb|AAO50582.1| unknown protein [Arabidopsis
           thaliana] gi|222424407|dbj|BAH20159.1| AT3G13030
           [Arabidopsis thaliana] gi|332641757|gb|AEE75278.1| hAT
           transposon superfamily protein [Arabidopsis thaliana]
           gi|332641758|gb|AEE75279.1| hAT transposon superfamily
           protein [Arabidopsis thaliana]
           gi|332641759|gb|AEE75280.1| hAT transposon superfamily
           protein [Arabidopsis thaliana]
          Length = 544

 Score =  175 bits (443), Expect = 3e-41
 Identities = 85/227 (37%), Positives = 132/227 (58%)
 Frame = -3

Query: 986 VADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXXXXXXX 807
           +++L+ D SFW+ +  V+K T PL+  L L +   + H+G++Y+TMD +           
Sbjct: 306 ISNLVSDSSFWETVESVLKCTSPLIHGLLLFSTANNQHLGYVYDTMDSIKESIAREFNHK 365

Query: 806 KAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIVRMVED 627
              YKP            LH PLHAAGYFLNP   Y+ +F  D EV  GL+++++ MVED
Sbjct: 366 PQFYKPLWDVIDDVWNKHLHNPLHAAGYFLNPTAFYSTNFHLDIEVVTGLISSLIHMVED 425

Query: 626 LRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATRILSQS 447
              Q  ++ Q+++YR  +  F   ++ ++   I PA WW+   S  P+LQ LA +ILSQ+
Sbjct: 426 CHVQFKISTQIDMYRLGKDCFNEASQADQITGISPAEWWAHKASQYPELQSLAIKILSQT 485

Query: 446 CNGAESYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLKQRK 306
           C GA  Y LK+ +AEKLL + G ++ E+Q L  L ++ YN++L+  K
Sbjct: 486 CEGASKYKLKRSLAEKLLLSEGMSNRERQHLDELVFVQYNLHLQSYK 532


>ref|NP_187908.1| hAT transposon superfamily protein [Arabidopsis thaliana]
            gi|15795134|dbj|BAB02512.1| transposase-like protein
            [Arabidopsis thaliana] gi|332641756|gb|AEE75277.1| hAT
            transposon superfamily protein [Arabidopsis thaliana]
          Length = 605

 Score =  168 bits (425), Expect = 3e-39
 Identities = 88/232 (37%), Positives = 132/232 (56%), Gaps = 1/232 (0%)
 Frame = -3

Query: 998  KGKRVADLLHDKSFWDRITVVVKATIPLVKVLSLI-NGDGDPHVGFIYETMDQVXXXXXX 822
            +GK V++L++D SFW+ +  ++K T PL   L L  N D + HVG+IY+T+D +      
Sbjct: 375  EGKSVSNLVNDSSFWEAVEEILKCTSPLTDGLRLFSNADNNQHVGYIYDTLDGIKLSIKK 434

Query: 821  XXXXXKAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIV 642
                 K  Y              LH PLHAAGY+LNP   Y+ DF  DPEV+ GL  ++V
Sbjct: 435  EFNDEKKHYLTLWDVIDDVWNKHLHNPLHAAGYYLNPTSFYSTDFHLDPEVSSGLTHSLV 494

Query: 641  RMVEDLRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATR 462
             + ++   Q  +  QL+ YR  +  F   ++ ++ + I P  WW+   S  P+LQ  A +
Sbjct: 495  HVAKE--GQIKIASQLDRYRLGKDCFNEASQPDQISGISPIDWWTEKASQHPELQSFAIK 552

Query: 461  ILSQSCNGAESYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLKQRK 306
            ILSQ+C GA  Y LK+ +AEKLL   G +H E++ L  LA++HYN++L+  K
Sbjct: 553  ILSQTCEGASRYKLKRSLAEKLLLTEGMSHCERKHLEELAFVHYNLHLQSCK 604


>ref|XP_006366951.1| PREDICTED: uncharacterized protein LOC102590309 [Solanum tuberosum]
          Length = 507

 Score =  166 bits (420), Expect = 1e-38
 Identities = 89/251 (35%), Positives = 139/251 (55%)
 Frame = -3

Query: 998 KGKRVADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXXX 819
           +GK +++++ D+SFW    + VKATIPLV+V+  +NG     VGFI++T+DQ        
Sbjct: 244 EGKGMSEMIKDESFWTEALMAVKATIPLVEVIKFLNGTNKAQVGFIHDTLDQAKETVRKE 303

Query: 818 XXXXKAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIVR 639
               +  +              LH PLH AGY+LNP F ++ ++  + +++ GL + I  
Sbjct: 304 FERTRFCHAKIWNAIDDTWNKYLHSPLHDAGYYLNPTFFHSSNWCLNVKISDGLCSCITG 363

Query: 638 MVEDLRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATRI 459
           M ED R +DL+T+Q+       G F   +  E  + I P  WWS +    P+L++LA RI
Sbjct: 364 MAEDRRIKDLITQQI-------GTFDFLSSKEILSDISPGHWWSKYEVEFPELERLAVRI 416

Query: 458 LSQSCNGAESYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLKQRKLGGKREGEA 279
           LSQ+CNGA  Y LK+ + E  L   GRN  EQQ+LS+L ++H N+ L+     G+ +   
Sbjct: 417 LSQTCNGASHYRLKRSLVE-TLHRKGRNQIEQQRLSDLVFVHCNLQLQAFDPEGENDIAE 475

Query: 278 GEIDPKCDWIM 246
             +D   +WI+
Sbjct: 476 DVVDSMDEWIV 486


>ref|XP_006345717.1| PREDICTED: uncharacterized protein LOC102580052 [Solanum tuberosum]
          Length = 586

 Score =  166 bits (420), Expect = 1e-38
 Identities = 89/251 (35%), Positives = 139/251 (55%)
 Frame = -3

Query: 998  KGKRVADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXXX 819
            +GK +++++ D+SFW    + VKATIPLV+V+  +NG     VGFI++T+DQ        
Sbjct: 323  EGKGMSEMIKDESFWTEALMAVKATIPLVEVIKFLNGTNKAQVGFIHDTLDQAKETIRKE 382

Query: 818  XXXXKAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIVR 639
                +  +              LH PLH AGY+LNP F ++ ++  + +++ GL + I  
Sbjct: 383  FKSTRFCHAKIWNAIDDTWNKYLHSPLHDAGYYLNPTFFHSSNWCLNVKISDGLCSCITG 442

Query: 638  MVEDLRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATRI 459
            M ED R +DL+T+Q+       G F   +  E  + I P  WWS +    P+L++LA RI
Sbjct: 443  MAEDRRIKDLITQQI-------GTFDFLSSKEILSDISPGHWWSKYEVEFPELERLAVRI 495

Query: 458  LSQSCNGAESYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLKQRKLGGKREGEA 279
            LSQ+CNGA  Y LK+ + E  L   GRN  EQQ+LS+L ++H N+ L+     G+ +   
Sbjct: 496  LSQTCNGASHYRLKRSLVE-TLHRKGRNQIEQQRLSDLVFVHCNLQLQAFDPEGENDIAE 554

Query: 278  GEIDPKCDWIM 246
              +D   +WI+
Sbjct: 555  DVVDSMDEWIV 565


>ref|XP_002273287.1| PREDICTED: uncharacterized protein LOC100260844 [Vitis vinifera]
          Length = 758

 Score =  150 bits (378), Expect = 1e-33
 Identities = 87/264 (32%), Positives = 128/264 (48%), Gaps = 13/264 (4%)
 Frame = -3

Query: 995  GKRVADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXXXX 816
            G  VA+++ D +FW      +K + PL+ VL LI+ +  P VG+IY+ M++         
Sbjct: 446  GVEVAEIIVDPTFWSMCDRALKVSKPLLAVLHLIDCEERPSVGYIYDAMEKAKKSIILAF 505

Query: 815  XXXKAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIVRM 636
               ++ Y PY            H PLHAA Y+LNP   Y   F  +  +  GLL  I  +
Sbjct: 506  DDKESDYSPYLKIIDCIWKEEFHSPLHAAAYYLNPSIFYNPSFSTNKVIQKGLLDCIESL 565

Query: 635  VEDLRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATRIL 456
              +L TQ ++T  +  Y    G F R      + S+ PA WWS + +  PDLQ+LA RIL
Sbjct: 566  EPNLSTQVMITSHINYYEEAVGDFSRPVALRGRESLAPATWWSLYAADYPDLQRLAVRIL 625

Query: 455  SQSCNGAE---SYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLKQ-RKLGGKRE 288
            SQ+C+      S+ + + V  K      RN  E Q+LS+L ++HYN+ L++ R    K  
Sbjct: 626  SQTCSVTRCETSWSMSERVHSK-----QRNRLEHQRLSDLIFVHYNLRLQEKRSESSKGR 680

Query: 287  GEAGEIDPKC---------DWIMD 243
               G  DP C         DW+ D
Sbjct: 681  CMRGTFDPTCLEAIDANMEDWVED 704


>emb|CAN67823.1| hypothetical protein VITISV_028004 [Vitis vinifera]
          Length = 896

 Score =  149 bits (377), Expect = 1e-33
 Identities = 87/262 (33%), Positives = 129/262 (49%), Gaps = 8/262 (3%)
 Frame = -3

Query: 995  GKRVADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXXXX 816
            G  VA+++ D +FW      +K + PL+ VL LI+ +  P VG+IY+ M++         
Sbjct: 493  GVEVAEIIVDPTFWSMCDRALKVSKPLLAVLHLIDCEERPSVGYIYDAMEKAKKSIILAF 552

Query: 815  XXXKAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIVRM 636
               ++ Y PY            H PLHAA Y+LNP   Y   F  +  +  GLL  I  +
Sbjct: 553  DDKESDYSPYLKIIDCIWKEEFHSPLHAAAYYLNPSIFYNPSFSTNKVIQKGLLDCIESL 612

Query: 635  VEDLRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATRIL 456
              +L TQ ++T  +  Y    G F R      + S+ PA WWS + +  PDLQ+LA RIL
Sbjct: 613  EPNLSTQVMITSHINYYEEAVGDFSRPVALRGRESLAPATWWSLYAADYPDLQRLAVRIL 672

Query: 455  SQSCNGAE---SYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLKQRKLG----- 300
            SQ+C+      S+ + + V  K      RN  E Q+LS+L ++HYN+ L+++  G     
Sbjct: 673  SQTCSVTRCETSWSMSERVHSK-----QRNRLEHQRLSDLXFVHYNLRLQEKVKGLSYPR 727

Query: 299  GKREGEAGEIDPKCDWIMDEEG 234
            G  EGE  E+   C     E G
Sbjct: 728  GFFEGEGVEVKDVCGGCKVEAG 749


>ref|XP_006299218.1| hypothetical protein CARUB_v10015366mg [Capsella rubella]
            gi|482567927|gb|EOA32116.1| hypothetical protein
            CARUB_v10015366mg [Capsella rubella]
          Length = 596

 Score =  145 bits (366), Expect = 2e-32
 Identities = 75/224 (33%), Positives = 122/224 (54%)
 Frame = -3

Query: 986  VADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXXXXXXX 807
            V  L+ D SFW+ +  VV+ T  LV  L  ++   + HVG++Y+ ++ +           
Sbjct: 369  VLSLVSDSSFWESVERVVRCTSALVHGLLRLSTANNMHVGYVYDILNSIKLSTALNFKNE 428

Query: 806  KAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIVRMVED 627
            K  Y+P            L+ PLH AGYFLNP   Y+ +F    EV  GL  ++V MV++
Sbjct: 429  KQIYQPIWDVVDDVWKHHLYNPLHGAGYFLNPTAYYSGNFHLSQEVYTGLTFSMVHMVKE 488

Query: 626  LRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATRILSQS 447
             R Q  +  Q+ +YR  +  F   ++ ++ + I P  WW+  G    +L+  A +ILSQ+
Sbjct: 489  ARLQVTIAAQIGMYRLGKSCFNEASQADQISGIFPVDWWTQNGGQHAELKSFAVKILSQT 548

Query: 446  CNGAESYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLK 315
            C GA  Y LK+ +AEKLL   G +H E++ +  +A++HYN++L+
Sbjct: 549  CEGAWKYKLKRGLAEKLLLTEGMSHCEKKHVEEMAFVHYNLHLQ 592


>emb|CBI29151.3| unnamed protein product [Vitis vinifera]
          Length = 718

 Score =  144 bits (363), Expect = 5e-32
 Identities = 78/232 (33%), Positives = 119/232 (51%), Gaps = 3/232 (1%)
 Frame = -3

Query: 995  GKRVADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXXXX 816
            G  VA+++ D +FW      +K + PL+ VL LI+ +  P VG+IY+ M++         
Sbjct: 492  GVEVAEIIVDPTFWSMCDRALKVSKPLLAVLHLIDCEERPSVGYIYDAMEKAKKSIILAF 551

Query: 815  XXXKAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIVRM 636
               ++ Y PY            H PLHAA Y+LNP   Y   F  +  +  GLL  I  +
Sbjct: 552  DDKESDYSPYLKIIDCIWKEEFHSPLHAAAYYLNPSIFYNPSFSTNKVIQKGLLDCIESL 611

Query: 635  VEDLRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATRIL 456
              +L TQ ++T  +  Y    G F R      + S+ PA WWS + +  PDLQ+LA RIL
Sbjct: 612  EPNLSTQVMITSHINYYEEAVGDFSRPVALRGRESLAPATWWSLYAADYPDLQRLAVRIL 671

Query: 455  SQSCNGAE---SYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLKQR 309
            SQ+C+      S+ + + V  K      RN  E Q+LS+L ++HYN+ L+++
Sbjct: 672  SQTCSVTRCETSWSMSERVHSK-----QRNRLEHQRLSDLIFVHYNLRLQEK 718


>ref|XP_002524204.1| DNA binding protein, putative [Ricinus communis]
            gi|223536481|gb|EEF38128.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 753

 Score =  141 bits (356), Expect = 3e-31
 Identities = 81/262 (30%), Positives = 137/262 (52%), Gaps = 8/262 (3%)
 Frame = -3

Query: 998  KGKRVADLLHDKSFWDRITVVVKATIPLVKVLSLINGDGDPHVGFIYETMDQVXXXXXXX 819
            +G  + DLL ++SFW    ++   T PL+++L +++    P +G++Y  + +        
Sbjct: 446  RGLEMLDLLSNQSFWSSCVLITNLTNPLLRLLRIVSSKKRPPMGYVYAGIYRAKEAIKKE 505

Query: 818  XXXXKAPYKPYXXXXXXXXXXILHIPLHAAGYFLNPRFLYTEDFFADPEVAGGLLATIVR 639
                K  Y  Y            ++PLHAAG+FLNP+ LY+ +     E+  G+   I +
Sbjct: 506  LVKRK-DYMVYWNIIDHWWEQQSNLPLHAAGFFLNPKVLYSIEGDLHNEILSGMFDCIEK 564

Query: 638  MVEDLRTQDLVTKQLEVYRGCQGPFRRGTENEKQNSIPPAAWWSSFGSHCPDLQKLATRI 459
            +V D+  QD +TK++  Y+   G F R      + ++ PA WWS++G  CP+L +LA R+
Sbjct: 565  LVPDVTVQDKITKEINSYKNASGDFGRKMAVRARETLLPAEWWSTYGGSCPNLARLAIRV 624

Query: 458  LSQSCNGAESYGLKKDVAEKLLCNGGRNHAEQQQLSNLAYIHYNMNLKQRKLGGKREGEA 279
            LSQ C+   S+G K +       +  +N  E+Q+LS+L ++ YN+ LKQ  + GK E E 
Sbjct: 625  LSQPCS---SFGYKLNHISLEQIHDTKNCLERQRLSDLVFVQYNLRLKQ--MVGKSE-EQ 678

Query: 278  GEIDPKC--------DWIMDEE 237
              +DP          DWI +++
Sbjct: 679  DSVDPLSFDCISILEDWIKEKD 700


Top