BLASTX nr result

ID: Zingiber23_contig00017346 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber23_contig00017346
         (2194 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002269161.1| PREDICTED: uncharacterized protein LOC100264...   493   e-136
ref|XP_004246747.1| PREDICTED: uncharacterized protein LOC101247...   458   e-126
ref|XP_006366948.1| PREDICTED: uncharacterized protein LOC102589...   456   e-125
ref|XP_004246748.1| PREDICTED: uncharacterized protein LOC101247...   455   e-125
ref|XP_002310902.1| predicted protein [Populus trichocarpa]           453   e-124
gb|EOY26199.1| HAT transposon superfamily protein, putative [The...   447   e-123
ref|XP_002530377.1| protein dimerization, putative [Ricinus comm...   444   e-121
ref|XP_002269962.2| PREDICTED: uncharacterized protein LOC100251...   432   e-118
ref|XP_006656455.1| PREDICTED: uncharacterized protein LOC102710...   394   e-107
ref|XP_004246933.1| PREDICTED: uncharacterized protein LOC101250...   359   2e-96
ref|NP_187908.1| hAT transposon superfamily protein [Arabidopsis...   345   4e-92
gb|EEC81276.1| hypothetical protein OsI_24379 [Oryza sativa Indi...   340   2e-90
ref|NP_001058504.1| Os06g0704000 [Oryza sativa Japonica Group] g...   338   7e-90
ref|XP_004966349.1| PREDICTED: uncharacterized protein LOC101752...   327   1e-86
ref|XP_002437551.1| hypothetical protein SORBIDRAFT_10g029230 [S...   322   4e-85
ref|XP_006345717.1| PREDICTED: uncharacterized protein LOC102580...   322   6e-85
ref|XP_002512206.1| DNA binding protein, putative [Ricinus commu...   310   1e-81
emb|CAN78444.1| hypothetical protein VITISV_016801 [Vitis vinifera]   306   2e-80
gb|EOY18075.1| HAT and BED zinc finger domain-containing protein...   302   3e-79
ref|XP_004246932.1| PREDICTED: uncharacterized protein LOC101250...   300   2e-78

>ref|XP_002269161.1| PREDICTED: uncharacterized protein LOC100264734 [Vitis vinifera]
          Length = 714

 Score =  493 bits (1269), Expect = e-136
 Identities = 263/658 (39%), Positives = 379/658 (57%), Gaps = 1/658 (0%)
 Frame = -3

Query: 1988 IEEHGTPIDAEKKRVQCKYCLKEVNGFNRLKHHLAAVGNDVTACSEVPSGVKARMKDLLL 1809
            + +HG  +D +K RVQC YC K ++GF+RL++HL  V  DVT C EVP  VK  MK  LL
Sbjct: 10   VHDHGKVVDQQKNRVQCNYCAKLMSGFSRLRYHLGCVKGDVTPCGEVPENVKELMKTKLL 69

Query: 1808 EKRKERFMKEVGRIEHPELPLKRNFSPSSEQRHCRTKLTTPTDSGEGSIETGNSVRGSFK 1629
            E ++    KEVG +E+P+LP KR + PS      R KL T   +G  S +          
Sbjct: 70   ELKRGSLGKEVGTLEYPDLPWKRKWYPSPSAIEHR-KLQTTQKAGSDSRKD--------- 119

Query: 1628 LKMPFPQQPPKAMNNLDFQIADTINGSHTSIPAIENIQPIVKEEVKDEITLHAARCIGRF 1449
                  Q+   + N +  +++   NG   S    ++      +E +D  +  A +CIGRF
Sbjct: 120  -----VQKDTVSENGVTKEVS-LPNGRRGSQKVEDH------KEREDSSSRQAKKCIGRF 167

Query: 1448 FFDAGIDTTNINLPSFQAMIDAVICCGS-GYSAPGLDELKGVIXXXXXXXXXXXXXXXKQ 1272
            F++ G D +    PSFQ MI A + CG  GY  P   ELKG I               + 
Sbjct: 168  FYELGTDLSAATSPSFQRMITAALGCGQIGYKLPSCQELKGWILKEEVKEMQQYVKDVRN 227

Query: 1271 SWQRTGCSILLDGWTDQRGRSFVSFLVSCPAGTIFLRXXXXXXXXXXXXXXXSLISDXXX 1092
            SW  TGCSILLDGW D++GR+ ++ L  CP GTI++R                 I     
Sbjct: 228  SWANTGCSILLDGWMDEKGRNLINVLADCPKGTIYIRSCDISAFIADVDALQFFIEQIIE 287

Query: 1091 XXXXXXXXXXXXXESSDLIEEAGKRIMERYRSIFWTLCAEYCINMILQKIQMLDSVKKVL 912
                           SD +  AG+R+ME++R++FWT+ A YCI ++L+KI M+D ++ +L
Sbjct: 288  EVGVENVVQIITYSISDCMAAAGQRLMEKFRTVFWTVSASYCIELMLEKIGMMDPIRGIL 347

Query: 911  DDAKAISRFIHSNPHTLSHLRTYIGTHSLVKMSNLKSVIPFITLKNMLSNREILVGLFNS 732
            D AKAI++FIHS+   L  +R Y   ++LVK S +K   PF+TL+N++S ++ L  +F S
Sbjct: 348  DKAKAITKFIHSHATVLKLMRNYTSANTLVKPSKIKLAKPFLTLENIVSEKDNLQNMFVS 407

Query: 731  PGWDAADLASKWRGKCISKMVKDSSFWAAAVEVLKVTTPLLGILVEISRKDTSPMGVLYN 552
             GW++   AS+  GK ++ +V D +FW  A+ VLK T PL+ +L  I+  D   MG +Y+
Sbjct: 408  SGWNSLIWASREEGKRVADLVVDPAFWTGAIMVLKATIPLVRVLSWINGSDKPQMGYIYD 467

Query: 551  SLDCAKEEIKENMGGEEARYSLYWALVDDVWNNYLHSPLHSAGYYLNPILFYSSDFYLDA 372
            ++D AKE I +    ++++Y  +W ++D++WN +L+SPLHS GYYLNP  FYSSDF+ DA
Sbjct: 468  TMDQAKEAIAKEFKDKKSQYMPFWEVIDEIWNKHLYSPLHSTGYYLNPHFFYSSDFHCDA 527

Query: 371  EVTTGVVYCMVKMSKDPEEQEQMVIQLDKYRQAEGEFSGEEAVDGREKASPDVWWSAHGG 192
            EV +G++ C+V+M  D   Q+ + +QLDKY   EG F+   A D R    P +WWS +G 
Sbjct: 528  EVASGILCCIVRMVPDLHVQDVIGLQLDKYLWTEGAFAQGSAFDQRTNIPPVLWWSHYGR 587

Query: 191  HCAELQRIAVKILSQTCSGASRYMLKKAISEKLHAEVRDVTEQQRFRDLEFVHYNRRL 18
               E QR A +ILSQTC GASRY LKK+++EKL  + R+  EQQR  DL F+HYN  L
Sbjct: 588  QHPEFQRFATRILSQTCDGASRYELKKSLAEKLLMKGRNPIEQQRLSDLIFLHYNLHL 645


>ref|XP_004246747.1| PREDICTED: uncharacterized protein LOC101247551 isoform 1 [Solanum
            lycopersicum]
          Length = 692

 Score =  458 bits (1179), Expect = e-126
 Identities = 243/670 (36%), Positives = 372/670 (55%)
 Frame = -3

Query: 2027 FLIADMSTSTGQTIEEHGTPIDAEKKRVQCKYCLKEVNGFNRLKHHLAAVGNDVTACSEV 1848
            F + D  T     I +HG P+D +K +V+C YC K V+GF+RLK HL  +  DVT C + 
Sbjct: 5    FAVEDQMTRDKIDIRQHGVPVDQKKLKVKCNYCGKVVSGFSRLKQHLGGIRGDVTPCLKT 64

Query: 1847 PSGVKARMKDLLLEKRKERFMKEVGRIEHPELPLKRNFSPSSEQRHCRTKLTTPTDSGEG 1668
            P  VK  ++  +L K+ E  +K+VG+++HP LPLKRN+ P   + +              
Sbjct: 65   PILVKEALEAEILNKKNENLIKKVGQLQHPSLPLKRNWCPRDGEPN-------------- 110

Query: 1667 SIETGNSVRGSFKLKMPFPQQPPKAMNNLDFQIADTINGSHTSIPAIENIQPIVKEEVKD 1488
              +T  SV               K  N ++  +A T                     V D
Sbjct: 111  --KTSESVN--------------KKHNGVNSNVAGT--------------------SVVD 134

Query: 1487 EITLHAARCIGRFFFDAGIDTTNINLPSFQAMIDAVICCGSGYSAPGLDELKGVIXXXXX 1308
              +   ++ IGRFF++AGID   I LPSFQ M+ A +  G     P   ELKG I     
Sbjct: 135  SSSQEISKSIGRFFYEAGIDFDAIRLPSFQRMLKATLSPGKTIKFPSCQELKGWILQDAV 194

Query: 1307 XXXXXXXXXXKQSWQRTGCSILLDGWTDQRGRSFVSFLVSCPAGTIFLRXXXXXXXXXXX 1128
                      ++SW  TGCSILLDGW D +GR+ ++ LV CP GTI+LR           
Sbjct: 195  KEMQQYVTEIRKSWASTGCSILLDGWIDSKGRNLINILVYCPRGTIYLRSSDISSFNGNV 254

Query: 1127 XXXXSLISDXXXXXXXXXXXXXXXXESSDLIEEAGKRIMERYRSIFWTLCAEYCINMILQ 948
                    +                 +S  + EAGKR+ME+ +++FWT+   +C+ ++LQ
Sbjct: 255  DAMLVFFEEVLEEVGVETVVQIVGYSTSACMMEAGKRLMEKCKTVFWTVDVSHCMELMLQ 314

Query: 947  KIQMLDSVKKVLDDAKAISRFIHSNPHTLSHLRTYIGTHSLVKMSNLKSVIPFITLKNML 768
            K   ++ +++ L+ AK +++FI+++   L  LR       LVK S ++S++PF+TL+N++
Sbjct: 315  KFTKMNPIQEALEKAKTLTQFIYNHATALKLLRDAC-PDELVKSSKIRSIVPFLTLENIV 373

Query: 767  SNREILVGLFNSPGWDAADLASKWRGKCISKMVKDSSFWAAAVEVLKVTTPLLGILVEIS 588
            S ++ L+ +F S  W  + +AS   GK IS+MVK+ SFW+ A+  +K T PL+ ++  ++
Sbjct: 374  SQKDCLISMFQSSDWHTSIMASTNEGKRISEMVKNESFWSEALMAVKATIPLVKVIKLLN 433

Query: 587  RKDTSPMGVLYNSLDCAKEEIKENMGGEEARYSLYWALVDDVWNNYLHSPLHSAGYYLNP 408
              +   +G +Y++LD  K  IK+   G+E+ Y+ +WA +DD+WN YLHS LH+AGY+LNP
Sbjct: 434  GTNKPQIGFIYDTLDQIKVTIKKEFQGKESLYAKFWAAIDDIWNGYLHSHLHAAGYFLNP 493

Query: 407  ILFYSSDFYLDAEVTTGVVYCMVKMSKDPEEQEQMVIQLDKYRQAEGEFSGEEAVDGREK 228
            I FYSSDFY DAEVT+G+  C+V+M++D   Q+ + +Q+D+YR+    F      +    
Sbjct: 494  IYFYSSDFYADAEVTSGLCCCVVRMTEDRHIQDLIALQIDEYRKGRSTFHFGSFKEKLIN 553

Query: 227  ASPDVWWSAHGGHCAELQRIAVKILSQTCSGASRYMLKKAISEKLHAEVRDVTEQQRFRD 48
             SP +WWS +G    E+QR A ++LSQTC+GAS Y LK+++ E LH E  +  E+QR +D
Sbjct: 554  ISPALWWSQYGVQYPEIQRFAFRLLSQTCNGASHYRLKRSLVETLHTEGMNPIEKQRLQD 613

Query: 47   LEFVHYNRRL 18
            L FVH N +L
Sbjct: 614  LVFVHCNLQL 623


>ref|XP_006366948.1| PREDICTED: uncharacterized protein LOC102589543 isoform X1 [Solanum
            tuberosum] gi|565402986|ref|XP_006366949.1| PREDICTED:
            uncharacterized protein LOC102589543 isoform X2 [Solanum
            tuberosum] gi|565402988|ref|XP_006366950.1| PREDICTED:
            uncharacterized protein LOC102589543 isoform X3 [Solanum
            tuberosum]
          Length = 686

 Score =  456 bits (1172), Expect = e-125
 Identities = 242/657 (36%), Positives = 363/657 (55%)
 Frame = -3

Query: 1988 IEEHGTPIDAEKKRVQCKYCLKEVNGFNRLKHHLAAVGNDVTACSEVPSGVKARMKDLLL 1809
            I +HG P+D +K +V+C YC K V+GF+RLK HL  +  DVT C E P  VK  ++  +L
Sbjct: 8    IHQHGVPVDQKKLKVKCNYCGKVVSGFSRLKQHLGGIRGDVTPCLETPILVKEALEAEIL 67

Query: 1808 EKRKERFMKEVGRIEHPELPLKRNFSPSSEQRHCRTKLTTPTDSGEGSIETGNSVRGSFK 1629
             K+    +KEVG+++HP LPLKRN+ P   + +                +T  SV     
Sbjct: 68   NKKNGNLIKEVGQLQHPNLPLKRNWCPRDGEPN----------------KTSESVN---- 107

Query: 1628 LKMPFPQQPPKAMNNLDFQIADTINGSHTSIPAIENIQPIVKEEVKDEITLHAARCIGRF 1449
                      K  N ++ ++A T                     V D  +   ++ IGRF
Sbjct: 108  ----------KKHNGVNSKVAGT--------------------SVVDSSSQEISKSIGRF 137

Query: 1448 FFDAGIDTTNINLPSFQAMIDAVICCGSGYSAPGLDELKGVIXXXXXXXXXXXXXXXKQS 1269
            F++AGID   I LPSFQ M+ A +  G     P   EL+G I               + S
Sbjct: 138  FYEAGIDLDAIRLPSFQRMVKATLSPGKTVKFPSCQELRGWILQDAVKEMQQYVMEIRNS 197

Query: 1268 WQRTGCSILLDGWTDQRGRSFVSFLVSCPAGTIFLRXXXXXXXXXXXXXXXSLISDXXXX 1089
            W  TGCSILLDGW D  GR+ ++ LV CP GTI+LR                   +    
Sbjct: 198  WASTGCSILLDGWIDSNGRNLINILVYCPRGTIYLRSSDISSFNGNVDAMLLFFEEVLEE 257

Query: 1088 XXXXXXXXXXXXESSDLIEEAGKRIMERYRSIFWTLCAEYCINMILQKIQMLDSVKKVLD 909
                         +S  + E GK++ME+ +++FWT+ A +C+ ++LQ    +D +++ L+
Sbjct: 258  VGVETVVQIVAYSTSACMMEVGKKLMEKCKTVFWTVDASHCMELMLQNFTKIDPIQEALE 317

Query: 908  DAKAISRFIHSNPHTLSHLRTYIGTHSLVKMSNLKSVIPFITLKNMLSNREILVGLFNSP 729
             AK +++FI+S+   L  LR       LVK S ++S++PF+TL+N++S ++ L+ +F S 
Sbjct: 318  KAKTLTQFIYSHATALKLLRDAC-PDELVKSSKIRSIVPFLTLENIVSQKDCLIRMFQSS 376

Query: 728  GWDAADLASKWRGKCISKMVKDSSFWAAAVEVLKVTTPLLGILVEISRKDTSPMGVLYNS 549
             W  + +AS   GK IS MVKD SFW+ A+  +K T PL+ ++  +   +   +G +Y++
Sbjct: 377  DWRTSIMASTNEGKRISNMVKDESFWSEALMAVKATIPLVEVMKLLDGTNKPQVGFIYDT 436

Query: 548  LDCAKEEIKENMGGEEARYSLYWALVDDVWNNYLHSPLHSAGYYLNPILFYSSDFYLDAE 369
            LD AKE IK+    +++ Y+ +W  +DD+W+ YLHS LH+AGY+LNP LFYSSDFY D E
Sbjct: 437  LDQAKETIKKEFQDKKSLYAKFWIAIDDIWDEYLHSHLHAAGYFLNPTLFYSSDFYTDVE 496

Query: 368  VTTGVVYCMVKMSKDPEEQEQMVIQLDKYRQAEGEFSGEEAVDGREKASPDVWWSAHGGH 189
            V+ G+  C+V+M++D   Q+ + +Q+D+YR   G F      D     SP +WWS +G  
Sbjct: 497  VSCGLCCCVVRMAEDRHIQDLITLQIDEYRMGRGTFHFGSFKDKLSNISPALWWSQYGVQ 556

Query: 188  CAELQRIAVKILSQTCSGASRYMLKKAISEKLHAEVRDVTEQQRFRDLEFVHYNRRL 18
              ELQR+AV+ILSQTC+GAS Y LK+++ E LH E  +  E+QR +DL FVH N +L
Sbjct: 557  FPELQRLAVRILSQTCNGASHYRLKRSLVETLHTEGMNPIEKQRLQDLVFVHCNLQL 613


>ref|XP_004246748.1| PREDICTED: uncharacterized protein LOC101247551 isoform 2 [Solanum
            lycopersicum]
          Length = 682

 Score =  455 bits (1170), Expect = e-125
 Identities = 240/657 (36%), Positives = 368/657 (56%)
 Frame = -3

Query: 1988 IEEHGTPIDAEKKRVQCKYCLKEVNGFNRLKHHLAAVGNDVTACSEVPSGVKARMKDLLL 1809
            I +HG P+D +K +V+C YC K V+GF+RLK HL  +  DVT C + P  VK  ++  +L
Sbjct: 8    IRQHGVPVDQKKLKVKCNYCGKVVSGFSRLKQHLGGIRGDVTPCLKTPILVKEALEAEIL 67

Query: 1808 EKRKERFMKEVGRIEHPELPLKRNFSPSSEQRHCRTKLTTPTDSGEGSIETGNSVRGSFK 1629
             K+ E  +K+VG+++HP LPLKRN+ P   + +                +T  SV     
Sbjct: 68   NKKNENLIKKVGQLQHPSLPLKRNWCPRDGEPN----------------KTSESVN---- 107

Query: 1628 LKMPFPQQPPKAMNNLDFQIADTINGSHTSIPAIENIQPIVKEEVKDEITLHAARCIGRF 1449
                      K  N ++  +A T                     V D  +   ++ IGRF
Sbjct: 108  ----------KKHNGVNSNVAGT--------------------SVVDSSSQEISKSIGRF 137

Query: 1448 FFDAGIDTTNINLPSFQAMIDAVICCGSGYSAPGLDELKGVIXXXXXXXXXXXXXXXKQS 1269
            F++AGID   I LPSFQ M+ A +  G     P   ELKG I               ++S
Sbjct: 138  FYEAGIDFDAIRLPSFQRMLKATLSPGKTIKFPSCQELKGWILQDAVKEMQQYVTEIRKS 197

Query: 1268 WQRTGCSILLDGWTDQRGRSFVSFLVSCPAGTIFLRXXXXXXXXXXXXXXXSLISDXXXX 1089
            W  TGCSILLDGW D +GR+ ++ LV CP GTI+LR                   +    
Sbjct: 198  WASTGCSILLDGWIDSKGRNLINILVYCPRGTIYLRSSDISSFNGNVDAMLVFFEEVLEE 257

Query: 1088 XXXXXXXXXXXXESSDLIEEAGKRIMERYRSIFWTLCAEYCINMILQKIQMLDSVKKVLD 909
                         +S  + EAGKR+ME+ +++FWT+   +C+ ++LQK   ++ +++ L+
Sbjct: 258  VGVETVVQIVGYSTSACMMEAGKRLMEKCKTVFWTVDVSHCMELMLQKFTKMNPIQEALE 317

Query: 908  DAKAISRFIHSNPHTLSHLRTYIGTHSLVKMSNLKSVIPFITLKNMLSNREILVGLFNSP 729
             AK +++FI+++   L  LR       LVK S ++S++PF+TL+N++S ++ L+ +F S 
Sbjct: 318  KAKTLTQFIYNHATALKLLRDAC-PDELVKSSKIRSIVPFLTLENIVSQKDCLISMFQSS 376

Query: 728  GWDAADLASKWRGKCISKMVKDSSFWAAAVEVLKVTTPLLGILVEISRKDTSPMGVLYNS 549
             W  + +AS   GK IS+MVK+ SFW+ A+  +K T PL+ ++  ++  +   +G +Y++
Sbjct: 377  DWHTSIMASTNEGKRISEMVKNESFWSEALMAVKATIPLVKVIKLLNGTNKPQIGFIYDT 436

Query: 548  LDCAKEEIKENMGGEEARYSLYWALVDDVWNNYLHSPLHSAGYYLNPILFYSSDFYLDAE 369
            LD  K  IK+   G+E+ Y+ +WA +DD+WN YLHS LH+AGY+LNPI FYSSDFY DAE
Sbjct: 437  LDQIKVTIKKEFQGKESLYAKFWAAIDDIWNGYLHSHLHAAGYFLNPIYFYSSDFYADAE 496

Query: 368  VTTGVVYCMVKMSKDPEEQEQMVIQLDKYRQAEGEFSGEEAVDGREKASPDVWWSAHGGH 189
            VT+G+  C+V+M++D   Q+ + +Q+D+YR+    F      +     SP +WWS +G  
Sbjct: 497  VTSGLCCCVVRMTEDRHIQDLIALQIDEYRKGRSTFHFGSFKEKLINISPALWWSQYGVQ 556

Query: 188  CAELQRIAVKILSQTCSGASRYMLKKAISEKLHAEVRDVTEQQRFRDLEFVHYNRRL 18
              E+QR A ++LSQTC+GAS Y LK+++ E LH E  +  E+QR +DL FVH N +L
Sbjct: 557  YPEIQRFAFRLLSQTCNGASHYRLKRSLVETLHTEGMNPIEKQRLQDLVFVHCNLQL 613


>ref|XP_002310902.1| predicted protein [Populus trichocarpa]
          Length = 705

 Score =  453 bits (1165), Expect = e-124
 Identities = 242/656 (36%), Positives = 356/656 (54%), Gaps = 2/656 (0%)
 Frame = -3

Query: 1988 IEEHGTPIDAEKKRVQCKYCLKEVNGFNRLKHHLAAVGNDVTACSEVPSGVKARMKDLLL 1809
            I +HG  +D +KKRVQC YC K ++GF+RLK+H+  +  DV  C +V   V+   + +LL
Sbjct: 10   IHDHGAALDEKKKRVQCNYCGKVLSGFSRLKYHVGGIRGDVVPCEKVAENVRESFRSMLL 69

Query: 1808 EKRKERFMKEVGRIEHPELPLKRNFSPSSEQRHCRTKLTTPTDSGEGSIETGNSVRGSFK 1629
            E ++     EV  +  P+LP KR  SP       R K      +G GS            
Sbjct: 70   ENKRASRDNEVQNLYPPDLPWKRYCSPDLNAAK-RKKRDANQTTGCGS------------ 116

Query: 1628 LKMPFPQQPPKAMNNLDFQIADTINGSHTSIPAIENIQPIVKEEVKDEI-TLHAARCIGR 1452
                           +  ++   +    T   ++ N +  +    K+ + +  A RCIGR
Sbjct: 117  --------------GMHAEMHSVVEDDMTEHVSVNNRRRAMSSGPKENVMSRQAQRCIGR 162

Query: 1451 FFFDAGIDTTNINLPSFQAMIDAVICCG-SGYSAPGLDELKGVIXXXXXXXXXXXXXXXK 1275
            FF++ G D +   LPSFQ MI+A +  G S Y  P L +LKG I                
Sbjct: 163  FFYETGFDFSASTLPSFQRMINATLDDGHSEYKVPSLQDLKGWILHDEVEEIKTYVNEIS 222

Query: 1274 QSWQRTGCSILLDGWTDQRGRSFVSFLVSCPAGTIFLRXXXXXXXXXXXXXXXSLISDXX 1095
             SW  TGCS+LLDGW D++GR+ VSF+V CP G  +LR                L+    
Sbjct: 223  HSWASTGCSVLLDGWVDEKGRNLVSFVVECPGGPTYLRSADVSAIIDDVNALQLLLEGVI 282

Query: 1094 XXXXXXXXXXXXXXESSDLIEEAGKRIMERYRSIFWTLCAEYCINMILQKIQMLDSVKKV 915
                           +   +   G++ M+RY  +FW + A +CI ++L+KI  +DS+++ 
Sbjct: 283  EEVGIDNVVQIVAFSTVGWVGAVGEQFMQRYWCVFWCVSASHCIELMLEKIGAMDSIRRT 342

Query: 914  LDDAKAISRFIHSNPHTLSHLRTYIGTHSLVKMSNLKSVIPFITLKNMLSNREILVGLFN 735
            L+ AK I++FI+ +   L  +R +I  + L+K S +K  +PF TL+N+LS ++ L  +F+
Sbjct: 343  LEKAKIITKFIYGHKKVLKLMRNHIDDYDLIKPSKMKLAMPFFTLENILSEKKNLEEMFD 402

Query: 734  SPGWDAADLASKWRGKCISKMVKDSSFWAAAVEVLKVTTPLLGILVEISRKDTSPMGVLY 555
            S  W  +  +S   G  ++ +V D SFW+ A    K T PLL +L  ++  D   +G +Y
Sbjct: 403  SFEWKTSVWSSTVEGMRVAHLVGDHSFWSGAEMASKATVPLLRVLCLVNEGDKPQVGFIY 462

Query: 554  NSLDCAKEEIKENMGGEEARYSLYWALVDDVWNNYLHSPLHSAGYYLNPILFYSSDFYLD 375
             ++D  KE IK+    +++ Y+ +W  +DD+W+  LHSPLH+AGYYLNP LFYSSDFY D
Sbjct: 463  ETMDQVKETIKKEFKNKKSDYTPFWTAIDDIWDTRLHSPLHAAGYYLNPCLFYSSDFYSD 522

Query: 374  AEVTTGVVYCMVKMSKDPEEQEQMVIQLDKYRQAEGEFSGEEAVDGREKASPDVWWSAHG 195
             EVT G++ C+V+M  D   Q ++  QLD+YR A G F   +A+  R   SP  WW  +G
Sbjct: 523  PEVTFGLLCCVVRMVADQRTQLKITFQLDEYRHARGAFQEGKAIVKRTNISPAQWWCTYG 582

Query: 194  GHCAELQRIAVKILSQTCSGASRYMLKKAISEKLHAEVRDVTEQQRFRDLEFVHYN 27
              C ELQR AV+ILSQTC GASRY LK++++EKL  + R+  EQQR RDL FVHYN
Sbjct: 583  KQCPELQRFAVRILSQTCDGASRYGLKRSMAEKLLTDRRNPIEQQRLRDLTFVHYN 638


>gb|EOY26199.1| HAT transposon superfamily protein, putative [Theobroma cacao]
          Length = 709

 Score =  447 bits (1151), Expect = e-123
 Identities = 246/671 (36%), Positives = 364/671 (54%), Gaps = 7/671 (1%)
 Frame = -3

Query: 2009 STSTGQTIEEHGTPIDAEKKRVQCKYCLKEVNGFNRLKHHLAAVGNDVTACSEVPSGVKA 1830
            S+     + +HG  +D +K+RVQC YC KE++GF RLK+HL  V  DV  C  V   VK 
Sbjct: 3    SSEASINVHDHGKAVDGKKQRVQCNYCGKEMSGFFRLKYHLGGVRGDVIPCEMVSEDVKE 62

Query: 1829 RMKDLLLEKRKERFMKEVGRIEHPELPLKRNFSPSSEQRHCRTKLTTPTDSGEGSIETGN 1650
              K++L E R  R  +EV  +   +LP KRN  P+S   +   K+   +    GS     
Sbjct: 63   LFKNMLPE-RGGRLSQEVRDLSRQDLPWKRNGCPNS---NVAKKMRRQSCKSSGS----- 113

Query: 1649 SVRGSFKLKMPFPQQPPKAMNNLDFQIADTINGSHTSIPAIENIQPIVKEEV------KD 1488
                                 + + +I D+++      PAI     IV +        ++
Sbjct: 114  --------------------RSGEDEIIDSMSEDDVKEPAILPSARIVSQSAVTGDPEEE 153

Query: 1487 EITLHAARCIGRFFFDAGIDTTNINLPSFQAMIDAVICCG-SGYSAPGLDELKGVIXXXX 1311
                   RCIGRFF++ GID T +N PSFQ MI+   C G + Y  P   ELKG I    
Sbjct: 154  PSCKQNKRCIGRFFYETGIDLTLVNSPSFQRMINDTHCPGQTNYKIPSCQELKGWILKDE 213

Query: 1310 XXXXXXXXXXXKQSWQRTGCSILLDGWTDQRGRSFVSFLVSCPAGTIFLRXXXXXXXXXX 1131
                       +QSW  +GCSILLDGW D++GR+ VSF+V CP G I+L           
Sbjct: 214  VKEMQEYVEKIRQSWASSGCSILLDGWIDEKGRNLVSFIVDCPQGPIYLHSSDVSASVDD 273

Query: 1130 XXXXXSLISDXXXXXXXXXXXXXXXXESSDLIEEAGKRIMERYRSIFWTLCAEYCINMIL 951
                  L                    +   +   GK+ M R +++FWT+ A +CI ++L
Sbjct: 274  VDALQLLFDRVIDDVGVENVVQIIAFSTEGWVGAVGKQFMGRSKTVFWTVNASHCIELML 333

Query: 950  QKIQMLDSVKKVLDDAKAISRFIHSNPHTLSHLRTYIGTHSLVKMSNLKSVIPFITLKNM 771
             KI M+  ++  L++A+ IS+FIH +   L+ LR Y   H L+K + ++S +PF+TL+N+
Sbjct: 334  DKIAMMGEIRGTLENARTISKFIHGHLTVLNLLRDYTDGHDLIKPTKVRSAMPFVTLENI 393

Query: 770  LSNREILVGLFNSPGWDAADLASKWRGKCISKMVKDSSFWAAAVEVLKVTTPLLGILVEI 591
            ++ ++ L  +F S  W+ +  AS+  GK ++ +V D SFW  A  V+K   PL+ +L  I
Sbjct: 394  IAEKKNLKAMFASSEWNTSAWASRAEGKRVADLVGDPSFWKGAGRVVKTALPLIRVLCLI 453

Query: 590  SRKDTSPMGVLYNSLDCAKEEIKENMGGEEARYSLYWALVDDVWNNYLHSPLHSAGYYLN 411
            +  D   MG +Y ++D  KE IK+    +E++Y  +W L+D +W+ +LHSPLH+AG++LN
Sbjct: 454  NGDDKPQMGYIYETMDQMKETIKKECNSKESQYMPFWELIDKIWDGHLHSPLHAAGHFLN 513

Query: 410  PILFYSSDFYLDAEVTTGVVYCMVKMSKDPEEQEQMVIQLDKYRQAEGEFSGEEAVDGRE 231
            P LFYS+DF  D+EV  G++ CMV+M +    Q+++V QL+ YR +EG F     V  R 
Sbjct: 514  PSLFYSTDFQSDSEVAFGLLCCMVRMIQSQPIQDKIVQQLEAYRNSEGAFGEGSTVQQRT 573

Query: 230  KASPDVWWSAHGGHCAELQRIAVKILSQTCSGASRYMLKKAISEKLHAEVRDVTEQQRFR 51
            + S  +WWS +GG C ELQR A +ILSQTC GAS+Y L +++ EKL  + R+  EQQ   
Sbjct: 574  RFSSTMWWSTYGGRCPELQRFATRILSQTCVGASKYRLNRSLVEKLLTKGRNPVEQQLLS 633

Query: 50   DLEFVHYNRRL 18
            DL FVHYN +L
Sbjct: 634  DLIFVHYNLQL 644


>ref|XP_002530377.1| protein dimerization, putative [Ricinus communis]
            gi|223530094|gb|EEF32010.1| protein dimerization,
            putative [Ricinus communis]
          Length = 698

 Score =  444 bits (1141), Expect = e-121
 Identities = 245/666 (36%), Positives = 363/666 (54%), Gaps = 11/666 (1%)
 Frame = -3

Query: 1982 EHGTPIDAEKKRVQCKYCLKEVNGFNRLKHHLAAVGNDVTACSEVPSGVKARMKDLLLEK 1803
            +HGT +  EK RVQC YC K V+G  RLK HL  +  DV  C +VP  VK   +++L E 
Sbjct: 12   DHGTAL--EKNRVQCNYCGKVVSGITRLKCHLGGIRKDVVPCEKVPENVKEAFRNMLQEI 69

Query: 1802 RKERFMKEVGRIEHPELPLKRNFSPS-----------SEQRHCRTKLTTPTDSGEGSIET 1656
            +KE   KE G+   P+LP KRN+SP+           S+   C +      DSG    E 
Sbjct: 70   KKEALAKEFGKQCQPDLPWKRNWSPTPNGVKHIKHEASQTAGCESNKQVDMDSGA---ED 126

Query: 1655 GNSVRGSFKLKMPFPQQPPKAMNNLDFQIADTINGSHTSIPAIENIQPIVKEEVKDEITL 1476
            G +            +  P     +D + A  ING                E  +D  + 
Sbjct: 127  GAA------------EYLPVCNRRVDPEFA--ING----------------EAKEDASSR 156

Query: 1475 HAARCIGRFFFDAGIDTTNINLPSFQAMIDAVICCGSGYSAPGLDELKGVIXXXXXXXXX 1296
             A RCIGRFF++ GID +N N PSF+ M++  +  G     P + E KG I         
Sbjct: 157  QAKRCIGRFFYETGIDFSNANSPSFKRMLNTTLGDGQ-VKIPTIHEFKGWILWDELKETQ 215

Query: 1295 XXXXXXKQSWQRTGCSILLDGWTDQRGRSFVSFLVSCPAGTIFLRXXXXXXXXXXXXXXX 1116
                  + SW  TGCS+LLDGW +++G++ VSF+V  P G I+LR               
Sbjct: 216  EYVKKIRNSWASTGCSLLLDGWMNEKGQNLVSFVVEGPEGLIYLRSANVSDIINDLDALQ 275

Query: 1115 SLISDXXXXXXXXXXXXXXXXESSDLIEEAGKRIMERYRSIFWTLCAEYCINMILQKIQM 936
             L+                   ++  +   GK+ M+R R++FW++ A +CI ++L+KI  
Sbjct: 276  LLLDRVMEEVGVDNVVQIIACSTTGWMGTIGKQFMDRRRTVFWSVSASHCIKLMLEKIGA 335

Query: 935  LDSVKKVLDDAKAISRFIHSNPHTLSHLRTYIGTHSLVKMSNLKSVIPFITLKNMLSNRE 756
            +D +K +++ AK I++FI+ N   L  +R Y  ++ LVK S +K  +PF+TL+N++S ++
Sbjct: 336  MDCIKWIIEKAKIITKFIYGNGEVLKLMRNYTNSYDLVKTSRMKFGVPFLTLENIISEKK 395

Query: 755  ILVGLFNSPGWDAADLASKWRGKCISKMVKDSSFWAAAVEVLKVTTPLLGILVEISRKDT 576
             L  +F S  W  +  AS   GK ++ ++ D SFW  A   L+ T PLL +L  I   D 
Sbjct: 396  NLENMFASSEWMTSVWASSPEGKRVAHLMGDLSFWTGAEMTLRATVPLLRVLCLIIEADK 455

Query: 575  SPMGVLYNSLDCAKEEIKENMGGEEARYSLYWALVDDVWNNYLHSPLHSAGYYLNPILFY 396
              +G +Y ++D AKE IKE    ++++Y  +W ++D++W+ +LHSPLH+AGYYLNP LFY
Sbjct: 456  PQVGFIYETMDQAKETIKEEFRNKKSQYVPFWEIIDEIWDTHLHSPLHAAGYYLNPSLFY 515

Query: 395  SSDFYLDAEVTTGVVYCMVKMSKDPEEQEQMVIQLDKYRQAEGEFSGEEAVDGREKASPD 216
            S+DFY D EV+ G++ C+V+M +DP  Q+ + +QLD+YR A G F    A++ R   SP 
Sbjct: 516  STDFYSDPEVSFGLLCCIVRMVQDPRTQDLISLQLDEYRHARGAFKEGSAINKRTNISPA 575

Query: 215  VWWSAHGGHCAELQRIAVKILSQTCSGASRYMLKKAISEKLHAEVRDVTEQQRFRDLEFV 36
             WWS +G    ELQ  A+KILSQTC GA ++ LK+ ++EKL    R+  EQQR  +L +V
Sbjct: 576  QWWSIYGKQHPELQNFAIKILSQTCDGAMKFGLKRGLAEKLLLNGRNCNEQQRLDELTYV 635

Query: 35   HYNRRL 18
            HYN  L
Sbjct: 636  HYNLHL 641


>ref|XP_002269962.2| PREDICTED: uncharacterized protein LOC100251332 [Vitis vinifera]
          Length = 709

 Score =  432 bits (1110), Expect = e-118
 Identities = 239/659 (36%), Positives = 357/659 (54%), Gaps = 4/659 (0%)
 Frame = -3

Query: 1982 EHGTPIDAEKKRVQCKYCLKEVNGFNRLKHHLAAVGNDVTACSEVPSGVKARMKDLLLEK 1803
            +HG  +D +KK+ QC YC K V+GF RLK+HLA    DV+AC EVP+ VK  MK+ + E 
Sbjct: 12   DHGKAVDEQKKKAQCNYCGKVVSGFTRLKYHLAGKRGDVSACGEVPANVKELMKEKIHEL 71

Query: 1802 RKERFMKEVGRIEHPELPLKRNFSPSSE---QRHCRTKLTTPTDSGEGSIETGNSVRGSF 1632
             + +  K V ++  P+L LKR  S  S+   QR   T  +  +DSG+ +           
Sbjct: 72   ERRKLRKGVEKMNPPDLSLKRKSSLESKNVKQRKVGTIQSAGSDSGKHA----------- 120

Query: 1631 KLKMPFPQQPPKAMNNLDFQIADTINGSHTSIPAIENIQPIVKEEVKDEITLHAARCIGR 1452
                     P   +N +             S+ +I + +    +E +D     A +CIGR
Sbjct: 121  ------KNDPVSRVNEI----------VSFSVLSIGSKKASSDKEGEDIPVSQAKKCIGR 164

Query: 1451 FFFDAGIDTTNINLPSFQAMIDAVICCGS-GYSAPGLDELKGVIXXXXXXXXXXXXXXXK 1275
            F ++ G D +     S + MI+ +  C    Y  P   ELKG I               +
Sbjct: 165  FLYEMGTDFSAATPTSLRRMINGIHSCHQVEYEFPSHQELKGCILQDEVKEMLHHVHGIR 224

Query: 1274 QSWQRTGCSILLDGWTDQRGRSFVSFLVSCPAGTIFLRXXXXXXXXXXXXXXXSLISDXX 1095
             +W  TGCSI++DGW D++GR+ ++FLV CP G I LR                L     
Sbjct: 225  DTWATTGCSIVVDGWKDEKGRNLMNFLVDCPWGPICLRLCDISTLSDDVHSLVLLFEQVI 284

Query: 1094 XXXXXXXXXXXXXXESSDLIEEAGKRIMERYRSIFWTLCAEYCINMILQKIQMLDSVKKV 915
                           +S+ +   G  +M++Y ++FWT+ A +CI M+L+KI M+ + +++
Sbjct: 285  AEVGVENVVQIVSHSASECMAAVGNTLMDKYPTLFWTVSASHCIEMMLEKIGMMGTTREI 344

Query: 914  LDDAKAISRFIHSNPHTLSHLRTYIGTHSLVKMSNLKSVIPFITLKNMLSNREILVGLFN 735
            LD AK I+RFI+ +   L+ +R +   H LVK S  KS IPF+TL+N++  +  L  +F 
Sbjct: 345  LDKAKTITRFIYCHAMVLNLMRNHTLVHDLVKPSKSKSAIPFLTLQNIVLEKGRLEKMFI 404

Query: 734  SPGWDAADLASKWRGKCISKMVKDSSFWAAAVEVLKVTTPLLGILVEISRKDTSPMGVLY 555
            S  W  +  AS+  GK ++ +V D SFW+ A  VLK T PL+G+L  I R     M  +Y
Sbjct: 405  SSEWKTSCWASRREGKRVADIVLDPSFWSGAEMVLKPTIPLVGVLCSIIRGGKGQMCYIY 464

Query: 554  NSLDCAKEEIKENMGGEEARYSLYWALVDDVWNNYLHSPLHSAGYYLNPILFYSSDFYLD 375
             ++D  KE+I E     E++Y  +W L+D++WNN+LHS LH+A  +LNP +FYS D+  D
Sbjct: 465  ETMDAVKEDIAEEFENNESQYMPFWELIDEIWNNHLHSALHAAANHLNPAIFYSRDYNFD 524

Query: 374  AEVTTGVVYCMVKMSKDPEEQEQMVIQLDKYRQAEGEFSGEEAVDGREKASPDVWWSAHG 195
             EV  G+  C+  M  D   Q ++ +QL++Y+ AEG+F   +A + R    P +WWS +G
Sbjct: 525  KEVFEGINCCIEHMVPDEHIQNEIWLQLEQYKDAEGDFGLGKATERRNIFHPALWWSNYG 584

Query: 194  GHCAELQRIAVKILSQTCSGASRYMLKKAISEKLHAEVRDVTEQQRFRDLEFVHYNRRL 18
            GHC ELQ++A +ILSQTC GASRY LK++++E L A+ R+   Q R  DL FVHYN  L
Sbjct: 585  GHCPELQKLATRILSQTCDGASRYKLKRSLAENLLAKGRNPIGQGRLCDLTFVHYNLHL 643


>ref|XP_006656455.1| PREDICTED: uncharacterized protein LOC102710414 [Oryza brachyantha]
          Length = 740

 Score =  394 bits (1012), Expect = e-107
 Identities = 228/707 (32%), Positives = 372/707 (52%), Gaps = 46/707 (6%)
 Frame = -3

Query: 1985 EEHGTPIDAEKKRVQCKYCLKEVNGFNRLKHHLAAVGNDVTACSEVPSGVKARMKDLLLE 1806
            E HG  ID E ++V C YC K V  +NRL+HHLA +  +V+ C +VP  V+  ++ LL +
Sbjct: 22   ENHGKTIDKETQKVSCNYCGKVVTSYNRLEHHLAGIRGNVSPCDQVPESVRQNIRTLLED 81

Query: 1805 KRKERFMKEVGRIEHPELPLKRNFSPSSEQR-------------------------HCRT 1701
            +RK+   + +G+++  ELP  RN S  S Q                          HC  
Sbjct: 82   RRKDWIARRIGKLKSSELPTVRNPSLPSAQACQPTLQPIASSIDRVNSVNGHRCFVHCTN 141

Query: 1700 KLTTPTDSGE----------GSIETGNSVRGSFKLKMPFPQQPPKAMNNLDFQIADTING 1551
             L  P+ + +           S + G     + +L MP  Q P      L+      IN 
Sbjct: 142  NLLQPSTTAQLNANYVCCNASSFQQGGQ---TIELAMPPYQNPSVTNKQLEISSGQRINP 198

Query: 1550 SHTSIPAIENIQPIVKEEVKDE------ITLHAARCIGRFFFDAGIDTTNINLPSFQAMI 1389
               S+   EN  P +++ V         +   A + IG+  F+AG+D   ++LPSF+ M+
Sbjct: 199  LSFSM---ENSSPQMQDSVSSMESNNSYLNSQAGKSIGKLIFEAGLDPGILHLPSFKDMV 255

Query: 1388 DAVICCGSG---YSAPGLDELKGVIXXXXXXXXXXXXXXXKQSWQRTGCSILLDGWTDQR 1218
            D +         Y +   D+LK +                ++ W+ +GCS++LD W  + 
Sbjct: 256  DVLAWAQVSMPTYESIMEDQLKEI---------QYRAGDLRKQWEMSGCSVILDSWESRC 306

Query: 1217 GRSFVSFLVSCPAGTIFLRXXXXXXXXXXXXXXXSLISDXXXXXXXXXXXXXXXXESSDL 1038
            G+SF+S LV C  G +FL+                ++                  ++S  
Sbjct: 307  GKSFISVLVHCSKGMLFLKSMDVSEIIDDVDELSLMLLHVVEEVGVLNIAQIITNDASPH 366

Query: 1037 IEEAGKRIMERY-RSIFWTLCAEYCINMILQKIQMLDSVKKVLDDAKAISRFIHSNPHTL 861
            ++ A   +++R+  S F+TLCA++CIN++L+ I  LD V KVL  A+ I+RFI+S+   +
Sbjct: 367  MQAAEHAVLKRFGHSFFFTLCADHCINLLLENIAALDDVSKVLIKARDITRFIYSHAVPM 426

Query: 860  SHLRTYIGTHSLVKMSNLKSVIPFITLKNMLSNREILVGLFNSPGWDAADLASKWRGKCI 681
                 YI    ++   NLK V  FITL+ ++S R  LV LF+SP W ++D AS+   + +
Sbjct: 427  ELKGKYIQGGEILSNCNLKFVAMFITLRELVSERINLVELFSSPEWASSDWASRSTFRHV 486

Query: 680  SKMVK-DSSFWAAAVEVLKVTTPLLGILVEISRKDTSPMGVLYNSLDCAKEEIKENMGGE 504
             ++VK D +FW +A ++LK+T PL+ +L ++   D+ P+G+LY+++DCAKE+IK N+   
Sbjct: 487  YEIVKTDDAFWCSAADILKLTDPLVTVLYKLEA-DSCPIGILYDAMDCAKEDIKCNL--- 542

Query: 503  EARYSLYWALVDDVWNNYLHSPLHSAGYYLNPILFYSSDFYLDAEVTTGVVYCMVKMSKD 324
              ++  YW +VD++W++YLH+P+H+AGY LNP +FY+  F  D E+ +G   C+ +++K+
Sbjct: 543  RDKHGDYWPMVDNIWDHYLHTPVHAAGYILNPRIFYTERFSCDTEIKSGTTACVSRLAKN 602

Query: 323  PEEQEQMVIQLDKYRQAEGEFSGEEAVDGREKASPDVWWSAHGGHCAELQRIAVKILSQT 144
              +  ++  Q++ Y+     F  +  +    +     WWSAHG    EL+  A++ILSQT
Sbjct: 603  HYDPRKVAAQMEIYQSKSAPFDSDTEIQQIMEIPQVRWWSAHGTSTPELKTFAIRILSQT 662

Query: 143  CSGASRYMLKKAISEKLHAEVRDVTEQQRFRDLEFVHYNRRLWHPPP 3
            C GASRY +  +ISE+LH   R   EQ++FR +E++HYN RL H  P
Sbjct: 663  CFGASRYNIDWSISEQLHLVKRPYPEQEKFRKMEYIHYNLRLAHSEP 709


>ref|XP_004246933.1| PREDICTED: uncharacterized protein LOC101250835 [Solanum
            lycopersicum]
          Length = 640

 Score =  359 bits (922), Expect = 2e-96
 Identities = 189/520 (36%), Positives = 286/520 (55%), Gaps = 27/520 (5%)
 Frame = -3

Query: 1496 VKDEITLHAARCIGRFFFDAGIDTTNINLPSFQAMIDAVICCGSGYSAPGLDELKGVIXX 1317
            V D  +   ++ IGRFF++AGID   I  PSFQ M+ A +  G     P   ELKG I  
Sbjct: 52   VVDSSSQEISKSIGRFFYEAGIDFDAIRSPSFQRMVIATLSLGQTIKFPSCQELKGWILQ 111

Query: 1316 XXXXXXXXXXXXXKQSWQRTGCSILLDGWTDQRGRSFVSFLVSCPAGTIFLRXXXXXXXX 1137
                         + SW  TGCSILLDGW D   R+ ++ LV CP GTI+LR        
Sbjct: 112  DAVKEMQQYVTEIRDSWTSTGCSILLDGWIDLNNRNLINILVYCPRGTIYLRSSDISSFN 171

Query: 1136 XXXXXXXSLISDXXXXXXXXXXXXXXXXESSDLIEEAGKRIMERYRSIFWTLCAEYCINM 957
                     + +                 ++  + EAGK++ME++R++FW + A +C+ +
Sbjct: 172  GNVGAMLLFLEEILEEVGVETVVQIVTYSTAACMMEAGKKLMEKHRTVFWAVDAYHCMEL 231

Query: 956  ILQKIQMLDSVKKVLDDAKAISRFIHSNPHTLSHLRTYIGTHSLVKMSNLKSVIPFITLK 777
            +LQK   +D + +V++ AK +++FI+S+   L  LR       LVK S ++ ++PF+TL+
Sbjct: 232  MLQKFTKIDPIHEVMEKAKTLTQFIYSHATVLKLLRDAC-PDELVKSSKIRFIVPFLTLE 290

Query: 776  NMLSNREILVGLFNSPGWDAADLASKWRGKCISKMVKDSSFWAAAVEVLKVTTPLLGILV 597
            N++S ++ L+ +F S  W ++ LAS   GK +S+MV+D SFW   +  +K T PL+ ++ 
Sbjct: 291  NIVSQKKCLIRMFQSSDWHSSVLASTIEGKRMSEMVEDRSFWTEGLMAVKATIPLVEVIK 350

Query: 596  EISRKDTSPMGVLYNSLDCAKEEIKENMGGEEARYSLYWALVDDVWNNYLHSPLHSAGYY 417
             +   +   +G +Y++LD AKE IK+    + + Y+ +W  +DD+W+ Y HS LH+ GY+
Sbjct: 351  LLDCTNKPQVGFIYDTLDQAKETIKKEFRHKRSHYARFWKAIDDIWDEYFHSHLHAVGYF 410

Query: 416  LNPILFYSSDFYLDAEVTTGVVYCMVKMSKDPEEQEQMVIQLDKYRQAEGEFSGEEAVDG 237
            LNP LFYSS+FY D EVT G+  C+V+M++D   Q  +  Q+D+YR+  G F      D 
Sbjct: 411  LNPTLFYSSNFYTDVEVTCGLCCCVVRMTEDRHIQHLITQQIDEYRKGRGTFHFGSFKDK 470

Query: 236  REKASPD---------------------------VWWSAHGGHCAELQRIAVKILSQTCS 138
                SP                            +WWS +GG C ELQR AV+ILSQTC+
Sbjct: 471  LSNISPGGIIYTFSAILIMLTYNSYINLYVMVAALWWSQYGGQCPELQRFAVRILSQTCN 530

Query: 137  GASRYMLKKAISEKLHAEVRDVTEQQRFRDLEFVHYNRRL 18
            GAS Y LK+ + E L  E  ++ E+QR +DL FVH N +L
Sbjct: 531  GASHYRLKRNLVETLLTEGMNLIEKQRLQDLVFVHCNLQL 570


>ref|NP_187908.1| hAT transposon superfamily protein [Arabidopsis thaliana]
            gi|15795134|dbj|BAB02512.1| transposase-like protein
            [Arabidopsis thaliana] gi|332641756|gb|AEE75277.1| hAT
            transposon superfamily protein [Arabidopsis thaliana]
          Length = 605

 Score =  345 bits (886), Expect = 4e-92
 Identities = 217/663 (32%), Positives = 321/663 (48%), Gaps = 6/663 (0%)
 Frame = -3

Query: 1988 IEEHGTPIDAEKKRVQCKYCLKEVNGFNRLKHHLAAVGNDVTACSEVPSGVKARMKDLLL 1809
            + EHG  +D +K RV+C YC KE+N F+RLKHHL AVG DVT C +V   ++   + +L+
Sbjct: 8    VREHGICVDKKKSRVKCNYCGKEMNSFHRLKHHLGAVGTDVTHCDQVSLTLRETFRTMLM 67

Query: 1808 EKRK---ERFMKEVGRIEHPELPLKRNFSPSSEQRHCRTKLTTPTDSGEGSIETGNSVRG 1638
            E +        K VG+ +  +   +R    SS      +K  +P + G  ++E  N    
Sbjct: 68   EDKSGYTTPKTKRVGKFQMADSRKRRKTEDSS------SKSVSP-EQGNVAVEVDN---- 116

Query: 1637 SFKLKMPFPQQPPKAMNNLDFQIADTINGSHTSIPAIENIQPIVKEEVKDEITLHAARCI 1458
                                                            +D ++  A +CI
Sbjct: 117  ------------------------------------------------QDLLSSKAQKCI 128

Query: 1457 GRFFFDAGIDTTNINLPSFQAMIDAVICCGSGYSAPGLDELKGVIXXXXXXXXXXXXXXX 1278
            GRFF++  +D + ++ P F+ M+ A+   G G   P   +L G +               
Sbjct: 129  GRFFYEHCVDLSAVDSPCFKEMMMAL---GVGQKIPDSHDLNGRLLQEAMKEVQDYVKNI 185

Query: 1277 KQSWQRTGCSILLDGWTDQRGRSFVSFLVSCPAGTIFLRXXXXXXXXXXXXXXXSLISDX 1098
            K SW+ TGCSILLD W D +G   VSF+  CPAG ++L+               SL++  
Sbjct: 186  KDSWKITGCSILLDAWIDPKGHDLVSFVADCPAGPVYLKSIDVSVVKNDVTALLSLVNGL 245

Query: 1097 XXXXXXXXXXXXXXXESSDLIEEAGKRIMERYRSIFWTLCAEYCINMILQKIQMLDSVKK 918
                            +S  + E GK      R +FW++   +C  ++L KI  + S   
Sbjct: 246  VEEVGVHNVTQIIACSTSGWVGELGKLFSGHDREVFWSVSLSHCFELMLVKIGKMRSFGD 305

Query: 917  VLDDAKAISRFIHSNPHTLSHLRTYI-GTHSLVKMSNLKSVIPFITLKNMLSNREILVGL 741
            +LD    I  FI++NP  L   R    G    V  S  + V P++ LK++   ++ L  +
Sbjct: 306  ILDKVNTIWEFINNNPSALKIYRDQSHGKDITVSSSEFEFVKPYLILKSVFKAKKNLAAM 365

Query: 740  FNSPGWDAADLASKWRGKCISKMVKDSSFWAAAVEVLKVTTPLLGILVEISRKDTSP-MG 564
            F S  W       K  GK +S +V DSSFW A  E+LK T+PL   L   S  D +  +G
Sbjct: 366  FASSVW------KKEEGKSVSNLVNDSSFWEAVEEILKCTSPLTDGLRLFSNADNNQHVG 419

Query: 563  VLYNSLDCAKEEIKENMGGEEARYSLYWALVDDVWNNYLHSPLHSAGYYLNPILFYSSDF 384
             +Y++LD  K  IK+    E+  Y   W ++DDVWN +LH+PLH+AGYYLNP  FYS+DF
Sbjct: 420  YIYDTLDGIKLSIKKEFNDEKKHYLTLWDVIDDVWNKHLHNPLHAAGYYLNPTSFYSTDF 479

Query: 383  YLDAEVTTGVVYCMVKMSKDPEEQEQMVIQLDKYRQAEGEFSGEEAVDGREKASPDVWWS 204
            +LD EV++G+ + +V ++K  E Q ++  QLD+YR  +  F+     D     SP  WW+
Sbjct: 480  HLDPEVSSGLTHSLVHVAK--EGQIKIASQLDRYRLGKDCFNEASQPDQISGISPIDWWT 537

Query: 203  AHGGHCAELQRIAVKILSQTCSGASRYMLKKAISEK-LHAEVRDVTEQQRFRDLEFVHYN 27
                   ELQ  A+KILSQTC GASRY LK++++EK L  E     E++   +L FVHYN
Sbjct: 538  EKASQHPELQSFAIKILSQTCEGASRYKLKRSLAEKLLLTEGMSHCERKHLEELAFVHYN 597

Query: 26   RRL 18
              L
Sbjct: 598  LHL 600


>gb|EEC81276.1| hypothetical protein OsI_24379 [Oryza sativa Indica Group]
          Length = 657

 Score =  340 bits (872), Expect = 2e-90
 Identities = 192/571 (33%), Positives = 317/571 (55%), Gaps = 7/571 (1%)
 Frame = -3

Query: 1694 TTPTDSGEGSIETGNSVRGSFKLKMPFPQQPPKAMNNLDFQIA-----DTINGSHTSIPA 1530
            TT  ++ + S    +S +G   +++  P      M N   +I+     D ++ S  +  +
Sbjct: 65   TTQVNAHDVSCNASSSQKGGQTIEVTRPPYQNPCMMNKPPEISSGQRIDPLSFSMENSSS 124

Query: 1529 IENIQPIVKEEVKDEITLHAARCIGRFFFDAGIDTTNINLPSFQAMIDAVICCGSGYSAP 1350
                    KE   D +   A + IG+  F+AG++   ++LPSF+ M+D  +   +  S P
Sbjct: 125  QMQDSESSKEPTNDYLNSQARKSIGKLIFEAGLEPGILHLPSFKDMVD--VLAWAQVSIP 182

Query: 1349 GLDELKGVIXXXXXXXXXXXXXXXKQSWQRTGCSILLDGWTDQRGRSFVSFLVSCPAGTI 1170
              +     I               K+ W+  GCS++LD W  + G+SF+S LV C  G +
Sbjct: 183  TYES----IMEEQLREIQCHARDLKKHWEMNGCSVILDTWESRCGKSFISVLVHCSKGML 238

Query: 1169 FLRXXXXXXXXXXXXXXXSLISDXXXXXXXXXXXXXXXXESSDLIEEAGKRIMERYR-SI 993
            F++                ++                  + S  ++ A   +++RY  S 
Sbjct: 239  FIKSMDVSDIIDDVDELAVMLFRVVEEVGVLNIVQVITNDESPYMQAAEHAVLKRYGYSF 298

Query: 992  FWTLCAEYCINMILQKIQMLDSVKKVLDDAKAISRFIHSNPHTLSHLRTYIGTHSLVKMS 813
            F+TLCA++CIN++L+ I  LD V +VL  A+ I+RFI+S+   +     YI    ++  S
Sbjct: 299  FFTLCADHCINLLLENIAALDHVNEVLIKAREITRFIYSHAVPMELKGKYIQGGEILSSS 358

Query: 812  NLKSVIPFITLKNMLSNREILVGLFNSPGWDAADLASKWRGKCISKMVK-DSSFWAAAVE 636
            NLK V  FITL  ++S R  LV +F+SP W ++DLAS+   + + ++VK D++FW+AA +
Sbjct: 359  NLKFVAMFITLGKLVSERINLVEMFSSPEWASSDLASRSSFRHVYEVVKTDNAFWSAAAD 418

Query: 635  VLKVTTPLLGILVEISRKDTSPMGVLYNSLDCAKEEIKENMGGEEARYSLYWALVDDVWN 456
            +LK+T PL+ +L ++   D  P+G+LY+++DCAKE+IK N+     ++  YW +VD++W+
Sbjct: 419  ILKLTDPLITVLYKLEA-DNCPIGILYDAMDCAKEDIKCNL---RDKHGDYWPMVDEIWD 474

Query: 455  NYLHSPLHSAGYYLNPILFYSSDFYLDAEVTTGVVYCMVKMSKDPEEQEQMVIQLDKYRQ 276
            +YLH+P+H+AGY LNP +FY+  F  D E+ +G   C+ +++K+  + +++ IQ+D+YR+
Sbjct: 475  HYLHTPVHAAGYILNPRIFYTERFSYDTEIKSGTNACVTRLAKNHYDPKKVAIQMDRYRR 534

Query: 275  AEGEFSGEEAVDGREKASPDVWWSAHGGHCAELQRIAVKILSQTCSGASRYMLKKAISEK 96
                F  + A+    +     WWSAHG    ELQ  A++ILSQTC GAS Y + ++ISE+
Sbjct: 535  KSAPFDSDSAIQQTMEIPQVRWWSAHGTDTPELQTFAIRILSQTCFGASIYNIDRSISEQ 594

Query: 95   LHAEVRDVTEQQRFRDLEFVHYNRRLWHPPP 3
            LH   R   EQ+RFR +E+VHYN RL H  P
Sbjct: 595  LHVVKRTYPEQERFRTMEYVHYNLRLAHCEP 625


>ref|NP_001058504.1| Os06g0704000 [Oryza sativa Japonica Group]
            gi|53791924|dbj|BAD54046.1| hAT dimerisation
            domain-containing protein-like [Oryza sativa Japonica
            Group] gi|113596544|dbj|BAF20418.1| Os06g0704000 [Oryza
            sativa Japonica Group] gi|215707068|dbj|BAG93528.1|
            unnamed protein product [Oryza sativa Japonica Group]
            gi|222636187|gb|EEE66319.1| hypothetical protein
            OsJ_22556 [Oryza sativa Japonica Group]
          Length = 657

 Score =  338 bits (866), Expect = 7e-90
 Identities = 190/571 (33%), Positives = 317/571 (55%), Gaps = 7/571 (1%)
 Frame = -3

Query: 1694 TTPTDSGEGSIETGNSVRGSFKLKMPFPQQPPKAMNNLDFQIA-----DTINGSHTSIPA 1530
            TT  ++ + S    +S +G   +++  P      M N   +I+     D ++ S  +  +
Sbjct: 65   TTQVNAHDVSCNASSSQKGGQTIEVTRPPYQNPCMMNKPPEISSGQRIDPLSFSMENSSS 124

Query: 1529 IENIQPIVKEEVKDEITLHAARCIGRFFFDAGIDTTNINLPSFQAMIDAVICCGSGYSAP 1350
                    KE   D +   A + IG+  F+AG++   ++LPSF+ M+D  +   +  + P
Sbjct: 125  QMQDSESSKEPTNDYLNSQARKSIGKLIFEAGLEPGILHLPSFKDMVD--VLAWAQVAIP 182

Query: 1349 GLDELKGVIXXXXXXXXXXXXXXXKQSWQRTGCSILLDGWTDQRGRSFVSFLVSCPAGTI 1170
              +     I               K+ W+  GCS++LD W  + G+SF+S LV C  G +
Sbjct: 183  TYES----IMEEQLREIQCHARDLKKHWEMNGCSVILDTWESRCGKSFISVLVHCSKGML 238

Query: 1169 FLRXXXXXXXXXXXXXXXSLISDXXXXXXXXXXXXXXXXESSDLIEEAGKRIMERYR-SI 993
            F++                ++                  + S  ++ A   +++RY  S 
Sbjct: 239  FIKSMDVSDIIDDVDELAVMLFRVVEEVGVLNIVQVITNDESPYMQAAEHAVLKRYGYSF 298

Query: 992  FWTLCAEYCINMILQKIQMLDSVKKVLDDAKAISRFIHSNPHTLSHLRTYIGTHSLVKMS 813
            F+TLCA++CIN++L+ I  LD V +VL  A+ I+RFI+S+   +     YI    ++  S
Sbjct: 299  FFTLCADHCINLLLENIAALDHVNEVLIKAREITRFIYSHAVPMELKGKYIQGGEILSSS 358

Query: 812  NLKSVIPFITLKNMLSNREILVGLFNSPGWDAADLASKWRGKCISKMVK-DSSFWAAAVE 636
            NLK V  FITL  ++S R  LV +F+SP W ++DLAS+   + + ++VK D++FW+AA +
Sbjct: 359  NLKFVAMFITLGKLVSERINLVEMFSSPEWASSDLASRSSFRHVYEVVKTDNAFWSAAAD 418

Query: 635  VLKVTTPLLGILVEISRKDTSPMGVLYNSLDCAKEEIKENMGGEEARYSLYWALVDDVWN 456
            +LK+T PL+ +L ++   D  P+G+LY+++DCAKE+IK N+     ++  YW +VD++W+
Sbjct: 419  ILKLTDPLITVLYKLEA-DNCPIGILYDAMDCAKEDIKCNL---RDKHGDYWPMVDEIWD 474

Query: 455  NYLHSPLHSAGYYLNPILFYSSDFYLDAEVTTGVVYCMVKMSKDPEEQEQMVIQLDKYRQ 276
            +YLH+P+H+AGY LNP +FY+  F  D E+ +G   C+ +++K+  + +++ IQ+D+YR+
Sbjct: 475  HYLHTPVHAAGYILNPRIFYTERFSYDTEIKSGTNACVTRLAKNHYDPKKVAIQMDRYRR 534

Query: 275  AEGEFSGEEAVDGREKASPDVWWSAHGGHCAELQRIAVKILSQTCSGASRYMLKKAISEK 96
                F  + A+    +     WWSAHG    ELQ  A++ILSQTC GAS Y + ++ISE+
Sbjct: 535  KSAPFDSDSAIQQTMEIPQVRWWSAHGTDTPELQTFAIRILSQTCFGASIYNIDRSISEQ 594

Query: 95   LHAEVRDVTEQQRFRDLEFVHYNRRLWHPPP 3
            LH   R   EQ+RFR +E++HYN RL H  P
Sbjct: 595  LHVVKRTYPEQERFRTMEYLHYNLRLAHCEP 625


>ref|XP_004966349.1| PREDICTED: uncharacterized protein LOC101752579 [Setaria italica]
          Length = 579

 Score =  327 bits (839), Expect = 1e-86
 Identities = 180/508 (35%), Positives = 288/508 (56%), Gaps = 2/508 (0%)
 Frame = -3

Query: 1535 PAIENIQPIVKEEVKDEITLHAARCIGRFFFDAGIDTTNINLPSFQAMIDAVICCGSGYS 1356
            P  +  +P       D I    A  IGR  F+AG++   ++LPSF  +ID ++  G   +
Sbjct: 53   PVSQRQEPEPSMGASDNIDSLVANSIGRLIFEAGLEPGFVHLPSFNGVID-LLTRGVRIA 111

Query: 1355 APGLDELKGVIXXXXXXXXXXXXXXXKQSWQRTGCSILLDGWTDQRGRSFVSFLVSCPAG 1176
             P  + +  V                +Q W+++GCS++LD W  + G+ FVS  V C  G
Sbjct: 112  MPSYEYILQV----QIKEVQQRDRALRQHWEKSGCSVILDSWKSRCGKRFVSVFVHCREG 167

Query: 1175 TIFLRXXXXXXXXXXXXXXXSLISDXXXXXXXXXXXXXXXXESSDLIEEAGKRIMERY-R 999
             +FLR               +++                  + S  ++ A   +++RY +
Sbjct: 168  MLFLRSMDTSTIFDDVDELATMVCHVIEDIGVRNIVQVIINDVSPHMQAAEHAVLKRYEQ 227

Query: 998  SIFWTLCAEYCINMILQKIQMLDSVKKVLDDAKAISRFIHSNPHTLSHLRTYIGTHSLVK 819
            S  +T+CA++CI+++L+ I  LD+VK VL  AK I+RF++ +   +   R YIG   ++ 
Sbjct: 228  SFIFTVCADHCIDLLLENIAALDNVKDVLTKAKEITRFLYGHALPMELKRLYIGDAEIIS 287

Query: 818  MSNLKSVIPFITLKNMLSNREILVGLFNSPGWDAADLASKWRGKCISKMVK-DSSFWAAA 642
             SNLK V  F TL+ ++S RE LV +FNS  W ++DLAS      I ++V+ +++FW+AA
Sbjct: 288  NSNLKCVAMFDTLEKLVSWRENLVEMFNSADWVSSDLASTNLSMGICEVVQMENAFWSAA 347

Query: 641  VEVLKVTTPLLGILVEISRKDTSPMGVLYNSLDCAKEEIKENMGGEEARYSLYWALVDDV 462
              VLKVT PL+ +L ++   D  P+ VLY+++D AKEEIK+N+G E   +  YW ++D +
Sbjct: 348  AHVLKVTGPLIRVLYKLE-DDKCPVSVLYDAMDNAKEEIKQNLGDE---HDSYWQMIDHI 403

Query: 461  WNNYLHSPLHSAGYYLNPILFYSSDFYLDAEVTTGVVYCMVKMSKDPEEQEQMVIQLDKY 282
            W++YLHSP+H+AGY+LNP +FY+  F  DAE+++G+  C+++ +K   +   +  Q+D Y
Sbjct: 404  WDDYLHSPVHAAGYFLNPAIFYTVRFRNDAEISSGITTCILRAAKSHYDALLVAEQMDVY 463

Query: 281  RQAEGEFSGEEAVDGREKASPDVWWSAHGGHCAELQRIAVKILSQTCSGASRYMLKKAIS 102
             +  G+F  + A++       D+WW  HG     LQ  A  IL QTC G SRY L +++S
Sbjct: 464  LRKSGQFDSDPAIEEAVGTPQDLWWVKHGTGTPALQSFAGLILGQTCYGVSRYNLDRSLS 523

Query: 101  EKLHAEVRDVTEQQRFRDLEFVHYNRRL 18
            E+LH E    TE++RFR +E+V+YN RL
Sbjct: 524  ERLHTEKMAYTERERFRSMEYVYYNLRL 551


>ref|XP_002437551.1| hypothetical protein SORBIDRAFT_10g029230 [Sorghum bicolor]
            gi|241915774|gb|EER88918.1| hypothetical protein
            SORBIDRAFT_10g029230 [Sorghum bicolor]
          Length = 588

 Score =  322 bits (825), Expect = 4e-85
 Identities = 175/499 (35%), Positives = 283/499 (56%), Gaps = 3/499 (0%)
 Frame = -3

Query: 1490 DEITLHAARCIGRFFFDAGIDTTNINLPSFQAMIDAVICCGSGYSAPGLDELKGVIXXXX 1311
            D +    A  IGR  F+AG++   ++LPSF  +ID ++  G   + P  + +  V     
Sbjct: 76   DNLDSLVADSIGRLAFEAGVEPDFVHLPSFNGVID-LLTRGVRIAMPSYEYILQV----Q 130

Query: 1310 XXXXXXXXXXXKQSWQRTGCSILLDGWTDQRGRSFVSFLVSCPAGTIFLRXXXXXXXXXX 1131
                       +Q W+R GCS++LD W  + GRSF+S  V C  G  FLR          
Sbjct: 131  LNEVQKREKAMRQHWERRGCSLILDSWKSRCGRSFISAFVHCGEGMFFLRSIDISTIFDD 190

Query: 1130 XXXXXSLISDXXXXXXXXXXXXXXXXESSDLIEEAGKRIMERYRSIF-WTLCAEYCINMI 954
                 +++                    S  ++     +++++   F +T+CA++CIN++
Sbjct: 191  VDELAAMVCCLIDDIGVHNIVQVITNNVSPHMQATEHAVLKKHDQPFVFTVCADHCINLL 250

Query: 953  LQKIQMLDSVKKVLDDAKAISRFIHSNPHTLSHLRTYIGTHS-LVKMSNLKSVIPFITLK 777
            L+ I  LD VK VL  A+ I+ F++ +   +  ++ +    S ++  SNL+SV  F+TL+
Sbjct: 251  LENIAKLDHVKDVLTKAREITMFLYGHALPMELMKKFFYFDSEIISNSNLRSVAKFLTLE 310

Query: 776  NMLSNREILVGLFNSPGWDAADLASKWRGKCISKMVK-DSSFWAAAVEVLKVTTPLLGIL 600
             ++S RE L+ +F+SP W ++DLA       I ++VK DS+FW AA  VLKVT PL+ +L
Sbjct: 311  TLVSQRENLMEMFSSPNWASSDLACTSLSMHICEVVKTDSAFWRAADNVLKVTGPLISVL 370

Query: 599  VEISRKDTSPMGVLYNSLDCAKEEIKENMGGEEARYSLYWALVDDVWNNYLHSPLHSAGY 420
             ++   D  P+ VLY++++ AKE IK+N+G E   Y   W ++D +W NYLHSP+H+AGY
Sbjct: 371  YKLEN-DNCPVSVLYDAMNSAKECIKKNLGHEHGNY---WRMIDRIWENYLHSPIHAAGY 426

Query: 419  YLNPILFYSSDFYLDAEVTTGVVYCMVKMSKDPEEQEQMVIQLDKYRQAEGEFSGEEAVD 240
             LNP LFY+  +  D+E+ +G+  C+++ ++   +  ++  Q+D Y++  G F  + A+ 
Sbjct: 427  ILNPGLFYADRYREDSEIVSGIKTCIIQAARSHYDAFRVGEQMDLYKRRSGLFDSDSAIQ 486

Query: 239  GREKASPDVWWSAHGGHCAELQRIAVKILSQTCSGASRYMLKKAISEKLHAEVRDVTEQQ 60
               +   DVWW  HG    ELQ  A +IL QTC GA+RY L K++SE+LH E R VT+Q+
Sbjct: 487  EATETPQDVWWERHGSGTKELQSFAARILGQTCFGATRYNLNKSLSERLHTEKRTVTDQE 546

Query: 59   RFRDLEFVHYNRRLWHPPP 3
            RFR++E+++YN RL +  P
Sbjct: 547  RFRNMEYIYYNLRLKNAVP 565


>ref|XP_006345717.1| PREDICTED: uncharacterized protein LOC102580052 [Solanum tuberosum]
          Length = 586

 Score =  322 bits (824), Expect = 6e-85
 Identities = 167/472 (35%), Positives = 265/472 (56%)
 Frame = -3

Query: 1433 IDTTNINLPSFQAMIDAVICCGSGYSAPGLDELKGVIXXXXXXXXXXXXXXXKQSWQRTG 1254
            ID   I  PSF+ M+ A +  G     P   EL G I               ++SW  TG
Sbjct: 77   IDFDAIRSPSFRRMVKATLSPGQTIKFPSCQELNGWILEDAVQEMQQYVTEIRKSWASTG 136

Query: 1253 CSILLDGWTDQRGRSFVSFLVSCPAGTIFLRXXXXXXXXXXXXXXXSLISDXXXXXXXXX 1074
            CSILLDGW D   R+ ++ LV CP GTI+LR                 + +         
Sbjct: 137  CSILLDGWIDLNNRNLINILVYCPRGTIYLRSSDISSFSRNFDAMLLFLEEILEEVGVEN 196

Query: 1073 XXXXXXXESSDLIEEAGKRIMERYRSIFWTLCAEYCINMILQKIQMLDSVKKVLDDAKAI 894
                    +SD + EAGK++M++ +++FW++ A YC+ ++LQ++  +  +K+ L+ AK +
Sbjct: 197  VVQIVAYTTSDWMMEAGKKLMDKCKTVFWSIDASYCMELMLQEVTKIGWIKEALEKAKML 256

Query: 893  SRFIHSNPHTLSHLRTYIGTHSLVKMSNLKSVIPFITLKNMLSNREILVGLFNSPGWDAA 714
             +FI+S+   L  LR       LVK S +K+++PF+TL+N++S ++ L+ +F S  W  +
Sbjct: 257  VQFIYSHATVLKLLRDAFSEAELVKSSKIKAIVPFLTLENIVSQKDGLIRMFQSSTWQTS 316

Query: 713  DLASKWRGKCISKMVKDSSFWAAAVEVLKVTTPLLGILVEISRKDTSPMGVLYNSLDCAK 534
             LAS   GK +S+M+KD SFW  A+  +K T PL+ ++  ++  + + +G ++++LD AK
Sbjct: 317  LLASTSEGKGMSEMIKDESFWTEALMAVKATIPLVEVIKFLNGTNKAQVGFIHDTLDQAK 376

Query: 533  EEIKENMGGEEARYSLYWALVDDVWNNYLHSPLHSAGYYLNPILFYSSDFYLDAEVTTGV 354
            E I++        ++  W  +DD WN YLHSPLH AGYYLNP  F+SS++ L+ +++ G+
Sbjct: 377  ETIRKEFKSTRFCHAKIWNAIDDTWNKYLHSPLHDAGYYLNPTFFHSSNWCLNVKISDGL 436

Query: 353  VYCMVKMSKDPEEQEQMVIQLDKYRQAEGEFSGEEAVDGREKASPDVWWSAHGGHCAELQ 174
              C+  M++D   ++ +  Q+       G F    + +     SP  WWS +     EL+
Sbjct: 437  CSCITGMAEDRRIKDLITQQI-------GTFDFLSSKEILSDISPGHWWSKYEVEFPELE 489

Query: 173  RIAVKILSQTCSGASRYMLKKAISEKLHAEVRDVTEQQRFRDLEFVHYNRRL 18
            R+AV+ILSQTC+GAS Y LK+++ E LH + R+  EQQR  DL FVH N +L
Sbjct: 490  RLAVRILSQTCNGASHYRLKRSLVETLHRKGRNQIEQQRLSDLVFVHCNLQL 541


>ref|XP_002512206.1| DNA binding protein, putative [Ricinus communis]
            gi|223548750|gb|EEF50240.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 739

 Score =  310 bits (795), Expect = 1e-81
 Identities = 190/653 (29%), Positives = 319/653 (48%), Gaps = 2/653 (0%)
 Frame = -3

Query: 1979 HGTPIDAEKKRVQCKYCLKEV--NGFNRLKHHLAAVGNDVTACSEVPSGVKARMKDLLLE 1806
            HGT ++  +++++CKYC K     G +RLK HLA    +V  C +VP  VK +++  L  
Sbjct: 21   HGTMVNGGRQKIKCKYCHKIFLGGGISRLKQHLAGERGNVAPCEDVPEEVKVQIQQHLGF 80

Query: 1805 KRKERFMKEVGRIEHPELPLKRNFSPSSEQRHCRTKLTTPTDSGEGSIETGNSVRGSFKL 1626
            K  ER  K+           + N S +S   + R +     + G G  E     R   K 
Sbjct: 81   KVLERLKKQK----------EANGSKNSYMLYLRDREEDDVNLGSGQKEAS---RRRDKE 127

Query: 1625 KMPFPQQPPKAMNNLDFQIADTINGSHTSIPAIENIQPIVKEEVKDEITLHAARCIGRFF 1446
             +    +  K     ++ +A ++             QPI +     E    A   + RFF
Sbjct: 128  VLEGISKRTKRRKKQNYSMATSVI-----------TQPICQSFAPPENIELADVAVARFF 176

Query: 1445 FDAGIDTTNINLPSFQAMIDAVICCGSGYSAPGLDELKGVIXXXXXXXXXXXXXXXKQSW 1266
            ++AGI  T  N   FQ M D +I  G GY  P    L+G +               ++SW
Sbjct: 177  YEAGIPFTAANSYFFQQMADNIIAAGPGYKMPSYTSLRGKLLNRCIQDAEEYCSELRKSW 236

Query: 1265 QRTGCSILLDGWTDQRGRSFVSFLVSCPAGTIFLRXXXXXXXXXXXXXXXSLISDXXXXX 1086
            + TGC++L+D W   R R+ ++F V CP GT+FLR               +L  D     
Sbjct: 237  EVTGCTVLVDRWMHGRDRTVINFFVYCPKGTMFLRSVDASGITKSVEALLNLF-DSVVQQ 295

Query: 1085 XXXXXXXXXXXESSDLIEEAGKRIMERYRSIFWTLCAEYCINMILQKIQMLDSVKKVLDD 906
                       +S    + AGK + E+Y++ F + C   CIN++L++I   D +K+VL  
Sbjct: 296  VGLKNIVNFVTDSVPTYKNAGKLLAEKYKTFFCSTCGAECINLMLEEIGESDGIKEVLAK 355

Query: 905  AKAISRFIHSNPHTLSHLRTYIGTHSLVKMSNLKSVIPFITLKNMLSNREILVGLFNSPG 726
            AK +++FI++N   L+ +R   G   +++++  +    F+TL+ ++S ++ L  +F S  
Sbjct: 356  AKRLTQFIYNNSWVLNLMRKRTGGKDIIQLARTRFASIFLTLQTIVSLKDHLHKMFTSAS 415

Query: 725  WDAADLASKWRGKCISKMVKDSSFWAAAVEVLKVTTPLLGILVEISRKDTSPMGVLYNSL 546
            W  +    +  G  +++++ D  FW+   + L +  P+L +L  I  +D   MG +Y+++
Sbjct: 416  WMQSSFPKQRAGIEVAEILVDPRFWSLCDQTLTIAKPILSVLHLIDCQDKPSMGYIYDAI 475

Query: 545  DCAKEEIKENMGGEEARYSLYWALVDDVWNNYLHSPLHSAGYYLNPILFYSSDFYLDAEV 366
            + AK+ I      +E+ Y  Y  ++D VW    HSPLH+A +YLNP + Y+  F  +  +
Sbjct: 476  EKAKKSIVVGFNNKESDYLSYLKVIDHVWQEDFHSPLHAAAHYLNPSVIYNPSFSSNKFI 535

Query: 365  TTGVVYCMVKMSKDPEEQEQMVIQLDKYRQAEGEFSGEEAVDGREKASPDVWWSAHGGHC 186
              G++ C+  +  +   Q  +  Q+  Y +A G+F    A+ GRE  +P  WWS +    
Sbjct: 536  QKGLLDCIETLEPNLSAQVTITSQIKFYEEAVGDFGRPMALRGRESLAPATWWSLYAADY 595

Query: 185  AELQRIAVKILSQTCSGASRYMLKKAISEKLHAEVRDVTEQQRFRDLEFVHYN 27
             +LQR+A++ILSQTCS  +R     ++ E+ H++ R+  E QR  DL FVHYN
Sbjct: 596  PDLQRLAIRILSQTCS-LTRCERNWSMFERTHSKKRNRLEHQRLNDLTFVHYN 647


>emb|CAN78444.1| hypothetical protein VITISV_016801 [Vitis vinifera]
          Length = 689

 Score =  306 bits (784), Expect = 2e-80
 Identities = 197/645 (30%), Positives = 305/645 (47%), Gaps = 2/645 (0%)
 Frame = -3

Query: 1946 VQCKYCLKE-VNGFNRLKHHLAAVGNDVTACSEVPSGVKARMKDLLLEKRKERFMKEVGR 1770
            ++CK+C +  + G NRLKHHLA   + +  CS+V    +   K+ L   + ++  +    
Sbjct: 27   LRCKFCNQRCMGGVNRLKHHLAGTHHGMNPCSKVSEDARLECKEALANFKDQKTKRN--- 83

Query: 1769 IEHPELPLKRNFSPSSEQRHCRTKLTTPTDSGEGSIETGNSVRGSFKLKMPFPQQPPKAM 1590
                EL  +    P+S      +K      SG GS E             P P+ P    
Sbjct: 84   ----ELLQEIGMGPTSMHESALSKTIGTLGSGSGSGE-------------PIPRGPMDKF 126

Query: 1589 NNLDFQIADTINGSHTSIPAIENIQPIVKEEVKDEITLHAARCIGRFFFDAGIDTTNINL 1410
                           TS P    +    K+E + E+     R IGRF +  G+    +N 
Sbjct: 127  T--------------TSQPRQSTLNSKWKQEERKEV----CRKIGRFMYSKGLPFNTVND 168

Query: 1409 PSFQAMIDAVICCGSGYSAPGLDELKGVIXXXXXXXXXXXXXXXKQSWQRTGCSILLDGW 1230
              +  MIDAV   G G+  P + EL+  I               K++W++ GCSI+ DGW
Sbjct: 169  RYWFPMIDAVANFGPGFKPPSMHELRTWILKEEVNDLSIIMEDHKKAWKQYGCSIMSDGW 228

Query: 1229 TDQRGRSFVSFLVSCPAGTIFLRXXXXXXXXXXXXXXXSLISDXXXXXXXXXXXXXXXXE 1050
            TD + R  ++FLV+ P GT F++                 + +                 
Sbjct: 229  TDGKSRCLINFLVNSPTGTWFMKSIDASDTIKNGELMFKYLDEVVEEIGEENVVQVITDN 288

Query: 1049 SSDLIEEAGKRIMERYRSIFWTLCAEYCINMILQKIQMLDSVKKVLDDAKAISRFIHSNP 870
            +S+ +  AG R+ME+   ++WT C  +CI+++L+ I+ L+     L  A+ + +FI+ + 
Sbjct: 289  ASNYVN-AGMRLMEKRSRLWWTPCVAHCIDLMLEDIRKLNVHATTLSRARQVVKFIYGHT 347

Query: 869  HTLSHLRTYIGTHSLVKMSNLKSVIPFITLKNMLSNREILVGLFNSPGWDAADLASKWRG 690
              LS +RT+   H L++ +  +    F+TL+++   ++ L+ +F+S  W ++  A K  G
Sbjct: 348  WVLSLMRTFTKNHELIRPAITRFATAFLTLQSLYKQKQALIAMFSSEKWCSSTWAKKVEG 407

Query: 689  -KCISKMVKDSSFWAAAVEVLKVTTPLLGILVEISRKDTSPMGVLYNSLDCAKEEIKENM 513
             K  S ++ D +FW      +K T PL+ +L E+  ++   MG +Y  +D AKE+I  N 
Sbjct: 408  VKTRSTVLFDPNFWPHVAFCIKTTVPLVSVLREVDSEERPAMGYIYELMDSAKEKIAFNC 467

Query: 512  GGEEARYSLYWALVDDVWNNYLHSPLHSAGYYLNPILFYSSDFYLDAEVTTGVVYCMVKM 333
             G E +Y   W  +D  W   LH PLH+A YYLNP L Y   F    EV  G+  CM +M
Sbjct: 468  RGMERKYGPIWRKIDARWTPQLHRPLHAADYYLNPQLRYGDKFSNVDEVRKGLFECMDRM 527

Query: 332  SKDPEEQEQMVIQLDKYRQAEGEFSGEEAVDGREKASPDVWWSAHGGHCAELQRIAVKIL 153
              D +E+ +  IQLD Y QA GEF    A+D R   SP  WW   GG   ELQ+ A+++L
Sbjct: 528  -LDYQERLKADIQLDSYDQAMGEFGSCIAIDSRTLRSPTSWWMRFGGSTPELQKFAIRVL 586

Query: 152  SQTCSGASRYMLKKAISEKLHAEVRDVTEQQRFRDLEFVHYNRRL 18
            S TCS AS      +  E +H + R+  E QR   L +V YN RL
Sbjct: 587  SLTCS-ASGCERNWSTFESIHTKKRNRLEHQRLNALVYVRYNTRL 630


>gb|EOY18075.1| HAT and BED zinc finger domain-containing protein, putative
            [Theobroma cacao]
          Length = 749

 Score =  302 bits (774), Expect = 3e-79
 Identities = 194/663 (29%), Positives = 324/663 (48%), Gaps = 14/663 (2%)
 Frame = -3

Query: 1964 DAEKKRVQCKYCLK--EVNGFNRLKHHLAAVGNDVTACSEVPSGVKARMKDLL----LEK 1803
            + E+ +++C YC K     G +R+K HLA    + + C  VPS V+  M++ L    ++K
Sbjct: 27   NGERVQLKCIYCGKIFRGGGIHRIKEHLAGQKGNASTCFHVPSDVRLLMRESLDGVEVKK 86

Query: 1802 RKERFMKEVGRIEHPELPLKRNFSPSSEQRHCRTKLTTPTDSGEGSIETGNSVRGSFKLK 1623
            RK++ +                    +E+     ++++  D+ +  ++T     G   ++
Sbjct: 87   RKKQKI--------------------AEEMSNANQVSSEIDTYDNQVDTNT---GLLMIE 123

Query: 1622 MPFPQQPPKAM-------NNLDFQIADTINGSHTSIPAIENIQPIVKEEVKDEITLHAAR 1464
             P   QP  ++       +N+         G  ++  +   +   V    K  +  H   
Sbjct: 124  GPDTLQPSSSLLVNREGTSNVSGDRRKRGKGKSSAAESNALVVNTVGLGAK-RVNNHVHV 182

Query: 1463 CIGRFFFDAGIDTTNINLPSFQAMIDAVICCGSGYSAPGLDELKGVIXXXXXXXXXXXXX 1284
             IGRF FD G     +N   FQ M+DA+I  GSG   P   +L+G I             
Sbjct: 183  AIGRFLFDIGAPLDAVNSVYFQPMVDAIISGGSGVLMPSCSDLQGWILKKSVEEVKSDND 242

Query: 1283 XXKQSWQRTGCSILLDGWTDQRGRSFVSFLVSCPAGTIFLRXXXXXXXXXXXXXXXSLIS 1104
                +W RTGCSIL++ W  Q GR  ++FLV CP GT+FL+                L+ 
Sbjct: 243  KVTAAWVRTGCSILVNQWNTQTGRILLNFLVYCPEGTVFLKSVDASSVINSSDALYELLK 302

Query: 1103 DXXXXXXXXXXXXXXXXESSDLIEEAGKRIMERYRSIFWTLCAEYCINMILQKIQMLDSV 924
                                  I  AG+R+ E + +++WT CA +CIN+IL+    L+ +
Sbjct: 303  QVVEEVGSKHVLQVITNAEEQYIV-AGRRLAETFPTLYWTPCAAHCINLILEDFAKLEWI 361

Query: 923  KKVLDDAKAISRFIHSNPHTLSHLRTYIGTHSLVKMSNLKSVIPFITLKNMLSNREILVG 744
              +++ A++I+RF++++   L+ +R Y   + +V+ +   S   F TLK M+  +  L  
Sbjct: 362  NVIIEQARSITRFVYNHSVVLNMVRRYTLGNDIVEPAVTCSATNFTTLKQMIDLKNNLQA 421

Query: 743  LFNSPGWDAADLASKWRGKCISKMVKDSSFWAAAVEVLKVTTPLLGILVEISRKDTSPMG 564
            +  S  W     + K  G  +  +V + SFW+++V + ++T PLL +L  +  K    MG
Sbjct: 422  MVTSQEWMDCPYSKKPGGLEMLDLVSNPSFWSSSVLITQLTNPLLRVLRMVGSKKRPAMG 481

Query: 563  VLYNSLDCAKEEIKENMGGEEARYSLYWALVDDVWNNYLHSPLHSAGYYLNPILFYSSDF 384
             +Y  +  AKE IK+ +  +   Y +YW ++D  W    H PLH AG+YLNP  FYS + 
Sbjct: 482  YVYAGMYRAKETIKKELV-KRNEYMIYWNIIDHWWEQQWHHPLHGAGFYLNPKFFYSMEG 540

Query: 383  YLDAEVTTGVVYCMVKMSKDPEEQEQMVIQLDKYRQAEGEFSGEEAVDGREKASPDVWWS 204
             +  E+ +G++ C+ K+  D + Q+++  +++ Y+   G+F  + AV  R+   P  WWS
Sbjct: 541  DMPNEMLSGMLDCIEKLVPDVKVQDKISKEINSYKNTVGDFGRKMAVRARDTLLPAEWWS 600

Query: 203  AHGGHCAELQRIAVKILSQTCSGASRYMLKKAIS-EKLHAEVRDVTEQQRFRDLEFVHYN 27
             +GG C  L R+A+ +LSQTCS     + + +I  EKLH E R+  EQQRFRDL FV  N
Sbjct: 601  TYGGSCPNLARLAIHVLSQTCSTLG--LKQNSIPFEKLH-ETRNFLEQQRFRDLIFVQCN 657

Query: 26   RRL 18
             +L
Sbjct: 658  LQL 660


>ref|XP_004246932.1| PREDICTED: uncharacterized protein LOC101250543 [Solanum
            lycopersicum]
          Length = 618

 Score =  300 bits (768), Expect = 2e-78
 Identities = 177/547 (32%), Positives = 280/547 (51%), Gaps = 9/547 (1%)
 Frame = -3

Query: 1631 KLKMPFPQQPPKAMNNLDFQIADTINGSHTSIPAIENIQPIVKEEVKDEI--------TL 1476
            K+K  +  +       L F +A    G   ++     + P+VK+    +I        + 
Sbjct: 23   KVKCKYCAKTVIGFYRLKFHLA----GIRGNVTPCSEVPPLVKQAFYAQIMGKKSCQSSQ 78

Query: 1475 HAARCIGRFFFDAGIDTTNINLPSFQAMIDAVICCGSGYSAPGLDELKGVIXXXXXXXXX 1296
              ++ IGRFF+++G+D   I LPSFQ M  A +  G     P   +LKG I         
Sbjct: 79   EISKSIGRFFYESGLDFDAIRLPSFQMMFKATLSPGQTVKFPSCQDLKGWILQDAVHEMQ 138

Query: 1295 XXXXXXKQSWQRTGCSILLDGWTDQRGRSFVSFLVSCPAGTIFLRXXXXXXXXXXXXXXX 1116
                  + SW RTGCSILLDGW D  GR+ ++ LV CP GTI+LR               
Sbjct: 139  LYVTEIRSSWPRTGCSILLDGWIDSNGRNLINILVYCPRGTIYLRSSDITSFYENPDAML 198

Query: 1115 SLISDXXXXXXXXXXXXXXXXESSDLIEEAGKRIMERYRSIFWTLCAEYCINMILQKIQM 936
              + +                 +S  +  AG+++M+  +++F+++ A  C+ ++LQ +  
Sbjct: 199  VFLEEILEEVGVENVVQIIAHSTSHWMIAAGEKLMDSCKTVFFSIDASRCMGLMLQNVTQ 258

Query: 935  LDSVKKVLDDAKAISRFIHSNPHTLSHLRTYIGTHSLVKMSNLKSVIPFITLKNMLSNRE 756
            +D + + L  AK + +FI+S+  T+  L        LVK S +K+++PF+TL+N++S ++
Sbjct: 259  IDWIGQALQKAKMLIQFIYSHTTTMKLLSDVFPGVELVKSSKVKAIVPFLTLQNIVSQKD 318

Query: 755  ILVGLFNSPGWDAADLASKWRGKCISKMVKDSSFWAAAVEVLKVTTPLLGILVEISRKDT 576
            +L+ +F S  W  + LAS   GK I++M++D+S W+      +VT PL+ ++  ++  + 
Sbjct: 319  VLIRMFQSSAWGTSQLASTSEGKRIAEMIEDASVWSNFGMAARVTIPLVEVIKYLNGTNK 378

Query: 575  SPMGVLYNSLDCAKEEIKENMGGEEA-RYSLYWALVDDVWNNYLHSPLHSAGYYLNPILF 399
               G + N L  AKE IK      +  R+   W  +++ W  YLHS LH AGYYLNP  F
Sbjct: 379  PQAGFISNRLYQAKEIIKMEFRSRQLWRHEETWNKIEETWKKYLHSDLHGAGYYLNPCYF 438

Query: 398  YSSDFYLDAEVTTGVVYCMVKMSKDPEEQEQMVIQLDKYRQAEGEFSGEEAVDGREKASP 219
            YSSD+   AE+T G+   + +++      + ++ Q  K    E +F G   +      SP
Sbjct: 439  YSSDWLGTAEITCGLCKTIDRIA---GHIKGLITQQIK----EFDFDGSREI--LPDISP 489

Query: 218  DVWWSAHGGHCAELQRIAVKILSQTCSGASRYMLKKAISEKLHAEVRDVTEQQRFRDLEF 39
              WW  +     EL+R AV+ILSQTC GAS Y LK+ + E LH + R   EQQR +DL F
Sbjct: 490  AQWWLKYEVEYPELERFAVRILSQTCDGASHYRLKRRLVETLHTKGRSEIEQQRLKDLVF 549

Query: 38   VHYNRRL 18
            VH N +L
Sbjct: 550  VHCNLQL 556



 Score = 59.3 bits (142), Expect = 7e-06
 Identities = 31/75 (41%), Positives = 44/75 (58%), Gaps = 2/75 (2%)
 Frame = -3

Query: 1988 IEEHGTPIDAE--KKRVQCKYCLKEVNGFNRLKHHLAAVGNDVTACSEVPSGVKARMKDL 1815
            I +HG  +  E  K +V+CKYC K V GF RLK HLA +  +VT CSEVP  VK      
Sbjct: 8    IHDHGDKVVDENHKSKVKCKYCAKTVIGFYRLKFHLAGIRGNVTPCSEVPPLVKQAFYAQ 67

Query: 1814 LLEKRKERFMKEVGR 1770
            ++ K+  +  +E+ +
Sbjct: 68   IMGKKSCQSSQEISK 82


Top