BLASTX nr result

ID: Achyranthes22_contig00019497 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes22_contig00019497
         (1849 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002267489.2| PREDICTED: uncharacterized protein LOC100265...   825   0.0  
emb|CBI22554.3| unnamed protein product [Vitis vinifera]              825   0.0  
ref|XP_002513602.1| protein dimerization, putative [Ricinus comm...   821   0.0  
ref|XP_004145979.1| PREDICTED: uncharacterized protein LOC101215...   804   0.0  
ref|XP_004160403.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   802   0.0  
gb|EXB72473.1| hypothetical protein L484_011475 [Morus notabilis]     794   0.0  
ref|XP_004308021.1| PREDICTED: uncharacterized protein LOC101298...   794   0.0  
ref|XP_002318364.1| predicted protein [Populus trichocarpa]           794   0.0  
gb|EOY23434.1| HAT transposon superfamily isoform 4 [Theobroma c...   791   0.0  
gb|EOY23432.1| HAT transposon superfamily isoform 2 [Theobroma c...   791   0.0  
gb|EOY23431.1| HAT transposon superfamily isoform 1 [Theobroma c...   791   0.0  
ref|XP_004502603.1| PREDICTED: uncharacterized protein LOC101496...   789   0.0  
ref|XP_006490683.1| PREDICTED: uncharacterized protein LOC102618...   783   0.0  
ref|XP_006350604.1| PREDICTED: uncharacterized protein LOC102593...   780   0.0  
ref|XP_004234278.1| PREDICTED: uncharacterized protein LOC101256...   778   0.0  
ref|XP_006581618.1| PREDICTED: uncharacterized protein LOC100808...   776   0.0  
ref|XP_003602175.1| Protein dimerization [Medicago truncatula] g...   771   0.0  
ref|XP_006300772.1| hypothetical protein CARUB_v10019846mg, part...   722   0.0  
ref|NP_178092.4| hAT family dimerization domain-containing prote...   721   0.0  
gb|AAF68117.1|AC010793_12 F20B17.17 [Arabidopsis thaliana] gi|12...   718   0.0  

>ref|XP_002267489.2| PREDICTED: uncharacterized protein LOC100265581 [Vitis vinifera]
          Length = 723

 Score =  825 bits (2130), Expect = 0.0
 Identities = 404/531 (76%), Positives = 462/531 (87%), Gaps = 4/531 (0%)
 Frame = -3

Query: 1847 SSYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTW 1668
            SSYQLMIEA+ KCG  F G SAE LKTTWLER+KS++ +  KD+EKEW  TGCTIIADTW
Sbjct: 194  SSYQLMIEAVSKCGHGFRGPSAEILKTTWLERIKSEVSLQSKDIEKEWATTGCTIIADTW 253

Query: 1667 TDNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDG 1488
            TDNKSRA+INFLVSSPSRTFFHKSVD S+YFKN K LADLFDSVIQ++G +NVVQII+D 
Sbjct: 254  TDNKSRALINFLVSSPSRTFFHKSVDASSYFKNTKYLADLFDSVIQDLGPDNVVQIIMDS 313

Query: 1487 ALSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAA 1308
             L+Y+G+A+HI+QNYGT+F+SPCAS C+NLILEDFCKIDWVNRCI+QAQTI++FIYN+A+
Sbjct: 314  TLNYTGVASHIVQNYGTVFVSPCASQCLNLILEDFCKIDWVNRCILQAQTISKFIYNNAS 373

Query: 1307 LLEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQS 1128
            +L+ MKK TGGQD+I+TGITKSVSNFLSLQ++ K++ +LK M    +Y  NS   NK Q+
Sbjct: 374  MLDLMKKSTGGQDLIRTGITKSVSNFLSLQSMLKQRPRLKHMFGSSEYSTNSYS-NKPQN 432

Query: 1127 ISCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIM 948
            ISC AIL+DN+FW+AVEEC+AI EPFL+ LREVS GKPAVG IYELMTKAKESIRTYYIM
Sbjct: 433  ISCIAILEDNDFWRAVEECVAISEPFLKGLREVSGGKPAVGSIYELMTKAKESIRTYYIM 492

Query: 947  DESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLP 768
            DESKCK FLDIVD +W NQLH+PLH+AAAFLNP IQYNPE+KF+ AIKEDFF VLEK+LP
Sbjct: 493  DESKCKAFLDIVDGRWRNQLHSPLHAAAAFLNPSIQYNPEIKFIGAIKEDFFKVLEKLLP 552

Query: 767  TPDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQ 588
            T D+R +ITNQILLF RATGMFGCNLA+EARDTV PG WWEQ+GDSAPVLQ+VAIRILSQ
Sbjct: 553  TSDMRRDITNQILLFTRATGMFGCNLAREARDTVPPGLWWEQFGDSAPVLQRVAIRILSQ 612

Query: 587  VCSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGK-FKSKSSEVDPLQLDD 411
            VCSTSTFE+ WNTFQQIH EKRNKIDKE LNDLV +NYNLKL +  K KSSE DPLQ DD
Sbjct: 613  VCSTSTFERHWNTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKMKSSEADPLQFDD 672

Query: 410  IDMTSEWVEEIETSSPTQWLDRFGS-YDVSDLNTRQFSTAFFGAND--FGL 267
            IDMTSEWVEE E  SPTQWLDRFGS  D SDLNTRQF+ A FG++D  FGL
Sbjct: 673  IDMTSEWVEETENPSPTQWLDRFGSALDGSDLNTRQFNAAIFGSSDTIFGL 723


>emb|CBI22554.3| unnamed protein product [Vitis vinifera]
          Length = 731

 Score =  825 bits (2130), Expect = 0.0
 Identities = 404/531 (76%), Positives = 462/531 (87%), Gaps = 4/531 (0%)
 Frame = -3

Query: 1847 SSYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTW 1668
            SSYQLMIEA+ KCG  F G SAE LKTTWLER+KS++ +  KD+EKEW  TGCTIIADTW
Sbjct: 202  SSYQLMIEAVSKCGHGFRGPSAEILKTTWLERIKSEVSLQSKDIEKEWATTGCTIIADTW 261

Query: 1667 TDNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDG 1488
            TDNKSRA+INFLVSSPSRTFFHKSVD S+YFKN K LADLFDSVIQ++G +NVVQII+D 
Sbjct: 262  TDNKSRALINFLVSSPSRTFFHKSVDASSYFKNTKYLADLFDSVIQDLGPDNVVQIIMDS 321

Query: 1487 ALSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAA 1308
             L+Y+G+A+HI+QNYGT+F+SPCAS C+NLILEDFCKIDWVNRCI+QAQTI++FIYN+A+
Sbjct: 322  TLNYTGVASHIVQNYGTVFVSPCASQCLNLILEDFCKIDWVNRCILQAQTISKFIYNNAS 381

Query: 1307 LLEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQS 1128
            +L+ MKK TGGQD+I+TGITKSVSNFLSLQ++ K++ +LK M    +Y  NS   NK Q+
Sbjct: 382  MLDLMKKSTGGQDLIRTGITKSVSNFLSLQSMLKQRPRLKHMFGSSEYSTNSYS-NKPQN 440

Query: 1127 ISCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIM 948
            ISC AIL+DN+FW+AVEEC+AI EPFL+ LREVS GKPAVG IYELMTKAKESIRTYYIM
Sbjct: 441  ISCIAILEDNDFWRAVEECVAISEPFLKGLREVSGGKPAVGSIYELMTKAKESIRTYYIM 500

Query: 947  DESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLP 768
            DESKCK FLDIVD +W NQLH+PLH+AAAFLNP IQYNPE+KF+ AIKEDFF VLEK+LP
Sbjct: 501  DESKCKAFLDIVDGRWRNQLHSPLHAAAAFLNPSIQYNPEIKFIGAIKEDFFKVLEKLLP 560

Query: 767  TPDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQ 588
            T D+R +ITNQILLF RATGMFGCNLA+EARDTV PG WWEQ+GDSAPVLQ+VAIRILSQ
Sbjct: 561  TSDMRRDITNQILLFTRATGMFGCNLAREARDTVPPGLWWEQFGDSAPVLQRVAIRILSQ 620

Query: 587  VCSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGK-FKSKSSEVDPLQLDD 411
            VCSTSTFE+ WNTFQQIH EKRNKIDKE LNDLV +NYNLKL +  K KSSE DPLQ DD
Sbjct: 621  VCSTSTFERHWNTFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMKMKSSEADPLQFDD 680

Query: 410  IDMTSEWVEEIETSSPTQWLDRFGS-YDVSDLNTRQFSTAFFGAND--FGL 267
            IDMTSEWVEE E  SPTQWLDRFGS  D SDLNTRQF+ A FG++D  FGL
Sbjct: 681  IDMTSEWVEETENPSPTQWLDRFGSALDGSDLNTRQFNAAIFGSSDTIFGL 731


>ref|XP_002513602.1| protein dimerization, putative [Ricinus communis]
            gi|223547510|gb|EEF49005.1| protein dimerization,
            putative [Ricinus communis]
          Length = 688

 Score =  821 bits (2120), Expect = 0.0
 Identities = 394/530 (74%), Positives = 462/530 (87%), Gaps = 4/530 (0%)
 Frame = -3

Query: 1844 SYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTWT 1665
            SYQLMIEAI KCG  F+G SAE LKTTWLER+KS++ + LKD EKEW  TGCTIIADTWT
Sbjct: 159  SYQLMIEAIEKCGPGFTGPSAEILKTTWLERIKSEVSLQLKDTEKEWTTTGCTIIADTWT 218

Query: 1664 DNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDGA 1485
            DNKSRA+INF VSSPSRTFFHKSVD S+YFKN K LADLFDSVIQ+ G ENVVQII+D +
Sbjct: 219  DNKSRALINFFVSSPSRTFFHKSVDASSYFKNTKCLADLFDSVIQDFGAENVVQIIMDSS 278

Query: 1484 LSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAAL 1305
             +Y+G+ANHI+QNYGTIF+SPCAS C+NLILEDF K+DWVNRCI QAQT+++FIYN++++
Sbjct: 279  FNYTGVANHILQNYGTIFVSPCASQCLNLILEDFSKVDWVNRCISQAQTLSKFIYNNSSM 338

Query: 1304 LEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQSI 1125
            L+ MKKFTGGQ++IKTGITKSVS+FLSLQ++ K++ +LKLM +  +Y  NS+  +K QSI
Sbjct: 339  LDLMKKFTGGQELIKTGITKSVSSFLSLQSMLKQRPRLKLMFSSNEYSANSSYSSKPQSI 398

Query: 1124 SCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIMD 945
            +C  I++D +FW+AVEEC+AI EPFL++LREVS GKPAVG IYELMT+AKESIRTYYIMD
Sbjct: 399  ACITIVEDGDFWRAVEECVAITEPFLKVLREVSGGKPAVGSIYELMTRAKESIRTYYIMD 458

Query: 944  ESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLPT 765
            ESKCKTFLDIVD+KW +QLH+PLHSAAAFLNPC+QYNPE+KFL  IKEDFF V+EK+LPT
Sbjct: 459  ESKCKTFLDIVDRKWRDQLHSPLHSAAAFLNPCVQYNPEIKFLVNIKEDFFKVIEKLLPT 518

Query: 764  PDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQV 585
            PD+R +ITNQI +F RA+GMFGCNLA EARDTV PG WWEQYGDSAPVLQ+VAIRILSQV
Sbjct: 519  PDMRRDITNQIFIFTRASGMFGCNLAMEARDTVAPGLWWEQYGDSAPVLQRVAIRILSQV 578

Query: 584  CSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKL-GKFKSKSSEVDPLQLDDI 408
            CST TFE+ WNTF+QIH EKRNKIDKE LNDLV +NYNLKL  + ++KSSE DP+Q DDI
Sbjct: 579  CSTFTFERHWNTFRQIHSEKRNKIDKETLNDLVYINYNLKLMRQMRTKSSETDPIQFDDI 638

Query: 407  DMTSEWVEEIETSSPTQWLDRFGS-YDVSDLNTRQFSTAFFGAND--FGL 267
            DMTSEWVEE +  SPTQWLDRFGS  D SDLNTRQF+ A FGA+D  FGL
Sbjct: 639  DMTSEWVEETDNPSPTQWLDRFGSALDGSDLNTRQFNAAIFGASDPLFGL 688


>ref|XP_004145979.1| PREDICTED: uncharacterized protein LOC101215128, partial [Cucumis
            sativus]
          Length = 685

 Score =  804 bits (2076), Expect = 0.0
 Identities = 387/526 (73%), Positives = 453/526 (86%), Gaps = 2/526 (0%)
 Frame = -3

Query: 1847 SSYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTW 1668
            SSYQLMI+AIGKCG  F+G SAE LKTTWLER+K+++ +  KD+EKEW  TGCTII DTW
Sbjct: 156  SSYQLMIDAIGKCGPGFTGPSAETLKTTWLERIKTEVSLQSKDIEKEWTTTGCTIIVDTW 215

Query: 1667 TDNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDG 1488
            TDNKSRA+INFLVSSPSRTFFHKSVD STYFKN K L DLFDSVIQ+ GHENVVQII+D 
Sbjct: 216  TDNKSRALINFLVSSPSRTFFHKSVDASTYFKNTKCLGDLFDSVIQDFGHENVVQIIMDS 275

Query: 1487 ALSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAA 1308
            +L+YSG ANHI+Q YGTIF+SPCAS C+N ILE+F K+DWVNRCI+QAQTI++F+YNS++
Sbjct: 276  SLNYSGTANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSS 335

Query: 1307 LLEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQS 1128
            LL+ M++FTGGQ++I+TGI+K VS+FLSLQ+I K++S+LK M N  DY  NS   NK QS
Sbjct: 336  LLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPDYTTNSYA-NKPQS 394

Query: 1127 ISCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIM 948
            ISC AI++DN+FW+AVEEC+AI EPFLR+LREV  GKPAVG IYELMT+AKESIRTYYIM
Sbjct: 395  ISCIAIIEDNDFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIM 454

Query: 947  DESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLP 768
            DE KCKTFLDIVD+KW +QLH+PLH+AAAFLNP IQYNPE+KFLT+IKEDFFNVLEK+LP
Sbjct: 455  DEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKEDFFNVLEKLLP 514

Query: 767  TPDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQ 588
             P++R +ITNQI  F +A GMFGC+LA EARDTV P  WWEQ+GDSAPVLQ+VAIRILSQ
Sbjct: 515  LPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQ 574

Query: 587  VCSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGK-FKSKSSEVDPLQLDD 411
            VCST +FE+ W+ FQQIH EKRNKIDKE LNDLV +NYNLKL +  ++K  E DP+Q DD
Sbjct: 575  VCSTFSFERHWSMFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMRTKPLESDPIQFDD 634

Query: 410  IDMTSEWVEEIETSSPTQWLDRFG-SYDVSDLNTRQFSTAFFGAND 276
            IDMTSEWVEE E  SPTQWLDRFG S D SDLNTRQF+ A FGAND
Sbjct: 635  IDMTSEWVEESENQSPTQWLDRFGSSLDGSDLNTRQFNAAMFGAND 680


>ref|XP_004160403.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101215128 [Cucumis
            sativus]
          Length = 784

 Score =  802 bits (2071), Expect = 0.0
 Identities = 386/526 (73%), Positives = 452/526 (85%), Gaps = 2/526 (0%)
 Frame = -3

Query: 1847 SSYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTW 1668
            SSYQLMI+AIGKCG  F+G SAE LKTTWLER+K+++ +  KD+EKEW  TGCTII DTW
Sbjct: 255  SSYQLMIDAIGKCGPGFTGPSAETLKTTWLERIKTEVSLQSKDIEKEWTTTGCTIIVDTW 314

Query: 1667 TDNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDG 1488
            TDNKSRA+INF VSSPSRTFFHKSVD STYFKN K L DLFDSVIQ+ GHENVVQII+D 
Sbjct: 315  TDNKSRALINFXVSSPSRTFFHKSVDASTYFKNTKCLGDLFDSVIQDFGHENVVQIIMDS 374

Query: 1487 ALSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAA 1308
            +L+YSG ANHI+Q YGTIF+SPCAS C+N ILE+F K+DWVNRCI+QAQTI++F+YNS++
Sbjct: 375  SLNYSGTANHILQTYGTIFVSPCASQCLNSILEEFSKVDWVNRCILQAQTISKFLYNSSS 434

Query: 1307 LLEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQS 1128
            LL+ M++FTGGQ++I+TGI+K VS+FLSLQ+I K++S+LK M N  DY  NS   NK QS
Sbjct: 435  LLDLMRRFTGGQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPDYTTNSYA-NKPQS 493

Query: 1127 ISCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIM 948
            ISC AI++DN+FW+AVEEC+AI EPFLR+LREV  GKPAVG IYELMT+AKESIRTYYIM
Sbjct: 494  ISCIAIIEDNDFWRAVEECVAISEPFLRVLREVCGGKPAVGCIYELMTRAKESIRTYYIM 553

Query: 947  DESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLP 768
            DE KCKTFLDIVD+KW +QLH+PLH+AAAFLNP IQYNPE+KFLT+IKEDFFNVLEK+LP
Sbjct: 554  DEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKEDFFNVLEKLLP 613

Query: 767  TPDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQ 588
             P++R +ITNQI  F +A GMFGC+LA EARDTV P  WWEQ+GDSAPVLQ+VAIRILSQ
Sbjct: 614  LPEMRRDITNQIFTFTKANGMFGCSLAMEARDTVSPWLWWEQFGDSAPVLQRVAIRILSQ 673

Query: 587  VCSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGK-FKSKSSEVDPLQLDD 411
            VCST +FE+ W+ FQQIH EKRNKIDKE LNDLV +NYNLKL +  ++K  E DP+Q DD
Sbjct: 674  VCSTFSFERHWSMFQQIHSEKRNKIDKETLNDLVYINYNLKLARQMRTKPLESDPIQFDD 733

Query: 410  IDMTSEWVEEIETSSPTQWLDRFG-SYDVSDLNTRQFSTAFFGAND 276
            IDMTSEWVEE E  SPTQWLDRFG S D SDLNTRQF+ A FGAND
Sbjct: 734  IDMTSEWVEESENQSPTQWLDRFGSSLDGSDLNTRQFNAAMFGAND 779


>gb|EXB72473.1| hypothetical protein L484_011475 [Morus notabilis]
          Length = 694

 Score =  794 bits (2051), Expect = 0.0
 Identities = 380/531 (71%), Positives = 459/531 (86%), Gaps = 4/531 (0%)
 Frame = -3

Query: 1847 SSYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTW 1668
            SSYQLM++AI KCG  F+G SAE LKTTWLER+KS++ +  KD+EKEW+ TGCTIIADTW
Sbjct: 164  SSYQLMVDAIAKCGPGFTGPSAETLKTTWLERIKSEMSLQSKDIEKEWMTTGCTIIADTW 223

Query: 1667 TDNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDG 1488
            TDNKSRA+INFLVSSPSRTFFHKSVD S YFKN+K LADLFDSVIQ+ G +NVVQ+I+D 
Sbjct: 224  TDNKSRALINFLVSSPSRTFFHKSVDASAYFKNMKCLADLFDSVIQDFGPDNVVQVIMDS 283

Query: 1487 ALSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAA 1308
            + +Y+G+ANHI+QNY TIF+SPC S C+NLILE+F K+DWVNRCI+Q QTI++FIYNSA+
Sbjct: 284  SFNYTGVANHILQNYSTIFVSPCVSQCLNLILEEFSKVDWVNRCILQGQTISKFIYNSAS 343

Query: 1307 LLEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQS 1128
            +L+ MKK+TGGQ++I+TGITKSVS+FLSLQ+I K+KS+LK M N  +Y  NS  VNK QS
Sbjct: 344  MLDLMKKYTGGQELIRTGITKSVSSFLSLQSILKQKSRLKHMFNSPEYCTNSLYVNKPQS 403

Query: 1127 ISCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIM 948
            ISC +I++D++FW+AVEE +AI EPFL++LREV+ GKPAVG IYELMT+AKESIRTYYIM
Sbjct: 404  ISCISIVEDSDFWRAVEESVAISEPFLKVLREVAGGKPAVGSIYELMTRAKESIRTYYIM 463

Query: 947  DESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLP 768
            DE+KCKTFLDIVD+KW +QLH+PLHSAAAFLNP IQYNPE+KFL++IKEDFF VLEK+LP
Sbjct: 464  DENKCKTFLDIVDRKWRDQLHSPLHSAAAFLNPSIQYNPEIKFLSSIKEDFFKVLEKLLP 523

Query: 767  TPDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQ 588
             P++R +IT+QI  F +A  MFGC+LA EARD V PG WWEQYGDSAPVLQ+VAIRILSQ
Sbjct: 524  LPEMRRDITSQIFTFTKAMSMFGCSLAMEARDVVSPGLWWEQYGDSAPVLQRVAIRILSQ 583

Query: 587  VCSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGKF-KSKSSEVDPLQLDD 411
            VCS+ TFE+ W+ FQQIH EKRNKID+E LNDLV +NYNLKL +  ++KS E DP+Q DD
Sbjct: 584  VCSSFTFERHWSAFQQIHSEKRNKIDRETLNDLVYINYNLKLARHTRTKSIEADPIQFDD 643

Query: 410  IDMTSEWVEEIETSSPTQWLDRFGS-YDVSDLNTRQFSTAFFGAND--FGL 267
            IDMTSEWVEE + SSP+QWLDRFGS  D SDLNTRQ++ A FG+ND  FGL
Sbjct: 644  IDMTSEWVEESDNSSPSQWLDRFGSALDGSDLNTRQYNAAIFGSNDHIFGL 694


>ref|XP_004308021.1| PREDICTED: uncharacterized protein LOC101298657 [Fragaria vesca
            subsp. vesca]
          Length = 681

 Score =  794 bits (2051), Expect = 0.0
 Identities = 383/531 (72%), Positives = 456/531 (85%), Gaps = 4/531 (0%)
 Frame = -3

Query: 1847 SSYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTW 1668
            SSYQLMI+AI KCG  F+G SAE LKTTWLERVK+++ +  KD+EKEW  TGCTIIADTW
Sbjct: 151  SSYQLMIDAITKCGPGFTGPSAETLKTTWLERVKTEMSLQSKDIEKEWTTTGCTIIADTW 210

Query: 1667 TDNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDG 1488
            TDNKSRA+INFLVSSPSRTFFHKSVD S YFKN K LA+LFDSVIQ+ G ENVVQII+D 
Sbjct: 211  TDNKSRALINFLVSSPSRTFFHKSVDASAYFKNTKCLAELFDSVIQDFGPENVVQIIMDS 270

Query: 1487 ALSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAA 1308
            + +Y+G+ANHI+ NY TIF+SPCAS C+NLILE+F K+DWVNRC +QAQTI++FIYN+A+
Sbjct: 271  SFNYTGVANHILTNYTTIFVSPCASQCLNLILEEFSKVDWVNRCFLQAQTISKFIYNNAS 330

Query: 1307 LLEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQS 1128
            +L+ MK+FTGGQD+I+TGITKSVS+FLSLQ I K++S+LK M N  ++  NS+  NK QS
Sbjct: 331  MLDLMKRFTGGQDLIRTGITKSVSSFLSLQTILKQRSRLKHMFNSPEFCTNSSYANKTQS 390

Query: 1127 ISCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIM 948
            ISC +I++DN+FW+A EE +AI EPFL++LREVS GKPAVG IYELMT+AKESIRTYYIM
Sbjct: 391  ISCISIMEDNDFWRAAEESVAISEPFLKVLREVSGGKPAVGSIYELMTRAKESIRTYYIM 450

Query: 947  DESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLP 768
            DE+KCK FLDIVD+KW +QLH+PLH+AAAFLNP IQYNPE+KFLT+IKEDFF VLEK+LP
Sbjct: 451  DENKCKVFLDIVDRKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLTSIKEDFFKVLEKLLP 510

Query: 767  TPDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQ 588
            +P++R +ITNQI  F +ATGMFGC+LA EARD V PG WWEQYGDSAPVLQ+VAIRILSQ
Sbjct: 511  SPEMRRDITNQIFTFTKATGMFGCSLAMEARDVVSPGLWWEQYGDSAPVLQRVAIRILSQ 570

Query: 587  VCSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGK-FKSKSSEVDPLQLDD 411
            VCST TFEK W+ FQQIH EKRNKID+E LNDLV +NYNL+L K  ++K+ E DP+  DD
Sbjct: 571  VCSTFTFEKHWSAFQQIHSEKRNKIDRETLNDLVYINYNLRLSKQTRNKNVEADPILFDD 630

Query: 410  IDMTSEWVEEIETSSPTQWLDRFGS-YDVSDLNTRQFSTAFFGAND--FGL 267
            IDMTSEWVEE ++ SPTQWLDRFGS  D SDLNTRQF+ A FG+ND  FGL
Sbjct: 631  IDMTSEWVEESDSPSPTQWLDRFGSALDGSDLNTRQFNAAIFGSNDHIFGL 681


>ref|XP_002318364.1| predicted protein [Populus trichocarpa]
          Length = 566

 Score =  794 bits (2050), Expect = 0.0
 Identities = 381/530 (71%), Positives = 457/530 (86%), Gaps = 4/530 (0%)
 Frame = -3

Query: 1844 SYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTWT 1665
            SYQLM++AIGKCGA F+G SA+ L+TTWLER+KS++ +  KD EKEW  TGCTIIADTWT
Sbjct: 38   SYQLMVDAIGKCGAGFTGPSADMLRTTWLERIKSEVSLQTKDAEKEWATTGCTIIADTWT 97

Query: 1664 DNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDGA 1485
            DNKSRA+INFLVSSPSRTFFHKSVD S+ FKN K LADLFDSVIQ+ G ENVVQII+D +
Sbjct: 98   DNKSRALINFLVSSPSRTFFHKSVDASSIFKNTKCLADLFDSVIQDFGAENVVQIIMDSS 157

Query: 1484 LSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAAL 1305
             +Y+G+ANHI+QNYGTIF+SPCAS C+NLILE+F K+DWVN+CI+QAQTI++ IYNS ++
Sbjct: 158  FNYTGIANHILQNYGTIFVSPCASQCLNLILEEFSKVDWVNKCILQAQTISKVIYNSVSI 217

Query: 1304 LEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQSI 1125
            L+ MKKFTGGQ++IKTGITK VSNFLSLQ++ K++S+LK M+N  ++ +NS+  N  ++I
Sbjct: 218  LDLMKKFTGGQELIKTGITKPVSNFLSLQSMLKQRSRLKQMLNSPEFSMNSSYANNPKNI 277

Query: 1124 SCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIMD 945
            +C AI++D +FW+AVEE +AI EPFL+++REVS GKPAVG IYELMT+AKESIRTYYIMD
Sbjct: 278  ACIAIIEDGDFWRAVEESVAISEPFLKVMREVSGGKPAVGSIYELMTRAKESIRTYYIMD 337

Query: 944  ESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLPT 765
            ESKCKTFLDIVD+KW  QLH+PLHSAAAFLNP +QYNPE+KFL +IKEDFF V+EK+LPT
Sbjct: 338  ESKCKTFLDIVDRKWGGQLHSPLHSAAAFLNPSVQYNPEIKFLVSIKEDFFKVIEKLLPT 397

Query: 764  PDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQV 585
            PD+R +ITNQI +F RA+GMFGC+LA EARDTV PG WWEQ+GDSAPVLQ+VAIRILSQV
Sbjct: 398  PDMRRDITNQIFIFTRASGMFGCSLAMEARDTVAPGLWWEQFGDSAPVLQRVAIRILSQV 457

Query: 584  CSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGK-FKSKSSEVDPLQLDDI 408
            CST TFEK W+TFQQIH EKRNKIDKE LNDL  +NYNLKL +  ++K  E DP+Q DDI
Sbjct: 458  CSTFTFEKHWSTFQQIHSEKRNKIDKETLNDLAYINYNLKLTRQMRTKPLEADPIQYDDI 517

Query: 407  DMTSEWVEEIETSSPTQWLDRFGS-YDVSDLNTRQFSTAFFGAND--FGL 267
            DMTSEWVEE +  SPTQWLDRFGS  D SDLNTR F+ A FG+ND  FGL
Sbjct: 518  DMTSEWVEESDNPSPTQWLDRFGSALDGSDLNTR-FNAAIFGSNDHLFGL 566


>gb|EOY23434.1| HAT transposon superfamily isoform 4 [Theobroma cacao]
          Length = 682

 Score =  791 bits (2044), Expect = 0.0
 Identities = 384/531 (72%), Positives = 453/531 (85%), Gaps = 4/531 (0%)
 Frame = -3

Query: 1847 SSYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTW 1668
            SSYQ MI+A+GK G  F+G S E LKT WLER+KS++ +  KD EKEW  TGCTIIADTW
Sbjct: 153  SSYQAMIDAVGKFGPGFTGPSVETLKTMWLERIKSEVCLQSKDTEKEWATTGCTIIADTW 212

Query: 1667 TDNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDG 1488
            TDNKSRA+INFLVSSPSRTFFHKSVD S+YFKN K LADLFDSVIQ+ G ENVVQII+D 
Sbjct: 213  TDNKSRALINFLVSSPSRTFFHKSVDASSYFKNTKCLADLFDSVIQDFGPENVVQIIMDS 272

Query: 1487 ALSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAA 1308
            + +Y+G++NHI+QNYGTIF+SPCAS C+NLILE+F K+DWVNRCI+QAQT+++F+YN+A+
Sbjct: 273  SFNYTGISNHILQNYGTIFVSPCASQCLNLILEEFSKVDWVNRCILQAQTLSKFLYNNAS 332

Query: 1307 LLEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQS 1128
            +L+ MKKFTG Q++I+TGITKSVS+FLSLQ++ K++S+LK M N  +Y  NS+  NK QS
Sbjct: 333  MLDLMKKFTGEQELIRTGITKSVSSFLSLQSMLKQRSRLKHMFNSPEYSTNSSYANKPQS 392

Query: 1127 ISCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIM 948
            ISC AI++DN+FW+AV+EC+AI EPFL++LREVS GKPAVG IYELMT+AKESIRTYYIM
Sbjct: 393  ISCIAIVEDNDFWRAVDECVAISEPFLKVLREVSGGKPAVGSIYELMTRAKESIRTYYIM 452

Query: 947  DESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLP 768
            DE KCKTFLDIVD+KW +QLH+PLHSA AFLNP IQYN E+KFL +IKEDFF VLEK+LP
Sbjct: 453  DEGKCKTFLDIVDRKWRDQLHSPLHSAGAFLNPSIQYNQEIKFLGSIKEDFFKVLEKLLP 512

Query: 767  TPDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQ 588
            TP+LR +ITNQI  F RA GMF CNLA EARDTV PG WWEQ+GDSAPVLQ+VAIRILSQ
Sbjct: 513  TPELRRDITNQIFTFTRAKGMFACNLAMEARDTVSPGLWWEQFGDSAPVLQRVAIRILSQ 572

Query: 587  VCSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGK-FKSKSSEVDPLQLDD 411
            VCST TFE+ W+TFQQIH EKRNKIDKE LNDLV +NYNL+L +  ++KS E DP+Q DD
Sbjct: 573  VCSTFTFERHWSTFQQIHSEKRNKIDKEILNDLVYINYNLRLARQMRTKSVEADPIQFDD 632

Query: 410  IDMTSEWVEEIETSSPTQWLDRFGS-YDVSDLNTRQFSTAFFGAND--FGL 267
            IDMTSEWVEE E  SPTQWLDRFGS  D  DLNTRQF+ A FG ND  FGL
Sbjct: 633  IDMTSEWVEESENPSPTQWLDRFGSALDGGDLNTRQFNAAIFG-NDHIFGL 682


>gb|EOY23432.1| HAT transposon superfamily isoform 2 [Theobroma cacao]
            gi|508776177|gb|EOY23433.1| HAT transposon superfamily
            isoform 2 [Theobroma cacao]
          Length = 678

 Score =  791 bits (2044), Expect = 0.0
 Identities = 384/531 (72%), Positives = 453/531 (85%), Gaps = 4/531 (0%)
 Frame = -3

Query: 1847 SSYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTW 1668
            SSYQ MI+A+GK G  F+G S E LKT WLER+KS++ +  KD EKEW  TGCTIIADTW
Sbjct: 149  SSYQAMIDAVGKFGPGFTGPSVETLKTMWLERIKSEVCLQSKDTEKEWATTGCTIIADTW 208

Query: 1667 TDNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDG 1488
            TDNKSRA+INFLVSSPSRTFFHKSVD S+YFKN K LADLFDSVIQ+ G ENVVQII+D 
Sbjct: 209  TDNKSRALINFLVSSPSRTFFHKSVDASSYFKNTKCLADLFDSVIQDFGPENVVQIIMDS 268

Query: 1487 ALSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAA 1308
            + +Y+G++NHI+QNYGTIF+SPCAS C+NLILE+F K+DWVNRCI+QAQT+++F+YN+A+
Sbjct: 269  SFNYTGISNHILQNYGTIFVSPCASQCLNLILEEFSKVDWVNRCILQAQTLSKFLYNNAS 328

Query: 1307 LLEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQS 1128
            +L+ MKKFTG Q++I+TGITKSVS+FLSLQ++ K++S+LK M N  +Y  NS+  NK QS
Sbjct: 329  MLDLMKKFTGEQELIRTGITKSVSSFLSLQSMLKQRSRLKHMFNSPEYSTNSSYANKPQS 388

Query: 1127 ISCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIM 948
            ISC AI++DN+FW+AV+EC+AI EPFL++LREVS GKPAVG IYELMT+AKESIRTYYIM
Sbjct: 389  ISCIAIVEDNDFWRAVDECVAISEPFLKVLREVSGGKPAVGSIYELMTRAKESIRTYYIM 448

Query: 947  DESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLP 768
            DE KCKTFLDIVD+KW +QLH+PLHSA AFLNP IQYN E+KFL +IKEDFF VLEK+LP
Sbjct: 449  DEGKCKTFLDIVDRKWRDQLHSPLHSAGAFLNPSIQYNQEIKFLGSIKEDFFKVLEKLLP 508

Query: 767  TPDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQ 588
            TP+LR +ITNQI  F RA GMF CNLA EARDTV PG WWEQ+GDSAPVLQ+VAIRILSQ
Sbjct: 509  TPELRRDITNQIFTFTRAKGMFACNLAMEARDTVSPGLWWEQFGDSAPVLQRVAIRILSQ 568

Query: 587  VCSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGK-FKSKSSEVDPLQLDD 411
            VCST TFE+ W+TFQQIH EKRNKIDKE LNDLV +NYNL+L +  ++KS E DP+Q DD
Sbjct: 569  VCSTFTFERHWSTFQQIHSEKRNKIDKEILNDLVYINYNLRLARQMRTKSVEADPIQFDD 628

Query: 410  IDMTSEWVEEIETSSPTQWLDRFGS-YDVSDLNTRQFSTAFFGAND--FGL 267
            IDMTSEWVEE E  SPTQWLDRFGS  D  DLNTRQF+ A FG ND  FGL
Sbjct: 629  IDMTSEWVEESENPSPTQWLDRFGSALDGGDLNTRQFNAAIFG-NDHIFGL 678


>gb|EOY23431.1| HAT transposon superfamily isoform 1 [Theobroma cacao]
          Length = 640

 Score =  791 bits (2044), Expect = 0.0
 Identities = 384/531 (72%), Positives = 453/531 (85%), Gaps = 4/531 (0%)
 Frame = -3

Query: 1847 SSYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTW 1668
            SSYQ MI+A+GK G  F+G S E LKT WLER+KS++ +  KD EKEW  TGCTIIADTW
Sbjct: 111  SSYQAMIDAVGKFGPGFTGPSVETLKTMWLERIKSEVCLQSKDTEKEWATTGCTIIADTW 170

Query: 1667 TDNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDG 1488
            TDNKSRA+INFLVSSPSRTFFHKSVD S+YFKN K LADLFDSVIQ+ G ENVVQII+D 
Sbjct: 171  TDNKSRALINFLVSSPSRTFFHKSVDASSYFKNTKCLADLFDSVIQDFGPENVVQIIMDS 230

Query: 1487 ALSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAA 1308
            + +Y+G++NHI+QNYGTIF+SPCAS C+NLILE+F K+DWVNRCI+QAQT+++F+YN+A+
Sbjct: 231  SFNYTGISNHILQNYGTIFVSPCASQCLNLILEEFSKVDWVNRCILQAQTLSKFLYNNAS 290

Query: 1307 LLEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQS 1128
            +L+ MKKFTG Q++I+TGITKSVS+FLSLQ++ K++S+LK M N  +Y  NS+  NK QS
Sbjct: 291  MLDLMKKFTGEQELIRTGITKSVSSFLSLQSMLKQRSRLKHMFNSPEYSTNSSYANKPQS 350

Query: 1127 ISCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIM 948
            ISC AI++DN+FW+AV+EC+AI EPFL++LREVS GKPAVG IYELMT+AKESIRTYYIM
Sbjct: 351  ISCIAIVEDNDFWRAVDECVAISEPFLKVLREVSGGKPAVGSIYELMTRAKESIRTYYIM 410

Query: 947  DESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLP 768
            DE KCKTFLDIVD+KW +QLH+PLHSA AFLNP IQYN E+KFL +IKEDFF VLEK+LP
Sbjct: 411  DEGKCKTFLDIVDRKWRDQLHSPLHSAGAFLNPSIQYNQEIKFLGSIKEDFFKVLEKLLP 470

Query: 767  TPDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQ 588
            TP+LR +ITNQI  F RA GMF CNLA EARDTV PG WWEQ+GDSAPVLQ+VAIRILSQ
Sbjct: 471  TPELRRDITNQIFTFTRAKGMFACNLAMEARDTVSPGLWWEQFGDSAPVLQRVAIRILSQ 530

Query: 587  VCSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGK-FKSKSSEVDPLQLDD 411
            VCST TFE+ W+TFQQIH EKRNKIDKE LNDLV +NYNL+L +  ++KS E DP+Q DD
Sbjct: 531  VCSTFTFERHWSTFQQIHSEKRNKIDKEILNDLVYINYNLRLARQMRTKSVEADPIQFDD 590

Query: 410  IDMTSEWVEEIETSSPTQWLDRFGS-YDVSDLNTRQFSTAFFGAND--FGL 267
            IDMTSEWVEE E  SPTQWLDRFGS  D  DLNTRQF+ A FG ND  FGL
Sbjct: 591  IDMTSEWVEESENPSPTQWLDRFGSALDGGDLNTRQFNAAIFG-NDHIFGL 640


>ref|XP_004502603.1| PREDICTED: uncharacterized protein LOC101496447 isoform X1 [Cicer
            arietinum] gi|502136218|ref|XP_004502604.1| PREDICTED:
            uncharacterized protein LOC101496447 isoform X2 [Cicer
            arietinum]
          Length = 679

 Score =  789 bits (2037), Expect = 0.0
 Identities = 381/531 (71%), Positives = 459/531 (86%), Gaps = 4/531 (0%)
 Frame = -3

Query: 1847 SSYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTW 1668
            SSYQLMI+AIGKCG  F+G SAE LKTTWLER+KS++ +  KDVEKEW  TGCTIIADTW
Sbjct: 149  SSYQLMIDAIGKCGPGFTGPSAEILKTTWLERIKSEVGLQSKDVEKEWATTGCTIIADTW 208

Query: 1667 TDNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDG 1488
            TD KS+A+INFLVSSPSRTFFHKSVD S YFKN K LADLFDSVIQE G ENVVQII+D 
Sbjct: 209  TDYKSKAIINFLVSSPSRTFFHKSVDASAYFKNTKWLADLFDSVIQEFGPENVVQIIMDS 268

Query: 1487 ALSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAA 1308
            + +Y+G+ANHI+QNYGTIF+SPCAS C+NLILE+F K+DW++RCI+QAQTI++ IYN+A+
Sbjct: 269  SFNYTGIANHIVQNYGTIFVSPCASQCLNLILEEFTKVDWISRCILQAQTISKLIYNNAS 328

Query: 1307 LLEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQS 1128
            LL+ MKK++GGQ++I+TG+TKSVS FLSLQ++ K +++LK M +  +Y  N++  NK QS
Sbjct: 329  LLDLMKKYSGGQELIRTGVTKSVSTFLSLQSMLKLRTRLKHMFHSPEYASNTSYANKPQS 388

Query: 1127 ISCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIM 948
            +SC AI +D +FW+ VEEC+AI EPFL++LREVSEGKP VG IYELMT+AKESIRTYYIM
Sbjct: 389  LSCIAIAEDGDFWRTVEECVAISEPFLKVLREVSEGKPIVGSIYELMTRAKESIRTYYIM 448

Query: 947  DESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLP 768
            DE+KCKTFLDIVDKKW +QLH+PLH+AAAFLNP IQYNPE+KFL++IKEDFFNVLEK+LP
Sbjct: 449  DENKCKTFLDIVDKKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLSSIKEDFFNVLEKLLP 508

Query: 767  TPDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQ 588
             PD+R +ITNQI  F +A GMFGC+LA+EAR+TV P  WWEQYGDSAP LQ+VAIRILSQ
Sbjct: 509  VPDMRRDITNQIYTFTKAHGMFGCSLAREARNTVAPWLWWEQYGDSAPGLQRVAIRILSQ 568

Query: 587  VCSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGK-FKSKSSEVDPLQLDD 411
            VCST +F++QW+TF+QIH EK+NKID+E LNDLV +NYNLKL K   +KS EVD LQ DD
Sbjct: 569  VCSTFSFQRQWSTFRQIHSEKKNKIDRETLNDLVYINYNLKLTKQVNAKSLEVDLLQSDD 628

Query: 410  IDMTSEWVEEIETSSPTQWLDRFG-SYDVSDLNTRQFSTAFFGAND--FGL 267
            IDMTSEWVEE ET+SPTQWLDRFG + D +DLNTRQF ++ FGAND  FGL
Sbjct: 629  IDMTSEWVEENETASPTQWLDRFGPALDGNDLNTRQFGSSIFGANDPIFGL 679


>ref|XP_006490683.1| PREDICTED: uncharacterized protein LOC102618477 [Citrus sinensis]
          Length = 764

 Score =  783 bits (2023), Expect = 0.0
 Identities = 382/530 (72%), Positives = 453/530 (85%), Gaps = 3/530 (0%)
 Frame = -3

Query: 1847 SSYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTW 1668
            SSYQ MI+A+GKCG  F+G SAEALKT WL+R+KS+++V  KD+EKEW  TGCTIIADTW
Sbjct: 237  SSYQQMIDAVGKCGPGFTGPSAEALKTMWLDRIKSEVNVQSKDIEKEWAMTGCTIIADTW 296

Query: 1667 TDNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDG 1488
            TDNKS+A+INFLVSSPSRTFF KSVD S+ FKN K LAD+FDSVIQ+IG ENVVQII+D 
Sbjct: 297  TDNKSKALINFLVSSPSRTFFLKSVDTSSNFKNTKYLADIFDSVIQDIGPENVVQIIMDS 356

Query: 1487 ALSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAA 1308
            + +Y+G+ANHI+QNYGTIF+SPCAS  +N+ILE+F K+DWVNRCI+QAQTI++FIYN+A+
Sbjct: 357  SFNYTGVANHILQNYGTIFVSPCASQSLNIILEEFSKVDWVNRCILQAQTISKFIYNNAS 416

Query: 1307 LLEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQS 1128
            +L+ MKKFTGG ++I+TGITK VSNFLSLQ+I K++S+LK M N  +Y  +S   NK QS
Sbjct: 417  MLDLMKKFTGGLELIRTGITKYVSNFLSLQSILKQRSRLKHMFNSPEYSTSSPYANKPQS 476

Query: 1127 ISCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIM 948
            +SC +I++DN+FW+AVEE +AI EPFL++LREVS GKPAVG IYELMT+AKESIRTYYIM
Sbjct: 477  LSCISIVEDNDFWRAVEESVAISEPFLKVLREVSGGKPAVGSIYELMTRAKESIRTYYIM 536

Query: 947  DESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLP 768
            DE+KCK FLDIVD+ W  QLH+PLHSAAAFLNP IQYNPE+KFL +IKEDFFNVLEK+LP
Sbjct: 537  DENKCKIFLDIVDRNWRGQLHSPLHSAAAFLNPSIQYNPEIKFLGSIKEDFFNVLEKLLP 596

Query: 767  TPDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQ 588
            TPD R +IT QIL F RA+GMFGC LA EAR+TV PG WWEQYGDSAPVLQ+VAIRILSQ
Sbjct: 597  TPDTRRDITTQILTFSRASGMFGCKLAMEARETVPPGLWWEQYGDSAPVLQRVAIRILSQ 656

Query: 587  VCSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGKFKSKSSEVDPLQLDDI 408
            VCS+ +FE+ W+TFQQIH EKRNKIDKE LNDLV ++YNLKL   ++KS E DPLQ DDI
Sbjct: 657  VCSSFSFERHWSTFQQIHSEKRNKIDKETLNDLVYISYNLKLA--RTKSVEADPLQFDDI 714

Query: 407  DMTSEWVEEIETSSPTQWLDRFGS-YDVSDLNTRQFSTAFFGAND--FGL 267
            DMTSEWVEE E  SP QWLDRFGS  D SDLNTRQFS + F +ND  FGL
Sbjct: 715  DMTSEWVEESEHHSPHQWLDRFGSALDGSDLNTRQFSASMFSSNDPIFGL 764


>ref|XP_006350604.1| PREDICTED: uncharacterized protein LOC102593027 isoform X1 [Solanum
            tuberosum] gi|565367925|ref|XP_006350605.1| PREDICTED:
            uncharacterized protein LOC102593027 isoform X2 [Solanum
            tuberosum]
          Length = 675

 Score =  780 bits (2015), Expect = 0.0
 Identities = 376/531 (70%), Positives = 445/531 (83%), Gaps = 4/531 (0%)
 Frame = -3

Query: 1847 SSYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTW 1668
            SSY  MIEA+GKCG+ F G S E LK TWLER+KS++ +  KDVEKEW  TGCT+IA+TW
Sbjct: 145  SSYHQMIEAVGKCGSGFIGPSPETLKATWLERIKSEVSLQSKDVEKEWAMTGCTLIAETW 204

Query: 1667 TDNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDG 1488
            TDNK +A+INFLVSSPSRTFF+KSVD S+YFKN+K L++LFDS+IQ+ G ENVVQ+IVD 
Sbjct: 205  TDNKMKALINFLVSSPSRTFFYKSVDASSYFKNLKCLSELFDSIIQDFGPENVVQVIVDN 264

Query: 1487 ALSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAA 1308
             L  +G+ NHI+QNYG +F+SPCAS C+N IL++F K+DWVNRCI+QAQ+I++FIYN++ 
Sbjct: 265  TLHCTGIVNHILQNYGNVFVSPCASQCINAILDEFSKLDWVNRCILQAQSISKFIYNNSP 324

Query: 1307 LLEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQS 1128
            LL+ MKKFTGGQ+IIKTGITKSVSNFLSLQ + K +S+LK++ N  +   NSA  NK QS
Sbjct: 325  LLDLMKKFTGGQEIIKTGITKSVSNFLSLQCLLKHRSRLKVIFNSPELAANSAYTNKSQS 384

Query: 1127 ISCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIM 948
            ++C AILDDN+FW+  EEC+A+ EPFL+++REVS GKPAVG IYEL+T+AKESIRTYYIM
Sbjct: 385  VNCIAILDDNDFWRTAEECVAVSEPFLKVMREVSGGKPAVGTIYELLTRAKESIRTYYIM 444

Query: 947  DESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLP 768
            DE KCKTFLDIVDK W N LH+PLHSAAAFLNP IQYN EVKFL +IKEDFF VLEK+LP
Sbjct: 445  DEIKCKTFLDIVDKNWKNNLHSPLHSAAAFLNPGIQYNREVKFLGSIKEDFFRVLEKLLP 504

Query: 767  TPDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQ 588
            TP+LR +IT QILL+ RA+GMFGCNLAKEA DTV PG WWEQYGD+AP LQ+VAI+ILSQ
Sbjct: 505  TPELRRDITTQILLYTRASGMFGCNLAKEAIDTVPPGIWWEQYGDAAPTLQRVAIKILSQ 564

Query: 587  VCSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGKF-KSKSSEVDPLQLDD 411
            VCST TFE+ W+TFQQIH EKRNKIDKE L DLV +NYNLKL ++  SK  E DPLQLDD
Sbjct: 565  VCSTFTFERHWSTFQQIHSEKRNKIDKETLLDLVYINYNLKLARYLVSKPPEEDPLQLDD 624

Query: 410  IDMTSEWVEEIETSSPTQWLDRFGS-YDVSDLNTRQFSTAFFGAND--FGL 267
            IDMTSEWVEE E  SPTQWLDRFGS  D +DLNTRQF+ A FG  D  FGL
Sbjct: 625  IDMTSEWVEEAENPSPTQWLDRFGSGLDGNDLNTRQFTAAIFGPGDNIFGL 675


>ref|XP_004234278.1| PREDICTED: uncharacterized protein LOC101256946 [Solanum
            lycopersicum]
          Length = 739

 Score =  778 bits (2010), Expect = 0.0
 Identities = 374/531 (70%), Positives = 444/531 (83%), Gaps = 4/531 (0%)
 Frame = -3

Query: 1847 SSYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTW 1668
            SSY  MIEA+GKCG+ F G S E LK TWLER+KS++ +  KDVEKEW  TGCT+IA+TW
Sbjct: 209  SSYHQMIEAVGKCGSGFIGPSPETLKATWLERIKSEVSLQSKDVEKEWAMTGCTLIAETW 268

Query: 1667 TDNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDG 1488
            TDNK +A+INFLVSSPSRTFF+KSVD S+YFKN+K L++LFDS+IQ+ G ENVVQ+IVD 
Sbjct: 269  TDNKMKALINFLVSSPSRTFFYKSVDASSYFKNLKCLSELFDSIIQDFGPENVVQVIVDN 328

Query: 1487 ALSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAA 1308
             L  +G+ NHI+QNYG +F+SPCAS C+N IL++F K+DWVNRCI+QAQ++++FIYN++ 
Sbjct: 329  TLHCTGIVNHILQNYGNVFVSPCASQCINAILDEFSKLDWVNRCILQAQSLSKFIYNNSP 388

Query: 1307 LLEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQS 1128
            LL+ MKKFTGGQ+IIKTGITKSVSNFLSLQ + K +S+LK++ N  +   NSA  NK QS
Sbjct: 389  LLDLMKKFTGGQEIIKTGITKSVSNFLSLQCLLKHRSRLKVIFNSPELAANSAYTNKSQS 448

Query: 1127 ISCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIM 948
            ++C  ILDDN+FW+  EEC+A+ EPFL+++REVS GKPAVG IYEL+T+AKESIRTYYIM
Sbjct: 449  VNCITILDDNDFWRTAEECVAVSEPFLKVMREVSGGKPAVGTIYELLTRAKESIRTYYIM 508

Query: 947  DESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLP 768
            DE KCKTFLDIVDK W N LH+PLHSAAAFLNP IQYNPEVKFL +IKEDFF VLEK+LP
Sbjct: 509  DEIKCKTFLDIVDKNWKNNLHSPLHSAAAFLNPGIQYNPEVKFLGSIKEDFFRVLEKLLP 568

Query: 767  TPDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQ 588
            TP+LR +IT QILL+ RA+GMFGCNLAKEA DTV PG WWEQYGD+AP LQ+VAI+ILSQ
Sbjct: 569  TPELRRDITTQILLYTRASGMFGCNLAKEAIDTVPPGIWWEQYGDAAPTLQRVAIKILSQ 628

Query: 587  VCSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGKF-KSKSSEVDPLQLDD 411
            VCST T E+ W+TFQQIH EKRNKIDKE L DLV +NYNLKL ++  SK  E DPLQLDD
Sbjct: 629  VCSTFTCERHWSTFQQIHSEKRNKIDKETLLDLVYINYNLKLARYLVSKPPEEDPLQLDD 688

Query: 410  IDMTSEWVEEIETSSPTQWLDRFGS-YDVSDLNTRQFSTAFFGAND--FGL 267
            IDMTSEWVEE E  SPTQWLDRFGS  D +DLNTRQF+ A FG  D  FGL
Sbjct: 689  IDMTSEWVEEAENPSPTQWLDRFGSGLDGNDLNTRQFTAAIFGPGDNIFGL 739


>ref|XP_006581618.1| PREDICTED: uncharacterized protein LOC100808813 isoform X1 [Glycine
            max] gi|571460166|ref|XP_006581619.1| PREDICTED:
            uncharacterized protein LOC100808813 isoform X2 [Glycine
            max]
          Length = 679

 Score =  776 bits (2003), Expect = 0.0
 Identities = 374/531 (70%), Positives = 453/531 (85%), Gaps = 4/531 (0%)
 Frame = -3

Query: 1847 SSYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTW 1668
            SSYQLMI+AI KCG  F+G SAE LKT WLER+KS++ +  KDVEKEW  TGCTI+ADTW
Sbjct: 149  SSYQLMIDAIAKCGPGFTGPSAETLKTIWLERMKSEVGLQTKDVEKEWATTGCTILADTW 208

Query: 1667 TDNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDG 1488
            TD KS+A+INFLVSSPSRTFFHKSVD S YFKN K LADLFDSVIQE G ENVVQII+D 
Sbjct: 209  TDYKSKAIINFLVSSPSRTFFHKSVDASAYFKNTKWLADLFDSVIQEFGPENVVQIIMDS 268

Query: 1487 ALSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAA 1308
            +++Y+ +ANHI+Q+YGTIF+SPCAS C+NLILE+F K+DW++RCI+QAQTI++ IYN+A+
Sbjct: 269  SVNYTVIANHIVQSYGTIFVSPCASQCLNLILEEFSKVDWISRCILQAQTISKLIYNNAS 328

Query: 1307 LLEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQS 1128
            LL+  KK+TGGQ++I+TGITKSVS FLSLQ++ K +++LK M +  +Y  N++  NK QS
Sbjct: 329  LLDLTKKYTGGQELIRTGITKSVSTFLSLQSMLKLRTRLKNMFHSHEYASNTSYANKPQS 388

Query: 1127 ISCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIM 948
            +SC  I +D +FW+ VEEC+AI EPFL++LRE+SEGKP VG IYELMT+AKESIRTYYIM
Sbjct: 389  LSCITIAEDGDFWRTVEECVAISEPFLKVLREISEGKPTVGSIYELMTRAKESIRTYYIM 448

Query: 947  DESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLP 768
            DE+KCK FLDIVDKKW +QLH+PLH+AAAFLNP IQYNPE+KF+++IKEDFFNVLEK+LP
Sbjct: 449  DENKCKKFLDIVDKKWRDQLHSPLHAAAAFLNPSIQYNPEIKFISSIKEDFFNVLEKLLP 508

Query: 767  TPDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQ 588
             PD+R +ITNQI  F +A GMFGC+LAKEAR+TV P  WWEQYGDSAP LQ+VAIRILSQ
Sbjct: 509  VPDMRRDITNQIYTFTKAHGMFGCSLAKEARNTVAPWLWWEQYGDSAPGLQRVAIRILSQ 568

Query: 587  VCSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGK-FKSKSSEVDPLQLDD 411
            VCST +F +QW+T +QIH EKRNKID+E LNDLV +NYNLKL +   +KSSEVD LQ DD
Sbjct: 569  VCSTFSFHRQWSTIRQIHSEKRNKIDRETLNDLVYINYNLKLARQMSAKSSEVDLLQFDD 628

Query: 410  IDMTSEWVEEIETSSPTQWLDRFG-SYDVSDLNTRQFSTAFFGAND--FGL 267
            IDMTSEWVEE ET+SPTQWLDRFG + D +DLNTRQF ++ FGAND  FGL
Sbjct: 629  IDMTSEWVEENETASPTQWLDRFGPALDGNDLNTRQFGSSIFGANDPIFGL 679


>ref|XP_003602175.1| Protein dimerization [Medicago truncatula]
            gi|355491223|gb|AES72426.1| Protein dimerization
            [Medicago truncatula]
          Length = 786

 Score =  771 bits (1990), Expect = 0.0
 Identities = 375/532 (70%), Positives = 452/532 (84%), Gaps = 5/532 (0%)
 Frame = -3

Query: 1847 SSYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTW 1668
            SSYQLMI+AI KCG  F+G SAE LKT WLER+KS++ +  KDVEKEW  TGCTIIADTW
Sbjct: 255  SSYQLMIDAITKCGPGFTGPSAEILKTIWLERIKSEVGLQSKDVEKEWATTGCTIIADTW 314

Query: 1667 TDNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDG 1488
            TD KS+A+INFLVSSPSR FFHKSVD S YFKN K LADLFDSVIQE G ENVVQII+D 
Sbjct: 315  TDYKSKAIINFLVSSPSRIFFHKSVDASAYFKNTKWLADLFDSVIQEFGPENVVQIIMDS 374

Query: 1487 ALSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAA 1308
            + +Y+G+ NHI+QNYGTIF+SPCAS C+NLILE+F KIDW++RCI+QAQTI++ IYN+A+
Sbjct: 375  SFNYTGIGNHIVQNYGTIFVSPCASQCLNLILEEFTKIDWISRCILQAQTISKLIYNNAS 434

Query: 1307 LLEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQS 1128
            LL+ MK ++GGQ++I+TG TKSVS FLSLQ + K +++LK M +  +Y ++++  NK QS
Sbjct: 435  LLDLMKSYSGGQELIRTGATKSVSTFLSLQTMLKLRTRLKHMFHSPEYALDTSYANKPQS 494

Query: 1127 ISCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIM 948
            +SC AI +D +FW+ VEEC+AI EPFL++LREVSEGKP VG IYELMT+AKESIRTYYIM
Sbjct: 495  LSCIAIAEDGDFWRTVEECVAISEPFLKVLREVSEGKPTVGSIYELMTRAKESIRTYYIM 554

Query: 947  DESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLP 768
            DE+KCKTFLDIVDKKW +QLH+PLH+AAAFLNP IQYNPE+KFL++IKEDF++VLEK+LP
Sbjct: 555  DENKCKTFLDIVDKKWRDQLHSPLHAAAAFLNPSIQYNPEIKFLSSIKEDFYHVLEKLLP 614

Query: 767  TPDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQ 588
             PD+R +ITNQI  F +A GMFGC+LAKEAR+TV P  WWEQYGDSAP LQ+VAIRILSQ
Sbjct: 615  VPDMRRDITNQIYTFTKAHGMFGCSLAKEARNTVAPWLWWEQYGDSAPGLQRVAIRILSQ 674

Query: 587  VCSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGK-FKSKSSEVDPLQLDD 411
            VCST +F++QW+TF+QIH EK+NKID+E LNDLV +NYNLKL +   +KS EVD LQ DD
Sbjct: 675  VCSTFSFQRQWSTFRQIHSEKKNKIDRETLNDLVYINYNLKLNRQMSAKSLEVDLLQFDD 734

Query: 410  IDMTSEWVEEIET-SSPTQWLDRFGS-YDVSDLNTRQFSTAFFGAND--FGL 267
            IDMTSEWVEE ET S PTQWLDRFGS  D +DLNTRQF ++ FGAND  FGL
Sbjct: 735  IDMTSEWVEENETVSPPTQWLDRFGSALDGNDLNTRQFGSSIFGANDPIFGL 786


>ref|XP_006300772.1| hypothetical protein CARUB_v10019846mg, partial [Capsella rubella]
            gi|482569482|gb|EOA33670.1| hypothetical protein
            CARUB_v10019846mg, partial [Capsella rubella]
          Length = 768

 Score =  722 bits (1864), Expect = 0.0
 Identities = 352/530 (66%), Positives = 431/530 (81%), Gaps = 4/530 (0%)
 Frame = -3

Query: 1844 SYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTWT 1665
            SY  M++AI KCG  F   S  +LKT WL+RVKS+I + LKD EKEWV TGCTIIA+ WT
Sbjct: 244  SYHHMLDAIAKCGPAFFAPSPLSLKTEWLDRVKSEISLQLKDSEKEWVTTGCTIIAEAWT 303

Query: 1664 DNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDGA 1485
            DNKSRA+INF VSSPSR FFHKSVD S+YFKN K LADLFDSVIQ+IG E++VQII+D +
Sbjct: 304  DNKSRALINFSVSSPSRIFFHKSVDASSYFKNTKCLADLFDSVIQDIGQEHIVQIIMDNS 363

Query: 1484 LSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAAL 1305
             SY+G++NHI+QNYG+IF+SPCAS C+++ILE+F K+DWVN+CI QAQ I++F+YN+  +
Sbjct: 364  FSYTGISNHILQNYGSIFVSPCASQCLSIILEEFSKVDWVNQCISQAQVISKFVYNNRPV 423

Query: 1304 LEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQSI 1125
            L+ M+K TGGQDII+TG+T+SVSNFLSLQ++ K+K++LK M N  +Y   +   NK QS+
Sbjct: 424  LDLMRKLTGGQDIIRTGVTRSVSNFLSLQSMMKQKARLKHMFNSSEYTTQA---NKPQSM 480

Query: 1124 SCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIMD 945
            SC  IL+DN+FW+A+EE +AI EP L++LREVS+GKPAVG IYELM+KAKESIRTYYIMD
Sbjct: 481  SCVNILEDNDFWRALEESVAISEPILKVLREVSKGKPAVGSIYELMSKAKESIRTYYIMD 540

Query: 944  ESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLPT 765
            E+K K F +IVD KW + LH+PLH+AAAFLNP IQYNPE+KFLT++KEDFF VLEK+LPT
Sbjct: 541  ENKHKVFSNIVDTKWCDHLHSPLHAAAAFLNPSIQYNPEIKFLTSLKEDFFKVLEKLLPT 600

Query: 764  PDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQV 585
             DLR +ITNQI  F RA GMFGCNLA EARD+V PG WWEQ+GDSAPVLQ+VAIRILSQV
Sbjct: 601  SDLRRDITNQIFTFTRAKGMFGCNLAMEARDSVSPGLWWEQFGDSAPVLQRVAIRILSQV 660

Query: 584  CSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGKFKSKSSEVDPLQLDDID 405
            CS+   E+QW+TFQQ+H+E+RN ID+E LN+L  +N NLKLG+    + E D + L+DID
Sbjct: 661  CSSYNLERQWSTFQQMHWERRNTIDREILNNLAYVNQNLKLGRM--ITLETDSISLEDID 718

Query: 404  MTSEWVEEIETSSPTQWLDRFGS-YDVSDLNTRQFSTAFFGAND---FGL 267
            M SEWVEE E  SP QWLDRFGS  D  DLNTRQF  A F AND   FGL
Sbjct: 719  MMSEWVEEAENPSPAQWLDRFGSALDGGDLNTRQFGGAIFSANDHNIFGL 768


>ref|NP_178092.4| hAT family dimerization domain-containing protein [Arabidopsis
            thaliana] gi|332198172|gb|AEE36293.1| hAT family
            dimerization domain-containing protein [Arabidopsis
            thaliana]
          Length = 651

 Score =  721 bits (1861), Expect = 0.0
 Identities = 352/530 (66%), Positives = 427/530 (80%), Gaps = 4/530 (0%)
 Frame = -3

Query: 1844 SYQLMIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTWT 1665
            SY  M++A+ KCG    G  A + KT WL+RVKS I + LKD EKEWV TGCTIIA+ WT
Sbjct: 130  SYHHMLDAVAKCGP---GFVAPSPKTEWLDRVKSDISLQLKDTEKEWVTTGCTIIAEAWT 186

Query: 1664 DNKSRAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDGA 1485
            DNKSRA+INF VSSPSR FFHKSVD S+YFKN K LADLFDSVIQ+IG E++VQII+D +
Sbjct: 187  DNKSRALINFSVSSPSRIFFHKSVDASSYFKNSKCLADLFDSVIQDIGQEHIVQIIMDNS 246

Query: 1484 LSYSGLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAAL 1305
              Y+G++NH++QNY TIF+SPCAS C+N+ILE+F K+DWVN+CI QAQ I++F+YN++ +
Sbjct: 247  FCYTGISNHLLQNYATIFVSPCASQCLNIILEEFSKVDWVNQCISQAQVISKFVYNNSPV 306

Query: 1304 LEFMKKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQSI 1125
            L+ ++K TGGQDII++G+T+SVSNFLSLQ++ K+K++LK M NC +Y  N+   NK QSI
Sbjct: 307  LDLLRKLTGGQDIIRSGVTRSVSNFLSLQSMMKQKARLKHMFNCPEYTTNT---NKPQSI 363

Query: 1124 SCAAILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIMD 945
            SC  IL+DN+FW+AVEE +AI EP L++LREVS GKPAVG IYELM+KAKESIRTYYIMD
Sbjct: 364  SCVNILEDNDFWRAVEESVAISEPILKVLREVSTGKPAVGSIYELMSKAKESIRTYYIMD 423

Query: 944  ESKCKTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLPT 765
            E+K K F DIVD  W   LH+PLH+AAAFLNP IQYNPE+KFLT++KEDFF VLEK+LPT
Sbjct: 424  ENKHKVFSDIVDTNWCEHLHSPLHAAAAFLNPSIQYNPEIKFLTSLKEDFFKVLEKLLPT 483

Query: 764  PDLRHNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQV 585
             DLR +ITNQI  F RA GMFGCNLA EARD+V PG WWEQ+GDSAPVLQ+VAIRILSQV
Sbjct: 484  SDLRRDITNQIFTFTRAKGMFGCNLAMEARDSVSPGLWWEQFGDSAPVLQRVAIRILSQV 543

Query: 584  CSTSTFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGKFKSKSSEVDPLQLDDID 405
            CS    E+QW+TFQQ+H+E+RNKID+E LN L  +N NLKLG+    + E DP+ L+DID
Sbjct: 544  CSGYNLERQWSTFQQMHWERRNKIDREILNKLAYVNQNLKLGRM--ITLETDPIALEDID 601

Query: 404  MTSEWVEEIETSSPTQWLDRFG-SYDVSDLNTRQFSTAFFGAND---FGL 267
            M SEWVEE E  SP QWLDRFG + D  DLNTRQF  A F AND   FGL
Sbjct: 602  MMSEWVEEAENPSPAQWLDRFGTALDGGDLNTRQFGGAIFSANDHNIFGL 651


>gb|AAF68117.1|AC010793_12 F20B17.17 [Arabidopsis thaliana] gi|12324578|gb|AAG52239.1|AC011717_7
            hypothetical protein; 97951-99813 [Arabidopsis thaliana]
          Length = 518

 Score =  718 bits (1853), Expect = 0.0
 Identities = 350/526 (66%), Positives = 425/526 (80%), Gaps = 4/526 (0%)
 Frame = -3

Query: 1832 MIEAIGKCGARFSGLSAEALKTTWLERVKSKIDVGLKDVEKEWVRTGCTIIADTWTDNKS 1653
            M++A+ KCG    G  A + KT WL+RVKS I + LKD EKEWV TGCTIIA+ WTDNKS
Sbjct: 1    MLDAVAKCGP---GFVAPSPKTEWLDRVKSDISLQLKDTEKEWVTTGCTIIAEAWTDNKS 57

Query: 1652 RAMINFLVSSPSRTFFHKSVDVSTYFKNIKSLADLFDSVIQEIGHENVVQIIVDGALSYS 1473
            RA+INF VSSPSR FFHKSVD S+YFKN K LADLFDSVIQ+IG E++VQII+D +  Y+
Sbjct: 58   RALINFSVSSPSRIFFHKSVDASSYFKNSKCLADLFDSVIQDIGQEHIVQIIMDNSFCYT 117

Query: 1472 GLANHIMQNYGTIFISPCASHCVNLILEDFCKIDWVNRCIVQAQTITRFIYNSAALLEFM 1293
            G++NH++QNY TIF+SPCAS C+N+ILE+F K+DWVN+CI QAQ I++F+YN++ +L+ +
Sbjct: 118  GISNHLLQNYATIFVSPCASQCLNIILEEFSKVDWVNQCISQAQVISKFVYNNSPVLDLL 177

Query: 1292 KKFTGGQDIIKTGITKSVSNFLSLQAIFKRKSKLKLMINCGDYPINSACVNKQQSISCAA 1113
            +K TGGQDII++G+T+SVSNFLSLQ++ K+K++LK M NC +Y  N+   NK QSISC  
Sbjct: 178  RKLTGGQDIIRSGVTRSVSNFLSLQSMMKQKARLKHMFNCPEYTTNT---NKPQSISCVN 234

Query: 1112 ILDDNEFWQAVEECIAICEPFLRILREVSEGKPAVGFIYELMTKAKESIRTYYIMDESKC 933
            IL+DN+FW+AVEE +AI EP L++LREVS GKPAVG IYELM+KAKESIRTYYIMDE+K 
Sbjct: 235  ILEDNDFWRAVEESVAISEPILKVLREVSTGKPAVGSIYELMSKAKESIRTYYIMDENKH 294

Query: 932  KTFLDIVDKKWHNQLHTPLHSAAAFLNPCIQYNPEVKFLTAIKEDFFNVLEKVLPTPDLR 753
            K F DIVD  W   LH+PLH+AAAFLNP IQYNPE+KFLT++KEDFF VLEK+LPT DLR
Sbjct: 295  KVFSDIVDTNWCEHLHSPLHAAAAFLNPSIQYNPEIKFLTSLKEDFFKVLEKLLPTSDLR 354

Query: 752  HNITNQILLFQRATGMFGCNLAKEARDTVQPGEWWEQYGDSAPVLQKVAIRILSQVCSTS 573
             +ITNQI  F RA GMFGCNLA EARD+V PG WWEQ+GDSAPVLQ+VAIRILSQVCS  
Sbjct: 355  RDITNQIFTFTRAKGMFGCNLAMEARDSVSPGLWWEQFGDSAPVLQRVAIRILSQVCSGY 414

Query: 572  TFEKQWNTFQQIHFEKRNKIDKEALNDLVSMNYNLKLGKFKSKSSEVDPLQLDDIDMTSE 393
              E+QW+TFQQ+H+E+RNKID+E LN L  +N NLKLG+    + E DP+ L+DIDM SE
Sbjct: 415  NLERQWSTFQQMHWERRNKIDREILNKLAYVNQNLKLGRM--ITLETDPIALEDIDMMSE 472

Query: 392  WVEEIETSSPTQWLDRFG-SYDVSDLNTRQFSTAFFGAND---FGL 267
            WVEE E  SP QWLDRFG + D  DLNTRQF  A F AND   FGL
Sbjct: 473  WVEEAENPSPAQWLDRFGTALDGGDLNTRQFGGAIFSANDHNIFGL 518


Top