BLASTX nr result

ID: Akebia23_contig00003496 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00003496
         (2250 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI22554.3| unnamed protein product [Vitis vinifera]             1079   0.0  
ref|XP_002267489.2| PREDICTED: uncharacterized protein LOC100265...  1077   0.0  
gb|EXB72473.1| hypothetical protein L484_011475 [Morus notabilis]    1070   0.0  
ref|XP_007038933.1| HAT transposon superfamily isoform 4 [Theobr...  1058   0.0  
ref|XP_007038931.1| HAT transposon superfamily isoform 2 [Theobr...  1058   0.0  
ref|XP_002513602.1| protein dimerization, putative [Ricinus comm...  1050   0.0  
ref|XP_004308021.1| PREDICTED: uncharacterized protein LOC101298...  1045   0.0  
ref|XP_004145979.1| PREDICTED: uncharacterized protein LOC101215...  1037   0.0  
ref|XP_004502603.1| PREDICTED: uncharacterized protein LOC101496...  1035   0.0  
ref|XP_004160403.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...  1033   0.0  
ref|XP_006490683.1| PREDICTED: uncharacterized protein LOC102618...  1021   0.0  
ref|XP_006581618.1| PREDICTED: uncharacterized protein LOC100808...  1016   0.0  
ref|XP_003602175.1| Protein dimerization [Medicago truncatula] g...  1011   0.0  
ref|XP_006350604.1| PREDICTED: uncharacterized protein LOC102593...   994   0.0  
ref|XP_004234278.1| PREDICTED: uncharacterized protein LOC101256...   994   0.0  
ref|XP_007218857.1| hypothetical protein PRUPE_ppa002763mg [Prun...   980   0.0  
ref|XP_007038930.1| HAT transposon superfamily isoform 1 [Theobr...   952   0.0  
ref|XP_006300772.1| hypothetical protein CARUB_v10019846mg, part...   915   0.0  
ref|NP_178092.4| hAT family dimerization domain-containing prote...   907   0.0  
gb|EEC70201.1| hypothetical protein OsI_00947 [Oryza sativa Indi...   809   0.0  

>emb|CBI22554.3| unnamed protein product [Vitis vinifera]
          Length = 731

 Score = 1079 bits (2791), Expect = 0.0
 Identities = 530/684 (77%), Positives = 597/684 (87%)
 Frame = -3

Query: 2110 SFAIMVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRD 1931
            SF  MVREKDVCWEY EKL+GNKVRCKFCLRVLNGGISRLKHHLSRLPSKGV+PCSKVRD
Sbjct: 50   SFISMVREKDVCWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRD 109

Query: 1930 DVTDRVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSST 1751
            DVTDRVRAII+            KQR+AEAKSPGN S+ KALMS+    SP++K +P  T
Sbjct: 110  DVTDRVRAIISSKEDGKETSSAKKQRVAEAKSPGNYSAIKALMSVETP-SPIAKIFPPIT 168

Query: 1750 QVPPSASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKI 1571
             + PS+S D ENAERSIALFFFENKLDFSVARS SYQLM+EA++KCGHGFRGPS+E LK 
Sbjct: 169  HMGPSSSNDGENAERSIALFFFENKLDFSVARSSSYQLMIEAVSKCGHGFRGPSAEILKT 228

Query: 1570 TWLERVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEA 1391
            TWLER+KSEVS QSK+IEKEW  TGCTIIADTWTDNKSRALINFLVSSPS TFFHKSV+A
Sbjct: 229  TWLERIKSEVSLQSKDIEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDA 288

Query: 1390 STYFKSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHC 1211
            S+YFK+ K LADLFDSVIQD GP+NVVQ+IMD +LNY  V S+I+Q+Y ++F +PCAS C
Sbjct: 289  SSYFKNTKYLADLFDSVIQDLGPDNVVQIIMDSTLNYTGVASHIVQNYGTVFVSPCASQC 348

Query: 1210 LNLILEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFL 1031
            LNLILEDF KIDWVNRCILQAQ+ISKFIYN+  +L+LMKK TGGQ+L+RT ITKS SNFL
Sbjct: 349  LNLILEDFCKIDWVNRCILQAQTISKFIYNNASMLDLMKKSTGGQDLIRTGITKSVSNFL 408

Query: 1030 SLQSMLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLL 851
            SLQSMLKQR RLKHMF S EYSTN +Y+NKPQ+ISCI ILEDNDFW+AVEECVA+SEP L
Sbjct: 409  SLQSMLKQRPRLKHMFGSSEYSTN-SYSNKPQNISCIAILEDNDFWRAVEECVAISEPFL 467

Query: 850  KVLREISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAA 671
            K LRE+SGGKP+VGSIYE MT+AK+SIRTYYIMDE+KCK FLDIVD RW NQLHSPLHAA
Sbjct: 468  KGLREVSGGKPAVGSIYELMTKAKESIRTYYIMDESKCKAFLDIVDGRWRNQLHSPLHAA 527

Query: 670  AAFLNPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLA 491
            AAFLNPSIQYNPE+KF+G IK++F  VLEKLLPT ++R DIT QI LF +A GMFGCNLA
Sbjct: 528  AAFLNPSIQYNPEIKFIGAIKEDFFKVLEKLLPTSDMRRDITNQILLFTRATGMFGCNLA 587

Query: 490  REARNTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDK 311
            REAR+T+ PGLWWEQ+GDSAP LQRVA+RILSQVCS S FER+W+TFQQIHSEKRN++DK
Sbjct: 588  REARDTVPPGLWWEQFGDSAPVLQRVAIRILSQVCSTSTFERHWNTFQQIHSEKRNKIDK 647

Query: 310  ETLGDLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSAL 131
            ETL DL+YINYNLKLA + K K  E DP+  DDIDMTS+WVEETENPSPTQWLDRFGSAL
Sbjct: 648  ETLNDLVYINYNLKLARQMKMKSSEADPLQFDDIDMTSEWVEETENPSPTQWLDRFGSAL 707

Query: 130  DGSDLNTRQFTNAMFGTNDHIFGL 59
            DGSDLNTRQF  A+FG++D IFGL
Sbjct: 708  DGSDLNTRQFNAAIFGSSDTIFGL 731


>ref|XP_002267489.2| PREDICTED: uncharacterized protein LOC100265581 [Vitis vinifera]
          Length = 723

 Score = 1077 bits (2784), Expect = 0.0
 Identities = 528/683 (77%), Positives = 596/683 (87%)
 Frame = -3

Query: 2107 FAIMVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDD 1928
            F  +VREKDVCWEY EKL+GNKVRCKFCLRVLNGGISRLKHHLSRLPSKGV+PCSKVRDD
Sbjct: 43   FLAVVREKDVCWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDD 102

Query: 1927 VTDRVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQ 1748
            VTDRVRAII+            KQR+AEAKSPGN S+ KALMS+    SP++K +P  T 
Sbjct: 103  VTDRVRAIISSKEDGKETSSAKKQRVAEAKSPGNYSAIKALMSVETP-SPIAKIFPPITH 161

Query: 1747 VPPSASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKIT 1568
            + PS+S D ENAERSIALFFFENKLDFSVARS SYQLM+EA++KCGHGFRGPS+E LK T
Sbjct: 162  MGPSSSNDGENAERSIALFFFENKLDFSVARSSSYQLMIEAVSKCGHGFRGPSAEILKTT 221

Query: 1567 WLERVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEAS 1388
            WLER+KSEVS QSK+IEKEW  TGCTIIADTWTDNKSRALINFLVSSPS TFFHKSV+AS
Sbjct: 222  WLERIKSEVSLQSKDIEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDAS 281

Query: 1387 TYFKSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCL 1208
            +YFK+ K LADLFDSVIQD GP+NVVQ+IMD +LNY  V S+I+Q+Y ++F +PCAS CL
Sbjct: 282  SYFKNTKYLADLFDSVIQDLGPDNVVQIIMDSTLNYTGVASHIVQNYGTVFVSPCASQCL 341

Query: 1207 NLILEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLS 1028
            NLILEDF KIDWVNRCILQAQ+ISKFIYN+  +L+LMKK TGGQ+L+RT ITKS SNFLS
Sbjct: 342  NLILEDFCKIDWVNRCILQAQTISKFIYNNASMLDLMKKSTGGQDLIRTGITKSVSNFLS 401

Query: 1027 LQSMLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLK 848
            LQSMLKQR RLKHMF S EYSTN +Y+NKPQ+ISCI ILEDNDFW+AVEECVA+SEP LK
Sbjct: 402  LQSMLKQRPRLKHMFGSSEYSTN-SYSNKPQNISCIAILEDNDFWRAVEECVAISEPFLK 460

Query: 847  VLREISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAA 668
             LRE+SGGKP+VGSIYE MT+AK+SIRTYYIMDE+KCK FLDIVD RW NQLHSPLHAAA
Sbjct: 461  GLREVSGGKPAVGSIYELMTKAKESIRTYYIMDESKCKAFLDIVDGRWRNQLHSPLHAAA 520

Query: 667  AFLNPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAR 488
            AFLNPSIQYNPE+KF+G IK++F  VLEKLLPT ++R DIT QI LF +A GMFGCNLAR
Sbjct: 521  AFLNPSIQYNPEIKFIGAIKEDFFKVLEKLLPTSDMRRDITNQILLFTRATGMFGCNLAR 580

Query: 487  EARNTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKE 308
            EAR+T+ PGLWWEQ+GDSAP LQRVA+RILSQVCS S FER+W+TFQQIHSEKRN++DKE
Sbjct: 581  EARDTVPPGLWWEQFGDSAPVLQRVAIRILSQVCSTSTFERHWNTFQQIHSEKRNKIDKE 640

Query: 307  TLGDLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALD 128
            TL DL+YINYNLKLA + K K  E DP+  DDIDMTS+WVEETENPSPTQWLDRFGSALD
Sbjct: 641  TLNDLVYINYNLKLARQMKMKSSEADPLQFDDIDMTSEWVEETENPSPTQWLDRFGSALD 700

Query: 127  GSDLNTRQFTNAMFGTNDHIFGL 59
            GSDLNTRQF  A+FG++D IFGL
Sbjct: 701  GSDLNTRQFNAAIFGSSDTIFGL 723


>gb|EXB72473.1| hypothetical protein L484_011475 [Morus notabilis]
          Length = 694

 Score = 1070 bits (2767), Expect = 0.0
 Identities = 518/680 (76%), Positives = 592/680 (87%)
 Frame = -3

Query: 2098 MVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTD 1919
            +VREKDVCWEY EKL+GNKVRCKFCLRVLNGGISRLKHHLSRLPSKGV+PCSKVRDDVTD
Sbjct: 16   VVREKDVCWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTD 75

Query: 1918 RVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPP 1739
            RVRAIIA            KQ+L E KSPGN+S+SKAL+S T  TSP++K +P+ T V P
Sbjct: 76   RVRAIIASKEDVKETSSTKKQKLVEVKSPGNVSASKALVS-TDTTSPVAKVFPAVTPVAP 134

Query: 1738 SASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLE 1559
             +   +ENAERSIALFFFENKLDF +ARS SYQLM++AIAKCG GF GPS+E LK TWLE
Sbjct: 135  PSLNSQENAERSIALFFFENKLDFGIARSSSYQLMVDAIAKCGPGFTGPSAETLKTTWLE 194

Query: 1558 RVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYF 1379
            R+KSE+S QSK+IEKEW  TGCTIIADTWTDNKSRALINFLVSSPS TFFHKSV+AS YF
Sbjct: 195  RIKSEMSLQSKDIEKEWMTTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDASAYF 254

Query: 1378 KSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLI 1199
            K+ KCLADLFDSVIQDFGP+NVVQVIMD S NY  V ++I+Q+Y++IF +PC S CLNLI
Sbjct: 255  KNMKCLADLFDSVIQDFGPDNVVQVIMDSSFNYTGVANHILQNYSTIFVSPCVSQCLNLI 314

Query: 1198 LEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQS 1019
            LE+FSK+DWVNRCILQ Q+ISKFIYN   +L+LMKK+TGGQEL+RT ITKS S+FLSLQS
Sbjct: 315  LEEFSKVDWVNRCILQGQTISKFIYNSASMLDLMKKYTGGQELIRTGITKSVSSFLSLQS 374

Query: 1018 MLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLR 839
            +LKQ+SRLKHMFNSPEY TN  Y NKPQSISCI I+ED+DFW+AVEE VA+SEP LKVLR
Sbjct: 375  ILKQKSRLKHMFNSPEYCTNSLYVNKPQSISCISIVEDSDFWRAVEESVAISEPFLKVLR 434

Query: 838  EISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFL 659
            E++GGKP+VGSIYE MTRAK+SIRTYYIMDENKCKTFLDIVDR+W +QLHSPLH+AAAFL
Sbjct: 435  EVAGGKPAVGSIYELMTRAKESIRTYYIMDENKCKTFLDIVDRKWRDQLHSPLHSAAAFL 494

Query: 658  NPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREAR 479
            NPSIQYNPE+KFL +IK++F  VLEKLLP PE+R DIT QIF F KA  MFGC+LA EAR
Sbjct: 495  NPSIQYNPEIKFLSSIKEDFFKVLEKLLPLPEMRRDITSQIFTFTKAMSMFGCSLAMEAR 554

Query: 478  NTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLG 299
            + +SPGLWWEQYGDSAP LQRVA+RILSQVCS+  FER+WS FQQIHSEKRN++D+ETL 
Sbjct: 555  DVVSPGLWWEQYGDSAPVLQRVAIRILSQVCSSFTFERHWSAFQQIHSEKRNKIDRETLN 614

Query: 298  DLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSD 119
            DL+YINYNLKLA  T+ K IE DPI  DDIDMTS+WVEE++N SP+QWLDRFGSALDGSD
Sbjct: 615  DLVYINYNLKLARHTRTKSIEADPIQFDDIDMTSEWVEESDNSSPSQWLDRFGSALDGSD 674

Query: 118  LNTRQFTNAMFGTNDHIFGL 59
            LNTRQ+  A+FG+NDHIFGL
Sbjct: 675  LNTRQYNAAIFGSNDHIFGL 694


>ref|XP_007038933.1| HAT transposon superfamily isoform 4 [Theobroma cacao]
            gi|508776178|gb|EOY23434.1| HAT transposon superfamily
            isoform 4 [Theobroma cacao]
          Length = 682

 Score = 1058 bits (2735), Expect = 0.0
 Identities = 509/683 (74%), Positives = 590/683 (86%)
 Frame = -3

Query: 2107 FAIMVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDD 1928
            F  +VREKDVCWEY EKL+GNKVRCKFCLRVLNGGISRLKHHLSRLPSKGV+PCSKVRDD
Sbjct: 2    FMAVVREKDVCWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDD 61

Query: 1927 VTDRVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQ 1748
            VTDRVRAI++            KQ++AEA+SPGN+S+   ++ +   +SP++K +P+++ 
Sbjct: 62   VTDRVRAILSSKEEIKETSSVKKQKIAEARSPGNISTCSKIIPLEA-SSPVAKVFPATSP 120

Query: 1747 VPPSASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKIT 1568
            + P +   +EN ERSIALFFFENKLDFSVARS SYQ M++A+ K G GF GPS E LK  
Sbjct: 121  IAPPSLNSQENVERSIALFFFENKLDFSVARSSSYQAMIDAVGKFGPGFTGPSVETLKTM 180

Query: 1567 WLERVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEAS 1388
            WLER+KSEV  QSK+ EKEW  TGCTIIADTWTDNKSRALINFLVSSPS TFFHKSV+AS
Sbjct: 181  WLERIKSEVCLQSKDTEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDAS 240

Query: 1387 TYFKSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCL 1208
            +YFK+ KCLADLFDSVIQDFGPENVVQ+IMD S NY  + ++I+Q+Y +IF +PCAS CL
Sbjct: 241  SYFKNTKCLADLFDSVIQDFGPENVVQIIMDSSFNYTGISNHILQNYGTIFVSPCASQCL 300

Query: 1207 NLILEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLS 1028
            NLILE+FSK+DWVNRCILQAQ++SKF+YN+  +L+LMKKFTG QEL+RT ITKS S+FLS
Sbjct: 301  NLILEEFSKVDWVNRCILQAQTLSKFLYNNASMLDLMKKFTGEQELIRTGITKSVSSFLS 360

Query: 1027 LQSMLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLK 848
            LQSMLKQRSRLKHMFNSPEYSTN +YANKPQSISCI I+EDNDFW+AV+ECVA+SEP LK
Sbjct: 361  LQSMLKQRSRLKHMFNSPEYSTNSSYANKPQSISCIAIVEDNDFWRAVDECVAISEPFLK 420

Query: 847  VLREISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAA 668
            VLRE+SGGKP+VGSIYE MTRAK+SIRTYYIMDE KCKTFLDIVDR+W +QLHSPLH+A 
Sbjct: 421  VLREVSGGKPAVGSIYELMTRAKESIRTYYIMDEGKCKTFLDIVDRKWRDQLHSPLHSAG 480

Query: 667  AFLNPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAR 488
            AFLNPSIQYN E+KFLG+IK++F  VLEKLLPTPELR DIT QIF F +A+GMF CNLA 
Sbjct: 481  AFLNPSIQYNQEIKFLGSIKEDFFKVLEKLLPTPELRRDITNQIFTFTRAKGMFACNLAM 540

Query: 487  EARNTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKE 308
            EAR+T+SPGLWWEQ+GDSAP LQRVA+RILSQVCS   FER+WSTFQQIHSEKRN++DKE
Sbjct: 541  EARDTVSPGLWWEQFGDSAPVLQRVAIRILSQVCSTFTFERHWSTFQQIHSEKRNKIDKE 600

Query: 307  TLGDLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALD 128
             L DL+YINYNL+LA + + K +E DPI  DDIDMTS+WVEE+ENPSPTQWLDRFGSALD
Sbjct: 601  ILNDLVYINYNLRLARQMRTKSVEADPIQFDDIDMTSEWVEESENPSPTQWLDRFGSALD 660

Query: 127  GSDLNTRQFTNAMFGTNDHIFGL 59
            G DLNTRQF  A+FG NDHIFGL
Sbjct: 661  GGDLNTRQFNAAIFG-NDHIFGL 682


>ref|XP_007038931.1| HAT transposon superfamily isoform 2 [Theobroma cacao]
            gi|590673575|ref|XP_007038932.1| HAT transposon
            superfamily isoform 2 [Theobroma cacao]
            gi|508776176|gb|EOY23432.1| HAT transposon superfamily
            isoform 2 [Theobroma cacao] gi|508776177|gb|EOY23433.1|
            HAT transposon superfamily isoform 2 [Theobroma cacao]
          Length = 678

 Score = 1058 bits (2735), Expect = 0.0
 Identities = 509/680 (74%), Positives = 589/680 (86%)
 Frame = -3

Query: 2098 MVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTD 1919
            MVREKDVCWEY EKL+GNKVRCKFCLRVLNGGISRLKHHLSRLPSKGV+PCSKVRDDVTD
Sbjct: 1    MVREKDVCWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTD 60

Query: 1918 RVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPP 1739
            RVRAI++            KQ++AEA+SPGN+S+   ++ +   +SP++K +P+++ + P
Sbjct: 61   RVRAILSSKEEIKETSSVKKQKIAEARSPGNISTCSKIIPLEA-SSPVAKVFPATSPIAP 119

Query: 1738 SASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLE 1559
             +   +EN ERSIALFFFENKLDFSVARS SYQ M++A+ K G GF GPS E LK  WLE
Sbjct: 120  PSLNSQENVERSIALFFFENKLDFSVARSSSYQAMIDAVGKFGPGFTGPSVETLKTMWLE 179

Query: 1558 RVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYF 1379
            R+KSEV  QSK+ EKEW  TGCTIIADTWTDNKSRALINFLVSSPS TFFHKSV+AS+YF
Sbjct: 180  RIKSEVCLQSKDTEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDASSYF 239

Query: 1378 KSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLI 1199
            K+ KCLADLFDSVIQDFGPENVVQ+IMD S NY  + ++I+Q+Y +IF +PCAS CLNLI
Sbjct: 240  KNTKCLADLFDSVIQDFGPENVVQIIMDSSFNYTGISNHILQNYGTIFVSPCASQCLNLI 299

Query: 1198 LEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQS 1019
            LE+FSK+DWVNRCILQAQ++SKF+YN+  +L+LMKKFTG QEL+RT ITKS S+FLSLQS
Sbjct: 300  LEEFSKVDWVNRCILQAQTLSKFLYNNASMLDLMKKFTGEQELIRTGITKSVSSFLSLQS 359

Query: 1018 MLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLR 839
            MLKQRSRLKHMFNSPEYSTN +YANKPQSISCI I+EDNDFW+AV+ECVA+SEP LKVLR
Sbjct: 360  MLKQRSRLKHMFNSPEYSTNSSYANKPQSISCIAIVEDNDFWRAVDECVAISEPFLKVLR 419

Query: 838  EISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFL 659
            E+SGGKP+VGSIYE MTRAK+SIRTYYIMDE KCKTFLDIVDR+W +QLHSPLH+A AFL
Sbjct: 420  EVSGGKPAVGSIYELMTRAKESIRTYYIMDEGKCKTFLDIVDRKWRDQLHSPLHSAGAFL 479

Query: 658  NPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREAR 479
            NPSIQYN E+KFLG+IK++F  VLEKLLPTPELR DIT QIF F +A+GMF CNLA EAR
Sbjct: 480  NPSIQYNQEIKFLGSIKEDFFKVLEKLLPTPELRRDITNQIFTFTRAKGMFACNLAMEAR 539

Query: 478  NTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLG 299
            +T+SPGLWWEQ+GDSAP LQRVA+RILSQVCS   FER+WSTFQQIHSEKRN++DKE L 
Sbjct: 540  DTVSPGLWWEQFGDSAPVLQRVAIRILSQVCSTFTFERHWSTFQQIHSEKRNKIDKEILN 599

Query: 298  DLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSD 119
            DL+YINYNL+LA + + K +E DPI  DDIDMTS+WVEE+ENPSPTQWLDRFGSALDG D
Sbjct: 600  DLVYINYNLRLARQMRTKSVEADPIQFDDIDMTSEWVEESENPSPTQWLDRFGSALDGGD 659

Query: 118  LNTRQFTNAMFGTNDHIFGL 59
            LNTRQF  A+FG NDHIFGL
Sbjct: 660  LNTRQFNAAIFG-NDHIFGL 678


>ref|XP_002513602.1| protein dimerization, putative [Ricinus communis]
            gi|223547510|gb|EEF49005.1| protein dimerization,
            putative [Ricinus communis]
          Length = 688

 Score = 1050 bits (2716), Expect = 0.0
 Identities = 506/680 (74%), Positives = 590/680 (86%)
 Frame = -3

Query: 2098 MVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTD 1919
            +VREKDVCWEY EKL+GNKV+CKFCLRVLNGGISRLKHHLSRLPSKGV+PCSKVRDDVTD
Sbjct: 10   VVREKDVCWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTD 69

Query: 1918 RVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPP 1739
            RVRAIIA            KQR AEAKSP ++ ++KAL+++    +P +K YP+ T + P
Sbjct: 70   RVRAIIASKEDIKEPSSAKKQRPAEAKSPAHIYATKALVNVE-SVAPAAKVYPTVTSISP 128

Query: 1738 SASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLE 1559
             +  ++ENAERSIALFFFENKLDFSVARSPSYQLM+EAI KCG GF GPS+E LK TWLE
Sbjct: 129  PSLSNQENAERSIALFFFENKLDFSVARSPSYQLMIEAIEKCGPGFTGPSAEILKTTWLE 188

Query: 1558 RVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYF 1379
            R+KSEVS Q K+ EKEW  TGCTIIADTWTDNKSRALINF VSSPS TFFHKSV+AS+YF
Sbjct: 189  RIKSEVSLQLKDTEKEWTTTGCTIIADTWTDNKSRALINFFVSSPSRTFFHKSVDASSYF 248

Query: 1378 KSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLI 1199
            K+ KCLADLFDSVIQDFG ENVVQ+IMD S NY  V ++I+Q+Y +IF +PCAS CLNLI
Sbjct: 249  KNTKCLADLFDSVIQDFGAENVVQIIMDSSFNYTGVANHILQNYGTIFVSPCASQCLNLI 308

Query: 1198 LEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQS 1019
            LEDFSK+DWVNRCI QAQ++SKFIYN++ +L+LMKKFTGGQEL++T ITKS S+FLSLQS
Sbjct: 309  LEDFSKVDWVNRCISQAQTLSKFIYNNSSMLDLMKKFTGGQELIKTGITKSVSSFLSLQS 368

Query: 1018 MLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLR 839
            MLKQR RLK MF+S EYS N +Y++KPQSI+CI I+ED DFW+AVEECVA++EP LKVLR
Sbjct: 369  MLKQRPRLKLMFSSNEYSANSSYSSKPQSIACITIVEDGDFWRAVEECVAITEPFLKVLR 428

Query: 838  EISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFL 659
            E+SGGKP+VGSIYE MTRAK+SIRTYYIMDE+KCKTFLDIVDR+W +QLHSPLH+AAAFL
Sbjct: 429  EVSGGKPAVGSIYELMTRAKESIRTYYIMDESKCKTFLDIVDRKWRDQLHSPLHSAAAFL 488

Query: 658  NPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREAR 479
            NP +QYNPE+KFL NIK++F  V+EKLLPTP++R DIT QIF+F +A GMFGCNLA EAR
Sbjct: 489  NPCVQYNPEIKFLVNIKEDFFKVIEKLLPTPDMRRDITNQIFIFTRASGMFGCNLAMEAR 548

Query: 478  NTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLG 299
            +T++PGLWWEQYGDSAP LQRVA+RILSQVCS   FER+W+TF+QIHSEKRN++DKETL 
Sbjct: 549  DTVAPGLWWEQYGDSAPVLQRVAIRILSQVCSTFTFERHWNTFRQIHSEKRNKIDKETLN 608

Query: 298  DLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSD 119
            DL+YINYNLKL  + + K  ETDPI  DDIDMTS+WVEET+NPSPTQWLDRFGSALDGSD
Sbjct: 609  DLVYINYNLKLMRQMRTKSSETDPIQFDDIDMTSEWVEETDNPSPTQWLDRFGSALDGSD 668

Query: 118  LNTRQFTNAMFGTNDHIFGL 59
            LNTRQF  A+FG +D +FGL
Sbjct: 669  LNTRQFNAAIFGASDPLFGL 688


>ref|XP_004308021.1| PREDICTED: uncharacterized protein LOC101298657 [Fragaria vesca
            subsp. vesca]
          Length = 681

 Score = 1045 bits (2703), Expect = 0.0
 Identities = 506/682 (74%), Positives = 587/682 (86%), Gaps = 2/682 (0%)
 Frame = -3

Query: 2098 MVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTD 1919
            MVREKD CWEY EKL+GNKV+CKFC RVLNGGISRLKHHLSRLPSKGV+PCSKVRDDVTD
Sbjct: 1    MVREKDTCWEYAEKLDGNKVKCKFCQRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTD 60

Query: 1918 RVRAIIALXXXXXXXXXXXKQR-LAEAKSPG-NLSSSKALMSMTMETSPMSKTYPSSTQV 1745
            +VR IIA            K++   E KSP  N+S  KALMSM    SP+ K YP+ T +
Sbjct: 61   KVRTIIASKEEVKETSSSSKKKKFVEVKSPPVNVSPVKALMSMETP-SPIQKVYPNVTPM 119

Query: 1744 PPSASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITW 1565
             P +  ++ENAERSIALFFFENK+DFS+AR+ SYQLM++AI KCG GF GPS+E LK TW
Sbjct: 120  APLSMNNQENAERSIALFFFENKIDFSIARTSSYQLMIDAITKCGPGFTGPSAETLKTTW 179

Query: 1564 LERVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEAST 1385
            LERVK+E+S QSK+IEKEW  TGCTIIADTWTDNKSRALINFLVSSPS TFFHKSV+AS 
Sbjct: 180  LERVKTEMSLQSKDIEKEWTTTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDASA 239

Query: 1384 YFKSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLN 1205
            YFK+ KCLA+LFDSVIQDFGPENVVQ+IMD S NY  V ++I+ +Y +IF +PCAS CLN
Sbjct: 240  YFKNTKCLAELFDSVIQDFGPENVVQIIMDSSFNYTGVANHILTNYTTIFVSPCASQCLN 299

Query: 1204 LILEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSL 1025
            LILE+FSK+DWVNRC LQAQ+ISKFIYN+  +L+LMK+FTGGQ+L+RT ITKS S+FLSL
Sbjct: 300  LILEEFSKVDWVNRCFLQAQTISKFIYNNASMLDLMKRFTGGQDLIRTGITKSVSSFLSL 359

Query: 1024 QSMLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKV 845
            Q++LKQRSRLKHMFNSPE+ TN +YANK QSISCI I+EDNDFW+A EE VA+SEP LKV
Sbjct: 360  QTILKQRSRLKHMFNSPEFCTNSSYANKTQSISCISIMEDNDFWRAAEESVAISEPFLKV 419

Query: 844  LREISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAA 665
            LRE+SGGKP+VGSIYE MTRAK+SIRTYYIMDENKCK FLDIVDR+W +QLHSPLHAAAA
Sbjct: 420  LREVSGGKPAVGSIYELMTRAKESIRTYYIMDENKCKVFLDIVDRKWRDQLHSPLHAAAA 479

Query: 664  FLNPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLARE 485
            FLNPSIQYNPE+KFL +IK++F  VLEKLLP+PE+R DIT QIF F KA GMFGC+LA E
Sbjct: 480  FLNPSIQYNPEIKFLTSIKEDFFKVLEKLLPSPEMRRDITNQIFTFTKATGMFGCSLAME 539

Query: 484  ARNTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKET 305
            AR+ +SPGLWWEQYGDSAP LQRVA+RILSQVCS   FE++WS FQQIHSEKRN++D+ET
Sbjct: 540  ARDVVSPGLWWEQYGDSAPVLQRVAIRILSQVCSTFTFEKHWSAFQQIHSEKRNKIDRET 599

Query: 304  LGDLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDG 125
            L DL+YINYNL+L+ +T+ K +E DPIL DDIDMTS+WVEE+++PSPTQWLDRFGSALDG
Sbjct: 600  LNDLVYINYNLRLSKQTRNKNVEADPILFDDIDMTSEWVEESDSPSPTQWLDRFGSALDG 659

Query: 124  SDLNTRQFTNAMFGTNDHIFGL 59
            SDLNTRQF  A+FG+NDHIFGL
Sbjct: 660  SDLNTRQFNAAIFGSNDHIFGL 681


>ref|XP_004145979.1| PREDICTED: uncharacterized protein LOC101215128, partial [Cucumis
            sativus]
          Length = 685

 Score = 1037 bits (2682), Expect = 0.0
 Identities = 502/682 (73%), Positives = 589/682 (86%), Gaps = 2/682 (0%)
 Frame = -3

Query: 2098 MVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTD 1919
            +VREKD+CWEY EKL+GNKV+CKFCLRVLNGGISRLKHHLSRLPS+GV+PCSKVRDDV+D
Sbjct: 5    VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSD 64

Query: 1918 RVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSM-TMET-SPMSKTYPSSTQV 1745
            RVRAI+A            KQ+LAE K+  ++ S     S+ ++ET SP++K +P+ T +
Sbjct: 65   RVRAILATREEIKEASTGKKQKLAEVKTVESVPSISMCKSVVSIETPSPVAKVFPTVTPM 124

Query: 1744 PPSASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITW 1565
             P + ++ ENAE+SIALFFFENKLDFS+ARS SYQLM++AI KCG GF GPS+E LK TW
Sbjct: 125  APPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKTTW 184

Query: 1564 LERVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEAST 1385
            LER+K+EVS QSK+IEKEW  TGCTII DTWTDNKSRALINFLVSSPS TFFHKSV+AST
Sbjct: 185  LERIKTEVSLQSKDIEKEWTTTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSVDAST 244

Query: 1384 YFKSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLN 1205
            YFK+ KCL DLFDSVIQDFG ENVVQ+IMD SLNY    ++I+Q+Y +IF +PCAS CLN
Sbjct: 245  YFKNTKCLGDLFDSVIQDFGHENVVQIIMDSSLNYSGTANHILQTYGTIFVSPCASQCLN 304

Query: 1204 LILEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSL 1025
             ILE+FSK+DWVNRCILQAQ+ISKF+YN + +L+LM++FTGGQEL+RT I+K  S+FLSL
Sbjct: 305  SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSL 364

Query: 1024 QSMLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKV 845
            QS+LKQRSRLKHMFNSP+Y+TN +YANKPQSISCI I+EDNDFW+AVEECVA+SEP L+V
Sbjct: 365  QSILKQRSRLKHMFNSPDYTTN-SYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRV 423

Query: 844  LREISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAA 665
            LRE+ GGKP+VG IYE MTRAK+SIRTYYIMDE KCKTFLDIVDR+W +QLHSPLHAAAA
Sbjct: 424  LREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAA 483

Query: 664  FLNPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLARE 485
            FLNPSIQYNPE+KFL +IK++F  VLEKLLP PE+R DIT QIF F KA GMFGC+LA E
Sbjct: 484  FLNPSIQYNPEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAME 543

Query: 484  ARNTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKET 305
            AR+T+SP LWWEQ+GDSAP LQRVA+RILSQVCS  +FER+WS FQQIHSEKRN++DKET
Sbjct: 544  ARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKET 603

Query: 304  LGDLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDG 125
            L DL+YINYNLKLA + + KP+E+DPI  DDIDMTS+WVEE+EN SPTQWLDRFGS+LDG
Sbjct: 604  LNDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDG 663

Query: 124  SDLNTRQFTNAMFGTNDHIFGL 59
            SDLNTRQF  AMFG NDHIF L
Sbjct: 664  SDLNTRQFNAAMFGANDHIFNL 685


>ref|XP_004502603.1| PREDICTED: uncharacterized protein LOC101496447 isoform X1 [Cicer
            arietinum] gi|502136218|ref|XP_004502604.1| PREDICTED:
            uncharacterized protein LOC101496447 isoform X2 [Cicer
            arietinum]
          Length = 679

 Score = 1035 bits (2676), Expect = 0.0
 Identities = 495/680 (72%), Positives = 582/680 (85%)
 Frame = -3

Query: 2098 MVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTD 1919
            MVREKDVCWEY EKL+GNKVRCKFC RVLNGGISRLKHHLSR PSKGV+PCSKVRDDVTD
Sbjct: 1    MVREKDVCWEYAEKLDGNKVRCKFCQRVLNGGISRLKHHLSRFPSKGVNPCSKVRDDVTD 60

Query: 1918 RVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPP 1739
            RVR IIA            KQ++AE KSPG+LS++KALMS+   TSP  K +P+S  + P
Sbjct: 61   RVRNIIASKDEIKETTSVKKQKVAEVKSPGSLSATKALMSLET-TSPTGKIFPTSNPLTP 119

Query: 1738 SASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLE 1559
            S++ ++ENAERSIALFFFENKLDFSVARS SYQLM++AI KCG GF GPS+E LK TWLE
Sbjct: 120  SSTNNQENAERSIALFFFENKLDFSVARSSSYQLMIDAIGKCGPGFTGPSAEILKTTWLE 179

Query: 1558 RVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYF 1379
            R+KSEV  QSK++EKEW  TGCTIIADTWTD KS+A+INFLVSSPS TFFHKSV+AS YF
Sbjct: 180  RIKSEVGLQSKDVEKEWATTGCTIIADTWTDYKSKAIINFLVSSPSRTFFHKSVDASAYF 239

Query: 1378 KSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLI 1199
            K+ K LADLFDSVIQ+FGPENVVQ+IMD S NY  + ++I+Q+Y +IF +PCAS CLNLI
Sbjct: 240  KNTKWLADLFDSVIQEFGPENVVQIIMDSSFNYTGIANHIVQNYGTIFVSPCASQCLNLI 299

Query: 1198 LEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQS 1019
            LE+F+K+DW++RCILQAQ+ISK IYN+  +L+LMKK++GGQEL+RT +TKS S FLSLQS
Sbjct: 300  LEEFTKVDWISRCILQAQTISKLIYNNASLLDLMKKYSGGQELIRTGVTKSVSTFLSLQS 359

Query: 1018 MLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLR 839
            MLK R+RLKHMF+SPEY++N +YANKPQS+SCI I ED DFW+ VEECVA+SEP LKVLR
Sbjct: 360  MLKLRTRLKHMFHSPEYASNTSYANKPQSLSCIAIAEDGDFWRTVEECVAISEPFLKVLR 419

Query: 838  EISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFL 659
            E+S GKP VGSIYE MTRAK+SIRTYYIMDENKCKTFLDIVD++W +QLHSPLHAAAAFL
Sbjct: 420  EVSEGKPIVGSIYELMTRAKESIRTYYIMDENKCKTFLDIVDKKWRDQLHSPLHAAAAFL 479

Query: 658  NPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREAR 479
            NPSIQYNPE+KFL +IK++F  VLEKLLP P++R DIT QI+ F KA GMFGC+LAREAR
Sbjct: 480  NPSIQYNPEIKFLSSIKEDFFNVLEKLLPVPDMRRDITNQIYTFTKAHGMFGCSLAREAR 539

Query: 478  NTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLG 299
            NT++P LWWEQYGDSAPGLQRVA+RILSQVCS  +F+R WSTF+QIHSEK+N++D+ETL 
Sbjct: 540  NTVAPWLWWEQYGDSAPGLQRVAIRILSQVCSTFSFQRQWSTFRQIHSEKKNKIDRETLN 599

Query: 298  DLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSD 119
            DL+YINYNLKL  +   K +E D +  DDIDMTS+WVEE E  SPTQWLDRFG ALDG+D
Sbjct: 600  DLVYINYNLKLTKQVNAKSLEVDLLQSDDIDMTSEWVEENETASPTQWLDRFGPALDGND 659

Query: 118  LNTRQFTNAMFGTNDHIFGL 59
            LNTRQF +++FG ND IFGL
Sbjct: 660  LNTRQFGSSIFGANDPIFGL 679


>ref|XP_004160403.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101215128 [Cucumis
            sativus]
          Length = 784

 Score = 1033 bits (2670), Expect = 0.0
 Identities = 500/682 (73%), Positives = 587/682 (86%), Gaps = 2/682 (0%)
 Frame = -3

Query: 2098 MVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTD 1919
            +VREKD+CWEY EKL+GNKV+CKFCLRVLNGGISRLKHHLSRLPS+GV+PCSKVRDDV+D
Sbjct: 104  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSD 163

Query: 1918 RVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSM-TMET-SPMSKTYPSSTQV 1745
            RVRAI+A            KQ+LAE K+  ++ S     S+ ++ET SP++K +P+ T +
Sbjct: 164  RVRAILATREEIKEASTGKKQKLAEVKTVESVPSISMCKSVVSIETPSPVAKVFPTVTPM 223

Query: 1744 PPSASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITW 1565
             P + ++ ENAE+SIALF FENKLDFS+ARS SYQLM++AI KCG GF GPS+E LK TW
Sbjct: 224  APPSLHNHENAEKSIALFXFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKTTW 283

Query: 1564 LERVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEAST 1385
            LER+K+EVS QSK+IEKEW  TGCTII DTWTDNKSRALINF VSSPS TFFHKSV+AST
Sbjct: 284  LERIKTEVSLQSKDIEKEWTTTGCTIIVDTWTDNKSRALINFXVSSPSRTFFHKSVDAST 343

Query: 1384 YFKSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLN 1205
            YFK+ KCL DLFDSVIQDFG ENVVQ+IMD SLNY    ++I+Q+Y +IF +PCAS CLN
Sbjct: 344  YFKNTKCLGDLFDSVIQDFGHENVVQIIMDSSLNYSGTANHILQTYGTIFVSPCASQCLN 403

Query: 1204 LILEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSL 1025
             ILE+FSK+DWVNRCILQAQ+ISKF+YN + +L+LM++FTGGQEL+RT I+K  S+FLSL
Sbjct: 404  SILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSL 463

Query: 1024 QSMLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKV 845
            QS+LKQRSRLKHMFNSP+Y+TN +YANKPQSISCI I+EDNDFW+AVEECVA+SEP L+V
Sbjct: 464  QSILKQRSRLKHMFNSPDYTTN-SYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRV 522

Query: 844  LREISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAA 665
            LRE+ GGKP+VG IYE MTRAK+SIRTYYIMDE KCKTFLDIVDR+W +QLHSPLHAAAA
Sbjct: 523  LREVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAA 582

Query: 664  FLNPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLARE 485
            FLNPSIQYNPE+KFL +IK++F  VLEKLLP PE+R DIT QIF F KA GMFGC+LA E
Sbjct: 583  FLNPSIQYNPEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAME 642

Query: 484  ARNTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKET 305
            AR+T+SP LWWEQ+GDSAP LQRVA+RILSQVCS  +FER+WS FQQIHSEKRN++DKET
Sbjct: 643  ARDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKET 702

Query: 304  LGDLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDG 125
            L DL+YINYNLKLA + + KP+E+DPI  DDIDMTS+WVEE+EN SPTQWLDRFGS+LDG
Sbjct: 703  LNDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDG 762

Query: 124  SDLNTRQFTNAMFGTNDHIFGL 59
            SDLNTRQF  AMFG NDHIF L
Sbjct: 763  SDLNTRQFNAAMFGANDHIFNL 784


>ref|XP_006490683.1| PREDICTED: uncharacterized protein LOC102618477 [Citrus sinensis]
          Length = 764

 Score = 1021 bits (2640), Expect = 0.0
 Identities = 501/694 (72%), Positives = 586/694 (84%)
 Frame = -3

Query: 2140 LVKHSRREYNSFAIMVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSK 1961
            ++K   + +   + +VREKD+CWEY EKL+GNKVRCKFCLRVLNGGISRLKHHLSRLPSK
Sbjct: 75   VIKGGYKTFFKSSAVVREKDICWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSK 134

Query: 1960 GVHPCSKVRDDVTDRVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETS 1781
            GV+PCSKVRDDVTDRVRAIIA            KQR+AEAK  G + SSK+LM +    S
Sbjct: 135  GVNPCSKVRDDVTDRVRAIIASKEDVKETPIGKKQRVAEAKPVGIVCSSKSLMPLETP-S 193

Query: 1780 PMSKTYPSSTQVPPSASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGF 1601
            P++K + + T +  S+  ++ENAERSIALFFFENKLDF+VARS SYQ M++A+ KCG GF
Sbjct: 194  PVTKVFATMTPMGNSSLNNQENAERSIALFFFENKLDFAVARSSSYQQMIDAVGKCGPGF 253

Query: 1600 RGPSSEALKITWLERVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPS 1421
             GPS+EALK  WL+R+KSEV+ QSK+IEKEW +TGCTIIADTWTDNKS+ALINFLVSSPS
Sbjct: 254  TGPSAEALKTMWLDRIKSEVNVQSKDIEKEWAMTGCTIIADTWTDNKSKALINFLVSSPS 313

Query: 1420 GTFFHKSVEASTYFKSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNS 1241
             TFF KSV+ S+ FK+ K LAD+FDSVIQD GPENVVQ+IMD S NY  V ++I+Q+Y +
Sbjct: 314  RTFFLKSVDTSSNFKNTKYLADIFDSVIQDIGPENVVQIIMDSSFNYTGVANHILQNYGT 373

Query: 1240 IFFTPCASHCLNLILEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRT 1061
            IF +PCAS  LN+ILE+FSK+DWVNRCILQAQ+ISKFIYN+  +L+LMKKFTGG EL+RT
Sbjct: 374  IFVSPCASQSLNIILEEFSKVDWVNRCILQAQTISKFIYNNASMLDLMKKFTGGLELIRT 433

Query: 1060 SITKSTSNFLSLQSMLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVE 881
             ITK  SNFLSLQS+LKQRSRLKHMFNSPEYST+  YANKPQS+SCI I+EDNDFW+AVE
Sbjct: 434  GITKYVSNFLSLQSILKQRSRLKHMFNSPEYSTSSPYANKPQSLSCISIVEDNDFWRAVE 493

Query: 880  ECVAVSEPLLKVLREISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWE 701
            E VA+SEP LKVLRE+SGGKP+VGSIYE MTRAK+SIRTYYIMDENKCK FLDIVDR W 
Sbjct: 494  ESVAISEPFLKVLREVSGGKPAVGSIYELMTRAKESIRTYYIMDENKCKIFLDIVDRNWR 553

Query: 700  NQLHSPLHAAAAFLNPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKK 521
             QLHSPLH+AAAFLNPSIQYNPE+KFLG+IK++F  VLEKLLPTP+ R DIT QI  F +
Sbjct: 554  GQLHSPLHSAAAFLNPSIQYNPEIKFLGSIKEDFFNVLEKLLPTPDTRRDITTQILTFSR 613

Query: 520  AQGMFGCNLAREARNTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQI 341
            A GMFGC LA EAR T+ PGLWWEQYGDSAP LQRVA+RILSQVCS+ +FER+WSTFQQI
Sbjct: 614  ASGMFGCKLAMEARETVPPGLWWEQYGDSAPVLQRVAIRILSQVCSSFSFERHWSTFQQI 673

Query: 340  HSEKRNRLDKETLGDLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPT 161
            HSEKRN++DKETL DL+YI+YNLKLA   + K +E DP+  DDIDMTS+WVEE+E+ SP 
Sbjct: 674  HSEKRNKIDKETLNDLVYISYNLKLA---RTKSVEADPLQFDDIDMTSEWVEESEHHSPH 730

Query: 160  QWLDRFGSALDGSDLNTRQFTNAMFGTNDHIFGL 59
            QWLDRFGSALDGSDLNTRQF+ +MF +ND IFGL
Sbjct: 731  QWLDRFGSALDGSDLNTRQFSASMFSSNDPIFGL 764


>ref|XP_006581618.1| PREDICTED: uncharacterized protein LOC100808813 isoform X1 [Glycine
            max] gi|571460166|ref|XP_006581619.1| PREDICTED:
            uncharacterized protein LOC100808813 isoform X2 [Glycine
            max]
          Length = 679

 Score = 1016 bits (2628), Expect = 0.0
 Identities = 490/680 (72%), Positives = 576/680 (84%)
 Frame = -3

Query: 2098 MVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTD 1919
            MVREKDVCWEY EKL+GNKVRCKFC RVLNGGISRLKHHLSR PSKGV+PCSKVRDDVTD
Sbjct: 1    MVREKDVCWEYAEKLDGNKVRCKFCQRVLNGGISRLKHHLSRFPSKGVNPCSKVRDDVTD 60

Query: 1918 RVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPP 1739
            RVR IIA            KQ++AE KSP NLS+SKAL+S+    SP+ K +P+   + P
Sbjct: 61   RVRGIIASKEEVKETSSAKKQKIAEVKSPSNLSASKALVSLDA-ASPVMKIFPTGHPMTP 119

Query: 1738 SASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLE 1559
            S++ ++E AERSIALFFFENKLDFSVARS SYQLM++AIAKCG GF GPS+E LK  WLE
Sbjct: 120  SSTNNQEIAERSIALFFFENKLDFSVARSSSYQLMIDAIAKCGPGFTGPSAETLKTIWLE 179

Query: 1558 RVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYF 1379
            R+KSEV  Q+K++EKEW  TGCTI+ADTWTD KS+A+INFLVSSPS TFFHKSV+AS YF
Sbjct: 180  RMKSEVGLQTKDVEKEWATTGCTILADTWTDYKSKAIINFLVSSPSRTFFHKSVDASAYF 239

Query: 1378 KSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLI 1199
            K+ K LADLFDSVIQ+FGPENVVQ+IMD S+NY  + ++I+QSY +IF +PCAS CLNLI
Sbjct: 240  KNTKWLADLFDSVIQEFGPENVVQIIMDSSVNYTVIANHIVQSYGTIFVSPCASQCLNLI 299

Query: 1198 LEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQS 1019
            LE+FSK+DW++RCILQAQ+ISK IYN+  +L+L KK+TGGQEL+RT ITKS S FLSLQS
Sbjct: 300  LEEFSKVDWISRCILQAQTISKLIYNNASLLDLTKKYTGGQELIRTGITKSVSTFLSLQS 359

Query: 1018 MLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLR 839
            MLK R+RLK+MF+S EY++N +YANKPQS+SCI I ED DFW+ VEECVA+SEP LKVLR
Sbjct: 360  MLKLRTRLKNMFHSHEYASNTSYANKPQSLSCITIAEDGDFWRTVEECVAISEPFLKVLR 419

Query: 838  EISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFL 659
            EIS GKP+VGSIYE MTRAK+SIRTYYIMDENKCK FLDIVD++W +QLHSPLHAAAAFL
Sbjct: 420  EISEGKPTVGSIYELMTRAKESIRTYYIMDENKCKKFLDIVDKKWRDQLHSPLHAAAAFL 479

Query: 658  NPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREAR 479
            NPSIQYNPE+KF+ +IK++F  VLEKLLP P++R DIT QI+ F KA GMFGC+LA+EAR
Sbjct: 480  NPSIQYNPEIKFISSIKEDFFNVLEKLLPVPDMRRDITNQIYTFTKAHGMFGCSLAKEAR 539

Query: 478  NTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLG 299
            NT++P LWWEQYGDSAPGLQRVA+RILSQVCS  +F R WST +QIHSEKRN++D+ETL 
Sbjct: 540  NTVAPWLWWEQYGDSAPGLQRVAIRILSQVCSTFSFHRQWSTIRQIHSEKRNKIDRETLN 599

Query: 298  DLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSD 119
            DL+YINYNLKLA +   K  E D +  DDIDMTS+WVEE E  SPTQWLDRFG ALDG+D
Sbjct: 600  DLVYINYNLKLARQMSAKSSEVDLLQFDDIDMTSEWVEENETASPTQWLDRFGPALDGND 659

Query: 118  LNTRQFTNAMFGTNDHIFGL 59
            LNTRQF +++FG ND IFGL
Sbjct: 660  LNTRQFGSSIFGANDPIFGL 679


>ref|XP_003602175.1| Protein dimerization [Medicago truncatula]
            gi|355491223|gb|AES72426.1| Protein dimerization
            [Medicago truncatula]
          Length = 786

 Score = 1011 bits (2614), Expect = 0.0
 Identities = 487/682 (71%), Positives = 579/682 (84%), Gaps = 1/682 (0%)
 Frame = -3

Query: 2101 IMVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVT 1922
            +MVREKDVCWEY EKL+GNKV+CKFC RVLNGGISRLKHHLSR PSKGV+PCSKVRDDVT
Sbjct: 106  LMVREKDVCWEYAEKLDGNKVKCKFCQRVLNGGISRLKHHLSRFPSKGVNPCSKVRDDVT 165

Query: 1921 DRVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVP 1742
            DRVR IIA            KQ+++E  SPG+ S++KAL+S+   T P+ K +PSS  + 
Sbjct: 166  DRVRNIIASKEEVKETSSVKKQKVSEVISPGSHSATKALISLDT-TLPIGKMFPSSNPMT 224

Query: 1741 PSASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWL 1562
            PS++ ++ENAERSIALFFFENKLDFSVARS SYQLM++AI KCG GF GPS+E LK  WL
Sbjct: 225  PSSTNNQENAERSIALFFFENKLDFSVARSSSYQLMIDAITKCGPGFTGPSAEILKTIWL 284

Query: 1561 ERVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTY 1382
            ER+KSEV  QSK++EKEW  TGCTIIADTWTD KS+A+INFLVSSPS  FFHKSV+AS Y
Sbjct: 285  ERIKSEVGLQSKDVEKEWATTGCTIIADTWTDYKSKAIINFLVSSPSRIFFHKSVDASAY 344

Query: 1381 FKSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNL 1202
            FK+ K LADLFDSVIQ+FGPENVVQ+IMD S NY  +G++I+Q+Y +IF +PCAS CLNL
Sbjct: 345  FKNTKWLADLFDSVIQEFGPENVVQIIMDSSFNYTGIGNHIVQNYGTIFVSPCASQCLNL 404

Query: 1201 ILEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQ 1022
            ILE+F+KIDW++RCILQAQ+ISK IYN+  +L+LMK ++GGQEL+RT  TKS S FLSLQ
Sbjct: 405  ILEEFTKIDWISRCILQAQTISKLIYNNASLLDLMKSYSGGQELIRTGATKSVSTFLSLQ 464

Query: 1021 SMLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVL 842
            +MLK R+RLKHMF+SPEY+ + +YANKPQS+SCI I ED DFW+ VEECVA+SEP LKVL
Sbjct: 465  TMLKLRTRLKHMFHSPEYALDTSYANKPQSLSCIAIAEDGDFWRTVEECVAISEPFLKVL 524

Query: 841  REISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAF 662
            RE+S GKP+VGSIYE MTRAK+SIRTYYIMDENKCKTFLDIVD++W +QLHSPLHAAAAF
Sbjct: 525  REVSEGKPTVGSIYELMTRAKESIRTYYIMDENKCKTFLDIVDKKWRDQLHSPLHAAAAF 584

Query: 661  LNPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREA 482
            LNPSIQYNPE+KFL +IK++F  VLEKLLP P++R DIT QI+ F KA GMFGC+LA+EA
Sbjct: 585  LNPSIQYNPEIKFLSSIKEDFYHVLEKLLPVPDMRRDITNQIYTFTKAHGMFGCSLAKEA 644

Query: 481  RNTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETL 302
            RNT++P LWWEQYGDSAPGLQRVA+RILSQVCS  +F+R WSTF+QIHSEK+N++D+ETL
Sbjct: 645  RNTVAPWLWWEQYGDSAPGLQRVAIRILSQVCSTFSFQRQWSTFRQIHSEKKNKIDRETL 704

Query: 301  GDLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPS-PTQWLDRFGSALDG 125
             DL+YINYNLKL  +   K +E D +  DDIDMTS+WVEE E  S PTQWLDRFGSALDG
Sbjct: 705  NDLVYINYNLKLNRQMSAKSLEVDLLQFDDIDMTSEWVEENETVSPPTQWLDRFGSALDG 764

Query: 124  SDLNTRQFTNAMFGTNDHIFGL 59
            +DLNTRQF +++FG ND IFGL
Sbjct: 765  NDLNTRQFGSSIFGANDPIFGL 786


>ref|XP_006350604.1| PREDICTED: uncharacterized protein LOC102593027 isoform X1 [Solanum
            tuberosum] gi|565367925|ref|XP_006350605.1| PREDICTED:
            uncharacterized protein LOC102593027 isoform X2 [Solanum
            tuberosum]
          Length = 675

 Score =  994 bits (2570), Expect = 0.0
 Identities = 479/680 (70%), Positives = 572/680 (84%)
 Frame = -3

Query: 2098 MVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTD 1919
            MVREKDVCWEY EKL+GNKVRCKFCLR+LNGGISRLKHHLSRLPSKGV+PC+KVRDDVTD
Sbjct: 1    MVREKDVCWEYAEKLDGNKVRCKFCLRILNGGISRLKHHLSRLPSKGVNPCTKVRDDVTD 60

Query: 1918 RVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPP 1739
            RVR II             K +L E K+  N+S  K L+S+   T P+++ +P   Q   
Sbjct: 61   RVRDIIG----SKEPPSTKKHKLIETKALANISPEKLLLSVEPIT-PIARIFPPIGQAIS 115

Query: 1738 SASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLE 1559
            S+  ++ENAERSIALFFFENK+DF VARS SY  M+EA+ KCG GF GPS E LK TWLE
Sbjct: 116  SSGNNQENAERSIALFFFENKIDFGVARSSSYHQMIEAVGKCGSGFIGPSPETLKATWLE 175

Query: 1558 RVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYF 1379
            R+KSEVS QSK++EKEW +TGCT+IA+TWTDNK +ALINFLVSSPS TFF+KSV+AS+YF
Sbjct: 176  RIKSEVSLQSKDVEKEWAMTGCTLIAETWTDNKMKALINFLVSSPSRTFFYKSVDASSYF 235

Query: 1378 KSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLI 1199
            K+ KCL++LFDS+IQDFGPENVVQVI+D +L+   + ++I+Q+Y ++F +PCAS C+N I
Sbjct: 236  KNLKCLSELFDSIIQDFGPENVVQVIVDNTLHCTGIVNHILQNYGNVFVSPCASQCINAI 295

Query: 1198 LEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQS 1019
            L++FSK+DWVNRCILQAQSISKFIYN++ +L+LMKKFTGGQE+++T ITKS SNFLSLQ 
Sbjct: 296  LDEFSKLDWVNRCILQAQSISKFIYNNSPLLDLMKKFTGGQEIIKTGITKSVSNFLSLQC 355

Query: 1018 MLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLR 839
            +LK RSRLK +FNSPE + N AY NK QS++CI IL+DNDFW+  EECVAVSEP LKV+R
Sbjct: 356  LLKHRSRLKVIFNSPELAANSAYTNKSQSVNCIAILDDNDFWRTAEECVAVSEPFLKVMR 415

Query: 838  EISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFL 659
            E+SGGKP+VG+IYE +TRAK+SIRTYYIMDE KCKTFLDIVD+ W+N LHSPLH+AAAFL
Sbjct: 416  EVSGGKPAVGTIYELLTRAKESIRTYYIMDEIKCKTFLDIVDKNWKNNLHSPLHSAAAFL 475

Query: 658  NPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREAR 479
            NP IQYN EVKFLG+IK++F  VLEKLLPTPELR DIT QI L+ +A GMFGCNLA+EA 
Sbjct: 476  NPGIQYNREVKFLGSIKEDFFRVLEKLLPTPELRRDITTQILLYTRASGMFGCNLAKEAI 535

Query: 478  NTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLG 299
            +T+ PG+WWEQYGD+AP LQRVA++ILSQVCS   FER+WSTFQQIHSEKRN++DKETL 
Sbjct: 536  DTVPPGIWWEQYGDAAPTLQRVAIKILSQVCSTFTFERHWSTFQQIHSEKRNKIDKETLL 595

Query: 298  DLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSD 119
            DL+YINYNLKLA     KP E DP+ +DDIDMTS+WVEE ENPSPTQWLDRFGS LDG+D
Sbjct: 596  DLVYINYNLKLARYLVSKPPEEDPLQLDDIDMTSEWVEEAENPSPTQWLDRFGSGLDGND 655

Query: 118  LNTRQFTNAMFGTNDHIFGL 59
            LNTRQFT A+FG  D+IFGL
Sbjct: 656  LNTRQFTAAIFGPGDNIFGL 675


>ref|XP_004234278.1| PREDICTED: uncharacterized protein LOC101256946 [Solanum
            lycopersicum]
          Length = 739

 Score =  994 bits (2569), Expect = 0.0
 Identities = 478/680 (70%), Positives = 572/680 (84%)
 Frame = -3

Query: 2098 MVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTD 1919
            +VREKDVCWEY EKLEGNKVRCKFCLR+LNGGISRLKHHLSRLPSKGV+PC+KVRDDVTD
Sbjct: 65   VVREKDVCWEYAEKLEGNKVRCKFCLRILNGGISRLKHHLSRLPSKGVNPCTKVRDDVTD 124

Query: 1918 RVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPP 1739
            RVR II             K +L E K+  N+S  K L+S+   T P+++ +P   Q   
Sbjct: 125  RVRDIIG----SKEPPSTKKHKLIETKALANISPEKPLLSVEPIT-PIARIFPPIGQAIS 179

Query: 1738 SASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLE 1559
            S+  ++ENAERSIALFFFENK+DF VARS SY  M+EA+ KCG GF GPS E LK TWLE
Sbjct: 180  SSGNNQENAERSIALFFFENKIDFGVARSSSYHQMIEAVGKCGSGFIGPSPETLKATWLE 239

Query: 1558 RVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYF 1379
            R+KSEVS QSK++EKEW +TGCT+IA+TWTDNK +ALINFLVSSPS TFF+KSV+AS+YF
Sbjct: 240  RIKSEVSLQSKDVEKEWAMTGCTLIAETWTDNKMKALINFLVSSPSRTFFYKSVDASSYF 299

Query: 1378 KSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLI 1199
            K+ KCL++LFDS+IQDFGPENVVQVI+D +L+   + ++I+Q+Y ++F +PCAS C+N I
Sbjct: 300  KNLKCLSELFDSIIQDFGPENVVQVIVDNTLHCTGIVNHILQNYGNVFVSPCASQCINAI 359

Query: 1198 LEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQS 1019
            L++FSK+DWVNRCILQAQS+SKFIYN++ +L+LMKKFTGGQE+++T ITKS SNFLSLQ 
Sbjct: 360  LDEFSKLDWVNRCILQAQSLSKFIYNNSPLLDLMKKFTGGQEIIKTGITKSVSNFLSLQC 419

Query: 1018 MLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLR 839
            +LK RSRLK +FNSPE + N AY NK QS++CI IL+DNDFW+  EECVAVSEP LKV+R
Sbjct: 420  LLKHRSRLKVIFNSPELAANSAYTNKSQSVNCITILDDNDFWRTAEECVAVSEPFLKVMR 479

Query: 838  EISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFL 659
            E+SGGKP+VG+IYE +TRAK+SIRTYYIMDE KCKTFLDIVD+ W+N LHSPLH+AAAFL
Sbjct: 480  EVSGGKPAVGTIYELLTRAKESIRTYYIMDEIKCKTFLDIVDKNWKNNLHSPLHSAAAFL 539

Query: 658  NPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREAR 479
            NP IQYNPEVKFLG+IK++F  VLEKLLPTPELR DIT QI L+ +A GMFGCNLA+EA 
Sbjct: 540  NPGIQYNPEVKFLGSIKEDFFRVLEKLLPTPELRRDITTQILLYTRASGMFGCNLAKEAI 599

Query: 478  NTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLG 299
            +T+ PG+WWEQYGD+AP LQRVA++ILSQVCS    ER+WSTFQQIHSEKRN++DKETL 
Sbjct: 600  DTVPPGIWWEQYGDAAPTLQRVAIKILSQVCSTFTCERHWSTFQQIHSEKRNKIDKETLL 659

Query: 298  DLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSD 119
            DL+YINYNLKLA     KP E DP+ +DDIDMTS+WVEE ENPSPTQWLDRFGS LDG+D
Sbjct: 660  DLVYINYNLKLARYLVSKPPEEDPLQLDDIDMTSEWVEEAENPSPTQWLDRFGSGLDGND 719

Query: 118  LNTRQFTNAMFGTNDHIFGL 59
            LNTRQFT A+FG  D+IFGL
Sbjct: 720  LNTRQFTAAIFGPGDNIFGL 739


>ref|XP_007218857.1| hypothetical protein PRUPE_ppa002763mg [Prunus persica]
            gi|462415319|gb|EMJ20056.1| hypothetical protein
            PRUPE_ppa002763mg [Prunus persica]
          Length = 636

 Score =  980 bits (2534), Expect = 0.0
 Identities = 486/680 (71%), Positives = 551/680 (81%)
 Frame = -3

Query: 2098 MVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTD 1919
            MVREKDVCWEY EKL+GNKVRCKFC RVLNGGISRLKHHLSRLPSKGV+PCSKVRDDVTD
Sbjct: 1    MVREKDVCWEYAEKLDGNKVRCKFCQRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTD 60

Query: 1918 RVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPP 1739
            RVR IIA            KQ+L E KSPGN+S+SKALMS    T P+ K +P+ T + P
Sbjct: 61   RVRTIIASKEEVKETSSGKKQKLVEVKSPGNVSASKALMSFDTPT-PIQKVFPNVTPMVP 119

Query: 1738 SASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLE 1559
                ++ENAER+IALFFFENKLDFS+ARS SYQLM++AI KCG GF GPS+E LK TWLE
Sbjct: 120  PPLNNQENAERNIALFFFENKLDFSIARSSSYQLMIDAIEKCGPGFIGPSAETLKTTWLE 179

Query: 1558 RVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYF 1379
            R+KSE+S QSK+IEKEW  TGCTIIADTWTDNKSRALINFL+                  
Sbjct: 180  RIKSEMSLQSKDIEKEWTTTGCTIIADTWTDNKSRALINFLI------------------ 221

Query: 1378 KSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLI 1199
                                     IMD S NY  V ++I+Q+Y +IF +PCAS CLNLI
Sbjct: 222  -------------------------IMDSSFNYTGVANHILQNYATIFVSPCASQCLNLI 256

Query: 1198 LEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQS 1019
            LE+FSK+DWVNRCILQAQ+ISKFIYN+  +L+LMKKFTGGQEL+RT ITKS SNFLSLQS
Sbjct: 257  LEEFSKVDWVNRCILQAQTISKFIYNNASMLDLMKKFTGGQELIRTGITKSVSNFLSLQS 316

Query: 1018 MLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLR 839
            +LKQRSRLKHMFNSPEY TN +YANK QSISCI I+EDNDFW+AVEE VA+SEP LKVLR
Sbjct: 317  LLKQRSRLKHMFNSPEYCTNSSYANKTQSISCISIVEDNDFWRAVEESVAISEPFLKVLR 376

Query: 838  EISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFL 659
            E+SGGKPSVG IYE MTRAK+SIRTYYIMDENKCKTFLDIVDR+W +QLHSPLHAAAAFL
Sbjct: 377  EVSGGKPSVGFIYELMTRAKESIRTYYIMDENKCKTFLDIVDRKWRDQLHSPLHAAAAFL 436

Query: 658  NPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREAR 479
            NP IQYNPE+KFL +IK++F  VLEKLLP PE+R DIT QIF F KA GMFGC+LA EAR
Sbjct: 437  NPGIQYNPEIKFLTSIKEDFFKVLEKLLPMPEMRRDITSQIFTFTKATGMFGCSLAMEAR 496

Query: 478  NTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLG 299
            + +SPGLWWEQYGDSAP LQRVA+RILSQVCS+  FER+WS FQQIHSEKRN++D+ETL 
Sbjct: 497  DVVSPGLWWEQYGDSAPVLQRVAIRILSQVCSSFMFERHWSAFQQIHSEKRNKIDRETLN 556

Query: 298  DLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSD 119
            DL+YINYNLKLA +T+ K +E DPI  DDIDMTS+WVEE++NPSPTQWLDRFGSALDGSD
Sbjct: 557  DLVYINYNLKLARQTRTKTLEADPIQFDDIDMTSEWVEESDNPSPTQWLDRFGSALDGSD 616

Query: 118  LNTRQFTNAMFGTNDHIFGL 59
            LNTRQF  A+FG+NDHIFGL
Sbjct: 617  LNTRQFNAAIFGSNDHIFGL 636


>ref|XP_007038930.1| HAT transposon superfamily isoform 1 [Theobroma cacao]
            gi|508776175|gb|EOY23431.1| HAT transposon superfamily
            isoform 1 [Theobroma cacao]
          Length = 640

 Score =  952 bits (2462), Expect = 0.0
 Identities = 460/629 (73%), Positives = 539/629 (85%)
 Frame = -3

Query: 1945 SKVRDDVTDRVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKT 1766
            +KVRDDVTDRVRAI++            KQ++AEA+SPGN+S+   ++ +   +SP++K 
Sbjct: 14   NKVRDDVTDRVRAILSSKEEIKETSSVKKQKIAEARSPGNISTCSKIIPLEA-SSPVAKV 72

Query: 1765 YPSSTQVPPSASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSS 1586
            +P+++ + P +   +EN ERSIALFFFENKLDFSVARS SYQ M++A+ K G GF GPS 
Sbjct: 73   FPATSPIAPPSLNSQENVERSIALFFFENKLDFSVARSSSYQAMIDAVGKFGPGFTGPSV 132

Query: 1585 EALKITWLERVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFH 1406
            E LK  WLER+KSEV  QSK+ EKEW  TGCTIIADTWTDNKSRALINFLVSSPS TFFH
Sbjct: 133  ETLKTMWLERIKSEVCLQSKDTEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFH 192

Query: 1405 KSVEASTYFKSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTP 1226
            KSV+AS+YFK+ KCLADLFDSVIQDFGPENVVQ+IMD S NY  + ++I+Q+Y +IF +P
Sbjct: 193  KSVDASSYFKNTKCLADLFDSVIQDFGPENVVQIIMDSSFNYTGISNHILQNYGTIFVSP 252

Query: 1225 CASHCLNLILEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKS 1046
            CAS CLNLILE+FSK+DWVNRCILQAQ++SKF+YN+  +L+LMKKFTG QEL+RT ITKS
Sbjct: 253  CASQCLNLILEEFSKVDWVNRCILQAQTLSKFLYNNASMLDLMKKFTGEQELIRTGITKS 312

Query: 1045 TSNFLSLQSMLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAV 866
             S+FLSLQSMLKQRSRLKHMFNSPEYSTN +YANKPQSISCI I+EDNDFW+AV+ECVA+
Sbjct: 313  VSSFLSLQSMLKQRSRLKHMFNSPEYSTNSSYANKPQSISCIAIVEDNDFWRAVDECVAI 372

Query: 865  SEPLLKVLREISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHS 686
            SEP LKVLRE+SGGKP+VGSIYE MTRAK+SIRTYYIMDE KCKTFLDIVDR+W +QLHS
Sbjct: 373  SEPFLKVLREVSGGKPAVGSIYELMTRAKESIRTYYIMDEGKCKTFLDIVDRKWRDQLHS 432

Query: 685  PLHAAAAFLNPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMF 506
            PLH+A AFLNPSIQYN E+KFLG+IK++F  VLEKLLPTPELR DIT QIF F +A+GMF
Sbjct: 433  PLHSAGAFLNPSIQYNQEIKFLGSIKEDFFKVLEKLLPTPELRRDITNQIFTFTRAKGMF 492

Query: 505  GCNLAREARNTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKR 326
             CNLA EAR+T+SPGLWWEQ+GDSAP LQRVA+RILSQVCS   FER+WSTFQQIHSEKR
Sbjct: 493  ACNLAMEARDTVSPGLWWEQFGDSAPVLQRVAIRILSQVCSTFTFERHWSTFQQIHSEKR 552

Query: 325  NRLDKETLGDLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDR 146
            N++DKE L DL+YINYNL+LA + + K +E DPI  DDIDMTS+WVEE+ENPSPTQWLDR
Sbjct: 553  NKIDKEILNDLVYINYNLRLARQMRTKSVEADPIQFDDIDMTSEWVEESENPSPTQWLDR 612

Query: 145  FGSALDGSDLNTRQFTNAMFGTNDHIFGL 59
            FGSALDG DLNTRQF  A+FG NDHIFGL
Sbjct: 613  FGSALDGGDLNTRQFNAAIFG-NDHIFGL 640


>ref|XP_006300772.1| hypothetical protein CARUB_v10019846mg, partial [Capsella rubella]
            gi|482569482|gb|EOA33670.1| hypothetical protein
            CARUB_v10019846mg, partial [Capsella rubella]
          Length = 768

 Score =  915 bits (2366), Expect = 0.0
 Identities = 452/682 (66%), Positives = 552/682 (80%), Gaps = 2/682 (0%)
 Frame = -3

Query: 2098 MVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTD 1919
            MVREKD+CWEY EKL+GNKV+CKFC RVLNGGISRLKHHLSRLPSKGV+PC+KVRDDVTD
Sbjct: 100  MVREKDICWEYAEKLDGNKVKCKFCSRVLNGGISRLKHHLSRLPSKGVNPCAKVRDDVTD 159

Query: 1918 RVRAIIALXXXXXXXXXXXKQ-RLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVP 1742
            RVR+I+A             + +  E K P     S +L+ +T+ +   SK +P+S   P
Sbjct: 160  RVRSILAAKDDPKDSPLTTNKYKPPEVKPP----LSASLLPVTVSSG--SKLFPTSILAP 213

Query: 1741 PSASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWL 1562
            P+ +  +  AERSI+LFFFENK+D+ VARSPSY  M++AIAKCG  F  PS  +LK  WL
Sbjct: 214  PTPNA-QVIAERSISLFFFENKIDWCVARSPSYHHMLDAIAKCGPAFFAPSPLSLKTEWL 272

Query: 1561 ERVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTY 1382
            +RVKSE+S Q K+ EKEW  TGCTIIA+ WTDNKSRALINF VSSPS  FFHKSV+AS+Y
Sbjct: 273  DRVKSEISLQLKDSEKEWVTTGCTIIAEAWTDNKSRALINFSVSSPSRIFFHKSVDASSY 332

Query: 1381 FKSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNL 1202
            FK+ KCLADLFDSVIQD G E++VQ+IMD S +Y  + ++I+Q+Y SIF +PCAS CL++
Sbjct: 333  FKNTKCLADLFDSVIQDIGQEHIVQIIMDNSFSYTGISNHILQNYGSIFVSPCASQCLSI 392

Query: 1201 ILEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQ 1022
            ILE+FSK+DWVN+CI QAQ ISKF+YN+  VL+LM+K TGGQ+++RT +T+S SNFLSLQ
Sbjct: 393  ILEEFSKVDWVNQCISQAQVISKFVYNNRPVLDLMRKLTGGQDIIRTGVTRSVSNFLSLQ 452

Query: 1021 SMLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVL 842
            SM+KQ++RLKHMFNS EY+T    ANKPQS+SC++ILEDNDFW+A+EE VA+SEP+LKVL
Sbjct: 453  SMMKQKARLKHMFNSSEYTTQ---ANKPQSMSCVNILEDNDFWRALEESVAISEPILKVL 509

Query: 841  REISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAF 662
            RE+S GKP+VGSIYE M++AK+SIRTYYIMDENK K F +IVD +W + LHSPLHAAAAF
Sbjct: 510  REVSKGKPAVGSIYELMSKAKESIRTYYIMDENKHKVFSNIVDTKWCDHLHSPLHAAAAF 569

Query: 661  LNPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREA 482
            LNPSIQYNPE+KFL ++K++F  VLEKLLPT +LR DIT QIF F +A+GMFGCNLA EA
Sbjct: 570  LNPSIQYNPEIKFLTSLKEDFFKVLEKLLPTSDLRRDITNQIFTFTRAKGMFGCNLAMEA 629

Query: 481  RNTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETL 302
            R+++SPGLWWEQ+GDSAP LQRVA+RILSQVCS+ N ER WSTFQQ+H E+RN +D+E L
Sbjct: 630  RDSVSPGLWWEQFGDSAPVLQRVAIRILSQVCSSYNLERQWSTFQQMHWERRNTIDREIL 689

Query: 301  GDLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGS 122
             +L Y+N NLKL        +ETD I ++DIDM S+WVEE ENPSP QWLDRFGSALDG 
Sbjct: 690  NNLAYVNQNLKLGRMI---TLETDSISLEDIDMMSEWVEEAENPSPAQWLDRFGSALDGG 746

Query: 121  DLNTRQFTNAMFGTNDH-IFGL 59
            DLNTRQF  A+F  NDH IFGL
Sbjct: 747  DLNTRQFGGAIFSANDHNIFGL 768


>ref|NP_178092.4| hAT family dimerization domain-containing protein [Arabidopsis
            thaliana] gi|332198172|gb|AEE36293.1| hAT family
            dimerization domain-containing protein [Arabidopsis
            thaliana]
          Length = 651

 Score =  907 bits (2345), Expect = 0.0
 Identities = 451/682 (66%), Positives = 545/682 (79%), Gaps = 2/682 (0%)
 Frame = -3

Query: 2098 MVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTD 1919
            MVREKD+CWEY EKL+GNKV+CKFC RVLNGGISRLKHHLSRLPSKGV+PC+KVRDDVTD
Sbjct: 1    MVREKDICWEYAEKLDGNKVKCKFCSRVLNGGISRLKHHLSRLPSKGVNPCAKVRDDVTD 60

Query: 1918 RVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSK-TYPSSTQVP 1742
            RVR+I++                 + K P  LS            +P SK  +PSS   P
Sbjct: 61   RVRSILSAKDDPPITN--------KYKPPPPLSPP--------FDAPASKLVFPSS---P 101

Query: 1741 PSASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWL 1562
            P+A   ++ AERSI+LFFFENK+DF+VARSPSY  M++A+AKCG GF  PS    K  WL
Sbjct: 102  PNA---QDIAERSISLFFFENKIDFAVARSPSYHHMLDAVAKCGPGFVAPSP---KTEWL 155

Query: 1561 ERVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTY 1382
            +RVKS++S Q K+ EKEW  TGCTIIA+ WTDNKSRALINF VSSPS  FFHKSV+AS+Y
Sbjct: 156  DRVKSDISLQLKDTEKEWVTTGCTIIAEAWTDNKSRALINFSVSSPSRIFFHKSVDASSY 215

Query: 1381 FKSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNL 1202
            FK+ KCLADLFDSVIQD G E++VQ+IMD S  Y  + ++++Q+Y +IF +PCAS CLN+
Sbjct: 216  FKNSKCLADLFDSVIQDIGQEHIVQIIMDNSFCYTGISNHLLQNYATIFVSPCASQCLNI 275

Query: 1201 ILEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQ 1022
            ILE+FSK+DWVN+CI QAQ ISKF+YN++ VL+L++K TGGQ+++R+ +T+S SNFLSLQ
Sbjct: 276  ILEEFSKVDWVNQCISQAQVISKFVYNNSPVLDLLRKLTGGQDIIRSGVTRSVSNFLSLQ 335

Query: 1021 SMLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVL 842
            SM+KQ++RLKHMFN PEY+TN    NKPQSISC++ILEDNDFW+AVEE VA+SEP+LKVL
Sbjct: 336  SMMKQKARLKHMFNCPEYTTN---TNKPQSISCVNILEDNDFWRAVEESVAISEPILKVL 392

Query: 841  REISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAF 662
            RE+S GKP+VGSIYE M++AK+SIRTYYIMDENK K F DIVD  W   LHSPLHAAAAF
Sbjct: 393  REVSTGKPAVGSIYELMSKAKESIRTYYIMDENKHKVFSDIVDTNWCEHLHSPLHAAAAF 452

Query: 661  LNPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREA 482
            LNPSIQYNPE+KFL ++K++F  VLEKLLPT +LR DIT QIF F +A+GMFGCNLA EA
Sbjct: 453  LNPSIQYNPEIKFLTSLKEDFFKVLEKLLPTSDLRRDITNQIFTFTRAKGMFGCNLAMEA 512

Query: 481  RNTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETL 302
            R+++SPGLWWEQ+GDSAP LQRVA+RILSQVCS  N ER WSTFQQ+H E+RN++D+E L
Sbjct: 513  RDSVSPGLWWEQFGDSAPVLQRVAIRILSQVCSGYNLERQWSTFQQMHWERRNKIDREIL 572

Query: 301  GDLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGS 122
              L Y+N NLKL        +ETDPI ++DIDM S+WVEE ENPSP QWLDRFG+ALDG 
Sbjct: 573  NKLAYVNQNLKLGRMI---TLETDPIALEDIDMMSEWVEEAENPSPAQWLDRFGTALDGG 629

Query: 121  DLNTRQFTNAMFGTNDH-IFGL 59
            DLNTRQF  A+F  NDH IFGL
Sbjct: 630  DLNTRQFGGAIFSANDHNIFGL 651


>gb|EEC70201.1| hypothetical protein OsI_00947 [Oryza sativa Indica Group]
          Length = 1045

 Score =  809 bits (2089), Expect = 0.0
 Identities = 402/700 (57%), Positives = 527/700 (75%), Gaps = 20/700 (2%)
 Frame = -3

Query: 2098 MVREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTD 1919
            ++RE+DVCWEYC+K+EGNKVRC+FC +VLNGGISRLK HLS++ SKGV+PC+KV+ DV +
Sbjct: 353  ILRERDVCWEYCDKMEGNKVRCRFCYKVLNGGISRLKFHLSQISSKGVNPCTKVKPDVIE 412

Query: 1918 RVRAIIALXXXXXXXXXXXKQRLAE--------------------AKSPGNLSSSKALMS 1799
            +V+A+IA            +QR  E                    A SP   S+S     
Sbjct: 413  KVKAVIAAKEEHRETQVLKRQRDTELSVRPRRIRDLPSQPTSPERATSPAITSTSDQTQF 472

Query: 1798 MTMETSPMSKTYPSSTQVPPSASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIA 1619
            + +E S       S T    SA   +  AER IA FFFENKLD+++A S SY+ MMEA+ 
Sbjct: 473  LALEVSTPVLKLSSVTNKARSAP--QSEAERCIAEFFFENKLDYNIADSVSYRHMMEALG 530

Query: 1618 KCGHGFRGPSSEALKITWLERVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINF 1439
              G GFRGPS+E LK  WL ++KSEV +++KEIEK+W  TGCTI+AD+WTDNKS+ALINF
Sbjct: 531  --GQGFRGPSAEVLKTKWLHKLKSEVLQKTKEIEKDWATTGCTILADSWTDNKSKALINF 588

Query: 1438 LVSSPSGTFFHKSVEASTYFKSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYI 1259
             VSSP GTFF K+V+AS + KS + L +LFD VI++ GP+NVVQ+I D ++NY +V   I
Sbjct: 589  SVSSPLGTFFLKTVDASPHIKSHQ-LYELFDDVIREVGPDNVVQIITDRNINYGSVDKLI 647

Query: 1258 MQSYNSIFFTPCASHCLNLILEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGG 1079
            MQ+YN+IF++PCAS C+N +L+DFSKIDWVNRCI QAQ+I++F+YN+ WVL+LM+K   G
Sbjct: 648  MQNYNTIFWSPCASSCVNSMLDDFSKIDWVNRCICQAQTITRFVYNNKWVLDLMRKCIAG 707

Query: 1078 QELVRTSITKSTSNFLSLQSMLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDND 899
            QELV + ITK  S+FL+LQS+L+ R +LK MF+S +Y+++ +YAN+  S SC++IL+D++
Sbjct: 708  QELVCSGITKCVSDFLTLQSLLRYRPKLKQMFHSSDYASS-SYANRSLSSSCVEILDDDE 766

Query: 898  FWKAVEECVAVSEPLLKVLREISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDI 719
            FW+AVEE  AVSEPLL+V+R++SGGK ++G IYE MT+  DSIRTYYIMDE KCK+FLDI
Sbjct: 767  FWRAVEEIAAVSEPLLRVMRDVSGGKAAIGYIYESMTKVMDSIRTYYIMDEGKCKSFLDI 826

Query: 718  VDRRWENQLHSPLHAAAAFLNPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQ 539
            V+++W+ +LHSPLH+AAAFLNPSIQYNPEVKF  +IK+EF  VL+K+L  P+ R+ IT +
Sbjct: 827  VEQKWQVELHSPLHSAAAFLNPSIQYNPEVKFFSSIKEEFYHVLDKVLTVPDQRQGITVE 886

Query: 538  IFLFKKAQGMFGCNLAREARNTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNW 359
            +  F+KAQGMFG N+A+EARN  SPG+WWEQYGDSAP LQ  AVRI+SQVCS   F+R+W
Sbjct: 887  LHAFRKAQGMFGSNIAKEARNNTSPGMWWEQYGDSAPSLQHAAVRIVSQVCSTLTFQRDW 946

Query: 358  STFQQIHSEKRNRLDKETLGDLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEET 179
            S   + HSEKRN+LDKE L D  Y++YN  L S +K K  + DPI +D IDMTS WVE++
Sbjct: 947  SIIVRNHSEKRNKLDKEALADQAYVHYNFMLHSDSKMKKGDGDPIALDAIDMTSPWVEDS 1006

Query: 178  ENPSPTQWLDRFGSALDGSDLNTRQFTNAMFGTNDHIFGL 59
            ++P+  QWLDRF SALDG DLNTRQF  ++FGTND +FGL
Sbjct: 1007 DSPNLAQWLDRFPSALDG-DLNTRQFGGSIFGTNDTLFGL 1045


Top