BLASTX nr result

ID: Akebia25_contig00027424 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00027424
         (2081 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002267489.2| PREDICTED: uncharacterized protein LOC100265...  1075   0.0  
emb|CBI22554.3| unnamed protein product [Vitis vinifera]             1075   0.0  
gb|EXB72473.1| hypothetical protein L484_011475 [Morus notabilis]    1070   0.0  
ref|XP_007038933.1| HAT transposon superfamily isoform 4 [Theobr...  1056   0.0  
ref|XP_007038931.1| HAT transposon superfamily isoform 2 [Theobr...  1056   0.0  
ref|XP_002513602.1| protein dimerization, putative [Ricinus comm...  1050   0.0  
ref|XP_004308021.1| PREDICTED: uncharacterized protein LOC101298...  1043   0.0  
ref|XP_004145979.1| PREDICTED: uncharacterized protein LOC101215...  1037   0.0  
ref|XP_004502603.1| PREDICTED: uncharacterized protein LOC101496...  1033   0.0  
ref|XP_004160403.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...  1032   0.0  
ref|XP_006490683.1| PREDICTED: uncharacterized protein LOC102618...  1019   0.0  
ref|XP_006581618.1| PREDICTED: uncharacterized protein LOC100808...  1014   0.0  
ref|XP_003602175.1| Protein dimerization [Medicago truncatula] g...  1008   0.0  
ref|XP_004234278.1| PREDICTED: uncharacterized protein LOC101256...   993   0.0  
ref|XP_006350604.1| PREDICTED: uncharacterized protein LOC102593...   992   0.0  
ref|XP_007218857.1| hypothetical protein PRUPE_ppa002763mg [Prun...   978   0.0  
ref|XP_007038930.1| HAT transposon superfamily isoform 1 [Theobr...   952   0.0  
ref|XP_006300772.1| hypothetical protein CARUB_v10019846mg, part...   914   0.0  
ref|NP_178092.4| hAT family dimerization domain-containing prote...   905   0.0  
gb|EEC70201.1| hypothetical protein OsI_00947 [Oryza sativa Indi...   808   0.0  

>ref|XP_002267489.2| PREDICTED: uncharacterized protein LOC100265581 [Vitis vinifera]
          Length = 723

 Score = 1075 bits (2779), Expect = 0.0
 Identities = 527/679 (77%), Positives = 594/679 (87%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKDVCWEY EKL+GNKVRCKFCLRVLNGGISRLKHHLSRLPSKGV+PCSKVRDDVTDR
Sbjct: 47   VREKDVCWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDR 106

Query: 1899 VRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPPS 1720
            VRAII+            KQR+AEAKSPGN S+ KALMS+    SP++K +P  T + PS
Sbjct: 107  VRAIISSKEDGKETSSAKKQRVAEAKSPGNYSAIKALMSVETP-SPIAKIFPPITHMGPS 165

Query: 1719 ASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLER 1540
            +S D ENAERSIALFFFENKLDFSVARS SYQLM+EA++KCGHGFRGPS+E LK TWLER
Sbjct: 166  SSNDGENAERSIALFFFENKLDFSVARSSSYQLMIEAVSKCGHGFRGPSAEILKTTWLER 225

Query: 1539 VKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYFK 1360
            +KSEVS QSK+IEKEW  TGCTIIADTWTDNKSRALINFLVSSPS TFFHKSV+AS+YFK
Sbjct: 226  IKSEVSLQSKDIEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDASSYFK 285

Query: 1359 SPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLIL 1180
            + K LADLFDSVIQD GP+NVVQ+IMD +LNY  V S+I+Q+Y ++F +PCAS CLNLIL
Sbjct: 286  NTKYLADLFDSVIQDLGPDNVVQIIMDSTLNYTGVASHIVQNYGTVFVSPCASQCLNLIL 345

Query: 1179 EDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQSM 1000
            EDF KIDWVNRCILQAQ+ISKFIYN+  +L+LMKK TGGQ+L+RT ITKS SNFLSLQSM
Sbjct: 346  EDFCKIDWVNRCILQAQTISKFIYNNASMLDLMKKSTGGQDLIRTGITKSVSNFLSLQSM 405

Query: 999  LKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLRE 820
            LKQR RLKHMF S EYSTN +Y+NKPQ+ISCI ILEDNDFW+AVEECVA+SEP LK LRE
Sbjct: 406  LKQRPRLKHMFGSSEYSTN-SYSNKPQNISCIAILEDNDFWRAVEECVAISEPFLKGLRE 464

Query: 819  ISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFLN 640
            +SGGKP+VGSIYE MT+AK+SIRTYYIMDE+KCK FLDIVD RW NQLHSPLHAAAAFLN
Sbjct: 465  VSGGKPAVGSIYELMTKAKESIRTYYIMDESKCKAFLDIVDGRWRNQLHSPLHAAAAFLN 524

Query: 639  PSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREARN 460
            PSIQYNPE+KF+G IK++F  VLEKLLPT ++R DIT QI LF +A GMFGCNLAREAR+
Sbjct: 525  PSIQYNPEIKFIGAIKEDFFKVLEKLLPTSDMRRDITNQILLFTRATGMFGCNLAREARD 584

Query: 459  TISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLGD 280
            T+ PGLWWEQ+GDSAP LQRVA+RILSQVCS S FER+W+TFQQIHSEKRN++DKETL D
Sbjct: 585  TVPPGLWWEQFGDSAPVLQRVAIRILSQVCSTSTFERHWNTFQQIHSEKRNKIDKETLND 644

Query: 279  LLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSDL 100
            L+YINYNLKLA + K K  E DP+  DDIDMTS+WVEETENPSPTQWLDRFGSALDGSDL
Sbjct: 645  LVYINYNLKLARQMKMKSSEADPLQFDDIDMTSEWVEETENPSPTQWLDRFGSALDGSDL 704

Query: 99   NTRQFTNAMFGTNDHIFGL 43
            NTRQF  A+FG++D IFGL
Sbjct: 705  NTRQFNAAIFGSSDTIFGL 723


>emb|CBI22554.3| unnamed protein product [Vitis vinifera]
          Length = 731

 Score = 1075 bits (2779), Expect = 0.0
 Identities = 527/679 (77%), Positives = 594/679 (87%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKDVCWEY EKL+GNKVRCKFCLRVLNGGISRLKHHLSRLPSKGV+PCSKVRDDVTDR
Sbjct: 55   VREKDVCWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDR 114

Query: 1899 VRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPPS 1720
            VRAII+            KQR+AEAKSPGN S+ KALMS+    SP++K +P  T + PS
Sbjct: 115  VRAIISSKEDGKETSSAKKQRVAEAKSPGNYSAIKALMSVETP-SPIAKIFPPITHMGPS 173

Query: 1719 ASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLER 1540
            +S D ENAERSIALFFFENKLDFSVARS SYQLM+EA++KCGHGFRGPS+E LK TWLER
Sbjct: 174  SSNDGENAERSIALFFFENKLDFSVARSSSYQLMIEAVSKCGHGFRGPSAEILKTTWLER 233

Query: 1539 VKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYFK 1360
            +KSEVS QSK+IEKEW  TGCTIIADTWTDNKSRALINFLVSSPS TFFHKSV+AS+YFK
Sbjct: 234  IKSEVSLQSKDIEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDASSYFK 293

Query: 1359 SPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLIL 1180
            + K LADLFDSVIQD GP+NVVQ+IMD +LNY  V S+I+Q+Y ++F +PCAS CLNLIL
Sbjct: 294  NTKYLADLFDSVIQDLGPDNVVQIIMDSTLNYTGVASHIVQNYGTVFVSPCASQCLNLIL 353

Query: 1179 EDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQSM 1000
            EDF KIDWVNRCILQAQ+ISKFIYN+  +L+LMKK TGGQ+L+RT ITKS SNFLSLQSM
Sbjct: 354  EDFCKIDWVNRCILQAQTISKFIYNNASMLDLMKKSTGGQDLIRTGITKSVSNFLSLQSM 413

Query: 999  LKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLRE 820
            LKQR RLKHMF S EYSTN +Y+NKPQ+ISCI ILEDNDFW+AVEECVA+SEP LK LRE
Sbjct: 414  LKQRPRLKHMFGSSEYSTN-SYSNKPQNISCIAILEDNDFWRAVEECVAISEPFLKGLRE 472

Query: 819  ISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFLN 640
            +SGGKP+VGSIYE MT+AK+SIRTYYIMDE+KCK FLDIVD RW NQLHSPLHAAAAFLN
Sbjct: 473  VSGGKPAVGSIYELMTKAKESIRTYYIMDESKCKAFLDIVDGRWRNQLHSPLHAAAAFLN 532

Query: 639  PSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREARN 460
            PSIQYNPE+KF+G IK++F  VLEKLLPT ++R DIT QI LF +A GMFGCNLAREAR+
Sbjct: 533  PSIQYNPEIKFIGAIKEDFFKVLEKLLPTSDMRRDITNQILLFTRATGMFGCNLAREARD 592

Query: 459  TISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLGD 280
            T+ PGLWWEQ+GDSAP LQRVA+RILSQVCS S FER+W+TFQQIHSEKRN++DKETL D
Sbjct: 593  TVPPGLWWEQFGDSAPVLQRVAIRILSQVCSTSTFERHWNTFQQIHSEKRNKIDKETLND 652

Query: 279  LLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSDL 100
            L+YINYNLKLA + K K  E DP+  DDIDMTS+WVEETENPSPTQWLDRFGSALDGSDL
Sbjct: 653  LVYINYNLKLARQMKMKSSEADPLQFDDIDMTSEWVEETENPSPTQWLDRFGSALDGSDL 712

Query: 99   NTRQFTNAMFGTNDHIFGL 43
            NTRQF  A+FG++D IFGL
Sbjct: 713  NTRQFNAAIFGSSDTIFGL 731


>gb|EXB72473.1| hypothetical protein L484_011475 [Morus notabilis]
          Length = 694

 Score = 1070 bits (2766), Expect = 0.0
 Identities = 518/679 (76%), Positives = 591/679 (87%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKDVCWEY EKL+GNKVRCKFCLRVLNGGISRLKHHLSRLPSKGV+PCSKVRDDVTDR
Sbjct: 17   VREKDVCWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDR 76

Query: 1899 VRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPPS 1720
            VRAIIA            KQ+L E KSPGN+S+SKAL+S T  TSP++K +P+ T V P 
Sbjct: 77   VRAIIASKEDVKETSSTKKQKLVEVKSPGNVSASKALVS-TDTTSPVAKVFPAVTPVAPP 135

Query: 1719 ASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLER 1540
            +   +ENAERSIALFFFENKLDF +ARS SYQLM++AIAKCG GF GPS+E LK TWLER
Sbjct: 136  SLNSQENAERSIALFFFENKLDFGIARSSSYQLMVDAIAKCGPGFTGPSAETLKTTWLER 195

Query: 1539 VKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYFK 1360
            +KSE+S QSK+IEKEW  TGCTIIADTWTDNKSRALINFLVSSPS TFFHKSV+AS YFK
Sbjct: 196  IKSEMSLQSKDIEKEWMTTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDASAYFK 255

Query: 1359 SPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLIL 1180
            + KCLADLFDSVIQDFGP+NVVQVIMD S NY  V ++I+Q+Y++IF +PC S CLNLIL
Sbjct: 256  NMKCLADLFDSVIQDFGPDNVVQVIMDSSFNYTGVANHILQNYSTIFVSPCVSQCLNLIL 315

Query: 1179 EDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQSM 1000
            E+FSK+DWVNRCILQ Q+ISKFIYN   +L+LMKK+TGGQEL+RT ITKS S+FLSLQS+
Sbjct: 316  EEFSKVDWVNRCILQGQTISKFIYNSASMLDLMKKYTGGQELIRTGITKSVSSFLSLQSI 375

Query: 999  LKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLRE 820
            LKQ+SRLKHMFNSPEY TN  Y NKPQSISCI I+ED+DFW+AVEE VA+SEP LKVLRE
Sbjct: 376  LKQKSRLKHMFNSPEYCTNSLYVNKPQSISCISIVEDSDFWRAVEESVAISEPFLKVLRE 435

Query: 819  ISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFLN 640
            ++GGKP+VGSIYE MTRAK+SIRTYYIMDENKCKTFLDIVDR+W +QLHSPLH+AAAFLN
Sbjct: 436  VAGGKPAVGSIYELMTRAKESIRTYYIMDENKCKTFLDIVDRKWRDQLHSPLHSAAAFLN 495

Query: 639  PSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREARN 460
            PSIQYNPE+KFL +IK++F  VLEKLLP PE+R DIT QIF F KA  MFGC+LA EAR+
Sbjct: 496  PSIQYNPEIKFLSSIKEDFFKVLEKLLPLPEMRRDITSQIFTFTKAMSMFGCSLAMEARD 555

Query: 459  TISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLGD 280
             +SPGLWWEQYGDSAP LQRVA+RILSQVCS+  FER+WS FQQIHSEKRN++D+ETL D
Sbjct: 556  VVSPGLWWEQYGDSAPVLQRVAIRILSQVCSSFTFERHWSAFQQIHSEKRNKIDRETLND 615

Query: 279  LLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSDL 100
            L+YINYNLKLA  T+ K IE DPI  DDIDMTS+WVEE++N SP+QWLDRFGSALDGSDL
Sbjct: 616  LVYINYNLKLARHTRTKSIEADPIQFDDIDMTSEWVEESDNSSPSQWLDRFGSALDGSDL 675

Query: 99   NTRQFTNAMFGTNDHIFGL 43
            NTRQ+  A+FG+NDHIFGL
Sbjct: 676  NTRQYNAAIFGSNDHIFGL 694


>ref|XP_007038933.1| HAT transposon superfamily isoform 4 [Theobroma cacao]
            gi|508776178|gb|EOY23434.1| HAT transposon superfamily
            isoform 4 [Theobroma cacao]
          Length = 682

 Score = 1056 bits (2730), Expect = 0.0
 Identities = 508/679 (74%), Positives = 588/679 (86%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKDVCWEY EKL+GNKVRCKFCLRVLNGGISRLKHHLSRLPSKGV+PCSKVRDDVTDR
Sbjct: 6    VREKDVCWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDR 65

Query: 1899 VRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPPS 1720
            VRAI++            KQ++AEA+SPGN+S+   ++ +   +SP++K +P+++ + P 
Sbjct: 66   VRAILSSKEEIKETSSVKKQKIAEARSPGNISTCSKIIPLEA-SSPVAKVFPATSPIAPP 124

Query: 1719 ASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLER 1540
            +   +EN ERSIALFFFENKLDFSVARS SYQ M++A+ K G GF GPS E LK  WLER
Sbjct: 125  SLNSQENVERSIALFFFENKLDFSVARSSSYQAMIDAVGKFGPGFTGPSVETLKTMWLER 184

Query: 1539 VKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYFK 1360
            +KSEV  QSK+ EKEW  TGCTIIADTWTDNKSRALINFLVSSPS TFFHKSV+AS+YFK
Sbjct: 185  IKSEVCLQSKDTEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDASSYFK 244

Query: 1359 SPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLIL 1180
            + KCLADLFDSVIQDFGPENVVQ+IMD S NY  + ++I+Q+Y +IF +PCAS CLNLIL
Sbjct: 245  NTKCLADLFDSVIQDFGPENVVQIIMDSSFNYTGISNHILQNYGTIFVSPCASQCLNLIL 304

Query: 1179 EDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQSM 1000
            E+FSK+DWVNRCILQAQ++SKF+YN+  +L+LMKKFTG QEL+RT ITKS S+FLSLQSM
Sbjct: 305  EEFSKVDWVNRCILQAQTLSKFLYNNASMLDLMKKFTGEQELIRTGITKSVSSFLSLQSM 364

Query: 999  LKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLRE 820
            LKQRSRLKHMFNSPEYSTN +YANKPQSISCI I+EDNDFW+AV+ECVA+SEP LKVLRE
Sbjct: 365  LKQRSRLKHMFNSPEYSTNSSYANKPQSISCIAIVEDNDFWRAVDECVAISEPFLKVLRE 424

Query: 819  ISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFLN 640
            +SGGKP+VGSIYE MTRAK+SIRTYYIMDE KCKTFLDIVDR+W +QLHSPLH+A AFLN
Sbjct: 425  VSGGKPAVGSIYELMTRAKESIRTYYIMDEGKCKTFLDIVDRKWRDQLHSPLHSAGAFLN 484

Query: 639  PSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREARN 460
            PSIQYN E+KFLG+IK++F  VLEKLLPTPELR DIT QIF F +A+GMF CNLA EAR+
Sbjct: 485  PSIQYNQEIKFLGSIKEDFFKVLEKLLPTPELRRDITNQIFTFTRAKGMFACNLAMEARD 544

Query: 459  TISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLGD 280
            T+SPGLWWEQ+GDSAP LQRVA+RILSQVCS   FER+WSTFQQIHSEKRN++DKE L D
Sbjct: 545  TVSPGLWWEQFGDSAPVLQRVAIRILSQVCSTFTFERHWSTFQQIHSEKRNKIDKEILND 604

Query: 279  LLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSDL 100
            L+YINYNL+LA + + K +E DPI  DDIDMTS+WVEE+ENPSPTQWLDRFGSALDG DL
Sbjct: 605  LVYINYNLRLARQMRTKSVEADPIQFDDIDMTSEWVEESENPSPTQWLDRFGSALDGGDL 664

Query: 99   NTRQFTNAMFGTNDHIFGL 43
            NTRQF  A+FG NDHIFGL
Sbjct: 665  NTRQFNAAIFG-NDHIFGL 682


>ref|XP_007038931.1| HAT transposon superfamily isoform 2 [Theobroma cacao]
            gi|590673575|ref|XP_007038932.1| HAT transposon
            superfamily isoform 2 [Theobroma cacao]
            gi|508776176|gb|EOY23432.1| HAT transposon superfamily
            isoform 2 [Theobroma cacao] gi|508776177|gb|EOY23433.1|
            HAT transposon superfamily isoform 2 [Theobroma cacao]
          Length = 678

 Score = 1056 bits (2730), Expect = 0.0
 Identities = 508/679 (74%), Positives = 588/679 (86%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKDVCWEY EKL+GNKVRCKFCLRVLNGGISRLKHHLSRLPSKGV+PCSKVRDDVTDR
Sbjct: 2    VREKDVCWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDR 61

Query: 1899 VRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPPS 1720
            VRAI++            KQ++AEA+SPGN+S+   ++ +   +SP++K +P+++ + P 
Sbjct: 62   VRAILSSKEEIKETSSVKKQKIAEARSPGNISTCSKIIPLEA-SSPVAKVFPATSPIAPP 120

Query: 1719 ASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLER 1540
            +   +EN ERSIALFFFENKLDFSVARS SYQ M++A+ K G GF GPS E LK  WLER
Sbjct: 121  SLNSQENVERSIALFFFENKLDFSVARSSSYQAMIDAVGKFGPGFTGPSVETLKTMWLER 180

Query: 1539 VKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYFK 1360
            +KSEV  QSK+ EKEW  TGCTIIADTWTDNKSRALINFLVSSPS TFFHKSV+AS+YFK
Sbjct: 181  IKSEVCLQSKDTEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDASSYFK 240

Query: 1359 SPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLIL 1180
            + KCLADLFDSVIQDFGPENVVQ+IMD S NY  + ++I+Q+Y +IF +PCAS CLNLIL
Sbjct: 241  NTKCLADLFDSVIQDFGPENVVQIIMDSSFNYTGISNHILQNYGTIFVSPCASQCLNLIL 300

Query: 1179 EDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQSM 1000
            E+FSK+DWVNRCILQAQ++SKF+YN+  +L+LMKKFTG QEL+RT ITKS S+FLSLQSM
Sbjct: 301  EEFSKVDWVNRCILQAQTLSKFLYNNASMLDLMKKFTGEQELIRTGITKSVSSFLSLQSM 360

Query: 999  LKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLRE 820
            LKQRSRLKHMFNSPEYSTN +YANKPQSISCI I+EDNDFW+AV+ECVA+SEP LKVLRE
Sbjct: 361  LKQRSRLKHMFNSPEYSTNSSYANKPQSISCIAIVEDNDFWRAVDECVAISEPFLKVLRE 420

Query: 819  ISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFLN 640
            +SGGKP+VGSIYE MTRAK+SIRTYYIMDE KCKTFLDIVDR+W +QLHSPLH+A AFLN
Sbjct: 421  VSGGKPAVGSIYELMTRAKESIRTYYIMDEGKCKTFLDIVDRKWRDQLHSPLHSAGAFLN 480

Query: 639  PSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREARN 460
            PSIQYN E+KFLG+IK++F  VLEKLLPTPELR DIT QIF F +A+GMF CNLA EAR+
Sbjct: 481  PSIQYNQEIKFLGSIKEDFFKVLEKLLPTPELRRDITNQIFTFTRAKGMFACNLAMEARD 540

Query: 459  TISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLGD 280
            T+SPGLWWEQ+GDSAP LQRVA+RILSQVCS   FER+WSTFQQIHSEKRN++DKE L D
Sbjct: 541  TVSPGLWWEQFGDSAPVLQRVAIRILSQVCSTFTFERHWSTFQQIHSEKRNKIDKEILND 600

Query: 279  LLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSDL 100
            L+YINYNL+LA + + K +E DPI  DDIDMTS+WVEE+ENPSPTQWLDRFGSALDG DL
Sbjct: 601  LVYINYNLRLARQMRTKSVEADPIQFDDIDMTSEWVEESENPSPTQWLDRFGSALDGGDL 660

Query: 99   NTRQFTNAMFGTNDHIFGL 43
            NTRQF  A+FG NDHIFGL
Sbjct: 661  NTRQFNAAIFG-NDHIFGL 678


>ref|XP_002513602.1| protein dimerization, putative [Ricinus communis]
            gi|223547510|gb|EEF49005.1| protein dimerization,
            putative [Ricinus communis]
          Length = 688

 Score = 1050 bits (2715), Expect = 0.0
 Identities = 506/679 (74%), Positives = 589/679 (86%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKDVCWEY EKL+GNKV+CKFCLRVLNGGISRLKHHLSRLPSKGV+PCSKVRDDVTDR
Sbjct: 11   VREKDVCWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDR 70

Query: 1899 VRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPPS 1720
            VRAIIA            KQR AEAKSP ++ ++KAL+++    +P +K YP+ T + P 
Sbjct: 71   VRAIIASKEDIKEPSSAKKQRPAEAKSPAHIYATKALVNVE-SVAPAAKVYPTVTSISPP 129

Query: 1719 ASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLER 1540
            +  ++ENAERSIALFFFENKLDFSVARSPSYQLM+EAI KCG GF GPS+E LK TWLER
Sbjct: 130  SLSNQENAERSIALFFFENKLDFSVARSPSYQLMIEAIEKCGPGFTGPSAEILKTTWLER 189

Query: 1539 VKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYFK 1360
            +KSEVS Q K+ EKEW  TGCTIIADTWTDNKSRALINF VSSPS TFFHKSV+AS+YFK
Sbjct: 190  IKSEVSLQLKDTEKEWTTTGCTIIADTWTDNKSRALINFFVSSPSRTFFHKSVDASSYFK 249

Query: 1359 SPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLIL 1180
            + KCLADLFDSVIQDFG ENVVQ+IMD S NY  V ++I+Q+Y +IF +PCAS CLNLIL
Sbjct: 250  NTKCLADLFDSVIQDFGAENVVQIIMDSSFNYTGVANHILQNYGTIFVSPCASQCLNLIL 309

Query: 1179 EDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQSM 1000
            EDFSK+DWVNRCI QAQ++SKFIYN++ +L+LMKKFTGGQEL++T ITKS S+FLSLQSM
Sbjct: 310  EDFSKVDWVNRCISQAQTLSKFIYNNSSMLDLMKKFTGGQELIKTGITKSVSSFLSLQSM 369

Query: 999  LKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLRE 820
            LKQR RLK MF+S EYS N +Y++KPQSI+CI I+ED DFW+AVEECVA++EP LKVLRE
Sbjct: 370  LKQRPRLKLMFSSNEYSANSSYSSKPQSIACITIVEDGDFWRAVEECVAITEPFLKVLRE 429

Query: 819  ISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFLN 640
            +SGGKP+VGSIYE MTRAK+SIRTYYIMDE+KCKTFLDIVDR+W +QLHSPLH+AAAFLN
Sbjct: 430  VSGGKPAVGSIYELMTRAKESIRTYYIMDESKCKTFLDIVDRKWRDQLHSPLHSAAAFLN 489

Query: 639  PSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREARN 460
            P +QYNPE+KFL NIK++F  V+EKLLPTP++R DIT QIF+F +A GMFGCNLA EAR+
Sbjct: 490  PCVQYNPEIKFLVNIKEDFFKVIEKLLPTPDMRRDITNQIFIFTRASGMFGCNLAMEARD 549

Query: 459  TISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLGD 280
            T++PGLWWEQYGDSAP LQRVA+RILSQVCS   FER+W+TF+QIHSEKRN++DKETL D
Sbjct: 550  TVAPGLWWEQYGDSAPVLQRVAIRILSQVCSTFTFERHWNTFRQIHSEKRNKIDKETLND 609

Query: 279  LLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSDL 100
            L+YINYNLKL  + + K  ETDPI  DDIDMTS+WVEET+NPSPTQWLDRFGSALDGSDL
Sbjct: 610  LVYINYNLKLMRQMRTKSSETDPIQFDDIDMTSEWVEETDNPSPTQWLDRFGSALDGSDL 669

Query: 99   NTRQFTNAMFGTNDHIFGL 43
            NTRQF  A+FG +D +FGL
Sbjct: 670  NTRQFNAAIFGASDPLFGL 688


>ref|XP_004308021.1| PREDICTED: uncharacterized protein LOC101298657 [Fragaria vesca
            subsp. vesca]
          Length = 681

 Score = 1043 bits (2698), Expect = 0.0
 Identities = 505/681 (74%), Positives = 586/681 (86%), Gaps = 2/681 (0%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKD CWEY EKL+GNKV+CKFC RVLNGGISRLKHHLSRLPSKGV+PCSKVRDDVTD+
Sbjct: 2    VREKDTCWEYAEKLDGNKVKCKFCQRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDK 61

Query: 1899 VRAIIALXXXXXXXXXXXKQR-LAEAKSPG-NLSSSKALMSMTMETSPMSKTYPSSTQVP 1726
            VR IIA            K++   E KSP  N+S  KALMSM    SP+ K YP+ T + 
Sbjct: 62   VRTIIASKEEVKETSSSSKKKKFVEVKSPPVNVSPVKALMSMETP-SPIQKVYPNVTPMA 120

Query: 1725 PSASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWL 1546
            P +  ++ENAERSIALFFFENK+DFS+AR+ SYQLM++AI KCG GF GPS+E LK TWL
Sbjct: 121  PLSMNNQENAERSIALFFFENKIDFSIARTSSYQLMIDAITKCGPGFTGPSAETLKTTWL 180

Query: 1545 ERVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTY 1366
            ERVK+E+S QSK+IEKEW  TGCTIIADTWTDNKSRALINFLVSSPS TFFHKSV+AS Y
Sbjct: 181  ERVKTEMSLQSKDIEKEWTTTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDASAY 240

Query: 1365 FKSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNL 1186
            FK+ KCLA+LFDSVIQDFGPENVVQ+IMD S NY  V ++I+ +Y +IF +PCAS CLNL
Sbjct: 241  FKNTKCLAELFDSVIQDFGPENVVQIIMDSSFNYTGVANHILTNYTTIFVSPCASQCLNL 300

Query: 1185 ILEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQ 1006
            ILE+FSK+DWVNRC LQAQ+ISKFIYN+  +L+LMK+FTGGQ+L+RT ITKS S+FLSLQ
Sbjct: 301  ILEEFSKVDWVNRCFLQAQTISKFIYNNASMLDLMKRFTGGQDLIRTGITKSVSSFLSLQ 360

Query: 1005 SMLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVL 826
            ++LKQRSRLKHMFNSPE+ TN +YANK QSISCI I+EDNDFW+A EE VA+SEP LKVL
Sbjct: 361  TILKQRSRLKHMFNSPEFCTNSSYANKTQSISCISIMEDNDFWRAAEESVAISEPFLKVL 420

Query: 825  REISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAF 646
            RE+SGGKP+VGSIYE MTRAK+SIRTYYIMDENKCK FLDIVDR+W +QLHSPLHAAAAF
Sbjct: 421  REVSGGKPAVGSIYELMTRAKESIRTYYIMDENKCKVFLDIVDRKWRDQLHSPLHAAAAF 480

Query: 645  LNPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREA 466
            LNPSIQYNPE+KFL +IK++F  VLEKLLP+PE+R DIT QIF F KA GMFGC+LA EA
Sbjct: 481  LNPSIQYNPEIKFLTSIKEDFFKVLEKLLPSPEMRRDITNQIFTFTKATGMFGCSLAMEA 540

Query: 465  RNTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETL 286
            R+ +SPGLWWEQYGDSAP LQRVA+RILSQVCS   FE++WS FQQIHSEKRN++D+ETL
Sbjct: 541  RDVVSPGLWWEQYGDSAPVLQRVAIRILSQVCSTFTFEKHWSAFQQIHSEKRNKIDRETL 600

Query: 285  GDLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGS 106
             DL+YINYNL+L+ +T+ K +E DPIL DDIDMTS+WVEE+++PSPTQWLDRFGSALDGS
Sbjct: 601  NDLVYINYNLRLSKQTRNKNVEADPILFDDIDMTSEWVEESDSPSPTQWLDRFGSALDGS 660

Query: 105  DLNTRQFTNAMFGTNDHIFGL 43
            DLNTRQF  A+FG+NDHIFGL
Sbjct: 661  DLNTRQFNAAIFGSNDHIFGL 681


>ref|XP_004145979.1| PREDICTED: uncharacterized protein LOC101215128, partial [Cucumis
            sativus]
          Length = 685

 Score = 1037 bits (2681), Expect = 0.0
 Identities = 502/681 (73%), Positives = 588/681 (86%), Gaps = 2/681 (0%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKD+CWEY EKL+GNKV+CKFCLRVLNGGISRLKHHLSRLPS+GV+PCSKVRDDV+DR
Sbjct: 6    VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDR 65

Query: 1899 VRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSM-TMET-SPMSKTYPSSTQVP 1726
            VRAI+A            KQ+LAE K+  ++ S     S+ ++ET SP++K +P+ T + 
Sbjct: 66   VRAILATREEIKEASTGKKQKLAEVKTVESVPSISMCKSVVSIETPSPVAKVFPTVTPMA 125

Query: 1725 PSASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWL 1546
            P + ++ ENAE+SIALFFFENKLDFS+ARS SYQLM++AI KCG GF GPS+E LK TWL
Sbjct: 126  PPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKTTWL 185

Query: 1545 ERVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTY 1366
            ER+K+EVS QSK+IEKEW  TGCTII DTWTDNKSRALINFLVSSPS TFFHKSV+ASTY
Sbjct: 186  ERIKTEVSLQSKDIEKEWTTTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSVDASTY 245

Query: 1365 FKSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNL 1186
            FK+ KCL DLFDSVIQDFG ENVVQ+IMD SLNY    ++I+Q+Y +IF +PCAS CLN 
Sbjct: 246  FKNTKCLGDLFDSVIQDFGHENVVQIIMDSSLNYSGTANHILQTYGTIFVSPCASQCLNS 305

Query: 1185 ILEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQ 1006
            ILE+FSK+DWVNRCILQAQ+ISKF+YN + +L+LM++FTGGQEL+RT I+K  S+FLSLQ
Sbjct: 306  ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQ 365

Query: 1005 SMLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVL 826
            S+LKQRSRLKHMFNSP+Y+TN +YANKPQSISCI I+EDNDFW+AVEECVA+SEP L+VL
Sbjct: 366  SILKQRSRLKHMFNSPDYTTN-SYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVL 424

Query: 825  REISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAF 646
            RE+ GGKP+VG IYE MTRAK+SIRTYYIMDE KCKTFLDIVDR+W +QLHSPLHAAAAF
Sbjct: 425  REVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAF 484

Query: 645  LNPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREA 466
            LNPSIQYNPE+KFL +IK++F  VLEKLLP PE+R DIT QIF F KA GMFGC+LA EA
Sbjct: 485  LNPSIQYNPEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEA 544

Query: 465  RNTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETL 286
            R+T+SP LWWEQ+GDSAP LQRVA+RILSQVCS  +FER+WS FQQIHSEKRN++DKETL
Sbjct: 545  RDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL 604

Query: 285  GDLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGS 106
             DL+YINYNLKLA + + KP+E+DPI  DDIDMTS+WVEE+EN SPTQWLDRFGS+LDGS
Sbjct: 605  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGS 664

Query: 105  DLNTRQFTNAMFGTNDHIFGL 43
            DLNTRQF  AMFG NDHIF L
Sbjct: 665  DLNTRQFNAAMFGANDHIFNL 685


>ref|XP_004502603.1| PREDICTED: uncharacterized protein LOC101496447 isoform X1 [Cicer
            arietinum] gi|502136218|ref|XP_004502604.1| PREDICTED:
            uncharacterized protein LOC101496447 isoform X2 [Cicer
            arietinum]
          Length = 679

 Score = 1033 bits (2671), Expect = 0.0
 Identities = 494/679 (72%), Positives = 581/679 (85%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKDVCWEY EKL+GNKVRCKFC RVLNGGISRLKHHLSR PSKGV+PCSKVRDDVTDR
Sbjct: 2    VREKDVCWEYAEKLDGNKVRCKFCQRVLNGGISRLKHHLSRFPSKGVNPCSKVRDDVTDR 61

Query: 1899 VRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPPS 1720
            VR IIA            KQ++AE KSPG+LS++KALMS+   TSP  K +P+S  + PS
Sbjct: 62   VRNIIASKDEIKETTSVKKQKVAEVKSPGSLSATKALMSLET-TSPTGKIFPTSNPLTPS 120

Query: 1719 ASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLER 1540
            ++ ++ENAERSIALFFFENKLDFSVARS SYQLM++AI KCG GF GPS+E LK TWLER
Sbjct: 121  STNNQENAERSIALFFFENKLDFSVARSSSYQLMIDAIGKCGPGFTGPSAEILKTTWLER 180

Query: 1539 VKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYFK 1360
            +KSEV  QSK++EKEW  TGCTIIADTWTD KS+A+INFLVSSPS TFFHKSV+AS YFK
Sbjct: 181  IKSEVGLQSKDVEKEWATTGCTIIADTWTDYKSKAIINFLVSSPSRTFFHKSVDASAYFK 240

Query: 1359 SPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLIL 1180
            + K LADLFDSVIQ+FGPENVVQ+IMD S NY  + ++I+Q+Y +IF +PCAS CLNLIL
Sbjct: 241  NTKWLADLFDSVIQEFGPENVVQIIMDSSFNYTGIANHIVQNYGTIFVSPCASQCLNLIL 300

Query: 1179 EDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQSM 1000
            E+F+K+DW++RCILQAQ+ISK IYN+  +L+LMKK++GGQEL+RT +TKS S FLSLQSM
Sbjct: 301  EEFTKVDWISRCILQAQTISKLIYNNASLLDLMKKYSGGQELIRTGVTKSVSTFLSLQSM 360

Query: 999  LKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLRE 820
            LK R+RLKHMF+SPEY++N +YANKPQS+SCI I ED DFW+ VEECVA+SEP LKVLRE
Sbjct: 361  LKLRTRLKHMFHSPEYASNTSYANKPQSLSCIAIAEDGDFWRTVEECVAISEPFLKVLRE 420

Query: 819  ISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFLN 640
            +S GKP VGSIYE MTRAK+SIRTYYIMDENKCKTFLDIVD++W +QLHSPLHAAAAFLN
Sbjct: 421  VSEGKPIVGSIYELMTRAKESIRTYYIMDENKCKTFLDIVDKKWRDQLHSPLHAAAAFLN 480

Query: 639  PSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREARN 460
            PSIQYNPE+KFL +IK++F  VLEKLLP P++R DIT QI+ F KA GMFGC+LAREARN
Sbjct: 481  PSIQYNPEIKFLSSIKEDFFNVLEKLLPVPDMRRDITNQIYTFTKAHGMFGCSLAREARN 540

Query: 459  TISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLGD 280
            T++P LWWEQYGDSAPGLQRVA+RILSQVCS  +F+R WSTF+QIHSEK+N++D+ETL D
Sbjct: 541  TVAPWLWWEQYGDSAPGLQRVAIRILSQVCSTFSFQRQWSTFRQIHSEKKNKIDRETLND 600

Query: 279  LLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSDL 100
            L+YINYNLKL  +   K +E D +  DDIDMTS+WVEE E  SPTQWLDRFG ALDG+DL
Sbjct: 601  LVYINYNLKLTKQVNAKSLEVDLLQSDDIDMTSEWVEENETASPTQWLDRFGPALDGNDL 660

Query: 99   NTRQFTNAMFGTNDHIFGL 43
            NTRQF +++FG ND IFGL
Sbjct: 661  NTRQFGSSIFGANDPIFGL 679


>ref|XP_004160403.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101215128 [Cucumis
            sativus]
          Length = 784

 Score = 1032 bits (2669), Expect = 0.0
 Identities = 500/681 (73%), Positives = 586/681 (86%), Gaps = 2/681 (0%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKD+CWEY EKL+GNKV+CKFCLRVLNGGISRLKHHLSRLPS+GV+PCSKVRDDV+DR
Sbjct: 105  VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDR 164

Query: 1899 VRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSM-TMET-SPMSKTYPSSTQVP 1726
            VRAI+A            KQ+LAE K+  ++ S     S+ ++ET SP++K +P+ T + 
Sbjct: 165  VRAILATREEIKEASTGKKQKLAEVKTVESVPSISMCKSVVSIETPSPVAKVFPTVTPMA 224

Query: 1725 PSASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWL 1546
            P + ++ ENAE+SIALF FENKLDFS+ARS SYQLM++AI KCG GF GPS+E LK TWL
Sbjct: 225  PPSLHNHENAEKSIALFXFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKTTWL 284

Query: 1545 ERVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTY 1366
            ER+K+EVS QSK+IEKEW  TGCTII DTWTDNKSRALINF VSSPS TFFHKSV+ASTY
Sbjct: 285  ERIKTEVSLQSKDIEKEWTTTGCTIIVDTWTDNKSRALINFXVSSPSRTFFHKSVDASTY 344

Query: 1365 FKSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNL 1186
            FK+ KCL DLFDSVIQDFG ENVVQ+IMD SLNY    ++I+Q+Y +IF +PCAS CLN 
Sbjct: 345  FKNTKCLGDLFDSVIQDFGHENVVQIIMDSSLNYSGTANHILQTYGTIFVSPCASQCLNS 404

Query: 1185 ILEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQ 1006
            ILE+FSK+DWVNRCILQAQ+ISKF+YN + +L+LM++FTGGQEL+RT I+K  S+FLSLQ
Sbjct: 405  ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQ 464

Query: 1005 SMLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVL 826
            S+LKQRSRLKHMFNSP+Y+TN +YANKPQSISCI I+EDNDFW+AVEECVA+SEP L+VL
Sbjct: 465  SILKQRSRLKHMFNSPDYTTN-SYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVL 523

Query: 825  REISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAF 646
            RE+ GGKP+VG IYE MTRAK+SIRTYYIMDE KCKTFLDIVDR+W +QLHSPLHAAAAF
Sbjct: 524  REVCGGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAF 583

Query: 645  LNPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREA 466
            LNPSIQYNPE+KFL +IK++F  VLEKLLP PE+R DIT QIF F KA GMFGC+LA EA
Sbjct: 584  LNPSIQYNPEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEA 643

Query: 465  RNTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETL 286
            R+T+SP LWWEQ+GDSAP LQRVA+RILSQVCS  +FER+WS FQQIHSEKRN++DKETL
Sbjct: 644  RDTVSPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETL 703

Query: 285  GDLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGS 106
             DL+YINYNLKLA + + KP+E+DPI  DDIDMTS+WVEE+EN SPTQWLDRFGS+LDGS
Sbjct: 704  NDLVYINYNLKLARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGS 763

Query: 105  DLNTRQFTNAMFGTNDHIFGL 43
            DLNTRQF  AMFG NDHIF L
Sbjct: 764  DLNTRQFNAAMFGANDHIFNL 784


>ref|XP_006490683.1| PREDICTED: uncharacterized protein LOC102618477 [Citrus sinensis]
          Length = 764

 Score = 1019 bits (2635), Expect = 0.0
 Identities = 500/679 (73%), Positives = 579/679 (85%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKD+CWEY EKL+GNKVRCKFCLRVLNGGISRLKHHLSRLPSKGV+PCSKVRDDVTDR
Sbjct: 90   VREKDICWEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDR 149

Query: 1899 VRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPPS 1720
            VRAIIA            KQR+AEAK  G + SSK+LM +    SP++K + + T +  S
Sbjct: 150  VRAIIASKEDVKETPIGKKQRVAEAKPVGIVCSSKSLMPLETP-SPVTKVFATMTPMGNS 208

Query: 1719 ASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLER 1540
            +  ++ENAERSIALFFFENKLDF+VARS SYQ M++A+ KCG GF GPS+EALK  WL+R
Sbjct: 209  SLNNQENAERSIALFFFENKLDFAVARSSSYQQMIDAVGKCGPGFTGPSAEALKTMWLDR 268

Query: 1539 VKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYFK 1360
            +KSEV+ QSK+IEKEW +TGCTIIADTWTDNKS+ALINFLVSSPS TFF KSV+ S+ FK
Sbjct: 269  IKSEVNVQSKDIEKEWAMTGCTIIADTWTDNKSKALINFLVSSPSRTFFLKSVDTSSNFK 328

Query: 1359 SPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLIL 1180
            + K LAD+FDSVIQD GPENVVQ+IMD S NY  V ++I+Q+Y +IF +PCAS  LN+IL
Sbjct: 329  NTKYLADIFDSVIQDIGPENVVQIIMDSSFNYTGVANHILQNYGTIFVSPCASQSLNIIL 388

Query: 1179 EDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQSM 1000
            E+FSK+DWVNRCILQAQ+ISKFIYN+  +L+LMKKFTGG EL+RT ITK  SNFLSLQS+
Sbjct: 389  EEFSKVDWVNRCILQAQTISKFIYNNASMLDLMKKFTGGLELIRTGITKYVSNFLSLQSI 448

Query: 999  LKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLRE 820
            LKQRSRLKHMFNSPEYST+  YANKPQS+SCI I+EDNDFW+AVEE VA+SEP LKVLRE
Sbjct: 449  LKQRSRLKHMFNSPEYSTSSPYANKPQSLSCISIVEDNDFWRAVEESVAISEPFLKVLRE 508

Query: 819  ISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFLN 640
            +SGGKP+VGSIYE MTRAK+SIRTYYIMDENKCK FLDIVDR W  QLHSPLH+AAAFLN
Sbjct: 509  VSGGKPAVGSIYELMTRAKESIRTYYIMDENKCKIFLDIVDRNWRGQLHSPLHSAAAFLN 568

Query: 639  PSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREARN 460
            PSIQYNPE+KFLG+IK++F  VLEKLLPTP+ R DIT QI  F +A GMFGC LA EAR 
Sbjct: 569  PSIQYNPEIKFLGSIKEDFFNVLEKLLPTPDTRRDITTQILTFSRASGMFGCKLAMEARE 628

Query: 459  TISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLGD 280
            T+ PGLWWEQYGDSAP LQRVA+RILSQVCS+ +FER+WSTFQQIHSEKRN++DKETL D
Sbjct: 629  TVPPGLWWEQYGDSAPVLQRVAIRILSQVCSSFSFERHWSTFQQIHSEKRNKIDKETLND 688

Query: 279  LLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSDL 100
            L+YI+YNLKLA   + K +E DP+  DDIDMTS+WVEE+E+ SP QWLDRFGSALDGSDL
Sbjct: 689  LVYISYNLKLA---RTKSVEADPLQFDDIDMTSEWVEESEHHSPHQWLDRFGSALDGSDL 745

Query: 99   NTRQFTNAMFGTNDHIFGL 43
            NTRQF+ +MF +ND IFGL
Sbjct: 746  NTRQFSASMFSSNDPIFGL 764


>ref|XP_006581618.1| PREDICTED: uncharacterized protein LOC100808813 isoform X1 [Glycine
            max] gi|571460166|ref|XP_006581619.1| PREDICTED:
            uncharacterized protein LOC100808813 isoform X2 [Glycine
            max]
          Length = 679

 Score = 1014 bits (2623), Expect = 0.0
 Identities = 489/679 (72%), Positives = 575/679 (84%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKDVCWEY EKL+GNKVRCKFC RVLNGGISRLKHHLSR PSKGV+PCSKVRDDVTDR
Sbjct: 2    VREKDVCWEYAEKLDGNKVRCKFCQRVLNGGISRLKHHLSRFPSKGVNPCSKVRDDVTDR 61

Query: 1899 VRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPPS 1720
            VR IIA            KQ++AE KSP NLS+SKAL+S+    SP+ K +P+   + PS
Sbjct: 62   VRGIIASKEEVKETSSAKKQKIAEVKSPSNLSASKALVSLDA-ASPVMKIFPTGHPMTPS 120

Query: 1719 ASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLER 1540
            ++ ++E AERSIALFFFENKLDFSVARS SYQLM++AIAKCG GF GPS+E LK  WLER
Sbjct: 121  STNNQEIAERSIALFFFENKLDFSVARSSSYQLMIDAIAKCGPGFTGPSAETLKTIWLER 180

Query: 1539 VKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYFK 1360
            +KSEV  Q+K++EKEW  TGCTI+ADTWTD KS+A+INFLVSSPS TFFHKSV+AS YFK
Sbjct: 181  MKSEVGLQTKDVEKEWATTGCTILADTWTDYKSKAIINFLVSSPSRTFFHKSVDASAYFK 240

Query: 1359 SPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLIL 1180
            + K LADLFDSVIQ+FGPENVVQ+IMD S+NY  + ++I+QSY +IF +PCAS CLNLIL
Sbjct: 241  NTKWLADLFDSVIQEFGPENVVQIIMDSSVNYTVIANHIVQSYGTIFVSPCASQCLNLIL 300

Query: 1179 EDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQSM 1000
            E+FSK+DW++RCILQAQ+ISK IYN+  +L+L KK+TGGQEL+RT ITKS S FLSLQSM
Sbjct: 301  EEFSKVDWISRCILQAQTISKLIYNNASLLDLTKKYTGGQELIRTGITKSVSTFLSLQSM 360

Query: 999  LKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLRE 820
            LK R+RLK+MF+S EY++N +YANKPQS+SCI I ED DFW+ VEECVA+SEP LKVLRE
Sbjct: 361  LKLRTRLKNMFHSHEYASNTSYANKPQSLSCITIAEDGDFWRTVEECVAISEPFLKVLRE 420

Query: 819  ISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFLN 640
            IS GKP+VGSIYE MTRAK+SIRTYYIMDENKCK FLDIVD++W +QLHSPLHAAAAFLN
Sbjct: 421  ISEGKPTVGSIYELMTRAKESIRTYYIMDENKCKKFLDIVDKKWRDQLHSPLHAAAAFLN 480

Query: 639  PSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREARN 460
            PSIQYNPE+KF+ +IK++F  VLEKLLP P++R DIT QI+ F KA GMFGC+LA+EARN
Sbjct: 481  PSIQYNPEIKFISSIKEDFFNVLEKLLPVPDMRRDITNQIYTFTKAHGMFGCSLAKEARN 540

Query: 459  TISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLGD 280
            T++P LWWEQYGDSAPGLQRVA+RILSQVCS  +F R WST +QIHSEKRN++D+ETL D
Sbjct: 541  TVAPWLWWEQYGDSAPGLQRVAIRILSQVCSTFSFHRQWSTIRQIHSEKRNKIDRETLND 600

Query: 279  LLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSDL 100
            L+YINYNLKLA +   K  E D +  DDIDMTS+WVEE E  SPTQWLDRFG ALDG+DL
Sbjct: 601  LVYINYNLKLARQMSAKSSEVDLLQFDDIDMTSEWVEENETASPTQWLDRFGPALDGNDL 660

Query: 99   NTRQFTNAMFGTNDHIFGL 43
            NTRQF +++FG ND IFGL
Sbjct: 661  NTRQFGSSIFGANDPIFGL 679


>ref|XP_003602175.1| Protein dimerization [Medicago truncatula]
            gi|355491223|gb|AES72426.1| Protein dimerization
            [Medicago truncatula]
          Length = 786

 Score = 1008 bits (2607), Expect = 0.0
 Identities = 486/680 (71%), Positives = 577/680 (84%), Gaps = 1/680 (0%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKDVCWEY EKL+GNKV+CKFC RVLNGGISRLKHHLSR PSKGV+PCSKVRDDVTDR
Sbjct: 108  VREKDVCWEYAEKLDGNKVKCKFCQRVLNGGISRLKHHLSRFPSKGVNPCSKVRDDVTDR 167

Query: 1899 VRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPPS 1720
            VR IIA            KQ+++E  SPG+ S++KAL+S+   T P+ K +PSS  + PS
Sbjct: 168  VRNIIASKEEVKETSSVKKQKVSEVISPGSHSATKALISLDT-TLPIGKMFPSSNPMTPS 226

Query: 1719 ASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLER 1540
            ++ ++ENAERSIALFFFENKLDFSVARS SYQLM++AI KCG GF GPS+E LK  WLER
Sbjct: 227  STNNQENAERSIALFFFENKLDFSVARSSSYQLMIDAITKCGPGFTGPSAEILKTIWLER 286

Query: 1539 VKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYFK 1360
            +KSEV  QSK++EKEW  TGCTIIADTWTD KS+A+INFLVSSPS  FFHKSV+AS YFK
Sbjct: 287  IKSEVGLQSKDVEKEWATTGCTIIADTWTDYKSKAIINFLVSSPSRIFFHKSVDASAYFK 346

Query: 1359 SPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLIL 1180
            + K LADLFDSVIQ+FGPENVVQ+IMD S NY  +G++I+Q+Y +IF +PCAS CLNLIL
Sbjct: 347  NTKWLADLFDSVIQEFGPENVVQIIMDSSFNYTGIGNHIVQNYGTIFVSPCASQCLNLIL 406

Query: 1179 EDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQSM 1000
            E+F+KIDW++RCILQAQ+ISK IYN+  +L+LMK ++GGQEL+RT  TKS S FLSLQ+M
Sbjct: 407  EEFTKIDWISRCILQAQTISKLIYNNASLLDLMKSYSGGQELIRTGATKSVSTFLSLQTM 466

Query: 999  LKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLRE 820
            LK R+RLKHMF+SPEY+ + +YANKPQS+SCI I ED DFW+ VEECVA+SEP LKVLRE
Sbjct: 467  LKLRTRLKHMFHSPEYALDTSYANKPQSLSCIAIAEDGDFWRTVEECVAISEPFLKVLRE 526

Query: 819  ISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFLN 640
            +S GKP+VGSIYE MTRAK+SIRTYYIMDENKCKTFLDIVD++W +QLHSPLHAAAAFLN
Sbjct: 527  VSEGKPTVGSIYELMTRAKESIRTYYIMDENKCKTFLDIVDKKWRDQLHSPLHAAAAFLN 586

Query: 639  PSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREARN 460
            PSIQYNPE+KFL +IK++F  VLEKLLP P++R DIT QI+ F KA GMFGC+LA+EARN
Sbjct: 587  PSIQYNPEIKFLSSIKEDFYHVLEKLLPVPDMRRDITNQIYTFTKAHGMFGCSLAKEARN 646

Query: 459  TISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLGD 280
            T++P LWWEQYGDSAPGLQRVA+RILSQVCS  +F+R WSTF+QIHSEK+N++D+ETL D
Sbjct: 647  TVAPWLWWEQYGDSAPGLQRVAIRILSQVCSTFSFQRQWSTFRQIHSEKKNKIDRETLND 706

Query: 279  LLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPS-PTQWLDRFGSALDGSD 103
            L+YINYNLKL  +   K +E D +  DDIDMTS+WVEE E  S PTQWLDRFGSALDG+D
Sbjct: 707  LVYINYNLKLNRQMSAKSLEVDLLQFDDIDMTSEWVEENETVSPPTQWLDRFGSALDGND 766

Query: 102  LNTRQFTNAMFGTNDHIFGL 43
            LNTRQF +++FG ND IFGL
Sbjct: 767  LNTRQFGSSIFGANDPIFGL 786


>ref|XP_004234278.1| PREDICTED: uncharacterized protein LOC101256946 [Solanum
            lycopersicum]
          Length = 739

 Score =  993 bits (2568), Expect = 0.0
 Identities = 478/679 (70%), Positives = 571/679 (84%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKDVCWEY EKLEGNKVRCKFCLR+LNGGISRLKHHLSRLPSKGV+PC+KVRDDVTDR
Sbjct: 66   VREKDVCWEYAEKLEGNKVRCKFCLRILNGGISRLKHHLSRLPSKGVNPCTKVRDDVTDR 125

Query: 1899 VRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPPS 1720
            VR II             K +L E K+  N+S  K L+S+   T P+++ +P   Q   S
Sbjct: 126  VRDIIG----SKEPPSTKKHKLIETKALANISPEKPLLSVEPIT-PIARIFPPIGQAISS 180

Query: 1719 ASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLER 1540
            +  ++ENAERSIALFFFENK+DF VARS SY  M+EA+ KCG GF GPS E LK TWLER
Sbjct: 181  SGNNQENAERSIALFFFENKIDFGVARSSSYHQMIEAVGKCGSGFIGPSPETLKATWLER 240

Query: 1539 VKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYFK 1360
            +KSEVS QSK++EKEW +TGCT+IA+TWTDNK +ALINFLVSSPS TFF+KSV+AS+YFK
Sbjct: 241  IKSEVSLQSKDVEKEWAMTGCTLIAETWTDNKMKALINFLVSSPSRTFFYKSVDASSYFK 300

Query: 1359 SPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLIL 1180
            + KCL++LFDS+IQDFGPENVVQVI+D +L+   + ++I+Q+Y ++F +PCAS C+N IL
Sbjct: 301  NLKCLSELFDSIIQDFGPENVVQVIVDNTLHCTGIVNHILQNYGNVFVSPCASQCINAIL 360

Query: 1179 EDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQSM 1000
            ++FSK+DWVNRCILQAQS+SKFIYN++ +L+LMKKFTGGQE+++T ITKS SNFLSLQ +
Sbjct: 361  DEFSKLDWVNRCILQAQSLSKFIYNNSPLLDLMKKFTGGQEIIKTGITKSVSNFLSLQCL 420

Query: 999  LKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLRE 820
            LK RSRLK +FNSPE + N AY NK QS++CI IL+DNDFW+  EECVAVSEP LKV+RE
Sbjct: 421  LKHRSRLKVIFNSPELAANSAYTNKSQSVNCITILDDNDFWRTAEECVAVSEPFLKVMRE 480

Query: 819  ISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFLN 640
            +SGGKP+VG+IYE +TRAK+SIRTYYIMDE KCKTFLDIVD+ W+N LHSPLH+AAAFLN
Sbjct: 481  VSGGKPAVGTIYELLTRAKESIRTYYIMDEIKCKTFLDIVDKNWKNNLHSPLHSAAAFLN 540

Query: 639  PSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREARN 460
            P IQYNPEVKFLG+IK++F  VLEKLLPTPELR DIT QI L+ +A GMFGCNLA+EA +
Sbjct: 541  PGIQYNPEVKFLGSIKEDFFRVLEKLLPTPELRRDITTQILLYTRASGMFGCNLAKEAID 600

Query: 459  TISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLGD 280
            T+ PG+WWEQYGD+AP LQRVA++ILSQVCS    ER+WSTFQQIHSEKRN++DKETL D
Sbjct: 601  TVPPGIWWEQYGDAAPTLQRVAIKILSQVCSTFTCERHWSTFQQIHSEKRNKIDKETLLD 660

Query: 279  LLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSDL 100
            L+YINYNLKLA     KP E DP+ +DDIDMTS+WVEE ENPSPTQWLDRFGS LDG+DL
Sbjct: 661  LVYINYNLKLARYLVSKPPEEDPLQLDDIDMTSEWVEEAENPSPTQWLDRFGSGLDGNDL 720

Query: 99   NTRQFTNAMFGTNDHIFGL 43
            NTRQFT A+FG  D+IFGL
Sbjct: 721  NTRQFTAAIFGPGDNIFGL 739


>ref|XP_006350604.1| PREDICTED: uncharacterized protein LOC102593027 isoform X1 [Solanum
            tuberosum] gi|565367925|ref|XP_006350605.1| PREDICTED:
            uncharacterized protein LOC102593027 isoform X2 [Solanum
            tuberosum]
          Length = 675

 Score =  992 bits (2565), Expect = 0.0
 Identities = 478/679 (70%), Positives = 571/679 (84%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKDVCWEY EKL+GNKVRCKFCLR+LNGGISRLKHHLSRLPSKGV+PC+KVRDDVTDR
Sbjct: 2    VREKDVCWEYAEKLDGNKVRCKFCLRILNGGISRLKHHLSRLPSKGVNPCTKVRDDVTDR 61

Query: 1899 VRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPPS 1720
            VR II             K +L E K+  N+S  K L+S+   T P+++ +P   Q   S
Sbjct: 62   VRDIIG----SKEPPSTKKHKLIETKALANISPEKLLLSVEPIT-PIARIFPPIGQAISS 116

Query: 1719 ASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLER 1540
            +  ++ENAERSIALFFFENK+DF VARS SY  M+EA+ KCG GF GPS E LK TWLER
Sbjct: 117  SGNNQENAERSIALFFFENKIDFGVARSSSYHQMIEAVGKCGSGFIGPSPETLKATWLER 176

Query: 1539 VKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYFK 1360
            +KSEVS QSK++EKEW +TGCT+IA+TWTDNK +ALINFLVSSPS TFF+KSV+AS+YFK
Sbjct: 177  IKSEVSLQSKDVEKEWAMTGCTLIAETWTDNKMKALINFLVSSPSRTFFYKSVDASSYFK 236

Query: 1359 SPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLIL 1180
            + KCL++LFDS+IQDFGPENVVQVI+D +L+   + ++I+Q+Y ++F +PCAS C+N IL
Sbjct: 237  NLKCLSELFDSIIQDFGPENVVQVIVDNTLHCTGIVNHILQNYGNVFVSPCASQCINAIL 296

Query: 1179 EDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQSM 1000
            ++FSK+DWVNRCILQAQSISKFIYN++ +L+LMKKFTGGQE+++T ITKS SNFLSLQ +
Sbjct: 297  DEFSKLDWVNRCILQAQSISKFIYNNSPLLDLMKKFTGGQEIIKTGITKSVSNFLSLQCL 356

Query: 999  LKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLRE 820
            LK RSRLK +FNSPE + N AY NK QS++CI IL+DNDFW+  EECVAVSEP LKV+RE
Sbjct: 357  LKHRSRLKVIFNSPELAANSAYTNKSQSVNCIAILDDNDFWRTAEECVAVSEPFLKVMRE 416

Query: 819  ISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFLN 640
            +SGGKP+VG+IYE +TRAK+SIRTYYIMDE KCKTFLDIVD+ W+N LHSPLH+AAAFLN
Sbjct: 417  VSGGKPAVGTIYELLTRAKESIRTYYIMDEIKCKTFLDIVDKNWKNNLHSPLHSAAAFLN 476

Query: 639  PSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREARN 460
            P IQYN EVKFLG+IK++F  VLEKLLPTPELR DIT QI L+ +A GMFGCNLA+EA +
Sbjct: 477  PGIQYNREVKFLGSIKEDFFRVLEKLLPTPELRRDITTQILLYTRASGMFGCNLAKEAID 536

Query: 459  TISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLGD 280
            T+ PG+WWEQYGD+AP LQRVA++ILSQVCS   FER+WSTFQQIHSEKRN++DKETL D
Sbjct: 537  TVPPGIWWEQYGDAAPTLQRVAIKILSQVCSTFTFERHWSTFQQIHSEKRNKIDKETLLD 596

Query: 279  LLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSDL 100
            L+YINYNLKLA     KP E DP+ +DDIDMTS+WVEE ENPSPTQWLDRFGS LDG+DL
Sbjct: 597  LVYINYNLKLARYLVSKPPEEDPLQLDDIDMTSEWVEEAENPSPTQWLDRFGSGLDGNDL 656

Query: 99   NTRQFTNAMFGTNDHIFGL 43
            NTRQFT A+FG  D+IFGL
Sbjct: 657  NTRQFTAAIFGPGDNIFGL 675


>ref|XP_007218857.1| hypothetical protein PRUPE_ppa002763mg [Prunus persica]
            gi|462415319|gb|EMJ20056.1| hypothetical protein
            PRUPE_ppa002763mg [Prunus persica]
          Length = 636

 Score =  978 bits (2529), Expect = 0.0
 Identities = 485/679 (71%), Positives = 550/679 (81%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKDVCWEY EKL+GNKVRCKFC RVLNGGISRLKHHLSRLPSKGV+PCSKVRDDVTDR
Sbjct: 2    VREKDVCWEYAEKLDGNKVRCKFCQRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDR 61

Query: 1899 VRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPPS 1720
            VR IIA            KQ+L E KSPGN+S+SKALMS    T P+ K +P+ T + P 
Sbjct: 62   VRTIIASKEEVKETSSGKKQKLVEVKSPGNVSASKALMSFDTPT-PIQKVFPNVTPMVPP 120

Query: 1719 ASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLER 1540
               ++ENAER+IALFFFENKLDFS+ARS SYQLM++AI KCG GF GPS+E LK TWLER
Sbjct: 121  PLNNQENAERNIALFFFENKLDFSIARSSSYQLMIDAIEKCGPGFIGPSAETLKTTWLER 180

Query: 1539 VKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYFK 1360
            +KSE+S QSK+IEKEW  TGCTIIADTWTDNKSRALINFL+                   
Sbjct: 181  IKSEMSLQSKDIEKEWTTTGCTIIADTWTDNKSRALINFLI------------------- 221

Query: 1359 SPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLIL 1180
                                    IMD S NY  V ++I+Q+Y +IF +PCAS CLNLIL
Sbjct: 222  ------------------------IMDSSFNYTGVANHILQNYATIFVSPCASQCLNLIL 257

Query: 1179 EDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQSM 1000
            E+FSK+DWVNRCILQAQ+ISKFIYN+  +L+LMKKFTGGQEL+RT ITKS SNFLSLQS+
Sbjct: 258  EEFSKVDWVNRCILQAQTISKFIYNNASMLDLMKKFTGGQELIRTGITKSVSNFLSLQSL 317

Query: 999  LKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLRE 820
            LKQRSRLKHMFNSPEY TN +YANK QSISCI I+EDNDFW+AVEE VA+SEP LKVLRE
Sbjct: 318  LKQRSRLKHMFNSPEYCTNSSYANKTQSISCISIVEDNDFWRAVEESVAISEPFLKVLRE 377

Query: 819  ISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFLN 640
            +SGGKPSVG IYE MTRAK+SIRTYYIMDENKCKTFLDIVDR+W +QLHSPLHAAAAFLN
Sbjct: 378  VSGGKPSVGFIYELMTRAKESIRTYYIMDENKCKTFLDIVDRKWRDQLHSPLHAAAAFLN 437

Query: 639  PSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREARN 460
            P IQYNPE+KFL +IK++F  VLEKLLP PE+R DIT QIF F KA GMFGC+LA EAR+
Sbjct: 438  PGIQYNPEIKFLTSIKEDFFKVLEKLLPMPEMRRDITSQIFTFTKATGMFGCSLAMEARD 497

Query: 459  TISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLGD 280
             +SPGLWWEQYGDSAP LQRVA+RILSQVCS+  FER+WS FQQIHSEKRN++D+ETL D
Sbjct: 498  VVSPGLWWEQYGDSAPVLQRVAIRILSQVCSSFMFERHWSAFQQIHSEKRNKIDRETLND 557

Query: 279  LLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSDL 100
            L+YINYNLKLA +T+ K +E DPI  DDIDMTS+WVEE++NPSPTQWLDRFGSALDGSDL
Sbjct: 558  LVYINYNLKLARQTRTKTLEADPIQFDDIDMTSEWVEESDNPSPTQWLDRFGSALDGSDL 617

Query: 99   NTRQFTNAMFGTNDHIFGL 43
            NTRQF  A+FG+NDHIFGL
Sbjct: 618  NTRQFNAAIFGSNDHIFGL 636


>ref|XP_007038930.1| HAT transposon superfamily isoform 1 [Theobroma cacao]
            gi|508776175|gb|EOY23431.1| HAT transposon superfamily
            isoform 1 [Theobroma cacao]
          Length = 640

 Score =  952 bits (2462), Expect = 0.0
 Identities = 460/629 (73%), Positives = 539/629 (85%)
 Frame = -3

Query: 1929 SKVRDDVTDRVRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSKT 1750
            +KVRDDVTDRVRAI++            KQ++AEA+SPGN+S+   ++ +   +SP++K 
Sbjct: 14   NKVRDDVTDRVRAILSSKEEIKETSSVKKQKIAEARSPGNISTCSKIIPLEA-SSPVAKV 72

Query: 1749 YPSSTQVPPSASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSS 1570
            +P+++ + P +   +EN ERSIALFFFENKLDFSVARS SYQ M++A+ K G GF GPS 
Sbjct: 73   FPATSPIAPPSLNSQENVERSIALFFFENKLDFSVARSSSYQAMIDAVGKFGPGFTGPSV 132

Query: 1569 EALKITWLERVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFH 1390
            E LK  WLER+KSEV  QSK+ EKEW  TGCTIIADTWTDNKSRALINFLVSSPS TFFH
Sbjct: 133  ETLKTMWLERIKSEVCLQSKDTEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFH 192

Query: 1389 KSVEASTYFKSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTP 1210
            KSV+AS+YFK+ KCLADLFDSVIQDFGPENVVQ+IMD S NY  + ++I+Q+Y +IF +P
Sbjct: 193  KSVDASSYFKNTKCLADLFDSVIQDFGPENVVQIIMDSSFNYTGISNHILQNYGTIFVSP 252

Query: 1209 CASHCLNLILEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKS 1030
            CAS CLNLILE+FSK+DWVNRCILQAQ++SKF+YN+  +L+LMKKFTG QEL+RT ITKS
Sbjct: 253  CASQCLNLILEEFSKVDWVNRCILQAQTLSKFLYNNASMLDLMKKFTGEQELIRTGITKS 312

Query: 1029 TSNFLSLQSMLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAV 850
             S+FLSLQSMLKQRSRLKHMFNSPEYSTN +YANKPQSISCI I+EDNDFW+AV+ECVA+
Sbjct: 313  VSSFLSLQSMLKQRSRLKHMFNSPEYSTNSSYANKPQSISCIAIVEDNDFWRAVDECVAI 372

Query: 849  SEPLLKVLREISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHS 670
            SEP LKVLRE+SGGKP+VGSIYE MTRAK+SIRTYYIMDE KCKTFLDIVDR+W +QLHS
Sbjct: 373  SEPFLKVLREVSGGKPAVGSIYELMTRAKESIRTYYIMDEGKCKTFLDIVDRKWRDQLHS 432

Query: 669  PLHAAAAFLNPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMF 490
            PLH+A AFLNPSIQYN E+KFLG+IK++F  VLEKLLPTPELR DIT QIF F +A+GMF
Sbjct: 433  PLHSAGAFLNPSIQYNQEIKFLGSIKEDFFKVLEKLLPTPELRRDITNQIFTFTRAKGMF 492

Query: 489  GCNLAREARNTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKR 310
             CNLA EAR+T+SPGLWWEQ+GDSAP LQRVA+RILSQVCS   FER+WSTFQQIHSEKR
Sbjct: 493  ACNLAMEARDTVSPGLWWEQFGDSAPVLQRVAIRILSQVCSTFTFERHWSTFQQIHSEKR 552

Query: 309  NRLDKETLGDLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDR 130
            N++DKE L DL+YINYNL+LA + + K +E DPI  DDIDMTS+WVEE+ENPSPTQWLDR
Sbjct: 553  NKIDKEILNDLVYINYNLRLARQMRTKSVEADPIQFDDIDMTSEWVEESENPSPTQWLDR 612

Query: 129  FGSALDGSDLNTRQFTNAMFGTNDHIFGL 43
            FGSALDG DLNTRQF  A+FG NDHIFGL
Sbjct: 613  FGSALDGGDLNTRQFNAAIFG-NDHIFGL 640


>ref|XP_006300772.1| hypothetical protein CARUB_v10019846mg, partial [Capsella rubella]
            gi|482569482|gb|EOA33670.1| hypothetical protein
            CARUB_v10019846mg, partial [Capsella rubella]
          Length = 768

 Score =  914 bits (2361), Expect = 0.0
 Identities = 451/681 (66%), Positives = 551/681 (80%), Gaps = 2/681 (0%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKD+CWEY EKL+GNKV+CKFC RVLNGGISRLKHHLSRLPSKGV+PC+KVRDDVTDR
Sbjct: 101  VREKDICWEYAEKLDGNKVKCKFCSRVLNGGISRLKHHLSRLPSKGVNPCAKVRDDVTDR 160

Query: 1899 VRAIIALXXXXXXXXXXXKQ-RLAEAKSPGNLSSSKALMSMTMETSPMSKTYPSSTQVPP 1723
            VR+I+A             + +  E K P     S +L+ +T+ +   SK +P+S   PP
Sbjct: 161  VRSILAAKDDPKDSPLTTNKYKPPEVKPP----LSASLLPVTVSSG--SKLFPTSILAPP 214

Query: 1722 SASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLE 1543
            + +  +  AERSI+LFFFENK+D+ VARSPSY  M++AIAKCG  F  PS  +LK  WL+
Sbjct: 215  TPNA-QVIAERSISLFFFENKIDWCVARSPSYHHMLDAIAKCGPAFFAPSPLSLKTEWLD 273

Query: 1542 RVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYF 1363
            RVKSE+S Q K+ EKEW  TGCTIIA+ WTDNKSRALINF VSSPS  FFHKSV+AS+YF
Sbjct: 274  RVKSEISLQLKDSEKEWVTTGCTIIAEAWTDNKSRALINFSVSSPSRIFFHKSVDASSYF 333

Query: 1362 KSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLI 1183
            K+ KCLADLFDSVIQD G E++VQ+IMD S +Y  + ++I+Q+Y SIF +PCAS CL++I
Sbjct: 334  KNTKCLADLFDSVIQDIGQEHIVQIIMDNSFSYTGISNHILQNYGSIFVSPCASQCLSII 393

Query: 1182 LEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQS 1003
            LE+FSK+DWVN+CI QAQ ISKF+YN+  VL+LM+K TGGQ+++RT +T+S SNFLSLQS
Sbjct: 394  LEEFSKVDWVNQCISQAQVISKFVYNNRPVLDLMRKLTGGQDIIRTGVTRSVSNFLSLQS 453

Query: 1002 MLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLR 823
            M+KQ++RLKHMFNS EY+T    ANKPQS+SC++ILEDNDFW+A+EE VA+SEP+LKVLR
Sbjct: 454  MMKQKARLKHMFNSSEYTTQ---ANKPQSMSCVNILEDNDFWRALEESVAISEPILKVLR 510

Query: 822  EISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFL 643
            E+S GKP+VGSIYE M++AK+SIRTYYIMDENK K F +IVD +W + LHSPLHAAAAFL
Sbjct: 511  EVSKGKPAVGSIYELMSKAKESIRTYYIMDENKHKVFSNIVDTKWCDHLHSPLHAAAAFL 570

Query: 642  NPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREAR 463
            NPSIQYNPE+KFL ++K++F  VLEKLLPT +LR DIT QIF F +A+GMFGCNLA EAR
Sbjct: 571  NPSIQYNPEIKFLTSLKEDFFKVLEKLLPTSDLRRDITNQIFTFTRAKGMFGCNLAMEAR 630

Query: 462  NTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLG 283
            +++SPGLWWEQ+GDSAP LQRVA+RILSQVCS+ N ER WSTFQQ+H E+RN +D+E L 
Sbjct: 631  DSVSPGLWWEQFGDSAPVLQRVAIRILSQVCSSYNLERQWSTFQQMHWERRNTIDREILN 690

Query: 282  DLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSD 103
            +L Y+N NLKL        +ETD I ++DIDM S+WVEE ENPSP QWLDRFGSALDG D
Sbjct: 691  NLAYVNQNLKLGRMI---TLETDSISLEDIDMMSEWVEEAENPSPAQWLDRFGSALDGGD 747

Query: 102  LNTRQFTNAMFGTNDH-IFGL 43
            LNTRQF  A+F  NDH IFGL
Sbjct: 748  LNTRQFGGAIFSANDHNIFGL 768


>ref|NP_178092.4| hAT family dimerization domain-containing protein [Arabidopsis
            thaliana] gi|332198172|gb|AEE36293.1| hAT family
            dimerization domain-containing protein [Arabidopsis
            thaliana]
          Length = 651

 Score =  905 bits (2340), Expect = 0.0
 Identities = 450/681 (66%), Positives = 544/681 (79%), Gaps = 2/681 (0%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            VREKD+CWEY EKL+GNKV+CKFC RVLNGGISRLKHHLSRLPSKGV+PC+KVRDDVTDR
Sbjct: 2    VREKDICWEYAEKLDGNKVKCKFCSRVLNGGISRLKHHLSRLPSKGVNPCAKVRDDVTDR 61

Query: 1899 VRAIIALXXXXXXXXXXXKQRLAEAKSPGNLSSSKALMSMTMETSPMSK-TYPSSTQVPP 1723
            VR+I++                 + K P  LS            +P SK  +PSS   PP
Sbjct: 62   VRSILSAKDDPPITN--------KYKPPPPLSPP--------FDAPASKLVFPSS---PP 102

Query: 1722 SASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAKCGHGFRGPSSEALKITWLE 1543
            +A   ++ AERSI+LFFFENK+DF+VARSPSY  M++A+AKCG GF  PS    K  WL+
Sbjct: 103  NA---QDIAERSISLFFFENKIDFAVARSPSYHHMLDAVAKCGPGFVAPSP---KTEWLD 156

Query: 1542 RVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFLVSSPSGTFFHKSVEASTYF 1363
            RVKS++S Q K+ EKEW  TGCTIIA+ WTDNKSRALINF VSSPS  FFHKSV+AS+YF
Sbjct: 157  RVKSDISLQLKDTEKEWVTTGCTIIAEAWTDNKSRALINFSVSSPSRIFFHKSVDASSYF 216

Query: 1362 KSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIMQSYNSIFFTPCASHCLNLI 1183
            K+ KCLADLFDSVIQD G E++VQ+IMD S  Y  + ++++Q+Y +IF +PCAS CLN+I
Sbjct: 217  KNSKCLADLFDSVIQDIGQEHIVQIIMDNSFCYTGISNHLLQNYATIFVSPCASQCLNII 276

Query: 1182 LEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQELVRTSITKSTSNFLSLQS 1003
            LE+FSK+DWVN+CI QAQ ISKF+YN++ VL+L++K TGGQ+++R+ +T+S SNFLSLQS
Sbjct: 277  LEEFSKVDWVNQCISQAQVISKFVYNNSPVLDLLRKLTGGQDIIRSGVTRSVSNFLSLQS 336

Query: 1002 MLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDFWKAVEECVAVSEPLLKVLR 823
            M+KQ++RLKHMFN PEY+TN    NKPQSISC++ILEDNDFW+AVEE VA+SEP+LKVLR
Sbjct: 337  MMKQKARLKHMFNCPEYTTN---TNKPQSISCVNILEDNDFWRAVEESVAISEPILKVLR 393

Query: 822  EISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIVDRRWENQLHSPLHAAAAFL 643
            E+S GKP+VGSIYE M++AK+SIRTYYIMDENK K F DIVD  W   LHSPLHAAAAFL
Sbjct: 394  EVSTGKPAVGSIYELMSKAKESIRTYYIMDENKHKVFSDIVDTNWCEHLHSPLHAAAAFL 453

Query: 642  NPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQIFLFKKAQGMFGCNLAREAR 463
            NPSIQYNPE+KFL ++K++F  VLEKLLPT +LR DIT QIF F +A+GMFGCNLA EAR
Sbjct: 454  NPSIQYNPEIKFLTSLKEDFFKVLEKLLPTSDLRRDITNQIFTFTRAKGMFGCNLAMEAR 513

Query: 462  NTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWSTFQQIHSEKRNRLDKETLG 283
            +++SPGLWWEQ+GDSAP LQRVA+RILSQVCS  N ER WSTFQQ+H E+RN++D+E L 
Sbjct: 514  DSVSPGLWWEQFGDSAPVLQRVAIRILSQVCSGYNLERQWSTFQQMHWERRNKIDREILN 573

Query: 282  DLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETENPSPTQWLDRFGSALDGSD 103
             L Y+N NLKL        +ETDPI ++DIDM S+WVEE ENPSP QWLDRFG+ALDG D
Sbjct: 574  KLAYVNQNLKLGRMI---TLETDPIALEDIDMMSEWVEEAENPSPAQWLDRFGTALDGGD 630

Query: 102  LNTRQFTNAMFGTNDH-IFGL 43
            LNTRQF  A+F  NDH IFGL
Sbjct: 631  LNTRQFGGAIFSANDHNIFGL 651


>gb|EEC70201.1| hypothetical protein OsI_00947 [Oryza sativa Indica Group]
          Length = 1045

 Score =  808 bits (2088), Expect = 0.0
 Identities = 402/699 (57%), Positives = 526/699 (75%), Gaps = 20/699 (2%)
 Frame = -3

Query: 2079 VREKDVCWEYCEKLEGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVHPCSKVRDDVTDR 1900
            +RE+DVCWEYC+K+EGNKVRC+FC +VLNGGISRLK HLS++ SKGV+PC+KV+ DV ++
Sbjct: 354  LRERDVCWEYCDKMEGNKVRCRFCYKVLNGGISRLKFHLSQISSKGVNPCTKVKPDVIEK 413

Query: 1899 VRAIIALXXXXXXXXXXXKQRLAE--------------------AKSPGNLSSSKALMSM 1780
            V+A+IA            +QR  E                    A SP   S+S     +
Sbjct: 414  VKAVIAAKEEHRETQVLKRQRDTELSVRPRRIRDLPSQPTSPERATSPAITSTSDQTQFL 473

Query: 1779 TMETSPMSKTYPSSTQVPPSASYDKENAERSIALFFFENKLDFSVARSPSYQLMMEAIAK 1600
             +E S       S T    SA   +  AER IA FFFENKLD+++A S SY+ MMEA+  
Sbjct: 474  ALEVSTPVLKLSSVTNKARSAP--QSEAERCIAEFFFENKLDYNIADSVSYRHMMEALG- 530

Query: 1599 CGHGFRGPSSEALKITWLERVKSEVSEQSKEIEKEWGVTGCTIIADTWTDNKSRALINFL 1420
             G GFRGPS+E LK  WL ++KSEV +++KEIEK+W  TGCTI+AD+WTDNKS+ALINF 
Sbjct: 531  -GQGFRGPSAEVLKTKWLHKLKSEVLQKTKEIEKDWATTGCTILADSWTDNKSKALINFS 589

Query: 1419 VSSPSGTFFHKSVEASTYFKSPKCLADLFDSVIQDFGPENVVQVIMDGSLNYRAVGSYIM 1240
            VSSP GTFF K+V+AS + KS + L +LFD VI++ GP+NVVQ+I D ++NY +V   IM
Sbjct: 590  VSSPLGTFFLKTVDASPHIKSHQ-LYELFDDVIREVGPDNVVQIITDRNINYGSVDKLIM 648

Query: 1239 QSYNSIFFTPCASHCLNLILEDFSKIDWVNRCILQAQSISKFIYNHTWVLELMKKFTGGQ 1060
            Q+YN+IF++PCAS C+N +L+DFSKIDWVNRCI QAQ+I++F+YN+ WVL+LM+K   GQ
Sbjct: 649  QNYNTIFWSPCASSCVNSMLDDFSKIDWVNRCICQAQTITRFVYNNKWVLDLMRKCIAGQ 708

Query: 1059 ELVRTSITKSTSNFLSLQSMLKQRSRLKHMFNSPEYSTNPAYANKPQSISCIDILEDNDF 880
            ELV + ITK  S+FL+LQS+L+ R +LK MF+S +Y+++ +YAN+  S SC++IL+D++F
Sbjct: 709  ELVCSGITKCVSDFLTLQSLLRYRPKLKQMFHSSDYASS-SYANRSLSSSCVEILDDDEF 767

Query: 879  WKAVEECVAVSEPLLKVLREISGGKPSVGSIYEFMTRAKDSIRTYYIMDENKCKTFLDIV 700
            W+AVEE  AVSEPLL+V+R++SGGK ++G IYE MT+  DSIRTYYIMDE KCK+FLDIV
Sbjct: 768  WRAVEEIAAVSEPLLRVMRDVSGGKAAIGYIYESMTKVMDSIRTYYIMDEGKCKSFLDIV 827

Query: 699  DRRWENQLHSPLHAAAAFLNPSIQYNPEVKFLGNIKQEFLTVLEKLLPTPELREDITGQI 520
            +++W+ +LHSPLH+AAAFLNPSIQYNPEVKF  +IK+EF  VL+K+L  P+ R+ IT ++
Sbjct: 828  EQKWQVELHSPLHSAAAFLNPSIQYNPEVKFFSSIKEEFYHVLDKVLTVPDQRQGITVEL 887

Query: 519  FLFKKAQGMFGCNLAREARNTISPGLWWEQYGDSAPGLQRVAVRILSQVCSASNFERNWS 340
              F+KAQGMFG N+A+EARN  SPG+WWEQYGDSAP LQ  AVRI+SQVCS   F+R+WS
Sbjct: 888  HAFRKAQGMFGSNIAKEARNNTSPGMWWEQYGDSAPSLQHAAVRIVSQVCSTLTFQRDWS 947

Query: 339  TFQQIHSEKRNRLDKETLGDLLYINYNLKLASRTKGKPIETDPILVDDIDMTSDWVEETE 160
               + HSEKRN+LDKE L D  Y++YN  L S +K K  + DPI +D IDMTS WVE+++
Sbjct: 948  IIVRNHSEKRNKLDKEALADQAYVHYNFMLHSDSKMKKGDGDPIALDAIDMTSPWVEDSD 1007

Query: 159  NPSPTQWLDRFGSALDGSDLNTRQFTNAMFGTNDHIFGL 43
            +P+  QWLDRF SALDG DLNTRQF  ++FGTND +FGL
Sbjct: 1008 SPNLAQWLDRFPSALDG-DLNTRQFGGSIFGTNDTLFGL 1045


Top