BLASTX nr result

ID: Sinomenium21_contig00021061 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00021061
         (1382 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI20940.3| unnamed protein product [Vitis vinifera]              291   4e-76
ref|XP_002281922.1| PREDICTED: uncharacterized protein LOC100264...   291   4e-76
gb|EXC20799.1| hypothetical protein L484_007381 [Morus notabilis]     275   4e-71
ref|XP_007013731.1| Enhancer of polycomb-like transcription fact...   268   3e-69
ref|XP_007013730.1| Enhancer of polycomb-like transcription fact...   268   3e-69
ref|XP_007013727.1| Enhancer of polycomb-like transcription fact...   268   3e-69
ref|XP_007225478.1| hypothetical protein PRUPE_ppa000151mg [Prun...   264   6e-68
ref|XP_002516604.1| hypothetical protein RCOM_0804080 [Ricinus c...   249   3e-63
ref|XP_004292962.1| PREDICTED: uncharacterized protein LOC101313...   248   6e-63
ref|XP_006601120.1| PREDICTED: uncharacterized protein LOC100789...   247   1e-62
ref|XP_007013729.1| Enhancer of polycomb-like transcription fact...   241   7e-61
ref|XP_004162065.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   235   4e-59
ref|XP_002324830.2| hypothetical protein POPTR_0018s01030g [Popu...   233   1e-58
ref|XP_006596126.1| PREDICTED: uncharacterized protein LOC100781...   229   2e-57
ref|XP_004498624.1| PREDICTED: uncharacterized protein LOC101499...   229   2e-57
ref|XP_003545513.1| PREDICTED: uncharacterized protein LOC100781...   229   2e-57
ref|XP_006601123.1| PREDICTED: uncharacterized protein LOC100792...   223   1e-55
ref|XP_006601122.1| PREDICTED: uncharacterized protein LOC100792...   223   1e-55
ref|XP_002309585.2| hypothetical protein POPTR_0006s26240g [Popu...   222   3e-55
ref|XP_007137088.1| hypothetical protein PHAVU_009G098700g [Phas...   213   1e-52

>emb|CBI20940.3| unnamed protein product [Vitis vinifera]
          Length = 1634

 Score =  291 bits (746), Expect = 4e-76
 Identities = 190/435 (43%), Positives = 245/435 (56%), Gaps = 14/435 (3%)
 Frame = +2

Query: 119  MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGSDSX 298
            ME+SV NS   EISKKSRSLDL+++Y  +S VS+E  + + L RK  S   + E  S   
Sbjct: 1    MEHSVENSGGSEISKKSRSLDLQSIY--RSKVSQEG-DNKILKRKHSS-ENDGEVESGQG 56

Query: 299  XXXXXXXXEVSLSSLESGSRKNGKFLNASDGSKIHGLILNQRDSTKKQNDRVSRGANLKS 478
                     VSLSSL+S  + + K L+      +     +    +KK+   +S+  +  S
Sbjct: 57   KKKSNSRKAVSLSSLKSLLKNSHKSLDEVYADGLGSGSSSGLPDSKKKELGLSQKLDDNS 116

Query: 479  SSSDVTPNLDSNVMPIPKRPRDFSRRKKFQSTDVPNQSGISRKVSSDA---QMVKLTGYP 649
              + ++ NLD+NV+ IPKRPR F RR++F    +      S   S D    Q+ KL+   
Sbjct: 117  GLNSISRNLDNNVIRIPKRPRGFVRRRRFDGNHMLQPGRSSPASSKDVFVDQITKLSDDS 176

Query: 650  VTPTIYSEGKRKNVFDEFKENSSDRANSTRGSKAKDGTKCY-----------PILKRMRR 796
             T  +  + KRK  FD+FKEN S  ++S    K  D  K             P  K+++R
Sbjct: 177  ATRVVPLKIKRKKGFDDFKENRSSGSSSAPHYKEGDEIKVVDNGNSSLRKRMPRKKQVKR 236

Query: 797  DHGKSEEAGPQEQTHRLEIEPVVDNFSXXXXXXXXXXXXXXXXXAARMLSSRFDPSCTVF 976
             +  SE     ++    E  P+ DN                   AARMLSSRFDP+CT F
Sbjct: 237  KNLSSEGKSIVKE----EAVPLADN---PIKNCDEEDEENLEENAARMLSSRFDPNCTGF 289

Query: 977  PGSVTAPVSKSMNGFSFVSSFHGDSESHLLSHSAGSESNSADAACRVLRPRKKGKEKRQV 1156
              +  A   +S NG SF+ S   D   H ++   GSES S D A RVLRPRK+ K+K   
Sbjct: 290  SSNGKASTPQSTNGLSFLLSPDQDCMIHRMNSLVGSESASVDTAGRVLRPRKQHKQKGLS 349

Query: 1157 RKRRHFYEIFSRNIDAHWVLNRRIKVFWPLDQCWYFGLVTGYDPHEKLHHVKYDDREEEW 1336
            RKRRHFYEIFSRN+DA+WVLNRRIKVFWPLDQ WYFGLV  YDP  KLHHVKYDDR+EEW
Sbjct: 350  RKRRHFYEIFSRNLDAYWVLNRRIKVFWPLDQSWYFGLVKDYDPERKLHHVKYDDRDEEW 409

Query: 1337 IDLQNERFKLLLLPS 1381
            IDL++ERFKLLLLPS
Sbjct: 410  IDLRHERFKLLLLPS 424


>ref|XP_002281922.1| PREDICTED: uncharacterized protein LOC100264575 [Vitis vinifera]
          Length = 1679

 Score =  291 bits (746), Expect = 4e-76
 Identities = 190/435 (43%), Positives = 245/435 (56%), Gaps = 14/435 (3%)
 Frame = +2

Query: 119  MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGSDSX 298
            ME+SV NS   EISKKSRSLDL+++Y  +S VS+E  + + L RK  S   + E  S   
Sbjct: 1    MEHSVENSGGSEISKKSRSLDLQSIY--RSKVSQEG-DNKILKRKHSS-ENDGEVESGQG 56

Query: 299  XXXXXXXXEVSLSSLESGSRKNGKFLNASDGSKIHGLILNQRDSTKKQNDRVSRGANLKS 478
                     VSLSSL+S  + + K L+      +     +    +KK+   +S+  +  S
Sbjct: 57   KKKSNSRKAVSLSSLKSLLKNSHKSLDEVYADGLGSGSSSGLPDSKKKELGLSQKLDDNS 116

Query: 479  SSSDVTPNLDSNVMPIPKRPRDFSRRKKFQSTDVPNQSGISRKVSSDA---QMVKLTGYP 649
              + ++ NLD+NV+ IPKRPR F RR++F    +      S   S D    Q+ KL+   
Sbjct: 117  GLNSISRNLDNNVIRIPKRPRGFVRRRRFDGNHMLQPGRSSPASSKDVFVDQITKLSDDS 176

Query: 650  VTPTIYSEGKRKNVFDEFKENSSDRANSTRGSKAKDGTKCY-----------PILKRMRR 796
             T  +  + KRK  FD+FKEN S  ++S    K  D  K             P  K+++R
Sbjct: 177  ATRVVPLKIKRKKGFDDFKENRSSGSSSAPHYKEGDEIKVVDNGNSSLRKRMPRKKQVKR 236

Query: 797  DHGKSEEAGPQEQTHRLEIEPVVDNFSXXXXXXXXXXXXXXXXXAARMLSSRFDPSCTVF 976
             +  SE     ++    E  P+ DN                   AARMLSSRFDP+CT F
Sbjct: 237  KNLSSEGKSIVKE----EAVPLADN---PIKNCDEEDEENLEENAARMLSSRFDPNCTGF 289

Query: 977  PGSVTAPVSKSMNGFSFVSSFHGDSESHLLSHSAGSESNSADAACRVLRPRKKGKEKRQV 1156
              +  A   +S NG SF+ S   D   H ++   GSES S D A RVLRPRK+ K+K   
Sbjct: 290  SSNGKASTPQSTNGLSFLLSPDQDCMIHRMNSLVGSESASVDTAGRVLRPRKQHKQKGLS 349

Query: 1157 RKRRHFYEIFSRNIDAHWVLNRRIKVFWPLDQCWYFGLVTGYDPHEKLHHVKYDDREEEW 1336
            RKRRHFYEIFSRN+DA+WVLNRRIKVFWPLDQ WYFGLV  YDP  KLHHVKYDDR+EEW
Sbjct: 350  RKRRHFYEIFSRNLDAYWVLNRRIKVFWPLDQSWYFGLVKDYDPERKLHHVKYDDRDEEW 409

Query: 1337 IDLQNERFKLLLLPS 1381
            IDL++ERFKLLLLPS
Sbjct: 410  IDLRHERFKLLLLPS 424


>gb|EXC20799.1| hypothetical protein L484_007381 [Morus notabilis]
          Length = 1690

 Score =  275 bits (702), Expect = 4e-71
 Identities = 176/430 (40%), Positives = 241/430 (56%), Gaps = 9/430 (2%)
 Frame = +2

Query: 119  MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGSDSX 298
            MEN + +SD  E+ +KSRSLDLK+LY  K  V+K+       N+K +      +   +S 
Sbjct: 1    MENRIESSDGAEVPRKSRSLDLKSLY--KHRVTKD-----VQNKKLKRKASADDGDENSE 53

Query: 299  XXXXXXXXEVSLSSLESGSRKNGKFLNASDGSKIHGLILNQRDSTKKQNDRVSRGANLKS 478
                    EVSLSSL++ S  + K ++      +   + + +D   +   +++     KS
Sbjct: 54   KKKKKSVKEVSLSSLKNTSSSSKKNVDKDCHKGLSSGLHDSKDLKLEAKQKLNGSIGFKS 113

Query: 479  SSSDVTPNLDSNVMPIPKRPRDFSRRKKFQSTDVPNQSGIS-RKVSSDAQMVKLTGYPVT 655
             SS    +L+ +V+ IP+R R F  RKK +   VP + G+S  K+    Q+ KL+G    
Sbjct: 114  ISS---LSLNDDVIQIPRRKRGFVGRKKGEGGHVPRRQGLSCGKLDLVDQISKLSGDDSG 170

Query: 656  PTIYS-EGKRKNVFDEFKENSSDRANSTRGSKAKDGTKCYPIL-------KRMRRDHGKS 811
              + S + KR   FD+FKEN    +NS R ++ +     + ++       K+ RR   K+
Sbjct: 171  SQVESVKVKRTKGFDDFKENRISESNSARHAEEEHERVNHLVVSNGDSLFKKSRRKRSKT 230

Query: 812  EEAGPQEQTHRLEIEPVVDNFSXXXXXXXXXXXXXXXXXAARMLSSRFDPSCTVFPGSVT 991
            +   P ++    E EP+ DN +                 AA MLSSRFDP+CT F  S  
Sbjct: 231  KNLSPDDKVGAKEAEPLADNSTMMCNDSQEDDEENLEENAAMMLSSRFDPNCTGF-SSNK 289

Query: 992  APVSKSMNGFSFVSSFHGDSESHLLSHSAGSESNSADAACRVLRPRKKGKEKRQVRKRRH 1171
            A    +++G SF+ S   D  S      +GSES S DAA RVLRPR + KEK   RKRRH
Sbjct: 290  ASAFATVDGLSFLLSSGRDFVSRRSRSLSGSESPSVDAAGRVLRPRIQHKEKGHSRKRRH 349

Query: 1172 FYEIFSRNIDAHWVLNRRIKVFWPLDQCWYFGLVTGYDPHEKLHHVKYDDREEEWIDLQN 1351
            FYE+F  ++DA WVLNRRIKVFWPLDQ WY+GLV  YD  +KLHHVKYDDR+EEWIDLQN
Sbjct: 350  FYEVFFGDLDADWVLNRRIKVFWPLDQSWYYGLVNDYDREKKLHHVKYDDRDEEWIDLQN 409

Query: 1352 ERFKLLLLPS 1381
            ERFKLLLLPS
Sbjct: 410  ERFKLLLLPS 419


>ref|XP_007013731.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 5 [Theobroma cacao] gi|508784094|gb|EOY31350.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 5 [Theobroma cacao]
          Length = 1522

 Score =  268 bits (686), Expect = 3e-69
 Identities = 177/432 (40%), Positives = 235/432 (54%), Gaps = 11/432 (2%)
 Frame = +2

Query: 119  MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGSDSX 298
            MEN + NS   EI +KSRSLDLK+LY  KS  SKE ++ ++L RK  S   + E  S + 
Sbjct: 1    MENRIGNSHGAEIPRKSRSLDLKSLY--KSGDSKESSKNKSLKRKDSSQEGDDEKRSSNN 58

Query: 299  XXXXXXXXEVSLSSLES--GSRKNGKFLNASDGSKIHGLILNQRDSTKKQNDRVSRGANL 472
                     + LSS  +  GS  +       +G    GL     DS   +N  +S+    
Sbjct: 59   NKRKKSRKALPLSSFRTVDGSNSSKSLTEVYNGGFSSGL----HDSESLKNLGLSQKLKN 114

Query: 473  KSSSSDVTPNLDSNVMPIPKRPRDFSRRKKFQSTDVPNQSGISRKVSSDA-QMVKLTGYP 649
               ++ ++ +L  +   IP+R R F  R KF+       +G S     D  + VKLT   
Sbjct: 115  GCGANGISLSLGDSETRIPRRKRGFVGRNKFEGGQRLKLAGRSSSTVGDVKEEVKLTSED 174

Query: 650  V-TPTIYSEGKRKNVFDEFKENSSDRANSTRGSKAKDGTKCY-------PILKRMRRDHG 805
              T    S+ K+K   D+FKEN +  ++  +  K +DG   Y        +LK+ +R+  
Sbjct: 175  SGTQNESSKVKQKKFIDDFKENRNSESSLVQHLKEEDGVAAYLAVNDGDSLLKKSQRNPR 234

Query: 806  KSEEAGPQEQTHRLEIEPVVDNFSXXXXXXXXXXXXXXXXXAARMLSSRFDPSCTVFPGS 985
            K +++    ++   + E +V +                   AARMLSSRFDPSCT F  +
Sbjct: 235  KRKDSVKGGKSVAKKAEILVGSSVKTCDDFKEDDEENLEENAARMLSSRFDPSCTGFSSN 294

Query: 986  VTAPVSKSMNGFSFVSSFHGDSESHLLSHSAGSESNSADAACRVLRPRKKGKEKRQVRKR 1165
                VS S NGFSF+ S  G + S      +GSES S DA+ RVLRPRK  KEK   RKR
Sbjct: 295  SKVSVSPSENGFSFLLS-SGQNASSGSKTFSGSESASVDASGRVLRPRKSHKEKSNSRKR 353

Query: 1166 RHFYEIFSRNIDAHWVLNRRIKVFWPLDQCWYFGLVTGYDPHEKLHHVKYDDREEEWIDL 1345
            RHFYEI+S ++DA WVLNRRIKVFWPLD+ WY+GLV  YD   KLHHVKYDDR+EEWI+L
Sbjct: 354  RHFYEIYSGDLDASWVLNRRIKVFWPLDKSWYYGLVNEYDKERKLHHVKYDDRDEEWINL 413

Query: 1346 QNERFKLLLLPS 1381
            QNERFKLLL PS
Sbjct: 414  QNERFKLLLFPS 425


>ref|XP_007013730.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 4 [Theobroma cacao] gi|508784093|gb|EOY31349.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 4 [Theobroma cacao]
          Length = 1721

 Score =  268 bits (686), Expect = 3e-69
 Identities = 177/432 (40%), Positives = 235/432 (54%), Gaps = 11/432 (2%)
 Frame = +2

Query: 119  MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGSDSX 298
            MEN + NS   EI +KSRSLDLK+LY  KS  SKE ++ ++L RK  S   + E  S + 
Sbjct: 1    MENRIGNSHGAEIPRKSRSLDLKSLY--KSGDSKESSKNKSLKRKDSSQEGDDEKRSSNN 58

Query: 299  XXXXXXXXEVSLSSLES--GSRKNGKFLNASDGSKIHGLILNQRDSTKKQNDRVSRGANL 472
                     + LSS  +  GS  +       +G    GL     DS   +N  +S+    
Sbjct: 59   NKRKKSRKALPLSSFRTVDGSNSSKSLTEVYNGGFSSGL----HDSESLKNLGLSQKLKN 114

Query: 473  KSSSSDVTPNLDSNVMPIPKRPRDFSRRKKFQSTDVPNQSGISRKVSSDA-QMVKLTGYP 649
               ++ ++ +L  +   IP+R R F  R KF+       +G S     D  + VKLT   
Sbjct: 115  GCGANGISLSLGDSETRIPRRKRGFVGRNKFEGGQRLKLAGRSSSTVGDVKEEVKLTSED 174

Query: 650  V-TPTIYSEGKRKNVFDEFKENSSDRANSTRGSKAKDGTKCY-------PILKRMRRDHG 805
              T    S+ K+K   D+FKEN +  ++  +  K +DG   Y        +LK+ +R+  
Sbjct: 175  SGTQNESSKVKQKKFIDDFKENRNSESSLVQHLKEEDGVAAYLAVNDGDSLLKKSQRNPR 234

Query: 806  KSEEAGPQEQTHRLEIEPVVDNFSXXXXXXXXXXXXXXXXXAARMLSSRFDPSCTVFPGS 985
            K +++    ++   + E +V +                   AARMLSSRFDPSCT F  +
Sbjct: 235  KRKDSVKGGKSVAKKAEILVGSSVKTCDDFKEDDEENLEENAARMLSSRFDPSCTGFSSN 294

Query: 986  VTAPVSKSMNGFSFVSSFHGDSESHLLSHSAGSESNSADAACRVLRPRKKGKEKRQVRKR 1165
                VS S NGFSF+ S  G + S      +GSES S DA+ RVLRPRK  KEK   RKR
Sbjct: 295  SKVSVSPSENGFSFLLS-SGQNASSGSKTFSGSESASVDASGRVLRPRKSHKEKSNSRKR 353

Query: 1166 RHFYEIFSRNIDAHWVLNRRIKVFWPLDQCWYFGLVTGYDPHEKLHHVKYDDREEEWIDL 1345
            RHFYEI+S ++DA WVLNRRIKVFWPLD+ WY+GLV  YD   KLHHVKYDDR+EEWI+L
Sbjct: 354  RHFYEIYSGDLDASWVLNRRIKVFWPLDKSWYYGLVNEYDKERKLHHVKYDDRDEEWINL 413

Query: 1346 QNERFKLLLLPS 1381
            QNERFKLLL PS
Sbjct: 414  QNERFKLLLFPS 425


>ref|XP_007013727.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 1 [Theobroma cacao]
            gi|590579224|ref|XP_007013728.1| Enhancer of
            polycomb-like transcription factor protein, putative
            isoform 1 [Theobroma cacao] gi|508784090|gb|EOY31346.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 1 [Theobroma cacao]
            gi|508784091|gb|EOY31347.1| Enhancer of polycomb-like
            transcription factor protein, putative isoform 1
            [Theobroma cacao]
          Length = 1693

 Score =  268 bits (686), Expect = 3e-69
 Identities = 177/432 (40%), Positives = 235/432 (54%), Gaps = 11/432 (2%)
 Frame = +2

Query: 119  MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGSDSX 298
            MEN + NS   EI +KSRSLDLK+LY  KS  SKE ++ ++L RK  S   + E  S + 
Sbjct: 1    MENRIGNSHGAEIPRKSRSLDLKSLY--KSGDSKESSKNKSLKRKDSSQEGDDEKRSSNN 58

Query: 299  XXXXXXXXEVSLSSLES--GSRKNGKFLNASDGSKIHGLILNQRDSTKKQNDRVSRGANL 472
                     + LSS  +  GS  +       +G    GL     DS   +N  +S+    
Sbjct: 59   NKRKKSRKALPLSSFRTVDGSNSSKSLTEVYNGGFSSGL----HDSESLKNLGLSQKLKN 114

Query: 473  KSSSSDVTPNLDSNVMPIPKRPRDFSRRKKFQSTDVPNQSGISRKVSSDA-QMVKLTGYP 649
               ++ ++ +L  +   IP+R R F  R KF+       +G S     D  + VKLT   
Sbjct: 115  GCGANGISLSLGDSETRIPRRKRGFVGRNKFEGGQRLKLAGRSSSTVGDVKEEVKLTSED 174

Query: 650  V-TPTIYSEGKRKNVFDEFKENSSDRANSTRGSKAKDGTKCY-------PILKRMRRDHG 805
              T    S+ K+K   D+FKEN +  ++  +  K +DG   Y        +LK+ +R+  
Sbjct: 175  SGTQNESSKVKQKKFIDDFKENRNSESSLVQHLKEEDGVAAYLAVNDGDSLLKKSQRNPR 234

Query: 806  KSEEAGPQEQTHRLEIEPVVDNFSXXXXXXXXXXXXXXXXXAARMLSSRFDPSCTVFPGS 985
            K +++    ++   + E +V +                   AARMLSSRFDPSCT F  +
Sbjct: 235  KRKDSVKGGKSVAKKAEILVGSSVKTCDDFKEDDEENLEENAARMLSSRFDPSCTGFSSN 294

Query: 986  VTAPVSKSMNGFSFVSSFHGDSESHLLSHSAGSESNSADAACRVLRPRKKGKEKRQVRKR 1165
                VS S NGFSF+ S  G + S      +GSES S DA+ RVLRPRK  KEK   RKR
Sbjct: 295  SKVSVSPSENGFSFLLS-SGQNASSGSKTFSGSESASVDASGRVLRPRKSHKEKSNSRKR 353

Query: 1166 RHFYEIFSRNIDAHWVLNRRIKVFWPLDQCWYFGLVTGYDPHEKLHHVKYDDREEEWIDL 1345
            RHFYEI+S ++DA WVLNRRIKVFWPLD+ WY+GLV  YD   KLHHVKYDDR+EEWI+L
Sbjct: 354  RHFYEIYSGDLDASWVLNRRIKVFWPLDKSWYYGLVNEYDKERKLHHVKYDDRDEEWINL 413

Query: 1346 QNERFKLLLLPS 1381
            QNERFKLLL PS
Sbjct: 414  QNERFKLLLFPS 425


>ref|XP_007225478.1| hypothetical protein PRUPE_ppa000151mg [Prunus persica]
            gi|462422414|gb|EMJ26677.1| hypothetical protein
            PRUPE_ppa000151mg [Prunus persica]
          Length = 1617

 Score =  264 bits (675), Expect = 6e-68
 Identities = 182/442 (41%), Positives = 237/442 (53%), Gaps = 21/442 (4%)
 Frame = +2

Query: 119  MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGSDSX 298
            MEN + NS   EI +KSRSLDLK+LY  KS  +KE    ++L RK  +     E G ++ 
Sbjct: 1    MENRIENSHGTEIPRKSRSLDLKSLY--KSRTTKE-VPTKSLKRKGSA-----EDGDENR 52

Query: 299  XXXXXXXXEVSLSSLESGSRKNGKFLNASDGSKIHGLILNQRDSTKKQNDRVSRGANLKS 478
                    EVSLSSL++ +  + K L+    S ++    +        +  +  G+    
Sbjct: 53   DKKKKSRKEVSLSSLKNVNTSSKKSLDEVYHSGLNSGSHDPEAVKCGSSQILDSGSGFNG 112

Query: 479  SSSDVTPNLDSNVMPIPKRPRDFSRRKKFQSTDV---PNQS-GISRKVSSDAQMVKLTGY 646
             SS    +L +NV+ IP+R R F  RKKF+   V   P+QS G    V  + Q+ KL   
Sbjct: 113  VSS---LSLGNNVIQIPRRKRGFVGRKKFEGGQVLKLPDQSAGKVGLVDQNHQIAKLNVD 169

Query: 647  PV-TPTIYSEGKRKNVFDEFKENSSDRANSTRGSKAKDGTKCYPI-------LKRMRRDH 802
             + T       KRK   D+FKEN     NS   +  +     + +       LK+ RR+ 
Sbjct: 170  DLGTQDELLNVKRKKGRDDFKENIDSELNSAPHADKEGVHTSHSVVSNGDSSLKKSRRNQ 229

Query: 803  GKSEEAGPQEQTHRL---------EIEPVVDNFSXXXXXXXXXXXXXXXXXAARMLSSRF 955
               E    + +   L         E +P+VD+ +                 AARMLSSRF
Sbjct: 230  DNEENRRSRRKRKDLACGSKSAAKEADPLVDSSTKSCHDLQEDDEENLEENAARMLSSRF 289

Query: 956  DPSCTVFPGSVTAPVSKSMNGFSFVSSFHGDSESHLLSHSAGSESNSADAACRVLRPRKK 1135
            DPSCT F  +  A   +S NG SF+ S   D +S      +GSES S D + RVLRPRK+
Sbjct: 290  DPSCTGFSSNNKASALESANGLSFLLSSGQDFDSRRSKSISGSESPSVDNSGRVLRPRKQ 349

Query: 1136 GKEKRQVRKRRHFYEIFSRNIDAHWVLNRRIKVFWPLDQCWYFGLVTGYDPHEKLHHVKY 1315
             KEK   RKRRHFYE+F  N+DA+WV NRRIKVFWPLDQ WY+GLV  YD  +KLHHVKY
Sbjct: 350  HKEKGHSRKRRHFYEVFLGNLDAYWVTNRRIKVFWPLDQTWYYGLVNDYDKEKKLHHVKY 409

Query: 1316 DDREEEWIDLQNERFKLLLLPS 1381
            DDR+EEWIDLQNERFKLLLLPS
Sbjct: 410  DDRDEEWIDLQNERFKLLLLPS 431


>ref|XP_002516604.1| hypothetical protein RCOM_0804080 [Ricinus communis]
            gi|223544424|gb|EEF45945.1| hypothetical protein
            RCOM_0804080 [Ricinus communis]
          Length = 1705

 Score =  249 bits (635), Expect = 3e-63
 Identities = 180/452 (39%), Positives = 236/452 (52%), Gaps = 31/452 (6%)
 Frame = +2

Query: 119  MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGSDSX 298
            MEN + NS   EI KKSRSLDL++LY + S+ SKE  + + L RK  S  +N  F     
Sbjct: 1    MENRIGNSHEAEIPKKSRSLDLRSLY-QSSEGSKE-AQIKNLKRKGGSDVDNSGFEKRKK 58

Query: 299  XXXXXXXXEVSLSSLE----SGSRKNGKFLNASDGSKIHGLILNQRDSTKKQNDRVSRGA 466
                     VS+SS      +GS+   +  N S  S  H     +  S  +Q     R  
Sbjct: 59   SRKA-----VSISSFRKVNGNGSKSLEEVYNGSLSSGSHDTKEIKSGSLNQQ-----RVN 108

Query: 467  NLKSSSSDVTPNLDSNVMPIPKRPRDFSRRKKFQS-TDVPNQSGISRKVSSDAQMVKLTG 643
            N  S  S ++ NL+ +   IP+R R F  RKK +  + V   +  SR      Q+ KLT 
Sbjct: 109  NSNSGVSKISQNLEGSFDKIPRRKRGFVGRKKVEKDSQVLKPAEESRDKLETDQISKLTV 168

Query: 644  YPVTPTIYSEG-KRKNVFDEFKENSSDRANSTRGSKAKDGTKCYPIL------------- 781
                  + S   K+K V D+FKEN     +S R  + +DG   + +              
Sbjct: 169  KDTGKVVESSKVKQKKVSDDFKENRISERSSGRHCE-EDGHTGHSVARSVVLSLWKSQTG 227

Query: 782  ------------KRMRRDHGKSEEAGPQEQTHRLEIEPVVDNFSXXXXXXXXXXXXXXXX 925
                        K +R+   K +    ++++   E EP VD  +                
Sbjct: 228  HSVEIDDDSSKKKSLRKRSRKRKNLISEDKSVAKEAEPSVD--AEVSCDLHDDDEENLEE 285

Query: 926  XAARMLSSRFDPSCTVFPGSVTAPVSKSMNGFSFVSSFHGDSESHLLSHSAGSESNSADA 1105
             AARMLSSRFD SCT F  +  A    S NG SF+ S   +  +H  ++ +GSES S DA
Sbjct: 286  NAARMLSSRFDTSCTGFSSNSKASPVPSTNGLSFLLSSGQEFATHGPNYISGSESASLDA 345

Query: 1106 ACRVLRPRKKGKEKRQVRKRRHFYEIFSRNIDAHWVLNRRIKVFWPLDQCWYFGLVTGYD 1285
            A R+LRPRK+ KEK   RKRRH+YEIFS ++DA+WVLNRRIKVFWPLDQ WY+GLV  YD
Sbjct: 346  AARILRPRKQHKEKGSSRKRRHYYEIFSGDLDAYWVLNRRIKVFWPLDQSWYYGLVNDYD 405

Query: 1286 PHEKLHHVKYDDREEEWIDLQNERFKLLLLPS 1381
               KLHHVKYDDR+EEWI+LQ+ERFKLLLLPS
Sbjct: 406  NVRKLHHVKYDDRDEEWINLQDERFKLLLLPS 437


>ref|XP_004292962.1| PREDICTED: uncharacterized protein LOC101313578 [Fragaria vesca
            subsp. vesca]
          Length = 1673

 Score =  248 bits (632), Expect = 6e-63
 Identities = 169/438 (38%), Positives = 225/438 (51%), Gaps = 17/438 (3%)
 Frame = +2

Query: 119  MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGSDSX 298
            MEN V  S   EI ++SRSLD+K+LY  +S    E           +SL  N   G    
Sbjct: 1    MENRVEISHGTEIPRRSRSLDVKSLYRSRSTKEAEN----------QSLKRNGSEGDGDG 50

Query: 299  XXXXXXXXEVSLSSLESGSRKNGKFLNASDGSKIHGLILNQRD---STKKQNDRVSRGAN 469
                    EVSLSSL++ +  +       D     GL     D   S    + ++  G+ 
Sbjct: 51   EKKKKSRKEVSLSSLKNVNSSSSSSWKNIDKEYDRGLESGSHDPEASNSGSSQKLDSGSR 110

Query: 470  LKSSSSDVTPNLDSNVMPIPKRPRDFSRRKKFQSTDVPNQSGISRKVSSDA----QMVKL 637
            L S S     +LD++ + IP+R R F  RKKF+       S  S   +S A    Q+ KL
Sbjct: 111  LNSVSQ---LSLDNSGIQIPRRKRGFVGRKKFEGGQALKLSDESAGKASIADQNHQVAKL 167

Query: 638  TGYPVTPTIYS-EGKRKNVFDEFKENSSDRANSTRGSKAKDGTKCYPI-------LKRMR 793
            +G  +       + +R    DE KEN +   N    +K ++  +   +       LK+ R
Sbjct: 168  SGEELDSQAEGWKAERNKGLDECKENLNSELNGALHAKKENALESRSVVSNGNSSLKKSR 227

Query: 794  RDHGKSEEAGPQEQTHRLEIEPVVDNFSXXXXXXXXXXXXXXXXXAARMLSSRFDPSCTV 973
            R   KS++     +T   + EP+V++ +                 AA MLSSRFDPSCT 
Sbjct: 228  RKSRKSKDLSSDSRTDAKKAEPLVNSSTKACQASHEDEEENLEENAAMMLSSRFDPSCTG 287

Query: 974  FPGSVTAPVSKSMNGFSFVSSFHGDSESHLLSHSAGSESNSADAACRVLRPR--KKGKEK 1147
            F  +  A   +S NG S       D + H+    +GSES S D A R LRPR  K  KEK
Sbjct: 288  FSLNAKACAMQSSNGLS-----GQDFDGHMSKSLSGSESPSIDNAGRTLRPRPRKHHKEK 342

Query: 1148 RQVRKRRHFYEIFSRNIDAHWVLNRRIKVFWPLDQCWYFGLVTGYDPHEKLHHVKYDDRE 1327
            +  RKRRHFYEIF  ++DA WV+NRRIKVFWPLDQ WY+GLV  YD  +KLHH++YDDRE
Sbjct: 343  KGTRKRRHFYEIFFGDLDACWVVNRRIKVFWPLDQSWYYGLVNDYDKDKKLHHIRYDDRE 402

Query: 1328 EEWIDLQNERFKLLLLPS 1381
            EEWIDLQ+ERFKLLLLP+
Sbjct: 403  EEWIDLQHERFKLLLLPT 420


>ref|XP_006601120.1| PREDICTED: uncharacterized protein LOC100789801 isoform X1 [Glycine
            max] gi|571538233|ref|XP_006601121.1| PREDICTED:
            uncharacterized protein LOC100789801 isoform X2 [Glycine
            max]
          Length = 1602

 Score =  247 bits (630), Expect = 1e-62
 Identities = 174/432 (40%), Positives = 223/432 (51%), Gaps = 11/432 (2%)
 Frame = +2

Query: 119  MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGSDSX 298
            ME   +NS+   I KKSRSLDLK+LY  K       TE  A    +R    N   G D  
Sbjct: 1    MEGRAQNSNDTTIPKKSRSLDLKSLYKSKL------TENTAKKNLKRI--GNSSGGGDEK 52

Query: 299  XXXXXXXXEVSLSSLESGSRKNGKFLNASDGSKIHGLILNQRDSTKKQNDRVSRGANLKS 478
                    EVSLSSLE+G           DGS    L ++Q+ S+              S
Sbjct: 53   RKKKKARKEVSLSSLENG-----------DGSSELKLGVSQKLSSSS------------S 89

Query: 479  SSSDVTPNLDSNVMPIPKRPRDFSRRKKFQ---STDVPNQSGISRKVSSDAQMVKLTGYP 649
            + + V+ ++  + + IPKR R F  RKK +   ++ V  QSG+  K+  + Q+ KL    
Sbjct: 90   TLNRVSFSVGDDDVQIPKRKRSFVGRKKSELGLASKVVEQSGL--KIGYNDQVPKLGSDD 147

Query: 650  VTPTIYS-EGKRKNVFDEFKENSSDRANSTRGSKAKDGTKCYPI-------LKRMRRDHG 805
            +   + S + KRK  FDEFKEN +  +NS + +K       + +       L + RR H 
Sbjct: 148  LGSGVESFKIKRKKEFDEFKENRNSDSNSVQHAKENGDCASHSVVNSGDSSLSKSRRQHR 207

Query: 806  KSEEAGPQEQTHRLEIEPVVDNFSXXXXXXXXXXXXXXXXXAARMLSSRFDPSCTVFPGS 985
            K + +         E EP+V   S                 AARMLSSRFDPSCT F   
Sbjct: 208  KRKASAIDSTKVSKEAEPLVS--SSKISDDLQDEEENLEENAARMLSSRFDPSCTGFS-- 263

Query: 986  VTAPVSKSMNGFSFVSSFHGDSESHLLSHSAGSESNSADAACRVLRPRKKGKEKRQVRKR 1165
                  K  NG SF  S      +H L    GSES SAD A RVLRPRK+ K K   RKR
Sbjct: 264  -----MKGSNGLSFFQSSSQSIVNHSLKSPLGSESTSADTAGRVLRPRKQYKNKSNSRKR 318

Query: 1166 RHFYEIFSRNIDAHWVLNRRIKVFWPLDQCWYFGLVTGYDPHEKLHHVKYDDREEEWIDL 1345
            RHFYEI   ++DA+WVLNRRIK+FWPLDQ WY+GLV  YD   KL+H+KYDDR+ +W++L
Sbjct: 319  RHFYEILLGDVDAYWVLNRRIKIFWPLDQSWYYGLVDNYDEGSKLYHIKYDDRDVKWVNL 378

Query: 1346 QNERFKLLLLPS 1381
            Q ERFKLLLL S
Sbjct: 379  QTERFKLLLLRS 390


>ref|XP_007013729.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 3 [Theobroma cacao] gi|508784092|gb|EOY31348.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 3 [Theobroma cacao]
          Length = 1674

 Score =  241 bits (614), Expect = 7e-61
 Identities = 158/400 (39%), Positives = 213/400 (53%), Gaps = 11/400 (2%)
 Frame = +2

Query: 215  SKEQTEGRALNRKRRSLPENKEFGSDSXXXXXXXXXEVSLSSLES--GSRKNGKFLNASD 388
            SKE ++ ++L RK  S   + E  S +          + LSS  +  GS  +       +
Sbjct: 12   SKESSKNKSLKRKDSSQEGDDEKRSSNNNKRKKSRKALPLSSFRTVDGSNSSKSLTEVYN 71

Query: 389  GSKIHGLILNQRDSTKKQNDRVSRGANLKSSSSDVTPNLDSNVMPIPKRPRDFSRRKKFQ 568
            G    GL     DS   +N  +S+       ++ ++ +L  +   IP+R R F  R KF+
Sbjct: 72   GGFSSGL----HDSESLKNLGLSQKLKNGCGANGISLSLGDSETRIPRRKRGFVGRNKFE 127

Query: 569  STDVPNQSGISRKVSSDA-QMVKLTGYPV-TPTIYSEGKRKNVFDEFKENSSDRANSTRG 742
                   +G S     D  + VKLT     T    S+ K+K   D+FKEN +  ++  + 
Sbjct: 128  GGQRLKLAGRSSSTVGDVKEEVKLTSEDSGTQNESSKVKQKKFIDDFKENRNSESSLVQH 187

Query: 743  SKAKDGTKCY-------PILKRMRRDHGKSEEAGPQEQTHRLEIEPVVDNFSXXXXXXXX 901
             K +DG   Y        +LK+ +R+  K +++    ++   + E +V +          
Sbjct: 188  LKEEDGVAAYLAVNDGDSLLKKSQRNPRKRKDSVKGGKSVAKKAEILVGSSVKTCDDFKE 247

Query: 902  XXXXXXXXXAARMLSSRFDPSCTVFPGSVTAPVSKSMNGFSFVSSFHGDSESHLLSHSAG 1081
                     AARMLSSRFDPSCT F  +    VS S NGFSF+ S  G + S      +G
Sbjct: 248  DDEENLEENAARMLSSRFDPSCTGFSSNSKVSVSPSENGFSFLLS-SGQNASSGSKTFSG 306

Query: 1082 SESNSADAACRVLRPRKKGKEKRQVRKRRHFYEIFSRNIDAHWVLNRRIKVFWPLDQCWY 1261
            SES S DA+ RVLRPRK  KEK   RKRRHFYEI+S ++DA WVLNRRIKVFWPLD+ WY
Sbjct: 307  SESASVDASGRVLRPRKSHKEKSNSRKRRHFYEIYSGDLDASWVLNRRIKVFWPLDKSWY 366

Query: 1262 FGLVTGYDPHEKLHHVKYDDREEEWIDLQNERFKLLLLPS 1381
            +GLV  YD   KLHHVKYDDR+EEWI+LQNERFKLLL PS
Sbjct: 367  YGLVNEYDKERKLHHVKYDDRDEEWINLQNERFKLLLFPS 406


>ref|XP_004162065.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101228859
            [Cucumis sativus]
          Length = 1466

 Score =  235 bits (599), Expect = 4e-59
 Identities = 181/468 (38%), Positives = 231/468 (49%), Gaps = 44/468 (9%)
 Frame = +2

Query: 110  GS*MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGS 289
            G  MENS+ NS   +I KKSRSLDLK+LY  +S VSKE  + + L RK R+     E G 
Sbjct: 14   GKSMENSLENSHGTDIPKKSRSLDLKSLY--ESKVSKE-VQNKRLKRKGRA-----EDG- 64

Query: 290  DSXXXXXXXXXEVSLSSLESGSRKNGKFLNASDGSKIHGLILNQRDSTKKQNDRVSRGAN 469
            D          +VSLS+  S   ++ K L+    +   GL  +  DS K          N
Sbjct: 65   DVQKNERRNRKKVSLSNFSSIYSRSRKSLDEVYDA---GLGSSGHDSKKALKSESKDKLN 121

Query: 470  LKSSSSDVTPNLDSNVMPIPKRPRD-FSRRKKFQSTDVPNQSG-ISRKVSS-DAQMVKLT 640
              S  ++V   LD NVM IPKR R  F RRKK     +   SG +  K  S DA+   L 
Sbjct: 122  SSSEFNEVPLILDENVMHIPKRKRGGFVRRKKSHDGQILKPSGQLDAKAGSLDAKAGSLD 181

Query: 641  GYPVTPTIYSEGKRKNVFDEFKENSSDRANSTRGSKAKDGTKCY---------------- 772
                T    ++   K+  D+ +   ++R  + +  K K+  +                  
Sbjct: 182  DKAGTVDQIAKSSVKDSSDQVECCKTNRKLAFKDLKEKEPKELRLHLKKEDGQADQLTRE 241

Query: 773  -------------------------PILKRMRRDHGKSEEAGPQEQTHRLEIEPVVDNFS 877
                                     P  K+ +++  K + +    +++  E E  +   +
Sbjct: 242  NELNPASRLKEEGEHIDHSVVKPVSPSSKKSKKNVRKRKISASGSKSNSKEGEASISQST 301

Query: 878  XXXXXXXXXXXXXXXXXAARMLSSRFDPSCTVFPGSVTAPVSKSMNGFSFVSSFHGDSES 1057
                             AARMLSSRFDP+CT F  S T       NG SF+ S   D+ S
Sbjct: 302  KRRDGFPEDDEENLEENAARMLSSRFDPNCTGFXSSNTKGSLPPTNGLSFLLSSGHDNVS 361

Query: 1058 HLLSHSAGSESNSADAACRVLRPRKKGKEKRQVRKRRHFYEIFSRNIDAHWVLNRRIKVF 1237
              L    G ES S DAA RVLRPRK+ KEK+  RKRRHFY+I   +IDA WVLNRRIKVF
Sbjct: 362  RGLK--PGLESASVDAAGRVLRPRKQRKEKKXSRKRRHFYDILFGDIDAAWVLNRRIKVF 419

Query: 1238 WPLDQCWYFGLVTGYDPHEKLHHVKYDDREEEWIDLQNERFKLLLLPS 1381
            WPLDQ WY+GLV  YD   KLHHVKYDDR+EEWIDLQNERFKLLLLPS
Sbjct: 420  WPLDQIWYYGLVNDYDKERKLHHVKYDDRDEEWIDLQNERFKLLLLPS 467


>ref|XP_002324830.2| hypothetical protein POPTR_0018s01030g [Populus trichocarpa]
            gi|550317762|gb|EEF03395.2| hypothetical protein
            POPTR_0018s01030g [Populus trichocarpa]
          Length = 1722

 Score =  233 bits (595), Expect = 1e-58
 Identities = 171/485 (35%), Positives = 228/485 (47%), Gaps = 64/485 (13%)
 Frame = +2

Query: 119  MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGSDSX 298
            MEN V  S   EI KKSRSLD K+LY  K+    + +    L RK     ++++      
Sbjct: 1    MENRVGKSHGVEIPKKSRSLDHKSLYESKNPKGDQNSNN--LKRKGGGAGDDEK-----G 53

Query: 299  XXXXXXXXEVSLSSLESGSRKNGKFLNASDGSKIHGLILNQRDSTKKQNDR--VSRGANL 472
                    EVS+SS ++      K +N+S    +  +      S  K++    + R A+ 
Sbjct: 54   HEKKKSRKEVSISSFKN------KNVNSSYSKSLKEVYNRSLSSGLKESKSGLIQRLAD- 106

Query: 473  KSSSSDVTPNLDSNVMPIPKRPRDFSRRKKFQSTDVPNQ--SGISRKVSSDAQMVKLTGY 646
             +  S V+  LD  V  IP+R R F  RKK  +    ++   G  R+  +  Q  KLTG 
Sbjct: 107  SNGFSGVSLPLDGGVFKIPRRKRGFVGRKKVDNGSEGSKLTGGFGREAGNVDQADKLTGE 166

Query: 647  PVTPTIYSEG------------------------------------KRKNVFDEFKENSS 718
              +  + + G                                    K+K   D+ KEN +
Sbjct: 167  DESKWVENGGRELKAVGISGGEVDDVDQASKLTVEDKGKQVEPLKAKQKKGSDDLKENRN 226

Query: 719  DRANSTRGSKAKDGTKCYPILKRMRRDHGKSEEAGP------------------------ 826
            D  N++R  + +DG + + +  + R    K    GP                        
Sbjct: 227  DELNASRNLEEEDGHEGHSVATK-RDSSSKRPHNGPLVDNNGDLSLKKSLRKRSRKKGMV 285

Query: 827  QEQTHRLEIEPVVDNFSXXXXXXXXXXXXXXXXXAARMLSSRFDPSCTVFPGSVTAPVSK 1006
             ++    E +P VD                    AA MLSSRFDPSCT F  +  A  S 
Sbjct: 286  SDKKRTKEDDPTVDTSMKMSGVFHDDEEENLEENAAMMLSSRFDPSCTGFSSNSKASASP 345

Query: 1007 SMNGFSFVSSFHGDSESHLLSHSAGSESNSADAACRVLRPRKKGKEKRQVRKRRHFYEIF 1186
            S N F        +  +H  S+ +GSES+S D   RVLRPRK+ KEK   RKRRH+YE+F
Sbjct: 346  SKNDFQ-------EFVAHGSSYVSGSESSSVDTDGRVLRPRKQNKEKGSTRKRRHYYEVF 398

Query: 1187 SRNIDAHWVLNRRIKVFWPLDQCWYFGLVTGYDPHEKLHHVKYDDREEEWIDLQNERFKL 1366
            S ++DAHWVLNRRIKVFWPLDQ WY GLV  YD   KLHH+KYDDR+EEWIDLQNERFKL
Sbjct: 399  SGDLDAHWVLNRRIKVFWPLDQRWYHGLVGDYDKERKLHHIKYDDRDEEWIDLQNERFKL 458

Query: 1367 LLLPS 1381
            LLLPS
Sbjct: 459  LLLPS 463


>ref|XP_006596126.1| PREDICTED: uncharacterized protein LOC100781778 isoform X2 [Glycine
            max]
          Length = 1473

 Score =  229 bits (585), Expect = 2e-57
 Identities = 167/430 (38%), Positives = 215/430 (50%), Gaps = 9/430 (2%)
 Frame = +2

Query: 119  MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGSDSX 298
            ME    NS+   I KKSRSLDLK+LY  K       TE  A    +R    N   G    
Sbjct: 1    MEGIAENSNDTTIPKKSRSLDLKSLYKSKL------TENTAKKNLKRI--GNSSGGGGEK 52

Query: 299  XXXXXXXXEVSLSSLESGSRKNGKFLNASDGSKIHGLILNQRDSTKKQNDRVSRGANLKS 478
                    EVSLSSL++G           DGS    L ++QR S+   +  ++R      
Sbjct: 53   RKKKKTRKEVSLSSLKNG-----------DGSSELKLGVSQRLSSSSSSSMLNR------ 95

Query: 479  SSSDVTPNLDSNVMPIPKRPRDFSRRKKFQSTDVPNQ-SGISRKVSSDAQMVKLTGYPVT 655
                V+ ++  +   IPKR R F  RKK +     N    +S K+  D Q+ KL    + 
Sbjct: 96   ----VSFSVGGDDAQIPKRKRSFVGRKKSERGQASNLVEQLSCKIGYD-QVPKLGSADLG 150

Query: 656  PTIYS-EGKRKNVFDEFKENSSDRANSTRGSKAKDGTKCYPI-------LKRMRRDHGKS 811
              + S + K K  FDEFKEN +  +NS +  K       + +       L + RR + K 
Sbjct: 151  SGVESFKIKHKKEFDEFKENRNSDSNSVQHIKEDGDCASHSVVNSGDSSLTKSRRKNRKR 210

Query: 812  EEAGPQEQTHRLEIEPVVDNFSXXXXXXXXXXXXXXXXXAARMLSSRFDPSCTVFPGSVT 991
            + +         E EP+V +                   AARMLSSRFDPSCT F     
Sbjct: 211  KASALDRTKVSKEAEPLVSSCKISDDLQEDEEENLEEN-AARMLSSRFDPSCTGFS---- 265

Query: 992  APVSKSMNGFSFVSSFHGDSESHLLSHSAGSESNSADAACRVLRPRKKGKEKRQVRKRRH 1171
               +K  NG  F  S      +H L   +GSES SAD A R+LRPRK+ K K   RKRRH
Sbjct: 266  ---TKCSNGLFFFGSSCQSIVNHGLKSKSGSESASADTAGRILRPRKQYKNKGSSRKRRH 322

Query: 1172 FYEIFSRNIDAHWVLNRRIKVFWPLDQCWYFGLVTGYDPHEKLHHVKYDDREEEWIDLQN 1351
            FYEI   ++DA+WVLNRRIK+FWPLDQ WY+GLV  YD   KL+H+KYDDR+ EW++L  
Sbjct: 323  FYEILLGDVDAYWVLNRRIKIFWPLDQSWYYGLVDNYDEGSKLYHIKYDDRDVEWVNLHT 382

Query: 1352 ERFKLLLLPS 1381
            ERFKLLLL S
Sbjct: 383  ERFKLLLLRS 392


>ref|XP_004498624.1| PREDICTED: uncharacterized protein LOC101499788 [Cicer arietinum]
          Length = 1658

 Score =  229 bits (585), Expect = 2e-57
 Identities = 172/442 (38%), Positives = 220/442 (49%), Gaps = 18/442 (4%)
 Frame = +2

Query: 104  IEGS*MENSVRNSDVPEISKKSRSLDLKTLYVEK--SDVSKEQTEGRALNRKRRSLPENK 277
            +EGS  +NS  N D    SKKSRSLDLK+LY  K   +VSK+ ++     RK    P   
Sbjct: 1    MEGSREDNS--NGDAN--SKKSRSLDLKSLYKSKLTEEVSKKNSK-----RKGSGSPGG- 50

Query: 278  EFGSDSXXXXXXXXXEVSLSSLESGSRKNGKFLN--ASDGSKIHGLILNQRDSTKKQNDR 451
              G +          EVSLSSLE+G     K  +     G    G      D   +    
Sbjct: 51   --GEEKKNKRKKARKEVSLSSLENGEGSGKKVTDEECKQGPSSGG------DDLVELKLG 102

Query: 452  VSRGANLKSSSSDVTPNLDSNVMPIPKRPRDFSRRKKFQSTDVPNQSGISR----KVSSD 619
            VS+G    S  S V      +V  IPKR R    RKK   +++   S + R     +  D
Sbjct: 103  VSKGVTSSSGPSRVLLGAGGDVC-IPKRKRTLVGRKK---SEIGQSSNLVRHPSPSIGHD 158

Query: 620  AQMVKLTGYPVTPTIYSEG-KRKNVFDEFKENSSDRANSTRGSKAKDGTKCYP------- 775
             Q+ KL        + S     K   +EFKEN +  +NS      K+     P       
Sbjct: 159  DQVPKLGSDDSGRAVQSSKINLKKHLNEFKENRNSDSNSISVKHVKENGDHAPHSVVNSD 218

Query: 776  --ILKRMRRDHGKSEEAGPQEQTHRLEIEPVVDNFSXXXXXXXXXXXXXXXXXAARMLSS 949
               LK+ ++   K +     +     E EP+ D+                   AARMLSS
Sbjct: 219  HSSLKKSKKKDRKRKTLASDKPRVSKEAEPLNDS-RKISVELQEDDEENLEENAARMLSS 277

Query: 950  RFDPSCTVFPGSVTAPVSKSMNGFSFVSSFHGDSESHLLSHSAGSESNSADAACRVLRPR 1129
            RFDPSCT F  S  +    S NG SF+ S   +  +H     +GSES S D A R LRPR
Sbjct: 278  RFDPSCTGFSSSGKSSPLPSANGLSFLLSSSRNIVNHGSKSRSGSESASVDTAGRNLRPR 337

Query: 1130 KKGKEKRQVRKRRHFYEIFSRNIDAHWVLNRRIKVFWPLDQCWYFGLVTGYDPHEKLHHV 1309
            ++ K+K + RKRRHFYEI   ++DA+WVLNRRIKVFWPLDQ WY+GLV  YD  ++LHH+
Sbjct: 338  QQYKDKEKSRKRRHFYEILPGDVDAYWVLNRRIKVFWPLDQSWYYGLVNDYDEQQRLHHI 397

Query: 1310 KYDDREEEWIDLQNERFKLLLL 1375
            KYDDR+EEWIDLQ ERFKLLLL
Sbjct: 398  KYDDRDEEWIDLQTERFKLLLL 419


>ref|XP_003545513.1| PREDICTED: uncharacterized protein LOC100781778 isoform X1 [Glycine
            max]
          Length = 1603

 Score =  229 bits (585), Expect = 2e-57
 Identities = 167/430 (38%), Positives = 215/430 (50%), Gaps = 9/430 (2%)
 Frame = +2

Query: 119  MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGSDSX 298
            ME    NS+   I KKSRSLDLK+LY  K       TE  A    +R    N   G    
Sbjct: 1    MEGIAENSNDTTIPKKSRSLDLKSLYKSKL------TENTAKKNLKRI--GNSSGGGGEK 52

Query: 299  XXXXXXXXEVSLSSLESGSRKNGKFLNASDGSKIHGLILNQRDSTKKQNDRVSRGANLKS 478
                    EVSLSSL++G           DGS    L ++QR S+   +  ++R      
Sbjct: 53   RKKKKTRKEVSLSSLKNG-----------DGSSELKLGVSQRLSSSSSSSMLNR------ 95

Query: 479  SSSDVTPNLDSNVMPIPKRPRDFSRRKKFQSTDVPNQ-SGISRKVSSDAQMVKLTGYPVT 655
                V+ ++  +   IPKR R F  RKK +     N    +S K+  D Q+ KL    + 
Sbjct: 96   ----VSFSVGGDDAQIPKRKRSFVGRKKSERGQASNLVEQLSCKIGYD-QVPKLGSADLG 150

Query: 656  PTIYS-EGKRKNVFDEFKENSSDRANSTRGSKAKDGTKCYPI-------LKRMRRDHGKS 811
              + S + K K  FDEFKEN +  +NS +  K       + +       L + RR + K 
Sbjct: 151  SGVESFKIKHKKEFDEFKENRNSDSNSVQHIKEDGDCASHSVVNSGDSSLTKSRRKNRKR 210

Query: 812  EEAGPQEQTHRLEIEPVVDNFSXXXXXXXXXXXXXXXXXAARMLSSRFDPSCTVFPGSVT 991
            + +         E EP+V +                   AARMLSSRFDPSCT F     
Sbjct: 211  KASALDRTKVSKEAEPLVSSCKISDDLQEDEEENLEEN-AARMLSSRFDPSCTGFS---- 265

Query: 992  APVSKSMNGFSFVSSFHGDSESHLLSHSAGSESNSADAACRVLRPRKKGKEKRQVRKRRH 1171
               +K  NG  F  S      +H L   +GSES SAD A R+LRPRK+ K K   RKRRH
Sbjct: 266  ---TKCSNGLFFFGSSCQSIVNHGLKSKSGSESASADTAGRILRPRKQYKNKGSSRKRRH 322

Query: 1172 FYEIFSRNIDAHWVLNRRIKVFWPLDQCWYFGLVTGYDPHEKLHHVKYDDREEEWIDLQN 1351
            FYEI   ++DA+WVLNRRIK+FWPLDQ WY+GLV  YD   KL+H+KYDDR+ EW++L  
Sbjct: 323  FYEILLGDVDAYWVLNRRIKIFWPLDQSWYYGLVDNYDEGSKLYHIKYDDRDVEWVNLHT 382

Query: 1352 ERFKLLLLPS 1381
            ERFKLLLL S
Sbjct: 383  ERFKLLLLRS 392


>ref|XP_006601123.1| PREDICTED: uncharacterized protein LOC100792436 isoform X2 [Glycine
            max]
          Length = 1469

 Score =  223 bits (569), Expect = 1e-55
 Identities = 163/432 (37%), Positives = 215/432 (49%), Gaps = 11/432 (2%)
 Frame = +2

Query: 119  MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGSDSX 298
            ME    N++   I KKSRSLDLK+LY  K       TE  A    +R    N   G D  
Sbjct: 1    MEGRAENTNDTAILKKSRSLDLKSLYKSKL------TENTAKKNLKRI--GNSSGGGDEK 52

Query: 299  XXXXXXXXEVSLSSLESGSRKNGKFLNASDGSKIHGLILNQRDSTKKQNDRVSRGANLKS 478
                    +V LSSLE+G           DGS    L ++QR S+              S
Sbjct: 53   RKKKKARKKVFLSSLENG-----------DGSSELKLGVSQRLSSSS------------S 89

Query: 479  SSSDVTPNLDSNVMPIPKRPRDFSRRKK---FQSTDVPNQSGISRKVSSDAQMVKLTGYP 649
            + + ++ ++  + + IPKR R F  RKK    Q++ V  QSG+  K+    Q+ KL    
Sbjct: 90   TLNRISFSVGDDDVQIPKRKRSFVGRKKSELVQASKVVEQSGL--KIGYGDQVPKLGSDD 147

Query: 650  VTPTIYS-EGKRKNVFDEFKENSSDRANSTRGSKAKDGTKCYPI-------LKRMRRDHG 805
            +   + S + K    FDEFKEN +  +NS +  K       + +       L + RR + 
Sbjct: 148  LGSGVESFKIKHTKEFDEFKENRNSDSNSVQHVKEDGDCASHSVVNSGDSSLSKSRRKNR 207

Query: 806  KSEEAGPQEQTHRLEIEPVVDNFSXXXXXXXXXXXXXXXXXAARMLSSRFDPSCTVFPGS 985
            K + +         E EP+V   S                 AARMLSSRFDPSCT F   
Sbjct: 208  KRKASALDRTKVSKEAEPLVS--SCKIPGDLQDEEENLEENAARMLSSRFDPSCTGFS-- 263

Query: 986  VTAPVSKSMNGFSFVSSFHGDSESHLLSHSAGSESNSADAACRVLRPRKKGKEKRQVRKR 1165
                  K +NG  F  S      +  L   +GSES SAD A R+LRPRK+ K K   RKR
Sbjct: 264  -----MKGLNGLPFFGSSSQSIVNRGLKSQSGSESASADTAGRILRPRKQYKNKGDSRKR 318

Query: 1166 RHFYEIFSRNIDAHWVLNRRIKVFWPLDQCWYFGLVTGYDPHEKLHHVKYDDREEEWIDL 1345
            RHFY+I   +++A+WVLNRRIK+FWPLDQ WY+G V  YD   KL+H+KYDDR+ EW++L
Sbjct: 319  RHFYKILLGDVNAYWVLNRRIKIFWPLDQSWYYGFVDNYDEGSKLYHIKYDDRDVEWVNL 378

Query: 1346 QNERFKLLLLPS 1381
              ERFKLLLL S
Sbjct: 379  HTERFKLLLLRS 390


>ref|XP_006601122.1| PREDICTED: uncharacterized protein LOC100792436 isoform X1 [Glycine
            max]
          Length = 1594

 Score =  223 bits (569), Expect = 1e-55
 Identities = 163/432 (37%), Positives = 215/432 (49%), Gaps = 11/432 (2%)
 Frame = +2

Query: 119  MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGSDSX 298
            ME    N++   I KKSRSLDLK+LY  K       TE  A    +R    N   G D  
Sbjct: 1    MEGRAENTNDTAILKKSRSLDLKSLYKSKL------TENTAKKNLKRI--GNSSGGGDEK 52

Query: 299  XXXXXXXXEVSLSSLESGSRKNGKFLNASDGSKIHGLILNQRDSTKKQNDRVSRGANLKS 478
                    +V LSSLE+G           DGS    L ++QR S+              S
Sbjct: 53   RKKKKARKKVFLSSLENG-----------DGSSELKLGVSQRLSSSS------------S 89

Query: 479  SSSDVTPNLDSNVMPIPKRPRDFSRRKK---FQSTDVPNQSGISRKVSSDAQMVKLTGYP 649
            + + ++ ++  + + IPKR R F  RKK    Q++ V  QSG+  K+    Q+ KL    
Sbjct: 90   TLNRISFSVGDDDVQIPKRKRSFVGRKKSELVQASKVVEQSGL--KIGYGDQVPKLGSDD 147

Query: 650  VTPTIYS-EGKRKNVFDEFKENSSDRANSTRGSKAKDGTKCYPI-------LKRMRRDHG 805
            +   + S + K    FDEFKEN +  +NS +  K       + +       L + RR + 
Sbjct: 148  LGSGVESFKIKHTKEFDEFKENRNSDSNSVQHVKEDGDCASHSVVNSGDSSLSKSRRKNR 207

Query: 806  KSEEAGPQEQTHRLEIEPVVDNFSXXXXXXXXXXXXXXXXXAARMLSSRFDPSCTVFPGS 985
            K + +         E EP+V   S                 AARMLSSRFDPSCT F   
Sbjct: 208  KRKASALDRTKVSKEAEPLVS--SCKIPGDLQDEEENLEENAARMLSSRFDPSCTGFS-- 263

Query: 986  VTAPVSKSMNGFSFVSSFHGDSESHLLSHSAGSESNSADAACRVLRPRKKGKEKRQVRKR 1165
                  K +NG  F  S      +  L   +GSES SAD A R+LRPRK+ K K   RKR
Sbjct: 264  -----MKGLNGLPFFGSSSQSIVNRGLKSQSGSESASADTAGRILRPRKQYKNKGDSRKR 318

Query: 1166 RHFYEIFSRNIDAHWVLNRRIKVFWPLDQCWYFGLVTGYDPHEKLHHVKYDDREEEWIDL 1345
            RHFY+I   +++A+WVLNRRIK+FWPLDQ WY+G V  YD   KL+H+KYDDR+ EW++L
Sbjct: 319  RHFYKILLGDVNAYWVLNRRIKIFWPLDQSWYYGFVDNYDEGSKLYHIKYDDRDVEWVNL 378

Query: 1346 QNERFKLLLLPS 1381
              ERFKLLLL S
Sbjct: 379  HTERFKLLLLRS 390


>ref|XP_002309585.2| hypothetical protein POPTR_0006s26240g [Populus trichocarpa]
            gi|550337121|gb|EEE93108.2| hypothetical protein
            POPTR_0006s26240g [Populus trichocarpa]
          Length = 1685

 Score =  222 bits (566), Expect = 3e-55
 Identities = 167/483 (34%), Positives = 223/483 (46%), Gaps = 63/483 (13%)
 Frame = +2

Query: 119  MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGSDSX 298
            MEN V  S    I KKSRSLDLK+LY  K+  SK       L RK   + ++++   +  
Sbjct: 32   MENRVGKSHGVGIPKKSRSLDLKSLYETKN--SKWYQNSNNLKRKGGGIGDDEKGHKNKK 89

Query: 299  XXXXXXXXEVSLSSLESGSRKNGKFLN-ASDGSKIHGLILNQRDSTKKQNDRVSRGANLK 475
                    EV +SS ++ +    K L    +GS   GL          +   + R A+  
Sbjct: 90   SRK-----EVCISSFKNVNSSYSKSLKEVYNGSLSSGL-------KDPRTGLIQRLADSN 137

Query: 476  SSSSDVTPNLDSNVMPIPKRPRDFSRRKKFQSTDVPNQ--SGISRKVSSDAQMVKLTGYP 649
              S    P L+   + IP+R R F  R+K  +     +   G  R+V +  Q  KLTG  
Sbjct: 138  GFSGASLP-LEDGAVKIPRRKRGFVGRRKVDNGSEGGKLARGFGREVGNADQADKLTGED 196

Query: 650  VTPTI------------------------------------YSEGKRKNVFDEFKENSSD 721
                +                                    +S+ K+K   D+ KEN + 
Sbjct: 197  EGKGVENGSQESKAVVILVSVVGDVDQASKLTGEGKAKQVEHSKAKQKKGSDDLKENRNG 256

Query: 722  RANSTRGSKAKDGTKCYPI------------------------LKRMRRDHGKSEEAGPQ 829
              +++R  K +DG   + +                        LK+  R   + ++    
Sbjct: 257  ELDASRHLKEEDGHDDHSVATKRDSSLKKSDNCPLVVNNGDSSLKKSLRKRSRKKKDMVS 316

Query: 830  EQTHRLEIEPVVDNFSXXXXXXXXXXXXXXXXXAARMLSSRFDPSCTVFPGSVTAPVSKS 1009
             +    E +P VD                    AA MLSSRFDPSCT F  +  A  S S
Sbjct: 317  NKKRTKEADPSVDASIKISDVLHDEDEENLEENAAMMLSSRFDPSCTGFSSNSKASASPS 376

Query: 1010 MNGFSFVSSFHGDSESHLLSHSAGSESNSADAACRVLRPRKKGKEKRQVRKRRHFYEIFS 1189
             +GF   ++          S+ +GSES+S D   RVLRPRK+ KEK   RKRRH+YEIFS
Sbjct: 377  KDGFQEFAARES-------SYVSGSESSSVDTDGRVLRPRKQNKEKGNTRKRRHYYEIFS 429

Query: 1190 RNIDAHWVLNRRIKVFWPLDQCWYFGLVTGYDPHEKLHHVKYDDREEEWIDLQNERFKLL 1369
             ++DAHWVLNRRIKVFWPLDQ WY GLV  YD   KLHHVKYDDR+EEWI+LQNERFKLL
Sbjct: 430  GDLDAHWVLNRRIKVFWPLDQSWYHGLVGDYDKDRKLHHVKYDDRDEEWINLQNERFKLL 489

Query: 1370 LLP 1378
            +LP
Sbjct: 490  MLP 492


>ref|XP_007137088.1| hypothetical protein PHAVU_009G098700g [Phaseolus vulgaris]
            gi|561010175|gb|ESW09082.1| hypothetical protein
            PHAVU_009G098700g [Phaseolus vulgaris]
          Length = 1699

 Score =  213 bits (543), Expect = 1e-52
 Identities = 169/469 (36%), Positives = 220/469 (46%), Gaps = 48/469 (10%)
 Frame = +2

Query: 119  MENSVRNSDVPEISKKSRSLDLKTLYVEKSDVSKEQTEGRALNRKRRSLPENKEFGSDSX 298
            ME+   ++    I KKSRSLDLK+LY  K  V KE  E + L RK   L    E    + 
Sbjct: 1    MEDREESTHGTAIPKKSRSLDLKSLY--KPKVRKESPE-KGLKRKGSHLGGVHE----NT 53

Query: 299  XXXXXXXXEVSLSSLESGSRKNGKFLNASDGSKIHGLILNQRD-STKKQNDRVSRGANLK 475
                    EVSLSSLE+    N K +   D     GL    +D   +K   +   G+N  
Sbjct: 54   NKKKKTRKEVSLSSLENADVGNKKVV---DEECQKGLGSGWQDLCEQKLEPKQGSGSNTV 110

Query: 476  SSSSDVTPNLDSNVMPIPKRPRDFSRRKKFQSTDVPNQSGISRKVSSDA-QMVKLTGYPV 652
             +   +    D NV  IPKR RDF  R+K +    P  +G S        Q++KL+   +
Sbjct: 111  LNRGSLC--FDENVH-IPKRRRDFVGRRKIEVGPAPRLAGESSNTGGHGEQILKLSSNVL 167

Query: 653  TPTIYSEG-KRKNVFDEFKENSSDRA-----NSTRGSKAKD-------------GTKCYP 775
               I S   K K  FDE K   S  A     +S++ S  KD              T+  P
Sbjct: 168  DRGIESSKIKHKRDFDECKGTKSKSAVKSGDSSSKKSLKKDRKQKAFAPDRNRVATEVKP 227

Query: 776  ILKRMRRDHGKSEEAGPQEQTHRLEIEPVVDNF--------------------------- 874
             +   +    K +   P  +    E++P++D+                            
Sbjct: 228  PIDSSKASDYKQKAVAPDRRRVAKEVQPLIDDTKTSDYKQKSLAPDRNKVAKEVKPLIDD 287

Query: 875  SXXXXXXXXXXXXXXXXXAARMLSSRFDPSCTVFPGSVTAPVSKSMNGFSFVSSFHGDSE 1054
            +                 AARMLSSRFDP+   F  S       S NG SF+ S   + +
Sbjct: 288  NKISDYLREDEEENLEENAARMLSSRFDPNYAGFCSSSKPSTLPSSNGLSFLLSSSRNID 347

Query: 1055 SHLLSHSAGSESNSADAACRVLRPRKKGKEKRQVRKRRHFYEIFSRNIDAHWVLNRRIKV 1234
            S      +GSES S D A RVLRPRK+  EK + R+RRHFYEI   ++D HW+LN+RIKV
Sbjct: 348  SWASKSQSGSESASVDTAGRVLRPRKQYNEKGRSRRRRHFYEISLGDLDKHWILNQRIKV 407

Query: 1235 FWPLDQCWYFGLVTGYDPHEKLHHVKYDDREEEWIDLQNERFKLLLLPS 1381
            FWPLDQ WY GLV  Y+   K HH+KYDDREEEWI+L+ ERFKLLLLPS
Sbjct: 408  FWPLDQIWYHGLVDDYNKETKCHHIKYDDREEEWINLETERFKLLLLPS 456


Top