BLASTX nr result

ID: Cinnamomum25_contig00002833 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cinnamomum25_contig00002833
         (1329 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_008775217.1| PREDICTED: uncharacterized protein LOC103695...   160   3e-36
ref|XP_010920082.1| PREDICTED: uncharacterized protein LOC105044...   159   6e-36
ref|XP_010920081.1| PREDICTED: uncharacterized protein LOC105044...   159   6e-36
ref|XP_010266411.1| PREDICTED: uncharacterized protein LOC104603...   138   8e-30
ref|XP_009401240.1| PREDICTED: uncharacterized protein LOC103985...   138   8e-30
ref|XP_009401239.1| PREDICTED: uncharacterized protein LOC103985...   138   8e-30
ref|XP_010657505.1| PREDICTED: uncharacterized protein LOC104880...   125   1e-25
emb|CBI28490.3| unnamed protein product [Vitis vinifera]              125   1e-25
ref|XP_007049458.1| Uncharacterized protein isoform 2 [Theobroma...   123   3e-25
ref|XP_007049457.1| Uncharacterized protein isoform 1 [Theobroma...   116   3e-23
ref|XP_010252972.1| PREDICTED: uncharacterized protein LOC104594...   112   8e-22
ref|XP_010252971.1| PREDICTED: uncharacterized protein LOC104594...   112   8e-22
ref|XP_010252970.1| PREDICTED: uncharacterized protein LOC104594...   112   8e-22
ref|XP_010252969.1| PREDICTED: uncharacterized protein LOC104594...   112   8e-22
gb|EEC67726.1| hypothetical protein OsI_35213 [Oryza sativa Indi...   111   1e-21
ref|XP_006594392.1| PREDICTED: histone-lysine N-methyltransferas...   111   1e-21
ref|NP_001065819.1| Os11g0160700 [Oryza sativa Japonica Group] g...   110   2e-21
ref|XP_007049460.1| Uncharacterized protein isoform 4 [Theobroma...   110   2e-21
ref|XP_010036112.1| PREDICTED: uncharacterized protein LOC104425...   107   2e-20
gb|KCW47629.1| hypothetical protein EUGRSUZ_K01371 [Eucalyptus g...   105   1e-19

>ref|XP_008775217.1| PREDICTED: uncharacterized protein LOC103695626, partial [Phoenix
            dactylifera]
          Length = 715

 Score =  160 bits (404), Expect = 3e-36
 Identities = 128/436 (29%), Positives = 185/436 (42%), Gaps = 3/436 (0%)
 Frame = -3

Query: 1327 SGSSCPMAVHEDCLGSLASFEGSD-FYCPFCSYTRADXXXXXXXXXXXXXXXALSSFMCR 1151
            S S C + VHE CLG    FE +  FYCPFCSY RA                 LS F+  
Sbjct: 329  SASGCSIGVHESCLGPFVKFEKTGLFYCPFCSYRRATLAYWKARKNLSWAKKGLSVFLGE 388

Query: 1150 DLV--HPKQQRQLPEGANNKESEAARNANCPDKIHANKQHREIIDNHISTRVTEYRQQEK 977
            D +  H K+Q          ++E A NA+     H    H + ++   ST   + +Q E 
Sbjct: 389  DPIPGHRKEQSSPKVCRETSQAEVAGNASHQSDAH----HADKLNVSFSTAKEQQQQIEV 444

Query: 976  ASIHLSDNNLPCKEAATSLDSERNGAPLVQEREGASITVDSNKHKTAVEPHHDVVVATFC 797
            A +        C +    L S++   P   ER   S           VE + D +V    
Sbjct: 445  AKV--------CDKG--KLHSKQMENPSSVERGDVS--------HNPVENNQDKMVMRV- 485

Query: 796  SNDYLPCKEVETSVIMKSHDVSSIDVERNCVEIVKDSQCTREGEHQQQKESPTLCNNESL 617
              ++   ++ +T + M++             E+  +S+     EHQQ  +     +N +L
Sbjct: 486  --NFPDAEDADTFINMEN-------------ELGNNSEHVSVAEHQQPIKPQAGIDNGNL 530

Query: 616  PCREAKTSFDSESIDEKEDAAEMVDNQRHMNGSTQQRRAETGTNSNAGDIPCIQVDDSPA 437
            PC +  TS           A ++ + +++ N  +      T    N G +      D+  
Sbjct: 531  PCEDRGTS-------HAVHAHDVSNAKQNSNIDSADDHLRTQQLKNQGQVEAFLDHDTGH 583

Query: 436  AEKRPRVKRKHAMIQQTSDPETPASQPSVNADETARNQNAEDFXXXXXXXXXXXXXXXSN 257
                P+   KH      S  +  A   + +   T  +Q+                   SN
Sbjct: 584  TGPSPQTSDKHC----KSRRQDAAVGEACHHQGTVGDQSRSKASPGKNGKYRSRPKRYSN 639

Query: 256  PLMPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGSLPWTKIMDFGNKVFHKTRTPGDLKD 77
            P++P  RR KL WT EEE  LK+ V KF+  G G+LPWTKI++FG  VFH+TR PGDLKD
Sbjct: 640  PIIPSGRRRKLAWTVEEEEALKDVVPKFAANGDGTLPWTKILEFGRHVFHRTRQPGDLKD 699

Query: 76   KWRNIMIKEGAPRRKT 29
            KWRNI IKEG   RKT
Sbjct: 700  KWRNIQIKEGLRTRKT 715


>ref|XP_010920082.1| PREDICTED: uncharacterized protein LOC105044011 isoform X2 [Elaeis
            guineensis]
          Length = 798

 Score =  159 bits (401), Expect = 6e-36
 Identities = 128/436 (29%), Positives = 184/436 (42%), Gaps = 3/436 (0%)
 Frame = -3

Query: 1327 SGSSCPMAVHEDCLGSLASFEGSD-FYCPFCSYTRADXXXXXXXXXXXXXXXALSSFMCR 1151
            S S C + VHE CLG     E +  FYCPFCS+ RA                ALS+F+ +
Sbjct: 412  SASGCSIGVHESCLGPFVKVEKTGLFYCPFCSFRRAAIAYCKAREKLSSAKNALSAFLGK 471

Query: 1150 DLV--HPKQQRQLPEGANNKESEAARNANCPDKIHANKQHREIIDNHISTRVTEYRQQEK 977
            D +  H K+Q          +++ A N +    +     H + ++   ST   E++Q E 
Sbjct: 472  DPIPGHRKEQSSPKVHRETSQADVAGNLSHQSAV----PHADKLNVSFSTAQEEHQQLEV 527

Query: 976  ASIHLSDNNLPCKEAATSLDSERNGAPLVQEREGASITVDSNKHKTAVEPHHDVVVATFC 797
            A +        C +A        N + +    EG  ++         VE  HD +     
Sbjct: 528  AKV--------CDKAKLCSKQMDNPSGV----EGCDVS------HNPVENKHDKMFKKVS 569

Query: 796  SNDYLPCKEVETSVIMKSHDVSSIDVERNCVEIVKDSQCTREGEHQQQKESPTLCNNESL 617
              D              S D  +I    N  E+  +S+     EHQQ  E     +N +L
Sbjct: 570  FPD--------------SEDADTIINVEN--ELGNNSEHVWVAEHQQPTEPQEGIDNGNL 613

Query: 616  PCREAKTSFDSESIDEKEDAAEMVDNQRHMNGSTQQRRAETGTNSNAGDIPCIQVDDSPA 437
            PC +  TS    + D  +        +++ N  +     +T    N G +      D+  
Sbjct: 614  PCEDRDTSTGVHAHDVSKA-------KQNSNIDSADDHLQTQHLKNQGQVEAFLDHDTGH 666

Query: 436  AEKRPRVKRKHAMIQQTSDPETPASQPSVNADETARNQNAEDFXXXXXXXXXXXXXXXSN 257
                P+   KH    + S  +  A   + +   T  +Q+                   SN
Sbjct: 667  KSPSPQTSEKH----RESRRQHAAVGEACHTQGTVGDQSWNKVSPGKNLKYRSRPKRYSN 722

Query: 256  PLMPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGSLPWTKIMDFGNKVFHKTRTPGDLKD 77
            P++P  RR KL WT +EE  LK+AV KFS    G+LPWTKI++FG +VFHKTR PGDLKD
Sbjct: 723  PIIPSGRRKKLAWTVQEEEALKDAVQKFSVNSDGTLPWTKILEFGRRVFHKTRQPGDLKD 782

Query: 76   KWRNIMIKEGAPRRKT 29
            KWRNI IKEG   RKT
Sbjct: 783  KWRNIQIKEGLRTRKT 798


>ref|XP_010920081.1| PREDICTED: uncharacterized protein LOC105044011 isoform X1 [Elaeis
            guineensis]
          Length = 799

 Score =  159 bits (401), Expect = 6e-36
 Identities = 128/436 (29%), Positives = 184/436 (42%), Gaps = 3/436 (0%)
 Frame = -3

Query: 1327 SGSSCPMAVHEDCLGSLASFEGSD-FYCPFCSYTRADXXXXXXXXXXXXXXXALSSFMCR 1151
            S S C + VHE CLG     E +  FYCPFCS+ RA                ALS+F+ +
Sbjct: 413  SASGCSIGVHESCLGPFVKVEKTGLFYCPFCSFRRAAIAYCKAREKLSSAKNALSAFLGK 472

Query: 1150 DLV--HPKQQRQLPEGANNKESEAARNANCPDKIHANKQHREIIDNHISTRVTEYRQQEK 977
            D +  H K+Q          +++ A N +    +     H + ++   ST   E++Q E 
Sbjct: 473  DPIPGHRKEQSSPKVHRETSQADVAGNLSHQSAV----PHADKLNVSFSTAQEEHQQLEV 528

Query: 976  ASIHLSDNNLPCKEAATSLDSERNGAPLVQEREGASITVDSNKHKTAVEPHHDVVVATFC 797
            A +        C +A        N + +    EG  ++         VE  HD +     
Sbjct: 529  AKV--------CDKAKLCSKQMDNPSGV----EGCDVS------HNPVENKHDKMFKKVS 570

Query: 796  SNDYLPCKEVETSVIMKSHDVSSIDVERNCVEIVKDSQCTREGEHQQQKESPTLCNNESL 617
              D              S D  +I    N  E+  +S+     EHQQ  E     +N +L
Sbjct: 571  FPD--------------SEDADTIINVEN--ELGNNSEHVWVAEHQQPTEPQEGIDNGNL 614

Query: 616  PCREAKTSFDSESIDEKEDAAEMVDNQRHMNGSTQQRRAETGTNSNAGDIPCIQVDDSPA 437
            PC +  TS    + D  +        +++ N  +     +T    N G +      D+  
Sbjct: 615  PCEDRDTSTGVHAHDVSKA-------KQNSNIDSADDHLQTQHLKNQGQVEAFLDHDTGH 667

Query: 436  AEKRPRVKRKHAMIQQTSDPETPASQPSVNADETARNQNAEDFXXXXXXXXXXXXXXXSN 257
                P+   KH    + S  +  A   + +   T  +Q+                   SN
Sbjct: 668  KSPSPQTSEKH----RESRRQHAAVGEACHTQGTVGDQSWNKVSPGKNLKYRSRPKRYSN 723

Query: 256  PLMPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGSLPWTKIMDFGNKVFHKTRTPGDLKD 77
            P++P  RR KL WT +EE  LK+AV KFS    G+LPWTKI++FG +VFHKTR PGDLKD
Sbjct: 724  PIIPSGRRKKLAWTVQEEEALKDAVQKFSVNSDGTLPWTKILEFGRRVFHKTRQPGDLKD 783

Query: 76   KWRNIMIKEGAPRRKT 29
            KWRNI IKEG   RKT
Sbjct: 784  KWRNIQIKEGLRTRKT 799


>ref|XP_010266411.1| PREDICTED: uncharacterized protein LOC104603935 [Nelumbo nucifera]
            gi|720033383|ref|XP_010266413.1| PREDICTED:
            uncharacterized protein LOC104603935 [Nelumbo nucifera]
            gi|720033386|ref|XP_010266414.1| PREDICTED:
            uncharacterized protein LOC104603935 [Nelumbo nucifera]
          Length = 936

 Score =  138 bits (348), Expect = 8e-30
 Identities = 119/431 (27%), Positives = 187/431 (43%), Gaps = 3/431 (0%)
 Frame = -3

Query: 1321 SSCPMAVHEDCLGSLASFEGS-DFYCPFCSYTRADXXXXXXXXXXXXXXXALSSFMCRDL 1145
            ++C +AVHE CLGS ASF+ + +FYCPFC Y  A                + ++F     
Sbjct: 572  TNCQLAVHECCLGSPASFDNNGNFYCPFCFYAEAIVAYHDAKKKISLARKSFAAFTGEK- 630

Query: 1144 VHPKQQRQLPEGANNKESEAAR--NANCPDKIHANKQHREIIDNHISTRVTEYRQQEKAS 971
             + +Q ++  E  + +E+++ R      P  I  NK   EII    S +  E+++    S
Sbjct: 631  -NEQQLKKQFENIHYEETQSRRFGEVGFPRNIPDNKHQTEIIHKQ-SVQAMEHQEAANDS 688

Query: 970  IHLSDNNLPCKEAATSLDSERNGAPLVQEREGASITVDSNKHKTAVEPHHDVVVATFCSN 791
            I   +  +PC+        E+N   L +  +  S  VD N +   +E H     +  CS 
Sbjct: 689  IEYKNGYIPCEGGKAPPVHEKNDIFLTEGDD--SKQVDDNHNNLVMECHLTKSYSNACSG 746

Query: 790  DYLPCKEVETSVIMKSHDVSSIDVERNCVEIVKDSQCTREGEHQQQKESPTLCNNESLPC 611
            + L  ++ E   + +  D+S    +   V I K        E+ +Q E+     +++ PC
Sbjct: 747  NDLQVRDGEAFAMNEISDISL--TKEKAVVIEKHQSVDWREENNKQVEADLKHKSKNSPC 804

Query: 610  REAKTSFDSESIDEKEDAAEMVDNQRHMNGSTQQRRAETGTNSNAGDIPCIQVDDSPAAE 431
            R                  EMV   + + G   +R+   G  S+                
Sbjct: 805  RRR----------------EMVPLSQRLAGVRTRRQVLQGKFSS---------------- 832

Query: 430  KRPRVKRKHAMIQQTSDPETPASQPSVNADETARNQNAEDFXXXXXXXXXXXXXXXSNPL 251
             RP +  K     +TS+ E    +  +++++++R+Q                     +P 
Sbjct: 833  -RPNITTK-----ETSEAE----EDDISSNDSSRDQKP------------------ISPK 864

Query: 250  MPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGSLPWTKIMDFGNKVFHKTRTPGDLKDKW 71
             P  RRTK  WT+EEE +L E V +FS       PW KI++FG  VFHKTRTP DLKDK+
Sbjct: 865  FPSLRRTKNPWTAEEEEILLEGVKRFSSTDGKGFPWKKILEFGCHVFHKTRTPVDLKDKY 924

Query: 70   RNIMIKEGAPR 38
            RNI  K G PR
Sbjct: 925  RNIWTK-GGPR 934


>ref|XP_009401240.1| PREDICTED: uncharacterized protein LOC103985310 isoform X2 [Musa
            acuminata subsp. malaccensis]
          Length = 883

 Score =  138 bits (348), Expect = 8e-30
 Identities = 126/443 (28%), Positives = 188/443 (42%), Gaps = 16/443 (3%)
 Frame = -3

Query: 1324 GSSCPMAVHEDCLGSLASFEGSD-FYCPFCSYTRADXXXXXXXXXXXXXXXALSSFMCRD 1148
            G+ C ++VHE CLGS   F+ S  FYCPFCSYTRA                 LS F+  +
Sbjct: 478  GNGCLISVHESCLGSSPIFDTSGLFYCPFCSYTRAAISYRKVKKNFIQARRVLSEFIGGN 537

Query: 1147 LVHPKQQRQLPEGANNKESEAAR--NANCPDKIHANKQHREIIDNHISTRVTEYRQQEKA 974
             V  +  R++     +KE+   R  + +C +    + Q +    N IS  V E+ + E+ 
Sbjct: 538  FV--RGHRKVSPSGVHKETNQTRVVDNSCSEHSAGSSQCKGNKLNEISVEVNEHSRVERE 595

Query: 973  SIHLSDNNLPCKEAATSLDSERNGAPLVQEREGASITVDSNKHKTAVEPHHDVVVATFCS 794
                 DN   C     + +++ +   +V +  G  + V  ++H    EP+     A  C 
Sbjct: 596  --RACDN---CTSLMLNGNADLSKVHIVPQSNGEQVEVAEHQHLR--EPY---AAADNCR 645

Query: 793  NDYLPCKEVETSVIMKSHDVSSIDVERNCVEIVKDSQCTREGEHQQQKESPTLCNNESLP 614
             D    ++++    ++  D+  I+   N  ++              ++E   L  N  L 
Sbjct: 646  GDSCHARDID----VRQGDIVMIEDHSNVQQL-----------KTPEEEGLMLNGNADL- 689

Query: 613  CREAKTSFDSESIDEKEDAAEMVDNQRHMNGSTQQRRAETGTNSNAGDIPCIQVDDSPAA 434
               +K      S  E+ + AE     +H+           G   +A DI  ++ DD    
Sbjct: 690  ---SKGHIVPRSNREQVEVAE----HQHLREPYAAAVNCPGDPCHARDINDVRQDDIVMI 742

Query: 433  EKRPRVKRKHAMIQQTSDPETP--ASQPSVNADETARNQNAEDFXXXXXXXXXXXXXXXS 260
            E        H+ IQQ   PE    AS   V A E    +  ED                S
Sbjct: 743  EA-------HSNIQQLKTPEEEGHASVDKVLAGEKHGKRQLEDILVVDNNGRNKSSPAKS 795

Query: 259  -----------NPLMPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGSLPWTKIMDFGNKV 113
                       NP++P +RRTKL WT EEE  L+EAV +  E   GS+PW KI++ G  V
Sbjct: 796  KRHISRAKRYSNPILPPTRRTKLSWTPEEEEFLREAVHELGEKNDGSIPWVKILELGRHV 855

Query: 112  FHKTRTPGDLKDKWRNIMIKEGA 44
             HKTR PGDLKDKWRN+  KE +
Sbjct: 856  IHKTRQPGDLKDKWRNMKKKEAS 878


>ref|XP_009401239.1| PREDICTED: uncharacterized protein LOC103985310 isoform X1 [Musa
            acuminata subsp. malaccensis]
          Length = 886

 Score =  138 bits (348), Expect = 8e-30
 Identities = 126/443 (28%), Positives = 188/443 (42%), Gaps = 16/443 (3%)
 Frame = -3

Query: 1324 GSSCPMAVHEDCLGSLASFEGSD-FYCPFCSYTRADXXXXXXXXXXXXXXXALSSFMCRD 1148
            G+ C ++VHE CLGS   F+ S  FYCPFCSYTRA                 LS F+  +
Sbjct: 481  GNGCLISVHESCLGSSPIFDTSGLFYCPFCSYTRAAISYRKVKKNFIQARRVLSEFIGGN 540

Query: 1147 LVHPKQQRQLPEGANNKESEAAR--NANCPDKIHANKQHREIIDNHISTRVTEYRQQEKA 974
             V  +  R++     +KE+   R  + +C +    + Q +    N IS  V E+ + E+ 
Sbjct: 541  FV--RGHRKVSPSGVHKETNQTRVVDNSCSEHSAGSSQCKGNKLNEISVEVNEHSRVERE 598

Query: 973  SIHLSDNNLPCKEAATSLDSERNGAPLVQEREGASITVDSNKHKTAVEPHHDVVVATFCS 794
                 DN   C     + +++ +   +V +  G  + V  ++H    EP+     A  C 
Sbjct: 599  --RACDN---CTSLMLNGNADLSKVHIVPQSNGEQVEVAEHQHLR--EPY---AAADNCR 648

Query: 793  NDYLPCKEVETSVIMKSHDVSSIDVERNCVEIVKDSQCTREGEHQQQKESPTLCNNESLP 614
             D    ++++    ++  D+  I+   N  ++              ++E   L  N  L 
Sbjct: 649  GDSCHARDID----VRQGDIVMIEDHSNVQQL-----------KTPEEEGLMLNGNADL- 692

Query: 613  CREAKTSFDSESIDEKEDAAEMVDNQRHMNGSTQQRRAETGTNSNAGDIPCIQVDDSPAA 434
               +K      S  E+ + AE     +H+           G   +A DI  ++ DD    
Sbjct: 693  ---SKGHIVPRSNREQVEVAE----HQHLREPYAAAVNCPGDPCHARDINDVRQDDIVMI 745

Query: 433  EKRPRVKRKHAMIQQTSDPETP--ASQPSVNADETARNQNAEDFXXXXXXXXXXXXXXXS 260
            E        H+ IQQ   PE    AS   V A E    +  ED                S
Sbjct: 746  EA-------HSNIQQLKTPEEEGHASVDKVLAGEKHGKRQLEDILVVDNNGRNKSSPAKS 798

Query: 259  -----------NPLMPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGSLPWTKIMDFGNKV 113
                       NP++P +RRTKL WT EEE  L+EAV +  E   GS+PW KI++ G  V
Sbjct: 799  KRHISRAKRYSNPILPPTRRTKLSWTPEEEEFLREAVHELGEKNDGSIPWVKILELGRHV 858

Query: 112  FHKTRTPGDLKDKWRNIMIKEGA 44
             HKTR PGDLKDKWRN+  KE +
Sbjct: 859  IHKTRQPGDLKDKWRNMKKKEAS 881


>ref|XP_010657505.1| PREDICTED: uncharacterized protein LOC104880931 [Vitis vinifera]
            gi|731410304|ref|XP_010657506.1| PREDICTED:
            uncharacterized protein LOC104880931 [Vitis vinifera]
          Length = 546

 Score =  125 bits (313), Expect = 1e-25
 Identities = 110/454 (24%), Positives = 183/454 (40%), Gaps = 32/454 (7%)
 Frame = -3

Query: 1315 CPMAVHEDCLGSLASFEG-SDFYCPFCSYTRADXXXXXXXXXXXXXXXALSSFMCRDLVH 1139
            C +AVHE C+   A+F+   DFYCP+C Y  A                ALS+F+    + 
Sbjct: 129  CRLAVHEKCMNCSAAFDDMGDFYCPYCWYRCAIAKSNEARKRAMSSKKALSTFLDTKALC 188

Query: 1138 PKQQRQLPEGANNKESEAARNANCPDK-------------IHANKQHRE--IIDNHISTR 1004
              QQ++  + +N K+  +    +C +              + A K  ++   +D      
Sbjct: 189  GNQQKEKTKSSNGKKPPSTSERSCNENEYRLDYDEVYNQSVQAEKDQQDGFALDFEQHQI 248

Query: 1003 VTEYRQQEKASIHLSDNNLPCKE-AATSLDSERNGAPLVQEREGASITVDSNKHKTAVEP 827
            V +++   K+S+   D NL  +E   TS D    G    Q+ +G    + + K +  ++ 
Sbjct: 249  VAQHQWHMKSSVDDGDGNLYSREEGTTSADGSFQGFVANQKFDGVK-QLAAVKVREMIQE 307

Query: 826  HHDVVVATFCSNDYLPCKEVETSVIMKSHDVSSIDVERNCVEIVK----DSQCTREG--- 668
             H   V   C ++ +   + E   +   H      ++ +   + K    D++ T E    
Sbjct: 308  EHSREVGD-CQDEGVAEDQQEAEPLNDCHLEEETTLDGDFSVLTKGKKVDAKMTEENLGR 366

Query: 667  -------EHQQQKESPTLCNNESLPCREAKTSFDSESIDEKEDAAEMVDNQRHMNGSTQQ 509
                   + Q Q+ +  +   +       K +     ID    A  ++ +QRH+    + 
Sbjct: 367  REEEEQMQPQAQETTTAIPGGDPASLVHEKVNIGFRIIDSCRGARTLLTHQRHVGQRAKN 426

Query: 508  RRAETGTNSNAGDIPCIQVDDSPAAEKRPRVKRKHAMIQQTS-DPETPASQPSVNADETA 332
            +      +S     P +  +    AEK      K  ++   S  P  P+ Q +       
Sbjct: 427  KMVSQNVDSQKKSSPDLHNN----AEKNAGDGTKEVIVSSKSIQPRGPSKQLT------- 475

Query: 331  RNQNAEDFXXXXXXXXXXXXXXXSNPLMPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGS 152
                                    N + P+ RR KLLW ++EE +LKE V KFS  G  +
Sbjct: 476  ------------------------NQIFPNERRKKLLWKTDEEEMLKEGVQKFSATGDKN 511

Query: 151  LPWTKIMDFGNKVFHKTRTPGDLKDKWRNIMIKE 50
            LPW KI++FG  VF  TRTP DLKDKWR ++ KE
Sbjct: 512  LPWRKILEFGRHVFDGTRTPVDLKDKWRKMLAKE 545


>emb|CBI28490.3| unnamed protein product [Vitis vinifera]
          Length = 566

 Score =  125 bits (313), Expect = 1e-25
 Identities = 110/454 (24%), Positives = 183/454 (40%), Gaps = 32/454 (7%)
 Frame = -3

Query: 1315 CPMAVHEDCLGSLASFEG-SDFYCPFCSYTRADXXXXXXXXXXXXXXXALSSFMCRDLVH 1139
            C +AVHE C+   A+F+   DFYCP+C Y  A                ALS+F+    + 
Sbjct: 149  CRLAVHEKCMNCSAAFDDMGDFYCPYCWYRCAIAKSNEARKRAMSSKKALSTFLDTKALC 208

Query: 1138 PKQQRQLPEGANNKESEAARNANCPDK-------------IHANKQHRE--IIDNHISTR 1004
              QQ++  + +N K+  +    +C +              + A K  ++   +D      
Sbjct: 209  GNQQKEKTKSSNGKKPPSTSERSCNENEYRLDYDEVYNQSVQAEKDQQDGFALDFEQHQI 268

Query: 1003 VTEYRQQEKASIHLSDNNLPCKE-AATSLDSERNGAPLVQEREGASITVDSNKHKTAVEP 827
            V +++   K+S+   D NL  +E   TS D    G    Q+ +G    + + K +  ++ 
Sbjct: 269  VAQHQWHMKSSVDDGDGNLYSREEGTTSADGSFQGFVANQKFDGVK-QLAAVKVREMIQE 327

Query: 826  HHDVVVATFCSNDYLPCKEVETSVIMKSHDVSSIDVERNCVEIVK----DSQCTREG--- 668
             H   V   C ++ +   + E   +   H      ++ +   + K    D++ T E    
Sbjct: 328  EHSREVGD-CQDEGVAEDQQEAEPLNDCHLEEETTLDGDFSVLTKGKKVDAKMTEENLGR 386

Query: 667  -------EHQQQKESPTLCNNESLPCREAKTSFDSESIDEKEDAAEMVDNQRHMNGSTQQ 509
                   + Q Q+ +  +   +       K +     ID    A  ++ +QRH+    + 
Sbjct: 387  REEEEQMQPQAQETTTAIPGGDPASLVHEKVNIGFRIIDSCRGARTLLTHQRHVGQRAKN 446

Query: 508  RRAETGTNSNAGDIPCIQVDDSPAAEKRPRVKRKHAMIQQTS-DPETPASQPSVNADETA 332
            +      +S     P +  +    AEK      K  ++   S  P  P+ Q +       
Sbjct: 447  KMVSQNVDSQKKSSPDLHNN----AEKNAGDGTKEVIVSSKSIQPRGPSKQLT------- 495

Query: 331  RNQNAEDFXXXXXXXXXXXXXXXSNPLMPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGS 152
                                    N + P+ RR KLLW ++EE +LKE V KFS  G  +
Sbjct: 496  ------------------------NQIFPNERRKKLLWKTDEEEMLKEGVQKFSATGDKN 531

Query: 151  LPWTKIMDFGNKVFHKTRTPGDLKDKWRNIMIKE 50
            LPW KI++FG  VF  TRTP DLKDKWR ++ KE
Sbjct: 532  LPWRKILEFGRHVFDGTRTPVDLKDKWRKMLAKE 565


>ref|XP_007049458.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|590712761|ref|XP_007049459.1| Uncharacterized protein
            isoform 2 [Theobroma cacao] gi|508701719|gb|EOX93615.1|
            Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508701720|gb|EOX93616.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 487

 Score =  123 bits (309), Expect = 3e-25
 Identities = 103/431 (23%), Positives = 169/431 (39%), Gaps = 1/431 (0%)
 Frame = -3

Query: 1327 SGSSCPMAVHEDCLGSLASFEG-SDFYCPFCSYTRADXXXXXXXXXXXXXXXALSSFMCR 1151
            S + CP+ +HE C+     F+    FYCP+C Y R                  LS+F+C 
Sbjct: 114  SENGCPVTIHEVCMNCNPKFDNMGKFYCPYCWYKRELVRTKELRRKAMLARKELSNFICL 173

Query: 1150 DLVHPKQQRQLPEGANNKESEAARNANCPDKIHANKQHREIIDNHISTRVTEYRQQEKAS 971
                  ++ Q+ E    K +  +  A    KI+       + D +               
Sbjct: 174  KRDGGNEEMQVDETETMKAASVSTMAG---KINTGDSENGLNDKN------------NER 218

Query: 970  IHLSDNNLPCKEAATSLDSERNGAPLVQEREGASITVDSNKHKTAVEPHHDVVVATFCSN 791
            IH      P  E+ +  D ERN      E  G    +             D+  A+   +
Sbjct: 219  IHHDQEETPGVESISKSDEERNSRARGSENFGDGERIQDE----------DIENASDSED 268

Query: 790  DYLPCKEVETSVIMKSHDVSSIDVERNCVEIVKDSQCTREGEHQQQKESPTLCNNESLPC 611
            D +   + +   I  SH    +++E+  + +         G  ++ KE P L N  ++  
Sbjct: 269  DEIDEDQWQIQPISSSH----LEIEKGALPVSTKETSDNVGVLEENKEEPVLPN--AVGT 322

Query: 610  REAKTSFDSESIDEKEDAAEMVDNQRHMNGSTQQRRAETGTNSNAGDIPCIQVDDSPAAE 431
              A  + D  S     ++ E V                         +P +  +     +
Sbjct: 323  TMALITSDCTSKVPAIESFEFV-------------------------LPDLNTETLVVRQ 357

Query: 430  KRPRVKRKHAMIQQTSDPETPASQPSVNADETARNQNAEDFXXXXXXXXXXXXXXXSNPL 251
            KR +   +    Q+   P+ P+S+PS +A +   NQ  +                  +  
Sbjct: 358  KRVKRTAQKEWPQKVDSPKMPSSEPSTSAKDKKMNQQGKATAAKNSVQCQELNKRFVSSK 417

Query: 250  MPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGSLPWTKIMDFGNKVFHKTRTPGDLKDKW 71
            +   +R +L WT+EEE +LKE V +FS     ++PW KI++FG+ VFH TRTP DLKDKW
Sbjct: 418  LGTEKRRRLHWTAEEEDMLKEGVRRFSSIVNKNIPWRKILEFGHHVFHSTRTPVDLKDKW 477

Query: 70   RNIMIKEGAPR 38
            +NI+ KE AP+
Sbjct: 478  KNIIAKE-APK 487


>ref|XP_007049457.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508701718|gb|EOX93614.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 502

 Score =  116 bits (291), Expect = 3e-23
 Identities = 104/446 (23%), Positives = 170/446 (38%), Gaps = 16/446 (3%)
 Frame = -3

Query: 1327 SGSSCPMAVHEDCLGSLASFEG-SDFYCPFCSYTRADXXXXXXXXXXXXXXXALSSFMCR 1151
            S + CP+ +HE C+     F+    FYCP+C Y R                  LS+F+C 
Sbjct: 114  SENGCPVTIHEVCMNCNPKFDNMGKFYCPYCWYKRELVRTKELRRKAMLARKELSNFICL 173

Query: 1150 DLVHPKQQRQLPEGANNKESEAARNANCPDKIHANKQHREIIDNHISTRVTEYRQQEKAS 971
                  ++ Q+ E    K +  +  A    KI+       + D +               
Sbjct: 174  KRDGGNEEMQVDETETMKAASVSTMAG---KINTGDSENGLNDKN------------NER 218

Query: 970  IHLSDNNLPCKEAATSLDSERNGAPLVQEREGASITVDSNKHKTAVEPHHDVVVATFCSN 791
            IH      P  E+ +  D ERN      E  G    +             D+  A+   +
Sbjct: 219  IHHDQEETPGVESISKSDEERNSRARGSENFGDGERIQDE----------DIENASDSED 268

Query: 790  DYLPCKEVETSVIMKSHDVSSIDVERNCVEIVKDSQCTREGEHQQQKESPTLCNNESLPC 611
            D +   + +   I  SH    +++E+  + +         G  ++ KE P L N  ++  
Sbjct: 269  DEIDEDQWQIQPISSSH----LEIEKGALPVSTKETSDNVGVLEENKEEPVLPN--AVGT 322

Query: 610  REAKTSFDSESIDEKEDAAEMVDNQRHMNGSTQQRRAETGTNSNAGDIPCIQVDDSPAAE 431
              A  + D  S     ++ E V                         +P +  +     +
Sbjct: 323  TMALITSDCTSKVPAIESFEFV-------------------------LPDLNTETLVVRQ 357

Query: 430  KRPRVKRKHAMIQQTSDPETPASQPSVNADETARNQNAED---------------FXXXX 296
            KR +   +    Q+   P+ P+S+PS +A +   NQ  +                F    
Sbjct: 358  KRVKRTAQKEWPQKVDSPKMPSSEPSTSAKDKKMNQQGKATAAKNSVQCQELNKRFYYYS 417

Query: 295  XXXXXXXXXXXSNPLMPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGSLPWTKIMDFGNK 116
                        +  +   +R +L WT+EEE +LKE V +FS     ++PW KI++FG+ 
Sbjct: 418  KITLYFHLTCSVSSKLGTEKRRRLHWTAEEEDMLKEGVRRFSSIVNKNIPWRKILEFGHH 477

Query: 115  VFHKTRTPGDLKDKWRNIMIKEGAPR 38
            VFH TRTP DLKDKW+NI+ KE AP+
Sbjct: 478  VFHSTRTPVDLKDKWKNIIAKE-APK 502


>ref|XP_010252972.1| PREDICTED: uncharacterized protein LOC104594379 isoform X4 [Nelumbo
            nucifera]
          Length = 813

 Score =  112 bits (279), Expect = 8e-22
 Identities = 93/376 (24%), Positives = 153/376 (40%), Gaps = 5/376 (1%)
 Frame = -3

Query: 1159 MCRDLVHPKQQRQLPEGANNKESEAARNANCPDKIHANKQHREIIDNHISTRVTEYRQQE 980
            M +D  H   + Q+P  +N        +         +   R  +++H+         Q+
Sbjct: 450  MLKDNQHLTVEEQMPAESNTAGKRNVSSTTVKHMQQMDHNLRNFVEDHL---------QK 500

Query: 979  KASIHLSDNNLPCKEAATSLDSERNGAPLVQEREGASITVDSNKHKTAVEPHHDVVVATF 800
              SI  S  +LPC+E    L    +     ++R  ++ T +  ++    E    V     
Sbjct: 501  VTSIAQSGEDLPCREDFPFLSDTFDKCSTKRKRNHSN-TFEGYRYIREAEQKRHVKFCAT 559

Query: 799  CSNDYLPCKEVETSVIMKSHDVSSIDVERNCVEIV--KDSQCTREGEHQQQKESPTLCNN 626
              N   PC+    +VI  +  ++  + +      +  K S    +GEH+Q+ +  T  N 
Sbjct: 560  SENSNSPCEGTICAVINDNDHLTEAEEQMQVKGNIEHKGSSSLSKGEHEQKVDCQTKRNV 619

Query: 625  ESLPCREAKTSFDSESIDEKEDAAEMVDNQRHMNGSTQQRRAETGTNSNAGDIPCIQVDD 446
            E    +E  T+   ++         MV+   H+    +Q   +    S    +PC   + 
Sbjct: 620  EDQLKKEQSTAHYGDN------DPNMVEED-HIREDERQPCEKANVWSRNRILPCRDTET 672

Query: 445  SPAAEKR---PRVKRKHAMIQQTSDPETPASQPSVNADETARNQNAEDFXXXXXXXXXXX 275
             P+   R    R+K+     Q    PE P+ + S   DE A +QN +             
Sbjct: 673  LPSDSDRYSAQRIKKAILQPQIYDQPEEPSLKASSLVDENAEDQNRKAIASNKSIKYQRA 732

Query: 274  XXXXSNPLMPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGSLPWTKIMDFGNKVFHKTRT 95
                 +    +SRR KL W +EEE  LK+ V  FS     ++PW KI++FG  VF  TRT
Sbjct: 733  ANHSPSSSFSNSRRKKLPWKAEEEETLKKGVQMFSTTVNKNIPWRKILEFGANVFDGTRT 792

Query: 94   PGDLKDKWRNIMIKEG 47
            P DLKDKW+NI   +G
Sbjct: 793  PVDLKDKWKNIKKAKG 808


>ref|XP_010252971.1| PREDICTED: uncharacterized protein LOC104594379 isoform X3 [Nelumbo
            nucifera]
          Length = 815

 Score =  112 bits (279), Expect = 8e-22
 Identities = 93/376 (24%), Positives = 153/376 (40%), Gaps = 5/376 (1%)
 Frame = -3

Query: 1159 MCRDLVHPKQQRQLPEGANNKESEAARNANCPDKIHANKQHREIIDNHISTRVTEYRQQE 980
            M +D  H   + Q+P  +N        +         +   R  +++H+         Q+
Sbjct: 450  MLKDNQHLTVEEQMPAESNTAGKRNVSSTTVKHMQQMDHNLRNFVEDHL---------QK 500

Query: 979  KASIHLSDNNLPCKEAATSLDSERNGAPLVQEREGASITVDSNKHKTAVEPHHDVVVATF 800
              SI  S  +LPC+E    L    +     ++R  ++ T +  ++    E    V     
Sbjct: 501  VTSIAQSGEDLPCREDFPFLSDTFDKCSTKRKRNHSN-TFEGYRYIREAEQKRHVKFCAT 559

Query: 799  CSNDYLPCKEVETSVIMKSHDVSSIDVERNCVEIV--KDSQCTREGEHQQQKESPTLCNN 626
              N   PC+    +VI  +  ++  + +      +  K S    +GEH+Q+ +  T  N 
Sbjct: 560  SENSNSPCEGTICAVINDNDHLTEAEEQMQVKGNIEHKGSSSLSKGEHEQKVDCQTKRNV 619

Query: 625  ESLPCREAKTSFDSESIDEKEDAAEMVDNQRHMNGSTQQRRAETGTNSNAGDIPCIQVDD 446
            E    +E  T+   ++         MV+   H+    +Q   +    S    +PC   + 
Sbjct: 620  EDQLKKEQSTAHYGDN------DPNMVEED-HIREDERQPCEKANVWSRNRILPCRDTET 672

Query: 445  SPAAEKR---PRVKRKHAMIQQTSDPETPASQPSVNADETARNQNAEDFXXXXXXXXXXX 275
             P+   R    R+K+     Q    PE P+ + S   DE A +QN +             
Sbjct: 673  LPSDSDRYSAQRIKKAILQPQIYDQPEEPSLKASSLVDENAEDQNRKAIASNKSIKYQRA 732

Query: 274  XXXXSNPLMPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGSLPWTKIMDFGNKVFHKTRT 95
                 +    +SRR KL W +EEE  LK+ V  FS     ++PW KI++FG  VF  TRT
Sbjct: 733  ANHSPSSSFSNSRRKKLPWKAEEEETLKKGVQMFSTTVNKNIPWRKILEFGANVFDGTRT 792

Query: 94   PGDLKDKWRNIMIKEG 47
            P DLKDKW+NI   +G
Sbjct: 793  PVDLKDKWKNIKKAKG 808


>ref|XP_010252970.1| PREDICTED: uncharacterized protein LOC104594379 isoform X2 [Nelumbo
            nucifera]
          Length = 843

 Score =  112 bits (279), Expect = 8e-22
 Identities = 93/376 (24%), Positives = 153/376 (40%), Gaps = 5/376 (1%)
 Frame = -3

Query: 1159 MCRDLVHPKQQRQLPEGANNKESEAARNANCPDKIHANKQHREIIDNHISTRVTEYRQQE 980
            M +D  H   + Q+P  +N        +         +   R  +++H+         Q+
Sbjct: 450  MLKDNQHLTVEEQMPAESNTAGKRNVSSTTVKHMQQMDHNLRNFVEDHL---------QK 500

Query: 979  KASIHLSDNNLPCKEAATSLDSERNGAPLVQEREGASITVDSNKHKTAVEPHHDVVVATF 800
              SI  S  +LPC+E    L    +     ++R  ++ T +  ++    E    V     
Sbjct: 501  VTSIAQSGEDLPCREDFPFLSDTFDKCSTKRKRNHSN-TFEGYRYIREAEQKRHVKFCAT 559

Query: 799  CSNDYLPCKEVETSVIMKSHDVSSIDVERNCVEIV--KDSQCTREGEHQQQKESPTLCNN 626
              N   PC+    +VI  +  ++  + +      +  K S    +GEH+Q+ +  T  N 
Sbjct: 560  SENSNSPCEGTICAVINDNDHLTEAEEQMQVKGNIEHKGSSSLSKGEHEQKVDCQTKRNV 619

Query: 625  ESLPCREAKTSFDSESIDEKEDAAEMVDNQRHMNGSTQQRRAETGTNSNAGDIPCIQVDD 446
            E    +E  T+   ++         MV+   H+    +Q   +    S    +PC   + 
Sbjct: 620  EDQLKKEQSTAHYGDN------DPNMVEED-HIREDERQPCEKANVWSRNRILPCRDTET 672

Query: 445  SPAAEKR---PRVKRKHAMIQQTSDPETPASQPSVNADETARNQNAEDFXXXXXXXXXXX 275
             P+   R    R+K+     Q    PE P+ + S   DE A +QN +             
Sbjct: 673  LPSDSDRYSAQRIKKAILQPQIYDQPEEPSLKASSLVDENAEDQNRKAIASNKSIKYQRA 732

Query: 274  XXXXSNPLMPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGSLPWTKIMDFGNKVFHKTRT 95
                 +    +SRR KL W +EEE  LK+ V  FS     ++PW KI++FG  VF  TRT
Sbjct: 733  ANHSPSSSFSNSRRKKLPWKAEEEETLKKGVQMFSTTVNKNIPWRKILEFGANVFDGTRT 792

Query: 94   PGDLKDKWRNIMIKEG 47
            P DLKDKW+NI   +G
Sbjct: 793  PVDLKDKWKNIKKAKG 808


>ref|XP_010252969.1| PREDICTED: uncharacterized protein LOC104594379 isoform X1 [Nelumbo
            nucifera]
          Length = 846

 Score =  112 bits (279), Expect = 8e-22
 Identities = 93/376 (24%), Positives = 153/376 (40%), Gaps = 5/376 (1%)
 Frame = -3

Query: 1159 MCRDLVHPKQQRQLPEGANNKESEAARNANCPDKIHANKQHREIIDNHISTRVTEYRQQE 980
            M +D  H   + Q+P  +N        +         +   R  +++H+         Q+
Sbjct: 450  MLKDNQHLTVEEQMPAESNTAGKRNVSSTTVKHMQQMDHNLRNFVEDHL---------QK 500

Query: 979  KASIHLSDNNLPCKEAATSLDSERNGAPLVQEREGASITVDSNKHKTAVEPHHDVVVATF 800
              SI  S  +LPC+E    L    +     ++R  ++ T +  ++    E    V     
Sbjct: 501  VTSIAQSGEDLPCREDFPFLSDTFDKCSTKRKRNHSN-TFEGYRYIREAEQKRHVKFCAT 559

Query: 799  CSNDYLPCKEVETSVIMKSHDVSSIDVERNCVEIV--KDSQCTREGEHQQQKESPTLCNN 626
              N   PC+    +VI  +  ++  + +      +  K S    +GEH+Q+ +  T  N 
Sbjct: 560  SENSNSPCEGTICAVINDNDHLTEAEEQMQVKGNIEHKGSSSLSKGEHEQKVDCQTKRNV 619

Query: 625  ESLPCREAKTSFDSESIDEKEDAAEMVDNQRHMNGSTQQRRAETGTNSNAGDIPCIQVDD 446
            E    +E  T+   ++         MV+   H+    +Q   +    S    +PC   + 
Sbjct: 620  EDQLKKEQSTAHYGDN------DPNMVEED-HIREDERQPCEKANVWSRNRILPCRDTET 672

Query: 445  SPAAEKR---PRVKRKHAMIQQTSDPETPASQPSVNADETARNQNAEDFXXXXXXXXXXX 275
             P+   R    R+K+     Q    PE P+ + S   DE A +QN +             
Sbjct: 673  LPSDSDRYSAQRIKKAILQPQIYDQPEEPSLKASSLVDENAEDQNRKAIASNKSIKYQRA 732

Query: 274  XXXXSNPLMPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGSLPWTKIMDFGNKVFHKTRT 95
                 +    +SRR KL W +EEE  LK+ V  FS     ++PW KI++FG  VF  TRT
Sbjct: 733  ANHSPSSSFSNSRRKKLPWKAEEEETLKKGVQMFSTTVNKNIPWRKILEFGANVFDGTRT 792

Query: 94   PGDLKDKWRNIMIKEG 47
            P DLKDKW+NI   +G
Sbjct: 793  PVDLKDKWKNIKKAKG 808


>gb|EEC67726.1| hypothetical protein OsI_35213 [Oryza sativa Indica Group]
          Length = 951

 Score =  111 bits (277), Expect = 1e-21
 Identities = 111/431 (25%), Positives = 169/431 (39%), Gaps = 5/431 (1%)
 Frame = -3

Query: 1321 SSCPMAVHEDCLGSLASFEGS-DFYCPFCSYTRADXXXXXXXXXXXXXXXALSSFMCR-D 1148
            SSC +A H+ C GSLA+ + S   YCP C YT+A                 LS+F+ R  
Sbjct: 595  SSCLLAAHDTCFGSLATLDDSGQLYCPVCFYTKATEAYQKAKKTYSEARKNLSAFLGRKQ 654

Query: 1147 LVHPKQQRQLPEGANNKESEAARNANCPDKIHANKQHREIIDN--HISTRVTEYRQQEKA 974
            L    QQ  + + A N E        C D       H+   +N  H     T  R+++K 
Sbjct: 655  LAEQHQQAAVGQRAANNEDHLN---GCNDASKRKDNHQSEGNNLSHRDEDPTRKRKKQKT 711

Query: 973  SIHLSDNNLPCKEAATSLDSERNGAPLVQEREGASITVDSNKHKTAVEPHHDVVVATFCS 794
                   N      A  + +E+   P+VQ  + A +    NKH           VA    
Sbjct: 712  -------NATSDACAQEVVTEK--VPVVQNSDVAPM----NKHSVLQNNRKQAQVA---- 754

Query: 793  NDYLPCKEVETSVIMKSHDVSSIDVERNCVEIVKDSQCTREGEHQQQKESPTLCNNESLP 614
             ++   +E E +     +D S                      H+    S T C      
Sbjct: 755  -EHEQPEENEEASGESGNDNSL---------------------HKTTHSSQTKC------ 786

Query: 613  CREAKTSFDSESIDEKEDAAEMVDNQRHMNGSTQQRRAETGTNSNAGDIPCIQVDDSPAA 434
                     S ++++  DA +        NG    +++E     ++ +I     +DS   
Sbjct: 787  ---------SPAVNQNVDADKE-------NGLASSQQSE-----DSDEIEATSSNDSTKK 825

Query: 433  EKRPRVKRKHAM-IQQTSDPETPASQPSVNADETARNQNAEDFXXXXXXXXXXXXXXXSN 257
               P  K +H   I Q  D   P++   V+      N++                   SN
Sbjct: 826  SSPPWRKLRHRKAIYQDKDTAMPSNSKKVHG-----NRDQHMASPSRKRNYACPPKRYSN 880

Query: 256  PLMPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGSLPWTKIMDFGNKVFHKTRTPGDLKD 77
            P++P  RRTKL WT +EE  L+EA+ KF+    G +PW +I++ G  VFH+TR P DL+ 
Sbjct: 881  PIVPAGRRTKLCWTEKEEITLREAMAKFTPRDNGPIPWVQILEHGRDVFHRTRLPSDLRV 940

Query: 76   KWRNIMIKEGA 44
            KWRN+  K G+
Sbjct: 941  KWRNMKKKSGS 951


>ref|XP_006594392.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-79
            specific-like isoform X1 [Glycine max]
            gi|571499066|ref|XP_006594393.1| PREDICTED:
            histone-lysine N-methyltransferase, H3 lysine-79
            specific-like isoform X2 [Glycine max]
          Length = 451

 Score =  111 bits (277), Expect = 1e-21
 Identities = 109/437 (24%), Positives = 175/437 (40%), Gaps = 11/437 (2%)
 Frame = -3

Query: 1327 SGSSCPMAVHEDCLGSLASFEGS-DFYCPFCSYTRADXXXXXXXXXXXXXXXALSSFMCR 1151
            SG  CP+AVH  CL +   F+GS +F CP+C Y RA                 LS F+  
Sbjct: 56   SGRGCPVAVHATCLATGPKFDGSGNFCCPYCWYKRAVDTCRRLREKALEAKGDLSRFL-- 113

Query: 1150 DLVHPKQQRQLPEGANNKESEAARNANCPDKIHANKQHREIIDNHISTRVTEYRQQEKAS 971
                          A + +     +    ++     Q ++  D     RV +   +E+  
Sbjct: 114  ------DNHDHARAAAHVDLVVQDSEELMEETGTQAQSKDNKDEEGEARVNQVHDREETE 167

Query: 970  IHLSDNNLPCKEAATSLDSERNGAPLVQEREGASIT-VDSNKHKTAVEPHHDVVVATFCS 794
                 N    KE    +   R+   LV+ERE  ++T   S ++K   +   D       S
Sbjct: 168  TEPEGN----KEKEGKV---RDNEELVEERERKTVTEAQSQENKAEEDKFQD------DS 214

Query: 793  NDYLPCKEVETSVIMKSHDVSSIDVERNCVEIVKDSQCTREGEHQQQKESPTLCNNESLP 614
             + +   E ET V               C E  ++ +     EH ++ E+ T    +   
Sbjct: 215  EELVVETETETEV--------------QCEENKEEGKVRDSEEHVEEMETETGAEAQPEE 260

Query: 613  CREAKTSFDSESIDEKEDAAEMVDNQRHMNGSTQQRRAETG-------TNSNAGDIPCIQ 455
             ++     DSE + E        + Q    G +++++ E G       + S   D   + 
Sbjct: 261  KKDEGKVRDSEKLVE--------ETQTETEGQSEEKKDEEGKVAVMSSSVSETYDSDSVA 312

Query: 454  VDDSPAAEKRPRV--KRKHAMIQQTSDPETPASQPSVNADETARNQNAEDFXXXXXXXXX 281
            V      +K+ +V   RK   +QQ    +   ++  V  +E   +               
Sbjct: 313  VSMKKRKDKKKKVTSARKSLSLQQEHKNKHYKTRGKVANEEEVTSFKTTSLGQQPQRMKQ 372

Query: 280  XXXXXXSNPLMPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGSLPWTKIMDFGNKVFHKT 101
                         ++R +LLWT+EEE VLKE V KFS   Q ++PW KI++FG +VF +T
Sbjct: 373  SSLA---------AKRKRLLWTAEEEKVLKEGVSKFSTENQ-NIPWRKILEFGCRVFDET 422

Query: 100  RTPGDLKDKWRNIMIKE 50
            RTP DLKDKW+NI+ K+
Sbjct: 423  RTPVDLKDKWKNIISKK 439


>ref|NP_001065819.1| Os11g0160700 [Oryza sativa Japonica Group] gi|77548797|gb|ABA91594.1|
            expressed protein [Oryza sativa Japonica Group]
            gi|113644523|dbj|BAF27664.1| Os11g0160700 [Oryza sativa
            Japonica Group] gi|222615565|gb|EEE51697.1| hypothetical
            protein OsJ_33066 [Oryza sativa Japonica Group]
          Length = 951

 Score =  110 bits (276), Expect = 2e-21
 Identities = 111/431 (25%), Positives = 169/431 (39%), Gaps = 5/431 (1%)
 Frame = -3

Query: 1321 SSCPMAVHEDCLGSLASFEGS-DFYCPFCSYTRADXXXXXXXXXXXXXXXALSSFMCR-D 1148
            SSC +A H+ C GSLA+ + S   YCP C YT+A                 LS+F+ R  
Sbjct: 595  SSCLLAAHDTCFGSLATLDDSGQLYCPVCFYTKATEAYQKAKKTYSEARKNLSAFLGRKQ 654

Query: 1147 LVHPKQQRQLPEGANNKESEAARNANCPDKIHANKQHREIIDN--HISTRVTEYRQQEKA 974
            L    QQ  + + A N E        C D       H+   +N  H     T  R+++K 
Sbjct: 655  LAEQHQQAAVGQRAANNEDHLN---GCNDASKRKDNHQSEGNNLSHRDEDPTRKRKKQKT 711

Query: 973  SIHLSDNNLPCKEAATSLDSERNGAPLVQEREGASITVDSNKHKTAVEPHHDVVVATFCS 794
                   N      A  + +E+   P+VQ  + A +    NKH           VA    
Sbjct: 712  -------NATSDACAQEVVTEK--VPVVQNSDVAPM----NKHSVLQNNRKQAQVA---- 754

Query: 793  NDYLPCKEVETSVIMKSHDVSSIDVERNCVEIVKDSQCTREGEHQQQKESPTLCNNESLP 614
             ++   +E E +     +D S                      H+    S T C      
Sbjct: 755  -EHEQPEENEEASGESGNDNSL---------------------HKTTHSSQTKC------ 786

Query: 613  CREAKTSFDSESIDEKEDAAEMVDNQRHMNGSTQQRRAETGTNSNAGDIPCIQVDDSPAA 434
                     S ++++  DA +        NG    +++E     ++ +I     +DS   
Sbjct: 787  ---------SPAVNQNVDADKE-------NGLASSQQSE-----DSDEIEATSSNDSTKK 825

Query: 433  EKRPRVKRKHAM-IQQTSDPETPASQPSVNADETARNQNAEDFXXXXXXXXXXXXXXXSN 257
               P  K +H   I Q  D   P+     N+ +   N++                   SN
Sbjct: 826  SSPPWRKLRHRKAIYQDKDTAMPS-----NSKKVLGNRDQHMASPSRKRNYACPPKRYSN 880

Query: 256  PLMPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGSLPWTKIMDFGNKVFHKTRTPGDLKD 77
            P++P  RRTKL WT +EE  L+EA+ KF+    G +PW +I++ G  VFH+TR P DL+ 
Sbjct: 881  PIVPAGRRTKLCWTEKEEITLREAMAKFTPRDNGPIPWVQILEHGRDVFHRTRLPSDLRV 940

Query: 76   KWRNIMIKEGA 44
            KWRN+  K G+
Sbjct: 941  KWRNMKKKSGS 951


>ref|XP_007049460.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|590712773|ref|XP_007049461.1| Uncharacterized protein
            isoform 4 [Theobroma cacao] gi|508701721|gb|EOX93617.1|
            Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508701722|gb|EOX93618.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 361

 Score =  110 bits (275), Expect = 2e-21
 Identities = 96/406 (23%), Positives = 157/406 (38%)
 Frame = -3

Query: 1255 FYCPFCSYTRADXXXXXXXXXXXXXXXALSSFMCRDLVHPKQQRQLPEGANNKESEAARN 1076
            FYCP+C Y R                  LS+F+C       ++ Q+ E    K +  +  
Sbjct: 13   FYCPYCWYKRELVRTKELRRKAMLARKELSNFICLKRDGGNEEMQVDETETMKAASVSTM 72

Query: 1075 ANCPDKIHANKQHREIIDNHISTRVTEYRQQEKASIHLSDNNLPCKEAATSLDSERNGAP 896
            A    KI+       + D +               IH      P  E+ +  D ERN   
Sbjct: 73   AG---KINTGDSENGLNDKN------------NERIHHDQEETPGVESISKSDEERNSRA 117

Query: 895  LVQEREGASITVDSNKHKTAVEPHHDVVVATFCSNDYLPCKEVETSVIMKSHDVSSIDVE 716
               E  G    +             D+  A+   +D +   + +   I  SH    +++E
Sbjct: 118  RGSENFGDGERIQDE----------DIENASDSEDDEIDEDQWQIQPISSSH----LEIE 163

Query: 715  RNCVEIVKDSQCTREGEHQQQKESPTLCNNESLPCREAKTSFDSESIDEKEDAAEMVDNQ 536
            +  + +         G  ++ KE P L N  ++    A  + D  S     ++ E V   
Sbjct: 164  KGALPVSTKETSDNVGVLEENKEEPVLPN--AVGTTMALITSDCTSKVPAIESFEFV--- 218

Query: 535  RHMNGSTQQRRAETGTNSNAGDIPCIQVDDSPAAEKRPRVKRKHAMIQQTSDPETPASQP 356
                                  +P +  +     +KR +   +    Q+   P+ P+S+P
Sbjct: 219  ----------------------LPDLNTETLVVRQKRVKRTAQKEWPQKVDSPKMPSSEP 256

Query: 355  SVNADETARNQNAEDFXXXXXXXXXXXXXXXSNPLMPHSRRTKLLWTSEEEAVLKEAVLK 176
            S +A +   NQ  +                  +  +   +R +L WT+EEE +LKE V +
Sbjct: 257  STSAKDKKMNQQGKATAAKNSVQCQELNKRFVSSKLGTEKRRRLHWTAEEEDMLKEGVRR 316

Query: 175  FSEPGQGSLPWTKIMDFGNKVFHKTRTPGDLKDKWRNIMIKEGAPR 38
            FS     ++PW KI++FG+ VFH TRTP DLKDKW+NI+ KE AP+
Sbjct: 317  FSSIVNKNIPWRKILEFGHHVFHSTRTPVDLKDKWKNIIAKE-APK 361


>ref|XP_010036112.1| PREDICTED: uncharacterized protein LOC104425191 [Eucalyptus grandis]
            gi|629081183|gb|KCW47628.1| hypothetical protein
            EUGRSUZ_K01371 [Eucalyptus grandis]
          Length = 492

 Score =  107 bits (268), Expect = 2e-20
 Identities = 104/424 (24%), Positives = 173/424 (40%), Gaps = 2/424 (0%)
 Frame = -3

Query: 1327 SGSSCPMAVHEDCLGSLASFEG-SDFYCPFCSYTRADXXXXXXXXXXXXXXXALSSFMCR 1151
            S  SCP+A+HE CL     F+   +FYCP+CSY R                 ALS+F+  
Sbjct: 116  SEPSCPIAIHEKCLSCKPQFDHLGNFYCPYCSYKRVVAKVHELRRKAMLRKEALSNFLDN 175

Query: 1150 DLVHPKQQRQLPEGANNKESEAARNANCPDKIHANKQHREIIDNHISTRVTEYRQQEKAS 971
             +V   Q+ Q+  G + +       +   D  H +         H S  V     + +A 
Sbjct: 176  GVVDKGQREQI-SGVDKRGDLDLSPSPGHDSCHDDVSKENYEGQHQSDPVNPVYSKHEAL 234

Query: 970  IHLSDNNLPCKEAATSLDSERNGAPLVQEREGASITVDSNKHKTAVEPHHDVVVATFCSN 791
            +    + L     A S+  +RN   LV E EG    +   + + A + H           
Sbjct: 235  VENQTDTL-----APSVHVQRN---LVYE-EG----LQEGESREATDNH----------- 270

Query: 790  DYLPCKEVETSVIMKSHDVSSIDVERNCVEIVKDSQCTREGEHQQQKESPTLCNNESLPC 611
                CKEV      K+        +    E  +D+  ++  E ++ ++S    +  S   
Sbjct: 271  ----CKEVYDQE--KAETADDHTSKEGAAEAAEDAVLSKSSEGKRHRKSIRRSHGLSRKR 324

Query: 610  REAKTSFDSESIDEKE-DAAEMVDNQRHMNGSTQQRRAETGTNSNAGDIPCIQVDDSPAA 434
            +  K      ++   + D A  V+ + +M        AE G  +N           SP+A
Sbjct: 325  KRRKVQSGDATVSSIQGDFATDVNPEANMVDQVCDSDAEAGFMNNG---------HSPSA 375

Query: 433  EKRPRVKRKHAMIQQTSDPETPASQPSVNADETARNQNAEDFXXXXXXXXXXXXXXXSNP 254
            + + +     A +++   P   +S+P +      +   AE                   P
Sbjct: 376  KPQGK-----APVEEVVSPRNGSSEPGITITNQGKATLAE-----MSRQQRQSPRKLPKP 425

Query: 253  LMPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGSLPWTKIMDFGNKVFHKTRTPGDLKDK 74
             +P+ +R +L W+ EEE +LKE V  FS     ++PW KI+++G +VF+ +RTP DLKDK
Sbjct: 426  PLPNVKR-RLRWSPEEEEILKEGVQLFSSNANKNIPWRKILEYGCRVFNPSRTPVDLKDK 484

Query: 73   WRNI 62
            W+NI
Sbjct: 485  WKNI 488


>gb|KCW47629.1| hypothetical protein EUGRSUZ_K01371 [Eucalyptus grandis]
          Length = 485

 Score =  105 bits (261), Expect = 1e-19
 Identities = 103/424 (24%), Positives = 172/424 (40%), Gaps = 2/424 (0%)
 Frame = -3

Query: 1327 SGSSCPMAVHEDCLGSLASFEG-SDFYCPFCSYTRADXXXXXXXXXXXXXXXALSSFMCR 1151
            S  SCP+A+HE CL     F+   +FYCP+CSY R                 ALS+F+  
Sbjct: 109  SEPSCPIAIHEKCLSCKPQFDHLGNFYCPYCSYKRVVAKVHELRRKAMLRKEALSNFLDN 168

Query: 1150 DLVHPKQQRQLPEGANNKESEAARNANCPDKIHANKQHREIIDNHISTRVTEYRQQEKAS 971
             +V   Q+ Q+  G + +       +   D  H +         H S  V     + +A 
Sbjct: 169  GVVDKGQREQI-SGVDKRGDLDLSPSPGHDSCHDDVSKENYEGQHQSDPVNPVYSKHEAL 227

Query: 970  IHLSDNNLPCKEAATSLDSERNGAPLVQEREGASITVDSNKHKTAVEPHHDVVVATFCSN 791
            +    + L     A S+  +RN   LV E EG    +   + + A + H           
Sbjct: 228  VENQTDTL-----APSVHVQRN---LVYE-EG----LQEGESREATDNH----------- 263

Query: 790  DYLPCKEVETSVIMKSHDVSSIDVERNCVEIVKDSQCTREGEHQQQKESPTLCNNESLPC 611
                CKEV      K+        +    E  +D+  ++  E ++ ++S    +  S   
Sbjct: 264  ----CKEVYDQE--KAETADDHTSKEGAAEAAEDAVLSKSSEGKRHRKSIRRSHGLSRKR 317

Query: 610  REAKTSFDSESIDEKE-DAAEMVDNQRHMNGSTQQRRAETGTNSNAGDIPCIQVDDSPAA 434
            +  K      ++   + D A  V+ + +M        AE G  +N           SP+A
Sbjct: 318  KRRKVQSGDATVSSIQGDFATDVNPEANMVDQVCDSDAEAGFMNNG---------HSPSA 368

Query: 433  EKRPRVKRKHAMIQQTSDPETPASQPSVNADETARNQNAEDFXXXXXXXXXXXXXXXSNP 254
            + + +     A +++   P   +S+P +      +   AE                   P
Sbjct: 369  KPQGK-----APVEEVVSPRNGSSEPGITITNQGKATLAE-----MSRQQRQSPRKLPKP 418

Query: 253  LMPHSRRTKLLWTSEEEAVLKEAVLKFSEPGQGSLPWTKIMDFGNKVFHKTRTPGDLKDK 74
             +P+ +R +L W+ EEE +LK  V  FS     ++PW KI+++G +VF+ +RTP DLKDK
Sbjct: 419  PLPNVKR-RLRWSPEEEEILKARVQLFSSNANKNIPWRKILEYGCRVFNPSRTPVDLKDK 477

Query: 73   WRNI 62
            W+NI
Sbjct: 478  WKNI 481


Top