BLASTX nr result

ID: Akebia27_contig00005148 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00005148
         (1568 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI16022.3| unnamed protein product [Vitis vinifera]              348   3e-93
ref|XP_006488440.1| PREDICTED: mediator of RNA polymerase II tra...   327   7e-87
ref|XP_006424987.1| hypothetical protein CICLE_v10027683mg [Citr...   327   7e-87
ref|XP_007204950.1| hypothetical protein PRUPE_ppa000292mg [Prun...   313   1e-82
ref|XP_002520450.1| hypothetical protein RCOM_0731250 [Ricinus c...   304   8e-80
ref|XP_007016237.1| Uncharacterized protein isoform 7 [Theobroma...   299   3e-78
ref|XP_007016232.1| Uncharacterized protein isoform 2 [Theobroma...   299   3e-78
ref|XP_007016238.1| Uncharacterized protein isoform 8 [Theobroma...   291   4e-76
ref|XP_006379033.1| hypothetical protein POPTR_0009s04520g [Popu...   285   5e-74
emb|CAN64434.1| hypothetical protein VITISV_000937 [Vitis vinifera]   278   5e-72
ref|XP_006848046.1| hypothetical protein AMTR_s00029p00190880 [A...   275   4e-71
ref|XP_004295721.1| PREDICTED: uncharacterized protein LOC101314...   273   2e-70
ref|XP_002298329.1| hypothetical protein POPTR_0001s25430g [Popu...   268   4e-69
ref|XP_004169561.1| PREDICTED: uncharacterized protein LOC101227...   266   2e-68
ref|XP_004153176.1| PREDICTED: uncharacterized protein LOC101214...   266   2e-68
ref|XP_004145323.1| PREDICTED: uncharacterized protein LOC101205...   266   2e-68
ref|XP_007131393.1| hypothetical protein PHAVU_011G009900g [Phas...   263   2e-67
ref|XP_006591981.1| PREDICTED: histone-lysine N-methyltransferas...   254   1e-64
ref|XP_006591980.1| PREDICTED: histone-lysine N-methyltransferas...   254   1e-64
ref|XP_006591977.1| PREDICTED: histone-lysine N-methyltransferas...   254   1e-64

>emb|CBI16022.3| unnamed protein product [Vitis vinifera]
          Length = 1669

 Score =  348 bits (894), Expect = 3e-93
 Identities = 209/424 (49%), Positives = 248/424 (58%), Gaps = 52/424 (12%)
 Frame = +1

Query: 1    FKPFPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDR--------- 153
            FK  P EP RR  DH +F EDLK+F R  HLDS+ V KF +Y+SSSRP DR         
Sbjct: 1252 FKSLP-EPGRRSSDHGKFAEDLKQFSRSSHLDSDLVPKFGNYFSSSRPLDRGSQGFVMDA 1310

Query: 154  -------VPPGFSHEVGPKLDGSASGAASRYLPPYQPGG----LRPVGPLDDNMRRKTDS 300
                    P GF+++ G K   SA    SR+ PP  PGG     R VG  +DN+ R +D 
Sbjct: 1311 AQGLLDKAPLGFNYDSGFK--SSAGTGTSRFFPPPHPGGDGERSRAVGFHEDNVGR-SDM 1367

Query: 301  IGVHPDFLRNASEPGRHRMDGLPPLRSPGREYHS-------------SRFGPPEDIDVRE 441
               HP+FL +  E GRH MDGL P RSP RE+                R    +DID RE
Sbjct: 1368 ARTHPNFLGSVPEYGRHHMDGLNP-RSPTREFSGIPHRGFGGLSGVPGRQSDLDDIDGRE 1426

Query: 442  SHVFGERGVPFKLSSDGNAFHESRFPTLPSHLRRSELDGPG---------------NLRM 576
            S  FGE    F L SD     ESRFP LPSHLRR EL+GPG               +LR 
Sbjct: 1427 SRRFGEGSKTFNLPSD-----ESRFPVLPSHLRRGELEGPGELVMADPIASRPAPHHLRG 1481

Query: 577  GEKIGSGALPVHFRSGE---PHNLPGHLRMGEPAGFGAFPNHLRAGEVGGPRNLPSNLRI 747
            G+ IG   LP H + GE     N+PG LR GEP  F AF  H R GE+ GP N PS L  
Sbjct: 1482 GDLIGQDILPSHLQRGEHFGSRNIPGQLRFGEPV-FDAFLGHPRMGELSGPGNFPSRLSA 1540

Query: 748  GDSIGGK-LHGQVRMGEPEFNSSFPIHGYPNDSGFFNAGDVESFDQPRKRKSGTMGWCRI 924
            G+S GG    G  R+GEP F S++ +HGYPND GF   GD+ESFD  RKRK  +M WCRI
Sbjct: 1541 GESFGGSNKSGHPRIGEPGFRSTYSLHGYPNDHGFRPPGDMESFDNSRKRKPLSMAWCRI 1600

Query: 925  CKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNAKKQKLSSDDHVSHEDANKSRKASF 1104
            C +DCETV+GLDMHSQTREHQ+MAMD+VLSIK+ NAKKQKL+S DH + ED++KS+K   
Sbjct: 1601 CNIDCETVDGLDMHSQTREHQQMAMDIVLSIKQQNAKKQKLTSKDHSTPEDSSKSKKGVL 1660

Query: 1105 ESHG 1116
               G
Sbjct: 1661 RGGG 1664


>ref|XP_006488440.1| PREDICTED: mediator of RNA polymerase II transcription subunit
            15-like isoform X1 [Citrus sinensis]
            gi|568870502|ref|XP_006488441.1| PREDICTED: mediator of
            RNA polymerase II transcription subunit 15-like isoform
            X2 [Citrus sinensis] gi|568870504|ref|XP_006488442.1|
            PREDICTED: mediator of RNA polymerase II transcription
            subunit 15-like isoform X3 [Citrus sinensis]
            gi|568870506|ref|XP_006488443.1| PREDICTED: mediator of
            RNA polymerase II transcription subunit 15-like isoform
            X4 [Citrus sinensis]
          Length = 1392

 Score =  327 bits (839), Expect = 7e-87
 Identities = 192/392 (48%), Positives = 229/392 (58%), Gaps = 22/392 (5%)
 Frame = +1

Query: 7    PFPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEVGP 186
            PFPV+P+R ++D  EFEEDLK+F RP HLD+E V K  S++  SRPFDR P G+  ++GP
Sbjct: 1038 PFPVDPARSVIDRGEFEEDLKQFSRPSHLDAEPVPKLGSHFLPSRPFDRGPHGYGMDMGP 1097

Query: 187  -------------KLDGSASGAASRYLPPYQPGGLRPVGPLDDNMRRKTDSIGVHPDFLR 327
                         KLD   + A SR+LP Y            D+   ++DS   HPDF R
Sbjct: 1098 RPFERGLSYDPGLKLDPMGASAPSRFLPAYH-----------DDAAGRSDSSHAHPDFPR 1146

Query: 328  NASEPGRHRMDGLPPLRSPGREYHSSRFGPP---------EDIDVRESHVFGERGVPFKL 480
                 GR  M GL P RS  RE+      P          EDI  RE   FG+       
Sbjct: 1147 PGRAYGRRHMGGLSP-RSSFREFCGFGGLPGSLGGSRSVREDIGGREFRRFGD------- 1198

Query: 481  SSDGNAFHESRFPTLPSHLRRSELDGPGNLRMGEKIGSGALPVHFRSGEPHNLPGHLRMG 660
               GN+FH+SRFP LPSHLRR E +GPG  R G+ IG   LP H R GEP   P +LR+G
Sbjct: 1199 -PIGNSFHDSRFPVLPSHLRRGEFEGPG--RTGDLIGQEFLPSHLRRGEPLG-PHNLRLG 1254

Query: 661  EPAGFGAFPNHLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRMGEPEFNSSFPIHGYPND 840
            E  G G FP   R  E+GGP N P                 R+GEP F SSF   G+PND
Sbjct: 1255 ETVGLGGFPGPARMEELGGPGNFPPP---------------RLGEPGFRSSFSRQGFPND 1299

Query: 841  SGFFNAGDVESFDQPRKRKSGTMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIK 1020
             GF+  GD+ES D  RKRK  +MGWCRICKVDCETV+GLD+HSQTREHQKMAMDMVLSIK
Sbjct: 1300 GGFYT-GDMESIDNSRKRKPPSMGWCRICKVDCETVDGLDLHSQTREHQKMAMDMVLSIK 1358

Query: 1021 KDNAKKQKLSSDDHVSHEDANKSRKASFESHG 1116
            + NAKKQKL+S D  S +DANKSR  +F+  G
Sbjct: 1359 Q-NAKKQKLTSGDRCSTDDANKSRNVNFDGRG 1389


>ref|XP_006424987.1| hypothetical protein CICLE_v10027683mg [Citrus clementina]
            gi|557526921|gb|ESR38227.1| hypothetical protein
            CICLE_v10027683mg [Citrus clementina]
          Length = 1392

 Score =  327 bits (839), Expect = 7e-87
 Identities = 192/392 (48%), Positives = 229/392 (58%), Gaps = 22/392 (5%)
 Frame = +1

Query: 7    PFPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEVGP 186
            PFPV+P+R ++D  EFEEDLK+F RP HLD+E V K  S++  SRPFDR P G+  ++GP
Sbjct: 1038 PFPVDPARSVIDRGEFEEDLKQFSRPSHLDAEPVPKLGSHFLPSRPFDRGPHGYGMDMGP 1097

Query: 187  -------------KLDGSASGAASRYLPPYQPGGLRPVGPLDDNMRRKTDSIGVHPDFLR 327
                         KLD   + A SR+LP Y            D+   ++DS   HPDF R
Sbjct: 1098 RPFERGLSYDPGLKLDPMGASAPSRFLPAYH-----------DDAAGRSDSSHAHPDFPR 1146

Query: 328  NASEPGRHRMDGLPPLRSPGREYHSSRFGPP---------EDIDVRESHVFGERGVPFKL 480
                 GR  M GL P RS  RE+      P          EDI  RE   FG+       
Sbjct: 1147 PGRAYGRRHMGGLSP-RSSFREFCGFGGLPGSLGGSRSVREDIGGREFRRFGD------- 1198

Query: 481  SSDGNAFHESRFPTLPSHLRRSELDGPGNLRMGEKIGSGALPVHFRSGEPHNLPGHLRMG 660
               GN+FH+SRFP LPSHLRR E +GPG  R G+ IG   LP H R GEP   P +LR+G
Sbjct: 1199 -PIGNSFHDSRFPVLPSHLRRGEFEGPG--RTGDLIGQEFLPSHLRRGEPLG-PHNLRLG 1254

Query: 661  EPAGFGAFPNHLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRMGEPEFNSSFPIHGYPND 840
            E  G G FP   R  E+GGP N P                 R+GEP F SSF   G+PND
Sbjct: 1255 ETVGLGGFPGPARMEELGGPGNFPPP---------------RLGEPGFRSSFSHQGFPND 1299

Query: 841  SGFFNAGDVESFDQPRKRKSGTMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIK 1020
             GF+  GD+ES D  RKRK  +MGWCRICKVDCETV+GLD+HSQTREHQKMAMDMVLSIK
Sbjct: 1300 GGFYT-GDMESIDNSRKRKPPSMGWCRICKVDCETVDGLDLHSQTREHQKMAMDMVLSIK 1358

Query: 1021 KDNAKKQKLSSDDHVSHEDANKSRKASFESHG 1116
            + NAKKQKL+S D  S +DANKSR  +F+  G
Sbjct: 1359 Q-NAKKQKLTSGDRCSTDDANKSRNVNFDGRG 1389


>ref|XP_007204950.1| hypothetical protein PRUPE_ppa000292mg [Prunus persica]
            gi|462400592|gb|EMJ06149.1| hypothetical protein
            PRUPE_ppa000292mg [Prunus persica]
          Length = 1334

 Score =  313 bits (802), Expect = 1e-82
 Identities = 188/375 (50%), Positives = 226/375 (60%), Gaps = 13/375 (3%)
 Frame = +1

Query: 7    PFPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEVGP 186
            PFPV+P+R ++D  EFE+DLK+FPRP +LDSE V KF +Y  SSRPFDR P GF ++ GP
Sbjct: 985  PFPVDPTRHVIDRVEFEDDLKQFPRPSYLDSEPVAKFGNY--SSRPFDRAPHGFKYDSGP 1042

Query: 187  KLDGSASGAASRYLPPYQPGGLRPVGPLDDNMRRKTDSIGVHPDFLRNASEPGRHRMDGL 366
              D  A  A SR+L PY+ GG   V   D     + +    HPDF+      GR  +DGL
Sbjct: 1043 HTDPLAGTAPSRFLSPYRLGG--SVHGNDAGDFGRMEPTHGHPDFV------GRRLVDGL 1094

Query: 367  PPLRSPGREY-----HSSRFGPPEDIDVRESHVFGERGVPFKLSSDGNAFHESRFPTLPS 531
             P RSP R+Y     H  R   P+D D RE H FG+   P      GN FHE RF  LP 
Sbjct: 1095 AP-RSPVRDYPGLPPHGFRGFGPDDFDGREFHRFGD---PL-----GNQFHEGRFSNLPG 1145

Query: 532  HLRRSELDGPGNLRM-----GEKIGSGALPVHFRSGE---PHNLPGHLRMGEPAGFGAFP 687
            H RR E +GPGNLRM      + IG    P H R G+   PHNL       EP GFG+  
Sbjct: 1146 HFRRGEFEGPGNLRMVDHRRNDFIGQDGHPGHLRRGDHLGPHNLR------EPLGFGSRH 1199

Query: 688  NHLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRMGEPEFNSSFPIHGYPNDSGFFNAGDV 867
            +H+  G++ GP N        +   G      R+GEP F SSF +  +PND  +   GD+
Sbjct: 1200 SHM--GDMAGPGNF-------EPFRGNRPNHPRLGEPGFRSSFSLQRFPNDGTY--TGDL 1248

Query: 868  ESFDQPRKRKSGTMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNAKKQKL 1047
            ESFD  RKRK  +MGWCRICKVDCETVEGLD+HSQTREHQKMAMDMV SIK+ NAKKQKL
Sbjct: 1249 ESFDHSRKRKPASMGWCRICKVDCETVEGLDLHSQTREHQKMAMDMVRSIKQ-NAKKQKL 1307

Query: 1048 SSDDHVSHEDANKSR 1092
            +S D    EDANKS+
Sbjct: 1308 TSGDQSLLEDANKSK 1322


>ref|XP_002520450.1| hypothetical protein RCOM_0731250 [Ricinus communis]
            gi|223540292|gb|EEF41863.1| hypothetical protein
            RCOM_0731250 [Ricinus communis]
          Length = 1329

 Score =  304 bits (778), Expect = 8e-80
 Identities = 183/393 (46%), Positives = 229/393 (58%), Gaps = 22/393 (5%)
 Frame = +1

Query: 7    PFPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVP-----PGFS 171
            PFP +PS+RIVD REFEEDLK F RP  LD++   KF + +SSSRP DR P      G +
Sbjct: 976  PFPKDPSQRIVDRREFEEDLKHFSRPSDLDTQSTTKFGANFSSSRPLDRGPLDKGLHGPN 1035

Query: 172  HEVGPKLDGSASGAASRYLPPYQPGGL--------RPVGPLDDNMRRKTDSIGVHPDFLR 327
            ++ G KL+       SR+ PPY   GL        R +G  D+ + R+ DS+  HP+F  
Sbjct: 1036 YDSGMKLESLGGPPPSRFFPPYHHDGLMHPNDIAERSIGFHDNTLGRQPDSVRAHPEFFG 1095

Query: 328  NASEPGRHRMDGLPPLRSPGREYH--SSR-FGPP---EDIDVRESHVFGERGVPFKLSSD 489
                  R   DG+ P RSPGR+Y   SSR FG     +DID RES  FG+          
Sbjct: 1096 PGRRYDRRHRDGMAP-RSPGRDYPGVSSRGFGAIPGLDDIDGRESRRFGD---------- 1144

Query: 490  GNAFHESRFPTLPSHLRRSELDGPGNLRMGEKIGSGALPVHFRSGEP---HNLPGHLRMG 660
              +FH SRFP LPSH+R  E +GP                HFR GE    HN+    R+G
Sbjct: 1145 --SFHGSRFPVLPSHMRMGEFEGPSQ---------DGFSNHFRRGEHLGHHNMRN--RLG 1191

Query: 661  EPAGFGAFPNHLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRMGEPEFNSSFPIHGYPND 840
            EP GFGAFP     G++ G  N  +                R+GEP F SSF   G+P D
Sbjct: 1192 EPIGFGAFPGPAGMGDLSGTGNFFNP---------------RLGEPGFRSSFSFKGFPGD 1236

Query: 841  SGFFNAGDVESFDQPRKRKSGTMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIK 1020
             G + AG++ESFD  R+RKS +MGWCRICKVDCETVEGLD+HSQTREHQK AMDMV++IK
Sbjct: 1237 GGIY-AGELESFDNSRRRKSSSMGWCRICKVDCETVEGLDLHSQTREHQKRAMDMVVTIK 1295

Query: 1021 KDNAKKQKLSSDDHVSHEDANKSRKASFESHGN 1119
            + NAKKQKL+++DH S +DA+KS+  S E  GN
Sbjct: 1296 Q-NAKKQKLANNDHSSVDDASKSKNTSIEGRGN 1327


>ref|XP_007016237.1| Uncharacterized protein isoform 7 [Theobroma cacao]
            gi|508786600|gb|EOY33856.1| Uncharacterized protein
            isoform 7 [Theobroma cacao]
          Length = 975

 Score =  299 bits (765), Expect = 3e-78
 Identities = 181/394 (45%), Positives = 215/394 (54%), Gaps = 24/394 (6%)
 Frame = +1

Query: 10   FPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEVGPK 189
            FP++   R  D  +FEEDLK FPRP HLD+E V KF SY SSSRP DR P GF  ++GP+
Sbjct: 625  FPLDRGHR-GDRGQFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPR 683

Query: 190  ----------LDGSASGAASRYLPPYQPG--GLRPVGPLDDNMRRKTDSIGVHPDFLRNA 333
                       D       SR+LPPY P   G RPVG   D + R        PDFL   
Sbjct: 684  AQEKEPHGFSFDPMIGSGPSRFLPPYHPDDTGERPVGLPKDTLGR--------PDFLGTV 735

Query: 334  SEPGRHRMDGLPPLRSPGREY-----HSSRFGPPEDIDVRESHVFGERGVPFKLSSDGNA 498
               GRHRMDG    RSPGREY     H     P ++ID RE                   
Sbjct: 736  PSYGRHRMDGFVS-RSPGREYPGISPHGFGGHPGDEIDGRERRF---------------- 778

Query: 499  FHESRFPTLPSHLRRSELDGPG----NLRMGEKIGSGALPVHFRSGEP---HNLPGHLRM 657
                RFP LP HL R   +       +LR  + I     P +FR GE    HN+PGHLR+
Sbjct: 779  --SDRFPGLPGHLHRGGFESSDRMEEHLRSRDMINQDNRPAYFRRGEHVGHHNMPGHLRL 836

Query: 658  GEPAGFGAFPNHLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRMGEPEFNSSFPIHGYPN 837
            GEP GFG F +H R GE GGP N                   R+GEP F SSF +  +PN
Sbjct: 837  GEPIGFGDFSSHERIGEFGGPGNFR---------------HPRLGEPGFRSSFSLQEFPN 881

Query: 838  DSGFFNAGDVESFDQPRKRKSGTMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSI 1017
            D G +  G ++SF+  RKRK  +MGWCRICK+DCETVEGLD+HSQTREHQKMAMDMV++I
Sbjct: 882  DGGIYTGG-MDSFENLRKRKPMSMGWCRICKIDCETVEGLDLHSQTREHQKMAMDMVVTI 940

Query: 1018 KKDNAKKQKLSSDDHVSHEDANKSRKASFESHGN 1119
            K+ NAKKQKL+S DH    D +KS+   FE   N
Sbjct: 941  KQ-NAKKQKLTSSDHSIRNDTSKSKNVKFEGRVN 973


>ref|XP_007016232.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|590588563|ref|XP_007016233.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
            gi|590588573|ref|XP_007016234.1| Uncharacterized protein
            isoform 2 [Theobroma cacao] gi|508786595|gb|EOY33851.1|
            Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508786596|gb|EOY33852.1| Uncharacterized protein
            isoform 2 [Theobroma cacao] gi|508786597|gb|EOY33853.1|
            Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 1408

 Score =  299 bits (765), Expect = 3e-78
 Identities = 181/394 (45%), Positives = 215/394 (54%), Gaps = 24/394 (6%)
 Frame = +1

Query: 10   FPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEVGPK 189
            FP++   R  D  +FEEDLK FPRP HLD+E V KF SY SSSRP DR P GF  ++GP+
Sbjct: 1058 FPLDRGHR-GDRGQFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPR 1116

Query: 190  ----------LDGSASGAASRYLPPYQPG--GLRPVGPLDDNMRRKTDSIGVHPDFLRNA 333
                       D       SR+LPPY P   G RPVG   D + R        PDFL   
Sbjct: 1117 AQEKEPHGFSFDPMIGSGPSRFLPPYHPDDTGERPVGLPKDTLGR--------PDFLGTV 1168

Query: 334  SEPGRHRMDGLPPLRSPGREY-----HSSRFGPPEDIDVRESHVFGERGVPFKLSSDGNA 498
               GRHRMDG    RSPGREY     H     P ++ID RE                   
Sbjct: 1169 PSYGRHRMDGFVS-RSPGREYPGISPHGFGGHPGDEIDGRERRF---------------- 1211

Query: 499  FHESRFPTLPSHLRRSELDGPG----NLRMGEKIGSGALPVHFRSGEP---HNLPGHLRM 657
                RFP LP HL R   +       +LR  + I     P +FR GE    HN+PGHLR+
Sbjct: 1212 --SDRFPGLPGHLHRGGFESSDRMEEHLRSRDMINQDNRPAYFRRGEHVGHHNMPGHLRL 1269

Query: 658  GEPAGFGAFPNHLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRMGEPEFNSSFPIHGYPN 837
            GEP GFG F +H R GE GGP N                   R+GEP F SSF +  +PN
Sbjct: 1270 GEPIGFGDFSSHERIGEFGGPGNFR---------------HPRLGEPGFRSSFSLQEFPN 1314

Query: 838  DSGFFNAGDVESFDQPRKRKSGTMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSI 1017
            D G +  G ++SF+  RKRK  +MGWCRICK+DCETVEGLD+HSQTREHQKMAMDMV++I
Sbjct: 1315 DGGIYTGG-MDSFENLRKRKPMSMGWCRICKIDCETVEGLDLHSQTREHQKMAMDMVVTI 1373

Query: 1018 KKDNAKKQKLSSDDHVSHEDANKSRKASFESHGN 1119
            K+ NAKKQKL+S DH    D +KS+   FE   N
Sbjct: 1374 KQ-NAKKQKLTSSDHSIRNDTSKSKNVKFEGRVN 1406


>ref|XP_007016238.1| Uncharacterized protein isoform 8 [Theobroma cacao]
            gi|508786601|gb|EOY33857.1| Uncharacterized protein
            isoform 8 [Theobroma cacao]
          Length = 972

 Score =  291 bits (746), Expect = 4e-76
 Identities = 180/394 (45%), Positives = 213/394 (54%), Gaps = 24/394 (6%)
 Frame = +1

Query: 10   FPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEVGPK 189
            FP++   R  D  +FEEDLK FPRP HLD+E V KF SY SSSRP DR P GF  ++GP+
Sbjct: 625  FPLDRGHR-GDRGQFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPR 683

Query: 190  ----------LDGSASGAASRYLPPYQPG--GLRPVGPLDDNMRRKTDSIGVHPDFLRNA 333
                       D       SR+LPPY P   G RPVG   D + R        PDFL   
Sbjct: 684  AQEKEPHGFSFDPMIGSGPSRFLPPYHPDDTGERPVGLPKDTLGR--------PDFLGTV 735

Query: 334  SEPGRHRMDGLPPLRSPGREY-----HSSRFGPPEDIDVRESHVFGERGVPFKLSSDGNA 498
               GRHRMDG    RSPGREY     H     P ++ID RE                   
Sbjct: 736  PSYGRHRMDGFVS-RSPGREYPGISPHGFGGHPGDEIDGRERRF---------------- 778

Query: 499  FHESRFPTLPSHLRRSELDGPG----NLRMGEKIGSGALPVHFRSGEP---HNLPGHLRM 657
                RFP LP HL R   +       +LR  + I     P +FR GE    HN+PGHLR+
Sbjct: 779  --SDRFPGLPGHLHRGGFESSDRMEEHLRSRDMINQDNRPAYFRRGEHVGHHNMPGHLRL 836

Query: 658  GEPAGFGAFPNHLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRMGEPEFNSSFPIHGYPN 837
            GEP GFG F +H R GE GGP N                   R+GEP F SSF +  +PN
Sbjct: 837  GEPIGFGDFSSHERIGEFGGPGNFR---------------HPRLGEPGFRSSFSLQEFPN 881

Query: 838  DSGFFNAGDVESFDQPRKRKSGTMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSI 1017
            D G +  G ++SF+  RKRK  +MGWCRICK+DCETVEGLD+HSQTREHQKMAMDMV++I
Sbjct: 882  DGGIYTGG-MDSFENLRKRKPMSMGWCRICKIDCETVEGLDLHSQTREHQKMAMDMVVTI 940

Query: 1018 KKDNAKKQKLSSDDHVSHEDANKSRKASFESHGN 1119
            K+ NAKKQKL   DH    D +KS+   FE   N
Sbjct: 941  KQ-NAKKQKL---DHSIRNDTSKSKNVKFEGRVN 970


>ref|XP_006379033.1| hypothetical protein POPTR_0009s04520g [Populus trichocarpa]
            gi|550331020|gb|ERP56830.1| hypothetical protein
            POPTR_0009s04520g [Populus trichocarpa]
          Length = 1315

 Score =  285 bits (728), Expect = 5e-74
 Identities = 189/410 (46%), Positives = 225/410 (54%), Gaps = 39/410 (9%)
 Frame = +1

Query: 7    PFPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEVGP 186
            PFP  P+   V   EFEEDLK FPRP HLD+E V K  S++ SSRP DR P GF  +  P
Sbjct: 941  PFPRGPAHHNVHQGEFEEDLKHFPRPSHLDTEPVPKSSSHFPSSRPLDRGPRGFGVDGAP 1000

Query: 187  K-LDGSASG---------------AASRYLPPYQPGGLRPVGPLD--------DNMRRKT 294
            + LD  + G               A  R+ PPY     + + P D        D++  ++
Sbjct: 1001 RPLDKGSHGFNYDSGLNMEPLGGSAPPRFFPPYHHD--KALHPSDAEVSLGYHDSLAGRS 1058

Query: 295  DSIGVHPDFLRNASEPGRHR-MDGLPPLRSPGREYH---SSRFGPP---EDIDVRESHVF 453
            D     P FL        HR MD L P RSP R+Y    + RFG     +DID R+ H F
Sbjct: 1059 DFARTRPGFLGPPIPGYDHRHMDNLAP-RSPVRDYPGMPTRRFGALPGLDDIDGRDPHRF 1117

Query: 454  GERGVPFKLSSDGNAFHESRFPTLPSHLRRSELDGPGNLRMGEKI-----GSGALPVHFR 618
            G+     K SS   +  +SRFP  PSHLRR EL+GPGNL MGE +     G    P H R
Sbjct: 1118 GD-----KFSS---SLRDSRFPVFPSHLRRGELEGPGNLHMGEHLSGDLMGHDGRPAHLR 1169

Query: 619  SGE---PHNLPGHLRMGEPAGFGAFPNHLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRM 789
             GE   P NLP HL +GEP  FGAFP H R GE+ GP N               H Q  +
Sbjct: 1170 RGEHLGPRNLPSHLWVGEPGNFGAFPGHARMGELAGPGNF-------------YHHQ--L 1214

Query: 790  GEPEFNSSFPIHGYPNDSGFFNAGDVESFDQPRKRKSGTMGWCRICKVDCETVEGLDMHS 969
            GEP F SSF         G   AGD++ FD  RKRK  +MGWCRICKVDCETVE LD+HS
Sbjct: 1215 GEPGFRSSF---------GGNYAGDLQFFDNSRKRKP-SMGWCRICKVDCETVEALDLHS 1264

Query: 970  QTREHQKMAMDMVLSIKKDNAKKQKLSSDDHVSHEDANKSRKASFESHGN 1119
            QTREHQKMA+DMV++IK+ NAKK K +   H S ED +KSR ASFE  GN
Sbjct: 1265 QTREHQKMALDMVVTIKQ-NAKKHKSTPCHHSSLEDKSKSRNASFEGRGN 1313


>emb|CAN64434.1| hypothetical protein VITISV_000937 [Vitis vinifera]
          Length = 1131

 Score =  278 bits (711), Expect = 5e-72
 Identities = 171/373 (45%), Positives = 204/373 (54%), Gaps = 1/373 (0%)
 Frame = +1

Query: 1    FKPFPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEV 180
            FK  P EP RR  DH +F EDLK+F R  HLDS+ V KF +Y+SSSRP DR   GF  + 
Sbjct: 823  FKSLP-EPGRRSSDHGKFAEDLKQFSRSSHLDSDLVPKFGNYFSSSRPLDRGSQGFVMDA 881

Query: 181  GPKLDGSASGAASRYLPPYQPGGLRPVGPLDDNMRRKTDSIGVHPDFLRNASEPGRHRMD 360
                                  GL    PL  N          +    ++++  G  R  
Sbjct: 882  AQ--------------------GLLDKAPLGFN----------YDSGFKSSAGTGTSRQS 911

Query: 361  GLPPLRSPGREYHSSRFGPPEDIDVRESHVFGERGVPFKLSSDGNAFHESRFPTLPSHLR 540
             L                  +DID RES  FGE    F L SD     ESRFP LPSHLR
Sbjct: 912  DL------------------DDIDGRESRRFGEGYQTFNLPSD-----ESRFPVLPSHLR 948

Query: 541  RSELDGPGNLRMGEKIGSGALPVHFRSGEPHNLPGHLRMGEPAGFGAFPNHLRAGEVGGP 720
            R  L  P +L+ GE  GS             N+PG LR GEP  F AF  H R GE+ GP
Sbjct: 949  RDIL--PSHLQRGEHFGS------------RNIPGQLRFGEPV-FDAFLGHPRMGELSGP 993

Query: 721  RNLPSNLRIGDSIGGK-LHGQVRMGEPEFNSSFPIHGYPNDSGFFNAGDVESFDQPRKRK 897
             N PS L  G+S GG    G  R+GEP F S++ +HGYPND GF   GD+ESFD  RKRK
Sbjct: 994  GNFPSRLSAGESFGGSNKSGHPRIGEPGFRSTYSLHGYPNDHGFRPPGDMESFDNSRKRK 1053

Query: 898  SGTMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNAKKQKLSSDDHVSHED 1077
              +M WCRIC +DCETV+GLDMHSQTREHQ+MAMD+VLSIK+ NAKKQKL+S DH + ED
Sbjct: 1054 PLSMAWCRICNIDCETVDGLDMHSQTREHQQMAMDIVLSIKQQNAKKQKLTSKDHSTPED 1113

Query: 1078 ANKSRKASFESHG 1116
            ++KS+K      G
Sbjct: 1114 SSKSKKGVLRGGG 1126


>ref|XP_006848046.1| hypothetical protein AMTR_s00029p00190880 [Amborella trichopoda]
            gi|548851351|gb|ERN09627.1| hypothetical protein
            AMTR_s00029p00190880 [Amborella trichopoda]
          Length = 1626

 Score =  275 bits (703), Expect = 4e-71
 Identities = 175/405 (43%), Positives = 219/405 (54%), Gaps = 32/405 (7%)
 Frame = +1

Query: 1    FKPFPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEV 180
            F+P+ ++PSRR +D REFEEDLKKFPR GHLD E   +++ Y+SS  P    P       
Sbjct: 1251 FRPYALDPSRRAIDRREFEEDLKKFPRSGHLDGEPASRYDGYFSSRNPSGHSPRSLERP- 1309

Query: 181  GPKLDGSASGAASRY-----LPPYQPGG---------LRPVGPLDDNMRRKTDSIGVHPD 318
            G  LD      A RY     +PPY+  G          +P G   D + RK D+ G   D
Sbjct: 1310 GLNLD------APRYPEGMSVPPYRGAGGSSLDLGDRSKPGGFHGDLIGRKLDTTGARSD 1363

Query: 319  FLRNASEPGRHRMDGLPPLRSPGREYHSSRFG-----------PPEDIDVRESHVFGERG 465
            +     E  R   DGL P RSP R+Y   R             P + +  RE   FGE+ 
Sbjct: 1364 YGGPFPEVSRSHRDGLGPPRSPVRDYAGVRVSGVRPDYAGIPHPLDGLGGREPLGFGEQR 1423

Query: 466  VPFKLSSDGNAFHESRFPTLPSHLRRSELDGPGNLRMGEKIGSGALPVHFRSGEPHNLPG 645
                L    +  H  + P+ P   R      P   R+ E  G G  P H R G+P   P 
Sbjct: 1424 ARAFL----DPIHGGKIPSGPFESRL-----PIPSRIAESAGFGDFPGHLRGGDPFG-PS 1473

Query: 646  HLRMGEPAGFGAFPNHLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRMGEPEFNSSFPIH 825
            H R GE       P+HLR  E+ G  NLP +LRIG+++G   H    + EP F     + 
Sbjct: 1474 HFRSGE------LPSHLRGRELAGSGNLPPHLRIGEAMGPGGH----LREPGFG----MQ 1519

Query: 826  GYPNDSGFFNAG-----DVESFDQPRKRKSGTMGWCRICKVDCETVEGLDMHSQTREHQK 990
            GYP D GF+N G     DV++ +  RKRK G+ GWCRICKVDCETVEGLD+HSQTREHQK
Sbjct: 1520 GYPKDGGFYNPGSFPPSDVDALEYSRKRKPGSTGWCRICKVDCETVEGLDLHSQTREHQK 1579

Query: 991  MAMDMVLSIKKDNAKKQKL--SSDDHVSHEDANKSRKASFESHGN 1119
            MAMDMVLSIK+D+AKKQKL  SS+DHV  E+  K R+ASFES G+
Sbjct: 1580 MAMDMVLSIKQDSAKKQKLYGSSEDHVPQEEPTKGRRASFESRGS 1624


>ref|XP_004295721.1| PREDICTED: uncharacterized protein LOC101314450 [Fragaria vesca
            subsp. vesca]
          Length = 1316

 Score =  273 bits (698), Expect = 2e-70
 Identities = 178/380 (46%), Positives = 219/380 (57%), Gaps = 12/380 (3%)
 Frame = +1

Query: 7    PFPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEVGP 186
            PFP +P+R ++    FE+DLK+FPRP  LDSE + K  +Y  SSR FDR P G +++   
Sbjct: 980  PFPGDPTR-VISRVGFEDDLKQFPRPSFLDSEPLPKLGNY--SSRAFDRRPFGVNYDTRL 1036

Query: 187  KLDGSASGAASRYLPPYQPGGLRPVGPLDDNMRRKTDSIGVHPDFLRNASEPGRHRMDGL 366
             +D  A+G+A R+L PY   GL              D+IG HPDF       GR  MDGL
Sbjct: 1037 NID-PAAGSAPRFLSPYGHAGL----------IHANDTIG-HPDF------GGRRLMDGL 1078

Query: 367  PPLRSPGREYHS--SRFG--PPEDIDVRESHVFGERGVPFKLSSDGNAFHESRFPTLPSH 534
               RSP R+Y    SRF    P+D D RE H FG+   P      G  FH++RFP    H
Sbjct: 1079 -ARRSPIRDYPGIPSRFRGFGPDDFDGREFHRFGD---PL-----GREFHDNRFPN--QH 1127

Query: 535  LRRSELDGPGNLRMGEK-----IGSGALPVHFRSGE---PHNLPGHLRMGEPAGFGAFPN 690
             RR E +GPGN+R+ ++     IG      H + GE   PHNLPGHL M E  GFG  P 
Sbjct: 1128 FRRGEFEGPGNMRVDDRMRNDLIGQDGHLGHLQRGEHLGPHNLPGHLHMREHVGFGVHPR 1187

Query: 691  HLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRMGEPEFNSSFPIHGYPNDSGFFNAGDVE 870
            H   G               +S  G      R+GEP F SSF +  +PND  +  AG++E
Sbjct: 1188 HAGPGSF-------------ESFIGNRANHPRLGEPGFRSSFSLKRFPNDGTY--AGELE 1232

Query: 871  SFDQPRKRKSGTMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNAKKQKLS 1050
            SFD  RKRK  +MGWCRICKV+CETVEGLD+HSQTREHQ+MAM+MV  I K NAKKQKL+
Sbjct: 1233 SFDHSRKRKPASMGWCRICKVNCETVEGLDVHSQTREHQRMAMEMV-QIIKQNAKKQKLT 1291

Query: 1051 SDDHVSHEDANKSRKASFES 1110
            S D  S EDANKS+  S ES
Sbjct: 1292 SGDQSSIEDANKSKITSSES 1311


>ref|XP_002298329.1| hypothetical protein POPTR_0001s25430g [Populus trichocarpa]
            gi|222845587|gb|EEE83134.1| hypothetical protein
            POPTR_0001s25430g [Populus trichocarpa]
          Length = 1327

 Score =  268 bits (686), Expect = 4e-69
 Identities = 180/405 (44%), Positives = 211/405 (52%), Gaps = 34/405 (8%)
 Frame = +1

Query: 7    PFPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEVGP 186
            PFP +P+RR     EFEEDLK F  P  LD++ V K   ++SSSRP DR P GF  +  P
Sbjct: 953  PFPHDPARRNAHQGEFEEDLKHFTAPSCLDTKPVPKSGGHFSSSRPLDRGPHGFGVDGAP 1012

Query: 187  K-LDGSASG---------------AASRYLPPYQPGGL----RPVGPLD--DNMRRKTDS 300
            K LD  + G               A  R+ PP             G L   DN+  +TD 
Sbjct: 1013 KHLDKGSHGLNYDSGLNVEPLGGSAPPRFFPPIHHDRTLHRSEAEGSLGFHDNLAGRTDF 1072

Query: 301  IGVHPDFLRNASEPGRHR-MDGLPPLRSPGREYHS---SRFGPPEDIDVRESHVFGERGV 468
                P  L        HR MD L P RSPGR+Y      RFG    +D  +         
Sbjct: 1073 ARTRPGLLGPPMPGYDHRDMDNLAP-RSPGRDYPGMSMQRFGALPGLDDIDGRAPQRSSD 1131

Query: 469  PFKLSSDGNAFHESRFPTLPSHLRRSELDGPGNLRMGEKI-----GSGALPVHFRSGE-- 627
            P   S      H+SRFP  PSHLRR EL+GPGN  MGE +     G    P H R GE  
Sbjct: 1132 PITSS-----LHDSRFPLFPSHLRRGELNGPGNFHMGEHLSGDLMGHDGWPAHLRRGERL 1186

Query: 628  -PHNLPGHLRMGEPAGFGAFPNHLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRMGEPEF 804
             P N P HLR+GE  GFG+FP H R GE+ GP NL              H Q  +GEP F
Sbjct: 1187 GPRNPPSHLRLGERGGFGSFPGHARMGELAGPGNL-------------YHQQ--LGEPGF 1231

Query: 805  NSSFPIHGYPNDSGFFNAGDVESFDQPRKRKSGTMGWCRICKVDCETVEGLDMHSQTREH 984
             SSF         G   AGD++  +  RKRKS +MGWCRICKVDCET EGLD+HSQTREH
Sbjct: 1232 RSSF---------GGSYAGDLQYSENSRKRKS-SMGWCRICKVDCETFEGLDLHSQTREH 1281

Query: 985  QKMAMDMVLSIKKDNAKKQKLSSDDHVSHEDANKSRKASFESHGN 1119
            QKMAMDMV++IK+ N KK K +  DH S ED +K R ASFE  GN
Sbjct: 1282 QKMAMDMVVTIKQ-NVKKHKSAPSDHSSLEDTSKLRNASFEGRGN 1325


>ref|XP_004169561.1| PREDICTED: uncharacterized protein LOC101227701 [Cucumis sativus]
          Length = 538

 Score =  266 bits (680), Expect = 2e-68
 Identities = 173/382 (45%), Positives = 219/382 (57%), Gaps = 13/382 (3%)
 Frame = +1

Query: 10   FPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEVGPK 189
            FP++P+RR ++  + E+ L++FPRP HL+SE   +  +Y  S RPFDR   G + + G  
Sbjct: 182  FPLDPARRPINQTDAEDILRQFPRPSHLESELAQRIGNY--SLRPFDRGVHGQNFDTGLT 239

Query: 190  LDGSASGAASRYLPPYQPGGL-------RPVGPLDDNMRRKTDSIGVHPDFLRNASEPGR 348
            +DG+A   ASR LPP   GG        RP+   +D+  +   S G H DF    S  GR
Sbjct: 240  IDGAA---ASRVLPPRHIGGALYPTDAERPIAFYEDSTGQADRSRG-HSDFPAPGSY-GR 294

Query: 349  HRMDGLPPLRSPGREYHSSRFGPP-----EDIDVRE-SHVFGERGVPFKLSSDGNAFHES 510
              +DG  P RSP  EYH   FG       E+ID ++  H FG          D  +F ES
Sbjct: 295  RFVDGFGP-RSPLHEYHGRGFGGRGFTGVEEIDGQDFPHHFG----------DPLSFRES 343

Query: 511  RFPTLPSHLRRSELDGPGNLRMGEKIGSGALPVHFRSGEPHNLPGHLRMGEPAGFGAFPN 690
            RFP   SHL+R + +  GN RM E + +G L    R   P +LPGHLR+GE   FG+ P 
Sbjct: 344  RFPIFRSHLQRGDFESSGNFRMSEHLRTGDLIGQDRHFGPRSLPGHLRLGELTAFGSHPG 403

Query: 691  HLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRMGEPEFNSSFPIHGYPNDSGFFNAGDVE 870
            H R G++    N       G   GG      R+GEP F SSF   G  +D  FF AGDVE
Sbjct: 404  HSRIGDLSVLGNFEP---FG---GGHRPNNPRLGEPGFRSSFSRQGLVDDGRFF-AGDVE 456

Query: 871  SFDQPRKRKSGTMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNAKKQKLS 1050
            SFD  RKRK  +MGWCRICKVDCETVEGL++HSQTREHQKMAMDMV SIK+ NAKK K++
Sbjct: 457  SFDNSRKRKPISMGWCRICKVDCETVEGLELHSQTREHQKMAMDMVQSIKQ-NAKKHKVT 515

Query: 1051 SDDHVSHEDANKSRKASFESHG 1116
             +DH S +   KS+    ES G
Sbjct: 516  PNDHSSED--GKSKNVGLESRG 535


>ref|XP_004153176.1| PREDICTED: uncharacterized protein LOC101214768 [Cucumis sativus]
          Length = 1177

 Score =  266 bits (680), Expect = 2e-68
 Identities = 173/382 (45%), Positives = 219/382 (57%), Gaps = 13/382 (3%)
 Frame = +1

Query: 10   FPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEVGPK 189
            FP++P+RR ++  + E+ L++FPRP HL+SE   +  +Y  S RPFDR   G + + G  
Sbjct: 821  FPLDPARRPINQTDAEDILRQFPRPSHLESELAQRIGNY--SLRPFDRGVHGQNFDTGLT 878

Query: 190  LDGSASGAASRYLPPYQPGGL-------RPVGPLDDNMRRKTDSIGVHPDFLRNASEPGR 348
            +DG+A   ASR LPP   GG        RP+   +D+  +   S G H DF    S  GR
Sbjct: 879  IDGAA---ASRVLPPRHIGGALYPTDAERPIAFYEDSTGQADRSRG-HSDFPAPGSY-GR 933

Query: 349  HRMDGLPPLRSPGREYHSSRFGPP-----EDIDVRE-SHVFGERGVPFKLSSDGNAFHES 510
              +DG  P RSP  EYH   FG       E+ID ++  H FG          D  +F ES
Sbjct: 934  RFVDGFGP-RSPLHEYHGRGFGGRGFTGVEEIDGQDFPHHFG----------DPLSFRES 982

Query: 511  RFPTLPSHLRRSELDGPGNLRMGEKIGSGALPVHFRSGEPHNLPGHLRMGEPAGFGAFPN 690
            RFP   SHL+R + +  GN RM E + +G L    R   P +LPGHLR+GE   FG+ P 
Sbjct: 983  RFPIFRSHLQRGDFESSGNFRMSEHLRTGDLIGQDRHFGPRSLPGHLRLGELTAFGSHPG 1042

Query: 691  HLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRMGEPEFNSSFPIHGYPNDSGFFNAGDVE 870
            H R G++    N       G   GG      R+GEP F SSF   G  +D  FF AGDVE
Sbjct: 1043 HSRIGDLSVLGNFEP---FG---GGHRPNNPRLGEPGFRSSFSRQGLVDDGRFF-AGDVE 1095

Query: 871  SFDQPRKRKSGTMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNAKKQKLS 1050
            SFD  RKRK  +MGWCRICKVDCETVEGL++HSQTREHQKMAMDMV SIK+ NAKK K++
Sbjct: 1096 SFDNSRKRKPISMGWCRICKVDCETVEGLELHSQTREHQKMAMDMVQSIKQ-NAKKHKVT 1154

Query: 1051 SDDHVSHEDANKSRKASFESHG 1116
             +DH S +   KS+    ES G
Sbjct: 1155 PNDHSSED--GKSKNVGLESRG 1174


>ref|XP_004145323.1| PREDICTED: uncharacterized protein LOC101205914 [Cucumis sativus]
          Length = 1434

 Score =  266 bits (680), Expect = 2e-68
 Identities = 173/382 (45%), Positives = 219/382 (57%), Gaps = 13/382 (3%)
 Frame = +1

Query: 10   FPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEVGPK 189
            FP++P+RR ++  + E+ L++FPRP HL+SE   +  +Y  S RPFDR   G + + G  
Sbjct: 1078 FPLDPARRPINQTDAEDILRQFPRPSHLESELAQRIGNY--SLRPFDRGVHGQNFDTGLT 1135

Query: 190  LDGSASGAASRYLPPYQPGGL-------RPVGPLDDNMRRKTDSIGVHPDFLRNASEPGR 348
            +DG+A   ASR LPP   GG        RP+   +D+  +   S G H DF    S  GR
Sbjct: 1136 IDGAA---ASRVLPPRHIGGALYPTDAERPIAFYEDSTGQADRSRG-HSDFPAPGSY-GR 1190

Query: 349  HRMDGLPPLRSPGREYHSSRFGPP-----EDIDVRE-SHVFGERGVPFKLSSDGNAFHES 510
              +DG  P RSP  EYH   FG       E+ID ++  H FG          D  +F ES
Sbjct: 1191 RFVDGFGP-RSPLHEYHGRGFGGRGFTGVEEIDGQDFPHHFG----------DPLSFRES 1239

Query: 511  RFPTLPSHLRRSELDGPGNLRMGEKIGSGALPVHFRSGEPHNLPGHLRMGEPAGFGAFPN 690
            RFP   SHL+R + +  GN RM E + +G L    R   P +LPGHLR+GE   FG+ P 
Sbjct: 1240 RFPIFRSHLQRGDFESSGNFRMSEHLRTGDLIGQDRHFGPRSLPGHLRLGELTAFGSHPG 1299

Query: 691  HLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRMGEPEFNSSFPIHGYPNDSGFFNAGDVE 870
            H R G++    N       G   GG      R+GEP F SSF   G  +D  FF AGDVE
Sbjct: 1300 HSRIGDLSVLGNFEP---FG---GGHRPNNPRLGEPGFRSSFSRQGLVDDGRFF-AGDVE 1352

Query: 871  SFDQPRKRKSGTMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNAKKQKLS 1050
            SFD  RKRK  +MGWCRICKVDCETVEGL++HSQTREHQKMAMDMV SIK+ NAKK K++
Sbjct: 1353 SFDNSRKRKPISMGWCRICKVDCETVEGLELHSQTREHQKMAMDMVQSIKQ-NAKKHKVT 1411

Query: 1051 SDDHVSHEDANKSRKASFESHG 1116
             +DH S +   KS+    ES G
Sbjct: 1412 PNDHSSED--GKSKNVGLESRG 1431


>ref|XP_007131393.1| hypothetical protein PHAVU_011G009900g [Phaseolus vulgaris]
            gi|561004393|gb|ESW03387.1| hypothetical protein
            PHAVU_011G009900g [Phaseolus vulgaris]
          Length = 1314

 Score =  263 bits (671), Expect = 2e-67
 Identities = 167/386 (43%), Positives = 214/386 (55%), Gaps = 13/386 (3%)
 Frame = +1

Query: 1    FKPFPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEV 180
            FKPF V  +++ +D RE+++DLKKF R   +D+E + K+ +Y  S+           HE 
Sbjct: 976  FKPFLVS-NQQTMDRREYDDDLKKFSRLP-MDAESISKYGNYSLSA-----------HE- 1021

Query: 181  GPKLDGSASGAASRYLPPYQPGGLRPVGPLDDNMRRKTDSIGVHPDFLRNASEPGRHRMD 360
                                  G R VG  DD +++   ++  HP +L      GRH MD
Sbjct: 1022 ---------------------SGKRSVGIHDDVIKKSGSAL--HPGYLGPGPGYGRHHMD 1058

Query: 361  GLPPLRSPGREY---HSSRFGPPEDIDVRESHVFGERG-VPFKLSSDGNAFHESRFPTLP 528
            G+ P RSP  EY    S R GP     + +S +    G VP      G  F +SRFP LP
Sbjct: 1059 GMTP-RSPVGEYAEMSSRRLGPHSGSLIGKSGIDDFDGRVPRHF---GGEFRDSRFPHLP 1114

Query: 529  SHLRRSELDGPGNLRMGEK------IGSGALPVHFRSGEP---HNLPGHLRMGEPAGFGA 681
            SHL R E DG GN R+GE       IG      HFR GEP   HN P HL++GEP GFGA
Sbjct: 1115 SHLHRDEFDGFGNFRIGEHPRSGDFIGQDEYAGHFRRGEPLGPHNFPRHLQLGEPVGFGA 1174

Query: 682  FPNHLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRMGEPEFNSSFPIHGYPNDSGFFNAG 861
             P H+RA E G  R+  S  +      G   G  ++GEP F SSF + G+PND+GF   G
Sbjct: 1175 HPGHMRAVEHGSFRSFESFAK------GSRPGHPQLGEPGFRSSFSLPGFPNDAGFLT-G 1227

Query: 862  DVESFDQPRKRKSGTMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNAKKQ 1041
            D+ SFD  R+RK  +MGWCRICK DCETVEGLD+HSQT+EHQKMAMDMV +IK+ NAKKQ
Sbjct: 1228 DIRSFDNLRRRKVSSMGWCRICKADCETVEGLDLHSQTKEHQKMAMDMVKTIKQ-NAKKQ 1286

Query: 1042 KLSSDDHVSHEDANKSRKASFESHGN 1119
            KL   +  + ++ NK+    FE  GN
Sbjct: 1287 KLIPSEQPTVDEGNKTHNTGFEGRGN 1312


>ref|XP_006591981.1| PREDICTED: histone-lysine N-methyltransferase 2D-like isoform X5
            [Glycine max]
          Length = 1299

 Score =  254 bits (648), Expect = 1e-64
 Identities = 168/395 (42%), Positives = 211/395 (53%), Gaps = 22/395 (5%)
 Frame = +1

Query: 1    FKPFPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEV 180
            FKP      + I + REF++DLKKF R   L+SE V KF +Y              +HE 
Sbjct: 962  FKPLHALNQQNI-ERREFDDDLKKFSRL-PLNSEPVSKFGNYSLG-----------THEA 1008

Query: 181  GPKLDGSASGAASRYLPPYQPGGLRPVGPLDDNMRRKTDSIGVHPDFLRNASEPGRHRMD 360
            G                       RPVG  DD +++   ++  HP +        RH MD
Sbjct: 1009 GK----------------------RPVGIHDDVIKKSGSAL--HPGYFGPGPGYARHHMD 1044

Query: 361  GLPPLRSPGREY---HSSRFG----------PPEDIDVRESHVFGERGVPFKLSSDGNAF 501
            G+ P RSP  EY    S R G            +D D R +  FGE             F
Sbjct: 1045 GIAP-RSPVSEYAEMSSRRLGLHSGSLVGKSGIDDFDDRVARRFGE-------------F 1090

Query: 502  HESRFPTLPSHLRRSELDGPGNLRMGEK------IGSGALPVHFRSGE---PHNLPGHLR 654
             +SRFP LPSHLRR + DG GN RMGE       +G      HFR GE   PHN P HL+
Sbjct: 1091 RDSRFPHLPSHLRRDDFDGFGNFRMGEYPRSGDFVGQDEFAGHFRRGEHLGPHNFPRHLQ 1150

Query: 655  MGEPAGFGAFPNHLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRMGEPEFNSSFPIHGYP 834
             GEP GFGA P H+RA E+ G R+  S      S GG+  G  ++GEP F SSF + G+P
Sbjct: 1151 HGEPIGFGAHPGHMRAVELDGFRSFES-----FSKGGR-PGHPQLGEPGFRSSFSLTGFP 1204

Query: 835  NDSGFFNAGDVESFDQPRKRKSGTMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLS 1014
            ND+GF   GD+ SFD  R++K+ +MGWCRICKVDCETVEGLD+HSQT+EHQKMAMD+V +
Sbjct: 1205 NDAGFL-TGDIRSFDNLRRKKASSMGWCRICKVDCETVEGLDLHSQTKEHQKMAMDIVKT 1263

Query: 1015 IKKDNAKKQKLSSDDHVSHEDANKSRKASFESHGN 1119
            IK+ NAKKQKL   +  S ++ NK+     E  GN
Sbjct: 1264 IKQ-NAKKQKLIPSEEPSMDEGNKTHNTGIEGRGN 1297


>ref|XP_006591980.1| PREDICTED: histone-lysine N-methyltransferase 2D-like isoform X4
            [Glycine max]
          Length = 1335

 Score =  254 bits (648), Expect = 1e-64
 Identities = 168/395 (42%), Positives = 211/395 (53%), Gaps = 22/395 (5%)
 Frame = +1

Query: 1    FKPFPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEV 180
            FKP      + I + REF++DLKKF R   L+SE V KF +Y              +HE 
Sbjct: 998  FKPLHALNQQNI-ERREFDDDLKKFSRL-PLNSEPVSKFGNYSLG-----------THEA 1044

Query: 181  GPKLDGSASGAASRYLPPYQPGGLRPVGPLDDNMRRKTDSIGVHPDFLRNASEPGRHRMD 360
            G                       RPVG  DD +++   ++  HP +        RH MD
Sbjct: 1045 GK----------------------RPVGIHDDVIKKSGSAL--HPGYFGPGPGYARHHMD 1080

Query: 361  GLPPLRSPGREY---HSSRFG----------PPEDIDVRESHVFGERGVPFKLSSDGNAF 501
            G+ P RSP  EY    S R G            +D D R +  FGE             F
Sbjct: 1081 GIAP-RSPVSEYAEMSSRRLGLHSGSLVGKSGIDDFDDRVARRFGE-------------F 1126

Query: 502  HESRFPTLPSHLRRSELDGPGNLRMGEK------IGSGALPVHFRSGE---PHNLPGHLR 654
             +SRFP LPSHLRR + DG GN RMGE       +G      HFR GE   PHN P HL+
Sbjct: 1127 RDSRFPHLPSHLRRDDFDGFGNFRMGEYPRSGDFVGQDEFAGHFRRGEHLGPHNFPRHLQ 1186

Query: 655  MGEPAGFGAFPNHLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRMGEPEFNSSFPIHGYP 834
             GEP GFGA P H+RA E+ G R+  S      S GG+  G  ++GEP F SSF + G+P
Sbjct: 1187 HGEPIGFGAHPGHMRAVELDGFRSFES-----FSKGGR-PGHPQLGEPGFRSSFSLTGFP 1240

Query: 835  NDSGFFNAGDVESFDQPRKRKSGTMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLS 1014
            ND+GF   GD+ SFD  R++K+ +MGWCRICKVDCETVEGLD+HSQT+EHQKMAMD+V +
Sbjct: 1241 NDAGFL-TGDIRSFDNLRRKKASSMGWCRICKVDCETVEGLDLHSQTKEHQKMAMDIVKT 1299

Query: 1015 IKKDNAKKQKLSSDDHVSHEDANKSRKASFESHGN 1119
            IK+ NAKKQKL   +  S ++ NK+     E  GN
Sbjct: 1300 IKQ-NAKKQKLIPSEEPSMDEGNKTHNTGIEGRGN 1333


>ref|XP_006591977.1| PREDICTED: histone-lysine N-methyltransferase 2D-like isoform X1
            [Glycine max] gi|571491554|ref|XP_006591978.1| PREDICTED:
            histone-lysine N-methyltransferase 2D-like isoform X2
            [Glycine max] gi|571491556|ref|XP_006591979.1| PREDICTED:
            histone-lysine N-methyltransferase 2D-like isoform X3
            [Glycine max]
          Length = 1347

 Score =  254 bits (648), Expect = 1e-64
 Identities = 168/395 (42%), Positives = 211/395 (53%), Gaps = 22/395 (5%)
 Frame = +1

Query: 1    FKPFPVEPSRRIVDHREFEEDLKKFPRPGHLDSEHVGKFESYYSSSRPFDRVPPGFSHEV 180
            FKP      + I + REF++DLKKF R   L+SE V KF +Y              +HE 
Sbjct: 1010 FKPLHALNQQNI-ERREFDDDLKKFSRL-PLNSEPVSKFGNYSLG-----------THEA 1056

Query: 181  GPKLDGSASGAASRYLPPYQPGGLRPVGPLDDNMRRKTDSIGVHPDFLRNASEPGRHRMD 360
            G                       RPVG  DD +++   ++  HP +        RH MD
Sbjct: 1057 GK----------------------RPVGIHDDVIKKSGSAL--HPGYFGPGPGYARHHMD 1092

Query: 361  GLPPLRSPGREY---HSSRFG----------PPEDIDVRESHVFGERGVPFKLSSDGNAF 501
            G+ P RSP  EY    S R G            +D D R +  FGE             F
Sbjct: 1093 GIAP-RSPVSEYAEMSSRRLGLHSGSLVGKSGIDDFDDRVARRFGE-------------F 1138

Query: 502  HESRFPTLPSHLRRSELDGPGNLRMGEK------IGSGALPVHFRSGE---PHNLPGHLR 654
             +SRFP LPSHLRR + DG GN RMGE       +G      HFR GE   PHN P HL+
Sbjct: 1139 RDSRFPHLPSHLRRDDFDGFGNFRMGEYPRSGDFVGQDEFAGHFRRGEHLGPHNFPRHLQ 1198

Query: 655  MGEPAGFGAFPNHLRAGEVGGPRNLPSNLRIGDSIGGKLHGQVRMGEPEFNSSFPIHGYP 834
             GEP GFGA P H+RA E+ G R+  S      S GG+  G  ++GEP F SSF + G+P
Sbjct: 1199 HGEPIGFGAHPGHMRAVELDGFRSFES-----FSKGGR-PGHPQLGEPGFRSSFSLTGFP 1252

Query: 835  NDSGFFNAGDVESFDQPRKRKSGTMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLS 1014
            ND+GF   GD+ SFD  R++K+ +MGWCRICKVDCETVEGLD+HSQT+EHQKMAMD+V +
Sbjct: 1253 NDAGFL-TGDIRSFDNLRRKKASSMGWCRICKVDCETVEGLDLHSQTKEHQKMAMDIVKT 1311

Query: 1015 IKKDNAKKQKLSSDDHVSHEDANKSRKASFESHGN 1119
            IK+ NAKKQKL   +  S ++ NK+     E  GN
Sbjct: 1312 IKQ-NAKKQKLIPSEEPSMDEGNKTHNTGIEGRGN 1345


Top