BLASTX nr result

ID: Atropa21_contig00004154 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00004154
         (1468 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006345324.1| PREDICTED: trithorax group protein osa-like ...   674   0.0  
ref|XP_004246977.1| PREDICTED: uncharacterized protein LOC101249...   658   0.0  
emb|CBI16022.3| unnamed protein product [Vitis vinifera]              263   1e-67
gb|ESW03387.1| hypothetical protein PHAVU_011G009900g [Phaseolus...   221   6e-55
ref|XP_006488440.1| PREDICTED: mediator of RNA polymerase II tra...   209   2e-51
ref|XP_006424987.1| hypothetical protein CICLE_v10027683mg [Citr...   209   2e-51
emb|CAN64434.1| hypothetical protein VITISV_000937 [Vitis vinifera]   207   1e-50
gb|EMJ06149.1| hypothetical protein PRUPE_ppa000292mg [Prunus pe...   206   2e-50
gb|ESW03386.1| hypothetical protein PHAVU_011G009900g [Phaseolus...   204   6e-50
ref|XP_003534401.2| PREDICTED: altered inheritance of mitochondr...   203   1e-49
ref|XP_006379033.1| hypothetical protein POPTR_0009s04520g [Popu...   199   2e-48
ref|XP_004506322.1| PREDICTED: mediator of RNA polymerase II tra...   199   2e-48
ref|XP_006591981.1| PREDICTED: histone-lysine N-methyltransferas...   196   2e-47
ref|XP_006591980.1| PREDICTED: histone-lysine N-methyltransferas...   196   2e-47
ref|XP_006591977.1| PREDICTED: histone-lysine N-methyltransferas...   196   2e-47
gb|EOY33856.1| Uncharacterized protein isoform 7 [Theobroma cacao]    187   8e-45
gb|EOY33851.1| Uncharacterized protein isoform 2 [Theobroma caca...   187   8e-45
gb|EOY33857.1| Uncharacterized protein isoform 8 [Theobroma cacao]    184   1e-43
gb|ACU17648.1| unknown [Glycine max]                                  182   3e-43
ref|XP_002298329.1| hypothetical protein POPTR_0001s25430g [Popu...   178   6e-42

>ref|XP_006345324.1| PREDICTED: trithorax group protein osa-like isoform X1 [Solanum
            tuberosum]
          Length = 1049

 Score =  674 bits (1739), Expect = 0.0
 Identities = 329/414 (79%), Positives = 346/414 (83%), Gaps = 2/414 (0%)
 Frame = +2

Query: 2    VNPAETEMFQNQRVNRFDGNQPNPFPPGSSDEVPFGQPRSMESPWDKRLKAPMGKHLSPL 181
            VNPAE EMFQNQRVNRF+GNQPNPF  GS ++VPFGQPRSMES  DKRLKAPMG+HLSPL
Sbjct: 638  VNPAEAEMFQNQRVNRFEGNQPNPFSSGSFEKVPFGQPRSMESARDKRLKAPMGEHLSPL 697

Query: 182  P--HDQASRPLDKPPRGLGYHFGSKFEASAGVXXXXXXXXXXXXSSMHFKDSGEREAPLG 355
            P   DQ S P DKPPRGLGY  GSKFEAS GV             SMHFKDSGEREAPLG
Sbjct: 698  PVPRDQGSWPHDKPPRGLGYDSGSKFEASTGVPPNRLLPPHHPPGSMHFKDSGEREAPLG 757

Query: 356  LYDDDRKRAGSGFGVHHMDYMSARNPDGEFFNIPPRGFVSHSSFEDIGGREPCQFIEGSG 535
             +DDDRKR GSGFGVHHMDY+SARNPDGE FNIPPRGFVSHS F+DIGGREP QFIEG G
Sbjct: 758  PHDDDRKRGGSGFGVHHMDYLSARNPDGELFNIPPRGFVSHSGFDDIGGREPRQFIEGPG 817

Query: 536  PFNLPSNLAGGGGLFSDGRFQSLPGNSHGGEIDGLGDLRSSEHTTFGRPYKHVRSGDLFG 715
             FNLPSNLAGG  L+S+GRFQ+LPG+ HG E DGLGDLR  EHTTFGRPYKHV+SGDLFG
Sbjct: 818  HFNLPSNLAGG--LYSNGRFQALPGHPHGVETDGLGDLRGGEHTTFGRPYKHVQSGDLFG 875

Query: 716  KDVPSHLHHAEPLDPQKIPSHLXXXXXXXXXXXXXXXYMGELSGFGDIPCFDESIGRSKP 895
            KD+PSHLHH E LDP K+PSHL               YMGELSGFGDIP F ESIGR+KP
Sbjct: 876  KDMPSHLHHDESLDPPKLPSHLRFDKPAGFGSFAGHAYMGELSGFGDIPGFGESIGRNKP 935

Query: 896  GMPLFGEPGFRSRYPSPGFPNHGLYAGDVDSFDRPRKRKPVSMGWCRICKADCETVEGLD 1075
            GMP FGEPGFRSRYP P +PNHGLYAGDVDSFDRPRKRKP SMGWCRICK DCETVEGLD
Sbjct: 936  GMPRFGEPGFRSRYPVPAYPNHGLYAGDVDSFDRPRKRKPTSMGWCRICKVDCETVEGLD 995

Query: 1076 MHSQTREHQDMAMDMVRSIKEQNRKKQKTFSDRASVEEKGKTRKAVFEGRGRKT 1237
            MHSQTREHQDMAMDMVRSIKEQNRKKQKTFSDRASVEEKG+TRKAVFE RGRKT
Sbjct: 996  MHSQTREHQDMAMDMVRSIKEQNRKKQKTFSDRASVEEKGRTRKAVFESRGRKT 1049


>ref|XP_004246977.1| PREDICTED: uncharacterized protein LOC101249008 [Solanum
            lycopersicum]
          Length = 1353

 Score =  658 bits (1698), Expect = 0.0
 Identities = 321/414 (77%), Positives = 340/414 (82%), Gaps = 2/414 (0%)
 Frame = +2

Query: 2    VNPAETEMFQNQRVNRFDGNQPNPFPPGSSDEVPFGQPRSMESPWDKRLKAPMGKHLSPL 181
            VNPAE EMFQNQRVN F+GNQ NPF  GS ++VPFGQPRSMES  DKRLKAPMG+HL PL
Sbjct: 942  VNPAEAEMFQNQRVNCFEGNQSNPFSSGSFEKVPFGQPRSMESARDKRLKAPMGEHLIPL 1001

Query: 182  P--HDQASRPLDKPPRGLGYHFGSKFEASAGVXXXXXXXXXXXXSSMHFKDSGEREAPLG 355
            P   DQ SRP DKPP GLGY  GSKFEAS GV             SMHFKDSGEREAPLG
Sbjct: 1002 PVPSDQGSRPHDKPPHGLGYDSGSKFEASTGVPPNRLLPPHHPPGSMHFKDSGEREAPLG 1061

Query: 356  LYDDDRKRAGSGFGVHHMDYMSARNPDGEFFNIPPRGFVSHSSFEDIGGREPCQFIEGSG 535
             +DDDRKR GSGFGVHH+DY+SARNPDGE FNIP RGFVSHS F+D GGREP QFIEG G
Sbjct: 1062 PHDDDRKRGGSGFGVHHLDYLSARNPDGELFNIPQRGFVSHSGFDDTGGREPRQFIEGPG 1121

Query: 536  PFNLPSNLAGGGGLFSDGRFQSLPGNSHGGEIDGLGDLRSSEHTTFGRPYKHVRSGDLFG 715
             FNLPSNLAGG  L+S+ RFQ+LPG+ HG E DGLGDLR  EHTTFGRPYKHV+SGDLFG
Sbjct: 1122 HFNLPSNLAGG--LYSNSRFQALPGHPHGVETDGLGDLRGGEHTTFGRPYKHVQSGDLFG 1179

Query: 716  KDVPSHLHHAEPLDPQKIPSHLXXXXXXXXXXXXXXXYMGELSGFGDIPCFDESIGRSKP 895
            KD+PSHLHH E LDP K+PSHL               YMGELSGFGDIP FDES+GR+KP
Sbjct: 1180 KDMPSHLHHDESLDPPKLPSHLRFDKPGGFGSFAGRAYMGELSGFGDIPGFDESVGRNKP 1239

Query: 896  GMPLFGEPGFRSRYPSPGFPNHGLYAGDVDSFDRPRKRKPVSMGWCRICKADCETVEGLD 1075
            GMP FGEPGFRSRYP PG+PNHGLYAGDVDSFDRPRKRKP SMGWCRICK DCETVEGLD
Sbjct: 1240 GMPQFGEPGFRSRYPVPGYPNHGLYAGDVDSFDRPRKRKPTSMGWCRICKVDCETVEGLD 1299

Query: 1076 MHSQTREHQDMAMDMVRSIKEQNRKKQKTFSDRASVEEKGKTRKAVFEGRGRKT 1237
            MHSQTREHQDMAMDMVRSIKEQNR KQKTFSDR SVEEKG+TRKAVFE RGRKT
Sbjct: 1300 MHSQTREHQDMAMDMVRSIKEQNRMKQKTFSDRPSVEEKGRTRKAVFESRGRKT 1353


>emb|CBI16022.3| unnamed protein product [Vitis vinifera]
          Length = 1669

 Score =  263 bits (673), Expect = 1e-67
 Identities = 185/495 (37%), Positives = 237/495 (47%), Gaps = 86/495 (17%)
 Frame = +2

Query: 2    VNPAETEMFQNQRVNRFDGNQPNPFPPGSSDEVPFGQPRSMES----------------- 130
            VNP E+E+F N R N FDG Q +   PGSS+  PFGQP  ++S                 
Sbjct: 1186 VNPVESEIFSNPRPNYFDGRQSDSHIPGSSERGPFGQPSGVQSNMMRMNGGLGIESSLPV 1245

Query: 131  ----------PWDKRLKAPMGKHLSPLP------------------HDQASRPLDKPPRG 226
                      P   R  +  GK    L                   +  +SRPLD+  +G
Sbjct: 1246 GLQDERFKSLPEPGRRSSDHGKFAEDLKQFSRSSHLDSDLVPKFGNYFSSSRPLDRGSQG 1305

Query: 227  --------------LGYHFGSKFEASAGVXXXXXXXXXXXXSSMHFKDSGEREAPLGLYD 364
                          LG+++ S F++SAG                H    GER   +G ++
Sbjct: 1306 FVMDAAQGLLDKAPLGFNYDSGFKSSAGTGTSRFFPPP------HPGGDGERSRAVGFHE 1359

Query: 365  DDRKRAGSG------------FGVHHMDYMSARNPDGEFFNIPPRGFVS-------HSSF 487
            D+  R+               +G HHMD ++ R+P  EF  IP RGF          S  
Sbjct: 1360 DNVGRSDMARTHPNFLGSVPEYGRHHMDGLNPRSPTREFSGIPHRGFGGLSGVPGRQSDL 1419

Query: 488  EDIGGREPCQFIEGSGPFNLPSNLAGGGGLFSDGRFQSLPGNSHGGEIDGLGDLRSSEHT 667
            +DI GRE  +F EGS  FNLPS+         + RF  LP +   GE++G G+L  ++  
Sbjct: 1420 DDIDGRESRRFGEGSKTFNLPSD---------ESRFPVLPSHLRRGELEGPGELVMADPI 1470

Query: 668  TFGRPYKHVRSGDLFGKDV-PSHLHHAEPLDPQKIPSHLXXXXXXXXXXXXXXXYMGELS 844
                   H+R GDL G+D+ PSHL   E    + IP  L                MGELS
Sbjct: 1471 ASRPAPHHLRGGDLIGQDILPSHLQRGEHFGSRNIPGQLRFGEPVFDAFLGHPR-MGELS 1529

Query: 845  GFGDIPC---FDESIGRS-KPGMPLFGEPGFRSRYPSPGFPN-HGLYA-GDVDSFDRPRK 1006
            G G+ P      ES G S K G P  GEPGFRS Y   G+PN HG    GD++SFD  RK
Sbjct: 1530 GPGNFPSRLSAGESFGGSNKSGHPRIGEPGFRSTYSLHGYPNDHGFRPPGDMESFDNSRK 1589

Query: 1007 RKPVSMGWCRICKADCETVEGLDMHSQTREHQDMAMDMVRSIKEQNRKKQK-TFSDRASV 1183
            RKP+SM WCRIC  DCETV+GLDMHSQTREHQ MAMD+V SIK+QN KKQK T  D ++ 
Sbjct: 1590 RKPLSMAWCRICNIDCETVDGLDMHSQTREHQQMAMDIVLSIKQQNAKKQKLTSKDHSTP 1649

Query: 1184 EEKGKTRKAVFEGRG 1228
            E+  K++K V  G G
Sbjct: 1650 EDSSKSKKGVLRGGG 1664


>gb|ESW03387.1| hypothetical protein PHAVU_011G009900g [Phaseolus vulgaris]
          Length = 1314

 Score =  221 bits (563), Expect = 6e-55
 Identities = 142/341 (41%), Positives = 173/341 (50%), Gaps = 33/341 (9%)
 Frame = +2

Query: 311  SMHFKDSGEREAPLGLYDDDRKRAGS-----------GFGVHHMDYMSARNPDGEFFNIP 457
            S+   +SG+R   +G++DD  K++GS           G+G HHMD M+ R+P GE+  + 
Sbjct: 1016 SLSAHESGKRS--VGIHDDVIKKSGSALHPGYLGPGPGYGRHHMDGMTPRSPVGEYAEMS 1073

Query: 458  PRGFVSHSS-------FEDIGGREPCQFIEGSGPFNLPSNLAGGGGLFSDGRFQSLPGNS 616
             R    HS         +D  GR P  F                GG F D RF  LP + 
Sbjct: 1074 SRRLGPHSGSLIGKSGIDDFDGRVPRHF----------------GGEFRDSRFPHLPSHL 1117

Query: 617  HGGEIDGLGDLRSSEHTTFGRPYKHVRSGDLFGKD-VPSHLHHAEPLDPQKIPSHLXXXX 793
            H  E DG G+ R  EH          RSGD  G+D    H    EPL P   P HL    
Sbjct: 1118 HRDEFDGFGNFRIGEHP---------RSGDFIGQDEYAGHFRRGEPLGPHNFPRHLQ--- 1165

Query: 794  XXXXXXXXXXXYMGELSGFGDIP------------CFDESIGRSKPGMPLFGEPGFRSRY 937
                        +GE  GFG  P             F+     S+PG P  GEPGFRS +
Sbjct: 1166 ------------LGEPVGFGAHPGHMRAVEHGSFRSFESFAKGSRPGHPQLGEPGFRSSF 1213

Query: 938  PSPGFPNH-GLYAGDVDSFDRPRKRKPVSMGWCRICKADCETVEGLDMHSQTREHQDMAM 1114
              PGFPN  G   GD+ SFD  R+RK  SMGWCRICKADCETVEGLD+HSQT+EHQ MAM
Sbjct: 1214 SLPGFPNDAGFLTGDIRSFDNLRRRKVSSMGWCRICKADCETVEGLDLHSQTKEHQKMAM 1273

Query: 1115 DMVRSIKEQNRKKQKTF-SDRASVEEKGKTRKAVFEGRGRK 1234
            DMV++IK QN KKQK   S++ +V+E  KT    FEGRG K
Sbjct: 1274 DMVKTIK-QNAKKQKLIPSEQPTVDEGNKTHNTGFEGRGNK 1313


>ref|XP_006488440.1| PREDICTED: mediator of RNA polymerase II transcription subunit
            15-like isoform X1 [Citrus sinensis]
            gi|568870502|ref|XP_006488441.1| PREDICTED: mediator of
            RNA polymerase II transcription subunit 15-like isoform
            X2 [Citrus sinensis] gi|568870504|ref|XP_006488442.1|
            PREDICTED: mediator of RNA polymerase II transcription
            subunit 15-like isoform X3 [Citrus sinensis]
            gi|568870506|ref|XP_006488443.1| PREDICTED: mediator of
            RNA polymerase II transcription subunit 15-like isoform
            X4 [Citrus sinensis]
          Length = 1392

 Score =  209 bits (533), Expect = 2e-51
 Identities = 161/472 (34%), Positives = 199/472 (42%), Gaps = 62/472 (13%)
 Frame = +2

Query: 5    NPAETEMFQNQRVNRFDGNQPNPFPPGSSDEVPFGQPRSMESPW------------DKRL 148
            NP E EMF  QR    DG + +   PGS    P G P    S              D+R 
Sbjct: 969  NPMEAEMFTGQRPGYMDGRESDSHFPGSQQRSPLGPPSGTRSNMMRMNGGPGSELRDERF 1028

Query: 149  KAPMGKHLSPLPHDQA------------------------------------SRPLDKPP 220
            K+     L+P P D A                                    SRP D+ P
Sbjct: 1029 KSFPDGRLNPFPVDPARSVIDRGEFEEDLKQFSRPSHLDAEPVPKLGSHFLPSRPFDRGP 1088

Query: 221  RGLGYHFGSK-FEASAGVXXXXXXXXXXXXSSMHFKDSGEREAPLGLYDD-----DRKRA 382
             G G   G + FE                 +   F  +   +A  G  D      D  R 
Sbjct: 1089 HGYGMDMGPRPFERGLSYDPGLKLDPMGASAPSRFLPAYHDDAA-GRSDSSHAHPDFPRP 1147

Query: 383  GSGFGVHHMDYMSARNPDGEFFN---IPPRGFVSHSSFEDIGGREPCQFIEGSGPFNLPS 553
            G  +G  HM  +S R+   EF     +P     S S  EDIGGRE  +F     P     
Sbjct: 1148 GRAYGRRHMGGLSPRSSFREFCGFGGLPGSLGGSRSVREDIGGREFRRF---GDPI---- 1200

Query: 554  NLAGGGGLFSDGRFQSLPGNSHGGEIDGLGDLRSSEHTTFGRPYKHVRSGDLFGKD-VPS 730
                 G  F D RF  LP +   GE +G G                 R+GDL G++ +PS
Sbjct: 1201 -----GNSFHDSRFPVLPSHLRRGEFEGPG-----------------RTGDLIGQEFLPS 1238

Query: 731  HLHHAEPLDPQKIPSHLXXXXXXXXXXXXXXXYMGELSGFGDIPC---FDESIGRSKPGM 901
            HL   EPL P  +                    +GE  G G  P     +E  G      
Sbjct: 1239 HLRRGEPLGPHNLR-------------------LGETVGLGGFPGPARMEELGGPGNFPP 1279

Query: 902  PLFGEPGFRSRYPSPGFPNHG-LYAGDVDSFDRPRKRKPVSMGWCRICKADCETVEGLDM 1078
            P  GEPGFRS +   GFPN G  Y GD++S D  RKRKP SMGWCRICK DCETV+GLD+
Sbjct: 1280 PRLGEPGFRSSFSRQGFPNDGGFYTGDMESIDNSRKRKPPSMGWCRICKVDCETVDGLDL 1339

Query: 1079 HSQTREHQDMAMDMVRSIKEQNRKKQKTFSDRASVEEKGKTRKAVFEGRGRK 1234
            HSQTREHQ MAMDMV SIK+  +K++ T  DR S ++  K+R   F+GRG+K
Sbjct: 1340 HSQTREHQKMAMDMVLSIKQNAKKQKLTSGDRCSTDDANKSRNVNFDGRGKK 1391


>ref|XP_006424987.1| hypothetical protein CICLE_v10027683mg [Citrus clementina]
            gi|557526921|gb|ESR38227.1| hypothetical protein
            CICLE_v10027683mg [Citrus clementina]
          Length = 1392

 Score =  209 bits (533), Expect = 2e-51
 Identities = 161/472 (34%), Positives = 199/472 (42%), Gaps = 62/472 (13%)
 Frame = +2

Query: 5    NPAETEMFQNQRVNRFDGNQPNPFPPGSSDEVPFGQPRSMESPW------------DKRL 148
            NP E EMF  QR    DG + +   PGS    P G P    S              D+R 
Sbjct: 969  NPMEAEMFTGQRPGYMDGRESDSHFPGSQQRSPLGPPSGTRSNMMRMNGGPGSELRDERF 1028

Query: 149  KAPMGKHLSPLPHDQA------------------------------------SRPLDKPP 220
            K+     L+P P D A                                    SRP D+ P
Sbjct: 1029 KSFPDGRLNPFPVDPARSVIDRGEFEEDLKQFSRPSHLDAEPVPKLGSHFLPSRPFDRGP 1088

Query: 221  RGLGYHFGSK-FEASAGVXXXXXXXXXXXXSSMHFKDSGEREAPLGLYDD-----DRKRA 382
             G G   G + FE                 +   F  +   +A  G  D      D  R 
Sbjct: 1089 HGYGMDMGPRPFERGLSYDPGLKLDPMGASAPSRFLPAYHDDAA-GRSDSSHAHPDFPRP 1147

Query: 383  GSGFGVHHMDYMSARNPDGEFFN---IPPRGFVSHSSFEDIGGREPCQFIEGSGPFNLPS 553
            G  +G  HM  +S R+   EF     +P     S S  EDIGGRE  +F     P     
Sbjct: 1148 GRAYGRRHMGGLSPRSSFREFCGFGGLPGSLGGSRSVREDIGGREFRRF---GDPI---- 1200

Query: 554  NLAGGGGLFSDGRFQSLPGNSHGGEIDGLGDLRSSEHTTFGRPYKHVRSGDLFGKD-VPS 730
                 G  F D RF  LP +   GE +G G                 R+GDL G++ +PS
Sbjct: 1201 -----GNSFHDSRFPVLPSHLRRGEFEGPG-----------------RTGDLIGQEFLPS 1238

Query: 731  HLHHAEPLDPQKIPSHLXXXXXXXXXXXXXXXYMGELSGFGDIPC---FDESIGRSKPGM 901
            HL   EPL P  +                    +GE  G G  P     +E  G      
Sbjct: 1239 HLRRGEPLGPHNLR-------------------LGETVGLGGFPGPARMEELGGPGNFPP 1279

Query: 902  PLFGEPGFRSRYPSPGFPNHG-LYAGDVDSFDRPRKRKPVSMGWCRICKADCETVEGLDM 1078
            P  GEPGFRS +   GFPN G  Y GD++S D  RKRKP SMGWCRICK DCETV+GLD+
Sbjct: 1280 PRLGEPGFRSSFSHQGFPNDGGFYTGDMESIDNSRKRKPPSMGWCRICKVDCETVDGLDL 1339

Query: 1079 HSQTREHQDMAMDMVRSIKEQNRKKQKTFSDRASVEEKGKTRKAVFEGRGRK 1234
            HSQTREHQ MAMDMV SIK+  +K++ T  DR S ++  K+R   F+GRG+K
Sbjct: 1340 HSQTREHQKMAMDMVLSIKQNAKKQKLTSGDRCSTDDANKSRNVNFDGRGKK 1391


>emb|CAN64434.1| hypothetical protein VITISV_000937 [Vitis vinifera]
          Length = 1131

 Score =  207 bits (526), Expect = 1e-50
 Identities = 162/429 (37%), Positives = 205/429 (47%), Gaps = 20/429 (4%)
 Frame = +2

Query: 2    VNPAETEMFQNQRVNRFDGNQPNPFPPGSSDEVPFGQPRSMESPWDKRLKAPMGKHLSPL 181
            VNP E+E+F N R N FDG Q +   PGSS+  PFGQP   +S    R+   +G   S L
Sbjct: 757  VNPVESEIFSNPRPNYFDGRQSDSHIPGSSERGPFGQPSGXQSNM-MRMNGGLGIE-SSL 814

Query: 182  P---HDQASRPLDKPPRGLGYHFG-----SKFEASAGVXXXXXXXXXXXXSSMHFKDSGE 337
            P    D+  + L +P R    H        +F  S+ +            SS    D G 
Sbjct: 815  PVGLQDERFKSLPEPGRRSSDHGKFAEDLKQFSRSSHLDSDLVPKFGNYFSSSRPLDRGS 874

Query: 338  R----EAPLGLYDDDRKRAGSGFGVHHMDYMSARNPDGEFFNIPPRGFVSHSSFEDIGGR 505
            +    +A  GL D    +A  GF           N D  F +    G    S  +DI GR
Sbjct: 875  QGFVMDAAQGLLD----KAPLGF-----------NYDSGFKSSAGTGTSRQSDLDDIDGR 919

Query: 506  EPCQFIEGSGPFNLPSNLAGGGGLFSDGRFQSLPGNSHGGEIDGLGDLRSSEHTTFGRPY 685
            E  +F EG   FNLPS+         + RF  LP +          D+  S         
Sbjct: 920  ESRRFGEGYQTFNLPSD---------ESRFPVLPSHLRR-------DILPS--------- 954

Query: 686  KHVRSGDLFG-KDVPSHLHHAEPLDPQKIPSHLXXXXXXXXXXXXXXXYMGELSGFGDIP 862
             H++ G+ FG +++P  L   EP+                         MGELSG G+ P
Sbjct: 955  -HLQRGEHFGSRNIPGQLRFGEPV----------------FDAFLGHPRMGELSGPGNFP 997

Query: 863  C---FDESIGRS-KPGMPLFGEPGFRSRYPSPGFPN-HGLYA-GDVDSFDRPRKRKPVSM 1024
                  ES G S K G P  GEPGFRS Y   G+PN HG    GD++SFD  RKRKP+SM
Sbjct: 998  SRLSAGESFGGSNKSGHPRIGEPGFRSTYSLHGYPNDHGFRPPGDMESFDNSRKRKPLSM 1057

Query: 1025 GWCRICKADCETVEGLDMHSQTREHQDMAMDMVRSIKEQNRKKQK-TFSDRASVEEKGKT 1201
             WCRIC  DCETV+GLDMHSQTREHQ MAMD+V SIK+QN KKQK T  D ++ E+  K+
Sbjct: 1058 AWCRICNIDCETVDGLDMHSQTREHQQMAMDIVLSIKQQNAKKQKLTSKDHSTPEDSSKS 1117

Query: 1202 RKAVFEGRG 1228
            +K V  G G
Sbjct: 1118 KKGVLRGGG 1126


>gb|EMJ06149.1| hypothetical protein PRUPE_ppa000292mg [Prunus persica]
          Length = 1334

 Score =  206 bits (524), Expect = 2e-50
 Identities = 140/403 (34%), Positives = 194/403 (48%), Gaps = 11/403 (2%)
 Frame = +2

Query: 29   QNQRVNRFDGNQPNPFPPGSS----DEVPFGQPRSMESPWDKRLKAPMGKHLSPLP---- 184
            +++R   F G + NPFP   +    D V F          D   + P   +L   P    
Sbjct: 971  RDERFKAFPGERLNPFPVDPTRHVIDRVEFE---------DDLKQFPRPSYLDSEPVAKF 1021

Query: 185  HDQASRPLDKPPRGLGYHFGSKFEASAGVXXXXXXXXXXXXSSMHFKDSGE--REAPLGL 358
             + +SRP D+ P G  Y  G   +  AG              S+H  D+G+  R  P   
Sbjct: 1022 GNYSSRPFDRAPHGFKYDSGPHTDPLAGTAPSRFLSPYRLGGSVHGNDAGDFGRMEPTHG 1081

Query: 359  YDDDRKRAGSGFGVHHMDYMSARNPDGEFFNIPPRGFVSHSSFEDIGGREPCQFIEGSGP 538
            + D         G   +D ++ R+P  ++  +PP GF      +D  GRE  +F     P
Sbjct: 1082 HPDF-------VGRRLVDGLAPRSPVRDYPGLPPHGFRGFGP-DDFDGREFHRF---GDP 1130

Query: 539  FNLPSNLAGGGGLFSDGRFQSLPGNSHGGEIDGLGDLRSSEHTTFGRPYKHVRSGDLFGK 718
                      G  F +GRF +LPG+   GE +G G+LR  +H          R  D  G+
Sbjct: 1131 L---------GNQFHEGRFSNLPGHFRRGEFEGPGNLRMVDH----------RRNDFIGQ 1171

Query: 719  DV-PSHLHHAEPLDPQKIPSHLXXXXXXXXXXXXXXXYMGELSGFGDIPCFDESIGRSKP 895
            D  P HL   + L P  +   L               +MG+++G G+     E    ++P
Sbjct: 1172 DGHPGHLRRGDHLGPHNLREPLGFGSRHS--------HMGDMAGPGNF----EPFRGNRP 1219

Query: 896  GMPLFGEPGFRSRYPSPGFPNHGLYAGDVDSFDRPRKRKPVSMGWCRICKADCETVEGLD 1075
              P  GEPGFRS +    FPN G Y GD++SFD  RKRKP SMGWCRICK DCETVEGLD
Sbjct: 1220 NHPRLGEPGFRSSFSLQRFPNDGTYTGDLESFDHSRKRKPASMGWCRICKVDCETVEGLD 1279

Query: 1076 MHSQTREHQDMAMDMVRSIKEQNRKKQKTFSDRASVEEKGKTR 1204
            +HSQTREHQ MAMDMVRSIK+  +K++ T  D++ +E+  K++
Sbjct: 1280 LHSQTREHQKMAMDMVRSIKQNAKKQKLTSGDQSLLEDANKSK 1322


>gb|ESW03386.1| hypothetical protein PHAVU_011G009900g [Phaseolus vulgaris]
          Length = 1288

 Score =  204 bits (520), Expect = 6e-50
 Identities = 131/315 (41%), Positives = 158/315 (50%), Gaps = 32/315 (10%)
 Frame = +2

Query: 311  SMHFKDSGEREAPLGLYDDDRKRAGS-----------GFGVHHMDYMSARNPDGEFFNIP 457
            S+   +SG+R   +G++DD  K++GS           G+G HHMD M+ R+P GE+  + 
Sbjct: 1016 SLSAHESGKRS--VGIHDDVIKKSGSALHPGYLGPGPGYGRHHMDGMTPRSPVGEYAEMS 1073

Query: 458  PRGFVSHSS-------FEDIGGREPCQFIEGSGPFNLPSNLAGGGGLFSDGRFQSLPGNS 616
             R    HS         +D  GR P  F                GG F D RF  LP + 
Sbjct: 1074 SRRLGPHSGSLIGKSGIDDFDGRVPRHF----------------GGEFRDSRFPHLPSHL 1117

Query: 617  HGGEIDGLGDLRSSEHTTFGRPYKHVRSGDLFGKD-VPSHLHHAEPLDPQKIPSHLXXXX 793
            H  E DG G+ R  EH          RSGD  G+D    H    EPL P   P HL    
Sbjct: 1118 HRDEFDGFGNFRIGEHP---------RSGDFIGQDEYAGHFRRGEPLGPHNFPRHLQ--- 1165

Query: 794  XXXXXXXXXXXYMGELSGFGDIP------------CFDESIGRSKPGMPLFGEPGFRSRY 937
                        +GE  GFG  P             F+     S+PG P  GEPGFRS +
Sbjct: 1166 ------------LGEPVGFGAHPGHMRAVEHGSFRSFESFAKGSRPGHPQLGEPGFRSSF 1213

Query: 938  PSPGFPNH-GLYAGDVDSFDRPRKRKPVSMGWCRICKADCETVEGLDMHSQTREHQDMAM 1114
              PGFPN  G   GD+ SFD  R+RK  SMGWCRICKADCETVEGLD+HSQT+EHQ MAM
Sbjct: 1214 SLPGFPNDAGFLTGDIRSFDNLRRRKVSSMGWCRICKADCETVEGLDLHSQTKEHQKMAM 1273

Query: 1115 DMVRSIKEQNRKKQK 1159
            DMV++IK QN KKQK
Sbjct: 1274 DMVKTIK-QNAKKQK 1287


>ref|XP_003534401.2| PREDICTED: altered inheritance of mitochondria protein 3 isoform X1
            [Glycine max] gi|571478903|ref|XP_006587697.1| PREDICTED:
            altered inheritance of mitochondria protein 3 isoform X2
            [Glycine max] gi|571478905|ref|XP_006587698.1| PREDICTED:
            altered inheritance of mitochondria protein 3 isoform X3
            [Glycine max]
          Length = 1300

 Score =  203 bits (517), Expect = 1e-49
 Identities = 125/322 (38%), Positives = 165/322 (51%), Gaps = 14/322 (4%)
 Frame = +2

Query: 311  SMHFKDSGEREAPLGLYDDDRKRAGS-----------GFGVHHMDYMSARNPDGEFFNIP 457
            S+   D+G+R  P+G++DD  K++GS           G+G HHMD +++R+P  E+  + 
Sbjct: 1003 SLGAHDAGKR--PVGIHDDVIKKSGSALHPGYLEPGPGYGRHHMDGIASRSPVSEYAEMS 1060

Query: 458  PRGFVSHSSFEDIGGREPCQFIEGSGPFNLPSNLAGGGGLFSDGRFQSLPGNSHGGEIDG 637
             R    H+             +  +G  +    +A   G F D RF  LP + H  + DG
Sbjct: 1061 SRRLGPHAG----------SLVGKAGIDDFEGRVARRFGEFHDSRFPHLPSHLHRDDFDG 1110

Query: 638  LGDLRSSEHTTFGRPYKHVRSGDLFGKD-VPSHLHHAEPLDPQKIPSHLXXXXXXXXXXX 814
             G+ R  EH          RSGD  G+D    H    E L P   P HL           
Sbjct: 1111 FGNFRMGEHP---------RSGDFIGQDEFGGHFRRGEHLAPHNFPRHLQLGEPIGFGAH 1161

Query: 815  XXXXYMGELSGFGDIPCFDESIGRSKPGMPLFGEPGFRSRYPSPGFPNHGLY-AGDVDSF 991
                   EL GF     F +     +PG P  GEPGFRS +  PGFPN   +  GD+   
Sbjct: 1162 PGHMRAVELDGFRGFESFGKG---GRPGHPQLGEPGFRSSFSLPGFPNDARFLTGDIRLL 1218

Query: 992  DRPRKRKPVSMGWCRICKADCETVEGLDMHSQTREHQDMAMDMVRSIKEQNRKKQKTF-S 1168
            D  R+RK  SMGWCRICK DCETVEGLD+HSQT+EHQ MAMD+V++IK QN KKQK   S
Sbjct: 1219 DNLRRRKASSMGWCRICKVDCETVEGLDLHSQTKEHQKMAMDIVKTIK-QNAKKQKLIPS 1277

Query: 1169 DRASVEEKGKTRKAVFEGRGRK 1234
            +++S++E  KT     EGRG K
Sbjct: 1278 EQSSIDEGNKTHNTSIEGRGNK 1299


>ref|XP_006379033.1| hypothetical protein POPTR_0009s04520g [Populus trichocarpa]
            gi|550331020|gb|ERP56830.1| hypothetical protein
            POPTR_0009s04520g [Populus trichocarpa]
          Length = 1315

 Score =  199 bits (507), Expect = 2e-48
 Identities = 135/360 (37%), Positives = 174/360 (48%), Gaps = 11/360 (3%)
 Frame = +2

Query: 188  DQASRPLDKPPRGLGYHFGSKFEASAGVXXXXXXXXXXXXSSMHFKDS----GEREAPLG 355
            D A RPLDK   G  Y  G   E   G              ++H  D+    G  ++  G
Sbjct: 997  DGAPRPLDKGSHGFNYDSGLNMEPLGGSAPPRFFPPYHHDKALHPSDAEVSLGYHDSLAG 1056

Query: 356  LYDDDRKRAG------SGFGVHHMDYMSARNPDGEFFNIPPRGFVSHSSFEDIGGREPCQ 517
              D  R R G       G+   HMD ++ R+P  ++  +P R F +    +DI GR+P +
Sbjct: 1057 RSDFARTRPGFLGPPIPGYDHRHMDNLAPRSPVRDYPGMPTRRFGALPGLDDIDGRDPHR 1116

Query: 518  FIEGSGPFNLPSNLAGGGGLFSDGRFQSLPGNSHGGEIDGLGDLRSSEHTTFGRPYKHVR 697
            F +        S+L        D RF   P +   GE++G G+L   EH           
Sbjct: 1117 FGD-----KFSSSLR-------DSRFPVFPSHLRRGELEGPGNLHMGEHL---------- 1154

Query: 698  SGDLFGKDV-PSHLHHAEPLDPQKIPSHLXXXXXXXXXXXXXXXYMGELSGFGDIPCFDE 874
            SGDL G D  P+HL   E L P+ +PSHL                MGEL+G G+      
Sbjct: 1155 SGDLMGHDGRPAHLRRGEHLGPRNLPSHLWVGEPGNFGAFPGHARMGELAGPGNFYHHQ- 1213

Query: 875  SIGRSKPGMPLFGEPGFRSRYPSPGFPNHGLYAGDVDSFDRPRKRKPVSMGWCRICKADC 1054
                        GEPGFRS +        G YAGD+  FD  RKRKP SMGWCRICK DC
Sbjct: 1214 -----------LGEPGFRSSFG-------GNYAGDLQFFDNSRKRKP-SMGWCRICKVDC 1254

Query: 1055 ETVEGLDMHSQTREHQDMAMDMVRSIKEQNRKKQKTFSDRASVEEKGKTRKAVFEGRGRK 1234
            ETVE LD+HSQTREHQ MA+DMV +IK+  +K + T    +S+E+K K+R A FEGRG K
Sbjct: 1255 ETVEALDLHSQTREHQKMALDMVVTIKQNAKKHKSTPCHHSSLEDKSKSRNASFEGRGNK 1314


>ref|XP_004506322.1| PREDICTED: mediator of RNA polymerase II transcription subunit
            12-like isoform X1 [Cicer arietinum]
            gi|502146144|ref|XP_004506323.1| PREDICTED: mediator of
            RNA polymerase II transcription subunit 12-like isoform
            X2 [Cicer arietinum] gi|502146146|ref|XP_004506324.1|
            PREDICTED: mediator of RNA polymerase II transcription
            subunit 12-like isoform X3 [Cicer arietinum]
          Length = 1283

 Score =  199 bits (506), Expect = 2e-48
 Identities = 131/336 (38%), Positives = 170/336 (50%), Gaps = 33/336 (9%)
 Frame = +2

Query: 326  DSGEREAPLGLYDDDRKRAGS-----------GFGVHHMDYMSARNPDGEFFNIPPR--- 463
            ++G+R  P+G +DD  K+ GS           G+G+HHMD ++ R+P  E+ ++P R   
Sbjct: 986  ETGKR--PVGYHDDAIKKPGSTLHPGHLGPGPGYGIHHMDGIAPRSPGSEYIDMPSRRSG 1043

Query: 464  ----GFVSHSSFEDIGGREPCQFIEGSGPFNLPSNLAGGGGLFSDGRFQSLPGNSHGGEI 631
                G VS S  +D  GR   ++ +  G              F DGRF   P + H    
Sbjct: 1044 PLSGGLVSKSGIDDFDGRTASRYGDSVGI------------AFRDGRFPHQPSHLHRDAF 1091

Query: 632  DGLGDLRSSEHTTFGRPYKHVRSGDLFGKDVPS-HLHHAEPLDPQKIPSHLXXXXXXXXX 808
            DG G+ R  EH          R G+  G+D  S H    E L P   P HL         
Sbjct: 1092 DGFGNFRMGEHP---------RRGNFIGRDEFSGHFQRGEHLGPHNFPRHLQ-------- 1134

Query: 809  XXXXXXYMGELSGFGDIP----CFDESIGRS--------KPGMPLFGEPGFRSRYPSPGF 952
                   +GE   FGD P     F+    RS        +PG P  GEPGFRS +   GF
Sbjct: 1135 -------LGERISFGDHPGHMRAFELGSSRSFESFSKGNRPGHPQLGEPGFRSSFSLAGF 1187

Query: 953  PNH-GLYAGDVDSFDRPRKRKPVSMGWCRICKADCETVEGLDMHSQTREHQDMAMDMVRS 1129
             N  G   GD+ SFD  R+RK  SMGWCRICK DCETVEGL++HSQTREHQ MA+D+V++
Sbjct: 1188 NNDAGFLTGDIRSFDNLRRRKAASMGWCRICKVDCETVEGLELHSQTREHQKMAVDIVKT 1247

Query: 1130 IKEQNRKKQKTF-SDRASVEEKGKTRKAVFEGRGRK 1234
            IK QN KKQK   S+++SVE+  +T    FEG G K
Sbjct: 1248 IK-QNAKKQKLIPSEQSSVEDGKQTWGTGFEGHGNK 1282


>ref|XP_006591981.1| PREDICTED: histone-lysine N-methyltransferase 2D-like isoform X5
            [Glycine max]
          Length = 1299

 Score =  196 bits (498), Expect = 2e-47
 Identities = 124/322 (38%), Positives = 161/322 (50%), Gaps = 14/322 (4%)
 Frame = +2

Query: 311  SMHFKDSGEREAPLGLYDDDRKRAGS-----------GFGVHHMDYMSARNPDGEFFNIP 457
            S+   ++G+R  P+G++DD  K++GS           G+  HHMD ++ R+P  E+  + 
Sbjct: 1002 SLGTHEAGKR--PVGIHDDVIKKSGSALHPGYFGPGPGYARHHMDGIAPRSPVSEYAEMS 1059

Query: 458  PRGFVSHSSFEDIGGREPCQFIEGSGPFNLPSNLAGGGGLFSDGRFQSLPGNSHGGEIDG 637
             R    HS             +  SG  +    +A   G F D RF  LP +    + DG
Sbjct: 1060 SRRLGLHSG----------SLVGKSGIDDFDDRVARRFGEFRDSRFPHLPSHLRRDDFDG 1109

Query: 638  LGDLRSSEHTTFGRPYKHVRSGDLFGKD-VPSHLHHAEPLDPQKIPSHLXXXXXXXXXXX 814
             G+ R  E+          RSGD  G+D    H    E L P   P HL           
Sbjct: 1110 FGNFRMGEYP---------RSGDFVGQDEFAGHFRRGEHLGPHNFPRHLQHGEPIGFGAH 1160

Query: 815  XXXXYMGELSGFGDIPCFDESIGRSKPGMPLFGEPGFRSRYPSPGFPNH-GLYAGDVDSF 991
                   EL GF     F +     +PG P  GEPGFRS +   GFPN  G   GD+ SF
Sbjct: 1161 PGHMRAVELDGFRSFESFSKG---GRPGHPQLGEPGFRSSFSLTGFPNDAGFLTGDIRSF 1217

Query: 992  DRPRKRKPVSMGWCRICKADCETVEGLDMHSQTREHQDMAMDMVRSIKEQNRKKQKTF-S 1168
            D  R++K  SMGWCRICK DCETVEGLD+HSQT+EHQ MAMD+V++IK QN KKQK   S
Sbjct: 1218 DNLRRKKASSMGWCRICKVDCETVEGLDLHSQTKEHQKMAMDIVKTIK-QNAKKQKLIPS 1276

Query: 1169 DRASVEEKGKTRKAVFEGRGRK 1234
            +  S++E  KT     EGRG K
Sbjct: 1277 EEPSMDEGNKTHNTGIEGRGNK 1298


>ref|XP_006591980.1| PREDICTED: histone-lysine N-methyltransferase 2D-like isoform X4
            [Glycine max]
          Length = 1335

 Score =  196 bits (498), Expect = 2e-47
 Identities = 124/322 (38%), Positives = 161/322 (50%), Gaps = 14/322 (4%)
 Frame = +2

Query: 311  SMHFKDSGEREAPLGLYDDDRKRAGS-----------GFGVHHMDYMSARNPDGEFFNIP 457
            S+   ++G+R  P+G++DD  K++GS           G+  HHMD ++ R+P  E+  + 
Sbjct: 1038 SLGTHEAGKR--PVGIHDDVIKKSGSALHPGYFGPGPGYARHHMDGIAPRSPVSEYAEMS 1095

Query: 458  PRGFVSHSSFEDIGGREPCQFIEGSGPFNLPSNLAGGGGLFSDGRFQSLPGNSHGGEIDG 637
             R    HS             +  SG  +    +A   G F D RF  LP +    + DG
Sbjct: 1096 SRRLGLHSG----------SLVGKSGIDDFDDRVARRFGEFRDSRFPHLPSHLRRDDFDG 1145

Query: 638  LGDLRSSEHTTFGRPYKHVRSGDLFGKD-VPSHLHHAEPLDPQKIPSHLXXXXXXXXXXX 814
             G+ R  E+          RSGD  G+D    H    E L P   P HL           
Sbjct: 1146 FGNFRMGEYP---------RSGDFVGQDEFAGHFRRGEHLGPHNFPRHLQHGEPIGFGAH 1196

Query: 815  XXXXYMGELSGFGDIPCFDESIGRSKPGMPLFGEPGFRSRYPSPGFPNH-GLYAGDVDSF 991
                   EL GF     F +     +PG P  GEPGFRS +   GFPN  G   GD+ SF
Sbjct: 1197 PGHMRAVELDGFRSFESFSKG---GRPGHPQLGEPGFRSSFSLTGFPNDAGFLTGDIRSF 1253

Query: 992  DRPRKRKPVSMGWCRICKADCETVEGLDMHSQTREHQDMAMDMVRSIKEQNRKKQKTF-S 1168
            D  R++K  SMGWCRICK DCETVEGLD+HSQT+EHQ MAMD+V++IK QN KKQK   S
Sbjct: 1254 DNLRRKKASSMGWCRICKVDCETVEGLDLHSQTKEHQKMAMDIVKTIK-QNAKKQKLIPS 1312

Query: 1169 DRASVEEKGKTRKAVFEGRGRK 1234
            +  S++E  KT     EGRG K
Sbjct: 1313 EEPSMDEGNKTHNTGIEGRGNK 1334


>ref|XP_006591977.1| PREDICTED: histone-lysine N-methyltransferase 2D-like isoform X1
            [Glycine max] gi|571491554|ref|XP_006591978.1| PREDICTED:
            histone-lysine N-methyltransferase 2D-like isoform X2
            [Glycine max] gi|571491556|ref|XP_006591979.1| PREDICTED:
            histone-lysine N-methyltransferase 2D-like isoform X3
            [Glycine max]
          Length = 1347

 Score =  196 bits (498), Expect = 2e-47
 Identities = 124/322 (38%), Positives = 161/322 (50%), Gaps = 14/322 (4%)
 Frame = +2

Query: 311  SMHFKDSGEREAPLGLYDDDRKRAGS-----------GFGVHHMDYMSARNPDGEFFNIP 457
            S+   ++G+R  P+G++DD  K++GS           G+  HHMD ++ R+P  E+  + 
Sbjct: 1050 SLGTHEAGKR--PVGIHDDVIKKSGSALHPGYFGPGPGYARHHMDGIAPRSPVSEYAEMS 1107

Query: 458  PRGFVSHSSFEDIGGREPCQFIEGSGPFNLPSNLAGGGGLFSDGRFQSLPGNSHGGEIDG 637
             R    HS             +  SG  +    +A   G F D RF  LP +    + DG
Sbjct: 1108 SRRLGLHSG----------SLVGKSGIDDFDDRVARRFGEFRDSRFPHLPSHLRRDDFDG 1157

Query: 638  LGDLRSSEHTTFGRPYKHVRSGDLFGKD-VPSHLHHAEPLDPQKIPSHLXXXXXXXXXXX 814
             G+ R  E+          RSGD  G+D    H    E L P   P HL           
Sbjct: 1158 FGNFRMGEYP---------RSGDFVGQDEFAGHFRRGEHLGPHNFPRHLQHGEPIGFGAH 1208

Query: 815  XXXXYMGELSGFGDIPCFDESIGRSKPGMPLFGEPGFRSRYPSPGFPNH-GLYAGDVDSF 991
                   EL GF     F +     +PG P  GEPGFRS +   GFPN  G   GD+ SF
Sbjct: 1209 PGHMRAVELDGFRSFESFSKG---GRPGHPQLGEPGFRSSFSLTGFPNDAGFLTGDIRSF 1265

Query: 992  DRPRKRKPVSMGWCRICKADCETVEGLDMHSQTREHQDMAMDMVRSIKEQNRKKQKTF-S 1168
            D  R++K  SMGWCRICK DCETVEGLD+HSQT+EHQ MAMD+V++IK QN KKQK   S
Sbjct: 1266 DNLRRKKASSMGWCRICKVDCETVEGLDLHSQTKEHQKMAMDIVKTIK-QNAKKQKLIPS 1324

Query: 1169 DRASVEEKGKTRKAVFEGRGRK 1234
            +  S++E  KT     EGRG K
Sbjct: 1325 EEPSMDEGNKTHNTGIEGRGNK 1346


>gb|EOY33856.1| Uncharacterized protein isoform 7 [Theobroma cacao]
          Length = 975

 Score =  187 bits (476), Expect = 8e-45
 Identities = 145/415 (34%), Positives = 194/415 (46%), Gaps = 22/415 (5%)
 Frame = +2

Query: 56   GNQPNPFPPGSSDEVPFGQP-RSMESPWDKRLKA-PMGKHLS--PLP----HDQASRPLD 211
            G +  P     S++ P  +  R     +++ LK  P   HL   P+P    +  +SRPLD
Sbjct: 611  GERLKPVQDECSNQFPLDRGHRGDRGQFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLD 670

Query: 212  KPPRGLGYHFGSKFEASAGVXXXXXXXXXXXXSSM----HFKDSGEREAPLGLYDDDRKR 379
            + P G G   G + +                 S      H  D+GER  P+GL  D   R
Sbjct: 671  RGPHGFGMDMGPRAQEKEPHGFSFDPMIGSGPSRFLPPYHPDDTGER--PVGLPKDTLGR 728

Query: 380  AG-----SGFGVHHMDYMSARNPDGEFFNIPPRGFVSHSSFEDIGGREPCQFIEGSGPFN 544
                     +G H MD   +R+P  E+  I P GF  H   ++I GRE            
Sbjct: 729  PDFLGTVPSYGRHRMDGFVSRSPGREYPGISPHGFGGHPG-DEIDGRER----------- 776

Query: 545  LPSNLAGGGGLFSDGRFQSLPGNSHGGEIDGLGDLRSSEHTTFGRPYKHVRSGDLFGKDV 724
                       FSD RF  LPG+ H G  +      SS+     R  +H+RS D+  +D 
Sbjct: 777  ----------RFSD-RFPGLPGHLHRGGFE------SSD-----RMEEHLRSRDMINQDN 814

Query: 725  -PSHLHHAEPLDPQKIPSHLXXXXXXXXXXXXXXXYMGELSGFGDIPCFDESIGRSKPGM 901
             P++    E +    +P HL                +GE  GFGD    +       PG 
Sbjct: 815  RPAYFRRGEHVGHHNMPGHLR---------------LGEPIGFGDFSSHERIGEFGGPGN 859

Query: 902  ---PLFGEPGFRSRYPSPGFPNHG-LYAGDVDSFDRPRKRKPVSMGWCRICKADCETVEG 1069
               P  GEPGFRS +    FPN G +Y G +DSF+  RKRKP+SMGWCRICK DCETVEG
Sbjct: 860  FRHPRLGEPGFRSSFSLQEFPNDGGIYTGGMDSFENLRKRKPMSMGWCRICKIDCETVEG 919

Query: 1070 LDMHSQTREHQDMAMDMVRSIKEQNRKKQKTFSDRASVEEKGKTRKAVFEGRGRK 1234
            LD+HSQTREHQ MAMDMV +IK+  +K++ T SD +   +  K++   FEGR  K
Sbjct: 920  LDLHSQTREHQKMAMDMVVTIKQNAKKQKLTSSDHSIRNDTSKSKNVKFEGRVNK 974


>gb|EOY33851.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508786596|gb|EOY33852.1| Uncharacterized protein
            isoform 2 [Theobroma cacao] gi|508786597|gb|EOY33853.1|
            Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 1408

 Score =  187 bits (476), Expect = 8e-45
 Identities = 145/415 (34%), Positives = 194/415 (46%), Gaps = 22/415 (5%)
 Frame = +2

Query: 56   GNQPNPFPPGSSDEVPFGQP-RSMESPWDKRLKA-PMGKHLS--PLP----HDQASRPLD 211
            G +  P     S++ P  +  R     +++ LK  P   HL   P+P    +  +SRPLD
Sbjct: 1044 GERLKPVQDECSNQFPLDRGHRGDRGQFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLD 1103

Query: 212  KPPRGLGYHFGSKFEASAGVXXXXXXXXXXXXSSM----HFKDSGEREAPLGLYDDDRKR 379
            + P G G   G + +                 S      H  D+GER  P+GL  D   R
Sbjct: 1104 RGPHGFGMDMGPRAQEKEPHGFSFDPMIGSGPSRFLPPYHPDDTGER--PVGLPKDTLGR 1161

Query: 380  AG-----SGFGVHHMDYMSARNPDGEFFNIPPRGFVSHSSFEDIGGREPCQFIEGSGPFN 544
                     +G H MD   +R+P  E+  I P GF  H   ++I GRE            
Sbjct: 1162 PDFLGTVPSYGRHRMDGFVSRSPGREYPGISPHGFGGHPG-DEIDGRER----------- 1209

Query: 545  LPSNLAGGGGLFSDGRFQSLPGNSHGGEIDGLGDLRSSEHTTFGRPYKHVRSGDLFGKDV 724
                       FSD RF  LPG+ H G  +      SS+     R  +H+RS D+  +D 
Sbjct: 1210 ----------RFSD-RFPGLPGHLHRGGFE------SSD-----RMEEHLRSRDMINQDN 1247

Query: 725  -PSHLHHAEPLDPQKIPSHLXXXXXXXXXXXXXXXYMGELSGFGDIPCFDESIGRSKPGM 901
             P++    E +    +P HL                +GE  GFGD    +       PG 
Sbjct: 1248 RPAYFRRGEHVGHHNMPGHLR---------------LGEPIGFGDFSSHERIGEFGGPGN 1292

Query: 902  ---PLFGEPGFRSRYPSPGFPNHG-LYAGDVDSFDRPRKRKPVSMGWCRICKADCETVEG 1069
               P  GEPGFRS +    FPN G +Y G +DSF+  RKRKP+SMGWCRICK DCETVEG
Sbjct: 1293 FRHPRLGEPGFRSSFSLQEFPNDGGIYTGGMDSFENLRKRKPMSMGWCRICKIDCETVEG 1352

Query: 1070 LDMHSQTREHQDMAMDMVRSIKEQNRKKQKTFSDRASVEEKGKTRKAVFEGRGRK 1234
            LD+HSQTREHQ MAMDMV +IK+  +K++ T SD +   +  K++   FEGR  K
Sbjct: 1353 LDLHSQTREHQKMAMDMVVTIKQNAKKQKLTSSDHSIRNDTSKSKNVKFEGRVNK 1407


>gb|EOY33857.1| Uncharacterized protein isoform 8 [Theobroma cacao]
          Length = 972

 Score =  184 bits (466), Expect = 1e-43
 Identities = 148/415 (35%), Positives = 193/415 (46%), Gaps = 22/415 (5%)
 Frame = +2

Query: 56   GNQPNPFPPGSSDEVPFGQP-RSMESPWDKRLKA-PMGKHLS--PLP----HDQASRPLD 211
            G +  P     S++ P  +  R     +++ LK  P   HL   P+P    +  +SRPLD
Sbjct: 611  GERLKPVQDECSNQFPLDRGHRGDRGQFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLD 670

Query: 212  KPPRGLGYHFGSKFEASAGVXXXXXXXXXXXXSSM----HFKDSGEREAPLGLYDDDRKR 379
            + P G G   G + +                 S      H  D+GER  P+GL  D   R
Sbjct: 671  RGPHGFGMDMGPRAQEKEPHGFSFDPMIGSGPSRFLPPYHPDDTGER--PVGLPKDTLGR 728

Query: 380  AG-----SGFGVHHMDYMSARNPDGEFFNIPPRGFVSHSSFEDIGGREPCQFIEGSGPFN 544
                     +G H MD   +R+P  E+  I P GF  H   ++I GRE            
Sbjct: 729  PDFLGTVPSYGRHRMDGFVSRSPGREYPGISPHGFGGHPG-DEIDGRER----------- 776

Query: 545  LPSNLAGGGGLFSDGRFQSLPGNSHGGEIDGLGDLRSSEHTTFGRPYKHVRSGDLFGKDV 724
                       FSD RF  LPG+ H G  +      SS+     R  +H+RS D+  +D 
Sbjct: 777  ----------RFSD-RFPGLPGHLHRGGFE------SSD-----RMEEHLRSRDMINQDN 814

Query: 725  -PSHLHHAEPLDPQKIPSHLXXXXXXXXXXXXXXXYMGELSGFGDIPCFDESIGRSKPGM 901
             P++    E +    +P HL                +GE  GFGD    +       PG 
Sbjct: 815  RPAYFRRGEHVGHHNMPGHLR---------------LGEPIGFGDFSSHERIGEFGGPGN 859

Query: 902  ---PLFGEPGFRSRYPSPGFPNHG-LYAGDVDSFDRPRKRKPVSMGWCRICKADCETVEG 1069
               P  GEPGFRS +    FPN G +Y G +DSF+  RKRKP+SMGWCRICK DCETVEG
Sbjct: 860  FRHPRLGEPGFRSSFSLQEFPNDGGIYTGGMDSFENLRKRKPMSMGWCRICKIDCETVEG 919

Query: 1070 LDMHSQTREHQDMAMDMVRSIKEQNRKKQKTFSDRASVEEKGKTRKAVFEGRGRK 1234
            LD+HSQTREHQ MAMDMV +IK QN KKQK   D +   +  K++   FEGR  K
Sbjct: 920  LDLHSQTREHQKMAMDMVVTIK-QNAKKQKL--DHSIRNDTSKSKNVKFEGRVNK 971


>gb|ACU17648.1| unknown [Glycine max]
          Length = 257

 Score =  182 bits (462), Expect = 3e-43
 Identities = 110/279 (39%), Positives = 141/279 (50%), Gaps = 3/279 (1%)
 Frame = +2

Query: 407  MDYMSARNPDGEFFNIPPRGFVSHSSFEDIGGREPCQFIEGSGPFNLPSNLAGGGGLFSD 586
            MD +++R+P  E+  +  R    H+             +  +G  +    +A   G F D
Sbjct: 1    MDGIASRSPVSEYAEMSSRRLGPHAG----------SLVGKAGIDDFEGRVARRFGEFHD 50

Query: 587  GRFQSLPGNSHGGEIDGLGDLRSSEHTTFGRPYKHVRSGDLFGKD-VPSHLHHAEPLDPQ 763
             RF  LP + H  + DG G+ R  EH          RSGD  G+D    H    E L P 
Sbjct: 51   SRFPHLPSHLHRDDFDGFGNFRMGEHP---------RSGDFIGQDEFGGHFRRGEHLAPH 101

Query: 764  KIPSHLXXXXXXXXXXXXXXXYMGELSGFGDIPCFDESIGRSKPGMPLFGEPGFRSRYPS 943
              P HL                  EL GF     F +     +PG P  GEPGFRS +  
Sbjct: 102  NFPRHLQLGEPIGFGAHPGHMRAVELDGFRGFESFGKG---GRPGHPQLGEPGFRSSFSL 158

Query: 944  PGFPNHGLY-AGDVDSFDRPRKRKPVSMGWCRICKADCETVEGLDMHSQTREHQDMAMDM 1120
            PGFPN   +  GD+   D  R+RK  SMGWCRICK DCETVEGLD+HSQT+EHQ MAMD+
Sbjct: 159  PGFPNDARFLTGDIRLLDNLRRRKASSMGWCRICKVDCETVEGLDLHSQTKEHQKMAMDI 218

Query: 1121 VRSIKEQNRKKQKTF-SDRASVEEKGKTRKAVFEGRGRK 1234
            V++IK QN KKQK   S+++S++E  KT     EGRG K
Sbjct: 219  VKTIK-QNAKKQKLIPSEQSSIDEGNKTHNTSIEGRGNK 256


>ref|XP_002298329.1| hypothetical protein POPTR_0001s25430g [Populus trichocarpa]
            gi|222845587|gb|EEE83134.1| hypothetical protein
            POPTR_0001s25430g [Populus trichocarpa]
          Length = 1327

 Score =  178 bits (451), Expect = 6e-42
 Identities = 144/438 (32%), Positives = 195/438 (44%), Gaps = 37/438 (8%)
 Frame = +2

Query: 32   NQRVNRFDGNQPNPFPPGSSDEVPFGQPRSMESPWDKRLK---APMGKHLSPLP----HD 190
            + R   F     NPFP   +      +  + +  +++ LK   AP      P+P    H 
Sbjct: 940  SDRFRSFPDEHLNPFPHDPA------RRNAHQGEFEEDLKHFTAPSCLDTKPVPKSGGHF 993

Query: 191  QASRPLDKPPRGLG----------------YHFGSKFEASAGVXXXXXXXXXXXXSSMHF 322
             +SRPLD+ P G G                Y  G   E   G              ++H 
Sbjct: 994  SSSRPLDRGPHGFGVDGAPKHLDKGSHGLNYDSGLNVEPLGGSAPPRFFPPIHHDRTLH- 1052

Query: 323  KDSGEREAPLGLYDD-------DRKRAG------SGFGVHHMDYMSARNPDGEFFNIPPR 463
                E E  LG +D+        R R G       G+    MD ++ R+P  ++  +  +
Sbjct: 1053 --RSEAEGSLGFHDNLAGRTDFARTRPGLLGPPMPGYDHRDMDNLAPRSPGRDYPGMSMQ 1110

Query: 464  GFVSHSSFEDIGGREPCQFIEGSGPFNLPSNLAGGGGLFSDGRFQSLPGNSHGGEIDGLG 643
             F +    +DI GR P +    S P  + S+L        D RF   P +   GE++G G
Sbjct: 1111 RFGALPGLDDIDGRAPQR---SSDP--ITSSL-------HDSRFPLFPSHLRRGELNGPG 1158

Query: 644  DLRSSEHTTFGRPYKHVRSGDLFGKDV-PSHLHHAEPLDPQKIPSHLXXXXXXXXXXXXX 820
            +    EH           SGDL G D  P+HL   E L P+  PSHL             
Sbjct: 1159 NFHMGEHL----------SGDLMGHDGWPAHLRRGERLGPRNPPSHLRLGERGGFGSFPG 1208

Query: 821  XXYMGELSGFGDIPCFDESIGRSKPGMPLFGEPGFRSRYPSPGFPNHGLYAGDVDSFDRP 1000
               MGEL+G G++  + + +G          EPGFRS +        G YAGD+   +  
Sbjct: 1209 HARMGELAGPGNL--YHQQLG----------EPGFRSSFG-------GSYAGDLQYSENS 1249

Query: 1001 RKRKPVSMGWCRICKADCETVEGLDMHSQTREHQDMAMDMVRSIKEQNRKKQKTFSDRAS 1180
            RKRK  SMGWCRICK DCET EGLD+HSQTREHQ MAMDMV +IK+  +K +   SD +S
Sbjct: 1250 RKRKS-SMGWCRICKVDCETFEGLDLHSQTREHQKMAMDMVVTIKQNVKKHKSAPSDHSS 1308

Query: 1181 VEEKGKTRKAVFEGRGRK 1234
            +E+  K R A FEGRG K
Sbjct: 1309 LEDTSKLRNASFEGRGNK 1326


Top