BLASTX nr result
ID: Sinomenium21_contig00008790
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00008790 (1652 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI16022.3| unnamed protein product [Vitis vinifera] 340 1e-90 ref|XP_006488440.1| PREDICTED: mediator of RNA polymerase II tra... 302 3e-79 ref|XP_006424987.1| hypothetical protein CICLE_v10027683mg [Citr... 302 3e-79 ref|XP_007204950.1| hypothetical protein PRUPE_ppa000292mg [Prun... 300 1e-78 emb|CAN64434.1| hypothetical protein VITISV_000937 [Vitis vinifera] 279 2e-72 ref|XP_006848046.1| hypothetical protein AMTR_s00029p00190880 [A... 275 4e-71 ref|XP_004295721.1| PREDICTED: uncharacterized protein LOC101314... 273 2e-70 ref|XP_004169561.1| PREDICTED: uncharacterized protein LOC101227... 270 1e-69 ref|XP_004153176.1| PREDICTED: uncharacterized protein LOC101214... 270 1e-69 ref|XP_004145323.1| PREDICTED: uncharacterized protein LOC101205... 270 1e-69 ref|XP_002520450.1| hypothetical protein RCOM_0731250 [Ricinus c... 270 1e-69 ref|XP_007016237.1| Uncharacterized protein isoform 7 [Theobroma... 261 9e-67 ref|XP_007016232.1| Uncharacterized protein isoform 2 [Theobroma... 261 9e-67 ref|XP_006379033.1| hypothetical protein POPTR_0009s04520g [Popu... 258 7e-66 ref|XP_007016238.1| Uncharacterized protein isoform 8 [Theobroma... 253 1e-64 ref|XP_002298329.1| hypothetical protein POPTR_0001s25430g [Popu... 239 3e-60 ref|XP_007131393.1| hypothetical protein PHAVU_011G009900g [Phas... 219 2e-54 gb|EXB30469.1| hypothetical protein L484_006018 [Morus notabilis] 214 7e-53 ref|XP_004506322.1| PREDICTED: mediator of RNA polymerase II tra... 210 1e-51 ref|XP_004246977.1| PREDICTED: uncharacterized protein LOC101249... 207 1e-50 >emb|CBI16022.3| unnamed protein product [Vitis vinifera] Length = 1669 Score = 340 bits (871), Expect = 1e-90 Identities = 218/508 (42%), Positives = 269/508 (52%), Gaps = 61/508 (12%) Frame = -1 Query: 1652 DGRQPDSHLPGSAEHVLFGQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQM 1473 DGRQ DSH+PGS+E FGQPS + N M++NGG G + S+P GLQ+ER+K + Sbjct: 1203 DGRQSDSHIPGSSERGPFGQPSGVQSNMMRMNGGLGI-------ESSLPVGLQDERFKSL 1255 Query: 1472 PDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFE 1293 P EPGR +F EDLKQF RS+ LDS+ VPKF Sbjct: 1256 P-------------------------EPGRRSSDHGKFAEDLKQFSRSSHLDSDLVPKFG 1290 Query: 1292 SYFS--RP-----------------DRASHGFNHDVGLKLDGNDNAPRLLPPYQPGS--- 1179 +YFS RP D+A GFN+D G K R PP PG Sbjct: 1291 NYFSSSRPLDRGSQGFVMDAAQGLLDKAPLGFNYDSGFKSSAGTGTSRFFPPPHPGGDGE 1350 Query: 1178 -LRPLDLCDDNMDRRVDIAAGVPPDFLRSAS--GRNRIDGFPLRSPGREYPSHPSSRF-- 1014 R + +DN+ R D+A P+FL S GR+ +DG RSP RE+ P F Sbjct: 1351 RSRAVGFHEDNVGRS-DMAR-THPNFLGSVPEYGRHHMDGLNPRSPTREFSGIPHRGFGG 1408 Query: 1013 --------RRLEDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGS 858 L+D DGRE F E SK+FNLPS+ E+RFP+LPSHLR+GE +G G Sbjct: 1409 LSGVPGRQSDLDDIDGRESRRFGEGSKTFNLPSD-----ESRFPVLPSHLRRGELEGPGE 1463 Query: 857 L-----------PARLRGGDLIGSNVPPGRLQSGEPIGHRNLPN--------------HL 753 L P LRGGDLIG ++ P LQ GE G RN+P H Sbjct: 1464 LVMADPIASRPAPHHLRGGDLIGQDILPSHLQRGEHFGSRNIPGQLRFGEPVFDAFLGHP 1523 Query: 752 HRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGG-NLPSRARGAESGFSSGFPIHGY 576 G+++G G F +R ++ FGG N R E GF S + +HGY Sbjct: 1524 RMGELSGPGNFPSRLSAGES--------------FGGSNKSGHPRIGEPGFRSTYSLHGY 1569 Query: 575 QNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVL 396 ND GF GD+ESFD SRKRK SM WCRIC +DCETV+GLDMHSQTREHQ+MAMD+VL Sbjct: 1570 PNDHGFRPPGDMESFDNSRKRKPLSMAWCRICNIDCETVDGLDMHSQTREHQQMAMDIVL 1629 Query: 395 SIKKDNVKKQKVSSDDHKSHEDGSKSSK 312 SIK+ N KKQK++S DH + ED SKS K Sbjct: 1630 SIKQQNAKKQKLTSKDHSTPEDSSKSKK 1657 >ref|XP_006488440.1| PREDICTED: mediator of RNA polymerase II transcription subunit 15-like isoform X1 [Citrus sinensis] gi|568870502|ref|XP_006488441.1| PREDICTED: mediator of RNA polymerase II transcription subunit 15-like isoform X2 [Citrus sinensis] gi|568870504|ref|XP_006488442.1| PREDICTED: mediator of RNA polymerase II transcription subunit 15-like isoform X3 [Citrus sinensis] gi|568870506|ref|XP_006488443.1| PREDICTED: mediator of RNA polymerase II transcription subunit 15-like isoform X4 [Citrus sinensis] Length = 1392 Score = 302 bits (774), Expect = 3e-79 Identities = 206/480 (42%), Positives = 251/480 (52%), Gaps = 27/480 (5%) Frame = -1 Query: 1652 DGRQPDSHLPGSAEHVLFGQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQM 1473 DGR+ DSH PGS + G PS N M++NGGPG L Sbjct: 985 DGRESDSHFPGSQQRSPLGPPSGTRSNMMRMNGGPGSELR-------------------- 1024 Query: 1472 PDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFE 1293 DERFK P+ R PF ++P R +I R EFEEDLKQF R + LD+E VPK Sbjct: 1025 -DERFKSFPD--------GRLNPFPVDPARSVIDRGEFEEDLKQFSRPSHLDAEPVPKLG 1075 Query: 1292 SYF--SRP-DRASHGF-------------NHDVGLKLD--GNDNAPRLLPPYQPGSLRPL 1167 S+F SRP DR HG+ ++D GLKLD G R LP Y Sbjct: 1076 SHFLPSRPFDRGPHGYGMDMGPRPFERGLSYDPGLKLDPMGASAPSRFLPAYH------- 1128 Query: 1166 DLCDDNMDRRVDIAAGVPPDFLRS--ASGRNRIDGFPLRSPGREY-------PSHPSSRF 1014 D+ R D ++ PDF R A GR + G RS RE+ S SR Sbjct: 1129 ----DDAAGRSD-SSHAHPDFPRPGRAYGRRHMGGLSPRSSFREFCGFGGLPGSLGGSRS 1183 Query: 1013 RRLEDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGSLPARLRGG 834 R ED GRE F + GN+FH++RFP+LPSHLR+GE +G G R G Sbjct: 1184 VR-EDIGGREFRRFGDPI--------GNSFHDSRFPVLPSHLRRGEFEGPG------RTG 1228 Query: 833 DLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXX 654 DLIG P L+ GEP+G P++L G+ G GGF A++ + Sbjct: 1229 DLIGQEFLPSHLRRGEPLG----PHNLRLGETVGLGGFPGPARMEELGGP---------- 1274 Query: 653 SFGGNLPSRARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKV 474 GN P G E GF S F G+ NDGGF+ GD+ES D SRKRK SMGWCRICKV Sbjct: 1275 ---GNFPPPRLG-EPGFRSSFSRQGFPNDGGFYT-GDMESIDNSRKRKPPSMGWCRICKV 1329 Query: 473 DCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESR 294 DCETV+GLD+HSQTREHQKMAMDMVLSIK+ N KKQK++S D S +D +KS F+ R Sbjct: 1330 DCETVDGLDLHSQTREHQKMAMDMVLSIKQ-NAKKQKLTSGDRCSTDDANKSRNVNFDGR 1388 >ref|XP_006424987.1| hypothetical protein CICLE_v10027683mg [Citrus clementina] gi|557526921|gb|ESR38227.1| hypothetical protein CICLE_v10027683mg [Citrus clementina] Length = 1392 Score = 302 bits (774), Expect = 3e-79 Identities = 206/480 (42%), Positives = 251/480 (52%), Gaps = 27/480 (5%) Frame = -1 Query: 1652 DGRQPDSHLPGSAEHVLFGQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQM 1473 DGR+ DSH PGS + G PS N M++NGGPG L Sbjct: 985 DGRESDSHFPGSQQRSPLGPPSGTRSNMMRMNGGPGSELR-------------------- 1024 Query: 1472 PDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFE 1293 DERFK P+ R PF ++P R +I R EFEEDLKQF R + LD+E VPK Sbjct: 1025 -DERFKSFPD--------GRLNPFPVDPARSVIDRGEFEEDLKQFSRPSHLDAEPVPKLG 1075 Query: 1292 SYF--SRP-DRASHGF-------------NHDVGLKLD--GNDNAPRLLPPYQPGSLRPL 1167 S+F SRP DR HG+ ++D GLKLD G R LP Y Sbjct: 1076 SHFLPSRPFDRGPHGYGMDMGPRPFERGLSYDPGLKLDPMGASAPSRFLPAYH------- 1128 Query: 1166 DLCDDNMDRRVDIAAGVPPDFLRS--ASGRNRIDGFPLRSPGREY-------PSHPSSRF 1014 D+ R D ++ PDF R A GR + G RS RE+ S SR Sbjct: 1129 ----DDAAGRSD-SSHAHPDFPRPGRAYGRRHMGGLSPRSSFREFCGFGGLPGSLGGSRS 1183 Query: 1013 RRLEDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGSLPARLRGG 834 R ED GRE F + GN+FH++RFP+LPSHLR+GE +G G R G Sbjct: 1184 VR-EDIGGREFRRFGDPI--------GNSFHDSRFPVLPSHLRRGEFEGPG------RTG 1228 Query: 833 DLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXX 654 DLIG P L+ GEP+G P++L G+ G GGF A++ + Sbjct: 1229 DLIGQEFLPSHLRRGEPLG----PHNLRLGETVGLGGFPGPARMEELGGP---------- 1274 Query: 653 SFGGNLPSRARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKV 474 GN P G E GF S F G+ NDGGF+ GD+ES D SRKRK SMGWCRICKV Sbjct: 1275 ---GNFPPPRLG-EPGFRSSFSHQGFPNDGGFYT-GDMESIDNSRKRKPPSMGWCRICKV 1329 Query: 473 DCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESR 294 DCETV+GLD+HSQTREHQKMAMDMVLSIK+ N KKQK++S D S +D +KS F+ R Sbjct: 1330 DCETVDGLDLHSQTREHQKMAMDMVLSIKQ-NAKKQKLTSGDRCSTDDANKSRNVNFDGR 1388 >ref|XP_007204950.1| hypothetical protein PRUPE_ppa000292mg [Prunus persica] gi|462400592|gb|EMJ06149.1| hypothetical protein PRUPE_ppa000292mg [Prunus persica] Length = 1334 Score = 300 bits (769), Expect = 1e-78 Identities = 204/467 (43%), Positives = 249/467 (53%), Gaps = 18/467 (3%) Frame = -1 Query: 1637 DSHLPGSAEHVLFGQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQMPDERF 1458 DSH + GQPS + PN +++NG PG D S G ++ER+K P Sbjct: 931 DSHGGMMSRAAPIGQPSGIHPNMLRMNGTPGL-------DSSSTHGPRDERFKAFP---- 979 Query: 1457 KRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFESYFSR 1278 G+R PF ++P RH+I R EFE+DLKQFPR + LDSE V KF +Y SR Sbjct: 980 ------------GERLNPFPVDPTRHVIDRVEFEDDLKQFPRPSYLDSEPVAKFGNYSSR 1027 Query: 1277 P-DRASHGFNHDVGLKLDG-NDNAP-RLLPPYQ-PGSLRPLDLCDDNMDRRVDIAAGVPP 1110 P DRA HGF +D G D AP R L PY+ GS+ D D R++ G P Sbjct: 1028 PFDRAPHGFKYDSGPHTDPLAGTAPSRFLSPYRLGGSVHGNDAGDFG---RMEPTHG-HP 1083 Query: 1109 DFLRSASGRNRIDGFPLRSPGREYPSHPSSRFRRL--EDSDGRELHVFSEQSKSFNLPSE 936 DF+ GR +DG RSP R+YP P FR +D DGRE H F + Sbjct: 1084 DFV----GRRLVDGLAPRSPVRDYPGLPPHGFRGFGPDDFDGREFHRFGDPL-------- 1131 Query: 935 GNAFHENRFPILPSHLRKGESDGSGSLP-ARLRGGDLIGSNVPPGRLQSGEPIGHRNL-- 765 GN FHE RF LP H R+GE +G G+L R D IG + PG L+ G+ +G NL Sbjct: 1132 GNQFHEGRFSNLPGHFRRGEFEGPGNLRMVDHRRNDFIGQDGHPGHLRRGDHLGPHNLRE 1191 Query: 764 -----PNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGGNLPSRARGAESGFS 600 H H G +AG G F F GN P+ R E GF Sbjct: 1192 PLGFGSRHSHMGDMAGPGNFE---------------------PFRGNRPNHPRLGEPGFR 1230 Query: 599 SGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTREHQ 420 S F + + NDG + GD+ESFD SRKRK SMGWCRICKVDCETVEGLD+HSQTREHQ Sbjct: 1231 SSFSLQRFPNDGTY--TGDLESFDHSRKRKPASMGWCRICKVDCETVEGLDLHSQTREHQ 1288 Query: 419 KMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSS----KAGFESRD 291 KMAMDMV SIK+ N KKQK++S D ED +KS +AG +S D Sbjct: 1289 KMAMDMVRSIKQ-NAKKQKLTSGDQSLLEDANKSKIPVLRAGEKSID 1334 >emb|CAN64434.1| hypothetical protein VITISV_000937 [Vitis vinifera] Length = 1131 Score = 279 bits (714), Expect = 2e-72 Identities = 187/451 (41%), Positives = 232/451 (51%), Gaps = 4/451 (0%) Frame = -1 Query: 1652 DGRQPDSHLPGSAEHVLFGQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQM 1473 DGRQ DSH+PGS+E FGQPS N M++NGG G + S+P GLQ+ER+K + Sbjct: 774 DGRQSDSHIPGSSERGPFGQPSGXQSNMMRMNGGLGI-------ESSLPVGLQDERFKSL 826 Query: 1472 PDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFE 1293 P EPGR +F EDLKQF RS+ LDS+ VPKF Sbjct: 827 P-------------------------EPGRRSSDHGKFAEDLKQFSRSSHLDSDLVPKFG 861 Query: 1292 SYFS--RP-DRASHGFNHDVGLKLDGNDNAPRLLPPYQPGSLRPLDLCDDNMDRRVDIAA 1122 +YFS RP DR S GF D L D AP Sbjct: 862 NYFSSSRPLDRGSQGFVMDAAQGL--LDKAPL---------------------------- 891 Query: 1121 GVPPDFLRSASGRNRIDGFPLRSPGREYPSHPSSRFRRLEDSDGRELHVFSEQSKSFNLP 942 GF S + +SR L+D DGRE F E ++FNLP Sbjct: 892 -----------------GFNYDSGFKSSAGTGTSRQSDLDDIDGRESRRFGEGYQTFNLP 934 Query: 941 SEGNAFHENRFPILPSHLRKGESDGSGSLPARLRGGDLIGSNVPPGRLQSGEPIGHRNLP 762 S+ E+RFP+LPSHLR+ LP+ L+ G+ GS PG+L+ GEP+ L Sbjct: 935 SD-----ESRFPVLPSHLRRD------ILPSHLQRGEHFGSRNIPGQLRFGEPVFDAFL- 982 Query: 761 NHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGG-NLPSRARGAESGFSSGFPI 585 H G+++G G F +R ++ FGG N R E GF S + + Sbjct: 983 GHPRMGELSGPGNFPSRLSAGES--------------FGGSNKSGHPRIGEPGFRSTYSL 1028 Query: 584 HGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTREHQKMAMD 405 HGY ND GF GD+ESFD SRKRK SM WCRIC +DCETV+GLDMHSQTREHQ+MAMD Sbjct: 1029 HGYPNDHGFRPPGDMESFDNSRKRKPLSMAWCRICNIDCETVDGLDMHSQTREHQQMAMD 1088 Query: 404 MVLSIKKDNVKKQKVSSDDHKSHEDGSKSSK 312 +VLSIK+ N KKQK++S DH + ED SKS K Sbjct: 1089 IVLSIKQQNAKKQKLTSKDHSTPEDSSKSKK 1119 >ref|XP_006848046.1| hypothetical protein AMTR_s00029p00190880 [Amborella trichopoda] gi|548851351|gb|ERN09627.1| hypothetical protein AMTR_s00029p00190880 [Amborella trichopoda] Length = 1626 Score = 275 bits (703), Expect = 4e-71 Identities = 198/490 (40%), Positives = 251/490 (51%), Gaps = 37/490 (7%) Frame = -1 Query: 1652 DGRQPDSHLPGSAEHVLFGQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQM 1473 DGRQPD H PS+ P + +NG GK S + + P GL EER+ + Sbjct: 1182 DGRQPDVHQ---------SLPSDRAPYGL-VNGAAGK--GSNVPESAFPHGLPEERFGPL 1229 Query: 1472 PDERFKRLPEEGFNM-LPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKF 1296 P++RFK LPE+G LP D F+P+ ++P R I RREFEEDLK+FPRS LD E ++ Sbjct: 1230 PEDRFKHLPEDGLKKPLPDDHFRPYALDPSRRAIDRREFEEDLKKFPRSGHLDGEPASRY 1289 Query: 1295 ESYFSRPDRASHGFN--HDVGLKLDGNDNAPRL-----LPPYQPGSLRPLDLCD------ 1155 + YFS + + H GL LD APR +PPY+ LDL D Sbjct: 1290 DGYFSSRNPSGHSPRSLERPGLNLD----APRYPEGMSVPPYRGAGGSSLDLGDRSKPGG 1345 Query: 1154 ---DNMDRRVDIAAGVPPDFLRSAS--GRNRIDGF-PLRSPGREYPSHPSSRFRR----- 1008 D + R++D G D+ R+ DG P RSP R+Y S R Sbjct: 1346 FHGDLIGRKLD-TTGARSDYGGPFPEVSRSHRDGLGPPRSPVRDYAGVRVSGVRPDYAGI 1404 Query: 1007 ---LEDSDGRELHVFSEQ-SKSFNLPSEGNAFHENRFPI-LPSHLRKGESDGSGSLPARL 843 L+ GRE F EQ +++F P G F LP R ES G G P L Sbjct: 1405 PHPLDGLGGREPLGFGEQRARAFLDPIHGGKIPSGPFESRLPIPSRIAESAGFGDFPGHL 1464 Query: 842 RGGDLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXX 663 RGGD G P +SGE LP+HL ++AG G ++ +A Sbjct: 1465 RGGDPFG----PSHFRSGE------LPSHLRGRELAGSGNLPPHLRIGEAMGP------- 1507 Query: 662 XXXSFGGNLPSRARGAESGFSSGFPIHGYQNDGGFFNAG-----DVESFDLSRKRKLGSM 498 GG+L GF + GY DGGF+N G DV++ + SRKRK GS Sbjct: 1508 -----GGHLRE----------PGFGMQGYPKDGGFYNPGSFPPSDVDALEYSRKRKPGST 1552 Query: 497 GWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKV--SSDDHKSHEDGS 324 GWCRICKVDCETVEGLD+HSQTREHQKMAMDMVLSIK+D+ KKQK+ SS+DH E+ + Sbjct: 1553 GWCRICKVDCETVEGLDLHSQTREHQKMAMDMVLSIKQDSAKKQKLYGSSEDHVPQEEPT 1612 Query: 323 KSSKAGFESR 294 K +A FESR Sbjct: 1613 KGRRASFESR 1622 >ref|XP_004295721.1| PREDICTED: uncharacterized protein LOC101314450 [Fragaria vesca subsp. vesca] Length = 1316 Score = 273 bits (697), Expect = 2e-70 Identities = 190/441 (43%), Positives = 237/441 (53%), Gaps = 6/441 (1%) Frame = -1 Query: 1598 GQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQMPDERFKRLPEEGFNMLPG 1419 GQPS + N +++NG PG +S GL++ER+K +PD R N PG Sbjct: 939 GQPSGIISNMLRMNGNPGFESSS-------TLGLRDERFKALPDGRL--------NPFPG 983 Query: 1418 DRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFESYFSRP-DRASHGFNHDV 1242 D P R +I+R FE+DLKQFPR + LDSE +PK +Y SR DR G N+D Sbjct: 984 D--------PTR-VISRVGFEDDLKQFPRPSFLDSEPLPKLGNYSSRAFDRRPFGVNYDT 1034 Query: 1241 GLKLD-GNDNAPRLLPPYQPGSLRPLDLCDDNMDRRVDIAAGVPPDFLRSASGRNRIDGF 1065 L +D +APR L PY L +D + PDF GR +DG Sbjct: 1035 RLNIDPAAGSAPRFLSPYGHAGLIH---ANDTIGH---------PDF----GGRRLMDGL 1078 Query: 1064 PLRSPGREYPSHPSSRFRRL--EDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSH 891 RSP R+YP PS RFR +D DGRE H F + G FH+NRFP H Sbjct: 1079 ARRSPIRDYPGIPS-RFRGFGPDDFDGREFHRFGDPL--------GREFHDNRFP--NQH 1127 Query: 890 LRKGESDGSGSLPA--RLRGGDLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFH 717 R+GE +G G++ R+R DLIG + G LQ GE +G NLP HLH + GFG Sbjct: 1128 FRRGEFEGPGNMRVDDRMRN-DLIGQDGHLGHLQRGEHLGPHNLPGHLHMREHVGFGVHP 1186 Query: 716 NRAKLSDAXXXXXXXXXXXXXSFGGNLPSRARGAESGFSSGFPIHGYQNDGGFFNAGDVE 537 A SF GN + R E GF S F + + NDG + AG++E Sbjct: 1187 RHA------------GPGSFESFIGNRANHPRLGEPGFRSSFSLKRFPNDGTY--AGELE 1232 Query: 536 SFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVS 357 SFD SRKRK SMGWCRICKV+CETVEGLD+HSQTREHQ+MAM+MV I K N KKQK++ Sbjct: 1233 SFDHSRKRKPASMGWCRICKVNCETVEGLDVHSQTREHQRMAMEMV-QIIKQNAKKQKLT 1291 Query: 356 SDDHKSHEDGSKSSKAGFESR 294 S D S ED +KS ES+ Sbjct: 1292 SGDQSSIEDANKSKITSSESQ 1312 >ref|XP_004169561.1| PREDICTED: uncharacterized protein LOC101227701 [Cucumis sativus] Length = 538 Score = 270 bits (691), Expect = 1e-69 Identities = 192/464 (41%), Positives = 245/464 (52%), Gaps = 16/464 (3%) Frame = -1 Query: 1637 DSHLPGSAEHVLFGQPSNM---PPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQMPD 1467 DSHLPG+ EH P ++ PPN + LNG PG D S GL+ D Sbjct: 128 DSHLPGTMEH----HPPHLTGIPPNVLPLNGAPGP-------DSSSKLGLR--------D 168 Query: 1466 ERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFESY 1287 ERFK L EE N P ++P R I + + E+ L+QFPR + L+SE + +Y Sbjct: 169 ERFKLLHEEQLNSFP--------LDPARRPINQTDAEDILRQFPRPSHLESELAQRIGNY 220 Query: 1286 FSRP-DRASHGFNHDVGLKLDGNDNAPRLLPP-------YQPGSLRPLDLCDDNMDRRVD 1131 RP DR HG N D GL +DG A R+LPP Y + RP+ +D+ + D Sbjct: 221 SLRPFDRGVHGQNFDTGLTIDGAA-ASRVLPPRHIGGALYPTDAERPIAFYEDSTGQ-AD 278 Query: 1130 IAAGVPPDFLRSASGRNRIDGFPLRSPGREYPSHP--SSRFRRLEDSDGREL-HVFSEQS 960 + G + GR +DGF RSP EY F +E+ DG++ H F + Sbjct: 279 RSRGHSDFPAPGSYGRRFVDGFGPRSPLHEYHGRGFGGRGFTGVEEIDGQDFPHHFGDPL 338 Query: 959 KSFNLPSEGNAFHENRFPILPSHLRKGESDGSGS--LPARLRGGDLIGSNVPPGRLQSGE 786 +F E+RFPI SHL++G+ + SG+ + LR GDLIG + Sbjct: 339 ----------SFRESRFPIFRSHLQRGDFESSGNFRMSEHLRTGDLIGQD---------R 379 Query: 785 PIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGGNLPSRARGAESG 606 G R+LP HL G++ FG +++ D GG+ P+ R E G Sbjct: 380 HFGPRSLPGHLRLGELTAFGSHPGHSRIGDLSVLGNFEPFG-----GGHRPNNPRLGEPG 434 Query: 605 FSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTRE 426 F S F G +DG FF AGDVESFD SRKRK SMGWCRICKVDCETVEGL++HSQTRE Sbjct: 435 FRSSFSRQGLVDDGRFF-AGDVESFDNSRKRKPISMGWCRICKVDCETVEGLELHSQTRE 493 Query: 425 HQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESR 294 HQKMAMDMV SIK+ N KK KV+ +DH S EDG KS G ESR Sbjct: 494 HQKMAMDMVQSIKQ-NAKKHKVTPNDHSS-EDG-KSKNVGLESR 534 >ref|XP_004153176.1| PREDICTED: uncharacterized protein LOC101214768 [Cucumis sativus] Length = 1177 Score = 270 bits (691), Expect = 1e-69 Identities = 192/464 (41%), Positives = 245/464 (52%), Gaps = 16/464 (3%) Frame = -1 Query: 1637 DSHLPGSAEHVLFGQPSNM---PPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQMPD 1467 DSHLPG+ EH P ++ PPN + LNG PG D S GL+ D Sbjct: 767 DSHLPGTMEH----HPPHLTGIPPNVLPLNGAPGP-------DSSSKLGLR--------D 807 Query: 1466 ERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFESY 1287 ERFK L EE N P ++P R I + + E+ L+QFPR + L+SE + +Y Sbjct: 808 ERFKLLHEEQLNSFP--------LDPARRPINQTDAEDILRQFPRPSHLESELAQRIGNY 859 Query: 1286 FSRP-DRASHGFNHDVGLKLDGNDNAPRLLPP-------YQPGSLRPLDLCDDNMDRRVD 1131 RP DR HG N D GL +DG A R+LPP Y + RP+ +D+ + D Sbjct: 860 SLRPFDRGVHGQNFDTGLTIDGAA-ASRVLPPRHIGGALYPTDAERPIAFYEDSTGQ-AD 917 Query: 1130 IAAGVPPDFLRSASGRNRIDGFPLRSPGREYPSHP--SSRFRRLEDSDGREL-HVFSEQS 960 + G + GR +DGF RSP EY F +E+ DG++ H F + Sbjct: 918 RSRGHSDFPAPGSYGRRFVDGFGPRSPLHEYHGRGFGGRGFTGVEEIDGQDFPHHFGDPL 977 Query: 959 KSFNLPSEGNAFHENRFPILPSHLRKGESDGSGS--LPARLRGGDLIGSNVPPGRLQSGE 786 +F E+RFPI SHL++G+ + SG+ + LR GDLIG + Sbjct: 978 ----------SFRESRFPIFRSHLQRGDFESSGNFRMSEHLRTGDLIGQD---------R 1018 Query: 785 PIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGGNLPSRARGAESG 606 G R+LP HL G++ FG +++ D GG+ P+ R E G Sbjct: 1019 HFGPRSLPGHLRLGELTAFGSHPGHSRIGDLSVLGNFEPFG-----GGHRPNNPRLGEPG 1073 Query: 605 FSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTRE 426 F S F G +DG FF AGDVESFD SRKRK SMGWCRICKVDCETVEGL++HSQTRE Sbjct: 1074 FRSSFSRQGLVDDGRFF-AGDVESFDNSRKRKPISMGWCRICKVDCETVEGLELHSQTRE 1132 Query: 425 HQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESR 294 HQKMAMDMV SIK+ N KK KV+ +DH S EDG KS G ESR Sbjct: 1133 HQKMAMDMVQSIKQ-NAKKHKVTPNDHSS-EDG-KSKNVGLESR 1173 >ref|XP_004145323.1| PREDICTED: uncharacterized protein LOC101205914 [Cucumis sativus] Length = 1434 Score = 270 bits (691), Expect = 1e-69 Identities = 192/464 (41%), Positives = 245/464 (52%), Gaps = 16/464 (3%) Frame = -1 Query: 1637 DSHLPGSAEHVLFGQPSNM---PPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQMPD 1467 DSHLPG+ EH P ++ PPN + LNG PG D S GL+ D Sbjct: 1024 DSHLPGTMEH----HPPHLTGIPPNVLPLNGAPGP-------DSSSKLGLR--------D 1064 Query: 1466 ERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFESY 1287 ERFK L EE N P ++P R I + + E+ L+QFPR + L+SE + +Y Sbjct: 1065 ERFKLLHEEQLNSFP--------LDPARRPINQTDAEDILRQFPRPSHLESELAQRIGNY 1116 Query: 1286 FSRP-DRASHGFNHDVGLKLDGNDNAPRLLPP-------YQPGSLRPLDLCDDNMDRRVD 1131 RP DR HG N D GL +DG A R+LPP Y + RP+ +D+ + D Sbjct: 1117 SLRPFDRGVHGQNFDTGLTIDGAA-ASRVLPPRHIGGALYPTDAERPIAFYEDSTGQ-AD 1174 Query: 1130 IAAGVPPDFLRSASGRNRIDGFPLRSPGREYPSHP--SSRFRRLEDSDGREL-HVFSEQS 960 + G + GR +DGF RSP EY F +E+ DG++ H F + Sbjct: 1175 RSRGHSDFPAPGSYGRRFVDGFGPRSPLHEYHGRGFGGRGFTGVEEIDGQDFPHHFGDPL 1234 Query: 959 KSFNLPSEGNAFHENRFPILPSHLRKGESDGSGS--LPARLRGGDLIGSNVPPGRLQSGE 786 +F E+RFPI SHL++G+ + SG+ + LR GDLIG + Sbjct: 1235 ----------SFRESRFPIFRSHLQRGDFESSGNFRMSEHLRTGDLIGQD---------R 1275 Query: 785 PIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGGNLPSRARGAESG 606 G R+LP HL G++ FG +++ D GG+ P+ R E G Sbjct: 1276 HFGPRSLPGHLRLGELTAFGSHPGHSRIGDLSVLGNFEPFG-----GGHRPNNPRLGEPG 1330 Query: 605 FSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTRE 426 F S F G +DG FF AGDVESFD SRKRK SMGWCRICKVDCETVEGL++HSQTRE Sbjct: 1331 FRSSFSRQGLVDDGRFF-AGDVESFDNSRKRKPISMGWCRICKVDCETVEGLELHSQTRE 1389 Query: 425 HQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESR 294 HQKMAMDMV SIK+ N KK KV+ +DH S EDG KS G ESR Sbjct: 1390 HQKMAMDMVQSIKQ-NAKKHKVTPNDHSS-EDG-KSKNVGLESR 1430 >ref|XP_002520450.1| hypothetical protein RCOM_0731250 [Ricinus communis] gi|223540292|gb|EEF41863.1| hypothetical protein RCOM_0731250 [Ricinus communis] Length = 1329 Score = 270 bits (690), Expect = 1e-69 Identities = 181/462 (39%), Positives = 233/462 (50%), Gaps = 25/462 (5%) Frame = -1 Query: 1598 GQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQMPDERFKRLPEEGFNMLPG 1419 GQ S M NAM++NG PG D S GL+++R++ DE Sbjct: 935 GQQSGMHSNAMRMNGAPG-------MDSSSALGLRDDRFRPFSDEYMN------------ 975 Query: 1418 DRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFESYFS--RP------DRAS 1263 PF +P + I+ RREFEEDLK F R + LD++ KF + FS RP D+ Sbjct: 976 ----PFPKDPSQRIVDRREFEEDLKHFSRPSDLDTQSTTKFGANFSSSRPLDRGPLDKGL 1031 Query: 1262 HGFNHDVGLKLDGNDNAP--RLLPPYQ-PGSLRPLDLCDDNMDRRVDIAAGVPPDFLRSA 1092 HG N+D G+KL+ P R PPY G + P D+ + ++ D G PD +R+ Sbjct: 1032 HGPNYDSGMKLESLGGPPPSRFFPPYHHDGLMHPNDIAERSIGFH-DNTLGRQPDSVRAH 1090 Query: 1091 S---------GRNRIDGFPLRSPGREYPSHPSSRFRR---LEDSDGRELHVFSEQSKSFN 948 R DG RSPGR+YP S F L+D DGRE F Sbjct: 1091 PEFFGPGRRYDRRHRDGMAPRSPGRDYPGVSSRGFGAIPGLDDIDGRESRRF-------- 1142 Query: 947 LPSEGNAFHENRFPILPSHLRKGESDGSGS--LPARLRGGDLIGSNVPPGRLQSGEPIGH 774 G++FH +RFP+LPSH+R GE +G R G+ +G + RL GEPIG Sbjct: 1143 ----GDSFHGSRFPVLPSHMRMGEFEGPSQDGFSNHFRRGEHLGHHNMRNRL--GEPIGF 1196 Query: 773 RNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGGNLPSRARGAESGFSSG 594 P G ++G G F N R E GF S Sbjct: 1197 GAFPGPAGMGDLSGTGNFFN-----------------------------PRLGEPGFRSS 1227 Query: 593 FPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTREHQKM 414 F G+ DGG + AG++ESFD SR+RK SMGWCRICKVDCETVEGLD+HSQTREHQK Sbjct: 1228 FSFKGFPGDGGIY-AGELESFDNSRRRKSSSMGWCRICKVDCETVEGLDLHSQTREHQKR 1286 Query: 413 AMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESRDN 288 AMDMV++IK+ N KKQK++++DH S +D SKS E R N Sbjct: 1287 AMDMVVTIKQ-NAKKQKLANNDHSSVDDASKSKNTSIEGRGN 1327 >ref|XP_007016237.1| Uncharacterized protein isoform 7 [Theobroma cacao] gi|508786600|gb|EOY33856.1| Uncharacterized protein isoform 7 [Theobroma cacao] Length = 975 Score = 261 bits (666), Expect = 9e-67 Identities = 175/427 (40%), Positives = 218/427 (51%), Gaps = 22/427 (5%) Frame = -1 Query: 1502 GLQEERYKQMPDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQ 1323 GL + ER K + +E N P DR H R +FEEDLK FPR + Sbjct: 600 GLDSTSTFSLRGERLKPVQDECSNQFPLDR---------GHRGDRGQFEEDLKHFPRPSH 650 Query: 1322 LDSEGVPKFESYFS--RP-DRASHGFNHDVGLKLDGND------------NAPRLLPPYQ 1188 LD+E VPKF SY S RP DR HGF D+G + + R LPPY Sbjct: 651 LDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDPMIGSGPSRFLPPYH 710 Query: 1187 PGSL--RPLDLCDDNMDRRVDIAAGVPPDFLRSAS--GRNRIDGFPLRSPGREYPSHPSS 1020 P RP+ L D + R PDFL + GR+R+DGF RSPGREYP Sbjct: 711 PDDTGERPVGLPKDTLGR---------PDFLGTVPSYGRHRMDGFVSRSPGREYPGISPH 761 Query: 1019 RF--RRLEDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGSLPAR 846 F ++ DGRE FS+ RFP LP HL +G + S + Sbjct: 762 GFGGHPGDEIDGRERR-FSD-----------------RFPGLPGHLHRGGFESSDRMEEH 803 Query: 845 LRGGDLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXX 666 LR D+I + P + GE +GH N+P HL G+ GFG F + ++ + Sbjct: 804 LRSRDMINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGE---------- 853 Query: 665 XXXXSFGGNLPSR-ARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWC 489 FGG R R E GF S F + + NDGG + G ++SF+ RKRK SMGWC Sbjct: 854 -----FGGPGNFRHPRLGEPGFRSSFSLQEFPNDGGIYTGG-MDSFENLRKRKPMSMGWC 907 Query: 488 RICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKA 309 RICK+DCETVEGLD+HSQTREHQKMAMDMV++IK+ N KKQK++S DH D SKS Sbjct: 908 RICKIDCETVEGLDLHSQTREHQKMAMDMVVTIKQ-NAKKQKLTSSDHSIRNDTSKSKNV 966 Query: 308 GFESRDN 288 FE R N Sbjct: 967 KFEGRVN 973 >ref|XP_007016232.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|590588563|ref|XP_007016233.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|590588573|ref|XP_007016234.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508786595|gb|EOY33851.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508786596|gb|EOY33852.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508786597|gb|EOY33853.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 1408 Score = 261 bits (666), Expect = 9e-67 Identities = 175/427 (40%), Positives = 218/427 (51%), Gaps = 22/427 (5%) Frame = -1 Query: 1502 GLQEERYKQMPDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQ 1323 GL + ER K + +E N P DR H R +FEEDLK FPR + Sbjct: 1033 GLDSTSTFSLRGERLKPVQDECSNQFPLDR---------GHRGDRGQFEEDLKHFPRPSH 1083 Query: 1322 LDSEGVPKFESYFS--RP-DRASHGFNHDVGLKLDGND------------NAPRLLPPYQ 1188 LD+E VPKF SY S RP DR HGF D+G + + R LPPY Sbjct: 1084 LDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDPMIGSGPSRFLPPYH 1143 Query: 1187 PGSL--RPLDLCDDNMDRRVDIAAGVPPDFLRSAS--GRNRIDGFPLRSPGREYPSHPSS 1020 P RP+ L D + R PDFL + GR+R+DGF RSPGREYP Sbjct: 1144 PDDTGERPVGLPKDTLGR---------PDFLGTVPSYGRHRMDGFVSRSPGREYPGISPH 1194 Query: 1019 RF--RRLEDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGSLPAR 846 F ++ DGRE FS+ RFP LP HL +G + S + Sbjct: 1195 GFGGHPGDEIDGRERR-FSD-----------------RFPGLPGHLHRGGFESSDRMEEH 1236 Query: 845 LRGGDLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXX 666 LR D+I + P + GE +GH N+P HL G+ GFG F + ++ + Sbjct: 1237 LRSRDMINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGE---------- 1286 Query: 665 XXXXSFGGNLPSR-ARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWC 489 FGG R R E GF S F + + NDGG + G ++SF+ RKRK SMGWC Sbjct: 1287 -----FGGPGNFRHPRLGEPGFRSSFSLQEFPNDGGIYTGG-MDSFENLRKRKPMSMGWC 1340 Query: 488 RICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKA 309 RICK+DCETVEGLD+HSQTREHQKMAMDMV++IK+ N KKQK++S DH D SKS Sbjct: 1341 RICKIDCETVEGLDLHSQTREHQKMAMDMVVTIKQ-NAKKQKLTSSDHSIRNDTSKSKNV 1399 Query: 308 GFESRDN 288 FE R N Sbjct: 1400 KFEGRVN 1406 >ref|XP_006379033.1| hypothetical protein POPTR_0009s04520g [Populus trichocarpa] gi|550331020|gb|ERP56830.1| hypothetical protein POPTR_0009s04520g [Populus trichocarpa] Length = 1315 Score = 258 bits (658), Expect = 7e-66 Identities = 174/417 (41%), Positives = 220/417 (52%), Gaps = 34/417 (8%) Frame = -1 Query: 1436 FNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFESYF--SRP---- 1275 F+ LP + PF P H + + EFEEDLK FPR + LD+E VPK S+F SRP Sbjct: 931 FSSLPDEHLNPFPRGPAHHNVHQGEFEEDLKHFPRPSHLDTEPVPKSSSHFPSSRPLDRG 990 Query: 1274 -------------DRASHGFNHDVGLKLD--GNDNAPRLLPPYQPG-SLRPLDL-----C 1158 D+ SHGFN+D GL ++ G PR PPY +L P D Sbjct: 991 PRGFGVDGAPRPLDKGSHGFNYDSGLNMEPLGGSAPPRFFPPYHHDKALHPSDAEVSLGY 1050 Query: 1157 DDNMDRRVDIAAGVPPDFLRS---ASGRNRIDGFPLRSPGREYPSHPSSRFRRL---EDS 996 D++ R D A P FL +D RSP R+YP P+ RF L +D Sbjct: 1051 HDSLAGRSDFAR-TRPGFLGPPIPGYDHRHMDNLAPRSPVRDYPGMPTRRFGALPGLDDI 1109 Query: 995 DGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGSLP-ARLRGGDLIGS 819 DGR+ H F ++ S + ++RFP+ PSHLR+GE +G G+L GDL+G Sbjct: 1110 DGRDPHRFGDKFSS--------SLRDSRFPVFPSHLRRGELEGPGNLHMGEHLSGDLMGH 1161 Query: 818 NVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGGN 639 + P L+ GE +G RNLP+HL G+ FG F A++ + GN Sbjct: 1162 DGRPAHLRRGEHLGPRNLPSHLWVGEPGNFGAFPGHARMGELAGP-------------GN 1208 Query: 638 LPSRARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETV 459 G E GF S F GG + AGD++ FD SRKRK SMGWCRICKVDCETV Sbjct: 1209 FYHHQLG-EPGFRSSF--------GGNY-AGDLQFFDNSRKRK-PSMGWCRICKVDCETV 1257 Query: 458 EGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESRDN 288 E LD+HSQTREHQKMA+DMV++IK+ N KK K + H S ED SKS A FE R N Sbjct: 1258 EALDLHSQTREHQKMALDMVVTIKQ-NAKKHKSTPCHHSSLEDKSKSRNASFEGRGN 1313 >ref|XP_007016238.1| Uncharacterized protein isoform 8 [Theobroma cacao] gi|508786601|gb|EOY33857.1| Uncharacterized protein isoform 8 [Theobroma cacao] Length = 972 Score = 253 bits (647), Expect = 1e-64 Identities = 174/427 (40%), Positives = 216/427 (50%), Gaps = 22/427 (5%) Frame = -1 Query: 1502 GLQEERYKQMPDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQ 1323 GL + ER K + +E N P DR H R +FEEDLK FPR + Sbjct: 600 GLDSTSTFSLRGERLKPVQDECSNQFPLDR---------GHRGDRGQFEEDLKHFPRPSH 650 Query: 1322 LDSEGVPKFESYFS--RP-DRASHGFNHDVGLKLDGND------------NAPRLLPPYQ 1188 LD+E VPKF SY S RP DR HGF D+G + + R LPPY Sbjct: 651 LDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDPMIGSGPSRFLPPYH 710 Query: 1187 PGSL--RPLDLCDDNMDRRVDIAAGVPPDFLRSAS--GRNRIDGFPLRSPGREYPSHPSS 1020 P RP+ L D + R PDFL + GR+R+DGF RSPGREYP Sbjct: 711 PDDTGERPVGLPKDTLGR---------PDFLGTVPSYGRHRMDGFVSRSPGREYPGISPH 761 Query: 1019 RF--RRLEDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGSLPAR 846 F ++ DGRE FS+ RFP LP HL +G + S + Sbjct: 762 GFGGHPGDEIDGRERR-FSD-----------------RFPGLPGHLHRGGFESSDRMEEH 803 Query: 845 LRGGDLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXX 666 LR D+I + P + GE +GH N+P HL G+ GFG F + ++ + Sbjct: 804 LRSRDMINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGE---------- 853 Query: 665 XXXXSFGGNLPSR-ARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWC 489 FGG R R E GF S F + + NDGG + G ++SF+ RKRK SMGWC Sbjct: 854 -----FGGPGNFRHPRLGEPGFRSSFSLQEFPNDGGIYTGG-MDSFENLRKRKPMSMGWC 907 Query: 488 RICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKA 309 RICK+DCETVEGLD+HSQTREHQKMAMDMV++IK+ N KKQK+ DH D SKS Sbjct: 908 RICKIDCETVEGLDLHSQTREHQKMAMDMVVTIKQ-NAKKQKL---DHSIRNDTSKSKNV 963 Query: 308 GFESRDN 288 FE R N Sbjct: 964 KFEGRVN 970 >ref|XP_002298329.1| hypothetical protein POPTR_0001s25430g [Populus trichocarpa] gi|222845587|gb|EEE83134.1| hypothetical protein POPTR_0001s25430g [Populus trichocarpa] Length = 1327 Score = 239 bits (610), Expect = 3e-60 Identities = 185/499 (37%), Positives = 236/499 (47%), Gaps = 45/499 (9%) Frame = -1 Query: 1649 GRQPDSHLPGSAEHVLFGQPSNMPPNAMKLNGGPGKRLAS--------GFQDPSIPFGLQ 1494 GR P H+P +G P +A G+R +S G Q PS P G Q Sbjct: 872 GRLPPGHMPSH-----YGPPQGPYTHAPT---SQGERTSSYVHETSMFGNQRPSYPGGRQ 923 Query: 1493 EERYKQMPDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDS 1314 + + + F P + PF +P R + EFEEDLK F + LD+ Sbjct: 924 GILSNAVGTNGAQDPNSDRFRSFPDEHLNPFPHDPARRNAHQGEFEEDLKHFTAPSCLDT 983 Query: 1313 EGVPKFESYFS--RP-----------------DRASHGFNHDVGLKLD--GNDNAPRLLP 1197 + VPK +FS RP D+ SHG N+D GL ++ G PR P Sbjct: 984 KPVPKSGGHFSSSRPLDRGPHGFGVDGAPKHLDKGSHGLNYDSGLNVEPLGGSAPPRFFP 1043 Query: 1196 PYQ----------PGSLRPLDLCDDNMDRRVDIAAGVPPDFLRSASGRNR--IDGFPLRS 1053 P GSL DN+ R D A P G + +D RS Sbjct: 1044 PIHHDRTLHRSEAEGSLG----FHDNLAGRTDFARTRPGLLGPPMPGYDHRDMDNLAPRS 1099 Query: 1052 PGREYPSHPSSRFRRL---EDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRK 882 PGR+YP RF L +D DGR S+ S + H++RFP+ PSHLR+ Sbjct: 1100 PGRDYPGMSMQRFGALPGLDDIDGRAPQRSSDPITS--------SLHDSRFPLFPSHLRR 1151 Query: 881 GESDGSGSLP-ARLRGGDLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAK 705 GE +G G+ GDL+G + P L+ GE +G RN P+HL G+ GFG F A+ Sbjct: 1152 GELNGPGNFHMGEHLSGDLMGHDGWPAHLRRGERLGPRNPPSHLRLGERGGFGSFPGHAR 1211 Query: 704 LSDAXXXXXXXXXXXXXSFGGNLPSRARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDL 525 + + GNL + G E GF S F GG + AGD++ + Sbjct: 1212 MGELAGP-------------GNLYHQQLG-EPGFRSSF--------GGSY-AGDLQYSEN 1248 Query: 524 SRKRKLGSMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDH 345 SRKRK SMGWCRICKVDCET EGLD+HSQTREHQKMAMDMV++IK+ NVKK K + DH Sbjct: 1249 SRKRK-SSMGWCRICKVDCETFEGLDLHSQTREHQKMAMDMVVTIKQ-NVKKHKSAPSDH 1306 Query: 344 KSHEDGSKSSKAGFESRDN 288 S ED SK A FE R N Sbjct: 1307 SSLEDTSKLRNASFEGRGN 1325 >ref|XP_007131393.1| hypothetical protein PHAVU_011G009900g [Phaseolus vulgaris] gi|561004393|gb|ESW03387.1| hypothetical protein PHAVU_011G009900g [Phaseolus vulgaris] Length = 1314 Score = 219 bits (559), Expect = 2e-54 Identities = 167/489 (34%), Positives = 218/489 (44%), Gaps = 44/489 (8%) Frame = -1 Query: 1622 GSAE--HVLFGQPSNMPPNAMKLNGGPGKRLASGFQDPSI----PFGLQEERYKQMPDER 1461 GSA H N PP K L GFQ S+ PF E + Sbjct: 877 GSAHDPHTGHASAENFPPTMFKQPQDSDITLGRGFQPQSLGPPQPFNQVHEPPFRAGTSN 936 Query: 1460 FKRLPEEGFNM-LPGD----------------------RFKPFLIEPGRHIIARREFEED 1350 F RL F LPGD RFKPFL+ + + RRE+++D Sbjct: 937 FSRLGGPQFGAPLPGDMHGRMAANLPPHGTEGLGLHDERFKPFLVS-NQQTMDRREYDDD 995 Query: 1349 LKQFPRSAQLDSEGVPKFESYF---SRPDRASHGFNHDVGLKLDGNDNAPRLLPPYQPGS 1179 LK+F R +D+E + K+ +Y + S G + DV +K G+ P L P PG Sbjct: 996 LKKFSR-LPMDAESISKYGNYSLSAHESGKRSVGIHDDV-IKKSGSALHPGYLGP-GPGY 1052 Query: 1178 LRPLDLCDDNMDRRVDIAAGVPPDFLRSASGRNRIDGFPLRSPGREYPSHPSSRF----- 1014 GR+ +DG RSP EY S R Sbjct: 1053 ------------------------------GRHHMDGMTPRSPVGEYAEMSSRRLGPHSG 1082 Query: 1013 -----RRLEDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGS--L 855 ++D DGR F G F ++RFP LPSHL + E DG G+ + Sbjct: 1083 SLIGKSGIDDFDGRVPRHF------------GGEFRDSRFPHLPSHLHRDEFDGFGNFRI 1130 Query: 854 PARLRGGDLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXX 675 R GD IG + G + GEP+G N P HL G+ GFG + + Sbjct: 1131 GEHPRSGDFIGQDEYAGHFRRGEPLGPHNFPRHLQLGEPVGFGAHPGHMRAVEHGSFRSF 1190 Query: 674 XXXXXXXSFGGNLPSRARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMG 495 G+ P + E GF S F + G+ ND GF GD+ SFD R+RK+ SMG Sbjct: 1191 ESFAK-----GSRPGHPQLGEPGFRSSFSLPGFPNDAGFLT-GDIRSFDNLRRRKVSSMG 1244 Query: 494 WCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSS 315 WCRICK DCETVEGLD+HSQT+EHQKMAMDMV +IK+ N KKQK+ + + ++G+K+ Sbjct: 1245 WCRICKADCETVEGLDLHSQTKEHQKMAMDMVKTIKQ-NAKKQKLIPSEQPTVDEGNKTH 1303 Query: 314 KAGFESRDN 288 GFE R N Sbjct: 1304 NTGFEGRGN 1312 >gb|EXB30469.1| hypothetical protein L484_006018 [Morus notabilis] Length = 1320 Score = 214 bits (546), Expect = 7e-53 Identities = 177/475 (37%), Positives = 220/475 (46%), Gaps = 20/475 (4%) Frame = -1 Query: 1652 DGRQPDSHLPGSAEHVLFGQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQM 1473 D R PD H GS EH Q + PN ++N G D G ++ER Sbjct: 924 DSRGPDPHFAGSLEHGAHSQSFGIHPNMTRMNDSHGF-------DSLSTLGPRDER---- 972 Query: 1472 PDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFE 1293 F PF P R EFE+DLKQFPR Sbjct: 973 --------------------FNPFPAGPN----PRAEFEDDLKQFPRPF----------- 997 Query: 1292 SYFSRPDRASHGFNHDVGLKLD-GNDNAP-RLLPPYQPGSLRPLDLCDDNMDR----RVD 1131 DR HG + GLK+D G + P R L PY G +D DR R D Sbjct: 998 ------DRGLHGLKYHTGLKMDSGVGSVPSRSLSPYNGGG------ANDGGDRLGWHRGD 1045 Query: 1130 IAAGVPP-----DFLRSASG--RNRIDGFPLRSPGREYPSHPSSRF--RRLEDSDGRELH 978 + P DFL G R R+D RSP RE+P F +D GREL Sbjct: 1046 AFGRMDPTRGHLDFLGPGLGYDRRRMDSLASRSPIREHPGISLRGFVGPGPDDIHGRELR 1105 Query: 977 VFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGSLPA--RLRGGDLIGSNVPPG 804 F E S +FHE+RF +LP HLR+GE +G ++ LR DLIG + G Sbjct: 1106 RFGEPFDS--------SFHESRFSMLPGHLRRGEFEGPRNMGMGDHLRN-DLIGRDGLSG 1156 Query: 803 RLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFG-GNLPSR 627 L+ GE +G + H H G+ GFG A++ + FG G+ PS Sbjct: 1157 PLRWGEHMG--DFHGHFHLGEPVGFGAHSRHARIREIGGPGSFDS------FGRGDGPSF 1208 Query: 626 ARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLD 447 E GF S F HG+ G F + +FD SRKRKL +MGWCRICKVDCETVEGL+ Sbjct: 1209 PHLGEPGFRSRFSSHGFPTGDGIFT--EDLAFDKSRKRKLPTMGWCRICKVDCETVEGLE 1266 Query: 446 MHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFE--SRDN 288 +HSQTREHQKMAMDMV++IK+ N KKQK++ D S D S+ AG E +DN Sbjct: 1267 LHSQTREHQKMAMDMVVAIKQ-NAKKQKLTFGDQSSLGDASQPRSAGTEGHGKDN 1320 >ref|XP_004506322.1| PREDICTED: mediator of RNA polymerase II transcription subunit 12-like isoform X1 [Cicer arietinum] gi|502146144|ref|XP_004506323.1| PREDICTED: mediator of RNA polymerase II transcription subunit 12-like isoform X2 [Cicer arietinum] gi|502146146|ref|XP_004506324.1| PREDICTED: mediator of RNA polymerase II transcription subunit 12-like isoform X3 [Cicer arietinum] Length = 1283 Score = 210 bits (535), Expect = 1e-51 Identities = 154/399 (38%), Positives = 197/399 (49%), Gaps = 14/399 (3%) Frame = -1 Query: 1442 EGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFESYFSRPDRAS 1263 EGF + +RFK F +H I RREFE DLK+FPR D+E PKF +Y P Sbjct: 936 EGFGV-QDERFKSF-----QHNIDRREFENDLKKFPRHP-FDAEPGPKFGNYQLGP---- 984 Query: 1262 HGFNHDVGLKLDG-NDNAPRLLPPYQPGS-LRPLDLCDDNMDRRVDIAAGVPPDFLRSAS 1089 H+ G + G +D+A + +PGS L P L G P + Sbjct: 985 ----HETGKRPVGYHDDAIK-----KPGSTLHPGHL-------------GPGPGY----- 1017 Query: 1088 GRNRIDGFPLRSPGREYPSHPSSRFRRL----------EDSDGRELHVFSEQSKSFNLPS 939 G + +DG RSPG EY PS R L +D DGR + + S Sbjct: 1018 GIHHMDGIAPRSPGSEYIDMPSRRSGPLSGGLVSKSGIDDFDGRTASRYGD--------S 1069 Query: 938 EGNAFHENRFPILPSHLRKGESDGSGS--LPARLRGGDLIGSNVPPGRLQSGEPIGHRNL 765 G AF + RFP PSHL + DG G+ + R G+ IG + G Q GE +G N Sbjct: 1070 VGIAFRDGRFPHQPSHLHRDAFDGFGNFRMGEHPRRGNFIGRDEFSGHFQRGEHLGPHNF 1129 Query: 764 PNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGGNLPSRARGAESGFSSGFPI 585 P HL G+ FG + + GN P + E GF S F + Sbjct: 1130 PRHLQLGERISFGDHPGHMRAFELGSSRSFESFSK-----GNRPGHPQLGEPGFRSSFSL 1184 Query: 584 HGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTREHQKMAMD 405 G+ ND GF GD+ SFD R+RK SMGWCRICKVDCETVEGL++HSQTREHQKMA+D Sbjct: 1185 AGFNNDAGFLT-GDIRSFDNLRRRKAASMGWCRICKVDCETVEGLELHSQTREHQKMAVD 1243 Query: 404 MVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESRDN 288 +V +IK+ N KKQK+ + S EDG ++ GFE N Sbjct: 1244 IVKTIKQ-NAKKQKLIPSEQSSVEDGKQTWGTGFEGHGN 1281 >ref|XP_004246977.1| PREDICTED: uncharacterized protein LOC101249008 [Solanum lycopersicum] Length = 1353 Score = 207 bits (527), Expect = 1e-50 Identities = 148/356 (41%), Positives = 179/356 (50%), Gaps = 26/356 (7%) Frame = -1 Query: 1283 SRP-DRASHGFNHDVGLKLDGNDNAP--RLLPPYQP-GSLR---------PLDLCDDNMD 1143 SRP D+ HG +D G K + + P RLLPP+ P GS+ PL DD+ Sbjct: 1009 SRPHDKPPHGLGYDSGSKFEASTGVPPNRLLPPHHPPGSMHFKDSGEREAPLGPHDDDRK 1068 Query: 1142 RRVDIAAGVPPDFLRSASGRNRIDGFPLRSPGREYPSHPSSRFRRLEDSDGRELHVFSEQ 963 R +G L S RN DG P R + SH +D+ GRE F E Sbjct: 1069 RG---GSGFGVHHLDYLSARNP-DGELFNIPQRGFVSHSG-----FDDTGGREPRQFIEG 1119 Query: 962 SKSFNLPSE--GNAFHENRFPILPSHLRKGESDGSGSLPA-----------RLRGGDLIG 822 FNLPS G + +RF LP H E+DG G L ++ GDL G Sbjct: 1120 PGHFNLPSNLAGGLYSNSRFQALPGHPHGVETDGLGDLRGGEHTTFGRPYKHVQSGDLFG 1179 Query: 821 SNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGG 642 ++P L E + LP+HL K GFG F RA + + G Sbjct: 1180 KDMP-SHLHHDESLDPPKLPSHLRFDKPGGFGSFAGRAYMGELSGFGDIPGFDESV--GR 1236 Query: 641 NLPSRARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCET 462 N P + E GF S +P+ GY N G + AGDV+SFD RKRK SMGWCRICKVDCET Sbjct: 1237 NKPGMPQFGEPGFRSRYPVPGYPNHGLY--AGDVDSFDRPRKRKPTSMGWCRICKVDCET 1294 Query: 461 VEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESR 294 VEGLDMHSQTREHQ MAMDMV SIK+ N KQK S D S E+ ++ KA FESR Sbjct: 1295 VEGLDMHSQTREHQDMAMDMVRSIKEQNRMKQKTFS-DRPSVEEKGRTRKAVFESR 1349