BLASTX nr result
ID: Rauwolfia21_contig00003333
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rauwolfia21_contig00003333 (1880 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004251799.1| PREDICTED: uncharacterized protein LOC101256... 265 5e-68 ref|XP_006350037.1| PREDICTED: uncharacterized protein LOC102601... 263 1e-67 ref|XP_006350032.1| PREDICTED: uncharacterized protein LOC102601... 263 2e-67 ref|XP_004243407.1| PREDICTED: uncharacterized protein LOC101260... 260 1e-66 ref|XP_006469032.1| PREDICTED: uncharacterized protein LOC102611... 186 4e-44 ref|XP_006446765.1| hypothetical protein CICLE_v10015391mg [Citr... 186 4e-44 gb|EXB78097.1| hypothetical protein L484_004798 [Morus notabilis] 174 9e-41 ref|XP_002327318.1| predicted protein [Populus trichocarpa] gi|5... 170 2e-39 gb|EOX92817.1| Uncharacterized protein isoform 1 [Theobroma caca... 168 9e-39 ref|XP_006376346.1| hypothetical protein POPTR_0013s12230g [Popu... 167 2e-38 ref|XP_006348849.1| PREDICTED: uncharacterized protein LOC102594... 166 4e-38 ref|XP_002279986.1| PREDICTED: uncharacterized protein LOC100250... 164 1e-37 ref|XP_004303170.1| PREDICTED: uncharacterized protein LOC101303... 150 1e-33 gb|EOX92819.1| Uncharacterized protein isoform 3, partial [Theob... 145 8e-32 ref|XP_004149372.1| PREDICTED: uncharacterized protein LOC101205... 143 3e-31 ref|NP_565715.1| uncharacterized protein [Arabidopsis thaliana] ... 142 4e-31 ref|XP_002881158.1| hypothetical protein ARALYDRAFT_482041 [Arab... 142 7e-31 gb|EOX92820.1| Uncharacterized protein isoform 4, partial [Theob... 137 2e-29 ref|XP_002325580.2| hypothetical protein POPTR_0019s11960g [Popu... 134 1e-28 gb|EOX92821.1| Uncharacterized protein isoform 5 [Theobroma caca... 134 1e-28 >ref|XP_004251799.1| PREDICTED: uncharacterized protein LOC101256948 [Solanum lycopersicum] Length = 421 Score = 265 bits (677), Expect = 5e-68 Identities = 170/423 (40%), Positives = 244/423 (57%), Gaps = 28/423 (6%) Frame = -1 Query: 1652 MDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQDLDPE 1473 MDLKG+AW+G+IY KFEAMCLEME+ MYQDT +YVENQVQTVGASVK+FYS+V+ DL P+ Sbjct: 1 MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60 Query: 1472 SHVDLVKVAAADLTLNPYAHVEMIQKSSSK---KDPCENTEKLTDDFKVISGKSKT-GAY 1305 ++D VKVAAADL+LNPYAH E+ +K ++ P ++L DD +VI GKSK+ G Y Sbjct: 61 FNIDPVKVAAADLSLNPYAHTEISKKLKAQLKGGHPRVINKELIDDTQVIKGKSKSGGVY 120 Query: 1304 KRPVARRRGHSSANYCPAVLG-LTASMSGNVTSMSSFSQRRGSHEVACGSLDVTLSSAAV 1128 +R + N+ P+ +SGN +SS S+ RG EVA + +T A+V Sbjct: 121 RRQSVGMKEIVRDNHPPSKKSDALCLVSGNTIKLSSDSKVRGGFEVASDHMTMTSPLASV 180 Query: 1127 EG-NSEEAKEKICNGVIDKGVPTSGASVDFPRTETKLLTLSSGRKQAE------SGDNSY 969 +G S E +++ N +I VP +G S++ ++T L G+ QA+ GD Sbjct: 181 KGLKSTETGKEVSNHIIKTEVPAAGISINIAASDTSLSVDCVGQNQADLRNTFSVGDLQS 240 Query: 968 SSCLATGTP-----------AAGSYTNRVVVSETDRDIKADAGL---SISGEEDVMVSHK 831 S + GT ++ + N + E + K + +I+GEE + S K Sbjct: 241 DSHVDRGTRKELAGDTGLKISSNTGDNNIASKEVNNIAKISSNTDDNNIAGEE-IKESCK 299 Query: 830 ERLDECSRDAAENDNIIDQEAEIIGKVNESMLAETCVLVEEDVLHFVHNGNDKHKSYKKK 651 R D+ + ++I+ + EI+ + +E L ETCVLVE + LH V G+ K KSYKKK Sbjct: 300 ARSDKSCSPPPDKYDLIESDVEIVERYDEPKLEETCVLVEAEKLH-VPQGSVKRKSYKKK 358 Query: 650 LREAFSTKKRSTRKEYEKLATQYKEQNSKQESAERLMPAIGDSS--AKISPGHTLPDSDW 477 LR+ FS KK+STR EYE+L Y +Q + E+ M + +S K+S +S+W Sbjct: 359 LRQVFSMKKKSTRTEYEQLGALYGDQQPNLQPEEKQMQVLSKNSNPKKLSSADDHSESEW 418 Query: 476 ELL 468 ELL Sbjct: 419 ELL 421 >ref|XP_006350037.1| PREDICTED: uncharacterized protein LOC102601397 isoform X6 [Solanum tuberosum] Length = 420 Score = 263 bits (673), Expect = 1e-67 Identities = 172/421 (40%), Positives = 244/421 (57%), Gaps = 26/421 (6%) Frame = -1 Query: 1652 MDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQDLDPE 1473 MDLKG+AW+G+IY KFEAMCLEME+ MYQDT +YVENQVQTVGASVK+FYS+V+ DL P+ Sbjct: 1 MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60 Query: 1472 SHVDLVKVAAADLTLNPYAHVEMIQKSSSK---KDPCENTEKLTDDFKVISGKSKT-GAY 1305 ++D VKVAAADL+LNPYAH E+ +K +K P ++L DD +VI GKSK+ G Y Sbjct: 61 FNIDPVKVAAADLSLNPYAHTEISKKLKAKLKGGHPMVINKELIDDTQVIKGKSKSGGVY 120 Query: 1304 KRPVARRRGHSSANYCPAVLG-LTASMSGNVTSMSSFSQRRGSHEVACGSLDVTLSSAAV 1128 +R + N+ P+ +SGN +SS S+ RG EVA + +T A+V Sbjct: 121 RRQSVGIKEIVRDNHPPSKKSDALCLVSGNAIKLSSDSKVRGGFEVASDHMTMTSPLASV 180 Query: 1127 EG-NSEEAKEKICNGVIDKGVPTSGASVDFPRTETKLL-------------TLSSGRKQA 990 +G +S E +++ N +I V +G S++ ++ L T S G Q+ Sbjct: 181 KGRSSAETGKEVSNHIIKTDVSAAGISINVAASDRSLSVDCVGQNQADLRNTSSVGDLQS 240 Query: 989 ESGDNSYSSCLA--TGTPAAGSYTNRVVVSETDRD---IKADAGLSISGEEDVMVSHKER 825 +S D LA TG + + + + SE + I ++ G + E++ S KER Sbjct: 241 DSHDRGTCKELAGDTGLKISSNTGDNNIASEEINNIAKISSNTGDNNITGEEINESCKER 300 Query: 824 LDECSRDAAENDNIIDQEAEIIGKVNESMLAETCVLVEEDVLHFVHNGNDKHKSYKKKLR 645 D+ E ++I+ + EI+ +ES L ETCVLVE + LH V + K KSYKKKLR Sbjct: 301 SDKSCSPPPEKYDLIESDVEIVEHYDESKLEETCVLVEAEKLH-VPQESVKQKSYKKKLR 359 Query: 644 EAFSTKKRSTRKEYEKLATQYKEQNSKQESAERLMPAIGDSS--AKISPGHTLPDSDWEL 471 + FS KK+STRKEYE+L + +Q E E+ M + +S K+S +S+WEL Sbjct: 360 QVFSMKKKSTRKEYEQLGALHGDQQPNLEPEEKPMQVLSKNSNMKKLSSADDHSESEWEL 419 Query: 470 L 468 L Sbjct: 420 L 420 >ref|XP_006350032.1| PREDICTED: uncharacterized protein LOC102601397 isoform X1 [Solanum tuberosum] gi|565366720|ref|XP_006350033.1| PREDICTED: uncharacterized protein LOC102601397 isoform X2 [Solanum tuberosum] gi|565366722|ref|XP_006350034.1| PREDICTED: uncharacterized protein LOC102601397 isoform X3 [Solanum tuberosum] gi|565366724|ref|XP_006350035.1| PREDICTED: uncharacterized protein LOC102601397 isoform X4 [Solanum tuberosum] gi|565366726|ref|XP_006350036.1| PREDICTED: uncharacterized protein LOC102601397 isoform X5 [Solanum tuberosum] Length = 421 Score = 263 bits (672), Expect = 2e-67 Identities = 171/431 (39%), Positives = 243/431 (56%), Gaps = 36/431 (8%) Frame = -1 Query: 1652 MDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQDLDPE 1473 MDLKG+AW+G+IY KFEAMCLEME+ MYQDT +YVENQVQTVGASVK+FYS+V+ DL P+ Sbjct: 1 MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60 Query: 1472 SHVDLVKVAAADLTLNPYAHVEMIQKSSSK---KDPCENTEKLTDDFKVISGKSKT-GAY 1305 ++D VKVAAADL+LNPYAH E+ +K +K P ++L DD +VI GKSK+ G Y Sbjct: 61 FNIDPVKVAAADLSLNPYAHTEISKKLKAKLKGGHPMVINKELIDDTQVIKGKSKSGGVY 120 Query: 1304 KRPVARRRGHSSANYCPAVLG-LTASMSGNVTSMSSFSQRRGSHEVACGSLDVTLSSAAV 1128 +R + N+ P+ +SGN +SS S+ RG EVA + +T A+V Sbjct: 121 RRQSVGIKEIVRDNHPPSKKSDALCLVSGNAIKLSSDSKVRGGFEVASDHMTMTSPLASV 180 Query: 1127 EG-NSEEAKEKICNGVIDKGVPTSGASVDFPRTETKLLTLSSGRKQAESGDNSYSSCLAT 951 +G +S E +++ N +I V +G S++ ++ L G+ QA+ + S + Sbjct: 181 KGRSSAETGKEVSNHIIKTDVSAAGISINVAASDRSLSVDCVGQNQADLRNTS-----SV 235 Query: 950 GTPAAGSYTNRVVVSETDRDIKADAGLSISGE---------------------------- 855 G + S+ +R T +++ D GL IS Sbjct: 236 GDLQSDSHADR----GTCKELAGDTGLKISSNTGDNNIASEEINNIAKISSNTGDNNITG 291 Query: 854 EDVMVSHKERLDECSRDAAENDNIIDQEAEIIGKVNESMLAETCVLVEEDVLHFVHNGND 675 E++ S KER D+ E ++I+ + EI+ +ES L ETCVLVE + LH V + Sbjct: 292 EEINESCKERSDKSCSPPPEKYDLIESDVEIVEHYDESKLEETCVLVEAEKLH-VPQESV 350 Query: 674 KHKSYKKKLREAFSTKKRSTRKEYEKLATQYKEQNSKQESAERLMPAIGDSS--AKISPG 501 K KSYKKKLR+ FS KK+STRKEYE+L + +Q E E+ M + +S K+S Sbjct: 351 KQKSYKKKLRQVFSMKKKSTRKEYEQLGALHGDQQPNLEPEEKPMQVLSKNSNMKKLSSA 410 Query: 500 HTLPDSDWELL 468 +S+WELL Sbjct: 411 DDHSESEWELL 421 >ref|XP_004243407.1| PREDICTED: uncharacterized protein LOC101260247 [Solanum lycopersicum] Length = 374 Score = 260 bits (665), Expect = 1e-66 Identities = 166/400 (41%), Positives = 234/400 (58%), Gaps = 5/400 (1%) Frame = -1 Query: 1652 MDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQDLDPE 1473 MDLK ++W+GNIY KFE MCLEMEE MYQDTVKYVENQ+ TVG +VK+F SEVMQD+ P+ Sbjct: 1 MDLKSISWVGNIYQKFETMCLEMEEAMYQDTVKYVENQMNTVGTNVKRFCSEVMQDVHPQ 60 Query: 1472 SHVDLVKVAAADLTLNPYAHVEMIQ--KSSSKKDPCENTEKLTDDFKVISGKSKT-GAYK 1302 ++D VKVAAADL+LNPYAH E+ + K++ K + KL DD +VI GKSK+ G YK Sbjct: 61 CNIDPVKVAAADLSLNPYAHYEIDKKLKANLKGSARGFSNKLNDDTQVIKGKSKSGGVYK 120 Query: 1301 RPVARRRGHSSANYCPAVLGLTASMSGNVTSMSSFSQRRGSHEVACGSLDVTLSSAAVEG 1122 R + ++ SG+ +SS ++ RG E+A + +T + A+V+G Sbjct: 121 RQNVGIKEIVRDSHLTKKPNAICLASGDALKLSSSAEVRGGFELASDHVTLTSALASVKG 180 Query: 1121 -NSEEAKEKICNGVIDKGVPTSGASVDFPRTETKLLTLSSGRKQAESGDNSYSSCLATGT 945 +S E K+ N VI V T+ S+ + S G+KQ ++ LA T Sbjct: 181 SDSGEVASKVSNHVIQTNVSTADTSIT-SEASVMMSVESVGKKQTDTCTKE----LACNT 235 Query: 944 PAAGSYTNRVVVSETDRDIKADAGLSISGEEDVMVSHKERLDECSRDAAENDNIIDQEAE 765 +T D++ + E++ SH+E+ D + + I+ + E Sbjct: 236 R-----------FKTSSDVRNNL-----ANEEIDESHEEKSD----NLLSKYDSIESDLE 275 Query: 764 IIGKVNESMLAETCVLVEEDVLHFVHNGNDKHKSYKKKLREAFSTKKRSTRKEYEKLATQ 585 I+ K +E L ETCVLVEED +H V G K KSYKKKLR+AFSTKKR TRKEYE+L Sbjct: 276 IVEKFDEFQLNETCVLVEEDRIH-VPQGPVKQKSYKKKLRDAFSTKKRLTRKEYEQLGAL 334 Query: 584 YKEQNSKQESAERLMPAIG-DSSAKISPGHTLPDSDWELL 468 Y +Q K ES +++MP + +S+ K+ + P+S+WE+L Sbjct: 335 YGDQQIKVESEDKVMPVLAMNSNTKMLSANDHPESEWEIL 374 >ref|XP_006469032.1| PREDICTED: uncharacterized protein LOC102611541 isoform X1 [Citrus sinensis] gi|568829444|ref|XP_006469033.1| PREDICTED: uncharacterized protein LOC102611541 isoform X2 [Citrus sinensis] gi|568829446|ref|XP_006469034.1| PREDICTED: uncharacterized protein LOC102611541 isoform X3 [Citrus sinensis] gi|568829448|ref|XP_006469035.1| PREDICTED: uncharacterized protein LOC102611541 isoform X4 [Citrus sinensis] Length = 416 Score = 186 bits (471), Expect = 4e-44 Identities = 153/447 (34%), Positives = 220/447 (49%), Gaps = 52/447 (11%) Frame = -1 Query: 1652 MDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQDLDPE 1473 MDLKG+ W+G++Y KFEAMCLE+EE+MYQDTVKYVENQVQTVG++VKKFYS+V++DL P Sbjct: 1 MDLKGITWVGHVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGSTVKKFYSDVIEDLLPP 60 Query: 1472 SHVDLVKVA-AADLTLNPYAHVEMIQKS--SSKKDPCE-NTEKLTD-------------- 1347 VDLVK A A++L L A V + +K K++ + N E+L++ Sbjct: 61 PSVDLVKGAVASNLPLEQNADVGIYKKPKIGIKEEAMKVNNEQLSESSLATTDLDKGAGG 120 Query: 1346 ------------DFKVISGKSKTG----AYKRPVARRRGHSSANYCPAVLGL-----TAS 1230 F+ G + G AY + R GH+ ++ C + + Sbjct: 121 GQSFCRFHIEDTSFQPSLGNTLKGVFSDAYPKEYDIRSGHNQSSICMQKISKEDNLPPSE 180 Query: 1229 MSGNVTSMSSFSQRRGS--------HEVACGSLDVTLSSAAVEGNSEEAKEKICNGVIDK 1074 MSG M +R S EV+ + V +S E S ++ E+I + + Sbjct: 181 MSGAGPHMERGLRRASSSCELLDKIQEVSDDQVVVDPTSVTTEVASCKSFEEIYDELEKA 240 Query: 1073 GVPTSGASVDFPRTETKLLTLSSGRKQAESGDNSYSSCLATGTPAAGSYTNRVVVSETDR 894 SGA LT S K + ++++SSC + G TN VVS Sbjct: 241 SKGASGA-----------LTSSPAAKNCDESESAHSSCSSLSAELNGICTNDGVVSLVGS 289 Query: 893 DIKADAGLSISGEEDVMVSH---KERLDECSRDAAENDNIIDQEAEIIGKVNESMLAETC 723 + EDV S R D + DA E++ ++Q E + +V+ + ETC Sbjct: 290 FV----------NEDVQPSEFPDPGRSDYSTVDATESNIDVEQGYETVQRVDNIQVEETC 339 Query: 722 VLVEEDVLHFVHNGNDKHKSYKKKLREAFSTKKRSTRK-EYEKLATQYKE-QNSKQESAE 549 VLV D L FV DKH+ KKK+++A S++ RSTRK EY++LA Y E + SKQ++AE Sbjct: 340 VLVNGDELCFVPCREDKHRPCKKKIQDAISSRMRSTRKHEYKQLAVWYNEDEKSKQQNAE 399 Query: 548 RLMPAIGDSSAKISPGHTLPDSDWELL 468 K P H + +WELL Sbjct: 400 ----------TKGKPSHGYCELEWELL 416 >ref|XP_006446765.1| hypothetical protein CICLE_v10015391mg [Citrus clementina] gi|567908905|ref|XP_006446766.1| hypothetical protein CICLE_v10015391mg [Citrus clementina] gi|557549376|gb|ESR60005.1| hypothetical protein CICLE_v10015391mg [Citrus clementina] gi|557549377|gb|ESR60006.1| hypothetical protein CICLE_v10015391mg [Citrus clementina] Length = 416 Score = 186 bits (471), Expect = 4e-44 Identities = 152/447 (34%), Positives = 217/447 (48%), Gaps = 52/447 (11%) Frame = -1 Query: 1652 MDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQDLDPE 1473 MDLKG+ W+G++Y KFEAMCLE+EE+MYQDTVKYVENQVQTVG++VKKFYS+V++DL P Sbjct: 1 MDLKGITWVGHVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGSTVKKFYSDVIEDLLPP 60 Query: 1472 SHVDLVKVA-AADLTLNPYAHVEMIQKSS---SKKDPCENTEKLTD-------------- 1347 VDLVK A A++L L A V + +K ++ N E+L++ Sbjct: 61 PSVDLVKGAVASNLPLEQNADVGIYKKPKIGIKEEAMNVNNEQLSESSLATTDLDKGAGG 120 Query: 1346 ------------DFKVISGKSKTG----AYKRPVARRRGHSSANYCPAVLGL-----TAS 1230 F+ G + G AY + R GH+ ++ C + + Sbjct: 121 GQSFCRFHIEDTSFQPSLGDTLKGVFSDAYSKEYDIRSGHNQSSICMQKISKEDNLPPSE 180 Query: 1229 MSGNVTSMSSFSQRRGS--------HEVACGSLDVTLSSAAVEGNSEEAKEKICNGVIDK 1074 MSG M +R S EV+ + V + E S ++ E+I + + Sbjct: 181 MSGAGPHMERGLRRASSSCELLDKIQEVSDDQVVVDPTPVTTEVASCKSFEEIYDELEKA 240 Query: 1073 GVPTSGASVDFPRTETKLLTLSSGRKQAESGDNSYSSCLATGTPAAGSYTNRVVVSETDR 894 SGA LT S K + +N++SSC + G TN VVS Sbjct: 241 SKGASGA-----------LTSSPAAKNCDESENAHSSCSSLSAELNGICTNDGVVSLVGS 289 Query: 893 DIKADAGLSISGEEDVMVSH---KERLDECSRDAAENDNIIDQEAEIIGKVNESMLAETC 723 + EDV S R D + DA E++ ++Q E + +V+ + ETC Sbjct: 290 FV----------NEDVQPSEFPDPGRSDYSTVDATESNIDVEQGYETVQRVDNIQVEETC 339 Query: 722 VLVEEDVLHFVHNGNDKHKSYKKKLREAFSTKKRSTRK-EYEKLATQYKE-QNSKQESAE 549 VLV D L FV KH+ YKKK+++A S++ RSTRK EY++LA Y E + SKQ++AE Sbjct: 340 VLVNGDELCFVPCREGKHRPYKKKIQDAISSRMRSTRKHEYKQLAVWYNEDEKSKQQNAE 399 Query: 548 RLMPAIGDSSAKISPGHTLPDSDWELL 468 K P H + +WELL Sbjct: 400 ----------MKGKPSHGYCELEWELL 416 >gb|EXB78097.1| hypothetical protein L484_004798 [Morus notabilis] Length = 443 Score = 174 bits (442), Expect = 9e-41 Identities = 140/448 (31%), Positives = 208/448 (46%), Gaps = 53/448 (11%) Frame = -1 Query: 1652 MDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQDLDPE 1473 MD+KG+ W+GN+Y KFEAMCLE+EE+MYQDTVKYVENQVQTVGASVK+FYS+VMQDL P Sbjct: 1 MDVKGITWVGNVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGASVKRFYSDVMQDLLPP 60 Query: 1472 SHVDLVKVAA--------ADLTLNPYAHVEMIQKSSSKKD-PCENTEKLTDDFKVI---- 1332 S D KV+ +D ++ +V +K + D T K+T D K + Sbjct: 61 SSQDSEKVSLCGFIGKQDSDDGISKKPNVAKKEKPAKADDEQLIRTLKVTSDSKDVYLAP 120 Query: 1331 --------------SGKSKTGAYKRPVARRR-----GHSSANY---------------CP 1254 SG+ GA +R++ HSS+N Sbjct: 121 SIHVRCDVDNMCRPSGECVKGACSNLRSRKKCRDVSVHSSSNLSVNENRSDKKLIPPETS 180 Query: 1253 AVLGLTASMSGNVTSMSSFSQRRGSHEVACGSLDVTLSSAAVEGNSEEAKEKICNGVIDK 1074 + +S ++S S F HE++ T + + E S ++ + C+ + + Sbjct: 181 CAITREKHLSRPLSSYSEFVNE--IHEISLDQTGTTKAPSVNEDTSSDSIVESCDEIENS 238 Query: 1073 GVPTSGASVDFPRTETKLLTLSSGRKQAESGDNSYSSCLATGTPAAGSYTNRV----VVS 906 + S F + +L S G + + S A G YT++ + S Sbjct: 239 SECMADLSSSFHASSEIILVKSVG---YDGNEMDVPSGGGLSEQANGDYTSKCSSNSLAS 295 Query: 905 ETDRDIKADAGLSISGEEDVMVSHKERLDECSRDAAENDNIIDQEAEIIGKVNESMLAET 726 +A +EDV VS + D+ + + E++ + E I + ++ L ET Sbjct: 296 TGGSSQNEEARNDKYADEDVFVSLPRKFDDWNLNITESEIATEHGTETIQQRDKVKLEET 355 Query: 725 CVLVEEDVLHFVHNGNDKHKSYKKKLREAFSTKKRSTRK-EYEKLATQYKEQNS-KQESA 552 CVLV ED LH + K + YKKK+R+A ++ RS RK EYE+L QY + Q+ Sbjct: 356 CVLVNEDELHILPQRGGKWRPYKKKIRDALYSRMRSARKEEYEQLVLQYGDNKKLNQDFG 415 Query: 551 ERLMPAIGDSSAKISPGHTLPDSDWELL 468 E L P + K P +S+WELL Sbjct: 416 EALAPTLIVKERKKLPHLDSCESEWELL 443 >ref|XP_002327318.1| predicted protein [Populus trichocarpa] gi|566200863|ref|XP_006376347.1| hypothetical protein POPTR_0013s12230g [Populus trichocarpa] gi|550325623|gb|ERP54144.1| hypothetical protein POPTR_0013s12230g [Populus trichocarpa] Length = 418 Score = 170 bits (431), Expect = 2e-39 Identities = 142/437 (32%), Positives = 216/437 (49%), Gaps = 42/437 (9%) Frame = -1 Query: 1652 MDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQDL-DP 1476 MDLKG+ W+G+ Y KFEA LE+EE+M ++ VKYVENQ+QTV +V+KFYS+VMQDL P Sbjct: 1 MDLKGITWVGDFYQKFEARLLEVEEIMCEEAVKYVENQMQTVSGNVRKFYSDVMQDLCSP 60 Query: 1475 ESHVDL--------VKVAAADLTLNPYAHVEMIQKSSSKKDPCENTEKLTDDFKVISGKS 1320 +S V V + AAD+ ++ ++ K+ CE DD ++++G S Sbjct: 61 DSEVPANGAVSKLPVDLGAADVGVH-------LKPDDGAKETCEK----ADDLRLLTGYS 109 Query: 1319 KT----GAYKRPVARR-------RGHSSA-----------------NYCP-AVLGLTASM 1227 K G + PV R R HS N P G+T Sbjct: 110 KMTTDHGPDRLPVRERISIRRISRQHSKGSLSNKSNLDMHGNSNCKNVSPKETSGITTPS 169 Query: 1226 SGNVTSMSSFSQRRGSH-EVACGSLDVTLSSAAVEGNSEEAKEKICNGVIDKGVPTSGAS 1050 S ++ S+ S+ + E +C ++ +VE + EK + + S Sbjct: 170 SKHLIGYSTISEHSDQNLEASCDWNARLITPGSVEVTEHFSIEKSKKEIENTREHMLDIS 229 Query: 1049 VDFPRTETKLLTLSSGRKQAESGDNSYSSC-LATGTPAAGSYTNRVVVSETDRDIKADAG 873 P + +T +GR E D SS L + AAG N +VS TD + Sbjct: 230 FYKPSLDMGNIT-ETGRH--EGTDRRPSSINLLEESNAAGVCLNNGLVSMTDFYANGNMQ 286 Query: 872 LSISGEEDVMVSHKERLDECSRDAAENDNIIDQEAEIIGKVNESMLAETCVLVEEDVLHF 693 + E+ VS+ DE D+ ++ +I+++ EII +V+++ L ETCVL+ D L Sbjct: 287 TNKFAYEEDFVSNS---DEWGIDSDKDGTLIEEDMEIIQQVDKAQLEETCVLMNGDELDA 343 Query: 692 VHNGNDKHKSYKKKLREAFSTKKRSTRKEYEKLATQYKE--QNSKQESAERLMPAIGDSS 519 G K+K YKKK+R+ FS++KRS RKEYE+LA Q++ +++++ES LM Sbjct: 344 SREG--KNKPYKKKIRDVFSSRKRSVRKEYEQLAVQFRSDPKSNQEESKTSLMATPSIKE 401 Query: 518 AKISPGHTLPDSDWELL 468 AK S H +S+WEL+ Sbjct: 402 AKRSSSHDPSESEWELV 418 >gb|EOX92817.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508700922|gb|EOX92818.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 397 Score = 168 bits (425), Expect = 9e-39 Identities = 131/429 (30%), Positives = 205/429 (47%), Gaps = 33/429 (7%) Frame = -1 Query: 1655 TMDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQD--- 1485 +MDLKG+ W+G++Y KFEAMCLE+EEVMYQDTVKYVEN+VQTVGASVKKFYS +MQD Sbjct: 3 SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62 Query: 1484 --LDPESHVDLVKVAAADLTLNPYAHVEMIQKSSSKKDPCE-NTEKLTDDFKVISGKSKT 1314 L P S + VAA+DL + YA K+D + ++E+LT+D +VI+ ++ Sbjct: 63 DLLLPSSLEPMKAVAASDLPVEIYAETLKKPNVGLKEDAIQGDSEQLTEDSEVIADVNEN 122 Query: 1313 GAYKRPVARRRGHSSANYCPAVLGLTASMSGNVTSMSSFSQRRGSHEVACGSLDVTLSSA 1134 A+ S+ V + S SG+ +S G H C TL+ Sbjct: 123 AAHV---------PSSCQLHMVDNIFESCSGSFVERASSDLLSGEHNNRC-----TLNKT 168 Query: 1133 AVE-------------------------GNSEEAKEKICNGVIDKGVPTSGASVDFPRTE 1029 VE GN+ E C+ + P S D Sbjct: 169 NVEHLLPAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPATLTPVSVEEDDCD--- 225 Query: 1028 TKLLTLSSGRKQAESGDNSYSSCLATGTPAAGSYTNRVVVSETDRDIKADAGLSISGEED 849 ++ + +S +S L G G +V + + +++ + + S E + Sbjct: 226 ----SIEESSNEIKSASDSVPEILPDGLHLVG------IVEKNEMEMRCSSSIIESEESN 275 Query: 848 VMVSHKERLDECSRDAAENDNIIDQEAEIIGKVNESMLAETCVLVEEDVLHFVHNGNDKH 669 ++ ++DA+ + + +E E + ++++ + E+C +V LHF KH Sbjct: 276 GKLN-------WTKDASGSSTVGRKEIETVQQLDKIRVDESCFMVNGAELHFHPQREGKH 328 Query: 668 KSYKKKLREAFSTKKRSTR-KEYEKLATQYKEQ-NSKQESAERLMPAIGDSSAKISPGHT 495 K+Y++K+R+A S++ RS R KEYE+L Y + S Q+S A+ + + H Sbjct: 329 KTYQRKIRDAISSRMRSARKKEYEQLPLWYGDDVKSDQDSEGSSTSALTREDTRRTLNHD 388 Query: 494 LPDSDWELL 468 DS+WELL Sbjct: 389 DLDSEWELL 397 >ref|XP_006376346.1| hypothetical protein POPTR_0013s12230g [Populus trichocarpa] gi|550325622|gb|ERP54143.1| hypothetical protein POPTR_0013s12230g [Populus trichocarpa] Length = 416 Score = 167 bits (422), Expect = 2e-38 Identities = 137/436 (31%), Positives = 214/436 (49%), Gaps = 41/436 (9%) Frame = -1 Query: 1652 MDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQDL-DP 1476 MDLKG+ W+G+ Y KFEA LE+EE+M ++ VKYVENQ+QTV +V+KFYS+VMQDL P Sbjct: 1 MDLKGITWVGDFYQKFEARLLEVEEIMCEEAVKYVENQMQTVSGNVRKFYSDVMQDLCSP 60 Query: 1475 ESHVDL--------VKVAAADLTLNPYAHVEMIQKSSSKKDPCENTEKLTDDFKVISGKS 1320 +S V V + AAD+ ++ ++ K+ CE DD ++++G S Sbjct: 61 DSEVPANGAVSKLPVDLGAADVGVH-------LKPDDGAKETCEK----ADDLRLLTGYS 109 Query: 1319 KT----GAYKRPVARR-------RGHSSA-----------------NYCP-AVLGLTASM 1227 K G + PV R R HS N P G+T Sbjct: 110 KMTTDHGPDRLPVRERISIRRISRQHSKGSLSNKSNLDMHGNSNCKNVSPKETSGITTPS 169 Query: 1226 SGNVTSMSSFSQRRGSH-EVACGSLDVTLSSAAVEGNSEEAKEKICNGVIDKGVPTSGAS 1050 S ++ S+ S+ + E +C ++ +VE + EK + + S Sbjct: 170 SKHLIGYSTISEHSDQNLEASCDWNARLITPGSVEVTEHFSIEKSKKEIENTREHMLDIS 229 Query: 1049 VDFPRTETKLLTLSSGRKQAESGDNSYSSCLATGTPAAGSYTNRVVVSETDRDIKADAGL 870 P + +T +GR + G + S + + G N +VS TD + Sbjct: 230 FYKPSLDMGNIT-ETGRHE---GTDRRPSSINLLEESNGVCLNNGLVSMTDFYANGNMQT 285 Query: 869 SISGEEDVMVSHKERLDECSRDAAENDNIIDQEAEIIGKVNESMLAETCVLVEEDVLHFV 690 + E+ VS+ DE D+ ++ +I+++ EII +V+++ L ETCVL+ D L Sbjct: 286 NKFAYEEDFVSNS---DEWGIDSDKDGTLIEEDMEIIQQVDKAQLEETCVLMNGDELDAS 342 Query: 689 HNGNDKHKSYKKKLREAFSTKKRSTRKEYEKLATQYKE--QNSKQESAERLMPAIGDSSA 516 G K+K YKKK+R+ FS++KRS RKEYE+LA Q++ +++++ES LM A Sbjct: 343 REG--KNKPYKKKIRDVFSSRKRSVRKEYEQLAVQFRSDPKSNQEESKTSLMATPSIKEA 400 Query: 515 KISPGHTLPDSDWELL 468 K S H +S+WEL+ Sbjct: 401 KRSSSHDPSESEWELV 416 >ref|XP_006348849.1| PREDICTED: uncharacterized protein LOC102594335 isoform X1 [Solanum tuberosum] Length = 260 Score = 166 bits (419), Expect = 4e-38 Identities = 99/224 (44%), Positives = 140/224 (62%), Gaps = 4/224 (1%) Frame = -1 Query: 1652 MDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQDLDPE 1473 MDLK ++W+GNIY KFE MCLEMEE MYQDTVKYVENQV TVG +VK+F SEVMQD+ P+ Sbjct: 1 MDLKSISWVGNIYQKFETMCLEMEEAMYQDTVKYVENQVNTVGTNVKRFCSEVMQDVHPQ 60 Query: 1472 SHVDLVKVAAADLTLNPYAHVEMIQ--KSSSKKDPCENTEKLTDDFKVISGKSKT-GAYK 1302 ++D VKVAAADL++NPYAH E+ + K++ K + KL DD +VI GKSK+ G YK Sbjct: 61 CNIDPVKVAAADLSINPYAHYEIDKKLKANLKGSARRFSNKLNDDTQVIKGKSKSGGVYK 120 Query: 1301 RPVARRRGHSSANYCPAVLGLTASMSGNVTSMSSFSQRRGSHEVACGSLDVTLSSAAVEG 1122 R + ++ SG+ +SS ++ RG E+A + +T + A+V+G Sbjct: 121 RQNVGIKEIVRDSHPAKKPNAICLASGDALKLSSSAEVRGGFEMASDHVTLTSALASVKG 180 Query: 1121 -NSEEAKEKICNGVIDKGVPTSGASVDFPRTETKLLTLSSGRKQ 993 +S EA K+ + I V + S+ + T +++ S RK+ Sbjct: 181 SDSGEAASKVRDHFIQTNVSAADTSITSEASVT--MSVESVRKK 222 >ref|XP_002279986.1| PREDICTED: uncharacterized protein LOC100250516 [Vitis vinifera] gi|302143402|emb|CBI21963.3| unnamed protein product [Vitis vinifera] Length = 451 Score = 164 bits (415), Expect = 1e-37 Identities = 144/466 (30%), Positives = 217/466 (46%), Gaps = 71/466 (15%) Frame = -1 Query: 1652 MDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVEN-------QVQTVGASVKKFYSEV 1494 MD KG+ W+GN+Y KFE +CLE+E++MYQDTVKY EN QV+TVG SVKKF SE+ Sbjct: 1 MDFKGITWVGNMYQKFETICLEVEDIMYQDTVKYFENHVKYVEDQVETVGESVKKFCSEI 60 Query: 1493 MQDLDPESHVDLVKVAAADLTLNPYAHVEMIQKSSS----------KKDP---------- 1374 +QDL D ++V ++L+L+ + +V++ +K K++P Sbjct: 61 VQDL---LLPDSLEVTDSNLSLDQHDNVKLCKKPKVGIKEEAKVGFKEEPKVSIKEEFIK 117 Query: 1373 ------CENTE--KLTDD----------------FKVISGKSKTGAYKR----------- 1299 E++E L +D F+ SG S TGA Sbjct: 118 FDIDRLTEHSEIADLNEDVEHKSSFTGLHGVNNLFQSYSGNSVTGACSDLHLVQNDDGVM 177 Query: 1298 -----PVARRRGHSSANYCPAVLGLTASMSGNVTSMSSFSQRRGSHEVACGSLDVTLSSA 1134 +R + + V G+ A +SG+V+ + S ++ C + +T S A Sbjct: 178 CKNLDAGIKRNPVKVSQFPIEVSGVIAPISGDVSRLPSSLNENCENK--CNQMAITSSPA 235 Query: 1133 AVEGNSEEAKEKICNGVIDKGVPTSGASVDFPRTETKLLTLSSGRKQAESGDNSYSSCLA 954 +VE + ICN + D + SVD P GR+ S SS L Sbjct: 236 SVEITDCNLEGAICNEIAD----VTAISVDLPSVPLVESVGKEGREMVFSSRGGLSSELN 291 Query: 953 TGTPAAGSYTNRVVVSETDRDIKADAGLSISGEEDVMVSHKERLDECSRDAAENDNIIDQ 774 G + ++ S RDI+ + + E+ ++SH E D + DA E +++I+Q Sbjct: 292 AGNIPMDNGVGSLIGSF--RDIQQNE----TAEKKDLLSHSEGSDGWNIDAIEINDVIEQ 345 Query: 773 EAEIIGKVNESM-LAETCVLVEEDVLHFVHNGNDKHKSYKKKLREAFSTKKRSTRKEYEK 597 E + + M L + CV+V+ D LH V + K KKKLR AF +K+R RKEYE+ Sbjct: 346 GIETTKDLLDKMKLEDACVMVDGDELHVVSHREGKVWLVKKKLRNAFYSKRRLARKEYER 405 Query: 596 LATQYK--EQNSKQESAERLMPAIG-DSSAKISPGHTLPDSDWELL 468 LA ++ + S Q AE L P+ DS + SP S+WELL Sbjct: 406 LAVWHRVIDSESNQPGAEGLTPSPSTDSDKRTSPDDDFCQSEWELL 451 >ref|XP_004303170.1| PREDICTED: uncharacterized protein LOC101303722 [Fragaria vesca subsp. vesca] Length = 389 Score = 150 bits (380), Expect = 1e-33 Identities = 128/412 (31%), Positives = 190/412 (46%), Gaps = 16/412 (3%) Frame = -1 Query: 1655 TMDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQDLDP 1476 TMD+KG+ W+G +Y KFE+MCLE+EE MY+DTVK+VE+QVQTVG SVKKFY++VMQDL Sbjct: 3 TMDVKGITWVGCVYEKFESMCLEVEENMYEDTVKFVEDQVQTVGESVKKFYADVMQDLLC 62 Query: 1475 ESHVDLVKVAAADLTLNPYAHVEMIQKSSSKKDP--CENTEKLTDDFKVISGKSK----T 1314 +S +D V+A + Y+ V+ + KK E++ D +VIS K T Sbjct: 63 DSSLDRDDVSAGGFPVEHYSDVDNSKSKIRKKKEHVKAGVEEVKGDSEVISAVLKDVDHT 122 Query: 1313 GAYKRPVARRRGHSSANYCPAVL------GLTASMSGNVTSMSSFSQRRGSHEVACGSLD 1152 G + R S+ C + G+ + V + R A G Sbjct: 123 GLFHRQRVYDSCTKSSGNCAKLACSRQDHGVRSCNKKIVVRETPIKDRLPGANTAVGKDF 182 Query: 1151 VTLSSAAVEGNSEEAKEKIC---NGVIDKGVPTSGASVDFPRTETKLLTLSSGRKQAESG 981 S ++ S E ++ C + VI P G D +E+ ++ +S + Sbjct: 183 SRESLSSCSEFSNEDRDTSCDQPDEVITPSKPPEGMRCD-SMSESCVVANASQCTGDDVS 241 Query: 980 DNSYSSCLATGTPAAGSYTNRVVVSETDRDIKADAGLSISGEEDVMVSHKERLDECSRDA 801 N SS + + G N ++ S G SI+ D + S Sbjct: 242 VNCQSSDMIVLDNSDGKRWNELLDSSIGGLSTELNGGSINPSMDAIES------------ 289 Query: 800 AENDNIIDQEAEIIGKVNESMLAETCVLVEEDVLHFVHNGNDKHKSYKKKLREAFSTKKR 621 NI EII + ++ L ETCV+V + LHFVH+ +K YKKK+ +AF+++ Sbjct: 290 ----NIGTHGTEIIQQSDKPKLEETCVMVSGEDLHFVHHTVANYKPYKKKIPKAFTSRTS 345 Query: 620 STRK-EYEKLATQYKEQNSKQESAERLMPAIGDSSAKISPGHTLPDSDWELL 468 S RK EYE+LA + G +K SP H +S+WE+L Sbjct: 346 SARKQEYEQLALWHGHHTKSILE--------GGEESKKSPTHDFCESEWEIL 389 >gb|EOX92819.1| Uncharacterized protein isoform 3, partial [Theobroma cacao] Length = 343 Score = 145 bits (365), Expect = 8e-32 Identities = 111/374 (29%), Positives = 178/374 (47%), Gaps = 31/374 (8%) Frame = -1 Query: 1655 TMDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQD--- 1485 +MDLKG+ W+G++Y KFEAMCLE+EEVMYQDTVKYVEN+VQTVGASVKKFYS +MQD Sbjct: 3 SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62 Query: 1484 --LDPESHVDLVKVAAADLTLNPYAHVEMIQKSSSKKDPCE-NTEKLTDDFKVISGKSKT 1314 L P S + VAA+DL + YA K+D + ++E+LT+D +VI+ ++ Sbjct: 63 DLLLPSSLEPMKAVAASDLPVEIYAETLKKPNVGLKEDAIQGDSEQLTEDSEVIADVNEN 122 Query: 1313 GAYKRPVARRRGHSSANYCPAVLGLTASMSGNVTSMSSFSQRRGSHEVACGSLDVTLSSA 1134 A+ S+ V + S SG+ +S G H C TL+ Sbjct: 123 AAHV---------PSSCQLHMVDNIFESCSGSFVERASSDLLSGEHNNRC-----TLNKT 168 Query: 1133 AVE-------------------------GNSEEAKEKICNGVIDKGVPTSGASVDFPRTE 1029 VE GN+ E C+ + P S D Sbjct: 169 NVEHLLPAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPATLTPVSVEEDDCD--- 225 Query: 1028 TKLLTLSSGRKQAESGDNSYSSCLATGTPAAGSYTNRVVVSETDRDIKADAGLSISGEED 849 ++ + +S +S L G G +V + + +++ + + S E + Sbjct: 226 ----SIEESSNEIKSASDSVPEILPDGLHLVG------IVEKNEMEMRCSSSIIESEESN 275 Query: 848 VMVSHKERLDECSRDAAENDNIIDQEAEIIGKVNESMLAETCVLVEEDVLHFVHNGNDKH 669 ++ ++DA+ + + +E E + ++++ + E+C +V LHF KH Sbjct: 276 GKLN-------WTKDASGSSTVGRKEIETVQQLDKIRVDESCFMVNGAELHFHPQREGKH 328 Query: 668 KSYKKKLREAFSTK 627 K+Y++K+R+A S++ Sbjct: 329 KTYQRKIRDAISSR 342 >ref|XP_004149372.1| PREDICTED: uncharacterized protein LOC101205697 [Cucumis sativus] Length = 379 Score = 143 bits (360), Expect = 3e-31 Identities = 125/411 (30%), Positives = 199/411 (48%), Gaps = 16/411 (3%) Frame = -1 Query: 1652 MDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQDLDPE 1473 MD+KG+AW+G +Y KFE MCLE+E+++ QDTVKYVENQV+ VGASVK+FYS+VMQD P Sbjct: 1 MDVKGIAWVGRLYEKFETMCLEVEDIICQDTVKYVENQVEVVGASVKRFYSDVMQDFLPP 60 Query: 1472 SHVDLVKVAAADLTLNPYAHVEMIQKSS--SKKDPCE-NTEKLTDDFKVISGKSKTGAYK 1302 S + KVA + L Y +V + +K + K + + + EK ++ KV + + A K Sbjct: 61 SELSDEKVAVCNSALENYENVVICKKPTMGMKIERSKFSEEKSNENSKVTADAKRDIACK 120 Query: 1301 RPVARRRGHSSANYCPAVLGLTASMSGNVTSMSSFSQRRGSHEVACGSLDVTLSSAAVEG 1122 P RGH+ ANY L + + N + +S+++ + ++ Sbjct: 121 LP----RGHNHANY--LYLVSSPYSAANRAQIDGYSRKKDDENI----------HHKIDL 164 Query: 1121 NSEEAKEKICNGVIDKGVPTSGASVDFPRTETKLLTLSSGRKQAESG-DNSYSSCLATGT 945 + E+ + C + + PT+ + + T+ + + +A S + + L T Sbjct: 165 DGRESTTRGCKSLTETS-PTN-LEKKYENDASSCCTILNRKSEASSELAGNMETMLVKDT 222 Query: 944 PAAGSYTNRVVVSETDRDIKADAGLSISGEEDVMVSHKE-----------RLDECSRDAA 798 N V+ S + +IK D L + ++ + KE LD S + Sbjct: 223 RC-----NSVMQSANETEIKTDNILPDTPSSAIVDTEKETRLLSYGDSSAELDGRSDSWS 277 Query: 797 ENDNIIDQEAEIIGKVNESML-AETCVLVEEDVLHFVHNGNDKHKSYKKKLREAFSTKKR 621 +D ++Q I + +E+ L E CVLV+ D LHF N K + Y KK+ AFS K+ Sbjct: 278 LDDIELEQGTHNIQQADETKLDEEACVLVKGDDLHFDFNEEVKQRHY-KKIAGAFSFTKK 336 Query: 620 STRKEYEKLATQYKEQNSKQESAERLMPAIGDSSAKISPGHTLPDSDWELL 468 S RK+ +YKE K +P D K++ L + DW+LL Sbjct: 337 SKRKQ------EYKELAMKHGYGFGTIPNQQDEQ-KLTAEDVL-EQDWQLL 379 >ref|NP_565715.1| uncharacterized protein [Arabidopsis thaliana] gi|16612317|gb|AAL27517.1|AF439849_1 At2g31130/T16B12.6 [Arabidopsis thaliana] gi|20197328|gb|AAC63838.2| expressed protein [Arabidopsis thaliana] gi|23506163|gb|AAN31093.1| At2g31130/T16B12.6 [Arabidopsis thaliana] gi|330253402|gb|AEC08496.1| uncharacterized protein AT2G31130 [Arabidopsis thaliana] Length = 419 Score = 142 bits (359), Expect = 4e-31 Identities = 125/436 (28%), Positives = 204/436 (46%), Gaps = 41/436 (9%) Frame = -1 Query: 1652 MDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQDLDPE 1473 MD KG+ W+GN+Y KFEAMCLE+EE++ QDT KYVENQVQTVG SVKKF S+V+ DL P+ Sbjct: 1 MDFKGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSVKKFCSDVVHDLLPD 60 Query: 1472 SHVDLVKVAAADLTLNPYAHVEMIQKSSSKKDPC-ENTEKLTDDFKVISGK--------- 1323 VD K + L+ YA V +K KKD T+ +T + +V GK Sbjct: 61 ESVDSGKPLPVSM-LHEYAPVYSFKK---KKDSMNRKTKDVTQEQEVTEGKKDGFAKKLR 116 Query: 1322 ----------------SKTGAYKRPVARRRGHSSANYCPAVLGLTASMSGNVTSMSSFSQ 1191 S G Y+R R+ V + + ++TS+S Sbjct: 117 GLDADDYDICTSPRQYSYGGPYRRTRIGRKQIFKKEELSQV--IRPYIQKDLTSLSMVHS 174 Query: 1190 RRGSHE---VACGSLDVTLSSAAVE--GNSEEAKEKICNGVIDKGVPTSGASVDFPRTET 1026 R + V SL + S+ + G + + + K + S D P E Sbjct: 175 ARVKDDLGTVNSSSLSMVHSARVNDDVGTVNSSSLSMVHHASMKDDVGTVKSSDSPPGEV 234 Query: 1025 KLLTLSSGRKQAESGDNSYSSCLATGTPAAGSYTNRVVVSETDRDIKADAGL---SISGE 855 + L +K+ + D + + T + S + V+V + + + AD + + + Sbjct: 235 EKLI---SKKKCQKDDKAKNQQSLTVVNSVKSNDSEVIV-DNEHGLSADKSVRSQDLEIQ 290 Query: 854 EDVMVSHKERLDECSRDA---AENDNIIDQEAEIIGKVNESMLAETCVLVEEDVLHFVHN 684 + S D+C ++ + ++ + ++EI+ ++ + E+C+LV+ D H V Sbjct: 291 PSLATSLPAESDDCRKETNVETSSSSVSEPKSEILQHLSGRSVEESCILVDRDEFHSVFP 350 Query: 683 G---NDKHKSYKKKLREAFSTK-KRSTRKEYEKLATQYKEQNSKQESAERLMPAIGDSSA 516 NDKHK Y KK+R+A S++ K++ KEY++LA Q+ ++ + GD+ Sbjct: 351 DKMENDKHKPY-KKIRDAISSRMKQNREKEYKRLARQWYAEDVENGR------ECGDNPK 403 Query: 515 KISPGHTLPDSDWELL 468 I + +S+WELL Sbjct: 404 PIEENQSSEESEWELL 419 >ref|XP_002881158.1| hypothetical protein ARALYDRAFT_482041 [Arabidopsis lyrata subsp. lyrata] gi|297326997|gb|EFH57417.1| hypothetical protein ARALYDRAFT_482041 [Arabidopsis lyrata subsp. lyrata] Length = 418 Score = 142 bits (357), Expect = 7e-31 Identities = 119/430 (27%), Positives = 198/430 (46%), Gaps = 35/430 (8%) Frame = -1 Query: 1652 MDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQDLDPE 1473 MD KG+ W+GN+Y KFEAMCLE+EE++ QDT KYVENQVQTVG SVKKF S+V+QDL P+ Sbjct: 1 MDFKGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSVKKFCSDVVQDLLPD 60 Query: 1472 SHVDLVKVAAADLTLNPYAHVEMIQK------------------SSSKKDPCENTEK--L 1353 VD K + L+ YA V +K + KKD C + Sbjct: 61 DSVDSGKPLPVSM-LHEYAPVCSFKKKRDSMNRKTRDVKQEQEVTEGKKDGCAQKFRGLD 119 Query: 1352 TDDFKVISGK---SKTGAYKRP-VARRRGHSSANYCPAVLGLTASMSGNVTSMSSFSQRR 1185 DD+ + + S G Y+R V R++ S +++ + S + Sbjct: 120 ADDYDICTSPRQYSYGGPYRRTRVGRKQIFKKEELSQVTRPYMQKDSSSLSMVHSARVKD 179 Query: 1184 GSHEVACGSLDVTLSSAAVE--GNSEEAKEKICNGVIDKGVPTSGASVDFPRTETKLLTL 1011 V SL + S+ + G + + + K + S D P E + L Sbjct: 180 DVGTVNSSSLSMVHSARVKDDVGTVNSSSLTMVHSARIKDDVGTVKSSDSPPGEVEKLIY 239 Query: 1010 SSGRKQAESGDNSYSSCLATGTPAAGSYTNRVVVSETDRDIKADAGLSISGEEDVMVSHK 831 +K+ + D + + T + + + + + + + D+ + V S Sbjct: 240 ---KKECQKDDKTKNQQSLTVVNSVKRNDSEIRI-DNEHGLMGDSSQDSEIQPSVATSLA 295 Query: 830 ERLDECSRDA-----AENDNIIDQEAEIIGKVNESMLAETCVLVEEDVLHFVHNG---ND 675 D+C ++ + ++ +Q++EI+ ++ + E+C+LV+ D H V ND Sbjct: 296 AGSDDCRKETNVDTKTSSSSVSEQKSEILQPLSGRSVEESCILVDRDEFHCVFPDKMEND 355 Query: 674 KHKSYKKKLREAFSTK-KRSTRKEYEKLATQYKEQNSKQESAERLMPAIGDSSAKISPGH 498 KHK Y KK+R+A S++ K++ KEY++LA Q+ ++ + GD + Sbjct: 356 KHKPY-KKIRDAISSRMKQNREKEYKRLARQWYAEDVENGR------ECGDDPKPLEENQ 408 Query: 497 TLPDSDWELL 468 + +S+WELL Sbjct: 409 SPEESEWELL 418 >gb|EOX92820.1| Uncharacterized protein isoform 4, partial [Theobroma cacao] Length = 341 Score = 137 bits (345), Expect = 2e-29 Identities = 110/374 (29%), Positives = 176/374 (47%), Gaps = 31/374 (8%) Frame = -1 Query: 1655 TMDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQD--- 1485 +MDLKG+ W+G++Y KFEAMCLE+EEVMYQDTVKYVEN+VQTVGASVKKFYS +MQD Sbjct: 3 SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62 Query: 1484 --LDPESHVDLVKVAAADLTLNPYAHVEMIQKSSSKKDPCE-NTEKLTDDFKVISGKSKT 1314 L P S + VAA+DL + YA K+D + ++E+LT+D +VI+ ++ Sbjct: 63 DLLLPSSLEPMKAVAASDLPVEIYAETLKKPNVGLKEDAIQGDSEQLTEDSEVIADVNEN 122 Query: 1313 GAYKRPVARRRGHSSANYCPAVLGLTASMSGNVTSMSSFSQRRGSHEVACGSLDVTLSSA 1134 A+ S+ V + S SG+ +S G H C TL+ Sbjct: 123 AAHV---------PSSCQLHMVDNIFESCSGSFVERASSDLLSGEHNNRC-----TLNKT 168 Query: 1133 AVE-------------------------GNSEEAKEKICNGVIDKGVPTSGASVDFPRTE 1029 VE GN+ E C+ + P S D Sbjct: 169 NVEHLLPAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPATLTPVSVEEDDCD--- 225 Query: 1028 TKLLTLSSGRKQAESGDNSYSSCLATGTPAAGSYTNRVVVSETDRDIKADAGLSISGEED 849 ++ + +S +S L G G +V + + +++ + + S E + Sbjct: 226 ----SIEESSNEIKSASDSVPEILPDGLHLVG------IVEKNEMEMRCSSSIIESEESN 275 Query: 848 VMVSHKERLDECSRDAAENDNIIDQEAEIIGKVNESMLAETCVLVEEDVLHFVHNGNDKH 669 ++ ++DA+ + + +E E + ++++ + E+C +V LHF KH Sbjct: 276 GKLN-------WTKDASGSSTVGRKEIETVQQLDKIRVDESCFMVNGAELHFHPQREGKH 328 Query: 668 KSYKKKLREAFSTK 627 K+Y ++R+A S++ Sbjct: 329 KTY--QIRDAISSR 340 >ref|XP_002325580.2| hypothetical protein POPTR_0019s11960g [Populus trichocarpa] gi|550317324|gb|EEE99961.2| hypothetical protein POPTR_0019s11960g [Populus trichocarpa] Length = 442 Score = 134 bits (338), Expect = 1e-28 Identities = 133/456 (29%), Positives = 203/456 (44%), Gaps = 61/456 (13%) Frame = -1 Query: 1652 MDLKGLAWIGNIYGKFEAMCLEMEEVMY-----------------------------QDT 1560 MDLKG+ W+G+IY KFEA LE+EE+M Q+ Sbjct: 4 MDLKGITWVGDIYLKFEARLLEVEEIMREAAEFEWPARAVQFPPKLQMLGCCGCCFGQEA 63 Query: 1559 VKYVENQVQTVGASVKKFYSEVMQDLDPESHVDLVKVAAADLTLNPYAHVEMIQK----S 1392 VKYVENQ+QTV +V+KFYS+VMQDL D A + ++ A V + K Sbjct: 64 VKYVENQMQTVSNNVRKFYSDVMQDLCSPDSEDPANGAVSKFPVDSGADVGIYMKPEDGM 123 Query: 1391 SSKKDPCENTEKLTDDFKVI--SGKSKTGAYKRPVARR--RGHSSA-------------- 1266 K ++ E+L +D K+ SG +R RR R HS Sbjct: 124 EEKCGKADDPEQLAEDPKMTADSGSDCLPLRRRITVRRISRQHSKGSLSNKSNLDTDKNS 183 Query: 1265 ---NYCPAVLGLTASMSGNVTSMSSFSQRRGSHEVACGSLDVTLSSAAVE----GNSEEA 1107 N P + T ++S +S S + + E +C + VE + EE+ Sbjct: 184 NCNNVSPNEISGTTTLSSKFSSNVELSDQ--NLEASCDQTARLATPGCVEVTDHFSMEES 241 Query: 1106 KEKICNGVIDKGVPTSGASVDFPRTETKLLTLS-SGRKQAESGDNSYSSCLATGTPAAGS 930 K +I N K VP + F + ++ ++ +GR + G +S S + G Sbjct: 242 KNEIKNA--SKHVP----EISFNKPSLDMVNITETGRHE---GTDSRPSSRNLLEESNGV 292 Query: 929 YTNRVVVSETDRDIKADAGLSISGEEDVMVSHKERLDECSRDAAENDNIIDQEAEIIGKV 750 + VS + + + E+ VS+ DE ++ E+ IID+ EII + Sbjct: 293 CISNEFVSMIESAANGNMQTNKFAYEEDFVSNS---DEWGIESDEDGTIIDEGMEII-RA 348 Query: 749 NESMLAETCVLVEEDVLHFVHNGNDKHKSYKKKLREAFSTKKRSTRKEYEKLATQYK--E 576 +++ L E CVLV D H V K++ Y KK+R+ F ++KRS KEYE+LA Q Sbjct: 349 DKARLEEVCVLVNVDEFHHVPR-EGKNRPY-KKIRDVFRSRKRSVMKEYEQLAAQCSSDS 406 Query: 575 QNSKQESAERLMPAIGDSSAKISPGHTLPDSDWELL 468 ++ ++ES LMP + A S H +S+WEL+ Sbjct: 407 KSKEEESITSLMPTLSIKEANRSLSHDPSESEWELV 442 >gb|EOX92821.1| Uncharacterized protein isoform 5 [Theobroma cacao] gi|508700926|gb|EOX92822.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 334 Score = 134 bits (338), Expect = 1e-28 Identities = 107/364 (29%), Positives = 169/364 (46%), Gaps = 31/364 (8%) Frame = -1 Query: 1655 TMDLKGLAWIGNIYGKFEAMCLEMEEVMYQDTVKYVENQVQTVGASVKKFYSEVMQD--- 1485 +MDLKG+ W+G++Y KFEAMCLE+EEVMYQDTVKYVEN+VQTVGASVKKFYS +MQD Sbjct: 3 SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62 Query: 1484 --LDPESHVDLVKVAAADLTLNPYAHVEMIQKSSSKKDPCE-NTEKLTDDFKVISGKSKT 1314 L P S + VAA+DL + YA K+D + ++E+LT+D +VI+ ++ Sbjct: 63 DLLLPSSLEPMKAVAASDLPVEIYAETLKKPNVGLKEDAIQGDSEQLTEDSEVIADVNEN 122 Query: 1313 GAYKRPVARRRGHSSANYCPAVLGLTASMSGNVTSMSSFSQRRGSHEVACGSLDVTLSSA 1134 A+ S+ V + S SG+ +S G H C TL+ Sbjct: 123 AAHV---------PSSCQLHMVDNIFESCSGSFVERASSDLLSGEHNNRC-----TLNKT 168 Query: 1133 AVE-------------------------GNSEEAKEKICNGVIDKGVPTSGASVDFPRTE 1029 VE GN+ E C+ + P S D Sbjct: 169 NVEHLLPAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPATLTPVSVEEDDCD--- 225 Query: 1028 TKLLTLSSGRKQAESGDNSYSSCLATGTPAAGSYTNRVVVSETDRDIKADAGLSISGEED 849 ++ + +S +S L G G +V + + +++ + + S E + Sbjct: 226 ----SIEESSNEIKSASDSVPEILPDGLHLVG------IVEKNEMEMRCSSSIIESEESN 275 Query: 848 VMVSHKERLDECSRDAAENDNIIDQEAEIIGKVNESMLAETCVLVEEDVLHFVHNGNDKH 669 ++ ++DA+ + + +E E + ++++ + E+C +V LHF KH Sbjct: 276 GKLN-------WTKDASGSSTVGRKEIETVQQLDKIRVDESCFMVNGAELHFHPQREGKH 328 Query: 668 KSYK 657 K+Y+ Sbjct: 329 KTYQ 332