BLASTX nr result
ID: Catharanthus23_contig00012237
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00012237 (2225 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI30576.3| unnamed protein product [Vitis vinifera] 251 1e-63 ref|XP_002269847.1| PREDICTED: uncharacterized protein LOC100261... 245 7e-62 emb|CAN79695.1| hypothetical protein VITISV_023936 [Vitis vinifera] 241 1e-60 ref|XP_004302736.1| PREDICTED: uncharacterized protein LOC101292... 188 1e-44 gb|EOY07557.1| Uncharacterized protein isoform 2 [Theobroma cacao] 178 1e-41 gb|EOY07556.1| Uncharacterized protein isoform 1 [Theobroma cacao] 178 1e-41 gb|EOY07558.1| Uncharacterized protein isoform 3 [Theobroma cacao] 174 1e-40 gb|EOY07559.1| Uncharacterized protein isoform 4 [Theobroma cacao] 173 3e-40 gb|EMJ07008.1| hypothetical protein PRUPE_ppa010713mg [Prunus pe... 162 4e-37 gb|EOY07560.1| Uncharacterized protein isoform 5 [Theobroma cacao] 109 4e-21 ref|XP_006853660.1| hypothetical protein AMTR_s00056p00105160 [A... 61 2e-06 >emb|CBI30576.3| unnamed protein product [Vitis vinifera] Length = 693 Score = 251 bits (640), Expect = 1e-63 Identities = 195/629 (31%), Positives = 307/629 (48%), Gaps = 17/629 (2%) Frame = +1 Query: 49 CTADNDAAEVEVELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAV 228 C+ + VELEAMRK+D SWHPC VS STG GL+V++ + D +DII + EEA+ Sbjct: 122 CSMGTGTGDATVELEAMRKDDSSWHPCRVSLSSTGFGLIVDFGSQ--DLEDIISNEEEAL 179 Query: 229 ARLRIRSAPLEGVACSIVQPGDRVLA--RRNRMNLFFDAEVIEVVRVRHSKRIHCRCTFT 402 ARLRIRS PL+G CS+++ G+RVLA + + L FDA V + +RVRHS RI CRCTF Sbjct: 180 ARLRIRSVPLQGEDCSLIEEGERVLATHKSHFKTLSFDAMVEKALRVRHSTRISCRCTFV 239 Query: 403 VKWIHNGLEGETEIIPSSGLMKMSTESIHLHPTIFAFFNTLLTSSCFDVSPLRAVAEGMD 582 +KW+H L+G T I+PSS +MK++T+SI +HP + AF + T +C V E +D Sbjct: 240 IKWLHQDLKGATSIVPSSSIMKLATQSITVHPMVAAFLKPIKTLNCSAAPSFSTVFEDVD 299 Query: 583 CEMDIDE-LEKQIEQISNSADACRMKITK-VLSG-EVDVDERSQCNLIPASEVCDTYIHL 753 CE+D+ + LEKQIE+ISN ADA + +I++ +L G + D+ E+ C+ + S++ ++ + Sbjct: 300 CEVDLHKLLEKQIEEISNLADASKKEISEDILFGIKADIKEQMDCSPVAESKITSSHFQV 359 Query: 754 PPSQ-NSITGSTGGAQLTRPVETEFKHP-PPHSWFTEEASVEGRSRCNPIAACAALASLM 927 P Q N ST + R V E K P PP S +E S E R+ +P+A+ AALAS+M Sbjct: 360 PHEQENHFKRSTRSSSKLR-VNMEVKDPLPPDSSIQKELS-ENRAYLSPLASRAALASIM 417 Query: 928 SKSSENTPTMSFISNSSKINDENFLGTESGTVTSISTVKELFPSKKLFKDPETLNASSVA 1107 S + + S +EN +T+ +L K KD + + Sbjct: 418 SNLPQK------LEFSIYHEEENGFACAPDNITNKHVTMDLLNGTKPVKDKLSSEIEAAF 471 Query: 1108 LNWE-------SKNKESSQEVASCVGNHVRSTKKMCLAARSSSAMNRSPFENVEMPVTNA 1266 + E ++ S + + + + + K A+ S S + + P + Sbjct: 472 IPAEIFKSLITTEKGASRRPLLVEASSEIANPKSQNDASPSLSGLIEE--RELRQPAKES 529 Query: 1267 RRLTRSVIR--AIKEKQTVETKNVTEEILCSTFEQNLSVQKMDTLPDMEMVASVIDK-ES 1437 R T S I+ A+ E K EEI K L + + S + K E Sbjct: 530 -RFTSSAIQKHAVSSTSNAEMKTHAEEI------------KSVALTNKRLTRSAVHKQEE 576 Query: 1438 GLTISVKSKTQMDSKGICENNGSVMRSNGVGIQGLNNSRRLTRSAVKGKSKDSNGELPKG 1617 L + VK +++++ N+++ + ++ +G + + PK Sbjct: 577 NLAMEVKQRSEVN----------------------NSAQDIESNSSEGNVTIPDRKAPKK 614 Query: 1618 LEECSYPGHSKSDLEGNITSEREVPEMEKTISTSPVHETCPNPSIAEAYEKVQLSASIKT 1797 + S P ++S SPV E E +K ++ ++++T Sbjct: 615 KKPVSLPPAAQS---------------------SPVTE--------ERNKKRKMPSAVET 645 Query: 1798 TRKTEGNATDNVSLKQGVKRKSSASKNQE 1884 KTEG + N + K KS++SK QE Sbjct: 646 ASKTEGKVSRNGGNSESQKSKSTSSKKQE 674 >ref|XP_002269847.1| PREDICTED: uncharacterized protein LOC100261386 [Vitis vinifera] Length = 552 Score = 245 bits (625), Expect = 7e-62 Identities = 168/482 (34%), Positives = 260/482 (53%), Gaps = 47/482 (9%) Frame = +1 Query: 73 EVEVELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAVARLRIRSA 252 + VELEAMRK+D SWHPC VS STG GL+V++ + D +DII + EEA+ARLRIRS Sbjct: 7 DATVELEAMRKDDSSWHPCRVSLSSTGFGLIVDFGSQ--DLEDIISNEEEALARLRIRSV 64 Query: 253 PLEGVACSIVQPGDRVLA--RRNRMNLFFDAEVIEVVRVRHSKRIHCRCTFTVKWIHNGL 426 PL+G CS+++ G+RVLA + + L FDA V + +RVRHS RI CRCTF +KW+H L Sbjct: 65 PLQGEDCSLIEEGERVLATHKSHFKTLSFDAMVEKALRVRHSTRISCRCTFVIKWLHQDL 124 Query: 427 EGETEIIPSSGLMKMSTESIHLHPTIFAFFNTLLTSSCFDVSPLRAVAEGMDCEMDIDE- 603 +G T I+PSS +MK++T+SI +HP + AF + T +C V E +DCE+D+ + Sbjct: 125 KGATSIVPSSSIMKLATQSITVHPMVAAFLKPIKTLNCSAAPSFSTVFEDVDCEVDLHKL 184 Query: 604 LEKQIEQISNSADACRMKITK-VLSG-EVDVDERSQCNLIPASEVCDTYIHLPPSQ-NSI 774 LEKQIE+ISN ADA + +I++ +L G + D+ E+ C+ + S++ ++ +P Q N Sbjct: 185 LEKQIEEISNLADASKKEISEDILFGIKADIKEQMDCSPVAESKITSSHFQVPHEQENHF 244 Query: 775 TGSTGGAQLTRPVETEFKHP-PPHSWFTEEASVEGRSRCNPIAACAALASLMSKSSENTP 951 ST + R V E K P PP S +E S E R+ +P+A+ AALAS+MS + Sbjct: 245 KRSTRSSSKLR-VNMEVKDPLPPDSSIQKELS-ENRAYLSPLASRAALASIMSNLPQKLE 302 Query: 952 -----------------------TMSFISNSSKINDENFLGTESGTVTSISTVKELFPSK 1062 TM ++ + + D+ E+ + + + K Sbjct: 303 FSIYHEEENGFACAPDNITNKHVTMDLLNGTKPVKDKLSSEIEAAFIPAEIFKSLITTEK 362 Query: 1063 KLFKDPETLNASSVALNWESKNKESSQEVASCVGNHVR-----------STKKMCLAARS 1209 + P + ASS N +S+N S +R + +K +++ S Sbjct: 363 GASRRPLLVEASSEIANPKSQNDASPSLSGLIEERELRQPAKESRFTSSAIQKHAVSSTS 422 Query: 1210 SSAMNRSPFENVEMPVTNARRLTRSVIR------AIKEKQTVETKNVTEEILCSTFEQNL 1371 ++ M E + +TN +RLTRS + A++ KQ E N ++I ++ E N+ Sbjct: 423 NAEMKTHAEEIKSVALTN-KRLTRSAVHKQEENLAMEVKQRSEVNNSAQDIESNSSEGNV 481 Query: 1372 SV 1377 ++ Sbjct: 482 TI 483 >emb|CAN79695.1| hypothetical protein VITISV_023936 [Vitis vinifera] Length = 1508 Score = 241 bits (615), Expect = 1e-60 Identities = 180/568 (31%), Positives = 278/568 (48%), Gaps = 76/568 (13%) Frame = +1 Query: 49 CTADNDAAEVEVELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAV 228 C+ + VELEAMRK+D SWHPC VS STG GL+V++ + D +DII + EEA+ Sbjct: 27 CSMGTGTGDATVELEAMRKDDSSWHPCRVSLSSTGFGLIVDFGSQ--DLEDIISNEEEAL 84 Query: 229 ARLRIRSAPLEGVACSIVQPGDRVLA--RRNRMNLFFDAEV------------------- 345 ARLRIRS PL+G CS+++ G+RVLA + + L FDA V Sbjct: 85 ARLRIRSVPLQGEDCSLIEEGERVLATHKSHFKTLSFDAMVEKEMSHEFXIECDLIDWGI 144 Query: 346 ---IEVVRVRHSKRIHCRCTFTVKWIHNGLEGETEIIPSSGLMKMSTESIHLHPTIFAFF 516 + +RVRHS RI CRCTF +KW+H L+G T I+PSS +MK++T+SI +HP + AF Sbjct: 145 XVNVVALRVRHSTRISCRCTFVIKWLHQDLKGATSIVPSSSIMKLATQSITVHPMVAAFL 204 Query: 517 NTLLTSSCFDVSPLRAVAEGMDCEMDIDE-LEKQIEQISNSADACRMKITK-VLSG-EVD 687 + T +C V E +DCE+D+ + LEKQIE+ISN ADA + +I++ +L G + D Sbjct: 205 KPIKTLNCSAAPSFSTVFEDVDCEVDLHKLLEKQIEEISNLADASKKEISEDILFGIKAD 264 Query: 688 VDERSQCNLIPASEVCDTYIHLPPSQ-NSITGSTGGAQLTRPVETEFKHP-PPHSWFTEE 861 + E+ C+ + S++ ++ +P Q N ST + R V E K P PP S EE Sbjct: 265 IKEQMDCSPVAESKITSSHFQVPHEQENHFKRSTRSSSKLR-VNMEVKDPLPPDSSIQEE 323 Query: 862 ASVEGRSRCNPIAACAALASLMSKSSENTP-----------------------TMSFISN 972 S E R+ +P+A+ AALAS+MS + TM ++ Sbjct: 324 LS-ENRAYLSPLASRAALASIMSNLPQKLEFSIXHEEENGFACAPDNITNKHVTMDLLNG 382 Query: 973 SSKINDENFLGTESGTVTSISTVKELFPSKKLFKDPETLNASSVALNWESKNKESSQEVA 1152 + + D+ E+ + + + K + P + ASS N +S+N S Sbjct: 383 TKPVKDKLSSEIEAAFIPAEIFKSLITTEKGASRRPLLVEASSEIANPKSQNDASPSLSG 442 Query: 1153 SCVGNHVRSTKKMC----------LAARSSSAMNRSPFENVEMPVTNARRLTRSVIR--- 1293 +R K + +S+A ++ E ++ +RLTRS + Sbjct: 443 LIEERELRQPAKESRFTSSAIQKHAVSSTSNAEMKTHAEEIKSVALXNKRLTRSAVHKQE 502 Query: 1294 ---AIKEKQTVETKNVTEEILCSTFEQNLSV--------QKMDTLPDMEMVASVIDKESG 1440 A++ KQ E N ++I ++ E N+++ +K +LP +S + +E Sbjct: 503 ENLAMEVKQRSEVNNSAQDIESNSSEGNVTIPDRKAPKKKKPVSLPPAAQTSSPVTEERN 562 Query: 1441 LTISVKSKTQMDSKGICENNGSVMRSNG 1524 + S + SK G V R+ G Sbjct: 563 KKRKMPSAVETASK----TEGKVSRNGG 586 >ref|XP_004302736.1| PREDICTED: uncharacterized protein LOC101292719 [Fragaria vesca subsp. vesca] Length = 580 Score = 188 bits (477), Expect = 1e-44 Identities = 172/615 (27%), Positives = 296/615 (48%), Gaps = 10/615 (1%) Frame = +1 Query: 70 AEVEVELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAVARLRIRS 249 AE ELEA+ K+D SW+PC VS ST L+V++ + + +D++L+ +EA+ RLR RS Sbjct: 7 AENATELEALCKQDSSWYPCHVSLSSTEDSLIVDFG--RQELEDMVLNKDEALMRLRFRS 64 Query: 250 APLEGVACSIVQPGDRVLARRNR--MNLFFDAEVIEVVRVRHSKRIHCRCTFTVKWIHNG 423 PL+G CS ++ G+ VLA + +DA+V +V RVRHS R++CRC+F + W+H Sbjct: 65 GPLQGDDCSHIE-GEHVLAIHKSPFKSYLYDAKVEKVTRVRHSTRVYCRCSFMILWLHPD 123 Query: 424 LEGETEIIPSSGLMKMSTESIHLHPTIFAFFNTLLTSSCFDVSPLRAVAEGMDCEMDIDE 603 +G+ I SS +MK++++SI+ HPT+ A F ++ + L + E +D E D+++ Sbjct: 124 FKGQMVTITSSSIMKLASKSINSHPTVAALFKSVKQMGLYTAPLLPIMHEDIDVEFDLNK 183 Query: 604 -LEKQIEQISNSADACRMKITKVLSGEVDVDERSQCNLIPASEVCDTYIHLPPSQNSITG 780 L KQIE+I+ SA+ +IT + V D H S+ Sbjct: 184 LLGKQIEEINISANRVTNEITVDIIEGVKADSSGHVTESKIGTSKAQVSHDQDQLKSVAN 243 Query: 781 STGGAQLTRPVETEFKHPPPHSWFTEEASVEGRSRCNPIAACAALASLMSKSSENTPTMS 960 +G ++ + E E HPP S +E E R +P+AA AALASL+S + ++ Sbjct: 244 RSGNLEVNK--EDEDPHPPFLS--KQEEHSEHRCHISPLAARAALASLVSLTHKHIAI-- 297 Query: 961 FISNSSKINDENFLGTESGTVTSISTVKELFPSKKLFKDPETLNASSVALNWESKNKESS 1140 SGT ELF S +++ +++ S ES Sbjct: 298 -----------------SGT--------ELFKSS---------DSTDLSIKVSSDRTESP 323 Query: 1141 QEVASCVGNHVRSTKKMCLAA--RSSSAMNRSPFENVEMPVTNARRLTRSVIRAIKEKQT 1314 + + +G+ R+T+ L + +S ++ S VTN LTRS ++ K+ + Sbjct: 324 KNGNANLGSGARTTRSRGLKGFEKQNSDLHDSAEAIKLRAVTNRGWLTRSAVKEEKDISS 383 Query: 1315 VETKNVTEEILCSTFEQNLSVQKMDTLPDMEMVASVIDKESGLT-ISVKSKTQMDSKGIC 1491 V +K+ +EE + ++ S D + + V+ K++G++ +V S +S G Sbjct: 384 VASKHGSEESESAQSTESYSSDGTDIVHGNK----VLTKKNGISKKAVSSPLHSESNGHK 439 Query: 1492 ENNGSVMRSNGVGIQGLNNSRRLTRSAVKGKSKDSNGELPKGLEECSYPGHSKSDLEGN- 1668 EN + S +G+ + ++ T++ +KD+N + L + S+ + N Sbjct: 440 EN----LTSGDLGV--IQDAYVQTKTC----AKDTNSSVSTNLRRLT---RSRVSCQDNL 486 Query: 1669 ITSEREVPEMEKTISTSPVHETCPNPSIAEAYE---KVQLSASIKTTRKTEGNATDNVSL 1839 I E E E S + + + + + E + S ++ +R+TEG + + Sbjct: 487 IVPECHAVEKENRESKKKKAGSASSQNYSTSGEDGNRQHNSGVVRNSRQTEGKMSGSGDN 546 Query: 1840 KQGVKRKSSASKNQE 1884 QG KRKS++S QE Sbjct: 547 SQGRKRKSNSSSRQE 561 >gb|EOY07557.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 611 Score = 178 bits (451), Expect = 1e-41 Identities = 140/419 (33%), Positives = 207/419 (49%), Gaps = 22/419 (5%) Frame = +1 Query: 82 VELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAVARLRIRSAPLE 261 VELEA RKED SWHPC V S+G L+V + + + DD++L EE + LR RS PL+ Sbjct: 11 VELEAKRKEDSSWHPCRVYLSSSGDSLIVNFG--RQELDDMLLQKEEVLMHLRFRSMPLQ 68 Query: 262 GVACSIVQPGDRVLARRNRMN--LFFDAEVIEVVRVRHSKRIHCRCTFTVKWIHNGLEGE 435 C ++ G+RVLA R LF DA V++V RVRHSKR CRCTF +KW+ LEG+ Sbjct: 69 VDDCFHIEEGERVLADRKSQFKILFHDAVVVKVDRVRHSKR-GCRCTFMIKWLDQDLEGQ 127 Query: 436 TEIIPSSGLMKMSTESIHLHPTIFAFFNTLLTSSCFDVSPLRAVAEGMDCEMDIDE-LEK 612 T +PSS +MK++T+SI HP I SPL + EG D E+D+++ L+K Sbjct: 128 TFTLPSSSIMKLATKSISAHPIINKLLKPEKHRGLSYSSPLLTILEGTDSEIDLNKLLQK 187 Query: 613 QIEQISNSADACRMKITKVLSGEVDVDERSQCNLIPASE-------VCDTYIHLPPSQNS 771 QIEQISN ADA + I + + + Q P +E V D + HL Sbjct: 188 QIEQISNLADASKKDIPEDIPWRNKGVNKGQSPHKPTAESNACVPAVADHHNHL----KR 243 Query: 772 ITGSTGGAQLTRPVETEFKHPPPHSWFTEEASVEGRSRCNPIAACAALASLMSKSSENTP 951 T ST Q + E ++ H+ +EA ++ RS +P+A+ AALAS + + + Sbjct: 244 TTRSTRKLQ----INIEAENQSGHTISMKEAFIQSRSHLSPLASRAALASSLLTAKK--- 296 Query: 952 TMSFISNSSKINDENFLGTESGTVTSIS------TVKELFPSKKLFKD----PETLNASS 1101 + +SS G +S + ++S E+ P D P+ SS Sbjct: 297 CLDMDLSSSMTASMFMKGKDSSDILAVSIPLVSEASHEISPHISTQGDASCEPQPTKPSS 356 Query: 1102 V--ALNWESKNKESSQEVASCVGNHVRSTKKMCLAARSSSAMNRSPFENVEMPVTNARR 1272 WE++NK +S E+ S K+ + +S E+P++ A++ Sbjct: 357 CIPTKGWENENK-TSDEINCTAEQRTYSPVKITAESVTSGVAT----STAELPISRAKK 410 >gb|EOY07556.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 567 Score = 178 bits (451), Expect = 1e-41 Identities = 140/419 (33%), Positives = 207/419 (49%), Gaps = 22/419 (5%) Frame = +1 Query: 82 VELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAVARLRIRSAPLE 261 VELEA RKED SWHPC V S+G L+V + + + DD++L EE + LR RS PL+ Sbjct: 11 VELEAKRKEDSSWHPCRVYLSSSGDSLIVNFG--RQELDDMLLQKEEVLMHLRFRSMPLQ 68 Query: 262 GVACSIVQPGDRVLARRNRMN--LFFDAEVIEVVRVRHSKRIHCRCTFTVKWIHNGLEGE 435 C ++ G+RVLA R LF DA V++V RVRHSKR CRCTF +KW+ LEG+ Sbjct: 69 VDDCFHIEEGERVLADRKSQFKILFHDAVVVKVDRVRHSKR-GCRCTFMIKWLDQDLEGQ 127 Query: 436 TEIIPSSGLMKMSTESIHLHPTIFAFFNTLLTSSCFDVSPLRAVAEGMDCEMDIDE-LEK 612 T +PSS +MK++T+SI HP I SPL + EG D E+D+++ L+K Sbjct: 128 TFTLPSSSIMKLATKSISAHPIINKLLKPEKHRGLSYSSPLLTILEGTDSEIDLNKLLQK 187 Query: 613 QIEQISNSADACRMKITKVLSGEVDVDERSQCNLIPASE-------VCDTYIHLPPSQNS 771 QIEQISN ADA + I + + + Q P +E V D + HL Sbjct: 188 QIEQISNLADASKKDIPEDIPWRNKGVNKGQSPHKPTAESNACVPAVADHHNHL----KR 243 Query: 772 ITGSTGGAQLTRPVETEFKHPPPHSWFTEEASVEGRSRCNPIAACAALASLMSKSSENTP 951 T ST Q + E ++ H+ +EA ++ RS +P+A+ AALAS + + + Sbjct: 244 TTRSTRKLQ----INIEAENQSGHTISMKEAFIQSRSHLSPLASRAALASSLLTAKK--- 296 Query: 952 TMSFISNSSKINDENFLGTESGTVTSIS------TVKELFPSKKLFKD----PETLNASS 1101 + +SS G +S + ++S E+ P D P+ SS Sbjct: 297 CLDMDLSSSMTASMFMKGKDSSDILAVSIPLVSEASHEISPHISTQGDASCEPQPTKPSS 356 Query: 1102 V--ALNWESKNKESSQEVASCVGNHVRSTKKMCLAARSSSAMNRSPFENVEMPVTNARR 1272 WE++NK +S E+ S K+ + +S E+P++ A++ Sbjct: 357 CIPTKGWENENK-TSDEINCTAEQRTYSPVKITAESVTSGVAT----STAELPISRAKK 410 >gb|EOY07558.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 409 Score = 174 bits (442), Expect = 1e-40 Identities = 117/290 (40%), Positives = 161/290 (55%), Gaps = 10/290 (3%) Frame = +1 Query: 82 VELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAVARLRIRSAPLE 261 VELEA RKED SWHPC V S+G L+V + + + DD++L EE + LR RS PL+ Sbjct: 11 VELEAKRKEDSSWHPCRVYLSSSGDSLIVNFG--RQELDDMLLQKEEVLMHLRFRSMPLQ 68 Query: 262 GVACSIVQPGDRVLARRNRMN--LFFDAEVIEVVRVRHSKRIHCRCTFTVKWIHNGLEGE 435 C ++ G+RVLA R LF DA V++V RVRHSKR CRCTF +KW+ LEG+ Sbjct: 69 VDDCFHIEEGERVLADRKSQFKILFHDAVVVKVDRVRHSKR-GCRCTFMIKWLDQDLEGQ 127 Query: 436 TEIIPSSGLMKMSTESIHLHPTIFAFFNTLLTSSCFDVSPLRAVAEGMDCEMDIDE-LEK 612 T +PSS +MK++T+SI HP I SPL + EG D E+D+++ L+K Sbjct: 128 TFTLPSSSIMKLATKSISAHPIINKLLKPEKHRGLSYSSPLLTILEGTDSEIDLNKLLQK 187 Query: 613 QIEQISNSADACRMKITKVLSGEVDVDERSQCNLIPASE-------VCDTYIHLPPSQNS 771 QIEQISN ADA + I + + + Q P +E V D + HL Sbjct: 188 QIEQISNLADASKKDIPEDIPWRNKGVNKGQSPHKPTAESNACVPAVADHHNHL----KR 243 Query: 772 ITGSTGGAQLTRPVETEFKHPPPHSWFTEEASVEGRSRCNPIAACAALAS 921 T ST Q + E ++ H+ +EA ++ RS +P+A+ AALAS Sbjct: 244 TTRSTRKLQ----INIEAENQSGHTISMKEAFIQSRSHLSPLASRAALAS 289 >gb|EOY07559.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 565 Score = 173 bits (438), Expect = 3e-40 Identities = 139/419 (33%), Positives = 206/419 (49%), Gaps = 22/419 (5%) Frame = +1 Query: 82 VELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAVARLRIRSAPLE 261 VELEA RKED SWHPC V S+G L+V + + + DD++L EE + LR RS PL+ Sbjct: 11 VELEAKRKEDSSWHPCRVYLSSSGDSLIVNFG--RQELDDMLLQKEEVLMHLRFRSMPLQ 68 Query: 262 GVACSIVQPGDRVLARRNRMN--LFFDAEVIEVVRVRHSKRIHCRCTFTVKWIHNGLEGE 435 C ++ G+RVLA R LF DA V++ RVRHSKR CRCTF +KW+ LEG+ Sbjct: 69 VDDCFHIEEGERVLADRKSQFKILFHDAVVVD--RVRHSKR-GCRCTFMIKWLDQDLEGQ 125 Query: 436 TEIIPSSGLMKMSTESIHLHPTIFAFFNTLLTSSCFDVSPLRAVAEGMDCEMDIDE-LEK 612 T +PSS +MK++T+SI HP I SPL + EG D E+D+++ L+K Sbjct: 126 TFTLPSSSIMKLATKSISAHPIINKLLKPEKHRGLSYSSPLLTILEGTDSEIDLNKLLQK 185 Query: 613 QIEQISNSADACRMKITKVLSGEVDVDERSQCNLIPASE-------VCDTYIHLPPSQNS 771 QIEQISN ADA + I + + + Q P +E V D + HL Sbjct: 186 QIEQISNLADASKKDIPEDIPWRNKGVNKGQSPHKPTAESNACVPAVADHHNHL----KR 241 Query: 772 ITGSTGGAQLTRPVETEFKHPPPHSWFTEEASVEGRSRCNPIAACAALASLMSKSSENTP 951 T ST Q + E ++ H+ +EA ++ RS +P+A+ AALAS + + + Sbjct: 242 TTRSTRKLQ----INIEAENQSGHTISMKEAFIQSRSHLSPLASRAALASSLLTAKK--- 294 Query: 952 TMSFISNSSKINDENFLGTESGTVTSIS------TVKELFPSKKLFKD----PETLNASS 1101 + +SS G +S + ++S E+ P D P+ SS Sbjct: 295 CLDMDLSSSMTASMFMKGKDSSDILAVSIPLVSEASHEISPHISTQGDASCEPQPTKPSS 354 Query: 1102 V--ALNWESKNKESSQEVASCVGNHVRSTKKMCLAARSSSAMNRSPFENVEMPVTNARR 1272 WE++NK +S E+ S K+ + +S E+P++ A++ Sbjct: 355 CIPTKGWENENK-TSDEINCTAEQRTYSPVKITAESVTSGVAT----STAELPISRAKK 408 >gb|EMJ07008.1| hypothetical protein PRUPE_ppa010713mg [Prunus persica] Length = 238 Score = 162 bits (411), Expect = 4e-37 Identities = 90/207 (43%), Positives = 134/207 (64%), Gaps = 5/207 (2%) Frame = +1 Query: 58 DNDAAEVEVELEAMRKEDFSWHPCTVSPCSTGVGLMVEYSNDKSDPDDIILSTEEAVARL 237 D AE ELEAM KED SWHPC VS ST L+V++ + + D++L+T+EA+ RL Sbjct: 3 DTSEAENVTELEAMCKEDSSWHPCQVSLSSTKDSLIVDFGGQELE--DMVLNTDEALTRL 60 Query: 238 RIRSAPLEGVACSIVQPGDRVLA--RRNRMNLFFDAEVIEVVRVRHSKRIHCRCTFTVKW 411 R R APL+G C+ ++ G+ VLA + + FFDA+V +V+RVRHS R++CRCTF +KW Sbjct: 61 RFRCAPLQGDDCTRIE-GEHVLAINKSQSKSHFFDAKVEKVLRVRHSTRVYCRCTFMIKW 119 Query: 412 IHNGLEGETEIIPSSGLMKMSTESIHLHPTIFAFFNTLLTSSCFDVS--PLRAVAEGMDC 585 +H L+G+ +PSS +MK++ ++I++HPT+ AF ++ S P+ E Sbjct: 120 LHQDLKGQMVTVPSSSIMKLTGKNINVHPTVSAFLKSVKQMGLDSASSVPVMLEVEDFAV 179 Query: 586 EMDIDE-LEKQIEQISNSADACRMKIT 663 E+D+++ LEKQIE I+ SA+ R IT Sbjct: 180 ELDLNKFLEKQIEDITVSANEFRKAIT 206 >gb|EOY07560.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 468 Score = 109 bits (273), Expect = 4e-21 Identities = 98/327 (29%), Positives = 151/327 (46%), Gaps = 20/327 (6%) Frame = +1 Query: 352 VVRVRHSKRIHCRCTFTVKWIHNGLEGETEIIPSSGLMKMSTESIHLHPTIFAFFNTLLT 531 V RVRHSKR CRCTF +KW+ LEG+T +PSS +MK++T+SI HP I Sbjct: 2 VDRVRHSKR-GCRCTFMIKWLDQDLEGQTFTLPSSSIMKLATKSISAHPIINKLLKPEKH 60 Query: 532 SSCFDVSPLRAVAEGMDCEMDIDE-LEKQIEQISNSADACRMKITKVLSGEVDVDERSQC 708 SPL + EG D E+D+++ L+KQIEQISN ADA + I + + + Q Sbjct: 61 RGLSYSSPLLTILEGTDSEIDLNKLLQKQIEQISNLADASKKDIPEDIPWRNKGVNKGQS 120 Query: 709 NLIPASE-------VCDTYIHLPPSQNSITGSTGGAQLTRPVETEFKHPPPHSWFTEEAS 867 P +E V D + HL T ST Q + E ++ H+ +EA Sbjct: 121 PHKPTAESNACVPAVADHHNHL----KRTTRSTRKLQ----INIEAENQSGHTISMKEAF 172 Query: 868 VEGRSRCNPIAACAALASLMSKSSENTPTMSFISNSSKINDENFLGTESGTVTSIS---- 1035 ++ RS +P+A+ AALAS + + + + +SS G +S + ++S Sbjct: 173 IQSRSHLSPLASRAALASSLLTAKK---CLDMDLSSSMTASMFMKGKDSSDILAVSIPLV 229 Query: 1036 --TVKELFPSKKLFKD----PETLNASSV--ALNWESKNKESSQEVASCVGNHVRSTKKM 1191 E+ P D P+ SS WE++NK +S E+ S K+ Sbjct: 230 SEASHEISPHISTQGDASCEPQPTKPSSCIPTKGWENENK-TSDEINCTAEQRTYSPVKI 288 Query: 1192 CLAARSSSAMNRSPFENVEMPVTNARR 1272 + +S E+P++ A++ Sbjct: 289 TAESVTSGVAT----STAELPISRAKK 311 >ref|XP_006853660.1| hypothetical protein AMTR_s00056p00105160 [Amborella trichopoda] gi|548857321|gb|ERN15127.1| hypothetical protein AMTR_s00056p00105160 [Amborella trichopoda] Length = 228 Score = 61.2 bits (147), Expect = 2e-06 Identities = 46/124 (37%), Positives = 64/124 (51%), Gaps = 7/124 (5%) Frame = +1 Query: 79 EVELEAMRKEDFSWHPCTVSPC-----STGVGLMVEYSNDKSDPDDIILSTEEAVARLRI 243 E+E EA +D +W+ + S + V ++ ++ D+ + + + AV R Sbjct: 97 ELEFEARSAKDGAWYDVALFLTHRILHSGEPEVRVRFTGFGAEEDEWV-NVKRAVRR--- 152 Query: 244 RSAPLEGVACSIVQPGDRVLARRNRMNL--FFDAEVIEVVRVRHSKRIHCRCTFTVKWIH 417 RS PLE C V PGD VL R NL +FDA VIEV R RH R CRCTF V++ H Sbjct: 153 RSIPLESSECGKVMPGDLVLCFREGENLATYFDAHVIEVQRRRHDLR-GCRCTFLVRYDH 211 Query: 418 NGLE 429 + E Sbjct: 212 DQAE 215