BLASTX nr result
ID: Rehmannia22_contig00017349
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia22_contig00017349 (686 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI19835.3| unnamed protein product [Vitis vinifera] 163 5e-38 gb|EMJ26698.1| hypothetical protein PRUPE_ppa000819mg [Prunus pe... 147 4e-33 ref|XP_004238341.1| PREDICTED: filament-like plant protein 6-lik... 145 1e-32 ref|XP_006342030.1| PREDICTED: filament-like plant protein 6-lik... 144 3e-32 gb|EXC00965.1| hypothetical protein L484_016031 [Morus notabilis] 142 7e-32 ref|XP_006473632.1| PREDICTED: filament-like plant protein 4-lik... 134 3e-29 ref|XP_004291383.1| PREDICTED: filament-like plant protein 6-lik... 133 5e-29 ref|XP_006435149.1| hypothetical protein CICLE_v10000102mg [Citr... 133 6e-29 ref|XP_002301986.2| hypothetical protein POPTR_0002s02600g [Popu... 132 8e-29 ref|XP_006386179.1| hypothetical protein POPTR_0002s02600g [Popu... 132 8e-29 gb|EOY14987.1| Uncharacterized protein isoform 8, partial [Theob... 131 2e-28 gb|EOY14986.1| Uncharacterized protein isoform 7 [Theobroma cacao] 131 2e-28 gb|EOY14985.1| Uncharacterized protein isoform 6 [Theobroma cacao] 131 2e-28 gb|EOY14984.1| Uncharacterized protein isoform 5 [Theobroma cacao] 131 2e-28 gb|EOY14983.1| Uncharacterized protein isoform 4 [Theobroma cacao] 131 2e-28 gb|EOY14982.1| Uncharacterized protein isoform 3 [Theobroma cacao] 131 2e-28 gb|EOY14981.1| Uncharacterized protein isoform 2, partial [Theob... 131 2e-28 gb|EOY14980.1| Uncharacterized protein isoform 1 [Theobroma cacao] 131 2e-28 ref|XP_002893071.1| hypothetical protein ARALYDRAFT_335233 [Arab... 125 9e-27 ref|XP_002510512.1| Myosin heavy chain, striated muscle, putativ... 125 9e-27 >emb|CBI19835.3| unnamed protein product [Vitis vinifera] Length = 993 Score = 163 bits (412), Expect = 5e-38 Identities = 117/287 (40%), Positives = 156/287 (54%), Gaps = 59/287 (20%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQL----------RANAQVSTGGFSSQKVSDQ--- 141 A RNSELQASR+ICA+TASKLQNLEAQL ++N Q+ G SQ S+ Sbjct: 381 AKRNSELQASRNICAKTASKLQNLEAQLQMNNQQKSPPKSNLQIPNDGSLSQNASNPPSM 440 Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318 MSEDGNDD VSCA S AT +S LS +KE N NHL+LMDDFLEMEK+A Sbjct: 441 TSMSEDGNDDAVSCAESWATGLVSGLSQFKKE----------NANHLELMDDFLEMEKLA 490 Query: 319 CLPHGSNGAVSSSDV--------------------SVNTG-----------NTGSELVKH 405 CL + SNGA S D+ +TG +T L +H Sbjct: 491 CLSNNSNGAFSKHDLDSLANQLRSRISMVFESVSEDSDTGKILEEIKRVLQDTHDTLHQH 550 Query: 406 E---DDAKINVNSCI----------DTVQTNDQALEMAISGIYDFVMILGKEAKALPGSS 546 +DA + I DT+ Q L AIS I++FV+ LGKEA A+ G+S Sbjct: 551 SACPEDAGVTAEREISLSQDCKPGTDTLHIISQELAAAISQIHEFVLFLGKEAMAIQGAS 610 Query: 547 -DEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLDFN 684 D +G +K++ FSA + + ++++ DF+ D+S+VL+KA+ L+FN Sbjct: 611 PDGNGWSRKIEDFSATVNKVLCRKMSVIDFIFDLSNVLAKASELNFN 657 >gb|EMJ26698.1| hypothetical protein PRUPE_ppa000819mg [Prunus persica] Length = 993 Score = 147 bits (370), Expect = 4e-33 Identities = 97/235 (41%), Positives = 131/235 (55%), Gaps = 15/235 (6%) Frame = +1 Query: 7 RNSELQASRSICAQTASKLQNLEAQLRAN----------AQVSTGGFSSQKVSDQ----L 144 RNSELQ SR +CAQT SKLQ LEAQL+ N Q++T G SSQ S+ Sbjct: 305 RNSELQTSRGMCAQTVSKLQTLEAQLQINNQQKGSPKSVVQITTEGSSSQNASNPPSLTS 364 Query: 145 MSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIACL 324 +SEDGNDD+ SCA S AT S+LS+I+KEK+ +K+EN NHL+LMDDFLEMEK+ACL Sbjct: 365 LSEDGNDDDRSCAESWATTLGSDLSHIRKEKSNQKSNKAENQNHLNLMDDFLEMEKLACL 424 Query: 325 PHGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGIYDFV 504 P+ SNGAVS +S N SE H+ + I + Q D +S + Sbjct: 425 PNDSNGAVS---ISSGPNNKTSERENHDASGDVTAEKDIQSEQQQD------LSPLEGDQ 475 Query: 505 MILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666 + L SDE+ L + KL + + E ++ + + + DI HV+ +A Sbjct: 476 ASSNVKLSGLSPESDENQLPLVKLRSKISMLLELLSKDTDFGKVIEDIKHVVQEA 530 >ref|XP_004238341.1| PREDICTED: filament-like plant protein 6-like [Solanum lycopersicum] Length = 1091 Score = 145 bits (366), Expect = 1e-32 Identities = 88/166 (53%), Positives = 110/166 (66%), Gaps = 17/166 (10%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRANA------------QVSTGGFSSQKVSDQL 144 A+RNSELQASRSICA+T+SKLQ+LEAQL+AN Q S G FS + ++ L Sbjct: 384 AHRNSELQASRSICAKTSSKLQSLEAQLQANLEQKSPQKSTIRRQPSEGSFSHE--ANHL 441 Query: 145 -----MSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEME 309 MSEDGNDDNVSCA S T MS+LS ++KEKN DSPHKSE +HLDLMDDFLEME Sbjct: 442 PRLASMSEDGNDDNVSCASSWTTALMSDLSNVKKEKNFDSPHKSECASHLDLMDDFLEME 501 Query: 310 KIACLPHGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDT 447 K+A +NGAVSS D+ N +++ D ++V++ DT Sbjct: 502 KLAYQSSDTNGAVSSPDIPRNARPETTKV-----DTSVHVSTSPDT 542 Score = 81.6 bits (200), Expect = 2e-13 Identities = 68/229 (29%), Positives = 114/229 (49%), Gaps = 5/229 (2%) Frame = +1 Query: 13 SELQASRS--ICAQTASKLQNLEAQLRANAQVSTGGFSSQKVSD-QLMSEDGNDDNVSCA 183 SE QAS+ + +Q+ L + ++ +++ST S K +D Q + ED + Sbjct: 553 SEDQASQQEEVSSQSHQPLLDASISMKLQSRISTVLESLSKEADIQRIQEDLRE------ 606 Query: 184 GSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIACLPHGSNGAVSSSDV 363 +Q+ +NA P +++ + L S + S Sbjct: 607 ------------IVQEMRNAVVPQSTKSIVEITL----------------SPKTATESQA 638 Query: 364 SVNTGNTGSEL-VKHEDDAKINVNSCIDTVQTNDQALEMAISGIYDFVMILGKEAKALPG 540 S++ G E + +D+K SC +++ + L A+S I+DFV+ LGKEAKA+ G Sbjct: 639 SLDDGEANLEKEIPVSEDSK----SCNESIHGISKELADAMSQIHDFVLFLGKEAKAIQG 694 Query: 541 SS-DEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLDFN 684 ++ D G+ +KLD FSA Y E I++ +++ +FVLD+SHVLS A+ L FN Sbjct: 695 TAPDGSGINEKLDDFSATYVEVISNRLSMVNFVLDLSHVLSNASQLHFN 743 >ref|XP_006342030.1| PREDICTED: filament-like plant protein 6-like [Solanum tuberosum] Length = 1093 Score = 144 bits (363), Expect = 3e-32 Identities = 82/141 (58%), Positives = 100/141 (70%), Gaps = 17/141 (12%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRANA------------QVSTGGFSSQKVSDQL 144 A+RNSELQASRSICA+T+SKLQ+LEAQL+AN Q S G S + ++ L Sbjct: 387 AHRNSELQASRSICAKTSSKLQSLEAQLQANVEQKSPQKSTIRRQPSEGSLSHE--ANHL 444 Query: 145 -----MSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEME 309 MSEDGNDDNVSCA S T MS+L++++KEKN DSPHKSE+ +HLDLMDDFLEME Sbjct: 445 PRLASMSEDGNDDNVSCASSWTTALMSDLTHVKKEKNFDSPHKSESASHLDLMDDFLEME 504 Query: 310 KIACLPHGSNGAVSSSDVSVN 372 K+A +NGAVSS D+ N Sbjct: 505 KLAYQSSDTNGAVSSPDIPNN 525 Score = 84.0 bits (206), Expect = 4e-14 Identities = 66/215 (30%), Positives = 111/215 (51%), Gaps = 11/215 (5%) Frame = +1 Query: 73 EAQLRANAQVSTGGFSSQKVSDQLMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSP 252 ++QL+ + + S G Q ++ +S + + S+ S K+AD Sbjct: 544 DSQLKEHNETSVSG--DQASRNEEVSSQSHQPLSDTSISMKLQSRISTVLESLSKDADIQ 601 Query: 253 HKSENTNHLDLMDDFLEMEKIACLPHGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVN 432 E DL + EM A +P + V ++++++ NT +E DD + N+ Sbjct: 602 RIQE-----DLREIVQEMRN-ALIPQSTKSIV---EITLSS-NTATESQPSLDDGEANLE 651 Query: 433 ----------SCIDTVQTNDQALEMAISGIYDFVMILGKEAKALPGSS-DEDGLIKKLDT 579 SC +++ + L A+S I+DFV+ LGKEAKA+ G++ D G+ +KLD Sbjct: 652 KEIPVSEDSKSCNESIHGISKELADAMSQIHDFVLFLGKEAKAIQGTAPDGSGINEKLDD 711 Query: 580 FSAKYAEAINSEINLFDFVLDISHVLSKATLLDFN 684 FSA Y E I++++++ +FVLD+SHVLS A+ L FN Sbjct: 712 FSATYVEVISNKLSMVNFVLDLSHVLSNASQLHFN 746 >gb|EXC00965.1| hypothetical protein L484_016031 [Morus notabilis] Length = 1087 Score = 142 bits (359), Expect = 7e-32 Identities = 93/235 (39%), Positives = 135/235 (57%), Gaps = 14/235 (5%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRAN----------AQVSTGGFSSQKVSDQ--- 141 A RNSELQ SRS+CA+T+SKLQ+LEAQ+++N Q+S G SQ S+ Sbjct: 383 AKRNSELQVSRSMCAKTSSKLQSLEAQIQSNNQHKTTPKSIVQISAEGSFSQNASNPPSL 442 Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318 MSEDGNDD+ SCA S T +SE+S ++KEK+ + +++E NHL+LMDDFLEMEK+A Sbjct: 443 TSMSEDGNDDDRSCAESWTTTLISEVSQVKKEKSNEKTNRAEKPNHLNLMDDFLEMEKLA 502 Query: 319 CLPHGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGIYD 498 CL + SNGA+S SD + + SE V H DA V + +N A + S Sbjct: 503 CLSNESNGAISVSD---SMSSKISETVNH--DASEVVMRKEEQCDSNSLANQQLTSN--- 554 Query: 499 FVMILGKEAKALPGSSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSK 663 GK + PGS+ E + KL + + E+++ + ++ + DI H + + Sbjct: 555 -----GKSPELRPGSNSEQLPLMKLQSRISVLLESVSKDSDVGTILEDIKHAIQE 604 Score = 57.8 bits (138), Expect = 3e-06 Identities = 32/73 (43%), Positives = 49/73 (67%), Gaps = 1/73 (1%) Frame = +1 Query: 469 LEMAISGIYDFVMILGKEAKALPGSSDEDG-LIKKLDTFSAKYAEAINSEINLFDFVLDI 645 L AIS I+DFV+ LGKEA + +S E ++++ FS + I+S+++L DFVLD+ Sbjct: 663 LAAAISQIHDFVLFLGKEAMGVHDTSTEGSEFSQRIEEFSVTLNKVIHSDLSLIDFVLDL 722 Query: 646 SHVLSKATLLDFN 684 S VL+KA+ L F+ Sbjct: 723 SSVLAKASELRFS 735 >ref|XP_006473632.1| PREDICTED: filament-like plant protein 4-like isoform X1 [Citrus sinensis] gi|568839322|ref|XP_006473633.1| PREDICTED: filament-like plant protein 4-like isoform X2 [Citrus sinensis] Length = 1091 Score = 134 bits (337), Expect = 3e-29 Identities = 90/235 (38%), Positives = 133/235 (56%), Gaps = 16/235 (6%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141 A RNSELQASR++CA+TASKLQ+LEAQ++ + Q ++ G++SQ S+ Sbjct: 384 AKRNSELQASRNLCAKTASKLQSLEAQMQTSTQQKSPTKSVVQIAAEGYTSQNASNPPSL 443 Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318 MSED NDD VSCA S AT +SELS I+KEKN + +K+E HL+LMDDFLEMEK+A Sbjct: 444 TSMSEDDNDDKVSCADSWATALISELSQIKKEKNVEKSNKAETPKHLELMDDFLEMEKLA 503 Query: 319 CLPH--GSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492 CL + SNG +++S+ N S++V H DA V S D + + + ++ Sbjct: 504 CLSNDTNSNGTITASN---GPNNKTSDIVNH--DASGAVTSGEDLLSEQQRDMNPSV--- 555 Query: 493 YDFVMILGKEAKALPGSSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVL 657 D + + + P + + KL + + E I+ + ++ V DI V+ Sbjct: 556 -DKLSSNTESSTVNPEADAGQPQLMKLRSRISMLLETISKDADMGKIVEDIKRVV 609 Score = 73.2 bits (178), Expect = 7e-11 Identities = 66/237 (27%), Positives = 109/237 (45%), Gaps = 28/237 (11%) Frame = +1 Query: 58 KLQNLEAQLRANAQVSTGGFSSQKVSD--------------QLMSEDGNDDNVSCAGSLA 195 KL L +N ++ + K SD L+SE D N S Sbjct: 501 KLACLSNDTNSNGTITASNGPNNKTSDIVNHDASGAVTSGEDLLSEQQRDMNPSVD---K 557 Query: 196 TVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEK-IACLPHGSNGAVSSSDVSVN 372 S +E S + E +A P + + + ++ + + + + + V V+++ Sbjct: 558 LSSNTESSTVNPEADAGQPQLMKLRSRISMLLETISKDADMGKIVEDIKRVVEDEHVTLH 617 Query: 373 --TGNTGSELVKHED----------DAKINVNSCID-TVQTNDQALEMAISGIYDFVMIL 513 + N SE VK D DA++N ID TVQ Q L AI+ I+DFV+ L Sbjct: 618 QHSANCISEEVKCSDVSCSAEAYPGDARLNTERKIDLTVQVISQELVAAITQIHDFVLFL 677 Query: 514 GKEAKALPGSSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLDFN 684 GKEA+A+ +++E+G +K++ F + + I+S L DFV +S+VL+KA+ L N Sbjct: 678 GKEARAVHDTTNENGFSQKIEEFYVSFNKVIDSNTYLVDFVFALSNVLAKASELRIN 734 >ref|XP_004291383.1| PREDICTED: filament-like plant protein 6-like [Fragaria vesca subsp. vesca] Length = 1091 Score = 133 bits (335), Expect = 5e-29 Identities = 89/236 (37%), Positives = 127/236 (53%), Gaps = 14/236 (5%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141 + RNSELQASRSICA+T SKLQ LEAQL+ Q +ST G S+ S Sbjct: 400 SKRNSELQASRSICAKTVSKLQTLEAQLQITGQQKGSPKSVVHISTEGSLSRNASIPPSF 459 Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318 MSEDGNDD+ SCA S T S+LS+ +KEKN + K+EN NHL+LMDDFLEMEK+A Sbjct: 460 ASMSEDGNDDDRSCAESWGTTLNSDLSHSKKEKNNEKSSKAENQNHLNLMDDFLEMEKLA 519 Query: 319 CLPHGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGIYD 498 CLP+ SNG V +S++ +N E ++ I + Q ++ + Sbjct: 520 CLPNDSNG-VKTSEIEIN-----------EASGEVTATKDIHSEQQHEASFN-------- 559 Query: 499 FVMILGKEAKALPGSSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666 G + PG+++ + KL + + E ++ + + + DI HV+ +A Sbjct: 560 -----GDLSVLSPGANENKLPLVKLRSRISVLLELLSKDTDFVKVIEDIKHVVQEA 610 Score = 63.5 bits (153), Expect = 6e-08 Identities = 42/149 (28%), Positives = 82/149 (55%), Gaps = 7/149 (4%) Frame = +1 Query: 259 SENTNHLDLMDDF---LEMEKIACLPHGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINV 429 S++T+ + +++D ++ + A PH V+S +++ + + H +D+ + Sbjct: 591 SKDTDFVKVIEDIKHVVQEAQDALQPH----TVNSVSEEIHSADAICDTQAHPEDSVFST 646 Query: 430 N---SCIDTVQTNDQALEMAISGIYDFVMILGKEAKALPGS-SDEDGLIKKLDTFSAKYA 597 + +T+ + L AIS I+DFV+ LGKE + + D + L +K++ FS ++ Sbjct: 647 EKETTAKETMSAISEELASAISLIHDFVVFLGKEVVGVHDTFPDSNELSQKIEEFSGTFS 706 Query: 598 EAINSEINLFDFVLDISHVLSKATLLDFN 684 + I+ ++L D VLD+SHVL+ A+ L FN Sbjct: 707 KVIHGNLSLVDLVLDLSHVLANASELKFN 735 >ref|XP_006435149.1| hypothetical protein CICLE_v10000102mg [Citrus clementina] gi|567885183|ref|XP_006435150.1| hypothetical protein CICLE_v10000102mg [Citrus clementina] gi|557537271|gb|ESR48389.1| hypothetical protein CICLE_v10000102mg [Citrus clementina] gi|557537272|gb|ESR48390.1| hypothetical protein CICLE_v10000102mg [Citrus clementina] Length = 1091 Score = 133 bits (334), Expect = 6e-29 Identities = 89/235 (37%), Positives = 133/235 (56%), Gaps = 16/235 (6%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141 A RNSELQASR++CA+TASKLQ+LEAQ++ + Q ++ G++SQ S+ Sbjct: 384 AKRNSELQASRNLCAKTASKLQSLEAQMQTSTQQKSPTKSVVQIAAEGYTSQNASNPPSL 443 Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318 MSED NDD VSCA S AT +SELS I+KEKN + +K+E HL+LMDDFLEMEK+A Sbjct: 444 TSMSEDDNDDKVSCADSWATALISELSQIKKEKNVEKSNKAETPKHLELMDDFLEMEKLA 503 Query: 319 CLPH--GSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492 CL + SNG +++S+ N S+++ H DA V S D + + + ++ Sbjct: 504 CLSNDTNSNGTITASN---GPNNKTSDILNH--DASGAVTSGEDLLSEQQRDMNPSV--- 555 Query: 493 YDFVMILGKEAKALPGSSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVL 657 D + + + P + + KL + + E I+ + ++ V DI V+ Sbjct: 556 -DKLSSNTESSTVNPEADAGQPQLMKLRSRISMLLETISKDADMGKIVEDIKRVV 609 Score = 73.6 bits (179), Expect = 6e-11 Identities = 67/237 (28%), Positives = 108/237 (45%), Gaps = 28/237 (11%) Frame = +1 Query: 58 KLQNLEAQLRANAQVSTGGFSSQKVSD--------------QLMSEDGNDDNVSCAGSLA 195 KL L +N ++ + K SD L+SE D N S Sbjct: 501 KLACLSNDTNSNGTITASNGPNNKTSDILNHDASGAVTSGEDLLSEQQRDMNPSVD---K 557 Query: 196 TVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEK-IACLPHGSNGAVSSSDVSVN 372 S +E S + E +A P + + + ++ + + + + + V V+++ Sbjct: 558 LSSNTESSTVNPEADAGQPQLMKLRSRISMLLETISKDADMGKIVEDIKRVVEDEHVTLH 617 Query: 373 --TGNTGSELVKHED----------DAKINVNSCID-TVQTNDQALEMAISGIYDFVMIL 513 + N SE VK D DA +N ID TVQ Q L AIS I+DFV+ L Sbjct: 618 QHSANCISEEVKCSDVSCSAEAYPGDASLNTERKIDLTVQVISQELVAAISQIHDFVLFL 677 Query: 514 GKEAKALPGSSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLDFN 684 GKEA+A+ +++E+G +K++ F + + I+S L DFV +S+VL+KA+ L N Sbjct: 678 GKEARAVHDTTNENGFSQKIEEFYVSFNKVIDSNTYLVDFVFALSNVLAKASELRIN 734 >ref|XP_002301986.2| hypothetical protein POPTR_0002s02600g [Populus trichocarpa] gi|550344134|gb|EEE81259.2| hypothetical protein POPTR_0002s02600g [Populus trichocarpa] Length = 1063 Score = 132 bits (333), Expect = 8e-29 Identities = 86/237 (36%), Positives = 135/237 (56%), Gaps = 15/237 (6%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRAN----------AQVSTGGFSSQKVSDQ--- 141 A RNSELQASR++CA+TASKLQ+LEAQ + N QV G+SSQ +S+ Sbjct: 375 AKRNSELQASRNLCAKTASKLQSLEAQFQINNHQKSSPKSITQVPAEGYSSQNISNPPSL 434 Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318 +SEDGNDD SCA S AT S+S++S+ +K+ + + +K+EN HL+LMDDFLEMEK+A Sbjct: 435 TSVSEDGNDDTQSCADSWATTSVSDVSHFKKDNHIEKSNKAENAKHLELMDDFLEMEKLA 494 Query: 319 CLPHGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGIYD 498 CL A S++ +S + N SE + A++++ D + + L+ + + Sbjct: 495 CL-----NADSATTISSSPNNKASETANTDALAEVSLQK-EDALSEEKRDLDPLANHV-- 546 Query: 499 FVMILGKEAKALPGSSDED-GLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666 K++ A+ SD D KL + + E+++ E+++ + +I V+ A Sbjct: 547 ---SCNKDSSAINSGSDADLSSFGKLQSRISMLLESVSKEVDVDKILEEIKQVVHDA 600 Score = 58.5 bits (140), Expect = 2e-06 Identities = 64/225 (28%), Positives = 108/225 (48%), Gaps = 11/225 (4%) Frame = +1 Query: 43 AQTASKLQNLEAQLRANAQVSTGGFSSQKVSDQLMSEDGND-----DNVSC---AGSLAT 198 A T S N +A AN + S QK + +SE+ D ++VSC + ++ + Sbjct: 501 ATTISSSPNNKASETANTD-ALAEVSLQK--EDALSEEKRDLDPLANHVSCNKDSSAINS 557 Query: 199 VSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIACLPHGSNGAVSSSDVSVNTG 378 S ++LS K ++ S + +D+ D LE +I + H + A S V Sbjct: 558 GSDADLSSFGKLQSRISMLLESVSKEVDV-DKILE--EIKQVVHDAETAASCGSKEV--- 611 Query: 379 NTGSELVKHEDDAKINVNSCID--TVQTNDQALEMAISGIYDFVMILGKEAKALPGSS-D 549 H DA + +C + + + + S I+DFV++LGKEA A+ +S D Sbjct: 612 --------HHSDATCDRQTCPEDAVIMGEKEITLLQESIIHDFVLLLGKEAMAVHDTSCD 663 Query: 550 EDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLDFN 684 GL +K++ FS + + + S+ +L DF+ D+S VL+ A+ L FN Sbjct: 664 SIGLSQKIEEFSITFKKVLCSDRSLIDFMFDLSRVLALASGLRFN 708 >ref|XP_006386179.1| hypothetical protein POPTR_0002s02600g [Populus trichocarpa] gi|550344133|gb|ERP63976.1| hypothetical protein POPTR_0002s02600g [Populus trichocarpa] Length = 991 Score = 132 bits (333), Expect = 8e-29 Identities = 86/237 (36%), Positives = 135/237 (56%), Gaps = 15/237 (6%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRAN----------AQVSTGGFSSQKVSDQ--- 141 A RNSELQASR++CA+TASKLQ+LEAQ + N QV G+SSQ +S+ Sbjct: 303 AKRNSELQASRNLCAKTASKLQSLEAQFQINNHQKSSPKSITQVPAEGYSSQNISNPPSL 362 Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318 +SEDGNDD SCA S AT S+S++S+ +K+ + + +K+EN HL+LMDDFLEMEK+A Sbjct: 363 TSVSEDGNDDTQSCADSWATTSVSDVSHFKKDNHIEKSNKAENAKHLELMDDFLEMEKLA 422 Query: 319 CLPHGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGIYD 498 CL A S++ +S + N SE + A++++ D + + L+ + + Sbjct: 423 CL-----NADSATTISSSPNNKASETANTDALAEVSLQK-EDALSEEKRDLDPLANHV-- 474 Query: 499 FVMILGKEAKALPGSSDED-GLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666 K++ A+ SD D KL + + E+++ E+++ + +I V+ A Sbjct: 475 ---SCNKDSSAINSGSDADLSSFGKLQSRISMLLESVSKEVDVDKILEEIKQVVHDA 528 Score = 58.5 bits (140), Expect = 2e-06 Identities = 64/225 (28%), Positives = 108/225 (48%), Gaps = 11/225 (4%) Frame = +1 Query: 43 AQTASKLQNLEAQLRANAQVSTGGFSSQKVSDQLMSEDGND-----DNVSC---AGSLAT 198 A T S N +A AN + S QK + +SE+ D ++VSC + ++ + Sbjct: 429 ATTISSSPNNKASETANTD-ALAEVSLQK--EDALSEEKRDLDPLANHVSCNKDSSAINS 485 Query: 199 VSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIACLPHGSNGAVSSSDVSVNTG 378 S ++LS K ++ S + +D+ D LE +I + H + A S V Sbjct: 486 GSDADLSSFGKLQSRISMLLESVSKEVDV-DKILE--EIKQVVHDAETAASCGSKEV--- 539 Query: 379 NTGSELVKHEDDAKINVNSCID--TVQTNDQALEMAISGIYDFVMILGKEAKALPGSS-D 549 H DA + +C + + + + S I+DFV++LGKEA A+ +S D Sbjct: 540 --------HHSDATCDRQTCPEDAVIMGEKEITLLQESIIHDFVLLLGKEAMAVHDTSCD 591 Query: 550 EDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLDFN 684 GL +K++ FS + + + S+ +L DF+ D+S VL+ A+ L FN Sbjct: 592 SIGLSQKIEEFSITFKKVLCSDRSLIDFMFDLSRVLALASGLRFN 636 >gb|EOY14987.1| Uncharacterized protein isoform 8, partial [Theobroma cacao] Length = 951 Score = 131 bits (329), Expect = 2e-28 Identities = 92/239 (38%), Positives = 132/239 (55%), Gaps = 17/239 (7%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141 A RNSEL ASR++CA+T+SKLQ LEAQL ++Q + +SSQ VS+ Sbjct: 385 AKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSV 444 Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318 +SEDGNDD+ SCA S AT MSELS +KEKN + P+K+EN HLDLMDDFLEMEK+A Sbjct: 445 TSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLA 504 Query: 319 CLPHGS--NGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492 C + S NG ++ SD +T N SE V + +I SC + L +++ + Sbjct: 505 CSSNDSTANGTITISD---STNNKISESVNGDASGEI---SCKELQSEKQHVLSPSVNQV 558 Query: 493 YDFVMILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666 + + SD D L + KL T + ++++ + ++ + DI + A Sbjct: 559 SS-----NMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDA 612 Score = 63.9 bits (154), Expect = 4e-08 Identities = 43/122 (35%), Positives = 67/122 (54%), Gaps = 3/122 (2%) Frame = +1 Query: 328 HGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCI--DTVQTNDQALEMAISGIYDF 501 HGS+G + + G + E + I+ + + VQT Q L AIS I+DF Sbjct: 629 HGSDGTC------IGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDF 682 Query: 502 VMILGKEAKALPG-SSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLD 678 V+ LGKEA+A+ SD + L K++ FS Y + + S ++L DF+ D+S +L+KA+ L Sbjct: 683 VLSLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLR 742 Query: 679 FN 684 N Sbjct: 743 VN 744 >gb|EOY14986.1| Uncharacterized protein isoform 7 [Theobroma cacao] Length = 1107 Score = 131 bits (329), Expect = 2e-28 Identities = 92/239 (38%), Positives = 132/239 (55%), Gaps = 17/239 (7%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141 A RNSEL ASR++CA+T+SKLQ LEAQL ++Q + +SSQ VS+ Sbjct: 389 AKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSV 448 Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318 +SEDGNDD+ SCA S AT MSELS +KEKN + P+K+EN HLDLMDDFLEMEK+A Sbjct: 449 TSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLA 508 Query: 319 CLPHGS--NGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492 C + S NG ++ SD +T N SE V + +I SC + L +++ + Sbjct: 509 CSSNDSTANGTITISD---STNNKISESVNGDASGEI---SCKELQSEKQHVLSPSVNQV 562 Query: 493 YDFVMILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666 + + SD D L + KL T + ++++ + ++ + DI + A Sbjct: 563 SS-----NMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDA 616 Score = 63.9 bits (154), Expect = 4e-08 Identities = 43/122 (35%), Positives = 67/122 (54%), Gaps = 3/122 (2%) Frame = +1 Query: 328 HGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCI--DTVQTNDQALEMAISGIYDF 501 HGS+G + + G + E + I+ + + VQT Q L AIS I+DF Sbjct: 633 HGSDGTC------IGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDF 686 Query: 502 VMILGKEAKALPG-SSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLD 678 V+ LGKEA+A+ SD + L K++ FS Y + + S ++L DF+ D+S +L+KA+ L Sbjct: 687 VLSLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLR 746 Query: 679 FN 684 N Sbjct: 747 VN 748 >gb|EOY14985.1| Uncharacterized protein isoform 6 [Theobroma cacao] Length = 837 Score = 131 bits (329), Expect = 2e-28 Identities = 92/239 (38%), Positives = 132/239 (55%), Gaps = 17/239 (7%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141 A RNSEL ASR++CA+T+SKLQ LEAQL ++Q + +SSQ VS+ Sbjct: 230 AKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSV 289 Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318 +SEDGNDD+ SCA S AT MSELS +KEKN + P+K+EN HLDLMDDFLEMEK+A Sbjct: 290 TSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLA 349 Query: 319 CLPHGS--NGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492 C + S NG ++ SD +T N SE V + +I SC + L +++ + Sbjct: 350 CSSNDSTANGTITISD---STNNKISESVNGDASGEI---SCKELQSEKQHVLSPSVNQV 403 Query: 493 YDFVMILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666 + + SD D L + KL T + ++++ + ++ + DI + A Sbjct: 404 SS-----NMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDA 457 Score = 63.9 bits (154), Expect = 4e-08 Identities = 43/122 (35%), Positives = 67/122 (54%), Gaps = 3/122 (2%) Frame = +1 Query: 328 HGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCI--DTVQTNDQALEMAISGIYDF 501 HGS+G + + G + E + I+ + + VQT Q L AIS I+DF Sbjct: 474 HGSDGTC------IGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDF 527 Query: 502 VMILGKEAKALPG-SSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLD 678 V+ LGKEA+A+ SD + L K++ FS Y + + S ++L DF+ D+S +L+KA+ L Sbjct: 528 VLSLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLR 587 Query: 679 FN 684 N Sbjct: 588 VN 589 >gb|EOY14984.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 992 Score = 131 bits (329), Expect = 2e-28 Identities = 92/239 (38%), Positives = 132/239 (55%), Gaps = 17/239 (7%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141 A RNSEL ASR++CA+T+SKLQ LEAQL ++Q + +SSQ VS+ Sbjct: 385 AKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSV 444 Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318 +SEDGNDD+ SCA S AT MSELS +KEKN + P+K+EN HLDLMDDFLEMEK+A Sbjct: 445 TSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLA 504 Query: 319 CLPHGS--NGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492 C + S NG ++ SD +T N SE V + +I SC + L +++ + Sbjct: 505 CSSNDSTANGTITISD---STNNKISESVNGDASGEI---SCKELQSEKQHVLSPSVNQV 558 Query: 493 YDFVMILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666 + + SD D L + KL T + ++++ + ++ + DI + A Sbjct: 559 SS-----NMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDA 612 Score = 63.9 bits (154), Expect = 4e-08 Identities = 43/122 (35%), Positives = 67/122 (54%), Gaps = 3/122 (2%) Frame = +1 Query: 328 HGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCI--DTVQTNDQALEMAISGIYDF 501 HGS+G + + G + E + I+ + + VQT Q L AIS I+DF Sbjct: 629 HGSDGTC------IGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDF 682 Query: 502 VMILGKEAKALPG-SSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLD 678 V+ LGKEA+A+ SD + L K++ FS Y + + S ++L DF+ D+S +L+KA+ L Sbjct: 683 VLSLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLR 742 Query: 679 FN 684 N Sbjct: 743 VN 744 >gb|EOY14983.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 947 Score = 131 bits (329), Expect = 2e-28 Identities = 92/239 (38%), Positives = 132/239 (55%), Gaps = 17/239 (7%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141 A RNSEL ASR++CA+T+SKLQ LEAQL ++Q + +SSQ VS+ Sbjct: 230 AKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSV 289 Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318 +SEDGNDD+ SCA S AT MSELS +KEKN + P+K+EN HLDLMDDFLEMEK+A Sbjct: 290 TSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLA 349 Query: 319 CLPHGS--NGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492 C + S NG ++ SD +T N SE V + +I SC + L +++ + Sbjct: 350 CSSNDSTANGTITISD---STNNKISESVNGDASGEI---SCKELQSEKQHVLSPSVNQV 403 Query: 493 YDFVMILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666 + + SD D L + KL T + ++++ + ++ + DI + A Sbjct: 404 SS-----NMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDA 457 Score = 63.9 bits (154), Expect = 4e-08 Identities = 43/122 (35%), Positives = 67/122 (54%), Gaps = 3/122 (2%) Frame = +1 Query: 328 HGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCI--DTVQTNDQALEMAISGIYDF 501 HGS+G + + G + E + I+ + + VQT Q L AIS I+DF Sbjct: 474 HGSDGTC------IGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDF 527 Query: 502 VMILGKEAKALPG-SSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLD 678 V+ LGKEA+A+ SD + L K++ FS Y + + S ++L DF+ D+S +L+KA+ L Sbjct: 528 VLSLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLR 587 Query: 679 FN 684 N Sbjct: 588 VN 589 >gb|EOY14982.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 1106 Score = 131 bits (329), Expect = 2e-28 Identities = 92/239 (38%), Positives = 132/239 (55%), Gaps = 17/239 (7%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141 A RNSEL ASR++CA+T+SKLQ LEAQL ++Q + +SSQ VS+ Sbjct: 389 AKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSV 448 Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318 +SEDGNDD+ SCA S AT MSELS +KEKN + P+K+EN HLDLMDDFLEMEK+A Sbjct: 449 TSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLA 508 Query: 319 CLPHGS--NGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492 C + S NG ++ SD +T N SE V + +I SC + L +++ + Sbjct: 509 CSSNDSTANGTITISD---STNNKISESVNGDASGEI---SCKELQSEKQHVLSPSVNQV 562 Query: 493 YDFVMILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666 + + SD D L + KL T + ++++ + ++ + DI + A Sbjct: 563 SS-----NMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDA 616 Score = 63.9 bits (154), Expect = 4e-08 Identities = 43/122 (35%), Positives = 67/122 (54%), Gaps = 3/122 (2%) Frame = +1 Query: 328 HGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCI--DTVQTNDQALEMAISGIYDF 501 HGS+G + + G + E + I+ + + VQT Q L AIS I+DF Sbjct: 633 HGSDGTC------IGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDF 686 Query: 502 VMILGKEAKALPG-SSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLD 678 V+ LGKEA+A+ SD + L K++ FS Y + + S ++L DF+ D+S +L+KA+ L Sbjct: 687 VLSLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLR 746 Query: 679 FN 684 N Sbjct: 747 VN 748 >gb|EOY14981.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] Length = 992 Score = 131 bits (329), Expect = 2e-28 Identities = 92/239 (38%), Positives = 132/239 (55%), Gaps = 17/239 (7%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141 A RNSEL ASR++CA+T+SKLQ LEAQL ++Q + +SSQ VS+ Sbjct: 385 AKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSV 444 Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318 +SEDGNDD+ SCA S AT MSELS +KEKN + P+K+EN HLDLMDDFLEMEK+A Sbjct: 445 TSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLA 504 Query: 319 CLPHGS--NGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492 C + S NG ++ SD +T N SE V + +I SC + L +++ + Sbjct: 505 CSSNDSTANGTITISD---STNNKISESVNGDASGEI---SCKELQSEKQHVLSPSVNQV 558 Query: 493 YDFVMILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666 + + SD D L + KL T + ++++ + ++ + DI + A Sbjct: 559 SS-----NMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDA 612 Score = 63.9 bits (154), Expect = 4e-08 Identities = 43/122 (35%), Positives = 67/122 (54%), Gaps = 3/122 (2%) Frame = +1 Query: 328 HGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCI--DTVQTNDQALEMAISGIYDF 501 HGS+G + + G + E + I+ + + VQT Q L AIS I+DF Sbjct: 629 HGSDGTC------IGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDF 682 Query: 502 VMILGKEAKALPG-SSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLD 678 V+ LGKEA+A+ SD + L K++ FS Y + + S ++L DF+ D+S +L+KA+ L Sbjct: 683 VLSLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLR 742 Query: 679 FN 684 N Sbjct: 743 VN 744 >gb|EOY14980.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1102 Score = 131 bits (329), Expect = 2e-28 Identities = 92/239 (38%), Positives = 132/239 (55%), Gaps = 17/239 (7%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141 A RNSEL ASR++CA+T+SKLQ LEAQL ++Q + +SSQ VS+ Sbjct: 385 AKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSV 444 Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318 +SEDGNDD+ SCA S AT MSELS +KEKN + P+K+EN HLDLMDDFLEMEK+A Sbjct: 445 TSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLA 504 Query: 319 CLPHGS--NGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492 C + S NG ++ SD +T N SE V + +I SC + L +++ + Sbjct: 505 CSSNDSTANGTITISD---STNNKISESVNGDASGEI---SCKELQSEKQHVLSPSVNQV 558 Query: 493 YDFVMILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666 + + SD D L + KL T + ++++ + ++ + DI + A Sbjct: 559 SS-----NMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDA 612 Score = 63.9 bits (154), Expect = 4e-08 Identities = 43/122 (35%), Positives = 67/122 (54%), Gaps = 3/122 (2%) Frame = +1 Query: 328 HGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCI--DTVQTNDQALEMAISGIYDF 501 HGS+G + + G + E + I+ + + VQT Q L AIS I+DF Sbjct: 629 HGSDGTC------IGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDF 682 Query: 502 VMILGKEAKALPG-SSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLD 678 V+ LGKEA+A+ SD + L K++ FS Y + + S ++L DF+ D+S +L+KA+ L Sbjct: 683 VLSLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLR 742 Query: 679 FN 684 N Sbjct: 743 VN 744 >ref|XP_002893071.1| hypothetical protein ARALYDRAFT_335233 [Arabidopsis lyrata subsp. lyrata] gi|297338913|gb|EFH69330.1| hypothetical protein ARALYDRAFT_335233 [Arabidopsis lyrata subsp. lyrata] Length = 986 Score = 125 bits (315), Expect = 9e-27 Identities = 100/288 (34%), Positives = 138/288 (47%), Gaps = 63/288 (21%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRANAQVSTGG------FSSQKVSDQ----LMS 150 A RNSELQ SR+ICA+TA++LQ LEAQ+ + G FS Q S+ MS Sbjct: 376 AKRNSELQVSRNICAKTANRLQTLEAQMVNKSPTKRGFEMPAEIFSRQNASNPPSMASMS 435 Query: 151 EDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIACLPH 330 EDGN+D S AGSL MSELS K+KN K+E+ N L+LMDDFLEMEK+ACLP+ Sbjct: 436 EDGNEDARSVAGSL----MSELSQSNKDKNNAKIKKTESANQLELMDDFLEMEKLACLPN 491 Query: 331 GSNG-----------------------------------------------AVSSSDVSV 369 GSN AV + V + Sbjct: 492 GSNANGTTDHSSADSDGEILPATQLKKRISTVLQSLPKDAAFEKILAEIQCAVKDAGVKL 551 Query: 370 NTGNTGSELVKHEDDAKINVNS-----CIDTVQTNDQALEMAISGIYDFVMILGKEAKAL 534 + G+ L ++ +I +++ + V+ Q L A+S IY FV L KEA A Sbjct: 552 PSKCHGANLNGVTEEKEIAMSNETTEEKVTIVEVITQELSDALSQIYQFVSYLAKEATAC 611 Query: 535 PGSSDEDGLI-KKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLL 675 + E+ +K++ FS + + E L DF+ D+S VL +A+ L Sbjct: 612 QDTFSENRTFSQKVEEFSVTFERVLAKEKTLVDFLFDLSRVLVEASEL 659 >ref|XP_002510512.1| Myosin heavy chain, striated muscle, putative [Ricinus communis] gi|223551213|gb|EEF52699.1| Myosin heavy chain, striated muscle, putative [Ricinus communis] Length = 1041 Score = 125 bits (315), Expect = 9e-27 Identities = 101/300 (33%), Positives = 148/300 (49%), Gaps = 72/300 (24%) Frame = +1 Query: 1 ANRNSELQASRSICAQTASKLQNLEAQLRAN--------AQVSTGGFSSQKVSDQ----L 144 A RNSELQASR++CA+TAS+LQ+LEAQ+ QV G+SSQ +S+ Sbjct: 383 AKRNSELQASRNLCAKTASRLQSLEAQVSNQQKSSPTSVVQVPIEGYSSQNMSNPPSLTS 442 Query: 145 MSEDGNDDNVSCAGSLATVSMSELSYIQKEKN---------------------------- 240 MSEDGNDD+ SCA S AT +SELS ++KEK+ Sbjct: 443 MSEDGNDDDRSCADSWATSLISELSQLKKEKSTEKLNKTKNTQHLELMDDFLEMEKLACL 502 Query: 241 ------------------ADSPHKSENTNHLDLMDDFLE--------MEKIACLPHGSNG 342 AD P + + + ++ + + +E + + ++G Sbjct: 503 NANVNLVSSMSAANSGSEADQPCLVKLRSRISMLLESISQDADMGKILEDVQRIVQDTHG 562 Query: 343 AVSSSDVSVN-TGNTGSELVKHEDDAKINV----NSCIDTVQTNDQALEMAISGIYDFVM 507 AVSS V T T E D +I + N+ DTV++ +Q L A+S I+DFV+ Sbjct: 563 AVSSVSEDVRATDATCPEYASITGDKEITLFQDTNAATDTVRSVNQELATAVSSIHDFVL 622 Query: 508 ILGKEAKAL-PGSSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLDFN 684 LGKEA A+ SSD L +K++ FS + + +N +L DF+ +S VL+KA+ L FN Sbjct: 623 FLGKEAMAVHDTSSDGSDLSQKIEHFSVTFNKVLNGNTSLIDFIFYLSCVLAKASELRFN 682