BLASTX nr result
ID: Cocculus23_contig00033965
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00033965 (1075 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CCA65974.1| hypothetical protein [Beta vulgaris subsp. vulga... 220 9e-55 emb|CCA66153.1| hypothetical protein [Beta vulgaris subsp. vulga... 218 3e-54 ref|XP_006576082.1| PREDICTED: uncharacterized protein LOC102659... 215 2e-53 emb|CCA66235.1| hypothetical protein [Beta vulgaris subsp. vulga... 214 4e-53 emb|CCA66178.1| hypothetical protein [Beta vulgaris subsp. vulga... 211 4e-52 emb|CAN78583.1| hypothetical protein VITISV_029931 [Vitis vinifera] 210 7e-52 ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom... 209 1e-51 ref|XP_007214027.1| hypothetical protein PRUPE_ppa016677mg [Prun... 208 3e-51 gb|AAC67331.1| putative non-LTR retroelement reverse transcripta... 208 4e-51 ref|XP_007202950.1| hypothetical protein PRUPE_ppa016504mg, part... 207 6e-51 ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom... 206 1e-50 gb|ABE87589.2| RNA-directed DNA polymerase (Reverse transcriptas... 206 1e-50 ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom... 206 1e-50 emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulga... 206 1e-50 emb|CCA66198.1| hypothetical protein [Beta vulgaris subsp. vulga... 206 2e-50 emb|CCA66188.1| hypothetical protein [Beta vulgaris subsp. vulga... 206 2e-50 emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 206 2e-50 gb|AAD12028.1| putative non-LTR retroelement reverse transcripta... 205 3e-50 ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom... 204 4e-50 ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom... 204 7e-50 >emb|CCA65974.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1379 Score = 220 bits (560), Expect = 9e-55 Identities = 123/354 (34%), Positives = 184/354 (51%) Frame = -1 Query: 1063 FTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFGP 884 FTW R S LDR+L N +W + FPS + G+SDH + + T+ N+GP Sbjct: 183 FTWFR-----GRSKSVLDRLLLNPEWINEFPSMRLSLLQRGLSDHCPLLTNIHTQ-NWGP 236 Query: 883 KPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQTT 704 KPF+F N WL P L +V + W E T PM KL+ VK L +WN++ FG + T Sbjct: 237 KPFRFQNCWLTDPHCLEIVNKTWLES-TNMPM---IDKLRRVKIRLKAWNRDEFGHIDTN 292 Query: 703 TRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKGGDS 524 ++ E ++Q + + + +E +A+ L + ++R+E ++ Q SR+ WLK GD Sbjct: 293 IKIMEDEIQKFDTISNERELDEQEIERRKEAQSDLWMWMKRKELYWAQNSRILWLKHGDR 352 Query: 523 NTKFFHAKMKARWNSNQITRLRVGDDWIEDSSALKNHVTQHFKGILGSANDQIIIPTDFP 344 NTKFFH + N I ++V IE + +K FK I + Sbjct: 353 NTKFFHMVASNKKRRNFIASIKVNGRRIEKPNQIKEEAVTFFKEIFTEEFTERPTLEGLQ 412 Query: 343 VPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDVSSAIK 164 + + D L+ + EID V DKAPGPDGFN F K W I DV + ++ Sbjct: 413 FNQLSQNQADSLIQPFSDEEIDYAVNSCASDKAPGPDGFNFKFIKNAWETIKEDVYTLVR 472 Query: 163 DFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2 +F+A ++ +G N+TF+ LIPK N N DFRPIS+ +YKII+K++ R++ Sbjct: 473 EFWATSKLPKGSNSTFITLIPKIDNPENFKDFRPISMVGCVYKIIAKLMAKRIQ 526 >emb|CCA66153.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1381 Score = 218 bits (556), Expect = 3e-54 Identities = 125/358 (34%), Positives = 184/358 (51%), Gaps = 3/358 (0%) Frame = -1 Query: 1066 KFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFG 887 KFTW Q S+LDR+ + QW D+FP+ + + +SDH I V+ K + N+G Sbjct: 182 KFTWFRGQSK-----SKLDRMFIHPQWLDLFPTLQISLLKRTLSDHCPILVQTKLK-NWG 235 Query: 886 PKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCA---KLKNVKEALVSWNKNCFGE 716 P+PF+FI+ WL HP L L+ + W E C+ KLK VK +L+ WN FG Sbjct: 236 PRPFRFIDAWLSHPGCLKLISKTWLEA-------HDCSFSEKLKKVKSSLLKWNAEEFGC 288 Query: 715 VQTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLK 536 + + E ++Q+ + A D AN LEE +++ L + ++R+E + Q+SRV+W+K Sbjct: 289 IDEKIQSLENKIQEMDRIADDRNLEANELEERRKSQMDLWIWMKRKEVLWAQQSRVKWIK 348 Query: 535 GGDSNTKFFHAKMKARWNSNQITRLRVGDDWIEDSSALKNHVTQHFKGILGSANDQIIIP 356 GD NT++FH R N I L + I+ LK +F + + Sbjct: 349 EGDRNTRYFHIMATMRRKKNAIESLIIEQKQIDSPEDLKAAAVSYFSELFTEELSPRPVF 408 Query: 355 TDFPVPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDVS 176 D + ++ L T+ EID+ V K+PGPDGFN F K+ W +I DV Sbjct: 409 GDLNFKQLNDSHREILTSQFTRSEIDEAVSSCDGSKSPGPDGFNFKFVKQAWEVIKEDVY 468 Query: 175 SAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2 + +F+ R+ RG N + LIPK N DFRPIS+ +YKIISKIL RL+ Sbjct: 469 GIVNEFWHSSRLPRGCNTALIALIPKISNPEGFKDFRPISMVGCVYKIISKILARRLQ 526 >ref|XP_006576082.1| PREDICTED: uncharacterized protein LOC102659506 [Glycine max] Length = 964 Score = 215 bits (548), Expect = 2e-53 Identities = 126/363 (34%), Positives = 194/363 (53%), Gaps = 7/363 (1%) Frame = -1 Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMF--PSCEALVEDSGISDHHHITVKLKTE 899 G +TW+N SR+WS+LDR L N+ W + F +CE + E ISDH + V + Sbjct: 552 GPLYTWTN-----SRVWSKLDRALCNQAWFNSFGNSACEVM-EFISISDHTPLVVTTELV 605 Query: 898 VNFGPKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFG 719 V G PFKF N+ +DHP+FL +V + W++ + G M +VC KLK +K L + K F Sbjct: 606 VPRGNSPFKFNNLIVDHPNFLRIVADGWKQNIHGCSMFKVCKKLKALKAPLKNLFKQEFS 665 Query: 718 EVQTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWL 539 + LAEA+ + +P + ++L ++ R ++ + E + Q + ++L Sbjct: 666 NISNRVELAEAEYNSVLNSIKQNPQDPSLLALANRTRGQTIMLRKAESMKFAQLIKNKYL 725 Query: 538 KGGDSNTKFFHAKMKARWNSNQITRLRVGDDWIEDSS-----ALKNHVTQHFKGILGSAN 374 D +KFFHA +K +S I +R+ D S A NH F + Sbjct: 726 LQADKCSKFFHALIKRNKHSRFIAAIRLEDGHNTSSQDEIALAFVNHFRNFFSAHELTQT 785 Query: 373 DQIIIPTDFPVPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHI 194 I I P +P D L+ ++ ++ ++ + +KAPGPDGFN FFK+ W+I Sbjct: 786 PSISICNRGP--KVPTDCFAALLCPTSKQKVWNIISVMANNKAPGPDGFNVLFFKKAWNI 843 Query: 193 IGNDVSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILV 14 +G+D+ +A+ +FF +I + LN + LIPK AS +N FRPIS CN+LYKI+SKIL Sbjct: 844 VGDDIFAAVNEFFTTGKILKQLNHAIIVLIPKHDQASQVNHFRPISCCNLLYKIVSKILA 903 Query: 13 NRL 5 NR+ Sbjct: 904 NRI 906 >emb|CCA66235.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1380 Score = 214 bits (546), Expect = 4e-53 Identities = 119/358 (33%), Positives = 189/358 (52%), Gaps = 2/358 (0%) Frame = -1 Query: 1069 DKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNF 890 ++FTW S+LDR N +W +P+ + + + G+SDH + + N+ Sbjct: 181 ERFTWFRGNSK-----SKLDRCFVNPEWLTHYPTLKLSLLNRGLSDHCPLLLNSSVR-NW 234 Query: 889 GPKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQ 710 GPKPFKF N WL P + LVK+ W++ PM V KLK VK+ L WN+ FG ++ Sbjct: 235 GPKPFKFQNCWLSDPRCMRLVKDTWQKSS---PMGLV-QKLKTVKKDLKDWNEKVFGNIE 290 Query: 709 TTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKGG 530 + E ++ K + + ++ LE++ +A+ L ++ +E ++ Q+SR++WLK G Sbjct: 291 ANIKQLEHEINQLDKISNERDLDSFELEKKKKAQVDLWSWMKTKESYWSQQSRIKWLKQG 350 Query: 529 DSNTKFFHAKMKARWNSNQITRLRVGDDWIEDSSALKNHVTQHFKGILG--SANDQIIIP 356 D NTKFFH R + N IT + V D I + +K ++F+ S N ++ Sbjct: 351 DRNTKFFHVVASIRKHRNSITSIEVNGDKISEPEKIKLEAMKYFRKAFKEESYNRPLLEG 410 Query: 355 TDFPVPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDVS 176 DF T L+ + EID+ V DKAPGPDGFN F K+ W +I ++ Sbjct: 411 LDFKHLTEAQSAD--LIAPFSHEEIDKAVASCSSDKAPGPDGFNFTFIKKAWDVIKEEIY 468 Query: 175 SAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2 +++F+ R+ +G N F+ LIPK + DFRPIS+ +YKI++K+L RL+ Sbjct: 469 ETVQEFWNSSRLPKGCNMAFIALIPKTDSPKGFQDFRPISMVGCVYKIVAKLLTMRLQ 526 >emb|CCA66178.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1381 Score = 211 bits (537), Expect = 4e-52 Identities = 118/340 (34%), Positives = 176/340 (51%) Frame = -1 Query: 1021 SRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFGPKPFKFINVWLDHPD 842 S LDR+ N +W P+ + G+SDH + V K E+++GPKPF+F N WL P+ Sbjct: 192 SILDRLFVNPEWITNLPNLRVSLLQRGLSDHCPLLVHNK-ELDWGPKPFRFQNCWLSDPE 250 Query: 841 FLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQTTTRLAEAQLQDCSKR 662 L +VK W++ + + KLK VK+ L SWN FG + + + E+++Q Sbjct: 251 CLKIVKAVWQDAEALHTI----GKLKEVKKRLKSWNLTEFGNIDSKIKKFESEIQHLDSI 306 Query: 661 AQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKGGDSNTKFFHAKMKARWN 482 + LE +A+ L ++R E ++ Q SRV WLK GD NT FFHA + Sbjct: 307 NNTRDLDTQELENRKEAQVELWKWIKRREMYWAQNSRVTWLKEGDRNTMFFHAIASNKRR 366 Query: 481 SNQITRLRVGDDWIEDSSALKNHVTQHFKGILGSANDQIIIPTDFPVPTIPMDLKDGLMV 302 N IT + V I++ S +K T +FK I + + D + + + L + Sbjct: 367 KNSITTVEVDGLKIDEPSRIKWEATTYFKKIFKEEHGCRPLFEDLNFKCVTHEQAEQLTL 426 Query: 301 DITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDVSSAIKDFFARERIHRGLNA 122 + EID+ V DKAPGPDGFN F K W II +D+ + F+ R+ +G N Sbjct: 427 PFSCEEIDEAVSTCSSDKAPGPDGFNFKFIKSAWGIIKHDIYEMVHKFWESSRLPQGSNV 486 Query: 121 TFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2 ++ LIPK N N D+RPIS+ LYKII+K++ RL+ Sbjct: 487 AYIALIPKMSNPKNFKDYRPISMVGCLYKIIAKVMAKRLQ 526 >emb|CAN78583.1| hypothetical protein VITISV_029931 [Vitis vinifera] Length = 1875 Score = 210 bits (535), Expect = 7e-52 Identities = 120/361 (33%), Positives = 191/361 (52%), Gaps = 4/361 (1%) Frame = -1 Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVE---DSGISDHHHITVKLKT 902 G F+WS ++ ++ W+RLDR L ++ W D C +V+ ISDH I +K Sbjct: 915 GGVFSWSGGRN--NQAWARLDRFLVSQCWLD---KCCGVVQCRLPRPISDHFPIMLK-GG 968 Query: 901 EVNFGPKPFKFINVWLDHPDFLNLVKEKWEEQVT-GYPMQRVCAKLKNVKEALVSWNKNC 725 + GP PF+F N+WL F +L++E W+ V G R+ +KLK +K+ + WN+ Sbjct: 969 GLRRGPSPFRFENMWLKVDGFKDLLREWWQGTVVRGKASFRLASKLKVLKQKIKEWNREV 1028 Query: 724 FGEVQTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQ 545 FG ++ LA Q++ + + + + E + +A+ K + EE +RQ SR Sbjct: 1029 FGRLEVNKSLALQQVEFWDRVESERSLSVSETEMKKEAKEXFKKWVLLEETHWRQMSREL 1088 Query: 544 WLKGGDSNTKFFHAKMKARWNSNQITRLRVGDDWIEDSSALKNHVTQHFKGILGSANDQI 365 WLK GD N+ FFH A +N + R+++ W + ++ + Q+F+ +L Sbjct: 1089 WLKEGDKNSGFFHRMANAHRRTNSMDRIKINGVWRTEEQEVREGIVQNFQQLLTEEPSWR 1148 Query: 364 IIPTDFPVPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGN 185 +P + +GL V T EI + D+ DKAPGPDGF G F++ W + Sbjct: 1149 ADIEGLHLPRLNTCEAEGLEVPFTMEEIHSALMDMNGDKAPGPDGFTGAFWQTCWEFVKE 1208 Query: 184 DVSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRL 5 ++ K+FF ++ + LN TFL LIPKK A ++ +FRPISL LYK+++K+L NRL Sbjct: 1209 EIMDLFKEFFVQKSFAKSLNTTFLVLIPKKGGAEDLGEFRPISLLGGLYKLVAKVLANRL 1268 Query: 4 K 2 K Sbjct: 1269 K 1269 >ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao] gi|508710339|gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] Length = 1954 Score = 209 bits (533), Expect = 1e-51 Identities = 125/363 (34%), Positives = 190/363 (52%), Gaps = 7/363 (1%) Frame = -1 Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVN 893 G+ FTW+N + ++ RLDRV+ N +W F S + SDH + + T Sbjct: 763 GNSFTWTN-----NHMFQRLDRVVYNPEWAHCFSSTRVQHLNRDGSDHCPLLISCATASQ 817 Query: 892 FGPKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEV 713 GP F+F++ W H DFL V+ W+ + + K + +K L WNK FG++ Sbjct: 818 KGPSTFRFLHAWTKHHDFLPFVERSWQVPLNSSGLTAFWIKQQRLKRDLKWWNKQIFGDI 877 Query: 712 QTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKG 533 + AE + + K Q DPS+ N ++A L L EE F++QKS V+WL Sbjct: 878 FEKLKRAEIEAEKREKEFQQDPSSIN-RNLMNKAYAKLNRQLSIEELFWQQKSGVKWLVE 936 Query: 532 GDSNTKFFHAKMKARWNSNQITRLRVGDDWI-EDSSALKNHVTQHFKGILGSAN------ 374 G+ NTKFFH +M+ + N I R++ + I ED ++N Q+F+ +L + Sbjct: 937 GERNTKFFHLRMRKKRVRNNIFRIQDSEGNIYEDPQYIQNSAVQYFQNLLTAEQCDFSRF 996 Query: 373 DQIIIPTDFPVPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHI 194 D +IP TI + + L + EI +VV ++ D GPDGF+ F++ W I Sbjct: 997 DPSLIPR-----TISITDNEFLCAAPSLKEIKEVVFNIDKDSVAGPDGFSSLFYQHCWDI 1051 Query: 193 IGNDVSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILV 14 I D+ A+ DFF + +G+ +T L L+PKKPN+ +DFRPISLC +L KI++K L Sbjct: 1052 IKQDLLEAVLDFFNGTPMPQGVTSTTLVLLPKKPNSCQWSDFRPISLCTVLNKIVTKTLA 1111 Query: 13 NRL 5 NRL Sbjct: 1112 NRL 1114 >ref|XP_007214027.1| hypothetical protein PRUPE_ppa016677mg [Prunus persica] gi|462409892|gb|EMJ15226.1| hypothetical protein PRUPE_ppa016677mg [Prunus persica] Length = 1421 Score = 208 bits (530), Expect = 3e-51 Identities = 125/360 (34%), Positives = 191/360 (53%), Gaps = 6/360 (1%) Frame = -1 Query: 1063 FTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFGP 884 FTWSN ++ + + RLDR L + W++ FP SDH I + + V +GP Sbjct: 450 FTWSNLRE--NAVCRRLDRFLVSGSWEEHFPHYRHKALPRITSDHCPIELD-SSRVKWGP 506 Query: 883 KPFKFINVWLDHPDFLNLVKEKW-EEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQT 707 PF+F N+WL+HPDF +K W E+Q+ G+ + +LK +K L W+K FG+V+ Sbjct: 507 SPFRFENMWLNHPDFKRKIKLWWGEDQIPGWEGYKFMTRLKMLKSKLKVWSKEEFGDVER 566 Query: 706 TTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKGGD 527 R AEA+L +R + + + E + + +REE +RQ+ +V+W + GD Sbjct: 567 DLREAEARLLVLDQREGTEGLDHLLRSERDNLLLKIGDLAQREEVKWRQRGKVKWAREGD 626 Query: 526 SNTKFFHAKMKARWNSNQITRLRVGD-DWIEDSSALKNHVTQHFKGILGSANDQIIIPTD 350 NTKFFH N I +L V D IE + ++ V + FKG+ S N + + Sbjct: 627 GNTKFFHRVANGARKRNYIEKLEVEDLGVIEVDANIEREVIRFFKGLY-SRNKNVGWGVE 685 Query: 349 ----FPVPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGND 182 P+ + D L E+ + V D DK+PGPDGF+ FF+ W ++ D Sbjct: 686 GLNWCPISQVEADW---LERPFDLEEVQKAVFDCGKDKSPGPDGFSMSFFQSCWEVVKGD 742 Query: 181 VSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2 + ++DFF ++ N TF+CLIPKK N+ + D+RPISL LYK+ISK+L +RL+ Sbjct: 743 LMKVMQDFFQSGIVNGVTNETFICLIPKKANSVKVTDYRPISLVTSLYKVISKVLASRLR 802 >gb|AAC67331.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1449 Score = 208 bits (529), Expect = 4e-51 Identities = 132/377 (35%), Positives = 186/377 (49%), Gaps = 20/377 (5%) Frame = -1 Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVN 893 G FTW N++D IW +LDRV+ NE WK ++P + E G SDH + L Sbjct: 599 GPLFTWCNKRDNDP-IWKKLDRVMVNEAWKMVYPQSYNVFEAGGCSDHLRCRINLNMNSG 657 Query: 892 F---GPKPFKFINVWLDHPDFLNLVKEKWEE----QVTGYPMQRVCAKLKNVKEALVSWN 734 G KPFKF+N D +F LV+ W E ++ + R KLK +K L Sbjct: 658 AQVRGNKPFKFVNAVADMEEFKPLVENFWRETEPIHMSTSSLFRFTKKLKALKPKLRGLA 717 Query: 733 KNCFGEVQTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKS 554 K G + TR A L + +PS +E ES+A + EEK+ +Q S Sbjct: 718 KEKMGNLVKRTREAYLSLCQAQQSNSQNPSQ-RAMEIESEAYVRWDRIASIEEKYLKQVS 776 Query: 553 RVQWLKGGDSNTKFFHAKMKARWNSNQITRLRVGD-DWIEDSSALKNHVTQHFKGILGSA 377 ++ WLK GD N K FH AR N I ++ D +KN + F+ L Sbjct: 777 KLHWLKVGDKNNKTFHRAATARAAQNSIREIQKEDGSTATTKDDIKNETERFFQEFLQ-- 834 Query: 376 NDQIIIPTDFPVPTI------------PMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPD 233 +IP D+ T+ P + KD L ++ EI + + DK+PGPD Sbjct: 835 ----LIPNDYEGITVEKLTSLLPYHCSPAE-KDMLTASVSAKEIRGALFSMPNDKSPGPD 889 Query: 232 GFNGDFFKRTWHIIGNDVSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISL 53 G+ +F+KR W IIG + A+K FF + + +G+N T L LIPKK A + D+RPIS Sbjct: 890 GYTSEFYKRAWDIIGAEFVLAVKSFFEKGFLPKGVNTTILALIPKKLEAKEMKDYRPISC 949 Query: 52 CNILYKIISKILVNRLK 2 CN++YK+ISKI+ NRLK Sbjct: 950 CNVIYKVISKIIANRLK 966 >ref|XP_007202950.1| hypothetical protein PRUPE_ppa016504mg, partial [Prunus persica] gi|462398481|gb|EMJ04149.1| hypothetical protein PRUPE_ppa016504mg, partial [Prunus persica] Length = 1162 Score = 207 bits (527), Expect = 6e-51 Identities = 123/356 (34%), Positives = 187/356 (52%), Gaps = 2/356 (0%) Frame = -1 Query: 1063 FTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFGP 884 FTWSN ++ + + RLDR L + W+D FP SDH I + + V +GP Sbjct: 168 FTWSNLRE--NAVCRRLDRFLVSGSWEDHFPHYRHKALPRITSDHCPIELDT-SRVKWGP 224 Query: 883 KPFKFINVWLDHPDFLNLVKEKW-EEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQT 707 PF+F N+WL+HPDF+ +K W E+Q+ G+ + +LK +K L W+K FG+V+ Sbjct: 225 SPFRFENMWLNHPDFMRKIKLWWGEDQIPGWEGYKFMTRLKMLKSKLKVWSKEEFGDVER 284 Query: 706 TTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKGGD 527 R AEA+L +R + + + E + + ++EE +RQ+ +V+W + GD Sbjct: 285 DLREAEARLLVLDQREGTEGLDHLLRSERDNLLLKIGDLAQKEEVKWRQRGKVKWAREGD 344 Query: 526 SNTKFFHAKMKARWNSNQITRLRVGD-DWIEDSSALKNHVTQHFKGILGSANDQIIIPTD 350 NTKFFH N I +L V D IE + ++ V + FKG+ S + Sbjct: 345 GNTKFFHRVANGARKRNYIEKLEVEDLGVIEVDANIEREVIRFFKGLYSSNKNVGWGVEG 404 Query: 349 FPVPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDVSSA 170 I D L E+ + V + DK+PGPDGF+ FF+ W ++ D+ Sbjct: 405 LNWCPISQVEADWLERPFDLEEVQKAVFECGKDKSPGPDGFSMSFFQSCWEVVKGDLMKV 464 Query: 169 IKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2 ++DFF ++ N TF+CLIPKK N+ + D RPISL LYK+ISK+L +RL+ Sbjct: 465 MQDFFQSGIVNGVTNETFICLIPKKANSVKVTDNRPISLVTSLYKVISKVLASRLR 520 >ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao] gi|508722459|gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 206 bits (525), Expect = 1e-50 Identities = 118/358 (32%), Positives = 197/358 (55%), Gaps = 2/358 (0%) Frame = -1 Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVN 893 G+ FTW+N +R++ RLDRV+ N QW +MFP + SDH + + Sbjct: 1060 GNPFTWTN-----NRMFQRLDRVVYNHQWINMFPITRIQHLNRDGSDHCPLLISCFISSE 1114 Query: 892 FGPKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEV 713 P F+F + W+ H DF V+ W + G +Q K +K+ L WNK FG++ Sbjct: 1115 KSPSSFRFQHAWVLHHDFKTSVEGNWNLPINGSGLQAFWIKQHRLKQHLKWWNKAVFGDI 1174 Query: 712 QTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKG 533 + + AE ++++C Q + + + + +++ L L EE F++QKS V+W+ Sbjct: 1175 FSKLKEAEKRVEECEILHQQEQTVGSRINL-NKSYAQLNKQLNVEEIFWKQKSGVKWVVE 1233 Query: 532 GDSNTKFFHAKMKARWNSNQITRLRVGDD-WIEDSSALKNHVTQHFKGILGSANDQIIIP 356 G+ NTKFFH +M+ + + I +++ D WIED LK ++F +L + I Sbjct: 1234 GERNTKFFHMRMQKKRIRSHIFKVQEPDGRWIEDQEQLKQSAIEYFSSLLKAEPCDISRF 1293 Query: 355 TDFPVPTIPMDLKDGLM-VDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDV 179 + +P+I + ++ L+ + E+ V D+ + A GPDGF+ F+++ W+ I +D+ Sbjct: 1294 QNSLIPSIISNSENELLCAEPNLQEVKDAVFDIDPESAAGPDGFSSYFYQQCWNTIAHDL 1353 Query: 178 SSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRL 5 A++DFF I RG+ +T L L+PKK +AS ++FRPISLC ++ KII+K+L NRL Sbjct: 1354 LDAVRDFFHGANIPRGVTSTTLVLLPKKSSASKWSEFRPISLCTVMNKIITKLLSNRL 1411 >gb|ABE87589.2| RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H; Endonuclease/exonuclease/phosphatase [Medicago truncatula] Length = 1246 Score = 206 bits (525), Expect = 1e-50 Identities = 122/366 (33%), Positives = 190/366 (51%), Gaps = 9/366 (2%) Frame = -1 Query: 1075 VGDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMF--PSCEALVEDSGI---SDHHHITVK 911 +G +TWSN + + RLDR + NE+W + + SC AL + + SDHH + + Sbjct: 176 LGAFYTWSNGRLGSDNVALRLDRAICNEEWVNFWRSSSCSALGNSALVRHQSDHHPLLMS 235 Query: 910 LKTEVNFGPKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNK 731 + + FKF W +H D +V E W + G+ M R+ AKLK++K+ WN+ Sbjct: 236 MDFCTSQRSGNFKFFKTWTEHEDCRRIVAENWSKHTRGHGMTRLQAKLKHMKQVFRHWNR 295 Query: 730 NCFGEVQTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSR 551 FG+V R+A ++ + + + +E +A L L +++ +R+K R Sbjct: 296 TVFGDVDRKVRMAVEEVNRIQQIIDSVGFSDQLYAQELEAHLILTKALHYQDELWREKLR 355 Query: 550 VQWLKGGDSNTKFFHAKMKARWNSNQITRLRVGDDWIEDSSALKNHVTQHFKGILGSAND 371 Q GD NT +FH K R N I+ L+ GD I D + ++ HV +F+ I + D Sbjct: 356 DQRFIHGDRNTAYFHRISKVRATKNTISFLQDGDAVITDPARIEVHVLNYFQAIF--SVD 413 Query: 370 QIIIPTDFPVPTIPMDL----KDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRT 203 I D V TIP + + L+ GE+ V L D APGP+GF G F++ Sbjct: 414 NSCIQNDLVVDTIPSLVSNVDNNSLLRLPLWGEVKNAVFTLNGDGAPGPNGFGGHFYQTY 473 Query: 202 WHIIGNDVSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISK 23 W I+G DV +++DFF ++ + +N+ + LIPK P A + D+RPI+L N +KIISK Sbjct: 474 WDIVGADVIQSVQDFFISGQLAQNINSNLIVLIPKVPGARVMGDYRPIALANFQFKIISK 533 Query: 22 ILVNRL 5 IL +RL Sbjct: 534 ILADRL 539 >ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao] gi|508778198|gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 206 bits (524), Expect = 1e-50 Identities = 118/363 (32%), Positives = 197/363 (54%), Gaps = 7/363 (1%) Frame = -1 Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVN 893 G+ FTW+N +R++ RLDR++ N W + FP + SDH + + Sbjct: 1230 GNPFTWTN-----NRMFQRLDRIVYNHHWINKFPITRIQHLNRDGSDHCPLLISCFNSSE 1284 Query: 892 FGPKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEV 713 P F+F + W+ H DF V+ W + G +Q +K +K+ L WNK FG++ Sbjct: 1285 KAPSSFRFQHAWVLHHDFKTSVESNWNLPINGSGLQAFWSKQHRLKQHLKWWNKVMFGDI 1344 Query: 712 QTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKG 533 + + AE ++++C Q++ + ++++ +++ L L EE F++QKS V+W+ Sbjct: 1345 FSKLKEAEKRVEECEILHQNEQTVESIIKL-NKSYAQLNKQLNIEEIFWKQKSGVKWVVE 1403 Query: 532 GDSNTKFFHAKMKARWNSNQITRLRVGDD-WIEDSSALKNHVTQHFKGIL------GSAN 374 G+ NTKFFH +M+ + + I +++ D WIED LK ++F +L S Sbjct: 1404 GERNTKFFHTRMQKKRIRSHIFKVQEPDGRWIEDQEQLKQSAIKYFSSLLKFEPCDDSRF 1463 Query: 373 DQIIIPTDFPVPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHI 194 + +IP+ I + L + E+ V + + A GPDGF+ F+++ W+I Sbjct: 1464 QRSLIPS-----IISNSENELLCAEPNLQEVKDAVFGIDPESAAGPDGFSSYFYQQCWNI 1518 Query: 193 IGNDVSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILV 14 I +D+ A++DFF I RG+ +T L L+PKKP+AS +DFRPISLC ++ KII+K+L Sbjct: 1519 IAHDLLDAVRDFFHGANIPRGVTSTTLILLPKKPSASKWSDFRPISLCTVMNKIITKLLS 1578 Query: 13 NRL 5 NRL Sbjct: 1579 NRL 1581 >emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1383 Score = 206 bits (524), Expect = 1e-50 Identities = 115/342 (33%), Positives = 172/342 (50%), Gaps = 2/342 (0%) Frame = -1 Query: 1021 SRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFGPKPFKFINVWLDHPD 842 S LDR+L + +W P+ + + G+SDH + V + +GPKPF+F N WL P Sbjct: 193 SLLDRLLVSPEWVSHCPNIKVSILQRGLSDHCPLLVHSHIQ-EWGPKPFRFNNCWLTDPK 251 Query: 841 FLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQTTTRLAEAQLQDCSKR 662 + +V+ W P V KLK K+ L WN N FG + R E + + K Sbjct: 252 CMKIVEASWSSS----PKISVVEKLKETKKRLKEWNLNEFGSIDANIRKLEDCIANFDKE 307 Query: 661 AQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKGGDSNTKFFHAKMKARWN 482 A + + LE+ +A+ L ++R+E ++ Q+SR+ WLK GD NTKFFHA + Sbjct: 308 ADERELDKEELEKRREAQADLWKWMKRKEIYWAQRSRITWLKAGDKNTKFFHAIASNKKR 367 Query: 481 SNQITRLRVGDDWIEDSSALKNHVTQHFKGILGSANDQIIIPT--DFPVPTIPMDLKDGL 308 N + + D S +K FK I D + PT + + + + + L Sbjct: 368 KNMMACIETDGQSTNDPSQIKKEARAFFKKIF--KEDHVKRPTLENLHLKRLSQNQANSL 425 Query: 307 MVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDVSSAIKDFFARERIHRGL 128 + T EID V DKAPGPDGFN F K W II D+ + DF+ + +G Sbjct: 426 ITPFTTEEIDTAVSSCASDKAPGPDGFNFKFVKSAWDIIKTDIYGIVNDFWETGCLPQGC 485 Query: 127 NATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2 N ++ LIPK N S++ D+RPIS+ +YKI++K+L RL+ Sbjct: 486 NTAYIALIPKIDNPSSLKDYRPISMVGFIYKIVAKLLAKRLQ 527 >emb|CCA66198.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1381 Score = 206 bits (523), Expect = 2e-50 Identities = 116/340 (34%), Positives = 178/340 (52%) Frame = -1 Query: 1021 SRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFGPKPFKFINVWLDHPD 842 S+LDRVL +W + FP+ + + ISDH + ++ + V++GP+PFKF +VWL H Sbjct: 192 SKLDRVLVQAEWIEKFPALAVSILNRSISDHCPLLLQ-SSIVDWGPRPFKFQDVWLSHKG 250 Query: 841 FLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQTTTRLAEAQLQDCSKR 662 + +V++ W + MQ KLK VK L +WN FG + L EA++Q Sbjct: 251 CMEIVEKAWIQSKELTLMQ----KLKKVKLDLKTWNSESFGNIDANILLREAEIQKWDSE 306 Query: 661 AQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKGGDSNTKFFHAKMKARWN 482 A ++ +QA+ L L+++E ++ Q+SR++WLK GD NTKFFH R + Sbjct: 307 ANSRDLEPEEIKTRAQAQLELWEWLKKKEIYWAQQSRIKWLKSGDRNTKFFHICASIRRS 366 Query: 481 SNQITRLRVGDDWIEDSSALKNHVTQHFKGILGSANDQIIIPTDFPVPTIPMDLKDGLMV 302 N I+ + + IED +K ++FK + + T+ + + Sbjct: 367 KNNISSILLQGKKIEDPIIIKEEAVKYFKNLFTEDFKERPTFTNLSFKKLSESQAFSISA 426 Query: 301 DITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDVSSAIKDFFARERIHRGLNA 122 + EID+ V K+PGPDGFN F K +W +I +D S I++F+ + RG N Sbjct: 427 PFSTTEIDEAVASCNPSKSPGPDGFNFKFIKASWDLIKHDFYSIIQEFWHTGILPRGSNV 486 Query: 121 TFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2 F+ LI K + S DFRPIS+ +YKIISK+L RLK Sbjct: 487 AFIALIAKIESPSGFKDFRPISMVGCVYKIISKLLAGRLK 526 >emb|CCA66188.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1381 Score = 206 bits (523), Expect = 2e-50 Identities = 113/354 (31%), Positives = 175/354 (49%) Frame = -1 Query: 1063 FTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFGP 884 FTW + Q S+LDR+L N +W +FPS + + +SDH + VK E+N+GP Sbjct: 183 FTWFSGQAK-----SKLDRLLVNPEWVSLFPSLQVSILRRNLSDHCPLLVK-SDELNWGP 236 Query: 883 KPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQTT 704 +PF+F N WL HP L ++K+ W +G + KLK K+ L WN + FG + Sbjct: 237 RPFRFQNCWLSHPGCLQIIKDVWASHTSG----NLTDKLKETKKRLKIWNSSEFGHIDRN 292 Query: 703 TRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKGGDS 524 E ++ + + L E ++ L + L R+E F+ Q SR +W+K GD Sbjct: 293 IEELEDRIHNLDLISNGRDLQLEELAERRSSQMELWVWLRRKEAFWAQNSRAKWIKEGDK 352 Query: 523 NTKFFHAKMKARWNSNQITRLRVGDDWIEDSSALKNHVTQHFKGILGSANDQIIIPTDFP 344 NTK+FH R N I L + + D + + + FK I + Sbjct: 353 NTKYFHTLASTRKKKNTIPALITNNGVVSDPAGIHHEAVSFFKSIFKEDFSSRPVFNGLQ 412 Query: 343 VPTIPMDLKDGLMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDVSSAIK 164 ++ + L + E+D+ V KAPGPDG+N F K +W II DV + ++ Sbjct: 413 FRSLSCEQVSQLTEPFSHKEVDEAVESCDPQKAPGPDGYNFRFIKDSWDIIKLDVYNIVE 472 Query: 163 DFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2 +F+ + +G N F+ LI K+ +NDFRPIS+ +YKII+K+L RL+ Sbjct: 473 NFWNSGSLPKGSNVAFIALIAKREVPEGLNDFRPISMVGCIYKIIAKLLARRLQ 526 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 206 bits (523), Expect = 2e-50 Identities = 122/360 (33%), Positives = 195/360 (54%), Gaps = 6/360 (1%) Frame = -1 Query: 1063 FTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVNFGP 884 ++W+N+ RI SR+D+ N W + +P ++GISDH + L T+ + G Sbjct: 183 YSWNNKSIGADRISSRIDKSFVNVAWINQYPDVVVEYREAGISDHSPLIFNLATQHDEGG 242 Query: 883 KPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEVQTT 704 +PFKF+N D F+ +VKE W + M+ + +L+ VK AL S++ F + Sbjct: 243 RPFKFLNFLADQNGFVEVVKEAWGSANHRFKMKNIWVRLQAVKRALKSFHSKKFSKAHC- 301 Query: 703 TRLAEAQLQDCSKRAQDDPSNANVL-EEESQARCHLKLMLEREEKFYRQKSRVQWLKGGD 527 ++ E + + + +A + S + L EEE L+ +E +QKSR+QWL GD Sbjct: 302 -QVEELRRKLAAVQALPEVSQVSELQEEEKDLIAQLRKWSTIDESILKQKSRIQWLSLGD 360 Query: 526 SNTKFFHAKMKARWNSNQITRLRVG-DDWIEDSSALKNHVTQHFKGILGSANDQIIIPTD 350 SN+KFF +K R N+I L+ D + +++ ++N + ++ +LG+++ Q+ D Sbjct: 361 SNSKFFFTAIKVRKARNKIVLLQNDRGDQLTENTEIQNEICNFYRRLLGTSSSQLEA-ID 419 Query: 349 FPVPTIPMDLKDG----LMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGND 182 V + L L+ IT EIDQ + D+ KAPG DGFN FFK++W +I + Sbjct: 420 LHVVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKAPGLDGFNSVFFKKSWLVIKQE 479 Query: 181 VSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRLK 2 + I DFF +H+ +N T + LIPK A + D+RPI+ C+ LYKIISKIL RL+ Sbjct: 480 IYEGILDFFENGFMHKPINCTAVTLIPKIDEAKHAKDYRPIACCSTLYKIISKILTKRLQ 539 >gb|AAD12028.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1447 Score = 205 bits (521), Expect = 3e-50 Identities = 129/377 (34%), Positives = 197/377 (52%), Gaps = 20/377 (5%) Frame = -1 Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKT--- 902 G +TWSN+++ I +LDRV+ N+ W FP ++ E G DH + L Sbjct: 592 GPLYTWSNKREH-DLIAKKLDRVMVNDVWTQSFPQSYSVFEAGGCLDHLRGRINLNDGPG 650 Query: 901 EVNFGPKPFKFINVWLDHPDFLNLVKEKWEEQ----VTGYPMQRVCAKLKNVKEALVSWN 734 + G +PFKF+NV + DF V W+E ++ + R KLK++K L + Sbjct: 651 SIVRGKRPFKFVNVLTEMEDFKPTVDSYWKETEPIFLSTSSLFRFSKKLKSLKPLLRNLA 710 Query: 733 KNCFGEVQTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKS 554 K G + TR A L + ++P+ N ++EE +A + + EEKF ++KS Sbjct: 711 KERLGNLVKKTREAYDTLCKKQESTLNNPT-PNAMKEEVEAHDRWEHVAGLEEKFLKKKS 769 Query: 553 RVQWLKGGDSNTKFFHAKMKARWNSNQITRLRVGDDWIE-DSSALKNHVTQHFKGILGSA 377 ++ WL GGD N K FH + R N I+ ++ D + +K + + F+ L Sbjct: 770 KLHWLDGGDKNNKAFHRAVVTREAQNSISEIQCQDGSVTAKGDEIKAYAERFFREFLQ-- 827 Query: 376 NDQIIIPTDFPVPTIPMDLKDGLMVD------------ITQGEIDQVVRDLKIDKAPGPD 233 +IP ++ T+ DL+D L +T EI +V+ + DK+PGPD Sbjct: 828 ----LIPNEYEGVTMA-DLQDLLPFRCSETEHELLTRVVTAEEIKKVLFSMPNDKSPGPD 882 Query: 232 GFNGDFFKRTWHIIGNDVSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISL 53 GF +FFK TW I+GN+ AI+ FFA+ + +G+N T L LIPKK A + D+RPIS Sbjct: 883 GFTSEFFKATWEILGNEFILAIQSFFAKGFLPKGINTTILALIPKKKEAKEMKDYRPISC 942 Query: 52 CNILYKIISKILVNRLK 2 CN++YK+ISKI+ NRLK Sbjct: 943 CNVIYKVISKIIANRLK 959 >ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao] gi|508710341|gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 204 bits (520), Expect = 4e-50 Identities = 119/359 (33%), Positives = 189/359 (52%), Gaps = 3/359 (0%) Frame = -1 Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVN 893 G+ FTW+N +R++ RLDRV+ N++W + F S + SDH + + Sbjct: 1024 GNSFTWTN-----NRMFQRLDRVVYNQEWAEFFSSTRVQHLNRDGSDHCPLLISCSNTNQ 1078 Query: 892 FGPKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEV 713 GP F+F++ W H DF++ V++ W + + K + +K L WNK+ FG++ Sbjct: 1079 RGPATFRFLHAWTKHHDFISFVEKSWNTPIHAEGLNAFWTKQQRLKRDLKWWNKHIFGDI 1138 Query: 712 QTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKG 533 RLAE + + Q +PS AN E +A L L EE F++QKS V+WL Sbjct: 1139 FKILRLAEVEAEQRELNFQQNPSAAN-RELMHKAYAKLNRQLSIEELFWQQKSGVKWLVE 1197 Query: 532 GDSNTKFFHAKMKARWNSNQITRLRVGD-DWIEDSSALKNHVTQHFKGILGSANDQIIIP 356 G+ NTKFFH +M+ + N I R++ + + +E+ ++N + F+ +L + I Sbjct: 1198 GERNTKFFHMRMRKKRMRNHIFRIQDQEGNVLEEPHLIQNSGVEFFQNLLKAEQCDISRF 1257 Query: 355 TDFPVPTIPMDLKDGLMVDITQG--EIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGND 182 P I + D + T E+ + V ++ D GPDGF+ F++ W II D Sbjct: 1258 DPSITPRI-ISTTDNEFLCATPSLQEVKEAVFNINKDSVAGPDGFSSLFYQHCWDIIKQD 1316 Query: 181 VSSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRL 5 + A+ DFF + RG+ +T L L+PK N S ++FRPISLC +L KI++K+L NRL Sbjct: 1317 LFEAVLDFFKGSPLPRGITSTTLVLLPKTQNVSQWSEFRPISLCTVLNKIVTKLLANRL 1375 >ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao] gi|508725616|gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 204 bits (518), Expect = 7e-50 Identities = 116/358 (32%), Positives = 195/358 (54%), Gaps = 2/358 (0%) Frame = -1 Query: 1072 GDKFTWSNRQDPPSRIWSRLDRVLANEQWKDMFPSCEALVEDSGISDHHHITVKLKTEVN 893 G+ FTW+N +R++ RLDR++ N+QW + FP + SDH + + Sbjct: 1023 GNPFTWTN-----NRMFQRLDRMVYNQQWINKFPITRIQHLNRDGSDHCPLLLSCSNSSE 1077 Query: 892 FGPKPFKFINVWLDHPDFLNLVKEKWEEQVTGYPMQRVCAKLKNVKEALVSWNKNCFGEV 713 P F+F++ W H +F V+ W + G + +K K +K+ L WNK FG++ Sbjct: 1078 KAPSSFRFLHAWALHHNFNASVEGNWNLPINGSGLMAFWSKQKRLKQHLKWWNKTVFGDI 1137 Query: 712 QTTTRLAEAQLQDCSKRAQDDPSNANVLEEESQARCHLKLMLEREEKFYRQKSRVQWLKG 533 + + AE ++++C Q + + + ++ +++ L L EE F++QKS V+W+ Sbjct: 1138 FSNIKEAEKRVEECEILHQQEQTIGSRIQL-NKSYAQLNKQLSMEEIFWKQKSGVKWVVE 1196 Query: 532 GDSNTKFFHAKMKARWNSNQITRLRVGD-DWIEDSSALKNHVTQHFKGILGSANDQIIIP 356 G+ NTKFFH +M+ + + I +++ D +WIED L+ F +L + + Sbjct: 1197 GERNTKFFHMRMQKKRIRSHIFKIQEQDGNWIEDPEQLQQSAIDFFSSLLKAESCDDTRF 1256 Query: 355 TDFPVPTIPMDLKDG-LMVDITQGEIDQVVRDLKIDKAPGPDGFNGDFFKRTWHIIGNDV 179 P+I D +G L + T E+ + V + + A GPDGF+ F+++ W II +D+ Sbjct: 1257 QSSLCPSIISDTDNGFLCAEPTLQEVKEAVFGIDPESAAGPDGFSSHFYQQCWDIIAHDL 1316 Query: 178 SSAIKDFFARERIHRGLNATFLCLIPKKPNASNINDFRPISLCNILYKIISKILVNRL 5 A+K+FF I +G+ +T L LIPK +AS ++FRPISLC ++ KII+KIL NRL Sbjct: 1317 FEAVKEFFHGADIPQGMTSTTLVLIPKTTSASKWSEFRPISLCTVMNKIITKILANRL 1374