BLASTX nr result
ID: Ephedra27_contig00007360
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra27_contig00007360 (1819 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002308172.1| predicted protein [Populus trichocarpa] 158 8e-36 emb|CCA66235.1| hypothetical protein [Beta vulgaris subsp. vulga... 154 1e-34 emb|CCA66153.1| hypothetical protein [Beta vulgaris subsp. vulga... 144 2e-31 emb|CCA66188.1| hypothetical protein [Beta vulgaris subsp. vulga... 139 5e-30 gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] 138 9e-30 gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao] 135 6e-29 gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] 135 8e-29 gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] 134 1e-28 ref|XP_002303192.1| predicted protein [Populus trichocarpa] 133 2e-28 emb|CCA65995.1| hypothetical protein [Beta vulgaris subsp. vulga... 131 8e-28 gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob... 130 1e-27 emb|CCA66178.1| hypothetical protein [Beta vulgaris subsp. vulga... 130 2e-27 gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] 130 2e-27 ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein A... 130 2e-27 gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] 129 4e-27 gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] 127 2e-26 ref|XP_004299997.1| PREDICTED: putative ribonuclease H protein A... 125 5e-26 gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] 125 8e-26 gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] 124 1e-25 ref|NP_187562.1| RNase H domain-containing protein [Arabidopsis ... 124 1e-25 >ref|XP_002308172.1| predicted protein [Populus trichocarpa] Length = 670 Score = 158 bits (399), Expect = 8e-36 Identities = 126/448 (28%), Positives = 196/448 (43%), Gaps = 22/448 (4%) Frame = +3 Query: 3 KMIKFWTDKWLKMGTLADKFPRHFSSVSARVPNTVRGPNIPELLDLELSVSEGYA----- 167 K IKFW D W G+LAD+FP F R N E ++ + +G+A Sbjct: 229 KRIKFWLDDWTATGSLADQFPALF-----------RLTNDKEASLDKMGIWDGHAWHWLF 277 Query: 168 ---NLSRNRDQSWKEIKQLVLSELPPLLDNKRDSLIWSAHPQGKFTVKS-CYLLESNRNL 335 R R+ + +LS++ L + D LIW A+ G+F++KS C LL + Sbjct: 278 TWSRPLRGRNYGLLDRMTAILSKVQ-LDKDAEDRLIWKANSTGRFSIKSLCGLLSPKPPM 336 Query: 336 PRPIRVSTGYWNIDLIPKINSFWWLCLKKCLPTKDLLLRRGIQ--VDPLCPLCGQQEESL 509 TG W + PK+ F W+ + K + T+ +L+RRGI CP+C +EES+ Sbjct: 337 DTSFSF-TGIWRGIVPPKVEVFCWMAIIKKINTRSMLVRRGILDISAAACPICLAEEESV 395 Query: 510 THLFFFCSFLNDIWKLFWKYWKTNWVFSGDIISIWNAWKSPWSSDGLKRIWKISLPHLCW 689 H+ C +W +W W ++ ++++ W S K+ W + L + W Sbjct: 396 DHILLHCHKHWIVWSKIINWWGLAWCCPKNLAALFSQWDSLVYGKFQKKAWLMLLFSVAW 455 Query: 690 SIWLERNNRVFGRPHSNPSTIFFRARRFIMDNAMCWKKC---QVSDSETSIIK--EWHIP 854 S+WL RN+ +F + N T+F I+ W K S S + +++ E + Sbjct: 456 SLWLHRNDVIFKQSTPNYDTLFI----LIITRLCFWIKAIEPDFSYSASDLLRSAEGLLR 511 Query: 855 YSPQEDSFVEVRWFPPTGNGIKFNFDGSYRG-GNVVGCGGIFRDSEGRFLYGFSFKA--- 1022 ++ ++ V V W P N K+N DGS G G GG+ R+ G L FS Sbjct: 512 WTNSKNQRVVVVWSPLMLNSFKWNVDGSSLGKSGPSGIGGVLRNHNGIILGIFSLSVGIL 571 Query: 1023 -SGNSALIAEAKSLHFGSRLIRRFMAPITIEGDSLSIIKMAKKEWEHPWYLSNFLEGIWE 1199 S + L A K++ + R I IE DS ++I PW N I Sbjct: 572 DSNVAELKAVVKAIELSAFNCRLHHKHIIIESDSANVISWMNSPHNRPWRHHNLFSSIQR 631 Query: 1200 NLAGFDT-RFQHTFREGNKVADLLANHG 1280 + F + F H+ RE N +AD +A G Sbjct: 632 AASCFGSLTFTHSLRESNHMADHMAKQG 659 >emb|CCA66235.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1380 Score = 154 bits (389), Expect = 1e-34 Identities = 123/446 (27%), Positives = 184/446 (41%), Gaps = 19/446 (4%) Frame = +3 Query: 15 FWTDKWLKMGTLADKFPRHFSSVSARVPNTVRGPNIPELL--DLELSVSEGYANLSRNRD 188 FW D+WL L +FPR + + ++ P L + S +A R RD Sbjct: 938 FWHDQWLGPKPLKAQFPRLYLLATNKM-----APVASHCFWDGLAWAWSFSWARHHRARD 992 Query: 189 QSWKEIKQLVLSELPPLLDNKRDSLIWSAHPQGKFTVKSCYLLESNRNLPRPIRVSTGYW 368 KE K L L ++ L + +DSL+WS H G F+ S + NLP G W Sbjct: 993 LDEKE-KLLELLDMVHLDPSNQDSLVWSYHKSGSFSTSSFTAEMAKANLPPHTDAIKGVW 1051 Query: 369 NIDLIP-KINSFWWLCLKKCLPTKDLLLRRGI--QVDPLCPLCGQQEESLTHLFFFCSFL 539 + L+P ++ F W+ L + T+ L GI Q + +C LC E HL C F Sbjct: 1052 -VGLVPHRVEIFVWMALLGRINTRCKLASIGIIPQSENICVLCNTSPEQHNHLLLHCPFS 1110 Query: 540 NDIWKLFWKYWKTNWVFSGDIISIWNAWKSPWSSDGLKRIWKISLPHLCWSIWLERNNRV 719 +W + W+ WV + +++ W SP + K++W + + WSIW ERN+R+ Sbjct: 1111 LSLWNWWLDLWRLKWVLPETLRGLFDQWLSPIKTPFFKKVWAATFFIISWSIWKERNSRI 1170 Query: 720 FGRPHSNPST----IFFRARRFI--MDNAMCWKKCQVSDSETSIIKEWHIPYSPQEDSFV 881 F S PS+ I R +I D A + + + ++ IP+ Q Sbjct: 1171 FENTSSPPSSLHDLILLRLGWWISGWDEAFPYSPTDIQRNPQCLVWGGKIPHPLQAPHPS 1230 Query: 882 EVRWFPPTGNGIKFNFDGSYRGGN-VVGCGGIFRDSEGRFLYGFSFKASGNSALIAEAKS 1058 W PP +K+N D SY N GG+ R+ G F+ FS AE + Sbjct: 1231 SAIWTPPDHGSLKWNVDASYNPLNHRAAVGGVLRNHLGHFICVFSVPVPPMEINFAEVLA 1290 Query: 1059 LHFGSRL----IRRFMAPITIEGDSLSIIKMAKKEWEHPWYLS---NFLEGIWENLAGFD 1217 +H + I + + IE DS + + + PW L NF+ G Sbjct: 1291 IHRALSISHSDITLQSSLLVIESDSANAVSWCNAKQGGPWNLGFQLNFIRSAGSR--GLK 1348 Query: 1218 TRFQHTFREGNKVADLLANHGYEEMD 1295 H R N+VAD LA G D Sbjct: 1349 IEIIHKGRSSNQVADALAKQGLSRRD 1374 >emb|CCA66153.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1381 Score = 144 bits (362), Expect = 2e-31 Identities = 123/459 (26%), Positives = 190/459 (41%), Gaps = 32/459 (6%) Frame = +3 Query: 15 FWTDKWLKMGTLADKFPRHFSSVSARVPNTVRGPNIPELLDLELSVSEGYANLSRNRDQS 194 FW D W+ L + PR F + D +S+ + L Sbjct: 938 FWHDVWVGANPLKTECPRLF--------------RLSLQQDAYVSLCGFWDGLCWRWSLL 983 Query: 195 W-KEIKQLVLSELPPLLD---------NKRDSLIWSAHPQGKFTVKSCYLLESNRNLPRP 344 W + ++Q L E LL+ + +D LIW+ G F+VKS L +N R Sbjct: 984 WSRPLRQRDLHEQATLLNIINRAVLQKDGKDHLIWAPSKSGIFSVKSFSLELANMEESRS 1043 Query: 345 IRVSTGYWNIDLIP-KINSFWWLCLKKCLPTKDLLLRRGI--QVDPLCPLCGQQEESLTH 515 + W L+P +I F W + L TK+ LL + D C C ES H Sbjct: 1044 FEATKELWK-GLVPFRIEIFVWFVILGRLNTKEKLLNLKLISNEDSSCIFCSSSIESTNH 1102 Query: 516 LFFFCSFLNDIWKLFWKYWKTNWVFSGDIISIWNAWKSPWSSDGLKRIWKISLPHLCWSI 695 LF CS+ ++W +++ W WV I ++ W P+ K++W + W+I Sbjct: 1103 LFLECSYSKELWHWWFQIWNVAWVLPSSIKELFTHWIPPFKGKFFKKVWMSCFFIILWTI 1162 Query: 696 WLERNNRVF-GRPHSNPSTIFFRARRFIMDNAMCWKKC---QVSDSETSIIK-----EWH 848 W ERN+R+F +P+S + + I+ W K S I++ W Sbjct: 1163 WKERNSRIFQEKPNSK-----LQLKELILLRLGWWIKGWNEPFPYSAEDIVRNPLCLNWL 1217 Query: 849 IPYSPQE---DSFVEVRWFPPTGNGIKFNFDGSYRGG-NVVGCGGIFRDSEGRFLYGFS- 1013 P PQ+ + W PP+ +K+N D S + GG+ RD +G F+ FS Sbjct: 1218 TPVKPQKAIMPAPFPQHWSPPSIGSLKWNVDASIKSSLQKSSIGGVLRDHKGNFICMFSS 1277 Query: 1014 ---FKASGNSALIAEAKSLHFGSRLIRRFMAPITIEGDSLSIIKMAKKEWEHPWYLSNFL 1184 F N+ ++A ++L + R + + I +E DS + + KK+ PW L NF+ Sbjct: 1278 PIPFMEINNAEVLAIHRALKISAACPRIWGSHIIVESDSSNAVSWCKKDASGPWNL-NFI 1336 Query: 1185 EGIWENLAGFDTRFQHTF--REGNKVADLLANHGYEEMD 1295 N A D + T+ RE N VAD LA G D Sbjct: 1337 LNFIRNSASKDPKVSITYKGRETNMVADALAKQGLSRWD 1375 >emb|CCA66188.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1381 Score = 139 bits (349), Expect = 5e-30 Identities = 122/451 (27%), Positives = 187/451 (41%), Gaps = 24/451 (5%) Frame = +3 Query: 15 FWTDKWLKMGTLADKFPRHFSSVSARVPNTVRGPNIPELLDLELSVSEGYANLSRNRD-Q 191 FW D WL L +FPR F+ V + + E + ++ + R RD + Sbjct: 938 FWLDTWLGDSPLKLRFPRLFTIVDNPMAYIA---SCGSWCGREWVWNFSWSRVFRPRDAE 994 Query: 192 SWKEIKQLVLSE-LPPLLDNKRDSLIWSAHPQGKFTVKSCYLLESNRNLP--RPIRVSTG 362 W+E++ L+ S L P D D LIW+ H G F+VKSC +N L IR+ Sbjct: 995 EWEELQGLLGSVCLSPSTD---DRLIWTPHKSGAFSVKSCSKELTNTALKPQSKIRIWGR 1051 Query: 363 YWNIDLIPKINSFWWLCLKKCLPTKDLLLRRGI--QVDPLCPLCGQQEESLTHLFFFCSF 536 W + P+I F W+ L L ++ L I D +C +C E+ HL C F Sbjct: 1052 LWRGLIPPRIEVFSWVALLGKLNSRQKLATLNIIPPDDAVCIMCNGAPETSDHLLLHCPF 1111 Query: 537 LNDIWKLFWKYWKTNWVFSGDIISIWNAWKSPWSSDGLKRIWKISLPHLCWSIWLERNNR 716 + IW + W +WVF ++ + W + +++W + W+IW ERN R Sbjct: 1112 ASSIWLWWLGIWNVSWVFPKNLFEAFEQWYCHKKNPFFRKVWCSIFSIIIWTIWKERNAR 1171 Query: 717 VFGRPHSNPSTIFFRARRFIMDNAMCWKKCQVSDSETSIIKEWHIPYSPQED-------- 872 +F R S S + + ++ M W K SI++ P D Sbjct: 1172 IF-RGISCSSN---KLQDLVIIRLMWWIKGWGEAFPYSIVEVLRHPQCLSWDYLKAAPAA 1227 Query: 873 ---SFVEVRWFPPTGNGIKFNFDGSYRGGNVVGCGGIFRDSEGRFLYGFSFKASGNSALI 1043 S + W PP +K+N D S G GG+ R+S+G F+ FS Sbjct: 1228 TAVSVDGMLWSPPNDGVMKWNVDASVNAGR-SAIGGVLRNSQGIFVCVFSCPIPSIEINS 1286 Query: 1044 AEAKSLHFGSRLIRRF----MAPITIEGDSLSIIKMAKKEWEHPWYLS---NFLEGIWEN 1202 AE +++ ++ F AP+ +E DS + + + + PW L+ NF+ Sbjct: 1287 AEIIAIYRAMQICYSFEFLKRAPLVLESDSANAVMWSNENEGGPWNLNFQLNFIRN--AR 1344 Query: 1203 LAGFDTRFQHTFREGNKVADLLANHGYEEMD 1295 AG + H R N VAD LA G D Sbjct: 1345 KAGLNISIVHKKRSSNAVADALAKQGLSRTD 1375 >gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 138 bits (347), Expect = 9e-30 Identities = 108/417 (25%), Positives = 192/417 (46%), Gaps = 7/417 (1%) Frame = +3 Query: 144 LSVSEGYANLSRNRDQSWKEIKQLVLSELP--PLLDNKRDSLIWSAHPQGKFTVKSCYLL 317 + V + + N S N ++ ++Q V+ E+ P+ +D W+ P G F+ KS + L Sbjct: 1838 VQVCDFFTNNSWNIEKLKTVLQQEVVDEIAKIPIDTMNKDEAYWTPTPNGDFSTKSAWQL 1897 Query: 318 ESNRNLPRPIRVSTGYWNIDLIPKINSFWWLCLKKCLPTKDLLLRRGIQVDPLCPLCGQQ 497 R + P V W+ + + F W L +P + + +G+Q+ C C + Sbjct: 1898 IRKRKVVNP--VFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCC-KS 1954 Query: 498 EESLTHLFFFCSFLNDIWKLFWKYWKTNWVFSGDIISIWNAWKSPWSSDGLK--RIWKIS 671 EES+ H+ + +W F K ++ + I I AW +S D K I + Sbjct: 1955 EESIMHVMWDNPVAMQVWNYFAKLFQILIINPCTINQIIGAWF--YSGDYCKPGHIRTLV 2012 Query: 672 LPHLCWSIWLERNNRVFGRPHSNPSTIFFRARRFIMDNAMCWKKCQVS-DSETSIIKEWH 848 + W +W+ERN+ P+ + +R + I ++ + + + I +EW Sbjct: 2013 PLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWG 2072 Query: 849 IPYSPQEDSFVEV-RWFPPTGNGIKFNFDGSYRGGNVVGCGGIFRDSEGRFLYGFSFKAS 1025 I + + + +V W P+ K N DGS + + GGI RD G ++GFS Sbjct: 2073 IIFQAESLAPPKVFSWHKPSLGEFKLNVDGSAKQSHNAAGGGILRDHAGEMVFGFSENLG 2132 Query: 1026 GNSALIAEAKSLHFGSRLIRRF-MAPITIEGDSLSIIKMAKKEWEHPWYLSNFLEGIWEN 1202 ++L AE +L+ G L R + + + IE D++S+I++ + P + + + + Sbjct: 2133 TQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQL 2192 Query: 1203 LAGFDTRFQHTFREGNKVADLLANHGYEEMDIQFFDNAPVFIKPALFDDKIGTKFLR 1373 L+ F RF H FREGN+ AD LAN G+E ++Q F A ++ L D+ ++R Sbjct: 2193 LSHFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVAQGKLRGMLCLDQTSFPYVR 2249 >gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao] Length = 1134 Score = 135 bits (340), Expect = 6e-29 Identities = 102/387 (26%), Positives = 181/387 (46%), Gaps = 7/387 (1%) Frame = +3 Query: 234 PLLDNKRDSLIWSAHPQGKFTVKSCYLLESNRNLPRPIRVSTGYWNIDLIPKINSFWWLC 413 P ++ D W+ G F+ +S + R + + W+ + I+ F W Sbjct: 751 PFDKSREDVAYWTLTSNGDFSTRSAGEMIRQRQTSNAL--CSFIWHRSIPLSISFFLWKT 808 Query: 414 LKKCLPTKDLLLRRGIQVDPLCPLCGQQEESLTHLFFFCSFLNDIWKLFWKYWKTNWVFS 593 L +P + + +GIQ+ C +C EESL H+ + +W F K ++ + Sbjct: 809 LHNWIPVELRMKEKGIQLASKC-VCCNSEESLIHVLWENPVAKQVWNFFAKLFQIYILNP 867 Query: 594 GDIISIWNAWKSPWSSDGLKR-IWKISLP-HLCWSIWLERNNRVFGRPHSNPSTIFFRAR 767 + I AW S D +++ +++ LP +CW +WLERN+ P + +R Sbjct: 868 RHVSQIIWAWYV--SGDYVRKGHFRVLLPLFICWFLWLERNDAKHRHTGLYPDRVIWRTM 925 Query: 768 ---RFIMDNAMCWKKCQVSDSETSIIKEWHIPYSPQEDSFVEV-RWFPPTGNGIKFNFDG 935 R + D ++ + D++ + + + P PQ+ + ++ W P+ K N DG Sbjct: 926 KHCRQLYDGSLLQQWQWKGDTDIAAMLGFSFP--PQQHASPQIIYWKKPSIGEYKLNVDG 983 Query: 936 SYRGGNVVGCGGIFRDSEGRFLYGFSFKASGNSALIAEAKSLHFGSRLIR-RFMAPITIE 1112 S R G GG+ RD G+ ++GFS ++L AE ++L G L + R + + IE Sbjct: 984 SSRNGLHAATGGVLRDHTGKLIFGFSENIGPCNSLQAELRALLRGLLLCKERHIEKLWIE 1043 Query: 1113 GDSLSIIKMAKKEWEHPWYLSNFLEGIWENLAGFDTRFQHTFREGNKVADLLANHGYEEM 1292 D+L+ I++ + + P+ + LE I L+ F R HTFREGNK AD L+N G++ Sbjct: 1044 MDALAAIQLIQPSKKGPYDIRYLLESIRMCLSSFSYRLSHTFREGNKAADYLSNEGHKHQ 1103 Query: 1293 DIQFFDNAPVFIKPALFDDKIGTKFLR 1373 ++ F A + L D++ ++R Sbjct: 1104 NLCVFTEAQGQLHGMLKLDRLNLPYVR 1130 >gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] Length = 910 Score = 135 bits (339), Expect = 8e-29 Identities = 106/415 (25%), Positives = 187/415 (45%), Gaps = 5/415 (1%) Frame = +3 Query: 144 LSVSEGYANLSRNRDQSWKEIKQLVLSELP--PLLDNKRDSLIWSAHPQGKFTVKSCYLL 317 + V + + N S N ++ ++Q V+ E+ P+ +D W+ P G F+ KS + L Sbjct: 497 VQVCDFFMNNSWNVEKLKTVLQQEVVDEIAKIPIDTMSKDEAYWTPTPNGDFSTKSAWQL 556 Query: 318 ESNRNLPRPIRVSTGYWNIDLIPKINSFWWLCLKKCLPTKDLLLRRGIQVDPLCPLCGQQ 497 R + P V W+ + + F W L +P + + +G+Q+ C C + Sbjct: 557 IRKRKVVNP--VFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCC-KS 613 Query: 498 EESLTHLFFFCSFLNDIWKLFWKYWKTNWVFSGDIISIWNAWKSPWSSDGLKRIWKISLP 677 EES+ H+ + +W F K ++ + I I AW I + Sbjct: 614 EESIMHVMWDNPVAMQVWNYFAKLFQICIINPCTINQIIGAWFHSGDYCKPGHIRTLVPL 673 Query: 678 HLCWSIWLERNNRVFGRPHSNPSTIFFRARRFIMDNAMCWKKCQVS-DSETSIIKEWHIP 854 + W +W+ERN+ P+ + +R + I ++ + + + I +EW I Sbjct: 674 FILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGII 733 Query: 855 YSPQEDSFVEV-RWFPPTGNGIKFNFDGSYRGGNVVGCGGIFRDSEGRFLYGFSFKASGN 1031 + + +V W PT K N DGS + + GGI RD G ++GFS Sbjct: 734 LQAESLAPPKVFSWHKPTTGEFKLNVDGSAKHSHNAAGGGILRDHAGVMVFGFSENLGIQ 793 Query: 1032 SALIAEAKSLHFGSRLIRRF-MAPITIEGDSLSIIKMAKKEWEHPWYLSNFLEGIWENLA 1208 ++L AE +L+ G L R + + + IE D++S+I++ + P + + + + L+ Sbjct: 794 NSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLS 853 Query: 1209 GFDTRFQHTFREGNKVADLLANHGYEEMDIQFFDNAPVFIKPALFDDKIGTKFLR 1373 F RF H FREGN+ AD LAN G+E ++Q F A ++ L D+ ++R Sbjct: 854 HFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVAQGKLRGMLRLDQTSFPYVR 908 >gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] Length = 879 Score = 134 bits (338), Expect = 1e-28 Identities = 105/393 (26%), Positives = 177/393 (45%), Gaps = 11/393 (2%) Frame = +3 Query: 228 LPPLLDNKRDSLIWSAHPQGKFTVKSCYLLESNRNLPRPIRVSTGYWNIDLIPKINSFWW 407 L P ++D W+ G+F S + E+ R + + W+ + I+ F W Sbjct: 496 LIPFNRTQQDVAYWTLTSNGEFATWSAW--ETIRQRKSSNALCSFIWHRSIPLSISFFLW 553 Query: 408 LCLKKCLPTKDLLLRRGIQVDPLCPLCGQQEESLTHLFFFCSFLNDIWKLFWKYWKTNWV 587 L +P + + +GIQ+ C +C EESL H+ + S +W F K+++ + Sbjct: 554 RALNNWIPVELRMKEKGIQLASKC-VCCNSEESLMHVLWGNSVAKQVWAFFGKFFQIYVL 612 Query: 588 FSGDIISIWNAWKSPWSSDGLKR--IWKISLPHLCWSIWLERNNRVFGRPHSNPSTIFFR 761 + I AW +S D +K+ I + +CW +WLERN+ NP + +R Sbjct: 613 NPQHVSQILWAWF--FSGDYVKKGHIRSLLPIFICWFLWLERNDAKHRHTRLNPDRVVWR 670 Query: 762 ARRFI---MDNAMC----WKKCQVSDSETSIIKEW-HIPYSPQEDSFVEVRWFPPTGNGI 917 + + +D ++ WK +T I W H S + W P Sbjct: 671 IMKLLRQLLDGSLLHQWQWK------GDTDIASMWGHTFQSKHRAPPQIIYWRKPFTGEY 724 Query: 918 KFNFDGSYRGGNVVGCGGIFRDSEGRFLYGFSFKASGNSALIAEAKSLHFGSRLIR-RFM 1094 K N DGS R G++ GGI RD G+ ++GFS ++L AE ++L G L + R + Sbjct: 725 KLNVDGSSRNGHLAASGGILRDHTGKLIFGFSENIGLCNSLQAELRALLRGLLLCKERHI 784 Query: 1095 APITIEGDSLSIIKMAKKEWEHPWYLSNFLEGIWENLAGFDTRFQHTFREGNKVADLLAN 1274 + IE D+L++I++ + + + LE I + L+ R H FREGN+ AD LAN Sbjct: 785 ENLWIEMDALAVIQLIQHSQKGSHDIRYLLESIRKCLSCISYRISHIFREGNQAADYLAN 844 Query: 1275 HGYEEMDIQFFDNAPVFIKPALFDDKIGTKFLR 1373 G+ ++ A + L D++ ++R Sbjct: 845 EGHSHQNLCVITEAQGELHGMLKLDRLNLPYVR 877 >ref|XP_002303192.1| predicted protein [Populus trichocarpa] Length = 677 Score = 133 bits (335), Expect = 2e-28 Identities = 115/400 (28%), Positives = 182/400 (45%), Gaps = 23/400 (5%) Frame = +3 Query: 3 KMIKFWTDKWLKMGTLADKFPR--HFSSVSARVPNTVRGPNIPELLDLELSVSEGYA--- 167 K FW D WL LAD+FP H S+ + + + +D ++ + +G+ Sbjct: 284 KRTVFWHDTWLANYCLADRFPTLYHLSNDKDASIDKMGMWDGDASID-KMGMWDGFEWTW 342 Query: 168 --NLSRN-RDQSWKEIKQLVLSELPPLLDNKRDS-LIWSAHPQGKFTVKSCYLLESNRNL 335 + +R R Q+ ++QL + LDN+ D LIW + G+F+VKS L S + Sbjct: 343 FFSWTRPLRGQNIGLLEQLYVVLSTMHLDNEADDRLIWKDNKSGRFSVKSLCGLLSPTHY 402 Query: 336 PRPIRVSTGYWNIDLIPKINSFWWLCLKKCLPTKDLLLRRGI--QVDPLCPLCGQQEESL 509 P G W + PK+ F W+ + L T+ +L+RRG+ + CP+C +EES+ Sbjct: 403 PNNGFSFAGIWKGVVPPKVEIFCWMVIINILNTRGVLVRRGVLDSSNSNCPICLVEEESV 462 Query: 510 THLFFFCSFLNDIWKLFWKYWKTNWVFSGDIISIWNAWKSPWSSDGLKRIWKISLPHLCW 689 HL C IW K+W +W ++ +++ W K+ W + + W Sbjct: 463 DHLILLCYKHLTIWSKIIKWWGLSWCCPKNLSGLFSQWTFMVHGKFQKKAWLMLFFSVAW 522 Query: 690 SIWLERNNRVFGRPHSNPSTIFFRARRFIMDNAMCWKKCQVSD---SETSIIK--EWHIP 854 S+WL RN+ +F + N ++FF I+ W K D S + +++ E I Sbjct: 523 SLWLLRNDLIFQQKSPNYDSVFF----LIITRLCLWLKAFHPDFPYSPSDLLRSVEGLIR 578 Query: 855 YSPQEDSFVEVRWFPPTGNGIKFNFDGSYRG-GNVVGCGGIFRDSEGRFLYGFSFKASGN 1031 +S + + V W PPT K+N DGS G + G GG+ R+ G L FS Sbjct: 579 WSNVQITRTGVIWSPPTIGSFKWNVDGSSLGKPGLSGIGGVLRNHHGHLLGIFSLPVGIL 638 Query: 1032 SALIAE------AKSLHFGSRLIRRFMAPITIEGDSLSII 1133 + IAE A L +RL+ ITIE DS ++I Sbjct: 639 DSNIAELRAVVKAVELSASNRLLHH--KHITIESDSANVI 676 >emb|CCA65995.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1389 Score = 131 bits (330), Expect = 8e-28 Identities = 122/447 (27%), Positives = 195/447 (43%), Gaps = 17/447 (3%) Frame = +3 Query: 9 IKFWTDKWLKMGTLADKFPRHFSSVSARVP---NTVRGPNIPELLDLELSVSEGYANLSR 179 I FWTD W+ L K+ S + +V N + G +IP+LL L Sbjct: 948 ISFWTDNWIFQYPLNSKYVPTVGSENIKVAECFNGLGGWDIPKLLTLVPP---------- 997 Query: 180 NRDQSWKEIKQLVLSELPPLLDNKRDSLIWSAHPQGKFTVKSCYLLESNRNLPRPIRVST 359 I + + S P +++D L+W P G+++VKS L N +V Sbjct: 998 -------NIVKAISSVFIPS-SSQQDRLLWGLTPTGQYSVKSGASLIREVNGGTIEKVEF 1049 Query: 360 GY-WNIDLIPKINSFWWLCLKKCLPTKDLLLRRGIQVDPLCPLCGQQEESLTHLFFFCSF 536 + W I PKI +F W L T L R I V C C E++ HL F C F Sbjct: 1050 NWIWGIHAPPKIKNFLWKACNDGLATTSRLERSHIFVPQNCCFCDCPSETICHLCFQCPF 1109 Query: 537 LNDIW-----KLFWKYWKTNWVFSGDIISIWNAWKSPWSSDGLKRIWKISLPHLCWSIWL 701 DI+ K W + +W + + S + ++ + L+ + K+S+ + W +W Sbjct: 1110 TLDIYSHLEDKFQWPAY-PSWFSTLQLSSFRSVLEACHINLTLEYLTKLSI--VWWHVWY 1166 Query: 702 ERNNRVFGRPHSNPSTIFFRARRFIMDNAMCWKKC--QVSDSETSIIKEWHIPYSPQEDS 875 RN +F +N ST F +A I W+K ++ T + K+ +P S Sbjct: 1167 FRNKLIF----NNESTSFSQASFIIHSFMGKWEKANLEIPSFNTPLPKDCKLPVR----S 1218 Query: 876 FVEVRWFPPTGNGIKFNFDGSYRGGNVVGCGGIFRDSEGRFLYGFSFKASG--NSALIAE 1049 + W PP + +K NFDGS G + R+S G L + KA G S L+AE Sbjct: 1219 GKNLIWSPPNEDVLKVNFDGSKLDNGQAAYGFVIRNSNGEVLMARA-KALGVYPSILMAE 1277 Query: 1050 AKSLHFGSR---LIRRFMAPITIEGDSLSIIKMAKKEWEHPWYLSNFLEGIWENLAGF-D 1217 A L G + ++ + I EGD++++I PW ++N + L F + Sbjct: 1278 AMGLLEGIKGAISLQNWSRKIIFEGDNIAVINAMSPSATGPWTIANIILDAGALLGHFQE 1337 Query: 1218 TRFQHTFREGNKVADLLANHGYEEMDI 1298 +FQH +RE N++AD +A+ G+ ++ Sbjct: 1338 VKFQHCYREANRLADFMAHKGHSHPEV 1364 >gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 130 bits (328), Expect = 1e-27 Identities = 113/451 (25%), Positives = 190/451 (42%), Gaps = 15/451 (3%) Frame = +3 Query: 9 IKFWTDKWLKMGTLADKFPRHFSSVSARVPNTVRGPNIPELLDLELSVSEGYANLSRNRD 188 I FW D W+ L + FP S+ + V+ + + + + D Sbjct: 897 IFFWHDAWMGDEPLVNSFPSFSQSM--------------------MKVNYFFNDDAWDVD 936 Query: 189 QSWKEIKQLVLSELP--PLLDNKRDSLIWSAHPQGKFTVKSCYLLESNRNLPRPIRVSTG 362 + I ++ E+ P+ K D W+ G F++KS + L R V Sbjct: 937 KLKTFIPNAIVEEILKIPISREKEDIAYWALTANGDFSIKSAWELLRQRKQVN--LVGQL 994 Query: 363 YWNIDLIPKINSFWWLCLKKCLPTKDLLLRRGIQVDPLCPLCGQQEESLTHLFFFCSFLN 542 W+ + ++ F W L LP + + +GIQ+ C LC + EESL H+ + Sbjct: 995 IWHKSIPLTVSFFLWRTLHNWLPVEVRMKAKGIQLASKC-LCCKSEESLLHVLWESPVAQ 1053 Query: 543 DIWKLFWKYWKTNWVFSGDIISIWNAWKSPWSSDGLK--RIWKISLPHLCWSIWLERNNR 716 +W F K+++ +I+ I N+W +S D K I + L + W +W+ERN+ Sbjct: 1054 QVWNYFSKFFQIYVHNPQNILQILNSWY--YSGDFTKPGHIRTLILLFIFWFVWVERNDA 1111 Query: 717 VFGRPHSNPSTIFFRA----RRFIMDNAMC---WKKCQVSDSETSIIKEWHIPYSPQEDS 875 P I +R R+ +C WK + I W ++ + + Sbjct: 1112 KHRDLGMYPDRIIWRIMKILRKLFQGGLLCKWQWK------GDLDIAIHWGFNFAQERQA 1165 Query: 876 FVEV-RWFPPTGNGIKFNFDGSYRGG--NVVGCGGIFRDSEGRFLYGFSFKASGNSALIA 1046 ++ W P +K N DGS + N G GG+ RD G ++GFS ++L A Sbjct: 1166 RPKIINWIKPLIGELKLNVDGSSKDEFQNAAG-GGVLRDHTGNLIFGFSENFGYQNSLQA 1224 Query: 1047 EAKSLHFGSRLIRRF-MAPITIEGDSLSIIKMAKKEWEHPWYLSNFLEGIWENLAGFDTR 1223 E +LH G L + ++ + IE D+ +I+M + + + + LE I + L R Sbjct: 1225 ELLALHRGLCLCMEYNVSRVWIEVDAQVVIQMIQNHHKGSYKIQYLLESIRKCLQVISVR 1284 Query: 1224 FQHTFREGNKVADLLANHGYEEMDIQFFDNA 1316 H REGN+ AD L+ HG+ ++ F A Sbjct: 1285 ISHIHREGNQAADFLSKHGHTHQNLHVFTEA 1315 >emb|CCA66178.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1381 Score = 130 bits (327), Expect = 2e-27 Identities = 119/449 (26%), Positives = 185/449 (41%), Gaps = 22/449 (4%) Frame = +3 Query: 12 KFWTDKWLKMGTLADKFPRHFSSVSARVPNTVRGPNIPELLDLELSVSEGYANLSRNRDQ 191 +FW D WL +L +FPR FS ++ +V E + S S + + R +D Sbjct: 937 RFWLDSWLSSSSLKSEFPRLFS-ITMNPNASVESLGFWEGYNWVWSFS--WKRILRPQDA 993 Query: 192 SWKEIKQLVLSELPPLLDNKRDSLIWSAHPQGKFTVKSCYLLESNRNLPRPIRVSTGYWN 371 K +L ++ P +D LIW+ G F+ KS P G W Sbjct: 994 IEKARLDNLLLQVCPARQ-AQDHLIWAFSKSGSFSTKSVSRQLVKLQHPHYQDAIRGVW- 1051 Query: 372 IDLIP-KINSFWWLCLKKCLPTKDLLLRRGIQVDP--LCPLCGQQEESLTHLFFFCSFLN 542 + L+P +I F WL L + T+D L GI +CPLC + E+ HL C + Sbjct: 1052 VGLVPHRIELFVWLALLGKINTRDKLASLGIIHGDCNICPLCMTEPETAEHLLLHCPVAS 1111 Query: 543 DIWKLFWKYWKTNWVFSGDIISIWNAWKSPWSSDGLKRIWKISLPHLCWSIWLERNNRVF 722 IW + W+ W F + + W P +S K++W + W++W ERN R+F Sbjct: 1112 QIWSWWIGLWRIKWAFPLSLREAFTQWFWPKNSPFFKKVWSAVFFIIVWTLWKERNQRIF 1171 Query: 723 GRPHSNPSTIFFRARRFIMDNAMC---WKKCQVSDSETSIIK-----EWH-IPYSPQEDS 875 +NPST+ +M WK + + T I++ +W I + D Sbjct: 1172 S---NNPSTVKVLKDMVLMRLGWWISGWKD-EFPYNPTDIMRNPSCLQWSGIKDDSKADL 1227 Query: 876 FVE--VRWFPPTGNGIKFNFDGSYRGGNV-VGCGGIFRDSEGRFLYGFS----FKASGNS 1034 ++ V W PP IK+N D S + GG+ R+ G F+ FS F + Sbjct: 1228 VIKSSVSWCPPPSQIIKWNVDASVHTCSARSAIGGVLRNHSGNFMCLFSSPIPFMEINCA 1287 Query: 1035 ALIAEAKSLHFGSRLIRRFMAPITIEGDSLSIIKMAKKEWEHPWYLS---NFLEGIWENL 1205 ++A +++ S A I +E DS + + + PW L+ NF+ Sbjct: 1288 EILAIHRAVKISSAKEELKGAKIILESDSKNAVLWCNSDSGGPWNLNFQLNFIRN--TRK 1345 Query: 1206 AGFDTRFQHTFREGNKVADLLANHGYEEM 1292 G D H R N VAD +A G + Sbjct: 1346 GGLDISIVHRSRSANVVADSMAKQGLHRL 1374 >gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 130 bits (326), Expect = 2e-27 Identities = 108/417 (25%), Positives = 187/417 (44%), Gaps = 7/417 (1%) Frame = +3 Query: 144 LSVSEGYANLSRNRDQSWKEIKQLVLSELP--PLLDNKRDSLIWSAHPQGKFTVKSCYLL 317 + V + + N S + ++ ++Q V+ E+ P+ +D W+ P G+F+ KS + L Sbjct: 1836 VQVCDFFMNNSWDIEKLKTVLQQEVVDEIAKIPIDAMSKDEAYWAPTPNGEFSTKSAWQL 1895 Query: 318 ESNRNLPRPIRVSTGYWNIDLIPKINSFWWLCLKKCLPTKDLLLRRGIQVDPLCPLCGQQ 497 R + P V W+ + I+ F W L +P + + +G Q+ C C + Sbjct: 1896 IRKREVVNP--VFNFIWHKTVPLTISFFLWRLLHDWIPVELKMKSKGFQLASRCRCC-KS 1952 Query: 498 EESLTHLFFFCSFLNDIWKLFWKYWKTNWVFSGDIISIWNAWKSPWSSDGLK--RIWKIS 671 EES+ H+ + +W F K+++ + I I AW +S D K I + Sbjct: 1953 EESIMHVMWDNPVATQVWNYFSKFFQILVINPCTINQILGAWF--YSGDYCKPGHIRTLV 2010 Query: 672 LPHLCWSIWLERNNRVFGRPHSNPSTIFFRARRFIMDNAMCWKKCQVS-DSETSIIKEWH 848 W +W+ERN+ P+ I +R + I ++ + + + I +EW Sbjct: 2011 PIFTLWFLWVERNDAKHRNLGMYPNRIVWRILKLIQQLSLGQQLLKWQWKGDKQIAQEWG 2070 Query: 849 IPYSPQEDSFVEV-RWFPPTGNGIKFNFDGSYRGGNVVGCGGIFRDSEGRFLYGFSFKAS 1025 I + + +V W P+ K N DGS + GG+ RD G ++GFS Sbjct: 2071 ITFQAESLPPPKVFPWHKPSIGEFKLNVDGSAKLSQNAAGGGVLRDHAGVMVFGFSENLG 2130 Query: 1026 GNSALIAEAKSLHFGSRLIRRF-MAPITIEGDSLSIIKMAKKEWEHPWYLSNFLEGIWEN 1202 ++L AE +L+ G L R + + + IE D+ S+I++ + P + L I + Sbjct: 2131 IQNSLQAELLALYRGLILCRDYNIRRLWIEMDAASVIRLLQGNQRGPHAIRYLLVSIRQL 2190 Query: 1203 LAGFDTRFQHTFREGNKVADLLANHGYEEMDIQFFDNAPVFIKPALFDDKIGTKFLR 1373 L+ F R H FREGN+ AD LAN G+E +Q A ++ L D+ ++R Sbjct: 2191 LSHFSFRLSHIFREGNQAADFLANRGHEHQSLQVVTVAQGKLRGMLRLDQTSLPYVR 2247 >ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 364 Score = 130 bits (326), Expect = 2e-27 Identities = 106/368 (28%), Positives = 162/368 (44%), Gaps = 10/368 (2%) Frame = +3 Query: 255 DSLIWSAHPQGKFTVKSCYLLESNRNLPRPIRVSTG--YWNIDLIPKINSFWWLCLKKCL 428 D LIW G+ + K + PR + G W+ +IP+I+ W L+ + Sbjct: 3 DKLIWVPLSSGELSAKEAFQFLR----PRLPSLDWGKLIWSKFIIPRISLHSWKVLRGRV 58 Query: 429 PTKDLLLRRGIQVDPLCPLCGQQEESLTHLFFFCSFLNDIWKLFWKYWKTNWVFSGDIIS 608 ++DLL RRGI + C LCG+ ESL H+F CSF +W ++ + + Sbjct: 59 LSEDLLQRRGIALASRCVLCGRDGESLPHIFLTCSFAASLWNNRAGLFELGCLPQNLVDL 118 Query: 609 IWNAWKSPWSSDGLKRIWKISLPHLCWSIWLERNNRVFGRPHSNPSTIFFRARRFIMDNA 788 ++ + S LK IW I W IW RN H N + + R+ IM + Sbjct: 119 LY--YGGVGRSHQLKEIWLICYTTTLWFIWKARNK----MRHDNCTIVVDAVRQLIMGHV 172 Query: 789 MCWKKCQV-----SDSETSIIKEWHIPYSP-QEDSFVEVRWFPPTGNGIKFNFDGSY-RG 947 K + S +E ++K++ + P + EV W PP IK N DG++ + Sbjct: 173 KTASKLALGCMSNSLTELRVLKKFGLLCRPHRAPRITEVNWHPPLFGWIKVNTDGAWQKT 232 Query: 948 GNVVGCGGIFRDSEGRFLYGFSFKASGNSALIAEAKSLHFGSRLI-RRFMAPITIEGDSL 1124 G GGIFRD G FL F+ +++ AE ++ L R I +E DS+ Sbjct: 233 TGKSGYGGIFRDFHGSFLGAFASNLEILNSVDAEVMAVIQAIELAWVRDWEHIWLEVDSI 292 Query: 1125 SIIKMAKKEWEHPWYLSNFLEGIWENLAGFDTRFQHTFREGNKVADLLANHGYEEMDIQF 1304 ++ + PW L ++ + R H FREGN+VAD LAN G + + Sbjct: 293 IVLNFLQDPHLVPWRLRVGWGNFLHRISQMNFRSSHIFREGNQVADALANMGLSMSALSW 352 Query: 1305 FDNAPVFI 1328 +D P FI Sbjct: 353 WDEPPHFI 360 >gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 129 bits (324), Expect = 4e-27 Identities = 100/388 (25%), Positives = 177/388 (45%), Gaps = 6/388 (1%) Frame = +3 Query: 228 LPPLLDNKRDSLIWSAHPQGKFTVKSCYLLESNRNLPRPIRVSTGYWNIDLIPKINSFWW 407 L P ++D W+ G+F+ KS + E+ R + + W+ + I+ F W Sbjct: 1832 LIPFDRTQQDVAYWTLTSNGEFSTKSAW--ETIRQQQSHNTLGSLIWHRSIPLSISFFIW 1889 Query: 408 LCLKKCLPTKDLLLRRGIQVDPLCPLCGQQEESLTHLFFFCSFLNDIWKLFWKYWKTNWV 587 L +P + + +GI + C +C EESL H+ + S +W F K+++ + Sbjct: 1890 RALNNWIPVELRMKGKGIHLASKC-VCCNSEESLMHVLWGNSVAKQVWAFFAKFFQIYVL 1948 Query: 588 FSGDIISIWNAWKSPWSSDGLKR--IWKISLPHLCWSIWLERNNRVFGRPHSNPSTIFFR 761 + I AW +S D +KR I + +CW +WLERN+ + N I +R Sbjct: 1949 NPKHVSHILWAWF--YSGDYVKRGHIRTLLPIFICWFLWLERNDAKYRHSGLNTDRIVWR 2006 Query: 762 ARRFIM---DNAMCWKKCQVSDSETSIIKEWHIPYSPQEDSFVEVRWFPPTGNGIKFNFD 932 + + D ++ + D++ + + +++ + + V W P+ K N D Sbjct: 2007 IMKLLRQLKDGSLLQQWQWKGDTDIAAMWQYNFQLKLRAPPQI-VYWRKPSTGEYKLNVD 2065 Query: 933 GSYRGGNVVGCGGIFRDSEGRFLYGFSFKASGNSALIAEAKSLHFGSRLIR-RFMAPITI 1109 GS R G GG+ RD G+ ++GFS ++L AE ++L G L + R + + I Sbjct: 2066 GSSRHGQHAASGGVLRDHTGKLIFGFSENIGTCNSLQAELRALLRGLLLCKERHIEKLWI 2125 Query: 1110 EGDSLSIIKMAKKEWEHPWYLSNFLEGIWENLAGFDTRFQHTFREGNKVADLLANHGYEE 1289 E D+L+ I++ + + LE I + L R H REGN+VAD L+N G+ Sbjct: 2126 EMDALAAIQLLPHSQKGSHDIRYLLESIRKCLNSISYRISHIHREGNQVADFLSNEGHNH 2185 Query: 1290 MDIQFFDNAPVFIKPALFDDKIGTKFLR 1373 ++ F A + L D++ ++R Sbjct: 2186 QNLHVFTEAQGKLHGMLKLDRLNLPYVR 2213 >gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 127 bits (318), Expect = 2e-26 Identities = 116/462 (25%), Positives = 194/462 (41%), Gaps = 9/462 (1%) Frame = +3 Query: 15 FWTDKWLKMGTLADKFPRHFSSVSARVPNTVRGPNIPELLDLELSVSEGYANLSRNRDQS 194 FW D W+ L ++ SS++ VS+ + N S N ++ Sbjct: 1778 FWHDCWMGEEPLVNRNQAFASSMA--------------------QVSDFFLNNSWNVEKL 1817 Query: 195 WKEIKQLVLSELP--PLLDNKRDSLIWSAHPQGKFTVKSCYLLESNRNLPRPIRVSTGYW 368 ++Q V+ E+ P+ + D W+ P G F+ KS + L NR + P V W Sbjct: 1818 KTVLQQEVVEEIVKIPIDTSSNDKAYWTTTPNGDFSTKSAWQLIRNRKVENP--VFNFIW 1875 Query: 369 NIDLIPKINSFWWLCLKKCLPTKDLLLRRGIQVDPLCPLCGQQEESLTHLFFFCSFLNDI 548 + + + F W L +P + + +G Q+ C C + EESL H+ + N + Sbjct: 1876 HKSVPLTTSFFLWRLLHDWIPVELKMKTKGFQLASRCRCC-KSEESLMHVMWKNPVANQV 1934 Query: 549 WKLFWKYWKTNWVFSGDIISIWNAW--KSPWSSDGLKRIWKISLPHLCWSIWLERNNRVF 722 W F K ++ + I I AW +S G I + W +W+ERN+ Sbjct: 1935 WSYFAKVFQIQIINPCTINQIICAWFYSGDYSKPG--HIRTLVPLFTLWFLWVERNDAKH 1992 Query: 723 GRPHSNPSTIFFRARRFIMDNAMCWKKCQVSD--SETSIIKEWHIPYSPQEDSFVEVR-W 893 P+ + ++ + ++ K+ Q + I +EW I S ++ W Sbjct: 1993 RNLGMYPNRVVWKILK-LLHQLFQGKQLQKWQWQGDKQIAQEWGIILKADAPSPPKLLFW 2051 Query: 894 FPPTGNGIKFNFDGSYRGG-NVVGCGGIFRDSEGRFLYGFSFKASGNSALIAEAKSLHFG 1070 P+ +K N DGS + GG+ RD G ++GFS +L AE +LH G Sbjct: 2052 LKPSIGELKLNVDGSCKHNPQSAAGGGLLRDHTGSMIFGFSENFGPQDSLQAELMALHRG 2111 Query: 1071 SRL-IRRFMAPITIEGDSLSIIKMAKKEWEHPWYLSNFLEGIWENLAGFDTRFQHTFREG 1247 L I ++ + IE D+ ++M K+ + L I L+G R H FREG Sbjct: 2112 LLLCIEHNISRLWIEMDAKVAVQMIKEGHQGSSRTRYLLASIHRCLSGISFRISHIFREG 2171 Query: 1248 NKVADLLANHGYEEMDIQFFDNAPVFIKPALFDDKIGTKFLR 1373 N+ AD L+N G+ ++Q A ++ L +KI ++R Sbjct: 2172 NQAADHLSNQGHTHQNLQVISQAEGQLRGILRLEKINLAYVR 2213 >ref|XP_004299997.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 448 Score = 125 bits (315), Expect = 5e-26 Identities = 121/476 (25%), Positives = 198/476 (41%), Gaps = 18/476 (3%) Frame = +3 Query: 3 KMIKFWTDKW-LKMGTLADKFPRHFSSVSARVPNTVRGPNIPELLDLELSVSEGYANLSR 179 KM+KFW+D W L + L P ++S+ V + G+ N+ Sbjct: 13 KMVKFWSDTWVLSVPLLQFALPHAVINLSSTV--------------CDFWCDTGW-NIEM 57 Query: 180 NRDQSWKEIKQLVLSELPPLLDNKRDSLIWSAHPQGKFTVKSCYLLESNRNLPRPIRVST 359 D E+ ++S D+ D LIW A G F+VKS Y+ + P+ Sbjct: 58 LSDVVPPEVVNQIISFPTGFEDSGNDQLIWKATSNGVFSVKSAYISSFDMAEPQHHYWKV 117 Query: 360 GYWNIDLIPKINSFWWLCLKKCLPTKDLLLRRGIQVDPLCPLCGQQEESL-----THL-- 518 W ++ +PK+ +F+W L K + T +RR +CP+C +ESL T L Sbjct: 118 -VWKLNCLPKLKTFFWTVLHKKILTNVQRVRRRFTTSAVCPICNSADESLHSETVTGLKR 176 Query: 519 --FFFCSFLNDIWKLFWK--YWKTNWVFSGDIISIWNAWKSPWSSDGLKRIWKISLPHLC 686 F L + +L W W + +F + +G+K W +C Sbjct: 177 FGNLFAPLLIFLTRLSWAGIIWISAQLFCQSKV-----------KNGIK--WCNLFVFVC 223 Query: 687 WSIWLERNNRVFGRPHSNPSTIFFRARRFIMDNAMCWKKCQVSDSETSIIKEWHIPYSPQ 866 W +W RN VF P ++ + W Q + S +++ +++ Y Sbjct: 224 WFLWKWRNKIVFDSSFVMPGDPALVIWNYVEE----WTSAQSNPSSSNM---FNVTY--- 273 Query: 867 EDSFVEVRWFPPTGNGIKFNFDGSYRGGN-VVGCGGIFRDSEGRFLYGFSFKASGNSALI 1043 + W P N +K N DG+ + +G GG+ RD G +++G L Sbjct: 274 ------LSWLRPPANCLKLNIDGTRSSSSGKIGAGGVLRDHAGNWIFGCQINLGVGEVLY 327 Query: 1044 AEAKSLHFGSRLIRRF-MAPITIEGDSLSIIK-MAKKEWE-HPWYLSNFLEGIWENLAGF 1214 AEA L FG +L+ +F + + +E DS +++ M K+ +E HP L + L ++ Sbjct: 328 AEAWGLLFGLKLVAKFYCSDLEVESDSAVLVQLMQKRSFELHP--LGSLLSACSSFMSKM 385 Query: 1215 -DTRFQHTFREGNKVADLLANHGY-EEMDIQFFDNAPVFIKPALFDDKIGTKFLRR 1376 + + H FRE N VAD LA ++ + F++ PV A DD G RR Sbjct: 386 PNVKLSHIFRECNMVADSLAKCSITHDLGLVTFNSPPVHAVQAYLDDLDGVVRARR 441 >gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] Length = 1014 Score = 125 bits (313), Expect = 8e-26 Identities = 97/386 (25%), Positives = 173/386 (44%), Gaps = 6/386 (1%) Frame = +3 Query: 234 PLLDNKRDSLIWSAHPQGKFTVKSCYLLESNRNLPRPIRVSTGYWNIDLIPKINSFWWLC 413 P ++ D W+ G+F+ S + E+ R P + + W+ + I+ F W Sbjct: 633 PFDRSQDDIAYWALTSDGEFSTWSAW--EAVRQRQSPNTLCSFIWHKSIPLTISFFLWRV 690 Query: 414 LKKCLPTKDLLLRRGIQVDPLCPLCGQQEESLTHLFFFCSFLNDIWKLFWKYWKTNWVFS 593 L +P + L +G + C +C EESL H+ + +W F +++ N Sbjct: 691 LNNWIPVELRLKEKGFHLASKC-VCCNSEESLIHVLWDNPVAKQVWNFFADFFQINISNP 749 Query: 594 GDIISIWNAWKSPWSSDGLKR--IWKISLPHLCWSIWLERNN---RVFGRPHSNPSTIFF 758 + I AW +S D +++ I + +CW +WLERN+ R G Sbjct: 750 QHVSQIIWAWY--YSGDFVRKGHIRTLIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIM 807 Query: 759 RARRFIMDNAMCWKKCQVSDSETSIIKEWHIPYSPQEDSFVEVRWFPPTGNGIKFNFDGS 938 + R + D ++ K D++ + + + +P +E + + W P K N DGS Sbjct: 808 KVLRQLQDGSLLKKWQWKGDTDIAAMWGFTLPLKIRESPQI-IHWVKPVTGEYKLNVDGS 866 Query: 939 YRGGNVVGCGGIFRDSEGRFLYGFSFKASGNSALIAEAKSLHFGSRLIR-RFMAPITIEG 1115 R GG+ RD G ++GFS +++L AE ++L G L + R + + IE Sbjct: 867 SRHNQSAATGGLLRDHTGTLVFGFSENIGPSNSLQAELRALLRGLLLCKDRNIEKLWIEM 926 Query: 1116 DSLSIIKMAKKEWEHPWYLSNFLEGIWENLAGFDTRFQHTFREGNKVADLLANHGYEEMD 1295 D+L +I+M ++ + + L I + L+ F R H FREGN+ AD L+N G+ + Sbjct: 927 DALVVIQMIQQSKKGSHDIRYLLASIRKCLSFFSFRISHIFREGNQAADFLSNKGHTHQN 986 Query: 1296 IQFFDNAPVFIKPALFDDKIGTKFLR 1373 +Q A + L D++ +++ Sbjct: 987 LQVISEAQGKLHGMLKLDRLNLPYVK 1012 >gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] Length = 926 Score = 124 bits (312), Expect = 1e-25 Identities = 100/390 (25%), Positives = 181/390 (46%), Gaps = 8/390 (2%) Frame = +3 Query: 228 LPPLLDNKRDSLIWSAHPQGKFTVKSCYLLESNRNLPRPIRVSTGYWNIDLIPKINSFWW 407 L P ++D W G+F+ +S + E+ R + + W+ + I+ F W Sbjct: 544 LIPFDRTQQDVAYWILTSNGEFSTRSAW--ETIRKRQPHNTLGSLIWHRSIPLSISFFIW 601 Query: 408 LCLKKCLPTKDLLLRRGIQVDPLCPLCGQQEESLTHLFFFCSFLNDIWKLFWKYWKTNWV 587 L +P + + +GI + C +C EESL H+ + S +W F +++ ++ Sbjct: 602 RALNNWIPVELRMKEKGIHLASKC-VCCNSEESLMHVLWGNSVAKQVWAFFANFFQI-YI 659 Query: 588 FSGDIIS--IWNAWKSPWSSDGLKR--IWKISLPHLCWSIWLERNN---RVFGRPHSNPS 746 F+ +S +W AW +S D +KR I + +CW +WLERN+ R G Sbjct: 660 FNPQHVSHILW-AWF--YSGDYVKRGHIRTLLPIFICWFLWLERNDAKHRYSGLYTDRVV 716 Query: 747 TIFFRARRFIMDNAMCWKKCQVSDSETSIIKEWHIPYSPQEDSFVEVRWFPPTGNGIKFN 926 + R + D ++ + D++ + + ++++ + + V W P+ K N Sbjct: 717 WRIMKLLRQLHDGSLLQQWQWKGDTDIAAMWKYNLQLKLRAPPQI-VYWRKPSTGEYKLN 775 Query: 927 FDGSYRGGNVVGCGGIFRDSEGRFLYGFSFKASGNSALIAEAKSLHFGSRLIR-RFMAPI 1103 DGS R G GG+ RD G+ ++GFS ++L AE ++L G L + R + + Sbjct: 776 VDGSSRHGQHAASGGVLRDHTGKLIFGFSENIGNCNSLQAELRALLRGLLLCKERHIEQL 835 Query: 1104 TIEGDSLSIIKMAKKEWEHPWYLSNFLEGIWENLAGFDTRFQHTFREGNKVADLLANHGY 1283 IE D+L++I++ + + LE I + L R H REGN+VAD L+N G+ Sbjct: 836 WIEMDALAVIQLIPHSQKGSHDIRYLLESIRKCLNSISYRISHILREGNQVADFLSNEGH 895 Query: 1284 EEMDIQFFDNAPVFIKPALFDDKIGTKFLR 1373 +++ F A + L D++ ++R Sbjct: 896 NHQNLRVFTEAQGKLHGMLKLDRLNLPYVR 925 >ref|NP_187562.1| RNase H domain-containing protein [Arabidopsis thaliana] gi|6682231|gb|AAF23283.1|AC016661_8 putative non-LTR reverse transcriptase [Arabidopsis thaliana] gi|332641254|gb|AEE74775.1| RNase H domain-containing protein [Arabidopsis thaliana] Length = 484 Score = 124 bits (311), Expect = 1e-25 Identities = 107/364 (29%), Positives = 163/364 (44%), Gaps = 20/364 (5%) Frame = +3 Query: 249 KRDSLIWSAHPQGKFTVKSCYLL---ESNRNLPR------PIRVSTGYWNIDLIPKINSF 401 K D +IW+ + G++TV+S Y L + + N+P I + T WN+ ++PK+ F Sbjct: 115 KPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIPAINPPHGSIDLKTRIWNLPIMPKLKHF 174 Query: 402 WWLCLKKCLPTKDLLLRRGIQVDPLCPLCGQQEESLTHLFFFCSFLNDIWKLFWKYWKTN 581 W L + L T + L RG+++DP CP C ++ ES+ H F C F W+L N Sbjct: 175 LWRALSQALATTERLTTRGMRIDPSCPRCHRENESINHALFTCPFATMAWRLSDSSLIRN 234 Query: 582 WVFSGD----IISIWNAWKSPWSSD--GLKRIWKISLPHLCWSIWLERNNRVFGRPHSNP 743 + S D I +I N + SD L +W L W IW RNN VF + +P Sbjct: 235 QLMSNDFEENISNILNFVQDTTMSDFHKLLPVW------LIWRIWKARNNVVFNKFRESP 288 Query: 744 STIFFRARRFIMDNAMCWKKCQVSDSETSIIKEWHIPYSPQEDSFVEVRWFPPTGNGIKF 923 S A+ D W S +T P ++ + ++ W P +K Sbjct: 289 SKTVLSAKAETHD----WLNATQSHKKT--------PSPTRQIAENKIEWRNPPATYVKC 336 Query: 924 NFDGSYRGGNVVGCGG-IFRDSEGRFLYGFSFK-ASGNSALIAEAKSLHFG-SRLIRRFM 1094 NFD + + GG I R+ G + S K A ++ L AE K+L + R Sbjct: 337 NFDAGFDVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLEAETKALLAALQQTWIRGY 396 Query: 1095 APITIEGDSLSIIKMAKKEWEHPWYLSNFLEGI--WENLAGFDTRFQHTFREGNKVADLL 1268 + +EGD ++I + H L+N LE I W N +F R+GNK+A +L Sbjct: 397 TQVFMEGDCQTLINLINGISFHS-SLANHLEDISFWANKFA-SIQFGFIRRKGNKLAHVL 454 Query: 1269 ANHG 1280 A +G Sbjct: 455 AKYG 458