BLASTX nr result
ID: Rehmannia22_contig00021327
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia22_contig00021327 (1183 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] 164 3e-51 gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] 164 3e-51 gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] 160 2e-50 gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao] 166 3e-50 gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] 165 6e-50 gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] 163 1e-49 gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] 159 2e-49 gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] 165 3e-49 gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] 154 4e-49 gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] 158 9e-49 gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] 156 3e-48 gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao] 148 6e-48 gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] 155 6e-48 gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] 156 8e-47 gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob... 152 3e-46 gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao] 151 9e-36 ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein A... 109 1e-31 gb|EOY26529.1| Ribonuclease H-like protein [Theobroma cacao] 140 1e-30 gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] 137 7e-30 ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258... 111 5e-29 >gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 164 bits (414), Expect(2) = 3e-51 Identities = 88/247 (35%), Positives = 138/247 (55%) Frame = +2 Query: 326 HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505 HI ++P ILWF+W+ERN KH + G R++ ++ I+ L K QK W+G Sbjct: 3258 HIRTLVPLFILWFLWVERNDAKHRNLGMYPNRIVWKILKLIHQLFQGKQLQKWQWQGDKQ 3317 Query: 506 VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685 +A +GI + + W+KP +G++KLN+DGS GG+LRD G++I Sbjct: 3318 IAQEWGIILKAVAPSPPKLLFWNKPSIGEFKLNVDGSSKYNLQTAAGGGLLRDHTGSMIF 3377 Query: 686 AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865 F++ F DS AE+ ALH+ L L + ++WIE D++V +Q+I+ G ++ Sbjct: 3378 GFSENF-GSQDSLQAELMALHRGLLLCIDHNVTRLWIEMDAKVAVQMINEGHQGSSRTRY 3436 Query: 866 LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045 LL+ I + + G ++ISHIFREGN+ AD L+ G + QN Q+RG+ R+D+ Sbjct: 3437 LLASIHRCLSGISFRISHIFREGNQAADHLSNQGYTHQNLQ--VISQAEGQLRGILRLDK 3494 Query: 1046 LNLPSFR 1066 +NL R Sbjct: 3495 INLAYVR 3501 Score = 66.6 bits (161), Expect(2) = 3e-51 Identities = 32/94 (34%), Positives = 55/94 (58%) Frame = +3 Query: 15 NPIFPPLCSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLF 194 NP + + + + S F+WRL+ + +PV+ K+ +GF LA +C CC+ S ESL+H+ Sbjct: 3156 NPTYNYIWHKSVPLTTSFFLWRLLHDWVPVELKMKSKGFQLASRCRCCK--SEESLMHVM 3213 Query: 195 LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 296 N +++VW +FA + + + IN IS+W Sbjct: 3214 WDNPVANQVWSYFAKVFQIHIINPCTINHIISAW 3247 Score = 150 bits (380), Expect(2) = 2e-46 Identities = 85/230 (36%), Positives = 125/230 (54%), Gaps = 1/230 (0%) Frame = +2 Query: 326 HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505 HI +IP I WF+WLERN KH G RVI R+ + L + L ++ WKG + Sbjct: 1464 HIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLHAGSLLKQWQWKGDTD 1523 Query: 506 VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685 +A +G + Q + W KP +G+YKLN+DGS + G GGVLRD G + Sbjct: 1524 IATMWGFKYPPKYCQSPQIISWIKPFIGEYKLNVDGSSKSSQNAAG-GGVLRDHTGKLAF 1582 Query: 686 AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865 AF++ P S AE+ AL + L L + + +WIE D+ V +Q++ + G ++ Sbjct: 1583 AFSENLGPLP-SLQAELHALLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRY 1641 Query: 866 LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNF-VQFFADDFP 1012 LL IR + F ++ISHI+REGN+ AD L+ G + Q+ V A +FP Sbjct: 1642 LLESIRLCLRSFSYRISHIYREGNQAADFLSNKGQTHQSLCVVSEAQEFP 1691 Score = 63.5 bits (153), Expect(2) = 2e-46 Identities = 29/80 (36%), Positives = 47/80 (58%) Frame = +3 Query: 57 SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236 S+S F+WR++ N +PV+ ++ D+G LA KC CC S ESL+H+ N + +VW FA Sbjct: 1376 SISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCR--SEESLIHVLWENPVAKQVWNFFA 1433 Query: 237 NMLHFSLPDTENINTFISSW 296 + ++I+ I +W Sbjct: 1434 KSFQIYVSKPKHISQIIWAW 1453 >gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] Length = 1014 Score = 164 bits (414), Expect(2) = 3e-51 Identities = 89/244 (36%), Positives = 140/244 (57%) Frame = +2 Query: 326 HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505 HI +IP I WF+WLERN KH G + RV+ ++ + L+ L +K WKG + Sbjct: 770 HIRTLIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKVLRQLQDGSLLKKWQWKGDTD 829 Query: 506 VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685 +AA +G + + + +HW KP G+YKLN+DGS S GG+LRD G ++ Sbjct: 830 IAAMWGFTLPLKIRESPQIIHWVKPVTGEYKLNVDGSSRHNQS-AATGGLLRDHTGTLVF 888 Query: 686 AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865 F++ + +S AE+ AL + L L + + K+WIE D+ V++Q+I + G ++ Sbjct: 889 GFSEN-IGPSNSLQAELRALLRGLLLCKDRNIEKLWIEMDALVVIQMIQQSKKGSHDIRY 947 Query: 866 LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045 LL+ IRK + F ++ISHIFREGN+ AD L+ G + QN + ++ G+ ++D+ Sbjct: 948 LLASIRKCLSFFSFRISHIFREGNQAADFLSNKGHTHQNLQ--VISEAQGKLHGMLKLDR 1005 Query: 1046 LNLP 1057 LNLP Sbjct: 1006 LNLP 1009 Score = 66.6 bits (161), Expect(2) = 3e-51 Identities = 28/80 (35%), Positives = 51/80 (63%) Frame = +3 Query: 57 SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236 ++S F+WR++ N +PV+ +L ++GF LA KC CC S ESL+H+ N + +VW FA Sbjct: 682 TISFFLWRVLNNWIPVELRLKEKGFHLASKCVCCN--SEESLIHVLWDNPVAKQVWNFFA 739 Query: 237 NMLHFSLPDTENINTFISSW 296 + ++ + ++++ I +W Sbjct: 740 DFFQINISNPQHVSQIIWAW 759 >gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 160 bits (404), Expect(2) = 2e-50 Identities = 90/249 (36%), Positives = 139/249 (55%), Gaps = 2/249 (0%) Frame = +2 Query: 326 HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505 HI ++P LWF+W+ERN KH + G RV+ ++ ++ L K QK W+G Sbjct: 1970 HIRTLVPLFTLWFLWVERNDAKHRNLGMYPNRVVWKILKLLHQLFQGKQLQKWQWQGDKQ 2029 Query: 506 VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGS--KNPISSCGGIGGVLRDWQGNV 679 +A +GI + + W KP +G+ KLN+DGS NP S+ G GG+LRD G++ Sbjct: 2030 IAQEWGIILKADAPSPPKLLFWLKPSIGELKLNVDGSCKHNPQSAAG--GGLLRDHTGSM 2087 Query: 680 ILAFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSY 859 I F++ F DS AE+ ALH+ L L + ++WIE D++V +Q+I G Sbjct: 2088 IFGFSENF-GPQDSLQAELMALHRGLLLCIEHNISRLWIEMDAKVAVQMIKEGHQGSSRT 2146 Query: 860 QHLLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARV 1039 ++LL+ I + + G ++ISHIFREGN+ AD L+ G + QN Q+RG+ R+ Sbjct: 2147 RYLLASIHRCLSGISFRISHIFREGNQAADHLSNQGHTHQNLQ--VISQAEGQLRGILRL 2204 Query: 1040 DQLNLPSFR 1066 +++NL R Sbjct: 2205 EKINLAYVR 2213 Score = 67.8 bits (164), Expect(2) = 2e-50 Identities = 32/94 (34%), Positives = 55/94 (58%) Frame = +3 Query: 15 NPIFPPLCSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLF 194 NP+F + + + S F+WRL+ + +PV+ K+ +GF LA +C CC+ S ESL+H+ Sbjct: 1868 NPVFNFIWHKSVPLTTSFFLWRLLHDWIPVELKMKTKGFQLASRCRCCK--SEESLMHVM 1925 Query: 195 LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 296 N +++VW +FA + + + IN I +W Sbjct: 1926 WKNPVANQVWSYFAKVFQIQIINPCTINQIICAW 1959 >gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao] Length = 1134 Score = 166 bits (420), Expect(2) = 3e-50 Identities = 95/250 (38%), Positives = 134/250 (53%), Gaps = 1/250 (0%) Frame = +2 Query: 326 HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505 H ++P I WF+WLERN KH G RVI R H L L Q+ WKG + Sbjct: 888 HFRVLLPLFICWFLWLERNDAKHRHTGLYPDRVIWRTMKHCRQLYDGSLLQQWQWKGDTD 947 Query: 506 VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGS-KNPISSCGGIGGVLRDWQGNVI 682 +AA G F ++W KP +G+YKLN+DGS +N + + GGVLRD G +I Sbjct: 948 IAAMLGFSFPPQQHASPQIIYWKKPSIGEYKLNVDGSSRNGLHAA--TGGVLRDHTGKLI 1005 Query: 683 LAFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQ 862 F++ C +S AE+ AL + L L + K+WIE D+ +QLI + G + + Sbjct: 1006 FGFSENIGPC-NSLQAELRALLRGLLLCKERHIEKLWIEMDALAAIQLIQPSKKGPYDIR 1064 Query: 863 HLLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVD 1042 +LL IR + F +++SH FREGNK AD L+ G QN F + Q+ G+ ++D Sbjct: 1065 YLLESIRMCLSSFSYRLSHTFREGNKAADYLSNEGHKHQNLCVF--TEAQGQLHGMLKLD 1122 Query: 1043 QLNLPSFRTR 1072 +LNLP R R Sbjct: 1123 RLNLPYVRFR 1132 Score = 60.8 bits (146), Expect(2) = 3e-50 Identities = 26/80 (32%), Positives = 47/80 (58%) Frame = +3 Query: 57 SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236 S+S F+W+ + N +PV+ ++ ++G LA KC CC S ESL+H+ N + +VW FA Sbjct: 800 SISFFLWKTLHNWIPVELRMKEKGIQLASKCVCCN--SEESLIHVLWENPVAKQVWNFFA 857 Query: 237 NMLHFSLPDTENINTFISSW 296 + + + +++ I +W Sbjct: 858 KLFQIYILNPRHVSQIIWAW 877 >gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 165 bits (418), Expect(2) = 6e-50 Identities = 94/248 (37%), Positives = 135/248 (54%), Gaps = 1/248 (0%) Frame = +2 Query: 326 HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505 H ++P I WF+WLERN KH G A RVI R H L L Q+ WKG + Sbjct: 1884 HFRVLLPLFICWFLWLERNDAKHRHTGLYADRVIWRTMKHCRQLYDGSLLQQWQWKGDTD 1943 Query: 506 VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGS-KNPISSCGGIGGVLRDWQGNVI 682 +A G F ++W KP +G+YKLN+DGS +N + + GGVLRD G +I Sbjct: 1944 IATMLGFSFTHKQHAPPQIIYWKKPSIGEYKLNVDGSSRNGLHAA--TGGVLRDHTGKLI 2001 Query: 683 LAFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQ 862 F++ C +S AE+ AL + L L + K+WIE D+ V +QLI + G ++ + Sbjct: 2002 FGFSENIGPC-NSLQAELRALLRGLLLCKERHIEKLWIEMDALVAIQLIQPSKKGPYNLR 2060 Query: 863 HLLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVD 1042 +LL IR + F +++SHI REGN+ AD L+ G QN F + Q+ G+ ++D Sbjct: 2061 YLLESIRMCLSSFSYRLSHILREGNQAADYLSNEGHKHQNLCVF--TEAQGQLHGMLKLD 2118 Query: 1043 QLNLPSFR 1066 +LNLP R Sbjct: 2119 RLNLPYVR 2126 Score = 60.5 bits (145), Expect(2) = 6e-50 Identities = 26/80 (32%), Positives = 47/80 (58%) Frame = +3 Query: 57 SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236 S+S F+W+ + N +PV+ ++ ++G LA KC CC S ESL+H+ N + +VW FA Sbjct: 1796 SISFFLWKTLHNWIPVELRMKEKGIQLASKCVCCN--SEESLIHVLWENPVAKQVWNFFA 1853 Query: 237 NMLHFSLPDTENINTFISSW 296 + + + +++ I +W Sbjct: 1854 QLFQIYIWNPRHVSQIIWAW 1873 >gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] Length = 926 Score = 163 bits (412), Expect(2) = 1e-49 Identities = 91/247 (36%), Positives = 136/247 (55%) Frame = +2 Query: 326 HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505 HI ++P I WF+WLERN KH G RV+ R+ + L L Q+ WKG + Sbjct: 683 HIRTLLPIFICWFLWLERNDAKHRYSGLYTDRVVWRIMKLLRQLHDGSLLQQWQWKGDTD 742 Query: 506 VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685 +AA + ++ + V+W KP G+YKLN+DGS GGVLRD G +I Sbjct: 743 IAAMWKYNLQLKLRAPPQIVYWRKPSTGEYKLNVDGSSRH-GQHAASGGVLRDHTGKLIF 801 Query: 686 AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865 F++ +C +S AE+ AL + L L + ++WIE D+ ++QLI H+ G ++ Sbjct: 802 GFSENIGNC-NSLQAELRALLRGLLLCKERHIEQLWIEMDALAVIQLIPHSQKGSHDIRY 860 Query: 866 LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045 LL IRK + ++ISHI REGN+VAD L+ G + QN F + ++ G+ ++D+ Sbjct: 861 LLESIRKCLNSISYRISHILREGNQVADFLSNEGHNHQNLRVF--TEAQGKLHGMLKLDR 918 Query: 1046 LNLPSFR 1066 LNLP R Sbjct: 919 LNLPYVR 925 Score = 62.0 bits (149), Expect(2) = 1e-49 Identities = 27/80 (33%), Positives = 49/80 (61%) Frame = +3 Query: 57 SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236 S+S F+WR + N +PV+ ++ ++G LA KC CC S ESL+H+ N+ + +VW FA Sbjct: 595 SISFFIWRALNNWIPVELRMKEKGIHLASKCVCCN--SEESLMHVLWGNSVAKQVWAFFA 652 Query: 237 NMLHFSLPDTENINTFISSW 296 N + + ++++ + +W Sbjct: 653 NFFQIYIFNPQHVSHILWAW 672 >gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] Length = 1951 Score = 159 bits (402), Expect(2) = 2e-49 Identities = 90/249 (36%), Positives = 136/249 (54%) Frame = +2 Query: 326 HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505 HI +IP I WF+WLERN KH G RVI R+ + L + L ++ WKG + Sbjct: 1707 HIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLYAGSLLKQWQWKGDTD 1766 Query: 506 VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685 +A +G F ++W KP +G+YKLN+DGS + G GGVLRD G + Sbjct: 1767 IATMWGFKFPPKYCTSPQIIYWIKPFIGEYKLNVDGSSKSNLNAAG-GGVLRDHTGKLAF 1825 Query: 686 AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865 AF++ P S AE+ AL + L L + + +WIE D+ V +Q++ + G ++ Sbjct: 1826 AFSENLGPLP-SLQAELHALLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRY 1884 Query: 866 LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045 LL IR + F ++ISHI+REGN+ AD L+ G + Q+ F + ++ G+ ++D+ Sbjct: 1885 LLESIRLCLRSFSYRISHIYREGNQAADFLSNKGQTHQSLCVF--SEAQGELIGILKLDK 1942 Query: 1046 LNLPSFRTR 1072 LNLP R R Sbjct: 1943 LNLPYVRFR 1951 Score = 65.1 bits (157), Expect(2) = 2e-49 Identities = 31/94 (32%), Positives = 51/94 (54%) Frame = +3 Query: 15 NPIFPPLCSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLF 194 N +F + + S+S F+WR++ N +PV+ ++ D+G LA KC CC S ESL+H+ Sbjct: 1605 NALFSLIWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCR--SEESLIHVL 1662 Query: 195 LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 296 N + +VW FA + +I+ I +W Sbjct: 1663 WENPVATQVWFFFAKSFQIYVSKPNHISQIIWAW 1696 >gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 165 bits (418), Expect(2) = 3e-49 Identities = 91/247 (36%), Positives = 137/247 (55%) Frame = +2 Query: 326 HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505 HI ++P I WF+WLERN K+ G + R++ R+ + LK L Q+ WKG + Sbjct: 1971 HIRTLLPIFICWFLWLERNDAKYRHSGLNTDRIVWRIMKLLRQLKDGSLLQQWQWKGDTD 2030 Query: 506 VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685 +AA + F++ + V+W KP G+YKLN+DGS GGVLRD G +I Sbjct: 2031 IAAMWQYNFQLKLRAPPQIVYWRKPSTGEYKLNVDGSSRH-GQHAASGGVLRDHTGKLIF 2089 Query: 686 AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865 F++ C +S AE+ AL + L L + K+WIE D+ +QL+ H+ G ++ Sbjct: 2090 GFSENIGTC-NSLQAELRALLRGLLLCKERHIEKLWIEMDALAAIQLLPHSQKGSHDIRY 2148 Query: 866 LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045 LL IRK + ++ISHI REGN+VAD L+ G + QN F + ++ G+ ++D+ Sbjct: 2149 LLESIRKCLNSISYRISHIHREGNQVADFLSNEGHNHQNLHVF--TEAQGKLHGMLKLDR 2206 Query: 1046 LNLPSFR 1066 LNLP R Sbjct: 2207 LNLPYVR 2213 Score = 58.2 bits (139), Expect(2) = 3e-49 Identities = 26/80 (32%), Positives = 47/80 (58%) Frame = +3 Query: 57 SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236 S+S F+WR + N +PV+ ++ +G LA KC CC S ESL+H+ N+ + +VW FA Sbjct: 1883 SISFFIWRALNNWIPVELRMKGKGIHLASKCVCCN--SEESLMHVLWGNSVAKQVWAFFA 1940 Query: 237 NMLHFSLPDTENINTFISSW 296 + + ++++ + +W Sbjct: 1941 KFFQIYVLNPKHVSHILWAW 1960 >gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] Length = 1954 Score = 154 bits (390), Expect(2) = 4e-49 Identities = 87/247 (35%), Positives = 138/247 (55%) Frame = +2 Query: 326 HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505 HI +IP I WF+WLERN KH G + RV+ ++ + L+ L + WKG + Sbjct: 1710 HIRILIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKLLRQLQDGYLLKSWQWKGDKD 1769 Query: 506 VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685 A +G++ +HW KP G++KLN+DGS + IGGVLRD G ++ Sbjct: 1770 FATMWGLFSPPKTRAAPQILHWVKPVPGEHKLNVDGSSRQ-NQTAAIGGVLRDHTGTLVF 1828 Query: 686 AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865 F++ + +S AE+ AL + L L + + K+W+E D+ V +Q+I + G ++ Sbjct: 1829 DFSEN-IGPSNSLQAELRALLRGLLLCKERNIEKLWVEMDALVAIQMIQQSQKGSHDIRY 1887 Query: 866 LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045 LL+ IRK + F ++ISHIFREGN+ AD L+ G + Q+ F + ++ G+ ++D+ Sbjct: 1888 LLASIRKYLNFFSFRISHIFREGNQAADFLSNKGHTHQSLHVF--TEAQGKLYGMLKLDR 1945 Query: 1046 LNLPSFR 1066 LNLP R Sbjct: 1946 LNLPYVR 1952 Score = 68.6 bits (166), Expect(2) = 4e-49 Identities = 31/80 (38%), Positives = 48/80 (60%) Frame = +3 Query: 57 SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236 S+S F+WR+ N +PVD +L ++GF LA KC CC S ESL+H+ N + +VW FA Sbjct: 1622 SISFFLWRVFHNWIPVDIRLKEKGFHLASKCICCN--SEESLIHVLWDNPIAKQVWNFFA 1679 Query: 237 NMLHFSLPDTENINTFISSW 296 N + +N++ + +W Sbjct: 1680 NSFQIYISKPQNVSQILWTW 1699 >gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 158 bits (400), Expect(2) = 9e-49 Identities = 87/247 (35%), Positives = 135/247 (54%) Frame = +2 Query: 326 HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505 HI ++P ILWF+W+ERN KH + G RV+ RV I L + K WKG Sbjct: 2007 HIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQ 2066 Query: 506 VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685 +A +GI F+ + WHKP LG++KLN+DGS + G GG+LRD G ++ Sbjct: 2067 IAQEWGIIFQAESLAPPKVFSWHKPSLGEFKLNVDGSAKQSHNAAG-GGILRDHAGEMVF 2125 Query: 686 AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865 F++ + +S AE+ AL++ L L + ++WIE D+ +++L+ N G + ++ Sbjct: 2126 GFSEN-LGTQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRY 2184 Query: 866 LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045 L+ +R+ + F ++ SHIFREGN+ AD LA G QN F ++RG+ +DQ Sbjct: 2185 LMVSLRQLLSHFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVAQ--GKLRGMLCLDQ 2242 Query: 1046 LNLPSFR 1066 + P R Sbjct: 2243 TSFPYVR 2249 Score = 63.5 bits (153), Expect(2) = 9e-49 Identities = 30/94 (31%), Positives = 53/94 (56%) Frame = +3 Query: 15 NPIFPPLCSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLF 194 NP+F + + + S F+WRL+ + +PV+ K+ +G LA +C CC+ S ES++H+ Sbjct: 1905 NPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCK--SEESIMHVM 1962 Query: 195 LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 296 N + +VW +FA + + + IN I +W Sbjct: 1963 WDNPVAMQVWNYFAKLFQILIINPCTINQIIGAW 1996 >gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 156 bits (395), Expect(2) = 3e-48 Identities = 86/247 (34%), Positives = 136/247 (55%) Frame = +2 Query: 326 HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505 HI ++P LWF+W+ERN KH + G R++ R+ I L + K WKG Sbjct: 2005 HIRTLVPIFTLWFLWVERNDAKHRNLGMYPNRIVWRILKLIQQLSLGQQLLKWQWKGDKQ 2064 Query: 506 VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685 +A +GI F+ + WHKP +G++KLN+DGS + G GGVLRD G ++ Sbjct: 2065 IAQEWGITFQAESLPPPKVFPWHKPSIGEFKLNVDGSAKLSQNAAG-GGVLRDHAGVMVF 2123 Query: 686 AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865 F++ + +S AE+ AL++ L L + ++WIE D+ +++L+ N G + ++ Sbjct: 2124 GFSEN-LGIQNSLQAELLALYRGLILCRDYNIRRLWIEMDAASVIRLLQGNQRGPHAIRY 2182 Query: 866 LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045 LL IR+ + F +++SHIFREGN+ AD LA G Q+ ++RG+ R+DQ Sbjct: 2183 LLVSIRQLLSHFSFRLSHIFREGNQAADFLANRGHEHQSLQVVTVAQ--GKLRGMLRLDQ 2240 Query: 1046 LNLPSFR 1066 +LP R Sbjct: 2241 TSLPYVR 2247 Score = 63.5 bits (153), Expect(2) = 3e-48 Identities = 29/94 (30%), Positives = 54/94 (57%) Frame = +3 Query: 15 NPIFPPLCSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLF 194 NP+F + + ++S F+WRL+ + +PV+ K+ +GF LA +C CC+ S ES++H+ Sbjct: 1903 NPVFNFIWHKTVPLTISFFLWRLLHDWIPVELKMKSKGFQLASRCRCCK--SEESIMHVM 1960 Query: 195 LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 296 N + +VW +F+ + + IN + +W Sbjct: 1961 WDNPVATQVWNYFSKFFQILVINPCTINQILGAW 1994 >gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao] Length = 1702 Score = 148 bits (373), Expect(2) = 6e-48 Identities = 83/223 (37%), Positives = 125/223 (56%) Frame = +2 Query: 326 HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505 HI +IP I WF+WLERN K G + RV+ ++ + L+ + + WKG + Sbjct: 1290 HIRTLIPLFICWFLWLERNDAKQRHLGMYSDRVVWKIMKLLRQLQDGYVLKNWQWKGDMD 1349 Query: 506 VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685 +AA +G F + HW K G++KLN+DGS S IGG+LRD G ++ Sbjct: 1350 IAAMWGFNFSPKIQATPQIFHWVKLVSGEHKLNVDGSSRQNQSAA-IGGLLRDHTGTLVF 1408 Query: 686 AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865 F++ + +S AE+ AL + L L + + K+WIE D+ V +Q+I + G Q+ Sbjct: 1409 GFSEN-IGPSNSLQAELRALLRGLLLCKERNIEKLWIEMDALVAIQMIQQSQKGSHDIQY 1467 Query: 866 LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQF 994 LL+ IRK + F ++ISHIFREGN+VAD L+ G + QN + F Sbjct: 1468 LLASIRKCLSFFSFRISHIFREGNQVADFLSNKGHTQQNLLVF 1510 Score = 71.2 bits (173), Expect(2) = 6e-48 Identities = 35/92 (38%), Positives = 54/92 (58%), Gaps = 4/92 (4%) Frame = +3 Query: 33 LCSHF----LTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLH 200 LCS F + S+S F+WR+ N +PVD +L D+GF LA KC CC S E+L+H+ Sbjct: 1190 LCSLFWHKSIPLSISFFLWRVFHNWIPVDLRLKDKGFHLASKCACCN--SEETLIHVLWD 1247 Query: 201 NTHSHKVWMHFANMLHFSLPDTENINTFISSW 296 N + +VW FAN + + +N++ + +W Sbjct: 1248 NPVAKQVWNFFANFFQIYVSNPQNVSQILWAW 1279 Score = 106 bits (265), Expect = 2e-20 Identities = 58/168 (34%), Positives = 95/168 (56%) Frame = +2 Query: 563 VHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVILAFADGFMDCPDSTYAEISA 742 ++W +P +G++KLN+DG GGV RD +I F++ F +ST AE+ A Sbjct: 1536 IYWSRPLMGEFKLNVDGCSKEAFQNAASGGVPRDHTSTMIFGFSENFGPY-NSTQAELMA 1594 Query: 743 LHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQHLLSKIRKQMEGFDWKISHI 922 LH+ L L + ++WIE D++ ++Q++ G Q+LLS I + + G ++ISHI Sbjct: 1595 LHRGLLLCNEYNISRVWIEIDAKAIVQMLHEGHKGYSRTQYLLSFICQCLSGISYRISHI 1654 Query: 923 FREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQLNLPSFR 1066 RE N+ AD L+ G + Q+ F + ++RG+ R+D+ NLP R Sbjct: 1655 HRESNQAADYLSNQGHTHQSLQVFSKAE--GELRGMIRLDKSNLPYVR 1700 >gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] Length = 910 Score = 155 bits (392), Expect(2) = 6e-48 Identities = 86/247 (34%), Positives = 134/247 (54%) Frame = +2 Query: 326 HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505 HI ++P ILWF+W+ERN KH + G RV+ RV I L + K WKG Sbjct: 666 HIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQ 725 Query: 506 VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685 +A +GI + + WHKP G++KLN+DGS + G GG+LRD G ++ Sbjct: 726 IAQEWGIILQAESLAPPKVFSWHKPTTGEFKLNVDGSAKHSHNAAG-GGILRDHAGVMVF 784 Query: 686 AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865 F++ + +S AE+ AL++ L L + ++WIE D+ +++L+ N G + ++ Sbjct: 785 GFSEN-LGIQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRY 843 Query: 866 LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045 L+ +R+ + F ++ SHIFREGN+ AD LA G QN F ++RG+ R+DQ Sbjct: 844 LMVSLRQLLSHFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVAQ--GKLRGMLRLDQ 901 Query: 1046 LNLPSFR 1066 + P R Sbjct: 902 TSFPYVR 908 Score = 63.9 bits (154), Expect(2) = 6e-48 Identities = 30/94 (31%), Positives = 53/94 (56%) Frame = +3 Query: 15 NPIFPPLCSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLF 194 NP+F + + + S F+WRL+ + +PV+ K+ +G LA +C CC+ S ES++H+ Sbjct: 564 NPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCK--SEESIMHVM 621 Query: 195 LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 296 N + +VW +FA + + + IN I +W Sbjct: 622 WDNPVAMQVWNYFAKLFQICIINPCTINQIIGAW 655 >gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] Length = 879 Score = 156 bits (394), Expect(2) = 8e-47 Identities = 89/247 (36%), Positives = 132/247 (53%) Frame = +2 Query: 326 HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505 HI ++P I WF+WLERN KH + RV+ R+ + L L + WKG + Sbjct: 635 HIRSLLPIFICWFLWLERNDAKHRHTRLNPDRVVWRIMKLLRQLLDGSLLHQWQWKGDTD 694 Query: 506 VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685 +A+ +G F+ ++W KP G+YKLN+DGS GG+LRD G +I Sbjct: 695 IASMWGHTFQSKHRAPPQIIYWRKPFTGEYKLNVDGSSRN-GHLAASGGILRDHTGKLIF 753 Query: 686 AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865 F++ C +S AE+ AL + L L + +WIE D+ ++QLI H+ G ++ Sbjct: 754 GFSENIGLC-NSLQAELRALLRGLLLCKERHIENLWIEMDALAVIQLIQHSQKGSHDIRY 812 Query: 866 LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045 LL IRK + ++ISHIFREGN+ AD LA G S QN + ++ G+ ++D+ Sbjct: 813 LLESIRKCLSCISYRISHIFREGNQAADYLANEGHSHQNLC--VITEAQGELHGMLKLDR 870 Query: 1046 LNLPSFR 1066 LNLP R Sbjct: 871 LNLPYVR 877 Score = 59.3 bits (142), Expect(2) = 8e-47 Identities = 25/80 (31%), Positives = 47/80 (58%) Frame = +3 Query: 57 SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236 S+S F+WR + N +PV+ ++ ++G LA KC CC S ESL+H+ N+ + +VW F Sbjct: 547 SISFFLWRALNNWIPVELRMKEKGIQLASKCVCCN--SEESLMHVLWGNSVAKQVWAFFG 604 Query: 237 NMLHFSLPDTENINTFISSW 296 + + ++++ + +W Sbjct: 605 KFFQIYVLNPQHVSQILWAW 624 >gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 152 bits (384), Expect(2) = 3e-46 Identities = 91/245 (37%), Positives = 137/245 (55%), Gaps = 1/245 (0%) Frame = +2 Query: 326 HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505 HI +I I WF+W+ERN KH G R+I R+ + L L K WKG + Sbjct: 1091 HIRTLILLFIFWFVWVERNDAKHRDLGMYPDRIIWRIMKILRKLFQGGLLCKWQWKGDLD 1150 Query: 506 VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGS-KNPISSCGGIGGVLRDWQGNVI 682 +A +G F + ++W KP +G+ KLN+DGS K+ + G GGVLRD GN+I Sbjct: 1151 IAIHWGFNFAQERQARPKIINWIKPLIGELKLNVDGSSKDEFQNAAG-GGVLRDHTGNLI 1209 Query: 683 LAFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQ 862 F++ F +S AE+ ALH+ L L + ++WIE D+QV++Q+I ++ G + Q Sbjct: 1210 FGFSENF-GYQNSLQAELLALHRGLCLCMEYNVSRVWIEVDAQVVIQMIQNHHKGSYKIQ 1268 Query: 863 HLLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVD 1042 +LL IRK ++ +ISHI REGN+ AD L+K G + QN F + ++RG V+ Sbjct: 1269 YLLESIRKCLQVISVRISHIHREGNQAADFLSKHGHTHQNLHVF--TEAQGELRGRTLVN 1326 Query: 1043 QLNLP 1057 ++ P Sbjct: 1327 RVEHP 1331 Score = 61.2 bits (147), Expect(2) = 3e-46 Identities = 27/80 (33%), Positives = 48/80 (60%) Frame = +3 Query: 57 SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236 ++S F+WR + N LPV+ ++ +G LA KC CC+ S ESL+H+ + + +VW +F+ Sbjct: 1003 TVSFFLWRTLHNWLPVEVRMKAKGIQLASKCLCCK--SEESLLHVLWESPVAQQVWNYFS 1060 Query: 237 NMLHFSLPDTENINTFISSW 296 + + +NI ++SW Sbjct: 1061 KFFQIYVHNPQNILQILNSW 1080 >gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao] Length = 1275 Score = 151 bits (381), Expect(2) = 9e-36 Identities = 86/229 (37%), Positives = 125/229 (54%), Gaps = 2/229 (0%) Frame = +2 Query: 314 AHTAHISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWK 493 A I ++P I WF+WLERN KH G RV+ R+ + L+ L Q+ WK Sbjct: 883 AKQGRIRTLLPIFICWFLWLERNDAKHRHSGLYTDRVVWRIMTLLRQLQDDSLLQQWQWK 942 Query: 494 GFGNVAASFGIYFRISVVQKCIP--VHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDW 667 G ++AA + F++ Q+ P V+W KP G+YKLN+DGS GGVLRD Sbjct: 943 GDTDIAAMWRYNFQLK--QRAPPQIVYWRKPFTGEYKLNVDGSSRNGQHAAS-GGVLRDH 999 Query: 668 QGNVILAFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATG 847 +I F++ + +S AE+ ALH+ L L + K+WIE D+ ++QLI H+ G Sbjct: 1000 TSKLIFCFSEN-IGTYNSLQAELRALHRGLLLCKERHIEKLWIEMDALAVIQLIPHSQKG 1058 Query: 848 KWSYQHLLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQF 994 ++LL I+K + ++ISHIFREGN+ AD L+ G + QN F Sbjct: 1059 SHDIRYLLESIKKCLNSISYRISHIFREGNQAADFLSNEGHNHQNLRVF 1107 Score = 27.3 bits (59), Expect(2) = 9e-36 Identities = 14/39 (35%), Positives = 21/39 (53%) Frame = +3 Query: 90 NRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNT 206 N L + + ++G L KC CC S ESL+H+ N+ Sbjct: 845 NTLALSFGIEEKGIHLVSKCVCCN--SEESLMHVLWGNS 881 >ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 872 Score = 109 bits (273), Expect(2) = 1e-31 Identities = 76/242 (31%), Positives = 112/242 (46%), Gaps = 2/242 (0%) Frame = +2 Query: 353 ILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN--VAASFGI 526 ILW+IW RN+ + +S+ FS V V HI SS+L + + SFG Sbjct: 636 ILWYIWHARNQIRFDSRTFSVAGVCRLVSRHIQA--SSRLATGHMHNTIHDLCILKSFGA 693 Query: 527 YFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVILAFADGFM 706 R + + + V WH P +G K+N DG+ GG G V R ++G + AFA + Sbjct: 694 CCRSRRIPRMVEVIWHPPSIGWIKINSDGAWKHEEGIGGFGAVFRYYKGQFVGAFAS-HI 752 Query: 707 DCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQHLLSKIRK 886 D P S A++ + A+ L + +W+E D +L I + W + Sbjct: 753 DIPSSIAAKVMVVITAIELAWVRDWKHVWLEVDFSTVLDYIRSPSLVPWQLRVRWLNCLY 812 Query: 887 QMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQLNLPSFR 1066 ++ +K SHIFREGN+VAD LA G S V + D P + D L +P+FR Sbjct: 813 RISTMTFKSSHIFREGNRVADALANHGTSMSEEVWW--DVPPSFILSYYERDLLGMPNFR 870 Query: 1067 TR 1072 R Sbjct: 871 FR 872 Score = 55.1 bits (131), Expect(2) = 1e-31 Identities = 32/88 (36%), Positives = 47/88 (53%), Gaps = 1/88 (1%) Frame = +3 Query: 6 SPTNPIFPPLCSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLV 185 SP P PL S F+ P MS+ W++++ + L RG +L +C C + S ESL Sbjct: 522 SPVVPWGKPLWSKFILPRMSLHAWKVMRGTVISYHLLQRRGVALVSRCEFCGN-STESLD 580 Query: 186 HLFLHNTHSHKVWMHFANMLHFSL-PDT 266 H+FLH + + VW HF + L P+T Sbjct: 581 HIFLHCSFAASVWNHFIYIFEIGLVPNT 608 >gb|EOY26529.1| Ribonuclease H-like protein [Theobroma cacao] Length = 458 Score = 140 bits (353), Expect = 1e-30 Identities = 83/243 (34%), Positives = 126/243 (51%) Frame = +2 Query: 329 ISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGNV 508 IS +IP I WF+WLERN KH G RV+ + L ++ WK ++ Sbjct: 218 ISALIPLFICWFLWLERNDAKHRHLGMYPDRVVWETMKLLRQLHDGSPLKQWQWKVDKDI 277 Query: 509 AASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVILA 688 AA + F +HW KP G+YKLN+DGS S GG+LRD G ++ Sbjct: 278 AAMWSFLFPPKHGTTPQIIHWVKPFTGEYKLNVDGSSRNCQSATS-GGLLRDHIGKLVFG 336 Query: 689 FADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQHL 868 F++ C +S AE+ AL + L L + ++WIE D+ V++Q+I G ++L Sbjct: 337 FSENIGRC-NSLQAELRALLRRLLLCKEQHIERLWIEMDALVVIQMIHQYQKGSHDIRYL 395 Query: 869 LSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQL 1048 L+ IRK + ++I HIFREGN+ A L+ G + QN + ++ G+ ++D+L Sbjct: 396 LTSIRKGLSSISYRILHIFREGNQAAYFLSNQGYTHQNLC--LITEAQGELHGMLKLDRL 453 Query: 1049 NLP 1057 NLP Sbjct: 454 NLP 456 >gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 137 bits (346), Expect = 7e-30 Identities = 82/247 (33%), Positives = 128/247 (51%) Frame = +2 Query: 326 HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505 HI +IP LWF+W+ERN KH + G + + WKG Sbjct: 2143 HIRTLIPIFTLWFLWVERNDAKHRNLG--------------------QQLLEWQWKGDKQ 2182 Query: 506 VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685 +A +GI F+ + WHKP G++KLN+DGS + G GGVLRD G +I Sbjct: 2183 IAQEWGITFQAKSLPPPKVFCWHKPSNGEFKLNVDGSAKLSQNAAG-GGVLRDHAGVMIF 2241 Query: 686 AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865 F++ + +S AE+ AL++ L L + ++WIE D+ +++L+ N G + ++ Sbjct: 2242 GFSEN-LGIQNSLKAELLALYRGLILCRDYNIRRLWIEMDATSVIRLLQGNHRGPHAIRY 2300 Query: 866 LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045 LL IR+ + F ++++HIFREGN+ AD LA G Q+ ++RG+ R+DQ Sbjct: 2301 LLGSIRQLLSHFSFRLTHIFREGNQAADFLANRGHEHQSLQVITVAQ--GKLRGMLRLDQ 2358 Query: 1046 LNLPSFR 1066 +LP R Sbjct: 2359 TSLPYVR 2365 >ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258077 [Solanum lycopersicum] Length = 1454 Score = 111 bits (277), Expect(2) = 5e-29 Identities = 77/252 (30%), Positives = 126/252 (50%), Gaps = 5/252 (1%) Frame = +2 Query: 338 IIPCVILWFIWLERNKNKHESKGFSAYRV---IGRVEHHIYLLKSSKLFQKSTWKGFGNV 508 I+P I W +W R K+ K S YRV I + + + + +++W N+ Sbjct: 1203 ILPNFICWNLWKNRCAVKYGLKNSSIYRVQYGIFKNIMQVITIVFPSIPWQTSWNNLINI 1262 Query: 509 AASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVILA 688 +++I +V+ W+KPDLG+YKLN DGS S G GG+LRD QG +I A Sbjct: 1263 VEQCKQHYKILIVK------WNKPDLGKYKLNTDGSALQNSGKIGGGGILRDNQGKIIYA 1316 Query: 689 FADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQHL 868 F+ F + +AEI A L E + KI +E DS++L I+ N W Y+ L Sbjct: 1317 FSLPF-GFGTNNFAEIKAALHGLDWCEQHGYKKIELEVDSKLLCNWINSNINIPWRYEEL 1375 Query: 869 LSKIRKQMEGFD-WKISHIFREGNKVADGLAKLGCSSQNFVQFFAD-DFPRQVRGLARVD 1042 + +I + + D ++ HI+RE N AD L+K + + +F+ +RG ++ Sbjct: 1376 IQQIHQIIRKMDQFQCHHIYREANCTADLLSKWSHNLEILQKFYTTRQLKEPIRGSYLLE 1435 Query: 1043 QLNLPSFRTRTI 1078 ++ + +FR R + Sbjct: 1436 KMGVQNFRRRKL 1447 Score = 44.7 bits (104), Expect(2) = 5e-29 Identities = 19/89 (21%), Positives = 46/89 (51%) Frame = +3 Query: 60 MSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFAN 239 +S F+WR ++ +LP ++ L G +L+ C+CC + + + H+ ++ + +W +++ Sbjct: 1111 VSFFIWRALRGKLPTNENLQRIGKNLS-DCYCCYNKGKDDINHILINGNFAKYIWKIYSS 1169 Query: 240 MLHFSLPDTENINTFISSWKNFTPLHTLH 326 + LP + + W+N + +H Sbjct: 1170 AVGV-LPINTTLRDLLLQWRNQQYTNEVH 1197