BLASTX nr result
ID: Rehmannia22_contig00027300
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia22_contig00027300 (982 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] 168 3e-39 gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] 166 1e-38 gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] 164 5e-38 gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao] 164 5e-38 gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] 162 1e-37 gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] 161 3e-37 gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] 161 4e-37 gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] 158 3e-36 gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] 157 6e-36 gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] 155 2e-35 gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] 155 2e-35 gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob... 155 2e-35 gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] 152 1e-34 gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao] 149 2e-33 gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] 144 4e-32 gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao] 128 4e-27 ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein A... 127 9e-27 ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein A... 126 1e-26 ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein A... 124 4e-26 gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] 124 6e-26 >gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 168 bits (425), Expect = 3e-39 Identities = 108/324 (33%), Positives = 156/324 (48%), Gaps = 3/324 (0%) Frame = +2 Query: 5 LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184 LASKCVCC + EE+L H+ N +VW FA + + + H+S Sbjct: 1822 LASKCVCC------------NSEESLIHVLWENPVAKQVWNFFAQLFQIYIWNPRHVSQI 1869 Query: 185 LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364 + W + +K H VLLP + W++W ERN +H + R+I H + L Sbjct: 1870 IWAWYVSGDYVRKGHFRVLLPLFICWFLWLERNDAKHRHTGLYADRVIWRTMKHCRQLYD 1929 Query: 365 AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544 L Q WKG D AT LG F + I W+KPS KLN+DGS ++ L AA Sbjct: 1930 GSLLQQWQWKGDTDIATMLGFSFTHKQHAPPQIIYWKKPSIGEYKLNVDGSSRNGLHAA- 1988 Query: 545 IGGIIRNHE*DTIWAFSGYIPRSEGP---LHAESVALETALTYSYTQSIDHLWIETDSLI 715 GG++R+H I+ FS I GP L AE AL L + I+ LWIE D+L+ Sbjct: 1989 TGGVLRDHTGKLIFGFSENI----GPCNSLQAELRALLRGLLLCKERHIEKLWIEMDALV 2044 Query: 716 LCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKY 895 ++ G ++ RY L I L + +R++HI REGN+ AD+L++ G + Sbjct: 2045 AIQLIQPSKKGPYNLRYLLESIRMCLSSFSYRLSHILREGNQAADYLSNEGHKHQNLCVF 2104 Query: 896 TAISLPHSAKGIARLDQLEIPSFR 967 T G+ +LD+L +P R Sbjct: 2105 T--EAQGQLHGMLKLDRLNLPYVR 2126 >gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 166 bits (421), Expect = 1e-38 Identities = 104/321 (32%), Positives = 160/321 (49%) Frame = +2 Query: 5 LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184 LASKCVCC + EE+L H+ N+ +VW FA + + + + +H+S Sbjct: 1909 LASKCVCC------------NSEESLMHVLWGNSVAKQVWAFFAKFFQIYVLNPKHVSHI 1956 Query: 185 LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364 L W + ++ HI LLP + W++W ERN ++ ++ + RI+ + ++ L Sbjct: 1957 LWAWFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKYRHSGLNTDRIVWRIMKLLRQLKD 2016 Query: 365 AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544 L Q WKG D A F+ R+ + W+KPS+ KLN+DGS + H A Sbjct: 2017 GSLLQQWQWKGDTDIAAMWQYNFQLKLRAPPQIVYWRKPSTGEYKLNVDGSSR-HGQHAA 2075 Query: 545 IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724 GG++R+H I+ FS I L AE AL L + I+ LWIE D+L Sbjct: 2076 SGGVLRDHTGKLIFGFSENIGTCNS-LQAELRALLRGLLLCKERHIEKLWIEMDALAAIQ 2134 Query: 725 MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAI 904 ++ + G RY L I L++ +RI+HIHREGN+VADFL++ G + +T Sbjct: 2135 LLPHSQKGSHDIRYLLESIRKCLNSISYRISHIHREGNQVADFLSNEGHNHQNLHVFT-- 2192 Query: 905 SLPHSAKGIARLDQLEIPSFR 967 G+ +LD+L +P R Sbjct: 2193 EAQGKLHGMLKLDRLNLPYVR 2213 >gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] Length = 1951 Score = 164 bits (415), Expect = 5e-38 Identities = 108/324 (33%), Positives = 155/324 (47%), Gaps = 3/324 (0%) Frame = +2 Query: 5 LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184 LASKCVCC EE+L H+ N +VW FA + + HIS Sbjct: 1645 LASKCVCC------------RSEESLIHVLWENPVATQVWFFFAKSFQIYVSKPNHISQI 1692 Query: 185 LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364 + W + + HI +L+P + W++W ERN +H + R+I + + L Sbjct: 1693 IWAWFFSGDYTRNGHIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLYA 1752 Query: 365 AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544 L + WKG D AT G +F + I W KP KLN+DGS KS+L AAG Sbjct: 1753 GSLLKQWQWKGDTDIATMWGFKFPPKYCTSPQIIYWIKPFIGEYKLNVDGSSKSNLNAAG 1812 Query: 545 IGGIIRNHE*DTIWAFS---GYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLI 715 GG++R+H +AFS G +P + LH AL L ++I +LWIE D+L+ Sbjct: 1813 -GGVLRDHTGKLAFAFSENLGPLPSLQAELH----ALLRGLLLCKERNITNLWIEMDALV 1867 Query: 716 LCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKY 895 MV G RY L I L + +RI+HI+REGN+ ADFL++ G + + Sbjct: 1868 AVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRISHIYREGNQAADFLSNKGQTHQSLCVF 1927 Query: 896 TAISLPHSAKGIARLDQLEIPSFR 967 + GI +LD+L +P R Sbjct: 1928 S--EAQGELIGILKLDKLNLPYVR 1949 >gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao] Length = 1134 Score = 164 bits (415), Expect = 5e-38 Identities = 107/324 (33%), Positives = 152/324 (46%), Gaps = 3/324 (0%) Frame = +2 Query: 5 LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184 LASKCVCC + EE+L H+ N +VW FA + + + H+S Sbjct: 826 LASKCVCC------------NSEESLIHVLWENPVAKQVWNFFAKLFQIYILNPRHVSQI 873 Query: 185 LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364 + W + +K H VLLP + W++W ERN +H + R+I H + L Sbjct: 874 IWAWYVSGDYVRKGHFRVLLPLFICWFLWLERNDAKHRHTGLYPDRVIWRTMKHCRQLYD 933 Query: 365 AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544 L Q WKG D A LG F + I W+KPS KLN+DGS ++ L AA Sbjct: 934 GSLLQQWQWKGDTDIAAMLGFSFPPQQHASPQIIYWKKPSIGEYKLNVDGSSRNGLHAA- 992 Query: 545 IGGIIRNHE*DTIWAFSGYIPRSEGP---LHAESVALETALTYSYTQSIDHLWIETDSLI 715 GG++R+H I+ FS I GP L AE AL L + I+ LWIE D+L Sbjct: 993 TGGVLRDHTGKLIFGFSENI----GPCNSLQAELRALLRGLLLCKERHIEKLWIEMDALA 1048 Query: 716 LCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKY 895 ++ G + RY L I L + +R++H REGNK AD+L++ G + Sbjct: 1049 AIQLIQPSKKGPYDIRYLLESIRMCLSSFSYRLSHTFREGNKAADYLSNEGHKHQNLCVF 1108 Query: 896 TAISLPHSAKGIARLDQLEIPSFR 967 T G+ +LD+L +P R Sbjct: 1109 T--EAQGQLHGMLKLDRLNLPYVR 1130 >gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] Length = 926 Score = 162 bits (411), Expect = 1e-37 Identities = 102/322 (31%), Positives = 158/322 (49%) Frame = +2 Query: 5 LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184 LASKCVCC + EE+L H+ N+ +VW FA + + + + +H+S Sbjct: 621 LASKCVCC------------NSEESLMHVLWGNSVAKQVWAFFANFFQIYIFNPQHVSHI 668 Query: 185 LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364 L W + ++ HI LLP + W++W ERN +H + R++ + ++ L Sbjct: 669 LWAWFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKHRYSGLYTDRVVWRIMKLLRQLHD 728 Query: 365 AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544 L Q WKG D A + R+ + W+KPS+ KLN+DGS + H A Sbjct: 729 GSLLQQWQWKGDTDIAAMWKYNLQLKLRAPPQIVYWRKPSTGEYKLNVDGSSR-HGQHAA 787 Query: 545 IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724 GG++R+H I+ FS I L AE AL L + I+ LWIE D+L + Sbjct: 788 SGGVLRDHTGKLIFGFSENIGNCNS-LQAELRALLRGLLLCKERHIEQLWIEMDALAVIQ 846 Query: 725 MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAI 904 ++ + G RY L I L++ +RI+HI REGN+VADFL++ G + +T Sbjct: 847 LIPHSQKGSHDIRYLLESIRKCLNSISYRISHILREGNQVADFLSNEGHNHQNLRVFT-- 904 Query: 905 SLPHSAKGIARLDQLEIPSFRI 970 G+ +LD+L +P R+ Sbjct: 905 EAQGKLHGMLKLDRLNLPYVRL 926 >gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] Length = 1014 Score = 161 bits (408), Expect = 3e-37 Identities = 100/318 (31%), Positives = 150/318 (47%) Frame = +2 Query: 5 LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184 LASKCVCC + EE+L H+ N +VW FA + + + + +H+S Sbjct: 708 LASKCVCC------------NSEESLIHVLWDNPVAKQVWNFFADFFQINISNPQHVSQI 755 Query: 185 LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364 + W + +K HI L+P + W++W ERN +H + R++ + ++ L Sbjct: 756 IWAWYYSGDFVRKGHIRTLIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKVLRQLQD 815 Query: 365 AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544 L + WKG D A G R I W KP + KLN+DGS + H +A Sbjct: 816 GSLLKKWQWKGDTDIAAMWGFTLPLKIRESPQIIHWVKPVTGEYKLNVDGSSR-HNQSAA 874 Query: 545 IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724 GG++R+H ++ FS I S L AE AL L ++I+ LWIE D+L++ Sbjct: 875 TGGLLRDHTGTLVFGFSENIGPSNS-LQAELRALLRGLLLCKDRNIEKLWIEMDALVVIQ 933 Query: 725 MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAI 904 M+ G RY L I L FRI+HI REGN+ ADFL++ G + Sbjct: 934 MIQQSKKGSHDIRYLLASIRKCLSFFSFRISHIFREGNQAADFLSNKGHT--HQNLQVIS 991 Query: 905 SLPHSAKGIARLDQLEIP 958 G+ +LD+L +P Sbjct: 992 EAQGKLHGMLKLDRLNLP 1009 >gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] Length = 879 Score = 161 bits (407), Expect = 4e-37 Identities = 105/322 (32%), Positives = 158/322 (49%), Gaps = 1/322 (0%) Frame = +2 Query: 5 LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184 LASKCVCC + EE+L H+ N+ +VW F + + + + +H+S Sbjct: 573 LASKCVCC------------NSEESLMHVLWGNSVAKQVWAFFGKFFQIYVLNPQHVSQI 620 Query: 185 LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364 L W + +K HI LLP + W++W ERN +H + + R++ + ++ L Sbjct: 621 LWAWFFSGDYVKKGHIRSLLPIFICWFLWLERNDAKHRHTRLNPDRVVWRIMKLLRQLLD 680 Query: 365 AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKS-HLGAA 541 L WKG D A+ G F+ R+ I W+KP + KLN+DGS ++ HL A+ Sbjct: 681 GSLLHQWQWKGDTDIASMWGHTFQSKHRAPPQIIYWRKPFTGEYKLNVDGSSRNGHLAAS 740 Query: 542 GIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILC 721 GGI+R+H I+ FS I L AE AL L + I++LWIE D+L + Sbjct: 741 --GGILRDHTGKLIFGFSENIGLCNS-LQAELRALLRGLLLCKERHIENLWIEMDALAVI 797 Query: 722 NMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTA 901 ++ + G RY L I L +RI+HI REGN+ AD+LA+ G S T Sbjct: 798 QLIQHSQKGSHDIRYLLESIRKCLSCISYRISHIFREGNQAADYLANEGHSHQNLCVIT- 856 Query: 902 ISLPHSAKGIARLDQLEIPSFR 967 G+ +LD+L +P R Sbjct: 857 -EAQGELHGMLKLDRLNLPYVR 877 >gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] Length = 1954 Score = 158 bits (400), Expect = 3e-36 Identities = 102/321 (31%), Positives = 156/321 (48%) Frame = +2 Query: 5 LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184 LASKC+CC + EE+L H+ N +VW FA + + +++S Sbjct: 1648 LASKCICC------------NSEESLIHVLWDNPIAKQVWNFFANSFQIYISKPQNVSQI 1695 Query: 185 LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364 L W + +K HI +L+P + W++W ERN +H + R++ + ++ L Sbjct: 1696 LWTWYLSGDYVRKGHIRILIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKLLRQLQD 1755 Query: 365 AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544 L + WKG D AT GL +R+ + W KP KLN+DGS + + AA Sbjct: 1756 GYLLKSWQWKGDKDFATMWGLFSPPKTRAAPQILHWVKPVPGEHKLNVDGSSRQNQTAA- 1814 Query: 545 IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724 IGG++R+H ++ FS I S L AE AL L ++I+ LW+E D+L+ Sbjct: 1815 IGGVLRDHTGTLVFDFSENIGPSNS-LQAELRALLRGLLLCKERNIEKLWVEMDALVAIQ 1873 Query: 725 MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAI 904 M+ G RY L I L+ FRI+HI REGN+ ADFL++ G + +T Sbjct: 1874 MIQQSQKGSHDIRYLLASIRKYLNFFSFRISHIFREGNQAADFLSNKGHTHQSLHVFT-- 1931 Query: 905 SLPHSAKGIARLDQLEIPSFR 967 G+ +LD+L +P R Sbjct: 1932 EAQGKLYGMLKLDRLNLPYVR 1952 >gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 157 bits (397), Expect = 6e-36 Identities = 101/321 (31%), Positives = 151/321 (47%) Frame = +2 Query: 5 LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184 LAS+C CC EE++ H+ N M+VW +FA + + + I+ Sbjct: 1945 LASRCRCC------------KSEESIMHVMWDNPVAMQVWNYFAKLFQILIINPCTINQI 1992 Query: 185 LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364 + W + + HI L+P +LW++W ERN +H N R++ V IQ LS Sbjct: 1993 IGAWFYSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSL 2052 Query: 365 AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544 + WKG A G+ F+ S + W KPS KLN+DGS K AAG Sbjct: 2053 GQQLLKWQWKGDKQIAQEWGIIFQAESLAPPKVFSWHKPSLGEFKLNVDGSAKQSHNAAG 2112 Query: 545 IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724 GGI+R+H + ++ FS + ++ L AE +AL L +I LWIE D++ + Sbjct: 2113 -GGILRDHAGEMVFGFSENL-GTQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIR 2170 Query: 725 MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAI 904 ++ G + RY +V + L + FR +HI REGN+ ADFLA+ G +T Sbjct: 2171 LLQGNHRGPHAIRYLMVSLRQLLSHFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVA 2230 Query: 905 SLPHSAKGIARLDQLEIPSFR 967 +G+ LDQ P R Sbjct: 2231 Q--GKLRGMLCLDQTSFPYVR 2249 >gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] Length = 910 Score = 155 bits (393), Expect = 2e-35 Identities = 100/321 (31%), Positives = 151/321 (47%) Frame = +2 Query: 5 LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184 LAS+C CC EE++ H+ N M+VW +FA + + + I+ Sbjct: 604 LASRCRCC------------KSEESIMHVMWDNPVAMQVWNYFAKLFQICIINPCTINQI 651 Query: 185 LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364 + W ++ + HI L+P +LW++W ERN +H N R++ V IQ LS Sbjct: 652 IGAWFHSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSL 711 Query: 365 AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544 + WKG A G+ + S + W KP++ KLN+DGS K AAG Sbjct: 712 GQQLLKWQWKGDKQIAQEWGIILQAESLAPPKVFSWHKPTTGEFKLNVDGSAKHSHNAAG 771 Query: 545 IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724 GGI+R+H ++ FS + + L AE +AL L +I LWIE D++ + Sbjct: 772 -GGILRDHAGVMVFGFSENL-GIQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIR 829 Query: 725 MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAI 904 ++ G + RY +V + L + FR +HI REGN+ ADFLA+ G +T Sbjct: 830 LLQGNHRGPHAIRYLMVSLRQLLSHFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVA 889 Query: 905 SLPHSAKGIARLDQLEIPSFR 967 +G+ RLDQ P R Sbjct: 890 Q--GKLRGMLRLDQTSFPYVR 908 >gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 155 bits (393), Expect = 2e-35 Identities = 99/291 (34%), Positives = 140/291 (48%), Gaps = 3/291 (1%) Frame = +2 Query: 5 LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184 LASKCVCC EE+L H+ N +VW FA + + +HIS Sbjct: 1402 LASKCVCC------------RSEESLIHVLWENPVAKQVWNFFAKSFQIYVSKPKHISQI 1449 Query: 185 LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364 + W + + HI +L+P + W++W ERN +H + R+I + + L Sbjct: 1450 IWAWFFSGDYTRNGHIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLHA 1509 Query: 365 AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544 L + WKG D AT G ++ I W KP KLN+DGS KS AAG Sbjct: 1510 GSLLKQWQWKGDTDIATMWGFKYPPKYCQSPQIISWIKPFIGEYKLNVDGSSKSSQNAAG 1569 Query: 545 IGGIIRNHE*DTIWAFS---GYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLI 715 GG++R+H +AFS G +P + LH AL L ++I +LWIE D+L+ Sbjct: 1570 -GGVLRDHTGKLAFAFSENLGPLPSLQAELH----ALLRGLLLCKERNITNLWIEMDALV 1624 Query: 716 LCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLG 868 MV G RY L I L + +RI+HI+REGN+ ADFL++ G Sbjct: 1625 AVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRISHIYREGNQAADFLSNKG 1675 Score = 154 bits (390), Expect = 4e-35 Identities = 99/321 (30%), Positives = 148/321 (46%) Frame = +2 Query: 5 LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184 LAS+C CC EE+L H+ N +VW +FA + + + I+ Sbjct: 3196 LASRCRCC------------KSEESLMHVMWDNPVANQVWSYFAKVFQIHIINPCTINHI 3243 Query: 185 LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364 +S W + ++ HI L+P +LW++W ERN +H N RI+ + I L + Sbjct: 3244 ISAWFYSGDYSKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRIVWKILKLIHQLFQ 3303 Query: 365 AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544 K Q W+G A G+ + + S + W KPS KLN+DGS K +L A Sbjct: 3304 GKQLQKWQWQGDKQIAQEWGIILKAVAPSPPKLLFWNKPSIGEFKLNVDGSSKYNLQTAA 3363 Query: 545 IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724 GG++R+H I+ FS S+ L AE +AL L ++ LWIE D+ + Sbjct: 3364 GGGLLRDHTGSMIFGFSENF-GSQDSLQAELMALHRGLLLCIDHNVTRLWIEMDAKVAVQ 3422 Query: 725 MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAI 904 M+ G RY L I L FRI+HI REGN+ AD L++ G++ Sbjct: 3423 MINEGHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQAADHLSNQGYT--HQNLQVIS 3480 Query: 905 SLPHSAKGIARLDQLEIPSFR 967 +GI RLD++ + R Sbjct: 3481 QAEGQLRGILRLDKINLAYVR 3501 >gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 155 bits (392), Expect = 2e-35 Identities = 91/291 (31%), Positives = 142/291 (48%), Gaps = 3/291 (1%) Frame = +2 Query: 5 LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184 LASKC+CC EE+L H+ + +VW +F+ + + + + ++I Sbjct: 1029 LASKCLCC------------KSEESLLHVLWESPVAQQVWNYFSKFFQIYVHNPQNILQI 1076 Query: 185 LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364 L+ W + + HI L+ + W++W ERN +H + RII + ++ L + Sbjct: 1077 LNSWYYSGDFTKPGHIRTLILLFIFWFVWVERNDAKHRDLGMYPDRIIWRIMKILRKLFQ 1136 Query: 365 AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544 L WKG +D A G F + ++R I W KP +KLN+DGS K A Sbjct: 1137 GGLLCKWQWKGDLDIAIHWGFNFAQERQARPKIINWIKPLIGELKLNVDGSSKDEFQNAA 1196 Query: 545 IGGIIRNHE*DTIWAFS---GYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLI 715 GG++R+H + I+ FS GY + L AE +AL L ++ +WIE D+ + Sbjct: 1197 GGGVLRDHTGNLIFGFSENFGY----QNSLQAELLALHRGLCLCMEYNVSRVWIEVDAQV 1252 Query: 716 LCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLG 868 + M+ N G + +Y L I L RI+HIHREGN+ ADFL+ G Sbjct: 1253 VIQMIQNHHKGSYKIQYLLESIRKCLQVISVRISHIHREGNQAADFLSKHG 1303 >gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 152 bits (385), Expect = 1e-34 Identities = 102/321 (31%), Positives = 148/321 (46%) Frame = +2 Query: 5 LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184 LAS+C CC EE++ H+ N +VW +F+ + + + + I+ Sbjct: 1943 LASRCRCC------------KSEESIMHVMWDNPVATQVWNYFSKFFQILVINPCTINQI 1990 Query: 185 LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364 L W + + HI L+P LW++W ERN +H N RI+ + IQ LS Sbjct: 1991 LGAWFYSGDYCKPGHIRTLVPIFTLWFLWVERNDAKHRNLGMYPNRIVWRILKLIQQLSL 2050 Query: 365 AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544 + WKG A G+ F+ S W KPS KLN+DGS K AAG Sbjct: 2051 GQQLLKWQWKGDKQIAQEWGITFQAESLPPPKVFPWHKPSIGEFKLNVDGSAKLSQNAAG 2110 Query: 545 IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724 GG++R+H ++ FS + + L AE +AL L +I LWIE D+ + Sbjct: 2111 -GGVLRDHAGVMVFGFSENL-GIQNSLQAELLALYRGLILCRDYNIRRLWIEMDAASVIR 2168 Query: 725 MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAI 904 ++ G + RY LV I L + FR++HI REGN+ ADFLA+ G T Sbjct: 2169 LLQGNQRGPHAIRYLLVSIRQLLSHFSFRLSHIFREGNQAADFLANRGHEHQSLQVVTVA 2228 Query: 905 SLPHSAKGIARLDQLEIPSFR 967 +G+ RLDQ +P R Sbjct: 2229 Q--GKLRGMLRLDQTSLPYVR 2247 >gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao] Length = 1702 Score = 149 bits (375), Expect = 2e-33 Identities = 93/288 (32%), Positives = 140/288 (48%) Frame = +2 Query: 5 LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184 LASKC CC + EETL H+ N +VW FA + + + + +++S Sbjct: 1228 LASKCACC------------NSEETLIHVLWDNPVAKQVWNFFANFFQIYVSNPQNVSQI 1275 Query: 185 LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364 L W + +K HI L+P + W++W ERN + + R++ + ++ L Sbjct: 1276 LWAWYFSGDYVRKGHIRTLIPLFICWFLWLERNDAKQRHLGMYSDRVVWKIMKLLRQLQD 1335 Query: 365 AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544 + + WKG +D A G F ++ W K S KLN+DGS + + AA Sbjct: 1336 GYVLKNWQWKGDMDIAAMWGFNFSPKIQATPQIFHWVKLVSGEHKLNVDGSSRQNQSAA- 1394 Query: 545 IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724 IGG++R+H ++ FS I S L AE AL L ++I+ LWIE D+L+ Sbjct: 1395 IGGLLRDHTGTLVFGFSENIGPSNS-LQAELRALLRGLLLCKERNIEKLWIEMDALVAIQ 1453 Query: 725 MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLG 868 M+ G +Y L I L FRI+HI REGN+VADFL++ G Sbjct: 1454 MIQQSQKGSHDIQYLLASIRKCLSFFSFRISHIFREGNQVADFLSNKG 1501 Score = 82.8 bits (203), Expect = 2e-13 Identities = 56/185 (30%), Positives = 87/185 (47%), Gaps = 3/185 (1%) Frame = +2 Query: 422 GLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFSGY 601 GL++ + S I W +P KLN+DG K A GG+ R+H I+ FS Sbjct: 1522 GLRYEQDSHGHPKIIYWSRPLMGEFKLNVDGCSKEAFQNAASGGVPRDHTSTMIFGFS-- 1579 Query: 602 IPRSEGPLH---AESVALETALTYSYTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSL 772 + GP + AE +AL L +I +WIE D+ + M+ G+ +Y L Sbjct: 1580 --ENFGPYNSTQAELMALHRGLLLCNEYNISRVWIEIDAKAIVQMLHEGHKGYSRTQYLL 1637 Query: 773 VQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAISLPHSAKGIARLDQLE 952 I L +RI+HIHRE N+ AD+L++ G + ++ +G+ RLD+ Sbjct: 1638 SFICQCLSGISYRISHIHRESNQAADYLSNQGHTHQSLQVFS--KAEGELRGMIRLDKSN 1695 Query: 953 IPSFR 967 +P R Sbjct: 1696 LPYVR 1700 >gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 144 bits (364), Expect = 4e-32 Identities = 95/322 (29%), Positives = 147/322 (45%), Gaps = 1/322 (0%) Frame = +2 Query: 5 LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184 LAS+C CC EE+L H+ N +VW +FA + + + I+ Sbjct: 1908 LASRCRCC------------KSEESLMHVMWKNPVANQVWSYFAKVFQIQIINPCTINQI 1955 Query: 185 LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364 + W + ++ HI L+P LW++W ERN +H N R++ + + L + Sbjct: 1956 ICAWFYSGDYSKPGHIRTLVPLFTLWFLWVERNDAKHRNLGMYPNRVVWKILKLLHQLFQ 2015 Query: 365 AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544 K Q W+G A G+ + + S + W KPS +KLN+DGS K + +A Sbjct: 2016 GKQLQKWQWQGDKQIAQEWGIILKADAPSPPKLLFWLKPSIGELKLNVDGSCKHNPQSAA 2075 Query: 545 IGGIIRNHE*DTIWAFS-GYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILC 721 GG++R+H I+ FS + P+ L AE +AL L +I LWIE D+ + Sbjct: 2076 GGGLLRDHTGSMIFGFSENFGPQDS--LQAELMALHRGLLLCIEHNISRLWIEMDAKVAV 2133 Query: 722 NMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTA 901 M+ G RY L I L FRI+HI REGN+ AD L++ G + Sbjct: 2134 QMIKEGHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQAADHLSNQGHT--HQNLQVI 2191 Query: 902 ISLPHSAKGIARLDQLEIPSFR 967 +GI RL+++ + R Sbjct: 2192 SQAEGQLRGILRLEKINLAYVR 2213 >gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao] Length = 1275 Score = 128 bits (321), Expect = 4e-27 Identities = 86/289 (29%), Positives = 141/289 (48%), Gaps = 5/289 (1%) Frame = +2 Query: 74 ETLSHLFLHNTQVMKVWM-----HFAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITV 238 ET+ HNT + + H + +C ++E S+ W N+ +A++ I Sbjct: 836 ETIRQWQSHNTLALSFGIEEKGIHLVS--KCVCCNSEE-SLMHVLWGNS--VAKQGRIRT 890 Query: 239 LLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATR 418 LLP + W++W ERN +H ++ R++ + ++ L L Q WKG D A Sbjct: 891 LLPIFICWFLWLERNDAKHRHSGLYTDRVVWRIMTLLRQLQDDSLLQQWQWKGDTDIAAM 950 Query: 419 LGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFSG 598 F+ R+ + W+KP + KLN+DGS ++ AA GG++R+H I+ FS Sbjct: 951 WRYNFQLKQRAPPQIVYWRKPFTGEYKLNVDGSSRNGQHAAS-GGVLRDHTSKLIFCFSE 1009 Query: 599 YIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQ 778 I + L AE AL L + I+ LWIE D+L + ++ + G RY L Sbjct: 1010 NI-GTYNSLQAELRALHRGLLLCKERHIEKLWIEMDALAVIQLIPHSQKGSHDIRYLLES 1068 Query: 779 IANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAISLPHSAK 925 I L++ +RI+HI REGN+ ADFL++ G + +T P +++ Sbjct: 1069 IKKCLNSISYRISHIFREGNQAADFLSNEGHNHQNLRVFTKAQGPPNSE 1117 >ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum tuberosum] Length = 885 Score = 127 bits (318), Expect = 9e-27 Identities = 87/325 (26%), Positives = 159/325 (48%), Gaps = 2/325 (0%) Frame = +2 Query: 2 SLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISI 181 ++ S+C CC EET++HLF K+W +FA + + + Sbjct: 558 NIVSRCWCC----------DRKKEETMTHLFPTAPITYKLWRYFAHFAGINIDGMHLQQL 607 Query: 182 FLSFWKN-TTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLL 358 +S+WK+ TP Q I +P +++W +WK RN +H++ S S R+++ V ++ + Sbjct: 608 IISWWKHEATPKLQG--IYKAIPAIIMWTLWKRRNALKHDS-SISWERMVEMVIEVVRKM 664 Query: 359 SKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGA 538 K++ + N + + Q++R + + W+ P ++K N DG+ + + G Sbjct: 665 VKSQFPWIKNMRWTWQAIIQRLNQYKR--KIHVLRVTWKPPDDHYVKSNTDGACRGNPGL 722 Query: 539 AGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLIL 718 + G IR+ + D I+A + I + + AE+VA+ TAL + + + IETDSL L Sbjct: 723 SSFGFCIRDDKGDLIYAKAKGIGIATN-MEAETVAILTALRECSNRKMQKVIIETDSLSL 781 Query: 719 CNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYT 898 ++ + W + +I + +ITHI REGN +AD LA++ + +Y+ Sbjct: 782 KKIIQQTWRVPWKIAEKVEEIREIMEKIKAKITHIFREGNSLADSLANIAIESQAEHQYS 841 Query: 899 AI-SLPHSAKGIARLDQLEIPSFRI 970 LP + I +D+ +IP+ RI Sbjct: 842 CFQELPLKERRILNIDKAQIPTLRI 866 >ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 364 Score = 126 bits (317), Expect = 1e-26 Identities = 83/292 (28%), Positives = 137/292 (46%), Gaps = 1/292 (0%) Frame = +2 Query: 2 SLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRC-TLPHTEHIS 178 +LAS+CV C E+L H+FL + +W + A LP Sbjct: 70 ALASRCVLC-----------GRDGESLPHIFLTCSFAASLWNNRAGLFELGCLPQN---L 115 Query: 179 IFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLL 358 + L ++ Q I ++ LW+IWK RN RH+N + + + + H++ Sbjct: 116 VDLLYYGGVGRSHQLKEIWLICYTTTLWFIWKARNKMRHDNCTIVVDAVRQLIMGHVKTA 175 Query: 359 SKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGA 538 SK L + N + + GL R R T + W P WIK+N DG+++ G Sbjct: 176 SKLALGCMSNSLTELRVLKKFGLLCRPHRAPRITEVNWHPPLFGWIKVNTDGAWQKTTGK 235 Query: 539 AGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLIL 718 +G GGI R+ + AF+ + + AE +A+ A+ ++ + +H+W+E DS+I+ Sbjct: 236 SGYGGIFRDFHGSFLGAFASNL-EILNSVDAEVMAVIQAIELAWVRDWEHIWLEVDSIIV 294 Query: 719 CNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFS 874 N + + W R +R+ +FR +HI REGN+VAD LA++G S Sbjct: 295 LNFLQDPHLVPWRLRVGWGNFLHRISQMNFRSSHIFREGNQVADALANMGLS 346 >ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 872 Score = 124 bits (312), Expect = 4e-26 Identities = 81/292 (27%), Positives = 133/292 (45%), Gaps = 1/292 (0%) Frame = +2 Query: 2 SLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTL-PHTEHIS 178 +L S+C C + E+L H+FLH + VW HF L P+T Sbjct: 564 ALVSRCEFC-----------GNSTESLDHIFLHCSFAASVWNHFIYIFEIGLVPNTIAEV 612 Query: 179 IFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLL 358 L + +P Q + ++ +LWYIW RN R ++ +FS + + V HIQ Sbjct: 613 FSLGLAMDRSP--QLKELWLICFTSILWYIWHARNQIRFDSRTFSVAGVCRLVSRHIQAS 670 Query: 359 SKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGA 538 S+ +HN + G R R ++W PS WIK+N DG++K G Sbjct: 671 SRLATGHMHNTIHDLCILKSFGACCRSRRIPRMVEVIWHPPSIGWIKINSDGAWKHEEGI 730 Query: 539 AGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLIL 718 G G + R ++ + AF+ +I + A+ + + TA+ ++ + H+W+E D + Sbjct: 731 GGFGAVFRYYKGQFVGAFASHIDIPSS-IAAKVMVVITAIELAWVRDWKHVWLEVDFSTV 789 Query: 719 CNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFS 874 + + + W R + R+ F+ +HI REGN+VAD LA+ G S Sbjct: 790 LDYIRSPSLVPWQLRVRWLNCLYRISTMTFKSSHIFREGNRVADALANHGTS 841 >gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 124 bits (311), Expect = 6e-26 Identities = 92/274 (33%), Positives = 122/274 (44%) Frame = +2 Query: 146 RCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRI 325 RC +E SI W N + Q HI L+P LW++W ERN +H N Sbjct: 2118 RCRCCRSEE-SIIHVMWDNPVAV-QPGHIRTLIPIFTLWFLWVERNDAKHRNLGQ----- 2170 Query: 326 IKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLN 505 QLL WKG A G+ F+ S W KPS+ KLN Sbjct: 2171 --------QLLE-------WQWKGDKQIAQEWGITFQAKSLPPPKVFCWHKPSNGEFKLN 2215 Query: 506 IDGSYKSHLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSID 685 +DGS K AAG GG++R+H I+ FS + + L AE +AL L +I Sbjct: 2216 VDGSAKLSQNAAG-GGVLRDHAGVMIFGFSENLG-IQNSLKAELLALYRGLILCRDYNIR 2273 Query: 686 HLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASL 865 LWIE D+ + ++ G + RY L I L + FR+THI REGN+ ADFLA+ Sbjct: 2274 RLWIEMDATSVIRLLQGNHRGPHAIRYLLGSIRQLLSHFSFRLTHIFREGNQAADFLANR 2333 Query: 866 GFSTLGSTKYTAISLPHSAKGIARLDQLEIPSFR 967 G T +G+ RLDQ +P R Sbjct: 2334 GHEHQSLQVITVAQ--GKLRGMLRLDQTSLPYVR 2365