BLASTX nr result
ID: Rehmannia25_contig00013918
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia25_contig00013918 (1286 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] 162 7e-44 gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] 165 3e-43 gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] 159 4e-43 gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] 159 6e-43 gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] 156 7e-43 gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob... 165 7e-43 gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] 156 7e-43 gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] 160 2e-42 gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] 154 8e-42 gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] 152 1e-41 gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao] 152 1e-41 gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] 159 5e-41 gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] 152 7e-41 gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] 151 2e-40 gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao] 141 2e-35 ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein A... 100 9e-27 gb|EOY13984.1| RNase H family protein [Theobroma cacao] 125 3e-26 gb|ABI34321.1| RNase H family protein [Solanum demissum] 93 8e-24 ref|XP_004253442.1| PREDICTED: putative ribonuclease H protein A... 88 3e-23 ref|XP_004253443.1| PREDICTED: putative ribonuclease H protein A... 88 9e-23 >gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] Length = 910 Score = 162 bits (410), Expect(2) = 7e-44 Identities = 89/269 (33%), Positives = 144/269 (53%) Frame = +2 Query: 17 LSQVPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLW 196 ++++PI S D W +PNG FS SAW+++++ V+ +F IW+K + + S FLW Sbjct: 525 IAKIPIDTMSKDEAYWTPTPNGDFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLTTSFFLW 584 Query: 197 RLLYNRLLVDLKMQARNFSLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLP 376 RLL++ + V+LKM+++ L S+C CC ES+ H+ N ++W +FA++F + Sbjct: 585 RLLHDWIPVELKMKSKGLQLASRCRCCKSE-ESIMHVMWDNPVAMQVWNYFAKLFQICII 643 Query: 377 HTESIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVE 556 + +I+ + W + + HI ++P ILWF W+ERN +KH N+ ++ +V Sbjct: 644 NPCTINQIIGAWFHSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVL 703 Query: 557 AHLKLLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGAT 736 ++ L L WKG IA + WHKP K+NVDG+ Sbjct: 704 KLIQQLSLGQQLLKWQWKGDKQIAQEWGIILQAESLAPPKVFSWHKPTTGEFKLNVDGSA 763 Query: 737 KGLINQAGLGGVLRDHEGNILWICYGFAE 823 K N AG GG+LRDH G ++ +GF+E Sbjct: 764 KHSHNAAG-GGILRDHAGVMV---FGFSE 788 Score = 43.5 bits (101), Expect(2) = 7e-44 Identities = 25/93 (26%), Positives = 47/93 (50%) Frame = +1 Query: 910 VETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASLNIQFSHTLREGNQVADAVAHFGCQ 1089 +E D++S+ ++L + +L+ +R + + +FSH REGNQ AD +A+ G + Sbjct: 820 IEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLSHFSFRFSHIFREGNQAADFLANRGHE 879 Query: 1090 YRTYHMLRPPDIPRRILGLARTDQLELPSFRRK 1188 ++ + ++ G+ R DQ P R K Sbjct: 880 HQNLQVFTVAQ--GKLRGMLRLDQTSFPYVRFK 910 >gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 165 bits (417), Expect(2) = 3e-43 Identities = 87/269 (32%), Positives = 146/269 (54%) Frame = +2 Query: 17 LSQVPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLW 196 ++++PI S D W +PNG FS SAW++ ++ V+ + IW+K + + S FLW Sbjct: 3117 IAKIPINASSNDRAYWTPTPNGDFSTKSAWQLSRERKVVNPTYNYIWHKSVPLTTSFFLW 3176 Query: 197 RLLYNRLLVDLKMQARNFSLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLP 376 RLL++ + V+LKM+++ F L S+C CC ESL H+ N ++W +FA++F + Sbjct: 3177 RLLHDWVPVELKMKSKGFQLASRCRCCKSE-ESLMHVMWDNPVANQVWSYFAKVFQIHII 3235 Query: 377 HTESIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVE 556 + +I+ + W ++ HI ++P ILWF W+ERN +KH N+ I+ ++ Sbjct: 3236 NPCTINHIISAWFYSGDYSKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRIVWKIL 3295 Query: 557 AHLKLLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGAT 736 + L+Q L W+G IA + + W+KP K+NVDG++ Sbjct: 3296 KLIHQLFQGKQLQKWQWQGDKQIAQEWGIILKAVAPSPPKLLFWNKPSIGEFKLNVDGSS 3355 Query: 737 KGLINQAGLGGVLRDHEGNILWICYGFAE 823 K + A GG+LRDH G+++ +GF+E Sbjct: 3356 KYNLQTAAGGGLLRDHTGSMI---FGFSE 3381 Score = 38.5 bits (88), Expect(2) = 3e-43 Identities = 22/93 (23%), Positives = 47/93 (50%) Frame = +1 Query: 910 VETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASLNIQFSHTLREGNQVADAVAHFGCQ 1089 +E D+ Q++ + Q + +L+ I + ++ + SH REGNQ AD +++ G Sbjct: 3413 IEMDAKVAVQMINEGHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQAADHLSNQGYT 3472 Query: 1090 YRTYHMLRPPDIPRRILGLARTDQLELPSFRRK 1188 ++ ++ + ++ G+ R D++ L R K Sbjct: 3473 HQNLQVISQAE--GQLRGILRLDKINLAYVRFK 3503 Score = 154 bits (389), Expect(2) = 2e-39 Identities = 88/267 (32%), Positives = 134/267 (50%) Frame = +2 Query: 23 QVPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLWRL 202 Q+P D W + NG+FS SAW++++Q + L W++ I S+S FLWR+ Sbjct: 1325 QIPFDRSQEDVAYWALTSNGEFSFWSAWEIIRQRQTPNALLSFNWHRSIPLSISFFLWRV 1384 Query: 203 LYNRLLVDLKMQARNFSLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 382 L N + V+L+M+ + L S+C CC ESL H+ N K++W FA+ F + Sbjct: 1385 LNNWIPVELRMKDKGIHLASKCVCCRSE-ESLIHVLWENPVAKQVWNFFAKSFQIYVSKP 1443 Query: 383 ESIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAH 562 + I + W +T + HI ++P I WF WLERN +KH ++ +I ++ Sbjct: 1444 KHISQIIWAWFFSGDYTRNGHIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKL 1503 Query: 563 LKLLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGATKG 742 L L+ +L WKG IA+ + + W KP K+NVDG++K Sbjct: 1504 LNQLHAGSLLKQWQWKGDTDIATMWGFKYPPKYCQSPQIISWIKPFIGEYKLNVDGSSKS 1563 Query: 743 LINQAGLGGVLRDHEGNILWICYGFAE 823 N AG GGVLRDH G + + F+E Sbjct: 1564 SQNAAG-GGVLRDHTGK---LAFAFSE 1586 Score = 37.0 bits (84), Expect(2) = 2e-39 Identities = 18/58 (31%), Positives = 33/58 (56%) Frame = +1 Query: 910 VETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASLNIQFSHTLREGNQVADAVAHFG 1083 +E D++ Q++ Q ++ +L+ IR+ S + + SH REGNQ AD +++ G Sbjct: 1618 IEMDALVAVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRISHIYREGNQAADFLSNKG 1675 >gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 159 bits (401), Expect(2) = 4e-43 Identities = 90/269 (33%), Positives = 144/269 (53%) Frame = +2 Query: 17 LSQVPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLW 196 ++++PI S D W +PNG+FS SAW+++++ V+ +F IW+K + ++S FLW Sbjct: 1864 IAKIPIDAMSKDEAYWAPTPNGEFSTKSAWQLIRKREVVNPVFNFIWHKTVPLTISFFLW 1923 Query: 197 RLLYNRLLVDLKMQARNFSLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLP 376 RLL++ + V+LKM+++ F L S+C CC ES+ H+ N ++W +F++ F + Sbjct: 1924 RLLHDWIPVELKMKSKGFQLASRCRCCKSE-ESIMHVMWDNPVATQVWNYFSKFFQILVI 1982 Query: 377 HTESIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVE 556 + +I+ L W + HI ++P LWF W+ERN +KH N+ I+ ++ Sbjct: 1983 NPCTINQILGAWFYSGDYCKPGHIRTLVPIFTLWFLWVERNDAKHRNLGMYPNRIVWRIL 2042 Query: 557 AHLKLLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGAT 736 ++ L L WKG IA + WHKP K+NVDG+ Sbjct: 2043 KLIQQLSLGQQLLKWQWKGDKQIAQEWGITFQAESLPPPKVFPWHKPSIGEFKLNVDGSA 2102 Query: 737 KGLINQAGLGGVLRDHEGNILWICYGFAE 823 K N AG GGVLRDH G ++ +GF+E Sbjct: 2103 KLSQNAAG-GGVLRDHAGVMV---FGFSE 2127 Score = 44.3 bits (103), Expect(2) = 4e-43 Identities = 26/93 (27%), Positives = 49/93 (52%) Frame = +1 Query: 910 VETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASLNIQFSHTLREGNQVADAVAHFGCQ 1089 +E D+ S+ ++L ++ +L+ IR + + + SH REGNQ AD +A+ G + Sbjct: 2159 IEMDAASVIRLLQGNQRGPHAIRYLLVSIRQLLSHFSFRLSHIFREGNQAADFLANRGHE 2218 Query: 1090 YRTYHMLRPPDIPRRILGLARTDQLELPSFRRK 1188 +++ ++ ++ G+ R DQ LP R K Sbjct: 2219 HQSLQVVTVAQ--GKLRGMLRLDQTSLPYVRFK 2249 >gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] Length = 1951 Score = 159 bits (402), Expect(2) = 6e-43 Identities = 90/267 (33%), Positives = 134/267 (50%) Frame = +2 Query: 23 QVPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLWRL 202 Q+P D W + NG FS+ SAW+ ++Q + LF IW++ I S+S FLWR+ Sbjct: 1568 QIPFDRSQEDVAYWALTSNGDFSLWSAWEAIRQRQTPNALFSLIWHRSIPLSISFFLWRV 1627 Query: 203 LYNRLLVDLKMQARNFSLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 382 L N + V+L+M+ + L S+C CC ESL H+ N ++W FA+ F + Sbjct: 1628 LNNWIPVELRMKDKGIHLASKCVCCRSE-ESLIHVLWENPVATQVWFFFAKSFQIYVSKP 1686 Query: 383 ESIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAH 562 I + W +T + HI ++P I WF WLERN +KH ++ +I ++ Sbjct: 1687 NHISQIIWAWFFSGDYTRNGHIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKL 1746 Query: 563 LKLLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGATKG 742 L LY +L WKG IA+ + + W KP K+NVDG++K Sbjct: 1747 LNQLYAGSLLKQWQWKGDTDIATMWGFKFPPKYCTSPQIIYWIKPFIGEYKLNVDGSSKS 1806 Query: 743 LINQAGLGGVLRDHEGNILWICYGFAE 823 +N AG GGVLRDH G + + F+E Sbjct: 1807 NLNAAG-GGVLRDHTGK---LAFAFSE 1829 Score = 43.5 bits (101), Expect(2) = 6e-43 Identities = 24/91 (26%), Positives = 49/91 (53%) Frame = +1 Query: 910 VETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASLNIQFSHTLREGNQVADAVAHFGCQ 1089 +E D++ Q++ Q ++ +L+ IR+ S + + SH REGNQ AD +++ G Sbjct: 1861 IEMDALVAVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRISHIYREGNQAADFLSNKGQT 1920 Query: 1090 YRTYHMLRPPDIPRRILGLARTDQLELPSFR 1182 +++ + + ++G+ + D+L LP R Sbjct: 1921 HQSLCVF--SEAQGELIGILKLDKLNLPYVR 1949 >gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 156 bits (395), Expect(2) = 7e-43 Identities = 86/267 (32%), Positives = 133/267 (49%) Frame = +2 Query: 23 QVPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLWRL 202 QVP D W + NG FS SAW++++Q + L IW++ I S+S FLW+ Sbjct: 1745 QVPFDKSREDVAYWTLTSNGDFSTRSAWEMIRQRQTSNALCSFIWHRSIPLSISFFLWKT 1804 Query: 203 LYNRLLVDLKMQARNFSLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 382 L+N + V+L+M+ + L S+C CC+ ESL H+ N K++W FA++F + + Sbjct: 1805 LHNWIPVELRMKEKGIQLASKCVCCNSE-ESLIHVLWENPVAKQVWNFFAQLFQIYIWNP 1863 Query: 383 ESIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAH 562 + + W + H +LP I WF WLERN +KH + +I + H Sbjct: 1864 RHVSQIIWAWYVSGDYVRKGHFRVLLPLFICWFLWLERNDAKHRHTGLYADRVIWRTMKH 1923 Query: 563 LKLLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGATKG 742 + LY +L WKG IA+ + W KP K+NVDG+++ Sbjct: 1924 CRQLYDGSLLQQWQWKGDTDIATMLGFSFTHKQHAPPQIIYWKKPSIGEYKLNVDGSSRN 1983 Query: 743 LINQAGLGGVLRDHEGNILWICYGFAE 823 ++ A GGVLRDH G ++ +GF+E Sbjct: 1984 GLH-AATGGVLRDHTGKLI---FGFSE 2006 Score = 45.8 bits (107), Expect(2) = 7e-43 Identities = 25/91 (27%), Positives = 51/91 (56%) Frame = +1 Query: 910 VETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASLNIQFSHTLREGNQVADAVAHFGCQ 1089 +E D++ Q++ K+ + +L+ IR+ +S + + SH LREGNQ AD +++ G + Sbjct: 2038 IEMDALVAIQLIQPSKKGPYNLRYLLESIRMCLSSFSYRLSHILREGNQAADYLSNEGHK 2097 Query: 1090 YRTYHMLRPPDIPRRILGLARTDQLELPSFR 1182 ++ + + ++ G+ + D+L LP R Sbjct: 2098 HQNLCVF--TEAQGQLHGMLKLDRLNLPYVR 2126 >gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 165 bits (417), Expect(2) = 7e-43 Identities = 91/267 (34%), Positives = 144/267 (53%) Frame = +2 Query: 23 QVPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLWRL 202 ++PI D W + NG FSI SAW++++Q V+ + IW+K I ++S FLWR Sbjct: 952 KIPISREKEDIAYWALTANGDFSIKSAWELLRQRKQVNLVGQLIWHKSIPLTVSFFLWRT 1011 Query: 203 LYNRLLVDLKMQARNFSLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 382 L+N L V+++M+A+ L S+C CC ESL H+ + +++W +F++ F + + Sbjct: 1012 LHNWLPVEVRMKAKGIQLASKCLCCKSE-ESLLHVLWESPVAQQVWNYFSKFFQIYVHNP 1070 Query: 383 ESIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAH 562 ++I L W FT HI ++ I WF W+ERN +KH ++ II ++ Sbjct: 1071 QNILQILNSWYYSGDFTKPGHIRTLILLFIFWFVWVERNDAKHRDLGMYPDRIIWRIMKI 1130 Query: 563 LKLLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGATKG 742 L+ L+Q +L WKG L IA + + + W KP +K+NVDG++K Sbjct: 1131 LRKLFQGGLLCKWQWKGDLDIAIHWGFNFAQERQARPKIINWIKPLIGELKLNVDGSSKD 1190 Query: 743 LINQAGLGGVLRDHEGNILWICYGFAE 823 A GGVLRDH GN++ +GF+E Sbjct: 1191 EFQNAAGGGVLRDHTGNLI---FGFSE 1214 Score = 37.4 bits (85), Expect(2) = 7e-43 Identities = 18/70 (25%), Positives = 36/70 (51%) Frame = +1 Query: 898 SSAMVETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASLNIQFSHTLREGNQVADAVAH 1077 S +E D+ + Q++ + + +L+ IR ++++ SH REGNQ AD ++ Sbjct: 1242 SRVWIEVDAQVVIQMIQNHHKGSYKIQYLLESIRKCLQVISVRISHIHREGNQAADFLSK 1301 Query: 1078 FGCQYRTYHM 1107 G ++ H+ Sbjct: 1302 HGHTHQNLHV 1311 >gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] Length = 926 Score = 156 bits (395), Expect(2) = 7e-43 Identities = 88/270 (32%), Positives = 134/270 (49%) Frame = +2 Query: 26 VPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLWRLL 205 +P D WI + NG+FS SAW+ +++ P +TL IW++ I S+S F+WR L Sbjct: 545 IPFDRTQQDVAYWILTSNGEFSTRSAWETIRKRQPHNTLGSLIWHRSIPLSISFFIWRAL 604 Query: 206 YNRLLVDLKMQARNFSLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHTE 385 N + V+L+M+ + L S+C CC+ ESL H+ N K++W FA F + + + Sbjct: 605 NNWIPVELRMKEKGIHLASKCVCCNSE-ESLMHVLWGNSVAKQVWAFFANFFQIYIFNPQ 663 Query: 386 SIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAHL 565 + L W + HI +LP I WF WLERN +KH ++ ++ L Sbjct: 664 HVSHILWAWFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKHRYSGLYTDRVVWRIMKLL 723 Query: 566 KLLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGATKGL 745 + L+ +L WKG IA+ + V W KP K+NVDG+++ Sbjct: 724 RQLHDGSLLQQWQWKGDTDIAAMWKYNLQLKLRAPPQIVYWRKPSTGEYKLNVDGSSRH- 782 Query: 746 INQAGLGGVLRDHEGNILWICYGFAEECDN 835 A GGVLRDH G ++ +GF+E N Sbjct: 783 GQHAASGGVLRDHTGKLI---FGFSENIGN 809 Score = 45.8 bits (107), Expect(2) = 7e-43 Identities = 25/91 (27%), Positives = 51/91 (56%) Frame = +1 Query: 910 VETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASLNIQFSHTLREGNQVADAVAHFGCQ 1089 +E D++++ Q++ ++ +L+ IR S++ + SH LREGNQVAD +++ G Sbjct: 837 IEMDALAVIQLIPHSQKGSHDIRYLLESIRKCLNSISYRISHILREGNQVADFLSNEGHN 896 Query: 1090 YRTYHMLRPPDIPRRILGLARTDQLELPSFR 1182 ++ + + ++ G+ + D+L LP R Sbjct: 897 HQNLRVF--TEAQGKLHGMLKLDRLNLPYVR 925 >gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 160 bits (405), Expect(2) = 2e-42 Identities = 88/269 (32%), Positives = 143/269 (53%) Frame = +2 Query: 17 LSQVPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLW 196 ++++PI + D W +PNG FS SAW+++++ V+ +F IW+K + + S FLW Sbjct: 1866 IAKIPIDTMNKDEAYWTPTPNGDFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLTTSFFLW 1925 Query: 197 RLLYNRLLVDLKMQARNFSLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLP 376 RLL++ + V+LKM+++ L S+C CC ES+ H+ N ++W +FA++F + Sbjct: 1926 RLLHDWIPVELKMKSKGLQLASRCRCCKSE-ESIMHVMWDNPVAMQVWNYFAKLFQILII 1984 Query: 377 HTESIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVE 556 + +I+ + W + HI ++P ILWF W+ERN +KH N+ ++ +V Sbjct: 1985 NPCTINQIIGAWFYSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVL 2044 Query: 557 AHLKLLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGAT 736 ++ L L WKG IA + WHKP K+NVDG+ Sbjct: 2045 KLIQQLSLGQQLLKWQWKGDKQIAQEWGIIFQAESLAPPKVFSWHKPSLGEFKLNVDGSA 2104 Query: 737 KGLINQAGLGGVLRDHEGNILWICYGFAE 823 K N AG GG+LRDH G ++ +GF+E Sbjct: 2105 KQSHNAAG-GGILRDHAGEMV---FGFSE 2129 Score = 40.4 bits (93), Expect(2) = 2e-42 Identities = 24/93 (25%), Positives = 46/93 (49%) Frame = +1 Query: 910 VETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASLNIQFSHTLREGNQVADAVAHFGCQ 1089 +E D++S+ ++L + +L+ +R + + +FSH REGNQ AD +A+ G + Sbjct: 2161 IEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLSHFSFRFSHIFREGNQAADFLANRGHE 2220 Query: 1090 YRTYHMLRPPDIPRRILGLARTDQLELPSFRRK 1188 ++ + ++ G+ DQ P R K Sbjct: 2221 HQNLQVFTVAQ--GKLRGMLCLDQTSFPYVRFK 2251 >gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] Length = 1954 Score = 154 bits (389), Expect(2) = 8e-42 Identities = 83/261 (31%), Positives = 135/261 (51%), Gaps = 1/261 (0%) Frame = +2 Query: 23 QVPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLWRL 202 Q+PI D W + NG+FS SAW+ ++ + L +W+K I S+S FLWR+ Sbjct: 1571 QIPIDRSQDDVAYWSLTSNGEFSTRSAWEAIRLRKSPNVLCSLLWHKSIPLSISFFLWRV 1630 Query: 203 LYNRLLVDLKMQARNFSLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 382 +N + VD++++ + F L S+C CC+ ESL H+ N K++W FA F + Sbjct: 1631 FHNWIPVDIRLKEKGFHLASKCICCNSE-ESLIHVLWDNPIAKQVWNFFANSFQIYISKP 1689 Query: 383 ESIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAH 562 +++ L W + HI ++P I WF WLERN +KH ++ ++ ++ Sbjct: 1690 QNVSQILWTWYLSGDYVRKGHIRILIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKL 1749 Query: 563 LKLLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGATKG 742 L+ L ++L + WKG A+ + + W KP P K+NVDG+++ Sbjct: 1750 LRQLQDGYLLKSWQWKGDKDFATMWGLFSPPKTRAAPQILHWVKPVPGEHKLNVDGSSRQ 1809 Query: 743 LINQ-AGLGGVLRDHEGNILW 802 NQ A +GGVLRDH G +++ Sbjct: 1810 --NQTAAIGGVLRDHTGTLVF 1828 Score = 44.7 bits (104), Expect(2) = 8e-42 Identities = 25/91 (27%), Positives = 48/91 (52%) Frame = +1 Query: 910 VETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASLNIQFSHTLREGNQVADAVAHFGCQ 1089 VE D++ Q++ Q ++ +L+ IR + + SH REGNQ AD +++ G Sbjct: 1864 VEMDALVAIQMIQQSQKGSHDIRYLLASIRKYLNFFSFRISHIFREGNQAADFLSNKGHT 1923 Query: 1090 YRTYHMLRPPDIPRRILGLARTDQLELPSFR 1182 +++ H+ + ++ G+ + D+L LP R Sbjct: 1924 HQSLHVF--TEAQGKLYGMLKLDRLNLPYVR 1952 >gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 152 bits (383), Expect(2) = 1e-41 Identities = 86/266 (32%), Positives = 133/266 (50%) Frame = +2 Query: 26 VPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLWRLL 205 +P D W + NG+FS SAW+ ++Q +TL IW++ I S+S F+WR L Sbjct: 1833 IPFDRTQQDVAYWTLTSNGEFSTKSAWETIRQQQSHNTLGSLIWHRSIPLSISFFIWRAL 1892 Query: 206 YNRLLVDLKMQARNFSLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHTE 385 N + V+L+M+ + L S+C CC+ ESL H+ N K++W FA+ F + + + Sbjct: 1893 NNWIPVELRMKGKGIHLASKCVCCNSE-ESLMHVLWGNSVAKQVWAFFAKFFQIYVLNPK 1951 Query: 386 SIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAHL 565 + L W + HI +LP I WF WLERN +K+ + + I+ ++ L Sbjct: 1952 HVSHILWAWFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKYRHSGLNTDRIVWRIMKLL 2011 Query: 566 KLLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGATKGL 745 + L +L WKG IA+ + V W KP K+NVDG+++ Sbjct: 2012 RQLKDGSLLQQWQWKGDTDIAAMWQYNFQLKLRAPPQIVYWRKPSTGEYKLNVDGSSRH- 2070 Query: 746 INQAGLGGVLRDHEGNILWICYGFAE 823 A GGVLRDH G ++ +GF+E Sbjct: 2071 GQHAASGGVLRDHTGKLI---FGFSE 2093 Score = 46.2 bits (108), Expect(2) = 1e-41 Identities = 26/91 (28%), Positives = 50/91 (54%) Frame = +1 Query: 910 VETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASLNIQFSHTLREGNQVADAVAHFGCQ 1089 +E D+++ Q+L ++ +L+ IR S++ + SH REGNQVAD +++ G Sbjct: 2125 IEMDALAAIQLLPHSQKGSHDIRYLLESIRKCLNSISYRISHIHREGNQVADFLSNEGHN 2184 Query: 1090 YRTYHMLRPPDIPRRILGLARTDQLELPSFR 1182 ++ H+ + ++ G+ + D+L LP R Sbjct: 2185 HQNLHVF--TEAQGKLHGMLKLDRLNLPYVR 2213 >gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao] Length = 1134 Score = 152 bits (384), Expect(2) = 1e-41 Identities = 85/267 (31%), Positives = 132/267 (49%) Frame = +2 Query: 23 QVPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLWRL 202 QVP D W + NG FS SA ++++Q + L IW++ I S+S FLW+ Sbjct: 749 QVPFDKSREDVAYWTLTSNGDFSTRSAGEMIRQRQTSNALCSFIWHRSIPLSISFFLWKT 808 Query: 203 LYNRLLVDLKMQARNFSLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 382 L+N + V+L+M+ + L S+C CC+ ESL H+ N K++W FA++F + + Sbjct: 809 LHNWIPVELRMKEKGIQLASKCVCCNSE-ESLIHVLWENPVAKQVWNFFAKLFQIYILNP 867 Query: 383 ESIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAH 562 + + W + H +LP I WF WLERN +KH + +I + H Sbjct: 868 RHVSQIIWAWYVSGDYVRKGHFRVLLPLFICWFLWLERNDAKHRHTGLYPDRVIWRTMKH 927 Query: 563 LKLLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGATKG 742 + LY +L WKG IA+ + W KP K+NVDG+++ Sbjct: 928 CRQLYDGSLLQQWQWKGDTDIAAMLGFSFPPQQHASPQIIYWKKPSIGEYKLNVDGSSRN 987 Query: 743 LINQAGLGGVLRDHEGNILWICYGFAE 823 ++ A GGVLRDH G ++ +GF+E Sbjct: 988 GLH-AATGGVLRDHTGKLI---FGFSE 1010 Score = 45.8 bits (107), Expect(2) = 1e-41 Identities = 24/91 (26%), Positives = 52/91 (57%) Frame = +1 Query: 910 VETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASLNIQFSHTLREGNQVADAVAHFGCQ 1089 +E D+++ Q++ K+ + +L+ IR+ +S + + SHT REGN+ AD +++ G + Sbjct: 1042 IEMDALAAIQLIQPSKKGPYDIRYLLESIRMCLSSFSYRLSHTFREGNKAADYLSNEGHK 1101 Query: 1090 YRTYHMLRPPDIPRRILGLARTDQLELPSFR 1182 ++ + + ++ G+ + D+L LP R Sbjct: 1102 HQNLCVF--TEAQGQLHGMLKLDRLNLPYVR 1130 >gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 159 bits (403), Expect(2) = 5e-41 Identities = 86/267 (32%), Positives = 140/267 (52%) Frame = +2 Query: 23 QVPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLWRL 202 ++PI S D W +PNG FS SAW++++ + +F IW+K + + S FLWRL Sbjct: 1831 KIPIDTSSNDKAYWTTTPNGDFSTKSAWQLIRNRKVENPVFNFIWHKSVPLTTSFFLWRL 1890 Query: 203 LYNRLLVDLKMQARNFSLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 382 L++ + V+LKM+ + F L S+C CC ESL H+ N ++W +FA++F + + Sbjct: 1891 LHDWIPVELKMKTKGFQLASRCRCCKSE-ESLMHVMWKNPVANQVWSYFAKVFQIQIINP 1949 Query: 383 ESIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAH 562 +I+ + W ++ HI ++P LWF W+ERN +KH N+ ++ ++ Sbjct: 1950 CTINQIICAWFYSGDYSKPGHIRTLVPLFTLWFLWVERNDAKHRNLGMYPNRVVWKILKL 2009 Query: 563 LKLLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGATKG 742 L L+Q L W+G IA + + W KP +K+NVDG+ K Sbjct: 2010 LHQLFQGKQLQKWQWQGDKQIAQEWGIILKADAPSPPKLLFWLKPSIGELKLNVDGSCKH 2069 Query: 743 LINQAGLGGVLRDHEGNILWICYGFAE 823 A GG+LRDH G+++ +GF+E Sbjct: 2070 NPQSAAGGGLLRDHTGSMI---FGFSE 2093 Score = 36.6 bits (83), Expect(2) = 5e-41 Identities = 22/97 (22%), Positives = 48/97 (49%) Frame = +1 Query: 898 SSAMVETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASLNIQFSHTLREGNQVADAVAH 1077 S +E D+ Q++ + Q + +L+ I + ++ + SH REGNQ AD +++ Sbjct: 2121 SRLWIEMDAKVAVQMIKEGHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQAADHLSN 2180 Query: 1078 FGCQYRTYHMLRPPDIPRRILGLARTDQLELPSFRRK 1188 G ++ ++ + ++ G+ R +++ L R K Sbjct: 2181 QGHTHQNLQVISQAE--GQLRGILRLEKINLAYVRFK 2215 >gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] Length = 1014 Score = 152 bits (384), Expect(2) = 7e-41 Identities = 87/268 (32%), Positives = 137/268 (51%), Gaps = 1/268 (0%) Frame = +2 Query: 23 QVPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLWRL 202 Q+P D W + +G+FS SAW+ V+Q +TL IW+K I ++S FLWR+ Sbjct: 631 QIPFDRSQDDIAYWALTSDGEFSTWSAWEAVRQRQSPNTLCSFIWHKSIPLTISFFLWRV 690 Query: 203 LYNRLLVDLKMQARNFSLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 382 L N + V+L+++ + F L S+C CC+ ESL H+ N K++W FA F + + Sbjct: 691 LNNWIPVELRLKEKGFHLASKCVCCNSE-ESLIHVLWDNPVAKQVWNFFADFFQINISNP 749 Query: 383 ESIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAH 562 + + + W F HI ++P I WF WLERN +KH ++ ++ ++ Sbjct: 750 QHVSQIIWAWYYSGDFVRKGHIRTLIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKV 809 Query: 563 LKLLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGATKG 742 L+ L +L WKG IA+ + + W KP K+NVDG+++ Sbjct: 810 LRQLQDGSLLKKWQWKGDTDIAAMWGFTLPLKIRESPQIIHWVKPVTGEYKLNVDGSSRH 869 Query: 743 LINQ-AGLGGVLRDHEGNILWICYGFAE 823 NQ A GG+LRDH G ++ +GF+E Sbjct: 870 --NQSAATGGLLRDHTGTLV---FGFSE 892 Score = 43.5 bits (101), Expect(2) = 7e-41 Identities = 26/89 (29%), Positives = 50/89 (56%), Gaps = 1/89 (1%) Frame = +1 Query: 910 VETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASL-NIQFSHTLREGNQVADAVAHFGC 1086 +E D++ + Q++ Q K+ +L+ IR KC S + + SH REGNQ AD +++ G Sbjct: 924 IEMDALVVIQMIQQSKKGSHDIRYLLASIR-KCLSFFSFRISHIFREGNQAADFLSNKGH 982 Query: 1087 QYRTYHMLRPPDIPRRILGLARTDQLELP 1173 ++ ++ + ++ G+ + D+L LP Sbjct: 983 THQNLQVI--SEAQGKLHGMLKLDRLNLP 1009 >gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] Length = 879 Score = 151 bits (381), Expect(2) = 2e-40 Identities = 83/266 (31%), Positives = 134/266 (50%) Frame = +2 Query: 26 VPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLWRLL 205 +P D W + NG+F+ SAW+ ++Q + L IW++ I S+S FLWR L Sbjct: 497 IPFNRTQQDVAYWTLTSNGEFATWSAWETIRQRKSSNALCSFIWHRSIPLSISFFLWRAL 556 Query: 206 YNRLLVDLKMQARNFSLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHTE 385 N + V+L+M+ + L S+C CC+ ESL H+ N K++W F + F + + + Sbjct: 557 NNWIPVELRMKEKGIQLASKCVCCNSE-ESLMHVLWGNSVAKQVWAFFGKFFQIYVLNPQ 615 Query: 386 SIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAHL 565 + L W + HI +LP I WF WLERN +KH + + + ++ ++ L Sbjct: 616 HVSQILWAWFFSGDYVKKGHIRSLLPIFICWFLWLERNDAKHRHTRLNPDRVVWRIMKLL 675 Query: 566 KLLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGATKGL 745 + L +L+ WKG IAS + + W KP K+NVDG+++ Sbjct: 676 RQLLDGSLLHQWQWKGDTDIASMWGHTFQSKHRAPPQIIYWRKPFTGEYKLNVDGSSRN- 734 Query: 746 INQAGLGGVLRDHEGNILWICYGFAE 823 + A GG+LRDH G ++ +GF+E Sbjct: 735 GHLAASGGILRDHTGKLI---FGFSE 757 Score = 42.7 bits (99), Expect(2) = 2e-40 Identities = 24/93 (25%), Positives = 50/93 (53%) Frame = +1 Query: 910 VETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASLNIQFSHTLREGNQVADAVAHFGCQ 1089 +E D++++ Q++ ++ +L+ IR + ++ + SH REGNQ AD +A+ G Sbjct: 789 IEMDALAVIQLIQHSQKGSHDIRYLLESIRKCLSCISYRISHIFREGNQAADYLANEGHS 848 Query: 1090 YRTYHMLRPPDIPRRILGLARTDQLELPSFRRK 1188 ++ ++ + + G+ + D+L LP R K Sbjct: 849 HQNLCVI--TEAQGELHGMLKLDRLNLPYVRFK 879 >gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao] Length = 1702 Score = 141 bits (355), Expect(2) = 2e-35 Identities = 82/274 (29%), Positives = 135/274 (49%), Gaps = 1/274 (0%) Frame = +2 Query: 5 WAATLSQVPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMS 184 W +P D W + NG+FS SAW+ ++ + L W+K I S+S Sbjct: 1145 WMGDQPLIPFDRSQDDIAYWALTSNGEFSTWSAWEALRLRQSPNVLCSLFWHKSIPLSIS 1204 Query: 185 IFLWRLLYNRLLVDLKMQARNFSLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFN 364 FLWR+ +N + VDL+++ + F L S+C CC+ E+L H+ N K++W FA F Sbjct: 1205 FFLWRVFHNWIPVDLRLKDKGFHLASKCACCNSE-ETLIHVLWDNPVAKQVWNFFANFFQ 1263 Query: 365 CTLPHTESIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHII 544 + + +++ L W + HI ++P I WF WLERN +K ++ ++ Sbjct: 1264 IYVSNPQNVSQILWAWYFSGDYVRKGHIRTLIPLFICWFLWLERNDAKQRHLGMYSDRVV 1323 Query: 545 CQVEAHLKLLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNV 724 ++ L+ L ++L WKG + IA+ + W K K+NV Sbjct: 1324 WKIMKLLRQLQDGYVLKNWQWKGDMDIAAMWGFNFSPKIQATPQIFHWVKLVSGEHKLNV 1383 Query: 725 DGATKGLINQ-AGLGGVLRDHEGNILWICYGFAE 823 DG+++ NQ A +GG+LRDH G ++ +GF+E Sbjct: 1384 DGSSRQ--NQSAAIGGLLRDHTGTLV---FGFSE 1412 Score = 36.6 bits (83), Expect(2) = 2e-35 Identities = 21/59 (35%), Positives = 35/59 (59%), Gaps = 1/59 (1%) Frame = +1 Query: 910 VETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASL-NIQFSHTLREGNQVADAVAHFG 1083 +E D++ Q++ Q ++ +L+ IR KC S + + SH REGNQVAD +++ G Sbjct: 1444 IEMDALVAIQMIQQSQKGSHDIQYLLASIR-KCLSFFSFRISHIFREGNQVADFLSNKG 1501 >ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum tuberosum] Length = 885 Score = 100 bits (249), Expect(2) = 9e-27 Identities = 63/257 (24%), Positives = 117/257 (45%), Gaps = 6/257 (2%) Frame = +2 Query: 50 DAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLWRLLYNRLLVDL 229 D + W+ + G F++ SAW++ + V IWNK + ++ F+WR+ R+ D Sbjct: 491 DVVWWMANAQGIFTVKSAWQITRNKQEVRRDCEVIWNKELPFKINFFMWRVWKRRIATDD 550 Query: 230 KMQARNFSLTSQCYCCS-CHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHTESIHIFLQ 406 ++ ++ S+C+CC E++ HLF K+W +FA + + + Sbjct: 551 NLKKMRINIVSRCWCCDRKKEETMTHLFPTAPITYKLWRYFAHFAGINIDGMHLQQLIIS 610 Query: 407 FWSN-FTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAHLKLLYQS 583 +W + TP I +P +I+W W RN KH++ S+ ++ V ++ + +S Sbjct: 611 WWKHEATP--KLQGIYKAIPAIIMWTLWKRRNALKHDS-SISWERMVEMVIEVVRKMVKS 667 Query: 584 H---MLNAN-VWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGATKGLIN 751 + N W+ + + Y I + V W P H VK N DGA +G Sbjct: 668 QFPWIKNMRWTWQAIIQRLNQY------KRKIHVLRVTWKPPDDHYVKSNTDGACRGNPG 721 Query: 752 QAGLGGVLRDHEGNILW 802 + G +RD +G++++ Sbjct: 722 LSSFGFCIRDDKGDLIY 738 Score = 48.1 bits (113), Expect(2) = 9e-27 Identities = 26/106 (24%), Positives = 53/106 (50%), Gaps = 1/106 (0%) Frame = +1 Query: 868 SLNTLSSTGYSSAMVETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASLNIQFSHTLRE 1047 +L S+ ++ETDS+SL +++ Q + W V +IR + + +H RE Sbjct: 760 ALRECSNRKMQKVIIETDSLSLKKIIQQTWRVPWKIAEKVEEIREIMEKIKAKITHIFRE 819 Query: 1048 GNQVADAVAHFGCQYRTYHMLRP-PDIPRRILGLARTDQLELPSFR 1182 GN +AD++A+ + + H ++P + + D+ ++P+ R Sbjct: 820 GNSLADSLANIAIESQAEHQYSCFQELPLKERRILNIDKAQIPTLR 865 >gb|EOY13984.1| RNase H family protein [Theobroma cacao] Length = 429 Score = 125 bits (315), Expect = 3e-26 Identities = 74/262 (28%), Positives = 121/262 (46%) Frame = +2 Query: 17 LSQVPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLW 196 + ++PI W + +GKF+ SAW++V+Q ++ +F IW++ I S+S FLW Sbjct: 74 IMKIPIDESRIYEAYWAPTSDGKFTTKSAWEIVRQRHSINFVFYSIWHRSIPLSISFFLW 133 Query: 197 RLLYNRLLVDLKMQARNFSLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLP 376 RL + + VDL+++++ F L +C C+ ESL H+ ++W +FA+ F + Sbjct: 134 RLFQDWIPVDLRLKSKGFQLVFKCQHCNSK-ESLFHVMWECPLASQVWNYFAKFFQIYII 192 Query: 377 HTESIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVE 556 H +SI+ + W + +T HI ++P I WF W+ERN +KH N+ Sbjct: 193 HRKSIYQIIWAWLFSSDYTKKGHIHILIPLFIFWFLWVERNDAKHRNLGM---------- 242 Query: 557 AHLKLLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGAT 736 Y K+ S W KP K+NVDG + Sbjct: 243 --------------------------YPNRKPSLPKPKVFS--WQKPLTGEFKLNVDGGS 274 Query: 737 KGLINQAGLGGVLRDHEGNILW 802 K A G +LRDH G +++ Sbjct: 275 KYDCQSAAGGRLLRDHTGTLIF 296 >gb|ABI34321.1| RNase H family protein [Solanum demissum] Length = 945 Score = 93.2 bits (230), Expect(2) = 8e-24 Identities = 67/270 (24%), Positives = 118/270 (43%), Gaps = 6/270 (2%) Frame = +2 Query: 47 ADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLWRLLYNRLLVD 226 +D WI S NG F+ SA+ + + KIW+ MS WRL+ N+L Sbjct: 535 SDYAIWIPSENGHFTTKSAYVDCSNTREKNDMRNKIWHGKFPFKMSFLTWRLVQNKLPFY 594 Query: 227 LKMQARNFSLTSQCYCC-SCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHTESIHIFL 403 + ++ S C CC + E++ H+FL +D +W+ F + +I++ Sbjct: 595 DTVGKFVDNIDSNCVCCKNMKTETINHVFLNSDVASYLWKKFGGTLGIDTRASSTINLLK 654 Query: 404 QFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYW-----HIICQVEAHLK 568 +W+ T + H+ I LP LI W W R K+ + K ++ H+ ++ L+ Sbjct: 655 TWWNVQTHNSIHNVIIHTLPILIFWEIWKRRCACKYGDQKKMWYRTMENHVWWNLKMSLR 714 Query: 569 LLYQSHMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVDGATKGLI 748 + + S + N W+ L+ K V W+ P + VK+N DG+ Sbjct: 715 MTFPSFEI-GNSWRDLLNKVESLRPYP------KWKIVHWNTPNINCVKINTDGSFSS-- 765 Query: 749 NQAGLGGVLRDHEGNILWICYGFAEECDNS 838 AGLG ++RDH ++ + + C ++ Sbjct: 766 GNAGLGWIVRDHTRRMI-MAFSIPSSCSSN 794 Score = 45.4 bits (106), Expect(2) = 8e-24 Identities = 28/98 (28%), Positives = 49/98 (50%), Gaps = 1/98 (1%) Frame = +1 Query: 892 GYSSAMVETDSMSLYQVLTQQKQSHWTSLHLVYKIRIKCASLNIQFSHTLREGNQVADAV 1071 G+ + +E DS + ++ + ++ +V I A +N + +H RE NQVADA+ Sbjct: 813 GFHNCYLELDSKLVVDMVRNGQATNLKIKGVVEDIIQVVAKMNCEVNHCYREANQVADAL 872 Query: 1072 AHFGCQYRTYHMLRP-PDIPRRILGLARTDQLELPSFR 1182 A HM DIP+ +G + D++++PS R Sbjct: 873 AKHAVISNEAHMYHDWRDIPKLAVGSYQLDKMQMPSIR 910 >ref|XP_004253442.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 775 Score = 88.2 bits (217), Expect(2) = 3e-23 Identities = 61/264 (23%), Positives = 109/264 (41%), Gaps = 6/264 (2%) Frame = +2 Query: 26 VPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLWRLL 205 +P D W GKFS SAW+ ++ + +W+ FI S LWR+L Sbjct: 380 IPQQQYQQDQPVWKLHSQGKFSCHSAWEEIRNKKAKNRFLSFLWHNFIPFKTSFLLWRIL 439 Query: 206 YNRLLVDLKMQARNFSLT-SQCYCC--SCHIESLPHLFLLNDQVKKIWEHFARMFNCTLP 376 ++ + K+ NF + S CYCC ++S+ H+F + ++W+ FA Sbjct: 440 KGKIPTNEKLT--NFGIEPSPCYCCVDRAGMDSINHIFNTGNFAGRVWKSFAAGAGLQQD 497 Query: 377 HTESIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQV- 553 Q+W+ + H + P I W W R K+ + + V Sbjct: 498 QQTLQARLKQWWTAKSCNAGHQLLLQATPIFICWNLWKNRCACKYGGKATNISRVKYAVY 557 Query: 554 EAHLKLLYQS--HMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVD 727 + + K++ + H+ W +H + K+ V+W++PP +K+N D Sbjct: 558 KDNFKMMKNAFPHIQWPAHWTALIHTSE------KCKHDTKVCQVVWNRPPEEWIKINTD 611 Query: 728 GATKGLINQAGLGGVLRDHEGNIL 799 G+ G GG++R+ EG ++ Sbjct: 612 GSALTNPGNIGAGGIIRNKEGKLV 635 Score = 48.5 bits (114), Expect(2) = 3e-23 Identities = 29/109 (26%), Positives = 55/109 (50%), Gaps = 7/109 (6%) Frame = +1 Query: 892 GYSSAMVETDSMSLYQVLTQQKQSHWTSLHLVYKIR-IKCASLNIQFSHTLREGNQVADA 1068 GY + ++E DS + Q ++++ HW+ + + +++ + + N + H RE N VADA Sbjct: 666 GYRNIIMELDSQLIVQWISKKSVHHWSVSNQIERLQYLIMQTQNFKCQHIFREANWVADA 725 Query: 1069 VAHFGCQYRTYHMLRPP------DIPRRILGLARTDQLELPSFRRKVVK 1197 ++ ++H+ P +P+ R D L +PSFRR+ K Sbjct: 726 LSK-----HSHHITSPQLYFDSNQLPKEANAYYRMDLLNMPSFRRRKTK 769 >ref|XP_004253443.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 775 Score = 88.2 bits (217), Expect(2) = 9e-23 Identities = 61/264 (23%), Positives = 110/264 (41%), Gaps = 6/264 (2%) Frame = +2 Query: 26 VPILHGSADAMSWIHSPNGKFSISSAWKVVKQPLPVSTLFPKIWNKFITPSMSIFLWRLL 205 +P D W GKFS SAW+ ++ + +W+ FI S LWR+L Sbjct: 380 IPQQQHQQDQPVWKLHSQGKFSCHSAWEEIRNKKAKNRFLSFLWHNFIPFKTSFLLWRIL 439 Query: 206 YNRLLVDLKMQARNFSLT-SQCYCC--SCHIESLPHLFLLNDQVKKIWEHFARMFNCTLP 376 ++ + K+ NF + S CYCC ++S+ H+F + ++W+ FA Sbjct: 440 KGKIPTNEKLT--NFGIEPSPCYCCVDRAGMDSINHIFNTGNFAGRVWKSFAAGAGLQED 497 Query: 377 HTESIHIFLQFWSNFTPFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQV- 553 Q+W+ + H + P I W W R K+ + + V Sbjct: 498 QQTLQARLKQWWTAKSCNAGHQLLLQATPIFICWNLWKNRCACKYGGKATNISRVKYVVY 557 Query: 554 EAHLKLLYQS--HMLNANVWKGFLHIASGYXXXXXXXXXIKISSVLWHKPPPHLVKVNVD 727 + + K++ + H+ W +H + K+ V+W++PP +K+N D Sbjct: 558 KDNFKMMKNAFPHIQWPAHWTALIHTSE------KCKHDTKVCQVVWNRPPEEWIKINTD 611 Query: 728 GATKGLINQAGLGGVLRDHEGNIL 799 G+ + G GG++R+ EG ++ Sbjct: 612 GSALTNPGKIGAGGIIRNKEGKLV 635 Score = 47.0 bits (110), Expect(2) = 9e-23 Identities = 28/109 (25%), Positives = 54/109 (49%), Gaps = 7/109 (6%) Frame = +1 Query: 892 GYSSAMVETDSMSLYQVLTQQKQSHWTSLHLVYKIR-IKCASLNIQFSHTLREGNQVADA 1068 GY + ++E DS + Q ++++ HW+ + + +++ + + N + H +E N VADA Sbjct: 666 GYRNIIMELDSQLIVQWISKKSVHHWSVSNQIERLQYLIMQTQNFKCQHIFKEANWVADA 725 Query: 1069 VAHFGCQYRTYHMLRPP------DIPRRILGLARTDQLELPSFRRKVVK 1197 ++ +H+ P +P+ R D L +PSFRR+ K Sbjct: 726 LSK-----HNHHITSPQLYFDSNQLPKEANAYYRMDLLNMPSFRRRKTK 769