BLASTX nr result
ID: Rehmannia23_contig00032294
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia23_contig00032294 (760 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] 118 1e-35 gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] 113 6e-34 gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] 117 2e-33 gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] 117 3e-33 gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] 110 5e-33 gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob... 108 2e-32 gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] 110 2e-32 gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao] 105 3e-32 gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] 115 4e-32 gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao] 110 4e-32 gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] 111 4e-32 gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] 108 7e-32 gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] 103 3e-31 gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] 113 5e-31 gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] 112 3e-30 gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] 88 3e-22 gb|EOY13984.1| RNase H family protein [Theobroma cacao] 101 2e-19 ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein A... 74 7e-16 ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596... 86 1e-14 ref|XP_004253338.1| PREDICTED: putative ribonuclease H protein A... 70 2e-14 >gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] Length = 1951 Score = 118 bits (296), Expect(2) = 1e-35 Identities = 60/161 (37%), Positives = 89/161 (55%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W +SNG+FSL SAW++I + LFS +W+ + S+S FLWR+L N +PV+ +++D Sbjct: 1581 WALTSNGDFSLWSAWEAIRQRQTPNALFSLIWHRSIPLSISFFLWRVLNNWIPVELRMKD 1640 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +GI LASKC CC S+ H+ NP QVW F+ + + + +I Sbjct: 1641 KGIHLASKCVCC----------RSEESLIHVLWENPVATQVWFFFAKSFQIYVSKPNHIS 1690 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHEN 272 + AW + HI +IP I WF+WLERN+ KH + Sbjct: 1691 QIIWAWFFSGDYTRNGHIRILIPLFICWFLWLERNDAKHRH 1731 Score = 58.5 bits (140), Expect(2) = 1e-35 Identities = 33/84 (39%), Positives = 45/84 (53%) Frame = -3 Query: 260 PHHIIWKVEHHLSLLRFCKLFKAKNWKGMLDIASSFGFNFSGTITSISIPVLWKKPDTGV 81 P+ +IW++ L+ L L K WKG DIA+ +GF F + + W KP G Sbjct: 1736 PNRVIWRIMKLLNQLYAGSLLKQWQWKGDTDIATMWGFKFPPKYCTSPQIIYWIKPFIGE 1795 Query: 80 VKLNCDGAKKTAHSHGGIGGVLRD 9 KLN DG+ K+ + G GGVLRD Sbjct: 1796 YKLNVDGSSKSNLNAAG-GGVLRD 1818 >gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 111 bits (278), Expect(2) = 6e-34 Identities = 58/161 (36%), Positives = 85/161 (52%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W +SNG FS SAW+ I + L S W+ + S+S FLWR+L N +PV+ +++D Sbjct: 1338 WALTSNGEFSFWSAWEIIRQRQTPNALLSFNWHRSIPLSISFFLWRVLNNWIPVELRMKD 1397 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +GI LASKC CC S+ H+ NP QVW F+ + + + ++I Sbjct: 1398 KGIHLASKCVCC----------RSEESLIHVLWENPVAKQVWNFFAKSFQIYVSKPKHIS 1447 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHEN 272 + AW + HI +IP I WF+WLERN+ KH + Sbjct: 1448 QIIWAWFFSGDYTRNGHIRILIPLFICWFLWLERNDAKHRH 1488 Score = 59.7 bits (143), Expect(2) = 6e-34 Identities = 32/84 (38%), Positives = 45/84 (53%) Frame = -3 Query: 260 PHHIIWKVEHHLSLLRFCKLFKAKNWKGMLDIASSFGFNFSGTITSISIPVLWKKPDTGV 81 P+ +IW++ L+ L L K WKG DIA+ +GF + + W KP G Sbjct: 1493 PNRVIWRIMKLLNQLHAGSLLKQWQWKGDTDIATMWGFKYPPKYCQSPQIISWIKPFIGE 1552 Query: 80 VKLNCDGAKKTAHSHGGIGGVLRD 9 KLN DG+ K++ + G GGVLRD Sbjct: 1553 YKLNVDGSSKSSQNAAG-GGVLRD 1575 Score = 113 bits (283), Expect(2) = 8e-31 Identities = 55/161 (34%), Positives = 87/161 (54%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W P+ NG+FS SAW+ P ++ +W+ + + S FLWRLL + +PV+ K++ Sbjct: 3132 WTPTPNGDFSTKSAWQLSRERKVVNPTYNYIWHKSVPLTTSFFLWRLLHDWVPVELKMKS 3191 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +G LAS+C CC S+ H+ +NP +QVW +F+ + + I I Sbjct: 3192 KGFQLASRCRCC----------KSEESLMHVMWDNPVANQVWSYFAKVFQIHIINPCTIN 3241 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHEN 272 +SAW ++ HI ++P ILWF+W+ERN+ KH N Sbjct: 3242 HIISAWFYSGDYSKPGHIRTLVPLFILWFLWVERNDAKHRN 3282 Score = 47.4 bits (111), Expect(2) = 8e-31 Identities = 27/84 (32%), Positives = 37/84 (44%) Frame = -3 Query: 260 PHHIIWKVEHHLSLLRFCKLFKAKNWKGMLDIASSFGFNFSGTITSISIPVLWKKPDTGV 81 P+ I+WK+ + L K + W+G IA +G S + W KP G Sbjct: 3287 PNRIVWKILKLIHQLFQGKQLQKWQWQGDKQIAQEWGIILKAVAPSPPKLLFWNKPSIGE 3346 Query: 80 VKLNCDGAKKTAHSHGGIGGVLRD 9 KLN DG+ K GG+LRD Sbjct: 3347 FKLNVDGSSKYNLQTAAGGGLLRD 3370 >gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 117 bits (293), Expect(2) = 2e-33 Identities = 57/161 (35%), Positives = 88/161 (54%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W P+ NG+FS SAW+ I + P+F+ +W+ + + S FLWRLL + +PV+ K++ Sbjct: 1881 WTPTPNGDFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKS 1940 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +G+ LAS+C CC SI H+ +NP QVW +F+ + + I I Sbjct: 1941 KGLQLASRCRCC----------KSEESIMHVMWDNPVAMQVWNYFAKLFQILIINPCTIN 1990 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHEN 272 + AW + HI ++P ILWF+W+ERN+ KH N Sbjct: 1991 QIIGAWFYSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRN 2031 Score = 52.0 bits (123), Expect(2) = 2e-33 Identities = 28/84 (33%), Positives = 40/84 (47%) Frame = -3 Query: 260 PHHIIWKVEHHLSLLRFCKLFKAKNWKGMLDIASSFGFNFSGTITSISIPVLWKKPDTGV 81 P+ ++W+V + L + WKG IA +G F + W KP G Sbjct: 2036 PNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIIFQAESLAPPKVFSWHKPSLGE 2095 Query: 80 VKLNCDGAKKTAHSHGGIGGVLRD 9 KLN DG+ K +H+ G GG+LRD Sbjct: 2096 FKLNVDGSAKQSHNAAG-GGILRD 2118 >gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] Length = 910 Score = 117 bits (294), Expect(2) = 3e-33 Identities = 57/161 (35%), Positives = 88/161 (54%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W P+ NG+FS SAW+ I + P+F+ +W+ + + S FLWRLL + +PV+ K++ Sbjct: 540 WTPTPNGDFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKS 599 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +G+ LAS+C CC SI H+ +NP QVW +F+ + + I I Sbjct: 600 KGLQLASRCRCC----------KSEESIMHVMWDNPVAMQVWNYFAKLFQICIINPCTIN 649 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHEN 272 + AW + HI ++P ILWF+W+ERN+ KH N Sbjct: 650 QIIGAWFHSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRN 690 Score = 51.2 bits (121), Expect(2) = 3e-33 Identities = 28/84 (33%), Positives = 40/84 (47%) Frame = -3 Query: 260 PHHIIWKVEHHLSLLRFCKLFKAKNWKGMLDIASSFGFNFSGTITSISIPVLWKKPDTGV 81 P+ ++W+V + L + WKG IA +G + W KP TG Sbjct: 695 PNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIILQAESLAPPKVFSWHKPTTGE 754 Query: 80 VKLNCDGAKKTAHSHGGIGGVLRD 9 KLN DG+ K +H+ G GG+LRD Sbjct: 755 FKLNVDGSAKHSHNAAG-GGILRD 777 >gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 110 bits (275), Expect(2) = 5e-33 Identities = 55/161 (34%), Positives = 85/161 (52%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W +SNG+FS SAW+ I + S L S +W+ + S+S FLW+ L N +PV+ ++++ Sbjct: 1758 WTLTSNGDFSTRSAWEMIRQRQTSNALCSFIWHRSIPLSISFFLWKTLHNWIPVELRMKE 1817 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +GI LASKC CC S+ H+ NP QVW F+ + + I ++ Sbjct: 1818 KGIQLASKCVCC----------NSEESLIHVLWENPVAKQVWNFFAQLFQIYIWNPRHVS 1867 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHEN 272 + AW + H ++P I WF+WLERN+ KH + Sbjct: 1868 QIIWAWYVSGDYVRKGHFRVLLPLFICWFLWLERNDAKHRH 1908 Score = 57.8 bits (138), Expect(2) = 5e-33 Identities = 31/81 (38%), Positives = 41/81 (50%) Frame = -3 Query: 251 IIWKVEHHLSLLRFCKLFKAKNWKGMLDIASSFGFNFSGTITSISIPVLWKKPDTGVVKL 72 +IW+ H L L + WKG DIA+ GF+F+ + + WKKP G KL Sbjct: 1916 VIWRTMKHCRQLYDGSLLQQWQWKGDTDIATMLGFSFTHKQHAPPQIIYWKKPSIGEYKL 1975 Query: 71 NCDGAKKTAHSHGGIGGVLRD 9 N DG+ + H GGVLRD Sbjct: 1976 NVDGSSRNG-LHAATGGVLRD 1995 >gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 108 bits (271), Expect(2) = 2e-32 Identities = 54/161 (33%), Positives = 86/161 (53%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W ++NG+FS+ SAW+ + + + +W+ + ++S FLWR L N LPV+ +++ Sbjct: 965 WALTANGDFSIKSAWELLRQRKQVNLVGQLIWHKSIPLTVSFFLWRTLHNWLPVEVRMKA 1024 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +GI LASKC CC S+ H+ +P QVW +FS + + +NIL Sbjct: 1025 KGIQLASKCLCC----------KSEESLLHVLWESPVAQQVWNYFSKFFQIYVHNPQNIL 1074 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHEN 272 L++W F HI +I I WF+W+ERN+ KH + Sbjct: 1075 QILNSWYYSGDFTKPGHIRTLILLFIFWFVWVERNDAKHRD 1115 Score = 57.0 bits (136), Expect(2) = 2e-32 Identities = 34/84 (40%), Positives = 43/84 (51%) Frame = -3 Query: 260 PHHIIWKVEHHLSLLRFCKLFKAKNWKGMLDIASSFGFNFSGTITSISIPVLWKKPDTGV 81 P IIW++ L L L WKG LDIA +GFNF+ + + W KP G Sbjct: 1120 PDRIIWRIMKILRKLFQGGLLCKWQWKGDLDIAIHWGFNFAQERQARPKIINWIKPLIGE 1179 Query: 80 VKLNCDGAKKTAHSHGGIGGVLRD 9 +KLN DG+ K + GGVLRD Sbjct: 1180 LKLNVDGSSKDEFQNAAGGGVLRD 1203 >gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] Length = 926 Score = 110 bits (276), Expect(2) = 2e-32 Identities = 56/159 (35%), Positives = 86/159 (54%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W+ +SNG FS SAW++I + P L S +W+ + S+S F+WR L N +PV+ ++++ Sbjct: 557 WILTSNGEFSTRSAWETIRKRQPHNTLGSLIWHRSIPLSISFFIWRALNNWIPVELRMKE 616 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +GI LASKC CC S+ H+ N QVW F++ + I +++ Sbjct: 617 KGIHLASKCVCC----------NSEESLMHVLWGNSVAKQVWAFFANFFQIYIFNPQHVS 666 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKH 278 L AW + HI ++P I WF+WLERN+ KH Sbjct: 667 HILWAWFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKH 705 Score = 55.1 bits (131), Expect(2) = 2e-32 Identities = 29/81 (35%), Positives = 42/81 (51%) Frame = -3 Query: 251 IIWKVEHHLSLLRFCKLFKAKNWKGMLDIASSFGFNFSGTITSISIPVLWKKPDTGVVKL 72 ++W++ L L L + WKG DIA+ + +N + + V W+KP TG KL Sbjct: 715 VVWRIMKLLRQLHDGSLLQQWQWKGDTDIAAMWKYNLQLKLRAPPQIVYWRKPSTGEYKL 774 Query: 71 NCDGAKKTAHSHGGIGGVLRD 9 N DG+ + H GGVLRD Sbjct: 775 NVDGSSRHG-QHAASGGVLRD 794 >gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao] Length = 1134 Score = 105 bits (263), Expect(2) = 3e-32 Identities = 54/161 (33%), Positives = 84/161 (52%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W +SNG+FS SA + I + S L S +W+ + S+S FLW+ L N +PV+ ++++ Sbjct: 762 WTLTSNGDFSTRSAGEMIRQRQTSNALCSFIWHRSIPLSISFFLWKTLHNWIPVELRMKE 821 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +GI LASKC CC S+ H+ NP QVW F+ + + I ++ Sbjct: 822 KGIQLASKCVCC----------NSEESLIHVLWENPVAKQVWNFFAKLFQIYILNPRHVS 871 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHEN 272 + AW + H ++P I WF+WLERN+ KH + Sbjct: 872 QIIWAWYVSGDYVRKGHFRVLLPLFICWFLWLERNDAKHRH 912 Score = 59.7 bits (143), Expect(2) = 3e-32 Identities = 32/84 (38%), Positives = 41/84 (48%) Frame = -3 Query: 260 PHHIIWKVEHHLSLLRFCKLFKAKNWKGMLDIASSFGFNFSGTITSISIPVLWKKPDTGV 81 P +IW+ H L L + WKG DIA+ GF+F + + WKKP G Sbjct: 917 PDRVIWRTMKHCRQLYDGSLLQQWQWKGDTDIAAMLGFSFPPQQHASPQIIYWKKPSIGE 976 Query: 80 VKLNCDGAKKTAHSHGGIGGVLRD 9 KLN DG+ + H GGVLRD Sbjct: 977 YKLNVDGSSRNG-LHAATGGVLRD 999 >gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 115 bits (289), Expect(2) = 4e-32 Identities = 57/161 (35%), Positives = 85/161 (52%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W P+ NG FS SAW+ I + P+F+ +W+ + ++S FLWRLL + +PV+ K++ Sbjct: 1879 WAPTPNGEFSTKSAWQLIRKREVVNPVFNFIWHKTVPLTISFFLWRLLHDWIPVELKMKS 1938 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +G LAS+C CC SI H+ +NP QVW +FS + + I Sbjct: 1939 KGFQLASRCRCC----------KSEESIMHVMWDNPVATQVWNYFSKFFQILVINPCTIN 1988 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHEN 272 L AW + HI ++P LWF+W+ERN+ KH N Sbjct: 1989 QILGAWFYSGDYCKPGHIRTLVPIFTLWFLWVERNDAKHRN 2029 Score = 49.3 bits (116), Expect(2) = 4e-32 Identities = 28/84 (33%), Positives = 38/84 (45%) Frame = -3 Query: 260 PHHIIWKVEHHLSLLRFCKLFKAKNWKGMLDIASSFGFNFSGTITSISIPVLWKKPDTGV 81 P+ I+W++ + L + WKG IA +G F W KP G Sbjct: 2034 PNRIVWRILKLIQQLSLGQQLLKWQWKGDKQIAQEWGITFQAESLPPPKVFPWHKPSIGE 2093 Query: 80 VKLNCDGAKKTAHSHGGIGGVLRD 9 KLN DG+ K + + G GGVLRD Sbjct: 2094 FKLNVDGSAKLSQNAAG-GGVLRD 2116 >gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao] Length = 1702 Score = 110 bits (274), Expect(2) = 4e-32 Identities = 56/161 (34%), Positives = 83/161 (51%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W +SNG FS SAW+++ L S W+ + S+S FLWR+ N +PVD +L+D Sbjct: 1164 WALTSNGEFSTWSAWEALRLRQSPNVLCSLFWHKSIPLSISFFLWRVFHNWIPVDLRLKD 1223 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +G LASKC CC ++ H+ +NP QVW F++ + + +N+ Sbjct: 1224 KGFHLASKCACC----------NSEETLIHVLWDNPVAKQVWNFFANFFQIYVSNPQNVS 1273 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHEN 272 L AW + HI +IP I WF+WLERN+ K + Sbjct: 1274 QILWAWYFSGDYVRKGHIRTLIPLFICWFLWLERNDAKQRH 1314 Score = 55.1 bits (131), Expect(2) = 4e-32 Identities = 32/81 (39%), Positives = 45/81 (55%) Frame = -3 Query: 251 IIWKVEHHLSLLRFCKLFKAKNWKGMLDIASSFGFNFSGTITSISIPVLWKKPDTGVVKL 72 ++WK+ L L+ + K WKG +DIA+ +GFNFS I + W K +G KL Sbjct: 1322 VVWKIMKLLRQLQDGYVLKNWQWKGDMDIAAMWGFNFSPKIQATPQIFHWVKLVSGEHKL 1381 Query: 71 NCDGAKKTAHSHGGIGGVLRD 9 N DG+ + S IGG+LRD Sbjct: 1382 NVDGSSRQNQS-AAIGGLLRD 1401 >gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] Length = 1014 Score = 111 bits (278), Expect(2) = 4e-32 Identities = 55/161 (34%), Positives = 86/161 (53%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W +S+G FS SAW+++ + L S +W+ + ++S FLWR+L N +PV+ +L++ Sbjct: 644 WALTSDGEFSTWSAWEAVRQRQSPNTLCSFIWHKSIPLTISFFLWRVLNNWIPVELRLKE 703 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +G LASKC CC S+ H+ +NP QVW F+ + I +++ Sbjct: 704 KGFHLASKCVCC----------NSEESLIHVLWDNPVAKQVWNFFADFFQINISNPQHVS 753 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHEN 272 + AW F HI +IP I WF+WLERN+ KH + Sbjct: 754 QIIWAWYYSGDFVRKGHIRTLIPLFICWFLWLERNDAKHRH 794 Score = 53.5 bits (127), Expect(2) = 4e-32 Identities = 31/81 (38%), Positives = 41/81 (50%) Frame = -3 Query: 251 IIWKVEHHLSLLRFCKLFKAKNWKGMLDIASSFGFNFSGTITSISIPVLWKKPDTGVVKL 72 ++WK+ L L+ L K WKG DIA+ +GF I + W KP TG KL Sbjct: 802 VVWKIMKVLRQLQDGSLLKKWQWKGDTDIAAMWGFTLPLKIRESPQIIHWVKPVTGEYKL 861 Query: 71 NCDGAKKTAHSHGGIGGVLRD 9 N DG+ + S GG+LRD Sbjct: 862 NVDGSSRHNQS-AATGGLLRD 881 >gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] Length = 879 Score = 108 bits (269), Expect(2) = 7e-32 Identities = 55/165 (33%), Positives = 85/165 (51%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W +SNG F+ SAW++I + S L S +W+ + S+S FLWR L N +PV+ ++++ Sbjct: 509 WTLTSNGEFATWSAWETIRQRKSSNALCSFIWHRSIPLSISFFLWRALNNWIPVELRMKE 568 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +GI LASKC CC S+ H+ N QVW F + + +++ Sbjct: 569 KGIQLASKCVCC----------NSEESLMHVLWGNSVAKQVWAFFGKFFQIYVLNPQHVS 618 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHENANFS 260 L AW + HI ++P I WF+WLERN+ KH + + Sbjct: 619 QILWAWFFSGDYVKKGHIRSLLPIFICWFLWLERNDAKHRHTRLN 663 Score = 56.2 bits (134), Expect(2) = 7e-32 Identities = 30/84 (35%), Positives = 41/84 (48%) Frame = -3 Query: 260 PHHIIWKVEHHLSLLRFCKLFKAKNWKGMLDIASSFGFNFSGTITSISIPVLWKKPDTGV 81 P ++W++ L L L WKG DIAS +G F + + W+KP TG Sbjct: 664 PDRVVWRIMKLLRQLLDGSLLHQWQWKGDTDIASMWGHTFQSKHRAPPQIIYWRKPFTGE 723 Query: 80 VKLNCDGAKKTAHSHGGIGGVLRD 9 KLN DG+ + H GG+LRD Sbjct: 724 YKLNVDGSSRNGHL-AASGGILRD 746 >gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 103 bits (258), Expect(2) = 3e-31 Identities = 53/165 (32%), Positives = 85/165 (51%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W +SNG FS SAW++I + L S +W+ + S+S F+WR L N +PV+ +++ Sbjct: 1845 WTLTSNGEFSTKSAWETIRQQQSHNTLGSLIWHRSIPLSISFFIWRALNNWIPVELRMKG 1904 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +GI LASKC CC S+ H+ N QVW F+ + + +++ Sbjct: 1905 KGIHLASKCVCC----------NSEESLMHVLWGNSVAKQVWAFFAKFFQIYVLNPKHVS 1954 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHENANFS 260 L AW + HI ++P I WF+WLERN+ K+ ++ + Sbjct: 1955 HILWAWFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKYRHSGLN 1999 Score = 58.5 bits (140), Expect(2) = 3e-31 Identities = 31/81 (38%), Positives = 44/81 (54%) Frame = -3 Query: 251 IIWKVEHHLSLLRFCKLFKAKNWKGMLDIASSFGFNFSGTITSISIPVLWKKPDTGVVKL 72 I+W++ L L+ L + WKG DIA+ + +NF + + V W+KP TG KL Sbjct: 2003 IVWRIMKLLRQLKDGSLLQQWQWKGDTDIAAMWQYNFQLKLRAPPQIVYWRKPSTGEYKL 2062 Query: 71 NCDGAKKTAHSHGGIGGVLRD 9 N DG+ + H GGVLRD Sbjct: 2063 NVDGSSRHG-QHAASGGVLRD 2082 >gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] Length = 1954 Score = 113 bits (283), Expect(2) = 5e-31 Identities = 59/161 (36%), Positives = 85/161 (52%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W +SNG FS SAW++I L S LW+ + S+S FLWR+ N +PVD +L++ Sbjct: 1584 WSLTSNGEFSTRSAWEAIRLRKSPNVLCSLLWHKSIPLSISFFLWRVFHNWIPVDIRLKE 1643 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +G LASKC CC S+ H+ +NP QVW F++ + I + +N+ Sbjct: 1644 KGFHLASKCICC----------NSEESLIHVLWDNPIAKQVWNFFANSFQIYISKPQNVS 1693 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHEN 272 L W + HI +IP I WF+WLERN+ KH + Sbjct: 1694 QILWTWYLSGDYVRKGHIRILIPLFICWFLWLERNDAKHRH 1734 Score = 48.1 bits (113), Expect(2) = 5e-31 Identities = 32/82 (39%), Positives = 44/82 (53%), Gaps = 1/82 (1%) Frame = -3 Query: 251 IIWKVEHHLSLLRFCKLFKAKNWKGMLDIASSFGFNFSGTITSISIPVL-WKKPDTGVVK 75 ++WK+ L L+ L K+ WKG D A+ +G FS T + +L W KP G K Sbjct: 1742 VVWKIMKLLRQLQDGYLLKSWQWKGDKDFATMWGL-FSPPKTRAAPQILHWVKPVPGEHK 1800 Query: 74 LNCDGAKKTAHSHGGIGGVLRD 9 LN DG+ + + IGGVLRD Sbjct: 1801 LNVDGSSR-QNQTAAIGGVLRD 1821 >gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 112 bits (279), Expect(2) = 3e-30 Identities = 54/161 (33%), Positives = 85/161 (52%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W + NG+FS SAW+ I P+F+ +W+ + + S FLWRLL + +PV+ K++ Sbjct: 1844 WTTTPNGDFSTKSAWQLIRNRKVENPVFNFIWHKSVPLTTSFFLWRLLHDWIPVELKMKT 1903 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +G LAS+C CC S+ H+ NP +QVW +F+ + + I I Sbjct: 1904 KGFQLASRCRCC----------KSEESLMHVMWKNPVANQVWSYFAKVFQIQIINPCTIN 1953 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHEN 272 + AW ++ HI ++P LWF+W+ERN+ KH N Sbjct: 1954 QIICAWFYSGDYSKPGHIRTLVPLFTLWFLWVERNDAKHRN 1994 Score = 47.0 bits (110), Expect(2) = 3e-30 Identities = 27/84 (32%), Positives = 38/84 (45%) Frame = -3 Query: 260 PHHIIWKVEHHLSLLRFCKLFKAKNWKGMLDIASSFGFNFSGTITSISIPVLWKKPDTGV 81 P+ ++WK+ L L K + W+G IA +G S + W KP G Sbjct: 1999 PNRVVWKILKLLHQLFQGKQLQKWQWQGDKQIAQEWGIILKADAPSPPKLLFWLKPSIGE 2058 Query: 80 VKLNCDGAKKTAHSHGGIGGVLRD 9 +KLN DG+ K GG+LRD Sbjct: 2059 LKLNVDGSCKHNPQSAAGGGLLRD 2082 >gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 88.2 bits (217), Expect(2) = 3e-22 Identities = 50/167 (29%), Positives = 74/167 (44%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W P+ NG FS SAW+ I + P+F+ +W+ + + S FLWRLL + +PV+ +++ Sbjct: 2051 WAPTPNGEFSTKSAWQLIRKREVVNPVFNFIWHKAIPLTTSFFLWRLLHDWIPVELRMKS 2110 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +G LAS+C CC SI H+ +NP Q Sbjct: 2111 KGFQLASRCRCC----------RSEESIIHVMWDNPVAVQPG------------------ 2142 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHENANFSLI 254 HI +IP LWF+W+ERN+ KH N L+ Sbjct: 2143 ----------------HIRTLIPIFTLWFLWVERNDAKHRNLGQQLL 2173 Score = 43.9 bits (102), Expect(2) = 3e-22 Identities = 24/59 (40%), Positives = 28/59 (47%) Frame = -3 Query: 185 WKGMLDIASSFGFNFSGTITSISIPVLWKKPDTGVVKLNCDGAKKTAHSHGGIGGVLRD 9 WKG IA +G F W KP G KLN DG+ K + + G GGVLRD Sbjct: 2177 WKGDKQIAQEWGITFQAKSLPPPKVFCWHKPSNGEFKLNVDGSAKLSQNAAG-GGVLRD 2234 >gb|EOY13984.1| RNase H family protein [Theobroma cacao] Length = 429 Score = 101 bits (252), Expect = 2e-19 Identities = 51/161 (31%), Positives = 82/161 (50%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W P+S+G F+ SAW+ + + +F +W+ + S+S FLWRL ++ +PVD +L+ Sbjct: 89 WAPTSDGKFTTKSAWEIVRQRHSINFVFYSIWHRSIPLSISFFLWRLFQDWIPVDLRLKS 148 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 +G L KC C S+ H+ P QVW +F+ + I ++I Sbjct: 149 KGFQLVFKCQHC----------NSKESLFHVMWECPLASQVWNYFAKFFQIYIIHRKSIY 198 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHEN 272 + AW + + HI +IP I WF+W+ERN+ KH N Sbjct: 199 QIIWAWLFSSDYTKKGHIHILIPLFIFWFLWVERNDAKHRN 239 >ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 872 Score = 74.3 bits (181), Expect(2) = 7e-16 Identities = 51/171 (29%), Positives = 76/171 (44%), Gaps = 5/171 (2%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 W SS G + A+ + + SP P LW+ F+ P MS+ W+++R + LQ Sbjct: 501 WQASSTGELTAKQAFLFLQQASPVVPWGKPLWSKFILPRMSLHAWKVMRGTVISYHLLQR 560 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILR-----HTIPE 410 RG+AL S+C C S+ H+FL+ VW HF I +TI E Sbjct: 561 RGVALVSRCEFC---------GNSTESLDHIFLHCSFAASVWNHFIYIFEIGLVPNTIAE 611 Query: 409 TENILLYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKHENANFSL 257 ++ L + QL I S ILW+IW RN + ++ FS+ Sbjct: 612 VFSLGLAMDRSPQLKELWLICFTS------ILWYIWHARNQIRFDSRTFSV 656 Score = 36.2 bits (82), Expect(2) = 7e-16 Identities = 20/54 (37%), Positives = 24/54 (44%) Frame = -3 Query: 173 LDIASSFGFNFSGTITSISIPVLWKKPDTGVVKLNCDGAKKTAHSHGGIGGVLR 12 L I SFG + V+W P G +K+N DGA K GG G V R Sbjct: 685 LCILKSFGACCRSRRIPRMVEVIWHPPSIGWIKINSDGAWKHEEGIGGFGAVFR 738 >ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596481 [Solanum tuberosum] Length = 1135 Score = 86.3 bits (212), Expect = 1e-14 Identities = 45/159 (28%), Positives = 75/159 (47%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 WM S+ G F++ SAW+ + + +W + M+ FLWRL + R+ D L+ Sbjct: 872 WMGSTQGIFTVKSAWELMRHKQERRTDYQLIWTKDVPFKMNFFLWRLWKRRIATDDNLKR 931 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 I + S+CWCC ++ H+FL P +++W FS+ I Sbjct: 932 MKIQIVSRCWCC--------SETEEETMTHIFLTAPIANRLWRQFSNFAGIQIESMHLQQ 983 Query: 394 LYLSAWKQLTPFAHIVHISFIIPCLILWFIWLERNNTKH 278 L ++ WK + A + + +P +I+W +W RNN KH Sbjct: 984 LIINWWKH-SDNAKLKVVMRAMPTIIMWTLWKRRNNFKH 1021 >ref|XP_004253338.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 655 Score = 70.5 bits (171), Expect(2) = 2e-14 Identities = 47/161 (29%), Positives = 74/161 (45%), Gaps = 2/161 (1%) Frame = -1 Query: 754 WMPSSNGNFSLSSAWKSITRHSPSPPLFSDLWNPFLTPSMSIFLWRLLRNRLPVDQKLQD 575 WMP+ G FS+SSAW+ I + + + +WN L ++ F+WR L+ +LP ++ LQ Sbjct: 333 WMPTETGIFSISSAWECIRKKRIIDNISTIIWNKHLPFKIAFFIWRALKGKLPTNEFLQR 392 Query: 574 RGIALASKCWCCXXXXXXXXXXXXXXSIAHLFLNNPKVHQVWMHFSSILRHTIPETENIL 395 G + S C CC I H+ +N +W ++ L IP N+ Sbjct: 393 IGSNI-SDCSCC--------YRKGKDDINHILINGNFAKYIWKIHAATL-GIIPVNTNLR 442 Query: 394 LYLSAWKQLTPFAHIVH--ISFIIPCLILWFIWLERNNTKH 278 L W+ + VH + I+P LI W +W R K+ Sbjct: 443 AQLLHWRN-QKVNNEVHKLLIHILPNLICWNLWKNRCAVKY 482 Score = 35.4 bits (80), Expect(2) = 2e-14 Identities = 19/60 (31%), Positives = 27/60 (45%) Frame = -3 Query: 188 NWKGMLDIASSFGFNFSGTITSISIPVLWKKPDTGVVKLNCDGAKKTAHSHGGIGGVLRD 9 NW +++I + + + S W KP G KLN DG+ G GG+LRD Sbjct: 516 NWNNLVNIIENCSQQYKIVLVS------WNKPAFGTYKLNTDGSAIQNSGKTGGGGILRD 569