BLASTX nr result
ID: Rehmannia22_contig00040118
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia22_contig00040118 (529 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob... 86 2e-22 gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] 91 6e-22 gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] 92 1e-21 gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] 89 2e-21 gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao] 87 1e-20 gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao] 81 2e-19 ref|XP_002524702.1| nuclease, putative [Ricinus communis] gi|223... 74 3e-17 gb|EOX93822.1| Uncharacterized protein TCM_002766 [Theobroma cacao] 73 6e-17 ref|XP_003543094.1| PREDICTED: putative ribonuclease H protein A... 74 8e-17 ref|XP_002304990.2| hypothetical protein POPTR_0004s03265g [Popu... 73 1e-16 ref|XP_002317250.1| predicted protein [Populus trichocarpa] 74 2e-16 gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] 90 3e-16 ref|XP_004146631.1| PREDICTED: putative ribonuclease H protein A... 77 3e-16 emb|CAN67514.1| hypothetical protein VITISV_012081 [Vitis vinifera] 70 6e-16 gb|EOY19103.1| Uncharacterized protein TCM_043836 [Theobroma cacao] 75 7e-16 gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] 88 1e-15 gb|EOY28460.1| Polynucleotidyl transferase [Theobroma cacao] 69 3e-15 ref|XP_002267980.2| PREDICTED: putative ribonuclease H protein A... 68 4e-15 gb|EOY17515.1| Uncharacterized protein TCM_042331 [Theobroma cacao] 85 9e-15 gb|EOX96781.1| Uncharacterized protein TCM_005952 [Theobroma cacao] 60 1e-14 >gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 85.9 bits (211), Expect(2) = 2e-22 Identities = 42/106 (39%), Positives = 70/106 (66%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 GVLRDH G +I+GF+++ +SLQAE+ ALH+ L + + +WIE D+Q ++ +I Sbjct: 1199 GVLRDHTGNLIFGFSENFGY-QNSLQAELLALHRGLCLCMEYNVSRVWIEVDAQVVIQMI 1257 Query: 386 LHQKASNWRYQHILLQVQTLIQGINMKISHIFREGNQVADALANIG 523 + +++ Q++L ++ +Q I+++ISHI REGNQ AD L+ G Sbjct: 1258 QNHHKGSYKIQYLLESIRKCLQVISVRISHIHREGNQAADFLSKHG 1303 Score = 45.4 bits (106), Expect(2) = 2e-22 Identities = 26/68 (38%), Positives = 39/68 (57%), Gaps = 3/68 (4%) Frame = +3 Query: 9 LLRGGKVFKYQTWRGFLNRAGHFGLHFRCRSQTSCRSVLWKKPEQGWAKLNIDGAKK--- 179 L +GG + K+Q W+G L+ A H+G +F Q + + W KP G KLN+DG+ K Sbjct: 1134 LFQGGLLCKWQ-WKGDLDIAIHWGFNFAQERQARPKIINWIKPLIGELKLNVDGSSKDEF 1192 Query: 180 ANSIHGGI 203 N+ GG+ Sbjct: 1193 QNAAGGGV 1200 >gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 90.9 bits (224), Expect(2) = 6e-22 Identities = 45/107 (42%), Positives = 69/107 (64%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 G+LRDH G +I+GF+++ S DSLQAE+ ALH+ L + + LWIE D++ + +I Sbjct: 3366 GLLRDHTGSMIFGFSENFGS-QDSLQAELMALHRGLLLCIDHNVTRLWIEMDAKVAVQMI 3424 Query: 386 LHQKASNWRYQHILLQVQTLIQGINMKISHIFREGNQVADALANIGF 526 + R +++L + + GI+ +ISHIFREGNQ AD L+N G+ Sbjct: 3425 NEGHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQAADHLSNQGY 3471 Score = 38.9 bits (89), Expect(2) = 6e-22 Identities = 21/69 (30%), Positives = 33/69 (47%) Frame = +3 Query: 3 LHLLRGGKVFKYQTWRGFLNRAGHFGLHFRCRSQTSCRSVLWKKPEQGWAKLNIDGAKKA 182 +H L GK + W+G A +G+ + + + + + W KP G KLN+DG+ K Sbjct: 3298 IHQLFQGKQLQKWQWQGDKQIAQEWGIILKAVAPSPPKLLFWNKPSIGEFKLNVDGSSKY 3357 Query: 183 NSIHGGIGG 209 N GG Sbjct: 3358 NLQTAAGGG 3366 Score = 75.1 bits (183), Expect(2) = 4e-17 Identities = 37/106 (34%), Positives = 65/106 (61%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 GVLRDH G++ + F++++ SLQAE+ AL + L + + NLWIE D+ + ++ Sbjct: 1571 GVLRDHTGKLAFAFSENLGPL-PSLQAELHALLRGLLLCKERNITNLWIEMDALVAVQMV 1629 Query: 386 LHQKASNWRYQHILLQVQTLIQGINMKISHIFREGNQVADALANIG 523 + + +++L ++ ++ + +ISHI+REGNQ AD L+N G Sbjct: 1630 QQSQKGSHDIRYLLESIRLCLRSFSYRISHIYREGNQAADFLSNKG 1675 Score = 38.1 bits (87), Expect(2) = 4e-17 Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 2/69 (2%) Frame = +3 Query: 3 LHLLRGGKVFKYQTWRGFLNRAGHFGLHFRCRSQTSCRSVLWKKPEQGWAKLNIDGAKKA 182 L+ L G + K W+G + A +G + + S + + W KP G KLN+DG+ K+ Sbjct: 1504 LNQLHAGSLLKQWQWKGDTDIATMWGFKYPPKYCQSPQIISWIKPFIGEYKLNVDGSSKS 1563 Query: 183 --NSIHGGI 203 N+ GG+ Sbjct: 1564 SQNAAGGGV 1572 >gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] Length = 879 Score = 92.0 bits (227), Expect(2) = 1e-21 Identities = 45/106 (42%), Positives = 71/106 (66%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 G+LRDH G++I+GF+++I C+ SLQAE+ AL + L + + NLWIE D+ ++ +I Sbjct: 742 GILRDHTGKLIFGFSENIGLCN-SLQAELRALLRGLLLCKERHIENLWIEMDALAVIQLI 800 Query: 386 LHQKASNWRYQHILLQVQTLIQGINMKISHIFREGNQVADALANIG 523 H + + +++L ++ + I+ +ISHIFREGNQ AD LAN G Sbjct: 801 QHSQKGSHDIRYLLESIRKCLSCISYRISHIFREGNQAADYLANEG 846 Score = 37.0 bits (84), Expect(2) = 1e-21 Identities = 21/67 (31%), Positives = 37/67 (55%), Gaps = 2/67 (2%) Frame = +3 Query: 9 LLRGGKVFKYQTWRGFLNRAGHFGLHFRCRSQTSCRSVLWKKPEQGWAKLNIDGAKKANS 188 LL G + ++Q W+G + A +G F+ + + + + W+KP G KLN+DG+ + Sbjct: 678 LLDGSLLHQWQ-WKGDTDIASMWGHTFQSKHRAPPQIIYWRKPFTGEYKLNVDGSSRNGH 736 Query: 189 I--HGGI 203 + GGI Sbjct: 737 LAASGGI 743 >gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 89.0 bits (219), Expect(2) = 2e-21 Identities = 44/106 (41%), Positives = 68/106 (64%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 G+LRDH G +I+GF+++ DSLQAE+ ALH+ L + ++ LWIE D++ + +I Sbjct: 2078 GLLRDHTGSMIFGFSENFGP-QDSLQAELMALHRGLLLCIEHNISRLWIEMDAKVAVQMI 2136 Query: 386 LHQKASNWRYQHILLQVQTLIQGINMKISHIFREGNQVADALANIG 523 + R +++L + + GI+ +ISHIFREGNQ AD L+N G Sbjct: 2137 KEGHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQAADHLSNQG 2182 Score = 39.3 bits (90), Expect(2) = 2e-21 Identities = 22/69 (31%), Positives = 33/69 (47%) Frame = +3 Query: 3 LHLLRGGKVFKYQTWRGFLNRAGHFGLHFRCRSQTSCRSVLWKKPEQGWAKLNIDGAKKA 182 LH L GK + W+G A +G+ + + + + + W KP G KLN+DG+ K Sbjct: 2010 LHQLFQGKQLQKWQWQGDKQIAQEWGIILKADAPSPPKLLFWLKPSIGELKLNVDGSCKH 2069 Query: 183 NSIHGGIGG 209 N GG Sbjct: 2070 NPQSAAGGG 2078 >gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao] Length = 1275 Score = 86.7 bits (213), Expect(2) = 1e-20 Identities = 42/106 (39%), Positives = 70/106 (66%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 GVLRDH ++I+ F+++I + ++SLQAE+ ALH+ L + + LWIE D+ ++ +I Sbjct: 994 GVLRDHTSKLIFCFSENIGT-YNSLQAELRALHRGLLLCKERHIEKLWIEMDALAVIQLI 1052 Query: 386 LHQKASNWRYQHILLQVQTLIQGINMKISHIFREGNQVADALANIG 523 H + + +++L ++ + I+ +ISHIFREGNQ AD L+N G Sbjct: 1053 PHSQKGSHDIRYLLESIKKCLNSISYRISHIFREGNQAADFLSNEG 1098 Score = 38.5 bits (88), Expect(2) = 1e-20 Identities = 20/69 (28%), Positives = 35/69 (50%) Frame = +3 Query: 3 LHLLRGGKVFKYQTWRGFLNRAGHFGLHFRCRSQTSCRSVLWKKPEQGWAKLNIDGAKKA 182 L L+ + + W+G + A + +F+ + + + V W+KP G KLN+DG+ + Sbjct: 927 LRQLQDDSLLQQWQWKGDTDIAAMWRYNFQLKQRAPPQIVYWRKPFTGEYKLNVDGSSR- 985 Query: 183 NSIHGGIGG 209 N H GG Sbjct: 986 NGQHAASGG 994 >gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao] Length = 1702 Score = 81.3 bits (199), Expect(2) = 2e-19 Identities = 41/106 (38%), Positives = 66/106 (62%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 G+LRDH G +++GF+++I + SLQAE+ AL + L + + LWIE D+ + +I Sbjct: 1397 GLLRDHTGTLVFGFSENIGPSN-SLQAELRALLRGLLLCKERNIEKLWIEMDALVAIQMI 1455 Query: 386 LHQKASNWRYQHILLQVQTLIQGINMKISHIFREGNQVADALANIG 523 + + Q++L ++ + + +ISHIFREGNQVAD L+N G Sbjct: 1456 QQSQKGSHDIQYLLASIRKCLSFFSFRISHIFREGNQVADFLSNKG 1501 Score = 39.7 bits (91), Expect(2) = 2e-19 Identities = 23/69 (33%), Positives = 35/69 (50%) Frame = +3 Query: 3 LHLLRGGKVFKYQTWRGFLNRAGHFGLHFRCRSQTSCRSVLWKKPEQGWAKLNIDGAKKA 182 L L+ G V K W+G ++ A +G +F + Q + + W K G KLN+DG+ + Sbjct: 1330 LRQLQDGYVLKNWQWKGDMDIAAMWGFNFSPKIQATPQIFHWVKLVSGEHKLNVDGSSRQ 1389 Query: 183 NSIHGGIGG 209 N IGG Sbjct: 1390 NQ-SAAIGG 1397 Score = 73.6 bits (179), Expect(2) = 3e-15 Identities = 38/106 (35%), Positives = 64/106 (60%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 GV RDH +I+GF+++ ++S QAE+ ALH+ L + ++ +WIE D++ ++ ++ Sbjct: 1565 GVPRDHTSTMIFGFSENFGP-YNSTQAELMALHRGLLLCNEYNISRVWIEIDAKAIVQML 1623 Query: 386 LHQKASNWRYQHILLQVQTLIQGINMKISHIFREGNQVADALANIG 523 R Q++L + + GI+ +ISHI RE NQ AD L+N G Sbjct: 1624 HEGHKGYSRTQYLLSFICQCLSGISYRISHIHRESNQAADYLSNQG 1669 Score = 33.5 bits (75), Expect(2) = 3e-15 Identities = 16/47 (34%), Positives = 24/47 (51%), Gaps = 3/47 (6%) Frame = +3 Query: 72 HFGLHFRCRSQTSCRSVLWKKPEQGWAKLNIDGAKK---ANSIHGGI 203 H+GL + S + + W +P G KLN+DG K N+ GG+ Sbjct: 1520 HWGLRYEQDSHGHPKIIYWSRPLMGEFKLNVDGCSKEAFQNAASGGV 1566 >ref|XP_002524702.1| nuclease, putative [Ricinus communis] gi|223536063|gb|EEF37721.1| nuclease, putative [Ricinus communis] Length = 201 Score = 74.3 bits (181), Expect(2) = 3e-17 Identities = 41/108 (37%), Positives = 69/108 (63%), Gaps = 2/108 (1%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 GV R+H+ E + G+A+SI ++ AE+AAL + L++V +N + N+W+E D++ LL II Sbjct: 61 GVFRNHKAEFLLGYAESIGRSTSTI-AELAALRRGLELVLENGWSNVWLEGDAKTLLEII 119 Query: 386 L-HQKASNWRYQHILLQVQTLIQGI-NMKISHIFREGNQVADALANIG 523 + +K + Q + + +I + N +SH++REGN+ AD LA IG Sbjct: 120 VKRRKVRCAQMQRHVSDINLIIPELDNCIVSHVYREGNRAADKLAQIG 167 Score = 39.7 bits (91), Expect(2) = 3e-17 Identities = 16/30 (53%), Positives = 19/30 (63%) Frame = +3 Query: 120 VLWKKPEQGWAKLNIDGAKKANSIHGGIGG 209 V W+KP+ GW KLN DG+ K G IGG Sbjct: 32 VAWEKPQVGWTKLNFDGSCKGREGKGSIGG 61 >gb|EOX93822.1| Uncharacterized protein TCM_002766 [Theobroma cacao] Length = 241 Score = 73.2 bits (178), Expect(2) = 6e-17 Identities = 38/106 (35%), Positives = 65/106 (61%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 G+LRDH G +I+GF+ + + SLQAE+ ALH+ L + + +WIE +++ ++ +I Sbjct: 127 GLLRDHTGIVIFGFSKNFR-LYISLQAELMALHRGLLLCIEYNVSRIWIEMNAKVVVQMI 185 Query: 386 LHQKASNWRYQHILLQVQTLIQGINMKISHIFREGNQVADALANIG 523 + + +++L ++ + I+ ISHI REGNQV D L+N G Sbjct: 186 HEGNKGSSQTRYLLASIRKCLNAISYCISHIHREGNQVVDHLSNQG 231 Score = 39.7 bits (91), Expect(2) = 6e-17 Identities = 22/53 (41%), Positives = 28/53 (52%) Frame = +3 Query: 27 VFKYQTWRGFLNRAGHFGLHFRCRSQTSCRSVLWKKPEQGWAKLNIDGAKKAN 185 +FK WRG L A +GL F+ S S + W KP G KLN+D + K N Sbjct: 67 LFKRWQWRGDLQIAQAWGLMFQRASPPSPKIFSWHKPLTGEFKLNVDDSSKHN 119 >ref|XP_003543094.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine max] Length = 201 Score = 74.3 bits (181), Expect(2) = 8e-17 Identities = 39/110 (35%), Positives = 68/110 (61%), Gaps = 2/110 (1%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 GV+R+H E + G+A+SI + ++ AE+ AL K L++V +N + ++W+E D++ L+ II Sbjct: 61 GVVRNHNAEFLLGYAESIGQANSTI-AELTALRKGLELVLENGWNDIWLEGDAKTLVEII 119 Query: 386 L-HQKASNWRYQHILLQVQTLIQGIN-MKISHIFREGNQVADALANIGFH 529 + +K Q + + T++ N +SHI+REGN+ AD A +G H Sbjct: 120 VKRRKVRCTEVQRHINHINTILPEFNNFFVSHIYREGNRAADKFAQMGHH 169 Score = 38.1 bits (87), Expect(2) = 8e-17 Identities = 17/30 (56%), Positives = 18/30 (60%) Frame = +3 Query: 120 VLWKKPEQGWAKLNIDGAKKANSIHGGIGG 209 V WKKP GW KLN DG+ K S IGG Sbjct: 32 VAWKKPRIGWTKLNFDGSCKCLSGKASIGG 61 >ref|XP_002304990.2| hypothetical protein POPTR_0004s03265g [Populus trichocarpa] gi|550340224|gb|EEE85501.2| hypothetical protein POPTR_0004s03265g [Populus trichocarpa] Length = 202 Score = 73.2 bits (178), Expect(2) = 1e-16 Identities = 42/111 (37%), Positives = 68/111 (61%), Gaps = 5/111 (4%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 GV R+H+ E + G+A+ I ++ AE+AAL + L++V +N + N+W+E DS+ L+ II Sbjct: 62 GVFRNHEAEFLLGYAEPIGGTTSTI-AELAALRRGLELVLENGWSNVWLEGDSKSLVDII 120 Query: 386 LHQ-----KASNWRYQHILLQVQTLIQGINMKISHIFREGNQVADALANIG 523 + + K + + HI L + L N ++H+FREGN+ AD LA IG Sbjct: 121 VKRKQVRCKEAQRQVSHINLIMPEL---QNCVVTHVFREGNRAADKLARIG 168 Score = 38.5 bits (88), Expect(2) = 1e-16 Identities = 16/30 (53%), Positives = 19/30 (63%) Frame = +3 Query: 120 VLWKKPEQGWAKLNIDGAKKANSIHGGIGG 209 V WKKP+ GW KLN DG+ K + IGG Sbjct: 33 VSWKKPQIGWTKLNFDGSCKGTAGKASIGG 62 >ref|XP_002317250.1| predicted protein [Populus trichocarpa] Length = 171 Score = 73.9 bits (180), Expect(2) = 2e-16 Identities = 41/109 (37%), Positives = 69/109 (63%), Gaps = 2/109 (1%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 GV R+H+ E + G+A+SI S+ AE+AAL + L++V +N + N+W+E DS+ L+ II Sbjct: 33 GVFRNHEAEFLLGYAESIGRT-TSMIAELAALRRGLELVLENGWGNVWLEGDSKSLVDII 91 Query: 386 LHQKASNWR-YQHILLQVQTLIQGI-NMKISHIFREGNQVADALANIGF 526 + +K + Q + + +I + N ++H+FREGN+ AD LA I + Sbjct: 92 VKRKLVRCKEAQRQVSYINLIIPELKNCLVTHVFREGNRAADKLARIAY 140 Score = 37.0 bits (84), Expect(2) = 2e-16 Identities = 15/30 (50%), Positives = 20/30 (66%) Frame = +3 Query: 120 VLWKKPEQGWAKLNIDGAKKANSIHGGIGG 209 V W+KP+ GW KLN DG+ K ++ IGG Sbjct: 4 VAWEKPQIGWTKLNFDGSCKDSAGKASIGG 33 >gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] Length = 926 Score = 90.1 bits (222), Expect = 3e-16 Identities = 44/106 (41%), Positives = 71/106 (66%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 GVLRDH G++I+GF+++I +C+ SLQAE+ AL + L + + LWIE D+ ++ +I Sbjct: 790 GVLRDHTGKLIFGFSENIGNCN-SLQAELRALLRGLLLCKERHIEQLWIEMDALAVIQLI 848 Query: 386 LHQKASNWRYQHILLQVQTLIQGINMKISHIFREGNQVADALANIG 523 H + + +++L ++ + I+ +ISHI REGNQVAD L+N G Sbjct: 849 PHSQKGSHDIRYLLESIRKCLNSISYRISHILREGNQVADFLSNEG 894 >ref|XP_004146631.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Cucumis sativus] gi|449488498|ref|XP_004158057.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Cucumis sativus] Length = 205 Score = 76.6 bits (187), Expect(2) = 3e-16 Identities = 41/108 (37%), Positives = 72/108 (66%), Gaps = 2/108 (1%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 GVLRDH+ + + G+A+SI + S+ AE+ AL K L++V +N + ++W+E D++ L+ I+ Sbjct: 65 GVLRDHKAQFLLGYAESIGRAYSSM-AELKALTKGLELVLENGWKDVWVEGDAKGLVEIL 123 Query: 386 L-HQKASNWRYQHILLQVQTLIQGI-NMKISHIFREGNQVADALANIG 523 +++ + L +++L+ N K+SHI+REGN+VAD A+IG Sbjct: 124 AENREVKCMEARSYLRHIKSLLLDFDNCKVSHIYREGNKVADRFASIG 171 Score = 33.9 bits (76), Expect(2) = 3e-16 Identities = 14/36 (38%), Positives = 20/36 (55%) Frame = +3 Query: 120 VLWKKPEQGWAKLNIDGAKKANSIHGGIGGFSEIIK 227 V W +PE GW KLN DG+ K I G+ +++ Sbjct: 34 VAWTRPEFGWTKLNFDGSSK-GEIGPGVASIGGVLR 68 >emb|CAN67514.1| hypothetical protein VITISV_012081 [Vitis vinifera] Length = 697 Score = 69.7 bits (169), Expect(2) = 6e-16 Identities = 39/108 (36%), Positives = 66/108 (61%), Gaps = 2/108 (1%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 GV+RDH + G+A+SI ++ AE+AAL + L++V +N + +W+E D Q L+ II Sbjct: 500 GVIRDHNAAFLLGYAESIGHAXSTI-AEMAALRRGLELVVENGWSQVWLEGDLQSLVEII 558 Query: 386 LH-QKASNWRYQHILLQVQTLIQGI-NMKISHIFREGNQVADALANIG 523 + ++ + Q + ++ LI + N I+HI+REGN+VA A +G Sbjct: 559 MQGRRVRSAEAQKQVSHIKLLIPELDNFLITHIYREGNRVAHTFAQMG 606 Score = 39.7 bits (91), Expect(2) = 6e-16 Identities = 21/50 (42%), Positives = 28/50 (56%) Frame = +3 Query: 60 NRAGHFGLHFRCRSQTSCRSVLWKKPEQGWAKLNIDGAKKANSIHGGIGG 209 N G GL R + + + V W+KP+ GW KLN DG+ K +S IGG Sbjct: 452 NPVGPVGLLSRNWHENAIQ-VAWEKPQIGWTKLNFDGSCKCSSGRASIGG 500 >gb|EOY19103.1| Uncharacterized protein TCM_043836 [Theobroma cacao] Length = 228 Score = 75.1 bits (183), Expect(2) = 7e-16 Identities = 37/109 (33%), Positives = 66/109 (60%) Frame = +2 Query: 197 GNRGVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLL 376 G G+LRDH +++ F++++ + +SLQAE+ ALH+ L + +N LWIE D+ ++ Sbjct: 74 GGGGLLRDHTSTLVFVFSENLGA-KNSLQAELLALHRGLLLCQENNISRLWIEMDAMIVI 132 Query: 377 TIILHQKASNWRYQHILLQVQTLIQGINMKISHIFREGNQVADALANIG 523 ++ + +++ ++ ++ + +ISHI REGNQ AD LAN G Sbjct: 133 QMLKEGHIGSHDSRYLWASIRQQLKLFSFRISHIHREGNQAADWLANRG 181 Score = 34.3 bits (77), Expect(2) = 7e-16 Identities = 19/54 (35%), Positives = 28/54 (51%) Frame = +3 Query: 48 RGFLNRAGHFGLHFRCRSQTSCRSVLWKKPEQGWAKLNIDGAKKANSIHGGIGG 209 RG + A +GL F + + + + W KP G KLN+DG+ N + G GG Sbjct: 24 RGDIQTAQMWGLTFPRKVISLPKVISWHKPSTGEFKLNVDGSSINNFQNAGGGG 77 >gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 88.2 bits (217), Expect = 1e-15 Identities = 43/106 (40%), Positives = 70/106 (66%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 GVLRDH G++I+GF+++I +C+ SLQAE+ AL + L + + LWIE D+ + ++ Sbjct: 2078 GVLRDHTGKLIFGFSENIGTCN-SLQAELRALLRGLLLCKERHIEKLWIEMDALAAIQLL 2136 Query: 386 LHQKASNWRYQHILLQVQTLIQGINMKISHIFREGNQVADALANIG 523 H + + +++L ++ + I+ +ISHI REGNQVAD L+N G Sbjct: 2137 PHSQKGSHDIRYLLESIRKCLNSISYRISHIHREGNQVADFLSNEG 2182 >gb|EOY28460.1| Polynucleotidyl transferase [Theobroma cacao] Length = 285 Score = 69.3 bits (168), Expect(2) = 3e-15 Identities = 37/108 (34%), Positives = 68/108 (62%), Gaps = 2/108 (1%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 GV R+H+ E + G+A+SI ++ AE+AAL + L++V +N + ++W+E D++ L+ +I Sbjct: 62 GVFRNHKAEFLLGYAESIGRSTSTI-AELAALRRGLELVLENGWTDVWLEGDAKTLVDVI 120 Query: 386 LHQKASNW-RYQHILLQVQTLIQGI-NMKISHIFREGNQVADALANIG 523 + ++ Q + + +I + N ++HI+REGN+ AD LA IG Sbjct: 121 VQRRQVKCAELQRHVSHINLIIPELNNCIVTHIYREGNRAADKLAQIG 168 Score = 37.7 bits (86), Expect(2) = 3e-15 Identities = 16/30 (53%), Positives = 18/30 (60%) Frame = +3 Query: 120 VLWKKPEQGWAKLNIDGAKKANSIHGGIGG 209 V W+KPE GW KLN DG+ K IGG Sbjct: 33 VSWEKPEIGWTKLNFDGSCKGRGGKASIGG 62 >ref|XP_002267980.2| PREDICTED: putative ribonuclease H protein At1g65750-like [Vitis vinifera] Length = 205 Score = 68.2 bits (165), Expect(2) = 4e-15 Identities = 39/108 (36%), Positives = 66/108 (61%), Gaps = 2/108 (1%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 GV+RDH + G+A+SI ++ AE+AAL + L++V +N + +W+E D Q L+ II Sbjct: 65 GVIRDHNAVFLLGYAESIGHTTSTI-AEMAALRRGLELVLENGWSQVWLEGDLQSLVEII 123 Query: 386 LH-QKASNWRYQHILLQVQTLIQGI-NMKISHIFREGNQVADALANIG 523 + ++ + Q + ++ LI + N I+HI+REGN+VA A +G Sbjct: 124 MQGRRVRSAEAQKQVSHIKLLIPELDNFLITHIYREGNRVAHTFAQMG 171 Score = 38.5 bits (88), Expect(2) = 4e-15 Identities = 20/50 (40%), Positives = 28/50 (56%) Frame = +3 Query: 60 NRAGHFGLHFRCRSQTSCRSVLWKKPEQGWAKLNIDGAKKANSIHGGIGG 209 N G GL R + + + V W+KP+ GW KLN DG+ K ++ IGG Sbjct: 17 NPVGPVGLLSRNWHENAIQ-VAWEKPQIGWTKLNFDGSCKCSTGRASIGG 65 >gb|EOY17515.1| Uncharacterized protein TCM_042331 [Theobroma cacao] Length = 1176 Score = 85.1 bits (209), Expect = 9e-15 Identities = 41/106 (38%), Positives = 68/106 (64%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 GVLRDH G++I+GF+++I + ++SLQ E+ ALH+ L + LWIE D+ ++ +I Sbjct: 1040 GVLRDHTGKLIFGFSENIGT-YNSLQGELRALHRGLLLCKDCHIEKLWIEMDALAVIQLI 1098 Query: 386 LHQKASNWRYQHILLQVQTLIQGINMKISHIFREGNQVADALANIG 523 H + + +++L ++ + I+ +I HIFREGNQ D L+N G Sbjct: 1099 PHSQKGSHDIRYLLESIRKCLNNISYRILHIFREGNQTVDFLSNRG 1144 >gb|EOX96781.1| Uncharacterized protein TCM_005952 [Theobroma cacao] Length = 445 Score = 60.5 bits (145), Expect(2) = 1e-14 Identities = 37/106 (34%), Positives = 53/106 (50%) Frame = +2 Query: 206 GVLRDHQGEIIWGFADSINSCHDSLQAEIAALHKALQIVDKNQYPNLWIETDSQCLLTII 385 GVLRDH G +I+GF+++ +SLQAE+ ALHK L + + +WIE D+Q Sbjct: 88 GVLRDHTGNLIFGFSENFGY-QNSLQAELLALHKGLCLCMEYNVSRVWIEMDAQV----- 141 Query: 386 LHQKASNWRYQHILLQVQTLIQGINMKISHIFREGNQVADALANIG 523 I+++ISHI +EGNQ D L+ G Sbjct: 142 -----------------------ISVRISHIHKEGNQATDFLSKCG 164 Score = 44.7 bits (104), Expect(2) = 1e-14 Identities = 25/68 (36%), Positives = 39/68 (57%), Gaps = 3/68 (4%) Frame = +3 Query: 9 LLRGGKVFKYQTWRGFLNRAGHFGLHFRCRSQTSCRSVLWKKPEQGWAKLNIDGAKK--- 179 L +GG + K+Q W+ L+ A H+G +F Q + + W KP G KLN+DG+ K Sbjct: 23 LFQGGLLCKWQ-WKTDLDIAIHWGFNFAQERQARPKIIHWTKPLIGELKLNVDGSSKDEF 81 Query: 180 ANSIHGGI 203 N++ GG+ Sbjct: 82 QNAVGGGV 89