BLASTX nr result
ID: Rehmannia22_contig00017304
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia22_contig00017304 (1833 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] 385 e-104 gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] 384 e-104 gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] 382 e-103 gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] 380 e-102 gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] 379 e-102 gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] 377 e-102 gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] 376 e-101 gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob... 371 e-100 gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] 363 1e-97 gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] 361 7e-97 gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] 352 2e-94 gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] 334 9e-89 gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] 317 1e-83 gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao] 298 5e-78 ref|XP_004253442.1| PREDICTED: putative ribonuclease H protein A... 264 8e-68 ref|XP_004253443.1| PREDICTED: putative ribonuclease H protein A... 255 4e-65 gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] 254 1e-64 ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein A... 251 1e-63 ref|XP_004233578.1| PREDICTED: putative ribonuclease H protein A... 251 1e-63 ref|XP_006367640.1| PREDICTED: putative ribonuclease H protein A... 242 4e-61 >gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 385 bits (989), Expect = e-104 Identities = 208/610 (34%), Positives = 315/610 (51%), Gaps = 1/610 (0%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 E + L+ GGR+ L++SVLAS+ +L VL PP +L + ++ F WG + K +HW Sbjct: 1663 ENKILSPGGRITLLRSVLASLPIYLLQVLKPPVCVLERVNRLFNSFLWGGSAASKRIHWA 1722 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKV 361 SW I EGGL IRS+ E AFS KLWWRFR +SLW RF++ KYC+ P + Sbjct: 1723 SWAKIALPVTEGGLDIRSLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCRGQLPMQTQP 1782 Query: 362 SIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKDKWGVYHV 541 +HDS WKRM + + +M W +G G FWH+ WMG+ PLI ++ + V Sbjct: 1783 KLHDSQTWKRMLTSSTITEQHMRWRVGQGNVFFWHDCWMGEAPLIS--SNQEFTSSMVQV 1840 Query: 542 NHFWTNGQWDPYKLGKVLDGFWVEKICSIPINENQRDTMHWKLSSNGQFTTTSAWLSLRK 721 F+TN W+ KL VL V++I IPI+ +D +W + NG F+T SAW +RK Sbjct: 1841 CDFFTNNSWNIEKLKTVLQQEVVDEIAKIPIDTMNKDEAYWTPTPNGDFSTKSAWQLIRK 1900 Query: 722 EKTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXXXXX 901 K + +F +W+ T S FLWRL + IPV+ K++SKG+ LAS+C CC Sbjct: 1901 RKVVNPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCKSE----- 1955 Query: 902 XXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLMFQF 1081 E++ H+ ++ VW +F+ LF+ + + ++ + Sbjct: 1956 --------------------ESIMHVMWDNPVAMQVWNYFAKLFQILIINPCTINQIIGA 1995 Query: 1082 WKLSSPFSHVSHVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNK 1261 W S + H+ L+P + WF WVERND KHR LG R++W V + +LSL + Sbjct: 1996 WFYSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQ 2055 Query: 1262 FDFSIWKGFSHIANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD-XXXXXXXXXXXXV 1438 WKG IA + S+ + W KP +G KLN D + Sbjct: 2056 LLKWQWKGDKQIAQEWGIIFQAESLAPPKVFSWHKPSLGEFKLNVDGSAKQSHNAAGGGI 2115 Query: 1439 IRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKN 1618 +R+ GE+V F + NSL+AE+ AL G+ IL R R WIE+D+++++ +++ Sbjct: 2116 LRDHAGEMVFGFSENLGTQNSLQAELLALYRGL-ILCRDYNIRRLWIEMDAISVIRLLQG 2174 Query: 1619 RKIGHWRLQHDFLRIKNKFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPG 1798 G +++ + ++ +H FREGN AD+LAN G + Q Q F + G Sbjct: 2175 NHRGPHAIRYLMVSLRQLLSHFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVAQ--G 2232 Query: 1799 KLKGLIRVDK 1828 KL+G++ +D+ Sbjct: 2233 KLRGMLCLDQ 2242 >gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 384 bits (986), Expect = e-104 Identities = 206/612 (33%), Positives = 312/612 (50%), Gaps = 2/612 (0%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 E + L+ GGR+ L++S L+S+ +L VL PP +L + ++ F WG S K +HW Sbjct: 2914 ENKILSPGGRITLLRSTLSSLPIYLLQVLKPPIIVLERINRLFNNFLWGGSASSKRIHWA 2973 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKV 361 SW I EGGL IR++ + AFS KLWWRFR NSLW +F++AKYC P V+ Sbjct: 2974 SWGKIALPIAEGGLDIRNLEDVFKAFSMKLWWRFRTTNSLWMQFMRAKYCGGQLPTHVQP 3033 Query: 362 SIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKDKWGVYHV 541 +HDS WKRM + + + N+ W +G GK FWH+ WMG++PL + ++ + V Sbjct: 3034 KLHDSQTWKRMVTISSITEQNIRWRVGHGKLFFWHDCWMGEEPL--VIRNQEFASSMAQV 3091 Query: 542 NHFWTNGQWDPYKLGKVLDGFWVEKICSIPINENQRDTMHWKLSSNGQFTTTSAWLSLRK 721 + F+ N WD KL VL VE+I IPIN + D +W + NG F+T SAW R+ Sbjct: 3092 SDFFLNNSWDIEKLKSVLQQEVVEEIAKIPINASSNDRAYWTPTPNGDFSTKSAWQLSRE 3151 Query: 722 EKTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXXXXX 901 K + + +W+ T S FLWRL + +PV+ K++SKG LAS+C CC Sbjct: 3152 RKVVNPTYNYIWHKSVPLTTSFFLWRLLHDWVPVELKMKSKGFQLASRCRCCKSE----- 3206 Query: 902 XXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLMFQF 1081 E++ H+ ++ VW +F+ +F+ + + ++ + Sbjct: 3207 --------------------ESLMHVMWDNPVANQVWSYFAKVFQIHIINPCTINHIISA 3246 Query: 1082 WKLSSPFSHVSHVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNK 1261 W S +S H+ L+P + WF WVERND KHR LG RI+W + + +L + Sbjct: 3247 WFYSGDYSKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRIVWKILKLIHQLFQGKQ 3306 Query: 1262 FDFSIWKGFSHIANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD--XXXXXXXXXXXX 1435 W+G IA ++ + L+FW+KP +G KLN D Sbjct: 3307 LQKWQWQGDKQIAQEWGIILKAVAPSPPKLLFWNKPSIGEFKLNVDGSSKYNLQTAAGGG 3366 Query: 1436 VIRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIK 1615 ++R+ G ++ F +F + +SL+AE+ AL G+ +L R WIE+D+ V MI Sbjct: 3367 LLRDHTGSMIFGFSENFGSQDSLQAELMALHRGL-LLCIDHNVTRLWIEMDAKVAVQMIN 3425 Query: 1616 NRKIGHWRLQHDFLRIKNKFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLP 1795 G R ++ I I+H FREGN AD+L+N G Q Q S Sbjct: 3426 EGHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQAADHLSNQGYTHQNLQVI--SQAE 3483 Query: 1796 GKLKGLIRVDKM 1831 G+L+G++R+DK+ Sbjct: 3484 GQLRGILRLDKI 3495 Score = 374 bits (961), Expect = e-101 Identities = 203/599 (33%), Positives = 311/599 (51%), Gaps = 7/599 (1%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 E + L+ GGR+ L++SVL+S +L VL PP T++ ++E++ F WG K +HW Sbjct: 1120 ENKILSPGGRITLLRSVLSSQPMYLLQVLKPPVTVIEKIERLFNSFLWGDSCDGKKLHWT 1179 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKV 361 +W I EGGL IR++ + AFS KLWWRF+ NSLW RFL+ KYC P +V+ Sbjct: 1180 AWSKITFPVSEGGLDIRNLRDVFEAFSLKLWWRFQTCNSLWTRFLRTKYCLGRIPHLVQP 1239 Query: 362 SIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKDKWGVYHV 541 +HDS +WKRM RD+ N+ W +G G+ FWH+ WMGDQPL +F + + + HV Sbjct: 1240 KLHDSQVWKRMIVGRDVALQNIRWRIGKGELFFWHDCWMGDQPLATLFPSFHN--DMSHV 1297 Query: 542 NHFWTNGQWDPYKLGKVLDGFWVEKICSIPINENQRDTMHWKLSSNGQFTTTSAWLSLRK 721 + F+ +WD KL L V++I IP + +Q D +W L+SNG+F+ SAW +R+ Sbjct: 1298 HKFYNGDEWDIVKLNSYLPTSLVDEILQIPFDRSQEDVAYWALTSNGEFSFWSAWEIIRQ 1357 Query: 722 EKTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXXXXX 901 +T + + W+ +IS FLWR+ N IPV+ +++ KGI LAS+C CC Sbjct: 1358 RQTPNALLSFNWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCRSE----- 1412 Query: 902 XXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLMFQF 1081 E++ H+ E+ K VW F+ F+ + + + Sbjct: 1413 --------------------ESLIHVLWENPVAKQVWNFFAKSFQIYVSKPKHISQIIWA 1452 Query: 1082 WKLSSPFSHVSHVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNK 1261 W S ++ H+ ILIP + WF W+ERND KHR +G R+IW + L++L + Sbjct: 1453 WFFSGDYTRNGHIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLHAGSL 1512 Query: 1262 FDFSIWKGFSHIANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD-XXXXXXXXXXXXV 1438 WKG + IA F P + ++ W KP +G KLN D V Sbjct: 1513 LKQWQWKGDTDIATMWGFKYPPKYCQSPQIISWIKPFIGEYKLNVDGSSKSSQNAAGGGV 1572 Query: 1439 IRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKN 1618 +R+ G++ AF + + SL+AE+ AL G+ +L + WIE+D++ V M++ Sbjct: 1573 LRDHTGKLAFAFSENLGPLPSLQAELHALLRGL-LLCKERNITNLWIEMDALVAVQMVQQ 1631 Query: 1619 RKIGHWRLQHDFLRIKNKFKDKDVVITHNFREGNAPADYLANLG------CDEQRTQEF 1777 + G +++ I+ + I+H +REGN AD+L+N G C QEF Sbjct: 1632 SQKGSHDIRYLLESIRLCLRSFSYRISHIYREGNQAADFLSNKGQTHQSLCVVSEAQEF 1690 >gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 382 bits (981), Expect = e-103 Identities = 210/610 (34%), Positives = 313/610 (51%), Gaps = 1/610 (0%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 E + L+ GGR+ L++SVLAS+ +L VL PP +L + +I F WG + K +HW Sbjct: 1661 ENKILSPGGRITLLRSVLASLPIYLLQVLKPPICVLERVNRIFNSFLWGGSAASKKIHWA 1720 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKV 361 SW I +EGGL IR++ E AFS KLWWRFR +SLW RF++ KYC+ P + Sbjct: 1721 SWAKISLPIKEGGLDIRNLAEVFEAFSMKLWWRFRTIDSLWTRFMRMKYCRGQLPMHTQP 1780 Query: 362 SIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKDKWGVYHV 541 +HDS WKRM + + NM W +G GK FWH+ WMG+ PL ++ + V Sbjct: 1781 KLHDSQTWKRMVANSAITEQNMRWRVGQGKLFFWHDCWMGETPLTS--SNQELSLSMVQV 1838 Query: 542 NHFWTNGQWDPYKLGKVLDGFWVEKICSIPINENQRDTMHWKLSSNGQFTTTSAWLSLRK 721 F+ N WD KL VL V++I IPI+ +D +W + NG+F+T SAW +RK Sbjct: 1839 CDFFMNNSWDIEKLKTVLQQEVVDEIAKIPIDAMSKDEAYWAPTPNGEFSTKSAWQLIRK 1898 Query: 722 EKTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXXXXX 901 + + +F +W+ TIS FLWRL + IPV+ K++SKG LAS+C CC Sbjct: 1899 REVVNPVFNFIWHKTVPLTISFFLWRLLHDWIPVELKMKSKGFQLASRCRCCKSE----- 1953 Query: 902 XXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLMFQF 1081 E++ H+ ++ VW +FS F+ + + ++ + Sbjct: 1954 --------------------ESIMHVMWDNPVATQVWNYFSKFFQILVINPCTINQILGA 1993 Query: 1082 WKLSSPFSHVSHVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNK 1261 W S + H+ L+P WF WVERND KHR LG RI+W + + +LSL + Sbjct: 1994 WFYSGDYCKPGHIRTLVPIFTLWFLWVERNDAKHRNLGMYPNRIVWRILKLIQQLSLGQQ 2053 Query: 1262 FDFSIWKGFSHIANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD-XXXXXXXXXXXXV 1438 WKG IA S+ + W KP +G KLN D V Sbjct: 2054 LLKWQWKGDKQIAQEWGITFQAESLPPPKVFPWHKPSIGEFKLNVDGSAKLSQNAAGGGV 2113 Query: 1439 IRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKN 1618 +R+ G +V F + NSL+AE+ AL G+ IL R R WIE+D+ +++ +++ Sbjct: 2114 LRDHAGVMVFGFSENLGIQNSLQAELLALYRGL-ILCRDYNIRRLWIEMDAASVIRLLQG 2172 Query: 1619 RKIGHWRLQHDFLRIKNKFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPG 1798 + G +++ + I+ ++H FREGN AD+LAN G + Q Q + G Sbjct: 2173 NQRGPHAIRYLLVSIRQLLSHFSFRLSHIFREGNQAADFLANRGHEHQSLQVVTVAQ--G 2230 Query: 1799 KLKGLIRVDK 1828 KL+G++R+D+ Sbjct: 2231 KLRGMLRLDQ 2240 >gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] Length = 1951 Score = 380 bits (976), Expect = e-102 Identities = 206/611 (33%), Positives = 318/611 (52%), Gaps = 1/611 (0%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 E + L+ GGR+ L++SVL+S +L VL PP T++ ++E+I F WG K +HW Sbjct: 1363 ENKILSPGGRITLLRSVLSSQPMYLLQVLKPPVTVIEKIERIFNSFLWGDSNDGKKLHWT 1422 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKV 361 W I EGGL IR++ + AFS KLWWRF+ NSLW +FL+ KYC P V+ Sbjct: 1423 VWSKITFPVSEGGLDIRNLRDVFEAFSLKLWWRFQTCNSLWTKFLRTKYCLGRIPHFVQP 1482 Query: 362 SIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKDKWGVYHV 541 +HDS +WKRM RD+ N+ W +G G+ FWH+ WMGDQPL + + + + HV Sbjct: 1483 KLHDSQVWKRMIVGRDVALQNIRWRIGKGELFFWHDCWMGDQPLATLCPSFHN--DMSHV 1540 Query: 542 NHFWTNGQWDPYKLGKVLDGFWVEKICSIPINENQRDTMHWKLSSNGQFTTTSAWLSLRK 721 + F+ WD KL L V++I IP + +Q D +W L+SNG F+ SAW ++R+ Sbjct: 1541 HKFYNGDVWDIEKLSSCLPTSLVDEILQIPFDRSQEDVAYWALTSNGDFSLWSAWEAIRQ 1600 Query: 722 EKTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXXXXX 901 +T +F+ +W+ +IS FLWR+ N IPV+ +++ KGI LAS+C CC Sbjct: 1601 RQTPNALFSLIWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCRSE----- 1655 Query: 902 XXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLMFQF 1081 E++ H+ E+ VW F+ F+ + + + Sbjct: 1656 --------------------ESLIHVLWENPVATQVWFFFAKSFQIYVSKPNHISQIIWA 1695 Query: 1082 WKLSSPFSHVSHVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNK 1261 W S ++ H+ ILIP + WF W+ERND KHR +G R+IW + L++L + Sbjct: 1696 WFFSGDYTRNGHIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLYAGSL 1755 Query: 1262 FDFSIWKGFSHIANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD-XXXXXXXXXXXXV 1438 WKG + IA F P +++W KP +G KLN D V Sbjct: 1756 LKQWQWKGDTDIATMWGFKFPPKYCTSPQIIYWIKPFIGEYKLNVDGSSKSNLNAAGGGV 1815 Query: 1439 IRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKN 1618 +R+ G++ AF + + SL+AE+ AL G+ +L + WIE+D++ V M++ Sbjct: 1816 LRDHTGKLAFAFSENLGPLPSLQAELHALLRGL-LLCKERNITNLWIEMDALVAVQMVQQ 1874 Query: 1619 RKIGHWRLQHDFLRIKNKFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPG 1798 + G +++ I+ + I+H +REGN AD+L+N G Q F S G Sbjct: 1875 SQKGSHDIRYLLESIRLCLRSFSYRISHIYREGNQAADFLSNKGQTHQSLCVF--SEAQG 1932 Query: 1799 KLKGLIRVDKM 1831 +L G++++DK+ Sbjct: 1933 ELIGILKLDKL 1943 >gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 379 bits (973), Expect = e-102 Identities = 206/615 (33%), Positives = 313/615 (50%), Gaps = 5/615 (0%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 E + L+ GGR+ L++S L+S+ +L VL PP +L + +++ F WG + K +HW Sbjct: 1626 ENKTLSPGGRITLLRSTLSSLPIYLLQVLKPPVIVLERINRLLNNFLWGGSTASKRIHWA 1685 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKV 361 SW I EGGL IR++ + AFS KLWWRFR NSLW +F++AKYC P V+ Sbjct: 1686 SWGKIALPIAEGGLDIRNVEDVCEAFSMKLWWRFRTTNSLWTQFMRAKYCGGQLPTDVQP 1745 Query: 362 SIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLI---QIFGAEKDKWGV 532 +HDS WKRM + + + N+ W +G G+ FWH+ WMG++PL+ Q F + + Sbjct: 1746 KLHDSQTWKRMVTISSITEQNIRWRIGHGELFFWHDCWMGEEPLVNRNQAFAS-----SM 1800 Query: 533 YHVNHFWTNGQWDPYKLGKVLDGFWVEKICSIPINENQRDTMHWKLSSNGQFTTTSAWLS 712 V+ F+ N W+ KL VL VE+I IPI+ + D +W + NG F+T SAW Sbjct: 1801 AQVSDFFLNNSWNVEKLKTVLQQEVVEEIVKIPIDTSSNDKAYWTTTPNGDFSTKSAWQL 1860 Query: 713 LRKEKTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXX 892 +R K +F +W+ T S FLWRL + IPV+ K+++KG LAS+C CC Sbjct: 1861 IRNRKVENPVFNFIWHKSVPLTTSFFLWRLLHDWIPVELKMKTKGFQLASRCRCCKSE-- 1918 Query: 893 XXXXXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLM 1072 E++ H+ ++ VW +F+ +F+ + + ++ + Sbjct: 1919 -----------------------ESLMHVMWKNPVANQVWSYFAKVFQIQIINPCTINQI 1955 Query: 1073 FQFWKLSSPFSHVSHVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSL 1252 W S +S H+ L+P WF WVERND KHR LG R++W + L +L Sbjct: 1956 ICAWFYSGDYSKPGHIRTLVPLFTLWFLWVERNDAKHRNLGMYPNRVVWKILKLLHQLFQ 2015 Query: 1253 SNKFDFSIWKGFSHIANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD--XXXXXXXXX 1426 + W+G IA ++ + L+FW KP +G LKLN D Sbjct: 2016 GKQLQKWQWQGDKQIAQEWGIILKADAPSPPKLLFWLKPSIGELKLNVDGSCKHNPQSAA 2075 Query: 1427 XXXVIRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVW 1606 ++R+ G ++ F +F +SL+AE+ AL G+ +L R WIE+D+ V Sbjct: 2076 GGGLLRDHTGSMIFGFSENFGPQDSLQAELMALHRGL-LLCIEHNISRLWIEMDAKVAVQ 2134 Query: 1607 MIKNRKIGHWRLQHDFLRIKNKFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPS 1786 MIK G R ++ I I+H FREGN AD+L+N G Q Q S Sbjct: 2135 MIKEGHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQAADHLSNQGHTHQNLQVI--S 2192 Query: 1787 NLPGKLKGLIRVDKM 1831 G+L+G++R++K+ Sbjct: 2193 QAEGQLRGILRLEKI 2207 >gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 377 bits (968), Expect = e-102 Identities = 201/611 (32%), Positives = 315/611 (51%), Gaps = 1/611 (0%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 E + L+ GGR+ L++SVL+S+ +L VL PP T++ ++++ F WG K MHW Sbjct: 1540 ENKILSPGGRITLLRSVLSSLPMYLLQVLKPPVTVIERIDRLFNSFLWGDSTECKKMHWA 1599 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKV 361 W I EGGLGIR + + AF+ KLWWRF+ NSLW +FL+ KYC P ++ Sbjct: 1600 EWAKISFPCAEGGLGIRKLEDVCAAFTLKLWWRFQTGNSLWTQFLRTKYCLGRIPHHIQP 1659 Query: 362 SIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKDKWGVYHV 541 +HDS +WKRM R+M N+ W +G G FWH+ WMGD+PL F ++ + H Sbjct: 1660 KLHDSHVWKRMISGREMALQNIRWKIGKGDLFFWHDCWMGDKPLAASFPEFQN--DMSHG 1717 Query: 542 NHFWTNGQWDPYKLGKVLDGFWVEKICSIPINENQRDTMHWKLSSNGQFTTTSAWLSLRK 721 HF+ WD KL L VE+I +P ++++ D +W L+SNG F+T SAW +R+ Sbjct: 1718 YHFYNGDTWDVDKLRSFLPTILVEEILQVPFDKSREDVAYWTLTSNGDFSTRSAWEMIRQ 1777 Query: 722 EKTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXXXXX 901 +T + + +W+ +IS FLW+ N IPV+ +++ KGI LAS+C CC Sbjct: 1778 RQTSNALCSFIWHRSIPLSISFFLWKTLHNWIPVELRMKEKGIQLASKCVCCNSE----- 1832 Query: 902 XXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLMFQF 1081 E++ H+ E+ K VW F+ LF+ + + V + Sbjct: 1833 --------------------ESLIHVLWENPVAKQVWNFFAQLFQIYIWNPRHVSQIIWA 1872 Query: 1082 WKLSSPFSHVSHVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNK 1261 W +S + H +L+P + WF W+ERND KHR G +R+IW + +L + Sbjct: 1873 WYVSGDYVRKGHFRVLLPLFICWFLWLERNDAKHRHTGLYADRVIWRTMKHCRQLYDGSL 1932 Query: 1262 FDFSIWKGFSHIANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD-XXXXXXXXXXXXV 1438 WKG + IA L F +++W KP +G KLN D V Sbjct: 1933 LQQWQWKGDTDIATMLGFSFTHKQHAPPQIIYWKKPSIGEYKLNVDGSSRNGLHAATGGV 1992 Query: 1439 IRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKN 1618 +R+ G+++ F + NSL+AE++AL G+ + E + WIE+D++ + +I+ Sbjct: 1993 LRDHTGKLIFGFSENIGPCNSLQAELRALLRGLLLCKERHIE-KLWIEMDALVAIQLIQP 2051 Query: 1619 RKIGHWRLQHDFLRIKNKFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPG 1798 K G + L++ I+ ++H REGN ADYL+N G Q F + G Sbjct: 2052 SKKGPYNLRYLLESIRMCLSSFSYRLSHILREGNQAADYLSNEGHKHQNLCVF--TEAQG 2109 Query: 1799 KLKGLIRVDKM 1831 +L G++++D++ Sbjct: 2110 QLHGMLKLDRL 2120 >gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] Length = 1954 Score = 376 bits (965), Expect = e-101 Identities = 204/611 (33%), Positives = 311/611 (50%), Gaps = 1/611 (0%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 E + L+ GGR+ L++SVL+S+ +L VL PP ++ ++E++ F WG + K +HW Sbjct: 1366 ENKTLSPGGRITLLRSVLSSLPLYLLQVLKPPVVVIEKIERLFNSFLWGDSTNDKRIHWA 1425 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKV 361 +W + EGGL IR + + AFS KLWWRF LW +FLK KYC P V Sbjct: 1426 AWHKLTFPCSEGGLDIRRLTDMFDAFSLKLWWRFSTCEGLWTKFLKTKYCMGQIPHYVHP 1485 Query: 362 SIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKDKWGVYHV 541 +HDS +WKRM + R++ N W +G G FWH+ WMGDQPL+ F ++ H Sbjct: 1486 KLHDSQVWKRMVRGREVAIQNTRWRIGKGSLFFWHDCWMGDQPLVTSFPHFRNDMSTVH- 1544 Query: 542 NHFWTNGQWDPYKLGKVLDGFWVEKICSIPINENQRDTMHWKLSSNGQFTTTSAWLSLRK 721 +F+ WD KL L V++I IPI+ +Q D +W L+SNG+F+T SAW ++R Sbjct: 1545 -NFFNGHNWDVDKLNLYLPMNLVDEILQIPIDRSQDDVAYWSLTSNGEFSTRSAWEAIRL 1603 Query: 722 EKTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXXXXX 901 K+ + + LW+ +IS FLWR+F N IPVD +++ KG LAS+C CC Sbjct: 1604 RKSPNVLCSLLWHKSIPLSISFFLWRVFHNWIPVDIRLKEKGFHLASKCICCNSE----- 1658 Query: 902 XXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLMFQF 1081 E++ H+ ++ K VW F++ F+ + +V + Sbjct: 1659 --------------------ESLIHVLWDNPIAKQVWNFFANSFQIYISKPQNVSQILWT 1698 Query: 1082 WKLSSPFSHVSHVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNK 1261 W LS + H+ ILIP + WF W+ERND KHR LG +R++W + L +L Sbjct: 1699 WYLSGDYVRKGHIRILIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKLLRQLQDGYL 1758 Query: 1262 FDFSIWKGFSHIANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD-XXXXXXXXXXXXV 1438 WKG A P + ++ W KP G KLN D V Sbjct: 1759 LKSWQWKGDKDFATMWGLFSPPKTRAAPQILHWVKPVPGEHKLNVDGSSRQNQTAAIGGV 1818 Query: 1439 IRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKN 1618 +R+ G +V F + NSL+AE++AL G+ + E + W+E+D++ + MI+ Sbjct: 1819 LRDHTGTLVFDFSENIGPSNSLQAELRALLRGLLLCKERNIE-KLWVEMDALVAIQMIQQ 1877 Query: 1619 RKIGHWRLQHDFLRIKNKFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPG 1798 + G +++ I+ I+H FREGN AD+L+N G Q F + G Sbjct: 1878 SQKGSHDIRYLLASIRKYLNFFSFRISHIFREGNQAADFLSNKGHTHQSLHVF--TEAQG 1935 Query: 1799 KLKGLIRVDKM 1831 KL G++++D++ Sbjct: 1936 KLYGMLKLDRL 1946 >gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 371 bits (952), Expect = e-100 Identities = 205/612 (33%), Positives = 310/612 (50%), Gaps = 2/612 (0%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 E + L+ GGR+ L++SVL+SM +L VL PP ++ ++E++ F WG+ +HW Sbjct: 747 ENKILSPGGRITLLRSVLSSMPIYLLQVLKPPACVIQKIERLFNSFLWGSSMDSTRIHWT 806 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKV 361 +W +I EGGLGIRS+ + AFS KLWWRF SLW R+++ KYC + Sbjct: 807 AWHNITFPSSEGGLGIRSLKDSFDAFSAKLWWRFDTCQSLWVRYMRLKYCTGQIHHNIAP 866 Query: 362 SIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKDKWGVYHV 541 HDS WK + R + W +G G FWH+ WMGD+PL+ F + + V Sbjct: 867 KPHDSATWKPLLAGRATASQQIRWRIGKGDIFFWHDAWMGDEPLVNSFPSFSQ--SMMKV 924 Query: 542 NHFWTNGQWDPYKLGKVLDGFWVEKICSIPINENQRDTMHWKLSSNGQFTTTSAWLSLRK 721 N+F+ + WD KL + VE+I IPI+ + D +W L++NG F+ SAW LR+ Sbjct: 925 NYFFNDDAWDVDKLKTFIPNAIVEEILKIPISREKEDIAYWALTANGDFSIKSAWELLRQ 984 Query: 722 EKTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXXXXX 901 K + + +W+ T+S FLWR N +PV+ ++++KGI LAS+C CC Sbjct: 985 RKQVNLVGQLIWHKSIPLTVSFFLWRTLHNWLPVEVRMKAKGIQLASKCLCCKSE----- 1039 Query: 902 XXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLMFQF 1081 E++ H+ ES + VW +FS F+ + + ++ + Sbjct: 1040 --------------------ESLLHVLWESPVAQQVWNYFSKFFQIYVHNPQNILQILNS 1079 Query: 1082 WKLSSPFSHVSHVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNK 1261 W S F+ H+ LI +FWF WVERND KHR LG +RIIW + L KL Sbjct: 1080 WYYSGDFTKPGHIRTLILLFIFWFVWVERNDAKHRDLGMYPDRIIWRIMKILRKLFQGGL 1139 Query: 1262 FDFSIWKGFSHIANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD--XXXXXXXXXXXX 1435 WKG IA H F + + ++ W KP +G LKLN D Sbjct: 1140 LCKWQWKGDLDIAIHWGFNFAQERQARPKIINWIKPLIGELKLNVDGSSKDEFQNAAGGG 1199 Query: 1436 VIRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIK 1615 V+R+ G ++ F +F NSL+AE+ AL G+ + + R WIEVD+ ++ MI+ Sbjct: 1200 VLRDHTGNLIFGFSENFGYQNSLQAELLALHRGLCLCMEYNV-SRVWIEVDAQVVIQMIQ 1258 Query: 1616 NRKIGHWRLQHDFLRIKNKFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLP 1795 N G +++Q+ I+ + V I+H REGN AD+L+ G Q F + Sbjct: 1259 NHHKGSYKIQYLLESIRKCLQVISVRISHIHREGNQAADFLSKHGHTHQNLHVF--TEAQ 1316 Query: 1796 GKLKGLIRVDKM 1831 G+L+G V+++ Sbjct: 1317 GELRGRTLVNRV 1328 >gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] Length = 879 Score = 363 bits (933), Expect = 1e-97 Identities = 196/611 (32%), Positives = 313/611 (51%), Gaps = 1/611 (0%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 E + L+ GGR+ L++SVL+S+ +L VL PP ++ ++E++ F WG K MHW Sbjct: 291 ENKILSPGGRITLLRSVLSSLPMYLLQVLKPPAIVIEKIERLFNSFLWGDSNEGKRMHWA 350 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKV 361 +W+ I EGGL IR++ + AF+ KLWWRF+ +SLW FLK KYC P V Sbjct: 351 AWNKITFPCSEGGLDIRNLNDVFEAFTLKLWWRFQTCDSLWTHFLKTKYCLGRIPHYVHP 410 Query: 362 SIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKDKWGVYHV 541 +HDS +WKRM + R++ N+ W +G G FWH+ WMG+QPL+ F + ++ + H Sbjct: 411 KLHDSLVWKRMIRGREVAFRNIRWKIGKGDLFFWHDCWMGNQPLVMSFPSLRNDMSLVH- 469 Query: 542 NHFWTNGQWDPYKLGKVLDGFWVEKICSIPINENQRDTMHWKLSSNGQFTTTSAWLSLRK 721 +F+ WD KL L +++I IP N Q+D +W L+SNG+F T SAW ++R+ Sbjct: 470 -NFYNGDTWDVDKLKAYLPMNLIDEILLIPFNRTQQDVAYWTLTSNGEFATWSAWETIRQ 528 Query: 722 EKTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXXXXX 901 K+ + + +W+ +IS FLWR N IPV+ +++ KGI LAS+C CC Sbjct: 529 RKSSNALCSFIWHRSIPLSISFFLWRALNNWIPVELRMKEKGIQLASKCVCCNSE----- 583 Query: 902 XXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLMFQF 1081 E++ H+ + K VW F F+ + + V + Sbjct: 584 --------------------ESLMHVLWGNSVAKQVWAFFGKFFQIYVLNPQHVSQILWA 623 Query: 1082 WKLSSPFSHVSHVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNK 1261 W S + H+ L+P + WF W+ERND KHR +R++W + L +L + Sbjct: 624 WFFSGDYVKKGHIRSLLPIFICWFLWLERNDAKHRHTRLNPDRVVWRIMKLLRQLLDGSL 683 Query: 1262 FDFSIWKGFSHIANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD-XXXXXXXXXXXXV 1438 WKG + IA+ +++W KP G KLN D + Sbjct: 684 LHQWQWKGDTDIASMWGHTFQSKHRAPPQIIYWRKPFTGEYKLNVDGSSRNGHLAASGGI 743 Query: 1439 IRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKN 1618 +R+ G+++ F + NSL+AE++AL G+ + E WIE+D++A++ +I++ Sbjct: 744 LRDHTGKLIFGFSENIGLCNSLQAELRALLRGLLLCKERHIE-NLWIEMDALAVIQLIQH 802 Query: 1619 RKIGHWRLQHDFLRIKNKFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPG 1798 + G +++ I+ I+H FREGN ADYLAN G Q + G Sbjct: 803 SQKGSHDIRYLLESIRKCLSCISYRISHIFREGNQAADYLANEGHSHQNLCVI--TEAQG 860 Query: 1799 KLKGLIRVDKM 1831 +L G++++D++ Sbjct: 861 ELHGMLKLDRL 871 >gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] Length = 926 Score = 361 bits (926), Expect = 7e-97 Identities = 198/611 (32%), Positives = 317/611 (51%), Gaps = 1/611 (0%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 E + L+ GGR+ L++SVL+S+ +L VL PP ++ ++E++ F WG K MHW Sbjct: 339 ENKILSPGGRITLLRSVLSSLPMYLLQVLKPPAIVIEKIERLFNSFLWGDSNEGKRMHWA 398 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKV 361 +W+ I EGGL IR++ + AF+ KLWWRF +SLW FLK KYC P V+ Sbjct: 399 AWNKITFPSSEGGLDIRNLKDVFDAFTLKLWWRFYTCDSLWTHFLKTKYCLGRIPHYVQP 458 Query: 362 SIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKDKWGVYHV 541 +H+S IWKR+ RD+ N W +G G+ FWH+ WMGDQPL+ F + ++ + H Sbjct: 459 KLHNSSIWKRITGGRDVTIQNTRWKIGRGELFFWHDCWMGDQPLVISFPSFRNDMSLVH- 517 Query: 542 NHFWTNGQWDPYKLGKVLDGFWVEKICSIPINENQRDTMHWKLSSNGQFTTTSAWLSLRK 721 F+ WD KL L V++I IP + Q+D +W L+SNG+F+T SAW ++RK Sbjct: 518 -KFYKGDSWDVDKLRLFLPVNLVDEILLIPFDRTQQDVAYWILTSNGEFSTRSAWETIRK 576 Query: 722 EKTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXXXXX 901 + + + +W+ +IS F+WR N IPV+ +++ KGI LAS+C CC Sbjct: 577 RQPHNTLGSLIWHRSIPLSISFFIWRALNNWIPVELRMKEKGIHLASKCVCCNSE----- 631 Query: 902 XXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLMFQF 1081 E++ H+ + K VW F++ F+ + + V + Sbjct: 632 --------------------ESLMHVLWGNSVAKQVWAFFANFFQIYIFNPQHVSHILWA 671 Query: 1082 WKLSSPFSHVSHVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNK 1261 W S + H+ L+P + WF W+ERND KHR G +R++W + L +L + Sbjct: 672 WFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKHRYSGLYTDRVVWRIMKLLRQLHDGSL 731 Query: 1262 FDFSIWKGFSHIANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD-XXXXXXXXXXXXV 1438 WKG + IA + + +V+W KP G KLN D V Sbjct: 732 LQQWQWKGDTDIAAMWKYNLQLKLRAPPQIVYWRKPSTGEYKLNVDGSSRHGQHAASGGV 791 Query: 1439 IRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKN 1618 +R+ G+++ F + NSL+AE++AL G+ + E + WIE+D++A++ +I + Sbjct: 792 LRDHTGKLIFGFSENIGNCNSLQAELRALLRGLLLCKERHIE-QLWIEMDALAVIQLIPH 850 Query: 1619 RKIGHWRLQHDFLRIKNKFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPG 1798 + G +++ I+ I+H REGN AD+L+N G + Q + F + G Sbjct: 851 SQKGSHDIRYLLESIRKCLNSISYRISHILREGNQVADFLSNEGHNHQNLRVF--TEAQG 908 Query: 1799 KLKGLIRVDKM 1831 KL G++++D++ Sbjct: 909 KLHGMLKLDRL 919 >gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 352 bits (904), Expect = 2e-94 Identities = 195/611 (31%), Positives = 311/611 (50%), Gaps = 1/611 (0%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 E + L+ G R+ L++SVL+S+ +L VL PP ++ ++E++ F WG K MHW Sbjct: 1627 ENKILSPGSRITLLRSVLSSLPMYLLQVLKPPAIVIEKIERLFNSFLWGDSNEGKRMHWA 1686 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKV 361 +W+ I EGGL IR++ + AF+ KLWWRF +SLW FLK KYC P V+ Sbjct: 1687 AWNKINFPCSEGGLDIRNLKDVFDAFTLKLWWRFYTCDSLWTLFLKTKYCLGRIPHYVQP 1746 Query: 362 SIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKDKWGVYHV 541 IH S IWKR+ RD+ N W +G G+ FWH+ WMGDQPL+ F + ++ H Sbjct: 1747 KIHSSSIWKRITGGRDVTIQNTRWKIGRGELFFWHDCWMGDQPLVISFPSFRNDMSFVH- 1805 Query: 542 NHFWTNGQWDPYKLGKVLDGFWVEKICSIPINENQRDTMHWKLSSNGQFTTTSAWLSLRK 721 F+ WD KL L + +I IP + Q+D +W L+SNG+F+T SAW ++R+ Sbjct: 1806 -KFYKGDSWDVDKLRLFLPVNLIYEILLIPFDRTQQDVAYWTLTSNGEFSTKSAWETIRQ 1864 Query: 722 EKTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXXXXX 901 +++ + + +W+ +IS F+WR N IPV+ +++ KGI LAS+C CC Sbjct: 1865 QQSHNTLGSLIWHRSIPLSISFFIWRALNNWIPVELRMKGKGIHLASKCVCCNSE----- 1919 Query: 902 XXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLMFQF 1081 E++ H+ + K VW F+ F+ + + V + Sbjct: 1920 --------------------ESLMHVLWGNSVAKQVWAFFAKFFQIYVLNPKHVSHILWA 1959 Query: 1082 WKLSSPFSHVSHVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNK 1261 W S + H+ L+P + WF W+ERND K+R G +RI+W + L +L + Sbjct: 1960 WFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKYRHSGLNTDRIVWRIMKLLRQLKDGSL 2019 Query: 1262 FDFSIWKGFSHIANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD-XXXXXXXXXXXXV 1438 WKG + IA + +V+W KP G KLN D V Sbjct: 2020 LQQWQWKGDTDIAAMWQYNFQLKLRAPPQIVYWRKPSTGEYKLNVDGSSRHGQHAASGGV 2079 Query: 1439 IRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKN 1618 +R+ G+++ F + NSL+AE++AL G+ + E + WIE+D++A + ++ + Sbjct: 2080 LRDHTGKLIFGFSENIGTCNSLQAELRALLRGLLLCKERHIE-KLWIEMDALAAIQLLPH 2138 Query: 1619 RKIGHWRLQHDFLRIKNKFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPG 1798 + G +++ I+ I+H REGN AD+L+N G + Q F + G Sbjct: 2139 SQKGSHDIRYLLESIRKCLNSISYRISHIHREGNQVADFLSNEGHNHQNLHVF--TEAQG 2196 Query: 1799 KLKGLIRVDKM 1831 KL G++++D++ Sbjct: 2197 KLHGMLKLDRL 2207 >gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] Length = 910 Score = 334 bits (856), Expect = 9e-89 Identities = 195/610 (31%), Positives = 297/610 (48%), Gaps = 1/610 (0%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 E + L+ GGR+ L++SVLAS+ +L VL PP IL + Sbjct: 359 ENKILSPGGRITLLRSVLASLPIYLLQVLKPPVCILERV--------------------- 397 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKV 361 +S+ + FE AFS KLWWRFR +SLW RF++ KYC+ P + Sbjct: 398 --NSLAEVFE--------------AFSMKLWWRFRTIDSLWTRFMRMKYCRGQLPMQTQP 441 Query: 362 SIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKDKWGVYHV 541 +HDS WKRM + +M W +G G FWH+ WMGD PLI ++ + V Sbjct: 442 KLHDSQTWKRMLTSSATTEQHMRWRVGQGNLFFWHDCWMGDAPLIS--SNQEFTSSMVQV 499 Query: 542 NHFWTNGQWDPYKLGKVLDGFWVEKICSIPINENQRDTMHWKLSSNGQFTTTSAWLSLRK 721 F+ N W+ KL VL V++I IPI+ +D +W + NG F+T SAW +RK Sbjct: 500 CDFFMNNSWNVEKLKTVLQQEVVDEIAKIPIDTMSKDEAYWTPTPNGDFSTKSAWQLIRK 559 Query: 722 EKTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXXXXX 901 K + +F +W+ T S FLWRL + IPV+ K++SKG+ LAS+C CC Sbjct: 560 RKVVNPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCKSE----- 614 Query: 902 XXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLMFQF 1081 E++ H+ ++ VW +F+ LF+ + + ++ + Sbjct: 615 --------------------ESIMHVMWDNPVAMQVWNYFAKLFQICIINPCTINQIIGA 654 Query: 1082 WKLSSPFSHVSHVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNK 1261 W S + H+ L+P + WF WVERND KHR LG R++W V + +LSL + Sbjct: 655 WFHSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQ 714 Query: 1262 FDFSIWKGFSHIANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD-XXXXXXXXXXXXV 1438 WKG IA ++ S+ + W KP G KLN D + Sbjct: 715 LLKWQWKGDKQIAQEWGIILQAESLAPPKVFSWHKPTTGEFKLNVDGSAKHSHNAAGGGI 774 Query: 1439 IRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKN 1618 +R+ G +V F + NSL+AE+ AL G+ IL R R WIE+D+++++ +++ Sbjct: 775 LRDHAGVMVFGFSENLGIQNSLQAELLALYRGL-ILCRDYNIRRLWIEMDAISVIRLLQG 833 Query: 1619 RKIGHWRLQHDFLRIKNKFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPG 1798 G +++ + ++ +H FREGN AD+LAN G + Q Q F + G Sbjct: 834 NHRGPHAIRYLMVSLRQLLSHFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVAQ--G 891 Query: 1799 KLKGLIRVDK 1828 KL+G++R+D+ Sbjct: 892 KLRGMLRLDQ 901 >gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] Length = 1014 Score = 317 bits (811), Expect = 1e-83 Identities = 174/540 (32%), Positives = 276/540 (51%), Gaps = 1/540 (0%) Frame = +2 Query: 215 GGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKVSIHDSPIWKRM 394 GGL IR + + AF+ KLWWRF+ + LW FLK KYC P V+ +HDS +WKRM Sbjct: 497 GGLDIRRLNDVSDAFTMKLWWRFQTCDGLWTNFLKTKYCMGQIPHYVQSKLHDSQVWKRM 556 Query: 395 CKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKDKWGVYHVNHFWTNGQWDP 574 + RD+ N W +G G FWH+ WMG++PL+ F + ++ + V+ F+ WD Sbjct: 557 VRGRDVAIQNTRWRIGKGNLFFWHDCWMGNKPLVTSFPSFRN--DMTFVHKFYNGDNWDV 614 Query: 575 YKLGKVLDGFWVEKICSIPINENQRDTMHWKLSSNGQFTTTSAWLSLRKEKTIQKIFTNL 754 L L +++I IP + +Q D +W L+S+G+F+T SAW ++R+ ++ + + + Sbjct: 615 NTLKLYLPMNLIDEILQIPFDRSQDDIAYWALTSDGEFSTWSAWEAVRQRQSPNTLCSFI 674 Query: 755 WNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXXXXXXXXXXXXXXQR 934 W+ TIS FLWR+ N IPV+ +++ KG LAS+C CC Sbjct: 675 WHKSIPLTISFFLWRVLNNWIPVELRLKEKGFHLASKCVCCNSE---------------- 718 Query: 935 LSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLMFQFWKLSSPFSHVS 1114 E++ H+ ++ K VW F+ F+ ++ + V + W S F Sbjct: 719 ---------ESLIHVLWDNPVAKQVWNFFADFFQINISNPQHVSQIIWAWYYSGDFVRKG 769 Query: 1115 HVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSH 1294 H+ LIP + WF W+ERND KHR LG +R++W + L +L + WKG + Sbjct: 770 HIRTLIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKVLRQLQDGSLLKKWQWKGDTD 829 Query: 1295 IANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD-XXXXXXXXXXXXVIRNDRGEVVRA 1471 IA F +P + ++ W KP G KLN D ++R+ G +V Sbjct: 830 IAAMWGFTLPLKIRESPQIIHWVKPVTGEYKLNVDGSSRHNQSAATGGLLRDHTGTLVFG 889 Query: 1472 FQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHD 1651 F + NSL+AE++AL G+ +L + + WIE+D++ ++ MI+ K G +++ Sbjct: 890 FSENIGPSNSLQAELRALLRGL-LLCKDRNIEKLWIEMDALVVIQMIQQSKKGSHDIRYL 948 Query: 1652 FLRIKNKFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKM 1831 I+ I+H FREGN AD+L+N G Q Q S GKL G++++D++ Sbjct: 949 LASIRKCLSFFSFRISHIFREGNQAADFLSNKGHTHQNLQVI--SEAQGKLHGMLKLDRL 1006 >gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao] Length = 1702 Score = 298 bits (763), Expect = 5e-78 Identities = 177/595 (29%), Positives = 271/595 (45%), Gaps = 3/595 (0%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 E + L+ GGR+ L++SVL+S+ +L VL PP ++ ++E++ F WG + K +HW+ Sbjct: 988 ENKTLSPGGRITLLRSVLSSLPMYLLQVLKPPMVVIEKIERLFNSFLWGDSTNGKRIHWV 1047 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKV 361 +W + EGGL IR +++ AFS KLWWRF+ + LW FL+ KYC P V+ Sbjct: 1048 AWHKLTFPCSEGGLDIRRLIDMFDAFSMKLWWRFQTCDGLWTNFLRTKYCMGQIPHYVQP 1107 Query: 362 SIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKD--KWGVY 535 +HDS +WKRM K R++ N W +G G FW++ WMGDQPLI ++ D W + Sbjct: 1108 KLHDSQVWKRMVKSREVAIQNTRWRIGKGNLFFWYDCWMGDQPLIPFDRSQDDIAYWALT 1167 Query: 536 HVNHFWTNGQWDPYKLGKVLDGFWVEKICSIPINENQRDTMHWKLSSNGQFTTTSAWLSL 715 F T W+ +L Sbjct: 1168 SNGEFSTWSAWE----------------------------------------------AL 1181 Query: 716 RKEKTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXXX 895 R ++ + + W+ +IS FLWR+F N IPVD +++ KG LAS+C CC Sbjct: 1182 RLRQSPNVLCSLFWHKSIPLSISFFLWRVFHNWIPVDLRLKDKGFHLASKCACCNSE--- 1238 Query: 896 XXXXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLMF 1075 ET+ H+ ++ K VW F++ F+ + + +V + Sbjct: 1239 ----------------------ETLIHVLWDNPVAKQVWNFFANFFQIYVSNPQNVSQIL 1276 Query: 1076 QFWKLSSPFSHVSHVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLS 1255 W S + H+ LIP + WF W+ERND K R LG +R++W + L +L Sbjct: 1277 WAWYFSGDYVRKGHIRTLIPLFICWFLWLERNDAKQRHLGMYSDRVVWKIMKLLRQLQDG 1336 Query: 1256 NKFDFSIWKGFSHIANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD-XXXXXXXXXXX 1432 WKG IA F + W K G KLN D Sbjct: 1337 YVLKNWQWKGDMDIAAMWGFNFSPKIQATPQIFHWVKLVSGEHKLNVDGSSRQNQSAAIG 1396 Query: 1433 XVIRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMI 1612 ++R+ G +V F + NSL+AE++AL G+ + E + WIE+D++ + MI Sbjct: 1397 GLLRDHTGTLVFGFSENIGPSNSLQAELRALLRGLLLCKERNIE-KLWIEMDALVAIQMI 1455 Query: 1613 KNRKIGHWRLQHDFLRIKNKFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEF 1777 + + G +Q+ I+ I+H FREGN AD+L+N G +Q F Sbjct: 1456 QQSQKGSHDIQYLLASIRKCLSFFSFRISHIFREGNQVADFLSNKGHTQQNLLVF 1510 Score = 77.0 bits (188), Expect = 2e-11 Identities = 52/162 (32%), Positives = 79/162 (48%), Gaps = 2/162 (1%) Frame = +2 Query: 1349 LVFWDKPPVGCLKLNTDXXXXXXXXXXXX--VIRNDRGEVVRAFQAHFPAVNSLEAEVKA 1522 +++W +P +G KLN D V R+ ++ F +F NS +AE+ A Sbjct: 1535 IIYWSRPLMGEFKLNVDGCSKEAFQNAASGGVPRDHTSTMIFGFSENFGPYNSTQAELMA 1594 Query: 1523 LAMGIDILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKNKFKDKDVVITH 1702 L G+ + + + R WIE+D+ A+V M+ G+ R Q+ I I+H Sbjct: 1595 LHRGLLLCNEYNIS-RVWIEIDAKAIVQMLHEGHKGYSRTQYLLSFICQCLSGISYRISH 1653 Query: 1703 NFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDK 1828 RE N ADYL+N G Q Q F S G+L+G+IR+DK Sbjct: 1654 IHRESNQAADYLSNQGHTHQSLQVF--SKAEGELRGMIRLDK 1693 >ref|XP_004253442.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 775 Score = 264 bits (675), Expect = 8e-68 Identities = 179/621 (28%), Positives = 283/621 (45%), Gaps = 13/621 (2%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 +T+ L+FGG+ +L K VL ++ HL + PP TI+ +++ ++A FFWG + K HW Sbjct: 171 QTKQLSFGGKAVLSKYVLQALPIHLLSAVTPPNTIIKQIQMLIADFFWGWQNNSKKYHWS 230 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKV 361 SW ++ +EEGG+G+R++ + +F +K WW FR + +LW FL+AKYC++S P K Sbjct: 231 SWKNLSYPYEEGGVGMRNLNDVCKSFQFKQWWTFRTKQTLWGDFLRAKYCQRSNPVSKKW 290 Query: 362 SIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKDKWGVYHV 541 S WK M +R V+ ++ W L G SFW ++WMG PL Q + V Sbjct: 291 DTGQSLTWKHMLAIRQQVEQHIQWQLQAGNCSFWWDNWMGTGPLAQ-HTCNNIRLNNSKV 349 Query: 542 NHFWTNGQWDPYKLGKVLDGFWVEKI--CSIPINENQRDTMHWKLSSNGQFTTTSAWLSL 715 FW NG W+ KL + + I +IP + Q+D WKL S G+F+ SAW + Sbjct: 350 ADFWENGVWNYRKLVEQAPASQLANIMAIAIPQQQYQQDQPVWKLHSQGKFSCHSAWEEI 409 Query: 716 RKEKTIQKIFTNLWNPCFIP-TISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXX 892 R +K + + LW+ FIP S LWR+ + +IP + K+ + GI S CYCC Sbjct: 410 RNKKAKNRFLSFLWHN-FIPFKTSFLLWRILKGKIPTNEKLTNFGIE-PSPCYCCVDRAG 467 Query: 893 XXXXXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLM 1072 ++++ H+F + VW F+ + Sbjct: 468 ----------------------MDSINHIFNTGNFAGRVWKSFAAGAGLQQDQQTLQARL 505 Query: 1073 FQFWKLSSPFSHVSHVSIL--IPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKL 1246 Q+W S + H +L P + W W R CK+ G R+ + V+ K+ Sbjct: 506 KQWWTAKS--CNAGHQLLLQATPIFICWNLWKNRCACKYGGKATNISRVKYAVYKDNFKM 563 Query: 1247 SLSNKFDFSIWKGFSHIANHLNFLVPKSSV----KKAILVFWDKPPVGCLKLNTD--XXX 1408 + N F W H L+ S K V W++PP +K+NTD Sbjct: 564 -MKNAFPHIQWPA------HWTALIHTSEKCKHDTKVCQVVWNRPPEEWIKINTDGSALT 616 Query: 1409 XXXXXXXXXVIRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVD 1588 +IRN G++V AF ++ +AE +A +G+ G +E+D Sbjct: 617 NPGNIGAGGIIRNKEGKLVMAFATSLGEGSNNKAETEAALIGLVHALELGYR-NIIMELD 675 Query: 1589 SMALVWMIKNRKIGHWRLQHDFLRIKNK-FKDKDVVITHNFREGNAPADYLANLGCDEQR 1765 S +V I + + HW + + R++ + ++ H FRE N AD L+ Sbjct: 676 SQLIVQWISKKSVHHWSVSNQIERLQYLIMQTQNFKCQHIFREANWVADALSKHSHHITS 735 Query: 1766 TQ-EFDPSNLPGKLKGLIRVD 1825 Q FD + LP + R+D Sbjct: 736 PQLYFDSNQLPKEANAYYRMD 756 >ref|XP_004253443.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 775 Score = 255 bits (652), Expect = 4e-65 Identities = 176/621 (28%), Positives = 281/621 (45%), Gaps = 13/621 (2%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 +T+ L+FGG+ +L K VL ++ HL V+ PP TI+ +++ +A FFWG + K HW Sbjct: 171 QTKQLSFGGKAVLSKYVLQALPIHLLSVVTPPNTIIKQIQMFIADFFWGWQNNSKKYHWS 230 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKV 361 SW ++ +EEGG+G+R++ + +F +K WW F+ + +LW FL+AKYC++S P K Sbjct: 231 SWKNLSYPYEEGGVGMRNLNDVCKSFQFKQWWTFQTKQTLWGDFLRAKYCQRSNPVSKKW 290 Query: 362 SIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKDKWGVYHV 541 S WK M +R V+ ++ W L G SFW ++ MG PL Q + V Sbjct: 291 DTGQSLTWKHMLAIRQQVEQHIQWQLQAGNCSFWWDNCMGTGPLAQ-HTCSNIRLNNSKV 349 Query: 542 NHFWTNGQWDPYKLGKVLDGFWVEKI--CSIPINENQRDTMHWKLSSNGQFTTTSAWLSL 715 FW NG W+ KL + + I +IP ++Q+D WKL S G+F+ SAW + Sbjct: 350 ADFWENGVWNCRKLVEQAPASQLANIMAIAIPQQQHQQDQPVWKLHSQGKFSCHSAWEEI 409 Query: 716 RKEKTIQKIFTNLWNPCFIP-TISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXX 892 R +K + + LW+ FIP S LWR+ + +IP + K+ + GI S CYCC Sbjct: 410 RNKKAKNRFLSFLWHN-FIPFKTSFLLWRILKGKIPTNEKLTNFGIE-PSPCYCCVDRAG 467 Query: 893 XXXXXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLM 1072 ++++ H+F + VW F+ + Sbjct: 468 ----------------------MDSINHIFNTGNFAGRVWKSFAAGAGLQEDQQTLQARL 505 Query: 1073 FQFWKLSSPFSHVSHVSIL--IPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKL 1246 Q+W S + H +L P + W W R CK+ G R+ + V+ K+ Sbjct: 506 KQWWTAKS--CNAGHQLLLQATPIFICWNLWKNRCACKYGGKATNISRVKYVVYKDNFKM 563 Query: 1247 SLSNKFDFSIWKGFSHIANHLNFLVPKSSV----KKAILVFWDKPPVGCLKLNTD--XXX 1408 + N F W H L+ S K V W++PP +K+NTD Sbjct: 564 -MKNAFPHIQWPA------HWTALIHTSEKCKHDTKVCQVVWNRPPEEWIKINTDGSALT 616 Query: 1409 XXXXXXXXXVIRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVD 1588 +IRN G++V AF +A+ +A +G+ G +E+D Sbjct: 617 NPGKIGAGGIIRNKEGKLVMAFATSLGEGTKNKAKTEAALIGLVHALELGYR-NIIMELD 675 Query: 1589 SMALVWMIKNRKIGHWRLQHDFLRIKNK-FKDKDVVITHNFREGNAPADYLANLGCDEQR 1765 S +V I + + HW + + R++ + ++ H F+E N AD L+ Sbjct: 676 SQLIVQWISKKSVHHWSVSNQIERLQYLIMQTQNFKCQHIFKEANWVADALSKHNHHITS 735 Query: 1766 TQ-EFDPSNLPGKLKGLIRVD 1825 Q FD + LP + R+D Sbjct: 736 PQLYFDSNQLPKEANAYYRMD 756 >gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 254 bits (648), Expect = 1e-64 Identities = 122/292 (41%), Positives = 169/292 (57%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 E + L+ GGR+ L+KSVL S+ +LF VL PP +L + +I F WG + K +HW Sbjct: 1833 ENKILSPGGRITLLKSVLTSLPIYLFQVLKPPVCVLERINRIFNSFLWGGSAASKKIHWT 1892 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKV 361 SW I +EGGL IRS+ E AFS KLWWRFR +SLW RF++ KYC+ P + Sbjct: 1893 SWAKISLPVKEGGLDIRSLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCRGQLPMHTQP 1952 Query: 362 SIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKDKWGVYHV 541 +HDS WKRM + + NM W +G G FWH+ WMG+ PLI + + V Sbjct: 1953 KLHDSQTWKRMVASSAITEQNMRWRVGQGNLFFWHDCWMGETPLIS--SNHEFSLSMVQV 2010 Query: 542 NHFWTNGQWDPYKLGKVLDGFWVEKICSIPINENQRDTMHWKLSSNGQFTTTSAWLSLRK 721 F+ N WD KL VL V++I IPI+ +D +W + NG+F+T SAW +RK Sbjct: 2011 CDFFMNNSWDIEKLKTVLQQEVVDEIAKIPIDAMSKDEAYWAPTPNGEFSTKSAWQLIRK 2070 Query: 722 EKTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCC 877 + + +F +W+ T S FLWRL + IPV+ +++SKG LAS+C CC Sbjct: 2071 REVVNPVFNFIWHKAIPLTTSFFLWRLLHDWIPVELRMKSKGFQLASRCRCC 2122 Score = 108 bits (271), Expect = 6e-21 Identities = 75/239 (31%), Positives = 109/239 (45%), Gaps = 1/239 (0%) Frame = +2 Query: 1115 HVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSH 1294 H+ LIP WF WVERND KHR LG + + W WKG Sbjct: 2143 HIRTLIPIFTLWFLWVERNDAKHRNLG--QQLLEWQ------------------WKGDKQ 2182 Query: 1295 IANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD-XXXXXXXXXXXXVIRNDRGEVVRA 1471 IA S+ + W KP G KLN D V+R+ G ++ Sbjct: 2183 IAQEWGITFQAKSLPPPKVFCWHKPSNGEFKLNVDGSAKLSQNAAGGGVLRDHAGVMIFG 2242 Query: 1472 FQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHD 1651 F + NSL+AE+ AL G+ IL R R WIE+D+ +++ +++ G +++ Sbjct: 2243 FSENLGIQNSLKAELLALYRGL-ILCRDYNIRRLWIEMDATSVIRLLQGNHRGPHAIRYL 2301 Query: 1652 FLRIKNKFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDK 1828 I+ +TH FREGN AD+LAN G + Q Q + GKL+G++R+D+ Sbjct: 2302 LGSIRQLLSHFSFRLTHIFREGNQAADFLANRGHEHQSLQVITVAQ--GKLRGMLRLDQ 2358 >ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum tuberosum] Length = 885 Score = 251 bits (640), Expect = 1e-63 Identities = 174/618 (28%), Positives = 284/618 (45%), Gaps = 9/618 (1%) Frame = +2 Query: 2 ETRNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWI 181 + R L+FGGR +LI +VL S+ ++ ++PP ++ +L +I A+FFW K HW+ Sbjct: 274 QNRLLSFGGRYVLIANVLQSLPIYVVSAMNPPACVITQLHRIFAKFFWANTAGAKNKHWV 333 Query: 182 SWDSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQ-NSLWARFLKAKYCKKSFPGIVK 358 WD +C EGG+G RS+ + A KLWW FR N+LWA F+ KYCKK P I+ Sbjct: 334 GWDKMCYPRGEGGMGWRSLHDISKALFAKLWWNFRTSTNTLWASFMWNKYCKKHHP-IIA 392 Query: 359 VSIHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFGAEKDKWGVYH 538 S +W+RM +R+ V+ ++W + G SFW ++W L I E K Sbjct: 393 QGYGSSHVWRRMISIREEVEHEIWWQIKAGNSSFWFDNWTKQGALYHI--EENAKEEEVE 450 Query: 539 VNHFWTNGQWDPYKLGKVLDGFWVEKI---CSIPINENQRDTMHWKLSSNGQFTTTSAWL 709 V F T WD KL + L + I S P D + W ++ G FT SAW Sbjct: 451 VKEFCTGEGWDKEKLLQNLSLEMTDHIMENISPPNTLFGNDVVWWMANAQGIFTVKSAWQ 510 Query: 710 SLRKEKTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXX 889 R ++ +++ +WN I+ F+WR+++ RI D ++ I++ S+C+CC Sbjct: 511 ITRNKQEVRRDCEVIWNKELPFKINFFMWRVWKRRIATDDNLKKMRINIVSRCWCCDRKK 570 Query: 890 XXXXXXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRL 1069 ET+THLF + +W +F+H ++ +L Sbjct: 571 E-----------------------ETMTHLFPTAPITYKLWRYFAHFAGINIDGMHLQQL 607 Query: 1070 MFQFWKLSSPFSHVSHVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLS 1249 + +WK + + + IP ++ W W RN KH ER++ V + K+ Sbjct: 608 IISWWKHEAT-PKLQGIYKAIPAIIMWTLWKRRNALKHDS-SISWERMVEMVIEVVRKM- 664 Query: 1250 LSNKFDF--SIWKGFSHIANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD--XXXXXX 1417 + ++F + ++ + I LN K V + V W P +K NTD Sbjct: 665 VKSQFPWIKNMRWTWQAIIQRLNQYKRKIHV---LRVTWKPPDDHYVKSNTDGACRGNPG 721 Query: 1418 XXXXXXVIRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMA 1597 IR+D+G+++ A ++EAE A+ + S + + IE DS++ Sbjct: 722 LSSFGFCIRDDKGDLIYAKAKGIGIATNMEAETVAILTALRECSNRKMQ-KVIIETDSLS 780 Query: 1598 LVWMIKNRKIGHWRLQHDFLRIKNKFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEF 1777 L +I+ W++ I+ + ITH FREGN+ AD LAN+ + Q ++ Sbjct: 781 LKKIIQQTWRVPWKIAEKVEEIREIMEKIKAKITHIFREGNSLADSLANIAIESQAEHQY 840 Query: 1778 DP-SNLPGKLKGLIRVDK 1828 LP K + ++ +DK Sbjct: 841 SCFQELPLKERRILNIDK 858 >ref|XP_004233578.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 955 Score = 251 bits (640), Expect = 1e-63 Identities = 178/627 (28%), Positives = 283/627 (45%), Gaps = 21/627 (3%) Frame = +2 Query: 14 LTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWISWDS 193 L FGG++ L+K VL S+ HL + PPKT L ++ ++A FFWG K HW SW++ Sbjct: 356 LNFGGKITLVKHVLQSIPIHLLAAVSPPKTTLKYIKNVIADFFWGMDKDGKKYHWASWET 415 Query: 194 ICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQNSLWARFLKAKYCKKSFPGIVKVSIHD 373 + EGG+G+R++ + +AF YK WW FR +NSLW++FLKAKYCK++ P K + Sbjct: 416 LAYPTNEGGIGVRNLEDVCIAFQYKQWWEFRTKNSLWSKFLKAKYCKRANPVAKKYDTGN 475 Query: 374 SPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLI-QIFGAEKDKWGVYHVNHF 550 S +W+ + R V++ + W + +G SFW ++W+G++ L Q+ HV+ F Sbjct: 476 SLVWRYFTRNRQAVESYIKWNIHSGSSSFWWDNWLGNEALANQVINI--SSLNNIHVSDF 533 Query: 551 WTNGQWDPYKLGKVLDGFWVEKI--CSIPINENQRDTMHWKLSSNGQFTTTSAWLSLRKE 724 TNG W+ + + + V I N N DT W NG+FT SAW +RK+ Sbjct: 534 LTNGIWNERYVRQHVPPTMVPDIMQTQFKYNINIEDTAIWTPEENGKFTIASAWEVIRKK 593 Query: 725 KTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXXXXXXX 904 K+ I ++W+ IS F+WR + ++P +Q G S A+ CYCC Sbjct: 594 KSTDIINNSVWHKHIPFKISFFIWRALRGKLPTYDYLQKFG-SNATDCYCCNRKG----- 647 Query: 905 XXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRLMFQFW 1084 I+ + H+ + + +W +++ F + + L+ Q+ Sbjct: 648 ------------------IDDINHILITGNFANYIWKYYAPTFGITQINIDLRSLLLQWT 689 Query: 1085 KLSSPFSHVSHVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKF 1264 L S + ++P + W W +N C + Y +K+S + Sbjct: 690 NLPSSNQVYKLLISILPNFICWHLW--KNMCAVK---------------YGNKISSIQRV 732 Query: 1265 DFSIWKG-------------FSHIANHLNFLVPKSSVK-KAILVFWDKPPVGCLKLNTD- 1399 + I+K + H L LV + + K I+V W KP G KLNTD Sbjct: 733 QYGIFKDVMQTIKIVFPNIPWQHSWYRLINLVEQCQQQLKVIMVSWRKPQFGIYKLNTDG 792 Query: 1400 -XXXXXXXXXXXXVIRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWW 1576 ++R+ G++ AF F + AE++A G+D + G + Sbjct: 793 SALPESGKIGGGGILRDYTGKLHYAFSIPFGLGTNNIAEMEAARYGLDWCEQHGYKS-IL 851 Query: 1577 IEVDSMALVWMIKNRKIGHWRLQHDFLRIKNKFKDKD-VVITHNFREGNAPADYLANLGC 1753 +EVDS L I N WR Q I++ + D H +RE N AD L+ Sbjct: 852 LEVDSEILQKWISNTIAIPWRYQQTIEHIQDIGRKMDHFECQHVYREVNGTADLLSKWSH 911 Query: 1754 DEQRTQEFDPS-NLPGKLKGLIRVDKM 1831 Q F S L G ++G +DK+ Sbjct: 912 KLDILQHFYTSQQLIGSIRGSYILDKL 938 >ref|XP_006367640.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum tuberosum] Length = 1035 Score = 242 bits (617), Expect = 4e-61 Identities = 176/622 (28%), Positives = 278/622 (44%), Gaps = 15/622 (2%) Frame = +2 Query: 8 RNLTFGGRLLLIKSVLASMTCHLFHVLHPPKTILHELEQIMARFFWGTYGSRKAMHWISW 187 R LTFGG+ +LI +VL SM ++ L PPK +L ++ QI A+FFWG G K HW++W Sbjct: 246 RFLTFGGKWILINNVLQSMPVYMLSALKPPKKVLDQIHQIFAKFFWGNLGGIKGKHWVAW 305 Query: 188 DSICKKFEEGGLGIRSMLEGVMAFSYKLWWRFRLQ-NSLWARFLKAKYCKKSFPGIVKVS 364 +C EGGLG RS+ A KLWW FR+ SLW +++ KYCKK P +V S Sbjct: 306 GDLCYPKTEGGLGFRSLHNMNKALFAKLWWNFRVSTTSLWVKYMWNKYCKKLHP-VVATS 364 Query: 365 IHDSPIWKRMCKVRDMVQANMFWTLGTGKFSFWHEHWMGDQPLIQIFG--AEKDKWGVYH 538 + S +W++M +R+ V+ +++W + G SFW ++W L G A++++ Sbjct: 365 LGASQVWRKMISIREEVEHDIWWQIKAGNSSFWFDNWTRQGALYYTEGDCAQEEE---LE 421 Query: 539 VNHFWTNGQWDPYKLGKVLDGFWVEKI---CSIPINENQRDTMHWKLSSNGQFTTTSAWL 709 V +F TN WD KL +L VE I +E D W + G FT SA+ Sbjct: 422 VQYFITNDGWDETKLKDLLSEEMVEHIILNIRPKTSEEGIDKAWWCGNLTGLFTVKSAYH 481 Query: 710 SLRKEKTIQKIFTNLWNPCFIPTISIFLWRLFQNRIPVDTKIQSKGISLASQCYCCXXXX 889 +R K ++ +W IS FLWR+++ +I ++ I + S+CYCC Sbjct: 482 RIRGRKEEEEWRRYMWIKGMPIKISFFLWRVWRRKIATYDNLKRMKIPVVSKCYCCKEGE 541 Query: 890 XXXXXXXXXXXXXQRLSNDQLYFIETVTHLFLESDQVKLVWIHFSHLFKFSLPHTVSVRL 1069 +ET+THL L + + +W F+ + +L Sbjct: 542 -----------------------METMTHLLLTAPIAQKLWKQFASYAGIIINGLNLQQL 578 Query: 1070 MFQFWKLSSPFSHVSHVSILIPCLVFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLS 1249 +F++W + + +S + + ++ W W RN +H G+ +N +Y +L Sbjct: 579 IFKWWDYKAS-NKLSQILKAVLAVIMWELWKRRNSYRH------GKETTYNNMYYQCQLI 631 Query: 1250 LSN--KFDFSIWKGFSH----IANHLNFLVPKSSVKKAILVFWDKPPVGCLKLNTD--XX 1405 L F KG ++ + L P K +V W KP G + NTD Sbjct: 632 LYQLVTIKFPWIKGLTYHWPQVVGMLQNYKPPLHYK---VVRWRKPSEGWVTCNTDGASK 688 Query: 1406 XXXXXXXXXXVIRNDRGEVVRAFQAHFPAVNSLEAEVKALAMGIDILSRFGTEGRWWIEV 1585 IR+ G+++ A + ++EAE + + G + +E Sbjct: 689 GNPRMSSYGYCIRDKNGDLLYAEAHNIGETTNMEAEATTVWKALQFCYENGLR-KVRLET 747 Query: 1586 DSMALVWMIKNRKIGHWRLQHDFLRIKNKFKDKDVVITHNFREGNAPADYLANLGCDEQR 1765 DS+AL MI W L I + DV + H +RE N AD++AN + + Sbjct: 748 DSLALQNMITRSWKIPWELVEKLEEIHEIMQQIDVQVCHVYREVNQLADFIANTTINTEH 807 Query: 1766 TQEFDP-SNLPGKLKGLIRVDK 1828 + F LP K L+ +DK Sbjct: 808 KKVFHHFHQLPSLGKKLLNIDK 829