BLASTX nr result
ID: Mentha22_contig00011808
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00011808 (1160 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom... 290 6e-76 ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom... 290 1e-75 ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom... 276 2e-71 ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom... 273 8e-71 ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom... 273 1e-70 ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom... 270 1e-69 ref|XP_007010390.1| Retrotransposon, unclassified-like protein [... 266 9e-69 ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom... 266 1e-68 ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobrom... 264 6e-68 ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom... 262 2e-67 ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobrom... 259 2e-66 ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom... 251 4e-64 ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobrom... 229 2e-57 ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein A... 207 7e-51 ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596... 201 4e-49 ref|XP_004234855.1| PREDICTED: putative ribonuclease H protein A... 200 8e-49 ref|XP_007017128.1| Uncharacterized protein TCM_042327 [Theobrom... 200 1e-48 ref|XP_004233578.1| PREDICTED: putative ribonuclease H protein A... 198 3e-48 ref|XP_004253372.1| PREDICTED: putative ribonuclease H protein A... 198 4e-48 ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258... 198 4e-48 >ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao] gi|508722459|gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 290 bits (743), Expect = 6e-76 Identities = 148/383 (38%), Positives = 220/383 (57%), Gaps = 2/383 (0%) Frame = +1 Query: 16 DRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTATQ 195 +RI W ++ LS GGR+ L++S LA++P+++ V++P VL ++ ++ F WG +A Sbjct: 1657 ERITGWENKILSPGGRITLLRSVLASLPIYLLQVLKPPVCVLERVNRLFNSFLWGGSAAS 1716 Query: 196 KKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYC--IL 369 K++HW SW +I PV EGGL IRS E+ AFS KLWWRFR DSLW RF KYC L Sbjct: 1717 KRIHWASWAKIALPVTEGGLDIRSLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCRGQL 1776 Query: 370 PFRVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLPG 549 P + T+ +HDS W+R+ + ++RW VGQG+ FW D W G+ L S Sbjct: 1777 PMQ---TQPKLHDSQTWKRMLTSSTITEQHMRWRVGQGNVFFWHDCWMGEAPLIS-SNQE 1832 Query: 550 IEPRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRWAMTCHG 729 VQV ++ +SW++E+L + + VVD +++IPID +D W T +G Sbjct: 1833 FTSSMVQVCDFFTNNSWNIEKLKTV-----LQQEVVDEIAKIPIDTMNKDEAYWTPTPNG 1887 Query: 730 EFSVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSFVS 909 +FS SAW+ + + +P+ IW+ + T S FLWR LHDWIPV+ +++ + + S Sbjct: 1888 DFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLAS 1947 Query: 910 RCLCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFPSLHTPIHTCTAIALRLGFWWRAFH 1089 RC CC+S ES+ H+ +P +W +FA F L I+ CT I +G W+ + Sbjct: 1948 RCRCCKSE---ESIMHVMWDNPVAMQVWNYFAKLFQIL--IINPCT-INQIIGAWFYSGD 2001 Query: 1090 KAPTTHISFLIPCFIVWFIWTER 1158 HI L+P FI+WF+W ER Sbjct: 2002 YCKPGHIRTLVPLFILWFLWVER 2024 >ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao] gi|508725617|gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 290 bits (741), Expect = 1e-75 Identities = 149/383 (38%), Positives = 219/383 (57%), Gaps = 2/383 (0%) Frame = +1 Query: 16 DRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTATQ 195 +RI W ++ LS GGR+ L++S LA++P+++ V++P VL ++ +I F WG +A Sbjct: 1655 ERITGWENKILSPGGRITLLRSVLASLPIYLLQVLKPPICVLERVNRIFNSFLWGGSAAS 1714 Query: 196 KKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYC--IL 369 KK+HW SW +I P++EGGL IR+ E+ AFS KLWWRFR DSLW RF KYC L Sbjct: 1715 KKIHWASWAKISLPIKEGGLDIRNLAEVFEAFSMKLWWRFRTIDSLWTRFMRMKYCRGQL 1774 Query: 370 PFRVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLPG 549 P T+ +HDS W+R+ + +RW VGQG FW D W G+ L S Sbjct: 1775 PMH---TQPKLHDSQTWKRMVANSAITEQNMRWRVGQGKLFFWHDCWMGETPLTS-SNQE 1830 Query: 550 IEPRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRWAMTCHG 729 + VQV ++ +SW +E+L + + VVD +++IPID +D WA T +G Sbjct: 1831 LSLSMVQVCDFFMNNSWDIEKLKTV-----LQQEVVDEIAKIPIDAMSKDEAYWAPTPNG 1885 Query: 730 EFSVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSFVS 909 EFS SAW+ + + +P+ IW+ + T+S FLWR LHDWIPV+ +++ + S Sbjct: 1886 EFSTKSAWQLIRKREVVNPVFNFIWHKTVPLTISFFLWRLLHDWIPVELKMKSKGFQLAS 1945 Query: 910 RCLCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFPSLHTPIHTCTAIALRLGFWWRAFH 1089 RC CC+S ES+ H+ +P +W +F+ +F L I+ CT I LG W+ + Sbjct: 1946 RCRCCKSE---ESIMHVMWDNPVATQVWNYFSKFFQIL--VINPCT-INQILGAWFYSGD 1999 Query: 1090 KAPTTHISFLIPCFIVWFIWTER 1158 HI L+P F +WF+W ER Sbjct: 2000 YCKPGHIRTLVPIFTLWFLWVER 2022 >ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao] gi|508725616|gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 276 bits (705), Expect = 2e-71 Identities = 136/383 (35%), Positives = 215/383 (56%), Gaps = 2/383 (0%) Frame = +1 Query: 16 DRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTATQ 195 +RI W ++ LS GGR+ L++STL+++P+++ V++P VL ++ +++ F WG + Sbjct: 1620 ERITGWENKTLSPGGRITLLRSTLSSLPIYLLQVLKPPVIVLERINRLLNNFLWGGSTAS 1679 Query: 196 KKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYC--IL 369 K++HW SW +I P+ EGGL IR+ E++ AFS KLWWRFR +SLW +F KYC L Sbjct: 1680 KRIHWASWGKIALPIAEGGLDIRNVEDVCEAFSMKLWWRFRTTNSLWTQFMRAKYCGGQL 1739 Query: 370 PFRVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLPG 549 P V + +HDS W+R+ I + +RW +G G+ FW D W G+ L + Sbjct: 1740 PTDV---QPKLHDSQTWKRMVTISSITEQNIRWRIGHGELFFWHDCWMGEEPLVNR-NQA 1795 Query: 550 IEPRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRWAMTCHG 729 QV ++ +SW++E+L + + VV+ + +IPID D W T +G Sbjct: 1796 FASSMAQVSDFFLNNSWNVEKLKTV-----LQQEVVEEIVKIPIDTSSNDKAYWTTTPNG 1850 Query: 730 EFSVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSFVS 909 +FS SAW+ + N+ +P+ IW+ + T S FLWR LHDWIPV+ +++ + S Sbjct: 1851 DFSTKSAWQLIRNRKVENPVFNFIWHKSVPLTTSFFLWRLLHDWIPVELKMKTKGFQLAS 1910 Query: 910 RCLCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFPSLHTPIHTCTAIALRLGFWWRAFH 1089 RC CC+S ES+ H+ +P +W +FA F I+ CT + +++ + Sbjct: 1911 RCRCCKSE---ESLMHVMWKNPVANQVWSYFAKVFQI--QIINPCTINQIICAWFYSGDY 1965 Query: 1090 KAPTTHISFLIPCFIVWFIWTER 1158 P HI L+P F +WF+W ER Sbjct: 1966 SKP-GHIRTLVPLFTLWFLWVER 1987 >ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao] gi|508710339|gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] Length = 1954 Score = 273 bits (699), Expect = 8e-71 Identities = 133/381 (34%), Positives = 208/381 (54%) Frame = +1 Query: 16 DRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTATQ 195 DRI W ++ LS GGR+ L++S L+++PL++ V++P V+ ++E++ F WG + Sbjct: 1360 DRISGWENKTLSPGGRITLLRSVLSSLPLYLLQVLKPPVVVIEKIERLFNSFLWGDSTND 1419 Query: 196 KKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYCILPF 375 K++HW +W ++ P EGGL IR ++ +AFS KLWWRF + LW +F KYC+ Sbjct: 1420 KRIHWAAWHKLTFPCSEGGLDIRRLTDMFDAFSLKLWWRFSTCEGLWTKFLKTKYCMGQI 1479 Query: 376 RVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLPGIE 555 Y +HDS +W+R+ R V RW +G+G FW D W GD L + P Sbjct: 1480 PHY-VHPKLHDSQVWKRMVRGREVAIQNTRWRIGKGSLFFWHDCWMGDQPLVT-SFPHFR 1537 Query: 556 PRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRWAMTCHGEF 735 V + + H+W +++L+ +PM++VD + +IPID DV W++T +GEF Sbjct: 1538 NDMSTVHNFFNGHNWDVDKLN-----LYLPMNLVDEILQIPIDRSQDDVAYWSLTSNGEF 1592 Query: 736 SVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSFVSRC 915 S SAWE + + + L + +W+ I ++S FLWR H+WIPVD ++ + S+C Sbjct: 1593 STRSAWEAIRLRKSPNVLCSLLWHKSIPLSISFFLWRVFHNWIPVDIRLKEKGFHLASKC 1652 Query: 916 LCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFPSLHTPIHTCTAIALRLGFWWRAFHKA 1095 +CC S ES+ H+ +P + +W FA F + + I L W+ + Sbjct: 1653 ICCNSE---ESLIHVLWDNPIAKQVWNFFANSFQIYISKPQNVSQI---LWTWYLSGDYV 1706 Query: 1096 PTTHISFLIPCFIVWFIWTER 1158 HI LIP FI WF+W ER Sbjct: 1707 RKGHIRILIPLFICWFLWLER 1727 >ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao] gi|508715063|gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 273 bits (697), Expect = 1e-70 Identities = 138/385 (35%), Positives = 217/385 (56%), Gaps = 4/385 (1%) Frame = +1 Query: 16 DRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTATQ 195 +RI W ++ LS GGR+ L++STL+++P+++ V++P VL ++ ++ F WG +A+ Sbjct: 2908 ERITGWENKILSPGGRITLLRSTLSSLPIYLLQVLKPPIIVLERINRLFNNFLWGGSASS 2967 Query: 196 KKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYC--IL 369 K++HW SW +I P+ EGGL IR+ E++ AFS KLWWRFR +SLW +F KYC L Sbjct: 2968 KRIHWASWGKIALPIAEGGLDIRNLEDVFKAFSMKLWWRFRTTNSLWMQFMRAKYCGGQL 3027 Query: 370 PFRVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGD--CTLRSYCL 543 P V + +HDS W+R+ I + +RW VG G FW D W G+ +R+ Sbjct: 3028 PTHV---QPKLHDSQTWKRMVTISSITEQNIRWRVGHGKLFFWHDCWMGEEPLVIRN--- 3081 Query: 544 PGIEPRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRWAMTC 723 QV ++ +SW +E+L + + VV+ +++IPI+ D W T Sbjct: 3082 QEFASSMAQVSDFFLNNSWDIEKLKSV-----LQQEVVEEIAKIPINASSNDRAYWTPTP 3136 Query: 724 HGEFSVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSF 903 +G+FS SAW+ + +P + IW+ + T S FLWR LHDW+PV+ +++ + Sbjct: 3137 NGDFSTKSAWQLSRERKVVNPTYNYIWHKSVPLTTSFFLWRLLHDWVPVELKMKSKGFQL 3196 Query: 904 VSRCLCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFPSLHTPIHTCTAIALRLGFWWRA 1083 SRC CC+S ES+ H+ +P +W +FA F +H I+ CT I + W+ + Sbjct: 3197 ASRCRCCKSE---ESLMHVMWDNPVANQVWSYFAKVF-QIHI-INPCT-INHIISAWFYS 3250 Query: 1084 FHKAPTTHISFLIPCFIVWFIWTER 1158 + HI L+P FI+WF+W ER Sbjct: 3251 GDYSKPGHIRTLVPLFILWFLWVER 3275 Score = 266 bits (680), Expect = 1e-68 Identities = 134/384 (34%), Positives = 213/384 (55%), Gaps = 3/384 (0%) Frame = +1 Query: 16 DRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTATQ 195 DRI W ++ LS GGR+ L++S L++ P+++ V++P V+ ++E++ F WG + Sbjct: 1114 DRISGWENKILSPGGRITLLRSVLSSQPMYLLQVLKPPVTVIEKIERLFNSFLWGDSCDG 1173 Query: 196 KKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYCI--L 369 KK+HW +W +I PV EGGL IR+ ++ AFS KLWWRF+ +SLW RF KYC+ + Sbjct: 1174 KKLHWTAWSKITFPVSEGGLDIRNLRDVFEAFSLKLWWRFQTCNSLWTRFLRTKYCLGRI 1233 Query: 370 PFRVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLPG 549 P + + +HDS +W+R+ V +RW +G+G+ FW D W GD L + P Sbjct: 1234 P---HLVQPKLHDSQVWKRMIVGRDVALQNIRWRIGKGELFFWHDCWMGDQPLAT-LFPS 1289 Query: 550 IEPRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRWAMTCHG 729 V ++ + W + +L+ +P +VD + +IP D DV WA+T +G Sbjct: 1290 FHNDMSHVHKFYNGDEWDIVKLNSY-----LPTSLVDEILQIPFDRSQEDVAYWALTSNG 1344 Query: 730 EFSVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSFVS 909 EFS SAWE + + + L + W+ I ++S FLWR L++WIPV+ ++ + + S Sbjct: 1345 EFSFWSAWEIIRQRQTPNALLSFNWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLAS 1404 Query: 910 RCLCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFP-SLHTPIHTCTAIALRLGFWWRAF 1086 +C+CC+S ES+ H+ +P + +W FA F + P H I+ + W+ + Sbjct: 1405 KCVCCRSE---ESLIHVLWENPVAKQVWNFFAKSFQIYVSKPKH----ISQIIWAWFFSG 1457 Query: 1087 HKAPTTHISFLIPCFIVWFIWTER 1158 HI LIP FI WF+W ER Sbjct: 1458 DYTRNGHIRILIPLFICWFLWLER 1481 >ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao] gi|508715062|gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] Length = 1951 Score = 270 bits (689), Expect = 1e-69 Identities = 136/384 (35%), Positives = 214/384 (55%), Gaps = 3/384 (0%) Frame = +1 Query: 16 DRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTATQ 195 DRI W ++ LS GGR+ L++S L++ P+++ V++P V+ ++E+I F WG + Sbjct: 1357 DRISGWENKILSPGGRITLLRSVLSSQPMYLLQVLKPPVTVIEKIERIFNSFLWGDSNDG 1416 Query: 196 KKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYCI--L 369 KK+HW W +I PV EGGL IR+ ++ AFS KLWWRF+ +SLW +F KYC+ + Sbjct: 1417 KKLHWTVWSKITFPVSEGGLDIRNLRDVFEAFSLKLWWRFQTCNSLWTKFLRTKYCLGRI 1476 Query: 370 PFRVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLPG 549 P + + +HDS +W+R+ V +RW +G+G+ FW D W GD L + C P Sbjct: 1477 P---HFVQPKLHDSQVWKRMIVGRDVALQNIRWRIGKGELFFWHDCWMGDQPLATLC-PS 1532 Query: 550 IEPRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRWAMTCHG 729 V ++ + W +E+L +P +VD + +IP D DV WA+T +G Sbjct: 1533 FHNDMSHVHKFYNGDVWDIEKLSSC-----LPTSLVDEILQIPFDRSQEDVAYWALTSNG 1587 Query: 730 EFSVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSFVS 909 +FS+ SAWE + + + L + IW+ I ++S FLWR L++WIPV+ ++ + + S Sbjct: 1588 DFSLWSAWEAIRQRQTPNALFSLIWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLAS 1647 Query: 910 RCLCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFP-SLHTPIHTCTAIALRLGFWWRAF 1086 +C+CC+S ES+ H+ +P +W FA F + P H I+ + W+ + Sbjct: 1648 KCVCCRSE---ESLIHVLWENPVATQVWFFFAKSFQIYVSKPNH----ISQIIWAWFFSG 1700 Query: 1087 HKAPTTHISFLIPCFIVWFIWTER 1158 HI LIP FI WF+W ER Sbjct: 1701 DYTRNGHIRILIPLFICWFLWLER 1724 >ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao] gi|508727303|gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 266 bits (681), Expect = 9e-69 Identities = 130/381 (34%), Positives = 210/381 (55%) Frame = +1 Query: 16 DRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTATQ 195 +RI W ++ LS GGR+ L++S L+++P+++ V++P V+ ++E++ F WG++ Sbjct: 741 ERITGWENKILSPGGRITLLRSVLSSMPIYLLQVLKPPACVIQKIERLFNSFLWGSSMDS 800 Query: 196 KKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYCILPF 375 ++HW +W I P EGGLGIRS ++ +AFS KLWWRF SLW R+ KYC Sbjct: 801 TRIHWTAWHNITFPSSEGGLGIRSLKDSFDAFSAKLWWRFDTCQSLWVRYMRLKYCTGQI 860 Query: 376 RVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLPGIE 555 + HDS W+ L S +RW +G+GD FW D W GD L + P Sbjct: 861 H-HNIAPKPHDSATWKPLLAGRATASQQIRWRIGKGDIFFWHDAWMGDEPLVN-SFPSFS 918 Query: 556 PRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRWAMTCHGEF 735 ++V+ + ++ +W +++L + +P +V+ + +IPI D+ WA+T +G+F Sbjct: 919 QSMMKVNYFFNDDAWDVDKL-----KTFIPNAIVEEILKIPISREKEDIAYWALTANGDF 973 Query: 736 SVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSFVSRC 915 S+ SAWE + + + + IW+ I TVS FLWR LH+W+PV+ ++ + + S+C Sbjct: 974 SIKSAWELLRQRKQVNLVGQLIWHKSIPLTVSFFLWRTLHNWLPVEVRMKAKGIQLASKC 1033 Query: 916 LCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFPSLHTPIHTCTAIALRLGFWWRAFHKA 1095 LCC+S ES+ H+ SP + +W +F+ +F +H I L W+ + Sbjct: 1034 LCCKSE---ESLLHVLWESPVAQQVWNYFSKFF---QIYVHNPQNILQILNSWYYSGDFT 1087 Query: 1096 PTTHISFLIPCFIVWFIWTER 1158 HI LI FI WF+W ER Sbjct: 1088 KPGHIRTLILLFIFWFVWVER 1108 >ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao] gi|508778198|gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 266 bits (680), Expect = 1e-68 Identities = 146/389 (37%), Positives = 207/389 (53%), Gaps = 8/389 (2%) Frame = +1 Query: 16 DRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTATQ 195 +RI W ++ LS GGR+ L+KS L ++P+++F V++P VL ++ +I F WG +A Sbjct: 1827 ERITGWENKILSPGGRITLLKSVLTSLPIYLFQVLKPPVCVLERINRIFNSFLWGGSAAS 1886 Query: 196 KKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYC--IL 369 KK+HW SW +I PV+EGGL IRS E+ AFS KLWWRFR DSLW RF KYC L Sbjct: 1887 KKIHWTSWAKISLPVKEGGLDIRSLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCRGQL 1946 Query: 370 PFRVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLPG 549 P T+ +HDS W+R+ + +RW VGQG+ FW D W G+ P Sbjct: 1947 PMH---TQPKLHDSQTWKRMVASSAITEQNMRWRVGQGNLFFWHDCWMGE-------TPL 1996 Query: 550 IEPRH------VQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRW 711 I H VQV ++ +SW +E+L + + VVD +++IPID +D W Sbjct: 1997 ISSNHEFSLSMVQVCDFFMNNSWDIEKLKTV-----LQQEVVDEIAKIPIDAMSKDEAYW 2051 Query: 712 AMTCHGEFSVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPR 891 A T +GEFS SAW+ + + +P+ IW+ I T S FLWR LHDWIPV+ ++ + Sbjct: 2052 APTPNGEFSTKSAWQLIRKREVVNPVFNFIWHKAIPLTTSFFLWRLLHDWIPVELRMKSK 2111 Query: 892 RVSFVSRCLCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFPSLHTPIHTCTAIALRLGF 1071 SRC CC+S ES+ H+ +W++ +A++ G Sbjct: 2112 GFQLASRCRCCRSE---ESIIHV---------MWDN----------------PVAVQPG- 2142 Query: 1072 WWRAFHKAPTTHISFLIPCFIVWFIWTER 1158 HI LIP F +WF+W ER Sbjct: 2143 -----------HIRTLIPIFTLWFLWVER 2160 >ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobroma cacao] gi|508710337|gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] Length = 926 Score = 264 bits (674), Expect = 6e-68 Identities = 131/382 (34%), Positives = 212/382 (55%), Gaps = 1/382 (0%) Frame = +1 Query: 16 DRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTATQ 195 DRI W ++ LS GGR+ L++S L+++P+++ V++P V+ ++E++ F WG + Sbjct: 333 DRISGWENKILSPGGRITLLRSVLSSLPMYLLQVLKPPAIVIEKIERLFNSFLWGDSNEG 392 Query: 196 KKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYCILPF 375 K++HW +W +I P EGGL IR+ +++ +AF+ KLWWRF DSLW F KYC+ Sbjct: 393 KRMHWAAWNKITFPSSEGGLDIRNLKDVFDAFTLKLWWRFYTCDSLWTHFLKTKYCLGRI 452 Query: 376 RVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLPGIE 555 Y + +H+S IW+R+ V RW +G+G+ FW D W GD L P Sbjct: 453 PHY-VQPKLHNSSIWKRITGGRDVTIQNTRWKIGRGELFFWHDCWMGDQPL-VISFPSFR 510 Query: 556 PRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRWAMTCHGEF 735 V ++ SW +++L R +P+++VD + IP D +DV W +T +GEF Sbjct: 511 NDMSLVHKFYKGDSWDVDKL-----RLFLPVNLVDEILLIPFDRTQQDVAYWILTSNGEF 565 Query: 736 SVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSFVSRC 915 S SAWE + + P + L + IW+ I ++S F+WR L++WIPV+ ++ + + S+C Sbjct: 566 STRSAWETIRKRQPHNTLGSLIWHRSIPLSISFFIWRALNNWIPVELRMKEKGIHLASKC 625 Query: 916 LCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFP-SLHTPIHTCTAIALRLGFWWRAFHK 1092 +CC S ES+ H+ + + +W FA +F + P H ++ L W+ + Sbjct: 626 VCCNSE---ESLMHVLWGNSVAKQVWAFFANFFQIYIFNPQH----VSHILWAWFYSGDY 678 Query: 1093 APTTHISFLIPCFIVWFIWTER 1158 HI L+P FI WF+W ER Sbjct: 679 VKRGHIRTLLPIFICWFLWLER 700 >ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao] gi|508710342|gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 262 bits (670), Expect = 2e-67 Identities = 126/384 (32%), Positives = 213/384 (55%), Gaps = 3/384 (0%) Frame = +1 Query: 16 DRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTATQ 195 DRI W ++ LS GGR+ L++S L+++P+++ V++P V+ +++++ F WG + Sbjct: 1534 DRISGWENKILSPGGRITLLRSVLSSLPMYLLQVLKPPVTVIERIDRLFNSFLWGDSTEC 1593 Query: 196 KKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYCI--L 369 KK+HW W +I P EGGLGIR E++ AF+ KLWWRF+ +SLW +F KYC+ + Sbjct: 1594 KKMHWAEWAKISFPCAEGGLGIRKLEDVCAAFTLKLWWRFQTGNSLWTQFLRTKYCLGRI 1653 Query: 370 PFRVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLPG 549 P + + +HDS +W+R+ + +RW +G+GD FW D W GD L + P Sbjct: 1654 PHHI---QPKLHDSHVWKRMISGREMALQNIRWKIGKGDLFFWHDCWMGDKPLAA-SFPE 1709 Query: 550 IEPRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRWAMTCHG 729 + + + +W +++L R +P +V+ + ++P D+ DV W +T +G Sbjct: 1710 FQNDMSHGYHFYNGDTWDVDKL-----RSFLPTILVEEILQVPFDKSREDVAYWTLTSNG 1764 Query: 730 EFSVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSFVS 909 +FS SAWE + + + L + IW+ I ++S FLW+ LH+WIPV+ ++ + + S Sbjct: 1765 DFSTRSAWEMIRQRQTSNALCSFIWHRSIPLSISFFLWKTLHNWIPVELRMKEKGIQLAS 1824 Query: 910 RCLCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFP-SLHTPIHTCTAIALRLGFWWRAF 1086 +C+CC S ES+ H+ +P + +W FA F + P H ++ + W+ + Sbjct: 1825 KCVCCNSE---ESLIHVLWENPVAKQVWNFFAQLFQIYIWNPRH----VSQIIWAWYVSG 1877 Query: 1087 HKAPTTHISFLIPCFIVWFIWTER 1158 H L+P FI WF+W ER Sbjct: 1878 DYVRKGHFRVLLPLFICWFLWLER 1901 >ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobroma cacao] gi|508778195|gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] Length = 879 Score = 259 bits (662), Expect = 2e-66 Identities = 128/384 (33%), Positives = 213/384 (55%), Gaps = 3/384 (0%) Frame = +1 Query: 16 DRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTATQ 195 DRI W ++ LS GGR+ L++S L+++P+++ V++P V+ ++E++ F WG + Sbjct: 285 DRISGWENKILSPGGRITLLRSVLSSLPMYLLQVLKPPAIVIEKIERLFNSFLWGDSNEG 344 Query: 196 KKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYCI--L 369 K++HW +W +I P EGGL IR+ ++ AF+ KLWWRF+ DSLW F KYC+ + Sbjct: 345 KRMHWAAWNKITFPCSEGGLDIRNLNDVFEAFTLKLWWRFQTCDSLWTHFLKTKYCLGRI 404 Query: 370 PFRVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLPG 549 P V+ +HDS +W+R+ R V +RW +G+GD FW D W G+ L P Sbjct: 405 PHYVH---PKLHDSLVWKRMIRGREVAFRNIRWKIGKGDLFFWHDCWMGNQPL-VMSFPS 460 Query: 550 IEPRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRWAMTCHG 729 + V + + +W +++L +PM+++D + IP + +DV W +T +G Sbjct: 461 LRNDMSLVHNFYNGDTWDVDKLKAY-----LPMNLIDEILLIPFNRTQQDVAYWTLTSNG 515 Query: 730 EFSVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSFVS 909 EF+ SAWE + + + L + IW+ I ++S FLWR L++WIPV+ ++ + + S Sbjct: 516 EFATWSAWETIRQRKSSNALCSFIWHRSIPLSISFFLWRALNNWIPVELRMKEKGIQLAS 575 Query: 910 RCLCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFP-SLHTPIHTCTAIALRLGFWWRAF 1086 +C+CC S ES+ H+ + + +W F +F + P H ++ L W+ + Sbjct: 576 KCVCCNSE---ESLMHVLWGNSVAKQVWAFFGKFFQIYVLNPQH----VSQILWAWFFSG 628 Query: 1087 HKAPTTHISFLIPCFIVWFIWTER 1158 HI L+P FI WF+W ER Sbjct: 629 DYVKKGHIRSLLPIFICWFLWLER 652 >ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao] gi|508710341|gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 251 bits (641), Expect = 4e-64 Identities = 127/382 (33%), Positives = 208/382 (54%), Gaps = 1/382 (0%) Frame = +1 Query: 16 DRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTATQ 195 DRI W ++ LS G R+ L++S L+++P+++ V++P V+ ++E++ F WG + Sbjct: 1621 DRISGWENKILSPGSRITLLRSVLSSLPMYLLQVLKPPAIVIEKIERLFNSFLWGDSNEG 1680 Query: 196 KKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYCILPF 375 K++HW +W +I P EGGL IR+ +++ +AF+ KLWWRF DSLW F KYC+ Sbjct: 1681 KRMHWAAWNKINFPCSEGGLDIRNLKDVFDAFTLKLWWRFYTCDSLWTLFLKTKYCLGRI 1740 Query: 376 RVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLPGIE 555 Y + +H S IW+R+ V RW +G+G+ FW D W GD L P Sbjct: 1741 PHY-VQPKIHSSSIWKRITGGRDVTIQNTRWKIGRGELFFWHDCWMGDQPL-VISFPSFR 1798 Query: 556 PRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRWAMTCHGEF 735 V ++ SW +++L R +P++++ + IP D +DV W +T +GEF Sbjct: 1799 NDMSFVHKFYKGDSWDVDKL-----RLFLPVNLIYEILLIPFDRTQQDVAYWTLTSNGEF 1853 Query: 736 SVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSFVSRC 915 S SAWE + + + L + IW+ I ++S F+WR L++WIPV+ ++ + + S+C Sbjct: 1854 STKSAWETIRQQQSHNTLGSLIWHRSIPLSISFFIWRALNNWIPVELRMKGKGIHLASKC 1913 Query: 916 LCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFP-SLHTPIHTCTAIALRLGFWWRAFHK 1092 +CC S ES+ H+ + + +W FA +F + P H ++ L W+ + Sbjct: 1914 VCCNSE---ESLMHVLWGNSVAKQVWAFFAKFFQIYVLNPKH----VSHILWAWFYSGDY 1966 Query: 1093 APTTHISFLIPCFIVWFIWTER 1158 HI L+P FI WF+W ER Sbjct: 1967 VKRGHIRTLLPIFICWFLWLER 1988 >ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobroma cacao] gi|508787492|gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] Length = 910 Score = 229 bits (583), Expect = 2e-57 Identities = 129/383 (33%), Positives = 194/383 (50%), Gaps = 2/383 (0%) Frame = +1 Query: 16 DRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTATQ 195 +RI W ++ LS GGR+ L++S LA++P+++ V++P +L + Sbjct: 353 ERITGWENKILSPGGRITLLRSVLASLPIYLLQVLKPPVCILER---------------- 396 Query: 196 KKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYCI--L 369 + S E+ AFS KLWWRFR DSLW RF KYC L Sbjct: 397 ---------------------VNSLAEVFEAFSMKLWWRFRTIDSLWTRFMRMKYCRGQL 435 Query: 370 PFRVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLPG 549 P + T+ +HDS W+R+ ++RW VGQG+ FW D W GD L S Sbjct: 436 PMQ---TQPKLHDSQTWKRMLTSSATTEQHMRWRVGQGNLFFWHDCWMGDAPLIS-SNQE 491 Query: 550 IEPRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRWAMTCHG 729 VQV ++ +SW++E+L + + VVD +++IPID +D W T +G Sbjct: 492 FTSSMVQVCDFFMNNSWNVEKLKTV-----LQQEVVDEIAKIPIDTMSKDEAYWTPTPNG 546 Query: 730 EFSVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSFVS 909 +FS SAW+ + + +P+ IW+ + T S FLWR LHDWIPV+ +++ + + S Sbjct: 547 DFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLAS 606 Query: 910 RCLCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFPSLHTPIHTCTAIALRLGFWWRAFH 1089 RC CC+S ES+ H+ +P +W +FA F I+ CT I +G W+ + Sbjct: 607 RCRCCKSE---ESIMHVMWDNPVAMQVWNYFAKLFQI--CIINPCT-INQIIGAWFHSGD 660 Query: 1090 KAPTTHISFLIPCFIVWFIWTER 1158 HI L+P FI+WF+W ER Sbjct: 661 YCKPGHIRTLVPLFILWFLWVER 683 >ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum tuberosum] Length = 885 Score = 207 bits (527), Expect = 7e-51 Identities = 122/384 (31%), Positives = 183/384 (47%), Gaps = 1/384 (0%) Frame = +1 Query: 10 MMDRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTA 189 +M RI SW +R LSFGGR LI + L ++P+++ M P V+ QL +I A+FFW TA Sbjct: 266 VMKRISSWQNRLLSFGGRYVLIANVLQSLPIYVVSAMNPPACVITQLHRIFAKFFWANTA 325 Query: 190 TQKKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFR-AQDSLWARFTARKYCI 366 K HW+ W ++C+P EGG+G RS ++ A KLWW FR + ++LWA F KYC Sbjct: 326 GAKNKHWVGWDKMCYPRGEGGMGWRSLHDISKALFAKLWWNFRTSTNTLWASFMWNKYCK 385 Query: 367 LPFRVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLP 546 + S +WRR+ I + + W + G+ SFW D W L + Sbjct: 386 KHHPIIAQ--GYGSSHVWRRMISIREEVEHEIWWQIKAGNSSFWFDNWTKQGAL-YHIEE 442 Query: 547 GIEPRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRWAMTCH 726 + V+V E+ W E+L ++ + H+++ +S P G DV+ W Sbjct: 443 NAKEEEVEVKEFCTGEGWDKEKLL-QNLSLEMTDHIMENISP-PNTLFGNDVVWWMANAQ 500 Query: 727 GEFSVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSFV 906 G F+V SAW+ NK IWN + ++ F+WR I D ++ R++ V Sbjct: 501 GIFTVKSAWQITRNKQEVRRDCEVIWNKELPFKINFFMWRVWKRRIATDDNLKKMRINIV 560 Query: 907 SRCLCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFPSLHTPIHTCTAIALRLGFWWRAF 1086 SRC CC E++ HLF +P +W +FA + +H I WW+ Sbjct: 561 SRCWCCDRKKE-ETMTHLFPTAPITYKLWRYFAHFAGINIDGMHLQQLII----SWWKHE 615 Query: 1087 HKAPTTHISFLIPCFIVWFIWTER 1158 I IP I+W +W R Sbjct: 616 ATPKLQGIYKAIPAIIMWTLWKRR 639 >ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596481 [Solanum tuberosum] Length = 1135 Score = 201 bits (512), Expect = 4e-49 Identities = 110/382 (28%), Positives = 183/382 (47%), Gaps = 1/382 (0%) Frame = +1 Query: 16 DRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTATQ 195 +R+++W ++ +SFG R LI L +IP+++ M P K ++ QL ++ A FFW ++ Sbjct: 644 NRMNTWQNKLMSFGERYILIAHVLQSIPVYLLAAMNPPKSIIDQLHKLFAIFFWSNSSGA 703 Query: 196 KKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQ-DSLWARFTARKYCILP 372 + HW++W ++C+P EGGLG RS ++ AF KLWW FR SLWA F KYC Sbjct: 704 RNKHWVAWDKMCYPKVEGGLGFRSLHDVSKAFFAKLWWNFRTDTSSLWASFMWNKYCKKM 763 Query: 373 FRVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLPGI 552 S +WR++ + + + W + G+ SFW D W L Sbjct: 764 HPTVARGQGA--SHVWRKMITVREEVEHNIWWQIKAGNSSFWFDNWTKQGALWYVEENNA 821 Query: 553 EPRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRWAMTCHGE 732 ++V + H+ +W E+L + + ++++ + P++E DV W + G Sbjct: 822 VEEKIEVKYFTHQGAWDREKLLN-KISEEMTDYIMESIKP-PLEEYINDVAWWMGSTQGI 879 Query: 733 FSVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSFVSR 912 F+V SAWE + +K R + IW + ++ FLWR I D ++ ++ VSR Sbjct: 880 FTVKSAWELMRHKQERRTDYQLIWTKDVPFKMNFFLWRLWKRRIATDDNLKRMKIQIVSR 939 Query: 913 CLCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFPSLHTPIHTCTAIALRLGFWWRAFHK 1092 C CC S E++ H+FL +P +W F+ + +H I WW+ Sbjct: 940 CWCC-SETEEETMTHIFLTAPIANRLWRQFSNFAGIQIESMHLQQLII----NWWKHSDN 994 Query: 1093 APTTHISFLIPCFIVWFIWTER 1158 A + +P I+W +W R Sbjct: 995 AKLKVVMRAMPTIIMWTLWKRR 1016 >ref|XP_004234855.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 440 Score = 200 bits (509), Expect = 8e-49 Identities = 106/360 (29%), Positives = 173/360 (48%), Gaps = 2/360 (0%) Frame = +1 Query: 10 MMDRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTA 189 ++ RI W + LS+GG+ L K L +P+H+ + P ++ Q++ ++A FFWG Sbjct: 73 VVSRITGWQTKQLSYGGKAVLSKHVLQALPIHLLLAVTPPTTIIRQIQMLIADFFWGWKN 132 Query: 190 TQKKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYCIL 369 +KK HW SW+ + +P EEGG+G+R+ +++ +F FK WW FR + +LW F KYC Sbjct: 133 DRKKYHWSSWKNLSYPYEEGGIGMRNLQDVCKSFQFKQWWVFRTKQTLWGEFLRAKYCQR 192 Query: 370 PFRVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLPG 549 V C + +S W+ + + ++ W++ G+ SFW D W G L + Sbjct: 193 SNPV-CKKWDTGESLTWKHMLDTRQQVEQHIHWNLQAGNCSFWWDNWLGTGPLAQHTTSS 251 Query: 550 IEPRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGL--SEIPIDEGGRDVMRWAMTC 723 ++ V E+L W +L P+ + + + IP + D W Sbjct: 252 NRFNNITVAEFLENGEWKWSKL-----MKHAPVTQLSSILATRIPQHQHRPDQAIWKPNT 306 Query: 724 HGEFSVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSF 903 HG FS TSAWE + +K ++ ++ IW+ I S LWR L +P + ++ + Sbjct: 307 HGRFSCTSAWEEIRSKKAKNNFNSLIWHKSIPFKTSFLLWRTLKGKLPTNEKLFNFGIE- 365 Query: 904 VSRCLCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFPSLHTPIHTCTAIALRLGFWWRA 1083 S C CC +++VEH+F P +W FA S T +AL + WW A Sbjct: 366 PSPCFCCFDRAGMDTVEHIFNSGPFAAKVWRFFAA---SAGLQADHSTLLAL-IKQWWTA 421 >ref|XP_007017128.1| Uncharacterized protein TCM_042327 [Theobroma cacao] gi|508787491|gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] Length = 1014 Score = 200 bits (508), Expect = 1e-48 Identities = 104/305 (34%), Positives = 161/305 (52%), Gaps = 1/305 (0%) Frame = +1 Query: 247 GGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYCILPFRVYCTRLSVHDSPIWRR 426 GGL IR ++ +AF+ KLWWRF+ D LW F KYC+ Y + +HDS +W+R Sbjct: 497 GGLDIRRLNDVSDAFTMKLWWRFQTCDGLWTNFLKTKYCMGQIPHY-VQSKLHDSQVWKR 555 Query: 427 LCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLPGIEPRHVQVDEYLHEHSWSL 606 + R V RW +G+G+ FW D W G+ L + P V ++ + +W + Sbjct: 556 MVRGRDVAIQNTRWRIGKGNLFFWHDCWMGNKPLVT-SFPSFRNDMTFVHKFYNGDNWDV 614 Query: 607 ERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRWAMTCHGEFSVTSAWEHVHNKLPRHP 786 L + +PM+++D + +IP D D+ WA+T GEFS SAWE V + + Sbjct: 615 NTL-----KLYLPMNLIDEILQIPFDRSQDDIAYWALTSDGEFSTWSAWEAVRQRQSPNT 669 Query: 787 LHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSFVSRCLCCQSSPSVESVEHLFL 966 L + IW+ I T+S FLWR L++WIPV+ ++ + S+C+CC S ES+ H+ Sbjct: 670 LCSFIWHKSIPLTISFFLWRVLNNWIPVELRLKEKGFHLASKCVCCNSE---ESLIHVLW 726 Query: 967 LSPAVRDIWEHFAGWFP-SLHTPIHTCTAIALRLGFWWRAFHKAPTTHISFLIPCFIVWF 1143 +P + +W FA +F ++ P H ++ + W+ + HI LIP FI WF Sbjct: 727 DNPVAKQVWNFFADFFQINISNPQH----VSQIIWAWYYSGDFVRKGHIRTLIPLFICWF 782 Query: 1144 IWTER 1158 +W ER Sbjct: 783 LWLER 787 >ref|XP_004233578.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 955 Score = 198 bits (504), Expect = 3e-48 Identities = 108/389 (27%), Positives = 193/389 (49%), Gaps = 7/389 (1%) Frame = +1 Query: 4 QHMMDRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGT 183 + ++ +I W + L+FGG++ L+K L +IP+H+ + P K L ++ ++A FFWG Sbjct: 342 EKIIRKISGWHAKILNFGGKITLVKHVLQSIPIHLLAAVSPPKTTLKYIKNVIADFFWGM 401 Query: 184 TATQKKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYC 363 KK HW SW + +P EGG+G+R+ E++ AF +K WW FR ++SLW++F KYC Sbjct: 402 DKDGKKYHWASWETLAYPTNEGGIGVRNLEDVCIAFQYKQWWEFRTKNSLWSKFLKAKYC 461 Query: 364 ILPFRVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCL 543 V + +S +WR R + +Y++W++ G SFW D W G+ L + + Sbjct: 462 KRANPV-AKKYDTGNSLVWRYFTRNRQAVESYIKWNIHSGSSSFWWDNWLGNEALANQVI 520 Query: 544 PGIEPRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGL--SEIPIDEGGRDVMRWAM 717 ++ V ++L W+ ER +VR VP +V + ++ + D W Sbjct: 521 NISSLNNIHVSDFLTNGIWN-ER----YVRQHVPPTMVPDIMQTQFKYNINIEDTAIWTP 575 Query: 718 TCHGEFSVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRV 897 +G+F++ SAWE + K ++ +W+ I +S F+WR L +P +Q + Sbjct: 576 EENGKFTIASAWEVIRKKKSTDIINNSVWHKHIPFKISFFIWRALRGKLPTYDYLQ-KFG 634 Query: 898 SFVSRCLCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFPSLHTPIHTCTAIALRLGFWW 1077 S + C CC + ++ + H+ + IW+++A P T I + L Sbjct: 635 SNATDCYCC-NRKGIDDINHILITGNFANYIWKYYA--------PTFGITQINIDLRSLL 685 Query: 1078 RAFHKAPTTH-----ISFLIPCFIVWFIW 1149 + P+++ + ++P FI W +W Sbjct: 686 LQWTNLPSSNQVYKLLISILPNFICWHLW 714 >ref|XP_004253372.1| PREDICTED: putative ribonuclease H protein At1g65750-like, partial [Solanum lycopersicum] Length = 451 Score = 198 bits (503), Expect = 4e-48 Identities = 103/357 (28%), Positives = 173/357 (48%), Gaps = 1/357 (0%) Frame = +1 Query: 10 MMDRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGTTA 189 ++ RI W + LSFGG+ L K L +P+H+ + P K ++ Q++ ++A FFWG Sbjct: 104 VISRITGWQTKQLSFGGKAVLSKHVLQALPIHLLTAVTPPKTIIKQIQMLIADFFWGWQN 163 Query: 190 TQKKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYCIL 369 ++K HW SW+ + +P EEGG+G+R+ ++ +F FK WW FR + +LW F KYC Sbjct: 164 NRRKYHWSSWKNLSYPYEEGGIGMRNLHDICKSFQFKQWWTFRTKHTLWGDFLKAKYCQR 223 Query: 370 PFRVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCLPG 549 V + +S W+ + Y++W + G+ SFW D W G +L + Sbjct: 224 SNPV-SKKWDTGESIAWKHMLATRQQGEQYIQWQLNSGNCSFWWDNWLGTGSLAQHTNRN 282 Query: 550 IEPRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGL-SEIPIDEGGRDVMRWAMTCH 726 I + +V ++ +W+ +L H+ + + + IP + D W + H Sbjct: 283 IRFNNSKVADFWENGNWNWRKLE----EQAPTTHLTNIMATAIPSQQQKPDQAVWRLDSH 338 Query: 727 GEFSVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSFV 906 G+FS SAWE + +K P++ +W++ I S LWR + +P + ++ + Sbjct: 339 GKFSCHSAWEEIRSKKPKNRFFNLLWHNSIPFKASFLLWRAIKRKLPTNEKLTNIGIE-P 397 Query: 907 SRCLCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFPSLHTPIHTCTAIALRLGFWW 1077 S C CC ++S+EH+F +W FA S I +I RL WW Sbjct: 398 SHCFCCIDRAGMDSIEHIFNSGQFASRVWSFFAA---SAGLEIEQ-PSIQARLRQWW 450 >ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258077 [Solanum lycopersicum] Length = 1454 Score = 198 bits (503), Expect = 4e-48 Identities = 115/387 (29%), Positives = 192/387 (49%), Gaps = 2/387 (0%) Frame = +1 Query: 4 QHMMDRIHSWSHRHLSFGGRLALIKSTLATIPLHIFHVMEPLKGVLHQLEQIMARFFWGT 183 + ++ +I W + L+FGG++ L+K L ++P+H + P K +L+ +++++A FFWG Sbjct: 842 EKVIKKIAGWHLKILNFGGKVTLVKHVLQSMPIHTLSAISPPKTILNSIKKVIADFFWGI 901 Query: 184 TATQKKVHWISWRQICHPVEEGGLGIRSFEELVNAFSFKLWWRFRAQDSLWARFTARKYC 363 KK HW SW + P EGG+G+R E++ AF +K WW FR +SLW++F KY Sbjct: 902 EKDGKKYHWSSWNNMAFPTNEGGIGVRLIEDMCTAFQYKQWWAFRTNNSLWSKFLKAKYN 961 Query: 364 ILPFRVYCTRLSVHDSPIWRRLCRIWPVMSTYVRWSVGQGDFSFWDDVWFGDCTLRSYCL 543 V + + DS +WR L R + + ++W + G SFW D W D L C Sbjct: 962 QRANPV-AKKYNTGDSIVWRYLTRNRQKVESLIKWHIQSGTCSFWWDCWL-DKPLAMQCD 1019 Query: 544 PGIEPRHVQVDEYLHEHSWSLERLHGLHVRYGVPMHVVDGLSEIPIDEGGRDVMRWAMTC 723 + V ++L +W+ ERL HV + +++ ++I G D W T Sbjct: 1020 HVSSLNNSVVADFLINGNWN-ERLLRQHVPPQLVPYILQ--TKINYQAGNIDTSIWTPTE 1076 Query: 724 HGEFSVTSAWEHVHNKLPRHPLHAQIWNDCITPTVSVFLWRFLHDWIPVDTEVQPRRVSF 903 G+F+++SAW+ + K + P++ IW+ I VS F+WR L +P + +Q R Sbjct: 1077 SGQFTISSAWDSIRKKRNKDPINNIIWHKQIPFKVSFFIWRALRGKLPTNENLQ-RIGKN 1135 Query: 904 VSRCLCCQSSPSVESVEHLFLLSPAVRDIWEHFAGWFPSLHTPIHTCTAIALRLGFWWRA 1083 +S C CC + + + H+ + + IW+ ++ L PI+T L WR Sbjct: 1136 LSDCYCCYNK-GKDDINHILINGNFAKYIWKIYSSAVGVL--PINTTLRDLL---LQWRN 1189 Query: 1084 FHKAPTTH--ISFLIPCFIVWFIWTER 1158 H + ++P FI W +W R Sbjct: 1190 QQYTNEVHKLLIHILPNFICWNLWKNR 1216