BLASTX nr result
ID: Mentha25_contig00034940
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00034940 (947 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobrom... 110 9e-28 ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobrom... 107 6e-27 ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom... 125 2e-26 ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom... 106 2e-26 ref|XP_007008705.1| Uncharacterized protein TCM_042331 [Theobrom... 108 5e-26 ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom... 122 2e-25 ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom... 122 2e-25 ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom... 105 2e-25 ref|XP_007043747.1| Uncharacterized protein TCM_008287 [Theobrom... 102 7e-25 ref|XP_004253372.1| PREDICTED: putative ribonuclease H protein A... 105 7e-25 ref|XP_006367184.1| PREDICTED: uncharacterized protein LOC102601... 110 8e-25 ref|XP_004253338.1| PREDICTED: putative ribonuclease H protein A... 108 4e-24 ref|XP_007040946.1| Uncharacterized protein TCM_016753 [Theobrom... 105 7e-24 ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom... 116 1e-23 ref|XP_004237273.1| PREDICTED: putative ribonuclease H protein A... 109 2e-23 ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom... 115 3e-23 ref|XP_007010390.1| Retrotransposon, unclassified-like protein [... 114 4e-23 ref|XP_007043999.1| Uncharacterized protein TCM_008793 [Theobrom... 114 4e-23 ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom... 114 6e-23 ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobrom... 114 7e-23 >ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobroma cacao] gi|508778195|gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] Length = 879 Score = 110 bits (276), Expect(2) = 9e-28 Identities = 57/160 (35%), Positives = 84/160 (52%), Gaps = 5/160 (3%) Frame = -1 Query: 623 EAKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKYSYP 444 E K++HW +W I P +EGGL IR L+D+ AF+ KLWWRF DSLW +L KY Sbjct: 343 EGKRMHWAAWNKITFPCSEGGLDIRNLNDVFEAFTLKLWWRFQTCDSLWTHFLKTKYCLG 402 Query: 443 LTAFSV----HHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGA 276 V H S +W+R++ A IRW +G G + FW D W + PL P Sbjct: 403 RIPHYVHPKLHDSLVWKRMIRGREVAFRNIRWKIGKGDLFFWHDCWMGNQPLVMSFPSLR 462 Query: 275 PNPS-VCDLWSDSAWDMDRLHALCATSVLSPDQVVTLSRT 159 + S V + ++ WD+D+L A +++ ++ +RT Sbjct: 463 NDMSLVHNFYNGDTWDVDKLKAYLPMNLIDEILLIPFNRT 502 Score = 40.4 bits (93), Expect(2) = 9e-28 Identities = 18/46 (39%), Positives = 27/46 (58%) Frame = -2 Query: 142 EDVMRWSLTGHGEFTVTSAWHHVRYRRPQVSLFGDIWNPCLTPTIS 5 +DV W+LT +GEF SAW +R R+ +L IW+ + +IS Sbjct: 504 QDVAYWTLTSNGEFATWSAWETIRQRKSSNALCSFIWHRSIPLSIS 549 >ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobroma cacao] gi|508710337|gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] Length = 926 Score = 107 bits (266), Expect(2) = 6e-27 Identities = 56/160 (35%), Positives = 78/160 (48%), Gaps = 5/160 (3%) Frame = -1 Query: 623 EAKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKYSYP 444 E K++HW +W I P +EGGL IR L D+ AF+ KLWWRF DSLW +L KY Sbjct: 391 EGKRMHWAAWNKITFPSSEGGLDIRNLKDVFDAFTLKLWWRFYTCDSLWTHFLKTKYCLG 450 Query: 443 LTAFSV----HHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGA 276 V H+S IW+R+ RW +G G + FW D W D PL P Sbjct: 451 RIPHYVQPKLHNSSIWKRITGGRDVTIQNTRWKIGRGELFFWHDCWMGDQPLVISFPSFR 510 Query: 275 PNPS-VCDLWSDSAWDMDRLHALCATSVLSPDQVVTLSRT 159 + S V + +WD+D+L +++ ++ RT Sbjct: 511 NDMSLVHKFYKGDSWDVDKLRLFLPVNLVDEILLIPFDRT 550 Score = 41.6 bits (96), Expect(2) = 6e-27 Identities = 19/46 (41%), Positives = 28/46 (60%) Frame = -2 Query: 142 EDVMRWSLTGHGEFTVTSAWHHVRYRRPQVSLFGDIWNPCLTPTIS 5 +DV W LT +GEF+ SAW +R R+P +L IW+ + +IS Sbjct: 552 QDVAYWILTSNGEFSTRSAWETIRKRQPHNTLGSLIWHRSIPLSIS 597 >ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao] gi|508722459|gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 125 bits (315), Expect = 2e-26 Identities = 66/186 (35%), Positives = 104/186 (55%), Gaps = 5/186 (2%) Frame = -1 Query: 620 AKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKY---S 450 +K+IHW SW I LPV EGGL IR L+++ AFS KLWWRF DSLW +++ KY Sbjct: 1716 SKRIHWASWAKIALPVTEGGLDIRSLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCRGQ 1775 Query: 449 YPL-TAFSVHHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPL-AEYRPVGA 276 P+ T +H S W+R++ SS +RW +G G V FW D W + PL + + + Sbjct: 1776 LPMQTQPKLHDSQTWKRMLTSSTITEQHMRWRVGQGNVFFWHDCWMGEAPLISSNQEFTS 1835 Query: 275 PNPSVCDLWSDSAWDMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPRRVHC 96 VCD +++++W++++L +VL + V +++ P+ ++ E+YW + Sbjct: 1836 SMVQVCDFFTNNSWNIEKL-----KTVLQQEVVDEIAKIPI---DTMNKDEAYWTPTPNG 1887 Query: 95 DFCMAS 78 DF S Sbjct: 1888 DFSTKS 1893 >ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao] gi|508710339|gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] Length = 1954 Score = 106 bits (265), Expect(2) = 2e-26 Identities = 54/139 (38%), Positives = 75/139 (53%), Gaps = 5/139 (3%) Frame = -1 Query: 617 KKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKYSYPLT 438 K+IHW +W + P +EGGL IRRL+DM AFS KLWWRF + LW ++L KY Sbjct: 1420 KRIHWAAWHKLTFPCSEGGLDIRRLTDMFDAFSLKLWWRFSTCEGLWTKFLKTKYCMGQI 1479 Query: 437 AFSV----HHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGAPN 270 V H S +W+R++ A RW +G G + FW D W D PL P + Sbjct: 1480 PHYVHPKLHDSQVWKRMVRGREVAIQNTRWRIGKGSLFFWHDCWMGDQPLVTSFPHFRND 1539 Query: 269 PS-VCDLWSDSAWDMDRLH 216 S V + ++ WD+D+L+ Sbjct: 1540 MSTVHNFFNGHNWDVDKLN 1558 Score = 40.0 bits (92), Expect(2) = 2e-26 Identities = 18/46 (39%), Positives = 27/46 (58%) Frame = -2 Query: 142 EDVMRWSLTGHGEFTVTSAWHHVRYRRPQVSLFGDIWNPCLTPTIS 5 +DV WSLT +GEF+ SAW +R R+ L +W+ + +IS Sbjct: 1579 DDVAYWSLTSNGEFSTRSAWEAIRLRKSPNVLCSLLWHKSIPLSIS 1624 >ref|XP_007008705.1| Uncharacterized protein TCM_042331 [Theobroma cacao] gi|508725618|gb|EOY17515.1| Uncharacterized protein TCM_042331 [Theobroma cacao] Length = 1176 Score = 108 bits (269), Expect(2) = 5e-26 Identities = 55/160 (34%), Positives = 78/160 (48%), Gaps = 5/160 (3%) Frame = -1 Query: 623 EAKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKYSY- 447 E K +HW +W I P +EGGL IR L ++ AF+ KLWWRF DSLW +L KY Sbjct: 789 EGKMMHWAAWNKITFPSSEGGLDIRNLKNVFDAFTLKLWWRFYTCDSLWTHFLKTKYCLG 848 Query: 446 ---PLTAFSVHHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLA-EYRPVG 279 +H S IW+R++ A IRW +G G + FW D W D PL + Sbjct: 849 QIPQYVQPKLHDSSIWKRMIGGRDVAIQNIRWKIGKGELFFWHDCWMGDQPLVISFPSFR 908 Query: 278 APNPSVCDLWSDSAWDMDRLHALCATSVLSPDQVVTLSRT 159 SV + +WD+D+L +++ ++ RT Sbjct: 909 NDMSSVHKFYKGDSWDVDKLRLFLPVNLIDEILLIPFDRT 948 Score = 37.4 bits (85), Expect(2) = 5e-26 Identities = 16/38 (42%), Positives = 24/38 (63%) Frame = -2 Query: 142 EDVMRWSLTGHGEFTVTSAWHHVRYRRPQVSLFGDIWN 29 +DV W+LT +GEF+ SAW +R R+ +L IW+ Sbjct: 950 QDVAYWTLTPNGEFSTWSAWETIRQRQSHNTLGSLIWH 987 >ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao] gi|508778198|gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 122 bits (307), Expect = 2e-25 Identities = 66/186 (35%), Positives = 100/186 (53%), Gaps = 5/186 (2%) Frame = -1 Query: 620 AKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKY---S 450 +KKIHW SW I LPV EGGL IR L+++ AFS KLWWRF DSLW +++ KY Sbjct: 1886 SKKIHWTSWAKISLPVKEGGLDIRSLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCRGQ 1945 Query: 449 YPL-TAFSVHHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPL-AEYRPVGA 276 P+ T +H S W+R++ SS +RW +G G + FW D W + PL + Sbjct: 1946 LPMHTQPKLHDSQTWKRMVASSAITEQNMRWRVGQGNLFFWHDCWMGETPLISSNHEFSL 2005 Query: 275 PNPSVCDLWSDSAWDMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPRRVHC 96 VCD + +++WD+++L +VL + V +++ P+ + E+YW + Sbjct: 2006 SMVQVCDFFMNNSWDIEKL-----KTVLQQEVVDEIAKIPI---DAMSKDEAYWAPTPNG 2057 Query: 95 DFCMAS 78 +F S Sbjct: 2058 EFSTKS 2063 >ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao] gi|508725617|gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 122 bits (307), Expect = 2e-25 Identities = 64/186 (34%), Positives = 103/186 (55%), Gaps = 5/186 (2%) Frame = -1 Query: 620 AKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKY---S 450 +KKIHW SW I LP+ EGGL IR L+++ AFS KLWWRF DSLW +++ KY Sbjct: 1714 SKKIHWASWAKISLPIKEGGLDIRNLAEVFEAFSMKLWWRFRTIDSLWTRFMRMKYCRGQ 1773 Query: 449 YPL-TAFSVHHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPL-AEYRPVGA 276 P+ T +H S W+R++ +S +RW +G G++ FW D W + PL + + + Sbjct: 1774 LPMHTQPKLHDSQTWKRMVANSAITEQNMRWRVGQGKLFFWHDCWMGETPLTSSNQELSL 1833 Query: 275 PNPSVCDLWSDSAWDMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPRRVHC 96 VCD + +++WD+++L +VL + V +++ P+ + E+YW + Sbjct: 1834 SMVQVCDFFMNNSWDIEKL-----KTVLQQEVVDEIAKIPI---DAMSKDEAYWAPTPNG 1885 Query: 95 DFCMAS 78 +F S Sbjct: 1886 EFSTKS 1891 >ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao] gi|508710341|gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 105 bits (261), Expect(2) = 2e-25 Identities = 55/160 (34%), Positives = 77/160 (48%), Gaps = 5/160 (3%) Frame = -1 Query: 623 EAKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKYSYP 444 E K++HW +W I P +EGGL IR L D+ AF+ KLWWRF DSLW +L KY Sbjct: 1679 EGKRMHWAAWNKINFPCSEGGLDIRNLKDVFDAFTLKLWWRFYTCDSLWTLFLKTKYCLG 1738 Query: 443 LTAF----SVHHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGA 276 +H S IW+R+ RW +G G + FW D W D PL P Sbjct: 1739 RIPHYVQPKIHSSSIWKRITGGRDVTIQNTRWKIGRGELFFWHDCWMGDQPLVISFPSFR 1798 Query: 275 PNPS-VCDLWSDSAWDMDRLHALCATSVLSPDQVVTLSRT 159 + S V + +WD+D+L +++ ++ RT Sbjct: 1799 NDMSFVHKFYKGDSWDVDKLRLFLPVNLIYEILLIPFDRT 1838 Score = 38.5 bits (88), Expect(2) = 2e-25 Identities = 17/46 (36%), Positives = 28/46 (60%) Frame = -2 Query: 142 EDVMRWSLTGHGEFTVTSAWHHVRYRRPQVSLFGDIWNPCLTPTIS 5 +DV W+LT +GEF+ SAW +R ++ +L IW+ + +IS Sbjct: 1840 QDVAYWTLTSNGEFSTKSAWETIRQQQSHNTLGSLIWHRSIPLSIS 1885 >ref|XP_007043747.1| Uncharacterized protein TCM_008287 [Theobroma cacao] gi|508707682|gb|EOX99578.1| Uncharacterized protein TCM_008287 [Theobroma cacao] Length = 499 Score = 102 bits (255), Expect(2) = 7e-25 Identities = 51/153 (33%), Positives = 84/153 (54%), Gaps = 5/153 (3%) Frame = -1 Query: 599 SWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKYSY----PLTAF 432 +W I LP +EGGL I+ L D+ AFS KLWW+F +++W++++ KY Y T Sbjct: 268 TWNKITLPSSEGGLDIKGLEDVFEAFSMKLWWKFQTCNNIWSKFMRAKYCYGRIPGYTQP 327 Query: 431 SVHHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPV-GAPNPSVCD 255 H S +W+R++ + +RW +G G + FW D W D PL PV + VC Sbjct: 328 KRHDSQMWKRMLACYLVTEQHMRWKIGKGELFFWYDCWMGDEPLINRFPVFSSSMTQVCY 387 Query: 254 LWSDSAWDMDRLHALCATSVLSPDQVVTLSRTP 156 ++++ WD+D+L+ ++L + VV + + P Sbjct: 388 FFNNNEWDVDKLN-----TMLPEEMVVEILKIP 415 Score = 38.9 bits (89), Expect(2) = 7e-25 Identities = 19/45 (42%), Positives = 24/45 (53%) Frame = -2 Query: 139 DVMRWSLTGHGEFTVTSAWHHVRYRRPQVSLFGDIWNPCLTPTIS 5 DV W T G+FT SAW +R R S+F IW+ C+ T S Sbjct: 422 DVAYWVPTSDGDFTTKSAWEIIRQRDLVNSVFNLIWHRCIPLTTS 466 >ref|XP_004253372.1| PREDICTED: putative ribonuclease H protein At1g65750-like, partial [Solanum lycopersicum] Length = 451 Score = 105 bits (263), Expect(2) = 7e-25 Identities = 50/149 (33%), Positives = 78/149 (52%), Gaps = 6/149 (4%) Frame = -1 Query: 617 KKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKY---SY 447 +K HW SWK++ P EGG+G+R L D+ +F +K WW F + +LW +L KY S Sbjct: 166 RKYHWSSWKNLSYPYEEGGIGMRNLHDICKSFQFKQWWTFRTKHTLWGDFLKAKYCQRSN 225 Query: 446 PLT-AFSVHHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEY--RPVGA 276 P++ + S W+ ++ + +I+W L SG SFW D W G LA++ R + Sbjct: 226 PVSKKWDTGESIAWKHMLATRQQGEQYIQWQLNSGNCSFWWDNWLGTGSLAQHTNRNIRF 285 Query: 275 PNPSVCDLWSDSAWDMDRLHALCATSVLS 189 N V D W + W+ +L T+ L+ Sbjct: 286 NNSKVADFWENGNWNWRKLEEQAPTTHLT 314 Score = 35.8 bits (81), Expect(2) = 7e-25 Identities = 12/33 (36%), Positives = 20/33 (60%) Frame = -2 Query: 127 WSLTGHGEFTVTSAWHHVRYRRPQVSLFGDIWN 29 W L HG+F+ SAW +R ++P+ F +W+ Sbjct: 333 WRLDSHGKFSCHSAWEEIRSKKPKNRFFNLLWH 365 >ref|XP_006367184.1| PREDICTED: uncharacterized protein LOC102601483 [Solanum tuberosum] Length = 2019 Score = 110 bits (275), Expect(2) = 8e-25 Identities = 53/141 (37%), Positives = 77/141 (54%), Gaps = 6/141 (4%) Frame = -1 Query: 623 EAKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKY--- 453 E KK HW SWK++ P EGG+G+R L D+ AF YK WW F ++ +LW +L KY Sbjct: 399 EKKKYHWASWKNLSFPYEEGGIGMRNLKDVCIAFQYKQWWCFRSKQTLWGDFLKAKYCQR 458 Query: 452 SYPLT-AFSVHHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVG- 279 S P++ + S W+ LMH+ + I+W L SG SFW D W GPLA + Sbjct: 459 SNPISKKWDTGDSLTWKHLMHNKHKVEEHIQWKLNSGSCSFWWDNWLGVGPLARFSTDSN 518 Query: 278 -APNPSVCDLWSDSAWDMDRL 219 N +V + + W++++L Sbjct: 519 RLNNTTVAEFLVEGQWNVNKL 539 Score = 30.8 bits (68), Expect(2) = 8e-25 Identities = 12/33 (36%), Positives = 18/33 (54%) Frame = -2 Query: 127 WSLTGHGEFTVTSAWHHVRYRRPQVSLFGDIWN 29 W L G FT +SAW+ +R +R + IW+ Sbjct: 568 WKLNSDGNFTYSSAWNAIREKRTKTIFNTFIWH 600 >ref|XP_004253338.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 655 Score = 108 bits (270), Expect(2) = 4e-24 Identities = 52/137 (37%), Positives = 74/137 (54%), Gaps = 6/137 (4%) Frame = -1 Query: 623 EAKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKY--- 453 + KK HW SW+++ P++EGG+G+R L D+ TAF YK WW F + SLW+Q+L KY Sbjct: 164 DGKKYHWSSWENLAYPISEGGIGVRLLEDVCTAFQYKQWWDFRTKKSLWSQFLQAKYCQR 223 Query: 452 SYPLT-AFSVHHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRP--V 282 + P+ + S IWR L + + FI+W++ SG SFW D W + LA Sbjct: 224 ANPVAKKYDTGDSLIWRYLTRNRLKVESFIKWNINSGTCSFWWDNWLDIENLASQNEHIS 283 Query: 281 GAPNPSVCDLWSDSAWD 231 N V D D W+ Sbjct: 284 SLNNSMVADFLKDGKWN 300 Score = 30.4 bits (67), Expect(2) = 4e-24 Identities = 14/40 (35%), Positives = 22/40 (55%) Frame = -2 Query: 148 GGEDVMRWSLTGHGEFTVTSAWHHVRYRRPQVSLFGDIWN 29 G +D W T G F+++SAW +R +R ++ IWN Sbjct: 326 GKDDTAIWMPTETGIFSISSAWECIRKKRIIDNISTIIWN 365 >ref|XP_007040946.1| Uncharacterized protein TCM_016753 [Theobroma cacao] gi|508778191|gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao] Length = 1275 Score = 105 bits (263), Expect(2) = 7e-24 Identities = 54/160 (33%), Positives = 77/160 (48%), Gaps = 5/160 (3%) Frame = -1 Query: 623 EAKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKYSY- 447 E K++HW +W I P +EGGL IR L D+ AF+ KLWWRF DSLW +L KY Sbjct: 655 EGKRMHWATWNKITFPSSEGGLDIRNLKDVFDAFTLKLWWRFYTCDSLWTHFLKTKYCLG 714 Query: 446 ---PLTAFSVHHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLA-EYRPVG 279 +H+S IW+R+ IRW +G G + W D W D PL + Sbjct: 715 RIPQYMQPKLHNSSIWKRMTGGQDVVIQNIRWKIGKGELFSWHDCWMGDQPLVISFPSFR 774 Query: 278 APNPSVCDLWSDSAWDMDRLHALCATSVLSPDQVVTLSRT 159 SV + +WD+D+L ++++ + RT Sbjct: 775 NDMSSVHKFYKGDSWDVDKLRLFLPVNLINEILPIPFDRT 814 Score = 32.3 bits (72), Expect(2) = 7e-24 Identities = 12/24 (50%), Positives = 17/24 (70%) Frame = -2 Query: 142 EDVMRWSLTGHGEFTVTSAWHHVR 71 +DV W+LT +GEF+ SAW +R Sbjct: 816 QDVAYWTLTSNGEFSTWSAWETIR 839 Score = 104 bits (259), Expect = 6e-20 Identities = 61/179 (34%), Positives = 87/179 (48%), Gaps = 5/179 (2%) Frame = -1 Query: 599 SWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKYS----YPLTAF 432 +W +I P +EGGL I L D AFS KLWWRF SLWA+Y+ KY + A Sbjct: 377 AWHNITFPSSEGGLDICSLKDFFDAFSTKLWWRFDTCQSLWARYMRLKYCTGQIHHNIAP 436 Query: 431 SVHHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGAPN-PSVCD 255 H S W+RL+ + A IRW +G G + FW D W D PL P + + V Sbjct: 437 KPHDSATWKRLIDGRVTASQQIRWRIGKGDIFFWHDAWMGDEPLVNSFPSFSQSMMKVNY 496 Query: 254 LWSDSAWDMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPRRVHCDFCMAS 78 ++D AWD+D+L + +++ + +SR +E +YW + DF S Sbjct: 497 FFNDDAWDVDKLKTVIPNAIVDEILKIPISRE--------NEDIAYWALTPNGDFSTKS 547 >ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao] gi|508715062|gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] Length = 1951 Score = 116 bits (291), Expect = 1e-23 Identities = 69/201 (34%), Positives = 98/201 (48%), Gaps = 10/201 (4%) Frame = -1 Query: 623 EAKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKYSYP 444 + KK+HW W I PV+EGGL IR L D+ AFS KLWWRF +SLW ++L KY Sbjct: 1415 DGKKLHWTVWSKITFPVSEGGLDIRNLRDVFEAFSLKLWWRFQTCNSLWTKFLRTKYCLG 1474 Query: 443 LTAFSV----HHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGA 276 V H S +W+R++ A IRW +G G + FW D W D PLA P Sbjct: 1475 RIPHFVQPKLHDSQVWKRMIVGRDVALQNIRWRIGKGELFFWHDCWMGDQPLATLCPSFH 1534 Query: 275 PNPS-VCDLWSDSAWDMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPRRVH 99 + S V ++ WD+++L + TS++ + R+ E +YW + Sbjct: 1535 NDMSHVHKFYNGDVWDIEKLSSCLPTSLVDEILQIPFDRS--------QEDVAYWALTSN 1586 Query: 98 CDFCM-----ASRQVQAPSGI 51 DF + A RQ Q P+ + Sbjct: 1587 GDFSLWSAWEAIRQRQTPNAL 1607 >ref|XP_004237273.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 601 Score = 109 bits (272), Expect(2) = 2e-23 Identities = 54/137 (39%), Positives = 78/137 (56%), Gaps = 6/137 (4%) Frame = -1 Query: 623 EAKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKY--- 453 + KK HW SW+++ P+NEGG+G+R L D+ TAF YK WW F + SLW+Q+L KY Sbjct: 50 DRKKYHWSSWENLSYPINEGGIGVRLLEDVCTAFQYKQWWEFRTKKSLWSQFLKAKYCQR 109 Query: 452 SYPLT-AFSVHHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEY-RPVG 279 + P+ + S +WR L + I+WS+ SG SFW D W E+ LA + + Sbjct: 110 ANPVAKKYDSGDSIVWRYLTKNRHKFESLIKWSIRSGTYSFWLDNWLENDSLANHCDHIS 169 Query: 278 APNPS-VCDLWSDSAWD 231 + N S + D W D W+ Sbjct: 170 SLNKSRLDDFWIDGKWN 186 Score = 27.7 bits (60), Expect(2) = 2e-23 Identities = 13/40 (32%), Positives = 20/40 (50%) Frame = -2 Query: 148 GGEDVMRWSLTGHGEFTVTSAWHHVRYRRPQVSLFGDIWN 29 G ED W +FT++SAW +R +R + IW+ Sbjct: 212 GKEDTAIWIPDETVKFTISSAWKVIRKKRSHDPINNIIWH 251 >ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao] gi|508715063|gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 115 bits (288), Expect = 3e-23 Identities = 70/216 (32%), Positives = 104/216 (48%), Gaps = 10/216 (4%) Frame = -1 Query: 623 EAKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKYSY- 447 + KK+HW +W I PV+EGGL IR L D+ AFS KLWWRF +SLW ++L KY Sbjct: 1172 DGKKLHWTAWSKITFPVSEGGLDIRNLRDVFEAFSLKLWWRFQTCNSLWTRFLRTKYCLG 1231 Query: 446 ---PLTAFSVHHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGA 276 L +H S +W+R++ A IRW +G G + FW D W D PLA P Sbjct: 1232 RIPHLVQPKLHDSQVWKRMIVGRDVALQNIRWRIGKGELFFWHDCWMGDQPLATLFPSFH 1291 Query: 275 PNPS-VCDLWSDSAWDMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPRRVH 99 + S V ++ WD+ +L++ TS++ + R+ E +YW + Sbjct: 1292 NDMSHVHKFYNGDEWDIVKLNSYLPTSLVDEILQIPFDRS--------QEDVAYWALTSN 1343 Query: 98 CDFCMAS-----RQVQAPSGISFW*YLEPLPYPYHF 6 +F S RQ Q P+ + + + +P F Sbjct: 1344 GEFSFWSAWEIIRQRQTPNALLSFNWHRSIPLSISF 1379 Score = 115 bits (288), Expect = 3e-23 Identities = 64/186 (34%), Positives = 94/186 (50%), Gaps = 5/186 (2%) Frame = -1 Query: 620 AKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKYSYPL 441 +K+IHW SW I LP+ EGGL IR L D+ AFS KLWWRF +SLW Q++ KY Sbjct: 2967 SKRIHWASWGKIALPIAEGGLDIRNLEDVFKAFSMKLWWRFRTTNSLWMQFMRAKYCGGQ 3026 Query: 440 TAFSV----HHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLA-EYRPVGA 276 V H S W+R++ S IRW +G G++ FW D W + PL + + Sbjct: 3027 LPTHVQPKLHDSQTWKRMVTISSITEQNIRWRVGHGKLFFWHDCWMGEEPLVIRNQEFAS 3086 Query: 275 PNPSVCDLWSDSAWDMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPRRVHC 96 V D + +++WD+++L SVL + V +++ P+ +YW + Sbjct: 3087 SMAQVSDFFLNNSWDIEKL-----KSVLQQEVVEEIAKIPI---NASSNDRAYWTPTPNG 3138 Query: 95 DFCMAS 78 DF S Sbjct: 3139 DFSTKS 3144 >ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao] gi|508727303|gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 114 bits (286), Expect = 4e-23 Identities = 64/187 (34%), Positives = 91/187 (48%), Gaps = 5/187 (2%) Frame = -1 Query: 623 EAKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKYS-- 450 ++ +IHW +W +I P +EGGLGIR L D AFS KLWWRF SLW +Y+ KY Sbjct: 799 DSTRIHWTAWHNITFPSSEGGLGIRSLKDSFDAFSAKLWWRFDTCQSLWVRYMRLKYCTG 858 Query: 449 --YPLTAFSVHHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGA 276 + A H S W+ L+ A IRW +G G + FW D W D PL P + Sbjct: 859 QIHHNIAPKPHDSATWKPLLAGRATASQQIRWRIGKGDIFFWHDAWMGDEPLVNSFPSFS 918 Query: 275 PN-PSVCDLWSDSAWDMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPRRVH 99 + V ++D AWD+D+L +++ + +SR E +YW + Sbjct: 919 QSMMKVNYFFNDDAWDVDKLKTFIPNAIVEEILKIPISRE--------KEDIAYWALTAN 970 Query: 98 CDFCMAS 78 DF + S Sbjct: 971 GDFSIKS 977 >ref|XP_007043999.1| Uncharacterized protein TCM_008793 [Theobroma cacao] gi|508707934|gb|EOX99830.1| Uncharacterized protein TCM_008793 [Theobroma cacao] Length = 270 Score = 114 bits (286), Expect = 4e-23 Identities = 67/198 (33%), Positives = 101/198 (51%), Gaps = 8/198 (4%) Frame = -1 Query: 620 AKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKYSY-- 447 +KKIHW +W I LP +EGGL IR L DM AFS KLWWRF +S W++++ KY Y Sbjct: 29 SKKIHWAAWNKITLPSSEGGLDIRGLGDMFEAFSMKLWWRFQTCNSSWSKFMKAKYCYGR 88 Query: 446 --PLTAFSVHHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPV-GA 276 T H S W+R++ +R +G G + FW D W +D PL + P + Sbjct: 89 IPRYTQPKRHDSQTWKRMLACCPVIEQHMRCKIGKGELFFWHDCWMDDEPLINHFPAFSS 148 Query: 275 PNPSVCDLWSDSAWDMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPRRVHC 96 VC ++++ WD+D+L+ ++LS V + + P + +M +YW Sbjct: 149 SMTQVCYFFNNNEWDVDKLN-----TMLSEKMVAEILKIP--FNTSSTDM-AYWVPTSDG 200 Query: 95 DFCMASR---QVQAPSGI 51 DF S+ Q+ A G+ Sbjct: 201 DFTTKSKHNCQIAAGGGL 218 >ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao] gi|508710342|gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 114 bits (285), Expect = 6e-23 Identities = 69/216 (31%), Positives = 99/216 (45%), Gaps = 10/216 (4%) Frame = -1 Query: 623 EAKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKYSYP 444 E KK+HW W I P EGGLGIR+L D+ AF+ KLWWRF +SLW Q+L KY Sbjct: 1592 ECKKMHWAEWAKISFPCAEGGLGIRKLEDVCAAFTLKLWWRFQTGNSLWTQFLRTKYCLG 1651 Query: 443 LTAFSV----HHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGA 276 + H S +W+R++ A IRW +G G + FW D W D PLA P Sbjct: 1652 RIPHHIQPKLHDSHVWKRMISGREMALQNIRWKIGKGDLFFWHDCWMGDKPLAASFPEFQ 1711 Query: 275 PNPS-VCDLWSDSAWDMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPRRVH 99 + S ++ WD+D+L + T ++ V ++ E +YW + Sbjct: 1712 NDMSHGYHFYNGDTWDVDKLRSFLPTILVEEILQVPFDKS--------REDVAYWTLTSN 1763 Query: 98 CDFCMAS-----RQVQAPSGISFW*YLEPLPYPYHF 6 DF S RQ Q + + + + +P F Sbjct: 1764 GDFSTRSAWEMIRQRQTSNALCSFIWHRSIPLSISF 1799 >ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobroma cacao] gi|508787493|gb|EOY34749.1| Uncharacterized protein TCM_042329 [Theobroma cacao] Length = 2606 Score = 114 bits (284), Expect = 7e-23 Identities = 64/187 (34%), Positives = 93/187 (49%), Gaps = 5/187 (2%) Frame = -1 Query: 623 EAKKIHWVSWKHICLPVNEGGLGIRRLSDMVTAFSYKLWWRF*ARDSLWAQYLWRKYSYP 444 + KK+HW +W I PV+EGGLGIR L D+ AFS KLWWRF +SLW ++L KY Sbjct: 1356 DGKKLHWTAWSKITFPVSEGGLGIRNLRDVFEAFSLKLWWRFQTCNSLWTRFLKTKYCLG 1415 Query: 443 LTAFSV----HHSPIWRRLMHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGA 276 V H S +W+R++ A IRW +G G + FW D W D PL+ P Sbjct: 1416 RIPHFVQPKLHDSQVWKRMIFGRDVALQNIRWGIGKGELFFWHDCWMGDLPLSNLFPSFH 1475 Query: 275 PNPS-VCDLWSDSAWDMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPRRVH 99 + S V ++ WD+ +L++ S++ + R+ E +YW + Sbjct: 1476 NDMSHVHKFYNGDGWDIVKLNSCLPMSLIDEILQIPFDRS--------QEDIAYWALTSN 1527 Query: 98 CDFCMAS 78 DF + S Sbjct: 1528 GDFSLWS 1534