BLASTX nr result
ID: Mentha23_contig00012413
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha23_contig00012413 (727 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom... 127 4e-35 ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom... 122 8e-34 ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom... 121 3e-33 ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobrom... 110 6e-32 ref|XP_007010390.1| Retrotransposon, unclassified-like protein [... 112 3e-31 ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom... 104 7e-31 ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom... 115 2e-30 ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobrom... 103 3e-30 ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom... 112 4e-30 ref|XP_007043747.1| Uncharacterized protein TCM_008287 [Theobrom... 102 8e-27 ref|XP_004248595.1| PREDICTED: uncharacterized protein LOC101261... 102 4e-26 ref|XP_006367184.1| PREDICTED: uncharacterized protein LOC102601... 110 8e-26 ref|XP_004237273.1| PREDICTED: putative ribonuclease H protein A... 111 1e-25 ref|XP_004253372.1| PREDICTED: putative ribonuclease H protein A... 107 1e-25 ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258... 100 5e-25 ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom... 120 6e-25 ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobrom... 92 2e-24 ref|XP_004233578.1| PREDICTED: putative ribonuclease H protein A... 101 4e-24 ref|XP_004253442.1| PREDICTED: putative ribonuclease H protein A... 105 7e-24 ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobrom... 116 9e-24 >ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao] gi|508722459|gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 127 bits (319), Expect(2) = 4e-35 Identities = 65/186 (34%), Positives = 105/186 (56%), Gaps = 5/186 (2%) Frame = +2 Query: 8 AKKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYL---WCKYS 178 +K+IHW SW +I LP+ EGGL IR L+++ AFS KLWWRF+ DSLW +++ +C+ Sbjct: 1716 SKRIHWASWAKIALPVTEGGLDIRSLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCRGQ 1775 Query: 179 YPLTTF-SVHHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGTP 355 P+ T +H S W +++ SS +RW +G G V FW D W + PL T Sbjct: 1776 LPMQTQPKLHDSQTWKRMLTSSTITEQHMRWRVGQGNVFFWHDCWMGEAPLISSNQEFTS 1835 Query: 356 N-PSVCDLWSDSAWNMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPWQVHC 532 + VCD +++++WN+++L +VL + V +++ P+ ++ E+YW + Sbjct: 1836 SMVQVCDFFTNNSWNIEKL-----KTVLQQEVVDEIAKIPI---DTMNKDEAYWTPTPNG 1887 Query: 533 DFCLAS 550 DF S Sbjct: 1888 DFSTKS 1893 Score = 47.8 bits (112), Expect(2) = 4e-35 Identities = 22/48 (45%), Positives = 32/48 (66%) Frame = +1 Query: 580 LFGDIWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CCAS 723 +F IW+ + T S FLWRLL +PV++ ++S+ + LASRC CC S Sbjct: 1907 VFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCKS 1954 >ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao] gi|508725617|gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 122 bits (305), Expect(2) = 8e-34 Identities = 61/174 (35%), Positives = 101/174 (58%), Gaps = 5/174 (2%) Frame = +2 Query: 8 AKKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYL---WCKYS 178 +KKIHW SW +I LPI EGGL IR L+++ AFS KLWWRF+ DSLW +++ +C+ Sbjct: 1714 SKKIHWASWAKISLPIKEGGLDIRNLAEVFEAFSMKLWWRFRTIDSLWTRFMRMKYCRGQ 1773 Query: 179 YPL-TTFSVHHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPL-AEYRPVGT 352 P+ T +H S W ++V +S +RW +G G++ FW D W + PL + + + Sbjct: 1774 LPMHTQPKLHDSQTWKRMVANSAITEQNMRWRVGQGKLFFWHDCWMGETPLTSSNQELSL 1833 Query: 353 PNPSVCDLWSDSAWNMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYW 514 VCD + +++W++++L +VL + V +++ P+ + E+YW Sbjct: 1834 SMVQVCDFFMNNSWDIEKL-----KTVLQQEVVDEIAKIPI---DAMSKDEAYW 1879 Score = 48.9 bits (115), Expect(2) = 8e-34 Identities = 23/48 (47%), Positives = 32/48 (66%) Frame = +1 Query: 580 LFGDIWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CCAS 723 +F IW+ + TIS FLWRLL +PV++ ++S+ LASRC CC S Sbjct: 1905 VFNFIWHKTVPLTISFFLWRLLHDWIPVELKMKSKGFQLASRCRCCKS 1952 >ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao] gi|508778198|gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 121 bits (303), Expect(2) = 3e-33 Identities = 61/174 (35%), Positives = 98/174 (56%), Gaps = 5/174 (2%) Frame = +2 Query: 8 AKKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYL---WCKYS 178 +KKIHW SW +I LP+ EGGL IR L+++ AFS KLWWRF+ DSLW +++ +C+ Sbjct: 1886 SKKIHWTSWAKISLPVKEGGLDIRSLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCRGQ 1945 Query: 179 YPL-TTFSVHHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPL-AEYRPVGT 352 P+ T +H S W ++V SS +RW +G G + FW D W + PL + Sbjct: 1946 LPMHTQPKLHDSQTWKRMVASSAITEQNMRWRVGQGNLFFWHDCWMGETPLISSNHEFSL 2005 Query: 353 PNPSVCDLWSDSAWNMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYW 514 VCD + +++W++++L +VL + V +++ P+ + E+YW Sbjct: 2006 SMVQVCDFFMNNSWDIEKL-----KTVLQQEVVDEIAKIPI---DAMSKDEAYW 2051 Score = 47.8 bits (112), Expect(2) = 3e-33 Identities = 22/48 (45%), Positives = 31/48 (64%) Frame = +1 Query: 580 LFGDIWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CCAS 723 +F IW+ + T S FLWRLL +PV++ ++S+ LASRC CC S Sbjct: 2077 VFNFIWHKAIPLTTSFFLWRLLHDWIPVELRMKSKGFQLASRCRCCRS 2124 >ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobroma cacao] gi|508778195|gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] Length = 879 Score = 110 bits (275), Expect(3) = 6e-32 Identities = 56/160 (35%), Positives = 85/160 (53%), Gaps = 5/160 (3%) Frame = +2 Query: 5 EAKKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYLWCKYSYP 184 E K++HW +W +I P +EGGL IR L+D+ AF+ KLWWRFQ DSLW +L KY Sbjct: 343 EGKRMHWAAWNKITFPCSEGGLDIRNLNDVFEAFTLKLWWRFQTCDSLWTHFLKTKYCLG 402 Query: 185 LTTFSV----HHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGT 352 V H S +W +++ A IRW +G G + FW D W + PL P Sbjct: 403 RIPHYVHPKLHDSLVWKRMIRGREVAFRNIRWKIGKGDLFFWHDCWMGNQPLVMSFPSLR 462 Query: 353 PNPS-VCDLWSDSAWNMDRLHALCATSVLSPDQVVTLSRT 469 + S V + ++ W++D+L A +++ ++ +RT Sbjct: 463 NDMSLVHNFYNGDTWDVDKLKAYLPMNLIDEILLIPFNRT 502 Score = 42.7 bits (99), Expect(3) = 6e-32 Identities = 19/44 (43%), Positives = 29/44 (65%) Frame = +1 Query: 592 IWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CCAS 723 IW+ + +IS FLWR L +PV++ ++ + I LAS+C CC S Sbjct: 539 IWHRSIPLSISFFLWRALNNWIPVELRMKEKGIQLASKCVCCNS 582 Score = 31.6 bits (70), Expect(3) = 6e-32 Identities = 12/29 (41%), Positives = 18/29 (62%) Frame = +3 Query: 486 EDVMRWSLTGHGKFTVTFAWHHVRYRRPS 572 +DV W+LT +G+F AW +R R+ S Sbjct: 504 QDVAYWTLTSNGEFATWSAWETIRQRKSS 532 >ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao] gi|508727303|gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 112 bits (281), Expect(2) = 3e-31 Identities = 64/191 (33%), Positives = 90/191 (47%), Gaps = 9/191 (4%) Frame = +2 Query: 5 EAKKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYLWCKYSYP 184 ++ +IHW +W I P +EGGLGIR L D AFS KLWWRF SLW +Y+ KY Sbjct: 799 DSTRIHWTAWHNITFPSSEGGLGIRSLKDSFDAFSAKLWWRFDTCQSLWVRYMRLKY--- 855 Query: 185 LTTFSVHH--------SPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYR 340 T +HH S W L+ A IRW +G G + FW D W D PL Sbjct: 856 -CTGQIHHNIAPKPHDSATWKPLLAGRATASQQIRWRIGKGDIFFWHDAWMGDEPLVNSF 914 Query: 341 PVGTPN-PSVCDLWSDSAWNMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWP 517 P + + V ++D AW++D+L +++ + +SR E +YW Sbjct: 915 PSFSQSMMKVNYFFNDDAWDVDKLKTFIPNAIVEEILKIPISRE--------KEDIAYWA 966 Query: 518 WQVHCDFCLAS 550 + DF + S Sbjct: 967 LTANGDFSIKS 977 Score = 49.7 bits (117), Expect(2) = 3e-31 Identities = 25/52 (48%), Positives = 35/52 (67%), Gaps = 1/52 (1%) Frame = +1 Query: 571 QVSLFGD-IWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CCAS 723 QV+L G IW+ + T+S FLWR L LPV+V ++++ I LAS+C CC S Sbjct: 987 QVNLVGQLIWHKSIPLTVSFFLWRTLHNWLPVEVRMKAKGIQLASKCLCCKS 1038 >ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao] gi|508710339|gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] Length = 1954 Score = 104 bits (260), Expect(3) = 7e-31 Identities = 52/139 (37%), Positives = 74/139 (53%), Gaps = 5/139 (3%) Frame = +2 Query: 11 KKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYLWCKYSYPLT 190 K+IHW +W ++ P +EGGL IRRL+DM AFS KLWWRF + LW ++L KY Sbjct: 1420 KRIHWAAWHKLTFPCSEGGLDIRRLTDMFDAFSLKLWWRFSTCEGLWTKFLKTKYCMGQI 1479 Query: 191 TFSV----HHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRP-VGTP 355 V H S +W ++V A RW +G G + FW D W D PL P Sbjct: 1480 PHYVHPKLHDSQVWKRMVRGREVAIQNTRWRIGKGSLFFWHDCWMGDQPLVTSFPHFRND 1539 Query: 356 NPSVCDLWSDSAWNMDRLH 412 +V + ++ W++D+L+ Sbjct: 1540 MSTVHNFFNGHNWDVDKLN 1558 Score = 43.9 bits (102), Expect(3) = 7e-31 Identities = 18/44 (40%), Positives = 28/44 (63%) Frame = +1 Query: 592 IWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CCAS 723 +W+ + +IS FLWR+ +PVD+ L+ + LAS+C CC S Sbjct: 1614 LWHKSIPLSISFFLWRVFHNWIPVDIRLKEKGFHLASKCICCNS 1657 Score = 32.7 bits (73), Expect(3) = 7e-31 Identities = 12/31 (38%), Positives = 19/31 (61%) Frame = +3 Query: 486 EDVMRWSLTGHGKFTVTFAWHHVRYRRPSGI 578 +DV WSLT +G+F+ AW +R R+ + Sbjct: 1579 DDVAYWSLTSNGEFSTRSAWEAIRLRKSPNV 1609 >ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao] gi|508715063|gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 114 bits (286), Expect(2) = 2e-30 Identities = 65/186 (34%), Positives = 97/186 (52%), Gaps = 5/186 (2%) Frame = +2 Query: 8 AKKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYLWCKY-SYP 184 +K+IHW SW +I LPI EGGL IR L D+ AFS KLWWRF+ +SLW Q++ KY Sbjct: 2967 SKRIHWASWGKIALPIAEGGLDIRNLEDVFKAFSMKLWWRFRTTNSLWMQFMRAKYCGGQ 3026 Query: 185 LTTF---SVHHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLA-EYRPVGT 352 L T +H S W ++V S IRW +G G++ FW D W + PL + + Sbjct: 3027 LPTHVQPKLHDSQTWKRMVTISSITEQNIRWRVGHGKLFFWHDCWMGEEPLVIRNQEFAS 3086 Query: 353 PNPSVCDLWSDSAWNMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPWQVHC 532 V D + +++W++++L SVL + V +++ P+ +YW + Sbjct: 3087 SMAQVSDFFLNNSWDIEKL-----KSVLQQEVVEEIAKIPI---NASSNDRAYWTPTPNG 3138 Query: 533 DFCLAS 550 DF S Sbjct: 3139 DFSTKS 3144 Score = 45.1 bits (105), Expect(2) = 2e-30 Identities = 21/44 (47%), Positives = 29/44 (65%) Frame = +1 Query: 592 IWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CCAS 723 IW+ + T S FLWRLL +PV++ ++S+ LASRC CC S Sbjct: 3162 IWHKSVPLTTSFFLWRLLHDWVPVELKMKSKGFQLASRCRCCKS 3205 Score = 115 bits (287), Expect = 2e-23 Identities = 70/216 (32%), Positives = 105/216 (48%), Gaps = 10/216 (4%) Frame = +2 Query: 5 EAKKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYLWCKYSY- 181 + KK+HW +W +I P++EGGL IR L D+ AFS KLWWRFQ +SLW ++L KY Sbjct: 1172 DGKKLHWTAWSKITFPVSEGGLDIRNLRDVFEAFSLKLWWRFQTCNSLWTRFLRTKYCLG 1231 Query: 182 ---PLTTFSVHHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGT 352 L +H S +W +++ A IRW +G G + FW D W D PLA P Sbjct: 1232 RIPHLVQPKLHDSQVWKRMIVGRDVALQNIRWRIGKGELFFWHDCWMGDQPLATLFPSFH 1291 Query: 353 PNPS-VCDLWSDSAWNMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPWQVH 529 + S V ++ W++ +L++ TS++ + R+ E +YW + Sbjct: 1292 NDMSHVHKFYNGDEWDIVKLNSYLPTSLVDEILQIPFDRS--------QEDVAYWALTSN 1343 Query: 530 CDFCLAS-----RQVQAPLRYLSLVIFGTLALPLPF 622 +F S RQ Q P LS ++ L + F Sbjct: 1344 GEFSFWSAWEIIRQRQTPNALLSFNWHRSIPLSISF 1379 >ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobroma cacao] gi|508710337|gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] Length = 926 Score = 103 bits (258), Expect(3) = 3e-30 Identities = 54/160 (33%), Positives = 78/160 (48%), Gaps = 5/160 (3%) Frame = +2 Query: 5 EAKKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYLWCKYSYP 184 E K++HW +W +I P +EGGL IR L D+ AF+ KLWWRF DSLW +L KY Sbjct: 391 EGKRMHWAAWNKITFPSSEGGLDIRNLKDVFDAFTLKLWWRFYTCDSLWTHFLKTKYCLG 450 Query: 185 LTTFSV----HHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGT 352 V H+S IW ++ RW +G G + FW D W D PL P Sbjct: 451 RIPHYVQPKLHNSSIWKRITGGRDVTIQNTRWKIGRGELFFWHDCWMGDQPLVISFPSFR 510 Query: 353 PNPS-VCDLWSDSAWNMDRLHALCATSVLSPDQVVTLSRT 469 + S V + +W++D+L +++ ++ RT Sbjct: 511 NDMSLVHKFYKGDSWDVDKLRLFLPVNLVDEILLIPFDRT 550 Score = 43.5 bits (101), Expect(3) = 3e-30 Identities = 20/52 (38%), Positives = 32/52 (61%) Frame = +1 Query: 568 PQVSLFGDIWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CCAS 723 P +L IW+ + +IS F+WR L +PV++ ++ + I LAS+C CC S Sbjct: 579 PHNTLGSLIWHRSIPLSISFFIWRALNNWIPVELRMKEKGIHLASKCVCCNS 630 Score = 32.0 bits (71), Expect(3) = 3e-30 Identities = 12/28 (42%), Positives = 18/28 (64%) Frame = +3 Query: 486 EDVMRWSLTGHGKFTVTFAWHHVRYRRP 569 +DV W LT +G+F+ AW +R R+P Sbjct: 552 QDVAYWILTSNGEFSTRSAWETIRKRQP 579 >ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao] gi|508725616|gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 112 bits (281), Expect(2) = 4e-30 Identities = 63/186 (33%), Positives = 94/186 (50%), Gaps = 5/186 (2%) Frame = +2 Query: 8 AKKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYLWCKYSYPL 187 +K+IHW SW +I LPI EGGL IR + D+ AFS KLWWRF+ +SLW Q++ KY Sbjct: 1679 SKRIHWASWGKIALPIAEGGLDIRNVEDVCEAFSMKLWWRFRTTNSLWTQFMRAKYCGGQ 1738 Query: 188 TTFSV----HHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLA-EYRPVGT 352 V H S W ++V S IRW +G G + FW D W + PL + + Sbjct: 1739 LPTDVQPKLHDSQTWKRMVTISSITEQNIRWRIGHGELFFWHDCWMGEEPLVNRNQAFAS 1798 Query: 353 PNPSVCDLWSDSAWNMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPWQVHC 532 V D + +++WN+++L +VL + V + + P+ ++YW + Sbjct: 1799 SMAQVSDFFLNNSWNVEKL-----KTVLQQEVVEEIVKIPI---DTSSNDKAYWTTTPNG 1850 Query: 533 DFCLAS 550 DF S Sbjct: 1851 DFSTKS 1856 Score = 45.8 bits (107), Expect(2) = 4e-30 Identities = 21/48 (43%), Positives = 31/48 (64%) Frame = +1 Query: 580 LFGDIWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CCAS 723 +F IW+ + T S FLWRLL +PV++ ++++ LASRC CC S Sbjct: 1870 VFNFIWHKSVPLTTSFFLWRLLHDWIPVELKMKTKGFQLASRCRCCKS 1917 >ref|XP_007043747.1| Uncharacterized protein TCM_008287 [Theobroma cacao] gi|508707682|gb|EOX99578.1| Uncharacterized protein TCM_008287 [Theobroma cacao] Length = 499 Score = 102 bits (253), Expect(2) = 8e-27 Identities = 50/153 (32%), Positives = 85/153 (55%), Gaps = 5/153 (3%) Frame = +2 Query: 29 SWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYLWCKYSY----PLTTF 196 +W +I LP +EGGL I+ L D+ AFS KLWW+FQ +++W++++ KY Y T Sbjct: 268 TWNKITLPSSEGGLDIKGLEDVFEAFSMKLWWKFQTCNNIWSKFMRAKYCYGRIPGYTQP 327 Query: 197 SVHHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPV-GTPNPSVCD 373 H S +W +++ + +RW +G G + FW D W D PL PV + VC Sbjct: 328 KRHDSQMWKRMLACYLVTEQHMRWKIGKGELFFWYDCWMGDEPLINRFPVFSSSMTQVCY 387 Query: 374 LWSDSAWNMDRLHALCATSVLSPDQVVTLSRTP 472 ++++ W++D+L+ ++L + VV + + P Sbjct: 388 FFNNNEWDVDKLN-----TMLPEEMVVEILKIP 415 Score = 45.4 bits (106), Expect(2) = 8e-27 Identities = 23/49 (46%), Positives = 30/49 (61%) Frame = +1 Query: 577 SLFGDIWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CCAS 723 S+F IW+ C+ T S FLWRLL PVD+ L+ + LAS+C C S Sbjct: 451 SVFNLIWHRCIPLTTSFFLWRLLQNWSPVDLRLKIKGFQLASKCQYCNS 499 >ref|XP_004248595.1| PREDICTED: uncharacterized protein LOC101261371 [Solanum lycopersicum] Length = 1246 Score = 102 bits (253), Expect(3) = 4e-26 Identities = 49/137 (35%), Positives = 72/137 (52%), Gaps = 6/137 (4%) Frame = +2 Query: 5 EAKKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYL---WCKY 175 + KK HW SW+ + P +EGG+G+R L D+ TAF Y WW F+ ++SLW+Q+L +C+ Sbjct: 710 DGKKYHWSSWENMAYPTSEGGIGVRLLEDVCTAFQYMQWWDFRTKNSLWSQFLKAKYCQR 769 Query: 176 SYPLT-TFSVHHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRP--V 346 + PL + S +W L + + I+W + SG SFW D W ++ LA Sbjct: 770 ANPLAKKYDSGDSLVWRYLTRNRLKVESLIKWQIHSGTSSFWWDNWLDNENLASQSDHIS 829 Query: 347 GTPNPSVCDLWSDSAWN 397 N V D D WN Sbjct: 830 SLNNGVVTDFIKDGKWN 846 Score = 33.5 bits (75), Expect(3) = 4e-26 Identities = 19/44 (43%), Positives = 25/44 (56%) Frame = +1 Query: 592 IWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CCAS 723 IW+ L I+ F+WR L G+LP + LQ R S+C CC S Sbjct: 909 IWHKHLPFKIAFFIWRALKGKLPTNELLQ-RFGSAISKCYCCYS 951 Score = 29.6 bits (65), Expect(3) = 4e-26 Identities = 13/30 (43%), Positives = 16/30 (53%) Frame = +3 Query: 480 GGEDVMRWSLTGHGKFTVTFAWHHVRYRRP 569 G ED W T G FT+ AW +R +RP Sbjct: 872 GKEDNAIWIPTETGNFTIASAWECIRNKRP 901 >ref|XP_006367184.1| PREDICTED: uncharacterized protein LOC102601483 [Solanum tuberosum] Length = 2019 Score = 110 bits (276), Expect(3) = 8e-26 Identities = 52/141 (36%), Positives = 77/141 (54%), Gaps = 6/141 (4%) Frame = +2 Query: 5 EAKKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYL---WCKY 175 E KK HW SWK + P EGG+G+R L D+ AF YK WW F+++ +LW +L +C+ Sbjct: 399 EKKKYHWASWKNLSFPYEEGGIGMRNLKDVCIAFQYKQWWCFRSKQTLWGDFLKAKYCQR 458 Query: 176 SYPLT-TFSVHHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGT 352 S P++ + S W L+H+ + I+W L SG SFW D W GPLA + Sbjct: 459 SNPISKKWDTGDSLTWKHLMHNKHKVEEHIQWKLNSGSCSFWWDNWLGVGPLARFSTDSN 518 Query: 353 --PNPSVCDLWSDSAWNMDRL 409 N +V + + WN+++L Sbjct: 519 RLNNTTVAEFLVEGQWNVNKL 539 Score = 26.6 bits (57), Expect(3) = 8e-26 Identities = 10/26 (38%), Positives = 14/26 (53%) Frame = +3 Query: 501 WSLTGHGKFTVTFAWHHVRYRRPSGI 578 W L G FT + AW+ +R +R I Sbjct: 568 WKLNSDGNFTYSSAWNAIREKRTKTI 593 Score = 26.6 bits (57), Expect(3) = 8e-26 Identities = 16/44 (36%), Positives = 22/44 (50%) Frame = +1 Query: 592 IWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CCAS 723 IW+ + S LWR L G+LP + L S A+ C CC + Sbjct: 598 IWHKSIPFKTSFLLWRTLRGKLPTNEKLISFGNEPAN-CFCCCN 640 Score = 40.0 bits (92), Expect(3) = 5e-06 Identities = 21/58 (36%), Positives = 28/58 (48%), Gaps = 2/58 (3%) Frame = +2 Query: 242 IHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGT--PNPSVCDLWSDSAWNMDRL 409 I A I+W + SG SFW D W GPLA Y N SV + + WN+ ++ Sbjct: 797 IGAGSNIQWRIRSGSCSFWWDNWLGVGPLAHYTSNSNRFNNDSVSEFIEEGHWNIPKV 854 Score = 29.6 bits (65), Expect(3) = 5e-06 Identities = 17/43 (39%), Positives = 22/43 (51%) Frame = +1 Query: 595 WNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CCAS 723 W+P + S LWR + G+LP + L S I S C CC S Sbjct: 896 WHPKIPFKCSFLLWRAIRGKLPTNEKLLSFGI-EPSDCHCCHS 937 Score = 26.2 bits (56), Expect(3) = 5e-06 Identities = 9/22 (40%), Positives = 14/22 (63%) Frame = +3 Query: 501 WSLTGHGKFTVTFAWHHVRYRR 566 W L G F+V+ AW+ +R +R Sbjct: 865 WKLNSSGLFSVSSAWNSIREKR 886 >ref|XP_004237273.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 601 Score = 111 bits (277), Expect(3) = 1e-25 Identities = 54/137 (39%), Positives = 78/137 (56%), Gaps = 6/137 (4%) Frame = +2 Query: 5 EAKKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYL---WCKY 175 + KK HW SW+ + PINEGG+G+R L D+ TAF YK WW F+ + SLW+Q+L +C+ Sbjct: 50 DRKKYHWSSWENLSYPINEGGIGVRLLEDVCTAFQYKQWWEFRTKKSLWSQFLKAKYCQR 109 Query: 176 SYPLT-TFSVHHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEY-RPVG 349 + P+ + S +W L + I+WS+ SG SFW D W E+ LA + + Sbjct: 110 ANPVAKKYDSGDSIVWRYLTKNRHKFESLIKWSIRSGTYSFWLDNWLENDSLANHCDHIS 169 Query: 350 TPNPS-VCDLWSDSAWN 397 + N S + D W D WN Sbjct: 170 SLNKSRLDDFWIDGKWN 186 Score = 28.1 bits (61), Expect(3) = 1e-25 Identities = 16/44 (36%), Positives = 22/44 (50%) Frame = +1 Query: 592 IWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CCAS 723 IW+ + IS F+W L G+LP + LQ + C CC S Sbjct: 249 IWHKHIPFKISFFIWGALTGKLPTNEILQRLGRDIVD-CYCCYS 291 Score = 24.3 bits (51), Expect(3) = 1e-25 Identities = 11/29 (37%), Positives = 15/29 (51%) Frame = +3 Query: 480 GGEDVMRWSLTGHGKFTVTFAWHHVRYRR 566 G ED W KFT++ AW +R +R Sbjct: 212 GKEDTAIWIPDETVKFTISSAWKVIRKKR 240 >ref|XP_004253372.1| PREDICTED: putative ribonuclease H protein At1g65750-like, partial [Solanum lycopersicum] Length = 451 Score = 107 bits (267), Expect(3) = 1e-25 Identities = 50/149 (33%), Positives = 78/149 (52%), Gaps = 6/149 (4%) Frame = +2 Query: 11 KKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYL---WCKYSY 181 +K HW SWK + P EGG+G+R L D+ +F +K WW F+ + +LW +L +C+ S Sbjct: 166 RKYHWSSWKNLSYPYEEGGIGMRNLHDICKSFQFKQWWTFRTKHTLWGDFLKAKYCQRSN 225 Query: 182 PLT-TFSVHHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEY--RPVGT 352 P++ + S W ++ + +I+W L SG SFW D W G LA++ R + Sbjct: 226 PVSKKWDTGESIAWKHMLATRQQGEQYIQWQLNSGNCSFWWDNWLGTGSLAQHTNRNIRF 285 Query: 353 PNPSVCDLWSDSAWNMDRLHALCATSVLS 439 N V D W + WN +L T+ L+ Sbjct: 286 NNSKVADFWENGNWNWRKLEEQAPTTHLT 314 Score = 30.0 bits (66), Expect(3) = 1e-25 Identities = 10/23 (43%), Positives = 14/23 (60%) Frame = +3 Query: 501 WSLTGHGKFTVTFAWHHVRYRRP 569 W L HGKF+ AW +R ++P Sbjct: 333 WRLDSHGKFSCHSAWEEIRSKKP 355 Score = 26.2 bits (56), Expect(3) = 1e-25 Identities = 15/50 (30%), Positives = 23/50 (46%) Frame = +1 Query: 568 PQVSLFGDIWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CC 717 P+ F +W+ + S LWR + +LP + L + I S C CC Sbjct: 355 PKNRFFNLLWHNSIPFKASFLLWRAIKRKLPTNEKLTNIGI-EPSHCFCC 403 >ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258077 [Solanum lycopersicum] Length = 1454 Score = 100 bits (250), Expect(3) = 5e-25 Identities = 52/137 (37%), Positives = 72/137 (52%), Gaps = 6/137 (4%) Frame = +2 Query: 5 EAKKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYLWCKYSYP 184 + KK HW SW + P NEGG+G+R + DM TAF YK WW F+ +SLW+++L KY+ Sbjct: 904 DGKKYHWSSWNNMAFPTNEGGIGVRLIEDMCTAFQYKQWWAFRTNNSLWSKFLKAKYNQR 963 Query: 185 LT----TFSVHHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLA-EYRPVG 349 ++ S +W L + I+W + SG SFW D W D PLA + V Sbjct: 964 ANPVAKKYNTGDSIVWRYLTRNRQKVESLIKWHIQSGTCSFWWDCWL-DKPLAMQCDHVS 1022 Query: 350 TPNPS-VCDLWSDSAWN 397 + N S V D + WN Sbjct: 1023 SLNNSVVADFLINGNWN 1039 Score = 32.0 bits (71), Expect(3) = 5e-25 Identities = 17/42 (40%), Positives = 23/42 (54%) Frame = +1 Query: 592 IWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CC 717 IW+ + +S F+WR L G+LP + LQ L S C CC Sbjct: 1102 IWHKQIPFKVSFFIWRALRGKLPTNENLQRIGKNL-SDCYCC 1142 Score = 28.5 bits (62), Expect(3) = 5e-25 Identities = 11/29 (37%), Positives = 17/29 (58%) Frame = +3 Query: 480 GGEDVMRWSLTGHGKFTVTFAWHHVRYRR 566 G D W+ T G+FT++ AW +R +R Sbjct: 1065 GNIDTSIWTPTESGQFTISSAWDSIRKKR 1093 >ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao] gi|508715062|gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] Length = 1951 Score = 120 bits (300), Expect = 6e-25 Identities = 72/216 (33%), Positives = 105/216 (48%), Gaps = 10/216 (4%) Frame = +2 Query: 5 EAKKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYLWCKYSYP 184 + KK+HW W +I P++EGGL IR L D+ AFS KLWWRFQ +SLW ++L KY Sbjct: 1415 DGKKLHWTVWSKITFPVSEGGLDIRNLRDVFEAFSLKLWWRFQTCNSLWTKFLRTKYCLG 1474 Query: 185 LTTFSV----HHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGT 352 V H S +W +++ A IRW +G G + FW D W D PLA P Sbjct: 1475 RIPHFVQPKLHDSQVWKRMIVGRDVALQNIRWRIGKGELFFWHDCWMGDQPLATLCPSFH 1534 Query: 353 PNPS-VCDLWSDSAWNMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPWQVH 529 + S V ++ W++++L + TS++ + R+ E +YW + Sbjct: 1535 NDMSHVHKFYNGDVWDIEKLSSCLPTSLVDEILQIPFDRS--------QEDVAYWALTSN 1586 Query: 530 CDFCL-----ASRQVQAPLRYLSLVIFGTLALPLPF 622 DF L A RQ Q P SL+ ++ L + F Sbjct: 1587 GDFSLWSAWEAIRQRQTPNALFSLIWHRSIPLSISF 1622 >ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobroma cacao] gi|508787492|gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] Length = 910 Score = 91.7 bits (226), Expect(2) = 2e-24 Identities = 50/164 (30%), Positives = 85/164 (51%), Gaps = 5/164 (3%) Frame = +2 Query: 74 IRRLSDMVTAFSYKLWWRFQARDSLWAQYL---WCKYSYPLTTF-SVHHSPIWCKLVHSS 241 + L+++ AFS KLWWRF+ DSLW +++ +C+ P+ T +H S W +++ SS Sbjct: 397 VNSLAEVFEAFSMKLWWRFRTIDSLWTRFMRMKYCRGQLPMQTQPKLHDSQTWKRMLTSS 456 Query: 242 IHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGTPN-PSVCDLWSDSAWNMDRLHAL 418 +RW +G G + FW D W D PL T + VCD + +++WN+++L Sbjct: 457 ATTEQHMRWRVGQGNLFFWHDCWMGDAPLISSNQEFTSSMVQVCDFFMNNSWNVEKL--- 513 Query: 419 CATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPWQVHCDFCLAS 550 +VL + V +++ P+ + E+YW + DF S Sbjct: 514 --KTVLQQEVVDEIAKIPI---DTMSKDEAYWTPTPNGDFSTKS 552 Score = 47.8 bits (112), Expect(2) = 2e-24 Identities = 22/48 (45%), Positives = 32/48 (66%) Frame = +1 Query: 580 LFGDIWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CCAS 723 +F IW+ + T S FLWRLL +PV++ ++S+ + LASRC CC S Sbjct: 566 VFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCKS 613 >ref|XP_004233578.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 955 Score = 101 bits (251), Expect(3) = 4e-24 Identities = 47/137 (34%), Positives = 73/137 (53%), Gaps = 6/137 (4%) Frame = +2 Query: 5 EAKKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYL---WCKY 175 + KK HW SW+ + P NEGG+G+R L D+ AF YK WW F+ ++SLW+++L +CK Sbjct: 404 DGKKYHWASWETLAYPTNEGGIGVRNLEDVCIAFQYKQWWEFRTKNSLWSKFLKAKYCKR 463 Query: 176 SYPLT-TFSVHHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAE--YRPV 346 + P+ + +S +W + +I+W++ SG SFW D W + LA Sbjct: 464 ANPVAKKYDTGNSLVWRYFTRNRQAVESYIKWNIHSGSSSFWWDNWLGNEALANQVINIS 523 Query: 347 GTPNPSVCDLWSDSAWN 397 N V D ++ WN Sbjct: 524 SLNNIHVSDFLTNGIWN 540 Score = 28.5 bits (62), Expect(3) = 4e-24 Identities = 11/31 (35%), Positives = 18/31 (58%) Frame = +3 Query: 486 EDVMRWSLTGHGKFTVTFAWHHVRYRRPSGI 578 ED W+ +GKFT+ AW +R ++ + I Sbjct: 568 EDTAIWTPEENGKFTIASAWEVIRKKKSTDI 598 Score = 28.5 bits (62), Expect(3) = 4e-24 Identities = 16/42 (38%), Positives = 23/42 (54%) Frame = +1 Query: 592 IWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CC 717 +W+ + IS F+WR L G+LP LQ + A+ C CC Sbjct: 603 VWHKHIPFKISFFIWRALRGKLPTYDYLQ-KFGSNATDCYCC 643 >ref|XP_004253442.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 775 Score = 105 bits (263), Expect(3) = 7e-24 Identities = 48/140 (34%), Positives = 74/140 (52%), Gaps = 6/140 (4%) Frame = +2 Query: 8 AKKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYL---WCKYS 178 +KK HW SWK + P EGG+G+R L+D+ +F +K WW F+ + +LW +L +C+ S Sbjct: 224 SKKYHWSSWKNLSYPYEEGGVGMRNLNDVCKSFQFKQWWTFRTKQTLWGDFLRAKYCQRS 283 Query: 179 YPLT-TFSVHHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEY--RPVG 349 P++ + S W ++ I+W L +G SFW D W GPLA++ + Sbjct: 284 NPVSKKWDTGQSLTWKHMLAIRQQVEQHIQWQLQAGNCSFWWDNWMGTGPLAQHTCNNIR 343 Query: 350 TPNPSVCDLWSDSAWNMDRL 409 N V D W + WN +L Sbjct: 344 LNNSKVADFWENGVWNYRKL 363 Score = 26.9 bits (58), Expect(3) = 7e-24 Identities = 14/42 (33%), Positives = 22/42 (52%) Frame = +1 Query: 592 IWNPCLTPTISVFLWRLLLGRLPVDVGLQSRRIFLASRC*CC 717 +W+ + S LWR+L G++P + L + I S C CC Sbjct: 422 LWHNFIPFKTSFLLWRILKGKIPTNEKLTNFGI-EPSPCYCC 462 Score = 24.6 bits (52), Expect(3) = 7e-24 Identities = 9/27 (33%), Positives = 14/27 (51%) Frame = +3 Query: 486 EDVMRWSLTGHGKFTVTFAWHHVRYRR 566 +D W L GKF+ AW +R ++ Sbjct: 387 QDQPVWKLHSQGKFSCHSAWEEIRNKK 413 >ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobroma cacao] gi|508787493|gb|EOY34749.1| Uncharacterized protein TCM_042329 [Theobroma cacao] Length = 2606 Score = 116 bits (290), Expect = 9e-24 Identities = 69/202 (34%), Positives = 102/202 (50%), Gaps = 8/202 (3%) Frame = +2 Query: 5 EAKKIHWISWKQICLPINEGGLGIRRLSDMVTAFSYKLWWRFQARDSLWAQYLWCKYSYP 184 + KK+HW +W +I P++EGGLGIR L D+ AFS KLWWRFQ +SLW ++L KY Sbjct: 1356 DGKKLHWTAWSKITFPVSEGGLGIRNLRDVFEAFSLKLWWRFQTCNSLWTRFLKTKYCLG 1415 Query: 185 LTTFSV----HHSPIWCKLVHSSIHAHDFIRWSLGSGRVSFWDDTWFEDGPLAEYRPVGT 352 V H S +W +++ A IRW +G G + FW D W D PL+ P Sbjct: 1416 RIPHFVQPKLHDSQVWKRMIFGRDVALQNIRWGIGKGELFFWHDCWMGDLPLSNLFPSFH 1475 Query: 353 PNPS-VCDLWSDSAWNMDRLHALCATSVLSPDQVVTLSRTPVLWGGGCHEMESYWPWQVH 529 + S V ++ W++ +L++ S++ + R+ E +YW + Sbjct: 1476 NDMSHVHKFYNGDGWDIVKLNSCLPMSLIDEILQIPFDRS--------QEDIAYWALTSN 1527 Query: 530 CDFCLAS---RQVQAPLRYLSL 586 DF L S ++QA LR L L Sbjct: 1528 GDFSLWSAWEAELQALLRGLLL 1549