BLASTX nr result
ID: Sinomenium22_contig00020797
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium22_contig00020797 (1760 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom... 194 1e-46 ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom... 185 5e-44 ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom... 185 5e-44 ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom... 184 8e-44 ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom... 184 1e-43 ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom... 182 3e-43 ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom... 179 4e-42 ref|XP_007010390.1| Retrotransposon, unclassified-like protein [... 174 1e-40 ref|XP_002280704.1| PREDICTED: chloroplastic group IIA intron sp... 173 2e-40 ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobrom... 168 6e-39 ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobrom... 168 8e-39 ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom... 164 1e-37 ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom... 162 6e-37 ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobrom... 159 5e-36 ref|XP_004233578.1| PREDICTED: putative ribonuclease H protein A... 153 2e-34 ref|XP_004234855.1| PREDICTED: putative ribonuclease H protein A... 150 2e-33 ref|XP_007014716.1| Uncharacterized protein TCM_040145 [Theobrom... 150 2e-33 ref|XP_004253436.1| PREDICTED: uncharacterized protein LOC101262... 148 7e-33 ref|XP_004253372.1| PREDICTED: putative ribonuclease H protein A... 148 7e-33 ref|XP_006367640.1| PREDICTED: putative ribonuclease H protein A... 147 2e-32 >ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao] gi|508715063|gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 194 bits (493), Expect = 1e-46 Identities = 98/265 (36%), Positives = 149/265 (56%) Frame = +2 Query: 875 ITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHLLSV 1054 ITYLGAPL G +F LV KI ++ W+ ++LS GGR+ LLR LSS+PI+LL V Sbjct: 2882 ITYLGAPLFKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLRSTLSSLPIYLLQV 2941 Query: 1055 LDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRSLHM 1234 L P+ RIN + NFLWG +++HW +W K+ P E GL +R+ ++V ++ M Sbjct: 2942 LKPPIIVLERINRLFNNFLWGGSASSKRIHWASWGKIALPIAEGGLDIRNLEDVFKAFSM 3001 Query: 1235 KFAYRLLMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VFMRG 1414 K +R SLW F RAKY G+ + + DSQ K ++ S N+ + Sbjct: 3002 KLWWRFRTTNSLWMQFMRAKYCGGQLPTHVQPKLHDSQTWKRMVTISSITEQNIRWRVGH 3061 Query: 1415 GNSFFWHDRWLASGPLSVRVEEIQNNNLRINDCWIDQTWDVELLKDLVGETVANEIIQTR 1594 G FFWHD W+ PL +R +E ++ +++D +++ +WD+E LK ++ + V EI + Sbjct: 3062 GKLFFWHDCWMGEEPLVIRNQEFASSMAQVSDFFLNNSWDIEKLKSVLQQEVVEEIAKIP 3121 Query: 1595 FIGREGPDLCVWKPSRDGNFSTATA 1669 I D W P+ +G+FST +A Sbjct: 3122 -INASSNDRAYWTPTPNGDFSTKSA 3145 Score = 177 bits (449), Expect = 1e-41 Identities = 92/268 (34%), Positives = 145/268 (54%) Frame = +2 Query: 875 ITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHLLSV 1054 +TYLGAPL G+ +F+ L+ KIR++++ W+ ++LS GGR+ LLR VLSS P++LL V Sbjct: 1088 VTYLGAPLHKGQKKVILFDSLISKIRDRISGWENKILSPGGRITLLRSVLSSQPMYLLQV 1147 Query: 1055 LDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRSLHM 1234 L PV +I + +FLWG+ +K+HW AWSK+ P E GL +R+ ++V + + Sbjct: 1148 LKPPVTVIEKIERLFNSFLWGDSCDGKKLHWTAWSKITFPVSEGGLDIRNLRDVFEAFSL 1207 Query: 1235 KFAYRLLMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VFMRG 1414 K +R SLW F R KY GR + DSQ+ K ++ + N+ + Sbjct: 1208 KLWWRFQTCNSLWTRFLRTKYCLGRIPHLVQPKLHDSQVWKRMIVGRDVALQNIRWRIGK 1267 Query: 1415 GNSFFWHDRWLASGPLSVRVEEIQNNNLRINDCWIDQTWDVELLKDLVGETVANEIIQTR 1594 G FFWHD W+ PL+ N+ ++ + WD+ L + ++ +EI+Q Sbjct: 1268 GELFFWHDCWMGDQPLATLFPSFHNDMSHVHKFYNGDEWDIVKLNSYLPTSLVDEILQIP 1327 Query: 1595 FIGREGPDLCVWKPSRDGNFSTATA*EV 1678 F R D+ W + +G FS +A E+ Sbjct: 1328 F-DRSQEDVAYWALTSNGEFSFWSAWEI 1354 >ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao] gi|508725617|gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 185 bits (470), Expect = 5e-44 Identities = 97/265 (36%), Positives = 148/265 (55%) Frame = +2 Query: 875 ITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHLLSV 1054 ITYLGAPL G +F LV KI ++ W+ ++LS GGR+ LLR VL+S+PI+LL V Sbjct: 1629 ITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLRSVLASLPIYLLQV 1688 Query: 1055 LDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRSLHM 1234 L P+ R+N I +FLWG +K+HW +W+K+ P KE GL +R+ EV + M Sbjct: 1689 LKPPICVLERVNRIFNSFLWGGSAASKKIHWASWAKISLPIKEGGLDIRNLAEVFEAFSM 1748 Query: 1235 KFAYRLLMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VFMRG 1414 K +R SLW F R KY +G+ + DSQ K ++ + N+ + Sbjct: 1749 KLWWRFRTIDSLWTRFMRMKYCRGQLPMHTQPKLHDSQTWKRMVANSAITEQNMRWRVGQ 1808 Query: 1415 GNSFFWHDRWLASGPLSVRVEEIQNNNLRINDCWIDQTWDVELLKDLVGETVANEIIQTR 1594 G FFWHD W+ PL+ +E+ + +++ D +++ +WD+E LK ++ + V +EI + Sbjct: 1809 GKLFFWHDCWMGETPLTSSNQELSLSMVQVCDFFMNNSWDIEKLKTVLQQEVVDEIAKIP 1868 Query: 1595 FIGREGPDLCVWKPSRDGNFSTATA 1669 I D W P+ +G FST +A Sbjct: 1869 -IDAMSKDEAYWAPTPNGEFSTKSA 1892 >ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao] gi|508725616|gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 185 bits (470), Expect = 5e-44 Identities = 97/265 (36%), Positives = 146/265 (55%) Frame = +2 Query: 875 ITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHLLSV 1054 ITYLGAPL G +F LV KI ++ W+ + LS GGR+ LLR LSS+PI+LL V Sbjct: 1594 ITYLGAPLYKGHKKVMLFNDLVAKIEERITGWENKTLSPGGRITLLRSTLSSLPIYLLQV 1653 Query: 1055 LDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRSLHM 1234 L PV RIN ++ NFLWG +++HW +W K+ P E GL +R+ ++V + M Sbjct: 1654 LKPPVIVLERINRLLNNFLWGGSTASKRIHWASWGKIALPIAEGGLDIRNVEDVCEAFSM 1713 Query: 1235 KFAYRLLMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VFMRG 1414 K +R SLW F RAKY G+ + + DSQ K ++ S N+ + Sbjct: 1714 KLWWRFRTTNSLWTQFMRAKYCGGQLPTDVQPKLHDSQTWKRMVTISSITEQNIRWRIGH 1773 Query: 1415 GNSFFWHDRWLASGPLSVRVEEIQNNNLRINDCWIDQTWDVELLKDLVGETVANEIIQTR 1594 G FFWHD W+ PL R + ++ +++D +++ +W+VE LK ++ + V EI++ Sbjct: 1774 GELFFWHDCWMGEEPLVNRNQAFASSMAQVSDFFLNNSWNVEKLKTVLQQEVVEEIVKIP 1833 Query: 1595 FIGREGPDLCVWKPSRDGNFSTATA 1669 I D W + +G+FST +A Sbjct: 1834 -IDTSSNDKAYWTTTPNGDFSTKSA 1857 >ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao] gi|508722459|gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 184 bits (468), Expect = 8e-44 Identities = 96/265 (36%), Positives = 148/265 (55%) Frame = +2 Query: 875 ITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHLLSV 1054 ITYLGAPL G +F LV KI ++ W+ ++LS GGR+ LLR VL+S+PI+LL V Sbjct: 1631 ITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLRSVLASLPIYLLQV 1690 Query: 1055 LDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRSLHM 1234 L PV R+N + +FLWG +++HW +W+K+ P E GL +R EV + M Sbjct: 1691 LKPPVCVLERVNRLFNSFLWGGSAASKRIHWASWAKIALPVTEGGLDIRSLAEVFEAFSM 1750 Query: 1235 KFAYRLLMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VFMRG 1414 K +R SLW F R KY +G+ Q + DSQ K +L + + ++ + Sbjct: 1751 KLWWRFRTTDSLWTRFMRMKYCRGQLPMQTQPKLHDSQTWKRMLTSSTITEQHMRWRVGQ 1810 Query: 1415 GNSFFWHDRWLASGPLSVRVEEIQNNNLRINDCWIDQTWDVELLKDLVGETVANEIIQTR 1594 GN FFWHD W+ PL +E ++ +++ D + + +W++E LK ++ + V +EI + Sbjct: 1811 GNVFFWHDCWMGEAPLISSNQEFTSSMVQVCDFFTNNSWNIEKLKTVLQQEVVDEIAKIP 1870 Query: 1595 FIGREGPDLCVWKPSRDGNFSTATA 1669 I D W P+ +G+FST +A Sbjct: 1871 -IDTMNKDEAYWTPTPNGDFSTKSA 1894 >ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao] gi|508778198|gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 184 bits (467), Expect = 1e-43 Identities = 98/265 (36%), Positives = 145/265 (54%) Frame = +2 Query: 875 ITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHLLSV 1054 ITYLGAPL G +F LV KI ++ W+ ++LS GGR+ LL+ VL+S+PI+L V Sbjct: 1801 ITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLKSVLTSLPIYLFQV 1860 Query: 1055 LDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRSLHM 1234 L PV RIN I +FLWG +K+HW +W+K+ P KE GL +R EV + M Sbjct: 1861 LKPPVCVLERINRIFNSFLWGGSAASKKIHWTSWAKISLPVKEGGLDIRSLAEVFEAFSM 1920 Query: 1235 KFAYRLLMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VFMRG 1414 K +R SLW F R KY +G+ + DSQ K ++ + + N+ + Sbjct: 1921 KLWWRFRTTDSLWTRFMRMKYCRGQLPMHTQPKLHDSQTWKRMVASSAITEQNMRWRVGQ 1980 Query: 1415 GNSFFWHDRWLASGPLSVRVEEIQNNNLRINDCWIDQTWDVELLKDLVGETVANEIIQTR 1594 GN FFWHD W+ PL E + +++ D +++ +WD+E LK ++ + V +EI + Sbjct: 1981 GNLFFWHDCWMGETPLISSNHEFSLSMVQVCDFFMNNSWDIEKLKTVLQQEVVDEIAKIP 2040 Query: 1595 FIGREGPDLCVWKPSRDGNFSTATA 1669 I D W P+ +G FST +A Sbjct: 2041 -IDAMSKDEAYWAPTPNGEFSTKSA 2064 >ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao] gi|508710342|gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 182 bits (463), Expect = 3e-43 Identities = 95/268 (35%), Positives = 148/268 (55%) Frame = +2 Query: 875 ITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHLLSV 1054 +TYLGAPL G +F+ L+ KIR++++ W+ ++LS GGR+ LLR VLSS+P++LL V Sbjct: 1508 VTYLGAPLHKGPKKVLLFDSLISKIRDRISGWENKILSPGGRITLLRSVLSSLPMYLLQV 1567 Query: 1055 LDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRSLHM 1234 L PV RI+ + +FLWG+ +K+HW W+K+ P E GLG+R ++V + + Sbjct: 1568 LKPPVTVIERIDRLFNSFLWGDSTECKKMHWAEWAKISFPCAEGGLGIRKLEDVCAAFTL 1627 Query: 1235 KFAYRLLMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VFMRG 1414 K +R SLW F R KY GR + DS + K ++ + N+ + Sbjct: 1628 KLWWRFQTGNSLWTQFLRTKYCLGRIPHHIQPKLHDSHVWKRMISGREMALQNIRWKIGK 1687 Query: 1415 GNSFFWHDRWLASGPLSVRVEEIQNNNLRINDCWIDQTWDVELLKDLVGETVANEIIQTR 1594 G+ FFWHD W+ PL+ E QN+ + TWDV+ L+ + + EI+Q Sbjct: 1688 GDLFFWHDCWMGDKPLAASFPEFQNDMSHGYHFYNGDTWDVDKLRSFLPTILVEEILQVP 1747 Query: 1595 FIGREGPDLCVWKPSRDGNFSTATA*EV 1678 F + D+ W + +G+FST +A E+ Sbjct: 1748 F-DKSREDVAYWTLTSNGDFSTRSAWEM 1774 >ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao] gi|508715062|gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] Length = 1951 Score = 179 bits (454), Expect = 4e-42 Identities = 93/267 (34%), Positives = 145/267 (54%) Frame = +2 Query: 875 ITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHLLSV 1054 +TYLGAPL G +F+ L+ KIR++++ W+ ++LS GGR+ LLR VLSS P++LL V Sbjct: 1331 VTYLGAPLHKGPKKVFLFDSLISKIRDRISGWENKILSPGGRITLLRSVLSSQPMYLLQV 1390 Query: 1055 LDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRSLHM 1234 L PV +I I +FLWG+ + +K+HW WSK+ P E GL +R+ ++V + + Sbjct: 1391 LKPPVTVIEKIERIFNSFLWGDSNDGKKLHWTVWSKITFPVSEGGLDIRNLRDVFEAFSL 1450 Query: 1235 KFAYRLLMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VFMRG 1414 K +R SLW F R KY GR + DSQ+ K ++ + N+ + Sbjct: 1451 KLWWRFQTCNSLWTKFLRTKYCLGRIPHFVQPKLHDSQVWKRMIVGRDVALQNIRWRIGK 1510 Query: 1415 GNSFFWHDRWLASGPLSVRVEEIQNNNLRINDCWIDQTWDVELLKDLVGETVANEIIQTR 1594 G FFWHD W+ PL+ N+ ++ + WD+E L + ++ +EI+Q Sbjct: 1511 GELFFWHDCWMGDQPLATLCPSFHNDMSHVHKFYNGDVWDIEKLSSCLPTSLVDEILQIP 1570 Query: 1595 FIGREGPDLCVWKPSRDGNFSTATA*E 1675 F R D+ W + +G+FS +A E Sbjct: 1571 F-DRSQEDVAYWALTSNGDFSLWSAWE 1596 >ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao] gi|508727303|gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 174 bits (441), Expect = 1e-40 Identities = 91/268 (33%), Positives = 139/268 (51%) Frame = +2 Query: 875 ITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHLLSV 1054 ITYLGAPL G +F+ L+ KIR ++ W+ ++LS GGR+ LLR VLSS+PI+LL V Sbjct: 715 ITYLGAPLFKGPKKVMLFDSLINKIRERITGWENKILSPGGRITLLRSVLSSMPIYLLQV 774 Query: 1055 LDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRSLHM 1234 L P +I + +FLWG ++HW AW + P+ E GLG+R K+ + Sbjct: 775 LKPPACVIQKIERLFNSFLWGSSMDSTRIHWTAWHNITFPSSEGGLGIRSLKDSFDAFSA 834 Query: 1235 KFAYRLLMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VFMRG 1414 K +R +SLW + R KY G+ + DS K +L + + + Sbjct: 835 KLWWRFDTCQSLWVRYMRLKYCTGQIHHNIAPKPHDSATWKPLLAGRATASQQIRWRIGK 894 Query: 1415 GNSFFWHDRWLASGPLSVRVEEIQNNNLRINDCWIDQTWDVELLKDLVGETVANEIIQTR 1594 G+ FFWHD W+ PL + +++N + D WDV+ LK + + EI++ Sbjct: 895 GDIFFWHDAWMGDEPLVNSFPSFSQSMMKVNYFFNDDAWDVDKLKTFIPNAIVEEILKIP 954 Query: 1595 FIGREGPDLCVWKPSRDGNFSTATA*EV 1678 I RE D+ W + +G+FS +A E+ Sbjct: 955 -ISREKEDIAYWALTANGDFSIKSAWEL 981 >ref|XP_002280704.1| PREDICTED: chloroplastic group IIA intron splicing facilitator CRS1, chloroplastic-like [Vitis vinifera] Length = 1184 Score = 173 bits (439), Expect = 2e-40 Identities = 99/218 (45%), Positives = 134/218 (61%), Gaps = 1/218 (0%) Frame = +1 Query: 139 PKQLHDSNKPLKFYLLSHTSPLHQTATKNPNPQTPTLPFFQKPFLSTNDYVKFPTDPWMA 318 P+++H P P+ T T N P +P T+ +K PT PWM Sbjct: 35 PQRIHSFKPP----------PISATTTATTNH--PDHSISSQPVSGTDAAIKMPTAPWMK 82 Query: 319 GSLLLPSNEVLNLSTLRTKKGKNRVQGLERTDLSLTEKISGGRGRRAMRRIVESITKLQE 498 G LLL NEVL+LS R KK G E+ D SLTEK+SGGRG +AM++I++SI KLQE Sbjct: 83 GPLLLQPNEVLDLSKARPKKVAGSA-GAEKPDRSLTEKVSGGRGAKAMKKIMQSIVKLQE 141 Query: 499 CANLEEPQKAVEKFELRVPLKPVFEEENSNSEVRMPWMTEEKIVFRR-MKERVVTKSELI 675 +E Q+ E+FE V L+ + +ENS +MPW+ EK+VFRR KE+VVT +EL Sbjct: 142 THTSDETQENTEEFEFGVSLEGIGGDENSRIGGKMPWLKTEKVVFRRTKKEKVVTAAELT 201 Query: 676 LSERVLKRLRNDAVKVTKWVEVKKAGVTEAIVDEIERI 789 L +L+RLR +AVK+ KWV+VKKAGVTE++VD+I + Sbjct: 202 LDPMLLERLRGEAVKMRKWVKVKKAGVTESVVDQIHMV 239 >ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobroma cacao] gi|508710337|gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] Length = 926 Score = 168 bits (426), Expect = 6e-39 Identities = 89/267 (33%), Positives = 145/267 (54%) Frame = +2 Query: 875 ITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHLLSV 1054 +TYLGAPL G +F+ L+ KIR++++ W+ ++LS GGR+ LLR VLSS+P++LL V Sbjct: 307 VTYLGAPLHKGPKKVYLFDSLISKIRDRISGWENKILSPGGRITLLRSVLSSLPMYLLQV 366 Query: 1055 LDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRSLHM 1234 L P +I + +FLWG+ + +++HW AW+K+ P+ E GL +R+ K+V + + Sbjct: 367 LKPPAIVIEKIERLFNSFLWGDSNEGKRMHWAAWNKITFPSSEGGLDIRNLKDVFDAFTL 426 Query: 1235 KFAYRLLMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VFMRG 1414 K +R SLW F + KY GR + +S + K I + N + Sbjct: 427 KLWWRFYTCDSLWTHFLKTKYCLGRIPHYVQPKLHNSSIWKRITGGRDVTIQNTRWKIGR 486 Query: 1415 GNSFFWHDRWLASGPLSVRVEEIQNNNLRINDCWIDQTWDVELLKDLVGETVANEIIQTR 1594 G FFWHD W+ PL + +N+ ++ + +WDV+ L+ + + +EI+ Sbjct: 487 GELFFWHDCWMGDQPLVISFPSFRNDMSLVHKFYKGDSWDVDKLRLFLPVNLVDEILLIP 546 Query: 1595 FIGREGPDLCVWKPSRDGNFSTATA*E 1675 F R D+ W + +G FST +A E Sbjct: 547 F-DRTQQDVAYWILTSNGEFSTRSAWE 572 >ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobroma cacao] gi|508778195|gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao] Length = 879 Score = 168 bits (425), Expect = 8e-39 Identities = 88/267 (32%), Positives = 147/267 (55%) Frame = +2 Query: 875 ITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHLLSV 1054 + YLGAPL G +F+ L+ KIR++++ W+ ++LS GGR+ LLR VLSS+P++LL V Sbjct: 259 VIYLGAPLHKGPKKVFLFDSLITKIRDRISGWENKILSPGGRITLLRSVLSSLPMYLLQV 318 Query: 1055 LDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRSLHM 1234 L P +I + +FLWG+ + +++HW AW+K+ P E GL +R+ +V + + Sbjct: 319 LKPPAIVIEKIERLFNSFLWGDSNEGKRMHWAAWNKITFPCSEGGLDIRNLNDVFEAFTL 378 Query: 1235 KFAYRLLMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VFMRG 1414 K +R SLW F + KY GR + DS + K ++ N+ + Sbjct: 379 KLWWRFQTCDSLWTHFLKTKYCLGRIPHYVHPKLHDSLVWKRMIRGREVAFRNIRWKIGK 438 Query: 1415 GNSFFWHDRWLASGPLSVRVEEIQNNNLRINDCWIDQTWDVELLKDLVGETVANEIIQTR 1594 G+ FFWHD W+ + PL + ++N+ +++ + TWDV+ LK + + +EI+ Sbjct: 439 GDLFFWHDCWMGNQPLVMSFPSLRNDMSLVHNFYNGDTWDVDKLKAYLPMNLIDEILLIP 498 Query: 1595 FIGREGPDLCVWKPSRDGNFSTATA*E 1675 F R D+ W + +G F+T +A E Sbjct: 499 F-NRTQQDVAYWTLTSNGEFATWSAWE 524 >ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao] gi|508710339|gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] Length = 1954 Score = 164 bits (415), Expect = 1e-37 Identities = 87/267 (32%), Positives = 141/267 (52%) Frame = +2 Query: 875 ITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHLLSV 1054 + YLGAPL G +F+ L+ KIR++++ W+ + LS GGR+ LLR VLSS+P++LL V Sbjct: 1334 VIYLGAPLHKGPKKVTLFDSLITKIRDRISGWENKTLSPGGRITLLRSVLSSLPLYLLQV 1393 Query: 1055 LDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRSLHM 1234 L PV +I + +FLWG+ +++HW AW K+ P E GL +R ++ + + Sbjct: 1394 LKPPVVVIEKIERLFNSFLWGDSTNDKRIHWAAWHKLTFPCSEGGLDIRRLTDMFDAFSL 1453 Query: 1235 KFAYRLLMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VFMRG 1414 K +R + LW F + KY G+ + DSQ+ K ++ + N + Sbjct: 1454 KLWWRFSTCEGLWTKFLKTKYCMGQIPHYVHPKLHDSQVWKRMVRGREVAIQNTRWRIGK 1513 Query: 1415 GNSFFWHDRWLASGPLSVRVEEIQNNNLRINDCWIDQTWDVELLKDLVGETVANEIIQTR 1594 G+ FFWHD W+ PL +N+ +++ + WDV+ L + + +EI+Q Sbjct: 1514 GSLFFWHDCWMGDQPLVTSFPHFRNDMSTVHNFFNGHNWDVDKLNLYLPMNLVDEILQIP 1573 Query: 1595 FIGREGPDLCVWKPSRDGNFSTATA*E 1675 I R D+ W + +G FST +A E Sbjct: 1574 -IDRSQDDVAYWSLTSNGEFSTRSAWE 1599 >ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao] gi|508710341|gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 162 bits (409), Expect = 6e-37 Identities = 88/267 (32%), Positives = 141/267 (52%) Frame = +2 Query: 875 ITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHLLSV 1054 +TYLGAPL G +F+ L+ KIR++++ W+ ++LS G R+ LLR VLSS+P++LL V Sbjct: 1595 VTYLGAPLHKGPKKVFLFDSLISKIRDRISGWENKILSPGSRITLLRSVLSSLPMYLLQV 1654 Query: 1055 LDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRSLHM 1234 L P +I + +FLWG+ + +++HW AW+K+ P E GL +R+ K+V + + Sbjct: 1655 LKPPAIVIEKIERLFNSFLWGDSNEGKRMHWAAWNKINFPCSEGGLDIRNLKDVFDAFTL 1714 Query: 1235 KFAYRLLMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VFMRG 1414 K +R SLW F + KY GR + S + K I + N + Sbjct: 1715 KLWWRFYTCDSLWTLFLKTKYCLGRIPHYVQPKIHSSSIWKRITGGRDVTIQNTRWKIGR 1774 Query: 1415 GNSFFWHDRWLASGPLSVRVEEIQNNNLRINDCWIDQTWDVELLKDLVGETVANEIIQTR 1594 G FFWHD W+ PL + +N+ ++ + +WDV+ L+ + + EI+ Sbjct: 1775 GELFFWHDCWMGDQPLVISFPSFRNDMSFVHKFYKGDSWDVDKLRLFLPVNLIYEILLIP 1834 Query: 1595 FIGREGPDLCVWKPSRDGNFSTATA*E 1675 F R D+ W + +G FST +A E Sbjct: 1835 F-DRTQQDVAYWTLTSNGEFSTKSAWE 1860 >ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobroma cacao] gi|508787493|gb|EOY34749.1| Uncharacterized protein TCM_042329 [Theobroma cacao] Length = 2606 Score = 159 bits (401), Expect = 5e-36 Identities = 82/248 (33%), Positives = 135/248 (54%) Frame = +2 Query: 938 VEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHLLSVLDIPVATYSRINSIMTNFLWG 1117 + +IR++++ W+ ++LS GGR+ LLR VLSS P++LL V+ PV +I + +FLWG Sbjct: 1293 IPQIRDRISGWENKILSPGGRITLLRSVLSSQPMYLLQVIKPPVTVIEKIERLFNSFLWG 1352 Query: 1118 EVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRSLHMKFAYRLLMAKSLWADFFRAKY 1297 + + +K+HW AWSK+ P E GLG+R+ ++V + +K +R SLW F + KY Sbjct: 1353 DSNDGKKLHWTAWSKITFPVSEGGLGIRNLRDVFEAFSLKLWWRFQTCNSLWTRFLKTKY 1412 Query: 1298 LKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VFMRGGNSFFWHDRWLASGPLSVRVE 1477 GR + DSQ+ K ++F + N+ + G FFWHD W+ PLS Sbjct: 1413 CLGRIPHFVQPKLHDSQVWKRMIFGRDVALQNIRWGIGKGELFFWHDCWMGDLPLSNLFP 1472 Query: 1478 EIQNNNLRINDCWIDQTWDVELLKDLVGETVANEIIQTRFIGREGPDLCVWKPSRDGNFS 1657 N+ ++ + WD+ L + ++ +EI+Q F R D+ W + +G+FS Sbjct: 1473 SFHNDMSHVHKFYNGDGWDIVKLNSCLPMSLIDEILQIPF-DRSQEDIAYWALTSNGDFS 1531 Query: 1658 TATA*EVK 1681 +A E + Sbjct: 1532 LWSAWEAE 1539 >ref|XP_004233578.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 955 Score = 153 bits (387), Expect = 2e-34 Identities = 86/273 (31%), Positives = 143/273 (52%), Gaps = 2/273 (0%) Frame = +2 Query: 866 NFSITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHL 1045 N ITYLG PL G F +VEKI K++ W ++L+ GG++ L++HVL S+PIHL Sbjct: 317 NSPITYLGCPLYVGGQRIIYFSGIVEKIIRKISGWHAKILNFGGKITLVKHVLQSIPIHL 376 Query: 1046 LSVLDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRS 1225 L+ + P T I +++ +F WG +K HW +W + PT E G+G+R+ ++V + Sbjct: 377 LAAVSPPKTTLKYIKNVIADFFWGMDKDGKKYHWASWETLAYPTNEGGIGVRNLEDVCIA 436 Query: 1226 LHMKFAYRLLMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VF 1405 K + SLW+ F +AKY K + + +S + + V + + Sbjct: 437 FQYKQWWEFRTKNSLWSKFLKAKYCKRANPVAKKYDTGNSLVWRYFTRNRQAVESYIKWN 496 Query: 1406 MRGGNSFFWHDRWLASGPLSVRVEEIQN-NNLRINDCWIDQTWDVELLKDLVGETVANEI 1582 + G+S FW D WL + L+ +V I + NN+ ++D + W+ ++ V T+ +I Sbjct: 497 IHSGSSSFWWDNWLGNEALANQVINISSLNNIHVSDFLTNGIWNERYVRQHVPPTMVPDI 556 Query: 1583 IQTRF-IGREGPDLCVWKPSRDGNFSTATA*EV 1678 +QT+F D +W P +G F+ A+A EV Sbjct: 557 MQTQFKYNINIEDTAIWTPEENGKFTIASAWEV 589 >ref|XP_004234855.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 440 Score = 150 bits (379), Expect = 2e-33 Identities = 82/269 (30%), Positives = 138/269 (51%), Gaps = 2/269 (0%) Frame = +2 Query: 875 ITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHLLSV 1054 ITYLG PL GR + F L+ K+ +++ W+ + LS GG+ +L +HVL ++PIHLL Sbjct: 49 ITYLGCPLFVGRPRNVYFSDLINKVVSRITGWQTKQLSYGGKAVLSKHVLQALPIHLLLA 108 Query: 1055 LDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRSLHM 1234 + P +I ++ +F WG + ++K HW +W + P +E G+G+R+ ++V +S Sbjct: 109 VTPPTTIIRQIQMLIADFFWGWKNDRKKYHWSSWKNLSYPYEEGGIGMRNLQDVCKSFQF 168 Query: 1235 KFAYRLLMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VFMRG 1414 K + ++LW +F RAKY + + + +S K +L +V ++ ++ Sbjct: 169 KQWWVFRTKQTLWGEFLRAKYCQRSNPVCKKWDTGESLTWKHMLDTRQQVEQHIHWNLQA 228 Query: 1415 GNSFFWHDRWLASGPLSVRVEEIQN-NNLRINDCWIDQTWDVELLKDLVGETVANEIIQT 1591 GN FW D WL +GPL+ NN+ + + + W L T + I+ T Sbjct: 229 GNCSFWWDNWLGTGPLAQHTTSSNRFNNITVAEFLENGEWKWSKLMKHAPVTQLSSILAT 288 Query: 1592 RFIGRE-GPDLCVWKPSRDGNFSTATA*E 1675 R + PD +WKP+ G FS +A E Sbjct: 289 RIPQHQHRPDQAIWKPNTHGRFSCTSAWE 317 >ref|XP_007014716.1| Uncharacterized protein TCM_040145 [Theobroma cacao] gi|508785079|gb|EOY32335.1| Uncharacterized protein TCM_040145 [Theobroma cacao] Length = 249 Score = 150 bits (378), Expect = 2e-33 Identities = 80/245 (32%), Positives = 128/245 (52%) Frame = +2 Query: 923 MFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHLLSVLDIPVATYSRINSIMT 1102 +F+ LV KI++++A W+ ++LS GGR+ LLR V SS+PI+LL V P RI+ + Sbjct: 2 LFDDLVAKIQDRIAEWENKVLSPGGRITLLRSVFSSLPIYLLQVFKPPTCVIERIDRLFN 61 Query: 1103 NFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRSLHMKFAYRLLMAKSLWADF 1282 +FLW G RK+HW +W K+ P+ E GL +R +V ++ MK +R L S+W F Sbjct: 62 SFLWEGSTGTRKIHWASWHKITLPSNEGGLDIRGLGDVMQAFSMKLWWRFLTCNSIWTHF 121 Query: 1283 FRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VFMRGGNSFFWHDRWLASGPL 1462 +KY + + DSQ K +L + S + + G FFWHD W+ PL Sbjct: 122 IWSKYCASQIPRNVKSKLWDSQTWKWMLASCSVIEQFTRCRIGKGELFFWHDCWMGEAPL 181 Query: 1463 SVRVEEIQNNNLRINDCWIDQTWDVELLKDLVGETVANEIIQTRFIGREGPDLCVWKPSR 1642 R ++ R+ + + WD+ L +++ E V +I++ I D W P+ Sbjct: 182 VSRYPSFASSTTRVCYFYDNGKWDLGKLNNVLPEEVVAKILKIS-IDPLSVDTAFWVPTS 240 Query: 1643 DGNFS 1657 +G F+ Sbjct: 241 NGQFT 245 >ref|XP_004253436.1| PREDICTED: uncharacterized protein LOC101262707 [Solanum lycopersicum] Length = 764 Score = 148 bits (374), Expect = 7e-33 Identities = 78/273 (28%), Positives = 139/273 (50%), Gaps = 2/273 (0%) Frame = +2 Query: 866 NFSITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHL 1045 N I YLG PL G F +V+K+ K++ W+ ++L+ GG++ L++HVL S+PIH Sbjct: 304 NSPINYLGCPLYIGGQRIIYFFEVVDKVIKKISGWQSKILNFGGKITLIKHVLQSIPIHT 363 Query: 1046 LSVLDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRS 1225 L+ + P T + I +M +F WG +K HW +W + PT E G+G+R ++ ++ Sbjct: 364 LAAISPPKTTINHIKKLMADFFWGIDKEGKKYHWASWDTMAYPTNEGGIGVRLLDDICKA 423 Query: 1226 LHMKFAYRLLMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VF 1405 K + SLW++F +KY + H + DS + + + EV ++ Sbjct: 424 FQYKHWWDFRTKHSLWSNFLMSKYCQRAHPVAKKYNTGDSLMWRYLTRNRIEVEVHIRWH 483 Query: 1406 MRGGNSFFWHDRWLASGPLSVRVEEIQN-NNLRINDCWIDQTWDVELLKDLVGETVANEI 1582 ++ G S W D W +G ++ + + + NN+ + +C + W+ ++ V + I Sbjct: 484 IQSGTSSLWWDNWTGNGAIANYCDHVSSLNNMVLAECLTNGKWNERFIRQHVPAILIPHI 543 Query: 1583 IQTRFIGREG-PDLCVWKPSRDGNFSTATA*EV 1678 +QT +EG D +W P G FS A+A ++ Sbjct: 544 LQTCINYKEGAEDTAIWLPEESGKFSIASAWDI 576 >ref|XP_004253372.1| PREDICTED: putative ribonuclease H protein At1g65750-like, partial [Solanum lycopersicum] Length = 451 Score = 148 bits (374), Expect = 7e-33 Identities = 81/269 (30%), Positives = 136/269 (50%), Gaps = 2/269 (0%) Frame = +2 Query: 875 ITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRHVLSSVPIHLLSV 1054 ITYLG PL GR + F L+ K+ +++ W+ + LS GG+ +L +HVL ++PIHLL+ Sbjct: 80 ITYLGCPLFVGRPRNTYFSNLINKVISRITGWQTKQLSFGGKAVLSKHVLQALPIHLLTA 139 Query: 1055 LDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGLRDFKEVQRSLHM 1234 + P +I ++ +F WG + +RK HW +W + P +E G+G+R+ ++ +S Sbjct: 140 VTPPKTIIKQIQMLIADFFWGWQNNRRKYHWSSWKNLSYPYEEGGIGMRNLHDICKSFQF 199 Query: 1235 KFAYRLLMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAVSEVMNNV*VFMRG 1414 K + +LW DF +AKY + + + +S K +L + + + Sbjct: 200 KQWWTFRTKHTLWGDFLKAKYCQRSNPVSKKWDTGESIAWKHMLATRQQGEQYIQWQLNS 259 Query: 1415 GNSFFWHDRWLASGPLSVRV-EEIQNNNLRINDCWIDQTWDVELLKDLVGETVANEIIQT 1591 GN FW D WL +G L+ I+ NN ++ D W + W+ L++ T I+ T Sbjct: 260 GNCSFWWDNWLGTGSLAQHTNRNIRFNNSKVADFWENGNWNWRKLEEQAPTTHLTNIMAT 319 Query: 1592 RFIG-REGPDLCVWKPSRDGNFSTATA*E 1675 ++ PD VW+ G FS +A E Sbjct: 320 AIPSQQQKPDQAVWRLDSHGKFSCHSAWE 348 >ref|XP_006367640.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum tuberosum] Length = 1035 Score = 147 bits (371), Expect = 2e-32 Identities = 89/281 (31%), Positives = 134/281 (47%), Gaps = 4/281 (1%) Frame = +2 Query: 839 RNMDRAREGNFSITYLGAPLVSGRLTSCMFEPLVEKIRNKVA*WKFRLLSQGGRLILLRH 1018 R + R+GNF +TYLG P+ GR S + +V+KI ++ W R L+ GG+ IL+ + Sbjct: 200 RRLTGIRQGNFPLTYLGCPVFYGRRKSSYYVEMVQKIAKRILTWHNRFLTFGGKWILINN 259 Query: 1019 VLSSVPIHLLSVLDIPVATYSRINSIMTNFLWGEVDGKRKVHWCAWSKVCKPTKEWGLGL 1198 VL S+P+++LS L P +I+ I F WG + G + HW AW +C P E GLG Sbjct: 260 VLQSMPVYMLSALKPPKKVLDQIHQIFAKFFWGNLGGIKGKHWVAWGDLCYPKTEGGLGF 319 Query: 1199 RDFKEVQRSLHMKFAYRL-LMAKSLWADFFRAKYLKGRHISQYTRRSTDSQL*KSILFAV 1375 R + ++L K + + SLW + KY K H T SQ+ + ++ Sbjct: 320 RSLHNMNKALFAKLWWNFRVSTTSLWVKYMWNKYCKKLHPVVATSLGA-SQVWRKMISIR 378 Query: 1376 SEVMNNV*VFMRGGNSFFWHDRWLASGPLSVRVEE-IQNNNLRINDCWIDQTWDVELLKD 1552 EV +++ ++ GNS FW D W G L + Q L + + WD LKD Sbjct: 379 EEVEHDIWWQIKAGNSSFWFDNWTRQGALYYTEGDCAQEEELEVQYFITNDGWDETKLKD 438 Query: 1553 LVGETVANEIIQT--RFIGREGPDLCVWKPSRDGNFSTATA 1669 L+ E + II EG D W + G F+ +A Sbjct: 439 LLSEEMVEHIILNIRPKTSEEGIDKAWWCGNLTGLFTVKSA 479