BLASTX nr result
ID: Atropa21_contig00037945
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00037945 (611 letters) Database: nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004239567.1| PREDICTED: uncharacterized protein LOC101262... 115 1e-23 ref|XP_004250606.1| PREDICTED: uncharacterized protein LOC101247... 108 1e-21 ref|XP_004253407.1| PREDICTED: uncharacterized protein LOC101250... 100 4e-19 ref|XP_004253220.1| PREDICTED: uncharacterized protein LOC101264... 100 4e-19 ref|XP_004233579.1| PREDICTED: uncharacterized protein LOC101260... 100 4e-19 gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] 98 2e-18 ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581... 96 6e-18 ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268... 96 8e-18 ref|XP_004244918.1| PREDICTED: putative ribonuclease H protein A... 96 1e-17 ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258... 96 1e-17 gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] 94 3e-17 gb|AAD29058.1| putative non-LTR retroelement reverse transcripta... 94 4e-17 gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] 93 7e-17 ref|XP_004248595.1| PREDICTED: uncharacterized protein LOC101261... 93 7e-17 gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] 92 9e-17 gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob... 92 1e-16 gb|EOY02242.1| Uncharacterized protein TCM_016767 [Theobroma cacao] 92 1e-16 gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] 91 2e-16 ref|XP_004231462.1| PREDICTED: uncharacterized protein LOC101258... 91 2e-16 emb|CAB39638.1| RNA-directed DNA polymerase-like protein [Arabid... 90 4e-16 >ref|XP_004239567.1| PREDICTED: uncharacterized protein LOC101262916 [Solanum lycopersicum] Length = 895 Score = 115 bits (287), Expect = 1e-23 Identities = 74/202 (36%), Positives = 110/202 (54%), Gaps = 1/202 (0%) Frame = +2 Query: 2 RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181 RR+KL I KI +G I+ E +A+E + F + N + ND L I MV+ D Sbjct: 280 RRRKLFIHKIATENGDWIQGENNIAQEACEHFHTIFTGE-NRYINDHNLECIPRMVNVDQ 338 Query: 182 NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361 N QL K+ M+E+K +F +N S +GP+ +G F+ W+II++D+ + FF + Sbjct: 339 NTQLTKLPDMDEIKEVVFAMNPNSTAGPDGMNGYFFQKCWNIIKSDLIEVQHAFFSGQMI 398 Query: 362 LSQHSH*PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538 SH PK +KL ++ + F + I K + +RL +LP LIS NQ F Sbjct: 399 PKYFSHSCIVLLPKVNNPNKLTEFRLISLSNFTSKIISKLVSNRLSPILPSLISTNQFGF 458 Query: 539 IKGRSIIENVLLTQEIITDIRK 604 +KGRSI EN++L QEII I+K Sbjct: 459 VKGRSISENIMLAQEIIHQIKK 480 >ref|XP_004250606.1| PREDICTED: uncharacterized protein LOC101247390 [Solanum lycopersicum] Length = 612 Score = 108 bits (270), Expect = 1e-21 Identities = 72/202 (35%), Positives = 109/202 (53%), Gaps = 1/202 (0%) Frame = +2 Query: 2 RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181 RR+KL I KI +G I+ E +A+ + + F +N H N+ L I MV++D Sbjct: 342 RRRKLFIHKIITENGDWIQGENNIAQNACDHFNAIFT-SENKHINEQNLECIPRMVNKDQ 400 Query: 182 NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361 N QL K+ M+E+K +F +N SA+GP+ +G F+ +II+ D+ ++ PFF + Sbjct: 401 NTQLTKLPDMDELKEVVFSMNPNSAAGPDGMNGYFFKKCLNIIKNDLVEVLHPFFSGQMI 460 Query: 362 LSQHSH*PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538 SH PK +KL + + F + I K + +RL +L LIS NQ F Sbjct: 461 PKYFSHSCIVLLPKVNNTNKLTEFRPISLSNFTSKIISKLVSNRLSPILLSLISTNQSGF 520 Query: 539 IKGRSIIENVLLTQEIITDIRK 604 +KGRSI EN++ QEII I+K Sbjct: 521 VKGRSISENIMHAQEIIHQIKK 542 >ref|XP_004253407.1| PREDICTED: uncharacterized protein LOC101250876, partial [Solanum lycopersicum] Length = 445 Score = 100 bits (248), Expect = 4e-19 Identities = 70/202 (34%), Positives = 107/202 (52%), Gaps = 1/202 (0%) Frame = +2 Query: 2 RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181 +R ++ I K++D G I E +A++ +Y E F K N+ + L I M+ Q+ Sbjct: 79 KRNRMAIHKLKDDRGNWIIGEEDIAKKACEYYEEIFTGK-NETIKEDILQCITPMITQEQ 137 Query: 182 NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361 N+ L ++ M+E++ I +N SA GP+ F G FY +DII+ D+ + V F+ Sbjct: 138 NDGLDRLPDMDELRRIIMSMNPHSAPGPDGFGGKFYQVCFDIIKKDLLDAVNHFYIGNSM 197 Query: 362 LSQHSH*PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538 +H PK + KL + + F + I K L RL ++LP +IS NQ F Sbjct: 198 PRYMTHACLILLPKIDHPCKLKDFRPISLSNFVNKIISKILSTRLASILPGVISENQPGF 257 Query: 539 IKGRSIIENVLLTQEIITDIRK 604 +KGRSI EN+LL QEII I+K Sbjct: 258 VKGRSIAENILLAQEIIHGIKK 279 >ref|XP_004253220.1| PREDICTED: uncharacterized protein LOC101264807 [Solanum lycopersicum] Length = 934 Score = 100 bits (248), Expect = 4e-19 Identities = 67/197 (34%), Positives = 104/197 (52%), Gaps = 1/197 (0%) Frame = +2 Query: 2 RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181 RRK++ I K++ +G I+ E + + ++Y + F K N DS L I ++ ++ Sbjct: 93 RRKRMCITKLESENGEWIQGEENIVKTACDYYKQIFTGKNEVINEDS-LQCISKIIIEEQ 151 Query: 182 NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361 N +L +M +M+E+K I +N SA GP+ G F+ +DII+ D+ V FF Sbjct: 152 NSKLEQMPNMDELKNVIMNMNPNSAPGPDGIGGKFFQVCFDIIKDDLLAAVQHFFNGFDM 211 Query: 362 LSQHSH*PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538 +H PK E +KL + + F + I K + RL +LP +IS NQ F Sbjct: 212 PKYMTHACLVLIPKVEYPNKLKDFRPISLSNFTNKIISKIMSTRLAPILPTIISKNQSGF 271 Query: 539 IKGRSIIENVLLTQEII 589 +KGRSI EN++L QEII Sbjct: 272 VKGRSISENIMLAQEII 288 >ref|XP_004233579.1| PREDICTED: uncharacterized protein LOC101260201 [Solanum lycopersicum] Length = 1531 Score = 100 bits (248), Expect = 4e-19 Identities = 67/197 (34%), Positives = 103/197 (52%), Gaps = 1/197 (0%) Frame = +2 Query: 2 RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181 RR K+ I KI + GV I+ E +A+E ++Y F K ++ + L +I ++ + Sbjct: 658 RRNKMIIYKIMNDSGVWIQGEDNVAKEACDYYQNMFTGK-SEKIKEELLQNIPELITLEQ 716 Query: 182 NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361 N L K+ ++EE+K I +N SA GP+ G FY +DII+ DM V FF + Sbjct: 717 NSDLDKLPTVEELKNTIMSMNPNSAPGPDGIGGKFYQECFDIIQEDMLAAVNSFFSGNIM 776 Query: 362 LSQHSH*PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538 +H K ++L ++ + F + I K L RL ++LP +IS NQ F Sbjct: 777 PRYMTHACLVLLLKINHPNQLKDYRLMSLSNFTNKIISKILSTRLASILPNIISTNQYGF 836 Query: 539 IKGRSIIENVLLTQEII 589 +KGR I EN+LL QE+I Sbjct: 837 VKGRRISENILLAQEVI 853 >gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 98.2 bits (243), Expect = 2e-18 Identities = 68/217 (31%), Positives = 107/217 (49%), Gaps = 15/217 (6%) Frame = +2 Query: 2 RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHN--NDSFLAHILVMVDQ 175 +R + I +IQD +G ++E+ + V F+ ++ D + + S I+ D Sbjct: 1212 KRMRNHIFRIQDQEGNVLEEPHLIQNSGVEFFQNLLKAEQCDISRFDPSITPRIISTTD- 1270 Query: 176 DTNEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKA- 352 NE LC S++EVK A+F +N S +GP+ FS LFY WDII+ D++ VL FFK Sbjct: 1271 --NEFLCATPSLQEVKEAVFNINKDSVAGPDGFSSLFYQHCWDIIKQDLFEAVLDFFKGS 1328 Query: 353 ------------ILSLSQHSH*PSAAAPKEIGSKLFRLKVHQSA*FH*QYIVKGLHDRLE 496 +L +Q+ S P + + L ++ + K L +RL Sbjct: 1329 PLPRGITSTTLVLLPKTQNVSQWSEFRPISLCTVLNKI------------VTKLLANRLS 1376 Query: 497 NVLPRLISPNQD*FIKGRSIIENVLLTQEIITDIRKR 607 +LP +IS NQ F+ GR I +N+LL QE++ I R Sbjct: 1377 KILPSIISENQSGFVNGRLISDNILLAQELVDKINAR 1413 >ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581051 [Solanum tuberosum] Length = 1946 Score = 96.3 bits (238), Expect = 6e-18 Identities = 63/186 (33%), Positives = 99/186 (53%), Gaps = 3/186 (1%) Frame = +2 Query: 56 EDEGQ--MAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDTNEQLCKMSSMEEVKVA 229 E++G +A+ ++ E F +N N ++ L I MV ++ N+ L + + EE K Sbjct: 157 EEQGDENIAKAACVYFQETFTGHEN-RNAENILQCITRMVTEEQNQNLKALPTKEESKQV 215 Query: 230 IFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILSLSQHSH*PSAAAPK-E 406 ++ +N SA GP+ F G FY + WDII+ ++ VL +F + SH PK E Sbjct: 216 VYSMNPNSAPGPDGFGGKFYQACWDIIQDELLEAVLAYFSGHIMPKFMSHSCLVVLPKVE 275 Query: 407 IGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*FIKGRSIIENVLLTQEI 586 ++ + F + I K L RL +LP LIS NQ F++GRSI +N++L QEI Sbjct: 276 HPNRFNEFRPISLTNFTSKIISKILCLRLAPILPHLISENQSGFVRGRSITDNIMLAQEI 335 Query: 587 ITDIRK 604 I +I+K Sbjct: 336 IHNIKK 341 >ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268853 [Solanum lycopersicum] Length = 1333 Score = 95.9 bits (237), Expect = 8e-18 Identities = 65/202 (32%), Positives = 104/202 (51%), Gaps = 1/202 (0%) Frame = +2 Query: 2 RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181 +R ++ I K+ D G I+ E ++A+ ++Y + F N + L I M+ Q+ Sbjct: 314 KRNRMSIHKLMDESGNWIKGEEEIAKHACDYYEKIFTGM-NGKIKEDILQCINPMITQEQ 372 Query: 182 NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361 N+ L ++ M+E++ I +N SA GP+ F G FY +DII+ D+ V F+ + Sbjct: 373 NKDLDRIPDMDELRRTIMSMNPHSAPGPDGFGGKFYQVCFDIIKEDLLAAVKHFYVGNIM 432 Query: 362 LSQHSH*PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538 +H PK + +L + + F + I K L RL +LP ++S NQ F Sbjct: 433 PRYLTHACLTLIPKIDHPCRLKDFRPISLSNFTNKIISKILSTRLALILPSIVSANQSGF 492 Query: 539 IKGRSIIENVLLTQEIITDIRK 604 +KGRSI EN+LL QEI I+K Sbjct: 493 VKGRSIAENILLAQEIFHGIKK 514 >ref|XP_004244918.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 1010 Score = 95.5 bits (236), Expect = 1e-17 Identities = 64/190 (33%), Positives = 99/190 (52%), Gaps = 1/190 (0%) Frame = +2 Query: 35 DSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDTNEQLCKMSSME 214 + +G I+ + +A+E ++Y + F+ N+ L I MV D N+ L K+ ME Sbjct: 2 NDNGEWIQGDDNIAKEACDYYKDMFSGSSL-RVNEEILQCIPNMVTADQNDVLDKLPDME 60 Query: 215 EVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILSLSQHSH*PSAA 394 E++ + +N SA GP+ G FY +DII+ D+ V FF + +H Sbjct: 61 ELRKVVMSMNPNSAPGPDGIGGKFYQFCFDIIKDDLLAAVQDFFNGEIMPRYMTHACLVL 120 Query: 395 APK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*FIKGRSIIENVL 571 PK E +K + + F + I K + RL +++P+LIS NQ F+KGRSI EN+L Sbjct: 121 LPKIEHPNKHKDFRPISLSNFSNKIISKVMSMRLASIIPKLISDNQSGFVKGRSISENIL 180 Query: 572 LTQEIITDIR 601 L QEII I+ Sbjct: 181 LAQEIIHGIK 190 >ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258077 [Solanum lycopersicum] Length = 1454 Score = 95.5 bits (236), Expect = 1e-17 Identities = 66/202 (32%), Positives = 105/202 (51%), Gaps = 1/202 (0%) Frame = +2 Query: 2 RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181 +R ++ I K+ D +G I+ E ++A+ ++Y + F K ++ L I MV Q Sbjct: 436 KRNRIAIHKLMDDNGNWIQGEDKIAKLACDYYEQNFTGKAEKIKEEN-LHCINKMVTQAQ 494 Query: 182 NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361 N+ L ++ +E++ I +N SA GP+ F G FY + +DII+ D+ V F+ Sbjct: 495 NDDLDRLPDEDELRRIIMSMNPNSAPGPDGFGGKFYQTCFDIIKKDLLAAVNYFYIGNSM 554 Query: 362 LSQHSH*PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538 +H PK E KL + + F + I K + RL ++LP ++S NQ F Sbjct: 555 PKYMTHACLILLPKVEHPCKLKEFRPISLSNFSNKIISKIMSTRLASILPCVVSENQSGF 614 Query: 539 IKGRSIIENVLLTQEIITDIRK 604 +KGRSI EN+LL EII I+K Sbjct: 615 VKGRSISENILLAHEIIHGIKK 636 >gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 94.0 bits (232), Expect = 3e-17 Identities = 68/220 (30%), Positives = 109/220 (49%), Gaps = 17/220 (7%) Frame = +2 Query: 2 RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHN--NDSFLAHILVMVDQ 175 +R + I +IQDS+G + +D + + +F+ + + D + + S + I+ D Sbjct: 1125 KRVRSHIFQIQDSEGNVFDDIHSIQKSATDFFRDLMQAENCDLSRFDPSLIPRIISSAD- 1183 Query: 176 DTNEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAI 355 NE LC ++E+K A+F +N S +GP+ FS LFY WDII+ D+ + VL FF+ Sbjct: 1184 --NEFLCAAPPLQEIKEAVFNINKDSVAGPDGFSSLFYQHCWDIIKNDLLDAVLDFFR-- 1239 Query: 356 LSLSQHSH*PSAAAPKEIGS-KLFRLKVHQSA*FH*QY------------IVKGLHDRLE 496 + P+ + S L L +A +Y + K L +RL Sbjct: 1240 ----------GSPLPRGVTSTTLVLLPKKPNACHWSEYRPISLCTVLNKIVTKLLANRLS 1289 Query: 497 NVLPRLISPNQD*FIKGRSIIENVLLTQEII--TDIRKRG 610 +LP +IS NQ F+ GR I +N+LL QE+I D + RG Sbjct: 1290 KILPSIISENQSGFVNGRLISDNILLAQELIGKIDAKSRG 1329 >gb|AAD29058.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1229 Score = 93.6 bits (231), Expect = 4e-17 Identities = 61/201 (30%), Positives = 109/201 (54%), Gaps = 1/201 (0%) Frame = +2 Query: 2 RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181 RR + + ++D +GV +E Q+++ +++ Y +Q ++D + I MV Q Sbjct: 257 RRTQNRLTVMEDINGVAQHEEHQISQ-IISGYFQQIFTSESDGDFSVVDEAIEPMVSQGD 315 Query: 182 NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361 N+ L ++ + EEVK A+F +N++ A GP+ F+ FYHSYW II D+ + FF + Sbjct: 316 NDFLTRIPNDEEVKDAVFSINASKAPGPDGFTAGFYHSYWHIISTDVGREIRLFFTSKNF 375 Query: 362 LSQHSH*PSAAAPKEIG-SKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538 + + PK++G K+ + + + K + R++ +LP+LIS NQ F Sbjct: 376 PRRMNETHIRLIPKDLGPRKVADYRPIALCNIFYKIVAKIMTKRMQLILPKLISENQSAF 435 Query: 539 IKGRSIIENVLLTQEIITDIR 601 + GR I +NVL+T E++ +R Sbjct: 436 VPGRVISDNVLITHEVLHFLR 456 >gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] Length = 1954 Score = 92.8 bits (229), Expect = 7e-17 Identities = 62/202 (30%), Positives = 100/202 (49%), Gaps = 5/202 (2%) Frame = +2 Query: 20 IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHN--NDSFLAHILVMVDQDTNEQL 193 I +IQDS+G + ED + V ++ ++ D + + S + + + D NE L Sbjct: 957 IFRIQDSEGNIYEDPQYIQNSAVQYFQNLLTAEQCDFSRFDPSLIPRTISITD---NEFL 1013 Query: 194 CKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILSLSQH 373 C S++E+K +F ++ S +GP+ FS LFY WDII+ D+ VL FF Sbjct: 1014 CAAPSLKEIKEVVFNIDKDSVAGPDGFSSLFYQHCWDIIKQDLLEAVLDFFNGTPMPQGV 1073 Query: 374 SH*PSAAAPKEIGS-KLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*FIKGR 550 + PK+ S + + + + K L +RL +LP +IS NQ F+ GR Sbjct: 1074 TSTTLVLLPKKPNSCQWSDFRPISLCTVLNKIVTKTLANRLSKILPSIISENQSGFVNGR 1133 Query: 551 SIIENVLLTQEII--TDIRKRG 610 I +N+LL QE++ D + RG Sbjct: 1134 LISDNILLAQELVGKLDAKARG 1155 >ref|XP_004248595.1| PREDICTED: uncharacterized protein LOC101261371 [Solanum lycopersicum] Length = 1246 Score = 92.8 bits (229), Expect = 7e-17 Identities = 64/202 (31%), Positives = 102/202 (50%), Gaps = 1/202 (0%) Frame = +2 Query: 2 RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181 +R ++ I K+ D G I E +A++ ++Y F KN+ + L I ++ Q+ Sbjct: 270 KRNRMAIHKLMDDSGNWITGEENIAKQACDYYEGIFT-AKNEKIKEDILQCIKPIITQER 328 Query: 182 NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361 N+ L ++ M+E++ I +N SA GP+ F G FY +DII+ D+ V F+ Sbjct: 329 NDSLDRLPDMDELRGVIMSMNPHSAPGPDGFGGKFYQVCFDIIKEDLLAAVKYFYIGNSM 388 Query: 362 LSQHSH*PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538 +H PK + +L + + F + I K + R +LP +I NQ F Sbjct: 389 PRYLTHASLILLPKTDHPCRLKDFRPISLSNFANKIISKIISTRFGLILPGIIFENQSGF 448 Query: 539 IKGRSIIENVLLTQEIITDIRK 604 +KGRSI EN+LL QEII I+K Sbjct: 449 VKGRSIAENILLAQEIINGIKK 470 >gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 92.4 bits (228), Expect = 9e-17 Identities = 66/193 (34%), Positives = 96/193 (49%), Gaps = 3/193 (1%) Frame = +2 Query: 20 IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILV--MVDQDTNEQL 193 I K+QD +G IED+ Q+ + ++ K + DS L+ ++ NE L Sbjct: 1252 IFKVQDPEGRWIEDQEQLKHSAIEYFSSLL---KVEPCYDSRFQSSLIPSIISNSENELL 1308 Query: 194 CKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFK-AILSLSQ 370 C S++EVK A+F +NS SA+GP+ FS FY W+II D+ + V FF A + Sbjct: 1309 CAEPSLQEVKDAVFGINSESAAGPDGFSSYFYQQCWNIIAQDLLDAVRDFFHGANIPRGV 1368 Query: 371 HSH*PSAAAPKEIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*FIKGR 550 S K SK + + I K L +RL VLP +I+ NQ F+ GR Sbjct: 1369 TSTTLILLPKKSSASKWSDFRPISLCTVMNKIITKLLSNRLAKVLPSIITENQSGFVGGR 1428 Query: 551 SIIENVLLTQEII 589 I +N+LL QE+I Sbjct: 1429 LISDNILLAQELI 1441 >gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 92.0 bits (227), Expect = 1e-16 Identities = 68/201 (33%), Positives = 96/201 (47%), Gaps = 4/201 (1%) Frame = +2 Query: 20 IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT-NEQLC 196 I KIQDS+G L+E+ G + V F+ K +++ F A + + D N LC Sbjct: 338 IFKIQDSEGTLMEEPGLIESSAVEFFENLL--KAENYDLSRFKAEFIPQMLSDADNNLLC 395 Query: 197 KMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFK-AILSLSQH 373 ++EVK A+F ++ S GP+ FS FY W II D+ V FFK A+ Sbjct: 396 AEPQLQEVKDAVFAIDKDSVVGPDGFSSFFYQQCWPIIAEDLLAAVRDFFKGAVFPRGVT 455 Query: 374 SH*PSAAAPKEIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*FIKGRS 553 S A K + + + + K L +RL VLP LIS NQ F+ GR Sbjct: 456 STTLVLLAKKPDAATWSDFRPISLCTILNKIVTKLLANRLSKVLPSLISENQSGFVSGRL 515 Query: 554 IIENVLLTQEII--TDIRKRG 610 I +N+LL QE+I D + RG Sbjct: 516 INDNILLAQELIGKIDYKARG 536 >gb|EOY02242.1| Uncharacterized protein TCM_016767 [Theobroma cacao] Length = 1707 Score = 92.0 bits (227), Expect = 1e-16 Identities = 66/220 (30%), Positives = 105/220 (47%), Gaps = 17/220 (7%) Frame = +2 Query: 2 RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFY--LEQFNHKKNDHNNDSFLAHILVMVDQ 175 +R + + +IQDS+G + +D + + +F+ L Q + N + S + I+ D Sbjct: 1082 KRVRSHVFQIQDSEGNVFDDTHSIQKSATDFFRNLMQAENCDNSRFDPSLIPRIISSAD- 1140 Query: 176 DTNEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAI 355 NE LC S++EVK +F +N S +G + FS LFY WDII+ D+ + VL FF+ Sbjct: 1141 --NEFLCAAPSLQEVKETVFNINKDSVAGSDGFSSLFYQHCWDIIKHDLLDAVLDFFR-- 1196 Query: 356 LSLSQHSH*PSAAAPKEIGSKLFRLKVHQSA*FH-------------*QYIVKGLHDRLE 496 + P+ + S L + H + + K L +RL Sbjct: 1197 ----------GSPLPRGVTSTTLVLLPKKPNACHWSDYSPISLCTVLNKIVTKLLANRLS 1246 Query: 497 NVLPRLISPNQD*FIKGRSIIENVLLTQEII--TDIRKRG 610 +LP +IS NQ F+ GR I +N+LL E+I D + RG Sbjct: 1247 KILPLIISENQSGFVNGRLISDNILLAHELIGKIDAKSRG 1286 >gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 91.3 bits (225), Expect = 2e-16 Identities = 66/201 (32%), Positives = 98/201 (48%), Gaps = 4/201 (1%) Frame = +2 Query: 20 IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT-NEQLC 196 I KIQ+ DG IED Q+ + ++F+ + D F + + + DT N LC Sbjct: 1217 IFKIQEQDGNWIEDPEQLQQSAIDFFSSLLKAESCDDTR--FQSSLCPSIISDTDNGFLC 1274 Query: 197 KMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILSLSQHS 376 +++EVK A+F ++ SA+GP+ FS FY WDII D++ V FF + Sbjct: 1275 AEPTLQEVKEAVFGIDPESAAGPDGFSSHFYQQCWDIIAHDLFEAVKEFFHGADIPQGMT 1334 Query: 377 H*PSAAAPKEI-GSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*FIKGRS 553 PK SK + + I K L +RL +LP +I+ NQ F+ GR Sbjct: 1335 STTLVLIPKTTSASKWSEFRPISLCTVMNKIITKILANRLAKILPSIITENQSGFVGGRL 1394 Query: 554 IIENVLLTQEII--TDIRKRG 610 I +N+LL QE+I D + RG Sbjct: 1395 ISDNILLAQELIGKLDQKNRG 1415 >ref|XP_004231462.1| PREDICTED: uncharacterized protein LOC101258709 [Solanum lycopersicum] Length = 845 Score = 90.9 bits (224), Expect = 2e-16 Identities = 62/191 (32%), Positives = 97/191 (50%), Gaps = 1/191 (0%) Frame = +2 Query: 20 IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDTNEQLCK 199 I K++ +G I+ E + + ++Y + F K N DS ++ D+ N +L + Sbjct: 3 ITKLESENGEWIQGEENIVKTACDYYKQIFTGKNEAINEDSLQCISRIITDEQ-NIKLEQ 61 Query: 200 MSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILSLSQHSH 379 M +++E+K I +N SA GP+ G F+ +DII+ D+ V FF +H Sbjct: 62 MPNIDELKNVIMNMNPNSAPGPDGIGGKFFQVCFDIIKDDLLAAVQHFFNGFDMPKYMTH 121 Query: 380 *PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*FIKGRSI 556 PK E +KL + + F + I K + RL +LP +IS NQ F+KGRSI Sbjct: 122 ACLVLIPKVEHPNKLKDFRPISLSNFTNKIISKIMSTRLAPILPSIISKNQSGFVKGRSI 181 Query: 557 IENVLLTQEII 589 EN++L QEII Sbjct: 182 SENIMLAQEII 192 >emb|CAB39638.1| RNA-directed DNA polymerase-like protein [Arabidopsis thaliana] gi|7267666|emb|CAB78094.1| RNA-directed DNA polymerase-like protein [Arabidopsis thaliana] Length = 1274 Score = 90.1 bits (222), Expect = 4e-16 Identities = 63/203 (31%), Positives = 106/203 (52%), Gaps = 9/203 (4%) Frame = +2 Query: 29 IQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAH--ILVMVDQDTNEQLCKM 202 I+D G +E Q+A + +++ F +N+D + + ++ NE+L K+ Sbjct: 312 IEDGSGQEFHEEEQIASTISSYFQNIFT---TSNNSDLQVVQEALSPIISSHCNEELIKI 368 Query: 203 SSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFF-----KAILSLS 367 SS+ E+K A+F +++ A GP+ FS F+H+YWDII AD+ + FF L+ + Sbjct: 369 SSLLEIKEALFSISADKAPGPDGFSASFFHAYWDIIEADVSRDIRSFFVDSCLSPRLNET 428 Query: 368 QHSH*PSAAAPKEIGSKLFRLKVHQSA*FH*QY--IVKGLHDRLENVLPRLISPNQD*FI 541 + P +AP+++ A + QY + K L RL+ L LIS +Q F+ Sbjct: 429 HVTLIPKISAPRKVSD------YRPIALCNVQYKIVAKILTRRLQPWLSELISLHQSAFV 482 Query: 542 KGRSIIENVLLTQEIITDIRKRG 610 GR+I +NVL+T EI+ +R G Sbjct: 483 PGRAIADNVLITHEILHFLRVSG 505