BLASTX nr result

ID: Atropa21_contig00037945 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00037945
         (611 letters)

Database: nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004239567.1| PREDICTED: uncharacterized protein LOC101262...   115   1e-23
ref|XP_004250606.1| PREDICTED: uncharacterized protein LOC101247...   108   1e-21
ref|XP_004253407.1| PREDICTED: uncharacterized protein LOC101250...   100   4e-19
ref|XP_004253220.1| PREDICTED: uncharacterized protein LOC101264...   100   4e-19
ref|XP_004233579.1| PREDICTED: uncharacterized protein LOC101260...   100   4e-19
gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]    98   2e-18
ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581...    96   6e-18
ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268...    96   8e-18
ref|XP_004244918.1| PREDICTED: putative ribonuclease H protein A...    96   1e-17
ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258...    96   1e-17
gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]    94   3e-17
gb|AAD29058.1| putative non-LTR retroelement reverse transcripta...    94   4e-17
gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]    93   7e-17
ref|XP_004248595.1| PREDICTED: uncharacterized protein LOC101261...    93   7e-17
gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]    92   9e-17
gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob...    92   1e-16
gb|EOY02242.1| Uncharacterized protein TCM_016767 [Theobroma cacao]    92   1e-16
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]    91   2e-16
ref|XP_004231462.1| PREDICTED: uncharacterized protein LOC101258...    91   2e-16
emb|CAB39638.1| RNA-directed DNA polymerase-like protein [Arabid...    90   4e-16

>ref|XP_004239567.1| PREDICTED: uncharacterized protein LOC101262916 [Solanum
           lycopersicum]
          Length = 895

 Score =  115 bits (287), Expect = 1e-23
 Identities = 74/202 (36%), Positives = 110/202 (54%), Gaps = 1/202 (0%)
 Frame = +2

Query: 2   RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181
           RR+KL I KI   +G  I+ E  +A+E    +   F  + N + ND  L  I  MV+ D 
Sbjct: 280 RRRKLFIHKIATENGDWIQGENNIAQEACEHFHTIFTGE-NRYINDHNLECIPRMVNVDQ 338

Query: 182 NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361
           N QL K+  M+E+K  +F +N  S +GP+  +G F+   W+II++D+  +   FF   + 
Sbjct: 339 NTQLTKLPDMDEIKEVVFAMNPNSTAGPDGMNGYFFQKCWNIIKSDLIEVQHAFFSGQMI 398

Query: 362 LSQHSH*PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538
               SH      PK    +KL   ++   + F  + I K + +RL  +LP LIS NQ  F
Sbjct: 399 PKYFSHSCIVLLPKVNNPNKLTEFRLISLSNFTSKIISKLVSNRLSPILPSLISTNQFGF 458

Query: 539 IKGRSIIENVLLTQEIITDIRK 604
           +KGRSI EN++L QEII  I+K
Sbjct: 459 VKGRSISENIMLAQEIIHQIKK 480


>ref|XP_004250606.1| PREDICTED: uncharacterized protein LOC101247390 [Solanum
           lycopersicum]
          Length = 612

 Score =  108 bits (270), Expect = 1e-21
 Identities = 72/202 (35%), Positives = 109/202 (53%), Gaps = 1/202 (0%)
 Frame = +2

Query: 2   RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181
           RR+KL I KI   +G  I+ E  +A+   + +   F   +N H N+  L  I  MV++D 
Sbjct: 342 RRRKLFIHKIITENGDWIQGENNIAQNACDHFNAIFT-SENKHINEQNLECIPRMVNKDQ 400

Query: 182 NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361
           N QL K+  M+E+K  +F +N  SA+GP+  +G F+    +II+ D+  ++ PFF   + 
Sbjct: 401 NTQLTKLPDMDELKEVVFSMNPNSAAGPDGMNGYFFKKCLNIIKNDLVEVLHPFFSGQMI 460

Query: 362 LSQHSH*PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538
               SH      PK    +KL   +    + F  + I K + +RL  +L  LIS NQ  F
Sbjct: 461 PKYFSHSCIVLLPKVNNTNKLTEFRPISLSNFTSKIISKLVSNRLSPILLSLISTNQSGF 520

Query: 539 IKGRSIIENVLLTQEIITDIRK 604
           +KGRSI EN++  QEII  I+K
Sbjct: 521 VKGRSISENIMHAQEIIHQIKK 542


>ref|XP_004253407.1| PREDICTED: uncharacterized protein LOC101250876, partial [Solanum
           lycopersicum]
          Length = 445

 Score =  100 bits (248), Expect = 4e-19
 Identities = 70/202 (34%), Positives = 107/202 (52%), Gaps = 1/202 (0%)
 Frame = +2

Query: 2   RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181
           +R ++ I K++D  G  I  E  +A++   +Y E F  K N+   +  L  I  M+ Q+ 
Sbjct: 79  KRNRMAIHKLKDDRGNWIIGEEDIAKKACEYYEEIFTGK-NETIKEDILQCITPMITQEQ 137

Query: 182 NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361
           N+ L ++  M+E++  I  +N  SA GP+ F G FY   +DII+ D+ + V  F+     
Sbjct: 138 NDGLDRLPDMDELRRIIMSMNPHSAPGPDGFGGKFYQVCFDIIKKDLLDAVNHFYIGNSM 197

Query: 362 LSQHSH*PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538
               +H      PK +   KL   +    + F  + I K L  RL ++LP +IS NQ  F
Sbjct: 198 PRYMTHACLILLPKIDHPCKLKDFRPISLSNFVNKIISKILSTRLASILPGVISENQPGF 257

Query: 539 IKGRSIIENVLLTQEIITDIRK 604
           +KGRSI EN+LL QEII  I+K
Sbjct: 258 VKGRSIAENILLAQEIIHGIKK 279


>ref|XP_004253220.1| PREDICTED: uncharacterized protein LOC101264807 [Solanum
           lycopersicum]
          Length = 934

 Score =  100 bits (248), Expect = 4e-19
 Identities = 67/197 (34%), Positives = 104/197 (52%), Gaps = 1/197 (0%)
 Frame = +2

Query: 2   RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181
           RRK++ I K++  +G  I+ E  + +   ++Y + F  K    N DS L  I  ++ ++ 
Sbjct: 93  RRKRMCITKLESENGEWIQGEENIVKTACDYYKQIFTGKNEVINEDS-LQCISKIIIEEQ 151

Query: 182 NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361
           N +L +M +M+E+K  I  +N  SA GP+   G F+   +DII+ D+   V  FF     
Sbjct: 152 NSKLEQMPNMDELKNVIMNMNPNSAPGPDGIGGKFFQVCFDIIKDDLLAAVQHFFNGFDM 211

Query: 362 LSQHSH*PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538
               +H      PK E  +KL   +    + F  + I K +  RL  +LP +IS NQ  F
Sbjct: 212 PKYMTHACLVLIPKVEYPNKLKDFRPISLSNFTNKIISKIMSTRLAPILPTIISKNQSGF 271

Query: 539 IKGRSIIENVLLTQEII 589
           +KGRSI EN++L QEII
Sbjct: 272 VKGRSISENIMLAQEII 288


>ref|XP_004233579.1| PREDICTED: uncharacterized protein LOC101260201 [Solanum
            lycopersicum]
          Length = 1531

 Score =  100 bits (248), Expect = 4e-19
 Identities = 67/197 (34%), Positives = 103/197 (52%), Gaps = 1/197 (0%)
 Frame = +2

Query: 2    RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181
            RR K+ I KI +  GV I+ E  +A+E  ++Y   F  K ++   +  L +I  ++  + 
Sbjct: 658  RRNKMIIYKIMNDSGVWIQGEDNVAKEACDYYQNMFTGK-SEKIKEELLQNIPELITLEQ 716

Query: 182  NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361
            N  L K+ ++EE+K  I  +N  SA GP+   G FY   +DII+ DM   V  FF   + 
Sbjct: 717  NSDLDKLPTVEELKNTIMSMNPNSAPGPDGIGGKFYQECFDIIQEDMLAAVNSFFSGNIM 776

Query: 362  LSQHSH*PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538
                +H       K    ++L   ++   + F  + I K L  RL ++LP +IS NQ  F
Sbjct: 777  PRYMTHACLVLLLKINHPNQLKDYRLMSLSNFTNKIISKILSTRLASILPNIISTNQYGF 836

Query: 539  IKGRSIIENVLLTQEII 589
            +KGR I EN+LL QE+I
Sbjct: 837  VKGRRISENILLAQEVI 853


>gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score = 98.2 bits (243), Expect = 2e-18
 Identities = 68/217 (31%), Positives = 107/217 (49%), Gaps = 15/217 (6%)
 Frame = +2

Query: 2    RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHN--NDSFLAHILVMVDQ 175
            +R +  I +IQD +G ++E+   +    V F+      ++ D +  + S    I+   D 
Sbjct: 1212 KRMRNHIFRIQDQEGNVLEEPHLIQNSGVEFFQNLLKAEQCDISRFDPSITPRIISTTD- 1270

Query: 176  DTNEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKA- 352
              NE LC   S++EVK A+F +N  S +GP+ FS LFY   WDII+ D++  VL FFK  
Sbjct: 1271 --NEFLCATPSLQEVKEAVFNINKDSVAGPDGFSSLFYQHCWDIIKQDLFEAVLDFFKGS 1328

Query: 353  ------------ILSLSQHSH*PSAAAPKEIGSKLFRLKVHQSA*FH*QYIVKGLHDRLE 496
                        +L  +Q+    S   P  + + L ++            + K L +RL 
Sbjct: 1329 PLPRGITSTTLVLLPKTQNVSQWSEFRPISLCTVLNKI------------VTKLLANRLS 1376

Query: 497  NVLPRLISPNQD*FIKGRSIIENVLLTQEIITDIRKR 607
             +LP +IS NQ  F+ GR I +N+LL QE++  I  R
Sbjct: 1377 KILPSIISENQSGFVNGRLISDNILLAQELVDKINAR 1413


>ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581051 [Solanum tuberosum]
          Length = 1946

 Score = 96.3 bits (238), Expect = 6e-18
 Identities = 63/186 (33%), Positives = 99/186 (53%), Gaps = 3/186 (1%)
 Frame = +2

Query: 56  EDEGQ--MAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDTNEQLCKMSSMEEVKVA 229
           E++G   +A+    ++ E F   +N  N ++ L  I  MV ++ N+ L  + + EE K  
Sbjct: 157 EEQGDENIAKAACVYFQETFTGHEN-RNAENILQCITRMVTEEQNQNLKALPTKEESKQV 215

Query: 230 IFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILSLSQHSH*PSAAAPK-E 406
           ++ +N  SA GP+ F G FY + WDII+ ++   VL +F   +     SH      PK E
Sbjct: 216 VYSMNPNSAPGPDGFGGKFYQACWDIIQDELLEAVLAYFSGHIMPKFMSHSCLVVLPKVE 275

Query: 407 IGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*FIKGRSIIENVLLTQEI 586
             ++    +      F  + I K L  RL  +LP LIS NQ  F++GRSI +N++L QEI
Sbjct: 276 HPNRFNEFRPISLTNFTSKIISKILCLRLAPILPHLISENQSGFVRGRSITDNIMLAQEI 335

Query: 587 ITDIRK 604
           I +I+K
Sbjct: 336 IHNIKK 341


>ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268853 [Solanum
           lycopersicum]
          Length = 1333

 Score = 95.9 bits (237), Expect = 8e-18
 Identities = 65/202 (32%), Positives = 104/202 (51%), Gaps = 1/202 (0%)
 Frame = +2

Query: 2   RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181
           +R ++ I K+ D  G  I+ E ++A+   ++Y + F    N    +  L  I  M+ Q+ 
Sbjct: 314 KRNRMSIHKLMDESGNWIKGEEEIAKHACDYYEKIFTGM-NGKIKEDILQCINPMITQEQ 372

Query: 182 NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361
           N+ L ++  M+E++  I  +N  SA GP+ F G FY   +DII+ D+   V  F+   + 
Sbjct: 373 NKDLDRIPDMDELRRTIMSMNPHSAPGPDGFGGKFYQVCFDIIKEDLLAAVKHFYVGNIM 432

Query: 362 LSQHSH*PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538
               +H      PK +   +L   +    + F  + I K L  RL  +LP ++S NQ  F
Sbjct: 433 PRYLTHACLTLIPKIDHPCRLKDFRPISLSNFTNKIISKILSTRLALILPSIVSANQSGF 492

Query: 539 IKGRSIIENVLLTQEIITDIRK 604
           +KGRSI EN+LL QEI   I+K
Sbjct: 493 VKGRSIAENILLAQEIFHGIKK 514


>ref|XP_004244918.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
           lycopersicum]
          Length = 1010

 Score = 95.5 bits (236), Expect = 1e-17
 Identities = 64/190 (33%), Positives = 99/190 (52%), Gaps = 1/190 (0%)
 Frame = +2

Query: 35  DSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDTNEQLCKMSSME 214
           + +G  I+ +  +A+E  ++Y + F+       N+  L  I  MV  D N+ L K+  ME
Sbjct: 2   NDNGEWIQGDDNIAKEACDYYKDMFSGSSL-RVNEEILQCIPNMVTADQNDVLDKLPDME 60

Query: 215 EVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILSLSQHSH*PSAA 394
           E++  +  +N  SA GP+   G FY   +DII+ D+   V  FF   +     +H     
Sbjct: 61  ELRKVVMSMNPNSAPGPDGIGGKFYQFCFDIIKDDLLAAVQDFFNGEIMPRYMTHACLVL 120

Query: 395 APK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*FIKGRSIIENVL 571
            PK E  +K    +    + F  + I K +  RL +++P+LIS NQ  F+KGRSI EN+L
Sbjct: 121 LPKIEHPNKHKDFRPISLSNFSNKIISKVMSMRLASIIPKLISDNQSGFVKGRSISENIL 180

Query: 572 LTQEIITDIR 601
           L QEII  I+
Sbjct: 181 LAQEIIHGIK 190


>ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258077 [Solanum
            lycopersicum]
          Length = 1454

 Score = 95.5 bits (236), Expect = 1e-17
 Identities = 66/202 (32%), Positives = 105/202 (51%), Gaps = 1/202 (0%)
 Frame = +2

Query: 2    RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181
            +R ++ I K+ D +G  I+ E ++A+   ++Y + F  K      ++ L  I  MV Q  
Sbjct: 436  KRNRIAIHKLMDDNGNWIQGEDKIAKLACDYYEQNFTGKAEKIKEEN-LHCINKMVTQAQ 494

Query: 182  NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361
            N+ L ++   +E++  I  +N  SA GP+ F G FY + +DII+ D+   V  F+     
Sbjct: 495  NDDLDRLPDEDELRRIIMSMNPNSAPGPDGFGGKFYQTCFDIIKKDLLAAVNYFYIGNSM 554

Query: 362  LSQHSH*PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538
                +H      PK E   KL   +    + F  + I K +  RL ++LP ++S NQ  F
Sbjct: 555  PKYMTHACLILLPKVEHPCKLKEFRPISLSNFSNKIISKIMSTRLASILPCVVSENQSGF 614

Query: 539  IKGRSIIENVLLTQEIITDIRK 604
            +KGRSI EN+LL  EII  I+K
Sbjct: 615  VKGRSISENILLAHEIIHGIKK 636


>gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score = 94.0 bits (232), Expect = 3e-17
 Identities = 68/220 (30%), Positives = 109/220 (49%), Gaps = 17/220 (7%)
 Frame = +2

Query: 2    RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHN--NDSFLAHILVMVDQ 175
            +R +  I +IQDS+G + +D   + +   +F+ +    +  D +  + S +  I+   D 
Sbjct: 1125 KRVRSHIFQIQDSEGNVFDDIHSIQKSATDFFRDLMQAENCDLSRFDPSLIPRIISSAD- 1183

Query: 176  DTNEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAI 355
              NE LC    ++E+K A+F +N  S +GP+ FS LFY   WDII+ D+ + VL FF+  
Sbjct: 1184 --NEFLCAAPPLQEIKEAVFNINKDSVAGPDGFSSLFYQHCWDIIKNDLLDAVLDFFR-- 1239

Query: 356  LSLSQHSH*PSAAAPKEIGS-KLFRLKVHQSA*FH*QY------------IVKGLHDRLE 496
                       +  P+ + S  L  L    +A    +Y            + K L +RL 
Sbjct: 1240 ----------GSPLPRGVTSTTLVLLPKKPNACHWSEYRPISLCTVLNKIVTKLLANRLS 1289

Query: 497  NVLPRLISPNQD*FIKGRSIIENVLLTQEII--TDIRKRG 610
             +LP +IS NQ  F+ GR I +N+LL QE+I   D + RG
Sbjct: 1290 KILPSIISENQSGFVNGRLISDNILLAQELIGKIDAKSRG 1329


>gb|AAD29058.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
           thaliana]
          Length = 1229

 Score = 93.6 bits (231), Expect = 4e-17
 Identities = 61/201 (30%), Positives = 109/201 (54%), Gaps = 1/201 (0%)
 Frame = +2

Query: 2   RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181
           RR +  +  ++D +GV   +E Q+++ +++ Y +Q    ++D +       I  MV Q  
Sbjct: 257 RRTQNRLTVMEDINGVAQHEEHQISQ-IISGYFQQIFTSESDGDFSVVDEAIEPMVSQGD 315

Query: 182 NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361
           N+ L ++ + EEVK A+F +N++ A GP+ F+  FYHSYW II  D+   +  FF +   
Sbjct: 316 NDFLTRIPNDEEVKDAVFSINASKAPGPDGFTAGFYHSYWHIISTDVGREIRLFFTSKNF 375

Query: 362 LSQHSH*PSAAAPKEIG-SKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538
             + +       PK++G  K+   +         + + K +  R++ +LP+LIS NQ  F
Sbjct: 376 PRRMNETHIRLIPKDLGPRKVADYRPIALCNIFYKIVAKIMTKRMQLILPKLISENQSAF 435

Query: 539 IKGRSIIENVLLTQEIITDIR 601
           + GR I +NVL+T E++  +R
Sbjct: 436 VPGRVISDNVLITHEVLHFLR 456


>gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score = 92.8 bits (229), Expect = 7e-17
 Identities = 62/202 (30%), Positives = 100/202 (49%), Gaps = 5/202 (2%)
 Frame = +2

Query: 20   IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHN--NDSFLAHILVMVDQDTNEQL 193
            I +IQDS+G + ED   +    V ++      ++ D +  + S +   + + D   NE L
Sbjct: 957  IFRIQDSEGNIYEDPQYIQNSAVQYFQNLLTAEQCDFSRFDPSLIPRTISITD---NEFL 1013

Query: 194  CKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILSLSQH 373
            C   S++E+K  +F ++  S +GP+ FS LFY   WDII+ D+   VL FF         
Sbjct: 1014 CAAPSLKEIKEVVFNIDKDSVAGPDGFSSLFYQHCWDIIKQDLLEAVLDFFNGTPMPQGV 1073

Query: 374  SH*PSAAAPKEIGS-KLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*FIKGR 550
            +       PK+  S +    +         + + K L +RL  +LP +IS NQ  F+ GR
Sbjct: 1074 TSTTLVLLPKKPNSCQWSDFRPISLCTVLNKIVTKTLANRLSKILPSIISENQSGFVNGR 1133

Query: 551  SIIENVLLTQEII--TDIRKRG 610
             I +N+LL QE++   D + RG
Sbjct: 1134 LISDNILLAQELVGKLDAKARG 1155


>ref|XP_004248595.1| PREDICTED: uncharacterized protein LOC101261371 [Solanum
           lycopersicum]
          Length = 1246

 Score = 92.8 bits (229), Expect = 7e-17
 Identities = 64/202 (31%), Positives = 102/202 (50%), Gaps = 1/202 (0%)
 Frame = +2

Query: 2   RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT 181
           +R ++ I K+ D  G  I  E  +A++  ++Y   F   KN+   +  L  I  ++ Q+ 
Sbjct: 270 KRNRMAIHKLMDDSGNWITGEENIAKQACDYYEGIFT-AKNEKIKEDILQCIKPIITQER 328

Query: 182 NEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILS 361
           N+ L ++  M+E++  I  +N  SA GP+ F G FY   +DII+ D+   V  F+     
Sbjct: 329 NDSLDRLPDMDELRGVIMSMNPHSAPGPDGFGGKFYQVCFDIIKEDLLAAVKYFYIGNSM 388

Query: 362 LSQHSH*PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*F 538
               +H      PK +   +L   +    + F  + I K +  R   +LP +I  NQ  F
Sbjct: 389 PRYLTHASLILLPKTDHPCRLKDFRPISLSNFANKIISKIISTRFGLILPGIIFENQSGF 448

Query: 539 IKGRSIIENVLLTQEIITDIRK 604
           +KGRSI EN+LL QEII  I+K
Sbjct: 449 VKGRSIAENILLAQEIINGIKK 470


>gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score = 92.4 bits (228), Expect = 9e-17
 Identities = 66/193 (34%), Positives = 96/193 (49%), Gaps = 3/193 (1%)
 Frame = +2

Query: 20   IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILV--MVDQDTNEQL 193
            I K+QD +G  IED+ Q+    + ++       K +   DS     L+  ++    NE L
Sbjct: 1252 IFKVQDPEGRWIEDQEQLKHSAIEYFSSLL---KVEPCYDSRFQSSLIPSIISNSENELL 1308

Query: 194  CKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFK-AILSLSQ 370
            C   S++EVK A+F +NS SA+GP+ FS  FY   W+II  D+ + V  FF  A +    
Sbjct: 1309 CAEPSLQEVKDAVFGINSESAAGPDGFSSYFYQQCWNIIAQDLLDAVRDFFHGANIPRGV 1368

Query: 371  HSH*PSAAAPKEIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*FIKGR 550
             S        K   SK    +         + I K L +RL  VLP +I+ NQ  F+ GR
Sbjct: 1369 TSTTLILLPKKSSASKWSDFRPISLCTVMNKIITKLLSNRLAKVLPSIITENQSGFVGGR 1428

Query: 551  SIIENVLLTQEII 589
             I +N+LL QE+I
Sbjct: 1429 LISDNILLAQELI 1441


>gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score = 92.0 bits (227), Expect = 1e-16
 Identities = 68/201 (33%), Positives = 96/201 (47%), Gaps = 4/201 (1%)
 Frame = +2

Query: 20  IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT-NEQLC 196
           I KIQDS+G L+E+ G +    V F+      K  +++   F A  +  +  D  N  LC
Sbjct: 338 IFKIQDSEGTLMEEPGLIESSAVEFFENLL--KAENYDLSRFKAEFIPQMLSDADNNLLC 395

Query: 197 KMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFK-AILSLSQH 373
               ++EVK A+F ++  S  GP+ FS  FY   W II  D+   V  FFK A+      
Sbjct: 396 AEPQLQEVKDAVFAIDKDSVVGPDGFSSFFYQQCWPIIAEDLLAAVRDFFKGAVFPRGVT 455

Query: 374 SH*PSAAAPKEIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*FIKGRS 553
           S      A K   +     +         + + K L +RL  VLP LIS NQ  F+ GR 
Sbjct: 456 STTLVLLAKKPDAATWSDFRPISLCTILNKIVTKLLANRLSKVLPSLISENQSGFVSGRL 515

Query: 554 IIENVLLTQEII--TDIRKRG 610
           I +N+LL QE+I   D + RG
Sbjct: 516 INDNILLAQELIGKIDYKARG 536


>gb|EOY02242.1| Uncharacterized protein TCM_016767 [Theobroma cacao]
          Length = 1707

 Score = 92.0 bits (227), Expect = 1e-16
 Identities = 66/220 (30%), Positives = 105/220 (47%), Gaps = 17/220 (7%)
 Frame = +2

Query: 2    RRKKL*IKKIQDSDGVLIEDEGQMAEEVVNFY--LEQFNHKKNDHNNDSFLAHILVMVDQ 175
            +R +  + +IQDS+G + +D   + +   +F+  L Q  +  N   + S +  I+   D 
Sbjct: 1082 KRVRSHVFQIQDSEGNVFDDTHSIQKSATDFFRNLMQAENCDNSRFDPSLIPRIISSAD- 1140

Query: 176  DTNEQLCKMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAI 355
              NE LC   S++EVK  +F +N  S +G + FS LFY   WDII+ D+ + VL FF+  
Sbjct: 1141 --NEFLCAAPSLQEVKETVFNINKDSVAGSDGFSSLFYQHCWDIIKHDLLDAVLDFFR-- 1196

Query: 356  LSLSQHSH*PSAAAPKEIGSKLFRLKVHQSA*FH-------------*QYIVKGLHDRLE 496
                       +  P+ + S    L   +    H              + + K L +RL 
Sbjct: 1197 ----------GSPLPRGVTSTTLVLLPKKPNACHWSDYSPISLCTVLNKIVTKLLANRLS 1246

Query: 497  NVLPRLISPNQD*FIKGRSIIENVLLTQEII--TDIRKRG 610
             +LP +IS NQ  F+ GR I +N+LL  E+I   D + RG
Sbjct: 1247 KILPLIISENQSGFVNGRLISDNILLAHELIGKIDAKSRG 1286


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score = 91.3 bits (225), Expect = 2e-16
 Identities = 66/201 (32%), Positives = 98/201 (48%), Gaps = 4/201 (1%)
 Frame = +2

Query: 20   IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDT-NEQLC 196
            I KIQ+ DG  IED  Q+ +  ++F+      +  D     F + +   +  DT N  LC
Sbjct: 1217 IFKIQEQDGNWIEDPEQLQQSAIDFFSSLLKAESCDDTR--FQSSLCPSIISDTDNGFLC 1274

Query: 197  KMSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILSLSQHS 376
               +++EVK A+F ++  SA+GP+ FS  FY   WDII  D++  V  FF         +
Sbjct: 1275 AEPTLQEVKEAVFGIDPESAAGPDGFSSHFYQQCWDIIAHDLFEAVKEFFHGADIPQGMT 1334

Query: 377  H*PSAAAPKEI-GSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*FIKGRS 553
                   PK    SK    +         + I K L +RL  +LP +I+ NQ  F+ GR 
Sbjct: 1335 STTLVLIPKTTSASKWSEFRPISLCTVMNKIITKILANRLAKILPSIITENQSGFVGGRL 1394

Query: 554  IIENVLLTQEII--TDIRKRG 610
            I +N+LL QE+I   D + RG
Sbjct: 1395 ISDNILLAQELIGKLDQKNRG 1415


>ref|XP_004231462.1| PREDICTED: uncharacterized protein LOC101258709 [Solanum
           lycopersicum]
          Length = 845

 Score = 90.9 bits (224), Expect = 2e-16
 Identities = 62/191 (32%), Positives = 97/191 (50%), Gaps = 1/191 (0%)
 Frame = +2

Query: 20  IKKIQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAHILVMVDQDTNEQLCK 199
           I K++  +G  I+ E  + +   ++Y + F  K    N DS      ++ D+  N +L +
Sbjct: 3   ITKLESENGEWIQGEENIVKTACDYYKQIFTGKNEAINEDSLQCISRIITDEQ-NIKLEQ 61

Query: 200 MSSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFFKAILSLSQHSH 379
           M +++E+K  I  +N  SA GP+   G F+   +DII+ D+   V  FF         +H
Sbjct: 62  MPNIDELKNVIMNMNPNSAPGPDGIGGKFFQVCFDIIKDDLLAAVQHFFNGFDMPKYMTH 121

Query: 380 *PSAAAPK-EIGSKLFRLKVHQSA*FH*QYIVKGLHDRLENVLPRLISPNQD*FIKGRSI 556
                 PK E  +KL   +    + F  + I K +  RL  +LP +IS NQ  F+KGRSI
Sbjct: 122 ACLVLIPKVEHPNKLKDFRPISLSNFTNKIISKIMSTRLAPILPSIISKNQSGFVKGRSI 181

Query: 557 IENVLLTQEII 589
            EN++L QEII
Sbjct: 182 SENIMLAQEII 192


>emb|CAB39638.1| RNA-directed DNA polymerase-like protein [Arabidopsis thaliana]
           gi|7267666|emb|CAB78094.1| RNA-directed DNA
           polymerase-like protein [Arabidopsis thaliana]
          Length = 1274

 Score = 90.1 bits (222), Expect = 4e-16
 Identities = 63/203 (31%), Positives = 106/203 (52%), Gaps = 9/203 (4%)
 Frame = +2

Query: 29  IQDSDGVLIEDEGQMAEEVVNFYLEQFNHKKNDHNNDSFLAH--ILVMVDQDTNEQLCKM 202
           I+D  G    +E Q+A  + +++   F      +N+D  +    +  ++    NE+L K+
Sbjct: 312 IEDGSGQEFHEEEQIASTISSYFQNIFT---TSNNSDLQVVQEALSPIISSHCNEELIKI 368

Query: 203 SSMEEVKVAIFKLNSTSASGPNRFSGLFYHSYWDIIRADMYNMVLPFF-----KAILSLS 367
           SS+ E+K A+F +++  A GP+ FS  F+H+YWDII AD+   +  FF        L+ +
Sbjct: 369 SSLLEIKEALFSISADKAPGPDGFSASFFHAYWDIIEADVSRDIRSFFVDSCLSPRLNET 428

Query: 368 QHSH*PSAAAPKEIGSKLFRLKVHQSA*FH*QY--IVKGLHDRLENVLPRLISPNQD*FI 541
             +  P  +AP+++            A  + QY  + K L  RL+  L  LIS +Q  F+
Sbjct: 429 HVTLIPKISAPRKVSD------YRPIALCNVQYKIVAKILTRRLQPWLSELISLHQSAFV 482

Query: 542 KGRSIIENVLLTQEIITDIRKRG 610
            GR+I +NVL+T EI+  +R  G
Sbjct: 483 PGRAIADNVLITHEILHFLRVSG 505


Top