BLASTX nr result

ID: Atropa21_contig00034058 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00034058
         (926 letters)

Database: nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum]    91   6e-16
gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum]      89   2e-15
gb|AAT39954.1| Putative integrase, identical [Solanum demissum]        89   2e-15
gb|EOX99963.1| Uncharacterized protein TCM_009073 [Theobroma cacao]    86   2e-14
ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581...    85   3e-14
gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobrom...    85   4e-14
gb|ADU56211.1| gag-pol polyprotein [Solanum lycopersicum]              85   4e-14
ref|XP_006356454.1| PREDICTED: uncharacterized protein LOC102599...    84   8e-14
gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobrom...    84   8e-14
gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobrom...    84   8e-14
gb|EOY20280.1| Uncharacterized protein TCM_045699 [Theobroma cacao]    84   8e-14
gb|EOY20275.1| DNA/RNA polymerases superfamily protein [Theobrom...    84   8e-14
gb|EOY14099.1| DNA/RNA polymerases superfamily protein [Theobrom...    84   8e-14
gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, puta...    84   8e-14
gb|EOY08659.1| DNA/RNA polymerases superfamily protein [Theobrom...    84   8e-14
gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao]     83   1e-13
gb|ABI34389.1| Polyprotein, putative [Solanum tuberosum]               83   1e-13
gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobrom...    83   2e-13
gb|EOY26288.1| Retrotransposon protein, Ty3-gypsy subclass, puta...    82   2e-13
gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobrom...    82   2e-13

>gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum]
          Length = 1475

 Score = 90.9 bits (224), Expect = 6e-16
 Identities = 48/93 (51%), Positives = 57/93 (61%)
 Frame = +1

Query: 121  FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300
            FTS+FWR  Q ELGTRV+LSTAFHPQ D Q +  I  L        +        F    
Sbjct: 1332 FTSNFWRTFQDELGTRVDLSTAFHPQTDGQSERTIQVLEDMLRACVMDFGGQWDQFLPLA 1391

Query: 301  SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLE 399
             F YNNSY+S+I+MAPF+ +YGRR RS VGW E
Sbjct: 1392 EFAYNNSYHSSIQMAPFEALYGRRCRSPVGWFE 1424


>gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum]
          Length = 1771

 Score = 89.0 bits (219), Expect = 2e-15
 Identities = 47/93 (50%), Positives = 56/93 (60%)
 Frame = +1

Query: 121  FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300
            FTS FWR  Q+ELGTRV LST+FHPQ D Q +  I  L        +        F    
Sbjct: 1488 FTSSFWRAFQEELGTRVHLSTSFHPQTDGQSERTIQVLEDMLRACVMDFGGQWEQFLPLA 1547

Query: 301  SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLE 399
             F YNNSY+S+I+MAPF+ +YGRR RS VGW E
Sbjct: 1548 EFAYNNSYHSSIQMAPFEALYGRRCRSPVGWFE 1580


>gb|AAT39954.1| Putative integrase, identical [Solanum demissum]
          Length = 1609

 Score = 89.0 bits (219), Expect = 2e-15
 Identities = 46/93 (49%), Positives = 56/93 (60%)
 Frame = +1

Query: 121  FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300
            FTS FWR  Q +LGTRV+LST FHPQ D Q +  I  L    +   +        F    
Sbjct: 1185 FTSSFWRTFQDDLGTRVDLSTTFHPQTDGQSERTIQVLEDMLQACVMDFGGQWDQFLPLA 1244

Query: 301  SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLE 399
             F YNN+YYS+I+MAPF+ +YGRR RS VGW E
Sbjct: 1245 EFAYNNNYYSSIQMAPFEALYGRRCRSPVGWFE 1277


>gb|EOX99963.1| Uncharacterized protein TCM_009073 [Theobroma cacao]
          Length = 421

 Score = 85.9 bits (211), Expect = 2e-14
 Identities = 52/139 (37%), Positives = 75/139 (53%), Gaps = 5/139 (3%)
 Frame = +1

Query: 1   LSREGNFISVQNLK*V-----FYGDYLISVISKKKSCLMDHNCM*FTSHFWRYIQKELGT 165
           L++  +F+SV+           Y D ++ +     S + D     FTS FW  +Q+ LGT
Sbjct: 84  LTKSAHFLSVKTTYGAAQYARVYVDEIVRLHGIPISIVSDRGAQ-FTSRFWGKLQEALGT 142

Query: 166 RVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWRSFFYNNSYYSNIEMA 345
           +++ STAFHPQ D Q +  I  L        + L V    +     F YNNS+ ++I+MA
Sbjct: 143 KLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSIQMA 202

Query: 346 PFKVMYGRRPRSLVGWLEV 402
           PFK +YGRR RS +GWLEV
Sbjct: 203 PFKALYGRRCRSPIGWLEV 221


>ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581051 [Solanum tuberosum]
          Length = 1946

 Score = 85.1 bits (209), Expect = 3e-14
 Identities = 48/100 (48%), Positives = 59/100 (59%)
 Frame = +1

Query: 121  FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300
            FTSHFW+  Q+ LGTRV+L+TAFHPQ D Q +  I  L        L L     +     
Sbjct: 1663 FTSHFWKSFQRGLGTRVKLTTAFHPQTDGQAERTIQTLEDMLRACVLELKGSWEDHLPLI 1722

Query: 301  SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEVVFIVFL 420
             F YNNSY+S+I MAPF+ +YGRR RS VG  EV  +  L
Sbjct: 1723 EFSYNNSYHSSIGMAPFEALYGRRCRSSVGLFEVGEVALL 1762


>gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1447

 Score = 84.7 bits (208), Expect = 4e-14
 Identities = 43/94 (45%), Positives = 58/94 (61%)
 Frame = +1

Query: 121  FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300
            FTS FW  +Q+ LGT+++ STAFHPQ D Q +  I  L        + L V    +    
Sbjct: 1168 FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEAMLRACVIDLGVRWEQYLPLV 1227

Query: 301  SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402
             F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV
Sbjct: 1228 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 1261


>gb|ADU56211.1| gag-pol polyprotein [Solanum lycopersicum]
          Length = 367

 Score = 84.7 bits (208), Expect = 4e-14
 Identities = 46/98 (46%), Positives = 63/98 (64%), Gaps = 5/98 (5%)
 Frame = +1

Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLR--ICCEPA*LTLLVIGINFYS 294
           FTS FW+ +  ELGTR++LSTAFHPQ D Q +  I  L   IC       ++  G ++ S
Sbjct: 68  FTSKFWKILHAELGTRLDLSTAFHPQTDGQSERTIQVLEDMICA-----CVIEFGGHWDS 122

Query: 295 W---RSFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLE 399
           +     F YNNSY+S+I+MAPF+ +YGRR RS +GW +
Sbjct: 123 FLPLAEFSYNNSYHSSIDMAPFEALYGRRCRSPIGWFD 160


>ref|XP_006356454.1| PREDICTED: uncharacterized protein LOC102599406 [Solanum tuberosum]
          Length = 859

 Score = 84.0 bits (206), Expect = 8e-14
 Identities = 45/93 (48%), Positives = 53/93 (56%)
 Frame = +1

Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300
           FTS FWR  Q ELGTRV+L T FHPQ D Q +  I  L        +             
Sbjct: 544 FTSSFWRTFQDELGTRVDLCTTFHPQTDGQSERTIKVLEDMLRACVMDFGGQWDQHLPLA 603

Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLE 399
            F YNNSY+S+I+MAPF+ +YGRR RS VGW E
Sbjct: 604 EFAYNNSYHSSIQMAPFEALYGRRCRSPVGWFE 636


>gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 679

 Score = 84.0 bits (206), Expect = 8e-14
 Identities = 43/94 (45%), Positives = 58/94 (61%)
 Frame = +1

Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300
           FTS FW  +Q+ LGT+++ STAFHPQ D Q +  I  L        + L V    +    
Sbjct: 400 FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLV 459

Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402
            F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV
Sbjct: 460 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 493


>gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 448

 Score = 84.0 bits (206), Expect = 8e-14
 Identities = 43/94 (45%), Positives = 58/94 (61%)
 Frame = +1

Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300
           FTS FW  +Q+ LGT+++ STAFHPQ D Q +  I  L        + L V    +    
Sbjct: 169 FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLV 228

Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402
            F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV
Sbjct: 229 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 262


>gb|EOY20280.1| Uncharacterized protein TCM_045699 [Theobroma cacao]
          Length = 415

 Score = 84.0 bits (206), Expect = 8e-14
 Identities = 43/94 (45%), Positives = 58/94 (61%)
 Frame = +1

Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300
           FTS FW  +Q+ LGT+++ STAFHPQ D Q +  I  L        + L V    +    
Sbjct: 136 FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVKWEQYLPLV 195

Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402
            F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV
Sbjct: 196 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 229


>gb|EOY20275.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 562

 Score = 84.0 bits (206), Expect = 8e-14
 Identities = 43/94 (45%), Positives = 58/94 (61%)
 Frame = +1

Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300
           FTS FW  +Q+ LGT+++ STAFHPQ D Q +  I  L        + L V    +    
Sbjct: 349 FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLV 408

Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402
            F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV
Sbjct: 409 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 442


>gb|EOY14099.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1502

 Score = 84.0 bits (206), Expect = 8e-14
 Identities = 43/94 (45%), Positives = 58/94 (61%)
 Frame = +1

Query: 121  FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300
            FTS FW  +Q+ LGT+++ STAFHPQ D Q +  I  L        + L V    +    
Sbjct: 1270 FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLV 1329

Query: 301  SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402
             F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV
Sbjct: 1330 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 1363


>gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma
           cacao]
          Length = 521

 Score = 84.0 bits (206), Expect = 8e-14
 Identities = 43/94 (45%), Positives = 58/94 (61%)
 Frame = +1

Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300
           FTS FW  +Q+ LGT+++ STAFHPQ D Q +  I  L        + L V    +    
Sbjct: 242 FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLV 301

Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402
            F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV
Sbjct: 302 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 335


>gb|EOY08659.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 937

 Score = 84.0 bits (206), Expect = 8e-14
 Identities = 43/94 (45%), Positives = 58/94 (61%)
 Frame = +1

Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300
           FTS FW  +Q+ LGT+++ STAFHPQ D Q +  I  L        + L V    +    
Sbjct: 630 FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSEWTIQTLEDMLRACVIDLGVRWEQYLPLV 689

Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402
            F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV
Sbjct: 690 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 723


>gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao]
          Length = 1480

 Score = 83.2 bits (204), Expect = 1e-13
 Identities = 43/94 (45%), Positives = 57/94 (60%)
 Frame = +1

Query: 121  FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300
            FTS FW  +Q+ LGT+ + STAFHPQ D Q +  I  L        + L V    +    
Sbjct: 1239 FTSRFWGKLQEALGTKFDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLV 1298

Query: 301  SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402
             F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV
Sbjct: 1299 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 1332


>gb|ABI34389.1| Polyprotein, putative [Solanum tuberosum]
          Length = 545

 Score = 83.2 bits (204), Expect = 1e-13
 Identities = 46/93 (49%), Positives = 54/93 (58%)
 Frame = +1

Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300
           FTS F R  Q+ELGTRV LSTAFHPQ D Q +  I  L        +        F    
Sbjct: 94  FTSSFLRAFQEELGTRVHLSTAFHPQTDGQSERTIQVLEDMLRACVMDFGGQWDQFLPLA 153

Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLE 399
            F YNNSY+S+I+MAPF+ +YGRR  S VGW E
Sbjct: 154 EFAYNNSYHSSIQMAPFEALYGRRCHSPVGWFE 186


>gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 666

 Score = 82.8 bits (203), Expect = 2e-13
 Identities = 50/139 (35%), Positives = 74/139 (53%), Gaps = 5/139 (3%)
 Frame = +1

Query: 1   LSREGNFISVQNLK*V-----FYGDYLISVISKKKSCLMDHNCM*FTSHFWRYIQKELGT 165
           L++  +F+SV+           Y D ++ +     S + D     FTS FW  +Q+ LGT
Sbjct: 450 LTKSAHFLSVKTTYGAAQYARVYVDEIVRLHGIPISIVSDRGAQ-FTSRFWGKLQEALGT 508

Query: 166 RVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWRSFFYNNSYYSNIEMA 345
           +++ ST FHPQ D Q +  I  L        + L V    +     F YNNS+ ++I+MA
Sbjct: 509 KLDFSTTFHPQTDGQSERTIQTLEDMLRACVIDLGVKWEQYLPLVEFAYNNSFQTSIQMA 568

Query: 346 PFKVMYGRRPRSLVGWLEV 402
           PF+ +YGRR RS +GWLEV
Sbjct: 569 PFEALYGRRCRSPIGWLEV 587


>gb|EOY26288.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma
           cacao]
          Length = 308

 Score = 82.4 bits (202), Expect = 2e-13
 Identities = 42/94 (44%), Positives = 57/94 (60%)
 Frame = +1

Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300
           FTS FW  +Q+ LGT+++ ST FHPQ D Q +  I  L        + L V    +    
Sbjct: 128 FTSRFWGKLQEALGTKLDFSTTFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLV 187

Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402
            F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV
Sbjct: 188 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 221


>gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 878

 Score = 82.4 bits (202), Expect = 2e-13
 Identities = 42/93 (45%), Positives = 57/93 (61%)
 Frame = +1

Query: 121  FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300
            FTS FW  +Q+ LGT+++ STAFHPQ D Q +  I  L        + L V    +    
Sbjct: 777  FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLV 836

Query: 301  SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLE 399
             F YNNS+ ++I+MAPF+ +YGRR RS +GWLE
Sbjct: 837  EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 869


Top