BLASTX nr result

ID: Cocculus23_contig00026807 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00026807
         (298 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007028192.1| Uncharacterized protein TCM_023754 [Theobrom...    92   1e-16
ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cac...    91   2e-16
ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobrom...    90   3e-16
gb|ADP20179.1| gag-pol polyprotein [Silene latifolia]                  87   3e-15
ref|XP_007033335.1| Uncharacterized protein TCM_019516, partial ...    85   9e-15
ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobrom...    85   1e-14
gb|ABD32741.1| hypothetical protein MtrDRAFT_AC150777g16v1 [Medi...    82   6e-14
ref|XP_007050047.1| Uncharacterized protein TCM_003699 [Theobrom...    79   7e-13
ref|XP_004150126.1| PREDICTED: uncharacterized protein LOC101221...    75   7e-12
ref|XP_006290417.1| hypothetical protein CARUB_v10019144mg, part...    75   1e-11
gb|ADP20178.1| gag-pol polyprotein [Silene latifolia]                  75   1e-11
ref|XP_006402169.1| hypothetical protein EUTSA_v10015409mg, part...    74   2e-11
ref|XP_006392773.1| hypothetical protein EUTSA_v10012229mg [Eutr...    69   2e-11
ref|XP_006366953.1| PREDICTED: uncharacterized protein LOC102594...    74   3e-11
gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum ...    74   3e-11
gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum ...    74   3e-11
ref|XP_007210190.1| hypothetical protein PRUPE_ppa017790mg [Prun...    72   6e-11
ref|XP_007207232.1| hypothetical protein PRUPE_ppa026856mg [Prun...    72   1e-10
gb|ABD33247.2| RNA-directed DNA polymerase (Reverse transcriptas...    72   1e-10
ref|XP_004161321.1| PREDICTED: uncharacterized protein LOC101223...    70   4e-10

>ref|XP_007028192.1| Uncharacterized protein TCM_023754 [Theobroma cacao]
           gi|508716797|gb|EOY08694.1| Uncharacterized protein
           TCM_023754 [Theobroma cacao]
          Length = 440

 Score = 91.7 bits (226), Expect = 1e-16
 Identities = 39/63 (61%), Positives = 50/63 (79%)
 Frame = +2

Query: 32  EIEVTTEKLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFDG 211
           E+     KL WL KG+EV V++RC V FSIGS+Y+D+V CDV+PMDACHLLLG+PWQ+D 
Sbjct: 200 EVHPHPYKLQWLRKGNEVKVTKRCCVQFSIGSKYEDEVWCDVIPMDACHLLLGRPWQYDR 259

Query: 212 GSH 220
            +H
Sbjct: 260 RAH 262


>ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cacao]
           gi|508704828|gb|EOX96724.1| Gag-pol polyprotein,
           putative [Theobroma cacao]
          Length = 794

 Score = 90.5 bits (223), Expect = 2e-16
 Identities = 38/63 (60%), Positives = 50/63 (79%)
 Frame = +2

Query: 32  EIEVTTEKLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFDG 211
           E+     KL WL KG+EV V++RC V FSIG++Y+D+V CDV+PMDACHLLLG+PWQ+D 
Sbjct: 416 EVHPHPYKLQWLRKGNEVKVTKRCCVQFSIGNKYEDEVWCDVIPMDACHLLLGRPWQYDR 475

Query: 212 GSH 220
            +H
Sbjct: 476 RAH 478


>ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobroma cacao]
           gi|508718388|gb|EOY10285.1| Uncharacterized protein
           TCM_025656 [Theobroma cacao]
          Length = 505

 Score = 90.1 bits (222), Expect = 3e-16
 Identities = 37/63 (58%), Positives = 50/63 (79%)
 Frame = +2

Query: 32  EIEVTTEKLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFDG 211
           E+     KL WL KG+EV V++RC V FSIG++Y+D+V CD++PMDACHLLLG+PWQ+D 
Sbjct: 265 EVHPHPYKLQWLRKGNEVKVTKRCCVQFSIGNKYEDEVWCDIIPMDACHLLLGRPWQYDR 324

Query: 212 GSH 220
            +H
Sbjct: 325 RAH 327


>gb|ADP20179.1| gag-pol polyprotein [Silene latifolia]
          Length = 1475

 Score = 86.7 bits (213), Expect = 3e-15
 Identities = 37/55 (67%), Positives = 43/55 (78%)
 Frame = +2

Query: 53  KLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFDGGS 217
           KL WLNKG+EV V ++C V FSIG  Y D+  CDVLPMDACHLLLG+PW+FD  S
Sbjct: 433 KLRWLNKGAEVRVDKQCLVTFSIGKNYSDEALCDVLPMDACHLLLGRPWEFDRDS 487


>ref|XP_007033335.1| Uncharacterized protein TCM_019516, partial [Theobroma cacao]
           gi|508712364|gb|EOY04261.1| Uncharacterized protein
           TCM_019516, partial [Theobroma cacao]
          Length = 215

 Score = 85.1 bits (209), Expect = 9e-15
 Identities = 35/56 (62%), Positives = 46/56 (82%)
 Frame = +2

Query: 53  KLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFDGGSH 220
           KL WL KG+EV V++ C V FSIG++Y+D+V CDV+PMDAC LLLG+PWQ+D  +H
Sbjct: 99  KLQWLRKGNEVKVTKHCCVQFSIGNKYEDEVWCDVIPMDACQLLLGRPWQYDRRAH 154


>ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobroma cacao]
           gi|508726763|gb|EOY18660.1| Uncharacterized protein
           TCM_043155 [Theobroma cacao]
          Length = 625

 Score = 84.7 bits (208), Expect = 1e-14
 Identities = 35/63 (55%), Positives = 48/63 (76%)
 Frame = +2

Query: 32  EIEVTTEKLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFDG 211
           E+     KL WL KG+EV V++RC + F I ++Y+D+V CDV+PMDACHLLLG+PWQ+D 
Sbjct: 385 EVHPHPYKLQWLRKGNEVKVTKRCCIQFFIRNKYEDEVWCDVIPMDACHLLLGRPWQYDR 444

Query: 212 GSH 220
            +H
Sbjct: 445 RAH 447


>gb|ABD32741.1| hypothetical protein MtrDRAFT_AC150777g16v1 [Medicago truncatula]
          Length = 187

 Score = 82.4 bits (202), Expect = 6e-14
 Identities = 34/52 (65%), Positives = 43/52 (82%)
 Frame = +2

Query: 53  KLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFD 208
           KL WL KG+EV VS+ C V+FSIG +Y+D V CDV+ MDACH+LLG+PWQ+D
Sbjct: 17  KLQWLKKGNEVRVSKCCLVSFSIGQKYKDNVWCDVISMDACHMLLGRPWQYD 68


>ref|XP_007050047.1| Uncharacterized protein TCM_003699 [Theobroma cacao]
           gi|508702308|gb|EOX94204.1| Uncharacterized protein
           TCM_003699 [Theobroma cacao]
          Length = 258

 Score = 79.0 bits (193), Expect = 7e-13
 Identities = 33/56 (58%), Positives = 43/56 (76%)
 Frame = +2

Query: 53  KLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFDGGSH 220
           KL WL KG+EV V + C V F IG++YQD++ CDV+PMDACHL LG+P Q+D  +H
Sbjct: 127 KLQWLRKGNEVKVMKHCCVQFYIGNKYQDEIWCDVIPMDACHLFLGRPCQYDCQAH 182


>ref|XP_004150126.1| PREDICTED: uncharacterized protein LOC101221019 [Cucumis sativus]
          Length = 390

 Score = 75.5 bits (184), Expect = 7e-12
 Identities = 35/76 (46%), Positives = 47/76 (61%), Gaps = 1/76 (1%)
 Frame = +2

Query: 32  EIEVTTEKLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFDG 211
           E   T  K+ W+ KG E TVS  C V  SIG+ Y+D++ CDV+ MD CHLLLG+PWQ+D 
Sbjct: 81  EAHPTPYKIGWVKKGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDT 140

Query: 212 GS-HMIVAKIHKVSWL 256
            S H      ++  W+
Sbjct: 141 QSLHKGRENTYEFQWM 156


>ref|XP_006290417.1| hypothetical protein CARUB_v10019144mg, partial [Capsella rubella]
           gi|482559124|gb|EOA23315.1| hypothetical protein
           CARUB_v10019144mg, partial [Capsella rubella]
          Length = 110

 Score = 75.1 bits (183), Expect = 1e-11
 Identities = 31/51 (60%), Positives = 40/51 (78%)
 Frame = +2

Query: 56  LYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFD 208
           L WLN  ++  +S+RC V F IG+ Y+D V CD+LPMDACHLLLG+PWQ+D
Sbjct: 44  LAWLNSSTDSRLSKRCRVPFLIGANYKDLVICDILPMDACHLLLGRPWQYD 94


>gb|ADP20178.1| gag-pol polyprotein [Silene latifolia]
          Length = 1518

 Score = 74.7 bits (182), Expect = 1e-11
 Identities = 32/53 (60%), Positives = 43/53 (81%), Gaps = 1/53 (1%)
 Frame = +2

Query: 53  KLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CD-VLPMDACHLLLGKPWQFD 208
           KL WL+K S V V ++C ++FSIG  Y+D+V CD V+PMDACHLLLG+PW++D
Sbjct: 444 KLRWLSKDSGVRVDKQCIISFSIGKMYKDEVLCDVVVPMDACHLLLGRPWEYD 496


>ref|XP_006402169.1| hypothetical protein EUTSA_v10015409mg, partial [Eutrema
           salsugineum] gi|557103259|gb|ESQ43622.1| hypothetical
           protein EUTSA_v10015409mg, partial [Eutrema salsugineum]
          Length = 367

 Score = 73.9 bits (180), Expect = 2e-11
 Identities = 31/52 (59%), Positives = 38/52 (73%)
 Frame = +2

Query: 53  KLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFD 208
           KL WL +G E+ +  RC V+FSIGS Y+DK+ CDV  MD  HLLLG PWQ+D
Sbjct: 266 KLTWLKQGVEIRIEHRCLVSFSIGSHYKDKIYCDVALMDVSHLLLGTPWQYD 317


>ref|XP_006392773.1| hypothetical protein EUTSA_v10012229mg [Eutrema salsugineum]
           gi|557089351|gb|ESQ30059.1| hypothetical protein
           EUTSA_v10012229mg [Eutrema salsugineum]
          Length = 382

 Score = 68.9 bits (167), Expect(2) = 2e-11
 Identities = 26/51 (50%), Positives = 39/51 (76%)
 Frame = +2

Query: 56  LYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFD 208
           L W+ +G++V ++ R  V+FSIG+ Y+D + CD+ PMD  HL+LG+PWQFD
Sbjct: 258 LAWITEGTDVKITHRALVSFSIGAFYKDTIYCDIAPMDVSHLILGRPWQFD 308



 Score = 25.0 bits (53), Expect(2) = 2e-11
 Identities = 13/36 (36%), Positives = 20/36 (55%)
 Frame = +1

Query: 184 VGKTMAI*RGVTYDSCKNT*SFMVGSVHIVLMPNKE 291
           +G+     R   ++  KNT SF+  +  IVL+PN E
Sbjct: 301 LGRPWQFDRDTCHNGKKNTYSFVFENRKIVLLPNPE 336


>ref|XP_006366953.1| PREDICTED: uncharacterized protein LOC102594328 [Solanum tuberosum]
          Length = 1191

 Score = 73.6 bits (179), Expect = 3e-11
 Identities = 29/55 (52%), Positives = 44/55 (80%)
 Frame = +2

Query: 44  TTEKLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFD 208
           T  +L WLN   EV V+++C ++F++G RY+D++ CDV+PM ACH+LLG+PWQ+D
Sbjct: 490 TPYRLQWLNDCGEVKVNKQCMISFNVG-RYEDEILCDVVPMQACHVLLGRPWQYD 543


>gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1588

 Score = 73.6 bits (179), Expect = 3e-11
 Identities = 29/55 (52%), Positives = 44/55 (80%)
 Frame = +2

Query: 44  TTEKLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFD 208
           T  +L WLN   EV V+++C ++F++G RY+D++ CDV+PM ACH+LLG+PWQ+D
Sbjct: 464 TPYRLQWLNDCGEVQVNKQCMISFNVG-RYEDEILCDVVPMQACHVLLGRPWQYD 517


>gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1588

 Score = 73.6 bits (179), Expect = 3e-11
 Identities = 29/55 (52%), Positives = 44/55 (80%)
 Frame = +2

Query: 44  TTEKLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFD 208
           T  +L WLN   EV V+++C ++F++G RY+D++ CDV+PM ACH+LLG+PWQ+D
Sbjct: 464 TPYRLQWLNDCGEVQVNKQCMISFNVG-RYEDEILCDVVPMQACHVLLGRPWQYD 517


>ref|XP_007210190.1| hypothetical protein PRUPE_ppa017790mg [Prunus persica]
           gi|462405925|gb|EMJ11389.1| hypothetical protein
           PRUPE_ppa017790mg [Prunus persica]
          Length = 1485

 Score = 72.4 bits (176), Expect = 6e-11
 Identities = 32/59 (54%), Positives = 40/59 (67%)
 Frame = +2

Query: 32  EIEVTTEKLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFD 208
           E  V+   L W+ KG  V V+  C V  SIG  Y+D+V CDV+ MDACH+LLG+PWQFD
Sbjct: 462 EPHVSPYSLGWVKKGPSVRVAETCRVPLSIGKHYRDEVLCDVIDMDACHILLGRPWQFD 520


>ref|XP_007207232.1| hypothetical protein PRUPE_ppa026856mg [Prunus persica]
           gi|462402874|gb|EMJ08431.1| hypothetical protein
           PRUPE_ppa026856mg [Prunus persica]
          Length = 1493

 Score = 71.6 bits (174), Expect = 1e-10
 Identities = 32/59 (54%), Positives = 39/59 (66%)
 Frame = +2

Query: 32  EIEVTTEKLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFD 208
           E  V+   L W+ KG  V V+  C V  SIG  Y+D V CDV+ MDACH+LLG+PWQFD
Sbjct: 473 EPHVSPYSLGWVKKGPSVRVAETCRVPLSIGKHYRDDVLCDVIDMDACHILLGRPWQFD 531


>gb|ABD33247.2| RNA-directed DNA polymerase (Reverse transcriptase) [Medicago
           truncatula]
          Length = 386

 Score = 71.6 bits (174), Expect = 1e-10
 Identities = 36/76 (47%), Positives = 49/76 (64%), Gaps = 8/76 (10%)
 Frame = +2

Query: 53  KLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFD-------- 208
           KL WL +G E+ V ++  VNFSIG +Y D++ CDV+PM+A H+LLG+PWQFD        
Sbjct: 1   KLQWLKEGVELLVDKQVLVNFSIG-KYNDEILCDVVPMEAGHILLGRPWQFDRNVFHDGK 59

Query: 209 GGSHMIVAKIHKVSWL 256
             +   V K HK+S L
Sbjct: 60  ANTFTFVHKKHKISLL 75


>ref|XP_004161321.1| PREDICTED: uncharacterized protein LOC101223713 [Cucumis sativus]
          Length = 645

 Score = 69.7 bits (169), Expect = 4e-10
 Identities = 30/59 (50%), Positives = 40/59 (67%)
 Frame = +2

Query: 32  EIEVTTEKLYWLNKGSEVTVSRRCFVNFSIGSRYQDKV*CDVLPMDACHLLLGKPWQFD 208
           E   T+ K+ W+ K  E TVS  C V  SI + Y+D++ CDV+ MD CHLLLG+PWQ+D
Sbjct: 329 EAHPTSYKIGWVRKEGEATVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYD 387


Top