BLASTX nr result

ID: Sinomenium22_contig00047104 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00047104
         (398 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABD32741.1| hypothetical protein MtrDRAFT_AC150777g16v1 [Medi...    83   4e-14
ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobrom...    79   5e-13
ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cac...    78   1e-12
ref|XP_006402169.1| hypothetical protein EUTSA_v10015409mg, part...    76   5e-12
ref|XP_007033335.1| Uncharacterized protein TCM_019516, partial ...    74   2e-11
ref|XP_004150126.1| PREDICTED: uncharacterized protein LOC101221...    74   3e-11
gb|AAV88076.1| putative retrotransposon polyprotein [Ipomoea bat...    73   5e-11
ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobrom...    72   8e-11
gb|ADP20179.1| gag-pol polyprotein [Silene latifolia]                  72   8e-11
gb|ABD33247.2| RNA-directed DNA polymerase (Reverse transcriptas...    72   8e-11
ref|XP_007028192.1| Uncharacterized protein TCM_023754 [Theobrom...    72   1e-10
ref|XP_006290417.1| hypothetical protein CARUB_v10019144mg, part...    71   1e-10
ref|XP_004161321.1| PREDICTED: uncharacterized protein LOC101223...    70   4e-10
gb|ADP20178.1| gag-pol polyprotein [Silene latifolia]                  69   5e-10
ref|XP_006392773.1| hypothetical protein EUTSA_v10012229mg [Eutr...    69   7e-10
ref|XP_006494982.1| PREDICTED: uncharacterized protein LOC102624...    68   1e-09
gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum ...    67   3e-09
gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum ...    67   3e-09
ref|XP_006341306.1| PREDICTED: uncharacterized protein LOC102594...    67   3e-09
emb|CAN65853.1| hypothetical protein VITISV_004966 [Vitis vinifera]    67   3e-09

>gb|ABD32741.1| hypothetical protein MtrDRAFT_AC150777g16v1 [Medicago truncatula]
          Length = 187

 Score = 82.8 bits (203), Expect = 4e-14
 Identities = 32/48 (66%), Positives = 40/48 (83%)
 Frame = +3

Query: 6   INFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSF 149
           ++FSIG KY+D V CDV+ MD CH+LLGRPWQY RH ++DG ANTY+F
Sbjct: 35  VSFSIGQKYKDNVWCDVISMDACHMLLGRPWQYDRHALYDGHANTYTF 82


>ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobroma cacao]
           gi|508718388|gb|EOY10285.1| Uncharacterized protein
           TCM_025656 [Theobroma cacao]
          Length = 505

 Score = 79.3 bits (194), Expect = 5e-13
 Identities = 48/120 (40%), Positives = 66/120 (55%), Gaps = 4/120 (3%)
 Frame = +3

Query: 6   INFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSFNLGLQKYCCSLA 185
           + FSIG+KY D V CD++ MD CHLLLGRPWQY R   HDG  NTYSF     K    L 
Sbjct: 290 VQFSIGNKYEDEVWCDIIPMDACHLLLGRPWQYDRRAHHDGYKNTYSFIKDGAK--IMLT 347

Query: 186 GVKTLQNP----QARILTSLPKLCWSRRWRNKVWCMLGWGKQVKSEASATSEPVRALINE 353
            +K    P    + + L ++P L  +    N + C+L   K+ K  +S +++    LIN+
Sbjct: 348 PLKPENRPKRQEEDKALITVPSLSKAYCESNHL-CLLLVSKENKVSSSLSNDGQTKLINQ 406


>ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cacao]
           gi|508704828|gb|EOX96724.1| Gag-pol polyprotein,
           putative [Theobroma cacao]
          Length = 794

 Score = 77.8 bits (190), Expect = 1e-12
 Identities = 45/122 (36%), Positives = 60/122 (49%)
 Frame = +3

Query: 6   INFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSFNLGLQKYCCSLA 185
           + FSIG+KY D V CDV+ MD CHLLLGRPWQY R   HDG  NTYSF     K   +  
Sbjct: 441 VQFSIGNKYEDEVWCDVIPMDACHLLLGRPWQYDRRAHHDGYKNTYSFIKDGAKIMLTPL 500

Query: 186 GVKTLQNPQARILTSLPKLCWSRRWRNKVWCMLGWGKQVKSEASATSEPVRALINELFDV 365
             +     Q +    +     ++ +R      L    +    +S  S+ V+ +I E  DV
Sbjct: 501 KPEDCPKKQEKDKALITMSGLNKAFRKSSLLYLLLVCEENEVSSPLSKDVKPIIEEFCDV 560

Query: 366 FP 371
            P
Sbjct: 561 VP 562


>ref|XP_006402169.1| hypothetical protein EUTSA_v10015409mg, partial [Eutrema
           salsugineum] gi|557103259|gb|ESQ43622.1| hypothetical
           protein EUTSA_v10015409mg, partial [Eutrema salsugineum]
          Length = 367

 Score = 75.9 bits (185), Expect = 5e-12
 Identities = 32/48 (66%), Positives = 38/48 (79%)
 Frame = +3

Query: 6   INFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSF 149
           ++FSIGS Y+D + CDV  MDV HLLLG PWQY R V+HDGR N+YSF
Sbjct: 284 VSFSIGSHYKDKIYCDVALMDVSHLLLGTPWQYDRSVMHDGRRNSYSF 331


>ref|XP_007033335.1| Uncharacterized protein TCM_019516, partial [Theobroma cacao]
           gi|508712364|gb|EOY04261.1| Uncharacterized protein
           TCM_019516, partial [Theobroma cacao]
          Length = 215

 Score = 73.9 bits (180), Expect = 2e-11
 Identities = 32/48 (66%), Positives = 35/48 (72%)
 Frame = +3

Query: 6   INFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSF 149
           + FSIG+KY D V CDV+ MD C LLLGRPWQY R   HDG  NTYSF
Sbjct: 117 VQFSIGNKYEDEVWCDVIPMDACQLLLGRPWQYDRRAHHDGYKNTYSF 164


>ref|XP_004150126.1| PREDICTED: uncharacterized protein LOC101221019 [Cucumis sativus]
          Length = 390

 Score = 73.6 bits (179), Expect = 3e-11
 Identities = 30/49 (61%), Positives = 37/49 (75%)
 Frame = +3

Query: 3   TINFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSF 149
           T+  SIG+ Y+D ++CDV+ MDVCHLLLGRPWQY    +H GR NTY F
Sbjct: 105 TVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEF 153


>gb|AAV88076.1| putative retrotransposon polyprotein [Ipomoea batatas]
          Length = 1358

 Score = 72.8 bits (177), Expect = 5e-11
 Identities = 34/71 (47%), Positives = 46/71 (64%)
 Frame = +3

Query: 6   INFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSFNLGLQKYCCSLA 185
           I+ SIG KY+D VLCDV+ M  CH+LLGRPWQY R  +H G+ N Y+ + G +KY  +  
Sbjct: 467 ISISIG-KYQDDVLCDVIPMHACHILLGRPWQYDRDTLHHGKTNKYTIHKGGKKYTLTPL 525

Query: 186 GVKTLQNPQAR 218
             K + N Q +
Sbjct: 526 APKEVYNLQVQ 536


>ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobroma cacao]
           gi|508726763|gb|EOY18660.1| Uncharacterized protein
           TCM_043155 [Theobroma cacao]
          Length = 625

 Score = 72.0 bits (175), Expect = 8e-11
 Identities = 47/120 (39%), Positives = 64/120 (53%), Gaps = 4/120 (3%)
 Frame = +3

Query: 6   INFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSFNLGLQKYCCSLA 185
           I F I +KY D V CDV+ MD CHLLLGRPWQY R   +DG  NTYSF     K    L 
Sbjct: 410 IQFFIRNKYEDEVWCDVIPMDACHLLLGRPWQYDRRAHYDGYKNTYSFIKDGVK--IMLT 467

Query: 186 GVKTLQNP----QARILTSLPKLCWSRRWRNKVWCMLGWGKQVKSEASATSEPVRALINE 353
            +K    P    + + L ++P L  +    N + C+L   K+ K  +S +++    LIN+
Sbjct: 468 PLKPEDRPKRQEEDKALITVPSLSKAYCESNHL-CLLLVSKENKVSSSLSNDGQTKLINQ 526


>gb|ADP20179.1| gag-pol polyprotein [Silene latifolia]
          Length = 1475

 Score = 72.0 bits (175), Expect = 8e-11
 Identities = 30/54 (55%), Positives = 37/54 (68%)
 Frame = +3

Query: 6   INFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSFNLGLQK 167
           + FSIG  Y D  LCDV+ MD CHLLLGRPW++ R  +H GR NTY+F    +K
Sbjct: 451 VTFSIGKNYSDEALCDVLPMDACHLLLGRPWEFDRDSVHHGRDNTYTFKFRSRK 504


>gb|ABD33247.2| RNA-directed DNA polymerase (Reverse transcriptase) [Medicago
           truncatula]
          Length = 386

 Score = 72.0 bits (175), Expect = 8e-11
 Identities = 31/48 (64%), Positives = 40/48 (83%)
 Frame = +3

Query: 6   INFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSF 149
           +NFSIG KY D +LCDVV M+  H+LLGRPWQ+ R+V HDG+ANT++F
Sbjct: 19  VNFSIG-KYNDEILCDVVPMEAGHILLGRPWQFDRNVFHDGKANTFTF 65


>ref|XP_007028192.1| Uncharacterized protein TCM_023754 [Theobroma cacao]
           gi|508716797|gb|EOY08694.1| Uncharacterized protein
           TCM_023754 [Theobroma cacao]
          Length = 440

 Score = 71.6 bits (174), Expect = 1e-10
 Identities = 47/120 (39%), Positives = 64/120 (53%), Gaps = 4/120 (3%)
 Frame = +3

Query: 6   INFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSFNLGLQKYCCSLA 185
           + FSIGSKY D V CDV+ MD CHLLLGRPWQY R   +DG  N  SF     K    L 
Sbjct: 225 VQFSIGSKYEDEVWCDVIPMDACHLLLGRPWQYDRRAHYDGYKNISSFIKDGVK--IMLT 282

Query: 186 GVKTLQNP----QARILTSLPKLCWSRRWRNKVWCMLGWGKQVKSEASATSEPVRALINE 353
            +K    P    + + L ++P L  +    N + C+L   K+ K  +S +++    LIN+
Sbjct: 283 PLKPEDRPKRQEEDKALITVPTLSKTYCESNHL-CLLLVSKKNKVSSSLSNDGQTKLINQ 341


>ref|XP_006290417.1| hypothetical protein CARUB_v10019144mg, partial [Capsella rubella]
           gi|482559124|gb|EOA23315.1| hypothetical protein
           CARUB_v10019144mg, partial [Capsella rubella]
          Length = 110

 Score = 71.2 bits (173), Expect = 1e-10
 Identities = 29/48 (60%), Positives = 37/48 (77%)
 Frame = +3

Query: 6   INFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSF 149
           + F IG+ Y+D V+CD++ MD CHLLLGRPWQY R ++HDG ANT  F
Sbjct: 61  VPFLIGANYKDLVICDILPMDACHLLLGRPWQYDRRIMHDGFANTIIF 108


>ref|XP_004161321.1| PREDICTED: uncharacterized protein LOC101223713 [Cucumis sativus]
          Length = 645

 Score = 69.7 bits (169), Expect = 4e-10
 Identities = 29/51 (56%), Positives = 36/51 (70%)
 Frame = +3

Query: 3   TINFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSFNL 155
           T+  SI + Y+D ++CDV+ MDVCHLLLGRPWQY    +H GR NTY   L
Sbjct: 353 TVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQL 403


>gb|ADP20178.1| gag-pol polyprotein [Silene latifolia]
          Length = 1518

 Score = 69.3 bits (168), Expect = 5e-10
 Identities = 31/49 (63%), Positives = 36/49 (73%), Gaps = 1/49 (2%)
 Frame = +3

Query: 6   INFSIGSKYRDGVLCDVVC-MDVCHLLLGRPWQYGRHVIHDGRANTYSF 149
           I+FSIG  Y+D VLCDVV  MD CHLLLGRPW+Y R+  H G+ N Y F
Sbjct: 462 ISFSIGKMYKDEVLCDVVVPMDACHLLLGRPWEYDRNTTHQGKDNVYIF 510


>ref|XP_006392773.1| hypothetical protein EUTSA_v10012229mg [Eutrema salsugineum]
           gi|557089351|gb|ESQ30059.1| hypothetical protein
           EUTSA_v10012229mg [Eutrema salsugineum]
          Length = 382

 Score = 68.9 bits (167), Expect = 7e-10
 Identities = 27/48 (56%), Positives = 37/48 (77%)
 Frame = +3

Query: 6   INFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSF 149
           ++FSIG+ Y+D + CD+  MDV HL+LGRPWQ+ R   H+G+ NTYSF
Sbjct: 275 VSFSIGAFYKDTIYCDIAPMDVSHLILGRPWQFDRDTCHNGKKNTYSF 322


>ref|XP_006494982.1| PREDICTED: uncharacterized protein LOC102624489 [Citrus sinensis]
          Length = 1083

 Score = 68.2 bits (165), Expect = 1e-09
 Identities = 32/54 (59%), Positives = 39/54 (72%)
 Frame = +3

Query: 6   INFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSFNLGLQK 167
           ++FSIG +Y+D VLCDVV M   H+LLGRPWQY R V HDG  N YSF +  +K
Sbjct: 295 VSFSIG-RYKDDVLCDVVPMHAGHILLGRPWQYDRRVTHDGYLNRYSFVINKRK 347


>gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1588

 Score = 67.0 bits (162), Expect = 3e-09
 Identities = 33/78 (42%), Positives = 45/78 (57%)
 Frame = +3

Query: 6   INFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSFNLGLQKYCCSLA 185
           I+F++G +Y D +LCDVV M  CH+LLGRPWQY R   H GR N YS     +KY  +  
Sbjct: 485 ISFNVG-RYEDEILCDVVPMQACHVLLGRPWQYDRDTTHHGRKNRYSLLHNGKKYTLAPL 543

Query: 186 GVKTLQNPQARILTSLPK 239
               +   Q R+  ++ K
Sbjct: 544 SPSQVFEDQKRLRETMGK 561


>gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1588

 Score = 67.0 bits (162), Expect = 3e-09
 Identities = 33/78 (42%), Positives = 45/78 (57%)
 Frame = +3

Query: 6   INFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSFNLGLQKYCCSLA 185
           I+F++G +Y D +LCDVV M  CH+LLGRPWQY R   H GR N YS     +KY  +  
Sbjct: 485 ISFNVG-RYEDEILCDVVPMQACHVLLGRPWQYDRDTTHHGRKNRYSLLHNGKKYTLAPL 543

Query: 186 GVKTLQNPQARILTSLPK 239
               +   Q R+  ++ K
Sbjct: 544 SPSQVFEDQKRLRETMGK 561


>ref|XP_006341306.1| PREDICTED: uncharacterized protein LOC102594621 [Solanum tuberosum]
          Length = 631

 Score = 66.6 bits (161), Expect = 3e-09
 Identities = 30/56 (53%), Positives = 42/56 (75%)
 Frame = +3

Query: 3   TINFSIGSKYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSFNLGLQKY 170
           +I FS+G KY + ++CDVV M   HLLLGRPWQ+GR ++H GR+N Y+F +  +KY
Sbjct: 508 SIQFSVG-KYNEELVCDVVPMLAYHLLLGRPWQFGRDIMHQGRSNKYTFVIEGKKY 562


>emb|CAN65853.1| hypothetical protein VITISV_004966 [Vitis vinifera]
          Length = 556

 Score = 66.6 bits (161), Expect = 3e-09
 Identities = 36/72 (50%), Positives = 40/72 (55%), Gaps = 1/72 (1%)
 Frame = +3

Query: 27  KYRDGVLCDVVCMDVCHLLLGRPWQYGRHVIHDGRANTYSFNLGLQKYCCSLAGVKTLQN 206
           KY D VLCDVV M V HLLLGRPWQ+ RHV HD   N YSF L            K +  
Sbjct: 299 KYEDEVLCDVVLMQVGHLLLGRPWQFDRHVKHDDFTNKYSFVLNQMTITLVPLSPKQVYE 358

Query: 207 PQAR-ILTSLPK 239
            Q R ++ S PK
Sbjct: 359 DQVRSLMMSFPK 370


Top