BLASTX nr result

ID: Cornus23_contig00028991 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00028991
         (256 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_013694964.1| PREDICTED: uncharacterized protein LOC106399...    88   2e-15
ref|XP_013694892.1| PREDICTED: uncharacterized protein LOC106398...    85   2e-14
ref|XP_013746037.1| PREDICTED: uncharacterized protein LOC106448...    84   4e-14
ref|XP_013634635.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...    83   7e-14
ref|XP_010412456.1| PREDICTED: uncharacterized protein LOC104698...    81   3e-13
ref|XP_013745520.1| PREDICTED: uncharacterized protein LOC106448...    80   5e-13
emb|CAN77045.1| hypothetical protein VITISV_035256 [Vitis vinifera]    77   5e-12
ref|XP_010278719.1| PREDICTED: uncharacterized protein LOC104612...    76   9e-12
ref|XP_007048683.1| DNA/RNA polymerases superfamily protein [The...    75   1e-11
ref|XP_007019612.1| Uncharacterized protein TCM_035725 [Theobrom...    75   2e-11
ref|XP_007023626.1| Uncharacterized protein TCM_046829 [Theobrom...    74   4e-11
ref|XP_007045326.1| DNA/RNA polymerases superfamily protein [The...    74   4e-11
ref|XP_007049888.1| DNA/RNA polymerases superfamily protein, par...    74   4e-11
ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [The...    74   6e-11
ref|XP_010266336.1| PREDICTED: uncharacterized protein LOC104603...    73   9e-11
ref|XP_007010495.1| Uncharacterized protein TCM_044370 [Theobrom...    72   1e-10
ref|XP_007019474.1| Uncharacterized protein TCM_035549 [Theobrom...    72   1e-10
ref|XP_008234059.1| PREDICTED: uncharacterized protein LOC103333...    71   3e-10
ref|XP_013624326.1| PREDICTED: uncharacterized protein LOC106330...    71   4e-10
ref|XP_012704376.1| PREDICTED: uncharacterized protein LOC105915...    70   8e-10

>ref|XP_013694964.1| PREDICTED: uncharacterized protein LOC106399026 [Brassica napus]
           gi|923823351|ref|XP_013695069.1| PREDICTED:
           uncharacterized protein LOC106399147 [Brassica napus]
          Length = 674

 Score = 88.2 bits (217), Expect = 2e-15
 Identities = 39/81 (48%), Positives = 55/81 (67%)
 Frame = -3

Query: 245 CSLGEKIIIE*HNLGHSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVATNGD 66
           CSL  ++I E H  GH GRD++L ++ N  +W  L RD+ R+VE   VCQ+ +G A+N  
Sbjct: 588 CSLRLQLISETHGEGHVGRDRTLHLLSNSYFWPTLHRDIERYVERCVVCQQAKGSASNAG 647

Query: 65  LYMPLLIPNRPWVAISMDFVL 3
           LY+PL +P +PW  ISMDFV+
Sbjct: 648 LYLPLPVPTQPWTDISMDFVV 668


>ref|XP_013694892.1| PREDICTED: uncharacterized protein LOC106398939 [Brassica napus]
          Length = 1353

 Score = 85.1 bits (209), Expect = 2e-14
 Identities = 39/80 (48%), Positives = 56/80 (70%)
 Frame = -3

Query: 242  SLGEKIIIE*HNLGHSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVATNGDL 63
            SL  ++I E HN GH GRD++L+++ +  +W +LRRDV R +    +CQ  +G A+N  L
Sbjct: 1064 SLRLQVIREFHNEGHVGRDRTLQLVTSSYFWPSLRRDVERFIVRCGICQASKGHASNAGL 1123

Query: 62   YMPLLIPNRPWVAISMDFVL 3
            Y+PL IP++PW  ISMDFVL
Sbjct: 1124 YLPLPIPSQPWTDISMDFVL 1143


>ref|XP_013746037.1| PREDICTED: uncharacterized protein LOC106448737 [Brassica napus]
            gi|923808461|ref|XP_013690050.1| PREDICTED:
            uncharacterized protein LOC106393961 [Brassica napus]
          Length = 1236

 Score = 84.0 bits (206), Expect = 4e-14
 Identities = 40/81 (49%), Positives = 53/81 (65%)
 Frame = -3

Query: 245  CSLGEKIIIE*HNLGHSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVATNGD 66
            CSL  +II E H  GH GRD++L+++    +W  LRRDV R V     CQ+ +G A+N  
Sbjct: 1111 CSLRLQIIKELHGEGHIGRDRTLKLVAESYFWPTLRRDVERFVARCTSCQQGKGQASNAG 1170

Query: 65   LYMPLLIPNRPWVAISMDFVL 3
            LY+PL +P +PW  ISMDFVL
Sbjct: 1171 LYLPLPVPTQPWSDISMDFVL 1191


>ref|XP_013634635.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC106340299 [Brassica oleracea var. oleracea]
          Length = 543

 Score = 83.2 bits (204), Expect = 7e-14
 Identities = 39/76 (51%), Positives = 50/76 (65%)
 Frame = -3

Query: 230 KIIIE*HNLGHSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVATNGDLYMPL 51
           +II E HN GH GRD++L+++    +W  LRRDV R V    VCQ+ +G   N  LYMPL
Sbjct: 54  QIITELHNEGHVGRDRTLKLVAESYFWPTLRRDVERFVAWCTVCQQGKGQTLNAGLYMPL 113

Query: 50  LIPNRPWVAISMDFVL 3
            IP +PW  +SMDFVL
Sbjct: 114 PIPTQPWTDLSMDFVL 129


>ref|XP_010412456.1| PREDICTED: uncharacterized protein LOC104698758 [Camelina sativa]
          Length = 1103

 Score = 81.3 bits (199), Expect = 3e-13
 Identities = 36/76 (47%), Positives = 51/76 (67%)
 Frame = -3

Query: 230  KIIIE*HNLGHSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVATNGDLYMPL 51
            K+I E H+ GH GRD++L+++    YW +LRR+V R VE    CQ  +G  +N  LY+PL
Sbjct: 798  KMIEELHSEGHVGRDRTLQLVSASYYWPSLRRNVERCVERCRACQLAKGTTSNAVLYLPL 857

Query: 50   LIPNRPWVAISMDFVL 3
             IP +PW  +SMDF+L
Sbjct: 858  PIPTKPWTDVSMDFIL 873


>ref|XP_013745520.1| PREDICTED: uncharacterized protein LOC106448142 [Brassica napus]
          Length = 701

 Score = 80.5 bits (197), Expect = 5e-13
 Identities = 38/76 (50%), Positives = 50/76 (65%)
 Frame = -3

Query: 230 KIIIE*HNLGHSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVATNGDLYMPL 51
           +II E H  GH GRD++L+++ +  YW +LRRDV R V     CQ  +G  +N  LY+PL
Sbjct: 338 QIIQELHKEGHVGRDRTLKLVADSYYWPSLRRDVERFVARCTTCQEGKGHVSNAGLYLPL 397

Query: 50  LIPNRPWVAISMDFVL 3
            IP +PW  ISMDFVL
Sbjct: 398 PIPTQPWSDISMDFVL 413


>emb|CAN77045.1| hypothetical protein VITISV_035256 [Vitis vinifera]
          Length = 665

 Score = 77.0 bits (188), Expect = 5e-12
 Identities = 38/83 (45%), Positives = 53/83 (63%), Gaps = 3/83 (3%)
 Frame = -3

Query: 242 SLGEKIIIE*HNLG---HSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVATN 72
           SL E++I E H+ G   H GRDK++ M  +  YW +L+RDVT++V     CQ  +G   N
Sbjct: 349 SLREQVIWELHSRGXAXHFGRDKTIAMTEDHFYWPSLKRDVTKNVSKCRTCQPSKGRKKN 408

Query: 71  GDLYMPLLIPNRPWVAISMDFVL 3
             LYMPL +P+ PW  +S+DFVL
Sbjct: 409 TGLYMPLPVPHEPWQELSIDFVL 431


>ref|XP_010278719.1| PREDICTED: uncharacterized protein LOC104612828 [Nelumbo nucifera]
          Length = 925

 Score = 76.3 bits (186), Expect = 9e-12
 Identities = 41/83 (49%), Positives = 49/83 (59%), Gaps = 3/83 (3%)
 Frame = -3

Query: 242 SLGEKIIIE*HN---LGHSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVATN 72
           SL EKII + H     GH GRDK++E +    YW  LRRDVT  V   ++CQ  +G A N
Sbjct: 622 SLREKIIKDLHGGGLAGHLGRDKTIEAVKGRYYWPKLRRDVTTIVSRCYICQTAKGQAQN 681

Query: 71  GDLYMPLLIPNRPWVAISMDFVL 3
             LYMPL IP   W  + MDFVL
Sbjct: 682 TGLYMPLPIPTAIWEDLPMDFVL 704


>ref|XP_007048683.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508700944|gb|EOX92840.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 647

 Score = 75.5 bits (184), Expect = 1e-11
 Identities = 41/86 (47%), Positives = 53/86 (61%), Gaps = 3/86 (3%)
 Frame = -3

Query: 251 LKCSLGEKIIIE*HNLG---HSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGV 81
           L+ SL E+II E H  G   H GRDK+L M+ +  YW  + RDV R V+    C   +G 
Sbjct: 280 LEGSLREQIIGELHGNGLGGHFGRDKTLAMVADRYYWPKMHRDVERLVKRCSTCLFGKGS 339

Query: 80  ATNGDLYMPLLIPNRPWVAISMDFVL 3
           A N  LY+PLL P+ PW+ +SMDFVL
Sbjct: 340 AQNTGLYVPLLEPDAPWIHLSMDFVL 365


>ref|XP_007019612.1| Uncharacterized protein TCM_035725 [Theobroma cacao]
           gi|508724940|gb|EOY16837.1| Uncharacterized protein
           TCM_035725 [Theobroma cacao]
          Length = 499

 Score = 74.7 bits (182), Expect = 2e-11
 Identities = 41/85 (48%), Positives = 52/85 (61%), Gaps = 3/85 (3%)
 Frame = -3

Query: 248 KCSLGEKIIIE*HNLG---HSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVA 78
           K SL E+II E H  G   H GRDK+L M+ +  YW  +RRDV R V+    C   +G A
Sbjct: 60  KGSLREQIIRELHGNGLGGHFGRDKTLAMVADRYYWPKMRRDVERLVKRCPACLFGKGSA 119

Query: 77  TNGDLYMPLLIPNRPWVAISMDFVL 3
            N  LY+PL  P+ PW+ +SMDFVL
Sbjct: 120 QNTGLYVPLPEPDAPWIHLSMDFVL 144


>ref|XP_007023626.1| Uncharacterized protein TCM_046829 [Theobroma cacao]
           gi|508778992|gb|EOY26248.1| Uncharacterized protein
           TCM_046829 [Theobroma cacao]
          Length = 672

 Score = 73.9 bits (180), Expect = 4e-11
 Identities = 40/83 (48%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
 Frame = -3

Query: 242 SLGEKIIIE*HNLG---HSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVATN 72
           SL E+II E H  G   H GRDK+L M+ +  YW  +RRDV R V+    C   +G A N
Sbjct: 286 SLREQIIRELHGNGLGGHFGRDKTLAMVADRYYWPKMRRDVERLVKRCPACLFGKGSAQN 345

Query: 71  GDLYMPLLIPNRPWVAISMDFVL 3
             LY+PL  P+ PW+ +SMDFVL
Sbjct: 346 TGLYVPLPEPDAPWIHLSMDFVL 368


>ref|XP_007045326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508709261|gb|EOY01158.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 786

 Score = 73.9 bits (180), Expect = 4e-11
 Identities = 40/83 (48%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
 Frame = -3

Query: 242 SLGEKIIIE*HNLG---HSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVATN 72
           SL E+II E H  G   H GRDK+L M+ +  YW  +RRDV R V+    C   +G A N
Sbjct: 467 SLREQIIRELHGNGLGGHFGRDKTLAMVADRYYWPKMRRDVERLVKRCPACLFGKGSAQN 526

Query: 71  GDLYMPLLIPNRPWVAISMDFVL 3
             LY+PL  P+ PW+ +SMDFVL
Sbjct: 527 TGLYVPLPEPDAPWIHLSMDFVL 549


>ref|XP_007049888.1| DNA/RNA polymerases superfamily protein, partial [Theobroma cacao]
           gi|508702149|gb|EOX94045.1| DNA/RNA polymerases
           superfamily protein, partial [Theobroma cacao]
          Length = 624

 Score = 73.9 bits (180), Expect = 4e-11
 Identities = 40/83 (48%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
 Frame = -3

Query: 242 SLGEKIIIE*HNLG---HSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVATN 72
           SL E+II E H  G   H GRDK+L M+ +  YW  +RRDV R V+    C   +G A N
Sbjct: 467 SLREQIIRELHGNGLGGHFGRDKTLAMVADRYYWPKMRRDVERLVKRCPACLFGKGSAQN 526

Query: 71  GDLYMPLLIPNRPWVAISMDFVL 3
             LY+PL  P+ PW+ +SMDFVL
Sbjct: 527 TGLYVPLPEPDAPWIHLSMDFVL 549


>ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508703673|gb|EOX95569.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1452

 Score = 73.6 bits (179), Expect = 6e-11
 Identities = 40/83 (48%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
 Frame = -3

Query: 242  SLGEKIIIE*HNLG---HSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVATN 72
            SL E+II E H  G   H GRDK+L M+ +  YW  +RRDV R V+    C   +G A N
Sbjct: 1015 SLREQIIRELHGNGLGGHFGRDKTLVMVADRYYWPKMRRDVERLVKRCPACLFGKGSAQN 1074

Query: 71   GDLYMPLLIPNRPWVAISMDFVL 3
              LY+PL  P+ PW+ +SMDFVL
Sbjct: 1075 TGLYVPLPEPDAPWIHLSMDFVL 1097


>ref|XP_010266336.1| PREDICTED: uncharacterized protein LOC104603865 [Nelumbo nucifera]
          Length = 338

 Score = 72.8 bits (177), Expect = 9e-11
 Identities = 38/80 (47%), Positives = 49/80 (61%), Gaps = 3/80 (3%)
 Frame = -3

Query: 233 EKIIIE*HN---LGHSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVATNGDL 63
           EKII + H    +GH GRDK++E +    YW  LRRDV+  V   +VCQ  +G   N  L
Sbjct: 254 EKIIRDLHGGGLVGHLGRDKTVEAVKGRYYWPKLRRDVSTIVSRCYVCQIAKGKTQNTGL 313

Query: 62  YMPLLIPNRPWVAISMDFVL 3
           YMPL +P+  W  +SMDFVL
Sbjct: 314 YMPLPVPSAIWEDLSMDFVL 333


>ref|XP_007010495.1| Uncharacterized protein TCM_044370 [Theobroma cacao]
            gi|508727408|gb|EOY19305.1| Uncharacterized protein
            TCM_044370 [Theobroma cacao]
          Length = 1306

 Score = 72.4 bits (176), Expect = 1e-10
 Identities = 39/82 (47%), Positives = 50/82 (60%), Gaps = 3/82 (3%)
 Frame = -3

Query: 239  LGEKIIIE*HNLG---HSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVATNG 69
            L E+II E H  G   H GRDK+L M+ +  YW  +RRDV R V+    C   +G A N 
Sbjct: 912  LREQIIRELHGNGLGGHFGRDKTLAMVADRYYWPKMRRDVERLVKRCPTCLFGKGSAQNT 971

Query: 68   DLYMPLLIPNRPWVAISMDFVL 3
             LY+PL  P+ PW+ +SMDFVL
Sbjct: 972  GLYVPLPEPDAPWIHLSMDFVL 993


>ref|XP_007019474.1| Uncharacterized protein TCM_035549 [Theobroma cacao]
            gi|508724802|gb|EOY16699.1| Uncharacterized protein
            TCM_035549 [Theobroma cacao]
          Length = 1392

 Score = 72.4 bits (176), Expect = 1e-10
 Identities = 39/83 (46%), Positives = 51/83 (61%), Gaps = 3/83 (3%)
 Frame = -3

Query: 242  SLGEKIIIE*HNLG---HSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVATN 72
            SL E+II E H  G   H GRDK+L M+ +  YW  +R+DV R V+    C   +G A N
Sbjct: 955  SLREQIIRELHGNGLGGHFGRDKTLAMVADRYYWPKMRQDVERLVKRCPTCLFGKGSAQN 1014

Query: 71   GDLYMPLLIPNRPWVAISMDFVL 3
              LY+PL  P+ PW+ +SMDFVL
Sbjct: 1015 TGLYVPLPEPDAPWIHLSMDFVL 1037


>ref|XP_008234059.1| PREDICTED: uncharacterized protein LOC103333039 [Prunus mume]
          Length = 1268

 Score = 71.2 bits (173), Expect = 3e-10
 Identities = 36/83 (43%), Positives = 50/83 (60%), Gaps = 3/83 (3%)
 Frame = -3

Query: 242  SLGEKIIIE*HN---LGHSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVATN 72
            SL EK+I + H     GH GRDK++ ++ +  YW  L+RDV   V   ++CQ  +G   N
Sbjct: 1124 SLREKLIRDLHGGGLSGHLGRDKTIALLEDRFYWPQLKRDVGTIVRKCYICQTSKGQVQN 1183

Query: 71   GDLYMPLLIPNRPWVAISMDFVL 3
              LYMPL +PN  W  ++MDFVL
Sbjct: 1184 TGLYMPLPVPNDIWQDLAMDFVL 1206


>ref|XP_013624326.1| PREDICTED: uncharacterized protein LOC106330401 [Brassica oleracea
           var. oleracea]
          Length = 273

 Score = 70.9 bits (172), Expect = 4e-10
 Identities = 34/71 (47%), Positives = 46/71 (64%)
 Frame = -3

Query: 242 SLGEKIIIE*HNLGHSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVATNGDL 63
           SL  K+I E H  GH GRD++L+++    +W +LRRDV R V    VCQR +G A+N  L
Sbjct: 55  SLRLKVIKELHEEGHVGRDRTLQLVMESYFWPSLRRDVYRFVARCVVCQRSKGHASNSGL 114

Query: 62  YMPLLIPNRPW 30
           Y+ L IP +PW
Sbjct: 115 YLSLPIPTQPW 125


>ref|XP_012704376.1| PREDICTED: uncharacterized protein LOC105915107 [Setaria italica]
          Length = 1399

 Score = 69.7 bits (169), Expect = 8e-10
 Identities = 35/84 (41%), Positives = 49/84 (58%), Gaps = 3/84 (3%)
 Frame = -3

Query: 245  CSLGEKIIIE*HN---LGHSGRDKSLEMIGNELYWLNLRRDVTRHVEH*HVCQRERGVAT 75
            CS+   ++ E H     GH G  K+L+M+ +  +W ++RRDV RHVE    C + +    
Sbjct: 1056 CSIRHVLLQEAHAGGLAGHFGMKKTLDMLADHFFWPHMRRDVQRHVERCITCLKAKSRLN 1115

Query: 74   NGDLYMPLLIPNRPWVAISMDFVL 3
               LY+PL IPN PW  ISMDF+L
Sbjct: 1116 PHGLYIPLPIPNVPWEDISMDFIL 1139


Top