BLASTX nr result

ID: Rehmannia26_contig00032013 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia26_contig00032013
         (719 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob...    63   1e-07
gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]    62   1e-07
gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]    62   1e-07
gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]    62   2e-07
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]    62   2e-07
gb|EOY19103.1| Uncharacterized protein TCM_043836 [Theobroma cacao]    60   7e-07
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]    60   7e-07
gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]    58   3e-06
gb|EOY32248.1| Uncharacterized protein TCM_039895 [Theobroma cacao]    58   3e-06
ref|XP_004308214.1| PREDICTED: putative ribonuclease H protein A...    57   6e-06
ref|XP_004301067.1| PREDICTED: uncharacterized protein LOC101309...    57   6e-06
gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]    57   8e-06
gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]    57   8e-06

>gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score = 62.8 bits (151), Expect = 1e-07
 Identities = 45/129 (34%), Positives = 62/129 (48%), Gaps = 8/129 (6%)
 Frame = +3

Query: 33   NWDKP-------NINGSFDHGNQLAGGGGVIRDHHRELTNVFHMPCQASSSFDVEIHALP 191
            NW KP       N++GS     Q A GGGV+RDH   L   F       +S   E+ AL 
Sbjct: 1171 NWIKPLIGELKLNVDGSSKDEFQNAAGGGVLRDHTGNLIFGFSENFGYQNSLQAELLALH 1230

Query: 192  IGLQMAMQIS-THV*IELDAATMVTLIHSSHQGP*QIQHXXXXXXXXXXXXXXXXSHILC 368
             GL + M+ + + V IE+DA  ++ +I + H+G  +IQ+                SHI  
Sbjct: 1231 RGLCLCMEYNVSRVWIEVDAQVVIQMIQNHHKGSYKIQYLLESIRKCLQVISVRISHIHR 1290

Query: 369  VGNQPADFL 395
             GNQ ADFL
Sbjct: 1291 EGNQAADFL 1299


>gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
          Length = 3503

 Score = 62.4 bits (150), Expect = 1e-07
 Identities = 46/157 (29%), Positives = 71/157 (45%), Gaps = 8/157 (5%)
 Frame = +3

Query: 6    IVVWLSTEYNWDKPNINGSFDHGNQLAGGGGVIRDHHRELTNVFHMPCQASSSFDVEIHA 185
            I+ W+       K N++GS    +Q A GGGV+RDH  +L   F        S   E+HA
Sbjct: 1542 IISWIKPFIGEYKLNVDGS-SKSSQNAAGGGVLRDHTGKLAFAFSENLGPLPSLQAELHA 1600

Query: 186  LPIGLQMAMQIS-THV*IELDAATMVTLIHSSHQGP*QIQHXXXXXXXXXXXXXXXXSHI 362
            L  GL +  + + T++ IE+DA   V ++  S +G   I++                SHI
Sbjct: 1601 LLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRISHI 1660

Query: 363  LCVGNQPADFLVGR-------VLITPTQQIFQSPSAH 452
               GNQ ADFL  +        +++  Q+    P+ H
Sbjct: 1661 YREGNQAADFLSNKGQTHQSLCVVSEAQEFPSLPTMH 1697



 Score = 58.5 bits (140), Expect = 2e-06
 Identities = 40/128 (31%), Positives = 59/128 (46%), Gaps = 8/128 (6%)
 Frame = +3

Query: 36   WDKP-------NINGSFDHGNQLAGGGGVIRDHHRELTNVFHMPCQASSSFDVEIHALPI 194
            W+KP       N++GS  +  Q A GGG++RDH   +   F     +  S   E+ AL  
Sbjct: 3339 WNKPSIGEFKLNVDGSSKYNLQTAAGGGLLRDHTGSMIFGFSENFGSQDSLQAELMALHR 3398

Query: 195  GLQMAMQIS-THV*IELDAATMVTLIHSSHQGP*QIQHXXXXXXXXXXXXXXXXSHILCV 371
            GL + +  + T + IE+DA   V +I+  HQG  + ++                SHI   
Sbjct: 3399 GLLLCIDHNVTRLWIEMDAKVAVQMINEGHQGSSRTRYLLASIHRCLSGISFRISHIFRE 3458

Query: 372  GNQPADFL 395
            GNQ AD L
Sbjct: 3459 GNQAADHL 3466


>gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
          Length = 1951

 Score = 62.4 bits (150), Expect = 1e-07
 Identities = 47/162 (29%), Positives = 73/162 (45%), Gaps = 1/162 (0%)
 Frame = +3

Query: 6    IVVWLSTEYNWDKPNINGSFDHGNQLAGGGGVIRDHHRELTNVFHMPCQASSSFDVEIHA 185
            I+ W+       K N++GS    N  A GGGV+RDH  +L   F        S   E+HA
Sbjct: 1785 IIYWIKPFIGEYKLNVDGS-SKSNLNAAGGGVLRDHTGKLAFAFSENLGPLPSLQAELHA 1843

Query: 186  LPIGLQMAMQIS-THV*IELDAATMVTLIHSSHQGP*QIQHXXXXXXXXXXXXXXXXSHI 362
            L  GL +  + + T++ IE+DA   V ++  S +G   I++                SHI
Sbjct: 1844 LLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRISHI 1903

Query: 363  LCVGNQPADFLVGRVLITPTQQIFQSPSAHKCLVALVWIDQL 488
               GNQ ADFL  +     +  +F        L+ ++ +D+L
Sbjct: 1904 YREGNQAADFLSNKGQTHQSLCVFSEAQGE--LIGILKLDKL 1943


>gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
          Length = 910

 Score = 62.0 bits (149), Expect = 2e-07
 Identities = 41/133 (30%), Positives = 63/133 (47%), Gaps = 8/133 (6%)
 Frame = +3

Query: 30   YNWDKP-------NINGSFDHGNQLAGGGGVIRDHHRELTNVFHMPCQASSSFDVEIHAL 188
            ++W KP       N++GS  H +  A GGG++RDH   +   F       +S   E+ AL
Sbjct: 745  FSWHKPTTGEFKLNVDGSAKHSHN-AAGGGILRDHAGVMVFGFSENLGIQNSLQAELLAL 803

Query: 189  PIGLQMAMQISTH-V*IELDAATMVTLIHSSHQGP*QIQHXXXXXXXXXXXXXXXXSHIL 365
              GL +    +   + IE+DA +++ L+  +H+GP  I++                SHI 
Sbjct: 804  YRGLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLSHFSFRFSHIF 863

Query: 366  CVGNQPADFLVGR 404
              GNQ ADFL  R
Sbjct: 864  REGNQAADFLANR 876


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score = 62.0 bits (149), Expect = 2e-07
 Identities = 41/133 (30%), Positives = 63/133 (47%), Gaps = 8/133 (6%)
 Frame = +3

Query: 30   YNWDKP-------NINGSFDHGNQLAGGGGVIRDHHRELTNVFHMPCQASSSFDVEIHAL 188
            ++W KP       N++GS    +  A GGG++RDH  E+   F       +S   E+ AL
Sbjct: 2086 FSWHKPSLGEFKLNVDGSAKQSHN-AAGGGILRDHAGEMVFGFSENLGTQNSLQAELLAL 2144

Query: 189  PIGLQMAMQIS-THV*IELDAATMVTLIHSSHQGP*QIQHXXXXXXXXXXXXXXXXSHIL 365
              GL +    +   + IE+DA +++ L+  +H+GP  I++                SHI 
Sbjct: 2145 YRGLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLSHFSFRFSHIF 2204

Query: 366  CVGNQPADFLVGR 404
              GNQ ADFL  R
Sbjct: 2205 REGNQAADFLANR 2217


>gb|EOY19103.1| Uncharacterized protein TCM_043836 [Theobroma cacao]
          Length = 228

 Score = 60.1 bits (144), Expect = 7e-07
 Identities = 43/140 (30%), Positives = 65/140 (46%), Gaps = 8/140 (5%)
 Frame = +3

Query: 9   VVWLSTEYNWDKP-------NINGSFDHGNQLAGGGGVIRDHHRELTNVFHMPCQASSSF 167
           V+ L    +W KP       N++GS  +  Q AGGGG++RDH   L  VF     A +S 
Sbjct: 41  VISLPKVISWHKPSTGEFKLNVDGSSINNFQNAGGGGLLRDHTSTLVFVFSENLGAKNSL 100

Query: 168 DVEIHALPIGLQMAMQIS-THV*IELDAATMVTLIHSSHQGP*QIQHXXXXXXXXXXXXX 344
             E+ AL  GL +  + + + + IE+DA  ++ ++   H G    ++             
Sbjct: 101 QAELLALHRGLLLCQENNISRLWIEMDAMIVIQMLKEGHIGSHDSRYLWASIRQQLKLFS 160

Query: 345 XXXSHILCVGNQPADFLVGR 404
              SHI   GNQ AD+L  R
Sbjct: 161 FRISHIHREGNQAADWLANR 180


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score = 60.1 bits (144), Expect = 7e-07
 Identities = 40/131 (30%), Positives = 59/131 (45%), Gaps = 1/131 (0%)
 Frame = +3

Query: 6    IVVWLSTEYNWDKPNINGSFDHGNQLAGGGGVIRDHHRELTNVFHMPCQASSSFDVEIHA 185
            ++ WL       K N++GS  H  Q A GGG++RDH   +   F        S   E+ A
Sbjct: 2048 LLFWLKPSIGELKLNVDGSCKHNPQSAAGGGLLRDHTGSMIFGFSENFGPQDSLQAELMA 2107

Query: 186  LPIGLQMAMQIS-THV*IELDAATMVTLIHSSHQGP*QIQHXXXXXXXXXXXXXXXXSHI 362
            L  GL + ++ + + + IE+DA   V +I   HQG  + ++                SHI
Sbjct: 2108 LHRGLLLCIEHNISRLWIEMDAKVAVQMIKEGHQGSSRTRYLLASIHRCLSGISFRISHI 2167

Query: 363  LCVGNQPADFL 395
               GNQ AD L
Sbjct: 2168 FREGNQAADHL 2178


>gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]
          Length = 926

 Score = 58.2 bits (139), Expect = 3e-06
 Identities = 45/131 (34%), Positives = 60/131 (45%), Gaps = 1/131 (0%)
 Frame = +3

Query: 6    IVVWLSTEYNWDKPNINGSFDHGNQLAGGGGVIRDHHRELTNVFHMPCQASSSFDVEIHA 185
            IV W        K N++GS  HG Q A  GGV+RDH  +L   F       +S   E+ A
Sbjct: 761  IVYWRKPSTGEYKLNVDGSSRHG-QHAASGGVLRDHTGKLIFGFSENIGNCNSLQAELRA 819

Query: 186  LPIGLQMAMQIS-THV*IELDAATMVTLIHSSHQGP*QIQHXXXXXXXXXXXXXXXXSHI 362
            L  GL +  +     + IE+DA  ++ LI  S +G   I++                SHI
Sbjct: 820  LLRGLLLCKERHIEQLWIEMDALAVIQLIPHSQKGSHDIRYLLESIRKCLNSISYRISHI 879

Query: 363  LCVGNQPADFL 395
            L  GNQ ADFL
Sbjct: 880  LREGNQVADFL 890


>gb|EOY32248.1| Uncharacterized protein TCM_039895 [Theobroma cacao]
          Length = 206

 Score = 57.8 bits (138), Expect = 3e-06
 Identities = 42/142 (29%), Positives = 65/142 (45%), Gaps = 8/142 (5%)
 Frame = +3

Query: 33  NWDKP-------NINGSFDHGNQLAGGGGVIRDHHRELTNVFHMPCQASSSFDVEIHALP 191
           +W KP       N +GS     Q A GGG++RDH   L   F      ++    ++ AL 
Sbjct: 44  SWHKPLIGEFKLNADGSSKDAFQNAAGGGLLRDHTGNLIFGFSENFGPANLLQAKLMALH 103

Query: 192 IGLQMAMQIS-THV*IELDAATMVTLIHSSHQGP*QIQHXXXXXXXXXXXXXXXXSHILC 368
            GL + ++ + + + IE+DA  +V +IH  HQG  Q ++                SHI  
Sbjct: 104 RGLFLCIEYNISSIWIEMDAKIVVQMIHEGHQGSYQTRYLLAFIRKCLSGFTFRFSHIHR 163

Query: 369 VGNQPADFLVGRVLITPTQQIF 434
            GNQ AD+L  +  +    Q+F
Sbjct: 164 EGNQAADYLFNQGHMHHNLQVF 185


>ref|XP_004308214.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
           vesca subsp. vesca]
          Length = 409

 Score = 57.0 bits (136), Expect = 6e-06
 Identities = 35/131 (26%), Positives = 60/131 (45%), Gaps = 1/131 (0%)
 Frame = +3

Query: 9   VVWLSTEYNWDKPNINGSFDHGNQLAGGGGVIRDHHRELTNVFHMPCQASSSFDVEIHAL 188
           V W    + W K N +G++      +G GG+ RD H      F    +  +S D E+ A+
Sbjct: 243 VNWHPPLFGWIKVNTDGAWQKTTGKSGYGGIFRDFHGSFLGAFASNLEIPNSVDAEVMAV 302

Query: 189 PIGLQMA-MQISTHV*IELDAATMVTLIHSSHQGP*QIQHXXXXXXXXXXXXXXXXSHIL 365
              +++A ++   H+ +E+D+A ++  +H  H  P +++                 SHI 
Sbjct: 303 IQAIELAWVRDWKHILLEVDSAIVLNFLHDPHLVPWRLRVACGNCLHRISQMNFRSSHIF 362

Query: 366 CVGNQPADFLV 398
             GNQ AD LV
Sbjct: 363 REGNQVADTLV 373


>ref|XP_004301067.1| PREDICTED: uncharacterized protein LOC101309260 [Fragaria vesca
           subsp. vesca]
          Length = 209

 Score = 57.0 bits (136), Expect = 6e-06
 Identities = 50/172 (29%), Positives = 76/172 (44%), Gaps = 5/172 (2%)
 Frame = +3

Query: 9   VVWLSTEYNWDKPNINGSFDHGNQLAGGGGVIRDHHRELTNVF----HMPCQASSSFDVE 176
           V W+   ++  K N NG+F+H +  AG  GV R+   E   VF     +PC      D E
Sbjct: 43  VNWIPPLFDCIKINTNGAFNHASGKAGFRGVFRNFKGEFIGVFACNLDIPCS-----DAE 97

Query: 177 IHALPIGLQMA-MQISTHV*IELDAATMVTLIHSSHQGP*QIQHXXXXXXXXXXXXXXXX 353
           + A+   + +A ++   H+ +E+D+A ++  IHS H  P + +                 
Sbjct: 98  VMAVIKAIDLAWVREWRHIWLEVDSAIVLNFIHSPHLIPWRFRVAWDNCMYRVSQMQFQC 157

Query: 354 SHILCVGNQPADFLVGRVLITPTQQIFQSPSAHKCLVALVWIDQLGYSSFGF 509
           SHI   GNQ AD L    L   +  +    S  + L+ L   DQLG   F F
Sbjct: 158 SHIFREGNQVADALANFDL--SSSSLVYWDSIPQFLMNLCLQDQLGLPHFRF 207


>gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]
          Length = 1014

 Score = 56.6 bits (135), Expect = 8e-06
 Identities = 42/131 (32%), Positives = 60/131 (45%), Gaps = 1/131 (0%)
 Frame = +3

Query: 6    IVVWLSTEYNWDKPNINGSFDHGNQLAGGGGVIRDHHRELTNVFHMPCQASSSFDVEIHA 185
            I+ W+       K N++GS  H NQ A  GG++RDH   L   F      S+S   E+ A
Sbjct: 848  IIHWVKPVTGEYKLNVDGSSRH-NQSAATGGLLRDHTGTLVFGFSENIGPSNSLQAELRA 906

Query: 186  LPIGLQMAMQISTH-V*IELDAATMVTLIHSSHQGP*QIQHXXXXXXXXXXXXXXXXSHI 362
            L  GL +    +   + IE+DA  ++ +I  S +G   I++                SHI
Sbjct: 907  LLRGLLLCKDRNIEKLWIEMDALVVIQMIQQSKKGSHDIRYLLASIRKCLSFFSFRISHI 966

Query: 363  LCVGNQPADFL 395
               GNQ ADFL
Sbjct: 967  FREGNQAADFL 977


>gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score = 56.6 bits (135), Expect = 8e-06
 Identities = 42/133 (31%), Positives = 62/133 (46%), Gaps = 8/133 (6%)
 Frame = +3

Query: 30   YNWDKP-------NINGSFDHGNQLAGGGGVIRDHHRELTNVFHMPCQASSSFDVEIHAL 188
            + W KP       N++GS    +Q A GGGV+RDH   +   F       +S   E+ AL
Sbjct: 2084 FPWHKPSIGEFKLNVDGSAKL-SQNAAGGGVLRDHAGVMVFGFSENLGIQNSLQAELLAL 2142

Query: 189  PIGLQMAMQIS-THV*IELDAATMVTLIHSSHQGP*QIQHXXXXXXXXXXXXXXXXSHIL 365
              GL +    +   + IE+DAA+++ L+  + +GP  I++                SHI 
Sbjct: 2143 YRGLILCRDYNIRRLWIEMDAASVIRLLQGNQRGPHAIRYLLVSIRQLLSHFSFRLSHIF 2202

Query: 366  CVGNQPADFLVGR 404
              GNQ ADFL  R
Sbjct: 2203 REGNQAADFLANR 2215


Top