BLASTX nr result

ID: Rehmannia31_contig00019155 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia31_contig00019155
         (601 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNY01158.1| ribonuclease H, partial [Trifolium pratense]            98   5e-20
dbj|GAU35627.1| hypothetical protein TSUD_30450 [Trifolium subte...    97   7e-20
gb|PNY16580.1| ribonuclease H, partial [Trifolium pratense]            95   4e-19
gb|PNX71533.1| ribonuclease H [Trifolium pratense]                     94   1e-18
dbj|GAU39028.1| hypothetical protein TSUD_59840 [Trifolium subte...    94   2e-18
gb|EEC79285.1| hypothetical protein OsI_20087 [Oryza sativa Indi...    89   4e-18
gb|PNX72264.1| ribonuclease H [Trifolium pratense]                     92   6e-18
gb|OMO84064.1| reverse transcriptase [Corchorus capsularis]            90   3e-17
emb|CDP09717.1| unnamed protein product [Coffea canephora]             89   5e-17
ref|XP_017221408.1| PREDICTED: uncharacterized protein LOC108198...    89   5e-17
dbj|GAU21787.1| hypothetical protein TSUD_329120, partial [Trifo...    89   5e-17
gb|PNX85647.1| hypothetical protein L195_g041717, partial [Trifo...    86   8e-17
gb|OMO55679.1| reverse transcriptase [Corchorus capsularis]            89   8e-17
dbj|GAU33706.1| hypothetical protein TSUD_148570 [Trifolium subt...    88   9e-17
gb|PNX95041.1| ribonuclease H, partial [Trifolium pratense]            88   1e-16
dbj|GAU29911.1| hypothetical protein TSUD_148190 [Trifolium subt...    88   1e-16
dbj|GAU50334.1| hypothetical protein TSUD_243120 [Trifolium subt...    86   1e-16
dbj|GAU41924.1| hypothetical protein TSUD_25650 [Trifolium subte...    86   1e-16
dbj|GAU44059.1| hypothetical protein TSUD_399580 [Trifolium subt...    88   1e-16
ref|XP_023634342.1| uncharacterized protein LOC17879006 [Capsell...    88   2e-16

>gb|PNY01158.1| ribonuclease H, partial [Trifolium pratense]
          Length = 1068

 Score = 97.8 bits (242), Expect = 5e-20
 Identities = 44/112 (39%), Positives = 69/112 (61%)
 Frame = +1

Query: 262  LLMRMLNDKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNSTNWGKCWKFQ 441
            L  R+ +DK +W+WE+ G YSV++AY  L +  +   D PE S+    N+  W   WK Q
Sbjct: 961  LSWRLPDDKKVWSWERNGNYSVRSAYHLLKE--ETLRDIPEPSTAG--NTGIWKSIWKVQ 1016

Query: 442  VKNKLKHFLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQCE 597
               ++K+FLW+V+   +PT   L ++G+ +DP+CP+C +  ET EHLF  C+
Sbjct: 1017 APQRVKNFLWRVVKRILPTRCRLEQKGVALDPICPLCHDGEETQEHLFMHCQ 1068


>dbj|GAU35627.1| hypothetical protein TSUD_30450 [Trifolium subterraneum]
          Length = 1475

 Score = 97.4 bits (241), Expect = 7e-20
 Identities = 43/112 (38%), Positives = 67/112 (59%)
 Frame = +1

Query: 262  LLMRMLNDKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNSTNWGKCWKFQ 441
            L +R+ +D  +WNWEK G YSV++AY  +   ++     P  S      +  W + WK  
Sbjct: 1102 LSLRLPSDTLVWNWEKDGAYSVRSAYHLICDEKERSLPGPSVS----RKNKVWKEIWKAP 1157

Query: 442  VKNKLKHFLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQCE 597
            V NK+K+F+W++  N +PT   L+K+G+ +D  CP+C  EVE+  HLF QC+
Sbjct: 1158 VPNKIKNFMWRLTKNILPTRANLHKKGISLDLQCPLCHHEVESTNHLFLQCD 1209


>gb|PNY16580.1| ribonuclease H, partial [Trifolium pratense]
          Length = 894

 Score = 95.1 bits (235), Expect = 4e-19
 Identities = 45/111 (40%), Positives = 68/111 (61%)
 Frame = +1

Query: 262 LLMRMLNDKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNSTNWGKCWKFQ 441
           L  R+  DK IW+WEK G++SV++AY  L +I+   ++ PE+SS   H    W   WK +
Sbjct: 590 LSWRLPADKLIWHWEKNGEFSVRSAYHMLSEIRN--QNSPEASSSRDH--LLWKAIWKVK 645

Query: 442 VKNKLKHFLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQC 594
           V N +K+FLW++    +PT   L K+G+ +D  CP+C  ++E  EHLF  C
Sbjct: 646 VPNCIKNFLWRLAKAILPTRSRLEKKGITLDTTCPLCFNDIECNEHLFMHC 696


>gb|PNX71533.1| ribonuclease H [Trifolium pratense]
          Length = 798

 Score = 94.0 bits (232), Expect = 1e-18
 Identities = 42/112 (37%), Positives = 67/112 (59%)
 Frame = +1

Query: 262 LLMRMLNDKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNSTNWGKCWKFQ 441
           L  R+ +D  IWNWEK G+YSV++AY  L   +   +  P         S  W + W+  
Sbjct: 424 LSFRLPHDLLIWNWEKDGEYSVRSAYHLLCDEKARFQPGPSCPQ----RSKLWKEIWRAP 479

Query: 442 VKNKLKHFLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQCE 597
           V NK+K+F+W++  N +PT   L+K+G+ +D +CP+C  E E+ +HLF +C+
Sbjct: 480 VPNKIKNFMWRLAKNILPTRSNLHKKGITLDLLCPLCSSEEESSQHLFLKCD 531


>dbj|GAU39028.1| hypothetical protein TSUD_59840 [Trifolium subterraneum]
          Length = 1626

 Score = 93.6 bits (231), Expect = 2e-18
 Identities = 44/108 (40%), Positives = 66/108 (61%)
 Frame = +1

Query: 271  RMLNDKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNSTNWGKCWKFQVKN 450
            R+  DK IW+WEK G++SV++AY  L + +   ++ PE+SS    +   W   WK  V N
Sbjct: 1266 RLPADKLIWHWEKNGEFSVRSAYHMLSEDRN--KNSPEASSS--RDQLLWKTIWKVNVPN 1321

Query: 451  KLKHFLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQC 594
             +K+FLW+V    +PT   L K+G+ +D  CP+C  ++E  EHLF QC
Sbjct: 1322 CIKNFLWRVAKAILPTRGRLEKKGITLDTTCPLCFNDIECNEHLFMQC 1369


>gb|EEC79285.1| hypothetical protein OsI_20087 [Oryza sativa Indica Group]
 gb|EEC79286.1| hypothetical protein OsI_20088 [Oryza sativa Indica Group]
          Length = 216

 Score = 88.6 bits (218), Expect = 4e-18
 Identities = 42/111 (37%), Positives = 67/111 (60%), Gaps = 4/111 (3%)
 Frame = +1

Query: 277 LNDKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNS----TNWGKCWKFQV 444
           L D   W+++ KG +SVK+AYK  +QI++ KE   ++S  +++ S      W K W  +V
Sbjct: 58  LEDWPTWHFDSKGLFSVKSAYKLAVQIRE-KEKCRDASGSSLNTSHADTLQWEKIWNMEV 116

Query: 445 KNKLKHFLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQCE 597
            NK+K F+W++ +N +P    + +RGME D +CPMC    E   HLF +C+
Sbjct: 117 PNKIKMFVWRLAHNSLPVRCNIRRRGMESDNLCPMCNRFDEDCGHLFLKCK 167


>gb|PNX72264.1| ribonuclease H [Trifolium pratense]
          Length = 854

 Score = 91.7 bits (226), Expect = 6e-18
 Identities = 47/129 (36%), Positives = 70/129 (54%), Gaps = 1/129 (0%)
 Frame = +1

Query: 214 INSCVWRMLI-TFCK*KLLMRMLNDKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESS 390
           I SC  R +  T     L  R+  D  IWNWEK G+YSV++AY  L   +   +  P   
Sbjct: 464 IFSCFNRQVAQTIISIPLSFRLPQDTLIWNWEKDGEYSVRSAYHLLCDEKARLQPGPSCP 523

Query: 391 SDAIHNSTNWGKCWKFQVKNKLKHFLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVET 570
                 S  W + W+  V NK+K+FLW++  N +PT   L+ +G+ +D  CP+C  E E+
Sbjct: 524 ----RRSKLWKEIWRAPVPNKVKNFLWRLAKNILPTRTNLHNKGITLDLQCPLCFREEES 579

Query: 571 VEHLFFQCE 597
            +HLF +C+
Sbjct: 580 SQHLFLKCD 588


>gb|OMO84064.1| reverse transcriptase [Corchorus capsularis]
          Length = 951

 Score = 89.7 bits (221), Expect = 3e-17
 Identities = 40/104 (38%), Positives = 61/104 (58%)
 Frame = +1

Query: 283 DKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNSTNWGKCWKFQVKNKLKH 462
           D+++W ++K+G++SV+ AY  L+  +    +W  S        T W   WK +V  KL  
Sbjct: 648 DRFVWKFDKRGKFSVRAAY-DLISKENHGGEWDYSKE------TEWKTLWKLKVPYKLVI 700

Query: 463 FLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQC 594
           FLWK+ NNC+P    L +RG  ID  C +C E +ET++HLF+ C
Sbjct: 701 FLWKICNNCLPVRAELKRRGFNIDDRCNLCHEGMETIDHLFWHC 744


>emb|CDP09717.1| unnamed protein product [Coffea canephora]
          Length = 613

 Score = 89.0 bits (219), Expect = 5e-17
 Identities = 43/109 (39%), Positives = 61/109 (55%), Gaps = 3/109 (2%)
 Frame = +1

Query: 283 DKWIWNWEKKGQYSVKTAYKQLLQI---QQWKEDWPESSSDAIHNSTNWGKCWKFQVKNK 453
           D   W     G Y+V + YK L Q     + + D    +S A  N   W   WK +VK+K
Sbjct: 253 DSNYWLHSGSGTYTVNSGYKALCQETSQHKGRRDNEAGTSSANSNEKQWKWLWKLKVKSK 312

Query: 454 LKHFLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQCER 600
           +KHF+W+ +N  +P   L+ KR  + DP+C  CGE+ E++EHLFFQC R
Sbjct: 313 IKHFIWRSLNGLLPVNDLVFKRIHQGDPICDGCGEQEESIEHLFFQCSR 361


>ref|XP_017221408.1| PREDICTED: uncharacterized protein LOC108198150 [Daucus carota
           subsp. sativus]
          Length = 632

 Score = 89.0 bits (219), Expect = 5e-17
 Identities = 44/110 (40%), Positives = 59/110 (53%)
 Frame = +1

Query: 265 LMRMLNDKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNSTNWGKCWKFQV 444
           L R + D W W  EK GQYSV++AY  L +    K D   SS     NS  W + W  ++
Sbjct: 387 LQRSMEDSWYWRREKMGQYSVRSAYAALSE----KRDVNHSSD----NSGFWRRIWNLKI 438

Query: 445 KNKLKHFLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQC 594
             K+KHFLW+ I  C+PT   L  R +E+   CP+C  E E+V H+   C
Sbjct: 439 PLKVKHFLWRAITGCLPTKEQLISRRVEVIEQCPLCNLEPESVAHVLTSC 488


>dbj|GAU21787.1| hypothetical protein TSUD_329120, partial [Trifolium subterraneum]
          Length = 734

 Score = 89.0 bits (219), Expect = 5e-17
 Identities = 40/109 (36%), Positives = 63/109 (57%)
 Frame = +1

Query: 271 RMLNDKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNSTNWGKCWKFQVKN 450
           R+  D  IWNWEK G YSV++AY  L   +   +  P         S  W + W+  V N
Sbjct: 256 RLPQDTLIWNWEKDGVYSVRSAYHLLCDEKARLQPGPSCPK----RSKLWKEIWRAPVPN 311

Query: 451 KLKHFLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQCE 597
           K+K+F+W++  N +PT   L+ +G+++D  CP+C  E E+ +HLF +C+
Sbjct: 312 KVKNFIWRLAKNILPTRTNLHNKGIKLDLQCPLCFREEESSQHLFLKCD 360


>gb|PNX85647.1| hypothetical protein L195_g041717, partial [Trifolium pratense]
          Length = 276

 Score = 86.3 bits (212), Expect = 8e-17
 Identities = 44/111 (39%), Positives = 67/111 (60%)
 Frame = +1

Query: 262 LLMRMLNDKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNSTNWGKCWKFQ 441
           L  R+ +DK +W+ EK G YSV++ Y  L   ++  +  PESSS   HN   W K W   
Sbjct: 3   LSWRLPDDKLLWHGEKDGLYSVRSTYHLLGTDKRVNQ--PESSSSN-HNKM-WSKIWSLP 58

Query: 442 VKNKLKHFLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQC 594
           + NK+K+F W++  + IPT   L +R + +DP+CP+C  + E+ +HLF QC
Sbjct: 59  LPNKIKNFTWRLAKHIIPTCGNLQRRRVVLDPICPLCFMDQESDDHLFMQC 109


>gb|OMO55679.1| reverse transcriptase [Corchorus capsularis]
          Length = 1701

 Score = 88.6 bits (218), Expect = 8e-17
 Identities = 41/105 (39%), Positives = 60/105 (57%)
 Frame = +1

Query: 283  DKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNSTNWGKCWKFQVKNKLKH 462
            DK IW + K GQYSVK+ Y +L       +   ++S+  + +   W   W      K+K 
Sbjct: 1325 DKRIWPFTKTGQYSVKSGYYKLKNFDGCIQIGQKASTSHLIDRKIWKFMWSINCPPKIKV 1384

Query: 463  FLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQCE 597
            FLW+ + N IPT W L +RG  ++ +C +CG+EVETVEHL   C+
Sbjct: 1385 FLWRCVRNVIPTLWGLYRRGCHLNGVCGICGQEVETVEHLLLTCD 1429


>dbj|GAU33706.1| hypothetical protein TSUD_148570 [Trifolium subterraneum]
          Length = 527

 Score = 88.2 bits (217), Expect = 9e-17
 Identities = 40/111 (36%), Positives = 66/111 (59%)
 Frame = +1

Query: 262 LLMRMLNDKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNSTNWGKCWKFQ 441
           L  R+ NDK IW+WEK G +SV++A+  + +      + PE+SS   +N   W   WK +
Sbjct: 187 LSWRLPNDKLIWHWEKDGNFSVRSAHHMIKETDNL--NIPEASSS--NNQEIWEAVWKIK 242

Query: 442 VKNKLKHFLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQC 594
               +K+FLW++  + +PT   L ++G+ +D +CP+C E  E  +HLF +C
Sbjct: 243 APPSVKNFLWRLAKDILPTRGRLKRKGLSLDTICPLCFEVEENRDHLFMRC 293


>gb|PNX95041.1| ribonuclease H, partial [Trifolium pratense]
          Length = 1348

 Score = 88.2 bits (217), Expect = 1e-16
 Identities = 41/104 (39%), Positives = 62/104 (59%)
 Frame = +1

Query: 283  DKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNSTNWGKCWKFQVKNKLKH 462
            DK +W  EK G YSVK+AY  L    + +   PESSS A    + W   W   + N++K+
Sbjct: 1090 DKLVWTGEKNGGYSVKSAYHLLCNESEVRH--PESSSSAA--GSFWKHLWAIPLPNRIKN 1145

Query: 463  FLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQC 594
            F+W++  N +PT   L +R + +DP+CP+C  ++E+ EHLF  C
Sbjct: 1146 FMWRLAKNILPTRGNLLRRRVPLDPVCPLCFNDLESTEHLFLHC 1189


>dbj|GAU29911.1| hypothetical protein TSUD_148190 [Trifolium subterraneum]
          Length = 482

 Score = 87.8 bits (216), Expect = 1e-16
 Identities = 42/110 (38%), Positives = 63/110 (57%)
 Frame = +1

Query: 265 LMRMLNDKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNSTNWGKCWKFQV 444
           L R+  DK IW WEK   YSV++AY  L   +   +  P S    I   + WGK WK  V
Sbjct: 120 LKRLPQDKIIWCWEKNVVYSVRSAYHLLDDRKFCNQPSPSS----IFQESLWGKIWKAPV 175

Query: 445 KNKLKHFLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQC 594
            N +++FLW+++ + +P+   L K+G+ +DP  P+C ++ E  EHLF  C
Sbjct: 176 PNVIRNFLWRLVKHILPSRARLAKKGLTLDPYFPLCYQQAEDYEHLFMSC 225


>dbj|GAU50334.1| hypothetical protein TSUD_243120 [Trifolium subterraneum]
          Length = 303

 Score = 86.3 bits (212), Expect = 1e-16
 Identities = 38/111 (34%), Positives = 65/111 (58%)
 Frame = +1

Query: 262 LLMRMLNDKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNSTNWGKCWKFQ 441
           L  R+ NDK IW+WEK G +SV++ +  + +      + PE+SS   +N   W   WK +
Sbjct: 99  LSWRLPNDKLIWHWEKDGNFSVRSTHHMIKEADNL--NIPEASSS--NNQEIWKAVWKIK 154

Query: 442 VKNKLKHFLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQC 594
               +K+FLW++  + +PT   + ++G+ +D +CP+C E  E  +HLF +C
Sbjct: 155 APPSVKNFLWRLAKDILPTRGRIKRKGLSLDTICPLCFEAEENRDHLFMRC 205


>dbj|GAU41924.1| hypothetical protein TSUD_25650 [Trifolium subterraneum]
          Length = 279

 Score = 85.9 bits (211), Expect = 1e-16
 Identities = 40/111 (36%), Positives = 60/111 (54%)
 Frame = +1

Query: 262 LLMRMLNDKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNSTNWGKCWKFQ 441
           L+  +  DK +W  E+   YSVK  YK  ++            SD  H + NW   WK Q
Sbjct: 60  LVSSVREDKVVWEEERNECYSVKYGYKLAMRYI--------IGSDKYHVAGNWNGIWKAQ 111

Query: 442 VKNKLKHFLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQC 594
             +K +H LW++   C+PT + L +R +E +P CP+C EE+E   H+FF+C
Sbjct: 112 APHKARHLLWRLCRGCLPTRYRLLERRVECNPNCPVCDEEIEDELHIFFRC 162


>dbj|GAU44059.1| hypothetical protein TSUD_399580 [Trifolium subterraneum]
          Length = 1229

 Score = 87.8 bits (216), Expect = 1e-16
 Identities = 43/115 (37%), Positives = 60/115 (52%)
 Frame = +1

Query: 250  CK*KLLMRMLNDKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNSTNWGKC 429
            C+  L  R L+D  IW     G Y+VK+AYK  LQ+         +S D+ + S +W K 
Sbjct: 865  CRIPLHSRALHDSIIWKSSPNGNYTVKSAYKLCLQL---------TSHDSFNVSGDWRKI 915

Query: 430  WKFQVKNKLKHFLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQC 594
            W  Q+  KLKHF W+++  C+PT   L+ RG+     C +C    E   HLFF C
Sbjct: 916  WTMQIPPKLKHFCWRMLRYCLPTRLKLHIRGVNCQTTCAVCSNATEDELHLFFDC 970


>ref|XP_023634342.1| uncharacterized protein LOC17879006 [Capsella rubella]
          Length = 1758

 Score = 87.8 bits (216), Expect = 2e-16
 Identities = 41/110 (37%), Positives = 63/110 (57%), Gaps = 6/110 (5%)
 Frame = +1

Query: 283  DKWIWNWEKKGQYSVKTAYKQLLQIQQWKEDWPESSSDAIHNSTNWG------KCWKFQV 444
            D+ IW++ K G+YSV++ Y  ++          E  +   ++S   G      + WK +V
Sbjct: 1389 DRLIWHYNKSGEYSVRSGYWLIMH---------EPHTMIANHSVPSGSLVLKNQIWKLKV 1439

Query: 445  KNKLKHFLWKVINNCIPTTWLLNKRGMEIDPMCPMCGEEVETVEHLFFQC 594
              K+KHFLW+++   +PT   LN RGM +DP+CP C    ET+EH+FFQC
Sbjct: 1440 IPKIKHFLWRILTKALPTITRLNSRGMNLDPICPRCFRADETIEHIFFQC 1489


Top