BLASTX nr result

ID: Astragalus22_contig00035844 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00035844
         (482 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU51253.1| hypothetical protein TSUD_412460, partial [Trifo...    68   4e-10
dbj|GAU34105.1| hypothetical protein TSUD_256010 [Trifolium subt...    66   1e-09
gb|PNY12392.1| ribonuclease H, partial [Trifolium pratense]            65   5e-09
dbj|GAU36844.1| hypothetical protein TSUD_213680 [Trifolium subt...    64   1e-08
dbj|GAU42748.1| hypothetical protein TSUD_77850 [Trifolium subte...    63   2e-08
gb|PNX68691.1| ribonuclease H, partial [Trifolium pratense]            60   2e-08
gb|PNX66973.1| hypothetical protein L195_g055376, partial [Trifo...    60   8e-08
gb|PKI60526.1| hypothetical protein CRG98_019002 [Punica granatum]     59   2e-07
gb|KYP36270.1| Putative ribonuclease H protein At1g65750 family ...    59   5e-07
dbj|GAU10577.1| hypothetical protein TSUD_420970, partial [Trifo...    58   7e-07
gb|KHN18498.1| Putative ribonuclease H protein [Glycine soja]          56   8e-07
dbj|GAU11804.1| hypothetical protein TSUD_75550 [Trifolium subte...    58   9e-07
ref|XP_016162229.1| uncharacterized protein LOC107605009 [Arachi...    58   9e-07
dbj|GAU19047.1| hypothetical protein TSUD_193840 [Trifolium subt...    56   1e-06
gb|KYP53058.1| Putative ribonuclease H protein At1g65750 family,...    58   1e-06
gb|KYP46788.1| Putative ribonuclease H protein At1g65750 family,...    55   2e-06
gb|PNY06182.1| ribonuclease H [Trifolium pratense]                     57   2e-06
gb|ABD28505.2| RNA-directed DNA polymerase (Reverse transcriptas...    57   2e-06
gb|KYP52103.1| Putative ribonuclease H protein At1g65750 family,...    56   2e-06
dbj|GAU44619.1| hypothetical protein TSUD_378970 [Trifolium subt...    57   3e-06

>dbj|GAU51253.1| hypothetical protein TSUD_412460, partial [Trifolium subterraneum]
          Length = 609

 Score = 67.8 bits (164), Expect = 4e-10
 Identities = 29/89 (32%), Positives = 49/89 (55%)
 Frame = +1

Query: 193 SNLHMRHFSSKSIAMWKWKDLRRMQVHLWKVVHDKLPTNXXXXXXXXXXPDCLICSRNVE 372
           + +H+R +  K   +W+ +   R++V +W+V+HD++ TN          P C  C    E
Sbjct: 276 TGVHLREYEKKWFKIWRLETTERIRVFMWQVLHDRILTNWRTAKWNLTDPYCSYCEHMEE 335

Query: 373 TMLHVVRDFPMATRMWLGLVKFEKQTLFF 459
           T LHV+RD P+A  +W  L++ E +  FF
Sbjct: 336 TTLHVLRDCPLAVEVWQHLLEEEHRGRFF 364


>dbj|GAU34105.1| hypothetical protein TSUD_256010 [Trifolium subterraneum]
          Length = 679

 Score = 66.2 bits (160), Expect = 1e-09
 Identities = 36/99 (36%), Positives = 53/99 (53%), Gaps = 6/99 (6%)
 Frame = +1

Query: 184 TLASNLHMRHFSSKSI-----AMWKWKDLRRMQVHLWKVVHDKLPTNXXXXXXXXXX-PD 345
           T+ S     + SS SI     A+W WKD  R+Q  +W   H++L TN           P 
Sbjct: 279 TVKSAYDSHNTSSHSIEGDWKALWNWKDPHRIQTFMWMAAHERLLTNYRRSKWGVGVSPL 338

Query: 346 CLICSRNVETMLHVVRDFPMATRMWLGLVKFEKQTLFFS 462
           C  C R+ ET +HV+R+ P+AT++W+ LV   + + FFS
Sbjct: 339 CSACDRDNETTIHVLRECPLATQIWIRLVPSNQISNFFS 377


>gb|PNY12392.1| ribonuclease H, partial [Trifolium pratense]
          Length = 1594

 Score = 64.7 bits (156), Expect = 5e-09
 Identities = 32/78 (41%), Positives = 43/78 (55%), Gaps = 1/78 (1%)
 Frame = +1

Query: 232  AMWKWKDLRRMQVHLWKVVHDKLPTNXXXXXXXXXX-PDCLICSRNVETMLHVVRDFPMA 408
            A+W WK   R+Q  +W   H++L TN           P C  C R+ ET LHV+RD P A
Sbjct: 1391 ALWSWKGPHRIQTFMWMAAHERLLTNYRRSKWGVGISPMCPDCDRDNETTLHVLRDCPKA 1450

Query: 409  TRMWLGLVKFEKQTLFFS 462
            T++W+ LV   + T FFS
Sbjct: 1451 TQIWIRLVPSNQITNFFS 1468


>dbj|GAU36844.1| hypothetical protein TSUD_213680 [Trifolium subterraneum]
          Length = 1025

 Score = 63.5 bits (153), Expect = 1e-08
 Identities = 36/102 (35%), Positives = 49/102 (48%), Gaps = 6/102 (5%)
 Frame = +1

Query: 184 TLASNLHMRHFSSKSI-----AMWKWKDLRRMQVHLWKVVHDKLPTNXXXXXXXXXX-PD 345
           T+ S   ++H +   I      +W WK   R+Q  +W   H +L TN           P 
Sbjct: 669 TIQSTYDLQHGNGHHINGDWNKIWAWKGPHRIQTFMWIAAHARLLTNVRRSKWGVGVSPT 728

Query: 346 CLICSRNVETMLHVVRDFPMATRMWLGLVKFEKQTLFFSEFD 471
           C IC  + ETM+H +RD   AT +WL LV   + T FFS FD
Sbjct: 729 CSICGNDDETMIHTLRDCIYATGIWLRLVSSNQITNFFSSFD 770


>dbj|GAU42748.1| hypothetical protein TSUD_77850 [Trifolium subterraneum]
          Length = 821

 Score = 63.2 bits (152), Expect = 2e-08
 Identities = 29/78 (37%), Positives = 44/78 (56%), Gaps = 1/78 (1%)
 Frame = +1

Query: 232 AMWKWKDLRRMQVHLWKVVHDKLPTNXXXXXXXXXX-PDCLICSRNVETMLHVVRDFPMA 408
           A+W WK   R+Q  +W   H++L TN           P C  C ++ ET +HV+RD P+A
Sbjct: 610 ALWSWKGPHRIQTFMWMAAHERLLTNYRRSKWGVGVSPLCSACDKDNETTIHVLRDCPLA 669

Query: 409 TRMWLGLVKFEKQTLFFS 462
           T++W+ LV   + + FFS
Sbjct: 670 TQIWIRLVPSNQISNFFS 687


>gb|PNX68691.1| ribonuclease H, partial [Trifolium pratense]
          Length = 138

 Score = 60.1 bits (144), Expect = 2e-08
 Identities = 30/77 (38%), Positives = 40/77 (51%), Gaps = 1/77 (1%)
 Frame = +1

Query: 235 MWKWKDLRRMQVHLWKVVHDKLPTNXXXXXXXXXX-PDCLICSRNVETMLHVVRDFPMAT 411
           +W WK   R+Q  +W   H+++ TN           P C  C R  ET LHV+RD   AT
Sbjct: 57  LWGWKGPHRIQTFIWLAAHERILTNAGRSKWGVGISPTCASCVREDETTLHVLRDCVHAT 116

Query: 412 RMWLGLVKFEKQTLFFS 462
           R+W+ LV     T+FFS
Sbjct: 117 RVWVRLVPSNYITIFFS 133


>gb|PNX66973.1| hypothetical protein L195_g055376, partial [Trifolium pratense]
          Length = 235

 Score = 60.1 bits (144), Expect = 8e-08
 Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
 Frame = +1

Query: 232 AMWKWKDLRRMQVHLWKVVHDKLPTNXXXXXXXXXX-PDCLICSRNVETMLHVVRDFPMA 408
           A+W WK   R+Q  +W   HD+L TN           P C  C R  ET++HV+RD P+A
Sbjct: 168 AVWSWKGPHRIQTFMWMATHDRLLTNFRRSKWGVGASPICSRCDRVNETLIHVLRDCPVA 227

Query: 409 TRMWLGLV 432
           T+ W+ LV
Sbjct: 228 TQTWIRLV 235


>gb|PKI60526.1| hypothetical protein CRG98_019002 [Punica granatum]
          Length = 222

 Score = 58.5 bits (140), Expect = 2e-07
 Identities = 30/78 (38%), Positives = 40/78 (51%), Gaps = 1/78 (1%)
 Frame = +1

Query: 235 MWKWKDLRRMQVHLWKVVHDKLPTNXXXXXXXXXXPD-CLICSRNVETMLHVVRDFPMAT 411
           +W+W+  +R+Q+ LW V H+KL TN             C  CS  VET+LHV+RD P   
Sbjct: 59  VWQWRRPQRIQIFLWLVAHEKLLTNSLRVSRHLAVSGLCDFCSSTVETILHVLRDCPSTR 118

Query: 412 RMWLGLVKFEKQTLFFSE 465
             W  LV    +  FF E
Sbjct: 119 SAWYRLVPMNIRGRFFLE 136


>gb|KYP36270.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 360

 Score = 58.5 bits (140), Expect = 5e-07
 Identities = 26/63 (41%), Positives = 34/63 (53%), Gaps = 1/63 (1%)
 Frame = +1

Query: 235 MWKWKDLRRMQVHLWKVVHDKLPTN-XXXXXXXXXXPDCLICSRNVETMLHVVRDFPMAT 411
           +WKW  L R+   LW+VVHD L TN           P C IC + +E  +HV+RD P A 
Sbjct: 235 VWKWTGLERIHTFLWRVVHDSLLTNLARFERNLGPDPTCPICLQGIEDSIHVLRDCPFAR 294

Query: 412 RMW 420
            +W
Sbjct: 295 EVW 297


>dbj|GAU10577.1| hypothetical protein TSUD_420970, partial [Trifolium subterraneum]
          Length = 426

 Score = 58.2 bits (139), Expect = 7e-07
 Identities = 29/80 (36%), Positives = 40/80 (50%), Gaps = 1/80 (1%)
 Frame = +1

Query: 235 MWKWKDLRRMQVHLWKVVHDKLPTNXXXXXXXXXX-PDCLICSRNVETMLHVVRDFPMAT 411
           +W WK   R+Q  +W   H++L TN           P C  C    ET++H +RD   AT
Sbjct: 253 IWSWKGPHRIQTFIWIAAHERLLTNFRRSKWGVGVSPACSSCGNGDETIIHTLRDCAHAT 312

Query: 412 RMWLGLVKFEKQTLFFSEFD 471
           R+WL LV   + T FFS  +
Sbjct: 313 RIWLRLVCHNQITNFFSSLN 332


>gb|KHN18498.1| Putative ribonuclease H protein [Glycine soja]
          Length = 165

 Score = 56.2 bits (134), Expect = 8e-07
 Identities = 31/76 (40%), Positives = 42/76 (55%)
 Frame = +1

Query: 235 MWKWKDLRRMQVHLWKVVHDKLPTNXXXXXXXXXXPDCLICSRNVETMLHVVRDFPMATR 414
           +W+    +R ++ LW V++DKLPT           P C  C R  ET+LHV+RD P A  
Sbjct: 6   LWRLHLPQRCKIFLWLVLYDKLPTEVPRIADFAFSP-CPFC-RGHETLLHVLRDCPRAAS 63

Query: 415 MWLGLVKFEKQTLFFS 462
           +WL LV  + Q  FFS
Sbjct: 64  VWLPLVAPQHQRAFFS 79


>dbj|GAU11804.1| hypothetical protein TSUD_75550 [Trifolium subterraneum]
          Length = 1178

 Score = 58.2 bits (139), Expect = 9e-07
 Identities = 29/80 (36%), Positives = 40/80 (50%), Gaps = 1/80 (1%)
 Frame = +1

Query: 235  MWKWKDLRRMQVHLWKVVHDKLPTNXXXXXXXXXX-PDCLICSRNVETMLHVVRDFPMAT 411
            +W WK   R+Q  +W   H++L TN           P C  C    ET++H +RD   AT
Sbjct: 961  IWSWKGPHRIQTFIWIAAHERLITNFRRSKWGVGVSPACSSCGNGDETIIHTLRDCAHAT 1020

Query: 412  RMWLGLVKFEKQTLFFSEFD 471
            R+WL LV   + T FFS  +
Sbjct: 1021 RIWLRLVCHNQITNFFSSLN 1040


>ref|XP_016162229.1| uncharacterized protein LOC107605009 [Arachis ipaensis]
          Length = 1371

 Score = 58.2 bits (139), Expect = 9e-07
 Identities = 31/95 (32%), Positives = 48/95 (50%), Gaps = 4/95 (4%)
 Frame = +1

Query: 187  LASNLHMRHFSSKSIAMWK----WKDLRRMQVHLWKVVHDKLPTNXXXXXXXXXXPDCLI 354
            LA  + + H S ++  +WK    WK   R++V +W+  H +L T           P+C  
Sbjct: 1020 LAYRILINHSSMETKRIWKVIWRWKGPERIRVFMWQAAHGRLLTASRKSRMMRTDPNCHR 1079

Query: 355  CSRNVETMLHVVRDFPMATRMWLGLVKFEKQTLFF 459
            C R +ET LH +RD P A  +W+ LV+     +FF
Sbjct: 1080 CHRILETGLHALRDCPYAASIWVELVQPSAIAVFF 1114


>dbj|GAU19047.1| hypothetical protein TSUD_193840 [Trifolium subterraneum]
          Length = 159

 Score = 55.8 bits (133), Expect = 1e-06
 Identities = 27/83 (32%), Positives = 39/83 (46%)
 Frame = +1

Query: 193 SNLHMRHFSSKSIAMWKWKDLRRMQVHLWKVVHDKLPTNXXXXXXXXXXPDCLICSRNVE 372
           +  HM H       +W+ +   R++V LW+  HD+L  N          P C  C    E
Sbjct: 46  AGFHMHHDLQSWKQIWRIESAGRVKVFLWQDFHDRLLNNWRMARWNLKSPYCSYCGHLEE 105

Query: 373 TMLHVVRDFPMATRMWLGLVKFE 441
           T  HV+RDFP+A  +W  LV  +
Sbjct: 106 TTCHVLRDFPLANIIWSHLVDIQ 128


>gb|KYP53058.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
            cajan]
          Length = 1039

 Score = 57.8 bits (138), Expect = 1e-06
 Identities = 27/64 (42%), Positives = 35/64 (54%), Gaps = 1/64 (1%)
 Frame = +1

Query: 232  AMWKWKDLRRMQVHLWKVVHDKLPTN-XXXXXXXXXXPDCLICSRNVETMLHVVRDFPMA 408
            A+WKW  L R++  LW+VVHD L  N           P C IC + VE  +HV+RD P A
Sbjct: 959  AVWKWTGLERIRTFLWRVVHDILLNNLARFERNLGPDPTCPICLQGVEDSIHVLRDCPFA 1018

Query: 409  TRMW 420
              +W
Sbjct: 1019 REVW 1022


>gb|KYP46788.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 172

 Score = 55.5 bits (132), Expect = 2e-06
 Identities = 30/95 (31%), Positives = 42/95 (44%), Gaps = 1/95 (1%)
 Frame = +1

Query: 184 TLASNLHMRHFSSKSIAMWKWKDLRRMQVHLWKVVHDKLPTN-XXXXXXXXXXPDCLICS 360
           T+AS+ H         A+W+W D  R++V LW+VVH  L  N           P C +C 
Sbjct: 69  TIASSFHTVSHPPAFKAIWRWNDPERIRVLLWRVVHGSLMINKVRVDRGLGIDPTCPVCV 128

Query: 361 RNVETMLHVVRDFPMATRMWLGLVKFEKQTLFFSE 465
           +  E  LH +RD   A  +W          LFF +
Sbjct: 129 QGTENNLHALRDCKFAAEIWSRASGDSLPRLFFED 163


>gb|PNY06182.1| ribonuclease H [Trifolium pratense]
          Length = 686

 Score = 57.0 bits (136), Expect = 2e-06
 Identities = 28/87 (32%), Positives = 45/87 (51%)
 Frame = +1

Query: 202 HMRHFSSKSIAMWKWKDLRRMQVHLWKVVHDKLPTNXXXXXXXXXXPDCLICSRNVETML 381
           H   F S   ++WK     R++  +W++++ +LPTN          P C  C    ETM+
Sbjct: 349 HNLLFDSLWRSIWKLDAPERIRCFVWQLMYGRLPTNSACSRWGHTVPQCDYCVGIEETMI 408

Query: 382 HVVRDFPMATRMWLGLVKFEKQTLFFS 462
           HV+RD P+A  +W  LV  + +  FF+
Sbjct: 409 HVMRDCPVAHEIWNNLVPLQGRLAFFT 435


>gb|ABD28505.2| RNA-directed DNA polymerase (Reverse transcriptase);
           Polynucleotidyl transferase, Ribonuclease H fold
           [Medicago truncatula]
          Length = 729

 Score = 57.0 bits (136), Expect = 2e-06
 Identities = 27/77 (35%), Positives = 40/77 (51%), Gaps = 1/77 (1%)
 Frame = +1

Query: 235 MWKWKDLRRMQVHLWKVVHDKLPTNXXXXXXXXXX-PDCLICSRNVETMLHVVRDFPMAT 411
           +W WK   R+Q  +W   H ++ TN           P C  C+R  ET++HV+RD   +T
Sbjct: 467 LWNWKGPHRIQTFIWLAAHGRILTNYRRSKWGVGISPTCPCCAREDETVIHVLRDCVHST 526

Query: 412 RMWLGLVKFEKQTLFFS 462
           ++WL L+     T FFS
Sbjct: 527 QVWLRLIPHNYITNFFS 543


>gb|KYP52103.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 255

 Score = 56.2 bits (134), Expect = 2e-06
 Identities = 28/80 (35%), Positives = 40/80 (50%), Gaps = 1/80 (1%)
 Frame = +1

Query: 184 TLASNLHMRHFSSKSIAMWKWKDLRRMQVHLWKVVHDKLPTN-XXXXXXXXXXPDCLICS 360
           T+AS+ H         A+W+W    R++V LW+VVH  L TN           P C +C 
Sbjct: 52  TIASSSHTISHPPAFKAIWRWNGPERIRVLLWRVVHGSLMTNQVRVDRGLGTDPTCPVCM 111

Query: 361 RNVETMLHVVRDFPMATRMW 420
           +  E+ LH +RD   AT +W
Sbjct: 112 QGTESNLHALRDCKFATEIW 131


>dbj|GAU44619.1| hypothetical protein TSUD_378970 [Trifolium subterraneum]
          Length = 440

 Score = 56.6 bits (135), Expect = 3e-06
 Identities = 29/78 (37%), Positives = 39/78 (50%), Gaps = 1/78 (1%)
 Frame = +1

Query: 232 AMWKWKDLRRMQVHLWKVVHDKLPTNXXXXXXXXXX-PDCLICSRNVETMLHVVRDFPMA 408
           +MW WK   R+Q  +W   H+ L TN           P C  C    ET++HV+RD   A
Sbjct: 110 SMWSWKGPHRIQTFMWIAAHECLLTNYRRSKWRSGISPTCPACGNEDETIIHVLRDCMHA 169

Query: 409 TRMWLGLVKFEKQTLFFS 462
           T++W+ LV     T FFS
Sbjct: 170 TQIWIRLVTSNHITNFFS 187


Top