BLASTX nr result

ID: Astragalus23_contig00022327 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00022327
         (538 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KHN39772.1| Putative ribonuclease H protein [Glycine soja]          67   4e-11
dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subt...    64   5e-11
gb|KHN04118.1| Putative ribonuclease H protein [Glycine soja]          67   7e-11
dbj|GAU20014.1| hypothetical protein TSUD_273490 [Trifolium subt...    63   3e-09
gb|KHN08094.1| Serine carboxypeptidase-like 50 [Glycine soja]          60   2e-07
gb|PNX80037.1| ribonuclease H [Trifolium pratense]                     60   4e-07
gb|PNX75801.1| ribonuclease H, partial [Trifolium pratense] >gi|...    57   4e-07
gb|KYP59936.1| Putative ribonuclease H protein At1g65750 family ...    56   5e-07
gb|KYP76251.1| Putative ribonuclease H protein At1g65750 [Cajanu...    56   6e-07
gb|PNX98086.1| ribonuclease H [Trifolium pratense]                     58   1e-06
gb|KYP69434.1| Putative ribonuclease H protein At1g65750 family ...    56   1e-06
gb|KYP45881.1| Putative ribonuclease H protein At1g65750 family ...    57   1e-06
gb|KYP40438.1| Putative ribonuclease H protein At1g65750 family,...    58   2e-06
gb|KYP56001.1| Putative ribonuclease H protein At1g65750 family,...    57   2e-06
gb|KYP35215.1| Putative ribonuclease H protein At1g65750 family,...    54   3e-06
gb|KYP77485.1| Putative ribonuclease H protein At1g65750 family ...    55   3e-06
dbj|GAU34226.1| hypothetical protein TSUD_210080 [Trifolium subt...    53   3e-06
gb|PNX89293.1| ribonuclease H, partial [Trifolium pratense]            53   4e-06
gb|PNX98235.1| ribonuclease H [Trifolium pratense]                     56   4e-06
gb|KYP37594.1| Putative ribonuclease H protein At1g65750 family,...    56   5e-06

>gb|KHN39772.1| Putative ribonuclease H protein [Glycine soja]
          Length = 137

 Score = 67.4 bits (163), Expect = 4e-11
 Identities = 31/79 (39%), Positives = 55/79 (69%)
 Frame = +3

Query: 87  RATVVETEIKGILKVVQIAISKGYNKLAIESDSMVAVKYVQ*GCSSQHPYYHLISFIPSL 266
           + +V+E E+KGIL  +++A S+GY ++ ++SDS+VA+K +  GC  +HP+Y+L+  I ++
Sbjct: 30  KGSVLEEELKGILFGLKLAWSRGYRRIRVDSDSLVAIKLMSKGCCMRHPFYNLVQEIHAV 89

Query: 267 LDSCDEVDISHILR*ANQV 323
                 +  +H+LR ANQV
Sbjct: 90  HGYQGNIYWNHVLREANQV 108


>dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subterraneum]
          Length = 1250

 Score = 64.3 bits (155), Expect(2) = 5e-11
 Identities = 29/88 (32%), Positives = 53/88 (60%)
 Frame = +3

Query: 60   ILSYFKRESRATVVETEIKGILKVVQIAISKGYNKLAIESDSMVAVKYVQ*GCSSQHPYY 239
            ++++  R    +VV  E+ GI+  +++A +KG  ++ +ESDSM+A+  ++ GC+ +HP +
Sbjct: 1121 LVAFSARAGSVSVVHAELWGIINGLELAKNKGLKRIRVESDSMIAINLIRNGCNKEHPAF 1180

Query: 240  HLISFIPSLLDSCDEVDISHILR*ANQV 323
            HL+     L +  + V   H  R ANQV
Sbjct: 1181 HLVQTALRLTEGMESVLWQHTWREANQV 1208



 Score = 30.4 bits (67), Expect(2) = 5e-11
 Identities = 14/27 (51%), Positives = 16/27 (59%)
 Frame = +1

Query: 373  VFFVDAPSFVVEAYKADFVGIVFPRGF 453
            + F   P F+V    AD VGI FPRGF
Sbjct: 1224 IIFDHVPGFLVMPLMADSVGIGFPRGF 1250


>gb|KHN04118.1| Putative ribonuclease H protein [Glycine soja]
          Length = 128

 Score = 66.6 bits (161), Expect = 7e-11
 Identities = 34/89 (38%), Positives = 51/89 (57%)
 Frame = +3

Query: 60  ILSYFKRESRATVVETEIKGILKVVQIAISKGYNKLAIESDSMVAVKYVQ*GCSSQHPYY 239
           I ++         +  E+ GILK + IA  +G+N + +ESDS +A+K +  GC + H  Y
Sbjct: 28  IFAFSSNIGNCNFLRAELWGILKGLTIARDRGFNYIVVESDSKIAIKLITYGCPTIHSNY 87

Query: 240 HLISFIPSLLDSCDEVDISHILR*ANQVT 326
           +L+  I  ++DS  EV  SHI R ANQ T
Sbjct: 88  NLVQQIKEIVDSRLEVTFSHIFREANQTT 116


>dbj|GAU20014.1| hypothetical protein TSUD_273490 [Trifolium subterraneum]
          Length = 159

 Score = 63.2 bits (152), Expect = 3e-09
 Identities = 33/77 (42%), Positives = 45/77 (58%)
 Frame = +3

Query: 93  TVVETEIKGILKVVQIAISKGYNKLAIESDSMVAVKYVQ*GCSSQHPYYHLISFIPSLLD 272
           TV+  E+ GIL  +Q+   KGY  +++ESDS + V  +  GC   HPY  ++S I  L  
Sbjct: 50  TVLMAELWGILTTLQMVWDKGYRYVSLESDSAIVVSLINKGCPPSHPYVSIVSLINRLKM 109

Query: 273 SCDEVDISHILR*ANQV 323
              +V ISHI R ANQV
Sbjct: 110 KDWQVQISHIYRQANQV 126


>gb|KHN08094.1| Serine carboxypeptidase-like 50 [Glycine soja]
          Length = 443

 Score = 60.1 bits (144), Expect = 2e-07
 Identities = 34/90 (37%), Positives = 53/90 (58%), Gaps = 1/90 (1%)
 Frame = +3

Query: 57  GILSYFKRESRATVVETEIKG-ILKVVQIAISKGYNKLAIESDSMVAVKYVQ*GCSSQHP 233
           GI+ +   E +   V  E+ G IL  +++A S+GY ++ ++SDS VA+  V  GC   HP
Sbjct: 330 GIVEFLNAERKIWKVNGELAGAILLGIKLAWSRGYRRIRVDSDSPVAINLVTKGCCMLHP 389

Query: 234 YYHLISFIPSLLDSCDEVDISHILR*ANQV 323
           Y++ +  I ++     +V  SHILR ANQV
Sbjct: 390 YFNHVKEIHAVHSYEGDVTWSHILREANQV 419


>gb|PNX80037.1| ribonuclease H [Trifolium pratense]
          Length = 670

 Score = 59.7 bits (143), Expect = 4e-07
 Identities = 34/77 (44%), Positives = 44/77 (57%)
 Frame = +3

Query: 93  TVVETEIKGILKVVQIAISKGYNKLAIESDSMVAVKYVQ*GCSSQHPYYHLISFIPSLLD 272
           TV+  E+ GIL  +Q    KGY  +++ESDS VAV  +  GC   HP   ++S I  L  
Sbjct: 550 TVLMAELWGILTTLQWVWDKGYQNISLESDSSVAVALINKGCPPSHPCATIVSLINRLKM 609

Query: 273 SCDEVDISHILR*ANQV 323
              +V ISHI R ANQV
Sbjct: 610 RDWQVQISHIYRQANQV 626


>gb|PNX75801.1| ribonuclease H, partial [Trifolium pratense]
 gb|PNX80452.1| ribonuclease H, partial [Trifolium pratense]
          Length = 171

 Score = 56.6 bits (135), Expect(2) = 4e-07
 Identities = 27/64 (42%), Positives = 39/64 (60%)
 Frame = +3

Query: 132 VQIAISKGYNKLAIESDSMVAVKYVQ*GCSSQHPYYHLISFIPSLLDSCDEVDISHILR* 311
           +Q+AI KG+ K+ IESDS+ A+  +  GCS  HP + L++ I +      EV   H+ R 
Sbjct: 66  LQVAIGKGFQKILIESDSLAAINIIAGGCSISHPCFGLVNDIKNKATQVTEVRFMHMFRE 125

Query: 312 ANQV 323
           ANQV
Sbjct: 126 ANQV 129



 Score = 25.0 bits (53), Expect(2) = 4e-07
 Identities = 10/21 (47%), Positives = 13/21 (61%)
 Frame = +1

Query: 391 PSFVVEAYKADFVGIVFPRGF 453
           P F+     AD +G +FPRGF
Sbjct: 151 PDFISFPLLADRIGTLFPRGF 171


>gb|KYP59936.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 116

 Score = 56.2 bits (134), Expect = 5e-07
 Identities = 30/72 (41%), Positives = 45/72 (62%)
 Frame = +3

Query: 108 EIKGILKVVQIAISKGYNKLAIESDSMVAVKYVQ*GCSSQHPYYHLISFIPSLLDSCDEV 287
           E+  I   +  A ++G+ KL +ESDS++AV  +Q GCSS HP + L+  I S + +  + 
Sbjct: 3   ELWAIYIGITTAWNRGFTKLVVESDSLLAVGLLQNGCSSCHPCFSLVQSILSFVAAGGDF 62

Query: 288 DISHILR*ANQV 323
           +  HILR ANQV
Sbjct: 63  ECRHILREANQV 74


>gb|KYP76251.1| Putative ribonuclease H protein At1g65750 [Cajanus cajan]
          Length = 125

 Score = 56.2 bits (134), Expect = 6e-07
 Identities = 25/53 (47%), Positives = 36/53 (67%)
 Frame = +3

Query: 165 LAIESDSMVAVKYVQ*GCSSQHPYYHLISFIPSLLDSCDEVDISHILR*ANQV 323
           + IESDS +AVK++  GCS +HPYY L++ I  +      +D +H+LR ANQV
Sbjct: 31  IIIESDSALAVKFLNEGCSREHPYYSLVNHIVRMAGDFPSIDCTHVLREANQV 83


>gb|PNX98086.1| ribonuclease H [Trifolium pratense]
          Length = 287

 Score = 57.8 bits (138), Expect = 1e-06
 Identities = 30/74 (40%), Positives = 45/74 (60%)
 Frame = +3

Query: 108 EIKGILKVVQIAISKGYNKLAIESDSMVAVKYVQ*GCSSQHPYYHLISFIPSLLDSCDEV 287
           E+ G+L  +Q+A  KG+  + +ESDSM+AV  V  GC   HP Y +++ I  L     +V
Sbjct: 172 ELWGMLTTLQLAWDKGFRLVNLESDSMLAVSLVVKGCPPTHPCYSIVALINCLKMRDWQV 231

Query: 288 DISHILR*ANQVTL 329
            ++HI R ANQV +
Sbjct: 232 SVNHIYRQANQVAI 245


>gb|KYP69434.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 147

 Score = 55.8 bits (133), Expect = 1e-06
 Identities = 29/75 (38%), Positives = 48/75 (64%)
 Frame = +3

Query: 96  VVETEIKGILKVVQIAISKGYNKLAIESDSMVAVKYVQ*GCSSQHPYYHLISFIPSLLDS 275
           ++  E+ GIL+ ++IA   G +++  ++DS+VAVK++Q G S  H Y +L+  I  LLD 
Sbjct: 31  IIGVELLGILQGLRIAQRLGLSRVYCQTDSLVAVKWIQGGVSHMHHYSNLVQKIHKLLDK 90

Query: 276 CDEVDISHILR*ANQ 320
              V ISH+L+  N+
Sbjct: 91  DWAVSISHVLKECNK 105


>gb|KYP45881.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 194

 Score = 56.6 bits (135), Expect = 1e-06
 Identities = 29/78 (37%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
 Frame = +3

Query: 93  TVVETEIKGILKVVQIAISKGYN-KLAIESDSMVAVKYVQ*GCSSQHPYYHLISFIPSLL 269
           +VV+ E+  I   +++   +     + IESDS +AVK++  GCS +HPYY L++ I  + 
Sbjct: 94  SVVQAELCAIFHGLRLLKERSLMVDIIIESDSALAVKFLNEGCSREHPYYSLVNHIVRMA 153

Query: 270 DSCDEVDISHILR*ANQV 323
                +D +H+LR ANQV
Sbjct: 154 GDFPSIDCTHVLREANQV 171


>gb|KYP40438.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
            cajan]
          Length = 1356

 Score = 57.8 bits (138), Expect(2) = 2e-06
 Identities = 34/89 (38%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
 Frame = +3

Query: 60   ILSYFKRESRATVVETEIKGILKVVQIAISKGY-NKLAIESDSMVAVKYVQ*GCSSQHPY 236
            I ++  R    +VV+ E+  I   ++I I +G  + + IESDS +AVK++  GCS ++P 
Sbjct: 1226 IFAFAGRLGTCSVVQAELWAIFHELRIFIERGLADPIIIESDSALAVKFLNEGCSRENPC 1285

Query: 237  YHLISFIPSLLDSCDEVDISHILR*ANQV 323
            Y L++ I ++     EV   HILR ANQV
Sbjct: 1286 YSLVNLIVNMTGDNLEVKCVHILREANQV 1314



 Score = 21.6 bits (44), Expect(2) = 2e-06
 Identities = 9/28 (32%), Positives = 13/28 (46%)
 Frame = +1

Query: 370  VVFFVDAPSFVVEAYKADFVGIVFPRGF 453
            +  F   P +      AD   ++FPRGF
Sbjct: 1329 IQIFSSPPPWARAPLFADSSNVIFPRGF 1356


>gb|KYP56001.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 414

 Score = 57.4 bits (137), Expect = 2e-06
 Identities = 31/87 (35%), Positives = 52/87 (59%)
 Frame = +3

Query: 60  ILSYFKRESRATVVETEIKGILKVVQIAISKGYNKLAIESDSMVAVKYVQ*GCSSQHPYY 239
           I  ++     + ++  E+ GIL+ ++IA   G +++  ++DS+VAVK++Q G S  H Y 
Sbjct: 286 IHGFYGNVDGSDIIGVELLGILQGLRIAQRLGLSRVYCQTDSLVAVKWIQGGVSHMHHYS 345

Query: 240 HLISFIPSLLDSCDEVDISHILR*ANQ 320
           +L+  I  LLD    V ISH+LR  N+
Sbjct: 346 NLVQEIHKLLDKDWAVSISHVLRECNK 372


>gb|KYP35215.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
 gb|KYP35220.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 170

 Score = 54.3 bits (129), Expect(2) = 3e-06
 Identities = 31/78 (39%), Positives = 48/78 (61%), Gaps = 1/78 (1%)
 Frame = +3

Query: 93  TVVETEIKGILKVVQIAISKG-YNKLAIESDSMVAVKYVQ*GCSSQHPYYHLISFIPSLL 269
           +VV+ E+  I   +QI   KG ++ + IESDS +AVK++  GCS ++P Y L++ I ++ 
Sbjct: 51  SVVQAELWAIFHGLQIINEKGIFDPIIIESDSALAVKFLNEGCSRENPCYSLVNLIVNMT 110

Query: 270 DSCDEVDISHILR*ANQV 323
                VD +HI   ANQV
Sbjct: 111 GDNLAVDCNHIFCEANQV 128



 Score = 24.6 bits (52), Expect(2) = 3e-06
 Identities = 10/28 (35%), Positives = 15/28 (53%)
 Frame = +1

Query: 370 VVFFVDAPSFVVEAYKADFVGIVFPRGF 453
           +  F   P +V+    AD   ++FPRGF
Sbjct: 143 IQIFSSPPPWVMAPLFADSSNVIFPRGF 170


>gb|KYP77485.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 152

 Score = 55.1 bits (131), Expect = 3e-06
 Identities = 30/75 (40%), Positives = 47/75 (62%)
 Frame = +3

Query: 96  VVETEIKGILKVVQIAISKGYNKLAIESDSMVAVKYVQ*GCSSQHPYYHLISFIPSLLDS 275
           ++  E+ GIL+ ++IA   G +++  + DS+VAVK++Q G S  H Y +L+  I  LLD 
Sbjct: 36  IIGVELLGILQGLRIAQRLGLSRVYCQIDSLVAVKWIQGGVSHMHHYSNLVQEIHKLLDK 95

Query: 276 CDEVDISHILR*ANQ 320
              V ISH+LR  N+
Sbjct: 96  DWIVSISHVLRECNK 110


>dbj|GAU34226.1| hypothetical protein TSUD_210080 [Trifolium subterraneum]
          Length = 964

 Score = 53.1 bits (126), Expect(2) = 3e-06
 Identities = 29/76 (38%), Positives = 46/76 (60%)
 Frame = +3

Query: 96   VVETEIKGILKVVQIAISKGYNKLAIESDSMVAVKYVQ*GCSSQHPYYHLISFIPSLLDS 275
            V+  E+ GIL  +QIA  + +  + ++SDS VA+  ++ GC+  H  Y L+     L +S
Sbjct: 847  VLPAELWGILHGLQIAKDRRHMLVRVDSDSFVAINLIKKGCTRVHTTYQLVHSTHLLAES 906

Query: 276  CDEVDISHILR*ANQV 323
             + V+ +HILR ANQV
Sbjct: 907  FEIVEWNHILREANQV 922



 Score = 25.4 bits (54), Expect(2) = 3e-06
 Identities = 11/25 (44%), Positives = 13/25 (52%)
 Frame = +1

Query: 379  FVDAPSFVVEAYKADFVGIVFPRGF 453
            F   P  +   + AD   IVFPRGF
Sbjct: 940  FTYVPDLISVHFMADLACIVFPRGF 964


>gb|PNX89293.1| ribonuclease H, partial [Trifolium pratense]
          Length = 89

 Score = 53.1 bits (126), Expect = 4e-06
 Identities = 19/52 (36%), Positives = 38/52 (73%)
 Frame = +3

Query: 93  TVVETEIKGILKVVQIAISKGYNKLAIESDSMVAVKYVQ*GCSSQHPYYHLI 248
           +VV  E+ GI+  +++A +KG  ++ ++SDSM+A+  ++ GC+ +HP +HL+
Sbjct: 31  SVVHAELWGIINALELAKNKGLERIRVDSDSMIAINLIRNGCNKEHPPFHLV 82


>gb|PNX98235.1| ribonuclease H [Trifolium pratense]
          Length = 245

 Score = 55.8 bits (133), Expect = 4e-06
 Identities = 32/82 (39%), Positives = 43/82 (52%)
 Frame = +3

Query: 78  RESRATVVETEIKGILKVVQIAISKGYNKLAIESDSMVAVKYVQ*GCSSQHPYYHLISFI 257
           R    T +  E+ GIL  +Q A  KGY+K+ +ESDS+V V  +   C   HP   +IS I
Sbjct: 162 RHVLGTALMAELWGILSALQFATEKGYSKVILESDSIVVVDLIVKRCPDNHPCASIISSI 221

Query: 258 PSLLDSCDEVDISHILR*ANQV 323
             L     E+ + H  R ANQV
Sbjct: 222 NCLKMQEWEISLQHTYRQANQV 243


>gb|KYP37594.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 522

 Score = 56.2 bits (134), Expect = 5e-06
 Identities = 25/53 (47%), Positives = 36/53 (67%)
 Frame = +3

Query: 165 LAIESDSMVAVKYVQ*GCSSQHPYYHLISFIPSLLDSCDEVDISHILR*ANQV 323
           + IESDS +AVK++  GCS +HPYY L++ I  +      +D +H+LR ANQV
Sbjct: 428 IIIESDSALAVKFLNEGCSREHPYYSLVNHIVRMAGDFPSIDCTHVLREANQV 480


Top