BLASTX nr result

ID: Astragalus22_contig00033550 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00033550
         (420 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX71251.1| ribonuclease H, partial [Trifolium pratense]            64   6e-11
dbj|GAU24540.1| hypothetical protein TSUD_156530 [Trifolium subt...    65   9e-11
gb|PNX74720.1| ribonuclease H, partial [Trifolium pratense]            61   5e-10
gb|PNX67808.1| ribonuclease H, partial [Trifolium pratense]            59   7e-10
dbj|GAU43217.1| hypothetical protein TSUD_301040 [Trifolium subt...    63   8e-10
gb|PNX94180.1| ribonuclease H [Trifolium pratense]                     65   1e-09
dbj|GAU48830.1| hypothetical protein TSUD_190600 [Trifolium subt...    62   1e-09
gb|PNX61255.1| ribonuclease H, partial [Trifolium pratense]            62   2e-09
dbj|GAU47359.1| hypothetical protein TSUD_403620 [Trifolium subt...    62   2e-09
dbj|GAU49781.1| hypothetical protein TSUD_188300 [Trifolium subt...    62   2e-09
dbj|GAU48983.1| hypothetical protein TSUD_245740 [Trifolium subt...    63   2e-09
gb|PNX57966.1| ribonuclease H [Trifolium pratense]                     62   2e-09
gb|PNY06444.1| ribonuclease H [Trifolium pratense]                     61   6e-09
gb|PNX86123.1| ribonuclease H [Trifolium pratense]                     61   7e-09
gb|KYP78163.1| hypothetical protein KK1_049646 [Cajanus cajan]         59   1e-08
gb|KHN31021.1| Putative ribonuclease H protein [Glycine soja]          61   1e-08
dbj|GAU30135.1| hypothetical protein TSUD_360280 [Trifolium subt...    63   1e-08
gb|PNY12901.1| S-adenosylmethionine-dependent methyltransferase ...    59   1e-08
dbj|GAU24479.1| hypothetical protein TSUD_319560 [Trifolium subt...    59   2e-08
gb|PNX79078.1| ribonuclease H, partial [Trifolium pratense]            62   2e-08

>gb|PNX71251.1| ribonuclease H, partial [Trifolium pratense]
          Length = 384

 Score = 63.5 bits (153), Expect(2) = 6e-11
 Identities = 27/55 (49%), Positives = 38/55 (69%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSLIA 415
           G++RD +G FL GFY   ++  +++AEI  +L GL LCW  G+RN +C SDSL A
Sbjct: 246 GLIRDTNGRFLKGFYGTASEASVLFAEILAVLNGLDLCWVNGFRNIVCFSDSLQA 300



 Score = 31.2 bits (69), Expect(2) = 6e-11
 Identities = 17/57 (29%), Positives = 28/57 (49%), Gaps = 8/57 (14%)
 Frame = +3

Query: 66  FNFREIKCAATLHQVQSAYAIVGGTGATSLVSE--------ISWSRPQAGTVVLNVD 212
           FN +      ++ Q+QS  +       T +++         ++WSRP+ GTV LNVD
Sbjct: 178 FNNKRTTVQESITQIQSLLSACTAAFGTHVLASPHTGTARLVAWSRPREGTVCLNVD 234


>dbj|GAU24540.1| hypothetical protein TSUD_156530 [Trifolium subterraneum]
          Length = 1147

 Score = 65.1 bits (157), Expect(2) = 9e-11
 Identities = 28/55 (50%), Positives = 39/55 (70%)
 Frame = +2

Query: 251  GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSLIA 415
            G++R++ G+FL GFY   ++  ++YAEI  IL GL LCW  GYR+ +C SDSL A
Sbjct: 1009 GLIRNSFGAFLKGFYGTASQSSVLYAEIMAILHGLHLCWNNGYRSIVCYSDSLQA 1063



 Score = 28.9 bits (63), Expect(2) = 9e-11
 Identities = 13/26 (50%), Positives = 15/26 (57%)
 Frame = +3

Query: 135  GTGATSLVSEISWSRPQAGTVVLNVD 212
            G+G  S    + W RP  GTV LNVD
Sbjct: 970  GSGGNSEHRLVVWPRPDEGTVCLNVD 995


>gb|PNX74720.1| ribonuclease H, partial [Trifolium pratense]
          Length = 177

 Score = 61.2 bits (147), Expect(2) = 5e-10
 Identities = 25/54 (46%), Positives = 38/54 (70%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSLI 412
           G++R+  G+FL GFY   ++  I+Y EI  +L GL LCW +G+R+ +C SDSL+
Sbjct: 39  GLIRNNVGAFLEGFYGTASQSNILYVEIMAVLHGLELCWNKGFRDVVCFSDSLL 92



 Score = 30.4 bits (67), Expect(2) = 5e-10
 Identities = 11/17 (64%), Positives = 14/17 (82%)
 Frame = +3

Query: 162 EISWSRPQAGTVVLNVD 212
           ++SWSRP  GT+ LNVD
Sbjct: 9   QVSWSRPAEGTICLNVD 25


>gb|PNX67808.1| ribonuclease H, partial [Trifolium pratense]
          Length = 199

 Score = 59.3 bits (142), Expect(3) = 7e-10
 Identities = 24/53 (45%), Positives = 34/53 (64%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSL 409
           G++RD +G F+ GFY   N   I++ E+  +L GL +CWE G+R   C SDSL
Sbjct: 95  GLIRDHNGVFISGFYGAANVQSILFVELMAVLHGLHICWESGFRRVTCYSDSL 147



 Score = 27.7 bits (60), Expect(3) = 7e-10
 Identities = 11/16 (68%), Positives = 12/16 (75%)
 Frame = +3

Query: 165 ISWSRPQAGTVVLNVD 212
           +SWS P  GTV LNVD
Sbjct: 66  VSWSPPMEGTVCLNVD 81



 Score = 23.5 bits (49), Expect(3) = 7e-10
 Identities = 7/16 (43%), Positives = 9/16 (56%)
 Frame = +2

Query: 8  IILWTLWCARINFFLR 55
          I LW +WC R  F  +
Sbjct: 11 IALWVIWCGRNEFVFK 26


>dbj|GAU43217.1| hypothetical protein TSUD_301040 [Trifolium subterraneum]
          Length = 565

 Score = 63.2 bits (152), Expect(2) = 8e-10
 Identities = 28/55 (50%), Positives = 36/55 (65%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSLIA 415
           G++RD  G F+ G+Y       I+YAEI  I +GL LCWE GYR  +C SDSL+A
Sbjct: 472 GLLRDNHGDFIWGYYGVAAAQNILYAEIMAIYQGLKLCWENGYRKVLCCSDSLLA 526



 Score = 27.7 bits (60), Expect(2) = 8e-10
 Identities = 11/16 (68%), Positives = 13/16 (81%)
 Frame = +3

Query: 165 ISWSRPQAGTVVLNVD 212
           +SWSRP AG + LNVD
Sbjct: 443 MSWSRPAAGIMCLNVD 458


>gb|PNX94180.1| ribonuclease H [Trifolium pratense]
          Length = 644

 Score = 65.5 bits (158), Expect(2) = 1e-09
 Identities = 29/55 (52%), Positives = 38/55 (69%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSLIA 415
           G++R++SG+FL GFY       ++YAEI  IL GL LCW  GYR+ +C SDSL A
Sbjct: 504 GLIRNSSGAFLKGFYGAATLSSVLYAEIMAILHGLQLCWSNGYRSIVCYSDSLQA 558



 Score = 25.0 bits (53), Expect(2) = 1e-09
 Identities = 21/76 (27%), Positives = 32/76 (42%), Gaps = 10/76 (13%)
 Frame = +3

Query: 12  SCGLCGVLGSISF*GQNNFNFREIKCAA--------TLHQVQSAYAIVGGTGATSLVSE- 164
           +CGL  VL S      N+  F   K +         ++H  ++A       GA     + 
Sbjct: 414 ACGLGHVLPSAIDTDLNDVVFNNTKASVYDSVAKVHSMHTFRTAAFETEMLGADGSSEQR 473

Query: 165 -ISWSRPQAGTVVLNV 209
            ++W+RP  GT  LNV
Sbjct: 474 LVTWTRPAEGTACLNV 489


>dbj|GAU48830.1| hypothetical protein TSUD_190600 [Trifolium subterraneum]
          Length = 298

 Score = 62.0 bits (149), Expect(2) = 1e-09
 Identities = 25/53 (47%), Positives = 35/53 (66%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSL 409
           G++RD++G FL GFY       I++AE+  +L GL +CWE G+R   C SDSL
Sbjct: 160 GLIRDSNGVFLSGFYGTATVQSILFAELMAVLHGLQICWESGFRRITCFSDSL 212



 Score = 28.1 bits (61), Expect(2) = 1e-09
 Identities = 14/44 (31%), Positives = 24/44 (54%), Gaps = 1/44 (2%)
 Frame = +3

Query: 84  KCAATLHQVQSAYAIVGGTGATSLVSE-ISWSRPQAGTVVLNVD 212
           K  + LH  ++ +     + AT+     ++W++P  GTV LNVD
Sbjct: 103 KIYSLLHSCEAVFTPPHSSMATTAKPRLVTWTKPAEGTVCLNVD 146


>gb|PNX61255.1| ribonuclease H, partial [Trifolium pratense]
          Length = 146

 Score = 62.4 bits (150), Expect = 2e-09
 Identities = 27/55 (49%), Positives = 37/55 (67%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSLIA 415
           G++R+  G+FL GFY   ++  ++YAEI  +L GL LCW  G+RN  C SDSL A
Sbjct: 31  GLIRNHFGAFLKGFYGTASQSSVLYAEIMAVLHGLELCWVNGFRNIACYSDSLQA 85


>dbj|GAU47359.1| hypothetical protein TSUD_403620 [Trifolium subterraneum]
          Length = 330

 Score = 61.6 bits (148), Expect(2) = 2e-09
 Identities = 25/53 (47%), Positives = 36/53 (67%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSL 409
           G++R+ +G F+ GFY   +   I++AEI  +L GL +CWE GYR   C+SDSL
Sbjct: 216 GLLRNHNGEFILGFYGTTSLKSILFAEIMAVLHGLTICWENGYRKINCLSDSL 268



 Score = 28.1 bits (61), Expect(2) = 2e-09
 Identities = 14/38 (36%), Positives = 20/38 (52%), Gaps = 4/38 (10%)
 Frame = +3

Query: 111 QSAYAIVGGTGATSLVSE----ISWSRPQAGTVVLNVD 212
           Q+  A  G T   +  S     ++W+RP  GT+ LNVD
Sbjct: 165 QACAAAFGSTQTIATQSSNPRLVTWARPMEGTICLNVD 202


>dbj|GAU49781.1| hypothetical protein TSUD_188300 [Trifolium subterraneum]
          Length = 221

 Score = 61.6 bits (148), Expect(2) = 2e-09
 Identities = 25/53 (47%), Positives = 36/53 (67%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSL 409
           G++R+ +G F+ GFY   +   I++AEI  +L GL +CWE GYR   C+SDSL
Sbjct: 83  GLLRNHNGEFILGFYGTTSLKSILFAEIMAVLHGLTICWENGYRKINCLSDSL 135



 Score = 28.1 bits (61), Expect(2) = 2e-09
 Identities = 14/38 (36%), Positives = 20/38 (52%), Gaps = 4/38 (10%)
 Frame = +3

Query: 111 QSAYAIVGGTGATSLVSE----ISWSRPQAGTVVLNVD 212
           Q+  A  G T   +  S     ++W+RP  GT+ LNVD
Sbjct: 32  QACAAAFGSTQTIATQSSNPRLVTWARPMEGTICLNVD 69


>dbj|GAU48983.1| hypothetical protein TSUD_245740 [Trifolium subterraneum]
          Length = 1103

 Score = 62.8 bits (151), Expect(2) = 2e-09
 Identities = 27/55 (49%), Positives = 38/55 (69%)
 Frame = +2

Query: 251  GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSLIA 415
            G++R++  +FL GFY   ++  ++YAEI  IL GL LCW  GYR+ +C SDSL A
Sbjct: 965  GLIRNSFSAFLKGFYGTASQSSVLYAEIMAILHGLHLCWNNGYRSIVCYSDSLQA 1019



 Score = 26.6 bits (57), Expect(2) = 2e-09
 Identities = 12/25 (48%), Positives = 14/25 (56%)
 Frame = +3

Query: 135  GTGATSLVSEISWSRPQAGTVVLNV 209
            G+G  S    + W RP  GTV LNV
Sbjct: 926  GSGGNSEQRLVVWPRPAEGTVCLNV 950


>gb|PNX57966.1| ribonuclease H [Trifolium pratense]
          Length = 192

 Score = 62.4 bits (150), Expect(2) = 2e-09
 Identities = 28/55 (50%), Positives = 37/55 (67%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSLIA 415
           G++R+ +G+FL GFY   +   I+YAEI  +L GL LCW  GY N +C SDSL A
Sbjct: 54  GLIRNNAGAFLGGFYGVASMPSILYAEIMAVLHGLELCWNNGYTNLVCFSDSLQA 108



 Score = 26.9 bits (58), Expect(2) = 2e-09
 Identities = 10/16 (62%), Positives = 12/16 (75%)
 Frame = +3

Query: 165 ISWSRPQAGTVVLNVD 212
           ++WSRP  GT  LNVD
Sbjct: 25  VAWSRPTEGTFCLNVD 40


>gb|PNY06444.1| ribonuclease H [Trifolium pratense]
          Length = 547

 Score = 60.8 bits (146), Expect(2) = 6e-09
 Identities = 28/55 (50%), Positives = 35/55 (63%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSLIA 415
           G++RD +G FL GFY  +    I+ AE+  IL GL +CWE GYR   C SDSL A
Sbjct: 409 GLLRDNNGVFLLGFYGAVTVPSILLAELMAILHGLQICWENGYRRITCFSDSLQA 463



 Score = 26.9 bits (58), Expect(2) = 6e-09
 Identities = 17/52 (32%), Positives = 26/52 (50%), Gaps = 1/52 (1%)
 Frame = +3

Query: 60  NNFNFREIKCAATLHQVQSAYAIVGGTGATSLVSE-ISWSRPQAGTVVLNVD 212
           +N +    K  + L   ++A++    T   S+ S  + WSRP  G V LNVD
Sbjct: 344 DNIHTSVTKNFSLLKSCEAAFSSPPTTSNISVTSRSVVWSRPVEGFVCLNVD 395


>gb|PNX86123.1| ribonuclease H [Trifolium pratense]
          Length = 149

 Score = 60.8 bits (146), Expect = 7e-09
 Identities = 25/53 (47%), Positives = 40/53 (75%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSL 409
           G++R++ GSFL GFY   ++  I+Y EI  +L GL LCW++G+++ +C+SDSL
Sbjct: 36  GLIRNSFGSFLGGFYGVASQASILYGEIMAMLHGLELCWDKGFKHVICLSDSL 88


>gb|KYP78163.1| hypothetical protein KK1_049646 [Cajanus cajan]
          Length = 76

 Score = 58.5 bits (140), Expect = 1e-08
 Identities = 27/52 (51%), Positives = 34/52 (65%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDS 406
           G+ RD +G FL  FY N     I++AEI  +L+GL LCW  GYRN +C SDS
Sbjct: 10  GLCRDHNGHFLMRFYSNAGSVTILHAEILVLLQGLELCWNVGYRNVICYSDS 61


>gb|KHN31021.1| Putative ribonuclease H protein [Glycine soja]
          Length = 172

 Score = 60.8 bits (146), Expect = 1e-08
 Identities = 26/53 (49%), Positives = 37/53 (69%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSL 409
           G+ R+ +G+FL GFY  +   E++YAEI  +L+GL LCWE  Y+  +C SDSL
Sbjct: 34  GLCRNHNGAFLLGFYGAVEISEVLYAEILALLKGLELCWEARYKKLVCYSDSL 86


>dbj|GAU30135.1| hypothetical protein TSUD_360280 [Trifolium subterraneum]
          Length = 479

 Score = 62.8 bits (151), Expect = 1e-08
 Identities = 27/55 (49%), Positives = 37/55 (67%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSLIA 415
           G++R+  G F+CGFY       I++AEI  I  GL LCWERG+R  +C SDSL++
Sbjct: 287 GLLRNKDGDFICGFYGVAAIPNILFAEIMAIWHGLELCWERGFRKVLCYSDSLLS 341


>gb|PNY12901.1| S-adenosylmethionine-dependent methyltransferase [Trifolium
           pratense]
          Length = 315

 Score = 58.9 bits (141), Expect(2) = 1e-08
 Identities = 26/53 (49%), Positives = 37/53 (69%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSL 409
           G++R+  G+FL GFY   ++  I+YAEI  +L GL LCWE+G+ +  C SDSL
Sbjct: 247 GLIRNNDGTFLGGFYGVASQPSILYAEIMAMLHGLELCWEKGFWSVSCFSDSL 299



 Score = 28.1 bits (61), Expect(2) = 1e-08
 Identities = 10/16 (62%), Positives = 13/16 (81%)
 Frame = +3

Query: 165 ISWSRPQAGTVVLNVD 212
           ++WSRP  GT+ LNVD
Sbjct: 218 VTWSRPLEGTICLNVD 233


>dbj|GAU24479.1| hypothetical protein TSUD_319560 [Trifolium subterraneum]
          Length = 227

 Score = 58.5 bits (140), Expect(2) = 2e-08
 Identities = 24/53 (45%), Positives = 36/53 (67%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSL 409
           G++R+ +G F+ GFY   +   I++AEI  +L GL +CWE GYR   C+S+SL
Sbjct: 104 GLLRNHNGEFILGFYGTTSLKSILFAEIMVVLHGLTICWENGYRKINCLSNSL 156



 Score = 27.7 bits (60), Expect(2) = 2e-08
 Identities = 9/16 (56%), Positives = 13/16 (81%)
 Frame = +3

Query: 165 ISWSRPQAGTVVLNVD 212
           ++W+RP  GT+ LNVD
Sbjct: 75  VTWARPMEGTICLNVD 90


>gb|PNX79078.1| ribonuclease H, partial [Trifolium pratense]
          Length = 548

 Score = 62.0 bits (149), Expect = 2e-08
 Identities = 26/53 (49%), Positives = 36/53 (67%)
 Frame = +2

Query: 251 GVVRDASGSFLCGFYCNLNKDEIIYAEIFGILRGLLLCWERGYRNFMCVSDSL 409
           G++R+  G+FL GFY   ++  ++YAEI  +L GL LCW  G+RN  C SDSL
Sbjct: 410 GLIRNNFGAFLKGFYGTASQSSVLYAEIMAVLHGLELCWVNGFRNIACYSDSL 462


Top