BLASTX nr result

ID: Astragalus23_contig00030248 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00030248
         (423 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU38301.1| hypothetical protein TSUD_157860 [Trifolium subt...    70   3e-11
gb|PNY14301.1| ribonuclease H [Trifolium pratense]                     67   5e-10
dbj|GAU41525.1| hypothetical protein TSUD_140560 [Trifolium subt...    67   6e-10
dbj|GAU29911.1| hypothetical protein TSUD_148190 [Trifolium subt...    66   7e-10
gb|PNX72264.1| ribonuclease H [Trifolium pratense]                     64   1e-09
gb|PNY16580.1| ribonuclease H, partial [Trifolium pratense]            64   5e-09
gb|PNX85341.1| hypothetical protein L195_g041409, partial [Trifo...    63   6e-09
gb|PNY15111.1| ribonuclease H [Trifolium pratense]                     60   8e-09
gb|PNX71533.1| ribonuclease H [Trifolium pratense]                     62   1e-08
dbj|GAU47092.1| hypothetical protein TSUD_369250 [Trifolium subt...    61   4e-08
gb|PNX95041.1| ribonuclease H, partial [Trifolium pratense]            61   6e-08
dbj|GAU39028.1| hypothetical protein TSUD_59840 [Trifolium subte...    60   8e-08
gb|PNY04967.1| ribonuclease H [Trifolium pratense]                     55   2e-07
dbj|GAU37237.1| hypothetical protein TSUD_375390 [Trifolium subt...    51   2e-06
dbj|GAU35627.1| hypothetical protein TSUD_30450 [Trifolium subte...    54   4e-06
gb|PNX58626.1| ribonuclease H, partial [Trifolium pratense]            52   5e-06

>dbj|GAU38301.1| hypothetical protein TSUD_157860 [Trifolium subterraneum]
          Length = 317

 Score = 69.7 bits (169), Expect = 3e-11
 Identities = 36/102 (35%), Positives = 50/102 (49%)
 Frame = +3

Query: 36  LLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASAA 215
           LLWK W+ RNQ+IFK V  +P+ +      +VHEFN              +  S    A 
Sbjct: 99  LLWKFWYGRNQVIFKGVVLDPIALAAEAALYVHEFNEANPRRCSQVVLQQASVSRLDDAN 158

Query: 216 HSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDIGLD 341
             ++  DAGCF  G  GWG  + N  G    +ACK E+I ++
Sbjct: 159 MQLMFTDAGCFNNGYTGWGIVLRNVDGTTSFSACKREEIEVE 200


>gb|PNY14301.1| ribonuclease H [Trifolium pratense]
          Length = 1196

 Score = 66.6 bits (161), Expect = 5e-10
 Identities = 35/104 (33%), Positives = 54/104 (51%), Gaps = 2/104 (1%)
 Frame = +3

Query: 36   LLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXT--HLVSPKSTTAS 209
            LLWK W+ RNQ+IFK+   +P+ +      +VHEFN             H+ +P+   ++
Sbjct: 978  LLWKFWYGRNQVIFKDAVFDPILLAADAIEYVHEFNEANPRRCNQVVLQHISAPRLDDSN 1037

Query: 210  AAHSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDIGLD 341
                ++  DAGCF  G  GWG  + N  G    +ACK E+I ++
Sbjct: 1038 M--QLMFTDAGCFNNGYTGWGLVLRNVDGTTSFSACKRENIEVE 1079


>dbj|GAU41525.1| hypothetical protein TSUD_140560 [Trifolium subterraneum]
          Length = 1610

 Score = 66.6 bits (161), Expect = 6e-10
 Identities = 35/100 (35%), Positives = 48/100 (48%)
 Frame = +3

Query: 33   SLLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASA 212
            ++LWK WFARNQ +F      P+++  S   FV EFN                 + +AS 
Sbjct: 1478 TILWKFWFARNQYVFNGYPIEPLRLAQSALLFVQEFNEANNLSRSTHVATRVHNTNSASP 1537

Query: 213  AHSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDI 332
                + VDAGCF   R GWG  + +  G +   AC+ EDI
Sbjct: 1538 CQFSMFVDAGCFSNARTGWGLVLKDQRGNVTWNACRREDI 1577


>dbj|GAU29911.1| hypothetical protein TSUD_148190 [Trifolium subterraneum]
          Length = 482

 Score = 66.2 bits (160), Expect = 7e-10
 Identities = 38/100 (38%), Positives = 51/100 (51%), Gaps = 2/100 (2%)
 Frame = +3

Query: 39  LWKIWFARNQLIFKNVN--PNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASA 212
           LWKIWF RN+LIF+     P+  +V  S  SF  EF+          T  V   S   S 
Sbjct: 270 LWKIWFHRNKLIFEQQAFVPHEYEVASSASSFGAEFSPTFLREIDMNTSDVLEASQVVSP 329

Query: 213 AHSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDI 332
             + + VDAGCF  G  GWG  V +  G ++ +AC+ E+I
Sbjct: 330 ICNRICVDAGCFSNGSTGWGLIVKDHEGSVIFSACRFEEI 369


>gb|PNX72264.1| ribonuclease H [Trifolium pratense]
          Length = 854

 Score = 64.3 bits (155), Expect(2) = 1e-09
 Identities = 38/113 (33%), Positives = 50/113 (44%), Gaps = 11/113 (9%)
 Frame = +3

Query: 36  LLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASAA 215
           LLWK W  RN  +F  V  +P ++     SFVH+FN               P+   A  A
Sbjct: 631 LLWKFWAGRNAAVFNGVQLDPGRLAIDAMSFVHDFNEANP-----------PRCRRAPVA 679

Query: 216 HSVVK-----------VDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDIGLD 341
           H  ++           VDAGC   G   WG  + N+ GE V +ACK ED  +D
Sbjct: 680 HVPIQPGMTNPIFSLFVDAGCSNSGHTVWGLVLRNSDGETVLSACKREDFYVD 732



 Score = 26.2 bits (56), Expect(2) = 1e-09
 Identities = 9/15 (60%), Positives = 12/15 (80%)
 Frame = +2

Query: 341 LDPLTAEILGIRWCM 385
           +DPL AE LG+RW +
Sbjct: 731 VDPLMAEALGVRWAL 745


>gb|PNY16580.1| ribonuclease H, partial [Trifolium pratense]
          Length = 894

 Score = 63.9 bits (154), Expect = 5e-09
 Identities = 37/101 (36%), Positives = 46/101 (45%)
 Frame = +3

Query: 30   SSLLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTAS 209
            S+ LW IW  RN+LIFKNV   P+ V  +   FV EFN                      
Sbjct: 738  STTLWMIWKGRNKLIFKNVKFCPIYVAAASSDFVAEFNSGSCCNESNIVRENPDSWEPPE 797

Query: 210  AAHSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDI 332
             A   V +DAGCF  G  GWG  + N +G +  AA  +E I
Sbjct: 798  QAKFKVNIDAGCFSNGTTGWGMIMRNHLGMVDFAATHLEKI 838


>gb|PNX85341.1| hypothetical protein L195_g041409, partial [Trifolium pratense]
          Length = 382

 Score = 63.2 bits (152), Expect(2) = 6e-09
 Identities = 34/100 (34%), Positives = 46/100 (46%)
 Frame = +3

Query: 33  SLLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASA 212
           +LLWK W  RN +IF     +P ++     +FVHEFN              +      S 
Sbjct: 261 TLLWKFWAGRNAVIFNGWQMDPTRLALDAMNFVHEFNEANPSRNRRVLVSQAISDPPRST 320

Query: 213 AHSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDI 332
           + + + VDAGC   G   WG  + N  GE   +ACK EDI
Sbjct: 321 SLNSMFVDAGCCNSGHTVWGLVLRNMNGETTFSACKREDI 360



 Score = 24.6 bits (52), Expect(2) = 6e-09
 Identities = 9/17 (52%), Positives = 12/17 (70%)
 Frame = +2

Query: 335 IGLDPLTAEILGIRWCM 385
           I  +PL AE LG+RW +
Sbjct: 360 ITAEPLLAEALGVRWAL 376


>gb|PNY15111.1| ribonuclease H [Trifolium pratense]
          Length = 1334

 Score = 60.1 bits (144), Expect(2) = 8e-09
 Identities = 32/102 (31%), Positives = 47/102 (46%)
 Frame = +3

Query: 36   LLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASAA 215
            L+WKIW ARN L+F N   +P+ +      F+ E +                 +    +A
Sbjct: 1119 LMWKIWNARNNLVFNNKLVDPIAIAQEAMYFMQELSPSPHEHNATPMQDAVLAAQPMPSA 1178

Query: 216  HSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDIGLD 341
              V  VDAGCF     GWG  + N  G +V +AC+ E I ++
Sbjct: 1179 PHVFYVDAGCFSGNATGWGMVIYNQSGRVVLSACRKELIDVE 1220



 Score = 27.3 bits (59), Expect(2) = 8e-09
 Identities = 8/17 (47%), Positives = 14/17 (82%)
 Frame = +2

Query: 335  IGLDPLTAEILGIRWCM 385
            I ++P+ AE +G+RWC+
Sbjct: 1217 IDVEPVLAEAIGVRWCL 1233


>gb|PNX71533.1| ribonuclease H [Trifolium pratense]
          Length = 798

 Score = 62.4 bits (150), Expect(2) = 1e-08
 Identities = 37/103 (35%), Positives = 49/103 (47%), Gaps = 3/103 (2%)
 Frame = +3

Query: 33  SLLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXT---HLVSPKSTT 203
           +LLWK W  RN +IF     +P  +     SFV EFN               +  P  +T
Sbjct: 573 TLLWKFWAGRNAVIFNGWQMDPTFLALDALSFVQEFNEANPSRNRRALVSQSISEPSRST 632

Query: 204 ASAAHSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDI 332
            ++ +S+  VDAGC   G   WG  + N  GE V +ACK EDI
Sbjct: 633 CTSMNSMF-VDAGCCNSGHTVWGLVLRNLNGETVFSACKREDI 674



 Score = 24.6 bits (52), Expect(2) = 1e-08
 Identities = 9/17 (52%), Positives = 12/17 (70%)
 Frame = +2

Query: 335 IGLDPLTAEILGIRWCM 385
           I  +PL AE LG+RW +
Sbjct: 674 ITAEPLLAEALGVRWAL 690


>dbj|GAU47092.1| hypothetical protein TSUD_369250 [Trifolium subterraneum]
          Length = 335

 Score = 60.8 bits (146), Expect(2) = 4e-08
 Identities = 36/96 (37%), Positives = 47/96 (48%)
 Frame = +3

Query: 45  KIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASAAHSV 224
           KIWF RN+LIFK     P +V  S  SFV EF+          T  V   S   S   + 
Sbjct: 127 KIWFHRNKLIFKQQAFVPHEVASSASSFVAEFSPTFLREIYMNTSDVLEASQVVSPVCNR 186

Query: 225 VKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDI 332
           + VDAG F  G  GWG  V +    ++ +AC+ E+I
Sbjct: 187 ICVDAGSFSNGSTGWGLIVKDHESSVILSACRFEEI 222



 Score = 24.3 bits (51), Expect(2) = 4e-08
 Identities = 10/25 (40%), Positives = 15/25 (60%)
 Frame = +2

Query: 347 PLTAEILGIRWCMS*SCQILYFYLT 421
           P+ AE LGIRW +  +  + Y  +T
Sbjct: 226 PILAEALGIRWAIQTAIDLNYNQVT 250


>gb|PNX95041.1| ribonuclease H, partial [Trifolium pratense]
          Length = 1348

 Score = 60.8 bits (146), Expect = 6e-08
 Identities = 37/112 (33%), Positives = 50/112 (44%), Gaps = 10/112 (8%)
 Frame = +3

Query: 39   LWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASAAH 218
            LWKIWF RNQ IFKN+  +P++V  + ++FV EF+                         
Sbjct: 1234 LWKIWFFRNQTIFKNLAFDPIRVSCAAQNFVSEFSVSSTPREQSTGQQPRCDWVAPPPDF 1293

Query: 219  SVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACK----------VEDIGLDW 344
              + VDAGC   G+  WG  + N   E+V AA K           E +GL W
Sbjct: 1294 FKLNVDAGCGSMGQVSWGLVIRNHNAEVVFAATKKTEFVAEAVVAEALGLRW 1345


>dbj|GAU39028.1| hypothetical protein TSUD_59840 [Trifolium subterraneum]
          Length = 1626

 Score = 60.5 bits (145), Expect = 8e-08
 Identities = 36/101 (35%), Positives = 46/101 (45%)
 Frame = +3

Query: 30   SSLLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTAS 209
            S+ LW IW  RN+LIFKN    P+ V  +   FV EFN          +     K     
Sbjct: 1411 STTLWMIWKGRNKLIFKNEKFCPIYVAAASSDFVAEFNSGTCSFENIPSCDNPGKWEHPE 1470

Query: 210  AAHSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDI 332
                 V +DAGCF  G  GWG  + N +G +  AA  +E I
Sbjct: 1471 QGKLKVNIDAGCFSNGTTGWGMIMRNHLGMVEFAATHLEKI 1511


>gb|PNY04967.1| ribonuclease H [Trifolium pratense]
          Length = 207

 Score = 54.7 bits (130), Expect(2) = 2e-07
 Identities = 27/98 (27%), Positives = 49/98 (50%)
 Frame = +3

Query: 48  IWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASAAHSVV 227
           +WF RNQ++F+   P P  +  +    VHEFN              + +      +H ++
Sbjct: 1   MWFFRNQVVFQQKIPTPPDIAIAALDIVHEFNLAVPKKSKQRQQHAASEPAATLCSH-LI 59

Query: 228 KVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDIGLD 341
           +VDAGCF +G   +G  + +  G +  +AC+ E++ +D
Sbjct: 60  QVDAGCFPDGYTTFGCVIKDCSGMISFSACRKENLLVD 97



 Score = 27.7 bits (60), Expect(2) = 2e-07
 Identities = 10/15 (66%), Positives = 12/15 (80%)
 Frame = +2

Query: 341 LDPLTAEILGIRWCM 385
           +DPL AE L IRWC+
Sbjct: 96  VDPLLAEALAIRWCL 110


>dbj|GAU37237.1| hypothetical protein TSUD_375390 [Trifolium subterraneum]
          Length = 246

 Score = 51.2 bits (121), Expect(2) = 2e-06
 Identities = 31/108 (28%), Positives = 46/108 (42%), Gaps = 11/108 (10%)
 Frame = +3

Query: 51  WFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASAAHSVVK 230
           W  RN  +F  +  +P ++     SFVH+FN               P+   A  AH  ++
Sbjct: 5   WNGRNATVFNGIKLDPGRLALDVTSFVHDFNEANP-----------PRCRRAPVAHVSIQ 53

Query: 231 -----------VDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDIGLD 341
                      VDAGC   G   WG  + N+ GE + + CK E+I +D
Sbjct: 54  PSLVTPIFSLFVDAGCSMSGPIVWGLVLRNSDGETILSVCKREEISVD 101



 Score = 27.7 bits (60), Expect(2) = 2e-06
 Identities = 10/17 (58%), Positives = 13/17 (76%)
 Frame = +2

Query: 335 IGLDPLTAEILGIRWCM 385
           I +DPL AE LG+RW +
Sbjct: 98  ISVDPLMAETLGVRWAL 114


>dbj|GAU35627.1| hypothetical protein TSUD_30450 [Trifolium subterraneum]
          Length = 1475

 Score = 53.5 bits (127), Expect(2) = 4e-06
 Identities = 32/103 (31%), Positives = 48/103 (46%)
 Frame = +3

Query: 33   SLLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASA 212
            +LLWK W  RN ++F+    +PV +     SFV EFN          +  ++  + + S 
Sbjct: 1251 TLLWKFWATRNNVVFRGDKLDPVCLVDEVMSFVQEFNEANPPRQGRVSLPLTTVTPSISR 1310

Query: 213  AHSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDIGLD 341
                V VDAGC   G   WG  + N       +ACK +DI ++
Sbjct: 1311 PSFSVFVDAGCNLNGPTVWGLVLKNHDRITTFSACKYDDIAVE 1353



 Score = 24.6 bits (52), Expect(2) = 4e-06
 Identities = 8/17 (47%), Positives = 13/17 (76%)
 Frame = +2

Query: 335  IGLDPLTAEILGIRWCM 385
            I ++P+ AE LG+RW +
Sbjct: 1350 IAVEPVMAEALGVRWAI 1366


>gb|PNX58626.1| ribonuclease H, partial [Trifolium pratense]
          Length = 217

 Score = 51.6 bits (122), Expect(2) = 5e-06
 Identities = 33/105 (31%), Positives = 45/105 (42%), Gaps = 11/105 (10%)
 Frame = +3

Query: 60  RNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASAAHSVVK--- 230
           RN  +F  V  +P ++     SFVH+FN               P+   A  AH  ++   
Sbjct: 2   RNAAVFNGVQLDPGRLAIDAMSFVHDFNEANP-----------PRCRRAPVAHVPIQPGM 50

Query: 231 --------VDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDIGLD 341
                   VDAGC   G   WG  + N+ GE V +ACK ED  +D
Sbjct: 51  TNPIFSLFVDAGCSNSGHTVWGLVLRNSDGETVLSACKREDFYVD 95



 Score = 26.2 bits (56), Expect(2) = 5e-06
 Identities = 9/15 (60%), Positives = 12/15 (80%)
 Frame = +2

Query: 341 LDPLTAEILGIRWCM 385
           +DPL AE LG+RW +
Sbjct: 94  VDPLMAEALGVRWAL 108


Top