BLASTX nr result

ID: Astragalus22_contig00038631 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00038631
         (428 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KYP67585.1| Putative ribonuclease H protein At1g65750 family ...    87   1e-17
gb|KYP65942.1| Putative ribonuclease H protein At1g65750 family ...    84   3e-16
gb|KYP65976.1| Putative ribonuclease H protein At1g65750 family,...    79   4e-16
dbj|GAU32642.1| hypothetical protein TSUD_71900 [Trifolium subte...    77   2e-15
gb|AFK45241.1| unknown [Lotus japonicus]                               77   6e-15
dbj|GAU33152.1| hypothetical protein TSUD_206110 [Trifolium subt...    77   1e-14
dbj|GAU10638.1| hypothetical protein TSUD_421140, partial [Trifo...    77   2e-14
gb|KYP60192.1| Putative ribonuclease H protein At1g65750 family,...    76   3e-14
gb|PNX61593.1| ribonuclease H, partial [Trifolium pratense]            74   4e-14
dbj|GAU31501.1| hypothetical protein TSUD_332760 [Trifolium subt...    74   5e-14
gb|KYP54027.1| Putative ribonuclease H protein At1g65750 family ...    78   6e-14
gb|KYP63365.1| Putative ribonuclease H protein At1g65750 family ...    73   2e-13
ref|XP_007158841.1| hypothetical protein PHAVU_002G186500g [Phas...    75   2e-13
ref|XP_007146655.1| hypothetical protein PHAVU_006G058500g [Phas...    75   3e-13
gb|KYP62996.1| Putative ribonuclease H protein At1g65750 family ...    76   3e-13
gb|KYP44518.1| Putative ribonuclease H protein At1g65750 family,...    76   3e-13
dbj|GAU44820.1| hypothetical protein TSUD_400390 [Trifolium subt...    73   5e-13
gb|KRH65176.1| hypothetical protein GLYMA_03G018300 [Glycine max]      72   7e-13
gb|KRH65175.1| hypothetical protein GLYMA_03G018200 [Glycine max]      72   7e-13
gb|OIS97451.1| putative ribonuclease h protein [Nicotiana attenu...    73   8e-13

>gb|KYP67585.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 363

 Score = 87.4 bits (215), Expect = 1e-17
 Identities = 51/109 (46%), Positives = 62/109 (56%), Gaps = 2/109 (1%)
 Frame = -2

Query: 349 WLPTATGRWKLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNSLEAELMGIKH 170
           W P   G  KLNCD ++   GR  GCGG+IR++ G F++ FSC L  C+ L+AEL  I H
Sbjct: 150 WQPPPLGSIKLNCDGAVRGVGRKVGCGGIIRNYLGGFIMGFSCKLGQCSILQAELWAIFH 209

Query: 169 GVRIASLKGFLE-LDVETDSSQAIDLLVGGVGDLKS-QTLIREIQEFGD 29
           G+RI   KGF E + VE DSS AI  L  G     S   LI  I E  D
Sbjct: 210 GLRIIKEKGFKEDIIVELDSSLAIKFLNEGCSASHSCAPLINSIVELAD 258


>gb|KYP65942.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 457

 Score = 84.3 bits (207), Expect = 3e-16
 Identities = 44/87 (50%), Positives = 55/87 (63%), Gaps = 1/87 (1%)
 Frame = -2

Query: 349 WLPTATGRWKLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNSLEAELMGIKH 170
           W P   G  KLNCD ++   GR AGCGG+I+D+ G F+  F C L  C+ L+AEL  I H
Sbjct: 290 WQPPPLGSIKLNCDRAVHGVGRKAGCGGIIKDYLGGFITGFPCKLGQCSILQAELWTIFH 349

Query: 169 GVRIASLKGFLE-LDVETDSSQAIDLL 92
           G+RI   KGF E + VE+DSS AI  L
Sbjct: 350 GLRIIKDKGFKEDIIVESDSSLAIKFL 376


>gb|KYP65976.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 108

 Score = 78.6 bits (192), Expect = 4e-16
 Identities = 41/78 (52%), Positives = 52/78 (66%), Gaps = 1/78 (1%)
 Frame = -2

Query: 322 KLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNSLEAELMGIKHGVRIASLKG 143
           KLNCD ++   GR AGCGG+I+D+ G F+  F C L  C+ L+AEL  I HG+RI   KG
Sbjct: 2   KLNCDRAVHGVGRKAGCGGIIKDYLGGFITGFPCKLGQCSILQAELWTIFHGLRIIKDKG 61

Query: 142 FLE-LDVETDSSQAIDLL 92
           F E + VE+DSS AI  L
Sbjct: 62  FKEDIIVESDSSLAIKFL 79


>dbj|GAU32642.1| hypothetical protein TSUD_71900 [Trifolium subterraneum]
          Length = 109

 Score = 76.6 bits (187), Expect = 2e-15
 Identities = 39/103 (37%), Positives = 66/103 (64%), Gaps = 2/103 (1%)
 Frame = -2

Query: 331 GRW-KLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNSLEAELMGIKHGVRIA 155
           G+W KLNCD +  E+   AGCGG+ RD +G ++  ++  + AC++L AE+ GI  G+++A
Sbjct: 5   GKWIKLNCDGAYKESMGLAGCGGLFRDSNGRWLKGYAQKIGACDALHAEMWGIYTGMQMA 64

Query: 154 SLKGFLELDVETDSSQAIDLLVGGVG-DLKSQTLIREIQEFGD 29
             +GF  + VE+DS   ID++ G    + K+  L+R I++F +
Sbjct: 65  RRQGFTHIIVESDSKLLIDMVTGSCKLNGKTPILVRRIRDFAN 107


>gb|AFK45241.1| unknown [Lotus japonicus]
          Length = 165

 Score = 77.0 bits (188), Expect = 6e-15
 Identities = 43/116 (37%), Positives = 60/116 (51%), Gaps = 1/116 (0%)
 Frame = -2

Query: 349 WLPTATGRWKLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNSLEAELMGIKH 170
           W PT     KLN D S + +   A CGGV RDHHG F+L F+  +  C+ L AEL G+ H
Sbjct: 29  WTPTLESWVKLNTDGSYSVDEDCAACGGVPRDHHGNFLLGFTMKVGVCSILHAELWGLVH 88

Query: 169 GVRIASLKGFLELDVETDSSQAIDLLVGGVGDL-KSQTLIREIQEFGDDGVNVTWK 5
           G+R    +GF ++ +E DS+  I+ L  G   +     L+RE          + WK
Sbjct: 89  GLRFVLGRGFSKILIEADSAGTIEFLNKGCPVVHPCFPLVREFHHLVGQNCYIHWK 144


>dbj|GAU33152.1| hypothetical protein TSUD_206110 [Trifolium subterraneum]
          Length = 190

 Score = 77.0 bits (188), Expect = 1e-14
 Identities = 40/111 (36%), Positives = 70/111 (63%), Gaps = 2/111 (1%)
 Frame = -2

Query: 331 GRW-KLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNSLEAELMGIKHGVRIA 155
           G W KLNCD +  E+   AGCGG+ RD +G ++  ++  + AC++L AE+ G+  G+++A
Sbjct: 32  GEWIKLNCDGAYKESMGLAGCGGLFRDSNGRWLKGYAQKIGACDALHAEMWGMYTGMQMA 91

Query: 154 SLKGFLELDVETDSSQAIDLLVGGVG-DLKSQTLIREIQEFGDDGVNVTWK 5
             +GF  + VE+DS   ID++ G    + K+  L+R I++F +   ++T+K
Sbjct: 92  RRQGFTHIIVESDSKLLIDMVTGSCKLNGKTPILVRRIRDFANLQWHITFK 142


>dbj|GAU10638.1| hypothetical protein TSUD_421140, partial [Trifolium subterraneum]
          Length = 236

 Score = 77.0 bits (188), Expect = 2e-14
 Identities = 40/111 (36%), Positives = 70/111 (63%), Gaps = 2/111 (1%)
 Frame = -2

Query: 331 GRW-KLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNSLEAELMGIKHGVRIA 155
           G W KLNCD +  E+   AGCGG+ RD +G ++  ++  + AC++L AE+ G+  G+++A
Sbjct: 70  GEWIKLNCDGAYKESMGLAGCGGLFRDSNGRWLKGYAQKIGACDALHAEMWGMYTGMQMA 129

Query: 154 SLKGFLELDVETDSSQAIDLLVGGVG-DLKSQTLIREIQEFGDDGVNVTWK 5
             +GF  + VE+DS   ID++ G    + K+  L+R I++F +   ++T+K
Sbjct: 130 RRQGFTHIIVESDSKLLIDMVTGSCKLNGKTPILVRRIRDFANLQWHITFK 180


>gb|KYP60192.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 192

 Score = 75.9 bits (185), Expect = 3e-14
 Identities = 48/121 (39%), Positives = 67/121 (55%), Gaps = 2/121 (1%)
 Frame = -2

Query: 364 DLGESWLPTATGRWKLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNSLEAEL 185
           D    W    TG  KLNCD ++A NG  A CGGV+R+ +G FV+AF   L  C+ LEAEL
Sbjct: 22  DRNTQWTYPPTGALKLNCDGAVARNGE-ASCGGVVRNSNGKFVVAFFGRLGRCSILEAEL 80

Query: 184 MGIKHGVRIASLKGFLE-LDVETDSSQAIDLLVGGVGDLKSQT-LIREIQEFGDDGVNVT 11
             I  G RI   +   + + VE+DSS+AI ++  G   +     L+REI++       V+
Sbjct: 81  RAILQGTRIILERNVGQVILVESDSSEAIRIINEGCSRVHPCCHLVREIKDLSTQLSRVS 140

Query: 10  W 8
           W
Sbjct: 141 W 141


>gb|PNX61593.1| ribonuclease H, partial [Trifolium pratense]
          Length = 146

 Score = 74.3 bits (181), Expect = 4e-14
 Identities = 38/116 (32%), Positives = 64/116 (55%), Gaps = 1/116 (0%)
 Frame = -2

Query: 349 WLPTATGRWKLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNSLEAELMGIKH 170
           W+    G   LN D ++    + AGCGGVIR+  G +V  F+  L  C++  AEL GI  
Sbjct: 29  WMRPQQGYLSLNTDGAVKNGSQQAGCGGVIRNDSGNWVCGFAKALGPCSAFVAELWGILE 88

Query: 169 GVRIASLKGFLELDVETDSSQAIDLLVGGV-GDLKSQTLIREIQEFGDDGVNVTWK 5
           G+ IA  +  + ++V+ DS+  +  L     G ++ + L+R+I+E  + G+ V +K
Sbjct: 89  GIIIAKDRNIMRIEVQVDSTAVLQCLTSSKNGSVRGRRLVRKIRELIEQGIEVQFK 144


>dbj|GAU31501.1| hypothetical protein TSUD_332760 [Trifolium subterraneum]
          Length = 153

 Score = 74.3 bits (181), Expect = 5e-14
 Identities = 43/107 (40%), Positives = 60/107 (56%), Gaps = 2/107 (1%)
 Frame = -2

Query: 349 WLPTATGRWKLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNSLEAELMGIKH 170
           W     G  KLNCD +  E G  AGCGG+ RD  G ++  F+  + AC++L AE+ G+  
Sbjct: 44  WKKPQDGWVKLNCDRACKELGETAGCGGLFRDSDGRWIKGFTRKIGACDALHAEMWGMYL 103

Query: 169 GVRIASLKGFLELDVETDSSQAIDLLVGGVGDLKSQT--LIREIQEF 35
           G+ IA   G   L VE+DS   I+++     ++K  T  LIR IQEF
Sbjct: 104 GIDIAWRDGLSHLIVESDSKVLINMVTNNC-NIKGHTPLLIRRIQEF 149


>gb|KYP54027.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 459

 Score = 77.8 bits (190), Expect = 6e-14
 Identities = 45/114 (39%), Positives = 64/114 (56%), Gaps = 2/114 (1%)
 Frame = -2

Query: 364 DLGESWLPTATGRWKLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNSLEAEL 185
           +L   W        KLNCD S++  G  A CGG++R+  GAFVLA++C L +C+   AE+
Sbjct: 291 NLSSFWSRPPANHLKLNCDGSVSVRG-LAVCGGIVRESAGAFVLAYACKLGSCSITNAEI 349

Query: 184 MGIKHGVRIASLKGFL-ELDVETDSSQAIDLLVGGVGDLKSQ-TLIREIQEFGD 29
             I HG+RI      L  + VETDS  A++L+  G         L++EIQE G+
Sbjct: 350 WAILHGLRIIRNNNLLGRILVETDSLTAVNLISHGCDHSHPNFNLVKEIQELGN 403


>gb|KYP63365.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 171

 Score = 73.2 bits (178), Expect = 2e-13
 Identities = 43/109 (39%), Positives = 61/109 (55%), Gaps = 2/109 (1%)
 Frame = -2

Query: 349 WLPTATGRWKLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNSLEAELMGIKH 170
           W        KLNCD S++  G  AGC G++RD  G FVLA++C L +C+   AE+  I H
Sbjct: 8   WSRPPANHLKLNCDGSVSARG-LAGCCGIVRDAAGTFVLAYACKLGSCSITNAEIWAILH 66

Query: 169 GVRIASLKGFL-ELDVETDSSQAIDLLVGGVG-DLKSQTLIREIQEFGD 29
           G+RI      L  + VETDS  +I+L+  G      S  L++EIQ+  +
Sbjct: 67  GLRIIRNNNLLGRILVETDSLTSINLISHGCDPSHPSFNLVKEIQDLSN 115


>ref|XP_007158841.1| hypothetical protein PHAVU_002G186500g [Phaseolus vulgaris]
 gb|ESW30835.1| hypothetical protein PHAVU_002G186500g [Phaseolus vulgaris]
          Length = 252

 Score = 74.7 bits (182), Expect = 2e-13
 Identities = 47/120 (39%), Positives = 61/120 (50%), Gaps = 6/120 (5%)
 Frame = -2

Query: 349 WLPTATGRWKLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNSLEAELMGIKH 170
           W P   G  K+NCD +   +G  AG GGV+RD  G F+  FS  L   + L AEL  IK 
Sbjct: 81  WTPPFEGFVKINCDGAFTMHGNKAGAGGVVRDWRGEFIFGFSSGLKNYSVLMAELEAIKI 140

Query: 169 GVRIASLKGFLELDVETDSSQAIDLLVGGV-----GDL-KSQTLIREIQEFGDDGVNVTW 8
           G+ IA  KG+  L VE+DS  AID++   V     GD  +SQ  +  I E       + W
Sbjct: 141 GIEIAISKGYKNLMVESDSKVAIDIITSLVVQQSNGDTSQSQNDMSSIIEISKTANKIHW 200


>ref|XP_007146655.1| hypothetical protein PHAVU_006G058500g [Phaseolus vulgaris]
 gb|ESW18649.1| hypothetical protein PHAVU_006G058500g [Phaseolus vulgaris]
          Length = 277

 Score = 74.7 bits (182), Expect = 3e-13
 Identities = 43/110 (39%), Positives = 57/110 (51%), Gaps = 6/110 (5%)
 Frame = -2

Query: 349 WLPTATGRWKLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNSLEAELMGIKH 170
           W P   G  K+NCD +   +G  AG GGV+RD  G F+  FS  L  C+ L AEL  IK 
Sbjct: 106 WRPPFEGFVKINCDGAFTMHGNKAGAGGVVRDWRGNFIFGFSSGLANCSVLTAELEAIKI 165

Query: 169 GVRIASLKGFLELDVETDSSQAIDLLVGGVGD------LKSQTLIREIQE 38
           G+     KG+  L VE+DS  A+D++   V         +SQ +I  I E
Sbjct: 166 GIETTISKGYKNLMVESDSKVAVDIITSLVAQQSNDDTRQSQDVISSIME 215


>gb|KYP62996.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 506

 Score = 75.9 bits (185), Expect = 3e-13
 Identities = 37/81 (45%), Positives = 56/81 (69%), Gaps = 1/81 (1%)
 Frame = -2

Query: 352 SWLPTATGRWKLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNSLEAELMGIK 173
           SW    TG +KLNCD ++   G  A CGG+IRDHHG+FV+AFSC +  C+ ++AEL  + 
Sbjct: 390 SWTLPPTGAFKLNCDGAVVA-GSGAACGGIIRDHHGSFVVAFSCKIGLCSVVQAELWAVY 448

Query: 172 HGVRIA-SLKGFLELDVETDS 113
           +G+++A  ++   +L VE+DS
Sbjct: 449 YGLKLAHDIRISGDLFVESDS 469


>gb|KYP44518.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 584

 Score = 75.9 bits (185), Expect = 3e-13
 Identities = 48/121 (39%), Positives = 67/121 (55%), Gaps = 2/121 (1%)
 Frame = -2

Query: 364 DLGESWLPTATGRWKLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNSLEAEL 185
           D    W    TG  KLNCD ++A NG  A CGGV+R+ +G FV+AFS  L  C+ LEAEL
Sbjct: 414 DRNTQWTYPPTGALKLNCDGAVARNGE-ASCGGVVRNSNGKFVVAFSGRLGRCSILEAEL 472

Query: 184 MGIKHGVRIASLKGFLE-LDVETDSSQAIDLLVGGVGDLKSQT-LIREIQEFGDDGVNVT 11
             I  G RI   +   + + VE+DS +AI ++  G   +     L+REI++       V+
Sbjct: 473 RAILQGTRIILERNVGQVILVESDSLEAIRIINEGCSRVHPCCHLVREIKDLSTQLSRVS 532

Query: 10  W 8
           W
Sbjct: 533 W 533


>dbj|GAU44820.1| hypothetical protein TSUD_400390 [Trifolium subterraneum]
          Length = 198

 Score = 72.8 bits (177), Expect = 5e-13
 Identities = 38/111 (34%), Positives = 69/111 (62%), Gaps = 2/111 (1%)
 Frame = -2

Query: 331 GRW-KLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNSLEAELMGIKHGVRIA 155
           G W KLNCD +  E+   AGCGG+ RD +G ++  ++  + AC++L AE+ G+  G+++A
Sbjct: 32  GEWIKLNCDGAYKESMGLAGCGGLFRDSNGRWLKGYAQKIGACDALHAEMWGMYTGMQMA 91

Query: 154 SLKGFLELDVETDSSQAIDLLVGGVG-DLKSQTLIREIQEFGDDGVNVTWK 5
             +GF  + V++DS   ID++      + K+  L+R I++F +   ++T+K
Sbjct: 92  RRQGFTHIIVQSDSKLLIDMVTESCKLNGKTPILVRRIRDFANLQWHITFK 142


>gb|KRH65176.1| hypothetical protein GLYMA_03G018300 [Glycine max]
          Length = 180

 Score = 72.0 bits (175), Expect = 7e-13
 Identities = 40/93 (43%), Positives = 53/93 (56%), Gaps = 1/93 (1%)
 Frame = -2

Query: 349 WLPTATGRWKLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNS-LEAELMGIK 173
           W P   G +KLNCD +    G     GGV+RD  G F+L FS  L  C+S LEAEL  IK
Sbjct: 18  WTPPPRGFFKLNCDGAFTVYGNKGAAGGVLRDWKGEFILGFSDALIECSSALEAELWAIK 77

Query: 172 HGVRIASLKGFLELDVETDSSQAIDLLVGGVGD 74
            G++    +G+  L VE+DS +AI ++    GD
Sbjct: 78  IGMQTVVARGYRNLIVESDSLKAIQIINAHKGD 110


>gb|KRH65175.1| hypothetical protein GLYMA_03G018200 [Glycine max]
          Length = 180

 Score = 72.0 bits (175), Expect = 7e-13
 Identities = 40/93 (43%), Positives = 53/93 (56%), Gaps = 1/93 (1%)
 Frame = -2

Query: 349 WLPTATGRWKLNCDESIAENGRFAGCGGVIRDHHGAFVLAFSCPLHACNS-LEAELMGIK 173
           W P   G +KLNCD +    G     GGV+RD  G F+L FS  L  C+S LEAEL  IK
Sbjct: 18  WTPPPRGFFKLNCDGAFTVYGNKGAAGGVLRDWKGEFILGFSDALIECSSALEAELWAIK 77

Query: 172 HGVRIASLKGFLELDVETDSSQAIDLLVGGVGD 74
            G++    +G+  L VE+DS +AI ++    GD
Sbjct: 78  IGMQTVVARGYRNLIVESDSLKAIQIINAHKGD 110


>gb|OIS97451.1| putative ribonuclease h protein [Nicotiana attenuata]
          Length = 252

 Score = 73.2 bits (178), Expect = 8e-13
 Identities = 42/110 (38%), Positives = 59/110 (53%)
 Frame = -2

Query: 409 QCVSLLSPQIQLATDDLGESWLPTATGRWKLNCDESIAENGRFAGCGGVIRDHHGAFVLA 230
           Q V ++ PQ+ +    L   W   + G +KLN D     N   AGCGGV+RDH G  ++A
Sbjct: 116 QLVDIIKPQLHI----LPVRWNKPSVGEFKLNVDGCSKGNPGNAGCGGVLRDHLGRLIMA 171

Query: 229 FSCPLHACNSLEAELMGIKHGVRIASLKGFLELDVETDSSQAIDLLVGGV 80
           F+  L +C++  AE   IK GVR     GF  L VE+DS   I ++ G +
Sbjct: 172 FTVYLGSCSNNSAEAQAIKTGVRWCLDHGFNRLTVESDSLVVIQMIRGEI 221


Top