BLASTX nr result

ID: Astragalus23_contig00008072 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00008072
         (400 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AFK41923.1| unknown [Lotus japonicus]                              122   7e-32
ref|XP_004496386.1| PREDICTED: uncharacterized protein LOC101497...   121   9e-32
gb|AFK39341.1| unknown [Lotus japonicus]                              120   2e-31
gb|AFK34055.1| unknown [Lotus japonicus]                              112   4e-28
gb|PNX67145.1| hypothetical protein L195_g055469, partial [Trifo...   102   3e-25
gb|AFK42656.1| unknown [Lotus japonicus]                              105   8e-25
dbj|GAU15977.1| hypothetical protein TSUD_42000 [Trifolium subte...   100   2e-23
dbj|GAU15978.1| hypothetical protein TSUD_42010 [Trifolium subte...    99   5e-23
gb|KYP69785.1| Putative ribonuclease H protein At1g65750 family ...    92   1e-20
ref|NP_001236017.1| uncharacterized protein LOC100526921 [Glycin...    90   2e-19
gb|PNX66128.1| hypothetical protein L195_g054921, partial [Trifo...    88   3e-19
dbj|BAT99172.1| hypothetical protein VIGAN_10056800 [Vigna angul...    88   1e-18
gb|PNX61593.1| ribonuclease H, partial [Trifolium pratense]            85   3e-18
ref|XP_013469552.1| transmembrane protein, putative [Medicago tr...    82   3e-17
gb|KHN29953.1| Putative ribonuclease H protein [Glycine soja]          83   1e-16
ref|XP_019163602.1| PREDICTED: uncharacterized protein LOC109159...    82   2e-15
gb|PNX88481.1| ribonuclease H, partial [Trifolium pratense]            79   5e-15
gb|KYP69787.1| Putative ribonuclease H protein At1g65750 family ...    77   5e-15
ref|NP_001240873.1| uncharacterized protein LOC100812666 [Glycin...    79   6e-15
ref|XP_019162021.1| PREDICTED: uncharacterized protein LOC109158...    80   1e-14

>gb|AFK41923.1| unknown [Lotus japonicus]
          Length = 229

 Score =  122 bits (305), Expect = 7e-32
 Identities = 62/103 (60%), Positives = 74/103 (71%), Gaps = 1/103 (0%)
 Frame = +1

Query: 94  VWTKPKVGWLKLNVDGSLLREIQSAGCGGVLRNDSGSWIYGFAQKLSPSSR-EDETEKKG 270
           VWTKP+ GWLKLNVDGSLL +  SAGCGGVLR+ SG WI GFA KL P     DETEK+ 
Sbjct: 67  VWTKPRRGWLKLNVDGSLLPDPLSAGCGGVLRDSSGKWISGFAVKLEPRRHYPDETEKEA 126

Query: 271 ILKGLQWAKEKGFMRVEVESDNEGIVEKVNSWSRSTDPLILAI 399
           I +GL+WA+ +   +V VESDN GIV  V + SR+ +PLI  I
Sbjct: 127 IFRGLRWARGRRVKKVVVESDNRGIVNLVKNGSRTINPLICQI 169


>ref|XP_004496386.1| PREDICTED: uncharacterized protein LOC101497185 [Cicer arietinum]
 ref|XP_004496387.1| PREDICTED: uncharacterized protein LOC101497515 [Cicer arietinum]
          Length = 215

 Score =  121 bits (303), Expect = 9e-32
 Identities = 60/91 (65%), Positives = 70/91 (76%)
 Frame = +1

Query: 127 LNVDGSLLREIQSAGCGGVLRNDSGSWIYGFAQKLSPSSREDETEKKGILKGLQWAKEKG 306
           LNVDGSLLR+I SAGCGGVL + SG W+ GFAQKL+P+ REDETEK+ IL+GL W KEKG
Sbjct: 70  LNVDGSLLRQIPSAGCGGVLTDSSGKWLCGFAQKLNPNLREDETEKEAILRGLIWVKEKG 129

Query: 307 FMRVEVESDNEGIVEKVNSWSRSTDPLILAI 399
             +V  ++DNEGI   VNS  RS DPLI  I
Sbjct: 130 KRKVTAKTDNEGIEISVNSGRRSNDPLICRI 160


>gb|AFK39341.1| unknown [Lotus japonicus]
          Length = 229

 Score =  120 bits (302), Expect = 2e-31
 Identities = 62/103 (60%), Positives = 73/103 (70%), Gaps = 1/103 (0%)
 Frame = +1

Query: 94  VWTKPKVGWLKLNVDGSLLREIQSAGCGGVLRNDSGSWIYGFAQKLSPSSR-EDETEKKG 270
           VWTKP+ GWLKLNVDGSLL +  SAGCG VLR+ SG WI GFA KL P     DETEK+ 
Sbjct: 67  VWTKPRRGWLKLNVDGSLLPDPLSAGCGDVLRDSSGKWISGFAVKLEPRRHYPDETEKEA 126

Query: 271 ILKGLQWAKEKGFMRVEVESDNEGIVEKVNSWSRSTDPLILAI 399
           I +GLQWA+ +   +V VESDN GIV  V + SR+ +PLI  I
Sbjct: 127 IFRGLQWARGRRVKKVVVESDNRGIVNLVKNGSRTINPLICQI 169


>gb|AFK34055.1| unknown [Lotus japonicus]
          Length = 217

 Score =  112 bits (279), Expect = 4e-28
 Identities = 58/107 (54%), Positives = 76/107 (71%), Gaps = 3/107 (2%)
 Frame = +1

Query: 88  ISVWTKPKVGWLKLNVDGSLLREIQSAGCGGVLRNDSGSWIYGFAQKLSPSSR---EDET 258
           + VWTKPK+GW+KLNVDGSL  E  SAGCGGVLR+ SG WI GFA+KL+         + 
Sbjct: 55  LKVWTKPKMGWIKLNVDGSLQSE--SAGCGGVLRDSSGKWISGFAKKLAHPGGGHCPHKP 112

Query: 259 EKKGILKGLQWAKEKGFMRVEVESDNEGIVEKVNSWSRSTDPLILAI 399
           E++ + +GLQWA E+G  RVEVESD +G+V+ V S S ++D +I  I
Sbjct: 113 EEEALCRGLQWAWERGENRVEVESDRQGLVDSVKSGSTTSDLVIREI 159


>gb|PNX67145.1| hypothetical protein L195_g055469, partial [Trifolium pratense]
          Length = 156

 Score =  102 bits (255), Expect = 3e-25
 Identities = 53/91 (58%), Positives = 62/91 (68%)
 Frame = +1

Query: 127 LNVDGSLLREIQSAGCGGVLRNDSGSWIYGFAQKLSPSSREDETEKKGILKGLQWAKEKG 306
           L VDGSLLR   SAGCGG L + S  WI GFAQ+L+P  REDETEK+ IL+GL W KEKG
Sbjct: 12  LKVDGSLLRGSNSAGCGGFLSSASEKWICGFAQRLNPDLREDETEKEAILRGLVWVKEKG 71

Query: 307 FMRVEVESDNEGIVEKVNSWSRSTDPLILAI 399
             +V V++DN GI   VNS  R  D +I  I
Sbjct: 72  KKKVVVKTDNRGIENLVNSGRRCNDSVICEI 102


>gb|AFK42656.1| unknown [Lotus japonicus]
          Length = 283

 Score =  105 bits (261), Expect = 8e-25
 Identities = 55/105 (52%), Positives = 72/105 (68%), Gaps = 3/105 (2%)
 Frame = +1

Query: 94  VWTKPKVGWLKLNVDGSLLREIQSAGCGGVLRNDSGSWIYGFAQKLSPSSR---EDETEK 264
           +WTKPK+GW+KLNVDGSL  E  SAGCGGVLR+  G WI GFA K +         + E+
Sbjct: 123 LWTKPKMGWIKLNVDGSLRSE--SAGCGGVLRDSFGKWISGFAVKFAHPGGGHCPHKPEE 180

Query: 265 KGILKGLQWAKEKGFMRVEVESDNEGIVEKVNSWSRSTDPLILAI 399
           + + +GLQWA E+G  RVEVESD +G+V+ V S S ++D +I  I
Sbjct: 181 EALCRGLQWAWERGENRVEVESDRKGLVDSVKSGSTTSDLVICEI 225


>dbj|GAU15977.1| hypothetical protein TSUD_42000 [Trifolium subterraneum]
          Length = 219

 Score =  100 bits (248), Expect = 2e-23
 Identities = 55/91 (60%), Positives = 63/91 (69%)
 Frame = +1

Query: 127 LNVDGSLLREIQSAGCGGVLRNDSGSWIYGFAQKLSPSSREDETEKKGILKGLQWAKEKG 306
           L VDGS L    SAGCGG L + S  WI GFAQKL+ + REDETEK+ IL+GL W KEKG
Sbjct: 78  LKVDGSRLPS--SAGCGGYLSSASKKWICGFAQKLNANLREDETEKEAILRGLLWVKEKG 135

Query: 307 FMRVEVESDNEGIVEKVNSWSRSTDPLILAI 399
             +V V++DNEGI   VNS  RS DPLI  I
Sbjct: 136 KKKVVVKTDNEGIKILVNSGRRSNDPLICGI 166


>dbj|GAU15978.1| hypothetical protein TSUD_42010 [Trifolium subterraneum]
          Length = 201

 Score = 98.6 bits (244), Expect = 5e-23
 Identities = 51/93 (54%), Positives = 66/93 (70%), Gaps = 2/93 (2%)
 Frame = +1

Query: 127 LNVDGSLLREIQSAGCGGVLRNDSGSWIYGFAQKL--SPSSREDETEKKGILKGLQWAKE 300
           L VDGS+LR++ SAGCGG L + S +WI GF QKL  +P+ +E ETEK+ IL+G++W K 
Sbjct: 60  LKVDGSVLRKVPSAGCGGYLSSRSQNWICGFVQKLKFTPNLKEHETEKEAILRGMRWVKN 119

Query: 301 KGFMRVEVESDNEGIVEKVNSWSRSTDPLILAI 399
           KG   V V+SD + +VE VNS  RS D LI AI
Sbjct: 120 KGMRNVVVKSDCKNVVEFVNSGRRSNDRLICAI 152


>gb|KYP69785.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 185

 Score = 92.0 bits (227), Expect = 1e-20
 Identities = 45/90 (50%), Positives = 63/90 (70%), Gaps = 1/90 (1%)
 Frame = +1

Query: 97  WTKPKVGWLKLNVDGSLLR-EIQSAGCGGVLRNDSGSWIYGFAQKLSPSSREDETEKKGI 273
           W KP++GW+K+NVDGS  + E  +AGCGGV+R+  G+W+ GF+QKL P ++  +TE + I
Sbjct: 19  WKKPEIGWVKVNVDGSRDQYEAPTAGCGGVVRDAWGTWLIGFSQKLDPKNKPHQTELEAI 78

Query: 274 LKGLQWAKEKGFMRVEVESDNEGIVEKVNS 363
           LKGL+ A +  F +V VESD E  V  V S
Sbjct: 79  LKGLKLALKMNFDKVVVESDCESAVRMVES 108


>ref|NP_001236017.1| uncharacterized protein LOC100526921 [Glycine max]
 ref|XP_006574355.1| PREDICTED: uncharacterized protein LOC100526921 isoform X1 [Glycine
           max]
 gb|ACU15942.1| unknown [Glycine max]
          Length = 247

 Score = 90.1 bits (222), Expect = 2e-19
 Identities = 45/100 (45%), Positives = 65/100 (65%), Gaps = 1/100 (1%)
 Frame = +1

Query: 88  ISVWTKPKVGWLKLNVDGSL-LREIQSAGCGGVLRNDSGSWIYGFAQKLSPSSREDETEK 264
           +S W KP++GW+KLNVDGS    +  SAGCGGVLR+ S  W+ GFA+KL+P+    +TE 
Sbjct: 76  VSRWKKPEIGWVKLNVDGSRDPYKSSSAGCGGVLRDASAKWLRGFAKKLNPTYAVHQTEL 135

Query: 265 KGILKGLQWAKEKGFMRVEVESDNEGIVEKVNSWSRSTDP 384
           + IL GL+ A E    ++ VESD++ +V  V +  +   P
Sbjct: 136 EAILTGLKVASEMNVKKLIVESDSDSVVSMVENGVKPNHP 175


>gb|PNX66128.1| hypothetical protein L195_g054921, partial [Trifolium pratense]
          Length = 184

 Score = 88.2 bits (217), Expect = 3e-19
 Identities = 48/91 (52%), Positives = 58/91 (63%), Gaps = 2/91 (2%)
 Frame = +1

Query: 127 LNVDGSLLREIQSAGCGGVLRNDSGSWIYGFAQKL--SPSSREDETEKKGILKGLQWAKE 300
           L VDGS L +I SAGCGG L + S  WI GF QKL  +P+   DETE++ IL+GL W KE
Sbjct: 36  LKVDGSRLPKISSAGCGGYLSSASQDWICGFVQKLMFTPTLTSDETEREAILRGLLWVKE 95

Query: 301 KGFMRVEVESDNEGIVEKVNSWSRSTDPLIL 393
           K   +V   +DNEG+   VNS  R  DPL L
Sbjct: 96  KEKKKVIAYTDNEGVENLVNSGRRCKDPLNL 126


>dbj|BAT99172.1| hypothetical protein VIGAN_10056800 [Vigna angularis var.
           angularis]
          Length = 236

 Score = 88.2 bits (217), Expect = 1e-18
 Identities = 44/93 (47%), Positives = 61/93 (65%), Gaps = 1/93 (1%)
 Frame = +1

Query: 88  ISVWTKPKVGWLKLNVDGSL-LREIQSAGCGGVLRNDSGSWIYGFAQKLSPSSREDETEK 264
           ++ W KP+ GW+KLNVDGS  L +  SAGCGGV+R+ SG W+ GFA+KL  S +  +TE 
Sbjct: 64  VAKWKKPEAGWVKLNVDGSQDLFKGPSAGCGGVVRDSSGKWVGGFAKKLGSSLKAHQTEL 123

Query: 265 KGILKGLQWAKEKGFMRVEVESDNEGIVEKVNS 363
           + IL GLQ+A      +V +ESD+   V  V +
Sbjct: 124 QAILTGLQFASRMKLKKVVLESDHNSAVRMVEN 156


>gb|PNX61593.1| ribonuclease H, partial [Trifolium pratense]
          Length = 146

 Score = 84.7 bits (208), Expect = 3e-18
 Identities = 40/89 (44%), Positives = 60/89 (67%)
 Frame = +1

Query: 97  WTKPKVGWLKLNVDGSLLREIQSAGCGGVLRNDSGSWIYGFAQKLSPSSREDETEKKGIL 276
           W +P+ G+L LN DG++    Q AGCGGV+RNDSG+W+ GFA+ L P S     E  GIL
Sbjct: 29  WMRPQQGYLSLNTDGAVKNGSQQAGCGGVIRNDSGNWVCGFAKALGPCS-AFVAELWGIL 87

Query: 277 KGLQWAKEKGFMRVEVESDNEGIVEKVNS 363
           +G+  AK++  MR+EV+ D+  +++ + S
Sbjct: 88  EGIIIAKDRNIMRIEVQVDSTAVLQCLTS 116


>ref|XP_013469552.1| transmembrane protein, putative [Medicago truncatula]
 gb|KEH43590.1| transmembrane protein, putative [Medicago truncatula]
          Length = 139

 Score = 82.0 bits (201), Expect = 3e-17
 Identities = 46/107 (42%), Positives = 61/107 (57%), Gaps = 2/107 (1%)
 Frame = +1

Query: 79  TKEISVWTKPKVGWLKLNVDGSLLREIQSAGCGGVLRNDSGSWIYGFAQKLSPSSREDET 258
           T+   VWT+P +GW+KLNV GSL  + +SAGCGGVLR+ S  W+ GFAQKL P+ + +E 
Sbjct: 52  TEPEPVWTEPMIGWVKLNVSGSLFPQNRSAGCGGVLRDSSRKWLCGFAQKLKPNLKANEI 111

Query: 259 EKKGILKGLQWAKEKGFMRVEVESDNEGIVEKVNSW--SRSTDPLIL 393
           EK                      DN+ IV  VN+    +S DPLI+
Sbjct: 112 EK----------------------DNKEIVNFVNNGPKGKSIDPLII 136


>gb|KHN29953.1| Putative ribonuclease H protein [Glycine soja]
          Length = 259

 Score = 83.2 bits (204), Expect = 1e-16
 Identities = 45/93 (48%), Positives = 60/93 (64%), Gaps = 4/93 (4%)
 Frame = +1

Query: 97  WTKPKVGWLKLNVDGSLLREIQSAGCGGVLRNDSGSWIYGFAQKLSPS--SREDETEKKG 270
           W KP+ GW+KLNVDGS + E  SAGCGGV+R++ G+W  GF QKL P+   +   TE + 
Sbjct: 87  WKKPESGWVKLNVDGSRIHEEPSAGCGGVIRDEWGTWCVGFDQKLDPNICRQAHYTELQA 146

Query: 271 ILKGLQWAKEK--GFMRVEVESDNEGIVEKVNS 363
           IL GL+ A+E      ++ VESD+E  V  V S
Sbjct: 147 ILTGLKVAREDMINVEKLVVESDSEPAVNMVKS 179


>ref|XP_019163602.1| PREDICTED: uncharacterized protein LOC109159944 [Ipomoea nil]
          Length = 1610

 Score = 81.6 bits (200), Expect = 2e-15
 Identities = 43/91 (47%), Positives = 57/91 (62%)
 Frame = +1

Query: 76   QTKEISVWTKPKVGWLKLNVDGSLLREIQSAGCGGVLRNDSGSWIYGFAQKLSPSSREDE 255
            Q+ ++  W KP  G LKLN+DGS+     +AGCGGV+RN SG WI GF  KL  +    E
Sbjct: 1433 QSWKMLTWKKPPPGTLKLNIDGSVAPLSLTAGCGGVIRNSSGEWITGFIAKLG-TCTPLE 1491

Query: 256  TEKKGILKGLQWAKEKGFMRVEVESDNEGIV 348
             E   ILKG+Q+A  KG+  V +ESD+  +V
Sbjct: 1492 AEAWSILKGIQFAIAKGYSNVLIESDSSDVV 1522


>gb|PNX88481.1| ribonuclease H, partial [Trifolium pratense]
          Length = 261

 Score = 79.0 bits (193), Expect = 5e-15
 Identities = 47/95 (49%), Positives = 54/95 (56%), Gaps = 4/95 (4%)
 Frame = +1

Query: 127 LNVDGSLLREIQSAGCGGVLRNDSGSWIYGFAQKLSPSSREDETEKKGILKGLQWAKEKG 306
           L VDGS   +I  A CGG L N SG WI GF QKL P+ R D+ EK+ IL GL W +  G
Sbjct: 21  LYVDGSHKPDIPLAACGGFLCNTSGKWICGFTQKLDPNLRLDQVEKQAILTGLLWVQGMG 80

Query: 307 FMRVEVESDNEGIVEKVN----SWSRSTDPLILAI 399
              V V+SD E  V+ VN    S S   DPLI  I
Sbjct: 81  KRNVLVKSDREEAVKSVNNPVISKSTKDDPLICDI 115


>gb|KYP69787.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 184

 Score = 77.4 bits (189), Expect = 5e-15
 Identities = 40/89 (44%), Positives = 57/89 (64%)
 Frame = +1

Query: 97  WTKPKVGWLKLNVDGSLLREIQSAGCGGVLRNDSGSWIYGFAQKLSPSSREDETEKKGIL 276
           W KP++GW+K+NVDGS   E       GV+R+  G+W+ GF Q+L P+++  ETE + IL
Sbjct: 19  WKKPEIGWVKVNVDGSRDHEGP-----GVVRDACGTWLIGFFQELDPTNKPHETELQAIL 73

Query: 277 KGLQWAKEKGFMRVEVESDNEGIVEKVNS 363
            GL+ A +  F +V VESD+E  V  V S
Sbjct: 74  TGLELALKMNFDKVVVESDSEPAVRMVKS 102


>ref|NP_001240873.1| uncharacterized protein LOC100812666 [Glycine max]
 gb|ACU18716.1| unknown [Glycine max]
          Length = 254

 Score = 78.6 bits (192), Expect = 6e-15
 Identities = 45/94 (47%), Positives = 60/94 (63%), Gaps = 5/94 (5%)
 Frame = +1

Query: 97  WTKPKVGWLKLNVDGSLLREIQ-SAGCGGVLRNDSGSWIYGFAQKLSPS--SREDETEKK 267
           W KP+ GW+KLNVDGS + E   SAGCGGV+R++ G+W  GF QKL P+   +   TE +
Sbjct: 81  WKKPESGWVKLNVDGSRIHEEPASAGCGGVIRDEWGTWCVGFDQKLDPNICRQAHYTELQ 140

Query: 268 GILKGLQWAKEK--GFMRVEVESDNEGIVEKVNS 363
            IL GL+ A+E      ++ VESD+E  V  V S
Sbjct: 141 AILTGLKVAREDMINVEKLVVESDSEPAVNMVKS 174


>ref|XP_019162021.1| PREDICTED: uncharacterized protein LOC109158590 [Ipomoea nil]
          Length = 1268

 Score = 79.7 bits (195), Expect = 1e-14
 Identities = 40/96 (41%), Positives = 56/96 (58%)
 Frame = +1

Query: 76   QTKEISVWTKPKVGWLKLNVDGSLLREIQSAGCGGVLRNDSGSWIYGFAQKLSPSSREDE 255
            Q K++ VW KP  GW+KLNVDG      + AGCGGVLR+  G+W  GF+  +  +    E
Sbjct: 1088 QNKKVFVWDKPADGWIKLNVDGCCKGREKRAGCGGVLRDTQGNWRRGFSHNIG-ACEAKE 1146

Query: 256  TEKKGILKGLQWAKEKGFMRVEVESDNEGIVEKVNS 363
             E   IL GL+ A   G  ++ VESD+  ++  +NS
Sbjct: 1147 AEAWAILVGLRMAAHYGASKIVVESDSIAVIRALNS 1182


Top