BLASTX nr result

ID: Astragalus23_contig00022914 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00022914
         (598 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU44992.1| hypothetical protein TSUD_184930 [Trifolium subt...   182   2e-55
gb|PNX82343.1| hypothetical protein L195_g038372 [Trifolium prat...   183   4e-54
ref|XP_004515829.1| PREDICTED: uncharacterized protein LOC101491...   177   6e-52
ref|XP_003608867.1| hypothetical protein MTR_4g103840 [Medicago ...   166   1e-47
ref|XP_020217427.1| uncharacterized protein LOC109800914 [Cajanu...   148   1e-40
gb|KHN14087.1| hypothetical protein glysoja_033314 [Glycine soja]     140   3e-37
ref|XP_016190366.1| uncharacterized protein LOC107631433 [Arachi...   139   5e-37
ref|XP_003549841.1| PREDICTED: uncharacterized protein LOC100812...   139   8e-37
gb|KHN23271.1| hypothetical protein glysoja_040240 [Glycine soja]     132   6e-35
ref|XP_006582520.2| PREDICTED: uncharacterized protein LOC102660...   132   7e-35
ref|XP_006579592.1| PREDICTED: uncharacterized protein LOC102662...   133   9e-35
ref|XP_019462844.1| PREDICTED: uncharacterized protein LOC109361...   129   4e-33
ref|XP_020210755.1| uncharacterized protein LOC109795676 [Cajanu...   124   6e-32
gb|OMO97825.1| hypothetical protein COLO4_14335 [Corchorus olito...   125   8e-32
gb|KYP69540.1| hypothetical protein KK1_008733 [Cajanus cajan]        124   9e-32
ref|XP_021908839.1| uncharacterized protein LOC110822924 [Carica...   125   1e-31
gb|OMO95069.1| hypothetical protein CCACVL1_05610 [Corchorus cap...   123   6e-31
ref|XP_021286511.1| uncharacterized protein LOC110418184 [Herran...   123   7e-31
ref|XP_007137937.1| hypothetical protein PHAVU_009G168000g [Phas...   122   7e-31
ref|XP_006578696.1| PREDICTED: uncharacterized protein LOC102664...   122   7e-31

>dbj|GAU44992.1| hypothetical protein TSUD_184930 [Trifolium subterraneum]
          Length = 157

 Score =  182 bits (461), Expect = 2e-55
 Identities = 84/94 (89%), Positives = 88/94 (93%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMMGEGIVTLVMKVSAVVLSPSFTCRVRVVQFACGSGRE 181
           TCPGFISDGYGRVTWINGAYREMMGEGIV LVMKV+ V+L PSFTCRVRVVQFACGSGRE
Sbjct: 64  TCPGFISDGYGRVTWINGAYREMMGEGIVALVMKVNGVILYPSFTCRVRVVQFACGSGRE 123

Query: 182 KISLTVPCDVWRMDCGGFAWRLDVKTALSLGLGC 283
           + SLTVPCDVWRM+CGGFAWRLDVK ALSL LGC
Sbjct: 124 RNSLTVPCDVWRMNCGGFAWRLDVKAALSLRLGC 157


>gb|PNX82343.1| hypothetical protein L195_g038372 [Trifolium pratense]
 gb|PNX83679.1| hypothetical protein L195_g039723 [Trifolium pratense]
 gb|PNX83978.1| hypothetical protein L195_g040029 [Trifolium pratense]
          Length = 292

 Score =  183 bits (465), Expect = 4e-54
 Identities = 84/94 (89%), Positives = 88/94 (93%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMMGEGIVTLVMKVSAVVLSPSFTCRVRVVQFACGSGRE 181
           TCPGFISDGYGRVTWINGAYREMMGEG+V LVMKV+ V+L PSFTCRVRVVQFACGSGRE
Sbjct: 199 TCPGFISDGYGRVTWINGAYREMMGEGVVALVMKVNGVILYPSFTCRVRVVQFACGSGRE 258

Query: 182 KISLTVPCDVWRMDCGGFAWRLDVKTALSLGLGC 283
           + SLTVPCDVWRMDCGGFAWRLDVK ALSL LGC
Sbjct: 259 RNSLTVPCDVWRMDCGGFAWRLDVKAALSLRLGC 292


>ref|XP_004515829.1| PREDICTED: uncharacterized protein LOC101491129 [Cicer arietinum]
          Length = 292

 Score =  177 bits (450), Expect = 6e-52
 Identities = 83/94 (88%), Positives = 86/94 (91%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMMGEGIVTLVMKVSAVVLSPSFTCRVRVVQFACGSGRE 181
           TCPGFISDGYGRVTW NGAYREMMGEG+V LVMKVS VVL PSFTCRVRVVQFACGSGRE
Sbjct: 199 TCPGFISDGYGRVTWTNGAYREMMGEGVVVLVMKVSGVVLYPSFTCRVRVVQFACGSGRE 258

Query: 182 KISLTVPCDVWRMDCGGFAWRLDVKTALSLGLGC 283
           + SLTVPCDVWRM+ GGFAWRLDVK ALSL LGC
Sbjct: 259 RNSLTVPCDVWRMESGGFAWRLDVKAALSLRLGC 292


>ref|XP_003608867.1| hypothetical protein MTR_4g103840 [Medicago truncatula]
 gb|AES91064.1| hypothetical protein MTR_4g103840 [Medicago truncatula]
          Length = 266

 Score =  166 bits (420), Expect = 1e-47
 Identities = 76/94 (80%), Positives = 83/94 (88%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMMGEGIVTLVMKVSAVVLSPSFTCRVRVVQFACGSGRE 181
           TCPGFISDGYGRVTW NGAYREMMG+G++ LVMK++ VVL PSFTCRVRVVQFAC  GRE
Sbjct: 175 TCPGFISDGYGRVTWTNGAYREMMGDGVIALVMKINGVVLYPSFTCRVRVVQFAC--GRE 232

Query: 182 KISLTVPCDVWRMDCGGFAWRLDVKTALSLGLGC 283
           +   T+PCDVWRMDCGGFAWRLDVK ALSL LGC
Sbjct: 233 RNLFTLPCDVWRMDCGGFAWRLDVKAALSLRLGC 266


>ref|XP_020217427.1| uncharacterized protein LOC109800914 [Cajanus cajan]
 gb|KYP65312.1| hypothetical protein KK1_011545 [Cajanus cajan]
          Length = 274

 Score =  148 bits (374), Expect = 1e-40
 Identities = 76/100 (76%), Positives = 79/100 (79%), Gaps = 7/100 (7%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMM-------GEGIVTLVMKVSAVVLSPSFTCRVRVVQF 160
           TCPGFISDGYGRVTW N AY EMM       G+G V LVMKVSAVV  PSFTCRVRVVQ+
Sbjct: 176 TCPGFISDGYGRVTWTNEAYGEMMKGEGEGQGQGRVLLVMKVSAVVAHPSFTCRVRVVQY 235

Query: 161 ACGSGREKISLTVPCDVWRMDCGGFAWRLDVKTALSLGLG 280
            CG  RE+ SLTVPCDVWRMD GGFAWRLDVKTALSL  G
Sbjct: 236 TCG--RERSSLTVPCDVWRMDSGGFAWRLDVKTALSLRFG 273


>gb|KHN14087.1| hypothetical protein glysoja_033314 [Glycine soja]
          Length = 290

 Score =  140 bits (352), Expect = 3e-37
 Identities = 72/101 (71%), Positives = 78/101 (77%), Gaps = 8/101 (7%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMM--------GEGIVTLVMKVSAVVLSPSFTCRVRVVQ 157
           TCPGFISDGYGRVTW N AY +MM        G+G V LV KV+ VV   SFTC VRVVQ
Sbjct: 191 TCPGFISDGYGRVTWTNEAYGKMMMMKGEGDEGQGPVLLVNKVNTVVPHASFTCLVRVVQ 250

Query: 158 FACGSGREKISLTVPCDVWRMDCGGFAWRLDVKTALSLGLG 280
           ++CG  RE+ SLTVPCDVWRMDCGGFAWRLDVKTALSL LG
Sbjct: 251 YSCG--RERNSLTVPCDVWRMDCGGFAWRLDVKTALSLRLG 289


>ref|XP_016190366.1| uncharacterized protein LOC107631433 [Arachis ipaensis]
          Length = 295

 Score =  139 bits (351), Expect = 5e-37
 Identities = 72/98 (73%), Positives = 79/98 (80%), Gaps = 5/98 (5%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMMGE----GIVTLVMKVSAVVLSP-SFTCRVRVVQFAC 166
           TCPGFISDGYGRVTW NGAYREM+G+     +V L MKV AVV  P SFTCRVRVVQF  
Sbjct: 199 TCPGFISDGYGRVTWTNGAYREMVGQKNEGAVVVLAMKVGAVVPYPCSFTCRVRVVQFH- 257

Query: 167 GSGREKISLTVPCDVWRMDCGGFAWRLDVKTALSLGLG 280
            +G+E+ +LTVPCDVWRMD GGFAWRLDVK ALSL LG
Sbjct: 258 -AGKERSALTVPCDVWRMDFGGFAWRLDVKAALSLRLG 294


>ref|XP_003549841.1| PREDICTED: uncharacterized protein LOC100812834 [Glycine max]
 gb|KRH03914.1| hypothetical protein GLYMA_17G127900 [Glycine max]
          Length = 290

 Score =  139 bits (349), Expect = 8e-37
 Identities = 71/101 (70%), Positives = 78/101 (77%), Gaps = 8/101 (7%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMM--------GEGIVTLVMKVSAVVLSPSFTCRVRVVQ 157
           TCPGFISDGYGRVTW N AY +MM        G+G V LV KV+ VV   SFTC VRVVQ
Sbjct: 191 TCPGFISDGYGRVTWTNEAYGKMMMMKGEGDEGQGPVLLVNKVNTVVPHASFTCLVRVVQ 250

Query: 158 FACGSGREKISLTVPCDVWRMDCGGFAWRLDVKTALSLGLG 280
           ++CG  +E+ SLTVPCDVWRMDCGGFAWRLDVKTALSL LG
Sbjct: 251 YSCG--KERNSLTVPCDVWRMDCGGFAWRLDVKTALSLRLG 289


>gb|KHN23271.1| hypothetical protein glysoja_040240 [Glycine soja]
          Length = 245

 Score =  132 bits (333), Expect = 6e-35
 Identities = 67/92 (72%), Positives = 71/92 (77%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMMGEGIVTLVMKVSAVVLSPSFTCRVRVVQFACGSGRE 181
           TCPGFISDGYGRVTW NGAYRE+MGEG V L MKV+   L   FTCRVR VQ+ACG  R 
Sbjct: 158 TCPGFISDGYGRVTWTNGAYREIMGEGGVWLAMKVTVPCLYRGFTCRVR-VQYACGKER- 215

Query: 182 KISLTVPCDVWRMDCGGFAWRLDVKTALSLGL 277
               TVPCDVWRM+ GGFAWRLDVK ALSL L
Sbjct: 216 ----TVPCDVWRMNSGGFAWRLDVKAALSLSL 243


>ref|XP_006582520.2| PREDICTED: uncharacterized protein LOC102660960 [Glycine max]
 gb|KRH54218.1| hypothetical protein GLYMA_06G172200 [Glycine max]
          Length = 246

 Score =  132 bits (333), Expect = 7e-35
 Identities = 67/92 (72%), Positives = 71/92 (77%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMMGEGIVTLVMKVSAVVLSPSFTCRVRVVQFACGSGRE 181
           TCPGFISDGYGRVTW NGAYRE+MGEG V L MKV+   L   FTCRVR VQ+ACG  R 
Sbjct: 159 TCPGFISDGYGRVTWTNGAYREIMGEGGVWLAMKVTVPCLYRGFTCRVR-VQYACGKER- 216

Query: 182 KISLTVPCDVWRMDCGGFAWRLDVKTALSLGL 277
               TVPCDVWRM+ GGFAWRLDVK ALSL L
Sbjct: 217 ----TVPCDVWRMNSGGFAWRLDVKAALSLSL 244


>ref|XP_006579592.1| PREDICTED: uncharacterized protein LOC102662292 [Glycine max]
 gb|KHN34397.1| hypothetical protein glysoja_016725 [Glycine soja]
 gb|KRH57200.1| hypothetical protein GLYMA_05G045800 [Glycine max]
          Length = 289

 Score =  133 bits (335), Expect = 9e-35
 Identities = 70/97 (72%), Positives = 76/97 (78%), Gaps = 4/97 (4%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMM----GEGIVTLVMKVSAVVLSPSFTCRVRVVQFACG 169
           TCPGFISDGYGRVT  N AY +MM    G+G V LV KV+ VV   SFTC VRVVQ+ACG
Sbjct: 194 TCPGFISDGYGRVTGTNEAYEKMMEGDEGQGPVLLVNKVNTVVPHASFTCLVRVVQYACG 253

Query: 170 SGREKISLTVPCDVWRMDCGGFAWRLDVKTALSLGLG 280
             RE+ SLTVPCDVWRMD GGFAWRLDV+TALSL LG
Sbjct: 254 --RERSSLTVPCDVWRMDSGGFAWRLDVETALSLRLG 288


>ref|XP_019462844.1| PREDICTED: uncharacterized protein LOC109361759 [Lupinus
           angustifolius]
 gb|OIW00280.1| hypothetical protein TanjilG_27531 [Lupinus angustifolius]
          Length = 288

 Score =  129 bits (324), Expect = 4e-33
 Identities = 66/97 (68%), Positives = 78/97 (80%), Gaps = 4/97 (4%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMMGEG---IVTLVMKVSAVVLSP-SFTCRVRVVQFACG 169
           TCPGFI+DGYG+VTW+NGAYREM+GEG   ++T+   VSA V +P SFTCRVRVV++   
Sbjct: 193 TCPGFITDGYGKVTWMNGAYREMVGEGACVLLTMKKNVSATVTNPLSFTCRVRVVEYDT- 251

Query: 170 SGREKISLTVPCDVWRMDCGGFAWRLDVKTALSLGLG 280
            G+E+ S TVPCDVWRMD  GF WRLDVKTALSL LG
Sbjct: 252 CGKERSSFTVPCDVWRMDF-GFTWRLDVKTALSLSLG 287


>ref|XP_020210755.1| uncharacterized protein LOC109795676 [Cajanus cajan]
          Length = 233

 Score =  124 bits (312), Expect = 6e-32
 Identities = 63/93 (67%), Positives = 66/93 (70%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMMGEGIVTLVMKVSAVVLSPSFTCRVRVVQFACGSGRE 181
           TCPGFISDGYG VTW NGAYRE +G G V L MK S       FTCRVR VQ+ACG  R 
Sbjct: 144 TCPGFISDGYGSVTWTNGAYRETVGGGGVWLAMKASVAYPYGGFTCRVR-VQYACGKER- 201

Query: 182 KISLTVPCDVWRMDCGGFAWRLDVKTALSLGLG 280
               TVPCD WRMD GGFAWRLDVK AL+L LG
Sbjct: 202 ----TVPCDAWRMDSGGFAWRLDVKAALTLSLG 230


>gb|OMO97825.1| hypothetical protein COLO4_14335 [Corchorus olitorius]
          Length = 283

 Score =  125 bits (315), Expect = 8e-32
 Identities = 63/96 (65%), Positives = 74/96 (77%), Gaps = 5/96 (5%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMMG----EGIVTLVMKVSAVVLS-PSFTCRVRVVQFAC 166
           TCPGFISDG GRVTW NGAY+EM+G    E +V LVMK    +++ P+FTCRVRV Q+ C
Sbjct: 189 TCPGFISDGLGRVTWTNGAYKEMVGGAGGETMVWLVMKERLPMITYPAFTCRVRVQQYTC 248

Query: 167 GSGREKISLTVPCDVWRMDCGGFAWRLDVKTALSLG 274
           G  +E  SLT+PCDVWRMD GGFAWRLD+  ALSLG
Sbjct: 249 G--KESSSLTLPCDVWRMDGGGFAWRLDINAALSLG 282


>gb|KYP69540.1| hypothetical protein KK1_008733 [Cajanus cajan]
          Length = 248

 Score =  124 bits (312), Expect = 9e-32
 Identities = 63/93 (67%), Positives = 66/93 (70%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMMGEGIVTLVMKVSAVVLSPSFTCRVRVVQFACGSGRE 181
           TCPGFISDGYG VTW NGAYRE +G G V L MK S       FTCRVR VQ+ACG  R 
Sbjct: 159 TCPGFISDGYGSVTWTNGAYRETVGGGGVWLAMKASVAYPYGGFTCRVR-VQYACGKER- 216

Query: 182 KISLTVPCDVWRMDCGGFAWRLDVKTALSLGLG 280
               TVPCD WRMD GGFAWRLDVK AL+L LG
Sbjct: 217 ----TVPCDAWRMDSGGFAWRLDVKAALTLSLG 245


>ref|XP_021908839.1| uncharacterized protein LOC110822924 [Carica papaya]
          Length = 300

 Score =  125 bits (315), Expect = 1e-31
 Identities = 64/94 (68%), Positives = 75/94 (79%), Gaps = 3/94 (3%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMMGEG---IVTLVMKVSAVVLSPSFTCRVRVVQFACGS 172
           TCPGFISDG GRVTW NGAYREM+G+    +V LVMK  A++  P+FTCRVR+ Q+ACG 
Sbjct: 209 TCPGFISDGMGRVTWTNGAYREMVGQCGGMMVWLVMKERAMLTYPAFTCRVRL-QYACG- 266

Query: 173 GREKISLTVPCDVWRMDCGGFAWRLDVKTALSLG 274
            RE+ SL +PCDVWRM+ GGFAWRLDVK AL LG
Sbjct: 267 -RERNSLILPCDVWRMEGGGFAWRLDVKAALCLG 299


>gb|OMO95069.1| hypothetical protein CCACVL1_05610 [Corchorus capsularis]
          Length = 283

 Score =  123 bits (309), Expect = 6e-31
 Identities = 62/96 (64%), Positives = 73/96 (76%), Gaps = 5/96 (5%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMMG----EGIVTLVMKVSAVVLS-PSFTCRVRVVQFAC 166
           TCPGFISDG GRVTW NGAY+EM+G    E +V LVMK    +++ P+ TCRVRV Q+ C
Sbjct: 189 TCPGFISDGLGRVTWTNGAYKEMVGGVGGETMVWLVMKERLPMITYPALTCRVRVQQYTC 248

Query: 167 GSGREKISLTVPCDVWRMDCGGFAWRLDVKTALSLG 274
           G  +E  SLT+PCDVWRMD GGFAWRLD+  ALSLG
Sbjct: 249 G--KESSSLTLPCDVWRMDGGGFAWRLDINAALSLG 282


>ref|XP_021286511.1| uncharacterized protein LOC110418184 [Herrania umbratica]
          Length = 276

 Score =  123 bits (308), Expect = 7e-31
 Identities = 62/93 (66%), Positives = 75/93 (80%), Gaps = 2/93 (2%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMMG-EGIVTLVMKVSAVVLS-PSFTCRVRVVQFACGSG 175
           TCPGFISDG+GRVTW NGAY+EM+G E +V LVMK    +++ P+FTCRVRV Q+ CG  
Sbjct: 186 TCPGFISDGFGRVTWTNGAYKEMVGGEMMVWLVMKERLPMITCPAFTCRVRV-QYTCG-- 242

Query: 176 REKISLTVPCDVWRMDCGGFAWRLDVKTALSLG 274
           +E+ SLT+PCDVWRMD GGFAWRLD+  AL LG
Sbjct: 243 KERSSLTLPCDVWRMDGGGFAWRLDINAALCLG 275


>ref|XP_007137937.1| hypothetical protein PHAVU_009G168000g [Phaseolus vulgaris]
 gb|ESW09931.1| hypothetical protein PHAVU_009G168000g [Phaseolus vulgaris]
          Length = 262

 Score =  122 bits (307), Expect = 7e-31
 Identities = 64/92 (69%), Positives = 68/92 (73%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMMGEGIVTLVMKVSAVVLSPSFTCRVRVVQFACGSGRE 181
           TCPGFISDGYGRVTW NGAYRE++GEG V L MKVS       FT  VR VQ+A G  R 
Sbjct: 175 TCPGFISDGYGRVTWTNGAYREVVGEGGVWLAMKVSVAYPYRGFTGWVR-VQYASGKER- 232

Query: 182 KISLTVPCDVWRMDCGGFAWRLDVKTALSLGL 277
               TVPCDVWRMDCGGFAWRLDVK AL+L L
Sbjct: 233 ----TVPCDVWRMDCGGFAWRLDVKAALTLTL 260


>ref|XP_006578696.1| PREDICTED: uncharacterized protein LOC102664115 [Glycine max]
 gb|KRH63730.1| hypothetical protein GLYMA_04G193900 [Glycine max]
          Length = 248

 Score =  122 bits (306), Expect = 7e-31
 Identities = 63/92 (68%), Positives = 67/92 (72%)
 Frame = +2

Query: 2   TCPGFISDGYGRVTWINGAYREMMGEGIVTLVMKVSAVVLSPSFTCRVRVVQFACGSGRE 181
           TCPGFISDGYGRVTW N AYRE +G G V L MKV+       FTCRVR V++ACG    
Sbjct: 161 TCPGFISDGYGRVTWTNEAYRETVGAGGVWLAMKVAVPYPYRGFTCRVR-VRYACG---- 215

Query: 182 KISLTVPCDVWRMDCGGFAWRLDVKTALSLGL 277
            I  TVPCDVWRMD GGFAWRLDVK ALSL L
Sbjct: 216 -IERTVPCDVWRMDSGGFAWRLDVKAALSLSL 246


Top