BLASTX nr result

ID: Astragalus24_contig00025088 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00025088
         (332 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX98576.1| hypothetical protein L195_g021826, partial [Trifo...   186   3e-57
gb|PNX98738.1| hypothetical protein L195_g021995, partial [Trifo...   186   4e-57
dbj|GAU20348.1| hypothetical protein TSUD_338300 [Trifolium subt...   185   2e-56
dbj|GAU20347.1| hypothetical protein TSUD_338290 [Trifolium subt...   185   3e-56
ref|XP_003625710.1| PWWP domain protein [Medicago truncatula] >g...   180   4e-54
ref|XP_004517305.1| PREDICTED: uncharacterized protein LOC101495...   176   2e-52
ref|XP_004494013.1| PREDICTED: uncharacterized protein LOC101508...   175   3e-52
ref|XP_004494012.1| PREDICTED: uncharacterized protein LOC101508...   175   4e-52
ref|XP_019459283.1| PREDICTED: uncharacterized protein LOC109359...   159   5e-46
ref|XP_019459282.1| PREDICTED: uncharacterized protein LOC109359...   159   9e-46
ref|XP_019459281.1| PREDICTED: uncharacterized protein LOC109359...   159   9e-46
ref|XP_020960159.1| uncharacterized protein LOC107648727 isoform...   155   2e-45
ref|XP_016208035.1| uncharacterized protein LOC107648727 isoform...   155   3e-44
ref|XP_020980377.1| uncharacterized protein LOC107493853 isoform...   152   5e-44
ref|XP_007162799.1| hypothetical protein PHAVU_001G181700g [Phas...   154   6e-44
gb|KHN18693.1| hypothetical protein glysoja_021444 [Glycine soja]     154   1e-43
ref|XP_003521421.1| PREDICTED: uncharacterized protein LOC100801...   154   1e-43
ref|XP_015970372.1| uncharacterized protein LOC107493853 isoform...   152   7e-43
ref|XP_006604592.1| PREDICTED: uncharacterized protein LOC100776...   146   1e-41
ref|XP_020211227.1| PWWP domain-containing protein 2A-like [Caja...   147   3e-41

>gb|PNX98576.1| hypothetical protein L195_g021826, partial [Trifolium pratense]
          Length = 245

 Score =  186 bits (472), Expect = 3e-57
 Identities = 90/110 (81%), Positives = 93/110 (84%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMSESTP E            PG+VVWAR +CQMWWPAEIIEERCALSD VSDGHVLVQF
Sbjct: 50  KMSESTPRESSVSDNSSLAVTPGNVVWARTSCQMWWPAEIIEERCALSDCVSDGHVLVQF 109

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGSC 332
           YGNH SAWIDPATDISIFEDSFEERSNNPSNDFQEAL+QALQRKAQL SC
Sbjct: 110 YGNHPSAWIDPATDISIFEDSFEERSNNPSNDFQEALQQALQRKAQLSSC 159


>gb|PNX98738.1| hypothetical protein L195_g021995, partial [Trifolium pratense]
          Length = 258

 Score =  186 bits (472), Expect = 4e-57
 Identities = 90/110 (81%), Positives = 93/110 (84%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMSESTP E            PG+VVWAR +CQMWWPAEIIEERCALSD VSDGHVLVQF
Sbjct: 68  KMSESTPRESSVSDNSSLAVTPGNVVWARTSCQMWWPAEIIEERCALSDCVSDGHVLVQF 127

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGSC 332
           YGNH SAWIDPATDISIFEDSFEERSNNPSNDFQEAL+QALQRKAQL SC
Sbjct: 128 YGNHPSAWIDPATDISIFEDSFEERSNNPSNDFQEALQQALQRKAQLSSC 177


>dbj|GAU20348.1| hypothetical protein TSUD_338300 [Trifolium subterraneum]
          Length = 275

 Score =  185 bits (469), Expect = 2e-56
 Identities = 89/110 (80%), Positives = 92/110 (83%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMSESTP E            PG+VVWAR +CQMWWPAEIIEERCALSD  SDGHVLVQF
Sbjct: 85  KMSESTPRESSVSDNSSLAVTPGNVVWARTSCQMWWPAEIIEERCALSDCASDGHVLVQF 144

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGSC 332
           YGNH SAWIDPATDISIFEDSFEERSNNPSNDFQ+ALKQALQRKAQL SC
Sbjct: 145 YGNHPSAWIDPATDISIFEDSFEERSNNPSNDFQDALKQALQRKAQLSSC 194


>dbj|GAU20347.1| hypothetical protein TSUD_338290 [Trifolium subterraneum]
          Length = 299

 Score =  185 bits (469), Expect = 3e-56
 Identities = 89/110 (80%), Positives = 92/110 (83%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMSESTP E            PG+VVWAR +CQMWWPAEIIEERCALSD  SDGHVLVQF
Sbjct: 109 KMSESTPRESSVSDNSSLAVTPGNVVWARTSCQMWWPAEIIEERCALSDCASDGHVLVQF 168

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGSC 332
           YGNH SAWIDPATDISIFEDSFEERSNNPSNDFQ+ALKQALQRKAQL SC
Sbjct: 169 YGNHPSAWIDPATDISIFEDSFEERSNNPSNDFQDALKQALQRKAQLSSC 218


>ref|XP_003625710.1| PWWP domain protein [Medicago truncatula]
 gb|AES81928.1| PWWP domain protein [Medicago truncatula]
          Length = 306

 Score =  180 bits (456), Expect = 4e-54
 Identities = 86/110 (78%), Positives = 92/110 (83%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMSESTP E            PG+VVWAR ACQMWWPAEI+EE CALSDRV+DG+VLVQF
Sbjct: 116 KMSESTPRESSVSDNSSLAVTPGTVVWARTACQMWWPAEIMEESCALSDRVNDGNVLVQF 175

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGSC 332
           YGNH SAWIDPATDISIFEDSFEERSNNPS+DFQ+ALKQALQRK QL SC
Sbjct: 176 YGNHPSAWIDPATDISIFEDSFEERSNNPSSDFQDALKQALQRKTQLSSC 225


>ref|XP_004517305.1| PREDICTED: uncharacterized protein LOC101495660 [Cicer arietinum]
          Length = 319

 Score =  176 bits (446), Expect = 2e-52
 Identities = 84/110 (76%), Positives = 90/110 (81%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMSESTP E            PGSVVWAR +CQ WWPAEI+EERCA SD V DGHVLVQF
Sbjct: 129 KMSESTPRESSVSDSSSLAVTPGSVVWARTSCQTWWPAEIMEERCAPSDYVRDGHVLVQF 188

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGSC 332
           YGNH SAW+DP+T+ISIFEDSFEERSNNPSNDFQ+ALKQALQRKAQL SC
Sbjct: 189 YGNHPSAWLDPSTNISIFEDSFEERSNNPSNDFQDALKQALQRKAQLSSC 238


>ref|XP_004494013.1| PREDICTED: uncharacterized protein LOC101508955 isoform X2 [Cicer
           arietinum]
          Length = 309

 Score =  175 bits (444), Expect = 3e-52
 Identities = 84/110 (76%), Positives = 89/110 (80%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMSESTP E            PGSVVWAR +CQ WWPAEI+EERCA SD   DGHVLVQF
Sbjct: 119 KMSESTPRESSVSDSSSLAVTPGSVVWARTSCQTWWPAEIMEERCAPSDDARDGHVLVQF 178

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGSC 332
           YGNH SAWIDP+T+ISIFEDSFEERSNNPSNDFQ+ALKQALQRKAQL SC
Sbjct: 179 YGNHPSAWIDPSTNISIFEDSFEERSNNPSNDFQDALKQALQRKAQLSSC 228


>ref|XP_004494012.1| PREDICTED: uncharacterized protein LOC101508955 isoform X1 [Cicer
           arietinum]
          Length = 319

 Score =  175 bits (444), Expect = 4e-52
 Identities = 84/110 (76%), Positives = 89/110 (80%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMSESTP E            PGSVVWAR +CQ WWPAEI+EERCA SD   DGHVLVQF
Sbjct: 129 KMSESTPRESSVSDSSSLAVTPGSVVWARTSCQTWWPAEIMEERCAPSDDARDGHVLVQF 188

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGSC 332
           YGNH SAWIDP+T+ISIFEDSFEERSNNPSNDFQ+ALKQALQRKAQL SC
Sbjct: 189 YGNHPSAWIDPSTNISIFEDSFEERSNNPSNDFQDALKQALQRKAQLSSC 238


>ref|XP_019459283.1| PREDICTED: uncharacterized protein LOC109359169 isoform X3 [Lupinus
           angustifolius]
          Length = 302

 Score =  159 bits (402), Expect = 5e-46
 Identities = 79/109 (72%), Positives = 81/109 (74%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMSESTPH             PGSVVWAR ACQMWWPAEI+EER  LSD   DGHVLVQF
Sbjct: 112 KMSESTPHGSPVSDSNYFAVTPGSVVWARTACQMWWPAEIMEERSTLSDSACDGHVLVQF 171

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGS 329
           YGN  SAWIDP T+IS FED FEERS NPS DFQEALKQALQRK QL S
Sbjct: 172 YGNRPSAWIDPRTNISAFEDCFEERSTNPSCDFQEALKQALQRKEQLSS 220


>ref|XP_019459282.1| PREDICTED: uncharacterized protein LOC109359169 isoform X2 [Lupinus
           angustifolius]
          Length = 328

 Score =  159 bits (402), Expect = 9e-46
 Identities = 79/109 (72%), Positives = 81/109 (74%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMSESTPH             PGSVVWAR ACQMWWPAEI+EER  LSD   DGHVLVQF
Sbjct: 139 KMSESTPHGSPVSDSNYFAVTPGSVVWARTACQMWWPAEIMEERSTLSDSACDGHVLVQF 198

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGS 329
           YGN  SAWIDP T+IS FED FEERS NPS DFQEALKQALQRK QL S
Sbjct: 199 YGNRPSAWIDPRTNISAFEDCFEERSTNPSCDFQEALKQALQRKEQLSS 247


>ref|XP_019459281.1| PREDICTED: uncharacterized protein LOC109359169 isoform X1 [Lupinus
           angustifolius]
 gb|OIW01573.1| hypothetical protein TanjilG_21153 [Lupinus angustifolius]
          Length = 329

 Score =  159 bits (402), Expect = 9e-46
 Identities = 79/109 (72%), Positives = 81/109 (74%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMSESTPH             PGSVVWAR ACQMWWPAEI+EER  LSD   DGHVLVQF
Sbjct: 139 KMSESTPHGSPVSDSNYFAVTPGSVVWARTACQMWWPAEIMEERSTLSDSACDGHVLVQF 198

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGS 329
           YGN  SAWIDP T+IS FED FEERS NPS DFQEALKQALQRK QL S
Sbjct: 199 YGNRPSAWIDPRTNISAFEDCFEERSTNPSCDFQEALKQALQRKEQLSS 247


>ref|XP_020960159.1| uncharacterized protein LOC107648727 isoform X2 [Arachis ipaensis]
          Length = 233

 Score =  155 bits (392), Expect = 2e-45
 Identities = 74/109 (67%), Positives = 81/109 (74%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMS+S PHE            PG+V+WAR  CQ WWPAEI+EER ALS  VSDG VLVQF
Sbjct: 40  KMSKSIPHETSVSDSSSLALTPGTVIWARTTCQTWWPAEIMEERSALSKPVSDGQVLVQF 99

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGS 329
           YGNH S WIDP TDIS FED FEER +NPSNDFQEALKQA+Q+K QL S
Sbjct: 100 YGNHSSVWIDPMTDISTFEDCFEERCSNPSNDFQEALKQAIQKKEQLSS 148


>ref|XP_016208035.1| uncharacterized protein LOC107648727 isoform X1 [Arachis ipaensis]
          Length = 331

 Score =  155 bits (392), Expect = 3e-44
 Identities = 74/109 (67%), Positives = 81/109 (74%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMS+S PHE            PG+V+WAR  CQ WWPAEI+EER ALS  VSDG VLVQF
Sbjct: 138 KMSKSIPHETSVSDSSSLALTPGTVIWARTTCQTWWPAEIMEERSALSKPVSDGQVLVQF 197

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGS 329
           YGNH S WIDP TDIS FED FEER +NPSNDFQEALKQA+Q+K QL S
Sbjct: 198 YGNHSSVWIDPMTDISTFEDCFEERCSNPSNDFQEALKQAIQKKEQLSS 246


>ref|XP_020980377.1| uncharacterized protein LOC107493853 isoform X2 [Arachis
           duranensis]
          Length = 233

 Score =  152 bits (383), Expect = 5e-44
 Identities = 72/109 (66%), Positives = 80/109 (73%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMS+S PHE            PG+V+WAR  CQ WWPAEI+EER ALS  VSDG VLVQF
Sbjct: 40  KMSKSIPHETSVSDSSSLALTPGTVIWARTTCQTWWPAEIMEERSALSKPVSDGQVLVQF 99

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGS 329
           YGNH S WIDP TDIS FED F+ER +NPS DFQEALKQA+Q+K QL S
Sbjct: 100 YGNHSSVWIDPMTDISTFEDCFQERCSNPSKDFQEALKQAIQKKEQLSS 148


>ref|XP_007162799.1| hypothetical protein PHAVU_001G181700g [Phaseolus vulgaris]
 gb|ESW34793.1| hypothetical protein PHAVU_001G181700g [Phaseolus vulgaris]
          Length = 331

 Score =  154 bits (390), Expect = 6e-44
 Identities = 77/110 (70%), Positives = 82/110 (74%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMSESTP E            PGSVVWAR  CQ+WWPAEI+EE  ALS   SDGHVLV F
Sbjct: 141 KMSESTPLESSISDSSSQAATPGSVVWARTDCQLWWPAEIMEETSALSKPGSDGHVLVHF 200

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGSC 332
           YGN  SAWIDP TDIS FE+SFE RSNNPS DFQ+ALKQALQ+KAQL SC
Sbjct: 201 YGNLPSAWIDPMTDISTFEESFEARSNNPSEDFQQALKQALQKKAQLSSC 250


>gb|KHN18693.1| hypothetical protein glysoja_021444 [Glycine soja]
          Length = 328

 Score =  154 bits (388), Expect = 1e-43
 Identities = 77/110 (70%), Positives = 83/110 (75%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMSESTP E            PGSVVWAR   Q+WWPAEI+EE   LS+  +DGHVLVQF
Sbjct: 138 KMSESTPLESSVSDSSSLAVTPGSVVWARTDSQVWWPAEIMEETSVLSNPGNDGHVLVQF 197

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGSC 332
           YGN  SAWIDP TDIS FEDSFE+RSNNPS DFQ+ALKQALQRKAQL SC
Sbjct: 198 YGNLPSAWIDPMTDISTFEDSFEDRSNNPSEDFQKALKQALQRKAQLSSC 247


>ref|XP_003521421.1| PREDICTED: uncharacterized protein LOC100801494 [Glycine max]
 gb|KRH67778.1| hypothetical protein GLYMA_03G186700 [Glycine max]
          Length = 328

 Score =  154 bits (388), Expect = 1e-43
 Identities = 77/110 (70%), Positives = 83/110 (75%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMSESTP E            PGSVVWAR   Q+WWPAEI+EE   LS+  +DGHVLVQF
Sbjct: 138 KMSESTPLESSVSDSSSLAVTPGSVVWARTDSQVWWPAEIMEETSVLSNPGNDGHVLVQF 197

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGSC 332
           YGN  SAWIDP TDIS FEDSFE+RSNNPS DFQ+ALKQALQRKAQL SC
Sbjct: 198 YGNLPSAWIDPMTDISTFEDSFEDRSNNPSEDFQKALKQALQRKAQLSSC 247


>ref|XP_015970372.1| uncharacterized protein LOC107493853 isoform X1 [Arachis
           duranensis]
          Length = 331

 Score =  152 bits (383), Expect = 7e-43
 Identities = 72/109 (66%), Positives = 80/109 (73%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMS+S PHE            PG+V+WAR  CQ WWPAEI+EER ALS  VSDG VLVQF
Sbjct: 138 KMSKSIPHETSVSDSSSLALTPGTVIWARTTCQTWWPAEIMEERSALSKPVSDGQVLVQF 197

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGS 329
           YGNH S WIDP TDIS FED F+ER +NPS DFQEALKQA+Q+K QL S
Sbjct: 198 YGNHSSVWIDPMTDISTFEDCFQERCSNPSKDFQEALKQAIQKKEQLSS 246


>ref|XP_006604592.1| PREDICTED: uncharacterized protein LOC100776360 isoform X4 [Glycine
           max]
 gb|KRG96060.1| hypothetical protein GLYMA_19G186800 [Glycine max]
          Length = 252

 Score =  146 bits (369), Expect = 1e-41
 Identities = 74/110 (67%), Positives = 82/110 (74%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMSEST  E            PG+VVWAR   Q+WWPAEI+EE  ALS+  S GHVLVQF
Sbjct: 62  KMSESTRLESYVCDSSSLAATPGNVVWARTDGQVWWPAEILEETSALSNPGSGGHVLVQF 121

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGSC 332
           YGN  SAWIDP TDIS FEDSFE++SNNPS DFQ+ALK+ALQRKAQL SC
Sbjct: 122 YGNLPSAWIDPMTDISTFEDSFEDKSNNPSEDFQQALKKALQRKAQLSSC 171


>ref|XP_020211227.1| PWWP domain-containing protein 2A-like [Cajanus cajan]
 gb|KYP71003.1| hypothetical protein KK1_010246 [Cajanus cajan]
          Length = 326

 Score =  147 bits (372), Expect = 3e-41
 Identities = 74/110 (67%), Positives = 80/110 (72%)
 Frame = +3

Query: 3   KMSESTPHEXXXXXXXXXXXXPGSVVWARKACQMWWPAEIIEERCALSDRVSDGHVLVQF 182
           KMSE TP E             GSVVWAR ACQ+WWPAEI+EE   +S+  S GHVLV F
Sbjct: 136 KMSEFTPLESSASDNSSLAVTLGSVVWARTACQVWWPAEIMEETSEISNPGSGGHVLVHF 195

Query: 183 YGNHLSAWIDPATDISIFEDSFEERSNNPSNDFQEALKQALQRKAQLGSC 332
           YGN  SAWIDP TDIS FEDSF+ERSNNPS DFQ+ALKQALQRK QL SC
Sbjct: 196 YGNLPSAWIDPMTDISTFEDSFKERSNNPSEDFQQALKQALQRKTQLSSC 245


Top