BLASTX nr result

ID: Astragalus22_contig00020937 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00020937
         (415 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004490525.1| PREDICTED: uncharacterized protein LOC101505...   131   2e-36
dbj|GAU25346.1| hypothetical protein TSUD_216840 [Trifolium subt...   124   1e-33
gb|PNX54572.1| hypothetical protein L195_g048192 [Trifolium prat...   122   6e-33
ref|XP_003615493.1| Kazal-type serine protease inhibitor [Medica...   118   1e-31
ref|XP_003544858.1| PREDICTED: uncharacterized protein LOC100777...   107   4e-27
ref|XP_019431555.1| PREDICTED: uncharacterized protein LOC109338...   105   1e-26
ref|XP_006578865.1| PREDICTED: uncharacterized protein LOC100797...   105   1e-26
ref|XP_007143607.1| hypothetical protein PHAVU_007G085900g [Phas...   105   2e-26
ref|XP_016167513.1| uncharacterized protein LOC107609963 [Arachi...   104   4e-26
ref|XP_015932144.1| uncharacterized protein LOC107458454 [Arachi...   103   8e-26
ref|XP_007136371.1| hypothetical protein PHAVU_009G039700g [Phas...   103   9e-26
ref|XP_020209493.1| uncharacterized protein LOC109794451 [Cajanu...   101   4e-25
ref|XP_003519227.1| PREDICTED: uncharacterized protein LOC100780...   100   3e-24
ref|XP_019416640.1| PREDICTED: uncharacterized protein LOC109327...   100   4e-24
ref|XP_019460776.1| PREDICTED: uncharacterized protein LOC109360...   100   4e-24
ref|XP_017421372.1| PREDICTED: uncharacterized protein LOC108331...    98   2e-23
ref|XP_010248153.1| PREDICTED: uncharacterized protein LOC104591...    97   3e-23
ref|XP_008439348.1| PREDICTED: uncharacterized protein LOC103484...    96   1e-22
ref|XP_022922853.1| uncharacterized protein LOC111430710 [Cucurb...    96   1e-22
ref|XP_004140750.1| PREDICTED: uncharacterized protein LOC101210...    95   2e-22

>ref|XP_004490525.1| PREDICTED: uncharacterized protein LOC101505835 [Cicer arietinum]
          Length = 139

 Score =  131 bits (329), Expect = 2e-36
 Identities = 64/101 (63%), Positives = 68/101 (67%)
 Frame = +1

Query: 4   ATAEREESSILLLPSATVGERHSLCDGSTPSSCPAKCFRTDPVCGADGVTYWXXXXXXXX 183
           ATAE EE S+L LPSAT G+  +LC G+T SSCPAKCFRTDPVCGADGVTYW        
Sbjct: 34  ATAEHEEPSVLRLPSATAGDEQTLCSGTTASSCPAKCFRTDPVCGADGVTYWCGCAEAAC 93

Query: 184 XXXXVAKLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
               VAKLGFCEVGNGGSA FPGQA            GFSV
Sbjct: 94  AGAKVAKLGFCEVGNGGSATFPGQALLLVHIVWLIVLGFSV 134


>dbj|GAU25346.1| hypothetical protein TSUD_216840 [Trifolium subterraneum]
          Length = 137

 Score =  124 bits (310), Expect = 1e-33
 Identities = 63/100 (63%), Positives = 66/100 (66%)
 Frame = +1

Query: 7   TAEREESSILLLPSATVGERHSLCDGSTPSSCPAKCFRTDPVCGADGVTYWXXXXXXXXX 186
           TAE EE S++ LPSA  GE  SLC  +TPSSCPAKCFRTDPVCGADGVTYW         
Sbjct: 35  TAEHEELSVIRLPSA--GEEQSLCSRTTPSSCPAKCFRTDPVCGADGVTYWCGCAEAACA 92

Query: 187 XXXVAKLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
              VAKLGFCEVGNGGSA FPGQA            GFSV
Sbjct: 93  GAKVAKLGFCEVGNGGSATFPGQALLLVHIVWLIVLGFSV 132


>gb|PNX54572.1| hypothetical protein L195_g048192 [Trifolium pratense]
          Length = 137

 Score =  122 bits (305), Expect = 6e-33
 Identities = 62/100 (62%), Positives = 66/100 (66%)
 Frame = +1

Query: 7   TAEREESSILLLPSATVGERHSLCDGSTPSSCPAKCFRTDPVCGADGVTYWXXXXXXXXX 186
           TAE E+ S++ LPSA  GE  SLC  +TPSSCPAKCFRTDPVCGADGVTYW         
Sbjct: 35  TAEHEQFSVIRLPSA--GEDQSLCSRTTPSSCPAKCFRTDPVCGADGVTYWCGCAEAACA 92

Query: 187 XXXVAKLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
              VAKLGFCEVGNGGSA FPGQA            GFSV
Sbjct: 93  GAKVAKLGFCEVGNGGSATFPGQALLLVHIVWLIVLGFSV 132


>ref|XP_003615493.1| Kazal-type serine protease inhibitor [Medicago truncatula]
 gb|AES98451.1| Kazal-type serine protease inhibitor [Medicago truncatula]
 gb|AFK33704.1| unknown [Medicago truncatula]
 gb|AFK37056.1| unknown [Medicago truncatula]
          Length = 120

 Score =  118 bits (295), Expect = 1e-31
 Identities = 61/102 (59%), Positives = 64/102 (62%)
 Frame = +1

Query: 1   MATAEREESSILLLPSATVGERHSLCDGSTPSSCPAKCFRTDPVCGADGVTYWXXXXXXX 180
           + TAE EESS+L LPS  V      C  +TPSSCPAKCFRTDPVCGADGVTYW       
Sbjct: 20  LTTAENEESSVLRLPSQNV------CSVTTPSSCPAKCFRTDPVCGADGVTYWCGCAEAA 73

Query: 181 XXXXXVAKLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
                VAKLGFCEVGNGGSA FPGQA            GFSV
Sbjct: 74  CAGAKVAKLGFCEVGNGGSATFPGQALLLVHIVWLIVLGFSV 115


>ref|XP_003544858.1| PREDICTED: uncharacterized protein LOC100777832 [Glycine max]
 gb|KRH16966.1| hypothetical protein GLYMA_14G189000 [Glycine max]
          Length = 124

 Score =  107 bits (266), Expect = 4e-27
 Identities = 56/101 (55%), Positives = 60/101 (59%)
 Frame = +1

Query: 4   ATAEREESSILLLPSATVGERHSLCDGSTPSSCPAKCFRTDPVCGADGVTYWXXXXXXXX 183
           A A+ E+  +L LPS       SLC  +TPSSCPAKCFR DPVCGADGVTYW        
Sbjct: 25  AAADLEDPGVLRLPS------DSLCGKTTPSSCPAKCFRADPVCGADGVTYWCGCAEAAC 78

Query: 184 XXXXVAKLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
               VAKLGFCEVGNGGSA  PGQA            GFSV
Sbjct: 79  AGVEVAKLGFCEVGNGGSAPIPGQALLLVHIVWLIVLGFSV 119


>ref|XP_019431555.1| PREDICTED: uncharacterized protein LOC109338718 [Lupinus
           angustifolius]
 gb|OIW16533.1| hypothetical protein TanjilG_32204 [Lupinus angustifolius]
          Length = 131

 Score =  105 bits (263), Expect = 1e-26
 Identities = 54/102 (52%), Positives = 61/102 (59%), Gaps = 2/102 (1%)
 Frame = +1

Query: 7   TAEREESSILLLPSATVGER--HSLCDGSTPSSCPAKCFRTDPVCGADGVTYWXXXXXXX 180
           TAE +E SIL LPS        H LC G++P SCPAKCFRTDPVCG + VTYW       
Sbjct: 25  TAEHDEPSILRLPSQVPSGDGLHDLCAGTSPLSCPAKCFRTDPVCGVNSVTYWCGCAEAA 84

Query: 181 XXXXXVAKLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
                V+KLGFCEVGNGGS++ PGQA            GFSV
Sbjct: 85  CAGVEVSKLGFCEVGNGGSSSLPGQALLLVHIVWLIVLGFSV 126


>ref|XP_006578865.1| PREDICTED: uncharacterized protein LOC100797240 [Glycine max]
 gb|KRH64271.1| hypothetical protein GLYMA_04G226300 [Glycine max]
          Length = 131

 Score =  105 bits (263), Expect = 1e-26
 Identities = 51/101 (50%), Positives = 58/101 (57%)
 Frame = +1

Query: 4   ATAEREESSILLLPSATVGERHSLCDGSTPSSCPAKCFRTDPVCGADGVTYWXXXXXXXX 183
           AT++   S++L LPS   GE  +LC  + PSSCP KCFRTDPVC  DGVTYW        
Sbjct: 26  ATSDNVASAVLGLPSHVAGEGKNLCSAAAPSSCPVKCFRTDPVCSVDGVTYWCGCSEAAY 85

Query: 184 XXXXVAKLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
               +AKLGFCEVGNGGS    GQA            GFSV
Sbjct: 86  ASAQIAKLGFCEVGNGGSVTLSGQALLLVHIVWLIVLGFSV 126


>ref|XP_007143607.1| hypothetical protein PHAVU_007G085900g [Phaseolus vulgaris]
 gb|ESW15601.1| hypothetical protein PHAVU_007G085900g [Phaseolus vulgaris]
          Length = 127

 Score =  105 bits (262), Expect = 2e-26
 Identities = 55/102 (53%), Positives = 60/102 (58%)
 Frame = +1

Query: 1   MATAEREESSILLLPSATVGERHSLCDGSTPSSCPAKCFRTDPVCGADGVTYWXXXXXXX 180
           ++TA+ EE  +L LPS        LC  + PSSCPAKCFR DPVCGADGVTYW       
Sbjct: 27  VSTADLEEPGVLQLPS------QRLCGQTMPSSCPAKCFRADPVCGADGVTYWCGCAEAA 80

Query: 181 XXXXXVAKLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
                VAKLGFCEVGNGGSA  PGQA            GFSV
Sbjct: 81  CSGVEVAKLGFCEVGNGGSAPIPGQALLLVHIVWLILLGFSV 122


>ref|XP_016167513.1| uncharacterized protein LOC107609963 [Arachis ipaensis]
          Length = 125

 Score =  104 bits (259), Expect = 4e-26
 Identities = 55/102 (53%), Positives = 64/102 (62%), Gaps = 1/102 (0%)
 Frame = +1

Query: 4   ATAEREESSILLLPSATVGERHSLCDGS-TPSSCPAKCFRTDPVCGADGVTYWXXXXXXX 180
           A +E + SS++ LPS + G    LC G+ +PSSCPAKCFRTDPVCGADGVTYW       
Sbjct: 23  AESEGKSSSVIRLPSQSAG----LCAGTPSPSSCPAKCFRTDPVCGADGVTYWCGCAEAA 78

Query: 181 XXXXXVAKLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
                VAKLGFCEVG+GGSA  PGQA            GFS+
Sbjct: 79  CAGAEVAKLGFCEVGSGGSAALPGQALLLLHIVWLIVLGFSM 120


>ref|XP_015932144.1| uncharacterized protein LOC107458454 [Arachis duranensis]
          Length = 125

 Score =  103 bits (257), Expect = 8e-26
 Identities = 54/102 (52%), Positives = 64/102 (62%), Gaps = 1/102 (0%)
 Frame = +1

Query: 4   ATAEREESSILLLPSATVGERHSLCDGS-TPSSCPAKCFRTDPVCGADGVTYWXXXXXXX 180
           A +E + SS++ LPS + G    +C G+ +PSSCPAKCFRTDPVCGADGVTYW       
Sbjct: 23  AESEGKSSSVIRLPSQSAG----ICAGTPSPSSCPAKCFRTDPVCGADGVTYWCGCAEAA 78

Query: 181 XXXXXVAKLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
                VAKLGFCEVG+GGSA  PGQA            GFS+
Sbjct: 79  CAGAEVAKLGFCEVGSGGSAALPGQALLLLHIVWLIVLGFSM 120


>ref|XP_007136371.1| hypothetical protein PHAVU_009G039700g [Phaseolus vulgaris]
 gb|ESW08365.1| hypothetical protein PHAVU_009G039700g [Phaseolus vulgaris]
          Length = 129

 Score =  103 bits (257), Expect = 9e-26
 Identities = 53/102 (51%), Positives = 58/102 (56%), Gaps = 1/102 (0%)
 Frame = +1

Query: 4   ATAEREESSILLLPS-ATVGERHSLCDGSTPSSCPAKCFRTDPVCGADGVTYWXXXXXXX 180
           ATA   +S++L LPS   V E   +C G  PSSCP KCFRTDPVCG DGVTYW       
Sbjct: 23  ATAHDVDSTVLRLPSQVAVDEGQKVCSGVAPSSCPVKCFRTDPVCGVDGVTYWCGCSEAA 82

Query: 181 XXXXXVAKLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
                VAK+GFCEVGNGGS    GQA            GFSV
Sbjct: 83  CAGAQVAKMGFCEVGNGGSVPLSGQALLLVHIVWLIVLGFSV 124


>ref|XP_020209493.1| uncharacterized protein LOC109794451 [Cajanus cajan]
 gb|KYP72772.1| hypothetical protein KK1_005372 [Cajanus cajan]
          Length = 120

 Score =  101 bits (252), Expect = 4e-25
 Identities = 53/102 (51%), Positives = 58/102 (56%)
 Frame = +1

Query: 1   MATAEREESSILLLPSATVGERHSLCDGSTPSSCPAKCFRTDPVCGADGVTYWXXXXXXX 180
           +  A+ E+  +L LPS        LC  +T SSCPAKCFR DPVCGADGVTYW       
Sbjct: 20  VVAADLEDRGVLRLPS------EGLCGKTTASSCPAKCFRADPVCGADGVTYWCGCAEAA 73

Query: 181 XXXXXVAKLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
                VAKLGFCEVGNGGSA  PGQA            GFSV
Sbjct: 74  CAGVEVAKLGFCEVGNGGSAPIPGQALLLVHIVWLIVLGFSV 115


>ref|XP_003519227.1| PREDICTED: uncharacterized protein LOC100780930 [Glycine max]
 gb|KRH72590.1| hypothetical protein GLYMA_02G221700 [Glycine max]
          Length = 125

 Score = 99.8 bits (247), Expect = 3e-24
 Identities = 52/101 (51%), Positives = 56/101 (55%)
 Frame = +1

Query: 4   ATAEREESSILLLPSATVGERHSLCDGSTPSSCPAKCFRTDPVCGADGVTYWXXXXXXXX 183
           A  + E+  +L LPS       SLC  + P SCPAKCFR DPVCGADGVTYW        
Sbjct: 26  AAVDLEDPGVLQLPS------ESLCGKTMPLSCPAKCFRADPVCGADGVTYWCGCAEAAC 79

Query: 184 XXXXVAKLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
               VAK GFCEVGNGGSA  PGQA            GFSV
Sbjct: 80  AGVEVAKFGFCEVGNGGSAPIPGQALLLVHIVWLIVLGFSV 120


>ref|XP_019416640.1| PREDICTED: uncharacterized protein LOC109327918 [Lupinus
           angustifolius]
 gb|OIV96902.1| hypothetical protein TanjilG_00484 [Lupinus angustifolius]
          Length = 134

 Score = 99.8 bits (247), Expect = 4e-24
 Identities = 49/93 (52%), Positives = 57/93 (61%), Gaps = 1/93 (1%)
 Frame = +1

Query: 31  ILLLPSATVGERHSLCDGST-PSSCPAKCFRTDPVCGADGVTYWXXXXXXXXXXXXVAKL 207
           + +LPS + GER +LC G+  P+SCP KCFR DPVCGA+GVTYW            VAK+
Sbjct: 37  VTVLPSQSAGERQNLCVGAVWPTSCPVKCFRADPVCGANGVTYWCGCVEAACEGAKVAKV 96

Query: 208 GFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
           GFCEVGNGGSA F GQA            G SV
Sbjct: 97  GFCEVGNGGSAPFSGQALLLVHILWHIVLGLSV 129


>ref|XP_019460776.1| PREDICTED: uncharacterized protein LOC109360379 [Lupinus
           angustifolius]
 gb|OIW02513.1| hypothetical protein TanjilG_12827 [Lupinus angustifolius]
          Length = 136

 Score = 99.8 bits (247), Expect = 4e-24
 Identities = 52/99 (52%), Positives = 57/99 (57%), Gaps = 5/99 (5%)
 Frame = +1

Query: 25  SSILLLPSATVGE-----RHSLCDGSTPSSCPAKCFRTDPVCGADGVTYWXXXXXXXXXX 189
           SS+L LPS T         + LC G+  SSCPAKCFRTDPVCG +GVTYW          
Sbjct: 33  SSVLRLPSQTAASVAGEGPYRLCAGTKTSSCPAKCFRTDPVCGVNGVTYWCGCAEAACDG 92

Query: 190 XXVAKLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
             VAKL FCEVGNGGSA+ PGQA            GFSV
Sbjct: 93  VEVAKLSFCEVGNGGSASLPGQALLLVHIVWLIVLGFSV 131


>ref|XP_017421372.1| PREDICTED: uncharacterized protein LOC108331236 [Vigna angularis]
 gb|KOM40968.1| hypothetical protein LR48_Vigan04g116600 [Vigna angularis]
 dbj|BAT78950.1| hypothetical protein VIGAN_02171500 [Vigna angularis var.
           angularis]
          Length = 132

 Score = 97.8 bits (242), Expect = 2e-23
 Identities = 51/102 (50%), Positives = 58/102 (56%), Gaps = 2/102 (1%)
 Frame = +1

Query: 7   TAEREESSILLLPSATVGER-HSLCDGSTPSSCPAKCFRTDPVCGADGVTYWXXXXXXXX 183
           TA    S++L +PS   G++   +C G TPSSCP KCFRTDPVCG DGVTYW        
Sbjct: 26  TANDVVSTVLRMPSQVAGDQGQKICGGITPSSCPVKCFRTDPVCGVDGVTYWCGCSEAAC 85

Query: 184 XXXXVAKLGFCEVGNGGS-ANFPGQAXXXXXXXXXXXXGFSV 306
               VAK+GFCEVGNGGS     GQA            GFSV
Sbjct: 86  AGAQVAKMGFCEVGNGGSVVPLSGQALLLVHIVWLIVLGFSV 127


>ref|XP_010248153.1| PREDICTED: uncharacterized protein LOC104591052 [Nelumbo nucifera]
          Length = 133

 Score = 97.4 bits (241), Expect = 3e-23
 Identities = 49/95 (51%), Positives = 52/95 (54%)
 Frame = +1

Query: 22  ESSILLLPSATVGERHSLCDGSTPSSCPAKCFRTDPVCGADGVTYWXXXXXXXXXXXXVA 201
           +SS + LPS        LC GS+P SCP  CFRTDPVCG DGVTYW            VA
Sbjct: 34  DSSAIRLPSDDAVHADDLCAGSSPPSCPVNCFRTDPVCGEDGVTYWCGCADAMCAGTRVA 93

Query: 202 KLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
           KLGFCEVGNGGS    GQA            GFSV
Sbjct: 94  KLGFCEVGNGGSGPVSGQALLLVHIVWLIVLGFSV 128


>ref|XP_008439348.1| PREDICTED: uncharacterized protein LOC103484162, partial [Cucumis
           melo]
          Length = 142

 Score = 96.3 bits (238), Expect = 1e-22
 Identities = 50/96 (52%), Positives = 53/96 (55%), Gaps = 2/96 (2%)
 Frame = +1

Query: 25  SSILLLPSATVGERHS--LCDGSTPSSCPAKCFRTDPVCGADGVTYWXXXXXXXXXXXXV 198
           SS + LPS          LC  S PSSCP KCFRTDPVCG DGVTYW            V
Sbjct: 42  SSAIRLPSEATNNDGDVDLCPVSVPSSCPVKCFRTDPVCGVDGVTYWCGCADALCSGVKV 101

Query: 199 AKLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
           AK+GFCEVGNGGSA+ PGQA            G SV
Sbjct: 102 AKMGFCEVGNGGSASIPGQALLLVHILWLIILGVSV 137


>ref|XP_022922853.1| uncharacterized protein LOC111430710 [Cucurbita moschata]
 ref|XP_022985062.1| uncharacterized protein LOC111483145 [Cucurbita maxima]
 ref|XP_023552497.1| uncharacterized protein LOC111810142 [Cucurbita pepo subsp. pepo]
          Length = 130

 Score = 95.9 bits (237), Expect = 1e-22
 Identities = 52/96 (54%), Positives = 56/96 (58%), Gaps = 2/96 (2%)
 Frame = +1

Query: 25  SSILLLPS-ATVGERH-SLCDGSTPSSCPAKCFRTDPVCGADGVTYWXXXXXXXXXXXXV 198
           SS + LPS AT  +R   LC  S PSSCP KCFRTDPVCG DG+TYW            V
Sbjct: 30  SSAIRLPSEATNNDRDLDLCPVSLPSSCPVKCFRTDPVCGVDGLTYWCGCADALCSGVKV 89

Query: 199 AKLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
           AK+GFCEVGNGGSA  PGQA            G SV
Sbjct: 90  AKMGFCEVGNGGSAPIPGQALLLVHILWLIILGVSV 125


>ref|XP_004140750.1| PREDICTED: uncharacterized protein LOC101210178 [Cucumis sativus]
 gb|KGN57420.1| hypothetical protein Csa_3G184020 [Cucumis sativus]
          Length = 130

 Score = 95.1 bits (235), Expect = 2e-22
 Identities = 50/96 (52%), Positives = 52/96 (54%), Gaps = 2/96 (2%)
 Frame = +1

Query: 25  SSILLLPSATVGERHS--LCDGSTPSSCPAKCFRTDPVCGADGVTYWXXXXXXXXXXXXV 198
           SS + LPS          LC  S PSSCP KCFRTDPVCG DGVTYW            V
Sbjct: 30  SSAIRLPSEATNNDGDVDLCPVSVPSSCPVKCFRTDPVCGVDGVTYWCGCADALCSGVKV 89

Query: 199 AKLGFCEVGNGGSANFPGQAXXXXXXXXXXXXGFSV 306
           AK+GFCEVGNGGSA  PGQA            G SV
Sbjct: 90  AKMGFCEVGNGGSAPIPGQALLLVHILWLIILGVSV 125


Top