BLASTX nr result

ID: Astragalus23_contig00026637 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00026637
         (387 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003588477.1| appr-1-P processing enzyme family protein [M...   114   6e-29
ref|XP_004498498.1| PREDICTED: O-acetyl-ADP-ribose deacetylase M...   113   2e-28
gb|PNX91865.1| Appr-1-p processing enzyme family protein [Trifol...   106   2e-26
gb|PNY11496.1| Appr-1-p processing enzyme family protein [Trifol...   106   3e-25
ref|XP_022637993.1| uncharacterized protein LOC106762897 isoform...   103   4e-25
ref|XP_014502501.1| uncharacterized protein LOC106762897 isoform...   103   9e-25
ref|XP_017431097.1| PREDICTED: macro domain-containing protein X...    98   1e-22
dbj|GAU21143.1| hypothetical protein TSUD_10590 [Trifolium subte...    97   3e-22
ref|XP_015971807.1| uncharacterized protein LOC107495209 [Arachi...    97   6e-22
ref|XP_020960731.1| uncharacterized protein LOC107605213 isoform...    94   3e-21
ref|XP_016162485.1| uncharacterized protein LOC107605213 isoform...    94   6e-21
gb|KYP72090.1| UPF0189 protein XCC3184 family [Cajanus cajan]          92   1e-20
ref|XP_020223433.1| uncharacterized protein LOC109805670 [Cajanu...    92   2e-20
dbj|GAU21142.1| hypothetical protein TSUD_10600 [Trifolium subte...    91   4e-20
ref|XP_007161504.1| hypothetical protein PHAVU_001G074800g [Phas...    90   8e-20
ref|XP_003544674.1| PREDICTED: macro domain-containing protein X...    88   1e-18
gb|OIW20025.1| hypothetical protein TanjilG_31943 [Lupinus angus...    84   7e-18
ref|XP_019430248.1| PREDICTED: uncharacterized protein LOC109337...    86   9e-18
ref|XP_023872487.1| uncharacterized protein LOC111985090 [Quercu...    85   3e-17
ref|XP_020988809.1| uncharacterized protein LOC107470226 isoform...    82   4e-17

>ref|XP_003588477.1| appr-1-P processing enzyme family protein [Medicago truncatula]
 gb|AES58728.1| appr-1-P processing enzyme family protein [Medicago truncatula]
 gb|AFK37701.1| unknown [Medicago truncatula]
          Length = 233

 Score =  114 bits (285), Expect = 6e-29
 Identities = 63/90 (70%), Positives = 70/90 (77%)
 Frame = -1

Query: 270 MTTLFRGSRXXXXXXXXTSHSNSHTANLRNFRVFGRMNASARVSSNGNDGVVRFPLSASS 91
           MTT+F  SR        TSHSN+   NLR+FR+   MN SA  SSNGN GVVRFPLS+S+
Sbjct: 1   MTTIFTTSRFLLKTLKLTSHSNA--VNLRSFRL-SAMNTSAMASSNGNGGVVRFPLSSSN 57

Query: 90  ALVIQKGDITKWSIDGSTDAIVNPANERML 1
           AL+IQKGDITKWSIDGSTDAIVNPANERML
Sbjct: 58  ALIIQKGDITKWSIDGSTDAIVNPANERML 87


>ref|XP_004498498.1| PREDICTED: O-acetyl-ADP-ribose deacetylase MACROD2 [Cicer
           arietinum]
          Length = 230

 Score =  113 bits (282), Expect = 2e-28
 Identities = 63/90 (70%), Positives = 68/90 (75%)
 Frame = -1

Query: 270 MTTLFRGSRXXXXXXXXTSHSNSHTANLRNFRVFGRMNASARVSSNGNDGVVRFPLSASS 91
           MTT+  GS         +SHS      LR+FR FG MN SARVSSNGN GVV FPLS+SS
Sbjct: 1   MTTISSGSHFLLKTLKFSSHSK-----LRSFRSFG-MNTSARVSSNGNGGVVHFPLSSSS 54

Query: 90  ALVIQKGDITKWSIDGSTDAIVNPANERML 1
           AL+IQKGDITKWSIDGSTDAIVNPANERML
Sbjct: 55  ALIIQKGDITKWSIDGSTDAIVNPANERML 84


>gb|PNX91865.1| Appr-1-p processing enzyme family protein [Trifolium pratense]
          Length = 195

 Score =  106 bits (265), Expect = 2e-26
 Identities = 62/91 (68%), Positives = 69/91 (75%), Gaps = 1/91 (1%)
 Frame = -1

Query: 270 MTTLFRGSRXXXXXXXXTSHSNSHTANLR-NFRVFGRMNASARVSSNGNDGVVRFPLSAS 94
           M T+F  SR        TS SN+   NLR +FR  G MN SARVSSNG+ G VRFPLS+S
Sbjct: 2   MATIFSSSRFLLNTLKFTSRSNA--VNLRRSFRPSG-MNTSARVSSNGSGGAVRFPLSSS 58

Query: 93  SALVIQKGDITKWSIDGSTDAIVNPANERML 1
           +AL+IQKGDITKWSIDGSTDAIVNPANERML
Sbjct: 59  NALIIQKGDITKWSIDGSTDAIVNPANERML 89


>gb|PNY11496.1| Appr-1-p processing enzyme family protein [Trifolium pratense]
          Length = 318

 Score =  106 bits (265), Expect = 3e-25
 Identities = 62/91 (68%), Positives = 69/91 (75%), Gaps = 1/91 (1%)
 Frame = -1

Query: 270 MTTLFRGSRXXXXXXXXTSHSNSHTANLR-NFRVFGRMNASARVSSNGNDGVVRFPLSAS 94
           M T+F  SR        TS SN+   NLR +FR  G MN SARVSSNG+ G VRFPLS+S
Sbjct: 2   MATIFSSSRFLLNTLKFTSRSNA--VNLRRSFRPSG-MNTSARVSSNGSGGAVRFPLSSS 58

Query: 93  SALVIQKGDITKWSIDGSTDAIVNPANERML 1
           +AL+IQKGDITKWSIDGSTDAIVNPANERML
Sbjct: 59  NALIIQKGDITKWSIDGSTDAIVNPANERML 89


>ref|XP_022637993.1| uncharacterized protein LOC106762897 isoform X2 [Vigna radiata var.
           radiata]
          Length = 199

 Score =  103 bits (257), Expect = 4e-25
 Identities = 59/90 (65%), Positives = 65/90 (72%)
 Frame = -1

Query: 270 MTTLFRGSRXXXXXXXXTSHSNSHTANLRNFRVFGRMNASARVSSNGNDGVVRFPLSASS 91
           M ++F GSR        +   +    NLR FRV+    A+ARVS NG  GVVRFPLSASS
Sbjct: 1   MCSIFSGSRFILKFGVNSKSKH----NLRRFRVYAMAAAAARVS-NGGGGVVRFPLSASS 55

Query: 90  ALVIQKGDITKWSIDGSTDAIVNPANERML 1
           ALVIQKGDITKWSIDGSTDAIVNPANERML
Sbjct: 56  ALVIQKGDITKWSIDGSTDAIVNPANERML 85


>ref|XP_014502501.1| uncharacterized protein LOC106762897 isoform X1 [Vigna radiata var.
           radiata]
          Length = 231

 Score =  103 bits (257), Expect = 9e-25
 Identities = 59/90 (65%), Positives = 65/90 (72%)
 Frame = -1

Query: 270 MTTLFRGSRXXXXXXXXTSHSNSHTANLRNFRVFGRMNASARVSSNGNDGVVRFPLSASS 91
           M ++F GSR        +   +    NLR FRV+    A+ARVS NG  GVVRFPLSASS
Sbjct: 1   MCSIFSGSRFILKFGVNSKSKH----NLRRFRVYAMAAAAARVS-NGGGGVVRFPLSASS 55

Query: 90  ALVIQKGDITKWSIDGSTDAIVNPANERML 1
           ALVIQKGDITKWSIDGSTDAIVNPANERML
Sbjct: 56  ALVIQKGDITKWSIDGSTDAIVNPANERML 85


>ref|XP_017431097.1| PREDICTED: macro domain-containing protein XCC3184 [Vigna
           angularis]
 gb|KOM48347.1| hypothetical protein LR48_Vigan07g205100 [Vigna angularis]
 dbj|BAT81934.1| hypothetical protein VIGAN_03185200 [Vigna angularis var.
           angularis]
          Length = 236

 Score = 98.2 bits (243), Expect = 1e-22
 Identities = 57/94 (60%), Positives = 64/94 (68%), Gaps = 4/94 (4%)
 Frame = -1

Query: 270 MTTLFRGSRXXXXXXXXTSHSNSHTANLRNFRVFGRMNASARVS----SNGNDGVVRFPL 103
           M ++F GSR        +   +    NLR FRV+    A+A  +    SNG  GVVRFPL
Sbjct: 1   MCSIFSGSRFILKFGVNSKSKH----NLRRFRVYAMDAAAAAAAAARVSNGGGGVVRFPL 56

Query: 102 SASSALVIQKGDITKWSIDGSTDAIVNPANERML 1
           SASSALVIQKGDITKWSIDGSTDAIVNPANERML
Sbjct: 57  SASSALVIQKGDITKWSIDGSTDAIVNPANERML 90


>dbj|GAU21143.1| hypothetical protein TSUD_10590 [Trifolium subterraneum]
          Length = 207

 Score = 96.7 bits (239), Expect = 3e-22
 Identities = 46/54 (85%), Positives = 51/54 (94%)
 Frame = -1

Query: 162 MNASARVSSNGNDGVVRFPLSASSALVIQKGDITKWSIDGSTDAIVNPANERML 1
           MN SARVSSNG+ GV RFP+S+S+AL+IQKGDITKWSIDGSTDAIVNPANERML
Sbjct: 1   MNTSARVSSNGDGGVARFPISSSNALIIQKGDITKWSIDGSTDAIVNPANERML 54


>ref|XP_015971807.1| uncharacterized protein LOC107495209 [Arachis duranensis]
          Length = 244

 Score = 96.7 bits (239), Expect = 6e-22
 Identities = 56/75 (74%), Positives = 61/75 (81%), Gaps = 5/75 (6%)
 Frame = -1

Query: 210 SNSHTANLRNFRVFGRMNASARVSSN---GNDGVVRFPLS--ASSALVIQKGDITKWSID 46
           SNSH A++RN R F   ++S RVSS+   G DG VRFPLS  ASSALVIQKGDITKWSID
Sbjct: 25  SNSH-ASVRNCRAFAMDSSSVRVSSSNGGGGDGEVRFPLSSAASSALVIQKGDITKWSID 83

Query: 45  GSTDAIVNPANERML 1
           GSTDAIVNPANERML
Sbjct: 84  GSTDAIVNPANERML 98


>ref|XP_020960731.1| uncharacterized protein LOC107605213 isoform X2 [Arachis ipaensis]
          Length = 211

 Score = 94.0 bits (232), Expect = 3e-21
 Identities = 54/75 (72%), Positives = 60/75 (80%), Gaps = 5/75 (6%)
 Frame = -1

Query: 210 SNSHTANLRNFRVFGRMNASARVSSN---GNDGVVRFPLS--ASSALVIQKGDITKWSID 46
           SNSH A++RN R F   ++S RVSS+   G DG VRFPLS  AS AL+IQKGDITKWSID
Sbjct: 25  SNSH-ASVRNCRPFAMDSSSVRVSSSNGGGGDGEVRFPLSSAASGALIIQKGDITKWSID 83

Query: 45  GSTDAIVNPANERML 1
           GSTDAIVNPANERML
Sbjct: 84  GSTDAIVNPANERML 98


>ref|XP_016162485.1| uncharacterized protein LOC107605213 isoform X1 [Arachis ipaensis]
          Length = 244

 Score = 94.0 bits (232), Expect = 6e-21
 Identities = 54/75 (72%), Positives = 60/75 (80%), Gaps = 5/75 (6%)
 Frame = -1

Query: 210 SNSHTANLRNFRVFGRMNASARVSSN---GNDGVVRFPLS--ASSALVIQKGDITKWSID 46
           SNSH A++RN R F   ++S RVSS+   G DG VRFPLS  AS AL+IQKGDITKWSID
Sbjct: 25  SNSH-ASVRNCRPFAMDSSSVRVSSSNGGGGDGEVRFPLSSAASGALIIQKGDITKWSID 83

Query: 45  GSTDAIVNPANERML 1
           GSTDAIVNPANERML
Sbjct: 84  GSTDAIVNPANERML 98


>gb|KYP72090.1| UPF0189 protein XCC3184 family [Cajanus cajan]
          Length = 213

 Score = 92.4 bits (228), Expect = 1e-20
 Identities = 51/65 (78%), Positives = 55/65 (84%)
 Frame = -1

Query: 195 ANLRNFRVFGRMNASARVSSNGNDGVVRFPLSASSALVIQKGDITKWSIDGSTDAIVNPA 16
           ANLR FR    M+A+ RVS NG   VVRFPLS+SSALVIQKGDIT+WSIDGSTDAIVNPA
Sbjct: 20  ANLRIFRACA-MDAAGRVS-NGGGSVVRFPLSSSSALVIQKGDITRWSIDGSTDAIVNPA 77

Query: 15  NERML 1
           NERML
Sbjct: 78  NERML 82


>ref|XP_020223433.1| uncharacterized protein LOC109805670 [Cajanus cajan]
          Length = 228

 Score = 92.4 bits (228), Expect = 2e-20
 Identities = 51/65 (78%), Positives = 55/65 (84%)
 Frame = -1

Query: 195 ANLRNFRVFGRMNASARVSSNGNDGVVRFPLSASSALVIQKGDITKWSIDGSTDAIVNPA 16
           ANLR FR    M+A+ RVS NG   VVRFPLS+SSALVIQKGDIT+WSIDGSTDAIVNPA
Sbjct: 20  ANLRIFRACA-MDAAGRVS-NGGGSVVRFPLSSSSALVIQKGDITRWSIDGSTDAIVNPA 77

Query: 15  NERML 1
           NERML
Sbjct: 78  NERML 82


>dbj|GAU21142.1| hypothetical protein TSUD_10600 [Trifolium subterraneum]
          Length = 219

 Score = 91.3 bits (225), Expect = 4e-20
 Identities = 51/83 (61%), Positives = 59/83 (71%)
 Frame = -1

Query: 270 MTTLFRGSRXXXXXXXXTSHSNSHTANLRNFRVFGRMNASARVSSNGNDGVVRFPLSASS 91
           MTT+F  SR        TS SN+   NLR       MN SARVSSNG+ GV RFP+S+S+
Sbjct: 1   MTTIFSSSRFLLNTVKLTSRSNA--VNLRRSFRPSEMNTSARVSSNGDGGVARFPISSSN 58

Query: 90  ALVIQKGDITKWSIDGSTDAIVN 22
           AL+IQKGDITKWSIDGSTDAIV+
Sbjct: 59  ALIIQKGDITKWSIDGSTDAIVS 81


>ref|XP_007161504.1| hypothetical protein PHAVU_001G074800g [Phaseolus vulgaris]
 gb|ESW33498.1| hypothetical protein PHAVU_001G074800g [Phaseolus vulgaris]
          Length = 201

 Score = 90.1 bits (222), Expect = 8e-20
 Identities = 48/52 (92%), Positives = 48/52 (92%)
 Frame = -1

Query: 156 ASARVSSNGNDGVVRFPLSASSALVIQKGDITKWSIDGSTDAIVNPANERML 1
           A ARVS NG  GVVRFPLSASSALVIQKGDITKWSIDGSTDAIVNPANERML
Sbjct: 5   AVARVS-NGGGGVVRFPLSASSALVIQKGDITKWSIDGSTDAIVNPANERML 55


>ref|XP_003544674.1| PREDICTED: macro domain-containing protein XCC3184 [Glycine max]
 gb|KHN48484.1| Macro domain-containing protein [Glycine soja]
 gb|KRH16274.1| hypothetical protein GLYMA_14G145000 [Glycine max]
          Length = 236

 Score = 87.8 bits (216), Expect = 1e-18
 Identities = 56/93 (60%), Positives = 64/93 (68%), Gaps = 3/93 (3%)
 Frame = -1

Query: 270 MTTLFRGSRXXXXXXXXTSHSNSHTANLRNFRVFGR-MNASARVS--SNGNDGVVRFPLS 100
           M  +F GSR        T+   +   +LR FR+    M+A+A V   SNG  GVVRFPLS
Sbjct: 1   MRAIFSGSRFALKFGWNTTSKGN--LSLRRFRLCASAMDAAATVGRVSNGG-GVVRFPLS 57

Query: 99  ASSALVIQKGDITKWSIDGSTDAIVNPANERML 1
           ASSAL +QKGDITKWSIDGSTDAIVNPANERML
Sbjct: 58  ASSALFMQKGDITKWSIDGSTDAIVNPANERML 90


>gb|OIW20025.1| hypothetical protein TanjilG_31943 [Lupinus angustifolius]
          Length = 155

 Score = 84.0 bits (206), Expect = 7e-18
 Identities = 41/51 (80%), Positives = 44/51 (86%)
 Frame = -1

Query: 153 SARVSSNGNDGVVRFPLSASSALVIQKGDITKWSIDGSTDAIVNPANERML 1
           +A   SNGN+G VRFPLS SS LVIQKGDITKW IDGS+DAIVNPANERML
Sbjct: 6   NASTLSNGNNGAVRFPLSPSSTLVIQKGDITKWFIDGSSDAIVNPANERML 56


>ref|XP_019430248.1| PREDICTED: uncharacterized protein LOC109337680 [Lupinus
           angustifolius]
          Length = 233

 Score = 85.5 bits (210), Expect = 9e-18
 Identities = 45/60 (75%), Positives = 48/60 (80%)
 Frame = -1

Query: 180 FRVFGRMNASARVSSNGNDGVVRFPLSASSALVIQKGDITKWSIDGSTDAIVNPANERML 1
           FR+    NAS    SNGN+G VRFPLS SS LVIQKGDITKW IDGS+DAIVNPANERML
Sbjct: 30  FRMAATNNASTL--SNGNNGAVRFPLSPSSTLVIQKGDITKWFIDGSSDAIVNPANERML 87


>ref|XP_023872487.1| uncharacterized protein LOC111985090 [Quercus suber]
 gb|POE85831.1| macro domain-containing protein [Quercus suber]
          Length = 268

 Score = 84.7 bits (208), Expect = 3e-17
 Identities = 40/53 (75%), Positives = 47/53 (88%)
 Frame = -1

Query: 159 NASARVSSNGNDGVVRFPLSASSALVIQKGDITKWSIDGSTDAIVNPANERML 1
           +++ARVS+ G DGVVRFPLS S+ALVIQKGDITKW +D  TDAIVNPANE+ML
Sbjct: 70  SSTARVSNEGGDGVVRFPLSPSAALVIQKGDITKWFVDSKTDAIVNPANEQML 122


>ref|XP_020988809.1| uncharacterized protein LOC107470226 isoform X3 [Arachis
           duranensis]
          Length = 174

 Score = 82.4 bits (202), Expect = 4e-17
 Identities = 40/46 (86%), Positives = 42/46 (91%)
 Frame = -1

Query: 138 SNGNDGVVRFPLSASSALVIQKGDITKWSIDGSTDAIVNPANERML 1
           SNG   VVRFPLS SS+LVIQKGDITKWSIDGS+DAIVNPANERML
Sbjct: 8   SNGESSVVRFPLSPSSSLVIQKGDITKWSIDGSSDAIVNPANERML 53


Top