BLASTX nr result
ID: Astragalus23_contig00030971
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00030971 (389 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja] 109 4e-27 gb|PNY05892.1| ribonuclease H [Trifolium pratense] 104 9e-24 gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja] 99 1e-23 gb|PNY01502.1| ribonuclease H [Trifolium pratense] 104 2e-23 gb|PNX71113.1| pantothenate synthetase [Trifolium pratense] 98 2e-22 gb|KHN20429.1| hypothetical protein glysoja_044415, partial [Gly... 89 4e-20 dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subt... 94 1e-19 dbj|GAU48210.1| hypothetical protein TSUD_404970 [Trifolium subt... 91 1e-18 gb|KYP39460.1| hypothetical protein KK1_039239 [Cajanus cajan] 86 4e-18 ref|XP_020203066.1| uncharacterized protein LOC109788691 [Cajanu... 86 8e-18 gb|KYP55738.1| hypothetical protein KK1_001963 [Cajanus cajan] 84 1e-17 gb|PNX84435.1| hypothetical protein L195_g040495, partial [Trifo... 85 2e-17 dbj|GAU10013.1| hypothetical protein TSUD_415800, partial [Trifo... 82 5e-17 gb|PNX99671.1| cysteine-rich receptor-like protein kinase, parti... 85 1e-16 gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Gly... 78 6e-16 gb|KHN39553.1| hypothetical protein glysoja_045723, partial [Gly... 79 2e-15 dbj|GAU35033.1| hypothetical protein TSUD_103560 [Trifolium subt... 80 2e-15 gb|KYP44105.1| Putative ribonuclease H protein At1g65750 family ... 80 3e-15 dbj|GAU37587.1| hypothetical protein TSUD_395600 [Trifolium subt... 78 4e-15 dbj|GAU45061.1| hypothetical protein TSUD_198490 [Trifolium subt... 80 4e-15 >gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja] Length = 229 Score = 109 bits (273), Expect = 4e-27 Identities = 44/119 (36%), Positives = 74/119 (62%) Frame = +2 Query: 23 CALCFREVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIAR 202 C LC E + KH+F C + +K+WQ+V W++V V +++ F G+L+KG+ R Sbjct: 100 CVLCSSEDETVKHIFLDCRVAKKIWQQVCLWLDVPVVEGEDIQAHFMAFGKLIKGKKQKR 159 Query: 203 VQHVFWAATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNSGLTFTDW 379 V+H+ W A +W +WL RN+V+F+ A I +++ IK +W WF++R+GR ++DW Sbjct: 160 VKHLIWMAVIWNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWFMARQGRTCWDGWSDW 218 >gb|PNY05892.1| ribonuclease H [Trifolium pratense] Length = 455 Score = 104 bits (260), Expect = 9e-24 Identities = 41/119 (34%), Positives = 67/119 (56%) Frame = +2 Query: 32 CFREVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIARVQH 211 CF E+ HLF C ++++VW+ ++ W+ E F G +VK + +V+H Sbjct: 329 CFTEIEDCMHLFFNCKLMQQVWRSIYKWLGCAYYNYGEGWKHFNFFGGIVKSKKGEKVKH 388 Query: 212 VFWAATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNSGLTFTDWCSN 388 + W T W +W RN ++F+G +A+ + +V IK +SW WFI R GR+ ++DWC N Sbjct: 389 LIWLVTTWCIWRLRNNIIFRGALADCAQLVDQIKLISWVWFIGRSGRHCPYLYSDWCVN 447 >gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja] Length = 160 Score = 99.0 bits (245), Expect = 1e-23 Identities = 41/116 (35%), Positives = 71/116 (61%) Frame = +2 Query: 29 LCFREVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIARVQ 208 LC E + KH+ C + +K+WQ+V W++V V +++ F G+L+KG+ RV+ Sbjct: 33 LCSSEDETVKHILLDCRVAKKIWQQVCLWLDVPVVEGEDIQAHFMAFGKLIKGKKQKRVK 92 Query: 209 HVFWAATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNSGLTFTD 376 H+ W A +W +WL RN+V+F+ A I +++ IK +W WF +R+GR + ++D Sbjct: 93 HLIWMAVIWNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWFKARQGRICWVGWSD 148 >gb|PNY01502.1| ribonuclease H [Trifolium pratense] Length = 554 Score = 104 bits (259), Expect = 2e-23 Identities = 47/120 (39%), Positives = 65/120 (54%) Frame = +2 Query: 23 CALCFREVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIAR 202 C CFR HLF CY + VW+ V W+ + + E + F G+L K + R Sbjct: 425 CVFCFRHREDGAHLFFSCYFSKVVWRNVLKWLGLSIPMDVEGIDHFMLFGDLFKVKDKGR 484 Query: 203 VQHVFWAATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNSGLTFTDWC 382 V+H+ W AT W +W RN+V+F+G I S+++ SIK SW WF R GRN F+ WC Sbjct: 485 VRHLVWLATTWNLWKLRNKVIFKGDIPETSALLDSIKLSSWMWFNGRYGRNVCCPFSSWC 544 >gb|PNX71113.1| pantothenate synthetase [Trifolium pratense] Length = 255 Score = 98.2 bits (243), Expect = 2e-22 Identities = 44/120 (36%), Positives = 62/120 (51%) Frame = +2 Query: 23 CALCFREVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIAR 202 C CF HLF CY + VW+ V W+ + + E + F GE K + Sbjct: 126 CVFCFWHREDGAHLFFSCYFSKVVWRNVLKWLGLSSPLDVEGIDHFLLFGEFFKVKDKGH 185 Query: 203 VQHVFWAATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNSGLTFTDWC 382 V+H+ W AT W +W RN+V+F+G I + + ++ SIK SW WF R GRN F+ WC Sbjct: 186 VRHLVWLATTWNLWKMRNKVIFKGDIPDTAVLLDSIKLFSWIWFNGRYGRNVCCPFSSWC 245 >gb|KHN20429.1| hypothetical protein glysoja_044415, partial [Glycine soja] Length = 118 Score = 88.6 bits (218), Expect = 4e-20 Identities = 37/114 (32%), Positives = 60/114 (52%) Frame = +2 Query: 41 EVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIARVQHVFW 220 +V HLF C K+W V AW+ V V+ ++ +F +G V+ R + R+ +FW Sbjct: 1 KVEDKNHLFVNCSFNSKIWYVVLAWLGVSVVLPNDAKSLFIWMGGFVRVRRVKRLIFIFW 60 Query: 221 AATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNSGLTFTDWC 382 TVW +W RN+++F+ + + IK +SW W S+ G + L F+ WC Sbjct: 61 HVTVWCLWNLRNQIIFKSDSIEFLACMAHIKIISWQWLFSKNGVKTSLFFSSWC 114 >dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subterraneum] Length = 767 Score = 93.6 bits (231), Expect = 1e-19 Identities = 41/120 (34%), Positives = 62/120 (51%) Frame = +2 Query: 23 CALCFREVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIAR 202 C C + HLF C ++++VW++VF W + + F G L+K + + Sbjct: 637 CVFCSSYDEDSAHLFFHCSVLKRVWEEVFKWFGKSYQAEADGWNHFNIFGSLLKTKRFEK 696 Query: 203 VQHVFWAATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNSGLTFTDWC 382 V+H+ W AT W +W RN V+F G + SS+V IK +S W R G S ++F DWC Sbjct: 697 VRHLIWLATTWSIWKLRNNVVFNGVTLSSSSLVNDIKTISCLWLSGRYGHISSISFPDWC 756 >dbj|GAU48210.1| hypothetical protein TSUD_404970 [Trifolium subterraneum] Length = 1653 Score = 90.9 bits (224), Expect = 1e-18 Identities = 42/120 (35%), Positives = 59/120 (49%) Frame = +2 Query: 23 CALCFREVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIAR 202 C C HLF C V VW+ V+ W+ + E FK G++V I R Sbjct: 1523 CVFCSVFYEDCVHLFFHCSFVNCVWEAVYNWIGKDYHAGAEGWSHFKVFGDMVNSTNIER 1582 Query: 203 VQHVFWAATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNSGLTFTDWC 382 V+H+ W AT W +W RN V+F G + SS++ IK +S W R G S ++F+ WC Sbjct: 1583 VRHLIWLATTWNLWKLRNNVIFNGATPSASSLLNDIKAISCAWVSGRYGHKSCISFSLWC 1642 >gb|KYP39460.1| hypothetical protein KK1_039239 [Cajanus cajan] Length = 223 Score = 86.3 bits (212), Expect = 4e-18 Identities = 39/120 (32%), Positives = 57/120 (47%), Gaps = 1/120 (0%) Frame = +2 Query: 23 CALCFREVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIAR 202 C C HLF C + VW + W+ + D V + V G+ AR Sbjct: 94 CVFCNNREEDVTHLFLHCDVASSVWYSLCRWLGFSSCLSDSVEGQLLTMSGFVAGKKPAR 153 Query: 203 VQHVFWAATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNSG-LTFTDW 379 V W ATVW +WL RN+++F G+ ++ +V S+KY SW W + G N G + F +W Sbjct: 154 VVVTIWVATVWSLWLHRNKIVFNNGVCDVLEVVESVKYRSWKWLTA--GMNHGNIPFVNW 211 >ref|XP_020203066.1| uncharacterized protein LOC109788691 [Cajanus cajan] Length = 265 Score = 86.3 bits (212), Expect = 8e-18 Identities = 39/120 (32%), Positives = 57/120 (47%), Gaps = 1/120 (0%) Frame = +2 Query: 23 CALCFREVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIAR 202 C C HLF C + VW + W+ + D V + V G+ AR Sbjct: 136 CVFCNNREEDVTHLFLHCDVASSVWYSLCRWLGFSSCLSDSVEGQLLTMSGFVAGKKPAR 195 Query: 203 VQHVFWAATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNSG-LTFTDW 379 V W ATVW +WL RN+++F G+ ++ +V S+KY SW W + G N G + F +W Sbjct: 196 VVVTIWVATVWSLWLHRNKIVFNNGVCDVLEVVESVKYRSWKWLTA--GMNHGNIPFVNW 253 >gb|KYP55738.1| hypothetical protein KK1_001963 [Cajanus cajan] Length = 186 Score = 84.0 bits (206), Expect = 1e-17 Identities = 35/120 (29%), Positives = 57/120 (47%) Frame = +2 Query: 23 CALCFREVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIAR 202 C C + S++HLF C R VW VF W + + + +F + + + + Sbjct: 57 CPFCSTTLESSQHLFLECEFSRNVWHNVFTWTGIRLELPSSLGHLFFLLRSMFLDKVKRK 116 Query: 203 VQHVFWAATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNSGLTFTDWC 382 + +FW AT+W +W RN ++F+ + IK +SW W++ R G G TF WC Sbjct: 117 WRDIFWHATIWVLWTNRNEIVFRNKTVSHFDFPYQIKIISWHWWMYRNGCRPGFTFAAWC 176 >gb|PNX84435.1| hypothetical protein L195_g040495, partial [Trifolium pratense] Length = 248 Score = 84.7 bits (208), Expect = 2e-17 Identities = 42/114 (36%), Positives = 62/114 (54%), Gaps = 6/114 (5%) Frame = +2 Query: 59 HLFCRCYMVRKVWQKVFAWMNVEEVIQDEV----MLMFKEVGELVKGRTIARVQHVFWAA 226 HLF C + VW+ V+ W+ ++ + L+F ++ KG RV+H+FW A Sbjct: 128 HLFFHCAFSKGVWESVYRWLGMKSISAGAEGWNHFLLFDDMITAKKGE---RVRHLFWLA 184 Query: 227 TVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNS--GLTFTDWC 382 T W +W RN V+F G I N SS+V I SW WF R G +S ++F++WC Sbjct: 185 TTWNIWKLRNNVVFNGVIPNASSLVEDIIANSWLWFNGRYGHHSCTSMSFSNWC 238 >dbj|GAU10013.1| hypothetical protein TSUD_415800, partial [Trifolium subterraneum] Length = 169 Score = 82.0 bits (201), Expect = 5e-17 Identities = 37/120 (30%), Positives = 61/120 (50%) Frame = +2 Query: 23 CALCFREVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIAR 202 C C V S+ HLF C ++WQ VF W+ + VI + ++F + + I + Sbjct: 42 CVHCHGSVESSLHLFLFCSFSVQIWQAVFRWLGLVVVIPPNMFVLFDCLIGAASNKKIRK 101 Query: 203 VQHVFWAATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNSGLTFTDWC 382 + W AT+W +W +RN ++F G+ + + IK +SW W +SR + L F +WC Sbjct: 102 GYALIWHATIWMLWKSRNEIIFSNGVKDSEKVFDEIKLLSWRWGLSRHSIPTCL-FYEWC 160 >gb|PNX99671.1| cysteine-rich receptor-like protein kinase, partial [Trifolium pratense] Length = 1007 Score = 85.1 bits (209), Expect = 1e-16 Identities = 42/119 (35%), Positives = 62/119 (52%) Frame = +2 Query: 23 CALCFREVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIAR 202 C C +V ++ HLF C V VW ++F W+ V VI + +F+ + E K I R Sbjct: 880 CMGCIGKVETSTHLFLHCPCVMMVWSEIFKWLGVLVVIPPSIASLFEVLKEAAKNVKIRR 939 Query: 203 VQHVFWAATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNSGLTFTDW 379 + W AT+W +W ARN +F G+ N IV IK +SW W ++R + S F +W Sbjct: 940 GFVMIWHATLWSIWKARNNAIFATGVFNPRMIVEDIKVLSWKWCLARL-KVSPCLFYEW 997 >gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Glycine soja] Length = 114 Score = 77.8 bits (190), Expect = 6e-16 Identities = 28/107 (26%), Positives = 56/107 (52%) Frame = +2 Query: 62 LFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIARVQHVFWAATVWQV 241 LF C ++VWQ + W+ + + + ++ ++G ++G+ R +H+ W T W + Sbjct: 1 LFVFCPFAKQVWQGILNWLGYSFSLPNNIQELYLQLGMNIRGKKKRRFKHLLWHNTCWSI 60 Query: 242 WLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNSGLTFTDWC 382 W RN V+F+ ++++ + IK +SW W + + G F+ WC Sbjct: 61 WCHRNNVIFRNAEVDVNNTILFIKSMSWQWVLYKSSGKPGFFFSSWC 107 >gb|KHN39553.1| hypothetical protein glysoja_045723, partial [Glycine soja] Length = 211 Score = 79.0 bits (193), Expect = 2e-15 Identities = 36/120 (30%), Positives = 63/120 (52%), Gaps = 1/120 (0%) Frame = +2 Query: 23 CALCFREVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIA- 199 C+LC + HLF C + +W+++ +W+ + +VI + F E L+K T Sbjct: 85 CSLCDSCDENVVHLFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNTSRN 144 Query: 200 RVQHVFWAATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNSGLTFTDW 379 +V +FW AT+W +W RN +F+ +I + IK++ W WF+ + G +G +DW Sbjct: 145 KVPFMFWLATLWIIWQVRNNSIFKEEEKDIPKTINQIKHICWAWFMGKVGGVTGSNISDW 204 >dbj|GAU35033.1| hypothetical protein TSUD_103560 [Trifolium subterraneum] Length = 311 Score = 80.5 bits (197), Expect = 2e-15 Identities = 33/108 (30%), Positives = 58/108 (53%) Frame = +2 Query: 20 VCALCFREVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIA 199 VC LC V ++ HLF C+ +VW+++ W+ + ++ ++ F E G+ Sbjct: 183 VCVLCGNCVETSVHLFVYCHFATQVWEQIITWLGMVFMLPQSLVSFFSFFAETSGGKKRR 242 Query: 200 RVQHVFWAATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISR 343 + + W A VW +W RNR++F+ G +++ +V IK SW W+I R Sbjct: 243 QGLIMIWNAVVWALWRQRNRIIFENGTGDLNGVVEEIKVSSWKWWIGR 290 >gb|KYP44105.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 346 Score = 80.5 bits (197), Expect = 3e-15 Identities = 36/122 (29%), Positives = 60/122 (49%) Frame = +2 Query: 23 CALCFREVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIAR 202 C C ++ + H+FC C MV +VW++ +W+N + ++ F + ++ + A Sbjct: 215 CIFCKVDIETTTHIFCTCPMVDRVWKQCLSWLNCPAPLPRQIFDHFSFLPAPIQNESEAE 274 Query: 203 VQHVFWAATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNSGLTFTDWC 382 H W AT+W +W RN+ +F GG SIV I W W +S + +F+ W Sbjct: 275 TWHTVWLATMWTLWRYRNKCIFDGGTFEQGSIVRDILIFCWRW-LSTLKPSFAYSFSQWS 333 Query: 383 SN 388 SN Sbjct: 334 SN 335 >dbj|GAU37587.1| hypothetical protein TSUD_395600 [Trifolium subterraneum] Length = 209 Score = 78.2 bits (191), Expect = 4e-15 Identities = 38/107 (35%), Positives = 53/107 (49%) Frame = +2 Query: 23 CALCFREVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIAR 202 C C V S+ HLF C VW VF W+ V V ++L+F+ + + + I Sbjct: 82 CVGCVGNVESSSHLFLHCPSAMMVWYDVFRWLGVIIVTPPTMLLLFEVMRGSTRNKKIRL 141 Query: 203 VQHVFWAATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISR 343 + W AT+W +W ARN+ F G N IV IK VSW W ++R Sbjct: 142 GYLMIWHATLWCIWKARNKACFANGTFNPKVIVEDIKVVSWKWCLAR 188 >dbj|GAU45061.1| hypothetical protein TSUD_198490 [Trifolium subterraneum] Length = 310 Score = 79.7 bits (195), Expect = 4e-15 Identities = 37/119 (31%), Positives = 59/119 (49%) Frame = +2 Query: 23 CALCFREVGSAKHLFCRCYMVRKVWQKVFAWMNVEEVIQDEVMLMFKEVGELVKGRTIAR 202 CA C + S HLF C + VW +F W+ V + +F+ GR I Sbjct: 186 CAFCGASLKSVDHLFVTCDSISPVWYSLFRWLGFRFVSPPSISSVFQGFLGFRVGRKIRL 245 Query: 203 VQHVFWAATVWQVWLARNRVMFQGGIANISSIVTSIKYVSWGWFISRRGRNSGLTFTDW 379 + W ATVW +W +RN V+F G ++ S+V +K+ SW W+++ + S +F +W Sbjct: 246 EWLLIWHATVWTIWNSRNDVIFARGTVSVESLVDKVKFSSWKWYLA-KNLGSPCSFYEW 303