BLASTX nr result
ID: Astragalus22_contig00030842
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00030842 (372 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja] 84 4e-17 gb|KHN10514.1| hypothetical protein glysoja_040669, partial [Gly... 80 1e-16 gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Gly... 76 2e-15 gb|PNY05892.1| ribonuclease H [Trifolium pratense] 80 4e-15 gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja] 77 4e-15 gb|KRH35933.1| hypothetical protein GLYMA_10G272900 [Glycine max] 69 8e-13 gb|PNY01502.1| ribonuclease H [Trifolium pratense] 72 5e-12 gb|PNX71113.1| pantothenate synthetase [Trifolium pratense] 69 2e-11 gb|KYP55738.1| hypothetical protein KK1_001963 [Cajanus cajan] 67 4e-11 dbj|GAU26446.1| hypothetical protein TSUD_294120 [Trifolium subt... 69 5e-11 dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subt... 68 9e-11 gb|KHN39553.1| hypothetical protein glysoja_045723, partial [Gly... 66 1e-10 gb|PNX84435.1| hypothetical protein L195_g040495, partial [Trifo... 66 2e-10 dbj|GAU46742.1| hypothetical protein TSUD_286020 [Trifolium subt... 66 3e-10 gb|KYP39460.1| hypothetical protein KK1_039239 [Cajanus cajan] 65 3e-10 ref|XP_022024524.1| uncharacterized protein LOC110924846 [Helian... 65 4e-10 ref|XP_020203066.1| uncharacterized protein LOC109788691 [Cajanu... 65 5e-10 gb|PNX95738.1| ribonuclease H, partial [Trifolium pratense] 66 6e-10 dbj|GAU48811.1| hypothetical protein TSUD_406440 [Trifolium subt... 65 7e-10 gb|KHN41375.1| Putative ribonuclease H protein, partial [Glycine... 65 9e-10 >gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja] Length = 229 Score = 83.6 bits (205), Expect = 4e-17 Identities = 43/109 (39%), Positives = 59/109 (54%) Frame = -3 Query: 364 DAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGS 185 D +C C E +H FL C A IWQQV L V E +Q H G L KG Sbjct: 97 DRRCVLCSSEDETVKHIFLDCRVAKKIWQQVCLWLDVPV-VEGEDIQAHFMAFGKLIKGK 155 Query: 184 KERKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FI 38 K++++K+LIW+ W IWL+ N ++FK E A I +I IK +W+ F+ Sbjct: 156 KQKRVKHLIWMAVIWNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWFM 204 >gb|KHN10514.1| hypothetical protein glysoja_040669, partial [Glycine soja] Length = 132 Score = 80.1 bits (196), Expect = 1e-16 Identities = 35/98 (35%), Positives = 54/98 (55%) Frame = -3 Query: 367 EDAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKG 188 +D C FC ED +H F CNFA +W +Y LG + N ++ H G +C+G Sbjct: 36 DDVGCVFCEHDWEDVDHLFPGCNFAYNVWIAIYSWLGFV-MIQHNQVKYHYVQHGLVCRG 94 Query: 187 SKERKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLI 74 + K+ + IW TCWC+WL N I+F+ E A++ ++ Sbjct: 95 KRLSKVCHFIWHATCWCLWLHRNRIIFQEEQADVQLVV 132 >gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Glycine soja] Length = 114 Score = 76.3 bits (186), Expect = 2e-15 Identities = 35/94 (37%), Positives = 56/94 (59%) Frame = -3 Query: 313 FLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGSKERKIKYLIWLTTCWCI 134 F+ C FA +WQ + LG + + N +QE LG +G K+R+ K+L+W TCW I Sbjct: 2 FVFCPFAKQVWQGILNWLGYSFSLP-NNIQELYLQLGMNIRGKKKRRFKHLLWHNTCWSI 60 Query: 133 WLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FIYK 32 W N ++F+N ++++ I+ IKS+SW +YK Sbjct: 61 WCHRNNVIFRNAEVDVNNTILFIKSMSWQWVLYK 94 >gb|PNY05892.1| ribonuclease H [Trifolium pratense] Length = 455 Score = 80.5 bits (197), Expect = 4e-15 Identities = 39/104 (37%), Positives = 54/104 (51%) Frame = -3 Query: 349 FCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGSKERKI 170 FC EDC H F C +W+ +Y LG A + +H + G + K K K+ Sbjct: 328 FCFTEIEDCMHLFFNCKLMQQVWRSIYKWLGCA-YYNYGEGWKHFNFFGGIVKSKKGEKV 386 Query: 169 KYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FI 38 K+LIWL T WCIW N I+F+ A+ L+ +IK +SW FI Sbjct: 387 KHLIWLVTTWCIWRLRNNIIFRGALADCAQLVDQIKLISWVWFI 430 >gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja] Length = 160 Score = 76.6 bits (187), Expect = 4e-15 Identities = 41/108 (37%), Positives = 56/108 (51%) Frame = -3 Query: 364 DAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGS 185 D + C E +H L C A IWQQV L V E +Q H G L KG Sbjct: 28 DRRYVLCSSEDETVKHILLDCRVAKKIWQQVCLWLDVPV-VEGEDIQAHFMAFGKLIKGK 86 Query: 184 KERKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*F 41 K++++K+LIW+ W IWL+ N ++FK E A I +I IK +W+ F Sbjct: 87 KQKRVKHLIWMAVIWNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWF 134 >gb|KRH35933.1| hypothetical protein GLYMA_10G272900 [Glycine max] Length = 79 Score = 68.6 bits (166), Expect = 8e-13 Identities = 32/81 (39%), Positives = 44/81 (54%), Gaps = 1/81 (1%) Frame = -3 Query: 343 MQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEF-NTLQEHCYWLGTLCKGSKERKIK 167 M+ ED H F C + IW+QV +GV N T + G L G K R++K Sbjct: 1 MEEEEDVHHLFYACQVTASIWRQVIAWVGV--NLVMPQTFGQLFKQFGGLLPGRKSRRVK 58 Query: 166 YLIWLTTCWCIWLSLNGIVFK 104 +++W TCWC+WLS N I+FK Sbjct: 59 HILWHATCWCVWLSRNAIIFK 79 >gb|PNY01502.1| ribonuclease H [Trifolium pratense] Length = 554 Score = 71.6 bits (174), Expect = 5e-12 Identities = 34/108 (31%), Positives = 56/108 (51%) Frame = -3 Query: 364 DAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGS 185 + C FC +HRED H F C F+ +W+ V LG++ + + +H G L K Sbjct: 422 ELSCVFCFRHREDGAHLFFSCYFSKVVWRNVLKWLGLSIPMDVEGI-DHFMLFGDLFKVK 480 Query: 184 KERKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*F 41 + ++++L+WL T W +W N ++FK + +L+ IK SW F Sbjct: 481 DKGRVRHLVWLATTWNLWKLRNKVIFKGDIPETSALLDSIKLSSWMWF 528 >gb|PNX71113.1| pantothenate synthetase [Trifolium pratense] Length = 255 Score = 68.6 bits (166), Expect = 2e-11 Identities = 32/105 (30%), Positives = 52/105 (49%) Frame = -3 Query: 364 DAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGS 185 + C FC HRED H F C F+ +W+ V LG++ + + +H G K Sbjct: 123 ELSCVFCFWHREDGAHLFFSCYFSKVVWRNVLKWLGLSSPLDVEGI-DHFLLFGEFFKVK 181 Query: 184 KERKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSW 50 + +++L+WL T W +W N ++FK + + L+ IK SW Sbjct: 182 DKGHVRHLVWLATTWNLWKMRNKVIFKGDIPDTAVLLDSIKLFSW 226 >gb|KYP55738.1| hypothetical protein KK1_001963 [Cajanus cajan] Length = 186 Score = 67.0 bits (162), Expect = 4e-11 Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 1/117 (0%) Frame = -3 Query: 355 CPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWL-GTLCKGSKE 179 CPFC E +H FL C F+ +W V+ G+ E + H ++L ++ + Sbjct: 57 CPFCSTTLESSQHLFLECEFSRNVWHNVFTWTGI--RLELPSSLGHLFFLLRSMFLDKVK 114 Query: 178 RKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FIYKVVCNSNFS 8 RK + + W T W +W + N IVF+N+ + +IK +SW ++Y+ C F+ Sbjct: 115 RKWRDIFWHATIWVLWTNRNEIVFRNKTVSHFDFPYQIKIISWHWWMYRNGCRPGFT 171 >dbj|GAU26446.1| hypothetical protein TSUD_294120 [Trifolium subterraneum] Length = 333 Score = 68.6 bits (166), Expect = 5e-11 Identities = 40/120 (33%), Positives = 58/120 (48%), Gaps = 1/120 (0%) Frame = -3 Query: 370 QEDAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCK 191 +EDA CP C + E H FL C FAS +W +V LG + + H +G C Sbjct: 201 REDALCPTCGETIETVRHLFLHCRFASAVWYRVNRWLGTMVVIPHDIIMSHGLLVG--CG 258 Query: 190 GSKE-RKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FIYKVVCNSN 14 G+K+ RK ++WL W IW N VF N ++ + I+ LSW ++ K S+ Sbjct: 259 GNKKVRKGYSIVWLAFVWVIWRFRNDRVFNNINGEVEDAMDSIQRLSWQWYLLKTAKGSS 318 >dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subterraneum] Length = 767 Score = 68.2 bits (165), Expect = 9e-11 Identities = 34/105 (32%), Positives = 53/105 (50%) Frame = -3 Query: 367 EDAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKG 188 +D C FC + ED H F C+ +W++V+ G + E + H G+L K Sbjct: 633 QDLHCVFCSSYDEDSAHLFFHCSVLKRVWEEVFKWFGKSYQAEADGWN-HFNIFGSLLKT 691 Query: 187 SKERKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLS 53 + K+++LIWL T W IW N +VF + SL+ IK++S Sbjct: 692 KRFEKVRHLIWLATTWSIWKLRNNVVFNGVTLSSSSLVNDIKTIS 736 >gb|KHN39553.1| hypothetical protein glysoja_045723, partial [Glycine soja] Length = 211 Score = 65.9 bits (159), Expect = 1e-10 Identities = 35/121 (28%), Positives = 58/121 (47%), Gaps = 2/121 (1%) Frame = -3 Query: 364 DAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGS 185 +++C C E+ H F C+F+ IW+++ +G+ +Q + L + Sbjct: 82 NSRCSLCDSCDENVVHLFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNT 141 Query: 184 KERKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FIYKV--VCNSNF 11 K+ ++ WL T W IW N +FK E +I I +IK + W+ F+ KV V SN Sbjct: 142 SRNKVPFMFWLATLWIIWQVRNNSIFKEEEKDIPKTINQIKHICWAWFMGKVGGVTGSNI 201 Query: 10 S 8 S Sbjct: 202 S 202 >gb|PNX84435.1| hypothetical protein L195_g040495, partial [Trifolium pratense] Length = 248 Score = 65.9 bits (159), Expect = 2e-10 Identities = 33/99 (33%), Positives = 47/99 (47%), Gaps = 1/99 (1%) Frame = -3 Query: 343 MQHR-EDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGSKERKIK 167 ++HR EDC H F C F+ G+W+ VY LG+ H + K +++ Sbjct: 119 LEHRQEDCSHLFFHCAFSKGVWESVYRWLGMKSISAGAEGWNHFLLFDDMITAKKGERVR 178 Query: 166 YLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSW 50 +L WL T W IW N +VF N SL+ I + SW Sbjct: 179 HLFWLATTWNIWKLRNNVVFNGVIPNASSLVEDIIANSW 217 >dbj|GAU46742.1| hypothetical protein TSUD_286020 [Trifolium subterraneum] Length = 258 Score = 65.9 bits (159), Expect = 3e-10 Identities = 35/119 (29%), Positives = 59/119 (49%), Gaps = 1/119 (0%) Frame = -3 Query: 367 EDAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKG 188 ED +C C + E H FL C F + +W V LGV + + + +G C G Sbjct: 127 EDTRCSLCGELAETSCHLFLHCRFVAAVWYAVIKWLGVVVVLPADPIMSYGILVG--CGG 184 Query: 187 SKE-RKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FIYKVVCNSN 14 +K+ RK ++W+ W +W N VF N +++ +I RI+ +SW +++K S+ Sbjct: 185 NKKIRKSLSIVWMDFVWVLWRVRNDRVFNNVDGSVEDVIERIQRISWQWYLHKTTMGSS 243 >gb|KYP39460.1| hypothetical protein KK1_039239 [Cajanus cajan] Length = 223 Score = 65.1 bits (157), Expect = 3e-10 Identities = 31/102 (30%), Positives = 50/102 (49%) Frame = -3 Query: 355 CPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGSKER 176 C FC ED H FL C+ AS +W + LG + +C ++++ + G K Sbjct: 94 CVFCNNREEDVTHLFLHCDVASSVWYSLCRWLGFS-SCLSDSVEGQLLTMSGFVAGKKPA 152 Query: 175 KIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSW 50 ++ IW+ T W +WL N IVF N ++ ++ +K SW Sbjct: 153 RVVVTIWVATVWSLWLHRNKIVFNNGVCDVLEVVESVKYRSW 194 >ref|XP_022024524.1| uncharacterized protein LOC110924846 [Helianthus annuus] Length = 203 Score = 64.7 bits (156), Expect = 4e-10 Identities = 34/100 (34%), Positives = 52/100 (52%) Frame = -3 Query: 349 FCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGSKERKI 170 FC ++ E C+H F+ C+FA +WQ V L + F + L L GS+ +K+ Sbjct: 81 FCGEYVESCDHIFVSCHFAQMVWQNVARWLRIQSIIAFGI--QDLLTLHGLSSGSRRKKV 138 Query: 169 KYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSW 50 + I L WCIW + N IVF+N N + IKS+++ Sbjct: 139 IHAIVLVVLWCIWKTRNDIVFRNVVPNYARTLDEIKSMAF 178 >ref|XP_020203066.1| uncharacterized protein LOC109788691 [Cajanus cajan] Length = 265 Score = 65.1 bits (157), Expect = 5e-10 Identities = 31/102 (30%), Positives = 50/102 (49%) Frame = -3 Query: 355 CPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGSKER 176 C FC ED H FL C+ AS +W + LG + +C ++++ + G K Sbjct: 136 CVFCNNREEDVTHLFLHCDVASSVWYSLCRWLGFS-SCLSDSVEGQLLTMSGFVAGKKPA 194 Query: 175 KIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSW 50 ++ IW+ T W +WL N IVF N ++ ++ +K SW Sbjct: 195 RVVVTIWVATVWSLWLHRNKIVFNNGVCDVLEVVESVKYRSW 236 >gb|PNX95738.1| ribonuclease H, partial [Trifolium pratense] Length = 1375 Score = 65.9 bits (159), Expect = 6e-10 Identities = 35/102 (34%), Positives = 52/102 (50%) Frame = -3 Query: 355 CPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGSKER 176 C C+ E H FL C+FAS IW +++ LG+ N Q ++G G K R Sbjct: 1248 CAICVGVDESSVHLFLHCDFASCIWYEIFRWLGLVIVLPANLFQCFDSFIGAAV-GKKCR 1306 Query: 175 KIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSW 50 K+ +IW T W IW + N ++F N ++ ++ IK LSW Sbjct: 1307 KMFRMIWHTIVWLIWKNRNDVIFSNSSKEVNEVVDDIKQLSW 1348 >dbj|GAU48811.1| hypothetical protein TSUD_406440 [Trifolium subterraneum] Length = 563 Score = 65.5 bits (158), Expect = 7e-10 Identities = 36/112 (32%), Positives = 57/112 (50%), Gaps = 1/112 (0%) Frame = -3 Query: 370 QEDAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCK 191 +E CP C + RE H FL C A+ IW + LGV Q + ++G C Sbjct: 431 EEVVFCPLCEEERETSCHLFLHCRVAASIWYGLTRWLGVVVVLPPLVAQSYAGFVG--CG 488 Query: 190 GSKERKIKY-LIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FI 38 +K+RK + ++WL W +W + N VF N+ ++ ++V I+ LSW F+ Sbjct: 489 SNKKRKKGFSIVWLAFVWALWQARNDRVFNNKEVKVEEVVVYIQRLSWRWFL 540 >gb|KHN41375.1| Putative ribonuclease H protein, partial [Glycine soja] Length = 363 Score = 65.1 bits (157), Expect = 9e-10 Identities = 35/121 (28%), Positives = 58/121 (47%), Gaps = 2/121 (1%) Frame = -3 Query: 364 DAQCPFCMQHREDCEHTFLVCNFASGIWQQVYC*LGVAGNCEFNTLQEHCYWLGTLCKGS 185 +++C C E+ H F C+F+ IW+++ +G+ +Q + L + Sbjct: 234 NSRCSLCDSCDENVVHLFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNT 293 Query: 184 KERKIKYLIWLTTCWCIWLSLNGIVFKNE*ANIDSLIVRIKSLSWS*FIYKV--VCNSNF 11 K+ ++ WL T W IW N +FK E +I I +IK + W+ F+ KV V SN Sbjct: 294 SRNKVPFMFWLATLWIIWQVRNNSIFKEEEKDIPKTINQIKHICWAWFMGKVGGVGGSNI 353 Query: 10 S 8 S Sbjct: 354 S 354