BLASTX nr result
ID: Astragalus23_contig00030177
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00030177 (407 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNY01502.1| ribonuclease H [Trifolium pratense] 99 1e-21 gb|PNX71113.1| pantothenate synthetase [Trifolium pratense] 96 2e-21 gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja] 92 9e-21 gb|KYP39460.1| hypothetical protein KK1_039239 [Cajanus cajan] 91 8e-20 ref|XP_020203066.1| uncharacterized protein LOC109788691 [Cajanu... 91 2e-19 gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja] 84 6e-18 gb|PNY05892.1| ribonuclease H [Trifolium pratense] 87 2e-17 gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Gly... 82 2e-17 gb|AFK37936.1| unknown [Lotus japonicus] 82 4e-17 gb|KYP46173.1| Putative ribonuclease H protein At1g65750 family,... 86 5e-17 gb|KYP55738.1| hypothetical protein KK1_001963 [Cajanus cajan] 82 1e-16 gb|KYP44105.1| Putative ribonuclease H protein At1g65750 family ... 84 3e-16 gb|KYP34033.1| Putative ribonuclease H protein At1g65750 family,... 84 4e-16 gb|KYP59667.1| Putative ribonuclease H protein At1g65750 family ... 83 5e-16 ref|XP_022030815.1| uncharacterized protein LOC110931740 [Helian... 79 1e-15 dbj|GAU48210.1| hypothetical protein TSUD_404970 [Trifolium subt... 82 1e-15 dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subt... 79 3e-15 ref|XP_020239954.1| uncharacterized protein LOC109818836 [Cajanu... 78 1e-14 gb|PNX84435.1| hypothetical protein L195_g040495, partial [Trifo... 77 2e-14 ref|XP_021996043.1| uncharacterized protein LOC110893235 [Helian... 75 2e-14 >gb|PNY01502.1| ribonuclease H [Trifolium pratense] Length = 554 Score = 99.4 bits (246), Expect = 1e-21 Identities = 40/108 (37%), Positives = 66/108 (61%) Frame = -1 Query: 407 PYDLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQG 228 P++LSCVFCFR E H+F C F + VWR V WL ++ + + ++ F+ + Sbjct: 420 PFELSCVFCFRHREDGAHLFFSCYFSKVVWRNVLKWLGLSIPMDVEGIDHFMLFGDLFKV 479 Query: 227 KSARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84 K R++H+ W+AT W LW +RN++IF+G + ++++ S+K SW W Sbjct: 480 KDKGRVRHLVWLATTWNLWKLRNKVIFKGDIPETSALLDSIKLSSWMW 527 >gb|PNX71113.1| pantothenate synthetase [Trifolium pratense] Length = 255 Score = 95.5 bits (236), Expect = 2e-21 Identities = 40/108 (37%), Positives = 64/108 (59%) Frame = -1 Query: 407 PYDLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQG 228 P++LSCVFCF E H+F C F + VWR V WL ++ L + ++ FL + + Sbjct: 121 PFELSCVFCFWHREDGAHLFFSCYFSKVVWRNVLKWLGLSSPLDVEGIDHFLLFGEFFKV 180 Query: 227 KSARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84 K ++H+ W+AT W LW +RN++IF+G + + ++ S+K SW W Sbjct: 181 KDKGHVRHLVWLATTWNLWKMRNKVIFKGDIPDTAVLLDSIKLFSWIW 228 >gb|KHN04350.1| hypothetical protein glysoja_030944 [Glycine soja] Length = 229 Score = 92.0 bits (227), Expect(2) = 9e-21 Identities = 38/109 (34%), Positives = 62/109 (56%) Frame = -1 Query: 401 DLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKS 222 D CV C E E H+F C + +W+ V LWL+V +D+ F+ K I+GK Sbjct: 97 DRRCVLCSSEDETVKHIFLDCRVAKKIWQQVCLWLDVPVVEGEDIQAHFMAFGKLIKGKK 156 Query: 221 ARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSWSVA 75 +R+KH+ W+A +W +WL RN++IF+ + +I +K +W+W +A Sbjct: 157 QKRVKHLIWMAVIWNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAWFMA 205 Score = 35.8 bits (81), Expect(2) = 9e-21 Identities = 14/25 (56%), Positives = 18/25 (72%) Frame = -3 Query: 75 SRKGRNSGVTFSDWCNCPMGCIASL 1 +R+GR +SDW NCPMGC+ SL Sbjct: 205 ARQGRTCWDGWSDWYNCPMGCLLSL 229 >gb|KYP39460.1| hypothetical protein KK1_039239 [Cajanus cajan] Length = 223 Score = 90.9 bits (224), Expect = 8e-20 Identities = 43/106 (40%), Positives = 55/106 (51%) Frame = -1 Query: 392 CVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKSARR 213 CVFC E H+F C SVW + WL + L D V L S + GK R Sbjct: 94 CVFCNNREEDVTHLFLHCDVASSVWYSLCRWLGFSSCLSDSVEGQLLTMSGFVAGKKPAR 153 Query: 212 IKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSWSVA 75 + WVAT+W LWL RN+I+F GV +V V+ S+K+ SW W A Sbjct: 154 VVVTIWVATVWSLWLHRNKIVFNNGVCDVLEVVESVKYRSWKWLTA 199 >ref|XP_020203066.1| uncharacterized protein LOC109788691 [Cajanus cajan] Length = 265 Score = 90.9 bits (224), Expect = 2e-19 Identities = 43/106 (40%), Positives = 55/106 (51%) Frame = -1 Query: 392 CVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKSARR 213 CVFC E H+F C SVW + WL + L D V L S + GK R Sbjct: 136 CVFCNNREEDVTHLFLHCDVASSVWYSLCRWLGFSSCLSDSVEGQLLTMSGFVAGKKPAR 195 Query: 212 IKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSWSVA 75 + WVAT+W LWL RN+I+F GV +V V+ S+K+ SW W A Sbjct: 196 VVVTIWVATVWSLWLHRNKIVFNNGVCDVLEVVESVKYRSWKWLTA 241 >gb|KHN35644.1| hypothetical protein glysoja_030996 [Glycine soja] Length = 160 Score = 84.3 bits (207), Expect = 6e-18 Identities = 35/106 (33%), Positives = 58/106 (54%) Frame = -1 Query: 401 DLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKS 222 D V C E E H+ C + +W+ V LWL+V +D+ F+ K I+GK Sbjct: 28 DRRYVLCSSEDETVKHILLDCRVAKKIWQQVCLWLDVPVVEGEDIQAHFMAFGKLIKGKK 87 Query: 221 ARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84 +R+KH+ W+A +W +WL RN++IF+ + +I +K +W+W Sbjct: 88 QKRVKHLIWMAVIWNIWLTRNKVIFKEEAAAIPVMISGIKDCAWAW 133 >gb|PNY05892.1| ribonuclease H [Trifolium pratense] Length = 455 Score = 87.4 bits (215), Expect(2) = 2e-17 Identities = 35/110 (31%), Positives = 59/110 (53%) Frame = -1 Query: 407 PYDLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQG 228 P+DL VFCF E+E H+F C ++ VWR ++ WL + F ++ Sbjct: 321 PHDLPRVFCFTEIEDCMHLFFNCKLMQQVWRSIYKWLGCAYYNYGEGWKHFNFFGGIVKS 380 Query: 227 KSARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSWSV 78 K ++KH+ W+ T W +W +RN IIF+G + + ++ +K +SW W + Sbjct: 381 KKGEKVKHLIWLVTTWCIWRLRNNIIFRGALADCAQLVDQIKLISWVWFI 430 Score = 29.3 bits (64), Expect(2) = 2e-17 Identities = 11/24 (45%), Positives = 15/24 (62%) Frame = -3 Query: 72 RKGRNSGVTFSDWCNCPMGCIASL 1 R GR+ +SDWC P+ CI S+ Sbjct: 432 RSGRHCPYLYSDWCVNPLECILSM 455 >gb|KHN03945.1| hypothetical protein glysoja_022631, partial [Glycine soja] Length = 114 Score = 81.6 bits (200), Expect = 2e-17 Identities = 34/98 (34%), Positives = 54/98 (55%) Frame = -1 Query: 353 VFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKSARRIKHIFWVATMWQL 174 +F C F + VW+G+ WL + L +++ +LQ I+GK RR KH+ W T W + Sbjct: 1 LFVFCPFAKQVWQGILNWLGYSFSLPNNIQELYLQLGMNIRGKKKRRFKHLLWHNTCWSI 60 Query: 173 WLVRNRIIFQGGVTNVNSVIYSLKFVSWSWSVALGRGE 60 W RN +IF+ +VN+ I +K +SW W + G+ Sbjct: 61 WCHRNNVIFRNAEVDVNNTILFIKSMSWQWVLYKSSGK 98 >gb|AFK37936.1| unknown [Lotus japonicus] Length = 138 Score = 81.6 bits (200), Expect = 4e-17 Identities = 39/105 (37%), Positives = 55/105 (52%) Frame = -1 Query: 398 LSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKSA 219 L+C FC + E +DH+FC C F ++WR V W V+ L V F+Q + S Sbjct: 8 LACSFCQLQDETSDHLFCTCAFSMAIWRMVLGWFGVSIALPSLVKALFVQFPVFGRCSSK 67 Query: 218 RRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84 R W+AT W LWL+RNR+IF G + V+ ++ SW W Sbjct: 68 REALVTVWMATCWSLWLMRNRVIFDNGELDTGLVLDLIQVRSWHW 112 >gb|KYP46173.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 353 Score = 85.5 bits (210), Expect = 5e-17 Identities = 41/113 (36%), Positives = 57/113 (50%), Gaps = 1/113 (0%) Frame = -1 Query: 392 CVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKSARR 213 CVFC E H+F C SVW + WL + L V L S + GK R Sbjct: 224 CVFCNNREEDVTHLFLHCDVASSVWYSLCRWLGFSSCLSGSVEGQLLTMSGFVVGKKPAR 283 Query: 212 IKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW-SVALGRGEI 57 + WVAT+W LWL RN+++F GV +V V+ S K+ +W W ++ + G I Sbjct: 284 VVVTIWVATVWSLWLHRNKMVFNNGVCDVLEVVESAKYRAWKWLTIGMSHGSI 336 >gb|KYP55738.1| hypothetical protein KK1_001963 [Cajanus cajan] Length = 186 Score = 81.6 bits (200), Expect = 1e-16 Identities = 37/105 (35%), Positives = 52/105 (49%) Frame = -1 Query: 398 LSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKSA 219 LSC FC +E + H+F C F R+VW VF W + L + + F K Sbjct: 55 LSCPFCSTTLESSQHLFLECEFSRNVWHNVFTWTGIRLELPSSLGHLFFLLRSMFLDKVK 114 Query: 218 RRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84 R+ + IFW AT+W LW RN I+F+ + Y +K +SW W Sbjct: 115 RKWRDIFWHATIWVLWTNRNEIVFRNKTVSHFDFPYQIKIISWHW 159 >gb|KYP44105.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 346 Score = 83.6 bits (205), Expect = 3e-16 Identities = 36/107 (33%), Positives = 51/107 (47%) Frame = -1 Query: 404 YDLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGK 225 ++ C+FC ++E H+FC C V VW+ WLN L + + F IQ + Sbjct: 211 HESRCIFCKVDIETTTHIFCTCPMVDRVWKQCLSWLNCPAPLPRQIFDHFSFLPAPIQNE 270 Query: 224 SARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84 S H W+ATMW LW RN+ IF GG S++ + W W Sbjct: 271 SEAETWHTVWLATMWTLWRYRNKCIFDGGTFEQGSIVRDILIFCWRW 317 >gb|KYP34033.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 443 Score = 83.6 bits (205), Expect = 4e-16 Identities = 36/107 (33%), Positives = 51/107 (47%) Frame = -1 Query: 404 YDLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGK 225 ++ C+FC ++E H+FC C V VW+ WLN L + + F IQ + Sbjct: 308 HESRCIFCKVDIETTTHIFCTCPMVDRVWKQCLSWLNCPAPLPRQIFDHFSFLPAPIQNE 367 Query: 224 SARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84 S H W+ATMW LW RN+ IF GG S++ + W W Sbjct: 368 SEAETWHTVWLATMWTLWRYRNKCIFDGGTFEQGSIVRDILIFCWRW 414 >gb|KYP59667.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 409 Score = 83.2 bits (204), Expect = 5e-16 Identities = 36/107 (33%), Positives = 51/107 (47%) Frame = -1 Query: 404 YDLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGK 225 ++ C+FC ++E H+FC C V VW+ WLN L + + F IQ + Sbjct: 274 HESKCIFCKVDIETTTHIFCTCPMVDRVWKQCLSWLNCPAPLPRQIFDHFSFLPAPIQNE 333 Query: 224 SARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84 S H W+ATMW LW RN+ IF GG S++ + W W Sbjct: 334 SEAETWHTVWLATMWTLWRYRNKCIFVGGTFEQGSIVRDILIFCWRW 380 >ref|XP_022030815.1| uncharacterized protein LOC110931740 [Helianthus annuus] Length = 157 Score = 78.6 bits (192), Expect = 1e-15 Identities = 35/99 (35%), Positives = 55/99 (55%), Gaps = 1/99 (1%) Frame = -1 Query: 395 SCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQ-GKSA 219 +CV C ++VE ADH+ C VW + LW+N+ GL ++ LQS + K+ Sbjct: 31 TCVLCEQDVESADHILLNCRVAEEVWHRLSLWMNIPPGLNQSTVDEMLQSVNGLNVSKNR 90 Query: 218 RRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLK 102 +RI H ++ TMW +W RNR IF+G + N ++ +K Sbjct: 91 KRIIHAIYIITMWSIWKARNRKIFEGIIVNRYKLVEDIK 129 >dbj|GAU48210.1| hypothetical protein TSUD_404970 [Trifolium subterraneum] Length = 1653 Score = 82.4 bits (202), Expect = 1e-15 Identities = 35/108 (32%), Positives = 56/108 (51%) Frame = -1 Query: 407 PYDLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQG 228 P DL CVFC E H+F C FV VW V+ W+ + + + F + Sbjct: 1518 PQDLPCVFCSVFYEDCVHLFFHCSFVNCVWEAVYNWIGKDYHAGAEGWSHFKVFGDMVNS 1577 Query: 227 KSARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84 + R++H+ W+AT W LW +RN +IF G + +S++ +K +S +W Sbjct: 1578 TNIERVRHLIWLATTWNLWKLRNNVIFNGATPSASSLLNDIKAISCAW 1625 >dbj|GAU42970.1| hypothetical protein TSUD_188430 [Trifolium subterraneum] Length = 767 Score = 78.6 bits (192), Expect(2) = 3e-15 Identities = 34/108 (31%), Positives = 56/108 (51%) Frame = -1 Query: 407 PYDLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQG 228 P DL CVFC E + H+F C ++ VW VF W + D N F ++ Sbjct: 632 PQDLHCVFCSSYDEDSAHLFFHCSVLKRVWEEVFKWFGKSYQAEADGWNHFNIFGSLLKT 691 Query: 227 KSARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84 K +++H+ W+AT W +W +RN ++F G + +S++ +K +S W Sbjct: 692 KRFEKVRHLIWLATTWSIWKLRNNVVFNGVTLSSSSLVNDIKTISCLW 739 Score = 30.4 bits (67), Expect(2) = 3e-15 Identities = 12/26 (46%), Positives = 15/26 (57%) Frame = -3 Query: 78 SSRKGRNSGVTFSDWCNCPMGCIASL 1 S R G S ++F DWC PM C S+ Sbjct: 741 SGRYGHISSISFPDWCFDPMTCFQSI 766 >ref|XP_020239954.1| uncharacterized protein LOC109818836 [Cajanus cajan] gb|KYP41807.1| hypothetical protein KK1_036829 [Cajanus cajan] Length = 237 Score = 77.8 bits (190), Expect = 1e-14 Identities = 38/95 (40%), Positives = 48/95 (50%) Frame = -1 Query: 392 CVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNVNEGLCDDVLNTFLQSSKCIQGKSARR 213 CVFC E H+F C SVW + WL + L D V L S + GK R Sbjct: 136 CVFCNNREEDVTHLFLHCDVASSVWYSLCRWLGFSSCLSDSVEGQLLTMSGFVAGKKPAR 195 Query: 212 IKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYS 108 + WVAT+W LWL RN+I+F GV +V V+ S Sbjct: 196 VVVTIWVATVWSLWLHRNKIVFNNGVCDVLEVVES 230 >gb|PNX84435.1| hypothetical protein L195_g040495, partial [Trifolium pratense] Length = 248 Score = 77.4 bits (189), Expect = 2e-14 Identities = 32/96 (33%), Positives = 50/96 (52%), Gaps = 1/96 (1%) Frame = -1 Query: 368 EVADHVFCRCVFVRSVWRGVFLWLNVNE-GLCDDVLNTFLQSSKCIQGKSARRIKHIFWV 192 E H+F C F + VW V+ WL + + N FL I K R++H+FW+ Sbjct: 124 EDCSHLFFHCAFSKGVWESVYRWLGMKSISAGAEGWNHFLLFDDMITAKKGERVRHLFWL 183 Query: 191 ATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84 AT W +W +RN ++F G + N +S++ + SW W Sbjct: 184 ATTWNIWKLRNNVVFNGVIPNASSLVEDIIANSWLW 219 >ref|XP_021996043.1| uncharacterized protein LOC110893235 [Helianthus annuus] Length = 149 Score = 75.1 bits (183), Expect = 2e-14 Identities = 40/110 (36%), Positives = 58/110 (52%), Gaps = 4/110 (3%) Frame = -1 Query: 401 DLSCVFCFREVEVADHVFCRCVFVRSVWRGVFLWLNV----NEGLCDDVLNTFLQSSKCI 234 D C C E E ADH+F +C+ RSVW +F WL V N D+L F S Sbjct: 19 DALCANCGFEEESADHLFAKCLTARSVWWNIFSWLKVPWPSNVDSLKDLLEVFYNSP--- 75 Query: 233 QGKSARRIKHIFWVATMWQLWLVRNRIIFQGGVTNVNSVIYSLKFVSWSW 84 K +R+ H+ V T+W++W RNR +F+G +V ++ S+K S+ W Sbjct: 76 GSKVWKRLAHMVAVDTVWRIWNARNRKVFEGDGISVRKIVDSIKEESFIW 125