BLASTX nr result
ID: Astragalus23_contig00030248
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00030248 (423 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU38301.1| hypothetical protein TSUD_157860 [Trifolium subt... 70 3e-11 gb|PNY14301.1| ribonuclease H [Trifolium pratense] 67 5e-10 dbj|GAU41525.1| hypothetical protein TSUD_140560 [Trifolium subt... 67 6e-10 dbj|GAU29911.1| hypothetical protein TSUD_148190 [Trifolium subt... 66 7e-10 gb|PNX72264.1| ribonuclease H [Trifolium pratense] 64 1e-09 gb|PNY16580.1| ribonuclease H, partial [Trifolium pratense] 64 5e-09 gb|PNX85341.1| hypothetical protein L195_g041409, partial [Trifo... 63 6e-09 gb|PNY15111.1| ribonuclease H [Trifolium pratense] 60 8e-09 gb|PNX71533.1| ribonuclease H [Trifolium pratense] 62 1e-08 dbj|GAU47092.1| hypothetical protein TSUD_369250 [Trifolium subt... 61 4e-08 gb|PNX95041.1| ribonuclease H, partial [Trifolium pratense] 61 6e-08 dbj|GAU39028.1| hypothetical protein TSUD_59840 [Trifolium subte... 60 8e-08 gb|PNY04967.1| ribonuclease H [Trifolium pratense] 55 2e-07 dbj|GAU37237.1| hypothetical protein TSUD_375390 [Trifolium subt... 51 2e-06 dbj|GAU35627.1| hypothetical protein TSUD_30450 [Trifolium subte... 54 4e-06 gb|PNX58626.1| ribonuclease H, partial [Trifolium pratense] 52 5e-06 >dbj|GAU38301.1| hypothetical protein TSUD_157860 [Trifolium subterraneum] Length = 317 Score = 69.7 bits (169), Expect = 3e-11 Identities = 36/102 (35%), Positives = 50/102 (49%) Frame = +3 Query: 36 LLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASAA 215 LLWK W+ RNQ+IFK V +P+ + +VHEFN + S A Sbjct: 99 LLWKFWYGRNQVIFKGVVLDPIALAAEAALYVHEFNEANPRRCSQVVLQQASVSRLDDAN 158 Query: 216 HSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDIGLD 341 ++ DAGCF G GWG + N G +ACK E+I ++ Sbjct: 159 MQLMFTDAGCFNNGYTGWGIVLRNVDGTTSFSACKREEIEVE 200 >gb|PNY14301.1| ribonuclease H [Trifolium pratense] Length = 1196 Score = 66.6 bits (161), Expect = 5e-10 Identities = 35/104 (33%), Positives = 54/104 (51%), Gaps = 2/104 (1%) Frame = +3 Query: 36 LLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXT--HLVSPKSTTAS 209 LLWK W+ RNQ+IFK+ +P+ + +VHEFN H+ +P+ ++ Sbjct: 978 LLWKFWYGRNQVIFKDAVFDPILLAADAIEYVHEFNEANPRRCNQVVLQHISAPRLDDSN 1037 Query: 210 AAHSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDIGLD 341 ++ DAGCF G GWG + N G +ACK E+I ++ Sbjct: 1038 M--QLMFTDAGCFNNGYTGWGLVLRNVDGTTSFSACKRENIEVE 1079 >dbj|GAU41525.1| hypothetical protein TSUD_140560 [Trifolium subterraneum] Length = 1610 Score = 66.6 bits (161), Expect = 6e-10 Identities = 35/100 (35%), Positives = 48/100 (48%) Frame = +3 Query: 33 SLLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASA 212 ++LWK WFARNQ +F P+++ S FV EFN + +AS Sbjct: 1478 TILWKFWFARNQYVFNGYPIEPLRLAQSALLFVQEFNEANNLSRSTHVATRVHNTNSASP 1537 Query: 213 AHSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDI 332 + VDAGCF R GWG + + G + AC+ EDI Sbjct: 1538 CQFSMFVDAGCFSNARTGWGLVLKDQRGNVTWNACRREDI 1577 >dbj|GAU29911.1| hypothetical protein TSUD_148190 [Trifolium subterraneum] Length = 482 Score = 66.2 bits (160), Expect = 7e-10 Identities = 38/100 (38%), Positives = 51/100 (51%), Gaps = 2/100 (2%) Frame = +3 Query: 39 LWKIWFARNQLIFKNVN--PNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASA 212 LWKIWF RN+LIF+ P+ +V S SF EF+ T V S S Sbjct: 270 LWKIWFHRNKLIFEQQAFVPHEYEVASSASSFGAEFSPTFLREIDMNTSDVLEASQVVSP 329 Query: 213 AHSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDI 332 + + VDAGCF G GWG V + G ++ +AC+ E+I Sbjct: 330 ICNRICVDAGCFSNGSTGWGLIVKDHEGSVIFSACRFEEI 369 >gb|PNX72264.1| ribonuclease H [Trifolium pratense] Length = 854 Score = 64.3 bits (155), Expect(2) = 1e-09 Identities = 38/113 (33%), Positives = 50/113 (44%), Gaps = 11/113 (9%) Frame = +3 Query: 36 LLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASAA 215 LLWK W RN +F V +P ++ SFVH+FN P+ A A Sbjct: 631 LLWKFWAGRNAAVFNGVQLDPGRLAIDAMSFVHDFNEANP-----------PRCRRAPVA 679 Query: 216 HSVVK-----------VDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDIGLD 341 H ++ VDAGC G WG + N+ GE V +ACK ED +D Sbjct: 680 HVPIQPGMTNPIFSLFVDAGCSNSGHTVWGLVLRNSDGETVLSACKREDFYVD 732 Score = 26.2 bits (56), Expect(2) = 1e-09 Identities = 9/15 (60%), Positives = 12/15 (80%) Frame = +2 Query: 341 LDPLTAEILGIRWCM 385 +DPL AE LG+RW + Sbjct: 731 VDPLMAEALGVRWAL 745 >gb|PNY16580.1| ribonuclease H, partial [Trifolium pratense] Length = 894 Score = 63.9 bits (154), Expect = 5e-09 Identities = 37/101 (36%), Positives = 46/101 (45%) Frame = +3 Query: 30 SSLLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTAS 209 S+ LW IW RN+LIFKNV P+ V + FV EFN Sbjct: 738 STTLWMIWKGRNKLIFKNVKFCPIYVAAASSDFVAEFNSGSCCNESNIVRENPDSWEPPE 797 Query: 210 AAHSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDI 332 A V +DAGCF G GWG + N +G + AA +E I Sbjct: 798 QAKFKVNIDAGCFSNGTTGWGMIMRNHLGMVDFAATHLEKI 838 >gb|PNX85341.1| hypothetical protein L195_g041409, partial [Trifolium pratense] Length = 382 Score = 63.2 bits (152), Expect(2) = 6e-09 Identities = 34/100 (34%), Positives = 46/100 (46%) Frame = +3 Query: 33 SLLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASA 212 +LLWK W RN +IF +P ++ +FVHEFN + S Sbjct: 261 TLLWKFWAGRNAVIFNGWQMDPTRLALDAMNFVHEFNEANPSRNRRVLVSQAISDPPRST 320 Query: 213 AHSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDI 332 + + + VDAGC G WG + N GE +ACK EDI Sbjct: 321 SLNSMFVDAGCCNSGHTVWGLVLRNMNGETTFSACKREDI 360 Score = 24.6 bits (52), Expect(2) = 6e-09 Identities = 9/17 (52%), Positives = 12/17 (70%) Frame = +2 Query: 335 IGLDPLTAEILGIRWCM 385 I +PL AE LG+RW + Sbjct: 360 ITAEPLLAEALGVRWAL 376 >gb|PNY15111.1| ribonuclease H [Trifolium pratense] Length = 1334 Score = 60.1 bits (144), Expect(2) = 8e-09 Identities = 32/102 (31%), Positives = 47/102 (46%) Frame = +3 Query: 36 LLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASAA 215 L+WKIW ARN L+F N +P+ + F+ E + + +A Sbjct: 1119 LMWKIWNARNNLVFNNKLVDPIAIAQEAMYFMQELSPSPHEHNATPMQDAVLAAQPMPSA 1178 Query: 216 HSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDIGLD 341 V VDAGCF GWG + N G +V +AC+ E I ++ Sbjct: 1179 PHVFYVDAGCFSGNATGWGMVIYNQSGRVVLSACRKELIDVE 1220 Score = 27.3 bits (59), Expect(2) = 8e-09 Identities = 8/17 (47%), Positives = 14/17 (82%) Frame = +2 Query: 335 IGLDPLTAEILGIRWCM 385 I ++P+ AE +G+RWC+ Sbjct: 1217 IDVEPVLAEAIGVRWCL 1233 >gb|PNX71533.1| ribonuclease H [Trifolium pratense] Length = 798 Score = 62.4 bits (150), Expect(2) = 1e-08 Identities = 37/103 (35%), Positives = 49/103 (47%), Gaps = 3/103 (2%) Frame = +3 Query: 33 SLLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXT---HLVSPKSTT 203 +LLWK W RN +IF +P + SFV EFN + P +T Sbjct: 573 TLLWKFWAGRNAVIFNGWQMDPTFLALDALSFVQEFNEANPSRNRRALVSQSISEPSRST 632 Query: 204 ASAAHSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDI 332 ++ +S+ VDAGC G WG + N GE V +ACK EDI Sbjct: 633 CTSMNSMF-VDAGCCNSGHTVWGLVLRNLNGETVFSACKREDI 674 Score = 24.6 bits (52), Expect(2) = 1e-08 Identities = 9/17 (52%), Positives = 12/17 (70%) Frame = +2 Query: 335 IGLDPLTAEILGIRWCM 385 I +PL AE LG+RW + Sbjct: 674 ITAEPLLAEALGVRWAL 690 >dbj|GAU47092.1| hypothetical protein TSUD_369250 [Trifolium subterraneum] Length = 335 Score = 60.8 bits (146), Expect(2) = 4e-08 Identities = 36/96 (37%), Positives = 47/96 (48%) Frame = +3 Query: 45 KIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASAAHSV 224 KIWF RN+LIFK P +V S SFV EF+ T V S S + Sbjct: 127 KIWFHRNKLIFKQQAFVPHEVASSASSFVAEFSPTFLREIYMNTSDVLEASQVVSPVCNR 186 Query: 225 VKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDI 332 + VDAG F G GWG V + ++ +AC+ E+I Sbjct: 187 ICVDAGSFSNGSTGWGLIVKDHESSVILSACRFEEI 222 Score = 24.3 bits (51), Expect(2) = 4e-08 Identities = 10/25 (40%), Positives = 15/25 (60%) Frame = +2 Query: 347 PLTAEILGIRWCMS*SCQILYFYLT 421 P+ AE LGIRW + + + Y +T Sbjct: 226 PILAEALGIRWAIQTAIDLNYNQVT 250 >gb|PNX95041.1| ribonuclease H, partial [Trifolium pratense] Length = 1348 Score = 60.8 bits (146), Expect = 6e-08 Identities = 37/112 (33%), Positives = 50/112 (44%), Gaps = 10/112 (8%) Frame = +3 Query: 39 LWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASAAH 218 LWKIWF RNQ IFKN+ +P++V + ++FV EF+ Sbjct: 1234 LWKIWFFRNQTIFKNLAFDPIRVSCAAQNFVSEFSVSSTPREQSTGQQPRCDWVAPPPDF 1293 Query: 219 SVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACK----------VEDIGLDW 344 + VDAGC G+ WG + N E+V AA K E +GL W Sbjct: 1294 FKLNVDAGCGSMGQVSWGLVIRNHNAEVVFAATKKTEFVAEAVVAEALGLRW 1345 >dbj|GAU39028.1| hypothetical protein TSUD_59840 [Trifolium subterraneum] Length = 1626 Score = 60.5 bits (145), Expect = 8e-08 Identities = 36/101 (35%), Positives = 46/101 (45%) Frame = +3 Query: 30 SSLLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTAS 209 S+ LW IW RN+LIFKN P+ V + FV EFN + K Sbjct: 1411 STTLWMIWKGRNKLIFKNEKFCPIYVAAASSDFVAEFNSGTCSFENIPSCDNPGKWEHPE 1470 Query: 210 AAHSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDI 332 V +DAGCF G GWG + N +G + AA +E I Sbjct: 1471 QGKLKVNIDAGCFSNGTTGWGMIMRNHLGMVEFAATHLEKI 1511 >gb|PNY04967.1| ribonuclease H [Trifolium pratense] Length = 207 Score = 54.7 bits (130), Expect(2) = 2e-07 Identities = 27/98 (27%), Positives = 49/98 (50%) Frame = +3 Query: 48 IWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASAAHSVV 227 +WF RNQ++F+ P P + + VHEFN + + +H ++ Sbjct: 1 MWFFRNQVVFQQKIPTPPDIAIAALDIVHEFNLAVPKKSKQRQQHAASEPAATLCSH-LI 59 Query: 228 KVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDIGLD 341 +VDAGCF +G +G + + G + +AC+ E++ +D Sbjct: 60 QVDAGCFPDGYTTFGCVIKDCSGMISFSACRKENLLVD 97 Score = 27.7 bits (60), Expect(2) = 2e-07 Identities = 10/15 (66%), Positives = 12/15 (80%) Frame = +2 Query: 341 LDPLTAEILGIRWCM 385 +DPL AE L IRWC+ Sbjct: 96 VDPLLAEALAIRWCL 110 >dbj|GAU37237.1| hypothetical protein TSUD_375390 [Trifolium subterraneum] Length = 246 Score = 51.2 bits (121), Expect(2) = 2e-06 Identities = 31/108 (28%), Positives = 46/108 (42%), Gaps = 11/108 (10%) Frame = +3 Query: 51 WFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASAAHSVVK 230 W RN +F + +P ++ SFVH+FN P+ A AH ++ Sbjct: 5 WNGRNATVFNGIKLDPGRLALDVTSFVHDFNEANP-----------PRCRRAPVAHVSIQ 53 Query: 231 -----------VDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDIGLD 341 VDAGC G WG + N+ GE + + CK E+I +D Sbjct: 54 PSLVTPIFSLFVDAGCSMSGPIVWGLVLRNSDGETILSVCKREEISVD 101 Score = 27.7 bits (60), Expect(2) = 2e-06 Identities = 10/17 (58%), Positives = 13/17 (76%) Frame = +2 Query: 335 IGLDPLTAEILGIRWCM 385 I +DPL AE LG+RW + Sbjct: 98 ISVDPLMAETLGVRWAL 114 >dbj|GAU35627.1| hypothetical protein TSUD_30450 [Trifolium subterraneum] Length = 1475 Score = 53.5 bits (127), Expect(2) = 4e-06 Identities = 32/103 (31%), Positives = 48/103 (46%) Frame = +3 Query: 33 SLLWKIWFARNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASA 212 +LLWK W RN ++F+ +PV + SFV EFN + ++ + + S Sbjct: 1251 TLLWKFWATRNNVVFRGDKLDPVCLVDEVMSFVQEFNEANPPRQGRVSLPLTTVTPSISR 1310 Query: 213 AHSVVKVDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDIGLD 341 V VDAGC G WG + N +ACK +DI ++ Sbjct: 1311 PSFSVFVDAGCNLNGPTVWGLVLKNHDRITTFSACKYDDIAVE 1353 Score = 24.6 bits (52), Expect(2) = 4e-06 Identities = 8/17 (47%), Positives = 13/17 (76%) Frame = +2 Query: 335 IGLDPLTAEILGIRWCM 385 I ++P+ AE LG+RW + Sbjct: 1350 IAVEPVMAEALGVRWAI 1366 >gb|PNX58626.1| ribonuclease H, partial [Trifolium pratense] Length = 217 Score = 51.6 bits (122), Expect(2) = 5e-06 Identities = 33/105 (31%), Positives = 45/105 (42%), Gaps = 11/105 (10%) Frame = +3 Query: 60 RNQLIFKNVNPNPVQVDFSPESFVHEFNXXXXXXXXXXTHLVSPKSTTASAAHSVVK--- 230 RN +F V +P ++ SFVH+FN P+ A AH ++ Sbjct: 2 RNAAVFNGVQLDPGRLAIDAMSFVHDFNEANP-----------PRCRRAPVAHVPIQPGM 50 Query: 231 --------VDAGCFGEGRAGWGFTV*NAVGELVAAACKVEDIGLD 341 VDAGC G WG + N+ GE V +ACK ED +D Sbjct: 51 TNPIFSLFVDAGCSNSGHTVWGLVLRNSDGETVLSACKREDFYVD 95 Score = 26.2 bits (56), Expect(2) = 5e-06 Identities = 9/15 (60%), Positives = 12/15 (80%) Frame = +2 Query: 341 LDPLTAEILGIRWCM 385 +DPL AE LG+RW + Sbjct: 94 VDPLMAEALGVRWAL 108