BLASTX nr result
ID: Astragalus23_contig00033606
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00033606 (453 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subt... 75 8e-14 gb|AAD22368.1| putative non-LTR retroelement reverse transcripta... 70 2e-11 ref|XP_024190061.1| uncharacterized protein LOC112194030 [Rosa c... 67 5e-10 gb|KYP71397.1| Putative ribonuclease H protein At1g65750 family,... 65 7e-10 ref|XP_018510856.1| PREDICTED: uncharacterized protein LOC103844... 67 8e-10 ref|XP_020871723.1| uncharacterized protein LOC9299799 [Arabidop... 66 9e-10 dbj|BAB09815.1| non-LTR retroelement reverse transcriptase-like ... 66 9e-10 gb|KYP42050.1| Putative ribonuclease H protein At1g65750 family,... 65 1e-09 ref|XP_022553441.1| uncharacterized protein LOC106384431 [Brassi... 66 1e-09 gb|KYP62996.1| Putative ribonuclease H protein At1g65750 family ... 64 1e-09 ref|XP_013745405.2| uncharacterized protein LOC106448012 [Brassi... 66 1e-09 ref|XP_018435759.1| PREDICTED: uncharacterized protein LOC108808... 65 4e-09 dbj|GAU10577.1| hypothetical protein TSUD_420970, partial [Trifo... 64 6e-09 ref|XP_013658112.1| uncharacterized protein LOC106362816 [Brassi... 64 7e-09 ref|XP_016164673.1| uncharacterized protein LOC107607211 [Arachi... 61 8e-09 dbj|GAU36844.1| hypothetical protein TSUD_213680 [Trifolium subt... 61 8e-09 dbj|GAU11804.1| hypothetical protein TSUD_75550 [Trifolium subte... 63 1e-08 ref|XP_013751841.2| uncharacterized protein LOC106454232 [Brassi... 63 1e-08 gb|PRQ37815.1| putative RNA-directed DNA polymerase [Rosa chinen... 62 2e-08 sp|P0C2F6.1|RNHX1_ARATH RecName: Full=Putative ribonuclease H pr... 62 3e-08 >dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subterraneum] Length = 1250 Score = 74.7 bits (182), Expect(2) = 8e-14 Identities = 36/79 (45%), Positives = 47/79 (59%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 ++ FLWK H LLTN +R RR M SK+C C +E+L H R+C+ S SIW L+V Sbjct: 933 IKLFLWKATHACLLTNMERLRRKMTASKVCSRCNLQDESLLHVFRDCNFSKSIWQNLNVQ 992 Query: 288 NPNQFLSCVNWNSWLYTNL 344 N F +W+ WL TNL Sbjct: 993 NRRSFFHENDWHQWLLTNL 1011 Score = 29.6 bits (65), Expect(2) = 8e-14 Identities = 11/29 (37%), Positives = 19/29 (65%) Frame = +1 Query: 364 EEEDYWHILFGAVLDQIW*NRNNVEFSQR 450 ++E W + F +LD+IW +RN+ FS + Sbjct: 1018 KDEATWSLKFAIILDKIWYSRNSFIFSHK 1046 >gb|AAD22368.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 321 Score = 70.5 bits (171), Expect = 2e-11 Identities = 30/85 (35%), Positives = 52/85 (61%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 +R FLW + Q+++TN +R+RR ++ +++C+IC ET+ H +R+C A IWS L Sbjct: 6 VRVFLWLVVQQVIITNVERYRRHLSDTRVCQICQGGEETILHVLRDCPAMAGIWSRLVPR 65 Query: 288 NPNQFLSCVNWNSWLYTNLRKKGVW 362 + + + W+Y NLR++G W Sbjct: 66 DQIRQFFTASLLEWIYKNLRERGSW 90 >ref|XP_024190061.1| uncharacterized protein LOC112194030 [Rosa chinensis] Length = 1296 Score = 67.0 bits (162), Expect = 5e-10 Identities = 33/95 (34%), Positives = 50/95 (52%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 L+ FLW L H LLTNA R +R + C IC ++E+L H ++C A+L++W+ + Sbjct: 970 LKTFLWVLCHGKLLTNAHRVKRNLTDDDTCPICRCNSESLSHLFKDCPAALNVWNSFTLP 1029 Query: 288 NPNQFLSCVNWNSWLYTNLRKKGVWRGGRLLAHSF 392 P +F ++W WL NL K G +F Sbjct: 1030 QPVKFTFSMSWEGWLQANLFCKAKCNAGNPWCSTF 1064 >gb|KYP71397.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 510 Score = 65.5 bits (158), Expect(2) = 7e-10 Identities = 31/83 (37%), Positives = 45/83 (54%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 +R FLW+L H LLTN R R M LC +C + ETL H +REC+ + S+W + Sbjct: 285 IRTFLWRLAHNSLLTNDLRMHRGMTMDPLCPVCHDELETLIHAMRECNVARSVWINIFNG 344 Query: 288 NPNQFLSCVNWNSWLYTNLRKKG 356 + ++W WL NL ++G Sbjct: 345 RLHTIFFTMDWMLWLEWNLLQQG 367 Score = 25.4 bits (54), Expect(2) = 7e-10 Identities = 12/19 (63%), Positives = 12/19 (63%) Frame = +1 Query: 385 ILFGAVLDQIW*NRNNVEF 441 ILF LD IW RNNV F Sbjct: 369 ILFVVALDAIWNMRNNVVF 387 >ref|XP_018510856.1| PREDICTED: uncharacterized protein LOC103844431 [Brassica rapa] Length = 1833 Score = 66.6 bits (161), Expect = 8e-10 Identities = 29/79 (36%), Positives = 49/79 (62%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 +R FLW + HQ+++TN +R RR ++ + +C++C + NET+ H +R+C AS+ +W L Sbjct: 1513 VRVFLWLVSHQVIMTNMERKRRHLSDNGMCQLCKSGNETILHTLRDCPASMGLWRRLVDP 1572 Query: 288 NPNQFLSCVNWNSWLYTNL 344 + Q + WLY NL Sbjct: 1573 SRQQRFFDQSLLQWLYENL 1591 >ref|XP_020871723.1| uncharacterized protein LOC9299799 [Arabidopsis lyrata subsp. lyrata] ref|XP_020871724.1| uncharacterized protein LOC9299799 [Arabidopsis lyrata subsp. lyrata] ref|XP_020871725.1| uncharacterized protein LOC9299799 [Arabidopsis lyrata subsp. lyrata] ref|XP_020871727.1| uncharacterized protein LOC9299799 [Arabidopsis lyrata subsp. lyrata] ref|XP_020871728.1| uncharacterized protein LOC9299799 [Arabidopsis lyrata subsp. lyrata] ref|XP_020871729.1| uncharacterized protein LOC9299799 [Arabidopsis lyrata subsp. lyrata] Length = 592 Score = 66.2 bits (160), Expect = 9e-10 Identities = 28/79 (35%), Positives = 45/79 (56%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 +R FLW + HQ ++TNA+R+RR + +++C++C ET+ H +R+C A IW+ Sbjct: 267 IRLFLWLVAHQAIMTNAERYRRHLGDTEICQVCKGGTETIIHALRDCPAMEGIWTRTVPL 326 Query: 288 NPNQFLSCVNWNSWLYTNL 344 Q + WLY NL Sbjct: 327 RKRQSFFASSLLEWLYANL 345 >dbj|BAB09815.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 676 Score = 66.2 bits (160), Expect = 9e-10 Identities = 32/82 (39%), Positives = 53/82 (64%), Gaps = 1/82 (1%) Frame = +3 Query: 111 RRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIW-SILHV* 287 R FLW +G+Q++LTNA+R RR MA S +C +C ++E+L H +R+C A + IW ++ V Sbjct: 357 RIFLWLVGNQVVLTNAERVRRHMADSDVCPLCKGASESLIHVLRDCPAMMGIWMRVVPVM 416 Query: 288 NPNQFLSCVNWNSWLYTNLRKK 353 +F + W+Y NL+++ Sbjct: 417 EQRRFFE-TSLLEWMYGNLKER 437 >gb|KYP42050.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 812 Score = 65.1 bits (157), Expect(2) = 1e-09 Identities = 31/83 (37%), Positives = 46/83 (55%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 +R FLW+L H LLTN R RR M LC +C + ETL H +R+C+ + S+W + Sbjct: 498 IRTFLWRLAHNSLLTNDLRMRRGMTMDPLCPVCHDELETLIHAMRDCNVARSVWINIFNG 557 Query: 288 NPNQFLSCVNWNSWLYTNLRKKG 356 + ++W WL NL ++G Sbjct: 558 RLHTNFFTMDWMLWLEWNLLQQG 580 Score = 25.4 bits (54), Expect(2) = 1e-09 Identities = 12/19 (63%), Positives = 12/19 (63%) Frame = +1 Query: 385 ILFGAVLDQIW*NRNNVEF 441 ILF LD IW RNNV F Sbjct: 582 ILFVVALDAIWTMRNNVVF 600 >ref|XP_022553441.1| uncharacterized protein LOC106384431 [Brassica napus] Length = 1859 Score = 66.2 bits (160), Expect = 1e-09 Identities = 32/99 (32%), Positives = 53/99 (53%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 +R FLW G Q+++TN +R+RR + + CE+C + ET+ H +R+C A IW+ + Sbjct: 1538 VRCFLWLAGQQVIMTNCERYRRHLGATNTCEVCKGAPETVLHVLRDCPAMEGIWNRVVPM 1597 Query: 288 NPNQFLSCVNWNSWLYTNLRKKGVWRGGRLLAHSFWSSF 404 Q + WL+TNL +++ S WS+F Sbjct: 1598 GKRQTFFTQSLLQWLFTNL------GDNQMVGESTWSTF 1630 >gb|KYP62996.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 506 Score = 63.5 bits (153), Expect(2) = 1e-09 Identities = 30/83 (36%), Positives = 45/83 (54%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 ++ FLW+L H LLTN R R M LC +C + ETL H +R+C+ + S+W + Sbjct: 243 IQTFLWRLAHNSLLTNDLRMHRGMTMDPLCPVCHDELETLIHAMRDCNVARSVWINIFNG 302 Query: 288 NPNQFLSCVNWNSWLYTNLRKKG 356 + +NW WL NL ++G Sbjct: 303 RLHTNFFTMNWMLWLEWNLLQQG 325 Score = 26.6 bits (57), Expect(2) = 1e-09 Identities = 12/19 (63%), Positives = 12/19 (63%) Frame = +1 Query: 385 ILFGAVLDQIW*NRNNVEF 441 ILF LD IW RNNV F Sbjct: 327 ILFAVALDAIWTMRNNVVF 345 >ref|XP_013745405.2| uncharacterized protein LOC106448012 [Brassica napus] ref|XP_022544051.1| uncharacterized protein LOC111198962 [Brassica napus] Length = 1826 Score = 65.9 bits (159), Expect = 1e-09 Identities = 27/79 (34%), Positives = 48/79 (60%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 +R FLW + HQ+++TN +R RR ++ + +C++C N +ET+ H +R+C A++ +W + + Sbjct: 1506 VRVFLWLVAHQVIMTNMERKRRHLSDNGMCQLCKNGDETIIHVLRDCPAAMGLWRKIVIL 1565 Query: 288 NPNQFLSCVNWNSWLYTNL 344 Q WLY NL Sbjct: 1566 RKQQRFFNQPLLEWLYENL 1584 >ref|XP_018435759.1| PREDICTED: uncharacterized protein LOC108808055 [Raphanus sativus] Length = 1802 Score = 64.7 bits (156), Expect = 4e-09 Identities = 30/87 (34%), Positives = 48/87 (55%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 +R FLW +G+Q ++TNA+R +R ++ + +C++C ET+ H +R+C A IW Sbjct: 1482 VRMFLWLVGNQAIMTNAERFQRHLSGTNVCQVCRGGIETILHVLRDCPAMKGIWDRFVPA 1541 Query: 288 NPNQFLSCVNWNSWLYTNLRKKGVWRG 368 Q + WLY NL +K V G Sbjct: 1542 TRRQTFFSMTLYEWLYWNLCEKDVGSG 1568 >dbj|GAU10577.1| hypothetical protein TSUD_420970, partial [Trifolium subterraneum] Length = 426 Score = 63.5 bits (153), Expect(2) = 6e-09 Identities = 32/90 (35%), Positives = 49/90 (54%), Gaps = 3/90 (3%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 ++ F+W H+ LLTN +R + + S C CGN +ET+ H +R+C+ + IW L Sbjct: 262 IQTFIWIAAHERLLTNFRRSKWGVGVSPACSSCGNGDETIIHTLRDCAHATRIWLRLVCH 321 Query: 288 NP-NQFLSCVNWNSWLYTNLRKK--GVWRG 368 N F S +N W++ NL K GV +G Sbjct: 322 NQITNFFSSLNCRDWIFMNLNSKEFGVQQG 351 Score = 24.3 bits (51), Expect(2) = 6e-09 Identities = 11/32 (34%), Positives = 15/32 (46%) Frame = +1 Query: 352 KEFGEEEDYWHILFGAVLDQIW*NRNNVEFSQ 447 KEFG ++ W +F IW RN F + Sbjct: 344 KEFGVQQGNWQSIFMVACWHIWTWRNKSIFEE 375 >ref|XP_013658112.1| uncharacterized protein LOC106362816 [Brassica napus] Length = 1707 Score = 63.9 bits (154), Expect = 7e-09 Identities = 32/99 (32%), Positives = 51/99 (51%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 +R FLW G Q+++TN +R+RR + + CE+C + ET+ H +R+C A IW+ + Sbjct: 1389 VRCFLWLAGQQVIMTNCERYRRHLGATNTCEVCKGAPETVLHVLRDCPAMEGIWNRVVPM 1448 Query: 288 NPNQFLSCVNWNSWLYTNLRKKGVWRGGRLLAHSFWSSF 404 Q + WL+TNL + S WS+F Sbjct: 1449 GKRQTFFTQSLLQWLFTNLGDNQM---------SIWSTF 1478 >ref|XP_016164673.1| uncharacterized protein LOC107607211 [Arachis ipaensis] Length = 1901 Score = 61.2 bits (147), Expect(2) = 8e-09 Identities = 28/85 (32%), Positives = 42/85 (49%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 +R FLW + H +LTN+++ RR + C C + E+ H +R+C ++SIW+ L Sbjct: 1587 IRTFLWLVTHNAILTNSEKRRRHLTNDDTCPRCRSHEESTIHVLRDCPYAMSIWNRLIPP 1646 Query: 288 NPNQFLSCVNWNSWLYTNLRKKGVW 362 N N WLY NL W Sbjct: 1647 NGRSSFFNTELNEWLYQNLTTNKNW 1671 Score = 26.2 bits (56), Expect(2) = 8e-09 Identities = 10/22 (45%), Positives = 13/22 (59%) Frame = +1 Query: 379 WHILFGAVLDQIW*NRNNVEFS 444 W+ LFG L IW RN + F+ Sbjct: 1671 WNCLFGVALSSIWYLRNKLVFN 1692 >dbj|GAU36844.1| hypothetical protein TSUD_213680 [Trifolium subterraneum] Length = 1025 Score = 61.2 bits (147), Expect(2) = 8e-09 Identities = 30/85 (35%), Positives = 46/85 (54%), Gaps = 3/85 (3%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 ++ F+W H LLTN +R + + S C ICGN +ET+ H +R+C + IW L + Sbjct: 700 IQTFMWIAAHARLLTNVRRSKWGVGVSPTCSICGNDDETMIHTLRDCIYATGIW--LRLV 757 Query: 288 NPNQ---FLSCVNWNSWLYTNLRKK 353 + NQ F S + W++ NL K Sbjct: 758 SSNQITNFFSSFDCREWIFLNLNTK 782 Score = 26.2 bits (56), Expect(2) = 8e-09 Identities = 11/32 (34%), Positives = 16/32 (50%) Frame = +1 Query: 352 KEFGEEEDYWHILFGAVLDQIW*NRNNVEFSQ 447 K FG +++ W +F V IW RN F + Sbjct: 782 KNFGNQQESWKSIFMVVCWHIWTWRNKAIFEE 813 >dbj|GAU11804.1| hypothetical protein TSUD_75550 [Trifolium subterraneum] Length = 1178 Score = 62.8 bits (151), Expect(2) = 1e-08 Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 3/90 (3%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 ++ F+W H+ L+TN +R + + S C CGN +ET+ H +R+C+ + IW L Sbjct: 970 IQTFIWIAAHERLITNFRRSKWGVGVSPACSSCGNGDETIIHTLRDCAHATRIWLRLVCH 1029 Query: 288 NP-NQFLSCVNWNSWLYTNLRKK--GVWRG 368 N F S +N W++ NL K GV +G Sbjct: 1030 NQITNFFSSLNCRDWIFMNLNSKEFGVQQG 1059 Score = 24.3 bits (51), Expect(2) = 1e-08 Identities = 11/32 (34%), Positives = 15/32 (46%) Frame = +1 Query: 352 KEFGEEEDYWHILFGAVLDQIW*NRNNVEFSQ 447 KEFG ++ W +F IW RN F + Sbjct: 1052 KEFGVQQGNWQSIFMVACWHIWTWRNKSIFEE 1083 >ref|XP_013751841.2| uncharacterized protein LOC106454232 [Brassica napus] Length = 1893 Score = 63.2 bits (152), Expect = 1e-08 Identities = 24/54 (44%), Positives = 38/54 (70%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIW 269 +R FLW + HQ+++TN +R RR M+ + +C +C N NET+ H +R+C A+ IW Sbjct: 1567 VRVFLWLVSHQVIMTNMERKRRHMSDNGMCTLCRNGNETILHALRDCQAAAGIW 1620 >gb|PRQ37815.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 760 Score = 62.4 bits (150), Expect = 2e-08 Identities = 29/80 (36%), Positives = 47/80 (58%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 L+ F W + H LLTN +R +R M++ C +C N+ ET+ H +R+CS + SIW+ + Sbjct: 440 LKSFFWLICHGKLLTNVERVKRRMSSDPSCPLCHNAPETIMHLLRDCSHASSIWNKIICL 499 Query: 288 NPNQFLSCVNWNSWLYTNLR 347 + ++W SWL N+R Sbjct: 500 DTITRAMHLDWMSWLAANIR 519 >sp|P0C2F6.1|RNHX1_ARATH RecName: Full=Putative ribonuclease H protein At1g65750 Length = 620 Score = 62.0 bits (149), Expect = 3e-08 Identities = 27/79 (34%), Positives = 45/79 (56%) Frame = +3 Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287 ++ FLW +G+Q ++T +RHRR ++ S +C++C E++ H +R+C A L IW + Sbjct: 300 VKTFLWLVGNQAVMTEEERHRRHLSASNVCQVCKGGVESMLHVLRDCPAQLGIWVRVVPQ 359 Query: 288 NPNQFLSCVNWNSWLYTNL 344 Q + WLY NL Sbjct: 360 RRQQGFFSKSLFEWLYDNL 378