BLASTX nr result
ID: Astragalus23_contig00023975
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00023975 (369 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU48919.1| hypothetical protein TSUD_301740 [Trifolium subt... 65 9e-20 dbj|GAU37771.1| hypothetical protein TSUD_102880 [Trifolium subt... 70 6e-16 gb|PNX68200.1| pentatricopeptide repeat-containing protein, part... 69 1e-15 dbj|GAU40391.1| hypothetical protein TSUD_265350 [Trifolium subt... 72 5e-14 gb|KYP66749.1| LINE-1 reverse transcriptase isogeny, partial [Ca... 61 1e-13 gb|KYP48093.1| Transposon TX1 uncharacterized [Cajanus cajan] 59 1e-13 gb|KYP35971.1| Putative ribonuclease H protein At1g65750 family ... 56 3e-13 gb|KYP48474.1| hypothetical protein KK1_029849 [Cajanus cajan] 59 2e-12 gb|KYP48455.1| hypothetical protein KK1_029830 [Cajanus cajan] 59 2e-12 gb|KYP45089.1| Putative ribonuclease H protein At1g65750 family ... 49 6e-12 gb|KYP46236.1| Putative ribonuclease H protein At1g65750 family ... 59 8e-12 dbj|GAU25895.1| hypothetical protein TSUD_376140 [Trifolium subt... 64 9e-12 dbj|GAU19712.1| hypothetical protein TSUD_78370 [Trifolium subte... 68 2e-11 dbj|GAU13516.1| hypothetical protein TSUD_128130 [Trifolium subt... 67 2e-11 dbj|GAU20190.1| hypothetical protein TSUD_352540 [Trifolium subt... 70 2e-11 gb|KHN20416.1| Putative ribonuclease H protein [Glycine soja] 67 3e-11 dbj|GAU10838.1| hypothetical protein TSUD_425970, partial [Trifo... 67 3e-11 gb|PNY11585.1| pentatricopeptide repeat-containing protein [Trif... 69 3e-11 dbj|GAU41598.1| hypothetical protein TSUD_196670 [Trifolium subt... 68 3e-11 gb|PNX71723.1| ribonuclease H [Trifolium pratense] 68 4e-11 >dbj|GAU48919.1| hypothetical protein TSUD_301740 [Trifolium subterraneum] Length = 640 Score = 65.5 bits (158), Expect(3) = 9e-20 Identities = 25/48 (52%), Positives = 32/48 (66%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFCE 153 GNW K+WKL VP K+K+ LW + RGCLP++ L K V CD C C+ Sbjct: 299 GNWTKLWKLNVPNKVKIFLWRSLRGCLPVKERLIPKGVQCDSKCICCD 346 Score = 40.0 bits (92), Expect(3) = 9e-20 Identities = 13/45 (28%), Positives = 25/45 (55%) Frame = -1 Query: 138 EWHMVFVCHKTKAIWVESKIWNEVQQALINVL*FNESIFSALNNL 4 EWH F C + +W+ES+ W + Q + + F + +FS + ++ Sbjct: 352 EWHCFFGCKAAQEVWIESEFWESLHQKIEAAVGFKQLVFSLIESM 396 Score = 38.9 bits (89), Expect(3) = 9e-20 Identities = 17/24 (70%), Positives = 20/24 (83%) Frame = -3 Query: 367 GFYTVRSAYYSLMEDFINNNHLKE 296 G Y+VRSAYY LME I+NNHL+E Sbjct: 274 GKYSVRSAYYQLMEVIIDNNHLRE 297 >dbj|GAU37771.1| hypothetical protein TSUD_102880 [Trifolium subterraneum] Length = 1688 Score = 69.7 bits (169), Expect(3) = 6e-16 Identities = 27/54 (50%), Positives = 39/54 (72%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFCEDEIDTN 135 GNW KIW+L++PQK+K+ LW AARGCLP R L+ K V+C C +C++ + + Sbjct: 1414 GNWGKIWELKIPQKMKVFLWRAARGCLPTRYRLQQKGVNCPHTCAYCQNNFEND 1467 Score = 33.5 bits (75), Expect(3) = 6e-16 Identities = 15/46 (32%), Positives = 24/46 (52%) Frame = -1 Query: 138 EWHMVFVCHKTKAIWVESKIWNEVQQALINVL*FNESIFSALNNLS 1 +WH+ F C K + IW E+ +W+ ++ + F FS L LS Sbjct: 1467 DWHVFFGCVKAQEIWEEAGLWSFIEGMFESTEGFVSLFFSLLELLS 1512 Score = 28.1 bits (61), Expect(3) = 6e-16 Identities = 11/23 (47%), Positives = 17/23 (73%) Frame = -3 Query: 367 GFYTVRSAYYSLMEDFINNNHLK 299 G Y+V+SAYY ME+ ++N L+ Sbjct: 1389 GEYSVKSAYYYTMENLVDNTGLR 1411 >gb|PNX68200.1| pentatricopeptide repeat-containing protein, partial [Trifolium pratense] Length = 220 Score = 68.6 bits (166), Expect(3) = 1e-15 Identities = 27/54 (50%), Positives = 38/54 (70%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFCEDEIDTN 135 GNW KIW L++PQK+K+ LW AARGCLP R L+ K V+C C +C++ + + Sbjct: 39 GNWGKIWGLKIPQKMKVFLWRAARGCLPTRYRLQRKGVNCPHTCAYCQNNFEND 92 Score = 33.5 bits (75), Expect(3) = 1e-15 Identities = 15/46 (32%), Positives = 24/46 (52%) Frame = -1 Query: 138 EWHMVFVCHKTKAIWVESKIWNEVQQALINVL*FNESIFSALNNLS 1 +WH+ F C K + IW E+ +W+ ++ + F FS L LS Sbjct: 92 DWHVFFGCVKAQEIWEEAGLWSLIEGMFESAEGFVSLFFSLLELLS 137 Score = 28.1 bits (61), Expect(3) = 1e-15 Identities = 11/23 (47%), Positives = 17/23 (73%) Frame = -3 Query: 367 GFYTVRSAYYSLMEDFINNNHLK 299 G Y+V+SAYY ME+ ++N L+ Sbjct: 14 GEYSVKSAYYYTMENLVDNTGLR 36 >dbj|GAU40391.1| hypothetical protein TSUD_265350 [Trifolium subterraneum] Length = 176 Score = 72.0 bits (175), Expect(2) = 5e-14 Identities = 28/47 (59%), Positives = 33/47 (70%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFC 156 GNW K+WKL+VP K+K+ LW RGCLP+R LR K V CD CP C Sbjct: 3 GNWKKLWKLKVPNKVKIFLWRVLRGCLPVRARLRSKGVQCDTKCPCC 49 Score = 33.1 bits (74), Expect(2) = 5e-14 Identities = 13/45 (28%), Positives = 22/45 (48%) Frame = -1 Query: 138 EWHMVFVCHKTKAIWVESKIWNEVQQALINVL*FNESIFSALNNL 4 EWH C + +W+E++ W V++ N + F I + L L Sbjct: 56 EWHCFVGCMSAQEVWMETECWPAVEKYSTNAMSFVSMILNMLEEL 100 >gb|KYP66749.1| LINE-1 reverse transcriptase isogeny, partial [Cajanus cajan] Length = 816 Score = 61.2 bits (147), Expect(3) = 1e-13 Identities = 22/47 (46%), Positives = 30/47 (63%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFC 156 G+WMK+W L++P ++ LW RGC+P R NL+ K V C CP C Sbjct: 497 GDWMKLWSLKIPHSTQIFLWRLLRGCIPTRLNLQQKGVPCTSSCPHC 543 Score = 31.6 bits (70), Expect(3) = 1e-13 Identities = 11/46 (23%), Positives = 24/46 (52%) Frame = -1 Query: 138 EWHMVFVCHKTKAIWVESKIWNEVQQALINVL*FNESIFSALNNLS 1 EWH+ + C +IW++S W + + + F ++ + L +L+ Sbjct: 550 EWHLFYSCPAALSIWIDSGCWPRIAHIVEQGISFIDTTWKLLGHLT 595 Score = 30.4 bits (67), Expect(3) = 1e-13 Identities = 13/23 (56%), Positives = 18/23 (78%) Frame = -3 Query: 367 GFYTVRSAYYSLMEDFINNNHLK 299 G Y+VR+AY+ LME I+NN L+ Sbjct: 472 GSYSVRTAYHHLMEHVISNNTLR 494 >gb|KYP48093.1| Transposon TX1 uncharacterized [Cajanus cajan] Length = 748 Score = 59.3 bits (142), Expect(3) = 1e-13 Identities = 22/47 (46%), Positives = 30/47 (63%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFC 156 GNW ++W L+VP +K+ LW ARGCLP R NL+ + + C C C Sbjct: 670 GNWKQLWSLKVPNTMKIFLWRIARGCLPSRMNLQQRGIPCTSLCAHC 716 Score = 32.3 bits (72), Expect(3) = 1e-13 Identities = 10/24 (41%), Positives = 15/24 (62%) Frame = -1 Query: 138 EWHMVFVCHKTKAIWVESKIWNEV 67 EWH+ F C TK+ W+ S +W + Sbjct: 723 EWHIFFGCQTTKSYWMTSGLWPSI 746 Score = 31.6 bits (70), Expect(3) = 1e-13 Identities = 13/22 (59%), Positives = 17/22 (77%) Frame = -3 Query: 367 GFYTVRSAYYSLMEDFINNNHL 302 G Y+V+SAYY +ME I+N HL Sbjct: 645 GSYSVKSAYYYVMESLISNAHL 666 >gb|KYP35971.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 606 Score = 56.2 bits (134), Expect(3) = 3e-13 Identities = 22/52 (42%), Positives = 29/52 (55%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFCEDEID 141 GNW +W ++ P K+ LW RGCLP R NL+ + V C CP C I+ Sbjct: 286 GNWSMLWSMKAPNTKKIFLWRVLRGCLPTRLNLQRRHVPCTMLCPTCSAGIE 337 Score = 37.0 bits (84), Expect(3) = 3e-13 Identities = 16/46 (34%), Positives = 24/46 (52%) Frame = -1 Query: 138 EWHMVFVCHKTKAIWVESKIWNEVQQALINVL*FNESIFSALNNLS 1 EWH+ F C + K IW S W ++ Q + + ++IF L LS Sbjct: 339 EWHIFFECVEAKDIWAASGFWPKISQIIADSDGIQQAIFQLLQCLS 384 Score = 28.9 bits (63), Expect(3) = 3e-13 Identities = 12/23 (52%), Positives = 15/23 (65%) Frame = -3 Query: 367 GFYTVRSAYYSLMEDFINNNHLK 299 G YT++SAYY LME N L+ Sbjct: 261 GMYTIKSAYYQLMEHLTPNVDLR 283 >gb|KYP48474.1| hypothetical protein KK1_029849 [Cajanus cajan] Length = 547 Score = 58.5 bits (140), Expect(3) = 2e-12 Identities = 21/47 (44%), Positives = 29/47 (61%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFC 156 G+WMK+W L++P ++ LW RGC+P NL+ K V C CP C Sbjct: 353 GDWMKLWSLKIPHSTQIFLWRLLRGCIPTCLNLQQKGVSCTSSCPHC 399 Score = 31.6 bits (70), Expect(3) = 2e-12 Identities = 11/46 (23%), Positives = 25/46 (54%) Frame = -1 Query: 138 EWHMVFVCHKTKAIWVESKIWNEVQQALINVL*FNESIFSALNNLS 1 EWH+ + C +IW++S W + + + + F ++ + L +L+ Sbjct: 406 EWHLFYSCPAAISIWIDSGCWPRIARIVEQGISFIDTTWKLLGHLT 451 Score = 28.9 bits (63), Expect(3) = 2e-12 Identities = 12/23 (52%), Positives = 18/23 (78%) Frame = -3 Query: 367 GFYTVRSAYYSLMEDFINNNHLK 299 G ++VR+AY+ LME I+NN L+ Sbjct: 328 GSFSVRTAYHHLMEHVISNNTLR 350 >gb|KYP48455.1| hypothetical protein KK1_029830 [Cajanus cajan] Length = 536 Score = 58.5 bits (140), Expect(3) = 2e-12 Identities = 21/47 (44%), Positives = 29/47 (61%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFC 156 G+WMK+W L++P ++ LW RGC+P NL+ K V C CP C Sbjct: 342 GDWMKLWSLKIPHSTQIFLWRLLRGCIPTCLNLQQKGVSCTSSCPHC 388 Score = 31.6 bits (70), Expect(3) = 2e-12 Identities = 11/46 (23%), Positives = 25/46 (54%) Frame = -1 Query: 138 EWHMVFVCHKTKAIWVESKIWNEVQQALINVL*FNESIFSALNNLS 1 EWH+ + C +IW++S W + + + + F ++ + L +L+ Sbjct: 395 EWHLFYSCPAAISIWIDSGCWPRIARIVEQGISFIDTTWKLLGHLT 440 Score = 28.9 bits (63), Expect(3) = 2e-12 Identities = 12/23 (52%), Positives = 18/23 (78%) Frame = -3 Query: 367 GFYTVRSAYYSLMEDFINNNHLK 299 G ++VR+AY+ LME I+NN L+ Sbjct: 317 GSFSVRTAYHHLMEHVISNNTLR 339 >gb|KYP45089.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 406 Score = 48.5 bits (114), Expect(3) = 6e-12 Identities = 20/47 (42%), Positives = 27/47 (57%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFC 156 GNW +IW L+V +K+ LW AR CLP R NL+ + + C C Sbjct: 181 GNWKQIWSLKVLNTMKIFLWRIARRCLPSRMNLQQRGIPRTSLCAHC 227 Score = 37.7 bits (86), Expect(3) = 6e-12 Identities = 15/45 (33%), Positives = 25/45 (55%) Frame = -1 Query: 138 EWHMVFVCHKTKAIWVESKIWNEVQQALINVL*FNESIFSALNNL 4 EWH+ F C ++IW+ +W + N F ++IFS ++NL Sbjct: 234 EWHIFFGCQTAESIWMTFGLWPSTNAYIDNGEDFKDTIFSLISNL 278 Score = 31.2 bits (69), Expect(3) = 6e-12 Identities = 13/22 (59%), Positives = 16/22 (72%) Frame = -3 Query: 367 GFYTVRSAYYSLMEDFINNNHL 302 G Y V+SAYY +ME I+N HL Sbjct: 156 GSYYVKSAYYYVMESLISNTHL 177 >gb|KYP46236.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 507 Score = 58.9 bits (141), Expect(3) = 8e-12 Identities = 23/54 (42%), Positives = 31/54 (57%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFCEDEIDTN 135 G+W K+W L +P +K+ LW R CLP R+ L+ K V C CP CE + N Sbjct: 282 GDWKKLWALPIPHNMKIFLWRLLRDCLPSRQRLQQKGVPCTSLCPHCEAAQENN 335 Score = 32.0 bits (71), Expect(3) = 8e-12 Identities = 13/45 (28%), Positives = 23/45 (51%) Frame = -1 Query: 135 WHMVFVCHKTKAIWVESKIWNEVQQALINVL*FNESIFSALNNLS 1 WH+ F C + + +W + IW ++ + E IFS L ++S Sbjct: 336 WHIFFGCQEAQTVWQATGIWQHIKSLIDVGEGIVEVIFSLLGSIS 380 Score = 26.2 bits (56), Expect(3) = 8e-12 Identities = 10/20 (50%), Positives = 15/20 (75%) Frame = -3 Query: 367 GFYTVRSAYYSLMEDFINNN 308 G Y+V+SAYY +ME + N+ Sbjct: 257 GSYSVKSAYYLIMESLLCNS 276 >dbj|GAU25895.1| hypothetical protein TSUD_376140 [Trifolium subterraneum] Length = 372 Score = 63.9 bits (154), Expect(2) = 9e-12 Identities = 25/51 (49%), Positives = 33/51 (64%) Frame = -2 Query: 287 MKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFCEDEIDTN 135 M+IW +++PQKIK+ LW AARGCLP R LR + V C C CE + + Sbjct: 1 MQIWNMKIPQKIKVFLWRAARGCLPTRERLRTRGVQCTDRCVHCEQSFEND 51 Score = 33.5 bits (75), Expect(2) = 9e-12 Identities = 9/28 (32%), Positives = 19/28 (67%) Frame = -1 Query: 138 EWHMVFVCHKTKAIWVESKIWNEVQQAL 55 +WH+ F C+K + +W E+ +W+ ++ L Sbjct: 51 DWHVFFGCNKVEEVWAEAGLWSFIRDKL 78 >dbj|GAU19712.1| hypothetical protein TSUD_78370 [Trifolium subterraneum] Length = 191 Score = 67.8 bits (164), Expect = 2e-11 Identities = 27/52 (51%), Positives = 35/52 (67%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFCEDEID 141 GNW IWK Q P K + LLW RGCLP R L +RV+C+ +CP C++EI+ Sbjct: 14 GNWNDIWKAQAPHKARHLLWRLCRGCLPTRYRLLERRVECNFNCPVCDEEIE 65 >dbj|GAU13516.1| hypothetical protein TSUD_128130 [Trifolium subterraneum] Length = 140 Score = 66.6 bits (161), Expect = 2e-11 Identities = 27/52 (51%), Positives = 34/52 (65%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFCEDEID 141 GNW IWK Q P K + LLW RGCLP R L +RV+C +CP C++EI+ Sbjct: 2 GNWNGIWKAQAPHKARHLLWRLCRGCLPTRSRLLERRVECTLNCPVCDEEIE 53 >dbj|GAU20190.1| hypothetical protein TSUD_352540 [Trifolium subterraneum] Length = 407 Score = 69.7 bits (169), Expect = 2e-11 Identities = 28/52 (53%), Positives = 35/52 (67%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFCEDEID 141 GNW IWK Q P K + LLWH RGCLP R L +RV+C +CP C++EI+ Sbjct: 2 GNWNGIWKAQAPHKARHLLWHLCRGCLPTRSRLLERRVECTLNCPVCDEEIE 53 >gb|KHN20416.1| Putative ribonuclease H protein [Glycine soja] Length = 249 Score = 67.4 bits (163), Expect(2) = 3e-11 Identities = 26/52 (50%), Positives = 34/52 (65%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFCEDEID 141 G W +WKLQVP K+K+ +W A RGCLP R L+ K V C G CP C + ++ Sbjct: 14 GRWKDLWKLQVPNKVKVFIWRAVRGCLPTRLRLQTKGVVCTGICPLCLNNLE 65 Score = 28.5 bits (62), Expect(2) = 3e-11 Identities = 12/46 (26%), Positives = 21/46 (45%) Frame = -1 Query: 138 EWHMVFVCHKTKAIWVESKIWNEVQQALINVL*FNESIFSALNNLS 1 EWH + C W + WN ++ + + F++ IF L +S Sbjct: 67 EWHCLVACPSNLVCWKLAGFWNVIRVQVDSADSFDDLIFRLLARIS 112 >dbj|GAU10838.1| hypothetical protein TSUD_425970, partial [Trifolium subterraneum] Length = 170 Score = 67.0 bits (162), Expect = 3e-11 Identities = 27/52 (51%), Positives = 35/52 (67%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFCEDEID 141 GNW IWK Q P K + LLW RGCLP R L +RV+C+ +CP C++EI+ Sbjct: 112 GNWNGIWKAQAPHKARHLLWRLCRGCLPTRYRLLERRVECNLNCPVCDEEIE 163 >gb|PNY11585.1| pentatricopeptide repeat-containing protein [Trifolium pratense] Length = 300 Score = 68.9 bits (167), Expect = 3e-11 Identities = 27/52 (51%), Positives = 36/52 (69%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFCEDEID 141 GNW IWK Q P K++ LLW RGCLP R L +RV+C+ +CP C++EI+ Sbjct: 123 GNWNDIWKAQAPHKVRHLLWRLCRGCLPTRYRLLERRVECNLNCPVCDEEIE 174 >dbj|GAU41598.1| hypothetical protein TSUD_196670 [Trifolium subterraneum] Length = 244 Score = 68.2 bits (165), Expect = 3e-11 Identities = 28/52 (53%), Positives = 35/52 (67%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFCEDEID 141 GNW IWK QVP K + LLW RGCLP R L +RV+C +CP C++EI+ Sbjct: 14 GNWNGIWKAQVPHKARHLLWRLCRGCLPTRSRLLERRVECTLNCPVCDEEIE 65 >gb|PNX71723.1| ribonuclease H [Trifolium pratense] Length = 232 Score = 67.8 bits (164), Expect = 4e-11 Identities = 28/52 (53%), Positives = 33/52 (63%) Frame = -2 Query: 296 GNWMKIWKLQVPQKIKLLLWHAARGCLPMRRNLRIKRVDCDGHCPFCEDEID 141 GNW +IWK P K + LLW RGCLP RRN VDCD HC CE+E++ Sbjct: 119 GNWKEIWKAHAPHKARHLLWRLCRGCLPTRRN-----VDCDVHCSLCEEEVE 165