BLASTX nr result
ID: Astragalus22_contig00029192
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00029192 (319 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subt... 90 1e-18 ref|XP_015935830.1| uncharacterized protein LOC107461787 [Arachi... 74 5e-13 ref|XP_016164673.1| uncharacterized protein LOC107607211 [Arachi... 72 3e-12 dbj|GAU18772.1| hypothetical protein TSUD_80610 [Trifolium subte... 71 4e-12 gb|AAC26674.1| putative non-LTR retroelement reverse transcripta... 70 1e-11 gb|KYP34286.1| Putative ribonuclease H protein At1g65750 family ... 69 1e-11 gb|KYP64774.1| Putative ribonuclease H protein At1g65750 family,... 67 9e-11 ref|XP_020219748.1| uncharacterized protein LOC109802758 [Cajanu... 67 9e-11 ref|XP_018474025.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 67 9e-11 ref|XP_019086326.1| PREDICTED: uncharacterized protein LOC109126... 67 1e-10 ref|XP_018460645.1| PREDICTED: uncharacterized protein LOC108831... 66 3e-10 gb|KYP44638.1| Putative ribonuclease H protein At1g65750 family,... 65 4e-10 gb|KYP71397.1| Putative ribonuclease H protein At1g65750 family,... 65 4e-10 gb|KHN24231.1| Putative ribonuclease H protein [Glycine soja] 65 6e-10 gb|ONK68084.1| uncharacterized protein A4U43_C05F7260 [Asparagus... 65 6e-10 gb|KYP62996.1| Putative ribonuclease H protein At1g65750 family ... 64 1e-09 gb|KYP42050.1| Putative ribonuclease H protein At1g65750 family,... 64 1e-09 gb|KYP52103.1| Putative ribonuclease H protein At1g65750 family,... 63 1e-09 ref|XP_006304881.2| LOW QUALITY PROTEIN: uncharacterized protein... 64 2e-09 ref|XP_021611887.1| uncharacterized protein LOC110614619 [Maniho... 63 2e-09 >dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subterraneum] Length = 1250 Score = 90.1 bits (222), Expect = 1e-18 Identities = 47/93 (50%), Positives = 55/93 (59%), Gaps = 3/93 (3%) Frame = -1 Query: 292 FQACLAWEGLERIRLFLWKAGS---LANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122 F W G ERI+LFLWKA L N ER RR +T K+C CN QD+ LLH FRDC Sbjct: 921 FNLVWKWRGPERIKLFLWKATHACLLTNMERLRRKMTASKVCSRCNLQDESLLHVFRDCN 980 Query: 121 VRSPIWYNLMGYVHSQSFFQDHDWSDWLL*NLS 23 IW NL + +SFF ++DW WLL NLS Sbjct: 981 FSKSIWQNL-NVQNRRSFFHENDWHQWLLTNLS 1012 >ref|XP_015935830.1| uncharacterized protein LOC107461787 [Arachis duranensis] Length = 1370 Score = 73.9 bits (180), Expect = 5e-13 Identities = 40/96 (41%), Positives = 53/96 (55%), Gaps = 3/96 (3%) Frame = -1 Query: 295 SFQACLAWEGLERIRLFLWKAGS---LANAERHRRHLTNIKICPVCNDQDKFLLHCFRDC 125 +F+ W+G ERIR FLW A L N+ER RRHLTN CP C ++ +H RDC Sbjct: 950 NFRLVWRWQGPERIRTFLWLATHNVILTNSERKRRHLTNDDSCPRCRCHEESTIHVLRDC 1009 Query: 124 KVRSPIWYNLMGYVHSQSFFQDHDWSDWLL*NLSNH 17 IW L + SFF + D ++WLL NL ++ Sbjct: 1010 FYAKSIWRKLFPPIGINSFF-NTDLNEWLLQNLKSN 1044 >ref|XP_016164673.1| uncharacterized protein LOC107607211 [Arachis ipaensis] Length = 1901 Score = 71.6 bits (174), Expect = 3e-12 Identities = 37/98 (37%), Positives = 53/98 (54%), Gaps = 3/98 (3%) Frame = -1 Query: 295 SFQACLAWEGLERIRLFLWKA---GSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDC 125 +F+ W+G ERIR FLW L N+E+ RRHLTN CP C ++ +H RDC Sbjct: 1574 NFRLVWNWQGPERIRTFLWLVTHNAILTNSEKRRRHLTNDDTCPRCRSHEESTIHVLRDC 1633 Query: 124 KVRSPIWYNLMGYVHSQSFFQDHDWSDWLL*NLSNHRN 11 IW L+ SFF + + ++WL NL+ ++N Sbjct: 1634 PYAMSIWNRLIPPNGRSSFF-NTELNEWLYQNLTTNKN 1670 >dbj|GAU18772.1| hypothetical protein TSUD_80610 [Trifolium subterraneum] Length = 482 Score = 71.2 bits (173), Expect = 4e-12 Identities = 36/96 (37%), Positives = 50/96 (52%), Gaps = 5/96 (5%) Frame = -1 Query: 292 FQACLAWEGLERIRLFLWKAGS---LANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122 F+ W+G RI+ FLWK L N ER R++TN +CP C D + ++HC RDC+ Sbjct: 178 FEKVWHWKGPNRIKAFLWKLSQGRLLTNEERRHRNMTNSDLCPRCQDYPESIMHCLRDCE 237 Query: 121 VRSPIWYNLMGYVHSQSFFQD--HDWSDWLL*NLSN 20 W N++ FF ++W DW NLSN Sbjct: 238 DAREFWTNIINPEVWSKFFSIGLNNWLDW---NLSN 270 >gb|AAC26674.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 970 Score = 69.7 bits (169), Expect = 1e-11 Identities = 30/74 (40%), Positives = 41/74 (55%), Gaps = 3/74 (4%) Frame = -1 Query: 262 ERIRLFLWKAGSLA---NAERHRRHLTNIKICPVCNDQDKFLLHCFRDCKVRSPIWYNLM 92 ER+R+F+W + N ER RRHL++I C VCN D+ +LH RDC +PIW L+ Sbjct: 654 ERVRVFIWLVSHMVIMTNVERVRRHLSDIATCSVCNGADESILHVLRDCPAMTPIWQRLL 713 Query: 91 GYVHSQSFFQDHDW 50 FF +W Sbjct: 714 PQRRQNEFFSQFEW 727 >gb|KYP34286.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 289 Score = 68.9 bits (167), Expect = 1e-11 Identities = 35/95 (36%), Positives = 47/95 (49%), Gaps = 3/95 (3%) Frame = -1 Query: 292 FQACLAWEGLERIRLFLWKA---GSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122 F+ W G +RIRL LW+ L N R RR + +CPVC Q K H RDC Sbjct: 73 FKLIWKWPGPQRIRLLLWRIVHNALLTNENRSRRRMAKCNLCPVCQSQPKTTFHVLRDCP 132 Query: 121 VRSPIWYNLMGYVHSQSFFQDHDWSDWLL*NLSNH 17 +W L+ H ++FF D D W+L NL ++ Sbjct: 133 PTELLWRKLLFQSH-ETFFDDMDIQLWILHNLDDY 166 >gb|KYP64774.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 930 Score = 67.4 bits (163), Expect = 9e-11 Identities = 34/95 (35%), Positives = 46/95 (48%), Gaps = 3/95 (3%) Frame = -1 Query: 292 FQACLAWEGLERIRLFLWKA---GSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122 F+ W GL+RIRL LW+ L N R RR + +CPVC Q + H RDC Sbjct: 689 FKLIWKWPGLQRIRLLLWRILHNALLTNENRSRRRMAQCNLCPVCQSQPETTFHVLRDCP 748 Query: 121 VRSPIWYNLMGYVHSQSFFQDHDWSDWLL*NLSNH 17 +W L+ H ++FF D D W+L N + Sbjct: 749 PTELLWRKLLFQSH-ETFFGDMDIQLWILHNFDGY 782 >ref|XP_020219748.1| uncharacterized protein LOC109802758 [Cajanus cajan] Length = 1032 Score = 67.4 bits (163), Expect = 9e-11 Identities = 34/95 (35%), Positives = 46/95 (48%), Gaps = 3/95 (3%) Frame = -1 Query: 292 FQACLAWEGLERIRLFLWKA---GSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122 F+ W GL+RIRL LW+ L N R RR + +CPVC Q + H RDC Sbjct: 791 FKLIWKWPGLQRIRLLLWRILHNALLTNENRSRRRMAQCNLCPVCQSQPETTFHVLRDCP 850 Query: 121 VRSPIWYNLMGYVHSQSFFQDHDWSDWLL*NLSNH 17 +W L+ H ++FF D D W+L N + Sbjct: 851 PTELLWRKLLFQSH-ETFFGDMDIQLWILHNFDGY 884 >ref|XP_018474025.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC108845292 [Raphanus sativus] Length = 1593 Score = 67.4 bits (163), Expect = 9e-11 Identities = 36/95 (37%), Positives = 51/95 (53%), Gaps = 5/95 (5%) Frame = -1 Query: 283 CLAWEGL--ERIRLFLWKAGS---LANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCKV 119 C W + ER++LFLW GS + NAER+RRHL+ +C VC + +LH RDC Sbjct: 1262 CSMWRVVAPERVKLFLWLVGSHAIMTNAERYRRHLSGTDVCQVCRGGVETILHVLRDCPA 1321 Query: 118 RSPIWYNLMGYVHSQSFFQDHDWSDWLL*NLSNHR 14 IW L+ +FF +WL NLS+++ Sbjct: 1322 MEGIWNKLVPRTKRDAFF-SMPLFEWLYRNLSDNK 1355 >ref|XP_019086326.1| PREDICTED: uncharacterized protein LOC109126886 [Camelina sativa] Length = 1556 Score = 67.0 bits (162), Expect = 1e-10 Identities = 33/86 (38%), Positives = 50/86 (58%), Gaps = 3/86 (3%) Frame = -1 Query: 262 ERIRLFLW---KAGSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCKVRSPIWYNLM 92 ER+R+FLW K + N ERHRRHL++ +C VC ++ ++H RDC S +W ++ Sbjct: 1393 ERVRVFLWMVVKQVIMTNVERHRRHLSDSGVCIVCKAGEETIIHILRDCPAISGVWMRII 1452 Query: 91 GYVHSQSFFQDHDWSDWLL*NLSNHR 14 H QS F +W+ NLSN++ Sbjct: 1453 PPRH-QSLFFHQSLLEWVFTNLSNNQ 1477 >ref|XP_018460645.1| PREDICTED: uncharacterized protein LOC108831621 [Raphanus sativus] Length = 1963 Score = 65.9 bits (159), Expect = 3e-10 Identities = 33/85 (38%), Positives = 47/85 (55%), Gaps = 3/85 (3%) Frame = -1 Query: 262 ERIRLFLWKAGSLA---NAERHRRHLTNIKICPVCNDQDKFLLHCFRDCKVRSPIWYNLM 92 ER+++FLW G+ A NAER+RRHL+ +C VC + +LH RDC IW + Sbjct: 1434 ERVKIFLWLVGNQAIMTNAERYRRHLSGTDVCQVCKGGIETILHVLRDCPAMEGIWSRTV 1493 Query: 91 GYVHSQSFFQDHDWSDWLL*NLSNH 17 Q+FF +W+ NLS+H Sbjct: 1494 QATKRQAFF-SMPLFEWIYRNLSDH 1517 >gb|KYP44638.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 260 Score = 64.7 bits (156), Expect = 4e-10 Identities = 38/100 (38%), Positives = 52/100 (52%), Gaps = 5/100 (5%) Frame = -1 Query: 292 FQACLAWEGLERIRLFLWKAGS---LANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122 F+ W+GLER+R+FLW+ + NA R RR +T CP+C+ + + H F C Sbjct: 91 FKLIWNWKGLERVRIFLWRVAHESLMINAFRVRRRITTYSACPICSHDYEDMKHVFLYCP 150 Query: 121 VRSPIWYNLMGYVHSQSFFQDH--DWSDWLL*NLSNHRNQ 8 +W L YV + FQ H D S WL +LS RNQ Sbjct: 151 YARQVWSRLPSYVQA---FQSHNSDISIWLTHHLS-RRNQ 186 >gb|KYP71397.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 510 Score = 65.5 bits (158), Expect = 4e-10 Identities = 36/93 (38%), Positives = 49/93 (52%), Gaps = 4/93 (4%) Frame = -1 Query: 292 FQACLAWEGLERIRLFLWKAGS---LANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122 F+ W G ERIR FLW+ L N R R +T +CPVC+D+ + L+H R+C Sbjct: 273 FKLLWNWRGPERIRTFLWRLAHNSLLTNDLRMHRGMTMDPLCPVCHDELETLIHAMRECN 332 Query: 121 VRSPIWYNLM-GYVHSQSFFQDHDWSDWLL*NL 26 V +W N+ G +H + F DW WL NL Sbjct: 333 VARSVWINIFNGRLH--TIFFTMDWMLWLEWNL 363 >gb|KHN24231.1| Putative ribonuclease H protein [Glycine soja] Length = 317 Score = 64.7 bits (156), Expect = 6e-10 Identities = 35/97 (36%), Positives = 47/97 (48%), Gaps = 3/97 (3%) Frame = -1 Query: 292 FQACLAWEGLERIRLFLWKA---GSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122 F +W+G ER+R+ LWK G L N R R + CP C+ Q + +LHC RDC Sbjct: 133 FNLIWSWKGPERMRILLWKIANEGLLTNKSRVTRAMAESSECPRCHLQPESILHCLRDCF 192 Query: 121 VRSPIWYNLMGYVHSQSFFQDHDWSDWLL*NLSNHRN 11 +W L G F HD WL+ NL + +N Sbjct: 193 YAKQVWNTLSGN-SLNHLFCAHDCPQWLVSNLRSPQN 228 >gb|ONK68084.1| uncharacterized protein A4U43_C05F7260 [Asparagus officinalis] Length = 320 Score = 64.7 bits (156), Expect = 6e-10 Identities = 35/82 (42%), Positives = 45/82 (54%), Gaps = 3/82 (3%) Frame = -1 Query: 262 ERIRLFLW---KAGSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCKVRSPIWYNLM 92 ER+R F W + G L NAER RRHLT CP C+ + + LH FRDC V + IW L Sbjct: 68 ERVRTFAWLVVRGGVLTNAERWRRHLTEDDACPCCSSEPELALHLFRDCGVVTDIWTKLK 127 Query: 91 GYVHSQSFFQDHDWSDWLL*NL 26 S + F +++ WL NL Sbjct: 128 P-PFSWTEFYGSNYAQWLRLNL 148 >gb|KYP62996.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 506 Score = 64.3 bits (155), Expect = 1e-09 Identities = 35/93 (37%), Positives = 49/93 (52%), Gaps = 4/93 (4%) Frame = -1 Query: 292 FQACLAWEGLERIRLFLWKAGS---LANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122 F+ W G ERI+ FLW+ L N R R +T +CPVC+D+ + L+H RDC Sbjct: 231 FKLLWNWRGPERIQTFLWRLAHNSLLTNDLRMHRGMTMDPLCPVCHDELETLIHAMRDCN 290 Query: 121 VRSPIWYNLM-GYVHSQSFFQDHDWSDWLL*NL 26 V +W N+ G +H+ F +W WL NL Sbjct: 291 VARSVWINIFNGRLHTNFFTM--NWMLWLEWNL 321 >gb|KYP42050.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 812 Score = 64.3 bits (155), Expect = 1e-09 Identities = 38/96 (39%), Positives = 50/96 (52%), Gaps = 11/96 (11%) Frame = -1 Query: 280 LAWEG-------LERIRLFLWKAGS---LANAERHRRHLTNIKICPVCNDQDKFLLHCFR 131 LAW+ L RIR FLW+ L N R RR +T +CPVC+D+ + L+H R Sbjct: 483 LAWKNAADGEFSLRRIRTFLWRLAHNSLLTNDLRMRRGMTMDPLCPVCHDELETLIHAMR 542 Query: 130 DCKVRSPIWYNLM-GYVHSQSFFQDHDWSDWLL*NL 26 DC V +W N+ G +H+ F DW WL NL Sbjct: 543 DCNVARSVWINIFNGRLHTNFFTM--DWMLWLEWNL 576 >gb|KYP52103.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 255 Score = 63.2 bits (152), Expect = 1e-09 Identities = 37/87 (42%), Positives = 45/87 (51%), Gaps = 5/87 (5%) Frame = -1 Query: 295 SFQACLAWEGLERIRLFLWKA--GSL-ANAERHRRHLTNIKICPVCNDQDKFLLHCFRDC 125 +F+A W G ERIR+ LW+ GSL N R R L CPVC + LH RDC Sbjct: 65 AFKAIWRWNGPERIRVLLWRVVHGSLMTNQVRVDRGLGTDPTCPVCMQGTESNLHALRDC 124 Query: 124 KVRSPIWYNLMGYVHSQSFFQD--HDW 50 K + IWY G +SF +D HDW Sbjct: 125 KFATEIWYRASGGSLPRSFAEDNIHDW 151 >ref|XP_006304881.2| LOW QUALITY PROTEIN: uncharacterized protein LOC17899005 [Capsella rubella] Length = 1833 Score = 63.9 bits (154), Expect = 2e-09 Identities = 34/92 (36%), Positives = 49/92 (53%), Gaps = 5/92 (5%) Frame = -1 Query: 274 WEGL--ERIRLFLW---KAGSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCKVRSP 110 W L ER RLFLW + N ERHRRHL++ +C VC ++ +LH RDC + Sbjct: 1505 WRALIPERTRLFLWLVVNRALMTNVERHRRHLSDTSVCSVCRSGEETILHILRDCPAMAG 1564 Query: 109 IWYNLMGYVHSQSFFQDHDWSDWLL*NLSNHR 14 +W L+ ++FF +W+ NLS+ R Sbjct: 1565 LWERLVPRGKVRTFF-SLSLFEWVYENLSDTR 1595 >ref|XP_021611887.1| uncharacterized protein LOC110614619 [Manihot esculenta] Length = 243 Score = 62.8 bits (151), Expect = 2e-09 Identities = 33/87 (37%), Positives = 47/87 (54%), Gaps = 4/87 (4%) Frame = -1 Query: 274 WEGLERIRLFLWKA---GSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCKVRSPIW 104 W G +RIR FLW L N ER RRH++ CP+C + + LLH FRDC +W Sbjct: 15 WPGPQRIRTFLWLVDYKAILTNQERSRRHISAPDTCPICKREVESLLHVFRDCDHVRSLW 74 Query: 103 YNLMGYVHSQS-FFQDHDWSDWLL*NL 26 NL + + + FF + +WL+ N+ Sbjct: 75 INLSPSLSAGTIFFSISNVREWLVDNI 101