BLASTX nr result

ID: Astragalus22_contig00029192 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00029192
         (319 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subt...    90   1e-18
ref|XP_015935830.1| uncharacterized protein LOC107461787 [Arachi...    74   5e-13
ref|XP_016164673.1| uncharacterized protein LOC107607211 [Arachi...    72   3e-12
dbj|GAU18772.1| hypothetical protein TSUD_80610 [Trifolium subte...    71   4e-12
gb|AAC26674.1| putative non-LTR retroelement reverse transcripta...    70   1e-11
gb|KYP34286.1| Putative ribonuclease H protein At1g65750 family ...    69   1e-11
gb|KYP64774.1| Putative ribonuclease H protein At1g65750 family,...    67   9e-11
ref|XP_020219748.1| uncharacterized protein LOC109802758 [Cajanu...    67   9e-11
ref|XP_018474025.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...    67   9e-11
ref|XP_019086326.1| PREDICTED: uncharacterized protein LOC109126...    67   1e-10
ref|XP_018460645.1| PREDICTED: uncharacterized protein LOC108831...    66   3e-10
gb|KYP44638.1| Putative ribonuclease H protein At1g65750 family,...    65   4e-10
gb|KYP71397.1| Putative ribonuclease H protein At1g65750 family,...    65   4e-10
gb|KHN24231.1| Putative ribonuclease H protein [Glycine soja]          65   6e-10
gb|ONK68084.1| uncharacterized protein A4U43_C05F7260 [Asparagus...    65   6e-10
gb|KYP62996.1| Putative ribonuclease H protein At1g65750 family ...    64   1e-09
gb|KYP42050.1| Putative ribonuclease H protein At1g65750 family,...    64   1e-09
gb|KYP52103.1| Putative ribonuclease H protein At1g65750 family,...    63   1e-09
ref|XP_006304881.2| LOW QUALITY PROTEIN: uncharacterized protein...    64   2e-09
ref|XP_021611887.1| uncharacterized protein LOC110614619 [Maniho...    63   2e-09

>dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subterraneum]
          Length = 1250

 Score = 90.1 bits (222), Expect = 1e-18
 Identities = 47/93 (50%), Positives = 55/93 (59%), Gaps = 3/93 (3%)
 Frame = -1

Query: 292  FQACLAWEGLERIRLFLWKAGS---LANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122
            F     W G ERI+LFLWKA     L N ER RR +T  K+C  CN QD+ LLH FRDC 
Sbjct: 921  FNLVWKWRGPERIKLFLWKATHACLLTNMERLRRKMTASKVCSRCNLQDESLLHVFRDCN 980

Query: 121  VRSPIWYNLMGYVHSQSFFQDHDWSDWLL*NLS 23
                IW NL    + +SFF ++DW  WLL NLS
Sbjct: 981  FSKSIWQNL-NVQNRRSFFHENDWHQWLLTNLS 1012


>ref|XP_015935830.1| uncharacterized protein LOC107461787 [Arachis duranensis]
          Length = 1370

 Score = 73.9 bits (180), Expect = 5e-13
 Identities = 40/96 (41%), Positives = 53/96 (55%), Gaps = 3/96 (3%)
 Frame = -1

Query: 295  SFQACLAWEGLERIRLFLWKAGS---LANAERHRRHLTNIKICPVCNDQDKFLLHCFRDC 125
            +F+    W+G ERIR FLW A     L N+ER RRHLTN   CP C   ++  +H  RDC
Sbjct: 950  NFRLVWRWQGPERIRTFLWLATHNVILTNSERKRRHLTNDDSCPRCRCHEESTIHVLRDC 1009

Query: 124  KVRSPIWYNLMGYVHSQSFFQDHDWSDWLL*NLSNH 17
                 IW  L   +   SFF + D ++WLL NL ++
Sbjct: 1010 FYAKSIWRKLFPPIGINSFF-NTDLNEWLLQNLKSN 1044


>ref|XP_016164673.1| uncharacterized protein LOC107607211 [Arachis ipaensis]
          Length = 1901

 Score = 71.6 bits (174), Expect = 3e-12
 Identities = 37/98 (37%), Positives = 53/98 (54%), Gaps = 3/98 (3%)
 Frame = -1

Query: 295  SFQACLAWEGLERIRLFLWKA---GSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDC 125
            +F+    W+G ERIR FLW       L N+E+ RRHLTN   CP C   ++  +H  RDC
Sbjct: 1574 NFRLVWNWQGPERIRTFLWLVTHNAILTNSEKRRRHLTNDDTCPRCRSHEESTIHVLRDC 1633

Query: 124  KVRSPIWYNLMGYVHSQSFFQDHDWSDWLL*NLSNHRN 11
                 IW  L+      SFF + + ++WL  NL+ ++N
Sbjct: 1634 PYAMSIWNRLIPPNGRSSFF-NTELNEWLYQNLTTNKN 1670


>dbj|GAU18772.1| hypothetical protein TSUD_80610 [Trifolium subterraneum]
          Length = 482

 Score = 71.2 bits (173), Expect = 4e-12
 Identities = 36/96 (37%), Positives = 50/96 (52%), Gaps = 5/96 (5%)
 Frame = -1

Query: 292 FQACLAWEGLERIRLFLWKAGS---LANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122
           F+    W+G  RI+ FLWK      L N ER  R++TN  +CP C D  + ++HC RDC+
Sbjct: 178 FEKVWHWKGPNRIKAFLWKLSQGRLLTNEERRHRNMTNSDLCPRCQDYPESIMHCLRDCE 237

Query: 121 VRSPIWYNLMGYVHSQSFFQD--HDWSDWLL*NLSN 20
                W N++       FF    ++W DW   NLSN
Sbjct: 238 DAREFWTNIINPEVWSKFFSIGLNNWLDW---NLSN 270


>gb|AAC26674.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
           thaliana]
          Length = 970

 Score = 69.7 bits (169), Expect = 1e-11
 Identities = 30/74 (40%), Positives = 41/74 (55%), Gaps = 3/74 (4%)
 Frame = -1

Query: 262 ERIRLFLWKAGSLA---NAERHRRHLTNIKICPVCNDQDKFLLHCFRDCKVRSPIWYNLM 92
           ER+R+F+W    +    N ER RRHL++I  C VCN  D+ +LH  RDC   +PIW  L+
Sbjct: 654 ERVRVFIWLVSHMVIMTNVERVRRHLSDIATCSVCNGADESILHVLRDCPAMTPIWQRLL 713

Query: 91  GYVHSQSFFQDHDW 50
                  FF   +W
Sbjct: 714 PQRRQNEFFSQFEW 727


>gb|KYP34286.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 289

 Score = 68.9 bits (167), Expect = 1e-11
 Identities = 35/95 (36%), Positives = 47/95 (49%), Gaps = 3/95 (3%)
 Frame = -1

Query: 292 FQACLAWEGLERIRLFLWKA---GSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122
           F+    W G +RIRL LW+      L N  R RR +    +CPVC  Q K   H  RDC 
Sbjct: 73  FKLIWKWPGPQRIRLLLWRIVHNALLTNENRSRRRMAKCNLCPVCQSQPKTTFHVLRDCP 132

Query: 121 VRSPIWYNLMGYVHSQSFFQDHDWSDWLL*NLSNH 17
               +W  L+   H ++FF D D   W+L NL ++
Sbjct: 133 PTELLWRKLLFQSH-ETFFDDMDIQLWILHNLDDY 166


>gb|KYP64774.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 930

 Score = 67.4 bits (163), Expect = 9e-11
 Identities = 34/95 (35%), Positives = 46/95 (48%), Gaps = 3/95 (3%)
 Frame = -1

Query: 292 FQACLAWEGLERIRLFLWKA---GSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122
           F+    W GL+RIRL LW+      L N  R RR +    +CPVC  Q +   H  RDC 
Sbjct: 689 FKLIWKWPGLQRIRLLLWRILHNALLTNENRSRRRMAQCNLCPVCQSQPETTFHVLRDCP 748

Query: 121 VRSPIWYNLMGYVHSQSFFQDHDWSDWLL*NLSNH 17
               +W  L+   H ++FF D D   W+L N   +
Sbjct: 749 PTELLWRKLLFQSH-ETFFGDMDIQLWILHNFDGY 782


>ref|XP_020219748.1| uncharacterized protein LOC109802758 [Cajanus cajan]
          Length = 1032

 Score = 67.4 bits (163), Expect = 9e-11
 Identities = 34/95 (35%), Positives = 46/95 (48%), Gaps = 3/95 (3%)
 Frame = -1

Query: 292  FQACLAWEGLERIRLFLWKA---GSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122
            F+    W GL+RIRL LW+      L N  R RR +    +CPVC  Q +   H  RDC 
Sbjct: 791  FKLIWKWPGLQRIRLLLWRILHNALLTNENRSRRRMAQCNLCPVCQSQPETTFHVLRDCP 850

Query: 121  VRSPIWYNLMGYVHSQSFFQDHDWSDWLL*NLSNH 17
                +W  L+   H ++FF D D   W+L N   +
Sbjct: 851  PTELLWRKLLFQSH-ETFFGDMDIQLWILHNFDGY 884


>ref|XP_018474025.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC108845292
            [Raphanus sativus]
          Length = 1593

 Score = 67.4 bits (163), Expect = 9e-11
 Identities = 36/95 (37%), Positives = 51/95 (53%), Gaps = 5/95 (5%)
 Frame = -1

Query: 283  CLAWEGL--ERIRLFLWKAGS---LANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCKV 119
            C  W  +  ER++LFLW  GS   + NAER+RRHL+   +C VC    + +LH  RDC  
Sbjct: 1262 CSMWRVVAPERVKLFLWLVGSHAIMTNAERYRRHLSGTDVCQVCRGGVETILHVLRDCPA 1321

Query: 118  RSPIWYNLMGYVHSQSFFQDHDWSDWLL*NLSNHR 14
               IW  L+      +FF      +WL  NLS+++
Sbjct: 1322 MEGIWNKLVPRTKRDAFF-SMPLFEWLYRNLSDNK 1355


>ref|XP_019086326.1| PREDICTED: uncharacterized protein LOC109126886 [Camelina sativa]
          Length = 1556

 Score = 67.0 bits (162), Expect = 1e-10
 Identities = 33/86 (38%), Positives = 50/86 (58%), Gaps = 3/86 (3%)
 Frame = -1

Query: 262  ERIRLFLW---KAGSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCKVRSPIWYNLM 92
            ER+R+FLW   K   + N ERHRRHL++  +C VC   ++ ++H  RDC   S +W  ++
Sbjct: 1393 ERVRVFLWMVVKQVIMTNVERHRRHLSDSGVCIVCKAGEETIIHILRDCPAISGVWMRII 1452

Query: 91   GYVHSQSFFQDHDWSDWLL*NLSNHR 14
               H QS F      +W+  NLSN++
Sbjct: 1453 PPRH-QSLFFHQSLLEWVFTNLSNNQ 1477


>ref|XP_018460645.1| PREDICTED: uncharacterized protein LOC108831621 [Raphanus sativus]
          Length = 1963

 Score = 65.9 bits (159), Expect = 3e-10
 Identities = 33/85 (38%), Positives = 47/85 (55%), Gaps = 3/85 (3%)
 Frame = -1

Query: 262  ERIRLFLWKAGSLA---NAERHRRHLTNIKICPVCNDQDKFLLHCFRDCKVRSPIWYNLM 92
            ER+++FLW  G+ A   NAER+RRHL+   +C VC    + +LH  RDC     IW   +
Sbjct: 1434 ERVKIFLWLVGNQAIMTNAERYRRHLSGTDVCQVCKGGIETILHVLRDCPAMEGIWSRTV 1493

Query: 91   GYVHSQSFFQDHDWSDWLL*NLSNH 17
                 Q+FF      +W+  NLS+H
Sbjct: 1494 QATKRQAFF-SMPLFEWIYRNLSDH 1517


>gb|KYP44638.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 260

 Score = 64.7 bits (156), Expect = 4e-10
 Identities = 38/100 (38%), Positives = 52/100 (52%), Gaps = 5/100 (5%)
 Frame = -1

Query: 292 FQACLAWEGLERIRLFLWKAGS---LANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122
           F+    W+GLER+R+FLW+      + NA R RR +T    CP+C+   + + H F  C 
Sbjct: 91  FKLIWNWKGLERVRIFLWRVAHESLMINAFRVRRRITTYSACPICSHDYEDMKHVFLYCP 150

Query: 121 VRSPIWYNLMGYVHSQSFFQDH--DWSDWLL*NLSNHRNQ 8
               +W  L  YV +   FQ H  D S WL  +LS  RNQ
Sbjct: 151 YARQVWSRLPSYVQA---FQSHNSDISIWLTHHLS-RRNQ 186


>gb|KYP71397.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 510

 Score = 65.5 bits (158), Expect = 4e-10
 Identities = 36/93 (38%), Positives = 49/93 (52%), Gaps = 4/93 (4%)
 Frame = -1

Query: 292 FQACLAWEGLERIRLFLWKAGS---LANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122
           F+    W G ERIR FLW+      L N  R  R +T   +CPVC+D+ + L+H  R+C 
Sbjct: 273 FKLLWNWRGPERIRTFLWRLAHNSLLTNDLRMHRGMTMDPLCPVCHDELETLIHAMRECN 332

Query: 121 VRSPIWYNLM-GYVHSQSFFQDHDWSDWLL*NL 26
           V   +W N+  G +H  + F   DW  WL  NL
Sbjct: 333 VARSVWINIFNGRLH--TIFFTMDWMLWLEWNL 363


>gb|KHN24231.1| Putative ribonuclease H protein [Glycine soja]
          Length = 317

 Score = 64.7 bits (156), Expect = 6e-10
 Identities = 35/97 (36%), Positives = 47/97 (48%), Gaps = 3/97 (3%)
 Frame = -1

Query: 292 FQACLAWEGLERIRLFLWKA---GSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122
           F    +W+G ER+R+ LWK    G L N  R  R +     CP C+ Q + +LHC RDC 
Sbjct: 133 FNLIWSWKGPERMRILLWKIANEGLLTNKSRVTRAMAESSECPRCHLQPESILHCLRDCF 192

Query: 121 VRSPIWYNLMGYVHSQSFFQDHDWSDWLL*NLSNHRN 11
               +W  L G       F  HD   WL+ NL + +N
Sbjct: 193 YAKQVWNTLSGN-SLNHLFCAHDCPQWLVSNLRSPQN 228


>gb|ONK68084.1| uncharacterized protein A4U43_C05F7260 [Asparagus officinalis]
          Length = 320

 Score = 64.7 bits (156), Expect = 6e-10
 Identities = 35/82 (42%), Positives = 45/82 (54%), Gaps = 3/82 (3%)
 Frame = -1

Query: 262 ERIRLFLW---KAGSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCKVRSPIWYNLM 92
           ER+R F W   + G L NAER RRHLT    CP C+ + +  LH FRDC V + IW  L 
Sbjct: 68  ERVRTFAWLVVRGGVLTNAERWRRHLTEDDACPCCSSEPELALHLFRDCGVVTDIWTKLK 127

Query: 91  GYVHSQSFFQDHDWSDWLL*NL 26
               S + F   +++ WL  NL
Sbjct: 128 P-PFSWTEFYGSNYAQWLRLNL 148


>gb|KYP62996.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 506

 Score = 64.3 bits (155), Expect = 1e-09
 Identities = 35/93 (37%), Positives = 49/93 (52%), Gaps = 4/93 (4%)
 Frame = -1

Query: 292 FQACLAWEGLERIRLFLWKAGS---LANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCK 122
           F+    W G ERI+ FLW+      L N  R  R +T   +CPVC+D+ + L+H  RDC 
Sbjct: 231 FKLLWNWRGPERIQTFLWRLAHNSLLTNDLRMHRGMTMDPLCPVCHDELETLIHAMRDCN 290

Query: 121 VRSPIWYNLM-GYVHSQSFFQDHDWSDWLL*NL 26
           V   +W N+  G +H+  F    +W  WL  NL
Sbjct: 291 VARSVWINIFNGRLHTNFFTM--NWMLWLEWNL 321


>gb|KYP42050.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 812

 Score = 64.3 bits (155), Expect = 1e-09
 Identities = 38/96 (39%), Positives = 50/96 (52%), Gaps = 11/96 (11%)
 Frame = -1

Query: 280 LAWEG-------LERIRLFLWKAGS---LANAERHRRHLTNIKICPVCNDQDKFLLHCFR 131
           LAW+        L RIR FLW+      L N  R RR +T   +CPVC+D+ + L+H  R
Sbjct: 483 LAWKNAADGEFSLRRIRTFLWRLAHNSLLTNDLRMRRGMTMDPLCPVCHDELETLIHAMR 542

Query: 130 DCKVRSPIWYNLM-GYVHSQSFFQDHDWSDWLL*NL 26
           DC V   +W N+  G +H+  F    DW  WL  NL
Sbjct: 543 DCNVARSVWINIFNGRLHTNFFTM--DWMLWLEWNL 576


>gb|KYP52103.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 255

 Score = 63.2 bits (152), Expect = 1e-09
 Identities = 37/87 (42%), Positives = 45/87 (51%), Gaps = 5/87 (5%)
 Frame = -1

Query: 295 SFQACLAWEGLERIRLFLWKA--GSL-ANAERHRRHLTNIKICPVCNDQDKFLLHCFRDC 125
           +F+A   W G ERIR+ LW+   GSL  N  R  R L     CPVC    +  LH  RDC
Sbjct: 65  AFKAIWRWNGPERIRVLLWRVVHGSLMTNQVRVDRGLGTDPTCPVCMQGTESNLHALRDC 124

Query: 124 KVRSPIWYNLMGYVHSQSFFQD--HDW 50
           K  + IWY   G    +SF +D  HDW
Sbjct: 125 KFATEIWYRASGGSLPRSFAEDNIHDW 151


>ref|XP_006304881.2| LOW QUALITY PROTEIN: uncharacterized protein LOC17899005 [Capsella
            rubella]
          Length = 1833

 Score = 63.9 bits (154), Expect = 2e-09
 Identities = 34/92 (36%), Positives = 49/92 (53%), Gaps = 5/92 (5%)
 Frame = -1

Query: 274  WEGL--ERIRLFLW---KAGSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCKVRSP 110
            W  L  ER RLFLW       + N ERHRRHL++  +C VC   ++ +LH  RDC   + 
Sbjct: 1505 WRALIPERTRLFLWLVVNRALMTNVERHRRHLSDTSVCSVCRSGEETILHILRDCPAMAG 1564

Query: 109  IWYNLMGYVHSQSFFQDHDWSDWLL*NLSNHR 14
            +W  L+     ++FF      +W+  NLS+ R
Sbjct: 1565 LWERLVPRGKVRTFF-SLSLFEWVYENLSDTR 1595


>ref|XP_021611887.1| uncharacterized protein LOC110614619 [Manihot esculenta]
          Length = 243

 Score = 62.8 bits (151), Expect = 2e-09
 Identities = 33/87 (37%), Positives = 47/87 (54%), Gaps = 4/87 (4%)
 Frame = -1

Query: 274 WEGLERIRLFLWKA---GSLANAERHRRHLTNIKICPVCNDQDKFLLHCFRDCKVRSPIW 104
           W G +RIR FLW       L N ER RRH++    CP+C  + + LLH FRDC     +W
Sbjct: 15  WPGPQRIRTFLWLVDYKAILTNQERSRRHISAPDTCPICKREVESLLHVFRDCDHVRSLW 74

Query: 103 YNLMGYVHSQS-FFQDHDWSDWLL*NL 26
            NL   + + + FF   +  +WL+ N+
Sbjct: 75  INLSPSLSAGTIFFSISNVREWLVDNI 101


Top