BLASTX nr result

ID: Astragalus23_contig00033606 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00033606
         (453 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subt...    75   8e-14
gb|AAD22368.1| putative non-LTR retroelement reverse transcripta...    70   2e-11
ref|XP_024190061.1| uncharacterized protein LOC112194030 [Rosa c...    67   5e-10
gb|KYP71397.1| Putative ribonuclease H protein At1g65750 family,...    65   7e-10
ref|XP_018510856.1| PREDICTED: uncharacterized protein LOC103844...    67   8e-10
ref|XP_020871723.1| uncharacterized protein LOC9299799 [Arabidop...    66   9e-10
dbj|BAB09815.1| non-LTR retroelement reverse transcriptase-like ...    66   9e-10
gb|KYP42050.1| Putative ribonuclease H protein At1g65750 family,...    65   1e-09
ref|XP_022553441.1| uncharacterized protein LOC106384431 [Brassi...    66   1e-09
gb|KYP62996.1| Putative ribonuclease H protein At1g65750 family ...    64   1e-09
ref|XP_013745405.2| uncharacterized protein LOC106448012 [Brassi...    66   1e-09
ref|XP_018435759.1| PREDICTED: uncharacterized protein LOC108808...    65   4e-09
dbj|GAU10577.1| hypothetical protein TSUD_420970, partial [Trifo...    64   6e-09
ref|XP_013658112.1| uncharacterized protein LOC106362816 [Brassi...    64   7e-09
ref|XP_016164673.1| uncharacterized protein LOC107607211 [Arachi...    61   8e-09
dbj|GAU36844.1| hypothetical protein TSUD_213680 [Trifolium subt...    61   8e-09
dbj|GAU11804.1| hypothetical protein TSUD_75550 [Trifolium subte...    63   1e-08
ref|XP_013751841.2| uncharacterized protein LOC106454232 [Brassi...    63   1e-08
gb|PRQ37815.1| putative RNA-directed DNA polymerase [Rosa chinen...    62   2e-08
sp|P0C2F6.1|RNHX1_ARATH RecName: Full=Putative ribonuclease H pr...    62   3e-08

>dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subterraneum]
          Length = 1250

 Score = 74.7 bits (182), Expect(2) = 8e-14
 Identities = 36/79 (45%), Positives = 47/79 (59%)
 Frame = +3

Query: 108  LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
            ++ FLWK  H  LLTN +R RR M  SK+C  C   +E+L H  R+C+ S SIW  L+V 
Sbjct: 933  IKLFLWKATHACLLTNMERLRRKMTASKVCSRCNLQDESLLHVFRDCNFSKSIWQNLNVQ 992

Query: 288  NPNQFLSCVNWNSWLYTNL 344
            N   F    +W+ WL TNL
Sbjct: 993  NRRSFFHENDWHQWLLTNL 1011



 Score = 29.6 bits (65), Expect(2) = 8e-14
 Identities = 11/29 (37%), Positives = 19/29 (65%)
 Frame = +1

Query: 364  EEEDYWHILFGAVLDQIW*NRNNVEFSQR 450
            ++E  W + F  +LD+IW +RN+  FS +
Sbjct: 1018 KDEATWSLKFAIILDKIWYSRNSFIFSHK 1046


>gb|AAD22368.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
           thaliana]
          Length = 321

 Score = 70.5 bits (171), Expect = 2e-11
 Identities = 30/85 (35%), Positives = 52/85 (61%)
 Frame = +3

Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
           +R FLW +  Q+++TN +R+RR ++ +++C+IC    ET+ H +R+C A   IWS L   
Sbjct: 6   VRVFLWLVVQQVIITNVERYRRHLSDTRVCQICQGGEETILHVLRDCPAMAGIWSRLVPR 65

Query: 288 NPNQFLSCVNWNSWLYTNLRKKGVW 362
           +  +     +   W+Y NLR++G W
Sbjct: 66  DQIRQFFTASLLEWIYKNLRERGSW 90


>ref|XP_024190061.1| uncharacterized protein LOC112194030 [Rosa chinensis]
          Length = 1296

 Score = 67.0 bits (162), Expect = 5e-10
 Identities = 33/95 (34%), Positives = 50/95 (52%)
 Frame = +3

Query: 108  LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
            L+ FLW L H  LLTNA R +R +     C IC  ++E+L H  ++C A+L++W+   + 
Sbjct: 970  LKTFLWVLCHGKLLTNAHRVKRNLTDDDTCPICRCNSESLSHLFKDCPAALNVWNSFTLP 1029

Query: 288  NPNQFLSCVNWNSWLYTNLRKKGVWRGGRLLAHSF 392
             P +F   ++W  WL  NL  K     G     +F
Sbjct: 1030 QPVKFTFSMSWEGWLQANLFCKAKCNAGNPWCSTF 1064


>gb|KYP71397.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 510

 Score = 65.5 bits (158), Expect(2) = 7e-10
 Identities = 31/83 (37%), Positives = 45/83 (54%)
 Frame = +3

Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
           +R FLW+L H  LLTN  R  R M    LC +C +  ETL H +REC+ + S+W  +   
Sbjct: 285 IRTFLWRLAHNSLLTNDLRMHRGMTMDPLCPVCHDELETLIHAMRECNVARSVWINIFNG 344

Query: 288 NPNQFLSCVNWNSWLYTNLRKKG 356
             +     ++W  WL  NL ++G
Sbjct: 345 RLHTIFFTMDWMLWLEWNLLQQG 367



 Score = 25.4 bits (54), Expect(2) = 7e-10
 Identities = 12/19 (63%), Positives = 12/19 (63%)
 Frame = +1

Query: 385 ILFGAVLDQIW*NRNNVEF 441
           ILF   LD IW  RNNV F
Sbjct: 369 ILFVVALDAIWNMRNNVVF 387


>ref|XP_018510856.1| PREDICTED: uncharacterized protein LOC103844431 [Brassica rapa]
          Length = 1833

 Score = 66.6 bits (161), Expect = 8e-10
 Identities = 29/79 (36%), Positives = 49/79 (62%)
 Frame = +3

Query: 108  LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
            +R FLW + HQ+++TN +R RR ++ + +C++C + NET+ H +R+C AS+ +W  L   
Sbjct: 1513 VRVFLWLVSHQVIMTNMERKRRHLSDNGMCQLCKSGNETILHTLRDCPASMGLWRRLVDP 1572

Query: 288  NPNQFLSCVNWNSWLYTNL 344
            +  Q     +   WLY NL
Sbjct: 1573 SRQQRFFDQSLLQWLYENL 1591


>ref|XP_020871723.1| uncharacterized protein LOC9299799 [Arabidopsis lyrata subsp.
           lyrata]
 ref|XP_020871724.1| uncharacterized protein LOC9299799 [Arabidopsis lyrata subsp.
           lyrata]
 ref|XP_020871725.1| uncharacterized protein LOC9299799 [Arabidopsis lyrata subsp.
           lyrata]
 ref|XP_020871727.1| uncharacterized protein LOC9299799 [Arabidopsis lyrata subsp.
           lyrata]
 ref|XP_020871728.1| uncharacterized protein LOC9299799 [Arabidopsis lyrata subsp.
           lyrata]
 ref|XP_020871729.1| uncharacterized protein LOC9299799 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 592

 Score = 66.2 bits (160), Expect = 9e-10
 Identities = 28/79 (35%), Positives = 45/79 (56%)
 Frame = +3

Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
           +R FLW + HQ ++TNA+R+RR +  +++C++C    ET+ H +R+C A   IW+     
Sbjct: 267 IRLFLWLVAHQAIMTNAERYRRHLGDTEICQVCKGGTETIIHALRDCPAMEGIWTRTVPL 326

Query: 288 NPNQFLSCVNWNSWLYTNL 344
              Q     +   WLY NL
Sbjct: 327 RKRQSFFASSLLEWLYANL 345


>dbj|BAB09815.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
           thaliana]
          Length = 676

 Score = 66.2 bits (160), Expect = 9e-10
 Identities = 32/82 (39%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
 Frame = +3

Query: 111 RRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIW-SILHV* 287
           R FLW +G+Q++LTNA+R RR MA S +C +C  ++E+L H +R+C A + IW  ++ V 
Sbjct: 357 RIFLWLVGNQVVLTNAERVRRHMADSDVCPLCKGASESLIHVLRDCPAMMGIWMRVVPVM 416

Query: 288 NPNQFLSCVNWNSWLYTNLRKK 353
              +F    +   W+Y NL+++
Sbjct: 417 EQRRFFE-TSLLEWMYGNLKER 437


>gb|KYP42050.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 812

 Score = 65.1 bits (157), Expect(2) = 1e-09
 Identities = 31/83 (37%), Positives = 46/83 (55%)
 Frame = +3

Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
           +R FLW+L H  LLTN  R RR M    LC +C +  ETL H +R+C+ + S+W  +   
Sbjct: 498 IRTFLWRLAHNSLLTNDLRMRRGMTMDPLCPVCHDELETLIHAMRDCNVARSVWINIFNG 557

Query: 288 NPNQFLSCVNWNSWLYTNLRKKG 356
             +     ++W  WL  NL ++G
Sbjct: 558 RLHTNFFTMDWMLWLEWNLLQQG 580



 Score = 25.4 bits (54), Expect(2) = 1e-09
 Identities = 12/19 (63%), Positives = 12/19 (63%)
 Frame = +1

Query: 385 ILFGAVLDQIW*NRNNVEF 441
           ILF   LD IW  RNNV F
Sbjct: 582 ILFVVALDAIWTMRNNVVF 600


>ref|XP_022553441.1| uncharacterized protein LOC106384431 [Brassica napus]
          Length = 1859

 Score = 66.2 bits (160), Expect = 1e-09
 Identities = 32/99 (32%), Positives = 53/99 (53%)
 Frame = +3

Query: 108  LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
            +R FLW  G Q+++TN +R+RR +  +  CE+C  + ET+ H +R+C A   IW+ +   
Sbjct: 1538 VRCFLWLAGQQVIMTNCERYRRHLGATNTCEVCKGAPETVLHVLRDCPAMEGIWNRVVPM 1597

Query: 288  NPNQFLSCVNWNSWLYTNLRKKGVWRGGRLLAHSFWSSF 404
               Q     +   WL+TNL         +++  S WS+F
Sbjct: 1598 GKRQTFFTQSLLQWLFTNL------GDNQMVGESTWSTF 1630


>gb|KYP62996.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 506

 Score = 63.5 bits (153), Expect(2) = 1e-09
 Identities = 30/83 (36%), Positives = 45/83 (54%)
 Frame = +3

Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
           ++ FLW+L H  LLTN  R  R M    LC +C +  ETL H +R+C+ + S+W  +   
Sbjct: 243 IQTFLWRLAHNSLLTNDLRMHRGMTMDPLCPVCHDELETLIHAMRDCNVARSVWINIFNG 302

Query: 288 NPNQFLSCVNWNSWLYTNLRKKG 356
             +     +NW  WL  NL ++G
Sbjct: 303 RLHTNFFTMNWMLWLEWNLLQQG 325



 Score = 26.6 bits (57), Expect(2) = 1e-09
 Identities = 12/19 (63%), Positives = 12/19 (63%)
 Frame = +1

Query: 385 ILFGAVLDQIW*NRNNVEF 441
           ILF   LD IW  RNNV F
Sbjct: 327 ILFAVALDAIWTMRNNVVF 345


>ref|XP_013745405.2| uncharacterized protein LOC106448012 [Brassica napus]
 ref|XP_022544051.1| uncharacterized protein LOC111198962 [Brassica napus]
          Length = 1826

 Score = 65.9 bits (159), Expect = 1e-09
 Identities = 27/79 (34%), Positives = 48/79 (60%)
 Frame = +3

Query: 108  LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
            +R FLW + HQ+++TN +R RR ++ + +C++C N +ET+ H +R+C A++ +W  + + 
Sbjct: 1506 VRVFLWLVAHQVIMTNMERKRRHLSDNGMCQLCKNGDETIIHVLRDCPAAMGLWRKIVIL 1565

Query: 288  NPNQFLSCVNWNSWLYTNL 344
               Q         WLY NL
Sbjct: 1566 RKQQRFFNQPLLEWLYENL 1584


>ref|XP_018435759.1| PREDICTED: uncharacterized protein LOC108808055 [Raphanus sativus]
          Length = 1802

 Score = 64.7 bits (156), Expect = 4e-09
 Identities = 30/87 (34%), Positives = 48/87 (55%)
 Frame = +3

Query: 108  LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
            +R FLW +G+Q ++TNA+R +R ++ + +C++C    ET+ H +R+C A   IW      
Sbjct: 1482 VRMFLWLVGNQAIMTNAERFQRHLSGTNVCQVCRGGIETILHVLRDCPAMKGIWDRFVPA 1541

Query: 288  NPNQFLSCVNWNSWLYTNLRKKGVWRG 368
               Q    +    WLY NL +K V  G
Sbjct: 1542 TRRQTFFSMTLYEWLYWNLCEKDVGSG 1568


>dbj|GAU10577.1| hypothetical protein TSUD_420970, partial [Trifolium subterraneum]
          Length = 426

 Score = 63.5 bits (153), Expect(2) = 6e-09
 Identities = 32/90 (35%), Positives = 49/90 (54%), Gaps = 3/90 (3%)
 Frame = +3

Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
           ++ F+W   H+ LLTN +R +  +  S  C  CGN +ET+ H +R+C+ +  IW  L   
Sbjct: 262 IQTFIWIAAHERLLTNFRRSKWGVGVSPACSSCGNGDETIIHTLRDCAHATRIWLRLVCH 321

Query: 288 NP-NQFLSCVNWNSWLYTNLRKK--GVWRG 368
           N    F S +N   W++ NL  K  GV +G
Sbjct: 322 NQITNFFSSLNCRDWIFMNLNSKEFGVQQG 351



 Score = 24.3 bits (51), Expect(2) = 6e-09
 Identities = 11/32 (34%), Positives = 15/32 (46%)
 Frame = +1

Query: 352 KEFGEEEDYWHILFGAVLDQIW*NRNNVEFSQ 447
           KEFG ++  W  +F      IW  RN   F +
Sbjct: 344 KEFGVQQGNWQSIFMVACWHIWTWRNKSIFEE 375


>ref|XP_013658112.1| uncharacterized protein LOC106362816 [Brassica napus]
          Length = 1707

 Score = 63.9 bits (154), Expect = 7e-09
 Identities = 32/99 (32%), Positives = 51/99 (51%)
 Frame = +3

Query: 108  LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
            +R FLW  G Q+++TN +R+RR +  +  CE+C  + ET+ H +R+C A   IW+ +   
Sbjct: 1389 VRCFLWLAGQQVIMTNCERYRRHLGATNTCEVCKGAPETVLHVLRDCPAMEGIWNRVVPM 1448

Query: 288  NPNQFLSCVNWNSWLYTNLRKKGVWRGGRLLAHSFWSSF 404
               Q     +   WL+TNL    +         S WS+F
Sbjct: 1449 GKRQTFFTQSLLQWLFTNLGDNQM---------SIWSTF 1478


>ref|XP_016164673.1| uncharacterized protein LOC107607211 [Arachis ipaensis]
          Length = 1901

 Score = 61.2 bits (147), Expect(2) = 8e-09
 Identities = 28/85 (32%), Positives = 42/85 (49%)
 Frame = +3

Query: 108  LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
            +R FLW + H  +LTN+++ RR +     C  C +  E+  H +R+C  ++SIW+ L   
Sbjct: 1587 IRTFLWLVTHNAILTNSEKRRRHLTNDDTCPRCRSHEESTIHVLRDCPYAMSIWNRLIPP 1646

Query: 288  NPNQFLSCVNWNSWLYTNLRKKGVW 362
            N          N WLY NL     W
Sbjct: 1647 NGRSSFFNTELNEWLYQNLTTNKNW 1671



 Score = 26.2 bits (56), Expect(2) = 8e-09
 Identities = 10/22 (45%), Positives = 13/22 (59%)
 Frame = +1

Query: 379  WHILFGAVLDQIW*NRNNVEFS 444
            W+ LFG  L  IW  RN + F+
Sbjct: 1671 WNCLFGVALSSIWYLRNKLVFN 1692


>dbj|GAU36844.1| hypothetical protein TSUD_213680 [Trifolium subterraneum]
          Length = 1025

 Score = 61.2 bits (147), Expect(2) = 8e-09
 Identities = 30/85 (35%), Positives = 46/85 (54%), Gaps = 3/85 (3%)
 Frame = +3

Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
           ++ F+W   H  LLTN +R +  +  S  C ICGN +ET+ H +R+C  +  IW  L + 
Sbjct: 700 IQTFMWIAAHARLLTNVRRSKWGVGVSPTCSICGNDDETMIHTLRDCIYATGIW--LRLV 757

Query: 288 NPNQ---FLSCVNWNSWLYTNLRKK 353
           + NQ   F S  +   W++ NL  K
Sbjct: 758 SSNQITNFFSSFDCREWIFLNLNTK 782



 Score = 26.2 bits (56), Expect(2) = 8e-09
 Identities = 11/32 (34%), Positives = 16/32 (50%)
 Frame = +1

Query: 352 KEFGEEEDYWHILFGAVLDQIW*NRNNVEFSQ 447
           K FG +++ W  +F  V   IW  RN   F +
Sbjct: 782 KNFGNQQESWKSIFMVVCWHIWTWRNKAIFEE 813


>dbj|GAU11804.1| hypothetical protein TSUD_75550 [Trifolium subterraneum]
          Length = 1178

 Score = 62.8 bits (151), Expect(2) = 1e-08
 Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 3/90 (3%)
 Frame = +3

Query: 108  LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
            ++ F+W   H+ L+TN +R +  +  S  C  CGN +ET+ H +R+C+ +  IW  L   
Sbjct: 970  IQTFIWIAAHERLITNFRRSKWGVGVSPACSSCGNGDETIIHTLRDCAHATRIWLRLVCH 1029

Query: 288  NP-NQFLSCVNWNSWLYTNLRKK--GVWRG 368
            N    F S +N   W++ NL  K  GV +G
Sbjct: 1030 NQITNFFSSLNCRDWIFMNLNSKEFGVQQG 1059



 Score = 24.3 bits (51), Expect(2) = 1e-08
 Identities = 11/32 (34%), Positives = 15/32 (46%)
 Frame = +1

Query: 352  KEFGEEEDYWHILFGAVLDQIW*NRNNVEFSQ 447
            KEFG ++  W  +F      IW  RN   F +
Sbjct: 1052 KEFGVQQGNWQSIFMVACWHIWTWRNKSIFEE 1083


>ref|XP_013751841.2| uncharacterized protein LOC106454232 [Brassica napus]
          Length = 1893

 Score = 63.2 bits (152), Expect = 1e-08
 Identities = 24/54 (44%), Positives = 38/54 (70%)
 Frame = +3

Query: 108  LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIW 269
            +R FLW + HQ+++TN +R RR M+ + +C +C N NET+ H +R+C A+  IW
Sbjct: 1567 VRVFLWLVSHQVIMTNMERKRRHMSDNGMCTLCRNGNETILHALRDCQAAAGIW 1620


>gb|PRQ37815.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 760

 Score = 62.4 bits (150), Expect = 2e-08
 Identities = 29/80 (36%), Positives = 47/80 (58%)
 Frame = +3

Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
           L+ F W + H  LLTN +R +R M++   C +C N+ ET+ H +R+CS + SIW+ +   
Sbjct: 440 LKSFFWLICHGKLLTNVERVKRRMSSDPSCPLCHNAPETIMHLLRDCSHASSIWNKIICL 499

Query: 288 NPNQFLSCVNWNSWLYTNLR 347
           +       ++W SWL  N+R
Sbjct: 500 DTITRAMHLDWMSWLAANIR 519


>sp|P0C2F6.1|RNHX1_ARATH RecName: Full=Putative ribonuclease H protein At1g65750
          Length = 620

 Score = 62.0 bits (149), Expect = 3e-08
 Identities = 27/79 (34%), Positives = 45/79 (56%)
 Frame = +3

Query: 108 LRRFLWKLGHQMLLTNAKRHRRFMATSKLCEICGNSNETLFH*VRECSASLSIWSILHV* 287
           ++ FLW +G+Q ++T  +RHRR ++ S +C++C    E++ H +R+C A L IW  +   
Sbjct: 300 VKTFLWLVGNQAVMTEEERHRRHLSASNVCQVCKGGVESMLHVLRDCPAQLGIWVRVVPQ 359

Query: 288 NPNQFLSCVNWNSWLYTNL 344
              Q     +   WLY NL
Sbjct: 360 RRQQGFFSKSLFEWLYDNL 378


Top