BLASTX nr result

ID: Astragalus22_contig00036985 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00036985
         (405 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subt...    72   8e-12
gb|EOY25817.1| Non-LTR retroelement reverse transcriptase [Theob...    70   2e-11
gb|KHN06694.1| Putative ribonuclease H protein [Glycine soja]          66   6e-11
gb|PNX89278.1| ribonuclease H, partial [Trifolium pratense]            65   6e-11
gb|KYP76974.1| Putative ribonuclease H protein At1g65750, partia...    66   1e-10
dbj|GAU48278.1| hypothetical protein TSUD_405240 [Trifolium subt...    65   2e-10
gb|KRH65176.1| hypothetical protein GLYMA_03G018300 [Glycine max]      65   2e-10
gb|KRH65175.1| hypothetical protein GLYMA_03G018200 [Glycine max]      65   2e-10
gb|KRH65170.1| hypothetical protein GLYMA_03G018000 [Glycine max...    65   5e-10
gb|KYP46735.1| Putative ribonuclease H protein At1g65750 family,...    64   5e-10
gb|PNX65197.1| ribonuclease H, partial [Trifolium pratense]            62   6e-10
dbj|GAU31501.1| hypothetical protein TSUD_332760 [Trifolium subt...    63   1e-09
ref|XP_015936169.1| uncharacterized protein LOC107462117 [Arachi...    65   1e-09
ref|XP_012084521.1| uncharacterized protein LOC105643892 [Jatrop...    63   2e-09
gb|KYP67585.1| Putative ribonuclease H protein At1g65750 family ...    64   2e-09
gb|KYP33975.1| Putative ribonuclease H protein At1g65750 family ...    64   2e-09
gb|ESR53983.1| hypothetical protein CICLE_v10021474mg [Citrus cl...    63   6e-09
gb|KDO65745.1| hypothetical protein CISIN_1g039495mg, partial [C...    63   7e-09
ref|XP_009344908.1| PREDICTED: uncharacterized protein LOC103936...    63   7e-09
dbj|GAU12898.1| hypothetical protein TSUD_73850 [Trifolium subte...    63   8e-09

>dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subterraneum]
          Length = 1250

 Score = 71.6 bits (174), Expect = 8e-12
 Identities = 33/87 (37%), Positives = 55/87 (63%)
 Frame = -3

Query: 370  PNDFVFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDLEAFSILECELWGIYYGIKI 191
            P +F+ L+CDGA T      GCGGV RN    F++AF     + S++  ELWGI  G+++
Sbjct: 1088 PENFIALNCDGAVTGLTGLAGCGGVLRNCHGGFLVAFSARAGSVSVVHAELWGIINGLEL 1147

Query: 190  GCDRGMRKLVVCTDSMEAVQLLQSPLN 110
              ++G++++ V +DSM A+ L+++  N
Sbjct: 1148 AKNKGLKRIRVESDSMIAINLIRNGCN 1174


>gb|EOY25817.1| Non-LTR retroelement reverse transcriptase [Theobroma cacao]
          Length = 874

 Score = 70.5 bits (171), Expect = 2e-11
 Identities = 34/109 (31%), Positives = 63/109 (57%)
 Frame = -3

Query: 370  PNDFVFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDLEAFSILECELWGIYYGIKI 191
            P D++ ++ DGA   A R+T  GGV R+   ++I+ +   LE  S+   ELWG+Y G+++
Sbjct: 756  PEDWITVNLDGAFKSAARTTAAGGVLRDAHGTWIVGYACKLETSSVFRAELWGVYKGLQL 815

Query: 190  GCDRGMRKLVVCTDSMEAVQLLQSPLNYVSNSLVKDILDTINGVCVMLD 44
              +RG RK+ + +D+   VQ +     +  ++L  D++  I G C++ +
Sbjct: 816  AWERGFRKVKLQSDNKAVVQAISFSSVHPCSNL--DLIRAIKGPCLLTE 862


>gb|KHN06694.1| Putative ribonuclease H protein [Glycine soja]
          Length = 139

 Score = 65.9 bits (159), Expect = 6e-11
 Identities = 36/105 (34%), Positives = 55/105 (52%)
 Frame = -3

Query: 370 PNDFVFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDLEAFSILECELWGIYYGIKI 191
           P DF+ L CDGA   +   T CGGV R+    FI AFL  L +  ILE ELW + +G+ +
Sbjct: 33  PVDFIKLKCDGAMVQSQSMTSCGGVIRDYKGHFIRAFLRKLRSCLILEAELWSLLFGMMM 92

Query: 190 GCDRGMRKLVVCTDSMEAVQLLQSPLNYVSNSLVKDILDTINGVC 56
             D  M  +++ +D +E V L+        +S     +D ++ +C
Sbjct: 93  LKDAEMPNVIIESDCLEVVHLVNG-----GSSPGHTFVDLVDEIC 132


>gb|PNX89278.1| ribonuclease H, partial [Trifolium pratense]
          Length = 99

 Score = 64.7 bits (156), Expect = 6e-11
 Identities = 30/86 (34%), Positives = 51/86 (59%)
 Frame = -3

Query: 376 LVPNDFVFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDLEAFSILECELWGIYYGI 197
           + P DF+ L CD A T      GCGGV RN    F++AF   + + S++  ELWGI +G+
Sbjct: 13  VAPADFIALSCDWAVTSQTGLAGCGGVLRNCHGGFLVAFFAKVGSVSVVRAELWGIIHGL 72

Query: 196 KIGCDRGMRKLVVCTDSMEAVQLLQS 119
            +  ++  +++ V  DS+ A+ L+++
Sbjct: 73  NLAKNKWHKRVRVEYDSLVAINLMKT 98


>gb|KYP76974.1| Putative ribonuclease H protein At1g65750, partial [Cajanus cajan]
          Length = 189

 Score = 66.2 bits (160), Expect = 1e-10
 Identities = 34/82 (41%), Positives = 46/82 (56%)
 Frame = -3

Query: 370 PNDFVFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDLEAFSILECELWGIYYGIKI 191
           P +F+ ++CDGA T  G     GGV RN +  FI AF   LE  SI+E ELW I  G++ 
Sbjct: 27  PQNFIKINCDGAFTSHGNKAAAGGVVRNWEGRFIFAFASALENCSIVEAELWAIKIGMEE 86

Query: 190 GCDRGMRKLVVCTDSMEAVQLL 125
              R    L+V  DS  A++L+
Sbjct: 87  AISRRFLNLIVENDSYSAIELV 108


>dbj|GAU48278.1| hypothetical protein TSUD_405240 [Trifolium subterraneum]
          Length = 161

 Score = 64.7 bits (156), Expect = 2e-10
 Identities = 32/80 (40%), Positives = 49/80 (61%)
 Frame = -3

Query: 361 FVFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDLEAFSILECELWGIYYGIKIGCD 182
           FV L+ DG+   +  + G GG+ RN D  FI  F  D    SIL  E+  I+YG+K+G  
Sbjct: 15  FVCLNVDGSLLGSTNTAGYGGLLRNIDGEFIWGFYGDAAIQSILFAEIMAIWYGLKLGWK 74

Query: 181 RGMRKLVVCTDSMEAVQLLQ 122
           RG RK++ C+DS+ ++ L++
Sbjct: 75  RGFRKVLRCSDSLLSINLIK 94


>gb|KRH65176.1| hypothetical protein GLYMA_03G018300 [Glycine max]
          Length = 180

 Score = 65.1 bits (157), Expect = 2e-10
 Identities = 38/106 (35%), Positives = 56/106 (52%), Gaps = 1/106 (0%)
 Frame = -3

Query: 370 PNDFVFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDL-EAFSILECELWGIYYGIK 194
           P  F  L+CDGA T  G     GGV R+    FIL F   L E  S LE ELW I  G++
Sbjct: 22  PRGFFKLNCDGAFTVYGNKGAAGGVLRDWKGEFILGFSDALIECSSALEAELWAIKIGMQ 81

Query: 193 IGCDRGMRKLVVCTDSMEAVQLLQSPLNYVSNSLVKDILDTINGVC 56
               RG R L+V +DS++A+Q++ +       S ++ +   ++ +C
Sbjct: 82  TVVARGYRNLIVESDSLKAIQIINAHKGDFLRSSIQHMTRMVDRIC 127


>gb|KRH65175.1| hypothetical protein GLYMA_03G018200 [Glycine max]
          Length = 180

 Score = 65.1 bits (157), Expect = 2e-10
 Identities = 38/106 (35%), Positives = 56/106 (52%), Gaps = 1/106 (0%)
 Frame = -3

Query: 370 PNDFVFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDL-EAFSILECELWGIYYGIK 194
           P  F  L+CDGA T  G     GGV R+    FIL F   L E  S LE ELW I  G++
Sbjct: 22  PRGFFKLNCDGAFTVYGNKGAAGGVLRDWKGEFILGFSDALIECSSALEAELWAIKIGMQ 81

Query: 193 IGCDRGMRKLVVCTDSMEAVQLLQSPLNYVSNSLVKDILDTINGVC 56
               RG R L+V +DS++A+Q++ +       S ++ +   ++ +C
Sbjct: 82  TVVARGYRNLIVESDSLKAIQIINAHKGDFLRSSIQHMTRMVDRIC 127


>gb|KRH65170.1| hypothetical protein GLYMA_03G018000 [Glycine max]
 gb|KRH65171.1| hypothetical protein GLYMA_03G018000 [Glycine max]
 gb|KRH65172.1| hypothetical protein GLYMA_03G018000 [Glycine max]
 gb|KRH65173.1| hypothetical protein GLYMA_03G018000 [Glycine max]
          Length = 223

 Score = 65.1 bits (157), Expect = 5e-10
 Identities = 38/106 (35%), Positives = 56/106 (52%), Gaps = 1/106 (0%)
 Frame = -3

Query: 370 PNDFVFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDL-EAFSILECELWGIYYGIK 194
           P  F  L+CDGA T  G     GGV R+    FIL F   L E  S LE ELW I  G++
Sbjct: 65  PRGFFKLNCDGAFTVYGNKGAAGGVLRDWKGEFILGFSDALIECSSALEAELWAIKIGMQ 124

Query: 193 IGCDRGMRKLVVCTDSMEAVQLLQSPLNYVSNSLVKDILDTINGVC 56
               RG R L+V +DS++A+Q++ +       S ++ +   ++ +C
Sbjct: 125 TVVARGYRNLIVESDSLKAIQIINAHKGDFLRSSIQHMTRMVDRIC 170


>gb|KYP46735.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 166

 Score = 63.9 bits (154), Expect = 5e-10
 Identities = 37/84 (44%), Positives = 52/84 (61%), Gaps = 1/84 (1%)
 Frame = -3

Query: 370 PNDFVFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDLEAFSILECELWGIYYGIKI 191
           P   + L+ DGA ++ G    CGGV R+   +F+LAF   L   SIL+ ELW IY G+ I
Sbjct: 9   PIGHIKLNGDGAVSNDGIGA-CGGVVRDSSGNFLLAFSKKLGCISILKAELWAIYQGLLI 67

Query: 190 GCDRGMRKLVVC-TDSMEAVQLLQ 122
             DR  R L++C +DS EAV+L++
Sbjct: 68  IKDRYSRSLIICESDSAEAVKLIE 91


>gb|PNX65197.1| ribonuclease H, partial [Trifolium pratense]
          Length = 94

 Score = 62.0 bits (149), Expect = 6e-10
 Identities = 30/78 (38%), Positives = 46/78 (58%)
 Frame = -3

Query: 376 LVPNDFVFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDLEAFSILECELWGIYYGI 197
           + P DF+ L CDGA T      GCGGV RN    F++AF   + + S++  EL GI YG+
Sbjct: 12  VAPADFIALSCDGAVTSQTGLAGCGGVLRNCHGGFLVAFSAKVGSVSVVRAELCGIIYGL 71

Query: 196 KIGCDRGMRKLVVCTDSM 143
            +  ++G +++ V  DS+
Sbjct: 72  DLAKNKGHKRIHVEFDSL 89


>dbj|GAU31501.1| hypothetical protein TSUD_332760 [Trifolium subterraneum]
          Length = 153

 Score = 62.8 bits (151), Expect = 1e-09
 Identities = 32/88 (36%), Positives = 49/88 (55%), Gaps = 1/88 (1%)
 Frame = -3

Query: 370 PND-FVFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDLEAFSILECELWGIYYGIK 194
           P D +V L+CD A  + G + GCGG+FR+ D  +I  F   + A   L  E+WG+Y GI 
Sbjct: 47  PQDGWVKLNCDRACKELGETAGCGGLFRDSDGRWIKGFTRKIGACDALHAEMWGMYLGID 106

Query: 193 IGCDRGMRKLVVCTDSMEAVQLLQSPLN 110
           I    G+  L+V +DS   + ++ +  N
Sbjct: 107 IAWRDGLSHLIVESDSKVLINMVTNNCN 134


>ref|XP_015936169.1| uncharacterized protein LOC107462117 [Arachis duranensis]
          Length = 1250

 Score = 65.5 bits (158), Expect = 1e-09
 Identities = 30/88 (34%), Positives = 50/88 (56%)
 Frame = -3

Query: 370  PNDFVFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDLEAFSILECELWGIYYGIKI 191
            P D++ ++ DGAA       GCGG+ RN    +I  F+ ++   +    ELWG+YYG+K 
Sbjct: 1085 PEDWMKVNTDGAAKGNPGMAGCGGLIRNYQGRWIAGFVANIGYCTAYYAELWGVYYGLKT 1144

Query: 190  GCDRGMRKLVVCTDSMEAVQLLQSPLNY 107
              + GMRK+++  DS   V +++   N+
Sbjct: 1145 AWELGMRKIILEVDSKAVVDVIKGATNF 1172


>ref|XP_012084521.1| uncharacterized protein LOC105643892 [Jatropha curcas]
          Length = 177

 Score = 62.8 bits (151), Expect = 2e-09
 Identities = 29/86 (33%), Positives = 50/86 (58%)
 Frame = -3

Query: 352 LHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDLEAFSILECELWGIYYGIKIGCDRGM 173
           L+CDG+           GV RN   ++   F  +L + SIL  ELWG++ G+ +  D+G+
Sbjct: 81  LNCDGSLITQSHRASTWGVIRNDVGNWCYGFACNLGSCSILLAELWGVFLGLSLAWDKGV 140

Query: 172 RKLVVCTDSMEAVQLLQSPLNYVSNS 95
           R L+V  D+++A +L+  P+ Y S++
Sbjct: 141 RNLIVEVDNVQACELINQPITYPSSA 166


>gb|KYP67585.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 363

 Score = 64.3 bits (155), Expect = 2e-09
 Identities = 33/83 (39%), Positives = 50/83 (60%), Gaps = 1/83 (1%)
 Frame = -3

Query: 370 PNDFVFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDLEAFSILECELWGIYYGIKI 191
           P   + L+CDGA    GR  GCGG+ RN    FI+ F   L   SIL+ ELW I++G++I
Sbjct: 154 PLGSIKLNCDGAVRGVGRKVGCGGIIRNYLGGFIMGFSCKLGQCSILQAELWAIFHGLRI 213

Query: 190 GCDRGMRK-LVVCTDSMEAVQLL 125
             ++G ++ ++V  DS  A++ L
Sbjct: 214 IKEKGFKEDIIVELDSSLAIKFL 236


>gb|KYP33975.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 300

 Score = 63.9 bits (154), Expect = 2e-09
 Identities = 37/84 (44%), Positives = 52/84 (61%), Gaps = 1/84 (1%)
 Frame = -3

Query: 370 PNDFVFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDLEAFSILECELWGIYYGIKI 191
           P   + L+ DGA ++ G    CGGV R+   +F+LAF   L   SIL+ ELW IY G+ I
Sbjct: 143 PIGHIKLNGDGAVSNDGIGA-CGGVVRDSSGNFLLAFSKKLGCISILKAELWAIYQGLLI 201

Query: 190 GCDRGMRKLVVC-TDSMEAVQLLQ 122
             DR  R L++C +DS EAV+L++
Sbjct: 202 IKDRYSRSLIICESDSAEAVKLIE 225


>gb|ESR53983.1| hypothetical protein CICLE_v10021474mg [Citrus clementina]
          Length = 290

 Score = 62.8 bits (151), Expect = 6e-09
 Identities = 30/89 (33%), Positives = 56/89 (62%)
 Frame = -3

Query: 370 PNDFVFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDLEAFSILECELWGIYYGIKI 191
           P ++V L+ +G+++ A  S G GG+ R+    +IL +  +L   + L  ELW +Y+G+ +
Sbjct: 127 PTNWVKLNIEGSSSRAQGSAGAGGIVRDESGKWILGYSKNLGTSNSLASELWALYHGLNL 186

Query: 190 GCDRGMRKLVVCTDSMEAVQLLQSPLNYV 104
             +RG RK++V  +S EAV+ L+ P +++
Sbjct: 187 VWERGFRKVLVECNSHEAVKCLELPASFL 215


>gb|KDO65745.1| hypothetical protein CISIN_1g039495mg, partial [Citrus sinensis]
          Length = 327

 Score = 62.8 bits (151), Expect = 7e-09
 Identities = 30/89 (33%), Positives = 56/89 (62%)
 Frame = -3

Query: 370 PNDFVFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDLEAFSILECELWGIYYGIKI 191
           P ++V L+ +G+++ A  S G GG+ R+    +IL +  +L   + L  ELW +Y+G+ +
Sbjct: 164 PTNWVKLNIEGSSSRAQGSAGAGGIVRDESGKWILGYSKNLGTSNSLASELWALYHGLNL 223

Query: 190 GCDRGMRKLVVCTDSMEAVQLLQSPLNYV 104
             +RG RK++V  +S EAV+ L+ P +++
Sbjct: 224 VWERGFRKVLVECNSHEAVKCLELPASFL 252


>ref|XP_009344908.1| PREDICTED: uncharacterized protein LOC103936764 [Pyrus x
            bretschneideri]
          Length = 1365

 Score = 63.2 bits (152), Expect = 7e-09
 Identities = 29/80 (36%), Positives = 47/80 (58%)
 Frame = -3

Query: 352  LHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDLEAFSILECELWGIYYGIKIGCDRGM 173
            ++ DG+  D  R    GG+ RN +  +I  F  +L   +I+E ELWG++ G+ I  D G 
Sbjct: 1207 INTDGSCNDPFRHISAGGLIRNSEGDWIKGFAANLGRGTIMEAELWGVFMGLSIAWDEGC 1266

Query: 172  RKLVVCTDSMEAVQLLQSPL 113
            R +++  DS +AV L+Q P+
Sbjct: 1267 RDVILECDSWDAVTLIQKPI 1286


>dbj|GAU12898.1| hypothetical protein TSUD_73850 [Trifolium subterraneum]
          Length = 360

 Score = 62.8 bits (151), Expect = 8e-09
 Identities = 31/83 (37%), Positives = 45/83 (54%)
 Frame = -3

Query: 358 VFLHCDGAATDAGRSTGCGGVFRNRDLSFILAFLHDLEAFSILECELWGIYYGIKIGCDR 179
           V L+ DGA  + G   GCGG+FR+ D  +I  F   + AF  L  E+WG+Y GI I    
Sbjct: 197 VKLNYDGACKELGEFAGCGGLFRDSDGRWIKGFTRKIGAFDALHVEMWGMYLGIDIAWRN 256

Query: 178 GMRKLVVCTDSMEAVQLLQSPLN 110
           G+  L V +DS   + ++ +  N
Sbjct: 257 GLSHLTVESDSKVLINMITNKCN 279


Top