BLASTX nr result

ID: Astragalus24_contig00025810 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00025810
         (443 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KYP77347.1| Retrovirus-related Pol polyprotein from transposo...   131   4e-35
gb|KYP40818.1| Retrovirus-related Pol polyprotein from transposo...   135   5e-35
gb|PNX56974.1| hypothetical protein L195_g058464, partial [Trifo...   125   4e-34
gb|PNX67669.1| hypothetical protein L195_g055750 [Trifolium prat...   124   7e-33
gb|PNX58558.1| copia-type polyprotein, partial [Trifolium pratense]   122   9e-33
gb|PNX85850.1| hypothetical protein L195_g041924 [Trifolium prat...   129   2e-32
gb|PNX89656.1| type I inositol 145-trisphosphate 5-phosphatase 1...   120   9e-32
gb|PNX79941.1| hypothetical protein L195_g035933 [Trifolium prat...   122   1e-31
gb|PNX88062.1| hypothetical protein L195_g044162, partial [Trifo...   117   2e-31
gb|PNY00510.1| copia-type polyprotein [Trifolium pratense]            122   5e-31
emb|CAN74443.1| hypothetical protein VITISV_031468 [Vitis vinifera]   117   5e-31
dbj|GAU16794.1| hypothetical protein TSUD_200370 [Trifolium subt...   126   1e-30
gb|KYP50279.1| Retrovirus-related Pol polyprotein from transposo...   118   1e-30
dbj|GAU32111.1| hypothetical protein TSUD_357950 [Trifolium subt...   125   2e-30
ref|XP_018816630.1| PREDICTED: uncharacterized protein LOC108988...   122   2e-30
gb|PNX56920.1| endoribonuclease dicer 2-like protein, partial [T...   118   2e-30
dbj|GAU42085.1| hypothetical protein TSUD_200730 [Trifolium subt...   124   3e-30
gb|PNX86749.1| copia-type polyprotein [Trifolium pratense]            121   4e-30
gb|PNY15642.1| copia-type polyprotein [Trifolium pratense]            124   4e-30
gb|PRQ52345.1| putative RNA-directed DNA polymerase [Rosa chinen...   124   6e-30

>gb|KYP77347.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 257

 Score =  131 bits (330), Expect = 4e-35
 Identities = 60/96 (62%), Positives = 80/96 (83%)
 Frame = -2

Query: 442 GIWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVI 263
           GIW+KRILE  G+K   S+++ CDNSS+IKLSKNP++HG +KHIDVRF FLRNLC++G+I
Sbjct: 154 GIWIKRILEELGVKLEESLSILCDNSSAIKLSKNPVMHGRSKHIDVRFLFLRNLCKEGII 213

Query: 262 ELKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMC 155
           ELKYC++ EQ AD++ KALK E+F   ++R+ LGMC
Sbjct: 214 ELKYCTTEEQLADVLTKALKKESF--MRIRSNLGMC 247


>gb|KYP40818.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 468

 Score =  135 bits (341), Expect = 5e-35
 Identities = 61/96 (63%), Positives = 81/96 (84%)
 Frame = -2

Query: 442 GIWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVI 263
           GIW+KRILE  G+K   S+++ CDNSS+IKLSKNP++HG +KHIDVRFHFLRNLC++G+I
Sbjct: 365 GIWIKRILEELGVKLEESLSILCDNSSAIKLSKNPVMHGRSKHIDVRFHFLRNLCKEGII 424

Query: 262 ELKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMC 155
           ELKYC++ EQ AD++ KALK E+F   ++R+ LGMC
Sbjct: 425 ELKYCTTEEQLADVLTKALKKESF--MRIRSNLGMC 458


>gb|PNX56974.1| hypothetical protein L195_g058464, partial [Trifolium pratense]
          Length = 123

 Score =  125 bits (313), Expect = 4e-34
 Identities = 60/99 (60%), Positives = 75/99 (75%)
 Frame = -2

Query: 442 GIWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVI 263
           GIWL+RIL N G +    + V CDNSS+IKLSKNP+LHG +KHID+RFHFLRNLC DG +
Sbjct: 23  GIWLQRILVNMGFELKKCLIVHCDNSSTIKLSKNPVLHGRSKHIDIRFHFLRNLCCDGKV 82

Query: 262 ELKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMCIRP 146
           EL +C+S +Q AD+M KALK E F   KLR+ LG+ + P
Sbjct: 83  ELVHCASQDQVADIMTKALKLEAF--EKLRSMLGVQMNP 119


>gb|PNX67669.1| hypothetical protein L195_g055750 [Trifolium pratense]
          Length = 200

 Score =  124 bits (311), Expect = 7e-33
 Identities = 59/95 (62%), Positives = 75/95 (78%)
 Frame = -2

Query: 439 IWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVIE 260
           +WLK +L+   +K+  SIT+ CDNSS+IKLSKNPI+HG +KHIDVRFHFLR+L +DGVIE
Sbjct: 104 LWLKNVLDYLHIKQDGSITINCDNSSTIKLSKNPIMHGRSKHIDVRFHFLRDLSKDGVIE 163

Query: 259 LKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMC 155
           LK+C S EQ AD+M K LK ++F   KLR  +GMC
Sbjct: 164 LKFCKSQEQLADIMTKPLKLDSF--CKLREGIGMC 196


>gb|PNX58558.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 149

 Score =  122 bits (306), Expect = 9e-33
 Identities = 57/96 (59%), Positives = 73/96 (76%)
 Frame = -2

Query: 442 GIWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVI 263
           GIWL+RILE      ++  T+ CDN+SSIKLSKNP++HG  KHIDVR+HFLR+L +DG +
Sbjct: 50  GIWLRRILEQLKQTHMTGTTILCDNTSSIKLSKNPVMHGRCKHIDVRYHFLRDLIKDGTV 109

Query: 262 ELKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMC 155
           E+ +CSS +Q AD+M KALK ETF    LR +LGMC
Sbjct: 110 EMSFCSSQDQIADIMTKALKLETF--CNLRIRLGMC 143


>gb|PNX85850.1| hypothetical protein L195_g041924 [Trifolium pratense]
          Length = 477

 Score =  129 bits (323), Expect = 2e-32
 Identities = 62/102 (60%), Positives = 76/102 (74%)
 Frame = -2

Query: 442 GIWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVI 263
           GIW++RILE  G  +L S TV CDNSS+IKLSKNP+LHG +KHIDVRFHFLR+L +DG +
Sbjct: 378 GIWMRRILEKLGHTQLGSTTVYCDNSSAIKLSKNPVLHGRSKHIDVRFHFLRDLTKDGTL 437

Query: 262 ELKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMCIRPSVN 137
           EL + +SH+Q AD+M K LK E F   KLR  LGMC    +N
Sbjct: 438 ELAHSNSHDQIADIMTKPLKFEAF--EKLRGLLGMCFLSDIN 477


>gb|PNX89656.1| type I inositol 145-trisphosphate 5-phosphatase 12-like protein
           [Trifolium pratense]
          Length = 156

 Score =  120 bits (300), Expect = 9e-32
 Identities = 58/102 (56%), Positives = 77/102 (75%)
 Frame = -2

Query: 442 GIWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVI 263
           GIWL+ +L+    ++ +  T+ CDNSSSIKLSKNP++HG +KHIDV++HFLR+L  DGVI
Sbjct: 57  GIWLRNVLKQLLQEQTTCTTIYCDNSSSIKLSKNPVMHGRSKHIDVKYHFLRDLNNDGVI 116

Query: 262 ELKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMCIRPSVN 137
           ELK+C ++EQ AD+M KALK +TF   KLR  LG+C   S N
Sbjct: 117 ELKHCRTNEQLADIMTKALKVDTF--CKLREGLGICDSSSFN 156


>gb|PNX79941.1| hypothetical protein L195_g035933 [Trifolium pratense]
          Length = 253

 Score =  122 bits (306), Expect = 1e-31
 Identities = 60/102 (58%), Positives = 77/102 (75%)
 Frame = -2

Query: 442 GIWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVI 263
           GIWL RIL     +E S IT+ CDNSSSIKLSKNP++HG +KHIDVRFHFLR+L ++GVI
Sbjct: 154 GIWLSRILTAIEAREKSCITIYCDNSSSIKLSKNPVMHGRSKHIDVRFHFLRDLTKEGVI 213

Query: 262 ELKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMCIRPSVN 137
           +L +CSS EQ AD+M K L  ++FS++  R +LG+C    VN
Sbjct: 214 QLVHCSSFEQVADIMTKPLSFDSFSRN--RDKLGLCTLELVN 253


>gb|PNX88062.1| hypothetical protein L195_g044162, partial [Trifolium pratense]
          Length = 107

 Score =  117 bits (293), Expect = 2e-31
 Identities = 56/102 (54%), Positives = 75/102 (73%)
 Frame = -2

Query: 442 GIWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVI 263
           GIWL RIL     +E   IT+ CDNSSSIKLSK+P++HG +KHIDVRFHFLR+L ++G I
Sbjct: 8   GIWLSRILTQIDAREKDCITIYCDNSSSIKLSKHPVMHGRSKHIDVRFHFLRDLTREGKI 67

Query: 262 ELKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMCIRPSVN 137
           +L +CSS EQ AD+M K L  E+F+++  R +LG+C    +N
Sbjct: 68  QLVHCSSFEQVADIMTKPLSFESFNRN--RDKLGLCTLELIN 107


>gb|PNY00510.1| copia-type polyprotein [Trifolium pratense]
          Length = 291

 Score =  122 bits (305), Expect = 5e-31
 Identities = 61/102 (59%), Positives = 74/102 (72%)
 Frame = -2

Query: 442 GIWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVI 263
           GIWL RIL     K    IT+ CDNSSSIKLSKNP++HG +KHIDVRFHFLR+L +DG I
Sbjct: 192 GIWLSRILAQIYTKGKDFITIYCDNSSSIKLSKNPVMHGRSKHIDVRFHFLRDLTKDGTI 251

Query: 262 ELKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMCIRPSVN 137
           +L +CSS EQ AD+M KAL  E FS++  R +LG+C    VN
Sbjct: 252 QLVHCSSFEQVADIMTKALSFENFSRN--RDKLGLCTLELVN 291


>emb|CAN74443.1| hypothetical protein VITISV_031468 [Vitis vinifera]
          Length = 147

 Score =  117 bits (294), Expect = 5e-31
 Identities = 57/102 (55%), Positives = 71/102 (69%)
 Frame = -2

Query: 442 GIWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVI 263
           G+W+KRIL+  G  +    T+ CDNSS+IKLSKNPI+HG  KHIDVRFHFLRNL ++G I
Sbjct: 48  GVWMKRILKELGHLDEGCTTMMCDNSSTIKLSKNPIMHGRNKHIDVRFHFLRNLAKEGTI 107

Query: 262 ELKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMCIRPSVN 137
           EL +C S +Q AD+M K LK E F   K R  LG+C    +N
Sbjct: 108 ELVHCGSQDQVADIMTKPLKLEVF--QKFRKLLGVCEISGIN 147


>dbj|GAU16794.1| hypothetical protein TSUD_200370 [Trifolium subterraneum]
          Length = 1102

 Score =  126 bits (316), Expect = 1e-30
 Identities = 60/96 (62%), Positives = 76/96 (79%)
 Frame = -2

Query: 442  GIWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVI 263
            G WLKRILEN GL++   + V CDN+S+IKLSKNP+LHG +KHID+RFHFLRNL  +G+I
Sbjct: 1003 GSWLKRILENLGLEQKQCLDVFCDNNSTIKLSKNPVLHGRSKHIDIRFHFLRNLSGEGMI 1062

Query: 262  ELKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMC 155
            ELK+C+S  Q AD+M KALK E F   +LR +LG+C
Sbjct: 1063 ELKHCTSQNQLADIMTKALKLEAF--ERLRERLGVC 1096


>gb|KYP50279.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 195

 Score =  118 bits (296), Expect = 1e-30
 Identities = 57/101 (56%), Positives = 73/101 (72%)
 Frame = -2

Query: 439 IWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVIE 260
           +W++RIL+  G  +    T+ CDNSS+IKLSKNPI+HG +KHIDVRFHFLRNL +DG IE
Sbjct: 97  VWMRRILKELGHNQEGCTTLMCDNSSTIKLSKNPIMHGRSKHIDVRFHFLRNLSRDGCIE 156

Query: 259 LKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMCIRPSVN 137
           L  C + EQ ADL+ K LK + F   KLR Q+G+C  P +N
Sbjct: 157 LVQCGTKEQVADLLTKPLKLDCF--LKLREQMGVCEIPKIN 195


>dbj|GAU32111.1| hypothetical protein TSUD_357950 [Trifolium subterraneum]
          Length = 1193

 Score =  125 bits (314), Expect = 2e-30
 Identities = 60/96 (62%), Positives = 74/96 (77%)
 Frame = -2

Query: 442  GIWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVI 263
            GIWLKRILE+ GLK+   + V CDN S+IKLSKNP+LHG +KHID+RFHFLRNL  DG I
Sbjct: 1093 GIWLKRILESMGLKQQRCLDVFCDNISTIKLSKNPVLHGRSKHIDIRFHFLRNLSYDGTI 1152

Query: 262  ELKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMC 155
            E+K+C+S  Q AD+M KALK E+F    L+  LG+C
Sbjct: 1153 EMKHCTSQNQIADIMTKALKLESF--ENLKRMLGVC 1186


>ref|XP_018816630.1| PREDICTED: uncharacterized protein LOC108988006 [Juglans regia]
          Length = 372

 Score =  122 bits (305), Expect = 2e-30
 Identities = 57/102 (55%), Positives = 76/102 (74%)
 Frame = -2

Query: 442 GIWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVI 263
           G+W++R+LE FG  +    TV CDNSS+IKLSKNP++HG +KHIDVRFHFL +L +DGV+
Sbjct: 273 GVWMRRVLEKFGHSQGKCTTVLCDNSSTIKLSKNPVMHGRSKHIDVRFHFLCDLTRDGVV 332

Query: 262 ELKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMCIRPSVN 137
           ELK+C + EQ AD+M K LK + F   KL   +G+C+ P VN
Sbjct: 333 ELKHCVTQEQVADIMTKPLKLDVF--LKLCESMGVCVVPRVN 372


>gb|PNX56920.1| endoribonuclease dicer 2-like protein, partial [Trifolium pratense]
          Length = 226

 Score =  118 bits (296), Expect = 2e-30
 Identities = 58/102 (56%), Positives = 77/102 (75%)
 Frame = -2

Query: 442 GIWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVI 263
           GIWL RIL     ++ S IT+ CDNSSSIKLSKNP++HG +KHIDVRFHFLR+L ++G +
Sbjct: 127 GIWLSRILAQIDSRKNSCITIYCDNSSSIKLSKNPVMHGRSKHIDVRFHFLRDLTKEGAV 186

Query: 262 ELKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMCIRPSVN 137
           +L +CSS EQ AD+M KAL  E+FS++  R +LG+    +VN
Sbjct: 187 QLVHCSSFEQVADIMTKALSFESFSRN--RDKLGLFNLETVN 226


>dbj|GAU42085.1| hypothetical protein TSUD_200730 [Trifolium subterraneum]
          Length = 1236

 Score =  124 bits (312), Expect = 3e-30
 Identities = 58/88 (65%), Positives = 71/88 (80%)
 Frame = -2

Query: 442  GIWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVI 263
            GIWLKRIL+N GLK+   + V CDNSS+IKLSKNP+LHG +KHID+RFHFLRNL  +G +
Sbjct: 980  GIWLKRILDNMGLKQSKCLDVFCDNSSTIKLSKNPVLHGRSKHIDIRFHFLRNLSCEGSV 1039

Query: 262  ELKYCSSHEQAADLMAKALKHETFSKSK 179
            ELK+C+S  Q AD+M KALK E+F K K
Sbjct: 1040 ELKHCTSQNQIADIMTKALKLESFEKLK 1067


>gb|PNX86749.1| copia-type polyprotein [Trifolium pratense]
          Length = 395

 Score =  121 bits (304), Expect = 4e-30
 Identities = 60/101 (59%), Positives = 75/101 (74%)
 Frame = -2

Query: 439 IWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVIE 260
           IWLK IL +  +++   +++ CDNSSSIKLSKNPI+HG  KHIDVR+HFLR+L +DGVIE
Sbjct: 296 IWLKNILSHLLVEQAGCVSINCDNSSSIKLSKNPIMHGRCKHIDVRYHFLRDLSRDGVIE 355

Query: 259 LKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMCIRPSVN 137
           LKYC S +Q AD+M K LK E+F   +LR  LGM I   VN
Sbjct: 356 LKYCKSQDQLADIMTKPLKLESF--CRLREGLGMSIAQDVN 394


>gb|PNY15642.1| copia-type polyprotein [Trifolium pratense]
          Length = 822

 Score =  124 bits (311), Expect = 4e-30
 Identities = 59/102 (57%), Positives = 79/102 (77%)
 Frame = -2

Query: 442  GIWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVI 263
            GIWL+ +L+    ++++  T+ CDNSSSIKLSKNP++HG +KHIDVR+HFLR+L  DGVI
Sbjct: 723  GIWLRNVLKQLRQEQVTCTTIYCDNSSSIKLSKNPVMHGRSKHIDVRYHFLRDLNNDGVI 782

Query: 262  ELKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMCIRPSVN 137
            ELK+C ++EQ AD+M KALK +TF   KLR +LG+C   S N
Sbjct: 783  ELKHCRTNEQLADIMTKALKVDTF--CKLREELGICDSSSFN 822


>gb|PRQ52345.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 1316

 Score =  124 bits (310), Expect = 6e-30
 Identities = 58/102 (56%), Positives = 73/102 (71%)
 Frame = -2

Query: 442  GIWLKRILENFGLKELSSITVQCDNSSSIKLSKNPILHGWTKHIDVRFHFLRNLCQDGVI 263
            G+W++RI E  G  +    TV CDNSS+IKLSKNP++HG +KHIDVRFHFLR L +DG +
Sbjct: 1217 GVWMRRIFERLGHAQRGCTTVYCDNSSTIKLSKNPVMHGRSKHIDVRFHFLRQLTKDGTV 1276

Query: 262  ELKYCSSHEQAADLMAKALKHETFSKSKLRAQLGMCIRPSVN 137
            EL YC++ +Q AD M K LK E F   KLR  LGMC+ P +N
Sbjct: 1277 ELVYCNTQDQIADAMTKPLKLEVF--EKLRDLLGMCLVPGIN 1316


Top