BLASTX nr result

ID: Astragalus24_contig00018114 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00018114
         (471 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX95321.1| retroelement pol polyprotein-like [Trifolium prat...    54   2e-17
gb|PNX96222.1| retrovirus-related Pol polyprotein from transposo...    55   3e-15
gb|KYP42518.1| Retrovirus-related Pol polyprotein from transposo...    49   1e-12
gb|KYP64168.1| Retrovirus-related Pol polyprotein from transposo...    49   8e-12
gb|KYP65664.1| Retrovirus-related Pol polyprotein from transposo...    44   1e-10
dbj|GAU35639.1| hypothetical protein TSUD_394790 [Trifolium subt...    47   2e-10
gb|KYP36220.1| Copia protein [Cajanus cajan] >gi|1012357536|gb|K...    52   2e-10
gb|KYP33474.1| Retrovirus-related Pol polyprotein from transposo...    46   4e-09
gb|KYP52818.1| Retrovirus-related Pol polyprotein from transposo...    43   4e-08
gb|KYP53968.1| Retrovirus-related Pol polyprotein from transposo...    42   1e-07
gb|PKH48991.1| hypothetical protein CRG98_050339, partial [Punic...    48   8e-07
gb|AAD24600.1| putative retroelement pol polyprotein [Arabidopsi...    45   2e-06

>gb|PNX95321.1| retroelement pol polyprotein-like [Trifolium pratense]
          Length = 1433

 Score = 53.9 bits (128), Expect(3) = 2e-17
 Identities = 22/38 (57%), Positives = 30/38 (78%)
 Frame = -2

Query: 116 ILYTDLWGRYHTASYQGCHYFLTIIIDYFKRGTCEYLM 3
           +++ DLWG+Y T+S+ GCHYFLTI+ DY  RGT  YL+
Sbjct: 553 LIHCDLWGKYRTSSHSGCHYFLTIVDDY-SRGTWVYLL 589



 Score = 46.2 bits (108), Expect(3) = 2e-17
 Identities = 19/26 (73%), Positives = 21/26 (80%)
 Frame = -1

Query: 183 CH*SKQCRLPFYVSSNKTEKSFDLIH 106
           CH SKQCRLPF++S NK E  FDLIH
Sbjct: 530 CHRSKQCRLPFHISYNKAENPFDLIH 555



 Score = 36.2 bits (82), Expect(3) = 2e-17
 Identities = 17/37 (45%), Positives = 25/37 (67%)
 Frame = -3

Query: 301 EVNKVI*HARLGHRLAKVV*LLSQLLNCSFNSNKVHC 191
           E N  + HAR+GH   +V+  +SQL+N +F SNK+ C
Sbjct: 490 EDNTALWHARMGHPSPQVMQRISQLVNFNFCSNKLRC 526


>gb|PNX96222.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Trifolium pratense]
          Length = 1369

 Score = 55.1 bits (131), Expect(3) = 3e-15
 Identities = 23/38 (60%), Positives = 31/38 (81%)
 Frame = -2

Query: 116 ILYTDLWGRYHTASYQGCHYFLTIIIDYFKRGTCEYLM 3
           ++++DLWGRYHTA++ G HYFLTI+ DY  RGT  YL+
Sbjct: 484 LIHSDLWGRYHTAAHDGSHYFLTIVDDY-SRGTWVYLL 520



 Score = 40.4 bits (93), Expect(3) = 3e-15
 Identities = 18/26 (69%), Positives = 19/26 (73%)
 Frame = -1

Query: 183 CH*SKQCRLPFYVSSNKTEKSFDLIH 106
           CH SKQCRLPF  S NK E+ F LIH
Sbjct: 461 CHKSKQCRLPFPQSINKAEEPFSLIH 486



 Score = 33.1 bits (74), Expect(3) = 3e-15
 Identities = 14/28 (50%), Positives = 20/28 (71%)
 Frame = -3

Query: 280 HARLGHRLAKVV*LLSQLLNCSFNSNKV 197
           H+RLGH  ++ +  LS LL C+FN NK+
Sbjct: 428 HSRLGHPSSQALQHLSHLLRCNFNFNKI 455


>gb|KYP42518.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 769

 Score = 48.9 bits (115), Expect(3) = 1e-12
 Identities = 21/38 (55%), Positives = 28/38 (73%)
 Frame = -2

Query: 116 ILYTDLWGRYHTASYQGCHYFLTIIIDYFKRGTCEYLM 3
           +++ DLWG+YHTAS+ G HYFLT I+D F R    YL+
Sbjct: 482 LIHCDLWGKYHTASHNGSHYFLT-IVDDFTRAVWIYLL 518



 Score = 42.4 bits (98), Expect(3) = 1e-12
 Identities = 18/26 (69%), Positives = 20/26 (76%)
 Frame = -1

Query: 183 CH*SKQCRLPFYVSSNKTEKSFDLIH 106
           CH SKQCRLPF ++ NK  K FDLIH
Sbjct: 459 CHRSKQCRLPFSLNYNKVSKVFDLIH 484



 Score = 28.9 bits (63), Expect(3) = 1e-12
 Identities = 14/31 (45%), Positives = 22/31 (70%), Gaps = 1/31 (3%)
 Frame = -3

Query: 280 HARLGHRLAKVV*LLSQLLNCSFN-SNKVHC 191
           HAR+GH   +V+  LS +++ SFN +NK+ C
Sbjct: 425 HARMGHPSDQVLSKLSTIISFSFNANNKMEC 455


>gb|KYP64168.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 967

 Score = 49.3 bits (116), Expect(3) = 8e-12
 Identities = 22/38 (57%), Positives = 28/38 (73%)
 Frame = -2

Query: 116 ILYTDLWGRYHTASYQGCHYFLTIIIDYFKRGTCEYLM 3
           +++ DLWG+Y+TAS  G HYFLTI+ DY  R T  YLM
Sbjct: 85  LIHCDLWGKYNTASQNGSHYFLTIVDDY-SRATWVYLM 121



 Score = 39.3 bits (90), Expect(3) = 8e-12
 Identities = 17/26 (65%), Positives = 19/26 (73%)
 Frame = -1

Query: 183 CH*SKQCRLPFYVSSNKTEKSFDLIH 106
           CH SKQC+LPF  S+NK E  F LIH
Sbjct: 62  CHKSKQCKLPFNHSNNKAEAPFHLIH 87



 Score = 28.5 bits (62), Expect(3) = 8e-12
 Identities = 12/30 (40%), Positives = 18/30 (60%)
 Frame = -3

Query: 280 HARLGHRLAKVV*LLSQLLNCSFNSNKVHC 191
           H+RLGH   + +  +S +  CSF +NK  C
Sbjct: 29  HSRLGHPSFEAIQKISYIDKCSFITNKEEC 58


>gb|KYP65664.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 529

 Score = 44.3 bits (103), Expect(3) = 1e-10
 Identities = 20/38 (52%), Positives = 26/38 (68%)
 Frame = -2

Query: 116 ILYTDLWGRYHTASYQGCHYFLTIIIDYFKRGTCEYLM 3
           +++ DLWGRY T S+ G HYFLT I+D F R    YL+
Sbjct: 200 LIHCDLWGRYATKSHNGSHYFLT-IVDDFTRAVWVYLI 236



 Score = 40.4 bits (93), Expect(3) = 1e-10
 Identities = 16/26 (61%), Positives = 21/26 (80%)
 Frame = -1

Query: 183 CH*SKQCRLPFYVSSNKTEKSFDLIH 106
           CH SKQC LPF +S+NK  ++F+LIH
Sbjct: 177 CHRSKQCMLPFSISNNKAVQAFELIH 202



 Score = 28.5 bits (62), Expect(3) = 1e-10
 Identities = 14/31 (45%), Positives = 20/31 (64%)
 Frame = -3

Query: 289 VI*HARLGHRLAKVV*LLSQLLNCSFNSNKV 197
           V+ HAR+GH   KV+  +S  LN S + NK+
Sbjct: 141 VLWHARMGHPSNKVLLKMSSSLNFSLDENKL 171


>dbj|GAU35639.1| hypothetical protein TSUD_394790 [Trifolium subterraneum]
          Length = 960

 Score = 46.6 bits (109), Expect(3) = 2e-10
 Identities = 20/38 (52%), Positives = 28/38 (73%)
 Frame = -2

Query: 116 ILYTDLWGRYHTASYQGCHYFLTIIIDYFKRGTCEYLM 3
           +++ DLWG+Y+T S  G HYFLT++ D+  RGT  YLM
Sbjct: 270 LIHCDLWGKYNTTSSNGSHYFLTLVDDH-TRGTWVYLM 306



 Score = 38.9 bits (89), Expect(3) = 2e-10
 Identities = 17/26 (65%), Positives = 18/26 (69%)
 Frame = -1

Query: 183 CH*SKQCRLPFYVSSNKTEKSFDLIH 106
           CH S QCRL F  S NK E+ FDLIH
Sbjct: 247 CHKSNQCRLSFSQSMNKAERPFDLIH 272



 Score = 26.9 bits (58), Expect(3) = 2e-10
 Identities = 11/30 (36%), Positives = 17/30 (56%)
 Frame = -3

Query: 280 HARLGHRLAKVV*LLSQLLNCSFNSNKVHC 191
           H R+GH   + +  LS L+ C  + NK+ C
Sbjct: 214 HCRMGHPSTQFLQQLSCLIKCCSDFNKIKC 243


>gb|KYP36220.1| Copia protein [Cajanus cajan]
 gb|KYP68721.1| Copia protein [Cajanus cajan]
          Length = 585

 Score = 51.6 bits (122), Expect(2) = 2e-10
 Identities = 26/40 (65%), Positives = 30/40 (75%)
 Frame = -2

Query: 122 HLILYTDLWGRYHTASYQGCHYFLTIIIDYFKRGTCEYLM 3
           HLI Y DLWG+Y+TAS+ G HYFLTI+ DY  R T  YLM
Sbjct: 361 HLIHY-DLWGKYNTASHNGSHYFLTIVDDY-SRATWVYLM 398



 Score = 41.2 bits (95), Expect(2) = 2e-10
 Identities = 30/95 (31%), Positives = 41/95 (43%), Gaps = 13/95 (13%)
 Frame = -1

Query: 351 EE*IGLCDLDDGAYMLRKSTRS--------SSMHDWDT-----V*PKLYSFFLXXXXXXX 211
           +E IGL D+ +G Y+LR+ T+S        +    W +     +   L            
Sbjct: 270 KEMIGLGDMHEGVYILRRPTKSIYFTAFLKNMAGTWHSRLGHPLFEALQKISNIVKCSFI 329

Query: 210 XXXXXXXXVCH*SKQCRLPFYVSSNKTEKSFDLIH 106
                   +CH SKQCR PF  S+NK E  F LIH
Sbjct: 330 TNKEECCDICHKSKQCRFPFNRSNNKAEAPFHLIH 364


>gb|KYP33474.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 339

 Score = 45.8 bits (107), Expect(2) = 4e-09
 Identities = 33/95 (34%), Positives = 41/95 (43%), Gaps = 16/95 (16%)
 Frame = -1

Query: 342 IGLCDLDDGAYMLRKSTRSSSM----------------HDWDTV*PKLYSFFLXXXXXXX 211
           IGL D  DG Y+L+ +T  SS+                H  D +   L    +       
Sbjct: 111 IGLGDSCDGIYILKSTTMGSSLVAVHEDATNLWHARMGHPSDQI---LSQLSIPLNFHFD 167

Query: 210 XXXXXXXXVCH*SKQCRLPFYVSSNKTEKSFDLIH 106
                   VCH SKQCRLPF +S+NK E  F LIH
Sbjct: 168 KNKLECFDVCHRSKQCRLPFSLSNNKAETPFSLIH 202



 Score = 42.7 bits (99), Expect(2) = 4e-09
 Identities = 17/38 (44%), Positives = 26/38 (68%)
 Frame = -2

Query: 116 ILYTDLWGRYHTASYQGCHYFLTIIIDYFKRGTCEYLM 3
           +++ DLW +YHT ++ G HYF+TI+ DY  R    YL+
Sbjct: 200 LIHYDLWWKYHTMAHNGAHYFITIVDDY-TRAVWVYLL 236


>gb|KYP52818.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Cajanus cajan]
          Length = 180

 Score = 43.1 bits (100), Expect(2) = 4e-08
 Identities = 16/28 (57%), Positives = 22/28 (78%)
 Frame = -2

Query: 116 ILYTDLWGRYHTASYQGCHYFLTIIIDY 33
           +++ DLWGRY T S+ G HYFLTI+ D+
Sbjct: 125 LIHCDLWGRYATKSHNGSHYFLTIVDDF 152



 Score = 42.0 bits (97), Expect(2) = 4e-08
 Identities = 38/126 (30%), Positives = 56/126 (44%), Gaps = 16/126 (12%)
 Frame = -1

Query: 435 LNCFSFMMEFCSNTIVLCNDVLCWGPSYEE*IGLCDLDDGAYMLRKSTRSSSM------- 277
           LNC   M+ + S+  V+ +  +      +  IG  DL DG Y+LR + + SS+       
Sbjct: 14  LNC---MVTYFSDNCVIQDQAM------KRKIGSGDLCDGVYVLRMANQGSSLAAQPQDA 64

Query: 276 ---------HDWDTV*PKLYSFFLXXXXXXXXXXXXXXXVCH*SKQCRLPFYVSSNKTEK 124
                    H  + V  K+ S                  VCH SKQC LPF +S+NK  +
Sbjct: 65  IVLWHAIMGHPSNKVLLKMSSSL---NFSLDENKLEGCDVCHQSKQCMLPFSLSNNKAVQ 121

Query: 123 SFDLIH 106
           +F+LIH
Sbjct: 122 AFELIH 127


>gb|KYP53968.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Cajanus cajan]
          Length = 207

 Score = 42.4 bits (98), Expect(2) = 1e-07
 Identities = 31/95 (32%), Positives = 40/95 (42%), Gaps = 16/95 (16%)
 Frame = -1

Query: 342 IGLCDLDDGAYMLRKSTRSSSM----------------HDWDTV*PKLYSFFLXXXXXXX 211
           IGL D  DG Y+L+ +T  +S+                H  D +   L    +       
Sbjct: 22  IGLGDSCDGIYILKSTTMGTSLVAIHEDATNLWHARMGHPSDQI---LSQLSISLNFHFD 78

Query: 210 XXXXXXXXVCH*SKQCRLPFYVSSNKTEKSFDLIH 106
                   VCH SKQCR PF +S+NK E  F LIH
Sbjct: 79  KNKLECCDVCHRSKQCRFPFSLSNNKAETPFALIH 113



 Score = 41.2 bits (95), Expect(2) = 1e-07
 Identities = 14/28 (50%), Positives = 22/28 (78%)
 Frame = -2

Query: 116 ILYTDLWGRYHTASYQGCHYFLTIIIDY 33
           +++ DLW +Y+T ++ G HYFLTI+ DY
Sbjct: 111 LIHCDLWRKYYTMTHNGAHYFLTIVDDY 138


>gb|PKH48991.1| hypothetical protein CRG98_050339, partial [Punica granatum]
          Length = 1053

 Score = 48.1 bits (113), Expect(2) = 8e-07
 Identities = 21/38 (55%), Positives = 27/38 (71%)
 Frame = -2

Query: 116 ILYTDLWGRYHTASYQGCHYFLTIIIDYFKRGTCEYLM 3
           +++ D+WG YHTAS  G HYFLTI+ D+  R T  YLM
Sbjct: 360 LIHCDIWGPYHTASLSGAHYFLTIVDDH-SRATWVYLM 396



 Score = 32.3 bits (72), Expect(2) = 8e-07
 Identities = 15/26 (57%), Positives = 17/26 (65%)
 Frame = -1

Query: 183 CH*SKQCRLPFYVSSNKTEKSFDLIH 106
           C  +KQ R PF  S NK+E  FDLIH
Sbjct: 337 CFRAKQTRFPFPPSINKSENIFDLIH 362


>gb|AAD24600.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1333

 Score = 45.4 bits (106), Expect(3) = 2e-06
 Identities = 23/43 (53%), Positives = 27/43 (62%)
 Frame = -2

Query: 131 LRNHLILYTDLWGRYHTASYQGCHYFLTIIIDYFKRGTCEYLM 3
           LR   ++Y DLWG Y T S+ G  YFLTII DY  RG   YL+
Sbjct: 444 LRIFELIYCDLWGPYRTPSHTGARYFLTIIDDY-SRGVWLYLL 485



 Score = 28.1 bits (61), Expect(3) = 2e-06
 Identities = 13/26 (50%), Positives = 18/26 (69%)
 Frame = -1

Query: 183 CH*SKQCRLPFYVSSNKTEKSFDLIH 106
           CH +KQ R  F +S NKT + F+LI+
Sbjct: 426 CHRAKQTRNSFPLSINKTLRIFELIY 451



 Score = 24.6 bits (52), Expect(3) = 2e-06
 Identities = 11/25 (44%), Positives = 17/25 (68%)
 Frame = -3

Query: 304 EEVNKVI*HARLGHRLAKVV*LLSQ 230
           EE N  + H+R+GH  A+VV L+ +
Sbjct: 385 EEKNYELWHSRMGHPAARVVSLIPE 409


Top