BLASTX nr result

ID: Astragalus22_contig00023913 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00023913
         (758 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KYP44593.1| Copia protein [Cajanus cajan]                           74   2e-12
gb|KYP48176.1| Copia protein, partial [Cajanus cajan]                  69   7e-11
gb|PNX56197.1| putative copia-type protein, partial [Trifolium p...    71   1e-10
gb|KYP69304.1| Copia protein [Cajanus cajan]                           68   2e-10
gb|KYP64168.1| Retrovirus-related Pol polyprotein from transposo...    70   4e-10
gb|PNX89752.1| putative copia-type protein [Trifolium pratense]        70   5e-10
gb|PNX95321.1| retroelement pol polyprotein-like [Trifolium prat...    69   1e-09
dbj|GAU34863.1| hypothetical protein TSUD_19430 [Trifolium subte...    69   1e-09
gb|PNY16822.1| retrovirus-related Pol polyprotein from transposo...    67   6e-09
gb|PNX96222.1| retrovirus-related Pol polyprotein from transposo...    67   7e-09
gb|KYP78471.1| Copia protein [Cajanus cajan]                           64   2e-08
gb|KYP33475.1| Copia protein [Cajanus cajan]                           63   4e-08
dbj|GAU46547.1| hypothetical protein TSUD_402640 [Trifolium subt...    64   5e-08
gb|PKI51811.1| hypothetical protein CRG98_027784 [Punica granatum]     62   1e-07
gb|KYP42517.1| Copia protein [Cajanus cajan]                           62   2e-07
gb|PKI39675.1| hypothetical protein CRG98_039932 [Punica granatum]     59   1e-06
gb|KZV23217.1| Cysteine-rich RLK (receptor-like protein kinase) ...    60   1e-06
gb|OWM83836.1| hypothetical protein CDL15_Pgr004267 [Punica gran...    57   2e-06
gb|PKI34309.1| hypothetical protein CRG98_045307 [Punica granatum]     57   2e-06
ref|XP_020420854.1| uncharacterized protein LOC109949504 [Prunus...    59   2e-06

>gb|KYP44593.1| Copia protein [Cajanus cajan]
          Length = 195

 Score = 73.9 bits (180), Expect = 2e-12
 Identities = 39/75 (52%), Positives = 51/75 (68%), Gaps = 6/75 (8%)
 Frame = +1

Query: 508 IVILEDNDLQLVCFCDSDWASCSITKKSC*GVLNE------AWYSPNILEDSRFSSEAEY 669
           I++  +NDLQLV FCDSDWASC +T++S  G L +      +W +      SR SSEAEY
Sbjct: 27  IILPSENDLQLVAFCDSDWASCPLTRRSVFGYLMKLGSVLVSWKTKKQTIVSRSSSEAEY 86

Query: 670 HAMAHATSEILCLQS 714
            +MAHATSEIL L++
Sbjct: 87  RSMAHATSEILWLRN 101


>gb|KYP48176.1| Copia protein, partial [Cajanus cajan]
          Length = 140

 Score = 68.6 bits (166), Expect = 7e-11
 Identities = 38/75 (50%), Positives = 50/75 (66%), Gaps = 6/75 (8%)
 Frame = +1

Query: 508 IVILEDNDLQLVCFCDSDWASCSITKKSC*GVLNE------AWYSPNILEDSRFSSEAEY 669
           I++L+DNDLQL+ + DSDWASC  T+KS  G L +      +W +      SR SSEAEY
Sbjct: 17  ILMLKDNDLQLMGYYDSDWASCPTTRKSVSGFLMKLKAILVSWKAKKQATVSRSSSEAEY 76

Query: 670 HAMAHATSEILCLQS 714
             +AHATSEI+ L+S
Sbjct: 77  KVLAHATSEIVWLRS 91


>gb|PNX56197.1| putative copia-type protein, partial [Trifolium pratense]
          Length = 340

 Score = 71.2 bits (173), Expect = 1e-10
 Identities = 38/74 (51%), Positives = 51/74 (68%), Gaps = 6/74 (8%)
 Frame = +1

Query: 508 IVILEDNDLQLVCFCDSDWASCSITKKSC*GVLNE------AWYSPNILEDSRFSSEAEY 669
           I+I +DNDL+LV +CDSD+ASC +T++S  G L +      +W +      SR SSEAEY
Sbjct: 172 IIIPKDNDLKLVAYCDSDYASCPLTRRSISGYLMKLGAAPISWKTKKQTTVSRSSSEAEY 231

Query: 670 HAMAHATSEILCLQ 711
            AMAHATSEI+ L+
Sbjct: 232 RAMAHATSEIIWLR 245


>gb|KYP69304.1| Copia protein [Cajanus cajan]
          Length = 168

 Score = 68.2 bits (165), Expect = 2e-10
 Identities = 38/77 (49%), Positives = 50/77 (64%), Gaps = 6/77 (7%)
 Frame = +1

Query: 511 VILEDNDLQLVCFCDSDWASCSITKKSC*GVLNE------AWYSPNILEDSRFSSEAEYH 672
           +IL +N+LQL  FCDSDWASC  T++S  G + +      +W +      SR SSEAEY 
Sbjct: 1   MILVNNNLQLEDFCDSDWASCPTTRRSISGYIVKLGSVPISWKTKKQTTISRSSSEAEYR 60

Query: 673 AMAHATSEILCLQSPVR 723
           AMAHATSEI+ L+  +R
Sbjct: 61  AMAHATSEIIWLRGLLR 77


>gb|KYP64168.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 967

 Score = 70.5 bits (171), Expect = 4e-10
 Identities = 38/75 (50%), Positives = 50/75 (66%), Gaps = 6/75 (8%)
 Frame = +1

Query: 508  IVILEDNDLQLVCFCDSDWASCSITKKSC*GVLNE------AWYSPNILEDSRFSSEAEY 669
            I++   NDL+LV +CDSDWASC +T+KS  G L +      +W +      SR SSEAEY
Sbjct: 799  IILPNTNDLRLVGYCDSDWASCPLTRKSISGYLMKLGPTPISWKTKKQSTVSRSSSEAEY 858

Query: 670  HAMAHATSEILCLQS 714
             A+AHATSEI+ L+S
Sbjct: 859  RAIAHATSEIIWLRS 873


>gb|PNX89752.1| putative copia-type protein [Trifolium pratense]
          Length = 403

 Score = 69.7 bits (169), Expect = 5e-10
 Identities = 36/75 (48%), Positives = 51/75 (68%), Gaps = 6/75 (8%)
 Frame = +1

Query: 508 IVILEDNDLQLVCFCDSDWASCSITKKSC*GVLNE------AWYSPNILEDSRFSSEAEY 669
           I++  +N+L+LV FCDSDWASC +T++S  G L +      +W +   +  SR SSEAEY
Sbjct: 235 IILPRENNLELVGFCDSDWASCPLTRRSTSGYLMKLGEAPVSWKTKKQVTVSRSSSEAEY 294

Query: 670 HAMAHATSEILCLQS 714
            AMAHA SEI+ L++
Sbjct: 295 RAMAHAASEIIWLRN 309


>gb|PNX95321.1| retroelement pol polyprotein-like [Trifolium pratense]
          Length = 1433

 Score = 69.3 bits (168), Expect = 1e-09
 Identities = 36/74 (48%), Positives = 50/74 (67%), Gaps = 6/74 (8%)
 Frame = +1

Query: 508  IVILEDNDLQLVCFCDSDWASCSITKKSC*GVLNE------AWYSPNILEDSRFSSEAEY 669
            I+I  DNDL+LV +CDSD+ASC +T++S  G + +      +W +      SR SSEAEY
Sbjct: 1265 IIIPRDNDLRLVAYCDSDYASCPLTRRSISGYVMKLGTAPISWKTKKQTTVSRSSSEAEY 1324

Query: 670  HAMAHATSEILCLQ 711
             AMAHATSE++ L+
Sbjct: 1325 RAMAHATSEVIWLR 1338


>dbj|GAU34863.1| hypothetical protein TSUD_19430 [Trifolium subterraneum]
          Length = 1312

 Score = 68.9 bits (167), Expect = 1e-09
 Identities = 38/75 (50%), Positives = 49/75 (65%), Gaps = 6/75 (8%)
 Frame = +1

Query: 508  IVILEDNDLQLVCFCDSDWASCSITKKSC*GVLNE------AWYSPNILEDSRFSSEAEY 669
            IV+ +DNDLQLV FCDSDWASC +T++S  G L +      +W +      S+ SSEAEY
Sbjct: 1189 IVLPKDNDLQLVGFCDSDWASCPLTRRSTTGYLMKLGAAPISWKTKKQTTVSKSSSEAEY 1248

Query: 670  HAMAHATSEILCLQS 714
             AM  A SEI+ L+S
Sbjct: 1249 RAMNQAVSEIIWLRS 1263


>gb|PNY16822.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Trifolium pratense]
          Length = 834

 Score = 67.0 bits (162), Expect = 6e-09
 Identities = 34/75 (45%), Positives = 49/75 (65%), Gaps = 6/75 (8%)
 Frame = +1

Query: 508 IVILEDNDLQLVCFCDSDWASCSITKKSC*GVLNE------AWYSPNILEDSRFSSEAEY 669
           I++ ++NDLQLV +CDSDWASC +T++S  G L +      +W +      S+ SSEAEY
Sbjct: 666 IILPKENDLQLVAYCDSDWASCPLTRRSTSGYLMKLGSAPISWKTKKQSTVSKSSSEAEY 725

Query: 670 HAMAHATSEILCLQS 714
            AM  A SE++ L+S
Sbjct: 726 RAMGQAVSEVIWLRS 740


>gb|PNX96222.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 1369

 Score = 67.0 bits (162), Expect = 7e-09
 Identities = 37/74 (50%), Positives = 49/74 (66%), Gaps = 6/74 (8%)
 Frame = +1

Query: 508  IVILEDNDLQLVCFCDSDWASCSITKKSC*GVLNE------AWYSPNILEDSRFSSEAEY 669
            IV+  +N+L+LV + DSDWASC +T++S  G L +      +W +      SR SSEAEY
Sbjct: 1198 IVLPRENELKLVAYSDSDWASCPLTRRSISGYLLKLGAAPISWKTKKQSTVSRSSSEAEY 1257

Query: 670  HAMAHATSEILCLQ 711
             AMAHATSEIL L+
Sbjct: 1258 RAMAHATSEILWLR 1271


>gb|KYP78471.1| Copia protein [Cajanus cajan]
          Length = 253

 Score = 64.3 bits (155), Expect = 2e-08
 Identities = 36/87 (41%), Positives = 56/87 (64%), Gaps = 6/87 (6%)
 Frame = +1

Query: 508 IVILEDNDLQLVCFCDSDWASCSITKKSC*GVLNE------AWYSPNILEDSRFSSEAEY 669
           I++ ++NDL LV + DSDWASC IT++S  G L +      +W +      S+ S+EAEY
Sbjct: 85  IILPKENDLNLVGYSDSDWASCPITRRSITGYLMKLGPILISWKTKKQATVSKSSTEAEY 144

Query: 670 HAMAHATSEILCLQSPVRASTMGVPIR 750
            AM+HA SE++ L+S +  +T+ VP +
Sbjct: 145 RAMSHAASEVVWLRSLL--ATLQVPCK 169


>gb|KYP33475.1| Copia protein [Cajanus cajan]
          Length = 253

 Score = 63.2 bits (152), Expect = 4e-08
 Identities = 33/75 (44%), Positives = 49/75 (65%), Gaps = 6/75 (8%)
 Frame = +1

Query: 508 IVILEDNDLQLVCFCDSDWASCSITKKSC*GVLNE------AWYSPNILEDSRFSSEAEY 669
           I++ ++NDL LV + DSDWASC IT++S  G L +      +W + N    S+ S+EAEY
Sbjct: 85  IILPKENDLNLVGYSDSDWASCPITRRSITGYLMKLGPVPISWKTKNQAIVSKSSTEAEY 144

Query: 670 HAMAHATSEILCLQS 714
            AM+H  SE++ L+S
Sbjct: 145 RAMSHVASEVVWLRS 159


>dbj|GAU46547.1| hypothetical protein TSUD_402640 [Trifolium subterraneum]
          Length = 1212

 Score = 64.3 bits (155), Expect = 5e-08
 Identities = 33/71 (46%), Positives = 46/71 (64%), Gaps = 6/71 (8%)
 Frame = +1

Query: 520  EDNDLQLVCFCDSDWASCSITKKSC*GVLNE------AWYSPNILEDSRFSSEAEYHAMA 681
            ++NDL LV +CDSDWASC +T+KS  G L +      +W +      S+ SSEAEY AM 
Sbjct: 1005 KENDLTLVAYCDSDWASCPLTRKSTTGFLMKLGSAPISWKTKKQTTVSKSSSEAEYRAMN 1064

Query: 682  HATSEILCLQS 714
             ATSE++ ++S
Sbjct: 1065 QATSEVIWIRS 1075


>gb|PKI51811.1| hypothetical protein CRG98_027784 [Punica granatum]
          Length = 242

 Score = 61.6 bits (148), Expect = 1e-07
 Identities = 35/83 (42%), Positives = 49/83 (59%), Gaps = 6/83 (7%)
 Frame = +1

Query: 511 VILEDNDLQLVCFCDSDWASCSITKKSC*GVL------NEAWYSPNILEDSRFSSEAEYH 672
           + L    L+L  FCDSDWASC +T++S  G        + +W +      SR S+EAEY 
Sbjct: 75  IFLRPTSLELEAFCDSDWASCPLTRRSITGYFIMLGGCSVSWKTKKQTTVSRSSAEAEYR 134

Query: 673 AMAHATSEILCLQSPVRASTMGV 741
           AMA   SEI+CL+S +  S++GV
Sbjct: 135 AMAVTVSEIVCLRSLL--SSLGV 155


>gb|KYP42517.1| Copia protein [Cajanus cajan]
          Length = 358

 Score = 62.0 bits (149), Expect = 2e-07
 Identities = 36/81 (44%), Positives = 52/81 (64%), Gaps = 6/81 (7%)
 Frame = +1

Query: 520 EDNDLQLVCFCDSDWASCSITKKSC*GVLNE------AWYSPNILEDSRFSSEAEYHAMA 681
           +++DL+L  FCDSDWA+C  T++S  G L +      +W        SR SSEAEY A+A
Sbjct: 239 KNSDLRLQGFCDSDWAACPTTRRSVTGYLMKLGPTPISWKMKKQTTISRSSSEAEYRAIA 298

Query: 682 HATSEILCLQSPVRASTMGVP 744
           HATSEI+ L++ +  +T+ VP
Sbjct: 299 HATSEIIWLRNLL--TTLQVP 317


>gb|PKI39675.1| hypothetical protein CRG98_039932 [Punica granatum]
          Length = 209

 Score = 58.5 bits (140), Expect = 1e-06
 Identities = 32/74 (43%), Positives = 43/74 (58%), Gaps = 6/74 (8%)
 Frame = +1

Query: 511 VILEDNDLQLVCFCDSDWASCSITKKSC*GVL------NEAWYSPNILEDSRFSSEAEYH 672
           + L    L+L  FCDSDWASC +T++S  G        + +W +      SR S+EAEY 
Sbjct: 75  IFLRPTSLELEAFCDSDWASCPMTRRSITGYFITLGGCSVSWKTKKQTTVSRSSAEAEYR 134

Query: 673 AMAHATSEILCLQS 714
           AMA A SEI+ L+S
Sbjct: 135 AMAAAVSEIIWLRS 148


>gb|KZV23217.1| Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras
            hygrometricum]
          Length = 1406

 Score = 60.1 bits (144), Expect = 1e-06
 Identities = 33/74 (44%), Positives = 45/74 (60%), Gaps = 6/74 (8%)
 Frame = +1

Query: 508  IVILEDNDLQLVCFCDSDWASCSITKKSC*GVLNE------AWYSPNILEDSRFSSEAEY 669
            I +   NDLQL  +CDSDWASC +T+KS  G L +      +W +      SR S+EAEY
Sbjct: 1239 IFLPASNDLQLKAYCDSDWASCPMTRKSVTGYLIQLGPASISWKTKQQNTVSRSSAEAEY 1298

Query: 670  HAMAHATSEILCLQ 711
             AMA A+ E++ L+
Sbjct: 1299 RAMASASCEVIWLR 1312


>gb|OWM83836.1| hypothetical protein CDL15_Pgr004267 [Punica granatum]
          Length = 169

 Score = 57.4 bits (137), Expect = 2e-06
 Identities = 30/74 (40%), Positives = 42/74 (56%), Gaps = 6/74 (8%)
 Frame = +1

Query: 511 VILEDNDLQLVCFCDSDWASCSITKKSC*GVLNE------AWYSPNILEDSRFSSEAEYH 672
           + L    ++L+ +CDSDWASC +T++S  G          +W +      SR S+EAEY 
Sbjct: 18  IFLRPQSMELIAYCDSDWASCPMTQRSVTGYFITLGGSPISWKTKKQTTVSRSSAEAEYR 77

Query: 673 AMAHATSEILCLQS 714
           AM  A SE+L LQS
Sbjct: 78  AMTAAVSEVLWLQS 91


>gb|PKI34309.1| hypothetical protein CRG98_045307 [Punica granatum]
          Length = 182

 Score = 57.4 bits (137), Expect = 2e-06
 Identities = 34/83 (40%), Positives = 47/83 (56%), Gaps = 6/83 (7%)
 Frame = +1

Query: 511 VILEDNDLQLVCFCDSDWASCSITKKSC*GVL------NEAWYSPNILEDSRFSSEAEYH 672
           + L    LQL  FCDSDWASC +T++S  G          +W +      SR S+EAEY 
Sbjct: 15  IFLRPTSLQLEAFCDSDWASCPLTRRSVTGYFIMLGGCPISWKTKKQTTVSRSSAEAEYR 74

Query: 673 AMAHATSEILCLQSPVRASTMGV 741
           AMA   SE++ L+S +  S++GV
Sbjct: 75  AMAATVSEVIWLRSLL--SSLGV 95


>ref|XP_020420854.1| uncharacterized protein LOC109949504 [Prunus persica]
          Length = 345

 Score = 58.9 bits (141), Expect = 2e-06
 Identities = 34/69 (49%), Positives = 43/69 (62%), Gaps = 6/69 (8%)
 Frame = +1

Query: 523 DNDLQLVCFCDSDWASCSITKKSC*G---VLNEA---WYSPNILEDSRFSSEAEYHAMAH 684
           +N+L+L  FCDSDWASC  T++S  G    L +A   W +      SR S+EAEY AMAH
Sbjct: 182 ENNLKLTAFCDSDWASCPTTRRSTTGYCTFLGDALISWKTKKQNVVSRSSAEAEYRAMAH 241

Query: 685 ATSEILCLQ 711
           AT EI  L+
Sbjct: 242 ATCEITWLR 250


Top