BLASTX nr result

ID: Astragalus23_contig00028627 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00028627
         (529 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX90867.1| putative copia-type protein, partial [Trifolium p...   130   6e-36
gb|KYP74152.1| Copia protein [Cajanus cajan]                          131   1e-34
gb|KYP62480.1| Copia protein, partial [Cajanus cajan]                 126   5e-33
gb|KYP70937.1| Copia protein [Cajanus cajan]                          125   8e-33
gb|KYP64927.1| Retrovirus-related Pol polyprotein from transposo...   129   6e-32
gb|KYP48058.1| Retrovirus-related Pol polyprotein from transposo...   129   1e-31
gb|KYP78810.1| Copia protein, partial [Cajanus cajan]                 120   1e-31
dbj|GAU47982.1| hypothetical protein TSUD_87860 [Trifolium subte...   127   2e-31
gb|KYP75334.1| Copia protein [Cajanus cajan]                          121   2e-31
gb|KYP59729.1| Retrovirus-related Pol polyprotein from transposo...   122   4e-31
gb|KYP75232.1| Copia protein [Cajanus cajan]                          121   7e-31
gb|KHN41724.1| Copia protein, partial [Glycine soja]                  121   1e-30
gb|PNX92571.1| histone deacetylase [Trifolium pratense]               127   2e-30
gb|PNX93473.1| retrovirus-related Pol polyprotein from transposo...   126   2e-30
gb|PNY08092.1| histone deacetylase [Trifolium pratense]               126   3e-30
gb|KYP30996.1| Copia protein [Cajanus cajan]                          118   3e-30
gb|KYP53273.1| Copia protein [Cajanus cajan]                          118   3e-30
gb|KYP40911.1| Retrovirus-related Pol polyprotein from transposo...   120   3e-30
ref|XP_020234407.1| uncharacterized protein LOC109814403 [Cajanu...   120   3e-30
dbj|GAU31266.1| hypothetical protein TSUD_153410 [Trifolium subt...   125   7e-30

>gb|PNX90867.1| putative copia-type protein, partial [Trifolium pratense]
          Length = 129

 Score =  130 bits (328), Expect = 6e-36
 Identities = 65/103 (63%), Positives = 79/103 (76%)
 Frame = +2

Query: 2   LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
           LA  TSELLWIQSLLT+L  P   PV++CDN+S V +AHNPVLHARTKH+ELDL  +RE+
Sbjct: 23  LAHTTSELLWIQSLLTDLHIPIHTPVLFCDNISAVMIAHNPVLHARTKHLELDLHFVRER 82

Query: 182 VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLVNSQ 310
           VL K  +++ +PG  Q ADALTKPLPT RF  LR KL + +SQ
Sbjct: 83  VLAKALNIQHVPGVDQIADALTKPLPTSRFLTLRDKLKVFSSQ 125


>gb|KYP74152.1| Copia protein [Cajanus cajan]
          Length = 258

 Score =  131 bits (330), Expect = 1e-34
 Identities = 67/100 (67%), Positives = 80/100 (80%)
 Frame = +2

Query: 2   LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
           LA AT+E+LWI++LL ELG  F  PVVYCDN STVSLAHNPVLH+RTKHME++LF +REK
Sbjct: 150 LAQATAEVLWIETLLHELGISFSVPVVYCDNQSTVSLAHNPVLHSRTKHMEINLFFVREK 209

Query: 182 VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLV 301
           VL KQ S++ IP + Q ADALTKPL + RF  LR KL +V
Sbjct: 210 VLAKQLSVQHIPAQDQWADALTKPLSSARFLFLRDKLKVV 249


>gb|KYP62480.1| Copia protein, partial [Cajanus cajan]
          Length = 233

 Score =  126 bits (317), Expect = 5e-33
 Identities = 61/102 (59%), Positives = 77/102 (75%)
 Frame = +2

Query: 2   LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
           LALATSE+LWIQSLL EL      PV+YCDN ST+SL+HNPVLH+RTKHMELD+F +REK
Sbjct: 124 LALATSEILWIQSLLNELQVQIPTPVLYCDNQSTISLSHNPVLHSRTKHMELDIFFVREK 183

Query: 182 VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLVNS 307
           VL K   + ++P + Q AD LTKPL  ++F + R KL ++ S
Sbjct: 184 VLNKSLIVSYVPTQAQIADILTKPLSKVQFCNFRDKLKVLGS 225


>gb|KYP70937.1| Copia protein [Cajanus cajan]
          Length = 220

 Score =  125 bits (315), Expect = 8e-33
 Identities = 64/100 (64%), Positives = 76/100 (76%)
 Frame = +2

Query: 2   LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
           LA ATSE+LWIQ+LL EL  PF  P +YCDN STVSLAHNPVLH+ TKHME++LF +REK
Sbjct: 112 LAQATSEVLWIQTLLHELRVPFSTPTIYCDNQSTVSLAHNPVLHSITKHMEINLFFVREK 171

Query: 182 VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLV 301
           VL  Q +++ IP   Q ADALTKPL + RF  LR KL +V
Sbjct: 172 VLANQLTVQHIPASDQWADALTKPLSSTRFLILRDKLKVV 211


>gb|KYP64927.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 470

 Score =  129 bits (323), Expect = 6e-32
 Identities = 65/100 (65%), Positives = 77/100 (77%)
 Frame = +2

Query: 2   LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
           LA ATSE+LWIQ+LL EL  PF  P +YCDN STVSLAHNPVLH+RTKHME++LF +REK
Sbjct: 362 LAQATSEVLWIQTLLHELRVPFSTPTIYCDNQSTVSLAHNPVLHSRTKHMEINLFFVREK 421

Query: 182 VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLV 301
           VL  Q +++ IP   Q ADALTKPL + RF  LR KL +V
Sbjct: 422 VLANQLTVQHIPASDQWADALTKPLSSTRFLILRDKLKVV 461


>gb|KYP48058.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 539

 Score =  129 bits (323), Expect = 1e-31
 Identities = 65/100 (65%), Positives = 77/100 (77%)
 Frame = +2

Query: 2   LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
           LA ATSE+LWIQ+LL EL  PF  P +YCDN STVSLAHNPVLH+RTKHME++LF +REK
Sbjct: 431 LAQATSEVLWIQTLLHELRVPFSTPTIYCDNQSTVSLAHNPVLHSRTKHMEINLFFVREK 490

Query: 182 VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLV 301
           VL  Q +++ IP   Q ADALTKPL + RF  LR KL +V
Sbjct: 491 VLANQLTVQHIPASDQWADALTKPLSSTRFLILRDKLKVV 530


>gb|KYP78810.1| Copia protein, partial [Cajanus cajan]
          Length = 145

 Score =  120 bits (301), Expect = 1e-31
 Identities = 58/102 (56%), Positives = 75/102 (73%)
 Frame = +2

Query: 2   LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
           LALATSE+LWIQSLL EL      P+ YCDN ST+SL++NP LH+RTKHMELD+F +REK
Sbjct: 36  LALATSEILWIQSLLNELQVQIPTPIHYCDNQSTISLSYNPFLHSRTKHMELDIFFVREK 95

Query: 182 VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLVNS 307
           VL K   + ++P + Q AD LTKPL  ++F + R KL ++ S
Sbjct: 96  VLNKSLIVSYVPTQAQIADILTKPLSKVQFCNFRDKLKVLGS 137


>dbj|GAU47982.1| hypothetical protein TSUD_87860 [Trifolium subterraneum]
          Length = 423

 Score =  127 bits (318), Expect = 2e-31
 Identities = 63/103 (61%), Positives = 75/103 (72%)
 Frame = +2

Query: 2   LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
           LA  T+E+LW+QSLLTEL   F  P + CDNLSTVSLAHNP LH RTKHMELD+F +REK
Sbjct: 308 LASLTAEILWLQSLLTELQCKFSTPRILCDNLSTVSLAHNPTLHHRTKHMELDIFFVREK 367

Query: 182 VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLVNSQ 310
           VL K  S+  +P + Q AD LTKPL  ++F  LRGKL + N Q
Sbjct: 368 VLSKHLSVSHVPAQDQWADILTKPLSAVKFGLLRGKLRVFNKQ 410


>gb|KYP75334.1| Copia protein [Cajanus cajan]
          Length = 198

 Score =  121 bits (304), Expect = 2e-31
 Identities = 64/102 (62%), Positives = 74/102 (72%)
 Frame = +2

Query: 2   LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
           LALAT+E LWIQ+LL+EL      PVVYCDN+STV+LAHNPVLHARTKHMELDLF +REK
Sbjct: 92  LALATAEALWIQTLLSELHVSHHTPVVYCDNMSTVALAHNPVLHARTKHMELDLFFVREK 151

Query: 182 VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLVNS 307
           V      +  +P   Q AD LTK L   RF +LR KL LV+S
Sbjct: 152 VAANSLHVVHVPAIDQYADILTKALSPSRFCELRTKLKLVDS 193


>gb|KYP59729.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Cajanus cajan]
          Length = 254

 Score =  122 bits (306), Expect = 4e-31
 Identities = 64/102 (62%), Positives = 75/102 (73%)
 Frame = +2

Query: 2   LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
           LALAT+E+LWIQ+LL+EL      PVVYCDN+STV+LAHNPVLHARTKHMELDLF +REK
Sbjct: 148 LALATAEVLWIQTLLSELHVSHHTPVVYCDNMSTVALAHNPVLHARTKHMELDLFFVREK 207

Query: 182 VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLVNS 307
           V      +  +P   Q AD LTK L   RF +LR KL LV+S
Sbjct: 208 VAANSLHVVHVPAIDQYADILTKALSPSRFCELRTKLKLVDS 249


>gb|KYP75232.1| Copia protein [Cajanus cajan]
          Length = 250

 Score =  121 bits (304), Expect = 7e-31
 Identities = 64/102 (62%), Positives = 74/102 (72%)
 Frame = +2

Query: 2   LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
           LALAT+E LWIQ+LL+EL      PVVYCDN+STV+LAHNPVLHARTKHMELDLF +REK
Sbjct: 144 LALATAEALWIQTLLSELHVSHHTPVVYCDNMSTVALAHNPVLHARTKHMELDLFFVREK 203

Query: 182 VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLVNS 307
           V      +  +P   Q AD LTK L   RF +LR KL LV+S
Sbjct: 204 VAANSLHVVHVPAIDQYADILTKALSPSRFCELRTKLKLVDS 245


>gb|KHN41724.1| Copia protein, partial [Glycine soja]
          Length = 260

 Score =  121 bits (303), Expect = 1e-30
 Identities = 58/102 (56%), Positives = 75/102 (73%)
 Frame = +2

Query: 2   LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
           LA A SE+LW+QSLL EL  P  PPV+YCDN S V+++HNPVLH+RTKHMELD+F +REK
Sbjct: 150 LAHAASEVLWLQSLLHELKVPIPPPVIYCDNQSAVAISHNPVLHSRTKHMELDIFFVREK 209

Query: 182 VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLVNS 307
           VL K   + +IP +LQ AD LTK L    F + R KL ++++
Sbjct: 210 VLNKSLVVSYIPAQLQVADILTKSLSKHLFYNFRSKLRVLST 251


>gb|PNX92571.1| histone deacetylase [Trifolium pratense]
          Length = 1488

 Score =  127 bits (318), Expect = 2e-30
 Identities = 62/101 (61%), Positives = 75/101 (74%)
 Frame = +2

Query: 2    LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
            LA AT+ELLWIQ+LLTEL  PF  P + CDN S V LAHNP++H+RTKHME+DLF +REK
Sbjct: 1384 LAHATAELLWIQTLLTELHVPFAAPTILCDNQSAVMLAHNPIMHSRTKHMEIDLFFVREK 1443

Query: 182  VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLVN 304
            V+ KQ S+  IPG  Q AD LTKPL T +F  LR KL + +
Sbjct: 1444 VISKQLSVLHIPGTDQWADVLTKPLSTAKFLSLRPKLNVAS 1484


>gb|PNX93473.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 1181

 Score =  126 bits (317), Expect = 2e-30
 Identities = 60/103 (58%), Positives = 77/103 (74%)
 Frame = +2

Query: 2    LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
            LA AT+++LW+Q+LL EL  PF  P +YCDN S V LAHNPVLH+RTKHME+D+F +REK
Sbjct: 1079 LAQATADVLWVQTLLNELTVPFTTPTIYCDNQSAVLLAHNPVLHSRTKHMEIDVFFVREK 1138

Query: 182  VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLVNSQ 310
            VL KQ ++  IPG  Q AD LTKP+ T +F  +R KL + +SQ
Sbjct: 1139 VLAKQLTVVHIPGSTQLADVLTKPVSTDKFLSMRSKLNVRDSQ 1181


>gb|PNY08092.1| histone deacetylase [Trifolium pratense]
          Length = 1339

 Score =  126 bits (316), Expect = 3e-30
 Identities = 62/102 (60%), Positives = 76/102 (74%)
 Frame = +2

Query: 2    LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
            LA AT++ LW+Q+LL EL  PF  PV+YCDN S V LAHNP+LH+RTKHME+DLF +REK
Sbjct: 1237 LAQATADALWVQTLLKELTVPFLAPVIYCDNQSAVLLAHNPILHSRTKHMEIDLFFVREK 1296

Query: 182  VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLVNS 307
            VL KQ S+  IPG  Q AD LTKP+ T +F  +R KL + NS
Sbjct: 1297 VLAKQLSVIHIPGTDQLADILTKPVSTDKFLFMRSKLNVTNS 1338


>gb|KYP30996.1| Copia protein [Cajanus cajan]
          Length = 197

 Score =  118 bits (296), Expect = 3e-30
 Identities = 60/102 (58%), Positives = 73/102 (71%)
 Frame = +2

Query: 2   LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
           LALAT+E+ WIQ+LL+EL      P+++CDNLSTV+LAHNPVLH RTKHMELDLF +REK
Sbjct: 92  LALATAEVTWIQTLLSELQVNHSTPIIFCDNLSTVALAHNPVLHVRTKHMELDLFFVREK 151

Query: 182 VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLVNS 307
           V  K   +  +P   Q AD LTK L   RF DLR K+ +V S
Sbjct: 152 VAAKCLQVVHVPAIDQCADVLTKALSPSRFCDLRSKIQVVES 193


>gb|KYP53273.1| Copia protein [Cajanus cajan]
          Length = 198

 Score =  118 bits (296), Expect = 3e-30
 Identities = 61/102 (59%), Positives = 74/102 (72%)
 Frame = +2

Query: 2   LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
           LALAT+E+LWIQ+LL++L     PPV+Y DN+STV+LAHNPVLHARTKHMELDLF +REK
Sbjct: 92  LALATAEVLWIQTLLSKLHVSHHPPVIYYDNMSTVALAHNPVLHARTKHMELDLFFVREK 151

Query: 182 VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLVNS 307
           V      +  +P   Q AD LTK L   RF +LR KL L +S
Sbjct: 152 VAANLLRVVHVPAADQYADILTKSLSPSRFCELRSKLKLADS 193


>gb|KYP40911.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
 gb|KYP44175.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 255

 Score =  120 bits (300), Expect = 3e-30
 Identities = 61/102 (59%), Positives = 74/102 (72%)
 Frame = +2

Query: 2   LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
           LALAT+E+ WIQ+LL+EL      P+++CDNLSTV+LAHNPVLHARTKHMELDLF +REK
Sbjct: 150 LALATTEVTWIQTLLSELKVTHSTPIIFCDNLSTVALAHNPVLHARTKHMELDLFFVREK 209

Query: 182 VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLVNS 307
           V  K   +  +P   Q AD LTK L   RF +LR KL +V S
Sbjct: 210 VAAKCLQVVHVPAIDQCADVLTKALSPTRFCELRSKLQVVES 251


>ref|XP_020234407.1| uncharacterized protein LOC109814403 [Cajanus cajan]
 gb|KYP48081.1| Copia protein [Cajanus cajan]
          Length = 258

 Score =  120 bits (300), Expect = 3e-30
 Identities = 61/100 (61%), Positives = 76/100 (76%)
 Frame = +2

Query: 2   LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
           LA ATSE+LWIQ+LL EL  PF  P +YC+N +TVSLAHNPVLH+RTKHME++LF +REK
Sbjct: 150 LAQATSEVLWIQTLLHELRVPFSTPTIYCNNQNTVSLAHNPVLHSRTKHMEINLFFVREK 209

Query: 182 VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLV 301
           VL    +++ I G+ Q ADALTKPL +  F  LR KL +V
Sbjct: 210 VLANPLTIQHILGQDQWADALTKPLSSTIFLFLRDKLKVV 249


>dbj|GAU31266.1| hypothetical protein TSUD_153410 [Trifolium subterraneum]
          Length = 844

 Score =  125 bits (313), Expect = 7e-30
 Identities = 59/103 (57%), Positives = 77/103 (74%)
 Frame = +2

Query: 2    LALATSELLWIQSLLTELGTPFKPPVVYCDNLSTVSLAHNPVLHARTKHMELDLFLLREK 181
            LA AT+++LW+Q+LL EL  PF  P +YCDN S V LAHNP+LH+RTKHME+D+F +REK
Sbjct: 742  LAQATADVLWVQTLLKELTVPFTTPTIYCDNQSAVLLAHNPILHSRTKHMEIDVFFVREK 801

Query: 182  VLQKQFSLKFIPGELQRADALTKPLPTLRFEDLRGKLTLVNSQ 310
            VL KQ ++  IPG  Q AD LTKP+ T +F  +R KL + +SQ
Sbjct: 802  VLVKQLTVVHIPGTTQLADVLTKPVSTDKFLSMRSKLNVRDSQ 844


Top