BLASTX nr result

ID: Astragalus23_contig00028362 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00028362
         (395 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNY05002.1| putative copia-type polyprotein [Trifolium pratense]    62   2e-08
dbj|GAU37126.1| hypothetical protein TSUD_278780 [Trifolium subt...    60   6e-08
gb|PNX90798.1| F-box protein, partial [Trifolium pratense]             60   6e-08
gb|KHN45890.1| Retrovirus-related Pol polyprotein from transposo...    60   7e-08
dbj|GAU44417.1| hypothetical protein TSUD_100640 [Trifolium subt...    60   8e-08
gb|PNX98468.1| putative copia-type polyprotein, partial [Trifoli...    59   1e-07
gb|PNX62592.1| retrovirus-related Pol polyprotein from transposo...    57   5e-07
ref|XP_012574203.1| PREDICTED: uncharacterized protein LOC105852...    57   9e-07
gb|PNX86900.1| F-box protein [Trifolium pratense]                      57   1e-06
dbj|GAU25734.1| hypothetical protein TSUD_216650 [Trifolium subt...    56   2e-06
dbj|GAU26184.1| hypothetical protein TSUD_354060 [Trifolium subt...    56   2e-06
dbj|GAU18816.1| hypothetical protein TSUD_81050 [Trifolium subte...    56   2e-06
gb|OIW01880.1| hypothetical protein TanjilG_31062 [Lupinus angus...    55   3e-06
dbj|GAU46754.1| hypothetical protein TSUD_402800 [Trifolium subt...    55   3e-06
gb|PNX90720.1| pectinesterase, partial [Trifolium pratense]            55   5e-06
ref|XP_019455195.1| PREDICTED: uncharacterized protein LOC109356...    54   7e-06
ref|XP_019451834.1| PREDICTED: uncharacterized protein LOC109353...    54   8e-06
dbj|GAU23301.1| hypothetical protein TSUD_237550 [Trifolium subt...    54   8e-06
ref|XP_019451905.1| PREDICTED: uncharacterized protein LOC109354...    54   1e-05

>gb|PNY05002.1| putative copia-type polyprotein [Trifolium pratense]
          Length = 762

 Score = 61.6 bits (148), Expect = 2e-08
 Identities = 39/109 (35%), Positives = 55/109 (50%)
 Frame = -1

Query: 386 RSSEKETEQALLSYSSKRNWKAGNXXXXXXXXXXXXXXXXXSFDKKKGDLSKVQCYNCQQ 207
           + SE   +QAL +  +K+  K  N                 +  KKK +  ++QCYNCQ+
Sbjct: 211 KQSEDSNDQALQAQYNKKG-KNQNSNEGNGKNQDSNQQENSNGQKKKFNKKEIQCYNCQK 269

Query: 206 FGHYRNKCKNKKVPRDKDAEAKFANXXXXXXXSEELMLMAVEDEIRLDE 60
           +GH+  +CK+KKVPR+K  EAKF          E  MLMAV  E   D+
Sbjct: 270 WGHFAAECKSKKVPREKTDEAKFV-YDKQEEDPESSMLMAVIKEEEDDD 317


>dbj|GAU37126.1| hypothetical protein TSUD_278780 [Trifolium subterraneum]
          Length = 870

 Score = 60.5 bits (145), Expect = 6e-08
 Identities = 40/117 (34%), Positives = 53/117 (45%), Gaps = 13/117 (11%)
 Frame = -1

Query: 395 IRQRSSEKETEQALLS------YSSKRNWKA-------GNXXXXXXXXXXXXXXXXXSFD 255
           ++QRSS K  EQAL +      Y  K  WK         +                    
Sbjct: 204 VKQRSSNKAVEQALQAKIQNKNYKGKDKWKKKKEESENSSKNSKTQAAGSIKGNQNKKNP 263

Query: 254 KKKGDLSKVQCYNCQQFGHYRNKCKNKKVPRDKDAEAKFANXXXXXXXSEELMLMAV 84
           KKK D   +QCYNCQ +GHY  +C +KKV R    EA+FAN       S++ +LMA+
Sbjct: 264 KKKIDKKDIQCYNCQNYGHYARECNSKKVERGDKDEAQFAN--GGGSDSDDSLLMAI 318


>gb|PNX90798.1| F-box protein, partial [Trifolium pratense]
          Length = 264

 Score = 59.7 bits (143), Expect = 6e-08
 Identities = 42/118 (35%), Positives = 53/118 (44%), Gaps = 14/118 (11%)
 Frame = -1

Query: 395 IRQRSSEKETEQAL------LSYSSKRNWK--------AGNXXXXXXXXXXXXXXXXXSF 258
           ++QRSS K  EQAL       S   K  WK        A                     
Sbjct: 80  VKQRSSNKAVEQALQAKFQNKSKKGKEKWKKKKEESENASKNSKAQGAESSKGNNQNKKN 139

Query: 257 DKKKGDLSKVQCYNCQQFGHYRNKCKNKKVPRDKDAEAKFANXXXXXXXSEELMLMAV 84
            KKK D   VQCYNCQ+ GHY  +C +KKV R+   EA+FAN       S++ +LMA+
Sbjct: 140 FKKKIDKKDVQCYNCQKLGHYARECHSKKVDREDKDEAQFAN--GGGSDSDDSLLMAI 195


>gb|KHN45890.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 374

 Score = 60.1 bits (144), Expect = 7e-08
 Identities = 41/128 (32%), Positives = 61/128 (47%), Gaps = 15/128 (11%)
 Frame = -1

Query: 395 IRQRSSEKETEQALLSYSSKR---------NWKAGNXXXXXXXXXXXXXXXXXSFDKKKG 243
           I+ +S+E   +QAL +   K+         N  + N                    KKK 
Sbjct: 209 IKDKSNESAADQALQAQHQKKGKYKKGKRKNQNSKNSNEGTSKGHDHSSQEGNKGQKKKI 268

Query: 242 DLSKVQCYNCQQFGHYRNKCKNKKVPRDKDAEAKFANXXXXXXXSEELMLMAV------E 81
           +   +QCYNCQ++GH+  +CK+KKVPR++  EAKF +       SE  MLMA+      +
Sbjct: 269 NKKDIQCYNCQKWGHFAAECKSKKVPREESDEAKFVH-DNEGYDSEGAMLMAIIKEEDDD 327

Query: 80  DEIRLDEG 57
           D+  LD G
Sbjct: 328 DQWYLDTG 335


>dbj|GAU44417.1| hypothetical protein TSUD_100640 [Trifolium subterraneum]
          Length = 1318

 Score = 60.1 bits (144), Expect = 8e-08
 Identities = 36/101 (35%), Positives = 45/101 (44%), Gaps = 13/101 (12%)
 Frame = -1

Query: 395 IRQRSSEKETEQALLS------YSSKRNWKA-------GNXXXXXXXXXXXXXXXXXSFD 255
           ++QRSS K  EQAL +      Y  K  WK         +                    
Sbjct: 204 VKQRSSNKAVEQALQAKIQNKNYKGKDKWKKKKEEPENSSKNSKTQAVGSIKGNQNKKNP 263

Query: 254 KKKGDLSKVQCYNCQQFGHYRNKCKNKKVPRDKDAEAKFAN 132
           KKK D   +QCYNCQ +GHY  +C +KKV R    EA+FAN
Sbjct: 264 KKKIDKKDIQCYNCQNYGHYARECNSKKVERGDKDEAQFAN 304


>gb|PNX98468.1| putative copia-type polyprotein, partial [Trifolium pratense]
          Length = 1267

 Score = 59.3 bits (142), Expect = 1e-07
 Identities = 40/117 (34%), Positives = 54/117 (46%), Gaps = 13/117 (11%)
 Frame = -1

Query: 395 IRQRSSEKETEQALLSYSSKRNWK-------------AGNXXXXXXXXXXXXXXXXXSFD 255
           ++QRSS K  EQAL +    +N K             + +                    
Sbjct: 204 VKQRSSSKAVEQALQAKVQNKNHKGKDKCKKKKDDSESSSKNSKNQAGESSKGNQNKKNF 263

Query: 254 KKKGDLSKVQCYNCQQFGHYRNKCKNKKVPRDKDAEAKFANXXXXXXXSEELMLMAV 84
           KKK D   VQCYNCQ+ GHY  +C +KKV RD   EA+FAN       S++ +LMA+
Sbjct: 264 KKKVDKKDVQCYNCQKHGHYARECHSKKVDRDDKDEAQFAN--GGGSESDDSLLMAI 318


>gb|PNX62592.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Trifolium pratense]
          Length = 233

 Score = 57.0 bits (136), Expect = 5e-07
 Identities = 34/120 (28%), Positives = 57/120 (47%), Gaps = 7/120 (5%)
 Frame = -1

Query: 395 IRQRSSEKETEQALLSYSSKRN----WKAGNXXXXXXXXXXXXXXXXXSFDKKKGDLSKV 228
           + +RS ++  +QAL + + K+N    WK  N                   +KK      +
Sbjct: 17  VNERSKDRGIDQALQAQTFKKNGGNKWKGKNKYENGNSQNDSNNKGKKQMNKKN-----I 71

Query: 227 QCYNCQQFGHYRNKCKNKKVPRDKDAEAKFANXXXXXXXSEE---LMLMAVEDEIRLDEG 57
           QC+NCQ++GH+ ++C+ +KVPR  + E   AN       SE+   LM+    D++   EG
Sbjct: 72  QCHNCQKYGHFASECRGQKVPRQYNNEESKANMAKNDSGSEQDPLLMMATTNDDLDKQEG 131


>ref|XP_012574203.1| PREDICTED: uncharacterized protein LOC105852602 [Cicer arietinum]
          Length = 300

 Score = 56.6 bits (135), Expect = 9e-07
 Identities = 34/104 (32%), Positives = 52/104 (50%), Gaps = 16/104 (15%)
 Frame = -1

Query: 395 IRQRSSEKETEQALLSYSSKRNW------KAGNXXXXXXXXXXXXXXXXXSFD------- 255
           I+QRSS+K  EQAL + +SK+N+      K G                  + +       
Sbjct: 76  IKQRSSDKVIEQALQAQTSKKNFIDRRKFKKGKWKNQKRKRSCEDTSEQGAHEGNKKEGN 135

Query: 254 ---KKKGDLSKVQCYNCQQFGHYRNKCKNKKVPRDKDAEAKFAN 132
              KK  D   +QC+NCQ++GH+ N+CK+ K  + KD E ++AN
Sbjct: 136 TKYKKMFDKKGIQCFNCQKYGHFANECKHNKSLQRKDDEVQYAN 179


>gb|PNX86900.1| F-box protein [Trifolium pratense]
          Length = 405

 Score = 56.6 bits (135), Expect = 1e-06
 Identities = 36/130 (27%), Positives = 59/130 (45%), Gaps = 17/130 (13%)
 Frame = -1

Query: 395 IRQRSSEKETEQALLSYSSKRN----WKAGNXXXXXXXXXXXXXXXXXSFD--------- 255
           + +RS ++  +QAL + + K+N    WK  N                   +         
Sbjct: 174 VNERSKDRGIDQALQAQTFKKNGGNKWKGKNKYKNGNSQSDSSKKEQDKGESSKNGSNNK 233

Query: 254 -KKKGDLSKVQCYNCQQFGHYRNKCKNKKVPRDKDAEAKFANXXXXXXXSEE---LMLMA 87
            KKK +   +QCYNCQ++GH+ ++C+ +KVPR  + E   AN       SE+   LM+  
Sbjct: 234 GKKKMNKKNIQCYNCQKYGHFASECRGQKVPRQYNNEESKANMAKNDSGSEQDPLLMMAT 293

Query: 86  VEDEIRLDEG 57
             D++   EG
Sbjct: 294 TNDDLDKQEG 303


>dbj|GAU25734.1| hypothetical protein TSUD_216650 [Trifolium subterraneum]
          Length = 787

 Score = 55.8 bits (133), Expect = 2e-06
 Identities = 25/63 (39%), Positives = 39/63 (61%), Gaps = 3/63 (4%)
 Frame = -1

Query: 254 KKKGDLSKVQCYNCQQFGHYRNKCKNKKVPR---DKDAEAKFANXXXXXXXSEELMLMAV 84
           KK  +   +QCYNCQ++GH+ +KC+NKKVPR   ++++EA  A           LM+  +
Sbjct: 258 KKFMNKKNIQCYNCQKYGHFASKCRNKKVPRQYNNEESEANMAQDDDGSETDVVLMMAII 317

Query: 83  EDE 75
           +DE
Sbjct: 318 DDE 320


>dbj|GAU26184.1| hypothetical protein TSUD_354060 [Trifolium subterraneum]
          Length = 819

 Score = 55.8 bits (133), Expect = 2e-06
 Identities = 30/81 (37%), Positives = 46/81 (56%), Gaps = 5/81 (6%)
 Frame = -1

Query: 254 KKKGDLSKVQCYNCQQFGHYRNKCKNKKVPR---DKDAEAKFANXXXXXXXSEELMLMAV 84
           KK  +   +QCYNCQ++GH+ +KC+NKKVPR   ++++EA  A           LM+  +
Sbjct: 320 KKFMNKKNIQCYNCQKYGHFASKCRNKKVPRQYNNEESEANMAQDDDGSETDVVLMMTII 379

Query: 83  EDEIRLDEGIA--ESSEQNSG 27
           +D+    E  A  +   QNSG
Sbjct: 380 DDDNDGSECKAPMKKKSQNSG 400


>dbj|GAU18816.1| hypothetical protein TSUD_81050 [Trifolium subterraneum]
          Length = 1380

 Score = 55.8 bits (133), Expect = 2e-06
 Identities = 36/110 (32%), Positives = 51/110 (46%), Gaps = 6/110 (5%)
 Frame = -1

Query: 386 RSSEKETEQALLSYSSKR------NWKAGNXXXXXXXXXXXXXXXXXSFDKKKGDLSKVQ 225
           +  EKETE+AL + S K+      +WK                        KK     +Q
Sbjct: 211 KKDEKETEKALFTQSQKKGSGSYESWKKKGKGKWKSNKNEGGNGKG-----KKKSKEHIQ 265

Query: 224 CYNCQQFGHYRNKCKNKKVPRDKDAEAKFANXXXXXXXSEELMLMAVEDE 75
           CYNCQ++GH+ ++C N KVPR K+ EA+ A         E + L+A  DE
Sbjct: 266 CYNCQKWGHFADECVNPKVPRKKNEEAQLAR----DSDEEVVALVATIDE 311


>gb|OIW01880.1| hypothetical protein TanjilG_31062 [Lupinus angustifolius]
          Length = 365

 Score = 55.5 bits (132), Expect = 3e-06
 Identities = 28/81 (34%), Positives = 49/81 (60%), Gaps = 5/81 (6%)
 Frame = -1

Query: 251 KKGDLSKVQCYNCQQFGHYRNKCKNKKVPRDKDAEAKFANXXXXXXXSEELMLMA----- 87
           KK D SK+QC+NC+ +GH+ ++CK K+V + K+ EA+           EE++LMA     
Sbjct: 139 KKKDKSKIQCFNCRNWGHFASECKEKRVIQTKEEEARLEK---DEESEEEVLLMARSIDS 195

Query: 86  VEDEIRLDEGIAESSEQNSGN 24
           +E+  + D+ +   + Q+SG+
Sbjct: 196 LEEANKSDDALLMVTNQSSGS 216


>dbj|GAU46754.1| hypothetical protein TSUD_402800 [Trifolium subterraneum]
          Length = 584

 Score = 55.5 bits (132), Expect = 3e-06
 Identities = 38/133 (28%), Positives = 60/133 (45%), Gaps = 20/133 (15%)
 Frame = -1

Query: 395 IRQRSSEKETEQALLSYSSKRNWKAGNXXXXXXXXXXXXXXXXXSFDKKKGDLSK----- 231
           I  R+  + T+QAL +++SK+    GN                    KK GD  +     
Sbjct: 175 INARAKNRTTDQALWAHTSKK----GNGNKIKGKDQFKKENSQQEIPKKNGDQGESSKYD 230

Query: 230 -------------VQCYNCQQFGHYRNKCKNKKVPRDKDAEAKFANXXXXXXXSE--ELM 96
                        +QCYNCQ++GH+ ++C+ KKVPR  + E   AN       SE   ++
Sbjct: 231 GNGKGKKFMNKKNIQCYNCQKYGHFASECRGKKVPRQYNNEESEANMAQDDNGSETYAVL 290

Query: 95  LMAVEDEIRLDEG 57
           +MA+ D+   D+G
Sbjct: 291 MMAIIDDGYDDKG 303


>gb|PNX90720.1| pectinesterase, partial [Trifolium pratense]
          Length = 334

 Score = 54.7 bits (130), Expect = 5e-06
 Identities = 34/131 (25%), Positives = 57/131 (43%), Gaps = 9/131 (6%)
 Frame = -1

Query: 389 QRSSEKETEQALLS----YSSKRNWKAGNXXXXXXXXXXXXXXXXXS-----FDKKKGDL 237
           +R   KE EQAL +    + SK+ W+  N                        + KKG  
Sbjct: 202 ERDHGKEDEQALYAKFKKFQSKKKWQKKNESKKGKESDEDKPESSKKERGGSVNSKKGSK 261

Query: 236 SKVQCYNCQQFGHYRNKCKNKKVPRDKDAEAKFANXXXXXXXSEELMLMAVEDEIRLDEG 57
             +QC+NCQ+FG + ++C+ +KVPR  + EA  A            M+   ++    DE 
Sbjct: 262 KHIQCFNCQEFGXFASECRGQKVPRQYNEEANVAQDDSTSEEDVNFMVTVTDE----DES 317

Query: 56  IAESSEQNSGN 24
            A  ++ ++G+
Sbjct: 318 EAHMTKNDNGS 328


>ref|XP_019455195.1| PREDICTED: uncharacterized protein LOC109356326 [Lupinus
           angustifolius]
          Length = 489

 Score = 54.3 bits (129), Expect = 7e-06
 Identities = 28/58 (48%), Positives = 38/58 (65%)
 Frame = -1

Query: 254 KKKGDLSKVQCYNCQQFGHYRNKCKNKKVPRDKDAEAKFANXXXXXXXSEELMLMAVE 81
           KKK D SK+QCYNC  +GH+ ++CK KK  + K+AEA+ A         EE++LMA E
Sbjct: 291 KKKKDKSKIQCYNCNNWGHFASECKFKK--KGKEAEARLAK---DEESDEEVLLMAEE 343


>ref|XP_019451834.1| PREDICTED: uncharacterized protein LOC109353933 [Lupinus
           angustifolius]
          Length = 688

 Score = 54.3 bits (129), Expect = 8e-06
 Identities = 28/78 (35%), Positives = 42/78 (53%)
 Frame = -1

Query: 248 KGDLSKVQCYNCQQFGHYRNKCKNKKVPRDKDAEAKFANXXXXXXXSEELMLMAVEDEIR 69
           K D SK+QC+NC+ +GHY ++CK K+  ++K+ EA+ A         EE++LMA      
Sbjct: 274 KKDKSKIQCFNCRNWGHYASECKEKRATQNKEEEARLAK---DDESEEEVLLMA------ 324

Query: 68  LDEGIAESSEQNSGNTCI 15
               I  S E N  N  +
Sbjct: 325 --RSIDPSQEANESNNAL 340


>dbj|GAU23301.1| hypothetical protein TSUD_237550 [Trifolium subterraneum]
          Length = 692

 Score = 54.3 bits (129), Expect = 8e-06
 Identities = 37/127 (29%), Positives = 56/127 (44%), Gaps = 20/127 (15%)
 Frame = -1

Query: 395 IRQRSSEKETEQALLSYSSKRNWKAGNXXXXXXXXXXXXXXXXXSFDKKKGDLSK----- 231
           I  R+  K  +QAL +++SK+    GN                    KK GD  +     
Sbjct: 146 INARAKNKTNDQALWAHTSKK----GNGNNNKGKDQFKKENSPQESSKKNGDQGESSKSN 201

Query: 230 -------------VQCYNCQQFGHYRNKCKNKKVPRDKDAEAKFANXXXXXXXSE--ELM 96
                        +QCYNCQ++GH+ ++C+ KKVPR  + E   AN       SE   L+
Sbjct: 202 GNGKGKKFMNKKNIQCYNCQKYGHFASECRGKKVPRQYNHEESKANLAQNDSGSEADPLL 261

Query: 95  LMAVEDE 75
           +MA+ +E
Sbjct: 262 MMAITNE 268


>ref|XP_019451905.1| PREDICTED: uncharacterized protein LOC109354003 [Lupinus
           angustifolius]
          Length = 418

 Score = 53.9 bits (128), Expect = 1e-05
 Identities = 32/72 (44%), Positives = 43/72 (59%)
 Frame = -1

Query: 254 KKKGDLSKVQCYNCQQFGHYRNKCKNKKVPRDKDAEAKFANXXXXXXXSEELMLMAVEDE 75
           KKK D SK+QCYNC  +GH+ ++CK KK+  +K+AEA+ A         EE++LMA E  
Sbjct: 266 KKKKDKSKIQCYNCNNWGHFASECKFKKM--NKEAEARLAK---DEESDEEVLLMA-EGM 319

Query: 74  IRLDEGIAESSE 39
           I    G   S E
Sbjct: 320 ISASVGETTSDE 331


Top