BLASTX nr result

ID: Astragalus23_contig00033565 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00033565
         (346 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_016172842.1| uncharacterized protein LOC107615265 [Arachi...    91   8e-19
ref|XP_015385514.1| PREDICTED: uncharacterized protein LOC107176...    86   1e-18
gb|KRH54809.1| hypothetical protein GLYMA_06G210500 [Glycine max]      82   2e-18
ref|XP_016195605.1| uncharacterized protein LOC107636622 [Arachi...    84   2e-18
ref|XP_016164695.1| uncharacterized protein LOC107607236 [Arachi...    89   5e-18
gb|KYP59627.1| Gypsy retrotransposon integrase-like protein 1 [C...    87   1e-17
gb|KYP51055.1| Gypsy retrotransposon integrase-like protein 1, p...    79   1e-17
ref|XP_016195081.1| uncharacterized protein LOC107636060 [Arachi...    87   2e-17
gb|KYP55059.1| Gypsy retrotransposon integrase-like protein 1 [C...    86   3e-17
ref|XP_024038287.1| uncharacterized protein LOC112097335 [Citrus...    86   3e-17
gb|KYP67718.1| Pol polyprotein [Cajanus cajan]                         86   3e-17
ref|XP_020215686.1| uncharacterized protein LOC109799522 [Cajanu...    86   3e-17
ref|XP_015960249.1| uncharacterized protein LOC107484143 [Arachi...    86   3e-17
gb|KYP38150.1| Uncharacterized protein Mb2253c family [Cajanus c...    78   3e-17
gb|KYP63291.1| Retrovirus-related Pol polyprotein from transposo...    76   4e-17
gb|KYP33134.1| Retrovirus-related Pol polyprotein from transposo...    78   4e-17
gb|KYP58222.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]    76   4e-17
ref|XP_020225282.1| uncharacterized protein LOC109807168 [Cajanu...    86   4e-17
ref|XP_016165268.1| uncharacterized protein LOC107607885 [Arachi...    86   4e-17
ref|XP_015959802.1| uncharacterized protein LOC107483704 [Arachi...    84   5e-17

>ref|XP_016172842.1| uncharacterized protein LOC107615265 [Arachis ipaensis]
          Length = 466

 Score = 90.5 bits (223), Expect = 8e-19
 Identities = 45/113 (39%), Positives = 63/113 (55%)
 Frame = -1

Query: 346 SSPNSQARKLLEITEHLSKKSSTNPVYLPILCLPRPFT**KKTHSWIGPILGYIIAGVEP 167
           + P    R L++     S  S+T    LPI              SW  PIL Y++ GV P
Sbjct: 263 TKPGQGNRSLIQEVVRTSSVSTTTDTGLPI----------SDQESWTSPILQYLLNGVLP 312

Query: 166 PDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVLAEIHEGIC 8
            D  +AK I+R+A++Y+++ G +YKR  S PLLKC+E E+  Y+L EIHEG C
Sbjct: 313 EDPKEAKQIKREAANYTIVTGQLYKRGFSQPLLKCVEPEDTEYILREIHEGCC 365


>ref|XP_015385514.1| PREDICTED: uncharacterized protein LOC107176899 [Citrus sinensis]
 ref|XP_015385515.1| PREDICTED: uncharacterized protein LOC107176900 [Citrus sinensis]
          Length = 167

 Score = 85.5 bits (210), Expect = 1e-18
 Identities = 36/70 (51%), Positives = 52/70 (74%)
 Frame = -1

Query: 214 SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35
           SWI PI+ Y+  GV PPDK  A+ +R +AS Y++IDG +Y+R  + P L+C++ ++A YV
Sbjct: 62  SWIDPIISYLRDGVLPPDKLRARKVRAQASRYTMIDGVLYRRGYTLPFLRCLDEDDADYV 121

Query: 34  LAEIHEGICG 5
           L E+HEGICG
Sbjct: 122 LREVHEGICG 131


>gb|KRH54809.1| hypothetical protein GLYMA_06G210500 [Glycine max]
          Length = 81

 Score = 82.4 bits (202), Expect = 2e-18
 Identities = 32/70 (45%), Positives = 51/70 (72%)
 Frame = -1

Query: 214 SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35
           +W+ P   ++I GV P D+N  + ++RKAS Y+++DG ++KR L+ PLLKC+  ++  YV
Sbjct: 9   NWMTPYRNFLIQGVLPSDENGTRCLKRKASYYAILDGELFKRGLTKPLLKCLNNQQIDYV 68

Query: 34  LAEIHEGICG 5
           + E+HEGICG
Sbjct: 69  MKELHEGICG 78


>ref|XP_016195605.1| uncharacterized protein LOC107636622 [Arachis ipaensis]
          Length = 120

 Score = 83.6 bits (205), Expect = 2e-18
 Identities = 42/107 (39%), Positives = 58/107 (54%)
 Frame = -1

Query: 325 RKLLEITEHLSKKSSTNPVYLPILCLPRPFT**KKTHSWIGPILGYIIAGVEPPDKNDAK 146
           R L +        S+T  + LP+              SW  PIL Y+I G+ P D  +  
Sbjct: 11  RSLFQEVARTPSVSATPDIVLPV----------SNQESWTFPILQYLIDGMLPEDSKEVN 60

Query: 145 LIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVLAEIHEGICG 5
            I+R+A++Y+VI G +YKR  S PLLKCIE  +  Y+L E+HEG CG
Sbjct: 61  RIKREAANYTVITGQLYKRGFSQPLLKCIEPGDTEYILREVHEGCCG 107


>ref|XP_016164695.1| uncharacterized protein LOC107607236 [Arachis ipaensis]
          Length = 768

 Score = 88.6 bits (218), Expect = 5e-18
 Identities = 38/70 (54%), Positives = 49/70 (70%)
 Frame = -1

Query: 214 SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35
           SW  PIL Y++ G  PPD  + K IRR+A++Y+V+ G +YKR  S PLLKC+E E   Y+
Sbjct: 382 SWTHPILQYLLDGTLPPDPKEGKRIRREAANYTVVTGQLYKRGFSQPLLKCVEPENTEYI 441

Query: 34  LAEIHEGICG 5
           L EIHEG CG
Sbjct: 442 LREIHEGCCG 451


>gb|KYP59627.1| Gypsy retrotransposon integrase-like protein 1 [Cajanus cajan]
          Length = 435

 Score = 87.0 bits (214), Expect = 1e-17
 Identities = 45/111 (40%), Positives = 70/111 (63%)
 Frame = -1

Query: 337 NSQARKLLEITEHLSKKSSTNPVYLPILCLPRPFT**KKTHSWIGPILGYIIAGVEPPDK 158
           +SQ    L  T HL   +S+    +P  C+P      + ++SW+  I+ +I+ G EP D 
Sbjct: 70  SSQKPGQLRSTIHLELPTSS----IPQECMPIE----EPSNSWMTGIMNFIVNGSEPADP 121

Query: 157 NDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVLAEIHEGICG 5
            DAK I+ KA+ YS++ G +Y+R  S+PLLKC++ ++A YV+ E+HEGICG
Sbjct: 122 IDAKKIQTKAARYSMVAGELYRRGFSTPLLKCLDQQQADYVIREVHEGICG 172


>gb|KYP51055.1| Gypsy retrotransposon integrase-like protein 1, partial [Cajanus
           cajan]
          Length = 411

 Score = 79.3 bits (194), Expect(2) = 1e-17
 Identities = 33/69 (47%), Positives = 50/69 (72%)
 Frame = -1

Query: 211 WIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVL 32
           W+  I GY+  G+ P DKN+A+ IR +++ + +I   ++KR +SSPLLKC+   +AAYV+
Sbjct: 44  WMAGISGYLKEGILPEDKNEARKIRMRSAKFVIIRDELFKRGVSSPLLKCLTASQAAYVI 103

Query: 31  AEIHEGICG 5
            EIH+GICG
Sbjct: 104 KEIHQGICG 112



 Score = 37.7 bits (86), Expect(2) = 1e-17
 Identities = 19/34 (55%), Positives = 22/34 (64%)
 Frame = -3

Query: 344 LAKLASTKALGNHRTFIQEILHKPSVSPDPVFAS 243
           L+KLASTK  G H+T IQE LH PS+    V  S
Sbjct: 4   LSKLASTKRPGQHQTIIQETLHSPSLDDKVVNVS 37


>ref|XP_016195081.1| uncharacterized protein LOC107636060 [Arachis ipaensis]
          Length = 862

 Score = 86.7 bits (213), Expect = 2e-17
 Identities = 37/70 (52%), Positives = 49/70 (70%)
 Frame = -1

Query: 214 SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35
           SW  PIL Y++ G  PPD  + K I+R+A++Y+++ G +YKR  S PLLKCIE E   Y+
Sbjct: 460 SWTYPILQYLLDGTLPPDPKEEKRIKREAANYTIVTGQLYKRGFSQPLLKCIEPENTEYI 519

Query: 34  LAEIHEGICG 5
           L EIHEG CG
Sbjct: 520 LREIHEGCCG 529


>gb|KYP55059.1| Gypsy retrotransposon integrase-like protein 1 [Cajanus cajan]
          Length = 467

 Score = 86.3 bits (212), Expect = 3e-17
 Identities = 37/70 (52%), Positives = 53/70 (75%)
 Frame = -1

Query: 214 SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35
           SW+  IL +I+ G EP + ++AK IR +A+ YSV+ G +Y+R  S+PLLKCI+ ++A YV
Sbjct: 175 SWMTEILNFIVNGTEPAEPSEAKRIRTQAAQYSVVAGELYRRGFSTPLLKCIDHQQANYV 234

Query: 34  LAEIHEGICG 5
           + EIHEGICG
Sbjct: 235 IGEIHEGICG 244


>ref|XP_024038287.1| uncharacterized protein LOC112097335 [Citrus clementina]
          Length = 549

 Score = 86.3 bits (212), Expect = 3e-17
 Identities = 37/70 (52%), Positives = 52/70 (74%)
 Frame = -1

Query: 214 SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35
           SW+ PIL YI  GV P DK  A+ ++ +A+ Y+++DG +Y+R  + PLL+C++ EEA YV
Sbjct: 221 SWMDPILAYIRDGVLPEDKRQARKLKCRAARYTLLDGVLYRRGFTLPLLRCVDDEEADYV 280

Query: 34  LAEIHEGICG 5
           L EIHEGICG
Sbjct: 281 LREIHEGICG 290


>gb|KYP67718.1| Pol polyprotein [Cajanus cajan]
          Length = 673

 Score = 86.3 bits (212), Expect = 3e-17
 Identities = 46/104 (44%), Positives = 65/104 (62%)
 Frame = -1

Query: 316 LEITEHLSKKSSTNPVYLPILCLPRPFT**KKTHSWIGPILGYIIAGVEPPDKNDAKLIR 137
           L  T HL   +S+    +P  C+P      + T SW+  I+ YII+  EP D  +AK +R
Sbjct: 376 LRSTIHLELPTSS----IPQECIPIE----RPTSSWMTNIINYIISSSEPSDPLEAKKVR 427

Query: 136 RKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVLAEIHEGICG 5
            +A+ YS+I G +Y+R  S+PLLKC++  +A YVL EIHEGICG
Sbjct: 428 TQAARYSLIAGELYRRGFSTPLLKCLDQPQADYVLREIHEGICG 471


>ref|XP_020215686.1| uncharacterized protein LOC109799522 [Cajanus cajan]
          Length = 747

 Score = 86.3 bits (212), Expect = 3e-17
 Identities = 46/104 (44%), Positives = 65/104 (62%)
 Frame = -1

Query: 316 LEITEHLSKKSSTNPVYLPILCLPRPFT**KKTHSWIGPILGYIIAGVEPPDKNDAKLIR 137
           L  T HL   +S+    +P  C+P      + T SW+  I+ YII+  EP D  +AK +R
Sbjct: 450 LRSTIHLELPTSS----IPQECIPIE----RPTSSWMTNIINYIISSSEPSDPLEAKKVR 501

Query: 136 RKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVLAEIHEGICG 5
            +A+ YS+I G +Y+R  S+PLLKC++  +A YVL EIHEGICG
Sbjct: 502 TQAARYSLIAGELYRRGFSTPLLKCLDQPQADYVLREIHEGICG 545


>ref|XP_015960249.1| uncharacterized protein LOC107484143 [Arachis duranensis]
          Length = 425

 Score = 85.9 bits (211), Expect(2) = 3e-17
 Identities = 35/70 (50%), Positives = 49/70 (70%)
 Frame = -1

Query: 214 SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35
           SW  PIL Y++ G  PPD  + K I+R+A++Y+++ G +YKR  S PLLKC+E  +  Y+
Sbjct: 202 SWTHPILQYLLDGTLPPDPKEGKRIKREAANYTIVTGQLYKRGFSQPLLKCVEPRDTEYI 261

Query: 34  LAEIHEGICG 5
           L EIHEG CG
Sbjct: 262 LREIHEGCCG 271



 Score = 30.0 bits (66), Expect(2) = 3e-17
 Identities = 14/27 (51%), Positives = 20/27 (74%)
 Frame = -3

Query: 344 LAKLASTKALGNHRTFIQEILHKPSVS 264
           L+KLASTK    +++ IQE++  PSVS
Sbjct: 162 LSKLASTKPGHGNKSLIQEVVRSPSVS 188


>gb|KYP38150.1| Uncharacterized protein Mb2253c family [Cajanus cajan]
          Length = 289

 Score = 77.8 bits (190), Expect(2) = 3e-17
 Identities = 33/73 (45%), Positives = 51/73 (69%)
 Frame = -1

Query: 223 KTHSWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEA 44
           K H W+  I  Y+  GV P DK++A+ IR +++ + ++D  ++KR +S+PLLKC+   +A
Sbjct: 199 KDHGWMTGIWSYLKEGVLPKDKDEAQKIRVRSAKFVIVDDELFKRGISTPLLKCLTAPQA 258

Query: 43  AYVLAEIHEGICG 5
           AYV+ EIH GICG
Sbjct: 259 AYVIEEIHWGICG 271



 Score = 38.1 bits (87), Expect(2) = 3e-17
 Identities = 17/26 (65%), Positives = 20/26 (76%)
 Frame = -3

Query: 344 LAKLASTKALGNHRTFIQEILHKPSV 267
           L+KLASTK  G HRT IQE +H PS+
Sbjct: 163 LSKLASTKRPGQHRTIIQETMHSPSL 188


>gb|KYP63291.1| Retrovirus-related Pol polyprotein from transposon 17.6 [Cajanus
           cajan]
          Length = 1133

 Score = 76.3 bits (186), Expect(2) = 4e-17
 Identities = 33/69 (47%), Positives = 48/69 (69%)
 Frame = -1

Query: 211 WIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVL 32
           W+  I  Y+  GV P DKN A+ IR +++ + +I   ++KR +SSPLLKC+   +AAYV+
Sbjct: 723 WMASIWRYLKEGVLPEDKNAARKIRMRSTKFVIIGDELFKRGISSPLLKCLTASQAAYVI 782

Query: 31  AEIHEGICG 5
            EIH+GICG
Sbjct: 783 REIHQGICG 791



 Score = 39.3 bits (90), Expect(2) = 4e-17
 Identities = 18/26 (69%), Positives = 21/26 (80%)
 Frame = -3

Query: 344 LAKLASTKALGNHRTFIQEILHKPSV 267
           L+KLA+TK  G HRTFIQE LH PS+
Sbjct: 683 LSKLANTKRPGQHRTFIQEPLHSPSL 708


>gb|KYP33134.1| Retrovirus-related Pol polyprotein from transposon opus [Cajanus
           cajan]
          Length = 915

 Score = 77.8 bits (190), Expect(2) = 4e-17
 Identities = 33/69 (47%), Positives = 49/69 (71%)
 Frame = -1

Query: 211 WIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVL 32
           W+  I  Y+  GV P DKN A+ +R +A+ + +ID  ++KR ++SPLLKC+   +AAYV+
Sbjct: 591 WMTNIWKYLKEGVLPEDKNKARKVRMRAAKFVIIDDELFKRGIASPLLKCLTASQAAYVI 650

Query: 31  AEIHEGICG 5
            EIH+GICG
Sbjct: 651 KEIHQGICG 659



 Score = 37.7 bits (86), Expect(2) = 4e-17
 Identities = 17/26 (65%), Positives = 20/26 (76%)
 Frame = -3

Query: 344 LAKLASTKALGNHRTFIQEILHKPSV 267
           L+KLA+TK  G HRT IQE LH PS+
Sbjct: 551 LSKLANTKRPGQHRTIIQETLHSPSL 576


>gb|KYP58222.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]
          Length = 480

 Score = 76.3 bits (186), Expect(2) = 4e-17
 Identities = 32/74 (43%), Positives = 51/74 (68%)
 Frame = -1

Query: 226 KKTHSWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEE 47
           K+   W+  I GY+  G+ P DK++A+ IR +++ + +I   ++K  +SSPLLKC+   +
Sbjct: 182 KEDLGWMAGIWGYLKEGILPEDKDEAQKIRMRSAKFVIIGDELFKHGVSSPLLKCLTASQ 241

Query: 46  AAYVLAEIHEGICG 5
           AAYV+ EIH+GICG
Sbjct: 242 AAYVIREIHQGICG 255



 Score = 39.3 bits (90), Expect(2) = 4e-17
 Identities = 19/31 (61%), Positives = 21/31 (67%)
 Frame = -3

Query: 344 LAKLASTKALGNHRTFIQEILHKPSVSPDPV 252
           L+KLASTK  G HRT IQE LH PS+    V
Sbjct: 147 LSKLASTKRPGQHRTIIQETLHSPSLDDKAV 177


>ref|XP_020225282.1| uncharacterized protein LOC109807168 [Cajanus cajan]
          Length = 411

 Score = 85.5 bits (210), Expect = 4e-17
 Identities = 46/104 (44%), Positives = 62/104 (59%)
 Frame = -1

Query: 316 LEITEHLSKKSSTNPVYLPILCLPRPFT**KKTHSWIGPILGYIIAGVEPPDKNDAKLIR 137
           L  T HL     T P  +P  C+    T  + T +WI  I  Y+  G EP D + AK +R
Sbjct: 175 LRTTLHLEL---TTPSVVPTECM----TIGEPTRTWITDITNYLEHGKEPSDPSAAKKLR 227

Query: 136 RKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVLAEIHEGICG 5
            +A+ YS++ G +Y+R  S PLLKC++ E+A YVL EIHEGICG
Sbjct: 228 TQAARYSMVGGELYRRGFSVPLLKCVDAEQANYVLREIHEGICG 271


>ref|XP_016165268.1| uncharacterized protein LOC107607885 [Arachis ipaensis]
          Length = 1341

 Score = 85.9 bits (211), Expect = 4e-17
 Identities = 37/70 (52%), Positives = 48/70 (68%)
 Frame = -1

Query: 214  SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35
            SW  PIL Y+  G  PPD  + K I+R+A++Y+V+ G +YKR  S PLLKC+E E   Y+
Sbjct: 971  SWTYPILQYLFDGTLPPDPKEGKRIKREAANYTVVAGQLYKRGFSQPLLKCVEPENTGYI 1030

Query: 34   LAEIHEGICG 5
            L EIHEG CG
Sbjct: 1031 LHEIHEGCCG 1040


>ref|XP_015959802.1| uncharacterized protein LOC107483704 [Arachis duranensis]
          Length = 590

 Score = 83.6 bits (205), Expect(2) = 5e-17
 Identities = 36/70 (51%), Positives = 49/70 (70%)
 Frame = -1

Query: 214 SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35
           SW  PIL Y++ G  PPD  + + I+R+A++Y++I G +YKR  S PLLKCIE  +  Y+
Sbjct: 328 SWTYPILQYLLDGTLPPDPKEERRIKREAANYTIIAGQLYKRGFSQPLLKCIEPGDTEYI 387

Query: 34  LAEIHEGICG 5
           L EIHEG CG
Sbjct: 388 LREIHEGCCG 397



 Score = 31.6 bits (70), Expect(2) = 5e-17
 Identities = 15/27 (55%), Positives = 21/27 (77%)
 Frame = -3

Query: 344 LAKLASTKALGNHRTFIQEILHKPSVS 264
           L+KLASTK+   +R+ IQE++  PSVS
Sbjct: 288 LSKLASTKSGHGNRSLIQEVVKSPSVS 314


Top