BLASTX nr result
ID: Astragalus23_contig00033565
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00033565 (346 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_016172842.1| uncharacterized protein LOC107615265 [Arachi... 91 8e-19 ref|XP_015385514.1| PREDICTED: uncharacterized protein LOC107176... 86 1e-18 gb|KRH54809.1| hypothetical protein GLYMA_06G210500 [Glycine max] 82 2e-18 ref|XP_016195605.1| uncharacterized protein LOC107636622 [Arachi... 84 2e-18 ref|XP_016164695.1| uncharacterized protein LOC107607236 [Arachi... 89 5e-18 gb|KYP59627.1| Gypsy retrotransposon integrase-like protein 1 [C... 87 1e-17 gb|KYP51055.1| Gypsy retrotransposon integrase-like protein 1, p... 79 1e-17 ref|XP_016195081.1| uncharacterized protein LOC107636060 [Arachi... 87 2e-17 gb|KYP55059.1| Gypsy retrotransposon integrase-like protein 1 [C... 86 3e-17 ref|XP_024038287.1| uncharacterized protein LOC112097335 [Citrus... 86 3e-17 gb|KYP67718.1| Pol polyprotein [Cajanus cajan] 86 3e-17 ref|XP_020215686.1| uncharacterized protein LOC109799522 [Cajanu... 86 3e-17 ref|XP_015960249.1| uncharacterized protein LOC107484143 [Arachi... 86 3e-17 gb|KYP38150.1| Uncharacterized protein Mb2253c family [Cajanus c... 78 3e-17 gb|KYP63291.1| Retrovirus-related Pol polyprotein from transposo... 76 4e-17 gb|KYP33134.1| Retrovirus-related Pol polyprotein from transposo... 78 4e-17 gb|KYP58222.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan] 76 4e-17 ref|XP_020225282.1| uncharacterized protein LOC109807168 [Cajanu... 86 4e-17 ref|XP_016165268.1| uncharacterized protein LOC107607885 [Arachi... 86 4e-17 ref|XP_015959802.1| uncharacterized protein LOC107483704 [Arachi... 84 5e-17 >ref|XP_016172842.1| uncharacterized protein LOC107615265 [Arachis ipaensis] Length = 466 Score = 90.5 bits (223), Expect = 8e-19 Identities = 45/113 (39%), Positives = 63/113 (55%) Frame = -1 Query: 346 SSPNSQARKLLEITEHLSKKSSTNPVYLPILCLPRPFT**KKTHSWIGPILGYIIAGVEP 167 + P R L++ S S+T LPI SW PIL Y++ GV P Sbjct: 263 TKPGQGNRSLIQEVVRTSSVSTTTDTGLPI----------SDQESWTSPILQYLLNGVLP 312 Query: 166 PDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVLAEIHEGIC 8 D +AK I+R+A++Y+++ G +YKR S PLLKC+E E+ Y+L EIHEG C Sbjct: 313 EDPKEAKQIKREAANYTIVTGQLYKRGFSQPLLKCVEPEDTEYILREIHEGCC 365 >ref|XP_015385514.1| PREDICTED: uncharacterized protein LOC107176899 [Citrus sinensis] ref|XP_015385515.1| PREDICTED: uncharacterized protein LOC107176900 [Citrus sinensis] Length = 167 Score = 85.5 bits (210), Expect = 1e-18 Identities = 36/70 (51%), Positives = 52/70 (74%) Frame = -1 Query: 214 SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35 SWI PI+ Y+ GV PPDK A+ +R +AS Y++IDG +Y+R + P L+C++ ++A YV Sbjct: 62 SWIDPIISYLRDGVLPPDKLRARKVRAQASRYTMIDGVLYRRGYTLPFLRCLDEDDADYV 121 Query: 34 LAEIHEGICG 5 L E+HEGICG Sbjct: 122 LREVHEGICG 131 >gb|KRH54809.1| hypothetical protein GLYMA_06G210500 [Glycine max] Length = 81 Score = 82.4 bits (202), Expect = 2e-18 Identities = 32/70 (45%), Positives = 51/70 (72%) Frame = -1 Query: 214 SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35 +W+ P ++I GV P D+N + ++RKAS Y+++DG ++KR L+ PLLKC+ ++ YV Sbjct: 9 NWMTPYRNFLIQGVLPSDENGTRCLKRKASYYAILDGELFKRGLTKPLLKCLNNQQIDYV 68 Query: 34 LAEIHEGICG 5 + E+HEGICG Sbjct: 69 MKELHEGICG 78 >ref|XP_016195605.1| uncharacterized protein LOC107636622 [Arachis ipaensis] Length = 120 Score = 83.6 bits (205), Expect = 2e-18 Identities = 42/107 (39%), Positives = 58/107 (54%) Frame = -1 Query: 325 RKLLEITEHLSKKSSTNPVYLPILCLPRPFT**KKTHSWIGPILGYIIAGVEPPDKNDAK 146 R L + S+T + LP+ SW PIL Y+I G+ P D + Sbjct: 11 RSLFQEVARTPSVSATPDIVLPV----------SNQESWTFPILQYLIDGMLPEDSKEVN 60 Query: 145 LIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVLAEIHEGICG 5 I+R+A++Y+VI G +YKR S PLLKCIE + Y+L E+HEG CG Sbjct: 61 RIKREAANYTVITGQLYKRGFSQPLLKCIEPGDTEYILREVHEGCCG 107 >ref|XP_016164695.1| uncharacterized protein LOC107607236 [Arachis ipaensis] Length = 768 Score = 88.6 bits (218), Expect = 5e-18 Identities = 38/70 (54%), Positives = 49/70 (70%) Frame = -1 Query: 214 SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35 SW PIL Y++ G PPD + K IRR+A++Y+V+ G +YKR S PLLKC+E E Y+ Sbjct: 382 SWTHPILQYLLDGTLPPDPKEGKRIRREAANYTVVTGQLYKRGFSQPLLKCVEPENTEYI 441 Query: 34 LAEIHEGICG 5 L EIHEG CG Sbjct: 442 LREIHEGCCG 451 >gb|KYP59627.1| Gypsy retrotransposon integrase-like protein 1 [Cajanus cajan] Length = 435 Score = 87.0 bits (214), Expect = 1e-17 Identities = 45/111 (40%), Positives = 70/111 (63%) Frame = -1 Query: 337 NSQARKLLEITEHLSKKSSTNPVYLPILCLPRPFT**KKTHSWIGPILGYIIAGVEPPDK 158 +SQ L T HL +S+ +P C+P + ++SW+ I+ +I+ G EP D Sbjct: 70 SSQKPGQLRSTIHLELPTSS----IPQECMPIE----EPSNSWMTGIMNFIVNGSEPADP 121 Query: 157 NDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVLAEIHEGICG 5 DAK I+ KA+ YS++ G +Y+R S+PLLKC++ ++A YV+ E+HEGICG Sbjct: 122 IDAKKIQTKAARYSMVAGELYRRGFSTPLLKCLDQQQADYVIREVHEGICG 172 >gb|KYP51055.1| Gypsy retrotransposon integrase-like protein 1, partial [Cajanus cajan] Length = 411 Score = 79.3 bits (194), Expect(2) = 1e-17 Identities = 33/69 (47%), Positives = 50/69 (72%) Frame = -1 Query: 211 WIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVL 32 W+ I GY+ G+ P DKN+A+ IR +++ + +I ++KR +SSPLLKC+ +AAYV+ Sbjct: 44 WMAGISGYLKEGILPEDKNEARKIRMRSAKFVIIRDELFKRGVSSPLLKCLTASQAAYVI 103 Query: 31 AEIHEGICG 5 EIH+GICG Sbjct: 104 KEIHQGICG 112 Score = 37.7 bits (86), Expect(2) = 1e-17 Identities = 19/34 (55%), Positives = 22/34 (64%) Frame = -3 Query: 344 LAKLASTKALGNHRTFIQEILHKPSVSPDPVFAS 243 L+KLASTK G H+T IQE LH PS+ V S Sbjct: 4 LSKLASTKRPGQHQTIIQETLHSPSLDDKVVNVS 37 >ref|XP_016195081.1| uncharacterized protein LOC107636060 [Arachis ipaensis] Length = 862 Score = 86.7 bits (213), Expect = 2e-17 Identities = 37/70 (52%), Positives = 49/70 (70%) Frame = -1 Query: 214 SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35 SW PIL Y++ G PPD + K I+R+A++Y+++ G +YKR S PLLKCIE E Y+ Sbjct: 460 SWTYPILQYLLDGTLPPDPKEEKRIKREAANYTIVTGQLYKRGFSQPLLKCIEPENTEYI 519 Query: 34 LAEIHEGICG 5 L EIHEG CG Sbjct: 520 LREIHEGCCG 529 >gb|KYP55059.1| Gypsy retrotransposon integrase-like protein 1 [Cajanus cajan] Length = 467 Score = 86.3 bits (212), Expect = 3e-17 Identities = 37/70 (52%), Positives = 53/70 (75%) Frame = -1 Query: 214 SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35 SW+ IL +I+ G EP + ++AK IR +A+ YSV+ G +Y+R S+PLLKCI+ ++A YV Sbjct: 175 SWMTEILNFIVNGTEPAEPSEAKRIRTQAAQYSVVAGELYRRGFSTPLLKCIDHQQANYV 234 Query: 34 LAEIHEGICG 5 + EIHEGICG Sbjct: 235 IGEIHEGICG 244 >ref|XP_024038287.1| uncharacterized protein LOC112097335 [Citrus clementina] Length = 549 Score = 86.3 bits (212), Expect = 3e-17 Identities = 37/70 (52%), Positives = 52/70 (74%) Frame = -1 Query: 214 SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35 SW+ PIL YI GV P DK A+ ++ +A+ Y+++DG +Y+R + PLL+C++ EEA YV Sbjct: 221 SWMDPILAYIRDGVLPEDKRQARKLKCRAARYTLLDGVLYRRGFTLPLLRCVDDEEADYV 280 Query: 34 LAEIHEGICG 5 L EIHEGICG Sbjct: 281 LREIHEGICG 290 >gb|KYP67718.1| Pol polyprotein [Cajanus cajan] Length = 673 Score = 86.3 bits (212), Expect = 3e-17 Identities = 46/104 (44%), Positives = 65/104 (62%) Frame = -1 Query: 316 LEITEHLSKKSSTNPVYLPILCLPRPFT**KKTHSWIGPILGYIIAGVEPPDKNDAKLIR 137 L T HL +S+ +P C+P + T SW+ I+ YII+ EP D +AK +R Sbjct: 376 LRSTIHLELPTSS----IPQECIPIE----RPTSSWMTNIINYIISSSEPSDPLEAKKVR 427 Query: 136 RKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVLAEIHEGICG 5 +A+ YS+I G +Y+R S+PLLKC++ +A YVL EIHEGICG Sbjct: 428 TQAARYSLIAGELYRRGFSTPLLKCLDQPQADYVLREIHEGICG 471 >ref|XP_020215686.1| uncharacterized protein LOC109799522 [Cajanus cajan] Length = 747 Score = 86.3 bits (212), Expect = 3e-17 Identities = 46/104 (44%), Positives = 65/104 (62%) Frame = -1 Query: 316 LEITEHLSKKSSTNPVYLPILCLPRPFT**KKTHSWIGPILGYIIAGVEPPDKNDAKLIR 137 L T HL +S+ +P C+P + T SW+ I+ YII+ EP D +AK +R Sbjct: 450 LRSTIHLELPTSS----IPQECIPIE----RPTSSWMTNIINYIISSSEPSDPLEAKKVR 501 Query: 136 RKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVLAEIHEGICG 5 +A+ YS+I G +Y+R S+PLLKC++ +A YVL EIHEGICG Sbjct: 502 TQAARYSLIAGELYRRGFSTPLLKCLDQPQADYVLREIHEGICG 545 >ref|XP_015960249.1| uncharacterized protein LOC107484143 [Arachis duranensis] Length = 425 Score = 85.9 bits (211), Expect(2) = 3e-17 Identities = 35/70 (50%), Positives = 49/70 (70%) Frame = -1 Query: 214 SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35 SW PIL Y++ G PPD + K I+R+A++Y+++ G +YKR S PLLKC+E + Y+ Sbjct: 202 SWTHPILQYLLDGTLPPDPKEGKRIKREAANYTIVTGQLYKRGFSQPLLKCVEPRDTEYI 261 Query: 34 LAEIHEGICG 5 L EIHEG CG Sbjct: 262 LREIHEGCCG 271 Score = 30.0 bits (66), Expect(2) = 3e-17 Identities = 14/27 (51%), Positives = 20/27 (74%) Frame = -3 Query: 344 LAKLASTKALGNHRTFIQEILHKPSVS 264 L+KLASTK +++ IQE++ PSVS Sbjct: 162 LSKLASTKPGHGNKSLIQEVVRSPSVS 188 >gb|KYP38150.1| Uncharacterized protein Mb2253c family [Cajanus cajan] Length = 289 Score = 77.8 bits (190), Expect(2) = 3e-17 Identities = 33/73 (45%), Positives = 51/73 (69%) Frame = -1 Query: 223 KTHSWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEA 44 K H W+ I Y+ GV P DK++A+ IR +++ + ++D ++KR +S+PLLKC+ +A Sbjct: 199 KDHGWMTGIWSYLKEGVLPKDKDEAQKIRVRSAKFVIVDDELFKRGISTPLLKCLTAPQA 258 Query: 43 AYVLAEIHEGICG 5 AYV+ EIH GICG Sbjct: 259 AYVIEEIHWGICG 271 Score = 38.1 bits (87), Expect(2) = 3e-17 Identities = 17/26 (65%), Positives = 20/26 (76%) Frame = -3 Query: 344 LAKLASTKALGNHRTFIQEILHKPSV 267 L+KLASTK G HRT IQE +H PS+ Sbjct: 163 LSKLASTKRPGQHRTIIQETMHSPSL 188 >gb|KYP63291.1| Retrovirus-related Pol polyprotein from transposon 17.6 [Cajanus cajan] Length = 1133 Score = 76.3 bits (186), Expect(2) = 4e-17 Identities = 33/69 (47%), Positives = 48/69 (69%) Frame = -1 Query: 211 WIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVL 32 W+ I Y+ GV P DKN A+ IR +++ + +I ++KR +SSPLLKC+ +AAYV+ Sbjct: 723 WMASIWRYLKEGVLPEDKNAARKIRMRSTKFVIIGDELFKRGISSPLLKCLTASQAAYVI 782 Query: 31 AEIHEGICG 5 EIH+GICG Sbjct: 783 REIHQGICG 791 Score = 39.3 bits (90), Expect(2) = 4e-17 Identities = 18/26 (69%), Positives = 21/26 (80%) Frame = -3 Query: 344 LAKLASTKALGNHRTFIQEILHKPSV 267 L+KLA+TK G HRTFIQE LH PS+ Sbjct: 683 LSKLANTKRPGQHRTFIQEPLHSPSL 708 >gb|KYP33134.1| Retrovirus-related Pol polyprotein from transposon opus [Cajanus cajan] Length = 915 Score = 77.8 bits (190), Expect(2) = 4e-17 Identities = 33/69 (47%), Positives = 49/69 (71%) Frame = -1 Query: 211 WIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVL 32 W+ I Y+ GV P DKN A+ +R +A+ + +ID ++KR ++SPLLKC+ +AAYV+ Sbjct: 591 WMTNIWKYLKEGVLPEDKNKARKVRMRAAKFVIIDDELFKRGIASPLLKCLTASQAAYVI 650 Query: 31 AEIHEGICG 5 EIH+GICG Sbjct: 651 KEIHQGICG 659 Score = 37.7 bits (86), Expect(2) = 4e-17 Identities = 17/26 (65%), Positives = 20/26 (76%) Frame = -3 Query: 344 LAKLASTKALGNHRTFIQEILHKPSV 267 L+KLA+TK G HRT IQE LH PS+ Sbjct: 551 LSKLANTKRPGQHRTIIQETLHSPSL 576 >gb|KYP58222.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan] Length = 480 Score = 76.3 bits (186), Expect(2) = 4e-17 Identities = 32/74 (43%), Positives = 51/74 (68%) Frame = -1 Query: 226 KKTHSWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEE 47 K+ W+ I GY+ G+ P DK++A+ IR +++ + +I ++K +SSPLLKC+ + Sbjct: 182 KEDLGWMAGIWGYLKEGILPEDKDEAQKIRMRSAKFVIIGDELFKHGVSSPLLKCLTASQ 241 Query: 46 AAYVLAEIHEGICG 5 AAYV+ EIH+GICG Sbjct: 242 AAYVIREIHQGICG 255 Score = 39.3 bits (90), Expect(2) = 4e-17 Identities = 19/31 (61%), Positives = 21/31 (67%) Frame = -3 Query: 344 LAKLASTKALGNHRTFIQEILHKPSVSPDPV 252 L+KLASTK G HRT IQE LH PS+ V Sbjct: 147 LSKLASTKRPGQHRTIIQETLHSPSLDDKAV 177 >ref|XP_020225282.1| uncharacterized protein LOC109807168 [Cajanus cajan] Length = 411 Score = 85.5 bits (210), Expect = 4e-17 Identities = 46/104 (44%), Positives = 62/104 (59%) Frame = -1 Query: 316 LEITEHLSKKSSTNPVYLPILCLPRPFT**KKTHSWIGPILGYIIAGVEPPDKNDAKLIR 137 L T HL T P +P C+ T + T +WI I Y+ G EP D + AK +R Sbjct: 175 LRTTLHLEL---TTPSVVPTECM----TIGEPTRTWITDITNYLEHGKEPSDPSAAKKLR 227 Query: 136 RKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYVLAEIHEGICG 5 +A+ YS++ G +Y+R S PLLKC++ E+A YVL EIHEGICG Sbjct: 228 TQAARYSMVGGELYRRGFSVPLLKCVDAEQANYVLREIHEGICG 271 >ref|XP_016165268.1| uncharacterized protein LOC107607885 [Arachis ipaensis] Length = 1341 Score = 85.9 bits (211), Expect = 4e-17 Identities = 37/70 (52%), Positives = 48/70 (68%) Frame = -1 Query: 214 SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35 SW PIL Y+ G PPD + K I+R+A++Y+V+ G +YKR S PLLKC+E E Y+ Sbjct: 971 SWTYPILQYLFDGTLPPDPKEGKRIKREAANYTVVAGQLYKRGFSQPLLKCVEPENTGYI 1030 Query: 34 LAEIHEGICG 5 L EIHEG CG Sbjct: 1031 LHEIHEGCCG 1040 >ref|XP_015959802.1| uncharacterized protein LOC107483704 [Arachis duranensis] Length = 590 Score = 83.6 bits (205), Expect(2) = 5e-17 Identities = 36/70 (51%), Positives = 49/70 (70%) Frame = -1 Query: 214 SWIGPILGYIIAGVEPPDKNDAKLIRRKASSYSVIDGTMYKRSLSSPLLKCIEGEEAAYV 35 SW PIL Y++ G PPD + + I+R+A++Y++I G +YKR S PLLKCIE + Y+ Sbjct: 328 SWTYPILQYLLDGTLPPDPKEERRIKREAANYTIIAGQLYKRGFSQPLLKCIEPGDTEYI 387 Query: 34 LAEIHEGICG 5 L EIHEG CG Sbjct: 388 LREIHEGCCG 397 Score = 31.6 bits (70), Expect(2) = 5e-17 Identities = 15/27 (55%), Positives = 21/27 (77%) Frame = -3 Query: 344 LAKLASTKALGNHRTFIQEILHKPSVS 264 L+KLASTK+ +R+ IQE++ PSVS Sbjct: 288 LSKLASTKSGHGNRSLIQEVVKSPSVS 314