BLASTX nr result
ID: Astragalus22_contig00035289
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00035289 (324 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KYP48259.1| Retrovirus-related Pol polyprotein from transposo... 103 3e-24 gb|PNX99971.1| hypothetical protein L195_g023244 [Trifolium prat... 102 6e-24 gb|PNX55526.1| retrovirus-related Pol polyprotein from transposo... 97 4e-22 ref|XP_019450657.1| PREDICTED: uncharacterized protein LOC109352... 99 6e-22 ref|XP_022154919.1| uncharacterized protein LOC111022065 [Momord... 97 3e-21 gb|KZV44334.1| hypothetical protein F511_18136 [Dorcoceras hygro... 96 4e-21 gb|KZV25004.1| Cysteine-rich RLK (receptor-like protein kinase) ... 97 5e-21 ref|XP_023908905.1| uncharacterized protein LOC112020549 [Quercu... 96 9e-21 dbj|GAU28547.1| hypothetical protein TSUD_268860 [Trifolium subt... 96 9e-21 dbj|GAU29493.1| hypothetical protein TSUD_360380 [Trifolium subt... 96 9e-21 gb|PNX84823.1| retrovirus-related Pol polyprotein from transposo... 95 1e-20 gb|KYP45565.1| Retrovirus-related Pol polyprotein from transposo... 95 2e-20 gb|KYP68601.1| Retrovirus-related Pol polyprotein from transposo... 93 8e-20 ref|XP_015574216.1| PREDICTED: uncharacterized protein LOC107261... 92 9e-20 gb|KYP55668.1| Retrovirus-related Pol polyprotein from transposo... 92 2e-19 dbj|GAU37804.1| hypothetical protein TSUD_276210, partial [Trifo... 92 3e-19 gb|PNX94008.1| retrovirus-related Pol polyprotein from transposo... 92 3e-19 dbj|GAU20491.1| hypothetical protein TSUD_130490 [Trifolium subt... 91 4e-19 ref|XP_017974499.1| PREDICTED: uncharacterized protein LOC108661... 91 5e-19 gb|KYP61022.1| Retrovirus-related Pol polyprotein from transposo... 91 5e-19 >gb|KYP48259.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 365 Score = 103 bits (258), Expect = 3e-24 Identities = 48/109 (44%), Positives = 67/109 (61%), Gaps = 3/109 (2%) Frame = +2 Query: 5 PSTTLVP--PPLRRSSRAHNPPTHLQEYDC-NSVLYPIQTHLTYDNLSPSYKHFVLNVSS 175 P ++ VP P R+SSR + P +L YDC N++LYPI ++TY+NLS +KHF+ VS+ Sbjct: 168 PQSSHVPYLPGTRKSSRQTHKPGYLDAYDCSNAILYPIHDYITYNNLSAEFKHFIGQVSN 227 Query: 176 VFEPTHFHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANKRAVGCKW 322 +EP +HQAV EW+ NDTW++ LP KR +GCKW Sbjct: 228 TYEPIFYHQAVNYLEWRQAMSEELQALEANDTWTLAKLPKGKRCIGCKW 276 >gb|PNX99971.1| hypothetical protein L195_g023244 [Trifolium pratense] Length = 345 Score = 102 bits (255), Expect = 6e-24 Identities = 49/94 (52%), Positives = 61/94 (64%), Gaps = 2/94 (2%) Frame = +2 Query: 26 PPLRRSSRAHNPPTHLQEYDCNSVL--YPIQTHLTYDNLSPSYKHFVLNVSSVFEPTHFH 199 P LRRSSR PP HL YDCNSV +PIQ+ LTYD+LSPSY ++ VSS +EP H Sbjct: 237 PVLRRSSRPSKPPAHLDLYDCNSVTVTHPIQSFLTYDHLSPSYMAYISQVSSFYEPQFNH 296 Query: 200 QAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANK 301 QA++ PEWQ N+TW+++PLPA K Sbjct: 297 QAIQYPEWQQAVAAELVALESNNTWTVMPLPAGK 330 >gb|PNX55526.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 304 Score = 97.4 bits (241), Expect = 4e-22 Identities = 46/127 (36%), Positives = 71/127 (55%), Gaps = 20/127 (15%) Frame = +2 Query: 2 PPSTTLVPPP--LRRSSRAHNPPTHLQEYDCNSVL------------------YPIQTHL 121 P + ++PPP LRRS+R NPP +LQ++ CN + YP+ + + Sbjct: 167 PSLSPIIPPPITLRRSTRPSNPPGYLQDFHCNLISTSNNDSIQSTSASSSECKYPLSSFI 226 Query: 122 TYDNLSPSYKHFVLNVSSVFEPTHFHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANK 301 +Y NLS S+KHF N+S++ EP+ + +A+ +W+ N+TWS+ LP NK Sbjct: 227 SYQNLSTSHKHFAFNISTLTEPSSYEEAMHDEQWKNAVNVELAALLKNNTWSMTTLPPNK 286 Query: 302 RAVGCKW 322 +AVGCKW Sbjct: 287 KAVGCKW 293 >ref|XP_019450657.1| PREDICTED: uncharacterized protein LOC109352928 [Lupinus angustifolius] Length = 854 Score = 99.4 bits (246), Expect = 6e-22 Identities = 50/118 (42%), Positives = 64/118 (54%), Gaps = 11/118 (9%) Frame = +2 Query: 2 PPSTTLVPPPLRRSSRAHNPPTHLQEYDCNSVL-----------YPIQTHLTYDNLSPSY 148 P S+T P R S R PPTHL++Y CN L YPI + LTY+NLS +Y Sbjct: 647 PTSSTSQPTNNRHSLRIRRPPTHLRDYHCNLTLSQPLPPSCNTKYPISSVLTYNNLSKNY 706 Query: 149 KHFVLNVSSVFEPTHFHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANKRAVGCKW 322 KHF LNVS EP + QA +P W N TW+++PLP +K ++GCKW Sbjct: 707 KHFCLNVSIHQEPKSYKQASSIPCWHQAMQQELLALDLNHTWTLMPLPPDKSSIGCKW 764 >ref|XP_022154919.1| uncharacterized protein LOC111022065 [Momordica charantia] Length = 923 Score = 97.4 bits (241), Expect = 3e-21 Identities = 47/107 (43%), Positives = 62/107 (57%), Gaps = 10/107 (9%) Frame = +2 Query: 32 LRRSSRAHNPPTHLQEYDC----------NSVLYPIQTHLTYDNLSPSYKHFVLNVSSVF 181 LRRSSR P++L++Y C +SV YP+Q +L Y+NLS SYK FVL+VS + Sbjct: 665 LRRSSRVAQRPSYLRDYHCGLIQATDHSASSVFYPLQKYLDYNNLSASYKEFVLSVSCDY 724 Query: 182 EPTHFHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANKRAVGCKW 322 EP +HQAV W+ N TWS+VPLP ++GCKW Sbjct: 725 EPQFYHQAVPFSHWREAMRAELHAMEANHTWSVVPLPYEHHSIGCKW 771 >gb|KZV44334.1| hypothetical protein F511_18136 [Dorcoceras hygrometricum] Length = 442 Score = 96.3 bits (238), Expect = 4e-21 Identities = 51/112 (45%), Positives = 61/112 (54%), Gaps = 7/112 (6%) Frame = +2 Query: 2 PPSTTLVPPPLRRSSRAHNPPTHLQEYDC-------NSVLYPIQTHLTYDNLSPSYKHFV 160 P T L RS R +PP HLQ+Y C S YP+ + + Y NLSPS+++FV Sbjct: 223 PDPTPLNSQQQSRSKRTSHPPHHLQDYHCYMISSPSTSTAYPLCSFVDYSNLSPSHRNFV 282 Query: 161 LNVSSVFEPTHFHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANKRAVGC 316 NVSSV EPT F QAV LPEW+ N TWS+V LP K VGC Sbjct: 283 NNVSSVIEPTTFSQAVVLPEWRQAMNDELKALELNHTWSVVSLPLGKSMVGC 334 >gb|KZV25004.1| Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum] Length = 1404 Score = 96.7 bits (239), Expect = 5e-21 Identities = 47/103 (45%), Positives = 60/103 (58%), Gaps = 8/103 (7%) Frame = +2 Query: 38 RSSRAHNPPTHLQEYDCNSV--------LYPIQTHLTYDNLSPSYKHFVLNVSSVFEPTH 193 R+SR HN P+HL++Y C S+ +PI + Y LS S++ FV N+SS+ EPT Sbjct: 835 RTSRPHNTPSHLRDYHCYSISTPCSTSTAHPIHPLVNYSKLSSSHRAFVQNISSILEPTT 894 Query: 194 FHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANKRAVGCKW 322 F QAV LPEW+ N TWSIV LP K AVGC+W Sbjct: 895 FSQAVSLPEWRQAMDEELKALELNHTWSIVSLPQGKSAVGCRW 937 >ref|XP_023908905.1| uncharacterized protein LOC112020549 [Quercus suber] Length = 722 Score = 95.9 bits (237), Expect = 9e-21 Identities = 48/115 (41%), Positives = 66/115 (57%), Gaps = 9/115 (7%) Frame = +2 Query: 5 PSTTLVPPPLRRSSRAHNPPTHLQEYDCNSVL--------YPIQTHLTYDNLSPSYKHFV 160 PST+ P L+RS+R+H PP +L +Y C SV Y + +L Y +L PS+K FV Sbjct: 565 PSTS--NPTLKRSTRSHKPPPYLYQYACKSVSTKPHSGLPYDVSAYLDYSHLGPSFKSFV 622 Query: 161 LNVSSV-FEPTHFHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANKRAVGCKW 322 + V+S +P FHQAV+ PEW+ +TWS+VPLP K +GCKW Sbjct: 623 MIVNSTPLDPVSFHQAVQYPEWKAAMDKKIEVLEVTNTWSLVPLPPGKSPIGCKW 677 >dbj|GAU28547.1| hypothetical protein TSUD_268860 [Trifolium subterraneum] Length = 1059 Score = 95.9 bits (237), Expect = 9e-21 Identities = 41/116 (35%), Positives = 70/116 (60%), Gaps = 17/116 (14%) Frame = +2 Query: 26 PPLRRSSRAHNPPTHLQEYDCNSVL-----------------YPIQTHLTYDNLSPSYKH 154 PPLR+S+R +PP +LQ++ CN + YP+ + ++Y +LSP+++H Sbjct: 693 PPLRKSTRITHPPGYLQDFHCNLLANTIQSSSADTSNSSTSKYPLSSFISYQHLSPTHQH 752 Query: 155 FVLNVSSVFEPTHFHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANKRAVGCKW 322 + LN+SS+ EPT + +A+ W+ N+TW++VPLP++K+A+GCKW Sbjct: 753 YTLNLSSLSEPTSYEKAISDENWKGAIKTELNALMKNNTWNLVPLPSHKKAIGCKW 808 >dbj|GAU29493.1| hypothetical protein TSUD_360380 [Trifolium subterraneum] Length = 1200 Score = 95.9 bits (237), Expect = 9e-21 Identities = 45/124 (36%), Positives = 67/124 (54%), Gaps = 18/124 (14%) Frame = +2 Query: 5 PSTTLVPPPLRRSSRAHNPPTHLQEYDCN------------------SVLYPIQTHLTYD 130 P + P PLR+S+R NPP +LQ+Y CN S YP+ LTY Sbjct: 624 PLASTSPIPLRKSTRLTNPPPYLQDYHCNLLTSTIHDSPSSADITSSSSKYPLSAFLTYQ 683 Query: 131 NLSPSYKHFVLNVSSVFEPTHFHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANKRAV 310 +LS ++ HF++N+S++ EPT + +A+K W +TW + PLPA+K+A+ Sbjct: 684 HLSLAHTHFIMNLSTISEPTSYEEALKNENWTSAIKAELSALMNTNTWILAPLPAHKKAI 743 Query: 311 GCKW 322 GCKW Sbjct: 744 GCKW 747 >gb|PNX84823.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 452 Score = 95.1 bits (235), Expect = 1e-20 Identities = 45/123 (36%), Positives = 69/123 (56%), Gaps = 18/123 (14%) Frame = +2 Query: 8 STTLVPPPLRRSSRAHNPPTHLQEYDCNSVL------------------YPIQTHLTYDN 133 S +L P PLRRS+R NPP +LQ++ C+ + YP+ + ++Y N Sbjct: 166 SPSLSPIPLRRSTRPSNPPGYLQDFHCSLLTTSNNDSIPSTSISSSDCKYPLSSFISYQN 225 Query: 134 LSPSYKHFVLNVSSVFEPTHFHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANKRAVG 313 LS S+KHF N+S++ EP+ + +A+ +W+ N+TWS+ LP NK+AVG Sbjct: 226 LSTSHKHFAFNISTLTEPSSYEEAMHDEQWKNAVNTELAALLKNNTWSMTTLPPNKKAVG 285 Query: 314 CKW 322 CKW Sbjct: 286 CKW 288 >gb|KYP45565.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 818 Score = 95.1 bits (235), Expect = 2e-20 Identities = 43/105 (40%), Positives = 58/105 (55%), Gaps = 6/105 (5%) Frame = +2 Query: 26 PPLRRSSRAHNPPTHLQEYDCNSVL------YPIQTHLTYDNLSPSYKHFVLNVSSVFEP 187 PP RRS RA NPP +L +Y C +V YPIQ +L Y LS S++H++ +S FEP Sbjct: 246 PPPRRSDRATNPPRYLSDYHCYNVTDSAITAYPIQNYLDYSKLSNSHRHYICQISEHFEP 305 Query: 188 THFHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANKRAVGCKW 322 + QA+K W+ N TW +VPLP K+ + CKW Sbjct: 306 QTYAQAIKYTSWKQAISDELVAMEVNHTWDVVPLPPEKKPISCKW 350 >gb|KYP68601.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 549 Score = 93.2 bits (230), Expect = 8e-20 Identities = 45/110 (40%), Positives = 64/110 (58%), Gaps = 10/110 (9%) Frame = +2 Query: 23 PPPLRRSSRAHNPPTHLQEYDCN----------SVLYPIQTHLTYDNLSPSYKHFVLNVS 172 P PLRRS R PPT+L+++ CN + +PI + + Y+ LS S+ H+VL++S Sbjct: 399 PLPLRRSQRIPFPPTYLKDFHCNFLTSSPQLSKGISHPISSVINYNTLSSSHLHYVLSLS 458 Query: 173 SVFEPTHFHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANKRAVGCKW 322 + EP +HQAVKL EW N+TW+IV LP+ K +GCKW Sbjct: 459 THEEPKFYHQAVKLQEWVVAMKAEIDALTANNTWTIVDLPSGKHPIGCKW 508 >ref|XP_015574216.1| PREDICTED: uncharacterized protein LOC107261203 [Ricinus communis] Length = 410 Score = 92.4 bits (228), Expect = 9e-20 Identities = 43/97 (44%), Positives = 60/97 (61%) Frame = +2 Query: 32 LRRSSRAHNPPTHLQEYDCNSVLYPIQTHLTYDNLSPSYKHFVLNVSSVFEPTHFHQAVK 211 L+RS+R PPT L++Y NSVLYPI +++Y +SPSYK ++ ++S EPT F+ A Sbjct: 204 LQRSTRPTRPPTRLKDYVSNSVLYPIHHYISYTLVSPSYKAYLNTLTSHTEPTSFYDANT 263 Query: 212 LPEWQXXXXXXXXXXXXNDTWSIVPLPANKRAVGCKW 322 P W N+TW +V LP+NK+ VGCKW Sbjct: 264 NPNWCKAMQEELQALEKNETWDLVTLPSNKKLVGCKW 300 >gb|KYP55668.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 1136 Score = 92.4 bits (228), Expect = 2e-19 Identities = 42/117 (35%), Positives = 64/117 (54%), Gaps = 10/117 (8%) Frame = +2 Query: 2 PPSTTLVPPPLRRSSRAHNPPTHLQEY----------DCNSVLYPIQTHLTYDNLSPSYK 151 PP PPPLRRS+R PPT+LQ++ + +P+ + ++YD LSPS+ Sbjct: 552 PPDQHSSPPPLRRSTRPRRPPTYLQDFHGAFTSTGPHSSTGIRHPLHSFISYDRLSPSFH 611 Query: 152 HFVLNVSSVFEPTHFHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANKRAVGCKW 322 H+V ++SSV +P +F +A K W N+TW + LP +K A+GC+W Sbjct: 612 HYVFSISSVTKPKNFVEASKSDSWLKAMHEEISALEANNTWVLTTLPPHKTAIGCRW 668 >dbj|GAU37804.1| hypothetical protein TSUD_276210, partial [Trifolium subterraneum] Length = 633 Score = 91.7 bits (226), Expect = 3e-19 Identities = 46/129 (35%), Positives = 71/129 (55%), Gaps = 22/129 (17%) Frame = +2 Query: 2 PPSTTLVPPP-----LRRSSRAHNPPTHLQEYDCN-----------------SVLYPIQT 115 P STTL P LR+S+R +PPT+LQ+Y CN S YPI + Sbjct: 91 PSSTTLTNKPIDYVPLRQSTRNCHPPTYLQDYYCNHLSNTIHDSSGNMEPSSSCKYPISS 150 Query: 116 HLTYDNLSPSYKHFVLNVSSVFEPTHFHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPA 295 ++Y N+S ++KH++LN+S++ EPT + +A+ W+ N+TW +V LP Sbjct: 151 FISYQNISSAHKHYLLNISTISEPTCYEKAICDENWRTAIQAELTALEKNNTWKLVSLPP 210 Query: 296 NKRAVGCKW 322 +K ++GCKW Sbjct: 211 HKHSIGCKW 219 >gb|PNX94008.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 1063 Score = 91.7 bits (226), Expect = 3e-19 Identities = 46/124 (37%), Positives = 68/124 (54%), Gaps = 19/124 (15%) Frame = +2 Query: 8 STTLVPP-PLRRSSRAHNPPTHLQEYDCN------------------SVLYPIQTHLTYD 130 +TT PP LRRS+R P +LQ+Y CN S YPI + +TY Sbjct: 541 NTTPSPPIQLRRSTRPTTMPGYLQDYHCNLLTPAIHASHSSASNLSSSSKYPISSFMTYQ 600 Query: 131 NLSPSYKHFVLNVSSVFEPTHFHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANKRAV 310 NLSP++ H+++N+S++ EPT + +A+K W +TW++ LPA+KRA+ Sbjct: 601 NLSPAHTHYIMNLSTITEPTSYEEALKDENWTNAIKAELSAMMHTNTWNLAHLPAHKRAI 660 Query: 311 GCKW 322 GCKW Sbjct: 661 GCKW 664 >dbj|GAU20491.1| hypothetical protein TSUD_130490 [Trifolium subterraneum] Length = 1127 Score = 91.3 bits (225), Expect = 4e-19 Identities = 44/126 (34%), Positives = 72/126 (57%), Gaps = 20/126 (15%) Frame = +2 Query: 5 PSTTLVPP--PLRRSSRAHNPPTHLQEYDCNSVL------------------YPIQTHLT 124 PS T+ P P+R+S+RA +PP +LQ+Y CN + YP+ + L+ Sbjct: 577 PSPTIPQPIVPIRKSNRASHPPGYLQDYHCNLLTTPSHDLVPPTSTSSSQCKYPLSSFLS 636 Query: 125 YDNLSPSYKHFVLNVSSVFEPTHFHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANKR 304 Y +LS ++ HFV N+S++ EPT + +A+ +W+ N+TWS+ LP++K+ Sbjct: 637 YKDLSSTHTHFVCNLSTLTEPTSYEEAMHDEQWKNAISSEMSALMKNNTWSMTTLPSHKK 696 Query: 305 AVGCKW 322 A+GCKW Sbjct: 697 AIGCKW 702 >ref|XP_017974499.1| PREDICTED: uncharacterized protein LOC108661577 [Theobroma cacao] Length = 553 Score = 90.9 bits (224), Expect = 5e-19 Identities = 41/107 (38%), Positives = 61/107 (57%), Gaps = 7/107 (6%) Frame = +2 Query: 23 PPPLRRSSRAHNPPTHLQEYDCNS-------VLYPIQTHLTYDNLSPSYKHFVLNVSSVF 181 P +R+S+R +PP +L+ Y C YPI +L+ + LSP +K F + +S + Sbjct: 434 PSNIRKSTRQRHPPKYLEAYHCTFPTQANFVTKYPITKYLSSNQLSPDHKVFTVALSHIL 493 Query: 182 EPTHFHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANKRAVGCKW 322 EPT++HQAVK +W+ N TW++VPLP N A+GCKW Sbjct: 494 EPTYYHQAVKHVQWREAMQSELDALEANGTWTVVPLPPNSHAIGCKW 540 >gb|KYP61022.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1316 Score = 90.9 bits (224), Expect = 5e-19 Identities = 42/111 (37%), Positives = 62/111 (55%), Gaps = 11/111 (9%) Frame = +2 Query: 23 PPPLRRSSRAHNPPTHLQEY-----------DCNSVLYPIQTHLTYDNLSPSYKHFVLNV 169 PPPLRRS+R PPT+LQ++ + +P+ + L+YD LSPS+ H+V ++ Sbjct: 738 PPPLRRSTRPRRPPTYLQDFHGAFTSTSTAHSSTGIRHPLHSFLSYDLLSPSFHHYVFSI 797 Query: 170 SSVFEPTHFHQAVKLPEWQXXXXXXXXXXXXNDTWSIVPLPANKRAVGCKW 322 SSV EP +F +A K W N+TW + LP +K A+GC+W Sbjct: 798 SSVTEPKNFAEASKSDSWLKAMHEEIFALEANNTWVLTTLPPHKTAIGCRW 848