BLASTX nr result
ID: Astragalus22_contig00022983
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00022983 (1054 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KHN49021.1| hypothetical protein glysoja_031232, partial [Gly... 430 e-148 gb|KHN21230.1| Retrovirus-related Pol polyprotein from transposo... 413 e-129 ref|XP_014627013.1| PREDICTED: uncharacterized protein LOC106797... 372 e-125 gb|KHN37157.1| hypothetical protein glysoja_046755, partial [Gly... 310 e-103 gb|KYP45646.1| hypothetical protein KK1_032760, partial [Cajanus... 175 6e-50 gb|KYP37941.1| hypothetical protein KK1_040830 [Cajanus cajan] 177 9e-50 gb|KYP43730.1| hypothetical protein KK1_034810, partial [Cajanus... 179 9e-50 gb|KYP46257.1| Retrovirus-related Pol polyprotein from transposo... 187 4e-49 gb|KYP34307.1| Retrovirus-related Pol polyprotein from transposo... 185 2e-48 gb|KYP35140.1| hypothetical protein KK1_043839 [Cajanus cajan] 172 2e-48 gb|PNX63257.1| retrovirus-related Pol polyprotein from transposo... 171 8e-48 ref|XP_019442260.1| PREDICTED: uncharacterized protein LOC109346... 170 2e-47 gb|PNX90864.1| retrovirus-related Pol polyprotein from transposo... 174 3e-47 dbj|GAU15285.1| hypothetical protein TSUD_03520 [Trifolium subte... 173 4e-47 gb|KYP33001.1| hypothetical protein KK1_046197, partial [Cajanus... 175 6e-47 gb|PNX63176.1| retrovirus-related Pol polyprotein from transposo... 167 3e-46 ref|XP_020204897.1| uncharacterized protein LOC109790192 [Cajanu... 170 5e-46 gb|PNY10805.1| histone deacetylase, partial [Trifolium pratense] 176 7e-46 gb|PNX57709.1| retrovirus-related Pol polyprotein from transposo... 166 8e-46 gb|PNX56029.1| retrovirus-related Pol polyprotein from transposo... 166 8e-46 >gb|KHN49021.1| hypothetical protein glysoja_031232, partial [Glycine soja] Length = 323 Score = 430 bits (1106), Expect = e-148 Identities = 221/272 (81%), Positives = 233/272 (85%) Frame = -3 Query: 884 ILVTPPSTITPLSTFNYKISVKLDETNYLVWL*QIEPVLRAHRLHRFCVSLEIPPQYASE 705 IL TP STITPLS+FNYKISVKLD TNYLVWL QIEPVLRAHRLHRFCV+ EIPPQYASE Sbjct: 1 ILATPNSTITPLSSFNYKISVKLDATNYLVWLQQIEPVLRAHRLHRFCVTPEIPPQYASE 60 Query: 704 EDRLANVENPDYSNWEXXXXXXXXXXXXXXSPAILPSVIGCKHTFQLWENIHQSFQSKTK 525 DRLAN+ENP +SNWE SPAILPSVIGCKHTFQLWENIHQSFQSKTK Sbjct: 61 HDRLANIENPAFSNWELQDQLLLAWLQSSLSPAILPSVIGCKHTFQLWENIHQSFQSKTK 120 Query: 524 AQARHLRTQLRTTKKGSSTISEFLAKIKHISDSLLSIGESVTLQDQLDVILEGLPNEFEA 345 AQAR LRTQLRTTKKGSS+ISEFLAKIKHISDSL SIGESV+LQDQLDVILEGLPNEFE+ Sbjct: 121 AQARQLRTQLRTTKKGSSSISEFLAKIKHISDSLTSIGESVSLQDQLDVILEGLPNEFES 180 Query: 344 LVTLINSKIEWFDLEEIRALLLAHEPRLDKARITEEASSLNLSQSQPISKTPNLSNPNFA 165 LVTLINSKIEWFDLEEIRALLLAHE RLDKARITEEA+SLN +QSQP SK PN NPN A Sbjct: 181 LVTLINSKIEWFDLEEIRALLLAHEQRLDKARITEEAASLNFTQSQPNSKIPNSVNPNSA 240 Query: 164 TENPVAPQANYTAGNPNSDNSLSQNNAYKGNN 69 TE +APQAN+T GN NS N SQNN +K NN Sbjct: 241 TETQIAPQANWTTGNSNSGNYDSQNNNFKNNN 272 >gb|KHN21230.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 1429 Score = 413 bits (1061), Expect = e-129 Identities = 211/259 (81%), Positives = 223/259 (86%) Frame = -3 Query: 845 TFNYKISVKLDETNYLVWL*QIEPVLRAHRLHRFCVSLEIPPQYASEEDRLANVENPDYS 666 +FNYKISVKLD TNYLVWL QIEPVLRAHRLHRFCV+ EIPPQYASE DRLAN+ENP +S Sbjct: 1 SFNYKISVKLDATNYLVWLQQIEPVLRAHRLHRFCVTPEIPPQYASEHDRLANIENPAFS 60 Query: 665 NWEXXXXXXXXXXXXXXSPAILPSVIGCKHTFQLWENIHQSFQSKTKAQARHLRTQLRTT 486 NWE SPAILPSVIGCKHTFQLWENIHQSFQSKTKAQAR LRTQLRTT Sbjct: 61 NWELQDQLLLAWLQSSLSPAILPSVIGCKHTFQLWENIHQSFQSKTKAQARQLRTQLRTT 120 Query: 485 KKGSSTISEFLAKIKHISDSLLSIGESVTLQDQLDVILEGLPNEFEALVTLINSKIEWFD 306 KKGSS+ISEFLAKIKHISDSL SIGESV+LQDQLDVILEGLPNEFE+LVTLINSKIEWFD Sbjct: 121 KKGSSSISEFLAKIKHISDSLTSIGESVSLQDQLDVILEGLPNEFESLVTLINSKIEWFD 180 Query: 305 LEEIRALLLAHEPRLDKARITEEASSLNLSQSQPISKTPNLSNPNFATENPVAPQANYTA 126 LEEIRALLLAHE RLDKARITEEA+SLN +QSQP SKTPN NPN ATE +APQAN+T Sbjct: 181 LEEIRALLLAHEQRLDKARITEEAASLNFTQSQPNSKTPNSVNPNSATETQIAPQANWTT 240 Query: 125 GNPNSDNSLSQNNAYKGNN 69 GN NS N SQNN +K NN Sbjct: 241 GNSNSGNYDSQNNNFKNNN 259 >ref|XP_014627013.1| PREDICTED: uncharacterized protein LOC106797270 [Glycine max] Length = 329 Score = 372 bits (956), Expect = e-125 Identities = 191/239 (79%), Positives = 204/239 (85%) Frame = -3 Query: 785 QIEPVLRAHRLHRFCVSLEIPPQYASEEDRLANVENPDYSNWEXXXXXXXXXXXXXXSPA 606 +IEPVLRAHRLHRFCV+ EIPPQYASE DRLAN+EN +SNWE SPA Sbjct: 12 KIEPVLRAHRLHRFCVTPEIPPQYASEHDRLANIENSAFSNWELQDQFFLAWLQSSLSPA 71 Query: 605 ILPSVIGCKHTFQLWENIHQSFQSKTKAQARHLRTQLRTTKKGSSTISEFLAKIKHISDS 426 ILPSVIGCKHTFQLWENIHQSFQSKTKAQAR LRTQLRTTKKGSS+ISEFLAKIKHISDS Sbjct: 72 ILPSVIGCKHTFQLWENIHQSFQSKTKAQARQLRTQLRTTKKGSSSISEFLAKIKHISDS 131 Query: 425 LLSIGESVTLQDQLDVILEGLPNEFEALVTLINSKIEWFDLEEIRALLLAHEPRLDKARI 246 L SIGESV+LQDQLDVILEGLPNEFE+LVTLINSKIEWF+LEEIRALLLAHE RLDKARI Sbjct: 132 LTSIGESVSLQDQLDVILEGLPNEFESLVTLINSKIEWFNLEEIRALLLAHEQRLDKARI 191 Query: 245 TEEASSLNLSQSQPISKTPNLSNPNFATENPVAPQANYTAGNPNSDNSLSQNNAYKGNN 69 TEEA+SLN +QSQP SKTPN NPN ATE +APQAN+T GN NS N SQNN +K NN Sbjct: 192 TEEAASLNFTQSQPNSKTPNSVNPNSATETQIAPQANWTTGNSNSGNYDSQNNNFKNNN 250 >gb|KHN37157.1| hypothetical protein glysoja_046755, partial [Glycine soja] Length = 194 Score = 310 bits (793), Expect = e-103 Identities = 159/194 (81%), Positives = 168/194 (86%) Frame = -3 Query: 701 DRLANVENPDYSNWEXXXXXXXXXXXXXXSPAILPSVIGCKHTFQLWENIHQSFQSKTKA 522 DRLAN+ENP +SNWE SPAILPSVIGCKHTFQLWENIHQSFQSKTKA Sbjct: 1 DRLANIENPAFSNWELQDQLLLAWLQSSLSPAILPSVIGCKHTFQLWENIHQSFQSKTKA 60 Query: 521 QARHLRTQLRTTKKGSSTISEFLAKIKHISDSLLSIGESVTLQDQLDVILEGLPNEFEAL 342 QAR LRTQLRTTKKGSS+ISEFLAKIKHISDSL SIGESV+LQDQLDVILEGLPNEFE+L Sbjct: 61 QARQLRTQLRTTKKGSSSISEFLAKIKHISDSLTSIGESVSLQDQLDVILEGLPNEFESL 120 Query: 341 VTLINSKIEWFDLEEIRALLLAHEPRLDKARITEEASSLNLSQSQPISKTPNLSNPNFAT 162 VTLINSKIEWFDLEEIRALLLAHE RLDKARITEEA+SLN +QSQP SKTPN NPN AT Sbjct: 121 VTLINSKIEWFDLEEIRALLLAHEQRLDKARITEEAASLNFTQSQPNSKTPNSVNPNSAT 180 Query: 161 ENPVAPQANYTAGN 120 E +APQAN+T GN Sbjct: 181 ETQIAPQANWTTGN 194 >gb|KYP45646.1| hypothetical protein KK1_032760, partial [Cajanus cajan] Length = 202 Score = 175 bits (443), Expect = 6e-50 Identities = 85/200 (42%), Positives = 131/200 (65%) Frame = -3 Query: 857 TPLSTFNYKISVKLDETNYLVWL*QIEPVLRAHRLHRFCVSLEIPPQYASEEDRLANVEN 678 +P TF + IS KLD NYL+W Q+EPV++ HRLH F V+ +IPP++ + D+ N + Sbjct: 2 SPSLTFAHTISEKLDTKNYLLWCQQVEPVIKGHRLHHFLVNPQIPPKFLTISDKDENCVS 61 Query: 677 PDYSNWEXXXXXXXXXXXXXXSPAILPSVIGCKHTFQLWENIHQSFQSKTKAQARHLRTQ 498 +Y WE S +L VIGCK +FQ+W+ IH+ F + T A+AR LR+ Sbjct: 62 EEYLAWEQQDQLLLSWLQSSMSKDMLTHVIGCKSSFQIWDKIHEYFHAHTNAKARQLRSD 121 Query: 497 LRTTKKGSSTISEFLAKIKHISDSLLSIGESVTLQDQLDVILEGLPNEFEALVTLINSKI 318 LR+T + TIS++L +I+ + DSL +IG+SV+ ++ L ++L+GLP E+E+ V+LI+S+ Sbjct: 122 LRSTTLDNGTISDYLLRIQSLVDSLTAIGDSVSSKEHLGIVLDGLPEEYESTVSLISSRF 181 Query: 317 EWFDLEEIRALLLAHEPRLD 258 + +EE+ LLLAHE RL+ Sbjct: 182 DVLSIEEVETLLLAHESRLN 201 >gb|KYP37941.1| hypothetical protein KK1_040830 [Cajanus cajan] Length = 296 Score = 177 bits (450), Expect = 9e-50 Identities = 105/273 (38%), Positives = 161/273 (58%), Gaps = 1/273 (0%) Frame = -3 Query: 926 SSMASHPHTPRTPPILVTPPSTITPLSTFNYKISVKLDETNYLVWL*QIEPVLRAHRLHR 747 SS + PH P V P S+ P TF + IS KLD NYL+ Q+EPV++ HRLH Sbjct: 8 SSSTAPPHFQAAP---VKPSSS--PSLTFAHTISEKLDTKNYLLGCQQVEPVIKGHRLHH 62 Query: 746 FCVSLEIPPQYASEEDRLANVENPDYSNWEXXXXXXXXXXXXXXSPAILPSVIGCKHTFQ 567 F V+ +I P++ + ++ N + +Y WE S +L VIGCK +FQ Sbjct: 63 FLVNPQILPKFLTVSNKDENRVSKEYLAWEQQDQLLLSWLQSSMSKDMLARVIGCKSSFQ 122 Query: 566 LWENIHQSFQSKTKAQARHLRTQLRTTKKGSSTISEFLAKIKHISDSLLSIGESVTLQDQ 387 +W+ IH F + T A+AR L + LR+T + TIS++L +I+ + DSL +IG+SV+ ++ Sbjct: 123 IWDKIHAYFHAHTNAKARQLHSDLRSTTLDNCTISDYLLRIQSLVDSLTAIGDSVSSKEH 182 Query: 386 LDVILEGLPNEFEALVTLINSKIEWFDLEEIRALLLAHEPRLDKARITEEASSLNLSQSQ 207 LD++LEGLP E+E+ V+LI+S+ + +EE+ LLLAHE RL+K + + S+NL +S Sbjct: 183 LDIVLEGLPGEYESTVSLISSRFDVLSIEEVETLLLAHEFRLEKFK-KKNLISVNLLESS 241 Query: 206 PISKTPNLS-NPNFATENPVAPQANYTAGNPNS 111 S TP L N A ++ P ++ G P++ Sbjct: 242 SGSNTPALQPQANLAHQDSQFP--SFRGGRPSA 272 >gb|KYP43730.1| hypothetical protein KK1_034810, partial [Cajanus cajan] Length = 363 Score = 179 bits (455), Expect = 9e-50 Identities = 99/240 (41%), Positives = 146/240 (60%) Frame = -3 Query: 899 PRTPPILVTPPSTITPLSTFNYKISVKLDETNYLVWL*QIEPVLRAHRLHRFCVSLEIPP 720 P TP + PP P TF + IS KLD NYL+W Q++PV++ HRLH F V+ +IP Sbjct: 1 PLTPFLPKNPPPNSHPSLTFAHTISEKLDTKNYLLWCQQVKPVIKGHRLHHFLVNPQIPQ 60 Query: 719 QYASEEDRLANVENPDYSNWEXXXXXXXXXXXXXXSPAILPSVIGCKHTFQLWENIHQSF 540 ++ + DR + Y WE S +L VIGCK +FQLW+ IH F Sbjct: 61 KFLNLADRDVGRISEPYLAWEQQDQLLLSWLQSSMSKDMLTRVIGCKTSFQLWDKIHSYF 120 Query: 539 QSKTKAQARHLRTQLRTTKKGSSTISEFLAKIKHISDSLLSIGESVTLQDQLDVILEGLP 360 S A+AR LR +LR+T + TIS+++ +I+ + D+L +IG+SV+ ++ LD+ILEGLP Sbjct: 121 HSHMNAKARQLRNELRSTNLENQTISDYVLQIQTLVDTLTAIGDSVSPKEHLDIILEGLP 180 Query: 359 NEFEALVTLINSKIEWFDLEEIRALLLAHEPRLDKARITEEASSLNLSQSQPISKTPNLS 180 E+E+ V+LI+S+ + +EE+ LLL HE RLDK + + A SLN++ + + PNLS Sbjct: 181 EEYESTVSLISSRFDLLSIEEVETLLLGHESRLDKFK-KKVAVSLNVTTT---TLEPNLS 236 >gb|KYP46257.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1408 Score = 187 bits (475), Expect = 4e-49 Identities = 109/293 (37%), Positives = 169/293 (57%), Gaps = 11/293 (3%) Frame = -3 Query: 920 MASHPHTPRTPPILVTPPSTITPLS---------TFNYKISVKLDETNYLVWL*QIEPVL 768 M S P P PPI P +T+ P + TF++ IS KLD NYL+W Q+EPV+ Sbjct: 1 MNSSPPPPPPPPIHAHPVNTVPPKNPPSNSHPSLTFSHTISEKLDTKNYLLWCQQVEPVI 60 Query: 767 RAHRLHRFCVSLEIPPQYASEEDRLANVENPDYSNWEXXXXXXXXXXXXXXSPAILPSVI 588 + HRLH + V+ +IP ++A+ DR A + Y WE S +L VI Sbjct: 61 KGHRLHHYLVNPQIPQKFATLADRDAGHISESYLAWEQQDQLLLSWLQSSMSKDMLTRVI 120 Query: 587 GCKHTFQLWENIHQSFQSKTKAQARHLRTQLRTTKKGSSTISEFLAKIKHISDSLLSIGE 408 GCK +FQLW+ IH F S A+AR LR +LR+T + +ISE++ +I+ + D+L +IG+ Sbjct: 121 GCKSSFQLWDKIHTYFHSHMNAKARQLRNELRSTTLDNLSISEYVLRIQTLVDALTAIGD 180 Query: 407 SVTLQDQLDVILEGLPNEFEALVTLINSKIEWFDLEEIRALLLAHEPRLDKARITEEASS 228 SV+ ++ LD+ILEGLP E+E+ V+LI+S+ + ++E+ LLL HE RLDK + + A+S Sbjct: 181 SVSPKEHLDIILEGLPEEYESTVSLISSRFDLLTIDEVETLLLGHESRLDKFK-KKAAAS 239 Query: 227 LNLSQSQPISKTPNLSNP--NFATENPVAPQANYTAGNPNSDNSLSQNNAYKG 75 +N++ + P+ +NP + +N + ++ G NS N A +G Sbjct: 240 INVT-TAVTEPDPSATNPQAHLTHQNNQSGPSHRRGGRTNSRGGRFSNWAGRG 291 >gb|KYP34307.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1102 Score = 185 bits (469), Expect = 2e-48 Identities = 106/271 (39%), Positives = 156/271 (57%), Gaps = 9/271 (3%) Frame = -3 Query: 920 MASHPHTPRTPPILVTPPSTITPLS---------TFNYKISVKLDETNYLVWL*QIEPVL 768 M S P P PPI P +T+ P + TF++ IS KLD NYL+W Q+EPV+ Sbjct: 1 MNSSPPPPPPPPIHANPVNTVPPKNPPPNSHPSLTFSHTISEKLDTKNYLLWCQQVEPVI 60 Query: 767 RAHRLHRFCVSLEIPPQYASEEDRLANVENPDYSNWEXXXXXXXXXXXXXXSPAILPSVI 588 + HRLH + V+ +IP ++A+ DR A + Y WE S +L VI Sbjct: 61 KGHRLHHYLVNPQIPQKFATLADRDAGRISESYLAWEQQDQLLLSWLQSSMSKDMLTRVI 120 Query: 587 GCKHTFQLWENIHQSFQSKTKAQARHLRTQLRTTKKGSSTISEFLAKIKHISDSLLSIGE 408 GCK +FQLW+ IH F S A+AR LR +LR T + +ISE++ +I+ + D+L +IG Sbjct: 121 GCKSSFQLWDKIHSYFHSHMNAKARQLRNELRNTSLENLSISEYVLRIQTLVDALTAIGN 180 Query: 407 SVTLQDQLDVILEGLPNEFEALVTLINSKIEWFDLEEIRALLLAHEPRLDKARITEEASS 228 SV+ ++ LD+ILEGLP E+E+ V+LI+S + ++E+ LLL HE RLDK + + A+S Sbjct: 181 SVSPKEHLDIILEGLPEEYESTVSLISSHFDLLTIDEVETLLLGHESRLDKFK-KKVAAS 239 Query: 227 LNLSQSQPISKTPNLSNPNFATENPVAPQAN 135 +N+ T + PN + NP A A+ Sbjct: 240 INV--------TTTTTEPNPSVTNPQAHLAH 262 >gb|KYP35140.1| hypothetical protein KK1_043839 [Cajanus cajan] Length = 255 Score = 172 bits (437), Expect = 2e-48 Identities = 87/219 (39%), Positives = 138/219 (63%), Gaps = 1/219 (0%) Frame = -3 Query: 866 STITPLSTF-NYKISVKLDETNYLVWL*QIEPVLRAHRLHRFCVSLEIPPQYASEEDRLA 690 + ++P S F + I+ KLD++NYL W QIEP +++H+L RF V+ +IPP+Y + DR + Sbjct: 5 NNLSPFSQFFSNSIAEKLDDSNYLHWRQQIEPAIKSHKLQRFVVNPQIPPRYLTNADRDS 64 Query: 689 NVENPDYSNWEXXXXXXXXXXXXXXSPAILPSVIGCKHTFQLWENIHQSFQSKTKAQARH 510 ++ NP Y WE S IL VIG H++Q+W+ +H+ F ++TKA+AR Sbjct: 65 DIVNPAYETWEVQDQMLLTWLQSTLSKTILSRVIGSVHSYQVWDKVHEYFHTQTKARARQ 124 Query: 509 LRTQLRTTKKGSSTISEFLAKIKHISDSLLSIGESVTLQDQLDVILEGLPNEFEALVTLI 330 LRT L +T ++ +FL +IK+I+D L +G V+L++ +DV+LEGLP E+ +V++I Sbjct: 125 LRTDLCSTTLDGKSMRDFLTQIKNIADELAGVGSPVSLEEYVDVVLEGLPQEYAPVVSVI 184 Query: 329 NSKIEWFDLEEIRALLLAHEPRLDKARITEEASSLNLSQ 213 SK + E+ ALLLAHE R ++ R + S+N +Q Sbjct: 185 ESKFVTPPIAEVEALLLAHESRANQFRKQSFSPSINYTQ 223 >gb|PNX63257.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 247 Score = 171 bits (433), Expect = 8e-48 Identities = 88/215 (40%), Positives = 134/215 (62%), Gaps = 2/215 (0%) Frame = -3 Query: 887 PILVTPPSTITPLSTFNY--KISVKLDETNYLVWL*QIEPVLRAHRLHRFCVSLEIPPQY 714 P LV S+ N+ K+S+KLD+ N+L+W Q+E V+ +++LHRF V+ EIP +Y Sbjct: 5 PFLVGGSSSGNNSGVVNFAPKLSIKLDDKNFLLWNQQVEGVILSNKLHRFVVNPEIPAKY 64 Query: 713 ASEEDRLANVENPDYSNWEXXXXXXXXXXXXXXSPAILPSVIGCKHTFQLWENIHQSFQS 534 SE DR ++ + Y W + ++LP IGC H FQ+WE IH+ F + Sbjct: 65 NSESDRELDIVSEAYDKWIVQDQMLFTWLLSTLAESVLPRTIGCCHAFQVWEQIHKYFNA 124 Query: 533 KTKAQARHLRTQLRTTKKGSSTISEFLAKIKHISDSLLSIGESVTLQDQLDVILEGLPNE 354 KA+ R LR++L+T KKG+ TI+EF+ +++ I+D+LLS+G+SVT QDQ+D IL+GLP E Sbjct: 125 HLKAKVRQLRSELKTIKKGTKTITEFVLRVRAIADTLLSVGDSVTEQDQIDSILDGLPEE 184 Query: 353 FEALVTLINSKIEWFDLEEIRALLLAHEPRLDKAR 249 + V +I + + L +I LLL E +L+K R Sbjct: 185 YNPFVMMIYGRSDSPSLYDIEGLLLVQESQLEKFR 219 >ref|XP_019442260.1| PREDICTED: uncharacterized protein LOC109346975 [Lupinus angustifolius] Length = 246 Score = 170 bits (430), Expect = 2e-47 Identities = 86/217 (39%), Positives = 138/217 (63%), Gaps = 2/217 (0%) Frame = -3 Query: 899 PRTPPILVTPPSTITP--LSTFNYKISVKLDETNYLVWL*QIEPVLRAHRLHRFCVSLEI 726 P T P L PST+ P TF++ +SVKL E NY +W Q+E VL +HRL RF V+ +I Sbjct: 5 PSTSPFLSQRPSTVIPPPTVTFSHNVSVKLSEKNYFIWKQQVEAVLASHRLERFVVNPQI 64 Query: 725 PPQYASEEDRLANVENPDYSNWEXXXXXXXXXXXXXXSPAILPSVIGCKHTFQLWENIHQ 546 P ++ SEEDR NP + WE S ++ V+G KH +Q+W+ +H Sbjct: 65 PFRFLSEEDRDLQRMNPAFLQWEEQDQHLVSWLLQSLSESVQSRVLGLKHAWQIWDEVHT 124 Query: 545 SFQSKTKAQARHLRTQLRTTKKGSSTISEFLAKIKHISDSLLSIGESVTLQDQLDVILEG 366 F S +A+++ L ++LRT KKGS +++EF+++IK + SL ++GE ++ ++Q+ ++LEG Sbjct: 125 FFDSLIRARSQLLLSELRTIKKGSQSVTEFVSRIKALIHSLAAVGEVISDREQVRLVLEG 184 Query: 365 LPNEFEALVTLINSKIEWFDLEEIRALLLAHEPRLDK 255 LP+E E+ VT+IN++IE ++ +LL+AHE L++ Sbjct: 185 LPSECESFVTVINNRIESCSFIQLESLLMAHEAYLER 221 >gb|PNX90864.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 401 Score = 174 bits (441), Expect = 3e-47 Identities = 94/252 (37%), Positives = 150/252 (59%), Gaps = 7/252 (2%) Frame = -3 Query: 920 MASHPHTPRTP----PILVTPPSTI-TPLSTFNYKISVKLDETNYLVWL*QIEPVLRAHR 756 M + P T P I + P+T + S + I++KLDE N+L+W Q+ V+ AH Sbjct: 1 MTTTPSTDEIPIADSGIAIDLPNTKGSTKSALTHSITIKLDEKNFLLWSQQVNGVITAHN 60 Query: 755 LHRFCVSLEIPPQYASEEDRLANVENPDYSNWEXXXXXXXXXXXXXXSPAILPSVIGCKH 576 LHRF V+ EIP Q+AS DRL + +Y W S +LP V+ CKH Sbjct: 61 LHRFVVNPEIPLQFASVADRLDGKTSDEYQRWLFKDQSLFTWLLSTISDNVLPRVLSCKH 120 Query: 575 TFQLWENIHQSFQSKTKAQARHLRTQLRTTKKGSSTISEFLAKIKHISDSLLSIGESVTL 396 +++W+ IH+ F S KA+AR LR++L+ TKK + +++E+L +IK I +SL+++G+ V++ Sbjct: 121 AYEVWDTIHKYFNSVLKARARQLRSELKNTKKLTRSVNEYLLRIKSIVNSLIAVGDVVSV 180 Query: 395 QDQLDVILEGLPNEFEALVTLINSKIEWFDLEEIRALLLAHEPRLDKAR--ITEEASSLN 222 Q+Q+D +LEGLP E+ + V L+ S+ E +E++ ALLL E + +K R +T + S N Sbjct: 181 QEQVDAVLEGLPEEYNSFVMLVYSRFETPTVEDVEALLLLQEVQFEKFRQELTNPSVSAN 240 Query: 221 LSQSQPISKTPN 186 ++ S +PN Sbjct: 241 VAHMNSKSNSPN 252 >dbj|GAU15285.1| hypothetical protein TSUD_03520 [Trifolium subterraneum] Length = 392 Score = 173 bits (439), Expect = 4e-47 Identities = 93/259 (35%), Positives = 152/259 (58%), Gaps = 2/259 (0%) Frame = -3 Query: 887 PILVTPPSTITPLSTFNYKISVKLDETNYLVWL*QIEPVLRAHRLHRFCVSLEIPPQYAS 708 P++V + T ++F K+ KLD+ NYL+W Q+E V+ A++LHRF V+ +IP +YAS Sbjct: 10 PVMVESTGSSTNAASFTPKL--KLDDGNYLLWSQQVEGVILANKLHRFVVNPQIPAKYAS 67 Query: 707 EEDRLANVENPDYSNWEXXXXXXXXXXXXXXSPAILPSVIGCKHTFQLWENIHQSFQSKT 528 E DR + + Y W + ++LP IGC+H FQ+W+ IH+ F++ Sbjct: 68 ESDRELDRVSEAYDKWLVQDQMLFTWLLSTLAESVLPRTIGCRHAFQVWDQIHKHFEAHL 127 Query: 527 KAQARHLRTQLRTTKKGSSTISEFLAKIKHISDSLLSIGESVTLQDQLDVILEGLPNEFE 348 KA+ R LR++L+ KKG+ +I+EF+ +++ I+D+L+SIG+S++ QDQ+D ILEGLP E+ Sbjct: 128 KAKVRQLRSELKNVKKGTKSITEFVLRVRVIADTLISIGDSISEQDQIDSILEGLPEEYN 187 Query: 347 ALVTLINSKIEWFDLEEIRALLLAHEPRLDKAR--ITEEASSLNLSQSQPISKTPNLSNP 174 V +I + + L +I LLL E +L+K R ++ ++S NL+ S+ Sbjct: 188 PFVMMIYGRSDSPSLYDIEGLLLVQESQLEKFRQELSTPSASANLAHSRGGRGNSGARGR 247 Query: 173 NFATENPVAPQANYTAGNP 117 +T P A+ T P Sbjct: 248 GRSTRGRGRPAASPTGNRP 266 >gb|KYP33001.1| hypothetical protein KK1_046197, partial [Cajanus cajan] Length = 470 Score = 175 bits (443), Expect = 6e-47 Identities = 98/268 (36%), Positives = 158/268 (58%), Gaps = 2/268 (0%) Frame = -3 Query: 872 PPSTITPLSTFNYKISVKLDETNYLVWL*QIEPVLRAHRLHRFCVSLEIPPQYASEEDRL 693 PP P TF++ IS KL NYL+W Q+EPV++ HRLH + V+ +IP ++A+ DR Sbjct: 5 PPPNSHPSLTFSHTISEKLGTKNYLLWCQQVEPVIKGHRLHHYLVNPQIPQKFATLADRD 64 Query: 692 ANVENPDYSNWEXXXXXXXXXXXXXXSPAILPSVIGCKHTFQLWENIHQSFQSKTKAQAR 513 A + Y WE S +L VIGCK +FQLW+ IH F S A+A Sbjct: 65 AGCISESYLAWEQQDQLLLSWLQSSMSKDMLTRVIGCKSSFQLWDKIHSYFHSHMNAKAC 124 Query: 512 HLRTQLRTTKKGSSTISEFLAKIKHISDSLLSIGESVTLQDQLDVILEGLPNEFEALVTL 333 LR +L +T + +ISE++ +I+ + D+L +IG+SV+L++ LD+ILEGLP E+E+ ++L Sbjct: 125 QLRNELCSTSLENLSISEYVLRIQTLVDALTAIGDSVSLKEHLDIILEGLPEEYESTMSL 184 Query: 332 INSKIEWFDLEEIRALLLAHEPRLDKARITEEASSLNLSQSQPISKTPNLSNP--NFATE 159 I+S+ + ++E+ LLL HE RLDK + + A+ +N++ + I P+++NP + A + Sbjct: 185 ISSRFDLLTIDEVETLLLGHESRLDKFK-KKAAAYINVT-TATIEPNPSVTNPQAHLAHQ 242 Query: 158 NPVAPQANYTAGNPNSDNSLSQNNAYKG 75 + ++ G+ N N A +G Sbjct: 243 ENQSGFSHRRGGHTNFRGGRFSNRAGRG 270 >gb|PNX63176.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 273 Score = 167 bits (424), Expect = 3e-46 Identities = 81/198 (40%), Positives = 128/198 (64%) Frame = -3 Query: 842 FNYKISVKLDETNYLVWL*QIEPVLRAHRLHRFCVSLEIPPQYASEEDRLANVENPDYSN 663 F K+S+KLDE N+L+W Q+E V+ +H+LHRF V+ +IP +Y SE DR + + Y Sbjct: 46 FAPKLSIKLDEKNFLLWHQQVEGVIISHKLHRFVVNPQIPAKYDSEADRELDEVSAAYDK 105 Query: 662 WEXXXXXXXXXXXXXXSPAILPSVIGCKHTFQLWENIHQSFQSKTKAQARHLRTQLRTTK 483 W + ++LP I C+H FQ+WE IH F ++TKA+ R LR++L+T K Sbjct: 106 WLVQDQMLFTWLLSTLAESVLPRTISCRHAFQVWEEIHNYFNAQTKAKIRQLRSELKTVK 165 Query: 482 KGSSTISEFLAKIKHISDSLLSIGESVTLQDQLDVILEGLPNEFEALVTLINSKIEWFDL 303 KG+ TI+EF+ +++ ++++L+SIG+SV+ QDQ+D IL+GLP E+ V ++ + + L Sbjct: 166 KGTKTITEFILRVRAVANTLISIGDSVSEQDQIDSILDGLPEEYNPFVMMMYGRSDSPSL 225 Query: 302 EEIRALLLAHEPRLDKAR 249 +I LLL E +L+K R Sbjct: 226 FDIEGLLLVQESQLEKFR 243 >ref|XP_020204897.1| uncharacterized protein LOC109790192 [Cajanus cajan] Length = 385 Score = 170 bits (431), Expect = 5e-46 Identities = 87/219 (39%), Positives = 136/219 (62%), Gaps = 1/219 (0%) Frame = -3 Query: 866 STITPLSTF-NYKISVKLDETNYLVWL*QIEPVLRAHRLHRFCVSLEIPPQYASEEDRLA 690 + ++P S F + I+ KLD++NYL W QIEPV+++H+L RF V+ + PPQY + DR + Sbjct: 5 NNLSPFSQFFSNSIAEKLDDSNYLHWRQQIEPVIKSHKLQRFVVNPQSPPQYLTNADRDS 64 Query: 689 NVENPDYSNWEXXXXXXXXXXXXXXSPAILPSVIGCKHTFQLWENIHQSFQSKTKAQARH 510 ++ NP Y WE S IL VIG H++Q+W+ +H+ F ++TKA AR Sbjct: 65 DIVNPAYETWEVQDQMLLTWLQSTLSKTILSHVIGSVHSYQVWDKVHEYFHTQTKACARQ 124 Query: 509 LRTQLRTTKKGSSTISEFLAKIKHISDSLLSIGESVTLQDQLDVILEGLPNEFEALVTLI 330 LRT L +T ++ +FL +IK+I+D L +G V+L++ +D +LEGLP E+ +V++I Sbjct: 125 LRTDLCSTTLDGKSMRDFLTQIKNIADELAGVGSLVSLEEYVDAVLEGLPQEYAPVVSVI 184 Query: 329 NSKIEWFDLEEIRALLLAHEPRLDKARITEEASSLNLSQ 213 SK + E+ ALLLAHE R ++ R + S+N +Q Sbjct: 185 ESKFVTPPIAEVEALLLAHESRANRFRKQSFSPSINYTQ 223 >gb|PNY10805.1| histone deacetylase, partial [Trifolium pratense] Length = 720 Score = 176 bits (446), Expect = 7e-46 Identities = 88/214 (41%), Positives = 138/214 (64%), Gaps = 2/214 (0%) Frame = -3 Query: 845 TFNYKISVKLDETNYLVWL*QIEPVLRAHRLHRFCVSLEIPPQYASEEDRLANVENPDYS 666 +F K+S+KLD+ NYL+W Q+E V+ AH+LH+F V+ +IP +YASE DRL + Y Sbjct: 23 SFAPKLSIKLDDKNYLLWNQQVEGVILAHKLHKFVVNPQIPMKYASESDRLLDKVTDAYD 82 Query: 665 NWEXXXXXXXXXXXXXXSPAILPSVIGCKHTFQLWENIHQSFQSKTKAQARHLRTQLRTT 486 W + ++LP +GC+H FQ+W+ IH+ F + KA+ R LR++L+T Sbjct: 83 QWLVQDQMLFTWLLSTLAESVLPRTVGCRHAFQVWDQIHKYFDAHLKAKVRQLRSELKTV 142 Query: 485 KKGSSTISEFLAKIKHISDSLLSIGESVTLQDQLDVILEGLPNEFEALVTLINSKIEWFD 306 KKG+ +ISEF+ +++ I+D+L+SIG+S++ QDQ+D ILEGLP E+ V +I + + Sbjct: 143 KKGTKSISEFVLRVRAIADTLISIGDSISEQDQIDSILEGLPEEYNPFVMMIYGRSDSPS 202 Query: 305 LEEIRALLLAHEPRLDKAR--ITEEASSLNLSQS 210 L +I LLL E +L K R ++ A+S N++ S Sbjct: 203 LFDIEGLLLVQESQLAKFRQELSLPAASANVAHS 236 >gb|PNX57709.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 242 Score = 166 bits (419), Expect = 8e-46 Identities = 81/200 (40%), Positives = 125/200 (62%) Frame = -3 Query: 848 STFNYKISVKLDETNYLVWL*QIEPVLRAHRLHRFCVSLEIPPQYASEEDRLANVENPDY 669 S + +++KLDE NYL+W Q+ V+ AH LHRF V+ EIP QY S DRL + +Y Sbjct: 30 SGLTHSLTIKLDEKNYLLWNQQVNGVITAHNLHRFVVNPEIPLQYTSVADRLDGKNSDEY 89 Query: 668 SNWEXXXXXXXXXXXXXXSPAILPSVIGCKHTFQLWENIHQSFQSKTKAQARHLRTQLRT 489 W S +LP V+ CKH+ ++W+ IH+ F S K++AR LR++L+ Sbjct: 90 QRWLFKDQTLFTWLLSTISDGVLPRVLNCKHSHEVWDTIHKFFNSVLKSRARQLRSELKN 149 Query: 488 TKKGSSTISEFLAKIKHISDSLLSIGESVTLQDQLDVILEGLPNEFEALVTLINSKIEWF 309 TKK S +++E+L ++K I +SL+++G+ V+ Q+Q+D ILEGLP EF + V ++ S+ E Sbjct: 150 TKKLSKSVNEYLLRVKSIVNSLIAVGDVVSEQEQVDAILEGLPEEFNSFVMMVYSRFETP 209 Query: 308 DLEEIRALLLAHEPRLDKAR 249 +E + ALLL E + +K R Sbjct: 210 TVENVEALLLLQEVQFEKFR 229 >gb|PNX56029.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 242 Score = 166 bits (419), Expect = 8e-46 Identities = 81/200 (40%), Positives = 125/200 (62%) Frame = -3 Query: 848 STFNYKISVKLDETNYLVWL*QIEPVLRAHRLHRFCVSLEIPPQYASEEDRLANVENPDY 669 S + +++KLDE NYL+W Q+ V+ AH LHRF V+ EIP QY S DRL + +Y Sbjct: 30 SGLTHSLTIKLDEKNYLLWNQQVNGVITAHNLHRFVVNPEIPLQYTSVADRLDGKNSDEY 89 Query: 668 SNWEXXXXXXXXXXXXXXSPAILPSVIGCKHTFQLWENIHQSFQSKTKAQARHLRTQLRT 489 W S +LP V+ CKH+ ++W+ IH+ F S K++AR LR++L+ Sbjct: 90 QRWLFKDQTLFTWLLSTISDGVLPRVLNCKHSHEVWDTIHKFFNSVLKSRARQLRSELKN 149 Query: 488 TKKGSSTISEFLAKIKHISDSLLSIGESVTLQDQLDVILEGLPNEFEALVTLINSKIEWF 309 TKK S +++E+L ++K I +SL+++G+ V+ Q+Q+D ILEGLP EF + V ++ S+ E Sbjct: 150 TKKLSKSVNEYLLRVKSIVNSLIAVGDVVSEQEQVDAILEGLPEEFNSFVMMVYSRFETP 209 Query: 308 DLEEIRALLLAHEPRLDKAR 249 +E + ALLL E + +K R Sbjct: 210 TVENVEALLLLQEVQFEKFR 229