BLASTX nr result
ID: Astragalus23_contig00026104
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00026104 (1000 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KHN49021.1| hypothetical protein glysoja_031232, partial [Gly... 253 5e-79 gb|KHN21230.1| Retrovirus-related Pol polyprotein from transposo... 260 9e-75 ref|XP_014627013.1| PREDICTED: uncharacterized protein LOC106797... 232 8e-71 gb|KYP46257.1| Retrovirus-related Pol polyprotein from transposo... 190 3e-50 gb|KYP33001.1| hypothetical protein KK1_046197, partial [Cajanus... 179 6e-49 gb|KYP34307.1| Retrovirus-related Pol polyprotein from transposo... 186 8e-49 gb|PNY02430.1| retrovirus-related Pol polyprotein from transposo... 184 4e-48 gb|KHN37157.1| hypothetical protein glysoja_046755, partial [Gly... 164 3e-46 gb|KYP61342.1| Retrovirus-related Pol polyprotein from transposo... 177 1e-45 gb|KYP40244.1| Retrovirus-related Pol polyprotein from transposo... 172 2e-44 gb|KYP35140.1| hypothetical protein KK1_043839 [Cajanus cajan] 161 3e-44 gb|PNX90878.1| retrovirus-related Pol polyprotein from transposo... 160 4e-43 ref|XP_020204897.1| uncharacterized protein LOC109790192 [Cajanu... 161 8e-43 dbj|GAU15285.1| hypothetical protein TSUD_03520 [Trifolium subte... 161 1e-42 gb|KYP47787.1| Retrovirus-related Pol polyprotein from transposo... 167 2e-42 gb|KYP45646.1| hypothetical protein KK1_032760, partial [Cajanus... 152 3e-41 dbj|GAU19342.1| hypothetical protein TSUD_336290 [Trifolium subt... 162 8e-41 gb|PNX81124.1| histone deacetylase, partial [Trifolium pratense] 160 1e-40 gb|KYP43730.1| hypothetical protein KK1_034810, partial [Cajanus... 155 2e-40 gb|PNY05793.1| histone deacetylase [Trifolium pratense] 162 2e-40 >gb|KHN49021.1| hypothetical protein glysoja_031232, partial [Glycine soja] Length = 323 Score = 253 bits (646), Expect = 5e-79 Identities = 133/319 (41%), Positives = 179/319 (56%), Gaps = 17/319 (5%) Frame = +2 Query: 26 STNQPSLKPFKSFQYQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPTRYLTDAD 205 +T ++ P SF Y++S KL N +WLQQ+EPVLRAH+LHRFCV+PE+P +Y ++ D Sbjct: 3 ATPNSTITPLSSFNYKISVKLDATNYLVWLQQIEPVLRAHRLHRFCVTPEIPPQYASEHD 62 Query: 206 RLAEIENPAYVEWEXXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAKTKVQ 385 RLA IENPA+ WE P+IL VIGC++TFQLW+ IHQ +KTK Q Sbjct: 63 RLANIENPAFSNWELQDQLLLAWLQSSLSPAILPSVIGCKHTFQLWENIHQSFQSKTKAQ 122 Query: 386 ARQLRAELHNTKKGERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEYESIT 565 ARQLR +L TKKG SI+EF A++K I+D+L SIGE++SLQ+Q DV+L+ LP E+ES+ Sbjct: 123 ARQLRTQLRTTKKGSSSISEFLAKIKHISDSLTSIGESVSLQDQLDVILEGLPNEFESLV 182 Query: 566 TLLSTKSEWXXXXXXXXXXXXHESRIAKQTDVATAVASLNLSQAISKSESQPKSTDDIAV 745 TL+++K EW HE R+ K + ASLN +Q+ S+ + A Sbjct: 183 TLINSKIEWFDLEEIRALLLAHEQRLDK-ARITEEAASLNFTQSQPNSKIPNSVNPNSAT 241 Query: 746 TTDTIPQAQFTA-----------------SXXXXXXXXXXXXXXXXXXXXXSNVQCQVCS 874 T PQA +T + S VQCQVC Sbjct: 242 ETQIAPQANWTTGNSNSGNYDSQNNNFKNNNQSRGRGGRNGRGNRGGRGGRSTVQCQVCH 301 Query: 875 RFGHDASVCYHRYTPQFAA 931 R GHDAS CYHR+ + + Sbjct: 302 RTGHDASYCYHRFNAAYGS 320 >gb|KHN21230.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 1429 Score = 260 bits (665), Expect = 9e-75 Identities = 139/323 (43%), Positives = 184/323 (56%), Gaps = 17/323 (5%) Frame = +2 Query: 59 SFQYQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPTRYLTDADRLAEIENPAYV 238 SF Y++S KL N +WLQQ+EPVLRAH+LHRFCV+PE+P +Y ++ DRLA IENPA+ Sbjct: 1 SFNYKISVKLDATNYLVWLQQIEPVLRAHRLHRFCVTPEIPPQYASEHDRLANIENPAFS 60 Query: 239 EWEXXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAKTKVQARQLRAELHNT 418 WE P+IL VIGC++TFQLW+ IHQ +KTK QARQLR +L T Sbjct: 61 NWELQDQLLLAWLQSSLSPAILPSVIGCKHTFQLWENIHQSFQSKTKAQARQLRTQLRTT 120 Query: 419 KKGERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEYESITTLLSTKSEWXX 598 KKG SI+EF A++K I+D+L SIGE++SLQ+Q DV+L+ LP E+ES+ TL+++K EW Sbjct: 121 KKGSSSISEFLAKIKHISDSLTSIGESVSLQDQLDVILEGLPNEFESLVTLINSKIEWFD 180 Query: 599 XXXXXXXXXXHESRIAKQTDVATAVASLNLSQAISKSESQPKSTDDIAVTTDTIPQAQFT 778 HE R+ K + ASLN +Q+ S++ + A T PQA +T Sbjct: 181 LEEIRALLLAHEQRLDK-ARITEEAASLNFTQSQPNSKTPNSVNPNSATETQIAPQANWT 239 Query: 779 A-----------------SXXXXXXXXXXXXXXXXXXXXXSNVQCQVCSRFGHDASVCYH 907 + S VQCQVC R GHDAS CYH Sbjct: 240 TGNSNSGNYDSQNNNFKNNNQSRGRGGRNGRGNRGGRGGRSTVQCQVCHRTGHDASYCYH 299 Query: 908 RYTPQFAAPAMVPNFGNPYQYVR 976 R+ + + + GNPYQYVR Sbjct: 300 RFNAAYGSNQPYVH-GNPYQYVR 321 >ref|XP_014627013.1| PREDICTED: uncharacterized protein LOC106797270 [Glycine max] Length = 329 Score = 232 bits (592), Expect = 8e-71 Identities = 126/303 (41%), Positives = 169/303 (55%), Gaps = 17/303 (5%) Frame = +2 Query: 119 QVEPVLRAHKLHRFCVSPEVPTRYLTDADRLAEIENPAYVEWEXXXXXXXXXXXXXXXPS 298 ++EPVLRAH+LHRFCV+PE+P +Y ++ DRLA IEN A+ WE P+ Sbjct: 12 KIEPVLRAHRLHRFCVTPEIPPQYASEHDRLANIENSAFSNWELQDQFFLAWLQSSLSPA 71 Query: 299 ILAKVIGCRYTFQLWDKIHQHSHAKTKVQARQLRAELHNTKKGERSITEFFARLKTITDA 478 IL VIGC++TFQLW+ IHQ +KTK QARQLR +L TKKG SI+EF A++K I+D+ Sbjct: 72 ILPSVIGCKHTFQLWENIHQSFQSKTKAQARQLRTQLRTTKKGSSSISEFLAKIKHISDS 131 Query: 479 LLSIGETISLQEQYDVVLQVLPAEYESITTLLSTKSEWXXXXXXXXXXXXHESRIAKQTD 658 L SIGE++SLQ+Q DV+L+ LP E+ES+ TL+++K EW HE R+ K Sbjct: 132 LTSIGESVSLQDQLDVILEGLPNEFESLVTLINSKIEWFNLEEIRALLLAHEQRLDK-AR 190 Query: 659 VATAVASLNLSQAISKSESQPKSTDDIAVTTDTIPQAQFTA-----------------SX 787 + ASLN +Q+ S++ + A T PQA +T + Sbjct: 191 ITEEAASLNFTQSQPNSKTPNSVNPNSATETQIAPQANWTTGNSNSGNYDSQNNNFKNNN 250 Query: 788 XXXXXXXXXXXXXXXXXXXXSNVQCQVCSRFGHDASVCYHRYTPQFAAPAMVPNFGNPYQ 967 S VQCQVC GHDAS CYHR+ + + + GNPYQ Sbjct: 251 QSRGRGGRNGRGNRGGRGGRSTVQCQVCHCTGHDASYCYHRFNAAYGSNQPYVH-GNPYQ 309 Query: 968 YVR 976 YVR Sbjct: 310 YVR 312 >gb|KYP46257.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1408 Score = 190 bits (482), Expect = 3e-50 Identities = 113/332 (34%), Positives = 172/332 (51%), Gaps = 13/332 (3%) Frame = +2 Query: 26 STNQPSLKPFKSFQYQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPTRYLTDAD 205 S + PSL +F + +SEKL N LW QQVEPV++ H+LH + V+P++P ++ T AD Sbjct: 28 SNSHPSL----TFSHTISEKLDTKNYLLWCQQVEPVIKGHRLHHYLVNPQIPQKFATLAD 83 Query: 206 RLAEIENPAYVEWEXXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAKTKVQ 385 R A + +Y+ WE +L +VIGC+ +FQLWDKIH + H+ + Sbjct: 84 RDAGHISESYLAWEQQDQLLLSWLQSSMSKDMLTRVIGCKSSFQLWDKIHTYFHSHMNAK 143 Query: 386 ARQLRAELHNTKKGERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEYESIT 565 ARQLR EL +T SI+E+ R++T+ DAL +IG+++S +E D++L+ LP EYES Sbjct: 144 ARQLRNELRSTTLDNLSISEYVLRIQTLVDALTAIGDSVSPKEHLDIILEGLPEEYESTV 203 Query: 566 TLLSTKSEWXXXXXXXXXXXXHESRIAKQTDVATAVASLNLSQAISKSESQPKSTDDIAV 745 +L+S++ + HESR+ K A AS+N++ A+ +E P +T+ Sbjct: 204 SLISSRFDLLTIDEVETLLLGHESRLDKFK--KKAAASINVTTAV--TEPDPSATN---- 255 Query: 746 TTDTIPQAQFTASXXXXXXXXXXXXXXXXXXXXXSN-----------VQCQVCSRFGHDA 892 PQA T SN QCQVC R+GH A Sbjct: 256 -----PQAHLTHQNNQSGPSHRRGGRTNSRGGRFSNWAGRGRGRFAGYQCQVCHRYGHVA 310 Query: 893 SVCYHRYTPQF--AAPAMVPNFGNPYQYVRPG 982 S CY+R+ + ++P P + + Q+ PG Sbjct: 311 SACYYRFDETYVPSSPLEAPAYPSNNQHTNPG 342 >gb|KYP33001.1| hypothetical protein KK1_046197, partial [Cajanus cajan] Length = 470 Score = 179 bits (455), Expect = 6e-49 Identities = 104/319 (32%), Positives = 163/319 (51%), Gaps = 4/319 (1%) Frame = +2 Query: 38 PSLKPFKSFQYQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPTRYLTDADRLAE 217 P+ P +F + +SEKL N LW QQVEPV++ H+LH + V+P++P ++ T ADR A Sbjct: 7 PNSHPSLTFSHTISEKLGTKNYLLWCQQVEPVIKGHRLHHYLVNPQIPQKFATLADRDAG 66 Query: 218 IENPAYVEWEXXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAKTKVQARQL 397 + +Y+ WE +L +VIGC+ +FQLWDKIH + H+ +A QL Sbjct: 67 CISESYLAWEQQDQLLLSWLQSSMSKDMLTRVIGCKSSFQLWDKIHSYFHSHMNAKACQL 126 Query: 398 RAELHNTKKGERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEYESITTLLS 577 R EL +T SI+E+ R++T+ DAL +IG+++SL+E D++L+ LP EYES +L+S Sbjct: 127 RNELCSTSLENLSISEYVLRIQTLVDALTAIGDSVSLKEHLDIILEGLPEEYESTMSLIS 186 Query: 578 TKSEWXXXXXXXXXXXXHESRIAKQTDVATAVASLNLSQAISKSESQPKSTDDIAVTTDT 757 ++ + HESR+ K A A ++ + E P T+ A Sbjct: 187 SRFDLLTIDEVETLLLGHESRLDKFKKKAAAYINVTTATI----EPNPSVTNPQAHLAHQ 242 Query: 758 IPQAQFT--ASXXXXXXXXXXXXXXXXXXXXXSNVQCQVCSRFGHDASVCYHRYTPQF-- 925 Q+ F+ + QCQVC R+ H AS CY+R+ + Sbjct: 243 ENQSGFSHRRGGHTNFRGGRFSNRAGRGRGRFAAYQCQVCHRYEHVASACYYRFDETYVP 302 Query: 926 AAPAMVPNFGNPYQYVRPG 982 ++P P + + Q+ PG Sbjct: 303 SSPLEAPAYHSINQHTNPG 321 >gb|KYP34307.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1102 Score = 186 bits (471), Expect = 8e-49 Identities = 107/319 (33%), Positives = 165/319 (51%), Gaps = 4/319 (1%) Frame = +2 Query: 38 PSLKPFKSFQYQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPTRYLTDADRLAE 217 P+ P +F + +SEKL N LW QQVEPV++ H+LH + V+P++P ++ T ADR A Sbjct: 28 PNSHPSLTFSHTISEKLDTKNYLLWCQQVEPVIKGHRLHHYLVNPQIPQKFATLADRDAG 87 Query: 218 IENPAYVEWEXXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAKTKVQARQL 397 + +Y+ WE +L +VIGC+ +FQLWDKIH + H+ +ARQL Sbjct: 88 RISESYLAWEQQDQLLLSWLQSSMSKDMLTRVIGCKSSFQLWDKIHSYFHSHMNAKARQL 147 Query: 398 RAELHNTKKGERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEYESITTLLS 577 R EL NT SI+E+ R++T+ DAL +IG ++S +E D++L+ LP EYES +L+S Sbjct: 148 RNELRNTSLENLSISEYVLRIQTLVDALTAIGNSVSPKEHLDIILEGLPEEYESTVSLIS 207 Query: 578 TKSEWXXXXXXXXXXXXHESRIAKQTDVATAVASLNLSQAISKSESQPKSTDDIAVTTDT 757 + + HESR+ K AS+N++ + +E P T+ A Sbjct: 208 SHFDLLTIDEVETLLLGHESRLDKFK--KKVAASINVT--TTTTEPNPSVTNPQAHLAHQ 263 Query: 758 IPQAQFT--ASXXXXXXXXXXXXXXXXXXXXXSNVQCQVCSRFGHDASVCYHRYTPQF-- 925 Q+ F+ + QCQVC R+GH AS CY+R+ + Sbjct: 264 ENQSGFSHRQGGRTNFRGGRFSNRAGRGRGRFAGYQCQVCHRYGHVASACYYRFDETYVP 323 Query: 926 AAPAMVPNFGNPYQYVRPG 982 ++P P + + Q+ PG Sbjct: 324 SSPLEAPAYHSINQHTNPG 342 >gb|PNY02430.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 1064 Score = 184 bits (466), Expect = 4e-48 Identities = 104/322 (32%), Positives = 161/322 (50%), Gaps = 9/322 (2%) Frame = +2 Query: 62 FQYQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPTRYLTDADRLAEIENPAYVE 241 F ++SEKLT DN PLW QQ+EP + AH L F V P VP ++LTD+DR + NPAY+ Sbjct: 30 FSLKISEKLTEDNFPLWRQQIEPYINAHNLTEFVVCPRVPPQFLTDSDRATGVTNPAYLS 89 Query: 242 WEXXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAKTKVQARQLRAELHNTK 421 W IL ++IGC ++ +LWD++ + + + +AR LR EL Sbjct: 90 WRSRDGMLLSWLQSTLSSEILTRMIGCSFSHELWDRLFAYFQKQIRAKARHLRVELRTHT 149 Query: 422 KGERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEYESITTLLSTKSEWXXX 601 +RS+ E+ +++ DAL SIG+ + DV+L+ LP+++ + +++ + + Sbjct: 150 LADRSVKEYLLQIRKTVDALASIGDPLPPSHHIDVILEGLPSDFAPVVSVIEGRFDAIDL 209 Query: 602 XXXXXXXXXHESRIAK-QTDVATAVASLNLSQAISKSESQPKSTDDIAVTTDT----IPQ 766 HE R+ K + V + VASLNL+ A S S + + D T++ P+ Sbjct: 210 DEVEVLLLAHELRMEKFKKRVISDVASLNLTHA-SSSTAPVTNGDSNETPTESPPPPSPE 268 Query: 767 AQFTASXXXXXXXXXXXXXXXXXXXXXSNVQCQVCSRFGHDASVCYHRYTPQF----AAP 934 + + S++QCQVC++FGH A C+HR+ QF A P Sbjct: 269 PDYNSFRGSRGGRGGRGGRGGRGRGRNSDLQCQVCAKFGHSALNCWHRFNQQFQGNPAPP 328 Query: 935 AMVPNFGNPYQYVRPGYVPPAA 1000 P +GNPY G PP A Sbjct: 329 VPQPRYGNPYGNPY-GNAPPQA 349 >gb|KHN37157.1| hypothetical protein glysoja_046755, partial [Glycine soja] Length = 194 Score = 164 bits (416), Expect = 3e-46 Identities = 86/192 (44%), Positives = 116/192 (60%) Frame = +2 Query: 203 DRLAEIENPAYVEWEXXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAKTKV 382 DRLA IENPA+ WE P+IL VIGC++TFQLW+ IHQ +KTK Sbjct: 1 DRLANIENPAFSNWELQDQLLLAWLQSSLSPAILPSVIGCKHTFQLWENIHQSFQSKTKA 60 Query: 383 QARQLRAELHNTKKGERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEYESI 562 QARQLR +L TKKG SI+EF A++K I+D+L SIGE++SLQ+Q DV+L+ LP E+ES+ Sbjct: 61 QARQLRTQLRTTKKGSSSISEFLAKIKHISDSLTSIGESVSLQDQLDVILEGLPNEFESL 120 Query: 563 TTLLSTKSEWXXXXXXXXXXXXHESRIAKQTDVATAVASLNLSQAISKSESQPKSTDDIA 742 TL+++K EW HE R+ K + ASLN +Q+ S++ + A Sbjct: 121 VTLINSKIEWFDLEEIRALLLAHEQRLDK-ARITEEAASLNFTQSQPNSKTPNSVNPNSA 179 Query: 743 VTTDTIPQAQFT 778 T PQA +T Sbjct: 180 TETQIAPQANWT 191 >gb|KYP61342.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1358 Score = 177 bits (448), Expect = 1e-45 Identities = 105/321 (32%), Positives = 162/321 (50%), Gaps = 5/321 (1%) Frame = +2 Query: 14 MASSSTNQPSLKPFKSFQYQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPTRYL 193 MA+S+ + P + F + ++EKL N W QQ+EPV+++HKL RF V+P++P RYL Sbjct: 1 MATSNNSSPFSQFFSN---SIAEKLDDSNYLHWRQQIEPVIKSHKLQRFVVNPQIPPRYL 57 Query: 194 TDADRLAEIENPAYVEWEXXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAK 373 TDADR ++I NPAY WE SIL++VIG +++Q+WDK+H++ H + Sbjct: 58 TDADRDSDIVNPAYETWEVQDQMLLTWLQSTLSKSILSRVIGSVHSYQVWDKVHEYFHTQ 117 Query: 374 TKVQARQLRAELHNTKKGERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEY 553 TK +ARQLR +L +T +S+ +F ++KTI D L +G +SL+E D VL+ LP EY Sbjct: 118 TKARARQLRTDLRSTTLDGQSMRDFLTQIKTIADELAGVGSPVSLEEYVDAVLEGLPQEY 177 Query: 554 ESITTLLSTKSEWXXXXXXXXXXXXHESRIAKQTDVATAVASLNLSQAISKSESQPKSTD 733 + +++ +K HESR A + + S+N +Q S+ Sbjct: 178 APVVSVIESKFVTPPIAEVEALLLAHESR-ANRFRKQSFSPSINYTQGYSRG-------- 228 Query: 734 DIAVTTDTIPQAQFTASXXXXXXXXXXXXXXXXXXXXXSNVQCQVCSRFGHDASVCYHR- 910 + +N QCQ+C ++GH A+VC++R Sbjct: 229 ---------------SVSGGHSGRRGGRGSGRGRGGRFANFQCQICFKYGHTANVCFYRA 273 Query: 911 ---YTP-QFAAPAMVPNFGNP 961 Y P + AMV N P Sbjct: 274 DVNYQPAESLVLAMVANTSQP 294 >gb|KYP40244.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 720 Score = 172 bits (435), Expect = 2e-44 Identities = 99/309 (32%), Positives = 155/309 (50%), Gaps = 6/309 (1%) Frame = +2 Query: 41 SLKPFKSF-QYQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPTRYLTDADRLAE 217 +L PF F ++EKL N W QQ++P++++HKL RF V+P++P RYLTDADR + Sbjct: 6 NLSPFSQFFSNSIAEKLDDSNYLHWRQQIKPIIKSHKLQRFVVNPQIPPRYLTDADRDYD 65 Query: 218 IENPAYVEWEXXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAKTKVQARQL 397 I NPAY WE +IL++VIG +++Q+WDK+H++ H +TK +ARQL Sbjct: 66 IVNPAYETWEVQDQMLLTWLQSMLSKTILSRVIGSVHSYQVWDKVHEYFHTQTKARARQL 125 Query: 398 RAELHNTKKGERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEYESITTLLS 577 R +L +T +S+ +F ++K I D L +G +SL+E DVVL+ LP EY + +++ Sbjct: 126 RTDLRSTTLDGKSMRDFLTQIKNIADQLAGVGSPMSLEEYVDVVLEGLPQEYTPVVSVIE 185 Query: 578 TKSEWXXXXXXXXXXXXHESRIAKQTDVATAVASLNLSQAISKSESQPKSTDDIAVTTDT 757 +K HESR+ + + + S+N +Q S+ + Sbjct: 186 SKFVTPPIAEVEALLLAHESRVNRFRKQSFS-PSINYTQGYSRG---------------S 229 Query: 758 IPQAQFTASXXXXXXXXXXXXXXXXXXXXXSNVQCQVCSRFGHDASVCYHR-----YTPQ 922 I F +N CQ C ++GH A+VC++R + Sbjct: 230 ISGESFRDRDGGHSGCRGGQGSGRGRGGRFANFHCQNCFKYGHTANVCFYRADVNYQLVE 289 Query: 923 FAAPAMVPN 949 F AMV N Sbjct: 290 FLVLAMVAN 298 >gb|KYP35140.1| hypothetical protein KK1_043839 [Cajanus cajan] Length = 255 Score = 161 bits (408), Expect = 3e-44 Identities = 84/223 (37%), Positives = 129/223 (57%), Gaps = 1/223 (0%) Frame = +2 Query: 41 SLKPFKSF-QYQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPTRYLTDADRLAE 217 +L PF F ++EKL N W QQ+EP +++HKL RF V+P++P RYLT+ADR ++ Sbjct: 6 NLSPFSQFFSNSIAEKLDDSNYLHWRQQIEPAIKSHKLQRFVVNPQIPPRYLTNADRDSD 65 Query: 218 IENPAYVEWEXXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAKTKVQARQL 397 I NPAY WE +IL++VIG +++Q+WDK+H++ H +TK +ARQL Sbjct: 66 IVNPAYETWEVQDQMLLTWLQSTLSKTILSRVIGSVHSYQVWDKVHEYFHTQTKARARQL 125 Query: 398 RAELHNTKKGERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEYESITTLLS 577 R +L +T +S+ +F ++K I D L +G +SL+E DVVL+ LP EY + +++ Sbjct: 126 RTDLCSTTLDGKSMRDFLTQIKNIADELAGVGSPVSLEEYVDVVLEGLPQEYAPVVSVIE 185 Query: 578 TKSEWXXXXXXXXXXXXHESRIAKQTDVATAVASLNLSQAISK 706 +K HESR A Q + S+N +Q S+ Sbjct: 186 SKFVTPPIAEVEALLLAHESR-ANQFRKQSFSPSINYTQGYSR 227 >gb|PNX90878.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 324 Score = 160 bits (406), Expect = 4e-43 Identities = 89/304 (29%), Positives = 153/304 (50%), Gaps = 1/304 (0%) Frame = +2 Query: 26 STNQPSLKPFKSFQYQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPTRYLTDAD 205 +TN+ S K + ++ KL N LW QQV V+ AH LHRF V+PE+P ++ + D Sbjct: 21 NTNKESTK--SGLTHSLTIKLDEKNFLLWSQQVNGVITAHNLHRFVVNPEIPLQFASVTD 78 Query: 206 RLAEIENPAYVEWEXXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAKTKVQ 385 RL + Y +W +L +V+ C++++++W+KIH + ++ K + Sbjct: 79 RLDGKNSDEYQKWLFKDQSLFTWLLSTISDGVLPRVLSCKHSYEVWEKIHTYFNSVLKSR 138 Query: 386 ARQLRAELHNTKKGERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEYESIT 565 ARQLR+EL NTKK R++ E+ R+K+I ++L++IG+ +S +EQ D +L+ L E+ Sbjct: 139 ARQLRSELKNTKKHSRTVNEYLLRIKSIVNSLIAIGDVVSEREQVDAILEGLSEEFNPFV 198 Query: 566 TLLSTKSEWXXXXXXXXXXXXHESRIAK-QTDVATAVASLNLSQAISKSESQPKSTDDIA 742 ++ ++S+ ES+ K + ++A S N++Q SK + ++ Sbjct: 199 MMVYSRSDTPKVEDVEALLLLQESQFEKFRQELANPSVSANVAQIESKDSNHNSDSEGQD 258 Query: 743 VTTDTIPQAQFTASXXXXXXXXXXXXXXXXXXXXXSNVQCQVCSRFGHDASVCYHRYTPQ 922 T+ + VQCQ+CSR HDAS+C++RY P Sbjct: 259 SGTE-----HYNVKTHRGRGRNKGRGRSRGKTPNTGRVQCQICSRKNHDASICWYRYDPS 313 Query: 923 FAAP 934 + P Sbjct: 314 SSRP 317 >ref|XP_020204897.1| uncharacterized protein LOC109790192 [Cajanus cajan] Length = 385 Score = 161 bits (408), Expect = 8e-43 Identities = 94/292 (32%), Positives = 148/292 (50%), Gaps = 2/292 (0%) Frame = +2 Query: 41 SLKPFKSF-QYQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPTRYLTDADRLAE 217 +L PF F ++EKL N W QQ+EPV+++HKL RF V+P+ P +YLT+ADR ++ Sbjct: 6 NLSPFSQFFSNSIAEKLDDSNYLHWRQQIEPVIKSHKLQRFVVNPQSPPQYLTNADRDSD 65 Query: 218 IENPAYVEWEXXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAKTKVQARQL 397 I NPAY WE +IL+ VIG +++Q+WDK+H++ H +TK ARQL Sbjct: 66 IVNPAYETWEVQDQMLLTWLQSTLSKTILSHVIGSVHSYQVWDKVHEYFHTQTKACARQL 125 Query: 398 RAELHNTKKGERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEYESITTLLS 577 R +L +T +S+ +F ++K I D L +G +SL+E D VL+ LP EY + +++ Sbjct: 126 RTDLCSTTLDGKSMRDFLTQIKNIADELAGVGSLVSLEEYVDAVLEGLPQEYAPVVSVIE 185 Query: 578 TKSEWXXXXXXXXXXXXHESRIAKQTDVATAVASLNLSQAIS-KSESQPKSTDDIAVTTD 754 +K HESR A + + S+N +Q S S S + + + Sbjct: 186 SKFVTPPIAEVEALLLAHESR-ANRFRKQSFSPSINYTQGYSCGSISGGRGSGN------ 238 Query: 755 TIPQAQFTASXXXXXXXXXXXXXXXXXXXXXSNVQCQVCSRFGHDASVCYHR 910 +N QCQ+C ++GH A++C++R Sbjct: 239 --------------SGRRGGRSSGRGRGSRFANFQCQICFKYGHTANICFYR 276 >dbj|GAU15285.1| hypothetical protein TSUD_03520 [Trifolium subterraneum] Length = 392 Score = 161 bits (407), Expect = 1e-42 Identities = 98/302 (32%), Positives = 148/302 (49%), Gaps = 1/302 (0%) Frame = +2 Query: 23 SSTNQPSLKPFKSFQYQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPTRYLTDA 202 SSTN S P KL N LW QQVE V+ A+KLHRF V+P++P +Y +++ Sbjct: 18 SSTNAASFTP--------KLKLDDGNYLLWSQQVEGVILANKLHRFVVNPQIPAKYASES 69 Query: 203 DRLAEIENPAYVEWEXXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAKTKV 382 DR + + AY +W S+L + IGCR+ FQ+WD+IH+H A K Sbjct: 70 DRELDRVSEAYDKWLVQDQMLFTWLLSTLAESVLPRTIGCRHAFQVWDQIHKHFEAHLKA 129 Query: 383 QARQLRAELHNTKKGERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEYESI 562 + RQLR+EL N KKG +SITEF R++ I D L+SIG++IS Q+Q D +L+ LP EY Sbjct: 130 KVRQLRSELKNVKKGTKSITEFVLRVRVIADTLISIGDSISEQDQIDSILEGLPEEYNPF 189 Query: 563 TTLLSTKSEWXXXXXXXXXXXXHESRIAK-QTDVATAVASLNLSQAISKSESQPKSTDDI 739 ++ +S+ ES++ K + +++T AS NL+ + + Sbjct: 190 VMMIYGRSDSPSLYDIEGLLLVQESQLEKFRQELSTPSASANLAHSRGGRGNSGARGRGR 249 Query: 740 AVTTDTIPQAQFTASXXXXXXXXXXXXXXXXXXXXXSNVQCQVCSRFGHDASVCYHRYTP 919 + P A T + CQ+C ++GH C++R+ Sbjct: 250 STRGRGRPAASPTG----------------------NRPTCQLCGKYGHHVIDCWYRFDE 287 Query: 920 QF 925 F Sbjct: 288 NF 289 >gb|KYP47787.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1299 Score = 167 bits (424), Expect = 2e-42 Identities = 107/334 (32%), Positives = 161/334 (48%), Gaps = 5/334 (1%) Frame = +2 Query: 5 LSSMASSSTNQPSLKPFKSFQYQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPT 184 ++S ASSS + S +F V+ KL N W QQV V+RAH L RF V+P++P Sbjct: 1 MASSASSSDHSAS----HTFSQSVTCKLDDRNFMTWQQQVTAVIRAHDLERFVVNPKIPL 56 Query: 185 RYLTDADRLAEIENPAYVEWEXXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHS 364 ++LT DR + NP Y W+ SI A+VI R+++Q+WD + QH Sbjct: 57 KFLTAEDRDSNTINPEYTVWDRKDSLLFSWLLSTLSESIQARVISYRHSYQIWDLVFQHF 116 Query: 365 HAKTKVQARQLRAELHNTKKGERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLP 544 H+ TKV+ QL EL KKG RS +E+ R+ TI + L S+G +S +E + ++ LP Sbjct: 117 HSLTKVKVAQLHPELRTIKKGTRSCSEYLLRISTIIEMLASVGSPVSPREHAECIIGGLP 176 Query: 545 AEYE---SITTLLSTKSEWXXXXXXXXXXXXHESRIAK-QTDVATAVASLNLSQAISKSE 712 EY+ S+ T +++ + E+R+ + + V A++NL+Q + S+ Sbjct: 177 PEYDSLISVVTAFASRDDTFSPAELENVILAQEARLDQAKIVVLQEPATINLAQTVPVSQ 236 Query: 713 SQPKSTDDIAVTTDTIPQAQFTASXXXXXXXXXXXXXXXXXXXXXSNVQCQVCSRFGHDA 892 P + +IA + Q S VQCQ+C + GH+A Sbjct: 237 VPPNA--NIAFQQPQLSHPQTNTYQNFPTQTREGGRGRRGGRNRGSPVQCQICHKRGHEA 294 Query: 893 SVCYHRYTPQFAAPAMVPNFGN-PYQYVRPGYVP 991 S CY R+TP A P FG+ P RP P Sbjct: 295 STCYQRFTPMLAP---YPTFGSGPVFPSRPSAAP 325 >gb|KYP45646.1| hypothetical protein KK1_032760, partial [Cajanus cajan] Length = 202 Score = 152 bits (384), Expect = 3e-41 Identities = 72/198 (36%), Positives = 118/198 (59%) Frame = +2 Query: 50 PFKSFQYQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPTRYLTDADRLAEIENP 229 P +F + +SEKL N LW QQVEPV++ H+LH F V+P++P ++LT +D+ + Sbjct: 3 PSLTFAHTISEKLDTKNYLLWCQQVEPVIKGHRLHHFLVNPQIPPKFLTISDKDENCVSE 62 Query: 230 AYVEWEXXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAKTKVQARQLRAEL 409 Y+ WE +L VIGC+ +FQ+WDKIH++ HA T +ARQLR++L Sbjct: 63 EYLAWEQQDQLLLSWLQSSMSKDMLTHVIGCKSSFQIWDKIHEYFHAHTNAKARQLRSDL 122 Query: 410 HNTKKGERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEYESITTLLSTKSE 589 +T +I+++ R++++ D+L +IG+++S +E +VL LP EYES +L+S++ + Sbjct: 123 RSTTLDNGTISDYLLRIQSLVDSLTAIGDSVSSKEHLGIVLDGLPEEYESTVSLISSRFD 182 Query: 590 WXXXXXXXXXXXXHESRI 643 HESR+ Sbjct: 183 VLSIEEVETLLLAHESRL 200 >dbj|GAU19342.1| hypothetical protein TSUD_336290 [Trifolium subterraneum] Length = 1442 Score = 162 bits (411), Expect = 8e-41 Identities = 95/311 (30%), Positives = 158/311 (50%), Gaps = 2/311 (0%) Frame = +2 Query: 68 YQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPTRYLTDADRLAEIENPAYVEWE 247 + ++ KL N LW QQV V+ AH LHRF V+P++P ++ +DADR+A+ + Y +W Sbjct: 35 HSLTIKLDEKNYLLWNQQVNGVITAHDLHRFIVNPQIPIQFASDADRVADRTSDEYRQWI 94 Query: 248 XXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAKTKVQARQLRAELHNTKKG 427 S+L +V+GC++ FQ+WD+IH++ H+ + +ARQLR+EL NTKK Sbjct: 95 FKDQTLFTWLLSTLSDSVLPRVLGCKHAFQVWDQIHKYFHSVLQARARQLRSELKNTKKA 154 Query: 428 ERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEYESITTLLSTKSEWXXXXX 607 RS+ E+ R+K+I ++LL++G+ +S +EQ D +L+ LP E+ S ++ ++ + Sbjct: 155 SRSVGEYLLRIKSIVNSLLAVGDLVSDREQVDAILEGLPEEFNSFVMMVYSRFDTPTVED 214 Query: 608 XXXXXXXHESRIAKQTDVATAVASLNLSQAISKSESQPKSTDDIAVTTDTIPQAQFTASX 787 E++ K A S++ A++ S+ S D + T Sbjct: 215 VEALLLLQEAQFEKFRQ-ELASPSVSAHVALTDSKMSDNSVDQDSHEVGTEHYVAGKGRG 273 Query: 788 XXXXXXXXXXXXXXXXXXXXSNVQCQVCSRFGHDASVCYHRYTPQFAAPAMV-PNFGNPY 964 QCQ+CS+ HDA C++RY P +P+M+ G+ Sbjct: 274 RGKGRGKGRSRGRGSYSGGNQGTQCQICSKSSHDAVNCWYRYHP---SPSMMNAPRGHAV 330 Query: 965 QYVR-PGYVPP 994 + R P Y PP Sbjct: 331 AHSRPPPYNPP 341 >gb|PNX81124.1| histone deacetylase, partial [Trifolium pratense] Length = 660 Score = 160 bits (405), Expect = 1e-40 Identities = 89/304 (29%), Positives = 155/304 (50%), Gaps = 1/304 (0%) Frame = +2 Query: 26 STNQPSLKPFKSFQYQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPTRYLTDAD 205 +T++ S K + ++ KL N LW QQV V+ AH LHR V+PE+P ++ + D Sbjct: 21 NTSRESTK--SGLTHSLTIKLDEKNFLLWSQQVNGVITAHNLHRLVVNPEIPLQFASVTD 78 Query: 206 RLAEIENPAYVEWEXXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAKTKVQ 385 RL + Y +W +L +V+ C++++++W+KIH + ++ K + Sbjct: 79 RLDGKNSEEYQKWLFKDQTLFTWLLSTISDGVLPRVLSCKHSYEVWEKIHTYFNSVLKSR 138 Query: 386 ARQLRAELHNTKKGERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEYESIT 565 ARQLR+EL NTKK R++ E+ R+K+I ++L++IG+ +S +EQ D +L+ L E+ Sbjct: 139 ARQLRSELKNTKKHSRTVNEYLLRIKSIVNSLIAIGDVVSEREQVDAILEGLSEEFNPFV 198 Query: 566 TLLSTKSEWXXXXXXXXXXXXHESRIAK-QTDVATAVASLNLSQAISKSESQPKSTDDIA 742 ++ ++++ ES++ K + ++A S N++Q SK+ + S + Sbjct: 199 MMVYSRTDTPSVEDVEALLLLQESQLEKFRQELANPSVSANVAQIESKNSNNSDSEGQES 258 Query: 743 VTTDTIPQAQFTASXXXXXXXXXXXXXXXXXXXXXSNVQCQVCSRFGHDASVCYHRYTPQ 922 T AQ VQCQ+CS++ HDAS+C+HRY P Sbjct: 259 ATEHYNFNAQ---RGRGRNKGRGRGRGKAQVASNTGKVQCQICSKYNHDASICWHRYDPS 315 Query: 923 FAAP 934 + P Sbjct: 316 SSRP 319 >gb|KYP43730.1| hypothetical protein KK1_034810, partial [Cajanus cajan] Length = 363 Score = 155 bits (391), Expect = 2e-40 Identities = 82/228 (35%), Positives = 129/228 (56%), Gaps = 8/228 (3%) Frame = +2 Query: 38 PSLKPFKSFQYQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPTRYLTDADR-LA 214 P+ P +F + +SEKL N LW QQV+PV++ H+LH F V+P++P ++L ADR + Sbjct: 12 PNSHPSLTFAHTISEKLDTKNYLLWCQQVKPVIKGHRLHHFLVNPQIPQKFLNLADRDVG 71 Query: 215 EIENPAYVEWEXXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAKTKVQARQ 394 I P Y+ WE +L +VIGC+ +FQLWDKIH + H+ +ARQ Sbjct: 72 RISEP-YLAWEQQDQLLLSWLQSSMSKDMLTRVIGCKTSFQLWDKIHSYFHSHMNAKARQ 130 Query: 395 LRAELHNTKKGERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEYESITTLL 574 LR EL +T ++I+++ +++T+ D L +IG+++S +E D++L+ LP EYES +L+ Sbjct: 131 LRNELRSTNLENQTISDYVLQIQTLVDTLTAIGDSVSPKEHLDIILEGLPEEYESTVSLI 190 Query: 575 STKSEWXXXXXXXXXXXXHESR-------IAKQTDVATAVASLNLSQA 697 S++ + HESR +A +V T NLS A Sbjct: 191 SSRFDLLSIEEVETLLLGHESRLDKFKKKVAVSLNVTTTTLEPNLSLA 238 >gb|PNY05793.1| histone deacetylase [Trifolium pratense] Length = 1485 Score = 162 bits (409), Expect = 2e-40 Identities = 95/313 (30%), Positives = 159/313 (50%), Gaps = 14/313 (4%) Frame = +2 Query: 68 YQVSEKLTGDNLPLWLQQVEPVLRAHKLHRFCVSPEVPTRYLTDADRLAEIENPAYVEWE 247 + ++ KL N LW QQV V+ AH LHRF ++P++P ++ T+ +R Y +W Sbjct: 47 HSLTIKLDEKNFLLWSQQVNGVITAHNLHRFILNPKIPLKFATEEERATNTFCDEYRKWL 106 Query: 248 XXXXXXXXXXXXXXXPSILAKVIGCRYTFQLWDKIHQHSHAKTKVQARQLRAELHNTKKG 427 +L +V+GC++ +Q+WDKIH++ ++ K +ARQLR+EL NTKK Sbjct: 107 MQDQTLFTWLLSTLSDGVLPRVLGCKHAYQVWDKIHKYFNSLLKARARQLRSELKNTKKL 166 Query: 428 ERSITEFFARLKTITDALLSIGETISLQEQYDVVLQVLPAEYESITTLLSTKSEWXXXXX 607 RSI E+ R+KTI D+L +IG+T+S QE D +L+ LP E+ S ++ ++ + Sbjct: 167 ARSIGEYLLRIKTIIDSLTAIGDTVSDQEHIDAILEGLPEEFNSFVMMIYSRLDTPTVED 226 Query: 608 XXXXXXXHESRIAK-QTDVATAVASLNLSQAISKSESQPKSTDDIAVTTDTIPQAQFTAS 784 E++ K + ++A+ S NL+Q+ +K+ ++ + ++ V T+ Sbjct: 227 VEALLMVQEAQFEKFRQELASPNVSANLAQSEAKTSAESVNHENTEVGTEHYA----AGG 282 Query: 785 XXXXXXXXXXXXXXXXXXXXXSNVQCQVCSRFGHDASVCYHRYTPQ----------FAA- 931 S+ CQ+C + GH+AS C++RY P FA+ Sbjct: 283 KGRGRGRGRGRGRGRSSKNPHSDSTCQICGKNGHEASGCWYRYDPNPTPKQFKFPGFASS 342 Query: 932 --PAMVPNFGNPY 964 P+M P NPY Sbjct: 343 SNPSMRPPSYNPY 355