BLASTX nr result
ID: Rehmannia31_contig00018239
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia31_contig00018239 (804 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011073037.1| uncharacterized protein LOC105158097 [Sesamu... 224 6e-67 ref|XP_012849888.1| PREDICTED: retrovirus-related Pol polyprotei... 183 3e-50 gb|AAK29467.1| polyprotein-like [Solanum chilense] 154 7e-39 gb|PKA49510.1| Retrovirus-related Pol polyprotein from transposo... 148 1e-38 gb|PKI48613.1| hypothetical protein CRG98_031032 [Punica granatum] 148 4e-38 sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol poly... 152 6e-38 gb|KYP71220.1| Retrovirus-related Pol polyprotein from transposo... 149 4e-37 gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsi... 149 4e-37 gb|OTG02614.1| putative zinc finger, CCHC-type, Ribonuclease H-l... 149 4e-37 gb|KZV56298.1| hypothetical protein F511_00295 [Dorcoceras hygro... 146 6e-36 gb|KYP50611.1| Retrovirus-related Pol polyprotein from transposo... 142 1e-35 gb|KYP48283.1| Retrovirus-related Pol polyprotein from transposo... 139 1e-34 gb|OAE31341.1| hypothetical protein AXG93_4510s1170 [Marchantia ... 137 1e-34 gb|KYP46254.1| Retrovirus-related Pol polyprotein from transposo... 141 1e-34 gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-... 141 2e-34 gb|ACL97387.1| Gag-Pol polyprotein [Lotus japonicus] 140 4e-34 gb|ABA98804.1| retrotransposon protein, putative, Ty1-copia subc... 140 4e-34 gb|ABA98656.1| retrotransposon protein, putative, Ty1-copia subc... 140 4e-34 gb|KYP34487.1| Retrovirus-related Pol polyprotein from transposo... 137 9e-34 gb|PNX74094.1| copia LTR rider, partial [Trifolium pratense] 139 1e-33 >ref|XP_011073037.1| uncharacterized protein LOC105158097 [Sesamum indicum] Length = 472 Score = 224 bits (571), Expect = 6e-67 Identities = 124/271 (45%), Positives = 173/271 (63%), Gaps = 8/271 (2%) Frame = +3 Query: 6 EVMHVRGRSQY-----RFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGH 170 E+ VRGR+++ R++++ + N++ + CYNCG GH Sbjct: 193 EINSVRGRTRFGNFNSRYNSRSRSKTKTNRSKSRPRETNLRDDKIRDRR--CYNCGTKGH 250 Query: 171 YIREC--PNKKGNQNQSNDQANLASTS-ENAGDIFMVTGICDVHIVNSVHSSTVCENEWL 341 YI++C P ++ +D+ +++ S E+ G++F+V + NSV ST +EWL Sbjct: 251 YIKDCRKPRRENRDRNYDDKEKVSNVSIESNGEVFVV------YEANSV--STFDMHEWL 302 Query: 342 IDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRH 521 IDS CTFHMSPFK++F+N K FVSMANEKKC + G+GDI L FD GY LKNVR+ Sbjct: 303 IDSGCTFHMSPFKDIFTNLKYEHAGFVSMANEKKCEIKGLGDISLCFD-GYKMLLKNVRY 361 Query: 522 VPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPVSFAGNS 701 VPDL +NL+SCAALE++GLEGR+G GLMKI+KGSLV+FKA ++ NLY+C S+ + Sbjct: 362 VPDLSHNLISCAALEENGLEGRWGKGLMKIMKGSLVVFKAERKRNLYIC---TASYDNIA 418 Query: 702 VNVVQEDKTDLWHKRLGHMSSKGLEILHKAG 794 +V D T LWHKRLGH+S KGL+ L + G Sbjct: 419 ASVSVCDLTSLWHKRLGHISQKGLDFLKRDG 449 >ref|XP_012849888.1| PREDICTED: retrovirus-related Pol polyprotein from transposon TNT 1-94 [Erythranthe guttata] Length = 598 Score = 183 bits (465), Expect = 3e-50 Identities = 93/169 (55%), Positives = 122/169 (72%) Frame = +3 Query: 297 VNSVHSSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICL 476 VNSV +S + ENEWL+DSAC++HM+P + +FS+Y +MKN V++A+ V GIG +CL Sbjct: 5 VNSVLAS-LSENEWLLDSACSYHMTPRREVFSDYVQMKNCGVTLADGTMIVVNGIGTVCL 63 Query: 477 KFDSGYAYTLKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNN 656 KF SG TLKNVRHVP L +NL+SCA LEDDG G +G+G M I+KGS +FKA++ N Sbjct: 64 KFVSGSVLTLKNVRHVPTLSHNLISCAVLEDDGFRGDWGDGCMNIMKGSRYLFKALRMGN 123 Query: 657 LYVCIGKPVSFAGNSVNVVQEDKTDLWHKRLGHMSSKGLEILHKAGCFG 803 +YVC + + S+NVVQ D ++LWHK LGHMS+K L ILHK FG Sbjct: 124 MYVCSAE----SSASMNVVQNDLSELWHKGLGHMSNKWLSILHKNQYFG 168 >gb|AAK29467.1| polyprotein-like [Solanum chilense] Length = 1328 Score = 154 bits (390), Expect = 7e-39 Identities = 85/220 (38%), Positives = 125/220 (56%), Gaps = 5/220 (2%) Frame = +3 Query: 144 CYNCGEVGHYIRECPNKKGNQNQSNDQANLASTS---ENAGDIFMVTGICD--VHIVNSV 308 CYNC + GH+ R+CPN K + +S+ Q N +T+ +N D+ ++ + +H+ + Sbjct: 233 CYNCDQPGHFKRDCPNPKRGKGESSGQKNDDNTAAMVQNNDDVVLLINEEEECMHLAGT- 291 Query: 309 HSSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDS 488 E+EW++D+A ++H +P ++LF Y V M N + GIGDIC K + Sbjct: 292 ------ESEWVVDTAASYHATPVRDLFCRYVAGDYGNVKMGNTSYSKIAGIGDICFKTNV 345 Query: 489 GYAYTLKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVC 668 G LK+VRHVPDL NL+S AL+ DG E F N ++ KG+LVI K V R LY Sbjct: 346 GCTLVLKDVRHVPDLRMNLISGIALDQDGYENYFANQKWRLTKGALVIAKGVARGTLYRT 405 Query: 669 IGKPVSFAGNSVNVVQEDKTDLWHKRLGHMSSKGLEILHK 788 + N+ + +E+ DLWHKR+GH S KGL+IL K Sbjct: 406 NAEICQGELNAAH--EENSADLWHKRMGHTSEKGLQILSK 443 >gb|PKA49510.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Apostasia shenzhenica] Length = 365 Score = 148 bits (373), Expect = 1e-38 Identities = 89/257 (34%), Positives = 128/257 (49%), Gaps = 2/257 (0%) Frame = +3 Query: 21 RGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRECPNKKG 200 RGRS+ +F NQ ++ + +N N CY C + GH+ R+CP K Sbjct: 91 RGRSKNKFGNQYRYRSISKENDNR-----------------CYYCKKEGHWKRDCPKKSK 133 Query: 201 NQNQ--SNDQANLASTSENAGDIFMVTGICDVHIVNSVHSSTVCENEWLIDSACTFHMSP 374 Q Q S ++A++AS E + +C ++S S W++DS C++HM P Sbjct: 134 QQQQKKSGEEASVASRLEKDSET-----LCTFSCMDSSDS-------WILDSDCSYHMCP 181 Query: 375 FKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHVPDLCNNLMSC 554 F++ FS Y V M N +C GIG I +K G TL VRHVPDL L+S Sbjct: 182 FRDWFSTYSIHDGGRVIMGNNSECKSVGIGTIKIKMFDGVIRTLTEVRHVPDLRKGLISL 241 Query: 555 AALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPVSFAGNSVNVVQEDKTDL 734 L+ G +G++K+ KG+ V+ K K +LY IGK ++ + +D T L Sbjct: 242 GTLDASGCTFIGSDGIIKVKKGAPVVMKGEKIESLYRLIGKTITGDIAVTSSTDDDDTML 301 Query: 735 WHKRLGHMSSKGLEILH 785 WH RLGHMS +GL LH Sbjct: 302 WHARLGHMSERGLLELH 318 >gb|PKI48613.1| hypothetical protein CRG98_031032 [Punica granatum] Length = 435 Score = 148 bits (373), Expect = 4e-38 Identities = 88/260 (33%), Positives = 132/260 (50%), Gaps = 2/260 (0%) Frame = +3 Query: 9 VMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRECP 188 V RGRSQ + N +++ N CY+ G+ GHY REC Sbjct: 84 VTEQRGRSQSK--------NSSSKHGNSGDKSRGRSKSKTRKVVTCYHYGKEGHYKRECR 135 Query: 189 NKKGNQNQSNDQANLASTSENA--GDIFMVTGICDVHIVNSVHSSTVCENEWLIDSACTF 362 K NQN + + T+ A G+ ++V CD VN T ++ W+ D+ +F Sbjct: 136 ALKKNQNGNGESKKEEGTTTVASDGETYIV---CDEAYVNF----TCQDSTWVADTGVSF 188 Query: 363 HMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHVPDLCNN 542 H++P ++ FS+Y +V M N + C + GIGD+CL+ + G LK VRHVP++ N Sbjct: 189 HVTPHRDFFSSYTTGDYGYVRMGNGQSCKIVGIGDVCLETELGCKLLLKKVRHVPEIRLN 248 Query: 543 LMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPVSFAGNSVNVVQED 722 L+S L D+G F NG K+ KGSL++ + K + LY + S VNV ++ Sbjct: 249 LISMGQLNDEGYSNEFSNGRWKLSKGSLIVARGQKTDTLYRLRARHNS---GQVNVAEDY 305 Query: 723 KTDLWHKRLGHMSSKGLEIL 782 LWH+RL H+S KG++IL Sbjct: 306 SIKLWHRRLRHISEKGIQIL 325 >sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94; Includes: RecName: Full=Protease; Includes: RecName: Full=Reverse transcriptase; Includes: RecName: Full=Endonuclease emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] Length = 1328 Score = 152 bits (383), Expect = 6e-38 Identities = 86/227 (37%), Positives = 125/227 (55%), Gaps = 12/227 (5%) Frame = +3 Query: 144 CYNCGEVGHYIRECPNKKGNQNQSNDQANLASTSENAGDIFMVTGICDVHIVNSVHSSTV 323 CYNC + GH+ R+CPN + + +++ Q N +T+ + + ++V ++ Sbjct: 232 CYNCNQPGHFKRDCPNPRKGKGETSGQKNDDNTAAMVQN--------NDNVVLFINEEEE 283 Query: 324 C------ENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFD 485 C E+EW++D+A + H +P ++LF Y V M N + GIGDIC+K + Sbjct: 284 CMHLSGPESEWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTN 343 Query: 486 SGYAYTLKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLY- 662 G LK+VRHVPDL NL+S AL+ DG E F N ++ KGSLVI K V R LY Sbjct: 344 VGCTLVLKDVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYR 403 Query: 663 ----VCIGKPVSFAGNSVNVVQED-KTDLWHKRLGHMSSKGLEILHK 788 +C G+ +N Q++ DLWHKR+GHMS KGL+IL K Sbjct: 404 TNAEICQGE--------LNAAQDEISVDLWHKRMGHMSEKGLQILAK 442 >gb|KYP71220.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 690 Score = 149 bits (375), Expect = 4e-37 Identities = 92/270 (34%), Positives = 136/270 (50%), Gaps = 3/270 (1%) Frame = +3 Query: 3 GEVMHVRGRSQYR--FDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYI 176 GE + VRGR+Q + N++ + + SN C C + GH I Sbjct: 197 GEGLSVRGRTQEKGSTSNKKSRSKSRGRKSNKT----------------CRYCKKFGHDI 240 Query: 177 RECPNKKGNQNQSNDQANLASTSENAGDIFMVTGICDVHIVNSVHSSTVCENEWLIDSAC 356 +C K Q + N A + D D ++ SV S + EW++DS C Sbjct: 241 SDCFILKKKQERQEKGKNPAEAANVETD-------SDGDVMISVSSDKRSKTEWILDSGC 293 Query: 357 TFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHVPDLC 536 TFHM P+K+LF+ + + + V M N+ +C + GIG I +K G TL NVR +PDL Sbjct: 294 TFHMCPYKDLFTTLEPVDSGVVLMGNDTQCKIAGIGTIQIKTHDGTIKTLSNVRFIPDLK 353 Query: 537 NNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPVS-FAGNSVNVV 713 NL+S LE G + G++K+ KG++V+ KA + +LY+ G V+ A S ++ Sbjct: 354 RNLISLGTLESLGCKYSAEGGVLKVSKGAIVLLKANRIGSLYILQGSIVTGSAAVSSSMS 413 Query: 714 QEDKTDLWHKRLGHMSSKGLEILHKAGCFG 803 +D T LWH RLGHMS KG+ +L K G G Sbjct: 414 DKDATKLWHMRLGHMSEKGMHLLSKQGLLG 443 >gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1335 Score = 149 bits (377), Expect = 4e-37 Identities = 81/224 (36%), Positives = 129/224 (57%), Gaps = 6/224 (2%) Frame = +3 Query: 144 CYNCGEVGHYIREC-----PNKKGNQNQSNDQANLASTSENAGDIFMVTGICDVHIVNSV 308 C+ CG+ GH+ ++C NK Q N +++LA ++E ++ + +V Sbjct: 221 CWICGKEGHFKKQCYKWIERNKSKQQGSDNGESSLAKSTEAFNPAMVLLATDETLVVTDS 280 Query: 309 HSSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDS 488 + NEW++D+ C+FHM+P K+ F ++KE+ + +V M N+ V GIG I ++ Sbjct: 281 IA-----NEWVLDTGCSFHMTPRKDWFKDFKELSSGYVKMGNDTYSPVKGIGSIKIRNSD 335 Query: 489 GYAYTLKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVC 668 G L +VR++P++ NL+S LED G + +G++KI+KG I K KR+ LY+ Sbjct: 336 GSQVILTDVRYMPNMTRNLISLGTLEDRGCWFKSQDGILKIVKGCSTILKGQKRDTLYIL 395 Query: 669 IGKPVSFAGNSVNVVQ-EDKTDLWHKRLGHMSSKGLEILHKAGC 797 G V+ G S + + +D+T LWH RLGHMS KG+EIL K GC Sbjct: 396 DG--VTEEGESHSSAEVKDETALWHSRLGHMSQKGMEILVKKGC 437 >gb|OTG02614.1| putative zinc finger, CCHC-type, Ribonuclease H-like domain, GAG-pre-integrase domain protein [Helianthus annuus] Length = 702 Score = 149 bits (375), Expect = 4e-37 Identities = 79/226 (34%), Positives = 121/226 (53%), Gaps = 11/226 (4%) Frame = +3 Query: 144 CYNCGEVGHYIRECP-----------NKKGNQNQSNDQANLASTSENAGDIFMVTGICDV 290 C++CG GH I+ C N K N ++ +D N + A + F + CD Sbjct: 365 CHHCGRKGHTIKFCRQLKKEKKKADYNNKKNNHKKDDGGNDTAEVNTATEEFFIC--CDD 422 Query: 291 HIVNSVHSSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDI 470 +VN ++ W++DS T H++ ++ FS+Y V M N + G+GD+ Sbjct: 423 DVVNITRD----DSSWVVDSGATCHVTSQRDFFSSYTPGDFGVVKMGNNGLSKIIGVGDV 478 Query: 471 CLKFDSGYAYTLKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKR 650 CLKFD+G L NV+HV D+ NL+S L+DDG FG+G+ K+ +GSL++ + + Sbjct: 479 CLKFDTGMELVLHNVKHVSDIRLNLISAGLLDDDGYHSTFGDGVWKLTRGSLIVARGKRS 538 Query: 651 NNLYVCIGKPVSFAGNSVNVVQEDKTDLWHKRLGHMSSKGLEILHK 788 + LY + P + ++V D T+LWHKRLGHMS KG+ IL K Sbjct: 539 SKLY--MAHPKISTDSVHSLVDNDMTELWHKRLGHMSEKGMHILLK 582 >gb|KZV56298.1| hypothetical protein F511_00295 [Dorcoceras hygrometricum] Length = 1309 Score = 146 bits (368), Expect = 6e-36 Identities = 85/270 (31%), Positives = 135/270 (50%), Gaps = 3/270 (1%) Frame = +3 Query: 3 GEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRE 182 GE ++VRGR+ R +K H++Q+ C+ C + GH+ ++ Sbjct: 195 GEGLNVRGRTYKRESRNEKGGKHRSQSRTRGKLK-------------CFVCHKEGHFKKD 241 Query: 183 CPNKKG-NQNQSNDQANLASTSEN--AGDIFMVTGICDVHIVNSVHSSTVCENEWLIDSA 353 CP+++ N + D + A S+ + ++ +V S T ++ W++DS Sbjct: 242 CPDRRARNPERRKDPGDAAVVSDGYESAEVLVV-------------SRTNKQDCWVMDSG 288 Query: 354 CTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHVPDL 533 C+FHM P K+ F N E ++ V + N ++C V GIG + LK G T+ VR+VPDL Sbjct: 289 CSFHMCPIKSWFQNLVEEESGHVLLGNNRECKVMGIGSVLLKMHDGCVRTITEVRYVPDL 348 Query: 534 CNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPVSFAGNSVNVV 713 NL+S L+ G + G MK++KGSL + + + N LY+ V+ + N+ V Sbjct: 349 RRNLLSIGMLDSKGFNVKIEGGTMKVIKGSLTVMRGSQDNGLYILEASTVTGSSNAA-VG 407 Query: 714 QEDKTDLWHKRLGHMSSKGLEILHKAGCFG 803 +K LWH RLGH+S KGL L K G Sbjct: 408 GANKARLWHLRLGHVSEKGLVELSKQNLLG 437 >gb|KYP50611.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 448 Score = 142 bits (357), Expect = 1e-35 Identities = 81/223 (36%), Positives = 118/223 (52%), Gaps = 3/223 (1%) Frame = +3 Query: 144 CYNCGEVGHYIRECPNKKGNQNQSNDQANLASTSENAGDIFMVTGICDVHIVNSVHSSTV 323 C C E GH+ +CP KK + + S+S+N +V I D H ++ Sbjct: 1 CNYCKEPGHWKNDCPKKKNQKPTAVTVQESTSSSDNE----LVLSIVDNHQQSA------ 50 Query: 324 CENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYT 503 ++W++DS C++HM P ++ F Y+E V M N+ C GIG I L+ G T Sbjct: 51 --DQWVLDSGCSYHMCPNRSWFLTYEERLGGRVFMGNDMPCKTVGIGTIQLRMHDGVIRT 108 Query: 504 LKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPV 683 L VRHVPDL NL+S L+ G + NG+M+I +GS V+ + K+ NLY+ G Sbjct: 109 LTEVRHVPDLKKNLISVGVLDSKGFKCNVKNGVMEIKRGSTVVMRGFKKGNLYMLQGSTS 168 Query: 684 SFAGNSVNVVQE---DKTDLWHKRLGHMSSKGLEILHKAGCFG 803 S + SV+V ++ D T LWH RLGHMS +G+ IL + G Sbjct: 169 SIS-ESVSVAEKNIPDLTYLWHMRLGHMSERGMMILSRQQLLG 210 >gb|KYP48283.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 431 Score = 139 bits (349), Expect = 1e-34 Identities = 85/258 (32%), Positives = 128/258 (49%) Frame = +3 Query: 21 RGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRECPNKKG 200 RG+S+ DN+ K NH++ N++ C+NCG+ GHY +C N Sbjct: 192 RGKSR---DNRSKSRNHRSSNNSKTIK--------------CWNCGQTGHYKNQCKNAPK 234 Query: 201 NQNQSNDQANLASTSENAGDIFMVTGICDVHIVNSVHSSTVCENEWLIDSACTFHMSPFK 380 NQ ++ +AN+ASTS D ++ S+ S E W++DS +FH + + Sbjct: 235 NQ-EAKAEANIASTSGR-----------DDALICSLESK---EESWVLDSGASFHATSQR 279 Query: 381 NLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHVPDLCNNLMSCAA 560 F NY V + NE+ C + G G + +K G + LKNVRH+PDL NL+S Sbjct: 280 EFFENYVPGNLGKVYLGNEQSCEIVGKGVVKIKL-KGSVWELKNVRHIPDLTKNLISVGQ 338 Query: 561 LEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPVSFAGNSVNVVQEDKTDLWH 740 L +G F KI KG++ I + K LY G A + + + D +LW+ Sbjct: 339 LASEGYTTTFHGDNWKISKGAMTIARGKKSGTLYKTAG-----AYHLIAIAANDNPNLWY 393 Query: 741 KRLGHMSSKGLEILHKAG 794 +RLGHMS KG++I+H G Sbjct: 394 QRLGHMSEKGMKIMHSKG 411 >gb|OAE31341.1| hypothetical protein AXG93_4510s1170 [Marchantia polymorpha subsp. ruderalis] Length = 344 Score = 137 bits (344), Expect = 1e-34 Identities = 77/222 (34%), Positives = 119/222 (53%), Gaps = 2/222 (0%) Frame = +3 Query: 144 CYNCGEVGHYIRECPN--KKGNQNQSNDQANLASTSENAGDIFMVTGICDVHIVNSVHSS 317 C+ C ++GH ++C + KK +NQ++ + + S N GD G D ++ + S Sbjct: 84 CFYCNKMGHLKKDCYSWIKKTKENQAST-SQVHQDSANFGD-----GYSDGEVLMA--SG 135 Query: 318 TVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYA 497 + +EW++DS CT+HM+ K+ S+Y+E+ V M N C+V GIG + +K G Sbjct: 136 KLKASEWILDSGCTYHMTHNKHWLSDYQELNEGKVIMGNNHSCSVAGIGSVSIKMADGVV 195 Query: 498 YTLKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGK 677 L+NVR VP+L NL+S L+D G + G M I KG+ I K +K LY IG+ Sbjct: 196 RILENVRWVPELSRNLISIGILDDLGYTNKIEQGSMYIAKGA-TILKGMKIGGLYYLIGE 254 Query: 678 PVSFAGNSVNVVQEDKTDLWHKRLGHMSSKGLEILHKAGCFG 803 + + K LWH+RLGH+S KGL+++ K G Sbjct: 255 TLYGVASVATTQDHSKATLWHRRLGHISEKGLQLMSKQNLLG 296 >gb|KYP46254.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 609 Score = 141 bits (355), Expect = 1e-34 Identities = 84/223 (37%), Positives = 116/223 (52%), Gaps = 3/223 (1%) Frame = +3 Query: 144 CYNCGEVGHYIRECPNKKGNQNQSNDQANLASTSENAGDIFMVTGICDVHIVNSVHSSTV 323 C C E GH+ ++CP KK NQ A S+ EN +V I D H HS+ Sbjct: 163 CNYCKEPGHWKKDCP-KKRNQKSVAVAAQEDSSFENE----LVLSIVDNH----QHSA-- 211 Query: 324 CENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYT 503 +W++DS C++HM P ++ F Y++ V M N+ C GIG I +K G T Sbjct: 212 --EQWVLDSGCSYHMCPNRSWFLTYEKKSGGDVFMGNDMACKTIGIGTIQIKMHDGVIRT 269 Query: 504 LKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPV 683 L VRHVPDL NL+S L G + G+M+I KGS V+ + +K+ NLY+ G Sbjct: 270 LTEVRHVPDLKKNLISVGVLHTKGFKCNVEGGVMEISKGSTVVIRGIKKGNLYMLQGS-T 328 Query: 684 SFAGNSVNVVQE---DKTDLWHKRLGHMSSKGLEILHKAGCFG 803 + SV+V + D T LWH RLGHMS +G+ +L K G Sbjct: 329 NLISESVSVADKHTPDLTHLWHMRLGHMSERGMMVLRKQKLLG 371 >gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa Japonica Group] gb|ABA92739.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 1373 Score = 141 bits (356), Expect = 2e-34 Identities = 86/272 (31%), Positives = 136/272 (50%), Gaps = 10/272 (3%) Frame = +3 Query: 3 GEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRE 182 GE +HVRGR++ R N++ +D S C C H I E Sbjct: 196 GEALHVRGRTENRTSNEKNYDRRGRSKSKPPGNKKF-----------CVYCKLKNHNIDE 244 Query: 183 CPNKKGNQNQSNDQ-----ANLASTSENAGDIFMVTGICDVHIVNSVHSSTVCENEWLID 347 C + + ++ A+ A++ +++GD +V C +EW++D Sbjct: 245 CKKVQAKERKNKKDGKVSVASAAASDDDSGDCLVVFAGC-----------VAGHDEWILD 293 Query: 348 SACTFHMSPFKNLFSNYKEM-KNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHV 524 SAC+FH+ +N FS+YK + K V M ++ C + GIG + +K D G TLKNVR++ Sbjct: 294 SACSFHICTKRNWFSSYKPVQKGDVVRMGDDNPCAIVGIGSVQIKTDDGMTRTLKNVRYI 353 Query: 525 PDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKA-VKRNNLYVCIGKPVSFAGNS 701 P + NL+S + L+ +G + +G++K+ KGSLV K V LYV G ++ + ++ Sbjct: 354 PGMSRNLISLSTLDAEGYKYSGSDGVLKVSKGSLVCLKGDVNSAKLYVLRGCTLTGSDSA 413 Query: 702 VNVVQED---KTDLWHKRLGHMSSKGLEILHK 788 + D KT+LWH RLGHMS G+ L K Sbjct: 414 AAAITNDEPSKTNLWHMRLGHMSHLGMTELMK 445 >gb|ACL97387.1| Gag-Pol polyprotein [Lotus japonicus] Length = 1305 Score = 140 bits (354), Expect = 4e-34 Identities = 80/219 (36%), Positives = 121/219 (55%), Gaps = 6/219 (2%) Frame = +3 Query: 144 CYNCGEVGHYIREC-PNKKGNQNQSN---DQANLASTSENAGDIFMVTGICDVHIVNSVH 311 CYNCG+ GH ++C NKK + S Q +ASTS++ ++ + S Sbjct: 230 CYNCGKRGHLKKDCWSNKKSGEKSSEASTSQGCVASTSDDGEVLYSEAAV-------STK 282 Query: 312 SSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSG 491 + W++DS T+HM+P ++ F Y+ + V M N+ + GIG + +K G Sbjct: 283 GKNRLTDVWIVDSGATWHMTPRRDWFCTYEPVSEGNVFMGNDHALEIVGIGTVKIKMYDG 342 Query: 492 YAYTLKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVK-RNNLYVC 668 TL+ VRHV +L NL+S L+D G + G++K++KGSLV+ KA K NLY+ Sbjct: 343 TIRTLQEVRHVKELAKNLLSVGQLDDLGYKYDIQGGILKVVKGSLVVMKAKKVAANLYML 402 Query: 669 IGKPVSFAGNSVNV-VQEDKTDLWHKRLGHMSSKGLEIL 782 +G A SV V QE+ T +WH+RLGHMS +GL++L Sbjct: 403 LGDTWQMADASVAVGSQEETTMMWHRRLGHMSERGLKVL 441 >gb|ABA98804.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 1333 Score = 140 bits (354), Expect = 4e-34 Identities = 86/272 (31%), Positives = 136/272 (50%), Gaps = 10/272 (3%) Frame = +3 Query: 3 GEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRE 182 GE +HVRGR++ R N++ +D S C C H I E Sbjct: 196 GEALHVRGRTENRTSNEKNYDRRGRSKSKPPGNKKF-----------CVYCKLKNHNIDE 244 Query: 183 CPNKKGNQNQSNDQ-----ANLASTSENAGDIFMVTGICDVHIVNSVHSSTVCENEWLID 347 C + + ++ A+ A++ +++GD +V C +EW++D Sbjct: 245 CKKVQAKERKNKKDGKVSVASAAASDDDSGDCLVVFAGC-----------VAGHDEWILD 293 Query: 348 SACTFHMSPFKNLFSNYKEM-KNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHV 524 SAC+FH+ +N FS+YK + K V M ++ C + GIG + +K D G TLKNVR++ Sbjct: 294 SACSFHICTKRNWFSSYKPVQKGDVVRMGDDNPCAIVGIGSVQIKTDDGMTRTLKNVRYI 353 Query: 525 PDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKA-VKRNNLYVCIGKPVSFAGNS 701 P + NL+S + L+ +G + +G++K+ KGSLV K + LYV G ++ + ++ Sbjct: 354 PGMSRNLISLSTLDAEGYKYSGSDGVLKVSKGSLVCLKGDLNSAKLYVLRGCTLTGSDSA 413 Query: 702 VNVVQED---KTDLWHKRLGHMSSKGLEILHK 788 V D KT+LWH RLGHMS G+ L K Sbjct: 414 AAAVTNDEPSKTNLWHMRLGHMSHLGMTELMK 445 >gb|ABA98656.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 1333 Score = 140 bits (354), Expect = 4e-34 Identities = 86/272 (31%), Positives = 136/272 (50%), Gaps = 10/272 (3%) Frame = +3 Query: 3 GEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRE 182 GE +HVRGR++ R N++ +D S C C H I E Sbjct: 196 GEALHVRGRTENRTSNEKNYDRRGRSKSKPPGNKKF-----------CVYCKLKNHNIDE 244 Query: 183 CPNKKGNQNQSNDQ-----ANLASTSENAGDIFMVTGICDVHIVNSVHSSTVCENEWLID 347 C + + ++ A+ A++ +++GD +V C +EW++D Sbjct: 245 CKKVQAKERKNKKDGKVSVASAAASDDDSGDCLVVFAGC-----------VAGHDEWILD 293 Query: 348 SACTFHMSPFKNLFSNYKEM-KNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHV 524 SAC+FH+ +N FS+YK + K V M ++ C + GIG + +K D G TLKNVR++ Sbjct: 294 SACSFHICTKRNWFSSYKPVQKGDVVRMGDDNPCAIVGIGSVQIKTDDGMTRTLKNVRYI 353 Query: 525 PDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKA-VKRNNLYVCIGKPVSFAGNS 701 P + NL+S + L+ +G + +G++K+ KGSLV K + LYV G ++ + ++ Sbjct: 354 PGMSRNLISLSTLDAEGYKYSGSDGVLKVSKGSLVCLKGDLNSAKLYVLRGCTLTGSDSA 413 Query: 702 VNVVQED---KTDLWHKRLGHMSSKGLEILHK 788 V D KT+LWH RLGHMS G+ L K Sbjct: 414 AAAVTNDEPSKTNLWHMRLGHMSHLGMTELMK 445 >gb|KYP34487.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 485 Score = 137 bits (345), Expect = 9e-34 Identities = 84/261 (32%), Positives = 131/261 (50%), Gaps = 3/261 (1%) Frame = +3 Query: 21 RGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRECPNKKG 200 RGRS+ R Q KF N C+NC + GH+ +C K Sbjct: 218 RGRSKSRAKGQPKFRND----------------------IVCWNCDKRGHFTNQCKAPKK 255 Query: 201 NQN---QSNDQANLASTSENAGDIFMVTGICDVHIVNSVHSSTVCENEWLIDSACTFHMS 371 N+N + +D++ A+T E D ++ S+ S W++DS +FH + Sbjct: 256 NKNHKKRDDDESANAATDE-----------IDDALICSLDSPI---ESWIMDSGASFHTT 301 Query: 372 PFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHVPDLCNNLMS 551 P L +NY + V +A+ K N+ G GDI ++ SG +TLKNVRH+P L NL+S Sbjct: 302 PSNELLTNYVSGRFGKVYLADGKPLNIVGKGDIAIRTSSGSHWTLKNVRHIPALKRNLIS 361 Query: 552 CAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPVSFAGNSVNVVQEDKTD 731 L+D+G E FG+G K+ KG+L++ + KR +LY+ + + + N + Sbjct: 362 VGQLDDEGHETTFGDGAWKVKKGNLIVARGKKRGSLYMVADENMIAVTEAAN-----NSF 416 Query: 732 LWHKRLGHMSSKGLEILHKAG 794 LWH+RLGHMS KG++++ G Sbjct: 417 LWHQRLGHMSEKGMKLMATKG 437 >gb|PNX74094.1| copia LTR rider, partial [Trifolium pratense] Length = 876 Score = 139 bits (351), Expect = 1e-33 Identities = 80/222 (36%), Positives = 125/222 (56%), Gaps = 7/222 (3%) Frame = +3 Query: 144 CYNCGEVGHYIRECPNKKGNQNQSNDQANLASTSENA--GDIFMVTGICDVHIVNSVHSS 317 CY+C ++GH+ R+CPN K + S ANL T ++ GDI V SS Sbjct: 159 CYSCKQIGHWKRDCPNIKQGSSTS---ANLVHTDDSCSEGDILCV-------------SS 202 Query: 318 TVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYA 497 + C + W++DS C++HM+P + F+ ++ FV + ++K C + GIG I + D G Sbjct: 203 SKCTDAWILDSGCSYHMTPNREWFTTFRSGSFGFVYLGDDKACAITGIGQIKIAMDDGGV 262 Query: 498 YTLKNVRHVPDLCNNLMSCAALEDDGLEGRF--GNGLMKILKGSLVIFKAVKR--NNLYV 665 TL NVR++P+L NL+S L+++G R ++K+ KG+L + + VKR N+Y Sbjct: 263 RTLTNVRYIPELRKNLISLGTLQENGYSYRSDRDRDILKVSKGALTVMR-VKRTAGNIYK 321 Query: 666 CIGKPVSFAGNSVNVVQE-DKTDLWHKRLGHMSSKGLEILHK 788 +G V G+ +V + D T LWH RLGH+S +G+ LHK Sbjct: 322 LLGNTV--VGDVASVESDNDATKLWHLRLGHLSERGMMELHK 361