BLASTX nr result
ID: Lithospermum23_contig00009515
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Lithospermum23_contig00009515 (2104 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_010058631.1 PREDICTED: uncharacterized protein LOC104446483 [... 140 1e-49 JAT53351.1 Retrovirus-related Pol polyprotein LINE-1 [Anthurium ... 132 6e-47 JAU74353.1 hypothetical protein LE_TR15446_c14_g1_i1_g.48810, pa... 144 4e-46 JAU49119.1 hypothetical protein LC_TR10504_c5_g1_i1_g.36974, par... 144 4e-46 JAU25120.1 LINE-1 retrotransposable element ORF2 protein, partia... 129 1e-41 XP_013699633.1 PREDICTED: uncharacterized protein LOC106403339 [... 112 2e-41 XP_010068169.1 PREDICTED: uncharacterized protein LOC104455000 [... 112 5e-41 XP_010058169.1 PREDICTED: uncharacterized protein LOC104445938 [... 137 8e-41 JAT50758.1 Retrovirus-related Pol polyprotein LINE-1, partial [A... 120 1e-40 JAT63902.1 LINE-1 reverse transcriptase [Anthurium amnicola] 113 1e-36 XP_010541224.1 PREDICTED: uncharacterized protein LOC104814739 [... 110 2e-36 XP_011013113.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized p... 102 3e-36 XP_013650404.1 PREDICTED: uncharacterized protein LOC106354909 [... 97 3e-35 OMO64412.1 reverse transcriptase [Corchorus capsularis] 154 5e-35 JAT47559.1 Retrovirus-related Pol polyprotein LINE-1, partial [A... 99 1e-34 XP_013651296.1 PREDICTED: uncharacterized protein LOC106355988 [... 99 2e-33 XP_009140327.1 PREDICTED: uncharacterized protein LOC103864320 [... 97 5e-33 JAU41414.1 LINE-1 retrotransposable element ORF2 protein, partia... 107 5e-33 CCA66198.1 hypothetical protein [Beta vulgaris subsp. vulgaris] 114 1e-32 XP_010671205.1 PREDICTED: uncharacterized protein LOC104888072 [... 119 2e-31 >XP_010058631.1 PREDICTED: uncharacterized protein LOC104446483 [Eucalyptus grandis] Length = 1755 Score = 140 bits (354), Expect(2) = 1e-49 Identities = 90/307 (29%), Positives = 160/307 (52%), Gaps = 10/307 (3%) Frame = -1 Query: 1504 SSEWLLSMRSCSVHIPAPTESDHWYLDVSVLDKLSQGPKPFKYMPFWSQYSDSLRIINE- 1328 ++ W + + AP SDH + V +L KPFK+ +W + + ++ + Sbjct: 752 NAAWNTAFSYSEANFLAPGVSDHSPMVVRILPT-PISRKPFKFFNYWMSHPNFFELVRQI 810 Query: 1327 ---EIEGNDMDMLHIRLKNDRGRLKALNNEEFSNIDTRVIEKGHELECECSN*MGI---- 1169 + G M +L+ +L++ + RLK LN E +S+I R E L E N + + Sbjct: 811 WELRMSGTPMFVLYSKLRSLKRRLKLLNKEAYSDISARTSE-ARRLLLEAQNAIQLDPHN 869 Query: 1168 KICSEARCSDK**I*NVLQSRKAVCQCKSRQTWDKDGDANTNFFHKSLKLHHMQNRISTM 989 + ++A + ++ ++ + KSR W K+GD NT FFH S+K H++NR+ ++ Sbjct: 870 QALADAEKNHLHIFSDLRLKEESFYRQKSRIRWLKEGDLNTKFFHHSVKRGHLRNRVLSI 929 Query: 988 HNKDGVIVKDFDQVKSIVIDFYKELF--ATPQEDTSIAATLNKFISDNIEDSDRSFLDQP 815 + VI D +V+ + +D ++ L +TP S+ + ++ ++D+ + QP Sbjct: 930 SDGSNVIT-DEAEVQRLFVDHFQNLLSASTPSAIPSV-EEIRANLASTLDDNHIQAISQP 987 Query: 814 FTAEEIEGAMLGMKLGETPGPDGFPLEFYKATWSTVREFVVEAVKTFFATCSMPRYFNST 635 FT EEI+ + + G+ PGPDGF ++F+K +W V V+ A++ FF+T + R NST Sbjct: 988 FTDEEIKSTLFSLASGKAPGPDGFNVDFFKRSWDIVGPSVLLAIRDFFSTGQLLREINST 1047 Query: 634 IIALIPK 614 I+ LIPK Sbjct: 1048 ILTLIPK 1054 Score = 87.0 bits (214), Expect(2) = 1e-49 Identities = 55/173 (31%), Positives = 85/173 (49%), Gaps = 7/173 (4%) Frame = -3 Query: 620 PKKTRACAYN----ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILL 453 PK A N I+CCNT+YKC+T +LANRL + L +I Q+A+V GR IS I+L Sbjct: 1053 PKTPNASMVNDFRPIACCNTVYKCITKLLANRLASILPSIISVSQSAFVKGRRISDNIML 1112 Query: 452 MQDLSVWL**ELRCA*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLG 273 Q+L E + +++ F+ + + + Sbjct: 1113 AQELFAHFHHEPYFPKNIIKVDFSKAYDSVDWKFIELSLQAFGFPSIFIDRIMTCIRTPK 1172 Query: 272 FQLQSMGSKDGFFKSSKGLRHGDPLSPYLFIIIMECFSRMVKFQISQ---RFF 123 F + G GFF S +G+R GDP+SPY+F ++ME F+ ++ + S+ RFF Sbjct: 1173 FSIALNGDLHGFFPSGRGIRQGDPISPYIFTLVMEVFTGIINARTSKPGFRFF 1225 >JAT53351.1 Retrovirus-related Pol polyprotein LINE-1 [Anthurium amnicola] Length = 1225 Score = 132 bits (332), Expect(2) = 6e-47 Identities = 84/308 (27%), Positives = 153/308 (49%), Gaps = 11/308 (3%) Frame = -1 Query: 1504 SSEWLLSMRSCSVHIPAPTESDHWYLDVSVLDKLSQGPKPFKYMPFWSQYSDSLRIINE- 1328 + EW+ S +H SDH + +S +D L + PKPFK+ W+ + D L II E Sbjct: 212 NEEWITLFPSSLLHYGPAFMSDHSLMLISTVDDLPKEPKPFKFHCMWTSHPDFLNIIKEA 271 Query: 1327 ---EIEGNDMDMLHIRLKNDRGRLKALNNEEFSNI-------DTRVIEKGHELECECSN* 1178 + G+ M +LK+ + L++ N F ++ ++++ +L+ + N Sbjct: 272 WKSDSTGSPMFTFCQKLKSVKNALRSWNKYGFGDVLANISCAKKKLVQMQGQLQDDPLN- 330 Query: 1177 MGIKICSEARCSDK**I*NVLQSRKAVCQCKSRQTWDKDGDANTNFFHKSLKLHHMQNRI 998 I +E ++ + + ++ + K++Q W GD N+ FFH ++K M N I Sbjct: 331 -PDLISNEKSVREE--FSRAILAENSLARQKAKQFWLTQGDTNSKFFHAAIKARRMFNSI 387 Query: 997 STMHNKDGVIVKDFDQVKSIVIDFYKELFATPQEDTSIAATLNKFISDNIEDSDRSFLDQ 818 N GVI++D QVKS + F+++L +D I + + IS + + DR+ L++ Sbjct: 388 RKCRNAQGVILEDITQVKSYTLSFFQQLL---NQDRIIDSNPHLEISKILVEEDRNLLNR 444 Query: 817 PFTAEEIEGAMLGMKLGETPGPDGFPLEFYKATWSTVREFVVEAVKTFFATCSMPRYFNS 638 ++ ++I+ ++ ++PGPDGFP EF++ W V + A+ F T + + + Sbjct: 445 RYSDDDIKAVVMKSPKMKSPGPDGFPAEFFQFCWDIVGKDFCSAIHNFLITGKLLKQVGT 504 Query: 637 TIIALIPK 614 T I LIPK Sbjct: 505 TFITLIPK 512 Score = 86.3 bits (212), Expect(2) = 6e-47 Identities = 51/148 (34%), Positives = 78/148 (52%) Frame = -3 Query: 590 ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILLMQDLSVWL**ELRC 411 IS CN +YK ++ +LANRLK L K+I Q A++ GR I ILL DL + + R Sbjct: 525 ISLCNFVYKVISKLLANRLKKVLDKIISPHQMAFIEGRKIQDSILLANDLVKNIHSKSRG 584 Query: 410 A*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLGFQLQSMGSKDGFFK 231 L + + F+ + ++ + ++ + + + GS GFF Sbjct: 585 NVSALKADLRKAFDSVHRPFIYFMMQKMGFPLEFIDRIRACLEVARYSILFNGSPMGFFD 644 Query: 230 SSKGLRHGDPLSPYLFIIIMECFSRMVK 147 SS G+R GDPLSPYLF+++ME FS M++ Sbjct: 645 SSNGIRQGDPLSPYLFVLVMEGFSVMMR 672 >JAU74353.1 hypothetical protein LE_TR15446_c14_g1_i1_g.48810, partial [Noccaea caerulescens] Length = 1124 Score = 144 bits (364), Expect(2) = 4e-46 Identities = 94/311 (30%), Positives = 155/311 (49%), Gaps = 12/311 (3%) Frame = -1 Query: 1504 SSEWLLSMRSCSVHIPAPTESDHWYLDVSVLDKLSQ-GPKPFKYMPFWSQYSDSLRIINE 1328 + WLL S AP SDH V ++ G +PF++ F ++ L + E Sbjct: 111 NDNWLLLFPSSHCIFEAPEFSDHTPCHVKLVTPPPNFGTRPFRFFNFVAKLPSFLHCVQE 170 Query: 1327 EIEG-----NDMDMLHIRLKNDRGRLKALNNEEFSNIDTRVIEKGHELECECSN*MGIK- 1166 + ++ L +LK + LK L E+FSN++ RVIE EL+ + + Sbjct: 171 SWDAAGSSATNLKTLGFKLKTLKRPLKTLCKEKFSNLEMRVIEANDELKSIQLQVLNVPS 230 Query: 1165 ---ICSEARCSDK**I*NVLQSRKAVCQCKSRQTWDKDGDANTNFFHKSLKLHHMQNRIS 995 + E DK I + Q+ ++ + +SR W +GD +T FFH K + N I Sbjct: 231 PAILLMEQTARDKWMI--LRQAEESFFRQRSRIKWLAEGDFDTRFFHLVTKARNSSNAIK 288 Query: 994 TMHNKDGVIVKDFDQVKSIVIDFYKELFATPQEDTS--IAATLNKFISDNIEDSDRSFLD 821 + DG + V +Y++++ T + D ++ L + I ++ D+ R + Sbjct: 289 YLKKDDGTRTTTLEDVHLHAQTYYEKIYNTLKGDYCPFLSVLLERVILNHCSDAQRRLMK 348 Query: 820 QPFTAEEIEGAMLGMKLGETPGPDGFPLEFYKATWSTVREFVVEAVKTFFATCSMPRYFN 641 FT+E I ++ M L +TPGPDGFP+EF+KA+W + + V++AVK FFAT MP+ N Sbjct: 349 SDFTSESIISSLSKMPLNKTPGPDGFPVEFFKASWGVIGKEVIDAVKEFFATSFMPKALN 408 Query: 640 STIIALIPKKQ 608 +T + LIPK++ Sbjct: 409 ATSLVLIPKRR 419 Score = 71.2 bits (173), Expect(2) = 4e-46 Identities = 49/156 (31%), Positives = 81/156 (51%), Gaps = 9/156 (5%) Frame = -3 Query: 590 ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILL---------MQDLS 438 ISC NT+YK +T +L++RLK+ L ++I QTA++ R + +LL +DLS Sbjct: 430 ISCLNTVYKLITRLLSDRLKSALPEIILPNQTAFIKDRLLLENVLLAAEVINGYHRKDLS 489 Query: 437 VWL**ELRCA*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLGFQLQS 258 + ++ A DS + + C+ S+ I +L++ +K + + + Sbjct: 490 PRITLKIDIA-KAFDSMRWDFILSCL--------SAYKIPSELIAWIKSCISSPSYSVSI 540 Query: 257 MGSKDGFFKSSKGLRHGDPLSPYLFIIIMECFSRMV 150 G+ G+FK GLR GDPLSP LF++ M S M+ Sbjct: 541 NGTTSGYFKGRTGLRQGDPLSPILFVMAMNVLSLML 576 >JAU49119.1 hypothetical protein LC_TR10504_c5_g1_i1_g.36974, partial [Noccaea caerulescens] Length = 533 Score = 144 bits (364), Expect(2) = 4e-46 Identities = 94/311 (30%), Positives = 155/311 (49%), Gaps = 12/311 (3%) Frame = -1 Query: 1504 SSEWLLSMRSCSVHIPAPTESDHWYLDVSVLDKLSQ-GPKPFKYMPFWSQYSDSLRIINE 1328 + WLL S AP SDH V ++ G +PF++ F ++ L + E Sbjct: 44 NDNWLLLFPSSHCIFEAPEFSDHTPCHVKLVTPPPNFGTRPFRFFNFVAKLPSFLHCVQE 103 Query: 1327 EIEG-----NDMDMLHIRLKNDRGRLKALNNEEFSNIDTRVIEKGHELECECSN*MGIK- 1166 + ++ L +LK + LK L E+FSN++ RVIE EL+ + + Sbjct: 104 SWDAAGSSATNLKTLGFKLKTLKRPLKTLCKEKFSNLEMRVIEANDELKSIQLQVLNVPS 163 Query: 1165 ---ICSEARCSDK**I*NVLQSRKAVCQCKSRQTWDKDGDANTNFFHKSLKLHHMQNRIS 995 + E DK I + Q+ ++ + +SR W +GD +T FFH K + N I Sbjct: 164 PAILLMEQTARDKWMI--LRQAEESFFRQRSRIKWLAEGDFDTRFFHLVTKARNSSNAIK 221 Query: 994 TMHNKDGVIVKDFDQVKSIVIDFYKELFATPQEDTS--IAATLNKFISDNIEDSDRSFLD 821 + DG + V +Y++++ T + D ++ L + I ++ D+ R + Sbjct: 222 YLKKDDGTRTTTLEDVHLHAQTYYEKIYNTLKGDYCPFLSVLLERVILNHCSDAQRRLMK 281 Query: 820 QPFTAEEIEGAMLGMKLGETPGPDGFPLEFYKATWSTVREFVVEAVKTFFATCSMPRYFN 641 FT+E I ++ M L +TPGPDGFP+EF+KA+W + + V++AVK FFAT MP+ N Sbjct: 282 SDFTSESIISSLSKMPLNKTPGPDGFPVEFFKASWGVIGKEVIDAVKEFFATSFMPKALN 341 Query: 640 STIIALIPKKQ 608 +T + LIPK++ Sbjct: 342 ATSLVLIPKRR 352 Score = 71.2 bits (173), Expect(2) = 4e-46 Identities = 49/156 (31%), Positives = 81/156 (51%), Gaps = 9/156 (5%) Frame = -3 Query: 590 ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILL---------MQDLS 438 ISC NT+YK +T +L++RLK+ L ++I QTA++ R + +LL +DLS Sbjct: 363 ISCLNTVYKLITRLLSDRLKSALPEIILPNQTAFIKDRLLLENVLLAAEVINGYHRKDLS 422 Query: 437 VWL**ELRCA*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLGFQLQS 258 + ++ A DS + + C+ S+ I +L++ +K + + + Sbjct: 423 PRITLKIDIA-KAFDSMRWDFILSCL--------SAYKIPSELIAWIKSCISSPSYSVSI 473 Query: 257 MGSKDGFFKSSKGLRHGDPLSPYLFIIIMECFSRMV 150 G+ G+FK GLR GDPLSP LF++ M S M+ Sbjct: 474 NGTTSGYFKGRTGLRQGDPLSPILFVMAMNVLSLML 509 >JAU25120.1 LINE-1 retrotransposable element ORF2 protein, partial [Noccaea caerulescens] Length = 942 Score = 129 bits (325), Expect(2) = 1e-41 Identities = 78/237 (32%), Positives = 127/237 (53%), Gaps = 6/237 (2%) Frame = -1 Query: 1300 LHIRLKNDRGRLKALNNEEFSNIDTRVIEKGHELECECSN*MGIK----ICSEARCSDK* 1133 L +LK + LK L E+FSN++ RVIE EL+ + + + E DK Sbjct: 3 LGFKLKTLKRPLKTLCKEKFSNLEMRVIEANDELKSIQLQVLNVPSPAILLMEQTARDKW 62 Query: 1132 *I*NVLQSRKAVCQCKSRQTWDKDGDANTNFFHKSLKLHHMQNRISTMHNKDGVIVKDFD 953 I + Q+ ++ + +SR W +GD +T FFH K + N I + DG + Sbjct: 63 MI--LRQAEESFFRQRSRIKWLAEGDFDTRFFHLVTKARNSSNAIKYLKKDDGTRTTTLE 120 Query: 952 QVKSIVIDFYKELFATPQEDTS--IAATLNKFISDNIEDSDRSFLDQPFTAEEIEGAMLG 779 V +Y++++ T + D ++ L + I ++ D+ RS + T+E I ++ Sbjct: 121 DVHLHAQTYYEKIYNTLKGDCCPFLSVLLERVILNHCSDAQRSLMKSDVTSESIISSLSK 180 Query: 778 MKLGETPGPDGFPLEFYKATWSTVREFVVEAVKTFFATCSMPRYFNSTIIALIPKKQ 608 M L +TPGPDGFP+EF+KA+W + + V++AVK FFAT MP+ N+T + LIPK++ Sbjct: 181 MPLNKTPGPDGFPVEFFKASWGVIGKEVIDAVKEFFATSFMPKALNATSLVLIPKRR 237 Score = 71.2 bits (173), Expect(2) = 1e-41 Identities = 49/156 (31%), Positives = 81/156 (51%), Gaps = 9/156 (5%) Frame = -3 Query: 590 ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILL---------MQDLS 438 ISC NT+YK +T +L++RLK+ L ++I QTA++ R + +LL +DLS Sbjct: 248 ISCLNTVYKLITRLLSDRLKSALPEIILPNQTAFIKDRLLLENVLLAAEVINGYHRKDLS 307 Query: 437 VWL**ELRCA*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLGFQLQS 258 + ++ A DS + + C+ S+ I +L++ +K + + + Sbjct: 308 PRITLKIDIA-KAFDSMRWDFILSCL--------SAYKIPSELIAWIKSCISSPSYSVSI 358 Query: 257 MGSKDGFFKSSKGLRHGDPLSPYLFIIIMECFSRMV 150 G+ G+FK GLR GDPLSP LF++ M S M+ Sbjct: 359 NGTTSGYFKGRTGLRQGDPLSPILFVMAMNVLSLML 394 >XP_013699633.1 PREDICTED: uncharacterized protein LOC106403339 [Brassica napus] Length = 1455 Score = 112 bits (279), Expect(2) = 2e-41 Identities = 74/270 (27%), Positives = 133/270 (49%), Gaps = 10/270 (3%) Frame = -1 Query: 1390 KPFKYMPFWSQYSDSLRIINE-----EIEGNDMDMLHIRLKNDRGRLKALNNEEFSNIDT 1226 KPFK+ + + D II G+ M ++ +LK + ++ + E +S+++ Sbjct: 488 KPFKFFTMLNNHPDFAEIIYSCWHSLPFSGSKMLLVSKKLKELKSIIRTFSKENYSDLEK 547 Query: 1225 RVIEKGHELE-CE---CSN*MGIKICSEARCSDK**I*NVLQSRKAVCQCKSRQTWDKDG 1058 RV E ELE C+ +N E K + + Q+ ++ + +SR W +G Sbjct: 548 RVAESFSELESCQQALLTNPTPDLAKQERDAHKKWSL--LAQAEESFLRQRSRILWLAEG 605 Query: 1057 DANTNFFHKSLKLHHMQNRISTMHNKDGVIVKDFDQVKSIVIDFYKELFATP-QEDTSIA 881 D+N+ FFH++L QN+I + + V++ D ++K V+ +Y+ L P TS Sbjct: 606 DSNSAFFHRALMTQISQNQICFLLDARDVVIDDLQELKDHVLSYYENLLGGPVAATTSSP 665 Query: 880 ATLNKFISDNIEDSDRSFLDQPFTAEEIEGAMLGMKLGETPGPDGFPLEFYKATWSTVRE 701 + + + + L PFTA+EI+ + ++PGPDG+P EF+ A W TV Sbjct: 666 SLIAALVPYRCTTEAGNCLSAPFTAQEIKDVFFSLPRNKSPGPDGYPAEFFTAQWHTVGP 725 Query: 700 FVVEAVKTFFATCSMPRYFNSTIIALIPKK 611 ++ AV F ++ + +N+T++ LI KK Sbjct: 726 DMISAVMEFLSSGRILTQWNATVLTLIRKK 755 Score = 88.2 bits (217), Expect(2) = 2e-41 Identities = 52/152 (34%), Positives = 80/152 (52%) Frame = -3 Query: 590 ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILLMQDLSVWL**ELRC 411 ISCCNTIYK + +LANRLK L +I Q+A++PGR ++ +LL +L + Sbjct: 767 ISCCNTIYKVASKLLANRLKQILPSIISNSQSAFIPGRSLAENVLLATELVESYKWKSIS 826 Query: 410 A*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLGFQLQSMGSKDGFFK 231 L + F+ + L+ V ++ ++ F + G G+FK Sbjct: 827 KRSMLKVDLQKAFDTVNWDFVINTLTGLNFLVSFVNLIRHCITTTRFSVSINGELCGYFK 886 Query: 230 SSKGLRHGDPLSPYLFIIIMECFSRMVKFQIS 135 ++GLR GDPLSPYLF+++ME F +M+K S Sbjct: 887 GTRGLRQGDPLSPYLFVLVMEVFCQMLKKNFS 918 >XP_010068169.1 PREDICTED: uncharacterized protein LOC104455000 [Eucalyptus grandis] Length = 1576 Score = 112 bits (281), Expect(2) = 5e-41 Identities = 71/232 (30%), Positives = 123/232 (53%), Gaps = 6/232 (2%) Frame = -1 Query: 1291 RLKNDRGRLKALNNEEFSNIDTRVIEKGHELECECSN*MGIKICSEARCSDK**I*NVLQ 1112 +L++ + RLK LN E + +I R + L E N + ++A + V Sbjct: 792 KLRSLKRRLKLLNREAYYDISARTSD-ARRLLVEAQNAIQFDPHNQALADAEKDHLRVFS 850 Query: 1111 S----RKAVCQCKSRQTWDKDGDANTNFFHKSLKLHHMQNRISTMHNKDGVIVKDFDQVK 944 ++ + KSR W K+GD NT FFH S+K H++NRI ++ + VI D +V+ Sbjct: 851 DLRLKEESFYRQKSRVRWLKEGDLNTRFFHHSVKRGHLRNRILSISDGSNVIT-DEVEVQ 909 Query: 943 SIVIDFYKELF--ATPQEDTSIAATLNKFISDNIEDSDRSFLDQPFTAEEIEGAMLGMKL 770 + ++ ++ L ATP S+ + ++ ++D+ + QPFT EEI+ + + Sbjct: 910 QLFVNHFQNLLSDATPSAVPSV-EEIRANLASTLDDNQIQAISQPFTEEEIQTTLFSLAT 968 Query: 769 GETPGPDGFPLEFYKATWSTVREFVVEAVKTFFATCSMPRYFNSTIIALIPK 614 G+ PGPDGF ++F+K +W V V+ A++ F +T + R N+TI+ L+PK Sbjct: 969 GKAPGPDGFNVDFFKQSWDIVGPSVLLAIRDFLSTGQLLREINTTILTLVPK 1020 Score = 86.3 bits (212), Expect(2) = 5e-41 Identities = 53/161 (32%), Positives = 78/161 (48%), Gaps = 4/161 (2%) Frame = -3 Query: 620 PKKTRACAYN----ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILL 453 PK A N I+CCNTIYKC+T +LANRL + L +I Q A+V GR IS I+L Sbjct: 1019 PKSPNASTVNDFRPIACCNTIYKCITKLLANRLASMLPSIISPPQNAFVKGRRISDNIML 1078 Query: 452 MQDLSVWL**ELRCA*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLG 273 Q+L E + +++ F+ + + + Sbjct: 1079 AQELFAHFHHEPYFPKNIIKVDFSKAYDSVDWNFIESTLQAFGFPPTFIDRIMTCIRTPK 1138 Query: 272 FQLQSMGSKDGFFKSSKGLRHGDPLSPYLFIIIMECFSRMV 150 F + G GFF S +G+R GDPLSPY+F ++ME F+ ++ Sbjct: 1139 FSIALNGELHGFFPSGRGIRQGDPLSPYIFTLVMEVFAGII 1179 >XP_010058169.1 PREDICTED: uncharacterized protein LOC104445938 [Eucalyptus grandis] Length = 1495 Score = 137 bits (345), Expect(2) = 8e-41 Identities = 93/309 (30%), Positives = 151/309 (48%), Gaps = 12/309 (3%) Frame = -1 Query: 1504 SSEWLLSMRSCSVHIPAPTESDHWYLDVSVLDKLSQGPKPFKYMPFWSQYSDSLRIINE- 1328 + +W AP+ SDH + V ++ + KPFK+ +W + D R+++E Sbjct: 662 NGQWNAEFSFSKASFLAPSISDHTPMVVKIMP-IPTSSKPFKFFNYWMTHPDFTRLVSEA 720 Query: 1327 ---EIEGNDMDMLHIRLKNDRGRLKALNNEEFSNIDTRVIEKGHELECECSN*MGIKICS 1157 G+ M +L+ +L+ + +LK LN E FS I R E L+ + Sbjct: 721 WASSFYGSPMFILYAKLRLLKSKLKQLNRESFSEISLRTEEARRTLQLTQDAIQDDPMNP 780 Query: 1156 EARCSDK**I*NVLQSR---KAVCQCKSRQTWDKDGDANTNFFHKSLKLHHMQNRISTMH 986 ++K I R ++ + KSR W KDGD NT FFH+ + ++QNRI ++ Sbjct: 781 FLVETEKQQIQAFSALRLQEESFYRQKSRVRWLKDGDLNTKFFHQVVNRRYLQNRIISLT 840 Query: 985 NKDGVIVKDFDQVKSIVIDFYKELFATPQE-----DTSIAATLNKFISDNIEDSDRSFLD 821 N D V D +V+ + +D +++L A P I A LN F ++ FL Sbjct: 841 NGD-VTTSDPAEVQKMFVDHFRDLLAAPPSLSCPVKEEIQAVLNHF----LDAEQVCFLS 895 Query: 820 QPFTAEEIEGAMLGMKLGETPGPDGFPLEFYKATWSTVREFVVEAVKTFFATCSMPRYFN 641 +P T EI+ + + +G+ PG DGF +EF+K +W V V+ AV+ FF T + + N Sbjct: 896 RPITDMEIKETLFSLAVGKAPGLDGFNVEFFKHSWDIVGTSVISAVRDFFETGELLKQIN 955 Query: 640 STIIALIPK 614 +TI+ L+PK Sbjct: 956 ATILVLVPK 964 Score = 60.8 bits (146), Expect(2) = 8e-41 Identities = 27/50 (54%), Positives = 36/50 (72%) Frame = -3 Query: 590 ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILLMQDL 441 I+CCNTIYKC+T ++ANR+ L +I + Q A+V GR IS ILL Q+L Sbjct: 977 IACCNTIYKCITKLIANRMTHVLPSIISSTQNAFVKGRHISDNILLAQEL 1026 >JAT50758.1 Retrovirus-related Pol polyprotein LINE-1, partial [Anthurium amnicola] Length = 1070 Score = 120 bits (302), Expect(2) = 1e-40 Identities = 80/308 (25%), Positives = 152/308 (49%), Gaps = 11/308 (3%) Frame = -1 Query: 1504 SSEWLLSMRSCSVHIPAPTESDHWYLDVSVLDKLSQGPKPFKYMPFWSQYSDSLRIINE- 1328 + EW+ S +H SDH + +S +D + + PKPFK+ + D +++++E Sbjct: 204 NEEWISEFPSSLLHYGPSLISDHAMMQISTVDDIPKEPKPFKFHAMCVSHPDFIKVVSEA 263 Query: 1327 ---EIEGNDMDMLHIRLKNDRGRLKALN----NEEFSNIDT---RVIEKGHELECECSN* 1178 +G+ + ++LK+ + L+ N + ++NI++ ++++ +L+ N Sbjct: 264 WHSHSKGSPLFTFCLKLKSVKAALRKWNLAGFGDIYANINSSRKKLLDFQGQLQHAPLNE 323 Query: 1177 MGIKICSEARCSDK**I*NVLQSRKAVCQCKSRQTWDKDGDANTNFFHKSLKLHHMQNRI 998 I AR + + + + K++Q W GD+NT FFH ++K M N I Sbjct: 324 DLISKEKAARMEYS----QAIHAENMMARQKAKQFWLSQGDSNTKFFHAAIKSRRMLNSI 379 Query: 997 STMHNKDGVIVKDFDQVKSIVIDFYKELFATPQEDTSIAATLNKFISDNIEDSDRSFLDQ 818 N+ G +++D QVKS + +++ L +D + IS + ++ L + Sbjct: 380 RKCKNEQGDLIEDLSQVKSHTLLYFQNLL---NQDRLMNHPGQIEISKTLNADEQLSLSR 436 Query: 817 PFTAEEIEGAMLGMKLGETPGPDGFPLEFYKATWSTVREFVVEAVKTFFATCSMPRYFNS 638 F+ EEI+ ++ ++PGPDGF EF+++ W V + + EAV+ FF T + + + Sbjct: 437 GFSVEEIKQVVMDSPKLKSPGPDGFTAEFFQSCWDIVGKDLCEAVQHFFITGRLLKQVGA 496 Query: 637 TIIALIPK 614 T I LIPK Sbjct: 497 TFITLIPK 504 Score = 76.6 bits (187), Expect(2) = 1e-40 Identities = 50/148 (33%), Positives = 73/148 (49%) Frame = -3 Query: 590 ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILLMQDLSVWL**ELRC 411 IS CN IYK ++ +LA R+ L K+I Q A++ GR I I L DL + + R Sbjct: 517 ISLCNFIYKVISKLLAGRMHRVLEKLISPHQMAFIKGRKIQDSIFLANDLVKNIHSKSRG 576 Query: 410 A*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLGFQLQSMGSKDGFFK 231 L + + F+ + + + ++ + F + GS GFF Sbjct: 577 NLSVLKADLRKAFDSIHRPFIYFMMQKMGFPMVFIDWIRTCLEVAKFSILFNGSPLGFFG 636 Query: 230 SSKGLRHGDPLSPYLFIIIMECFSRMVK 147 SS G+R GDPLSPYLF+I ME FS M++ Sbjct: 637 SSSGIRQGDPLSPYLFVIAMEGFSAMME 664 >JAT63902.1 LINE-1 reverse transcriptase [Anthurium amnicola] Length = 825 Score = 113 bits (282), Expect(2) = 1e-36 Identities = 69/283 (24%), Positives = 135/283 (47%), Gaps = 9/283 (3%) Frame = -1 Query: 1429 LDVSVLDKLSQGPKPFKYMPFWSQYSDSLRIINE----EIEGNDMDMLHIRLKNDRGRLK 1262 + ++ +D + + PKPFK+ W+ + D L+I+ E + G+ M +LK+ + LK Sbjct: 1 MHITTIDDILKEPKPFKFHSMWTSHPDFLKIVREAWHTHLSGSPMFSFCQKLKSVKAALK 60 Query: 1261 ALNNEEFSNIDTRVIEKGHEL-----ECECSN*MGIKICSEARCSDK**I*NVLQSRKAV 1097 N + F +I + +L + + S I E + + + + Sbjct: 61 RWNMQGFGDIYANISSSKKKLMDIQGQLQLSPLNEDLIYKEKSARSE--FSQAILAENMM 118 Query: 1096 CQCKSRQTWDKDGDANTNFFHKSLKLHHMQNRISTMHNKDGVIVKDFDQVKSIVIDFYKE 917 + K++Q W GD N+ FFH ++K M N I N+ G +++D +VKS + ++++ Sbjct: 119 ARQKAKQFWLSQGDTNSRFFHAAIKARRMVNSIRKCKNEQGELLEDISEVKSYTLSYFQK 178 Query: 916 LFATPQEDTSIAATLNKFISDNIEDSDRSFLDQPFTAEEIEGAMLGMKLGETPGPDGFPL 737 L +D + I + + D+ FL F+ E+I+ ++ ++PGPD F Sbjct: 179 LL---NQDRIVTPPSQVEIINTLTAEDQQFLSGGFSDEDIKLVVMNSPKMKSPGPDRFLA 235 Query: 736 EFYKATWSTVREFVVEAVKTFFATCSMPRYFNSTIIALIPKKQ 608 EF++ W+ + + + ++ FF T + + +T I LIPK + Sbjct: 236 EFFQTCWTIIGQDLCATMQHFFTTGRLLKQVGATFITLIPKNE 278 Score = 71.2 bits (173), Expect(2) = 1e-36 Identities = 45/148 (30%), Positives = 71/148 (47%) Frame = -3 Query: 590 ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILLMQDLSVWL**ELRC 411 IS CN +YK ++ ++A +L L K+I + Q A++ R I + L DL + R Sbjct: 289 ISLCNFLYKVISKLIAGKLHKVLDKIISSHQMAFIKRRKIQDSLFLANDLVKNIHSRHRG 348 Query: 410 A*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLGFQLQSMGSKDGFFK 231 L + + F+ + + + ++ F + GS GFF Sbjct: 349 NISVLKADLRKAFDSVHRPFIYSMLQKMGFPMTFIHWIRSCLEAAKFSILFNGSPLGFFG 408 Query: 230 SSKGLRHGDPLSPYLFIIIMECFSRMVK 147 SS G+R GDPLSPYLF+I ME FS +++ Sbjct: 409 SSNGIRQGDPLSPYLFVIAMEGFSALMQ 436 >XP_010541224.1 PREDICTED: uncharacterized protein LOC104814739 [Tarenaya hassleriana] Length = 1303 Score = 110 bits (276), Expect(2) = 2e-36 Identities = 79/307 (25%), Positives = 142/307 (46%), Gaps = 10/307 (3%) Frame = -1 Query: 1504 SSEWLLSMRSCSVHIPAPTESDHWYLDVSVLDKLSQGPKP-FKYMPFWSQYSDSLRIIN- 1331 + WLL+ P SDH V++ S + FK++ + SD ++ Sbjct: 355 NEHWLLTYNDSYATFDQPGPSDHCPCKVTLNPSTSPRRRTSFKFLNSLTLLSDFRPLVEK 414 Query: 1330 ----EEIEGNDMDMLHIRLKNDRGRLKALNNEEFSNIDTRVIEKGHELECECSN*MGIKI 1163 E I+G++M + LK+ + L+ L E SNI RV E EL+ + + Sbjct: 415 VWNAEHIQGSNMFRISSNLKSLKHHLRRLTKESCSNIQQRVAEAFGELKVAQNQLLQNPT 474 Query: 1162 CSEAR--CSDK**I*NVLQSRKAVCQCKSRQTWDKDGDANTNFFHKSLKLHHMQNRISTM 989 AR + + + + ++ + KSR W K+G+ NT +FH+ + N I + Sbjct: 475 LESARREVALRATWSTLAVAEESYFRQKSRIRWLKEGEQNTKYFHRVVLARQATNSIRFL 534 Query: 988 HNKDGVIVKDFDQVKSIVIDFYKELFATPQEDTSI--AATLNKFISDNIEDSDRSFLDQP 815 N G+ D +K++ + Y+++ T + L + + + D SFL++ Sbjct: 535 VNDAGLKTYDVVMMKAMAVQHYRDILGTHDHNIRCWSKTELQQILPFRCSEEDNSFLEEI 594 Query: 814 FTAEEIEGAMLGMKLGETPGPDGFPLEFYKATWSTVREFVVEAVKTFFATCSMPRYFNST 635 T +I + + + PGPDG+P+EF K+ W + + AVK FF + + + +N+T Sbjct: 595 PTESDIRSVVFELPKNKAPGPDGYPVEFLKSCWDLIGADINPAVKEFFTSEKILKQWNTT 654 Query: 634 IIALIPK 614 +I+LIPK Sbjct: 655 LISLIPK 661 Score = 72.4 bits (176), Expect(2) = 2e-36 Identities = 42/147 (28%), Positives = 75/147 (51%) Frame = -3 Query: 590 ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILLMQDLSVWL**ELRC 411 ISCCNT+YK ++ +L +L+ + + + + QTA++ G IS +LL +L + Sbjct: 674 ISCCNTVYKIISKLLVRKLQRIMPQAVASNQTAFISGCNISENVLLAIELVADFKPRDQA 733 Query: 410 A*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLGFQLQSMGSKDGFFK 231 L ++ FL +L++ + ++ F + G GFF+ Sbjct: 734 QKALLKVDLSKAFDSVHWEFLLCILQALNLPGKFVGWIRECITTPTFSISFNGESTGFFE 793 Query: 230 SSKGLRHGDPLSPYLFIIIMECFSRMV 150 +GLR GDPLSP+LF+++M+ SR++ Sbjct: 794 GKRGLRQGDPLSPHLFVLVMDVLSRLL 820 >XP_011013113.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC105117228 [Populus euphratica] Length = 2627 Score = 102 bits (253), Expect(2) = 3e-36 Identities = 77/302 (25%), Positives = 136/302 (45%), Gaps = 8/302 (2%) Frame = -1 Query: 1495 WLLSMRSCSVHIPAPTE-SDHWYLDVSVLDKLSQGPKPFKYMPFWSQYSDSLRIINEE-- 1325 W R VH P +DH V + + QG + FK+ W+ + L +++ Sbjct: 1713 WSSIHRQTHVHFDTPGAFTDHSPAKVCLSQHI-QGRRSFKFFNMWASHDKFLDVVSTNWH 1771 Query: 1324 --IEGNDMDMLHIRLKNDRGRLKALNNEEFSNIDTRVIEKGHEL---ECECSN*MGIKIC 1160 + G M +L RLK + LKALN+ F++I RV EL + + + M + Sbjct: 1772 SAVYGTPMYVLCRRLKLLKRHLKALNSLHFNHISERVSRLETELANHQLDLQHDMDNQSL 1831 Query: 1159 SEARCSDK**I*NVLQSRKAVCQCKSRQTWDKDGDANTNFFHKSLKLHHMQNRISTMHNK 980 E + + ++ + K C K + + K+ D + FFH L +H +N I + Sbjct: 1832 LEQEMLLRSKLSSLKFAEKQFCSQKIKCNFLKESDTGSKFFHALLNHNHRKNFIPAIMTS 1891 Query: 979 DGVIVKDFDQVKSIVIDFYKELFATPQEDTSIAATLNKFISDNIEDSDRSFLDQPFTAEE 800 G + +V S+ ++++++ P I + + + + + L P + EE Sbjct: 1892 QGHLTSSLKEVGSVFVNYFQQQLGIPTPVLPIDSAVVQS-GPCLSSGSQDLLLAPVSCEE 1950 Query: 799 IEGAMLGMKLGETPGPDGFPLEFYKATWSTVREFVVEAVKTFFATCSMPRYFNSTIIALI 620 I A+ + + PGPDG+ F+K W +RE AV+ FF + + + N +IIAL+ Sbjct: 1951 IRKAVFSIGDDKAPGPDGYSSLFFKQAWHIIREDFCSAVQDFFHSGKLLKQLNHSIIALV 2010 Query: 619 PK 614 PK Sbjct: 2011 PK 2012 Score = 80.9 bits (198), Expect(2) = 3e-36 Identities = 54/156 (34%), Positives = 73/156 (46%) Frame = -3 Query: 590 ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILLMQDLSVWL**ELRC 411 ISCCN IYK + ILA RL L +I Q A++ GR +S I L+Q+L + Sbjct: 2025 ISCCNVIYKVIAKILATRLALALMDIISPYQNAFLGGRFMSDNINLVQELLRQYGRKRSS 2084 Query: 410 A*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLGFQLQSMGSKDGFFK 231 L + + FL L V +S + + + + G GFF+ Sbjct: 2085 PRSLLKVDFRKAFDSVQWNFLENLLRHLGFPVPFVSLIMQCVSTTSYSVAVNGDLHGFFQ 2144 Query: 230 SSKGLRHGDPLSPYLFIIIMECFSRMVKFQISQRFF 123 G+R GDPLSPYLF+ ME FSRM+K Q F Sbjct: 2145 GQSGVRQGDPLSPYLFLCCMEYFSRMLKLVSQQEGF 2180 Score = 102 bits (254), Expect(2) = 5e-31 Identities = 78/306 (25%), Positives = 141/306 (46%), Gaps = 9/306 (2%) Frame = -1 Query: 1495 WLLSMRSCSVHIPAP-TESDHWYLDVSVLDKLSQGPKPFKYMPFWSQYSDSLRIINE--- 1328 W VH P T SDH + V + + FK+ W ++ D ++ E Sbjct: 201 WSSFQHFVHVHFSTPGTFSDHSPISVCIGPQYKPKRTSFKFFNMWVEHQDYQSLLLEHWH 260 Query: 1327 -EIEGNDMDMLHIRLKNDRGRLKALNNEEFSNIDTRVIEKGHELECECS----N*MGIKI 1163 E+ G+ M +L +LK +G LK LN F +I RV +L+ S + I++ Sbjct: 261 AEVYGSPMYVLCRKLKLLKGPLKQLNKLHFGHISERVCRAEAQLDQHQSLLQVHKDNIQL 320 Query: 1162 CSEARCSDK**I*NVLQSRKAVCQCKSRQTWDKDGDANTNFFHKSLKLHHMQNRISTMHN 983 + R + + N+ K K + + +D D T+FFH + H +N I T+H Sbjct: 321 LEQDR-KLRLELVNLKSFEKMFYSQKLKYNFFRDCDRGTSFFHALMNQKHKKNFIPTIHR 379 Query: 982 KDGVIVKDFDQVKSIVIDFYKELFATPQEDTSIAATLNKFISDNIEDSDRSFLDQPFTAE 803 DG + +V + I F+ +L T + + ++ + I+ S + L +++ Sbjct: 380 SDGSLTTSQSEVGDVFIKFFSQLLGTSGATSPLDESVVGY-GPCIDPSLHASLLANVSSD 438 Query: 802 EIEGAMLGMKLGETPGPDGFPLEFYKATWSTVREFVVEAVKTFFATCSMPRYFNSTIIAL 623 +I+ + + ++PGPDG+ F+K +W V + AV++FF + + + N +IIAL Sbjct: 439 DIKAVLFSIGDNKSPGPDGYSAFFFKKSWDVVGPDLCAAVQSFFQSGQLLKQINHSIIAL 498 Query: 622 IPKKQE 605 +PK + Sbjct: 499 VPKSAQ 504 Score = 63.2 bits (152), Expect(2) = 5e-31 Identities = 45/148 (30%), Positives = 67/148 (45%) Frame = -3 Query: 590 ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILLMQDLSVWL**ELRC 411 ISCCN + K ++ ILA R+ L +I Q A++ GR ++ I L+Q+L + Sbjct: 514 ISCCNVVDKIISKILATRMGRVLDSIISPLQNAFLGGRRMNDNINLLQELLRHYERKRAS 573 Query: 410 A*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLGFQLQSMGSKDGFFK 231 + + + FL L + + + + G GFF Sbjct: 574 PRCLIKIDFRKAFDSVQWPFLRHLLLLLGFPDQFVHLVMTCVETASYSVAVNGELFGFFP 633 Query: 230 SSKGLRHGDPLSPYLFIIIMECFSRMVK 147 G+R GDPLSPYLFII ME SRM++ Sbjct: 634 GKCGVRQGDPLSPYLFIICMEYLSRMLR 661 >XP_013650404.1 PREDICTED: uncharacterized protein LOC106354909 [Brassica napus] XP_013694977.1 PREDICTED: uncharacterized protein LOC106399045 [Brassica napus] Length = 1651 Score = 97.1 bits (240), Expect(2) = 3e-35 Identities = 70/285 (24%), Positives = 131/285 (45%), Gaps = 8/285 (2%) Frame = -1 Query: 1444 SDHWYLDVSVLDKLSQGPKPFKYMPFWSQYSDSLRII-----NEEIEGNDMDMLHIRLKN 1280 SDH + V + + PF++ F S+ L +I + I G++M + +LK Sbjct: 668 SDHASMSVILQSDKVKHRIPFRFFNFLLLDSELLPMIAWLWFSTNIVGSEMFRVSKKLKA 727 Query: 1279 DRGRLKALNNEEFSNIDTRVIEKGHELECECSN*MG--IKICSEARCSDK**I*NVLQSR 1106 + ++ N + +SN++ R E L N + +EA + + +L++ Sbjct: 728 LKNPIRYFNRDRYSNLEKRAEEAQANLNLIQHNLLSDPTPAIAEAEADAQRKLGILLKAE 787 Query: 1105 KAVCQCKSRQTWDKDGDANTNFFHKSLKLHHMQNRISTMHNKDGVIVKDFDQVKSIVIDF 926 +A ++ +W + GD +++FHK + QN I + G + D + + + Sbjct: 788 QAFLFQRTNISWLQVGDCGSHYFHKLMATRRSQNHIHLLLGPSGERFETRDAIHAHCLAH 847 Query: 925 YKELFATPQEDTSI-AATLNKFISDNIEDSDRSFLDQPFTAEEIEGAMLGMKLGETPGPD 749 + E T S ++ ++ N D+ R L + F+ +EI+ A + ++ GPD Sbjct: 848 FTEFLGTSATQPSFDPQDISLLLNYNCSDNQRKKLQEDFSEQEIKDAFFSLPRNKSCGPD 907 Query: 748 GFPLEFYKATWSTVREFVVEAVKTFFATCSMPRYFNSTIIALIPK 614 GF EF+ WS + ++ AVK FF + R +NST++ LIPK Sbjct: 908 GFSSEFFIGCWSIIGPEIISAVKEFFREGKLLRQWNSTMLVLIPK 952 Score = 82.4 bits (202), Expect(2) = 3e-35 Identities = 49/148 (33%), Positives = 79/148 (53%) Frame = -3 Query: 590 ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILLMQDLSVWL**ELRC 411 ISC NT+YK ++ +L RLK L VI Q+A++PGR ++ +LL +L + Sbjct: 965 ISCLNTLYKFISRLLTGRLKEALIPVISHAQSAFMPGRLLTENVLLATELVQGYKRKNIS 1024 Query: 410 A*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLGFQLQSMGSKDGFFK 231 + L + F+ +L + + ++ + F + G G+FK Sbjct: 1025 SRAMLKVDLRKAFDSIRWNFVIASLQALGMPLRFVNWISECITSASFTICVNGESGGYFK 1084 Query: 230 SSKGLRHGDPLSPYLFIIIMECFSRMVK 147 S++GLR GDPLSPYLF+++ME FSR++K Sbjct: 1085 STRGLRQGDPLSPYLFVLVMEVFSRLLK 1112 >OMO64412.1 reverse transcriptase [Corchorus capsularis] Length = 1793 Score = 154 bits (389), Expect = 5e-35 Identities = 107/411 (26%), Positives = 192/411 (46%), Gaps = 24/411 (5%) Frame = -1 Query: 1504 SSEWLLSMRSCSVHIPAPTESDHWYLDVSVLDKLSQGPKPFKYMPFWSQYSDSLRIINE- 1328 ++EWL V P SDH + + K PKPFK+ +W+++++ L++IN+ Sbjct: 1097 NAEWLQCFADSMVEFMLPDVSDHCLMFIRTDVKFFSPPKPFKFFNYWTKHAEFLQLINDV 1156 Query: 1327 ---EIEGNDMDMLHIRLKNDRGRLKALNNEEFSNIDTRVIEKGHELECECSN*MGIKICS 1157 + GN M L + K+ + LK N + F N+ +V+EK E+ + GI + Sbjct: 1157 WSGNVTGNRMQRLSKKFKSLKSVLKEFNRKHFGNLPQKVVEKKAEI---ATLQKGILVSP 1213 Query: 1156 EARCSDK**I*N-----VLQSRKAVCQCKSRQTWDKDGDANTNFFHKSLKLHHMQNRIST 992 A + I + + ++ + KSR W ++GD NT FFHK++K+ + +N I Sbjct: 1214 AADLIQRHKIATNELHELQLAEESFYKQKSRVQWLQEGDLNTGFFHKTMKIRNKRNDIRL 1273 Query: 991 MHNKDGVIVKDFDQVKSIVIDFYKELFATPQEDTSI--AATLNKFISDNIEDSDRSFLDQ 818 ++ ++G + + +++ +++Y+ ++ S A L + ++I + + + Sbjct: 1274 LYKENGDRLHTYAEIRDEAVNYYQNFLGKKDDNISSCPADLLQDILQNSISEELARKIIE 1333 Query: 817 PFTAEEIEGAMLGMKLGETPGPDGFPLEFYKATWSTVREFVVEAVKTFFATCSMPRYFNS 638 P T EEI + + PDG+ F+KA WS V + V+ A+ FF + + NS Sbjct: 1334 PVTEEEIRATFFSLNSNKASSPDGYSAHFFKAAWSIVGKDVITAILYFFESSRISPGINS 1393 Query: 637 TIIALIPKKQEPVHITFLAAIPYINV*PLSWQID*KQH*AR**EQSKLLMCL-------- 482 T+IAL+PK P H+T I + ++ K ++L CL Sbjct: 1394 TVIALVPKVDNPSHMTDFRPISCCT---MIYKCIAKI------LANRLKQCLPGLISIKQ 1444 Query: 481 -----GDAFLVEFC*CKT*VCGYDKSSGVPRCALTVDIMKAYDTVLEFSWG 344 G + + + V Y+KS PRCA+ +D+ KA+DT+ WG Sbjct: 1445 SAFIEGRSIIDNVLMAQELVSRYNKSQLSPRCAIKIDLRKAFDTL---DWG 1492 Score = 77.8 bits (190), Expect = 5e-11 Identities = 49/159 (30%), Positives = 82/159 (51%), Gaps = 3/159 (1%) Frame = -3 Query: 590 ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILLMQDL-SVWL**EL- 417 ISCC IYKC+ ILANRLK L +I +Q+A++ GR I +L+ Q+L S + +L Sbjct: 1414 ISCCTMIYKCIAKILANRLKQCLPGLISIKQSAFIEGRSIIDNVLMAQELVSRYNKSQLS 1473 Query: 416 -RCA*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLGFQLQSMGSKDG 240 RCA + + F+ +L+ ++ + F + G +G Sbjct: 1474 PRCA---IKIDLRKAFDTLDWGFILNIFKALNFPKQFVNWISTCITTTRFSVSINGGLEG 1530 Query: 239 FFKSSKGLRHGDPLSPYLFIIIMECFSRMVKFQISQRFF 123 +F ++G+R GDPLSPY+F++ M S+++ + + F Sbjct: 1531 YFTGARGVRQGDPLSPYIFVMAMNTLSKLLDYGVDHGVF 1569 >JAT47559.1 Retrovirus-related Pol polyprotein LINE-1, partial [Anthurium amnicola] Length = 447 Score = 98.6 bits (244), Expect(2) = 1e-34 Identities = 50/155 (32%), Positives = 85/155 (54%) Frame = -1 Query: 1078 QTWDKDGDANTNFFHKSLKLHHMQNRISTMHNKDGVIVKDFDQVKSIVIDFYKELFATPQ 899 Q W GD N+ FFH ++K M N I N G +++D DQVKS + ++++L Sbjct: 1 QFWLSQGDTNSRFFHAAIKARRMLNSIRKCKNDQGDLIEDLDQVKSHTLLYFQKLL---N 57 Query: 898 EDTSIAATLNKFISDNIEDSDRSFLDQPFTAEEIEGAMLGMKLGETPGPDGFPLEFYKAT 719 +D + IS + ++D+ L + F+ EEI+ ++ ++PGPDGF EF+++ Sbjct: 58 QDRLLTHPTQFEISKTLNENDQQSLSRGFSVEEIKQVVMDSPKLKSPGPDGFTAEFFQSC 117 Query: 718 WSTVREFVVEAVKTFFATCSMPRYFNSTIIALIPK 614 W V + + +A++ FF T + + +T I LIPK Sbjct: 118 WDIVGKDLCDAIQYFFTTGRLLKQVGATFITLIPK 152 Score = 79.3 bits (194), Expect(2) = 1e-34 Identities = 52/148 (35%), Positives = 71/148 (47%) Frame = -3 Query: 590 ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILLMQDLSVWL**ELRC 411 IS CN IYK ++ +LA R+ L K+I Q A++ GR I I L DL + + R Sbjct: 165 ISLCNFIYKVISKLLAGRMNRVLDKLISPHQMAFIKGRKIQDSIFLANDLLKNIHSKSRG 224 Query: 410 A*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLGFQLQSMGSKDGFFK 231 L + + F+ + + +K F + GS GFF Sbjct: 225 NISVLKADLRKAFDSVHRPFIYSMMQKMGFPTLFIDWIKTCLEAAKFSILFNGSPLGFFG 284 Query: 230 SSKGLRHGDPLSPYLFIIIMECFSRMVK 147 SS G+R GDPLSPYLF+I ME FS M+K Sbjct: 285 SSNGIRQGDPLSPYLFVIAMEGFSAMMK 312 >XP_013651296.1 PREDICTED: uncharacterized protein LOC106355988 [Brassica napus] Length = 1803 Score = 98.6 bits (244), Expect(2) = 2e-33 Identities = 77/291 (26%), Positives = 128/291 (43%), Gaps = 11/291 (3%) Frame = -1 Query: 1453 PTESDHWYLDVSVLDKLSQGPKPFKYMPFWSQYSDSLRIINEE-----IEGNDMDMLHIR 1289 P +SDH + KPFK+ + + + +++E +EG+ L R Sbjct: 714 PKQSDHAPCLFRIPSVSRHQRKPFKFYHHITDHPEYSSVVSEAWANVVVEGSHQFKLVRR 773 Query: 1288 LKNDRGRLKALNNEEFSNIDTRVIEKG---HELECECSN*MGIKICSEARCSDK**I*NV 1118 +K + L+ LN +S I RV + L+ SE I NV Sbjct: 774 MKLLKTDLRRLNKTHYSGITGRVKHQSAIVERLQTSLLTQPDPATASEEHRERA--ILNV 831 Query: 1117 L-QSRKAVCQCKSRQTWDKDGDANTNFFHKSLKLHHMQNRISTMHNKDGVIVKDFDQVKS 941 L + + +SR W GD NT F+HK++ + +N I + + G + D +K Sbjct: 832 LLNAEHKFFRQRSRVRWADVGDRNTTFYHKTVTERNSRNHIHYLLDDSGRFLGSLDDIKE 891 Query: 940 IVIDFYKELFATPQEDTS--IAATLNKFISDNIEDSDRSFLDQPFTAEEIEGAMLGMKLG 767 +++ + + S +L + ++ D +++L + EI+G + M L Sbjct: 892 HSASYFQGILGETELPVSPVTVESLQELLTFRCSDMQKAYLKRDVLEAEIKGTIFSMPLN 951 Query: 766 ETPGPDGFPLEFYKATWSTVREFVVEAVKTFFATCSMPRYFNSTIIALIPK 614 ++PGPDG+ EF KA+W TV VV AV FF + + N+T I LIPK Sbjct: 952 KSPGPDGYSFEFLKASWETVGGDVVAAVAEFFRNGRLLKDLNTTAITLIPK 1002 Score = 75.1 bits (183), Expect(2) = 2e-33 Identities = 56/183 (30%), Positives = 84/183 (45%), Gaps = 17/183 (9%) Frame = -3 Query: 620 PKKTRACAYN----ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILL 453 PK AC ISCCN +YK +T I+ANRLK L I Q+A++ GR + +LL Sbjct: 1001 PKSAAACKLRDYRPISCCNIVYKVITKIIANRLKPILQSSISRSQSAFLKGRSLGENVLL 1060 Query: 452 MQDL-----------SVWL**ELRCA--*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVD 312 +L S L ++R A +C D F+ K + Sbjct: 1061 AVELIRKYESPTCGKSSMLKIDIRKAFDTICWD-------------FVIKVLQAQGFPPI 1107 Query: 311 LLSGLKPV*ALLGFQLQSMGSKDGFFKSSKGLRHGDPLSPYLFIIIMECFSRMVKFQISQ 132 ++ ++ + F + G GFF KGLR GD +SPYLFI++ME S++++ + Sbjct: 1108 FVTWIRECISTPRFSVAINGELAGFFPGKKGLRQGDAISPYLFIMVMEVLSKLLEKAVED 1167 Query: 131 RFF 123 F Sbjct: 1168 GAF 1170 >XP_009140327.1 PREDICTED: uncharacterized protein LOC103864320 [Brassica rapa] Length = 1688 Score = 97.1 bits (240), Expect(2) = 5e-33 Identities = 73/270 (27%), Positives = 122/270 (45%), Gaps = 11/270 (4%) Frame = -1 Query: 1390 KPFKYMPFWSQYSDSLRIINEE-----IEGNDMDMLHIRLKNDRGRLKALNNEEFSNIDT 1226 KPFK+ + + + +++E +EG+ L R+K + L+ LN +S I Sbjct: 708 KPFKFYHHITDHPEYSSVVSEAWANVVVEGSHQFKLVRRMKLLKTDLRRLNKTHYSGITG 767 Query: 1225 RVIEKG---HELECECSN*MGIKICSEARCSDK**I*NVL-QSRKAVCQCKSRQTWDKDG 1058 RV + L+ SE I NVL + + +SR W G Sbjct: 768 RVKHQSAIVERLQTSLLTQPDPATASEEHRERA--ILNVLLNAEHKFFRQRSRVRWADVG 825 Query: 1057 DANTNFFHKSLKLHHMQNRISTMHNKDGVIVKDFDQVKSIVIDFYKELFATPQEDTS--I 884 D NT F+HK++ + +N I + + G + D +K +++ + + S Sbjct: 826 DRNTTFYHKTVTERNSRNHIHYLLDDSGRFLGSLDDIKEHSASYFQGILGETELPVSPVT 885 Query: 883 AATLNKFISDNIEDSDRSFLDQPFTAEEIEGAMLGMKLGETPGPDGFPLEFYKATWSTVR 704 +L + ++ D +++L + EI+G + M L ++PGPDG+ EF KA+W TV Sbjct: 886 VESLQELLTFRCSDMQKAYLKRDVLEAEIKGTIFSMPLNKSPGPDGYSFEFLKASWETVG 945 Query: 703 EFVVEAVKTFFATCSMPRYFNSTIIALIPK 614 VV AV FF + + N+T I LIPK Sbjct: 946 GDVVAAVAEFFKNGRLLKDLNTTAITLIPK 975 Score = 75.1 bits (183), Expect(2) = 5e-33 Identities = 56/183 (30%), Positives = 84/183 (45%), Gaps = 17/183 (9%) Frame = -3 Query: 620 PKKTRACAYN----ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILL 453 PK AC ISCCN +YK +T I+ANRLK L I Q+A++ GR + +LL Sbjct: 974 PKSAAACKLRDYRPISCCNIVYKVITKIIANRLKPILQSSISRSQSAFLKGRSLGENVLL 1033 Query: 452 MQDL-----------SVWL**ELRCA--*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVD 312 +L S L ++R A +C D F+ K + Sbjct: 1034 AVELIRKYESPTCGKSSMLKIDIRKAFDTICWD-------------FVIKVLQAQGFPPI 1080 Query: 311 LLSGLKPV*ALLGFQLQSMGSKDGFFKSSKGLRHGDPLSPYLFIIIMECFSRMVKFQISQ 132 ++ ++ + F + G GFF KGLR GD +SPYLFI++ME S++++ + Sbjct: 1081 FVTWIRECISTPRFSVAINGELAGFFPGKKGLRQGDAISPYLFIMVMEVLSKLLEKAVED 1140 Query: 131 RFF 123 F Sbjct: 1141 GAF 1143 >JAU41414.1 LINE-1 retrotransposable element ORF2 protein, partial [Noccaea caerulescens] Length = 631 Score = 107 bits (266), Expect(2) = 5e-33 Identities = 56/160 (35%), Positives = 87/160 (54%), Gaps = 2/160 (1%) Frame = -1 Query: 1087 KSRQTWDKDGDANTNFFHKSLKLHHMQNRISTMHNKDGVIVKDFDQVKSIVIDFYKELFA 908 KSR W K+GD+N+ FFH+S+K + +N I + N DG + + D +K + +D+Y+ L Sbjct: 55 KSRVRWLKEGDSNSGFFHRSVKANLSRNAIHHLRNSDGQKIYNPDLIKRMAVDYYRNLLG 114 Query: 907 TPQEDTS--IAATLNKFISDNIEDSDRSFLDQPFTAEEIEGAMLGMKLGETPGPDGFPLE 734 +P + A L + S +D L + E+I A + + PGPDGF +E Sbjct: 115 SPDNEVHPYSVAQLQELHSFRCDDDSALLLSSLPSNEDIMKAFFSLPKNKAPGPDGFTME 174 Query: 733 FYKATWSTVREFVVEAVKTFFATCSMPRYFNSTIIALIPK 614 F+ +W V ++ AVK FF + R NST+I+LIPK Sbjct: 175 FFYYSWDLVGSSLIGAVKDFFTGSYILRQVNSTVISLIPK 214 Score = 65.1 bits (157), Expect(2) = 5e-33 Identities = 42/147 (28%), Positives = 69/147 (46%) Frame = -3 Query: 590 ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILLMQDLSVWL**ELRC 411 IS CNT+YK ++ IL++RLK +V+ Q +V GR ++ ILL +L Sbjct: 227 ISLCNTVYKVISRILSSRLKLLTPRVVQRNQVGFVKGRLLTENILLASELVKDFNKPGIV 286 Query: 410 A*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLGFQLQSMGSKDGFFK 231 CL + + FL + ++ + + + + G GFFK Sbjct: 287 TRGCLQIDITKAYDNVNWDFLRNILEAFELPATFREWINLCISSPHYSISVNGELSGFFK 346 Query: 230 SSKGLRHGDPLSPYLFIIIMECFSRMV 150 KGLR GDP+S LF++ M+ S+++ Sbjct: 347 GEKGLRQGDPISSSLFVLAMDILSKLL 373 >CCA66198.1 hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1381 Score = 114 bits (285), Expect(2) = 1e-32 Identities = 83/306 (27%), Positives = 144/306 (47%), Gaps = 6/306 (1%) Frame = -1 Query: 1501 SEWLLSMRSCSVHIPAPTESDHW--YLDVSVLDKLSQGPKPFKYMPFWSQYSDSLRIINE 1328 +EW+ + +V I + SDH L S++D GP+PFK+ W + + I+ + Sbjct: 201 AEWIEKFPALAVSILNRSISDHCPLLLQSSIVD---WGPRPFKFQDVWLSHKGCMEIVEK 257 Query: 1327 E-IEGNDMDMLHIRLKNDRGRLKALNNEEFSNIDTRVIEKGHELECECSN*MGIKICSE- 1154 I+ ++ ++ +LK + LK N+E F NID ++ + E++ S + E Sbjct: 258 AWIQSKELTLMQ-KLKKVKLDLKTWNSESFGNIDANILLREAEIQKWDSEANSRDLEPEE 316 Query: 1153 --ARCSDK**I*NVLQSRKAVCQCKSRQTWDKDGDANTNFFHKSLKLHHMQNRISTMHNK 980 R + + L+ ++ +SR W K GD NT FFH + +N IS++ + Sbjct: 317 IKTRAQAQLELWEWLKKKEIYWAQQSRIKWLKSGDRNTKFFHICASIRRSKNNISSILLQ 376 Query: 979 DGVIVKDFDQVKSIVIDFYKELFATPQEDTSIAATLNKFISDNIEDSDRSFLDQPFTAEE 800 G ++D +K + ++K LF ED T + +S + PF+ E Sbjct: 377 -GKKIEDPIIIKEEAVKYFKNLFT---EDFKERPTFTNLSFKKLSESQAFSISAPFSTTE 432 Query: 799 IEGAMLGMKLGETPGPDGFPLEFYKATWSTVREFVVEAVKTFFATCSMPRYFNSTIIALI 620 I+ A+ ++PGPDGF +F KA+W ++ ++ F+ T +PR N IALI Sbjct: 433 IDEAVASCNPSKSPGPDGFNFKFIKASWDLIKHDFYSIIQEFWHTGILPRGSNVAFIALI 492 Query: 619 PKKQEP 602 K + P Sbjct: 493 AKIESP 498 Score = 56.6 bits (135), Expect(2) = 1e-32 Identities = 40/148 (27%), Positives = 70/148 (47%) Frame = -3 Query: 590 ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGRCISGGILLMQDLSVWL**ELRC 411 IS +YK ++ +LA RLK ++ ++G Q++++ GR I IL+ +L + Sbjct: 507 ISMVGCVYKIISKLLAGRLKQVMNDLVGPHQSSFIEGRQILDSILIASELFESCKRRKKA 566 Query: 410 A*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLGFQLQSMGSKDGFFK 231 M L +++ FL S + + + + GS FK Sbjct: 567 TVM-LKIDFHKAFDSVSWSFLDWTLSQMGFPPRWKKWISSCVSSAAASVLLNGSPSLPFK 625 Query: 230 SSKGLRHGDPLSPYLFIIIMECFSRMVK 147 +GLR GDPLSP+LF++++E + M+K Sbjct: 626 LQRGLRQGDPLSPFLFVLVVEVMNLMIK 653 >XP_010671205.1 PREDICTED: uncharacterized protein LOC104888072 [Beta vulgaris subsp. vulgaris] Length = 1592 Score = 119 bits (297), Expect(2) = 2e-31 Identities = 88/302 (29%), Positives = 144/302 (47%), Gaps = 3/302 (0%) Frame = -1 Query: 1498 EWLLSMRSCSVHIPAPTESDHWYLDVSVLDKLSQGPKPFKYMPFWSQYSDSLRIINEEIE 1319 EWL + + V + SDH L V D+ + GPKPF++ W D L+I+ + Sbjct: 413 EWLSKLPTIKVDLLQRGLSDHCPLLVHTKDQ-NWGPKPFRFQNCWLTDPDCLKIVKNVWQ 471 Query: 1318 GNDMDMLHIRLKNDRGRLKALNNEEFSNIDTRVIEKGHELEC--ECSN*MGIKICS-EAR 1148 + +LK + RL N EF NIDT++ + +E++ E +N ++ + R Sbjct: 472 ESAALQTREKLKEVKKRLNEWNQNEFGNIDTKIKKLENEIQRLDEINNFRDLEAQEVDNR 531 Query: 1147 CSDK**I*NVLQSRKAVCQCKSRQTWDKDGDANTNFFHKSLKLHHMQNRISTMHNKDGVI 968 + + ++ ++ SR +W K+GD NT FFH +N I+++ DG Sbjct: 532 KKAQSELWVWMKRKELYWAQNSRISWLKEGDRNTKFFHDIASNKRRKNSINSIII-DGQP 590 Query: 967 VKDFDQVKSIVIDFYKELFATPQEDTSIAATLNKFISDNIEDSDRSFLDQPFTAEEIEGA 788 V D +K+ F+K +F +E+ I + + + S L PF+ EEI+ A Sbjct: 591 VDDPSCIKNEARAFFKGIF---REEYDIRPHFDNLNFKQVTEEQGSQLTLPFSREEIDNA 647 Query: 787 MLGMKLGETPGPDGFPLEFYKATWSTVREFVVEAVKTFFATCSMPRYFNSTIIALIPKKQ 608 + + PGPDGF +F K+ W V+ + E V F+A+ +P+ N IALIPK Sbjct: 648 VASCDSDKAPGPDGFNFKFIKSAWDIVKHDIYEMVHKFWASSQLPQGCNVAYIALIPKID 707 Query: 607 EP 602 P Sbjct: 708 NP 709 Score = 47.8 bits (112), Expect(2) = 2e-31 Identities = 37/148 (25%), Positives = 70/148 (47%), Gaps = 1/148 (0%) Frame = -3 Query: 590 ISCCNTIYKCVTTILANRLKTTLSKVIGTEQTAYVPGR-CISGGILLMQDLSVWL**ELR 414 IS +YK + ++A+RL+ +S +IGT Q++Y+ GR + G ++ + + + Sbjct: 718 ISMVGCLYKIIAKLMASRLQKIMSSLIGTLQSSYIEGRQILDGALVAGEIIDSYKKNGKE 777 Query: 413 CA*MCLDS*YNEGL*HCVGVFLGKQ*SSLDIHVDLLSGLKPV*ALLGFQLQSMGSKDGFF 234 LD +++ FL ++ + + + GS F Sbjct: 778 AILFKLD--FHKAYDSVSWGFLKWVLEQMNFPSKWREWIMSCVSSAYASILVNGSPSAPF 835 Query: 233 KSSKGLRHGDPLSPYLFIIIMECFSRMV 150 K +GLR GDPLSP+LF++I E ++++ Sbjct: 836 KLQRGLRQGDPLSPFLFLLIGEVLNQVI 863