BLASTX nr result
ID: Ephedra25_contig00021777
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra25_contig00021777 (1365 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value pir||S00954 pol polyprotein - fruit fly (Drosophila melanogaster... 149 3e-33 ref|XP_006590044.1| PREDICTED: uncharacterized protein LOC102665... 148 6e-33 gb|EOY11267.1| Uncharacterized protein TCM_026511 [Theobroma cacao] 147 1e-32 gb|ACI62137.1| polyprotein [Drosophila melanogaster] 146 2e-32 emb|CAN60366.1| hypothetical protein VITISV_031870 [Vitis vinifera] 145 3e-32 emb|CAN71759.1| hypothetical protein VITISV_020777 [Vitis vinifera] 145 5e-32 gb|EOY32548.1| Uncharacterized protein TCM_040529 [Theobroma cacao] 144 9e-32 gb|AAT38797.2| Polyprotein, putative [Solanum demissum] 119 1e-31 emb|CAN72676.1| hypothetical protein VITISV_020406 [Vitis vinifera] 144 1e-31 emb|CAN60238.1| hypothetical protein VITISV_032906 [Vitis vinifera] 143 2e-31 emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana] 143 2e-31 gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thal... 143 2e-31 dbj|BAD34493.1| Gag-Pol [Ipomoea batatas] 128 3e-31 gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabi... 142 3e-31 emb|CBI37296.3| unnamed protein product [Vitis vinifera] 142 3e-31 ref|XP_002064813.1| GK15001 [Drosophila willistoni] gi|194160898... 142 3e-31 emb|CAN71037.1| hypothetical protein VITISV_011061 [Vitis vinifera] 108 4e-31 gb|ABD32582.1| Integrase, catalytic region; Zinc finger, CCHC-ty... 110 5e-31 gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabi... 141 6e-31 emb|CAB75469.1| copia-type reverse transcriptase-like protein [A... 140 2e-30 >pir||S00954 pol polyprotein - fruit fly (Drosophila melanogaster) transposon 1731 gi|8702|emb|CAA30503.1| unnamed protein product [Drosophila melanogaster] Length = 982 Score = 149 bits (375), Expect = 3e-33 Identities = 83/206 (40%), Positives = 111/206 (53%), Gaps = 5/206 (2%) Frame = +3 Query: 750 LFIMKPIPTECFLITS*ISNLWHNRFGHVNNECLS---RISATVSHSXXXXXXXXXCDNC 920 L++ + CF +LWH R GH+N L R C C Sbjct: 118 LYMFQGKHNSCFAAVDADGSLWHKRNGHLNTSSLQEMVRKKMVYGVEKVVFKPDAVCKTC 177 Query: 921 ITAKLHKKPFNKSSRTTTR-CLEIIHLDLCGPIN-PSTLHEKYILTFSDDLSKMT*VYFL 1094 + AK+H +PF K++R+ L++IH DLCGP + PS KY LTF DD S+ VYFL Sbjct: 178 MLAKIHVQPFPKTTRSRAEELLDMIHSDLCGPFSTPSLAGSKYFLTFIDDKSRRIFVYFL 237 Query: 1095 RHKSEVFKYFLKFRKRV*TENHTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNP 1274 R K EVF F++F+K V + KI C DNG EF +N F DY GI ++ +P+ P Sbjct: 238 RKKDEVFTKFVEFKKLVERQTGRKIKCIRSDNGGEFVNNVFDDYLKAHGIARQLTIPHTP 297 Query: 1275 QQNGVVERKNKTLIEMIQSQLHSSQL 1352 QQNGV ER N+TL+EM + L S+L Sbjct: 298 QQNGVAERANRTLVEMARCMLLQSEL 323 >ref|XP_006590044.1| PREDICTED: uncharacterized protein LOC102665857 [Glycine max] Length = 678 Score = 148 bits (373), Expect = 6e-33 Identities = 82/185 (44%), Positives = 110/185 (59%), Gaps = 4/185 (2%) Frame = +3 Query: 810 LWHNRFGHVNNECLSRISAT--VSHSXXXXXXXXXCDNCITAKLHKKPFNKSSRT-TTRC 980 LWH RFGH+N + L R++ V C+ C+ K +K F K S T T+ Sbjct: 384 LWHLRFGHLNFDGLERLAKKEMVRDLPSINHPDQLCERCLIGKQFRKSFPKESTTRATKP 443 Query: 981 LEIIHLDLCGPINPSTLHE-KYILTFSDDLSKMT*VYFLRHKSEVFKYFLKFRKRV*TEN 1157 LE+IH D+CGPI P++ + KY L F DD S+ T VYFL+ KSEVF+ F KF+ V E+ Sbjct: 444 LELIHTDVCGPIKPNSFGKNKYFLLFIDDYSRKTWVYFLKEKSEVFENFKKFKALVEKES 503 Query: 1158 HTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQNGVVERKNKTLIEMIQSQL 1337 I D G EFTSN+F Y GIR+ VP +PQQNGV ERKN+T++ M++S L Sbjct: 504 GLSIKAMRSDRGGEFTSNKFNKYCEDHGIRRPLTVPRSPQQNGVAERKNRTILNMVRSML 563 Query: 1338 HSSQL 1352 S ++ Sbjct: 564 KSKKM 568 >gb|EOY11267.1| Uncharacterized protein TCM_026511 [Theobroma cacao] Length = 1318 Score = 147 bits (371), Expect = 1e-32 Identities = 76/187 (40%), Positives = 112/187 (59%), Gaps = 4/187 (2%) Frame = +3 Query: 804 SNLWHNRFGHVNNECLSRISAT--VSHSXXXXXXXXXCDNCITAKLHKKPFNKSSRT-TT 974 + LWH R GH+N + + + + V+ C+ C+ K + PF K S+T T Sbjct: 456 ARLWHRRLGHINYQFIKNMGSLNLVNDMPIITEVEKTCEVCLQGKQSRHPFPKQSQTRTA 515 Query: 975 RCLEIIHLDLCGPINPSTLH-EKYILTFSDDLSKMT*VYFLRHKSEVFKYFLKFRKRV*T 1151 L++IH D+CGPI +L+ KY + F DD S+ ++FL+ KSE +YF+KF+ V Sbjct: 516 NRLQLIHTDICGPIGTLSLNGNKYFILFIDDFSRFCWIFFLKQKSEAIQYFMKFKVLVEK 575 Query: 1152 ENHTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQNGVVERKNKTLIEMIQS 1331 + KI DNG E+TSNEFK T++GI++ VPY+PQQNGV ERKN+T++EMI+ Sbjct: 576 QTDQKIKALRSDNGSEYTSNEFKALLTQEGIKQFLTVPYSPQQNGVSERKNRTIMEMIRC 635 Query: 1332 QLHSSQL 1352 L Q+ Sbjct: 636 LLFEQQM 642 >gb|ACI62137.1| polyprotein [Drosophila melanogaster] Length = 1319 Score = 146 bits (369), Expect = 2e-32 Identities = 76/185 (41%), Positives = 107/185 (57%), Gaps = 5/185 (2%) Frame = +3 Query: 813 WHNRFGHVNNECLSRISA---TVSHSXXXXXXXXXCDNCITAKLHKKPFNKSS-RTTTRC 980 WHNRFGH+N +CL I + CD C AK+H PF ++S R T Sbjct: 403 WHNRFGHLNFQCLKEIKEKELVIGMDFKNMSVNINCDTCNMAKIHVLPFPQNSERATQSV 462 Query: 981 LEIIHLDLCGPINPSTLH-EKYILTFSDDLSKMT*VYFLRHKSEVFKYFLKFRKRV*TEN 1157 LE++H D+CGP+N S+L KY +TF DD S+ +YF+ K+EVF F F+ V + Sbjct: 463 LELVHSDVCGPMNVSSLGGNKYFVTFIDDYSRKIFIYFMHAKNEVFDKFKLFKSYVECQT 522 Query: 1158 HTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQNGVVERKNKTLIEMIQSQL 1337 KI DNG E+ + +F +Y GI+++ VPY PQQNGV ER N+T++EM +S L Sbjct: 523 GKKIKALRSDNGTEYVNRQFTEYLNTCGIKRQLTVPYTPQQNGVAERANRTIVEMAKSML 582 Query: 1338 HSSQL 1352 ++L Sbjct: 583 IHAKL 587 >emb|CAN60366.1| hypothetical protein VITISV_031870 [Vitis vinifera] Length = 1274 Score = 145 bits (367), Expect = 3e-32 Identities = 79/187 (42%), Positives = 113/187 (60%), Gaps = 3/187 (1%) Frame = +3 Query: 804 SNLWHNRFGHVNNECLSRISAT-VSHSXXXXXXXXXCDNCITAKLHKKPFNKS-SRTTTR 977 SNLWH R+GH+N + L +S + C+ CI K KKPF K SR + Sbjct: 437 SNLWHLRYGHLNVKGLKLLSKKEMVFGLPKIDSVNVCEGCIYGKQSKKPFPKGRSRRASS 496 Query: 978 CLEIIHLDLCGPINPSTLH-EKYILTFSDDLSKMT*VYFLRHKSEVFKYFLKFRKRV*TE 1154 CLEIIH DLCGP+ ++ +Y L F+DD S+M+ VYFL+ K+E F+ F KF+ V + Sbjct: 497 CLEIIHADLCGPMQTASFGGSRYFLLFTDDHSRMSWVYFLQSKAETFETFKKFKAFVEKQ 556 Query: 1155 NHTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQNGVVERKNKTLIEMIQSQ 1334 + I D G EF SN+FK + ++G+ +E PY+P+QNGV ERKN+T++EM +S Sbjct: 557 SGKCIKVLRTDRGGEFLSNDFKVFCEEEGLHRELTTPYSPEQNGVAERKNRTVVEMARSM 616 Query: 1335 LHSSQLS 1355 + + LS Sbjct: 617 MKAKNLS 623 >emb|CAN71759.1| hypothetical protein VITISV_020777 [Vitis vinifera] Length = 1472 Score = 145 bits (365), Expect = 5e-32 Identities = 79/187 (42%), Positives = 112/187 (59%), Gaps = 3/187 (1%) Frame = +3 Query: 804 SNLWHNRFGHVNNECLSRISAT-VSHSXXXXXXXXXCDNCITAKLHKKPFNKS-SRTTTR 977 SNLWH R+GH+N + L +S + C+ CI K KKPF K SR + Sbjct: 422 SNLWHLRYGHLNVKGLKLLSKKEMVFGLPKIDSVNVCEGCIYGKQSKKPFPKGRSRRASS 481 Query: 978 CLEIIHLDLCGPINPSTLH-EKYILTFSDDLSKMT*VYFLRHKSEVFKYFLKFRKRV*TE 1154 CLEIIH DLCGP+ ++ +Y L F+DD S+M+ VYFL+ K+E F+ F KF+ V + Sbjct: 482 CLEIIHADLCGPMQTASFGGSRYFLLFTDDHSRMSWVYFLQSKAETFETFKKFKAFVEKQ 541 Query: 1155 NHTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQNGVVERKNKTLIEMIQSQ 1334 + I D G EF SN+FK + ++G+ +E PY+P QNGV ERKN+T++EM +S Sbjct: 542 SGKCIKVLRTDRGGEFLSNDFKVFXEEEGLHRELTTPYSPXQNGVAERKNRTVVEMARSM 601 Query: 1335 LHSSQLS 1355 + + LS Sbjct: 602 MKAKNLS 608 >gb|EOY32548.1| Uncharacterized protein TCM_040529 [Theobroma cacao] Length = 1266 Score = 144 bits (363), Expect = 9e-32 Identities = 75/187 (40%), Positives = 111/187 (59%), Gaps = 4/187 (2%) Frame = +3 Query: 804 SNLWHNRFGHVNNECLSRISAT--VSHSXXXXXXXXXCDNCITAKLHKKPFNKSSRT-TT 974 + LWH R GH+N + + + + V+ C+ C+ K + PF K S+T T Sbjct: 456 ARLWHRRLGHINYQFIKNMGSLNLVNDMPVITEVEKTCEVCLQGKQSRHPFPKQSQTRAT 515 Query: 975 RCLEIIHLDLCGPINPSTLH-EKYILTFSDDLSKMT*VYFLRHKSEVFKYFLKFRKRV*T 1151 L++IH D+CGPI +L+ KY + F DD S+ ++FL+ KSE +YF+KF+ V Sbjct: 516 NRLQLIHTDICGPIGTLSLNGNKYFILFIDDFSRFCWIFFLKQKSEAIQYFMKFKVLVEK 575 Query: 1152 ENHTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQNGVVERKNKTLIEMIQS 1331 + KI DNG E+TSNEFK T++GI++ V Y+PQQNGV ERKN+T++EMI+ Sbjct: 576 QTDQKIKALRSDNGSEYTSNEFKALLTQEGIKQFLTVTYSPQQNGVSERKNRTIMEMIRC 635 Query: 1332 QLHSSQL 1352 L Q+ Sbjct: 636 LLFEQQM 642 >gb|AAT38797.2| Polyprotein, putative [Solanum demissum] Length = 1793 Score = 119 bits (299), Expect(3) = 1e-31 Identities = 71/183 (38%), Positives = 100/183 (54%), Gaps = 5/183 (2%) Frame = +3 Query: 804 SNLWHNRFGHVNNECLSRISAT--VSHSXXXXXXXXXCDNCITAKLHKKPF--NKSSRTT 971 +NLWH RFGH N ++ + V + C+ C K K PF N+ R Sbjct: 566 TNLWHKRFGHFNLRSIAEMKKKELVENMPEFLSNAQVCETCQQGKQTKLPFQANQVWRAN 625 Query: 972 TRCLEIIHLDLCGPINPSTLH-EKYILTFSDDLSKMT*VYFLRHKSEVFKYFLKFRKRV* 1148 + L++IH D+CGPI +L KY L F DD ++M VYF+R KSEVF F +F+ V Sbjct: 626 QK-LQLIHTDVCGPIKTDSLSGNKYFLLFIDDYTRMCWVYFIRLKSEVFDVFKQFKALVE 684 Query: 1149 TENHTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQNGVVERKNKTLIEMIQ 1328 + + +I DNG E TS +F ++ I + +PY PQQNGV ERKN+T++EM + Sbjct: 685 NQCNLRIKALRSDNGGEHTSFQFVEFCNSTCIECQLTLPYTPQQNGVSERKNRTVMEMAR 744 Query: 1329 SQL 1337 L Sbjct: 745 CLL 747 Score = 34.7 bits (78), Expect(3) = 1e-31 Identities = 41/168 (24%), Positives = 64/168 (38%), Gaps = 21/168 (12%) Frame = +2 Query: 8 LSSWDAFKDSIYLIRNRMPFDALCQALLEKEQGNNQKKSANYE-VQFVQKQ--------- 157 +SS++ KD ++ L AL +EQ N ++ E VQKQ Sbjct: 286 ISSFEKSKDL-----GKLSLGELMGALQAQEQRRNMRRDKFTEGAVSVQKQIFGKGKQQV 340 Query: 158 --NKKRFQDKGRYEGK-KG*NKYCHICDRNNHDTKYYFF-------NAKGPNYNPNRRSR 307 N K D G G K C C R H KY ++ N K + Sbjct: 341 NQNNKVKHDGGNNSGDVKKKFPPCKYCKRTTHLEKYCWWRVDAICGNCKQTGHISKVCKS 400 Query: 308 AKDLQQNSETHEANMVEIKEDQVY-VLYTNRDNNNAGWYLDLGCNSHM 448 + + + A+ + EDQ++ V Y + + ++ W LD GC H+ Sbjct: 401 RANASGSLQAQVADAADAHEDQLFAVSYFSINESSDSWILDSGCTHHL 448 Score = 31.2 bits (69), Expect(3) = 1e-31 Identities = 20/82 (24%), Positives = 40/82 (48%) Frame = +1 Query: 502 VKTAGNDELPIQERGDITLNFGNDTIRLTIVLYVEGLRKNLISLYKLLTEGYNLRAFQKE 681 VK + + ++ RG ++++ + + +LY + +NL+S+ ++L Y+L K Sbjct: 466 VKVGNGEAVEVKGRGTMSISIISGIKTIPDILYTPDMSQNLLSVGQMLENNYSLHF--KN 523 Query: 682 NEETCCQISYDNKIVVKVQNIM 747 +E S VK+ NIM Sbjct: 524 HECVVSDPSGVELFYVKMSNIM 545 >emb|CAN72676.1| hypothetical protein VITISV_020406 [Vitis vinifera] Length = 1183 Score = 144 bits (362), Expect = 1e-31 Identities = 78/187 (41%), Positives = 113/187 (60%), Gaps = 3/187 (1%) Frame = +3 Query: 804 SNLWHNRFGHVNNECLSRISAT-VSHSXXXXXXXXXCDNCITAKLHKKPFNKS-SRTTTR 977 SNLWH R+GH+N + L +S + C+ CI K KKPF K SR + Sbjct: 371 SNLWHLRYGHLNVKGLKLLSKKEMVFGLPKIDSVNVCEGCIYGKQSKKPFPKGRSRRASS 430 Query: 978 CLEIIHLDLCGPINPSTLH-EKYILTFSDDLSKMT*VYFLRHKSEVFKYFLKFRKRV*TE 1154 CLEIIH DLCGP+ ++ +Y L F++D S+M+ VYFL+ K+E F+ F KF+ V + Sbjct: 431 CLEIIHADLCGPMQTASFGGSRYFLLFTNDHSRMSWVYFLQSKAETFETFKKFKAFVEKQ 490 Query: 1155 NHTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQNGVVERKNKTLIEMIQSQ 1334 + I D G EF SN+FK + ++G+ +E PY+P+QNGV ERKN+T++EM +S Sbjct: 491 SGKCIKVLRTDRGGEFLSNDFKVFCEEEGLHRELTTPYSPEQNGVAERKNRTVVEMARSM 550 Query: 1335 LHSSQLS 1355 + + LS Sbjct: 551 MKAKNLS 557 >emb|CAN60238.1| hypothetical protein VITISV_032906 [Vitis vinifera] Length = 1430 Score = 143 bits (360), Expect = 2e-31 Identities = 78/187 (41%), Positives = 112/187 (59%), Gaps = 3/187 (1%) Frame = +3 Query: 804 SNLWHNRFGHVNNECLSRISAT-VSHSXXXXXXXXXCDNCITAKLHKKPFNKS-SRTTTR 977 SNLWH R+GH+N + L +S + C+ CI K KKPF K SR + Sbjct: 548 SNLWHLRYGHLNVKGLKLLSKKEMVFELPKIDSVNVCEGCIYGKQSKKPFPKGRSRRASS 607 Query: 978 CLEIIHLDLCGPINPSTLH-EKYILTFSDDLSKMT*VYFLRHKSEVFKYFLKFRKRV*TE 1154 CLEIIH DLCGP+ ++ +Y L F+DD S+M+ VYFL+ K+E F+ F KF+ V + Sbjct: 608 CLEIIHADLCGPMQTASFGGSRYFLLFTDDHSRMSWVYFLQSKAETFETFKKFKAFVEKQ 667 Query: 1155 NHTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQNGVVERKNKTLIEMIQSQ 1334 + I D EF SN+FK + ++G+ +E PY+P+QNGV ERKN+T++EM +S Sbjct: 668 SGKCIKVLRTDRXGEFLSNDFKVFCEEEGLHRELTTPYSPEQNGVAERKNRTVVEMARSM 727 Query: 1335 LHSSQLS 1355 + + LS Sbjct: 728 MXAKNLS 734 >emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana] Length = 1352 Score = 143 bits (360), Expect = 2e-31 Identities = 86/186 (46%), Positives = 107/186 (57%), Gaps = 5/186 (2%) Frame = +3 Query: 810 LWHNRFGHVNN---ECLSRISATVSHSXXXXXXXXXCDNCITAKLHKKPFNK-SSRTTTR 977 LWH RFGH+N E LSR V C+ C+ K K F K SS + Sbjct: 467 LWHLRFGHLNFGGLELLSR-KEMVRGLPCINHPNQVCEGCLLGKQFKMSFPKESSSRAQK 525 Query: 978 CLEIIHLDLCGPINPSTLHEK-YILTFSDDLSKMT*VYFLRHKSEVFKYFLKFRKRV*TE 1154 LE+IH D+CGPI P +L + Y L F DD S+ T VYFL+ KSEVF+ F KF+ V E Sbjct: 526 PLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKE 585 Query: 1155 NHTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQNGVVERKNKTLIEMIQSQ 1334 + I D G EFTS EF Y GIR++ VP +PQQNGVVERKN+T++EM +S Sbjct: 586 SGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVVERKNRTILEMARSM 645 Query: 1335 LHSSQL 1352 L S +L Sbjct: 646 LKSKRL 651 >gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thaliana] Length = 1352 Score = 143 bits (360), Expect = 2e-31 Identities = 86/186 (46%), Positives = 107/186 (57%), Gaps = 5/186 (2%) Frame = +3 Query: 810 LWHNRFGHVNN---ECLSRISATVSHSXXXXXXXXXCDNCITAKLHKKPFNK-SSRTTTR 977 LWH RFGH+N E LSR V C+ C+ K K F K SS + Sbjct: 467 LWHLRFGHLNFGGLELLSR-KEMVRGLPCINHPNQVCEGCLLGKQFKMSFPKESSSRAQK 525 Query: 978 CLEIIHLDLCGPINPSTLHEK-YILTFSDDLSKMT*VYFLRHKSEVFKYFLKFRKRV*TE 1154 LE+IH D+CGPI P +L + Y L F DD S+ T VYFL+ KSEVF+ F KF+ V E Sbjct: 526 PLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKE 585 Query: 1155 NHTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQNGVVERKNKTLIEMIQSQ 1334 + I D G EFTS EF Y GIR++ VP +PQQNGVVERKN+T++EM +S Sbjct: 586 SGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVVERKNRTILEMARSM 645 Query: 1335 LHSSQL 1352 L S +L Sbjct: 646 LKSKRL 651 >dbj|BAD34493.1| Gag-Pol [Ipomoea batatas] Length = 1298 Score = 128 bits (321), Expect(3) = 3e-31 Identities = 67/181 (37%), Positives = 104/181 (57%) Frame = +3 Query: 810 LWHNRFGHVNNECLSRISATVSHSXXXXXXXXXCDNCITAKLHKKPFNKSSRTTTRCLEI 989 LWH + GH++++ + + C++CIT+K H+ F+ S+ LE+ Sbjct: 415 LWHQKLGHMSDQGMKILVEQKLIPGLTKVSLPLCEHCITSKQHRLKFSTSNSRGKVVLEL 474 Query: 990 IHLDLCGPINPSTLHEKYILTFSDDLSKMT*VYFLRHKSEVFKYFLKFRKRV*TENHTKI 1169 +H D+ PS KY ++F DD S+ VY ++ KS+VF F F+ RV ++ KI Sbjct: 475 VHSDVWQAPVPSLGGAKYFVSFIDDYSRRCWVYPIKKKSDVFATFKAFKARVELDSGKKI 534 Query: 1170 ACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQNGVVERKNKTLIEMIQSQLHSSQ 1349 CF DNG E+TS EF D+ K+GI+++ V Y PQQNGV ER N+TL+E ++ L ++ Sbjct: 535 KCFRTDNGGEYTSEEFDDFCKKEGIKRQFTVAYTPQQNGVAERMNRTLLERTRAMLRAAG 594 Query: 1350 L 1352 L Sbjct: 595 L 595 Score = 31.2 bits (69), Expect(3) = 3e-31 Identities = 33/146 (22%), Positives = 58/146 (39%), Gaps = 7/146 (4%) Frame = +2 Query: 44 LIRNRMPFDALCQALLEKEQGNNQKKSANYEVQ------FVQKQNKKRFQDKGRYEGKKG 205 ++ + + FD + A+LE+E K+ +Q ++ ++ +R Q GR K Sbjct: 163 ILTDYLVFDDVAAAVLEEESRRKNKEDRQVNLQQAEALTVMRGRSTERGQSSGRGRSKSS 222 Query: 206 *-NKYCHICDRNNHDTKYYFFNAKGPNYNPNRRSRAKDLQQNSETHEANMVEIKEDQVYV 382 N C+ C + H K + A+ N N S + D ++ EA++ Sbjct: 223 KKNLTCYNCGKKGHLKKDCWNLAQNSNPQGNVASTSDD--GSALCCEASIAR-------- 272 Query: 383 LYTNRDNNNAGWYLDLGCNSHMTGKK 460 R W +D G HMT +K Sbjct: 273 --EGRKRFADIWLIDSGATYHMTSRK 296 Score = 24.6 bits (52), Expect(3) = 3e-31 Identities = 25/86 (29%), Positives = 43/86 (50%), Gaps = 1/86 (1%) Frame = +1 Query: 490 NGKFVKTAGNDELPIQERGDITLNFGNDTIR-LTIVLYVEGLRKNLISLYKLLTEGYNLR 666 +G V + + L I G I L + T++ + V +V+GL+KNL+S Y +L Sbjct: 306 SGGSVYSCDDHALEIIGIGTIKLKMYDGTVQTVQDVRHVKGLKKNLLS-YGILDNS---- 360 Query: 667 AFQKENEETCCQISYDNKIVVKVQNI 744 A Q E ++ +I +V+K + I Sbjct: 361 ATQIETQKGVMKIFQGALVVMKGEKI 386 >gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabidopsis thaliana] Length = 1352 Score = 142 bits (358), Expect = 3e-31 Identities = 85/186 (45%), Positives = 106/186 (56%), Gaps = 5/186 (2%) Frame = +3 Query: 810 LWHNRFGHVNN---ECLSRISATVSHSXXXXXXXXXCDNCITAKLHKKPFNK-SSRTTTR 977 LWH RFGH+N E LSR V C+ C+ K K F K SS + Sbjct: 467 LWHLRFGHLNFGGLELLSR-KEMVRGLPCINHPNQVCEGCLLGKQFKMSFPKESSSRAQK 525 Query: 978 CLEIIHLDLCGPINPSTLHEK-YILTFSDDLSKMT*VYFLRHKSEVFKYFLKFRKRV*TE 1154 LE+IH D+CGPI P +L + Y L F DD S+ T VYFL+ KSEVF+ F KF+ V E Sbjct: 526 SLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKE 585 Query: 1155 NHTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQNGVVERKNKTLIEMIQSQ 1334 + I D G EFTS EF Y GIR++ VP +PQQNGV ERKN+T++EM +S Sbjct: 586 SGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAERKNRTILEMARSM 645 Query: 1335 LHSSQL 1352 L S +L Sbjct: 646 LKSKRL 651 >emb|CBI37296.3| unnamed protein product [Vitis vinifera] Length = 3048 Score = 142 bits (358), Expect = 3e-31 Identities = 80/209 (38%), Positives = 115/209 (55%), Gaps = 6/209 (2%) Frame = +3 Query: 756 IMKPIPTECF-LITS*ISNLWHNRFGHVNNECLSRISAT--VSHSXXXXXXXXXCDNCIT 926 + +PI + CF +T I LWH R+GH++ + L + V+ C +C+ Sbjct: 434 LSQPISSTCFNTVTEDILQLWHCRYGHLSFQGLKTLQQRKMVNGLPQFQPPSKLCKDCLV 493 Query: 927 AKLHKKPFNKSSR-TTTRCLEIIHLDLCGPINP-STLHEKYILTFSDDLSKMT*VYFLRH 1100 K H+ K S L+++H D+CGPINP S ++Y+LTF+DD S+ T VYFL Sbjct: 494 GKQHRSSIPKKSNWRAAEILQLVHADICGPINPISNSKKRYLLTFTDDFSRKTWVYFLVE 553 Query: 1101 KSEVFKYFLKFRKRV*TENHTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQ 1280 KSE F F F+ V E + + C D G EFTS EF + GIR++ Y PQQ Sbjct: 554 KSEAFAVFKSFKTYVEKETSSFLRCLRTDRGGEFTSQEFAIFCDVHGIRRQLTAAYTPQQ 613 Query: 1281 NGVVERKNKTLIEMIQSQLHSSQL-STKW 1364 NGV ERKN+T++ M++S L + +L T W Sbjct: 614 NGVAERKNRTIMNMVRSMLSAKKLPKTFW 642 >ref|XP_002064813.1| GK15001 [Drosophila willistoni] gi|194160898|gb|EDW75799.1| GK15001 [Drosophila willistoni] Length = 1249 Score = 142 bits (358), Expect = 3e-31 Identities = 78/213 (36%), Positives = 117/213 (54%), Gaps = 5/213 (2%) Frame = +3 Query: 729 KSTEHNGLFIMKPIPTECFLITS*ISNLWHNRFGHVNNECLSRISA---TVSHSXXXXXX 899 K+ + LF+ + F +LWH RFGH+N + L++I++ S Sbjct: 367 KAKKIGNLFVFEAESENLFAAVGEDVSLWHKRFGHLNYKSLTQIASKGLVRGLSVTNFAP 426 Query: 900 XXXCDNCITAKLHKKPFNKSSRT-TTRCLEIIHLDLCGPINPSTLH-EKYILTFSDDLSK 1073 C C+ +K+H +PF K + + ++ L+++H D+CGP +L +Y LTF DD S+ Sbjct: 427 NTPCKTCMVSKIHVQPFPKMTESRSSELLQLVHSDVCGPFGTKSLGGSRYFLTFIDDKSR 486 Query: 1074 MT*VYFLRHKSEVFKYFLKFRKRV*TENHTKIACFCFDNGIEFTSNEFKDYYTKKGIRKE 1253 VYFL+ K EVF FL+F+ V + K+ C DNG E+ +N F DY K GI ++ Sbjct: 487 RIFVYFLKGKDEVFGKFLEFKSLVERQTGKKLKCIRSDNGREYVNNAFDDYLKKNGILRQ 546 Query: 1254 EIVPYNPQQNGVVERKNKTLIEMIQSQLHSSQL 1352 + Y PQQNGV ER N+TL+EM + L S L Sbjct: 547 LTIAYTPQQNGVAERANRTLVEMSRCLLAQSGL 579 >emb|CAN71037.1| hypothetical protein VITISV_011061 [Vitis vinifera] Length = 1220 Score = 108 bits (271), Expect(3) = 4e-31 Identities = 67/187 (35%), Positives = 97/187 (51%), Gaps = 4/187 (2%) Frame = +3 Query: 804 SNLWHNRFGHVNNECLSRI--SATVSHSXXXXXXXXXCDNCITAKLHKKPFNKS-SRTTT 974 S +WH +GH N + L + + V C++C K ++PF ++ S+ T Sbjct: 433 SVVWHKSYGHFNLKSLRFMQEAGMVEDMLEISVNAQTCESCELGKQQRQPFPQNMSKRAT 492 Query: 975 RCLEIIHLDLCGPINPSTLHEK-YILTFSDDLSKMT*VYFLRHKSEVFKYFLKFRKRV*T 1151 LE+IH +CGP++ ++L Y F DDLS+MT VYFL+ KS+V F F+K V T Sbjct: 493 HELELIHSYICGPMSIASLSNNVYFALFIDDLSRMTWVYFLKTKSQVLSMFKSFKKMVET 552 Query: 1152 ENHTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQNGVVERKNKTLIEMIQS 1331 ++ + DNG E+ S EF PY PQQN V ERKNKT++EM + Sbjct: 553 QSGQNVKVLIIDNGGEYISKEF-----------NLTAPYLPQQNEVSERKNKTVMEMARC 601 Query: 1332 QLHSSQL 1352 L +L Sbjct: 602 MLFEKRL 608 Score = 42.7 bits (99), Expect(3) = 4e-31 Identities = 36/116 (31%), Positives = 54/116 (46%), Gaps = 17/116 (14%) Frame = +2 Query: 155 QNKKRFQD-KGRYEG--KKG*NKYCHICDRNNHDTKYYFFNAKGPNYNPNR--------- 298 Q KK F++ KG+ EG +KG C C R NH K + K P +N N Sbjct: 203 QGKKFFKNNKGKVEGFSRKGKFPSCFHCRRTNHAEKDCWNKGK-PLFNCNFCNKLCHSEK 261 Query: 299 --RSRAKDLQQNSETHEANMVEIKEDQVYVLYTNRDNNNAG---WYLDLGCNSHMT 451 R++ K QQ E + + + E K D ++ ++ ++ W +D GC SHMT Sbjct: 262 YCRAKKKQSQQQPEKNASVIEENKNDDEHLFMASQTLSSHELNTWLIDSGCTSHMT 317 Score = 32.0 bits (71), Expect(3) = 4e-31 Identities = 20/79 (25%), Positives = 38/79 (48%) Frame = +1 Query: 502 VKTAGNDELPIQERGDITLNFGNDTIRLTIVLYVEGLRKNLISLYKLLTEGYNLRAFQKE 681 VK + + + +G I ++ T +T VLY+ L +NL+S+ ++L GY + Sbjct: 334 VKLGNGEVVQAKGKGTIAISTKRGTKIVTNVLYIPDLDQNLLSVAQMLRNGYAV----SF 389 Query: 682 NEETCCQISYDNKIVVKVQ 738 E C + K + K++ Sbjct: 390 KENFCFITNVQEKEIAKIK 408 >gb|ABD32582.1| Integrase, catalytic region; Zinc finger, CCHC-type; Peptidase aspartic, catalytic [Medicago truncatula] Length = 1715 Score = 110 bits (275), Expect(3) = 5e-31 Identities = 68/187 (36%), Positives = 99/187 (52%), Gaps = 5/187 (2%) Frame = +3 Query: 810 LWHNRFGHVNNECLSRISA---TVSHSXXXXXXXXXCDNCITAKLHKKPF-NKSSRTTTR 977 +WH R GH N +S+IS C C K+ K F +K +T+R Sbjct: 799 VWHKRLGHANWRLISKISKLQLVKGLPNIDYHSDALCGACQKGKIVKSSFKSKDIVSTSR 858 Query: 978 CLEIIHLDLCGPINPSTLH-EKYILTFSDDLSKMT*VYFLRHKSEVFKYFLKFRKRV*TE 1154 LE++H+DL GP+N ++L+ KY L DD S+ T V F++ K + F F ++ +E Sbjct: 859 PLELLHIDLFGPVNTASLYGSKYGLVIVDDYSRWTWVKFIKSKDYACEVFSSFCTQIQSE 918 Query: 1155 NHTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQNGVVERKNKTLIEMIQSQ 1334 KI D+G EF + F+ + K GI E P PQQNGVVERKN+TL EM ++ Sbjct: 919 KELKILKVRSDHGGEFENEPFELFCEKHGILHEFSSPRTPQQNGVVERKNRTLQEMARTM 978 Query: 1335 LHSSQLS 1355 +H + L+ Sbjct: 979 IHENNLA 985 Score = 40.8 bits (94), Expect(3) = 5e-31 Identities = 32/100 (32%), Positives = 51/100 (51%), Gaps = 4/100 (4%) Frame = +1 Query: 457 KMTFLNLKVSNNGKFVKTAGNDELPIQERGDITLNFGNDTIRLTIVLYVEGLRKNLISLY 636 K FL L + + G+ VK GN I G I GN +I + V V+GL+ NL+S+ Sbjct: 684 KALFLTLTMKDGGE-VKFGGNQTGKIIGTGTI----GNSSISINNVWLVDGLKHNLLSIS 738 Query: 637 KLLTEGYNLRAFQKENEETCCQISYDNKIVV----KVQNI 744 + GY++ F K N C ++ D+K + +V+N+ Sbjct: 739 QFCDNGYDV-TFSKTN---CTLVNKDDKSITFKGKRVENV 774 Score = 32.0 bits (71), Expect(3) = 5e-31 Identities = 11/15 (73%), Positives = 13/15 (86%) Frame = +2 Query: 416 WYLDLGCNSHMTGKK 460 WYLD GC+ HMTG+K Sbjct: 670 WYLDSGCSRHMTGEK 684 >gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabidopsis thaliana] gi|12321387|gb|AAG50765.1|AC079131_10 copia-type polyprotein, putative [Arabidopsis thaliana] Length = 1320 Score = 141 bits (356), Expect = 6e-31 Identities = 85/186 (45%), Positives = 106/186 (56%), Gaps = 5/186 (2%) Frame = +3 Query: 810 LWHNRFGHVNN---ECLSRISATVSHSXXXXXXXXXCDNCITAKLHKKPFNK-SSRTTTR 977 LWH RFGH+N E LSR V C+ C+ K K F K SS + Sbjct: 467 LWHLRFGHLNFGGLELLSR-KEMVRGLPCINHPNQVCEGCLLGKQFKMSFPKESSSRAQK 525 Query: 978 CLEIIHLDLCGPINPSTLHEK-YILTFSDDLSKMT*VYFLRHKSEVFKYFLKFRKRV*TE 1154 LE+IH D+CGPI P +L + Y L F DD S+ T VYFL+ KSEVF+ F KF+ V E Sbjct: 526 PLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKE 585 Query: 1155 NHTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQNGVVERKNKTLIEMIQSQ 1334 + I D G EFTS EF Y GIR++ VP +PQQNGV ERKN+T++EM +S Sbjct: 586 SGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAERKNRTILEMARSM 645 Query: 1335 LHSSQL 1352 L S +L Sbjct: 646 LKSKRL 651 >emb|CAB75469.1| copia-type reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1272 Score = 140 bits (352), Expect = 2e-30 Identities = 84/186 (45%), Positives = 106/186 (56%), Gaps = 5/186 (2%) Frame = +3 Query: 810 LWHNRFGHVNN---ECLSRISATVSHSXXXXXXXXXCDNCITAKLHKKPFNK-SSRTTTR 977 LWH RFGH+N E LSR V C+ C+ K F K SS + Sbjct: 467 LWHLRFGHLNFGGLELLSR-KEMVRGLPCINHPNQVCEGCLLGNQFKMSFPKESSSRAQK 525 Query: 978 CLEIIHLDLCGPINPSTLHEK-YILTFSDDLSKMT*VYFLRHKSEVFKYFLKFRKRV*TE 1154 LE+IH D+CGPI P +L + Y L F DD S+ T VYFL+ KSEVF+ F KF+ V E Sbjct: 526 PLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKE 585 Query: 1155 NHTKIACFCFDNGIEFTSNEFKDYYTKKGIRKEEIVPYNPQQNGVVERKNKTLIEMIQSQ 1334 + I D+G EFTS EF Y GIR++ VP +PQQNGV ERKN+T++EM +S Sbjct: 586 SGLVIKTMRSDSGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAERKNRTILEMARSM 645 Query: 1335 LHSSQL 1352 L S +L Sbjct: 646 LKSKRL 651