BLASTX nr result
ID: Mentha24_contig00024088
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha24_contig00024088 (2071 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAK29467.1| polyprotein-like [Solanum chilense] 127 2e-26 sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol poly... 122 6e-25 ref|XP_003625577.1| Ubiquitin-protein ligase [Medicago truncatul... 118 1e-23 ref|XP_003613324.1| Cc-nbs-lrr resistance protein [Medicago trun... 116 5e-23 ref|XP_007038204.1| Transducin/WD40 repeat-like superfamily prot... 114 2e-22 emb|CAN72567.1| hypothetical protein VITISV_044177 [Vitis vinifera] 110 2e-21 ref|XP_007028624.1| Uncharacterized protein TCM_024518 [Theobrom... 110 2e-21 emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera] 109 6e-21 gb|AAP53866.2| retrotransposon protein, putative, Ty1-copia subc... 108 7e-21 gb|ABA98861.1| retrotransposon protein, putative, Ty1-copia subc... 108 1e-20 emb|CAN61630.1| hypothetical protein VITISV_003191 [Vitis vinifera] 107 2e-20 gb|ABA96825.1| retrotransposon protein, putative, Ty1-copia subc... 106 5e-20 gb|ABD96963.1| hypothetical protein [Cleome spinosa] 106 5e-20 emb|CAA31653.1| polyprotein [Arabidopsis thaliana] 104 1e-19 emb|CAJ09951.2| putative gag-pol polyprotein [Citrus sinensis] 104 1e-19 emb|CAA37918.1| unnamed protein product [Arabidopsis thaliana] 100 4e-18 emb|CAN80304.1| hypothetical protein VITISV_017821 [Vitis vinifera] 98 1e-17 ref|XP_004165166.1| PREDICTED: uncharacterized protein LOC101225... 96 8e-17 gb|AAC62132.1| copia-like retroelement pol polyprotein [Arabidop... 96 8e-17 emb|CAB40039.1| putative retrotransposon [Arabidopsis thaliana] ... 95 1e-16 >gb|AAK29467.1| polyprotein-like [Solanum chilense] Length = 1328 Score = 127 bits (319), Expect = 2e-26 Identities = 83/265 (31%), Positives = 134/265 (50%), Gaps = 3/265 (1%) Frame = +2 Query: 1163 MSSMVYGLVPFDGSSD-FGMWKRKMKCILIDKRAYKAI--TLEYXXXXXXXXXXXXXXLA 1333 MS + Y + F+G F MW+R+MK +LI + +KA+ + A Sbjct: 1 MSGVKYEVAKFNGDKPVFSMWQRRMKDLLIQQGLHKALGGKSKKPESMKLEDWEELDEKA 60 Query: 1334 ISVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSIT 1513 S I L L+D V+ ++ + +SA +W KLE LY +L ++++L + ++ +D + Sbjct: 61 ASAIRLHLTDDVVNNIVDEESACGIWTKLENLYMSKTLTNKLYLKKQLYTLHMDEGTNFL 120 Query: 1514 ENLNVFNKLIKDIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNAL 1693 +LNV N LI + G K + VLLN++P SY + + I +G+D + L V +AL Sbjct: 121 SHLNVLNGLITQLANLGVKIEEEDKRIVLLNSLPSSYDTLSTTILHGKDSIQLKDVTSAL 180 Query: 1694 KHKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRC 1873 EK ++ +VF + SY + + + SG + K + + K R C Sbjct: 181 LLNEK----MRKKPENHGQVFITESRGRSYQRSSSNYGRSG---ARGKSKVRSKSKARNC 233 Query: 1874 YNCGEIGHYVRDCPNPKRNQKGEQA 1948 YNC + GH+ RDCPNPKR KGE + Sbjct: 234 YNCDQPGHFKRDCPNPKRG-KGESS 257 >sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94; Includes: RecName: Full=Protease; Includes: RecName: Full=Reverse transcriptase; Includes: RecName: Full=Endonuclease gi|20045|emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] Length = 1328 Score = 122 bits (306), Expect = 6e-25 Identities = 74/262 (28%), Positives = 134/262 (51%), Gaps = 2/262 (0%) Frame = +2 Query: 1163 MSSMVYGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXX--LAI 1336 MS + Y + F+G + F W+R+M+ +LI + +K + ++ A Sbjct: 1 MSGVKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAA 60 Query: 1337 SVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITE 1516 S I L LSD V+ ++ + D+A+ +W +LE+LY +L ++++L + ++ + + Sbjct: 61 SAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLS 120 Query: 1517 NLNVFNKLIKDIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALK 1696 +LNVFN LI + G K + +LLN++P SY ++ + I +G+ + L V +AL Sbjct: 121 HLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALL 180 Query: 1697 HKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRCY 1876 EK ++ + G+ SY + + + SG + K + + + R CY Sbjct: 181 LNEK----MRKKPENQGQALITEGRGRSYQRSSNNYGRSG---ARGKSKNRSKSRVRNCY 233 Query: 1877 NCGEIGHYVRDCPNPKRNQKGE 1942 NC + GH+ RDCPNP++ KGE Sbjct: 234 NCNQPGHFKRDCPNPRKG-KGE 254 >ref|XP_003625577.1| Ubiquitin-protein ligase [Medicago truncatula] gi|355500592|gb|AES81795.1| Ubiquitin-protein ligase [Medicago truncatula] Length = 1405 Score = 118 bits (295), Expect = 1e-23 Identities = 86/265 (32%), Positives = 132/265 (49%), Gaps = 7/265 (2%) Frame = +2 Query: 1193 FDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXXL--AISVICLGLSDC 1366 F G +DFG+WK KM+ +LI ++ KA+ E + A S + L L D Sbjct: 10 FTGDNDFGLWKVKMEAVLIQQKCEKALKGEGSLPVTMSQAEKTEMVDKARSAVVLCLGDK 69 Query: 1367 VLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITENLNVFNKLIK 1546 VL V +A +W KLE+LY SLA + FL + +SF++ +K+I E L FNK++ Sbjct: 70 VLREVAKEATAASIWAKLESLYMTKSLAHRQFLKQQLYSFRMVESKAIMEQLTEFNKILD 129 Query: 1547 DIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGRD-KVTLDVVMNALKHKE----KD 1711 D++ + D +LL A+P+S+ K + YG++ VTL+ V AL+ KE KD Sbjct: 130 DLENIEVQLEDEDKAILLLCALPKSFESFKDTMLYGKEGTVTLEEVQAALRTKELTKSKD 189 Query: 1712 LNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRCYNCGEI 1891 L H +G+ S + N G+ +KS +K F+ C+NC ++ Sbjct: 190 LT-------------HEHGEGLSVTRGNGGGRGN-RRKSGNKSRFE-------CFNCHKM 228 Query: 1892 GHYVRDCPNPKRNQKGEQANVVSAG 1966 GH+ +DCP G A +VS G Sbjct: 229 GHFKKDCP----EINGNSAQIVSEG 249 >ref|XP_003613324.1| Cc-nbs-lrr resistance protein [Medicago truncatula] gi|355514659|gb|AES96282.1| Cc-nbs-lrr resistance protein [Medicago truncatula] Length = 1104 Score = 116 bits (290), Expect = 5e-23 Identities = 86/265 (32%), Positives = 130/265 (49%), Gaps = 7/265 (2%) Frame = +2 Query: 1193 FDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXXL--AISVICLGLSDC 1366 F G +DFG+WK KM+ +LI ++ KA+ E + A S + L L D Sbjct: 10 FTGDNDFGLWKVKMEAVLIQQKCEKALKGEGSLPVTMSQAEKTEMVDKARSAVVLCLGDK 69 Query: 1367 VLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITENLNVFNKLIK 1546 VL V +A +W KLE+LY SLA + FL + +SF++ +K+I E L FNK++ Sbjct: 70 VLREVAKEATAASIWAKLESLYMTKSLAHRQFLKQQLYSFRMVESKAIMEQLTEFNKILD 129 Query: 1547 DIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGRD-KVTLDVVMNALKHKE----KD 1711 D++ + D +LL A+P+S+ K + YG++ VTL+ V AL+ KE KD Sbjct: 130 DLENIEVQLEDEDKAILLLCALPKSFESFKDTMLYGKEGTVTLEEVQAALRTKELTKSKD 189 Query: 1712 LNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRCYNCGEI 1891 L H G S + N G+ +KS +K F+ C+NC ++ Sbjct: 190 LT-------------HEYGDGLSVTRGNGGGRGN-RRKSGNKSRFE-------CFNCHKM 228 Query: 1892 GHYVRDCPNPKRNQKGEQANVVSAG 1966 GH+ +DCP G A +VS G Sbjct: 229 GHFKKDCP----EINGNSAQIVSEG 249 >ref|XP_007038204.1| Transducin/WD40 repeat-like superfamily protein [Theobroma cacao] gi|508775449|gb|EOY22705.1| Transducin/WD40 repeat-like superfamily protein [Theobroma cacao] Length = 1029 Score = 114 bits (285), Expect = 2e-22 Identities = 80/272 (29%), Positives = 134/272 (49%), Gaps = 8/272 (2%) Frame = +2 Query: 1166 SSMVYGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXXL--AIS 1339 SS Y + F+G +DF +W+ KM+ +L+ + KA+ + + A S Sbjct: 126 SSTKYEIEKFNGRNDFSLWRVKMRALLVQQGLLKALKGKEHLPSNLSDGEKDDLMKKAHS 185 Query: 1340 VICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITEN 1519 VI L LSD VL V + +SA +W KLE++Y SL +++++ + ++ K+ S+ + Sbjct: 186 VILLALSDEVLREVTDEESAAAVWFKLESIYMTKSLTNRLYMKQRLYTLKMSEGTSVNTH 245 Query: 1520 LNVFNKLIKDIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALKH 1699 ++ FN++I D+K K D +LL +P SY + + YGRD +T + V +L Sbjct: 246 IDEFNRVILDLKNIDVKIEDEDLALILLCYLPPSYENFVDTMLYGRDTLTFEDVRASLNF 305 Query: 1700 KE--KDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRC 1873 KE K + ++N + V + G G +K D+ K R K + C Sbjct: 306 KELKKKVGGIRNENQAEGLVVN---------------RGRGKEKGLDRKG-KSRAKGKTC 349 Query: 1874 YNCGEIGHYVRDC----PNPKRNQKGEQANVV 1957 +NCG+ GH+ +DC + K N+ ANVV Sbjct: 350 WNCGQKGHFRQDCTKFKDDEKFNKSENTANVV 381 >emb|CAN72567.1| hypothetical protein VITISV_044177 [Vitis vinifera] Length = 950 Score = 110 bits (276), Expect = 2e-21 Identities = 79/261 (30%), Positives = 130/261 (49%), Gaps = 2/261 (0%) Frame = +2 Query: 1163 MSSMVYGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXXL--AI 1336 MSS + + F+GS+DF +WK KMK +L+ ++ +AI E + A Sbjct: 1 MSSQKFEVEKFNGSNDFTLWKLKMKALLVQQKCAQAIEGEETLPVGLTAVEKEEVVSRAH 60 Query: 1337 SVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITE 1516 S I L L+D VL V + +A LW K E+ Y + SL ++++ + K+ + + Sbjct: 61 SAILLSLADEVLREVADETTAVGLWRKFESKYQKKSLTNRLYQKRQLHTLKMSEGMQVRD 120 Query: 1517 NLNVFNKLIKDIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALK 1696 +LN FN++I D+ G K + +LL ++P SY + + YGR+ ++ + V +AL+ Sbjct: 121 HLNNFNRIILDLNGVGVKVEEEDQAMILLCSLPSSYENFVDTMMYGRBSISXNDVKDALQ 180 Query: 1697 HKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRCY 1876 KE L+ + SG ++ G T S + E NG G KS K RC+ Sbjct: 181 SKE-----LQKLVSGSEEGSVETGLTVSRGRSMER-NGGGRSKSXSKSK-----AAMRCF 229 Query: 1877 NCGEIGHYVRDCPNPKRNQKG 1939 + E GH+ ++CP + QKG Sbjct: 230 HXKEKGHFRKNCP---QRQKG 247 >ref|XP_007028624.1| Uncharacterized protein TCM_024518 [Theobroma cacao] gi|508717229|gb|EOY09126.1| Uncharacterized protein TCM_024518 [Theobroma cacao] Length = 277 Score = 110 bits (275), Expect = 2e-21 Identities = 80/272 (29%), Positives = 131/272 (48%), Gaps = 8/272 (2%) Frame = +2 Query: 1166 SSMVYGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXXL--AIS 1339 SS Y + F+G +DF +W+ KM +L+ + KA+ + + A S Sbjct: 4 SSTKYEIEKFNGRNDFSLWRVKMCALLVQQGLLKALKGKEHLPSNLSDSEKDDLMEKAHS 63 Query: 1340 VICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITEN 1519 I L LSD VL V + +SA +W KLE++Y SL +++++ + ++ K+ S+ + Sbjct: 64 AILLTLSDEVLREVTDEESAAAMWFKLESIYITKSLTNRLYMKQRLYTLKMSEGTSVNTH 123 Query: 1520 LNVFNKLIKDIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALKH 1699 ++ FN++I D+K K D +LL +P SY + + YGRD +T + V L Sbjct: 124 IDEFNRVILDLKNIDVKIEDEDLALILLCYLPPSYENFVDTMLYGRDTLTFEDVRAYLNS 183 Query: 1700 KE--KDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRC 1873 KE K + ++N + V + G G +K DK K R K + C Sbjct: 184 KELKKKVGGIRNENQAEGLVVN---------------RGRGKEKGLDKKG-KSRAKGKTC 227 Query: 1874 YNCGEIGHYVRDC----PNPKRNQKGEQANVV 1957 +NCG+ GH+ +DC + K N+ ANVV Sbjct: 228 WNCGQKGHFRQDCTKFKDDEKFNKSENTANVV 259 >emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera] Length = 894 Score = 109 bits (272), Expect = 6e-21 Identities = 77/265 (29%), Positives = 128/265 (48%), Gaps = 2/265 (0%) Frame = +2 Query: 1163 MSSMVYGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXXL--AI 1336 M ++ + + F G +DFG+W+ KM+ +L+ + A+ E L A Sbjct: 1 MGTVKFDVEKFTGKNDFGLWRLKMRALLVQQGLQDALLGEKNLPXTMQEKHKIELLEKAH 60 Query: 1337 SVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITE 1516 I L L D L V SA L KLE+LY SLA+++ ++FK+ + SI E Sbjct: 61 GAIILSLGDTXLREVAKAKSAAKLLLKLESLYMTKSLANRLHKXIKLYTFKMTPSMSIEE 120 Query: 1517 NLNVFNKLIKDIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALK 1696 +L+ FNK+I D+K + +LL ++ SY ++K AI YGRD +T D V + L Sbjct: 121 HLDHFNKIILDLKNIDIAVSNEDKAILLLTSLDASYTNMKEAIMYGRDILTFDEVQSILH 180 Query: 1697 HKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRCY 1876 +E L+ + + ++ GK+ + K +N K K + K +C+ Sbjct: 181 ARE--LHKQEESKEELGEGLNIRGKSKKREK---------KKGNNSKSRSKSKTKKFKCF 229 Query: 1877 NCGEIGHYVRDCPNPKRNQKGEQAN 1951 C + GH+ +DCP+ ++N + N Sbjct: 230 ICHKEGHFKKDCPDMRQNTXKKTMN 254 >gb|AAP53866.2| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 415 Score = 108 bits (271), Expect = 7e-21 Identities = 69/264 (26%), Positives = 132/264 (50%) Frame = +2 Query: 1178 YGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXXLAISVICLGL 1357 + +V FDG+ +F +W+ ++K +L + KA+ A + I L L Sbjct: 9 FEVVKFDGTGNFVLWQMRLKDLLAQQGISKALQETMPEKIDADKWNEMKAQAAATIRLSL 68 Query: 1358 SDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITENLNVFNK 1537 SD V+ V + S K++W+KL +L+ SL S+++L + + ++ + ++++VFN+ Sbjct: 69 SDSVMYQVMDEKSPKEIWDKLASLHMSKSLTSKLYLKQQLYGLQVQEESDLRKHVDVFNQ 128 Query: 1538 LIKDIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALKHKEKDLN 1717 L+ D+ + K D +LL ++P SY V + + +G+D V + ++++L Sbjct: 129 LVVDLSKLDVKLDDEDKAIILLCSLPLSYEHVVTTLTHGKDTVKTEEIISSL-------- 180 Query: 1718 LLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRCYNCGEIGH 1897 L +++ + S ++H + +G KS +KGA RCY C E GH Sbjct: 181 LARDLRRSKKNEATKASQGKSLLVKDKHDHEAGVSKSKEKGA--------RCYKCHEFGH 232 Query: 1898 YVRDCPNPKRNQKGEQANVVSAGE 1969 R+CP K+ +KG A++ + G+ Sbjct: 233 IRRNCPLLKK-RKGGIASLAARGD 255 >gb|ABA98861.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 425 Score = 108 bits (269), Expect = 1e-20 Identities = 73/272 (26%), Positives = 135/272 (49%), Gaps = 3/272 (1%) Frame = +2 Query: 1163 MSSMV---YGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXXLA 1333 M++MV + +V FDG+ +F +W+ ++K +L + KA+ A Sbjct: 1 MAAMVVSKFEVVKFDGTGNFILWQMRLKDLLAQQGISKALEETMPEKMDAGKWEEMKAQA 60 Query: 1334 ISVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSIT 1513 + I L LSD V+ V + + K++W+KL +LY SL S+++L + + ++ + Sbjct: 61 AATIRLSLSDSVMYPVMDEKTPKEIWDKLASLYMSKSLTSKLYLKQQLYGLQMQEESDLR 120 Query: 1514 ENLNVFNKLIKDIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNAL 1693 ++++VFN+L+ D+ + K D +LL ++P SY V + + +G+D V + +++L Sbjct: 121 KHVDVFNQLVVDLSKLDVKLDDEDMAIILLCSLPPSYEHVVTTLMHGKDTVKTEEKISSL 180 Query: 1694 KHKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRC 1873 L +++ + S +H + +G KS DKGA RC Sbjct: 181 --------LARDLRRSNKNEAMEASQAESLLVKAKHDHEAGVSKSKDKGA--------RC 224 Query: 1874 YNCGEIGHYVRDCPNPKRNQKGEQANVVSAGE 1969 Y C E GH R+CP K+ +KG A++ + G+ Sbjct: 225 YKCHEFGHIRRNCPLLKK-RKGGIASLAARGD 255 >emb|CAN61630.1| hypothetical protein VITISV_003191 [Vitis vinifera] Length = 1208 Score = 107 bits (268), Expect = 2e-20 Identities = 75/265 (28%), Positives = 124/265 (46%), Gaps = 2/265 (0%) Frame = +2 Query: 1163 MSSMVYGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXXL--AI 1336 M + + + F G +DFG+ + KM+ +L+ + A+ E L A Sbjct: 1 MGTAKFDVEKFTGKNDFGLXRLKMRALLVQQGLQDALLGEKNLPSTMQEKQKIELLEKAH 60 Query: 1337 SVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITE 1516 S I L L D VL SA ++W KLE+LY SLA+++ ++FK+ SI Sbjct: 61 SAIILSLGDTVLREXAKAKSAAEVWLKLESLYMTKSLANRLHKKIKLYTFKMTPGMSIEX 120 Query: 1517 NLNVFNKLIKDIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALK 1696 +L+ FNK+I D++ D +LL ++ SY ++K AI YGRD +T D V + L Sbjct: 121 HLDHFNKIILDLENIDITISDEDKAILLLTSLDASYTNMKDAIMYGRDSLTFDEVQSILH 180 Query: 1697 HKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRCY 1876 +E SG + ++ G++ + K N K K + K +C+ Sbjct: 181 ARELQKQEESKEESG--EGLNIRGRSEKREK----------KGKNSKSRSKSKTKKFKCF 228 Query: 1877 NCGEIGHYVRDCPNPKRNQKGEQAN 1951 C + GH+ +DCP+ ++N + N Sbjct: 229 ICHKEGHFKKDCPDRRQNTVKKTVN 253 >gb|ABA96825.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 424 Score = 106 bits (264), Expect = 5e-20 Identities = 67/254 (26%), Positives = 124/254 (48%) Frame = +2 Query: 1178 YGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLEYXXXXXXXXXXXXXXLAISVICLGL 1357 + +V FDG+ +F +W+ ++K +L + KA+ A + I L L Sbjct: 9 FEVVKFDGTGNFVLWQMRLKDLLAQQGISKALEETMPEKMDAGKWEEMKAQAAATIRLSL 68 Query: 1358 SDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITENLNVFNK 1537 SD V+ V + ++K++W KL +LY SL S+++L + + ++ + ++++VFN+ Sbjct: 69 SDSVMYQVMDEKTSKEIWVKLTSLYMSKSLTSKLYLKQQLYGLQMQEESDLRKHVDVFNQ 128 Query: 1538 LIKDIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALKHKEKDLN 1717 L+ D+ + K D +LL ++P SY V + + +G+D + +++ + L +DL Sbjct: 129 LVVDLSKLDVKLDDEDKAIILLCSLPPSYEHVVTILTHGKDTIKTEIISSLL---ARDLR 185 Query: 1718 LLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRRCYNCGEIGH 1897 K + + S +H + +G KS +KGA RCY C E GH Sbjct: 186 RSKKNEA------MEASQAESLLVKAKHDHEAGVSKSKEKGA--------RCYKCHEFGH 231 Query: 1898 YVRDCPNPKRNQKG 1939 R+CP K+ + G Sbjct: 232 IRRNCPLLKKRKDG 245 >gb|ABD96963.1| hypothetical protein [Cleome spinosa] Length = 408 Score = 106 bits (264), Expect = 5e-20 Identities = 63/211 (29%), Positives = 111/211 (52%) Frame = +2 Query: 1331 AISVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSI 1510 A ++I L L+D VL V + +A +W KLE L+ E SL ++M+L + F++D +++I Sbjct: 98 ARNLIVLALADQVLRKVISERTAFGIWRKLERLHIEQSLPNRMYLMQRVSGFRMDSSRTI 157 Query: 1511 TENLNVFNKLIKDIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNA 1690 ENL++F KL+ D+ K + Y LLN++P +Y ++ +KY R ++++ V A Sbjct: 158 EENLDIFQKLLSDLHSLNVKVEEEYQAVYLLNSLPPAYEQLREVLKYSRATISVEEVKAA 217 Query: 1691 LKHKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRR 1870 + KE +L + G + V GK +G G KK+ D+ Sbjct: 218 ARMKELELLAQGTLTRGTGEGLVVKGKPEK--------SGGGKKKAKDQ---------VE 260 Query: 1871 CYNCGEIGHYVRDCPNPKRNQKGEQANVVSA 1963 C+ CG+ GHY ++C + + ++ E VV++ Sbjct: 261 CWYCGKKGHYKKECRSRRAKEETEGKGVVAS 291 >emb|CAA31653.1| polyprotein [Arabidopsis thaliana] Length = 1291 Score = 104 bits (260), Expect = 1e-19 Identities = 65/218 (29%), Positives = 106/218 (48%), Gaps = 3/218 (1%) Frame = +2 Query: 1331 AISVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSI 1510 A+++I + D VL +D+ SA ++WE L Y ETSL +++++ F+SFK++ TKSI Sbjct: 90 AMNIIIAHVGDAVLRKIDHCKSAAEMWETLNKQYMETSLPNRIYVQLKFYSFKMNDTKSI 149 Query: 1511 TENLNVFNKLIKDIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNA 1690 EN+N F K++ ++ ++ + LN + Y +K +KYG ++L V++A Sbjct: 150 NENVNEFLKIVAELSSLEINVVEEVRAILFLNRLSSRYSQLKHTLKYGNKALSLKDVISA 209 Query: 1691 LKHKEKDLNLLKNVHSGPDKVFHVNGKTNSY---YQHNEHANGSGDKKSNDKGAFKPRYK 1861 + E++LN K V + N ++ HN+ G G KSN Sbjct: 210 ARSLERELNEQKETDKNTSTVLYTNERSRPQTRNQNHNKGGQGRGRSKSNSNAKL----- 264 Query: 1862 PRRCYNCGEIGHYVRDCPNPKRNQKGEQANVVSAGETT 1975 C+ C + GH +D KR K E N AG T Sbjct: 265 --TCWYCKKEGHVKKDYFARKR--KLESENPGEAGVIT 298 >emb|CAJ09951.2| putative gag-pol polyprotein [Citrus sinensis] Length = 1334 Score = 104 bits (260), Expect = 1e-19 Identities = 70/267 (26%), Positives = 125/267 (46%), Gaps = 5/267 (1%) Frame = +2 Query: 1163 MSSMVYGLVPFDGSSDFGMWKRKMKCILIDKRAYKAITLE----YXXXXXXXXXXXXXXL 1330 MS + + F DF +WK KMK +L+ + A+ E Sbjct: 1 MSLPRHEIEKFTIGGDFSLWKLKMKALLVHQGLESALDEEDLEASTGSGIDDKRRQIQNR 60 Query: 1331 AISVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSI 1510 A S + L L D +L + +A +W K+ETL + SLA ++FL + ++F + +I Sbjct: 61 AHSTLILSLGDSILREISEEKTALGIWNKVETLCMKKSLAHRLFLKKRLYTFSMREGVTI 120 Query: 1511 TENLNVFNKLIKDIKQTGD-KGIDVYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMN 1687 ++++ FNK+I D++ + K D + LL+++P+SY + YGR +TL+ V Sbjct: 121 QDHIDTFNKIILDLEGVENVKICDEDKAFFLLSSLPKSYEGFVDTMLYGRTTLTLEDVKA 180 Query: 1688 ALKHKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPR 1867 +L KE N +G + K + ++ +G + ++ K K R Sbjct: 181 SLSSKEIQKNCELETSNGEGLMARTEKKKDQKNKNQGKGHGKNQETADKK------KKKR 234 Query: 1868 RCYNCGEIGHYVRDCPNPKRNQKGEQA 1948 +C+ C + GHY+RDC K+ + E++ Sbjct: 235 KCFYCRKEGHYIRDCFEKKKKESQEKS 261 >emb|CAA37918.1| unnamed protein product [Arabidopsis thaliana] Length = 560 Score = 99.8 bits (247), Expect = 4e-18 Identities = 61/218 (27%), Positives = 107/218 (49%), Gaps = 3/218 (1%) Frame = +2 Query: 1331 AISVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSI 1510 A+++I + D VL +D+ SA ++W+ L Y ETSL +++++ F+SFK++ +KSI Sbjct: 78 AMNIIITHVGDAVLRKIDHCKSAAEMWKTLNKQYMETSLPNRIYVQLKFYSFKMNDSKSI 137 Query: 1511 TENLNVFNKLIKDIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNA 1690 EN+N F K++ ++ ++ + LN + Y +K +KYG ++L V+++ Sbjct: 138 NENVNEFLKIVAELSSLEINVVEEVRAILFLNGLSSRYSQLKHTLKYGNKALSLQDVISS 197 Query: 1691 LKHKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQH---NEHANGSGDKKSNDKGAFKPRYK 1861 + E++L+ K V + N + ++ N+ G G KSN Sbjct: 198 ARSLERELDEQKETDKNTSTVLYTNERGRPLTRNQNQNKGGQGRGRSKSNSNAKL----- 252 Query: 1862 PRRCYNCGEIGHYVRDCPNPKRNQKGEQANVVSAGETT 1975 C+ C + GH +DC KR K E N AG T Sbjct: 253 --TCWYCKKEGHVKKDCFARKR--KLESENPGEAGVIT 286 >emb|CAN80304.1| hypothetical protein VITISV_017821 [Vitis vinifera] Length = 939 Score = 98.2 bits (243), Expect = 1e-17 Identities = 58/186 (31%), Positives = 95/186 (51%), Gaps = 2/186 (1%) Frame = +2 Query: 1163 MSSMVYGLVPFDGSSDFGMWKRKMKCILIDKRAYKAIT--LEYXXXXXXXXXXXXXXLAI 1336 M S+ + F G +DF +W+ +MK IL + A+ E A Sbjct: 1 MGSIKSEIERFIGKNDFNVWRMRMKAILFQQGVKDALKDESELPVTMTAKEKSDIDEKAY 60 Query: 1337 SVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITE 1516 +I L L D L +AK +W KLE LY + SL+++++L E + FK+ +SI + Sbjct: 61 HLIILALGDKALREFSEETTAKGVWNKLEQLYMQNSLSNRLYLKERLYGFKMQEDRSIAD 120 Query: 1517 NLNVFNKLIKDIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNALK 1696 NL+ F K++ ++ G K D ++L ++P Y + K +KYGR +TL+ V +AL+ Sbjct: 121 NLDDFAKIVLEMSNIGIKVDDEDKAVLVLKSLPGLYSNFKETMKYGRKTLTLEEVQSALR 180 Query: 1697 HKEKDL 1714 KE +L Sbjct: 181 SKELEL 186 >ref|XP_004165166.1| PREDICTED: uncharacterized protein LOC101225243 [Cucumis sativus] Length = 158 Score = 95.5 bits (236), Expect = 8e-17 Identities = 50/145 (34%), Positives = 84/145 (57%), Gaps = 2/145 (1%) Frame = +2 Query: 1193 FDGSSDFGMWKRKMKCILIDKRAYKAIT--LEYXXXXXXXXXXXXXXLAISVICLGLSDC 1366 FDG DF +WK K+K +L ++A+KA+ LE +A + L +SD Sbjct: 11 FDGKGDFALWKAKIKALLGQQKAHKALLDPLELPTILTATQKEEIKLIAYGTLILNISDN 70 Query: 1367 VLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSITENLNVFNKLIK 1546 ++ V ++A +W+KLE+LYA L +++ L E F++K+D +K++TENL+ F K++ Sbjct: 71 IIRQVLEEETAHKVWKKLESLYATKDLPNKICLREKIFTYKMDSSKTLTENLDEFKKIVS 130 Query: 1547 DIKQTGDKGIDVYAPYVLLNAIPES 1621 + K DK D +VLLN +P++ Sbjct: 131 NFKSLEDKLDDENEAFVLLNFLPKA 155 >gb|AAC62132.1| copia-like retroelement pol polyprotein [Arabidopsis thaliana] Length = 1137 Score = 95.5 bits (236), Expect = 8e-17 Identities = 61/212 (28%), Positives = 108/212 (50%), Gaps = 4/212 (1%) Frame = +2 Query: 1331 AISVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETFFSFKIDVTKSI 1510 A+ +I + + D VL +++N +A + W L+ LY SL ++++L +++++ +K++ Sbjct: 44 AMDMIFINVGDKVLRNIENSKTAAEAWATLDKLYLVKSLPNRVYLQLKVYNYRMQDSKTL 103 Query: 1511 TENLNVFNKLIKDIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGRDKVTLDVVMNA 1690 EN++ F K+I D+ + D ++L+A+P+SY +K +KYGR+ + LD V++A Sbjct: 104 EENVDEFQKMISDLNNLQIQVPDEVQAILILSALPDSYDMLKETLKYGREGIKLDDVISA 163 Query: 1691 LKHKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSNDKGAFKPRYKPRR 1870 K KE +L + +V GK+ A GS KS + + Sbjct: 164 AKSKELELRDSSGGSRPVGEGLYVRGKS--------QARGSDGPKSTE--------GKKV 207 Query: 1871 CYNCGEIGHYVRDC----PNPKRNQKGEQANV 1954 C+ CG+ GH+ R C K N GE A V Sbjct: 208 CWICGKEGHFKRQCYKWLEKNKANGAGETALV 239 >emb|CAB40039.1| putative retrotransposon [Arabidopsis thaliana] gi|7267743|emb|CAB78169.1| putative retrotransposon [Arabidopsis thaliana] Length = 1230 Score = 95.1 bits (235), Expect = 1e-16 Identities = 80/288 (27%), Positives = 130/288 (45%), Gaps = 19/288 (6%) Frame = +2 Query: 1163 MSSMVYGLVPFDGSSDFGMWKRKMKC------ILIDKRAYKAIT--LEYXXXXXXXXXXX 1318 MSS + FDG D+ +WK K+ + + R ++++ LE Sbjct: 1 MSSARVEMEKFDGHGDYTLWKEKLMAHMDLLGLTVALRETQSVSDPLESEEEGKESEKGD 60 Query: 1319 XXXL-------AISVICLGLSDCVLMHVDNIDSAKDLWEKLETLYAETSLASQMFLFETF 1477 L A S I L +SD VL +A + E L+ LY +L ++++L + Sbjct: 61 KEALMEEKRQKARSTIVLSVSDQVLRKSKKEKTAPSMLEALDKLYMSKALPNRIYLKQKL 120 Query: 1478 FSFKIDVTKSITENLNVFNKLIKDIKQTGDKGIDVYAPYVLLNAIPESYGDVKSAIKYGR 1657 +S+K+ S+ N++ F +LI D++ T D +LL ++P+ + +K +KYG Sbjct: 121 YSYKMQENLSVEGNIDEFLRLIADLENTNVLVSDEDQAILLLMSLPKQFDQLKDTLKYGS 180 Query: 1658 DKVTLDV--VMNALKHKEKDLNLLKNVHSGPDKVFHVNGKTNSYYQHNEHANGSGDKKSN 1831 + TL V V+ A+ KE +L K G + +V K E S K+ Sbjct: 181 GRTTLSVDEVVAAIYSKELELGSNKKSIRGQAEGLYVKDKP-------ETRGMSEQKEKG 233 Query: 1832 DKGAFKPRYKP-RRCYNCGEIGHYVRDCPNP-KRNQKGEQANVVSAGE 1969 +KG + R K + C+ CGE GH+ CPN K+ KG+ S GE Sbjct: 234 NKGRSRSRSKGWKGCWICGEEGHFKTSCPNKGKQQNKGKDQASGSKGE 281