BLASTX nr result
ID: Mentha29_contig00033003
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00033003 (1065 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004492121.1| PREDICTED: uncharacterized protein LOC101498... 127 5e-36 ref|XP_006826167.1| hypothetical protein AMTR_s04947p00003620 [A... 122 1e-33 ref|XP_006494715.1| PREDICTED: uncharacterized protein LOC102612... 126 4e-33 ref|XP_007023601.1| Uncharacterized protein TCM_027661 [Theobrom... 114 2e-31 emb|CAB81134.1| putative athila transposon protein [Arabidopsis ... 129 2e-27 dbj|BAB10790.1| retroelement pol polyprotein-like [Arabidopsis t... 129 2e-27 gb|AAF63125.1|AC009526_10 Similar to Athila ORF 1 [Arabidopsis t... 128 4e-27 gb|AAB18645.1| unknown [Hordeum vulgare] 116 5e-27 gb|AAD20430.1| putative Athila retroelement ORF1 protein [Arabid... 127 6e-27 emb|CAN69639.1| hypothetical protein VITISV_040272 [Vitis vinifera] 127 8e-27 pir||C85079 hypothetical protein AT4g08050 [imported] - Arabidop... 127 1e-26 gb|AAF67369.1| Hypothetical protein T15F17.a [Arabidopsis thaliana] 126 1e-26 gb|AAF67381.1| Hypothetical protein T15F17.m [Arabidopsis thaliana] 126 1e-26 gb|ABD63156.1| Retrotransposon gag protein [Asparagus officinalis] 101 2e-26 ref|XP_006596755.1| PREDICTED: uncharacterized protein LOC102663... 113 3e-26 gb|AAF63128.1|AC009526_13 Similar to Athila ORF 1 [Arabidopsis t... 125 4e-26 pir||S66306 hypothetical protein 1 - Arabidopsis thaliana retrot... 124 7e-26 gb|AAF79809.1|AC020646_32 T32E20.9 [Arabidopsis thaliana] 122 3e-25 ref|XP_006575927.1| PREDICTED: uncharacterized protein LOC102669... 115 3e-25 gb|AAD19759.1| putative Athila retroelement ORF1 protein [Arabid... 121 4e-25 >ref|XP_004492121.1| PREDICTED: uncharacterized protein LOC101498022 [Cicer arietinum] Length = 544 Score = 127 bits (320), Expect(2) = 5e-36 Identities = 58/120 (48%), Positives = 85/120 (70%) Frame = +2 Query: 365 MVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGFSLSDRDKDWYKGLN 544 MV+ +QL G P +DPN +LS+ L+ CDT+K+NGV+ + IR+++F F L DR + W L Sbjct: 1 MVQQKQLSGTPTDDPNLYLSISLESCDTLKMNGVTYDTIRLRLFPFPLRDRARAWLHSLP 60 Query: 545 KANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAWKRYRELLRKCPQHG 724 +I T D L +AFL +YFPPSK +L + I+ FSQ+ E+L+EAW+ ++E+LR CP HG Sbjct: 61 SESITTWDQLKQAFLGRYFPPSKTAQLRNQITSFSQKEGESLYEAWENFKEMLRLCPHHG 120 Score = 51.6 bits (122), Expect(2) = 5e-36 Identities = 25/70 (35%), Positives = 40/70 (57%) Frame = +3 Query: 771 TTRSIVDATSGECLVNRNAKEALRIIEEMAANSYQWPMDRAPSRKIASVEDDRYEALAWE 950 TTR VD +G +N+N +E+ +IE+M N YQW DR+P K E D + +A + Sbjct: 137 TTRMTVDDDAGGAFINKNIEESYALIEDMEHNHYQWSSDRSPHNKGGMYEVDALDHIASK 196 Query: 951 PANMKKKYEE 980 + +K+E+ Sbjct: 197 VDALFQKFEK 206 >ref|XP_006826167.1| hypothetical protein AMTR_s04947p00003620 [Amborella trichopoda] gi|548830333|gb|ERM93404.1| hypothetical protein AMTR_s04947p00003620 [Amborella trichopoda] Length = 379 Score = 122 bits (307), Expect(2) = 1e-33 Identities = 59/135 (43%), Positives = 85/135 (62%), Gaps = 1/135 (0%) Frame = +2 Query: 323 INANNFELKSGLIQMVKDE-QLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFG 499 I A FELK + QM++ Q G P EDP+ HL FL++ D+ K+ GVS +R+++F Sbjct: 43 IQAPQFELKPVMFQMLQTVGQFSGMPTEDPHLHLRSFLEVSDSFKIQGVSEEVLRLKLFP 102 Query: 500 FSLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEA 679 FSL DR + W L ++ +DL + FL KYFPP++ + S+I F Q DE+ +A Sbjct: 103 FSLRDRARSWLNTLPPDSVTNWNDLAEKFLRKYFPPTRNAKFRSEIMSFQQLEDESTSDA 162 Query: 680 WKRYRELLRKCPQHG 724 W+R++ELLRKCP HG Sbjct: 163 WERFKELLRKCPHHG 177 Score = 48.5 bits (114), Expect(2) = 1e-33 Identities = 31/100 (31%), Positives = 57/100 (57%), Gaps = 2/100 (2%) Frame = +3 Query: 765 HTTTRSIVDATSGECLVNRNAKEALRIIEEMAANSYQWPMDRAP-SRKIASV-EDDRYEA 938 + +R ++DA++ +++++ EA I+E +A+N+YQW RAP SRK+A V E D A Sbjct: 192 NAASRMVLDASANGAILSKSYNEAFEILETIASNNYQWSNTRAPTSRKVAGVLEVDAITA 251 Query: 939 LAWEPANMKKKYEEERKAHIQSVQSQWNVEFYRREDVSYV 1058 L + A+M + + +++Q ++ +DVS V Sbjct: 252 LTAQMASMTNVLKNLSIGNAKNIQPAAAIQ---SDDVSCV 288 >ref|XP_006494715.1| PREDICTED: uncharacterized protein LOC102612045 [Citrus sinensis] Length = 810 Score = 126 bits (317), Expect(2) = 4e-33 Identities = 61/135 (45%), Positives = 88/135 (65%), Gaps = 1/135 (0%) Frame = +2 Query: 323 INANNFELKSGLIQMVKDE-QLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFG 499 + ANNFELK + QM++ Q G P +D + HL +FL++ D K+ G S A+R+++F Sbjct: 70 VQANNFELKPVMFQMLQTVGQFNGLPSKDLHPHLKLFLEVSDAFKIAGASQEALRLRLFS 129 Query: 500 FSLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEA 679 FSL DR + W L +I T DL FL KYFPP+K +L ++I+ F Q DE+L +A Sbjct: 130 FSLRDRARAWLNSLPPDSITTWSDLADKFLLKYFPPTKNAKLRNEITSFHQLEDESLCDA 189 Query: 680 WKRYRELLRKCPQHG 724 W+R++ELLR+CP HG Sbjct: 190 WERFKELLRRCPHHG 204 Score = 42.7 bits (99), Expect(2) = 4e-33 Identities = 26/68 (38%), Positives = 41/68 (60%), Gaps = 2/68 (2%) Frame = +3 Query: 771 TTRSIVDATSGECLVNRNAKEALRIIEEMAANSYQWPMDR-APSRKIASVED-DRYEALA 944 +TR IVDA++ L+ ++ EA I+E +A N+YQWP R A +R A V + D AL+ Sbjct: 221 STRLIVDASANGALLFKSYNEAYEILERIANNNYQWPSTRQAATRGTAGVHNVDALTALS 280 Query: 945 WEPANMKK 968 + ++ K Sbjct: 281 AQVTSLTK 288 >ref|XP_007023601.1| Uncharacterized protein TCM_027661 [Theobroma cacao] gi|508778967|gb|EOY26223.1| Uncharacterized protein TCM_027661 [Theobroma cacao] Length = 250 Score = 114 bits (284), Expect(2) = 2e-31 Identities = 57/123 (46%), Positives = 81/123 (65%), Gaps = 1/123 (0%) Frame = +2 Query: 323 INANNFELKSGLIQMVKDE-QLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFG 499 I NNFE+K +IQM++ Q G P +D NA++ FL+ICDT K NGV+N+ IR+++F Sbjct: 54 IQVNNFEIKLPIIQMIQTSIQFGRSPNDDLNAYIVNFLEICDTFKHNGVTNDVIRLRLFP 113 Query: 500 FSLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEA 679 FSL D+ K W L + I T DDL + FL+K FPP+K + + I+ F Q E+L+EA Sbjct: 114 FSLRDKIKSWLNSLIASFISTRDDLAQKFLAKLFPPTKTANMWNGITSFVQFNPESLYEA 173 Query: 680 WKR 688 W+R Sbjct: 174 WER 176 Score = 49.7 bits (117), Expect(2) = 2e-31 Identities = 25/74 (33%), Positives = 44/74 (59%), Gaps = 1/74 (1%) Frame = +3 Query: 759 WSHTTTRSIVDATSGECLVNRNAKEALRIIEEMAANSYQWPMDRAPSRKIASVED-DRYE 935 W TT +DAT+ L++++ EA +++E+A N+YQWP ++ RK+ASV + D Sbjct: 174 WERTT----IDATTSGALMDKSIDEAYDLLKEIAFNNYQWPCEKLVLRKVASVHELDGIN 229 Query: 936 ALAWEPANMKKKYE 977 A + + KK++ Sbjct: 230 AFTAQVTVLSKKFD 243 >emb|CAB81134.1| putative athila transposon protein [Arabidopsis thaliana] Length = 866 Score = 129 bits (325), Expect = 2e-27 Identities = 64/141 (45%), Positives = 90/141 (63%) Frame = +2 Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502 I NNFE+KSGLI M++ + G PMEDP HL F +C+ K+NGVS + ++++F F Sbjct: 34 IQNNNFEIKSGLISMIQGNKFYGLPMEDPLDHLDEFDRLCNLTKINGVSADGFKLRLFPF 93 Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682 SL D+ W K L +I T DD KAFLSK+F ++ RL ++IS FSQ+ E+ EAW Sbjct: 94 SLGDKAHIWEKNLPHDSIITWDDCKKAFLSKFFSNARTARLRNEISGFSQKTGESFCEAW 153 Query: 683 KRYRELLRKCPQHGFTEGQQL 745 +R++ +CP HGFT+ L Sbjct: 154 ERFKGYTNQCPHHGFTKASML 174 >dbj|BAB10790.1| retroelement pol polyprotein-like [Arabidopsis thaliana] Length = 1864 Score = 129 bits (324), Expect = 2e-27 Identities = 62/141 (43%), Positives = 90/141 (63%) Frame = +2 Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502 + NNFE+KSGLI MV+ + G PMEDP HL F +C K+NGVS + ++++F F Sbjct: 66 VQNNNFEIKSGLIAMVQSNKFHGLPMEDPLDHLDEFDRLCSLTKINGVSEDGFKLRLFPF 125 Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682 SL D+ W K L + +I + +D KAFL+K+F S+ RL +DIS F+Q +ET EAW Sbjct: 126 SLGDKAHQWEKSLLQGSITSWNDCKKAFLAKFFSNSRTARLRNDISGFTQTNNETFCEAW 185 Query: 683 KRYRELLRKCPQHGFTEGQQL 745 +R++ +CP HGF++ L Sbjct: 186 ERFKGYQTQCPHHGFSKASLL 206 >gb|AAF63125.1|AC009526_10 Similar to Athila ORF 1 [Arabidopsis thaliana] Length = 823 Score = 128 bits (322), Expect = 4e-27 Identities = 62/141 (43%), Positives = 90/141 (63%) Frame = +2 Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502 I NNFE+KSGLI M++ + G PMEDP HL F +C K+NGVS ++ ++++F F Sbjct: 70 IQNNNFEIKSGLISMIQSNKFHGLPMEDPLDHLDNFDRLCSLTKINGVSEDSFKLRLFPF 129 Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682 SL D+ W K L ++ TLDD KAFL+K+F S+ RL ++IS F+Q+ E+ EAW Sbjct: 130 SLGDKAHLWEKTLPVDSVDTLDDCKKAFLAKFFSNSRTARLRNEISGFNQKNSESFAEAW 189 Query: 683 KRYRELLRKCPQHGFTEGQQL 745 +R++ +CP HGF + L Sbjct: 190 ERFKGYSTQCPHHGFKKASLL 210 >gb|AAB18645.1| unknown [Hordeum vulgare] Length = 337 Score = 116 bits (291), Expect(2) = 5e-27 Identities = 53/135 (39%), Positives = 85/135 (62%) Frame = +2 Query: 326 NANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGFS 505 NA ++E+ + L+ +V EQ G P ED +HL+ F+++CD K V N+ I++++F FS Sbjct: 28 NAESYEINAALLNLVMKEQFSGLPSEDVASHLNTFIELCDMQKKKDVDNDVIKLKLFPFS 87 Query: 506 LSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAWK 685 L DR K W+ L K++I + D A++SKYFPP+K L +DI F Q E + +AW+ Sbjct: 88 LRDRAKTWFSSLPKSSIDSWDKCKDAYISKYFPPAKIISLRNDIMNFKQLDHEHVAQAWE 147 Query: 686 RYRELLRKCPQHGFT 730 R + ++R CP +G + Sbjct: 148 RMKLMIRNCPANGLS 162 Score = 32.3 bits (72), Expect(2) = 5e-27 Identities = 14/51 (27%), Positives = 31/51 (60%), Gaps = 1/51 (1%) Frame = +3 Query: 774 TRSIVDATSGECLVNRNAKEALRIIEEMAANSYQWPMDRAP-SRKIASVED 923 +R+I+D+ +G + EA ++++ + N QW +R+P S+K+ +E+ Sbjct: 178 SRNILDSATGGTFMEITLGEATKLLDNIMTNYSQWHTERSPTSKKVHVIEE 228 >gb|AAD20430.1| putative Athila retroelement ORF1 protein [Arabidopsis thaliana] Length = 622 Score = 127 bits (320), Expect = 6e-27 Identities = 62/151 (41%), Positives = 95/151 (62%) Frame = +2 Query: 281 PISNGTIQGGEWGTINANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLN 460 P G G + + NNFE+KS LI MV+ + G MEDP HL F +C TVK+N Sbjct: 38 PNIRGNRNGIQAPPVENNNFEIKSSLINMVQTSKFHGLSMEDPLDHLEQFDMLCSTVKIN 97 Query: 461 GVSNNAIRIQMFGFSLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDIS 640 G+S +A ++++F FSL DR + W K L + +I + D +AFLSK+F ++ RL ++IS Sbjct: 98 GISEDAFKLRLFPFSLGDRARIWEKNLPQRSITSWDQCKRAFLSKFFSTTRTARLRNEIS 157 Query: 641 YFSQQGDETLFEAWKRYRELLRKCPQHGFTE 733 F+Q+ +E+ EAW+R++ +CP HGF++ Sbjct: 158 SFTQRSNESFCEAWERFKGYKMQCPHHGFSK 188 >emb|CAN69639.1| hypothetical protein VITISV_040272 [Vitis vinifera] Length = 437 Score = 127 bits (319), Expect = 8e-27 Identities = 63/127 (49%), Positives = 89/127 (70%), Gaps = 1/127 (0%) Frame = +2 Query: 323 INANNFELKSGLIQMVKDE-QLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFG 499 I ANNFE+K +IQM++ Q GG +DPN H++ FL+ICDT K NGV ++AIR+++F Sbjct: 241 IQANNFEIKLAIIQMIRSSVQFGGLANDDPNLHIANFLEICDTFKHNGVIDDAIRLRLFP 300 Query: 500 FSLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEA 679 FSL+++ K W L I T D L AFL+KYFPP+K+ ++ +DI+ F QQ E+L+EA Sbjct: 301 FSLNNKAKAWLISLPPGTITTWDGLVNAFLTKYFPPAKSIKMRNDITNFLQQDQESLYEA 360 Query: 680 WKRYREL 700 W+R EL Sbjct: 361 WERKLEL 367 >pir||C85079 hypothetical protein AT4g08050 [imported] - Arabidopsis thaliana gi|5724774|gb|AAD48078.1|AF160183_5 contains similarity to retrotransposons; may be a pseudogene [Arabidopsis thaliana] gi|7267445|emb|CAB81142.1| AT4g08050 [Arabidopsis thaliana] Length = 1428 Score = 127 bits (318), Expect = 1e-26 Identities = 62/141 (43%), Positives = 89/141 (63%) Frame = +2 Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502 I NNFE+KSGLI M++ + G PMEDP HL F +C+ K+NGVS + ++++F F Sbjct: 34 IQNNNFEIKSGLISMIQGNKFHGLPMEDPLDHLDEFDRLCNLTKINGVSEDGFKLRLFPF 93 Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682 SL D+ W K L+ +I T DD KAFLSK+F ++ RL ++I FSQ+ E+ EAW Sbjct: 94 SLGDKAHIWEKNLSHDSITTWDDYKKAFLSKFFSNARTARLRNEIYGFSQKTGESFCEAW 153 Query: 683 KRYRELLRKCPQHGFTEGQQL 745 +R++ +CP H FT+ L Sbjct: 154 ERFKGYTNQCPHHSFTKASLL 174 >gb|AAF67369.1| Hypothetical protein T15F17.a [Arabidopsis thaliana] Length = 703 Score = 126 bits (317), Expect = 1e-26 Identities = 61/141 (43%), Positives = 89/141 (63%) Frame = +2 Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502 + NNFE+KSGLI MV+ + G PMEDP HL F +C K+NGVS + ++++F F Sbjct: 34 VQNNNFEIKSGLIAMVQSNKFHGLPMEDPLDHLDEFDRLCSLTKINGVSEDGFKLRLFPF 93 Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682 SL D+ W K L + +I + +D KAFL+K+F S+ RL +DIS F+Q +ET EAW Sbjct: 94 SLGDKAHQWEKSLPQGSITSWNDCKKAFLAKFFSNSRTARLRNDISGFTQTNNETFCEAW 153 Query: 683 KRYRELLRKCPQHGFTEGQQL 745 + ++ +CP HGF++ L Sbjct: 154 ECFKGYQTQCPHHGFSKASLL 174 >gb|AAF67381.1| Hypothetical protein T15F17.m [Arabidopsis thaliana] Length = 346 Score = 126 bits (317), Expect = 1e-26 Identities = 62/141 (43%), Positives = 88/141 (62%) Frame = +2 Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502 + NNF++KS LI MV+ + G PMEDP HL F +CD K+NGVS + ++++F F Sbjct: 34 VQNNNFKIKSSLIAMVQGNKFHGLPMEDPLDHLDEFERLCDLTKINGVSEDGFKLRLFPF 93 Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682 SL D+ W K L + +I T DD K FL K+F S+ RL ++IS F+Q+ +E+ EAW Sbjct: 94 SLGDKAHLWEKTLPQGSITTWDDCKKVFLEKFFSNSRTARLWNEISGFTQKQNESFCEAW 153 Query: 683 KRYRELLRKCPQHGFTEGQQL 745 +R++ KCP HGF E L Sbjct: 154 ERFKGYQTKCPHHGFKEASLL 174 >gb|ABD63156.1| Retrotransposon gag protein [Asparagus officinalis] Length = 275 Score = 101 bits (251), Expect(2) = 2e-26 Identities = 44/93 (47%), Positives = 69/93 (74%) Frame = +2 Query: 455 LNGVSNNAIRIQMFGFSLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISD 634 +NGVS++AI++++F FSL D+ + W + L +I T D L +AFL+KYFPPSK +L + Sbjct: 1 MNGVSDDAIKLRLFPFSLRDKARAWLQSLPPGSITTWDQLSEAFLAKYFPPSKTAQLRNQ 60 Query: 635 ISYFSQQGDETLFEAWKRYRELLRKCPQHGFTE 733 I+ F+Q+ E+L++AW+RY++LLR CP HG + Sbjct: 61 ITTFTQKEGESLYDAWERYKDLLRMCPHHGLED 93 Score = 45.8 bits (107), Expect(2) = 2e-26 Identities = 21/69 (30%), Positives = 42/69 (60%) Frame = +3 Query: 774 TRSIVDATSGECLVNRNAKEALRIIEEMAANSYQWPMDRAPSRKIASVEDDRYEALAWEP 953 TR VDA +G L+N++ ++A ++IE+MA N +QW +R+ +K + D + +A Sbjct: 108 TRMTVDAAAGGALMNKSVRDAKQLIEDMAQNHFQWSGERSLPKKSGRYDVDALDHIASRV 167 Query: 954 ANMKKKYEE 980 + +K+++ Sbjct: 168 DALFQKFDK 176 >ref|XP_006596755.1| PREDICTED: uncharacterized protein LOC102663452 [Glycine max] Length = 378 Score = 113 bits (282), Expect(2) = 3e-26 Identities = 62/163 (38%), Positives = 91/163 (55%) Frame = +2 Query: 257 TTSSQPYVPISNGTIQGGEWGTINANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLD 436 ++SS PY SN + + A N LIQ++++ G P EDP AHL+ +++ Sbjct: 14 SSSSVPYFFTSNACPE------VQAQNITYPHSLIQLIQNNLFHGLPNEDPCAHLATYIE 67 Query: 437 ICDTVKLNGVSNNAIRIQMFGFSLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKA 616 IC+T++L GV + +R+ +F FSLS K W + T +++ + FL KYFP SK Sbjct: 68 ICNTIRLAGVPEDVVRLSLFSFSLSREAKRWLHSFKGNGLKTWEEVVEKFLKKYFPESKM 127 Query: 617 QRLISDISYFSQQGDETLFEAWKRYRELLRKCPQHGFTEGQQL 745 + IS F Q E+L EA +R+ LLRK P HGF+E QL Sbjct: 128 TEGKTSISSFHQFPKESLSEALERFHGLLRKTPAHGFSEPIQL 170 Score = 33.1 bits (74), Expect(2) = 3e-26 Identities = 16/48 (33%), Positives = 29/48 (60%) Frame = +3 Query: 750 FL*WSHTTTRSIVDATSGECLVNRNAKEALRIIEEMAANSYQWPMDRA 893 F W + ++ ++DA +G + ++ KEA+ +IE MAA+ + DRA Sbjct: 173 FTDWLRSQSKQLLDAFAGGKIKLKSPKEAIELIENMAASDHAILHDRA 220 >gb|AAF63128.1|AC009526_13 Similar to Athila ORF 1 [Arabidopsis thaliana] Length = 780 Score = 125 bits (313), Expect = 4e-26 Identities = 61/141 (43%), Positives = 87/141 (61%) Frame = +2 Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502 I NNFE+KSGLI M++ + G PMEDP HL F C K+NGVS ++ ++++F F Sbjct: 34 IQNNNFEIKSGLISMIQSNKFHGLPMEDPLDHLDNFDRFCSLTKINGVSEDSFKLRLFPF 93 Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682 SL D W K L ++ T DD KAFL+K+F S+ RL ++IS F+Q+ E+ EAW Sbjct: 94 SLGDEAHLWEKTLLVDSVDTWDDCKKAFLAKFFSNSRTARLRNEISGFNQKNSESFAEAW 153 Query: 683 KRYRELLRKCPQHGFTEGQQL 745 +R++ +CP HGF + L Sbjct: 154 ERFKRYSTQCPHHGFKKASLL 174 >pir||S66306 hypothetical protein 1 - Arabidopsis thaliana retrotransposon Athila gi|806535|emb|CAA57397.1| unnamed protein product [Arabidopsis thaliana] Length = 935 Score = 124 bits (311), Expect = 7e-26 Identities = 63/141 (44%), Positives = 88/141 (62%) Frame = +2 Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502 + NNFE+KSGLI MV+ + G MEDP HL F +C K+NGVS + ++++F F Sbjct: 66 VQNNNFEIKSGLIAMVQGNKFHGLLMEDPLDHLDEFERLCRLTKINGVSEDGFKLRLFPF 125 Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682 SL D+ W K L +I T DD KAFL+K+F S+ RL ++IS F+Q+ +E+ EAW Sbjct: 126 SLGDKAHLWEKTLPHGSITTWDDCKKAFLAKFFSNSRTARLRNEISGFTQKQNESFCEAW 185 Query: 683 KRYRELLRKCPQHGFTEGQQL 745 +R++ KCP HGF E L Sbjct: 186 ERFKGYPTKCPHHGFKEASLL 206 >gb|AAF79809.1|AC020646_32 T32E20.9 [Arabidopsis thaliana] Length = 1586 Score = 122 bits (306), Expect = 3e-25 Identities = 58/133 (43%), Positives = 85/133 (63%) Frame = +2 Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502 + NNFE+KSGLI MV+ + G PMEDP HL F +C K+N VS + ++++F F Sbjct: 66 VQNNNFEIKSGLIAMVQSNKFHGLPMEDPLDHLDEFDRLCSLTKINRVSEDGFKLRLFPF 125 Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682 SL D+ W K L + +I + +D KAFL+K+F S+ RL +DIS F+Q +ET +EAW Sbjct: 126 SLGDKAHQWEKSLPQGSITSWNDCKKAFLAKFFSNSRTARLRNDISGFTQTNNETFYEAW 185 Query: 683 KRYRELLRKCPQH 721 +R++ +CP H Sbjct: 186 ERFKGYQTQCPHH 198 >ref|XP_006575927.1| PREDICTED: uncharacterized protein LOC102669817 [Glycine max] Length = 352 Score = 115 bits (287), Expect(2) = 3e-25 Identities = 60/145 (41%), Positives = 88/145 (60%) Frame = +2 Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502 + A N LIQ++++ G P EDP AHL+ +++IC+TV+L GV +A+R+ +F F Sbjct: 30 VQAQNITYPHSLIQLIQNNLFHGLPNEDPYAHLATYIEICNTVRLAGVPEDAVRLSLFLF 89 Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682 SL K W+ ++ T D++ + FL KYFP SK + IS F + DE+L EA Sbjct: 90 SLFGDAKRWFHSFKGNSLKTWDEVVEKFLKKYFPESKTAEGKAAISSFHEFPDESLSEAL 149 Query: 683 KRYRELLRKCPQHGFTEGQQLPLSI 757 +R+R LLRK P HGF+E QL + I Sbjct: 150 ERFRGLLRKTPTHGFSELIQLNIFI 174 Score = 28.1 bits (61), Expect(2) = 3e-25 Identities = 12/40 (30%), Positives = 26/40 (65%) Frame = +3 Query: 774 TRSIVDATSGECLVNRNAKEALRIIEEMAANSYQWPMDRA 893 ++ ++DA++G + + +EA+ +I+ MAA+ + DRA Sbjct: 181 SKQLLDASAGRKIKLKTPEEAMELIKNMAASDHAILRDRA 220 >gb|AAD19759.1| putative Athila retroelement ORF1 protein [Arabidopsis thaliana] Length = 750 Score = 121 bits (304), Expect = 4e-25 Identities = 61/141 (43%), Positives = 87/141 (61%) Frame = +2 Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502 + NNFE+KSGLI MV+ + G PMED HL F +CD K+NGVS + + ++F F Sbjct: 34 VQNNNFEIKSGLIAMVQGNKFHGLPMEDSLDHLDEFERLCDLTKINGVSEDGFKFRLFPF 93 Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682 SL D+ W K L + +I T DD KAF +K+F S+ RL ++IS F+Q+ +E+ EAW Sbjct: 94 SLGDKTHLWEKTLPQNSITTWDDCKKAFFAKFFSNSRTARLRNEISGFTQKQNESFCEAW 153 Query: 683 KRYRELLRKCPQHGFTEGQQL 745 +R++ KCP GF + L Sbjct: 154 ERFKGYQTKCPHPGFKQASLL 174