BLASTX nr result

ID: Mentha29_contig00033003 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00033003
         (1065 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004492121.1| PREDICTED: uncharacterized protein LOC101498...   127   5e-36
ref|XP_006826167.1| hypothetical protein AMTR_s04947p00003620 [A...   122   1e-33
ref|XP_006494715.1| PREDICTED: uncharacterized protein LOC102612...   126   4e-33
ref|XP_007023601.1| Uncharacterized protein TCM_027661 [Theobrom...   114   2e-31
emb|CAB81134.1| putative athila transposon protein [Arabidopsis ...   129   2e-27
dbj|BAB10790.1| retroelement pol polyprotein-like [Arabidopsis t...   129   2e-27
gb|AAF63125.1|AC009526_10 Similar to Athila ORF 1 [Arabidopsis t...   128   4e-27
gb|AAB18645.1| unknown [Hordeum vulgare]                              116   5e-27
gb|AAD20430.1| putative Athila retroelement ORF1 protein [Arabid...   127   6e-27
emb|CAN69639.1| hypothetical protein VITISV_040272 [Vitis vinifera]   127   8e-27
pir||C85079 hypothetical protein AT4g08050 [imported] - Arabidop...   127   1e-26
gb|AAF67369.1| Hypothetical protein T15F17.a [Arabidopsis thaliana]   126   1e-26
gb|AAF67381.1| Hypothetical protein T15F17.m [Arabidopsis thaliana]   126   1e-26
gb|ABD63156.1| Retrotransposon gag protein [Asparagus officinalis]    101   2e-26
ref|XP_006596755.1| PREDICTED: uncharacterized protein LOC102663...   113   3e-26
gb|AAF63128.1|AC009526_13 Similar to Athila ORF 1 [Arabidopsis t...   125   4e-26
pir||S66306 hypothetical protein 1 - Arabidopsis thaliana retrot...   124   7e-26
gb|AAF79809.1|AC020646_32 T32E20.9 [Arabidopsis thaliana]             122   3e-25
ref|XP_006575927.1| PREDICTED: uncharacterized protein LOC102669...   115   3e-25
gb|AAD19759.1| putative Athila retroelement ORF1 protein [Arabid...   121   4e-25

>ref|XP_004492121.1| PREDICTED: uncharacterized protein LOC101498022 [Cicer arietinum]
          Length = 544

 Score =  127 bits (320), Expect(2) = 5e-36
 Identities = 58/120 (48%), Positives = 85/120 (70%)
 Frame = +2

Query: 365 MVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGFSLSDRDKDWYKGLN 544
           MV+ +QL G P +DPN +LS+ L+ CDT+K+NGV+ + IR+++F F L DR + W   L 
Sbjct: 1   MVQQKQLSGTPTDDPNLYLSISLESCDTLKMNGVTYDTIRLRLFPFPLRDRARAWLHSLP 60

Query: 545 KANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAWKRYRELLRKCPQHG 724
             +I T D L +AFL +YFPPSK  +L + I+ FSQ+  E+L+EAW+ ++E+LR CP HG
Sbjct: 61  SESITTWDQLKQAFLGRYFPPSKTAQLRNQITSFSQKEGESLYEAWENFKEMLRLCPHHG 120



 Score = 51.6 bits (122), Expect(2) = 5e-36
 Identities = 25/70 (35%), Positives = 40/70 (57%)
 Frame = +3

Query: 771 TTRSIVDATSGECLVNRNAKEALRIIEEMAANSYQWPMDRAPSRKIASVEDDRYEALAWE 950
           TTR  VD  +G   +N+N +E+  +IE+M  N YQW  DR+P  K    E D  + +A +
Sbjct: 137 TTRMTVDDDAGGAFINKNIEESYALIEDMEHNHYQWSSDRSPHNKGGMYEVDALDHIASK 196

Query: 951 PANMKKKYEE 980
              + +K+E+
Sbjct: 197 VDALFQKFEK 206


>ref|XP_006826167.1| hypothetical protein AMTR_s04947p00003620 [Amborella trichopoda]
           gi|548830333|gb|ERM93404.1| hypothetical protein
           AMTR_s04947p00003620 [Amborella trichopoda]
          Length = 379

 Score =  122 bits (307), Expect(2) = 1e-33
 Identities = 59/135 (43%), Positives = 85/135 (62%), Gaps = 1/135 (0%)
 Frame = +2

Query: 323 INANNFELKSGLIQMVKDE-QLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFG 499
           I A  FELK  + QM++   Q  G P EDP+ HL  FL++ D+ K+ GVS   +R+++F 
Sbjct: 43  IQAPQFELKPVMFQMLQTVGQFSGMPTEDPHLHLRSFLEVSDSFKIQGVSEEVLRLKLFP 102

Query: 500 FSLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEA 679
           FSL DR + W   L   ++   +DL + FL KYFPP++  +  S+I  F Q  DE+  +A
Sbjct: 103 FSLRDRARSWLNTLPPDSVTNWNDLAEKFLRKYFPPTRNAKFRSEIMSFQQLEDESTSDA 162

Query: 680 WKRYRELLRKCPQHG 724
           W+R++ELLRKCP HG
Sbjct: 163 WERFKELLRKCPHHG 177



 Score = 48.5 bits (114), Expect(2) = 1e-33
 Identities = 31/100 (31%), Positives = 57/100 (57%), Gaps = 2/100 (2%)
 Frame = +3

Query: 765  HTTTRSIVDATSGECLVNRNAKEALRIIEEMAANSYQWPMDRAP-SRKIASV-EDDRYEA 938
            +  +R ++DA++   +++++  EA  I+E +A+N+YQW   RAP SRK+A V E D   A
Sbjct: 192  NAASRMVLDASANGAILSKSYNEAFEILETIASNNYQWSNTRAPTSRKVAGVLEVDAITA 251

Query: 939  LAWEPANMKKKYEEERKAHIQSVQSQWNVEFYRREDVSYV 1058
            L  + A+M    +     + +++Q    ++    +DVS V
Sbjct: 252  LTAQMASMTNVLKNLSIGNAKNIQPAAAIQ---SDDVSCV 288


>ref|XP_006494715.1| PREDICTED: uncharacterized protein LOC102612045 [Citrus sinensis]
          Length = 810

 Score =  126 bits (317), Expect(2) = 4e-33
 Identities = 61/135 (45%), Positives = 88/135 (65%), Gaps = 1/135 (0%)
 Frame = +2

Query: 323 INANNFELKSGLIQMVKDE-QLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFG 499
           + ANNFELK  + QM++   Q  G P +D + HL +FL++ D  K+ G S  A+R+++F 
Sbjct: 70  VQANNFELKPVMFQMLQTVGQFNGLPSKDLHPHLKLFLEVSDAFKIAGASQEALRLRLFS 129

Query: 500 FSLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEA 679
           FSL DR + W   L   +I T  DL   FL KYFPP+K  +L ++I+ F Q  DE+L +A
Sbjct: 130 FSLRDRARAWLNSLPPDSITTWSDLADKFLLKYFPPTKNAKLRNEITSFHQLEDESLCDA 189

Query: 680 WKRYRELLRKCPQHG 724
           W+R++ELLR+CP HG
Sbjct: 190 WERFKELLRRCPHHG 204



 Score = 42.7 bits (99), Expect(2) = 4e-33
 Identities = 26/68 (38%), Positives = 41/68 (60%), Gaps = 2/68 (2%)
 Frame = +3

Query: 771 TTRSIVDATSGECLVNRNAKEALRIIEEMAANSYQWPMDR-APSRKIASVED-DRYEALA 944
           +TR IVDA++   L+ ++  EA  I+E +A N+YQWP  R A +R  A V + D   AL+
Sbjct: 221 STRLIVDASANGALLFKSYNEAYEILERIANNNYQWPSTRQAATRGTAGVHNVDALTALS 280

Query: 945 WEPANMKK 968
            +  ++ K
Sbjct: 281 AQVTSLTK 288


>ref|XP_007023601.1| Uncharacterized protein TCM_027661 [Theobroma cacao]
           gi|508778967|gb|EOY26223.1| Uncharacterized protein
           TCM_027661 [Theobroma cacao]
          Length = 250

 Score =  114 bits (284), Expect(2) = 2e-31
 Identities = 57/123 (46%), Positives = 81/123 (65%), Gaps = 1/123 (0%)
 Frame = +2

Query: 323 INANNFELKSGLIQMVKDE-QLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFG 499
           I  NNFE+K  +IQM++   Q G  P +D NA++  FL+ICDT K NGV+N+ IR+++F 
Sbjct: 54  IQVNNFEIKLPIIQMIQTSIQFGRSPNDDLNAYIVNFLEICDTFKHNGVTNDVIRLRLFP 113

Query: 500 FSLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEA 679
           FSL D+ K W   L  + I T DDL + FL+K FPP+K   + + I+ F Q   E+L+EA
Sbjct: 114 FSLRDKIKSWLNSLIASFISTRDDLAQKFLAKLFPPTKTANMWNGITSFVQFNPESLYEA 173

Query: 680 WKR 688
           W+R
Sbjct: 174 WER 176



 Score = 49.7 bits (117), Expect(2) = 2e-31
 Identities = 25/74 (33%), Positives = 44/74 (59%), Gaps = 1/74 (1%)
 Frame = +3

Query: 759 WSHTTTRSIVDATSGECLVNRNAKEALRIIEEMAANSYQWPMDRAPSRKIASVED-DRYE 935
           W  TT    +DAT+   L++++  EA  +++E+A N+YQWP ++   RK+ASV + D   
Sbjct: 174 WERTT----IDATTSGALMDKSIDEAYDLLKEIAFNNYQWPCEKLVLRKVASVHELDGIN 229

Query: 936 ALAWEPANMKKKYE 977
           A   +   + KK++
Sbjct: 230 AFTAQVTVLSKKFD 243


>emb|CAB81134.1| putative athila transposon protein [Arabidopsis thaliana]
          Length = 866

 Score =  129 bits (325), Expect = 2e-27
 Identities = 64/141 (45%), Positives = 90/141 (63%)
 Frame = +2

Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502
           I  NNFE+KSGLI M++  +  G PMEDP  HL  F  +C+  K+NGVS +  ++++F F
Sbjct: 34  IQNNNFEIKSGLISMIQGNKFYGLPMEDPLDHLDEFDRLCNLTKINGVSADGFKLRLFPF 93

Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682
           SL D+   W K L   +I T DD  KAFLSK+F  ++  RL ++IS FSQ+  E+  EAW
Sbjct: 94  SLGDKAHIWEKNLPHDSIITWDDCKKAFLSKFFSNARTARLRNEISGFSQKTGESFCEAW 153

Query: 683 KRYRELLRKCPQHGFTEGQQL 745
           +R++    +CP HGFT+   L
Sbjct: 154 ERFKGYTNQCPHHGFTKASML 174


>dbj|BAB10790.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1864

 Score =  129 bits (324), Expect = 2e-27
 Identities = 62/141 (43%), Positives = 90/141 (63%)
 Frame = +2

Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502
           +  NNFE+KSGLI MV+  +  G PMEDP  HL  F  +C   K+NGVS +  ++++F F
Sbjct: 66  VQNNNFEIKSGLIAMVQSNKFHGLPMEDPLDHLDEFDRLCSLTKINGVSEDGFKLRLFPF 125

Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682
           SL D+   W K L + +I + +D  KAFL+K+F  S+  RL +DIS F+Q  +ET  EAW
Sbjct: 126 SLGDKAHQWEKSLLQGSITSWNDCKKAFLAKFFSNSRTARLRNDISGFTQTNNETFCEAW 185

Query: 683 KRYRELLRKCPQHGFTEGQQL 745
           +R++    +CP HGF++   L
Sbjct: 186 ERFKGYQTQCPHHGFSKASLL 206


>gb|AAF63125.1|AC009526_10 Similar to Athila ORF 1 [Arabidopsis thaliana]
          Length = 823

 Score =  128 bits (322), Expect = 4e-27
 Identities = 62/141 (43%), Positives = 90/141 (63%)
 Frame = +2

Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502
           I  NNFE+KSGLI M++  +  G PMEDP  HL  F  +C   K+NGVS ++ ++++F F
Sbjct: 70  IQNNNFEIKSGLISMIQSNKFHGLPMEDPLDHLDNFDRLCSLTKINGVSEDSFKLRLFPF 129

Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682
           SL D+   W K L   ++ TLDD  KAFL+K+F  S+  RL ++IS F+Q+  E+  EAW
Sbjct: 130 SLGDKAHLWEKTLPVDSVDTLDDCKKAFLAKFFSNSRTARLRNEISGFNQKNSESFAEAW 189

Query: 683 KRYRELLRKCPQHGFTEGQQL 745
           +R++    +CP HGF +   L
Sbjct: 190 ERFKGYSTQCPHHGFKKASLL 210


>gb|AAB18645.1| unknown [Hordeum vulgare]
          Length = 337

 Score =  116 bits (291), Expect(2) = 5e-27
 Identities = 53/135 (39%), Positives = 85/135 (62%)
 Frame = +2

Query: 326 NANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGFS 505
           NA ++E+ + L+ +V  EQ  G P ED  +HL+ F+++CD  K   V N+ I++++F FS
Sbjct: 28  NAESYEINAALLNLVMKEQFSGLPSEDVASHLNTFIELCDMQKKKDVDNDVIKLKLFPFS 87

Query: 506 LSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAWK 685
           L DR K W+  L K++I + D    A++SKYFPP+K   L +DI  F Q   E + +AW+
Sbjct: 88  LRDRAKTWFSSLPKSSIDSWDKCKDAYISKYFPPAKIISLRNDIMNFKQLDHEHVAQAWE 147

Query: 686 RYRELLRKCPQHGFT 730
           R + ++R CP +G +
Sbjct: 148 RMKLMIRNCPANGLS 162



 Score = 32.3 bits (72), Expect(2) = 5e-27
 Identities = 14/51 (27%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
 Frame = +3

Query: 774 TRSIVDATSGECLVNRNAKEALRIIEEMAANSYQWPMDRAP-SRKIASVED 923
           +R+I+D+ +G   +     EA ++++ +  N  QW  +R+P S+K+  +E+
Sbjct: 178 SRNILDSATGGTFMEITLGEATKLLDNIMTNYSQWHTERSPTSKKVHVIEE 228


>gb|AAD20430.1| putative Athila retroelement ORF1 protein [Arabidopsis thaliana]
          Length = 622

 Score =  127 bits (320), Expect = 6e-27
 Identities = 62/151 (41%), Positives = 95/151 (62%)
 Frame = +2

Query: 281 PISNGTIQGGEWGTINANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLN 460
           P   G   G +   +  NNFE+KS LI MV+  +  G  MEDP  HL  F  +C TVK+N
Sbjct: 38  PNIRGNRNGIQAPPVENNNFEIKSSLINMVQTSKFHGLSMEDPLDHLEQFDMLCSTVKIN 97

Query: 461 GVSNNAIRIQMFGFSLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDIS 640
           G+S +A ++++F FSL DR + W K L + +I + D   +AFLSK+F  ++  RL ++IS
Sbjct: 98  GISEDAFKLRLFPFSLGDRARIWEKNLPQRSITSWDQCKRAFLSKFFSTTRTARLRNEIS 157

Query: 641 YFSQQGDETLFEAWKRYRELLRKCPQHGFTE 733
            F+Q+ +E+  EAW+R++    +CP HGF++
Sbjct: 158 SFTQRSNESFCEAWERFKGYKMQCPHHGFSK 188


>emb|CAN69639.1| hypothetical protein VITISV_040272 [Vitis vinifera]
          Length = 437

 Score =  127 bits (319), Expect = 8e-27
 Identities = 63/127 (49%), Positives = 89/127 (70%), Gaps = 1/127 (0%)
 Frame = +2

Query: 323 INANNFELKSGLIQMVKDE-QLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFG 499
           I ANNFE+K  +IQM++   Q GG   +DPN H++ FL+ICDT K NGV ++AIR+++F 
Sbjct: 241 IQANNFEIKLAIIQMIRSSVQFGGLANDDPNLHIANFLEICDTFKHNGVIDDAIRLRLFP 300

Query: 500 FSLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEA 679
           FSL+++ K W   L    I T D L  AFL+KYFPP+K+ ++ +DI+ F QQ  E+L+EA
Sbjct: 301 FSLNNKAKAWLISLPPGTITTWDGLVNAFLTKYFPPAKSIKMRNDITNFLQQDQESLYEA 360

Query: 680 WKRYREL 700
           W+R  EL
Sbjct: 361 WERKLEL 367


>pir||C85079 hypothetical protein AT4g08050 [imported] - Arabidopsis thaliana
           gi|5724774|gb|AAD48078.1|AF160183_5 contains similarity
           to retrotransposons; may be a pseudogene [Arabidopsis
           thaliana] gi|7267445|emb|CAB81142.1| AT4g08050
           [Arabidopsis thaliana]
          Length = 1428

 Score =  127 bits (318), Expect = 1e-26
 Identities = 62/141 (43%), Positives = 89/141 (63%)
 Frame = +2

Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502
           I  NNFE+KSGLI M++  +  G PMEDP  HL  F  +C+  K+NGVS +  ++++F F
Sbjct: 34  IQNNNFEIKSGLISMIQGNKFHGLPMEDPLDHLDEFDRLCNLTKINGVSEDGFKLRLFPF 93

Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682
           SL D+   W K L+  +I T DD  KAFLSK+F  ++  RL ++I  FSQ+  E+  EAW
Sbjct: 94  SLGDKAHIWEKNLSHDSITTWDDYKKAFLSKFFSNARTARLRNEIYGFSQKTGESFCEAW 153

Query: 683 KRYRELLRKCPQHGFTEGQQL 745
           +R++    +CP H FT+   L
Sbjct: 154 ERFKGYTNQCPHHSFTKASLL 174


>gb|AAF67369.1| Hypothetical protein T15F17.a [Arabidopsis thaliana]
          Length = 703

 Score =  126 bits (317), Expect = 1e-26
 Identities = 61/141 (43%), Positives = 89/141 (63%)
 Frame = +2

Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502
           +  NNFE+KSGLI MV+  +  G PMEDP  HL  F  +C   K+NGVS +  ++++F F
Sbjct: 34  VQNNNFEIKSGLIAMVQSNKFHGLPMEDPLDHLDEFDRLCSLTKINGVSEDGFKLRLFPF 93

Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682
           SL D+   W K L + +I + +D  KAFL+K+F  S+  RL +DIS F+Q  +ET  EAW
Sbjct: 94  SLGDKAHQWEKSLPQGSITSWNDCKKAFLAKFFSNSRTARLRNDISGFTQTNNETFCEAW 153

Query: 683 KRYRELLRKCPQHGFTEGQQL 745
           + ++    +CP HGF++   L
Sbjct: 154 ECFKGYQTQCPHHGFSKASLL 174


>gb|AAF67381.1| Hypothetical protein T15F17.m [Arabidopsis thaliana]
          Length = 346

 Score =  126 bits (317), Expect = 1e-26
 Identities = 62/141 (43%), Positives = 88/141 (62%)
 Frame = +2

Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502
           +  NNF++KS LI MV+  +  G PMEDP  HL  F  +CD  K+NGVS +  ++++F F
Sbjct: 34  VQNNNFKIKSSLIAMVQGNKFHGLPMEDPLDHLDEFERLCDLTKINGVSEDGFKLRLFPF 93

Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682
           SL D+   W K L + +I T DD  K FL K+F  S+  RL ++IS F+Q+ +E+  EAW
Sbjct: 94  SLGDKAHLWEKTLPQGSITTWDDCKKVFLEKFFSNSRTARLWNEISGFTQKQNESFCEAW 153

Query: 683 KRYRELLRKCPQHGFTEGQQL 745
           +R++    KCP HGF E   L
Sbjct: 154 ERFKGYQTKCPHHGFKEASLL 174


>gb|ABD63156.1| Retrotransposon gag protein [Asparagus officinalis]
          Length = 275

 Score =  101 bits (251), Expect(2) = 2e-26
 Identities = 44/93 (47%), Positives = 69/93 (74%)
 Frame = +2

Query: 455 LNGVSNNAIRIQMFGFSLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISD 634
           +NGVS++AI++++F FSL D+ + W + L   +I T D L +AFL+KYFPPSK  +L + 
Sbjct: 1   MNGVSDDAIKLRLFPFSLRDKARAWLQSLPPGSITTWDQLSEAFLAKYFPPSKTAQLRNQ 60

Query: 635 ISYFSQQGDETLFEAWKRYRELLRKCPQHGFTE 733
           I+ F+Q+  E+L++AW+RY++LLR CP HG  +
Sbjct: 61  ITTFTQKEGESLYDAWERYKDLLRMCPHHGLED 93



 Score = 45.8 bits (107), Expect(2) = 2e-26
 Identities = 21/69 (30%), Positives = 42/69 (60%)
 Frame = +3

Query: 774 TRSIVDATSGECLVNRNAKEALRIIEEMAANSYQWPMDRAPSRKIASVEDDRYEALAWEP 953
           TR  VDA +G  L+N++ ++A ++IE+MA N +QW  +R+  +K    + D  + +A   
Sbjct: 108 TRMTVDAAAGGALMNKSVRDAKQLIEDMAQNHFQWSGERSLPKKSGRYDVDALDHIASRV 167

Query: 954 ANMKKKYEE 980
             + +K+++
Sbjct: 168 DALFQKFDK 176


>ref|XP_006596755.1| PREDICTED: uncharacterized protein LOC102663452 [Glycine max]
          Length = 378

 Score =  113 bits (282), Expect(2) = 3e-26
 Identities = 62/163 (38%), Positives = 91/163 (55%)
 Frame = +2

Query: 257 TTSSQPYVPISNGTIQGGEWGTINANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLD 436
           ++SS PY   SN   +      + A N      LIQ++++    G P EDP AHL+ +++
Sbjct: 14  SSSSVPYFFTSNACPE------VQAQNITYPHSLIQLIQNNLFHGLPNEDPCAHLATYIE 67

Query: 437 ICDTVKLNGVSNNAIRIQMFGFSLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKA 616
           IC+T++L GV  + +R+ +F FSLS   K W        + T +++ + FL KYFP SK 
Sbjct: 68  ICNTIRLAGVPEDVVRLSLFSFSLSREAKRWLHSFKGNGLKTWEEVVEKFLKKYFPESKM 127

Query: 617 QRLISDISYFSQQGDETLFEAWKRYRELLRKCPQHGFTEGQQL 745
               + IS F Q   E+L EA +R+  LLRK P HGF+E  QL
Sbjct: 128 TEGKTSISSFHQFPKESLSEALERFHGLLRKTPAHGFSEPIQL 170



 Score = 33.1 bits (74), Expect(2) = 3e-26
 Identities = 16/48 (33%), Positives = 29/48 (60%)
 Frame = +3

Query: 750 FL*WSHTTTRSIVDATSGECLVNRNAKEALRIIEEMAANSYQWPMDRA 893
           F  W  + ++ ++DA +G  +  ++ KEA+ +IE MAA+ +    DRA
Sbjct: 173 FTDWLRSQSKQLLDAFAGGKIKLKSPKEAIELIENMAASDHAILHDRA 220


>gb|AAF63128.1|AC009526_13 Similar to Athila ORF 1 [Arabidopsis thaliana]
          Length = 780

 Score =  125 bits (313), Expect = 4e-26
 Identities = 61/141 (43%), Positives = 87/141 (61%)
 Frame = +2

Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502
           I  NNFE+KSGLI M++  +  G PMEDP  HL  F   C   K+NGVS ++ ++++F F
Sbjct: 34  IQNNNFEIKSGLISMIQSNKFHGLPMEDPLDHLDNFDRFCSLTKINGVSEDSFKLRLFPF 93

Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682
           SL D    W K L   ++ T DD  KAFL+K+F  S+  RL ++IS F+Q+  E+  EAW
Sbjct: 94  SLGDEAHLWEKTLLVDSVDTWDDCKKAFLAKFFSNSRTARLRNEISGFNQKNSESFAEAW 153

Query: 683 KRYRELLRKCPQHGFTEGQQL 745
           +R++    +CP HGF +   L
Sbjct: 154 ERFKRYSTQCPHHGFKKASLL 174


>pir||S66306 hypothetical protein 1 - Arabidopsis thaliana retrotransposon
           Athila gi|806535|emb|CAA57397.1| unnamed protein product
           [Arabidopsis thaliana]
          Length = 935

 Score =  124 bits (311), Expect = 7e-26
 Identities = 63/141 (44%), Positives = 88/141 (62%)
 Frame = +2

Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502
           +  NNFE+KSGLI MV+  +  G  MEDP  HL  F  +C   K+NGVS +  ++++F F
Sbjct: 66  VQNNNFEIKSGLIAMVQGNKFHGLLMEDPLDHLDEFERLCRLTKINGVSEDGFKLRLFPF 125

Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682
           SL D+   W K L   +I T DD  KAFL+K+F  S+  RL ++IS F+Q+ +E+  EAW
Sbjct: 126 SLGDKAHLWEKTLPHGSITTWDDCKKAFLAKFFSNSRTARLRNEISGFTQKQNESFCEAW 185

Query: 683 KRYRELLRKCPQHGFTEGQQL 745
           +R++    KCP HGF E   L
Sbjct: 186 ERFKGYPTKCPHHGFKEASLL 206


>gb|AAF79809.1|AC020646_32 T32E20.9 [Arabidopsis thaliana]
          Length = 1586

 Score =  122 bits (306), Expect = 3e-25
 Identities = 58/133 (43%), Positives = 85/133 (63%)
 Frame = +2

Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502
           +  NNFE+KSGLI MV+  +  G PMEDP  HL  F  +C   K+N VS +  ++++F F
Sbjct: 66  VQNNNFEIKSGLIAMVQSNKFHGLPMEDPLDHLDEFDRLCSLTKINRVSEDGFKLRLFPF 125

Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682
           SL D+   W K L + +I + +D  KAFL+K+F  S+  RL +DIS F+Q  +ET +EAW
Sbjct: 126 SLGDKAHQWEKSLPQGSITSWNDCKKAFLAKFFSNSRTARLRNDISGFTQTNNETFYEAW 185

Query: 683 KRYRELLRKCPQH 721
           +R++    +CP H
Sbjct: 186 ERFKGYQTQCPHH 198


>ref|XP_006575927.1| PREDICTED: uncharacterized protein LOC102669817 [Glycine max]
          Length = 352

 Score =  115 bits (287), Expect(2) = 3e-25
 Identities = 60/145 (41%), Positives = 88/145 (60%)
 Frame = +2

Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502
           + A N      LIQ++++    G P EDP AHL+ +++IC+TV+L GV  +A+R+ +F F
Sbjct: 30  VQAQNITYPHSLIQLIQNNLFHGLPNEDPYAHLATYIEICNTVRLAGVPEDAVRLSLFLF 89

Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682
           SL    K W+      ++ T D++ + FL KYFP SK     + IS F +  DE+L EA 
Sbjct: 90  SLFGDAKRWFHSFKGNSLKTWDEVVEKFLKKYFPESKTAEGKAAISSFHEFPDESLSEAL 149

Query: 683 KRYRELLRKCPQHGFTEGQQLPLSI 757
           +R+R LLRK P HGF+E  QL + I
Sbjct: 150 ERFRGLLRKTPTHGFSELIQLNIFI 174



 Score = 28.1 bits (61), Expect(2) = 3e-25
 Identities = 12/40 (30%), Positives = 26/40 (65%)
 Frame = +3

Query: 774 TRSIVDATSGECLVNRNAKEALRIIEEMAANSYQWPMDRA 893
           ++ ++DA++G  +  +  +EA+ +I+ MAA+ +    DRA
Sbjct: 181 SKQLLDASAGRKIKLKTPEEAMELIKNMAASDHAILRDRA 220


>gb|AAD19759.1| putative Athila retroelement ORF1 protein [Arabidopsis thaliana]
          Length = 750

 Score =  121 bits (304), Expect = 4e-25
 Identities = 61/141 (43%), Positives = 87/141 (61%)
 Frame = +2

Query: 323 INANNFELKSGLIQMVKDEQLGGEPMEDPNAHLSMFLDICDTVKLNGVSNNAIRIQMFGF 502
           +  NNFE+KSGLI MV+  +  G PMED   HL  F  +CD  K+NGVS +  + ++F F
Sbjct: 34  VQNNNFEIKSGLIAMVQGNKFHGLPMEDSLDHLDEFERLCDLTKINGVSEDGFKFRLFPF 93

Query: 503 SLSDRDKDWYKGLNKANIHTLDDLCKAFLSKYFPPSKAQRLISDISYFSQQGDETLFEAW 682
           SL D+   W K L + +I T DD  KAF +K+F  S+  RL ++IS F+Q+ +E+  EAW
Sbjct: 94  SLGDKTHLWEKTLPQNSITTWDDCKKAFFAKFFSNSRTARLRNEISGFTQKQNESFCEAW 153

Query: 683 KRYRELLRKCPQHGFTEGQQL 745
           +R++    KCP  GF +   L
Sbjct: 154 ERFKGYQTKCPHPGFKQASLL 174


Top