BLASTX nr result

ID: Mentha22_contig00047209 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00047209
         (1452 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   498   e-138
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   474   e-131
ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664...   440   e-121
ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268...   405   e-110
ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665...   379   e-102
ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663...   373   e-100
ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661...   349   2e-93
gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]             328   4e-87
gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali...   319   2e-84
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   316   2e-83
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   313   9e-83
gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana]                307   7e-81
emb|CAB45965.1| putative reverse transcriptase [Arabidopsis thal...   306   1e-80
gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]               302   3e-79
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   300   1e-78
gb|AAD15471.1| putative non-LTR retroelement reverse transcripta...   300   1e-78
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       292   2e-76
gb|AAC63678.1| putative non-LTR retroelement reverse transcripta...   292   3e-76
gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   290   1e-75
emb|CAB72467.1| putative protein [Arabidopsis thaliana]               285   3e-74

>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  498 bits (1282), Expect = e-138
 Identities = 228/471 (48%), Positives = 324/471 (68%)
 Frame = +1

Query: 1    IFDNILLSHELIKGYQRKNISPRCMIKIDVQKAYDSVEWPFLNQVLEELGFPYQFSHWIM 180
            I DNILL+ ELI+GY RK++SPRC++K+D++KAYDSVEW FL  +L E GFP +F  WIM
Sbjct: 556  IADNILLASELIRGYTRKHMSPRCIMKVDIRKAYDSVEWSFLETLLYEFGFPSRFVGWIM 615

Query: 181  SCLTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRYH 360
             C+++V+Y + VNG   +PF ARKG+RQGDP+SP+LF +CMEYLSR L ELK +  F +H
Sbjct: 616  ECVSTVSYSVLVNGIPTQPFQARKGLRQGDPMSPFLFALCMEYLSRCLEELKGSPDFNFH 675

Query: 361  PRCKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVED 540
            P+C++L ITH  FADDLL+F R D +S+  M      F   SGL A+  KS +YF GV+D
Sbjct: 676  PKCERLNITHLMFADDLLMFCRADKSSLDHMNVAFQKFSHASGLAASHEKSNIYFCGVDD 735

Query: 541  AEKRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAGR 720
               R +     M  G LPF YLGVPL+++KL+  QC+PLV+ I +R  TW AKLLSYAGR
Sbjct: 736  ETARELADYVHMQLGELPFRYLGVPLTSKKLTYAQCKPLVEMITNRAQTWMAKLLSYAGR 795

Query: 721  VQLIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPKQ 900
            +QLIKS+++ +  YW  +F L +KVI+ +++ CR FLWTG+   +++A VAW  +  PK 
Sbjct: 796  LQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVCRKFLWTGKTEETKKAPVAWATIQRPKS 855

Query: 901  AGGMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYIKGRDLLEMPIPQQGSWVVR 1080
             GG N+ NMK WN+A + KLLW I+ K+D +W++W+H YYIK +D+L + I  Q +W++R
Sbjct: 856  RGGWNVINMKYWNRAAMLKLLWAIEFKRDKLWVRWIHSYYIKRQDILTVNISNQTTWILR 915

Query: 1081 KILGAREYVRRLPNRDEVLQQRSFSVKRVYMGLLGEVERVPWAKVVCQNPAPAKCKFIVW 1260
            KI+ AR+++  + + DE+     FS+K+ Y  +    ERV W +++C N A  K KFI+W
Sbjct: 916  KIVKARDHLSNIGDWDEICIGDKFSMKKAYKKISENGERVRWRRLICNNYATPKSKFILW 975

Query: 1261 LLLHAKLVTCDRLRRFGIMVDSTCCLCDTGEETLDHMFFECQFARRVWDNV 1413
            ++LH +L T DR+ R+G+  D    LC    ET+ H+FF C ++  VW  +
Sbjct: 976  MMLHERLPTVDRISRWGVQCDLNYRLCRNDGETIQHLFFSCSYSAGVWSKI 1026


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  474 bits (1220), Expect = e-131
 Identities = 217/481 (45%), Positives = 322/481 (66%)
 Frame = +1

Query: 1    IFDNILLSHELIKGYQRKNISPRCMIKIDVQKAYDSVEWPFLNQVLEELGFPYQFSHWIM 180
            I DNILL+ ELI+GY R+++SPRC+IK+D++KAYDSVEW FL  +L+ELGFP  F  WIM
Sbjct: 559  IGDNILLATELIRGYNRRHVSPRCVIKVDIRKAYDSVEWVFLESMLKELGFPSMFIRWIM 618

Query: 181  SCLTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRYH 360
            +C+ +V+Y + +NG    PF A+KG+RQGDP+SP+LF + MEYLSR +G + ++  F +H
Sbjct: 619  ACVKTVSYSILLNGIPSIPFDAQKGLRQGDPLSPFLFALSMEYLSRCMGNMCKDPEFNFH 678

Query: 361  PRCKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVED 540
            P+C+++ +TH  FADDLL+F+R D +S+  +M   + F + SGL+A+  KSC+YFGGV  
Sbjct: 679  PKCERIKLTHLMFADDLLMFARADASSISKIMAAFNSFSKASGLQASIEKSCIYFGGVCH 738

Query: 541  AEKRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAGR 720
             E   +     M  GSLPF YLGVPL+++KL+  QC+PL+ KI  R   W A LLSYAGR
Sbjct: 739  EEAEQLADRIQMPIGSLPFRYLGVPLASKKLNFSQCKPLIDKITTRAQGWVAHLLSYAGR 798

Query: 721  VQLIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPKQ 900
            +QL+K+++  +  YW Q+F LP+K+IK ++  CR FLWTG    S +A VAW+ +  PK 
Sbjct: 799  LQLVKTILYSMQNYWGQIFPLPKKLIKAVETTCRKFLWTGTVDTSYKAPVAWDFLQQPKS 858

Query: 901  AGGMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYIKGRDLLEMPIPQQGSWVVR 1080
             GG+N+ NM LWN+A I KLLW I  K+D +W++WV+ YYIK +++  + +    SW++R
Sbjct: 859  TGGLNVTNMVLWNKAAILKLLWAITFKQDKLWVRWVNAYYIKRQNIENVTVSSNTSWILR 918

Query: 1081 KILGAREYVRRLPNRDEVLQQRSFSVKRVYMGLLGEVERVPWAKVVCQNPAPAKCKFIVW 1260
            KI  +RE + R    + V    +FS+K+ Y  L  + E V W +++C N A  K +FI+W
Sbjct: 919  KIFESRELLTRTGGWEAVSNHMNFSIKKTYKLLQEDYENVVWKRLICNNKATPKSQFILW 978

Query: 1261 LLLHAKLVTCDRLRRFGIMVDSTCCLCDTGEETLDHMFFECQFARRVWDNVAAWCGEEKE 1440
            L +  +L T +R+ R+   V   C +C    ET+ H+FF C +++ +W  V  +   + +
Sbjct: 979  LAMLNRLATAERVSRWNRDVSPLCKMCGNEIETIQHLFFNCIYSKEIWGKVLLYLNLQPQ 1038

Query: 1441 A 1443
            A
Sbjct: 1039 A 1039


>ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max]
          Length = 939

 Score =  440 bits (1131), Expect = e-121
 Identities = 208/474 (43%), Positives = 307/474 (64%)
 Frame = +1

Query: 1    IFDNILLSHELIKGYQRKNISPRCMIKIDVQKAYDSVEWPFLNQVLEELGFPYQFSHWIM 180
            + D+++L+ EL++GY+RK+ +P+CM++ID+QKAYD+V W  L  +L ELGFP QF  WIM
Sbjct: 384  LHDHVMLAFELLRGYERKHGTPKCMLQIDIQKAYDTVHWDALEHILRELGFPDQFIKWIM 443

Query: 181  SCLTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRYH 360
              + SVTYV  +NG       AR+GIRQGDPISP LF++ MEYL+R L +L +   F YH
Sbjct: 444  IAVRSVTYVFNINGRFTRRLEARRGIRQGDPISPLLFILVMEYLNRILSQLDKIPNFNYH 503

Query: 361  PRCKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVED 540
             +C+K+ IT+ CFADDLLLFSRGD+ SVQ+M+   + F    GL  N  K  +Y G V+ 
Sbjct: 504  SKCEKMKITNLCFADDLLLFSRGDIGSVQIMLDKFNTFLRSMGLHVNPSKCNIYCGSVDI 563

Query: 541  AEKRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAGR 720
              K  +L  +G  EG +PF YLG+PLS++KL+++  Q L+ KI+ R++ W+A LLSYAGR
Sbjct: 564  NVKEQLLLISGFKEGKMPFRYLGIPLSSKKLNIKHYQVLIDKIVGRITHWSAGLLSYAGR 623

Query: 721  VQLIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPKQ 900
            VQLI+SV+     +W Q   LP+ VI  I   CR FLW G ++ SR++ +AWEKV  PK 
Sbjct: 624  VQLIQSVIFATINFWMQCLPLPKFVIMRINAICRSFLWIGNSNISRKSPIAWEKVCSPKI 683

Query: 901  AGGMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYIKGRDLLEMPIPQQGSWVVR 1080
             GG+NI N+ +WN+ +I KLLW +  K D +WI+W+H YYI+G+ +  M + +  SW++ 
Sbjct: 684  NGGLNIINLAIWNKISILKLLWNVCNKSDNLWIKWLHTYYIRGQSIWSMVLKKSHSWIMS 743

Query: 1081 KILGAREYVRRLPNRDEVLQQRSFSVKRVYMGLLGEVERVPWAKVVCQNPAPAKCKFIVW 1260
             ++  R  + +  +R     Q  F +K++Y+ L  E E++ W  ++C N A  +  F +W
Sbjct: 744  SMMKLRPLLLQYQSR----MQDVFKMKKIYLALFEESEKMSWRTLMCNNLARPRALFCLW 799

Query: 1261 LLLHAKLVTCDRLRRFGIMVDSTCCLCDTGEETLDHMFFECQFARRVWDNVAAW 1422
               H +L + DRL +FG+ VD+ C  C +  E+ +H+FF C   + +W  V  W
Sbjct: 800  QACHFRLASKDRLIKFGLNVDANCAFC-SSMESHEHLFFGCIELKTIWTAVLNW 852


>ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum
            lycopersicum]
          Length = 717

 Score =  405 bits (1040), Expect = e-110
 Identities = 180/344 (52%), Positives = 252/344 (73%)
 Frame = +1

Query: 1    IFDNILLSHELIKGYQRKNISPRCMIKIDVQKAYDSVEWPFLNQVLEELGFPYQFSHWIM 180
            I DNI+L+HEL+K Y RKN+SPRCM+KID+ KAYDSVEWPFL QV+E LGFP  F+ W+M
Sbjct: 364  IGDNIILAHELVKAYTRKNVSPRCMLKIDLHKAYDSVEWPFLEQVMEGLGFPDLFTKWVM 423

Query: 181  SCLTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRYH 360
             C+ +V Y + VNG+  + F A KG+RQGDP+SP+LF I MEYLSR L  LK++  F+YH
Sbjct: 424  KCVKTVNYTIVVNGQNTQRFDAAKGLRQGDPMSPFLFAIAMEYLSRLLKGLKEDKSFKYH 483

Query: 361  PRCKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVED 540
            P+  KL +TH CFADDLLLFSRGDL S++ + +    F + SGL+AN  KS +Y GGV+ 
Sbjct: 484  PKYAKLDVTHLCFADDLLLFSRGDLNSIKALQKCFTEFSQASGLQANLNKSSIYCGGVQM 543

Query: 541  AEKRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAGR 720
              ++ I+   G     LPF YLGVPLS++KL+  Q  PL++K++ R+++W AK LSYAGR
Sbjct: 544  EVRQQIIQQLGYTIEELPFKYLGVPLSSKKLNTIQWYPLIEKVMARINSWTAKKLSYAGR 603

Query: 721  VQLIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPKQ 900
             QL+K+V+ G+   W Q+F++P K+IK I+  CR +LW+G    +++AL+AW+KV  PK 
Sbjct: 604  AQLVKTVLFGVQALWAQLFIIPAKIIKLIEGLCRSYLWSGVGYVTKKALIAWDKVCSPKY 663

Query: 901  AGGMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYIKGR 1032
             GG+ + N+K+WN++ + KL W +  K+D +WI+W+H YYIKG+
Sbjct: 664  EGGLGLINLKIWNRSAVTKLCWDLANKEDKLWIKWIHAYYIKGQ 707


>ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max]
          Length = 506

 Score =  379 bits (973), Expect = e-102
 Identities = 177/425 (41%), Positives = 272/425 (64%)
 Frame = +1

Query: 178  MSCLTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRY 357
            M  +++V+Y   VNG   E   AR+G+RQGDPISP LFVI ME L+R L +++++  F Y
Sbjct: 1    MIAVSTVSYRFNVNGYKTEIMGARRGLRQGDPISPMLFVIVMECLNRYLYKMQKDGDFNY 60

Query: 358  HPRCKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVE 537
            HP+C KL IT+ CFADDLLLFSRGD  SV MMM+  + F + +GL  N  K  L   G++
Sbjct: 61   HPKCDKLKITNLCFADDLLLFSRGDKISVGMMMRAYESFSKATGLLVNPQKCSLLCAGID 120

Query: 538  DAEKRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAG 717
               KR IL  +G  EG LPF YLGVP++++KLS     PL+ KI+ ++  W A+LLSYAG
Sbjct: 121  AVTKREILEVSGFQEGQLPFKYLGVPVTSKKLSTIHYSPLIDKIVGKIKHWTARLLSYAG 180

Query: 718  RVQLIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPK 897
            R+QL+ SV+  +  YW   F  P+ V++ I+  CRIFLWTG    SR++ VAW+++  P+
Sbjct: 181  RLQLVNSVMFALTNYWLNCFPFPKSVLQKIEAICRIFLWTGGFEGSRKSPVAWKQICSPR 240

Query: 898  QAGGMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYIKGRDLLEMPIPQQGSWVV 1077
              GG+NI ++ +WN+A + KLLW +  K+D++W++W+  YY+K  +L+ + +    SW++
Sbjct: 241  SCGGLNIIDIDIWNKANLMKLLWNLSSKEDSLWVKWIQAYYVKRSELMHIEMKNTDSWIM 300

Query: 1078 RKILGAREYVRRLPNRDEVLQQRSFSVKRVYMGLLGEVERVPWAKVVCQNPAPAKCKFIV 1257
            + IL  RE + ++ N +E++ + S ++ ++Y  L    +R  W  ++  N A  +  FI+
Sbjct: 301  KAILKQREDLEKIDNMEELMIRGSINMGKLYRKLQDCGQRKEWKNLLYGNTARPRANFIL 360

Query: 1258 WLLLHAKLVTCDRLRRFGIMVDSTCCLCDTGEETLDHMFFECQFARRVWDNVAAWCGEEK 1437
            WL  H +L T DRL ++G++ D +CC C + EE+++H+FF C  ++RVW  V  W     
Sbjct: 361  WLACHGRLSTKDRLCKYGMIDDKSCCFC-SEEESMNHLFFVCDNSKRVWMEVLQWVQIRH 419

Query: 1438 EAVRW 1452
            +   W
Sbjct: 420  DPSDW 424


>ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max]
          Length = 514

 Score =  373 bits (958), Expect = e-100
 Identities = 172/440 (39%), Positives = 274/440 (62%), Gaps = 1/440 (0%)
 Frame = +1

Query: 88   VQKAYDSVEWPFLNQVLEELGFPYQFSHWIMSCLTSVTYVLTVNGEVLEPFVARKGIRQG 267
            V++ YD V+W  L  VL E G P +F  W+M  +T+V Y   +NGE+      + GI QG
Sbjct: 74   VEETYDMVDWGALEGVLTEFGLPKKFIGWVMKVITTVNYRFNINGELSNVLETKIGIWQG 133

Query: 268  DPISPYLFVICMEYLSRSLGELKQNAGFRYHPRCKKLGITHACFADDLLLFSRGDLASVQ 447
            DPISP LFV+ MEY +R + ++++N  F +H +C++LGITH  FADD+ L  RGD  S++
Sbjct: 134  DPISPLLFVLMMEYFNRIMVKMQRNPSFNHHSQCERLGITHLSFADDVFLLCRGDKKSIK 193

Query: 448  MMMQVLDHFGEVSGLKANQMKSCLYFGGVEDAEKRNILAATGMMEGSLPFSYLGVPLSAQ 627
            M+++    F + +GL+ N  K  ++ GG+     + I   TG  EG+LP  YLGVPLS +
Sbjct: 194  MIIKAFSFFSKSTGLQINPAKCKVFCGGLNCDSIQVITKITGFEEGTLPVRYLGVPLSCK 253

Query: 628  KLSVRQCQPLVQKILHRMSTWAAKLLSYAGRVQLIKSVVAGIHMYWCQVFVLPQKVIKFI 807
            KL+V    PLV+KI+ ++  W++KLLS AGR+QL++S++  I  YW  VF +P+KVI+ I
Sbjct: 254  KLNVHHYLPLVEKIVGKIRHWSSKLLSIAGRIQLVRSIITAIAQYWMSVFPMPKKVIQKI 313

Query: 808  QQACRIFLWTGRASASRRALVAWEKVVLPKQAGGMNIGNMKLWNQATICKLLWRIQQKKD 987
               CR F+W+G A   R++LVAW++V  P + GG+N+ N++LWN   + K LW I  K+D
Sbjct: 314  DSICRSFIWSGSAEVKRKSLVAWKQVCKPARCGGLNLINLELWNVTAMLKCLWNICSKED 373

Query: 988  AVWIQWVHIYYIKGRDLLEMPIPQQGSWVVRKILGAREYVRRLP-NRDEVLQQRSFSVKR 1164
             +W++W+H Y++KG +++   I    +W+++ ++  R  V  L     E+L++R FS+K+
Sbjct: 374  NLWVKWIHAYFLKGDNVMSATIKSNSTWILKSVMKQRPQVNNLQLVWIEMLRKRKFSMKQ 433

Query: 1165 VYMGLLGEVERVPWAKVVCQNPAPAKCKFIVWLLLHAKLVTCDRLRRFGIMVDSTCCLCD 1344
            VYM L+ +  ++ W +++  N A  +    +WL    +L T  RL+   ++  S C LC 
Sbjct: 434  VYMELVEDHNKIDWFRLLRYNRARPRANVTLWLACQNRLATKTRLKNMNMIQCSLCSLCK 493

Query: 1345 TGEETLDHMFFECQFARRVW 1404
              +E LDH+ F C+  + +W
Sbjct: 494  EQDEDLDHLMFSCRVTKAIW 513


>ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max]
          Length = 947

 Score =  349 bits (896), Expect = 2e-93
 Identities = 167/407 (41%), Positives = 245/407 (60%), Gaps = 1/407 (0%)
 Frame = +1

Query: 235  PFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRYHPRCKKLGITHACFADDLL 414
            P  A++GIRQGDPISP LFV+ MEYL+R L +L+ +  F +H +C+KLGITH  FADD+L
Sbjct: 462  PIAAKRGIRQGDPISPLLFVVMMEYLNRLLVKLQLDLNFNHHAKCEKLGITHLTFADDVL 521

Query: 415  LFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVEDAEKRNILAATGMMEGSLP 594
            LF RGD+ SV+MM+ V++ F   +GL  N  K  +YFGGV+   K  I   +   EG LP
Sbjct: 522  LFCRGDVMSVEMMLHVINKFSATTGLVVNPNKCRIYFGGVDGTTKNKIQQISSYEEGQLP 581

Query: 595  FSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAGRVQLIKSVVAGIHMYWCQV 774
              YLGVPL+++KL+++   PL+ KI  R+  W +KLL+  GRVQ++   +  I  +W Q 
Sbjct: 582  VRYLGVPLTSKKLNIKYYLPLIDKITTRIRHWTSKLLNMTGRVQMVNCTITAIVQFWMQC 641

Query: 775  FVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPKQAGGMNIGNMKLWNQATIC 954
              +P  VIK I   CR F+W+     +R++ +AW  V  PK  GG+NI N+K+WN  T+ 
Sbjct: 642  LPIPMSVIKKIDSMCRSFVWSRSTEITRKSPIAWNSVCRPKGQGGLNIFNLKVWNHITVL 701

Query: 955  KLLWRIQQKKDAVWIQWVHIYYIKGRDLLEMPIPQQGSWVVRKILGAREYVRRL-PNRDE 1131
              LW + +K D +W++W+H +YIK   ++   +    SWV++ +L  REY+  L P  DE
Sbjct: 702  NCLWNLCKKVDNLWVKWIHAHYIKNSSVMNTMVTNNFSWVLKNVLSQREYIHTLQPVWDE 761

Query: 1132 VLQQRSFSVKRVYMGLLGEVERVPWAKVVCQNPAPAKCKFIVWLLLHAKLVTCDRLRRFG 1311
            +L    F +K+ Y  ++ E +RV W+ ++ +N A  +     WL  H +L T DRL RFG
Sbjct: 762  LLNSERFKMKKAYDKMM-EADRVHWSGLMRKNCARPRAIHTTWLACHGRLGTKDRLVRFG 820

Query: 1312 IMVDSTCCLCDTGEETLDHMFFECQFARRVWDNVAAWCGEEKEAVRW 1452
            ++ D    LC   EET +H+ F C+ A  +W NV    G +     W
Sbjct: 821  MITDKIWSLCKEVEETQNHILFSCKVATDIWSNVLNRIGIDHVPQEW 867


>gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]
          Length = 653

 Score =  328 bits (841), Expect = 4e-87
 Identities = 167/484 (34%), Positives = 269/484 (55%)
 Frame = +1

Query: 1    IFDNILLSHELIKGYQRKNISPRCMIKIDVQKAYDSVEWPFLNQVLEELGFPYQFSHWIM 180
            + +N+LL+ EL+K Y ++++S RC IKID+ KA++SV+W F+  +L  + FP +F HWIM
Sbjct: 98   LIENLLLATELVKDYHKESVSSRCAIKIDISKAFNSVQWSFIRNILLSMDFPMEFVHWIM 157

Query: 181  SCLTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRYH 360
             C+++ ++ + VNGE++  F +++G+RQG  +SPYLFV+ M+ LS+ L +      F YH
Sbjct: 158  LCISTASFSVQVNGELVGFFQSKRGLRQGCSLSPYLFVMSMDVLSKLLDQAASAKKFGYH 217

Query: 361  PRCKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVED 540
             RCK+L +TH  FADDL++ S G + S+  +++V D F + SGLK +  KS +Y  GV +
Sbjct: 218  SRCKELSLTHLSFADDLMVLSDGKVRSIDGIVEVFDIFAKFSGLKISMEKSTIYLAGVTE 277

Query: 541  AEKRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAGR 720
                 I        G LP  YLG+PL  ++L+     PL++ I  ++ TW  + LSYAGR
Sbjct: 278  DVYHEIQNRYQFDVGQLPVRYLGLPLVTKRLTATDYSPLLEHIKKKIGTWTTRYLSYAGR 337

Query: 721  VQLIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPKQ 900
            + LI SV+  I  +W   F LP++ I+ I + C  FLW+G     R+  V W  V  PKQ
Sbjct: 338  LNLITSVLWSICNFWLAAFRLPRECIREIDKICSAFLWSGPDLNPRKTRVCWGDVCKPKQ 397

Query: 901  AGGMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYIKGRDLLEMPIPQQGSWVVR 1080
             GG+ + ++K  N+ +  KL+WRI    +++W++W+  Y +K      +        V+ 
Sbjct: 398  EGGLGLRSLKEMNEVSCLKLIWRIVSHTNSLWVRWIEQYLLKHDTFWSVQTTTNMDSVLW 457

Query: 1081 KILGAREYVRRLPNRDEVLQQRSFSVKRVYMGLLGEVERVPWAKVVCQNPAPAKCKFIVW 1260
            +     EY+ +   RD   Q R+ S              V W   +    A  K  F  W
Sbjct: 458  RGRN-DEYMPKFSTRDTWNQTRNTSTP------------VTWHMGIWFAHATPKFSFCAW 504

Query: 1261 LLLHAKLVTCDRLRRFGIMVDSTCCLCDTGEETLDHMFFECQFARRVWDNVAAWCGEEKE 1440
            L +  +L T D++ ++   +  TC LC+   ET +H+FF C +   +W+N+A    + K 
Sbjct: 505  LAVQNRLSTGDKMLQWNRRLSPTCVLCNNNIETRNHLFFSCCYTAEIWENLAKNIYKAKF 564

Query: 1441 AVRW 1452
            +  W
Sbjct: 565  STNW 568


>gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana]
            gi|20197043|gb|AAM14892.1| putative reverse transcriptase
            [Arabidopsis thaliana]
          Length = 1412

 Score =  319 bits (818), Expect = 2e-84
 Identities = 167/501 (33%), Positives = 262/501 (52%), Gaps = 30/501 (5%)
 Frame = +1

Query: 1    IFDNILLSHELIKGYQRKNISPRCMIKIDVQKAYDSVEWPFLNQVLEELGFPYQFSHWIM 180
            + +N+LL+ EL+K Y +  +SPRC +KID+ KA+DSV+WPFL   L  L  P +F HWI 
Sbjct: 837  LMENLLLASELVKDYHKDGLSPRCAMKIDLSKAFDSVQWPFLLNTLAALDIPEKFIHWIN 896

Query: 181  SCLTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRYH 360
             C+++ ++ + VNG           +RQG  +SPYLFVICM  LS  L +      F YH
Sbjct: 897  LCISTASFSVQVNG-----------LRQGCSLSPYLFVICMNVLSAMLDKGAVEKRFGYH 945

Query: 361  PRCKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVED 540
            PRC+ +G+TH CFADD+++FS G   S++ ++ +   F   SGL  +  KS L+   +  
Sbjct: 946  PRCRNMGLTHLCFADDIMVFSAGSAHSLEGVLAIFKDFAAFSGLNISLEKSTLFMASISS 1005

Query: 541  AEKRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAGR 720
                +ILA      GSLP  YLG+PL  +++++  C PL++KI  R+S+W  + LSYAGR
Sbjct: 1006 ETCASILARFPFDSGSLPVRYLGLPLMTKRMTLADCLPLLEKIRSRISSWKNRFLSYAGR 1065

Query: 721  VQLIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPKQ 900
            +QL+ SV++ +  +W   F LP+  I+ I+Q    FLW+G      +A VAW  V  PK 
Sbjct: 1066 LQLLNSVISSLTKFWISAFRLPRACIREIEQISAAFLWSGTDLNPHKAKVAWHDVCKPKS 1125

Query: 901  AGGMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYIKGRDLLEMPIPQQGSWVVR 1080
             GG+ + ++   N+    KL+WR+   K ++W+ W+                   + ++R
Sbjct: 1126 EGGLGLRSLVDANKICCFKLIWRLVSAKHSLWVNWIQ------------------NNLIR 1167

Query: 1081 KILGAREYVRRLPNRDEVLQQRSFSVKRVY-MGLLGEVER-------------------- 1197
             +  A    RR  +RD++L      ++++   G+  E +R                    
Sbjct: 1168 TVAEALSSHRRRSHRDDILNDIEEELEKLLCRGICTEQDRSLCRSIGGQFKAKFFSPEIW 1227

Query: 1198 ---------VPWAKVVCQNPAPAKCKFIVWLLLHAKLVTCDRLRRFGIMVDSTCCLCDTG 1350
                       W K +  + A  K  FI WL  H +L T D++  +   + S C LC+  
Sbjct: 1228 HQIREQGLVKQWHKAIWFSGATPKFTFISWLAAHDRLTTGDKMASWNRGISSVCVLCNIS 1287

Query: 1351 EETLDHMFFECQFARRVWDNV 1413
             E+ DH+FF C F+  +WD +
Sbjct: 1288 AESRDHLFFSCNFSSHIWDRL 1308


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  316 bits (809), Expect = 2e-83
 Identities = 147/373 (39%), Positives = 230/373 (61%), Gaps = 1/373 (0%)
 Frame = +1

Query: 1    IFDNILLSHELIKGYQRKNISPRCMIKIDVQKAYDSVEWPFLNQVLEELGFPYQFSHWIM 180
            + +N+LL+ EL+K Y +  IS RC IKID+ KA+DSV+WPFL  V   LGFP +F HWI 
Sbjct: 571  LIENLLLATELVKDYHKDTISTRCAIKIDISKAFDSVQWPFLINVFTILGFPREFIHWIN 630

Query: 181  SCLTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRYH 360
             C+T+ ++ + VNGE+   F + +G+RQG  +SPYLFVICM+ LS+ L +      F YH
Sbjct: 631  ICITTASFSVQVNGELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAARHFGYH 690

Query: 361  PRCKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVED 540
            P+CK +G+TH  FADDL++ S G + S++ +++V D F + SGL+ +  KS +Y  G+  
Sbjct: 691  PKCKTMGLTHLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSA 750

Query: 541  AEKRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAGR 720
              +  +        G LP  YLG+PL  ++LS   C PL++++  R+ +W ++ LSYAGR
Sbjct: 751  TARNEVADRFPFSSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGR 810

Query: 721  VQLIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPKQ 900
            + LI SV+  I  +W   F LP+K I+ +++ C  FLW+G    S +A ++W  V  PK 
Sbjct: 811  LNLISSVLWSICNFWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKD 870

Query: 901  AGGMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYIKGRDLLEM-PIPQQGSWVV 1077
             GG+ + ++K  N     KL+W+I    +++W++WV  + ++     E+     QGSW+ 
Sbjct: 871  EGGLGLRSLKEANDVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFWEVKQTVSQGSWIW 930

Query: 1078 RKILGAREYVRRL 1116
            +K+L  RE  + L
Sbjct: 931  KKLLKYREVAKTL 943



 Score = 61.6 bits (148), Expect = 8e-07
 Identities = 31/102 (30%), Positives = 47/102 (46%)
 Frame = +1

Query: 1147 SFSVKRVYMGLLGEVERVPWAKVVCQNPAPAKCKFIVWLLLHAKLVTCDRLRRFGIMVDS 1326
            +FS +  +        RVPW KV+  + A  K  F  WL  H +L T DR+  +   + +
Sbjct: 1037 TFSTRDTWHHTRSTSARVPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIAT 1096

Query: 1327 TCCLCDTGEETLDHMFFECQFARRVWDNVAAWCGEEKEAVRW 1452
             C  C    ET DH+FF C F   +W ++A    + +    W
Sbjct: 1097 DCIFCQGTLETRDHLFFTCSFTSVIWVDLARGIFKTQYTSHW 1138


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  313 bits (803), Expect = 9e-83
 Identities = 170/485 (35%), Positives = 261/485 (53%), Gaps = 1/485 (0%)
 Frame = +1

Query: 1    IFDNILLSHELIKGYQRKNISPRCMIKIDVQKAYDSVEWPFLNQVLEELGFPYQFSHWIM 180
            I DNILL+ E+I  Y + +  PRC   +D+ KA D+VEW F+   L+    P     WI 
Sbjct: 392  IGDNILLAQEIICDYHKADGQPRCTFMVDMMKANDTVEWDFIIATLQAFNIPSTLIGWIK 451

Query: 181  SCLTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGE-LKQNAGFRY 357
            SC++S  + + VNGE+   F  R+G+RQGDP+SPYLFVI ME LS  +   +  +  FRY
Sbjct: 452  SCISSAKFSVCVNGELAGFFARRRGLRQGDPLSPYLFVIAMEVLSLCIQRRINCSPCFRY 511

Query: 358  HPRCKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVE 537
            H RC +L ++H CFADDLL+F  GD  SV+ +     +F  +S LKAN  +S ++  GV+
Sbjct: 512  HWRCDQLNLSHLCFADDLLMFCNGDENSVRTLHDAFSNFESLSSLKANVSESKIFLAGVD 571

Query: 538  DAEKRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAG 717
                 ++L  T    G+ P  YLG+PL   KL ++ C PL+ +I  R+ +W  K+LS+AG
Sbjct: 572  GNSSDSVLQVTNFSLGTCPVRYLGIPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAG 631

Query: 718  RVQLIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPK 897
            R+QLI+SV++ I +YW    +LP+KV+K I++  R FLW G  S      VAW ++ LPK
Sbjct: 632  RLQLIQSVLSSIQVYWASHLILPKKVLKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPK 691

Query: 898  QAGGMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYIKGRDLLEMPIPQQGSWVV 1077
              GG+ I ++  WN+A +   +W +       W  WV +Y +KG      P+P   SW  
Sbjct: 692  CEGGLGIKDLHCWNKALMISHIWNLVSSSSNFWTDWVKVYLLKGNSFWNAPLPSICSWNW 751

Query: 1078 RKILGAREYVRRLPNRDEVLQQRSFSVKRVYMGLLGEVERVPWAKVVCQNPAPAKCKFIV 1257
            RK+L  RE        + +   R+ S+       LG +  + W+  +      +K   + 
Sbjct: 752  RKLLKIRELCCSF-FVNIIGDGRATSLWFDNWHPLGPL-TLRWSSNIIGESGLSKSAMLT 809

Query: 1258 WLLLHAKLVTCDRLRRFGIMVDSTCCLCDTGEETLDHMFFECQFARRVWDNVAAWCGEEK 1437
                ++     + LR    +V     +     ET +H+FF+C ++  +W +V + C   K
Sbjct: 810  PNGFYSTSSAWNTLRPSRFIVPWYRLVWFVA-ETHNHLFFDCAYSFGIWTHVLSKCDVSK 868

Query: 1438 EAVRW 1452
              + W
Sbjct: 869  PLLPW 873


>gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana]
          Length = 740

 Score =  307 bits (787), Expect = 7e-81
 Identities = 147/373 (39%), Positives = 224/373 (60%), Gaps = 1/373 (0%)
 Frame = +1

Query: 1    IFDNILLSHELIKGYQRKNISPRCMIKIDVQKAYDSVEWPFLNQVLEELGFPYQFSHWIM 180
            + +N+LL+ EL+K Y + +ISPRC +KID+ KA+DSV+W FL   LE L FP  F HWI 
Sbjct: 144  LIENVLLATELVKDYHKDSISPRCAMKIDISKAFDSVQWQFLLNTLEALNFPENFCHWIK 203

Query: 181  SCLTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRYH 360
             C+++ T+ + VNGE+   F +++G+RQG  +SPYLFVICM  LS  +     +    YH
Sbjct: 204  LCISTATFSVQVNGELAGFFGSKRGLRQGCALSPYLFVICMNVLSHMIDVAAVHRNIGYH 263

Query: 361  PRCKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVED 540
            P+CKKL +TH CFADDL++F  G   SV+ ++ +   F   SGL  +  KS LY  GV +
Sbjct: 264  PKCKKLSLTHLCFADDLMVFIDGQQRSVEGVINIFKEFAGKSGLHISLEKSTLYLAGVSE 323

Query: 541  AEKRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAGR 720
              + NIL+A     G LP  YLG+PL  ++++     PL+ K+  ++S+W A+ LSYAGR
Sbjct: 324  LNRNNILSAFPFASGQLPVRYLGLPLLTKQMTTADYSPLLDKVRSKISSWTARSLSYAGR 383

Query: 721  VQLIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPKQ 900
            + LI SV+  +  +W   + LP   IK I++ C  FLW+G     ++A + W  +   KQ
Sbjct: 384  LALINSVIVSLSNFWMSAYRLPAGCIKEIEKLCSAFLWSGPELNPKKAKITWTSLCKLKQ 443

Query: 901  AGGMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYI-KGRDLLEMPIPQQGSWVV 1077
             GG+ I ++   N+ +  KL+WR+  ++ ++W+ WV  Y I KG           GSW+ 
Sbjct: 444  EGGLGIKSLLEANKVSCLKLIWRLVSRQSSLWVNWVWTYIIRKGSFWSANDRSSLGSWMW 503

Query: 1078 RKILGAREYVRRL 1116
            +K+L  R+  + +
Sbjct: 504  KKLLKYRDVAKSM 516


>emb|CAB45965.1| putative reverse transcriptase [Arabidopsis thaliana]
            gi|7267919|emb|CAB78261.1| putative reverse transcriptase
            [Arabidopsis thaliana]
          Length = 662

 Score =  306 bits (785), Expect = 1e-80
 Identities = 144/376 (38%), Positives = 228/376 (60%)
 Frame = +1

Query: 1    IFDNILLSHELIKGYQRKNISPRCMIKIDVQKAYDSVEWPFLNQVLEELGFPYQFSHWIM 180
            + +N+LL+ EL+K Y + ++S RC IKID+ KA+DSV+W FL  VL  L FP +F HWIM
Sbjct: 89   LIENLLLATELVKDYHKDSVSERCAIKIDISKAFDSVQWSFLRNVLLTLDFPQEFVHWIM 148

Query: 181  SCLTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRYH 360
             C+T+ ++ + VN E+   F + +G+RQG  ++PYLFVI M+ LS+ L        F YH
Sbjct: 149  LCVTTASFSVQVNRELAGYFNSLRGLRQGCSLTPYLFVIVMDVLSKKLDRAAGLRKFGYH 208

Query: 361  PRCKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVED 540
            P+CK LG+TH  FADD+++ + G L S++ +++V D F + SGLK +  K+ +YF G+  
Sbjct: 209  PKCKNLGLTHLSFADDIMVLTDGKLRSLEGIVEVFDSFAKQSGLKISMAKTTIYFAGISK 268

Query: 541  AEKRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAGR 720
            +  +          G LP  YL +PL  ++ + +   PL+++I  R+ TW A+ LSYAGR
Sbjct: 269  SVCKEFEDQFHFAVGRLPVRYLCLPLVTKRFTSQDYSPLLEQIKRRIGTWTARFLSYAGR 328

Query: 721  VQLIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPKQ 900
            + L+ SV+  I  +W   F LP++ ++ I + C  FLW+G   ++ +A +AWE V  PK+
Sbjct: 329  LNLVSSVLWSICNFWLSAFRLPRECVREIDKLCSAFLWSGPELSTNKAKIAWETVCRPKR 388

Query: 901  AGGMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYIKGRDLLEMPIPQQGSWVVR 1080
             GG+ + ++K  N     KL+WRI  + D++W+QW+  Y +K           QGSW+ +
Sbjct: 389  EGGLGLQSIKEANDVCCLKLIWRIVSQGDSLWVQWIRTYLLKRNTFWSFRSASQGSWMWK 448

Query: 1081 KILGAREYVRRLPNRD 1128
            K+L  R+  +     D
Sbjct: 449  KLLKYRDTAKAFSKVD 464


>gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]
          Length = 872

 Score =  302 bits (773), Expect = 3e-79
 Identities = 147/371 (39%), Positives = 229/371 (61%), Gaps = 1/371 (0%)
 Frame = +1

Query: 1    IFDNILLSHELIKGYQRKNISPRCMIKIDVQKAYDSVEWPFLNQVLEELGFPYQFSHWIM 180
            + +N+LL+ EL+K Y + +IS RC IKID+ KA+DSV+W FL   L  + F   F HWI 
Sbjct: 218  LIENLLLATELVKDYHKDSISARCAIKIDISKAFDSVQWSFLTNTLVAMNFSPTFIHWIN 277

Query: 181  SCLTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRYH 360
             C+T+ ++ + VNG+++  F +++G+RQG  +SPYLFVICM+ LS+ L +      F +H
Sbjct: 278  LCITTASFSVQVNGDLVGYFQSKRGLRQGCSLSPYLFVICMDVLSKMLDKAAGVRKFGFH 337

Query: 361  PRCKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVED 540
            P+C++LG+TH  FADDL++ S G   S++ +++V D F + SGL+ +  KS LY  GV  
Sbjct: 338  PKCQRLGLTHLSFADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVSP 397

Query: 541  AEKRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAGR 720
              K+ I A      G LP  YLG+PL  ++L+     PL+++I  R++TW  +  S+AGR
Sbjct: 398  IIKQEIAAKFLFDVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGR 457

Query: 721  VQLIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPKQ 900
              LIKSV+  I  +W   F LP++ I+ I + C  FLW+G   +S +A ++W+ V  PK 
Sbjct: 458  FNLIKSVLWSICNFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKISWDIVCKPKA 517

Query: 901  AGGMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYIKGRDLLEM-PIPQQGSWVV 1077
             GG+ + N+K  N  +  KL+WRI    +++W +WV  Y I+ + +  +      GSW+ 
Sbjct: 518  EGGLGLRNLKEANDVSCLKLVWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTSMGSWIW 577

Query: 1078 RKILGAREYVR 1110
            RKIL  R+  +
Sbjct: 578  RKILKIRDVAK 588



 Score = 58.9 bits (141), Expect = 5e-06
 Identities = 31/103 (30%), Positives = 47/103 (45%), Gaps = 2/103 (1%)
 Frame = +1

Query: 1150 FSVKRVYMGLLGEVERVPWAKVVCQNPAPAKCKFIVWLLLHAKLVTCDRLRRFGIM--VD 1323
            FS +  +  +      V W K V    A  K     WL +H +L T DR+ ++     V 
Sbjct: 685  FSTRDTWHLIKATSSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVS 744

Query: 1324 STCCLCDTGEETLDHMFFECQFARRVWDNVAAWCGEEKEAVRW 1452
              C LC    +TL+H+FF C +A  VW  +A    + + + RW
Sbjct: 745  GNCVLCTNNSKTLEHLFFSCSYASTVWAALAKGIWKTRYSTRW 787


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  300 bits (768), Expect = 1e-78
 Identities = 139/373 (37%), Positives = 227/373 (60%), Gaps = 1/373 (0%)
 Frame = +1

Query: 1    IFDNILLSHELIKGYQRKNISPRCMIKIDVQKAYDSVEWPFLNQVLEELGFPYQFSHWIM 180
            + +N+LL+ EL+K Y +++++PRC +KID+ KA+DSV+W FL   LE L FP  F HWI 
Sbjct: 868  LMENVLLATELVKDYHKESVTPRCAMKIDISKAFDSVQWQFLLNTLEALNFPETFRHWIK 927

Query: 181  SCLTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRYH 360
             C+++ T+ + VNGE+   F + +G+RQG  +SPYLFVICM  LS  + E   +    YH
Sbjct: 928  LCISTATFSVQVNGELAGFFGSSRGLRQGCALSPYLFVICMNVLSHMIDEAAVHRNIGYH 987

Query: 361  PRCKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVED 540
            P+C+K+G+TH CFADDL++F  G   S++ ++ V   F   SGL+ +  KS +Y  GV  
Sbjct: 988  PKCEKIGLTHLCFADDLMVFVDGHQWSIEGVINVFKEFAGRSGLQISLEKSTIYLAGVSA 1047

Query: 541  AEKRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAGR 720
            +++   L++     G LP  YLG+PL  ++++     PL++ +  ++S+W A+ LSYAGR
Sbjct: 1048 SDRVQTLSSFPFANGQLPVRYLGLPLLTKQMTTADYSPLIEAVKTKISSWTARSLSYAGR 1107

Query: 721  VQLIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPKQ 900
            + L+ SV+  I  +W   + LP   I+ I++ C  FLW+G     ++A +AW  +  PK+
Sbjct: 1108 LALLNSVIVSIANFWMSAYRLPAGCIREIEKLCSAFLWSGPVLNPKKAKIAWSSICQPKK 1167

Query: 901  AGGMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYI-KGRDLLEMPIPQQGSWVV 1077
             GG+ I ++   N+ +  KL+WR+   + ++W+ W+  + I KG           GSW+ 
Sbjct: 1168 EGGLGIKSLAEANKVSCLKLIWRLLSTQPSLWVTWIWTFIIRKGTFWSANERSSLGSWMW 1227

Query: 1078 RKILGAREYVRRL 1116
            +K+L  RE  + +
Sbjct: 1228 KKLLKYRELAKSM 1240


>gb|AAD15471.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1277

 Score =  300 bits (768), Expect = 1e-78
 Identities = 145/373 (38%), Positives = 221/373 (59%), Gaps = 1/373 (0%)
 Frame = +1

Query: 1    IFDNILLSHELIKGYQRKNISPRCMIKIDVQKAYDSVEWPFLNQVLEELGFPYQFSHWIM 180
            + +N+LL+ EL+K Y + +ISPRC +KID+ KA+DSV+W FL   LE L FP +F HWI 
Sbjct: 713  LMENVLLATELVKDYHKDSISPRCAMKIDISKAFDSVQWQFLLNTLEALKFPEKFRHWIK 772

Query: 181  SCLTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRYH 360
             C+++ T+ + VN E    F +++G+RQG  +SPYLFVICM  LS  +     +    YH
Sbjct: 773  LCISTATFSVQVNSEQAGFFGSKRGLRQGCALSPYLFVICMNVLSHMIDVAAVHRNIGYH 832

Query: 361  PRCKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVED 540
            P+CKKL +TH CFADDL++F  G   SV+ ++ +   F   SGL  +  KS LY   V +
Sbjct: 833  PKCKKLSLTHLCFADDLMVFIDGQQRSVEGVINIFKDFAGKSGLHISLEKSTLYLAEVSE 892

Query: 541  AEKRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAGR 720
              + NIL+A     G LP  YLG PL  ++++     PL+ K+  ++S+W A+ LSYAGR
Sbjct: 893  LNRNNILSAFPFASGQLPVRYLGFPLLTKQMTTADYSPLLDKVRSKISSWTARSLSYAGR 952

Query: 721  VQLIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPKQ 900
            + LI SV+  +  +W   + LP   IK I++ C  FLW+G     ++A + W  +   KQ
Sbjct: 953  LALINSVIVSLSNFWMSAYRLPAGCIKEIEKLCSAFLWSGPELNPKKAKITWTSLCKLKQ 1012

Query: 901  AGGMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYI-KGRDLLEMPIPQQGSWVV 1077
             GG+ I ++   N+ +  KL+WR+  ++ ++W+ WV  Y I KG           GSW+ 
Sbjct: 1013 EGGLGIKSLLEANKVSCLKLIWRLVSRQSSLWVNWVWTYIIRKGSFWSANDRSSLGSWMW 1072

Query: 1078 RKILGAREYVRRL 1116
            +K+L  R+  + +
Sbjct: 1073 KKLLNYRDVAKSM 1085


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  292 bits (748), Expect = 2e-76
 Identities = 142/364 (39%), Positives = 216/364 (59%)
 Frame = +1

Query: 7    DNILLSHELIKGYQRKNISPRCMIKIDVQKAYDSVEWPFLNQVLEELGFPYQFSHWIMSC 186
            +N+LL+ +L+ GY   NISPR M+K+D++KA+DSV W F+   L  L  P +F +WI  C
Sbjct: 567  ENVLLATDLVHGYNWSNISPRGMLKVDLKKAFDSVRWEFVIAALRALAIPEKFINWISQC 626

Query: 187  LTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRYHPR 366
            +++ T+ +++NG     F + KG+RQGDP+SPYLFV+ ME  S  L    ++    YHP+
Sbjct: 627  ISTPTFTVSINGGNGGFFKSTKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYHPK 686

Query: 367  CKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVEDAE 546
               L I+H  FADD+++F  G   S+  + + LD F   SGLK N+ KS LY  G+   E
Sbjct: 687  ASNLSISHLMFADDVMIFFDGGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGLNQLE 746

Query: 547  KRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAGRVQ 726
              N  AA G   G+LP  YLG+PL  +KL + + +PL++KI  R  +W  K LS+AGR+Q
Sbjct: 747  S-NANAAYGFPIGTLPIRYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQ 805

Query: 727  LIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPKQAG 906
            LI SV+ G   +W   F+LP+  IK I+  C  FLW+G    ++   V+W  + LPK  G
Sbjct: 806  LISSVIFGSINFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEG 865

Query: 907  GMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYIKGRDLLEMPIPQQGSWVVRKI 1086
            G+ +  +  WN+    +L+WR+   KD++W  W H++++       +   Q  SW  +++
Sbjct: 866  GLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSWTWKRL 925

Query: 1087 LGAR 1098
            L  R
Sbjct: 926  LSLR 929


>gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1216

 Score =  292 bits (747), Expect = 3e-76
 Identities = 139/371 (37%), Positives = 225/371 (60%), Gaps = 1/371 (0%)
 Frame = +1

Query: 1    IFDNILLSHELIKGYQRKNISPRCMIKIDVQKAYDSVEWPFLNQVLEELGFPYQFSHWIM 180
            + +N+LL+ EL+K Y + +IS RC +KID+ KA+DS++W FL  VL  + FP +F HWI 
Sbjct: 292  LIENVLLATELVKDYHKDSISTRCAMKIDISKAFDSLQWSFLTHVLAAMNFPGEFIHWIS 351

Query: 181  SCLTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRYH 360
             C+++ ++ + VNGE+   F + +G+RQG  +SPYLFVI M+ LSR L +      F YH
Sbjct: 352  LCMSTASFSIQVNGELAGYFRSARGLRQGCSLSPYLFVISMDVLSRMLDKAAGAREFGYH 411

Query: 361  PRCKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVED 540
            PRCK LG+TH CFADDL++ + G + SV  +++VL+ F    GLK    K+ LY  GV D
Sbjct: 412  PRCKTLGLTHLCFADDLMILTDGKIRSVDGIVKVLNQFAAKLGLKICMEKTTLYLAGVSD 471

Query: 541  AEKRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAGR 720
              ++ + +      G LP  YLG+PL  ++L+     PL+ +I  R+  W ++ LS+AGR
Sbjct: 472  HSRQLMSSRYSFGVGKLPVRYLGLPLVTKRLTTSDYSPLIDQIRRRIGMWTSRYLSFAGR 531

Query: 721  VQLIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPKQ 900
            + LI SV+  I  +W   F LP++ I  I +     LW+G     ++A V+W+++  PK+
Sbjct: 532  LSLINSVLWSITNFWMNAFRLPRECINEINRISSALLWSGPELNPKKAKVSWDEICKPKK 591

Query: 901  AGGMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYIKGRDLLEMPIPQQ-GSWVV 1077
             GG+ + +++  N+ +  KL+WR+   +D++W++W  +  +K      +      GSW+ 
Sbjct: 592  EGGLGLQSLREANKVSSLKLIWRLLSCQDSLWVKWTRMNLLKKESFWSIGTHSTLGSWIW 651

Query: 1078 RKILGAREYVR 1110
            R++L  RE  +
Sbjct: 652  RRLLKHREVAK 662



 Score = 59.7 bits (143), Expect = 3e-06
 Identities = 30/110 (27%), Positives = 55/110 (50%)
 Frame = +1

Query: 1123 RDEVLQQRSFSVKRVYMGLLGEVERVPWAKVVCQNPAPAKCKFIVWLLLHAKLVTCDRLR 1302
            +++V + R FS K  +  +     +  W K V    A  K  F  WL +  +L T DR+ 
Sbjct: 752  KEDVFKAR-FSTKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRNRLSTGDRMM 810

Query: 1303 RFGIMVDSTCCLCDTGEETLDHMFFECQFARRVWDNVAAWCGEEKEAVRW 1452
             +     +TC  C +  ET DH+FF+C ++  +W ++A    +++ + +W
Sbjct: 811  TWNNGTPTTCVFCSSPMETRDHLFFQCCYSSEIWTSIAKNVYKDRFSTKW 860


>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  290 bits (741), Expect = 1e-75
 Identities = 141/371 (38%), Positives = 229/371 (61%), Gaps = 1/371 (0%)
 Frame = +1

Query: 1    IFDNILLSHELIKGYQRKNISPRCMIKIDVQKAYDSVEWPFLNQVLEELGFPYQFSHWIM 180
            + +N+LL+ EL+K Y +++IS R  +KID+ KA+D V+WPFL  VL+ +  P  F HWI 
Sbjct: 718  MMENLLLASELVKDYHKESISSRSALKIDISKAFDFVQWPFLINVLKAIHLPEMFIHWIE 777

Query: 181  SCLTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRYH 360
             C+ + ++ + VNGE+   F + +G+RQG  +SPYL+VICM  LS  L +        YH
Sbjct: 778  LCIGTASFSVQVNGELSGFFRSERGLRQGCSLSPYLYVICMNVLSCMLDKAAVEKKISYH 837

Query: 361  PRCKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVED 540
            PRC+ + +TH CFADD+++FS G   S+Q  + + + F  +S LK +  KS ++  G+  
Sbjct: 838  PRCRNMNLTHLCFADDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISP 897

Query: 541  AEKRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAGR 720
              K +IL       G+LP  YLG+PL  ++++     PLV+KI  R+++W  + LS+AGR
Sbjct: 898  NAKTSILQQFPFELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGR 957

Query: 721  VQLIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPKQ 900
            +QLIKSV++ I  +W  VF LP+  ++ I++    FLW+G    +++A +AW +V   K+
Sbjct: 958  LQLIKSVLSSITNFWLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKE 1017

Query: 901  AGGMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYIKGRDLLEM-PIPQQGSWVV 1077
             GG+ +  +K  N+ ++ KL+WRI   +D++W++WV+ + I+      +      GSW+ 
Sbjct: 1018 EGGLGLKPLKEANEVSLLKLIWRILSARDSLWVKWVNKHLIRKETFWSVKENTGLGSWLW 1077

Query: 1078 RKILGAREYVR 1110
            RKIL  R+  R
Sbjct: 1078 RKILKQRDKAR 1088


>emb|CAB72467.1| putative protein [Arabidopsis thaliana]
          Length = 762

 Score =  285 bits (730), Expect = 3e-74
 Identities = 140/373 (37%), Positives = 224/373 (60%), Gaps = 2/373 (0%)
 Frame = +1

Query: 1    IFDNILLSHELIKGYQRKNISPRCMIKIDVQKAYDSVEWPFLNQVLEELGFPYQFSHWIM 180
            + +N+LL+ +L+K Y + +IS RC IKID+ KA DSV+W FL   L  + FP  F HWI 
Sbjct: 124  LIENVLLATDLVKDYHKDSISERCAIKIDISKASDSVQWSFLINTLTAMHFPEMFIHWIR 183

Query: 181  SCLTSVTYVLTVNGEVLEPFVARKGIRQGDPISPYLFVICMEYLSRSLGELKQNAGFRYH 360
             C+T+ ++ + VNGE+   F + +G+RQG  +SPYLFVICM+ LS+ L ++       YH
Sbjct: 184  LCITTPSFSVQVNGELAGFFQSSRGLRQGCALSPYLFVICMDVLSKLLDKVVGIGRIGYH 243

Query: 361  PRCKKLGITHACFADDLLLFSRGDLASVQMMMQVLDHFGEVSGLKANQMKSCLYFGGVED 540
            P CK++G+TH  FADDL++ + G   S++ +++V D F + SGLK +  KS ++  G+  
Sbjct: 244  PHCKRMGLTHLSFADDLMILTDGQCRSIEGIIEVFDLFSKWSGLKISMEKSTIFSAGLSS 303

Query: 541  AEKRNILAATGMMEGSLPFSYLGVPLSAQKLSVRQCQPLVQKILHRMSTWAAKLLSYAGR 720
              +  +        G LP  YLG+PL  ++LS     PL+++I  R+ +W+++ LS+AGR
Sbjct: 304  TSRAQLHTHFPFEVGELPIRYLGLPLVTKRLSSVDYAPLIEQIRKRIGSWSSRFLSFAGR 363

Query: 721  VQLIKSVVAGIHMYWCQVFVLPQKVIKFIQQACRIFLWTGRASASRRALVAWEKVVLPKQ 900
              LI S++     +W   F LP+  I+ I++ C  FLW+G    S++A ++W +V  PK 
Sbjct: 364  FNLISSIIWSSCNFWLSAFQLPRACIQEIEKLCSSFLWSGTNLNSKKAKISWNQVCKPKS 423

Query: 901  AGGMNIGNMKLWNQATICKLLWRIQQKKDAVWIQWVHIYYIKGRDL--LEMPIPQQGSWV 1074
             GG+ + ++K  N     KL+WRI    D++W++WV    +K R++  +       GSW+
Sbjct: 424  EGGLGLRSLKEANDVCCLKLVWRIISHGDSLWVKWVEHNLLK-REIFWIVKENANLGSWI 482

Query: 1075 VRKILGAREYVRR 1113
             +KIL  R   +R
Sbjct: 483  WKKILKYRGVAKR 495


Top