BLASTX nr result

ID: Zingiber25_contig00014057 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber25_contig00014057
         (3855 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   624   e-175
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   622   e-175
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   578   e-162
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       567   e-158
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   558   e-156
gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,...   525   e-146
gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali...   514   e-142
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   503   e-139
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           503   e-139
dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thal...   498   e-137
dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like ...   494   e-136
emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-li...   489   e-135
gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00...   486   e-134
gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana]              453   e-124
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   453   e-124
gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip...   446   e-122
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   442   e-121
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]   435   e-119
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]   429   e-117
emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|72694...   419   e-114

>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  624 bits (1608), Expect = e-175
 Identities = 363/1098 (33%), Positives = 587/1098 (53%), Gaps = 21/1098 (1%)
 Frame = +3

Query: 486  MKIASWNIRGFHKALKHGGVQHLLKQKEIQVMALIETKLDDDALDIIMQRKFSGMGMANN 665
            M   SWN+RG +   K   +++ L   +I V AL+ET++ +     +  +        NN
Sbjct: 1    MLCVSWNVRGMNDPFKIKEIKNFLYSHKIVVCALLETRVREQNASKVQGKLGKDWKWLNN 60

Query: 666  FNYSNRGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPL 845
            +++S R RI + W    V++ +     Q M C ++        +A  VYG H+I  R+ L
Sbjct: 61   YSHSARERIWIGWRPAWVNVTLTHTQEQLMVCDIQDQSHKLKMVA--VYGLHTIADRKSL 118

Query: 846  WNSLTLLGENIAEPWIIMGDYNSLLTVQDKEGGQPVSNYELHDLERFVHTCGLVDLRSIG 1025
            W+ L    +   +P II+GD+N++    D+  G  V++ E  D ++F+    L++ RS  
Sbjct: 119  WSGLLQCVQQ-QDPMIIIGDFNAVCHSNDRLYGTLVTDAETEDFQQFLLQSNLIESRSTW 177

Query: 1026 CRLTWTNGSVS-----SKLDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGIVSTLAREK 1190
               +W+N S+      S++D+A VN  WL    E  V++LPPG +SDHS  + + +    
Sbjct: 178  SYYSWSNSSIGRDRVLSRIDKAYVNLVWLGMYAEVSVQYLPPG-ISDHSPLLFNLMTGRP 236

Query: 1191 RCNRPFKFYNMWTLHEGFQDLVASSWDEPISGNAQFALKAKLTRLKGKLRELDKLHFQHI 1370
            +  +PFKF N+      F + V  +W+   S N +F L+A    LK   REL ++  Q I
Sbjct: 237  QGGKPFKFMNVMAEQGEFLETVEKAWN---SVNGRFKLQAIWLNLKAVKRELKQMKTQKI 293

Query: 1371 SEQALRAKSVLEDAQRSLLSGAGPPNDYEVIRKRATL-----LAEAEKLFYQQRAKSVYL 1535
                 + K++    Q          ND      ++ +      +  E    QQ+++  +L
Sbjct: 294  GLAHEKVKNLRHQLQDLQSQDDFDHNDIMQTDAKSIMNDLRHWSHIEDSILQQKSRITWL 353

Query: 1536 KNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLG-KTGQCT 1712
            +  D  +K F   VK  +  N I  L  + G      DEV EE + +++ LLG +     
Sbjct: 354  QQGDTNSKLFFTAVKARHAINRIDMLNTEDGRVIQDADEVQEEILEFYKKLLGTRASTLM 413

Query: 1713 PLDSNTISGNHLSQAQ-REELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKI 1889
             +D NT+ G     AQ +E L+  + + EI  AL  IG DKAPG DGFNA F+  +W  I
Sbjct: 414  GVDLNTVRGGKCLSAQAKESLIREVASTEIDEALAGIGNDKAPGLDGFNAYFFKKSWGSI 473

Query: 1890 GDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVISKVLSA 2069
              +  A ++EFF++ ++ +  N   + L+PK  H+  V ++RPI+CCTV YK+ISK+L+ 
Sbjct: 474  KQEIYAGIQEFFNNSRMHRPINCIVVTLLPKVQHATRVKEFRPIACCTVIYKIISKMLTN 533

Query: 2070 RLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKKAFDTVD 2249
            R+  ++  +++ AQ  F+  R I +NI LA EL+R Y RK +SPRC++KVD++KA+D+V+
Sbjct: 534  RMKGIIGEVVNEAQSGFIPGRHIADNILLASELIRGYTRKHMSPRCIMKVDIRKAYDSVE 593

Query: 2250 WGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFG 2429
            W FL + L  +GFP +F GWI EC+STVSYSV +NG     F  R+GLRQGDP+SP LF 
Sbjct: 594  WSFLETLLYEFGFPSRFVGWIMECVSTVSYSVLVNGIPTQPFQARKGLRQGDPMSPFLFA 653

Query: 2430 LCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIMGCLTDF 2609
            LC+E  SR ++E      F++HP+C  + +THL +ADDLL+F RAD SSL+ +      F
Sbjct: 654  LCMEYLSRCLEELKGSPDFNFHPKCERLNITHLMFADDLLMFCRADKSSLDHMNVAFQKF 713

Query: 2610 GDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLATERLRIANYGS 2789
               +GL A+  KSN+Y  G++++   EL      Q G  PFRYLG+PL +++L  A    
Sbjct: 714  SHASGLAASHEKSNIYFCGVDDETARELADYVHMQLGELPFRYLGVPLTSKKLTYAQCKP 773

Query: 2790 LLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYGISRSFVW 2969
            L+E IT +  +W    LS+AGR++LI+S+L  ++ +W  I P+   VI  +  + R F+W
Sbjct: 774  LVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVCRKFLW 833

Query: 2970 S-----TKHPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWVKWIHQ 3134
            +     TK  P++W+TI  PK  GG+ + +++ WN   + + LW I+ +RD LWV+WIH 
Sbjct: 834  TGKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLKLLWAIEFKRDKLWVRWIHS 893

Query: 3135 HYLRSEDFWHWRGKPSDSPLIKCLLQIRDVLLQRTTSIGAAKMKLTEWFAEGENGTALAY 3314
            +Y++ +D          + +++ +++ RD L    ++IG       E     +     AY
Sbjct: 894  YYIKRQDILTVNISNQTTWILRKIVKARDHL----SNIG----DWDEICIGDKFSMKKAY 945

Query: 3315 DFFRPKKPKVPWFSVIWKSILMPKHRFTVWLLAHGKLRTADSLS---YEPSKMCALCQQH 3485
                    +V W  +I  +   PK +F +W++ H +L T D +S    +      LC+  
Sbjct: 946  KKISENGERVRWRRLICNNYATPKSKFILWMMLHERLPTVDRISRWGVQCDLNYRLCRND 1005

Query: 3486 EESNGHIFFECAIAKSLLDRVVRWMDIKFDITSMAKLL-DVFQRHYKGKGRQVKARYLAV 3662
             E+  H+FF C+ +  +  ++   M       S  +++  V  +  K KG+ +   Y   
Sbjct: 1006 GETIQHLFFSCSYSAGVWSKICYIMRFPNSGVSHQEIISSVCGQARKKKGKLIVMLY--- 1062

Query: 3663 SSMIYLIWEARNRSRFEG 3716
            +  +Y IW+ RN+  F G
Sbjct: 1063 TEFVYAIWKQRNKRTFTG 1080


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  622 bits (1603), Expect = e-175
 Identities = 367/1102 (33%), Positives = 582/1102 (52%), Gaps = 24/1102 (2%)
 Frame = +3

Query: 486  MKIASWNIRGFHKALKHGGVQHLLKQKEIQVMALIETKLDDDALDIIMQRKFSGMGMANN 665
            MKI +WN+RG +  +K   V+H L  ++I + +L ET++       I ++  +     NN
Sbjct: 1    MKITTWNVRGLNDPIKVKEVKHFLHSQKISLCSLFETRVRQQNSGKIQKKFGNRWSWINN 60

Query: 666  FNYSNRGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPL 845
            +  S RGRI + W  + V+++VL  T Q +   V+ S    +F    VYG H+I  R+ L
Sbjct: 61   YACSPRGRIWVGWLNNDVNINVLSVTEQVITMEVKNSYGLNMFKMAAVYGLHTIADRKVL 120

Query: 846  WNSLTLLGENIAEPWIIMGDYNSLLTVQDKEGGQPVSNYELHDLERFVHTCGLVDLRSIG 1025
            W  L        EP I++GDYN++ + QD+  G  VS  E  DL  FV    L++  + G
Sbjct: 121  WEELYNFVSVCHEPCILIGDYNAVYSAQDRLNGNDVSEAETSDLRSFVLKAQLLEAPTTG 180

Query: 1026 CRLTWTNGSV-----SSKLDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGIVSTLAREK 1190
               +W N S+     SS++D++ VN  W+    +  VE+   G +SDHS  I +   +  
Sbjct: 181  LFYSWNNKSIGADRISSRIDKSFVNVAWINQYPDVVVEYREAG-ISDHSPLIFNLATQHD 239

Query: 1191 RCNRPFKFYNMWTLHEGFQDLVASSWDEPISGNAQFALKAKLTRLKGKLRELDKLHFQHI 1370
               RPFKF N      GF ++V  +W    S N +F +K    RL+   R L   H +  
Sbjct: 240  EGGRPFKFLNFLADQNGFVEVVKEAWG---SANHRFKMKNIWVRLQAVKRALKSFHSKKF 296

Query: 1371 SEQALRAKSVLEDAQRSLLSGAGPPNDYEVIR---KRATLLAEAEKL------FYQQRAK 1523
            S    +A   +E+ +R L +    P   +V     +   L+A+  K         +Q+++
Sbjct: 297  S----KAHCQVEELRRKLAAVQALPEVSQVSELQEEEKDLIAQLRKWSTIDESILKQKSR 352

Query: 1524 SVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLG-KT 1700
              +L   D  +KFF   +K    RN I+ L    GD+ T   E+  E   +++ LLG  +
Sbjct: 353  IQWLSLGDSNSKFFFTAIKVRKARNKIVLLQNDRGDQLTENTEIQNEICNFYRRLLGTSS 412

Query: 1701 GQCTPLDSNTIS-GNHLSQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYTAA 1877
             Q   +D + +  G  LS     +LV PIT +EI  AL DI   KAPG DGFN+ F+  +
Sbjct: 413  SQLEAIDLHVVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKAPGLDGFNSVFFKKS 472

Query: 1878 WDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVISK 2057
            W  I  +    + +FF +G + K  N T++ LIPK D +    DYRPI+CC+  YK+ISK
Sbjct: 473  WLVIKQEIYEGILDFFENGFMHKPINCTAVTLIPKIDEAKHAKDYRPIACCSTLYKIISK 532

Query: 2058 VLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKKAF 2237
            +L+ RL  V++ ++D AQ  F+ +R IG+NI LA EL+R Y R+ +SPRC+IKVD++KA+
Sbjct: 533  ILTKRLQAVITEVVDCAQTGFIPERHIGDNILLATELIRGYNRRHVSPRCVIKVDIRKAY 592

Query: 2238 DTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSP 2417
            D+V+W FL S L   GFP  F  WI  C+ TVSYS+ LNG     F  ++GLRQGDPLSP
Sbjct: 593  DSVEWVFLESMLKELGFPSMFIRWIMACVKTVSYSILLNGIPSIPFDAQKGLRQGDPLSP 652

Query: 2418 LLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIMGC 2597
             LF L +E  SR M    +   F++HP+C  ++LTHL +ADDLL+FARAD SS++ IM  
Sbjct: 653  FLFALSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMFARADASSISKIMAA 712

Query: 2598 LTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLATERLRIA 2777
               F   +GL+A+  KS +Y  G+  +   +L        G  PFRYLG+PLA+++L  +
Sbjct: 713  FNSFSKASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPFRYLGVPLASKKLNFS 772

Query: 2778 NYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYGISR 2957
                L++ IT +   W  H LS+AGR++L++++L  ++ +W  I P+P  +I  +    R
Sbjct: 773  QCKPLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFPLPKKLIKAVETTCR 832

Query: 2958 SFVW-----STKHPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWVK 3122
             F+W     ++   P++W  +  PK  GG  + ++  WN   + + LW I  ++D LWV+
Sbjct: 833  KFLWTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILKLLWAITFKQDKLWVR 892

Query: 3123 WIHQHYLRSEDFWHWRGKPSDSPLIKCLLQIRDVLLQRTTSIGAAKMKLTEWFAEGENGT 3302
            W++ +Y++ ++  +     + S +++ + + R+ LL RT    A    +     +     
Sbjct: 893  WVNAYYIKRQNIENVTVSSNTSWILRKIFESRE-LLTRTGGWEAVSNHMNFSIKK----- 946

Query: 3303 ALAYDFFRPKKPKVPWFSVIWKSILMPKHRFTVWLLAHGKLRTADSLS---YEPSKMCAL 3473
               Y   +     V W  +I  +   PK +F +WL    +L TA+ +S    + S +C +
Sbjct: 947  --TYKLLQEDYENVVWKRLICNNKATPKSQFILWLAMLNRLATAERVSRWNRDVSPLCKM 1004

Query: 3474 CQQHEESNGHIFFECAIAKSLLDRVVRWMDIKFDITSMAKLLDVFQRHYKGKGRQVKARY 3653
            C    E+  H+FF C  +K +  +V+ +++++    + AK     ++    K R  K   
Sbjct: 1005 CGNEIETIQHLFFNCIYSKEIWGKVLLYLNLQPQADAQAKKELAIKKARSTKDRN-KLYV 1063

Query: 3654 LAVSSMIYLIWEARNRSRFEGI 3719
            +  +  +Y IW  RN   F GI
Sbjct: 1064 MMFTESVYAIWLLRNAKVFRGI 1085


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  578 bits (1490), Expect = e-162
 Identities = 328/954 (34%), Positives = 507/954 (53%), Gaps = 21/954 (2%)
 Frame = +3

Query: 498  SWNIRGFHKALKHGGVQHLLKQKEIQVMALIETKLDDDALDIIMQRKFSGMGMANNFNYS 677
            SWN+RGF+ +++    +   K  +    +++ET++ +      +   F G     N+ ++
Sbjct: 6    SWNVRGFNNSVRRRNFRKWFKLSKALFGSILETRVKEHRARRSLLSSFPGWKSVCNYEFA 65

Query: 678  NRGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPLWNSL 857
              GRI ++WD   V + VL  + Q + C V+       F+ TFVY  +    RR LW+ L
Sbjct: 66   ALGRIWVVWDP-AVEVTVLSKSDQTISCTVKLPHISTEFVVTFVYAVNCRYGRRRLWSEL 124

Query: 858  TLLGENIA---EPWIIMGDYNSLLTVQDKEGGQPVSNYELHDLERFVHTCGLVDLRSIGC 1028
             LL  N     +PWII+GD+N  L   D   G       + +    + T  + DL   G 
Sbjct: 125  ELLAANQTTSDKPWIILGDFNQSLDPVDASTGGSRITRGMEEFRECLLTSNISDLPFRGN 184

Query: 1029 RLTWTNGS----VSSKLDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGIVSTLAREKRC 1196
              TW N      ++ K+DR +VN  WLIA   SY  F      SDH    V+   +    
Sbjct: 185  HYTWWNNQENNPIAKKIDRILVNDSWLIASPLSYGSFCAME-FSDHCPSCVNISNQSGGR 243

Query: 1197 NRPFKFYNMWTLHEGFQDLVASSWDE-PISGNAQFALKAKLTRLKGKLRELDKLHFQHIS 1373
            N+PFK  N    H  F + +  +WD     G+A F L  K   LKG +R  ++ H+  + 
Sbjct: 244  NKPFKLSNFLMHHPEFIEKIRVTWDRLAYQGSAMFTLSKKSKFLKGTIRTFNREHYSGLE 303

Query: 1374 EQALRAKSVLEDAQRSLLSGAGPPNDYEVIRKRA----TLLAEAEKLFYQQRAKSVYLKN 1541
            ++ ++A   L+  Q +LL  A P +    + K A      LA AE+ F  Q+++ ++LK 
Sbjct: 304  KRVVQAAQNLKTCQNNLL--AAPSSYLAGLEKEAHRSWAELALAEERFLCQKSRVLWLKC 361

Query: 1542 SDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKTGQCTPLD 1721
             D  T FFH ++      N I  L  Q+G    + DE+    V +F++L G +      +
Sbjct: 362  GDSNTTFFHRMMTARRAINEIHYLLDQTGRRIENTDELQTHCVDFFKELFGSSSHLISAE 421

Query: 1722 S----NTISGNHLSQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKI 1889
                 N+++     +  R+ L   ++  +IK+  F + ++K+PGPDG+ ++F+   W  +
Sbjct: 422  GISQINSLTRFKCDENTRQLLEAEVSEADIKSEFFALPSNKSPGPDGYTSEFFKKTWSIV 481

Query: 1890 GDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVISKVLSA 2069
            G   IAAV+EFF  G+LL QWN T++ ++PK  ++  +T++RPISCC   YKVISK+L+ 
Sbjct: 482  GPSLIAAVQEFFRSGRLLGQWNSTAVTMVPKKPNADRITEFRPISCCNAIYKVISKLLAR 541

Query: 2070 RLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKKAFDTVD 2249
            RL N++   + P+Q AFVK R + EN+ LA EL++ + +  IS R ++KVDL+KAFD+V 
Sbjct: 542  RLENILPLWISPSQSAFVKGRLLTENVLLATELVQGFGQANISSRGVLKVDLRKAFDSVG 601

Query: 2250 WGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFG 2429
            WGF++  L     P +F  WI +CI++ S+S++++G L G+F G +GLRQGDPLSP LF 
Sbjct: 602  WGFIIETLKAANAPPRFVNWIKQCITSTSFSINVSGSLCGYFKGSKGLRQGDPLSPSLFV 661

Query: 2430 LCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIMGCLTDF 2609
            + +E+ SR ++         YHP+ +EV ++ LA+ADDL++F     SSL  I   L  F
Sbjct: 662  IAMEILSRLLENKFSDGSIGYHPKASEVRISSLAFADDLMIFYDGKASSLRGIKSVLESF 721

Query: 2610 GDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLATERLRIANYGS 2789
             + +GL  N  KS +Y AG+E+  + + L   GF  G FPFRYLG+PL   +LR ++Y  
Sbjct: 722  KNLSGLEMNTEKSAVYTAGLEDTDKEDTLAF-GFVNGTFPFRYLGLPLLHRKLRRSDYSQ 780

Query: 2790 LLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYGISRSFVW 2969
            L++ I  +   W   TLS AGR++LI SV+     FWLS   +P   +  I  +   F+W
Sbjct: 781  LIDKIAARFNHWATKTLSFAGRLQLISSVIYSTVNFWLSSFILPKCCLKTIEQMCNRFLW 840

Query: 2970 STK-----HPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWVKWIHQ 3134
                       +SW   CLPK EGG GLR+   WN TL  R +W + +RRDSLWV W H 
Sbjct: 841  GNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWTWNKTLNLRLIWMLFARRDSLWVAWNHA 900

Query: 3135 HYLRSEDFWHWRGKPSDSPLIKCLLQIRDVLLQRTTSIGAAKMKLTEWFAEGEN 3296
            + LR  +FW+       S + K +L +R +  +           L+ W+    N
Sbjct: 901  NRLRHVNFWNAEAASHHSWIWKAILGLRPLAKRFLRGAVGNGQLLSYWYDHWSN 954


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  567 bits (1461), Expect = e-158
 Identities = 347/963 (36%), Positives = 509/963 (52%), Gaps = 27/963 (2%)
 Frame = +3

Query: 501  WNIRGFHKALKHGGVQHLLKQKEIQVMALIETKLDDDALDIIMQRKFSGMGMANNFNYSN 680
            WNIRGF+      G +  +K  +     +IET +        +     G     N+ +S+
Sbjct: 8    WNIRGFNNVSHRSGFKKWVKANKPIFGGVIETHVKQPKDRKFINALLPGWSFVENYAFSD 67

Query: 681  RGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPLWNSLT 860
             G+I ++WD   V + V+  + Q + C V    +    + + VY  + + +R+ LW  + 
Sbjct: 68   LGKIWVMWDPS-VQVVVVAKSLQMITCEVLLPGSPSWIIVSVVYAANEVASRKELWIEIV 126

Query: 861  ------LLGENIAEPWIIMGDYNSLLTVQDKEGGQPVS---NYELHDLERFVHTCGLVDL 1013
                  ++G+    PW+++GD+N +L  Q  E   PVS   +  + D    +    L DL
Sbjct: 127  NMVVSGIIGDR---PWLVLGDFNQVLNPQ--EHSNPVSLNVDINMRDFRDCLLAAELSDL 181

Query: 1014 RSIGCRLTWTNGS----VSSKLDRAMVNSQWLIADYESYVEFLPPGCLSDH-SCGIVSTL 1178
            R  G   TW N S    V+ K+DR +VN  W  A + S +        SDH SCG+V   
Sbjct: 182  RYKGNTFTWWNKSHTTPVAKKIDRILVNDSWN-ALFPSSLGIFGSLDFSDHVSCGVVLEE 240

Query: 1179 AREKRCNRPFKFYNMWTLHEGFQDLVASSWDE-PISGNAQFALKAKLTRLKGKLRELDKL 1355
               K   RPFKF+N    +  F +LV  +W    + G++ F +  KL  LK  +++  +L
Sbjct: 241  TSIK-AKRPFKFFNYLLKNLDFLNLVRDNWFTLNVVGSSMFRVSKKLKALKKPIKDFSRL 299

Query: 1356 HFQHISEQALRAKSVLEDAQRSLLSGAGPPN-DYEVIRKRAT-LLAEAEKLFYQQRAKSV 1529
            ++  + ++   A   L   Q   L+   P N  +E+  +R   +L  AE+ F++Q+++  
Sbjct: 300  NYSELEKRTKEAHDFLIGCQDRTLADPTPINASFELEAERKWHILTAAEESFFRQKSRIS 359

Query: 1530 YLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKTGQC 1709
            +    D  TK+FH +    N  N+I  L   +G    S + + +    YF  LLG   + 
Sbjct: 360  WFAEGDGNTKYFHRMADARNSSNSISALYDGNGKLVDSQEGILDLCASYFGSLLGD--EV 417

Query: 1710 TPL-----DSNTISGNHLSQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYTA 1874
             P      D N +     S AQ  EL    +NE+I+ ALF +  +K+ GPDGF A+F+  
Sbjct: 418  DPYLMEQNDMNLLLSYRCSPAQVCELESTFSNEDIRAALFSLPRNKSCGPDGFTAEFFID 477

Query: 1875 AWDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVIS 2054
            +W  +G +   A+KEFF  G LLKQWN T+I LIPK  +  C +D+RPISC    YKVI+
Sbjct: 478  SWSIVGAEVTDAIKEFFSSGCLLKQWNATTIVLIPKIVNPTCTSDFRPISCLNTLYKVIA 537

Query: 2055 KVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKKA 2234
            ++L+ RL  ++S ++  AQ AF+  RS+ EN+ LA +L+  Y    ISPR M+KVDLKKA
Sbjct: 538  RLLTDRLQRLLSGVISSAQSAFLPGRSLAENVLLATDLVHGYNWSNISPRGMLKVDLKKA 597

Query: 2235 FDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLS 2414
            FD+V W F+++AL     P++F  WI +CIST +++VS+NGG  GFF   +GLRQGDPLS
Sbjct: 598  FDSVRWEFVIAALRALAIPEKFINWISQCISTPTFTVSINGGNGGFFKSTKGLRQGDPLS 657

Query: 2415 PLLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIMG 2594
            P LF L +E FS  +         HYHP+ + + ++HL +ADD+++F      SL+ I  
Sbjct: 658  PYLFVLAMEAFSNLLHSRYESGLIHYHPKASNLSISHLMFADDVMIFFDGGSFSLHGICE 717

Query: 2595 CLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLATERLRI 2774
             L DF   +GL+ N  KS+LY+AG+  QL        GF  G  P RYLG+PL   +LRI
Sbjct: 718  TLDDFASWSGLKVNKDKSHLYLAGL-NQLESNANAAYGFPIGTLPIRYLGLPLMNRKLRI 776

Query: 2775 ANYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYGIS 2954
            A Y  LLE IT +  SW    LS AGRI+LI SV+ G   FW+S   +P G I +I  + 
Sbjct: 777  AEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTFLLPKGCIKRIESLC 836

Query: 2955 RSFVWS-----TKHPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWV 3119
              F+WS      K   +SW+ +CLPK EGG GLR L  WN TL  R +W +   +DSLW 
Sbjct: 837  SRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWA 896

Query: 3120 KWIHQHYLRSEDFWHWRGKPSDSPLIKCLLQIRDVLLQRTTSIGAAKMKLTEWFAEGENG 3299
             W H H+L    FW   G  SDS   K LL +R +  Q         +K   W+   +N 
Sbjct: 897  DWQHLHHLSRGSFWAVEGGQSDSWTWKRLLSLRPLAHQFLVCKVGNGLKADYWY---DNW 953

Query: 3300 TAL 3308
            T+L
Sbjct: 954  TSL 956


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  558 bits (1437), Expect = e-156
 Identities = 323/939 (34%), Positives = 503/939 (53%), Gaps = 31/939 (3%)
 Frame = +3

Query: 501  WNIRGFHKALKHGGVQHLLKQKEIQVMALIETKLDDDALDIIMQRKFSGMGMANNFNYSN 680
            WN+RG +K+ KH  ++  +++   Q   L+ET++ +  +  ++ + F    +  N+ ++ 
Sbjct: 6    WNVRGLNKSSKHSVIKKWIEENNFQFGCLVETRVKESKVSQLVGKLFKDWSILTNYEHNR 65

Query: 681  RGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPLWNSLT 860
            RGRI +LW  + V L  +  + Q + C V+       F  +FVY  + +  R+ LW+ L 
Sbjct: 66   RGRIWVLWRKN-VRLSPIYKSCQLLTCSVKLEDRQDEFFCSFVYASNYVEERKVLWSELK 124

Query: 861  LLGENIA---EPWIIMGDYNSLLTVQDKEGG--QPVSNYELHDLERFVHTCGLVDLRSIG 1025
               ++     +PW ++GD+N  L + +       P+    + D ++ ++ C L D+ + G
Sbjct: 125  DHYDSPIIRHKPWTLLGDFNETLDIAEHSQSFVHPMVTPGMRDFQQVINYCSLTDMAAQG 184

Query: 1026 CRLTWTN----GSVSSKLDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGIV--STLARE 1187
               TW N    G +  KLDR ++N  W     +SY  F   GC     C I   S    +
Sbjct: 185  PLFTWCNKREHGLIMKKLDRVLINDCWNQTFSQSYSVFEAGGCSDHLRCRISLNSEAGNK 244

Query: 1188 KRCNRPFKFYNMWTLHEGFQDLVASSWDEP----ISGNAQFALKAKLTRLKGKLRELDKL 1355
             +  +PFKF N  T  E F+ +V++ W +     +S +  F     L  LK K+R + + 
Sbjct: 245  VQGLKPFKFVNALTDMEDFKPMVSTYWKDTEPLILSTSTLFRFSKNLKGLKPKIRSMARD 304

Query: 1356 HFQHISEQALRAKSVLEDAQRSLLSGAGPP--NDYEVIRKRATLLAEAEKLFYQQRAKSV 1529
               ++S++A  A  +L   Q   L+        +      R   +A  E+ + +Q++K  
Sbjct: 305  RLGNLSKKANEAYKILCAKQHVNLTNPSSMAMEEENAAYSRWDRVAILEEKYLKQKSKLH 364

Query: 1530 YLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKTGQC 1709
            + +  D+ TK FH         N I ++    G   T  DE+  E   +F++ L    Q 
Sbjct: 365  WCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTKGDEIKAEAERFFREFL----QL 420

Query: 1710 TPLDSNTISGNHLSQ--------AQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKF 1865
             P D   ++   L Q        A ++ L+ P+T EEI+  LF + +DK+PGPDG+ ++F
Sbjct: 421  IPNDFEGVTITELQQLLPVRCSDADQQSLIRPVTAEEIRKVLFRMPSDKSPGPDGYTSEF 480

Query: 1866 YTAAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYK 2045
            + A W+ IGD+F  AV+ FF  G L K  N T +ALIPK   +  + DYRPISCC V YK
Sbjct: 481  FKATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALIPKKTEAREMKDYRPISCCNVLYK 540

Query: 2046 VISKVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDL 2225
            VISK+++ RL  V+   +   Q AFVKDR + EN+ LA EL++ Y +  IS RC IK+D+
Sbjct: 541  VISKIIANRLKLVLPKFIAGNQSAFVKDRLLIENLLLATELVKDYHKDTISTRCAIKIDI 600

Query: 2226 KKAFDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGD 2405
             KAFD+V W FL++     GFP++F  WI+ CI+T S+SV +NG L G+F   RGLRQG 
Sbjct: 601  SKAFDSVQWPFLINVFTILGFPREFIHWINICITTASFSVQVNGELAGYFQSSRGLRQGC 660

Query: 2406 PLSPLLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNL 2585
             LSP LF +C++V S+ + +A+    F YHP+C  + LTHL++ADDL++ +   + S+  
Sbjct: 661  ALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMGLTHLSFADDLMVLSDGKIRSIER 720

Query: 2586 IMGCLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLATER 2765
            I+    +F   +GLR +  KS +Y+AG+    R E+     F  G  P RYLG+PL T+R
Sbjct: 721  IIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVADRFPFSSGQLPVRYLGLPLITKR 780

Query: 2766 LRIANYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIY 2945
            L   +   LLE + K++ SW    LS+AGR+ LI SVL  +  FWL+   +P   I ++ 
Sbjct: 781  LSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSICNFWLAAFRLPRKCIRELE 840

Query: 2946 GISRSFVWS-----TKHPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDS 3110
             +  +F+WS     +    ISW  +C PK+EGG GLR L+  N     + +W I S  +S
Sbjct: 841  KMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLRSLKEANDVCCLKLVWKIVSHSNS 900

Query: 3111 LWVKWIHQHYLRSEDFWHWRGKPSDSPLI-KCLLQIRDV 3224
            LWVKW+ QH LR+  FW  +   S    I K LL+ R+V
Sbjct: 901  LWVKWVDQHLLRNASFWEVKQTVSQGSWIWKKLLKYREV 939



 Score = 62.4 bits (150), Expect = 2e-06
 Identities = 41/137 (29%), Positives = 60/137 (43%), Gaps = 9/137 (6%)
 Frame = +3

Query: 3324 RPKKPKVPWFSVIWKSILMPKHRFTVWLLAHGKLRTADSLSYEPSKM---CALCQQHEES 3494
            R    +VPW  VIW S   PK+ F  WL AHG+L T D +    + +   C  CQ   E+
Sbjct: 1048 RSTSARVPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCIFCQGTLET 1107

Query: 3495 NGHIFFECAIAKSLLDRVVRWMDI-----KFDITS-MAKLLDVFQRHYKGKGRQVKARYL 3656
              H+FF C+         V W+D+     K   TS    +++        +      RY+
Sbjct: 1108 RDHLFFTCSFTS------VIWVDLARGIFKTQYTSHWQSIIEAITNSQHHRVEWFLRRYV 1161

Query: 3657 AVSSMIYLIWEARNRSR 3707
               + IY++W  RN  R
Sbjct: 1162 -FQATIYIVWRERNGRR 1177


>gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13)
            [Arabidopsis thaliana]
          Length = 1164

 Score =  525 bits (1351), Expect = e-146
 Identities = 313/830 (37%), Positives = 450/830 (54%), Gaps = 22/830 (2%)
 Frame = +3

Query: 801  TFVYGFHSIMARRPLWNSLTLLGEN---IAEPWIIMGDYNSLLTVQDKEGGQPVS-NYEL 968
            +FVY     + R+ LWN +     +   I +PW ++GD+N +L   +       + +   
Sbjct: 4    SFVYASTDEVTRQILWNEIVDFSNDPCVIDKPWTVLGDFNQILHPSEHSTSDGFNVDRPT 63

Query: 969  HDLERFVHTCGLVDLRSIGCRLTWTN----GSVSSKLDRAMVNSQWLIADYESYVEFLPP 1136
                  +    L DL   G   TW N      V+ KLDR +VN +W      S   F  P
Sbjct: 64   RIFRETILLASLTDLSFRGNTFTWWNKRSRAPVAKKLDRILVNDKWTTTFPSSLGLFGEP 123

Query: 1137 GCLSDHSCGIVSTLAREKRCNRPFKFYNMWTLHEGFQDLVASSW-DEPISGNAQFALKAK 1313
               SDHS   +S ++   R  +PF+F N     E F  L+   W    ++G+A + +  K
Sbjct: 124  D-FSDHSSCELSLMSASPRSKKPFRFNNFLLKDENFLSLICLKWFSTSVTGSAMYRVSVK 182

Query: 1314 LTRLKGKLRELDKLHFQHISEQALRAKSVLEDAQRSLLSGAGPPN---DYEVIRKRATLL 1484
            L  LK  +R+  + ++  I ++   A   L  AQ  LL+   P N   + E  RK   +L
Sbjct: 183  LKALKKVIRDFSRDNYSDIEKRTKEAHDALLLAQSVLLASPCPSNAAIEAETQRKWR-IL 241

Query: 1485 AEAEKLFYQQRAKSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEE 1664
            AEAE  F+ QR++  +L+  D  + +FH +       N I  L+   GD       +   
Sbjct: 242  AEAEASFFYQRSRVNWLREGDMNSSYFHKMASARQSLNHIHFLSDPVGDRIEGQQNLENH 301

Query: 1665 FVGYFQDLLGKTGQCTPL----DSNTISGNHLSQAQREELVVPITNEEIKNALFDIGADK 1832
             V YFQ  LG   Q  PL    D + +     S AQ+  L  P ++E+IKNA F +  +K
Sbjct: 302  CVEYFQSNLGSE-QGLPLFEQADISNLLSYRCSPAQQVSLDTPFSSEQIKNAFFSLPRNK 360

Query: 1833 APGPDGFNAKFYTAAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDY 2012
            A GPDGF+ +F+ A W  IG +   A+ EFF  GKLLKQWN T++ LIPK  ++  ++D+
Sbjct: 361  ASGPDGFSPEFFCACWPIIGGEVTEAIHEFFTSGKLLKQWNATNLVLIPKITNASSMSDF 420

Query: 2013 RPISCCTVFYKVISKVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKR 2192
            RPISC    YKVISK+L+ RL + + + +  +Q AF+  R   EN+ LA EL+  Y +K 
Sbjct: 421  RPISCLNTVYKVISKLLTDRLKDFLPAAISHSQSAFMPGRLFLENVLLATELVHGYNKKN 480

Query: 2193 ISPRCMIKVDLKKAFDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGF 2372
            I+P  M+KVDL+KAFD+V W F++SAL     P++F  WI EC+ST S+SV LNG   G 
Sbjct: 481  IAPSSMLKVDLRKAFDSVRWDFIVSALRALNVPEKFTCWILECLSTASFSVILNGHSAGH 540

Query: 2373 FSGRRGLRQGDPLSPLLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLL 2552
            F   +GLRQGDP+SP LF L +EVFS  ++         YHP+ +++E++HL +ADD+++
Sbjct: 541  FWSSKGLRQGDPMSPYLFVLAMEVFSGLLQSRYTSGYIAYHPKTSQLEISHLMFADDVMI 600

Query: 2553 FARADVSSLNLIMGCLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPF 2732
            F     SSL+ I+  L DF   +GL  N  K+ LY AG+  Q   + +   GF+ G  P 
Sbjct: 601  FFDGKSSSLHGIVESLEDFAGWSGLLMNTNKTQLYHAGL-SQSESDSMASYGFKLGSLPV 659

Query: 2733 RYLGIPLATERLRIANYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSIL 2912
            RYLG+PL + +L IA Y  L+E IT +  SW    LS AGR++L+ SV+ G+  FW+S  
Sbjct: 660  RYLGLPLMSRKLTIAEYAPLIEKITARFNSWVVRLLSFAGRVQLLASVISGIVNFWISSF 719

Query: 2913 PIPSGVIDKIYGISRSFVWSTK-----HPPISWSTICLPKEEGGYGLRDLQAWNMTLLCR 3077
             +P G I KI  +   F+WS++        ++WS +CLPK EGG GLR     N TL  R
Sbjct: 720  ILPLGCIKKIESLCSRFLWSSRIDKKGIAKVAWSQVCLPKAEGGIGLRRFAVSNRTLYLR 779

Query: 3078 TLWNIQSRRDSLWVKWIHQHYL-RSEDFWHWRGKPSDSPLIKCLLQIRDV 3224
             +W + S   SLWV W  QH L +S  FW+   KP DS   KCLL++R V
Sbjct: 780  MIWLLFSNSGSLWVAWHKQHSLGKSTSFWNQPEKPHDSWNWKCLLRLRVV 829


>gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana]
            gi|20197043|gb|AAM14892.1| putative reverse transcriptase
            [Arabidopsis thaliana]
          Length = 1412

 Score =  514 bits (1323), Expect = e-142
 Identities = 347/1114 (31%), Positives = 529/1114 (47%), Gaps = 45/1114 (4%)
 Frame = +3

Query: 573  QVMALIETKLDDDALDIIMQRKFSGMGMANNFNYSNRGRILLLWDMHRVSLDVLMATSQY 752
            Q   ++ET++ +  + +I  + F    M +N+ ++  GRI ++W    V L V+  +SQ 
Sbjct: 332  QFGCVLETRVIESKVPVIFAKVFKDWQMVSNYEFNRLGRIWVVWSSS-VQLQVIFKSSQM 390

Query: 753  MHCRVRCSITDRIFLATFVYGFHSIMARRPLWNSLTLLGENIA---EPWIIMGDYNSLLT 923
            + C VR    D  F+ +F+Y  + +  R+ LW  L  L  ++A   +PW++ GD+N  L 
Sbjct: 391  IVCLVRVEHYDVEFICSFIYASNFVEERKKLWQDLHNLQNSVAFRNKPWLLFGDFNETLK 450

Query: 924  VQDKEGG--QPVSNYELHDLERFVHTCGLVDLRSIGCRLTWTN----GSVSSKLDRAMVN 1085
            +++       P+    + D +  V  C L D+R+ G   TW N    G +  KLDR ++N
Sbjct: 451  MEEHSSYAVSPMVTPGMRDFQIVVRYCSLEDMRTHGPLFTWGNKRNEGLICKKLDRVLLN 510

Query: 1086 SQWLIADYESYVEFLPPGCLSDHSCGIVSTLAREKRCNRPFKFYNMWTLHEGFQDLVASS 1265
             ++  A   SY      GC SDH  G     +  ++   PFKF N+   H  F   V   
Sbjct: 511  PEYNSAYPHSYCIMDSGGC-SDHLRGRFHLRSAIQKPKGPFKFTNVIAAHPEFMPKVEDF 569

Query: 1266 WDEPI----SGNAQFALKAKLTRLKGKLRELDKLHFQHISEQALRAKSVLEDAQRSLLSG 1433
            W        S +  F    KL  LK  L++L +    ++S+   RA    E+  R     
Sbjct: 570  WKNTTELFPSTSTLFRFSKKLKELKPILKDLSR---NNLSDLTRRATYAYEELCRCQTKS 626

Query: 1434 AGPPNDYEVIRKRATLLAEAEKLFYQQRAKSVYLKNSDRCTKFFHDLVKRNNKRNAIIKL 1613
                N ++++          E L +++  K  +L N                   AI ++
Sbjct: 627  LTTLNPHDIVD---------ESLAFERWEKERHLLN-------------------AIHEV 658

Query: 1614 TKQSGDETTSLDEVAEEFVGYFQDLLGKTGQCTPLDSNTISGNHL--------SQAQREE 1769
                G    + D++  E V +F DLL       P D   IS + L        S  ++  
Sbjct: 659  MDPQGTRPPNQDDIKIEAVRFFSDLLSSQ----PSDFTGISVDELKGILQYRYSLHEQNL 714

Query: 1770 LVVPITNEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKIGDDFIAAVKEFFHHGKLLKQ 1949
            LV  IT  E+    F I  +K+PGPDG+  +F+   W  IG +   A+K FF +G L K 
Sbjct: 715  LVAEITEAEVMKVFFSIPLNKSPGPDGYTVEFFRETWSVIGQEVTMAIKSFFTYGFLPKG 774

Query: 1950 WNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVISKVLSARLANVMSSLLDPAQVAFVKD 2129
             N T +ALIPK  ++  + DYRPISCC V YK ISK+L+ RL  ++   + P Q AF+ D
Sbjct: 775  LNSTILALIPKRTYAKEMKDYRPISCCNVLYKAISKLLANRLKCLLPEFIAPNQSAFISD 834

Query: 2130 RSIGENIHLAQELLRKYARKRISPRCMIKVDLKKAFDTVDWGFLMSALVGYGFPQQFCGW 2309
            R + EN+ LA EL++ Y +  +SPRC +K+DL KAFD+V W FL++ L     P++F  W
Sbjct: 835  RLLMENLLLASELVKDYHKDGLSPRCAMKIDLSKAFDSVQWPFLLNTLAALDIPEKFIHW 894

Query: 2310 IHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFGLCLEVFSRSMKEASRLDGFH 2489
            I+ CIST S+SV +N           GLRQG  LSP LF +C+ V S  + + +    F 
Sbjct: 895  INLCISTASFSVQVN-----------GLRQGCSLSPYLFVICMNVLSAMLDKGAVEKRFG 943

Query: 2490 YHPRCAEVELTHLAYADDLLLFARADVSSLNLIMGCLTDFGDKAGLRANPLKSNLYVAGM 2669
            YHPRC  + LTHL +ADD+++F+     SL  ++    DF   +GL  +  KS L++A +
Sbjct: 944  YHPRCRNMGLTHLCFADDIMVFSAGSAHSLEGVLAIFKDFAAFSGLNISLEKSTLFMASI 1003

Query: 2670 EEQLRGELLHITGFQQGVFPFRYLGIPLATERLRIANYGSLLEAITKKVISWPKHTLSHA 2849
              +    +L    F  G  P RYLG+PL T+R+ +A+   LLE I  ++ SW    LS+A
Sbjct: 1004 SSETCASILARFPFDSGSLPVRYLGLPLMTKRMTLADCLPLLEKIRSRISSWKNRFLSYA 1063

Query: 2850 GRIELIRSVLQGVECFWLSILPIPSGVIDKIYGISRSFVWS-----TKHPPISWSTICLP 3014
            GR++L+ SV+  +  FW+S   +P   I +I  IS +F+WS          ++W  +C P
Sbjct: 1064 GRLQLLNSVISSLTKFWISAFRLPRACIREIEQISAAFLWSGTDLNPHKAKVAWHDVCKP 1123

Query: 3015 KEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWVKWIHQHYLR--SEDFWHWRGKPSDS 3188
            K EGG GLR L   N     + +W + S + SLWV WI  + +R  +E     R +    
Sbjct: 1124 KSEGGLGLRSLVDANKICCFKLIWRLVSAKHSLWVNWIQNNLIRTVAEALSSHRRRSHRD 1183

Query: 3189 PLIKCLLQIRDVLLQR----------TTSIGA---AKMKLTE-WFAEGENGTALAYDFFR 3326
             ++  + +  + LL R            SIG    AK    E W    E G         
Sbjct: 1184 DILNDIEEELEKLLCRGICTEQDRSLCRSIGGQFKAKFFSPEIWHQIREQGLV------- 1236

Query: 3327 PKKPKVPWFSVIWKSILMPKHRFTVWLLAHGKLRTAD---SLSYEPSKMCALCQQHEESN 3497
                   W   IW S   PK  F  WL AH +L T D   S +   S +C LC    ES 
Sbjct: 1237 -----KQWHKAIWFSGATPKFTFISWLAAHDRLTTGDKMASWNRGISSVCVLCNISAESR 1291

Query: 3498 GHIFFECAIAKSLLDRVVRWMDIKFDITSMAKLLDVFQRHYKGKGRQVKARYLAVSSMIY 3677
             H+FF C  +  + DR+ R + +    T+   LL +         ++   RY+   + I+
Sbjct: 1292 DHLFFSCNFSSHIWDRLTRRLLLCRYTTNFPALLLLLSGQDFSGTKRFLLRYV-FQATIH 1350

Query: 3678 LIWEARNRSRFEGIVPCVERMFHKVQIHVWRFVD 3779
             +W  RN+ R  G +P           H+ +F+D
Sbjct: 1351 TLWRERNKRR-HGDLPIPSD-------HIIKFID 1376


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  503 bits (1295), Expect = e-139
 Identities = 294/797 (36%), Positives = 436/797 (54%), Gaps = 22/797 (2%)
 Frame = +3

Query: 894  IMGDYNSLLTVQDKEGGQPVS-NYELHDLERFVHTCGLVDLRSIGCRLTWTNGS----VS 1058
            ++GD+N +L  Q+      ++ +  + D    +    L DL   G   TW N S    ++
Sbjct: 1    MLGDFNQVLLPQEHSNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIA 60

Query: 1059 SKLDRAMVNSQWLIADYESYVEFLPPGCL--SDH-SCGIVSTLAREKRCNRPFKFYNMWT 1229
             KLDR + N  W      S+  F   G L  SDH SCG+V   A      RPFKF+N   
Sbjct: 61   KKLDRILANDSWCNLYPSSHGLF---GNLDFSDHVSCGVVLE-ANGISAKRPFKFFNFLL 116

Query: 1230 LHEGFQDLVASSW-DEPISGNAQFALKAKLTRLKGKLRELDKLHFQHISEQALRAKSVLE 1406
             +E F ++V  +W    + G++ + +  KL  +K  +++  +L++  I  +   A  +L 
Sbjct: 117  KNEDFLNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPIKDFSRLNYSGIELRTKEAHELLI 176

Query: 1407 DAQRSLLSGAGPPN---DYEVIRKRATLLAEAEKLFYQQRAKSVYLKNSDRCTKFFHDLV 1577
              Q   L+     N   + E  RK   LL+ AE+ F+ QR++  +    D  T +FH +V
Sbjct: 177  TCQNLTLANPSVSNAALELEAQRKWV-LLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMV 235

Query: 1578 KRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKTGQCTPL---DSNTISGNHL 1748
                  N I  L   +G    S   + +  V Y++ LLG       +   D N +     
Sbjct: 236  DSRKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTYRC 295

Query: 1749 SQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKIGDDFIAAVKEFFH 1928
            SQ Q  EL    T++EIK A   +  +K  GPDG++ +F+   W  IG + +AA+ EFF 
Sbjct: 296  SQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFD 355

Query: 1929 HGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVISKVLSARLANVMSSLLDPA 2108
             G+LLKQWN T++ LIPK+ ++  ++++RPISC    YKVISK+L++RL  ++S+++  +
Sbjct: 356  SGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIGHS 415

Query: 2109 QVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKKAFDTVDWGFLMSALVGYGF 2288
            Q AF+  RS+ EN+ LA E++  Y R  ISPR M+KVDLKKAFD+V W F+ +AL     
Sbjct: 416  QSAFLPGRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAI 475

Query: 2289 PQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFGLCLEVFSRSMKEA 2468
            P+++  WIH+CI+T S+++S+NG   GFF   +GLRQGDPLSP LF L +EVFS+ +   
Sbjct: 476  PERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSR 535

Query: 2469 SRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIMGCLTDFGDKAGLRANPLKS 2648
                  HYHP+  ++ ++HL +ADD+++F     SS++ I   L DF D +GL+ N  KS
Sbjct: 536  YDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKS 595

Query: 2649 NLYVAGME--EQLRGELLHITGFQQGVFPFRYLGIPLATERLRIANYGSLLEAITKKVIS 2822
             L+ AG++  E++        GF  G FP RYLG+PL   +LRIA+YG LLE ++ ++ S
Sbjct: 596  QLFQAGLDLSERITSAAY---GFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRS 652

Query: 2823 WPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYGISRSFVWS-----TKHPP 2987
            W    LS AGR +LI SV+ G+  FW+S   +P G I KI  +   F+W+      K   
Sbjct: 653  WVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSK 712

Query: 2988 ISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWVKWIHQHYLRSEDFWHW 3167
            +SW   CLPK EGG G R    WN TLL R +W +  R  SLW +W   H L    FW  
Sbjct: 713  VSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQV 772

Query: 3168 RGKPSDSPLIKCLLQIR 3218
                +D    K LL +R
Sbjct: 773  NALQTDPWTWKMLLNLR 789


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  503 bits (1295), Expect = e-139
 Identities = 294/797 (36%), Positives = 436/797 (54%), Gaps = 22/797 (2%)
 Frame = +3

Query: 894  IMGDYNSLLTVQDKEGGQPVS-NYELHDLERFVHTCGLVDLRSIGCRLTWTNGS----VS 1058
            ++GD+N +L  Q+      ++ +  + D    +    L DL   G   TW N S    ++
Sbjct: 1    MLGDFNQVLLPQEHSNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIA 60

Query: 1059 SKLDRAMVNSQWLIADYESYVEFLPPGCL--SDH-SCGIVSTLAREKRCNRPFKFYNMWT 1229
             KLDR + N  W      S+  F   G L  SDH SCG+V   A      RPFKF+N   
Sbjct: 61   KKLDRILANDSWCNLYPSSHGLF---GNLDFSDHVSCGVVLE-ANGISAKRPFKFFNFLL 116

Query: 1230 LHEGFQDLVASSW-DEPISGNAQFALKAKLTRLKGKLRELDKLHFQHISEQALRAKSVLE 1406
             +E F ++V  +W    + G++ + +  KL  +K  +++  +L++  I  +   A  +L 
Sbjct: 117  KNEDFLNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPIKDFSRLNYSGIELRTKEAHELLI 176

Query: 1407 DAQRSLLSGAGPPN---DYEVIRKRATLLAEAEKLFYQQRAKSVYLKNSDRCTKFFHDLV 1577
              Q   L+     N   + E  RK   LL+ AE+ F+ QR++  +    D  T +FH +V
Sbjct: 177  TCQNLTLANPSVSNAALELEAQRKWV-LLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMV 235

Query: 1578 KRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKTGQCTPL---DSNTISGNHL 1748
                  N I  L   +G    S   + +  V Y++ LLG       +   D N +     
Sbjct: 236  DSRKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTYRC 295

Query: 1749 SQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKIGDDFIAAVKEFFH 1928
            SQ Q  EL    T++EIK A   +  +K  GPDG++ +F+   W  IG + +AA+ EFF 
Sbjct: 296  SQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFD 355

Query: 1929 HGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVISKVLSARLANVMSSLLDPA 2108
             G+LLKQWN T++ LIPK+ ++  ++++RPISC    YKVISK+L++RL  ++S+++  +
Sbjct: 356  SGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIGHS 415

Query: 2109 QVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKKAFDTVDWGFLMSALVGYGF 2288
            Q AF+  RS+ EN+ LA E++  Y R  ISPR M+KVDLKKAFD+V W F+ +AL     
Sbjct: 416  QSAFLPGRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAI 475

Query: 2289 PQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFGLCLEVFSRSMKEA 2468
            P+++  WIH+CI+T S+++S+NG   GFF   +GLRQGDPLSP LF L +EVFS+ +   
Sbjct: 476  PERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSR 535

Query: 2469 SRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIMGCLTDFGDKAGLRANPLKS 2648
                  HYHP+  ++ ++HL +ADD+++F     SS++ I   L DF D +GL+ N  KS
Sbjct: 536  YDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKS 595

Query: 2649 NLYVAGME--EQLRGELLHITGFQQGVFPFRYLGIPLATERLRIANYGSLLEAITKKVIS 2822
             L+ AG++  E++        GF  G FP RYLG+PL   +LRIA+YG LLE ++ ++ S
Sbjct: 596  QLFQAGLDLSERITSAAY---GFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRS 652

Query: 2823 WPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYGISRSFVWS-----TKHPP 2987
            W    LS AGR +LI SV+ G+  FW+S   +P G I KI  +   F+W+      K   
Sbjct: 653  WVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSK 712

Query: 2988 ISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWVKWIHQHYLRSEDFWHW 3167
            +SW   CLPK EGG G R    WN TLL R +W +  R  SLW +W   H L    FW  
Sbjct: 713  VSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQV 772

Query: 3168 RGKPSDSPLIKCLLQIR 3218
                +D    K LL +R
Sbjct: 773  NALQTDPWTWKMLLNLR 789


>dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thaliana]
          Length = 910

 Score =  498 bits (1281), Expect = e-137
 Identities = 293/915 (32%), Positives = 472/915 (51%), Gaps = 24/915 (2%)
 Frame = +3

Query: 486  MKIASWNIRGFHKALKHGGVQHLLKQKEIQVMALIETKLDDDALDIIMQRKFSGMGMANN 665
            MK+  WNIRG +   +   V+  +    + V   +ET +  +  + ++     G  M +N
Sbjct: 1    MKVFCWNIRGLNSRNRQRVVRSWIASNNLLVGCFLETHVAQENANSVLASTLPGWRMDSN 60

Query: 666  FNYSNRGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPL 845
            +  S  GRI ++WD   +S+ V   T Q M C ++     + F   FVYG +S + RR L
Sbjct: 61   YCCSELGRIWIVWDPS-ISVLVFKRTDQIMFCSIKIPSLLQSFAVAFVYGRNSELDRRSL 119

Query: 846  WNSLTLLGENI---AEPWIIMGDYNSLLTVQDKEG-GQPVSNYE-LHDLERFVHTCGLVD 1010
            W  + +L         PW+++GD+N +    +     Q + N   + DL+  +    L D
Sbjct: 120  WEDILVLSRTSPLSVTPWLLLGDFNQIAAASEHYSINQSLLNLRGMEDLQCCLRDSQLSD 179

Query: 1011 LRSIGCRLTWTN----GSVSSKLDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGIVSTL 1178
            L S G   TW+N      +  KLDRA+ N +W      +   F PPG  SDH+  I+   
Sbjct: 180  LPSRGVFFTWSNHQQDNPILRKLDRALANGEWFAVFPSALAVFDPPGD-SDHAPCIILID 238

Query: 1179 AREKRCNRPFKFYNMWTLHEGFQDLVASSWD-EPISGNAQFALKAKLTRLKGKLRELDKL 1355
             +     + FK+++  + H  +   ++++W+   + G+  F+L+  L   K   R L++L
Sbjct: 239  NQPPPSKKSFKYFSFLSSHPSYLAALSTAWEANTLVGSHMFSLRQHLKVAKLCCRTLNRL 298

Query: 1356 HFQHISEQALRAKSVLEDAQRSLLSGAGPPNDY-----EVIRKRATLLAEAEKLFYQQRA 1520
             F +I ++  ++ + LED Q  LL+    P+D       V RK+    A A + F++Q++
Sbjct: 299  RFSNIQQRTAQSLTRLEDIQVELLTS---PSDTLFRREHVARKQWIFFAAALESFFRQKS 355

Query: 1521 KSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKT 1700
            +  +L   D  T+FFH  V  +   N I  L    G    ++D++    + Y+  LLG  
Sbjct: 356  RIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLIAYYSHLLGIP 415

Query: 1701 GQ-CTPLDSNTISGN---HLSQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFY 1868
             +  TP     I G            +L    + EEI   LF +  +KAPGPDGF  +F+
Sbjct: 416  SENVTPFSVEKIKGLLPFRCDSFLASQLTTIPSEEEITQVLFSMPRNKAPGPDGFPVEFF 475

Query: 1869 TAAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKV 2048
              AW  +    +AA++EFF  G L + +N T+I LIPK   +  +T +RP++CCT  YKV
Sbjct: 476  IEAWAIVKSSVVAAIREFFISGNLPRGFNATAITLIPKVTGADRLTQFRPVACCTTIYKV 535

Query: 2049 ISKVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLK 2228
            I++++S RL   +   +   QV F+K R + EN+ LA EL+  +     + R  ++VD+ 
Sbjct: 536  ITRIISRRLKLFIDQAVQANQVGFIKGRLLCENVLLASELVDNFEADGETTRGCLQVDIS 595

Query: 2229 KAFDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDP 2408
            KA+D V+W FL++ L     P  F  WI  CIS+ SYS++ NG L GFF G++G+RQGDP
Sbjct: 596  KAYDNVNWEFLINILKALDLPLVFIHWIWVCISSASYSIAFNGELIGFFQGKKGIRQGDP 655

Query: 2409 LSPLLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLI 2588
            +S  LF L ++V S+S+   +    F+ HP C    +THL++ADD+L+F+    SS+  I
Sbjct: 656  MSSHLFVLVMDVLSKSLDLGALNGLFNLHPNCLAPIITHLSFADDVLVFSDGAASSIAGI 715

Query: 2589 MGCLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLATERL 2768
            +  L DF   +GL  N  K+ L + G        L    G   G  P RYLG+PL ++++
Sbjct: 716  LTILDDFRQGSGLGINREKTELLLDGGNFARNRSLADNLGITHGSLPVRYLGVPLMSQKM 775

Query: 2769 RIANYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYG 2948
            R  +Y  L++ I  +  SW    LS AGR++L++SV+     FW S+   P+  + K+  
Sbjct: 776  RRQDYQPLVDRINSRFTSWTARHLSFAGRLQLLKSVIYSTINFWASVFIFPNQCLQKLEQ 835

Query: 2949 ISRSFVW-----STKHPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSL 3113
            +  +F+W     S +   ISW+ +C PKE GG GL+ L +WN  L  + +W + +   SL
Sbjct: 836  MCNAFLWSGAPNSARGAKISWNIVCSPKEAGGLGLKRLSSWNRILALKLIWLLFTSAGSL 895

Query: 3114 WVKWIHQHYLRSEDF 3158
            WV W+  H +   +F
Sbjct: 896  WVSWVRLHLIGRRNF 910


>dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 893

 Score =  494 bits (1272), Expect = e-136
 Identities = 304/904 (33%), Positives = 466/904 (51%), Gaps = 28/904 (3%)
 Frame = +3

Query: 489  KIASWNIRGFHKALKHGGVQHLLKQKEIQVMALIETKLDDDALDIIMQRKFSGMGMANNF 668
            K+  WN+RGF+ +    G +      +     LIET +        +     G     N+
Sbjct: 4    KLFCWNVRGFNISSHRRGFKKWFLLNKPLFGGLIETHVKQPKEKKFISNLLPGWSFVENY 63

Query: 669  NYSNRGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPLW 848
             +S  G+I +LWD   V + V+  + Q + C +    +   F+ + VY  +    R+ LW
Sbjct: 64   EFSVLGKIWVLWDPS-VKVVVIGRSLQMITCELLLPDSPSWFVVSIVYASNEEGTRKELW 122

Query: 849  NSLTLLGEN---IAEPWIIMGDYNSLLTVQDKEGGQPVSNYELHDLERFVHTC----GLV 1007
            N L  L  +   +   WI++GD+N +L  +         N  +    R   +C     L 
Sbjct: 123  NELVQLALSPVVVGRSWIVLGDFNQILNPES------AINANIGRKIRAFRSCLLDSDLY 176

Query: 1008 DLRSIGCRLTWTNGSVS----SKLDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGIVST 1175
            DL   G   TW N   S     K+DR +VN  W      +Y  F  P      SC +V  
Sbjct: 177  DLVYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPDFSDHSSCEVVLD 236

Query: 1176 LAREKRCNRPFKFYNMWTLHEGFQDLVASSWDE-PISGNAQFALKAKLTRLKGKLRELDK 1352
             A  K   RPF+F+N +  +  F  L+  +W    +SG+A + +  KL  LK  +    +
Sbjct: 237  PAVLK-AKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPICCFSR 295

Query: 1353 LHFQHISEQALRAKSVLEDAQRSLLSGAGPPNDYEVIRKRAT----LLAEAEKLFYQQRA 1520
             ++  I ++   A +++   QR  L+   P   +  +   AT    +LA+AE+ F+ Q++
Sbjct: 296  ENYSDIEKRVSEAHAIVLHRQRITLTN--PSVVHATLELEATRKWQILAKAEESFFCQKS 353

Query: 1521 KSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSL----DEVAEEFVGYFQDL 1688
               +L   D  T +FH +       N I  L    G+   +     + + E    +F+ L
Sbjct: 354  SISWLYEGDNNTAYFHKMADMRKSINTINFLIDDFGERIETQQGIKEGIKEHSCNFFESL 413

Query: 1689 L-GKTGQ--CTPLDSNTISGNHLSQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNA 1859
            L G  G+      D N +     S  Q  +L    ++ +I+ A F +  +KA GPDG+++
Sbjct: 414  LCGVEGENSLAQSDMNLLLSFRCSVDQINDLERSFSDLDIQEAFFSLPRNKASGPDGYSS 473

Query: 1860 KFYTAAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVF 2039
            +F+   W  +G +   AV+EFF  G+LLKQWN T++ LIPK  +S  +TD+RPISC    
Sbjct: 474  EFFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDFRPISCLNTL 533

Query: 2040 YKVISKVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKV 2219
            YKVI+K+L++RL  +++ ++ P+Q AF+  R + EN+ LA E++  Y  K IS R M+KV
Sbjct: 534  YKVIAKLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVHGYNTKNISSRGMLKV 593

Query: 2220 DLKKAFDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQ 2399
            DL+KAFD+V W F++SA      P++F  WI++CIST  +SV +NG   GFF   +GLRQ
Sbjct: 594  DLRKAFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVNGSSSGFFKSNKGLRQ 653

Query: 2400 GDPLSPLLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSL 2579
            GDPLSP LF L +EVFS  +K        HYHP+ A++ ++HL +ADD+++F     SSL
Sbjct: 654  GDPLSPYLFVLAMEVFSSLLKARFDAGYIHYHPKTADLSISHLMFADDVMVFFDGGSSSL 713

Query: 2580 NLIMGCLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLAT 2759
            + I   L DF   +GL  N  K+NLY+AG +E     + H  GF     P RYLG+PL +
Sbjct: 714  HGISEALDDFASWSGLHVNKDKTNLYLAGTDEVEALAISHY-GFPISTLPIRYLGLPLMS 772

Query: 2760 ERLRIANYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDK 2939
             +L+I+ Y      + K+  SW   +LS AGR++LI SV+ G+  FW+S   +  G + K
Sbjct: 773  RKLKISEY-----ELVKRFRSWAVKSLSFAGRVQLITSVITGLVNFWMSTFVLLLGCVKK 827

Query: 2940 IYGISRSFVWS-----TKHPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRR 3104
            I  +   F+WS     +K   I+WS +CLPK EGG GLR    WN T   R +W + +  
Sbjct: 828  IESLCSRFLWSGSIDASKGAKIAWSGVCLPKNEGGVGLRRFTPWNKTFYLRFIWPLFADN 887

Query: 3105 DSLW 3116
            D LW
Sbjct: 888  DVLW 891


>emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-like protein
            [Arabidopsis thaliana]
          Length = 893

 Score =  489 bits (1258), Expect = e-135
 Identities = 302/904 (33%), Positives = 464/904 (51%), Gaps = 28/904 (3%)
 Frame = +3

Query: 489  KIASWNIRGFHKALKHGGVQHLLKQKEIQVMALIETKLDDDALDIIMQRKFSGMGMANNF 668
            K+  WN+RGF+ +    G +      +     LIET +        +     G     N+
Sbjct: 4    KLFCWNVRGFNISSHRRGFKKWFLLNKPLFGGLIETHVKQPKEKKFISNLLPGWSFVENY 63

Query: 669  NYSNRGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPLW 848
             +S  G+I +LWD   V + V+  + Q + C +    +   F+ + VY  +    R+ LW
Sbjct: 64   EFSVLGKIWVLWDPS-VKVVVIGRSLQMITCELLLPDSPSWFVVSIVYASNEEGTRKELW 122

Query: 849  NSLTLLGEN---IAEPWIIMGDYNSLLTVQDKEGGQPVSNYELHDLERFVHTC----GLV 1007
            N L  L  +   +   WI++GD+N +L  +         N  +    R   +C     L 
Sbjct: 123  NELVQLALSPVVVGRSWIVLGDFNQILNPES------AINANIGRKIRAFRSCLLDSDLY 176

Query: 1008 DLRSIGCRLTWTNGSVS----SKLDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGIVST 1175
            DL   G   TW N   S     K+DR +VN  W      +Y  F  P      SC +V  
Sbjct: 177  DLVYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPDFSDHSSCEVVLD 236

Query: 1176 LAREKRCNRPFKFYNMWTLHEGFQDLVASSWDE-PISGNAQFALKAKLTRLKGKLRELDK 1352
             A  K   RPF+F+N +  +  F  L+  +W    +SG+A + +  KL  LK  +    +
Sbjct: 237  PAVLK-AKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPICCFSR 295

Query: 1353 LHFQHISEQALRAKSVLEDAQRSLLSGAGPPNDYEVIRKRAT----LLAEAEKLFYQQRA 1520
             ++  I ++   A +++   QR  L+   P   +  +   AT    +LA+AE+ F+ Q++
Sbjct: 296  ENYSDIEKRVSEAHAIVLHRQRITLTN--PSVVHATLELEATRKWQILAKAEESFFCQKS 353

Query: 1521 KSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSL----DEVAEEFVGYFQDL 1688
               +L   D  T +FH +       N I  L    G+   +     + + E    +F+ L
Sbjct: 354  SISWLYEGDNNTAYFHKMADMRKSINTINFLIDDFGERIETQQGIKEGIKEHSCNFFESL 413

Query: 1689 L-GKTGQ--CTPLDSNTISGNHLSQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNA 1859
            L G  G+      D N +     S  Q  +L    ++ +I+ A F +  +KA GPDG+++
Sbjct: 414  LCGVEGENSLAQSDMNLLLSFRCSVDQINDLERSFSDLDIQEAFFSLPRNKASGPDGYSS 473

Query: 1860 KFYTAAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVF 2039
            +F+   W  +G +   AV+EFF  G+LLKQWN T++ LIPK  +S  +TD+RPISC    
Sbjct: 474  EFFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDFRPISCLNTL 533

Query: 2040 YKVISKVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKV 2219
            YKVI+K+L++RL  +++ ++ P+Q AF+  R + EN+ LA E++  Y  K IS R M+KV
Sbjct: 534  YKVIAKLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVHGYNTKNISSRGMLKV 593

Query: 2220 DLKKAFDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQ 2399
            DL+KAFD+V W F++SA      P++F  WI++CIST  +SV +NG   GFF   +GLRQ
Sbjct: 594  DLRKAFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVNGSSSGFFKSNKGLRQ 653

Query: 2400 GDPLSPLLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSL 2579
            GDPLSP LF L +EVFS  +K         YHP+ A++ ++HL +ADD+++F     SSL
Sbjct: 654  GDPLSPYLFVLAMEVFSSLLKARFDAGYIQYHPKTADLSISHLMFADDVMVFFDGGSSSL 713

Query: 2580 NLIMGCLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLAT 2759
            + I   L DF   +GL  N  K+NLY+AG +E     + H  GF     P RYLG+PL +
Sbjct: 714  HGISEALDDFASWSGLHVNKDKTNLYLAGTDEVEALAISHY-GFPISTLPIRYLGLPLMS 772

Query: 2760 ERLRIANYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDK 2939
             +L+I+ Y      + K+  SW   +LS AGR++LI SV+ G+  FW+S   +  G + K
Sbjct: 773  RKLKISEY-----ELVKRFRSWAVKSLSFAGRVQLITSVITGLVNFWMSTFVLLLGCVKK 827

Query: 2940 IYGISRSFVWS-----TKHPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRR 3104
            I  +   F+WS     +K   I+WS +CLPK EGG  LR    WN T   R +W + +  
Sbjct: 828  IESLCSRFLWSGSIDASKGAKIAWSGVCLPKNEGGVALRRFTPWNKTFYLRFIWPLFADN 887

Query: 3105 DSLW 3116
            D LW
Sbjct: 888  DVLW 891


>gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis
            thaliana]
          Length = 1253

 Score =  486 bits (1251), Expect = e-134
 Identities = 334/1070 (31%), Positives = 514/1070 (48%), Gaps = 65/1070 (6%)
 Frame = +3

Query: 768  RCSITDRIFLATFVYGFHSIMARRPLWNSLTLLGENIA---EPWIIMGDYNSLLTVQDKE 938
            R    D   + + VY  +  + R+ LW  L LL  +++   +PWI++GD+N +L   +  
Sbjct: 45   RTEFLDDSVVVSIVYAANEAITRKELWEELLLLSVSLSGNGKPWIMLGDFNQVLCPAEHS 104

Query: 939  GGQPVS-NYELHDLERFVHTCGLVDLRSIGCRLTWTNGS----VSSKLDRAMVNSQWLIA 1103
                ++ N  +      +    L DL   G   TW N S    V+ KLDR +VN  W   
Sbjct: 105  QATSLNVNRRMKVFRDCLFEAELCDLVFKGNTFTWWNKSATRPVAKKLDRILVNESWCSR 164

Query: 1104 DYESYVEFLPPGCLSDH-SCGIVST--LAREKRCNRPFKFYNMWTLHEGFQDLVASSWDE 1274
               +Y  F  P   SDH SCG++    + REKR   PF+FYN    +  F  LV   W  
Sbjct: 165  FPSAYAVFGEPD-FSDHASCGVIINPLMHREKR---PFRFYNFLLQNPDFISLVGELWYS 220

Query: 1275 -PISGNAQFALKAKLTRLKGKLRELDKLHFQHISEQALRAKSVLEDAQRSLLSGAGPPN- 1448
              + G++ F +  KL  LK  +R     +F ++ ++   A +++   Q   LS    PN 
Sbjct: 221  INVVGSSMFKMSKKLKALKNPIRTFSMENFSNLEKRVKEAHNLVLYRQNKTLSDPTIPNA 280

Query: 1449 --DYEVIRKRATLLAEAEKLFYQQRAKSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQ 1622
              + E  RK   +L +AE+ F+ QR++  ++   D  T +FH +       N I  +   
Sbjct: 281  ALEMEAQRKWL-ILVKAEESFFCQRSRVTWMGEGDSNTSYFHRMADSRKAVNTIHIIIDD 339

Query: 1623 SGDETTSLDEVAEEFVGYFQDLLGKTGQCTPL-----DSNTISGNHLSQAQREELVVPIT 1787
            +G +  +   + E  + YF +LLG  G+  P      D + +     S  Q++EL +  +
Sbjct: 340  NGVKIDTQLGIKEHCIEYFSNLLG--GEVGPPMLIQEDFDLLLPFRCSHDQKKELAMSFS 397

Query: 1788 NEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSI 1967
             ++IK+A F   ++K  GPDGF  +F+   W  IG +   AV EFF    LLKQWN T++
Sbjct: 398  RQDIKSAFFSFPSNKTSGPDGFPVEFFKETWSVIGTEVTDAVSEFFTSSVLLKQWNATTL 457

Query: 1968 ALIPKSDHSPCVTDYRPISCCT----VFYKVISKVLSARLANVMSSLLDPAQVAFVKDRS 2135
             LIPK  ++  + D+RPISC        YKVI+++L+ RL  ++S ++ P Q AF+  R 
Sbjct: 458  VLIPKITNASKMNDFRPISCNDFGPITLYKVIARLLTNRLQCLLSQVISPFQSAFLPGRF 517

Query: 2136 IGENIHLAQELLRKYARKRISPRCMIKVDLKKAFDTVDWGFLMSALVGYGFPQQFCGWIH 2315
            + EN+ LA EL++ Y R+ I PR M+KVDL+KAFD++ W F++SAL   G P +F  WI 
Sbjct: 518  LAENVLLATELVQGYNRQNIDPRGMLKVDLRKAFDSIRWDFIISALKAIGIPDRFVYWIT 577

Query: 2316 ECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFGLCLEVFSRSMKEASRLDGFHYH 2495
            +CIST ++SV +NG   GFF   RGLRQG+PLSP LF L +EVFS  +    +    HYH
Sbjct: 578  QCISTPTFSVCVNGNTGGFFKSTRGLRQGNPLSPFLFVLAMEVFSSLLNSRFQAGYIHYH 637

Query: 2496 PRCAEVELTHLAYADDLLLFARADVSSLNLIMGCLTDFGDKAGLRANPLKSNLYVAGMEE 2675
            P+ + + ++HL +ADD+++F     SSL+ I   L DF   +GL  N  K++LY+AG++ 
Sbjct: 638  PKTSPLSISHLMFADDIMVFFDGGSSSLHGISEALEDFAFWSGLVLNREKTHLYLAGLDR 697

Query: 2676 QLRGELLHITGFQQGVFPFRYLGIPLATERLRIANYGSLLEAITKKVISWPKHTLSHAGR 2855
                                 +       +LRIA YG LLE + K+  SW    LS AGR
Sbjct: 698  ---------------------IEASTIARKLRIAEYGPLLEKLAKRFRSWSVKCLSFAGR 736

Query: 2856 IELIRSVLQGVECFWLSILPIPSGVIDKIYGISRSFVWS-----TKHPPISWSTICLPKE 3020
            ++LI SV+ G+  FW+S   +P G + +I  +   F+WS      K   ++WS +CLPKE
Sbjct: 737  VQLIASVISGIINFWISTFILPKGCVKRIEALCARFLWSGNIDVKKGAKVAWSEVCLPKE 796

Query: 3021 EGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWV-KWIHQHYL------------------ 3143
            EGG GLR       T+L  TLW+   ++ S W   W     L                  
Sbjct: 797  EGGVGLR-----RFTVLNTTLWD--GKKISFWFDNWSPLGPLFKLFGSSGPRALCIPIQA 849

Query: 3144 ----RSEDFWHWRGKPSDSPLIKCLLQIRDVLLQRTTSIGAAKMKLTEWFAEGENGTALA 3311
                   D       P     +  L+ +  + L    S     + + + F       A  
Sbjct: 850  KVADACSDVGWLISPPRTDQALALLIHLTTIALPCFDSSPDTFVWIVDDFTCHGFSAART 909

Query: 3312 YDFFRPKKPKVPWFSVIWKSILMPKHRFTVWLLAHGKLRTADSLS---YEPSKMCALCQQ 3482
            ++  RPKKP   W   +W    +PKH F +W+    +L T   L+      +  C LC  
Sbjct: 910  WEAMRPKKPVKDWTKSVWFKGSVPKHAFNMWVSHLNRLPTRQRLAAWGVTTTTDCCLCSS 969

Query: 3483 HEESNGHIFFECAIAKSLLDRVVRWMDIKFDIT-------SMAKLLDVFQRHYKGKGRQV 3641
              ES  H+   C  +       V W  + F +T       S A+LL  + R    K   +
Sbjct: 970  RPESRDHLLLYCVFS------AVIWKLVFFRLTPSQAIFNSWAELLS-WTRINSSKAPSL 1022

Query: 3642 KARYLAVSSMIYLIWEARNRSRFEGIVPCVERMFHKVQ---IHVWRFVDI 3782
              R +A  + ++ +W+ RN      I      +FH +     +++R++ I
Sbjct: 1023 -LRKIAAQASVFHLWKQRNNVLHNSIFISPATVFHFIDRELENLYRYIQI 1071


>gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana]
          Length = 1161

 Score =  453 bits (1165), Expect = e-124
 Identities = 328/1124 (29%), Positives = 521/1124 (46%), Gaps = 27/1124 (2%)
 Frame = +3

Query: 408  STTSYANRQPKWAT-ITGPSSPLYTWIMKIASWNIRGFHKALKHGGVQHLLKQKEIQVMA 584
            ST +      KW   I  P + L  ++   A  N R   +      V+  +    + V  
Sbjct: 22   STKAKKKELKKWHNRIRPPVTELQGFVDGRAGLNSRNRQRV-----VRSWIASNNLLVGC 76

Query: 585  LIETKLDDDALDIIMQRKFSGMGMANNFNYSNRGRILLLWDMHRVSLDVLMATSQYMHCR 764
             +ET +  +  + ++     G  M +N+  S  GRI ++WD   +S+ V   T Q M C 
Sbjct: 77   FLETHVAQENANSVLASTLPGWRMDSNYCCSELGRIWIVWDPS-ISVLVFKRTDQIMFCS 135

Query: 765  VRCSITDRIFLATFVYGFHSIMARRPLWNSLTLLGENI---AEPWIIMGDYNSLLTVQDK 935
            ++     + F   FVYG +S + RR LW  + +L         PW+++GD+N +    + 
Sbjct: 136  IKIPSLLQSFAVAFVYGRNSELDRRSLWEDILVLSRTSPLSVTPWLLLGDFNQIAAASEH 195

Query: 936  EG-GQPVSNYE-LHDLERFVHTCGLVDLRSIGCRLTWTN----GSVSSKLDRAMVNSQWL 1097
                Q + N   + DL+  +    L DL S G   TW+N      +  KLDRA+ N +W 
Sbjct: 196  YSINQSLLNLRGMEDLQCCLRDSQLSDLPSRGVFFTWSNHQQDNPILRKLDRALANGEWF 255

Query: 1098 IADYESYVEFLPPGCLSDHSCGIVSTLAREKRCNRPFKFYNMWTLHEGFQDLVASSWDE- 1274
                 +   F PPG  SDH+  I+    +     + FK+++  + H  +   ++++W+E 
Sbjct: 256  AVFPSALAVFDPPGD-SDHAPCIILIDNQPPPSKKSFKYFSFLSSHPSYLAALSTAWEEN 314

Query: 1275 PISGNAQFALKAKLTRLKGKLRELDKLHFQHISEQALRAKSVLEDAQRSLLSGAGPPNDY 1454
             + G+  F+L+  L   K   R L++L F +I ++  ++ + LED Q  LL+    P+D 
Sbjct: 315  TLVGSHMFSLRQHLKVAKLCCRTLNRLRFSNIQQRTAQSLTRLEDIQVELLTS---PSDT 371

Query: 1455 -----EVIRKRATLLAEAEKLFYQQRAKSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTK 1619
                  V RK+    A A + F++Q+++  +L   D  T+FFH  V  +   N I  L  
Sbjct: 372  LFRREHVARKQWIFFAAALESFFRQKSRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRG 431

Query: 1620 QSGDETTSLDEVAEEFVGYFQDLLGKTGQ-CTPLDSNTISGN---HLSQAQREELVVPIT 1787
              G    ++D++    + Y+  LLG   +  TP     I G            +L    +
Sbjct: 432  DDGFRVENVDQIKGMLIAYYSHLLGIPSENVTPFSVEKIKGLLPFRCDSFLASQLTTIPS 491

Query: 1788 NEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSI 1967
             EEI   LF +  +KAPGPDGF  +F+  AW  +    +AA++EFF  G L + +N T+I
Sbjct: 492  EEEITQVLFSMPRNKAPGPDGFPVEFFIEAWAIVKSSVVAAIREFFISGNLPRGFNATAI 551

Query: 1968 ALIPKSDHSPCVTDYRPISCCTVFYKVISKVLSARLANVMSSLLDPAQVAFVKDRSIGEN 2147
             LIPK   +  +T +RP++CCT  YKVI++++S RL   +   +   QV F+K R + EN
Sbjct: 552  TLIPKVTGADRLTQFRPVACCTTIYKVITRIISRRLKLFIDQAVQANQVGFIKGRLLCEN 611

Query: 2148 IHLAQELLRKYARKRISPRCMIKVDLKKAFDTVDWGFLMSALVGYGFPQQFCGWIHECIS 2327
            + LA EL+  +     + R  ++VD+ KA+D V+W FL++ L     P  F  WI  CIS
Sbjct: 612  VLLASELVDNFEADGETTRGCLQVDISKAYDNVNWEFLINILKALDLPLVFIHWIWVCIS 671

Query: 2328 TVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFGLCLEVFSRSMKEASRLDGFHYHPRCA 2507
            + SYS++ NG L GFF G++G+RQGDP+S  LF L ++V S+S+   +    F+ HP C 
Sbjct: 672  SASYSIAFNGELIGFFQGKKGIRQGDPMSSHLFVLVMDVLSKSLDLGALNGLFNLHPNCL 731

Query: 2508 EVELTHLAYADDLLLFARADVSSLNLIMGCLTDFGDKAGLRANPLKSNLYVAGMEEQLRG 2687
               +THL++ADD+L+F+    SS+  I+  L DF   +GL  N  K+ L + G       
Sbjct: 732  APIITHLSFADDVLVFSDGAASSIAGILTILDDFRQGSGLGINREKTELLLDGGNFARNR 791

Query: 2688 ELLHITGFQQGVFPFRYLGIPLATERLRIANYGSLLEAITKKVISWPKHTLSHAGRIELI 2867
             L    G   G  P RYLG+PL ++++R  +Y  L++ I  +  SW    LS AGR++L+
Sbjct: 792  SLADNLGITHGSLPVRYLGVPLMSQKMRRQDYQPLVDRINSRFTSWTARHLSFAGRLQLL 851

Query: 2868 RSVLQGVECFWLSILPIPSGVIDKIYGISRSFVWS---TKHPPISWSTICLPKEEGGYGL 3038
              + +  +   L     P  + +   G++ SF W    T H P+   T        G  L
Sbjct: 852  NWIWR--KLCKLRPFARPFIICEVGSGVTASF-WHDNWTDHGPLLHLTGPAGPLLAGLPL 908

Query: 3039 RDLQAWNMTLLCRTLWNIQSRRDSLWVKWIHQHYLRSEDFWHWRGKPSDSPLIKCLLQIR 3218
              +       L    W I S R    V  + Q  L           PS + LI C     
Sbjct: 909  NSVV---RDALRDDTWRISSSRSRNPVITLLQRVL-----------PSAASLIDC--PHD 952

Query: 3219 DVLLQRTTSIGAAKMKLTEWFAEGENGTALAYDFFRPKKPKVPWFSVIWKSILMPKHRFT 3398
            D  L           K+          TA  + + +P    V W   +W    +PK  F 
Sbjct: 953  DTYL----------WKIGHHAPSNRFSTADTWSYLQPSSTSVLWHKAVWFKDHVPKQAFI 1002

Query: 3399 VWLLAHGKLRTADSL---SYEPSKMCALCQQHEESNGHIFFECAIAKSLLDRVVRWMDIK 3569
             W++AH +L T D L    +     C LC   +ES  H+FF C  +  +    +R +++ 
Sbjct: 1003 CWVVAHNRLHTRDRLRRWGFSIPPTCVLCNDLDESREHLFFRCQFSSEIWSFFMRALNLN 1062

Query: 3570 FDITSMAKLLDVFQRHYKGKGRQVK-ARYLAVSSMIYLIWEARN 3698
                 M  LL         + R +     L   + +Y IW  RN
Sbjct: 1063 PPPQFMHCLLWTLT---ASRDRNITLITKLLFHASVYFIWRERN 1103


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  453 bits (1165), Expect = e-124
 Identities = 274/813 (33%), Positives = 426/813 (52%), Gaps = 34/813 (4%)
 Frame = +3

Query: 888  WIIMGDYNSLLTVQDKEGGQPVSNYELHDLER-----FVHTCGLVDLRSIGCRLTWTNGS 1052
            W++  D N+ LT   K       +  L   E      F++    V+ R    R+ W +  
Sbjct: 431  WVVWRD-NARLTPVFKSSQMITCSILLEGKEEEFFCSFIYASNFVEER----RILWED-- 483

Query: 1053 VSSKLDRAMV-NSQWLIA-DYESYVE-----------FLPPGCLSDHSCGIVSTLAREKR 1193
            + S  D  ++ N  W++  D+   +E           + PPG       G +   A    
Sbjct: 484  IRSHHDSPLIRNKPWILCGDFNEILEGGEHSNYDNSPYTPPGMRDFQEIGRLMLEAAATG 543

Query: 1194 CNRPFKFYNMWTLHEGFQDLVASSWDEP----ISGNAQFALKAKLTRLKGKLRELDKLHF 1361
              +PFKF N+ T    F  +V S W       +S +A +    KL  LK  LREL K   
Sbjct: 544  GRKPFKFVNVLTKLPQFLPVVESHWASSAPLYVSTSALYRFSKKLKTLKPHLRELGKEKL 603

Query: 1362 QHISEQALRAKSVLEDAQRSLLSGAGPPNDYEVIRKRA--TLLAEAEKLFYQQRAKSVYL 1535
              + ++   A  +L + Q + L+        E ++     T L+E E+ F +Q++K  ++
Sbjct: 604  GDLPKRTREAHILLCEKQATTLANPSQETIAEELKAYTDWTHLSELEEGFLKQKSKLHWM 663

Query: 1536 KNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKTGQ--- 1706
               D    +FH   +    RN+I ++   + +   + +E+  E   +F + L +      
Sbjct: 664  NVGDGNNSYFHKAAQVRKMRNSIREIRGPNAETLQTSEEIKGEAERFFNEFLNRQSGDFH 723

Query: 1707 -CTPLDSNTISGNHLSQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYTAAWD 1883
              +  D   +     S   +  L   +T EEI+  LF +  +K+PGPDG+ ++F+ A W 
Sbjct: 724  GISVEDLRNLMSYRCSVTDQNILTREVTGEEIQKVLFAMPNNKSPGPDGYTSEFFKATWS 783

Query: 1884 KIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVISKVL 2063
              G DFIAA++ FF  G L K  N T +ALIPK D +  + DYRPISCC V YKVISK+L
Sbjct: 784  LTGPDFIAAIQSFFVKGFLPKGLNATILALIPKKDEAIEMKDYRPISCCNVLYKVISKIL 843

Query: 2064 SARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKKAFDT 2243
            + RL  ++ S +   Q AFVK+R + EN+ LA EL++ Y ++ ++PRC +K+D+ KAFD+
Sbjct: 844  ANRLKLLLPSFILQNQSAFVKERLLMENVLLATELVKDYHKESVTPRCAMKIDISKAFDS 903

Query: 2244 VDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLL 2423
            V W FL++ L    FP+ F  WI  CIST ++SV +NG L GFF   RGLRQG  LSP L
Sbjct: 904  VQWQFLLNTLEALNFPETFRHWIKLCISTATFSVQVNGELAGFFGSSRGLRQGCALSPYL 963

Query: 2424 FGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIMGCLT 2603
            F +C+ V S  + EA+      YHP+C ++ LTHL +ADDL++F      S+  ++    
Sbjct: 964  FVICMNVLSHMIDEAAVHRNIGYHPKCEKIGLTHLCFADDLMVFVDGHQWSIEGVINVFK 1023

Query: 2604 DFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLATERLRIANY 2783
            +F  ++GL+ +  KS +Y+AG+    R + L    F  G  P RYLG+PL T+++  A+Y
Sbjct: 1024 EFAGRSGLQISLEKSTIYLAGVSASDRVQTLSSFPFANGQLPVRYLGLPLLTKQMTTADY 1083

Query: 2784 GSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYGISRSF 2963
              L+EA+  K+ SW   +LS+AGR+ L+ SV+  +  FW+S   +P+G I +I  +  +F
Sbjct: 1084 SPLIEAVKTKISSWTARSLSYAGRLALLNSVIVSIANFWMSAYRLPAGCIREIEKLCSAF 1143

Query: 2964 VWS-----TKHPPISWSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWVKWI 3128
            +WS      K   I+WS+IC PK+EGG G++ L   N     + +W + S + SLWV WI
Sbjct: 1144 LWSGPVLNPKKAKIAWSSICQPKKEGGLGIKSLAEANKVSCLKLIWRLLSTQPSLWVTWI 1203

Query: 3129 HQHYLRSEDFWHWRGKPS-DSPLIKCLLQIRDV 3224
                +R   FW    + S  S + K LL+ R++
Sbjct: 1204 WTFIIRKGTFWSANERSSLGSWMWKKLLKYREL 1236


>gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score:
            72.31) [Arabidopsis thaliana]
          Length = 928

 Score =  446 bits (1146), Expect = e-122
 Identities = 270/760 (35%), Positives = 407/760 (53%), Gaps = 30/760 (3%)
 Frame = +3

Query: 834  RRPLWNSLTLLGENI---AEPWIIMGDYNSLLTVQDKEGGQ--PVSNYELHDLERFVHTC 998
            R+ LWN L    ++    ++PWII GD+N +L +++    +  PV+   + D +  V+ C
Sbjct: 4    RKELWNDLRDHSDSPIIRSKPWIIFGDFNEILDMEEHSNSRENPVTTTGMRDFQMAVNHC 63

Query: 999  GLVDLRSIGCRLTWTNGS----VSSKLDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGI 1166
             + DL   G   TW+N      ++ KLDR +VN  WL +   SY  F   GC     C I
Sbjct: 64   SITDLAYHGPLFTWSNKRENDLIAKKLDRVLVNDVWLQSFPRSYSVFEAGGCSDHLRCRI 123

Query: 1167 VSTLAREK--RCNRPFKFYNMWTLHEGFQDLVASSWDEP----ISGNAQFALKAKLTRLK 1328
               +      +  RPFKF N+ T  E F   V S W+E     +S ++ F    KL  LK
Sbjct: 124  NLNVGAGAVVKGKRPFKFVNVITEMEHFIPTVESYWNETEAIFMSTSSLFRFSKKLKGLK 183

Query: 1329 GKLRELDKLHFQHISEQALRAKSVLEDAQRSLLSGAGPPNDYEVIRKRATL--LAEAEKL 1502
              LR L K    ++ +Q   A   L   Q   ++   P +  E     A    +A  E+ 
Sbjct: 184  PLLRNLGKERLGNLVKQTKEAFETLCQKQAMKMANPSPSSMQEENEAYAKWDHIAVLEEK 243

Query: 1503 FYQQRAKSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQ 1682
            F +QR+K  +L   DR  K FH  V     +N+I ++    G   +  +++  E   +F+
Sbjct: 244  FLKQRSKLHWLDIGDRNNKAFHRAVVAREAQNSIREIICHDGSVASQEEKIKTEAEHHFR 303

Query: 1683 DLLGKTGQCTPLDSNTISGNHL--------SQAQREELVVPITNEEIKNALFDIGADKAP 1838
            + L    Q  P D   I+   L        S + +E L   ++ EEI   +F +  DK+P
Sbjct: 304  EFL----QLIPNDFEGIAVEELQDLLPYRCSDSDKEMLTNHVSAEEIHKVVFSMPNDKSP 359

Query: 1839 GPDGFNAKFYTAAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRP 2018
            GPDG+ A+FY  AW+ IG +FI A++ FF  G L K  N T +ALIPK   +  + DYRP
Sbjct: 360  GPDGYTAEFYKGAWNIIGAEFILAIQSFFAKGFLPKGINSTILALIPKKKEAKEMKDYRP 419

Query: 2019 ISCCTVFYKVISKVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRIS 2198
            ISCC V YKVISK+++ RL  V+   +   Q AFVKDR + EN+ LA E+++ Y +  +S
Sbjct: 420  ISCCNVLYKVISKIIANRLKLVLPKFIVGNQSAFVKDRLLIENVLLATEIVKDYHKDSVS 479

Query: 2199 PRCMIKVDLKKAFDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFS 2378
             RC +K+D+ KAFD+V W FL++ L    FP +F  WI  CI+T S+SV +NG L G FS
Sbjct: 480  SRCALKIDISKAFDSVQWKFLINVLEAMNFPPEFTHWITLCITTASFSVQVNGELAGVFS 539

Query: 2379 GRRGLRQGDPLSPLLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFA 2558
              R LRQG  LSP LF + ++V S+ + +A     F YHP+C  + LTHL++ADDL++ +
Sbjct: 540  SARELRQGCSLSPYLFVISMDVLSKMLDKAVGARQFGYHPKCRAIGLTHLSFADDLMILS 599

Query: 2559 RADVSSLNLIMGCLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRY 2738
               V S++ I+  L +F   +GL+ +  KS +Y+AG++  +  E++    F  G  P RY
Sbjct: 600  DGKVRSIDGIVKVLYEFAKWSGLKISMEKSTMYLAGVQASVYQEIVQKFSFDVGKLPVRY 659

Query: 2739 LGIPLATERLRIANYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLSILPI 2918
            LG+PL ++RL  ++   L+E + KK+ +W    LS AGR+ LI S L  +  FW++   +
Sbjct: 660  LGLPLVSKRLTASDCLPLIEQLRKKIEAWTSRFLSFAGRLNLISSTLWSICNFWMAAFRL 719

Query: 2919 PSGVIDKIYGISRSFVW-----STKHPPISWSTICLPKEE 3023
            P   I +I  +  +F+W     S+    +SW  IC PK+E
Sbjct: 720  PRACIREIDKLCSAFLWSGTELSSNKAKVSWEAICKPKKE 759


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  442 bits (1138), Expect = e-121
 Identities = 276/912 (30%), Positives = 432/912 (47%), Gaps = 39/912 (4%)
 Frame = +3

Query: 1113 SYVEFLPPGCLSDHSCGIVSTLAREKRCNRPFKFYNMWTLHEGFQDLVASSWDEPISGNA 1292
            S + FLPPG +SDH+  +V      +    PFKF+N     E F  +V++ W   + G+ 
Sbjct: 137  SVLSFLPPG-ISDHAAMVVKVGLPFRIRKAPFKFFNFLADREDFIPIVSAVWATNVWGSK 195

Query: 1293 QFALKAKLTRLKGKLRELDKLHFQHISEQALRAKSVLEDAQRSLLSGAGPPNDYEVIRKR 1472
            QF +  KL  +K + + L+     ++ E+ L+ KS ++                      
Sbjct: 196  QFQVWRKLKLVKNQFKLLNC----NVVEKLLKKKSRVQ---------------------- 229

Query: 1473 ATLLAEAEKLFYQQRAKSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDE 1652
                               +LK  D+ + FF   + ++  RN I  + +  G +      
Sbjct: 230  -------------------WLKKGDKNSTFFFKTMTKHRNRNRIATINRSDGPDLAK--S 268

Query: 1653 VAEEFVGYFQDLLGKTGQCTPLDSNTISGNHLSQAQREELVVPITNEEIKNALFDIGADK 1832
            +  EF                                       T+++I+   F +  +K
Sbjct: 269  LCNEF---------------------------------------THDDIRAVFFSMNPNK 289

Query: 1833 APGPDGFNAKFYTAAWDKIGDDFIAA-VKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTD 2009
            +PGPDGFN  F+  AW  IGD+ +AA VKEFF +G LL + N T I L+PK  +   ++D
Sbjct: 290  SPGPDGFNGCFFQKAWLVIGDNVVAAAVKEFFSYGSLLMELNSTIITLVPKVANPTTMSD 349

Query: 2010 YRPISCCTVFYKVISKVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARK 2189
            +RPISCC  FYK+I+K+L+ RL   +  ++ P+Q  F+  R IG+NI LAQE++  Y + 
Sbjct: 350  FRPISCCNTFYKIIAKLLANRLKGTLHLIVGPSQSTFIPGRRIGDNILLAQEIICDYHKA 409

Query: 2190 RISPRCMIKVDLKKAFDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYG 2369
               PRC   VD+ KA DTV+W F+++ L  +  P    GWI  CIS+  +SV +NG L G
Sbjct: 410  DGQPRCTFMVDMMKANDTVEWDFIIATLQAFNIPSTLIGWIKSCISSAKFSVCVNGELAG 469

Query: 2370 FFSGRRGLRQGDPLSPLLFGLCLEVFSRSMKEASRLDG-FHYHPRCAEVELTHLAYADDL 2546
            FF+ RRGLRQGDPLSP LF + +EV S  ++        F YH RC ++ L+HL +ADDL
Sbjct: 470  FFARRRGLRQGDPLSPYLFVIAMEVLSLCIQRRINCSPCFRYHWRCDQLNLSHLCFADDL 529

Query: 2547 LLFARADVSSLNLIMGCLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVF 2726
            L+F   D +S+  +    ++F   + L+AN  +S +++AG++      +L +T F  G  
Sbjct: 530  LMFCNGDENSVRTLHDAFSNFESLSSLKANVSESKIFLAGVDGNSSDSVLQVTNFSLGTC 589

Query: 2727 PFRYLGIPLATERLRIANYGSLLEAITKKVISWPKHTLSHAGRIELIRSVLQGVECFWLS 2906
            P RYLGIPL T +LR+ +   LL+ I  ++ SW    LS AGR++LI+SVL  ++ +W S
Sbjct: 590  PVRYLGIPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVYWAS 649

Query: 2907 ILPIPSGVIDKIYGISRSFVWSTK-----HPPISWSTICLPKEEGGYGLRDLQAWNMTLL 3071
             L +P  V+  I    R F+W+          ++WS ICLPK EGG G++DL  WN  L+
Sbjct: 650  HLILPKKVLKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPKCEGGLGIKDLHCWNKALM 709

Query: 3072 CRTLWNIQSRRDSLWVKWIHQHYLRSEDFWHWRGKPSDSPLIKCLLQIRDVLLQRTTSIG 3251
               +WN+ S   + W  W+  + L+   FW+       S   + LL+IR++      +I 
Sbjct: 710  ISHIWNLVSSSSNFWTDWVKVYLLKGNSFWNAPLPSICSWNWRKLLKIRELCCSFFVNII 769

Query: 3252 AAKMKLTEWFAE----------------GENG--------------TALAYDFFRPKKPK 3341
                  + WF                  GE+G              T+ A++  RP +  
Sbjct: 770  GDGRATSLWFDNWHPLGPLTLRWSSNIIGESGLSKSAMLTPNGFYSTSSAWNTLRPSRFI 829

Query: 3342 VPWFSVIWKSILMPKHRFTVWLLAHGKLRTADSLSYEPSKMCALCQQHEESNGHIFFECA 3521
            VPW+ ++W         F                               E++ H+FF+CA
Sbjct: 830  VPWYRLVW---------FVA-----------------------------ETHNHLFFDCA 851

Query: 3522 IAKSLLDRVVRWMDIKFDITSMAKLLDVFQRHYKGKGRQVKARYLAVSSMIYLIWEARNR 3701
             +  +   V+   D+   +   +  +     ++KG    V    LA+ +++Y IW  RN 
Sbjct: 852  YSFGIWTHVLSKCDVSKPLLPWSDFIFWVATNWKGNSLPVVILKLALQAVVYAIWRERNN 911

Query: 3702 SRF--EGIVPCV 3731
             RF  E + P V
Sbjct: 912  RRFRNESLPPAV 923


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  435 bits (1118), Expect = e-119
 Identities = 303/1088 (27%), Positives = 500/1088 (45%), Gaps = 90/1088 (8%)
 Frame = +3

Query: 543  VQHLLKQKEIQVMALIETKLDDDALDIIMQRKFSGMGMANNFNYSNRGRILLLWDMHRVS 722
            ++ L     ++++A++E  +D    +   ++      + NN           +W  H V 
Sbjct: 869  IKKLQLMHRLKILAILEPMVDTSKAEYFRRKMGFEKVIVNNSQK--------IWLFHSVE 920

Query: 723  L--DVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPLWNSLTLLGENIAEPWII 896
               +VL+   Q +H RV     D     TFVY   +   R PLWN L  L  ++  PWI+
Sbjct: 921  FICEVLLDHPQCLHVRVTIPWLDLPIFTTFVYAKCTRSERTPLWNCLRNLAADMEGPWIV 980

Query: 897  MGDYNSLLTVQDKEGGQPVSNYELHDLERFVHTCGLVDLRSIGCRLTWTNGSVSSKLDRA 1076
             GD+N +L  +++  G       + D    +  CGL+D    G   TWTN  +  +LDR 
Sbjct: 981  GGDFNIILKREERLYGADPHEGSIEDFASVLLDCGLLDGGFEGNPFTWTNNRMFQRLDRM 1040

Query: 1077 MVNSQWLIADYESYVEFLPPGCLSDHSCGIVSTLAREKRCNRPFKFYNMWTLHEGFQDLV 1256
            + N QW+     + ++ L     SDH   ++S     ++    F+F + W LH  F   V
Sbjct: 1041 VYNQQWINKFPITRIQHLNRDG-SDHCPLLLSCSNSSEKAPSSFRFLHAWALHHNFNASV 1099

Query: 1257 ASSWDEPISGNAQFALKAKLTRLKGKLRELDKLHFQHI------SEQALRAKSVLEDAQR 1418
              +W+ PI+G+   A  +K  RLK  L+  +K  F  I      +E+ +    +L   ++
Sbjct: 1100 EGNWNLPINGSGLMAFWSKQKRLKQHLKWWNKTVFGDIFSNIKEAEKRVEECEILHQQEQ 1159

Query: 1419 SLLSGAGPPNDYEVIRKRATLLAEAEKLFYQQRAKSVYLKNSDRCTKFFHDLVKRNNKRN 1598
            ++ S       Y  + K+ ++    E++F++Q++   ++   +R TKFFH  +++   R+
Sbjct: 1160 TIGSRIQLNKSYAQLNKQLSM----EEIFWKQKSGVKWVVEGERNTKFFHMRMQKKRIRS 1215

Query: 1599 AIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKTGQCTPLDSNTISGNHLSQAQREELVV 1778
             I K+ +Q G+     +++ +  + +F  LL           +++  + +S      L  
Sbjct: 1216 HIFKIQEQDGNWIEDPEQLQQSAIDFFSSLLKAESCDDTRFQSSLCPSIISDTDNGFLCA 1275

Query: 1779 PITNEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKIGDDFIAAVKEFFHHGKLLKQWNH 1958
              T +E+K A+F I  + A GPDGF++ FY   WD I  D   AVKEFFH   + +    
Sbjct: 1276 EPTLQEVKEAVFGIDPESAAGPDGFSSHFYQQCWDIIAHDLFEAVKEFFHGADIPQGMTS 1335

Query: 1959 TSIALIPKSDHSPCVTDYRPISCCTVFYKVISKVLSARLANVMSSLLDPAQVAFVKDRSI 2138
            T++ LIPK+  +   +++RPIS CTV  K+I+K+L+ RLA ++ S++   Q  FV  R I
Sbjct: 1336 TTLVLIPKTTSASKWSEFRPISLCTVMNKIITKILANRLAKILPSIITENQSGFVGGRLI 1395

Query: 2139 GENIHLAQELLRKYARKRISPRCMIKVDLKKAFDTVDWGFLMSALVGYGFPQQFCGWIHE 2318
             +NI LAQEL+ K  +K       +K+D+ KA+D +DW FL   L   GF  Q+ G I +
Sbjct: 1396 SDNILLAQELIGKLDQKNRGGNVALKLDMMKAYDRLDWSFLFKVLQHLGFNAQWIGMIQK 1455

Query: 2319 CISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFGLCLEVFSRSMKEA-SRLDGFHYH 2495
            CIS   +S+ LNG   G+F   RGLRQGD +SP LF L  E  +R +     +    HY 
Sbjct: 1456 CISNCWFSLLLNGRTVGYFKSERGLRQGDSISPQLFILAAEYLARGLNALYDQYPSLHYS 1515

Query: 2496 PRCAEVELTHLAYADDLLLFARADVSSLNLIMGCLTDFGDKAGLRANPLKSNLYV-AGME 2672
              C+ + ++HLA+ADD+++FA    S+L  IM  L ++   +G R NP KS +     M 
Sbjct: 1516 SGCS-LSVSHLAFADDVIIFANGSKSALQKIMAFLQEYEKLSGQRINPQKSCVVTHTNMA 1574

Query: 2673 EQLRGELLHITGFQQGVFPFRYLGIPLATERLRIANYGSLLEAITKKVISWPKHTLSHAG 2852
               R  +L  TGF     P  YLG PL     ++  +  L+  I +++  W   TLS  G
Sbjct: 1575 SSRRQIILQATGFSHRPLPITYLGAPLYKGHKKVMLFNDLVAKIEERITGWENKTLSPGG 1634

Query: 2853 RIELIRSVLQGVECFWLSILPIPSGVIDKIYGISRSFVW--STKHPPI---SWSTICLPK 3017
            RI L+RS L  +  + L +L  P  V+++I  +  +F+W  ST    I   SW  I LP 
Sbjct: 1635 RITLLRSTLSSLPIYLLQVLKPPVIVLERINRLLNNFLWGGSTASKRIHWASWGKIALPI 1694

Query: 3018 EEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWVKWIHQHYLRSE--------------- 3152
             EGG  +R+++        +  W  ++  +SLW +++   Y   +               
Sbjct: 1695 AEGGLDIRNVEDVCEAFSMKLWWRFRT-TNSLWTQFMRAKYCGGQLPTDVQPKLHDSQTW 1753

Query: 3153 -----------------------DFWH--WRGKPSDSPLI-------KCLLQIRDVLLQR 3236
                                    FWH  W G   + PL+         + Q+ D  L  
Sbjct: 1754 KRMVTISSITEQNIRWRIGHGELFFWHDCWMG---EEPLVNRNQAFASSMAQVSDFFLNN 1810

Query: 3237 TTSIGAAKMKLTEWFAE----------------------GENGTALAYDFFRPKKPKVPW 3350
            + ++   K  L +   E                      G+  T  A+   R +K + P 
Sbjct: 1811 SWNVEKLKTVLQQEVVEEIVKIPIDTSSNDKAYWTTTPNGDFSTKSAWQLIRNRKVENPV 1870

Query: 3351 FSVIWKSILMPKHRFTVWLLAHG------KLRTADSLSYEPSKMCALCQQHEESNGHIFF 3512
            F+ IW   +     F +W L H       K++T     ++ +  C  C+  EES  H+ +
Sbjct: 1871 FNFIWHKSVPLTTSFFLWRLLHDWIPVELKMKTK---GFQLASRCRCCKS-EESLMHVMW 1926

Query: 3513 ECAIAKSL 3536
            +  +A  +
Sbjct: 1927 KNPVANQV 1934


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  429 bits (1102), Expect = e-117
 Identities = 297/1086 (27%), Positives = 507/1086 (46%), Gaps = 91/1086 (8%)
 Frame = +3

Query: 543  VQHLLKQKE----IQVMALIETKLDDDALDIIMQRKFSGMGMANNFNYSNRGRILLLWDM 710
            +Q  LK+ +    I+++A++E  +D    +   ++    +G       S++     +W  
Sbjct: 902  IQRRLKKLQLMHRIKILAILEPMVDISKAEFFRRK----LGFEKVIVNSSQK----IWLF 953

Query: 711  HRVSL--DVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRPLWNSLTLLGENIAE 884
            H + L  D+++   Q +H R+     ++ F ATFVY   +   R  LW+ L  L  +  E
Sbjct: 954  HSLELHSDIILDHPQCLHVRLTSPWLEKPFFATFVYAKCTRSERTLLWDCLRRLAADNEE 1013

Query: 885  PWIIMGDYNSLLTVQDKEGGQPVSNYELHDLERFVHTCGLVDLRSIGCRLTWTNGSVSSK 1064
            PW++ GD+N +L  +++  G       + D    +  CGL+D    G   TWTN  +  +
Sbjct: 1014 PWLVGGDFNIILKREERLYGSAPHEGSMEDFASVLLDCGLLDGGFEGNPFTWTNNRMFQR 1073

Query: 1065 LDRAMVNSQWLIADYESYVEFLPPGCLSDHSCGIVSTLAREKRCNRPFKFYNMWTLHEGF 1244
            LDR + N QW+     + ++ L     SDH   ++S     ++    F+F + W LH  F
Sbjct: 1074 LDRVVYNHQWINMFPITRIQHLNRDG-SDHCPLLISCFISSEKSPSSFRFQHAWVLHHDF 1132

Query: 1245 QDLVASSWDEPISGNAQFALKAKLTRLKGKLRELDKLHFQHI------SEQALRAKSVLE 1406
            +  V  +W+ PI+G+   A   K  RLK  L+  +K  F  I      +E+ +    +L 
Sbjct: 1133 KTSVEGNWNLPINGSGLQAFWIKQHRLKQHLKWWNKAVFGDIFSKLKEAEKRVEECEILH 1192

Query: 1407 DAQRSLLSGAGPPNDYEVIRKRATLLAEAEKLFYQQRAKSVYLKNSDRCTKFFHDLVKRN 1586
              ++++ S       Y  + K+  +    E++F++Q++   ++   +R TKFFH  +++ 
Sbjct: 1193 QQEQTVGSRINLNKSYAQLNKQLNV----EEIFWKQKSGVKWVVEGERNTKFFHMRMQKK 1248

Query: 1587 NKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKTGQCTPLD----SNTISGNHLSQ 1754
              R+ I K+ +  G      +++ +  + YF  LL    +  P D     N++  + +S 
Sbjct: 1249 RIRSHIFKVQEPDGRWIEDQEQLKQSAIEYFSSLL----KAEPCDISRFQNSLIPSIISN 1304

Query: 1755 AQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYTAAWDKIGDDFIAAVKEFFHHG 1934
            ++ E L      +E+K+A+FDI  + A GPDGF++ FY   W+ I  D + AV++FFH  
Sbjct: 1305 SENELLCAEPNLQEVKDAVFDIDPESAAGPDGFSSYFYQQCWNTIAHDLLDAVRDFFHGA 1364

Query: 1935 KLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVISKVLSARLANVMSSLLDPAQV 2114
             + +    T++ L+PK   +   +++RPIS CTV  K+I+K+LS RLA ++ S++   Q 
Sbjct: 1365 NIPRGVTSTTLVLLPKKSSASKWSEFRPISLCTVMNKIITKLLSNRLAKILPSIITENQS 1424

Query: 2115 AFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKKAFDTVDWGFLMSALVGYGFPQ 2294
             FV  R I +NI LAQEL+RK   K       +K+D+ KA+D +DW FL+  L  +GF +
Sbjct: 1425 GFVGGRLISDNILLAQELIRKLDTKSRGGNLALKLDMMKAYDRLDWSFLIKVLQHFGFNE 1484

Query: 2295 QFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPLSPLLFGLCLEVFSRSMKEA-S 2471
            Q+ G I +CIS   +S+ LNG + G+F   RGLRQGD +SP LF L  E  SR +     
Sbjct: 1485 QWIGMIQKCISNCWFSLLLNGRIEGYFKSERGLRQGDSISPQLFILAAEYLSRGLNALYD 1544

Query: 2472 RLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIMGCLTDFGDKAGLRANPLKSN 2651
            +    HY      + ++HLA+ADD+L+F     S+L  I+  L ++ + +G R N  KS 
Sbjct: 1545 QYPSLHYSSG-VPLSVSHLAFADDVLIFTNGSKSALQRILVFLQEYEEISGQRINAQKSC 1603

Query: 2652 LYV-AGMEEQLRGELLHITGFQQGVFPFRYLGIPLATERLRIANYGSLLEAITKKVISWP 2828
                  +    R  +   TGF   + P  YLG PL     ++  +  L+  I +++  W 
Sbjct: 1604 FVTHTNIPNSRRQIIAQATGFNHQLLPITYLGAPLYKGHKKVILFNDLVAKIEERITGWE 1663

Query: 2829 KHTLSHAGRIELIRSVLQGVECFWLSILPIPSGVIDKIYGISRSFVW-----STKHPPIS 2993
               LS  GRI L+RSVL  +  + L +L  P  V++++  +  SF+W     S +    S
Sbjct: 1664 NKILSPGGRITLLRSVLASLPIYLLQVLKPPVCVLERVNRLFNSFLWGGSAASKRIHWAS 1723

Query: 2994 WSTICLPKEEGGYGLRDLQAWNMTLLCRTLWNIQSRRDSLWVKWIHQHYLRSE------- 3152
            W+ I LP  EGG  +R L         +  W  ++  DSLW +++   Y R +       
Sbjct: 1724 WAKIALPVTEGGLDIRSLAEVFEAFSMKLWWRFRT-TDSLWTRFMRMKYCRGQLPMQTQP 1782

Query: 3153 -------------------------------DFWH--WRGK----PSDSPLIKCLLQIRD 3221
                                            FWH  W G+     S+      ++Q+ D
Sbjct: 1783 KLHDSQTWKRMLTSSTITEQHMRWRVGQGNVFFWHDCWMGEAPLISSNQEFTSSMVQVCD 1842

Query: 3222 VLLQRTTSIGAAKMKLTEWFAE----------------------GENGTALAYDFFRPKK 3335
                 + +I   K  L +   +                      G+  T  A+   R +K
Sbjct: 1843 FFTNNSWNIEKLKTVLQQEVVDEIAKIPIDTMNKDEAYWTPTPNGDFSTKSAWQLIRKRK 1902

Query: 3336 PKVPWFSVIWKSILMPKHRFTVWLLAHGKLRTADSLSYEPSKMCALCQ--QHEESNGHIF 3509
               P F+ IW   +     F +W L H  +     +  +  ++ + C+  + EES  H+ 
Sbjct: 1903 VVNPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCKSEESIMHVM 1962

Query: 3510 FECAIA 3527
            ++  +A
Sbjct: 1963 WDNPVA 1968


>emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|7269488|emb|CAB79491.1|
            putative protein [Arabidopsis thaliana]
          Length = 1141

 Score =  419 bits (1078), Expect = e-114
 Identities = 260/733 (35%), Positives = 392/733 (53%), Gaps = 17/733 (2%)
 Frame = +3

Query: 663  NFNYSNRGRILLLWDMHRVSLDVLMATSQYMHCRVRCSITDRIFLATFVYGFHSIMARRP 842
            N+ +S+ G+I +LWD   V + ++  + Q + C V    +    + + VY  +    R+ 
Sbjct: 54   NYGFSDLGKIWVLWDPS-VEVVIVAKSLQMITCEVLFPNSRTWIVISVVYAANEDDKRKE 112

Query: 843  LWNSLTLLGEN---IAEPWIIMGDYNSLLTVQDKEGGQPVS---NYELHDLERFVHTCGL 1004
            LW  +T L  +      PWI++GD+N +L     E  + VS   +  + D    +    L
Sbjct: 113  LWREITALVASPVTFNRPWILLGDFNQVL--HPHEHSRHVSLNVDRRIRDFRECLLDAEL 170

Query: 1005 VDLRSIGCRLTWTNGS----VSSKLDRAMVNSQWLIADYESYVEFLPPGCLSDH-SCGIV 1169
             DL   G   TW N S    V+ K+DR +VN  W      S+  F PP   SDH SCG+V
Sbjct: 171  SDLVYKGSSFTWWNKSKTRPVAKKIDRILVNESWSNLFPSSFGLFGPPD-FSDHASCGVV 229

Query: 1170 STLAREKRCNRPFKFYNMWTLHEGFQDLVASSW-DEPISGNAQFALKAKLTRLKGKLREL 1346
              L   K   RPFKF+N    +  F +LV   W    + G++ F +  KL  LK  +++ 
Sbjct: 230  LELDPIK-AKRPFKFFNFLLKNPEFLNLVWDVWYSTNVVGSSMFRVSKKLKALKKPIKDF 288

Query: 1347 DKLHFQHISEQALRAKSVLEDAQRSLLSGAGPPNDYEVI--RKRATLLAEAEKLFYQQRA 1520
             +L++ ++ ++   A   L   Q   L      N    +  +++  +LA AE+ F++QR+
Sbjct: 289  SRLNYSNLEKRTEEAHETLLSFQNLTLDNPSLENAAHELEAQRKWQILATAEESFFRQRS 348

Query: 1521 KSVYLKNSDRCTKFFHDLVKRNNKRNAIIKLTKQSGDETTSLDEVAEEFVGYFQDLLGKT 1700
            +  +    D  T++FH +       N I  L   SG +  S   +A+    YF++LL   
Sbjct: 349  RVTWFAEGDGNTRYFHRMADSRKSVNTITTLVDDSGTQIDSQQGIADHCALYFENLLSDD 408

Query: 1701 GQCTPL---DSNTISGNHLSQAQREELVVPITNEEIKNALFDIGADKAPGPDGFNAKFYT 1871
                 L   D N +       +Q  +L    ++E+IK A F + ++KA GPDGF      
Sbjct: 409  NDPYSLEQDDMNLLLTYRCPYSQVADLEAMFSDEDIKAAFFGLPSNKACGPDGFPVT--- 465

Query: 1872 AAWDKIGDDFIAAVKEFFHHGKLLKQWNHTSIALIPKSDHSPCVTDYRPISCCTVFYKVI 2051
                       AAV+EFF  G LLKQWN T+I LIPK  ++ C +D+RPISC    YKVI
Sbjct: 466  -----------AAVREFFISGNLLKQWNATTIVLIPKFPNASCTSDFRPISCMNTLYKVI 514

Query: 2052 SKVLSARLANVMSSLLDPAQVAFVKDRSIGENIHLAQELLRKYARKRISPRCMIKVDLKK 2231
            +++L+ RL  ++S ++ P+Q AF+  R + EN+ LA E++  Y  + IS R M+KVDL+K
Sbjct: 515  ARLLTDRLQKLLSCVISPSQSAFLPGRLLAENVLLATEMVHGYNWRNISLRGMLKVDLRK 574

Query: 2232 AFDTVDWGFLMSALVGYGFPQQFCGWIHECISTVSYSVSLNGGLYGFFSGRRGLRQGDPL 2411
            AFD+V W F+++AL+  G P +F  WIH+CIST +++VS+NG   GFF   +GLRQGDPL
Sbjct: 575  AFDSVRWEFIIAALLALGVPTKFINWIHQCISTPTFTVSVNGCCGGFFKSAKGLRQGDPL 634

Query: 2412 SPLLFGLCLEVFSRSMKEASRLDGFHYHPRCAEVELTHLAYADDLLLFARADVSSLNLIM 2591
            SP LF L +EVFS+ +          YHP+ +++ ++HL +ADD+++F     SSL+ I 
Sbjct: 635  SPYLFVLAMEVFSKLLNSRFDSGYIRYHPKASDLSISHLMFADDVMIFFDGGSSSLHGIC 694

Query: 2592 GCLTDFGDKAGLRANPLKSNLYVAGMEEQLRGELLHITGFQQGVFPFRYLGIPLATERLR 2771
              L DF   +GL+ N  KS+ + AG+E+  R  L    GF QG  P RYLG+PL   +LR
Sbjct: 695  ETLEDFASWSGLKVNNDKSHFFCAGLEQAERNSLA-AYGFPQGCLPIRYLGLPLMCRKLR 753

Query: 2772 IANYGSLLEAITK 2810
            IA Y  LLE   K
Sbjct: 754  IAEYEPLLEKSPK 766


Top