BLASTX nr result

ID: Rehmannia25_contig00001067 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia25_contig00001067
         (3997 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   659   0.0  
emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   645   0.0  
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   588   e-165
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       541   e-151
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   526   e-146
dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thal...   524   e-145
gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali...   505   e-140
gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,...   501   e-139
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   495   e-137
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           495   e-137
dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like ...   493   e-136
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   493   e-136
emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-li...   491   e-136
gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana]              481   e-132
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   479   e-132
gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00...   478   e-132
gb|AAC63678.1| putative non-LTR retroelement reverse transcripta...   460   e-126
gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip...   453   e-124
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]   443   e-121
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]   437   e-119

>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  659 bits (1700), Expect = 0.0
 Identities = 372/1060 (35%), Positives = 576/1060 (54%), Gaps = 31/1060 (2%)
 Frame = -3

Query: 3947 VQNKFPG-WLATNNFHLIKNGRILILWNSSTVDLQVLDTDPQLIHCCLTCKISQNSILTS 3771
            +Q KF   W   NN+     GRI + W ++ V++ VL    Q+I   +      N    +
Sbjct: 47   IQKKFGNRWSWINNYACSPRGRIWVGWLNNDVNINVLSVTEQVITMEVKNSYGLNMFKMA 106

Query: 3770 FIYGLNTVGERRCLWNKMLELGDLIDSPWLLLGDFNTIKNPDEKLNGEPFTSKSVEEFHD 3591
             +YGL+T+ +R+ LW ++     +   P +L+GD+N + +  ++LNG   +     +   
Sbjct: 107  AVYGLHTIADRKVLWEELYNFVSVCHEPCILIGDYNAVYSAQDRLNGNDVSEAETSDLRS 166

Query: 3590 TCAYLGLSEVQSTGCYFTWTNNTIWC-----RLDRALINSAWSNSNWRCSADIPVPGNVS 3426
                  L E  +TG +++W N +I       R+D++ +N AW N       +    G +S
Sbjct: 167  FVLKAQLLEAPTTGLFYSWNNKSIGADRISSRIDKSFVNVAWINQYPDVVVEYREAG-IS 225

Query: 3425 DHSPIVVSFFEQNLVLSKPFKFFNMWALHPDFLNTVQTAW---NLNFWGKAQFILCKKLK 3255
            DHSP++ +   Q+    +PFKF N  A    F+  V+ AW   N  F  K  ++   +L+
Sbjct: 226  DHSPLIFNLATQHDEGGRPFKFLNFLADQNGFVEVVKEAWGSANHRFKMKNIWV---RLQ 282

Query: 3254 ALKPSLKELNNFHFSHISSRVKQVKTALKNSQIQLHTDPLNSALCDSVKELKAKETFLAK 3075
            A+K +LK  ++  FS    +V++++  L   Q       + S L +  K+L A+    + 
Sbjct: 283  AVKRALKSFHSKKFSKAHCQVEELRRKLAAVQALPEVSQV-SELQEEEKDLIAQLRKWST 341

Query: 3074 AERSFLSQKAKCDFLNNSDRNTKFFHSIIKRNSLRKQMNSVIL---EDGSKTSSFDDLSK 2904
             + S L QK++  +L+  D N+KFF + IK   +RK  N ++L   + G + +   ++  
Sbjct: 342  IDESILKQKSRIQWLSLGDSNSKFFFTAIK---VRKARNKIVLLQNDRGDQLTENTEIQN 398

Query: 2903 AFVNYFKNLFGTSF-QTSPVDLQTLQSGPCIDEDDFNLLSSPITQQAIKIALFDIEDERS 2727
               N+++ L GTS  Q   +DL  ++ G  +       L  PIT Q I  AL DI+D ++
Sbjct: 399  EICNFYRRLLGTSSSQLEAIDLHVVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKA 458

Query: 2726 PGPDGFSSGFFKKSWDVVGNDVIAAVSEFFDSSKILRQINHTXXXXXX-----------R 2580
            PG DGF+S FFKKSW V+  ++   + +FF++  + + IN T                 R
Sbjct: 459  PGLDGFNSVFFKKSWLVIKQEIYEGILDFFENGFMHKPINCTAVTLIPKIDEAKHAKDYR 518

Query: 2579 PIACCNVVYKVITKILASKLESVVPKLIDPAQSAFISGRNITDNIFLAQEIIRNYARVRI 2400
            PIACC+ +YK+I+KIL  +L++V+ +++D AQ+ FI  R+I DNI LA E+IR Y R  +
Sbjct: 519  PIACCSTLYKIISKILTKRLQAVITEVVDCAQTGFIPERHIGDNILLATELIRGYNRRHV 578

Query: 2399 SPRCAIKVDLKKAYDTISWSFLEQVLNGLGFPPIFISWVMECVSTASYSICINGSLQGKF 2220
            SPRC IKVD++KAYD++ W FLE +L  LGFP +FI W+M CV T SYSI +NG     F
Sbjct: 579  SPRCVIKVDIRKAYDSVEWVFLESMLKELGFPSMFIRWIMACVKTVSYSILLNGIPSIPF 638

Query: 2219 PGKRGLRQGDPMSPALFLLCMEYLSRLLKVRTNTSTFKYHPRCETLKITHLAFADDLMLF 2040
              ++GLRQGDP+SP LF L MEYLSR +        F +HP+CE +K+THL FADDL++F
Sbjct: 639  DAQKGLRQGDPLSPFLFALSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMF 698

Query: 2039 SRGDLPSVKILIDCLNDFKNASGLDVNSFKSNLFTAGIVGDKLDNILNLLNFPLGTLPVR 1860
            +R D  S+  ++   N F  ASGL  +  KS ++  G+  ++ + + + +  P+G+LP R
Sbjct: 699  ARADASSISKIMAAFNSFSKASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPFR 758

Query: 1859 YLGVPLAAQKLNVNHYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFP 1680
            YLGVPLA++KLN +   PL D+I +    W A+ LSYAGRL L+K++L  ++ +W QIFP
Sbjct: 759  YLGVPLASKKLNFSQCKPLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFP 818

Query: 1679 LPSTVIDRINRLCRVFLWGKNT-----SPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKI 1515
            LP  +I  +   CR FLW         +P+ W  +  P   GGL + ++  WNKA + K+
Sbjct: 819  LPKKLIKAVETTCRKFLWTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILKL 878

Query: 1514 LWNIHSKADSLWVRWVHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSLDLAI 1335
            LW I  K D LWVRWV+++Y+K Q+I       + S +L++I + R E+L + G  + A+
Sbjct: 879  LWAITFKQDKLWVRWVNAYYIKRQNIENVTVSSNTSWILRKIFESR-ELLTRTGGWE-AV 936

Query: 1334 SNLSLFANSKGLCSSKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINN 1155
            SN   F+        K Y L +E      W   +  N   PK  F +WLA  +RLAT   
Sbjct: 937  SNHMNFS------IKKTYKLLQEDYENVVWKRLICNNKATPKSQFILWLAMLNRLATAER 990

Query: 1154 LK--YLDVDPTCKLCGNYLENASHLFFDCIVTRLLWDRVKKWLKFSHSMSTIAGALKWTK 981
            +     DV P CK+CGN +E   HLFF+CI ++ +W +V  +L         A      K
Sbjct: 991  VSRWNRDVSPLCKMCGNEIETIQHLFFNCIYSKEIWGKVLLYLNLQPQADAQAKKELAIK 1050

Query: 980  KEHGAPALKKARAVAVCCTIYQIWKARNCQVFEGLNFAED 861
            K        K   +    ++Y IW  RN +VF G+   ++
Sbjct: 1051 KARSTKDRNKLYVMMFTESVYAIWLLRNAKVFRGIEINQN 1090


>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  645 bits (1665), Expect = 0.0
 Identities = 369/1062 (34%), Positives = 566/1062 (53%), Gaps = 34/1062 (3%)
 Frame = -3

Query: 3926 WLATNNFHLIKNGRILILWNSSTVDLQVLDTDPQLIHCCLTCKISQNSILTSFIYGLNTV 3747
            W   NN+      RI I W  + V++ +  T  QL+ C +  +  +  ++   +YGL+T+
Sbjct: 55   WKWLNNYSHSARERIWIGWRPAWVNVTLTHTQEQLMVCDIQDQSHKLKMVA--VYGLHTI 112

Query: 3746 GERRCLWNKMLELGDLIDSPWLLLGDFNTIKNPDEKLNGEPFTSKSVEEFHDTCAYLGLS 3567
             +R+ LW+ +L+     D P +++GDFN + + +++L G   T    E+F        L 
Sbjct: 113  ADRKSLWSGLLQCVQQQD-PMIIIGDFNAVCHSNDRLYGTLVTDAETEDFQQFLLQSNLI 171

Query: 3566 EVQSTGCYFTWTNNTIW-----CRLDRALINSAWSNSNWRCSADIPVPGNVSDHSPIVVS 3402
            E +ST  Y++W+N++I       R+D+A +N  W       S     PG +SDHSP++ +
Sbjct: 172  ESRSTWSYYSWSNSSIGRDRVLSRIDKAYVNLVWLGMYAEVSVQYLPPG-ISDHSPLLFN 230

Query: 3401 FFEQNLVLSKPFKFFNMWALHPDFLNTVQTAWN-LNFWGKAQFILCKKLKALKPSLKELN 3225
                     KPFKF N+ A   +FL TV+ AWN +N   K Q I    LKA+K  LK++ 
Sbjct: 231  LMTGRPQGGKPFKFMNVMAEQGEFLETVEKAWNSVNGRFKLQAIWLN-LKAVKRELKQMK 289

Query: 3224 NFHFSHISSRVKQVKTALKNSQIQLHTDPLNSALCDSVKELKAKETFLAKAERSFLSQKA 3045
                     +VK ++  L++ Q Q   D  N  +    K +       +  E S L QK+
Sbjct: 290  TQKIGLAHEKVKNLRHQLQDLQSQDDFDH-NDIMQTDAKSIMNDLRHWSHIEDSILQQKS 348

Query: 3044 KCDFLNNSDRNTKFFHSIIKRNSLRKQMNSVILEDGSKTSSFDDLSKAFVNYFKNLFGTS 2865
            +  +L   D N+K F + +K      +++ +  EDG      D++ +  + ++K L GT 
Sbjct: 349  RITWLQQGDTNSKLFFTAVKARHAINRIDMLNTEDGRVIQDADEVQEEILEFYKKLLGTR 408

Query: 2864 FQT-SPVDLQTLQSGPCIDEDDFNLLSSPITQQAIKIALFDIEDERSPGPDGFSSGFFKK 2688
              T   VDL T++ G C+       L   +    I  AL  I ++++PG DGF++ FFKK
Sbjct: 409  ASTLMGVDLNTVRGGKCLSAQAKESLIREVASTEIDEALAGIGNDKAPGLDGFNAYFFKK 468

Query: 2687 SWDVVGNDVIAAVSEFFDSSKILRQIN-----------HTXXXXXXRPIACCNVVYKVIT 2541
            SW  +  ++ A + EFF++S++ R IN           H       RPIACC V+YK+I+
Sbjct: 469  SWGSIKQEIYAGIQEFFNNSRMHRPINCIVVTLLPKVQHATRVKEFRPIACCTVIYKIIS 528

Query: 2540 KILASKLESVVPKLIDPAQSAFISGRNITDNIFLAQEIIRNYARVRISPRCAIKVDLKKA 2361
            K+L ++++ ++ ++++ AQS FI GR+I DNI LA E+IR Y R  +SPRC +KVD++KA
Sbjct: 529  KMLTNRMKGIIGEVVNEAQSGFIPGRHIADNILLASELIRGYTRKHMSPRCIMKVDIRKA 588

Query: 2360 YDTISWSFLEQVLNGLGFPPIFISWVMECVSTASYSICINGSLQGKFPGKRGLRQGDPMS 2181
            YD++ WSFLE +L   GFP  F+ W+MECVST SYS+ +NG     F  ++GLRQGDPMS
Sbjct: 589  YDSVEWSFLETLLYEFGFPSRFVGWIMECVSTVSYSVLVNGIPTQPFQARKGLRQGDPMS 648

Query: 2180 PALFLLCMEYLSRLLKVRTNTSTFKYHPRCETLKITHLAFADDLMLFSRGDLPSVKILID 2001
            P LF LCMEYLSR L+    +  F +HP+CE L ITHL FADDL++F R D  S+  +  
Sbjct: 649  PFLFALCMEYLSRCLEELKGSPDFNFHPKCERLNITHLMFADDLLMFCRADKSSLDHMNV 708

Query: 2000 CLNDFKNASGLDVNSFKSNLFTAGIVGDKLDNILNLLNFPLGTLPVRYLGVPLAAQKLNV 1821
                F +ASGL  +  KSN++  G+  +    + + ++  LG LP RYLGVPL ++KL  
Sbjct: 709  AFQKFSHASGLAASHEKSNIYFCGVDDETARELADYVHMQLGELPFRYLGVPLTSKKLTY 768

Query: 1820 NHYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLC 1641
                PL + I +    W A  LSYAGRL LIKS+L  ++ +W  IFPL   VI  + ++C
Sbjct: 769  AQCKPLVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVC 828

Query: 1640 RVFLWGKNT-----SPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWV 1476
            R FLW   T     +P+ W+ +  P   GG  + ++  WN+A + K+LW I  K D LWV
Sbjct: 829  RKFLWTGKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLKLLWAIEFKRDKLWV 888

Query: 1475 RWVHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSLDLAISNLSLFANSKGLC 1296
            RW+HS+Y+K Q I T +     + +L++I   R+  L+  G  D       +    K   
Sbjct: 889  RWIHSYYIKRQDILTVNISNQTTWILRKIVKARDH-LSNIGDWD------EICIGDK-FS 940

Query: 1295 SSKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKYLDV--DPTCK 1122
              K Y    E G +  W   +  N+  PK  F +W+  ++RL T++ +    V  D   +
Sbjct: 941  MKKAYKKISENGERVRWRRLICNNYATPKSKFILWMMLHERLPTVDRISRWGVQCDLNYR 1000

Query: 1121 LCGNYLENASHLFFDCIVTRLLWDRVKKWLKFSHS-------MSTIAGALKWTKKEHGAP 963
            LC N  E   HLFF C  +  +W ++   ++F +S       +S++ G  +         
Sbjct: 1001 LCRNDGETIQHLFFSCSYSAGVWSKICYIMRFPNSGVSHQEIISSVCGQAR--------- 1051

Query: 962  ALKKARAVAVCCT--IYQIWKARNCQVFEGLNFAEDTVFLKI 843
              KK + + +  T  +Y IWK RN + F G N  E+ V  KI
Sbjct: 1052 -KKKGKLIVMLYTEFVYAIWKQRNKRTFTGENKDENEVLRKI 1092


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  588 bits (1517), Expect = e-165
 Identities = 329/901 (36%), Positives = 506/901 (56%), Gaps = 29/901 (3%)
 Frame = -3

Query: 3992 LETEVVPQSNRCDFIVQNKFPGWLATNNFHLIKNGRILILWNSSTVDLQVLDTDPQLIHC 3813
            LET V  + +R    + + FPGW +  N+     GRI ++W+ + V++ VL    Q I C
Sbjct: 36   LETRV--KEHRARRSLLSSFPGWKSVCNYEFAALGRIWVVWDPA-VEVTVLSKSDQTISC 92

Query: 3812 CLTCKISQNSILTSFIYGLNTVGERRCLWNKMLELG---DLIDSPWLLLGDFNTIKNPDE 3642
             +         + +F+Y +N    RR LW+++  L       D PW++LGDFN   +P +
Sbjct: 93   TVKLPHISTEFVVTFVYAVNCRYGRRRLWSELELLAANQTTSDKPWIILGDFNQSLDPVD 152

Query: 3641 KLNGEPFTSKSVEEFHDTCAYLGLSEVQSTGCYFTW----TNNTIWCRLDRALINSAWSN 3474
               G    ++ +EEF +      +S++   G ++TW     NN I  ++DR L+N +W  
Sbjct: 153  ASTGGSRITRGMEEFRECLLTSNISDLPFRGNHYTWWNNQENNPIAKKIDRILVNDSWLI 212

Query: 3473 SNWRCSADIPVPGNVSDHSPIVVSFFEQNLVLSKPFKFFNMWALHPDFLNTVQTAWN-LN 3297
            ++   S         SDH P  V+   Q+   +KPFK  N    HP+F+  ++  W+ L 
Sbjct: 213  AS-PLSYGSFCAMEFSDHCPSCVNISNQSGGRNKPFKLSNFLMHHPEFIEKIRVTWDRLA 271

Query: 3296 FWGKAQFILCKKLKALKPSLKELNNFHFSHISSRVKQVKTALKNSQIQLHTDPLNSALCD 3117
            + G A F L KK K LK +++  N  H+S +  RV Q    LK  Q  L   P +S L  
Sbjct: 272  YQGSAMFTLSKKSKFLKGTIRTFNREHYSGLEKRVVQAAQNLKTCQNNLLAAP-SSYLAG 330

Query: 3116 SVKELKAKETFLAKAERSFLSQKAKCDFLNNSDRNTKFFHSIIKRNSLRKQMNSVILEDG 2937
              KE       LA AE  FL QK++  +L   D NT FFH ++       +++ ++ + G
Sbjct: 331  LEKEAHRSWAELALAEERFLCQKSRVLWLKCGDSNTTFFHRMMTARRAINEIHYLLDQTG 390

Query: 2936 SKTSSFDDLSKAFVNYFKNLFGTSFQTSPVD----LQTLQSGPCIDEDDFNLLSSPITQQ 2769
             +  + D+L    V++FK LFG+S      +    + +L    C DE+   LL + +++ 
Sbjct: 391  RRIENTDELQTHCVDFFKELFGSSSHLISAEGISQINSLTRFKC-DENTRQLLEAEVSEA 449

Query: 2768 AIKIALFDIEDERSPGPDGFSSGFFKKSWDVVGNDVIAAVSEFFDSSKILRQINHTXXXX 2589
             IK   F +   +SPGPDG++S FFKK+W +VG  +IAAV EFF S ++L Q N T    
Sbjct: 450  DIKSEFFALPSNKSPGPDGYTSEFFKKTWSIVGPSLIAAVQEFFRSGRLLGQWNSTAVTM 509

Query: 2588 XX-----------RPIACCNVVYKVITKILASKLESVVPKLIDPAQSAFISGRNITDNIF 2442
                         RPI+CCN +YKVI+K+LA +LE+++P  I P+QSAF+ GR +T+N+ 
Sbjct: 510  VPKKPNADRITEFRPISCCNAIYKVISKLLARRLENILPLWISPSQSAFVKGRLLTENVL 569

Query: 2441 LAQEIIRNYARVRISPRCAIKVDLKKAYDTISWSFLEQVLNGLGFPPIFISWVMECVSTA 2262
            LA E+++ + +  IS R  +KVDL+KA+D++ W F+ + L     PP F++W+ +C+++ 
Sbjct: 570  LATELVQGFGQANISSRGVLKVDLRKAFDSVGWGFIIETLKAANAPPRFVNWIKQCITST 629

Query: 2261 SYSICINGSLQGKFPGKRGLRQGDPMSPALFLLCMEYLSRLLKVRTNTSTFKYHPRCETL 2082
            S+SI ++GSL G F G +GLRQGDP+SP+LF++ ME LSRLL+ + +  +  YHP+   +
Sbjct: 630  SFSINVSGSLCGYFKGSKGLRQGDPLSPSLFVIAMEILSRLLENKFSDGSIGYHPKASEV 689

Query: 2081 KITHLAFADDLMLFSRGDLPSVKILIDCLNDFKNASGLDVNSFKSNLFTAGIVG-DKLDN 1905
            +I+ LAFADDLM+F  G   S++ +   L  FKN SGL++N+ KS ++TAG+   DK D 
Sbjct: 690  RISSLAFADDLMIFYDGKASSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDT 749

Query: 1904 ILNLLNFPLGTLPVRYLGVPLAAQKLNVNHYAPLYDRIASYINKWTANSLSYAGRLLLIK 1725
            +     F  GT P RYLG+PL  +KL  + Y+ L D+IA+  N W   +LS+AGRL LI 
Sbjct: 750  L--AFGFVNGTFPFRYLGLPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLIS 807

Query: 1724 SVLQGVECFWLQIFPLPSTVIDRINRLCRVFLWGKNTS-----PIKWSKVCLPFDEGGLG 1560
            SV+     FWL  F LP   +  I ++C  FLWG + +      + W   CLP  EGGLG
Sbjct: 808  SVIYSTVNFWLSSFILPKCCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPKAEGGLG 867

Query: 1559 LRDVHSWNKALLAKILWNIHSKADSLWVRWVHSFYLKNQSIWTWDPKKSDSCLLKRICDI 1380
            LR+  +WNK L  +++W + ++ DSLWV W H+  L++ + W  +     S + K I  +
Sbjct: 868  LRNFWTWNKTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFWNAEAASHHSWIWKAILGL 927

Query: 1379 R 1377
            R
Sbjct: 928  R 928



 Score = 65.9 bits (159), Expect = 1e-07
 Identities = 41/166 (24%), Positives = 70/166 (42%), Gaps = 4/166 (2%)
 Frame = -3

Query: 1316 ANSKGLCSSKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKYLDV 1137
            ++S    S   ++  R+      W +AVW     PKY+F  W+A  +RL       +   
Sbjct: 1028 SSSTSFSSKLTWECLRQRDTTKLWAAAVWYKGCIPKYAFNFWVAHLNRLPVRARTTHWST 1087

Query: 1136 D--PTCKLCGNYLENASHLFFDCIVTRLLWDRVKKWLKFSHSMSTIAGALKWTKKEHG-- 969
            +    C +C    E   HLF  C +  L+W +V      S         ++W     G  
Sbjct: 1088 NRPSLCCVCQRETETRDHLFIHCTLGSLIWQQVLARFGRSQMFREWKDIIEWMLSNQGSF 1147

Query: 968  APALKKARAVAVCCTIYQIWKARNCQVFEGLNFAEDTVFLKIQKDV 831
            +  LKK   +AV   I+ IWK RN ++   ++ +   +F +I + +
Sbjct: 1148 SGTLKK---LAVQTAIFHIWKERNSRLHSAMSASHTAIFKQIDRSI 1190


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  541 bits (1394), Expect = e-151
 Identities = 322/905 (35%), Positives = 484/905 (53%), Gaps = 33/905 (3%)
 Frame = -3

Query: 3992 LETEVVPQSNRCDFIVQNKFPGWLATNNFHLIKNGRILILWNSSTVDLQVLDTDPQLIHC 3813
            +ET V    +R    +    PGW    N+     G+I ++W+ S V + V+    Q+I C
Sbjct: 37   IETHVKQPKDRK--FINALLPGWSFVENYAFSDLGKIWVMWDPS-VQVVVVAKSLQMITC 93

Query: 3812 CLTCKISQNSILTSFIYGLNTVGERRCLWNKMLEL---GDLIDSPWLLLGDFNTIKNPDE 3642
             +    S + I+ S +Y  N V  R+ LW +++ +   G + D PWL+LGDFN + NP E
Sbjct: 94   EVLLPGSPSWIIVSVVYAANEVASRKELWIEIVNMVVSGIIGDRPWLVLGDFNQVLNPQE 153

Query: 3641 KLNGEPFTSK-SVEEFHDTCAYLGLSEVQSTGCYFTWTNNT----IWCRLDRALINSAWS 3477
              N        ++ +F D      LS+++  G  FTW N +    +  ++DR L+N +W 
Sbjct: 154  HSNPVSLNVDINMRDFRDCLLAAELSDLRYKGNTFTWWNKSHTTPVAKKIDRILVNDSW- 212

Query: 3476 NSNWRCSADIPVPGNVSDHSPIVVSFFEQNLVLSKPFKFFNMWALHPDFLNTVQTAW-NL 3300
            N+ +  S  I    + SDH    V   E ++   +PFKFFN    + DFLN V+  W  L
Sbjct: 213  NALFPSSLGIFGSLDFSDHVSCGVVLEETSIKAKRPFKFFNYLLKNLDFLNLVRDNWFTL 272

Query: 3299 NFWGKAQFILCKKLKALKPSLKELNNFHFSHISSRVKQVKTALKNSQIQLHTDP--LNSA 3126
            N  G + F + KKLKALK  +K+ +  ++S +  R K+    L   Q +   DP  +N++
Sbjct: 273  NVVGSSMFRVSKKLKALKKPIKDFSRLNYSELEKRTKEAHDFLIGCQDRTLADPTPINAS 332

Query: 3125 LCDSVKELKAKETF--LAKAERSFLSQKAKCDFLNNSDRNTKFFHSIIKRNSLRKQMNSV 2952
                  EL+A+  +  L  AE SF  QK++  +    D NTK+FH +    +    ++++
Sbjct: 333  F-----ELEAERKWHILTAAEESFFRQKSRISWFAEGDGNTKYFHRMADARNSSNSISAL 387

Query: 2951 ILEDGSKTSSFDDLSKAFVNYFKNLFGTS---FQTSPVDLQTLQSGPCIDEDDFNLLSSP 2781
               +G    S + +     +YF +L G     +     D+  L S  C       L  S 
Sbjct: 388  YDGNGKLVDSQEGILDLCASYFGSLLGDEVDPYLMEQNDMNLLLSYRCSPAQVCEL-EST 446

Query: 2780 ITQQAIKIALFDIEDERSPGPDGFSSGFFKKSWDVVGNDVIAAVSEFFDSSKILRQINHT 2601
             + + I+ ALF +   +S GPDGF++ FF  SW +VG +V  A+ EFF S  +L+Q N T
Sbjct: 447  FSNEDIRAALFSLPRNKSCGPDGFTAEFFIDSWSIVGAEVTDAIKEFFSSGCLLKQWNAT 506

Query: 2600 XXXXXX-----------RPIACCNVVYKVITKILASKLESVVPKLIDPAQSAFISGRNIT 2454
                             RPI+C N +YKVI ++L  +L+ ++  +I  AQSAF+ GR++ 
Sbjct: 507  TIVLIPKIVNPTCTSDFRPISCLNTLYKVIARLLTDRLQRLLSGVISSAQSAFLPGRSLA 566

Query: 2453 DNIFLAQEIIRNYARVRISPRCAIKVDLKKAYDTISWSFLEQVLNGLGFPPIFISWVMEC 2274
            +N+ LA +++  Y    ISPR  +KVDLKKA+D++ W F+   L  L  P  FI+W+ +C
Sbjct: 567  ENVLLATDLVHGYNWSNISPRGMLKVDLKKAFDSVRWEFVIAALRALAIPEKFINWISQC 626

Query: 2273 VSTASYSICINGSLQGKFPGKRGLRQGDPMSPALFLLCMEYLSRLLKVRTNTSTFKYHPR 2094
            +ST ++++ ING   G F   +GLRQGDP+SP LF+L ME  S LL  R  +    YHP+
Sbjct: 627  ISTPTFTVSINGGNGGFFKSTKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYHPK 686

Query: 2093 CETLKITHLAFADDLMLFSRGDLPSVKILIDCLNDFKNASGLDVNSFKSNLFTAGIVGDK 1914
               L I+HL FADD+M+F  G   S+  + + L+DF + SGL VN  KS+L+ AG+  ++
Sbjct: 687  ASNLSISHLMFADDVMIFFDGGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGL--NQ 744

Query: 1913 LDNILN-LLNFPLGTLPVRYLGVPLAAQKLNVNHYAPLYDRIASYINKWTANSLSYAGRL 1737
            L++  N    FP+GTLP+RYLG+PL  +KL +  Y PL ++I +    W    LS+AGR+
Sbjct: 745  LESNANAAYGFPIGTLPIRYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRI 804

Query: 1736 LLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFLWGKNTSPIK-----WSKVCLPFDE 1572
             LI SV+ G   FW+  F LP   I RI  LC  FLW  N    K     W+ +CLP  E
Sbjct: 805  QLISSVIFGSINFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSE 864

Query: 1571 GGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWVHSFYLKNQSIWTWDPKKSDSCLLKR 1392
            GGLGLR +  WNK L  +++W +    DSLW  W H  +L   S W  +  +SDS   KR
Sbjct: 865  GGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSWTWKR 924

Query: 1391 ICDIR 1377
            +  +R
Sbjct: 925  LLSLR 929



 Score = 60.1 bits (144), Expect = 8e-06
 Identities = 39/169 (23%), Positives = 75/169 (44%), Gaps = 6/169 (3%)
 Frame = -3

Query: 1307 KGLCSSKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLK---YLDV 1137
            +G  ++K ++  R       W S++W     PKY+F +W++  +RL T   L    ++  
Sbjct: 1031 QGFSAAKTWEAIRPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHIQS 1090

Query: 1136 DPTCKLCGNYLENASHLFFDCIVTRLLWDRV-KKWLKFSHSMSTIAGALKWTKKE--HGA 966
            D  C LC    E+  HL   C  +  +W  V ++        S+ +  L W ++      
Sbjct: 1091 D-ACVLCSFASESRDHLLLICEFSAQVWRLVFRRICPRQRLFSSWSELLSWVRQSSPEAP 1149

Query: 965  PALKKARAVAVCCTIYQIWKARNCQVFEGLNFAEDTVFLKIQKDVYKVV 819
            P L+K  +  V   +Y +W+ RN  +   L  A   +F  + +++  ++
Sbjct: 1150 PLLRKIVSQVV---VYNLWRQRNNLLHNSLRLAPAVIFKLVDREIRNII 1195


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  526 bits (1356), Expect = e-146
 Identities = 314/892 (35%), Positives = 465/892 (52%), Gaps = 39/892 (4%)
 Frame = -3

Query: 3992 LETEVVPQSNRCDFIVQNKFPGWLATNNFHLIKNGRILILWNSSTVDLQVLDTDPQLIHC 3813
            +ET V  + ++   +V   F  W    N+   + GRI +LW  + V L  +    QL+ C
Sbjct: 35   VETRV--KESKVSQLVGKLFKDWSILTNYEHNRRGRIWVLWRKN-VRLSPIYKSCQLLTC 91

Query: 3812 CLTCKISQNSILTSFIYGLNTVGERRCLWNKMLELGDLIDSP------WLLLGDFNTIKN 3651
             +  +  Q+    SF+Y  N V ER+ LW+   EL D  DSP      W LLGDFN   +
Sbjct: 92   SVKLEDRQDEFFCSFVYASNYVEERKVLWS---ELKDHYDSPIIRHKPWTLLGDFNETLD 148

Query: 3650 PDEKLNG--EPFTSKSVEEFHDTCAYLGLSEVQSTGCYFTWTNNT----IWCRLDRALIN 3489
              E       P  +  + +F     Y  L+++ + G  FTW N      I  +LDR LIN
Sbjct: 149  IAEHSQSFVHPMVTPGMRDFQQVINYCSLTDMAAQGPLFTWCNKREHGLIMKKLDRVLIN 208

Query: 3488 SAWSNSNWRCSADIPVPGNVSDHSPIVVSFFEQ--NLVLS-KPFKFFNMWALHPDFLNTV 3318
              W N  +  S  +   G  SDH    +S   +  N V   KPFKF N      DF   V
Sbjct: 209  DCW-NQTFSQSYSVFEAGGCSDHLRCRISLNSEAGNKVQGLKPFKFVNALTDMEDFKPMV 267

Query: 3317 QTAWN----LNFWGKAQFILCKKLKALKPSLKELNNFHFSHISSRVKQVKTALKNSQIQL 3150
             T W     L       F   K LK LKP ++ +      ++S +  +    L   Q   
Sbjct: 268  STYWKDTEPLILSTSTLFRFSKNLKGLKPKIRSMARDRLGNLSKKANEAYKILCAKQHVN 327

Query: 3149 HTDPLNSALCDSVKELKAKETFLAKAERSFLSQKAKCDFLNNSDRNTKFFHSIIKRNSLR 2970
             T+P + A+ +      ++   +A  E  +L QK+K  +    D+NTK FH         
Sbjct: 328  LTNPSSMAM-EEENAAYSRWDRVAILEEKYLKQKSKLHWCQVGDQNTKAFHRAAAAREAH 386

Query: 2969 KQMNSVILEDGSKTSSFDDL---SKAFVNYFKNLFGTSFQTSPV-DLQTLQSGPCIDEDD 2802
              +  ++  DG   +  D++   ++ F   F  L    F+   + +LQ L    C D D 
Sbjct: 387  NTIREILSNDGIVKTKGDEIKAEAERFFREFLQLIPNDFEGVTITELQQLLPVRCSDADQ 446

Query: 2801 FNLLSSPITQQAIKIALFDIEDERSPGPDGFSSGFFKKSWDVVGNDVIAAVSEFFDSSKI 2622
             +L+  P+T + I+  LF +  ++SPGPDG++S FFK +W+++G++   AV  FF    +
Sbjct: 447  QSLIR-PVTAEEIRKVLFRMPSDKSPGPDGYTSEFFKATWEIIGDEFTLAVQSFFTKGFL 505

Query: 2621 LRQINHTXXXXXX-----------RPIACCNVVYKVITKILASKLESVVPKLIDPAQSAF 2475
             + IN T                 RPI+CCNV+YKVI+KI+A++L+ V+PK I   QSAF
Sbjct: 506  PKGINSTILALIPKKTEAREMKDYRPISCCNVLYKVISKIIANRLKLVLPKFIAGNQSAF 565

Query: 2474 ISGRNITDNIFLAQEIIRNYARVRISPRCAIKVDLKKAYDTISWSFLEQVLNGLGFPPIF 2295
            +  R + +N+ LA E++++Y +  IS RCAIK+D+ KA+D++ W FL  V   LGFP  F
Sbjct: 566  VKDRLLIENLLLATELVKDYHKDTISTRCAIKIDISKAFDSVQWPFLINVFTILGFPREF 625

Query: 2294 ISWVMECVSTASYSICINGSLQGKFPGKRGLRQGDPMSPALFLLCMEYLSRLLKVRTNTS 2115
            I W+  C++TAS+S+ +NG L G F   RGLRQG  +SP LF++CM+ LS++L       
Sbjct: 626  IHWINICITTASFSVQVNGELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAAR 685

Query: 2114 TFKYHPRCETLKITHLAFADDLMLFSRGDLPSVKILIDCLNDFKNASGLDVNSFKSNLFT 1935
             F YHP+C+T+ +THL+FADDLM+ S G + S++ +I   ++F   SGL ++  KS ++ 
Sbjct: 686  HFGYHPKCKTMGLTHLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYL 745

Query: 1934 AGIVGDKLDNILNLLNFPLGTLPVRYLGVPLAAQKLNVNHYAPLYDRIASYINKWTANSL 1755
            AG+     + + +   F  G LPVRYLG+PL  ++L+     PL +++   I  WT+  L
Sbjct: 746  AGLSATARNEVADRFPFSSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFL 805

Query: 1754 SYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFLW-----GKNTSPIKWSKV 1590
            SYAGRL LI SVL  +  FWL  F LP   I  + ++C  FLW       N + I W  V
Sbjct: 806  SYAGRLNLISSVLWSICNFWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMV 865

Query: 1589 CLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWVHSFYLKNQSIW 1434
            C P DEGGLGLR +   N     K++W I S ++SLWV+WV    L+N S W
Sbjct: 866  CKPKDEGGLGLRSLKEANDVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFW 917



 Score = 63.5 bits (153), Expect = 7e-07
 Identities = 42/132 (31%), Positives = 60/132 (45%), Gaps = 7/132 (5%)
 Frame = -3

Query: 1271 REAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLAT----INNLKYLDVDPTCKLCGNYL 1104
            R    +  WH  +W +   PKYSFC WLA + RL T    IN    +  D  C  C   L
Sbjct: 1048 RSTSARVPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATD--CIFCQGTL 1105

Query: 1103 ENASHLFFDCIVTRLLWDRVKKWL---KFSHSMSTIAGALKWTKKEHGAPALKKARAVAV 933
            E   HLFF C  T ++W  + + +   +++    +I  A+  T  +H        R V  
Sbjct: 1106 ETRDHLFFTCSFTSVIWVDLARGIFKTQYTSHWQSIIEAI--TNSQHHRVEWFLRRYV-F 1162

Query: 932  CCTIYQIWKARN 897
              TIY +W+ RN
Sbjct: 1163 QATIYIVWRERN 1174


>dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thaliana]
          Length = 910

 Score =  524 bits (1349), Expect = e-145
 Identities = 308/873 (35%), Positives = 462/873 (52%), Gaps = 31/873 (3%)
 Frame = -3

Query: 3992 LETEVVPQSNRCDFIVQNKFPGWLATNNFHLIKNGRILILWNSSTVDLQVLDTDPQLIHC 3813
            LET V  ++   + ++ +  PGW   +N+   + GRI I+W+ S   L    TD Q++ C
Sbjct: 35   LETHVAQEN--ANSVLASTLPGWRMDSNYCCSELGRIWIVWDPSISVLVFKRTD-QIMFC 91

Query: 3812 CLTCKISQNSILTSFIYGLNTVGERRCLWNKMLELG---DLIDSPWLLLGDFNTIKNPDE 3642
             +       S   +F+YG N+  +RR LW  +L L     L  +PWLLLGDFN I    E
Sbjct: 92   SIKIPSLLQSFAVAFVYGRNSELDRRSLWEDILVLSRTSPLSVTPWLLLGDFNQIAAASE 151

Query: 3641 --KLNGEPFTSKSVEEFHDTCAYLGLSEVQSTGCYFTWTN----NTIWCRLDRALINSAW 3480
               +N      + +E+         LS++ S G +FTW+N    N I  +LDRAL N  W
Sbjct: 152  HYSINQSLLNLRGMEDLQCCLRDSQLSDLPSRGVFFTWSNHQQDNPILRKLDRALANGEW 211

Query: 3479 SNSNWRCSADIPVPGNVSDHSPIVVSFFEQNLVLSKPFKFFNMWALHPDFLNTVQTAWNL 3300
                    A    PG+ SDH+P ++    Q     K FK+F+  + HP +L  + TAW  
Sbjct: 212  FAVFPSALAVFDPPGD-SDHAPCIILIDNQPPPSKKSFKYFSFLSSHPSYLAALSTAWEA 270

Query: 3299 N-FWGKAQFILCKKLKALKPSLKELNNFHFSHISSRVKQVKTALKNSQIQLHTDPLNSAL 3123
            N   G   F L + LK  K   + LN   FS+I  R  Q  T L++ Q++L T P +  L
Sbjct: 271  NTLVGSHMFSLRQHLKVAKLCCRTLNRLRFSNIQQRTAQSLTRLEDIQVELLTSP-SDTL 329

Query: 3122 CDSVKELKAKETFLAKAERSFLSQKAKCDFLNNSDRNTKFFHSIIKRNSLRKQMNSVILE 2943
                   + +  F A A  SF  QK++  +L+  D NT+FFH  +  +     +  +  +
Sbjct: 330  FRREHVARKQWIFFAAALESFFRQKSRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGD 389

Query: 2942 DGSKTSSFDDLSKAFVNYFKNLFGT-SFQTSPVDLQTLQSGPCIDEDDFNLLSSPIT--- 2775
            DG +  + D +    + Y+ +L G  S   +P  ++ ++       D F  L+S +T   
Sbjct: 390  DGFRVENVDQIKGMLIAYYSHLLGIPSENVTPFSVEKIKGLLPFRCDSF--LASQLTTIP 447

Query: 2774 -QQAIKIALFDIEDERSPGPDGFSSGFFKKSWDVVGNDVIAAVSEFFDSSKILRQINHTX 2598
             ++ I   LF +   ++PGPDGF   FF ++W +V + V+AA+ EFF S  + R  N T 
Sbjct: 448  SEEEITQVLFSMPRNKAPGPDGFPVEFFIEAWAIVKSSVVAAIREFFISGNLPRGFNATA 507

Query: 2597 XXXXX-----------RPIACCNVVYKVITKILASKLESVVPKLIDPAQSAFISGRNITD 2451
                            RP+ACC  +YKVIT+I++ +L+  + + +   Q  FI GR + +
Sbjct: 508  ITLIPKVTGADRLTQFRPVACCTTIYKVITRIISRRLKLFIDQAVQANQVGFIKGRLLCE 567

Query: 2450 NIFLAQEIIRNYARVRISPRCAIKVDLKKAYDTISWSFLEQVLNGLGFPPIFISWVMECV 2271
            N+ LA E++ N+     + R  ++VD+ KAYD ++W FL  +L  L  P +FI W+  C+
Sbjct: 568  NVLLASELVDNFEADGETTRGCLQVDISKAYDNVNWEFLINILKALDLPLVFIHWIWVCI 627

Query: 2270 STASYSICINGSLQGKFPGKRGLRQGDPMSPALFLLCMEYLSRLLKVRTNTSTFKYHPRC 2091
            S+ASYSI  NG L G F GK+G+RQGDPMS  LF+L M+ LS+ L +      F  HP C
Sbjct: 628  SSASYSIAFNGELIGFFQGKKGIRQGDPMSSHLFVLVMDVLSKSLDLGALNGLFNLHPNC 687

Query: 2090 ETLKITHLAFADDLMLFSRGDLPSVKILIDCLNDFKNASGLDVNSFKSNLFTAGIVGDKL 1911
                ITHL+FADD+++FS G   S+  ++  L+DF+  SGL +N  K+ L   G    + 
Sbjct: 688  LAPIITHLSFADDVLVFSDGAASSIAGILTILDDFRQGSGLGINREKTELLLDGGNFARN 747

Query: 1910 DNILNLLNFPLGTLPVRYLGVPLAAQKLNVNHYAPLYDRIASYINKWTANSLSYAGRLLL 1731
             ++ + L    G+LPVRYLGVPL +QK+    Y PL DRI S    WTA  LS+AGRL L
Sbjct: 748  RSLADNLGITHGSLPVRYLGVPLMSQKMRRQDYQPLVDRINSRFTSWTARHLSFAGRLQL 807

Query: 1730 IKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFLW-----GKNTSPIKWSKVCLPFDEGG 1566
            +KSV+     FW  +F  P+  + ++ ++C  FLW         + I W+ VC P + GG
Sbjct: 808  LKSVIYSTINFWASVFIFPNQCLQKLEQMCNAFLWSGAPNSARGAKISWNIVCSPKEAGG 867

Query: 1565 LGLRDVHSWNKALLAKILWNIHSKADSLWVRWV 1467
            LGL+ + SWN+ L  K++W + + A SLWV WV
Sbjct: 868  LGLKRLSSWNRILALKLIWLLFTSAGSLWVSWV 900


>gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana]
            gi|20197043|gb|AAM14892.1| putative reverse transcriptase
            [Arabidopsis thaliana]
          Length = 1412

 Score =  505 bits (1300), Expect = e-140
 Identities = 348/1089 (31%), Positives = 520/1089 (47%), Gaps = 57/1089 (5%)
 Frame = -3

Query: 3992 LETEVVPQSNRCDFIVQNKFPGWLATNNFHLIKNGRILILWNSSTVDLQVLDTDPQLIHC 3813
            LET V+   ++   I    F  W   +N+   + GRI ++W+SS V LQV+    Q+I C
Sbjct: 337  LETRVI--ESKVPVIFAKVFKDWQMVSNYEFNRLGRIWVVWSSS-VQLQVIFKSSQMIVC 393

Query: 3812 CLTCKISQNSILTSFIYGLNTVGERRCLWNKMLELGDLI---DSPWLLLGDFNTIKNPDE 3642
             +  +      + SFIY  N V ER+ LW  +  L + +   + PWLL GDFN     +E
Sbjct: 394  LVRVEHYDVEFICSFIYASNFVEERKKLWQDLHNLQNSVAFRNKPWLLFGDFNETLKMEE 453

Query: 3641 KLNG--EPFTSKSVEEFHDTCAYLGLSEVQSTGCYFTWTNNT----IWCRLDRALINSAW 3480
              +    P  +  + +F     Y  L ++++ G  FTW N      I  +LDR L+N  +
Sbjct: 454  HSSYAVSPMVTPGMRDFQIVVRYCSLEDMRTHGPLFTWGNKRNEGLICKKLDRVLLNPEY 513

Query: 3479 SNSNWRCSADIPVPGNVSDHSPIVVSFFEQNLVLSKP---FKFFNMWALHPDFLNTVQTA 3309
             NS +  S  I   G  SDH   +   F     + KP   FKF N+ A HP+F+  V+  
Sbjct: 514  -NSAYPHSYCIMDSGGCSDH---LRGRFHLRSAIQKPKGPFKFTNVIAAHPEFMPKVEDF 569

Query: 3308 WN----LNFWGKAQFILCKKLKALKPSLKELNNFHFSHISSRVKQVKTALKNSQIQLHTD 3141
            W     L       F   KKLK LKP LK+L+  + S ++ R       L   Q +  T 
Sbjct: 570  WKNTTELFPSTSTLFRFSKKLKELKPILKDLSRNNLSDLTRRATYAYEELCRCQTKSLTT 629

Query: 3140 PLNSALCDSVKELKAKETFLAKAERSFLSQKAKCDFLNNSDRNTKFFHSIIKRNSLRKQM 2961
             LN    D V E  A E +                                K   L   +
Sbjct: 630  -LNPH--DIVDESLAFERWE-------------------------------KERHLLNAI 655

Query: 2960 NSVILEDGSKTSSFDDLSKAFVNYFKNLFGTSFQTSPVDLQTLQSGPCIDE--------- 2808
            + V+   G++  + DD+    V +F +L  +     P D     +G  +DE         
Sbjct: 656  HEVMDPQGTRPPNQDDIKIEAVRFFSDLLSSQ----PSDF----TGISVDELKGILQYRY 707

Query: 2807 --DDFNLLSSPITQQAIKIALFDIEDERSPGPDGFSSGFFKKSWDVVGNDVIAAVSEFFD 2634
               + NLL + IT+  +    F I   +SPGPDG++  FF+++W V+G +V  A+  FF 
Sbjct: 708  SLHEQNLLVAEITEAEVMKVFFSIPLNKSPGPDGYTVEFFRETWSVIGQEVTMAIKSFFT 767

Query: 2633 SSKILRQINHTXXXXXX-----------RPIACCNVVYKVITKILASKLESVVPKLIDPA 2487
               + + +N T                 RPI+CCNV+YK I+K+LA++L+ ++P+ I P 
Sbjct: 768  YGFLPKGLNSTILALIPKRTYAKEMKDYRPISCCNVLYKAISKLLANRLKCLLPEFIAPN 827

Query: 2486 QSAFISGRNITDNIFLAQEIIRNYARVRISPRCAIKVDLKKAYDTISWSFLEQVLNGLGF 2307
            QSAFIS R + +N+ LA E++++Y +  +SPRCA+K+DL KA+D++ W FL   L  L  
Sbjct: 828  QSAFISDRLLMENLLLASELVKDYHKDGLSPRCAMKIDLSKAFDSVQWPFLLNTLAALDI 887

Query: 2306 PPIFISWVMECVSTASYSICINGSLQGKFPGKRGLRQGDPMSPALFLLCMEYLSRLLKVR 2127
            P  FI W+  C+STAS+S+ +NG           LRQG  +SP LF++CM  LS +L   
Sbjct: 888  PEKFIHWINLCISTASFSVQVNG-----------LRQGCSLSPYLFVICMNVLSAMLDKG 936

Query: 2126 TNTSTFKYHPRCETLKITHLAFADDLMLFSRGDLPSVKILIDCLNDFKNASGLDVNSFKS 1947
                 F YHPRC  + +THL FADD+M+FS G   S++ ++    DF   SGL+++  KS
Sbjct: 937  AVEKRFGYHPRCRNMGLTHLCFADDIMVFSAGSAHSLEGVLAIFKDFAAFSGLNISLEKS 996

Query: 1946 NLFTAGIVGDKLDNILNLLNFPLGTLPVRYLGVPLAAQKLNVNHYAPLYDRIASYINKWT 1767
             LF A I  +   +IL    F  G+LPVRYLG+PL  +++ +    PL ++I S I+ W 
Sbjct: 997  TLFMASISSETCASILARFPFDSGSLPVRYLGLPLMTKRMTLADCLPLLEKIRSRISSWK 1056

Query: 1766 ANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFLW-GKNTSP----IK 1602
               LSYAGRL L+ SV+  +  FW+  F LP   I  I ++   FLW G + +P    + 
Sbjct: 1057 NRFLSYAGRLQLLNSVISSLTKFWISAFRLPRACIREIEQISAAFLWSGTDLNPHKAKVA 1116

Query: 1601 WSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWVHSFYLKNQSIWTWDP 1422
            W  VC P  EGGLGLR +   NK    K++W + S   SLWV W+     +N  I T   
Sbjct: 1117 WHDVCKPKSEGGLGLRSLVDANKICCFKLIWRLVSAKHSLWVNWI-----QNNLIRTVAE 1171

Query: 1421 KKSDSCLLKRICDIRNEILAKFGSLDLAISNLSLFANSKGLC------------SSKMYD 1278
              S         DI N+I  +   L+  +         + LC            S +++ 
Sbjct: 1172 ALSSHRRRSHRDDILNDIEEE---LEKLLCRGICTEQDRSLCRSIGGQFKAKFFSPEIWH 1228

Query: 1277 LFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKYLD--VDPTCKLCGNYL 1104
              RE G    WH A+W +   PK++F  WLA +DRL T + +   +  +   C LC    
Sbjct: 1229 QIREQGLVKQWHKAIWFSGATPKFTFISWLAAHDRLTTGDKMASWNRGISSVCVLCNISA 1288

Query: 1103 ENASHLFFDCIVTRLLWDRVKKWLKFSHSMSTIAGALKWTKKEHGAPALKKARAVAVCCT 924
            E+  HLFF C  +  +WDR+ + L      +     L     +  +   +         T
Sbjct: 1289 ESRDHLFFSCNFSSHIWDRLTRRLLLCRYTTNFPALLLLLSGQDFSGTKRFLLRYVFQAT 1348

Query: 923  IYQIWKARN 897
            I+ +W+ RN
Sbjct: 1349 IHTLWRERN 1357


>gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13)
            [Arabidopsis thaliana]
          Length = 1164

 Score =  501 bits (1291), Expect = e-139
 Identities = 298/833 (35%), Positives = 449/833 (53%), Gaps = 37/833 (4%)
 Frame = -3

Query: 3782 ILTSFIYGLNTVGERRCLWNKMLELGD---LIDSPWLLLGDFNTIKNPDEKLNGEPFT-S 3615
            ++ SF+Y       R+ LWN++++  +   +ID PW +LGDFN I +P E    + F   
Sbjct: 1    MVLSFVYASTDEVTRQILWNEIVDFSNDPCVIDKPWTVLGDFNQILHPSEHSTSDGFNVD 60

Query: 3614 KSVEEFHDTCAYLGLSEVQSTGCYFTWTNNT----IWCRLDRALINSAWSNSNWRCSADI 3447
            +    F +T     L+++   G  FTW N      +  +LDR L+N  W+ + +  S  +
Sbjct: 61   RPTRIFRETILLASLTDLSFRGNTFTWWNKRSRAPVAKKLDRILVNDKWTTT-FPSSLGL 119

Query: 3446 PVPGNVSDHSPIVVSFFEQNLVLSKPFKFFNMWALHPDFLNTVQTAW-NLNFWGKAQFIL 3270
                + SDHS   +S    +    KPF+F N      +FL+ +   W + +  G A + +
Sbjct: 120  FGEPDFSDHSSCELSLMSASPRSKKPFRFNNFLLKDENFLSLICLKWFSTSVTGSAMYRV 179

Query: 3269 CKKLKALKPSLKELNNFHFSHISSRVKQVKTALKNSQIQLHTDPLNSALCDSVKELKAKE 3090
              KLKALK  +++ +  ++S I  R K+   AL  +Q  L   P  S       E + K 
Sbjct: 180  SVKLKALKKVIRDFSRDNYSDIEKRTKEAHDALLLAQSVLLASPCPSNAAIEA-ETQRKW 238

Query: 3089 TFLAKAERSFLSQKAKCDFLNNSDRNTKFFHSIIKRNSLRKQMNSV-ILED--GSKTSSF 2919
              LA+AE SF  Q+++ ++L   D N+ +FH   K  S R+ +N +  L D  G +    
Sbjct: 239  RILAEAEASFFYQRSRVNWLREGDMNSSYFH---KMASARQSLNHIHFLSDPVGDRIEGQ 295

Query: 2918 DDLSKAFVNYFKNLFGTSFQTSPV----DLQTLQSGPCIDEDDFNLLSSPITQQAIKIAL 2751
             +L    V YF++  G+  Q  P+    D+  L S  C      +L  +P + + IK A 
Sbjct: 296  QNLENHCVEYFQSNLGSE-QGLPLFEQADISNLLSYRCSPAQQVSL-DTPFSSEQIKNAF 353

Query: 2750 FDIEDERSPGPDGFSSGFFKKSWDVVGNDVIAAVSEFFDSSKILRQINHTXXXXXX---- 2583
            F +   ++ GPDGFS  FF   W ++G +V  A+ EFF S K+L+Q N T          
Sbjct: 354  FSLPRNKASGPDGFSPEFFCACWPIIGGEVTEAIHEFFTSGKLLKQWNATNLVLIPKITN 413

Query: 2582 -------RPIACCNVVYKVITKILASKLESVVPKLIDPAQSAFISGRNITDNIFLAQEII 2424
                   RPI+C N VYKVI+K+L  +L+  +P  I  +QSAF+ GR   +N+ LA E++
Sbjct: 414  ASSMSDFRPISCLNTVYKVISKLLTDRLKDFLPAAISHSQSAFMPGRLFLENVLLATELV 473

Query: 2423 RNYARVRISPRCAIKVDLKKAYDTISWSFLEQVLNGLGFPPIFISWVMECVSTASYSICI 2244
              Y +  I+P   +KVDL+KA+D++ W F+   L  L  P  F  W++EC+STAS+S+ +
Sbjct: 474  HGYNKKNIAPSSMLKVDLRKAFDSVRWDFIVSALRALNVPEKFTCWILECLSTASFSVIL 533

Query: 2243 NGSLQGKFPGKRGLRQGDPMSPALFLLCMEYLSRLLKVRTNTSTFKYHPRCETLKITHLA 2064
            NG   G F   +GLRQGDPMSP LF+L ME  S LL+ R  +    YHP+   L+I+HL 
Sbjct: 534  NGHSAGHFWSSKGLRQGDPMSPYLFVLAMEVFSGLLQSRYTSGYIAYHPKTSQLEISHLM 593

Query: 2063 FADDLMLFSRGDLPSVKILIDCLNDFKNASGLDVNSFKSNLFTAGIVGDKLDNILNLLNF 1884
            FADD+M+F  G   S+  +++ L DF   SGL +N+ K+ L+ AG+   + D++ +   F
Sbjct: 594  FADDVMIFFDGKSSSLHGIVESLEDFAGWSGLLMNTNKTQLYHAGLSQSESDSMAS-YGF 652

Query: 1883 PLGTLPVRYLGVPLAAQKLNVNHYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVE 1704
             LG+LPVRYLG+PL ++KL +  YAPL ++I +  N W    LS+AGR+ L+ SV+ G+ 
Sbjct: 653  KLGSLPVRYLGLPLMSRKLTIAEYAPLIEKITARFNSWVVRLLSFAGRVQLLASVISGIV 712

Query: 1703 CFWLQIFPLPSTVIDRINRLCRVFLWG-----KNTSPIKWSKVCLPFDEGGLGLRDVHSW 1539
             FW+  F LP   I +I  LC  FLW      K  + + WS+VCLP  EGG+GLR     
Sbjct: 713  NFWISSFILPLGCIKKIESLCSRFLWSSRIDKKGIAKVAWSQVCLPKAEGGIGLRRFAVS 772

Query: 1538 NKALLAKILWNIHSKADSLWVRWVHSFYL-KNQSIWTWDPKKSDS----CLLK 1395
            N+ L  +++W + S + SLWV W     L K+ S W    K  DS    CLL+
Sbjct: 773  NRTLYLRMIWLLFSNSGSLWVAWHKQHSLGKSTSFWNQPEKPHDSWNWKCLLR 825


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  495 bits (1274), Expect = e-137
 Identities = 280/796 (35%), Positives = 433/796 (54%), Gaps = 28/796 (3%)
 Frame = -3

Query: 3680 LLGDFNTIKNPDEKLNGEPFT-SKSVEEFHDTCAYLGLSEVQSTGCYFTWTNNT----IW 3516
            +LGDFN +  P E  N       + + +F    + + LS++   G  FTW N +    I 
Sbjct: 1    MLGDFNQVLLPQEHSNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIA 60

Query: 3515 CRLDRALINSAWSNSNWRCSADIPVPGNVSDHSPIVVSFFEQNLVLSKPFKFFNMWALHP 3336
             +LDR L N +W N  +  S  +    + SDH    V      +   +PFKFFN    + 
Sbjct: 61   KKLDRILANDSWCNL-YPSSHGLFGNLDFSDHVSCGVVLEANGISAKRPFKFFNFLLKNE 119

Query: 3335 DFLNTVQTAW-NLNFWGKAQFILCKKLKALKPSLKELNNFHFSHISSRVKQVKTALKNSQ 3159
            DFLN V   W + N  G + + + KKLKA+K  +K+ +  ++S I  R K+    L   Q
Sbjct: 120  DFLNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPIKDFSRLNYSGIELRTKEAHELLITCQ 179

Query: 3158 IQLHTDPLNSALCDSVKELKA--KETFLAKAERSFLSQKAKCDFLNNSDRNTKFFHSIIK 2985
               +    N ++ ++  EL+A  K   L+ AE SF  Q+++  +    D NT +FH ++ 
Sbjct: 180  ---NLTLANPSVSNAALELEAQRKWVLLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMVD 236

Query: 2984 RNSLRKQMNSVILEDGSKTSSFDDLSKAFVNYFKNLFGT---SFQTSPVDLQTLQSGPCI 2814
                   +NS++  +G    S   +    V Y++ L G+    F     D+  L +  C 
Sbjct: 237  SRKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTYRC- 295

Query: 2813 DEDDFNLLSSPITQQAIKIALFDIEDERSPGPDGFSSGFFKKSWDVVGNDVIAAVSEFFD 2634
             +D  + L    T   IK A   +   ++ GPDG+S  FF+ +W ++G +V+AA+ EFFD
Sbjct: 296  SQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFD 355

Query: 2633 SSKILRQINHTXXXXXX-----------RPIACCNVVYKVITKILASKLESVVPKLIDPA 2487
            S ++L+Q N T                 RPI+C N +YKVI+K+L S+L+ ++  +I  +
Sbjct: 356  SGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIGHS 415

Query: 2486 QSAFISGRNITDNIFLAQEIIRNYARVRISPRCAIKVDLKKAYDTISWSFLEQVLNGLGF 2307
            QSAF+ GR++ +N+ LA E++  Y R+ ISPR  +KVDLKKA+D++ W F+   L  L  
Sbjct: 416  QSAFLPGRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAI 475

Query: 2306 PPIFISWVMECVSTASYSICINGSLQGKFPGKRGLRQGDPMSPALFLLCMEYLSRLLKVR 2127
            P  +I+W+ +C++T S++I +NG+  G F   +GLRQGDP+SP LF+L ME  S+LL  R
Sbjct: 476  PERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSR 535

Query: 2126 TNTSTFKYHPRCETLKITHLAFADDLMLFSRGDLPSVKILIDCLNDFKNASGLDVNSFKS 1947
             ++    YHP+   L I+HL FADD+M+F  G   S+  + + L+DF + SGL VN  KS
Sbjct: 536  YDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKS 595

Query: 1946 NLFTAGIVGDKLDNILN-LLNFPLGTLPVRYLGVPLAAQKLNVNHYAPLYDRIASYINKW 1770
             LF AG+  D  + I +    FP GT P+RYLG+PL  +KL +  Y PL +++++ +  W
Sbjct: 596  QLFQAGL--DLSERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSW 653

Query: 1769 TANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFLW-----GKNTSPI 1605
             + +LS+AGR  LI SV+ G+  FW+  F LP   I +I  LC  FLW     G+ +S +
Sbjct: 654  VSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKV 713

Query: 1604 KWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWVHSFYLKNQSIWTWD 1425
             W   CLP  EGGLG R    WNK LL +++W +  +  SLW +W     L + S W  +
Sbjct: 714  SWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVN 773

Query: 1424 PKKSDSCLLKRICDIR 1377
              ++D    K + ++R
Sbjct: 774  ALQTDPWTWKMLLNLR 789



 Score = 60.5 bits (145), Expect = 6e-06
 Identities = 43/168 (25%), Positives = 70/168 (41%), Gaps = 3/168 (1%)
 Frame = -3

Query: 1307 KGLCSSKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKY--LDVD 1134
            +G  ++K +++ R   P   W  +VW     PK++F  W A  +RL T   L    L   
Sbjct: 890  QGFSAAKTWEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSS 949

Query: 1133 PTCKLCGNYLENASHLFFDCIVTRLLWDRV-KKWLKFSHSMSTIAGALKWTKKEHGAPAL 957
              C LC    E   HL   C  +  +W  V  +       + T A  L WT++   A A 
Sbjct: 950  AECCLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLSWTRQSTAA-AP 1008

Query: 956  KKARAVAVCCTIYQIWKARNCQVFEGLNFAEDTVFLKIQKDVYKVVFN 813
               R V     +Y +W+ RN  +   L  +   VF  + +++  V+ +
Sbjct: 1009 SLLRKVVAQLVVYNLWRQRNLVLHSSLRVSCSVVFRLVDRELRNVILS 1056


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  495 bits (1274), Expect = e-137
 Identities = 280/796 (35%), Positives = 433/796 (54%), Gaps = 28/796 (3%)
 Frame = -3

Query: 3680 LLGDFNTIKNPDEKLNGEPFT-SKSVEEFHDTCAYLGLSEVQSTGCYFTWTNNT----IW 3516
            +LGDFN +  P E  N       + + +F    + + LS++   G  FTW N +    I 
Sbjct: 1    MLGDFNQVLLPQEHSNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIA 60

Query: 3515 CRLDRALINSAWSNSNWRCSADIPVPGNVSDHSPIVVSFFEQNLVLSKPFKFFNMWALHP 3336
             +LDR L N +W N  +  S  +    + SDH    V      +   +PFKFFN    + 
Sbjct: 61   KKLDRILANDSWCNL-YPSSHGLFGNLDFSDHVSCGVVLEANGISAKRPFKFFNFLLKNE 119

Query: 3335 DFLNTVQTAW-NLNFWGKAQFILCKKLKALKPSLKELNNFHFSHISSRVKQVKTALKNSQ 3159
            DFLN V   W + N  G + + + KKLKA+K  +K+ +  ++S I  R K+    L   Q
Sbjct: 120  DFLNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPIKDFSRLNYSGIELRTKEAHELLITCQ 179

Query: 3158 IQLHTDPLNSALCDSVKELKA--KETFLAKAERSFLSQKAKCDFLNNSDRNTKFFHSIIK 2985
               +    N ++ ++  EL+A  K   L+ AE SF  Q+++  +    D NT +FH ++ 
Sbjct: 180  ---NLTLANPSVSNAALELEAQRKWVLLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMVD 236

Query: 2984 RNSLRKQMNSVILEDGSKTSSFDDLSKAFVNYFKNLFGT---SFQTSPVDLQTLQSGPCI 2814
                   +NS++  +G    S   +    V Y++ L G+    F     D+  L +  C 
Sbjct: 237  SRKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTYRC- 295

Query: 2813 DEDDFNLLSSPITQQAIKIALFDIEDERSPGPDGFSSGFFKKSWDVVGNDVIAAVSEFFD 2634
             +D  + L    T   IK A   +   ++ GPDG+S  FF+ +W ++G +V+AA+ EFFD
Sbjct: 296  SQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFD 355

Query: 2633 SSKILRQINHTXXXXXX-----------RPIACCNVVYKVITKILASKLESVVPKLIDPA 2487
            S ++L+Q N T                 RPI+C N +YKVI+K+L S+L+ ++  +I  +
Sbjct: 356  SGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIGHS 415

Query: 2486 QSAFISGRNITDNIFLAQEIIRNYARVRISPRCAIKVDLKKAYDTISWSFLEQVLNGLGF 2307
            QSAF+ GR++ +N+ LA E++  Y R+ ISPR  +KVDLKKA+D++ W F+   L  L  
Sbjct: 416  QSAFLPGRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAI 475

Query: 2306 PPIFISWVMECVSTASYSICINGSLQGKFPGKRGLRQGDPMSPALFLLCMEYLSRLLKVR 2127
            P  +I+W+ +C++T S++I +NG+  G F   +GLRQGDP+SP LF+L ME  S+LL  R
Sbjct: 476  PERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSR 535

Query: 2126 TNTSTFKYHPRCETLKITHLAFADDLMLFSRGDLPSVKILIDCLNDFKNASGLDVNSFKS 1947
             ++    YHP+   L I+HL FADD+M+F  G   S+  + + L+DF + SGL VN  KS
Sbjct: 536  YDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKS 595

Query: 1946 NLFTAGIVGDKLDNILN-LLNFPLGTLPVRYLGVPLAAQKLNVNHYAPLYDRIASYINKW 1770
             LF AG+  D  + I +    FP GT P+RYLG+PL  +KL +  Y PL +++++ +  W
Sbjct: 596  QLFQAGL--DLSERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSW 653

Query: 1769 TANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFLW-----GKNTSPI 1605
             + +LS+AGR  LI SV+ G+  FW+  F LP   I +I  LC  FLW     G+ +S +
Sbjct: 654  VSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKV 713

Query: 1604 KWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWVHSFYLKNQSIWTWD 1425
             W   CLP  EGGLG R    WNK LL +++W +  +  SLW +W     L + S W  +
Sbjct: 714  SWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVN 773

Query: 1424 PKKSDSCLLKRICDIR 1377
              ++D    K + ++R
Sbjct: 774  ALQTDPWTWKMLLNLR 789



 Score = 60.1 bits (144), Expect = 8e-06
 Identities = 43/168 (25%), Positives = 70/168 (41%), Gaps = 3/168 (1%)
 Frame = -3

Query: 1307 KGLCSSKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKY--LDVD 1134
            +G  ++K +++ R   P   W  +VW     PK++F  W A  +RL T   L    L   
Sbjct: 890  QGFSAAKTWEVLRPRRPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSS 949

Query: 1133 PTCKLCGNYLENASHLFFDCIVTRLLWDRV-KKWLKFSHSMSTIAGALKWTKKEHGAPAL 957
              C LC    E   HL   C  +  +W  V  +       + T A  L WT++   A A 
Sbjct: 950  AECCLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLSWTRQSTAA-AP 1008

Query: 956  KKARAVAVCCTIYQIWKARNCQVFEGLNFAEDTVFLKIQKDVYKVVFN 813
               R V     +Y +W+ RN  +   L  +   VF  + +++  V+ +
Sbjct: 1009 SLLRKVVAQLVVYNLWRQRNLVLHSSLRVSCSVVFRLVDRELRNVILS 1056


>dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 893

 Score =  493 bits (1269), Expect = e-136
 Identities = 301/860 (35%), Positives = 457/860 (53%), Gaps = 37/860 (4%)
 Frame = -3

Query: 3947 VQNKFPGWLATNNFHLIKNGRILILWNSSTVDLQVLDTDPQLIHCCLTCKISQNSILTSF 3768
            + N  PGW    N+     G+I +LW+ S V + V+    Q+I C L    S +  + S 
Sbjct: 50   ISNLLPGWSFVENYEFSVLGKIWVLWDPS-VKVVVIGRSLQMITCELLLPDSPSWFVVSI 108

Query: 3767 IYGLNTVGERRCLWNKMLELG---DLIDSPWLLLGDFNTIKNPDEKLNGEPFTSKSVEEF 3597
            +Y  N  G R+ LWN++++L     ++   W++LGDFN I NP+  +N      + +  F
Sbjct: 109  VYASNEEGTRKELWNELVQLALSPVVVGRSWIVLGDFNQILNPESAINAN--IGRKIRAF 166

Query: 3596 HDTCAYLGLSEVQSTGCYFTWTNNT----IWCRLDRALINSAWSNSNWRCSADIPVPGNV 3429
                    L ++   G  +TW N      +  ++DR L+N  W+       A+   P + 
Sbjct: 167  RSCLLDSDLYDLVYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEP-DF 225

Query: 3428 SDHSPIVVSFFEQNLVLSKPFKFFNMWALHPDFLNTVQTAW-NLNFWGKAQFILCKKLKA 3252
            SDHS   V      L   +PF+FFN +  +PDFL  ++  W + N  G A + + KKLK 
Sbjct: 226  SDHSSCEVVLDPAVLKAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKH 285

Query: 3251 LKPSLKELNNFHFSHISSRVKQVKTALKNSQIQLHTDPLNSALCDSVKELKA--KETFLA 3078
            LK  +   +  ++S I  RV +    + + Q    T+P   ++  +  EL+A  K   LA
Sbjct: 286  LKLPICCFSRENYSDIEKRVSEAHAIVLHRQRITLTNP---SVVHATLELEATRKWQILA 342

Query: 3077 KAERSFLSQKAKCDFLNNSDRNTKFFHSIIKRNSLRKQMNSV--ILED-GSKTSSFDDLS 2907
            KAE SF  QK+   +L   D NT +FH   K   +RK +N++  +++D G +  +   + 
Sbjct: 343  KAEESFFCQKSSISWLYEGDNNTAYFH---KMADMRKSINTINFLIDDFGERIETQQGIK 399

Query: 2906 KAF----VNYFKNLF-GTSFQTSPV--DLQTLQSGPCIDEDDFNLLSSPITQQAIKIALF 2748
            +       N+F++L  G   + S    D+  L S  C   D  N L    +   I+ A F
Sbjct: 400  EGIKEHSCNFFESLLCGVEGENSLAQSDMNLLLSFRC-SVDQINDLERSFSDLDIQEAFF 458

Query: 2747 DIEDERSPGPDGFSSGFFKKSWDVVGNDVIAAVSEFFDSSKILRQINHTXXXXXX----- 2583
             +   ++ GPDG+SS FFK  W VVG +V  AV EFF S ++L+Q N T           
Sbjct: 459  SLPRNKASGPDGYSSEFFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNS 518

Query: 2582 ------RPIACCNVVYKVITKILASKLESVVPKLIDPAQSAFISGRNITDNIFLAQEIIR 2421
                  RPI+C N +YKVI K+L S+L+ ++ ++I P+QSAF+ GR +++N+ LA EI+ 
Sbjct: 519  SKMTDFRPISCLNTLYKVIAKLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVH 578

Query: 2420 NYARVRISPRCAIKVDLKKAYDTISWSFLEQVLNGLGFPPIFISWVMECVSTASYSICIN 2241
             Y    IS R  +KVDL+KA+D++ W F+      L  P  F+ W+ +C+ST  +S+ +N
Sbjct: 579  GYNTKNISSRGMLKVDLRKAFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVN 638

Query: 2240 GSLQGKFPGKRGLRQGDPMSPALFLLCMEYLSRLLKVRTNTSTFKYHPRCETLKITHLAF 2061
            GS  G F   +GLRQGDP+SP LF+L ME  S LLK R +     YHP+   L I+HL F
Sbjct: 639  GSSSGFFKSNKGLRQGDPLSPYLFVLAMEVFSSLLKARFDAGYIHYHPKTADLSISHLMF 698

Query: 2060 ADDLMLFSRGDLPSVKILIDCLNDFKNASGLDVNSFKSNLFTAGIVGDKLDNI-LNLLNF 1884
            ADD+M+F  G   S+  + + L+DF + SGL VN  K+NL+ AG   D+++ + ++   F
Sbjct: 699  ADDVMVFFDGGSSSLHGISEALDDFASWSGLHVNKDKTNLYLAGT--DEVEALAISHYGF 756

Query: 1883 PLGTLPVRYLGVPLAAQKLNVNHYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVE 1704
            P+ TLP+RYLG+PL ++KL ++ Y      +      W   SLS+AGR+ LI SV+ G+ 
Sbjct: 757  PISTLPIRYLGLPLMSRKLKISEY-----ELVKRFRSWAVKSLSFAGRVQLITSVITGLV 811

Query: 1703 CFWLQIFPLPSTVIDRINRLCRVFLW-----GKNTSPIKWSKVCLPFDEGGLGLRDVHSW 1539
             FW+  F L    + +I  LC  FLW         + I WS VCLP +EGG+GLR    W
Sbjct: 812  NFWMSTFVLLLGCVKKIESLCSRFLWSGSIDASKGAKIAWSGVCLPKNEGGVGLRRFTPW 871

Query: 1538 NKALLAKILWNIHSKADSLW 1479
            NK    + +W + +  D LW
Sbjct: 872  NKTFYLRFIWPLFADNDVLW 891


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  493 bits (1269), Expect = e-136
 Identities = 333/1113 (29%), Positives = 513/1113 (46%), Gaps = 113/1113 (10%)
 Frame = -3

Query: 3890 GRILILWNSSTVDLQVLDTDPQLIHCCLTCKISQNSILTSFIYGLNTVGERRCLWNKMLE 3711
            GRI ++W  +     V  +  Q+I C +  +  +     SFIY  N V ERR LW  +  
Sbjct: 428  GRIWVVWRDNARLTPVFKSS-QMITCSILLEGKEEEFFCSFIYASNFVEERRILWEDIRS 486

Query: 3710 LGD---LIDSPWLLLGDFNTIKNPDEKLN--GEPFTSKSVEEFHDTCAYLGLSEVQSTGC 3546
              D   + + PW+L GDFN I    E  N    P+T              G+ + Q  G 
Sbjct: 487  HHDSPLIRNKPWILCGDFNEILEGGEHSNYDNSPYTPP------------GMRDFQEIG- 533

Query: 3545 YFTWTNNTIWCRLDRALINSAWSNSNWRCSADIPVPGNVSDHSPIVVSFFEQNLVLSKPF 3366
                          R ++ +A +                                  KPF
Sbjct: 534  --------------RLMLEAAATGGR-------------------------------KPF 548

Query: 3365 KFFNMWALHPDFLNTVQTAWN----LNFWGKAQFILCKKLKALKPSLKELNNFHFSHISS 3198
            KF N+    P FL  V++ W     L     A +   KKLK LKP L+EL       +  
Sbjct: 549  KFVNVLTKLPQFLPVVESHWASSAPLYVSTSALYRFSKKLKTLKPHLRELGKEKLGDLPK 608

Query: 3197 RVKQVKTALKNSQIQLHTDPLNSALCDSVKELKAKETFLAKAERSFLSQKAKCDFLNNSD 3018
            R ++    L   Q     +P    + + +K      T L++ E  FL QK+K  ++N  D
Sbjct: 609  RTREAHILLCEKQATTLANPSQETIAEELKAY-TDWTHLSELEEGFLKQKSKLHWMNVGD 667

Query: 3017 RNTKFFHSIIKRNSLRKQMNSVILEDGSKTSSFDDL---SKAFVNYFKNLFGTSFQTSPV 2847
             N  +FH   +   +R  +  +   +     + +++   ++ F N F N     F    V
Sbjct: 668  GNNSYFHKAAQVRKMRNSIREIRGPNAETLQTSEEIKGEAERFFNEFLNRQSGDFHGISV 727

Query: 2846 -DLQTLQSGPCIDEDDFNLLSSPITQQAIKIALFDIEDERSPGPDGFSSGFFKKSWDVVG 2670
             DL+ L S  C   D  N+L+  +T + I+  LF + + +SPGPDG++S FFK +W + G
Sbjct: 728  EDLRNLMSYRCSVTDQ-NILTREVTGEEIQKVLFAMPNNKSPGPDGYTSEFFKATWSLTG 786

Query: 2669 NDVIAAVSEFFDSSKILRQINHTXXXXXX-----------RPIACCNVVYKVITKILASK 2523
             D IAA+  FF    + + +N T                 RPI+CCNV+YKVI+KILA++
Sbjct: 787  PDFIAAIQSFFVKGFLPKGLNATILALIPKKDEAIEMKDYRPISCCNVLYKVISKILANR 846

Query: 2522 LESVVPKLIDPAQSAFISGRNITDNIFLAQEIIRNYARVRISPRCAIKVDLKKAYDTISW 2343
            L+ ++P  I   QSAF+  R + +N+ LA E++++Y +  ++PRCA+K+D+ KA+D++ W
Sbjct: 847  LKLLLPSFILQNQSAFVKERLLMENVLLATELVKDYHKESVTPRCAMKIDISKAFDSVQW 906

Query: 2342 SFLEQVLNGLGFPPIFISWVMECVSTASYSICINGSLQGKFPGKRGLRQGDPMSPALFLL 2163
             FL   L  L FP  F  W+  C+STA++S+ +NG L G F   RGLRQG  +SP LF++
Sbjct: 907  QFLLNTLEALNFPETFRHWIKLCISTATFSVQVNGELAGFFGSSRGLRQGCALSPYLFVI 966

Query: 2162 CMEYLSRLLKVRTNTSTFKYHPRCETLKITHLAFADDLMLFSRGDLPSVKILIDCLNDFK 1983
            CM  LS ++          YHP+CE + +THL FADDLM+F  G   S++ +I+   +F 
Sbjct: 967  CMNVLSHMIDEAAVHRNIGYHPKCEKIGLTHLCFADDLMVFVDGHQWSIEGVINVFKEFA 1026

Query: 1982 NASGLDVNSFKSNLFTAGIVGDKLDNILNLLNFPL--GTLPVRYLGVPLAAQKLNVNHYA 1809
              SGL ++  KS ++ AG+     D +  L +FP   G LPVRYLG+PL  +++    Y+
Sbjct: 1027 GRSGLQISLEKSTIYLAGVSAS--DRVQTLSSFPFANGQLPVRYLGLPLLTKQMTTADYS 1084

Query: 1808 PLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFL 1629
            PL + + + I+ WTA SLSYAGRL L+ SV+  +  FW+  + LP+  I  I +LC  FL
Sbjct: 1085 PLIEAVKTKISSWTARSLSYAGRLALLNSVIVSIANFWMSAYRLPAGCIREIEKLCSAFL 1144

Query: 1628 W-----GKNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWVH 1464
            W         + I WS +C P  EGGLG++ +   NK    K++W + S   SLWV W+ 
Sbjct: 1145 WSGPVLNPKKAKIAWSSICQPKKEGGLGIKSLAEANKVSCLKLIWRLLSTQPSLWVTWIW 1204

Query: 1463 SFYLKNQSIWTWDPKKS-DSCLLKRICDIR------NEILAKFGS--------------- 1350
            +F ++  + W+ + + S  S + K++   R      +++  + GS               
Sbjct: 1205 TFIIRKGTFWSANERSSLGSWMWKKLLKYRELAKSMHKVEVRNGSSTSFWYDHWSHLGRL 1264

Query: 1349 ---------LDLAI---SNL-------------SLFANSKGLCSSKMYDLFREAGP---- 1257
                     +DL I   +NL             +   N       ++    REAGP    
Sbjct: 1265 LDITGTRRVIDLGIPLETNLETVLRTHQHRQHRAAIYNRINAEIQRLQQQEREAGPDISL 1324

Query: 1256 ---------KTFWHSAVWKN-----------------FIPPKYSFCVWLAFNDRLATINN 1155
                     K F     W N                 +  PKYSF +WL   +RL+T + 
Sbjct: 1325 WRSLKNDFNKRFITKVTWNNVRTHQPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGDR 1384

Query: 1154 LKYLDVDP--TCKLCGNYLENASHLFFDCIVTRLLWDRVKKWL---KFSHSMSTIAGALK 990
            +K  +     TC LC N  E   HLFF C  T  +W+ + + L    +S   + +   L 
Sbjct: 1385 IKAWNSGQLVTCTLCNNAEETRDHLFFSCQYTSYVWEALTQRLLSTNYSRDWNRLFTLLC 1444

Query: 989  WTKKEHGAPALKKARAVAVCCTIYQIWKARNCQ 891
             +        L +        +IY IW+ RN +
Sbjct: 1445 TSNLPRDHLFLFR---YVFQASIYHIWRERNAR 1474


>emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-like protein
            [Arabidopsis thaliana]
          Length = 893

 Score =  491 bits (1265), Expect = e-136
 Identities = 300/860 (34%), Positives = 457/860 (53%), Gaps = 37/860 (4%)
 Frame = -3

Query: 3947 VQNKFPGWLATNNFHLIKNGRILILWNSSTVDLQVLDTDPQLIHCCLTCKISQNSILTSF 3768
            + N  PGW    N+     G+I +LW+ S V + V+    Q+I C L    S +  + S 
Sbjct: 50   ISNLLPGWSFVENYEFSVLGKIWVLWDPS-VKVVVIGRSLQMITCELLLPDSPSWFVVSI 108

Query: 3767 IYGLNTVGERRCLWNKMLELG---DLIDSPWLLLGDFNTIKNPDEKLNGEPFTSKSVEEF 3597
            +Y  N  G R+ LWN++++L     ++   W++LGDFN I NP+  +N      + +  F
Sbjct: 109  VYASNEEGTRKELWNELVQLALSPVVVGRSWIVLGDFNQILNPESAINAN--IGRKIRAF 166

Query: 3596 HDTCAYLGLSEVQSTGCYFTWTNNT----IWCRLDRALINSAWSNSNWRCSADIPVPGNV 3429
                    L ++   G  +TW N      +  ++DR L+N  W+       A+   P + 
Sbjct: 167  RSCLLDSDLYDLVYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEP-DF 225

Query: 3428 SDHSPIVVSFFEQNLVLSKPFKFFNMWALHPDFLNTVQTAW-NLNFWGKAQFILCKKLKA 3252
            SDHS   V      L   +PF+FFN +  +PDFL  ++  W + N  G A + + KKLK 
Sbjct: 226  SDHSSCEVVLDPAVLKAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKH 285

Query: 3251 LKPSLKELNNFHFSHISSRVKQVKTALKNSQIQLHTDPLNSALCDSVKELKA--KETFLA 3078
            LK  +   +  ++S I  RV +    + + Q    T+P   ++  +  EL+A  K   LA
Sbjct: 286  LKLPICCFSRENYSDIEKRVSEAHAIVLHRQRITLTNP---SVVHATLELEATRKWQILA 342

Query: 3077 KAERSFLSQKAKCDFLNNSDRNTKFFHSIIKRNSLRKQMNSV--ILED-GSKTSSFDDLS 2907
            KAE SF  QK+   +L   D NT +FH   K   +RK +N++  +++D G +  +   + 
Sbjct: 343  KAEESFFCQKSSISWLYEGDNNTAYFH---KMADMRKSINTINFLIDDFGERIETQQGIK 399

Query: 2906 KAF----VNYFKNLF-GTSFQTSPV--DLQTLQSGPCIDEDDFNLLSSPITQQAIKIALF 2748
            +       N+F++L  G   + S    D+  L S  C   D  N L    +   I+ A F
Sbjct: 400  EGIKEHSCNFFESLLCGVEGENSLAQSDMNLLLSFRC-SVDQINDLERSFSDLDIQEAFF 458

Query: 2747 DIEDERSPGPDGFSSGFFKKSWDVVGNDVIAAVSEFFDSSKILRQINHTXXXXXX----- 2583
             +   ++ GPDG+SS FFK  W VVG +V  AV EFF S ++L+Q N T           
Sbjct: 459  SLPRNKASGPDGYSSEFFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNS 518

Query: 2582 ------RPIACCNVVYKVITKILASKLESVVPKLIDPAQSAFISGRNITDNIFLAQEIIR 2421
                  RPI+C N +YKVI K+L S+L+ ++ ++I P+QSAF+ GR +++N+ LA EI+ 
Sbjct: 519  SKMTDFRPISCLNTLYKVIAKLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVH 578

Query: 2420 NYARVRISPRCAIKVDLKKAYDTISWSFLEQVLNGLGFPPIFISWVMECVSTASYSICIN 2241
             Y    IS R  +KVDL+KA+D++ W F+      L  P  F+ W+ +C+ST  +S+ +N
Sbjct: 579  GYNTKNISSRGMLKVDLRKAFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVN 638

Query: 2240 GSLQGKFPGKRGLRQGDPMSPALFLLCMEYLSRLLKVRTNTSTFKYHPRCETLKITHLAF 2061
            GS  G F   +GLRQGDP+SP LF+L ME  S LLK R +    +YHP+   L I+HL F
Sbjct: 639  GSSSGFFKSNKGLRQGDPLSPYLFVLAMEVFSSLLKARFDAGYIQYHPKTADLSISHLMF 698

Query: 2060 ADDLMLFSRGDLPSVKILIDCLNDFKNASGLDVNSFKSNLFTAGIVGDKLDNI-LNLLNF 1884
            ADD+M+F  G   S+  + + L+DF + SGL VN  K+NL+ AG   D+++ + ++   F
Sbjct: 699  ADDVMVFFDGGSSSLHGISEALDDFASWSGLHVNKDKTNLYLAGT--DEVEALAISHYGF 756

Query: 1883 PLGTLPVRYLGVPLAAQKLNVNHYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVE 1704
            P+ TLP+RYLG+PL ++KL ++ Y      +      W   SLS+AGR+ LI SV+ G+ 
Sbjct: 757  PISTLPIRYLGLPLMSRKLKISEY-----ELVKRFRSWAVKSLSFAGRVQLITSVITGLV 811

Query: 1703 CFWLQIFPLPSTVIDRINRLCRVFLW-----GKNTSPIKWSKVCLPFDEGGLGLRDVHSW 1539
             FW+  F L    + +I  LC  FLW         + I WS VCLP +EGG+ LR    W
Sbjct: 812  NFWMSTFVLLLGCVKKIESLCSRFLWSGSIDASKGAKIAWSGVCLPKNEGGVALRRFTPW 871

Query: 1538 NKALLAKILWNIHSKADSLW 1479
            NK    + +W + +  D LW
Sbjct: 872  NKTFYLRFIWPLFADNDVLW 891


>gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana]
          Length = 1161

 Score =  481 bits (1237), Expect = e-132
 Identities = 334/1074 (31%), Positives = 511/1074 (47%), Gaps = 39/1074 (3%)
 Frame = -3

Query: 3992 LETEVVPQSNRCDFIVQNKFPGWLATNNFHLIKNGRILILWNSSTVDLQVLDTDPQLIHC 3813
            LET V  ++   + ++ +  PGW   +N+   + GRI I+W+ S   L    TD Q++ C
Sbjct: 78   LETHVAQEN--ANSVLASTLPGWRMDSNYCCSELGRIWIVWDPSISVLVFKRTD-QIMFC 134

Query: 3812 CLTCKISQNSILTSFIYGLNTVGERRCLWNKMLELG---DLIDSPWLLLGDFNTIKNPDE 3642
             +       S   +F+YG N+  +RR LW  +L L     L  +PWLLLGDFN I    E
Sbjct: 135  SIKIPSLLQSFAVAFVYGRNSELDRRSLWEDILVLSRTSPLSVTPWLLLGDFNQIAAASE 194

Query: 3641 --KLNGEPFTSKSVEEFHDTCAYLGLSEVQSTGCYFTWTN----NTIWCRLDRALINSAW 3480
               +N      + +E+         LS++ S G +FTW+N    N I  +LDRAL N  W
Sbjct: 195  HYSINQSLLNLRGMEDLQCCLRDSQLSDLPSRGVFFTWSNHQQDNPILRKLDRALANGEW 254

Query: 3479 SNSNWRCSADIPVPGNVSDHSPIVVSFFEQNLVLSKPFKFFNMWALHPDFLNTVQTAWNL 3300
                    A    PG+ SDH+P ++    Q     K FK+F+  + HP +L  + TAW  
Sbjct: 255  FAVFPSALAVFDPPGD-SDHAPCIILIDNQPPPSKKSFKYFSFLSSHPSYLAALSTAWEE 313

Query: 3299 N-FWGKAQFILCKKLKALKPSLKELNNFHFSHISSRVKQVKTALKNSQIQLHTDPLNSAL 3123
            N   G   F L + LK  K   + LN   FS+I  R  Q  T L++ Q++L T P +  L
Sbjct: 314  NTLVGSHMFSLRQHLKVAKLCCRTLNRLRFSNIQQRTAQSLTRLEDIQVELLTSP-SDTL 372

Query: 3122 CDSVKELKAKETFLAKAERSFLSQKAKCDFLNNSDRNTKFFHSIIKRNSLRKQMNSVILE 2943
                   + +  F A A  SF  QK++  +L+  D NT+FFH  +  +     +  +  +
Sbjct: 373  FRREHVARKQWIFFAAALESFFRQKSRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGD 432

Query: 2942 DGSKTSSFDDLSKAFVNYFKNLFGT-SFQTSPVDLQTLQSGPCIDEDDFNLLSSPIT--- 2775
            DG +  + D +    + Y+ +L G  S   +P  ++ ++       D F  L+S +T   
Sbjct: 433  DGFRVENVDQIKGMLIAYYSHLLGIPSENVTPFSVEKIKGLLPFRCDSF--LASQLTTIP 490

Query: 2774 -QQAIKIALFDIEDERSPGPDGFSSGFFKKSWDVVGNDVIAAVSEFFDSSKILRQINHTX 2598
             ++ I   LF +   ++PGPDGF   FF ++W +V + V+AA+ EFF S  + R  N T 
Sbjct: 491  SEEEITQVLFSMPRNKAPGPDGFPVEFFIEAWAIVKSSVVAAIREFFISGNLPRGFNATA 550

Query: 2597 XXXXX-----------RPIACCNVVYKVITKILASKLESVVPKLIDPAQSAFISGRNITD 2451
                            RP+ACC  +YKVIT+I++ +L+  + + +   Q  FI GR + +
Sbjct: 551  ITLIPKVTGADRLTQFRPVACCTTIYKVITRIISRRLKLFIDQAVQANQVGFIKGRLLCE 610

Query: 2450 NIFLAQEIIRNYARVRISPRCAIKVDLKKAYDTISWSFLEQVLNGLGFPPIFISWVMECV 2271
            N+ LA E++ N+     + R  ++VD+ KAYD ++W FL  +L  L  P +FI W+  C+
Sbjct: 611  NVLLASELVDNFEADGETTRGCLQVDISKAYDNVNWEFLINILKALDLPLVFIHWIWVCI 670

Query: 2270 STASYSICINGSLQGKFPGKRGLRQGDPMSPALFLLCMEYLSRLLKVRTNTSTFKYHPRC 2091
            S+ASYSI  NG L G F GK+G+RQGDPMS  LF+L M+ LS+ L +      F  HP C
Sbjct: 671  SSASYSIAFNGELIGFFQGKKGIRQGDPMSSHLFVLVMDVLSKSLDLGALNGLFNLHPNC 730

Query: 2090 ETLKITHLAFADDLMLFSRGDLPSVKILIDCLNDFKNASGLDVNSFKSNLFTAGIVGDKL 1911
                ITHL+FADD+++FS G   S+  ++  L+DF+  SGL +N  K+ L   G    + 
Sbjct: 731  LAPIITHLSFADDVLVFSDGAASSIAGILTILDDFRQGSGLGINREKTELLLDGGNFARN 790

Query: 1910 DNILNLLNFPLGTLPVRYLGVPLAAQKLNVNHYAPLYDRIASYINKWTANSLSYAGRLLL 1731
             ++ + L    G+LPVRYLGVPL +QK+    Y PL DRI S    WTA  LS+AGRL L
Sbjct: 791  RSLADNLGITHGSLPVRYLGVPLMSQKMRRQDYQPLVDRINSRFTSWTARHLSFAGRLQL 850

Query: 1730 IKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFLWGKNTSPIKWSKVCLPFDEGGLGLRD 1551
            +  + +  +   L+ F  P  +I  +        W  N     W+      D G L    
Sbjct: 851  LNWIWR--KLCKLRPFARP-FIICEVGSGVTASFWHDN-----WT------DHGPL---- 892

Query: 1550 VHSWNKA--LLAKILWNI---HSKADSLWVRWVHSFYLKNQSIWTWDPKKSDSCLLKRIC 1386
            +H    A  LLA +  N     +  D  W                   + S S     + 
Sbjct: 893  LHLTGPAGPLLAGLPLNSVVRDALRDDTW-------------------RISSSRSRNPVI 933

Query: 1385 DIRNEILAKFGSL------DLAISNLSLFANSKGLCSSKMYDLFREAGPKTFWHSAVWKN 1224
             +   +L    SL      D  +  +   A S    ++  +   + +     WH AVW  
Sbjct: 934  TLLQRVLPSAASLIDCPHDDTYLWKIGHHAPSNRFSTADTWSYLQPSSTSVLWHKAVWFK 993

Query: 1223 FIPPKYSFCVWLAFNDRLATINNLKY--LDVDPTCKLCGNYLENASHLFFDCIVTRLLWD 1050
               PK +F  W+  ++RL T + L+     + PTC LC +  E+  HLFF C  +  +W 
Sbjct: 994  DHVPKQAFICWVVAHNRLHTRDRLRRWGFSIPPTCVLCNDLDESREHLFFRCQFSSEIWS 1053

Query: 1049 RVKKWLKFSHSMSTIAGALKWTKKEHGAPALKKARAVAVCCTIYQIWKARNCQV 888
               + L   +        L WT        +     +    ++Y IW+ RN ++
Sbjct: 1054 FFMRALNL-NPPPQFMHCLLWTLTASRDRNITLITKLLFHASVYFIWRERNLRI 1106


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  479 bits (1233), Expect = e-132
 Identities = 309/975 (31%), Positives = 458/975 (46%), Gaps = 25/975 (2%)
 Frame = -3

Query: 3680 LLGDFNTIKNPDEKLNGEPFTSKSVEEFHDTCAYLGLSEVQSTGCYFTWTNNTIWCRLDR 3501
            ++GDFN  +  +E + G    + +++EF+                               
Sbjct: 95   VIGDFNVTRRCEETIGGNSRFTNAMDEFNS------------------------------ 124

Query: 3500 ALINSAWSNSNWRCSADIPVPGNVSDHSPIVVSFFEQNLVLSKPFKFFNMWALHPDFLNT 3321
             L NS   + N+   + +P PG +SDH+ +VV       +   PFKFFN  A   DF+  
Sbjct: 125  CLHNSKLDDLNYSVLSFLP-PG-ISDHAAMVVKVGLPFRIRKAPFKFFNFLADREDFIPI 182

Query: 3320 VQTAWNLNFWGKAQFILCKKLKALKPSLKELNNFHFSHISSRVKQVKTALKNSQIQLHTD 3141
            V   W  N WG  QF + +KLK +K   K LN                            
Sbjct: 183  VSAVWATNVWGSKQFQVWRKLKLVKNQFKLLN---------------------------- 214

Query: 3140 PLNSALCDSVKELKAKETFLAKAERSFLSQKAKCDFLNNSDRNTKFFHSIIKRNSLRKQM 2961
                  C+ V++L              L +K++  +L   D+N+ FF   + ++  R ++
Sbjct: 215  ------CNVVEKL--------------LKKKSRVQWLKKGDKNSTFFFKTMTKHRNRNRI 254

Query: 2960 NSVILEDGSKTSSFDDLSKAFVNYFKNLFGTSFQTSPVDLQTLQSGPCIDEDDFNLLSSP 2781
             ++   DG       DL+K+  N F                                   
Sbjct: 255  ATINRSDGP------DLAKSLCNEF----------------------------------- 273

Query: 2780 ITQQAIKIALFDIEDERSPGPDGFSSGFFKKSWDVVGNDVIAA-VSEFFDSSKILRQINH 2604
             T   I+   F +   +SPGPDGF+  FF+K+W V+G++V+AA V EFF    +L ++N 
Sbjct: 274  -THDDIRAVFFSMNPNKSPGPDGFNGCFFQKAWLVIGDNVVAAAVKEFFSYGSLLMELNS 332

Query: 2603 TXXXXXX-----------RPIACCNVVYKVITKILASKLESVVPKLIDPAQSAFISGRNI 2457
            T                 RPI+CCN  YK+I K+LA++L+  +  ++ P+QS FI GR I
Sbjct: 333  TIITLVPKVANPTTMSDFRPISCCNTFYKIIAKLLANRLKGTLHLIVGPSQSTFIPGRRI 392

Query: 2456 TDNIFLAQEIIRNYARVRISPRCAIKVDLKKAYDTISWSFLEQVLNGLGFPPIFISWVME 2277
             DNI LAQEII +Y +    PRC   VD+ KA DT+ W F+   L     P   I W+  
Sbjct: 393  GDNILLAQEIICDYHKADGQPRCTFMVDMMKANDTVEWDFIIATLQAFNIPSTLIGWIKS 452

Query: 2276 CVSTASYSICINGSLQGKFPGKRGLRQGDPMSPALFLLCMEYLSRLLKVRTNTS-TFKYH 2100
            C+S+A +S+C+NG L G F  +RGLRQGDP+SP LF++ ME LS  ++ R N S  F+YH
Sbjct: 453  CISSAKFSVCVNGELAGFFARRRGLRQGDPLSPYLFVIAMEVLSLCIQRRINCSPCFRYH 512

Query: 2099 PRCETLKITHLAFADDLMLFSRGDLPSVKILIDCLNDFKNASGLDVNSFKSNLFTAGIVG 1920
             RC+ L ++HL FADDL++F  GD  SV+ L D  ++F++ S L  N  +S +F AG+ G
Sbjct: 513  WRCDQLNLSHLCFADDLLMFCNGDENSVRTLHDAFSNFESLSSLKANVSESKIFLAGVDG 572

Query: 1919 DKLDNILNLLNFPLGTLPVRYLGVPLAAQKLNVNHYAPLYDRIASYINKWTANSLSYAGR 1740
            +  D++L + NF LGT PVRYLG+PL   KL +   +PL DRI + I  W    LS+AGR
Sbjct: 573  NSSDSVLQVTNFSLGTCPVRYLGIPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGR 632

Query: 1739 LLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFLW-----GKNTSPIKWSKVCLPFD 1575
            L LI+SVL  ++ +W     LP  V+  I +  R FLW     G+  + + WS++CLP  
Sbjct: 633  LQLIQSVLSSIQVYWASHLILPKKVLKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPKC 692

Query: 1574 EGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWVHSFYLKNQSIWTWDPKKSDSCLLK 1395
            EGGLG++D+H WNKAL+   +WN+ S + + W  WV  + LK  S W        +  L 
Sbjct: 693  EGGLGIKDLHCWNKALMISHIWNLVSSSSNFWTDWVKVYLLKGNSFW--------NAPLP 744

Query: 1394 RICDIRNEILAKFGSLDLAISNLSLFAN--SKGLCSSKMYDLFREAGPKTF-WHSAVWKN 1224
             IC      L K   L       S F N    G  +S  +D +   GP T  W S +   
Sbjct: 745  SICSWNWRKLLKIREL-----CCSFFVNIIGDGRATSLWFDNWHPLGPLTLRWSSNIIGE 799

Query: 1223 FIPPKYSFCVWLAFNDRLATINNLK-YLDVDPTCKLCGNYLENASHLFFDCIVTRLLWDR 1047
                K +      F    +  N L+    + P  +L     E  +HLFFDC  +  +W  
Sbjct: 800  SGLSKSAMLTPNGFYSTSSAWNTLRPSRFIVPWYRLVWFVAETHNHLFFDCAYSFGIWTH 859

Query: 1046 VKKWLKFSHSMSTIAGALKWTK---KEHGAPALKKARAVAVCCTIYQIWKARNCQVFEGL 876
            V      S  +   +  + W     K +  P +     +A+   +Y IW+ RN + F   
Sbjct: 860  VLSKCDVSKPLLPWSDFIFWVATNWKGNSLPVV--ILKLALQAVVYAIWRERNNRRFRNE 917

Query: 875  NFAEDTVFLKIQKDV 831
            +     VF  I + +
Sbjct: 918  SLPPAVVFKGIVESI 932


>gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis
            thaliana]
          Length = 1253

 Score =  478 bits (1230), Expect = e-132
 Identities = 325/1048 (31%), Positives = 511/1048 (48%), Gaps = 62/1048 (5%)
 Frame = -3

Query: 3788 NSILTSFIYGLNTVGERRCLWNKMLELGDLIDS---PWLLLGDFNTIKNPDEKLNGEPFT 3618
            +S++ S +Y  N    R+ LW ++L L   +     PW++LGDFN +  P E        
Sbjct: 51   DSVVVSIVYAANEAITRKELWEELLLLSVSLSGNGKPWIMLGDFNQVLCPAEHSQATSLN 110

Query: 3617 -SKSVEEFHDTCAYLGLSEVQSTGCYFTWTNNT----IWCRLDRALINSAWSNSNWRCSA 3453
             ++ ++ F D      L ++   G  FTW N +    +  +LDR L+N +W +      A
Sbjct: 111  VNRRMKVFRDCLFEAELCDLVFKGNTFTWWNKSATRPVAKKLDRILVNESWCSRFPSAYA 170

Query: 3452 DIPVPGNVSDHSPIVVSFFEQNLVLSKPFKFFNMWALHPDFLNTVQTAW-NLNFWGKAQF 3276
                P + SDH+   V          +PF+F+N    +PDF++ V   W ++N  G + F
Sbjct: 171  VFGEP-DFSDHASCGVIINPLMHREKRPFRFYNFLLQNPDFISLVGELWYSINVVGSSMF 229

Query: 3275 ILCKKLKALKPSLKELNNFHFSHISSRVKQVKTALKNSQIQLHTDPL--NSALCDSVKEL 3102
             + KKLKALK  ++  +  +FS++  RVK+    +   Q +  +DP   N+AL     E 
Sbjct: 230  KMSKKLKALKNPIRTFSMENFSNLEKRVKEAHNLVLYRQNKTLSDPTIPNAAL---EMEA 286

Query: 3101 KAKETFLAKAERSFLSQKAKCDFLNNSDRNTKFFHSIIKRNSLRKQMNSVILEDGSKTSS 2922
            + K   L KAE SF  Q+++  ++   D NT +FH +         ++ +I ++G K  +
Sbjct: 287  QRKWLILVKAEESFFCQRSRVTWMGEGDSNTSYFHRMADSRKAVNTIHIIIDDNGVKIDT 346

Query: 2921 FDDLSKAFVNYFKNLFGTSFQTSPV---DLQTLQSGPCIDEDDFNLLSSPITQQAIKIAL 2751
               + +  + YF NL G       +   D   L    C   D    L+   ++Q IK A 
Sbjct: 347  QLGIKEHCIEYFSNLLGGEVGPPMLIQEDFDLLLPFRC-SHDQKKELAMSFSRQDIKSAF 405

Query: 2750 FDIEDERSPGPDGFSSGFFKKSWDVVGNDVIAAVSEFFDSSKILRQINHTXXXXXX---- 2583
            F     ++ GPDGF   FFK++W V+G +V  AVSEFF SS +L+Q N T          
Sbjct: 406  FSFPSNKTSGPDGFPVEFFKETWSVIGTEVTDAVSEFFTSSVLLKQWNATTLVLIPKITN 465

Query: 2582 -------RPIACCN----VVYKVITKILASKLESVVPKLIDPAQSAFISGRNITDNIFLA 2436
                   RPI+C +     +YKVI ++L ++L+ ++ ++I P QSAF+ GR + +N+ LA
Sbjct: 466  ASKMNDFRPISCNDFGPITLYKVIARLLTNRLQCLLSQVISPFQSAFLPGRFLAENVLLA 525

Query: 2435 QEIIRNYARVRISPRCAIKVDLKKAYDTISWSFLEQVLNGLGFPPIFISWVMECVSTASY 2256
             E+++ Y R  I PR  +KVDL+KA+D+I W F+   L  +G P  F+ W+ +C+ST ++
Sbjct: 526  TELVQGYNRQNIDPRGMLKVDLRKAFDSIRWDFIISALKAIGIPDRFVYWITQCISTPTF 585

Query: 2255 SICINGSLQGKFPGKRGLRQGDPMSPALFLLCMEYLSRLLKVRTNTSTFKYHPRCETLKI 2076
            S+C+NG+  G F   RGLRQG+P+SP LF+L ME  S LL  R       YHP+   L I
Sbjct: 586  SVCVNGNTGGFFKSTRGLRQGNPLSPFLFVLAMEVFSSLLNSRFQAGYIHYHPKTSPLSI 645

Query: 2075 THLAFADDLMLFSRGDLPSVKILIDCLNDFKNASGLDVNSFKSNLFTAGIVGDKLDNILN 1896
            +HL FADD+M+F  G   S+  + + L DF   SGL +N  K++L+ AG+  D+++    
Sbjct: 646  SHLMFADDIMVFFDGGSSSLHGISEALEDFAFWSGLVLNREKTHLYLAGL--DRIE---- 699

Query: 1895 LLNFPLGTLPVRYLGVPLAAQKLNVNHYAPLYDRIASYINKWTANSLSYAGRLLLIKSVL 1716
                               A+KL +  Y PL +++A     W+   LS+AGR+ LI SV+
Sbjct: 700  ---------------ASTIARKLRIAEYGPLLEKLAKRFRSWSVKCLSFAGRVQLIASVI 744

Query: 1715 QGVECFWLQIFPLPSTVIDRINRLCRVFLWG-----KNTSPIKWSKVCLPFDEGGLGLRD 1551
             G+  FW+  F LP   + RI  LC  FLW      K  + + WS+VCLP +EGG+GLR 
Sbjct: 745  SGIINFWISTFILPKGCVKRIEALCARFLWSGNIDVKKGAKVAWSEVCLPKEEGGVGLR- 803

Query: 1550 VHSWNKALLAKILWNIHSKADSLWV-RWV---HSFYLKNQS------------------- 1440
                   +L   LW+   K  S W   W      F L   S                   
Sbjct: 804  ----RFTVLNTTLWD--GKKISFWFDNWSPLGPLFKLFGSSGPRALCIPIQAKVADACSD 857

Query: 1439 -IWTWDPKKSDSCLLKRICDIRNEILAKFGSLDLAISNLSLFANSKGLCSSKMYDLFREA 1263
              W   P ++D  L   I      +     S D  +  +  F    G  +++ ++  R  
Sbjct: 858  VGWLISPPRTDQALALLIHLTTIALPCFDSSPDTFVWIVDDF-TCHGFSAARTWEAMRPK 916

Query: 1262 GPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKYLDVDPT--CKLCGNYLENASH 1089
             P   W  +VW     PK++F +W++  +RL T   L    V  T  C LC +  E+  H
Sbjct: 917  KPVKDWTKSVWFKGSVPKHAFNMWVSHLNRLPTRQRLAAWGVTTTTDCCLCSSRPESRDH 976

Query: 1088 LFFDCIVTRLLWDRVKKWLKFSHSM-STIAGALKWTK-KEHGAPALKKARAVAVCCTIYQ 915
            L   C+ + ++W  V   L  S ++ ++ A  L WT+     AP+L   R +A   +++ 
Sbjct: 977  LLLYCVFSAVIWKLVFFRLTPSQAIFNSWAELLSWTRINSSKAPSL--LRKIAAQASVFH 1034

Query: 914  IWKARNCQVFEGLNFAEDTVFLKIQKDV 831
            +WK RN  +   +  +  TVF  I +++
Sbjct: 1035 LWKQRNNVLHNSIFISPATVFHFIDREL 1062


>gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1216

 Score =  460 bits (1183), Expect = e-126
 Identities = 271/817 (33%), Positives = 406/817 (49%), Gaps = 109/817 (13%)
 Frame = -3

Query: 3020 DRNTKFFHSIIKRNSLRKQMNSVILEDGSKTSSFDDLSKAFVNYFKNLFGTSFQTSPVDL 2841
            DRN K FH  I        +  ++  DG   +S  D+    VNYF++      QT P D 
Sbjct: 91   DRNNKTFHRAITTREAVNSIREIVTRDGLVVTSQQDIQTEAVNYFQDFL----QTIPADY 146

Query: 2840 QTLQSGPCIDE-----------DDFNLLSSPITQQAIKIALFDIEDERSPGPDGFSSGFF 2694
            +    G C++E           DD  LL+  +T + IK  +F +  ++SPGPDG++S F+
Sbjct: 147  E----GMCVEELENLLPFRCSEDDHRLLTRVVTGEEIKKVIFSMPKDKSPGPDGYTSEFY 202

Query: 2693 KKSWDVVGNDVIAAVSEFFDSSKILRQINHTXXXXXX-----------RPIACCNVVYKV 2547
            K SW+++G++VI A+  FF    + + +N T                 RPI+CCNV+YK 
Sbjct: 203  KASWEIIGDEVIIAIQSFFAKGFLPKGVNSTILALIPKKKEAREIKDYRPISCCNVLYKA 262

Query: 2546 ITKILASKLESVVPKLIDPAQSAFISGRNITDNIFLAQEIIRNYARVRISPRCAIKVDLK 2367
            I+KILA++L+ ++PK I   QSAF+  R + +N+ LA E++++Y +  IS RCA+K+D+ 
Sbjct: 263  ISKILANRLKRILPKFIVGNQSAFVKDRLLIENVLLATELVKDYHKDSISTRCAMKIDIS 322

Query: 2366 KAYDTISWSFLEQVLNGLGFPPIFISWVMECVSTASYSICINGSLQGKFPGKRGLRQGDP 2187
            KA+D++ WSFL  VL  + FP  FI W+  C+STAS+SI +NG L G F   RGLRQG  
Sbjct: 323  KAFDSLQWSFLTHVLAAMNFPGEFIHWISLCMSTASFSIQVNGELAGYFRSARGLRQGCS 382

Query: 2186 MSPALFLLCMEYLSRLLKVRTNTSTFKYHPRCETLKITHLAFADDLMLFSRGDLPSVKIL 2007
            +SP LF++ M+ LSR+L        F YHPRC+TL +THL FADDLM+ + G + SV  +
Sbjct: 383  LSPYLFVISMDVLSRMLDKAAGAREFGYHPRCKTLGLTHLCFADDLMILTDGKIRSVDGI 442

Query: 2006 IDCLNDFKNASGLDVNSFKSNLFTAGIVGDKLDNILNLLNFPLGTLPVRYLGVPLAAQKL 1827
            +  LN F    GL +   K+ L+ AG+       + +  +F +G LPVRYLG+PL  ++L
Sbjct: 443  VKVLNQFAAKLGLKICMEKTTLYLAGVSDHSRQLMSSRYSFGVGKLPVRYLGLPLVTKRL 502

Query: 1826 NVNHYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINR 1647
              + Y+PL D+I   I  WT+  LS+AGRL LI SVL  +  FW+  F LP   I+ INR
Sbjct: 503  TTSDYSPLIDQIRRRIGMWTSRYLSFAGRLSLINSVLWSITNFWMNAFRLPRECINEINR 562

Query: 1646 LCRVFLW-GKNTSP----IKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSL 1482
            +    LW G   +P    + W ++C P  EGGLGL+ +   NK    K++W + S  DSL
Sbjct: 563  ISSALLWSGPELNPKKAKVSWDEICKPKKEGGLGLQSLREANKVSSLKLIWRLLSCQDSL 622

Query: 1481 WVRWVHSFYLKNQSI-----------WTWDPKKSDSCLLKRIC--DIRNEILAKF----- 1356
            WV+W     LK +S            W W        + K  C  ++ N +   F     
Sbjct: 623  WVKWTRMNLLKKESFWSIGTHSTLGSWIWRRLLKHREVAKSFCKIEVNNGVNTSFWFDNW 682

Query: 1355 -------------GSLDLAISNLSLFANS------------------------------- 1308
                         G++D+ IS     A +                               
Sbjct: 683  SEKGPLINLTGARGAIDMGISRHMTLAEAWSRRRRKRHRVEILNEFEEILLQKYQHRNIE 742

Query: 1307 --------------KGLCSSK-MYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDR 1173
                          K   S+K  ++  R +  +  WH  VW     PK+SFC WLA  +R
Sbjct: 743  LEDAILWRGKEDVFKARFSTKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRNR 802

Query: 1172 LATINNLKYLD--VDPTCKLCGNYLENASHLFFDCIVTRLLWDRVKKWL---KFSHSMST 1008
            L+T + +   +     TC  C + +E   HLFF C  +  +W  + K +   +FS   S 
Sbjct: 803  LSTGDRMMTWNNGTPTTCVFCSSPMETRDHLFFQCCYSSEIWTSIAKNVYKDRFSTKWSA 862

Query: 1007 IAGALKWTKKEHGAPALKKARAVAVCCTIYQIWKARN 897
            +   +  ++ +     L +        +I+ IW+ RN
Sbjct: 863  VVNYISDSQPDRIQSFLSR---YTFQVSIHSIWRERN 896


>gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score:
            72.31) [Arabidopsis thaliana]
          Length = 928

 Score =  453 bits (1165), Expect = e-124
 Identities = 269/820 (32%), Positives = 428/820 (52%), Gaps = 38/820 (4%)
 Frame = -3

Query: 3743 ERRCLWNKMLELGD---LIDSPWLLLGDFNTIKNPDEKLNGE--PFTSKSVEEFHDTCAY 3579
            ER+ LWN + +  D   +   PW++ GDFN I + +E  N    P T+  + +F     +
Sbjct: 3    ERKELWNDLRDHSDSPIIRSKPWIIFGDFNEILDMEEHSNSRENPVTTTGMRDFQMAVNH 62

Query: 3578 LGLSEVQSTGCYFTWTN----NTIWCRLDRALINSAWSNSNWRCSADIPVPGNVSDH--S 3417
              ++++   G  FTW+N    + I  +LDR L+N  W  S  R S  +   G  SDH   
Sbjct: 63   CSITDLAYHGPLFTWSNKRENDLIAKKLDRVLVNDVWLQSFPR-SYSVFEAGGCSDHLRC 121

Query: 3416 PIVVSFFEQNLVLSK-PFKFFNMWALHPDFLNTVQTAWN----LNFWGKAQFILCKKLKA 3252
             I ++     +V  K PFKF N+      F+ TV++ WN    +     + F   KKLK 
Sbjct: 122  RINLNVGAGAVVKGKRPFKFVNVITEMEHFIPTVESYWNETEAIFMSTSSLFRFSKKLKG 181

Query: 3251 LKPSLKELNNFHFSHISSRVKQVKTALKNSQIQLHTDPLNSALCDSVKELKAKETFLAKA 3072
            LKP L+ L      ++  + K+    L   Q     +P  S++ +   E  AK   +A  
Sbjct: 182  LKPLLRNLGKERLGNLVKQTKEAFETLCQKQAMKMANPSPSSMQEE-NEAYAKWDHIAVL 240

Query: 3071 ERSFLSQKAKCDFLNNSDRNTKFFHSIIKRNSLRKQMNSVILEDGSKTSSFDDLSKAFVN 2892
            E  FL Q++K  +L+  DRN K FH  +     +  +  +I  DGS  S  + +     +
Sbjct: 241  EEKFLKQRSKLHWLDIGDRNNKAFHRAVVAREAQNSIREIICHDGSVASQEEKIKTEAEH 300

Query: 2891 YFKN---LFGTSFQTSPVD-LQTLQSGPCIDEDDFNLLSSPITQQAIKIALFDIEDERSP 2724
            +F+    L    F+   V+ LQ L    C D D   +L++ ++ + I   +F + +++SP
Sbjct: 301  HFREFLQLIPNDFEGIAVEELQDLLPYRCSDSDK-EMLTNHVSAEEIHKVVFSMPNDKSP 359

Query: 2723 GPDGFSSGFFKKSWDVVGNDVIAAVSEFFDSSKILRQINHTXXXXXX-----------RP 2577
            GPDG+++ F+K +W+++G + I A+  FF    + + IN T                 RP
Sbjct: 360  GPDGYTAEFYKGAWNIIGAEFILAIQSFFAKGFLPKGINSTILALIPKKKEAKEMKDYRP 419

Query: 2576 IACCNVVYKVITKILASKLESVVPKLIDPAQSAFISGRNITDNIFLAQEIIRNYARVRIS 2397
            I+CCNV+YKVI+KI+A++L+ V+PK I   QSAF+  R + +N+ LA EI+++Y +  +S
Sbjct: 420  ISCCNVLYKVISKIIANRLKLVLPKFIVGNQSAFVKDRLLIENVLLATEIVKDYHKDSVS 479

Query: 2396 PRCAIKVDLKKAYDTISWSFLEQVLNGLGFPPIFISWVMECVSTASYSICINGSLQGKFP 2217
             RCA+K+D+ KA+D++ W FL  VL  + FPP F  W+  C++TAS+S+ +NG L G F 
Sbjct: 480  SRCALKIDISKAFDSVQWKFLINVLEAMNFPPEFTHWITLCITTASFSVQVNGELAGVFS 539

Query: 2216 GKRGLRQGDPMSPALFLLCMEYLSRLLKVRTNTSTFKYHPRCETLKITHLAFADDLMLFS 2037
              R LRQG  +SP LF++ M+ LS++L        F YHP+C  + +THL+FADDLM+ S
Sbjct: 540  SARELRQGCSLSPYLFVISMDVLSKMLDKAVGARQFGYHPKCRAIGLTHLSFADDLMILS 599

Query: 2036 RGDLPSVKILIDCLNDFKNASGLDVNSFKSNLFTAGIVGDKLDNILNLLNFPLGTLPVRY 1857
             G + S+  ++  L +F   SGL ++  KS ++ AG+       I+   +F +G LPVRY
Sbjct: 600  DGKVRSIDGIVKVLYEFAKWSGLKISMEKSTMYLAGVQASVYQEIVQKFSFDVGKLPVRY 659

Query: 1856 LGVPLAAQKLNVNHYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPL 1677
            LG+PL +++L  +   PL +++   I  WT+  LS+AGRL LI S L  +  FW+  F L
Sbjct: 660  LGLPLVSKRLTASDCLPLIEQLRKKIEAWTSRFLSFAGRLNLISSTLWSICNFWMAAFRL 719

Query: 1676 PSTVIDRINRLCRVFLW-----GKNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKIL 1512
            P   I  I++LC  FLW       N + + W  +C P  E         +W+K      +
Sbjct: 720  PRACIREIDKLCSAFLWSGTELSSNKAKVSWEAICKPKKE---------AWHKG-----V 765

Query: 1511 WNIHSKADSLWVRW--VHSFYLKNQSIWTWDPKKSDSCLL 1398
            W  H      +  W  + +     Q +  W+ + S  C+L
Sbjct: 766  WFAHETPKHSFCVWLAIWNKLSTGQRMQHWNLQSSVGCVL 805



 Score = 71.6 bits (174), Expect = 3e-09
 Identities = 42/125 (33%), Positives = 66/125 (52%), Gaps = 5/125 (4%)
 Frame = -3

Query: 1256 KTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKYLDVDPT--CKLCGNYLENASHLF 1083
            K  WH  VW     PK+SFCVWLA  ++L+T   +++ ++  +  C LC N LE   HLF
Sbjct: 758  KEAWHKGVWFAHETPKHSFCVWLAIWNKLSTGQRMQHWNLQSSVGCVLCNNNLETRDHLF 817

Query: 1082 FDCIVTRLLWDRVKKWL---KFSHSMSTIAGALKWTKKEHGAPALKKARAVAVCCTIYQI 912
            F C  T  +W+ + K L    ++    TI   +  + + H   +   AR+V +  ++Y I
Sbjct: 818  FSCAYTSGIWEALAKNLLQRSYTTDWQTIISYV--SGQCHDRVSCFLARSV-LQASVYTI 874

Query: 911  WKARN 897
            W+ RN
Sbjct: 875  WRERN 879


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  443 bits (1140), Expect = e-121
 Identities = 278/861 (32%), Positives = 436/861 (50%), Gaps = 20/861 (2%)
 Frame = -3

Query: 3902 LIKNGRILILWNSSTVDLQVLDTDPQLIHCCLTCKISQNSILTSFIYGLNTVGERRCLWN 3723
            ++ N + + L++S     +VL   PQ +H  +T       I T+F+Y   T  ER  LWN
Sbjct: 906  IVNNSQKIWLFHSVEFICEVLLDHPQCLHVRVTIPWLDLPIFTTFVYAKCTRSERTPLWN 965

Query: 3722 KMLELGDLIDSPWLLLGDFNTIKNPDEKLNGEPFTSKSVEEFHDTCAYLGLSEVQSTGCY 3543
             +  L   ++ PW++ GDFN I   +E+L G      S+E+F       GL +    G  
Sbjct: 966  CLRNLAADMEGPWIVGGDFNIILKREERLYGADPHEGSIEDFASVLLDCGLLDGGFEGNP 1025

Query: 3542 FTWTNNTIWCRLDRALINSAWSNSNWRCSADIPVPGNVSDHSPIVVSFFEQNLVLSKPFK 3363
            FTWTNN ++ RLDR + N  W N  +  +    +  + SDH P+++S    +      F+
Sbjct: 1026 FTWTNNRMFQRLDRMVYNQQWINK-FPITRIQHLNRDGSDHCPLLLSCSNSSEKAPSSFR 1084

Query: 3362 FFNMWALHPDFLNTVQTAWNLNFWGKAQFILCKKLKALKPSLKELNNFHFSHISSRVKQV 3183
            F + WALH +F  +V+  WNL   G        K K LK  LK  N   F  I S +K+ 
Sbjct: 1085 FLHAWALHHNFNASVEGNWNLPINGSGLMAFWSKQKRLKQHLKWWNKTVFGDIFSNIKEA 1144

Query: 3182 KTALKNSQIQLHTDPLNSALCDSVKELKAKETFLAKAERSFLSQKAKCDFLNNSDRNTKF 3003
            +  ++  +I LH           + +  A+       E  F  QK+   ++   +RNTKF
Sbjct: 1145 EKRVEECEI-LHQQEQTIGSRIQLNKSYAQLNKQLSMEEIFWKQKSGVKWVVEGERNTKF 1203

Query: 3002 FHSIIKRNSLRKQMNSVILEDGSKTSSFDDLSKAFVNYFKNLFGTSFQTSPVDLQTLQSG 2823
            FH  +++  +R  +  +  +DG+     + L ++ +++F +L     +    D    QS 
Sbjct: 1204 FHMRMQKKRIRSHIFKIQEQDGNWIEDPEQLQQSAIDFFSSLL----KAESCDDTRFQSS 1259

Query: 2822 PC---IDEDDFNLLSSPITQQAIKIALFDIEDERSPGPDGFSSGFFKKSWDVVGNDVIAA 2652
             C   I + D   L +  T Q +K A+F I+ E + GPDGFSS F+++ WD++ +D+  A
Sbjct: 1260 LCPSIISDTDNGFLCAEPTLQEVKEAVFGIDPESAAGPDGFSSHFYQQCWDIIAHDLFEA 1319

Query: 2651 VSEFFDSSKILRQINHTXXXXXX-----------RPIACCNVVYKVITKILASKLESVVP 2505
            V EFF  + I + +  T                 RPI+ C V+ K+ITKILA++L  ++P
Sbjct: 1320 VKEFFHGADIPQGMTSTTLVLIPKTTSASKWSEFRPISLCTVMNKIITKILANRLAKILP 1379

Query: 2504 KLIDPAQSAFISGRNITDNIFLAQEIIRNYARVRISPRCAIKVDLKKAYDTISWSFLEQV 2325
             +I   QS F+ GR I+DNI LAQE+I    +       A+K+D+ KAYD + WSFL +V
Sbjct: 1380 SIITENQSGFVGGRLISDNILLAQELIGKLDQKNRGGNVALKLDMMKAYDRLDWSFLFKV 1439

Query: 2324 LNGLGFPPIFISWVMECVSTASYSICINGSLQGKFPGKRGLRQGDPMSPALFLLCMEYLS 2145
            L  LGF   +I  + +C+S   +S+ +NG   G F  +RGLRQGD +SP LF+L  EYL+
Sbjct: 1440 LQHLGFNAQWIGMIQKCISNCWFSLLLNGRTVGYFKSERGLRQGDSISPQLFILAAEYLA 1499

Query: 2144 RLLKVRTNTSTFKYHPRCETLKITHLAFADDLMLFSRGDLPSVKILIDCLNDFKNASGLD 1965
            R L    +     ++    +L ++HLAFADD+++F+ G   +++ ++  L +++  SG  
Sbjct: 1500 RGLNALYDQYPSLHYSSGCSLSVSHLAFADDVIIFANGSKSALQKIMAFLQEYEKLSGQR 1559

Query: 1964 VNSFKSNLFT-AGIVGDKLDNILNLLNFPLGTLPVRYLGVPLAAQKLNVNHYAPLYDRIA 1788
            +N  KS + T   +   +   IL    F    LP+ YLG PL      V  +  L  +I 
Sbjct: 1560 INPQKSCVVTHTNMASSRRQIILQATGFSHRPLPITYLGAPLYKGHKKVMLFNDLVAKIE 1619

Query: 1787 SYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFLWGKNTSP 1608
              I  W   +LS  GR+ L++S L  +  + LQ+   P  V++RINRL   FLWG +T+ 
Sbjct: 1620 ERITGWENKTLSPGGRITLLRSTLSSLPIYLLQVLKPPVIVLERINRLLNNFLWGGSTAS 1679

Query: 1607 -----IKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWVHSFYLKNQ 1443
                   W K+ LP  EGGL +R+V    +A   K+ W   +  +SLW +++ + Y   Q
Sbjct: 1680 KRIHWASWGKIALPIAEGGLDIRNVEDVCEAFSMKLWWRFRT-TNSLWTQFMRAKYCGGQ 1738

Query: 1442 SIWTWDPKKSDSCLLKRICDI 1380
                  PK  DS   KR+  I
Sbjct: 1739 LPTDVQPKLHDSQTWKRMVTI 1759


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  437 bits (1123), Expect = e-119
 Identities = 273/858 (31%), Positives = 429/858 (50%), Gaps = 20/858 (2%)
 Frame = -3

Query: 3902 LIKNGRILILWNSSTVDLQVLDTDPQLIHCCLTCKISQNSILTSFIYGLNTVGERRCLWN 3723
            ++ + + + L++S  +   ++   PQ +H  LT    +     +F+Y   T  ER  LW+
Sbjct: 943  IVNSSQKIWLFHSLELHSDIILDHPQCLHVRLTSPWLEKPFFATFVYAKCTRSERTLLWD 1002

Query: 3722 KMLELGDLIDSPWLLLGDFNTIKNPDEKLNGEPFTSKSVEEFHDTCAYLGLSEVQSTGCY 3543
             +  L    + PWL+ GDFN I   +E+L G      S+E+F       GL +    G  
Sbjct: 1003 CLRRLAADNEEPWLVGGDFNIILKREERLYGSAPHEGSMEDFASVLLDCGLLDGGFEGNP 1062

Query: 3542 FTWTNNTIWCRLDRALINSAWSNSNWRCSADIPVPGNVSDHSPIVVSFFEQNLVLSKPFK 3363
            FTWTNN ++ RLDR + N  W N  +  +    +  + SDH P+++S F  +      F+
Sbjct: 1063 FTWTNNRMFQRLDRVVYNHQWINM-FPITRIQHLNRDGSDHCPLLISCFISSEKSPSSFR 1121

Query: 3362 FFNMWALHPDFLNTVQTAWNLNFWGKAQFILCKKLKALKPSLKELNNFHFSHISSRVKQV 3183
            F + W LH DF  +V+  WNL   G        K   LK  LK  N   F  I S++K+ 
Sbjct: 1122 FQHAWVLHHDFKTSVEGNWNLPINGSGLQAFWIKQHRLKQHLKWWNKAVFGDIFSKLKEA 1181

Query: 3182 KTALKNSQIQLHTDPLNSALCDSVKELKAKETFLAKAERSFLSQKAKCDFLNNSDRNTKF 3003
            +  ++  +I LH          ++ +  A+       E  F  QK+   ++   +RNTKF
Sbjct: 1182 EKRVEECEI-LHQQEQTVGSRINLNKSYAQLNKQLNVEEIFWKQKSGVKWVVEGERNTKF 1240

Query: 3002 FHSIIKRNSLRKQMNSVILEDGSKTSSFDDLSKAFVNYFKNLFGTSFQTSPVDLQTLQSG 2823
            FH  +++  +R  +  V   DG      + L ++ + YF +L     +  P D+   Q+ 
Sbjct: 1241 FHMRMQKKRIRSHIFKVQEPDGRWIEDQEQLKQSAIEYFSSLL----KAEPCDISRFQNS 1296

Query: 2822 ---PCIDEDDFNLLSSPITQQAIKIALFDIEDERSPGPDGFSSGFFKKSWDVVGNDVIAA 2652
                 I   +  LL +    Q +K A+FDI+ E + GPDGFSS F+++ W+ + +D++ A
Sbjct: 1297 LIPSIISNSENELLCAEPNLQEVKDAVFDIDPESAAGPDGFSSYFYQQCWNTIAHDLLDA 1356

Query: 2651 VSEFFDSSKILRQINHTXXXXXX-----------RPIACCNVVYKVITKILASKLESVVP 2505
            V +FF  + I R +  T                 RPI+ C V+ K+ITK+L+++L  ++P
Sbjct: 1357 VRDFFHGANIPRGVTSTTLVLLPKKSSASKWSEFRPISLCTVMNKIITKLLSNRLAKILP 1416

Query: 2504 KLIDPAQSAFISGRNITDNIFLAQEIIRNYARVRISPRCAIKVDLKKAYDTISWSFLEQV 2325
             +I   QS F+ GR I+DNI LAQE+IR           A+K+D+ KAYD + WSFL +V
Sbjct: 1417 SIITENQSGFVGGRLISDNILLAQELIRKLDTKSRGGNLALKLDMMKAYDRLDWSFLIKV 1476

Query: 2324 LNGLGFPPIFISWVMECVSTASYSICINGSLQGKFPGKRGLRQGDPMSPALFLLCMEYLS 2145
            L   GF   +I  + +C+S   +S+ +NG ++G F  +RGLRQGD +SP LF+L  EYLS
Sbjct: 1477 LQHFGFNEQWIGMIQKCISNCWFSLLLNGRIEGYFKSERGLRQGDSISPQLFILAAEYLS 1536

Query: 2144 RLLKVRTNTSTFKYHPRCETLKITHLAFADDLMLFSRGDLPSVKILIDCLNDFKNASGLD 1965
            R L    +     ++     L ++HLAFADD+++F+ G   +++ ++  L +++  SG  
Sbjct: 1537 RGLNALYDQYPSLHYSSGVPLSVSHLAFADDVLIFTNGSKSALQRILVFLQEYEEISGQR 1596

Query: 1964 VNSFKSNLFT-AGIVGDKLDNILNLLNFPLGTLPVRYLGVPLAAQKLNVNHYAPLYDRIA 1788
            +N+ KS   T   I   +   I     F    LP+ YLG PL      V  +  L  +I 
Sbjct: 1597 INAQKSCFVTHTNIPNSRRQIIAQATGFNHQLLPITYLGAPLYKGHKKVILFNDLVAKIE 1656

Query: 1787 SYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFLWG----- 1623
              I  W    LS  GR+ L++SVL  +  + LQ+   P  V++R+NRL   FLWG     
Sbjct: 1657 ERITGWENKILSPGGRITLLRSVLASLPIYLLQVLKPPVCVLERVNRLFNSFLWGGSAAS 1716

Query: 1622 KNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWVHSFYLKNQ 1443
            K      W+K+ LP  EGGL +R +    +A   K+ W   +  DSLW R++   Y + Q
Sbjct: 1717 KRIHWASWAKIALPVTEGGLDIRSLAEVFEAFSMKLWWRFRT-TDSLWTRFMRMKYCRGQ 1775

Query: 1442 SIWTWDPKKSDSCLLKRI 1389
                  PK  DS   KR+
Sbjct: 1776 LPMQTQPKLHDSQTWKRM 1793


Top