BLASTX nr result

ID: Mentha26_contig00039396 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00039396
         (1480 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   501   e-139
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   474   e-131
gb|ABD33261.1| RNA-directed DNA polymerase (Reverse transcriptas...   439   e-120
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   423   e-115
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   399   e-108
gb|AAC63678.1| putative non-LTR retroelement reverse transcripta...   399   e-108
gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip...   399   e-108
gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]               394   e-107
ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268...   393   e-106
emb|CAB72467.1| putative protein [Arabidopsis thaliana]               390   e-106
gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana]                389   e-105
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   379   e-102
gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali...   377   e-102
gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   369   1e-99
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   364   5e-98
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           364   5e-98
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       361   4e-97
gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,...   359   2e-96
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   349   2e-93
dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like ...   347   1e-92

>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  501 bits (1291), Expect = e-139
 Identities = 235/455 (51%), Positives = 329/455 (72%)
 Frame = +1

Query: 109  EIREFYQSLMGTAAEELRMVDKNTMNRGPKLQVTQQKALVAAVTGKEVKEALFSMDSSKA 288
            EI EFY+ L+GT A  L  VD NT+  G  L    +++L+  V   E+ EAL  + + KA
Sbjct: 396  EILEFYKKLLGTRASTLMGVDLNTVRGGKCLSAQAKESLIREVASTEIDEALAGIGNDKA 455

Query: 289  PGIDGFNVYFFKRCWHIIGEEVIHAVQQFFSVGDLPKEVNVALITLIPKCDNASAVKDFR 468
            PG+DGFN YFFK+ W  I +E+   +Q+FF+   + + +N  ++TL+PK  +A+ VK+FR
Sbjct: 456  PGLDGFNAYFFKKSWGSIKQEIYAGIQEFFNNSRMHRPINCIVVTLLPKVQHATRVKEFR 515

Query: 469  PIACCTVLYKIISKILANRLKVVLDTIISDNQSAFVKGRLIFDNIILSHELIKSYTRKQL 648
            PIACCTV+YKIISK+L NR+K ++  ++++ QS F+ GR I DNI+L+ ELI+ YTRK +
Sbjct: 516  PIACCTVIYKIISKMLTNRMKGIIGEVVNEAQSGFIPGRHIADNILLASELIRGYTRKHM 575

Query: 649  SPRCMVKVDIQKAYDSVEWPFLKQMLIELGFPHRFINWIMTCLTTVSYVINVNGDLTEAF 828
            SPRC++KVDI+KAYDSVEW FL+ +L E GFP RF+ WIM C++TVSY + VNG  T+ F
Sbjct: 576  SPRCIMKVDIRKAYDSVEWSFLETLLYEFGFPSRFVGWIMECVSTVSYSVLVNGIPTQPF 635

Query: 829  EARKGIRQGDPISPYLFVICMEYLNRCLLELTDNRLFHYHPKCKRVGLIHLCFADDLLLF 1008
            +ARKG+RQGDP+SP+LF +CMEYL+RCL EL  +  F++HPKC+R+ + HL FADDLL+F
Sbjct: 636  QARKGLRQGDPMSPFLFALCMEYLSRCLEELKGSPDFNFHPKCERLNITHLMFADDLLMF 695

Query: 1009 TRGDINSIQQLLSVLDKFAAASGLKANQLKSNIYFGGVGAHLKQEILELSGMCEGELPFK 1188
             R D +S+  +     KF+ ASGL A+  KSNIYF GV     +E+ +   M  GELPF+
Sbjct: 696  CRADKSSLDHMNVAFQKFSHASGLAASHEKSNIYFCGVDDETARELADYVHMQLGELPFR 755

Query: 1189 YLGVPLSSKKLSVVQCQPLIKKMLQRINCWASKLLSYAGRVQLMKSVLFGIQVYWSQIFI 1368
            YLGVPL+SKKL+  QC+PL++ +  R   W +KLLSYAGR+QL+KS+L  +Q YW+ IF 
Sbjct: 756  YLGVPLTSKKLTYAQCKPLVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFP 815

Query: 1369 LPQKILKLIQSACRTFLWTGRTEVSKRALIAWEKI 1473
            L +K+++ ++  CR FLWTG+TE +K+A +AW  I
Sbjct: 816  LSKKVIQAVEKVCRKFLWTGKTEETKKAPVAWATI 850


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  474 bits (1219), Expect = e-131
 Identities = 225/453 (49%), Positives = 318/453 (70%)
 Frame = +1

Query: 109  EIREFYQSLMGTAAEELRMVDKNTMNRGPKLQVTQQKALVAAVTGKEVKEALFSMDSSKA 288
            EI  FY+ L+GT++ +L  +D + +  G KL  T    LV  +T +E+ +AL  +D +KA
Sbjct: 399  EICNFYRRLLGTSSSQLEAIDLHVVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKA 458

Query: 289  PGIDGFNVYFFKRCWHIIGEEVIHAVQQFFSVGDLPKEVNVALITLIPKCDNASAVKDFR 468
            PG+DGFN  FFK+ W +I +E+   +  FF  G + K +N   +TLIPK D A   KD+R
Sbjct: 459  PGLDGFNSVFFKKSWLVIKQEIYEGILDFFENGFMHKPINCTAVTLIPKIDEAKHAKDYR 518

Query: 469  PIACCTVLYKIISKILANRLKVVLDTIISDNQSAFVKGRLIFDNIILSHELIKSYTRKQL 648
            PIACC+ LYKIISKIL  RL+ V+  ++   Q+ F+  R I DNI+L+ ELI+ Y R+ +
Sbjct: 519  PIACCSTLYKIISKILTKRLQAVITEVVDCAQTGFIPERHIGDNILLATELIRGYNRRHV 578

Query: 649  SPRCMVKVDIQKAYDSVEWPFLKQMLIELGFPHRFINWIMTCLTTVSYVINVNGDLTEAF 828
            SPRC++KVDI+KAYDSVEW FL+ ML ELGFP  FI WIM C+ TVSY I +NG  +  F
Sbjct: 579  SPRCVIKVDIRKAYDSVEWVFLESMLKELGFPSMFIRWIMACVKTVSYSILLNGIPSIPF 638

Query: 829  EARKGIRQGDPISPYLFVICMEYLNRCLLELTDNRLFHYHPKCKRVGLIHLCFADDLLLF 1008
            +A+KG+RQGDP+SP+LF + MEYL+RC+  +  +  F++HPKC+R+ L HL FADDLL+F
Sbjct: 639  DAQKGLRQGDPLSPFLFALSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMF 698

Query: 1009 TRGDINSIQQLLSVLDKFAAASGLKANQLKSNIYFGGVGAHLKQEILELSGMCEGELPFK 1188
             R D +SI ++++  + F+ ASGL+A+  KS IYFGGV     +++ +   M  G LPF+
Sbjct: 699  ARADASSISKIMAAFNSFSKASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPFR 758

Query: 1189 YLGVPLSSKKLSVVQCQPLIKKMLQRINCWASKLLSYAGRVQLMKSVLFGIQVYWSQIFI 1368
            YLGVPL+SKKL+  QC+PLI K+  R   W + LLSYAGR+QL+K++L+ +Q YW QIF 
Sbjct: 759  YLGVPLASKKLNFSQCKPLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFP 818

Query: 1369 LPQKILKLIQSACRTFLWTGRTEVSKRALIAWE 1467
            LP+K++K +++ CR FLWTG  + S +A +AW+
Sbjct: 819  LPKKLIKAVETTCRKFLWTGTVDTSYKAPVAWD 851


>gb|ABD33261.1| RNA-directed DNA polymerase (Reverse transcriptase) [Medicago
            truncatula]
          Length = 402

 Score =  439 bits (1128), Expect = e-120
 Identities = 202/322 (62%), Positives = 258/322 (80%)
 Frame = +1

Query: 109  EIREFYQSLMGTAAEELRMVDKNTMNRGPKLQVTQQKALVAAVTGKEVKEALFSMDSSKA 288
            EIR FY  LMG++ + L MVDKN + RGP L   QQ  L +  T  EVK  LFSMDSSKA
Sbjct: 79   EIRGFYLKLMGSSVDSLPMVDKNVVKRGPMLSQHQQDLLCSKFTAVEVKNVLFSMDSSKA 138

Query: 289  PGIDGFNVYFFKRCWHIIGEEVIHAVQQFFSVGDLPKEVNVALITLIPKCDNASAVKDFR 468
            PGIDG+NV+FFK  W+IIG+ VI A+  FF  G +PK +N   +TL+PK  N ++VK+FR
Sbjct: 139  PGIDGYNVHFFKCSWNIIGDSVIDAILDFFKTGFMPKIINCTYMTLLPKEVNVTSVKNFR 198

Query: 469  PIACCTVLYKIISKILANRLKVVLDTIISDNQSAFVKGRLIFDNIILSHELIKSYTRKQL 648
            PIACC+V+YKIISKIL +R++ VL++++S+NQSAFVKGR+IFDNIILSHEL+KSY+RK +
Sbjct: 199  PIACCSVIYKIISKILTSRMQGVLNSVVSENQSAFVKGRVIFDNIILSHELVKSYSRKGI 258

Query: 649  SPRCMVKVDIQKAYDSVEWPFLKQMLIELGFPHRFINWIMTCLTTVSYVINVNGDLTEAF 828
            SPRCMVK+D+QKAY+SVEWPF+K +++ELGF ++F+NW+M CLTT SY  N+NGDLT  F
Sbjct: 259  SPRCMVKIDLQKAYNSVEWPFIKHLMLELGFSYKFVNWVMGCLTTASYTFNINGDLTRPF 318

Query: 829  EARKGIRQGDPISPYLFVICMEYLNRCLLELTDNRLFHYHPKCKRVGLIHLCFADDLLLF 1008
             A+KG+RQGDPISPYLFVICMEYLN CL++L  N  F +HP+CKR+ LIH+CF DDLLLF
Sbjct: 319  AAKKGLRQGDPISPYLFVICMEYLNICLIQLRKNAAFRFHPRCKRLNLIHVCFVDDLLLF 378

Query: 1009 TRGDINSIQQLLSVLDKFAAAS 1074
            +RGD++S+ QL      F+AAS
Sbjct: 379  SRGDVDSVSQLFEAFSLFSAAS 400


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  423 bits (1087), Expect = e-115
 Identities = 199/420 (47%), Positives = 287/420 (68%)
 Frame = +1

Query: 214  QKALVAAVTGKEVKEALFSMDSSKAPGIDGFNVYFFKRCWHIIGEEVIHAVQQFFSVGDL 393
            Q++L+  VT +E+++ LF M S K+PG DG+   FFK  W IIG+E   AVQ FF+ G L
Sbjct: 446  QQSLIRPVTAEEIRKVLFRMPSDKSPGPDGYTSEFFKATWEIIGDEFTLAVQSFFTKGFL 505

Query: 394  PKEVNVALITLIPKCDNASAVKDFRPIACCTVLYKIISKILANRLKVVLDTIISDNQSAF 573
            PK +N  ++ LIPK   A  +KD+RPI+CC VLYK+ISKI+ANRLK+VL   I+ NQSAF
Sbjct: 506  PKGINSTILALIPKKTEAREMKDYRPISCCNVLYKVISKIIANRLKLVLPKFIAGNQSAF 565

Query: 574  VKGRLIFDNIILSHELIKSYTRKQLSPRCMVKVDIQKAYDSVEWPFLKQMLIELGFPHRF 753
            VK RL+ +N++L+ EL+K Y +  +S RC +K+DI KA+DSV+WPFL  +   LGFP  F
Sbjct: 566  VKDRLLIENLLLATELVKDYHKDTISTRCAIKIDISKAFDSVQWPFLINVFTILGFPREF 625

Query: 754  INWIMTCLTTVSYVINVNGDLTEAFEARKGIRQGDPISPYLFVICMEYLNRCLLELTDNR 933
            I+WI  C+TT S+ + VNG+L   F++ +G+RQG  +SPYLFVICM+ L++ L +    R
Sbjct: 626  IHWINICITTASFSVQVNGELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAAR 685

Query: 934  LFHYHPKCKRVGLIHLCFADDLLLFTRGDINSIQQLLSVLDKFAAASGLKANQLKSNIYF 1113
             F YHPKCK +GL HL FADDL++ + G I SI++++ V D+FA  SGL+ +  KS +Y 
Sbjct: 686  HFGYHPKCKTMGLTHLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYL 745

Query: 1114 GGVGAHLKQEILELSGMCEGELPFKYLGVPLSSKKLSVVQCQPLIKKMLQRINCWASKLL 1293
             G+ A  + E+ +      G+LP +YLG+PL +K+LS   C PL++++ +RI  W S+ L
Sbjct: 746  AGLSATARNEVADRFPFSSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFL 805

Query: 1294 SYAGRVQLMKSVLFGIQVYWSQIFILPQKILKLIQSACRTFLWTGRTEVSKRALIAWEKI 1473
            SYAGR+ L+ SVL+ I  +W   F LP+K ++ ++  C  FLW+G    S +A I+W  +
Sbjct: 806  SYAGRLNLISSVLWSICNFWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMV 865


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  399 bits (1026), Expect = e-108
 Identities = 197/445 (44%), Positives = 288/445 (64%)
 Frame = +1

Query: 139  GTAAEELRMVDKNTMNRGPKLQVTQQKALVAAVTGKEVKEALFSMDSSKAPGIDGFNVYF 318
            G + E+LR    N M+   +  VT Q  L   VTG+E+++ LF+M ++K+PG DG+   F
Sbjct: 724  GISVEDLR----NLMSY--RCSVTDQNILTREVTGEEIQKVLFAMPNNKSPGPDGYTSEF 777

Query: 319  FKRCWHIIGEEVIHAVQQFFSVGDLPKEVNVALITLIPKCDNASAVKDFRPIACCTVLYK 498
            FK  W + G + I A+Q FF  G LPK +N  ++ LIPK D A  +KD+RPI+CC VLYK
Sbjct: 778  FKATWSLTGPDFIAAIQSFFVKGFLPKGLNATILALIPKKDEAIEMKDYRPISCCNVLYK 837

Query: 499  IISKILANRLKVVLDTIISDNQSAFVKGRLIFDNIILSHELIKSYTRKQLSPRCMVKVDI 678
            +ISKILANRLK++L + I  NQSAFVK RL+ +N++L+ EL+K Y ++ ++PRC +K+DI
Sbjct: 838  VISKILANRLKLLLPSFILQNQSAFVKERLLMENVLLATELVKDYHKESVTPRCAMKIDI 897

Query: 679  QKAYDSVEWPFLKQMLIELGFPHRFINWIMTCLTTVSYVINVNGDLTEAFEARKGIRQGD 858
             KA+DSV+W FL   L  L FP  F +WI  C++T ++ + VNG+L   F + +G+RQG 
Sbjct: 898  SKAFDSVQWQFLLNTLEALNFPETFRHWIKLCISTATFSVQVNGELAGFFGSSRGLRQGC 957

Query: 859  PISPYLFVICMEYLNRCLLELTDNRLFHYHPKCKRVGLIHLCFADDLLLFTRGDINSIQQ 1038
             +SPYLFVICM  L+  + E   +R   YHPKC+++GL HLCFADDL++F  G   SI+ 
Sbjct: 958  ALSPYLFVICMNVLSHMIDEAAVHRNIGYHPKCEKIGLTHLCFADDLMVFVDGHQWSIEG 1017

Query: 1039 LLSVLDKFAAASGLKANQLKSNIYFGGVGAHLKQEILELSGMCEGELPFKYLGVPLSSKK 1218
            +++V  +FA  SGL+ +  KS IY  GV A  + + L       G+LP +YLG+PL +K+
Sbjct: 1018 VINVFKEFAGRSGLQISLEKSTIYLAGVSASDRVQTLSSFPFANGQLPVRYLGLPLLTKQ 1077

Query: 1219 LSVVQCQPLIKKMLQRINCWASKLLSYAGRVQLMKSVLFGIQVYWSQIFILPQKILKLIQ 1398
            ++     PLI+ +  +I+ W ++ LSYAGR+ L+ SV+  I  +W   + LP   ++ I+
Sbjct: 1078 MTTADYSPLIEAVKTKISSWTARSLSYAGRLALLNSVIVSIANFWMSAYRLPAGCIREIE 1137

Query: 1399 SACRTFLWTGRTEVSKRALIAWEKI 1473
              C  FLW+G     K+A IAW  I
Sbjct: 1138 KLCSAFLWSGPVLNPKKAKIAWSSI 1162


>gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1216

 Score =  399 bits (1024), Expect = e-108
 Identities = 194/419 (46%), Positives = 275/419 (65%)
 Frame = +1

Query: 217  KALVAAVTGKEVKEALFSMDSSKAPGIDGFNVYFFKRCWHIIGEEVIHAVQQFFSVGDLP 396
            + L   VTG+E+K+ +FSM   K+PG DG+   F+K  W IIG+EVI A+Q FF+ G LP
Sbjct: 168  RLLTRVVTGEEIKKVIFSMPKDKSPGPDGYTSEFYKASWEIIGDEVIIAIQSFFAKGFLP 227

Query: 397  KEVNVALITLIPKCDNASAVKDFRPIACCTVLYKIISKILANRLKVVLDTIISDNQSAFV 576
            K VN  ++ LIPK   A  +KD+RPI+CC VLYK ISKILANRLK +L   I  NQSAFV
Sbjct: 228  KGVNSTILALIPKKKEAREIKDYRPISCCNVLYKAISKILANRLKRILPKFIVGNQSAFV 287

Query: 577  KGRLIFDNIILSHELIKSYTRKQLSPRCMVKVDIQKAYDSVEWPFLKQMLIELGFPHRFI 756
            K RL+ +N++L+ EL+K Y +  +S RC +K+DI KA+DS++W FL  +L  + FP  FI
Sbjct: 288  KDRLLIENVLLATELVKDYHKDSISTRCAMKIDISKAFDSLQWSFLTHVLAAMNFPGEFI 347

Query: 757  NWIMTCLTTVSYVINVNGDLTEAFEARKGIRQGDPISPYLFVICMEYLNRCLLELTDNRL 936
            +WI  C++T S+ I VNG+L   F + +G+RQG  +SPYLFVI M+ L+R L +    R 
Sbjct: 348  HWISLCMSTASFSIQVNGELAGYFRSARGLRQGCSLSPYLFVISMDVLSRMLDKAAGARE 407

Query: 937  FHYHPKCKRVGLIHLCFADDLLLFTRGDINSIQQLLSVLDKFAAASGLKANQLKSNIYFG 1116
            F YHP+CK +GL HLCFADDL++ T G I S+  ++ VL++FAA  GLK    K+ +Y  
Sbjct: 408  FGYHPRCKTLGLTHLCFADDLMILTDGKIRSVDGIVKVLNQFAAKLGLKICMEKTTLYLA 467

Query: 1117 GVGAHLKQEILELSGMCEGELPFKYLGVPLSSKKLSVVQCQPLIKKMLQRINCWASKLLS 1296
            GV  H +Q +        G+LP +YLG+PL +K+L+     PLI ++ +RI  W S+ LS
Sbjct: 468  GVSDHSRQLMSSRYSFGVGKLPVRYLGLPLVTKRLTTSDYSPLIDQIRRRIGMWTSRYLS 527

Query: 1297 YAGRVQLMKSVLFGIQVYWSQIFILPQKILKLIQSACRTFLWTGRTEVSKRALIAWEKI 1473
            +AGR+ L+ SVL+ I  +W   F LP++ +  I       LW+G     K+A ++W++I
Sbjct: 528  FAGRLSLINSVLWSITNFWMNAFRLPRECINEINRISSALLWSGPELNPKKAKVSWDEI 586


>gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score:
            72.31) [Arabidopsis thaliana]
          Length = 928

 Score =  399 bits (1024), Expect = e-108
 Identities = 201/461 (43%), Positives = 292/461 (63%), Gaps = 4/461 (0%)
 Frame = +1

Query: 103  EMEIREFYQSLM----GTAAEELRMVDKNTMNRGPKLQVTQQKALVAAVTGKEVKEALFS 270
            E   REF Q +     G A EEL+ +     +   K  +T        V+ +E+ + +FS
Sbjct: 299  EHHFREFLQLIPNDFEGIAVEELQDLLPYRCSDSDKEMLTNH------VSAEEIHKVVFS 352

Query: 271  MDSSKAPGIDGFNVYFFKRCWHIIGEEVIHAVQQFFSVGDLPKEVNVALITLIPKCDNAS 450
            M + K+PG DG+   F+K  W+IIG E I A+Q FF+ G LPK +N  ++ LIPK   A 
Sbjct: 353  MPNDKSPGPDGYTAEFYKGAWNIIGAEFILAIQSFFAKGFLPKGINSTILALIPKKKEAK 412

Query: 451  AVKDFRPIACCTVLYKIISKILANRLKVVLDTIISDNQSAFVKGRLIFDNIILSHELIKS 630
             +KD+RPI+CC VLYK+ISKI+ANRLK+VL   I  NQSAFVK RL+ +N++L+ E++K 
Sbjct: 413  EMKDYRPISCCNVLYKVISKIIANRLKLVLPKFIVGNQSAFVKDRLLIENVLLATEIVKD 472

Query: 631  YTRKQLSPRCMVKVDIQKAYDSVEWPFLKQMLIELGFPHRFINWIMTCLTTVSYVINVNG 810
            Y +  +S RC +K+DI KA+DSV+W FL  +L  + FP  F +WI  C+TT S+ + VNG
Sbjct: 473  YHKDSVSSRCALKIDISKAFDSVQWKFLINVLEAMNFPPEFTHWITLCITTASFSVQVNG 532

Query: 811  DLTEAFEARKGIRQGDPISPYLFVICMEYLNRCLLELTDNRLFHYHPKCKRVGLIHLCFA 990
            +L   F + + +RQG  +SPYLFVI M+ L++ L +    R F YHPKC+ +GL HL FA
Sbjct: 533  ELAGVFSSARELRQGCSLSPYLFVISMDVLSKMLDKAVGARQFGYHPKCRAIGLTHLSFA 592

Query: 991  DDLLLFTRGDINSIQQLLSVLDKFAAASGLKANQLKSNIYFGGVGAHLKQEILELSGMCE 1170
            DDL++ + G + SI  ++ VL +FA  SGLK +  KS +Y  GV A + QEI++      
Sbjct: 593  DDLMILSDGKVRSIDGIVKVLYEFAKWSGLKISMEKSTMYLAGVQASVYQEIVQKFSFDV 652

Query: 1171 GELPFKYLGVPLSSKKLSVVQCQPLIKKMLQRINCWASKLLSYAGRVQLMKSVLFGIQVY 1350
            G+LP +YLG+PL SK+L+   C PLI+++ ++I  W S+ LS+AGR+ L+ S L+ I  +
Sbjct: 653  GKLPVRYLGLPLVSKRLTASDCLPLIEQLRKKIEAWTSRFLSFAGRLNLISSTLWSICNF 712

Query: 1351 WSQIFILPQKILKLIQSACRTFLWTGRTEVSKRALIAWEKI 1473
            W   F LP+  ++ I   C  FLW+G    S +A ++WE I
Sbjct: 713  WMAAFRLPRACIREIDKLCSAFLWSGTELSSNKAKVSWEAI 753


>gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]
          Length = 872

 Score =  394 bits (1011), Expect = e-107
 Identities = 194/456 (42%), Positives = 291/456 (63%), Gaps = 2/456 (0%)
 Frame = +1

Query: 106  MEIREFYQSLMGTAAEELRMVDKNTMNRGPKLQVTQQ--KALVAAVTGKEVKEALFSMDS 279
            +E  +F++  +    E+   V+   +    + + T    + L   V+ +E+K  LFSM  
Sbjct: 55   VEAEKFFKEFLQLIPEDFVGVEVRELQDLLQFRCTNSDNEMLTREVSSEEIKTVLFSMPK 114

Query: 280  SKAPGIDGFNVYFFKRCWHIIGEEVIHAVQQFFSVGDLPKEVNVALITLIPKCDNASAVK 459
             K+PG DG+   F+K  W IIG+E    VQ FF  G LPK +N  ++ LIPK   A  ++
Sbjct: 115  DKSPGPDGYTSEFYKATWDIIGQEFTLPVQSFFQKGFLPKGINSIILALIPKKLAAKEMR 174

Query: 460  DFRPIACCTVLYKIISKILANRLKVVLDTIISDNQSAFVKGRLIFDNIILSHELIKSYTR 639
            D+RPI+CC VLYK+ISKI+ANRLK++L   I++NQSAFVK RL+ +N++L+ EL+K Y +
Sbjct: 175  DYRPISCCNVLYKVISKIIANRLKLLLPRFIAENQSAFVKDRLLIENLLLATELVKDYHK 234

Query: 640  KQLSPRCMVKVDIQKAYDSVEWPFLKQMLIELGFPHRFINWIMTCLTTVSYVINVNGDLT 819
              +S RC +K+DI KA+DSV+W FL   L+ + F   FI+WI  C+TT S+ + VNGDL 
Sbjct: 235  DSISARCAIKIDISKAFDSVQWSFLTNTLVAMNFSPTFIHWINLCITTASFSVQVNGDLV 294

Query: 820  EAFEARKGIRQGDPISPYLFVICMEYLNRCLLELTDNRLFHYHPKCKRVGLIHLCFADDL 999
              F++++G+RQG  +SPYLFVICM+ L++ L +    R F +HPKC+R+GL HL FADDL
Sbjct: 295  GYFQSKRGLRQGCSLSPYLFVICMDVLSKMLDKAAGVRKFGFHPKCQRLGLTHLSFADDL 354

Query: 1000 LLFTRGDINSIQQLLSVLDKFAAASGLKANQLKSNIYFGGVGAHLKQEILELSGMCEGEL 1179
            ++ + G   SI+ +L V D+F   SGL+ +  KS +Y  GV   +KQEI        G+L
Sbjct: 355  MVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVSPIIKQEIAAKFLFDVGQL 414

Query: 1180 PFKYLGVPLSSKKLSVVQCQPLIKKMLQRINCWASKLLSYAGRVQLMKSVLFGIQVYWSQ 1359
            P +YLG+PL +K+L+     PL++++ +RI  W  +  S+AGR  L+KSVL+ I  +W  
Sbjct: 415  PVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGRFNLIKSVLWSICNFWLA 474

Query: 1360 IFILPQKILKLIQSACRTFLWTGRTEVSKRALIAWE 1467
             F LP++ ++ I   C +FLW+G    S +A I+W+
Sbjct: 475  AFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKISWD 510


>ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum
            lycopersicum]
          Length = 717

 Score =  393 bits (1009), Expect = e-106
 Identities = 188/335 (56%), Positives = 254/335 (75%), Gaps = 3/335 (0%)
 Frame = +1

Query: 478  CCTVLYKIIS---KILANRLKVVLDTIISDNQSAFVKGRLIFDNIILSHELIKSYTRKQL 648
            C T++ + I    K + N    V+ TIISD+Q+ F+ GR I DNIIL+HEL+K+YTRK +
Sbjct: 324  CATIIEQEIIEALKSIGNDKAPVIHTIISDSQAGFIPGRKIGDNIILAHELVKAYTRKNV 383

Query: 649  SPRCMVKVDIQKAYDSVEWPFLKQMLIELGFPHRFINWIMTCLTTVSYVINVNGDLTEAF 828
            SPRCM+K+D+ KAYDSVEWPFL+Q++  LGFP  F  W+M C+ TV+Y I VNG  T+ F
Sbjct: 384  SPRCMLKIDLHKAYDSVEWPFLEQVMEGLGFPDLFTKWVMKCVKTVNYTIVVNGQNTQRF 443

Query: 829  EARKGIRQGDPISPYLFVICMEYLNRCLLELTDNRLFHYHPKCKRVGLIHLCFADDLLLF 1008
            +A KG+RQGDP+SP+LF I MEYL+R L  L +++ F YHPK  ++ + HLCFADDLLLF
Sbjct: 444  DAAKGLRQGDPMSPFLFAIAMEYLSRLLKGLKEDKSFKYHPKYAKLDVTHLCFADDLLLF 503

Query: 1009 TRGDINSIQQLLSVLDKFAAASGLKANQLKSNIYFGGVGAHLKQEILELSGMCEGELPFK 1188
            +RGD+NSI+ L     +F+ ASGL+AN  KS+IY GGV   ++Q+I++  G    ELPFK
Sbjct: 504  SRGDLNSIKALQKCFTEFSQASGLQANLNKSSIYCGGVQMEVRQQIIQQLGYTIEELPFK 563

Query: 1189 YLGVPLSSKKLSVVQCQPLIKKMLQRINCWASKLLSYAGRVQLMKSVLFGIQVYWSQIFI 1368
            YLGVPLSSKKL+ +Q  PLI+K++ RIN W +K LSYAGR QL+K+VLFG+Q  W+Q+FI
Sbjct: 564  YLGVPLSSKKLNTIQWYPLIEKVMARINSWTAKKLSYAGRAQLVKTVLFGVQALWAQLFI 623

Query: 1369 LPQKILKLIQSACRTFLWTGRTEVSKRALIAWEKI 1473
            +P KI+KLI+  CR++LW+G   V+K+ALIAW+K+
Sbjct: 624  IPAKIIKLIEGLCRSYLWSGVGYVTKKALIAWDKV 658


>emb|CAB72467.1| putative protein [Arabidopsis thaliana]
          Length = 762

 Score =  390 bits (1003), Expect = e-106
 Identities = 188/417 (45%), Positives = 278/417 (66%)
 Frame = +1

Query: 223  LVAAVTGKEVKEALFSMDSSKAPGIDGFNVYFFKRCWHIIGEEVIHAVQQFFSVGDLPKE 402
            L   V+ +E+K+ LFSM + K+PG DGF   FFK  W I+G E I A+Q FF++G LPK 
Sbjct: 2    LTRVVSAEEIKKVLFSMPNDKSPGPDGFTSEFFKESWEILGPEFILAIQSFFALGFLPKG 61

Query: 403  VNVALITLIPKCDNASAVKDFRPIACCTVLYKIISKILANRLKVVLDTIISDNQSAFVKG 582
            VN  ++ LIPK   +  +KD+RPI+CC V+YK+ISKILANRLK++L   I+ NQS+FVK 
Sbjct: 62   VNSTILALIPKKLESKEMKDYRPISCCNVMYKVISKILANRLKLLLPQFIAGNQSSFVKD 121

Query: 583  RLIFDNIILSHELIKSYTRKQLSPRCMVKVDIQKAYDSVEWPFLKQMLIELGFPHRFINW 762
            RL+ +N++L+ +L+K Y +  +S RC +K+DI KA DSV+W FL   L  + FP  FI+W
Sbjct: 122  RLLIENVLLATDLVKDYHKDSISERCAIKIDISKASDSVQWSFLINTLTAMHFPEMFIHW 181

Query: 763  IMTCLTTVSYVINVNGDLTEAFEARKGIRQGDPISPYLFVICMEYLNRCLLELTDNRLFH 942
            I  C+TT S+ + VNG+L   F++ +G+RQG  +SPYLFVICM+ L++ L ++       
Sbjct: 182  IRLCITTPSFSVQVNGELAGFFQSSRGLRQGCALSPYLFVICMDVLSKLLDKVVGIGRIG 241

Query: 943  YHPKCKRVGLIHLCFADDLLLFTRGDINSIQQLLSVLDKFAAASGLKANQLKSNIYFGGV 1122
            YHP CKR+GL HL FADDL++ T G   SI+ ++ V D F+  SGLK +  KS I+  G+
Sbjct: 242  YHPHCKRMGLTHLSFADDLMILTDGQCRSIEGIIEVFDLFSKWSGLKISMEKSTIFSAGL 301

Query: 1123 GAHLKQEILELSGMCEGELPFKYLGVPLSSKKLSVVQCQPLIKKMLQRINCWASKLLSYA 1302
             +  + ++        GELP +YLG+PL +K+LS V   PLI+++ +RI  W+S+ LS+A
Sbjct: 302  SSTSRAQLHTHFPFEVGELPIRYLGLPLVTKRLSSVDYAPLIEQIRKRIGSWSSRFLSFA 361

Query: 1303 GRVQLMKSVLFGIQVYWSQIFILPQKILKLIQSACRTFLWTGRTEVSKRALIAWEKI 1473
            GR  L+ S+++    +W   F LP+  ++ I+  C +FLW+G    SK+A I+W ++
Sbjct: 362  GRFNLISSIIWSSCNFWLSAFQLPRACIQEIEKLCSSFLWSGTNLNSKKAKISWNQV 418


>gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana]
          Length = 740

 Score =  389 bits (998), Expect = e-105
 Identities = 183/426 (42%), Positives = 272/426 (63%)
 Frame = +1

Query: 196  KLQVTQQKALVAAVTGKEVKEALFSMDSSKAPGIDGFNVYFFKRCWHIIGEEVIHAVQQF 375
            +   T Q  L   VT +E ++ LF+M S+K PG DG+   FFK  W I G++ I A++ F
Sbjct: 13   RCSATDQDMLTREVTSEENQKVLFAMPSNKFPGPDGYTSEFFKATWSITGQDFIAAIKSF 72

Query: 376  FSVGDLPKEVNVALITLIPKCDNASAVKDFRPIACCTVLYKIISKILANRLKVVLDTIIS 555
            F  G LPK +N  ++ LIPK D A+ ++D+RPI+CC V+YK+ISKI+ANRLKV+L T I 
Sbjct: 73   FIKGFLPKGLNATILALIPKKDEATLMRDYRPISCCNVIYKVISKIIANRLKVMLPTFIL 132

Query: 556  DNQSAFVKGRLIFDNIILSHELIKSYTRKQLSPRCMVKVDIQKAYDSVEWPFLKQMLIEL 735
             NQSAFV+ RL+ +N++L+ EL+K Y +  +SPRC +K+DI KA+DSV+W FL   L  L
Sbjct: 133  QNQSAFVRERLLIENVLLATELVKDYHKDSISPRCAMKIDISKAFDSVQWQFLLNTLEAL 192

Query: 736  GFPHRFINWIMTCLTTVSYVINVNGDLTEAFEARKGIRQGDPISPYLFVICMEYLNRCLL 915
             FP  F +WI  C++T ++ + VNG+L   F +++G+RQG  +SPYLFVICM  L+  + 
Sbjct: 193  NFPENFCHWIKLCISTATFSVQVNGELAGFFGSKRGLRQGCALSPYLFVICMNVLSHMID 252

Query: 916  ELTDNRLFHYHPKCKRVGLIHLCFADDLLLFTRGDINSIQQLLSVLDKFAAASGLKANQL 1095
                +R   YHPKCK++ L HLCFADDL++F  G   S++ ++++  +FA  SGL  +  
Sbjct: 253  VAAVHRNIGYHPKCKKLSLTHLCFADDLMVFIDGQQRSVEGVINIFKEFAGKSGLHISLE 312

Query: 1096 KSNIYFGGVGAHLKQEILELSGMCEGELPFKYLGVPLSSKKLSVVQCQPLIKKMLQRINC 1275
            KS +Y  GV    +  IL       G+LP +YLG+PL +K+++     PL+ K+  +I+ 
Sbjct: 313  KSTLYLAGVSELNRNNILSAFPFASGQLPVRYLGLPLLTKQMTTADYSPLLDKVRSKISS 372

Query: 1276 WASKLLSYAGRVQLMKSVLFGIQVYWSQIFILPQKILKLIQSACRTFLWTGRTEVSKRAL 1455
            W ++ LSYAGR+ L+ SV+  +  +W   + LP   +K I+  C  FLW+G     K+A 
Sbjct: 373  WTARSLSYAGRLALINSVIVSLSNFWMSAYRLPAGCIKEIEKLCSAFLWSGPELNPKKAK 432

Query: 1456 IAWEKI 1473
            I W  +
Sbjct: 433  ITWTSL 438


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  379 bits (972), Expect = e-102
 Identities = 192/438 (43%), Positives = 272/438 (62%), Gaps = 4/438 (0%)
 Frame = +1

Query: 178  TMNR--GPKLQVTQQKALVAAVTGKEVKEALFSMDSSKAPGIDGFNVYFFKRCWHIIGEE 351
            T+NR  GP L     K+L    T  +++   FSM+ +K+PG DGFN  FF++ W +IG+ 
Sbjct: 256  TINRSDGPDLA----KSLCNEFTHDDIRAVFFSMNPNKSPGPDGFNGCFFQKAWLVIGDN 311

Query: 352  VIHA-VQQFFSVGDLPKEVNVALITLIPKCDNASAVKDFRPIACCTVLYKIISKILANRL 528
            V+ A V++FFS G L  E+N  +ITL+PK  N + + DFRPI+CC   YKII+K+LANRL
Sbjct: 312  VVAAAVKEFFSYGSLLMELNSTIITLVPKVANPTTMSDFRPISCCNTFYKIIAKLLANRL 371

Query: 529  KVVLDTIISDNQSAFVKGRLIFDNIILSHELIKSYTRKQLSPRCMVKVDIQKAYDSVEWP 708
            K  L  I+  +QS F+ GR I DNI+L+ E+I  Y +    PRC   VD+ KA D+VEW 
Sbjct: 372  KGTLHLIVGPSQSTFIPGRRIGDNILLAQEIICDYHKADGQPRCTFMVDMMKANDTVEWD 431

Query: 709  FLKQMLIELGFPHRFINWIMTCLTTVSYVINVNGDLTEAFEARKGIRQGDPISPYLFVIC 888
            F+   L     P   I WI +C+++  + + VNG+L   F  R+G+RQGDP+SPYLFVI 
Sbjct: 432  FIIATLQAFNIPSTLIGWIKSCISSAKFSVCVNGELAGFFARRRGLRQGDPLSPYLFVIA 491

Query: 889  MEYLNRCL-LELTDNRLFHYHPKCKRVGLIHLCFADDLLLFTRGDINSIQQLLSVLDKFA 1065
            ME L+ C+   +  +  F YH +C ++ L HLCFADDLL+F  GD NS++ L      F 
Sbjct: 492  MEVLSLCIQRRINCSPCFRYHWRCDQLNLSHLCFADDLLMFCNGDENSVRTLHDAFSNFE 551

Query: 1066 AASGLKANQLKSNIYFGGVGAHLKQEILELSGMCEGELPFKYLGVPLSSKKLSVVQCQPL 1245
            + S LKAN  +S I+  GV  +    +L+++    G  P +YLG+PL + KL +  C PL
Sbjct: 552  SLSSLKANVSESKIFLAGVDGNSSDSVLQVTNFSLGTCPVRYLGIPLITSKLRMQDCSPL 611

Query: 1246 IKKMLQRINCWASKLLSYAGRVQLMKSVLFGIQVYWSQIFILPQKILKLIQSACRTFLWT 1425
            + ++  RI  W +K+LS+AGR+QL++SVL  IQVYW+   ILP+K+LK I+   R FLW 
Sbjct: 612  LDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVYWASHLILPKKVLKDIEKRLRCFLWA 671

Query: 1426 GRTEVSKRALIAWEKIML 1479
            G         +AW +I L
Sbjct: 672  GNCSGRAATKVAWSEICL 689


>gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana]
            gi|20197043|gb|AAM14892.1| putative reverse transcriptase
            [Arabidopsis thaliana]
          Length = 1412

 Score =  377 bits (967), Expect = e-102
 Identities = 179/421 (42%), Positives = 269/421 (63%)
 Frame = +1

Query: 211  QQKALVAAVTGKEVKEALFSMDSSKAPGIDGFNVYFFKRCWHIIGEEVIHAVQQFFSVGD 390
            +Q  LVA +T  EV +  FS+  +K+PG DG+ V FF+  W +IG+EV  A++ FF+ G 
Sbjct: 711  EQNLLVAEITEAEVMKVFFSIPLNKSPGPDGYTVEFFRETWSVIGQEVTMAIKSFFTYGF 770

Query: 391  LPKEVNVALITLIPKCDNASAVKDFRPIACCTVLYKIISKILANRLKVVLDTIISDNQSA 570
            LPK +N  ++ LIPK   A  +KD+RPI+CC VLYK ISK+LANRLK +L   I+ NQSA
Sbjct: 771  LPKGLNSTILALIPKRTYAKEMKDYRPISCCNVLYKAISKLLANRLKCLLPEFIAPNQSA 830

Query: 571  FVKGRLIFDNIILSHELIKSYTRKQLSPRCMVKVDIQKAYDSVEWPFLKQMLIELGFPHR 750
            F+  RL+ +N++L+ EL+K Y +  LSPRC +K+D+ KA+DSV+WPFL   L  L  P +
Sbjct: 831  FISDRLLMENLLLASELVKDYHKDGLSPRCAMKIDLSKAFDSVQWPFLLNTLAALDIPEK 890

Query: 751  FINWIMTCLTTVSYVINVNGDLTEAFEARKGIRQGDPISPYLFVICMEYLNRCLLELTDN 930
            FI+WI  C++T S+ + VN           G+RQG  +SPYLFVICM  L+  L +    
Sbjct: 891  FIHWINLCISTASFSVQVN-----------GLRQGCSLSPYLFVICMNVLSAMLDKGAVE 939

Query: 931  RLFHYHPKCKRVGLIHLCFADDLLLFTRGDINSIQQLLSVLDKFAAASGLKANQLKSNIY 1110
            + F YHP+C+ +GL HLCFADD+++F+ G  +S++ +L++   FAA SGL  +  KS ++
Sbjct: 940  KRFGYHPRCRNMGLTHLCFADDIMVFSAGSAHSLEGVLAIFKDFAAFSGLNISLEKSTLF 999

Query: 1111 FGGVGAHLKQEILELSGMCEGELPFKYLGVPLSSKKLSVVQCQPLIKKMLQRINCWASKL 1290
               + +     IL       G LP +YLG+PL +K++++  C PL++K+  RI+ W ++ 
Sbjct: 1000 MASISSETCASILARFPFDSGSLPVRYLGLPLMTKRMTLADCLPLLEKIRSRISSWKNRF 1059

Query: 1291 LSYAGRVQLMKSVLFGIQVYWSQIFILPQKILKLIQSACRTFLWTGRTEVSKRALIAWEK 1470
            LSYAGR+QL+ SV+  +  +W   F LP+  ++ I+     FLW+G      +A +AW  
Sbjct: 1060 LSYAGRLQLLNSVISSLTKFWISAFRLPRACIREIEQISAAFLWSGTDLNPHKAKVAWHD 1119

Query: 1471 I 1473
            +
Sbjct: 1120 V 1120


>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  369 bits (948), Expect = 1e-99
 Identities = 175/400 (43%), Positives = 264/400 (66%)
 Frame = +1

Query: 274  DSSKAPGIDGFNVYFFKRCWHIIGEEVIHAVQQFFSVGDLPKEVNVALITLIPKCDNASA 453
            ++ K+PG DG+ V FFK  W ++G +++ A+Q FF  G LPK +N  ++ LI K    S 
Sbjct: 613  EAHKSPGPDGYTVEFFKTAWPVLGRDLVIAIQSFFLKGFLPKGINTTILALISKKHEVSG 672

Query: 454  VKDFRPIACCTVLYKIISKILANRLKVVLDTIISDNQSAFVKGRLIFDNIILSHELIKSY 633
            +KD+RPI+CC VLYKI+SK++ANRLK +L   I+ NQSAF+K RL+ +N++L+ EL+K Y
Sbjct: 673  MKDYRPISCCNVLYKIVSKLMANRLKEILPASIAPNQSAFIKDRLMMENLLLASELVKDY 732

Query: 634  TRKQLSPRCMVKVDIQKAYDSVEWPFLKQMLIELGFPHRFINWIMTCLTTVSYVINVNGD 813
             ++ +S R  +K+DI KA+D V+WPFL  +L  +  P  FI+WI  C+ T S+ + VNG+
Sbjct: 733  HKESISSRSALKIDISKAFDFVQWPFLINVLKAIHLPEMFIHWIELCIGTASFSVQVNGE 792

Query: 814  LTEAFEARKGIRQGDPISPYLFVICMEYLNRCLLELTDNRLFHYHPKCKRVGLIHLCFAD 993
            L+  F + +G+RQG  +SPYL+VICM  L+  L +    +   YHP+C+ + L HLCFAD
Sbjct: 793  LSGFFRSERGLRQGCSLSPYLYVICMNVLSCMLDKAAVEKKISYHPRCRNMNLTHLCFAD 852

Query: 994  DLLLFTRGDINSIQQLLSVLDKFAAASGLKANQLKSNIYFGGVGAHLKQEILELSGMCEG 1173
            D+++F+ G   SIQ  L++ +KFAA S LK +  KS I+  G+  + K  IL+      G
Sbjct: 853  DIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISPNAKTSILQQFPFELG 912

Query: 1174 ELPFKYLGVPLSSKKLSVVQCQPLIKKMLQRINCWASKLLSYAGRVQLMKSVLFGIQVYW 1353
             LP KYLG+PL +K+++     PL++K+  RI  W ++ LS+AGR+QL+KSVL  I  +W
Sbjct: 913  TLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGRLQLIKSVLSSITNFW 972

Query: 1354 SQIFILPQKILKLIQSACRTFLWTGRTEVSKRALIAWEKI 1473
              +F LP+  L+ I+     FLW+G    +K+A IAW ++
Sbjct: 973  LSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEV 1012


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  364 bits (935), Expect = 5e-98
 Identities = 184/451 (40%), Positives = 276/451 (61%), Gaps = 3/451 (0%)
 Frame = +1

Query: 121  FYQSLMGTAAEELRMVDKNTMNRGPKLQVTQQKA--LVAAVTGKEVKEALFSMDSSKAPG 294
            +Y+ L+G+      M ++  MN     + +Q +   L  + T  E+K A  S+  +K  G
Sbjct: 268  YYERLLGSIESPFSM-EQEDMNLLLTYRCSQDQCSELEKSFTDDEIKAAFKSLPRNKTSG 326

Query: 295  IDGFNVYFFKRCWHIIGEEVIHAVQQFFSVGDLPKEVNVALITLIPKCDNASAVKDFRPI 474
             DG++V FF+  W IIG EV+ A+ +FF  G L K+ N   + LIPK  NA  + +FRPI
Sbjct: 327  PDGYSVEFFRDTWSIIGPEVLAAIHEFFDSGQLLKQWNATTLVLIPKTSNACTISEFRPI 386

Query: 475  ACCTVLYKIISKILANRLKVVLDTIISDNQSAFVKGRLIFDNIILSHELIKSYTRKQLSP 654
            +C   LYK+ISK+L +RL+ +L  +I  +QSAF+ GR + +N++L+ E++  Y R  +SP
Sbjct: 387  SCLNTLYKVISKLLTSRLQGLLSAVIGHSQSAFLPGRSLAENVLLATEMVHGYNRLNISP 446

Query: 655  RCMVKVDIQKAYDSVEWPFLKQMLIELGFPHRFINWIMTCLTTVSYVINVNGDLTEAFEA 834
            R M+KVD++KA+DSV+W F+   L  L  P R+INWI  C+TT S+ I+VNG     F +
Sbjct: 447  RGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIHQCITTPSFTISVNGATGGFFRS 506

Query: 835  RKGIRQGDPISPYLFVICMEYLNRCLLELTDNRLFHYHPKCKRVGLIHLCFADDLLLFTR 1014
             KG+RQGDP+SPYLFV+ ME  ++ L    D+   HYHPK   + + HL FADD+++F  
Sbjct: 507  TKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLSISHLMFADDVMIFFD 566

Query: 1015 GDINSIQQLLSVLDKFAAASGLKANQLKSNIYFGGVGAHLKQEILELS-GMCEGELPFKY 1191
            G  +S+  +   LD FA  SGLK N+ KS ++  G+   L + I   + G   G  P +Y
Sbjct: 567  GGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL--DLSERITSAAYGFPAGTFPIRY 624

Query: 1192 LGVPLSSKKLSVVQCQPLIKKMLQRINCWASKLLSYAGRVQLMKSVLFGIQVYWSQIFIL 1371
            LG+PL  +KL +    PL++K+  R+  W SK LS+AGR QL+ SV+FG+  +W   F+L
Sbjct: 625  LGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTFLL 684

Query: 1372 PQKILKLIQSACRTFLWTGRTEVSKRALIAW 1464
            P+  +K I+S C  FLW G  +  K + ++W
Sbjct: 685  PKGCIKKIESLCSKFLWAGSIDGRKSSKVSW 715


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  364 bits (935), Expect = 5e-98
 Identities = 184/451 (40%), Positives = 276/451 (61%), Gaps = 3/451 (0%)
 Frame = +1

Query: 121  FYQSLMGTAAEELRMVDKNTMNRGPKLQVTQQKA--LVAAVTGKEVKEALFSMDSSKAPG 294
            +Y+ L+G+      M ++  MN     + +Q +   L  + T  E+K A  S+  +K  G
Sbjct: 268  YYERLLGSIESPFSM-EQEDMNLLLTYRCSQDQCSELEKSFTDDEIKAAFKSLPRNKTSG 326

Query: 295  IDGFNVYFFKRCWHIIGEEVIHAVQQFFSVGDLPKEVNVALITLIPKCDNASAVKDFRPI 474
             DG++V FF+  W IIG EV+ A+ +FF  G L K+ N   + LIPK  NA  + +FRPI
Sbjct: 327  PDGYSVEFFRDTWSIIGPEVLAAIHEFFDSGQLLKQWNATTLVLIPKTSNACTISEFRPI 386

Query: 475  ACCTVLYKIISKILANRLKVVLDTIISDNQSAFVKGRLIFDNIILSHELIKSYTRKQLSP 654
            +C   LYK+ISK+L +RL+ +L  +I  +QSAF+ GR + +N++L+ E++  Y R  +SP
Sbjct: 387  SCLNTLYKVISKLLTSRLQGLLSAVIGHSQSAFLPGRSLAENVLLATEMVHGYNRLNISP 446

Query: 655  RCMVKVDIQKAYDSVEWPFLKQMLIELGFPHRFINWIMTCLTTVSYVINVNGDLTEAFEA 834
            R M+KVD++KA+DSV+W F+   L  L  P R+INWI  C+TT S+ I+VNG     F +
Sbjct: 447  RGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIHQCITTPSFTISVNGATGGFFRS 506

Query: 835  RKGIRQGDPISPYLFVICMEYLNRCLLELTDNRLFHYHPKCKRVGLIHLCFADDLLLFTR 1014
             KG+RQGDP+SPYLFV+ ME  ++ L    D+   HYHPK   + + HL FADD+++F  
Sbjct: 507  TKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLSISHLMFADDVMIFFD 566

Query: 1015 GDINSIQQLLSVLDKFAAASGLKANQLKSNIYFGGVGAHLKQEILELS-GMCEGELPFKY 1191
            G  +S+  +   LD FA  SGLK N+ KS ++  G+   L + I   + G   G  P +Y
Sbjct: 567  GGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL--DLSERITSAAYGFPAGTFPIRY 624

Query: 1192 LGVPLSSKKLSVVQCQPLIKKMLQRINCWASKLLSYAGRVQLMKSVLFGIQVYWSQIFIL 1371
            LG+PL  +KL +    PL++K+  R+  W SK LS+AGR QL+ SV+FG+  +W   F+L
Sbjct: 625  LGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTFLL 684

Query: 1372 PQKILKLIQSACRTFLWTGRTEVSKRALIAW 1464
            P+  +K I+S C  FLW G  +  K + ++W
Sbjct: 685  PKGCIKKIESLCSKFLWAGSIDGRKSSKVSW 715


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  361 bits (927), Expect = 4e-97
 Identities = 181/455 (39%), Positives = 280/455 (61%), Gaps = 2/455 (0%)
 Frame = +1

Query: 121  FYQSLMGTAAEELRMVDKNTMNR--GPKLQVTQQKALVAAVTGKEVKEALFSMDSSKAPG 294
            ++ SL+G   +   M ++N MN     +    Q   L +  + ++++ ALFS+  +K+ G
Sbjct: 408  YFGSLLGDEVDPYLM-EQNDMNLLLSYRCSPAQVCELESTFSNEDIRAALFSLPRNKSCG 466

Query: 295  IDGFNVYFFKRCWHIIGEEVIHAVQQFFSVGDLPKEVNVALITLIPKCDNASAVKDFRPI 474
             DGF   FF   W I+G EV  A+++FFS G L K+ N   I LIPK  N +   DFRPI
Sbjct: 467  PDGFTAEFFIDSWSIVGAEVTDAIKEFFSSGCLLKQWNATTIVLIPKIVNPTCTSDFRPI 526

Query: 475  ACCTVLYKIISKILANRLKVVLDTIISDNQSAFVKGRLIFDNIILSHELIKSYTRKQLSP 654
            +C   LYK+I+++L +RL+ +L  +IS  QSAF+ GR + +N++L+ +L+  Y    +SP
Sbjct: 527  SCLNTLYKVIARLLTDRLQRLLSGVISSAQSAFLPGRSLAENVLLATDLVHGYNWSNISP 586

Query: 655  RCMVKVDIQKAYDSVEWPFLKQMLIELGFPHRFINWIMTCLTTVSYVINVNGDLTEAFEA 834
            R M+KVD++KA+DSV W F+   L  L  P +FINWI  C++T ++ +++NG     F++
Sbjct: 587  RGMLKVDLKKAFDSVRWEFVIAALRALAIPEKFINWISQCISTPTFTVSINGGNGGFFKS 646

Query: 835  RKGIRQGDPISPYLFVICMEYLNRCLLELTDNRLFHYHPKCKRVGLIHLCFADDLLLFTR 1014
             KG+RQGDP+SPYLFV+ ME  +  L    ++ L HYHPK   + + HL FADD+++F  
Sbjct: 647  TKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYHPKASNLSISHLMFADDVMIFFD 706

Query: 1015 GDINSIQQLLSVLDKFAAASGLKANQLKSNIYFGGVGAHLKQEILELSGMCEGELPFKYL 1194
            G   S+  +   LD FA+ SGLK N+ KS++Y  G+   L+       G   G LP +YL
Sbjct: 707  GGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGLN-QLESNANAAYGFPIGTLPIRYL 765

Query: 1195 GVPLSSKKLSVVQCQPLIKKMLQRINCWASKLLSYAGRVQLMKSVLFGIQVYWSQIFILP 1374
            G+PL ++KL + + +PL++K+  R   W +K LS+AGR+QL+ SV+FG   +W   F+LP
Sbjct: 766  GLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTFLLP 825

Query: 1375 QKILKLIQSACRTFLWTGRTEVSKRALIAWEKIML 1479
            +  +K I+S C  FLW+G  E +K   ++W  + L
Sbjct: 826  KGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCL 860


>gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13)
            [Arabidopsis thaliana]
          Length = 1164

 Score =  359 bits (921), Expect = 2e-96
 Identities = 185/456 (40%), Positives = 281/456 (61%), Gaps = 2/456 (0%)
 Frame = +1

Query: 118  EFYQSLMGTAAEELRMVDKNTMNR--GPKLQVTQQKALVAAVTGKEVKEALFSMDSSKAP 291
            E++QS +G+  + L + ++  ++     +    QQ +L    + +++K A FS+  +KA 
Sbjct: 304  EYFQSNLGSE-QGLPLFEQADISNLLSYRCSPAQQVSLDTPFSSEQIKNAFFSLPRNKAS 362

Query: 292  GIDGFNVYFFKRCWHIIGEEVIHAVQQFFSVGDLPKEVNVALITLIPKCDNASAVKDFRP 471
            G DGF+  FF  CW IIG EV  A+ +FF+ G L K+ N   + LIPK  NAS++ DFRP
Sbjct: 363  GPDGFSPEFFCACWPIIGGEVTEAIHEFFTSGKLLKQWNATNLVLIPKITNASSMSDFRP 422

Query: 472  IACCTVLYKIISKILANRLKVVLDTIISDNQSAFVKGRLIFDNIILSHELIKSYTRKQLS 651
            I+C   +YK+ISK+L +RLK  L   IS +QSAF+ GRL  +N++L+ EL+  Y +K ++
Sbjct: 423  ISCLNTVYKVISKLLTDRLKDFLPAAISHSQSAFMPGRLFLENVLLATELVHGYNKKNIA 482

Query: 652  PRCMVKVDIQKAYDSVEWPFLKQMLIELGFPHRFINWIMTCLTTVSYVINVNGDLTEAFE 831
            P  M+KVD++KA+DSV W F+   L  L  P +F  WI+ CL+T S+ + +NG     F 
Sbjct: 483  PSSMLKVDLRKAFDSVRWDFIVSALRALNVPEKFTCWILECLSTASFSVILNGHSAGHFW 542

Query: 832  ARKGIRQGDPISPYLFVICMEYLNRCLLELTDNRLFHYHPKCKRVGLIHLCFADDLLLFT 1011
            + KG+RQGDP+SPYLFV+ ME  +  L     +    YHPK  ++ + HL FADD+++F 
Sbjct: 543  SSKGLRQGDPMSPYLFVLAMEVFSGLLQSRYTSGYIAYHPKTSQLEISHLMFADDVMIFF 602

Query: 1012 RGDINSIQQLLSVLDKFAAASGLKANQLKSNIYFGGVGAHLKQEILELSGMCEGELPFKY 1191
             G  +S+  ++  L+ FA  SGL  N  K+ +Y  G+ +  + + +   G   G LP +Y
Sbjct: 603  DGKSSSLHGIVESLEDFAGWSGLLMNTNKTQLYHAGL-SQSESDSMASYGFKLGSLPVRY 661

Query: 1192 LGVPLSSKKLSVVQCQPLIKKMLQRINCWASKLLSYAGRVQLMKSVLFGIQVYWSQIFIL 1371
            LG+PL S+KL++ +  PLI+K+  R N W  +LLS+AGRVQL+ SV+ GI  +W   FIL
Sbjct: 662  LGLPLMSRKLTIAEYAPLIEKITARFNSWVVRLLSFAGRVQLLASVISGIVNFWISSFIL 721

Query: 1372 PQKILKLIQSACRTFLWTGRTEVSKRALIAWEKIML 1479
            P   +K I+S C  FLW+ R +    A +AW ++ L
Sbjct: 722  PLGCIKKIESLCSRFLWSSRIDKKGIAKVAWSQVCL 757


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  349 bits (895), Expect = 2e-93
 Identities = 182/456 (39%), Positives = 277/456 (60%), Gaps = 2/456 (0%)
 Frame = +1

Query: 118  EFYQSLMGTAAEELRMVDKNTMNRGPKLQVTQ--QKALVAAVTGKEVKEALFSMDSSKAP 291
            +F++ L G+++  +     + +N   + +  +  ++ L A V+  ++K   F++ S+K+P
Sbjct: 405  DFFKELFGSSSHLISAEGISQINSLTRFKCDENTRQLLEAEVSEADIKSEFFALPSNKSP 464

Query: 292  GIDGFNVYFFKRCWHIIGEEVIHAVQQFFSVGDLPKEVNVALITLIPKCDNASAVKDFRP 471
            G DG+   FFK+ W I+G  +I AVQ+FF  G L  + N   +T++PK  NA  + +FRP
Sbjct: 465  GPDGYTSEFFKKTWSIVGPSLIAAVQEFFRSGRLLGQWNSTAVTMVPKKPNADRITEFRP 524

Query: 472  IACCTVLYKIISKILANRLKVVLDTIISDNQSAFVKGRLIFDNIILSHELIKSYTRKQLS 651
            I+CC  +YK+ISK+LA RL+ +L   IS +QSAFVKGRL+ +N++L+ EL++ + +  +S
Sbjct: 525  ISCCNAIYKVISKLLARRLENILPLWISPSQSAFVKGRLLTENVLLATELVQGFGQANIS 584

Query: 652  PRCMVKVDIQKAYDSVEWPFLKQMLIELGFPHRFINWIMTCLTTVSYVINVNGDLTEAFE 831
             R ++KVD++KA+DSV W F+ + L     P RF+NWI  C+T+ S+ INV+G L   F+
Sbjct: 585  SRGVLKVDLRKAFDSVGWGFIIETLKAANAPPRFVNWIKQCITSTSFSINVSGSLCGYFK 644

Query: 832  ARKGIRQGDPISPYLFVICMEYLNRCLLELTDNRLFHYHPKCKRVGLIHLCFADDLLLFT 1011
              KG+RQGDP+SP LFVI ME L+R L     +    YHPK   V +  L FADDL++F 
Sbjct: 645  GSKGLRQGDPLSPSLFVIAMEILSRLLENKFSDGSIGYHPKASEVRISSLAFADDLMIFY 704

Query: 1012 RGDINSIQQLLSVLDKFAAASGLKANQLKSNIYFGGVGAHLKQEILELSGMCEGELPFKY 1191
             G  +S++ + SVL+ F   SGL+ N  KS +Y  G+    K++ L   G   G  PF+Y
Sbjct: 705  DGKASSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDTLAF-GFVNGTFPFRY 763

Query: 1192 LGVPLSSKKLSVVQCQPLIKKMLQRINCWASKLLSYAGRVQLMKSVLFGIQVYWSQIFIL 1371
            LG+PL  +KL       LI K+  R N WA+K LS+AGR+QL+ SV++    +W   FIL
Sbjct: 764  LGLPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISSVIYSTVNFWLSSFIL 823

Query: 1372 PQKILKLIQSACRTFLWTGRTEVSKRALIAWEKIML 1479
            P+  LK I+  C  FLW           ++W+   L
Sbjct: 824  PKCCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCL 859


>dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 893

 Score =  347 bits (889), Expect = 1e-92
 Identities = 183/456 (40%), Positives = 274/456 (60%), Gaps = 3/456 (0%)
 Frame = +1

Query: 121  FYQSLM-GTAAEELRMVDKNTMNRGPKLQVTQQKALVAAVTGKEVKEALFSMDSSKAPGI 297
            F++SL+ G   E         +    +  V Q   L  + +  +++EA FS+  +KA G 
Sbjct: 409  FFESLLCGVEGENSLAQSDMNLLLSFRCSVDQINDLERSFSDLDIQEAFFSLPRNKASGP 468

Query: 298  DGFNVYFFKRCWHIIGEEVIHAVQQFFSVGDLPKEVNVALITLIPKCDNASAVKDFRPIA 477
            DG++  FFK  W ++G EV  AVQ+FF  G L K+ N   + LIPK  N+S + DFRPI+
Sbjct: 469  DGYSSEFFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDFRPIS 528

Query: 478  CCTVLYKIISKILANRLKVVLDTIISDNQSAFVKGRLIFDNIILSHELIKSYTRKQLSPR 657
            C   LYK+I+K+L +RLK +L+ +IS +QSAF+ GRL+ +N++L+ E++  Y  K +S R
Sbjct: 529  CLNTLYKVIAKLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVHGYNTKNISSR 588

Query: 658  CMVKVDIQKAYDSVEWPFLKQMLIELGFPHRFINWIMTCLTTVSYVINVNGDLTEAFEAR 837
             M+KVD++KA+DSV W F+      L  P +F+ WI  C++T  + + VNG  +  F++ 
Sbjct: 589  GMLKVDLRKAFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVNGSSSGFFKSN 648

Query: 838  KGIRQGDPISPYLFVICMEYLNRCLLELTDNRLFHYHPKCKRVGLIHLCFADDLLLFTRG 1017
            KG+RQGDP+SPYLFV+ ME  +  L    D    HYHPK   + + HL FADD+++F  G
Sbjct: 649  KGLRQGDPLSPYLFVLAMEVFSSLLKARFDAGYIHYHPKTADLSISHLMFADDVMVFFDG 708

Query: 1018 DINSIQQLLSVLDKFAAASGLKANQLKSNIYFGGVGAHLKQEILELS--GMCEGELPFKY 1191
              +S+  +   LD FA+ SGL  N+ K+N+Y  G     + E L +S  G     LP +Y
Sbjct: 709  GSSSLHGISEALDDFASWSGLHVNKDKTNLYLAGTD---EVEALAISHYGFPISTLPIRY 765

Query: 1192 LGVPLSSKKLSVVQCQPLIKKMLQRINCWASKLLSYAGRVQLMKSVLFGIQVYWSQIFIL 1371
            LG+PL S+KL + + +     +++R   WA K LS+AGRVQL+ SV+ G+  +W   F+L
Sbjct: 766  LGLPLMSRKLKISEYE-----LVKRFRSWAVKSLSFAGRVQLITSVITGLVNFWMSTFVL 820

Query: 1372 PQKILKLIQSACRTFLWTGRTEVSKRALIAWEKIML 1479
                +K I+S C  FLW+G  + SK A IAW  + L
Sbjct: 821  LLGCVKKIESLCSRFLWSGSIDASKGAKIAWSGVCL 856


Top