BLASTX nr result

ID: Mentha25_contig00025233 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00025233
         (2465 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   724   0.0  
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   718   0.0  
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   518   e-144
dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like ...   482   e-133
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   482   e-133
emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-li...   482   e-133
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       481   e-133
gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip...   468   e-129
gb|ABD33261.1| RNA-directed DNA polymerase (Reverse transcriptas...   466   e-128
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   458   e-126
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           458   e-126
gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali...   454   e-124
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   453   e-124
dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thal...   451   e-124
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   441   e-121
gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,...   431   e-118
gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00...   427   e-117
ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom...   426   e-116
ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom...   422   e-115
ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom...   421   e-115

>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  724 bits (1870), Expect = 0.0
 Identities = 369/827 (44%), Positives = 518/827 (62%), Gaps = 8/827 (0%)
 Frame = -2

Query: 2461 RGWECVNNYNSAVNGRIWVAWNPRXXXXXXXXXXXXAILFEVLDLGTGKWQNVLAVYALN 2282
            + W+ +NNY+ +   RIW+ W P              ++ ++ D        ++AVY L+
Sbjct: 53   KDWKWLNNYSHSARERIWIGWRPAWVNVTLTHTQEQLMVCDIQD--QSHKLKMVAVYGLH 110

Query: 2281 TGEGRKELWDFAKRKMVAVNEPMVVGGDFNAILSSEDRFQGAVVSQADVEDFQGFIDVGE 2102
            T   RK LW     + V   +PM++ GDFNA+  S DR  G +V+ A+ EDFQ F+    
Sbjct: 111  TIADRKSLWS-GLLQCVQQQDPMIIIGDFNAVCHSNDRLYGTLVTDAETEDFQQFLLQSN 169

Query: 2101 LHEVRWSGPQYTWSNNQEGERRVCSNIDRCFANVKWLDEYSDVVVQRLAKGVSDHCPQML 1922
            L E R +   Y+WSN+  G  RV S ID+ + N+ WL  Y++V VQ L  G+SDH P + 
Sbjct: 170  LIESRSTWSYYSWSNSSIGRDRVLSRIDKAYVNLVWLGMYAEVSVQYLPPGISDHSPLLF 229

Query: 1921 LFGNCRQRAGL-FRFFNVLADHEDFEGIIREHWGAWRSGNILRDIWRKCIKLKGPLKSLN 1745
                 R + G  F+F NV+A+  +F   + + W +      L+ IW     +K  LK + 
Sbjct: 230  NLMTGRPQGGKPFKFMNVMAEQGEFLETVEKAWNSVNGRFKLQAIWLNLKAVKRELKQMK 289

Query: 1744 TKWFARVGDRVQGLREQLARVQHDHDCSXXXXXXXXXEK-------WSNIEERIWQQKSR 1586
            T+      ++V+ LR QL  +Q   D           +        WS+IE+ I QQKSR
Sbjct: 290  TQKIGLAHEKVKNLRHQLQDLQSQDDFDHNDIMQTDAKSIMNDLRHWSHIEDSILQQKSR 349

Query: 1585 VDWLKLGDANTKFFHAYAKMRRNTNAINHLTRMDGSECWGQEQIMEEVRRFYMNLMGSCA 1406
            + WL+ GD N+K F    K R   N I+ L   DG      +++ EE+  FY  L+G+ A
Sbjct: 350  ITWLQQGDTNSKLFFTAVKARHAINRIDMLNTEDGRVIQDADEVQEEILEFYKKLLGTRA 409

Query: 1405 SELQVVNKDIMRRGPRLTSQQQRDLIKECTDAEVKDALFCMDSNKAPGVDGFNACFFKKS 1226
            S L  V+ + +R G  L++Q +  LI+E    E+ +AL  + ++KAPG+DGFNA FFKKS
Sbjct: 410  STLMGVDLNTVRGGKCLSAQAKESLIREVASTEIDEALAGIGNDKAPGLDGFNAYFFKKS 469

Query: 1225 WEFIGEEVTRAVQQFFRNGQLPREINVALITLLPKVPNASSVKDFRPIACCSVLYKIISK 1046
            W  I +E+   +Q+FF N ++ R IN  ++TLLPKV +A+ VK+FRPIACC+V+YKIISK
Sbjct: 470  WGSIKQEIYAGIQEFFNNSRMHRPINCIVVTLLPKVQHATRVKEFRPIACCTVIYKIISK 529

Query: 1045 ILANRMKIVLNDVIGDCQSAFIPGRLIFDNILLSHELVKGYTRKQVSPRCMIKVDIQKAY 866
            +L NRMK ++ +V+ + QS FIPGR I DNILL+ EL++GYTRK +SPRC++KVDI+KAY
Sbjct: 530  MLTNRMKGIIGEVVNEAQSGFIPGRHIADNILLASELIRGYTRKHMSPRCIMKVDIRKAY 589

Query: 865  DSVEWVFVEQMLSELGFPYRYIQWIMACLSTVSYSILVNGEVTEMFEARRGIRQGDPISP 686
            DSVEW F+E +L E GFP R++ WIM C+STVSYS+LVNG  T+ F+AR+G+RQGDP+SP
Sbjct: 590  DSVEWSFLETLLYEFGFPSRFVGWIMECVSTVSYSVLVNGIPTQPFQARKGLRQGDPMSP 649

Query: 685  YLFVICMEYLNRCFGELKSNRLFKYHPKCKKFGLVHVSFADDLLVFTRGDRDSVQQAMKV 506
            +LF +CMEYL+RC  ELK +  F +HPKC++  + H+ FADDLL+F R D+ S+      
Sbjct: 650  FLFALCMEYLSRCLEELKGSPDFNFHPKCERLNITHLMFADDLLMFCRADKSSLDHMNVA 709

Query: 505  LDHFAEVSGLRANQLKSCIYFGGVKDDMKHVILDQTGMCEGSLPFKYLGVPLTSHKLSVM 326
               F+  SGL A+  KS IYF GV D+    + D   M  G LPF+YLGVPLTS KL+  
Sbjct: 710  FQKFSHASGLAASHEKSNIYFCGVDDETARELADYVHMQLGELPFRYLGVPLTSKKLTYA 769

Query: 325  QCKPLVDGILQRINCWSAKLLSYAGRVQLIKSVVFGIQMYWSQIFVLPQKVLKRVQQACR 146
            QCKPLV+ I  R   W AKLLSYAGR+QLIKS++  +Q YW+ IF L +KV++ V++ CR
Sbjct: 770  QCKPLVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVCR 829

Query: 145  IFLWTGKGEMSNRALVAWDKVMQPKSGGGLNILNLNEWNKAAIFKHL 5
             FLWTGK E + +A VAW  + +PKS GG N++N+  WN+AA+ K L
Sbjct: 830  KFLWTGKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLKLL 876


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  718 bits (1853), Expect = 0.0
 Identities = 366/825 (44%), Positives = 514/825 (62%), Gaps = 8/825 (0%)
 Frame = -2

Query: 2455 WECVNNYNSAVNGRIWVAWNPRXXXXXXXXXXXXAILFEVLDLGTGKWQNVLAVYALNTG 2276
            W  +NNY  +  GRIWV W                I  EV +        + AVY L+T 
Sbjct: 55   WSWINNYACSPRGRIWVGWLNNDVNINVLSVTEQVITMEVKNSYGLNMFKMAAVYGLHTI 114

Query: 2275 EGRKELWDFAKRKMVAVNEPMVVGGDFNAILSSEDRFQGAVVSQADVEDFQGFIDVGELH 2096
              RK LW+     +   +EP ++ GD+NA+ S++DR  G  VS+A+  D + F+   +L 
Sbjct: 115  ADRKVLWEELYNFVSVCHEPCILIGDYNAVYSAQDRLNGNDVSEAETSDLRSFVLKAQLL 174

Query: 2095 EVRWSGPQYTWSNNQEGERRVCSNIDRCFANVKWLDEYSDVVVQRLAKGVSDHCPQMLLF 1916
            E   +G  Y+W+N   G  R+ S ID+ F NV W+++Y DVVV+    G+SDH P +   
Sbjct: 175  EAPTTGLFYSWNNKSIGADRISSRIDKSFVNVAWINQYPDVVVEYREAGISDHSPLIFNL 234

Query: 1915 GNCRQRAGL-FRFFNVLADHEDFEGIIREHWGAWRSGNILRDIWRKCIKLKGPLKSLNTK 1739
                   G  F+F N LAD   F  +++E WG+      +++IW +   +K  LKS ++K
Sbjct: 235  ATQHDEGGRPFKFLNFLADQNGFVEVVKEAWGSANHRFKMKNIWVRLQAVKRALKSFHSK 294

Query: 1738 WFARVGDRVQGLREQLARVQHDHDCSXXXXXXXXXE-------KWSNIEERIWQQKSRVD 1580
             F++   +V+ LR +LA VQ   + S         +       KWS I+E I +QKSR+ 
Sbjct: 295  KFSKAHCQVEELRRKLAAVQALPEVSQVSELQEEEKDLIAQLRKWSTIDESILKQKSRIQ 354

Query: 1579 WLKLGDANTKFFHAYAKMRRNTNAINHLTRMDGSECWGQEQIMEEVRRFYMNLMGSCASE 1400
            WL LGD+N+KFF    K+R+  N I  L    G +     +I  E+  FY  L+G+ +S+
Sbjct: 355  WLSLGDSNSKFFFTAIKVRKARNKIVLLQNDRGDQLTENTEIQNEICNFYRRLLGTSSSQ 414

Query: 1399 LQVVNKDIMRRGPRLTSQQQRDLIKECTDAEVKDALFCMDSNKAPGVDGFNACFFKKSWE 1220
            L+ ++  ++R G +L++     L++  T  E+  AL  +D  KAPG+DGFN+ FFKKSW 
Sbjct: 415  LEAIDLHVVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKAPGLDGFNSVFFKKSWL 474

Query: 1219 FIGEEVTRAVQQFFRNGQLPREINVALITLLPKVPNASSVKDFRPIACCSVLYKIISKIL 1040
             I +E+   +  FF NG + + IN   +TL+PK+  A   KD+RPIACCS LYKIISKIL
Sbjct: 475  VIKQEIYEGILDFFENGFMHKPINCTAVTLIPKIDEAKHAKDYRPIACCSTLYKIISKIL 534

Query: 1039 ANRMKIVLNDVIGDCQSAFIPGRLIFDNILLSHELVKGYTRKQVSPRCMIKVDIQKAYDS 860
              R++ V+ +V+   Q+ FIP R I DNILL+ EL++GY R+ VSPRC+IKVDI+KAYDS
Sbjct: 535  TKRLQAVITEVVDCAQTGFIPERHIGDNILLATELIRGYNRRHVSPRCVIKVDIRKAYDS 594

Query: 859  VEWVFVEQMLSELGFPYRYIQWIMACLSTVSYSILVNGEVTEMFEARRGIRQGDPISPYL 680
            VEWVF+E ML ELGFP  +I+WIMAC+ TVSYSIL+NG  +  F+A++G+RQGDP+SP+L
Sbjct: 595  VEWVFLESMLKELGFPSMFIRWIMACVKTVSYSILLNGIPSIPFDAQKGLRQGDPLSPFL 654

Query: 679  FVICMEYLNRCFGELKSNRLFKYHPKCKKFGLVHVSFADDLLVFTRGDRDSVQQAMKVLD 500
            F + MEYL+RC G +  +  F +HPKC++  L H+ FADDLL+F R D  S+ + M   +
Sbjct: 655  FALSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMFARADASSISKIMAAFN 714

Query: 499  HFAEVSGLRANQLKSCIYFGGVKDDMKHVILDQTGMCEGSLPFKYLGVPLTSHKLSVMQC 320
             F++ SGL+A+  KSCIYFGGV  +    + D+  M  GSLPF+YLGVPL S KL+  QC
Sbjct: 715  SFSKASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPFRYLGVPLASKKLNFSQC 774

Query: 319  KPLVDGILQRINCWSAKLLSYAGRVQLIKSVVFGIQMYWSQIFVLPQKVLKRVQQACRIF 140
            KPL+D I  R   W A LLSYAGR+QL+K++++ +Q YW QIF LP+K++K V+  CR F
Sbjct: 775  KPLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFPLPKKLIKAVETTCRKF 834

Query: 139  LWTGKGEMSNRALVAWDKVMQPKSGGGLNILNLNEWNKAAIFKHL 5
            LWTG  + S +A VAWD + QPKS GGLN+ N+  WNKAAI K L
Sbjct: 835  LWTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILKLL 879


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  518 bits (1334), Expect = e-144
 Identities = 301/840 (35%), Positives = 451/840 (53%), Gaps = 23/840 (2%)
 Frame = -2

Query: 2461 RGWECVNNYNSAVNGRIWVAWNPRXXXXXXXXXXXXAILFEVLDLGTGKWQNVLA-VYAL 2285
            + W  + NY     GRIWV W  R             +L   + L   + +   + VYA 
Sbjct: 53   KDWSILTNYEHNRRGRIWVLW--RKNVRLSPIYKSCQLLTCSVKLEDRQDEFFCSFVYAS 110

Query: 2284 NTGEGRKELWDFAKRKM---VAVNEPMVVGGDFNAILSSEDRFQGAV--VSQADVEDFQG 2120
            N  E RK LW   K      +  ++P  + GDFN  L   +  Q  V  +    + DFQ 
Sbjct: 111  NYVEERKVLWSELKDHYDSPIIRHKPWTLLGDFNETLDIAEHSQSFVHPMVTPGMRDFQQ 170

Query: 2119 FIDVGELHEVRWSGPQYTWSNNQEGERRVCSNIDRCFANVKWLDEYSDVVVQRLAKGVSD 1940
             I+   L ++   GP +TW N +E    +   +DR   N  W   +S       A G SD
Sbjct: 171  VINYCSLTDMAAQGPLFTWCNKRE-HGLIMKKLDRVLINDCWNQTFSQSYSVFEAGGCSD 229

Query: 1939 HCPQMLLF----GNCRQRAGLFRFFNVLADHEDFEGIIREHWGAWR----SGNILRDIWR 1784
            H    +      GN  Q    F+F N L D EDF+ ++  +W        S + L    +
Sbjct: 230  HLRCRISLNSEAGNKVQGLKPFKFVNALTDMEDFKPMVSTYWKDTEPLILSTSTLFRFSK 289

Query: 1783 KCIKLKGPLKSLNTKWFARVGDRVQGLREQLARVQH----DHDCSXXXXXXXXXEKWSNI 1616
                LK  ++S+       +  +     + L   QH    +              +W  +
Sbjct: 290  NLKGLKPKIRSMARDRLGNLSKKANEAYKILCAKQHVNLTNPSSMAMEEENAAYSRWDRV 349

Query: 1615 ---EERIWQQKSRVDWLKLGDANTKFFHAYAKMRRNTNAINHLTRMDGSECWGQEQIMEE 1445
               EE+  +QKS++ W ++GD NTK FH  A  R   N I  +   DG      ++I  E
Sbjct: 350  AILEEKYLKQKSKLHWCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTKGDEIKAE 409

Query: 1444 VRRFYMNLMGSCASELQVVN-KDIMRRGP-RLTSQQQRDLIKECTDAEVKDALFCMDSNK 1271
              RF+   +    ++ + V   ++ +  P R +   Q+ LI+  T  E++  LF M S+K
Sbjct: 410  AERFFREFLQLIPNDFEGVTITELQQLLPVRCSDADQQSLIRPVTAEEIRKVLFRMPSDK 469

Query: 1270 APGVDGFNACFFKKSWEFIGEEVTRAVQQFFRNGQLPREINVALITLLPKVPNASSVKDF 1091
            +PG DG+ + FFK +WE IG+E T AVQ FF  G LP+ IN  ++ L+PK   A  +KD+
Sbjct: 470  SPGPDGYTSEFFKATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALIPKKTEAREMKDY 529

Query: 1090 RPIACCSVLYKIISKILANRMKIVLNDVIGDCQSAFIPGRLIFDNILLSHELVKGYTRKQ 911
            RPI+CC+VLYK+ISKI+ANR+K+VL   I   QSAF+  RL+ +N+LL+ ELVK Y +  
Sbjct: 530  RPISCCNVLYKVISKIIANRLKLVLPKFIAGNQSAFVKDRLLIENLLLATELVKDYHKDT 589

Query: 910  VSPRCMIKVDIQKAYDSVEWVFVEQMLSELGFPYRYIQWIMACLSTVSYSILVNGEVTEM 731
            +S RC IK+DI KA+DSV+W F+  + + LGFP  +I WI  C++T S+S+ VNGE+   
Sbjct: 590  ISTRCAIKIDISKAFDSVQWPFLINVFTILGFPREFIHWINICITTASFSVQVNGELAGY 649

Query: 730  FEARRGIRQGDPISPYLFVICMEYLNRCFGELKSNRLFKYHPKCKKFGLVHVSFADDLLV 551
            F++ RG+RQG  +SPYLFVICM+ L++   +  + R F YHPKCK  GL H+SFADDL+V
Sbjct: 650  FQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMGLTHLSFADDLMV 709

Query: 550  FTRGDRDSVQQAMKVLDHFAEVSGLRANQLKSCIYFGGVKDDMKHVILDQTGMCEGSLPF 371
             + G   S+++ +KV D FA+ SGLR +  KS +Y  G+    ++ + D+     G LP 
Sbjct: 710  LSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVADRFPFSSGQLPV 769

Query: 370  KYLGVPLTSHKLSVMQCKPLVDGILQRINCWSAKLLSYAGRVQLIKSVVFGIQMYWSQIF 191
            +YLG+PL + +LS   C PL++ + +RI  W+++ LSYAGR+ LI SV++ I  +W   F
Sbjct: 770  RYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSICNFWLAAF 829

Query: 190  VLPQKVLKRVQQACRIFLWTGKGEMSNRALVAWDKVMQPKSGGGLNILNLNEWNKAAIFK 11
             LP+K ++ +++ C  FLW+G    SN+A ++W  V +PK  GGL + +L E N     K
Sbjct: 830  RLPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLRSLKEANDVCCLK 889


>dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 893

 Score =  482 bits (1241), Expect = e-133
 Identities = 285/828 (34%), Positives = 436/828 (52%), Gaps = 17/828 (2%)
 Frame = -2

Query: 2458 GWECVNNYNSAVNGRIWVAWNPRXXXXXXXXXXXXAILFEVLDLGTGKWQNVLAVYALNT 2279
            GW  V NY  +V G+IWV W+P              I  E+L   +  W  V  VYA N 
Sbjct: 56   GWSFVENYEFSVLGKIWVLWDPSVKVVVIGRSLQM-ITCELLLPDSPSWFVVSIVYASNE 114

Query: 2278 GEGRKELWDFAKR---KMVAVNEPMVVGGDFNAILSSEDRFQGAVVSQADVEDFQGFIDV 2108
               RKELW+   +     V V    +V GDFN IL+ E      +  +  +  F+  +  
Sbjct: 115  EGTRKELWNELVQLALSPVVVGRSWIVLGDFNQILNPESAINANIGRK--IRAFRSCLLD 172

Query: 2107 GELHEVRWSGPQYTWSNNQEGERRVCSNIDRCFANVKWLDEYSDVVVQRLAKGVSDHCP- 1931
             +L+++ + G  YTW N     R +   IDR   N  W   +            SDH   
Sbjct: 173  SDLYDLVYKGSSYTWWNKCSS-RPLAKKIDRILVNDHWNTLFPSAYANFGEPDFSDHSSC 231

Query: 1930 QMLLFGNCRQRAGLFRFFNVLADHEDFEGIIREHWGAWR-SGNILRDIWRKCIKLKGPLK 1754
            +++L     +    FRFFN    + DF  +IRE+W +   SG+ +  + +K   LK P+ 
Sbjct: 232  EVVLDPAVLKAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPIC 291

Query: 1753 SLNTKWFARVGDRVQGLREQLARVQH----DHDCSXXXXXXXXXEKW---SNIEERIWQQ 1595
              + + ++ +  RV      +   Q     +              KW   +  EE  + Q
Sbjct: 292  CFSRENYSDIEKRVSEAHAIVLHRQRITLTNPSVVHATLELEATRKWQILAKAEESFFCQ 351

Query: 1594 KSRVDWLKLGDANTKFFHAYAKMRRNTNAINHLTRMDGSECWGQEQIMEEVRR----FYM 1427
            KS + WL  GD NT +FH  A MR++ N IN L    G     Q+ I E ++     F+ 
Sbjct: 352  KSSISWLYEGDNNTAYFHKMADMRKSINTINFLIDDFGERIETQQGIKEGIKEHSCNFFE 411

Query: 1426 NLMGSCASELQVVNKDI-MRRGPRLTSQQQRDLIKECTDAEVKDALFCMDSNKAPGVDGF 1250
            +L+     E  +   D+ +    R +  Q  DL +  +D ++++A F +  NKA G DG+
Sbjct: 412  SLLCGVEGENSLAQSDMNLLLSFRCSVDQINDLERSFSDLDIQEAFFSLPRNKASGPDGY 471

Query: 1249 NACFFKKSWEFIGEEVTRAVQQFFRNGQLPREINVALITLLPKVPNASSVKDFRPIACCS 1070
            ++ FFK  W  +G EVT AVQ+FFR+GQL ++ N   + L+PK+ N+S + DFRPI+C +
Sbjct: 472  SSEFFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDFRPISCLN 531

Query: 1069 VLYKIISKILANRMKIVLNDVIGDCQSAFIPGRLIFDNILLSHELVKGYTRKQVSPRCMI 890
             LYK+I+K+L +R+K +LN+VI   QSAF+PGRL+ +N+LL+ E+V GY  K +S R M+
Sbjct: 532  TLYKVIAKLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVHGYNTKNISSRGML 591

Query: 889  KVDIQKAYDSVEWVFVEQMLSELGFPYRYIQWIMACLSTVSYSILVNGEVTEMFEARRGI 710
            KVD++KA+DSV W F+      L  P +++ WI  C+ST  +S++VNG  +  F++ +G+
Sbjct: 592  KVDLRKAFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVNGSSSGFFKSNKGL 651

Query: 709  RQGDPISPYLFVICMEYLNRCFGELKSNRLFKYHPKCKKFGLVHVSFADDLLVFTRGDRD 530
            RQGDP+SPYLFV+ ME  +             YHPK     + H+ FADD++VF  G   
Sbjct: 652  RQGDPLSPYLFVLAMEVFSSLLKARFDAGYIHYHPKTADLSISHLMFADDVMVFFDGGSS 711

Query: 529  SVQQAMKVLDHFAEVSGLRANQLKSCIYFGGVKDDMKHVILDQTGMCEGSLPFKYLGVPL 350
            S+    + LD FA  SGL  N+ K+ +Y  G  D+++ + +   G    +LP +YLG+PL
Sbjct: 712  SLHGISEALDDFASWSGLHVNKDKTNLYLAGT-DEVEALAISHYGFPISTLPIRYLGLPL 770

Query: 349  TSHKLSVMQCKPLVDGILQRINCWSAKLLSYAGRVQLIKSVVFGIQMYWSQIFVLPQKVL 170
             S KL + + +     +++R   W+ K LS+AGRVQLI SV+ G+  +W   FVL    +
Sbjct: 771  MSRKLKISEYE-----LVKRFRSWAVKSLSFAGRVQLITSVITGLVNFWMSTFVLLLGCV 825

Query: 169  KRVQQACRIFLWTGKGEMSNRALVAWDKVMQPKSGGGLNILNLNEWNK 26
            K+++  C  FLW+G  + S  A +AW  V  PK+ GG+ +     WNK
Sbjct: 826  KKIESLCSRFLWSGSIDASKGAKIAWSGVCLPKNEGGVGLRRFTPWNK 873


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  482 bits (1241), Expect = e-133
 Identities = 287/829 (34%), Positives = 434/829 (52%), Gaps = 18/829 (2%)
 Frame = -2

Query: 2458 GWECVNNYNSAVNGRIWVAWNPRXXXXXXXXXXXXAILFEVLDLGTGKWQNVLAVYALNT 2279
            GW+ V NY  A  GRIWV W+P                   L   + ++  V  VYA+N 
Sbjct: 55   GWKSVCNYEFAALGRIWVVWDPAVEVTVLSKSDQTISCTVKLPHISTEFV-VTFVYAVNC 113

Query: 2278 GEGRKELWDFAKRKMVAVNE-----PMVVGGDFNAILSSEDRFQGAVVSQADVEDFQGFI 2114
              GR+ LW  ++ +++A N+     P ++ GDFN  L   D   G       +E+F+  +
Sbjct: 114  RYGRRRLW--SELELLAANQTTSDKPWIILGDFNQSLDPVDASTGGSRITRGMEEFRECL 171

Query: 2113 DVGELHEVRWSGPQYTWSNNQEGERRVCSNIDRCFANVKWLDEYSDVVVQRLAKGVSDHC 1934
                + ++ + G  YTW NNQE    +   IDR   N  WL           A   SDHC
Sbjct: 172  LTSNISDLPFRGNHYTWWNNQENNP-IAKKIDRILVNDSWLIASPLSYGSFCAMEFSDHC 230

Query: 1933 PQMLLFGNCRQRAGL---FRFFNVLADHEDFEGIIREHWGAWR-SGNILRDIWRKCIKLK 1766
            P  +   N  Q  G    F+  N L  H +F   IR  W      G+ +  + +K   LK
Sbjct: 231  PSCVNISN--QSGGRNKPFKLSNFLMHHPEFIEKIRVTWDRLAYQGSAMFTLSKKSKFLK 288

Query: 1765 GPLKSLNTKWFARVGDRVQGLREQLARVQHDHDCSXXXXXXXXXEK----WSNI---EER 1607
            G +++ N + ++ +  RV    + L   Q++   +         ++    W+ +   EER
Sbjct: 289  GTIRTFNREHYSGLEKRVVQAAQNLKTCQNNLLAAPSSYLAGLEKEAHRSWAELALAEER 348

Query: 1606 IWQQKSRVDWLKLGDANTKFFHAYAKMRRNTNAINHLTRMDGSECWGQEQIMEEVRRFYM 1427
               QKSRV WLK GD+NT FFH     RR  N I++L    G      +++      F+ 
Sbjct: 349  FLCQKSRVLWLKCGDSNTTFFHRMMTARRAINEIHYLLDQTGRRIENTDELQTHCVDFFK 408

Query: 1426 NLMGSCASELQVVNKDIMRRGPRLTSQQQ-RDLIK-ECTDAEVKDALFCMDSNKAPGVDG 1253
             L GS +  +       +    R    +  R L++ E ++A++K   F + SNK+PG DG
Sbjct: 409  ELFGSSSHLISAEGISQINSLTRFKCDENTRQLLEAEVSEADIKSEFFALPSNKSPGPDG 468

Query: 1252 FNACFFKKSWEFIGEEVTRAVQQFFRNGQLPREINVALITLLPKVPNASSVKDFRPIACC 1073
            + + FFKK+W  +G  +  AVQ+FFR+G+L  + N   +T++PK PNA  + +FRPI+CC
Sbjct: 469  YTSEFFKKTWSIVGPSLIAAVQEFFRSGRLLGQWNSTAVTMVPKKPNADRITEFRPISCC 528

Query: 1072 SVLYKIISKILANRMKIVLNDVIGDCQSAFIPGRLIFDNILLSHELVKGYTRKQVSPRCM 893
            + +YK+ISK+LA R++ +L   I   QSAF+ GRL+ +N+LL+ ELV+G+ +  +S R +
Sbjct: 529  NAIYKVISKLLARRLENILPLWISPSQSAFVKGRLLTENVLLATELVQGFGQANISSRGV 588

Query: 892  IKVDIQKAYDSVEWVFVEQMLSELGFPYRYIQWIMACLSTVSYSILVNGEVTEMFEARRG 713
            +KVD++KA+DSV W F+ + L     P R++ WI  C+++ S+SI V+G +   F+  +G
Sbjct: 589  LKVDLRKAFDSVGWGFIIETLKAANAPPRFVNWIKQCITSTSFSINVSGSLCGYFKGSKG 648

Query: 712  IRQGDPISPYLFVICMEYLNRCFGELKSNRLFKYHPKCKKFGLVHVSFADDLLVFTRGDR 533
            +RQGDP+SP LFVI ME L+R      S+    YHPK  +  +  ++FADDL++F  G  
Sbjct: 649  LRQGDPLSPSLFVIAMEILSRLLENKFSDGSIGYHPKASEVRISSLAFADDLMIFYDGKA 708

Query: 532  DSVQQAMKVLDHFAEVSGLRANQLKSCIYFGGVKDDMKHVILDQTGMCEGSLPFKYLGVP 353
             S++    VL+ F  +SGL  N  KS +Y  G++D  K   L   G   G+ PF+YLG+P
Sbjct: 709  SSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDTL-AFGFVNGTFPFRYLGLP 767

Query: 352  LTSHKLSVMQCKPLVDGILQRINCWSAKLLSYAGRVQLIKSVVFGIQMYWSQIFVLPQKV 173
            L   KL       L+D I  R N W+ K LS+AGR+QLI SV++    +W   F+LP+  
Sbjct: 768  LLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISSVIYSTVNFWLSSFILPKCC 827

Query: 172  LKRVQQACRIFLWTGKGEMSNRALVAWDKVMQPKSGGGLNILNLNEWNK 26
            LK ++Q C  FLW           V+W     PK+ GGL + N   WNK
Sbjct: 828  LKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWTWNK 876


>emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-like protein
            [Arabidopsis thaliana]
          Length = 893

 Score =  482 bits (1241), Expect = e-133
 Identities = 285/828 (34%), Positives = 437/828 (52%), Gaps = 17/828 (2%)
 Frame = -2

Query: 2458 GWECVNNYNSAVNGRIWVAWNPRXXXXXXXXXXXXAILFEVLDLGTGKWQNVLAVYALNT 2279
            GW  V NY  +V G+IWV W+P              I  E+L   +  W  V  VYA N 
Sbjct: 56   GWSFVENYEFSVLGKIWVLWDPSVKVVVIGRSLQM-ITCELLLPDSPSWFVVSIVYASNE 114

Query: 2278 GEGRKELWDFAKR---KMVAVNEPMVVGGDFNAILSSEDRFQGAVVSQADVEDFQGFIDV 2108
               RKELW+   +     V V    +V GDFN IL+ E      +  +  +  F+  +  
Sbjct: 115  EGTRKELWNELVQLALSPVVVGRSWIVLGDFNQILNPESAINANIGRK--IRAFRSCLLD 172

Query: 2107 GELHEVRWSGPQYTWSNNQEGERRVCSNIDRCFANVKWLDEYSDVVVQRLAKGVSDHCP- 1931
             +L+++ + G  YTW N     R +   IDR   N  W   +            SDH   
Sbjct: 173  SDLYDLVYKGSSYTWWNKCSS-RPLAKKIDRILVNDHWNTLFPSAYANFGEPDFSDHSSC 231

Query: 1930 QMLLFGNCRQRAGLFRFFNVLADHEDFEGIIREHWGAWR-SGNILRDIWRKCIKLKGPLK 1754
            +++L     +    FRFFN    + DF  +IRE+W +   SG+ +  + +K   LK P+ 
Sbjct: 232  EVVLDPAVLKAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPIC 291

Query: 1753 SLNTKWFARVGDRVQGLREQLARVQH----DHDCSXXXXXXXXXEKW---SNIEERIWQQ 1595
              + + ++ +  RV      +   Q     +              KW   +  EE  + Q
Sbjct: 292  CFSRENYSDIEKRVSEAHAIVLHRQRITLTNPSVVHATLELEATRKWQILAKAEESFFCQ 351

Query: 1594 KSRVDWLKLGDANTKFFHAYAKMRRNTNAINHLTRMDGSECWGQEQIMEEVRR----FYM 1427
            KS + WL  GD NT +FH  A MR++ N IN L    G     Q+ I E ++     F+ 
Sbjct: 352  KSSISWLYEGDNNTAYFHKMADMRKSINTINFLIDDFGERIETQQGIKEGIKEHSCNFFE 411

Query: 1426 NLMGSCASELQVVNKDI-MRRGPRLTSQQQRDLIKECTDAEVKDALFCMDSNKAPGVDGF 1250
            +L+     E  +   D+ +    R +  Q  DL +  +D ++++A F +  NKA G DG+
Sbjct: 412  SLLCGVEGENSLAQSDMNLLLSFRCSVDQINDLERSFSDLDIQEAFFSLPRNKASGPDGY 471

Query: 1249 NACFFKKSWEFIGEEVTRAVQQFFRNGQLPREINVALITLLPKVPNASSVKDFRPIACCS 1070
            ++ FFK  W  +G EVT AVQ+FFR+GQL ++ N   + L+PK+ N+S + DFRPI+C +
Sbjct: 472  SSEFFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDFRPISCLN 531

Query: 1069 VLYKIISKILANRMKIVLNDVIGDCQSAFIPGRLIFDNILLSHELVKGYTRKQVSPRCMI 890
             LYK+I+K+L +R+K +LN+VI   QSAF+PGRL+ +N+LL+ E+V GY  K +S R M+
Sbjct: 532  TLYKVIAKLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVHGYNTKNISSRGML 591

Query: 889  KVDIQKAYDSVEWVFVEQMLSELGFPYRYIQWIMACLSTVSYSILVNGEVTEMFEARRGI 710
            KVD++KA+DSV W F+      L  P +++ WI  C+ST  +S++VNG  +  F++ +G+
Sbjct: 592  KVDLRKAFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVNGSSSGFFKSNKGL 651

Query: 709  RQGDPISPYLFVICMEYLNRCFGELKSNRLFKYHPKCKKFGLVHVSFADDLLVFTRGDRD 530
            RQGDP+SPYLFV+ ME  +            +YHPK     + H+ FADD++VF  G   
Sbjct: 652  RQGDPLSPYLFVLAMEVFSSLLKARFDAGYIQYHPKTADLSISHLMFADDVMVFFDGGSS 711

Query: 529  SVQQAMKVLDHFAEVSGLRANQLKSCIYFGGVKDDMKHVILDQTGMCEGSLPFKYLGVPL 350
            S+    + LD FA  SGL  N+ K+ +Y  G  D+++ + +   G    +LP +YLG+PL
Sbjct: 712  SLHGISEALDDFASWSGLHVNKDKTNLYLAGT-DEVEALAISHYGFPISTLPIRYLGLPL 770

Query: 349  TSHKLSVMQCKPLVDGILQRINCWSAKLLSYAGRVQLIKSVVFGIQMYWSQIFVLPQKVL 170
             S KL + + +     +++R   W+ K LS+AGRVQLI SV+ G+  +W   FVL    +
Sbjct: 771  MSRKLKISEYE-----LVKRFRSWAVKSLSFAGRVQLITSVITGLVNFWMSTFVLLLGCV 825

Query: 169  KRVQQACRIFLWTGKGEMSNRALVAWDKVMQPKSGGGLNILNLNEWNK 26
            K+++  C  FLW+G  + S  A +AW  V  PK+ GG+ +     WNK
Sbjct: 826  KKIESLCSRFLWSGSIDASKGAKIAWSGVCLPKNEGGVALRRFTPWNK 873


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  481 bits (1239), Expect = e-133
 Identities = 282/825 (34%), Positives = 440/825 (53%), Gaps = 14/825 (1%)
 Frame = -2

Query: 2458 GWECVNNYNSAVNGRIWVAWNPRXXXXXXXXXXXXAILFEVLDLGTGKWQNVLAVYALNT 2279
            GW  V NY  +  G+IWV W+P              I  EVL  G+  W  V  VYA N 
Sbjct: 56   GWSFVENYAFSDLGKIWVMWDPSVQVVVVAKSLQM-ITCEVLLPGSPSWIIVSVVYAANE 114

Query: 2278 GEGRKELWDFAKRKMVAV---NEPMVVGGDFNAILSSEDRFQGAVVS-QADVEDFQGFID 2111
               RKELW      +V+    + P +V GDFN +L+ ++      ++   ++ DF+  + 
Sbjct: 115  VASRKELWIEIVNMVVSGIIGDRPWLVLGDFNQVLNPQEHSNPVSLNVDINMRDFRDCLL 174

Query: 2110 VGELHEVRWSGPQYTWSNNQEGERRVCSNIDRCFANVKWLDEYSDVVVQRLAKGVSDHCP 1931
              EL ++R+ G  +TW N       V   IDR   N  W   +   +    +   SDH  
Sbjct: 175  AAELSDLRYKGNTFTWWNKSH-TTPVAKKIDRILVNDSWNALFPSSLGIFGSLDFSDHVS 233

Query: 1930 QMLLFGNCRQRAGL-FRFFNVLADHEDFEGIIREHWGAWRS-GNILRDIWRKCIKLKGPL 1757
              ++      +A   F+FFN L  + DF  ++R++W      G+ +  + +K   LK P+
Sbjct: 234  CGVVLEETSIKAKRPFKFFNYLLKNLDFLNLVRDNWFTLNVVGSSMFRVSKKLKALKKPI 293

Query: 1756 KSLNTKWFARVGDRVQGLREQLARVQH----DHDCSXXXXXXXXXEKWSNI---EERIWQ 1598
            K  +   ++ +  R +   + L   Q     D              KW  +   EE  ++
Sbjct: 294  KDFSRLNYSELEKRTKEAHDFLIGCQDRTLADPTPINASFELEAERKWHILTAAEESFFR 353

Query: 1597 QKSRVDWLKLGDANTKFFHAYAKMRRNTNAINHLTRMDGSECWGQEQIMEEVRRFYMNLM 1418
            QKSR+ W   GD NTK+FH  A  R ++N+I+ L   +G     QE I++    ++ +L+
Sbjct: 354  QKSRISWFAEGDGNTKYFHRMADARNSSNSISALYDGNGKLVDSQEGILDLCASYFGSLL 413

Query: 1417 GSCASELQVVNKDI-MRRGPRLTSQQQRDLIKECTDAEVKDALFCMDSNKAPGVDGFNAC 1241
            G       +   D+ +    R +  Q  +L    ++ +++ ALF +  NK+ G DGF A 
Sbjct: 414  GDEVDPYLMEQNDMNLLLSYRCSPAQVCELESTFSNEDIRAALFSLPRNKSCGPDGFTAE 473

Query: 1240 FFKKSWEFIGEEVTRAVQQFFRNGQLPREINVALITLLPKVPNASSVKDFRPIACCSVLY 1061
            FF  SW  +G EVT A+++FF +G L ++ N   I L+PK+ N +   DFRPI+C + LY
Sbjct: 474  FFIDSWSIVGAEVTDAIKEFFSSGCLLKQWNATTIVLIPKIVNPTCTSDFRPISCLNTLY 533

Query: 1060 KIISKILANRMKIVLNDVIGDCQSAFIPGRLIFDNILLSHELVKGYTRKQVSPRCMIKVD 881
            K+I+++L +R++ +L+ VI   QSAF+PGR + +N+LL+ +LV GY    +SPR M+KVD
Sbjct: 534  KVIARLLTDRLQRLLSGVISSAQSAFLPGRSLAENVLLATDLVHGYNWSNISPRGMLKVD 593

Query: 880  IQKAYDSVEWVFVEQMLSELGFPYRYIQWIMACLSTVSYSILVNGEVTEMFEARRGIRQG 701
            ++KA+DSV W FV   L  L  P ++I WI  C+ST ++++ +NG     F++ +G+RQG
Sbjct: 594  LKKAFDSVRWEFVIAALRALAIPEKFINWISQCISTPTFTVSINGGNGGFFKSTKGLRQG 653

Query: 700  DPISPYLFVICMEYLNRCFGELKSNRLFKYHPKCKKFGLVHVSFADDLLVFTRGDRDSVQ 521
            DP+SPYLFV+ ME  +        + L  YHPK     + H+ FADD+++F  G   S+ 
Sbjct: 654  DPLSPYLFVLAMEAFSNLLHSRYESGLIHYHPKASNLSISHLMFADDVMIFFDGGSFSLH 713

Query: 520  QAMKVLDHFAEVSGLRANQLKSCIYFGGVKDDMKHVILDQTGMCEGSLPFKYLGVPLTSH 341
               + LD FA  SGL+ N+ KS +Y  G+ + ++       G   G+LP +YLG+PL + 
Sbjct: 714  GICETLDDFASWSGLKVNKDKSHLYLAGL-NQLESNANAAYGFPIGTLPIRYLGLPLMNR 772

Query: 340  KLSVMQCKPLVDGILQRINCWSAKLLSYAGRVQLIKSVVFGIQMYWSQIFVLPQKVLKRV 161
            KL + + +PL++ I  R   W  K LS+AGR+QLI SV+FG   +W   F+LP+  +KR+
Sbjct: 773  KLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTFLLPKGCIKRI 832

Query: 160  QQACRIFLWTGKGEMSNRALVAWDKVMQPKSGGGLNILNLNEWNK 26
            +  C  FLW+G  E +    V+W  +  PKS GGL +  L EWNK
Sbjct: 833  ESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNK 877


>gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score:
            72.31) [Arabidopsis thaliana]
          Length = 928

 Score =  468 bits (1203), Expect = e-129
 Identities = 265/778 (34%), Positives = 418/778 (53%), Gaps = 22/778 (2%)
 Frame = -2

Query: 2275 EGRKELWDFAKRKM---VAVNEPMVVGGDFNAILSSEDRFQGAV--VSQADVEDFQGFID 2111
            E RKELW+  +      +  ++P ++ GDFN IL  E+        V+   + DFQ  ++
Sbjct: 2    EERKELWNDLRDHSDSPIIRSKPWIIFGDFNEILDMEEHSNSRENPVTTTGMRDFQMAVN 61

Query: 2110 VGELHEVRWSGPQYTWSNNQEGERRVCSNIDRCFANVKWLDEYSDVVVQRLAKGVSDH-- 1937
               + ++ + GP +TWSN +E +  +   +DR   N  WL  +        A G SDH  
Sbjct: 62   HCSITDLAYHGPLFTWSNKRENDL-IAKKLDRVLVNDVWLQSFPRSYSVFEAGGCSDHLR 120

Query: 1936 CPQMLLFGNCRQRAGL--FRFFNVLADHEDFEGIIREHWGAWRSGNI-LRDIWRKCIKLK 1766
            C   L  G      G   F+F NV+ + E F   +  +W    +  +    ++R   KLK
Sbjct: 121  CRINLNVGAGAVVKGKRPFKFVNVITEMEHFIPTVESYWNETEAIFMSTSSLFRFSKKLK 180

Query: 1765 GPLKSLNTKWFARVGDRVQGLREQL-------ARVQHDHDCSXXXXXXXXXEKWSNI--- 1616
            G    L      R+G+ V+  +E         A    +   S          KW +I   
Sbjct: 181  GLKPLLRNLGKERLGNLVKQTKEAFETLCQKQAMKMANPSPSSMQEENEAYAKWDHIAVL 240

Query: 1615 EERIWQQKSRVDWLKLGDANTKFFHAYAKMRRNTNAINHLTRMDGSECWGQEQIMEEVRR 1436
            EE+  +Q+S++ WL +GD N K FH     R   N+I  +   DGS    +E+I  E   
Sbjct: 241  EEKFLKQRSKLHWLDIGDRNNKAFHRAVVAREAQNSIREIICHDGSVASQEEKIKTEAEH 300

Query: 1435 FYMNLMGSCASELQ-VVNKDIMRRGPRLTSQQQRDLIKECTDAE-VKDALFCMDSNKAPG 1262
             +   +    ++ + +  +++    P   S   ++++     AE +   +F M ++K+PG
Sbjct: 301  HFREFLQLIPNDFEGIAVEELQDLLPYRCSDSDKEMLTNHVSAEEIHKVVFSMPNDKSPG 360

Query: 1261 VDGFNACFFKKSWEFIGEEVTRAVQQFFRNGQLPREINVALITLLPKVPNASSVKDFRPI 1082
             DG+ A F+K +W  IG E   A+Q FF  G LP+ IN  ++ L+PK   A  +KD+RPI
Sbjct: 361  PDGYTAEFYKGAWNIIGAEFILAIQSFFAKGFLPKGINSTILALIPKKKEAKEMKDYRPI 420

Query: 1081 ACCSVLYKIISKILANRMKIVLNDVIGDCQSAFIPGRLIFDNILLSHELVKGYTRKQVSP 902
            +CC+VLYK+ISKI+ANR+K+VL   I   QSAF+  RL+ +N+LL+ E+VK Y +  VS 
Sbjct: 421  SCCNVLYKVISKIIANRLKLVLPKFIVGNQSAFVKDRLLIENVLLATEIVKDYHKDSVSS 480

Query: 901  RCMIKVDIQKAYDSVEWVFVEQMLSELGFPYRYIQWIMACLSTVSYSILVNGEVTEMFEA 722
            RC +K+DI KA+DSV+W F+  +L  + FP  +  WI  C++T S+S+ VNGE+  +F +
Sbjct: 481  RCALKIDISKAFDSVQWKFLINVLEAMNFPPEFTHWITLCITTASFSVQVNGELAGVFSS 540

Query: 721  RRGIRQGDPISPYLFVICMEYLNRCFGELKSNRLFKYHPKCKKFGLVHVSFADDLLVFTR 542
             R +RQG  +SPYLFVI M+ L++   +    R F YHPKC+  GL H+SFADDL++ + 
Sbjct: 541  ARELRQGCSLSPYLFVISMDVLSKMLDKAVGARQFGYHPKCRAIGLTHLSFADDLMILSD 600

Query: 541  GDRDSVQQAMKVLDHFAEVSGLRANQLKSCIYFGGVKDDMKHVILDQTGMCEGSLPFKYL 362
            G   S+   +KVL  FA+ SGL+ +  KS +Y  GV+  +   I+ +     G LP +YL
Sbjct: 601  GKVRSIDGIVKVLYEFAKWSGLKISMEKSTMYLAGVQASVYQEIVQKFSFDVGKLPVRYL 660

Query: 361  GVPLTSHKLSVMQCKPLVDGILQRINCWSAKLLSYAGRVQLIKSVVFGIQMYWSQIFVLP 182
            G+PL S +L+   C PL++ + ++I  W+++ LS+AGR+ LI S ++ I  +W   F LP
Sbjct: 661  GLPLVSKRLTASDCLPLIEQLRKKIEAWTSRFLSFAGRLNLISSTLWSICNFWMAAFRLP 720

Query: 181  QKVLKRVQQACRIFLWTGKGEMSNRALVAWDKVMQPKSGGGLNILNLNEWNKAAIFKH 8
            +  ++ + + C  FLW+G    SN+A V+W+ + +PK            W+K   F H
Sbjct: 721  RACIREIDKLCSAFLWSGTELSSNKAKVSWEAICKPKK---------EAWHKGVWFAH 769


>gb|ABD33261.1| RNA-directed DNA polymerase (Reverse transcriptase) [Medicago
            truncatula]
          Length = 402

 Score =  466 bits (1200), Expect = e-128
 Identities = 216/383 (56%), Positives = 285/383 (74%)
 Frame = -2

Query: 1630 KWSNIEERIWQQKSRVDWLKLGDANTKFFHAYAKMRRNTNAINHLTRMDGSECWGQEQIM 1451
            KWS IEE+IW QKSR +W++LGD+NTKFFHAYAK RR  N I  L   DG+       I 
Sbjct: 18   KWSTIEEKIWMQKSRANWIQLGDSNTKFFHAYAKERRCQNNIKFLITEDGTRIDKHNLIK 77

Query: 1450 EEVRRFYMNLMGSCASELQVVNKDIMRRGPRLTSQQQRDLIKECTDAEVKDALFCMDSNK 1271
            EE+R FY+ LMGS    L +V+K++++RGP L+  QQ  L  + T  EVK+ LF MDS+K
Sbjct: 78   EEIRGFYLKLMGSSVDSLPMVDKNVVKRGPMLSQHQQDLLCSKFTAVEVKNVLFSMDSSK 137

Query: 1270 APGVDGFNACFFKKSWEFIGEEVTRAVQQFFRNGQLPREINVALITLLPKVPNASSVKDF 1091
            APG+DG+N  FFK SW  IG+ V  A+  FF+ G +P+ IN   +TLLPK  N +SVK+F
Sbjct: 138  APGIDGYNVHFFKCSWNIIGDSVIDAILDFFKTGFMPKIINCTYMTLLPKEVNVTSVKNF 197

Query: 1090 RPIACCSVLYKIISKILANRMKIVLNDVIGDCQSAFIPGRLIFDNILLSHELVKGYTRKQ 911
            RPIACCSV+YKIISKIL +RM+ VLN V+ + QSAF+ GR+IFDNI+LSHELVK Y+RK 
Sbjct: 198  RPIACCSVIYKIISKILTSRMQGVLNSVVSENQSAFVKGRVIFDNIILSHELVKSYSRKG 257

Query: 910  VSPRCMIKVDIQKAYDSVEWVFVEQMLSELGFPYRYIQWIMACLSTVSYSILVNGEVTEM 731
            +SPRCM+K+D+QKAY+SVEW F++ ++ ELGF Y+++ W+M CL+T SY+  +NG++T  
Sbjct: 258  ISPRCMVKIDLQKAYNSVEWPFIKHLMLELGFSYKFVNWVMGCLTTASYTFNINGDLTRP 317

Query: 730  FEARRGIRQGDPISPYLFVICMEYLNRCFGELKSNRLFKYHPKCKKFGLVHVSFADDLLV 551
            F A++G+RQGDPISPYLFVICMEYLN C  +L+ N  F++HP+CK+  L+HV F DDLL+
Sbjct: 318  FAAKKGLRQGDPISPYLFVICMEYLNICLIQLRKNAAFRFHPRCKRLNLIHVCFVDDLLL 377

Query: 550  FTRGDRDSVQQAMKVLDHFAEVS 482
            F+RGD DSV Q  +    F+  S
Sbjct: 378  FSRGDVDSVSQLFEAFSLFSAAS 400


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  458 bits (1179), Expect = e-126
 Identities = 257/742 (34%), Positives = 401/742 (54%), Gaps = 11/742 (1%)
 Frame = -2

Query: 2203 GDFNAILSSEDRFQGAVVS-QADVEDFQGFIDVGELHEVRWSGPQYTWSNNQEGERRVCS 2027
            GDFN +L  ++      ++    + DF   +   EL ++ + G  +TW N +   R +  
Sbjct: 3    GDFNQVLLPQEHSNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWN-KSSIRPIAK 61

Query: 2026 NIDRCFANVKWLDEYSDVVVQRLAKGVSDHCP-QMLLFGNCRQRAGLFRFFNVLADHEDF 1850
             +DR  AN  W + Y            SDH    ++L  N       F+FFN L  +EDF
Sbjct: 62   KLDRILANDSWCNLYPSSHGLFGNLDFSDHVSCGVVLEANGISAKRPFKFFNFLLKNEDF 121

Query: 1849 EGIIREHWGAWRS-GNILRDIWRKCIKLKGPLKSLNTKWFARVGDRVQGLREQLARVQH- 1676
              ++ ++W +    G+ +  + +K   +K P+K  +   ++ +  R +   E L   Q+ 
Sbjct: 122  LNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPIKDFSRLNYSGIELRTKEAHELLITCQNL 181

Query: 1675 ---DHDCSXXXXXXXXXEKW---SNIEERIWQQKSRVDWLKLGDANTKFFHAYAKMRRNT 1514
               +   S          KW   S  EE  + Q+SRV W   GD+NT +FH     R++ 
Sbjct: 182  TLANPSVSNAALELEAQRKWVLLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMVDSRKSF 241

Query: 1513 NAINHLTRMDGSECWGQEQIMEEVRRFYMNLMGSCASELQVVNKDI-MRRGPRLTSQQQR 1337
            N IN L   +G     Q+ I++    +Y  L+GS  S   +  +D+ +    R +  Q  
Sbjct: 242  NTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTYRCSQDQCS 301

Query: 1336 DLIKECTDAEVKDALFCMDSNKAPGVDGFNACFFKKSWEFIGEEVTRAVQQFFRNGQLPR 1157
            +L K  TD E+K A   +  NK  G DG++  FF+ +W  IG EV  A+ +FF +GQL +
Sbjct: 302  ELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFDSGQLLK 361

Query: 1156 EINVALITLLPKVPNASSVKDFRPIACCSVLYKIISKILANRMKIVLNDVIGDCQSAFIP 977
            + N   + L+PK  NA ++ +FRPI+C + LYK+ISK+L +R++ +L+ VIG  QSAF+P
Sbjct: 362  QWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIGHSQSAFLP 421

Query: 976  GRLIFDNILLSHELVKGYTRKQVSPRCMIKVDIQKAYDSVEWVFVEQMLSELGFPYRYIQ 797
            GR + +N+LL+ E+V GY R  +SPR M+KVD++KA+DSV+W FV   L  L  P RYI 
Sbjct: 422  GRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAIPERYIN 481

Query: 796  WIMACLSTVSYSILVNGEVTEMFEARRGIRQGDPISPYLFVICMEYLNRCFGELKSNRLF 617
            WI  C++T S++I VNG     F + +G+RQGDP+SPYLFV+ ME  ++       +   
Sbjct: 482  WIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYI 541

Query: 616  KYHPKCKKFGLVHVSFADDLLVFTRGDRDSVQQAMKVLDHFAEVSGLRANQLKSCIYFGG 437
             YHPK     + H+ FADD+++F  G   S+    + LD FA+ SGL+ N+ KS ++  G
Sbjct: 542  HYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAG 601

Query: 436  VKDDMKHVILDQTGMCEGSLPFKYLGVPLTSHKLSVMQCKPLVDGILQRINCWSAKLLSY 257
            + D  + +     G   G+ P +YLG+PL   KL +    PL++ +  R+  W +K LS+
Sbjct: 602  L-DLSERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSF 660

Query: 256  AGRVQLIKSVVFGIQMYWSQIFVLPQKVLKRVQQACRIFLWTGKGEMSNRALVAWDKVMQ 77
            AGR QLI SV+FG+  +W   F+LP+  +K+++  C  FLW G  +    + V+W     
Sbjct: 661  AGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCL 720

Query: 76   PKSGGGLNILNLNEWNKAAIFK 11
            PKS GGL   +  EWNK  + +
Sbjct: 721  PKSEGGLGFRSFGEWNKTLLLR 742


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  458 bits (1179), Expect = e-126
 Identities = 257/742 (34%), Positives = 401/742 (54%), Gaps = 11/742 (1%)
 Frame = -2

Query: 2203 GDFNAILSSEDRFQGAVVS-QADVEDFQGFIDVGELHEVRWSGPQYTWSNNQEGERRVCS 2027
            GDFN +L  ++      ++    + DF   +   EL ++ + G  +TW N +   R +  
Sbjct: 3    GDFNQVLLPQEHSNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWN-KSSIRPIAK 61

Query: 2026 NIDRCFANVKWLDEYSDVVVQRLAKGVSDHCP-QMLLFGNCRQRAGLFRFFNVLADHEDF 1850
             +DR  AN  W + Y            SDH    ++L  N       F+FFN L  +EDF
Sbjct: 62   KLDRILANDSWCNLYPSSHGLFGNLDFSDHVSCGVVLEANGISAKRPFKFFNFLLKNEDF 121

Query: 1849 EGIIREHWGAWRS-GNILRDIWRKCIKLKGPLKSLNTKWFARVGDRVQGLREQLARVQH- 1676
              ++ ++W +    G+ +  + +K   +K P+K  +   ++ +  R +   E L   Q+ 
Sbjct: 122  LNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPIKDFSRLNYSGIELRTKEAHELLITCQNL 181

Query: 1675 ---DHDCSXXXXXXXXXEKW---SNIEERIWQQKSRVDWLKLGDANTKFFHAYAKMRRNT 1514
               +   S          KW   S  EE  + Q+SRV W   GD+NT +FH     R++ 
Sbjct: 182  TLANPSVSNAALELEAQRKWVLLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMVDSRKSF 241

Query: 1513 NAINHLTRMDGSECWGQEQIMEEVRRFYMNLMGSCASELQVVNKDI-MRRGPRLTSQQQR 1337
            N IN L   +G     Q+ I++    +Y  L+GS  S   +  +D+ +    R +  Q  
Sbjct: 242  NTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTYRCSQDQCS 301

Query: 1336 DLIKECTDAEVKDALFCMDSNKAPGVDGFNACFFKKSWEFIGEEVTRAVQQFFRNGQLPR 1157
            +L K  TD E+K A   +  NK  G DG++  FF+ +W  IG EV  A+ +FF +GQL +
Sbjct: 302  ELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFDSGQLLK 361

Query: 1156 EINVALITLLPKVPNASSVKDFRPIACCSVLYKIISKILANRMKIVLNDVIGDCQSAFIP 977
            + N   + L+PK  NA ++ +FRPI+C + LYK+ISK+L +R++ +L+ VIG  QSAF+P
Sbjct: 362  QWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIGHSQSAFLP 421

Query: 976  GRLIFDNILLSHELVKGYTRKQVSPRCMIKVDIQKAYDSVEWVFVEQMLSELGFPYRYIQ 797
            GR + +N+LL+ E+V GY R  +SPR M+KVD++KA+DSV+W FV   L  L  P RYI 
Sbjct: 422  GRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAIPERYIN 481

Query: 796  WIMACLSTVSYSILVNGEVTEMFEARRGIRQGDPISPYLFVICMEYLNRCFGELKSNRLF 617
            WI  C++T S++I VNG     F + +G+RQGDP+SPYLFV+ ME  ++       +   
Sbjct: 482  WIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYI 541

Query: 616  KYHPKCKKFGLVHVSFADDLLVFTRGDRDSVQQAMKVLDHFAEVSGLRANQLKSCIYFGG 437
             YHPK     + H+ FADD+++F  G   S+    + LD FA+ SGL+ N+ KS ++  G
Sbjct: 542  HYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAG 601

Query: 436  VKDDMKHVILDQTGMCEGSLPFKYLGVPLTSHKLSVMQCKPLVDGILQRINCWSAKLLSY 257
            + D  + +     G   G+ P +YLG+PL   KL +    PL++ +  R+  W +K LS+
Sbjct: 602  L-DLSERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSF 660

Query: 256  AGRVQLIKSVVFGIQMYWSQIFVLPQKVLKRVQQACRIFLWTGKGEMSNRALVAWDKVMQ 77
            AGR QLI SV+FG+  +W   F+LP+  +K+++  C  FLW G  +    + V+W     
Sbjct: 661  AGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCL 720

Query: 76   PKSGGGLNILNLNEWNKAAIFK 11
            PKS GGL   +  EWNK  + +
Sbjct: 721  PKSEGGLGFRSFGEWNKTLLLR 742


>gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana]
            gi|20197043|gb|AAM14892.1| putative reverse transcriptase
            [Arabidopsis thaliana]
          Length = 1412

 Score =  454 bits (1167), Expect = e-124
 Identities = 277/836 (33%), Positives = 429/836 (51%), Gaps = 19/836 (2%)
 Frame = -2

Query: 2461 RGWECVNNYNSAVNGRIWVAWNPRXXXXXXXXXXXXAILFEVLDLGTGKWQNVLAVYALN 2282
            + W+ V+NY     GRIWV W+               +    ++    ++     +YA N
Sbjct: 355  KDWQMVSNYEFNRLGRIWVVWSSSVQLQVIFKSSQMIVCLVRVEHYDVEFICSF-IYASN 413

Query: 2281 TGEGRKELWDFAKRKMVAV---NEPMVVGGDFNAILSSEDRFQGAVVSQAD--VEDFQGF 2117
              E RK+LW        +V   N+P ++ GDFN  L  E+    AV       + DFQ  
Sbjct: 414  FVEERKKLWQDLHNLQNSVAFRNKPWLLFGDFNETLKMEEHSSYAVSPMVTPGMRDFQIV 473

Query: 2116 IDVGELHEVRWSGPQYTWSNNQEGERRVCSNIDRCFANVKWLDEYSDVVVQRLAKGVSDH 1937
            +    L ++R  GP +TW N +  E  +C  +DR   N ++   Y        + G SDH
Sbjct: 474  VRYCSLEDMRTHGPLFTWGNKRN-EGLICKKLDRVLLNPEYNSAYPHSYCIMDSGGCSDH 532

Query: 1936 CPQMLLFGNCRQRA-GLFRFFNVLADHEDFEGIIREHWG----AWRSGNILRDIWRKCIK 1772
                    +  Q+  G F+F NV+A H +F   + + W      + S + L    +K  +
Sbjct: 533  LRGRFHLRSAIQKPKGPFKFTNVIAAHPEFMPKVEDFWKNTTELFPSTSTLFRFSKKLKE 592

Query: 1771 LKGPLKSLNTKWFARVGDRVQGLREQLARVQ-------HDHDCSXXXXXXXXXEKWSNIE 1613
            LK  LK L+    + +  R     E+L R Q       + HD                ++
Sbjct: 593  LKPILKDLSRNNLSDLTRRATYAYEELCRCQTKSLTTLNPHDI---------------VD 637

Query: 1612 ERIWQQKSRVDWLKLGDANTKFFHAYAKMRRNTNAINHLTRMDGSECWGQEQIMEEVRRF 1433
            E +                   F  + K R   NAI+ +    G+    Q+ I  E  RF
Sbjct: 638  ESL------------------AFERWEKERHLLNAIHEVMDPQGTRPPNQDDIKIEAVRF 679

Query: 1432 YMNLMGSCASELQVVNKDIMRR--GPRLTSQQQRDLIKECTDAEVKDALFCMDSNKAPGV 1259
            + +L+ S  S+   ++ D ++     R +  +Q  L+ E T+AEV    F +  NK+PG 
Sbjct: 680  FSDLLSSQPSDFTGISVDELKGILQYRYSLHEQNLLVAEITEAEVMKVFFSIPLNKSPGP 739

Query: 1258 DGFNACFFKKSWEFIGEEVTRAVQQFFRNGQLPREINVALITLLPKVPNASSVKDFRPIA 1079
            DG+   FF+++W  IG+EVT A++ FF  G LP+ +N  ++ L+PK   A  +KD+RPI+
Sbjct: 740  DGYTVEFFRETWSVIGQEVTMAIKSFFTYGFLPKGLNSTILALIPKRTYAKEMKDYRPIS 799

Query: 1078 CCSVLYKIISKILANRMKIVLNDVIGDCQSAFIPGRLIFDNILLSHELVKGYTRKQVSPR 899
            CC+VLYK ISK+LANR+K +L + I   QSAFI  RL+ +N+LL+ ELVK Y +  +SPR
Sbjct: 800  CCNVLYKAISKLLANRLKCLLPEFIAPNQSAFISDRLLMENLLLASELVKDYHKDGLSPR 859

Query: 898  CMIKVDIQKAYDSVEWVFVEQMLSELGFPYRYIQWIMACLSTVSYSILVNGEVTEMFEAR 719
            C +K+D+ KA+DSV+W F+   L+ L  P ++I WI  C+ST S+S+ VN          
Sbjct: 860  CAMKIDLSKAFDSVQWPFLLNTLAALDIPEKFIHWINLCISTASFSVQVN---------- 909

Query: 718  RGIRQGDPISPYLFVICMEYLNRCFGELKSNRLFKYHPKCKKFGLVHVSFADDLLVFTRG 539
             G+RQG  +SPYLFVICM  L+    +    + F YHP+C+  GL H+ FADD++VF+ G
Sbjct: 910  -GLRQGCSLSPYLFVICMNVLSAMLDKGAVEKRFGYHPRCRNMGLTHLCFADDIMVFSAG 968

Query: 538  DRDSVQQAMKVLDHFAEVSGLRANQLKSCIYFGGVKDDMKHVILDQTGMCEGSLPFKYLG 359
               S++  + +   FA  SGL  +  KS ++   +  +    IL +     GSLP +YLG
Sbjct: 969  SAHSLEGVLAIFKDFAAFSGLNISLEKSTLFMASISSETCASILARFPFDSGSLPVRYLG 1028

Query: 358  VPLTSHKLSVMQCKPLVDGILQRINCWSAKLLSYAGRVQLIKSVVFGIQMYWSQIFVLPQ 179
            +PL + ++++  C PL++ I  RI+ W  + LSYAGR+QL+ SV+  +  +W   F LP+
Sbjct: 1029 LPLMTKRMTLADCLPLLEKIRSRISSWKNRFLSYAGRLQLLNSVISSLTKFWISAFRLPR 1088

Query: 178  KVLKRVQQACRIFLWTGKGEMSNRALVAWDKVMQPKSGGGLNILNLNEWNKAAIFK 11
              ++ ++Q    FLW+G     ++A VAW  V +PKS GGL + +L + NK   FK
Sbjct: 1089 ACIREIEQISAAFLWSGTDLNPHKAKVAWHDVCKPKSEGGLGLRSLVDANKICCFK 1144


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  453 bits (1165), Expect = e-124
 Identities = 255/673 (37%), Positives = 355/673 (52%), Gaps = 3/673 (0%)
 Frame = -2

Query: 2014 CFANVKWLDEYSDVVVQRLAKGVSDHCPQMLLFG-NCRQRAGLFRFFNVLADHEDFEGII 1838
            C  N K LD+ +  V+  L  G+SDH   ++  G   R R   F+FFN LAD EDF  I+
Sbjct: 125  CLHNSK-LDDLNYSVLSFLPPGISDHAAMVVKVGLPFRIRKAPFKFFNFLADREDFIPIV 183

Query: 1837 REHWGAWRSGNILRDIWRKCIKLKGPLKSLNTKWFARVGDRVQGLREQLARVQHDHDCSX 1658
               W     G+    +WRK   +K   K LN                             
Sbjct: 184  SAVWATNVWGSKQFQVWRKLKLVKNQFKLLNC---------------------------- 215

Query: 1657 XXXXXXXXEKWSNIEERIWQQKSRVDWLKLGDANTKFFHAYAKMRRNTNAINHLTRMDGS 1478
                        N+ E++ ++KSRV WLK GD N+ FF       RN N I  + R DG 
Sbjct: 216  ------------NVVEKLLKKKSRVQWLKKGDKNSTFFFKTMTKHRNRNRIATINRSDGP 263

Query: 1477 ECWGQEQIMEEVRRFYMNLMGSCASELQVVNKDIMRRGPRLTSQQQRDLIKECTDAEVKD 1298
            +                                             + L  E T  +++ 
Sbjct: 264  DL-------------------------------------------AKSLCNEFTHDDIRA 280

Query: 1297 ALFCMDSNKAPGVDGFNACFFKKSWEFIGEEVTRA-VQQFFRNGQLPREINVALITLLPK 1121
              F M+ NK+PG DGFN CFF+K+W  IG+ V  A V++FF  G L  E+N  +ITL+PK
Sbjct: 281  VFFSMNPNKSPGPDGFNGCFFQKAWLVIGDNVVAAAVKEFFSYGSLLMELNSTIITLVPK 340

Query: 1120 VPNASSVKDFRPIACCSVLYKIISKILANRMKIVLNDVIGDCQSAFIPGRLIFDNILLSH 941
            V N +++ DFRPI+CC+  YKII+K+LANR+K  L+ ++G  QS FIPGR I DNILL+ 
Sbjct: 341  VANPTTMSDFRPISCCNTFYKIIAKLLANRLKGTLHLIVGPSQSTFIPGRRIGDNILLAQ 400

Query: 940  ELVKGYTRKQVSPRCMIKVDIQKAYDSVEWVFVEQMLSELGFPYRYIQWIMACLSTVSYS 761
            E++  Y +    PRC   VD+ KA D+VEW F+   L     P   I WI +C+S+  +S
Sbjct: 401  EIICDYHKADGQPRCTFMVDMMKANDTVEWDFIIATLQAFNIPSTLIGWIKSCISSAKFS 460

Query: 760  ILVNGEVTEMFEARRGIRQGDPISPYLFVICMEYLNRCF-GELKSNRLFKYHPKCKKFGL 584
            + VNGE+   F  RRG+RQGDP+SPYLFVI ME L+ C    +  +  F+YH +C +  L
Sbjct: 461  VCVNGELAGFFARRRGLRQGDPLSPYLFVIAMEVLSLCIQRRINCSPCFRYHWRCDQLNL 520

Query: 583  VHVSFADDLLVFTRGDRDSVQQAMKVLDHFAEVSGLRANQLKSCIYFGGVKDDMKHVILD 404
             H+ FADDLL+F  GD +SV+       +F  +S L+AN  +S I+  GV  +    +L 
Sbjct: 521  SHLCFADDLLMFCNGDENSVRTLHDAFSNFESLSSLKANVSESKIFLAGVDGNSSDSVLQ 580

Query: 403  QTGMCEGSLPFKYLGVPLTSHKLSVMQCKPLVDGILQRINCWSAKLLSYAGRVQLIKSVV 224
             T    G+ P +YLG+PL + KL +  C PL+D I  RI  W  K+LS+AGR+QLI+SV+
Sbjct: 581  VTNFSLGTCPVRYLGIPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQLIQSVL 640

Query: 223  FGIQMYWSQIFVLPQKVLKRVQQACRIFLWTGKGEMSNRALVAWDKVMQPKSGGGLNILN 44
              IQ+YW+   +LP+KVLK +++  R FLW G         VAW ++  PK  GGL I +
Sbjct: 641  SSIQVYWASHLILPKKVLKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPKCEGGLGIKD 700

Query: 43   LNEWNKAAIFKHL 5
            L+ WNKA +  H+
Sbjct: 701  LHCWNKALMISHI 713


>dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thaliana]
          Length = 910

 Score =  451 bits (1161), Expect = e-124
 Identities = 266/834 (31%), Positives = 435/834 (52%), Gaps = 18/834 (2%)
 Frame = -2

Query: 2458 GWECVNNYNSAVNGRIWVAWNPRXXXXXXXXXXXXAILFEVLDLGTGKWQNVLA-VYALN 2282
            GW   +NY  +  GRIW+ W+P              I+F  + + +      +A VY  N
Sbjct: 54   GWRMDSNYCCSELGRIWIVWDP--SISVLVFKRTDQIMFCSIKIPSLLQSFAVAFVYGRN 111

Query: 2281 TGEGRKELWD----FAKRKMVAVNEPMVVGGDFNAILSSEDRF--QGAVVSQADVEDFQG 2120
            +   R+ LW+     ++   ++V  P ++ GDFN I ++ + +    ++++   +ED Q 
Sbjct: 112  SELDRRSLWEDILVLSRTSPLSVT-PWLLLGDFNQIAAASEHYSINQSLLNLRGMEDLQC 170

Query: 2119 FIDVGELHEVRWSGPQYTWSNNQEGERRVCSNIDRCFANVKWLDEYSDVVVQRLAKGVSD 1940
             +   +L ++   G  +TWSN+Q+ +  +   +DR  AN +W   +   +      G SD
Sbjct: 171  CLRDSQLSDLPSRGVFFTWSNHQQ-DNPILRKLDRALANGEWFAVFPSALAVFDPPGDSD 229

Query: 1939 HCPQMLLFGNCRQRA-GLFRFFNVLADHEDFEGIIREHWGA-WRSGNILRDIWRKCIKLK 1766
            H P ++L  N    +   F++F+ L+ H  +   +   W A    G+ +  + +     K
Sbjct: 230  HAPCIILIDNQPPPSKKSFKYFSFLSSHPSYLAALSTAWEANTLVGSHMFSLRQHLKVAK 289

Query: 1765 GPLKSLNTKWFARVGD-------RVQGLREQLARVQHDHDCSXXXXXXXXXEKWSNIEER 1607
               ++LN   F+ +         R++ ++ +L     D               ++   E 
Sbjct: 290  LCCRTLNRLRFSNIQQRTAQSLTRLEDIQVELLTSPSDTLFRREHVARKQWIFFAAALES 349

Query: 1606 IWQQKSRVDWLKLGDANTKFFHAYAKMRRNTNAINHLTRMDGSECWGQEQIMEEVRRFYM 1427
             ++QKSR+ WL  GDANT+FFH      + TN I  L   DG      +QI   +  +Y 
Sbjct: 350  FFRQKSRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLIAYYS 409

Query: 1426 NLMGSCASELQVVNKDIMR--RGPRLTSQQQRDLIKECTDAEVKDALFCMDSNKAPGVDG 1253
            +L+G  +  +   + + ++     R  S     L    ++ E+   LF M  NKAPG DG
Sbjct: 410  HLLGIPSENVTPFSVEKIKGLLPFRCDSFLASQLTTIPSEEEITQVLFSMPRNKAPGPDG 469

Query: 1252 FNACFFKKSWEFIGEEVTRAVQQFFRNGQLPREINVALITLLPKVPNASSVKDFRPIACC 1073
            F   FF ++W  +   V  A+++FF +G LPR  N   ITL+PKV  A  +  FRP+ACC
Sbjct: 470  FPVEFFIEAWAIVKSSVVAAIREFFISGNLPRGFNATAITLIPKVTGADRLTQFRPVACC 529

Query: 1072 SVLYKIISKILANRMKIVLNDVIGDCQSAFIPGRLIFDNILLSHELVKGYTRKQVSPRCM 893
            + +YK+I++I++ R+K+ ++  +   Q  FI GRL+ +N+LL+ ELV  +     + R  
Sbjct: 530  TTIYKVITRIISRRLKLFIDQAVQANQVGFIKGRLLCENVLLASELVDNFEADGETTRGC 589

Query: 892  IKVDIQKAYDSVEWVFVEQMLSELGFPYRYIQWIMACLSTVSYSILVNGEVTEMFEARRG 713
            ++VDI KAYD+V W F+  +L  L  P  +I WI  C+S+ SYSI  NGE+   F+ ++G
Sbjct: 590  LQVDISKAYDNVNWEFLINILKALDLPLVFIHWIWVCISSASYSIAFNGELIGFFQGKKG 649

Query: 712  IRQGDPISPYLFVICMEYLNRCFGELKSNRLFKYHPKCKKFGLVHVSFADDLLVFTRGDR 533
            IRQGDP+S +LFV+ M+ L++       N LF  HP C    + H+SFADD+LVF+ G  
Sbjct: 650  IRQGDPMSSHLFVLVMDVLSKSLDLGALNGLFNLHPNCLAPIITHLSFADDVLVFSDGAA 709

Query: 532  DSVQQAMKVLDHFAEVSGLRANQLKSCIYFGGVKDDMKHVILDQTGMCEGSLPFKYLGVP 353
             S+   + +LD F + SGL  N+ K+ +   G        + D  G+  GSLP +YLGVP
Sbjct: 710  SSIAGILTILDDFRQGSGLGINREKTELLLDGGNFARNRSLADNLGITHGSLPVRYLGVP 769

Query: 352  LTSHKLSVMQCKPLVDGILQRINCWSAKLLSYAGRVQLIKSVVFGIQMYWSQIFVLPQKV 173
            L S K+     +PLVD I  R   W+A+ LS+AGR+QL+KSV++    +W+ +F+ P + 
Sbjct: 770  LMSQKMRRQDYQPLVDRINSRFTSWTARHLSFAGRLQLLKSVIYSTINFWASVFIFPNQC 829

Query: 172  LKRVQQACRIFLWTGKGEMSNRALVAWDKVMQPKSGGGLNILNLNEWNKAAIFK 11
            L++++Q C  FLW+G    +  A ++W+ V  PK  GGL +  L+ WN+    K
Sbjct: 830  LQKLEQMCNAFLWSGAPNSARGAKISWNIVCSPKEAGGLGLKRLSSWNRILALK 883


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  441 bits (1135), Expect = e-121
 Identities = 233/639 (36%), Positives = 355/639 (55%), Gaps = 13/639 (2%)
 Frame = -2

Query: 1888 FRFFNVLADHEDFEGIIREHWGA----WRSGNILRDIWRKCIKLKGPLKSLNTKWFARVG 1721
            F+F NVL     F  ++  HW +    + S + L    +K   LK  L+ L  +    + 
Sbjct: 548  FKFVNVLTKLPQFLPVVESHWASSAPLYVSTSALYRFSKKLKTLKPHLRELGKEKLGDLP 607

Query: 1720 DRVQG----LREQLARVQHDHDCSXXXXXXXXXEKW---SNIEERIWQQKSRVDWLKLGD 1562
             R +     L E+ A    +               W   S +EE   +QKS++ W+ +GD
Sbjct: 608  KRTREAHILLCEKQATTLANPSQETIAEELKAYTDWTHLSELEEGFLKQKSKLHWMNVGD 667

Query: 1561 ANTKFFHAYAKMRRNTNAINHLTRMDGSECWGQEQIMEEVRRFYMNLMGSCASELQVVNK 1382
             N  +FH  A++R+  N+I  +   +       E+I  E  RF+   +   + +   ++ 
Sbjct: 668  GNNSYFHKAAQVRKMRNSIREIRGPNAETLQTSEEIKGEAERFFNEFLNRQSGDFHGISV 727

Query: 1381 DIMRR--GPRLTSQQQRDLIKECTDAEVKDALFCMDSNKAPGVDGFNACFFKKSWEFIGE 1208
            + +R     R +   Q  L +E T  E++  LF M +NK+PG DG+ + FFK +W   G 
Sbjct: 728  EDLRNLMSYRCSVTDQNILTREVTGEEIQKVLFAMPNNKSPGPDGYTSEFFKATWSLTGP 787

Query: 1207 EVTRAVQQFFRNGQLPREINVALITLLPKVPNASSVKDFRPIACCSVLYKIISKILANRM 1028
            +   A+Q FF  G LP+ +N  ++ L+PK   A  +KD+RPI+CC+VLYK+ISKILANR+
Sbjct: 788  DFIAAIQSFFVKGFLPKGLNATILALIPKKDEAIEMKDYRPISCCNVLYKVISKILANRL 847

Query: 1027 KIVLNDVIGDCQSAFIPGRLIFDNILLSHELVKGYTRKQVSPRCMIKVDIQKAYDSVEWV 848
            K++L   I   QSAF+  RL+ +N+LL+ ELVK Y ++ V+PRC +K+DI KA+DSV+W 
Sbjct: 848  KLLLPSFILQNQSAFVKERLLMENVLLATELVKDYHKESVTPRCAMKIDISKAFDSVQWQ 907

Query: 847  FVEQMLSELGFPYRYIQWIMACLSTVSYSILVNGEVTEMFEARRGIRQGDPISPYLFVIC 668
            F+   L  L FP  +  WI  C+ST ++S+ VNGE+   F + RG+RQG  +SPYLFVIC
Sbjct: 908  FLLNTLEALNFPETFRHWIKLCISTATFSVQVNGELAGFFGSSRGLRQGCALSPYLFVIC 967

Query: 667  MEYLNRCFGELKSNRLFKYHPKCKKFGLVHVSFADDLLVFTRGDRDSVQQAMKVLDHFAE 488
            M  L+    E   +R   YHPKC+K GL H+ FADDL+VF  G + S++  + V   FA 
Sbjct: 968  MNVLSHMIDEAAVHRNIGYHPKCEKIGLTHLCFADDLMVFVDGHQWSIEGVINVFKEFAG 1027

Query: 487  VSGLRANQLKSCIYFGGVKDDMKHVILDQTGMCEGSLPFKYLGVPLTSHKLSVMQCKPLV 308
             SGL+ +  KS IY  GV    +   L       G LP +YLG+PL + +++     PL+
Sbjct: 1028 RSGLQISLEKSTIYLAGVSASDRVQTLSSFPFANGQLPVRYLGLPLLTKQMTTADYSPLI 1087

Query: 307  DGILQRINCWSAKLLSYAGRVQLIKSVVFGIQMYWSQIFVLPQKVLKRVQQACRIFLWTG 128
            + +  +I+ W+A+ LSYAGR+ L+ SV+  I  +W   + LP   ++ +++ C  FLW+G
Sbjct: 1088 EAVKTKISSWTARSLSYAGRLALLNSVIVSIANFWMSAYRLPAGCIREIEKLCSAFLWSG 1147

Query: 127  KGEMSNRALVAWDKVMQPKSGGGLNILNLNEWNKAAIFK 11
                  +A +AW  + QPK  GGL I +L E NK +  K
Sbjct: 1148 PVLNPKKAKIAWSSICQPKKEGGLGIKSLAEANKVSCLK 1186


>gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13)
            [Arabidopsis thaliana]
          Length = 1164

 Score =  431 bits (1108), Expect = e-118
 Identities = 262/765 (34%), Positives = 401/765 (52%), Gaps = 16/765 (2%)
 Frame = -2

Query: 2296 VYALNTGEGRKELW----DFAKRKMVAVNEPMVVGGDFNAILS-SEDRFQGAVVSQADVE 2132
            VYA      R+ LW    DF+    V +++P  V GDFN IL  SE              
Sbjct: 6    VYASTDEVTRQILWNEIVDFSNDPCV-IDKPWTVLGDFNQILHPSEHSTSDGFNVDRPTR 64

Query: 2131 DFQGFIDVGELHEVRWSGPQYTWSNNQEGERRVCSNIDRCFANVKWLDEYSDVVVQRLAK 1952
             F+  I +  L ++ + G  +TW N +     V   +DR   N KW   +   +      
Sbjct: 65   IFRETILLASLTDLSFRGNTFTWWNKRS-RAPVAKKLDRILVNDKWTTTFPSSLGLFGEP 123

Query: 1951 GVSDH--CPQMLLFGNCRQRAGLFRFFNVLADHEDFEGIIREHWGAWR-SGNILRDIWRK 1781
              SDH  C   L+  + R +   FRF N L   E+F  +I   W +   +G+ +  +  K
Sbjct: 124  DFSDHSSCELSLMSASPRSKKP-FRFNNFLLKDENFLSLICLKWFSTSVTGSAMYRVSVK 182

Query: 1780 CIKLKGPLKSLNTKWFARVGDRVQGLREQLARVQH---DHDC-SXXXXXXXXXEKW---S 1622
               LK  ++  +   ++ +  R +   + L   Q       C S          KW   +
Sbjct: 183  LKALKKVIRDFSRDNYSDIEKRTKEAHDALLLAQSVLLASPCPSNAAIEAETQRKWRILA 242

Query: 1621 NIEERIWQQKSRVDWLKLGDANTKFFHAYAKMRRNTNAINHLTRMDGSECWGQEQIMEEV 1442
              E   + Q+SRV+WL+ GD N+ +FH  A  R++ N I+ L+   G    GQ+ +    
Sbjct: 243  EAEASFFYQRSRVNWLREGDMNSSYFHKMASARQSLNHIHFLSDPVGDRIEGQQNLENHC 302

Query: 1441 RRFYMNLMGSCASELQVVNKDIMRR-GPRLTSQQQRDLIKECTDAEVKDALFCMDSNKAP 1265
              ++ + +GS          DI      R +  QQ  L    +  ++K+A F +  NKA 
Sbjct: 303  VEYFQSNLGSEQGLPLFEQADISNLLSYRCSPAQQVSLDTPFSSEQIKNAFFSLPRNKAS 362

Query: 1264 GVDGFNACFFKKSWEFIGEEVTRAVQQFFRNGQLPREINVALITLLPKVPNASSVKDFRP 1085
            G DGF+  FF   W  IG EVT A+ +FF +G+L ++ N   + L+PK+ NASS+ DFRP
Sbjct: 363  GPDGFSPEFFCACWPIIGGEVTEAIHEFFTSGKLLKQWNATNLVLIPKITNASSMSDFRP 422

Query: 1084 IACCSVLYKIISKILANRMKIVLNDVIGDCQSAFIPGRLIFDNILLSHELVKGYTRKQVS 905
            I+C + +YK+ISK+L +R+K  L   I   QSAF+PGRL  +N+LL+ ELV GY +K ++
Sbjct: 423  ISCLNTVYKVISKLLTDRLKDFLPAAISHSQSAFMPGRLFLENVLLATELVHGYNKKNIA 482

Query: 904  PRCMIKVDIQKAYDSVEWVFVEQMLSELGFPYRYIQWIMACLSTVSYSILVNGEVTEMFE 725
            P  M+KVD++KA+DSV W F+   L  L  P ++  WI+ CLST S+S+++NG     F 
Sbjct: 483  PSSMLKVDLRKAFDSVRWDFIVSALRALNVPEKFTCWILECLSTASFSVILNGHSAGHFW 542

Query: 724  ARRGIRQGDPISPYLFVICMEYLNRCFGELKSNRLFKYHPKCKKFGLVHVSFADDLLVFT 545
            + +G+RQGDP+SPYLFV+ ME  +       ++    YHPK  +  + H+ FADD+++F 
Sbjct: 543  SSKGLRQGDPMSPYLFVLAMEVFSGLLQSRYTSGYIAYHPKTSQLEISHLMFADDVMIFF 602

Query: 544  RGDRDSVQQAMKVLDHFAEVSGLRANQLKSCIYFGGVKDDMKHVILDQTGMCEGSLPFKY 365
             G   S+   ++ L+ FA  SGL  N  K+ +Y  G+       +    G   GSLP +Y
Sbjct: 603  DGKSSSLHGIVESLEDFAGWSGLLMNTNKTQLYHAGLSQSESDSMASY-GFKLGSLPVRY 661

Query: 364  LGVPLTSHKLSVMQCKPLVDGILQRINCWSAKLLSYAGRVQLIKSVVFGIQMYWSQIFVL 185
            LG+PL S KL++ +  PL++ I  R N W  +LLS+AGRVQL+ SV+ GI  +W   F+L
Sbjct: 662  LGLPLMSRKLTIAEYAPLIEKITARFNSWVVRLLSFAGRVQLLASVISGIVNFWISSFIL 721

Query: 184  PQKVLKRVQQACRIFLWTGKGEMSNRALVAWDKVMQPKSGGGLNI 50
            P   +K+++  C  FLW+ + +    A VAW +V  PK+ GG+ +
Sbjct: 722  PLGCIKKIESLCSRFLWSSRIDKKGIAKVAWSQVCLPKAEGGIGL 766


>gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis
            thaliana]
          Length = 1253

 Score =  427 bits (1099), Expect = e-117
 Identities = 256/782 (32%), Positives = 405/782 (51%), Gaps = 23/782 (2%)
 Frame = -2

Query: 2305 VLAVYALNTGEGRKELWDFAKRKMVAVN---EPMVVGGDFNAILSSEDRFQGAVVS-QAD 2138
            V  VYA N    RKELW+      V+++   +P ++ GDFN +L   +  Q   ++    
Sbjct: 55   VSIVYAANEAITRKELWEELLLLSVSLSGNGKPWIMLGDFNQVLCPAEHSQATSLNVNRR 114

Query: 2137 VEDFQGFIDVGELHEVRWSGPQYTWSNNQEGERRVCSNIDRCFANVKWLDEYSDVVVQRL 1958
            ++ F+  +   EL ++ + G  +TW N +   R V   +DR   N  W   +        
Sbjct: 115  MKVFRDCLFEAELCDLVFKGNTFTWWN-KSATRPVAKKLDRILVNESWCSRFPSAYAVFG 173

Query: 1957 AKGVSDHCPQMLLFGNCRQRAGL-FRFFNVLADHEDFEGIIREHWGAWRS-GNILRDIWR 1784
                SDH    ++      R    FRF+N L  + DF  ++ E W +    G+ +  + +
Sbjct: 174  EPDFSDHASCGVIINPLMHREKRPFRFYNFLLQNPDFISLVGELWYSINVVGSSMFKMSK 233

Query: 1783 KCIKLKGPLKSLNTKWFARVGDRVQGLREQLARVQH----DHDCSXXXXXXXXXEKWSNI 1616
            K   LK P+++ + + F+ +  RV+     +   Q+    D              KW  +
Sbjct: 234  KLKALKNPIRTFSMENFSNLEKRVKEAHNLVLYRQNKTLSDPTIPNAALEMEAQRKWLIL 293

Query: 1615 ---EERIWQQKSRVDWLKLGDANTKFFHAYAKMRRNTNAINHLTRMDGSECWGQEQIMEE 1445
               EE  + Q+SRV W+  GD+NT +FH  A  R+  N I+ +   +G +   Q  I E 
Sbjct: 294  VKAEESFFCQRSRVTWMGEGDSNTSYFHRMADSRKAVNTIHIIIDDNGVKIDTQLGIKEH 353

Query: 1444 VRRFYMNLMGSCASELQVVNKDIMRRGP-RLTSQQQRDLIKECTDAEVKDALFCMDSNKA 1268
               ++ NL+G       ++ +D     P R +  Q+++L    +  ++K A F   SNK 
Sbjct: 354  CIEYFSNLLGGEVGPPMLIQEDFDLLLPFRCSHDQKKELAMSFSRQDIKSAFFSFPSNKT 413

Query: 1267 PGVDGFNACFFKKSWEFIGEEVTRAVQQFFRNGQLPREINVALITLLPKVPNASSVKDFR 1088
             G DGF   FFK++W  IG EVT AV +FF +  L ++ N   + L+PK+ NAS + DFR
Sbjct: 414  SGPDGFPVEFFKETWSVIGTEVTDAVSEFFTSSVLLKQWNATTLVLIPKITNASKMNDFR 473

Query: 1087 PIACCS----VLYKIISKILANRMKIVLNDVIGDCQSAFIPGRLIFDNILLSHELVKGYT 920
            PI+C       LYK+I+++L NR++ +L+ VI   QSAF+PGR + +N+LL+ ELV+GY 
Sbjct: 474  PISCNDFGPITLYKVIARLLTNRLQCLLSQVISPFQSAFLPGRFLAENVLLATELVQGYN 533

Query: 919  RKQVSPRCMIKVDIQKAYDSVEWVFVEQMLSELGFPYRYIQWIMACLSTVSYSILVNGEV 740
            R+ + PR M+KVD++KA+DS+ W F+   L  +G P R++ WI  C+ST ++S+ VNG  
Sbjct: 534  RQNIDPRGMLKVDLRKAFDSIRWDFIISALKAIGIPDRFVYWITQCISTPTFSVCVNGNT 593

Query: 739  TEMFEARRGIRQGDPISPYLFVICMEYLNRCFGELKSNRLFKYHPKCKKFGLVHVSFADD 560
               F++ RG+RQG+P+SP+LFV+ ME  +             YHPK     + H+ FADD
Sbjct: 594  GGFFKSTRGLRQGNPLSPFLFVLAMEVFSSLLNSRFQAGYIHYHPKTSPLSISHLMFADD 653

Query: 559  LLVFTRGDRDSVQQAMKVLDHFAEVSGLRANQLKSCIYFGGVKDDMKHVILDQTGMCEGS 380
            ++VF  G   S+    + L+ FA  SGL  N+ K+ +Y  G         LD+       
Sbjct: 654  IMVFFDGGSSSLHGISEALEDFAFWSGLVLNREKTHLYLAG---------LDR------- 697

Query: 379  LPFKYLGVPLTSHKLSVMQCKPLVDGILQRINCWSAKLLSYAGRVQLIKSVVFGIQMYWS 200
                 +     + KL + +  PL++ + +R   WS K LS+AGRVQLI SV+ GI  +W 
Sbjct: 698  -----IEASTIARKLRIAEYGPLLEKLAKRFRSWSVKCLSFAGRVQLIASVISGIINFWI 752

Query: 199  QIFVLPQKVLKRVQQACRIFLWTGKGEMSNRALVAWDKVMQPKSGGGL-----NILNLNE 35
              F+LP+  +KR++  C  FLW+G  ++   A VAW +V  PK  GG+      +LN   
Sbjct: 753  STFILPKGCVKRIEALCARFLWSGNIDVKKGAKVAWSEVCLPKEEGGVGLRRFTVLNTTL 812

Query: 34   WN 29
            W+
Sbjct: 813  WD 814


>ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
            gi|508778198|gb|EOY25454.1| Uncharacterized protein
            TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  426 bits (1096), Expect = e-116
 Identities = 262/771 (33%), Positives = 399/771 (51%), Gaps = 14/771 (1%)
 Frame = -2

Query: 2305 VLAVYALNTGEGRKELWDFAKRKMVAVNEPMVVGGDFNAILSSEDRFQGAVVSQADVEDF 2126
            V  VYA  T   R  LWD  +R    +  P +VGGDFN IL  E+R  G+   +  +EDF
Sbjct: 1155 VTFVYAKCTRSERTLLWDCLRRLAADIEVPWLVGGDFNIILKREERLYGSAPHEGAMEDF 1214

Query: 2125 QG-FIDVGELHEVRWSGPQYTWSNNQEGERRVCSNIDRCFANVKWLDEYSDVVVQRLAKG 1949
                +D G L +  + G  +TW+NN     R+   +DR   N  W++++    +Q L + 
Sbjct: 1215 ASTLLDCGLL-DGGFEGNPFTWTNN-----RMFQRLDRIVYNHHWINKFPITRIQHLNRD 1268

Query: 1948 VSDHCPQMLLFGNCRQRA-GLFRFFNVLADHEDFEGIIREHWGAWRSGNILRDIWRKCIK 1772
             SDHCP ++   N  ++A   FRF +    H DF+  +  +W    +G+ L+  W K  +
Sbjct: 1269 GSDHCPLLISCFNSSEKAPSSFRFQHAWVLHHDFKTSVESNWNLPINGSGLQAFWSKQHR 1328

Query: 1771 LKGPLKSLNTKWFARVGDRVQGLREQLARVQ-----HDHDCSXXXXXXXXXE-----KWS 1622
            LK  LK  N   F   GD    L+E   RV+     H ++ +               K  
Sbjct: 1329 LKQHLKWWNKVMF---GDIFSKLKEAEKRVEECEILHQNEQTVESIIKLNKSYAQLNKQL 1385

Query: 1621 NIEERIWQQKSRVDWLKLGDANTKFFHAYAKMRRNTNAINHLTRMDGSECWGQEQIMEEV 1442
            NIEE  W+QKS V W+  G+ NTKFFH   + +R  + I  +   DG     QEQ+ +  
Sbjct: 1386 NIEEIFWKQKSGVKWVVEGERNTKFFHTRMQKKRIRSHIFKVQEPDGRWIEDQEQLKQSA 1445

Query: 1441 RRFYMNLMGSCASELQVVNKDIMRRGPRLTSQQQRDLI-KECTDAEVKDALFCMDSNKAP 1265
             +++ +L+     +     + ++   P + S  + +L+  E    EVKDA+F +D   A 
Sbjct: 1446 IKYFSSLLKFEPCDDSRFQRSLI---PSIISNSENELLCAEPNLQEVKDAVFGIDPESAA 1502

Query: 1264 GVDGFNACFFKKSWEFIGEEVTRAVQQFFRNGQLPREINVALITLLPKVPNASSVKDFRP 1085
            G DGF++ F+++ W  I  ++  AV+ FF    +PR +    + LLPK P+AS   DFRP
Sbjct: 1503 GPDGFSSYFYQQCWNIIAHDLLDAVRDFFHGANIPRGVTSTTLILLPKKPSASKWSDFRP 1562

Query: 1084 IACCSVLYKIISKILANRMKIVLNDVIGDCQSAFIPGRLIFDNILLSHELVKGYTRKQVS 905
            I+ C+V+ KII+K+L+NR+  +L  +I + QS F+ GRLI DNILL+ EL+     K   
Sbjct: 1563 ISLCTVMNKIITKLLSNRLAKILPSIITENQSGFVGGRLISDNILLAQELIGKLNTKSRG 1622

Query: 904  PRCMIKVDIQKAYDSVEWVFVEQMLSELGFPYRYIQWIMACLSTVSYSILVNGEVTEMFE 725
                +K+D+ KAYD ++W F+ ++L   GF  ++I  I  C+S   +S+L+NG     F+
Sbjct: 1623 GNLALKLDMMKAYDRLDWSFLIKVLQHFGFNDQWIGMIQKCISNCWFSLLLNGRTEGYFK 1682

Query: 724  ARRGIRQGDPISPYLFVICMEYLNRCFGELKSNRLFKYHPKCKKFGLVHVSFADDLLVFT 545
              RG+RQGDPISP LF+I  EYL+R    L       ++       + H++FADD+L+FT
Sbjct: 1683 FERGLRQGDPISPQLFLIAAEYLSRGLNALYEQYPSLHYSTGVSIPVSHLAFADDVLIFT 1742

Query: 544  RGDRDSVQQAMKVLDHFAEVSGLRANQLKSC-IYFGGVKDDMKHVILDQTGMCEGSLPFK 368
             G + ++Q+ +  L  + E+S  R N  KSC +    V    + +I   TG     LP  
Sbjct: 1743 NGSKSALQRILAFLQEYEEISRQRINAQKSCFVTHTNVSSSRRQIIAQTTGFNHQLLPIT 1802

Query: 367  YLGVPLTSHKLSVMQCKPLVDGILQRINCWSAKLLSYAGRVQLIKSVVFGIQMYWSQIFV 188
            YLG PL      V+    LV  I +RI  W  K+LS  GR+ L+KSV+  + +Y  Q+  
Sbjct: 1803 YLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLKSVLTSLPIYLFQVLK 1862

Query: 187  LPQKVLKRVQQACRIFLWTGKGEMSNRALVAWDKVMQPKSGGGLNILNLNE 35
             P  VL+R+ +    FLW G          +W K+  P   GGL+I +L E
Sbjct: 1863 PPVCVLERINRIFNSFLWGGSAASKKIHWTSWAKISLPVKEGGLDIRSLAE 1913


>ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
            gi|508722459|gb|EOY14356.1| Uncharacterized protein
            TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  422 bits (1085), Expect = e-115
 Identities = 254/767 (33%), Positives = 398/767 (51%), Gaps = 13/767 (1%)
 Frame = -2

Query: 2296 VYALNTGEGRKELWDFAKRKMVAVNEPMVVGGDFNAILSSEDRFQGAVVSQADVEDFQGF 2117
            VYA  T   R  LWD  +R      EP +VGGDFN IL  E+R  G+   +  +EDF   
Sbjct: 988  VYAKCTRSERTLLWDCLRRLAADNEEPWLVGGDFNIILKREERLYGSAPHEGSMEDFASV 1047

Query: 2116 IDVGELHEVRWSGPQYTWSNNQEGERRVCSNIDRCFANVKWLDEYSDVVVQRLAKGVSDH 1937
            +    L +  + G  +TW+NN     R+   +DR   N +W++ +    +Q L +  SDH
Sbjct: 1048 LLDCGLLDGGFEGNPFTWTNN-----RMFQRLDRVVYNHQWINMFPITRIQHLNRDGSDH 1102

Query: 1936 CPQML-LFGNCRQRAGLFRFFNVLADHEDFEGIIREHWGAWRSGNILRDIWRKCIKLKGP 1760
            CP ++  F +  +    FRF +    H DF+  +  +W    +G+ L+  W K  +LK  
Sbjct: 1103 CPLLISCFISSEKSPSSFRFQHAWVLHHDFKTSVEGNWNLPINGSGLQAFWIKQHRLKQH 1162

Query: 1759 LKSLNTKWFARVGDRVQGLREQLARVQ-----HDHDCSXXXXXXXXXE-----KWSNIEE 1610
            LK  N   F   GD    L+E   RV+     H  + +               K  N+EE
Sbjct: 1163 LKWWNKAVF---GDIFSKLKEAEKRVEECEILHQQEQTVGSRINLNKSYAQLNKQLNVEE 1219

Query: 1609 RIWQQKSRVDWLKLGDANTKFFHAYAKMRRNTNAINHLTRMDGSECWGQEQIMEEVRRFY 1430
              W+QKS V W+  G+ NTKFFH   + +R  + I  +   DG     QEQ+ +    ++
Sbjct: 1220 IFWKQKSGVKWVVEGERNTKFFHMRMQKKRIRSHIFKVQEPDGRWIEDQEQLKQSAIEYF 1279

Query: 1429 MNLMGSCASELQVVNKDIMRRGPRLTSQQQRDLI-KECTDAEVKDALFCMDSNKAPGVDG 1253
             +L+ +   ++      ++   P + S  + +L+  E    EVKDA+F +D   A G DG
Sbjct: 1280 SSLLKAEPCDISRFQNSLI---PSIISNSENELLCAEPNLQEVKDAVFDIDPESAAGPDG 1336

Query: 1252 FNACFFKKSWEFIGEEVTRAVQQFFRNGQLPREINVALITLLPKVPNASSVKDFRPIACC 1073
            F++ F+++ W  I  ++  AV+ FF    +PR +    + LLPK  +AS   +FRPI+ C
Sbjct: 1337 FSSYFYQQCWNTIAHDLLDAVRDFFHGANIPRGVTSTTLVLLPKKSSASKWSEFRPISLC 1396

Query: 1072 SVLYKIISKILANRMKIVLNDVIGDCQSAFIPGRLIFDNILLSHELVKGYTRKQVSPRCM 893
            +V+ KII+K+L+NR+  +L  +I + QS F+ GRLI DNILL+ EL++    K       
Sbjct: 1397 TVMNKIITKLLSNRLAKILPSIITENQSGFVGGRLISDNILLAQELIRKLDTKSRGGNLA 1456

Query: 892  IKVDIQKAYDSVEWVFVEQMLSELGFPYRYIQWIMACLSTVSYSILVNGEVTEMFEARRG 713
            +K+D+ KAYD ++W F+ ++L   GF  ++I  I  C+S   +S+L+NG +   F++ RG
Sbjct: 1457 LKLDMMKAYDRLDWSFLIKVLQHFGFNEQWIGMIQKCISNCWFSLLLNGRIEGYFKSERG 1516

Query: 712  IRQGDPISPYLFVICMEYLNRCFGELKSNRLFKYHPKCKKFGLVHVSFADDLLVFTRGDR 533
            +RQGD ISP LF++  EYL+R    L       ++       + H++FADD+L+FT G +
Sbjct: 1517 LRQGDSISPQLFILAAEYLSRGLNALYDQYPSLHYSSGVPLSVSHLAFADDVLIFTNGSK 1576

Query: 532  DSVQQAMKVLDHFAEVSGLRANQLKSC-IYFGGVKDDMKHVILDQTGMCEGSLPFKYLGV 356
             ++Q+ +  L  + E+SG R N  KSC +    + +  + +I   TG     LP  YLG 
Sbjct: 1577 SALQRILVFLQEYEEISGQRINAQKSCFVTHTNIPNSRRQIIAQATGFNHQLLPITYLGA 1636

Query: 355  PLTSHKLSVMQCKPLVDGILQRINCWSAKLLSYAGRVQLIKSVVFGIQMYWSQIFVLPQK 176
            PL      V+    LV  I +RI  W  K+LS  GR+ L++SV+  + +Y  Q+   P  
Sbjct: 1637 PLYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLRSVLASLPIYLLQVLKPPVC 1696

Query: 175  VLKRVQQACRIFLWTGKGEMSNRALVAWDKVMQPKSGGGLNILNLNE 35
            VL+RV +    FLW G          +W K+  P + GGL+I +L E
Sbjct: 1697 VLERVNRLFNSFLWGGSAASKRIHWASWAKIALPVTEGGLDIRSLAE 1743


>ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
            gi|508710341|gb|EOY02238.1| Uncharacterized protein
            TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  421 bits (1082), Expect = e-115
 Identities = 254/767 (33%), Positives = 389/767 (50%), Gaps = 13/767 (1%)
 Frame = -2

Query: 2296 VYALNTGEGRKELWDFAKRKMVAVNEPMVVGGDFNAILSSEDRFQGAVVSQADVEDFQGF 2117
            VYA  T   R+ELW   +     +  P +VGGDFN+I+S ++R  GA+     +ED    
Sbjct: 952  VYAKCTRIERRELWTSLRIISDGMQAPWLVGGDFNSIVSCDERLNGAIPHDGSMEDLSST 1011

Query: 2116 IDVGELHEVRWSGPQYTWSNNQEGERRVCSNIDRCFANVKWLDEYSDVVVQRLAKGVSDH 1937
            +    L +  + G  +TW+NN     R+   +DR   N +W + +S   VQ L +  SDH
Sbjct: 1012 LFDCGLLDAGFEGNSFTWTNN-----RMFQRLDRVVYNQEWAEFFSSTRVQHLNRDGSDH 1066

Query: 1936 CPQMLLFGNCRQRA-GLFRFFNVLADHEDFEGIIREHWGAWRSGNILRDIWRKCIKLKGP 1760
            CP ++   N  QR    FRF +    H DF   + + W        L   W K  +LK  
Sbjct: 1067 CPLLISCSNTNQRGPATFRFLHAWTKHHDFISFVEKSWNTPIHAEGLNAFWTKQQRLKRD 1126

Query: 1759 LKSLNTKWFARVGDRVQGLR-------EQLARVQHDHDCSXXXXXXXXXEKWS---NIEE 1610
            LK  N   F   GD  + LR       ++    Q +   +          K +   +IEE
Sbjct: 1127 LKWWNKHIF---GDIFKILRLAEVEAEQRELNFQQNPSAANRELMHKAYAKLNRQLSIEE 1183

Query: 1609 RIWQQKSRVDWLKLGDANTKFFHAYAKMRRNTNAINHLTRMDGSECWGQEQIMEEVRRFY 1430
              WQQKS V WL  G+ NTKFFH   + +R  N I  +   +G+       I      F+
Sbjct: 1184 LFWQQKSGVKWLVEGERNTKFFHMRMRKKRMRNHIFRIQDQEGNVLEEPHLIQNSGVEFF 1243

Query: 1429 MNLMGSCASELQVVNKDIMRRGPRLTSQQQRDLIKECTDA-EVKDALFCMDSNKAPGVDG 1253
             NL+ +   ++   +  I    PR+ S    + +       EVK+A+F ++ +   G DG
Sbjct: 1244 QNLLKAEQCDISRFDPSIT---PRIISTTDNEFLCATPSLQEVKEAVFNINKDSVAGPDG 1300

Query: 1252 FNACFFKKSWEFIGEEVTRAVQQFFRNGQLPREINVALITLLPKVPNASSVKDFRPIACC 1073
            F++ F++  W+ I +++  AV  FF+   LPR I    + LLPK  N S   +FRPI+ C
Sbjct: 1301 FSSLFYQHCWDIIKQDLFEAVLDFFKGSPLPRGITSTTLVLLPKTQNVSQWSEFRPISLC 1360

Query: 1072 SVLYKIISKILANRMKIVLNDVIGDCQSAFIPGRLIFDNILLSHELVKGYTRKQVSPRCM 893
            +VL KI++K+LANR+  +L  +I + QS F+ GRLI DNILL+ ELV     +      +
Sbjct: 1361 TVLNKIVTKLLANRLSKILPSIISENQSGFVNGRLISDNILLAQELVDKINARSRGGNVV 1420

Query: 892  IKVDIQKAYDSVEWVFVEQMLSELGFPYRYIQWIMACLSTVSYSILVNGEVTEMFEARRG 713
            +K+D+ KAYD + W F+  M+ + GF   +I  I AC+S   +S+L+NG +   F++ RG
Sbjct: 1421 LKLDMAKAYDRLNWEFLYLMMEQFGFNALWINMIKACISNCWFSLLINGSLVGYFKSERG 1480

Query: 712  IRQGDPISPYLFVICMEYLNRCFGELKSNRLFKYHPKCKKFGLVHVSFADDLLVFTRGDR 533
            +RQGD ISP LF++  EYL+R   +L S     ++       + H++FADD+++FT G  
Sbjct: 1481 LRQGDSISPSLFILAAEYLSRGLNQLFSRYNSLHYLSGCSMSVSHLAFADDIVIFTNGCH 1540

Query: 532  DSVQQAMKVLDHFAEVSGLRANQLKSC-IYFGGVKDDMKHVILDQTGMCEGSLPFKYLGV 356
             ++Q+ +  L  + +VSG + N  KSC I   G     + +I   TG    +LP  YLG 
Sbjct: 1541 SALQKILVFLQEYEQVSGQQVNHQKSCFITANGCPLSRRQIIAQVTGFQHKTLPVTYLGA 1600

Query: 355  PLTSHKLSVMQCKPLVDGILQRINCWSAKLLSYAGRVQLIKSVVFGIQMYWSQIFVLPQK 176
            PL      V     L+  I  RI+ W  K+LS   R+ L++SV+  + MY  Q+   P  
Sbjct: 1601 PLHKGPKKVFLFDSLISKIRDRISGWENKILSPGSRITLLRSVLSSLPMYLLQVLKPPAI 1660

Query: 175  VLKRVQQACRIFLWTGKGEMSNRALVAWDKVMQPKSGGGLNILNLNE 35
            V++++++    FLW    E       AW+K+  P S GGL+I NL +
Sbjct: 1661 VIEKIERLFNSFLWGDSNEGKRMHWAAWNKINFPCSEGGLDIRNLKD 1707


Top