BLASTX nr result

ID: Atropa21_contig00026717 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00026717
         (3836 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   308   e-140
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   347   3e-98
gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali...   190   2e-71
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   195   7e-62
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   214   2e-61
ref|XP_004252692.1| PREDICTED: uncharacterized protein LOC101261...   236   8e-59
ref|XP_004247247.1| PREDICTED: uncharacterized protein LOC101256...   230   3e-57
gb|ABD33261.1| RNA-directed DNA polymerase (Reverse transcriptas...   228   2e-56
gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip...   201   5e-55
gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]               177   2e-54
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   220   4e-54
gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,...   203   9e-53
gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]   164   2e-52
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       184   8e-52
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   211   3e-51
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]   158   4e-50
gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]   164   1e-49
gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam...   191   2e-49
emb|CAB40051.1| putative protein [Arabidopsis thaliana] gi|72677...   191   2e-49
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   204   2e-49

>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  308 bits (790), Expect(5) = e-140
 Identities = 189/559 (33%), Positives = 290/559 (51%), Gaps = 16/559 (2%)
 Frame = +1

Query: 1219 SDHSPIHIEVLMDSNSKRKHFRFINIVAE*EKLLHIVEKSWQQHYQGTTMYRLWCNLNYC 1398
            SDHSP+   ++       K F+F+N++AE  + L  VEK+W        +  +W NL   
Sbjct: 222  SDHSPLLFNLMTGRPQGGKPFKFMNVMAEQGEFLETVEKAWNSVNGRFKLQAIWLNLKAV 281

Query: 1399 KETLKKLKAENLGSIDGRIDEARDKFEAIQAQITVALRPELMLNEKEAMGELTNG*IYRT 1578
            K  LK++K + +G    ++   R + + +Q+Q        +  + K  M +L +      
Sbjct: 282  KRELKQMKTQKIGLAHEKVKNLRHQLQDLQSQDDFDHNDIMQTDAKSIMNDLRHW----- 336

Query: 1579 KF*KKIQSSLDKEKR---W------E*HIYILNV*RSES*NSIPLIKDATCRVLQRHTEI 1731
                 I+ S+ ++K    W         ++   V    + N I ++     RV+Q   E+
Sbjct: 337  ---SHIEDSILQQKSRITWLQQGDTNSKLFFTAVKARHAINRIDMLNTEDGRVIQDADEV 393

Query: 1732 ESEILQFYKGLLSFTAIRISVVNLTILRKGPTLGIQQQ*DMCSNITREYIKQELFGIVNN 1911
            + EIL+FYK LL   A  +  V+L  +R G  L  Q +  +   +    I + L GI N+
Sbjct: 394  QEEILEFYKKLLGTRASTLMGVDLNTVRGGKCLSAQAKESLIREVASTEIDEALAGIGND 453

Query: 1912 KAPGIDGYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVILMPKRSHPETVKD 2091
            KAPG+DG+N YFFK +W  ++ ++   + EFF   R+ R +N  +V L+PK  H   VK+
Sbjct: 454  KAPGLDGFNAYFFKKSWGSIKQEIYAGIQEFFNNSRMHRPINCIVVTLLPKVQHATRVKE 513

Query: 2092 YRPIACCFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNIILSHEHVNGYTRK 2271
            +RPIACC ++YKIISK+++ R+KG+I  ++ ++QS FIPG+ I+DNI+L+ E + GYTRK
Sbjct: 514  FRPIACCTVIYKIISKMLTNRMKGIIGEVVNEAQSGFIPGRHIADNILLASELIRGYTRK 573

Query: 2272 *ISPRCMIKVDL*KAYDSVEWYFIKQILKGMRFPRKI-------VTGL*IASTVNGEMTD 2430
             +SPRC++KVD+ KAYDSVEW F++ +L    FP +        V+ +  +  VNG  T 
Sbjct: 574  HMSPRCIMKVDIRKAYDSVEWSFLETLLYEFGFPSRFVGWIMECVSTVSYSVLVNGIPTQ 633

Query: 2431 IMKARKGLRVGRPNVPLHLRATDGIFRYMFEGLTRRT*I*LSS*MLEVGDNIL*FADYLL 2610
              +ARKGLR G P  P              E L              +    L FAD LL
Sbjct: 634  PFQARKGLRQGDPMSPFLFALCMEYLSRCLEELKGSPDFNFHPKCERLNITHLMFADDLL 693

Query: 2611 LFARKDLKFIMLLKDKFALFSDVSGLKANLSKSQVYFGRVDVATKNVILDMLEYEEGKLP 2790
            +F R D   +  +   F  FS  SGL A+  KS +YF  VD  T   + D +  + G+LP
Sbjct: 694  MFCRADKSSLDHMNVAFQKFSHASGLAASHEKSNIYFCGVDDETARELADYVHMQLGELP 753

Query: 2791 FKYFGVPLSNNFGGQDHCK 2847
            F+Y GVPL++       CK
Sbjct: 754  FRYLGVPLTSKKLTYAQCK 772



 Score =  115 bits (287), Expect(5) = e-140
 Identities = 59/157 (37%), Positives = 90/157 (57%)
 Frame = +1

Query: 3055 KKAGGLNILNLRIWNQVAICKLLWAFSQKKIKLWIT*IHTYYIQR*DIHVMQIPKQVA*M 3234
            K  GG N++N++ WN+ A+ KLLWA   K+ KLW+  IH+YYI+R DI  + I  Q   +
Sbjct: 854  KSRGGWNVINMKYWNRAAMLKLLWAIEFKRDKLWVRWIHSYYIKRQDILTVNISNQTTWI 913

Query: 3235 IRKILQVRKYWPTPGDTNSLIIGRKFHVATAYNRLSGKELNATWSKLLYQNIDEPKHNFI 3414
            +RKI++ R +    GD + + IG KF +  AY ++S       W +L+  N   PK  FI
Sbjct: 914  LRKIVKARDHLSNIGDWDEICIGDKFSMKKAYKKISENGERVRWRRLICNNYATPKSKFI 973

Query: 3415 LGLNLHGKLRIQDKLLK*GVKVIADCVLCCNAPKTRQ 3525
            L + LH +L   D++ + GV+   +  LC N  +T Q
Sbjct: 974  LWMMLHERLPTVDRISRWGVQCDLNYRLCRNDGETIQ 1010



 Score = 69.3 bits (168), Expect(5) = e-140
 Identities = 38/83 (45%), Positives = 52/83 (62%), Gaps = 3/83 (3%)
 Frame = +2

Query: 2825 LVDKITAKVTSWMKKYLSYVRRL*LIKVVLFGVQAY*SQLFLMSKKVI---ELVFRSYLW 2995
            LV+ IT +  +WM K LSY  RL LIK +L  +Q Y + +F +SKKVI   E V R +LW
Sbjct: 774  LVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVCRKFLW 833

Query: 2996 SGEASITKKAMMAWDKVCLPKKQ 3064
            +G+   TKKA +AW  +  PK +
Sbjct: 834  TGKTEETKKAPVAWATIQRPKSR 856



 Score = 66.6 bits (161), Expect(5) = e-140
 Identities = 49/149 (32%), Positives = 72/149 (48%), Gaps = 7/149 (4%)
 Frame = +2

Query: 731  NNYVHVVNGRI*VLRREAKVAVTVHETYGQYIQCLVTDRGTAFQCLLIVIYES*SLEERK 910
            NNY H    RI +  R A V VT+  T  Q + C + D+    +  ++ +Y   ++ +RK
Sbjct: 59   NNYSHSARERIWIGWRPAWVNVTLTHTQEQLMVCDIQDQSHKLK--MVAVYGLHTIADRK 116

Query: 911  KLWVGLLKLGACIAT--PWSICGDFNSPLSSEDITCGNLVGDVEIRDFQLVVDTLVLTDM 1084
             LW GLL+   C+    P  I GDFN+   S D   G LV D E  DFQ  +    L + 
Sbjct: 117  SLWSGLLQ---CVQQQDPMIIIGDFNAVCHSNDRLYGTLVTDAETEDFQQFLLQSNLIES 173

Query: 1085 KAT*RVLTWTNG-----HVWSKIGRALCN 1156
            ++T    +W+N       V S+I +A  N
Sbjct: 174  RSTWSYYSWSNSSIGRDRVLSRIDKAYVN 202



 Score = 33.1 bits (74), Expect(5) = e-140
 Identities = 14/54 (25%), Positives = 31/54 (57%), Gaps = 3/54 (5%)
 Frame = +3

Query: 573 CIF*NVKSLNIPFKQRE---YLKKYKVCLAGLVKTKVKKHKFQTCLYRIARGWQ 725
           C+  NV+ +N PFK +E   +L  +K+ +  L++T+V++        ++ + W+
Sbjct: 3   CVSWNVRGMNDPFKIKEIKNFLYSHKIVVCALLETRVREQNASKVQGKLGKDWK 56


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  347 bits (889), Expect(2) = 3e-98
 Identities = 236/819 (28%), Positives = 389/819 (47%), Gaps = 40/819 (4%)
 Frame = +1

Query: 1189 IIAHFQENHFSDHSPIHIEVLMDSNSKRKHFRFINIVAE*EKLLHIVEKSWQQHYQGTTM 1368
            ++  ++E   SDHSP+   +    +   + F+F+N +A+    + +V+++W        M
Sbjct: 215  VVVEYREAGISDHSPLIFNLATQHDEGGRPFKFLNFLADQNGFVEVVKEAWGSANHRFKM 274

Query: 1369 YRLWCNLNYCKETLKKLKAENLGSIDGRIDEARDKFEAIQAQITVALRPELMLNEKEAMG 1548
              +W  L   K  LK   ++       +++E R K  A+QA   V+   EL   EK+ + 
Sbjct: 275  KNIWVRLQAVKRALKSFHSKKFSKAHCQVEELRRKLAAVQALPEVSQVSELQEEEKDLIA 334

Query: 1549 ELTNG*IYRTKF*KKIQSSLDKEK---RW------E*HIYILNV*RSES*NSIPLIKDAT 1701
            +L             I  S+ K+K   +W          +   +   ++ N I L+++  
Sbjct: 335  QLRKW--------STIDESILKQKSRIQWLSLGDSNSKFFFTAIKVRKARNKIVLLQNDR 386

Query: 1702 CRVLQRHTEIESEILQFYKGLLSFTAIRISVVNLTILRKGPTLGIQQQ*DMCSNITREYI 1881
               L  +TEI++EI  FY+ LL  ++ ++  ++L ++R G  L       +   IT + I
Sbjct: 387  GDQLTENTEIQNEICNFYRRLLGTSSSQLEAIDLHVVRVGAKLSATSCAQLVQPITIQEI 446

Query: 1882 KQELFGIVNNKAPGIDGYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVILMP 2061
             Q L  I + KAPG+DG+N+ FFK +W +++ ++ E +++FF    + + +N T V L+P
Sbjct: 447  DQALADIDDTKAPGLDGFNSVFFKKSWLVIKQEIYEGILDFFENGFMHKPINCTAVTLIP 506

Query: 2062 KRSHPETVKDYRPIACCFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNIILS 2241
            K    +  KDYRPIACC  +YKIISK+++ R++ VI  ++  +Q+ FIP + I DNI+L+
Sbjct: 507  KIDEAKHAKDYRPIACCSTLYKIISKILTKRLQAVITEVVDCAQTGFIPERHIGDNILLA 566

Query: 2242 HEHVNGYTRK*ISPRCMIKVDL*KAYDSVEWYFIKQILKGMRFPRKI-------VTGL*I 2400
             E + GY R+ +SPRC+IKVD+ KAYDSVEW F++ +LK + FP          V  +  
Sbjct: 567  TELIRGYNRRHVSPRCVIKVDIRKAYDSVEWVFLESMLKELGFPSMFIRWIMACVKTVSY 626

Query: 2401 ASTVNGEMTDIMKARKGLRVGRPNVPLHLRATDGIFRYMFEGLTRRT*I*LSS*MLEVGD 2580
            +  +NG  +    A+KGLR G P  P     +          + +            +  
Sbjct: 627  SILLNGIPSIPFDAQKGLRQGDPLSPFLFALSMEYLSRCMGNMCKDPEFNFHPKCERIKL 686

Query: 2581 NIL*FADYLLLFARKDLKFIMLLKDKFALFSDVSGLKANLSKSQVYFGRVDVATKNVILD 2760
              L FAD LL+FAR D   I  +   F  FS  SGL+A++ KS +YFG V       + D
Sbjct: 687  THLMFADDLLMFARADASSISKIMAAFNSFSKASGLQASIEKSCIYFGGVCHEEAEQLAD 746

Query: 2761 MLEYEEGKLPFKYFGVPLSNNFGGQDHCK--------------SHKLDEKISLLC*ETLI 2898
             ++   G LPF+Y GVPL++       CK              +H L     L   +T++
Sbjct: 747  RIQMPIGSLPFRYLGVPLASKKLNFSQCKPLIDKITTRAQGWVAHLLSYAGRLQLVKTIL 806

Query: 2899 DQGCSIWS-----PSLLVSTIFDV*XXXXXXXXXXXXXWRGLYY*E-----GYDGMG*SL 3048
                + W      P  L+  +                 W G           +D +    
Sbjct: 807  YSMQNYWGQIFPLPKKLIKAV---------ETTCRKFLWTGTVDTSYKAPVAWDFL---Q 854

Query: 3049 SAKKAGGLNILNLRIWNQVAICKLLWAFSQKKIKLWIT*IHTYYIQR*DIHVMQIPKQVA 3228
              K  GGLN+ N+ +WN+ AI KLLWA + K+ KLW+  ++ YYI+R +I  + +    +
Sbjct: 855  QPKSTGGLNVTNMVLWNKAAILKLLWAITFKQDKLWVRWVNAYYIKRQNIENVTVSSNTS 914

Query: 3229 *MIRKILQVRKYWPTPGDTNSLIIGRKFHVATAYNRLSGKELNATWSKLLYQNIDEPKHN 3408
             ++RKI + R+     G   ++     F +   Y  L     N  W +L+  N   PK  
Sbjct: 915  WILRKIFESRELLTRTGGWEAVSNHMNFSIKKTYKLLQEDYENVVWKRLICNNKATPKSQ 974

Query: 3409 FILGLNLHGKLRIQDKLLK*GVKVIADCVLCCNAPKTRQ 3525
            FIL L +  +L   +++ +    V   C +C N  +T Q
Sbjct: 975  FILWLAMLNRLATAERVSRWNRDVSPLCKMCGNEIETIQ 1013



 Score = 42.7 bits (99), Expect(2) = 3e-98
 Identities = 39/158 (24%), Positives = 62/158 (39%), Gaps = 5/158 (3%)
 Frame = +2

Query: 716  RMATCNNYVHVVNGRI*VLRREAKVAVTVHETYGQYIQCLVTDRGTAFQCLLIVIYES*S 895
            R +  NNY     GRI V      V + V     Q I   V +        +  +Y   +
Sbjct: 54   RWSWINNYACSPRGRIWVGWLNNDVNINVLSVTEQVITMEVKNSYGLNMFKMAAVYGLHT 113

Query: 896  LEERKKLWVGLLKLGACIATPWSICGDFNSPLSSEDITCGNLVGDVEIRDFQLVVDTLVL 1075
            + +RK LW  L    +    P  + GD+N+  S++D   GN V + E  D +  V    L
Sbjct: 114  IADRKVLWEELYNFVSVCHEPCILIGDYNAVYSAQDRLNGNDVSEAETSDLRSFVLKAQL 173

Query: 1076 TDMKAT*RVLTWTN-----GHVWSKIGRALCNAT*VVQ 1174
             +   T    +W N       + S+I ++  N   + Q
Sbjct: 174  LEAPTTGLFYSWNNKSIGADRISSRIDKSFVNVAWINQ 211


>gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana]
            gi|20197043|gb|AAM14892.1| putative reverse transcriptase
            [Arabidopsis thaliana]
          Length = 1412

 Score =  190 bits (482), Expect(4) = 2e-71
 Identities = 141/541 (26%), Positives = 244/541 (45%), Gaps = 9/541 (1%)
 Frame = +1

Query: 1219 SDHSPIHIEVLMDSNSKRKHFRFINIVAE*EKLLHIVEKSWQQHYQ----GTTMYRLWCN 1386
            SDH      +       +  F+F N++A   + +  VE  W+   +     +T++R    
Sbjct: 530  SDHLRGRFHLRSAIQKPKGPFKFTNVIAAHPEFMPKVEDFWKNTTELFPSTSTLFRFSKK 589

Query: 1387 LNYCKETLKKLKAENLGSIDGRIDEARDKFEAIQAQITVALRPELMLNEKEAMGELTNG* 1566
            L   K  LK L   NL  +  R   A ++    Q +    L P  +++E  A        
Sbjct: 590  LKELKPILKDLSRNNLSDLTRRATYAYEELCRCQTKSLTTLNPHDIVDESLAF------- 642

Query: 1567 IYRTKF*KKIQSSLDKEKRWE*HIYILNV*RSES*NSIPLIKDATCRVLQRHTEIESEIL 1746
                             +RWE   ++LN        +I  + D          +I+ E +
Sbjct: 643  -----------------ERWEKERHLLN--------AIHEVMDPQGTRPPNQDDIKIEAV 677

Query: 1747 QFYKGLLS-----FTAIRISVVNLTILRKGPTLGIQQQ*DMCSNITREYIKQELFGIVNN 1911
            +F+  LLS     FT I +  +   +  +     + +Q  + + IT   + +  F I  N
Sbjct: 678  RFFSDLLSSQPSDFTGISVDELKGILQYR---YSLHEQNLLVAEITEAEVMKVFFSIPLN 734

Query: 1912 KAPGIDGYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVILMPKRSHPETVKD 2091
            K+PG DGY   FF+ TW ++  +V  ++  FF    L + +N T++ L+PKR++ + +KD
Sbjct: 735  KSPGPDGYTVEFFRETWSVIGQEVTMAIKSFFTYGFLPKGLNSTILALIPKRTYAKEMKD 794

Query: 2092 YRPIACCFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNIILSHEHVNGYTRK 2271
            YRPI+CC ++YK ISK+++ R+K ++   +  +QSAFI  +L+ +N++L+ E V  Y + 
Sbjct: 795  YRPISCCNVLYKAISKLLANRLKCLLPEFIAPNQSAFISDRLLMENLLLASELVKDYHKD 854

Query: 2272 *ISPRCMIKVDL*KAYDSVEWYFIKQILKGMRFPRKIVTGL*IASTVNGEMTDIMKARKG 2451
             +SPRC +K+DL KA+DSV+W F+   L  +  P K +  + +  +       +     G
Sbjct: 855  GLSPRCAMKIDLSKAFDSVQWPFLLNTLAALDIPEKFIHWINLCISTASFSVQV----NG 910

Query: 2452 LRVGRPNVPLHLRATDGIFRYMFEGLTRRT*I*LSS*MLEVGDNIL*FADYLLLFARKDL 2631
            LR G    P        +   M +                +G   L FAD +++F+    
Sbjct: 911  LRQGCSLSPYLFVICMNVLSAMLDKGAVEKRFGYHPRCRNMGLTHLCFADDIMVFSAGSA 970

Query: 2632 KFIMLLKDKFALFSDVSGLKANLSKSQVYFGRVDVATKNVILDMLEYEEGKLPFKYFGVP 2811
              +  +   F  F+  SGL  +L KS ++   +   T   IL    ++ G LP +Y G+P
Sbjct: 971  HSLEGVLAIFKDFAAFSGLNISLEKSTLFMASISSETCASILARFPFDSGSLPVRYLGLP 1030

Query: 2812 L 2814
            L
Sbjct: 1031 L 1031



 Score = 58.2 bits (139), Expect(4) = 2e-71
 Identities = 52/151 (34%), Positives = 70/151 (46%), Gaps = 9/151 (5%)
 Frame = +2

Query: 731  NNYVHVVNGRI*VLRREAKVAVTVHETYGQYIQCLVTDRGTAFQCLLIVIYES*SLEERK 910
            +NY     GRI V+   + V + V     Q I CLV       + +   IY S  +EERK
Sbjct: 361  SNYEFNRLGRIWVVW-SSSVQLQVIFKSSQMIVCLVRVEHYDVEFICSFIYASNFVEERK 419

Query: 911  KLWVGLLKLGACIA---TPWSICGDFNSPLSSEDITCGNLVGDVE--IRDFQLVVDTLVL 1075
            KLW  L  L   +A    PW + GDFN  L  E+ +   +   V   +RDFQ+VV    L
Sbjct: 420  KLWQDLHNLQNSVAFRNKPWLLFGDFNETLKMEEHSSYAVSPMVTPGMRDFQIVVRYCSL 479

Query: 1076 TDMKAT*RVLTWTN----GHVWSKIGRALCN 1156
             DM+    + TW N    G +  K+ R L N
Sbjct: 480  EDMRTHGPLFTWGNKRNEGLICKKLDRVLLN 510



 Score = 54.3 bits (129), Expect(4) = 2e-71
 Identities = 27/85 (31%), Positives = 48/85 (56%), Gaps = 3/85 (3%)
 Frame = +2

Query: 2819 ITLVDKITAKVTSWMKKYLSYVRRL*LIKVVLFGVQAY*SQLFLMSK---KVIELVFRSY 2989
            + L++KI ++++SW  ++LSY  RL L+  V+  +  +    F + +   + IE +  ++
Sbjct: 1042 LPLLEKIRSRISSWKNRFLSYAGRLQLLNSVISSLTKFWISAFRLPRACIREIEQISAAF 1101

Query: 2990 LWSGEASITKKAMMAWDKVCLPKKQ 3064
            LWSG      KA +AW  VC PK +
Sbjct: 1102 LWSGTDLNPHKAKVAWHDVCKPKSE 1126



 Score = 39.7 bits (91), Expect(4) = 2e-71
 Identities = 41/172 (23%), Positives = 67/172 (38%), Gaps = 16/172 (9%)
 Frame = +1

Query: 3055 KKAGGLNILNLRIWNQVAICKLLWAFSQKKIKLWIT*IHTYYIQ---------R*DIHVM 3207
            K  GGL + +L   N++   KL+W     K  LW+  I    I+         R   H  
Sbjct: 1124 KSEGGLGLRSLVDANKICCFKLIWRLVSAKHSLWVNWIQNNLIRTVAEALSSHRRRSHRD 1183

Query: 3208 QIPKQVA*MIRKIL-------QVRKYWPTPGDTNSLIIGRKFHVATAYNRLSGKELNATW 3366
             I   +   + K+L       Q R    + G         KF     ++++  + L   W
Sbjct: 1184 DILNDIEEELEKLLCRGICTEQDRSLCRSIGGQ----FKAKFFSPEIWHQIREQGLVKQW 1239

Query: 3367 SKLLYQNIDEPKHNFILGLNLHGKLRIQDKLLK*GVKVIADCVLCCNAPKTR 3522
             K ++ +   PK  FI  L  H +L   DK+      + + CVLC  + ++R
Sbjct: 1240 HKAIWFSGATPKFTFISWLAAHDRLTTGDKMASWNRGISSVCVLCNISAESR 1291


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  195 bits (495), Expect(3) = 7e-62
 Identities = 121/353 (34%), Positives = 189/353 (53%), Gaps = 9/353 (2%)
 Frame = +1

Query: 1783 RISVVNLTILRKGPTLGIQQQ*DMCSNITREYIKQELFGIVNNKAPGIDGYNTYFFKTTW 1962
            RI+ +N +    GP L       +C+  T + I+   F +  NK+PG DG+N  FF+  W
Sbjct: 253  RIATINRS---DGPDLAKS----LCNEFTHDDIRAVFFSMNPNKSPGPDGFNGCFFQKAW 305

Query: 1963 EIV-QNDVCESVMEFF*KIRLLRAVNKTLVILMPKRSHPETVKDYRPIACCFIVYKIISK 2139
             ++  N V  +V EFF    LL  +N T++ L+PK ++P T+ D+RPI+CC   YKII+K
Sbjct: 306  LVIGDNVVAAAVKEFFSYGSLLMELNSTIITLVPKVANPTTMSDFRPISCCNTFYKIIAK 365

Query: 2140 VISARIKGVIDGIMGQSQSAFIPGKLISDNIILSHEHVNGYTRK*ISPRCMIKVDL*KAY 2319
            +++ R+KG +  I+G SQS FIPG+ I DNI+L+ E +  Y +    PRC   VD+ KA 
Sbjct: 366  LLANRLKGTLHLIVGPSQSTFIPGRRIGDNILLAQEIICDYHKADGQPRCTFMVDMMKAN 425

Query: 2320 DSVEWYFIKQILKGMRFP-------RKIVTGL*IASTVNGEMTDIMKARKGLRVGRPNVP 2478
            D+VEW FI   L+    P       +  ++    +  VNGE+      R+GLR G P  P
Sbjct: 426  DTVEWDFIIATLQAFNIPSTLIGWIKSCISSAKFSVCVNGELAGFFARRRGLRQGDPLSP 485

Query: 2479 -LHLRATDGIFRYMFEGLTRRT*I*LSS*MLEVGDNIL*FADYLLLFARKDLKFIMLLKD 2655
             L + A + +   +   +             ++  + L FAD LL+F   D   +  L D
Sbjct: 486  YLFVIAMEVLSLCIQRRINCSPCFRYHWRCDQLNLSHLCFADDLLMFCNGDENSVRTLHD 545

Query: 2656 KFALFSDVSGLKANLSKSQVYFGRVDVATKNVILDMLEYEEGKLPFKYFGVPL 2814
             F+ F  +S LKAN+S+S+++   VD  + + +L +  +  G  P +Y G+PL
Sbjct: 546  AFSNFESLSSLKANVSESKIFLAGVDGNSSDSVLQVTNFSLGTCPVRYLGIPL 598



 Score = 62.4 bits (150), Expect(3) = 7e-62
 Identities = 32/81 (39%), Positives = 47/81 (58%), Gaps = 3/81 (3%)
 Frame = +2

Query: 2825 LVDKITAKVTSWMKKYLSYVRRL*LIKVVLFGVQAY*SQLFLMSKKV---IELVFRSYLW 2995
            L+D+I  ++ SW  K LS+  RL LI+ VL  +Q Y +   ++ KKV   IE   R +LW
Sbjct: 611  LLDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVYWASHLILPKKVLKDIEKRLRCFLW 670

Query: 2996 SGEASITKKAMMAWDKVCLPK 3058
            +G  S      +AW ++CLPK
Sbjct: 671  AGNCSGRAATKVAWSEICLPK 691



 Score = 31.6 bits (70), Expect(3) = 7e-62
 Identities = 16/69 (23%), Positives = 29/69 (42%)
 Frame = +1

Query: 3055 KKAGGLNILNLRIWNQVAICKLLWAFSQKKIKLWIT*IHTYYIQR*DIHVMQIPKQVA*M 3234
            K  GGL I +L  WN+  +   +W         W   +  Y ++        +P   +  
Sbjct: 691  KCEGGLGIKDLHCWNKALMISHIWNLVSSSSNFWTDWVKVYLLKGNSFWNAPLPSICSWN 750

Query: 3235 IRKILQVRK 3261
             RK+L++R+
Sbjct: 751  WRKLLKIRE 759


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  214 bits (546), Expect(3) = 2e-61
 Identities = 177/675 (26%), Positives = 298/675 (44%), Gaps = 23/675 (3%)
 Frame = +1

Query: 1216 FSDHSPIHIEVLMDSNSKRKHFRFINIVAE*EKLLHIVEKSWQQ-HYQGTTMYRLWCNLN 1392
            FSDH P  + +   S  + K F+  N +    + +  +  +W +  YQG+ M+ L     
Sbjct: 226  FSDHCPSCVNISNQSGGRNKPFKLSNFLMHHPEFIEKIRVTWDRLAYQGSAMFTLSKKSK 285

Query: 1393 YCKETLKKLKAENLGSIDGRIDEARDKFEAIQAQITVALRPELMLNEKEAMGELTNG*IY 1572
            + K T++    E+   ++ R+ +A    +  Q  +  A    L   EKEA        + 
Sbjct: 286  FLKGTIRTFNREHYSGLEKRVVQAAQNLKTCQNNLLAAPSSYLAGLEKEAHRSWAELALA 345

Query: 1573 RTKF*-KKIQSSLDKEKRWE*HIYILNV*RSES*NSIPLIKDATCRVLQRHTEIESEILQ 1749
              +F  +K +    K        +   +    + N I  + D T R ++   E+++  + 
Sbjct: 346  EERFLCQKSRVLWLKCGDSNTTFFHRMMTARRAINEIHYLLDQTGRRIENTDELQTHCVD 405

Query: 1750 FYKGLLSFTAIRISVVNLTILRKGPTLGIQQQ*DMC--SNITREYIKQELFGIVNNKAPG 1923
            F+K L   ++  IS   ++ +         +       + ++   IK E F + +NK+PG
Sbjct: 406  FFKELFGSSSHLISAEGISQINSLTRFKCDENTRQLLEAEVSEADIKSEFFALPSNKSPG 465

Query: 1924 IDGYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVILMPKRSHPETVKDYRPI 2103
             DGY + FFK TW IV   +  +V EFF   RLL   N T V ++PK+ + + + ++RPI
Sbjct: 466  PDGYTSEFFKKTWSIVGPSLIAAVQEFFRSGRLLGQWNSTAVTMVPKKPNADRITEFRPI 525

Query: 2104 ACCFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNIILSHEHVNGYTRK*ISP 2283
            +CC  +YK+ISK+++ R++ ++   +  SQSAF+ G+L+++N++L+ E V G+ +  IS 
Sbjct: 526  SCCNAIYKVISKLLARRLENILPLWISPSQSAFVKGRLLTENVLLATELVQGFGQANISS 585

Query: 2284 RCMIKVDL*KAYDSVEWYFIKQILKGMRFP-------RKIVTGL*IASTVNGEMTDIMKA 2442
            R ++KVDL KA+DSV W FI + LK    P       ++ +T    +  V+G +    K 
Sbjct: 586  RGVLKVDLRKAFDSVGWGFIIETLKAANAPPRFVNWIKQCITSTSFSINVSGSLCGYFKG 645

Query: 2443 RKGLRVGRPNVPLHLRATDGIFRYMFEGLTRRT*I*LSS*MLEVGDNIL*FADYLLLFAR 2622
             KGLR G P  P        I   + E       I       EV  + L FAD L++F  
Sbjct: 646  SKGLRQGDPLSPSLFVIAMEILSRLLENKFSDGSIGYHPKASEVRISSLAFADDLMIFYD 705

Query: 2623 KDLKFIMLLKDKFALFSDVSGLKANLSKSQVYFGRVDVATKNVILDMLEYEEGKLPFKYF 2802
                 +  +K     F ++SGL+ N  KS VY   ++   K   L    +  G  PF+Y 
Sbjct: 706  GKASSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDTL-AFGFVNGTFPFRYL 764

Query: 2803 GVP-LSNNFGGQDHCKS-HKLDEKISLLC*ETLIDQGCSIWSPSLLVST--------IFD 2952
            G+P L       D+ +   K+  + +    +TL   G      S++ ST        I  
Sbjct: 765  GLPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISSVIYSTVNFWLSSFILP 824

Query: 2953 V*XXXXXXXXXXXXXWRGLYY*EGYDGMG*SLSA--KKAGGLNILNLRIWNQVAICKLLW 3126
                           W       G   +    S   K  GGL + N   WN+    +L+W
Sbjct: 825  KCCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWTWNKTLNLRLIW 884

Query: 3127 AFSQKKIKLWIT*IH 3171
                ++  LW+   H
Sbjct: 885  MLFARRDSLWVAWNH 899



 Score = 43.5 bits (101), Expect(3) = 2e-61
 Identities = 44/148 (29%), Positives = 61/148 (41%), Gaps = 7/148 (4%)
 Frame = +2

Query: 734  NYVHVVNGRI*VLRREAKVAVTVHETYGQYIQCLVTDRGTAFQCLLIVIYES*SLEERKK 913
            NY     GRI V+   A V VTV     Q I C V     + + ++  +Y       R++
Sbjct: 61   NYEFAALGRIWVVWDPA-VEVTVLSKSDQTISCTVKLPHISTEFVVTFVYAVNCRYGRRR 119

Query: 914  LWVGLLKLGACIAT---PWSICGDFNSPLSSEDITCGNLVGDVEIRDFQLVVDTLVLTDM 1084
            LW  L  L A   T   PW I GDFN  L   D + G       + +F+  + T  ++D+
Sbjct: 120  LWSELELLAANQTTSDKPWIILGDFNQSLDPVDASTGGSRITRGMEEFRECLLTSNISDL 179

Query: 1085 KAT*RVLTWTNGH----VWSKIGRALCN 1156
                   TW N      +  KI R L N
Sbjct: 180  PFRGNHYTWWNNQENNPIAKKIDRILVN 207



 Score = 29.6 bits (65), Expect(3) = 2e-61
 Identities = 15/52 (28%), Positives = 29/52 (55%), Gaps = 3/52 (5%)
 Frame = +3

Query: 585 NVKSLNIPFKQREYLKKYKVCLA---GLVKTKVKKHKFQTCLYRIARGWQHV 731
           NV+  N   ++R + K +K+  A    +++T+VK+H+ +  L     GW+ V
Sbjct: 8   NVRGFNNSVRRRNFRKWFKLSKALFGSILETRVKEHRARRSLLSSFPGWKSV 59


>ref|XP_004252692.1| PREDICTED: uncharacterized protein LOC101261795 [Solanum
            lycopersicum]
          Length = 413

 Score =  236 bits (601), Expect = 8e-59
 Identities = 137/392 (34%), Positives = 217/392 (55%), Gaps = 3/392 (0%)
 Frame = +1

Query: 1219 SDHSPIHIEVLMDSNSKRKHFRFINIVAE*EKLLHIVEKSWQQHYQGTTMYRLWCNLNYC 1398
            SDH P+H  +    +  +  F+  N++ E +  L +V+K W+Q +    M  +W NL   
Sbjct: 55   SDHIPMHFLLHQSYHQIKVSFKLFNVLIEHKSFLELVDKVWKQKHGSEVMKEIWYNLKEL 114

Query: 1399 KETLKKLKAENLGSIDGRIDEARDKFEAIQAQITVALRPELMLNEKEAMGELTNG*IYRT 1578
            +  L++L  +    I   I++ R +   +Q Q+      EL   EK+ + ++        
Sbjct: 115  QPVLRQLNRKEFQYIGQNIEKKRIELVELQEQLYSQASDELFTKEKDLLIKVDKW----- 169

Query: 1579 KF*KKIQSSLDKEK---RWE*HIYILNV*RSES*NSIPLIKDATCRVLQRHTEIESEILQ 1749
                 I+ S  ++K   RW      + +  +++     +IK+   R  ++H         
Sbjct: 170  ---SMIEESALRQKARARW------ITLGDAKNKYFSSVIKE---RNQKKHIRS------ 211

Query: 1750 FYKGLLSFTAIRISVVNLTILRKGPTLGIQQQ*DMCSNITREYIKQELFGIVNNKAPGID 1929
                       ++  +N  ++++GP    QQ+  +C++IT + I   L    N+KAPGID
Sbjct: 212  -----------KLPAINAQVMKRGPVSSRQQRIQLCTDITEQEIYSTLQSYGNDKAPGID 260

Query: 1930 GYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVILMPKRSHPETVKDYRPIAC 2109
            GYN  FFK TW+I++ DV E+V  FF   +L +  N TLV L+PK   P+TVK+Y PIAC
Sbjct: 261  GYNALFFKHTWKIIKKDVIEAVKNFFTTGKLFKPFNCTLVSLIPKVQCPKTVKEYTPIAC 320

Query: 2110 CFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNIILSHEHVNGYTRK*ISPRC 2289
            C ++YKIISKVI+ R+  VI  ++ +SQ+ FIPG+ I+DNIIL+HE V  YTRK ISPR 
Sbjct: 321  CTVLYKIISKVITRRMHDVIHDVICESQAGFIPGRKIADNIILAHELVKTYTRKNISPRI 380

Query: 2290 MIKVDL*KAYDSVEWYFIKQILKGMRFPRKIV 2385
            ++K+DL KAYDSVEW F++Q++ G+ FP   +
Sbjct: 381  ILKIDLHKAYDSVEWPFLEQVMVGLGFPEMFI 412


>ref|XP_004247247.1| PREDICTED: uncharacterized protein LOC101256917 [Solanum
            lycopersicum]
          Length = 421

 Score =  230 bits (587), Expect = 3e-57
 Identities = 145/379 (38%), Positives = 194/379 (51%), Gaps = 7/379 (1%)
 Frame = +1

Query: 1705 RVLQRHTEIESEILQFYKGLLSFTAIRISVVNLTILRKGPTLGIQQQ*DMCSNITREYIK 1884
            R+L    EI+ E++ FYK L+  +A+                            T E I 
Sbjct: 81   RMLYEPQEIQDEVVLFYKSLMGTSAV----------------------------TEEKIF 112

Query: 1885 QELFGIVNNKAPGIDGYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVILMPK 2064
              L  I N+KAPGIDGYN +FFK TW+I++ND+ E V  FF   +L +  N TLV L+PK
Sbjct: 113  AALQSIGNDKAPGIDGYNAFFFKYTWKIIKNDIIEVVQSFFKPGKLFKPFNCTLVSLIPK 172

Query: 2065 RSHPETVKDYRPIACCFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNIILSH 2244
               P+ VK+YR I CC ++YKIISKVI+ R+  VI  ++  SQ  FI G+ IS+NI+L+H
Sbjct: 173  VQSPKNVKEYRTITCCTVLYKIISKVITNRMHDVIHNVICDSQVGFILGRKISENILLAH 232

Query: 2245 EHVNGYTRK*ISPRCMIKVDL*KAYDSVEWYFIKQILKGMRFP-------RKIVTGL*IA 2403
            E VN YTRK ISPR M+K+DL K YDSVEW F+KQ++ G+ FP          V  +   
Sbjct: 233  ELVNSYTRKNISPRSMLKIDLQKVYDSVEWPFLKQVMVGLGFPDMFTQWVMHCVKTVNYT 292

Query: 2404 STVNGEMTDIMKARKGLRVGRPNVPLHLRATDGIFRYMFEGLTRRT*I*LSS*MLEVGDN 2583
              VNG+ T    A +                     Y +                     
Sbjct: 293  IVVNGQTTQRFDAAR-------------------LFYCYNN------------------- 314

Query: 2584 IL*FADYLLLFARKDLKFIMLLKDKFALFSDVSGLKANLSKSQVYFGRVDVATKNVILDM 2763
                   LLLF+R DL  I  LK  F  FS  SG +ANL+KS +Y G V +  +  I+  
Sbjct: 315  -------LLLFSRGDLNSIKALKGCFLEFSQASGQQANLNKSSIYCGGVQMEVRQQIVRQ 367

Query: 2764 LEYEEGKLPFKYFGVPLSN 2820
            L Y+  ++PFKY GVPLS+
Sbjct: 368  LHYKMEEIPFKYLGVPLSS 386


>gb|ABD33261.1| RNA-directed DNA polymerase (Reverse transcriptase) [Medicago
            truncatula]
          Length = 402

 Score =  228 bits (581), Expect = 2e-56
 Identities = 125/331 (37%), Positives = 196/331 (59%), Gaps = 7/331 (2%)
 Frame = +1

Query: 1711 LQRHTEIESEILQFYKGLLSFTAIRISVVNLTILRKGPTLGIQQQ*DMCSNITREYIKQE 1890
            + +H  I+ EI  FY  L+  +   + +V+  ++++GP L   QQ  +CS  T   +K  
Sbjct: 70   IDKHNLIKEEIRGFYLKLMGSSVDSLPMVDKNVVKRGPMLSQHQQDLLCSKFTAVEVKNV 129

Query: 1891 LFGIVNNKAPGIDGYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVILMPKRS 2070
            LF + ++KAPGIDGYN +FFK +W I+ + V +++++FF    + + +N T + L+PK  
Sbjct: 130  LFSMDSSKAPGIDGYNVHFFKCSWNIIGDSVIDAILDFFKTGFMPKIINCTYMTLLPKEV 189

Query: 2071 HPETVKDYRPIACCFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNIILSHEH 2250
            +  +VK++RPIACC ++YKIISK++++R++GV++ ++ ++QSAF+ G++I DNIILSHE 
Sbjct: 190  NVTSVKNFRPIACCSVIYKIISKILTSRMQGVLNSVVSENQSAFVKGRVIFDNIILSHEL 249

Query: 2251 VNGYTRK*ISPRCMIKVDL*KAYDSVEWYFIKQILKGMRFPRKIV-------TGL*IAST 2409
            V  Y+RK ISPRCM+K+DL KAY+SVEW FIK ++  + F  K V       T       
Sbjct: 250  VKSYSRKGISPRCMVKIDLQKAYNSVEWPFIKHLMLELGFSYKFVNWVMGCLTTASYTFN 309

Query: 2410 VNGEMTDIMKARKGLRVGRPNVPLHLRATDGIFRYMFEGLTRRT*I*LSS*MLEVGDNIL 2589
            +NG++T    A+KGLR G P  P                L +            +    +
Sbjct: 310  INGDLTRPFAAKKGLRQGDPISPYLFVICMEYLNICLIQLRKNAAFRFHPRCKRLNLIHV 369

Query: 2590 *FADYLLLFARKDLKFIMLLKDKFALFSDVS 2682
             F D LLLF+R D+  +  L + F+LFS  S
Sbjct: 370  CFVDDLLLFSRGDVDSVSQLFEAFSLFSAAS 400


>gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score:
            72.31) [Arabidopsis thaliana]
          Length = 928

 Score =  201 bits (511), Expect(2) = 5e-55
 Identities = 152/556 (27%), Positives = 265/556 (47%), Gaps = 18/556 (3%)
 Frame = +1

Query: 1201 FQENHFSDHSPIHIEVLMDSNSK---RKHFRFINIVAE*EKLLHIVEKSWQQ----HYQG 1359
            F+    SDH    I + + + +    ++ F+F+N++ E E  +  VE  W +        
Sbjct: 110  FEAGGCSDHLRCRINLNVGAGAVVKGKRPFKFVNVITEMEHFIPTVESYWNETEAIFMST 169

Query: 1360 TTMYRLWCNLNYCKETLKKLKAENLGSIDGRIDEARDKFEAIQAQITVALRPELMLNEKE 1539
            ++++R    L   K  L+ L  E LG++  +  EA +     QA       P  M  E E
Sbjct: 170  SSLFRFSKKLKGLKPLLRNLGKERLGNLVKQTKEAFETLCQKQAMKMANPSPSSMQEENE 229

Query: 1540 AMGELTNG*IYRTKF*KKIQSS--LDKEKRWE*HIYILNV*RSES*NSIPLIKDATCRVL 1713
            A  +  +  +   KF K+      LD   R     +   V R E+ NSI  I      V 
Sbjct: 230  AYAKWDHIAVLEEKFLKQRSKLHWLDIGDRNNKAFHRAVVAR-EAQNSIREIICHDGSVA 288

Query: 1714 QRHTEIESEILQFYKGLLSFTAIRISVVNLTILRKG-PTLGIQQQ*DMCSN-ITREYIKQ 1887
             +  +I++E    ++  L         + +  L+   P        +M +N ++ E I +
Sbjct: 289  SQEEKIKTEAEHHFREFLQLIPNDFEGIAVEELQDLLPYRCSDSDKEMLTNHVSAEEIHK 348

Query: 1888 ELFGIVNNKAPGIDGYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVILMPKR 2067
             +F + N+K+PG DGY   F+K  W I+  +   ++  FF K  L + +N T++ L+PK+
Sbjct: 349  VVFSMPNDKSPGPDGYTAEFYKGAWNIIGAEFILAIQSFFAKGFLPKGINSTILALIPKK 408

Query: 2068 SHPETVKDYRPIACCFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNIILSHE 2247
               + +KDYRPI+CC ++YK+ISK+I+ R+K V+   +  +QSAF+  +L+ +N++L+ E
Sbjct: 409  KEAKEMKDYRPISCCNVLYKVISKIIANRLKLVLPKFIVGNQSAFVKDRLLIENVLLATE 468

Query: 2248 HVNGYTRK*ISPRCMIKVDL*KAYDSVEWYFIKQILKGMRFPRKI-------VTGL*IAS 2406
             V  Y +  +S RC +K+D+ KA+DSV+W F+  +L+ M FP +        +T    + 
Sbjct: 469  IVKDYHKDSVSSRCALKIDISKAFDSVQWKFLINVLEAMNFPPEFTHWITLCITTASFSV 528

Query: 2407 TVNGEMTDIMKARKGLRVGRPNVPLHLRATDGIFRYMFEGLTRRT*I*LSS*MLEVGDNI 2586
             VNGE+  +  + + LR G    P     +  +   M +                +G   
Sbjct: 529  QVNGELAGVFSSARELRQGCSLSPYLFVISMDVLSKMLDKAVGARQFGYHPKCRAIGLTH 588

Query: 2587 L*FADYLLLFARKDLKFIMLLKDKFALFSDVSGLKANLSKSQVYFGRVDVATKNVILDML 2766
            L FAD L++ +   ++ I  +      F+  SGLK ++ KS +Y   V  +    I+   
Sbjct: 589  LSFADDLMILSDGKVRSIDGIVKVLYEFAKWSGLKISMEKSTMYLAGVQASVYQEIVQKF 648

Query: 2767 EYEEGKLPFKYFGVPL 2814
             ++ GKLP +Y G+PL
Sbjct: 649  SFDVGKLPVRYLGLPL 664



 Score = 43.9 bits (102), Expect(2) = 5e-55
 Identities = 32/96 (33%), Positives = 46/96 (47%), Gaps = 9/96 (9%)
 Frame = +2

Query: 896  LEERKKLWVGLLKLG---ACIATPWSICGDFNSPLSSEDITCG--NLVGDVEIRDFQLVV 1060
            +EERK+LW  L          + PW I GDFN  L  E+ +    N V    +RDFQ+ V
Sbjct: 1    MEERKELWNDLRDHSDSPIIRSKPWIIFGDFNEILDMEEHSNSRENPVTTTGMRDFQMAV 60

Query: 1061 DTLVLTDMKAT*RVLTWTNGH----VWSKIGRALCN 1156
            +   +TD+     + TW+N      +  K+ R L N
Sbjct: 61   NHCSITDLAYHGPLFTWSNKRENDLIAKKLDRVLVN 96



 Score = 63.2 bits (152), Expect = 9e-07
 Identities = 55/221 (24%), Positives = 99/221 (44%), Gaps = 20/221 (9%)
 Frame = +2

Query: 2462 GDLMSPYIFVLLMEYLGICLRGLQGEPEFNYHPRC*KLGITYCDLQIIFFYLL-GKI*SL 2638
            G  +SPY+FV+ M+ L   L    G  +F YHP+C  +G+T+         L  GK+ S+
Sbjct: 547  GCSLSPYLFVISMDVLSKMLDKAVGARQFGYHPKCRAIGLTHLSFADDLMILSDGKVRSI 606

Query: 2639 *CY------------LRTNLPYSLMYRD*KQT*VRV----KYTLGE*M*PQKMLS*ICWN 2770
                           L+ ++  S MY    Q  V      K++      P + L     +
Sbjct: 607  DGIVKVLYEFAKWSGLKISMEKSTMYLAGVQASVYQEIVQKFSFDVGKLPVRYLGLPLVS 666

Query: 2771 MRKGNYHSNILEYLFQITLVDKITAKVTSWMKKYLSYVRRL*LIKVVLFGVQAY*SQLFL 2950
             R        L     + L++++  K+ +W  ++LS+  RL LI   L+ +  +    F 
Sbjct: 667  KR--------LTASDCLPLIEQLRKKIEAWTSRFLSFAGRLNLISSTLWSICNFWMAAFR 718

Query: 2951 MSK---KVIELVFRSYLWSGEASITKKAMMAWDKVCLPKKQ 3064
            + +   + I+ +  ++LWSG    + KA ++W+ +C PKK+
Sbjct: 719  LPRACIREIDKLCSAFLWSGTELSSNKAKVSWEAICKPKKE 759


>gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]
          Length = 872

 Score =  177 bits (449), Expect(3) = 2e-54
 Identities = 103/324 (31%), Positives = 173/324 (53%), Gaps = 7/324 (2%)
 Frame = +1

Query: 1864 ITREYIKQELFGIVNNKAPGIDGYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKT 2043
            ++ E IK  LF +  +K+PG DGY + F+K TW+I+  +    V  FF K  L + +N  
Sbjct: 100  VSSEEIKTVLFSMPKDKSPGPDGYTSEFYKATWDIIGQEFTLPVQSFFQKGFLPKGINSI 159

Query: 2044 LVILMPKRSHPETVKDYRPIACCFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLIS 2223
            ++ L+PK+   + ++DYRPI+CC ++YK+ISK+I+ R+K ++   + ++QSAF+  +L+ 
Sbjct: 160  ILALIPKKLAAKEMRDYRPISCCNVLYKVISKIIANRLKLLLPRFIAENQSAFVKDRLLI 219

Query: 2224 DNIILSHEHVNGYTRK*ISPRCMIKVDL*KAYDSVEWYFIKQILKGMRFPRKIVTGL*IA 2403
            +N++L+ E V  Y +  IS RC IK+D+ KA+DSV+W F+   L  M F    +  + + 
Sbjct: 220  ENLLLATELVKDYHKDSISARCAIKIDISKAFDSVQWSFLTNTLVAMNFSPTFIHWINLC 279

Query: 2404 ST-------VNGEMTDIMKARKGLRVGRPNVPLHLRATDGIFRYMFEGLTRRT*I*LSS* 2562
             T       VNG++    ++++GLR G    P        +   M +             
Sbjct: 280  ITTASFSVQVNGDLVGYFQSKRGLRQGCSLSPYLFVICMDVLSKMLDKAAGVRKFGFHPK 339

Query: 2563 MLEVGDNIL*FADYLLLFARKDLKFIMLLKDKFALFSDVSGLKANLSKSQVYFGRVDVAT 2742
               +G   L FAD L++ +    + I  + + F  F   SGL+ +L KS +Y   V    
Sbjct: 340  CQRLGLTHLSFADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVSPII 399

Query: 2743 KNVILDMLEYEEGKLPFKYFGVPL 2814
            K  I     ++ G+LP +Y G+PL
Sbjct: 400  KQEIAAKFLFDVGQLPVRYLGLPL 423



 Score = 50.4 bits (119), Expect(3) = 2e-54
 Identities = 25/83 (30%), Positives = 47/83 (56%), Gaps = 3/83 (3%)
 Frame = +2

Query: 2825 LVDKITAKVTSWMKKYLSYVRRL*LIKVVLFGVQAY*SQLFLMSKKVIELVFR---SYLW 2995
            L+++I  ++ +W  ++ S+  R  LIK VL+ +  +    F + ++ I  + +   S+LW
Sbjct: 436  LLEQIKKRIATWTFRFFSFAGRFNLIKSVLWSICNFWLAAFRLPRQCIREIDKLCSSFLW 495

Query: 2996 SGEASITKKAMMAWDKVCLPKKQ 3064
            SG    + KA ++WD VC PK +
Sbjct: 496  SGSEMSSHKAKISWDIVCKPKAE 518



 Score = 36.2 bits (82), Expect(3) = 2e-54
 Identities = 54/241 (22%), Positives = 80/241 (33%), Gaps = 86/241 (35%)
 Frame = +1

Query: 3055 KKAGGLNILNLRIWNQVAICKLLWAFSQKKIKLWIT*IHTYYIQR*DIHVMQIPKQVA*M 3234
            K  GGL + NL+  N V+  KL+W        LW   +  Y I++  I  ++    +   
Sbjct: 516  KAEGGLGLRNLKEANDVSCLKLVWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTSMGSW 575

Query: 3235 I-RKILQVR-----------------KYW-----------PTPGDTNSLIIG--RKFHVA 3321
            I RKIL++R                  +W            T GD  ++ +G  R+  VA
Sbjct: 576  IWRKILKIRDVAKSFSRVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDLGIPREASVA 635

Query: 3322 TAYNRLSGKE-----LNATWSKLLYQNIDE------------------------------ 3396
             A+ R S +      LN     + YQ I                                
Sbjct: 636  DAWTRRSRRRHRTSLLNEIEEMMAYQRIHHSDAEDTVLWRGKNDVFKPHFSTRDTWHLIK 695

Query: 3397 ------------------PKHNFILGLNLHGKLRIQDKLLK*GV--KVIADCVLCCNAPK 3516
                              PK+     L +H +L   D++LK      V  +CVLC N  K
Sbjct: 696  ATSSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGNCVLCTNNSK 755

Query: 3517 T 3519
            T
Sbjct: 756  T 756


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  220 bits (560), Expect = 4e-54
 Identities = 183/681 (26%), Positives = 310/681 (45%), Gaps = 41/681 (6%)
 Frame = +1

Query: 1270 RKHFRFINIVAE*EKLLHIVEKSWQQ----HYQGTTMYRLWCNLNYCKETLKKLKAENLG 1437
            RK F+F+N++ +  + L +VE  W      +   + +YR    L   K  L++L  E LG
Sbjct: 545  RKPFKFVNVLTKLPQFLPVVESHWASSAPLYVSTSALYRFSKKLKTLKPHLRELGKEKLG 604

Query: 1438 SIDGRIDEARDKFEAIQAQITVALRPELMLNEKEAMGELTNG*IYRTKF*KKIQSSLDKE 1617
             +  R  EA       QA        E +  E +A  + T+          +++    K+
Sbjct: 605  DLPKRTREAHILLCEKQATTLANPSQETIAEELKAYTDWTHL--------SELEEGFLKQ 656

Query: 1618 KRWE*HIYILNV*RSES*------------NSIPLIKDATCRVLQRHTEIESEILQFYKG 1761
            K     ++ +NV    +             NSI  I+      LQ   EI+ E  +F+  
Sbjct: 657  KS---KLHWMNVGDGNNSYFHKAAQVRKMRNSIREIRGPNAETLQTSEEIKGEAERFFNE 713

Query: 1762 LLSFTAIRISVVNLTILRK--GPTLGIQQQ*DMCSNITREYIKQELFGIVNNKAPGIDGY 1935
             L+  +     +++  LR        +  Q  +   +T E I++ LF + NNK+PG DGY
Sbjct: 714  FLNRQSGDFHGISVEDLRNLMSYRCSVTDQNILTREVTGEEIQKVLFAMPNNKSPGPDGY 773

Query: 1936 NTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVILMPKRSHPETVKDYRPIACCF 2115
             + FFK TW +   D   ++  FF K  L + +N T++ L+PK+     +KDYRPI+CC 
Sbjct: 774  TSEFFKATWSLTGPDFIAAIQSFFVKGFLPKGLNATILALIPKKDEAIEMKDYRPISCCN 833

Query: 2116 IVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNIILSHEHVNGYTRK*ISPRCMI 2295
            ++YK+ISK+++ R+K ++   + Q+QSAF+  +L+ +N++L+ E V  Y ++ ++PRC +
Sbjct: 834  VLYKVISKILANRLKLLLPSFILQNQSAFVKERLLMENVLLATELVKDYHKESVTPRCAM 893

Query: 2296 KVDL*KAYDSVEWYFIKQILKGMRFP-------RKIVTGL*IASTVNGEMTDIMKARKGL 2454
            K+D+ KA+DSV+W F+   L+ + FP       +  ++    +  VNGE+     + +GL
Sbjct: 894  KIDISKAFDSVQWQFLLNTLEALNFPETFRHWIKLCISTATFSVQVNGELAGFFGSSRGL 953

Query: 2455 RVGRPNVPLHLRATDGIFRYMFEGLTRRT*I*LSS*MLEVGDNIL*FADYLLLFARKDLK 2634
            R G    P        +  +M +       I       ++G   L FAD L++F      
Sbjct: 954  RQGCALSPYLFVICMNVLSHMIDEAAVHRNIGYHPKCEKIGLTHLCFADDLMVFVDGHQW 1013

Query: 2635 FIMLLKDKFALFSDVSGLKANLSKSQVYFGRVDVATKNVILDMLEYEEGKLPFKYFGVP- 2811
             I  + + F  F+  SGL+ +L KS +Y   V  + +   L    +  G+LP +Y G+P 
Sbjct: 1014 SIEGVINVFKEFAGRSGLQISLEKSTIYLAGVSASDRVQTLSSFPFANGQLPVRYLGLPL 1073

Query: 2812 LSNNFGGQDHCK-SHKLDEKISLLC*ETLIDQGCSIWSPSLLVSTIFDV*XXXXXXXXXX 2988
            L+      D+      +  KIS     +L   G      +LL S I  +           
Sbjct: 1074 LTKQMTTADYSPLIEAVKTKISSWTARSLSYAG----RLALLNSVIVSIANFWMSAYRLP 1129

Query: 2989 XXXWRGL-YY*EGYDGMG*SLSAKKA-------------GGLNILNLRIWNQVAICKLLW 3126
                R +      +   G  L+ KKA             GGL I +L   N+V+  KL+W
Sbjct: 1130 AGCIREIEKLCSAFLWSGPVLNPKKAKIAWSSICQPKKEGGLGIKSLAEANKVSCLKLIW 1189

Query: 3127 AFSQKKIKLWIT*IHTYYIQR 3189
                 +  LW+T I T+ I++
Sbjct: 1190 RLLSTQPSLWVTWIWTFIIRK 1210


>gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13)
            [Arabidopsis thaliana]
          Length = 1164

 Score =  203 bits (516), Expect(2) = 9e-53
 Identities = 157/554 (28%), Positives = 262/554 (47%), Gaps = 16/554 (2%)
 Frame = +1

Query: 1201 FQENHFSDHSPIHIEVLMDSNSKRKHFRFINIVAE*EKLLHIVEKSW-QQHYQGTTMYRL 1377
            F E  FSDHS   + ++  S   +K FRF N + + E  L ++   W      G+ MYR+
Sbjct: 120  FGEPDFSDHSSCELSLMSASPRSKKPFRFNNFLLKDENFLSLICLKWFSTSVTGSAMYRV 179

Query: 1378 WCNLNYCKETLKKLKAENLGSIDGRIDEARDKFEAIQAQITVALRPELMLNEKEAMGELT 1557
               L   K+ ++    +N   I+ R  EA D     Q+ +  +  P     E E   +  
Sbjct: 180  SVKLKALKKVIRDFSRDNYSDIEKRTKEAHDALLLAQSVLLASPCPSNAAIEAETQRKWR 239

Query: 1558 N-G*IYRTKF*KKIQSSLDKEKRWE*HIYILNV*RSES*NSIPLIKDATCRVLQRHTEIE 1734
                   + F ++ + +  +E       +       +S N I  + D     ++    +E
Sbjct: 240  ILAEAEASFFYQRSRVNWLREGDMNSSYFHKMASARQSLNHIHFLSDPVGDRIEGQQNLE 299

Query: 1735 SEILQFYK-------GLLSFTAIRISVVNLTILRKGPTLGIQQQ*DMCSNITREYIKQEL 1893
            +  +++++       GL  F    IS  NL   R  P     QQ  + +  + E IK   
Sbjct: 300  NHCVEYFQSNLGSEQGLPLFEQADIS--NLLSYRCSPA----QQVSLDTPFSSEQIKNAF 353

Query: 1894 FGIVNNKAPGIDGYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVILMPKRSH 2073
            F +  NKA G DG++  FF   W I+  +V E++ EFF   +LL+  N T ++L+PK ++
Sbjct: 354  FSLPRNKASGPDGFSPEFFCACWPIIGGEVTEAIHEFFTSGKLLKQWNATNLVLIPKITN 413

Query: 2074 PETVKDYRPIACCFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNIILSHEHV 2253
              ++ D+RPI+C   VYK+ISK+++ R+K  +   +  SQSAF+PG+L  +N++L+ E V
Sbjct: 414  ASSMSDFRPISCLNTVYKVISKLLTDRLKDFLPAAISHSQSAFMPGRLFLENVLLATELV 473

Query: 2254 NGYTRK*ISPRCMIKVDL*KAYDSVEWYFIKQILKGMRFPRK----IVTGL*IAS---TV 2412
            +GY +K I+P  M+KVDL KA+DSV W FI   L+ +  P K    I+  L  AS    +
Sbjct: 474  HGYNKKNIAPSSMLKVDLRKAFDSVRWDFIVSALRALNVPEKFTCWILECLSTASFSVIL 533

Query: 2413 NGEMTDIMKARKGLRVGRPNVPLHLRATDGIFRYMFEGLTRRT*I*LSS*MLEVGDNIL* 2592
            NG       + KGLR G P  P        +F  + +       I       ++  + L 
Sbjct: 534  NGHSAGHFWSSKGLRQGDPMSPYLFVLAMEVFSGLLQSRYTSGYIAYHPKTSQLEISHLM 593

Query: 2593 FADYLLLFARKDLKFIMLLKDKFALFSDVSGLKANLSKSQVYFGRVDVATKNVILDMLEY 2772
            FAD +++F       +  + +    F+  SGL  N +K+Q+Y   +  +  + +     +
Sbjct: 594  FADDVMIFFDGKSSSLHGIVESLEDFAGWSGLLMNTNKTQLYHAGLSQSESDSMAS-YGF 652

Query: 2773 EEGKLPFKYFGVPL 2814
            + G LP +Y G+PL
Sbjct: 653  KLGSLPVRYLGLPL 666



 Score = 34.3 bits (77), Expect(2) = 9e-53
 Identities = 29/98 (29%), Positives = 40/98 (40%), Gaps = 4/98 (4%)
 Frame = +2

Query: 866  LLIVIYES*SLEERKKLWVGLLKLG---ACIATPWSICGDFNSPL-SSEDITCGNLVGDV 1033
            +L  +Y S     R+ LW  ++        I  PW++ GDFN  L  SE  T      D 
Sbjct: 2    VLSFVYASTDEVTRQILWNEIVDFSNDPCVIDKPWTVLGDFNQILHPSEHSTSDGFNVDR 61

Query: 1034 EIRDFQLVVDTLVLTDMKAT*RVLTWTNGHVWSKIGRA 1147
              R F+  +    LTD+       TW     W+K  RA
Sbjct: 62   PTRIFRETILLASLTDLSFRGNTFTW-----WNKRSRA 94



 Score = 60.1 bits (144), Expect(2) = 7e-07
 Identities = 62/222 (27%), Positives = 99/222 (44%), Gaps = 21/222 (9%)
 Frame = +2

Query: 2462 GDLMSPYIFVLLMEYLGICLRGLQGEPEFNYHPRC*KLGIT---YCDLQIIFF-----YL 2617
            GD MSPY+FVL ME     L+         YHP+  +L I+   + D  +IFF      L
Sbjct: 550  GDPMSPYLFVLAMEVFSGLLQSRYTSGYIAYHPKTSQLEISHLMFADDVMIFFDGKSSSL 609

Query: 2618 LGKI*SL*CYL----------RTNLPYSLMYRD*KQT*VRVKYTLGE*M*PQKMLS*ICW 2767
             G + SL  +           +T L ++ + +    +     + LG    P + L     
Sbjct: 610  HGIVESLEDFAGWSGLLMNTNKTQLYHAGLSQSESDSMASYGFKLGSL--PVRYLGLPLM 667

Query: 2768 NMRKGNYHSNILEYLFQITLVDKITAKVTSWMKKYLSYVRRL*LIKVVLFGVQAY*SQLF 2947
            + +       I EY     L++KITA+  SW+ + LS+  R+ L+  V+ G+  +    F
Sbjct: 668  SRKL-----TIAEYA---PLIEKITARFNSWVVRLLSFAGRVQLLASVISGIVNFWISSF 719

Query: 2948 LMSK---KVIELVFRSYLWSGEASITKKAMMAWDKVCLPKKQ 3064
            ++     K IE +   +LWS        A +AW +VCLPK +
Sbjct: 720  ILPLGCIKKIESLCSRFLWSSRIDKKGIAKVAWSQVCLPKAE 761



 Score = 23.1 bits (48), Expect(2) = 7e-07
 Identities = 7/35 (20%), Positives = 15/35 (42%)
 Frame = +1

Query: 3055 KKAGGLNILNLRIWNQVAICKLLWAFSQKKIKLWI 3159
            K  GG+ +    + N+    +++W        LW+
Sbjct: 759  KAEGGIGLRRFAVSNRTLYLRMIWLLFSNSGSLWV 793


>gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  164 bits (414), Expect(4) = 2e-52
 Identities = 133/561 (23%), Positives = 256/561 (45%), Gaps = 13/561 (2%)
 Frame = +1

Query: 1171 SKGPGTIIAHFQENHFSDHSPIHIEVLMDSNSKRKHFRFINIVAE*EKLLHIVEKSWQQH 1350
            +K P T I H   +  SDH P+ I     S      FRF +           VE +W   
Sbjct: 1083 NKFPVTRIQHLNRDG-SDHCPLLISCFNSSEKAPSSFRFQHAWVLHHDFKTSVESNWNLP 1141

Query: 1351 YQGTTMYRLWCNLNYCKETLKKLKAENLGSIDGRIDEARDKFEAIQAQITVALRPELMLN 1530
              G+ +   W   +  K+ LK       G I  ++ EA  + E  +         E  + 
Sbjct: 1142 INGSGLQAFWSKQHRLKQHLKWWNKAVFGDIFSKLKEAEKRVEECEILHQQEQTFESRIK 1201

Query: 1531 EKEAMGELTNG*IYRTKF*KK---IQSSLDKEKRWE*HIYILNV*RSES*NSIPLIKDAT 1701
              ++  +L         F K+   ++  ++ E+  +   + + + +    + I  ++D  
Sbjct: 1202 LNKSYAQLNKQLNIEELFWKQKSGVKWVVEGERNTK--FFHMRMQKKRIRSHIFKVQDPE 1259

Query: 1702 CRVLQRHTEIESEILQFYKGLLSFTAIRISVVNLTILRKGPTLGIQQQ*DM-CSNITREY 1878
             R ++   +++   ++++  LL       S    +++   P++    + ++ C+  + + 
Sbjct: 1260 GRWIEDQEQLKHSAIEYFSSLLKVEPCYDSRFQSSLI---PSIISNSENELLCAEPSLQE 1316

Query: 1879 IKQELFGIVNNKAPGIDGYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVILM 2058
            +K  +FGI +  A G DG+++YF++  W I+  D+ ++V +FF    + R V  T +IL+
Sbjct: 1317 VKDAVFGINSESAAGPDGFSSYFYQQCWNIIAQDLLDAVRDFFHGANIPRGVTSTTLILL 1376

Query: 2059 PKRSHPETVKDYRPIACCFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNIIL 2238
            PK+S      D+RPI+ C ++ KII+K++S R+  V+  I+ ++QS F+ G+LISDNI+L
Sbjct: 1377 PKKSSASKWSDFRPISLCTVMNKIITKLLSNRLAKVLPSIITENQSGFVGGRLISDNILL 1436

Query: 2239 SHEHVNGYTRK*ISPRCMIKVDL*KAYDSVEWYFIKQILKGMRFP-------RKIVTGL* 2397
            + E +     K       +K+D+ KAYD ++W F+ ++L+   F        +K ++   
Sbjct: 1437 AQELIGKLNTKSRGGNLALKLDMMKAYDKLDWSFLFKVLQHFGFNGQWIKMIQKCISNCW 1496

Query: 2398 IASTVNGEMTDIMKARKGLRVGRPNVP-LHLRATDGIFRYMFEGLTRRT*I*LSS*MLEV 2574
             +  +NG      K+ +GLR G    P L + A + + R +     +   +  SS  + +
Sbjct: 1497 FSLLLNGRTEGYFKSERGLRQGDSISPQLFIIAAEYLSRGLNALYDQYPSLHYSS-GVSI 1555

Query: 2575 GDNIL*FADYLLLFARKDLKFIMLLKDKFALFSDVSGLKANLSKS-QVYFGRVDVATKNV 2751
              + L FAD +L+F       +  +      + ++SG + N+ KS  V    V  + + +
Sbjct: 1556 SVSHLAFADDVLIFTNGSKSALQRILAFLQEYQEISGQRINVQKSCFVTHTNVSSSRRQI 1615

Query: 2752 ILDMLEYEEGKLPFKYFGVPL 2814
            I     +    L   Y G PL
Sbjct: 1616 IAQTTGFSHQLLLITYLGAPL 1636



 Score = 47.4 bits (111), Expect(4) = 2e-52
 Identities = 30/83 (36%), Positives = 43/83 (51%), Gaps = 3/83 (3%)
 Frame = +2

Query: 2825 LVDKITAKVTSWMKKYLSYVRRL*LIKVVLFGVQAY*SQLFLMSKKVIELV---FRSYLW 2995
            LV KI  ++T W  K LS   R+ L++ VL  +  Y  Q+      V+E V   F S+LW
Sbjct: 1649 LVAKIEERITGWENKILSPGGRITLLRSVLASLPIYLLQVLKPPICVLERVNRIFNSFLW 1708

Query: 2996 SGEASITKKAMMAWDKVCLPKKQ 3064
             G A+  K    +W K+ LP K+
Sbjct: 1709 GGSAASKKIHWASWAKISLPIKE 1731



 Score = 45.4 bits (106), Expect(4) = 2e-52
 Identities = 27/94 (28%), Positives = 42/94 (44%)
 Frame = +2

Query: 875  VIYES*SLEERKKLWVGLLKLGACIATPWSICGDFNSPLSSEDITCGNLVGDVEIRDFQL 1054
            ++Y   +  ER  LW  L +L   I  PW + GDFN  L  E+   G+   +  + DF  
Sbjct: 985  IVYAKCTRSERTLLWDCLRRLADDIEVPWLVGGDFNVILKREERLYGSAPHEGAMEDFAS 1044

Query: 1055 VVDTLVLTDMKAT*RVLTWTNGHVWSKIGRALCN 1156
             +    L D        TWTN  ++ ++ R + N
Sbjct: 1045 TLLDCGLLDGGFEGNSFTWTNNRMFQRLDRIVYN 1078



 Score = 21.2 bits (43), Expect(4) = 2e-52
 Identities = 11/25 (44%), Positives = 13/25 (52%)
 Frame = +1

Query: 3058 KAGGLNILNLRIWNQVAICKLLWAF 3132
            K GGL+I NL    +    KL W F
Sbjct: 1730 KEGGLDIRNLAEVFEAFSMKLWWRF 1754


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  184 bits (467), Expect(2) = 8e-52
 Identities = 185/718 (25%), Positives = 301/718 (41%), Gaps = 60/718 (8%)
 Frame = +1

Query: 1216 FSDHSPIHIEVLMDSNSKRKHFRFINIVAE*EKLLHIVEKSW-QQHYQGTTMYRLWCNLN 1392
            FSDH    + +   S   ++ F+F N + +    L++V  +W   +  G++M+R+   L 
Sbjct: 228  FSDHVSCGVVLEETSIKAKRPFKFFNYLLKNLDFLNLVRDNWFTLNVVGSSMFRVSKKLK 287

Query: 1393 YCKETLKKLKAENLGSIDGRIDEARDKFEAIQAQITVALRPELMLNEKEAMGE---LTNG 1563
              K+ +K     N   ++ R  EA D     Q +      P     E EA  +   LT  
Sbjct: 288  ALKKPIKDFSRLNYSELEKRTKEAHDFLIGCQDRTLADPTPINASFELEAERKWHILTAA 347

Query: 1564 *IYRTKF*KKIQSSLDKEKRWE*HIYILNV*RSES*NSIPLIKDATCRVLQRHTEIESEI 1743
                + F +K + S   E       +        S NSI  + D   +++     I    
Sbjct: 348  --EESFFRQKSRISWFAEGDGNTKYFHRMADARNSSNSISALYDGNGKLVDSQEGILDLC 405

Query: 1744 LQFYKGLLSFTA----IRISVVNLTI-LRKGPTLGIQQQ*DMCSNITREYIKQELFGIVN 1908
              ++  LL        +  + +NL +  R  P     Q  ++ S  + E I+  LF +  
Sbjct: 406  ASYFGSLLGDEVDPYLMEQNDMNLLLSYRCSPA----QVCELESTFSNEDIRAALFSLPR 461

Query: 1909 NKAPGIDGYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVILMPKRSHPETVK 2088
            NK+ G DG+   FF  +W IV  +V +++ EFF    LL+  N T ++L+PK  +P    
Sbjct: 462  NKSCGPDGFTAEFFIDSWSIVGAEVTDAIKEFFSSGCLLKQWNATTIVLIPKIVNPTCTS 521

Query: 2089 DYRPIACCFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNIILSHEHVNGYTR 2268
            D+RPI+C   +YK+I+++++ R++ ++ G++  +QSAF+PG+ +++N++L+ + V+GY  
Sbjct: 522  DFRPISCLNTLYKVIARLLTDRLQRLLSGVISSAQSAFLPGRSLAENVLLATDLVHGYNW 581

Query: 2269 K*ISPRCMIKVDL*KAYDSVEWYFIKQILKGMRFPRKIVTGL-------*IASTVNGEMT 2427
              ISPR M+KVDL KA+DSV W F+   L+ +  P K +  +           ++NG   
Sbjct: 582  SNISPRGMLKVDLKKAFDSVRWEFVIAALRALAIPEKFINWISQCISTPTFTVSINGGNG 641

Query: 2428 DIMKARKGLRVGRPNVP-------------LHLRATDGIFRYMFEGLTRRT*I*LSS*ML 2568
               K+ KGLR G P  P             LH R   G+  Y  +         LS   L
Sbjct: 642  GFFKSTKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYHPKASN------LSISHL 695

Query: 2569 EVGDNIL*FADYLLLFARKDLKFIMLLKDKFALFSDVSGLKANLSKSQVYFGRVDVATKN 2748
               D+++ F D         L  I    D FA +   SGLK N  KS +Y   ++    N
Sbjct: 696  MFADDVMIFFD----GGSFSLHGICETLDDFASW---SGLKVNKDKSHLYLAGLNQLESN 748

Query: 2749 VILDMLEYEEGKLPFKYFGVPLSN---------------------------NFGGQDHCK 2847
                   +  G LP +Y G+PL N                           +F G+    
Sbjct: 749  ANA-AYGFPIGTLPIRYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLI 807

Query: 2848 SHKLDEKISLLC*ETLIDQGCSIWSPSLLVSTIFDV*XXXXXXXXXXXXXWRGLYY*EGY 3027
            S  +   I+      L+ +GC     SL    +                 W G    E  
Sbjct: 808  SSVIFGSINFWMSTFLLPKGCIKRIESLCSRFL-----------------WSGNI--EQA 848

Query: 3028 DGMG*SLSA----KKAGGLNILNLRIWNQVAICKLLWAFSQKKIKLWIT*IHTYYIQR 3189
             G+  S +A    K  GGL +  L  WN+    +L+W     K  LW    H +++ R
Sbjct: 849  KGIKVSWAALCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQHLHHLSR 906



 Score = 50.1 bits (118), Expect(2) = 8e-52
 Identities = 43/149 (28%), Positives = 69/149 (46%), Gaps = 8/149 (5%)
 Frame = +2

Query: 734  NYVHVVNGRI*VLRREAKVAVTVHETYGQYIQCLVTDRGTAFQCLLIVIYES*SLEERKK 913
            NY     G+I V+   +   V V ++  Q I C V   G+    ++ V+Y +  +  RK+
Sbjct: 62   NYAFSDLGKIWVMWDPSVQVVVVAKSL-QMITCEVLLPGSPSWIIVSVVYAANEVASRKE 120

Query: 914  LWVGLLKL---GACIATPWSICGDFNSPLS-SEDITCGNLVGDVEIRDFQLVVDTLVLTD 1081
            LW+ ++ +   G     PW + GDFN  L+  E     +L  D+ +RDF+  +    L+D
Sbjct: 121  LWIEIVNMVVSGIIGDRPWLVLGDFNQVLNPQEHSNPVSLNVDINMRDFRDCLLAAELSD 180

Query: 1082 MKAT*RVLTWTNGH----VWSKIGRALCN 1156
            ++      TW N      V  KI R L N
Sbjct: 181  LRYKGNTFTWWNKSHTTPVAKKIDRILVN 209


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  211 bits (536), Expect = 3e-51
 Identities = 159/559 (28%), Positives = 277/559 (49%), Gaps = 21/559 (3%)
 Frame = +1

Query: 1201 FQENHFSDHSPIHIEVLMDSNSK---RKHFRFINIVAE*EKLLHIVEKSWQQH----YQG 1359
            F+    SDH    I +  ++ +K    K F+F+N + + E    +V   W+         
Sbjct: 222  FEAGGCSDHLRCRISLNSEAGNKVQGLKPFKFVNALTDMEDFKPMVSTYWKDTEPLILST 281

Query: 1360 TTMYRLWCNLNYCKETLKKLKAENLGSIDGRIDEARDKFEAIQAQITVAL-RPELMLNEK 1536
            +T++R   NL   K  ++ +  + LG++  + +EA   ++ + A+  V L  P  M  E+
Sbjct: 282  STLFRFSKNLKGLKPKIRSMARDRLGNLSKKANEA---YKILCAKQHVNLTNPSSMAMEE 338

Query: 1537 E--AMGELTNG*IYRTKF*KKIQSSLDKEKRWE*HIYILN--V*RSES*NSIPLIKDATC 1704
            E  A        I   K+ K+ +S L   +  + +    +      E+ N+I  I     
Sbjct: 339  ENAAYSRWDRVAILEEKYLKQ-KSKLHWCQVGDQNTKAFHRAAAAREAHNTIREILSNDG 397

Query: 1705 RVLQRHTEIESEILQFYKGLLSFTAIRISVVNLTILRKGPTLGIQQ--Q*DMCSNITREY 1878
             V  +  EI++E  +F++  L         V +T L++   +      Q  +   +T E 
Sbjct: 398  IVKTKGDEIKAEAERFFREFLQLIPNDFEGVTITELQQLLPVRCSDADQQSLIRPVTAEE 457

Query: 1879 IKQELFGIVNNKAPGIDGYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVILM 2058
            I++ LF + ++K+PG DGY + FFK TWEI+ ++   +V  FF K  L + +N T++ L+
Sbjct: 458  IRKVLFRMPSDKSPGPDGYTSEFFKATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALI 517

Query: 2059 PKRSHPETVKDYRPIACCFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNIIL 2238
            PK++    +KDYRPI+CC ++YK+ISK+I+ R+K V+   +  +QSAF+  +L+ +N++L
Sbjct: 518  PKKTEAREMKDYRPISCCNVLYKVISKIIANRLKLVLPKFIAGNQSAFVKDRLLIENLLL 577

Query: 2239 SHEHVNGYTRK*ISPRCMIKVDL*KAYDSVEWYFIKQILKGMRFPRKIVTGL*IAST--- 2409
            + E V  Y +  IS RC IK+D+ KA+DSV+W F+  +   + FPR+ +  + I  T   
Sbjct: 578  ATELVKDYHKDTISTRCAIKIDISKAFDSVQWPFLINVFTILGFPREFIHWINICITTAS 637

Query: 2410 ----VNGEMTDIMKARKGLRVGRPNVPLHLRATDGIFRYMFEGLTRRT*I*LSS*MLEVG 2577
                VNGE+    ++ +GLR G    P        +   M +                +G
Sbjct: 638  FSVQVNGELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMG 697

Query: 2578 DNIL*FADYLLLFARKDLKFIMLLKDKFALFSDVSGLKANLSKSQVYFGRVDVATKNVIL 2757
               L FAD L++ +   ++ I  +   F  F+  SGL+ +L KS VY   +    +N + 
Sbjct: 698  LTHLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVA 757

Query: 2758 DMLEYEEGKLPFKYFGVPL 2814
            D   +  G+LP +Y G+PL
Sbjct: 758  DRFPFSSGQLPVRYLGLPL 776



 Score = 62.8 bits (151), Expect(2) = 1e-09
 Identities = 58/221 (26%), Positives = 94/221 (42%), Gaps = 20/221 (9%)
 Frame = +2

Query: 2462 GDLMSPYIFVLLMEYLGICLRGLQGEPEFNYHPRC*KLGITYCDLQIIFFYLL-GKI*SL 2638
            G  +SPY+FV+ M+ L   L        F YHP+C  +G+T+         L  GKI S+
Sbjct: 659  GCALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMGLTHLSFADDLMVLSDGKIRSI 718

Query: 2639 *CY------------LRTNLPYSLMY----RD*KQT*VRVKYTLGE*M*PQKMLS*ICWN 2770
                           LR +L  S +Y        +  V  ++       P + L      
Sbjct: 719  ERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVADRFPFSSGQLPVRYLGLPLIT 778

Query: 2771 MRKGNYHSNILEYLFQITLVDKITAKVTSWMKKYLSYVRRL*LIKVVLFGVQAY*SQLFL 2950
             R        L     + L++++  ++ SW  ++LSY  RL LI  VL+ +  +    F 
Sbjct: 779  KR--------LSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSICNFWLAAFR 830

Query: 2951 MSKKVI---ELVFRSYLWSGEASITKKAMMAWDKVCLPKKQ 3064
            + +K I   E +  ++LWSG    + KA ++W  VC PK +
Sbjct: 831  LPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKDE 871



 Score = 29.6 bits (65), Expect(2) = 1e-09
 Identities = 18/70 (25%), Positives = 33/70 (47%), Gaps = 1/70 (1%)
 Frame = +1

Query: 3055 KKAGGLNILNLRIWNQVAICKLLWAFSQKKIKLWIT*IHTYYIQR*DI-HVMQIPKQVA* 3231
            K  GGL + +L+  N V   KL+W        LW+  +  + ++      V Q   Q + 
Sbjct: 869  KDEGGLGLRSLKEANDVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFWEVKQTVSQGSW 928

Query: 3232 MIRKILQVRK 3261
            + +K+L+ R+
Sbjct: 929  IWKKLLKYRE 938


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  158 bits (399), Expect(3) = 4e-50
 Identities = 131/559 (23%), Positives = 257/559 (45%), Gaps = 14/559 (2%)
 Frame = +1

Query: 1180 PGTIIAHFQENHFSDHSPIHIEVLMDSNSKRKHFRFINIVAE*EKLLHIVEKSWQQHYQG 1359
            P T I H   +  SDH P+ I   + S      FRF +           VE +W     G
Sbjct: 1088 PITRIQHLNRDG-SDHCPLLISCFISSEKSPSSFRFQHAWVLHHDFKTSVEGNWNLPING 1146

Query: 1360 TTMYRLWCNLNYCKETLKKLKAENLGSIDGRIDEARDKFEAI----QAQITVALRPELML 1527
            + +   W   +  K+ LK       G I  ++ EA  + E      Q + TV  R  L  
Sbjct: 1147 SGLQAFWIKQHRLKQHLKWWNKAVFGDIFSKLKEAEKRVEECEILHQQEQTVGSRINLNK 1206

Query: 1528 NEKEAMGELTNG*IYRTKF*KKIQSSLDKEKRWE*HIYILNV*RSES*NSIPLIKDATCR 1707
            +  +   +L    I+  K    ++  ++ E+  +   + + + +    + I  +++   R
Sbjct: 1207 SYAQLNKQLNVEEIF-WKQKSGVKWVVEGERNTK--FFHMRMQKKRIRSHIFKVQEPDGR 1263

Query: 1708 VLQRHTEIESEILQFYKGLLSFTAIRISVVNLTILRKGPTLGIQQQ*DM-CSNITREYIK 1884
             ++   +++   ++++  LL      IS    +++   P++    + ++ C+    + +K
Sbjct: 1264 WIEDQEQLKQSAIEYFSSLLKAEPCDISRFQNSLI---PSIISNSENELLCAEPNLQEVK 1320

Query: 1885 QELFGIVNNKAPGIDGYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVILMPK 2064
              +F I    A G DG+++YF++  W  + +D+ ++V +FF    + R V  T ++L+PK
Sbjct: 1321 DAVFDIDPESAAGPDGFSSYFYQQCWNTIAHDLLDAVRDFFHGANIPRGVTSTTLVLLPK 1380

Query: 2065 RSHPETVKDYRPIACCFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNIILSH 2244
            +S      ++RPI+ C ++ KII+K++S R+  ++  I+ ++QS F+ G+LISDNI+L+ 
Sbjct: 1381 KSSASKWSEFRPISLCTVMNKIITKLLSNRLAKILPSIITENQSGFVGGRLISDNILLAQ 1440

Query: 2245 EHVNGYTRK*ISPRCMIKVDL*KAYDSVEWYFIKQILKGMRFP-------RKIVTGL*IA 2403
            E +     K       +K+D+ KAYD ++W F+ ++L+   F        +K ++    +
Sbjct: 1441 ELIRKLDTKSRGGNLALKLDMMKAYDRLDWSFLIKVLQHFGFNEQWIGMIQKCISNCWFS 1500

Query: 2404 STVNGEMTDIMKARKGLRVGRPNVP-LHLRATDGIFRYMFEGLTRRT*I*LSS*MLEVGD 2580
              +NG +    K+ +GLR G    P L + A + + R +     +   +  SS  + +  
Sbjct: 1501 LLLNGRIEGYFKSERGLRQGDSISPQLFILAAEYLSRGLNALYDQYPSLHYSS-GVPLSV 1559

Query: 2581 NIL*FADYLLLFARKDLKFIMLLKDKFALFSDVSGLKANLSKS-QVYFGRVDVATKNVIL 2757
            + L FAD +L+F       +  +      + ++SG + N  KS  V    +  + + +I 
Sbjct: 1560 SHLAFADDVLIFTNGSKSALQRILVFLQEYEEISGQRINAQKSCFVTHTNIPNSRRQIIA 1619

Query: 2758 DMLEYEEGKLPFKYFGVPL 2814
                +    LP  Y G PL
Sbjct: 1620 QATGFNHQLLPITYLGAPL 1638



 Score = 45.8 bits (107), Expect(3) = 4e-50
 Identities = 28/93 (30%), Positives = 42/93 (45%)
 Frame = +2

Query: 878  IYES*SLEERKKLWVGLLKLGACIATPWSICGDFNSPLSSEDITCGNLVGDVEIRDFQLV 1057
            +Y   +  ER  LW  L +L A    PW + GDFN  L  E+   G+   +  + DF  V
Sbjct: 988  VYAKCTRSERTLLWDCLRRLAADNEEPWLVGGDFNIILKREERLYGSAPHEGSMEDFASV 1047

Query: 1058 VDTLVLTDMKAT*RVLTWTNGHVWSKIGRALCN 1156
            +    L D        TWTN  ++ ++ R + N
Sbjct: 1048 LLDCGLLDGGFEGNPFTWTNNRMFQRLDRVVYN 1080



 Score = 45.4 bits (106), Expect(3) = 4e-50
 Identities = 28/80 (35%), Positives = 41/80 (51%), Gaps = 3/80 (3%)
 Frame = +2

Query: 2825 LVDKITAKVTSWMKKYLSYVRRL*LIKVVLFGVQAY*SQLFLMSKKVIELV---FRSYLW 2995
            LV KI  ++T W  K LS   R+ L++ VL  +  Y  Q+      V+E V   F S+LW
Sbjct: 1651 LVAKIEERITGWENKILSPGGRITLLRSVLASLPIYLLQVLKPPVCVLERVNRLFNSFLW 1710

Query: 2996 SGEASITKKAMMAWDKVCLP 3055
             G A+  +    +W K+ LP
Sbjct: 1711 GGSAASKRIHWASWAKIALP 1730


>gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  164 bits (414), Expect(3) = 1e-49
 Identities = 136/554 (24%), Positives = 266/554 (48%), Gaps = 22/554 (3%)
 Frame = +1

Query: 1219 SDHSPIHIEVLMDSNSKRKHFRFINIVAE*EKLLHIVEKSWQQHYQGTTMYRLWCNLNYC 1398
            SDH P+ I     S      FRF++   +    L  VE+SWQ     + +   W      
Sbjct: 803  SDHCPLLISCATASQKGPSTFRFLHAWTKHHDFLPFVERSWQVPLNSSGLTAFWIKQQRL 862

Query: 1399 KETLKKLKAENLGSIDGRIDEAR---DKFEAIQAQITVALRPELM------LNEKEAMGE 1551
            K  LK    +  G I  ++  A    +K E    Q   ++   LM      LN + ++ E
Sbjct: 863  KRDLKWWNKQIFGDIFEKLKRAEIEAEKREKEFQQDPSSINRNLMNKAYAKLNRQLSIEE 922

Query: 1552 LTNG*IYRTKF*KKIQSSLDKEKRWE*HIYILNV*RSES*NSIPLIKDATCRVLQRHTEI 1731
            L     ++ K    ++  ++ E+  +   + L + +    N+I  I+D+   + +    I
Sbjct: 923  L----FWQQK--SGVKWLVEGERNTK--FFHLRMRKKRVRNNIFRIQDSEGNIYEDPQYI 974

Query: 1732 ESEILQFYKGLLSFTAIRISVVNLTILRKGPTLGIQQQ*DMCSNITREYIKQELFGIVNN 1911
            ++  +Q+++ LL+      S  + +++ +  T+ I     +C+  + + IK+ +F I  +
Sbjct: 975  QNSAVQYFQNLLTAEQCDFSRFDPSLIPR--TISITDNEFLCAAPSLKEIKEVVFNIDKD 1032

Query: 1912 KAPGIDGYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVILMPKRSHPETVKD 2091
               G DG+++ F++  W+I++ D+ E+V++FF    + + V  T ++L+PK+ +     D
Sbjct: 1033 SVAGPDGFSSLFYQHCWDIIKQDLLEAVLDFFNGTPMPQGVTSTTLVLLPKKPNSCQWSD 1092

Query: 2092 YRPIACCFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNIILSHEHVNGYTRK 2271
            +RPI+ C ++ KI++K ++ R+  ++  I+ ++QS F+ G+LISDNI+L+ E V     K
Sbjct: 1093 FRPISLCTVLNKIVTKTLANRLSKILPSIISENQSGFVNGRLISDNILLAQELVGKLDAK 1152

Query: 2272 *ISPRCMIKVDL*KAYDSVEWYFIKQILKGMRFP-------RKIVTGL*IASTVNGEMTD 2430
                  ++K+D+ KAYD + W F+  ++K   F        +  ++    +  +NG +  
Sbjct: 1153 ARGGNVVLKLDMAKAYDRLNWDFLYLMMKQFGFNDRWISMIKACISNCWFSLLINGSLVG 1212

Query: 2431 IMKARKGLRVGRPNVP-LHLRATDGIFRYMFEGLTR-RT*I*LSS*MLEVGDNIL*FADY 2604
              K+ +GLR G    P L + A D + R + +   R ++ + LS   + +    L FAD 
Sbjct: 1213 YFKSERGLRQGDSISPLLFVLAADYLSRGINQLFNRHKSLLYLSGCFMPISH--LAFADD 1270

Query: 2605 LLLF---ARKDLKFIMLLKDKFALFSDVSGLKANLSKS-QVYFGRVDVATKNVILDMLEY 2772
            +++F    R  L+ I++   +   + +VSG + N  KS  +      +  + +I     +
Sbjct: 1271 IVIFTNGCRPALQKILVFLQE---YEEVSGQQVNHQKSCFITANGCPMTRRQIIAHTTGF 1327

Query: 2773 EEGKLPFKYFGVPL 2814
            +   LP  Y G PL
Sbjct: 1328 QHKTLPVIYLGAPL 1341



 Score = 45.4 bits (106), Expect(3) = 1e-49
 Identities = 26/93 (27%), Positives = 43/93 (46%)
 Frame = +2

Query: 878  IYES*SLEERKKLWVGLLKLGACIATPWSICGDFNSPLSSEDITCGNLVGDVEIRDFQLV 1057
            +Y   + +ER +LW  L  L + +  PW + GDFN+ +S  +   G       + DF   
Sbjct: 691  VYAKCTRQERLELWNCLRSLSSDMQGPWMVGGDFNTIVSCAERLNGAPPHGGSMEDFVAT 750

Query: 1058 VDTLVLTDMKAT*RVLTWTNGHVWSKIGRALCN 1156
            +    L D        TWTN H++ ++ R + N
Sbjct: 751  LFDCGLIDAGFEGNSFTWTNNHMFQRLDRVVYN 783



 Score = 38.5 bits (88), Expect(3) = 1e-49
 Identities = 24/81 (29%), Positives = 39/81 (48%), Gaps = 3/81 (3%)
 Frame = +2

Query: 2822 TLVDKITAKVTSWMKKYLSYVRRL*LIKVVLFGVQAY*SQLFLMSKKVIELV---FRSYL 2992
            +L+ KI  +++ W  K LS   R+ L++ VL  +  Y  Q+      VIE +   F S+L
Sbjct: 1353 SLITKIRDRISGWENKTLSPGGRITLLRSVLSSLPLYLLQVLKPPVVVIEKIERLFNSFL 1412

Query: 2993 WSGEASITKKAMMAWDKVCLP 3055
            W    +  +    AW K+  P
Sbjct: 1413 WGDSTNDKRIHWAAWHKLTFP 1433


>gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam: rvt.hmm, score:
            42.57) [Arabidopsis thaliana]
          Length = 1662

 Score =  191 bits (486), Expect(2) = 2e-49
 Identities = 155/567 (27%), Positives = 265/567 (46%), Gaps = 30/567 (5%)
 Frame = +1

Query: 1201 FQENHFSDHSPIHIEVLMDSNSKRKHFRFINIVAE*EKLLHIVEKSWQQHYQGTTMYRLW 1380
            F E   SDH P+ + +      K + FRF   + E       V+  W +   G   + L 
Sbjct: 595  FLEFTGSDHKPLFLSLEKTETRKMRPFRFDKRLLEVPHFKTYVKAGWNKAINGQRKH-LP 653

Query: 1381 CNLNYCKETLKKLKAEN-------LGSIDGRIDEA--------RDKFEAIQAQITVALRP 1515
              +  C++ + KLK ++       +  +   +D+A        R     IQ ++TVA R 
Sbjct: 654  DQVRTCRQAMAKLKHKSNLNSRIRINQLQAALDKAMSSVNRTERRTISHIQRELTVAYRD 713

Query: 1516 ELMLNEKEAMGELTNG*IYRTKF*KKIQSSLDKEKRWE*HIYILNV*RSES*NSIPLIKD 1695
            E    ++++  +        T+F      +                    S N +  IKD
Sbjct: 714  EERYWQQKSRNQWMKEGDRNTEFFHACTKT------------------RFSVNRLVTIKD 755

Query: 1696 ATCRVLQRHTEIESEILQFYKGLLSFTAIRISVVNLTILRKGPTLGIQQQ*DMCSNITRE 1875
                + +   EI     +F+  +       +S+++    +  P +  Q   D+  +++  
Sbjct: 756  EEGMIYRGDKEIGVHAQEFFTKVYESNGRPVSIIDFAGFK--PIVTEQINDDLTKDLSDL 813

Query: 1876 YIKQELFGIVNNKAPGIDGYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVIL 2055
             I   +  I ++KAPG DG    F+K+ WEIV  DV + V  FF    + +++N T + +
Sbjct: 814  EIYNAICHIGDDKAPGPDGLTARFYKSCWEIVGPDVIKEVKIFFRTSYMKQSINHTNICM 873

Query: 2056 MPKRSHPETVKDYRPIACCFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNII 2235
            +PK ++PET+ DYRPIA C ++YKIISK +  R+KG +D I+  SQ+AFIPG+L++DN++
Sbjct: 874  IPKITNPETLSDYRPIALCNVLYKIISKCLVERLKGHLDAIVSDSQAAFIPGRLVNDNVM 933

Query: 2236 LSHEHVNGY-TRK*ISPRCM-IKVDL*KAYDSVEWYFIKQILKGMRFPRK-------IVT 2388
            ++HE ++   TRK +S   M +K D+ KAYD VEW F++  ++   F           V 
Sbjct: 934  IAHEMMHSLKTRKRVSQSYMAVKTDVSKAYDRVEWNFLETTMRLFGFSETWIKWIMGAVK 993

Query: 2389 GL*IASTVNGEMTDIMKARKGLRVGRPNVPLHLRATDGIFRYMFEGLTRRT*I*LSS*ML 2568
             +  +  VNG     ++ ++G+R G P  P        I  ++ +       I      +
Sbjct: 994  SVNYSVLVNGIPHGTIQPQRGIRQGDPLSPYLFILCADILNHLIKNRVAEGDI----RGI 1049

Query: 2569 EVGDNI-----L*FADYLLLFARKDLKFIMLLKDKFALFSDVSGLKANLSKSQVYFG-RV 2730
             +G+ +     L FAD  L F + +++    LKD F ++   SG K N+SKS + FG RV
Sbjct: 1050 RIGNGVPGVTHLQFADDSLFFCQSNVRNCQALKDVFDVYEYYSGQKINMSKSMITFGSRV 1109

Query: 2731 DVATKNVILDMLEYEEGKLPFKYFGVP 2811
               T+N + ++L  +      KY G+P
Sbjct: 1110 HGTTQNRLKNILGIQSHGGGGKYLGLP 1136



 Score = 35.0 bits (79), Expect(2) = 2e-49
 Identities = 18/83 (21%), Positives = 39/83 (46%), Gaps = 3/83 (3%)
 Frame = +2

Query: 2825 LVDKITAKVTSWMKKYLSYVRRL*LIKVVLFGVQAY*SQLFLMSKKV---IELVFRSYLW 2995
            +++++  + +SW  KYLS   +  ++K V   +  Y    F +   +   IE +  ++ W
Sbjct: 1150 IIERVKKRTSSWSAKYLSPAGKEIMLKSVAMSMPVYAMSCFKLPLNIVSEIEALLMNFWW 1209

Query: 2996 SGEASITKKAMMAWDKVCLPKKQ 3064
               A   +   +AW ++   KK+
Sbjct: 1210 EKNAKKREIPWIAWKRLQYSKKE 1232


>emb|CAB40051.1| putative protein [Arabidopsis thaliana] gi|7267781|emb|CAB81184.1|
            putative protein [Arabidopsis thaliana]
          Length = 1294

 Score =  191 bits (486), Expect(2) = 2e-49
 Identities = 155/567 (27%), Positives = 265/567 (46%), Gaps = 30/567 (5%)
 Frame = +1

Query: 1201 FQENHFSDHSPIHIEVLMDSNSKRKHFRFINIVAE*EKLLHIVEKSWQQHYQGTTMYRLW 1380
            F E   SDH P+ + +      K + FRF   + E       V+  W +   G   + L 
Sbjct: 575  FLEFTGSDHKPLFLSLEKTETRKMRPFRFDKRLLEVPHFKTYVKAGWNKAINGQRKH-LP 633

Query: 1381 CNLNYCKETLKKLKAEN-------LGSIDGRIDEA--------RDKFEAIQAQITVALRP 1515
              +  C++ + KLK ++       +  +   +D+A        R     IQ ++TVA R 
Sbjct: 634  DQVRTCRQAMAKLKHKSNLNSRIRINQLQAALDKAMSSVNRTERRTISHIQRELTVAYRD 693

Query: 1516 ELMLNEKEAMGELTNG*IYRTKF*KKIQSSLDKEKRWE*HIYILNV*RSES*NSIPLIKD 1695
            E    ++++  +        T+F      +                    S N +  IKD
Sbjct: 694  EERYWQQKSRNQWMKEGDRNTEFFHACTKT------------------RFSVNRLVTIKD 735

Query: 1696 ATCRVLQRHTEIESEILQFYKGLLSFTAIRISVVNLTILRKGPTLGIQQQ*DMCSNITRE 1875
                + +   EI     +F+  +       +S+++    +  P +  Q   D+  +++  
Sbjct: 736  EEGMIYRGDKEIGVHAQEFFTKVYESNGRPVSIIDFAGFK--PIVTEQINDDLTKDLSDL 793

Query: 1876 YIKQELFGIVNNKAPGIDGYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVIL 2055
             I   +  I ++KAPG DG    F+K+ WEIV  DV + V  FF    + +++N T + +
Sbjct: 794  EIYNAICHIGDDKAPGPDGLTARFYKSCWEIVGPDVIKEVKIFFRTSYMKQSINHTNICM 853

Query: 2056 MPKRSHPETVKDYRPIACCFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNII 2235
            +PK ++PET+ DYRPIA C ++YKIISK +  R+KG +D I+  SQ+AFIPG+L++DN++
Sbjct: 854  IPKITNPETLSDYRPIALCNVLYKIISKCLVERLKGHLDAIVSDSQAAFIPGRLVNDNVM 913

Query: 2236 LSHEHVNGY-TRK*ISPRCM-IKVDL*KAYDSVEWYFIKQILKGMRFPRK-------IVT 2388
            ++HE ++   TRK +S   M +K D+ KAYD VEW F++  ++   F           V 
Sbjct: 914  IAHEMMHSLKTRKRVSQSYMAVKTDVSKAYDRVEWNFLETTMRLFGFSETWIKWIMGAVK 973

Query: 2389 GL*IASTVNGEMTDIMKARKGLRVGRPNVPLHLRATDGIFRYMFEGLTRRT*I*LSS*ML 2568
             +  +  VNG     ++ ++G+R G P  P        I  ++ +       I      +
Sbjct: 974  SVNYSVLVNGIPHGTIQPQRGIRQGDPLSPYLFILCADILNHLIKNRVAEGDI----RGI 1029

Query: 2569 EVGDNI-----L*FADYLLLFARKDLKFIMLLKDKFALFSDVSGLKANLSKSQVYFG-RV 2730
             +G+ +     L FAD  L F + +++    LKD F ++   SG K N+SKS + FG RV
Sbjct: 1030 RIGNGVPGVTHLQFADDSLFFCQSNVRNCQALKDVFDVYEYYSGQKINMSKSMITFGSRV 1089

Query: 2731 DVATKNVILDMLEYEEGKLPFKYFGVP 2811
               T+N + ++L  +      KY G+P
Sbjct: 1090 HGTTQNRLKNILGIQSHGGGGKYLGLP 1116



 Score = 35.0 bits (79), Expect(2) = 2e-49
 Identities = 18/83 (21%), Positives = 39/83 (46%), Gaps = 3/83 (3%)
 Frame = +2

Query: 2825 LVDKITAKVTSWMKKYLSYVRRL*LIKVVLFGVQAY*SQLFLMSKKV---IELVFRSYLW 2995
            +++++  + +SW  KYLS   +  ++K V   +  Y    F +   +   IE +  ++ W
Sbjct: 1130 IIERVKKRTSSWSAKYLSPAGKEIMLKSVAMSMPVYAMSCFKLPLNIVSEIEALLMNFWW 1189

Query: 2996 SGEASITKKAMMAWDKVCLPKKQ 3064
               A   +   +AW ++   KK+
Sbjct: 1190 EKNAKKREIPWIAWKRLQYSKKE 1212


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  204 bits (519), Expect = 2e-49
 Identities = 156/555 (28%), Positives = 269/555 (48%), Gaps = 22/555 (3%)
 Frame = +1

Query: 1216 FSDHSPIHIEVLMDSNSKRKHFRFINIVAE*EKLLHIVEKSW-QQHYQGTTMYRLWCNLN 1392
            FSDH    + +  +  S ++ F+F N + + E  L++V  +W   +  G++MYR+   L 
Sbjct: 88   FSDHVSCGVVLEANGISAKRPFKFFNFLLKNEDFLNVVMDNWFSTNVVGSSMYRVSKKLK 147

Query: 1393 YCKETLKKLKAENLGSIDGRIDEARDKFEAIQAQITVA------LRPELMLNEKEAMGEL 1554
              K+ +K     N   I+ R  EA +     Q  +T+A         EL    K  +   
Sbjct: 148  AMKKPIKDFSRLNYSGIELRTKEAHELLITCQ-NLTLANPSVSNAALELEAQRKWVLLSC 206

Query: 1555 TNG*IYRTKF*KKIQSSLDKEKRWE*HIYILNV*RSES*NSIPLIKDATCRVLQRHTEIE 1734
                   + F ++ + S   E     H +   V   +S N+I  + D+   ++     I 
Sbjct: 207  AE----ESFFHQRSRVSWFAEGDSNTHYFHRMVDSRKSFNTINSLVDSNGLLIDSQQGIL 262

Query: 1735 SEILQFYKGLLSFTAIRISV----VNLTILRKGPTLGIQQQ*DMCSNITREY----IKQE 1890
               + +Y+ LL       S+    +NL +     T    Q  D CS + + +    IK  
Sbjct: 263  DHCVTYYERLLGSIESPFSMEQEDMNLLL-----TYRCSQ--DQCSELEKSFTDDEIKAA 315

Query: 1891 LFGIVNNKAPGIDGYNTYFFKTTWEIVQNDVCESVMEFF*KIRLLRAVNKTLVILMPKRS 2070
               +  NK  G DGY+  FF+ TW I+  +V  ++ EFF   +LL+  N T ++L+PK S
Sbjct: 316  FKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFDSGQLLKQWNATTLVLIPKTS 375

Query: 2071 HPETVKDYRPIACCFIVYKIISKVISARIKGVIDGIMGQSQSAFIPGKLISDNIILSHEH 2250
            +  T+ ++RPI+C   +YK+ISK++++R++G++  ++G SQSAF+PG+ +++N++L+ E 
Sbjct: 376  NACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIGHSQSAFLPGRSLAENVLLATEM 435

Query: 2251 VNGYTRK*ISPRCMIKVDL*KAYDSVEWYFIKQILKGMRFPRKIVTGL*IAST------- 2409
            V+GY R  ISPR M+KVDL KA+DSV+W F+   L+ +  P + +  +    T       
Sbjct: 436  VHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIHQCITTPSFTIS 495

Query: 2410 VNGEMTDIMKARKGLRVGRPNVPLHLRATDGIFRYMFEGLTRRT*I*LSS*MLEVGDNIL 2589
            VNG      ++ KGLR G P  P        +F  +         I       ++  + L
Sbjct: 496  VNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLSISHL 555

Query: 2590 *FADYLLLFARKDLKFIMLLKDKFALFSDVSGLKANLSKSQVYFGRVDVATKNVILDMLE 2769
             FAD +++F       +  + +    F+D SGLK N  KSQ++   +D+ ++ +      
Sbjct: 556  MFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGLDL-SERITSAAYG 614

Query: 2770 YEEGKLPFKYFGVPL 2814
            +  G  P +Y G+PL
Sbjct: 615  FPAGTFPIRYLGLPL 629



 Score = 64.3 bits (155), Expect(2) = 7e-10
 Identities = 61/222 (27%), Positives = 99/222 (44%), Gaps = 21/222 (9%)
 Frame = +2

Query: 2462 GDLMSPYIFVLLMEYLGICLRGLQGEPEFNYHPRC*KLGIT---YCDLQIIFF-----YL 2617
            GD +SPY+FVL ME     L         +YHP+   L I+   + D  +IFF      +
Sbjct: 513  GDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSM 572

Query: 2618 LGKI*SL*CY-----LRTNLPYSLMYR---D*KQT*VRVKYTLGE*M*PQKMLS*--ICW 2767
             G   +L  +     L+ N   S +++   D  +      Y       P + L    +C 
Sbjct: 573  HGICETLDDFADWSGLKVNKDKSQLFQAGLDLSERITSAAYGFPAGTFPIRYLGLPLMCR 632

Query: 2768 NMRKGNYHSNILEYLFQITLVDKITAKVTSWMKKYLSYVRRL*LIKVVLFGVQAY*SQLF 2947
             +R  +Y            L++K++A++ SW+ K LS+  R  LI  V+FG+  +    F
Sbjct: 633  KLRIADYGP----------LLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTF 682

Query: 2948 LMSK---KVIELVFRSYLWSGEASITKKAMMAWDKVCLPKKQ 3064
            L+ K   K IE +   +LW+G     K + ++W   CLPK +
Sbjct: 683  LLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSE 724



 Score = 29.3 bits (64), Expect(2) = 7e-10
 Identities = 10/34 (29%), Positives = 16/34 (47%)
 Frame = +1

Query: 3055 KKAGGLNILNLRIWNQVAICKLLWAFSQKKIKLW 3156
            K  GGL   +   WN+  + +L+W    +   LW
Sbjct: 722  KSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLW 755